• header-logo.png Department of Electronics and Electrical Engineering
    Indian Institute of Technology Guwahati
header-logo.png Department of Electronics
and Electrical Engineering
Prithwijit Guha

Dr. Prithwijit Guha

Associate Professor

Key Research Areas : Computer Vision, Pattern Recognition, Signal Processing, Robotics.

Phone No: +91 361 2583452 (O) | Email: pguha@iitg.ac.in| Room No: 0101

Degree Department & Institution Year
Ph.D. Information Systems, Department of Electrical Engineering, Indian Institute of Technology Kanpur 2009
M.Tech. Signal Processing, Department of Electrical Engineering, Indian Institute of Technology Kanpur 2001
B.E. Department of Electrical Engineering, Jadavpur University 1999

 

Role Institution/Organization From To
Associate Professor Department of Electronics & Electrical Engineering, Indian Institute of Technology Guwahati 14 August 2022 Present
Assistant Professor Department of Electronics & Electrical Engineering, Indian Institute of Technology Guwahati 01 October 2012 13 August 2022
Team Leader Computer Vision Group, TCS Innovation Labs, New Delhi 17 May 2010 30 September 2012
Visiting Faculty Department of Computer Science & Engineering, Indian Institute of Technology Kanpur 29 December 2010 08 May 2011
Visiting Faculty Laxmi Narayan Mittal Institute of Information Technology 01 January 2009 23 April 2010
Research Associate Computer Vision Lab, Swiss Federal Institute of Technology Zurich (ETHZ) 21 September 2001 30 September 2002

 

Book Chapter

1. Guha P., Mukerjee A., Venkatesh K.S., "Occlusion sequence mining for activity discovery from surveillance videos", Pattern Recognition Technologies and Applications: Recent Advances , (DOI - 10.4018/978-1-59904-807-9.ch009) ,pp.212-226, [2008].

Journal Publications

1. Rituparna Choudhury, Shaik Rafi Ahamed, Prithwijit Guha, "FPGA Implementation of Batch-Mode Depth-Pipelined Two Means Decision Tree", IEEE Embedded Systems Letters [2022]. , 10.1109/LES.2022.3190001

2. Mrinmoy Bhattacharjee, S.R. Mahadeva Prasanna, Prithwijit Guha, "Speech/Music Classification using Phase-based and Magnitude-based Features", Speech Communication (Elsevier) , vol.142 , (DOI - https://doi.org/10.1016/j.specom.2022.06.005) ,pp.34-48, [2022].

3. Mrinmoy Bhattacharjee, S.R. Mahadeva Prasanna, Prithwijit Guha, "Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning", IEEE/ACM Transactions on Audio Speech and Language Processing [2022]. , doi: 10.1109/TASLP.2022.3164199

4. Aakansha Mishra, Ashish Anand, Prithwijit Guha, "Dual Attention and Question Categorization based Visual Question Answering", IEEE Transactions on Artificial Intelligence [2022]. , doi: 10.1109/TAI.2022.3160418

5. Raghvendra Kannao, Prithwijit Guha, Bidyut Baran Chaudhuri, "Only Overlay Text: Novel Features for TV News Broadcast Video Segmentation", Multimedia Tools and Applications [2022]. , https://doi.org/10.1007/s11042-022-12917-w

6. Shikha Baghel, S.R. Mahadeva Prasanna, Prithwijit Guha, "Overlapped Speech Detection using Phase Features", The Journal of the Acoustical Society of America (JASA) , vol.150 , (DOI - 4) ,pp.2770-2781, [2021]. , https://doi.org/10.1121/10.0006614

7. Rituparna Choudhury, Shaik Rafi Ahamed, Prithwijit Guha, "Efficient Hardware Implementation of Decision Tree Training Accelerator", Springer Nature Computer Science , vol.2 , (DOI - 360) [2021]. , https://doi.org/10.1007/s42979-021-00748-9

8. Rituparna Choudhury, Shaik Rafi Ahamed, Prithwijit Guha, "

Training Accelerator for Two Means Decision Tree

", IEEE Transactions on Very Large Scale Integration (VLSI) Systems , vol.29 , (DOI - 7) ,pp.1465-1469, [2021]. , 10.1109/TVLSI.2021.3076081

9. Shikha Baghel, S.R. Mahadeva Prasanna, Prithwijit Guha, "Exploration of Excitation Source Information for Shouted and Normal Speech Classification", The Journal of the Acoustical Society of America (JASA) , vol.147 , (DOI - 2) ,pp.1250-1261, [2020]. , https://doi.org/10.1121/10.0000757

10. Raghvendra Kannao, Prithwijit Guha, "

A System for Semantic Segmentation of TV News Broadcast Videos

", Multimedia Tools and Applications , vol.79 , (DOI - 9) ,pp.6191-6225, [2020]. , https://doi.org/10.1007/s11042-019-08445-9

11. Mrinmoy Bhattacharjee, S.R. Mahadeva Prasanna, Prithwijit Guha, "Speech/Music Classification using Features from Spectral Peaks", IEEE/ACM Transactions on Audio Speech and Language Processing , vol.28 ,pp.1549-1559, [2020]. , 10.1109/TASLP.2020.2993152

12. Raghvendra Kannao, Prithwijit Guha, "Segmenting with Style: Detecting Program and Story Boundaries in TV News Broadcast Videos", Multimedia Tools and Applications , vol.78 , (DOI - 22) ,pp.31925-31957, [2019]. , https://doi.org/10.1007/s11042-019-7699-9

13. Raghvendra Kannao, Prithwijit Guha, "Success based Locally Weighted Multiple Kernel Combination", Pattern Recognition , vol.68 ,pp.38-51, [2017]. , https://doi.org/10.1016/j.patcog.2017.02.029

14. Tripuresh Mishra, Prithwijit Guha, Ashish Dutta, K.S. Venkatesh, "Stochastic Re-grasp Planning for Vision Aided Capture of Deforming and Moving Object, , Elsevier, Volume 19, Number 4, pp. 510-519, June 2009", Journal of Mechatronics (Elsevier) , vol.19 , (DOI - 4) ,pp.510-519, [2009]. , https://doi.org/10.1016/j.mechatronics.2008.12.002

15. Prasad Kulkarni, Dip Goswami, Prithwijit Guha, Ashish Dutta, "Path Planning for a Statically Stable Biped Robot using PRM and Reinforcement Learning", Journal of Intelligent and Robotic Systems , vol.47 , (DOI - 3) ,pp.197-214, [2006]. , https://doi.org/10.1007/s10846-006-9071-3

16. R. Chakrabarti, P. K. Hota, Prithwijit Guha, "Economic Load Scheduling Applying Artificial Neural Networks", Journal of the Institution of Engineers (IE India) , vol.83 ,pp.8-12, [2002].

17. M. Gopala Krishna, Prithwijit Guha, B.M. Karan, "Classification of Rolled Bloom using Artificial Neural Networks", Steel Times International , vol.26 , (DOI - 3) ,pp.26-27, [2002].

Conference Publications

1. Tiwari A., Trivedi G., Guha P., "Design of a Low Power Bfloat16 Pipelined MAC Unit for Deep Neural Network Applications", TENSYMP 2021 - 2021 IEEE Region 10 Symposium , (DOI - 10.1109/TENSYMP52854.2021.9550912) [2021].

2. Choudhury R., Ahamed S.R., Guha P., "FPGA Implementation of Low Complexity Hybrid Decision Tree Training Accelerator", Midwest Symposium on Circuits and Systems , vol.2021-August , (DOI - 10.1109/MWSCAS47672.2021.9531848) ,pp.511-514, [2021].

3. Baghel S., Prasanna S.R.M., Guha P., "Effect of high-energy voiced speech segments and speaker gender on shouted speech detection", 2021 National Conference on Communications, NCC 2021 , (DOI - 10.1109/NCC52529.2021.9530078) [2021].

4. Bhattacharjee M., Mahadeva Prasanna S.R., Guha P., "Detection of speech overlapped with low-energy music using pyknograms", 2021 National Conference on Communications, NCC 2021 , (DOI - 10.1109/NCC52529.2021.9530150) [2021].

5. Baghel S., Bhattacharjee M., Prasanna S.R.M., Guha P., "Automatic detection of shouted speech segments in Indian news debates", Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH , vol.5 , (DOI - 10.21437/Interspeech.2021-1592) ,pp.3666-3670, [2021].

6. Goel V., Chandak M., Anand A., Guha P., "Iq-vqa: Intelligent visual question answering", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.12662 LNCS , (DOI - 10.1007/978-3-030-68790-8_28) ,pp.357-370, [2021].

7. Vats S., Jain S., Guha P., "A Novel Ensemble Framework for Face Search", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.12665 LNCS , (DOI - 10.1007/978-3-030-68821-9_43) ,pp.514-528, [2021].

8. Choudhury R., Ahamed S.R., Guha P., "Efficient Hardware Implementation of Decision Tree Training Accelerator", Proceedings - 2020 6th IEEE International Symposium on Smart Electronic Systems, iSES 2020 , (DOI - 10.1109/iSES50453.2020.00055) ,pp.212-215, [2020].

9. Mishra A., Anand A., Guha P., "CQ-VQA: Visual Question Answering on Categorized Questions", Proceedings of the International Joint Conference on Neural Networks , (DOI - 10.1109/IJCNN48605.2020.9206913) [2020].

10. Bhattacharjee M., Mahadeva Prasanna S.R., Guha P., "Classification of Speech vs. Speech with Background Music", SPCOM 2020 - International Conference on Signal Processing and Communications , (DOI - 10.1109/SPCOM50965.2020.9179491) [2020].

11. Baghel S., Mahadeva Prasanna S.R., Guhal P., "Overlapped/Non-Overlapped Speech Transition Point Detection Using Bag-of-Audio-Words", SPCOM 2020 - International Conference on Signal Processing and Communications , (DOI - 10.1109/SPCOM50965.2020.9179591) [2020].

12. Baghel S., Mahadeva Prasanna S.R., Guha P., "Analysis of excitation source characteristics for shouted and normal speech classification", 26th National Conference on Communications, NCC 2020 , (DOI - 10.1109/NCC48643.2020.9056079) [2020].

13. Francis M., Guha P., "Siamese fully convolutional tracker with motion correction", Proceedings - International Conference on Pattern Recognition , (DOI - 10.1109/ICPR48806.2021.9412986) ,pp.2218-2225, [2020].

14. Mishra A., Anand A., Guha P., "Multi-stage attention based visual question answering", Proceedings - International Conference on Pattern Recognition , (DOI - 10.1109/ICPR48806.2021.9413173) ,pp.9407-9414, [2020].

15. Godbole A., Bhat S., Guha P., "Progressively Balanced Multi-class Neural Trees", 2018 24th National Conference on Communications, NCC 2018 , (DOI - 10.1109/NCC.2018.8599945) [2019].

16. Baghel S., Bhattacharjee M., Prasanna S.R.M., Guha P., "Shouted and Normal Speech Classification Using 1D CNN", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.11942 LNCS , (DOI - 10.1007/978-3-030-34872-4_52) ,pp.472-480, [2019].

17. Nakum G., Guha P., Baruah R.D., "Visual Object Tracking Using Perceptron Forests and Optical Flow", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.11942 LNCS , (DOI - 10.1007/978-3-030-34872-4_58) ,pp.525-532, [2019].

18. Kate P., Francis M., Guha P., "Visual Tracking with Breeding Fireflies using Brightness from Background-Foreground Information", Proceedings - International Conference on Pattern Recognition , vol.2018-August , (DOI - 10.1109/ICPR.2018.8546216) ,pp.2570-2575, [2018].

19. Baghel S., Mahadeva Prasanna S.R., Guha P., "Excitation source feature for discriminating shouted and normal speech", SPCOM 2018 - 12th International Conference on Signal Processing and Communications , (DOI - 10.1109/SPCOM.2018.8724482) ,pp.167-171, [2018].

20. Baghel S., Prasanna S.R.M., Guha P., "Classification of multi speaker shouted speech and single speaker normal speech", IEEE Region 10 Annual International Conference, Proceedings/TENCON , vol.2017-December , (DOI - 10.1109/TENCON.2017.8228261) ,pp.2388-2392, [2017].

21. Baghel S., Khonglah B.K., Prasanna S.R.M., Guha P., "Shouted/normal speech classification using speech-specific features", IEEE Region 10 Annual International Conference, Proceedings/TENCON , (DOI - 10.1109/TENCON.2016.7848298) ,pp.1655-1659, [2017].

22. Francis M., Guha P., "Object Tracking with Classification Score Weighted Histogram of Sparse Codes", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.10597 LNCS , (DOI - 10.1007/978-3-319-69900-4_21) ,pp.162-169, [2017].

23. Ghosh Dastidar J., Guha P., Pal S., Ahmed N., "Intrusion detection using SVM in a video sequence and signaling an alarm through Arduino UNO", Computational Science and Engineering - Proceedings of the International Conference on Computational Science and Engineering, ICCSE2016 ,pp.127-130, [2017].

24. Kannao R., Guha P., "Generic TV advertisement detection using progressively balanced perceptron trees", ACM International Conference Proceeding Series , (DOI - 10.1145/3009977.3009995) [2016].

25. Kannao R., Dandi D., Yellapu S., Guha P., "News program detection in TV broadcast videos", MM 2016 - Proceedings of the 2016 ACM Multimedia Conference , (DOI - 10.1145/2964284.2967281) ,pp.546-550, [2016].

26. Francis M., Rajesh R., Guha P., "PD-Shift: Patch detector shift based tracker", 2015 5th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics, NCVPRIPG 2015 , (DOI - 10.1109/NCVPRIPG.2015.7489997) [2016].

27. Kannao R., Guha P., "A novel local success weighted ensemble classifier", Proceedings - 3rd IAPR Asian Conference on Pattern Recognition, ACPR 2015 , (DOI - 10.1109/ACPR.2015.7486547) ,pp.469-473, [2016].

28. Rajesh R., Francis M., Guha P., "Tracking under scaling and rotations using stochastic mean shift", 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3), INDICON 2015 , (DOI - 10.1109/INDICON.2015.7443816) [2016].

29. Kannao R., Guha P., "Overlay text extraction from TV news broadcast", 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3), INDICON 2015 , (DOI - 10.1109/INDICON.2015.7443440) [2016].

30. Kannao R., Guha P., "TV advertisement detection for news channels using Local Success Weighted SVM Ensemble", 12th IEEE International Conference Electronics, Energy, Environment, Communication, Computer, Control: (E3-C3), INDICON 2015 , (DOI - 10.1109/INDICON.2015.7443801) [2016].

31. Kannao R., Guha P., "Story segmentation in TV news broadcast", Proceedings - International Conference on Pattern Recognition , vol.0 , (DOI - 10.1109/ICPR.2016.7900085) ,pp.2948-2953, [2016].

32. Shankar T., Dwivedy S.K., Guha P., "Reinforcement Learning via Recurrent Convolutional Neural Networks", Proceedings - International Conference on Pattern Recognition , vol.0 , (DOI - 10.1109/ICPR.2016.7900026) ,pp.2592-2597, [2016].

33. Saikia G., Shivagunde S., Saradhi V.V., Kannao R.D., Guha P., "Multiple kernel learning using data envelopment analysis and feature vector selection and projection", Proceedings - International Conference on Pattern Recognition , vol.0 , (DOI - 10.1109/ICPR.2016.7899686) ,pp.520-524, [2016].

34. Kannao R., Guha P., "TV commercial detection using success based locally weighted kernel combination", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.9516 , (DOI - 10.1007/978-3-319-27671-7_66) ,pp.793-805, [2016].

35. Garg S., Kumar S., Ratnakaram R., Guha P., "An occlusion reasoning scheme for monocular pedestrian tracking in dynamic scenes", AVSS 2015 - 12th IEEE International Conference on Advanced Video and Signal Based Surveillance , (DOI - 10.1109/AVSS.2015.7301781) [2015].

36. Garg S., Hassan E., Kumar S., Guha P., "A hierarchical frame-by-frame association method based on graph matching for multi-object tracking", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.9474 , (DOI - 10.1007/978-3-319-27857-5_13) ,pp.138-150, [2015].

37. Lobo M., Singh M.P., Kannao R., Guha P., "A novel method for face track linking in videos", ACM International Conference Proceeding Series , vol.14 , (DOI - 10.1145/2683483.2683551) [2014].

38. Vyas A., Kannao R., Bhargava V., Guha P., "Commercial block detection in broadcast news videos", ACM International Conference Proceeding Series , vol.14 , (DOI - 10.1145/2683483.2683546) [2014].

39. Guha P., Mukerjee A., "Unsupervised language learning for discovered visual concepts", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.7727 LNCS, issue PART 4 , (DOI - 10.1007/978-3-642-37447-0_40) ,pp.524-537, [2013].

40. Pande N., Jain M., Kapil D., Guha P., "The video face book", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.7131 LNCS , (DOI - 10.1007/978-3-642-27355-1_46) ,pp.495-506, [2012].

41. Guha P., Jain M., Pande N., Oberoi T., "Multiple face tracking with appearance modes and reasoning", Proceedings of the 2011 International Conference on Image Processing, Computer Vision, and Pattern Recognition, IPCV 2011 , vol.1 ,pp.375-380, [2011].

42. Guha P., Mukerjee A., Venkatesh K.S., "OCS-14: You can get occluded in fourteen ways", IJCAI International Joint Conference on Artificial Intelligence , (DOI - 10.5591/978-1-57735-516-8/IJCAI11-280) ,pp.1665-1670, [2011].

43. Guha P., Mukerjee A., Subramanian V.K., "Formulation, detection and application of occlusion states (Oc-7) in the context of multiple object tracking", 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2011 , (DOI - 10.1109/AVSS.2011.6027318) ,pp.191-196, [2011].

44. Guha P., Mukerjee A., Venkatesh K.S., "Activity discovery using compressed suffix trees", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.6979 LNCS, issue PART 2 , (DOI - 10.1007/978-3-642-24088-1_8) ,pp.69-78, [2011].

45. Pande N., Guha P., "OSiMa: Human pose estimation from a single image", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.6744 LNCS , (DOI - 10.1007/978-3-642-21786-9_34) ,pp.200-205, [2011].

46. Nandi S., Guha P., Venkatesh K.S., "Objects from animacy: Discovery in joint shape and haar feature space", Proceedings - 6th Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 2008 , (DOI - 10.1109/ICVGIP.2008.78) ,pp.730-737, [2008].

47. Sinha A.K., Guha P., Mukerjee A., "Back to the future: Robust foreground extraction with reversed-time background modeling", Proceedings - International Conference on Pattern Recognition , (DOI - 10.1109/icpr.2008.4761526) [2008].

48. Guha P., Mukerjee A., "Language label learning for visual concepts discovered from video sequences", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4840 LNAI , (DOI - 10.1007/978-3-540-77343-6_6) ,pp.91-105, [2007].

49. Mishra T., Guha P., Dutta A., Venkatesh K.S., "Efficient continuous re-grasp planning for moving and deforming planar objects", Proceedings - IEEE International Conference on Robotics and Automation , vol.2006 , (DOI - 10.1109/ROBOT.2006.1642073) ,pp.2472-2477, [2006].

50. Guha P., Mukerjee A., Venkatesh K.S., Mitra P., "Activity discovery from surveillance videos", Proceedings - International Conference on Pattern Recognition , vol.1 , (DOI - 10.1109/ICPR.2006.209) ,pp.433-436, [2006].

51. Guha P., Palai D., Venkatesh K.S., Mukerjee A., "A multiscale co-linearity statistic based approach to robust background modeling", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.3851 LNCS , (DOI - 10.1007/11612032_31) ,pp.297-306, [2006].

52. Guha P., Mukerjee A., Venkatesh K.S., "Appearance based multiple agent tracking under complex occlusions", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4099 LNAI , (DOI - 10.1007/11801603_63) ,pp.593-602, [2006].

53. Guha P., Mukerjee A., Venkatesh K.S., "Efficient occlusion handling for multiple agent tracking by reasoning with surveillance event primitives", Proceedings - 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, VS-PETS , vol.2005 , (DOI - 10.1109/VSPETS.2005.1570897) ,pp.49-56, [2005].

54. Guha P., Palai D., Goswami D., Mukerjee A., "DynaTracker: Target tracking in active video surveillance systems", 2005 International Conference on Advanced Robotics, ICAR '05, Proceedings , vol.2005 , (DOI - 10.1109/ICAR.2005.1507473) ,pp.621-627, [2005].

55. Guha P., Vaghela P., Mitra P., Venkatesh K.S., Mukerjee A., "Hybrid hierarchical learning from dynamic scenes", Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.3776 LNCS , (DOI - 10.1007/11590316_28) ,pp.212-217, [2005].