Master’s Thesis Presentation • Artificial Intelligence — Analysis of Textual and Non-Textual Sources of Sentiment in GitHubExport this event to calendar

Tuesday, May 26, 2020 10:00 AM EDT

Please note: This master’s thesis presentation will be given online.

Nalin De Zoysa, Master’s candidate
David R. Cheriton School of Computer Science

GitHub is a collaborative platform that is used primarily for the development of software. In order to gain more insight into how teams work on GitHub, we wish to analyze the sentiment content available via communication on the platform.

In order to do so, we first use existing sentiment analysis classifiers and compare the GitHub data to other social networks, Twitter and Reddit. By identifying that users are able to provide reactions to other users posts on GitHub, we use this as an indicator or label of sentiment information. Using this we first investigate whether repeated user interaction has an impact on sentiment and find that it is positively correlated to the amount of prior interaction as well as the directness of interaction. We also investigate if metrics corresponding to a user’s status or power in a project correlate with positive sentiment received and find that it does.

We then build sentiment classifiers using both textual and non-textual information, both which outperform the generic sentiment scorer systems. In addition we show that a sentiment classifier built using only non-textual information can perform at a comparable level to a text-based classifier, indicating that there is significant sentiment information contained in non-textual information in the GitHub network.

Location 
Online presentation
200 University Avenue West

Waterloo, ON N2L 3G1
Canada

S M T W T F S
29
30
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
1
2
  1. 2024 (96)
    1. April (19)
    2. March (27)
    3. February (25)
    4. January (25)
  2. 2023 (296)
    1. December (20)
    2. November (28)
    3. October (15)
    4. September (25)
    5. August (30)
    6. July (30)
    7. June (22)
    8. May (23)
    9. April (32)
    10. March (31)
    11. February (18)
    12. January (22)
  3. 2022 (245)
  4. 2021 (210)
  5. 2020 (217)
  6. 2019 (255)
  7. 2018 (217)
  8. 2017 (36)
  9. 2016 (21)
  10. 2015 (36)
  11. 2014 (33)
  12. 2013 (23)
  13. 2012 (4)
  14. 2011 (1)
  15. 2010 (1)
  16. 2009 (1)
  17. 2008 (1)