Utilizing Social Media Data for Estimating Transit Performance Metrics in a Pre- and Post-COVID-19 World


Public transit is a key mode of transportation in megacities like NYC. The performance metrics of public transit such as journey time, service provision, station and subway environment, etc. are measured continually and provided to the public by the MTA for each month.  The perception of transit service by the customer is also seen as a measure that may influence the choice of transit as a mode by the user. The perception of transit users can be affected by different factors ranging from, delay, availability of elevators, cleanliness of the station environment, and subway car facilities. In a post-COVID 19 lockdown time period, the transit service frequencies and ridership will be impacted. Also, the public perception of service may be based on completely different factors such as transit agencies’ safeguards against public health risks. 

This perception of transit can be captured through the sentiment of posts made by users on social media. Performance metrics based on user perception can provide the service provider a customer-facing view of their service and help enhance their service and address the user concerns, especially from a public health standpoint in a post-COVID lockdown world, appropriately.

Research Objectives

Objectives of the project include the following:

  • Literature Review: Entails a review of literature of past studies that utilized social media in the domain of public transit. Social media has been used by agencies to address customer service issues and information dissemination. Few studies also propose the use of social media to assess customer perception of public transit service. Various methodologies and metrics proposed in the literature for customer perception of transit will be compiled in this task.
  • Collecting & processing transit data: This task is to collect and analyze real-time transit data to extract relevant performance metrics. Data includes real-time MTA transit data, service alert data, and service metrics comparisons before and after COVID-19.
  • Social media data: Involves the collection, processing, and analysis of social media posts regarding public transit in NYC. Data will be compiled, extracted, and analyzed for the customer sentiments within the media posts both before and after the pandemic.
  • Co-analysis of social media and transit data: Entails a comparison of transit performance metrics estimated with user sentiments. User-facing metrics will be evaluated.
  • Production of a final report presenting the findings of all the tasks on data collection, processing, and analysis as a final report.
  • Production of final metrics indicating public perception of subway service, station environment, and public health measures before and after COVID-19 lockdown based on social media
  • Depending on the findings of the project, an algorithm to generate the metrics measuring public perception of transit service will be made available.


Camille Kamga

CCNY University Lead, C2SMART

Camille Kamga is the Principal Investigator on this project.

Sandeep Mudigonda

Senior Research Associate, CCNY

Sandeep Mudigonda is a Co-Principal Investigator on this project.