DIMACS
DIMACS REU 2020

General Information

me
Student: Jessica Moreira
School: Barnard College
Major: Economics
Minor: Education
E-mail: jdm2240@barnard.edu
Project: The Impact of COVID-19 on the Internet Usage

Project Goals

As the new pandemic hits the US, states started to adopt quarantine and lockdown policies and, thus, people started staying at home more. Considering this new social context in which we are living in, the Internet has been used by people more to connect with friends and family, to work and to study in a larger scale. Thus, the goals of this project is understanding how the Internet has been impacted by quarantine and lockdown policies and also infering through data analyzis how people's behaviour related to Internet usage in this period has been and changed.

Weekly Log

Week 1 (05/25-05/31):

During the first week, I focused on learning in deep more about the Federal Communication Comission's Broadband Measurement Data as this is the data set that we will be analysing. I read the last repoort that FCC released could understand how the measurments were made as well as the components of Internet that they tested. In addition to that, I studied about the fundamentals of how the Internet works and could learn about protocols, tranmission of data and etc. Besides being mentored by Dr. Henning Schulzrinne, I have also been mentored by Jan Janak, a PhD student at the Internet Real-Time Laboratory. In this first week, Jan helped me to learn about the fundamentals of the Internet by explaining some topics to me and answering questions that I had.

In addition to that, I attended the introductory events of DIMACS and Barnard Computer Science Summer Research, and also worked on my initial DIMACS presentation scheduled for next Tuesday. I also met with Jan Janak on Monday, Wednesday (Dr. Schulzrinee was also on the call) and Friday so we could discuss the project and talk about the progress of the project.


Week 2 (06/01-06/07):

During the second week of research, my main goal was to test possible platforms that I can use throughout the summer to analyze data. In this way, I started testing Datalab - an application on Google Cloud - in which it is possible to analyze and vizualize data through Jupiter notebooks. In this way, I uploaded the FCC's data for March and started graphing it so I could start analyzing how the quarantine has impacted the usage of Intern. However, as while learning how to navigate the platform, an incident happened - the region and zone that my Google Cloud and Datalab account was connected stoped working and in order to being able to use it again, I had to set a new default region and zone of work as well as a new VM. While doing it, I ended up loosing all the data and the progress that I had done with my code. I couldn't find it, so I created a new notebook and started from scratch again. Through this incident, I also learned how to download the notebooks and now every time that I need to log off from Datalab, I download the notebooks in case that this incident happens again.

Moreover, I also had my weekly meeting with Jan Janak and Dr. Schulzrinne on Wednesday and also talked to Jarak on Monday besides having my first presentation about my project to the other DIMACS REU participants on Tuesday. I also attended the talk with Dr. Lydia Chilton on Friday and worked on my website page.

Week 3 (06/8-06/14):

During the third week, I continued to work on the graphs related to total bytes of download and the average download speed for the months of January, February and March. In addition to that, I analyzed more closely on the characteristics of the households from where the FCC can take their data from, and I also participated in the data analysis bootcamp hosted by DIMACS. Moreover, I also started to work on a more detailed analyses of the data going from a monthly analyses to a weekly analyses. I also had meetings with my supervisor and mentor.

Week 4 (06/15-06/21):

During this week, I continued to work on the more detailed graphs for download speed and total download bytes used per week, I computed the ratios of the download bytes usage and the average download speed between March and January and March and February and I also computed the same ratios for the total bytes of upload and the average upload speed. Moreover, I computed the graphs for these ratios and attended the events hosted by DIMACS REU. Finally, I also attended the weekly talk of FCBA Summer Series.

Week 5 (06/22-06/28):

This week I analyzed and computed more detailed ratios of download and upload total bytes and ratios of average download and upload speed between March and January and March and February per weekdays and weekends. I also started to located the households that have a heavier usage of download and upload bytes as well as the households that have a ligher usage of download and upload bytes. Moreover, I participated in the IFIP Network 2020 conference and started reading some papers related to the same issue that I have been investigating that my supervisor recommended me to read.

Week 6 (06/29-07/05):

During this week, I focused on locating the light users and calculating their ratios of download and upload data consumed between March and January of 2020 and also started some geographical analysis. Moreover, I read some reference papers related to the topic of research that I am working on and created a reference section on my paper based on the work that other people have been done related to changes on Internet due to COVID-19. I had also to deal with some memory problems on Google Cloud and I attended the DIMACS events.

Week 7 (07/06-07/12):

This week I started calculating the data downloaded and uploaded in 2019 in order to analyse the trends of internet consumption that would be considered "normal" or "expected" if the COVID-19 outbreak had not existed. While finding these trends, it would be possible to measure the real impact of COVID-19 in the Internet usage. I also attended the DIMACS events and started using some other tables of data available in the data set used.

Week 8 (07/13-07/19):

During this week, we found out some errors with the data set that we were using which invalidated some of the results that I had already found. In this way, I had to conduct some experiments and analysis all over again and generate new graphs using the right data. For instance, I generated the average download and upload data consumed, ratios of average consumption between March and January of 2020 for weekdays and weekends, for heavy and light users and graphs with regional analysis. In addition to that, I started preparing my final DIMACS presentation and attended DIMACS and Barnard events. < br/>
Week 9 (07/20-07/26):

During this week, I continued updating my graphs and analysis with the right data sets and started preparing some more polished graphs to include in the paper. Moreover, I also finished preparing my final DIMACS presentation and presented it to people in the DIMACS REU program. Moreover, I also attended the DIMACS events.

Week 10 (07/27-07/31):

During my final week of work during the summer, I focused on writting the first draft of my research paper. I also attended the Barnard CS final event and conducted some final experiments and analysis with the data set.

Presentations

  • First Presentation
  • Final Presentation

  • My Mentors

  • Dr. Henning Schulzrinne
  • PhD student Jan Janak

  • Acknowledgements

    I would like to thank my supervisor Professor Henning Schulzrinne and my mentor Jan Janak for all their guidance and help this summer, the Computer Science department at Barnard College and the Craig Newmark Philanthropies for making the funding of my project possible and the DIMACS REU for making this experience even more meaningful.