The RickyRenuncia Project's main objective is to preserve digital materials (video, audio, images, news, tweets, among others) related to the resignation of the Governor of Puerto Rico, Ricardo Rosselló. The RickyRenuncia Project created three case studies using Twitter data to share with Information Sciences students and educators alike the challenges and opportunities we have encountered while working on our project. The examples highlight our approach and tools used to capture, curate, and standardize social media data for long-term curation.
Capturing Twitter data by Joel Blanco: This notebook uses the #RickyRenuncia dataset to guide students through the process of capturing Twitter data using Twarc. It includes information about exporting the dataset to GeoJSON format and generating reports and visualizations using Twarc-report. It also provides information about version 2 of Twarc and how to use it with Twitter version 2 API.
Multimedia Rating an Approach to Curation by Eduardo Beltran: This notebook accesses a subset from the prehydrated sample offered to the participants to construct rankings due to multiple public metrics available in the Twitter API. It is currently working on top of API v1.1 as the v2 API has inconsistencies that have changed over the past 4 months in the delivery of this information. This notebook also proposes an interactive curation method to filter out any unwanted or unrelated documents directly from the Jupyter notebooks interface, as reused memes without direct artistic relevance to the incident are reused within the captured dataset.
Mapping Twitter data to a Social Media Posting compliant RDF by Irmarie Fraticelli: This Jupyter Notebook has as an objective to introduce students to RDF and Schema.org—collaborative community activity with a mission to create, maintain, and promote schemas for structured data on the online and/or electronic format. The notebook will show how to map Twitter data to a Social Media Posting compliant RDF. The notebook also presents mechanisms to insert data to a Content Management System (CMS), such as Omeka-S for long-term preservation.