Reimagining digital preservation of music for Alzheimer’s disease: can artificial intelligence be a potential mirror of musical patterns that matter? An online lecture by DCI 2020-21 fellow Dr Yunhyong Kim University of Glasgow hosted by the Digital Curation Institute, Faculty of Information, University of Toronto Friday, 22 May 2020, 12 noon – 1:30pm Online The […]
Please join us for this timely talk and conversation at the Faculty of Information at the Bissell building in room BL528 on September 12 at 4pm! Abstract: How does data structure the political economy of Airbnb, Uber, and other leading “platform economy” firms? In this presentation, I answer this question through a spatial-big-data examination of short-term […]
DCI Lecture, Feb 14 2019, 4pm Please join us for the next DCI Lecture on February 14 at the Bissell building, BL728 (7th floor), at 4pm. Light refreshments will be served. Prof. Feenberg will speak for about an hour, followed by a moderated Q&A period. Andrew Feenberg is Canada Research Chair in Philosophy of Technology in […]
Save the date! Friday, 28 September 2018, 5pm, BL 728 We would like to invite you to the next DCI Lecture, “ Data Reuse from the Reusers Point of View”, given by leading archival studies and digital preservation scholar, Dr Elizabeth Yakel. Please join us at 5pm on September 28, 2018 for a DCI Lecture in BL […]
Please join us on October 15, 2018 at 4:15pm in BL 728 for a DCI Lecture by Niels Brügger, Professor in Internet Studies and Digital Humanities at Aarhus University, Denmark. Abstract: Since the online web disappears rapidly, any scholar who wants to include the web of the past in his study has to rely on that someone has archived […]
Please join us at 4pm on May 10, 2018 for a DCI Lecture in BL 417 (Bissell building, 4th floor) by this year’s Fellow at the DCI, Dr. Marie Ferrario from the University of Lancaster! Values in Computing, Connecting the Bits. As we watch digital technology unleashing its power on personal, social and organizational aspects of human life, we […]
Please join us at 4pm on September 21, 2017 for a DCI Lecture in BL 728 (Bissell building, 7th floor) by Prof. Cecilia Aragon from the University of Washington.
Extraordinary advances in our ability to acquire and generate data are transforming the fundamental nature of discovery across domains. Much of the research in the field of data science has focused on automated methods of analyzing data such as machine learning and new database techniques. However, the human aspects of data science, including how to maximize scientific creativity and human insight, how to address ethical concerns, and the consideration of societal impacts, are vital to the future of data science. Human-centered data science is a necessary part of the success of 21st century discovery. I will discuss promising research in this area, describe ongoing initiatives at the UW eScience Institute, and speculate upon future directions for data science.
Cecilia Aragon is a Professor in the Department of Human Centered Design & Engineering, Senior Data Science Fellow at the eScience Institute, and Director of the Human Centered Data Science Lab at the University of Washington in Seattle, US. She earned her Ph.D. in computer science from UC Berkeley in 2004. Her research focuses on human-centered data science, an emerging field at the intersection of computer-supported cooperative work (CSCW) and the statistical and computational techniques of data science. She has published over 200 papers in the areas of HCI, CSCW, data science, visual analytics, machine learning, and astrophysics. In 2008, she received the Presidential Early Career Award for Scientists and Engineers (PECASE), the highest honor bestowed by the US government on outstanding scientists in the early stages of their careers.
Lab website: https://depts.washington.edu/hdsl/
Faculty website: http://faculty.washington.edu/aragon
Please join us at 4pm on March 16th for a lecture in BL 728 (Bissell building, 7th floor) by the inaugural Marshall McLuhan Centenary Fellow in Digital Sustainability, Professor Ian Milligan (Waterloo, History). Milligan’s lecture, “Unleashing the Infinite Archive: Exploring Born-Digital Cultural Heritage at Scale through Interdisciplinary Collaboration,” explores several issues that are central to the DCI’s mission. […]
The Digital Curation Institute is pleased to announce a panel discussion on Thursday, March 2nd at 4pm in Bissell Room 507. All are welcome, and light refreshments will be provided. This panel, “Studying the Past Through Technology: An Interdisciplinary Roundtable,” brings diverse perspectives to bear on the question of how we can study the past […]
The sheer amount of social, cultural, and political information that is generated and, crucially, preserved every day presents new exciting opportunities to historians. A large amount of this information is being contained within web archives, which contain billions of web pages. Scholars broaching topics dating back to the mid-1990s will find their projects enhanced by web data – military historians can use forum posts by soldiers, social historians can track aspects of everyday life through blogs and comments, political historians can study changing sentiment, tropes, and link structures, and economic historians can explore the rise and fall of businesses webpages. Yet this tremendous opportunity is mitigated to some degree by the sheer challenge of dealing with all that data: we have more information than ever before, but the scale is overwhelming.
We have several common tensions, however, beyond basic ones of having enough storage and computational power to deal with all of this information. I will focus on two. The first is that while historians largely want to work with content, technological limitations push us towards rich metadata. The second is that without basic understanding of the conceptual structure of the web archive, from crawl structure to the biases, we can generate wildly misleading results – a problem for historians with most digitized sources.
In this talk, I explore these tensions as they have played out over three case studies that I have studied: the Internet Archive’s March-December 2011 Wide Web Scrape (WARC files), the 2009 GeoCities end-of-life torrent (a wget-compiled collection of mirrored websites), and the 2005-Present Archive-It collections of Canadian political parties, unions, and organizations (WAT files, which contain derivative data). For each archive, I briefly discuss the usage, technical, and ethical challenges that such collections present for historians: problems of too much data, processing time, and the difficulties in applying cutting-edge natural language processing.
Ian Milligan is an assistant professor of digital and Canadian history at the University of Waterloo. There, he is principal investigator of the web archives for historical research group (https://uwaterloo.ca/web-archive-group/), which is supported by an Ontario Early Researcher Award and SSHRC. Milligan serves as a co-editor of the Programming Historian (programminghistorian.org). He has published several articles looking at the impact of born-digital sources on historians and has a forthcoming co-authored book, Exploring Big Historical Data: The Historians’ Macroscope on digital methods with Imperial College Press. His first book, Rebel Youth: 1960s Labour Unrest, Young Workers, and New Leftists in English Canada, appeared in 2014.
The lecture takes place at 16:00-17:30 on Thursday, 29th of October 2015, in Room 728 (7th floor) at the iSchool, Bissell Building, 140 St. George Street.