COLLEGE OF COMPUTER SCIENCE NYCU
All are welcome to join this seminar!
Please fill out the form to register: https://forms.gle/6Gmnhw6bYyWFsTcJ6
Organized by the College of Computer Science
【Speaker】
Prof. J. Stephen Downie, Associate Dean
School of Information Sciences, University of Illinois at Urbana Champaign
【Topic】
Open Access Data for Open Community Development: A TORCHLITE Project Update
【Time】113/04/24 (Wed), 15:30-16:30
【Location】EC122、EC114
【Abstract】
The HathiTrust Research Center (HTRC) provides analytic access to 18 million volumes found in the HathiTrust Digital Library (HTDL). Roughly 10 million of the volumes in the collection are under copyright restrictions, and cannot be freely shared with researchers. In an effort to provide more open access to HathiTrust’s materials, the HTRC has released its Extracted Features 2.0 Dataset which contains over 2.9 trillion unigram tokens found on each of the 6 billion pages in the corpus. This talk provides a briefing on HTRC’s ongoing “Tools for Open Research and Computation with HathiTrust: Leveraging Intelligent Text Extraction” (TORCHLITE) project. Funded by the National Endowment for Humanities (NEH), TORCHLITE strives to create easy-to-use text analysis tools, dashboards and application programming interfaces (APIs) to facilitate open community research and development use of the uniquely valuable HTDL data. The talk will highlight motivations, challenges, and accomplishments of the TORCHLITE to date, along with its upcoming next steps.
【Biography】J. Stephen Downie is the associate dean for research and a professor at the School of Information Sciences at the University of Illinois Urbana Champaign. He is the also the Illinois codirector of the HathiTrust Research Center since 2012. He has been an active participant in the digital libraries and digital humanities research domains. He is best known for helping to establish a vibrant music information retrieval research community. Since 2005, he has directed the annual Music Information Retrieval Evaluation eXchange (MIREX). He also was a founder of the International Society Music Information Retrieval (ISMIR) and its first president.