The World Wide Web is the single largest repository of digital culture and knowledge. By strategically collecting, analyzing, and visualizing web data, business intelligence can extract decision-relevant insights, digital social sciences can explore current societal trends and social networks, and digital humanities can study cultural and historical questions using digital media. Additionally, the web is a focal point of computer science research for developing information systems and AI applications.
The Immersive Web Observatory (IWO), a BMBF-funded infrastructure at the Digital Bauhaus Lab at Bauhaus-Universität Weimar, to which we have access, leverages this potential by providing an extensive web crawl corpus encompassing 8 Petabytes of data, covering both current and historical web content taken from the Internet Archive’s web archive. It is an invaluable data resource for projects across various disciplines, particularly in information retrieval, data mining, and visualization. The IWO further facilitates knowledge and technology transfer to local businesses through project collaborations, demonstrators, and open-access publications, and fosters the training of data scientists specializing in big data and cognitive computing.
Further Information: https://www.uni-weimar.de/fileadmin/user/uni/dezernate/dfo/TOP-Projekte/2017/2017_14_Hagen_IWO-buw-projektbeschreibung.pdf