ARCH (Archives Research Compute Hub) is a platform for building research collections, analyzing them computationally, and generating datasets from terabytes and even petabytes of data. ARCH supports the open publication and preservation of user-generated datasets created from thousands of libraries, archives, and memory organizations worldwide, giving researchers, students, and information professionals the power to study and understand digital collections in new ways.
Streamline Data-Driven Research
ARCH leverages the Internet Archive’s non-profit infrastructure and open-source tools to streamline computational use of digital collections. Librarians, collection managers, and educators can provide ARCH to their researchers and students in order to facilitate sophisticated research processes that would otherwise require coding/scripting skills and significant computing resources.
ARCH was made possible in part by funding from the Mellon Foundation and via a long-running collaboration with the Archives Unleashed project of the University of Waterloo and York University.