University of Scranton Web Archives Collection Guide

University of Scranton Web Archives

Collection Guide


Periodic captures of University of Scranton web content, including the main University website ( as well as related sites (such as and
Subset of University of Scranton Archives
University of Scranton websites
Date Range
2000 - Present

Access and Use

Digital Access

Access to captured web content is publicly available via our Archive-It portal.

Physical Access

No physical access is available for materials in this collection.


Many of the elements incorporated into the Web sites (e.g., photographs, articles, graphical representations) are protected by copyright. The materials may also be subject to publicity rights, privacy rights, or other legal interests.

Responsibility for making an independent legal assessment of an item and securing any necessary permissions ultimately rests with the person desiring to use the item. You will need permission from the copyright owners or rights holders for reproduction, distribution, or other use of protected items beyond that allowed by fair use or other statutory exemptions. Researchers should consult the sites themselves for information about rights, contacts, and permissions.

Preferred Citation

[Web site title], [date], University of Scranton Web Archives. [Wayback Machine URL (e.g.,]

Linked Data


Scope and Content Note

The University of Scranton Web Archives Collection includes official University sites (like,, and, event-specific sites (like, decommissioned sites (like, and affiliated but external sites (like We capture publicly available content from these sites, including images, PDF documents, and embedded audio/video media. We do not capture restricted or sensitive content on password-protected pages, such as the my.scranton portal or the Desire2Learn learning management system. We plan to capture official University accounts on social media platforms (including Facebook, Twitter, YouTube, Flickr, and Instagram) in the near future.


Browse the collection's seed catalog

Acquisition Information

The Weinberg Memorial Library partners with Archive-It, a subscription service of the Internet Archive, to capture, preserve, and make accessible web content created by and related to the University of Scranton community.

The Library selects relevant web domains or particular websites (called seeds) for capture, and then directs Archive-It's web crawler to periodically create copies to ensure long-term access. After crawling the web site or web domain, the captured content is indexed and published for access.

Processing Notes


We generally crawl major University websites (like on a bimonthly basis, although some dynamic pages (like Royal News) are crawled more frequently.

Data Extraction

Archive-It's data extraction service allows data for specific hosts in Internet Archive's General Archive to be transferred into Archive-It collections. In 2013, the Library contracted with Archive-It to extract General Archive data for several hosts into our Archive-It collection, making the materials (dated 2000-2013) both full-text searchable and better integrated with the Library's crawls. Hosts extracted were:



Back up copies of WARC files from web crawls are stored in the Library’s DuraCloud repository for long-term preservation. Deposits are automated using Archive-It Sync.


Suggest a Site

Do you know of a University-related website or page that we should preserve? Please email the URL and a brief explanation to

Questions or Concerns

Please contact us with any questions or concerns regarding our crawls or captured collections, including requests for content removal.