University of Scranton Web Archives Collection Guide

University of Scranton Web Archives

Collection Guide

Summary

Abstract
Periodic captures of University of Scranton web content, including the main University website (www.scranton.edu) as well as related sites (such as admissions.scranton.edu and athletics.scranton.edu).
Relation
Subset of University of Scranton Archives
Source
University of Scranton websites
Date Range
2000 - Present
Language(s)
English

Access and Use

Digital Access

Access to captured web content is publicly available via our Archive-It portal.

Physical Access

No physical access is available for materials in this collection.

Rights

Many of the elements incorporated into the Web sites (e.g., photographs, articles, graphical representations) are protected by copyright. The materials may also be subject to publicity rights, privacy rights, or other legal interests.

Responsibility for making an independent legal assessment of an item and securing any necessary permissions ultimately rests with the person desiring to use the item. You will need permission from the copyright owners or rights holders for reproduction, distribution, or other use of protected items beyond that allowed by fair use or other statutory exemptions. Researchers should consult the sites themselves for information about rights, contacts, and permissions.

Preferred Citation

[Web site title], [date], University of Scranton Web Archives. [Wayback Machine URL (e.g., http://wayback.archive-it.org/3006/20140717175752/http://125th.scranton.edu/#)]

Linked Data


Contents

Scope and Content Note

The University of Scranton Web Archives Collection includes official University sites (like www.scranton.edu, admissions.scranton.edu, and athletics.scranton.edu), event-specific sites (like 125th.scranton.edu), decommissioned sites (like academic.scranton.edu), and affiliated but external sites (like www.thescrantonplayers.com). We capture publicly available content from these sites, including images, PDF documents, and embedded audio/video media. We do not capture restricted or sensitive content on password-protected pages, such as the my.scranton portal or the Desire2Learn learning management system. We plan to capture official University accounts on social media platforms (including Facebook, Twitter, YouTube, Flickr, and Instagram) in the near future.

Inventory

Browse the collection's seed catalog


Acquisition Information

The Weinberg Memorial Library partners with Archive-It, a subscription service of the Internet Archive, to capture, preserve, and make accessible web content created by and related to the University of Scranton community.

The Library selects relevant web domains or particular websites (called seeds) for capture, and then directs Archive-It's web crawler to periodically create copies to ensure long-term access. After crawling the web site or web domain, the captured content is indexed and published for access.


Processing Notes

Frequency

We generally crawl major University websites (like www.scranton.edu) on a bimonthly basis, although some dynamic pages (like Royal News) are crawled more frequently.

Data Extraction

Archive-It's data extraction service allows data for specific hosts in Internet Archive's General Archive to be transferred into Archive-It collections. In 2013, the Library contracted with Archive-It to extract General Archive data for several hosts into our Archive-It collection, making the materials (dated 2000-2013) both full-text searchable and better integrated with the Library's crawls. Hosts extracted were:

  • www.scranton.edu
  • matrix.scranton.edu
  • www.uofs.edu
  • academic.scranton.edu
  • academic.uofs.edu

Preservation

Back up copies of WARC files from web crawls are stored in the Library’s DuraCloud repository for long-term preservation. Deposits are automated using Archive-It Sync.

Other

Suggest a Site

Do you know of a University-related website or page that we should preserve? Please email the URL and a brief explanation to digitalcollections@scranton.edu.

Questions or Concerns

Please contact us with any questions or concerns regarding our crawls or captured collections, including requests for content removal.