WARC
W
WARC
Definition
Web ARChive: an ISO standard format for storing web crawl data including HTTP headers, page content, and metadata. Used by the Internet Archive and web preservation projects.