<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Digital Preservation Q&amp;A - Recent questions tagged scope</title>
<link>https://qanda.digipres.org/tag/scope</link>
<description>Powered by Question2Answer</description>
<item>
<title>How should you scope a crawl for web archiving online discussion forums</title>
<link>https://qanda.digipres.org/74/should-you-scope-crawl-archiving-online-discussion-forums</link>
<description>&lt;p&gt;
	A lot of popular online discussion platforms (phpbb, vbullitan, invision powerboard, etc.) generate a lot of different kinds of URLs for the same discussion threads and digital assets and do a lot of strange things with links for pagination and such. You can eaisly get stuck in a range of &lt;a href=&quot;https://webarchive.jira.com/wiki/display/ARIH/How+to+Identify+and+Avoid+Crawler+Traps&quot; rel=&quot;nofollow&quot;&gt;crawler traps&lt;/a&gt;. What are some good tactics to use when trying to scope crawling online discussion forms to archve them? So, going into scoping and planning to archive a discussion forum what kinds of ideas and tactics should one be thinking about/considering?&lt;/p&gt;</description>
<guid isPermaLink="true">https://qanda.digipres.org/74/should-you-scope-crawl-archiving-online-discussion-forums</guid>
<pubDate>Thu, 05 Jun 2014 13:35:47 +0000</pubDate>
</item>
<item>
<title>Is digitisation on topic?</title>
<link>https://qanda.digipres.org/5/is-digitisation-on-topic</link>
<description>&lt;p&gt;
	There have already been a couple of questions focused on digitisation rather than digital preservation. I would assume that these are actually off topic and should be closed? The boundary is of course a little fuzzy.&lt;/p&gt;
&lt;p&gt;
	I would propose that this community should be involved in decisions related to the results of a digitisation initiative. For example, the file formats and metadata schemas used, where/how the results are stored, and so on. However, questions focused on how to engage in digitisation, what types of scanners should be used or resolution to digitise at, will be off topic.&lt;/p&gt;
&lt;p&gt;
	&lt;a href=&quot;http://anjackson.github.io/zombse/032013%20Digital%20Preservation/static/questions/1.html&quot; rel=&quot;nofollow&quot;&gt;This question&lt;/a&gt; is about OCR, although is phrased in the title as a more general digitisation question. Is this in scope? Its a tough call.&lt;/p&gt;
&lt;p&gt;
	As many of us in the digitisation community are well aware, confusing digital preservation for digitisation is a common mistake. I'd suggest adding some clear scoping detail on this to the &quot;What kind of questions should I not ask here?&quot; section of the FAQ. Examples may well be necessary to keep things clear.&lt;/p&gt;
&lt;p&gt;
	Paul Wheatley&lt;/p&gt;
&lt;p&gt;
	(from &lt;a href=&quot;http://anjackson.github.io/zombse/032013%20Digital%20Preservation%20Meta/static/questions/3.html&quot; rel=&quot;nofollow&quot;&gt;ZDPSE&lt;/a&gt;)&lt;/p&gt;</description>
<guid isPermaLink="true">https://qanda.digipres.org/5/is-digitisation-on-topic</guid>
<pubDate>Mon, 02 Dec 2013 13:24:06 +0000</pubDate>
</item>
</channel>
</rss>