<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=ISO-8859-1">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<br>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style><head>
<style></style>
<table class="backgroundTable" width="100%" bgcolor="#ffffff"
cellpadding="0" cellspacing="0">
<tbody>
<tr>
<td valign="top" align="left">
<table cellpadding="0" cellspacing="0">
<tbody>
<tr>
<td style="border-top: 0px solid rgb(0, 0, 0);
border-bottom: 1px solid rgb(255, 255, 255);
background-color: rgb(255, 255, 255); text-align:
center;" align="center"><span style="font-size:
10px; color: rgb(96, 96, 96); line-height: 200%;
font-family: verdana; text-decoration: none;">Email
not
displaying correctly? <a
href="http://www.oplin.org/4cast/"
style="font-size: 10px; color: rgb(0, 0, 255);
line-height: 200%; font-family: verdana;
text-decoration: none;">View
it in your browser.</a></span></td>
</tr>
<tr>
<td style="border-top: 0px solid rgb(51, 51, 51);
border-bottom: 0px solid rgb(255, 255, 255);
background-color: rgb(255, 255, 255);">
<center><a href=""><img id="editableImg1"
src="cid:part2.08010601.06010108@oplin.org"
title="OPLIN" alt="OPLIN 4Cast" border="0"
align="middle"></a></center>
</td>
</tr>
</tbody>
</table>
<table style="width: 763px; height: 877px;"
bgcolor="#ffffff" cellpadding="20" cellspacing="0">
<tbody>
<tr>
<td style="font-size: 12px; color: rgb(0, 0, 0);
line-height: 150%; font-family: trebuchet ms;"
valign="top"
background="http://www.oplin.org/4cast/wp-content/themes/4cast/images/kubrickbgwide.jpg"
bgcolor="#ffffff">
<p><!-- Make sure you modify the 4Cast title in this section -->
<span style="font-size: 20px; font-weight: bold;
color: rgb(0, 0, 0); font-family: arial;
line-height: 110%;">OPLIN 4Cast #301: Tackling
the big jobs</span><br>
<!-- Make sure you modify the date of the 4Cast in this section -->
<span style="font-size: 11px; font-weight: normal;
color: rgb(102, 102, 102); font-style: italic;
font-family: arial;">September 26th, 2012</span></p>
<!-- Begin copy of Web Source here -->
<p style="text-align: justify;font-size: 16px;
font-family: arial; line-height: 110%;"><img
class="alignleft size-full wp-image-3095"
title="lateral file cabinet"
src="cid:part4.08090806.06010609@oplin.org"
alt="" width="124" height="118">In this day and
age of user-generated Internet content - Web 2.0,
if you want to call it that - there are astounding
amounts of information being generated in very
short time spans. It has been <a
href="http://blogs.loc.gov/digitalpreservation/2012/05/a-vision-of-the-role-and-future-of-web-archives-the-web-archive-in-todays-world/">pointed
out</a>, for example, that every 24 hours there
are more than twice as many words posted to
Twitter as were printed in the entire New York
Times over the last 60 years. If you are an
archivist, the enormity of the task of saving even
a little of this Internet material for future
research seems overwhelming, and when you factor
in other non-print information it seems even
worse. Yet a few organizations, most notably the
Internet Archive and the Library of Congress in
the United States, have tackled portions of the
job. Below are some recent news stories about the
problem and their latest efforts to capture and
provide effective access to huge amounts of
information that might otherwise be beyond the
reach of many.
</p>
<div> </div>
<ul style="text-align: left;">
<li style="text-align: justify; font-size: 16px;
font-family: arial; line-height: 110%;"><a
href="http://www.businessweek.com/articles/2012-09-20/the-disappearing-web-decay-is-eating-our-history">The
disappearing web: Decay is eating our history</a>
(Businessweek/Mathew Ingram) "They took a number
of recent major news events over the past three
years-including the Egyptian revolution, Michael
Jackson's death, the elections and related
protests in Iran, and the outbreak of the H1N1
virus-and tracked the links that were shared on
Twitter about each. Following the links to their
ultimate source showed that an alarming number
of them had simply vanished. In fact, the
researchers said that within a year of these
events, an average of 11 percent of the material
that was linked to had disappeared completely
(and another 20 percent had been archived), and
after two-and-a-half years, close to 30 percent
had been lost altogether and 41 percent had been
archived."</li>
<li style="text-align: justify; font-size: 16px;
font-family: arial; line-height: 110%;"><a
href="http://blog.archive.org/2012/09/17/launch-of-tv-news-search-borrow-with-350000-broadcasts/">Launch
of TV news search & borrow with 350,000
broadcasts</a> (Internet Archive Blog/Brewster
Kahle) "Like library collections of books and
newspapers, this accessible archive of TV news
enables anyone to reference and compare
statements from this influential medium. The
collection now contains 350,000 news programs
collected over 3 years from national U.S.
networks and stations in San Francisco and
Washington D.C. The archive is updated with new
broadcasts 24 hours after they are aired. Older
materials are also being added."</li>
<li style="text-align: justify; font-size: 16px;
font-family: arial; line-height: 110%;"><a
href="http://blogs.loc.gov/loc/2012/09/congress-gov-unveiled-today/">Congress.gov
unveiled today</a> (Library of Congress
Blog/Erin Allen) "The Congress.gov site includes
bill status and summary, bill text and member
profiles and other new features like
comprehensive searching across bill text,
summary and statuses; persistent URLs for search
results; Members' legislative history and
biographical profiles; and maintenance of
existing features such as links to video of the
House and Senate floor, top searched bills and
the save/share feature."</li>
<li style="text-align: justify; font-size: 16px;
font-family: arial; line-height: 110%;"><a
href="http://www.buzzfeed.com/jwherrman/so-is-the-library-of-congress-still-archiving-twi">So,
is the Library of Congress still archiving
Twitter?</a> (BuzzFeed/John Herrman) "Serving
up billions upon billions of tweets in even the
most basic way is a hard job for a technology
company, much less for a government agency whose
<a
href="http://www.loc.gov/about/reports/budget/fy2013.pdf">requested
budget</a> [pdf] for 'Digital Initiatives' in
2013 - all of them, including web archiving,
historic newspapers, the online American history
archive, the veteran's history project, early
sound recordings - is under $50m, and actually <em>lower</em>
than it was in 2011."</li>
</ul>
<div style="text-align: left;"> </div>
<p style="text-align: left; font-size: 20px;
font-family: arial; line-height: 110%;"><small><strong><em>Tweet
fact:</em></strong></small><br>
</p>
<div style="text-align: justify; font-size: 16px;
font-family: arial; line-height: 110%;">When the
Library of Congress announced in April 2010 that
it was going to archive Twitter, there were 50
million tweets a day. Now there are 400 million a
day.
</div>
<div style="text-align: left;"> </div>
<!-- End paste of web source here --> </td>
</tr>
<tr>
<td style="" solid="" background-color:="" rgb(255,=""
255,="" 255);="" >="" valign="top" width="760"
background="http://www.oplin.org/4cast/wp-content/themes/4cast/images/kubrickbgwide.jpg"><span
style="font-size: 10px; color: rgb(96, 96, 96);
line-height: 100%; font-family: verdana;">
<hr><!-- Begin standard subscription verbiage -->
<div style="text-align: justify;">The <strong><em>OPLIN
4cast</em></strong>
is a weekly compilation of
recent headlines, topics, and trends that could
impact public
libraries. You can subscribe to it in a variety
of ways, such as: <br>
</div>
<div style="text-align: left;"> </div>
<ul>
<li style="text-align: justify;"><strong>RSS
feed.</strong>
You
can receive the OPLIN 4cast
via RSS feed by subscribing to the following
URL:
<a class="moz-txt-link-freetext" href="http://www.oplin.org/4cast/index.php/?feed=rss2">http://www.oplin.org/4cast/index.php/?feed=rss2</a>.
</li>
<li style="text-align: justify;"><strong>Live
Bookmark.</strong>
If you're using the Firefox
web browser, you can go to the 4cast website
(<a class="moz-txt-link-freetext" href="http://www.oplin.org/4cast/">http://www.oplin.org/4cast/</a>) and click on the
orange "radio wave" icon
on the right side of the address bar. In
Internet Explorer 7, click on
the same icon to view or subscribe to the
4cast RSS feed. </li>
<li style="text-align: justify;"><strong>E-mail.</strong>
You
can have the OPLIN 4cast
delivered via e-mail (a'la OPLINlist and
OPLINtech) by subscribing to
the 4cast mailing list at
<a class="moz-txt-link-freetext" href="http://mail.oplin.org/mailman/listinfo/OPLIN4cast">http://mail.oplin.org/mailman/listinfo/OPLIN4cast</a>.
</li>
</ul>
</span> </td>
</tr>
<tr>
<td style="border-top: 0px solid rgb(255, 255, 255);
background-color: rgb(255, 255, 204);" valign="top"
width="760"
background="http://www.oplin.org/4cast/wp-content/themes/4cast/images/kubrickfooter.jpg">
<br>
</td>
</tr>
</tbody>
</table>
</td>
</tr>
</tbody>
</table>
<title>OPLIN 4Cast</title>
<style>
.headerTop { background-color:#FFFFFF; border-top:0px solid #000000; border-bottom:1px solid #FFFFFF; text-align:center; }
.adminText { font-size:16px; color:#0000FF; line-height:200%; font-family:verdana; text-decoration:none; }
.headerBar { background-color:#FFFFFF; border-top:0px solid #333333; border-bottom:0px solid #FFFFFF; }
.title { font-size:20px; font-weight:bold; color:#000000; font-family:arial; line-height:110%; }
.subTitle { font-size:11px; font-weight:normal; color:#000000; font-style:italic; font-family:arial; }
.defaultText { font-size:12px; color:#000000; line-height:150%; font-family:trebuchet ms; }
.footerRow { background-color:#FFFFCC; border-top:0px solid #FFFFFF; }
.footerText { font-size:10px; color:#996600; line-height:100%; font-family:verdana; }
a { color:#0000FF; color:#0000FF; color:#0000FF; }
</style>
</body>
</html>