<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>CAIRSS &#187; harvesting</title>
	<atom:link href="http://cairss.caul.edu.au/blog/category/harvesting/feed/" rel="self" type="application/rss+xml" />
	<link>http://cairss.caul.edu.au/blog</link>
	<description>The primary function of CAIRSS is to offer support for Repository Managers in the higher education sector in Australia.</description>
	<lastBuildDate>Wed, 21 Dec 2011 01:55:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
		<item>
		<title>Getting into Google, Google Scholar and other search engines</title>
		<link>http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/</link>
		<comments>http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/#comments</comments>
		<pubDate>Tue, 24 Aug 2010 23:46:58 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[harvesting]]></category>
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/</guid>
		<description><![CDATA[Quick start Getting into Google et al. The basics Why be indexed? Strategies for being indexed Getting into Google Scholar Troubleshooting Not being indexed? Not seeing traffic referred from search engines? The same thing is being indexed multiple times? Other resources [This is a draft of a document for the CAIRSS website – we are [...]]]></description>
			<content:encoded><![CDATA[<div>
<div class="page-toc">
<ul>
<li><a href="#id2"><span>Quick start</span></a></li>
<li><a href="#id5"><span>Getting into Google et al.</span></a>
<ul>
<li><a href="#id6"><span>The basics</span></a></li>
<li><a href="#id7"><span>Why be indexed?</span></a></li>
<li><a href="#id8"><span>Strategies for being indexed</span></a></li>
</ul>
</li>
<li><a href="#id9"><span>Getting into Google Scholar</span></a></li>
<li><a href="#id10"><span>Troubleshooting</span></a>
<ul>
<li><a href="#id11"><span>Not being indexed?</span></a></li>
<li><a href="#id14"><span>Not seeing traffic referred from search engines?</span></a></li>
<li><a href="#id15"><span>The same thing is being indexed multiple times?</span></a></li>
<li><a href="#id16"><span>Other resources</span></a></li>
</ul>
</li>
</ul>
</div>
<div>
<p>[This is a draft of a document for the CAIRSS website <span class="spCh spChx2013">–</span> we are putting it here on the blog so people can give feedback before it goes live. Please comment below if you have a question that is not answered, a suggestion or if you have spotted a typo.]</p>
<h1 id="232_quick-start_1" ><a name="id2"></a>Quick start</h1>
<p>If you want to get started straight away, we offer this summary of three things to do:</p>
<ol class="lin" style="list-style: decimal;">
<li><strong>Get into Google Scholar</strong>. The single best starting point for getting good exposure is to <a href="http://scholar.google.com.au/intl/en/scholar/inclusion.html"><span class="Internet_20_link">follow Google&#8217;s instructions for how to be indexed in Google Scholar</span></a> and then contact them using <a href="http://www.google.com/support/scholar/bin/request.py"><span class="Internet_20_link">this form</span></a>. For more discussion on this see below in the section on <a href="https://cairss.caul.edu.au/rep.CAIRSS/www/search-engines/how-to-get-indexed.htm#scholar"><span class="Internet_20_link">Getting into Google Scholar</span></a>.</li>
<li><strong>Get harvested by as many aggregation sites possible</strong>. To do this, follow the <a href="../harvesting/harvesting.htm"><span class="Internet_20_link">CAIRSS harvesting guide</span></a>.</li>
<li><strong>Build a good site</strong>. Don&#8217;t spend time working on search engine optimization<span class="Footnote_20_anchor"><span class="footnote" style="vertical-align: super;"><a name="ftn1-text"></a></span></span>, spend time on repository usability and promoting the repository to your target audience so that people cite your research outputs and link to them.
<ul class="lib">
<li>A site with incoming links will be indexed by Google et al. without the repository manager doing anything, as long as you don&#8217;t ban web crawlers from indexing your site.</li>
<li>Google Scholar is more particular about what it indexes, hence the advice in point one. Follow their guide.</li>
<li>Make sure your repository has clear browse-paths, eg by Author, Subject or Title, so users can discover content without searching. This helps the search engines&#8217; robot harvesters to find and index your content.  Frequent updates will generally increase the frequency with which search engines index your site; frequent additions of fresh content may influence search engine rankings, but this is not clear.</li>
</ul>
</li>
</ol>
<h1 id="232_getting-into-google-_1" ><a name="id5"></a>Getting into Google et al.</h1>
<h2 id="232_the-basics_1" ><a name="id6"></a>The basics</h2>
<p>Since the early days of the web, there have been services which provide full-text search indexes of the web. These indexes are constructed by web-crawling software which essentially browses the web a page at a time and indexes its textual content. Broadly speaking, the crawlers follow links, and cannot discover pages that are not linked from somewhere.</p>
<p>Google is famous for improving search results, about ten years ago using the <a href="http://en.wikipedia.org/wiki/PageRank"><span class="Internet_20_link">PageRank</span></a><span class="spCh spChx2122">™</span> algorithm which took into account how many links pointed to the pages it was indexing to help order search results. The indexing processes used by search engines today are trade secrets and differ from search engine to search engine, so trying to optimise your exposure by worrying about them is something that would be very resource intensive, not to mention risky, as attempts to &#8216;game&#8217; the search engines typically result in your site being dropped from the index.</p>
<h2 id="232_why-be-indexed_1" ><a name="id7"></a>Why be indexed?</h2>
<p>Most repositories are designed to promote the research output of a repository by making it discoverable. The main source for discovery, by far, is search engines. Even though the majority of traffic to your repository may not be coming from a search engine, chances are a lot of the traffic is from links and citations where the <strong>original</strong> discovery was via a search engine.</p>
<p>A recent very informal survey of CAIRSS  sites showed that typically around 40% of traffic is coming from search engines and almost all of that search engine traffic is from Google<span class="Footnote_20_anchor"><span class="footnote" style="vertical-align: super;"><a name="ftn2-text"></a></span></span>.</p>
<p>Most CAIRSS respondents to the informal survey reported that aggregating sites such as Australian Research Online (<a href="http://research.nla.gov.au/"><span class="Internet_20_link">ARO</span></a>) and <a href="http://trove.nla.gov.au/"><span class="Internet_20_link">Trove</span></a> account for only a few percent of overall traffic, but it is still important to be represented in such sites for a couple of reasons:</p>
<ul class="lib">
<li>Their incoming links almost certainly raise the status of your repository with the search engines.</li>
<li>An 2008 survey of visitors to the ARO site showed that users were mainly researchers and students seeking information on a specific topic. These are the kinds of searches which will likely result in increased downloads and citations.</li>
</ul>
<p>It is very difficult to make direct comparisons between sites as there is a huge amount of variation in the way statistics are collected. CAIRSS is working on a guide to repository stats.</p>
<h2 id="232_strategies-for-being_1" ><a name="id8"></a>Strategies for being indexed</h2>
<ul class="lib">
<li>Make sure the repository is harvested by discovery services. See <a href="http://cairss.caul.edu.au/www/harvesting/service_providers.htm"><span class="Internet_20_link">this CAIRSS page listing discovery services</span></a>. As noted above, these are typically not currently major sources of traffic but they do contribute to the rank of your site. And <a href="http://trove.nla.gov.au/"><span class="Internet_20_link">Trove</span></a> in particular is experiencing very rapid growth, so we may see increases in traffic from there as users searching for Australian content of all kinds discover things from institutional repositories alongside the many other kinds of content indexed in Trove.</li>
<li>Follow the advice set out in the <a href="http://scholar.google.com.au/intl/en/scholar/inclusion.html"><span class="Internet_20_link">Google Scholar Inclusion Guidelines for Webmasters</span></a>. All of that advice is relevant to making your site visible in Google et al. as well as Google Scholar, with the exception of the <em>Indexing Guidelines</em> section.</li>
</ul>
<h1 id="232_getting-into-google-_2" ><a name="id9"></a><a name="Scholar"></a>Getting into Google Scholar</h1>
<p>Google Scholar is very important as it is the largest open index of scholarship accessible to most readers. It does a good job of finding multiple versions of scholarly articles and theses, including various publisher and database sites as well as open access versions and providing interfaces that make it easy for users to download articles into reference management software like Zotero, EndNote and Mendeley (to name a few). Getting Open Access versions of articles into Google Scholar is a key way to further the OA agenda.</p>
<p>CAIRSS and the Trove team from the National Library of Australia have been in discussion with the Google Scholar team. Google Scholar is interested in:</p>
<ul class="lib">
<li>Full text, by which they mean HTML and PDF versions of scholarly material.
<p>If your repository does not have a lot of full text it will not be well-indexed by Google Scholar.</li>
<li>Material that that they won&#8217;t find elsewhere like fulltext thesis content.</li>
</ul>
<p>Google Scholar is not set up to index discovery sites like the NLA&#8217;s Trove, it is designed to work on repositories which contain the full text of articles.  So even if the NLA had enough metadata available to them, as supplied over OAI-PMH in Dublin Core then they could not make an effective Google Scholar web site. That said, being indexed by Trove is important as it will make your repository more visible to search engines, and increase traffic (if only by a little bit).</p>
<p>The Google Scholar documentation is very clear that the full-text download for an item must be &#8216;in a subdirectory&#8217; that is, the URL must be &#8216;under&#8217; the metadata page. To use a real example, this is the metadata page for an item in the USQ repository:</p>
<blockquote class="bq"><p><a href="http://eprints.usq.edu.au/3838/"><span class="Internet_20_link"><span style="font-style: normal;"><span class="T3">http://eprints.usq.edu.au/3838/</span></span></span></a></p></blockquote>
<p>And the full text is in a &#8216;subdirectory&#8217; (actually there are not really directories involved):</p>
<blockquote class="bq"><p><a href="http://eprints.usq.edu.au/3838/1/BuildingInstitutionalInfrastructureInRegionalAustralia.pdf"><span class="Internet_20_link">http://eprints.usq.edu.au/3838/1/BuildingInstitutionalInfrastructureInRegionalAustralia.pdf</span></a></p></blockquote>
<p>This should not, in general be a problem for your repository as most are set up so that data streams are referenced in this way but if, for example, you have made very zealous use of handles and try to reference a datastream via a handle or DOI then Google Scholar may not index it. This could be a problem also for some of non-mainstream proprietary repository software some of which appears to make use of indirect links. Contact CAIRSS if you have a problem.</p>
<p>The best way to be indexed in Google Scholar is to add metadata to your summary pages. This is covered in <span class="spCh spChx201c">“</span>Indexing Guidelines<span class="spCh spChx201d">”</span> in <a href="http://scholar.google.com.au/intl/en/scholar/inclusion.html"><span class="Internet_20_link">their guide</span></a>. There are a variety of formats that can work but one in particular is recommended. This is supported &#8216;out of the box&#8217; by EPrints <span class="spCh spChx2013">–</span> for other sites CAIRSS can assist in networking you with sites using your software.</p>
<p>CAIRSS sites: if you have configured your repository or want help then please contact CAIRSS and we will compile a technical guide using information from the community on how to set up the metadata for various software.</p>
<p>We are not going to reproduce Google&#8217;s very clear advice here, and recommend that CAIRSS sites<a href="http://scholar.google.com.au/intl/en/scholar/inclusion.html"><span class="Internet_20_link"> follow their guide</span></a> and then contact them using <a href="http://www.google.com/support/scholar/bin/request.py"><span class="Internet_20_link">this form</span></a>.</p>
<h1 id="232_troubleshooting_1" ><a name="id10"></a>Troubleshooting</h1>
<h2 id="232_not-being-indexed_1" ><a name="id11"></a>Not being indexed?</h2>
<p>The number one reason you might not be able to find content in Google et al. is that your robots.txt file prevents access to their crawlers. This is worth checking. For example CAIRSS discovered that one of the commercial software vendors has recently shipped updates to their software which by default blocks access to web crawlers.</p>
<p>Other common reasons for a lack of indexed pages would be:</p>
<ul class="lib">
<li>If your site has had extended periods of down-time search engines will eventually start dropping your content from the index.</li>
<li>If your site has changed its URLs without putting in place redirects so that the old URLs map to new ones. This is a basic governance process which needs to be in place for all of an institutions websites, but is particularly important for repositories. If you change domain names, or software, or upgrade some software packages then you may need to add redirect rules which permanently redirect browser-clients to the new home of URL.
<p>Note that using Handles does not automatically or magically fix this problem for two main reasons:</p>
<ul class="lib">
<li>The Handles database needs to be updated with new URLs, something which we have found that even commercial vendors do not do when upgrading a repository.</li>
<li>Many incoming links to the repository will use the &#8216;ordinary&#8217; URL for full-text items and metadata pages, if these links break will undermine the integrity of your repository and its usefulness in exposing research via increased readership and citation rates.</li>
</ul>
</li>
</ul>
<h2 id="232_not-seeing-traffic-r_1" ><a name="id14"></a>Not seeing traffic referred from search engines?</h2>
<p>This is not necessarily a question to do with search engines, but as noted above, we have found that some repository software uses a lot of redirects. The URL a user sees and clicks on  may be to one university site, which then redirects to the repository. The stats module in the repository may then report that all the traffic is coming from university referrers, even though the original referring site has about 40% chance of being a search engine.</p>
<h2 id="232_the-same-thing-is-be_1" ><a name="id15"></a>The same thing is being indexed multiple times?</h2>
<p>If the same content appears in your site on more than one URL then that can dilute the impact of the content in a search index. This can happen when the repository adds session information to its URLs or when multiple views of the same content can be set up. The solution is to add some metadata all the pages showing the same content to say which version is the canonical one. See <a href="http://googlewebmastercentral.blogspot.com/2009/10/reunifying-duplicate-content-on-your.html"><span class="Internet_20_link">this guide</span></a>, which works for multiple search engines.</p>
<h2 id="232_other-resources_1" ><a name="id16"></a>Other resources</h2>
<p>We will not attempt to provide a complete guide here, and do not have the resources to keep it up to date. For general indexing /search issues try <a href="http://sites.google.com/site/webmasterhelpforum/en/faq--crawling--indexing---ranking"><span class="Internet_20_link">Google&#8217;s web master help forum</span></a>.</p>
<p>This document was compiled by Peter Sefton with input from the CAIRSS team and the CAIRSS community.</p>
<p class="center">Copyright USQ, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. &lt;<a onclick="javascript:window.open(&quot;http://creativecommons.org/licenses/by-sa/2.5/au/&quot;);return false;" href="http://creativecommons.org/licenses/by-sa/2.5/au/"><span class="Internet_20_link">http://creativecommons.org/licenses/by-sa/2.5/au/</span></a>&gt;</p>
<p class="center" style="margin-bottom: 0.1654in; margin-top: 0in;"><span class="Default_20_Paragraph_20_Font"><span style="country: US; language: en;"><span class="T4"><a name="graphics1"></a><img class="fr1" style="border: 0px; vertical-align: top;" src="http://cairss.caul.edu.au/blog/wp-content/uploads/2010/08/m40ca94ba.png" alt="graphics1" width="88" height="31" /></span></span></span></p>
<hr />
<div style="font-size: .9em;"><span class="footnote-defined"><a name="ftn1"></a>For a blunt exposition on why <span class="spCh spChx201c">“</span>Search Engine Optimization<span class="spCh spChx201d">”</span> is not helpful see <a href="http://powazek.com/posts/2090"><span class="Internet_20_link">Spammers, Evildoers and Opportunists</span></a>, by Derek Powazek.</span></div>
<div style="font-size: .9em;"><span class="footnote-defined"><a name="ftn2"></a> There are lots of complexities here, for example many ARROW/VITAL sites banned crawlers to make previous versions of the software more stable, meaning they have small footprints in Google, and some sites have URLs which are re-directed a couple of times meaning the repository statistics module sees only referring traffic from the same institution.</span></div>
</div>
</div>
<h3 class="bsuite_related_bypageviews">People who looked at this item also looked at&#8230;</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2010/08/30/cairss-weekly-update-2010-08-27/'>CAIRSS Weekly Update 2010-08-27</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/08/05/i-only-have-uris-for-you-vicki-and-peters-adventures-in-linked-data-land/'>I only have URI&#8217;s for you: Vicki and Peter&#8217;s adventures in linked-data land</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/11/04/cairss-weekly-update-2010-11-04/'>CAIRSS Weekly Update 2010-11-04</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/06/23/two-ands-events-and-what-they-mean-for-cairss/'>Two ANDS events and what they mean for CAIRSS</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/'>How to export contents from an Institutional Repository to a Spreadsheet</a></li>
</ul>
<h3 class="bsuite_related">Related items</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2009/10/12/harvesting-from%c2%a0flickr/'>Harvesting from Flickr</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Harvesting from Flickr</title>
		<link>http://cairss.caul.edu.au/blog/2009/10/12/harvesting-from%c2%a0flickr/</link>
		<comments>http://cairss.caul.edu.au/blog/2009/10/12/harvesting-from%c2%a0flickr/#comments</comments>
		<pubDate>Mon, 12 Oct 2009 05:06:51 +0000</pubDate>
		<dc:creator>caulcairss</dc:creator>
				<category><![CDATA[flickr]]></category>
		<category><![CDATA[harvesting]]></category>

		<guid isPermaLink="false">http://caulcairss.wordpress.com/2009/10/12/harvesting-from%c2%a0flickr/</guid>
		<description><![CDATA[Basic features  Registering Uploading content Editing content Organizing content Harvesting content Flickr API      Flickr is a web suite capable of hosting still images and videos. CAIRSS has received questions from the community regarding harvesting from this suite in particular.  There are other types of photography management software available such as Picasa, F-Spot and DigiKam. However these are out of the scope of [...]]]></description>
			<content:encoded><![CDATA[<div>
<div class="page-toc">
<ul>
<li><a href="#id1">Registering</a></li>
<li><a href="#id2">Uploading content</a></li>
<li><a href="#id4">Editing content</a></li>
<li><a href="#id6">Organizing content</a></li>
<li><a href="#id8">Harvesting content</a></li>
<li><a href="#id10">Flickr API</a></li>
</ul>
</div>
<div>
<p>Flickr is a web suite capable of hosting still images and videos. CAIRSS has received questions from the community regarding harvesting from this suite in particular.</p>
<p>There are other types of photography management software available such as <a href="http://picasa.google.com.au/">Picasa</a>, <a href="http://f-spot.org/Main_Page">F-Spot</a> and <a href="http://www.digikam.org/">DigiKam</a>. However these are out of the scope of this blog post.</p>
<h1><a id="id1" name="id1"></a>Registering</h1>
<p>To get started with <a href="http://www.flickr.com/">Flickr</a> you have to <a href="https://na.edit.yahoo.com/registration?intl=us&amp;done=https://login.yahoo.com/config/validate?.src=flickr&amp;.pc=5134&amp;.scrumb=0&amp;.pd=c%3DE0.GahOp2e4MjkX.5l2HgAoLkpmyPvccpVM-&amp;.intl=us&amp;.done=http%3A%2F%2Fwww.flickr.com%2Fsignin%2Fyahoo%2F&amp;src=flickr&amp;last=&amp;partner=yahoo_default&amp;domain=&amp;yahooid=">register for a Yahoo account</a>. Once registered you are able to create a Flickr account and customize your profile.</p>
<h1><a id="id2" name="id2"></a>Uploading content</h1>
<p>There are quite a few ways to upload content to Flickr, examples of these include:</p>
<dl>
<dt><a href="http://www.flickr.com/tools/uploadr/">Flickr Uploadr</a></dt>
<dd>A client application available for Windows and Mac. This software is an official Flickr tool and allows the user to add titles, tags, descriptions and sets.</dd>
<dt><a href="http://www.apple.com/ilife/iphoto/">iPhoto</a> and <a href="http://www.apple.com/aperture/">Aperture</a></dt>
<dd>iPhoto and Aperture are client applications for the Mac, features include uploading, editing and organizing images. </dd>
<dt><a href="http://www.flickr.com/help/photos/">Email</a></dt>
<dd>Flickr is capable of accepting content from a users computer or mobile device via email.</dd>
</dl>
<h1><a id="id4" name="id4"></a>Editing content</h1>
<dl>
<dt><a href="http://www.picnik.com/app#/home/welcome">Picnik</a> </dt>
<dd>Picnik is an online photo editing and official Flickr partner.</dd>
</dl>
<h1><a id="id6" name="id6"></a>Organizing content</h1>
<dl>
<dt><a href="http://www.flickr.com/tools/organizr.gne">Organizr</a></dt>
<dd>Organizr, an official Flickr tool, is an online photo organizer and editor used to search and browse content as well as organize content into collections and sets.</dd>
</dl>
<h1><a id="id8" name="id8"></a>Harvesting content</h1>
<p>There is a growing number of third party applications (not official flickr applications) available for harvesting content from Flickr. Examples of these include:</p>
<dl>
<dt><a href="http://www.flishr.com/">flishr</a></dt>
<dd>Upload, Download and search tool for Windows (requires .NET framework).</dd>
<dt><a href="http://code.google.com/p/dfo/">Dfo</a></dt>
<dd>Desktop Flickr Organizer for Gnome. Preferably installed in Ubuntu Linux this application allows online and offline editing including adding, removing and editing of photos, tags, sets and comments.</dd>
<dt><a href="http://greggman.com/pages/flickrdown.htm">Flickrdown</a></dt>
<dd>Windows application that allows downloading of photos in bulk using sets, requires .NET framework.</dd>
<dt><a href="http://sourceforge.net/projects/flickrexport/">flickrexport</a></dt>
<dd>A plugin capable of directly exporting flickr images via iPhoto or Aperture.</dd>
<dt><a href="http://code.google.com/p/onairbustour/wiki/flump">Flump</a></dt>
<dd>An Adobe Air application available for Windows, Mac and Linux.</dd>
</dl>
<h1><a id="id10" name="id10"></a>Flickr API</h1>
<p>Flickr has provided an <a href="http://www.flickr.com/services/api/">API for non-commercial use</a> by outside developers.</p>
<p>Interacting with the Flickr API can be done using most of the common programming languages such as Java, .NET, PHP, Python, Ruby and Perl.</p>
<p>If there is sufficient interest from the <a href="http://cairss.caul.edu.au/www/index.htm">CAIRSS</a> community, CAIRSS Central can move to  investigate creating a customized application using Java, PHP etc or even a command line script using Curl, Python or Perl.</p>
<p>If anybody has successfully harvested from Flickr and would like to contribute their information to the CAIRSS community or if you are a member of the CAIRSS community and would like to request assistance with this topic please contact <a href="mailto:cairss-technical@caul.edu.au">cairss-technical@caul.edu.au</a></p>
</div>
</div>
<h3 class="bsuite_related_bypageviews">People who looked at this item also looked at&#8230;</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/'>Getting into Google, Google Scholar and other search engines</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/08/05/i-only-have-uris-for-you-vicki-and-peters-adventures-in-linked-data-land/'>I only have URI&#8217;s for you: Vicki and Peter&#8217;s adventures in linked-data land</a></li>
</ul>
<h3 class="bsuite_related">Related items</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/'>Getting into Google, Google Scholar and other search engines</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://cairss.caul.edu.au/blog/2009/10/12/harvesting-from%c2%a0flickr/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

