<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>CAIRSS &#187; Equella</title>
	<atom:link href="http://cairss.caul.edu.au/blog/category/equella/feed/" rel="self" type="application/rss+xml" />
	<link>http://cairss.caul.edu.au/blog</link>
	<description>The primary function of CAIRSS is to offer support for Repository Managers in the higher education sector in Australia.</description>
	<lastBuildDate>Wed, 21 Dec 2011 01:55:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
		<item>
		<title>How to export contents from an Institutional Repository to a Spreadsheet</title>
		<link>http://cairss.caul.edu.au/blog/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/</link>
		<comments>http://cairss.caul.edu.au/blog/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/#comments</comments>
		<pubDate>Wed, 20 Jan 2010 05:51:25 +0000</pubDate>
		<dc:creator>tpmccallum</dc:creator>
				<category><![CDATA[Digital Commons]]></category>
		<category><![CDATA[DigiTool]]></category>
		<category><![CDATA[DSpace]]></category>
		<category><![CDATA[EPrints]]></category>
		<category><![CDATA[Equella]]></category>
		<category><![CDATA[ERA]]></category>
		<category><![CDATA[Fedora]]></category>
		<category><![CDATA[Fez]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[OAI-PMH]]></category>
		<category><![CDATA[SOLR]]></category>

		<guid isPermaLink="false">http://caulcairss.wordpress.com/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/</guid>
		<description><![CDATA[The idea The tools The solution Extracting the data Converting the data Reporting the data Screen Shots Using the Flexible Repository Export Reporter (FoREveR) The idea A short time ago CAIRSS was approached by a Repository Manager from within the CAIRSS community to assist with exporting the contents of their repository to a spreadsheet. It [...]]]></description>
			<content:encoded><![CDATA[<div>
<div class="page-toc">
<ul>
<li><a href="#id2">The idea</a></li>
<li><a href="#id3">The tools</a></li>
<li><a href="#id4">The solution</a>
<ul>
<li><a href="#id5">Extracting the data</a></li>
<li><a href="#id6">Converting the data</a></li>
<li><a href="#id9">Reporting the data</a></li>
<li><a href="#id10">Screen Shots</a></li>
</ul>
</li>
<li><a href="#id14">Using the Flexible Repository Export Reporter (FoREveR)</a></li>
</ul>
</div>
<div>
<h1><a id="id2" name="id2"></a>The idea</h1>
<p>A short time ago <a href="http://cairss.caul.edu.au/www/index.htm">CAIRSS</a> was approached by a Repository Manager from within the CAIRSS community to assist with exporting the contents of their repository to a spreadsheet. It was made apparent that accomplishing this task would greatly assist with Institutional Repository management tasks and most importantly ERA related work.</p>
<h1><a id="id3" name="id3"></a>The tools</h1>
<p>There are many ways that data can be extracted, moved and converted. The wisest choice is to  use tools that are interoperable. An example of this would be choosing <a href="http://www.openarchives.org/pmh/">OAI-PMH</a> to extract data rather than attempting to communicate with an individual Repositories data storage device or database etc.</p>
<h1><a id="id4" name="id4"></a>The solution</h1>
<p>Our CAIRSS Technical Officer Tim McCallum has completed a solution to address this task in the form of a Java Web Application. FoREveR &#8211; Flexible Repository Export Reporter.</p>
<h2><a id="id5" name="id5"></a>Extracting the data</h2>
<p>The data extraction is carried out using an OAI-PMH harvester. In this instance <a href="http://fascinator.usq.edu.au/">The Fascinator</a> was used to accomplish this task. With regards to recent trends in Institutional Repository development and the use of <a href="http://lucene.apache.org/solr/">SOLR</a> the next step was an easy choice; simply extract the data from The Fascinator using a SOLR query. As an added bonus SOLR is able to supply the data in <a href="http://en.wikipedia.org/wiki/JSON">JSON</a> (JavaScript Object Notation) format.</p>
<h2><a id="id6" name="id6"></a>Converting the data</h2>
<h3><a id="id7" name="id7"></a>Overview</h3>
<p>After testing different methods of converting the data including XSLT and Python some research was done revealing some excellent JSON libraries written in Java. The final choice was Java given the fact that the JSON libraries could meet the requirements for this application and that OAI-PMH, The Fascinator and SOLR were all already written in Java.</p>
<h3><a id="id8" name="id8"></a>Technical</h3>
<p>The JSON data returned is the result of an HTTP request (can be set to fetch all by default). This data is converted to Java Maps and Java ArrayLists for further processing. The application loops through every record that has been returned and creates a Java Set (unique list/master list). This Set is then displayed in the users browser. This is a last minute chance to select or deselect metadata before the final report is written. It is sometimes the case that a metadata field containing a large amount of content is best left out, as this can make the spreadsheet unmanageable from an end users perspective.</p>
<h2><a id="id9" name="id9"></a>Reporting the data</h2>
<p>Once approved the application creates an HTML file with all data saved to a table. The table includes table headings, table rows, table data cells and unordered lists for repeating information. This file can be opened in Microsoft Excel and Open Office spreadsheet applications or viewed in a browser.</p>
<h2><a id="id10" name="id10"></a>Screen Shots</h2>
<h3><a id="id11" name="id11"></a>Optional SOLR Query</h3>
<p><span style="display:block;"><a name="graphics1"></a><img class="fr1" style="border:0;vertical-align:top;" src="http://caulcairss.files.wordpress.com/2010/01/m46abed93_552x193.jpeg" alt="graphics1" width="552" height="193" /></span>Note: It is not necessary to know SOLR query syntax, the application can be set to get everything by default. This may be an area to address with the community and feedback is welcome.</p>
<h3><a id="id12" name="id12"></a>Feedback</h3>
<p><span style="display:block;"><a name="graphics2"></a><img class="fr2" style="border:0;vertical-align:top;" src="http://caulcairss.files.wordpress.com/2010/01/505e3c00_355x528.jpeg" alt="graphics2" width="355" height="528" /></span></p>
<h3><a id="id13" name="id13"></a>Small sample of spreadsheet output</h3>
<p><span style="display:block;"><a name="graphics3"></a><img class="fr1" style="border:0;vertical-align:top;" src="http://caulcairss.files.wordpress.com/2010/01/m366c3f82_552x235.jpeg" alt="graphics3" width="552" height="235" /></span></p>
<h1><a id="id14" name="id14"></a>Using the Flexible Repository Export Reporter (FoREveR)</h1>
<p>As this software is in the very early stages of its life cycle reports can be created by CAIRSS and emailed out to you. Please <a href="http://cairss.caul.edu.au/www/contact_cairss.htm">contact CAIRSS Central</a> if you think that your institution can benefit from the use of this tool.</p>
<p>The source code is available at <a href="http://cairss.caul.edu.au/trac/browser/code/FoREveR">http://cairss.caul.edu.au/trac/browser/code/FoREveR</a> for your interest, however it has not been extensively tested. All feedback is welcome. CAIRSS will endeavor to improve and enhance the software to meet your needs.</p>
</div>
</div>
<h3 class="bsuite_related_bypageviews">People who looked at this item also looked at&#8230;</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2011/03/23/cairss-weekly-update-2011-03-23/'>CAIRSS Weekly Update 2011-03-23</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/09/22/cairss-weekly-update-2010-09-22/'>CAIRSS Weekly Update 2010-09-22</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2011/10/14/cairss-update-2011-10-14/'>CAIRSS Update 2011-10-14</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/08/05/i-only-have-uris-for-you-vicki-and-peters-adventures-in-linked-data-land/'>I only have URI&#8217;s for you: Vicki and Peter&#8217;s adventures in linked-data land</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/'>Getting into Google, Google Scholar and other search engines</a></li>
</ul>
<h3 class="bsuite_related">Related items</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2009/10/13/australian-repository-software-in-use/'>Australian repository software in use</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/03/09/cairss-repository-software-sandbox/'>CAIRSS Repository Software Sandbox</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2009/05/28/open-repositories-conference-09/'>Open Repositories Conference 09</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2009/05/13/what-do-you-get-when-you-combine-dspace-and-fedora-%e2%80%a6-duraspace/'>What do you get when you combine DSpace and Fedora? … DuraSpace</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/04/06/cairss-assistance-in-era-seer-repository-testing-2010/'>CAIRSS assistance in ERA-SEER repository testing 2010</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://cairss.caul.edu.au/blog/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Australian repository software in use</title>
		<link>http://cairss.caul.edu.au/blog/2009/10/13/australian-repository-software-in-use/</link>
		<comments>http://cairss.caul.edu.au/blog/2009/10/13/australian-repository-software-in-use/#comments</comments>
		<pubDate>Tue, 13 Oct 2009 03:58:41 +0000</pubDate>
		<dc:creator>caulcairss</dc:creator>
				<category><![CDATA[Digital Commons]]></category>
		<category><![CDATA[DigiTool]]></category>
		<category><![CDATA[DSpace]]></category>
		<category><![CDATA[EPrints]]></category>
		<category><![CDATA[Equella]]></category>
		<category><![CDATA[Fedora]]></category>
		<category><![CDATA[Fez]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[VITAL]]></category>

		<guid isPermaLink="false">http://caulcairss.wordpress.com/2009/10/13/australian-repository-software-in-use/</guid>
		<description><![CDATA[CAIRSS now has a current list of Australian university research repositories (see: http://cairss.caul.edu.au/www/repository_software/repository_software.htm). As outlined on this CAIRSS webpage, all 39 Australian universities have a research repository, with seven various repository software options currently in use. CAIRSS will be working in the future to list which version of the software each installation uses. People who [...]]]></description>
			<content:encoded><![CDATA[<div>
<div class="page-toc">
<div>
<p>CAIRSS now has a current list of Australian university research repositories (see: <a href="http://cairss.caul.edu.au/www/repository_software/repository_software.htm"><span class="Internet_20_link">http://cairss.caul.edu.au/www/repository_software/repository_software.htm</span></a>).</p>
<p>As outlined on this CAIRSS webpage, all 39 Australian universities have a research repository, with seven various repository software options currently in use.</p>
<p>CAIRSS will be working in the future to list which version of the software each installation uses.</p>
</div>
</div>
</div>
<h3 class="bsuite_related_bypageviews">People who looked at this item also looked at&#8230;</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2010/12/22/thoughts-from-eresearch-australasia-librarians-get-involved-in-research-data-management-2/'>Thoughts from eResearch Australasia &#8211; Librarians, get involved in research data management</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/03/09/cairss-repository-software-sandbox/'>CAIRSS Repository Software Sandbox</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/08/25/getting-into-google-google-scholar-and-other-search-engines/'>Getting into Google, Google Scholar and other search engines</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/06/23/two-ands-events-and-what-they-mean-for-cairss/'>Two ANDS events and what they mean for CAIRSS</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/'>How to export contents from an Institutional Repository to a Spreadsheet</a></li>
</ul>
<h3 class="bsuite_related">Related items</h3>
<ul class="bsuite_related">
<li><a href='http://cairss.caul.edu.au/blog/2010/01/20/how-to-export-contents-from-an-institutional-repository-to-a-spreadsheet/'>How to export contents from an Institutional Repository to a Spreadsheet</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2010/03/09/cairss-repository-software-sandbox/'>CAIRSS Repository Software Sandbox</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2009/05/28/open-repositories-conference-09/'>Open Repositories Conference 09</a></li>
<li><a href='http://cairss.caul.edu.au/blog/2009/05/13/what-do-you-get-when-you-combine-dspace-and-fedora-%e2%80%a6-duraspace/'>What do you get when you combine DSpace and Fedora? … DuraSpace</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://cairss.caul.edu.au/blog/2009/10/13/australian-repository-software-in-use/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

