CAIRSS

CAIRSS Weekly Update 2010-07-30

Events

Activity

Provide a forum to represent the collective interests of repository managers around Australia.

  • See CAIRSS website for details on 2010 CAIRSS Community Day: http://cairss.caul.edu.au/www/events/cairss_community_day_2010.htm. Program currently under development (to be released in August).
  • A group of Vital software users have started planning a meeting to coincide with the 2010 CAIRSS Community Day. Contact Vicki Picasso from Newcastle University for details.

Provide support for ADT.

  • Tim is now working with the institutions requiring migration of theses content from old VT-ETD system to IRs. There are still 7 institutions to commence this work, and 5 in progress currently. A new section outlining these ADT changes and new requirements has been added to the CAIRSS website at: http://cairss.caul.edu.au/www/theses/new_nla_theses_view.htm. Contact Tim at CAIRSS for assistance migrating your theses content from VT-ETD to your IR
    cairss-technical@caul.edu.au.

Assistance with the integration of repositories with the requirements of the ERA and the Higher Education Research Data Collection (HERDC) exercises.

  • For possible interest, advice from the ARC on book chapter scanning was posted to the CAIRSS elist this week: Q. For ERA book chapter RODAs, will the chapter be the only thing required? Is there any chance that the whole book may be requested e.g. to view the chapter in context ?
    A. Thank you for your question regarding Book Chapters and peer review. If a book chapter is submitted for peer review, the whole book does not need to be made available for review; just the book chapter. Regards, ERA Team. http://groups.google.com.au/group/cairss/browse_thread/thread/8e9e4db14da43fc

Assistance with the understanding of managing copyright issues in the repository environment.

  • The Perth CAIRSS Copyright Workshop was held yesterday. A special thankyou to Julie Woodland from Curtin University of Technology for assistance in organising this event.
  • See events section of this blog post for the Brisbane CAIRSS Copyright Workshop dates and details. Registrations are about to close for the Brisbane workshop.
  • Remember CAIRSS Community members can contact Robin for repository related queries on cairss-copyright@caul.edu.au.

Other.

  • Peter Sefton, CAIRSS Senior Advisor is investigating the Ranking Web of World Repositories http://repositories.webometrics.info/. Information to be posted to CAIRSS elist shortly.

Open Repositories 2010 – eResearch – mccallum

Over the last couple of years we have seen a shift in the software architecture used in Institutional Repositories. Institutional Repositories seem to have moved away from single software application models and have leaned towards using combinations of flexible and scalable web components, as well as more productive software development frameworks. So far these architectural changes have improved harvesting, indexing and general performance for the end user. But what about the researcher?

I am not exactly sure what term or phrase to reference when discussing the progressive interaction between a researcher and the software that will showcase or disseminate their research output, so allow me to refer to it using terminology that I have picked up recently such as AWE (Academic Working Environment) or eResearch Platform.

Am I correct in assuming that a researchers willingness to participate in actively making research documents and data available, as well as promoting ones personal profile might be elevated during the honeymoon phase of a research project? Based on conversations with many accomplished researchers I believe this is the case, however I have no hard evidence to back this up.

The good news is that advances in the AWE area from a software perspective are now available, creating an opportunity for change. Hopefully these advances will result in increased interaction from researchers and richer content on the web, these changes may not occur simply due to the fact that the software is available but may require innovation from within an institution or even a national mandate.

During the International Conference on Open Repositories 2010 I was made aware of systems including eSciDoc , Islandora and The Fascinator amongst others which address these issues but are unfortunately beyond the scope of this blog post. More to come.

I hope that this has created food for thought, all comments and thoughts are welcome.

Copyright Tim McCallum, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. < http://creativecommons.org/licenses/by-sa/2.5/au/ >

graphics1

This post was written in OpenOffice.org, using templates and tools provided by the Integrated Content Environment project and published to WordPress using The Fascinator.

CAIRSS Weekly Update 2010-07-16

Events

Activity

Provide a forum to represent the collective interests of repository managers around Australia.

  • 2010 CAIRSS Community Day to be held on Tuesday 23rd November 2010 in Melbourne. Registrations are now open. See CAIRSS website for further details at: http://cairss.caul.edu.au/www/events/cairss_community_day_2010.htm.
  • 2010 CAIRSS Community Day program is under construction and will be announced in August. Thankyou to those repository managers who have sent in program ideas. If you have a repository development you are interested in presenting on the day, or a topic area you would like to see addressed please contact Katy at cairss@caul.edu.au.
  • The new Australian and New Zealand DSpace users group had their first teleconference this week which CAIRSS participated in. Reminder that a new DSpace section has been added to the CAIRSS website (http://cairss.caul.edu.au/www/repository_software/dspace.htm). The new webpage outlines details for a new closed Australian and New Zealand DSpace users elist that is managed by Sten Christensen. If any other software groups want to use the CAIRSS teleconference facilities to discuss software issues, please contact Katy at cairss@caul.edu.au.
  • A group of Vital software users have started planning a meeting to coincide with the 2010 CAIRSS Community Day. Contact Vicki Picasso from Newcastle University for details.

Provide support for ADT.

  • Tim is back from Spain and CAIRSS is now working with the institutions requiring migration of theses content from old VT-ETD system to IRs. A new section outlining these ADT changes and new requirements has been added to the CAIRSS website at: http://cairss.caul.edu.au/www/theses/new_nla_theses_view.htm. Contact CAIRSS if you need assistance migrating your theses content from VT-ETD to your IR.

Assistance with the integration of repositories with the requirements of the ERA and the Higher Education Research Data Collection (HERDC) exercises.

  • The 18th June ERA upload file window date has passed and most of the community seems to have survived the first leg of ERA 2010. Please continue to submit any ERA SEER repository queries to CAIRSS as we continue to work closely with the ARC in this area.
  • ARC ERA faqs page (http://www.arc.gov.au/era/faq.htm) has some information on ‘when’ you may be asked for non-IR RODAs.
  • John Lamps at Deakin has developed a tool for looking up ERA journal rankings that you might not have yet seen. It is available at: http://lamp.infosys.deakin.edu.au/era/.

Assistance with the understanding of managing copyright issues in the repository environment.

  • Based on feedback from the Sydney CAIRSS Copyright workshop, Robin Wright the CAIRSS Copyright Specialist has provided a new updated version of the ‘CAIRSS and Copyright: A good practice guide’. See: http://cairss.caul.edu.au/www/copyright/a_good_practice_guide.htm. This includes additional or revised information in sections 2.6, 3.9, 3.12 and 3.13.
  • See events section of this blog post for the Perth and Brisbane CAIRSS Copyright Workshop dates and details. Registrations are still open for the Brisbane workshop.
  • A poster of possible interest on copyright permissions was recently presented at an ALA Convention. Asking for Permission: A Survey of Copyright Workflows for Institutional Repositories by Hanlon & Ramirez: http://works.bepress.com/marisa_ramirez/14/.
  • Remember CAIRSS Community members can contact Robin for repository related queries on cairss-copyright@caul.edu.au.

Provide a watching brief on trends and developments in repositories, Open Access, scholarly communication and dissemination of research in repositories both in Australia and overseas.

Open Repositories 2010 – Digital Media – McCallum

Today is the last day of the conference, I have been looking forward to attending the session on digital media as I believe that this is a topic with a big future. I have noticed allot of activity with Java programming libraries regarding media content and visualisation in the last few years. Below are some details of the work that the University of York has done recently.

“YODL2: Developing a search interface for multimedia content at the University of York” (Nigel Verghese Thomas and Julie Allinson).

Originally YODL was an implementation of Muradora as access control was an important requirement for the University of York.

The system currently holds 10 thousand objects and contains preview, thumbnail and fullscreen versions for all content.

The University of York has moved away from Muradora and created YODL2. As I understand it, this was due to the development of Muradora being halted and the start of performance issues as the system reached 10 thousand objects.

YODL2 has been written in Java. The architecture of the system is based on dependancy injection principles and uses SOLR 1.4, Spring and Fedora 3.3. The new design has an improved user interface (eg browser based zoom and handle functionality for viewing images and page turning within the browser) and performance gains.

The University of York performed a survey to assist them with interface product choice, the prototype was even written in other languages and frameworks including .NET and C#. It seems they put allot of effort into doing homework on finding the right tools to use. The candidates for enhancing the user experience were Java FX, Flex and Silverlight. Flex was chosen due to cross platform support and the fact that it is well documented. Although Silverlight would have worked on Windows and Mac there was concern that Linux users would require Moonlight (an open source implementation of Silverlight).

With YODL2, the data for display is drawn from Fedora. Sequence information is stored in RDF which when compounded provides the logic for navigating through say a book, for example there is “has next” and “has previous” value associated to pages within a book.

There were demonstrations on in-browser deep zoom, browsing image search results and paging through a book object.

This session brought us to the end of the conference which overall was a great success. The Palacio de Congresos de Madrid is a great venue, everything went as smoothly as one would expect a conference of this size would. A few minor hiccups with the wireless was definitely outweighed by the very friendly staff and the daily 3 course lunches.

Like everyone else here I have made many new friends and contacts including staff from 2 different repository support services, Welsh Repository Network and Repositories Support Project.

Please leave comments on this blog or email cairss@caul.edu.au if you have any questions regarding the conference.

Copyright Tim McCallum, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. < http://creativecommons.org/licenses/by-sa/2.5/au/ >

graphics1

This post was written in OpenOffice.org, using templates and tools provided by the Integrated Content Environment project and published to WordPress using The Fascinator.

Open Repositories 2010 – Day 2 – Terminology Services – Sefton

On day 2 by far the most striking paper for me was by Michael Durbin [PDF]: Terminology Services in a Digital Repository. He described a set of tools for creating controlled vocabularies and making them available to services such as metadata forms. This is an important area or repositories if we can share terminology using a linked-data approach then we can improve discoverability. There’s a page about the service at OCLC’s site.

At the moment in the IR world we have a variety of ways to refer to the same thing so you get Article, Journal and Article and Journal Article, and so on, all referring to (more or less) the same thing. Using a terminology server, we could all use the term from the bibliographic ontology with the URI: http://purl.org/ontology/bibo/Article. That doesn’t mean the users have to see that URI, you can still choose what you want to show on a web page as part of a description, but when services such as Trove or ARO aggregate repository content we can be much more confident that the right kinds of resources are being grouped together.

What the users depositing content will see need not change much, they will still be presented with drop-down lists and auto-complete fields, but behind the scenes repositories will not be storing plain-old text strings as metadata, but a combination of an identifier (a URL) and (maybe) a text string.

There are a similar efforts elsewhere. Chatting to Neil Jefferies of Oxford, during the Duraspace-hosted drinks on the third day I learned that Oxford has a similar service under development. In my last post I mentioned BibApp as another important service which can do a similar thing for people’s names. And I know that ANDS have been exploring the idea of an Australian vocabulary service.

In the ADFI Software R&D team’s forthcoming work for ANDS under the metadata stores programme we will be making a service which will work for both names and terms. I hope that as this new class of software becomes more common and the various efforts mature we can start to work out some standards for interoperability.

Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. <http://creativecommons.org/licenses/by-sa/2.5/au/>

Open Repositories 2010 – Update – McCallum

Today we reached the point in the conference on Open Repositories 2010 where the general sessions ceased and the user group sessions began. The user group sessions that I attended were very practical and slightly more technical than the general sessions. One talk included an excellent live presentation using DSpace where the audience were able to log in and participate.

Statistics

I have heard some good discussions regarding the relevance and accuracy of repository statistics back home and have personally experienced persistence issues where statistics have disappeared during the administration and upgrading of software. I decided to attend the talk named named DSpace 1.6 useage statistics, what can it do for you? (Ben Bosman) in the hope of discovering some new facts about statistics which I could share.

It was noted during the talk that 36% of DSpace users requested features relating to statistics. The latest version of DSpace (1.6.x) now includes a sophisticated statistics package, which was a contribution by @MIRE. It is worth noting that the logging and collection of the statistics ships with DSpace for free, however the statistics visualisation package which filters and displays the statistics in a browser can be purchased from @MIRE. It is possible to create your own visualisation software by simply querying the stored statistics.

DSpace can trigger a statistics event in 4 ways. A community home page visit, a collection home page visit, an item visit or a bitstream download. When an event is triggered, the information that is stored includes (but is not limited to) the IP address of the visitor, the referring web site, the longitude and latitude of the visitor as well as the continent code (eg Asia) and country code of the visitor (eg Japan).

These events are stored in a raw format using XML. The information is then indexed using SOLR, meaning the indexing and retrieval of information is fast and efficient. This architecture allows the use of an additional/separate machine for the SOLR component, saving CPU and RAM resources in the repository and allowing load balancing for the SOLR machine if required.

When we think of statistics the first thing that comes to mind is traffic, hits and popularity, however statistics based on metadata and bitstreams allow us to measure the growth of repository over time and analyse the repository based on information about the objects and their associated bitstreams. The collection of statistics in this system is highly configurable.

Having rich data in any form is a good thing I guess and even if traffic and popularity of single objects is not your goal this type of granular statistics collection on any platform may add value one day when justifying the existence, history or importance of your repository.

Fedora

I gathered some notes on Fedora as it is widely used in Australian repositories, below is a guide/roadmap of Fedora for the near future.

Fedora 3.3 will be released on December the 18th. New features will include the ability to ingest and provide local content using the file:// address. We were advised that the RESTful API in this version is no longer experimental and is now ready for use. Also RELS-INT relationships are supported.

Fedora 3.4 is coming soon, no date set at this point. The release candidate is out now users are able to assist in the testing of this release. This version includes size attribute improvements, scaling of logging without restarting and 25 bug fixes.

Fedora 3.5 will pave the way for 4.0 and has the following features pencilled in. A RESTful relationship API (currently versions are using SOAP to manage relationships), as well as the use of Spring and OSGi, enabling the community to contribute code more efficiently. Improvements to the Fedora command line interface was mentioned also.

Copyright Tim McCallum, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. < http://creativecommons.org/licenses/by-sa/2.5/au/ >

graphics1

This post was written in OpenOffice.org, using templates and tools provided by the Integrated Content Environment project and published to WordPress using The Fascinator.

Open Repositiories 2010 – Strategic overview of DuraSpace, and roadmaps for DSpace and Fedora – Parker

Developments in the collaboration between Fedora and DSpace are of obvious interest to the Australian repositories community, where the majority of research repositories are powered by one or other of these technologies.

The DuraSpace collaboration has gained a lot of attention since the DSpace and Fedora communities joined forces in 2008. There won’t be a code release for a while that integrates the two but it looks like a number of alliances have been formed and the community has substantially expanded. Eight of Fedora’s eleven code committers are now from the community rather than DuraSpace. The DuraSpace team members are no longer affiliated with their original universities— all now work independently for DuraSpace. And DuraSpace is a nonprofit organisation, which means they’re trying to move away from reliance on grant funding and towards a more sustainable model based on sponsorship and services.

The DuraSpace Strategy

According to Chief Technology Officer Brad McLean, DuraSpace’s mission statement is deliberately broad to allow support for community-led projects that may not directly relate to DSpace or Fedora. One of the objectives for 2010 is to “support more independent user-facing projects such as Blacklight”. This attitude explains DuraSpace’s interest in DepositMO, SWORD, Drupal/WordPress integration at Islandora and a PKP collaboration with the Big Digital Machine at Indiana University.

What about DSpace and Fedora?

DuraSpace still sees DSpace as a “turnkey reference platform for repositories” and a “user experience incubator”, while Fedora is an “easily installed, configured and maintained repository toolkit”. One possible future arrangement that DuraSpace envisages is DSpace running on Fedora running on DuraCloud. However many institutions could take a while to be ready for cloud computing. So for the moment DuraSpace is focussing on a future for DSpace with “Fedora Inside”. The goal is to retain the out-of-the-box experience of DSpace but to benefit from the strengths of the “robust” Fedora architecture.

Tim Donoghue from DSpace makes it clear that DSpace with “Fedora Inside” is still some way off—perhaps 2011 or 2012—as the software will need to go through some intermediary stages first. They’re attempting to follow more predictable release schedules; the next version of DSpace is 1.7, scheduled for December 2010.

One of the most interesting anticipated features in 1.7 (from this repository manager’s point of view) is support for different submission processes based on publication type. The example Donoghue gives here is a separate workflow for depositing theses as opposed to other publication types. This could be really useful for Australian DSpace repositories. Also, DSpace already supports import using SWORD but in 1.7 the developers are also looking at the possibility of exporting data using SWORD, which would theoretically allow DSpace to export to Fedora or to EPrints or perhaps to other repository software.

Brad McLean described how “decoupling of the user interface from workbenches” has already occurred in repositories (see for example Manakin, Blacklight and SWORD), and wonders whether the admin side of repository software will follow the same trend. I think repository managers should welcome any attempts to enhance our admin tools. We’ve focussed first on improving the usability of the end-user interfaces to our systems—this is naturally a matter of urgency—but it’s high time that repositories became easier for repository managers to use, too.

For more information on DuraSpace, see: http://www.duraspace.org/

Copyright Rebecca Parker, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. <http://creativecommons.org/licenses/by-sa/2.5/au/>

Open Repositories 2010 – Update – Parker

The winners of the poster competition (as decided by popular vote)were announced at the conference dinner last night. It’s interesting that while the presentations at Open Repositories are mostly technical (often at too high a level for most repository managers), the posters focus more on general repository management issues like open access policy, publishing trends and copyright. This flavour was reflected in the winner of the competition, a poster titled “Open Research Online:a self-archiving success story” (Smith & Yates) and the runner up, a poster from the Sherpa team outlining “Major improvements to the RoMEO service” (Smith & Millington).

Smith and Yates from Open University UK challenged the popular belief that self-deposit doesn’t work without a university-level mandate. They reported impressive contribution statistics (higher than most coverage levels in mandated repositories as indicated by my own reading of the literature). They argue that the secret to encouraging contribution is providing a service that appeals to researchers rather than just university administration. This is something we’re well aware of in Australia, and it’s something I believe we’re going to need to observe carefully in the post-ERA days.

The RoMEO team are building a new version of their database expected to be released later this year. Last year they improved the clarity of policies around archiving published versions within their database. This was a great development and something from which we in Australia have benefited for some time through the OAKList database.

Now Sherpa is working on an ambitious project to reorganise their records around journal-level policies, rather than publisher-level records. This will be particularly useful for society journals published by Wiley (previously under the Blackwell banner), where both copyright ownership and embargo periods differ between journals but this can be difficult to establish from the publisher or journal website.

Copyright Rebecca Parker, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. <http://creativecommons.org/licenses/by-sa/2.5/au/>

HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%

Open Repositories 2010 – Day 1 – Drury

Day one began with the conference being opened by the keynote speaker, David De Roure, speaking about a recent service, “myexperiment”. Myexperiment is essentially a social virtual research environment which also includes a repository of research methods. The environment has as little mandating of activity as possible, with the researchers able to take a more flexible approach to what can be included. The repository consists of “packs” which don’t just contain a workflow, but also any number of objects attached which relate somehow to that workflow. David also mentioned what he called the six ‘Rs’ of research object behaviours – that is must be: replayable, repeatable, reproducible, reusable, repurposable, and reliable. He also added in a seventh ‘R’ which was ‘referencable’. The environment is currently in its second generation, where the key characteristic is re-use of an increasing pool of tools etc, and provenance analytics are coming into play. In the anticipated third generation, they are looking at the global re-use of the tools, data, and methods in what they call a ‘radical sharing’. ‘myexperiment’ was summed up as being an evolving social infrastructure for sharing.

Session one of the day included two main areas of interest. Firstly that of Research Data. Matthias Razum spoke about eSciDoc, a portal which sits on top of data, and facilitates sharing and collaboration. In eSciDoc, the researcher creates an experiment, then as the researcher does the actual experiment, all data goes straight into eSciDoc. From there the researcher can perform the analysis, visualisation, etc. Mark Hedges then followed on to speak about gathering of data, and made several interesting points: firstly that researcher practices vary, but there are commonalities. For example, many researchers keep their data on their desktop, process the data, output the data (in the form of a publication), then submit some data to a databank – and what is not submitted to a databank is lost. Mark mentioned the OPM – the Open Provenance Model, and discussed the Model’s concepts of node types and the relationships between the nodes called edges which are stored using RELS-EXT. Finally, Julian Jonier from Kitware spoke about the work being done on collecting medical imaging and its related data in a system called MIDAS. He spoke a little about the structure of the system as well as some of the challenges that this area of data faces such as the massive datasets, the challenge of many different formats, and the lack of standardisation across the format metadata schemas.

Session two also included two main areas, one of which was Adminstrative Systems. Sally Rumsey of Oxford spoke first on the blurring of the boundaries between the IR and the research information registry. During this presentation she spoke about the research activity data registry which does not store the actual data, only the metadata. She also mentioned the entities that are used including Person (and persona), Project, Funding Body, Organisational Unit, etc, along with the other controlled vocabularies that are used. Les Carr from Southampton then presented on Research Assessment and the impact on the openess agenda. He mentioned CRIS (Current Research Information Systems) which seem to be becoming quite widespread. Along with CRIS he also highlighted CERIF which is a standard data schema to interoperate between systems. CERIF-ed Irs may contain many separate datasets, all linked by explicit relationships. He explained how their Eprints repository prior to the CERIF contained funder information which was just added as metadata, and how the Eprints repository now contains funders as their own objects for example, a paper links to affiliated projects instead of just naming them. To finish up the session, Arnoud Jippes from the Netherlands spoke about NARCIS and Research information services on a national scale. The Netherlands has a very unified approach all unversities use the same CRIS system called METIS. All IRs are harvested by NARCIS, as well as the National Library. He also mentioned that all the repositories use the DAI – the Digital Author Identifier.

The last session for the day included a session on Repository Frameworks. Stephen Abrams spoke frst on the work being done by the California Digital Library (CDL) and University of California San Diego Supercomputer Center. It is a collaboration between CDL and 10 campuses and several peer institutions. They are looking at how to manage and add value to a body of trusted digital content. Their approach is to build complexity through composition, not addition in other words they are taking a very modular approach. They keep everything as small blocks which are easier to replace when they outlive their usefulness. The implementation project which Stephen focussed on is the MERRIT project, which is aiming to consolidate 140TB of existing content. Tom Cramer from Stanford then spoke about the Hydra project, a collaboration between Hull and Virginia Universities, and Duraspace. He mentioned how IRs are well established, but there has been no framework to produce applications that interface between IRs and users. They have identified shared functions including deposit, manage, search, etc. Their approach has been one body, many heads, which is a component approach including components such as Fedora, ActiveFedora, Solr, Blacklight, along with a Hydra Plugin, Services, and a portal called Hydrangea. They have recognised that no single institution can resource the development of a full range of solutions of its own, so they have also taken a one body many heads approach to the community which shares the load. Finally Alex Wade from Microsoft Research spoke on the tools that Microsoft Research has been developing to assist with dataset lifecycle management, including Pivot and Zentity.

Copyright Caroline Drury, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. <http://creativecommons.org/licenses/by-sa/2.5/au/>

HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%

Open Repositories 2010 – Day 1 – Sefton

My top few things from day one or OR10.

  1. From the keynote, which was a presentation by David De Roure, professor of eResearch from Oxford. He looked at the way the research process is making its way onto the web.

    He said something along the lines of Methods are first-class citizens, which made me think of the ANDS IS O2146 data model for describing research data: Collections, Parties, Services and Activities. And now, along comes ‘methods’. How to adapt? I think it’s worth ANDS considering an information architecture which is flexible enough to adapt to insights like this, and emerging needs there a few of us suggesting that using RDF metadata might be a way forward.

  2. BibApp is an interesting bit of software. [PDF] It’s built around the idea of researcher profiles, and includes services for name-disambiguation, along the lines of the work done by NicNames and the forthcoming ARDC-PIP. Tim Sherrat of ANDS pointed it out to the ANDS eList recently.
  3. There was a presentation on an ongoing repository evaluation process [PDF] which was possibly a bit long-winded for what amounted to a description of a methodology, we’ll see how useful this is, but I think that ‘objective’ measures of software are much less important than narrative that helps you to understand what it does (benchmarks like whether it actually works for more than five repository items are important of course).
  4. From the poster sessions:CAIRSS and the NLA probably need to talk to OCLC [PDF], as we have been doing with the Google Scholar team, to see how we can make sure we’re feeding the best possible quality data through to their open-access aggregator.

Copyright Peter Sefton, 2010. Licensed under Creative Commons Attribution-Share Alike 2.5 Australia. <http://creativecommons.org/licenses/by-sa/2.5/au/>

HTTP://DBPEDIA.ORG/SNORQL/?QUERY=SELECT+%3FRESOURCE%0D%0AWHERE+{+%0D%0A%3FRESOURCE+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%2FBIRTHPLACE%3E+%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FRESOURCE%2FSYDNEY%3E+%3B%0D%0A%3CHTTP%3A%2F%2FDBPEDIA.ORG%2FONTOLOGY%2FPERSON%

This post was written in OpenOffice.org, using templates and tools provided by the Integrated Content Environment project and published to WordPress using The Fascinator.