tag:blogger.com,1999:blog-2495500078306016514.post1398706900405367617..comments2023-01-19T06:00:58.290-08:00Comments on UK Council of Research Repositories: Are your repository policies worth the HTML they are written in?Dominic Tatehttp://www.blogger.com/profile/16810507522854441709noreply@blogger.comBlogger5125tag:blogger.com,1999:blog-2495500078306016514.post-33939786092071934642012-03-22T17:02:54.254-07:002012-03-22T17:02:54.254-07:00Pat, this echoes the experience we've had with...Pat, this echoes the experience we've had with CORE. I've already blogged on the issues of linking from the metadata record to the thing described (with a particular focus on 'fulltext') at http://core-project.kmi.open.ac.uk/node/31 - but I haven't mentioned that resumption token issues are among the more common of the problems we've encountered when harvesting records.ostephenshttps://www.blogger.com/profile/09316058230103850251noreply@blogger.comtag:blogger.com,1999:blog-2495500078306016514.post-39791506442197414312012-03-22T08:53:28.179-07:002012-03-22T08:53:28.179-07:00Hi Pat. I also tried to do some OAI-PMH harvesting...Hi Pat. I also tried to do some OAI-PMH harvesting (for one evening) a couple of years ago, and posted a very, very badly written blog post about it http://www.nostuff.org/words/2008/playing-with-oai-pmh-with-simple-dc/<br /><br />In short, I found it difficult to count the number of full text items as the same DC field was used by so many thingsChris Keenehttps://www.blogger.com/profile/15435507593145209174noreply@blogger.comtag:blogger.com,1999:blog-2495500078306016514.post-3968300976426355092012-03-22T08:51:53.397-07:002012-03-22T08:51:53.397-07:00This is a very valid point.
Like you, we tried t...This is a very valid point. <br /><br />Like you, we tried to make our policy fairly realistic and open:<br />"Metadata<br />Anyone may access the metadata free of charge.<br />The metadata may be re-used in any medium without prior permission for not-for-profit purposes and re-sold commercially provided the OAI Identifier and/or a link to the original metadata record are given." If anything, if I was reviewing it today I would remove the need to link back (it's probably unrealistic)<br /><br />IR records are just webpages, and anyone who has ever looked at a web log file will know there are thousands of web crawlers (some of which may be nice enough to refer to your robots file, most will not) to index and reuse the data.<br /><br />I'm also not convinced that we need author's/publisher's permissions to redistribute. When we share our Catalogue bibliographic records we do not ask each individual author/publisher if they mind us passing on a record describing their book, and I don't see any real difference with IRs. <br /><br />Information on the web will be reused, this can only be a good thing for us.Chris Keenehttps://www.blogger.com/profile/15435507593145209174noreply@blogger.comtag:blogger.com,1999:blog-2495500078306016514.post-81058863366830191372012-03-22T05:47:20.327-07:002012-03-22T05:47:20.327-07:00A side note, I tried to do some oai-pmh harvesting...A side note, I tried to do some oai-pmh harvesting recently across a group of repositories and it just wasn't worth it. More often than not resumption tokens didn't work, and the metadata often lacked any identifier to explain what the metadata was for - which made harvesting it somewhat frivolous. <br /><br />In the end I reverted back to screen scraping the sites, as that was a lot more reliable.Pathttps://www.blogger.com/profile/09884646385829861282noreply@blogger.comtag:blogger.com,1999:blog-2495500078306016514.post-28284255320052491792012-03-22T05:10:10.229-07:002012-03-22T05:10:10.229-07:00Thanks for this thoughtful response Nick. I'm ...Thanks for this thoughtful response Nick. I'm going to follow up in the next couple of weeks with a post describing what CORE is actually going to do about harvesting metadata and fulltext. While it's been important to do this investigation, as an aggregator we've got to make a decision about what we do, and we obviously want to balance the needs of the repository owners with our own aims and mission.<br /><br />I think the approach we are planning to take walks this line, but it is really important for us as a project to get feedback from the 'repository' community and to continue the discussion whatever decisions we make now for the purposes of our current project. I hope that this blog and other UKCoRR channels will provide a way of doing this.ostephenshttps://www.blogger.com/profile/09316058230103850251noreply@blogger.com