This page is now "retired" as it has been superseded by the Comprehensive Knowledge Archive Network (CKAN) site.

General

  1. wikipedia: DONE

  2. www.archive.org (WONTDO -- not a single entity)

  3. project gutenberg DONE

  4. Christian Classics Ethereal Library DONE

  5. world wide molecular matrix - at Cambridge University DSpace (DONE - crystaleye)

  6. Website Attica: http://www.chass.utoronto.ca/attica/

  7. MIT opencoursware (not open really but should probably do ...)
  8. Connexions DONE

    • Knowledge should be free, open, and shared. Connexions is a rapidly growing collection of free scholarly materials and a powerful set of free software tools to help
    • http://cnx.rice.edu/

  9. http://www.opencontent.org/ (WONTDO - no specific open material)

    • OpenContent.org is reinventing itself as a portal into high quality, open access educational materials and educational discussions. Using the box above you can search the OpenCourseWare collection and its official Español and Portugues translations, the Connexions collection, and the Open Learning Support support forums

  10. Mathematics (WONTDO - not sure of size or status)

  11. http://www.mathforge.net ?? not so sure

  12. http://www.opentextbook.org + http://www.opengeodata.org. both starter projects. WONTDO: not sources of data)

  13. https://www.bioforge.net Community for Biological Innovation (WONTDO - inactive)

  14. http://www.keithbriggs.net -- online departure board information (WONTDO - no data it seems)

Property Data

Software

  • okftext
    • text, pdf, latex, html, xml
  • lilypond
    • tags: music

Geodata

Music

Musicbrainz

DONE

Texts

Economics

  1. title: innovation in specific industries DONE

    • url: http://www.hss.cmu.edu/departments/sds/faculty/klepper/archive.html

    • description: The first data set contains all the data used in the analyses reported in the paper by Steven Klepper and Kenneth L. Simons entitled, "The Making of an Oligopoly: Firm Survival and Technological Change in the Evolution of the U.S. Tire Industry," Journal of Political Economy, 2000, vol. 108, no. 4, pp. 728-760. For an explanation of how this data set is organized, click here. If you want to download the data set, click here. The other two data sets contain all the data used in the analyses reported in the paper by Steven Klepper and Kenneth L. Simons entitled, "Dominance by Birthright: Entry of Prior Radio Producers and Competitive Ramifications in the U.S. Television Receiver Industry," Strategic Management Journal, vol. 21, pp. 997-1016. For an explanation of how these two data sets are organized, click here. If you want to download the first of these data sets pertaining to the firms that produced radios, click here. If you want to download the second of these data sets pertaining to firms that produced televisions, click here.
  2. title: repec bibliography DONE

History

  1. history event markup language DONE

    • url: http://www.heml.org/

    • status: inactive (no change since 2004)
    • type: tool and data
    • description: tools for producing timelines and geographic charts. Data is there for demo tool rather than to be comprehensive

Closed Data

Bibliomaina.com

Bibliomania.com (went bankrupt) but from site we have:

What is the copyright status of texts on Bibliomania.com?

Most texts on our site are in the public domain. However Bibliomania.com Ltd has copyright in the HTML versions we have created for our web site. You are free to download these texts for personal use, but they may not be used for any commercial purpose, or republished in any form (including on the internet) without our prior email permission. Bibliomania.com Ltd has and will take legal action worldwide to protect its rights.

Please use the comments board to email us for copyright permission.

How do I cite a Bibliomania work?

We do not have full bibliographic data for the texts on Bibliomania, and they were typed from scratch, repaginated and reformatted hence these works are an original edition and should be cited as copyright Bibliomania.com Ltd 2000.

genuki.org.uk/big/eng/YKS/

Genuki is historical and geneaological information including information on Yorkshire. Claims aren't in relation to copyright (although what copyright in 100 year old photographs could you have) but in the assertion of database rights in information much of which comes from 1892.

http://www.genuki.org.uk/big/eng/YKS/Misc/conditions.html

((( All the material which is to be found on the Genuki Yorkshire site (any page which has a URL starting with "www.genuki.org.uk/big/eng/YKS/") is held in a database by me, and software to which I own the copyright, is used to extract the relevant data and generate the pages which you see on the Genuki Yorkshire site. A United Kingdom Act of 1997 specifically covers the compiling and use of database material. The notice below is required to be displayed in order to give me protection under this Act: Database Right, all databases used for this website are covered by the 1997 Database Regulations. Colin Hinson (and others as stated on the relevant pages) are the makers of the database used for this website and the owner of the database rights. First published in 1997. )))

Music Databases

Music databases (including open ones such as mutopia) claim copyright in the typesetting of their musical scores. While there is a compilation type copyright (for presentation) in most jurisdictions, I don't really see how this would cover the representation of the score in a musical notation such as lilypond or **kern when the original music is out of copyright.

One of the more outrageous (and stupid) examples of using this copyright to close access is on http://www.musedata.org/ (what makes it particularly bad is that this is academic project):

The research license: http://www.musedata.org/legal/licen.html -- MuseData files are provided free of charge to academic and non-commercial users but they remain the intellectual property of the Center for Computer Assisted Research in the Humanities, Braun #129, Stanford University, Stanford, CA, 94305-3067, USA.

Before downloading any materials from this site, please indicate your acceptance of the terms of this license agreement

All other prospective users must contact the Center for Computer Assisted Research in the Humanities, Braun #129, Stanford University, Stanford, CA 94305-3076, USA, before downloading, copying, or redistributing any data, in whole or in part, as found here or in derivative versions, in any format, electronic or otherwise, found at this site. --

The same site also runs themefinder which starts off its about page with:

  • Both the notated images and underlying data representations used by Themefinder are protected by international copyright laws. Visitors to this site are free to use Themefinder to search for musical themes for personal, teaching and non-commercial research purposes. However, any attempt to download the database, in whole or in part, will be considered a breach of copyright, and may lead to denial of access or legal action. The notated images are copyright © 1999-2000 by the Center for Computer Assisted Research in the Humanities. The encoded thematic material is copyright © 1999-2000 by David Huron and CCARH. The encoded European folksongs are copyright © 1999 by the estate of Helmut Schaffrath and used by permission. The Latin Motet thematic material is copyright © 1993 by Harry B. Lincoln and used by permission.

Shakespeare

Given Shakespeare's public domain status is incredible how much stuff claims copyright in his works or in related info. e.g.

Open Source Shakespeare

Despite its name bears the statement at the bottom of each page:

Program code and database © 2003-2006 Bernini Communications LLC. If copyrighted, texts are the property of their respective owners. About the texts used in OSS • Privacy policy

Does contain interesting info on the way most shakespeare texts end up being copyrighted again:

UK (ex-) Government Data

National archives

  • free to access and transcribe
  • 25GBP per copy

Not yet investigated properly

  • landregistry
  • companieshouse
  • ons (office national statistics)
  • met office: can't even find out what is available and what it costs

Sites Contacted without Response