schenizzle

Libraries, Classification, Cataloging, Carrboro, Duke

LC MARC codelists in XML

Posted by Sean Chen on February 15, 2008

I saw the announcement about LCCN permalink. But surfing around I then noticed that there were XML versions of the MARC codelists including language codes, country codes and geographic area codes. No relators and sources yet. This is terribly handy if your application needs up to date codelist or you are trying to process MARC files and need to translate dcu to Washington, D. C.

Thanks LC. Lets keep putting more things out there!

Posted in General | No Comments »

OMG WTF?!

Posted by Sean Chen on November 10, 2007

Radiohead covering Joy Division/New Order’s Ceremony

Posted in General, Music | No Comments »

Highlights from: LCSH Weekly List 42 (October 17, 2007)

Posted by Sean Chen on November 6, 2007

I missed posting about Dildos [sh2007007162] appearing on a September list. But in that vein … in good old Week 42:

(C) 150 Sex and popular music [May Subd Geog] [sp2007007020]
450 UF Popular music and sex
550 BT Popular music

It hasn’t made it into the distribution yet, but it is always funny to think, how many books with literary warrant passed someone’s desk to create this SH and it is only being authorized now.

(A) 150 Web 2.0 [sp2007008319]
053 TK5105.88817
550 BT World Wide Web

Who says SACO moves fast?

Posted in LCSH | No Comments »

UNC School of Information and Library Science, Fall 2007

Posted by Sean Chen on November 6, 2007

Fall 2007 is rolling now. Two classes for me this semester, strangely I think both are going to be about networks, social computing, electronic identity, and information.

INLS 490: Online Social Networks

http://www.ibiblio.org/fred/inls_490/

Class has a link stream setup at: http://del.icio.us/tag/inls490

Syllabus is pretty good. I’m pretty much doing this class for some deep background and to learn a bit more about research methods that people are applying in this little bit of our information space.

INLS 500: Introduction to Human Information Interaction

http://sils.unc.edu/programs/courses/descriptions.html#180

No public syllabus or website.  Its being done in a combination of Blackboard, in-class and Second Life.

The way I look at this class is that it is the introduction to the information science literature class.  Everyone in the program (both library and information science tracks) has to take this one.  It is the only required class for both tracks.

But … we spent the first half of the semester using Second Life as an in-class discussion tool … i.e. did small group discussions in it.  Well this was … you can imagine how that worked out.  The Second Life part has disappeared and now the class is being done in the usual manner.

Posted in SILS | No Comments »

Future of (Universal?) Bibliographic Control

Posted by Sean Chen on July 19, 2007

I tossed in the universal part because that was the underlaying promise of over forty of years of work by the library community. In a sense what we have now is a result of that aim. From IBSD, MARC, AACR2, FRBR and the rest of the alphabet soup has been a soupcon of hope that one day we’d have the systems and standards which would lead to one record done internationally for any work. I was reminded of this when I heard Michael Gorman talk at the American Association of Law Libraries annual meeting in New Orleans this past week, which is another story on to itself.

Of course the world is complicated and we live in a Babel of a bibliographic (or now more precisely descriptive) universe. So for my part here is what I feel still needs to be done. There has been obviously, with the Working Group on the Future of Bibliographic Control, either an attempt at a search for next steps and a vision of how things should be (or as the cynics and skeptics see it an attempt for political cover with making some really difficult choices).

So my laundry list, or more possibly wish list which really concentrates on the bibliographic utilities (read: OCLC) since that is the pointy end of the spear that most catalogers do their work and where most of our data eventually ends up.

  • Authorizing national enhance editing on a larger scale. In a sense distributing more of the work into more hands. Wikis work with the wisdom of crowds. Records aren’t sacred, they are wrong and incomplete the barrier to fixing things should be minimal. Shouldn’t cataloging on a more expansive model work with the wisdom and experience of trained catalogers?
  • Lowering the bar to getting back enhanced records for use and reuse. Sure I see that services like OCLC’s PromptCat have been successful for a lot of libraries, but the I think the ongoing costs have been a bit to too much for some to swallow. This should be something that all our bibliographic systems should be taking advantage of. Along with this comes the thought that records don’t need to be perfect the first time they come in, but that tendency comes because its an expensive operation to go back. Going back and benefiting from the work that others do behind you in the processing stream needs to be cheaper, faster and easier.
  • There needs to be a better incentivisation of the entire record creation, enhancing and reconsumption process. Sure we’ve brought our vendors into the process, and obviously there is now pressure on publishers to push more out upstream. But seems like it has been difficult get library administrators, systems, catalogers all pulling in the same direction with this.
  • More training and implementation support throughout the library community. My sense of things has been that things worked OK so far. But we really need to be taking advantage of new ways to distribute training and support to everyone who wants it. This has to be cheaper and easier.

Is there an emerging consensus on the future and what to do to do with bib control? I don’t know.

Posted in Cataloging, Libraries | No Comments »

Subject Heading Validation

Posted by Sean Chen on July 6, 2007

Bouncing around on the PCClist:

As announced in the CDS bulletin of May 25 2007 http://www.loc.gov/cds/notices/2007-05-25-Subject_Authority_Validation_Records.pdf CPSO has begun a project to create subject authority records for every subject string appearing in bibliographic records to aid Library of Congress catalogers, and external users in the validation of LCSH subject heading strings. Effective immediately subject authority records are being created for valid subject strings obtained from bibliographic records. Formerly, these subject strings did not prompt the creation of subject authority records, because they contained free-floating subdivision[s].

Some of these records are being created manually by the Cataloging Policy and Support Office staff, and some will be generated by machine, but all of them will be reviewed before distribution occurs. We anticipate at least 200 records per month at the start of this project. These records will NOT be printed in the annual editions of LCSH (the “red books”). The records can be identified by the legend “[proposed validation record]” appearing at the end of the 1xx string. This legend will be removed once the records have been approved and distributed. Additionally each record will contain a 667 field with this data: “Record generated for validation purposes.”

This is actually really cool. If you are getting a distribution of the entire subject authority file you’ll eventually be able to validate against entire subject headings rather than piecing together from individual subfields.

This also is really good from a subject cataloging point of view because we’ll be able to just pull a valid string from the file rather than saying … is this geographically subdivideable … is this one usable under a war or only animals and plants etc. A lot of individual decisions can be taken out and we can concentrate on what the book is about rather than trying to remember arcane subdivision usage rules and notes.

What isn’t clear is the source from where the subject headings will be pulled from. I’m curious if they are going to limit to the set of records that are DLC only and thus limit to DLC practice or will they include PCC records too. One reassuring things is that the individual subject headings will be subject to review. And one last point is you won’t see these headings in the weekly lists and in the red books, since really they are meant for plumbing.

Posted in Cataloging, LCSH, Libraries | No Comments »

Economics and Organization of Bibliographic Data

Posted by Sean Chen on June 9, 2007

In the background paper for the third meeting of the Library of Congress’ Working Group on the Future of Bibliographic Control there are a series of questions that the meeting requests comment on. One of them stood out for me:

4) A recurrent theme of the previous meetings was more fully integrating bibliographic data (such as MARC records, terminologies, authority files, et al.), which currently exist as “data silos,” into the fabric of the World Wide Web. In particular, terminologies and authorities were seen as important resources that could be used in a variety of ways. From a design perspective, how do we move from “data silos” to “data services,” that increase the potential value of bibliographic data by treating them as interconnected resource collections, addressable via URIs and accessible over Web protocols? Organizationally, how might this goal be accomplished, supported, and maintained? Economically, what factors need to be considered?

We’ve been talking around this.

Much of the discourse until now has been of the nature, “Wouldn’t it be great if … LCC was available in an open web service … MARC went away … webify our infrastructure … get rid of catalogers … have better OPACS.”

So what makes a silo? I’m pretty sure I can speak for everyone and say we (libraries) want to be relevant in the information future. How have we backed ourselves into a corner? And what exactly is that corner?

I am kind of suspicious of talk about all silos being bad. I think there can be an argument made on the behalf of silos. Silos exist for a reason, the information needs of a community are different. Then again there is probably a stronger argument that our silos exist because of the way we have acquired, collected, organized, and developed resources with our vendors and within our libraries.

Who do I see as the custodian, maintainer, of the bibliographic future of web services, standards, and open data? So who are the players:

So … who has been actually doing anything?

The distinct impression I get with LC is that they are initiating this entire process because there is a crisis in scalability and more importantly severe budgetary pressures. Are they really going to take, and more imporantly be capable of making the next steps? Including drastic organization changes, significant changes in legislation to make things happen, and of course dealing everything that being a Federal agency entails?

OCLC? Is the cooperative really moving in that direction? I get a sense they are. Worldcat.org is supposed to open up more later in the year. But is this the right direction? We’re actually talking about a different beast, opening up and revealing a lot more of the plumbing for a whole host of applications that we can only just begin to imagine. I actually have heard very little comment from their representatives from the meeting summaries. In my mind OCLC is probably the place where the organization, support and funding will be headed. This is something that member libraries, need and appear to want.

ARL has a stake, but being the plumbers of our bibliographic future isn’t in their charge.

The search corporations? Are they thinking in a long enough horizon here. Google probably, I’m not really sure what Microsoft was bringing to the table (they are listed as having a representative).

Our library system vendors? I don’t think they are that interested in selling us a new way of doing things that may very well put them all out of business?

So … do we need a new organization to do this? This organization will just not have to be a big player in making standards happen. But would also have to create systems, get people to buy into them, provide services over a very long horizon, and be a good custodian of our cataloging as a “public good”. I don’t think we will have to reinvent the wheel, especially with things happening in Internet time, but it might be necessary.

One last thought, my hunch is that moving forward with the opening up of bibliographic data will cost a bunch, but … I think in the long run it will pay for itself in terms of improved retrieval, resource sharing, new opportunities created, and just plain coolness. The alternative is even more irrelevance of libraries and their staggering collections.

Of course … anyone have hard numbers, I would be curious to see some real calculations or studies. Heck, a methodology would be a start. I’m not sure what would be a comparable situation, or how you can calculate value for metadata, and the wealth it in turn provides access to, or creates in new knowledge.

Posted in Cataloging, Libraries | 1 Comment »

Facebook Platform

Posted by Sean Chen on May 25, 2007

So are libraries and higher ed going to jump on? Or is this more hype than substance?
What sort of services as (libraries and librarians) can we develop that would actually be useful in the social environment that is Facebook? What is it that we do that takes advantage of the environment and platform?

Well Fred Stutzman has some thoughts on it.

Posted in General | No Comments »

Legislative intent?

Posted by Sean Chen on May 18, 2007

Its pretty well known in law that Associate Justice Antonin Scalia has nothing but contempt of legislative history and Congressional intent when it creates laws. So a colleague’s friend just published a book with Hein looking at some of Scalia’s decisions when he was on the DC Court of Appeals. [edit: Ignore my previous bad summarization: The book is about the reaction of the Federal appeals courts to Justice Scalia's criticism of legislative intent and statutory interpretation]

What good is legislative history?

Anyways the comment my colleague made in passing:

(01:59:23 PM) colleague: sca|1a 1s 1n ur supr3m3 c0urt, k1||1ng ur c0ngr3ss10na| 1nt3nt

I’m pretty easily amused these days.

Posted in Law | 1 Comment »

Sign of the Apocalypse?

Posted by Sean Chen on May 4, 2007

It started when I went to go check an unusual seen from reference: “State of Rhode Island and Providence Plantations.”

I knew I’d see it eventually, but the first time seeing it was quite a shock.

A citation to Wikipedia in the National Authority File.

Rhode Island
Personally I am fine with it in this case, but for something more obscure … well I guess it depends upon how much you trust wikipedia.

Posted in General | No Comments »