Skip to content

Scripts to reconcile and serialize to MARC21 TMS metadata.

License

Notifications You must be signed in to change notification settings

BookOps-CAT/tms-migration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tms-migration

Scripts to reconcile and serialize to MARC21 TMS metadata.

TMS MSSQL Database connectivity

Use sqlalchemy with PyODBC driver to connect. Additional Microsoft ODBC Driver 13 for SQL Server is required. Unable to connect using pymssql driver.

Work notes

9/11/2024

Title types occurrences

  • NULL has 71 entries.
  • COLLECTION has 126702 entries.
  • PUBLISHED_PORTFOLIO has 8316 entries.
  • SERIES has 53259 entries.
  • BOOK_OR_ALBUM_TITLE has 32919 entries.
  • TITLE has 410055 entries.
  • SUBTITLE_OR_TITLE_TRANSLATION has 3386 entries.
  • PERIODICAL_OR_SERIAL_PUBLICATION has 1773 entries.
  • TITLE_CONTINUED has 614 entries.
  • DESCRIPTIVE_TITLE has 51589 entries.
  • PORTFOLIO_OR_SET has 7365 entries.
  • ALTERNATE_TITLE has 2950 entries.
  • CAPTION has 1526 entries.
  • FOLDER_OR_SUBSERIES has 49482 entries.
  • ADDITIONAL_AUTHORS has 2651 entries.
  • LOCATION_DEPICTED has 40867 entries.

New classes

  • ObjTitles tbl (Virginia suggested Descriptive)
  • TitleTypes tbl

9/10/2024 Connecting to original TMS DB

  • Successfully tested access to TMS DB using DBeaver
  • Failed to connect using sqlachemy and pymssql driver. Successful with PyODBC and installation of Microsoft ODBC Driver 13 for SQL Server. Oof!
  • plan to pull data directly from TMS DB and store manipulated data in SQLite

9/5/2024

Objects count by department:

  • 'Photography Collection' (1) has 275592 objects.
  • 'Print Collection' (2) has 60461 objects.
  • 'Spencer Collection' (3) has 5728 objects.
  • 'Other NYPL' (10) has 84 objects.
  • 'DL' (12) has 2155 objects.
  • 'Art & Architecture' (13) has 193 objects.
  • 'Paintings & Sculpture' (14) has 157 objects.
  • 'Ephemera' (16) has 92509 objects.
  • 'Picture Collection' (18) has 10009 objects.
  • 'DEACCESSIONED' (49) has 39 objects.
  • 'LPA DANCE' (50) has 76 objects.

8/22/2024 Missing tables in Tony's DB copy

  • AddressType, ConAddresses, ConEMail, ConPhones, EMailType tbls: consult with Wallach staff; export to a sheet?
  • AltNum tbl: use TableID & ID to figure out the relation to Objects or Constituents tbls
  • AuthorityAltNum tbl: DID NOT SEE IT AMONG EXPORTED TABLES! may include valuable data
  • Associations, AssocParents & Relationships tbls: may be useful in figuring out the relation between different objects
  • AuthorityAltNums tbl: links to authority records for constituents; have not see it!
  • Classification tbl: use to map bibliographic record type and Sierra material type
  • Conditions tbl: use to map item fixed fields and notes regarding poor condition
  • ConTypes tbl: needed for Constituents tbl to properly code in MARC (700 vs 710, for example) if no authority link is available; different than role!
  • DimensionElements & DimItemElemXrefs tbls: easier mapping than from Objects tbl where displayed valued are concatenated
  • Exhibitions, ExhibitionTitles, ExhObjXref tbls: Virginia suggested mapping to 585 MARC field; data needed
  • ObjGeography tbl: columns for different geographical names
  • RoleType vs RolesTypes = duplicate table? Ask Tony
  • FlabLables & StatusFlag tbls: map to item record coding?
  • Terms, TermTypes, TermMasterThes, ThesXrefs (includes relation to Object, Constituents), ThesXrefTypes, ThesaurusBases tbls: 6xx MARC fields
  • TextEntries, TextTypes tbls: needed for bib mapping, captions, etc.

About

Scripts to reconcile and serialize to MARC21 TMS metadata.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages