I wanted to provide a way for users to search a limited access archival digital collection of over 36,000 records, which contained meta data and where approx 60% of the collection had machine generated OCR data.
To solve this I decided to batch load the “searchable” meta data and OCR data to a three field MySQL database and make this searchable via a PHP script. The database contained an index field, a title field and search field. The index and title fields are largely self explanatory with the index field containing a value that would allow a URL to the digital object concerned to be generated. However the search field contained a concatenation of all the relevant searchable meta data fields and then appended the machine generated OCR data, if it was available.
I then constructed a short PHP script to search the search field and return the results. I was surprised at how efficiently this process worked for our needs and decided that the basic code to search a single field MySQL database might be of value to others! Copy the code for achieving this here. If you save this file as search.php and then edit it in the appropriately commented locations, you should quite quickly get it working. (This code was actually working on my test server prior to making it available)
There is no attempt in my PHP code to evaluate the relevance of the results. They are just displayed sorted on the search field. I have since successfully used this PHP search script as the bases of more complex search scripts on other projects.
Saturday, October 17, 2009
Subscribe to:
Post Comments (Atom)
3 comments:
Is there a publicly available link?
How's the library going these days?
77p2p影片區hi5 tv 免費影片xxx383美女寫真aa一夜情台中援交妹視訊甜心寶貝直播貼片av成人網g8mm 視訊壞朋友論壇視訊美女msvt中部人聊天室視訊網愛聊天室網路援交168論壇辣妹貼圖新竹援交38ga片下載全國最大俱樂部1007視訊xvideo打飛機專用網哈尼視訊援交友aio34c蒼井空影片下載avdvd69性殿dodo豆豆聊天室色美眉部落格 2,視訊主播脫衣秀台南視訊月宮貼圖情趣 商品拓網交友-情色視訊拓網交友情色視訊免費視訊美女,情人視訊網自拍線上av免費影片一葉情貼影禁區kiss168一葉情貼影色影cu成人bt色站girl5320cu成人bt色片打飛機專用網
how do u do?
Post a Comment