LIS 7410 - Digital Libraries
Spring 2011 -- Section 01
Homework Assignment 2 (Research Track)
Survey Paper
The survey paper examines a particular aspect of digital libraries or their applications. You will have to read at least four
papers of high quality and write a paper that not only summarizes the papers' contributions but also clearly differentiates
each paper's strengths and weaknesses. Do not forget to cite what you quote. This can be a separate exercise, but
you are highly recommended to incorporate this exercise into your term project so that you can save some time.
Your survey paper should be no longer than six single-column, single-spaced pages. The more concise you are at summarizing the
points, the more likely that you'll receive a higher grade for the class.
Once you've chosen an area for your survey paper, the instructor will help suggest two to three references that
you can start with. You can decide whether to accept the suggested papers and to supplement/replace
the papers to round out your survey.
The following is the past grading criteria. A smilar criteria will be applied to this semester's survey papers although
the points for each criteria may vary.
Total: [100 points]
- Objective Part [75 points]
- Knowledge of papers [50 points]
- Presentation [17 points]
- Organization of papers into hierarchy:
- Definition of terms on first use:
- Introduction and Conclusion (Encapsulation):
- Abstract and indexing helpfulness:
- Succinctness [5 points]
- Writing and citation style [3 points]
-Critical Part [25 points]
- Future trends [10 points]
- Critique of papers [15 points]
The following are a list of topics for the survey paper and some suggested readings for the survey.
If you have a topic of your own interest, please discuss it with the instructor.
Please note that as some of the topics below are very broad, you may have to choose only a subset of the suggested
readings to build your survey paper around. Many of the readings suggested here come from recent conferences,
so it will require you to read background work. Remember that the four paper requirements is a minimum;
you may have to read many more than four to get a coherent overview of the topic.
You are recommended to search papers from journals (such as JASIST, IP&M, IJDL), conferences (such as JCDL, ECDL, ICADL, ACM SIGIR, ACL,
WWW, ASIST), and digital libraries (such as ACM DL, IEEE CS DL).
Currently LSU does not have organizational access to the ACM Digital Library. If you need to download
a specific paper from the ACM DL, please let the instructor know.
Note that the references below simply serve indicative purposes; they do not share a consistent style and are not error-free,
but I expect you to implement a consistent style (such as the ACM style) in your survey report.
- Automated Collection Building
- JCDL 2002 Collection synthesis Donna Bergmark Pages: 253 - 262
- G. Pant, K. Tsioutsiouliklis, J. Johnson, C.L. Giles: Panorama: Extending Digital Libraries with Topical Crawlers.
Proc. ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004).
- Standards used in the DL Metadata and Markup
There are too many metadata formats to overview successfully in a survey paper.
You should concentrate on one or two that have a similar purpose and pursue these in depth.
EAD:
- Christopher J. Prom, Thomas G. Habing, 2002. Using the Open Archives Initiative Protocols with EAD. JCDL 2002.
- EAD Development, http://www.loc.gov/ead/eaddev.html
Dublin Core:
- Jewel Ward, 2003. A Quantitative Analysis of Unqualified Dublin Core Metadata Element Set Usage
within Data Providers Registered with the Open Archives Initiative. JCDL 2003.
- The Dublin Core and Warwick Framework A Review of the Literature, March 1995-September 1997, D-Lib Magazine, January 1998.
- Bioinformation and Genomic Data in DLs
- Papers from BIOLINK 2004: Link.
- papers from ACL 2003 Workshop on Biomedicine: Link.
- Zoi Lacroix, Omar Boucelma, Mehdi Essid: The biological integration system. 45-49, WIDM 2003.
Link.
- S. B. Davidson and et al. Biokleisli: a digital library for biomedical researchers. Intnl. J. on
Digital Libraries, 1(1):36.
- Erjavec, T., Kim, J.D., Ohta, T., Tateisi, Y. & Tsujii, J., Encoding Biomedical Resources in TEI: The Case of the
GENIA Corpus, NLP in Biomedicine, ACL 2003 Workshop Program.
- Finding Gene Names Using FlyBase, PDF.
- Correction and Analysis of User Queries or Documents
Document correction:
- Bibliographic attribute extraction from erroneous references based on a statistical model Atsuhiro Takasu, JCDL 2003.
- Seung-Taek Park, David M. Pennock, C. Lee Giles, Robert Krovetz, Analysis of lexical signatures for finding lost or
related documents, SIGIR 2002.
- Digital Library Social Policy
Digital Divide:
- Hoffman. The Evolution of the Digital Divide: How Gaps in Internet Access May Impact Electronic Commerce.
Link.
- Bridging the Digital Divide: The Story of the Free Internet. PDF.
- Linda A. Jackson, Gretchen Barbatsis, Frank A. Biocca, Alexander von Eye, Yong Zhao, Hiram E. Fitzgerald.
Home Internet Use in Low-Income Families: Is Access Enough to Eliminate the Digital Divide?
In Media access: social and psychological dimensions of new technology use (edited by Erik P. Bucy, John E. Newhagen).
Information Ecology:
- Nardi, Bonnie A., 1999. Librarians: A Keystone Species. In Information Ecologies, MIT Press.
- Adams and Blanford. The developing roles of digital library intermediaries.
PDF.
Preservation:
- Sully, Sarah E., 1997. JSTOR: An IP Practitioner's Perspective. D-Lib Magazine, January, 1997.
Link.
- Michael A. Keller, Vicky Reich, and Andrew Herkovic, What is a library anymore anyway?, First Monday, 8(5), 2003.
Link.
- Vicky Reich & David S. H. Rosenthal, 2001. D-Lib Magazine, 7(6), June 2001.
Link.
- Fresko, M., 1995. Long Term Preservation of Electronic Materials. A Report of a Workshop Organised by
JISC/British Library, held at the University of Warwick on 27-28 November 1995. British Library R & D Report 6238.
Link.
Press Release on the Internet Archive
http://www.archive.org/about/press_release.php.
- Examples of Domain-Specific DLs
- Intelligent Agents in DLs
- Interoperability between DLs
- Metadata Extraction and Indexing
A study of manual methods:
- Catherine C. Marshall, Making metadata: a study of metadata creation for a mixed physical-digital collection,
Proceedings of the third ACM conference on Digital libraries, p.162-171, June 23-26, 1998, Pittsburgh,
Pennsylvania, United States. Link.
Text:
- Nina Wacholder, David K. Evans, Judith Klavans: Automatic identification and organization of index terms for
interactive browsing. JCDL 2001, 126-134.
- Hui Han, C. Lee Giles, Eren Manavoglu, Hongyuan Zha, Zhenyue Zhang, Edward A. Fox.
Automatic document metadata extraction using support vector machines
Images:
- Generating fuzzy semantic metadata describing spatial relations from images using the R-histogram.
International Conference on Digital Libraries archive 2004,
Proceedings of the 2004 joint ACM/IEEE conference on Digital libraries,
Link.
- Metadata Harvesting and Metasearching
- Mobile platform DL usability
- Catherine C. Marshall, Christine Ruotolo: Reading-in-the-small: a study of reading on small form factor devices. 56-64.
Electronic Edition (DOI: 10.1145/544220.544230)
- Seeing the Whole in Parts: Text Summarization for Web Browsing on Handheld Devices.
Link.
- Jones, M., Buchanan, G., Thimbleby, H., Sorting out Searching on Small Screen Devices,
Conference on Mobile HCI. Link.
- Multilingual Text Segmentation
- Unsupervised Learning of Arabic Stemming Using a Parallel Corpus Monica Rogati, Scott McCarley and Yiming Yang, ACL 2003
- Qiang Zhou, Local context templates for Chinese constituent boundary prediction. COLING 2000, 975-981.
Electronic Edition: PDF.
- Gary Kacmarcik, Chris Brockett, Hisami Suzuki: Robust Segmentation of Japanese Text into a Lattice for Parsing. COLING 2000.
390-396. PDF.
- Find more papers in ACL and Coling conferences.
- Music in DLs
Categorization:
- G. Tzanetakis and P. Cook, Musical genre classification of audio signals,
IEEE Transactions on Speech and Audio Processing, 10(5), 2002.
Querying:
- Fang-Fei Kuo, Man-Kwan Shan, 2004. Looking for new, not known music only: music retrieval by melody style.
International Conference on Digital Libraries, 2004, 243-251.
Indexing:
- Content-Based Indexing of Musical Scores. Link.
Also find papers in ISMIR, ACM MM.
- New Media for DL Blogging, IM, Wiki
Blogging:
- Diane Schiano, Bonnie Nardi, Michelle Gumbrecht and Luke Swartz, Blogging by the Rest of Us, CHI 2004.
- Natalie S. Glance, Matthew Hurst and Takashi Tomokiyo, Intelliseek Applied Research Center,
BlogPulse: Automated Trend Discovery for Weblogs.
- Ravi Kumar, Jasmine Novak, Prabhakar Raghavan and Andrew Tomkins, 2003. On the bursty evolution of blogspace
WWW, 2003.
Instant Messaging:
- Grinter, Rebecca and Leysia Palen, 2002. Instant Messaging in Teen Life. In
Proceedings of the 2002 ACM Conference on Computer Supported Cooperative WorkIssacs, E., Walendowski, A., Whittaker, S., Schiano, D. and Kamm, C., 2002.
The Character, Functions, and Styles of Instant Messaging in the Workplace. Proc. CSCW 2002.
ACM Press, 2002, 11-20.
- Patterns of use in the DL / Web
- Diane Kelly, Colleen Cool: The effects of topic familiarity on information search behavior. 74-75.
Electronic Edition (DOI: 10.1145/544220.544232), JCDL
- Sharing encountered information: digital libraries get a social life, JCDL.
Link.
Query Analysis:
- Steve Cronen-Townsend, Yun Zhou, W. Bruce Croft, Predicting query performance.
Proceedings of the 25th ACM SIGIR, Link.
- Daniel E. Rose, Danny Levinson, 2004. Understanding user goals in web search, WWW 2004, 13-19.
- Sally Jo Cunningham, Chris Knowles, Nina Reeves, 2001.
An ethnographic study of technical support workers: why we didn't build a tech support digital library.
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries, 189-198
- Phrasal Searching Techniques
- Einat Amitay, 1998. Using Common Hypertext Links to Identify the Best Phrasal Description of Target Web Documents.
SIGIR '98.
- Efficient phrase querying with an auxiliary index.
Proceedings of the 25th ACM SIGIR, Link.
- David D. Lewis, An evaluation of phrasal and clustered representations on a text categorization task.
Link.
- Question answering systems
- A great list of reads on Bos and Webber's reading group
(you have it a bit easier as someone has done some selection for you):
Link.
Also find papers in ACM SIGIR
- Lynette Hirschman and Rob Gaizauskas, 2001. Natural Language Question Answering: The View from Here.
Natural Language Engineering, 7.
- Harabagiu Moldovan, et al., 2001. FALCON: Boosting Knowledge for Answer Engines. In Proceedings of The Ninth
Text REtrieval Conference (TREC 9).
-
- Hui Yang, Tat-Seng Chua, Shuguang Wang, Chun-Keat Koh, 2003. Structured use of external knowledge for
event-based open domain question answering, Proceedings of 23rd ACM SIGIR, 33-40.
- Recommender Systems
- Also find papers in SIGIR, WWW, Machine Learning
- Rong Jin, Joyce Y. Chai, Luo Si, 2007. Content-based filtering & collaborative filtering:
An automatic weighting scheme for collaborative filtering, Proceedings of the 27th ACM SIGIR, July 2004.
- Shyong K. Lam, John Riedl, Shilling recommender systems for fun and profit.
Proceedings of the 13th international conference on World Wide Web Reputation networks.
- Speech in DLs
- Spatial and Geographic data in DLs
- Temporal data in DLs
- Tools to build a DL
- Text classification for DLs
- Paul N. Bennett, 2003. Using asymmetric distributions to improve text classifier probability estimates.
SIGIR 2003. 111 - 118
- Lijuan Cai, Thomas Hofmann, 2003. 182-189. Text categorization by boosting automatically extracted concepts. SIGIR 2003.
- Dou Shen, Zheng Chen, Qiang Yang, Hua-Jun Zeng, Benyu Zhang, Yuchang Lu, Wei-Ying Ma, 2004.
Web-page classification through summarization. SIGIR 2004. 242-249.
- Yong-Bae Lee, Sung Hyon Myaeng, 2002. Text genre classification with genre-revealing and subject-revealing features. 145-150.
- User interfaces in DLs
Others from Visualization Workshop in JCDL 2001
- IdeaKeeper notepads: scaffolding digital library information analysis in online inquiry Conference on Human Factors in
Computing Systems Extended abstracts of the 2004 conference on Human factors and computing systems.
Link.
Visualization of Scientific Research:
- Katy Borner and Shashikant Penumarthy. 2003. Social Diffusion Patterns in Three-Dimensional Virtual Worlds.
Information Visualization, 2(3), pp. 182-198, 2003.
Multimodal, media specific:
- E.-P. Lim, D. H.-L. Goh, Z. Liu, W.-K. Ng, C. S.-G. Khoo, S. E. Higgins, 2002.
G-Portal: A Map-based Digital Library for Distributed Geospatial and Georeferenced Resources, JCDL 2002.
- Mor Naaman, Yee Jiun Song, Andreas Paepcke, and Hector Garcia-Molina, 2004. Automatic organization for digital
photographs with geographic coordinates. JCDL 2004.
- David A. Smith. Detecting and Browsing Events in Unstructured text, 73-80.
Reading:
- Yi-Chun Chu, David Bainbridge, Matt Jones, and Ian H. Witten Realistic books: A bizarre homage to an obsolete medium?. JCDL 2004.
General, Legacy:
- B. Shneiderman, D. Feldman, A. Rose, and X.F. Grau, 1999. Visualizing Digital Library Search Results with Categorical and
Hierarchical Axes. Proceedings of 5th ACM Digital Library Conference, 1999, ACM, pp. 57-65.
- Video in DLs
- Alan F. Smeaton, 2001. Indexing, browsing, and searching of digital video and digtial audio information,
Lectures on information retrieval, Springer-Verlag New York, Inc., New York, NY, 2001
- Mounia Lalmas, 2002. Video retrieval using an MPEG-7 based inference network Andrew Graves,
Proceedings of the 25th ACM SIGIR, 339-346. Link.
- Susan Gauch, Wei Li and John Gauch, 1997. The VISION Digital Video Library,
Information Processing & Management, 33(4), April 1997, pp. 413-426.
Acknowledgement to Min-Yen Kan.
Back to main page
Yejun Wu