Papers
Note: the following list does not necessarily reflect the state of the art of the corresponding research
areas. It is mainly intended for two purposes: (1) suggested reading for a database course, and (2) reminding
records for myself.
Advanced Data Models
Maybe we need a separate page for such recommended readings.
- XML
- Essential XML Quick Reference by Skonnard and Gudgin, Addison-Wesley 1999.
- XML Tutorial
- Schema Languages, chapter 4 in An Introduction to XML and Web Technologies by Anders Møller and Michael I. Schwartzbach
Addison-Wesley, January 2006.
- XQuery: An XML Query Language by Chamberlin in IBM Systems Journal 41(4), 2002.
- XML: Model, Schemas, Types, Logics, and Queries by Klarlund et al. In "Logics for Emerging Applications of Databases", 2003, Springer.
- Data on the Web by Abiteboul et al., Morgan Kaufmann, 2000.
- XPath Leashed by Michael Benedikt and Christoph Koch, to appear in
ACM Computing survey.
- Conjunctive Queries over Trees by Gottlob,
Koch and Schulz, in Journal of ACM 53(2), March 2006.
- XPath Query Containment by Thomas Schwentick in ACM SIGMOD RECORD 33(1), 2004.
- XML Search: Languages, INEX and Scoring by Amer-Yahia and Lalmas
in SIGMOD Record 35(4), Dec 2006.
- Indexing XML Data Stored in a Relational Database by Pal et al. in VLDB 2004.
- Data Management for XML by Jennifer Widom.
- Reading List by Dan Suciu
- Query Optimization for XML by McHugh and Widom in VLDB'99.
- Mixed Mode XML Query Processing by Halverson et al. in VLDB 2003.
- Native XML Support in DB2 Universal Database by Nicola and van der Linden in VLDB 2005.
- XML Query Processing: Storage and Query Model Interplay by Ioana Manolescu, a tutorial given at EDBT Summer School 2004.
- Efficiently Querying Large XML Data Repositories: A Survey by Gou and Chirkova in TKDE 19(10), 2007.
- Ontology
- The Semantic Web by Tim Berners-Lee, James Hendler and Ora Lassila, Scientific American Magazine, May 2001.
- The Semantic Web : The Roles of XML and RDF by Decker et al. in IEEE Internet Computing, Sep-Oct, 2000.
- Framework for The Semantic Web: An RDF Tutorial by Decker et al. in IEEE Internet Computing, Nov-Dec, 2000.
- RDF Primer by Manola, F., Miller, E. (eds.)
- Web Ontology Language: OWL by Grigoris Antoniou and Frank van Harmelen in Handbook on Ontologies, Springer, 2004.
- Semantic Web Tutorials
- A Semantic Web Primer by Antoniou and Van Harmelen, MIT Press, 2004.
- Annotea: An Open RDF Infrastructure for Shared Web Annotations by Kahan et al. in WWW10, 2001.
- Simile project at MIT, exploiting semantic web inside web browser.
- Jena, an open source project.
- Protege, an academic ontology project at Stanford.
- SPARQL, a W3C query language for RDF.
- Scalable Semantic Web Data Management Using Vertical Partitioning by Abadi et al. in VLDB 2007.
-
State-of-the-art Survey from TONES by Cali, Calvanese, and et al., 2005.
- Publications of Noy.
- Description Logic
- Introduction to Description Logic,
a course by Enrico Franconi, Faculty of Computer Science, Free University of Bolzano, Italy
-
Basic Description Logic by Baader and Nutt in Description Logic Handbook, Baader et al.(Ed.), Cambridge University Press, 2002.
-
Description Logics by Baader, Horrocks, and Sattler in Frank van Harmelen, Vladimir Lifschitz, and Bruce Porter (ed.), Handbook of Knowledge Representation. Elsevier, 2007.
-
Description Logic by Baader and Lutz, in Patrick Blackburn, Johan van Benthem, and Frank Wolter (Ed.), The Handbook of Modal Logic, pages 757–820. Elsevier, 2006.
- UML, or OR Mapping?
Implementation of Databases
- Overview
- Query Optimization
- Query Evaluation
- Query Evaluation Techniques for Large Databases
by Goetz Graefe in ACM Computing Surveys, vol. 25, No. 2, 1993.
- Join Processing in Relational Databases
by P. Mishra and M. Eich in ACM Computing Surveys, vol. 24, No. 1, 1992.
- Implementing Sorting in Database Systems
by Goetz Graefe in ACM Computing Surveys, vol. 38, No. 3, 2006.
- Join Processing in Database Systems with Large Memories
by L. Shapiro in ACM TODS 11(3), 1986.
- An Amateur's Introduction to Recursive Query Processing Strategies
by François Bancilhon and Raghu Ramakrishnan in ACM SIGMOD 1986.
- Concurrency Control & Recovery
- The Transaction Concept: Virtues and Limitations
by Jim Gray, VLDB'81.
- Transaction Processing
by A. Thomasian, in Computer Science Handbook, 2nd Ed., 1997.
- Principles of Transaction-Oriented Database Recovery
by T. Haerder and A. Reuter, ACM Computing Surveys 15(4), 1983.
- The Notions of Consistency and Predicate Locks in a Database System
by K. Eswaran, J. Gray, R. Lorie, and I. Traiger in CACM 19(11), 1976.
- Granularity of Locks and Degrees of Consistency in a Shared Data Base
by Gray et al. in IFIP Working Conference on Modeling of Data Base Management System, 1979.
- On Optimistic Methods for Concurrency Control
by Kung and Robinson in ACM TODS 6(2), 1981.
- Efficient Locking for Concurrent Operations on B-Trees
by Lehman and Yao in ACM TODS 6(4).
- Concurrency Control and Recovery
by Michael Franklin, Computer Science Handbook 2nd Ed., 1997.
- Concurrency Control: Methods, Performance, and Analysis
by A. Thomasian, ACM Computing Surveys 30(1), 1998.
- Concurrency Control in Advanced Database Applications
by N. Barghouti and G. Kaiser, ACM Computing Surveys 23(3), 1991.
- ARIES/NT: A Recovery Method Based on Write-Ahead Logging for Nested Transactions
by Kurt Rothermel and C. Mohan in VLDB'89. See also Repeating History Beyond ARIES
by C. Mohan in VLDB'99.
- ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial Rollbacks Using Write-Ahead Logging
by Mohan et al. in TODS'92.
- ARIES/IM: An Efficient and High Concurrency Index Management Method Using Write-Ahead Logging
by Mohan and Levine in SIGMOD'92.
Research Areas
- Database Theory
- Elements of Relational Database Theory by P. Kanellakis in J. van Leeuwen ed. Handbook of Theoretical Computer Science, Vol. B, MIT Press, 1991.
- Constraint Satisfaction and Database Theory: a Tutorial by M. Vardi
in PODS 2000. See also Database Querying and Constraint Programming by P. Kanellakis
in SIGACT News 25, 4 (Dec. 1994), 22-87. And Constraint programming and database languages: a tutorial
by P. Kanellakis in PODS'95.
- Conjunctive Query Containment and Constraint Satisfaction by Kolaitis and Vardi in PODS'98.
See also the extended version in Journal of Computer System Science 61(2): 302-332 (2000).
-
Database and Finite Model Theory by Victor Vianu in Immerman and Kolaitis(Eds.): Descriptive Complexity and Finite Models, 1996. See also
Elements of Finite Model Theory by Libkin, Springer 2004. And Finite model theory and its applications by
Grädel, Kolaitis, Libkin, Marx, Spencer, Vardi, Venema, and Weinstein, Springer 2007.
- Fundamentals of Dependency Theory by Vardi in In: Trends in Theoretical Computer Science (E. Borger, ed.), pp. 171-224. Computer Science Press 1988.
See also The Theory of Data Dependencies: a Survey by Fagin and Vardi in Anshel and Gewirtz(Eds.) Mathematics of Information Processing, Symp. in Appl. Math, vol 34, 1986.
- Description logics for databases by Borgida, Lenzerini, and Rosati. In Baader et al.(Eds.): Description Logic Handbook: theory, Implementation, and Applications, Cambridge University Press 2003.
- Foundations of Databases by Abiteboul et al. Addison Wesley, 1994.
- Principles of Database and Knowledge-base Systems by J. Ullman, Vol 1-2, Computer Science Press, 1988-1989.
- Optimal Implementation of Conjunctive Queries in Relational Data Bases
by A. Chandra and P. Merlin in ACM Symposium on Theory of Computing, 1977.
- Equivalences among relational expressions by Aho and Ullman in SIAM J Comput 8, 2 (1979), 218-246
-
The Theory of Relational Databases by D. Maier,
Computer Science Press, 1983.
- Relational Database Theory by Atzeni and De Antonellis, the Benjamin/Cummings Publishing Company, 1993.
- Computable Queries for Relational Data Bases by A.K. Chandra and D. Harel in Journal of Computer and System Science 21, pp. 156-178, 1980.
- Relational Queries Computable in Polynomial Time (extended abstract)
by Neil Immerman in ACM Symposium on Theory of Computing, 1982. See also the full version on the author's homepage.
- The complexity of relational query languages (Extended Abstract)
by Vardi in ACM Symposium on Theory of Computing, 1982.
- Perspectives on database theory by M. Yannakakis in SIGACT News 27, 3 (Sep. 1996), 25-49.
- Database Principles Column in SIGMOD
Record
- PODS and ICDT.
- Logic for Database and Information Systems edited by Chomicki and Saake. Kluwer, 1999.
- Logic for Emerging Applications of Databases edited by Chomicki, van der Meyden, and Saake. Springer, 2003.
- On Efficient Reasoning with Implication Constraints by Zhang and Ozsoyoglu in DOOD'93.
See also Implication and Referential Constraints: A New Formal Reasoning in IEEE TOKDE 9(6), 1997. And the PhD dissertation of Xubo Zhang at Case Western Reserve Univ., 1994.
- Database States and Their Tableaux by Mendelzon in TODS 9(2) 1984.
- An amateur's introduction to recursive query processing strategies by Bancilhon and Ramakrishnan in SIGMOD'86.
- The Chase Revisited by Deutsch, Nash, and Remmel, in PODS'08.
- Data Integration
- Data Integration: A Theoretical Perspective
by M. Lenzerini in PODS'02.
- Answering Queries Using Views
by Levy, Mendelzon and Sagiv in PODS'95.
- Answering Queries Using Views: A Survey
by A. Halevy in VLDB Journal 10(4), 2001. See also another paper dedicated to data integration
Logic-based techniques in data integration by Levy in Minker (ed.) Logic-based artificial
intelligence, 2000.
- Information Integration Using Logical Views
by Jeffrey D. Ullman in ICDT 1997: 19-40. See also the extended
version in Theoretical computer science 239 (2000) and Tsimmis Project Publications.
- Answering Recursive Queries Using Views by Duschka and Genesereth in PODS'97. See also Recursive Query Plans for Data Integration by Duschka, Genesereth, and Levy in
Journal of Logic Programming 43(1), 2000.
- Answering Queries Using Templates With Binding Patterns by Rajaraman, Sagiv and Ullman in PODS'05.
- Mediators in the Architecture of Future Information Systems
by G. Wiederhold in IEEE Computer 25(3), 1992.
- Querying Heterogeneous Information Sources Using Source Descriptions
by A. Levy and A. Rajaraman and J. Ordille in VLDB'96. See also Data Integration: The Teenage Years in VLDB'06.
- Query Containment for Data Integration Systems by Millstein, Levy and Friedman in PODS 2000. See also the extended version in Journal of Computer and System Science 66 (2003), 20-39.
- Complexity of Answering Queries Using Materialized Views
by Abiteboul et al. in PODS'98.
- Navigational plans for data integration by M. Friedman et al. in AAAI'99.
- MiniCon: A scalable algorithm for answering queries using views by R. Pottinger and A. Halevy in VLDB Journal 10(2-3), 2001.
- A Logic-Based Approach to Data Integration
by J. Grant and J. Minker in Theory and Practice of Logic Programming 2(3), 2002.
- View-based Query Processing: On the Relationship between
Rewriting, Answering and Losslessness
by D. Calvanese et al. in Theoretical Computer Science 371(3), 2007.
- Model Management
(To be specified)
- Overview
- Schema Matching and Mapping Generation
- Mapping Composition
- Model & Data Translation
- Schema Integration
- Inclusion and Equivalence Between Relational Database Schemata by Atzeni et al. in Theoretical Computer Science 19(1982).
- Relative Information Capacity of Simple Relational Database Schemata by Hull in PODS'84.
- Equivalence of Relational Database Schemes by Beeri et al. in STOC 1979: 319-329.
- Equivalence of Keyed Relational Schemas by Conjunctive Queries by Albert, Ioannidis, and Ramakrishnan in Journal of Computer and System Science 58,1999.
- The Use of Information Capacity in Schema Integration and Translation by Miller et al. in VLDB'93.
See also Schema Equivalence in Heterogeneous Systems: Bridging Theory and Practice by Miller et al. in Information
Systems, 19(1) 1994.
- Theoretical Aspects of Schema Merging by Buneman et al. in EDBT'92.
- Model Independent Assertions for Integration of Heterogeneous Schemas by Spaccapietra et al. in VLDB Journal 1(1992).
See also Issues and Approaches in Database Integration by Parent and Spaccapietra in CACM, 1998.
- A Comparative Analysis of Methodologies for Database Schema Integration by Batini et al. in ACM Computing Survey 18(4), 1986.
- A Formal View Integration Method by Biskup and Convent in SIGMOD 1986.
- Towards A Sound View Integration Methodology by Casanova and Vidal in PODS'83.
- Schema Merging and Mapping Creation for Relational Sources by Pottinger and Bernstein in EDBT'08. See also
Processing Queries and Merging Schemas in Support of Data Integration by Pottinger, PhD dissertation, Univ. Washington 2002.
- Interactive Generation of Integrated Schemas by Chiticariu et al. in SIGMOD'08.
- Bootstrapping pay-as-you-go data integration systems by Sarma et al. in SIGMOD'08.
-
On the Logical Foundation of Schema Integration and Evolution in Heterogeneous Database Systems by Lakshmanan et al. in DOOD'93.
- Representing and Using Interschema Knowledge in Cooperative Information Systems by Catarci and Lenzerini in
Journal of Intelligent and Cooperative Information Systems, 2(4):375--398.
- A Logical Basis for Schema Integration by Johannesson in RIDE-IMS 1993: 86-95.
- A Formalism of Semantic Schema Integration by McBrien and Poulovassilis in Information Systems 23(5), 1998.
- SPHINX: Schema Integration by Example by Barbancon and Miranker in J. Intelligent Information System, 29(2007).
- Relational Languages for Metadata Integration by Wyss and Robertson in TODS 30(2), 2005.
- Semantics of Database Transformations by Davidson et al. In Selected Papers From A Workshop on Semantics in Databases L. Libkin and B. Thalheim, Eds. LNCS, vol. 1358. Springer-Verlag, London, 55-91.
- Integrity constraint integration in heterogeneous databases: An enhanced methodology for schema integration by Ramesh and Ram in Information Systems 22(8), 1997.
- Dataspaces (to be specified)
- Overview
- Pay-as-you-go integration
- Structured Query Over Text
- Structured Querying of Web Text by Cafarella et al. in CIDR'07.
See also a shorter version Structured Queries over Web Text by Cafarella et al. in IEEE Data Engineering Bulletin 29(4), 2006.
- Answering Structured Queries on Unstructured Data by Liu, Dong,and Halevy. in WebDB'06.
- Integration of Heterogeneous Databases without Common Domains Using Queries Based on Textual Similarity by William Cohen in SIGMOD'98.
- Approximate String Joins in a Database (Almost) for Free by Gravano et al, VLDB 2001.
- A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data by Yu et al. in VLDB'07.
- Bringing Precision to Desktop Search: A Predicate-based Desktop Search Architecture by Dittrich et al. Tech. Report, ETH.
- Crossing the Structure Chasm by Halevy et al. in CIDR'03.
- SQL queries over unstructured data by Jain et al. in ICDE'07.
- A system for integrating unstructured data into relational databases by Mansuri and Sarawagi in ICDE'06.
- Optimizing SQL Queries over Text Databasesby Jain et al. in ICDE'08.
- Uncovering the Relational Web by Cafarella et al. in WebDB'08.
- Lightweight Database Wrapper for Unstructured Data by Muezzinoglu and Badia in WebDB'08.
- Cimple Project at Wisconsin
- Avatar Project at IBM Almaden
- Information Extraction
- Keyword over all
- DBXplorer: A System for Keyword-Based Search over Relational Databases by Agrawal, Chaudhuri, and Das, in ICDE'02.
- Keyword Searching and Browsing in Databases using BANKS by Bhalotia et al. in ICDE'02.
- DISCOVER: Keyword Search in Relational Databases by Hristidis and Papakonstantinou in VLDB'02.
- Efficient IR-style Keyword Search over Relational Databases by Hristidis et al. in VLDB'03.
- Indexing Dataspaces by Dong and Halevy in SIGMOD'07.
- Précis: from unstructured keywords as queries to structured databases as answers
by Simitsis et al. in VLDB J. 17(1), 2008.
- SPARK: Top- Keyword Query in Relational Databases by Luo et al. in SIGMOD'07.
- Efficient Keyword Search Across Heterogeneous Relational Databases by Sayyadian et al. in ICDE'07.
- Personal Information System
- SEMEX at U. Washington.
- iMeMex at ETH Zurich.
- Reference Reconciliation in Complex Information Spaces by Dong, Halevy, and Madhavan in SIGMOD'05.
- Lineage, Uncertainty, and Inconsistency
- Data Lineage
or Data Provenance
- Practical lineage tracing in data warehouses by Cui and Widom in ICDE'00.
- Tracing the lineage of view data in a warehousing environment by Cui and Widom in TODS 25(2),
2000.
- Lineage Tracing for General Data Warehouse Tranformations by Cui and Widom in VLDB 2001.
- Why and Where: A Characterization of Data Provenance. by Buneman et al. in ICDT 2001
- Provenance in Databases by Buneman and Tan, a tutorial
at SIGMOD'07
- Research Problems in Data Provenance by Wang-Chiew Tan in IEEE Data Engineering Bulletin, vol 27, no. 4, pages 45-52, 2004
- Data Provenance: Some Basic Issues by Buneman et al. in Foundations of Software Technology and Theoretical Computer Science (2000)
- An annotation management system for relational databases by
Bhagwat et al. in VLDB'04.
- A Survey of Data Provenance Techniques by Simmhan
et al. , technical report of Indiana University.
- Overview of record linkage and current research directions by Winkler, Technical Report RRS 2006/02, US Bureau of the Census.
- Incomplete Information
and Uncertainty
- Incomplete Information in Relational Databases by Imielinski and Lipski in Journal of ACM 31(4), 1984.
- Logical Approach to Incomplete Information: a survey by Van der Meyden in Logics For Databases and Information Systems, Kluwer, 1998.
- Current Approaches to Handling Imperfect Information in Data and Knowledge Bases by Parsons in IEEE TKDE 3(3), 1996.
- Query Answering Techniques on Uncertain/Probabilistic Data by Jian Pei et al. A tutorial in SIGMOD'08.
- 10106 Worlds and Beyond: Efficient
Representation and Processing of Incomplete Information. by Antova et al. in ICDE'07.
606-615
- Uncertainty in Databases and Knowledge Bases, Part V of Advanced Database Systems by Zaniolo, Morgan Kaufmann, 1997.
- Working Model for Uncertain Data by Das Sarma et al. in
ICDE'06.
- Dependency Satisfaction in Databases with Incomplete
Information by Grahne in VLDB'84.
- Incomplete Information, chapter 19 of Foundations of Databases by Abiteboul et al., Addison-Wesley, 1995.
- Incomplete Information in the Relational Data Model, chapter 5 of A Guided Tour of Relational Databases and Beyond by Levene and Loizou, Springer, 1999.
- Models for incomplete and probabilistic information by Green and Talnen in EDBT Workshops 2006.
- The Problem of Incomplete Information in Relational
Databases. By Gösta Grahne in
Lecture Notes in Computer Science, vol. 554. Springer-Verlag,
Berlin 1991.
- Uncertainty Management in Information Systems by Motro and Smets (Ed.), Kluwer 1996.
- Probabilistic Data
- IEEE Data Engineering Bulletin, Special Issue on Probabilistic Data Management, 29(1), 2006
-
Probabilistic Databases, invited talk by Suciu at SIGMOD'05.
See also Management of probabilistic data: foundations and challenges by Dalvi and Suciu in PODS'07.
- Models for Incomplete and Probabilistic Information by Green and Tannen in IEEE Data Eng. Bull. 29(1): 17-24 (2006).
- A Probabilistic Relational Model and Algebra by Dey and Sarkar in TODS 21(3) 1996.
- A Probabilistic Relational Algebra for the Integration of Information Retrieval and Database Systems by Fuhr and Roellke in TOIS 15(1) 1997.
- Efficient Query Evaluation on Probabilistic Databases by Dalvi and Suciu
in VLDB'04. See also the journal version in VLDB J. 16(4), 2007.
- ProbView: A Flexible Probabilistic Database System by Lakshmanan et al. in TODS, 1997.
- Probabilistic frame-based systems by Koller and Preffer in AAAI, 1998.
- The management of probabilistic Data by Barbara et al. in
IEEE ToKDE'92.
- Prl: a logical approach to probabilistic relational models
by Getoor and Grant in Machine Learning Journal 62, 2006.
- MayBMS at Cornell
- Probabilistic Databases Project at UW
- Trio System at Standford
- Databases with Uncertainty and Lineage by Benjelloun et al. in VLDB J. 17, 2008.
- Trio: A System for Integrated Management of Data, Accuracy, and Lineage by Jennifer Widom in CIDR'05.
- Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS by Mutsuzaki et al. in CIDR'07.
- ULDBs: Databases with Uncertainty and Lineage by Benjelloun et al. in VLDB'06.
- Inconsistency
- Reconciling while tolerating disagreement in collaborative data sharing by Taylor and Ives in SIGMOD'06.
- Inconsistency Tolerance in P2P Data Integration: An Epistemic Logic Approach by Calvanese et al. in DBPL 2005.
- Clean answers over dirty databases: A probabilistic approach by Andritsos, Fuxman, and Miller in ICDE'06.
- Dependencies Revisited for Improving Data Quality by Wenfei Fan in PODS'08.
- Inconsistency Tolerance by Bertossi et al. (Ed.), Springer, 2005.
- ConQuer Project at U. of Toronto
- Integrity Enforcement (Repairing)
- Deriving production rules for constraint maintenance by Ceri and Widom in VLDB'90. See
also Automatic generation of production rules for integrity maintenance by Ceri et al. in TODS 19(3), 1994.
- A cost-based model and effective heuristic for repairing constraints by value modification by Bohannon et al. in SIGMOD'05.
- Preferred repairs for inconsistent databases by Greco et al. in IDEAS'03.
- Efficient Approximation Algorithms for Repairing Inconsistent Databasesby Andrei Lopatenko and Loreto Bravo in ICDE'07.
- Consistent Query Answering
(CQA)
- Consistent query answering in inconsistent databases by Arenas et al. in PODS'99.
- Query answering in inconsistent databases by Bertossi and Chomicki in Logics for Emerging Applications of Databases, 2003.
- Consistent Query Answering in databases by Bertossi in SIGMOD Record 35(2), 2006.
- Consistent Query Answering: Five Easy Pieces by Chomicki
in ICDT 2007.
- Consistent Query Answers in Virtual Data Integration Systems by Bertossi and Bravo in Inconsistency Tolerance 2005.
- Conquer: efficient management of inconsistent databases
by Fuxman et al. in SIGMOD'05.
- Scalar aggregation in inconsistent databases by Arenas et al. in Theoretical Computer Science 296(3), 2003.
- Repair localization for query answering from inconsistent databases by Eiter et al. in ACM TODS 33(2), 2008.
- Preference-driven querying of inconsistent relational databases by Staworko et al. in EDBT Workshop, 2006.
- A Logic programming approach to the integration, repairing and querying of inconsistent databases by Greco et al. in International Conference on Logic Programming, 2001.
- OLAP over uncertain and imprecise data by Burdick et al. in VLDB J. 16(1), 2007.
- View Update
- Updating relational views by Furtado and Casanova in Query Processing in Database Systems by Kim et al.(Ed.), Springer 1985.
- Update semantics of relational views by Bancilhon and Spy in TODS 6(4), 1981. See also Keller's comment in TODS 12(3) 1987 and The impact of the constant complement approach towards view updating by Lechtenbörger in PODS'03.
- Using constraint satisfaction for view update by Hua Shu in J. Intelligent Information Systems, 15(2) 2000
- Updating Relational Databases through weak instance interfaces by Atzeni and Torlone in TODS 17(4) 1992.
- On the computation of relational view complements by Lechtenbörger and Vossen in PODS'02.
- The impact of the constant complement approach towards view updating by Lechtenbörger in PODS'03.
- On propagation of deletions and annotations through views by Buneman et al. in PODS'02.
- Updates of relational views by Cosmadakis and Papadimitriou in J. ACM 31(4), 1984.
- On complementary and independent mappings on databases by Keller and Ullman in SIGMOD 1984.
- Run-time translation of view tuple deletions using data lineage by Cui and Widom, Tech. Report Stanford Univ. 2001.
- On the updatability of relational views by Dayal and Bernstein in VLDB'78.
- Properties and update semantics of consistent views by Gottlob in TODS 13(4), 1988.
- Update and retrieval in a relational database through a universal schema interface by Brosda and Vossen in TODS 13(4) 1988.
- PATAXÓ: A framework to allow updates through XML views by Braganholo et al. in ACM Transactions on Database Systems (TODS), v.31 n.3, p.839-886, September 2006
- View updates in relational databases with an independent scheme by Langerak in TODS 15(1) 1990.
- Updates through views: A new hope by Kotidis et al. in ICDE'06.
- Relational lenses: a language for updatable views by Bohannon et al. in PODS'06.
- Updating knowledge bases while maintaining their consistency by Teniente and Olive in VLDB J. 4(2), 1995. See also Consistency preserving updates in deductive databases by Mayol and Teniente in Data & Knowledge Engineering 47, 2003.
- Compiling Mappings to Bridge Applications and Databases
by Melnik et al. in SIGMOD'07.
- Top-k
- Combining Fuzzy Information: An Overview by Fagin in SIGMOD Record 31(2), 2002
-
A Survey of Top-k Query Processing Techniques in Relational Database
Systems by Ihab F. Ilyas, George Beskales, and Mohamed A. Soliman
To appear in the ACM Computing Surveys, 2008
- Top-k selection queries over relational databases: mapping strategies and performance evaluation by Bruno et al. in TODS, 2002.
- Combining Fuzzy Information from Multiple Systems by Fagin in PODS'96.
- Optimal Aggregation Algorithms for Middleware by Fagin et al. in PODS'01. See also the journal version in J. Comput. Syst. Sci. 66(4): 614-656 (2003).
- Top-k Query Evaluation with Probabilistic Guarantees by Theobald et al. in VLDB'04.
- On saying "Enough Already!" in SQL by Carey and Kossman in SIGMOD'97. See also Reducing the braking distance of an SQL query engine in VLDB'98.
- Probabilistic Optimization of top N queries by Donjerkovic and Ramakrishnan in SIGKDD'00.
-
Supporting top-k join queries in relational databases. by Ilyas et
al. in VLDB Journal 13(3), pp. 207-221, 2004.
-
Reducing the Braking Distance of an SQL Query Engine by Carey and Kossmann in VLDB, pp. 158-169, 1998.
- Efficient Maintenance of Materialized Top-k Views by Yi et al. in ICDE'03.
- RankSQL: Query Algebra and Optimization for Relational
Topk Queries by Li et al. in SIGMOD'05.
- Answering Top-k Queries Using Views by Das et al. in VLDB'06.
-
SPARK: Top-k Keyword Query in Relational Databases by Luo et al. in SIGMOD
, 2007.
- Top-k Query Processing in Uncertain Databases by Soliman et al. in ICDE'07.
- Efficient Top-k Query Evaluation on Probabilistic Data by Re, Dalvi, and Suciu in ICDE'07.
- KLEE: A Framework for Distributed Top-k Query Algorithms by Michel et al. in VLDB'05.
- The Threshold Algorithm: From Middleware Systems to the Relational Engine by Bruno and Wang in IEEE ToKDE 19(4), 2007.
- OLAP
- Basics and Overviews
-
An overview of data warehousing and olap technology by Chaudhuri and Dayal in SIGMOD Record 26(1), 1997.
-
Aggregate View Management in Data Warehouses by Yannis Kotidis in Handbook of Massive Data Set, Kluwer 2002.
- Data Warehousing by Theodore Johnson in Handbook of Massive Data Set, Kluwer 2002.
- Data Cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals by Jim Gray et al. in J. Data Mining and Knowledge Discovery, 1(1), 1997.
- An array-based algorithm for simultaneous multidimensional aggregates by Zhao et al. in SIGMOD'97.
- A survey of logical models for OLAP databases by Vassiliadis and Sellis in SIGMOD Record 28(4), 1999.
- Fundamentals of Data Warehouses (2nd Ed.) by Jarke et al, Springer, 2003.
- Materialized Views: Techniques, Implementations, and Applications by Gupta and Mumick(ed.), the MIT Press, 1998.
- Data warehousing and OLAP: a research-oriented bibliography by A. Mendelzon, C. Hurtado, and D. Lemire
- View Selection
- Materialized views selection in a multidimensional database. by Baralis et al. in VLDB'97.
- The view selection problem has an exponential time lower bound for conjunctive queries and views. by Rada Chirkova in PODS 2002.
- A formal perspective on the view selection problem by Chirkova, Halevy, and Suciu in VLDB 2001.
- On the complexity of the view selection problem by Karloff and Mihail in PODS'99.
- Materialized view selection for multidimensional datasets by Shukla et al. in VLDB'98.
- Data warehouse configuration by Theodoratos and Sellis in VLDB'97.
- Selection of views to materialize under a maintenance cost constraint by Gupta and Mumick in ICDT'99.
- Selection of views to materialize in a data warehouse by Gupta in ICDT'97. See also the extended version by Gupta and
Mumick in IEEE TKDE 17(1), 2005.
- Index selection for OLAP by Gupta et al. in ICDE'97.
- Implementing data cubes efficiently by Harinarayan et al. in SIGMOD'96.
- Algorithms for materialized view design in data warehousing environment by Yang et al. in VLDB'97.
- View selection using randomized search by Kalnis et al. in Data Knowledge Engineering, 42(1), 2002.
- Speeding up warehouse physical design using a randomized algorithm by Lee and Hammer in Design and Management of Data Warehouses, workshop at CAiSE, 1999.
- Caching multidimensional queries using chunks by Deshpande et al. in SIGMOD'98.
- WATCHMAN: A data warehouse intelligent cache manager by Scheuermann, Shim, and Vingralek in VLDB'96.
- Semantic data caching and replacement by Dar et al. in VLDB 1996.
- DynaMat: a dynamic view management system for data warehouses. by Kotidis and Roussopoulos in SIGMOD'99.
See also A case for dynamic view management by Kotidis and Roussopoulos in TODS Vol. 26, 2001.
- Dynamic Materialization of Query Views for Data Warehouse Workloads by Phan and Li in ICDE'08.
- Exact and inexact methods for selecting views and indexes for OLAP performance improvement by Talebi et al. in EDBT'08.
- Materialized view in oracle by Bello et al. in VLDB'98.
- On the complexity of the view-selection problem by Karloff and Mihail in PODS'99.
- Answering complex SQL queries using automatic summary tables by Zaharioudakis et al. in SIGMOD'00.
- View Selection for Stream Processing by Gupta, Halevy, and Suciu in WebDB'02.
- Multiple Materialized View Selection for XPath Query Rewriting by Tan et al. in ICDE'08.
- Materialized View Selection and Maintenance Using
MultiQuery Optimization by Mistry et al. in SIGMOD'01.
- View Maintenance
- Deriving Production Rules for Incremental View Maintenance by Ceri and Widom in VLDB'91.
- Making views self-maintainable for data warehousing by Quass, Gupta, Mumick, and Widom in PDIS, 1996
- Incremental Maintenance of views with duplicates by Griffin and Libkin in SIGMOD'95.
- Maintaining views incrementally by Gupta, Mumick, and Subrahmanian in SIGMOD'93.
- Maintenance of data cubes and summary tables in a warehouse by Mumick et al. in SIGMOD'97.
- View maintenance in a warehousing environment by Zhuge et al. in SIGMOD'95.
- View maintenance issues for the chronicle data model by Jagadish, Mumick, and Silberschatz in PODS'95.
-
Maintenance of materialized views: Problems, techniques and applications by Gupta and Mumick in IEEE Bulletin on Data Engineering, 18(2), 1995.
- Materialized view maintenance and integrity constraint checking: trading space for time by Ross et al. in SIGMOD'96.
- Incremental Maintenance of Externally Materialized Views by Staudt and Jarke in VLDB'96.
- Efficient Maintenance of Materialized Outer-Join Views by Larson and Zhou in ICDE'07.
- Storage, computation, and indexing
- Peer-to-Peer Data Management (To be specified)
- Schema Mediation in Peer Data Management System by Halevy et al. in ICDE'03.
- Mapping Data in Peer-to-peer systems: Semantics and Algorithmic Issues by Kementsietsidis et al. in SIGMOD'03.
- ORCHESTRA: facilitating collaborative data sharing by Green et al. in SIGMOD'07.
- PeerDB: A P2P-based system for Data sharing by Ng et al. in ICDE'03.
- RACCOON: A Peer-based System for Data Integration and Sharing by Li et al. in ICDE'04.
- Query Answering in Peer-to-Peer Data Exchange Systems by Bertossi and Bravo in EDBT Workshop 2004.
- Efficient Query Reformulation in Peer Data Management Systems by Tatarinov and Halevy in SIGMOD'04.
- Logical foundations of peer-to-peer data integration by Calvanese et al. in PODS'04.
- On reconciling data exchange, data integration, and peer data management by De Giacomo et al. in PODS'07.
- The Semantics of Consistency and Trust in Peer Data Exchange
Systems by Bertossi and Bravo in
LPAR 2007: 107-122
-
The Architecture of PIER: an
Internet-Scale Query Processor. by Huebsch et al. in
Conf. on Innovative Data Research (CIDR), pp. 28-43, 2005.
- Hyperion Project at U. Toronto
- Stream Data Management
(To be specified)
- Semi-Structured Data Management
(To be specified)
- Sensor Data Management
(To be specified)
- Prolog:
I am trying to learn prolog. The following is a list of references i consider worth reading.
- The Semantics of Predicate Logic as a Programming Language by Van Emden and Kowalski. in J. ACM 23(4), 1976.
-
Logic Programming by Apt in Handbook of Theoretical Computer Science, Volume B, 1990.
- The Art of Prolog, 2nd Ed. By Shapiro and Sterling. Cambridge, 1994.
- Programming in Prolog, 5th Ed. By Clocksin and Mellish, Springer, 2003.
- The Craft of Prolog. By O'Keefe, MIT Press, 1990.
- From Logic Programming to Prolog. By Apt, Prentice Hall, 1997.
- Foundations of Logic Programming (2nd Ed.) by Lloyd, Springer-Verlag,
1987.
- Learn Prolog Now. By Blackburn et al., College Publication, 2006.
- Logic, Programming and Prolog(2nd Ed.) by by Ulf Nilsson and Jan Maluszynski, John Wiley & Sons Ltd., 1995.
-
Prolog - Programming in Logic by Tschorn, Wallum and Steffens. Online book under GNU.
- Search Engines:
(I was asked to help build some slides about query processing in search engines. Here are some
references i find useful, although i have no chance reading all of them)
- The PageRank Citation Ranking: Bringing Order to the Web
by L. Page et al., Tech. Report, Stanford Univ. 1998. See also:The Anatomy of a Large-Scale Hypertextual Web Search Engine
by Sergey Brin and Lawrence Page in Computer Networks 30(1-7): 107-117 (1998).
- Bigtable: A Distributed Storage System for Structured Data
by Fay Chang et al., OSDI'06.
- MapReduce: Simplified Data Processing on Large Clusters
by Jeffrey Dean and Sanjay Ghemawat, OSDI'04. See also Experiences with MapReduce, an abstraction for large-scale computation
by Jeffrey Dean in PACT'06 and Google's MapReduce Programming Model Revisited by Ralf Lammel in Science of Computer Programming Journal, 2006.
-
MapReduce: A major step backwards by DeWitt and Stonebreaker in
The Database Column, Jan 18, 2008.
- The Web as a Graph by Kumar et al. in PODS 2000. See also
The web as a graph: measurements, models, and methods by Kleinberg et al. in proceedings of the 5th International Computing and combinatorics Conference,1999.
- Map-reduce-merge: simplified relational data processing on large clusters by Yang et al. in SIGMOD'07.
- Inverted Files for Text Search Engines
by J. Zobel and A. Moffat in ACM Computing Survey 2006.
- Optimized Query Execution in Large Search Engines by Long and Suel in VLDB'03.
- Learning Search Engine Specific Query Transformations for Question Answering by Agichten et al. in WWW10,
2001.
- Modern Information Retrieval by Baeza-Yates and Ribeiro-Neto, Addison-Wesley, 1999.
- Combining Systems and Databases: A Search Engine Retrospective by Brewer in Readings in Databases, 4th Ed.
-
Introduction to Information Retrieval by Jeffery Dean in PACT '06. ACM.
Manning et al., Cambridge University Press, 2008.
- Mining the Web: Discovering Knowledge from Hypertext Data by Soumen Chakrabarti, Morgan Kaufmann, 2002.(Second edition coming)
- Managing Gigabytes by Witten et al., Morgan Kaufmann, 1999.
- Google's PageRank and Beyond: The Science of Search Engine Rankings by Amy N. Langville and Carl D. Meyer, Princeton University Press, 2006.
- Understanding Search Engines: Mathematical Modeling and Text Retrieval, Second Edition by
Michael W. Berry and Murray Browne, SIAM, 2005.
- Open Source Projects such as
Lucene,
Egothor, and
Nutch
- Web Service Composition
(The following is the content of literature research of an abandoned thesis topic for a master student.)
- Uncategorized
- C-store:A Column Oriented DBMS
by M. Stonebraker et al., VLDB'05. See also
C-Store Project Home.
- Operating System Support for Database Mangement
by Michael Stonebreaker in Commun. ACM 24, 7 (Jul. 1981), 412-418.
- A Sophisticate's Introduction to Database Normalization Theory by Beeri, Bernstein, and Goodman, in VLDB'78.
- Semantic database modeling: survey, applications, and research issues. by Hull and King in ACM Computing Survey 19(1987) 201-260.
- A Survey of Theoretical Research on Typed Complex Database Objects by Richard Hull in XP Workshop on Database Theory 1986.
- Semantic Data Model by Peckham and Maryanski in ACM Computing Survey 20(3), 153-190, 1988.
- Rules In Database Systems by Ceri and Ramakrishnan in ACM Computing Survey 28, 1996.
- An Information-Theoretic Approach to Normal Forms for Relational and XML Data by Arenas and Libkin in Journal of ACM 52(2),2005.
- Integration DB and IR Technologies: what is the sound of one hand clapping. by Chaudhuri, Ramakrishnan, and Weikum in CIDR 2005.
See also DB & IR : Both Sides Now by Weikum in SIGMOD'07.
- Of Objects and Databases: A Decade of Turmoil by Carey and DeWitt in VLDB'96.
- Introduction to Data Mining and Knowledge Discovery by Two Crows Corporation.
-
Modern Information Retrieval: A Brief Overview by Singhal, Amit
. Bulletin of the IEEE Computer Society Technical Committee on
Data Engineering 24 (4): 35-43, 2001.
- The stable model semantics for logic programming by M. Gelfond and V. Lifschitz, in
5th ICLP, pp. 1070-1080
- The well-founded semantics for general logic programs by A. Van Gelder, K. A. Ross, and J. S. Schlipf in J. ACM 1991.
- Automata Theory for XML Researchers by Frank Neven in SIGMOD Record 31(3), 2002.
- Foundations of preferences in Database Systems by Kiessling in VLDB'02. See also Preference SQL - Design, Implementation, Experiences by Kiessling and Koestler in VLDB'02.
- Data Fusion in Three Steps: Resolving Schema, Tuple, and Value Inconsistencies by Naumann et al. in IEEE Data Engineering Bulletin 29(2), 2006.
- On the semantics of updates in databases by Fagin et al. in PODS'83.
- One table stores all: enabling painless free-and-easy data publishing and sharing by Yu et al. in CIDR'07.
- The case for a wide-table approach to manage sparse relational data sets by Chu et al. in SIGMOD'07.
- Text Databases and Information Retrieval by Riloff and Hollaar in ACM Computing Survey 28(1), 1996.
- A Query Language and Optimization Techniques for Unstructured Data by Buneman et al. in SIGMOD'96.
- Just-in-time databases and the World-Wide Web by Spertus and Stein. In
CIKM'98.
- Expressiveness and tractability in knowledge representation and reasoning by Brachman and Levesque in Computational Intelligence journal 3, 78-93 (1987).
- Link Analysis Ranking: Algorithms, Theory, and Experiments by Borodin et al. in ACM Trans. on Internet Technology 5(1), 2005.
- Web Mining by Johannes Fürnkranz, draft book chapter, 2004.
- Data Debugger: An Operator-Centric Approach for Data Quality Solutions by Chaudhuri et al. in IEEE Data Engineering Bulletin 29(2), 2006.
- Data Streams: Algorithms and Applications by Muthukrishnan in Foundations and Trends in Theoretical Computer Science 1(2), 2005.