Intelligent Document Retrieval: Exploiting Markup Structure / Edition 1

Hardcover (Print)
Buy New
Buy New from BN.com
$159.20
Used and New from Other Sellers
Used and New from Other Sellers
from $25.53
Usually ships in 1-2 business days
(Save 87%)
Other sellers (Hardcover)
  • All (5) from $25.53   
  • New (3) from $139.46   
  • Used (2) from $25.53   

Overview

Collections of digital documents can nowadays be found everywhere in institutions, universities or companies. Examples are Web sites or intranets. But searching them for information can still be painful. Searches often return either large numbers of matches or no suitable matches at all.

Such document collections can vary a lot in size and how much structure they carry. What they have in common is that they typically do have some structure and that they cover a limited range of topics. The second point is significantly different from the Web in general.

The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process. In order to suggest sensible query modifications we would need to know what the documents are about. Explicit knowledge about the document collection encoded in some electronic form is what we need. However, typically such knowledge is not available. So we construct it automatically.

Read More Show Less

Editorial Reviews

From the Publisher
From the reviews:

"The main idea of this book, based on the author’s PhD thesis, is to use markup information as a series of cues to the significance of words and concepts in a text, thus enhancing the indexing of that text. The technique is developed for collections of texts with a specific focus, such as a Web site or a collection of documents … . The presented approach is attractive, because it can be adapted to different contexts in a straightforward manner … ." (D. T. Barnard, Computing Reviews, July, 2006)

Read More Show Less

Product Details

  • ISBN-13: 9781402037672
  • Publisher: Springer Netherlands
  • Publication date: 12/1/2005
  • Series: The Information Retrieval Series , #17
  • Edition description: 2005
  • Edition number: 1
  • Pages: 198
  • Product dimensions: 9.21 (w) x 6.14 (h) x 0.56 (d)

Table of Contents

Kruschwitz "Intelligent Document Retrieval: Exploiting Markup Structure"

Table of Contents

Foreword V Preface VII List of Figures XIII List of Tables XV

1 Introduction 1
1.1 Introductory Examples 4
1.2 Using Markup to Extract Knowledge 8
1.3 Applying the Extracted Knowledge 15
1.4 Structure of the Book 17

Part I The Model 21

2 Related Work 23
2.1 Information Retrieval 24
2.2 Information Extraction 26
2.3 Clustering 27
2.4 Classification 29
2.5 Web Search Techniques 31
2.6 Ontologies 34
2.7 Layout Analysis 36
2.8 Web Search Studies 36
2.9 Navigating Concept Hierarchies 38
2.10 Dialogue Systems 41
2.11 Usability Issues 42
2.12 Concluding Remarks on Related Work 43

3 Data Analysis and Domain Model Construction 45
3.1 Documents 45
3.2 Concepts 47
3.3 A Domain Model Based on Concepts 51
3.4 Model Structure 53
3.5 Model Construction 54
3.6 Using the Model for Query Modification 58
3.7 Implementational Issues 60

4 Incorporating Additional Knowledge 63
4.1 Internal Knowledge 63
4.2 External Knowledge 67

5 A Dialogue System for Partially Structured Data 69
5.1 Dialogue as Movement in Space 70
5.2 Dialogue Example 71
5.3 Static vs. Dynamic Clusters 73
5.4 Real User Queries 73
5.5 Properties 75
5.5.1 Document Properties 76
5.5.2 System Properties 76
5.5.3 Goal Description 77
5.6 Dialogue 78
5.6.1 High Level Dialogue States 78
5.6.2 Low Level Dialogue States 80
5.6.3 Constructing Potential Choices 85
5.6.4 Dialogue Strategies 89
5.6.5 Customization 89

Part II Practical Applications 91

6 UKSearch - Intelligent Web Search 93
6.1 Indexing Web Pages 94
6.2 The UKSearch System 98
6.2.1 Indexing and Model Construction 100
6.2.2 Dialogue Strategy 102
6.3 Sample Domain 1: Essex University 107
6.3.1 Index Tables 108
6.3.2 Domain Model 109
6.3.3 Concepts it vs. Real User Queries 111
6.4 Sample Domain 2: BBC News 112
6.4.1 Index Tables 115
6.4.2 Domain Model 116
6.4.3 Adjusted Dialogue Strategy 117
6.5 Implementational Issues 117

7 UKSearch - Evaluation and Discussion 121
7.1 Log Analysis 121
7.1.1 System Setup 122
7.1.2 Results 124
7.1.3 Discussion 125
7.2 Investigating Domain Model Relations 125
7.2.1 Task and Setup 125
7.2.2 Results 127
7.2.3 Discussion 128
7.3 Task-Based Evaluation: Essex University 129
7.3.1 Search Tasks 129
7.3.2 Experimental Setup 133
7.3.3 Procedure 134
7.3.4 Results 134
7.3.5 Discussion 140
7.4 Task-Based Evaluation: BBC News 141
7.4.1 Search Tasks 142
7.4.2 Experimental Setup and Procedure 143
7.4.3 Results 143
7.4.4 Discussion 151

8 YPA - Searching Classified Directories 157
8.1 System Overview 158
8.2 Indexing Classified Advertisements 159
8.2.1 Structure of the Backend 160
8.2.2 Domain Model Construction 161
8.3 Dialogue Strategy in the YPA 162
8.3.1 Properties 165
8.3.2 Dialogue Setup 166
8.3.3 Dialogue Function 168
8.3.4 Calculation of Potential Choices 168
8.4 Implementational Issues 171

9 Future Directions and Conclusions 173
9.1 Towards Evolving Domain Models 173
9.2 Dialogue Management 176
9.3 An Outlook on Future Evaluations 177
9.4 Conclusions 178

References 181
Index 193

Read More Show Less

Customer Reviews

Be the first to write a review
( 0 )
Rating Distribution

5 Star

(0)

4 Star

(0)

3 Star

(0)

2 Star

(0)

1 Star

(0)

Your Rating:

Your Name: Create a Pen Name or

Barnes & Noble.com Review Rules

Our reader reviews allow you to share your comments on titles you liked, or didn't, with others. By submitting an online review, you are representing to Barnes & Noble.com that all information contained in your review is original and accurate in all respects, and that the submission of such content by you and the posting of such content by Barnes & Noble.com does not and will not violate the rights of any third party. Please follow the rules below to help ensure that your review can be posted.

Reviews by Our Customers Under the Age of 13

We highly value and respect everyone's opinion concerning the titles we offer. However, we cannot allow persons under the age of 13 to have accounts at BN.com or to post customer reviews. Please see our Terms of Use for more details.

What to exclude from your review:

Please do not write about reviews, commentary, or information posted on the product page. If you see any errors in the information on the product page, please send us an email.

Reviews should not contain any of the following:

  • - HTML tags, profanity, obscenities, vulgarities, or comments that defame anyone
  • - Time-sensitive information such as tour dates, signings, lectures, etc.
  • - Single-word reviews. Other people will read your review to discover why you liked or didn't like the title. Be descriptive.
  • - Comments focusing on the author or that may ruin the ending for others
  • - Phone numbers, addresses, URLs
  • - Pricing and availability information or alternative ordering information
  • - Advertisements or commercial solicitation

Reminder:

  • - By submitting a review, you grant to Barnes & Noble.com and its sublicensees the royalty-free, perpetual, irrevocable right and license to use the review in accordance with the Barnes & Noble.com Terms of Use.
  • - Barnes & Noble.com reserves the right not to post any review -- particularly those that do not follow the terms and conditions of these Rules. Barnes & Noble.com also reserves the right to remove any review at any time without notice.
  • - See Terms of Use for other conditions and disclaimers.
Search for Products You'd Like to Recommend

Recommend other products that relate to your review. Just search for them below and share!

Create a Pen Name

Your Pen Name is your unique identity on BN.com. It will appear on the reviews you write and other website activities. Your Pen Name cannot be edited, changed or deleted once submitted.

 
Your Pen Name can be any combination of alphanumeric characters (plus - and _), and must be at least two characters long.

Continue Anonymously

    If you find inappropriate content, please report it to Barnes & Noble
    Why is this product inappropriate?
    Comments (optional)