![An Incrementally Trainable Statistical Approach to Information Extraction: Based on Token Classification and Rich Context Model - Christian Siefkes - Bøger - VDM Verlag - 9783639001464 - 4. juli 2008](https://imusic.b-cdn.net/images/item/original/464/9783639001464.jpg?christian-siefkes-2008-an-incrementally-trainable-statistical-approach-to-information-extraction-based-on-token-classification-and-rich-context-model-paperback-bog&class=scaled&v=1497099319)
Fortæl dine venner om denne vare:
An Incrementally Trainable Statistical Approach to Information Extraction: Based on Token Classification and Rich Context Model
Christian Siefkes
Bestilles fra fjernlager
An Incrementally Trainable Statistical Approach to Information Extraction: Based on Token Classification and Rich Context Model
Christian Siefkes
Most of the information stored in digital form is hidden in naturallanguage texts. The purpose of Information Extraction (IE) is to finddesired pieces of information in unstructured or weakly structured textsand store them in a form that is suitable for automatic querying andprocessing. This book presents a innovative approach to statistical informationextraction. It introduces a new algorithm which supports functionality notavailable in previous IE systems, such as interactive incremental trainingto reduce the human training effort. The system also utilizes new sourcesof information, employing rich tree-based context representations tocombine document structure (HTML or XML markup) with linguistic andsemantic information. The resulting IE system is designed as a generic framework for statisticalinformation extraction. All core components can be modified or exchangedindependently of each other. This book is of interest for professionals who have to deal with largeamounts of weakly structured information and seek ways to automate thisprocess, as well as for researchers and practitioners active in the fieldsof text mining and text classification.
Medie | Bøger Paperback Bog (Bog med blødt omslag og limet ryg) |
Udgivet | 4. juli 2008 |
ISBN13 | 9783639001464 |
Forlag | VDM Verlag |
Antal sider | 220 |
Mål | 299 g |
Sprog | Engelsk |
Mere med Christian Siefkes
Se alt med Christian Siefkes ( f.eks. Paperback Bog og Hardcover bog )