What is the extraction of information?

Sometimes known as obtaining information, information extraction (IE) is a process that is used with computer systems to allow the relevant data to be extracted from larger data using a set of predefined criteria. The idea of ​​extraction of information is to make it easy to identify and assimilate data that is relevant to a particular activity without having to manually go through a large amount of information to detect the exact data required. The process is similar to concept mining or scratching on the web because all these approaches seek to collect useful information from the wider fund of available data.

General approach to information extraction requires the use of programming that is able to scan the sources of information that are considered to be readable machines. This may include printed documents that have been scanned into some kind of electronic files, documents prepared JAKO tables or document processing documents or even data contained in readable fields in the database. Usually, parameters are set to allow the software program to get access to these data sources, and quickly prefers them through specific criteria and pulls out certain types of information from the available fund. This process usually differs from a simple search process in that the method requires non -conformity of specific words or phrases in itself, but instead uses a process called natural language processing, which helps not only in the evaluation of real words, but also the context and meaning of this context.

Complexity associated with extraction of information makes it difficult to use this approach on a global scale, although there are IE tools that work very well with limited amounts of data, such as data sources associated with electronic files.And corporation or even resource fund including a limited number of ZPRavodaj channels. With this approach it is possible to identify an event type, possibly even reduce revenues to include a certain number of participants in the event and have data organized by date.

As with many forms of technology, the tools used to extract information are constantly increasing. Since the beginning of the 21st century, the ability to set parameters and the use of the ever -growing electronic data authorities in the search for relevant information has increased significantly. This includes the ability to solve large volumes of unstructured data and use these parameters to bring some order or structure to these data, which is all the more useful for future search.

IN OTHER LANGUAGES

Was this article helpful? Thanks for the feedback Thanks for the feedback

How can we help? How can we help?