Text mining is the discovery by computer of new, previously unknown information, by automatically extracting information from different written sources or Text( e.g. academic publications, patents , regulatory documents)
Text mining is a variation on a filed called data mining. The difference between regular data mining and text mining is that in text mining the patterns are extracted from natural language text rather than from structured databases of facts.
Currently researchers spend over 50% of their time searching for information through millions of scientific publication.
The burgeoning growth of published text means that even the most avid reader cannot hope to keep up with all the reading in a field, let alone adjacent fields.
Text mining offers a solution to this problem by replacing or supplementing the human reader with automatic systems undeterred by the text explosion . It involves analyzing a large collection of documents to discover previously unknown information. The information might be relationships or patterns that are buried in the document collection and which would otherwise be extremely difficult.The most active,and I think promising,application area for text mining is in the Biosciences(Proteomics & genomics).
2 comments:
Thanks for sharing the information.It is very valuable for the professionals in the field of Bioinformatics.
thanks for the info. i would like to know more about this. can some one help me in this regard. i am a medical doctor with patents & regulatory affairs knowledge domain knowledge & exposure.
bye
dr.r.r.srinivas
Post a Comment