{"id":24165,"date":"2021-01-21T19:24:30","date_gmt":"2021-01-21T19:24:30","guid":{"rendered":"https:\/\/www.rightsdirect.com\/?post_type=blog_post&#038;p=24165"},"modified":"2021-01-21T19:24:30","modified_gmt":"2021-01-21T19:24:30","slug":"fuenf-empfehlungen-fuer-smartes-text-und-data-mining","status":"publish","type":"blog_post","link":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/","title":{"rendered":"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining"},"content":{"rendered":"<p><a href=\"https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=3712813\">J\u00fcngste Untersuchungen<\/a> zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist. Da die Menge an verf\u00fcgbaren Informationen st\u00e4ndig w\u00e4chst, wenden sich F&amp;E-intensive Unternehmen zunehmend dem Text- und Data-Mining von wissenschaftlicher Volltextliteratur zu &#8211; sowohl im gro\u00dfen Ma\u00dfstab als auch im Kontext einzelner Projekte &#8211; um Informationen zu extrahieren und ihre Wissenslieferkette zu st\u00e4rken. Nat\u00fcrlich variieren diese Anstrengungen in Umfang und Anforderungen von Unternehmen zu Unternehmen oder sogar von Projekt zu Projekt. Bei der Nutzung von Volltextinhalten gibt es viele Faktoren, die ein Wissensmanager ber\u00fccksichtigen sollte, wenn er versucht, einen optimalen Workflow zu entwickeln, der f\u00fcr die Bed\u00fcrfnisse seines Unternehmens geeignet ist. Im Folgenden m\u00f6chten wir ein paar dieser Faktoren n\u00e4her beleuchten.<\/p>\n<p><strong>1) End-to-End Workflow<\/strong><\/p>\n<p>As a knowledge manager, it is essential to understand your company\u2019s expected end-to-end workflow for text mining full-text literature. It can be useful to map the anticipated inputs and outputs at each phase of the workflow, as well as clarifying expected timelines and business criticality. This applies both to any backend data processing pipeline as well as to the dependent end-user workflows. By looking at this workflow as one continuous stream, a knowledge manager can ensure that adjustments upstream do not break processes downstream.<\/p>\n<p><strong>2) Corpus Parameters<\/strong><\/p>\n<p>The parameters for defining a full-text corpus of scientific literature will vary depending on the organization\u2019s end-to-end workflow. For example, the dimensions of a corpus being leveraged in a text mining process applied to specific projects \u2013 such as a pharmacovigilance workflow \u2013 will differ from those used within broader initiatives to process scientific information at scale, apply machine learning or artificial intelligence capabilities, or construct knowledge graph representations. In narrower use cases, specific queries may rely on keywords or subject-related metadata (such as Medical Subject Headings aka MeSH or other indexing aids) that will pull relevant content based on the project specifications. The broader the use case, the less likely an organization is to be able to pre-filter for specific topics; in these cases, time- or journal-based, or other broader categories of content, need to be applied. Based on the end-to-end workflow envisioned, knowledge managers can help their stakeholders by identifying key questions that will define the approach to creating a useful corpus, such as:<\/p>\n<ul>\n<li>What are the desired outputs from processing full-text content?<\/li>\n<li>Is there an ongoing need for new literature, or will a backfile of historical research suffice?<\/li>\n<li>What is the expected outcome of the text and data mining effort?<\/li>\n<li>What journals, timelines, and fields of research are most relevant to the project?<\/li>\n<\/ul>\n<p><strong>3) Volume<\/strong><\/p>\n<p>As described above, there are different approaches to defining a corpus of scientific literature. Those parameters will naturally affect the volume of content in question. Looking back at the two use cases mentioned above, the organization that is text mining for specific projects may consume only several, dozens, or hundreds of articles at a given time for interrogation. Larger scale initiatives, with broader corpus parameters, may result in the processing of hundreds of thousands, or even millions, of articles. And, apart from the project needs at the present snapshot of time, it is important to consider also the maintenance of the project over time and its likely content needs in future.\u00a0 Predicting the amount of content needed for text mining going forward is an important exercise to undertake but may also be a challenge. One method that helps with this estimation is analyzing current or backfile needs, then using this metric to forecast future needs.\u00a0 This calculation can help a knowledge manager choose the appropriate method for consuming the content and predicting costs.<\/p>\n<p><strong>4) Timeliness\u00a0<\/strong><\/p>\n<p>For some organizations, timeliness is an important factor and is essential for supporting their text mining use case. As a knowledge manager, it\u2019s important to understand the stakeholder expectations for when a particular published article is expected to yield outputs from a text and data mining processing pipeline. Delays may be introduced \u2013 and therefore impact business commitments, service level agreements, and expectations \u2013 by article lag time from publication to presence in a feed of data or accessibility by a public API, by any batch or asynchronous processing rules, and so on.<\/p>\n<p><strong>5) Licensing<\/strong><\/p>\n<p>Published scientific literature is a precious asset and important investment for an organization. Based on the expected end-to-end workflow and corpus parameters, the knowledge manager should determine whether content needs can be satisfied through existing subscriptions\/licenses, or whether these should be augmented through extended licensing or through transactional steps. Any expected transactional impact should likewise be factored into the workflow, and its impact measured on timeliness requirements.<\/p>\n<p>Considering these five factors will help a knowledge manager uncover the optimum workflow for their organization to consume full-text content. Text mining, for many, is a new and exciting technology that can drastically improve research and innovation within an organization. By understanding and analyzing these five considerations, the potentially daunting task of developing a new workflow to support text mining will become significantly more manageable.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.<\/p>\n","protected":false},"author":189,"featured_media":24166,"template":"","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":"","_links_to":"","_links_to_target":""},"internal_tag":[],"topic":[],"coauthors":[],"class_list":["post-24165","blog_post","type-blog_post","status-publish","has-post-thumbnail","hentry"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining - RightsDirect<\/title>\n<meta name=\"description\" content=\"J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining - RightsDirect\" \/>\n<meta property=\"og:description\" content=\"J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/\" \/>\n<meta property=\"og:site_name\" content=\"RightsDirect\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/RightsDirect\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png\" \/>\n\t<meta property=\"og:image:width\" content=\"700\" \/>\n\t<meta property=\"og:image:height\" content=\"300\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"twitter:label1\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data1\" content=\"4\u00a0Minuten\" \/>\n\t<meta name=\"twitter:label2\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data2\" content=\"Garrett Dintaman\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/\",\"url\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/\",\"name\":\"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining - RightsDirect\",\"isPartOf\":{\"@id\":\"https:\/\/www.rightsdirect.com\/de\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png\",\"datePublished\":\"2021-01-21T19:24:30+00:00\",\"description\":\"J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#primaryimage\",\"url\":\"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png\",\"contentUrl\":\"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png\",\"width\":700,\"height\":300},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.rightsdirect.com\/de\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Blog Posts\",\"item\":\"https:\/\/www.rightsdirect.com\/de\/blog\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.rightsdirect.com\/de\/#website\",\"url\":\"https:\/\/www.rightsdirect.com\/de\/\",\"name\":\"RightsDirect\",\"description\":\"Global Copyright Compliance Solutions | Rights Licensing | Copyright Education\",\"publisher\":{\"@id\":\"https:\/\/www.rightsdirect.com\/de\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.rightsdirect.com\/de\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"de\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.rightsdirect.com\/de\/#organization\",\"name\":\"RightsDirect\",\"url\":\"https:\/\/www.rightsdirect.com\/de\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"de\",\"@id\":\"https:\/\/www.rightsdirect.com\/de\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2016\/05\/RightsDirect-Logo.RGB-300ppi.jpg\",\"contentUrl\":\"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2016\/05\/RightsDirect-Logo.RGB-300ppi.jpg\",\"width\":2000,\"height\":1200,\"caption\":\"RightsDirect\"},\"image\":{\"@id\":\"https:\/\/www.rightsdirect.com\/de\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/RightsDirect\",\"https:\/\/x.com\/RightsDirect\",\"https:\/\/www.linkedin.com\/company\/rightsdirect\",\"https:\/\/www.youtube.com\/user\/copyrightclear\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining - RightsDirect","description":"J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/","og_locale":"de_DE","og_type":"article","og_title":"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining - RightsDirect","og_description":"J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.","og_url":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/","og_site_name":"RightsDirect","article_publisher":"https:\/\/www.facebook.com\/RightsDirect","og_image":[{"width":700,"height":300,"url":"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png","type":"image\/png"}],"twitter_misc":{"Gesch\u00e4tzte Lesezeit":"4\u00a0Minuten","Written by":"Garrett Dintaman"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/","url":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/","name":"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining - RightsDirect","isPartOf":{"@id":"https:\/\/www.rightsdirect.com\/de\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#primaryimage"},"image":{"@id":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#primaryimage"},"thumbnailUrl":"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png","datePublished":"2021-01-21T19:24:30+00:00","description":"J\u00fcngste Untersuchungen zeigen, dass die Einreichungsrate bei wissenschaftlichen Zeitschriften in den ersten Monaten des Jahres 2020 exponentiell gestiegen ist.","breadcrumb":{"@id":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/"]}]},{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#primaryimage","url":"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png","contentUrl":"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2021\/01\/Open-Access-Research-Content-Corporate-World.png","width":700,"height":300},{"@type":"BreadcrumbList","@id":"https:\/\/www.rightsdirect.com\/de\/blog\/fuenf-empfehlungen-fuer-smartes-text-und-data-mining\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.rightsdirect.com\/de\/"},{"@type":"ListItem","position":2,"name":"Blog Posts","item":"https:\/\/www.rightsdirect.com\/de\/blog\/"},{"@type":"ListItem","position":3,"name":"F\u00fcnf Empfehlungen f\u00fcr smartes Text- und Data-Mining"}]},{"@type":"WebSite","@id":"https:\/\/www.rightsdirect.com\/de\/#website","url":"https:\/\/www.rightsdirect.com\/de\/","name":"RightsDirect","description":"Global Copyright Compliance Solutions | Rights Licensing | Copyright Education","publisher":{"@id":"https:\/\/www.rightsdirect.com\/de\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.rightsdirect.com\/de\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"de"},{"@type":"Organization","@id":"https:\/\/www.rightsdirect.com\/de\/#organization","name":"RightsDirect","url":"https:\/\/www.rightsdirect.com\/de\/","logo":{"@type":"ImageObject","inLanguage":"de","@id":"https:\/\/www.rightsdirect.com\/de\/#\/schema\/logo\/image\/","url":"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2016\/05\/RightsDirect-Logo.RGB-300ppi.jpg","contentUrl":"https:\/\/www.rightsdirect.com\/wp-content\/uploads\/sites\/6\/2016\/05\/RightsDirect-Logo.RGB-300ppi.jpg","width":2000,"height":1200,"caption":"RightsDirect"},"image":{"@id":"https:\/\/www.rightsdirect.com\/de\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/RightsDirect","https:\/\/x.com\/RightsDirect","https:\/\/www.linkedin.com\/company\/rightsdirect","https:\/\/www.youtube.com\/user\/copyrightclear"]}]}},"acf":[],"publishpress_future_workflow_manual_trigger":{"enabledWorkflows":[]},"_links":{"self":[{"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/blog_post\/24165","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/blog_post"}],"about":[{"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/types\/blog_post"}],"author":[{"embeddable":true,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/users\/189"}],"version-history":[{"count":2,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/blog_post\/24165\/revisions"}],"predecessor-version":[{"id":24173,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/blog_post\/24165\/revisions\/24173"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/media\/24166"}],"wp:attachment":[{"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/media?parent=24165"}],"wp:term":[{"taxonomy":"internal_tag","embeddable":true,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/internal_tag?post=24165"},{"taxonomy":"topic","embeddable":true,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/topic?post=24165"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.rightsdirect.com\/de\/wp-json\/wp\/v2\/coauthors?post=24165"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}