{"id":2081,"date":"2018-05-09T05:40:17","date_gmt":"2018-05-09T05:40:17","guid":{"rendered":"http:\/\/www.dekuzu.com\/en\/?p=2081"},"modified":"2018-05-09T05:40:17","modified_gmt":"2018-05-09T05:40:17","slug":"technical-aspects-of-text-and-data-mining-research-in-copyright-directive","status":"publish","type":"post","link":"https:\/\/www.dekuzu.com\/en\/2018\/05\/technical-aspects-of-text-and-data-mining-research-in-copyright-directive.html","title":{"rendered":"Technical aspects of text and data mining research in copyright directive"},"content":{"rendered":"<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">A new very useful research, requested by policy department for citizens\u2019 rights and constitutional affairs, has been published. The author of research, Eleonora Rosati, has briefly but informative and understandable way outlined the main issues with text and data mining exception to copyright. The entire research available <a href=\"http:\/\/www.dekuzu.com\/en\/docs\/IPOL_BRI(2018)604942_EN.pdf\">here<\/a>, below some technical points of exception \u2013 its three steps.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\"><!--more--><\/span><\/p>\n<hr \/>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">TDM activities can take place through different procedures and with different goals, the only common element being that of analysing and extracting associations between concepts to identify new patterns and relations. By means of a necessary simplification, it appears however possible to distinguish three common \u2013 yet not all necessary \u2013 steps in TDM processes:<\/span><\/p>\n<ol style=\"text-align: justify;\">\n<li><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Access to content (Step 1);<\/span><\/li>\n<li><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Extraction and\/or copying of content (Step 2);<\/span><\/li>\n<li><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Mining of text and\/or data and knowledge discovery (Step 3)<\/span><\/li>\n<\/ol>\n<p style=\"text-align: center;\"><strong><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Step 1 \u2013 Access to content<\/span><\/strong><\/p>\n<p><a href=\"http:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-2082\" src=\"http:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-1-1024x333.jpg\" alt=\"\" width=\"758\" height=\"246\" srcset=\"https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-1-1024x333.jpg 1024w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-1-300x97.jpg 300w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-1-768x250.jpg 768w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-1.jpg 1382w\" sizes=\"auto, (max-width: 758px) 100vw, 758px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">The primary distinction to be made is between content that is freely accessible and content that is not, and in relation to which access permission, i.e. a licence, may be required. In relation to the former, freedom of access does not necessarily entail that the content (text and data) is also free of legal restrictions. In relation to the latter, an issue might be also that of identifying the subjects from whom permission is to be sought, i.e. the relevant rightholders. Problems might also arise in relation to orphan works, these being works and other subject-matter that are protected by copyright or related rights and for which no rightholder has been identified or for which the rightholder, even if identified, has not been located.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">If a licence is required and is successfully secured, its resulting scope determines the types of activities that the licensee is entitled to undertake in relation to the content to which access has been secured. It is worth recalling that exceeding the scope of the licence secured might expose the licensee to liability for infringing acts. Some publishers include the possibility of undertaking TDM activities within the scope of the licences available, but that is not always the case. In particular, if acts of extraction and\/or copying of content are needed to undertake TDM, then further issues should be considered by the licensee who is not also explicitly allowed to perform TDM on the licensed content.<\/span><\/p>\n<p style=\"text-align: center;\"><strong><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Step 2 \u2013 Extraction and\/or copying of content<\/span><\/strong><\/p>\n<p style=\"text-align: justify;\"><a href=\"http:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-2083\" src=\"http:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-2-1024x885.jpg\" alt=\"\" width=\"758\" height=\"655\" srcset=\"https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-2-1024x885.jpg 1024w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-2-300x259.jpg 300w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-2-768x664.jpg 768w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-2.jpg 1359w\" sizes=\"auto, (max-width: 758px) 100vw, 758px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Lawful access to content \u2013 whether because such content is freely accessible or access has been obtained through a licence \u2013 does not necessarily entitle one to undertake TDM in respect of such content (text or data). This is because to undertake TDM it may be necessary to undertake certain propaedeutic activities, including extracting and\/or copying the content, for which specific authorization may be required. Not all TDM practices require necessarily the extraction and\/or copying of content.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Not all acts of copying are necessarily subject to the control of the relevant rightholder. If the content extracted and\/or copied is included in a database, then both copyright and the sui generis (database) right might come into consideration, as well as other aspects in the event that neither vests in the database considered.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">With regard to copyright, the author of a database is entitled to prevent a number of acts, including the reproduction \u2013 whether temporary or permanent \u2013 by any means and in any form, in whole or in part of the expression of the database which is protectable by copyright, i.e. expression that is sufficiently original. The only mandatory limitation to the rights of the copyright holder relates to the performance by the lawful user of a database or of a copy thereof of any acts that are necessary for the purposes of accessing the content of the databases and its normal use.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">With regard to the sui generis (database) right, the maker of a database who has made qualitatively and\/or quantitatively a substantial investment in either the obtaining, verification or presentation of the contents is entitled to prevent extraction and\/or re-utilization of the whole or of a substantial part, evaluated qualitatively and\/or quantitatively, of the contents of that database. Restrictions may also subsist in relation to databases that are protected by neither copyright nor the sui generis (database) right.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Also related (neighbouring) rights might come into consideration in relation to perspective acts of copying finalized to the undertaking of TDM activities. Should the proposed press publishers\u2019 rights be ultimately adopted, acts of reproduction in respect of press publications might also require authorization of press publishers.<\/span><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Finally, it should be noted that not only might intellectual property rights limit the activities underlying Step 2, but also other areas of the law might be relevant at this stage. In this sense, the application of data protection and privacy laws to the realm of text and data extraction should be considered. Another area that might be relevant is contract law, especially in relation to contractual restrictions and \u2013 where applicable \u2013 contractual restrictions of TDM.<\/span><\/p>\n<p style=\"text-align: center;\"><strong><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Step 3 \u2013 Mining of text and\/or data and knowledge discovery<\/span><\/strong><\/p>\n<p style=\"text-align: justify;\"><a href=\"http:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-3.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-2084\" src=\"http:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-3-1024x456.jpg\" alt=\"\" width=\"758\" height=\"338\" srcset=\"https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-3-1024x456.jpg 1024w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-3-300x134.jpg 300w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-3-768x342.jpg 768w, https:\/\/www.dekuzu.com\/en\/wp-content\/uploads\/2018\/04\/TDM-step-3.jpg 1767w\" sizes=\"auto, (max-width: 758px) 100vw, 758px\" \/><\/a><\/p>\n<p style=\"text-align: justify;\"><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">This step is also propaedeutic to the realization of the goal underlying predictive TDM, which is not just mere extraction of information, but rather knowledge discovery. In addition to the steps discussed above, which consist of identifying the content to use and securing access to it, in most cases stages in text and data mining processes include:<\/span><\/p>\n<ul style=\"text-align: justify; list-style-type: circle;\">\n<li><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Pre-processing of relevant text and data (Stage A);<\/span><\/li>\n<li><span style=\"font-family: 'times new roman', times, serif; font-size: 14pt;\">Extractino of structured data (Stage B).<\/span><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>A new very useful research, requested by policy department for citizens\u2019 rights and constitutional affairs, has been published. The author of research, Eleonora Rosati, has briefly but informative and understandable way outlined the main issues with text and data mining<\/p>\n<div class=\"more-link-wrapper\"><a class=\"more-link\" href=\"https:\/\/www.dekuzu.com\/en\/2018\/05\/technical-aspects-of-text-and-data-mining-research-in-copyright-directive.html\">Continue reading<span class=\"screen-reader-text\">Technical aspects of text and data mining research in copyright directive<\/span><\/a><\/div>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,11,31,22,6,18,20,13],"tags":[],"class_list":["post-2081","post","type-post","status-publish","format-standard","hentry","category-copyright","category-eu","category-exceptions-and-limitations","category-fair-use","category-intellectual-property","category-law","category-legal-proposal","category-research","entry"],"_links":{"self":[{"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/posts\/2081","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/comments?post=2081"}],"version-history":[{"count":0,"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/posts\/2081\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/media?parent=2081"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/categories?post=2081"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.dekuzu.com\/en\/wp-json\/wp\/v2\/tags?post=2081"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}