{"id":22396,"date":"2024-01-17T09:45:24","date_gmt":"2024-01-17T09:45:24","guid":{"rendered":"https:\/\/www.imprima.com\/?p=22396"},"modified":"2024-02-28T18:59:50","modified_gmt":"2024-02-28T18:59:50","slug":"llm-based-ai-how-accurate-is-it","status":"publish","type":"post","link":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it","title":{"rendered":"LLM-based AI &#8211; how accurate is it"},"content":{"rendered":"\n<p>In our previous post \u201c<strong><a href=\"https:\/\/www.imprima.com\/blog\/should-we-blindly-trust-ai\" target=\"_blank\" rel=\"noreferrer noopener\">Should we blindly trust AI?<\/a><\/strong>\u201d, we discussed the main benefit of an accurate AI tool for information extraction. And that is that it saves the user time, and not that it takes the human out of the loop.<\/p>\n\n\n\n<p>At Imprima we like to be open about how accurate AI can be. Here we share the results from our latest tests.<\/p>\n\n\n\n<p>For this experiment, we had a test set of legal documents, which contained 40 different data points (By \u201cdata points\u201d we mean pieces of key information such as clauses, names and dates, such as: effective date, expiration date, anti-assignment, governing law, exclusivity, non-compete, change of control etc.).<\/p>\n\n\n\n<p><strong>We extracted these datapoints with Imprima\u2019s <a href=\"https:\/\/www.imprima.com\/ai-due-diligence\/automated-contract-summaries\" target=\"_blank\" rel=\"noreferrer noopener\">Smart Summaries<\/a> tool.<\/strong><\/p>\n\n\n\n<p>Here we show the test results for a selection of data points:<\/p>\n\n\n\n<div style=\"height:42px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image size-large\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"567\" src=\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/accuracy-table-1024x567.png\" alt=\"Smart Summaries Accuracy table\" class=\"wp-image-22398\" srcset=\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/accuracy-table-1024x567.png 1024w, https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/accuracy-table-300x166.png 300w, https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/accuracy-table-768x425.png 768w, https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/accuracy-table.png 1200w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">In this experiment we selected the data points for which the most data was available for verification (at least 75 data points for verification for each unique datapoint), in order to optimise the statistical significance of the experiment. Recall is a measure of how many of the data points are found, where 0% means none were found, and 100% means all were found. A False Positive means a data point that was incorrectly identified as a valid result.<\/figcaption><\/figure>\n\n\n\n<div style=\"height:42px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><Span style=\"font-weight:800\">The main take aways from these test results are:<\/span><\/p>\n\n\n\n<ul>\n<li style=\"padding-bottom: 18px; font-weight: 500;\">A very high recall \u2013 over 90% on average \u2013 is achieved for these data points (as we have discussed in a <a href=\"https:\/\/www.imprima.com\/blog\/smart-vdrs-accuracy\"><strong>previous blog post<\/strong><\/a>, that is higher than with manual information extraction). <\/li>\n\n<li style=\"padding-bottom: 18px; font-weight: 500;\">The number of false positives is very low (ca. 0.05 False Positives per document on average, in other words only 1 False Positive for every 20 pages).<\/li>\n<\/ul>\n\n\n\n<p>Discarding such a low number of false positives wouldn\u2019t take long, especially since the user doesn\u2019t have to search for them. All extracted data points have been identified and highlighted by the Smart Summaries tool and the user will be able to jump directly to them and discard the few incorrect ones.<\/p>\n\n\n\n<p>And as said, the real goal of high accuracy is to save time, and not to take the human out of the loop (and don\u2019t trust any claims that a human in the loop is not necessary for any AI tool to be used to extract and classify information from legal and other documentation).<\/p>\n\n\n\n<div style=\"height:42px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><strong>If you are interested in learning more about how Imprima\u2019s AI tool can help with extracting key information from documents, visit the <a href=\"https:\/\/www.imprima.com\/ai-due-diligence\/automated-contract-summaries\" target=\"_blank\" rel=\"noreferrer noopener\">Smart Summaries<\/a> product page, or <a href=\"https:\/\/www.imprima.com\/contact\/sales\" target=\"_blank\" rel=\"noreferrer noopener\">contact us<\/a> for a demo.<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In our previous post \u201cShould we blindly trust AI?\u201d, we discussed the main benefit of an accurate AI tool for information extraction. And that is that it saves the user time, and not that it takes the human out of the loop. At Imprima we like to be open about how accurate AI can be. [&hellip;]<\/p>\n","protected":false},"author":8,"featured_media":22405,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[169,170,163,171],"tags":[],"class_list":["post-22396","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-artificial-intelligence","category-legal","category-ma","category-vdr"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.4 (Yoast SEO v26.4) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Smart Summaries - LLM-based information extraction<\/title>\n<meta name=\"description\" content=\"Accuracy of AI-based information extraction - see how well Imprima&#039;s Smart Summaries tool performs extracting key data points from documents\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"LLM-based AI - how accurate is it\" \/>\n<meta property=\"og:description\" content=\"Accuracy of AI-based information extraction - see how well Imprima&#039;s Smart Summaries tool performs extracting key data points from documents\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\" \/>\n<meta property=\"og:site_name\" content=\"Imprima\" \/>\n<meta property=\"article:published_time\" content=\"2024-01-17T09:45:24+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-02-28T18:59:50+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"648\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Marcus Tolan\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Marcus Tolan\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\"},\"author\":{\"name\":\"Marcus Tolan\",\"@id\":\"https:\/\/www.imprima.com\/#\/schema\/person\/91ac413cefc7cd6c4ec7034f7cef47f2\"},\"headline\":\"LLM-based AI &#8211; how accurate is it\",\"datePublished\":\"2024-01-17T09:45:24+00:00\",\"dateModified\":\"2024-02-28T18:59:50+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\"},\"wordCount\":419,\"publisher\":{\"@id\":\"https:\/\/www.imprima.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg\",\"articleSection\":[\"Artificial Intelligence\",\"Legal\",\"M&amp;A\",\"VDR\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\",\"url\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\",\"name\":\"Smart Summaries - LLM-based information extraction\",\"isPartOf\":{\"@id\":\"https:\/\/www.imprima.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg\",\"datePublished\":\"2024-01-17T09:45:24+00:00\",\"dateModified\":\"2024-02-28T18:59:50+00:00\",\"description\":\"Accuracy of AI-based information extraction - see how well Imprima's Smart Summaries tool performs extracting key data points from documents\",\"breadcrumb\":{\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage\",\"url\":\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg\",\"contentUrl\":\"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg\",\"width\":1200,\"height\":648,\"caption\":\"AI LLM image\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.imprima.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"LLM-based AI &#8211; how accurate is it\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.imprima.com\/#website\",\"url\":\"https:\/\/www.imprima.com\/\",\"name\":\"Imprima\",\"description\":\"Secure Online Data Room Services by Imprima\",\"publisher\":{\"@id\":\"https:\/\/www.imprima.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.imprima.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.imprima.com\/#organization\",\"name\":\"Imprima\",\"url\":\"https:\/\/www.imprima.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.imprima.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.imprima.com\/wp-content\/uploads\/2021\/05\/imprima-logo-new.svg\",\"contentUrl\":\"https:\/\/www.imprima.com\/wp-content\/uploads\/2021\/05\/imprima-logo-new.svg\",\"width\":507.43,\"height\":149.18,\"caption\":\"Imprima\"},\"image\":{\"@id\":\"https:\/\/www.imprima.com\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.imprima.com\/#\/schema\/person\/91ac413cefc7cd6c4ec7034f7cef47f2\",\"name\":\"Marcus Tolan\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Smart Summaries - LLM-based information extraction","description":"Accuracy of AI-based information extraction - see how well Imprima's Smart Summaries tool performs extracting key data points from documents","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it","og_locale":"en_US","og_type":"article","og_title":"LLM-based AI - how accurate is it","og_description":"Accuracy of AI-based information extraction - see how well Imprima's Smart Summaries tool performs extracting key data points from documents","og_url":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it","og_site_name":"Imprima","article_published_time":"2024-01-17T09:45:24+00:00","article_modified_time":"2024-02-28T18:59:50+00:00","og_image":[{"width":1200,"height":648,"url":"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg","type":"image\/jpeg"}],"author":"Marcus Tolan","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Marcus Tolan","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#article","isPartOf":{"@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it"},"author":{"name":"Marcus Tolan","@id":"https:\/\/www.imprima.com\/#\/schema\/person\/91ac413cefc7cd6c4ec7034f7cef47f2"},"headline":"LLM-based AI &#8211; how accurate is it","datePublished":"2024-01-17T09:45:24+00:00","dateModified":"2024-02-28T18:59:50+00:00","mainEntityOfPage":{"@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it"},"wordCount":419,"publisher":{"@id":"https:\/\/www.imprima.com\/#organization"},"image":{"@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage"},"thumbnailUrl":"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg","articleSection":["Artificial Intelligence","Legal","M&amp;A","VDR"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it","url":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it","name":"Smart Summaries - LLM-based information extraction","isPartOf":{"@id":"https:\/\/www.imprima.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage"},"image":{"@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage"},"thumbnailUrl":"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg","datePublished":"2024-01-17T09:45:24+00:00","dateModified":"2024-02-28T18:59:50+00:00","description":"Accuracy of AI-based information extraction - see how well Imprima's Smart Summaries tool performs extracting key data points from documents","breadcrumb":{"@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#primaryimage","url":"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg","contentUrl":"https:\/\/www.imprima.com\/wp-content\/uploads\/2024\/01\/V2C.jpg","width":1200,"height":648,"caption":"AI LLM image"},{"@type":"BreadcrumbList","@id":"https:\/\/www.imprima.com\/blog\/llm-based-ai-how-accurate-is-it#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.imprima.com\/"},{"@type":"ListItem","position":2,"name":"LLM-based AI &#8211; how accurate is it"}]},{"@type":"WebSite","@id":"https:\/\/www.imprima.com\/#website","url":"https:\/\/www.imprima.com\/","name":"Imprima","description":"Secure Online Data Room Services by Imprima","publisher":{"@id":"https:\/\/www.imprima.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.imprima.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.imprima.com\/#organization","name":"Imprima","url":"https:\/\/www.imprima.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.imprima.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.imprima.com\/wp-content\/uploads\/2021\/05\/imprima-logo-new.svg","contentUrl":"https:\/\/www.imprima.com\/wp-content\/uploads\/2021\/05\/imprima-logo-new.svg","width":507.43,"height":149.18,"caption":"Imprima"},"image":{"@id":"https:\/\/www.imprima.com\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.imprima.com\/#\/schema\/person\/91ac413cefc7cd6c4ec7034f7cef47f2","name":"Marcus Tolan"}]}},"_links":{"self":[{"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/posts\/22396","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/comments?post=22396"}],"version-history":[{"count":0,"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/posts\/22396\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/media\/22405"}],"wp:attachment":[{"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/media?parent=22396"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/categories?post=22396"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.imprima.com\/wp-json\/wp\/v2\/tags?post=22396"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}