Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-10-23 12:55:31

updated at

2025-10-23 12:55:32

Address

url

https://www.cs.jhu.edu/news/ai-lawyers-need-to-hit-the-books-study-finds-llms-flunk-law-101/

url length

92

url crc

50075

url crc32

3655713691

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2802737439

Source

domain id

491243395

domain tld

2295

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280903.0/warc/CC-MAIN-20250811192912-20250811222912-00999.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

128.220.13.64

Publication date

2025-08-11 19:55:26

Fetch attempts

0

Original html size

117765

Normalized and saved size

86871

Content

title

AI lawyers need to hit the books: Study finds LLMs flunk Law 101

excerpt

content

People are beginning to worry that AI is coming for their jobs—but lawyers and paralegals shouldn’t be concerned quite just yet, according to a new study by Johns Hopkins researchers. Despite OpenAI’s claim that ChatGPT has passed the bar exam, a Hopkins team including Andrew Blair-Stanek, a fifth-year PhD student in the Whiting School of Engineering’s Department of Computer Science, has revealed in a series of experiments that the most powerful large language models, or LLMs, can’t even perform basic legal tasks correctly. “We find LLMs are like very sloppy paralegals,” says Blair-Stanek, who is also a professor at the University of Maryland Francis King Carey School of Law. Blair-Stanek first noticed that LLMs struggled with basic legal text retrieval while working to use AI in identifying tax shelters. When the most powerful LLMs failed to accurately retrieve text from specific citations—and even failed at the same kind of retrieval in de...

author

updated

1762854376

Text analysis

block type

0

extracted fields

233

extracted bits

featured image
title
full content
content was extracted heuristically
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

Zastosowania AI (149)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

3330

text words

646

text unique words

332

text lines

1

text sentences

22

text paragraphs

1

text words per sentence

29

text matched phrases

11

text matched dictionaries

3