Main

type

0 (not classified)

status

30 (imported + raw text content deleted)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-11-04 23:09:57

updated at

2025-11-04 23:09:59

Address

url

https://1918redsox.com/pedro/archive/0503b.htm

url length

46

url crc

40982

url crc32

1032495126

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2906316237

Source

domain id

89948473

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280115.80/warc/CC-MAIN-20250809202940-20250809232940-00683.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

104.21.37.230

Publication date

2025-08-09 21:08:38

Fetch attempts

0

Original html size

59260

Normalized and saved size

52354

Content

title

News about Pedro Martinez, Red Sox pitcher, for the 2003 Boston Red Soxseason (May 16-31)

excerpt

content

News about Pedro Martinez, Red Sox pitcher, for the 2003 Boston Red Soxseason (May 16-31) pedro martinez ___________________________ News Archive for May 16-31, 2003 Older links may no longer work. May 31, 2003 Roger Clemens is such a tool. He told the New York Times: “It’s pretty simple, the way I look at it. I became a Hall of Famer here [New York], with my numbers here and what I’ve done here … When Duquette said that I was done [which he actually didn’t do; see May 23 below], if I’d have taken his advice and went home, I wouldn’t have been a Hall of Famer. …” Travis Nelson’s Boy of Summer blog posted the numbers (and offered some nice perspective): Yrs TM GS CG CG% SHO IP IP/GS BB/9 SO K/9 W L W/Yr W% ERA *ERA+ 13 BOS 382 100 26.2 38 2776.00 7.27 2.78 2590 8.4 192 111 14.8 63.4 3.06 151 2 TOR 67 14 20.9 6 498.67 7.44 2.82 563 10.2 41 13 20.5 75.9 2.33 203 5 NYY 133 2 1.5 ...

author

updated

1766874407

Text analysis

block type

0

extracted fields

233

extracted bits

featured image
title
full content
content was extracted heuristically
OpenGraph suggests this is an article

detected location

0

detected language

1 (English)

category id

227

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

20469

text words

5064

text unique words

1652

text lines

1

text sentences

242

text paragraphs

1

text words per sentence

20

text matched phrases

9

text matched dictionaries

4