Main

type

0 (not classified)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-11-08 19:50:27

updated at

2025-11-08 19:50:28

Address

url

https://developer.imdb.com/documentation/bulk-data-documentation/data-dictionary/names/

url length

87

url crc

51093

url crc32

2559821717

location type

1 (url matches target location, page_location is empty)

canonical status

10 (verified canonical url)

canonical page id

2935192144

Source

domain id

33932173

domain tld

2211

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280076.69/warc/CC-MAIN-20250809045158-20250809075158-00565.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

18.160.10.95

Publication date

2025-08-09 06:06:44

Fetch attempts

0

Original html size

52743

Normalized and saved size

27628

Content

title

excerpt

content

Data Dictionary - Name nameId The unique IMDb ID for the name in question. Each IMDb ID appears exactly once. remappedTo It is possible that two IMDb IDs can be created for a single entity within our system before IMDb identify that they actually represent the same person or title. When this happens, we maintain the data associated with both identifiers in the data set, duplicating the data. If there are duplicate name entities for a person, remappedTo provides the IMDb ID of the primary name entity for this person. See “Duplicate IDs" in the “Changes to Entities and Resolving IDs” section of “Key Concepts” for more information. name The primary name by which this person is known, usually the one by which they are most often credited. For more information about how IMDb defines the primary name see IMDb help site. awards A list of awards that this person has won or been nominated for. This includes the name and category of the award, the name and year of the award event, the titles the...

author

updated

2025-11-26 23:10:40

Text analysis

block type

0

extracted fields

96

extracted bits

full content
content was extracted heuristically

detected location

0

detected language

1 (English)

category id

Filmy i seriale (81)

index version

2025110801

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

3496

text words

711

text unique words

269

text lines

1

text sentences

44

text paragraphs

1

text words per sentence

16

text matched phrases

2

text matched dictionaries

2