id
type
0 (not classified)
status
21 (imported old-v2, waiting for another import)
review version
0
cleanup version
0
pending deletion
0 (-)
created at
2025-11-08 19:50:27
updated at
2025-11-08 19:50:28
url
https://developer.imdb.com/documentation/bulk-data-documentation/data-dictionary/names/
url length
87
url crc
51093
url crc32
2559821717
location type
1 (url matches target location, page_location is empty)
canonical status
10 (verified canonical url)
canonical page id
domain id
domain tld
2211
domain parts
0
originating warc id
-
originating url
https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151280076.69/warc/CC-MAIN-20250809045158-20250809075158-00565.warc.gz
source type
11 (CommonCrawl)
server ip
Publication date
2025-08-09 06:06:44
Fetch attempts
0
Original html size
52743
Normalized and saved size
27628
title
excerpt
content
Data Dictionary - Name nameId The unique IMDb ID for the name in question. Each IMDb ID appears exactly once. remappedTo It is possible that two IMDb IDs can be created for a single entity within our system before IMDb identify that they actually represent the same person or title. When this happens, we maintain the data associated with both identifiers in the data set, duplicating the data. If there are duplicate name entities for a person, remappedTo provides the IMDb ID of the primary name entity for this person. See “Duplicate IDs" in the “Changes to Entities and Resolving IDs” section of “Key Concepts” for more information. name The primary name by which this person is known, usually the one by which they are most often credited. For more information about how IMDb defines the primary name see IMDb help site. awards A list of awards that this person has won or been nominated for. This includes the name and category of the award, the name and year of the award event, the titles the...
author
updated
2025-11-26 23:10:40
block type
0
extracted fields
96
extracted bits
full content
content was extracted heuristically
detected location
0
detected language
1 (English)
category id
index version
2025110801
paywall score
0
spam phrases
0
text nonlatin
0
text cyrillic
0
text characters
3496
text words
711
text unique words
269
text lines
1
text sentences
44
text paragraphs
1
text words per sentence
16
text matched phrases
2
text matched dictionaries
2
links self subdomains
0
links other subdomains
0
links other domains
0
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
6
links ext leaks
0
links ext ugc
0
links ext klim
0
links ext generic
0
status
0
updated
2025-11-26 23:10:40
image author
featured image