Main

type

5 (blog/news article)

status

21 (imported old-v2, waiting for another import)

review version

0

cleanup version

0

pending deletion

0 (-)

created at

2025-12-10 18:16:51

updated at

2025-12-10 18:16:52

Address

url

https://sindimaq.org.br/sindimaq/blog/cat/60

url length

44

url crc

12438

url crc32

1645293718

location type

1 (url matches target location, page_location is empty)

canonical status

2 (missing canonical tag in html)

canonical page id

-

Source

domain id

294437885

domain tld

76

domain parts

0

originating warc id

-

originating url

https://data.commoncrawl.org/crawl-data/CC-MAIN-2025-33/segments/1754151279847.20/warc/CC-MAIN-20250805032822-20250805062822-00872.warc.gz

source type

11 (CommonCrawl)

Server response

server ip

3.171.61.21

Publication date

2025-08-05 04:10:43

Fetch attempts

0

Original html size

27722

Normalized and saved size

21458

Content

title

Blog - SINDIMAQ

excerpt

content

Os mecanismos alternativos de soluções de controvérsias tributárias tiveram papel importante nesses últimos dois anos, principalmente por conta da pandemia que enfrentamos, ocasionando sensíveis impactos financeiros nas empresas que do dia para a noite se tornaram devedores perante o fisco.
 N...

author

updated

1767234632

Text analysis

block type

0

extracted fields

104

extracted bits

title
full content
content was extracted heuristically

detected location

0

detected language

10 (Portuguese)

category id

Koronawirus (17)

index version

2025123101

paywall score

0

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

250

text words

41

text unique words

39

text lines

1

text sentences

2

text paragraphs

1

text words per sentence

20

text matched phrases

1

text matched dictionaries

2