id
name
processing priority
4
site type
0 (generic, awaiting analysis)
review version
11
html import
20 (imported)
first seen date
2024-02-05 06:39:17
expired found date
-
created at
2024-06-07 16:14:09
updated at
2025-12-31 01:53:30
length
9
crc
3129
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
-
previous id
0
replaced with id
0
related id
-
dns primary id
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
55208
mp size raw text
3895
mp inner links count
6
mp inner links status
20 (imported)
title
yiyun's Blog
description
Data Science, Machine Learning, Fullstack Web Developer
image
site name
yiyun's Blog
author
yiyun
updated
2025-12-19 04:52:42
raw text
yiyun's Blog yiyun's Blog 愚蠢的是我 首页 分类 标签 归档 关于 作品集 友链 RSS 搜索 0% Theme NexT works best with JavaScript enabled CSS 双飞翼布局 发表于 2024-01-05 更新于 2024-01-09 分类于 前端 , HTML/CSS/JavaScript 本文字数: 474 阅读时长 ≈ 2 分钟 引言 面试题:写一个左中右布局占满屏幕,其中左、右俩块固定宽 200,中间自适应宽,要求先加载中间块,请写出结构及样式。 阅读全文 » Web 前端 | 反爬虫 | 字体反爬 发表于 2023-10-02 更新于 2024-01-14 分类于 前端 , 反爬虫 本文字数: 4.6k 阅读时长 ≈ 17 分钟 引言 字体反爬 本质就是字符集映射, 将原本字符的 Unicode 码进行偏移(体现在自己的字体文件中), 而网页中不再展示原文本字符, 而是使用自定义字体文件中对应的 HTML 字符实体 案例 - 志愿军:雄兵出击_购票_剧情介绍_演职人员_图集_猫眼电影 可以发现即使浏览器审查元素也无法获取原文本字符, 这样即使使用 Selenium 等 Browser Headless 也无法直接获取原文本字符 阅读全文 » 《深入架构原理与实践》| 读书笔记 发表于 2023-09-30 更新于 2023-11-15 分类于 后端 , 架构 本文字数: 2.7k 阅读时长 ≈ 10 分钟 引言 随着云计算的兴起,技术架构的关注点也从集群可用性治理,发展到云原生和 FinOps 成本管理。 该书涵盖了网络、容器、网关、微服务与分布式、云原生、质量监测和成本管理方面的内容,帮助读者快速理清云时代下的技术架构体系。 本笔记大多为个人理解后的知识点, 仅供参考 ...
redirect type
0 (-)
block type
0 (no issues)
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
index version
2025110801
spam phrases
0
text nonlatin
1532
text cyrillic
0
text characters
2430
text words
498
text unique words
308
text lines
249
text sentences
1
text paragraphs
2
text words per sentence
255
text matched phrases
6
text matched dictionaries
4
links self subdomains
0
links other subdomains
links other domains
3 - maoyan.com, hexo.io, upyun.com
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
2
links ext klim
0
links ext generic
2
dol status
0
dol updated
2025-12-19 04:52:42
rss path
rss status
3 (priority 3 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
30 (processing completed, results pushed to table crawler_sitemaps.ext_domain_sitemap_lists)
sitemap review version
1
sitemap urls count
169
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
2024-02-14 17:55:38
sitemap process date
2024-12-07 21:26:53
sitemap first import date
-
sitemap last import date
-