id
name
processing priority
2
site type
0 (generic, awaiting analysis)
review version
11
html import
20 (imported)
first seen date
2024-10-13 06:44:07
expired found date
-
created at
2024-10-13 06:44:06
updated at
2024-10-13 06:44:07
length
13
crc
56410
tld
2211
nm parts
0
nm random digits
0
nm rare letters
0
is subdomain of id
54631194 (moeci.com)
previous id
0
replaced with id
0
related id
-
dns primary id
0
dns alternative id
0
lifecycle status
0 (unclassified, or currently active)
deleted subdomains
0
page imported products
0
page imported random
0
page imported parking
0
count skipped due to recent timeouts on the same server IP
0
count content received but rejected due to 11-799
0
count dns errors
0
count cert errors
0
count timeouts
0
count http 429
0
count http 404
0
count http 403
0
count http 5xx
0
next operation date
-
server bits
—
server ip
-
mp import status
20
mp rejected date
-
mp saved date
-
mp size orig
55037
mp size raw text
3776
mp inner links count
10
mp inner links status
10 (links queued, awaiting import)
title
yiyun's Blog
description
Data Science, Machine Learning, Fullstack Web Developer
image
site name
yiyun's Blog
author
yiyun
updated
2026-03-02 16:11:51
raw text
yiyun's Blog yiyun's Blog 愚蠢的是我 首页 分类 标签 归档 关于 作品集 友链 RSS 搜索 0% Theme NexT works best with JavaScript enabled RFM 分析 - 确定用户的核心价值 等杂记 发表于 2024-07-20 更新于 2024-07-24 分类于 数据分析 本文字数: 3.2k 阅读时长 ≈ 12 分钟 引言 RFM (Recency、Frequency、Monetary) 分析其实是用户画像的 "延伸品"。 它通过用户最近一次消费、消费频率以及消费金额 3 个指标将用户划分为不同的类别或集群,以描述用户的价值。 阅读全文 » CSS 双飞翼布局 发表于 2024-01-05 更新于 2024-01-09 分类于 前端 , HTML/CSS/JavaScript 本文字数: 474 阅读时长 ≈ 2 分钟 引言 面试题:写一个左中右布局占满屏幕,其中左、右俩块固定宽 200,中间自适应宽,要求先加载中间块,请写出结构及样式。 阅读全文 » Web 前端 | 反爬虫 | 字体反爬 发表于 2023-10-02 更新于 2024-01-14 分类于 前端 , 反爬虫 本文字数: 4.6k 阅读时长 ≈ 17 分钟 引言 字体反爬 本质就是字符集映射, 将原本字符的 Unicode 码进行偏移(体现在自己的字体文件中), 而网页中不再展示原文本字符, 而是使用自定义字体文件中对应的 HTML 字符实体 案例 - 志愿军:雄兵出击_购票_剧情介绍_演职人员_图集_猫眼电影 可以发现即使浏览器审查元素也无法获取原文本字符, 这样即使使用 Selenium 等 Browser Headless 也无法直接获取原文本字符 阅读全文 » 《深入架构原理与实践》| 读书笔记 发表于 2023-0...
redirect type
0 (-)
block type
0 (no issues)
detected language
126 (language undetectable (empty document, too short, or engines disagree))
category id
index version
1
spam phrases
0
text nonlatin
1566
text cyrillic
0
text characters
2390
text words
477
text unique words
302
text lines
231
text sentences
1
text paragraphs
2
text words per sentence
255
text matched phrases
0
text matched dictionaries
0
links self subdomains
0
links other subdomains
links other domains
14 - moeci.com, maoyan.com, hexo.io, upyun.com
links spam adult
0
links spam random
0
links spam expired
0
links ext activities
0
links ext ecommerce
0
links ext finance
0
links ext crypto
0
links ext booking
0
links ext news
0
links ext leaks
0
links ext ugc
2
links ext klim
0
links ext generic
2
dol status
0
dol updated
2026-03-02 16:11:51
rss path
rss status
1 (priority 1 already searched, no matches found)
rss found date
-
rss size orig
0
rss items
0
rss spam phrases
0
rss detected language
0 (awaiting analysis)
inbefore feed id
-
inbefore status
0 (new)
sitemap path
sitemap status
1 (priority 1 already searched, no matches found)
sitemap review version
2
sitemap urls count
0
sitemap urls adult
0
sitemap filtered products
0
sitemap filtered videos
0
sitemap found date
-
sitemap process date
-
sitemap first import date
-
sitemap last import date
-