beckerfuffle.com - NetAtlas

Main

id

98179444

name

beckerfuffle.com · homepage snapshot

processing priority

4

site type

0 (generic, awaiting analysis)

review version

11

html import

27 (unknown)

Events

first seen date

2024-02-12 07:14:40

expired found date

-

created at

2024-06-16 21:04:32

updated at

2025-05-01 01:24:54

Domain name statistics

length

16

crc

62807

tld

2211

nm parts

0

nm random digits

0

nm rare letters

0

Connections

is subdomain of id

-

previous id

0

replaced with id

0

related id

-

dns primary id

28195159

dns alternative id

0

lifecycle status

0 (unclassified, or currently active)

Subdomains and pages

deleted subdomains

0

page imported products

0

page imported random

0

page imported parking

0

Error counters

count skipped due to recent timeouts on the same server IP

0

count content received but rejected due to 11-799

0

count dns errors

0

count cert errors

7

count timeouts

0

count http 429

0

count http 404

0

count http 403

0

count http 5xx

0

next operation date

2025-05-15 01:24:54

Server

server bits

—

server ip

-

Mainpage statistics

mp import status

27

mp rejected date

-

mp saved date

-

mp size orig

66051

mp size raw text

31671

mp inner links count

8

mp inner links status

10 (links queued, awaiting import)

Open Graph

title

description

I'm a Senior Data Scientist at Penn Medicine where I'm building machine learning systems to improve patient outcomes by providing real-time predictive applications that empower clinicians to identify

image

site name

author

Michael Becker

updated

2026-02-25 13:57:03

raw text

Beckerfuffle Blog Archives Talks About Beckerfuffle Go fuffle yourself! Nov 24, 2014 Comments community conferences open source pydata python PyData NYC: The Really Short Version Here are my notes from PyData with links for more details. This isn’t a complete list, and in some cases my notes don’t really do justice to the actual talks, but I hope that these will be helpful to anyone who’s feeling PyData FOMO until the videos are released. Disclaimer: I took almost no notes on the second day so a bunch of my favorite talks are missing. High Performance Text Processing with Rosetta This library is a highly optimized NLP library with a focus on memory efficiency. TextFileStreamer - provides a streaming tokenizer. Good for memory efficiency. DBStreamer - Ditto but for data in a DB. Can be easily combined with online learning methods. An IPython notebook with example code can be found here . Python in the Hadoop/Spark Ecosys...

Text analysis

redirect type

0 (-)

block type

0 (no issues)

detected language

1 (English)

category id

Zastosowania AI (149)

index version

1

spam phrases

0

Text statistics

text nonlatin

0

text cyrillic

0

text characters

24277

text words

5226

text unique words

1493

text lines

606

text sentences

276

text paragraphs

69

text words per sentence

18

text matched phrases

0

text matched dictionaries

0

Link statistics

links self subdomains

0

links other subdomains

28 - nbviewer.ipython.org, blaze.pydata.org, numba.pydata.org, storm.apache.org, kafka.apache.org, libcloud.apache.org, blog.sashalaundy.com, csvkit.readthedocs.org, scikit-learn-laboratory.readthedocs.org, docs.python.org, wiki.python.org, us.pycon.org, mail.python.org, engineering.aweber.com, blog.liveramp.com

links other domains

71 - pydata.org, pythonhosted.org, cloudera.com, continuum.io, checkgermany.de, pyvideo.org, hilarymason.com, dataists.com, aweber.jobs, scrapy.org, scikit-learn.org, aweber.com, fperez.org, jesstess.com, python.org, jvns.ca, plot.ly, camdp.com, ogrisel.com, goo.gl, lxml.de, crummy.com, nltk.org, octopress.org, alexgaribay.com

links spam adult

0

links spam random

0

links spam expired

0

links ext activities

6

links ext ecommerce

0

links ext finance

0

links ext crypto

0

links ext booking

0

links ext news

0

links ext leaks

0

links ext ugc

32 - linkedin.com, twitter.com, en.wikipedia.org, meta.wikimedia.org, dumps.wikimedia.org

links ext klim

0

links ext generic

1

dol status

0

dol updated

2026-02-25 13:57:03

RSS

rss path

rss status

0 (new)

rss found date

-

rss size orig

0

rss items

0

rss spam phrases

0

rss detected language

0 (awaiting analysis)

inbefore feed id

-

inbefore status

0 (new)

Sitemap

sitemap path

sitemap status

0 (new)

sitemap review version

1

sitemap urls count

0

sitemap urls adult

0

sitemap filtered products

0

sitemap filtered videos

0

sitemap found date

-

sitemap process date

-

sitemap first import date

-

sitemap last import date

-