UPDATE (February 4, 2024): This is the discussion about this project on HN: here. Please specifically read @dang's comment regarding the core assumption of this project: here. On a personal note, the number of Stories removed yesterday (Saturday, February 3, 2024) was the lowest ever recorded by the service. This includes 2 duplicate Stories. As a side note, in the list always check whether a Story is a duplicate or not: this is a very reasonable reason for removal and unfortunately I have no way of automatically determining it in the service!
The purpose of this project is to try to understand the type and scale of the moderation of the Hacker News Front Page.
NOTE: I love Hacker News. I try to read it every day. In the case of OnnxStream (here for example), 95% of the comments were helpful and intelligent. I also understand that moderating a site with huge traffic and where users are basically anonymous must be a very difficult task.
Returning to the purpose of this project, from what I have been able to see, the "public" (i.e. observable from the outside) moderation of the Front Page consists of two main tools: modification of the title of a Story (voluntarily or involuntarily influencing its growth in terms of rank) or directly its removal.
Regarding the first type of moderation, an excellent site is already available that tracks changes to Story titles. Here instead I will focus on the second type.
For the reasons explained in the "Why?" section below, I have developed a small application that logs all the Stories that are removed from the Front Page, for personal use. I later discovered that there is no tool/website that provides this type of information and I decided to make it public here. It was a difficult decision but my rationale is: is it better to have more transparency or less transparency?
If you know of a tool/website similar to this, please let me know: I will archive this repo or set it to private.
A possible very positive outcome for this project could be to have a list similar to this, but available directly among the HN lists. Or even to notify a user when a Story is penalized on the Front Page, perhaps indicating the number of flags and/or the reason, for example.
Feel free to skip this part or click to expand
A friend of mine posted two Stories on Hacker News related to OnnxStream (31 days apart), the first related to SDXL Turbo support and the second related to TinyLlama and Mistral 7B support.
In the case of the first, the Story was among the first on the Front Page, until its title was changed from "Stable Diffusion Turbo on a Raspberry Pi Zero 2 generates an image in 29 minutes" to "OnnxStream: Stable Diffusion XL 1.0 Base on a Raspberry Pi Zero 2". This effectively "killed" the Story. One user pointed out that the new title didn't reflect the spirit of the Story (thanks @practice9).
In the case of the second, the Story was in third place on the Front Page, less than an hour after the submission. In this case it was simply removed from the Front Page.
Having discovered this, perplexed, I sent an email to the moderator. @dang, who was very kind and quick in his response, explained to me that the Story had been flagged by users even without being explicitly [flagged], and that he could therefore only hypothesize the causes of the flag. His hypothesis was that (some?) users might be fed up with news related to LLMs.
While I have no reason to doubt Daniel's good faith, it's hard to believe that HN users would be tired of LLM-related news.
So I decided to develop a small console application to determine the frequency of this phenomenon (actually I was also motivated by the prospect of writing some C# code, after more than 2 years of complete abstinence). I subsequently discovered that there were no tools/websites that monitored this specific phenomenon and I therefore decided to make it public here.
Using the official HN API, the service fetches 90 Top Stories every minute and makes a comparison with the first 30 Top Stories (i.e. the Front Page) fetched the previous minute. It logs all missing Stories here. The assumption is that a Story cannot go from the top 30 to a position greater than 90 in a single minute, without having been explicitly removed. If a Story reappears on the Front Page, it is removed from this log. All Stories present in the second-chance pool are excluded from the log. Title and URL are those from when the Story first appeared in the top 30. The number of points and comments and the rank are those from when the Story was removed from the Front Page. The ID points to the news.social-protocols.org page for that Story, which provides a graph of the Story's position on the Front Page over time.
NOTE: always check whether a Story is a duplicate or not: this is a very reasonable reason for removal and unfortunately I have no way of automatically determining it in the service!
- 42104374 #2 14 points 0 comments -> The mirror of fascism in big tech
- 42103910 #7 15 points 15 comments -> Who Would Support Deploying the Military to Domestic Protests?
- 42105398 #27 11 points 0 comments -> Australian scientists thought to be on the verge of curing paralysis
- 42104762 #12 211 points 80 comments -> Apple threatened workers over their talk about pay and remote work, feds charge
- 42100876 #15 220 points 64 comments -> JVM Anatomy Quarks
- 42103761 #18 136 points 188 comments -> Standing Desk Might Be as Bad as Sitting All Day
- 42105508 #18 40 points 15 comments -> Mattel apologises after Wicked movie dolls link to porn site on packaging
- 42105835 #19 23 points 4 comments -> Inside M4 chips: P cores
- 42106724 #18 -> "The Hatpin Peril" Terrorized Men Who Couldn't Handle the 20th-Century Woman
- 42106504 #25 7 points 0 comments -> A sit-down with Ubuntu founder Mark 'SABDFL' Shuttleworth
- 42107120 #13 20 points 8 comments -> Memecoins Are the Point
- 42107157 #5 67 points 2 comments -> AI protein-prediction tool AlphaFold3 is now open source
- 42107966 #25 3 points 0 comments -> Traditional Japanese Salted Aged Salmon Recipe for 1,000 Years [video]
- 42108247 #24 4 points 0 comments -> Tariffs Won't Work, Just as McKinley's Didn't
- 42030233 #18 12 points 1 comments -> Data movement bottlenecks to large-scale model training: Scaling past 1e28 FLOP
- 42109712 #24 13 points 4 comments -> LICENSE.TXT [video]
- 42110357 #29 6 points 0 comments -> Foundry worker puts wet scrap metal into furnace
- 42112154 #8 2 points 0 comments -> Racketeer Influenced and Corrupt Organizations Act
- 42073712 #22 7 points 0 comments -> Harnessing Vision for Computation (2008) [pdf]
- 42112945 #10 16 points 11 comments -> Der Spiegel Proclaims the End of the West
- 42111597 #15 78 points 45 comments -> Git and Jujutsu: In Miniature
- 42112005 #22 166 points 17 comments -> This morning for no obvious reason, I remembered the Fuel Rats
- 42032451 #17 116 points 32 comments -> I2P Anonymous Network
- 42113398 #28 25 points 1 comments -> Nearly 70% of Gaza war dead are women and children, UN says
- 42113296 #21 39 points 4 comments -> I Will Always Be Angry About Software Engineering
- 42075115 #10 42 points 11 comments -> FLUX1.1 [Pro] Ultra and Raw Modes
- 42115578 #19 37 points 40 comments -> Genetic Discrimination Is Coming for Us All
- 42116220 #27 3 points 1 comments -> Solanine
- 42116967 #2 76 points 31 comments -> US Senate to Revive Software Patents with PERA Bill Vote on Thursday
- 42081729 #17 5 points 0 comments -> Method for the determination of the luminance of two-photon vision stimuli
- 42072293 #26 17 points 3 comments -> Product Quality Through Change Management
- 42115453 #23 7 points 19 comments -> Show HN: We built the simplest Online Unit Converter for everyday use
- 42116936 #29 11 points 3 comments -> North Korean hackers create Flutter apps to bypass macOS security
- 42081010 #14 24 points 40 comments -> How do you juggle WFH with a baby?
- 42121122 #3 17 points 6 comments -> Nutrient Density and Other Stupid Magic Words
- 42119733 #25 30 points 25 comments -> Review: The fastest of the M4 MacBook Pros might be the least interesting one
- 42122113 #29 9 points 1 comments -> Why Boeing Killed DEI
- 42123070 #24 48 points 22 comments -> Cock.li Is on Red Alert
- 42122434 #16 130 points 57 comments -> Micron Launches First 60TB PCIe Gen5 SSD with 12GB/S Read Speeds
- 42120311 #17 522 points 519 comments -> M4 Mac mini's efficiency is incredible
- 42119141 #14 105 points 34 comments -> Voice acting in Space Quest V
- 42060081 #14 20 points 4 comments -> The Product-Market Fit Scale
- 42122740 #28 36 points 1 comments -> Backblaze Drive Stats for Q3 2024
- 42125275 #23 5 points 2 comments -> Forth May Be the Answer
- 42124409 #24 14 points 4 comments -> OpenAI and others seek new path to smarter AI as current methods hit limitations
- 42075663 #16 9 points 0 comments -> A comic uses fluid dynamics to explain how groups of people move
- 42125720 #27 3 points 0 comments -> Airlines Hate 'Skiplagging.' Meet the Man Who Helps Travelers Pull It Off
- 42121548 #22 86 points 129 comments -> Manjaro is experimenting with **opt-out telemetry
- 42093090 #24 10 points 0 comments -> How a neuron learns
- 42126384 #29 8 points 0 comments -> US Congress to hold second hearing on UFOs in attempt to 'pull back curtain'
- 42127161 #28 21 points 22 comments -> Journalism's fight for survival in a postliterate democracy
- 42125989 #21 6 points 1 comments -> Show HN: 8arms – Marketing Campaign Generation
- 42127304 #14 25 points 41 comments -> Just Eat Is Selling Grubhub to Marc Lore's Wonder for $650M
- 42127877 #13 9 points 2 comments -> Codeium launches Windsurf – the first agentic IDE
- 42129080 #8 5 points 1 comments -> Why Sabine Hossenfelder Is Just Wrong
- 42129791 #17 33 points 23 comments -> FBI seizes Polymarket CEO's phone, electronics
- 42129779 #11 8 points 3 comments -> Show HN: I Built Tinder, but for Entrepreneurs and Investors
- 42133026 #21 -> 14yo suicides after Santa Clara schoolmates bully him about being homeless
- 42133624 #13 145 points 41 comments -> GOG's Preservation Program Is the DRM-Free Store Refocusing on the Classics
- 42133980 #16 80 points 33 comments -> WebVM 2.0: A complete Linux Desktop Environment in the browser via WebAssembly
- 42108145 #20 166 points 47 comments -> New elliptic curve breaks 18-year-old record
- 42128438 #15 170 points 21 comments -> The Beginner's Guide to Visual Prompt Injections
- 42135277 #25 7 points 6 comments -> Amazon Haul–a broad selection of products $20 or less
- 42110664 #25 67 points 5 comments -> Debian Junior Desktop live image
- 42086289 #15 63 points 65 comments -> Choose Boring Technology and LLMs
- 42135358 #26 12 points 9 comments -> I Regret Ever Open Sourcing Java – Jill Malony Ratkevic
- 42094889 #26 142 points 30 comments -> The number given as % CPU in Activity Monitor
- 42135531 #28 5 points 1 comments -> Hosting a hobby project without credit card and How I did it
- 42134366 #16 59 points 122 comments -> Async Await Is the Worst Thing to Happen to Programming
- 42136095 #18 10 points 2 comments -> FBI raids home and seizes phone of Polymarket founder
- 42136210 #21 4 points 0 comments -> How the Ivy League Broke America
- 42136705 #3 15 points 1 comments -> Not Satire: The Onion Acquires Infowars
- 42136327 #10 105 points 34 comments -> The Onion wins Alex Jones' Infowars in bankruptcy auction
- 42138384 #12 4 points 4 comments -> How AI is beating VCs in their own game
- 42138976 #19 24 points 10 comments -> Mark Zuckerberg Wants to Feed You More AI Slop
- 42138319 #7 93 points 1 comments -> AI Progress Stalls as OpenAI, Google and Anthropic Hit Roadblocks
- 42140102 #6 9 points 0 comments -> Show HN: 16 year olds building an Integrated Writing Environment using AI
- 42139989 #21 20 points 12 comments -> Prompt Injecting Your Way to Shell: OpenAI's Containerized ChatGPT Environment
- 42140664 #6 8 points 12 comments -> Valuing Humans in the Age of Superintelligence: HumaneRank
- 42144541 #17 9 points 2 comments -> Reducing the cost of a single Google Cloud Dataflow Pipeline by Over 60%
- 42144873 #9 8 points 0 comments -> Terence Tao: Solving problems by abstracting away highly relevant information
- 42145643 #8 16 points 9 comments -> Arrested and Charged Because My 11yo Was Walking 1 Mile from Home
- 42104714 #9 47 points 13 comments -> Assembly Optimization Tips by Mark Larson
- 42102531 #18 3 points 1 comments -> New, Original, Multi-Purpose Programming Language
- 42143265 #10 256 points 188 comments -> New Apple security feature reboots iPhones after 3 days, researchers confirm
- 42137527 #14 437 points 214 comments -> AI makes tech debt more expensive
- 42142900 #27 18 points 1 comments -> Attestations: A new generation of signatures on PyPI
- 42139044 #29 256 points 74 comments -> Red Hat to contribute container tech (Podman, bootc, ComposeFS...) to CNCF
- 42138115 #10 644 points 235 comments -> Daisy, an AI granny wasting scammers' time
- 42145612 #8 8 points 0 comments -> Tesla pushes update for highway driving, but only for newer vehicles
- 42095263 #12 100 points 25 comments -> My simple knowledge management and time tracking system
- 42145047 #24 12 points 40 comments -> Europe's flying taxi dreams falter as cash runs short
- 42146042 #21 6 points 1 comments -> Amazon S3 now supports up to 1M buckets per AWS account
- 42145222 #18 4 points 2 comments -> Packages, Not Programs
- 42145704 #16 13 points 8 comments -> FlutterFlash - Ship your next Flutter app, at lightning speed
- 42146188 #28 12 points 40 comments -> Congress hears testimony on aliens the government hides
- 42148171 #23 15 points 0 comments -> Carl Sagan testifying before Congress in 1985 on climate change [video]
- 42148789 #18 3 points 0 comments -> Egg Consumption and 4-Year Change in Cognitive Function in Older Men and Women
- 42148817 #19 6 points 2 comments -> The Early Christian Strategy
- 42148864 #5 8 points 0 comments -> I Used Star Methodology to Crack Amazon Leadership Principles Interview
- 42149027 #27 47 points 3 comments -> Bill Atkinson Has Pancreatic Cancer
- 42150504 #5 38 points 12 comments -> National Security Just Called, They Can't See the Email Traffic
- 42082754 #28 6 points 1 comments -> A Visual Guide to the Aztec Pantheon
- 42151986 #17 19 points 2 comments -> Jawboning in Plain Sight: The Unconstitutional Censorship Tolerated by the DMCA
- 42092670 #29 7 points 0 comments -> Philosopher of Change: How Henri Bergson's View of Reality Came to Be
- 42100364 #11 16 points 11 comments -> Solo round-the-world sailor Cole Brauer:first two weeks I cried every single day
- 42099646 #26 8 points 6 comments -> Pokemon TCG Pocket Hub – Open-Source Multilingual Card Game Resource Platform
- 42151569 #12 94 points 85 comments -> Why did people rub snow on frozen feet? (2017)
- 42155986 #15 10 points 1 comments -> Micromelo Undatus
- 42152341 #16 56 points 19 comments -> Getting Started with FPGAs
- 42152928 #20 141 points 72 comments -> M4 MacBook Pros use a quantum dot (QD) film rather than a red KSF phosphor film
- 42149694 #22 140 points 29 comments -> The Practical Guide to Scaling Django
- 42153577 #27 81 points 21 comments -> How public key cryptography works, using only simple math
- 42152068 #28 128 points 75 comments -> FTC to launch investigation into Microsoft's cloud business
- 42150278 #29 415 points 697 comments -> Maybe Bluesky has "won"
- 42156608 #10 4 points 2 comments -> E/acc as a religion:A Technologist's Right to Choose
- 42157862 #12 28 points 9 comments -> Pakistani religious body declares using VPN is against Islamic law
- 42157606 #21 12 points 0 comments -> Stephen Spoonamore suggests presidential vote manipulated in swing states
- 42158401 #10 28 points 8 comments -> Blizzard-made Warcraft 1&2 remasters look like Zynga games made by blind duck
- 42159589 #15 9 points 1 comments -> Judge Blocks The Onion Bid for Alex Jones's InfoWars
- 42102690 #17 11 points 15 comments -> Where Does the Magic Go as We Grow Older? (2020)
- 42160129 #7 13 points 8 comments -> Microsoft and My Tinfoil Journey
- 42160528 #9 16 points 16 comments -> Could DOGE cut $2T? The legal and practical reality of government reform
- 42160713 #11 20 points 1 comments -> Potential algorithmic bias on platform X during the 2024 US election
- 42161348 #13 10 points 1 comments -> Speaker Johnson says Gaetz ethics report should not be released
- 42162028 #25 5 points 0 comments -> Why does the same font look better on macOS?
- 42161741 #15 5 points 0 comments -> I Learned to Stop Being a "Chill Girl" and Start Being Me
- 42098326 #17 32 points 6 comments -> Eau de Nil, the Light-Green Color of Egypt-Obsessed Europe (2018)
- 42164120 #27 5 points 1 comments -> Why are Indian and African wildlife so similar?
- 42124568 #23 10 points 2 comments -> Show HN: Knight's Graph – game based on the Knight's tour problem
- 42164226 #24 25 points 40 comments -> A $12k Surgery to Change Eye Color Is Surging in Popularity
- 42105281 #16 9 points 2 comments -> Timing-Sensitive Analysis in Python
- 42164828 #24 3 points 1 comments -> Why your house is a terrible investment