Show HN: I made Google Trends for Hacker News by indexing 18 years of comments
- zX41ZdbW - 6545 sekunder sedanI host a publicly open database with Hacker News data at https://play.clickhouse.com/play?user=play#U0VMRUNUICogRlJPT...
So you can create any sort of similar services in a single SQL query and an HTML page.
I also hosted it as a publicly accessible data lake, which you can query from everywhere: https://github.com/ClickHouse/ClickHouse/issues/29693#issuec...
It is also updated in real-time.
- Aachen - 6826 sekunder sedanGoogle Trends is about searches
This is about published text. More like if Google Trends counted word occurrences on webpages. Or if Google Ngrams counted webpages instead of books
People don't write much about non-newsworthy things whereas many people search "burger" anytime they want a burger delivery. The datasets aren't usable in the same way
Edit: not to say it's not a cool product! Just keep this in mind and enjoy using it :)
- gslepak - 4757 sekunder sedanVery cool! There seems to be a bug here: https://hackernewstrends.com/?q=vim&q=emacs&q=zed
For some reason the results cut off at 2018-10 even though "Popular Comparisons" preview shows more.
- simonpure - 11626 sekunder sedanHug of death
` /api/hn -> 504 An error occurred with your deployment FUNCTION_INVOCATION_TIMEOUT cle1::c8vgv-1782399959042-aeba3cae05ff `
- kaelyx - 9377 sekunder sedanHello, /api/hn -> 502 {"error":"Your database has been temporarily rate-limited, please contact support@upstash.com for further details."}
- smalltorch - 10537 sekunder sedanReminds me of this side project I'm working on.
https://gitlab/here_forawhile/torum
It's a HN clone, that syncs with HN that allows you to basically establish smaller private communities who can discuss anything that's on HN without actually being on HN.
It also indexes and let's you search through the DB which I find is really useful to find things that peak my interest.
- kpw94 - 7284 sekunder sedanThe huge spike of "lk-99" in science & frontier tech is amusing...
This is cool concept, would love a positive/negative sentiment computed for each comment that refers to a given word, so you can see trends of "cloudflare (positive)" vs "cloudflare (negative)" where first one counts comments only if sentiment confidence is greater than say 0.6 and the other one counts comments only if sentiment is less than 0.4 (assuming [0,1] sentiment score)
- arjie - 10111 sekunder sedanOne useful feature would be to normalize by total so that I can see changes in something as opposed to just total site growth. Right now I have to chart a single generic parameter but if I pick poorly it’ll confuse the issue.
- linmer - 5328 sekunder sedanCool! I want to suggest something, Imagine I want to got to a specific date where some topic was hot, I can read it from your website and then go to that date. But it would be better if I could click on some sort of button, or on the points on the graph to go to that date. It would be easy to implement, you just need links like this: https://news.ycombinator.com/front?day=2026-05-24
- bluecoconut - 8090 sekunder sedanVery cool!
one subtle consistency bug that made it hard for me to interpret when I was clicking around: the small thumbnail plot vs the full plot often (always?) seem to use different colors.
The blue / orange gets assigned to the opposite labels in the A vs. B when you click, which made it confusing to understand.
- Petersipoi - 4139 sekunder sedanIt's funny how "trump" dwarfs just about any other term. Truly a hacker forum.
- ytkimirti - 14840 sekunder sedanHello HN,
This was a small project of mine after I've found out that I can simply the whole hackernews archive (~48GB) and play around with it.
You can compare terms just like in google trends and you can also see the exact posts & comments from that time.
I like that you can discover what went crazy in the timeline, they just come up as small burst of activity, it's quite fun to play around with it. https://hackernewstrends.com/?q=litecoin&q=dogecoin&q=solana...
I also have a seperate page for the "Who is Hiring?" posts, here is the distribution of programming languages over each monthly "Who is hiring?" post in HN ever. https://hackernewstrends.com/who-is-hiring
Any kind of feedback is welcome.
- jtolmar - 6473 sekunder sedanIt looks like some of these terms aren't indexed (or the site is just too hug of deathed right now), but I'd like to see the graph of like, social media, iot, cryptocurrency, ai.
The transition between crypto and ai on the graphs is already pretty funny. https://hackernewstrends.com/?q=crypto&q=chatgpt
- maxignol - 2162 sekunder sedanFunny one x) Though I ain’t sure if even more data is useful on hackernews
- dwoosley - 6291 sekunder sedanAlmost all of the major vulnerability and hack are just single spikes at the time it happened and it tails off after that… except Stuxnet. Stuxnet is was much more interesting that most other attacks since it was very political and openly published. Of course, the thing that attack was about is still a news headline today as well
- sinuhe69 - 11030 sekunder sedanIMO, using AI to assign keywords to a broader group of strict synonymous keywords would make the comparison much more helpful.
Because in general we want to know the trend of categories more than of a word, asking for “auto pilot” for ex. should include “self driving”, FSD etc.
- aberrahmane_b - 4713 sekunder sedanGreat project.The popular comparisons are probably the most useful part because they show the relay race between tools pretty clearly.
One thing I’d like to see is normalization by total HN activity over time.
- Insanity - 5652 sekunder sedanThis looks quite nice! But suspiciously absent data points.. no Java or Go for the languages? Seems odd. No Amazon in companies, yet I think it's often mentioned.
I wondered if "go" got filtered out because it's also just a regular word.
Either way, very cool!
- dom96 - 11281 sekunder sedanVery cool idea. Shows programming language trends pretty well.
- jianfenglin - 3989 sekunder sedanGlad to see that the raw data is also shared. Very cool, but why the openai vs anthropic graph has no data post 2019?
- chfritz - 5806 sekunder sedangreat idea! Now, you are running into the same issue Google Trends had to solve: term disambiguation. For instance, "atom" is ambiguous in a comparison of editors like this: https://hackernewstrends.com/?q=sublime&q=atom&q=vscode. Given LLMs it might be possible to use an embedding vector (with context) instead of a text string for indexing, and if you do, this problem might go away.
- jazzpush2 - 5165 sekunder sedanThis is a great project. It'd be fun to look at some of the more popular startups over time, both those that ended up successful and those that didn't.
- upmostly - 2720 sekunder sedanLooking at this makes me think HN is peak design aesthetic.
- linzhangrun - 6667 sekunder sedanGreat job! I've also been wanting to do similar statistics recently, wanting to know when LLMs becoming the absolute dominant topic on HN. Now it seems like half of the posts were about LLMs.
- stopachka - 6129 sekunder sedanNice! Would love a brief explanation of the infrastructure. I see the Powered by "Upstash Redish Search", but why choose Upstash Redis Search vs something else?
- cloudkj - 11016 sekunder sedanThis is great, I was just hoping to find a tool like this and specifically scoped to "Show HN" posts? Is there a way to do that?
- ltrg - 6212 sekunder sedanIt would be super interesting to see if HN mentions serve as a leading indicator of company performance/valuations -- I wouldn't be surprised.
- scarecrw - 11519 sekunder sedanVery cool!
I'd love to have some sort of normalization option to separate more subtle positive trends from the general increase in number of posts.
- ytkimirti - 10553 sekunder sedanWe had to take the site down for a second, it'll be online in a few minutes. Thanks for trying it out
- corv - 9000 sekunder sedanThe 'flash vs html5' chart looks strange juxtaposed with that conclusion
- dacox - 3361 sekunder sedanvery cool! not sure if something is broken, but there seems to be no data past 2019 on any of the queries that i can see
- NoSalt - 11923 sekunder sedanWoah, great work!
I am really liking the trend for "linux": https://hackernewstrends.com/?q=linux
- SoKamil - 7307 sekunder sedanAre those raw numbers or adjusted for active users at given point in time?
- flakiness - 11163 sekunder sedanThe example comparisons made me smile. Well done!
- igcorreia - 8107 sekunder sedanThe colors of the lines of the big graph are inverted compared to the smaller ones.
- rightbyte - 10902 sekunder sedanNice. Is the data points y-axis normalized by total amount of comments at that time?
Edit: Nvm seems like absolute count if you click the graph.
- jahala - 9592 sekunder sedanReally cool! Where would you get the data for something like this? Is it open, or its scraped?
- WhitneyLand - 2967 sekunder sedanFirst great work.
Reminds that I wish there was a modern way to do this for the words people speak and write online with. I want to literally know when people started putting literally twice in sentences.
Ngram seems is out of date a piece meal. Now Corpus seems like they try but UX terrible.
- chris_money202 - 10381 sekunder sedanLove this, seems to struggle with newly indexed words. Will try again when the FP load is gone
- NooneAtAll3 - 9113 sekunder sedanI'd be interested in "google ngram for hacker news" instead
- Cider9986 - 6363 sekunder sedanScrolling is totally broken for me.
- joelres - 10523 sekunder sedanReally beautiful, informative, and functional layout. Great work!
- docheinestages - 11220 sekunder sedanBut can it discover new trends without having to type the keywords?
- mkgeorge7 - 7755 sekunder sedanThis is actually very cool@
- mkgeorge7 - 7767 sekunder sedanThis is actually very cool!
- GL26 - 11460 sekunder sedaninsane ! I don't know if it's possible but it would be huge if we had access to the localisation of the trends
- joe_the_user - 4879 sekunder sedanThe topic comparisons are pretty boring and search is disabled. Perhaps I'll remember to return to this. But I can't think of much it gives that plain Google nGram viewer doesn't.
- drchaim - 10688 sekunder sedantoo slow or broker right now
- lazystar - 11160 sekunder sedannice. i guess AWS still had nothing to fear from GCP/Azure. ty for this
- some_furry - 11393 sekunder sedanhttps://hackernewstrends.com/?q=furries&q=furry
Hmm, did I break something?
- thomasgeelens - 6210 sekunder sedanoeeh hug of death, congrats!
- k33n - 6874 sekunder sedanThis is quite useful at-a-glance
- jdw64 - 10879 sekunder sedanCOOOOOOOOOOL!!!!!!
- vachina - 11621 sekunder sedanThis is the only HN submission I ever upvoted because it is amazing
- ProofHouse - 9016 sekunder sedanYup your upstash is rate limited
- clacker-o-matic - 12090 sekunder sedanooh this is sick! really nice ui too!
- nailer - 4655 sekunder sedan> API design, era by era: REST becomes the web's default 2012–15, then the post-REST generation splits: gRPC for service-to-service from 2016, GraphQL for the client from 2017.
No. Looking at the diagram, REST is the default until 2017, GraphQL is briefly popular around early 2020s, then the web resturns to REST.
- oystersauce8 - 11896 sekunder sedanlove it
- ethanlipson - 1054 sekunder sedan[dead]
- robertpduncan - 1607 sekunder sedan[dead]
- JFGAi - 9454 sekunder sedan[dead]
Nördnytt! 🤓