Message from @porco
Discord ID: 453298779005911080
i was born near rotterdam tho
I refuse to believe you were born
where's your birth certificate?
<:mthink:451087817829777429>
conjured from the depths below
```
cat /var/log/pornspider/videos.log | grep 'asian' | grep -v 'Os.GNULinux' | wc -l
9934
```
```
cat /var/log/pornspider/videos.log | grep 'asian' | grep 'Os.GNULinux' | wc -l
8711
```
Linux users are 15 % of the user base and they watch almost as much asian porn as the other 85% combined
Linux users confirmed weebs
kek
Where's that filter
I can't promote pornspider when it lacks basic features
we didn't need a porn statistic to know linux users are weird
```
cat /var/log/pornspider/videos.log | grep 'forcing girl to install gentoo' | grep 'Os.GNULinux' | wc -l
1
```
<:blobhyperthink:427568506905559040>
By the way I've been meaning to ask but do you incorporate any C# in your pornspider?
yes, the codebase is roughly
50 % C++
25 % C#
15 % Haskell
10 % HTML, CSS, Javascript
<:googlethink:328536447219138561>
How do you combine them
Not asking about last part but C# and C++?
A ton of independent services
And they share a data source
Like, there's a shared library which has a database connection and a basic mapping layer
So wait, C++ is doing the scraping or what?
Exactly, C++ is mostly scraping the websites and also partially building search indexes
Huh, I thought you were scraping in C#, and was asking since I tried making a scraper before in C#
what exactly is this pornspider doing
But aside from linkin shit I didn't exactly manage how to get data to work properly
@Spicy it indexes a lot of porn websites and provides a cross site search over all of them
I made that while I was temporarily jobless because I really had nothing to do
how much effort did it take to not get banned everywhere
and then some of the users' data like os gets stored to do some analytics?
probably just regular logs
Well that's the Haskell part. My girlfriend is doing a lot of data analysis in haskell. Removes illegal content, keeps the huge amount of data consistent
Just search logs, yes. I don't even log IPs
and its just a web interface?
Site interface?
web
Ah the only GUI is web