• OK, it's on.
  • Please note that many, many Email Addresses used for spam, are not accepted at registration. Select a respectable Free email.
  • Done now. Domine miserere nobis.

Statistical Analysis of intpf

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Nice, mediocrity, just what I was going for

I can guess my word cloud: well, though, maybe, perhaps, guess, some, however, also, might, but, people

All my previous 500 posts are basically the same
that's thing about that TF-IDF algorithm – even if those are the words that comprise most of your posts, those words are not emphasized because they are common words among many of the posters.


Minuend and mine:
 

Attachments

  • minuend_wc.jpg
    minuend_wc.jpg
    64.6 KB · Views: 433
  • serac_wc.jpg
    serac_wc.jpg
    64 KB · Views: 426

Minuend

pat pat
Local time
Today 6:01 AM
Joined
Jan 1, 2009
Messages
4,142
-->
That's cheating:mad:

Though, I do have very strong opinions about digestion behavior that tend to be alienated
 

Polaris

Prolific Member
Local time
Yesterday 6:01 PM
Joined
Oct 13, 2009
Messages
2,261
-->
I’ve had the same thought as Minu. I sort of know what my cloud words would be in terms of topics I’ve been preoccupied in the last year. I don’t think it would be a nice cloud, more like looming doom and gloom in the form of negative words.

Funny how you can look at a word cloud and instantly feel like you’d rather not want to know that person. I think mine will be equally repellant. Kind of how I feel about people in general though, so no surprises there.
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Absurdity and Polaris below.
(what's up with pineapples?)
 

Attachments

  • absurdity_wc.jpg
    absurdity_wc.jpg
    62.8 KB · Views: 411
  • polaris_wc.jpg
    polaris_wc.jpg
    61.2 KB · Views: 415

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Random one of the day: Nebulous
 

Attachments

  • nebulous_wc.jpg
    nebulous_wc.jpg
    60.9 KB · Views: 410

PmjPmj

Full of stars.
Local time
Today 5:01 AM
Joined
Sep 18, 2012
Messages
1,396
-->
Location
UK
My cloud gave me a bloody good chuckle. And reminded me that I need to try harder :<

Thank you, kind sir.
 

Creeping Death

Consigliere
Local time
Yesterday 11:01 PM
Joined
Oct 10, 2016
Messages
860
-->
Location
Omnipresent
I'm curious about mine now. The words for the year pic in the op was about what I expected.
 

baccheion

Active Member
Local time
Today 1:01 AM
Joined
May 2, 2016
Messages
277
-->
Everyone in the thread gets a cloud!
This is baccehion, gopher, gps, hado, haim, respectively
How many words have I written this year, how many unique words, and where would I rank if included in the original post? Can you run the same program on other forums?

Have you considered aggregating then LZMA-compressing posts, then sorting by compression ratio? Each post could be compressed to get an average for each member (can apply standard error), but a giant post seems more telling? I saw this used to rank songwriters. The less the compression ratio, the more "complex/rich" a person's posts. Including the overall number of posts would help filter the final list.
 

Niclmaki

Disturber of the Peace
Local time
Today 1:01 AM
Joined
Oct 21, 2012
Messages
550
-->
Location
Canada
Pretty cool. Do those emojis count as a word? Or are they ignored? Or get the weirdo formatting in their “count”:elephant:
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Pretty cool. Do those emojis count as a word? Or are they ignored? Or get the weirdo formatting in their “count”:elephant:
nah, emojis are excluded. They show up as hyperlinks in the html code and are thus removed.
 

Attachments

  • niclmaki_wc.png
    niclmaki_wc.png
    63.8 KB · Views: 384

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
How many words have I written this year, how many unique words, and where would I rank if included in the original post? Can you run the same program on other forums?

Have you considered aggregating then LZMA-compressing posts, then sorting by compression ratio? Each post could be compressed to get an average for each member (can apply standard error), but a giant post seems more telling? I saw this used to rank songwriters. The less the compression ratio, the more "complex/rich" a person's posts. Including the overall number of posts would help filter the final list.

Words written this year: 3482
Unique words: 1018
Unique dictionary words: 876
In terms of number of words written your rank is 68

The compression idea is interesting. Might look at that when I have some time.

The code can probably be run on other forums but it will probably need some modification. It depends on the html structure of the threads. Dunno how much similarity there is between forums in that regard.
 

Rixus

I introverted think. Therefore, I am.
Local time
Today 5:01 AM
Joined
Nov 21, 2016
Messages
1,276
-->
Location
United Kingdon
Come on then - let's see mine.
 

Niclmaki

Disturber of the Peace
Local time
Today 1:01 AM
Joined
Oct 21, 2012
Messages
550
-->
Location
Canada
Nice, mediocrity, just what I was going for

I can guess my word cloud: well, though, maybe, perhaps, guess, some, however, also, might, but, people

All my previous 500 posts are basically the same





Shit and fuck were my first guesses. Not sure what to make of the milk and bowl, though. Hmmmm

What is your word cloud, serac?

500 posts! Heck all you guys post a lot. I never really looked at the post count before.
 

baccheion

Active Member
Local time
Today 1:01 AM
Joined
May 2, 2016
Messages
277
-->
Words written this year: 3482
Unique words: 1018
Unique dictionary words: 876
In terms of number of words written your rank is 68

The compression idea is interesting. Might look at that when I have some time.

The code can probably be run on other forums but it will probably need some modification. It depends on the html structure of the threads. Dunno how much similarity there is between forums in that regard.

What about the second graph (unique words)? I'm more interested in the density of unique words (something like unique_dictionary_words / total_dictionary_words) and the resulting graph/rank. If you end up trying the compression approach I mention, that would address everything.
 

Nebulous

Well-Known Member
Local time
Today 1:01 AM
Joined
Mar 11, 2016
Messages
909
-->
Location
Just North of Normal
Random one of the day: Nebulous
NEAT
“Daydreaming” not surprised

This stuff’s so coolllllllll I love this kind of thing so muccchhhhh
 

crippli

disturbed
Local time
Today 6:01 AM
Joined
Jan 15, 2008
Messages
1,779
-->
Can you match clouds, as to find who on the site would be best to sleep with?
 

Black Rose

An unbreakable bond
Local time
Yesterday 11:01 PM
Joined
Apr 4, 2010
Messages
10,783
-->
Location
with mama
Can you match clouds, as to find who on the site would be best to sleep with?

terrific idea.

group people with similarities in a high dimension graph of complex word similitude.

Maybe discover a persons MBTI type as well.
 

Happy

sorry for english
Local time
Today 4:01 PM
Joined
Apr 26, 2013
Messages
1,336
-->
Location
Yes
Dang I gotta know. Hit me plz.
 

QuickTwist

Spiritual "Woo"
Local time
Today 12:01 AM
Joined
Jan 24, 2013
Messages
7,182
-->
Location
...
According to my psychological profile, I am a 40-year-old female INTJ.

https://applymagicsauce.com/demo.html

Also includes the big five.

Interesting shit.

It says I am a 25-29 yo Male who is very unlikely to be gay who is an INFP. I think my Big 5 was:

Liberal and Artistic (O): 57%
Organized and Hardworking (C): 53% lol
Contemplative (E): 34%
Team Working and Trusting (A): 51%
Laid Back and Relaxed (N): 42%

Thanks for doing this Serec. That is quite a project you had there. How long did it take you to complete this?

My Title seems to fit based on this so I think I will keep it for the time being.

I was going to go to sleep, look what you made me do...

Can you do my word salad Serec? I don't really use big words tho.
 

Rixus

I introverted think. Therefore, I am.
Local time
Today 5:01 AM
Joined
Nov 21, 2016
Messages
1,276
-->
Location
United Kingdon
According to my psychological profile, I am a 40-year-old female INTJ.

https://applymagicsauce.com/demo.html

Also includes the big five.

Apparently, my digital profile suggests I'm a 29 year old INTP, Single, Morman, Conservative Female with a strong interest in art and is highly unlikely to be gay.

And apparently, liking rock music and a couple of fitness pages makes me less intellectual. I'm also apparently quite unsatisfied with life.
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Rixus, Happy below. @QT I already did yours – it's on the first page somewhere
 

Attachments

  • rixus_wc.jpg
    rixus_wc.jpg
    61.1 KB · Views: 381
  • happy_wc.jpg
    happy_wc.jpg
    62.3 KB · Views: 391

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
That is quite a project you had there. How long did it take you to complete this?

No too long tbh. A sunday afternoon for the code that retrieves the threads and did the cloud thing while waiting for some computations to finish at work
 

Happy

sorry for english
Local time
Today 4:01 PM
Joined
Apr 26, 2013
Messages
1,336
-->
Location
Yes
Haha that was fun. Thanks
 

Helvete

Pizdec
Local time
Today 4:01 PM
Joined
Dec 28, 2013
Messages
1,541
-->
Are there any spare clouds I could have?
 

PmjPmj

Full of stars.
Local time
Today 5:01 AM
Joined
Sep 18, 2012
Messages
1,396
-->
Location
UK
The test AK linked has me as a 32 year old male INTJ.

Accurate test is accurate. I mean sure, I'm probably an E rather than an I, but it had me close to borderline anyway.

Spooky shit bruh.
 

PmjPmj

Full of stars.
Local time
Today 5:01 AM
Joined
Sep 18, 2012
Messages
1,396
-->
Location
UK
Oh, wait. I analysed some emails and it now thinks I'm 26/f/INTP.

Hawt.
 

Rixus

I introverted think. Therefore, I am.
Local time
Today 5:01 AM
Joined
Nov 21, 2016
Messages
1,276
-->
Location
United Kingdon
Oh, wait. I analysed some emails and it now thinks I'm 26/f/INTP.

Hawt.

Ah, fellow INTP female in their late 20's. Just like me.
Are you a Conservative Mormon, as well?
 

PmjPmj

Full of stars.
Local time
Today 5:01 AM
Joined
Sep 18, 2012
Messages
1,396
-->
Location
UK
Libtard leaning, apparently.
 

QuickTwist

Spiritual "Woo"
Local time
Today 12:01 AM
Joined
Jan 24, 2013
Messages
7,182
-->
Location
...
Serec, I have a favor to ask. I want to take the results from the cloud you did for me and plug it into IBM watson's personality utility and see what I get and see if it's any different.
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Serec, I have a favor to ask. I want to take the results from the cloud you did for me and plug it into IBM watson's personality utility and see what I get and see if it's any different.

What sort of format does it take as input? The word clouds are based on giving each word a numerical weight
 

Rixus

I introverted think. Therefore, I am.
Local time
Today 5:01 AM
Joined
Nov 21, 2016
Messages
1,276
-->
Location
United Kingdon
Why don't words like "The", "And", "It" and so on come up?
I managed to get a "her" in there on mine. You'd think those simple pronouns and conjunctions words would litter our posts more so than the other words we find.

Though why "piles" is in my cloud, I'm not entirely sure.
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Why don't words like &quot;The&quot;, &quot;And&quot;, &quot;It&quot; and so on come up?
I managed to get a &quot;her&quot; in there on mine. You'd think those simple pronouns and conjunctions words would litter our posts more so than the other words we find.

Though why &quot;piles&quot; is in my cloud, I'm not entirely sure.

It's the TF-IDF algorithm. It deflates the weights of the words by how often they are used in the total collection of words, i.e. across all posts on the forum. For example since everyone uses "the" very frequently, its importance gets diminished.

It is surprising that "her" shows up in your cloud, but it just means you're using that word much more frequently than everyone else on the forum.
 

QuickTwist

Spiritual "Woo"
Local time
Today 12:01 AM
Joined
Jan 24, 2013
Messages
7,182
-->
Location
...
What sort of format does it take as input? The word clouds are based on giving each word a numerical weight

I think all it does is take the words you use and put them into different categories. Therefore, I don't think I actually need to write sentences, but can just enter in how many times of each word from the word cloud I used. So having a rundown of how many times I used each word in the word cloud you did for me should suffice.
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
I think all it does is take the words you use and put them into different categories. Therefore, I don't think I actually need to write sentences, but can just enter in how many times of each word from the word cloud I used. So having a rundown of how many times I used each word in the word cloud you did for me should suffice.
Something like this? These are the weights of each word
 

Attachments

  • quicktwist.txt
    60.7 KB · Views: 377

QuickTwist

Spiritual "Woo"
Local time
Today 12:01 AM
Joined
Jan 24, 2013
Messages
7,182
-->
Location
...
Something like this? These are the weights of each word

Thanks, I appreciate that. Unfortunately, I really have no idea how many times this equates to with each word and that is what I need.
 

Ex-User (14663)

Prolific Member
Local time
Today 5:01 AM
Joined
Jun 7, 2017
Messages
2,939
-->
Thanks, I appreciate that. Unfortunately, I really have no idea how many times this equates to with each word and that is what I need.

Well, I imagine you don't need the actual count, but just the relative count? In that case just multiply each weight with a scaling factor, say, 10000, and you get something akin to a relative count

Also note that these are not the actual frequencies but weights generated by TF-IDF algorithm.
 
Top Bottom