Statistics

You are currently browsing the archive for the Statistics category.

We created a simple to read list to get an overview of new names: http://popodeus.com/statistics/names/new/

If you think the list could be better, you should comment and tell us what you think of it. We’ll be updating that page with some more neat things real soon.

Today there was a funny name that stuck out from the others:

Shayla Swordfishtrombone

This is most likely still within the game rules, but is borderline stupid looking. Have you found any funny names you think should be mentioned? Let’s catch them all!

We’re reaching end of July and Popmundo forum threads are at around 1,670,xxx. That is, 1.67 millions and counting up.

Popodeus forum search only contains a small fraction of all the threads created daily inside Popmundo. The main reason for this is our conservative indexing speed and one access node (server). Current status of indexed threads from 1,000,000 up to 1,670,xxx looks like this:

As you can see, there’s much more red than white or green. And we have to fix that! So we’re going to take a different approach to this.

I will soon start a coordinated project hopefully with at least 10 volunteers so we can get most of the missing threads into the forum index. If you’re interested, drop me a mail at alanas.anikonis@popodeus.com and I’ll send more details when things are ready. Please state in your mail who your Popmundo character is and what forums you are subscribed to, and if you would not mind subscribing to forums that you don’t usually read. After indexing is done, you can of course unsubscribe. The indexing will most likely be done via your web browser and a Greasemonkey script, so make sure you have Firefox 3.5 or later installed. We can probably get started on this sometime in August, so it’s no rush, I have a few security issues to take care of first.

Don’t hesitate. There’s not much work involved on your part. The indexing is mostly automated and you only have to leave on your computer and either watch it happen or go watch some TV :D

More forum statistics coming soon!

Don’t miss out the new simple helper tool we’ve added to Popodeus.

http://popodeus.com/statistics/artist/

We added the Tour Banner helper, so you can easily pull a list of any artist’s tour schedule.

Then you can copy-paste this text into a Google Spreadsheet or maybe Photoshop, if you plan on making a tour banner for your artist.

Try it out and send us feedback!

Later today we will launch a daily list of name changes that happen daily.

Your name is tracked by Popodeus, be sure of it. And if other players pay enough attention, we will find the offensive names quicker.

To be updated….

On the Crew page in Popmundo http://www.popmundo.com/Common/Crew.asp there are numbers on how many people use a certain language as their primary language.

Running a simple script I made, which summarizes all numbers and calculates percentages, the page looks like this:

(Sum is 229,071 users)

We can most certainly assume that many pick either US or UK English as their game interface language, even if it’s not their mother tongue (I always play the game in US English myself).
And we can most certainly assume that nobody selects Turkish unless they actually speak the language.

So to see Turkish alone is 49%, we can be pretty sure that over 50% of the players speak and know Turkish… (if we include people who don’t pick Turkish even if they could).
Even Brazilian Portuguese only reaches a feeble 16.3% of the user base, and it’s supposed to be the second biggest language in the game.

So in case anyone ever wonders: Turkish IS the main / dominant language in the game. Accept it :)

Top 10 languages are:

112,200 Türkçe 48.98%
37,365 Português, Brasil 16.31%
17,087 Español 7.46%
13,504 US English 5.9%
6,761 Lietuvių 2.95%
6,138 Polski 2.68%
6,075 Italiano 2.65%
3,852 Chinese, Singapore 1.68%
3,680 Eesti 1.61%
3,613 Suomi 1.58%

Tags:

Happy Christmas and Merry New Year! Time to make a few promises for 2010.

Currently I’m on a vinter vacation and haven’t been home since New Year’s Eve, so there hasn’t been much happening the past week (or actually two…)

So, what will Popodeus be up to in 2010?

  • Snuggly 2.0 will be launched (yes, seriously…).
  • Better Forum Search and more up to date content.
  • A couple new and improved Name Search interfaces.
  • More graphs! Everyone loves graphs. Right?

I will release new stuff as soon as I get back home later this week.

It sure ain’t pretty

I was working on some old and new numbers again and decided to create a few pretty graphs for everyone. After all, number and graphs are always a joy to stare at! ;)

First up is a an updated graph of something I first posted in Trendy Charts. I never was happy with how the graph looked in the first place. Google completely ignored the date stamp, thus the graph was drawn in improper scaling. I don’t like to produce data that looks false.

Here’s a new one created with proper tools:

Number of characters in Popmundo

If you ask me, it sure ain’t a pretty view. Why did I ever choose purple in the first place? Oh, and that quick decline of players is awful too.

Second graph is data from last 19 days, plotting the number of logged in users that can be expected during day and night. Since central Europe dominates, the game is at its slowest during 5-7am and the busiest hours are during 17-22 (6-10pm) which really doesn’t come as a surprise. If East Asians and Americans were in greater numbers, the line would be much flatter.

All times in CET.

Logged in users per hour

Here’s the same data broken down into 3 hour intervals and split into weekdays:

people_online

What can be seen from this is that Thursday has been a really busy day during the last 19 days. People also like to stay up late on Fridays to play Popo.

Final graph is the raw data this information was based on:

Logged in users

All of these graph are continually updated on Popodeus website in our statistics page. Make sure to visit it often!

Back in Coverage of forum-index I posted (a sort of) scatter plot for my forum database index. Since then I’ve pulled in lots more forum posts.

Index coverage for Forum Search

This plot covers forum posts id 1,200,000 to 1,450,000. Black spots are unindexed so far. Green spots are posted in Turkish forums, white ones are all other languages. If you look carefully, some few orange and blue spots are Dutch and Finnish respectively.

That’s it for now.

I hadn’t run my letter frequency analyzer on all Popmundo names in a while now, which I do from time to time in order to find those odd (or just plain broken and illegal) names.

Special letters usually stick out like a sore thumb when you break down every name into separate letters and count each how many times they appear. For a human, doing this type of counting by hand would be insane.

In 288,348 names, there is a total of 4,022,233 letters (including spaces, dots, basically anything that takes space horizontally.)
Out of four million letters, there are only 158 unique ones:
Ė 1 ľ 1 1 1 1 ? 1
М 1 Ū 1 · 1 Þ 1 1 1
2 2 2 2 2 ð 2
Ą 3 3 3 þ 5 ù 5 6
À 6 ű 7 Ő 8 œ 10 Ţ 10 ì 10
à 19 . 20 , 20 Ć 23 Õ 25 Ø 26
Ó 31 Ä 34 Ż 35 Ś 37 Â 39 ò 45
ő 47 Ú 47 ß 59 67 ź 67 đ 71
Đ 73 ´ 76 æ 83 Í 89 Ł 93 ń 98
ï 113 Ž 120 ą 126 ę 127 Å 127 ţ 135
ż 137 ś 144 õ 162 û 201 Š 206 Č 210
ë 249 É 258 å 314 ø 358 ê 381 ž 402
ô 424 ă 431 î 449 è 581 Á 724 ñ 833
878 š 903 ł 916 Ü 924 ū 948 â 970
ú 982 č 1113 ã 1184 Q 1543 ć 1588 X 1662
ä 1697 q 2508 ó 3118 - 3166 U 3577 Ş 3891
x 4277 İ 4380 Ç 4601 ė 4704 á 5075 í 5430
Ö 5609 Z 6281 ö 6459 é 7278 W 8287 I 8466
j 8493 w 9889 ç 10127 O 10899 ğ 11410 V 11534
Y 11968 ş 13519 f 15612 N 16635 F 17522 H 21571
P 22442 J 22456 R 23374 ü 23900 T 24351 p 24606
ı 25901 L 25979 E 26900 D 28964 K 31164 G 32232
b 33843 C 36470 v 38267 B 40795 z 42677 g 46002
S 48279 M 48344 A 59444 h 62611 y 63447 c 63622
k 67516 m 73191 d 88623 u 112437 t 126844 s 127070
o 182954 l 204371 r 257355 i 259978 n 262577 322280
e 328487 a 442350
Unsurprisingly, the most common letters are a, e, n, i, r… Even in an international game like Popmundo, those letters dominate.
I’m not going to go extensively into the list this time, but I want to point out the small little dot that has appeared into the list:
The little boy who has this name since October 24, 2009 is Apol·linar Tortora.


Most common first names

David : 1124
John : 842
Paul : 738
Michael : 697
Emma : 659
Emily : 623
Zeynep : 592
Anna : 586
Kevin : 584
Pınar : 572
Daniel : 563
Mark : 561
Su : 535
Scott : 529
Danny : 524
Alex : 520
Jessica : 510
Elizabeth : 485
Chris : 483
Andy : 471
Isabella : 470
Stephen : 468
Hande : 467
Grace : 465
Ava : 463
Özlem : 462
James : 461
Olivia : 461
Rachel : 457
Ian : 451
Simin : 447
Steve : 447
Samantha : 445
Madison : 439
Peter : 438
Sezen : 430
Sophia : 427
Eva : 418
Laura : 415
Lauren : 410
Amanda : 402
Robert : 401
Alexis : 398
Alan : 395
Baby : 395
Matthew : 393
Abigail : 385
Hannah : 377
Brian : 376
Jonathan : 374
José : 368
Ada : 364
Gary : 364
Aslı : 361
Richard : 355
Yasemin : 351
Sarah : 348
Isabel : 346
Megan : 345
Sydney : 342
Nisan : 340
Chloe : 336
Christopher : 334
Ege : 333
Deniz : 329
Nil : 329
Başak : 328
Victoria : 326
Lee : 323
Emre : 322
Neil : 321
Yaprak : 316
Seda : 315
Dean : 313
Manuel : 310
Ekin : 305
Thomas : 305
Andrea : 304
Jason : 300
Ana : 295
Seçkin : 292
Mehtap : 291
Ogün : 291
Demir : 289
Özge : 288
Tuba : 287
Morgan : 283
Jamie : 282
Brianna : 281
Gabriel : 280
Darren : 272
Oya : 269
Berrak : 267
Ece : 267
Eric : 266
Diana : 265
Efe : 262
Paco : 262
Gavin : 258
Sandra : 258


Most common family names

Williams : 414
Johnson : 405
Taylor : 404
Şahin : 400
Murray : 324
Çetin : 304
Arslan : 300
Yıldız : 300
Kaya : 293
Yılmaz : 289
Jackson : 269
Rüştü : 268
Koç : 262
Duran : 256
Martin : 256
Kılıç : 253
Küçük : 248
Alemdar : 247
Karahan : 246
Evans : 245
Özdal : 239
Karahanlı : 238
Cullen : 236
Aktaş : 234
Campbell : 224
Manisalı : 222
Tuna : 217
Edwards : 216
Kahraman : 213
Şen : 213
Çakar : 212
Sever : 210
Avcı : 209
Yakın : 206
Doğan : 204
Durukan : 203
Yüksel : 203
Scott : 202
Thomas : 202
Yıldırım : 200
Kaynar : 199
Gercek : 197
Aydın : 194
Coşkun : 194
Gelik : 194
Aksoy : 189
Anık : 189
Vargın : 189
Yücel : 188
de Oliveira : 187
Bursa : 186
James : 186
Coşar : 185
López : 185
Ward : 185
Akyol : 184
Berkes : 184
Hughes : 182
Murat : 179
dos Santos : 179
Black : 177
Genç : 177
Keskin : 177
Harris : 176
Gündüz : 173
İzmir : 173
Güzelyurt : 172
Erkan : 171
Topaloğlu : 171
Tural : 171
Göztepeli : 170
Hoca : 170
Türktaş : 170
Şirin : 167
Yeğin : 166
Çolak : 166
Silva : 165
King : 164
Göztepe : 163
Clark : 162
Güzelizmir : 162
Tatlıses : 162
Korkmaz : 161
Müge : 161
Akut : 160
Avcıbaşı : 160
Gonçalves : 159
Orga : 159
Rodrigues : 157
Özkan : 157
Karabekir : 155
Oransayoğlu : 155
Ruiz : 154
Satan : 154
Üstün : 154
Şekercioğlu : 154
Ersoy : 153
Murphy : 153
de Carvalho : 153
İlker : 152

I wanted a quick visual of how much of the forum I have indexed… The current thread id maximum is at around 1.42 millions, and I only have 33k threads in my database. Lots of empty space in there.. ugh. I can do better, I promise!

This is my first visualization, blue dot is an indexed thread. Top left corner is id zero, and it grows to the right, 1000 pixels/threads to the right. So the 20th line represents threadids 20,000-20,999. First pixel on line 1400 is threadid 1,400,000-1,400,999 etc. Click on image for full size view.

Popmundo Forum Search database coverage

Later I might add color to show which thread is in Turkish, English, Spanish etc. unless I make a traditional pie chart or something.

Talk is cheap

As asked in the previous blog entry, I’m posting some of the most commonly used words (terms, as Lucene calls them) used in the Popmundo forums.

Extracting the top terms is quite simple, but getting relevant data, well… this is how it looks like:

141,846  i
108,782 de
87,742   you
84,692    me
80,864    que
71,235    la
59,152    en

… ok, so number is how many times it occurs (in 646k entries). Hardly any interesting details in the top most common terms.

I shouldn’t even index those words in the first place, they’re called “stop words”, and removing them from the search database would make it much smaller. (Actually I started filtering out stop words not too long ago, so searching for them should not be possible for posts indexed in the last month or so, unless I made a configuration mistake)

Scrolling down in the list these words pop up:

vip, solo, band, rock, idea, skills, show, thread, tour, basic, money, ceo, zombie, club… (I didn’t pick any words that I didn’t understand, in Spanish, Turkish etc.)
Well, hardly anything surprising there. Zombie is mentioned 5,750 times.

Full list of 400 most common terms: popmundo_forum_top_terms_2009.txt

« Older entries