Thank you for doing and sharing this
Comment on Not for me, tho
tetris11@lemmy.ml 4 months ago
Which language provides the most random alphabetically sorted sequence?
Data
| Base | English | Dutch | German | Turkish | |------+---------+-------+--------+---------| | 1 | 8 | 8 | 8 | 6 | | 2 | 11 | 3 | 3 | 5 | | 3 | 5 | 1 | 1 | 1 | | 4 | 4 | 11 | 11 | 9 | | 5 | 9 | 9 | 5 | 4 | | 6 | 1 | 10 | 9 | 2 | | 7 | 7 | 12 | 6 | 10 | | 8 | 6 | 2 | 7 | 11 | | 9 | 10 | 4 | 4 | 12 | | 10 | 3 | 5 | 10 | 8 | | 11 | 12 | 6 | 2 | 3 | | 12 | 2 | 7 | 12 | 7 |
- Sourced from comments in thread
 
Plot with Pearson Score
Code
bash   gnuplot -p -e ’     set xlabel “Base Sequence”;     set ylabel “Alphabetic”;     set xtics (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12);     set ytics (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12);     stats “alphabetic.tab” using 1:2 name “E”;     stats “” using 1:3 name “D”;     stats “” using 1:4 name “G”;     stats “” using 1:5 name “T”;     set label 1 sprintf(“Corr. [Eng = %.3f], [Dut = %.3f], [Ger = %.3f], [Tur = %.3f]”, E_correlation, D_correlation, G_correlation, T_correlation) at graph 0.95,0.05 right;     plot “” using 1:2 with lines title columnhead(2),          “” using 1:3 with lines title columnhead(3),           “” using 1:4 with lines title columnhead(4),           “” using 1:5 with lines title columnhead(5)   ’ 
It looks like the most random language is Dutch (closest to zero), and Turkish appears to be the least random (probably the 10,11,12 sequence skewed it)
FeatherConstrictor@sh.itjust.works 4 months ago
null@slrpnk.net 4 months ago
This is the second comment I’ve seen like this from you.
Please never stop.
Resonosity@lemmy.dbzer0.com 4 months ago
c/dataisbeautiful
diemartin@sh.itjust.works 4 months ago
I didn’t expect soneone to put that much effort into it.
Thanks! This is awesome!
jaybone@lemmy.zip 4 months ago
You put a lot of work into this.