Date: 2004-03-26 22:48:00
Tags: livejournal, toys
new lj toy

Ok, time for you to try out a fun lj toy. The LJ Markov Random Text Generator is now fully operational. Enter your LJ username, and it will produce randomly generated text based on your journal contents, with often bizarre and hilarious results.

Let me know if you have any comments or suggestions!

Thanks to [info]goulo for helping test an earlier version of this.

Username:
[info]equiraptor
2004-03-27T06:42:39Z
Apparently, I feel safer in the garage. :)
[info]pasketti
2004-03-27T15:55:39Z
That is so cool.

Years ago, I wrote one of those to mangle BBS postings. Great fun!
[info]ghewgill
2004-03-27T22:04:56Z
Yes, what was once old is new again. Having livejournal offer freely available, easily parseable text dumps written by millions of different people, sure makes the source text problem easier!
[info]kvarko
2004-03-27T20:48:38Z
Rock! On!

I wrote programs to do that back in high school, and asked everyone to send me their text files, so I could build up good percentages. I remember there being some pretty amusing results. I also gave it several large purity tests to build percentages from and gave it "Have you ever" as a seed string, and that gave some good results, I think.
[info]ghewgill
2004-03-27T21:40:17Z
I have noticed that it works better when used against a personal journal, rather than against a community. I think this is because a single person's writing style is often consistent across posts, but a community usually has posts written by many different people. The differences in writing style give the markov model less opportunity to choose alternate word chain paths.

Now, this doesn't mean that it's not still fun to run it against communities like badspam. :)
[info]lindseykuper
2004-04-02T19:08:12Z
Yeah, that's a really good point. I've noticed it works well on journals of people who tend to use the same sorts of phrases over and over.

Awesome toy, by the way!
[info]kvarko
2004-03-27T20:53:19Z
If your LJ is in one langauge, with only a few posts or quotes in other languages, you don't get good results when words in those other languages are chosen as the seed words. Because it has very few other words to pick next, so it ends up quoting large sections of posts. I think I only have one entry with french in it, which is lyrics from a song, and the randomizer picked a french word to start with so it quoted the entire song :) My LJ is mostly in Esperanto, and it actually took three or four reloads before I got a text which was in Esperanto.
[info]void150
2004-03-30T14:44:04Z
The text generator is fantastic, I'm going to waste bloody ages playing with it! Well done!
[info]r_l
2004-03-31T01:04:11Z
It's great! Thank you so much.
[info]ex_udod985
2004-03-31T08:49:02Z
Great toy!
[info]rydel23
2004-03-31T10:39:09Z
Thank you so much! I am playing with already for two hours! It's hillarious. And some of the stuff that comes out is so creative and often times so smart and deep, I just can't believe it. Thank you!

[info]greenadine
2004-03-31T11:34:25Z
It's very funny :-)
But do you use only the 20 last journal entries ?
[info]ghewgill
2004-04-02T00:45:57Z
Yes, the RSS feed of your journal from livejournal only includes the most recent entries. See http://www.livejournal.com/support/faqbrowse.bml?faqid=149
[info]interaktiv
2004-03-31T13:26:08Z
Genious!
[info]urbansheep
2004-04-01T05:24:13Z
Absolutely incredible, as it is becoming a next stage of Burroughs' cut-up technique. Thanks!
[info]gone_one
2004-04-01T23:29:01Z
Nice toy. Really fun. :)
I wrote a thing like that few month before, and I'm interested to take a look at your code - can you show it to me? Please write to sergey(at)consal[dot]ru.
[info]lba
2004-04-05T14:40:20Z
Very fun toy, thank you!
I would like to apply it to some "friends only" journal entries, so need to add some authentication.
Can you mail me code? PleeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeZZZ... :-))
"For personal use only!", of cource! :-)
If "yes" — xav13@atmusic.org
[info]ghewgill
2004-04-05T14:47:42Z
I added the ability to read friends-only posts, see this post to find out how.

The source code is available here.
[info]lba
2004-04-07T08:21:15Z
Thanksthanksthanksthanks!!! It's cool! :-)

Now I'll try to add ability to mix various journals... May be funny... :-)
[info]ultranurd : Bug Report
2004-04-06T03:59:11Z
This is very, very cool! I did a trigram based system using the King James Bible as a corpus for a computational linguistics course. I got some great prophetic proclamations.

A lot of my friends are getting some great results from their journals, and I want to join in the fun... but it isn't working for my RSS feed. It says there's a parse error.

All I can figure is that there's something in my journal that your program doesn't like. My only thought is that I use XJournal, which inserts links to the iTunes Music Store for my currently playing music.

Any ideas why it won't work?

[info]ghewgill : Re: Bug Report
2004-04-06T04:59:23Z
It looks like your guess is right - your RSS feed contains two é and a ö which were not properly UTF-8 encoded. These appear within song names, and I guess that's what XJournal inserted. Since I manually edited the cached copy of your journal, it will work for about the next hour or so until the cached copy expires. I could try to preprocess the RSS data to try to fix this sort of encoding error before the XML parser has a chance to complain. Perhaps I'll run some tests to see how prevalent this kind of encoding error is.
[info]kharin
2004-04-06T15:22:53Z
Hmm. Any chance this could be adapted so that it would take rss/atom feeds from locations over than LJ?
[info]ghewgill
2004-04-07T04:36:37Z
Sure, I posted a link to the source code a couple of comments up. It should be easy to modify to pull text from an alternate source.
[info]dron48
2004-04-19T12:24:59Z
it just cool!
it is difficuld to make some like this, is it?
(anonymous) : Занимательно
2011-02-21T01:20:30Z
Занимательно. Отлично написанно.
Greg Hewgill <greg@hewgill.com>