Genetic algorithm and Tetris

Asked 13/10, 2011 at 22:55 Answered 14/10, 2011 at 18:26

Im creating a Tetris player using genetic algorithms, and facing some issues. I've read a lot of related works, but they don't give me enough details on the GA.

The problem is that my agent seems to get stucked very fast...Im using a evaluation function take covers 4 features:height, covered holes, flatness and number of cleared rows. I read some paper that uses the same evaluation, and is capable of doing thousands of rows.

After 600 generations, with a population of 100 agents, the best one is capable of doing only 260 rows on average, that's lame. All agents are playing the same piece sequence.

Details of my GA:

generations:600 population:100

genes: Array of 4 float values, between 0 and 1.

Uniform crossover happens at a certain probability, and swaps genes between two parents, with a certain probability.

Mutation occurs at a certain probability, here i've tried 3 different approaches:swap genes, replace the gene with a random value, or add some noise value to the gene.

I have a elite rate of 50%, and noticed that some good agents are being selected and given birth to worse agents, contaminating the population.

Selection is roulette wheel...

If someone could give me details on the best way to crossover and mutate, i appreciate!

Thanks, and sorry for the long post!

Conveyor answered 13/10, 2011 at 22:55 Comment(6)

questions: 1) how are you mapping genes to to behavior? and 2) how many separate test runs are in your test set? – Darwen 14/10, 2011 at 2:12

Hi, thanks for the comment.Each gene is a number between 0.0 and 1.0, representing the weight of some particular feature, like the number of holes in the board. Now i switched to Steady-state GA, and it seems to works better, a least its faster. Im not doing runs until it hits a least 2000 lines...i think the problem is in the mutation/crossover... – Conveyor 14/10, 2011 at 2:43

Hmmm, still not getting it. Can you share a link to that paper? – Darwen 14/10, 2011 at 2:54

That the problem, those papers don't give the details of the GA functions, but they have used only 4 genes and steady state GA.But here is one google.com.br/… – Conveyor 14/10, 2011 at 3:0

OK, I see. But you don't have a "bumpiness" factor, like they did in the paper. – Darwen 14/10, 2011 at 4:4

Yes i have, i call it flatness ...i think i must implement a binary chromossome of 1's and 0's, and translate that into a real number, it seems like they did that. – Conveyor 14/10, 2011 at 4:11

There seems to be some difference in the evaluation functions. You describe four features:

height,
covered holes,
flatness and
number of cleared rows

However, the paper you reference describes five features:

The function that the agent uses to determine the utility of a board’s state is the weighted linear sum of numerical features calculated from the state. Colin Fahey’s agents used these features: pile-height, the number of closed holes, and the number of wells (Fahey 2003). The features that we added are the number of lines that were just made, and a number that represents how “bumpy” the pile is.

(emphasis mine)

So it appears that you are missing the "wells" feature in your evaluation function and the makeup of the genes.

Darwen answered 14/10, 2011 at 18:26 Comment(2)

These 4 seems to be the most important, actually i have several others, including those cited on the paper. Im confused in how to do continuous GA, thats the whole problem...do you have any suggestions on that? Now i'm using 4 features, and 4 float values for each feature, so my chromossome is an array[16] of floats...no improvements made yet! My cpu is burning. – Conveyor 14/10, 2011 at 18:57

Tracking the wells is important because only an "I" can fill one without making a closed-hole (which are hard to get rid of). Try it with all five of the features & genes and see if that doesn't fix your problem. – Darwen 14/10, 2011 at 19:2

Unlike the paper, you should implement the 'next piece' aspect of the game.

Simulate all possible 'current piece' placements followed by 'next piece' before calculating the 'utility'.

For the sake of performance you can cache the 'next piece' placements for the best 'utility' so that they don't need to be recalculated as 'current piece' placements again.

While the calculations will be slower I believe your agents will evolve faster/smarter.

Descartes answered 14/10, 2011 at 13:5 Comment(1)

Yes, im planning to add this as well. But i know it can play very well without that, so ill try some other encoding scheme and see what happens.Thanks! – Conveyor 14/10, 2011 at 17:6

Recommended topics

Hot tags