Replays sharing

milagros · Post by **milagros** » 22 Jul 2019, 16:04

Labs wrote: ↑22 Jul 2019, 15:31 Please show some nice internal recs, maybe computer can show us new styles

I guess it works only where you can locally create the gradient
for example "get as far left / right / up / down) as possible"
so no other internal than 02 can be done that way

also I guess it needs to be trained for each level separately (so it actually doesn't really observe stuff)

to do it properly on arbitrary internal level, you need to have sufficient computing power to achieve the goal "by chance", which is not really feasible unless you have some millions to waste

Stini · Post by **Stini** » 22 Jul 2019, 16:54

milagros wrote: ↑22 Jul 2019, 16:04
Labs wrote: ↑22 Jul 2019, 15:31 Please show some nice internal recs, maybe computer can show us new styles
I guess it works only where you can locally create the gradient
for example "get as far left / right / up / down) as possible"
so no other internal than 02 can be done that way

also I guess it needs to be trained for each level separately (so it actually doesn't really observe stuff)

to do it properly on arbitrary internal level, you need to have sufficient computing power to achieve the goal "by chance", which is not really feasible unless you have some millions to waste

Yeah, I have a very naive reward function at the moment, which essentially just gives some reward for getting closer to the next closest apple/flower as quickly as possible, so most of the internals are not feasible at the moment. It's really challenging to make it explore properly and come up with new and different styles. The model is indeed trained for each level separately right now, but I will try to make a more general AI in the future, which could play multiple and more challenging levels.

I did some tries on other internals as well, but the results weren't good enough for the video:

Warm up, 15.24
Islands in the Sky, 24.91
Uphill Battle, 25.96 - kinda interesting pop style here
Haircut, unfinished - I had a bug in my code, so it stops too early, but looks like ez finish

Grace · Post by **Grace** » 22 Jul 2019, 16:58

Pretty tight haircut stylefinding.

I'm curious on the logic behind the reward function. In - for example the haircut rec - surely by default the closest rewardable object is the flower. Is there some caveat that flower is unrewarding until all apples given?

Stini · Post by **Stini** » 22 Jul 2019, 17:04

Yes it tries to collect all the apples first.

Labs · Post by **Labs** » 22 Jul 2019, 17:10

Next table spef makes wr on uphill battle

Nics recs!

milagros · Post by **milagros** » 22 Jul 2019, 17:20

Stini wrote: ↑22 Jul 2019, 16:54 Yeah, I have a very naive reward function at the moment, which essentially just gives some reward for getting closer to the next closest apple/flower as quickly as possible

i wanted to do something similar at some point and concluded the only way to get really good results is to use a reward function depending on how close you are to a future state in a SL rec (you try to be where SL rec was 0.1s in the future, once achieved, 0.2s in the future etc)
all plans were dropped when I realized that all recs would be optimal if you over-use the bug bounces

Hosp · Post by **Hosp** » 22 Jul 2019, 18:30

that's the best 05 rec i hev ever seen

ArZeNiK · Post by **ArZeNiK** » 22 Jul 2019, 19:31

actually i think all these AI recs are sik
who know where bot will be in future. maybe bot is first 33tt

Lukazz · Post by **Lukazz** » 22 Jul 2019, 21:13

Insanely interesting project! Nice 05 style!

Petrenuk · Post by **Petrenuk** » 31 Jul 2019, 02:10

Stini wrote: ↑22 Jul 2019, 16:54
Yeah, I have a very naive reward function at the moment, which essentially just gives some reward for getting closer to the next closest apple/flower as quickly as possible, so most of the internals are not feasible at the moment. It's really challenging to make it explore properly and come up with new and different styles. The model is indeed trained for each level separately right now, but I will try to make a more general AI in the future, which could play multiple and more challenging levels.

Lol I was waiting for someone to try it out! Very interesting indeed!
Stini, how do you interact with the game? Is there a way to run it much faster than realtime (like in most reinforcement learning projects?) How do you grab the screen pixels if at all? (or is it training from low-dimensional representation?)
If you have access to Elma source code this should be doable, otherwise, I'm curious.
Also, do you run more than one copy of Elma at a time during training?

Other questions: what RL framework do you use? Is it something open-source like OpenAI Baselines, or is this your own implementation?
What kind of RL algorithm do you use? A2C, PPO, DQN?

Having worked with RL myself, I can say this is an extremely challenging problem for contemporary exploration algorithms. For non-trivial levels it basically involves implicit traveling salesman problem.
I think it might be possible to make progress on harder levels by combining some kind of search over apple sequences and routes with RL to produce the actual actions for every route. This might even have some style finding potential on shorter levels.
Totally agree with Milagros though, if bug bounce is patched this is even more interesting!

Also, very surprised at how human the recs look! Totally looks like a rec from an amateur player who never saw a professional rec.

Stini · Post by **Stini** » 31 Jul 2019, 16:31

Petrenuk wrote: ↑31 Jul 2019, 02:10 Stini, how do you interact with the game? Is there a way to run it much faster than realtime (like in most reinforcement learning projects?) How do you grab the screen pixels if at all? (or is it training from low-dimensional representation?)
If you have access to Elma source code this should be doable, otherwise, I'm curious.
Also, do you run more than one copy of Elma at a time during training?

Smibu was kind enough to give me access to EOL2 code, so I used its physics code to build an elma simulator, which is about 1000x faster than real time. Currently I only simulate the physics and I don't do any frame rendering, so I don't use any pixel values in my algorithm. I just use some of the internal state of the simulator as features (coordinates, velocities, rotation speeds etc.). And yeah it's quite trivial to run multiple simulators concurrently.

Petrenuk wrote: ↑31 Jul 2019, 02:10 Other questions: what RL framework do you use? Is it something open-source like OpenAI Baselines, or is this your own implementation?
What kind of RL algorithm do you use? A2C, PPO, DQN?

I wanted to try something simple first, so I just picked cross-entropy method since it's very easy to implement. Turned out it works quite well. I was then asked later to make some runs for the video, but due to limited time I just kept using and improving my implementation of CEM rather than taking a risk to try something more sophisticated.

I have actually implemented OpenAI Gym wrapper for the simulator and I did try some of the OpenAI Baselines algorithms, such as PPO and DQN. However, I wasn't able to get these working as well as my own implementation of CEM. I could maybe take a closer look at these again, but I'm more interested in trying model-based methods, which I also use at work a lot so I'm more familiar with them and I also think they might work quite well.

Petrenuk wrote: ↑31 Jul 2019, 02:10 Having worked with RL myself, I can say this is an extremely challenging problem for contemporary exploration algorithms. For non-trivial levels it basically involves implicit traveling salesman problem.
I think it might be possible to make progress on harder levels by combining some kind of search over apple sequences and routes with RL to produce the actual actions for every route. This might even have some style finding potential on shorter levels.
Totally agree with Milagros though, if bug bounce is patched this is even more interesting!

Yeah I have been also thinking about some kind of search and other ideas for style finding and exploration. For example, if you look at the recent Go-Explore paper by Uber to solve Montezuma's revenge, it's essentially just a simple search from "most promising" states with lower-dimensional state representation and stuff. However, I think the state-space might be too complex for this kind of approach to work well in elma.

I will probably try using recs for better style finding. I would like to use recs for training the AI in general, but you could also try learning which apple a human would collect next for example. Maybe you could then build some kind of a hierarchical model using this or something.

Petrenuk · Post by **Petrenuk** » 2 Aug 2019, 01:22

Stini, this is awesome!
I do know the Go-Explore paper and indeed there might be some potential in using it for style-finding. But as you said, Elma state space is a lot more complex than Montezumas, because things like velocity and rotation matter a lot. Even if you discretize everything, it might be either too coarse to find anything useful or too huge to store and process.

Maybe initially something like hand-crafted route defined by a set of waypoints could work. Similar to placing a bunch of apples along the "desired" path through the level. And then let the RL algorithm find the low-level actions to drive through waypoints.

Wild thought:
given the fact that you have access to all physic calculations, you can actually use differentiable physics

https://arxiv.org/abs/1905.10706

Kazan · Post by **Kazan** » 9 Nov 2019, 16:26

lul) http://recs.zamppe.com/replays/1525/show?08K2050

Zweq · Post by **Zweq** » 10 Nov 2019, 12:20

gz under 35tt or smth

Cfilorvy · Post by **Cfilorvy** » 11 Nov 2019, 14:26

Gz with u35, Kazan (and soon WR)

Kazan · Post by **Kazan** » 30 Nov 2019, 15:50

http://recs.zamppe.com/replays/1598/show?27K3172sl
finished with this style in train lev, it's actually not so hard. ez improve wr by 0.5

AndrY · Post by **AndrY** » 30 Nov 2019, 23:59

Kazan wrote: ↑30 Nov 2019, 15:50 http://recs.zamppe.com/replays/1598/show?27K3172sl
finished with this style in train lev, it's actually not so hard. ez improve wr by 0.5

ez wr ofc why shared?)
or this style is known? anyway cool

Jazka · Post by **Jazka** » 3 Dec 2019, 21:57

Nice to see that mopolauta is still active, haven't played the game in 10 years or so. I uploaded my Playing level records from 2009 here http://recs.zamppe.com/replays/1769/sho ... e=PLXT16Jz
http://recs.zamppe.com/replays/1690/sho ... e=PLKI03Jz
http://recs.zamppe.com/replays/1756/sho ... e=PLXT03Jz

Please go and make some new records on those short hoyla levels, would be nice to see how much they can still be improved. Some old records can be found from here http://zebra.mbnet.fi/playing_table_032.html

ArZeNiK · Post by **ArZeNiK** » 4 Dec 2019, 05:08

nice recs
https://elmaonline.net/statistics/levelpack/PL

pawq · Post by **pawq** » 4 Dec 2019, 10:45

Yeah, nice recs for 2009! One is still a WR even ;)

Welcome back! :)

Zweq · Post by **Zweq** » 4 Dec 2019, 11:35

Jazka wrote: ↑3 Dec 2019, 21:57 Nice to see that mopolauta is still active, haven't played the game in 10 years or so. I uploaded my Playing level records from 2009 here http://recs.zamppe.com/replays/1769/sho ... e=PLXT16Jz
http://recs.zamppe.com/replays/1690/sho ... e=PLKI03Jz
http://recs.zamppe.com/replays/1756/sho ... e=PLXT03Jz

Please go and make some new records on those short hoyla levels, would be nice to see how much they can still be improved. Some old records can be found from here http://zebra.mbnet.fi/playing_table_032.html

did you use a script for uploading all those recs? If not, I'm very sorry, I'm the one responsible for that abomination of a file upload that doesn't support uploading multiple recs

Jazka · Post by **Jazka** » 4 Dec 2019, 12:10

pawq wrote: ↑4 Dec 2019, 10:45 Yeah, nice recs for 2009! One is still a WR even

Welcome back!

Oh, only one of them? Which one? Where are the official records, I guess not here then? https://elmaonline.net/statistics/levelpack/PL

Jazka · Post by **Jazka** » 4 Dec 2019, 12:23

Zweq wrote: ↑4 Dec 2019, 11:35
Jazka wrote: ↑3 Dec 2019, 21:57 Nice to see that mopolauta is still active, haven't played the game in 10 years or so. I uploaded my Playing level records from 2009 here http://recs.zamppe.com/replays/1769/sho ... e=PLXT16Jz
http://recs.zamppe.com/replays/1690/sho ... e=PLKI03Jz
http://recs.zamppe.com/replays/1756/sho ... e=PLXT03Jz

Please go and make some new records on those short hoyla levels, would be nice to see how much they can still be improved. Some old records can be found from here http://zebra.mbnet.fi/playing_table_032.html
did you use a script for uploading all those recs? If not, I'm very sorry, I'm the one responsible for that abomination of a file upload that doesn't support uploading multiple recs

Didn't use any script but it wasn't that bad

It was nice to see the replays after uploading them, as I don't have a working version of the game at the moment. I'm a bit surprised of how good some of those replays looked like. Now I also remember that one of the reasons for quitting the game was a change of computer, after which I couldn't press multiple keys at the same time. Is it keyboard's fault or is there some settings that could be changed to allow that?

danitah · Post by **danitah** » 4 Dec 2019, 13:04

Usually its because of keyboard, and usually you can fix it by using different keys

Zweq · Post by **Zweq** » 4 Dec 2019, 22:06

those playing levs seem to be missing at least 2 tricks i can instantly think of

zebra · Post by **zebra** » 5 Dec 2019, 07:39

Welcome back, Jazka, and thank you for sharing the recs!
The playing levels were made 10 years ago so obviously there are some newer tricks missing. Which 2 tricks are you thinking of, Zweq?

pawq · Post by **pawq** » 6 Dec 2019, 13:01

Jazka wrote: ↑4 Dec 2019, 12:10 Oh, only one of them? Which one? Where are the official records, I guess not here then? https://elmaonline.net/statistics/levelpack/PL

Yeah, exactly that link

you're beaten in both XT levs (by 3s in XT03), but your KI03 still seems to be best

Jazka · Post by **Jazka** » 7 Dec 2019, 11:08

pawq wrote: ↑6 Dec 2019, 13:01
Jazka wrote: ↑4 Dec 2019, 12:10 Oh, only one of them? Which one? Where are the official records, I guess not here then? https://elmaonline.net/statistics/levelpack/PL
Yeah, exactly that link you're beaten in both XT levs (by 3s in XT03), but your KI03 still seems to be best

Oh I guess you only watched those three replays that I linked as an example. I also uploaded dozens of other replays and many of them seem to be WRs if compared to elmaonline site

pawq · Post by **pawq** » 17 Dec 2019, 01:54

pips065 - 50.21

50.21: http://recs.zamppe.com/replays/1821/show?pips065p5021
50.29: http://recs.zamppe.com/replays/1822/show?pips065p5029

Two oke rides within 5min, merging those would give some 48.mid, an oke clean run with this style should be some 46-47 I think. The best level I've played in a long time, I'm literally sad that it's not a WCup level. Haven't even done proper stylefinding yet

edit:
48.94: http://recs.zamppe.com/replays/1823/show?pips065p4894

pawq · Post by **pawq** » 17 Dec 2019, 23:26

47.36: http://recs.zamppe.com/replays/1824/show?pips065p4736

Made in the first run after I noticed that zero did 48.20

ty for hoyling power

Maybe a couple of imperfect spots, but a pretty clean run, would be hard to 46 or lower for my without new styles, and dunno if orka do stylefinding. But this style/route was super max fun to play, ty barryp <3

Zero · Post by **Zero** » 26 Dec 2019, 22:23

http://recs.zamppe.com/replays/1866/show?Life15zer

Late Ho Ho Ho

<- non alco beer

Grace · Post by **Grace** » 28 Dec 2019, 06:55

Very pretty rec imo

<- Non-alco

Kazan · Post by **Kazan** » 30 Jan 2020, 17:44

int36 old wr. dunno if shared
http://recs.zamppe.com/replays/1978/show?36K5680

Mira · Post by **Mira** » 8 Feb 2020, 14:32

Ahoj,

classic afterbattle playing
it's always fun to complete challenge for some u20 or any "uxx" time

it's a little less fun to get... not once, but twice a time exactly of 20:00
it's this much

satisfying to finally reach the goal
rec here:
http://recs.zamppe.com/replays/1998/sho ... 202Mir1979
or here:
https://elma.online/r/34f2khgfi3

pawq · Post by **pawq** » 8 Feb 2020, 18:09

Me after 6sec: is this rec 20s of driving straight 🤨

Me after 20sec: 🤯

Zero · Post by **Zero** » 9 Feb 2020, 11:22

Lit rec

ArZeNiK · Post by **ArZeNiK** » 9 Feb 2020, 12:05

always so satisfying to see smooth spins like this omg
well done

Zero · Post by **Zero** » 17 Mar 2020, 16:42

http://recs.zamppe.com/replays/2088/show?altF5zer

sierra · Post by **sierra** » 23 Apr 2020, 16:18

Did chainpi4 in 27 mins. (I might have used TAS.) Any mans has better? http://recs.zamppe.com/replays/2194/show?chainpi4ok

zebra · Post by **zebra** » 24 Apr 2020, 08:01

Thank you for sharing chainpi4 rec, sierra. It was great

About the chainpi4 level: I had completely forgotten that I had made my own part to it! Didn't remember any of those polygons and pipes I had made... but they looked like me so probably I have made them

ArZeNiK · Post by **ArZeNiK** » 6 May 2020, 23:47

Some of you early mans (especially hungarians) might remember mPack (pack by team MoSzAT) from around 2003-2004. I took one of the levs and invented a modern style for it, shaving off about 5 seconds from the previous (known) record by KD. It looks kinda cool imo, so decided to share it: http://recs.zamppe.com/replays/2246/show?mp07arz

iCS · Post by **iCS** » 16 Jun 2020, 08:13

what's best way to share across lev replays? youtube?
https://youtu.be/aSqFOVOSto0

ArZeNiK · Post by **ArZeNiK** » 16 Jun 2020, 13:30

iCS wrote: ↑16 Jun 2020, 08:13 what's best way to share across lev replays? youtube?
https://youtu.be/aSqFOVOSto0

sik rec! probably since unconvertable to elma

adi · Post by **adi** » 17 Jun 2020, 11:34

https://drive.google.com/file/d/1J3Tg0h ... sp=sharing

My 308 TL pipe recs. Some 0ikukas levels are still unfinished but maybe I'll try to finish them in the future (except 0ikukas6). I don't remember, which were the hardest pipes but probably at least 0ntelo35, 0ntelo37, 0melet24, 0lympic$ and 0pen302

Bjenn · Post by **Bjenn** » 20 Jun 2020, 21:44

Some recs I don't think are shared before.

recs.zamppe links
31B10972 (new style/move)
28B4736
03B1748

Won't share my Tunnel Terror though =)

Bjenn · Post by **Bjenn** » 22 Jun 2020, 01:04

16B12307 decent ending imo =) style from Zweq SL.

umiz · Post by **umiz** » 22 Jun 2020, 06:50

Very nice recs Bjenn! Thanks for sharing!
Common Tunnel terror :*

Madness · Post by **Madness** » 27 Jun 2020, 00:30

Nice recs Bjenn, I am proud of you.

Bjenn · Post by **Bjenn** » 7 Jul 2020, 20:28

Thanks you two!

Here is some style from 2011 that I tried in the start in Fruit in the Den 36stilbj

ArZeNiK · Post by **ArZeNiK** » 9 Jul 2020, 20:53

continuing what's apparently now a tradition, i'll share all my 38tt recs in a zip here, maybe someone will find them useful
http://kopasite.net/up/1/arze38tt.rar
includes:
- 2 leg times
- 7 wc times
- 36 pro times
- 9 good times
earliest rec hang tight 25:32 2018/03/24, latest rec bowling 58:79 2020/07/09
total time 38:59:95
enjoy