/EMAILHISTORY

http://github.com/deltaluca/flappy · #! · 497 lines · 382 code · 115 blank · 0 comment · 0 complexity · f1488137518d0ac1e79f0d18840c5479 MD5 · raw file

  1. Related work on AI engines:
  2. http://sites.google.com/site/diplomacyai/albert
  3. http://www.diplom.org/~diparch/resources/postal/artificial.htm
  4. http://www.daide.org.uk/wiki/Main_Page
  5. www.tofgarion.net/lectures/XX201/diplomacy/master-haard.pdf
  6. ------------------------------------------------------------------------------
  7. After writing the rest of the parsing rules (though without ast
  8. generation)
  9. I came across a few tokens which did not exist.
  10. And it turns out that the DAIDE homepage is a bit outdated....
  11. the low-level protocol on daide homepage is r13, whilst i have found
  12. an r14 on another page which has a few changes.
  13. Mainly old tokens that exists no longer exist, or new ones have been
  14. created (that are used int he grammar) with one token being replaced
  15. with another using the same 16bit value.
  16. Even with r14 however, there are two tokens which the r14 protocol
  17. document still do not give a value for (these being the tokens NAR and
  18. CCL) which are used int he grammar. how shit.
  19. :)
  20. reference for more recent protocol document:
  21. http://www.ellought.demon.co.uk/dipai/
  22. I'm hoping there is not some big changes should we fine a more recent
  23. one again.
  24. the syntax pdf is dated 7th march 2010. The r13 protcol document was
  25. dated 2003, and the r14 one that i've now found is dated 2006.
  26. ------------------------------------------------------------------------------
  27. Hey guys, I believe the next 2 weeks aren't going to be so bad as
  28. *most* courseworks are done and most of us have finished with
  29. interviews. We need some maximum effort so that we can actually hand
  30. in something that works. I know your futures important and shit but
  31. you have to complete the task in hand too! So when we mentioned last
  32. time that we should meet up every day, lets actually do that shall we?
  33. We don't have that many lectures this term so I know for a fact that
  34. everyone should have some spare time every day to get into labs and do
  35. some work. Also I know all of you guys have PMT's at lunch once a
  36. week, but I expect to see all of you in labs at lunchtime whenever you
  37. don't have PMT sessions. Obviously if you have commitments like
  38. societies and whatnot then that's fine, but I want to see at least 3
  39. people working together at one time. We definitely need more help on
  40. the server side of things and with the AI, it's mainly been Slemi so
  41. far (when I say mainly, I mean totally with a bit of help here and
  42. there) otherwise we'll end up handing in something that's most likely
  43. not going to work (or if it does work it would probably be shit). So
  44. in reply to this, I want everyone to tell me what you're schedule is
  45. like for the next 2 weeks. I.e. in terms of interviews, societies,
  46. other commitments and shit so I can try and get everyone together as
  47. often as possible (basically at least once a day). Furthermore, we
  48. need to start planning out our last report (I believe Luke's started
  49. that) and plan out what we're going to include in our hand in on the
  50. 11th. Then we can start thinking about how we can add on to it during
  51. christmas. Personally, I'm not really up for doing a huge amount of
  52. work over christmas as that's the only time us guys on the MEng course
  53. can really get some rest before diving back into term 2 and then
  54. placements straight after, but if it has to be done it will be done!
  55. Lets do this guys, if we all put some effort into this (as much effort
  56. as we can fit in) then I'm sure we can get a decent mark!
  57. Cliff 'I'm fucking so psyched up right now' Sun
  58. p.s. any questions or anything you wanna say, then please don't
  59. hesitate to reply
  60. --------------------------------------------------------------------------------
  61. also as i said in the commit message dumbbot does not work yet and
  62. runs out of memory very quickly. i indicated in the code with -- ERROR
  63. which chunk is causing it. if the code seems obscure please download
  64. the original dumbbot source and see DumbBot.cpp's
  65. generate_move_ish_function. it is mostly that except for their garbage
  66. sampler (we just square the weights and sample, they do weird
  67. retry-in-a-loop stuff) Slemmer, Andras [Reply All] Inbox 28 December
  68. 2011 10:54 AM Yes, sorry for that i was in a bit of a rush. shuffleM
  69. was added by me, i emailed the maintainer to add it but it will take a
  70. while i guess please do this (replace PROJECT_DIR) after git pull:
  71. cabal install random-shuffle cd
  72. ~/.cabal/packages/hackage.haskell.org/random-shuffle/0.0.2/ tar -zxf
  73. random-shuffle-0.0.2.tar.gz cp PROJECT_DIR/random-shuffle.patch .
  74. patch -p0 -i random-shuffle.patch cd random-shuffle-0.0.2 cabal
  75. install
  76. should work afterwards Feladó: Luke Tomlin [lt609@doc.ic.ac.uk]
  77. Küldve: 2011. december 28. 10:11 Címzett: Slemmer, Andras Tárgy: Re:
  78. guys...
  79. Hey,
  80. Thanks for that Slemi. I see you've done a hearty chunk of dumbbot as
  81. well! Awesome. However I'm having some compilation errors (sidenote:
  82. everything installs fine on my desktop once I've installed ghc 7.2.2,
  83. it only doesn't work on my laptop for some reason. No idea why).
  84. The compiler says the function shuffleM is not in scope. Whereabouts
  85. is this function defined?
  86. Hope you have a good holiday, wherever it might be!
  87. Cheers,
  88. Luke
  89. 2011/12/27 Slemmer, Andras <andras.slemmer09@imperial.ac.uk> hey,
  90. getAdjacentNodes is a MonadGameKnowledge monad computation and you are
  91. using it in a list comprehension which doesnt work. remember (=<<)'s
  92. lhs must return a monad computation. it deduces that "(intersect
  93. ...)" returns a list, therefore =<<'s monad has to be the list monad
  94. which is not the case. try using mapM over friendlies instead of a
  95. list comprehension!
  96. Feladó: Luke Tomlin [lt609@doc.ic.ac.uk] Küldve: 2011. december
  97. 27. 22:06 Címzett: Slemmer, Andras Tárgy: Re: guys...
  98. Hi Slemi,
  99. Quick question before you leave tomorrow and we all start screaming
  100. and dying,
  101. I have
  102. -- possible support moves let supportMoves = concat [map ((SupportMove
  103. unitPos otherUnit).provNodeToProv) $ (intersect adjacentNodes) =<<
  104. (getAdjacentNodes otherUnit) | otherUnit <- friendlies]
  105. Which probably doesn't work because I've done something wrong, but I
  106. get the error:
  107. Could not deduce (MonadGameKnowledge h0 []) arising from a use of
  108. `getAdjacentNodes' from the context (OrderClass o, MonadBrain o
  109. m, MonadGameKnowledge h m) bound by the type signature for
  110. genLegalOrders :: (OrderClass o, MonadBrain o m,
  111. MonadGameKnowledge h m) => Map.Map UnitPosition [OrderMovement]
  112. -> UnitPosition -> m (Map.Map UnitPosition [OrderMovement]) at
  113. Diplomacy/AI/SkelBot/Common.hs:(217,1)-(248,49)
  114. Could you shed any light on this? The code is commented out in
  115. Common.hs
  116. Thanks,
  117. Luke
  118. 2011/12/26 Slemmer, Andras <andras.slemmer09@imperial.ac.uk> Slemmer,
  119. Andras [Reply All] Inbox 26 December 2011 5:58 PM happy <relevant
  120. winter solstice substitute>!
  121. i'll be away from 28th until the 4th. i will try to implement dumbbot
  122. till the 28th. what i suggest is either 1. extend dumbbot to include
  123. diplomatic arrangements (add weightings according to allies/enemies
  124. plus some (possibly random *wink wink*) way of initiating these)
  125. 2. implement a novel ai. (in this case you actually have to start
  126. WORKING) either way please PLEASE ask now if you dont understand
  127. something, because i wont be available later.
  128. i added a Makefile in the root dir. 'make' will start the server and
  129. a gui observer instance. in the observer you will have to connect
  130. with the cli. you can do this either by "connect localhost 16713"
  131. "daide OBS" or "run try"
  132. once the observer is connected you can launch 7 randombots with 'make
  133. random7' see the Makefile for details on how this works and how you
  134. should add your own test make targets.
  135. in order for the above to work you have to have a wine-installed DAIDE
  136. server and a compiled (UserClient) observer. DAIDE server:
  137. http://www.ellought.demon.co.uk/dipai/aiserver.msi then use msiexec
  138. (comes with wine) to install UserClient: ask Luca, and/or see
  139. haxe_setup for details (btw Luca in the script the envs that are added
  140. to ~/.haxenv should also be exported before installing! forgot to push
  141. sry!)
  142. i'm also working on a branch that refactors the protocol with
  143. parsec(!) which is much neater and is something we can talk about if
  144. we wont have stuff to present. (it parses the stream into a
  145. computation how cool is that)
  146. another thing you want to implement is a general sanity checker for
  147. the moves in SkelBot.hs, this is very useful if you want to have more
  148. verbose er
  149. ---------------------------------------------------------------------------
  150. Hey,
  151. The report is due on the 9th. We also need to do some stuff for the
  152. presentation, but we can probably hack that together in a few hours so
  153. that's as much of a problem.
  154. I don't want to spend too long on this pattern weight thing, because
  155. obviously our main aim is to get some negotiation going on, so I'll
  156. probably have a fiddle around with that tomorrow or the day
  157. after. Unfortunately I think we're going to have to start from
  158. scratch, and Slemi isn't around to guide us through how the messaging
  159. works, if it works at all, so that'll be interesting. I guess if we
  160. can devise a clever method to handle it all though (ie. plan out in
  161. detail exactly how it will be done and implemented) then we can just
  162. code it off of that specification without too much problem when
  163. everyone is back. Plus those plans will be useful to put in the report
  164. I suppose.
  165. Let me know if anything crops up, I'm going to try to talk to Cliff
  166. too and get a plan going for when everyone is back.
  167. Cheers,
  168. Luke Hueser, Matthias Hey, unfortunately our internet was down
  169. (provider change) from 28th til today.. lll be back in London on the
  170. 3rd. When is the report due? Maybe I can finish that and work on the
  171. negotiation side as we go along. For the negotiation I thought about
  172. having a Fri 30/12 Luke Tomlin [luke.tomlin@googlemail.com] [Reply]
  173. [Reply All] [Forward] Actions To: Hueser, Matthias Inbox 28 December
  174. 2011 11:43 AM You replied on 30/12/2011 9:29 AM. Hi Matthias,
  175. Just thought I'd get a quick catch-up with you on the project - as
  176. you're probably aware, Slemi is now away until the 4th January, so
  177. it's up to the four of us now to get as much done as possible so the
  178. last week isn't too disastrous! I've started work on the
  179. pattern-weights learning bot recently, and most of the metrics/helper
  180. functions are down. The only hard bit now would be making the database
  181. and stuff, but I'm hoping to get that done over the next day or
  182. two. Hopefully it won't be too difficult!
  183. I'm hoping to make it as general as possible, so maybe we can apply it
  184. to negotiation too. I think the major thing at the moment is to get
  185. started with *something* on the negotiation side. Slemi mentioned (and
  186. it's in use in some of the other negotiation bots) a method that used
  187. a scoring mechanism for each province, and the messages would be sent
  188. to the enemy players based on their location around these provinces
  189. and the relative weightings.
  190. If we can get a plan of what needs to be done over the next week and a
  191. half, hopefully we'll have something good by the end of it! If by the
  192. time we're finished we have holdbot, randombot, dumbbot, patternweight
  193. (non-negotiation), negotation, and maybe some other flavour of
  194. negotiation, we'll probably be well set :D
  195. Let me know what you think. I'm trying to work on little bits of the
  196. report to in between.
  197. Hope your holidays are good!
  198. Cheers,
  199. Luke
  200. ----------------------------------------------------------------------
  201. Yo Matthias,
  202. Shall we meet in labs tomorrow morning, earlyish? Can you do 9 or is
  203. that too early? If we can write some detailed specifications for
  204. negotiation AI, we can pass them on to Slemi and he can hopefully work
  205. his magic.
  206. Cheers,
  207. Luke Sun, Cliff [Reply] [Reply All] [Forward] Actions To: Slemmer,
  208. Andras Cc: Hueser, Matthias‎; Tomlin, Luke‎; Deltodesco, Luca Deleted
  209. Items 03 January 2012 8:05 PM I'll be in labs from the 5th onwards,
  210. Matthias and Luke, will you guys be there? Slemmer, Andras [Reply
  211. All] Deleted Items 03 January 2012 8:01 PM sorry for the confusion i'm
  212. not in london yet (will be at 9th), only back from the 5dayish holiday
  213. during which i was internetless. Hueser, Matthias [Reply All] Sent
  214. Items 03 January 2012 7:11 PM Back in London.. can we meet in the labs
  215. tomorrow?
  216. Matthias Hueser Department of Computing Imperial College London
  217. E-mail: mh2308@doc.ic.ac.uk Mobile: 07833543707
  218. Slemmer, Andras [Reply All] Deleted Items 03 January 2012 6:52 PM Ok,
  219. back from holiday earlier.
  220. Cliff, Luke, Matthias:
  221. http://bbsimg.ngfiles.com/1/21840000/ngbbs4c9f9269c2bbd.jpg i finished
  222. dumbbot and i was hoping by now we could test it against our own
  223. ai... not exactly the case, i frankly don't really see the ai part
  224. getting anywhere. as you've done some reading on existing ais i was
  225. hoping you could suggest one that can be implemented in the little
  226. time we have left. i don't really expect coding cooperation at this
  227. point, but it would be very helpful if you could send me
  228. papers/sources on the chosen ai.
  229. Luca: Can you please add display of province IDs(numbers not tokens)
  230. to the GUI, it would be most helpful for debugging. It would also be
  231. nice to have pausing-rewinding.
  232. this is a shit email. i hate shitty emails slemi
  233. -----------------------------------------------------------------------
  234. Hi guys,
  235. Had a cheeky peek around and found these past reports from other
  236. years,
  237. 2005
  238. 2008
  239. 2009
  240. All three groups based their tactical bot around DumbBot, and used
  241. Daide. They operated a (very) watered down version of negotiation. I
  242. believe all three bots were written in Java.
  243. Some comments:
  244. 2005 Their negotiation worked purely on support moves - a list of
  245. 'useful supports' for each power is generated, in order of
  246. preference. The list is then iterated through, until one of the
  247. suggestions is accepted. If none are accepted, the list is iterated
  248. through again, but this time with a free 'IOU' attached to it, This
  249. continues until the other power accepts or the 'cost' of the IOUs is
  250. too high.
  251. 2008 Their end-product negotiation was not fully fledged, they had
  252. started to implement some form of trust and back stabbing mechanism
  253. based around a 'banking' metaphor, but then ran short of time.
  254. 2009 This paper looks the most advanced. Haven't read it all yet
  255. though.
  256. If we can even get a small subset of negotiation going on (such as
  257. making alliances ie. not attacking specific units if 'allied' with
  258. them, or backstabbing and trust and such) then we'll certainly be on
  259. par with these past papers, as we have a LOT to talk about with the
  260. framework and individual bots and game structure, as well as a
  261. fancy-ass GUI.
  262. Cool.
  263. Luke
  264. -------------------------------------------------------------------------
  265. Hey, just writing parts of the report and happy that I have some
  266. Diplo/AI I can write about in the 'our own contribution' - part of the
  267. report.
  268. @Luca Can I have an update on the GUI client? For two reasons a) There
  269. will be a chapter on it in the final report. b) Some screenshots of a
  270. clean map and some typical game situations (attack, standoff, convoy)
  271. would be handy for the 'Background section of the report'. Then I
  272. would not need to draw arrows and check copyright of web screenshots
  273. for inclusions.
  274. -----------------------------------------------------------
  275. >>First develop an ai that is 'aware' of groups, that is it considers
  276. them to be one body when making moves. This will ?>>default to every
  277. man for himself. Later this "grouping" should be weakened to
  278. one-to-one relationship strength and >>volatility between powers (we
  279. will also need a "certainty" metric to account for imperfect knowledge
  280. ofc) The certainty metric can be derived from a -character database-
  281. that is constructed from an analysis of previous moves. For instance
  282. if agreement and actual behaviour consistently diverge for some power
  283. we give them a low rating for trustworthiness.. Also we need to take
  284. into account what relationship our group has with the other power's
  285. group. If they acted as an adversary in the past, lying and
  286. back-stabbing is likely. Higher-level measures like volatility can be
  287. learned during the game and across games (we can simply measure the
  288. average duration a group persists). For other things there is a body
  289. of knowledge on the Diplomacy AI portal, which we can crawl through
  290. when the above is running.
  291. >>Second develop a 'passive' ai that does not actively seek alliances
  292. >>or propose orders, only reacts to other proposals >> and updates its
  293. >>knowledge (recalc. moves). Note that we will need external
  294. >>negotioating ais for testing this, we should >> get our hands on
  295. >>java clients to test on linux. This passive ai should also try to
  296. >>infer alliances based on attacks and potential conflicts that were
  297. >>avoided (simply >>looking at who attacked who would instantly make
  298. >>England and Turkey allies)
  299. This is the problem of reconciling results from our own Learning
  300. component and other powers suggestions. In my opinion acting upon
  301. other peoples suggestions should only be done if they are uttered in
  302. the context of a long-term strategy. I think this must be possible
  303. with the DAIDE syntax but only at higher press levels. If such a
  304. strategy is cross-checked for sound-ness (i.e. we are not leaving own
  305. supply centres exposed as a result and we are not neglecting easier
  306. targets as a result) positively we approve future move suggestions
  307. that are associated with this strategy. If such strategies succeed we
  308. can make note of the fact that the initiating power is clever and
  309. adopt their strategies / join their group more often in the future. I
  310. agree that we need to have some experiments. it would look against
  311. previous projects because they had few validation.
  312. Inferring alliances is a great idea, I wonder if there are formal
  313. alliances sealed by a DAIDE message, did not have time to read the
  314. relevant parts yet. However we cannot observe communication between
  315. other powers so inferring is well important.
  316. >> Problem-solving / planning as a team
  317. I liked Lukes idea that we consider our current group as a team and
  318. pretend that we can control all units to formulate a sound
  319. strategy. This works best with powers that are known to follow or
  320. suggestions / orders (hehe) and is fragile otherwise. On another level
  321. this also helps us to find worthy groups to enter, by pretending that
  322. they are in our group.
  323. >> The basis should be a shortest-path-tree on the map. Then for each
  324. >> of our supply centres we should make a list of pairs/map/whatever
  325. >> of (no. of turns)->(no. of units) for each group. This would
  326. >> describe how many turns it would take for the group to reach the sc
  327. >> with that many units. Then >>use this to calculate a raw "threat"
  328. >> metric for each sc. This should be weighted based on the knowledge
  329. >> of groups, meaning that if we are in group A then B will most
  330. >> likely >>not use all their units to attack us but also to attack C,
  331. >> and will divide its units based on how powerful A and C is. Later
  332. >> this should be refined with reasoning about where each enemy unit
  333. >> is, that is which enemy groups they are >>closest to. For the
  334. >> offensive side of things a quite old paper suggested using a
  335. >> Dynamic-Programming type approach. Commit one unit to a certain
  336. >> move / move-sequence and optimize on the success that can be
  337. >> reached with the remaining units. We should aim to quantify success
  338. >> of a move or a move-sequence: I think Luke already has code to
  339. >> measure aspects (by aspect I mean a function from the game state to
  340. >> a real number) of the map, these work like heuristics on a chess
  341. >> position. So the sucbe even better if we first developed a passive
  342. >> ai that is only considering enemy groups and will never be in one
  343. >> itself, as being in a group requires additional thought (for
  344. >> example as you pointed out an ally getting an sc is not necessarily
  345. >> increasing our utility)
  346. My thoughts on the negotiationless ai:
  347. The basis should be a shortest-path-tree on the map. Then for each of
  348. our supply centres we should make a list of pairs/map/whatever of
  349. (no. of turns)->(no. of units) for each group. This would describe how
  350. many turns it would take for the group to reach the sc with that many
  351. units. Then use this to calculate a raw "threat" metric for each sc.
  352. This should be weighted based on the knowledge of groups, meaning that
  353. if we are in group A then B will most likely not use all their units
  354. to attack us but also to attack C, and will divide its units based on
  355. how powerful A and C is. Later this should be refined with reasoning
  356. about where each enemy unit e we can come up with something.
  357. If we can implement these, we'll certainly have something to show and
  358. it won't differ hugely from what has been done in previous
  359. years. Except it won't be in Java. And we'll be submitting something
  360. waaay bigger than one AI. And it won't be in Java.
  361. Let me know what you guys think, or ideas on how to implement the
  362. little bits,
  363. Luke
  364. =========================================================================
  365. Hallo,
  366. So some development on inference ideas...
  367. So I'm in the process of writing some functions and stuff based off of
  368. what we spoke about yesterday via email, specifically on the stuff to
  369. do with passive inference of other enemies alliances, and then
  370. hopefully the part that determines if a proposed order set from
  371. another enemy is a worthwhile move.
  372. Hopefully if we can create enough of these negotiation tools, we'll
  373. have lots to talk about in the presentation and report even if they're
  374. not implemented in a functional AI. They can be put in as the 'future
  375. developments' section, possibly. Then during the presentations we
  376. could perhaps show how the 'trust' values change throughout play and
  377. discuss how it would influence negotiation in gameplay.
  378. The informal spec and the skeleton/partial implementation of the first
  379. half is sitting in the Bots folder because I couldn't think of a good
  380. place to put it.
  381. Let me know if you have any questions/improvements,
  382. Luke