PageRenderTime 151ms CodeModel.GetById 23ms RepoModel.GetById 0ms app.codeStats 0ms

/flipv2/20121113-091142-2.5K-ReLST-asneeded_epochs_50_5runs_noalphadecay/stdout-flip-2.5K_1.txt

https://bitbucket.org/evan13579b/soar-ziggurat
Plain Text | 16394 lines | 15674 code | 720 blank | 0 comment | 0 complexity | 79b9c8d1fecac5a1c13676e25c2fe8cf MD5 | raw file
Possible License(s): BSD-3-Clause
  1. Seeding... 1
  2. dir: dir isL
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 1 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_1.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\-sleeping...
  20. /|\-/|sleeping...
  21. \1: O: O1 (predict-yes)
  22. I see 0 and I'm going to do: predict-yes
  23. ENV: Agent did: predict-yes for direction L in state State-A
  24. In State-A moving L
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  26. predict error 1
  27. dir: dir isU
  28. rule alias: '*'
  29. rule alias: '*'
  30. -/|\-/|2: O: O4 (predict-no)
  31. I see 0 and I'm going to do: predict-no
  32. ENV: Agent did: predict-no for direction U in state State-A
  33. In State-A moving U
  34. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  35. predict error 0
  36. dir: dir isU
  37. \-3: O: O5 (predict-yes)
  38. I see 1 and I'm going to do: predict-yes
  39. ENV: Agent did: predict-yes for direction U in state State-A
  40. In State-A moving U
  41. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  42. predict error 1
  43. dir: dir isL
  44. /|\4: O: O7 (predict-yes)
  45. I see 0 and I'm going to do: predict-yes
  46. ENV: Agent did: predict-yes for direction L in state State-A
  47. In State-A moving L
  48. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  49. predict error 1
  50. dir: dir isR
  51. -/|5: O: O10 (predict-no)
  52. I see 0 and I'm going to do: predict-no
  53. ENV: Agent did: predict-no for direction R in state State-A
  54. In State-A moving R
  55. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  56. predict error 1
  57. dir: dir isR
  58. \-/6: O: O11 (predict-yes)
  59. I see 0 and I'm going to do: predict-yes
  60. ENV: Agent did: predict-yes for direction R in state State-B
  61. In State-B moving R
  62. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  63. predict error 1
  64. dir: dir isR
  65. |\7: O: O13 (predict-yes)
  66. I see 0 and I'm going to do: predict-yes
  67. ENV: Agent did: predict-yes for direction R in state State-B
  68. In State-B moving R
  69. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  70. predict error 1
  71. dir: dir isU
  72. -/|8: O: O16 (predict-no)
  73. I see 0 and I'm going to do: predict-no
  74. ENV: Agent did: predict-no for direction U in state State-B
  75. In State-B moving U
  76. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  77. predict error 0
  78. dir: dir isL
  79. \-/9: O: O18 (predict-no)
  80. I see 1 and I'm going to do: predict-no
  81. ENV: Agent did: predict-no for direction L in state State-B
  82. In State-B moving L
  83. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  84. predict error 1
  85. dir: dir isL
  86. |\-10: O: O20 (predict-no)
  87. I see 0 and I'm going to do: predict-no
  88. ENV: Agent did: predict-no for direction L in state State-A
  89. In State-A moving L
  90. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  91. predict error 0
  92. dir: dir isU
  93. /|\11: O: O22 (predict-no)
  94. I see 1 and I'm going to do: predict-no
  95. ENV: Agent did: predict-no for direction U in state State-A
  96. In State-A moving U
  97. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  98. predict error 0
  99. dir: dir isR
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. -12: O: O23 (predict-yes)
  105. I see 1 and I'm going to do: predict-yes
  106. ENV: Agent did: predict-yes for direction R in state State-A
  107. In State-A moving R
  108. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  109. predict error 0
  110. dir: dir isU
  111. /|\13: O: O26 (predict-no)
  112. I see 1 and I'm going to do: predict-no
  113. ENV: Agent did: predict-no for direction U in state State-B
  114. In State-B moving U
  115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  116. predict error 0
  117. dir: dir isL
  118. -/|14: O: O28 (predict-no)
  119. I see 1 and I'm going to do: predict-no
  120. ENV: Agent did: predict-no for direction L in state State-B
  121. In State-B moving L
  122. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  123. predict error 1
  124. dir: dir isR
  125. \15: O: O30 (predict-no)
  126. I see 0 and I'm going to do: predict-no
  127. ENV: Agent did: predict-no for direction R in state State-A
  128. In State-A moving R
  129. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  130. predict error 1
  131. dir: dir isU
  132. -/16: O: O32 (predict-no)
  133. I see 0 and I'm going to do: predict-no
  134. ENV: Agent did: predict-no for direction U in state State-B
  135. In State-B moving U
  136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  137. predict error 0
  138. dir: dir isL
  139. |\-17: O: O33 (predict-yes)
  140. I see 1 and I'm going to do: predict-yes
  141. ENV: Agent did: predict-yes for direction L in state State-B
  142. In State-B moving L
  143. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  144. predict error 0
  145. dir: dir isU
  146. /|\18: O: O36 (predict-no)
  147. I see 1 and I'm going to do: predict-no
  148. ENV: Agent did: predict-no for direction U in state State-A
  149. In State-A moving U
  150. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  151. predict error 0
  152. dir: dir isU
  153. -/19: O: O38 (predict-no)
  154. I see 1 and I'm going to do: predict-no
  155. ENV: Agent did: predict-no for direction U in state State-A
  156. In State-A moving U
  157. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  158. predict error 0
  159. dir: dir isL
  160. |\-/20: O: O39 (predict-yes)
  161. I see 1 and I'm going to do: predict-yes
  162. ENV: Agent did: predict-yes for direction L in state State-A
  163. In State-A moving L
  164. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  165. predict error 1
  166. dir: dir isL
  167. |\-21: O: O41 (predict-yes)
  168. I see 0 and I'm going to do: predict-yes
  169. ENV: Agent did: predict-yes for direction L in state State-A
  170. In State-A moving L
  171. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  172. predict error 1
  173. dir: dir isR
  174. /22: O: O43 (predict-yes)
  175. I see 0 and I'm going to do: predict-yes
  176. ENV: Agent did: predict-yes for direction R in state State-A
  177. In State-A moving R
  178. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  179. predict error 0
  180. dir: dir isU
  181. |23: O: O46 (predict-no)
  182. I see 1 and I'm going to do: predict-no
  183. ENV: Agent did: predict-no for direction U in state State-B
  184. In State-B moving U
  185. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  186. predict error 0
  187. dir: dir isR
  188. \-24: O: O47 (predict-yes)
  189. I see 1 and I'm going to do: predict-yes
  190. ENV: Agent did: predict-yes for direction R in state State-B
  191. In State-B moving R
  192. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  193. predict error 1
  194. dir: dir isL
  195. /|\25: O: O50 (predict-no)
  196. I see 0 and I'm going to do: predict-no
  197. ENV: Agent did: predict-no for direction L in state State-B
  198. In State-B moving L
  199. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  200. predict error 1
  201. dir: dir isR
  202. -/|26: O: O52 (predict-no)
  203. I see 0 and I'm going to do: predict-no
  204. ENV: Agent did: predict-no for direction R in state State-A
  205. In State-A moving R
  206. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  207. predict error 1
  208. dir: dir isL
  209. \-27: O: O53 (predict-yes)
  210. I see 0 and I'm going to do: predict-yes
  211. ENV: Agent did: predict-yes for direction L in state State-B
  212. In State-B moving L
  213. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  214. predict error 0
  215. dir: dir isL
  216. /|28: O: O55 (predict-yes)
  217. I see 1 and I'm going to do: predict-yes
  218. ENV: Agent did: predict-yes for direction L in state State-A
  219. In State-A moving L
  220. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  221. predict error 1
  222. dir: dir isR
  223. \-/29: O: O57 (predict-yes)
  224. I see 0 and I'm going to do: predict-yes
  225. ENV: Agent did: predict-yes for direction R in state State-A
  226. In State-A moving R
  227. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  228. predict error 0
  229. dir: dir isR
  230. |\30: O: O60 (predict-no)
  231. I see 1 and I'm going to do: predict-no
  232. ENV: Agent did: predict-no for direction R in state State-B
  233. In State-B moving R
  234. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  235. predict error 0
  236. dir: dir isL
  237. -/|\sleeping...
  238. -31: O: O61 (predict-yes)
  239. I see 1 and I'm going to do: predict-yes
  240. ENV: Agent did: predict-yes for direction L in state State-B
  241. In State-B moving L
  242. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  243. predict error 0
  244. dir: dir isL
  245. /32: O: O63 (predict-yes)
  246. I see 1 and I'm going to do: predict-yes
  247. ENV: Agent did: predict-yes for direction L in state State-A
  248. In State-A moving L
  249. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  250. predict error 1
  251. dir: dir isL
  252. |\-33: O: O65 (predict-yes)
  253. I see 0 and I'm going to do: predict-yes
  254. ENV: Agent did: predict-yes for direction L in state State-A
  255. In State-A moving L
  256. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  257. predict error 1
  258. dir: dir isR
  259. /|\34: O: O67 (predict-yes)
  260. I see 0 and I'm going to do: predict-yes
  261. ENV: Agent did: predict-yes for direction R in state State-A
  262. In State-A moving R
  263. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  264. predict error 0
  265. dir: dir isL
  266. -/35: O: O69 (predict-yes)
  267. I see 1 and I'm going to do: predict-yes
  268. ENV: Agent did: predict-yes for direction L in state State-B
  269. In State-B moving L
  270. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  271. predict error 0
  272. dir: dir isL
  273. |\-36: O: O72 (predict-no)
  274. I see 1 and I'm going to do: predict-no
  275. ENV: Agent did: predict-no for direction L in state State-A
  276. In State-A moving L
  277. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  278. predict error 0
  279. dir: dir isU
  280. /|37: O: O73 (predict-yes)
  281. I see 1 and I'm going to do: predict-yes
  282. ENV: Agent did: predict-yes for direction U in state State-A
  283. In State-A moving U
  284. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  285. predict error 1
  286. dir: dir isR
  287. \-/38: O: O76 (predict-no)
  288. I see 0 and I'm going to do: predict-no
  289. ENV: Agent did: predict-no for direction R in state State-A
  290. In State-A moving R
  291. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  292. predict error 1
  293. dir: dir isR
  294. |\-39: O: O77 (predict-yes)
  295. I see 0 and I'm going to do: predict-yes
  296. ENV: Agent did: predict-yes for direction R in state State-B
  297. In State-B moving R
  298. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  299. predict error 1
  300. dir: dir isL
  301. /|40: O: O79 (predict-yes)
  302. I see 0 and I'm going to do: predict-yes
  303. ENV: Agent did: predict-yes for direction L in state State-B
  304. In State-B moving L
  305. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  306. predict error 0
  307. dir: dir isU
  308. \-/41: O: O81 (predict-yes)
  309. I see 1 and I'm going to do: predict-yes
  310. ENV: Agent did: predict-yes for direction U in state State-A
  311. In State-A moving U
  312. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  313. predict error 1
  314. dir: dir isU
  315. |42: O: O84 (predict-no)
  316. I see 0 and I'm going to do: predict-no
  317. ENV: Agent did: predict-no for direction U in state State-A
  318. In State-A moving U
  319. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  320. predict error 0
  321. dir: dir isL
  322. \-/43: O: O85 (predict-yes)
  323. I see 1 and I'm going to do: predict-yes
  324. ENV: Agent did: predict-yes for direction L in state State-A
  325. In State-A moving L
  326. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  327. predict error 1
  328. dir: dir isL
  329. |\-44: O: O87 (predict-yes)
  330. I see 0 and I'm going to do: predict-yes
  331. ENV: Agent did: predict-yes for direction L in state State-A
  332. In State-A moving L
  333. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  334. predict error 1
  335. dir: dir isU
  336. /|\45: O: O90 (predict-no)
  337. I see 0 and I'm going to do: predict-no
  338. ENV: Agent did: predict-no for direction U in state State-A
  339. In State-A moving U
  340. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  341. predict error 0
  342. dir: dir isU
  343. -/46: O: O92 (predict-no)
  344. I see 1 and I'm going to do: predict-no
  345. ENV: Agent did: predict-no for direction U in state State-A
  346. In State-A moving U
  347. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  348. predict error 0
  349. dir: dir isU
  350. |\47: O: O94 (predict-no)
  351. I see 1 and I'm going to do: predict-no
  352. ENV: Agent did: predict-no for direction U in state State-A
  353. In State-A moving U
  354. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  355. predict error 0
  356. dir: dir isR
  357. -/48: O: O95 (predict-yes)
  358. I see 1 and I'm going to do: predict-yes
  359. ENV: Agent did: predict-yes for direction R in state State-A
  360. In State-A moving R
  361. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  362. predict error 0
  363. dir: dir isU
  364. |\-49: O: O98 (predict-no)
  365. I see 1 and I'm going to do: predict-no
  366. ENV: Agent did: predict-no for direction U in state State-B
  367. In State-B moving U
  368. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  369. predict error 0
  370. dir: dir isU
  371. /|\50: O: O100 (predict-no)
  372. I see 1 and I'm going to do: predict-no
  373. ENV: Agent did: predict-no for direction U in state State-B
  374. In State-B moving U
  375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  376. predict error 0
  377. dir: dir isL
  378. -/|\-/sleeping...
  379. |51: O: O102 (predict-no)
  380. I see 1 and I'm going to do: predict-no
  381. ENV: Agent did: predict-no for direction L in state State-B
  382. In State-B moving L
  383. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  384. predict error 1
  385. dir: dir isR
  386. rule alias: '*'
  387. rule alias: '*'
  388. \52: O: O103 (predict-yes)
  389. I see 0 and I'm going to do: predict-yes
  390. ENV: Agent did: predict-yes for direction R in state State-A
  391. In State-A moving R
  392. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  393. predict error 0
  394. dir: dir isU
  395. -/53: O: O106 (predict-no)
  396. I see 1 and I'm going to do: predict-no
  397. ENV: Agent did: predict-no for direction U in state State-B
  398. In State-B moving U
  399. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  400. predict error 0
  401. dir: dir isU
  402. |\-54: O: O107 (predict-yes)
  403. I see 1 and I'm going to do: predict-yes
  404. ENV: Agent did: predict-yes for direction U in state State-B
  405. In State-B moving U
  406. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  407. predict error 1
  408. dir: dir isR
  409. /|\55: O: O109 (predict-yes)
  410. I see 0 and I'm going to do: predict-yes
  411. ENV: Agent did: predict-yes for direction R in state State-B
  412. In State-B moving R
  413. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  414. predict error 1
  415. dir: dir isR
  416. -/|56: O: O111 (predict-yes)
  417. I see 0 and I'm going to do: predict-yes
  418. ENV: Agent did: predict-yes for direction R in state State-B
  419. In State-B moving R
  420. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  421. predict error 1
  422. dir: dir isL
  423. \-/57: O: O114 (predict-no)
  424. I see 0 and I'm going to do: predict-no
  425. ENV: Agent did: predict-no for direction L in state State-B
  426. In State-B moving L
  427. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  428. predict error 1
  429. dir: dir isL
  430. |\-58: O: O116 (predict-no)
  431. I see 0 and I'm going to do: predict-no
  432. ENV: Agent did: predict-no for direction L in state State-A
  433. In State-A moving L
  434. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  435. predict error 0
  436. dir: dir isU
  437. /|\59: O: O118 (predict-no)
  438. I see 1 and I'm going to do: predict-no
  439. ENV: Agent did: predict-no for direction U in state State-A
  440. In State-A moving U
  441. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  442. predict error 0
  443. dir: dir isR
  444. -/|60: O: O119 (predict-yes)
  445. I see 1 and I'm going to do: predict-yes
  446. ENV: Agent did: predict-yes for direction R in state State-A
  447. In State-A moving R
  448. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  449. predict error 0
  450. dir: dir isL
  451. \-61: O: O121 (predict-yes)
  452. I see 1 and I'm going to do: predict-yes
  453. ENV: Agent did: predict-yes for direction L in state State-B
  454. In State-B moving L
  455. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  456. predict error 0
  457. dir: dir isR
  458. rule alias: '*'
  459. rule alias: '*'
  460. rule alias: '*'
  461. rule alias: '*'
  462. rule alias: '*'
  463. rule alias: '*'
  464. rule alias: '*'
  465. rule alias: '*'
  466. rule alias: '*'
  467. rule alias: '*'
  468. rule alias: '*'
  469. rule alias: '*'
  470. /62: O: O123 (predict-yes)
  471. I see 1 and I'm going to do: predict-yes
  472. ENV: Agent did: predict-yes for direction R in state State-A
  473. In State-A moving R
  474. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  475. predict error 0
  476. dir: dir isU
  477. |\-63: O: O126 (predict-no)
  478. I see 1 and I'm going to do: predict-no
  479. ENV: Agent did: predict-no for direction U in state State-B
  480. In State-B moving U
  481. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  482. predict error 0
  483. dir: dir isU
  484. /|\64: O: O128 (predict-no)
  485. I see 1 and I'm going to do: predict-no
  486. ENV: Agent did: predict-no for direction U in state State-B
  487. In State-B moving U
  488. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  489. predict error 0
  490. dir: dir isR
  491. -/65: O: O130 (predict-no)
  492. I see 1 and I'm going to do: predict-no
  493. ENV: Agent did: predict-no for direction R in state State-B
  494. In State-B moving R
  495. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  496. predict error 0
  497. dir: dir isR
  498. |\-66: O: O131 (predict-yes)
  499. I see 1 and I'm going to do: predict-yes
  500. ENV: Agent did: predict-yes for direction R in state State-B
  501. In State-B moving R
  502. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  503. predict error 1
  504. dir: dir isR
  505. /|\67: O: O133 (predict-yes)
  506. I see 0 and I'm going to do: predict-yes
  507. ENV: Agent did: predict-yes for direction R in state State-B
  508. In State-B moving R
  509. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  510. predict error 1
  511. dir: dir isU
  512. -/|68: O: O135 (predict-yes)
  513. I see 0 and I'm going to do: predict-yes
  514. ENV: Agent did: predict-yes for direction U in state State-B
  515. In State-B moving U
  516. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  517. predict error 1
  518. dir: dir isR
  519. \-/69: O: O138 (predict-no)
  520. I see 0 and I'm going to do: predict-no
  521. ENV: Agent did: predict-no for direction R in state State-B
  522. In State-B moving R
  523. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  524. predict error 0
  525. dir: dir isR
  526. |\-70: O: O140 (predict-no)
  527. I see 1 and I'm going to do: predict-no
  528. ENV: Agent did: predict-no for direction R in state State-B
  529. In State-B moving R
  530. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  531. predict error 0
  532. dir: dir isR
  533. /|\-71: O: O142 (predict-no)
  534. I see 1 and I'm going to do: predict-no
  535. ENV: Agent did: predict-no for direction R in state State-B
  536. In State-B moving R
  537. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  538. predict error 0
  539. dir: dir isL
  540. /72: O: O143 (predict-yes)
  541. I see 1 and I'm going to do: predict-yes
  542. ENV: Agent did: predict-yes for direction L in state State-B
  543. In State-B moving L
  544. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  545. predict error 0
  546. dir: dir isL
  547. |\-73: O: O145 (predict-yes)
  548. I see 1 and I'm going to do: predict-yes
  549. ENV: Agent did: predict-yes for direction L in state State-A
  550. In State-A moving L
  551. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  552. predict error 1
  553. dir: dir isU
  554. /|74: O: O148 (predict-no)
  555. I see 0 and I'm going to do: predict-no
  556. ENV: Agent did: predict-no for direction U in state State-A
  557. In State-A moving U
  558. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  559. predict error 0
  560. dir: dir isU
  561. \-75: O: O150 (predict-no)
  562. I see 1 and I'm going to do: predict-no
  563. ENV: Agent did: predict-no for direction U in state State-A
  564. In State-A moving U
  565. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  566. predict error 0
  567. dir: dir isR
  568. /|\76: O: O152 (predict-no)
  569. I see 1 and I'm going to do: predict-no
  570. ENV: Agent did: predict-no for direction R in state State-A
  571. In State-A moving R
  572. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  573. predict error 1
  574. dir: dir isR
  575. -/|77: O: O154 (predict-no)
  576. I see 0 and I'm going to do: predict-no
  577. ENV: Agent did: predict-no for direction R in state State-B
  578. In State-B moving R
  579. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  580. predict error 0
  581. dir: dir isL
  582. \-/78: O: O155 (predict-yes)
  583. I see 1 and I'm going to do: predict-yes
  584. ENV: Agent did: predict-yes for direction L in state State-B
  585. In State-B moving L
  586. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  587. predict error 0
  588. dir: dir isR
  589. |\-/79: O: O157 (predict-yes)
  590. I see 1 and I'm going to do: predict-yes
  591. ENV: Agent did: predict-yes for direction R in state State-A
  592. In State-A moving R
  593. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  594. predict error 0
  595. dir: dir isU
  596. |80: O: O160 (predict-no)
  597. I see 1 and I'm going to do: predict-no
  598. ENV: Agent did: predict-no for direction U in state State-B
  599. In State-B moving U
  600. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  601. predict error 0
  602. dir: dir isU
  603. \-/81: O: O161 (predict-yes)
  604. I see 1 and I'm going to do: predict-yes
  605. ENV: Agent did: predict-yes for direction U in state State-B
  606. In State-B moving U
  607. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  608. predict error 1
  609. dir: dir isR
  610. |82: O: O164 (predict-no)
  611. I see 0 and I'm going to do: predict-no
  612. ENV: Agent did: predict-no for direction R in state State-B
  613. In State-B moving R
  614. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  615. predict error 0
  616. dir: dir isU
  617. \-/83: O: O166 (predict-no)
  618. I see 1 and I'm going to do: predict-no
  619. ENV: Agent did: predict-no for direction U in state State-B
  620. In State-B moving U
  621. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  622. predict error 0
  623. dir: dir isL
  624. |\-84: O: O168 (predict-no)
  625. I see 1 and I'm going to do: predict-no
  626. ENV: Agent did: predict-no for direction L in state State-B
  627. In State-B moving L
  628. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  629. predict error 1
  630. dir: dir isR
  631. /|\85: O: O169 (predict-yes)
  632. I see 0 and I'm going to do: predict-yes
  633. ENV: Agent did: predict-yes for direction R in state State-A
  634. In State-A moving R
  635. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  636. predict error 0
  637. dir: dir isU
  638. -/86: O: O172 (predict-no)
  639. I see 1 and I'm going to do: predict-no
  640. ENV: Agent did: predict-no for direction U in state State-B
  641. In State-B moving U
  642. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  643. predict error 0
  644. dir: dir isR
  645. |\87: O: O174 (predict-no)
  646. I see 1 and I'm going to do: predict-no
  647. ENV: Agent did: predict-no for direction R in state State-B
  648. In State-B moving R
  649. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  650. predict error 0
  651. dir: dir isR
  652. -/|88: O: O176 (predict-no)
  653. I see 1 and I'm going to do: predict-no
  654. ENV: Agent did: predict-no for direction R in state State-B
  655. In State-B moving R
  656. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  657. predict error 0
  658. dir: dir isL
  659. \-/89: O: O177 (predict-yes)
  660. I see 1 and I'm going to do: predict-yes
  661. ENV: Agent did: predict-yes for direction L in state State-B
  662. In State-B moving L
  663. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  664. predict error 0
  665. dir: dir isR
  666. |\90: O: O179 (predict-yes)
  667. I see 1 and I'm going to do: predict-yes
  668. ENV: Agent did: predict-yes for direction R in state State-A
  669. In State-A moving R
  670. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  671. predict error 0
  672. dir: dir isU
  673. -/91: O: O182 (predict-no)
  674. I see 1 and I'm going to do: predict-no
  675. ENV: Agent did: predict-no for direction U in state State-B
  676. In State-B moving U
  677. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  678. predict error 0
  679. dir: dir isL
  680. |92: O: O183 (predict-yes)
  681. I see 1 and I'm going to do: predict-yes
  682. ENV: Agent did: predict-yes for direction L in state State-B
  683. In State-B moving L
  684. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  685. predict error 0
  686. dir: dir isU
  687. \-/93: O: O186 (predict-no)
  688. I see 1 and I'm going to do: predict-no
  689. ENV: Agent did: predict-no for direction U in state State-A
  690. In State-A moving U
  691. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  692. predict error 0
  693. dir: dir isU
  694. |\-94: O: O188 (predict-no)
  695. I see 1 and I'm going to do: predict-no
  696. ENV: Agent did: predict-no for direction U in state State-A
  697. In State-A moving U
  698. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  699. predict error 0
  700. dir: dir isU
  701. /|95: O: O190 (predict-no)
  702. I see 1 and I'm going to do: predict-no
  703. ENV: Agent did: predict-no for direction U in state State-A
  704. In State-A moving U
  705. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  706. predict error 0
  707. dir: dir isU
  708. \96: O: O191 (predict-yes)
  709. I see 1 and I'm going to do: predict-yes
  710. ENV: Agent did: predict-yes for direction U in state State-A
  711. In State-A moving U
  712. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  713. predict error 1
  714. dir: dir isU
  715. -/|97: O: O194 (predict-no)
  716. I see 0 and I'm going to do: predict-no
  717. ENV: Agent did: predict-no for direction U in state State-A
  718. In State-A moving U
  719. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  720. predict error 0
  721. dir: dir isR
  722. \-98: O: O196 (predict-no)
  723. I see 1 and I'm going to do: predict-no
  724. ENV: Agent did: predict-no for direction R in state State-A
  725. In State-A moving R
  726. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  727. predict error 1
  728. dir: dir isR
  729. /|\99: O: O198 (predict-no)
  730. I see 0 and I'm going to do: predict-no
  731. ENV: Agent did: predict-no for direction R in state State-B
  732. In State-B moving R
  733. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  734. predict error 0
  735. dir: dir isR
  736. -/|100: O: O200 (predict-no)
  737. I see 1 and I'm going to do: predict-no
  738. ENV: Agent did: predict-no for direction R in state State-B
  739. In State-B moving R
  740. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  741. predict error 0
  742. dir: dir isL
  743. \-/101: O: O201 (predict-yes)
  744. I see 1 and I'm going to do: predict-yes
  745. ENV: Agent did: predict-yes for direction L in state State-B
  746. In State-B moving L
  747. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  748. predict error 0
  749. dir: dir isU
  750. |\102: O: O203 (predict-yes)
  751. I see 1 and I'm going to do: predict-yes
  752. ENV: Agent did: predict-yes for direction U in state State-A
  753. In State-A moving U
  754. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  755. predict error 1
  756. dir: dir isR
  757. -/|103: O: O205 (predict-yes)
  758. I see 0 and I'm going to do: predict-yes
  759. ENV: Agent did: predict-yes for direction R in state State-A
  760. In State-A moving R
  761. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  762. predict error 0
  763. dir: dir isL
  764. \-104: O: O207 (predict-yes)
  765. I see 1 and I'm going to do: predict-yes
  766. ENV: Agent did: predict-yes for direction L in state State-B
  767. In State-B moving L
  768. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  769. predict error 0
  770. dir: dir isR
  771. /|\105: O: O209 (predict-yes)
  772. I see 1 and I'm going to do: predict-yes
  773. ENV: Agent did: predict-yes for direction R in state State-A
  774. In State-A moving R
  775. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  776. predict error 0
  777. dir: dir isR
  778. -/|106: O: O211 (predict-yes)
  779. I see 1 and I'm going to do: predict-yes
  780. ENV: Agent did: predict-yes for direction R in state State-B
  781. In State-B moving R
  782. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  783. predict error 1
  784. dir: dir isR
  785. \-107: O: O213 (predict-yes)
  786. I see 0 and I'm going to do: predict-yes
  787. ENV: Agent did: predict-yes for direction R in state State-B
  788. In State-B moving R
  789. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  790. predict error 1
  791. dir: dir isR
  792. /|108: O: O216 (predict-no)
  793. I see 0 and I'm going to do: predict-no
  794. ENV: Agent did: predict-no for direction R in state State-B
  795. In State-B moving R
  796. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  797. predict error 0
  798. dir: dir isR
  799. \-109: O: O218 (predict-no)
  800. I see 1 and I'm going to do: predict-no
  801. ENV: Agent did: predict-no for direction R in state State-B
  802. In State-B moving R
  803. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  804. predict error 0
  805. dir: dir isR
  806. /|\110: O: O220 (predict-no)
  807. I see 1 and I'm going to do: predict-no
  808. ENV: Agent did: predict-no for direction R in state State-B
  809. In State-B moving R
  810. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  811. predict error 0
  812. dir: dir isR
  813. -/111: O: O222 (predict-no)
  814. I see 1 and I'm going to do: predict-no
  815. ENV: Agent did: predict-no for direction R in state State-B
  816. In State-B moving R
  817. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  818. predict error 0
  819. dir: dir isR
  820. |112: O: O223 (predict-yes)
  821. I see 1 and I'm going to do: predict-yes
  822. ENV: Agent did: predict-yes for direction R in state State-B
  823. In State-B moving R
  824. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  825. predict error 1
  826. dir: dir isL
  827. \-/113: O: O225 (predict-yes)
  828. I see 0 and I'm going to do: predict-yes
  829. ENV: Agent did: predict-yes for direction L in state State-B
  830. In State-B moving L
  831. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  832. predict error 0
  833. dir: dir isL
  834. |\-114: O: O227 (predict-yes)
  835. I see 1 and I'm going to do: predict-yes
  836. ENV: Agent did: predict-yes for direction L in state State-A
  837. In State-A moving L
  838. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  839. predict error 1
  840. dir: dir isL
  841. /|\-115: O: O229 (predict-yes)
  842. I see 0 and I'm going to do: predict-yes
  843. ENV: Agent did: predict-yes for direction L in state State-A
  844. In State-A moving L
  845. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  846. predict error 1
  847. dir: dir isR
  848. /|\-sleeping...
  849. /116: O: O231 (predict-yes)
  850. I see 0 and I'm going to do: predict-yes
  851. ENV: Agent did: predict-yes for direction R in state State-A
  852. In State-A moving R
  853. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  854. predict error 0
  855. dir: dir isU
  856. |\117: O: O234 (predict-no)
  857. I see 1 and I'm going to do: predict-no
  858. ENV: Agent did: predict-no for direction U in state State-B
  859. In State-B moving U
  860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  861. predict error 0
  862. dir: dir isU
  863. -/118: O: O236 (predict-no)
  864. I see 1 and I'm going to do: predict-no
  865. ENV: Agent did: predict-no for direction U in state State-B
  866. In State-B moving U
  867. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  868. predict error 0
  869. dir: dir isU
  870. |\-119: O: O238 (predict-no)
  871. I see 1 and I'm going to do: predict-no
  872. ENV: Agent did: predict-no for direction U in state State-B
  873. In State-B moving U
  874. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  875. predict error 0
  876. dir: dir isU
  877. /|\120: O: O239 (predict-yes)
  878. I see 1 and I'm going to do: predict-yes
  879. ENV: Agent did: predict-yes for direction U in state State-B
  880. In State-B moving U
  881. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  882. predict error 1
  883. dir: dir isL
  884. -/|121: O: O241 (predict-yes)
  885. I see 0 and I'm going to do: predict-yes
  886. ENV: Agent did: predict-yes for direction L in state State-B
  887. In State-B moving L
  888. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  889. predict error 0
  890. dir: dir isU
  891. rule alias: '*'
  892. rule alias: '*'
  893. \122: O: O244 (predict-no)
  894. I see 1 and I'm going to do: predict-no
  895. ENV: Agent did: predict-no for direction U in state State-A
  896. In State-A moving U
  897. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  898. predict error 0
  899. dir: dir isU
  900. -/|123: O: O246 (predict-no)
  901. I see 1 and I'm going to do: predict-no
  902. ENV: Agent did: predict-no for direction U in state State-A
  903. In State-A moving U
  904. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  905. predict error 0
  906. dir: dir isL
  907. \-124: O: O248 (predict-no)
  908. I see 1 and I'm going to do: predict-no
  909. ENV: Agent did: predict-no for direction L in state State-A
  910. In State-A moving L
  911. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  912. predict error 0
  913. dir: dir isL
  914. /|\125: O: O250 (predict-no)
  915. I see 1 and I'm going to do: predict-no
  916. ENV: Agent did: predict-no for direction L in state State-A
  917. In State-A moving L
  918. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  919. predict error 0
  920. dir: dir isL
  921. -/126: O: O252 (predict-no)
  922. I see 1 and I'm going to do: predict-no
  923. ENV: Agent did: predict-no for direction L in state State-A
  924. In State-A moving L
  925. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  926. predict error 0
  927. dir: dir isU
  928. |\-127: O: O254 (predict-no)
  929. I see 1 and I'm going to do: predict-no
  930. ENV: Agent did: predict-no for direction U in state State-A
  931. In State-A moving U
  932. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  933. predict error 0
  934. dir: dir isL
  935. /|\128: O: O256 (predict-no)
  936. I see 1 and I'm going to do: predict-no
  937. ENV: Agent did: predict-no for direction L in state State-A
  938. In State-A moving L
  939. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  940. predict error 0
  941. dir: dir isL
  942. -/129: O: O258 (predict-no)
  943. I see 1 and I'm going to do: predict-no
  944. ENV: Agent did: predict-no for direction L in state State-A
  945. In State-A moving L
  946. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  947. predict error 0
  948. dir: dir isR
  949. |\-130: O: O259 (predict-yes)
  950. I see 1 and I'm going to do: predict-yes
  951. ENV: Agent did: predict-yes for direction R in state State-A
  952. In State-A moving R
  953. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  954. predict error 0
  955. dir: dir isR
  956. /|131: O: O262 (predict-no)
  957. I see 1 and I'm going to do: predict-no
  958. ENV: Agent did: predict-no for direction R in state State-B
  959. In State-B moving R
  960. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  961. predict error 0
  962. dir: dir isL
  963. \132: O: O263 (predict-yes)
  964. I see 1 and I'm going to do: predict-yes
  965. ENV: Agent did: predict-yes for direction L in state State-B
  966. In State-B moving L
  967. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  968. predict error 0
  969. dir: dir isL
  970. -/|133: O: O266 (predict-no)
  971. I see 1 and I'm going to do: predict-no
  972. ENV: Agent did: predict-no for direction L in state State-A
  973. In State-A moving L
  974. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  975. predict error 0
  976. dir: dir isR
  977. \-/134: O: O267 (predict-yes)
  978. I see 1 and I'm going to do: predict-yes
  979. ENV: Agent did: predict-yes for direction R in state State-A
  980. In State-A moving R
  981. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  982. predict error 0
  983. dir: dir isL
  984. |\-135: O: O270 (predict-no)
  985. I see 1 and I'm going to do: predict-no
  986. ENV: Agent did: predict-no for direction L in state State-B
  987. In State-B moving L
  988. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  989. predict error 1
  990. dir: dir isL
  991. /|\136: O: O272 (predict-no)
  992. I see 0 and I'm going to do: predict-no
  993. ENV: Agent did: predict-no for direction L in state State-A
  994. In State-A moving L
  995. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  996. predict error 0
  997. dir: dir isU
  998. -/|137: O: O274 (predict-no)
  999. I see 1 and I'm going to do: predict-no
  1000. ENV: Agent did: predict-no for direction U in state State-A
  1001. In State-A moving U
  1002. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1003. predict error 0
  1004. dir: dir isR
  1005. \-/138: O: O276 (predict-no)
  1006. I see 1 and I'm going to do: predict-no
  1007. ENV: Agent did: predict-no for direction R in state State-A
  1008. In State-A moving R
  1009. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1010. predict error 1
  1011. dir: dir isL
  1012. |\-139: O: O277 (predict-yes)
  1013. I see 0 and I'm going to do: predict-yes
  1014. ENV: Agent did: predict-yes for direction L in state State-B
  1015. In State-B moving L
  1016. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1017. predict error 0
  1018. dir: dir isR
  1019. /|140: O: O279 (predict-yes)
  1020. I see 1 and I'm going to do: predict-yes
  1021. ENV: Agent did: predict-yes for direction R in state State-A
  1022. In State-A moving R
  1023. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1024. predict error 0
  1025. dir: dir isL
  1026. \-141: O: O282 (predict-no)
  1027. I see 1 and I'm going to do: predict-no
  1028. ENV: Agent did: predict-no for direction L in state State-B
  1029. In State-B moving L
  1030. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1031. predict error 1
  1032. dir: dir isR
  1033. /142: O: O283 (predict-yes)
  1034. I see 0 and I'm going to do: predict-yes
  1035. ENV: Agent did: predict-yes for direction R in state State-A
  1036. In State-A moving R
  1037. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1038. predict error 0
  1039. dir: dir isR
  1040. |\143: O: O286 (predict-no)
  1041. I see 1 and I'm going to do: predict-no
  1042. ENV: Agent did: predict-no for direction R in state State-B
  1043. In State-B moving R
  1044. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1045. predict error 0
  1046. dir: dir isL
  1047. -/|144: O: O287 (predict-yes)
  1048. I see 1 and I'm going to do: predict-yes
  1049. ENV: Agent did: predict-yes for direction L in state State-B
  1050. In State-B moving L
  1051. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1052. predict error 0
  1053. dir: dir isL
  1054. \-/145: O: O290 (predict-no)
  1055. I see 1 and I'm going to do: predict-no
  1056. ENV: Agent did: predict-no for direction L in state State-A
  1057. In State-A moving L
  1058. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1059. predict error 0
  1060. dir: dir isU
  1061. |\-146: O: O292 (predict-no)
  1062. I see 1 and I'm going to do: predict-no
  1063. ENV: Agent did: predict-no for direction U in state State-A
  1064. In State-A moving U
  1065. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1066. predict error 0
  1067. dir: dir isR
  1068. /|\147: O: O293 (predict-yes)
  1069. I see 1 and I'm going to do: predict-yes
  1070. ENV: Agent did: predict-yes for direction R in state State-A
  1071. In State-A moving R
  1072. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1073. predict error 0
  1074. dir: dir isL
  1075. -/|148: O: O295 (predict-yes)
  1076. I see 1 and I'm going to do: predict-yes
  1077. ENV: Agent did: predict-yes for direction L in state State-B
  1078. In State-B moving L
  1079. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1080. predict error 0
  1081. dir: dir isR
  1082. \-/149: O: O297 (predict-yes)
  1083. I see 1 and I'm going to do: predict-yes
  1084. ENV: Agent did: predict-yes for direction R in state State-A
  1085. In State-A moving R
  1086. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1087. predict error 0
  1088. dir: dir isU
  1089. |\150: O: O300 (predict-no)
  1090. I see 1 and I'm going to do: predict-no
  1091. ENV: Agent did: predict-no for direction U in state State-B
  1092. In State-B moving U
  1093. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1094. predict error 0
  1095. dir: dir isL
  1096. -/151: O: O301 (predict-yes)
  1097. I see 1 and I'm going to do: predict-yes
  1098. ENV: Agent did: predict-yes for direction L in state State-B
  1099. In State-B moving L
  1100. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1101. predict error 0
  1102. dir: dir isL
  1103. |152: O: O304 (predict-no)
  1104. I see 1 and I'm going to do: predict-no
  1105. ENV: Agent did: predict-no for direction L in state State-A
  1106. In State-A moving L
  1107. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1108. predict error 0
  1109. dir: dir isL
  1110. \-/153: O: O305 (predict-yes)
  1111. I see 1 and I'm going to do: predict-yes
  1112. ENV: Agent did: predict-yes for direction L in state State-A
  1113. In State-A moving L
  1114. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1115. predict error 1
  1116. dir: dir isU
  1117. |\154: O: O308 (predict-no)
  1118. I see 0 and I'm going to do: predict-no
  1119. ENV: Agent did: predict-no for direction U in state State-A
  1120. In State-A moving U
  1121. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1122. predict error 0
  1123. dir: dir isL
  1124. -/155: O: O310 (predict-no)
  1125. I see 1 and I'm going to do: predict-no
  1126. ENV: Agent did: predict-no for direction L in state State-A
  1127. In State-A moving L
  1128. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1129. predict error 0
  1130. dir: dir isU
  1131. |\156: O: O312 (predict-no)
  1132. I see 1 and I'm going to do: predict-no
  1133. ENV: Agent did: predict-no for direction U in state State-A
  1134. In State-A moving U
  1135. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1136. predict error 0
  1137. dir: dir isU
  1138. -/|157: O: O314 (predict-no)
  1139. I see 1 and I'm going to do: predict-no
  1140. ENV: Agent did: predict-no for direction U in state State-A
  1141. In State-A moving U
  1142. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1143. predict error 0
  1144. dir: dir isR
  1145. \-/158: O: O315 (predict-yes)
  1146. I see 1 and I'm going to do: predict-yes
  1147. ENV: Agent did: predict-yes for direction R in state State-A
  1148. In State-A moving R
  1149. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1150. predict error 0
  1151. dir: dir isL
  1152. |\-/159: O: O317 (predict-yes)
  1153. I see 1 and I'm going to do: predict-yes
  1154. ENV: Agent did: predict-yes for direction L in state State-B
  1155. In State-B moving L
  1156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1157. predict error 0
  1158. dir: dir isU
  1159. |\160: O: O320 (predict-no)
  1160. I see 1 and I'm going to do: predict-no
  1161. ENV: Agent did: predict-no for direction U in state State-A
  1162. In State-A moving U
  1163. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1164. predict error 0
  1165. dir: dir isU
  1166. -/161: O: O322 (predict-no)
  1167. I see 1 and I'm going to do: predict-no
  1168. ENV: Agent did: predict-no for direction U in state State-A
  1169. In State-A moving U
  1170. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1171. predict error 0
  1172. dir: dir isR
  1173. |162: O: O323 (predict-yes)
  1174. I see 1 and I'm going to do: predict-yes
  1175. ENV: Agent did: predict-yes for direction R in state State-A
  1176. In State-A moving R
  1177. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1178. predict error 0
  1179. dir: dir isL
  1180. \-/163: O: O325 (predict-yes)
  1181. I see 1 and I'm going to do: predict-yes
  1182. ENV: Agent did: predict-yes for direction L in state State-B
  1183. In State-B moving L
  1184. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1185. predict error 0
  1186. dir: dir isR
  1187. |\164: O: O327 (predict-yes)
  1188. I see 1 and I'm going to do: predict-yes
  1189. ENV: Agent did: predict-yes for direction R in state State-A
  1190. In State-A moving R
  1191. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1192. predict error 0
  1193. dir: dir isR
  1194. -/|165: O: O330 (predict-no)
  1195. I see 1 and I'm going to do: predict-no
  1196. ENV: Agent did: predict-no for direction R in state State-B
  1197. In State-B moving R
  1198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1199. predict error 0
  1200. dir: dir isR
  1201. \-/166: O: O331 (predict-yes)
  1202. I see 1 and I'm going to do: predict-yes
  1203. ENV: Agent did: predict-yes for direction R in state State-B
  1204. In State-B moving R
  1205. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1206. predict error 1
  1207. dir: dir isL
  1208. |\-167: O: O333 (predict-yes)
  1209. I see 0 and I'm going to do: predict-yes
  1210. ENV: Agent did: predict-yes for direction L in state State-B
  1211. In State-B moving L
  1212. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1213. predict error 0
  1214. dir: dir isR
  1215. /|168: O: O335 (predict-yes)
  1216. I see 1 and I'm going to do: predict-yes
  1217. ENV: Agent did: predict-yes for direction R in state State-A
  1218. In State-A moving R
  1219. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1220. predict error 0
  1221. dir: dir isL
  1222. \-/|169: O: O337 (predict-yes)
  1223. I see 1 and I'm going to do: predict-yes
  1224. ENV: Agent did: predict-yes for direction L in state State-B
  1225. In State-B moving L
  1226. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1227. predict error 0
  1228. dir: dir isL
  1229. \-170: O: O340 (predict-no)
  1230. I see 1 and I'm going to do: predict-no
  1231. ENV: Agent did: predict-no for direction L in state State-A
  1232. In State-A moving L
  1233. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1234. predict error 0
  1235. dir: dir isU
  1236. /|171: O: O342 (predict-no)
  1237. I see 1 and I'm going to do: predict-no
  1238. ENV: Agent did: predict-no for direction U in state State-A
  1239. In State-A moving U
  1240. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1241. predict error 0
  1242. dir: dir isU
  1243. \172: O: O343 (predict-yes)
  1244. I see 1 and I'm going to do: predict-yes
  1245. ENV: Agent did: predict-yes for direction U in state State-A
  1246. In State-A moving U
  1247. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1248. predict error 1
  1249. dir: dir isL
  1250. -/|173: O: O346 (predict-no)
  1251. I see 0 and I'm going to do: predict-no
  1252. ENV: Agent did: predict-no for direction L in state State-A
  1253. In State-A moving L
  1254. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1255. predict error 0
  1256. dir: dir isU
  1257. \-/174: O: O347 (predict-yes)
  1258. I see 1 and I'm going to do: predict-yes
  1259. ENV: Agent did: predict-yes for direction U in state State-A
  1260. In State-A moving U
  1261. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1262. predict error 1
  1263. dir: dir isL
  1264. |\-175: O: O350 (predict-no)
  1265. I see 0 and I'm going to do: predict-no
  1266. ENV: Agent did: predict-no for direction L in state State-A
  1267. In State-A moving L
  1268. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1269. predict error 0
  1270. dir: dir isU
  1271. /|176: O: O352 (predict-no)
  1272. I see 1 and I'm going to do: predict-no
  1273. ENV: Agent did: predict-no for direction U in state State-A
  1274. In State-A moving U
  1275. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1276. predict error 0
  1277. dir: dir isU
  1278. \-/177: O: O354 (predict-no)
  1279. I see 1 and I'm going to do: predict-no
  1280. ENV: Agent did: predict-no for direction U in state State-A
  1281. In State-A moving U
  1282. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1283. predict error 0
  1284. dir: dir isR
  1285. |\-178: O: O356 (predict-no)
  1286. I see 1 and I'm going to do: predict-no
  1287. ENV: Agent did: predict-no for direction R in state State-A
  1288. In State-A moving R
  1289. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1290. predict error 1
  1291. dir: dir isL
  1292. /|\179: O: O357 (predict-yes)
  1293. I see 0 and I'm going to do: predict-yes
  1294. ENV: Agent did: predict-yes for direction L in state State-B
  1295. In State-B moving L
  1296. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1297. predict error 0
  1298. dir: dir isL
  1299. -/|180: O: O360 (predict-no)
  1300. I see 1 and I'm going to do: predict-no
  1301. ENV: Agent did: predict-no for direction L in state State-A
  1302. In State-A moving L
  1303. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1304. predict error 0
  1305. dir: dir isU
  1306. \-181: O: O362 (predict-no)
  1307. I see 1 and I'm going to do: predict-no
  1308. ENV: Agent did: predict-no for direction U in state State-A
  1309. In State-A moving U
  1310. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1311. predict error 0
  1312. dir: dir isL
  1313. /182: O: O364 (predict-no)
  1314. I see 1 and I'm going to do: predict-no
  1315. ENV: Agent did: predict-no for direction L in state State-A
  1316. In State-A moving L
  1317. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1318. predict error 0
  1319. dir: dir isU
  1320. |\-183: O: O366 (predict-no)
  1321. I see 1 and I'm going to do: predict-no
  1322. ENV: Agent did: predict-no for direction U in state State-A
  1323. In State-A moving U
  1324. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1325. predict error 0
  1326. dir: dir isU
  1327. /|184: O: O367 (predict-yes)
  1328. I see 1 and I'm going to do: predict-yes
  1329. ENV: Agent did: predict-yes for direction U in state State-A
  1330. In State-A moving U
  1331. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1332. predict error 1
  1333. dir: dir isR
  1334. \-185: O: O369 (predict-yes)
  1335. I see 0 and I'm going to do: predict-yes
  1336. ENV: Agent did: predict-yes for direction R in state State-A
  1337. In State-A moving R
  1338. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1339. predict error 0
  1340. dir: dir isL
  1341. /|\186: O: O371 (predict-yes)
  1342. I see 1 and I'm going to do: predict-yes
  1343. ENV: Agent did: predict-yes for direction L in state State-B
  1344. In State-B moving L
  1345. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1346. predict error 0
  1347. dir: dir isU
  1348. -/|187: O: O374 (predict-no)
  1349. I see 1 and I'm going to do: predict-no
  1350. ENV: Agent did: predict-no for direction U in state State-A
  1351. In State-A moving U
  1352. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1353. predict error 0
  1354. dir: dir isU
  1355. \-/188: O: O376 (predict-no)
  1356. I see 1 and I'm going to do: predict-no
  1357. ENV: Agent did: predict-no for direction U in state State-A
  1358. In State-A moving U
  1359. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1360. predict error 0
  1361. dir: dir isU
  1362. |\-189: O: O378 (predict-no)
  1363. I see 1 and I'm going to do: predict-no
  1364. ENV: Agent did: predict-no for direction U in state State-A
  1365. In State-A moving U
  1366. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1367. predict error 0
  1368. dir: dir isR
  1369. /|\190: O: O379 (predict-yes)
  1370. I see 1 and I'm going to do: predict-yes
  1371. ENV: Agent did: predict-yes for direction R in state State-A
  1372. In State-A moving R
  1373. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1374. predict error 0
  1375. dir: dir isR
  1376. -/|191: O: O381 (predict-yes)
  1377. I see 1 and I'm going to do: predict-yes
  1378. ENV: Agent did: predict-yes for direction R in state State-B
  1379. In State-B moving R
  1380. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1381. predict error 1
  1382. dir: dir isR
  1383. \192: O: O384 (predict-no)
  1384. I see 0 and I'm going to do: predict-no
  1385. ENV: Agent did: predict-no for direction R in state State-B
  1386. In State-B moving R
  1387. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1388. predict error 0
  1389. dir: dir isL
  1390. -/|193: O: O385 (predict-yes)
  1391. I see 1 and I'm going to do: predict-yes
  1392. ENV: Agent did: predict-yes for direction L in state State-B
  1393. In State-B moving L
  1394. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1395. predict error 0
  1396. dir: dir isU
  1397. \194: O: O388 (predict-no)
  1398. I see 1 and I'm going to do: predict-no
  1399. ENV: Agent did: predict-no for direction U in state State-A
  1400. In State-A moving U
  1401. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1402. predict error 0
  1403. dir: dir isR
  1404. -/|195: O: O389 (predict-yes)
  1405. I see 1 and I'm going to do: predict-yes
  1406. ENV: Agent did: predict-yes for direction R in state State-A
  1407. In State-A moving R
  1408. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1409. predict error 0
  1410. dir: dir isL
  1411. \-/196: O: O391 (predict-yes)
  1412. I see 1 and I'm going to do: predict-yes
  1413. ENV: Agent did: predict-yes for direction L in state State-B
  1414. In State-B moving L
  1415. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1416. predict error 0
  1417. dir: dir isL
  1418. |\-197: O: O394 (predict-no)
  1419. I see 1 and I'm going to do: predict-no
  1420. ENV: Agent did: predict-no for direction L in state State-A
  1421. In State-A moving L
  1422. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1423. predict error 0
  1424. dir: dir isR
  1425. /|\198: O: O395 (predict-yes)
  1426. I see 1 and I'm going to do: predict-yes
  1427. ENV: Agent did: predict-yes for direction R in state State-A
  1428. In State-A moving R
  1429. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1430. predict error 0
  1431. dir: dir isL
  1432. -/|199: O: O398 (predict-no)
  1433. I see 1 and I'm going to do: predict-no
  1434. ENV: Agent did: predict-no for direction L in state State-B
  1435. In State-B moving L
  1436. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1437. predict error 1
  1438. dir: dir isR
  1439. \-200: O: O399 (predict-yes)
  1440. I see 0 and I'm going to do: predict-yes
  1441. ENV: Agent did: predict-yes for direction R in state State-A
  1442. In State-A moving R
  1443. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1444. predict error 0
  1445. dir: dir isL
  1446. /|\-/|201: O: O401 (predict-yes)
  1447. I see 1 and I'm going to do: predict-yes
  1448. ENV: Agent did: predict-yes for direction L in state State-B
  1449. In State-B moving L
  1450. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1451. predict error 0
  1452. dir: dir isU
  1453. \202: O: O404 (predict-no)
  1454. I see 1 and I'm going to do: predict-no
  1455. ENV: Agent did: predict-no for direction U in state State-A
  1456. In State-A moving U
  1457. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1458. predict error 0
  1459. dir: dir isU
  1460. -/203: O: O406 (predict-no)
  1461. I see 1 and I'm going to do: predict-no
  1462. ENV: Agent did: predict-no for direction U in state State-A
  1463. In State-A moving U
  1464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1465. predict error 0
  1466. dir: dir isL
  1467. |\-204: O: O408 (predict-no)
  1468. I see 1 and I'm going to do: predict-no
  1469. ENV: Agent did: predict-no for direction L in state State-A
  1470. In State-A moving L
  1471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1472. predict error 0
  1473. dir: dir isL
  1474. /|\205: O: O410 (predict-no)
  1475. I see 1 and I'm going to do: predict-no
  1476. ENV: Agent did: predict-no for direction L in state State-A
  1477. In State-A moving L
  1478. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1479. predict error 0
  1480. dir: dir isL
  1481. -/206: O: O412 (predict-no)
  1482. I see 1 and I'm going to do: predict-no
  1483. ENV: Agent did: predict-no for direction L in state State-A
  1484. In State-A moving L
  1485. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1486. predict error 0
  1487. dir: dir isU
  1488. |\-/207: O: O413 (predict-yes)
  1489. I see 1 and I'm going to do: predict-yes
  1490. ENV: Agent did: predict-yes for direction U in state State-A
  1491. In State-A moving U
  1492. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1493. predict error 1
  1494. dir: dir isU
  1495. |\208: O: O416 (predict-no)
  1496. I see 0 and I'm going to do: predict-no
  1497. ENV: Agent did: predict-no for direction U in state State-A
  1498. In State-A moving U
  1499. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1500. predict error 0
  1501. dir: dir isR
  1502. -/|209: O: O417 (predict-yes)
  1503. I see 1 and I'm going to do: predict-yes
  1504. ENV: Agent did: predict-yes for direction R in state State-A
  1505. In State-A moving R
  1506. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1507. predict error 0
  1508. dir: dir isL
  1509. \-210: O: O419 (predict-yes)
  1510. I see 1 and I'm going to do: predict-yes
  1511. ENV: Agent did: predict-yes for direction L in state State-B
  1512. In State-B moving L
  1513. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1514. predict error 0
  1515. dir: dir isU
  1516. /|211: O: O422 (predict-no)
  1517. I see 1 and I'm going to do: predict-no
  1518. ENV: Agent did: predict-no for direction U in state State-A
  1519. In State-A moving U
  1520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1521. predict error 0
  1522. dir: dir isU
  1523. \212: O: O424 (predict-no)
  1524. I see 1 and I'm going to do: predict-no
  1525. ENV: Agent did: predict-no for direction U in state State-A
  1526. In State-A moving U
  1527. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1528. predict error 0
  1529. dir: dir isU
  1530. -/213: O: O426 (predict-no)
  1531. I see 1 and I'm going to do: predict-no
  1532. ENV: Agent did: predict-no for direction U in state State-A
  1533. In State-A moving U
  1534. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1535. predict error 0
  1536. dir: dir isR
  1537. |\-214: O: O427 (predict-yes)
  1538. I see 1 and I'm going to do: predict-yes
  1539. ENV: Agent did: predict-yes for direction R in state State-A
  1540. In State-A moving R
  1541. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1542. predict error 0
  1543. dir: dir isU
  1544. /|\-sleeping...
  1545. /215: O: O430 (predict-no)
  1546. I see 1 and I'm going to do: predict-no
  1547. ENV: Agent did: predict-no for direction U in state State-B
  1548. In State-B moving U
  1549. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1550. predict error 0
  1551. dir: dir isU
  1552. |\-/216: O: O432 (predict-no)
  1553. I see 1 and I'm going to do: predict-no
  1554. ENV: Agent did: predict-no for direction U in state State-B
  1555. In State-B moving U
  1556. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1557. predict error 0
  1558. dir: dir isR
  1559. |\-217: O: O434 (predict-no)
  1560. I see 1 and I'm going to do: predict-no
  1561. ENV: Agent did: predict-no for direction R in state State-B
  1562. In State-B moving R
  1563. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1564. predict error 0
  1565. dir: dir isU
  1566. /|\218: O: O436 (predict-no)
  1567. I see 1 and I'm going to do: predict-no
  1568. ENV: Agent did: predict-no for direction U in state State-B
  1569. In State-B moving U
  1570. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1571. predict error 0
  1572. dir: dir isL
  1573. -/219: O: O437 (predict-yes)
  1574. I see 1 and I'm going to do: predict-yes
  1575. ENV: Agent did: predict-yes for direction L in state State-B
  1576. In State-B moving L
  1577. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1578. predict error 0
  1579. dir: dir isU
  1580. |\-220: O: O440 (predict-no)
  1581. I see 1 and I'm going to do: predict-no
  1582. ENV: Agent did: predict-no for direction U in state State-A
  1583. In State-A moving U
  1584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1585. predict error 0
  1586. dir: dir isL
  1587. /|\221: O: O442 (predict-no)
  1588. I see 1 and I'm going to do: predict-no
  1589. ENV: Agent did: predict-no for direction L in state State-A
  1590. In State-A moving L
  1591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1592. predict error 0
  1593. dir: dir isL
  1594. -222: O: O443 (predict-yes)
  1595. I see 1 and I'm going to do: predict-yes
  1596. ENV: Agent did: predict-yes for direction L in state State-A
  1597. In State-A moving L
  1598. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1599. predict error 1
  1600. dir: dir isU
  1601. /|\-sleeping...
  1602. /223: O: O446 (predict-no)
  1603. I see 0 and I'm going to do: predict-no
  1604. ENV: Agent did: predict-no for direction U in state State-A
  1605. In State-A moving U
  1606. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1607. predict error 0
  1608. dir: dir isL
  1609. |\224: O: O448 (predict-no)
  1610. I see 1 and I'm going to do: predict-no
  1611. ENV: Agent did: predict-no for direction L in state State-A
  1612. In State-A moving L
  1613. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1614. predict error 0
  1615. dir: dir isU
  1616. -/|225: O: O449 (predict-yes)
  1617. I see 1 and I'm going to do: predict-yes
  1618. ENV: Agent did: predict-yes for direction U in state State-A
  1619. In State-A moving U
  1620. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1621. predict error 1
  1622. dir: dir isR
  1623. \-/226: O: O452 (predict-no)
  1624. I see 0 and I'm going to do: predict-no
  1625. ENV: Agent did: predict-no for direction R in state State-A
  1626. In State-A moving R
  1627. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1628. predict error 1
  1629. dir: dir isU
  1630. |\227: O: O454 (predict-no)
  1631. I see 0 and I'm going to do: predict-no
  1632. ENV: Agent did: predict-no for direction U in state State-B
  1633. In State-B moving U
  1634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1635. predict error 0
  1636. dir: dir isR
  1637. -/|228: O: O456 (predict-no)
  1638. I see 1 and I'm going to do: predict-no
  1639. ENV: Agent did: predict-no for direction R in state State-B
  1640. In State-B moving R
  1641. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1642. predict error 0
  1643. dir: dir isR
  1644. \-229: O: O458 (predict-no)
  1645. I see 1 and I'm going to do: predict-no
  1646. ENV: Agent did: predict-no for direction R in state State-B
  1647. In State-B moving R
  1648. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1649. predict error 0
  1650. dir: dir isL
  1651. /|\230: O: O459 (predict-yes)
  1652. I see 1 and I'm going to do: predict-yes
  1653. ENV: Agent did: predict-yes for direction L in state State-B
  1654. In State-B moving L
  1655. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1656. predict error 0
  1657. dir: dir isU
  1658. -/|231: O: O462 (predict-no)
  1659. I see 1 and I'm going to do: predict-no
  1660. ENV: Agent did: predict-no for direction U in state State-A
  1661. In State-A moving U
  1662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1663. predict error 0
  1664. dir: dir isR
  1665. \232: O: O463 (predict-yes)
  1666. I see 1 and I'm going to do: predict-yes
  1667. ENV: Agent did: predict-yes for direction R in state State-A
  1668. In State-A moving R
  1669. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1670. predict error 0
  1671. dir: dir isU
  1672. -/233: O: O465 (predict-yes)
  1673. I see 1 and I'm going to do: predict-yes
  1674. ENV: Agent did: predict-yes for direction U in state State-B
  1675. In State-B moving U
  1676. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1677. predict error 1
  1678. dir: dir isU
  1679. |\234: O: O468 (predict-no)
  1680. I see 0 and I'm going to do: predict-no
  1681. ENV: Agent did: predict-no for direction U in state State-B
  1682. In State-B moving U
  1683. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1684. predict error 0
  1685. dir: dir isL
  1686. -/|235: O: O469 (predict-yes)
  1687. I see 1 and I'm going to do: predict-yes
  1688. ENV: Agent did: predict-yes for direction L in state State-B
  1689. In State-B moving L
  1690. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1691. predict error 0
  1692. dir: dir isR
  1693. \-/236: O: O471 (predict-yes)
  1694. I see 1 and I'm going to do: predict-yes
  1695. ENV: Agent did: predict-yes for direction R in state State-A
  1696. In State-A moving R
  1697. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1698. predict error 0
  1699. dir: dir isL
  1700. |\237: O: O473 (predict-yes)
  1701. I see 1 and I'm going to do: predict-yes
  1702. ENV: Agent did: predict-yes for direction L in state State-B
  1703. In State-B moving L
  1704. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1705. predict error 0
  1706. dir: dir isL
  1707. -/|238: O: O476 (predict-no)
  1708. I see 1 and I'm going to do: predict-no
  1709. ENV: Agent did: predict-no for direction L in state State-A
  1710. In State-A moving L
  1711. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1712. predict error 0
  1713. dir: dir isL
  1714. \-239: O: O478 (predict-no)
  1715. I see 1 and I'm going to do: predict-no
  1716. ENV: Agent did: predict-no for direction L in state State-A
  1717. In State-A moving L
  1718. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1719. predict error 0
  1720. dir: dir isU
  1721. /|\240: O: O479 (predict-yes)
  1722. I see 1 and I'm going to do: predict-yes
  1723. ENV: Agent did: predict-yes for direction U in state State-A
  1724. In State-A moving U
  1725. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1726. predict error 1
  1727. dir: dir isU
  1728. -/|241: O: O482 (predict-no)
  1729. I see 0 and I'm going to do: predict-no
  1730. ENV: Agent did: predict-no for direction U in state State-A
  1731. In State-A moving U
  1732. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1733. predict error 0
  1734. dir: dir isU
  1735. \242: O: O484 (predict-no)
  1736. I see 1 and I'm going to do: predict-no
  1737. ENV: Agent did: predict-no for direction U in state State-A
  1738. In State-A moving U
  1739. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1740. predict error 0
  1741. dir: dir isR
  1742. -/243: O: O485 (predict-yes)
  1743. I see 1 and I'm going to do: predict-yes
  1744. ENV: Agent did: predict-yes for direction R in state State-A
  1745. In State-A moving R
  1746. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1747. predict error 0
  1748. dir: dir isR
  1749. |\-244: O: O488 (predict-no)
  1750. I see 1 and I'm going to do: predict-no
  1751. ENV: Agent did: predict-no for direction R in state State-B
  1752. In State-B moving R
  1753. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1754. predict error 0
  1755. dir: dir isU
  1756. /|\245: O: O490 (predict-no)
  1757. I see 1 and I'm going to do: predict-no
  1758. ENV: Agent did: predict-no for direction U in state State-B
  1759. In State-B moving U
  1760. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1761. predict error 0
  1762. dir: dir isR
  1763. -/246: O: O492 (predict-no)
  1764. I see 1 and I'm going to do: predict-no
  1765. ENV: Agent did: predict-no for direction R in state State-B
  1766. In State-B moving R
  1767. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1768. predict error 0
  1769. dir: dir isR
  1770. |\-247: O: O493 (predict-yes)
  1771. I see 1 and I'm going to do: predict-yes
  1772. ENV: Agent did: predict-yes for direction R in state State-B
  1773. In State-B moving R
  1774. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1775. predict error 1
  1776. dir: dir isL
  1777. /|\248: O: O495 (predict-yes)
  1778. I see 0 and I'm going to do: predict-yes
  1779. ENV: Agent did: predict-yes for direction L in state State-B
  1780. In State-B moving L
  1781. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1782. predict error 0
  1783. dir: dir isL
  1784. -/|249: O: O498 (predict-no)
  1785. I see 1 and I'm going to do: predict-no
  1786. ENV: Agent did: predict-no for direction L in state State-A
  1787. In State-A moving L
  1788. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1789. predict error 0
  1790. dir: dir isL
  1791. \-/250: O: O500 (predict-no)
  1792. I see 1 and I'm going to do: predict-no
  1793. ENV: Agent did: predict-no for direction L in state State-A
  1794. In State-A moving L
  1795. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1796. predict error 0
  1797. dir: dir isU
  1798. |\-/251: O: O502 (predict-no)
  1799. I see 1 and I'm going to do: predict-no
  1800. ENV: Agent did: predict-no for direction U in state State-A
  1801. In State-A moving U
  1802. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1803. predict error 0
  1804. dir: dir isR
  1805. |252: O: O503 (predict-yes)
  1806. I see 1 and I'm going to do: predict-yes
  1807. ENV: Agent did: predict-yes for direction R in state State-A
  1808. In State-A moving R
  1809. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1810. predict error 0
  1811. dir: dir isU
  1812. \-253: O: O506 (predict-no)
  1813. I see 1 and I'm going to do: predict-no
  1814. ENV: Agent did: predict-no for direction U in state State-B
  1815. In State-B moving U
  1816. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1817. predict error 0
  1818. dir: dir isR
  1819. /|\-sleeping...
  1820. /254: O: O508 (predict-no)
  1821. I see 1 and I'm going to do: predict-no
  1822. ENV: Agent did: predict-no for direction R in state State-B
  1823. In State-B moving R
  1824. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1825. predict error 0
  1826. dir: dir isL
  1827. |\-255: O: O509 (predict-yes)
  1828. I see 1 and I'm going to do: predict-yes
  1829. ENV: Agent did: predict-yes for direction L in state State-B
  1830. In State-B moving L
  1831. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1832. predict error 0
  1833. dir: dir isU
  1834. /|\256: O: O512 (predict-no)
  1835. I see 1 and I'm going to do: predict-no
  1836. ENV: Agent did: predict-no for direction U in state State-A
  1837. In State-A moving U
  1838. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1839. predict error 0
  1840. dir: dir isU
  1841. -/|257: O: O514 (predict-no)
  1842. I see 1 and I'm going to do: predict-no
  1843. ENV: Agent did: predict-no for direction U in state State-A
  1844. In State-A moving U
  1845. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1846. predict error 0
  1847. dir: dir isL
  1848. \-/258: O: O516 (predict-no)
  1849. I see 1 and I'm going to do: predict-no
  1850. ENV: Agent did: predict-no for direction L in state State-A
  1851. In State-A moving L
  1852. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1853. predict error 0
  1854. dir: dir isU
  1855. |\-259: O: O518 (predict-no)
  1856. I see 1 and I'm going to do: predict-no
  1857. ENV: Agent did: predict-no for direction U in state State-A
  1858. In State-A moving U
  1859. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1860. predict error 0
  1861. dir: dir isL
  1862. /|\260: O: O519 (predict-yes)
  1863. I see 1 and I'm going to do: predict-yes
  1864. ENV: Agent did: predict-yes for direction L in state State-A
  1865. In State-A moving L
  1866. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1867. predict error 1
  1868. dir: dir isL
  1869. -/|\261: O: O522 (predict-no)
  1870. I see 0 and I'm going to do: predict-no
  1871. ENV: Agent did: predict-no for direction L in state State-A
  1872. In State-A moving L
  1873. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1874. predict error 0
  1875. dir: dir isU
  1876. -262: O: O524 (predict-no)
  1877. I see 1 and I'm going to do: predict-no
  1878. ENV: Agent did: predict-no for direction U in state State-A
  1879. In State-A moving U
  1880. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1881. predict error 0
  1882. dir: dir isL
  1883. /|\263: O: O526 (predict-no)
  1884. I see 1 and I'm going to do: predict-no
  1885. ENV: Agent did: predict-no for direction L in state State-A
  1886. In State-A moving L
  1887. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1888. predict error 0
  1889. dir: dir isL
  1890. -/|264: O: O528 (predict-no)
  1891. I see 1 and I'm going to do: predict-no
  1892. ENV: Agent did: predict-no for direction L in state State-A
  1893. In State-A moving L
  1894. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1895. predict error 0
  1896. dir: dir isU
  1897. \-/265: O: O530 (predict-no)
  1898. I see 1 and I'm going to do: predict-no
  1899. ENV: Agent did: predict-no for direction U in state State-A
  1900. In State-A moving U
  1901. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1902. predict error 0
  1903. dir: dir isR
  1904. |\266: O: O531 (predict-yes)
  1905. I see 1 and I'm going to do: predict-yes
  1906. ENV: Agent did: predict-yes for direction R in state State-A
  1907. In State-A moving R
  1908. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1909. predict error 0
  1910. dir: dir isL
  1911. -/267: O: O533 (predict-yes)
  1912. I see 1 and I'm going to do: predict-yes
  1913. ENV: Agent did: predict-yes for direction L in state State-B
  1914. In State-B moving L
  1915. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1916. predict error 0
  1917. dir: dir isL
  1918. |\-268: O: O536 (predict-no)
  1919. I see 1 and I'm going to do: predict-no
  1920. ENV: Agent did: predict-no for direction L in state State-A
  1921. In State-A moving L
  1922. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1923. predict error 0
  1924. dir: dir isL
  1925. /|\269: O: O538 (predict-no)
  1926. I see 1 and I'm going to do: predict-no
  1927. ENV: Agent did: predict-no for direction L in state State-A
  1928. In State-A moving L
  1929. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1930. predict error 0
  1931. dir: dir isU
  1932. -/|270: O: O540 (predict-no)
  1933. I see 1 and I'm going to do: predict-no
  1934. ENV: Agent did: predict-no for direction U in state State-A
  1935. In State-A moving U
  1936. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1937. predict error 0
  1938. dir: dir isL
  1939. \-271: O: O542 (predict-no)
  1940. I see 1 and I'm going to do: predict-no
  1941. ENV: Agent did: predict-no for direction L in state State-A
  1942. In State-A moving L
  1943. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1944. predict error 0
  1945. dir: dir isU
  1946. /272: O: O544 (predict-no)
  1947. I see 1 and I'm going to do: predict-no
  1948. ENV: Agent did: predict-no for direction U in state State-A
  1949. In State-A moving U
  1950. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1951. predict error 0
  1952. dir: dir isR
  1953. |\273: O: O545 (predict-yes)
  1954. I see 1 and I'm going to do: predict-yes
  1955. ENV: Agent did: predict-yes for direction R in state State-A
  1956. In State-A moving R
  1957. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1958. predict error 0
  1959. dir: dir isU
  1960. -/|274: O: O548 (predict-no)
  1961. I see 1 and I'm going to do: predict-no
  1962. ENV: Agent did: predict-no for direction U in state State-B
  1963. In State-B moving U
  1964. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1965. predict error 0
  1966. dir: dir isU
  1967. \-275: O: O550 (predict-no)
  1968. I see 1 and I'm going to do: predict-no
  1969. ENV: Agent did: predict-no for direction U in state State-B
  1970. In State-B moving U
  1971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1972. predict error 0
  1973. dir: dir isL
  1974. /|276: O: O551 (predict-yes)
  1975. I see 1 and I'm going to do: predict-yes
  1976. ENV: Agent did: predict-yes for direction L in state State-B
  1977. In State-B moving L
  1978. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1979. predict error 0
  1980. dir: dir isL
  1981. \-277: O: O554 (predict-no)
  1982. I see 1 and I'm going to do: predict-no
  1983. ENV: Agent did: predict-no for direction L in state State-A
  1984. In State-A moving L
  1985. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1986. predict error 0
  1987. dir: dir isR
  1988. /|\278: O: O555 (predict-yes)
  1989. I see 1 and I'm going to do: predict-yes
  1990. ENV: Agent did: predict-yes for direction R in state State-A
  1991. In State-A moving R
  1992. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1993. predict error 0
  1994. dir: dir isL
  1995. -/279: O: O557 (predict-yes)
  1996. I see 1 and I'm going to do: predict-yes
  1997. ENV: Agent did: predict-yes for direction L in state State-B
  1998. In State-B moving L
  1999. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2000. predict error 0
  2001. dir: dir isR
  2002. |\-280: O: O559 (predict-yes)
  2003. I see 1 and I'm going to do: predict-yes
  2004. ENV: Agent did: predict-yes for direction R in state State-A
  2005. In State-A moving R
  2006. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2007. predict error 0
  2008. dir: dir isL
  2009. /|281: O: O561 (predict-yes)
  2010. I see 1 and I'm going to do: predict-yes
  2011. ENV: Agent did: predict-yes for direction L in state State-B
  2012. In State-B moving L
  2013. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2014. predict error 0
  2015. dir: dir isL
  2016. \282: O: O564 (predict-no)
  2017. I see 1 and I'm going to do: predict-no
  2018. ENV: Agent did: predict-no for direction L in state State-A
  2019. In State-A moving L
  2020. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2021. predict error 0
  2022. dir: dir isU
  2023. -/|283: O: O566 (predict-no)
  2024. I see 1 and I'm going to do: predict-no
  2025. ENV: Agent did: predict-no for direction U in state State-A
  2026. In State-A moving U
  2027. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2028. predict error 0
  2029. dir: dir isL
  2030. \-284: O: O568 (predict-no)
  2031. I see 1 and I'm going to do: predict-no
  2032. ENV: Agent did: predict-no for direction L in state State-A
  2033. In State-A moving L
  2034. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2035. predict error 0
  2036. dir: dir isR
  2037. /|285: O: O569 (predict-yes)
  2038. I see 1 and I'm going to do: predict-yes
  2039. ENV: Agent did: predict-yes for direction R in state State-A
  2040. In State-A moving R
  2041. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2042. predict error 0
  2043. dir: dir isR
  2044. \-/286: O: O571 (predict-yes)
  2045. I see 1 and I'm going to do: predict-yes
  2046. ENV: Agent did: predict-yes for direction R in state State-B
  2047. In State-B moving R
  2048. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2049. predict error 1
  2050. dir: dir isL
  2051. |\-/287: O: O573 (predict-yes)
  2052. I see 0 and I'm going to do: predict-yes
  2053. ENV: Agent did: predict-yes for direction L in state State-B
  2054. In State-B moving L
  2055. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2056. predict error 0
  2057. dir: dir isL
  2058. |\-288: O: O576 (predict-no)
  2059. I see 1 and I'm going to do: predict-no
  2060. ENV: Agent did: predict-no for direction L in state State-A
  2061. In State-A moving L
  2062. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2063. predict error 0
  2064. dir: dir isU
  2065. /|\289: O: O578 (predict-no)
  2066. I see 1 and I'm going to do: predict-no
  2067. ENV: Agent did: predict-no for direction U in state State-A
  2068. In State-A moving U
  2069. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2070. predict error 0
  2071. dir: dir isU
  2072. -/290: O: O580 (predict-no)
  2073. I see 1 and I'm going to do: predict-no
  2074. ENV: Agent did: predict-no for direction U in state State-A
  2075. In State-A moving U
  2076. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2077. predict error 0
  2078. dir: dir isU
  2079. |\291: O: O582 (predict-no)
  2080. I see 1 and I'm going to do: predict-no
  2081. ENV: Agent did: predict-no for direction U in state State-A
  2082. In State-A moving U
  2083. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2084. predict error 0
  2085. dir: dir isL
  2086. -292: O: O584 (predict-no)
  2087. I see 1 and I'm going to do: predict-no
  2088. ENV: Agent did: predict-no for direction L in state State-A
  2089. In State-A moving L
  2090. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2091. predict error 0
  2092. dir: dir isL
  2093. /|\293: O: O586 (predict-no)
  2094. I see 1 and I'm going to do: predict-no
  2095. ENV: Agent did: predict-no for direction L in state State-A
  2096. In State-A moving L
  2097. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2098. predict error 0
  2099. dir: dir isR
  2100. -/|294: O: O587 (predict-yes)
  2101. I see 1 and I'm going to do: predict-yes
  2102. ENV: Agent did: predict-yes for direction R in state State-A
  2103. In State-A moving R
  2104. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2105. predict error 0
  2106. dir: dir isU
  2107. \-295: O: O590 (predict-no)
  2108. I see 1 and I'm going to do: predict-no
  2109. ENV: Agent did: predict-no for direction U in state State-B
  2110. In State-B moving U
  2111. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2112. predict error 0
  2113. dir: dir isR
  2114. /|296: O: O592 (predict-no)
  2115. I see 1 and I'm going to do: predict-no
  2116. ENV: Agent did: predict-no for direction R in state State-B
  2117. In State-B moving R
  2118. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2119. predict error 0
  2120. dir: dir isU
  2121. \-/297: O: O594 (predict-no)
  2122. I see 1 and I'm going to do: predict-no
  2123. ENV: Agent did: predict-no for direction U in state State-B
  2124. In State-B moving U
  2125. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2126. predict error 0
  2127. dir: dir isR
  2128. |\-298: O: O596 (predict-no)
  2129. I see 1 and I'm going to do: predict-no
  2130. ENV: Agent did: predict-no for direction R in state State-B
  2131. In State-B moving R
  2132. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2133. predict error 0
  2134. dir: dir isL
  2135. /|299: O: O597 (predict-yes)
  2136. I see 1 and I'm going to do: predict-yes
  2137. ENV: Agent did: predict-yes for direction L in state State-B
  2138. In State-B moving L
  2139. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2140. predict error 0
  2141. dir: dir isR
  2142. \-/300: O: O599 (predict-yes)
  2143. I see 1 and I'm going to do: predict-yes
  2144. ENV: Agent did: predict-yes for direction R in state State-A
  2145. In State-A moving R
  2146. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2147. predict error 0
  2148. dir: dir isL
  2149. |\-/|\301: O: O601 (predict-yes)
  2150. I see 1 and I'm going to do: predict-yes
  2151. ENV: Agent did: predict-yes for direction L in state State-B
  2152. In State-B moving L
  2153. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2154. predict error 0
  2155. dir: dir isL
  2156. -302: O: O604 (predict-no)
  2157. I see 1 and I'm going to do: predict-no
  2158. ENV: Agent did: predict-no for direction L in state State-A
  2159. In State-A moving L
  2160. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2161. predict error 0
  2162. dir: dir isL
  2163. /|\303: O: O606 (predict-no)
  2164. I see 1 and I'm going to do: predict-no
  2165. ENV: Agent did: predict-no for direction L in state State-A
  2166. In State-A moving L
  2167. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2168. predict error 0
  2169. dir: dir isL
  2170. -/|304: O: O608 (predict-no)
  2171. I see 1 and I'm going to do: predict-no
  2172. ENV: Agent did: predict-no for direction L in state State-A
  2173. In State-A moving L
  2174. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2175. predict error 0
  2176. dir: dir isU
  2177. \-/305: O: O610 (predict-no)
  2178. I see 1 and I'm going to do: predict-no
  2179. ENV: Agent did: predict-no for direction U in state State-A
  2180. In State-A moving U
  2181. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2182. predict error 0
  2183. dir: dir isR
  2184. |\-306: O: O611 (predict-yes)
  2185. I see 1 and I'm going to do: predict-yes
  2186. ENV: Agent did: predict-yes for direction R in state State-A
  2187. In State-A moving R
  2188. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2189. predict error 0
  2190. dir: dir isR
  2191. /307: O: O614 (predict-no)
  2192. I see 1 and I'm going to do: predict-no
  2193. ENV: Agent did: predict-no for direction R in state State-B
  2194. In State-B moving R
  2195. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2196. predict error 0
  2197. dir: dir isR
  2198. |\-308: O: O616 (predict-no)
  2199. I see 1 and I'm going to do: predict-no
  2200. ENV: Agent did: predict-no for direction R in state State-B
  2201. In State-B moving R
  2202. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2203. predict error 0
  2204. dir: dir isU
  2205. /|309: O: O618 (predict-no)
  2206. I see 1 and I'm going to do: predict-no
  2207. ENV: Agent did: predict-no for direction U in state State-B
  2208. In State-B moving U
  2209. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2210. predict error 0
  2211. dir: dir isR
  2212. \-/310: O: O620 (predict-no)
  2213. I see 1 and I'm going to do: predict-no
  2214. ENV: Agent did: predict-no for direction R in state State-B
  2215. In State-B moving R
  2216. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2217. predict error 0
  2218. dir: dir isL
  2219. |\-311: O: O621 (predict-yes)
  2220. I see 1 and I'm going to do: predict-yes
  2221. ENV: Agent did: predict-yes for direction L in state State-B
  2222. In State-B moving L
  2223. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2224. predict error 0
  2225. dir: dir isL
  2226. /312: O: O624 (predict-no)
  2227. I see 1 and I'm going to do: predict-no
  2228. ENV: Agent did: predict-no for direction L in state State-A
  2229. In State-A moving L
  2230. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2231. predict error 0
  2232. dir: dir isL
  2233. |\-313: O: O626 (predict-no)
  2234. I see 1 and I'm going to do: predict-no
  2235. ENV: Agent did: predict-no for direction L in state State-A
  2236. In State-A moving L
  2237. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2238. predict error 0
  2239. dir: dir isU
  2240. /|\314: O: O628 (predict-no)
  2241. I see 1 and I'm going to do: predict-no
  2242. ENV: Agent did: predict-no for direction U in state State-A
  2243. In State-A moving U
  2244. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2245. predict error 0
  2246. dir: dir isU
  2247. -/315: O: O630 (predict-no)
  2248. I see 1 and I'm going to do: predict-no
  2249. ENV: Agent did: predict-no for direction U in state State-A
  2250. In State-A moving U
  2251. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2252. predict error 0
  2253. dir: dir isL
  2254. |\316: O: O632 (predict-no)
  2255. I see 1 and I'm going to do: predict-no
  2256. ENV: Agent did: predict-no for direction L in state State-A
  2257. In State-A moving L
  2258. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2259. predict error 0
  2260. dir: dir isR
  2261. -/|317: O: O633 (predict-yes)
  2262. I see 1 and I'm going to do: predict-yes
  2263. ENV: Agent did: predict-yes for direction R in state State-A
  2264. In State-A moving R
  2265. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2266. predict error 0
  2267. dir: dir isR
  2268. \-318: O: O636 (predict-no)
  2269. I see 1 and I'm going to do: predict-no
  2270. ENV: Agent did: predict-no for direction R in state State-B
  2271. In State-B moving R
  2272. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2273. predict error 0
  2274. dir: dir isR
  2275. /|319: O: O638 (predict-no)
  2276. I see 1 and I'm going to do: predict-no
  2277. ENV: Agent did: predict-no for direction R in state State-B
  2278. In State-B moving R
  2279. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2280. predict error 0
  2281. dir: dir isR
  2282. \-320: O: O640 (predict-no)
  2283. I see 1 and I'm going to do: predict-no
  2284. ENV: Agent did: predict-no for direction R in state State-B
  2285. In State-B moving R
  2286. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2287. predict error 0
  2288. dir: dir isL
  2289. /|\321: O: O641 (predict-yes)
  2290. I see 1 and I'm going to do: predict-yes
  2291. ENV: Agent did: predict-yes for direction L in state State-B
  2292. In State-B moving L
  2293. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2294. predict error 0
  2295. dir: dir isL
  2296. -322: O: O644 (predict-no)
  2297. I see 1 and I'm going to do: predict-no
  2298. ENV: Agent did: predict-no for direction L in state State-A
  2299. In State-A moving L
  2300. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2301. predict error 0
  2302. dir: dir isL
  2303. /|\323: O: O646 (predict-no)
  2304. I see 1 and I'm going to do: predict-no
  2305. ENV: Agent did: predict-no for direction L in state State-A
  2306. In State-A moving L
  2307. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2308. predict error 0
  2309. dir: dir isL
  2310. -/324: O: O648 (predict-no)
  2311. I see 1 and I'm going to do: predict-no
  2312. ENV: Agent did: predict-no for direction L in state State-A
  2313. In State-A moving L
  2314. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2315. predict error 0
  2316. dir: dir isR
  2317. |\325: O: O649 (predict-yes)
  2318. I see 1 and I'm going to do: predict-yes
  2319. ENV: Agent did: predict-yes for direction R in state State-A
  2320. In State-A moving R
  2321. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2322. predict error 0
  2323. dir: dir isL
  2324. -/|326: O: O651 (predict-yes)
  2325. I see 1 and I'm going to do: predict-yes
  2326. ENV: Agent did: predict-yes for direction L in state State-B
  2327. In State-B moving L
  2328. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2329. predict error 0
  2330. dir: dir isL
  2331. \-/327: O: O654 (predict-no)
  2332. I see 1 and I'm going to do: predict-no
  2333. ENV: Agent did: predict-no for direction L in state State-A
  2334. In State-A moving L
  2335. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2336. predict error 0
  2337. dir: dir isR
  2338. |\-328: O: O655 (predict-yes)
  2339. I see 1 and I'm going to do: predict-yes
  2340. ENV: Agent did: predict-yes for direction R in state State-A
  2341. In State-A moving R
  2342. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2343. predict error 0
  2344. dir: dir isL
  2345. /|\329: O: O657 (predict-yes)
  2346. I see 1 and I'm going to do: predict-yes
  2347. ENV: Agent did: predict-yes for direction L in state State-B
  2348. In State-B moving L
  2349. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2350. predict error 0
  2351. dir: dir isU
  2352. -/|330: O: O660 (predict-no)
  2353. I see 1 and I'm going to do: predict-no
  2354. ENV: Agent did: predict-no for direction U in state State-A
  2355. In State-A moving U
  2356. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2357. predict error 0
  2358. dir: dir isR
  2359. \331: O: O661 (predict-yes)
  2360. I see 1 and I'm going to do: predict-yes
  2361. ENV: Agent did: predict-yes for direction R in state State-A
  2362. In State-A moving R
  2363. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2364. predict error 0
  2365. dir: dir isU
  2366. -332: O: O664 (predict-no)
  2367. I see 1 and I'm going to do: predict-no
  2368. ENV: Agent did: predict-no for direction U in state State-B
  2369. In State-B moving U
  2370. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2371. predict error 0
  2372. dir: dir isL
  2373. /|\333: O: O665 (predict-yes)
  2374. I see 1 and I'm going to do: predict-yes
  2375. ENV: Agent did: predict-yes for direction L in state State-B
  2376. In State-B moving L
  2377. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2378. predict error 0
  2379. dir: dir isR
  2380. -/|334: O: O667 (predict-yes)
  2381. I see 1 and I'm going to do: predict-yes
  2382. ENV: Agent did: predict-yes for direction R in state State-A
  2383. In State-A moving R
  2384. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2385. predict error 0
  2386. dir: dir isU
  2387. \-/|335: O: O670 (predict-no)
  2388. I see 1 and I'm going to do: predict-no
  2389. ENV: Agent did: predict-no for direction U in state State-B
  2390. In State-B moving U
  2391. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2392. predict error 0
  2393. dir: dir isL
  2394. \-336: O: O671 (predict-yes)
  2395. I see 1 and I'm going to do: predict-yes
  2396. ENV: Agent did: predict-yes for direction L in state State-B
  2397. In State-B moving L
  2398. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2399. predict error 0
  2400. dir: dir isU
  2401. /|\337: O: O674 (predict-no)
  2402. I see 1 and I'm going to do: predict-no
  2403. ENV: Agent did: predict-no for direction U in state State-A
  2404. In State-A moving U
  2405. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2406. predict error 0
  2407. dir: dir isL
  2408. -/|338: O: O676 (predict-no)
  2409. I see 1 and I'm going to do: predict-no
  2410. ENV: Agent did: predict-no for direction L in state State-A
  2411. In State-A moving L
  2412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2413. predict error 0
  2414. dir: dir isU
  2415. \-339: O: O678 (predict-no)
  2416. I see 1 and I'm going to do: predict-no
  2417. ENV: Agent did: predict-no for direction U in state State-A
  2418. In State-A moving U
  2419. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2420. predict error 0
  2421. dir: dir isU
  2422. /|\340: O: O679 (predict-yes)
  2423. I see 1 and I'm going to do: predict-yes
  2424. ENV: Agent did: predict-yes for direction U in state State-A
  2425. In State-A moving U
  2426. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2427. predict error 1
  2428. dir: dir isU
  2429. -/341: O: O682 (predict-no)
  2430. I see 0 and I'm going to do: predict-no
  2431. ENV: Agent did: predict-no for direction U in state State-A
  2432. In State-A moving U
  2433. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2434. predict error 0
  2435. dir: dir isL
  2436. |342: O: O684 (predict-no)
  2437. I see 1 and I'm going to do: predict-no
  2438. ENV: Agent did: predict-no for direction L in state State-A
  2439. In State-A moving L
  2440. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2441. predict error 0
  2442. dir: dir isL
  2443. \-/343: O: O685 (predict-yes)
  2444. I see 1 and I'm going to do: predict-yes
  2445. ENV: Agent did: predict-yes for direction L in state State-A
  2446. In State-A moving L
  2447. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2448. predict error 1
  2449. dir: dir isR
  2450. |\344: O: O687 (predict-yes)
  2451. I see 0 and I'm going to do: predict-yes
  2452. ENV: Agent did: predict-yes for direction R in state State-A
  2453. In State-A moving R
  2454. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2455. predict error 0
  2456. dir: dir isU
  2457. -/|345: O: O690 (predict-no)
  2458. I see 1 and I'm going to do: predict-no
  2459. ENV: Agent did: predict-no for direction U in state State-B
  2460. In State-B moving U
  2461. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2462. predict error 0
  2463. dir: dir isL
  2464. \-/|346: O: O691 (predict-yes)
  2465. I see 1 and I'm going to do: predict-yes
  2466. ENV: Agent did: predict-yes for direction L in state State-B
  2467. In State-B moving L
  2468. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2469. predict error 0
  2470. dir: dir isU
  2471. \-347: O: O694 (predict-no)
  2472. I see 1 and I'm going to do: predict-no
  2473. ENV: Agent did: predict-no for direction U in state State-A
  2474. In State-A moving U
  2475. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2476. predict error 0
  2477. dir: dir isL
  2478. /|348: O: O696 (predict-no)
  2479. I see 1 and I'm going to do: predict-no
  2480. ENV: Agent did: predict-no for direction L in state State-A
  2481. In State-A moving L
  2482. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2483. predict error 0
  2484. dir: dir isU
  2485. \-349: O: O698 (predict-no)
  2486. I see 1 and I'm going to do: predict-no
  2487. ENV: Agent did: predict-no for direction U in state State-A
  2488. In State-A moving U
  2489. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2490. predict error 0
  2491. dir: dir isL
  2492. /|\-sleeping...
  2493. /350: O: O700 (predict-no)
  2494. I see 1 and I'm going to do: predict-no
  2495. ENV: Agent did: predict-no for direction L in state State-A
  2496. In State-A moving L
  2497. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2498. predict error 0
  2499. dir: dir isL
  2500. |\-351: O: O702 (predict-no)
  2501. I see 1 and I'm going to do: predict-no
  2502. ENV: Agent did: predict-no for direction L in state State-A
  2503. In State-A moving L
  2504. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2505. predict error 0
  2506. dir: dir isU
  2507. /352: O: O704 (predict-no)
  2508. I see 1 and I'm going to do: predict-no
  2509. ENV: Agent did: predict-no for direction U in state State-A
  2510. In State-A moving U
  2511. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2512. predict error 0
  2513. dir: dir isU
  2514. |\-353: O: O706 (predict-no)
  2515. I see 1 and I'm going to do: predict-no
  2516. ENV: Agent did: predict-no for direction U in state State-A
  2517. In State-A moving U
  2518. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2519. predict error 0
  2520. dir: dir isU
  2521. /|354: O: O708 (predict-no)
  2522. I see 1 and I'm going to do: predict-no
  2523. ENV: Agent did: predict-no for direction U in state State-A
  2524. In State-A moving U
  2525. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2526. predict error 0
  2527. dir: dir isU
  2528. \-/355: O: O710 (predict-no)
  2529. I see 1 and I'm going to do: predict-no
  2530. ENV: Agent did: predict-no for direction U in state State-A
  2531. In State-A moving U
  2532. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2533. predict error 0
  2534. dir: dir isU
  2535. |\-356: O: O712 (predict-no)
  2536. I see 1 and I'm going to do: predict-no
  2537. ENV: Agent did: predict-no for direction U in state State-A
  2538. In State-A moving U
  2539. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2540. predict error 0
  2541. dir: dir isU
  2542. /|\357: O: O714 (predict-no)
  2543. I see 1 and I'm going to do: predict-no
  2544. ENV: Agent did: predict-no for direction U in state State-A
  2545. In State-A moving U
  2546. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2547. predict error 0
  2548. dir: dir isL
  2549. -/358: O: O716 (predict-no)
  2550. I see 1 and I'm going to do: predict-no
  2551. ENV: Agent did: predict-no for direction L in state State-A
  2552. In State-A moving L
  2553. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2554. predict error 0
  2555. dir: dir isR
  2556. |\-359: O: O717 (predict-yes)
  2557. I see 1 and I'm going to do: predict-yes
  2558. ENV: Agent did: predict-yes for direction R in state State-A
  2559. In State-A moving R
  2560. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2561. predict error 0
  2562. dir: dir isL
  2563. /|\360: O: O719 (predict-yes)
  2564. I see 1 and I'm going to do: predict-yes
  2565. ENV: Agent did: predict-yes for direction L in state State-B
  2566. In State-B moving L
  2567. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2568. predict error 0
  2569. dir: dir isU
  2570. -/|361: O: O722 (predict-no)
  2571. I see 1 and I'm going to do: predict-no
  2572. ENV: Agent did: predict-no for direction U in state State-A
  2573. In State-A moving U
  2574. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2575. predict error 0
  2576. dir: dir isU
  2577. \362: O: O724 (predict-no)
  2578. I see 1 and I'm going to do: predict-no
  2579. ENV: Agent did: predict-no for direction U in state State-A
  2580. In State-A moving U
  2581. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2582. predict error 0
  2583. dir: dir isL
  2584. -/|363: O: O726 (predict-no)
  2585. I see 1 and I'm going to do: predict-no
  2586. ENV: Agent did: predict-no for direction L in state State-A
  2587. In State-A moving L
  2588. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2589. predict error 0
  2590. dir: dir isL
  2591. \-/364: O: O728 (predict-no)
  2592. I see 1 and I'm going to do: predict-no
  2593. ENV: Agent did: predict-no for direction L in state State-A
  2594. In State-A moving L
  2595. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2596. predict error 0
  2597. dir: dir isU
  2598. |\-365: O: O730 (predict-no)
  2599. I see 1 and I'm going to do: predict-no
  2600. ENV: Agent did: predict-no for direction U in state State-A
  2601. In State-A moving U
  2602. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2603. predict error 0
  2604. dir: dir isU
  2605. /|\366: O: O732 (predict-no)
  2606. I see 1 and I'm going to do: predict-no
  2607. ENV: Agent did: predict-no for direction U in state State-A
  2608. In State-A moving U
  2609. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2610. predict error 0
  2611. dir: dir isR
  2612. -/|\367: O: O733 (predict-yes)
  2613. I see 1 and I'm going to do: predict-yes
  2614. ENV: Agent did: predict-yes for direction R in state State-A
  2615. In State-A moving R
  2616. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2617. predict error 0
  2618. dir: dir isR
  2619. -/368: O: O736 (predict-no)
  2620. I see 1 and I'm going to do: predict-no
  2621. ENV: Agent did: predict-no for direction R in state State-B
  2622. In State-B moving R
  2623. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2624. predict error 0
  2625. dir: dir isU
  2626. |\-369: O: O738 (predict-no)
  2627. I see 1 and I'm going to do: predict-no
  2628. ENV: Agent did: predict-no for direction U in state State-B
  2629. In State-B moving U
  2630. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2631. predict error 0
  2632. dir: dir isR
  2633. /|\370: O: O740 (predict-no)
  2634. I see 1 and I'm going to do: predict-no
  2635. ENV: Agent did: predict-no for direction R in state State-B
  2636. In State-B moving R
  2637. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2638. predict error 0
  2639. dir: dir isR
  2640. -371: O: O742 (predict-no)
  2641. I see 1 and I'm going to do: predict-no
  2642. ENV: Agent did: predict-no for direction R in state State-B
  2643. In State-B moving R
  2644. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2645. predict error 0
  2646. dir: dir isR
  2647. /372: O: O744 (predict-no)
  2648. I see 1 and I'm going to do: predict-no
  2649. ENV: Agent did: predict-no for direction R in state State-B
  2650. In State-B moving R
  2651. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2652. predict error 0
  2653. dir: dir isL
  2654. |\-373: O: O745 (predict-yes)
  2655. I see 1 and I'm going to do: predict-yes
  2656. ENV: Agent did: predict-yes for direction L in state State-B
  2657. In State-B moving L
  2658. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2659. predict error 0
  2660. dir: dir isL
  2661. /|374: O: O748 (predict-no)
  2662. I see 1 and I'm going to do: predict-no
  2663. ENV: Agent did: predict-no for direction L in state State-A
  2664. In State-A moving L
  2665. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2666. predict error 0
  2667. dir: dir isR
  2668. \-375: O: O749 (predict-yes)
  2669. I see 1 and I'm going to do: predict-yes
  2670. ENV: Agent did: predict-yes for direction R in state State-A
  2671. In State-A moving R
  2672. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2673. predict error 0
  2674. dir: dir isR
  2675. /|\376: O: O752 (predict-no)
  2676. I see 1 and I'm going to do: predict-no
  2677. ENV: Agent did: predict-no for direction R in state State-B
  2678. In State-B moving R
  2679. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2680. predict error 0
  2681. dir: dir isR
  2682. -/|377: O: O754 (predict-no)
  2683. I see 1 and I'm going to do: predict-no
  2684. ENV: Agent did: predict-no for direction R in state State-B
  2685. In State-B moving R
  2686. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2687. predict error 0
  2688. dir: dir isL
  2689. \-/378: O: O755 (predict-yes)
  2690. I see 1 and I'm going to do: predict-yes
  2691. ENV: Agent did: predict-yes for direction L in state State-B
  2692. In State-B moving L
  2693. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2694. predict error 0
  2695. dir: dir isR
  2696. |\-379: O: O757 (predict-yes)
  2697. I see 1 and I'm going to do: predict-yes
  2698. ENV: Agent did: predict-yes for direction R in state State-A
  2699. In State-A moving R
  2700. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2701. predict error 0
  2702. dir: dir isL
  2703. /|\380: O: O759 (predict-yes)
  2704. I see 1 and I'm going to do: predict-yes
  2705. ENV: Agent did: predict-yes for direction L in state State-B
  2706. In State-B moving L
  2707. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2708. predict error 0
  2709. dir: dir isL
  2710. -/|381: O: O762 (predict-no)
  2711. I see 1 and I'm going to do: predict-no
  2712. ENV: Agent did: predict-no for direction L in state State-A
  2713. In State-A moving L
  2714. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2715. predict error 0
  2716. dir: dir isL
  2717. \382: O: O764 (predict-no)
  2718. I see 1 and I'm going to do: predict-no
  2719. ENV: Agent did: predict-no for direction L in state State-A
  2720. In State-A moving L
  2721. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2722. predict error 0
  2723. dir: dir isU
  2724. -/|\383: O: O766 (predict-no)
  2725. I see 1 and I'm going to do: predict-no
  2726. ENV: Agent did: predict-no for direction U in state State-A
  2727. In State-A moving U
  2728. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2729. predict error 0
  2730. dir: dir isR
  2731. -/384: O: O768 (predict-no)
  2732. I see 1 and I'm going to do: predict-no
  2733. ENV: Agent did: predict-no for direction R in state State-A
  2734. In State-A moving R
  2735. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2736. predict error 1
  2737. dir: dir isR
  2738. |\-385: O: O770 (predict-no)
  2739. I see 0 and I'm going to do: predict-no
  2740. ENV: Agent did: predict-no for direction R in state State-B
  2741. In State-B moving R
  2742. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2743. predict error 0
  2744. dir: dir isR
  2745. /|\386: O: O772 (predict-no)
  2746. I see 1 and I'm going to do: predict-no
  2747. ENV: Agent did: predict-no for direction R in state State-B
  2748. In State-B moving R
  2749. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2750. predict error 0
  2751. dir: dir isL
  2752. -/|387: O: O773 (predict-yes)
  2753. I see 1 and I'm going to do: predict-yes
  2754. ENV: Agent did: predict-yes for direction L in state State-B
  2755. In State-B moving L
  2756. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2757. predict error 0
  2758. dir: dir isL
  2759. \-/388: O: O776 (predict-no)
  2760. I see 1 and I'm going to do: predict-no
  2761. ENV: Agent did: predict-no for direction L in state State-A
  2762. In State-A moving L
  2763. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2764. predict error 0
  2765. dir: dir isR
  2766. |\-389: O: O777 (predict-yes)
  2767. I see 1 and I'm going to do: predict-yes
  2768. ENV: Agent did: predict-yes for direction R in state State-A
  2769. In State-A moving R
  2770. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2771. predict error 0
  2772. dir: dir isR
  2773. /|390: O: O780 (predict-no)
  2774. I see 1 and I'm going to do: predict-no
  2775. ENV: Agent did: predict-no for direction R in state State-B
  2776. In State-B moving R
  2777. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2778. predict error 0
  2779. dir: dir isR
  2780. \-/391: O: O782 (predict-no)
  2781. I see 1 and I'm going to do: predict-no
  2782. ENV: Agent did: predict-no for direction R in state State-B
  2783. In State-B moving R
  2784. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2785. predict error 0
  2786. dir: dir isR
  2787. |392: O: O784 (predict-no)
  2788. I see 1 and I'm going to do: predict-no
  2789. ENV: Agent did: predict-no for direction R in state State-B
  2790. In State-B moving R
  2791. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2792. predict error 0
  2793. dir: dir isU
  2794. \-/|393: O: O786 (predict-no)
  2795. I see 1 and I'm going to do: predict-no
  2796. ENV: Agent did: predict-no for direction U in state State-B
  2797. In State-B moving U
  2798. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2799. predict error 0
  2800. dir: dir isU
  2801. \-/394: O: O788 (predict-no)
  2802. I see 1 and I'm going to do: predict-no
  2803. ENV: Agent did: predict-no for direction U in state State-B
  2804. In State-B moving U
  2805. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2806. predict error 0
  2807. dir: dir isL
  2808. |\-395: O: O789 (predict-yes)
  2809. I see 1 and I'm going to do: predict-yes
  2810. ENV: Agent did: predict-yes for direction L in state State-B
  2811. In State-B moving L
  2812. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2813. predict error 0
  2814. dir: dir isR
  2815. /|396: O: O791 (predict-yes)
  2816. I see 1 and I'm going to do: predict-yes
  2817. ENV: Agent did: predict-yes for direction R in state State-A
  2818. In State-A moving R
  2819. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2820. predict error 0
  2821. dir: dir isR
  2822. \-/397: O: O794 (predict-no)
  2823. I see 1 and I'm going to do: predict-no
  2824. ENV: Agent did: predict-no for direction R in state State-B
  2825. In State-B moving R
  2826. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2827. predict error 0
  2828. dir: dir isL
  2829. |\-398: O: O795 (predict-yes)
  2830. I see 1 and I'm going to do: predict-yes
  2831. ENV: Agent did: predict-yes for direction L in state State-B
  2832. In State-B moving L
  2833. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2834. predict error 0
  2835. dir: dir isR
  2836. /|\399: O: O797 (predict-yes)
  2837. I see 1 and I'm going to do: predict-yes
  2838. ENV: Agent did: predict-yes for direction R in state State-A
  2839. In State-A moving R
  2840. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2841. predict error 0
  2842. dir: dir isR
  2843. -/|\400: O: O800 (predict-no)
  2844. I see 1 and I'm going to do: predict-no
  2845. ENV: Agent did: predict-no for direction R in state State-B
  2846. In State-B moving R
  2847. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2848. predict error 0
  2849. dir: dir isU
  2850. -/|401: O: O802 (predict-no)
  2851. I see 1 and I'm going to do: predict-no
  2852. ENV: Agent did: predict-no for direction U in state State-B
  2853. In State-B moving U
  2854. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2855. predict error 0
  2856. dir: dir isU
  2857. \402: O: O803 (predict-yes)
  2858. I see 1 and I'm going to do: predict-yes
  2859. ENV: Agent did: predict-yes for direction U in state State-B
  2860. In State-B moving U
  2861. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2862. predict error 1
  2863. dir: dir isL
  2864. -/|403: O: O806 (predict-no)
  2865. I see 0 and I'm going to do: predict-no
  2866. ENV: Agent did: predict-no for direction L in state State-B
  2867. In State-B moving L
  2868. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2869. predict error 1
  2870. dir: dir isR
  2871. \404: O: O808 (predict-no)
  2872. I see 0 and I'm going to do: predict-no
  2873. ENV: Agent did: predict-no for direction R in state State-A
  2874. In State-A moving R
  2875. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2876. predict error 1
  2877. dir: dir isL
  2878. -405: O: O809 (predict-yes)
  2879. I see 0 and I'm going to do: predict-yes
  2880. ENV: Agent did: predict-yes for direction L in state State-B
  2881. In State-B moving L
  2882. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2883. predict error 0
  2884. dir: dir isL
  2885. /|406: O: O812 (predict-no)
  2886. I see 1 and I'm going to do: predict-no
  2887. ENV: Agent did: predict-no for direction L in state State-A
  2888. In State-A moving L
  2889. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2890. predict error 0
  2891. dir: dir isR
  2892. \-/407: O: O813 (predict-yes)
  2893. I see 1 and I'm going to do: predict-yes
  2894. ENV: Agent did: predict-yes for direction R in state State-A
  2895. In State-A moving R
  2896. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2897. predict error 0
  2898. dir: dir isU
  2899. |\-408: O: O815 (predict-yes)
  2900. I see 1 and I'm going to do: predict-yes
  2901. ENV: Agent did: predict-yes for direction U in state State-B
  2902. In State-B moving U
  2903. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2904. predict error 1
  2905. dir: dir isL
  2906. /|409: O: O817 (predict-yes)
  2907. I see 0 and I'm going to do: predict-yes
  2908. ENV: Agent did: predict-yes for direction L in state State-B
  2909. In State-B moving L
  2910. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2911. predict error 0
  2912. dir: dir isU
  2913. \-/410: O: O820 (predict-no)
  2914. I see 1 and I'm going to do: predict-no
  2915. ENV: Agent did: predict-no for direction U in state State-A
  2916. In State-A moving U
  2917. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2918. predict error 0
  2919. dir: dir isU
  2920. |\-411: O: O822 (predict-no)
  2921. I see 1 and I'm going to do: predict-no
  2922. ENV: Agent did: predict-no for direction U in state State-A
  2923. In State-A moving U
  2924. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2925. predict error 0
  2926. dir: dir isL
  2927. /412: O: O824 (predict-no)
  2928. I see 1 and I'm going to do: predict-no
  2929. ENV: Agent did: predict-no for direction L in state State-A
  2930. In State-A moving L
  2931. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2932. predict error 0
  2933. dir: dir isU
  2934. |\-413: O: O826 (predict-no)
  2935. I see 1 and I'm going to do: predict-no
  2936. ENV: Agent did: predict-no for direction U in state State-A
  2937. In State-A moving U
  2938. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2939. predict error 0
  2940. dir: dir isU
  2941. /|\414: O: O828 (predict-no)
  2942. I see 1 and I'm going to do: predict-no
  2943. ENV: Agent did: predict-no for direction U in state State-A
  2944. In State-A moving U
  2945. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2946. predict error 0
  2947. dir: dir isR
  2948. -/|415: O: O829 (predict-yes)
  2949. I see 1 and I'm going to do: predict-yes
  2950. ENV: Agent did: predict-yes for direction R in state State-A
  2951. In State-A moving R
  2952. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2953. predict error 0
  2954. dir: dir isU
  2955. \-/416: O: O832 (predict-no)
  2956. I see 1 and I'm going to do: predict-no
  2957. ENV: Agent did: predict-no for direction U in state State-B
  2958. In State-B moving U
  2959. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2960. predict error 0
  2961. dir: dir isU
  2962. |\417: O: O834 (predict-no)
  2963. I see 1 and I'm going to do: predict-no
  2964. ENV: Agent did: predict-no for direction U in state State-B
  2965. In State-B moving U
  2966. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2967. predict error 0
  2968. dir: dir isR
  2969. -/418: O: O836 (predict-no)
  2970. I see 1 and I'm going to do: predict-no
  2971. ENV: Agent did: predict-no for direction R in state State-B
  2972. In State-B moving R
  2973. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2974. predict error 0
  2975. dir: dir isU
  2976. |\-419: O: O838 (predict-no)
  2977. I see 1 and I'm going to do: predict-no
  2978. ENV: Agent did: predict-no for direction U in state State-B
  2979. In State-B moving U
  2980. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2981. predict error 0
  2982. dir: dir isU
  2983. /|420: O: O840 (predict-no)
  2984. I see 1 and I'm going to do: predict-no
  2985. ENV: Agent did: predict-no for direction U in state State-B
  2986. In State-B moving U
  2987. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2988. predict error 0
  2989. dir: dir isU
  2990. \-/421: O: O842 (predict-no)
  2991. I see 1 and I'm going to do: predict-no
  2992. ENV: Agent did: predict-no for direction U in state State-B
  2993. In State-B moving U
  2994. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2995. predict error 0
  2996. dir: dir isR
  2997. |422: O: O844 (predict-no)
  2998. I see 1 and I'm going to do: predict-no
  2999. ENV: Agent did: predict-no for direction R in state State-B
  3000. In State-B moving R
  3001. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3002. predict error 0
  3003. dir: dir isL
  3004. \423: O: O845 (predict-yes)
  3005. I see 1 and I'm going to do: predict-yes
  3006. ENV: Agent did: predict-yes for direction L in state State-B
  3007. In State-B moving L
  3008. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3009. predict error 0
  3010. dir: dir isL
  3011. -/|424: O: O848 (predict-no)
  3012. I see 1 and I'm going to do: predict-no
  3013. ENV: Agent did: predict-no for direction L in state State-A
  3014. In State-A moving L
  3015. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3016. predict error 0
  3017. dir: dir isL
  3018. \-425: O: O850 (predict-no)
  3019. I see 1 and I'm going to do: predict-no
  3020. ENV: Agent did: predict-no for direction L in state State-A
  3021. In State-A moving L
  3022. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3023. predict error 0
  3024. dir: dir isR
  3025. /|426: O: O851 (predict-yes)
  3026. I see 1 and I'm going to do: predict-yes
  3027. ENV: Agent did: predict-yes for direction R in state State-A
  3028. In State-A moving R
  3029. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3030. predict error 0
  3031. dir: dir isU
  3032. \-/427: O: O854 (predict-no)
  3033. I see 1 and I'm going to do: predict-no
  3034. ENV: Agent did: predict-no for direction U in state State-B
  3035. In State-B moving U
  3036. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3037. predict error 0
  3038. dir: dir isL
  3039. |\-428: O: O855 (predict-yes)
  3040. I see 1 and I'm going to do: predict-yes
  3041. ENV: Agent did: predict-yes for direction L in state State-B
  3042. In State-B moving L
  3043. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3044. predict error 0
  3045. dir: dir isU
  3046. /|\429: O: O858 (predict-no)
  3047. I see 1 and I'm going to do: predict-no
  3048. ENV: Agent did: predict-no for direction U in state State-A
  3049. In State-A moving U
  3050. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3051. predict error 0
  3052. dir: dir isU
  3053. -/|430: O: O859 (predict-yes)
  3054. I see 1 and I'm going to do: predict-yes
  3055. ENV: Agent did: predict-yes for direction U in state State-A
  3056. In State-A moving U
  3057. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3058. predict error 1
  3059. dir: dir isR
  3060. \-/431: O: O861 (predict-yes)
  3061. I see 0 and I'm going to do: predict-yes
  3062. ENV: Agent did: predict-yes for direction R in state State-A
  3063. In State-A moving R
  3064. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3065. predict error 0
  3066. dir: dir isR
  3067. |432: O: O864 (predict-no)
  3068. I see 1 and I'm going to do: predict-no
  3069. ENV: Agent did: predict-no for direction R in state State-B
  3070. In State-B moving R
  3071. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3072. predict error 0
  3073. dir: dir isL
  3074. \-/433: O: O865 (predict-yes)
  3075. I see 1 and I'm going to do: predict-yes
  3076. ENV: Agent did: predict-yes for direction L in state State-B
  3077. In State-B moving L
  3078. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3079. predict error 0
  3080. dir: dir isU
  3081. |\-434: O: O868 (predict-no)
  3082. I see 1 and I'm going to do: predict-no
  3083. ENV: Agent did: predict-no for direction U in state State-A
  3084. In State-A moving U
  3085. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3086. predict error 0
  3087. dir: dir isL
  3088. /|\435: O: O869 (predict-yes)
  3089. I see 1 and I'm going to do: predict-yes
  3090. ENV: Agent did: predict-yes for direction L in state State-A
  3091. In State-A moving L
  3092. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3093. predict error 1
  3094. dir: dir isU
  3095. -/|436: O: O872 (predict-no)
  3096. I see 0 and I'm going to do: predict-no
  3097. ENV: Agent did: predict-no for direction U in state State-A
  3098. In State-A moving U
  3099. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3100. predict error 0
  3101. dir: dir isU
  3102. \-/437: O: O874 (predict-no)
  3103. I see 1 and I'm going to do: predict-no
  3104. ENV: Agent did: predict-no for direction U in state State-A
  3105. In State-A moving U
  3106. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3107. predict error 0
  3108. dir: dir isR
  3109. |\-438: O: O875 (predict-yes)
  3110. I see 1 and I'm going to do: predict-yes
  3111. ENV: Agent did: predict-yes for direction R in state State-A
  3112. In State-A moving R
  3113. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3114. predict error 0
  3115. dir: dir isL
  3116. /|439: O: O877 (predict-yes)
  3117. I see 1 and I'm going to do: predict-yes
  3118. ENV: Agent did: predict-yes for direction L in state State-B
  3119. In State-B moving L
  3120. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3121. predict error 0
  3122. dir: dir isU
  3123. \-/440: O: O880 (predict-no)
  3124. I see 1 and I'm going to do: predict-no
  3125. ENV: Agent did: predict-no for direction U in state State-A
  3126. In State-A moving U
  3127. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3128. predict error 0
  3129. dir: dir isU
  3130. |\-441: O: O882 (predict-no)
  3131. I see 1 and I'm going to do: predict-no
  3132. ENV: Agent did: predict-no for direction U in state State-A
  3133. In State-A moving U
  3134. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3135. predict error 0
  3136. dir: dir isL
  3137. /442: O: O884 (predict-no)
  3138. I see 1 and I'm going to do: predict-no
  3139. ENV: Agent did: predict-no for direction L in state State-A
  3140. In State-A moving L
  3141. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3142. predict error 0
  3143. dir: dir isU
  3144. |\-443: O: O886 (predict-no)
  3145. I see 1 and I'm going to do: predict-no
  3146. ENV: Agent did: predict-no for direction U in state State-A
  3147. In State-A moving U
  3148. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3149. predict error 0
  3150. dir: dir isU
  3151. /|\444: O: O888 (predict-no)
  3152. I see 1 and I'm going to do: predict-no
  3153. ENV: Agent did: predict-no for direction U in state State-A
  3154. In State-A moving U
  3155. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3156. predict error 0
  3157. dir: dir isR
  3158. -/|\445: O: O889 (predict-yes)
  3159. I see 1 and I'm going to do: predict-yes
  3160. ENV: Agent did: predict-yes for direction R in state State-A
  3161. In State-A moving R
  3162. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3163. predict error 0
  3164. dir: dir isU
  3165. -/446: O: O892 (predict-no)
  3166. I see 1 and I'm going to do: predict-no
  3167. ENV: Agent did: predict-no for direction U in state State-B
  3168. In State-B moving U
  3169. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3170. predict error 0
  3171. dir: dir isR
  3172. |\-/447: O: O894 (predict-no)
  3173. I see 1 and I'm going to do: predict-no
  3174. ENV: Agent did: predict-no for direction R in state State-B
  3175. In State-B moving R
  3176. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3177. predict error 0
  3178. dir: dir isU
  3179. |\-448: O: O896 (predict-no)
  3180. I see 1 and I'm going to do: predict-no
  3181. ENV: Agent did: predict-no for direction U in state State-B
  3182. In State-B moving U
  3183. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3184. predict error 0
  3185. dir: dir isU
  3186. /|\449: O: O898 (predict-no)
  3187. I see 1 and I'm going to do: predict-no
  3188. ENV: Agent did: predict-no for direction U in state State-B
  3189. In State-B moving U
  3190. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3191. predict error 0
  3192. dir: dir isR
  3193. -/|450: O: O900 (predict-no)
  3194. I see 1 and I'm going to do: predict-no
  3195. ENV: Agent did: predict-no for direction R in state State-B
  3196. In State-B moving R
  3197. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3198. predict error 0
  3199. dir: dir isU
  3200. \-/451: O: O902 (predict-no)
  3201. I see 1 and I'm going to do: predict-no
  3202. ENV: Agent did: predict-no for direction U in state State-B
  3203. In State-B moving U
  3204. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3205. predict error 0
  3206. dir: dir isR
  3207. |452: O: O904 (predict-no)
  3208. I see 1 and I'm going to do: predict-no
  3209. ENV: Agent did: predict-no for direction R in state State-B
  3210. In State-B moving R
  3211. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3212. predict error 0
  3213. dir: dir isL
  3214. \-/453: O: O905 (predict-yes)
  3215. I see 1 and I'm going to do: predict-yes
  3216. ENV: Agent did: predict-yes for direction L in state State-B
  3217. In State-B moving L
  3218. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3219. predict error 0
  3220. dir: dir isL
  3221. |\-454: O: O908 (predict-no)
  3222. I see 1 and I'm going to do: predict-no
  3223. ENV: Agent did: predict-no for direction L in state State-A
  3224. In State-A moving L
  3225. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3226. predict error 0
  3227. dir: dir isL
  3228. /|455: O: O910 (predict-no)
  3229. I see 1 and I'm going to do: predict-no
  3230. ENV: Agent did: predict-no for direction L in state State-A
  3231. In State-A moving L
  3232. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3233. predict error 0
  3234. dir: dir isU
  3235. \-456: O: O912 (predict-no)
  3236. I see 1 and I'm going to do: predict-no
  3237. ENV: Agent did: predict-no for direction U in state State-A
  3238. In State-A moving U
  3239. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3240. predict error 0
  3241. dir: dir isU
  3242. /|\-457: O: O914 (predict-no)
  3243. I see 1 and I'm going to do: predict-no
  3244. ENV: Agent did: predict-no for direction U in state State-A
  3245. In State-A moving U
  3246. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3247. predict error 0
  3248. dir: dir isL
  3249. /|\458: O: O916 (predict-no)
  3250. I see 1 and I'm going to do: predict-no
  3251. ENV: Agent did: predict-no for direction L in state State-A
  3252. In State-A moving L
  3253. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3254. predict error 0
  3255. dir: dir isR
  3256. -/459: O: O918 (predict-no)
  3257. I see 1 and I'm going to do: predict-no
  3258. ENV: Agent did: predict-no for direction R in state State-A
  3259. In State-A moving R
  3260. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3261. predict error 1
  3262. dir: dir isR
  3263. |\-460: O: O920 (predict-no)
  3264. I see 0 and I'm going to do: predict-no
  3265. ENV: Agent did: predict-no for direction R in state State-B
  3266. In State-B moving R
  3267. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3268. predict error 0
  3269. dir: dir isL
  3270. /|461: O: O921 (predict-yes)
  3271. I see 1 and I'm going to do: predict-yes
  3272. ENV: Agent did: predict-yes for direction L in state State-B
  3273. In State-B moving L
  3274. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3275. predict error 0
  3276. dir: dir isL
  3277. \462: O: O924 (predict-no)
  3278. I see 1 and I'm going to do: predict-no
  3279. ENV: Agent did: predict-no for direction L in state State-A
  3280. In State-A moving L
  3281. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3282. predict error 0
  3283. dir: dir isL
  3284. -/|463: O: O926 (predict-no)
  3285. I see 1 and I'm going to do: predict-no
  3286. ENV: Agent did: predict-no for direction L in state State-A
  3287. In State-A moving L
  3288. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3289. predict error 0
  3290. dir: dir isU
  3291. \-/464: O: O928 (predict-no)
  3292. I see 1 and I'm going to do: predict-no
  3293. ENV: Agent did: predict-no for direction U in state State-A
  3294. In State-A moving U
  3295. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3296. predict error 0
  3297. dir: dir isL
  3298. |\-465: O: O930 (predict-no)
  3299. I see 1 and I'm going to do: predict-no
  3300. ENV: Agent did: predict-no for direction L in state State-A
  3301. In State-A moving L
  3302. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3303. predict error 0
  3304. dir: dir isL
  3305. /|\466: O: O932 (predict-no)
  3306. I see 1 and I'm going to do: predict-no
  3307. ENV: Agent did: predict-no for direction L in state State-A
  3308. In State-A moving L
  3309. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3310. predict error 0
  3311. dir: dir isR
  3312. -/|467: O: O933 (predict-yes)
  3313. I see 1 and I'm going to do: predict-yes
  3314. ENV: Agent did: predict-yes for direction R in state State-A
  3315. In State-A moving R
  3316. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3317. predict error 0
  3318. dir: dir isL
  3319. \-/468: O: O935 (predict-yes)
  3320. I see 1 and I'm going to do: predict-yes
  3321. ENV: Agent did: predict-yes for direction L in state State-B
  3322. In State-B moving L
  3323. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3324. predict error 0
  3325. dir: dir isR
  3326. |\-469: O: O937 (predict-yes)
  3327. I see 1 and I'm going to do: predict-yes
  3328. ENV: Agent did: predict-yes for direction R in state State-A
  3329. In State-A moving R
  3330. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3331. predict error 0
  3332. dir: dir isR
  3333. /|\470: O: O939 (predict-yes)
  3334. I see 1 and I'm going to do: predict-yes
  3335. ENV: Agent did: predict-yes for direction R in state State-B
  3336. In State-B moving R
  3337. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3338. predict error 1
  3339. dir: dir isU
  3340. -/|471: O: O942 (predict-no)
  3341. I see 0 and I'm going to do: predict-no
  3342. ENV: Agent did: predict-no for direction U in state State-B
  3343. In State-B moving U
  3344. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3345. predict error 0
  3346. dir: dir isL
  3347. \472: O: O943 (predict-yes)
  3348. I see 1 and I'm going to do: predict-yes
  3349. ENV: Agent did: predict-yes for direction L in state State-B
  3350. In State-B moving L
  3351. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3352. predict error 0
  3353. dir: dir isL
  3354. -/|473: O: O946 (predict-no)
  3355. I see 1 and I'm going to do: predict-no
  3356. ENV: Agent did: predict-no for direction L in state State-A
  3357. In State-A moving L
  3358. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3359. predict error 0
  3360. dir: dir isR
  3361. \-/474: O: O947 (predict-yes)
  3362. I see 1 and I'm going to do: predict-yes
  3363. ENV: Agent did: predict-yes for direction R in state State-A
  3364. In State-A moving R
  3365. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3366. predict error 0
  3367. dir: dir isL
  3368. |\-475: O: O949 (predict-yes)
  3369. I see 1 and I'm going to do: predict-yes
  3370. ENV: Agent did: predict-yes for direction L in state State-B
  3371. In State-B moving L
  3372. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3373. predict error 0
  3374. dir: dir isR
  3375. /|\476: O: O951 (predict-yes)
  3376. I see 1 and I'm going to do: predict-yes
  3377. ENV: Agent did: predict-yes for direction R in state State-A
  3378. In State-A moving R
  3379. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3380. predict error 0
  3381. dir: dir isL
  3382. -/477: O: O953 (predict-yes)
  3383. I see 1 and I'm going to do: predict-yes
  3384. ENV: Agent did: predict-yes for direction L in state State-B
  3385. In State-B moving L
  3386. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3387. predict error 0
  3388. dir: dir isU
  3389. |\-478: O: O956 (predict-no)
  3390. I see 1 and I'm going to do: predict-no
  3391. ENV: Agent did: predict-no for direction U in state State-A
  3392. In State-A moving U
  3393. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3394. predict error 0
  3395. dir: dir isU
  3396. /|\479: O: O958 (predict-no)
  3397. I see 1 and I'm going to do: predict-no
  3398. ENV: Agent did: predict-no for direction U in state State-A
  3399. In State-A moving U
  3400. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3401. predict error 0
  3402. dir: dir isU
  3403. -/|480: O: O960 (predict-no)
  3404. I see 1 and I'm going to do: predict-no
  3405. ENV: Agent did: predict-no for direction U in state State-A
  3406. In State-A moving U
  3407. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3408. predict error 0
  3409. dir: dir isU
  3410. \-/481: O: O962 (predict-no)
  3411. I see 1 and I'm going to do: predict-no
  3412. ENV: Agent did: predict-no for direction U in state State-A
  3413. In State-A moving U
  3414. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3415. predict error 0
  3416. dir: dir isR
  3417. |482: O: O963 (predict-yes)
  3418. I see 1 and I'm going to do: predict-yes
  3419. ENV: Agent did: predict-yes for direction R in state State-A
  3420. In State-A moving R
  3421. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3422. predict error 0
  3423. dir: dir isR
  3424. \-/483: O: O966 (predict-no)
  3425. I see 1 and I'm going to do: predict-no
  3426. ENV: Agent did: predict-no for direction R in state State-B
  3427. In State-B moving R
  3428. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3429. predict error 0
  3430. dir: dir isU
  3431. |\-484: O: O968 (predict-no)
  3432. I see 1 and I'm going to do: predict-no
  3433. ENV: Agent did: predict-no for direction U in state State-B
  3434. In State-B moving U
  3435. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3436. predict error 0
  3437. dir: dir isU
  3438. /|485: O: O970 (predict-no)
  3439. I see 1 and I'm going to do: predict-no
  3440. ENV: Agent did: predict-no for direction U in state State-B
  3441. In State-B moving U
  3442. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3443. predict error 0
  3444. dir: dir isR
  3445. \486: O: O972 (predict-no)
  3446. I see 1 and I'm going to do: predict-no
  3447. ENV: Agent did: predict-no for direction R in state State-B
  3448. In State-B moving R
  3449. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3450. predict error 0
  3451. dir: dir isR
  3452. -/487: O: O974 (predict-no)
  3453. I see 1 and I'm going to do: predict-no
  3454. ENV: Agent did: predict-no for direction R in state State-B
  3455. In State-B moving R
  3456. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3457. predict error 0
  3458. dir: dir isL
  3459. |\-488: O: O975 (predict-yes)
  3460. I see 1 and I'm going to do: predict-yes
  3461. ENV: Agent did: predict-yes for direction L in state State-B
  3462. In State-B moving L
  3463. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3464. predict error 0
  3465. dir: dir isL
  3466. /489: O: O978 (predict-no)
  3467. I see 1 and I'm going to do: predict-no
  3468. ENV: Agent did: predict-no for direction L in state State-A
  3469. In State-A moving L
  3470. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3471. predict error 0
  3472. dir: dir isU
  3473. |\-490: O: O980 (predict-no)
  3474. I see 1 and I'm going to do: predict-no
  3475. ENV: Agent did: predict-no for direction U in state State-A
  3476. In State-A moving U
  3477. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3478. predict error 0
  3479. dir: dir isL
  3480. /|\491: O: O982 (predict-no)
  3481. I see 1 and I'm going to do: predict-no
  3482. ENV: Agent did: predict-no for direction L in state State-A
  3483. In State-A moving L
  3484. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3485. predict error 0
  3486. dir: dir isU
  3487. -492: O: O984 (predict-no)
  3488. I see 1 and I'm going to do: predict-no
  3489. ENV: Agent did: predict-no for direction U in state State-A
  3490. In State-A moving U
  3491. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3492. predict error 0
  3493. dir: dir isR
  3494. /|\493: O: O985 (predict-yes)
  3495. I see 1 and I'm going to do: predict-yes
  3496. ENV: Agent did: predict-yes for direction R in state State-A
  3497. In State-A moving R
  3498. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3499. predict error 0
  3500. dir: dir isU
  3501. -/494: O: O988 (predict-no)
  3502. I see 1 and I'm going to do: predict-no
  3503. ENV: Agent did: predict-no for direction U in state State-B
  3504. In State-B moving U
  3505. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3506. predict error 0
  3507. dir: dir isU
  3508. |\495: O: O990 (predict-no)
  3509. I see 1 and I'm going to do: predict-no
  3510. ENV: Agent did: predict-no for direction U in state State-B
  3511. In State-B moving U
  3512. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3513. predict error 0
  3514. dir: dir isU
  3515. -/496: O: O992 (predict-no)
  3516. I see 1 and I'm going to do: predict-no
  3517. ENV: Agent did: predict-no for direction U in state State-B
  3518. In State-B moving U
  3519. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3520. predict error 0
  3521. dir: dir isL
  3522. |\497: O: O993 (predict-yes)
  3523. I see 1 and I'm going to do: predict-yes
  3524. ENV: Agent did: predict-yes for direction L in state State-B
  3525. In State-B moving L
  3526. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3527. predict error 0
  3528. dir: dir isR
  3529. -498: O: O995 (predict-yes)
  3530. I see 1 and I'm going to do: predict-yes
  3531. ENV: Agent did: predict-yes for direction R in state State-A
  3532. In State-A moving R
  3533. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3534. predict error 0
  3535. dir: dir isR
  3536. /|\499: O: O998 (predict-no)
  3537. I see 1 and I'm going to do: predict-no
  3538. ENV: Agent did: predict-no for direction R in state State-B
  3539. In State-B moving R
  3540. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3541. predict error 0
  3542. dir: dir isL
  3543. -/|500: O: O999 (predict-yes)
  3544. I see 1 and I'm going to do: predict-yes
  3545. ENV: Agent did: predict-yes for direction L in state State-B
  3546. In State-B moving L
  3547. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3548. predict error 0
  3549. dir: dir isR
  3550. \-/|\-501: O: O1001 (predict-yes)
  3551. I see 1 and I'm going to do: predict-yes
  3552. ENV: Agent did: predict-yes for direction R in state State-A
  3553. In State-A moving R
  3554. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3555. predict error 0
  3556. dir: dir isR
  3557. /502: O: O1004 (predict-no)
  3558. I see 1 and I'm going to do: predict-no
  3559. ENV: Agent did: predict-no for direction R in state State-B
  3560. In State-B moving R
  3561. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3562. predict error 0
  3563. dir: dir isR
  3564. |\503: O: O1006 (predict-no)
  3565. I see 1 and I'm going to do: predict-no
  3566. ENV: Agent did: predict-no for direction R in state State-B
  3567. In State-B moving R
  3568. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3569. predict error 0
  3570. dir: dir isL
  3571. -/|504: O: O1007 (predict-yes)
  3572. I see 1 and I'm going to do: predict-yes
  3573. ENV: Agent did: predict-yes for direction L in state State-B
  3574. In State-B moving L
  3575. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3576. predict error 0
  3577. dir: dir isR
  3578. \-/505: O: O1009 (predict-yes)
  3579. I see 1 and I'm going to do: predict-yes
  3580. ENV: Agent did: predict-yes for direction R in state State-A
  3581. In State-A moving R
  3582. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3583. predict error 0
  3584. dir: dir isR
  3585. |\506: O: O1012 (predict-no)
  3586. I see 1 and I'm going to do: predict-no
  3587. ENV: Agent did: predict-no for direction R in state State-B
  3588. In State-B moving R
  3589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3590. predict error 0
  3591. dir: dir isL
  3592. -507: O: O1013 (predict-yes)
  3593. I see 1 and I'm going to do: predict-yes
  3594. ENV: Agent did: predict-yes for direction L in state State-B
  3595. In State-B moving L
  3596. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3597. predict error 0
  3598. dir: dir isR
  3599. /|508: O: O1015 (predict-yes)
  3600. I see 1 and I'm going to do: predict-yes
  3601. ENV: Agent did: predict-yes for direction R in state State-A
  3602. In State-A moving R
  3603. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3604. predict error 0
  3605. dir: dir isU
  3606. \-509: O: O1018 (predict-no)
  3607. I see 1 and I'm going to do: predict-no
  3608. ENV: Agent did: predict-no for direction U in state State-B
  3609. In State-B moving U
  3610. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3611. predict error 0
  3612. dir: dir isU
  3613. /|510: O: O1020 (predict-no)
  3614. I see 1 and I'm going to do: predict-no
  3615. ENV: Agent did: predict-no for direction U in state State-B
  3616. In State-B moving U
  3617. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3618. predict error 0
  3619. dir: dir isR
  3620. \-/511: O: O1022 (predict-no)
  3621. I see 1 and I'm going to do: predict-no
  3622. ENV: Agent did: predict-no for direction R in state State-B
  3623. In State-B moving R
  3624. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3625. predict error 0
  3626. dir: dir isR
  3627. |512: O: O1024 (predict-no)
  3628. I see 1 and I'm going to do: predict-no
  3629. ENV: Agent did: predict-no for direction R in state State-B
  3630. In State-B moving R
  3631. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3632. predict error 0
  3633. dir: dir isR
  3634. \-513: O: O1026 (predict-no)
  3635. I see 1 and I'm going to do: predict-no
  3636. ENV: Agent did: predict-no for direction R in state State-B
  3637. In State-B moving R
  3638. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3639. predict error 0
  3640. dir: dir isL
  3641. /|\514: O: O1027 (predict-yes)
  3642. I see 1 and I'm going to do: predict-yes
  3643. ENV: Agent did: predict-yes for direction L in state State-B
  3644. In State-B moving L
  3645. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3646. predict error 0
  3647. dir: dir isL
  3648. -/|515: O: O1030 (predict-no)
  3649. I see 1 and I'm going to do: predict-no
  3650. ENV: Agent did: predict-no for direction L in state State-A
  3651. In State-A moving L
  3652. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3653. predict error 0
  3654. dir: dir isL
  3655. \-/516: O: O1032 (predict-no)
  3656. I see 1 and I'm going to do: predict-no
  3657. ENV: Agent did: predict-no for direction L in state State-A
  3658. In State-A moving L
  3659. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3660. predict error 0
  3661. dir: dir isR
  3662. |\-517: O: O1033 (predict-yes)
  3663. I see 1 and I'm going to do: predict-yes
  3664. ENV: Agent did: predict-yes for direction R in state State-A
  3665. In State-A moving R
  3666. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3667. predict error 0
  3668. dir: dir isU
  3669. /|\518: O: O1036 (predict-no)
  3670. I see 1 and I'm going to do: predict-no
  3671. ENV: Agent did: predict-no for direction U in state State-B
  3672. In State-B moving U
  3673. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3674. predict error 0
  3675. dir: dir isU
  3676. -/|519: O: O1038 (predict-no)
  3677. I see 1 and I'm going to do: predict-no
  3678. ENV: Agent did: predict-no for direction U in state State-B
  3679. In State-B moving U
  3680. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3681. predict error 0
  3682. dir: dir isR
  3683. \-520: O: O1040 (predict-no)
  3684. I see 1 and I'm going to do: predict-no
  3685. ENV: Agent did: predict-no for direction R in state State-B
  3686. In State-B moving R
  3687. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3688. predict error 0
  3689. dir: dir isU
  3690. /|521: O: O1042 (predict-no)
  3691. I see 1 and I'm going to do: predict-no
  3692. ENV: Agent did: predict-no for direction U in state State-B
  3693. In State-B moving U
  3694. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3695. predict error 0
  3696. dir: dir isR
  3697. \522: O: O1044 (predict-no)
  3698. I see 1 and I'm going to do: predict-no
  3699. ENV: Agent did: predict-no for direction R in state State-B
  3700. In State-B moving R
  3701. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3702. predict error 0
  3703. dir: dir isU
  3704. -/|523: O: O1046 (predict-no)
  3705. I see 1 and I'm going to do: predict-no
  3706. ENV: Agent did: predict-no for direction U in state State-B
  3707. In State-B moving U
  3708. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3709. predict error 0
  3710. dir: dir isR
  3711. \-/524: O: O1048 (predict-no)
  3712. I see 1 and I'm going to do: predict-no
  3713. ENV: Agent did: predict-no for direction R in state State-B
  3714. In State-B moving R
  3715. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3716. predict error 0
  3717. dir: dir isU
  3718. |\-525: O: O1050 (predict-no)
  3719. I see 1 and I'm going to do: predict-no
  3720. ENV: Agent did: predict-no for direction U in state State-B
  3721. In State-B moving U
  3722. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3723. predict error 0
  3724. dir: dir isU
  3725. /|526: O: O1052 (predict-no)
  3726. I see 1 and I'm going to do: predict-no
  3727. ENV: Agent did: predict-no for direction U in state State-B
  3728. In State-B moving U
  3729. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3730. predict error 0
  3731. dir: dir isL
  3732. \-/527: O: O1053 (predict-yes)
  3733. I see 1 and I'm going to do: predict-yes
  3734. ENV: Agent did: predict-yes for direction L in state State-B
  3735. In State-B moving L
  3736. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3737. predict error 0
  3738. dir: dir isL
  3739. |\-/528: O: O1056 (predict-no)
  3740. I see 1 and I'm going to do: predict-no
  3741. ENV: Agent did: predict-no for direction L in state State-A
  3742. In State-A moving L
  3743. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3744. predict error 0
  3745. dir: dir isR
  3746. |\-529: O: O1057 (predict-yes)
  3747. I see 1 and I'm going to do: predict-yes
  3748. ENV: Agent did: predict-yes for direction R in state State-A
  3749. In State-A moving R
  3750. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3751. predict error 0
  3752. dir: dir isR
  3753. /|\530: O: O1060 (predict-no)
  3754. I see 1 and I'm going to do: predict-no
  3755. ENV: Agent did: predict-no for direction R in state State-B
  3756. In State-B moving R
  3757. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3758. predict error 0
  3759. dir: dir isR
  3760. -/|531: O: O1062 (predict-no)
  3761. I see 1 and I'm going to do: predict-no
  3762. ENV: Agent did: predict-no for direction R in state State-B
  3763. In State-B moving R
  3764. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3765. predict error 0
  3766. dir: dir isL
  3767. \532: O: O1063 (predict-yes)
  3768. I see 1 and I'm going to do: predict-yes
  3769. ENV: Agent did: predict-yes for direction L in state State-B
  3770. In State-B moving L
  3771. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3772. predict error 0
  3773. dir: dir isR
  3774. -/533: O: O1065 (predict-yes)
  3775. I see 1 and I'm going to do: predict-yes
  3776. ENV: Agent did: predict-yes for direction R in state State-A
  3777. In State-A moving R
  3778. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3779. predict error 0
  3780. dir: dir isR
  3781. |\534: O: O1068 (predict-no)
  3782. I see 1 and I'm going to do: predict-no
  3783. ENV: Agent did: predict-no for direction R in state State-B
  3784. In State-B moving R
  3785. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3786. predict error 0
  3787. dir: dir isR
  3788. -/|535: O: O1070 (predict-no)
  3789. I see 1 and I'm going to do: predict-no
  3790. ENV: Agent did: predict-no for direction R in state State-B
  3791. In State-B moving R
  3792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3793. predict error 0
  3794. dir: dir isU
  3795. \-/536: O: O1072 (predict-no)
  3796. I see 1 and I'm going to do: predict-no
  3797. ENV: Agent did: predict-no for direction U in state State-B
  3798. In State-B moving U
  3799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3800. predict error 0
  3801. dir: dir isR
  3802. |\537: O: O1074 (predict-no)
  3803. I see 1 and I'm going to do: predict-no
  3804. ENV: Agent did: predict-no for direction R in state State-B
  3805. In State-B moving R
  3806. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3807. predict error 0
  3808. dir: dir isU
  3809. -538: O: O1076 (predict-no)
  3810. I see 1 and I'm going to do: predict-no
  3811. ENV: Agent did: predict-no for direction U in state State-B
  3812. In State-B moving U
  3813. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3814. predict error 0
  3815. dir: dir isU
  3816. /|\539: O: O1078 (predict-no)
  3817. I see 1 and I'm going to do: predict-no
  3818. ENV: Agent did: predict-no for direction U in state State-B
  3819. In State-B moving U
  3820. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3821. predict error 0
  3822. dir: dir isU
  3823. -/|540: O: O1080 (predict-no)
  3824. I see 1 and I'm going to do: predict-no
  3825. ENV: Agent did: predict-no for direction U in state State-B
  3826. In State-B moving U
  3827. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3828. predict error 0
  3829. dir: dir isR
  3830. \-/|541: O: O1082 (predict-no)
  3831. I see 1 and I'm going to do: predict-no
  3832. ENV: Agent did: predict-no for direction R in state State-B
  3833. In State-B moving R
  3834. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3835. predict error 0
  3836. dir: dir isU
  3837. \542: O: O1084 (predict-no)
  3838. I see 1 and I'm going to do: predict-no
  3839. ENV: Agent did: predict-no for direction U in state State-B
  3840. In State-B moving U
  3841. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3842. predict error 0
  3843. dir: dir isR
  3844. -/543: O: O1086 (predict-no)
  3845. I see 1 and I'm going to do: predict-no
  3846. ENV: Agent did: predict-no for direction R in state State-B
  3847. In State-B moving R
  3848. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3849. predict error 0
  3850. dir: dir isR
  3851. |\-544: O: O1088 (predict-no)
  3852. I see 1 and I'm going to do: predict-no
  3853. ENV: Agent did: predict-no for direction R in state State-B
  3854. In State-B moving R
  3855. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3856. predict error 0
  3857. dir: dir isR
  3858. /|\545: O: O1090 (predict-no)
  3859. I see 1 and I'm going to do: predict-no
  3860. ENV: Agent did: predict-no for direction R in state State-B
  3861. In State-B moving R
  3862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3863. predict error 0
  3864. dir: dir isR
  3865. -/546: O: O1092 (predict-no)
  3866. I see 1 and I'm going to do: predict-no
  3867. ENV: Agent did: predict-no for direction R in state State-B
  3868. In State-B moving R
  3869. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3870. predict error 0
  3871. dir: dir isR
  3872. |\-547: O: O1094 (predict-no)
  3873. I see 1 and I'm going to do: predict-no
  3874. ENV: Agent did: predict-no for direction R in state State-B
  3875. In State-B moving R
  3876. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3877. predict error 0
  3878. dir: dir isR
  3879. /|\548: O: O1096 (predict-no)
  3880. I see 1 and I'm going to do: predict-no
  3881. ENV: Agent did: predict-no for direction R in state State-B
  3882. In State-B moving R
  3883. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3884. predict error 0
  3885. dir: dir isU
  3886. -/|549: O: O1098 (predict-no)
  3887. I see 1 and I'm going to do: predict-no
  3888. ENV: Agent did: predict-no for direction U in state State-B
  3889. In State-B moving U
  3890. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3891. predict error 0
  3892. dir: dir isU
  3893. \-550: O: O1100 (predict-no)
  3894. I see 1 and I'm going to do: predict-no
  3895. ENV: Agent did: predict-no for direction U in state State-B
  3896. In State-B moving U
  3897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3898. predict error 0
  3899. dir: dir isU
  3900. /|\551: O: O1102 (predict-no)
  3901. I see 1 and I'm going to do: predict-no
  3902. ENV: Agent did: predict-no for direction U in state State-B
  3903. In State-B moving U
  3904. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3905. predict error 0
  3906. dir: dir isU
  3907. -552: O: O1104 (predict-no)
  3908. I see 1 and I'm going to do: predict-no
  3909. ENV: Agent did: predict-no for direction U in state State-B
  3910. In State-B moving U
  3911. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3912. predict error 0
  3913. dir: dir isU
  3914. /|\553: O: O1106 (predict-no)
  3915. I see 1 and I'm going to do: predict-no
  3916. ENV: Agent did: predict-no for direction U in state State-B
  3917. In State-B moving U
  3918. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3919. predict error 0
  3920. dir: dir isR
  3921. -/|554: O: O1108 (predict-no)
  3922. I see 1 and I'm going to do: predict-no
  3923. ENV: Agent did: predict-no for direction R in state State-B
  3924. In State-B moving R
  3925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3926. predict error 0
  3927. dir: dir isR
  3928. \-/555: O: O1110 (predict-no)
  3929. I see 1 and I'm going to do: predict-no
  3930. ENV: Agent did: predict-no for direction R in state State-B
  3931. In State-B moving R
  3932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3933. predict error 0
  3934. dir: dir isL
  3935. |\-556: O: O1111 (predict-yes)
  3936. I see 1 and I'm going to do: predict-yes
  3937. ENV: Agent did: predict-yes for direction L in state State-B
  3938. In State-B moving L
  3939. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3940. predict error 0
  3941. dir: dir isU
  3942. /|557: O: O1114 (predict-no)
  3943. I see 1 and I'm going to do: predict-no
  3944. ENV: Agent did: predict-no for direction U in state State-A
  3945. In State-A moving U
  3946. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3947. predict error 0
  3948. dir: dir isU
  3949. \-/558: O: O1116 (predict-no)
  3950. I see 1 and I'm going to do: predict-no
  3951. ENV: Agent did: predict-no for direction U in state State-A
  3952. In State-A moving U
  3953. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3954. predict error 0
  3955. dir: dir isR
  3956. |\-559: O: O1117 (predict-yes)
  3957. I see 1 and I'm going to do: predict-yes
  3958. ENV: Agent did: predict-yes for direction R in state State-A
  3959. In State-A moving R
  3960. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3961. predict error 0
  3962. dir: dir isL
  3963. /|\-560: O: O1119 (predict-yes)
  3964. I see 1 and I'm going to do: predict-yes
  3965. ENV: Agent did: predict-yes for direction L in state State-B
  3966. In State-B moving L
  3967. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3968. predict error 0
  3969. dir: dir isU
  3970. /|\561: O: O1122 (predict-no)
  3971. I see 1 and I'm going to do: predict-no
  3972. ENV: Agent did: predict-no for direction U in state State-A
  3973. In State-A moving U
  3974. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3975. predict error 0
  3976. dir: dir isR
  3977. -562: O: O1123 (predict-yes)
  3978. I see 1 and I'm going to do: predict-yes
  3979. ENV: Agent did: predict-yes for direction R in state State-A
  3980. In State-A moving R
  3981. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3982. predict error 0
  3983. dir: dir isR
  3984. /|563: O: O1126 (predict-no)
  3985. I see 1 and I'm going to do: predict-no
  3986. ENV: Agent did: predict-no for direction R in state State-B
  3987. In State-B moving R
  3988. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3989. predict error 0
  3990. dir: dir isL
  3991. \-/564: O: O1127 (predict-yes)
  3992. I see 1 and I'm going to do: predict-yes
  3993. ENV: Agent did: predict-yes for direction L in state State-B
  3994. In State-B moving L
  3995. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3996. predict error 0
  3997. dir: dir isR
  3998. |\-565: O: O1129 (predict-yes)
  3999. I see 1 and I'm going to do: predict-yes
  4000. ENV: Agent did: predict-yes for direction R in state State-A
  4001. In State-A moving R
  4002. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4003. predict error 0
  4004. dir: dir isU
  4005. /|566: O: O1132 (predict-no)
  4006. I see 1 and I'm going to do: predict-no
  4007. ENV: Agent did: predict-no for direction U in state State-B
  4008. In State-B moving U
  4009. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4010. predict error 0
  4011. dir: dir isR
  4012. \-/567: O: O1134 (predict-no)
  4013. I see 1 and I'm going to do: predict-no
  4014. ENV: Agent did: predict-no for direction R in state State-B
  4015. In State-B moving R
  4016. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4017. predict error 0
  4018. dir: dir isR
  4019. |\-568: O: O1136 (predict-no)
  4020. I see 1 and I'm going to do: predict-no
  4021. ENV: Agent did: predict-no for direction R in state State-B
  4022. In State-B moving R
  4023. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4024. predict error 0
  4025. dir: dir isR
  4026. /|\569: O: O1138 (predict-no)
  4027. I see 1 and I'm going to do: predict-no
  4028. ENV: Agent did: predict-no for direction R in state State-B
  4029. In State-B moving R
  4030. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4031. predict error 0
  4032. dir: dir isL
  4033. -/570: O: O1139 (predict-yes)
  4034. I see 1 and I'm going to do: predict-yes
  4035. ENV: Agent did: predict-yes for direction L in state State-B
  4036. In State-B moving L
  4037. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4038. predict error 0
  4039. dir: dir isR
  4040. |\-571: O: O1141 (predict-yes)
  4041. I see 1 and I'm going to do: predict-yes
  4042. ENV: Agent did: predict-yes for direction R in state State-A
  4043. In State-A moving R
  4044. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4045. predict error 0
  4046. dir: dir isU
  4047. /572: O: O1144 (predict-no)
  4048. I see 1 and I'm going to do: predict-no
  4049. ENV: Agent did: predict-no for direction U in state State-B
  4050. In State-B moving U
  4051. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4052. predict error 0
  4053. dir: dir isU
  4054. |\573: O: O1146 (predict-no)
  4055. I see 1 and I'm going to do: predict-no
  4056. ENV: Agent did: predict-no for direction U in state State-B
  4057. In State-B moving U
  4058. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4059. predict error 0
  4060. dir: dir isR
  4061. -/|574: O: O1148 (predict-no)
  4062. I see 1 and I'm going to do: predict-no
  4063. ENV: Agent did: predict-no for direction R in state State-B
  4064. In State-B moving R
  4065. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4066. predict error 0
  4067. dir: dir isU
  4068. \-575: O: O1150 (predict-no)
  4069. I see 1 and I'm going to do: predict-no
  4070. ENV: Agent did: predict-no for direction U in state State-B
  4071. In State-B moving U
  4072. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4073. predict error 0
  4074. dir: dir isR
  4075. /|\576: O: O1152 (predict-no)
  4076. I see 1 and I'm going to do: predict-no
  4077. ENV: Agent did: predict-no for direction R in state State-B
  4078. In State-B moving R
  4079. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4080. predict error 0
  4081. dir: dir isL
  4082. -/|577: O: O1153 (predict-yes)
  4083. I see 1 and I'm going to do: predict-yes
  4084. ENV: Agent did: predict-yes for direction L in state State-B
  4085. In State-B moving L
  4086. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4087. predict error 0
  4088. dir: dir isL
  4089. \-/578: O: O1156 (predict-no)
  4090. I see 1 and I'm going to do: predict-no
  4091. ENV: Agent did: predict-no for direction L in state State-A
  4092. In State-A moving L
  4093. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4094. predict error 0
  4095. dir: dir isU
  4096. |\-579: O: O1158 (predict-no)
  4097. I see 1 and I'm going to do: predict-no
  4098. ENV: Agent did: predict-no for direction U in state State-A
  4099. In State-A moving U
  4100. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4101. predict error 0
  4102. dir: dir isL
  4103. /|\580: O: O1160 (predict-no)
  4104. I see 1 and I'm going to do: predict-no
  4105. ENV: Agent did: predict-no for direction L in state State-A
  4106. In State-A moving L
  4107. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4108. predict error 0
  4109. dir: dir isL
  4110. -581: O: O1162 (predict-no)
  4111. I see 1 and I'm going to do: predict-no
  4112. ENV: Agent did: predict-no for direction L in state State-A
  4113. In State-A moving L
  4114. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4115. predict error 0
  4116. dir: dir isU
  4117. /582: O: O1164 (predict-no)
  4118. I see 1 and I'm going to do: predict-no
  4119. ENV: Agent did: predict-no for direction U in state State-A
  4120. In State-A moving U
  4121. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4122. predict error 0
  4123. dir: dir isR
  4124. |\-583: O: O1165 (predict-yes)
  4125. I see 1 and I'm going to do: predict-yes
  4126. ENV: Agent did: predict-yes for direction R in state State-A
  4127. In State-A moving R
  4128. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4129. predict error 0
  4130. dir: dir isR
  4131. /|584: O: O1168 (predict-no)
  4132. I see 1 and I'm going to do: predict-no
  4133. ENV: Agent did: predict-no for direction R in state State-B
  4134. In State-B moving R
  4135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4136. predict error 0
  4137. dir: dir isR
  4138. \-/585: O: O1170 (predict-no)
  4139. I see 1 and I'm going to do: predict-no
  4140. ENV: Agent did: predict-no for direction R in state State-B
  4141. In State-B moving R
  4142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4143. predict error 0
  4144. dir: dir isU
  4145. |\-586: O: O1172 (predict-no)
  4146. I see 1 and I'm going to do: predict-no
  4147. ENV: Agent did: predict-no for direction U in state State-B
  4148. In State-B moving U
  4149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4150. predict error 0
  4151. dir: dir isL
  4152. /|587: O: O1173 (predict-yes)
  4153. I see 1 and I'm going to do: predict-yes
  4154. ENV: Agent did: predict-yes for direction L in state State-B
  4155. In State-B moving L
  4156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4157. predict error 0
  4158. dir: dir isR
  4159. \-588: O: O1175 (predict-yes)
  4160. I see 1 and I'm going to do: predict-yes
  4161. ENV: Agent did: predict-yes for direction R in state State-A
  4162. In State-A moving R
  4163. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4164. predict error 0
  4165. dir: dir isU
  4166. /|\589: O: O1178 (predict-no)
  4167. I see 1 and I'm going to do: predict-no
  4168. ENV: Agent did: predict-no for direction U in state State-B
  4169. In State-B moving U
  4170. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4171. predict error 0
  4172. dir: dir isU
  4173. -/|590: O: O1180 (predict-no)
  4174. I see 1 and I'm going to do: predict-no
  4175. ENV: Agent did: predict-no for direction U in state State-B
  4176. In State-B moving U
  4177. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4178. predict error 0
  4179. dir: dir isL
  4180. \-591: O: O1181 (predict-yes)
  4181. I see 1 and I'm going to do: predict-yes
  4182. ENV: Agent did: predict-yes for direction L in state State-B
  4183. In State-B moving L
  4184. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4185. predict error 0
  4186. dir: dir isR
  4187. /592: O: O1183 (predict-yes)
  4188. I see 1 and I'm going to do: predict-yes
  4189. ENV: Agent did: predict-yes for direction R in state State-A
  4190. In State-A moving R
  4191. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4192. predict error 0
  4193. dir: dir isL
  4194. |\593: O: O1185 (predict-yes)
  4195. I see 1 and I'm going to do: predict-yes
  4196. ENV: Agent did: predict-yes for direction L in state State-B
  4197. In State-B moving L
  4198. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4199. predict error 0
  4200. dir: dir isR
  4201. -/|594: O: O1187 (predict-yes)
  4202. I see 1 and I'm going to do: predict-yes
  4203. ENV: Agent did: predict-yes for direction R in state State-A
  4204. In State-A moving R
  4205. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4206. predict error 0
  4207. dir: dir isL
  4208. \-/595: O: O1189 (predict-yes)
  4209. I see 1 and I'm going to do: predict-yes
  4210. ENV: Agent did: predict-yes for direction L in state State-B
  4211. In State-B moving L
  4212. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4213. predict error 0
  4214. dir: dir isU
  4215. |\-596: O: O1192 (predict-no)
  4216. I see 1 and I'm going to do: predict-no
  4217. ENV: Agent did: predict-no for direction U in state State-A
  4218. In State-A moving U
  4219. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4220. predict error 0
  4221. dir: dir isU
  4222. /|\597: O: O1194 (predict-no)
  4223. I see 1 and I'm going to do: predict-no
  4224. ENV: Agent did: predict-no for direction U in state State-A
  4225. In State-A moving U
  4226. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4227. predict error 0
  4228. dir: dir isL
  4229. -/598: O: O1196 (predict-no)
  4230. I see 1 and I'm going to do: predict-no
  4231. ENV: Agent did: predict-no for direction L in state State-A
  4232. In State-A moving L
  4233. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4234. predict error 0
  4235. dir: dir isL
  4236. |\-599: O: O1198 (predict-no)
  4237. I see 1 and I'm going to do: predict-no
  4238. ENV: Agent did: predict-no for direction L in state State-A
  4239. In State-A moving L
  4240. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4241. predict error 0
  4242. dir: dir isU
  4243. /|\-600: O: O1200 (predict-no)
  4244. I see 1 and I'm going to do: predict-no
  4245. ENV: Agent did: predict-no for direction U in state State-A
  4246. In State-A moving U
  4247. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4248. predict error 0
  4249. dir: dir isU
  4250. /|\601: O: O1202 (predict-no)
  4251. I see 1 and I'm going to do: predict-no
  4252. ENV: Agent did: predict-no for direction U in state State-A
  4253. In State-A moving U
  4254. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4255. predict error 0
  4256. dir: dir isU
  4257. -602: O: O1204 (predict-no)
  4258. I see 1 and I'm going to do: predict-no
  4259. ENV: Agent did: predict-no for direction U in state State-A
  4260. In State-A moving U
  4261. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4262. predict error 0
  4263. dir: dir isL
  4264. /|\603: O: O1206 (predict-no)
  4265. I see 1 and I'm going to do: predict-no
  4266. ENV: Agent did: predict-no for direction L in state State-A
  4267. In State-A moving L
  4268. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4269. predict error 0
  4270. dir: dir isU
  4271. -/604: O: O1208 (predict-no)
  4272. I see 1 and I'm going to do: predict-no
  4273. ENV: Agent did: predict-no for direction U in state State-A
  4274. In State-A moving U
  4275. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4276. predict error 0
  4277. dir: dir isR
  4278. |605: O: O1209 (predict-yes)
  4279. I see 1 and I'm going to do: predict-yes
  4280. ENV: Agent did: predict-yes for direction R in state State-A
  4281. In State-A moving R
  4282. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4283. predict error 0
  4284. dir: dir isL
  4285. \-/606: O: O1211 (predict-yes)
  4286. I see 1 and I'm going to do: predict-yes
  4287. ENV: Agent did: predict-yes for direction L in state State-B
  4288. In State-B moving L
  4289. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4290. predict error 0
  4291. dir: dir isR
  4292. |\-607: O: O1213 (predict-yes)
  4293. I see 1 and I'm going to do: predict-yes
  4294. ENV: Agent did: predict-yes for direction R in state State-A
  4295. In State-A moving R
  4296. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4297. predict error 0
  4298. dir: dir isU
  4299. /|\608: O: O1216 (predict-no)
  4300. I see 1 and I'm going to do: predict-no
  4301. ENV: Agent did: predict-no for direction U in state State-B
  4302. In State-B moving U
  4303. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4304. predict error 0
  4305. dir: dir isU
  4306. -/|609: O: O1218 (predict-no)
  4307. I see 1 and I'm going to do: predict-no
  4308. ENV: Agent did: predict-no for direction U in state State-B
  4309. In State-B moving U
  4310. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4311. predict error 0
  4312. dir: dir isL
  4313. \-/|610: O: O1219 (predict-yes)
  4314. I see 1 and I'm going to do: predict-yes
  4315. ENV: Agent did: predict-yes for direction L in state State-B
  4316. In State-B moving L
  4317. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4318. predict error 0
  4319. dir: dir isR
  4320. \-611: O: O1221 (predict-yes)
  4321. I see 1 and I'm going to do: predict-yes
  4322. ENV: Agent did: predict-yes for direction R in state State-A
  4323. In State-A moving R
  4324. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4325. predict error 0
  4326. dir: dir isL
  4327. /612: O: O1223 (predict-yes)
  4328. I see 1 and I'm going to do: predict-yes
  4329. ENV: Agent did: predict-yes for direction L in state State-B
  4330. In State-B moving L
  4331. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4332. predict error 0
  4333. dir: dir isU
  4334. |\613: O: O1226 (predict-no)
  4335. I see 1 and I'm going to do: predict-no
  4336. ENV: Agent did: predict-no for direction U in state State-A
  4337. In State-A moving U
  4338. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4339. predict error 0
  4340. dir: dir isR
  4341. -/614: O: O1227 (predict-yes)
  4342. I see 1 and I'm going to do: predict-yes
  4343. ENV: Agent did: predict-yes for direction R in state State-A
  4344. In State-A moving R
  4345. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4346. predict error 0
  4347. dir: dir isU
  4348. |\-615: O: O1230 (predict-no)
  4349. I see 1 and I'm going to do: predict-no
  4350. ENV: Agent did: predict-no for direction U in state State-B
  4351. In State-B moving U
  4352. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4353. predict error 0
  4354. dir: dir isU
  4355. /|\616: O: O1232 (predict-no)
  4356. I see 1 and I'm going to do: predict-no
  4357. ENV: Agent did: predict-no for direction U in state State-B
  4358. In State-B moving U
  4359. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4360. predict error 0
  4361. dir: dir isR
  4362. -/|617: O: O1234 (predict-no)
  4363. I see 1 and I'm going to do: predict-no
  4364. ENV: Agent did: predict-no for direction R in state State-B
  4365. In State-B moving R
  4366. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4367. predict error 0
  4368. dir: dir isL
  4369. \-/618: O: O1235 (predict-yes)
  4370. I see 1 and I'm going to do: predict-yes
  4371. ENV: Agent did: predict-yes for direction L in state State-B
  4372. In State-B moving L
  4373. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4374. predict error 0
  4375. dir: dir isR
  4376. |\619: O: O1237 (predict-yes)
  4377. I see 1 and I'm going to do: predict-yes
  4378. ENV: Agent did: predict-yes for direction R in state State-A
  4379. In State-A moving R
  4380. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4381. predict error 0
  4382. dir: dir isL
  4383. -/|620: O: O1239 (predict-yes)
  4384. I see 1 and I'm going to do: predict-yes
  4385. ENV: Agent did: predict-yes for direction L in state State-B
  4386. In State-B moving L
  4387. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4388. predict error 0
  4389. dir: dir isL
  4390. \-/|621: O: O1242 (predict-no)
  4391. I see 1 and I'm going to do: predict-no
  4392. ENV: Agent did: predict-no for direction L in state State-A
  4393. In State-A moving L
  4394. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4395. predict error 0
  4396. dir: dir isU
  4397. \622: O: O1244 (predict-no)
  4398. I see 1 and I'm going to do: predict-no
  4399. ENV: Agent did: predict-no for direction U in state State-A
  4400. In State-A moving U
  4401. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4402. predict error 0
  4403. dir: dir isR
  4404. -/623: O: O1245 (predict-yes)
  4405. I see 1 and I'm going to do: predict-yes
  4406. ENV: Agent did: predict-yes for direction R in state State-A
  4407. In State-A moving R
  4408. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4409. predict error 0
  4410. dir: dir isU
  4411. |\624: O: O1248 (predict-no)
  4412. I see 1 and I'm going to do: predict-no
  4413. ENV: Agent did: predict-no for direction U in state State-B
  4414. In State-B moving U
  4415. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4416. predict error 0
  4417. dir: dir isL
  4418. -/|625: O: O1249 (predict-yes)
  4419. I see 1 and I'm going to do: predict-yes
  4420. ENV: Agent did: predict-yes for direction L in state State-B
  4421. In State-B moving L
  4422. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4423. predict error 0
  4424. dir: dir isU
  4425. \-/626: O: O1252 (predict-no)
  4426. I see 1 and I'm going to do: predict-no
  4427. ENV: Agent did: predict-no for direction U in state State-A
  4428. In State-A moving U
  4429. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4430. predict error 0
  4431. dir: dir isU
  4432. |\627: O: O1254 (predict-no)
  4433. I see 1 and I'm going to do: predict-no
  4434. ENV: Agent did: predict-no for direction U in state State-A
  4435. In State-A moving U
  4436. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4437. predict error 0
  4438. dir: dir isL
  4439. -/628: O: O1256 (predict-no)
  4440. I see 1 and I'm going to do: predict-no
  4441. ENV: Agent did: predict-no for direction L in state State-A
  4442. In State-A moving L
  4443. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4444. predict error 0
  4445. dir: dir isL
  4446. |629: O: O1258 (predict-no)
  4447. I see 1 and I'm going to do: predict-no
  4448. ENV: Agent did: predict-no for direction L in state State-A
  4449. In State-A moving L
  4450. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4451. predict error 0
  4452. dir: dir isR
  4453. \-/630: O: O1259 (predict-yes)
  4454. I see 1 and I'm going to do: predict-yes
  4455. ENV: Agent did: predict-yes for direction R in state State-A
  4456. In State-A moving R
  4457. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4458. predict error 0
  4459. dir: dir isR
  4460. |\-631: O: O1262 (predict-no)
  4461. I see 1 and I'm going to do: predict-no
  4462. ENV: Agent did: predict-no for direction R in state State-B
  4463. In State-B moving R
  4464. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4465. predict error 0
  4466. dir: dir isL
  4467. /632: O: O1263 (predict-yes)
  4468. I see 1 and I'm going to do: predict-yes
  4469. ENV: Agent did: predict-yes for direction L in state State-B
  4470. In State-B moving L
  4471. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4472. predict error 0
  4473. dir: dir isL
  4474. |\-633: O: O1266 (predict-no)
  4475. I see 1 and I'm going to do: predict-no
  4476. ENV: Agent did: predict-no for direction L in state State-A
  4477. In State-A moving L
  4478. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4479. predict error 0
  4480. dir: dir isL
  4481. /|634: O: O1268 (predict-no)
  4482. I see 1 and I'm going to do: predict-no
  4483. ENV: Agent did: predict-no for direction L in state State-A
  4484. In State-A moving L
  4485. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4486. predict error 0
  4487. dir: dir isR
  4488. \-635: O: O1269 (predict-yes)
  4489. I see 1 and I'm going to do: predict-yes
  4490. ENV: Agent did: predict-yes for direction R in state State-A
  4491. In State-A moving R
  4492. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4493. predict error 0
  4494. dir: dir isU
  4495. /636: O: O1272 (predict-no)
  4496. I see 1 and I'm going to do: predict-no
  4497. ENV: Agent did: predict-no for direction U in state State-B
  4498. In State-B moving U
  4499. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4500. predict error 0
  4501. dir: dir isL
  4502. |\637: O: O1273 (predict-yes)
  4503. I see 1 and I'm going to do: predict-yes
  4504. ENV: Agent did: predict-yes for direction L in state State-B
  4505. In State-B moving L
  4506. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4507. predict error 0
  4508. dir: dir isL
  4509. -/|638: O: O1276 (predict-no)
  4510. I see 1 and I'm going to do: predict-no
  4511. ENV: Agent did: predict-no for direction L in state State-A
  4512. In State-A moving L
  4513. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4514. predict error 0
  4515. dir: dir isU
  4516. \-639: O: O1278 (predict-no)
  4517. I see 1 and I'm going to do: predict-no
  4518. ENV: Agent did: predict-no for direction U in state State-A
  4519. In State-A moving U
  4520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4521. predict error 0
  4522. dir: dir isU
  4523. /|\640: O: O1280 (predict-no)
  4524. I see 1 and I'm going to do: predict-no
  4525. ENV: Agent did: predict-no for direction U in state State-A
  4526. In State-A moving U
  4527. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4528. predict error 0
  4529. dir: dir isU
  4530. -/|641: O: O1282 (predict-no)
  4531. I see 1 and I'm going to do: predict-no
  4532. ENV: Agent did: predict-no for direction U in state State-A
  4533. In State-A moving U
  4534. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4535. predict error 0
  4536. dir: dir isR
  4537. \642: O: O1283 (predict-yes)
  4538. I see 1 and I'm going to do: predict-yes
  4539. ENV: Agent did: predict-yes for direction R in state State-A
  4540. In State-A moving R
  4541. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4542. predict error 0
  4543. dir: dir isR
  4544. -/|643: O: O1286 (predict-no)
  4545. I see 1 and I'm going to do: predict-no
  4546. ENV: Agent did: predict-no for direction R in state State-B
  4547. In State-B moving R
  4548. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4549. predict error 0
  4550. dir: dir isU
  4551. \-644: O: O1288 (predict-no)
  4552. I see 1 and I'm going to do: predict-no
  4553. ENV: Agent did: predict-no for direction U in state State-B
  4554. In State-B moving U
  4555. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4556. predict error 0
  4557. dir: dir isL
  4558. /|\645: O: O1289 (predict-yes)
  4559. I see 1 and I'm going to do: predict-yes
  4560. ENV: Agent did: predict-yes for direction L in state State-B
  4561. In State-B moving L
  4562. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4563. predict error 0
  4564. dir: dir isU
  4565. -/|646: O: O1292 (predict-no)
  4566. I see 1 and I'm going to do: predict-no
  4567. ENV: Agent did: predict-no for direction U in state State-A
  4568. In State-A moving U
  4569. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4570. predict error 0
  4571. dir: dir isL
  4572. \647: O: O1294 (predict-no)
  4573. I see 1 and I'm going to do: predict-no
  4574. ENV: Agent did: predict-no for direction L in state State-A
  4575. In State-A moving L
  4576. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4577. predict error 0
  4578. dir: dir isR
  4579. -/|648: O: O1295 (predict-yes)
  4580. I see 1 and I'm going to do: predict-yes
  4581. ENV: Agent did: predict-yes for direction R in state State-A
  4582. In State-A moving R
  4583. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4584. predict error 0
  4585. dir: dir isR
  4586. \-649: O: O1298 (predict-no)
  4587. I see 1 and I'm going to do: predict-no
  4588. ENV: Agent did: predict-no for direction R in state State-B
  4589. In State-B moving R
  4590. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4591. predict error 0
  4592. dir: dir isR
  4593. /|\-650: O: O1300 (predict-no)
  4594. I see 1 and I'm going to do: predict-no
  4595. ENV: Agent did: predict-no for direction R in state State-B
  4596. In State-B moving R
  4597. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4598. predict error 0
  4599. dir: dir isL
  4600. /|\-651: O: O1301 (predict-yes)
  4601. I see 1 and I'm going to do: predict-yes
  4602. ENV: Agent did: predict-yes for direction L in state State-B
  4603. In State-B moving L
  4604. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4605. predict error 0
  4606. dir: dir isL
  4607. /652: O: O1304 (predict-no)
  4608. I see 1 and I'm going to do: predict-no
  4609. ENV: Agent did: predict-no for direction L in state State-A
  4610. In State-A moving L
  4611. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4612. predict error 0
  4613. dir: dir isU
  4614. |\653: O: O1306 (predict-no)
  4615. I see 1 and I'm going to do: predict-no
  4616. ENV: Agent did: predict-no for direction U in state State-A
  4617. In State-A moving U
  4618. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4619. predict error 0
  4620. dir: dir isR
  4621. -/|654: O: O1307 (predict-yes)
  4622. I see 1 and I'm going to do: predict-yes
  4623. ENV: Agent did: predict-yes for direction R in state State-A
  4624. In State-A moving R
  4625. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4626. predict error 0
  4627. dir: dir isR
  4628. \-655: O: O1310 (predict-no)
  4629. I see 1 and I'm going to do: predict-no
  4630. ENV: Agent did: predict-no for direction R in state State-B
  4631. In State-B moving R
  4632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4633. predict error 0
  4634. dir: dir isL
  4635. /|\656: O: O1311 (predict-yes)
  4636. I see 1 and I'm going to do: predict-yes
  4637. ENV: Agent did: predict-yes for direction L in state State-B
  4638. In State-B moving L
  4639. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4640. predict error 0
  4641. dir: dir isU
  4642. -/|657: O: O1314 (predict-no)
  4643. I see 1 and I'm going to do: predict-no
  4644. ENV: Agent did: predict-no for direction U in state State-A
  4645. In State-A moving U
  4646. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4647. predict error 0
  4648. dir: dir isL
  4649. \-/|sleeping...
  4650. \658: O: O1316 (predict-no)
  4651. I see 1 and I'm going to do: predict-no
  4652. ENV: Agent did: predict-no for direction L in state State-A
  4653. In State-A moving L
  4654. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4655. predict error 0
  4656. dir: dir isR
  4657. -/|659: O: O1317 (predict-yes)
  4658. I see 1 and I'm going to do: predict-yes
  4659. ENV: Agent did: predict-yes for direction R in state State-A
  4660. In State-A moving R
  4661. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4662. predict error 0
  4663. dir: dir isU
  4664. \-/660: O: O1320 (predict-no)
  4665. I see 1 and I'm going to do: predict-no
  4666. ENV: Agent did: predict-no for direction U in state State-B
  4667. In State-B moving U
  4668. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4669. predict error 0
  4670. dir: dir isU
  4671. |\-661: O: O1322 (predict-no)
  4672. I see 1 and I'm going to do: predict-no
  4673. ENV: Agent did: predict-no for direction U in state State-B
  4674. In State-B moving U
  4675. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4676. predict error 0
  4677. dir: dir isL
  4678. /662: O: O1323 (predict-yes)
  4679. I see 1 and I'm going to do: predict-yes
  4680. ENV: Agent did: predict-yes for direction L in state State-B
  4681. In State-B moving L
  4682. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4683. predict error 0
  4684. dir: dir isU
  4685. |\-663: O: O1326 (predict-no)
  4686. I see 1 and I'm going to do: predict-no
  4687. ENV: Agent did: predict-no for direction U in state State-A
  4688. In State-A moving U
  4689. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4690. predict error 0
  4691. dir: dir isU
  4692. /|664: O: O1328 (predict-no)
  4693. I see 1 and I'm going to do: predict-no
  4694. ENV: Agent did: predict-no for direction U in state State-A
  4695. In State-A moving U
  4696. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4697. predict error 0
  4698. dir: dir isL
  4699. \-/665: O: O1330 (predict-no)
  4700. I see 1 and I'm going to do: predict-no
  4701. ENV: Agent did: predict-no for direction L in state State-A
  4702. In State-A moving L
  4703. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4704. predict error 0
  4705. dir: dir isR
  4706. |\-666: O: O1331 (predict-yes)
  4707. I see 1 and I'm going to do: predict-yes
  4708. ENV: Agent did: predict-yes for direction R in state State-A
  4709. In State-A moving R
  4710. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4711. predict error 0
  4712. dir: dir isR
  4713. /|\667: O: O1334 (predict-no)
  4714. I see 1 and I'm going to do: predict-no
  4715. ENV: Agent did: predict-no for direction R in state State-B
  4716. In State-B moving R
  4717. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4718. predict error 0
  4719. dir: dir isU
  4720. -/|668: O: O1336 (predict-no)
  4721. I see 1 and I'm going to do: predict-no
  4722. ENV: Agent did: predict-no for direction U in state State-B
  4723. In State-B moving U
  4724. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4725. predict error 0
  4726. dir: dir isR
  4727. \669: O: O1338 (predict-no)
  4728. I see 1 and I'm going to do: predict-no
  4729. ENV: Agent did: predict-no for direction R in state State-B
  4730. In State-B moving R
  4731. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4732. predict error 0
  4733. dir: dir isU
  4734. -/670: O: O1340 (predict-no)
  4735. I see 1 and I'm going to do: predict-no
  4736. ENV: Agent did: predict-no for direction U in state State-B
  4737. In State-B moving U
  4738. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4739. predict error 0
  4740. dir: dir isU
  4741. |\-671: O: O1342 (predict-no)
  4742. I see 1 and I'm going to do: predict-no
  4743. ENV: Agent did: predict-no for direction U in state State-B
  4744. In State-B moving U
  4745. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4746. predict error 0
  4747. dir: dir isL
  4748. /672: O: O1343 (predict-yes)
  4749. I see 1 and I'm going to do: predict-yes
  4750. ENV: Agent did: predict-yes for direction L in state State-B
  4751. In State-B moving L
  4752. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4753. predict error 0
  4754. dir: dir isU
  4755. |\673: O: O1346 (predict-no)
  4756. I see 1 and I'm going to do: predict-no
  4757. ENV: Agent did: predict-no for direction U in state State-A
  4758. In State-A moving U
  4759. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4760. predict error 0
  4761. dir: dir isL
  4762. -/|674: O: O1348 (predict-no)
  4763. I see 1 and I'm going to do: predict-no
  4764. ENV: Agent did: predict-no for direction L in state State-A
  4765. In State-A moving L
  4766. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4767. predict error 0
  4768. dir: dir isL
  4769. \-/675: O: O1350 (predict-no)
  4770. I see 1 and I'm going to do: predict-no
  4771. ENV: Agent did: predict-no for direction L in state State-A
  4772. In State-A moving L
  4773. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4774. predict error 0
  4775. dir: dir isR
  4776. |\-676: O: O1351 (predict-yes)
  4777. I see 1 and I'm going to do: predict-yes
  4778. ENV: Agent did: predict-yes for direction R in state State-A
  4779. In State-A moving R
  4780. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4781. predict error 0
  4782. dir: dir isL
  4783. /|\677: O: O1353 (predict-yes)
  4784. I see 1 and I'm going to do: predict-yes
  4785. ENV: Agent did: predict-yes for direction L in state State-B
  4786. In State-B moving L
  4787. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4788. predict error 0
  4789. dir: dir isR
  4790. -/678: O: O1355 (predict-yes)
  4791. I see 1 and I'm going to do: predict-yes
  4792. ENV: Agent did: predict-yes for direction R in state State-A
  4793. In State-A moving R
  4794. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4795. predict error 0
  4796. dir: dir isL
  4797. |\-679: O: O1357 (predict-yes)
  4798. I see 1 and I'm going to do: predict-yes
  4799. ENV: Agent did: predict-yes for direction L in state State-B
  4800. In State-B moving L
  4801. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4802. predict error 0
  4803. dir: dir isR
  4804. /|\680: O: O1359 (predict-yes)
  4805. I see 1 and I'm going to do: predict-yes
  4806. ENV: Agent did: predict-yes for direction R in state State-A
  4807. In State-A moving R
  4808. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4809. predict error 0
  4810. dir: dir isU
  4811. -/|681: O: O1362 (predict-no)
  4812. I see 1 and I'm going to do: predict-no
  4813. ENV: Agent did: predict-no for direction U in state State-B
  4814. In State-B moving U
  4815. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4816. predict error 0
  4817. dir: dir isU
  4818. \682: O: O1364 (predict-no)
  4819. I see 1 and I'm going to do: predict-no
  4820. ENV: Agent did: predict-no for direction U in state State-B
  4821. In State-B moving U
  4822. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4823. predict error 0
  4824. dir: dir isL
  4825. -/|683: O: O1365 (predict-yes)
  4826. I see 1 and I'm going to do: predict-yes
  4827. ENV: Agent did: predict-yes for direction L in state State-B
  4828. In State-B moving L
  4829. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4830. predict error 0
  4831. dir: dir isL
  4832. \-684: O: O1368 (predict-no)
  4833. I see 1 and I'm going to do: predict-no
  4834. ENV: Agent did: predict-no for direction L in state State-A
  4835. In State-A moving L
  4836. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4837. predict error 0
  4838. dir: dir isU
  4839. /|\685: O: O1370 (predict-no)
  4840. I see 1 and I'm going to do: predict-no
  4841. ENV: Agent did: predict-no for direction U in state State-A
  4842. In State-A moving U
  4843. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4844. predict error 0
  4845. dir: dir isL
  4846. -/|686: O: O1372 (predict-no)
  4847. I see 1 and I'm going to do: predict-no
  4848. ENV: Agent did: predict-no for direction L in state State-A
  4849. In State-A moving L
  4850. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4851. predict error 0
  4852. dir: dir isL
  4853. \-687: O: O1374 (predict-no)
  4854. I see 1 and I'm going to do: predict-no
  4855. ENV: Agent did: predict-no for direction L in state State-A
  4856. In State-A moving L
  4857. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4858. predict error 0
  4859. dir: dir isL
  4860. /|\688: O: O1376 (predict-no)
  4861. I see 1 and I'm going to do: predict-no
  4862. ENV: Agent did: predict-no for direction L in state State-A
  4863. In State-A moving L
  4864. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4865. predict error 0
  4866. dir: dir isL
  4867. -/689: O: O1378 (predict-no)
  4868. I see 1 and I'm going to do: predict-no
  4869. ENV: Agent did: predict-no for direction L in state State-A
  4870. In State-A moving L
  4871. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4872. predict error 0
  4873. dir: dir isL
  4874. |\-690: O: O1380 (predict-no)
  4875. I see 1 and I'm going to do: predict-no
  4876. ENV: Agent did: predict-no for direction L in state State-A
  4877. In State-A moving L
  4878. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4879. predict error 0
  4880. dir: dir isR
  4881. /|\691: O: O1381 (predict-yes)
  4882. I see 1 and I'm going to do: predict-yes
  4883. ENV: Agent did: predict-yes for direction R in state State-A
  4884. In State-A moving R
  4885. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4886. predict error 0
  4887. dir: dir isU
  4888. -692: O: O1384 (predict-no)
  4889. I see 1 and I'm going to do: predict-no
  4890. ENV: Agent did: predict-no for direction U in state State-B
  4891. In State-B moving U
  4892. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4893. predict error 0
  4894. dir: dir isU
  4895. /|\693: O: O1386 (predict-no)
  4896. I see 1 and I'm going to do: predict-no
  4897. ENV: Agent did: predict-no for direction U in state State-B
  4898. In State-B moving U
  4899. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4900. predict error 0
  4901. dir: dir isU
  4902. -/|694: O: O1388 (predict-no)
  4903. I see 1 and I'm going to do: predict-no
  4904. ENV: Agent did: predict-no for direction U in state State-B
  4905. In State-B moving U
  4906. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4907. predict error 0
  4908. dir: dir isR
  4909. \-/695: O: O1390 (predict-no)
  4910. I see 1 and I'm going to do: predict-no
  4911. ENV: Agent did: predict-no for direction R in state State-B
  4912. In State-B moving R
  4913. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4914. predict error 0
  4915. dir: dir isR
  4916. |\-696: O: O1392 (predict-no)
  4917. I see 1 and I'm going to do: predict-no
  4918. ENV: Agent did: predict-no for direction R in state State-B
  4919. In State-B moving R
  4920. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4921. predict error 0
  4922. dir: dir isR
  4923. /|\697: O: O1394 (predict-no)
  4924. I see 1 and I'm going to do: predict-no
  4925. ENV: Agent did: predict-no for direction R in state State-B
  4926. In State-B moving R
  4927. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4928. predict error 0
  4929. dir: dir isU
  4930. -/|698: O: O1396 (predict-no)
  4931. I see 1 and I'm going to do: predict-no
  4932. ENV: Agent did: predict-no for direction U in state State-B
  4933. In State-B moving U
  4934. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4935. predict error 0
  4936. dir: dir isR
  4937. \-/699: O: O1398 (predict-no)
  4938. I see 1 and I'm going to do: predict-no
  4939. ENV: Agent did: predict-no for direction R in state State-B
  4940. In State-B moving R
  4941. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4942. predict error 0
  4943. dir: dir isL
  4944. |\700: O: O1399 (predict-yes)
  4945. I see 1 and I'm going to do: predict-yes
  4946. ENV: Agent did: predict-yes for direction L in state State-B
  4947. In State-B moving L
  4948. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4949. predict error 0
  4950. dir: dir isL
  4951. -/701: O: O1402 (predict-no)
  4952. I see 1 and I'm going to do: predict-no
  4953. ENV: Agent did: predict-no for direction L in state State-A
  4954. In State-A moving L
  4955. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4956. predict error 0
  4957. dir: dir isU
  4958. |702: O: O1404 (predict-no)
  4959. I see 1 and I'm going to do: predict-no
  4960. ENV: Agent did: predict-no for direction U in state State-A
  4961. In State-A moving U
  4962. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4963. predict error 0
  4964. dir: dir isR
  4965. \-/|703: O: O1405 (predict-yes)
  4966. I see 1 and I'm going to do: predict-yes
  4967. ENV: Agent did: predict-yes for direction R in state State-A
  4968. In State-A moving R
  4969. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4970. predict error 0
  4971. dir: dir isR
  4972. \-/704: O: O1408 (predict-no)
  4973. I see 1 and I'm going to do: predict-no
  4974. ENV: Agent did: predict-no for direction R in state State-B
  4975. In State-B moving R
  4976. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4977. predict error 0
  4978. dir: dir isR
  4979. |\705: O: O1410 (predict-no)
  4980. I see 1 and I'm going to do: predict-no
  4981. ENV: Agent did: predict-no for direction R in state State-B
  4982. In State-B moving R
  4983. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4984. predict error 0
  4985. dir: dir isR
  4986. -/|706: O: O1412 (predict-no)
  4987. I see 1 and I'm going to do: predict-no
  4988. ENV: Agent did: predict-no for direction R in state State-B
  4989. In State-B moving R
  4990. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4991. predict error 0
  4992. dir: dir isR
  4993. \-/707: O: O1414 (predict-no)
  4994. I see 1 and I'm going to do: predict-no
  4995. ENV: Agent did: predict-no for direction R in state State-B
  4996. In State-B moving R
  4997. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4998. predict error 0
  4999. dir: dir isL
  5000. |\-708: O: O1415 (predict-yes)
  5001. I see 1 and I'm going to do: predict-yes
  5002. ENV: Agent did: predict-yes for direction L in state State-B
  5003. In State-B moving L
  5004. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5005. predict error 0
  5006. dir: dir isR
  5007. /|709: O: O1417 (predict-yes)
  5008. I see 1 and I'm going to do: predict-yes
  5009. ENV: Agent did: predict-yes for direction R in state State-A
  5010. In State-A moving R
  5011. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5012. predict error 0
  5013. dir: dir isR
  5014. \-710: O: O1420 (predict-no)
  5015. I see 1 and I'm going to do: predict-no
  5016. ENV: Agent did: predict-no for direction R in state State-B
  5017. In State-B moving R
  5018. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5019. predict error 0
  5020. dir: dir isL
  5021. /|\711: O: O1421 (predict-yes)
  5022. I see 1 and I'm going to do: predict-yes
  5023. ENV: Agent did: predict-yes for direction L in state State-B
  5024. In State-B moving L
  5025. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5026. predict error 0
  5027. dir: dir isU
  5028. -712: O: O1424 (predict-no)
  5029. I see 1 and I'm going to do: predict-no
  5030. ENV: Agent did: predict-no for direction U in state State-A
  5031. In State-A moving U
  5032. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5033. predict error 0
  5034. dir: dir isR
  5035. /|713: O: O1425 (predict-yes)
  5036. I see 1 and I'm going to do: predict-yes
  5037. ENV: Agent did: predict-yes for direction R in state State-A
  5038. In State-A moving R
  5039. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5040. predict error 0
  5041. dir: dir isR
  5042. \-/714: O: O1428 (predict-no)
  5043. I see 1 and I'm going to do: predict-no
  5044. ENV: Agent did: predict-no for direction R in state State-B
  5045. In State-B moving R
  5046. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5047. predict error 0
  5048. dir: dir isU
  5049. |\715: O: O1430 (predict-no)
  5050. I see 1 and I'm going to do: predict-no
  5051. ENV: Agent did: predict-no for direction U in state State-B
  5052. In State-B moving U
  5053. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5054. predict error 0
  5055. dir: dir isU
  5056. -/|716: O: O1432 (predict-no)
  5057. I see 1 and I'm going to do: predict-no
  5058. ENV: Agent did: predict-no for direction U in state State-B
  5059. In State-B moving U
  5060. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5061. predict error 0
  5062. dir: dir isU
  5063. \-717: O: O1434 (predict-no)
  5064. I see 1 and I'm going to do: predict-no
  5065. ENV: Agent did: predict-no for direction U in state State-B
  5066. In State-B moving U
  5067. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5068. predict error 0
  5069. dir: dir isU
  5070. /|\718: O: O1436 (predict-no)
  5071. I see 1 and I'm going to do: predict-no
  5072. ENV: Agent did: predict-no for direction U in state State-B
  5073. In State-B moving U
  5074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5075. predict error 0
  5076. dir: dir isL
  5077. -/|719: O: O1437 (predict-yes)
  5078. I see 1 and I'm going to do: predict-yes
  5079. ENV: Agent did: predict-yes for direction L in state State-B
  5080. In State-B moving L
  5081. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5082. predict error 0
  5083. dir: dir isU
  5084. \-/720: O: O1440 (predict-no)
  5085. I see 1 and I'm going to do: predict-no
  5086. ENV: Agent did: predict-no for direction U in state State-A
  5087. In State-A moving U
  5088. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5089. predict error 0
  5090. dir: dir isL
  5091. |\-721: O: O1442 (predict-no)
  5092. I see 1 and I'm going to do: predict-no
  5093. ENV: Agent did: predict-no for direction L in state State-A
  5094. In State-A moving L
  5095. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5096. predict error 0
  5097. dir: dir isU
  5098. /722: O: O1444 (predict-no)
  5099. I see 1 and I'm going to do: predict-no
  5100. ENV: Agent did: predict-no for direction U in state State-A
  5101. In State-A moving U
  5102. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5103. predict error 0
  5104. dir: dir isU
  5105. |\-723: O: O1446 (predict-no)
  5106. I see 1 and I'm going to do: predict-no
  5107. ENV: Agent did: predict-no for direction U in state State-A
  5108. In State-A moving U
  5109. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5110. predict error 0
  5111. dir: dir isU
  5112. /|\724: O: O1448 (predict-no)
  5113. I see 1 and I'm going to do: predict-no
  5114. ENV: Agent did: predict-no for direction U in state State-A
  5115. In State-A moving U
  5116. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5117. predict error 0
  5118. dir: dir isL
  5119. -/|725: O: O1450 (predict-no)
  5120. I see 1 and I'm going to do: predict-no
  5121. ENV: Agent did: predict-no for direction L in state State-A
  5122. In State-A moving L
  5123. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5124. predict error 0
  5125. dir: dir isL
  5126. \-/726: O: O1452 (predict-no)
  5127. I see 1 and I'm going to do: predict-no
  5128. ENV: Agent did: predict-no for direction L in state State-A
  5129. In State-A moving L
  5130. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5131. predict error 0
  5132. dir: dir isU
  5133. |\727: O: O1454 (predict-no)
  5134. I see 1 and I'm going to do: predict-no
  5135. ENV: Agent did: predict-no for direction U in state State-A
  5136. In State-A moving U
  5137. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5138. predict error 0
  5139. dir: dir isR
  5140. -/|728: O: O1455 (predict-yes)
  5141. I see 1 and I'm going to do: predict-yes
  5142. ENV: Agent did: predict-yes for direction R in state State-A
  5143. In State-A moving R
  5144. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5145. predict error 0
  5146. dir: dir isR
  5147. \-729: O: O1458 (predict-no)
  5148. I see 1 and I'm going to do: predict-no
  5149. ENV: Agent did: predict-no for direction R in state State-B
  5150. In State-B moving R
  5151. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5152. predict error 0
  5153. dir: dir isU
  5154. /|\-730: O: O1460 (predict-no)
  5155. I see 1 and I'm going to do: predict-no
  5156. ENV: Agent did: predict-no for direction U in state State-B
  5157. In State-B moving U
  5158. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5159. predict error 0
  5160. dir: dir isL
  5161. /|\731: O: O1461 (predict-yes)
  5162. I see 1 and I'm going to do: predict-yes
  5163. ENV: Agent did: predict-yes for direction L in state State-B
  5164. In State-B moving L
  5165. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5166. predict error 0
  5167. dir: dir isR
  5168. -732: O: O1463 (predict-yes)
  5169. I see 1 and I'm going to do: predict-yes
  5170. ENV: Agent did: predict-yes for direction R in state State-A
  5171. In State-A moving R
  5172. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5173. predict error 0
  5174. dir: dir isR
  5175. /|\733: O: O1466 (predict-no)
  5176. I see 1 and I'm going to do: predict-no
  5177. ENV: Agent did: predict-no for direction R in state State-B
  5178. In State-B moving R
  5179. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5180. predict error 0
  5181. dir: dir isL
  5182. -/|734: O: O1467 (predict-yes)
  5183. I see 1 and I'm going to do: predict-yes
  5184. ENV: Agent did: predict-yes for direction L in state State-B
  5185. In State-B moving L
  5186. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5187. predict error 0
  5188. dir: dir isR
  5189. \-735: O: O1469 (predict-yes)
  5190. I see 1 and I'm going to do: predict-yes
  5191. ENV: Agent did: predict-yes for direction R in state State-A
  5192. In State-A moving R
  5193. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5194. predict error 0
  5195. dir: dir isU
  5196. /|\736: O: O1472 (predict-no)
  5197. I see 1 and I'm going to do: predict-no
  5198. ENV: Agent did: predict-no for direction U in state State-B
  5199. In State-B moving U
  5200. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5201. predict error 0
  5202. dir: dir isU
  5203. -/|737: O: O1474 (predict-no)
  5204. I see 1 and I'm going to do: predict-no
  5205. ENV: Agent did: predict-no for direction U in state State-B
  5206. In State-B moving U
  5207. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5208. predict error 0
  5209. dir: dir isL
  5210. \738: O: O1475 (predict-yes)
  5211. I see 1 and I'm going to do: predict-yes
  5212. ENV: Agent did: predict-yes for direction L in state State-B
  5213. In State-B moving L
  5214. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5215. predict error 0
  5216. dir: dir isR
  5217. -/739: O: O1477 (predict-yes)
  5218. I see 1 and I'm going to do: predict-yes
  5219. ENV: Agent did: predict-yes for direction R in state State-A
  5220. In State-A moving R
  5221. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5222. predict error 0
  5223. dir: dir isL
  5224. |\-740: O: O1479 (predict-yes)
  5225. I see 1 and I'm going to do: predict-yes
  5226. ENV: Agent did: predict-yes for direction L in state State-B
  5227. In State-B moving L
  5228. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5229. predict error 0
  5230. dir: dir isU
  5231. /|\741: O: O1482 (predict-no)
  5232. I see 1 and I'm going to do: predict-no
  5233. ENV: Agent did: predict-no for direction U in state State-A
  5234. In State-A moving U
  5235. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5236. predict error 0
  5237. dir: dir isL
  5238. -742: O: O1484 (predict-no)
  5239. I see 1 and I'm going to do: predict-no
  5240. ENV: Agent did: predict-no for direction L in state State-A
  5241. In State-A moving L
  5242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5243. predict error 0
  5244. dir: dir isL
  5245. /|743: O: O1486 (predict-no)
  5246. I see 1 and I'm going to do: predict-no
  5247. ENV: Agent did: predict-no for direction L in state State-A
  5248. In State-A moving L
  5249. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5250. predict error 0
  5251. dir: dir isR
  5252. \-/744: O: O1487 (predict-yes)
  5253. I see 1 and I'm going to do: predict-yes
  5254. ENV: Agent did: predict-yes for direction R in state State-A
  5255. In State-A moving R
  5256. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5257. predict error 0
  5258. dir: dir isU
  5259. |\745: O: O1490 (predict-no)
  5260. I see 1 and I'm going to do: predict-no
  5261. ENV: Agent did: predict-no for direction U in state State-B
  5262. In State-B moving U
  5263. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5264. predict error 0
  5265. dir: dir isL
  5266. -/746: O: O1491 (predict-yes)
  5267. I see 1 and I'm going to do: predict-yes
  5268. ENV: Agent did: predict-yes for direction L in state State-B
  5269. In State-B moving L
  5270. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5271. predict error 0
  5272. dir: dir isL
  5273. |\747: O: O1494 (predict-no)
  5274. I see 1 and I'm going to do: predict-no
  5275. ENV: Agent did: predict-no for direction L in state State-A
  5276. In State-A moving L
  5277. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5278. predict error 0
  5279. dir: dir isU
  5280. -/|748: O: O1496 (predict-no)
  5281. I see 1 and I'm going to do: predict-no
  5282. ENV: Agent did: predict-no for direction U in state State-A
  5283. In State-A moving U
  5284. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5285. predict error 0
  5286. dir: dir isU
  5287. \-/749: O: O1498 (predict-no)
  5288. I see 1 and I'm going to do: predict-no
  5289. ENV: Agent did: predict-no for direction U in state State-A
  5290. In State-A moving U
  5291. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5292. predict error 0
  5293. dir: dir isU
  5294. |\750: O: O1500 (predict-no)
  5295. I see 1 and I'm going to do: predict-no
  5296. ENV: Agent did: predict-no for direction U in state State-A
  5297. In State-A moving U
  5298. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5299. predict error 0
  5300. dir: dir isL
  5301. -/|751: O: O1502 (predict-no)
  5302. I see 1 and I'm going to do: predict-no
  5303. ENV: Agent did: predict-no for direction L in state State-A
  5304. In State-A moving L
  5305. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5306. predict error 0
  5307. dir: dir isR
  5308. \752: O: O1503 (predict-yes)
  5309. I see 1 and I'm going to do: predict-yes
  5310. ENV: Agent did: predict-yes for direction R in state State-A
  5311. In State-A moving R
  5312. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5313. predict error 0
  5314. dir: dir isL
  5315. -/|753: O: O1505 (predict-yes)
  5316. I see 1 and I'm going to do: predict-yes
  5317. ENV: Agent did: predict-yes for direction L in state State-B
  5318. In State-B moving L
  5319. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5320. predict error 0
  5321. dir: dir isR
  5322. \-/754: O: O1507 (predict-yes)
  5323. I see 1 and I'm going to do: predict-yes
  5324. ENV: Agent did: predict-yes for direction R in state State-A
  5325. In State-A moving R
  5326. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5327. predict error 0
  5328. dir: dir isL
  5329. |\-755: O: O1509 (predict-yes)
  5330. I see 1 and I'm going to do: predict-yes
  5331. ENV: Agent did: predict-yes for direction L in state State-B
  5332. In State-B moving L
  5333. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5334. predict error 0
  5335. dir: dir isR
  5336. /|\-sleeping...
  5337. /756: O: O1511 (predict-yes)
  5338. I see 1 and I'm going to do: predict-yes
  5339. ENV: Agent did: predict-yes for direction R in state State-A
  5340. In State-A moving R
  5341. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5342. predict error 0
  5343. dir: dir isU
  5344. |\-757: O: O1514 (predict-no)
  5345. I see 1 and I'm going to do: predict-no
  5346. ENV: Agent did: predict-no for direction U in state State-B
  5347. In State-B moving U
  5348. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5349. predict error 0
  5350. dir: dir isU
  5351. /|\758: O: O1516 (predict-no)
  5352. I see 1 and I'm going to do: predict-no
  5353. ENV: Agent did: predict-no for direction U in state State-B
  5354. In State-B moving U
  5355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5356. predict error 0
  5357. dir: dir isR
  5358. -/|759: O: O1518 (predict-no)
  5359. I see 1 and I'm going to do: predict-no
  5360. ENV: Agent did: predict-no for direction R in state State-B
  5361. In State-B moving R
  5362. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5363. predict error 0
  5364. dir: dir isL
  5365. \-/760: O: O1519 (predict-yes)
  5366. I see 1 and I'm going to do: predict-yes
  5367. ENV: Agent did: predict-yes for direction L in state State-B
  5368. In State-B moving L
  5369. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5370. predict error 0
  5371. dir: dir isR
  5372. |\-761: O: O1521 (predict-yes)
  5373. I see 1 and I'm going to do: predict-yes
  5374. ENV: Agent did: predict-yes for direction R in state State-A
  5375. In State-A moving R
  5376. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5377. predict error 0
  5378. dir: dir isR
  5379. /762: O: O1524 (predict-no)
  5380. I see 1 and I'm going to do: predict-no
  5381. ENV: Agent did: predict-no for direction R in state State-B
  5382. In State-B moving R
  5383. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5384. predict error 0
  5385. dir: dir isU
  5386. |\-763: O: O1526 (predict-no)
  5387. I see 1 and I'm going to do: predict-no
  5388. ENV: Agent did: predict-no for direction U in state State-B
  5389. In State-B moving U
  5390. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5391. predict error 0
  5392. dir: dir isU
  5393. /|\764: O: O1528 (predict-no)
  5394. I see 1 and I'm going to do: predict-no
  5395. ENV: Agent did: predict-no for direction U in state State-B
  5396. In State-B moving U
  5397. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5398. predict error 0
  5399. dir: dir isU
  5400. -/|765: O: O1530 (predict-no)
  5401. I see 1 and I'm going to do: predict-no
  5402. ENV: Agent did: predict-no for direction U in state State-B
  5403. In State-B moving U
  5404. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5405. predict error 0
  5406. dir: dir isU
  5407. \766: O: O1532 (predict-no)
  5408. I see 1 and I'm going to do: predict-no
  5409. ENV: Agent did: predict-no for direction U in state State-B
  5410. In State-B moving U
  5411. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5412. predict error 0
  5413. dir: dir isR
  5414. -/|767: O: O1534 (predict-no)
  5415. I see 1 and I'm going to do: predict-no
  5416. ENV: Agent did: predict-no for direction R in state State-B
  5417. In State-B moving R
  5418. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5419. predict error 0
  5420. dir: dir isU
  5421. \-768: O: O1536 (predict-no)
  5422. I see 1 and I'm going to do: predict-no
  5423. ENV: Agent did: predict-no for direction U in state State-B
  5424. In State-B moving U
  5425. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5426. predict error 0
  5427. dir: dir isU
  5428. /|\769: O: O1538 (predict-no)
  5429. I see 1 and I'm going to do: predict-no
  5430. ENV: Agent did: predict-no for direction U in state State-B
  5431. In State-B moving U
  5432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5433. predict error 0
  5434. dir: dir isL
  5435. -/|770: O: O1539 (predict-yes)
  5436. I see 1 and I'm going to do: predict-yes
  5437. ENV: Agent did: predict-yes for direction L in state State-B
  5438. In State-B moving L
  5439. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5440. predict error 0
  5441. dir: dir isL
  5442. \-771: O: O1542 (predict-no)
  5443. I see 1 and I'm going to do: predict-no
  5444. ENV: Agent did: predict-no for direction L in state State-A
  5445. In State-A moving L
  5446. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5447. predict error 0
  5448. dir: dir isR
  5449. /772: O: O1543 (predict-yes)
  5450. I see 1 and I'm going to do: predict-yes
  5451. ENV: Agent did: predict-yes for direction R in state State-A
  5452. In State-A moving R
  5453. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5454. predict error 0
  5455. dir: dir isR
  5456. |\-773: O: O1546 (predict-no)
  5457. I see 1 and I'm going to do: predict-no
  5458. ENV: Agent did: predict-no for direction R in state State-B
  5459. In State-B moving R
  5460. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5461. predict error 0
  5462. dir: dir isL
  5463. /|774: O: O1547 (predict-yes)
  5464. I see 1 and I'm going to do: predict-yes
  5465. ENV: Agent did: predict-yes for direction L in state State-B
  5466. In State-B moving L
  5467. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5468. predict error 0
  5469. dir: dir isR
  5470. \-/|775: O: O1549 (predict-yes)
  5471. I see 1 and I'm going to do: predict-yes
  5472. ENV: Agent did: predict-yes for direction R in state State-A
  5473. In State-A moving R
  5474. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5475. predict error 0
  5476. dir: dir isR
  5477. \-/776: O: O1552 (predict-no)
  5478. I see 1 and I'm going to do: predict-no
  5479. ENV: Agent did: predict-no for direction R in state State-B
  5480. In State-B moving R
  5481. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5482. predict error 0
  5483. dir: dir isL
  5484. |\-777: O: O1553 (predict-yes)
  5485. I see 1 and I'm going to do: predict-yes
  5486. ENV: Agent did: predict-yes for direction L in state State-B
  5487. In State-B moving L
  5488. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5489. predict error 0
  5490. dir: dir isU
  5491. /|\-sleeping...
  5492. /778: O: O1556 (predict-no)
  5493. I see 1 and I'm going to do: predict-no
  5494. ENV: Agent did: predict-no for direction U in state State-A
  5495. In State-A moving U
  5496. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5497. predict error 0
  5498. dir: dir isR
  5499. |\-779: O: O1557 (predict-yes)
  5500. I see 1 and I'm going to do: predict-yes
  5501. ENV: Agent did: predict-yes for direction R in state State-A
  5502. In State-A moving R
  5503. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5504. predict error 0
  5505. dir: dir isL
  5506. /|\780: O: O1559 (predict-yes)
  5507. I see 1 and I'm going to do: predict-yes
  5508. ENV: Agent did: predict-yes for direction L in state State-B
  5509. In State-B moving L
  5510. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5511. predict error 0
  5512. dir: dir isL
  5513. -/|781: O: O1562 (predict-no)
  5514. I see 1 and I'm going to do: predict-no
  5515. ENV: Agent did: predict-no for direction L in state State-A
  5516. In State-A moving L
  5517. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5518. predict error 0
  5519. dir: dir isR
  5520. \782: O: O1563 (predict-yes)
  5521. I see 1 and I'm going to do: predict-yes
  5522. ENV: Agent did: predict-yes for direction R in state State-A
  5523. In State-A moving R
  5524. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5525. predict error 0
  5526. dir: dir isL
  5527. -/|783: O: O1565 (predict-yes)
  5528. I see 1 and I'm going to do: predict-yes
  5529. ENV: Agent did: predict-yes for direction L in state State-B
  5530. In State-B moving L
  5531. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5532. predict error 0
  5533. dir: dir isU
  5534. \-/784: O: O1568 (predict-no)
  5535. I see 1 and I'm going to do: predict-no
  5536. ENV: Agent did: predict-no for direction U in state State-A
  5537. In State-A moving U
  5538. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5539. predict error 0
  5540. dir: dir isR
  5541. |\785: O: O1569 (predict-yes)
  5542. I see 1 and I'm going to do: predict-yes
  5543. ENV: Agent did: predict-yes for direction R in state State-A
  5544. In State-A moving R
  5545. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5546. predict error 0
  5547. dir: dir isR
  5548. -/|786: O: O1572 (predict-no)
  5549. I see 1 and I'm going to do: predict-no
  5550. ENV: Agent did: predict-no for direction R in state State-B
  5551. In State-B moving R
  5552. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5553. predict error 0
  5554. dir: dir isL
  5555. \-/|787: O: O1573 (predict-yes)
  5556. I see 1 and I'm going to do: predict-yes
  5557. ENV: Agent did: predict-yes for direction L in state State-B
  5558. In State-B moving L
  5559. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5560. predict error 0
  5561. dir: dir isU
  5562. \-/788: O: O1576 (predict-no)
  5563. I see 1 and I'm going to do: predict-no
  5564. ENV: Agent did: predict-no for direction U in state State-A
  5565. In State-A moving U
  5566. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5567. predict error 0
  5568. dir: dir isL
  5569. |\-/789: O: O1578 (predict-no)
  5570. I see 1 and I'm going to do: predict-no
  5571. ENV: Agent did: predict-no for direction L in state State-A
  5572. In State-A moving L
  5573. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5574. predict error 0
  5575. dir: dir isL
  5576. |\790: O: O1580 (predict-no)
  5577. I see 1 and I'm going to do: predict-no
  5578. ENV: Agent did: predict-no for direction L in state State-A
  5579. In State-A moving L
  5580. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5581. predict error 0
  5582. dir: dir isL
  5583. -/|791: O: O1582 (predict-no)
  5584. I see 1 and I'm going to do: predict-no
  5585. ENV: Agent did: predict-no for direction L in state State-A
  5586. In State-A moving L
  5587. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5588. predict error 0
  5589. dir: dir isU
  5590. \792: O: O1584 (predict-no)
  5591. I see 1 and I'm going to do: predict-no
  5592. ENV: Agent did: predict-no for direction U in state State-A
  5593. In State-A moving U
  5594. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5595. predict error 0
  5596. dir: dir isR
  5597. -/793: O: O1585 (predict-yes)
  5598. I see 1 and I'm going to do: predict-yes
  5599. ENV: Agent did: predict-yes for direction R in state State-A
  5600. In State-A moving R
  5601. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5602. predict error 0
  5603. dir: dir isU
  5604. |\-794: O: O1588 (predict-no)
  5605. I see 1 and I'm going to do: predict-no
  5606. ENV: Agent did: predict-no for direction U in state State-B
  5607. In State-B moving U
  5608. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5609. predict error 0
  5610. dir: dir isU
  5611. /|\795: O: O1590 (predict-no)
  5612. I see 1 and I'm going to do: predict-no
  5613. ENV: Agent did: predict-no for direction U in state State-B
  5614. In State-B moving U
  5615. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5616. predict error 0
  5617. dir: dir isU
  5618. -/|796: O: O1592 (predict-no)
  5619. I see 1 and I'm going to do: predict-no
  5620. ENV: Agent did: predict-no for direction U in state State-B
  5621. In State-B moving U
  5622. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5623. predict error 0
  5624. dir: dir isU
  5625. \-/797: O: O1594 (predict-no)
  5626. I see 1 and I'm going to do: predict-no
  5627. ENV: Agent did: predict-no for direction U in state State-B
  5628. In State-B moving U
  5629. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5630. predict error 0
  5631. dir: dir isU
  5632. |\798: O: O1596 (predict-no)
  5633. I see 1 and I'm going to do: predict-no
  5634. ENV: Agent did: predict-no for direction U in state State-B
  5635. In State-B moving U
  5636. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5637. predict error 0
  5638. dir: dir isU
  5639. -/|799: O: O1598 (predict-no)
  5640. I see 1 and I'm going to do: predict-no
  5641. ENV: Agent did: predict-no for direction U in state State-B
  5642. In State-B moving U
  5643. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5644. predict error 0
  5645. dir: dir isU
  5646. \-/800: O: O1600 (predict-no)
  5647. I see 1 and I'm going to do: predict-no
  5648. ENV: Agent did: predict-no for direction U in state State-B
  5649. In State-B moving U
  5650. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5651. predict error 0
  5652. dir: dir isL
  5653. |\-801: O: O1601 (predict-yes)
  5654. I see 1 and I'm going to do: predict-yes
  5655. ENV: Agent did: predict-yes for direction L in state State-B
  5656. In State-B moving L
  5657. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5658. predict error 0
  5659. dir: dir isR
  5660. /802: O: O1603 (predict-yes)
  5661. I see 1 and I'm going to do: predict-yes
  5662. ENV: Agent did: predict-yes for direction R in state State-A
  5663. In State-A moving R
  5664. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5665. predict error 0
  5666. dir: dir isR
  5667. |\-803: O: O1606 (predict-no)
  5668. I see 1 and I'm going to do: predict-no
  5669. ENV: Agent did: predict-no for direction R in state State-B
  5670. In State-B moving R
  5671. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5672. predict error 0
  5673. dir: dir isU
  5674. /|\804: O: O1608 (predict-no)
  5675. I see 1 and I'm going to do: predict-no
  5676. ENV: Agent did: predict-no for direction U in state State-B
  5677. In State-B moving U
  5678. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5679. predict error 0
  5680. dir: dir isU
  5681. -/|805: O: O1610 (predict-no)
  5682. I see 1 and I'm going to do: predict-no
  5683. ENV: Agent did: predict-no for direction U in state State-B
  5684. In State-B moving U
  5685. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5686. predict error 0
  5687. dir: dir isU
  5688. \-/806: O: O1612 (predict-no)
  5689. I see 1 and I'm going to do: predict-no
  5690. ENV: Agent did: predict-no for direction U in state State-B
  5691. In State-B moving U
  5692. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5693. predict error 0
  5694. dir: dir isU
  5695. |\807: O: O1614 (predict-no)
  5696. I see 1 and I'm going to do: predict-no
  5697. ENV: Agent did: predict-no for direction U in state State-B
  5698. In State-B moving U
  5699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5700. predict error 0
  5701. dir: dir isR
  5702. -808: O: O1616 (predict-no)
  5703. I see 1 and I'm going to do: predict-no
  5704. ENV: Agent did: predict-no for direction R in state State-B
  5705. In State-B moving R
  5706. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5707. predict error 0
  5708. dir: dir isU
  5709. /|\809: O: O1618 (predict-no)
  5710. I see 1 and I'm going to do: predict-no
  5711. ENV: Agent did: predict-no for direction U in state State-B
  5712. In State-B moving U
  5713. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5714. predict error 0
  5715. dir: dir isR
  5716. -/|\sleeping...
  5717. -810: O: O1620 (predict-no)
  5718. I see 1 and I'm going to do: predict-no
  5719. ENV: Agent did: predict-no for direction R in state State-B
  5720. In State-B moving R
  5721. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5722. predict error 0
  5723. dir: dir isR
  5724. /|\811: O: O1622 (predict-no)
  5725. I see 1 and I'm going to do: predict-no
  5726. ENV: Agent did: predict-no for direction R in state State-B
  5727. In State-B moving R
  5728. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5729. predict error 0
  5730. dir: dir isR
  5731. -812: O: O1624 (predict-no)
  5732. I see 1 and I'm going to do: predict-no
  5733. ENV: Agent did: predict-no for direction R in state State-B
  5734. In State-B moving R
  5735. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5736. predict error 0
  5737. dir: dir isU
  5738. /|\813: O: O1626 (predict-no)
  5739. I see 1 and I'm going to do: predict-no
  5740. ENV: Agent did: predict-no for direction U in state State-B
  5741. In State-B moving U
  5742. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5743. predict error 0
  5744. dir: dir isR
  5745. -/|814: O: O1628 (predict-no)
  5746. I see 1 and I'm going to do: predict-no
  5747. ENV: Agent did: predict-no for direction R in state State-B
  5748. In State-B moving R
  5749. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5750. predict error 0
  5751. dir: dir isL
  5752. \-/815: O: O1629 (predict-yes)
  5753. I see 1 and I'm going to do: predict-yes
  5754. ENV: Agent did: predict-yes for direction L in state State-B
  5755. In State-B moving L
  5756. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5757. predict error 0
  5758. dir: dir isL
  5759. |\-816: O: O1632 (predict-no)
  5760. I see 1 and I'm going to do: predict-no
  5761. ENV: Agent did: predict-no for direction L in state State-A
  5762. In State-A moving L
  5763. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5764. predict error 0
  5765. dir: dir isU
  5766. /|817: O: O1634 (predict-no)
  5767. I see 1 and I'm going to do: predict-no
  5768. ENV: Agent did: predict-no for direction U in state State-A
  5769. In State-A moving U
  5770. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5771. predict error 0
  5772. dir: dir isR
  5773. \-/818: O: O1635 (predict-yes)
  5774. I see 1 and I'm going to do: predict-yes
  5775. ENV: Agent did: predict-yes for direction R in state State-A
  5776. In State-A moving R
  5777. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5778. predict error 0
  5779. dir: dir isU
  5780. |819: O: O1638 (predict-no)
  5781. I see 1 and I'm going to do: predict-no
  5782. ENV: Agent did: predict-no for direction U in state State-B
  5783. In State-B moving U
  5784. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5785. predict error 0
  5786. dir: dir isL
  5787. \-/820: O: O1639 (predict-yes)
  5788. I see 1 and I'm going to do: predict-yes
  5789. ENV: Agent did: predict-yes for direction L in state State-B
  5790. In State-B moving L
  5791. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5792. predict error 0
  5793. dir: dir isR
  5794. |\-821: O: O1641 (predict-yes)
  5795. I see 1 and I'm going to do: predict-yes
  5796. ENV: Agent did: predict-yes for direction R in state State-A
  5797. In State-A moving R
  5798. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5799. predict error 0
  5800. dir: dir isU
  5801. /822: O: O1644 (predict-no)
  5802. I see 1 and I'm going to do: predict-no
  5803. ENV: Agent did: predict-no for direction U in state State-B
  5804. In State-B moving U
  5805. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5806. predict error 0
  5807. dir: dir isL
  5808. |\-823: O: O1645 (predict-yes)
  5809. I see 1 and I'm going to do: predict-yes
  5810. ENV: Agent did: predict-yes for direction L in state State-B
  5811. In State-B moving L
  5812. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5813. predict error 0
  5814. dir: dir isL
  5815. /|\-824: O: O1648 (predict-no)
  5816. I see 1 and I'm going to do: predict-no
  5817. ENV: Agent did: predict-no for direction L in state State-A
  5818. In State-A moving L
  5819. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5820. predict error 0
  5821. dir: dir isR
  5822. /|825: O: O1649 (predict-yes)
  5823. I see 1 and I'm going to do: predict-yes
  5824. ENV: Agent did: predict-yes for direction R in state State-A
  5825. In State-A moving R
  5826. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5827. predict error 0
  5828. dir: dir isL
  5829. \-/826: O: O1651 (predict-yes)
  5830. I see 1 and I'm going to do: predict-yes
  5831. ENV: Agent did: predict-yes for direction L in state State-B
  5832. In State-B moving L
  5833. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5834. predict error 0
  5835. dir: dir isL
  5836. |\-827: O: O1654 (predict-no)
  5837. I see 1 and I'm going to do: predict-no
  5838. ENV: Agent did: predict-no for direction L in state State-A
  5839. In State-A moving L
  5840. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5841. predict error 0
  5842. dir: dir isL
  5843. /|\828: O: O1656 (predict-no)
  5844. I see 1 and I'm going to do: predict-no
  5845. ENV: Agent did: predict-no for direction L in state State-A
  5846. In State-A moving L
  5847. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5848. predict error 0
  5849. dir: dir isR
  5850. -829: O: O1657 (predict-yes)
  5851. I see 1 and I'm going to do: predict-yes
  5852. ENV: Agent did: predict-yes for direction R in state State-A
  5853. In State-A moving R
  5854. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5855. predict error 0
  5856. dir: dir isR
  5857. /|\830: O: O1660 (predict-no)
  5858. I see 1 and I'm going to do: predict-no
  5859. ENV: Agent did: predict-no for direction R in state State-B
  5860. In State-B moving R
  5861. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5862. predict error 0
  5863. dir: dir isL
  5864. -/831: O: O1661 (predict-yes)
  5865. I see 1 and I'm going to do: predict-yes
  5866. ENV: Agent did: predict-yes for direction L in state State-B
  5867. In State-B moving L
  5868. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5869. predict error 0
  5870. dir: dir isL
  5871. |832: O: O1664 (predict-no)
  5872. I see 1 and I'm going to do: predict-no
  5873. ENV: Agent did: predict-no for direction L in state State-A
  5874. In State-A moving L
  5875. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5876. predict error 0
  5877. dir: dir isU
  5878. \-/833: O: O1666 (predict-no)
  5879. I see 1 and I'm going to do: predict-no
  5880. ENV: Agent did: predict-no for direction U in state State-A
  5881. In State-A moving U
  5882. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5883. predict error 0
  5884. dir: dir isR
  5885. |\834: O: O1667 (predict-yes)
  5886. I see 1 and I'm going to do: predict-yes
  5887. ENV: Agent did: predict-yes for direction R in state State-A
  5888. In State-A moving R
  5889. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5890. predict error 0
  5891. dir: dir isL
  5892. -/835: O: O1669 (predict-yes)
  5893. I see 1 and I'm going to do: predict-yes
  5894. ENV: Agent did: predict-yes for direction L in state State-B
  5895. In State-B moving L
  5896. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5897. predict error 0
  5898. dir: dir isU
  5899. |\-836: O: O1672 (predict-no)
  5900. I see 1 and I'm going to do: predict-no
  5901. ENV: Agent did: predict-no for direction U in state State-A
  5902. In State-A moving U
  5903. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5904. predict error 0
  5905. dir: dir isL
  5906. /|\837: O: O1674 (predict-no)
  5907. I see 1 and I'm going to do: predict-no
  5908. ENV: Agent did: predict-no for direction L in state State-A
  5909. In State-A moving L
  5910. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5911. predict error 0
  5912. dir: dir isR
  5913. -838: O: O1675 (predict-yes)
  5914. I see 1 and I'm going to do: predict-yes
  5915. ENV: Agent did: predict-yes for direction R in state State-A
  5916. In State-A moving R
  5917. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5918. predict error 0
  5919. dir: dir isU
  5920. /|\839: O: O1678 (predict-no)
  5921. I see 1 and I'm going to do: predict-no
  5922. ENV: Agent did: predict-no for direction U in state State-B
  5923. In State-B moving U
  5924. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5925. predict error 0
  5926. dir: dir isU
  5927. -/840: O: O1680 (predict-no)
  5928. I see 1 and I'm going to do: predict-no
  5929. ENV: Agent did: predict-no for direction U in state State-B
  5930. In State-B moving U
  5931. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5932. predict error 0
  5933. dir: dir isL
  5934. |\-841: O: O1681 (predict-yes)
  5935. I see 1 and I'm going to do: predict-yes
  5936. ENV: Agent did: predict-yes for direction L in state State-B
  5937. In State-B moving L
  5938. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5939. predict error 0
  5940. dir: dir isU
  5941. /842: O: O1684 (predict-no)
  5942. I see 1 and I'm going to do: predict-no
  5943. ENV: Agent did: predict-no for direction U in state State-A
  5944. In State-A moving U
  5945. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5946. predict error 0
  5947. dir: dir isR
  5948. |\-843: O: O1685 (predict-yes)
  5949. I see 1 and I'm going to do: predict-yes
  5950. ENV: Agent did: predict-yes for direction R in state State-A
  5951. In State-A moving R
  5952. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5953. predict error 0
  5954. dir: dir isU
  5955. /|844: O: O1688 (predict-no)
  5956. I see 1 and I'm going to do: predict-no
  5957. ENV: Agent did: predict-no for direction U in state State-B
  5958. In State-B moving U
  5959. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5960. predict error 0
  5961. dir: dir isU
  5962. \-845: O: O1690 (predict-no)
  5963. I see 1 and I'm going to do: predict-no
  5964. ENV: Agent did: predict-no for direction U in state State-B
  5965. In State-B moving U
  5966. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5967. predict error 0
  5968. dir: dir isR
  5969. /|846: O: O1692 (predict-no)
  5970. I see 1 and I'm going to do: predict-no
  5971. ENV: Agent did: predict-no for direction R in state State-B
  5972. In State-B moving R
  5973. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5974. predict error 0
  5975. dir: dir isU
  5976. \847: O: O1694 (predict-no)
  5977. I see 1 and I'm going to do: predict-no
  5978. ENV: Agent did: predict-no for direction U in state State-B
  5979. In State-B moving U
  5980. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5981. predict error 0
  5982. dir: dir isR
  5983. -/|848: O: O1696 (predict-no)
  5984. I see 1 and I'm going to do: predict-no
  5985. ENV: Agent did: predict-no for direction R in state State-B
  5986. In State-B moving R
  5987. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5988. predict error 0
  5989. dir: dir isU
  5990. \-849: O: O1698 (predict-no)
  5991. I see 1 and I'm going to do: predict-no
  5992. ENV: Agent did: predict-no for direction U in state State-B
  5993. In State-B moving U
  5994. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5995. predict error 0
  5996. dir: dir isU
  5997. /|850: O: O1700 (predict-no)
  5998. I see 1 and I'm going to do: predict-no
  5999. ENV: Agent did: predict-no for direction U in state State-B
  6000. In State-B moving U
  6001. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6002. predict error 0
  6003. dir: dir isU
  6004. \-/851: O: O1702 (predict-no)
  6005. I see 1 and I'm going to do: predict-no
  6006. ENV: Agent did: predict-no for direction U in state State-B
  6007. In State-B moving U
  6008. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6009. predict error 0
  6010. dir: dir isU
  6011. |852: O: O1704 (predict-no)
  6012. I see 1 and I'm going to do: predict-no
  6013. ENV: Agent did: predict-no for direction U in state State-B
  6014. In State-B moving U
  6015. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6016. predict error 0
  6017. dir: dir isU
  6018. \-853: O: O1706 (predict-no)
  6019. I see 1 and I'm going to do: predict-no
  6020. ENV: Agent did: predict-no for direction U in state State-B
  6021. In State-B moving U
  6022. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6023. predict error 0
  6024. dir: dir isL
  6025. /|\854: O: O1707 (predict-yes)
  6026. I see 1 and I'm going to do: predict-yes
  6027. ENV: Agent did: predict-yes for direction L in state State-B
  6028. In State-B moving L
  6029. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6030. predict error 0
  6031. dir: dir isL
  6032. -/|855: O: O1710 (predict-no)
  6033. I see 1 and I'm going to do: predict-no
  6034. ENV: Agent did: predict-no for direction L in state State-A
  6035. In State-A moving L
  6036. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6037. predict error 0
  6038. dir: dir isU
  6039. \-/856: O: O1712 (predict-no)
  6040. I see 1 and I'm going to do: predict-no
  6041. ENV: Agent did: predict-no for direction U in state State-A
  6042. In State-A moving U
  6043. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6044. predict error 0
  6045. dir: dir isU
  6046. |\857: O: O1714 (predict-no)
  6047. I see 1 and I'm going to do: predict-no
  6048. ENV: Agent did: predict-no for direction U in state State-A
  6049. In State-A moving U
  6050. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6051. predict error 0
  6052. dir: dir isR
  6053. -/|858: O: O1715 (predict-yes)
  6054. I see 1 and I'm going to do: predict-yes
  6055. ENV: Agent did: predict-yes for direction R in state State-A
  6056. In State-A moving R
  6057. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6058. predict error 0
  6059. dir: dir isR
  6060. \-859: O: O1718 (predict-no)
  6061. I see 1 and I'm going to do: predict-no
  6062. ENV: Agent did: predict-no for direction R in state State-B
  6063. In State-B moving R
  6064. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6065. predict error 0
  6066. dir: dir isR
  6067. /|\860: O: O1720 (predict-no)
  6068. I see 1 and I'm going to do: predict-no
  6069. ENV: Agent did: predict-no for direction R in state State-B
  6070. In State-B moving R
  6071. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6072. predict error 0
  6073. dir: dir isU
  6074. -/|861: O: O1722 (predict-no)
  6075. I see 1 and I'm going to do: predict-no
  6076. ENV: Agent did: predict-no for direction U in state State-B
  6077. In State-B moving U
  6078. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6079. predict error 0
  6080. dir: dir isU
  6081. \862: O: O1724 (predict-no)
  6082. I see 1 and I'm going to do: predict-no
  6083. ENV: Agent did: predict-no for direction U in state State-B
  6084. In State-B moving U
  6085. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6086. predict error 0
  6087. dir: dir isR
  6088. -/|863: O: O1726 (predict-no)
  6089. I see 1 and I'm going to do: predict-no
  6090. ENV: Agent did: predict-no for direction R in state State-B
  6091. In State-B moving R
  6092. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6093. predict error 0
  6094. dir: dir isL
  6095. \-/864: O: O1727 (predict-yes)
  6096. I see 1 and I'm going to do: predict-yes
  6097. ENV: Agent did: predict-yes for direction L in state State-B
  6098. In State-B moving L
  6099. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6100. predict error 0
  6101. dir: dir isU
  6102. |\865: O: O1730 (predict-no)
  6103. I see 1 and I'm going to do: predict-no
  6104. ENV: Agent did: predict-no for direction U in state State-A
  6105. In State-A moving U
  6106. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6107. predict error 0
  6108. dir: dir isR
  6109. -/|866: O: O1731 (predict-yes)
  6110. I see 1 and I'm going to do: predict-yes
  6111. ENV: Agent did: predict-yes for direction R in state State-A
  6112. In State-A moving R
  6113. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6114. predict error 0
  6115. dir: dir isL
  6116. \-/867: O: O1733 (predict-yes)
  6117. I see 1 and I'm going to do: predict-yes
  6118. ENV: Agent did: predict-yes for direction L in state State-B
  6119. In State-B moving L
  6120. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6121. predict error 0
  6122. dir: dir isL
  6123. |\868: O: O1736 (predict-no)
  6124. I see 1 and I'm going to do: predict-no
  6125. ENV: Agent did: predict-no for direction L in state State-A
  6126. In State-A moving L
  6127. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6128. predict error 0
  6129. dir: dir isU
  6130. -/|869: O: O1738 (predict-no)
  6131. I see 1 and I'm going to do: predict-no
  6132. ENV: Agent did: predict-no for direction U in state State-A
  6133. In State-A moving U
  6134. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6135. predict error 0
  6136. dir: dir isL
  6137. \-/870: O: O1740 (predict-no)
  6138. I see 1 and I'm going to do: predict-no
  6139. ENV: Agent did: predict-no for direction L in state State-A
  6140. In State-A moving L
  6141. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6142. predict error 0
  6143. dir: dir isL
  6144. |\871: O: O1742 (predict-no)
  6145. I see 1 and I'm going to do: predict-no
  6146. ENV: Agent did: predict-no for direction L in state State-A
  6147. In State-A moving L
  6148. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6149. predict error 0
  6150. dir: dir isL
  6151. -872: O: O1744 (predict-no)
  6152. I see 1 and I'm going to do: predict-no
  6153. ENV: Agent did: predict-no for direction L in state State-A
  6154. In State-A moving L
  6155. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6156. predict error 0
  6157. dir: dir isU
  6158. /|\873: O: O1746 (predict-no)
  6159. I see 1 and I'm going to do: predict-no
  6160. ENV: Agent did: predict-no for direction U in state State-A
  6161. In State-A moving U
  6162. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6163. predict error 0
  6164. dir: dir isU
  6165. -/|\874: O: O1748 (predict-no)
  6166. I see 1 and I'm going to do: predict-no
  6167. ENV: Agent did: predict-no for direction U in state State-A
  6168. In State-A moving U
  6169. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6170. predict error 0
  6171. dir: dir isU
  6172. -/875: O: O1750 (predict-no)
  6173. I see 1 and I'm going to do: predict-no
  6174. ENV: Agent did: predict-no for direction U in state State-A
  6175. In State-A moving U
  6176. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6177. predict error 0
  6178. dir: dir isR
  6179. |\-876: O: O1751 (predict-yes)
  6180. I see 1 and I'm going to do: predict-yes
  6181. ENV: Agent did: predict-yes for direction R in state State-A
  6182. In State-A moving R
  6183. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6184. predict error 0
  6185. dir: dir isR
  6186. /877: O: O1754 (predict-no)
  6187. I see 1 and I'm going to do: predict-no
  6188. ENV: Agent did: predict-no for direction R in state State-B
  6189. In State-B moving R
  6190. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6191. predict error 0
  6192. dir: dir isR
  6193. |\-878: O: O1756 (predict-no)
  6194. I see 1 and I'm going to do: predict-no
  6195. ENV: Agent did: predict-no for direction R in state State-B
  6196. In State-B moving R
  6197. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6198. predict error 0
  6199. dir: dir isR
  6200. /|\879: O: O1758 (predict-no)
  6201. I see 1 and I'm going to do: predict-no
  6202. ENV: Agent did: predict-no for direction R in state State-B
  6203. In State-B moving R
  6204. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6205. predict error 0
  6206. dir: dir isR
  6207. -880: O: O1760 (predict-no)
  6208. I see 1 and I'm going to do: predict-no
  6209. ENV: Agent did: predict-no for direction R in state State-B
  6210. In State-B moving R
  6211. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6212. predict error 0
  6213. dir: dir isU
  6214. /|\881: O: O1762 (predict-no)
  6215. I see 1 and I'm going to do: predict-no
  6216. ENV: Agent did: predict-no for direction U in state State-B
  6217. In State-B moving U
  6218. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6219. predict error 0
  6220. dir: dir isU
  6221. -882: O: O1764 (predict-no)
  6222. I see 1 and I'm going to do: predict-no
  6223. ENV: Agent did: predict-no for direction U in state State-B
  6224. In State-B moving U
  6225. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6226. predict error 0
  6227. dir: dir isR
  6228. /|\883: O: O1766 (predict-no)
  6229. I see 1 and I'm going to do: predict-no
  6230. ENV: Agent did: predict-no for direction R in state State-B
  6231. In State-B moving R
  6232. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6233. predict error 0
  6234. dir: dir isR
  6235. -/884: O: O1768 (predict-no)
  6236. I see 1 and I'm going to do: predict-no
  6237. ENV: Agent did: predict-no for direction R in state State-B
  6238. In State-B moving R
  6239. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6240. predict error 0
  6241. dir: dir isL
  6242. |\-885: O: O1769 (predict-yes)
  6243. I see 1 and I'm going to do: predict-yes
  6244. ENV: Agent did: predict-yes for direction L in state State-B
  6245. In State-B moving L
  6246. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6247. predict error 0
  6248. dir: dir isL
  6249. /|\-886: O: O1772 (predict-no)
  6250. I see 1 and I'm going to do: predict-no
  6251. ENV: Agent did: predict-no for direction L in state State-A
  6252. In State-A moving L
  6253. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6254. predict error 0
  6255. dir: dir isR
  6256. /|887: O: O1773 (predict-yes)
  6257. I see 1 and I'm going to do: predict-yes
  6258. ENV: Agent did: predict-yes for direction R in state State-A
  6259. In State-A moving R
  6260. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6261. predict error 0
  6262. dir: dir isR
  6263. \-/888: O: O1776 (predict-no)
  6264. I see 1 and I'm going to do: predict-no
  6265. ENV: Agent did: predict-no for direction R in state State-B
  6266. In State-B moving R
  6267. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6268. predict error 0
  6269. dir: dir isR
  6270. |\-889: O: O1778 (predict-no)
  6271. I see 1 and I'm going to do: predict-no
  6272. ENV: Agent did: predict-no for direction R in state State-B
  6273. In State-B moving R
  6274. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6275. predict error 0
  6276. dir: dir isU
  6277. /|890: O: O1780 (predict-no)
  6278. I see 1 and I'm going to do: predict-no
  6279. ENV: Agent did: predict-no for direction U in state State-B
  6280. In State-B moving U
  6281. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6282. predict error 0
  6283. dir: dir isL
  6284. \-/891: O: O1781 (predict-yes)
  6285. I see 1 and I'm going to do: predict-yes
  6286. ENV: Agent did: predict-yes for direction L in state State-B
  6287. In State-B moving L
  6288. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6289. predict error 0
  6290. dir: dir isR
  6291. |892: O: O1783 (predict-yes)
  6292. I see 1 and I'm going to do: predict-yes
  6293. ENV: Agent did: predict-yes for direction R in state State-A
  6294. In State-A moving R
  6295. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6296. predict error 0
  6297. dir: dir isU
  6298. \-/893: O: O1786 (predict-no)
  6299. I see 1 and I'm going to do: predict-no
  6300. ENV: Agent did: predict-no for direction U in state State-B
  6301. In State-B moving U
  6302. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6303. predict error 0
  6304. dir: dir isU
  6305. |\-894: O: O1788 (predict-no)
  6306. I see 1 and I'm going to do: predict-no
  6307. ENV: Agent did: predict-no for direction U in state State-B
  6308. In State-B moving U
  6309. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6310. predict error 0
  6311. dir: dir isR
  6312. /|\895: O: O1790 (predict-no)
  6313. I see 1 and I'm going to do: predict-no
  6314. ENV: Agent did: predict-no for direction R in state State-B
  6315. In State-B moving R
  6316. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6317. predict error 0
  6318. dir: dir isR
  6319. -/896: O: O1792 (predict-no)
  6320. I see 1 and I'm going to do: predict-no
  6321. ENV: Agent did: predict-no for direction R in state State-B
  6322. In State-B moving R
  6323. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6324. predict error 0
  6325. dir: dir isR
  6326. |\897: O: O1794 (predict-no)
  6327. I see 1 and I'm going to do: predict-no
  6328. ENV: Agent did: predict-no for direction R in state State-B
  6329. In State-B moving R
  6330. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6331. predict error 0
  6332. dir: dir isU
  6333. -898: O: O1796 (predict-no)
  6334. I see 1 and I'm going to do: predict-no
  6335. ENV: Agent did: predict-no for direction U in state State-B
  6336. In State-B moving U
  6337. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6338. predict error 0
  6339. dir: dir isU
  6340. /|\899: O: O1798 (predict-no)
  6341. I see 1 and I'm going to do: predict-no
  6342. ENV: Agent did: predict-no for direction U in state State-B
  6343. In State-B moving U
  6344. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6345. predict error 0
  6346. dir: dir isU
  6347. -/|900: O: O1800 (predict-no)
  6348. I see 1 and I'm going to do: predict-no
  6349. ENV: Agent did: predict-no for direction U in state State-B
  6350. In State-B moving U
  6351. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6352. predict error 0
  6353. dir: dir isU
  6354. \-901: O: O1802 (predict-no)
  6355. I see 1 and I'm going to do: predict-no
  6356. ENV: Agent did: predict-no for direction U in state State-B
  6357. In State-B moving U
  6358. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6359. predict error 0
  6360. dir: dir isU
  6361. /902: O: O1804 (predict-no)
  6362. I see 1 and I'm going to do: predict-no
  6363. ENV: Agent did: predict-no for direction U in state State-B
  6364. In State-B moving U
  6365. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6366. predict error 0
  6367. dir: dir isU
  6368. |903: O: O1806 (predict-no)
  6369. I see 1 and I'm going to do: predict-no
  6370. ENV: Agent did: predict-no for direction U in state State-B
  6371. In State-B moving U
  6372. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6373. predict error 0
  6374. dir: dir isR
  6375. \-/904: O: O1808 (predict-no)
  6376. I see 1 and I'm going to do: predict-no
  6377. ENV: Agent did: predict-no for direction R in state State-B
  6378. In State-B moving R
  6379. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6380. predict error 0
  6381. dir: dir isR
  6382. |\-905: O: O1810 (predict-no)
  6383. I see 1 and I'm going to do: predict-no
  6384. ENV: Agent did: predict-no for direction R in state State-B
  6385. In State-B moving R
  6386. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6387. predict error 0
  6388. dir: dir isU
  6389. /|\906: O: O1812 (predict-no)
  6390. I see 1 and I'm going to do: predict-no
  6391. ENV: Agent did: predict-no for direction U in state State-B
  6392. In State-B moving U
  6393. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6394. predict error 0
  6395. dir: dir isR
  6396. -/|907: O: O1814 (predict-no)
  6397. I see 1 and I'm going to do: predict-no
  6398. ENV: Agent did: predict-no for direction R in state State-B
  6399. In State-B moving R
  6400. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6401. predict error 0
  6402. dir: dir isU
  6403. \-/908: O: O1816 (predict-no)
  6404. I see 1 and I'm going to do: predict-no
  6405. ENV: Agent did: predict-no for direction U in state State-B
  6406. In State-B moving U
  6407. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6408. predict error 0
  6409. dir: dir isR
  6410. |\-909: O: O1818 (predict-no)
  6411. I see 1 and I'm going to do: predict-no
  6412. ENV: Agent did: predict-no for direction R in state State-B
  6413. In State-B moving R
  6414. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6415. predict error 0
  6416. dir: dir isR
  6417. /|910: O: O1820 (predict-no)
  6418. I see 1 and I'm going to do: predict-no
  6419. ENV: Agent did: predict-no for direction R in state State-B
  6420. In State-B moving R
  6421. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6422. predict error 0
  6423. dir: dir isR
  6424. \-/911: O: O1822 (predict-no)
  6425. I see 1 and I'm going to do: predict-no
  6426. ENV: Agent did: predict-no for direction R in state State-B
  6427. In State-B moving R
  6428. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6429. predict error 0
  6430. dir: dir isL
  6431. |912: O: O1823 (predict-yes)
  6432. I see 1 and I'm going to do: predict-yes
  6433. ENV: Agent did: predict-yes for direction L in state State-B
  6434. In State-B moving L
  6435. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6436. predict error 0
  6437. dir: dir isR
  6438. \913: O: O1825 (predict-yes)
  6439. I see 1 and I'm going to do: predict-yes
  6440. ENV: Agent did: predict-yes for direction R in state State-A
  6441. In State-A moving R
  6442. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6443. predict error 0
  6444. dir: dir isR
  6445. -/|914: O: O1828 (predict-no)
  6446. I see 1 and I'm going to do: predict-no
  6447. ENV: Agent did: predict-no for direction R in state State-B
  6448. In State-B moving R
  6449. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6450. predict error 0
  6451. dir: dir isL
  6452. \-/915: O: O1829 (predict-yes)
  6453. I see 1 and I'm going to do: predict-yes
  6454. ENV: Agent did: predict-yes for direction L in state State-B
  6455. In State-B moving L
  6456. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6457. predict error 0
  6458. dir: dir isL
  6459. |\916: O: O1832 (predict-no)
  6460. I see 1 and I'm going to do: predict-no
  6461. ENV: Agent did: predict-no for direction L in state State-A
  6462. In State-A moving L
  6463. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6464. predict error 0
  6465. dir: dir isL
  6466. -/917: O: O1834 (predict-no)
  6467. I see 1 and I'm going to do: predict-no
  6468. ENV: Agent did: predict-no for direction L in state State-A
  6469. In State-A moving L
  6470. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6471. predict error 0
  6472. dir: dir isU
  6473. |\-918: O: O1836 (predict-no)
  6474. I see 1 and I'm going to do: predict-no
  6475. ENV: Agent did: predict-no for direction U in state State-A
  6476. In State-A moving U
  6477. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6478. predict error 0
  6479. dir: dir isR
  6480. /|\919: O: O1837 (predict-yes)
  6481. I see 1 and I'm going to do: predict-yes
  6482. ENV: Agent did: predict-yes for direction R in state State-A
  6483. In State-A moving R
  6484. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6485. predict error 0
  6486. dir: dir isL
  6487. -/|920: O: O1839 (predict-yes)
  6488. I see 1 and I'm going to do: predict-yes
  6489. ENV: Agent did: predict-yes for direction L in state State-B
  6490. In State-B moving L
  6491. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6492. predict error 0
  6493. dir: dir isU
  6494. \-/921: O: O1842 (predict-no)
  6495. I see 1 and I'm going to do: predict-no
  6496. ENV: Agent did: predict-no for direction U in state State-A
  6497. In State-A moving U
  6498. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6499. predict error 0
  6500. dir: dir isL
  6501. |922: O: O1844 (predict-no)
  6502. I see 1 and I'm going to do: predict-no
  6503. ENV: Agent did: predict-no for direction L in state State-A
  6504. In State-A moving L
  6505. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6506. predict error 0
  6507. dir: dir isR
  6508. \-923: O: O1845 (predict-yes)
  6509. I see 1 and I'm going to do: predict-yes
  6510. ENV: Agent did: predict-yes for direction R in state State-A
  6511. In State-A moving R
  6512. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6513. predict error 0
  6514. dir: dir isU
  6515. /|\924: O: O1848 (predict-no)
  6516. I see 1 and I'm going to do: predict-no
  6517. ENV: Agent did: predict-no for direction U in state State-B
  6518. In State-B moving U
  6519. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6520. predict error 0
  6521. dir: dir isU
  6522. -/925: O: O1850 (predict-no)
  6523. I see 1 and I'm going to do: predict-no
  6524. ENV: Agent did: predict-no for direction U in state State-B
  6525. In State-B moving U
  6526. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6527. predict error 0
  6528. dir: dir isR
  6529. |\926: O: O1852 (predict-no)
  6530. I see 1 and I'm going to do: predict-no
  6531. ENV: Agent did: predict-no for direction R in state State-B
  6532. In State-B moving R
  6533. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6534. predict error 0
  6535. dir: dir isU
  6536. -/|927: O: O1854 (predict-no)
  6537. I see 1 and I'm going to do: predict-no
  6538. ENV: Agent did: predict-no for direction U in state State-B
  6539. In State-B moving U
  6540. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6541. predict error 0
  6542. dir: dir isR
  6543. \-928: O: O1856 (predict-no)
  6544. I see 1 and I'm going to do: predict-no
  6545. ENV: Agent did: predict-no for direction R in state State-B
  6546. In State-B moving R
  6547. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6548. predict error 0
  6549. dir: dir isU
  6550. /|\929: O: O1858 (predict-no)
  6551. I see 1 and I'm going to do: predict-no
  6552. ENV: Agent did: predict-no for direction U in state State-B
  6553. In State-B moving U
  6554. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6555. predict error 0
  6556. dir: dir isR
  6557. -/|930: O: O1860 (predict-no)
  6558. I see 1 and I'm going to do: predict-no
  6559. ENV: Agent did: predict-no for direction R in state State-B
  6560. In State-B moving R
  6561. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6562. predict error 0
  6563. dir: dir isU
  6564. \-931: O: O1862 (predict-no)
  6565. I see 1 and I'm going to do: predict-no
  6566. ENV: Agent did: predict-no for direction U in state State-B
  6567. In State-B moving U
  6568. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6569. predict error 0
  6570. dir: dir isU
  6571. /932: O: O1864 (predict-no)
  6572. I see 1 and I'm going to do: predict-no
  6573. ENV: Agent did: predict-no for direction U in state State-B
  6574. In State-B moving U
  6575. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6576. predict error 0
  6577. dir: dir isL
  6578. |\-933: O: O1865 (predict-yes)
  6579. I see 1 and I'm going to do: predict-yes
  6580. ENV: Agent did: predict-yes for direction L in state State-B
  6581. In State-B moving L
  6582. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6583. predict error 0
  6584. dir: dir isL
  6585. /|934: O: O1868 (predict-no)
  6586. I see 1 and I'm going to do: predict-no
  6587. ENV: Agent did: predict-no for direction L in state State-A
  6588. In State-A moving L
  6589. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6590. predict error 0
  6591. dir: dir isU
  6592. \-/935: O: O1870 (predict-no)
  6593. I see 1 and I'm going to do: predict-no
  6594. ENV: Agent did: predict-no for direction U in state State-A
  6595. In State-A moving U
  6596. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6597. predict error 0
  6598. dir: dir isL
  6599. |\936: O: O1872 (predict-no)
  6600. I see 1 and I'm going to do: predict-no
  6601. ENV: Agent did: predict-no for direction L in state State-A
  6602. In State-A moving L
  6603. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6604. predict error 0
  6605. dir: dir isL
  6606. -/|937: O: O1874 (predict-no)
  6607. I see 1 and I'm going to do: predict-no
  6608. ENV: Agent did: predict-no for direction L in state State-A
  6609. In State-A moving L
  6610. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6611. predict error 0
  6612. dir: dir isL
  6613. \-/938: O: O1876 (predict-no)
  6614. I see 1 and I'm going to do: predict-no
  6615. ENV: Agent did: predict-no for direction L in state State-A
  6616. In State-A moving L
  6617. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6618. predict error 0
  6619. dir: dir isR
  6620. |\939: O: O1877 (predict-yes)
  6621. I see 1 and I'm going to do: predict-yes
  6622. ENV: Agent did: predict-yes for direction R in state State-A
  6623. In State-A moving R
  6624. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6625. predict error 0
  6626. dir: dir isU
  6627. -/940: O: O1880 (predict-no)
  6628. I see 1 and I'm going to do: predict-no
  6629. ENV: Agent did: predict-no for direction U in state State-B
  6630. In State-B moving U
  6631. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6632. predict error 0
  6633. dir: dir isR
  6634. |\941: O: O1882 (predict-no)
  6635. I see 1 and I'm going to do: predict-no
  6636. ENV: Agent did: predict-no for direction R in state State-B
  6637. In State-B moving R
  6638. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6639. predict error 0
  6640. dir: dir isU
  6641. -942: O: O1884 (predict-no)
  6642. I see 1 and I'm going to do: predict-no
  6643. ENV: Agent did: predict-no for direction U in state State-B
  6644. In State-B moving U
  6645. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6646. predict error 0
  6647. dir: dir isU
  6648. /|\943: O: O1886 (predict-no)
  6649. I see 1 and I'm going to do: predict-no
  6650. ENV: Agent did: predict-no for direction U in state State-B
  6651. In State-B moving U
  6652. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6653. predict error 0
  6654. dir: dir isL
  6655. -/944: O: O1887 (predict-yes)
  6656. I see 1 and I'm going to do: predict-yes
  6657. ENV: Agent did: predict-yes for direction L in state State-B
  6658. In State-B moving L
  6659. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6660. predict error 0
  6661. dir: dir isR
  6662. |\945: O: O1889 (predict-yes)
  6663. I see 1 and I'm going to do: predict-yes
  6664. ENV: Agent did: predict-yes for direction R in state State-A
  6665. In State-A moving R
  6666. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6667. predict error 0
  6668. dir: dir isU
  6669. -/|946: O: O1892 (predict-no)
  6670. I see 1 and I'm going to do: predict-no
  6671. ENV: Agent did: predict-no for direction U in state State-B
  6672. In State-B moving U
  6673. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6674. predict error 0
  6675. dir: dir isR
  6676. \-/947: O: O1894 (predict-no)
  6677. I see 1 and I'm going to do: predict-no
  6678. ENV: Agent did: predict-no for direction R in state State-B
  6679. In State-B moving R
  6680. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6681. predict error 0
  6682. dir: dir isR
  6683. |\-948: O: O1896 (predict-no)
  6684. I see 1 and I'm going to do: predict-no
  6685. ENV: Agent did: predict-no for direction R in state State-B
  6686. In State-B moving R
  6687. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6688. predict error 0
  6689. dir: dir isR
  6690. /|\949: O: O1898 (predict-no)
  6691. I see 1 and I'm going to do: predict-no
  6692. ENV: Agent did: predict-no for direction R in state State-B
  6693. In State-B moving R
  6694. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6695. predict error 0
  6696. dir: dir isU
  6697. -/|950: O: O1900 (predict-no)
  6698. I see 1 and I'm going to do: predict-no
  6699. ENV: Agent did: predict-no for direction U in state State-B
  6700. In State-B moving U
  6701. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6702. predict error 0
  6703. dir: dir isU
  6704. \-/|\-/|\-/--- Input Phase ---
  6705. =>WM: (13307: I2 ^dir U)
  6706. =>WM: (13306: I2 ^reward 1)
  6707. =>WM: (13305: I2 ^see 0)
  6708. =>WM: (13304: N950 ^status complete)
  6709. <=WM: (13293: I2 ^dir U)
  6710. <=WM: (13292: I2 ^reward 1)
  6711. <=WM: (13291: I2 ^see 0)
  6712. =>WM: (13308: I2 ^level-1 R0-root)
  6713. <=WM: (13294: I2 ^level-1 R0-root)
  6714. --- END Input Phase ---
  6715. --- Proposal Phase ---
  6716. --- Inner Elaboration Phase, active level 1 (S1) ---
  6717. Firing elaborate*copy-see-to-output-link
  6718. -->
  6719. (I3 ^see 0 +)
  6720. Firing elaborate*reward*based*on*reward
  6721. -->
  6722. (R954 ^value 1 +)
  6723. (R1 ^reward R954 +)
  6724. Firing propose*predict-yes
  6725. -->
  6726. (O1901 ^name predict-yes +)
  6727. (S1 ^operator O1901 +)
  6728. Firing propose*predict-no
  6729. -->
  6730. (O1902 ^name predict-no +)
  6731. (S1 ^operator O1902 +)
  6732. Firing rl*prefer*rvt*predict-no*H0*4
  6733. -->
  6734. (S1 ^operator O1900 = 0.9999999999999999)
  6735. Firing rl*prefer*rvt*predict-yes*H0*3
  6736. -->
  6737. (S1 ^operator O1899 = 0.)
  6738. Firing prefer*rvt*predict-yes*H0
  6739. -->
  6740. Firing prefer*rvt*predict-no*H0
  6741. -->
  6742. Firing elaborate*copy-dir-to-output-link
  6743. -->
  6744. (I3 ^dir U +)
  6745. inner elaboration loop at bottom goal.
  6746. Retracting elaborate*copy-see-to-output-link
  6747. -->
  6748. (I3 ^see 0 +)
  6749. Retracting propose*predict-no
  6750. -->
  6751. (O1900 ^name predict-no +)
  6752. (S1 ^operator O1900 +)
  6753. Retracting propose*predict-yes
  6754. -->
  6755. (O1899 ^name predict-yes +)
  6756. (S1 ^operator O1899 +)
  6757. Retracting elaborate*reward*based*on*reward
  6758. -->
  6759. (R953 ^value 1 +)
  6760. (R1 ^reward R953 +)
  6761. Retracting elaborate*copy-dir-to-output-link
  6762. -->
  6763. (I3 ^dir U +)
  6764. Retracting rl*prefer*rvt*predict-no*H0*4
  6765. -->
  6766. (S1 ^operator O1900 = 0.9999999999999999)
  6767. Retracting rl*prefer*rvt*predict-yes*H0*3
  6768. -->
  6769. (S1 ^operator O1899 = 0.)
  6770. =>WM: (13314: S1 ^operator O1902 +)
  6771. =>WM: (13313: S1 ^operator O1901 +)
  6772. =>WM: (13312: O1902 ^name predict-no)
  6773. =>WM: (13311: O1901 ^name predict-yes)
  6774. =>WM: (13310: R954 ^value 1)
  6775. =>WM: (13309: R1 ^reward R954)
  6776. <=WM: (13300: S1 ^operator O1899 +)
  6777. <=WM: (13301: S1 ^operator O1900 +)
  6778. <=WM: (13302: S1 ^operator O1900)
  6779. <=WM: (13295: R1 ^reward R953)
  6780. <=WM: (13298: O1900 ^name predict-no)
  6781. <=WM: (13297: O1899 ^name predict-yes)
  6782. <=WM: (13296: R953 ^value 1)
  6783. --- Inner Elaboration Phase, active level 1 (S1) ---
  6784. Firing prefer*rvt*predict-yes*H0
  6785. -->
  6786. Firing rl*prefer*rvt*predict-yes*H0*3
  6787. -->
  6788. (S1 ^operator O1901 = 0.)
  6789. Firing prefer*rvt*predict-no*H0
  6790. -->
  6791. Firing rl*prefer*rvt*predict-no*H0*4
  6792. -->
  6793. (S1 ^operator O1902 = 0.9999999999999999)
  6794. inner elaboration loop at bottom goal.
  6795. Retracting rl*prefer*rvt*predict-no*H0*4
  6796. -->
  6797. (S1 ^operator O1900 = 0.9999999999999999)
  6798. Retracting rl*prefer*rvt*predict-yes*H0*3
  6799. -->
  6800. (S1 ^operator O1899 = 0.)
  6801. --- END Proposal Phase ---
  6802. --- Decision Phase ---
  6803. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6804. =>WM: (13315: S1 ^operator O1902)
  6805. 951: O: O1902 (predict-no)
  6806. --- END Decision Phase ---
  6807. --- Application Phase ---
  6808. --- Firing Productions (PE) For State At Depth 1 ---
  6809. --- Inner Elaboration Phase, active level 1 (S1) ---
  6810. Firing apply*operator
  6811. -->
  6812. (I3 ^predict-no N951 + :O )
  6813. Firing apply*operator*complete
  6814. -->
  6815. (I3 ^predict-no N950 - :O )
  6816. inner elaboration loop at bottom goal.
  6817. --- Change Working Memory (PE) ---
  6818. =>WM: (13316: I3 ^predict-no N951)
  6819. <=WM: (13304: N950 ^status complete)
  6820. <=WM: (13303: I3 ^predict-no N950)
  6821. --- Firing Productions (IE) For State At Depth 1 ---
  6822. --- Inner Elaboration Phase, active level 1 (S1) ---
  6823. Firing monitor*world
  6824. -->
  6825. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6826. --- Change Working Memory (IE) ---
  6827. --- END Application Phase ---
  6828. --- Output Phase ---
  6829. ENV: Agent did: predict-no for direction U in state State-B
  6830. In State-B moving U
  6831. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6832. predict error 0
  6833. dir: dir isL
  6834. --- END Output Phase ---
  6835. |--- Input Phase ---
  6836. =>WM: (13320: I2 ^dir L)
  6837. =>WM: (13319: I2 ^reward 1)
  6838. =>WM: (13318: I2 ^see 0)
  6839. =>WM: (13317: N951 ^status complete)
  6840. <=WM: (13307: I2 ^dir U)
  6841. <=WM: (13306: I2 ^reward 1)
  6842. <=WM: (13305: I2 ^see 0)
  6843. =>WM: (13321: I2 ^level-1 R0-root)
  6844. <=WM: (13308: I2 ^level-1 R0-root)
  6845. --- END Input Phase ---
  6846. --- Proposal Phase ---
  6847. --- Inner Elaboration Phase, active level 1 (S1) ---
  6848. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  6849. -->
  6850. (S1 ^operator O1901 = 0.6597530378637458)
  6851. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  6852. -->
  6853. (S1 ^operator O1902 = 0.133561435542329)
  6854. Firing prefer*rvt*predict-no*H0*2*H1
  6855. -->
  6856. Firing prefer*rvt*predict-yes*H0*1*H1
  6857. -->
  6858. Firing elaborate*copy-see-to-output-link
  6859. -->
  6860. (I3 ^see 0 +)
  6861. Firing elaborate*reward*based*on*reward
  6862. -->
  6863. (R955 ^value 1 +)
  6864. (R1 ^reward R955 +)
  6865. Firing propose*predict-yes
  6866. -->
  6867. (O1903 ^name predict-yes +)
  6868. (S1 ^operator O1903 +)
  6869. Firing propose*predict-no
  6870. -->
  6871. (O1904 ^name predict-no +)
  6872. (S1 ^operator O1904 +)
  6873. Firing rl*prefer*rvt*predict-no*H0*2
  6874. -->
  6875. (S1 ^operator O1902 = 0.3212981720332201)
  6876. Firing rl*prefer*rvt*predict-yes*H0*1
  6877. -->
  6878. (S1 ^operator O1901 = 0.3402462579366619)
  6879. Firing prefer*rvt*predict-yes*H0
  6880. -->
  6881. Firing prefer*rvt*predict-no*H0
  6882. -->
  6883. Firing elaborate*copy-dir-to-output-link
  6884. -->
  6885. (I3 ^dir L +)
  6886. inner elaboration loop at bottom goal.
  6887. Retracting elaborate*copy-see-to-output-link
  6888. -->
  6889. (I3 ^see 0 +)
  6890. Retracting propose*predict-no
  6891. -->
  6892. (O1902 ^name predict-no +)
  6893. (S1 ^operator O1902 +)
  6894. Retracting propose*predict-yes
  6895. -->
  6896. (O1901 ^name predict-yes +)
  6897. (S1 ^operator O1901 +)
  6898. Retracting elaborate*reward*based*on*reward
  6899. -->
  6900. (R954 ^value 1 +)
  6901. (R1 ^reward R954 +)
  6902. Retracting elaborate*copy-dir-to-output-link
  6903. -->
  6904. (I3 ^dir U +)
  6905. Retracting rl*prefer*rvt*predict-no*H0*4
  6906. -->
  6907. (S1 ^operator O1902 = 0.9999999999999999)
  6908. Retracting rl*prefer*rvt*predict-yes*H0*3
  6909. -->
  6910. (S1 ^operator O1901 = 0.)
  6911. =>WM: (13328: S1 ^operator O1904 +)
  6912. =>WM: (13327: S1 ^operator O1903 +)
  6913. =>WM: (13326: I3 ^dir L)
  6914. =>WM: (13325: O1904 ^name predict-no)
  6915. =>WM: (13324: O1903 ^name predict-yes)
  6916. =>WM: (13323: R955 ^value 1)
  6917. =>WM: (13322: R1 ^reward R955)
  6918. <=WM: (13313: S1 ^operator O1901 +)
  6919. <=WM: (13314: S1 ^operator O1902 +)
  6920. <=WM: (13315: S1 ^operator O1902)
  6921. <=WM: (13299: I3 ^dir U)
  6922. <=WM: (13309: R1 ^reward R954)
  6923. <=WM: (13312: O1902 ^name predict-no)
  6924. <=WM: (13311: O1901 ^name predict-yes)
  6925. <=WM: (13310: R954 ^value 1)
  6926. --- Inner Elaboration Phase, active level 1 (S1) ---
  6927. Firing prefer*rvt*predict-yes*H0
  6928. -->
  6929. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  6930. -->
  6931. (S1 ^operator O1903 = 0.6597530378637458)
  6932. Firing rl*prefer*rvt*predict-yes*H0*1
  6933. -->
  6934. (S1 ^operator O1903 = 0.3402462579366619)
  6935. Firing prefer*rvt*predict-yes*H0*1*H1
  6936. -->
  6937. Firing prefer*rvt*predict-no*H0
  6938. -->
  6939. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  6940. -->
  6941. (S1 ^operator O1904 = 0.133561435542329)
  6942. Firing rl*prefer*rvt*predict-no*H0*2
  6943. -->
  6944. (S1 ^operator O1904 = 0.3212981720332201)
  6945. Firing prefer*rvt*predict-no*H0*2*H1
  6946. -->
  6947. inner elaboration loop at bottom goal.
  6948. Retracting rl*prefer*rvt*predict-no*H0*2
  6949. -->
  6950. (S1 ^operator O1902 = 0.3212981720332201)
  6951. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  6952. -->
  6953. (S1 ^operator O1902 = 0.133561435542329)
  6954. Retracting rl*prefer*rvt*predict-yes*H0*1
  6955. -->
  6956. (S1 ^operator O1901 = 0.3402462579366619)
  6957. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  6958. -->
  6959. (S1 ^operator O1901 = 0.6597530378637458)
  6960. --- END Proposal Phase ---
  6961. --- Decision Phase ---
  6962. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6963. =>WM: (13329: S1 ^operator O1903)
  6964. 952: O: O1903 (predict-yes)
  6965. --- END Decision Phase ---
  6966. --- Application Phase ---
  6967. --- Firing Productions (PE) For State At Depth 1 ---
  6968. --- Inner Elaboration Phase, active level 1 (S1) ---
  6969. Firing apply*operator
  6970. -->
  6971. (I3 ^predict-yes N952 + :O )
  6972. Firing apply*operator*complete
  6973. -->
  6974. (I3 ^predict-no N951 - :O )
  6975. inner elaboration loop at bottom goal.
  6976. --- Change Working Memory (PE) ---
  6977. =>WM: (13330: I3 ^predict-yes N952)
  6978. <=WM: (13317: N951 ^status complete)
  6979. <=WM: (13316: I3 ^predict-no N951)
  6980. --- Firing Productions (IE) For State At Depth 1 ---
  6981. --- Inner Elaboration Phase, active level 1 (S1) ---
  6982. Firing monitor*world
  6983. -->
  6984. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  6985. --- Change Working Memory (IE) ---
  6986. --- END Application Phase ---
  6987. --- Output Phase ---
  6988. ENV: Agent did: predict-yes for direction L in state State-B
  6989. In State-B moving L
  6990. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6991. predict error 0
  6992. dir: dir isR
  6993. --- END Output Phase ---
  6994. \-/--- Input Phase ---
  6995. =>WM: (13334: I2 ^dir R)
  6996. =>WM: (13333: I2 ^reward 1)
  6997. =>WM: (13332: I2 ^see 1)
  6998. =>WM: (13331: N952 ^status complete)
  6999. <=WM: (13320: I2 ^dir L)
  7000. <=WM: (13319: I2 ^reward 1)
  7001. <=WM: (13318: I2 ^see 0)
  7002. =>WM: (13335: I2 ^level-1 L1-root)
  7003. <=WM: (13321: I2 ^level-1 R0-root)
  7004. --- END Input Phase ---
  7005. --- Proposal Phase ---
  7006. --- Inner Elaboration Phase, active level 1 (S1) ---
  7007. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  7008. -->
  7009. (S1 ^operator O1903 = 0.8879101996662896)
  7010. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  7011. -->
  7012. (S1 ^operator O1904 = 0.02370016355578053)
  7013. Firing prefer*rvt*predict-no*H0*6*H1
  7014. -->
  7015. Firing prefer*rvt*predict-yes*H0*5*H1
  7016. -->
  7017. Firing elaborate*copy-see-to-output-link
  7018. -->
  7019. (I3 ^see 1 +)
  7020. Firing elaborate*reward*based*on*reward
  7021. -->
  7022. (R956 ^value 1 +)
  7023. (R1 ^reward R956 +)
  7024. Firing propose*predict-yes
  7025. -->
  7026. (O1905 ^name predict-yes +)
  7027. (S1 ^operator O1905 +)
  7028. Firing propose*predict-no
  7029. -->
  7030. (O1906 ^name predict-no +)
  7031. (S1 ^operator O1906 +)
  7032. Firing rl*prefer*rvt*predict-no*H0*6
  7033. -->
  7034. (S1 ^operator O1904 = 0.3993329903418046)
  7035. Firing rl*prefer*rvt*predict-yes*H0*5
  7036. -->
  7037. (S1 ^operator O1903 = 0.1121099638010357)
  7038. Firing prefer*rvt*predict-yes*H0
  7039. -->
  7040. Firing prefer*rvt*predict-no*H0
  7041. -->
  7042. Firing elaborate*copy-dir-to-output-link
  7043. -->
  7044. (I3 ^dir R +)
  7045. inner elaboration loop at bottom goal.
  7046. Retracting elaborate*copy-see-to-output-link
  7047. -->
  7048. (I3 ^see 0 +)
  7049. Retracting propose*predict-no
  7050. -->
  7051. (O1904 ^name predict-no +)
  7052. (S1 ^operator O1904 +)
  7053. Retracting propose*predict-yes
  7054. -->
  7055. (O1903 ^name predict-yes +)
  7056. (S1 ^operator O1903 +)
  7057. Retracting elaborate*reward*based*on*reward
  7058. -->
  7059. (R955 ^value 1 +)
  7060. (R1 ^reward R955 +)
  7061. Retracting elaborate*copy-dir-to-output-link
  7062. -->
  7063. (I3 ^dir L +)
  7064. Retracting rl*prefer*rvt*predict-no*H0*2
  7065. -->
  7066. (S1 ^operator O1904 = 0.3212981720332201)
  7067. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  7068. -->
  7069. (S1 ^operator O1904 = 0.133561435542329)
  7070. Retracting rl*prefer*rvt*predict-yes*H0*1
  7071. -->
  7072. (S1 ^operator O1903 = 0.3402462579366619)
  7073. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  7074. -->
  7075. (S1 ^operator O1903 = 0.6597530378637458)
  7076. =>WM: (13343: S1 ^operator O1906 +)
  7077. =>WM: (13342: S1 ^operator O1905 +)
  7078. =>WM: (13341: I3 ^dir R)
  7079. =>WM: (13340: O1906 ^name predict-no)
  7080. =>WM: (13339: O1905 ^name predict-yes)
  7081. =>WM: (13338: R956 ^value 1)
  7082. =>WM: (13337: R1 ^reward R956)
  7083. =>WM: (13336: I3 ^see 1)
  7084. <=WM: (13327: S1 ^operator O1903 +)
  7085. <=WM: (13329: S1 ^operator O1903)
  7086. <=WM: (13328: S1 ^operator O1904 +)
  7087. <=WM: (13326: I3 ^dir L)
  7088. <=WM: (13322: R1 ^reward R955)
  7089. <=WM: (13254: I3 ^see 0)
  7090. <=WM: (13325: O1904 ^name predict-no)
  7091. <=WM: (13324: O1903 ^name predict-yes)
  7092. <=WM: (13323: R955 ^value 1)
  7093. --- Inner Elaboration Phase, active level 1 (S1) ---
  7094. Firing prefer*rvt*predict-yes*H0
  7095. -->
  7096. Firing rl*prefer*rvt*predict-yes*H0*5
  7097. -->
  7098. (S1 ^operator O1905 = 0.1121099638010357)
  7099. Firing prefer*rvt*predict-yes*H0*5*H1
  7100. -->
  7101. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  7102. -->
  7103. (S1 ^operator O1905 = 0.8879101996662896)
  7104. Firing prefer*rvt*predict-no*H0
  7105. -->
  7106. Firing rl*prefer*rvt*predict-no*H0*6
  7107. -->
  7108. (S1 ^operator O1906 = 0.3993329903418046)
  7109. Firing prefer*rvt*predict-no*H0*6*H1
  7110. -->
  7111. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  7112. -->
  7113. (S1 ^operator O1906 = 0.02370016355578053)
  7114. inner elaboration loop at bottom goal.
  7115. Retracting rl*prefer*rvt*predict-no*H0*6
  7116. -->
  7117. (S1 ^operator O1904 = 0.3993329903418046)
  7118. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  7119. -->
  7120. (S1 ^operator O1904 = 0.02370016355578053)
  7121. Retracting rl*prefer*rvt*predict-yes*H0*5
  7122. -->
  7123. (S1 ^operator O1903 = 0.1121099638010357)
  7124. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  7125. -->
  7126. (S1 ^operator O1903 = 0.8879101996662896)
  7127. --- END Proposal Phase ---
  7128. --- Decision Phase ---
  7129. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.890323,0.0982824)
  7130. RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422821 0.236932 0.659753(R,m,v=1,1,0)
  7131. =>WM: (13344: S1 ^operator O1905)
  7132. 953: O: O1905 (predict-yes)
  7133. --- END Decision Phase ---
  7134. --- Application Phase ---
  7135. --- Firing Productions (PE) For State At Depth 1 ---
  7136. --- Inner Elaboration Phase, active level 1 (S1) ---
  7137. Firing apply*operator
  7138. -->
  7139. (I3 ^predict-yes N953 + :O )
  7140. Firing apply*operator*complete
  7141. -->
  7142. (I3 ^predict-yes N952 - :O )
  7143. inner elaboration loop at bottom goal.
  7144. --- Change Working Memory (PE) ---
  7145. =>WM: (13345: I3 ^predict-yes N953)
  7146. <=WM: (13331: N952 ^status complete)
  7147. <=WM: (13330: I3 ^predict-yes N952)
  7148. --- Firing Productions (IE) For State At Depth 1 ---
  7149. --- Inner Elaboration Phase, active level 1 (S1) ---
  7150. Firing monitor*world
  7151. -->
  7152. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7153. --- Change Working Memory (IE) ---
  7154. --- END Application Phase ---
  7155. --- Output Phase ---
  7156. ENV: Agent did: predict-yes for direction R in state State-A
  7157. In State-A moving R
  7158. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7159. predict error 0
  7160. dir: dir isR
  7161. --- END Output Phase ---
  7162. |\---- Input Phase ---
  7163. =>WM: (13349: I2 ^dir R)
  7164. =>WM: (13348: I2 ^reward 1)
  7165. =>WM: (13347: I2 ^see 1)
  7166. =>WM: (13346: N953 ^status complete)
  7167. <=WM: (13334: I2 ^dir R)
  7168. <=WM: (13333: I2 ^reward 1)
  7169. <=WM: (13332: I2 ^see 1)
  7170. =>WM: (13350: I2 ^level-1 R1-root)
  7171. <=WM: (13335: I2 ^level-1 L1-root)
  7172. --- END Input Phase ---
  7173. --- Proposal Phase ---
  7174. --- Inner Elaboration Phase, active level 1 (S1) ---
  7175. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  7176. -->
  7177. (S1 ^operator O1906 = 0.6006773674757838)
  7178. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  7179. -->
  7180. (S1 ^operator O1905 = 0.1602187148382515)
  7181. Firing prefer*rvt*predict-no*H0*6*H1
  7182. -->
  7183. Firing prefer*rvt*predict-yes*H0*5*H1
  7184. -->
  7185. Firing elaborate*copy-see-to-output-link
  7186. -->
  7187. (I3 ^see 1 +)
  7188. Firing elaborate*reward*based*on*reward
  7189. -->
  7190. (R957 ^value 1 +)
  7191. (R1 ^reward R957 +)
  7192. Firing propose*predict-yes
  7193. -->
  7194. (O1907 ^name predict-yes +)
  7195. (S1 ^operator O1907 +)
  7196. Firing propose*predict-no
  7197. -->
  7198. (O1908 ^name predict-no +)
  7199. (S1 ^operator O1908 +)
  7200. Firing rl*prefer*rvt*predict-no*H0*6
  7201. -->
  7202. (S1 ^operator O1906 = 0.3993329903418046)
  7203. Firing rl*prefer*rvt*predict-yes*H0*5
  7204. -->
  7205. (S1 ^operator O1905 = 0.1121099638010357)
  7206. Firing prefer*rvt*predict-yes*H0
  7207. -->
  7208. Firing prefer*rvt*predict-no*H0
  7209. -->
  7210. Firing elaborate*copy-dir-to-output-link
  7211. -->
  7212. (I3 ^dir R +)
  7213. inner elaboration loop at bottom goal.
  7214. Retracting elaborate*copy-see-to-output-link
  7215. -->
  7216. (I3 ^see 1 +)
  7217. Retracting propose*predict-no
  7218. -->
  7219. (O1906 ^name predict-no +)
  7220. (S1 ^operator O1906 +)
  7221. Retracting propose*predict-yes
  7222. -->
  7223. (O1905 ^name predict-yes +)
  7224. (S1 ^operator O1905 +)
  7225. Retracting elaborate*reward*based*on*reward
  7226. -->
  7227. (R956 ^value 1 +)
  7228. (R1 ^reward R956 +)
  7229. Retracting elaborate*copy-dir-to-output-link
  7230. -->
  7231. (I3 ^dir R +)
  7232. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  7233. -->
  7234. (S1 ^operator O1906 = 0.02370016355578053)
  7235. Retracting rl*prefer*rvt*predict-no*H0*6
  7236. -->
  7237. (S1 ^operator O1906 = 0.3993329903418046)
  7238. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  7239. -->
  7240. (S1 ^operator O1905 = 0.8879101996662896)
  7241. Retracting rl*prefer*rvt*predict-yes*H0*5
  7242. -->
  7243. (S1 ^operator O1905 = 0.1121099638010357)
  7244. =>WM: (13356: S1 ^operator O1908 +)
  7245. =>WM: (13355: S1 ^operator O1907 +)
  7246. =>WM: (13354: O1908 ^name predict-no)
  7247. =>WM: (13353: O1907 ^name predict-yes)
  7248. =>WM: (13352: R957 ^value 1)
  7249. =>WM: (13351: R1 ^reward R957)
  7250. <=WM: (13342: S1 ^operator O1905 +)
  7251. <=WM: (13344: S1 ^operator O1905)
  7252. <=WM: (13343: S1 ^operator O1906 +)
  7253. <=WM: (13337: R1 ^reward R956)
  7254. <=WM: (13340: O1906 ^name predict-no)
  7255. <=WM: (13339: O1905 ^name predict-yes)
  7256. <=WM: (13338: R956 ^value 1)
  7257. --- Inner Elaboration Phase, active level 1 (S1) ---
  7258. Firing prefer*rvt*predict-yes*H0
  7259. -->
  7260. Firing rl*prefer*rvt*predict-yes*H0*5
  7261. -->
  7262. (S1 ^operator O1907 = 0.1121099638010357)
  7263. Firing prefer*rvt*predict-yes*H0*5*H1
  7264. -->
  7265. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  7266. -->
  7267. (S1 ^operator O1907 = 0.1602187148382515)
  7268. Firing prefer*rvt*predict-no*H0
  7269. -->
  7270. Firing rl*prefer*rvt*predict-no*H0*6
  7271. -->
  7272. (S1 ^operator O1908 = 0.3993329903418046)
  7273. Firing prefer*rvt*predict-no*H0*6*H1
  7274. -->
  7275. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  7276. -->
  7277. (S1 ^operator O1908 = 0.6006773674757838)
  7278. inner elaboration loop at bottom goal.
  7279. Retracting rl*prefer*rvt*predict-no*H0*6
  7280. -->
  7281. (S1 ^operator O1906 = 0.3993329903418046)
  7282. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  7283. -->
  7284. (S1 ^operator O1906 = 0.6006773674757838)
  7285. Retracting rl*prefer*rvt*predict-yes*H0*5
  7286. -->
  7287. (S1 ^operator O1905 = 0.1121099638010357)
  7288. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  7289. -->
  7290. (S1 ^operator O1905 = 0.1602187148382515)
  7291. --- END Proposal Phase ---
  7292. --- Decision Phase ---
  7293. RL update rl*prefer*rvt*predict-yes*H0*5 0.619034 -0.506924 0.11211 -> 0.61903 -0.506923 0.112107(R,m,v=1,0.895425,0.0942552)
  7294. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.38099 0.50692 0.88791 -> 0.380987 0.506921 0.887907(R,m,v=1,1,0)
  7295. =>WM: (13357: S1 ^operator O1908)
  7296. 954: O: O1908 (predict-no)
  7297. --- END Decision Phase ---
  7298. --- Application Phase ---
  7299. --- Firing Productions (PE) For State At Depth 1 ---
  7300. --- Inner Elaboration Phase, active level 1 (S1) ---
  7301. Firing apply*operator
  7302. -->
  7303. (I3 ^predict-no N954 + :O )
  7304. Firing apply*operator*complete
  7305. -->
  7306. (I3 ^predict-yes N953 - :O )
  7307. inner elaboration loop at bottom goal.
  7308. --- Change Working Memory (PE) ---
  7309. =>WM: (13358: I3 ^predict-no N954)
  7310. <=WM: (13346: N953 ^status complete)
  7311. <=WM: (13345: I3 ^predict-yes N953)
  7312. --- Firing Productions (IE) For State At Depth 1 ---
  7313. --- Inner Elaboration Phase, active level 1 (S1) ---
  7314. Firing monitor*world
  7315. -->
  7316. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7317. --- Change Working Memory (IE) ---
  7318. --- END Application Phase ---
  7319. --- Output Phase ---
  7320. ENV: Agent did: predict-no for direction R in state State-B
  7321. In State-B moving R
  7322. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7323. predict error 0
  7324. dir: dir isU
  7325. --- END Output Phase ---
  7326. /|\--- Input Phase ---
  7327. =>WM: (13362: I2 ^dir U)
  7328. =>WM: (13361: I2 ^reward 1)
  7329. =>WM: (13360: I2 ^see 0)
  7330. =>WM: (13359: N954 ^status complete)
  7331. <=WM: (13349: I2 ^dir R)
  7332. <=WM: (13348: I2 ^reward 1)
  7333. <=WM: (13347: I2 ^see 1)
  7334. =>WM: (13363: I2 ^level-1 R0-root)
  7335. <=WM: (13350: I2 ^level-1 R1-root)
  7336. --- END Input Phase ---
  7337. --- Proposal Phase ---
  7338. --- Inner Elaboration Phase, active level 1 (S1) ---
  7339. Firing elaborate*copy-see-to-output-link
  7340. -->
  7341. (I3 ^see 0 +)
  7342. Firing elaborate*reward*based*on*reward
  7343. -->
  7344. (R958 ^value 1 +)
  7345. (R1 ^reward R958 +)
  7346. Firing propose*predict-yes
  7347. -->
  7348. (O1909 ^name predict-yes +)
  7349. (S1 ^operator O1909 +)
  7350. Firing propose*predict-no
  7351. -->
  7352. (O1910 ^name predict-no +)
  7353. (S1 ^operator O1910 +)
  7354. Firing rl*prefer*rvt*predict-no*H0*4
  7355. -->
  7356. (S1 ^operator O1908 = 0.9999999999999999)
  7357. Firing rl*prefer*rvt*predict-yes*H0*3
  7358. -->
  7359. (S1 ^operator O1907 = 0.)
  7360. Firing prefer*rvt*predict-yes*H0
  7361. -->
  7362. Firing prefer*rvt*predict-no*H0
  7363. -->
  7364. Firing elaborate*copy-dir-to-output-link
  7365. -->
  7366. (I3 ^dir U +)
  7367. inner elaboration loop at bottom goal.
  7368. Retracting elaborate*copy-see-to-output-link
  7369. -->
  7370. (I3 ^see 1 +)
  7371. Retracting propose*predict-no
  7372. -->
  7373. (O1908 ^name predict-no +)
  7374. (S1 ^operator O1908 +)
  7375. Retracting propose*predict-yes
  7376. -->
  7377. (O1907 ^name predict-yes +)
  7378. (S1 ^operator O1907 +)
  7379. Retracting elaborate*reward*based*on*reward
  7380. -->
  7381. (R957 ^value 1 +)
  7382. (R1 ^reward R957 +)
  7383. Retracting elaborate*copy-dir-to-output-link
  7384. -->
  7385. (I3 ^dir R +)
  7386. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  7387. -->
  7388. (S1 ^operator O1908 = 0.6006773674757838)
  7389. Retracting rl*prefer*rvt*predict-no*H0*6
  7390. -->
  7391. (S1 ^operator O1908 = 0.3993329903418046)
  7392. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  7393. -->
  7394. (S1 ^operator O1907 = 0.1602187148382515)
  7395. Retracting rl*prefer*rvt*predict-yes*H0*5
  7396. -->
  7397. (S1 ^operator O1907 = 0.112106939280937)
  7398. =>WM: (13371: S1 ^operator O1910 +)
  7399. =>WM: (13370: S1 ^operator O1909 +)
  7400. =>WM: (13369: I3 ^dir U)
  7401. =>WM: (13368: O1910 ^name predict-no)
  7402. =>WM: (13367: O1909 ^name predict-yes)
  7403. =>WM: (13366: R958 ^value 1)
  7404. =>WM: (13365: R1 ^reward R958)
  7405. =>WM: (13364: I3 ^see 0)
  7406. <=WM: (13355: S1 ^operator O1907 +)
  7407. <=WM: (13356: S1 ^operator O1908 +)
  7408. <=WM: (13357: S1 ^operator O1908)
  7409. <=WM: (13341: I3 ^dir R)
  7410. <=WM: (13351: R1 ^reward R957)
  7411. <=WM: (13336: I3 ^see 1)
  7412. <=WM: (13354: O1908 ^name predict-no)
  7413. <=WM: (13353: O1907 ^name predict-yes)
  7414. <=WM: (13352: R957 ^value 1)
  7415. --- Inner Elaboration Phase, active level 1 (S1) ---
  7416. Firing prefer*rvt*predict-yes*H0
  7417. -->
  7418. Firing rl*prefer*rvt*predict-yes*H0*3
  7419. -->
  7420. (S1 ^operator O1909 = 0.)
  7421. Firing prefer*rvt*predict-no*H0
  7422. -->
  7423. Firing rl*prefer*rvt*predict-no*H0*4
  7424. -->
  7425. (S1 ^operator O1910 = 0.9999999999999999)
  7426. inner elaboration loop at bottom goal.
  7427. Retracting rl*prefer*rvt*predict-no*H0*4
  7428. -->
  7429. (S1 ^operator O1908 = 0.9999999999999999)
  7430. Retracting rl*prefer*rvt*predict-yes*H0*3
  7431. -->
  7432. (S1 ^operator O1907 = 0.)
  7433. --- END Proposal Phase ---
  7434. --- Decision Phase ---
  7435. RL update rl*prefer*rvt*predict-no*H0*6 0.558041 -0.158708 0.399333 -> 0.55804 -0.158708 0.399331(R,m,v=1,0.926829,0.0682328)
  7436. RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441968 0.158709 0.600677 -> 0.441967 0.158709 0.600676(R,m,v=1,1,0)
  7437. =>WM: (13372: S1 ^operator O1910)
  7438. 955: O: O1910 (predict-no)
  7439. --- END Decision Phase ---
  7440. --- Application Phase ---
  7441. --- Firing Productions (PE) For State At Depth 1 ---
  7442. --- Inner Elaboration Phase, active level 1 (S1) ---
  7443. Firing apply*operator
  7444. -->
  7445. (I3 ^predict-no N955 + :O )
  7446. Firing apply*operator*complete
  7447. -->
  7448. (I3 ^predict-no N954 - :O )
  7449. inner elaboration loop at bottom goal.
  7450. --- Change Working Memory (PE) ---
  7451. =>WM: (13373: I3 ^predict-no N955)
  7452. <=WM: (13359: N954 ^status complete)
  7453. <=WM: (13358: I3 ^predict-no N954)
  7454. --- Firing Productions (IE) For State At Depth 1 ---
  7455. --- Inner Elaboration Phase, active level 1 (S1) ---
  7456. Firing monitor*world
  7457. -->
  7458. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7459. --- Change Working Memory (IE) ---
  7460. --- END Application Phase ---
  7461. --- Output Phase ---
  7462. ENV: Agent did: predict-no for direction U in state State-B
  7463. In State-B moving U
  7464. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7465. predict error 0
  7466. dir: dir isL
  7467. --- END Output Phase ---
  7468. -/|--- Input Phase ---
  7469. =>WM: (13377: I2 ^dir L)
  7470. =>WM: (13376: I2 ^reward 1)
  7471. =>WM: (13375: I2 ^see 0)
  7472. =>WM: (13374: N955 ^status complete)
  7473. <=WM: (13362: I2 ^dir U)
  7474. <=WM: (13361: I2 ^reward 1)
  7475. <=WM: (13360: I2 ^see 0)
  7476. =>WM: (13378: I2 ^level-1 R0-root)
  7477. <=WM: (13363: I2 ^level-1 R0-root)
  7478. --- END Input Phase ---
  7479. --- Proposal Phase ---
  7480. --- Inner Elaboration Phase, active level 1 (S1) ---
  7481. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  7482. -->
  7483. (S1 ^operator O1909 = 0.6597531434936846)
  7484. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  7485. -->
  7486. (S1 ^operator O1910 = 0.133561435542329)
  7487. Firing prefer*rvt*predict-no*H0*2*H1
  7488. -->
  7489. Firing prefer*rvt*predict-yes*H0*1*H1
  7490. -->
  7491. Firing elaborate*copy-see-to-output-link
  7492. -->
  7493. (I3 ^see 0 +)
  7494. Firing elaborate*reward*based*on*reward
  7495. -->
  7496. (R959 ^value 1 +)
  7497. (R1 ^reward R959 +)
  7498. Firing propose*predict-yes
  7499. -->
  7500. (O1911 ^name predict-yes +)
  7501. (S1 ^operator O1911 +)
  7502. Firing propose*predict-no
  7503. -->
  7504. (O1912 ^name predict-no +)
  7505. (S1 ^operator O1912 +)
  7506. Firing rl*prefer*rvt*predict-no*H0*2
  7507. -->
  7508. (S1 ^operator O1910 = 0.3212981720332201)
  7509. Firing rl*prefer*rvt*predict-yes*H0*1
  7510. -->
  7511. (S1 ^operator O1909 = 0.3402463635666008)
  7512. Firing prefer*rvt*predict-yes*H0
  7513. -->
  7514. Firing prefer*rvt*predict-no*H0
  7515. -->
  7516. Firing elaborate*copy-dir-to-output-link
  7517. -->
  7518. (I3 ^dir L +)
  7519. inner elaboration loop at bottom goal.
  7520. Retracting elaborate*copy-see-to-output-link
  7521. -->
  7522. (I3 ^see 0 +)
  7523. Retracting propose*predict-no
  7524. -->
  7525. (O1910 ^name predict-no +)
  7526. (S1 ^operator O1910 +)
  7527. Retracting propose*predict-yes
  7528. -->
  7529. (O1909 ^name predict-yes +)
  7530. (S1 ^operator O1909 +)
  7531. Retracting elaborate*reward*based*on*reward
  7532. -->
  7533. (R958 ^value 1 +)
  7534. (R1 ^reward R958 +)
  7535. Retracting elaborate*copy-dir-to-output-link
  7536. -->
  7537. (I3 ^dir U +)
  7538. Retracting rl*prefer*rvt*predict-no*H0*4
  7539. -->
  7540. (S1 ^operator O1910 = 0.9999999999999999)
  7541. Retracting rl*prefer*rvt*predict-yes*H0*3
  7542. -->
  7543. (S1 ^operator O1909 = 0.)
  7544. =>WM: (13385: S1 ^operator O1912 +)
  7545. =>WM: (13384: S1 ^operator O1911 +)
  7546. =>WM: (13383: I3 ^dir L)
  7547. =>WM: (13382: O1912 ^name predict-no)
  7548. =>WM: (13381: O1911 ^name predict-yes)
  7549. =>WM: (13380: R959 ^value 1)
  7550. =>WM: (13379: R1 ^reward R959)
  7551. <=WM: (13370: S1 ^operator O1909 +)
  7552. <=WM: (13371: S1 ^operator O1910 +)
  7553. <=WM: (13372: S1 ^operator O1910)
  7554. <=WM: (13369: I3 ^dir U)
  7555. <=WM: (13365: R1 ^reward R958)
  7556. <=WM: (13368: O1910 ^name predict-no)
  7557. <=WM: (13367: O1909 ^name predict-yes)
  7558. <=WM: (13366: R958 ^value 1)
  7559. --- Inner Elaboration Phase, active level 1 (S1) ---
  7560. Firing prefer*rvt*predict-yes*H0
  7561. -->
  7562. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  7563. -->
  7564. (S1 ^operator O1911 = 0.6597531434936846)
  7565. Firing rl*prefer*rvt*predict-yes*H0*1
  7566. -->
  7567. (S1 ^operator O1911 = 0.3402463635666008)
  7568. Firing prefer*rvt*predict-yes*H0*1*H1
  7569. -->
  7570. Firing prefer*rvt*predict-no*H0
  7571. -->
  7572. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  7573. -->
  7574. (S1 ^operator O1912 = 0.133561435542329)
  7575. Firing rl*prefer*rvt*predict-no*H0*2
  7576. -->
  7577. (S1 ^operator O1912 = 0.3212981720332201)
  7578. Firing prefer*rvt*predict-no*H0*2*H1
  7579. -->
  7580. inner elaboration loop at bottom goal.
  7581. Retracting rl*prefer*rvt*predict-no*H0*2
  7582. -->
  7583. (S1 ^operator O1910 = 0.3212981720332201)
  7584. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  7585. -->
  7586. (S1 ^operator O1910 = 0.133561435542329)
  7587. Retracting rl*prefer*rvt*predict-yes*H0*1
  7588. -->
  7589. (S1 ^operator O1909 = 0.3402463635666008)
  7590. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  7591. -->
  7592. (S1 ^operator O1909 = 0.6597531434936846)
  7593. --- END Proposal Phase ---
  7594. --- Decision Phase ---
  7595. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7596. =>WM: (13386: S1 ^operator O1911)
  7597. 956: O: O1911 (predict-yes)
  7598. --- END Decision Phase ---
  7599. --- Application Phase ---
  7600. --- Firing Productions (PE) For State At Depth 1 ---
  7601. --- Inner Elaboration Phase, active level 1 (S1) ---
  7602. Firing apply*operator
  7603. -->
  7604. (I3 ^predict-yes N956 + :O )
  7605. Firing apply*operator*complete
  7606. -->
  7607. (I3 ^predict-no N955 - :O )
  7608. inner elaboration loop at bottom goal.
  7609. --- Change Working Memory (PE) ---
  7610. =>WM: (13387: I3 ^predict-yes N956)
  7611. <=WM: (13374: N955 ^status complete)
  7612. <=WM: (13373: I3 ^predict-no N955)
  7613. --- Firing Productions (IE) For State At Depth 1 ---
  7614. --- Inner Elaboration Phase, active level 1 (S1) ---
  7615. Firing monitor*world
  7616. -->
  7617. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7618. --- Change Working Memory (IE) ---
  7619. --- END Application Phase ---
  7620. --- Output Phase ---
  7621. ENV: Agent did: predict-yes for direction L in state State-B
  7622. In State-B moving L
  7623. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7624. predict error 0
  7625. dir: dir isL
  7626. --- END Output Phase ---
  7627. \-/--- Input Phase ---
  7628. =>WM: (13391: I2 ^dir L)
  7629. =>WM: (13390: I2 ^reward 1)
  7630. =>WM: (13389: I2 ^see 1)
  7631. =>WM: (13388: N956 ^status complete)
  7632. <=WM: (13377: I2 ^dir L)
  7633. <=WM: (13376: I2 ^reward 1)
  7634. <=WM: (13375: I2 ^see 0)
  7635. =>WM: (13392: I2 ^level-1 L1-root)
  7636. <=WM: (13378: I2 ^level-1 R0-root)
  7637. --- END Input Phase ---
  7638. --- Proposal Phase ---
  7639. --- Inner Elaboration Phase, active level 1 (S1) ---
  7640. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  7641. -->
  7642. (S1 ^operator O1911 = 0.02884852834965246)
  7643. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  7644. -->
  7645. (S1 ^operator O1912 = 0.6787497288432303)
  7646. Firing prefer*rvt*predict-no*H0*2*H1
  7647. -->
  7648. Firing prefer*rvt*predict-yes*H0*1*H1
  7649. -->
  7650. Firing elaborate*copy-see-to-output-link
  7651. -->
  7652. (I3 ^see 1 +)
  7653. Firing elaborate*reward*based*on*reward
  7654. -->
  7655. (R960 ^value 1 +)
  7656. (R1 ^reward R960 +)
  7657. Firing propose*predict-yes
  7658. -->
  7659. (O1913 ^name predict-yes +)
  7660. (S1 ^operator O1913 +)
  7661. Firing propose*predict-no
  7662. -->
  7663. (O1914 ^name predict-no +)
  7664. (S1 ^operator O1914 +)
  7665. Firing rl*prefer*rvt*predict-no*H0*2
  7666. -->
  7667. (S1 ^operator O1912 = 0.3212981720332201)
  7668. Firing rl*prefer*rvt*predict-yes*H0*1
  7669. -->
  7670. (S1 ^operator O1911 = 0.3402463635666008)
  7671. Firing prefer*rvt*predict-yes*H0
  7672. -->
  7673. Firing prefer*rvt*predict-no*H0
  7674. -->
  7675. Firing elaborate*copy-dir-to-output-link
  7676. -->
  7677. (I3 ^dir L +)
  7678. inner elaboration loop at bottom goal.
  7679. Retracting elaborate*copy-see-to-output-link
  7680. -->
  7681. (I3 ^see 0 +)
  7682. Retracting propose*predict-no
  7683. -->
  7684. (O1912 ^name predict-no +)
  7685. (S1 ^operator O1912 +)
  7686. Retracting propose*predict-yes
  7687. -->
  7688. (O1911 ^name predict-yes +)
  7689. (S1 ^operator O1911 +)
  7690. Retracting elaborate*reward*based*on*reward
  7691. -->
  7692. (R959 ^value 1 +)
  7693. (R1 ^reward R959 +)
  7694. Retracting elaborate*copy-dir-to-output-link
  7695. -->
  7696. (I3 ^dir L +)
  7697. Retracting rl*prefer*rvt*predict-no*H0*2
  7698. -->
  7699. (S1 ^operator O1912 = 0.3212981720332201)
  7700. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  7701. -->
  7702. (S1 ^operator O1912 = 0.133561435542329)
  7703. Retracting rl*prefer*rvt*predict-yes*H0*1
  7704. -->
  7705. (S1 ^operator O1911 = 0.3402463635666008)
  7706. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  7707. -->
  7708. (S1 ^operator O1911 = 0.6597531434936846)
  7709. =>WM: (13399: S1 ^operator O1914 +)
  7710. =>WM: (13398: S1 ^operator O1913 +)
  7711. =>WM: (13397: O1914 ^name predict-no)
  7712. =>WM: (13396: O1913 ^name predict-yes)
  7713. =>WM: (13395: R960 ^value 1)
  7714. =>WM: (13394: R1 ^reward R960)
  7715. =>WM: (13393: I3 ^see 1)
  7716. <=WM: (13384: S1 ^operator O1911 +)
  7717. <=WM: (13386: S1 ^operator O1911)
  7718. <=WM: (13385: S1 ^operator O1912 +)
  7719. <=WM: (13379: R1 ^reward R959)
  7720. <=WM: (13364: I3 ^see 0)
  7721. <=WM: (13382: O1912 ^name predict-no)
  7722. <=WM: (13381: O1911 ^name predict-yes)
  7723. <=WM: (13380: R959 ^value 1)
  7724. --- Inner Elaboration Phase, active level 1 (S1) ---
  7725. Firing prefer*rvt*predict-yes*H0
  7726. -->
  7727. Firing rl*prefer*rvt*predict-yes*H0*1
  7728. -->
  7729. (S1 ^operator O1913 = 0.3402463635666008)
  7730. Firing prefer*rvt*predict-yes*H0*1*H1
  7731. -->
  7732. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  7733. -->
  7734. (S1 ^operator O1913 = 0.02884852834965246)
  7735. Firing prefer*rvt*predict-no*H0
  7736. -->
  7737. Firing rl*prefer*rvt*predict-no*H0*2
  7738. -->
  7739. (S1 ^operator O1914 = 0.3212981720332201)
  7740. Firing prefer*rvt*predict-no*H0*2*H1
  7741. -->
  7742. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  7743. -->
  7744. (S1 ^operator O1914 = 0.6787497288432303)
  7745. inner elaboration loop at bottom goal.
  7746. Retracting rl*prefer*rvt*predict-no*H0*2
  7747. -->
  7748. (S1 ^operator O1912 = 0.3212981720332201)
  7749. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  7750. -->
  7751. (S1 ^operator O1912 = 0.6787497288432303)
  7752. Retracting rl*prefer*rvt*predict-yes*H0*1
  7753. -->
  7754. (S1 ^operator O1911 = 0.3402463635666008)
  7755. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  7756. -->
  7757. (S1 ^operator O1911 = 0.02884852834965246)
  7758. --- END Proposal Phase ---
  7759. --- Decision Phase ---
  7760. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577179 -0.236932 0.340246(R,m,v=1,0.891026,0.0977254)
  7761. RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422821 0.236932 0.659753(R,m,v=1,1,0)
  7762. =>WM: (13400: S1 ^operator O1914)
  7763. 957: O: O1914 (predict-no)
  7764. --- END Decision Phase ---
  7765. --- Application Phase ---
  7766. --- Firing Productions (PE) For State At Depth 1 ---
  7767. --- Inner Elaboration Phase, active level 1 (S1) ---
  7768. Firing apply*operator
  7769. -->
  7770. (I3 ^predict-no N957 + :O )
  7771. Firing apply*operator*complete
  7772. -->
  7773. (I3 ^predict-yes N956 - :O )
  7774. inner elaboration loop at bottom goal.
  7775. --- Change Working Memory (PE) ---
  7776. =>WM: (13401: I3 ^predict-no N957)
  7777. <=WM: (13388: N956 ^status complete)
  7778. <=WM: (13387: I3 ^predict-yes N956)
  7779. --- Firing Productions (IE) For State At Depth 1 ---
  7780. --- Inner Elaboration Phase, active level 1 (S1) ---
  7781. Firing monitor*world
  7782. -->
  7783. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7784. --- Change Working Memory (IE) ---
  7785. --- END Application Phase ---
  7786. --- Output Phase ---
  7787. ENV: Agent did: predict-no for direction L in state State-A
  7788. In State-A moving L
  7789. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7790. predict error 0
  7791. dir: dir isL
  7792. --- END Output Phase ---
  7793. |\---- Input Phase ---
  7794. =>WM: (13405: I2 ^dir L)
  7795. =>WM: (13404: I2 ^reward 1)
  7796. =>WM: (13403: I2 ^see 0)
  7797. =>WM: (13402: N957 ^status complete)
  7798. <=WM: (13391: I2 ^dir L)
  7799. <=WM: (13390: I2 ^reward 1)
  7800. <=WM: (13389: I2 ^see 1)
  7801. =>WM: (13406: I2 ^level-1 L0-root)
  7802. <=WM: (13392: I2 ^level-1 L1-root)
  7803. --- END Input Phase ---
  7804. --- Proposal Phase ---
  7805. --- Inner Elaboration Phase, active level 1 (S1) ---
  7806. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  7807. -->
  7808. (S1 ^operator O1913 = -0.08284880498582387)
  7809. Firing rl*prefer*rvt*predict-no*H0*2*H1*21
  7810. -->
  7811. (S1 ^operator O1914 = 0.6786780143478275)
  7812. Firing prefer*rvt*predict-no*H0*2*H1
  7813. -->
  7814. Firing prefer*rvt*predict-yes*H0*1*H1
  7815. -->
  7816. Firing elaborate*copy-see-to-output-link
  7817. -->
  7818. (I3 ^see 0 +)
  7819. Firing elaborate*reward*based*on*reward
  7820. -->
  7821. (R961 ^value 1 +)
  7822. (R1 ^reward R961 +)
  7823. Firing propose*predict-yes
  7824. -->
  7825. (O1915 ^name predict-yes +)
  7826. (S1 ^operator O1915 +)
  7827. Firing propose*predict-no
  7828. -->
  7829. (O1916 ^name predict-no +)
  7830. (S1 ^operator O1916 +)
  7831. Firing rl*prefer*rvt*predict-no*H0*2
  7832. -->
  7833. (S1 ^operator O1914 = 0.3212981720332201)
  7834. Firing rl*prefer*rvt*predict-yes*H0*1
  7835. -->
  7836. (S1 ^operator O1913 = 0.3402464375075579)
  7837. Firing prefer*rvt*predict-yes*H0
  7838. -->
  7839. Firing prefer*rvt*predict-no*H0
  7840. -->
  7841. Firing elaborate*copy-dir-to-output-link
  7842. -->
  7843. (I3 ^dir L +)
  7844. inner elaboration loop at bottom goal.
  7845. Retracting elaborate*copy-see-to-output-link
  7846. -->
  7847. (I3 ^see 1 +)
  7848. Retracting propose*predict-no
  7849. -->
  7850. (O1914 ^name predict-no +)
  7851. (S1 ^operator O1914 +)
  7852. Retracting propose*predict-yes
  7853. -->
  7854. (O1913 ^name predict-yes +)
  7855. (S1 ^operator O1913 +)
  7856. Retracting elaborate*reward*based*on*reward
  7857. -->
  7858. (R960 ^value 1 +)
  7859. (R1 ^reward R960 +)
  7860. Retracting elaborate*copy-dir-to-output-link
  7861. -->
  7862. (I3 ^dir L +)
  7863. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  7864. -->
  7865. (S1 ^operator O1914 = 0.6787497288432303)
  7866. Retracting rl*prefer*rvt*predict-no*H0*2
  7867. -->
  7868. (S1 ^operator O1914 = 0.3212981720332201)
  7869. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  7870. -->
  7871. (S1 ^operator O1913 = 0.02884852834965246)
  7872. Retracting rl*prefer*rvt*predict-yes*H0*1
  7873. -->
  7874. (S1 ^operator O1913 = 0.3402464375075579)
  7875. =>WM: (13413: S1 ^operator O1916 +)
  7876. =>WM: (13412: S1 ^operator O1915 +)
  7877. =>WM: (13411: O1916 ^name predict-no)
  7878. =>WM: (13410: O1915 ^name predict-yes)
  7879. =>WM: (13409: R961 ^value 1)
  7880. =>WM: (13408: R1 ^reward R961)
  7881. =>WM: (13407: I3 ^see 0)
  7882. <=WM: (13398: S1 ^operator O1913 +)
  7883. <=WM: (13399: S1 ^operator O1914 +)
  7884. <=WM: (13400: S1 ^operator O1914)
  7885. <=WM: (13394: R1 ^reward R960)
  7886. <=WM: (13393: I3 ^see 1)
  7887. <=WM: (13397: O1914 ^name predict-no)
  7888. <=WM: (13396: O1913 ^name predict-yes)
  7889. <=WM: (13395: R960 ^value 1)
  7890. --- Inner Elaboration Phase, active level 1 (S1) ---
  7891. Firing prefer*rvt*predict-yes*H0
  7892. -->
  7893. Firing rl*prefer*rvt*predict-yes*H0*1
  7894. -->
  7895. (S1 ^operator O1915 = 0.3402464375075579)
  7896. Firing prefer*rvt*predict-yes*H0*1*H1
  7897. -->
  7898. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  7899. -->
  7900. (S1 ^operator O1915 = -0.08284880498582387)
  7901. Firing prefer*rvt*predict-no*H0
  7902. -->
  7903. Firing rl*prefer*rvt*predict-no*H0*2
  7904. -->
  7905. (S1 ^operator O1916 = 0.3212981720332201)
  7906. Firing prefer*rvt*predict-no*H0*2*H1
  7907. -->
  7908. Firing rl*prefer*rvt*predict-no*H0*2*H1*21
  7909. -->
  7910. (S1 ^operator O1916 = 0.6786780143478275)
  7911. inner elaboration loop at bottom goal.
  7912. Retracting rl*prefer*rvt*predict-no*H0*2
  7913. -->
  7914. (S1 ^operator O1914 = 0.3212981720332201)
  7915. Retracting rl*prefer*rvt*predict-no*H0*2*H1*21
  7916. -->
  7917. (S1 ^operator O1914 = 0.6786780143478275)
  7918. Retracting rl*prefer*rvt*predict-yes*H0*1
  7919. -->
  7920. (S1 ^operator O1913 = 0.3402464375075579)
  7921. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  7922. -->
  7923. (S1 ^operator O1913 = -0.08284880498582387)
  7924. --- END Proposal Phase ---
  7925. --- Decision Phase ---
  7926. RL update rl*prefer*rvt*predict-no*H0*2 0.641776 -0.320478 0.321298 -> 0.641768 -0.320477 0.321291(R,m,v=1,0.932432,0.0634308)
  7927. RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358272 0.320477 0.67875 -> 0.358265 0.320477 0.678743(R,m,v=1,1,0)
  7928. =>WM: (13414: S1 ^operator O1916)
  7929. 958: O: O1916 (predict-no)
  7930. --- END Decision Phase ---
  7931. --- Application Phase ---
  7932. --- Firing Productions (PE) For State At Depth 1 ---
  7933. --- Inner Elaboration Phase, active level 1 (S1) ---
  7934. Firing apply*operator
  7935. -->
  7936. (I3 ^predict-no N958 + :O )
  7937. Firing apply*operator*complete
  7938. -->
  7939. (I3 ^predict-no N957 - :O )
  7940. inner elaboration loop at bottom goal.
  7941. --- Change Working Memory (PE) ---
  7942. =>WM: (13415: I3 ^predict-no N958)
  7943. <=WM: (13402: N957 ^status complete)
  7944. <=WM: (13401: I3 ^predict-no N957)
  7945. --- Firing Productions (IE) For State At Depth 1 ---
  7946. --- Inner Elaboration Phase, active level 1 (S1) ---
  7947. Firing monitor*world
  7948. -->
  7949. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7950. --- Change Working Memory (IE) ---
  7951. --- END Application Phase ---
  7952. --- Output Phase ---
  7953. ENV: Agent did: predict-no for direction L in state State-A
  7954. In State-A moving L
  7955. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7956. predict error 0
  7957. dir: dir isR
  7958. --- END Output Phase ---
  7959. /|\--- Input Phase ---
  7960. =>WM: (13419: I2 ^dir R)
  7961. =>WM: (13418: I2 ^reward 1)
  7962. =>WM: (13417: I2 ^see 0)
  7963. =>WM: (13416: N958 ^status complete)
  7964. <=WM: (13405: I2 ^dir L)
  7965. <=WM: (13404: I2 ^reward 1)
  7966. <=WM: (13403: I2 ^see 0)
  7967. =>WM: (13420: I2 ^level-1 L0-root)
  7968. <=WM: (13406: I2 ^level-1 L0-root)
  7969. --- END Input Phase ---
  7970. --- Proposal Phase ---
  7971. --- Inner Elaboration Phase, active level 1 (S1) ---
  7972. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  7973. -->
  7974. (S1 ^operator O1915 = 0.8878774738146793)
  7975. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  7976. -->
  7977. (S1 ^operator O1916 = -0.1957074416057287)
  7978. Firing prefer*rvt*predict-no*H0*6*H1
  7979. -->
  7980. Firing prefer*rvt*predict-yes*H0*5*H1
  7981. -->
  7982. Firing elaborate*copy-see-to-output-link
  7983. -->
  7984. (I3 ^see 0 +)
  7985. Firing elaborate*reward*based*on*reward
  7986. -->
  7987. (R962 ^value 1 +)
  7988. (R1 ^reward R962 +)
  7989. Firing propose*predict-yes
  7990. -->
  7991. (O1917 ^name predict-yes +)
  7992. (S1 ^operator O1917 +)
  7993. Firing propose*predict-no
  7994. -->
  7995. (O1918 ^name predict-no +)
  7996. (S1 ^operator O1918 +)
  7997. Firing rl*prefer*rvt*predict-no*H0*6
  7998. -->
  7999. (S1 ^operator O1916 = 0.3993314366691663)
  8000. Firing rl*prefer*rvt*predict-yes*H0*5
  8001. -->
  8002. (S1 ^operator O1915 = 0.112106939280937)
  8003. Firing prefer*rvt*predict-yes*H0
  8004. -->
  8005. Firing prefer*rvt*predict-no*H0
  8006. -->
  8007. Firing elaborate*copy-dir-to-output-link
  8008. -->
  8009. (I3 ^dir R +)
  8010. inner elaboration loop at bottom goal.
  8011. Retracting elaborate*copy-see-to-output-link
  8012. -->
  8013. (I3 ^see 0 +)
  8014. Retracting propose*predict-no
  8015. -->
  8016. (O1916 ^name predict-no +)
  8017. (S1 ^operator O1916 +)
  8018. Retracting propose*predict-yes
  8019. -->
  8020. (O1915 ^name predict-yes +)
  8021. (S1 ^operator O1915 +)
  8022. Retracting elaborate*reward*based*on*reward
  8023. -->
  8024. (R961 ^value 1 +)
  8025. (R1 ^reward R961 +)
  8026. Retracting elaborate*copy-dir-to-output-link
  8027. -->
  8028. (I3 ^dir L +)
  8029. Retracting rl*prefer*rvt*predict-no*H0*2*H1*21
  8030. -->
  8031. (S1 ^operator O1916 = 0.6786780143478275)
  8032. Retracting rl*prefer*rvt*predict-no*H0*2
  8033. -->
  8034. (S1 ^operator O1916 = 0.3212909869017525)
  8035. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  8036. -->
  8037. (S1 ^operator O1915 = -0.08284880498582387)
  8038. Retracting rl*prefer*rvt*predict-yes*H0*1
  8039. -->
  8040. (S1 ^operator O1915 = 0.3402464375075579)
  8041. =>WM: (13427: S1 ^operator O1918 +)
  8042. =>WM: (13426: S1 ^operator O1917 +)
  8043. =>WM: (13425: I3 ^dir R)
  8044. =>WM: (13424: O1918 ^name predict-no)
  8045. =>WM: (13423: O1917 ^name predict-yes)
  8046. =>WM: (13422: R962 ^value 1)
  8047. =>WM: (13421: R1 ^reward R962)
  8048. <=WM: (13412: S1 ^operator O1915 +)
  8049. <=WM: (13413: S1 ^operator O1916 +)
  8050. <=WM: (13414: S1 ^operator O1916)
  8051. <=WM: (13383: I3 ^dir L)
  8052. <=WM: (13408: R1 ^reward R961)
  8053. <=WM: (13411: O1916 ^name predict-no)
  8054. <=WM: (13410: O1915 ^name predict-yes)
  8055. <=WM: (13409: R961 ^value 1)
  8056. --- Inner Elaboration Phase, active level 1 (S1) ---
  8057. Firing prefer*rvt*predict-yes*H0
  8058. -->
  8059. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  8060. -->
  8061. (S1 ^operator O1917 = 0.8878774738146793)
  8062. Firing rl*prefer*rvt*predict-yes*H0*5
  8063. -->
  8064. (S1 ^operator O1917 = 0.112106939280937)
  8065. Firing prefer*rvt*predict-yes*H0*5*H1
  8066. -->
  8067. Firing prefer*rvt*predict-no*H0
  8068. -->
  8069. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  8070. -->
  8071. (S1 ^operator O1918 = -0.1957074416057287)
  8072. Firing rl*prefer*rvt*predict-no*H0*6
  8073. -->
  8074. (S1 ^operator O1918 = 0.3993314366691663)
  8075. Firing prefer*rvt*predict-no*H0*6*H1
  8076. -->
  8077. inner elaboration loop at bottom goal.
  8078. Retracting rl*prefer*rvt*predict-no*H0*6
  8079. -->
  8080. (S1 ^operator O1916 = 0.3993314366691663)
  8081. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  8082. -->
  8083. (S1 ^operator O1916 = -0.1957074416057287)
  8084. Retracting rl*prefer*rvt*predict-yes*H0*5
  8085. -->
  8086. (S1 ^operator O1915 = 0.112106939280937)
  8087. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  8088. -->
  8089. (S1 ^operator O1915 = 0.8878774738146793)
  8090. --- END Proposal Phase ---
  8091. --- Decision Phase ---
  8092. RL update rl*prefer*rvt*predict-no*H0*2 0.641768 -0.320477 0.321291 -> 0.641773 -0.320478 0.321296(R,m,v=1,0.932886,0.0630328)
  8093. RL update rl*prefer*rvt*predict-no*H0*2*H1*21 0.3582 0.320478 0.678678 -> 0.358205 0.320478 0.678683(R,m,v=1,1,0)
  8094. =>WM: (13428: S1 ^operator O1917)
  8095. 959: O: O1917 (predict-yes)
  8096. --- END Decision Phase ---
  8097. --- Application Phase ---
  8098. --- Firing Productions (PE) For State At Depth 1 ---
  8099. --- Inner Elaboration Phase, active level 1 (S1) ---
  8100. Firing apply*operator
  8101. -->
  8102. (I3 ^predict-yes N959 + :O )
  8103. Firing apply*operator*complete
  8104. -->
  8105. (I3 ^predict-no N958 - :O )
  8106. inner elaboration loop at bottom goal.
  8107. --- Change Working Memory (PE) ---
  8108. =>WM: (13429: I3 ^predict-yes N959)
  8109. <=WM: (13416: N958 ^status complete)
  8110. <=WM: (13415: I3 ^predict-no N958)
  8111. --- Firing Productions (IE) For State At Depth 1 ---
  8112. --- Inner Elaboration Phase, active level 1 (S1) ---
  8113. Firing monitor*world
  8114. -->
  8115. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8116. --- Change Working Memory (IE) ---
  8117. --- END Application Phase ---
  8118. --- Output Phase ---
  8119. ENV: Agent did: predict-yes for direction R in state State-A
  8120. In State-A moving R
  8121. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8122. predict error 0
  8123. dir: dir isU
  8124. --- END Output Phase ---
  8125. -/|--- Input Phase ---
  8126. =>WM: (13433: I2 ^dir U)
  8127. =>WM: (13432: I2 ^reward 1)
  8128. =>WM: (13431: I2 ^see 1)
  8129. =>WM: (13430: N959 ^status complete)
  8130. <=WM: (13419: I2 ^dir R)
  8131. <=WM: (13418: I2 ^reward 1)
  8132. <=WM: (13417: I2 ^see 0)
  8133. =>WM: (13434: I2 ^level-1 R1-root)
  8134. <=WM: (13420: I2 ^level-1 L0-root)
  8135. --- END Input Phase ---
  8136. --- Proposal Phase ---
  8137. --- Inner Elaboration Phase, active level 1 (S1) ---
  8138. Firing elaborate*copy-see-to-output-link
  8139. -->
  8140. (I3 ^see 1 +)
  8141. Firing elaborate*reward*based*on*reward
  8142. -->
  8143. (R963 ^value 1 +)
  8144. (R1 ^reward R963 +)
  8145. Firing propose*predict-yes
  8146. -->
  8147. (O1919 ^name predict-yes +)
  8148. (S1 ^operator O1919 +)
  8149. Firing propose*predict-no
  8150. -->
  8151. (O1920 ^name predict-no +)
  8152. (S1 ^operator O1920 +)
  8153. Firing rl*prefer*rvt*predict-no*H0*4
  8154. -->
  8155. (S1 ^operator O1918 = 0.9999999999999999)
  8156. Firing rl*prefer*rvt*predict-yes*H0*3
  8157. -->
  8158. (S1 ^operator O1917 = 0.)
  8159. Firing prefer*rvt*predict-yes*H0
  8160. -->
  8161. Firing prefer*rvt*predict-no*H0
  8162. -->
  8163. Firing elaborate*copy-dir-to-output-link
  8164. -->
  8165. (I3 ^dir U +)
  8166. inner elaboration loop at bottom goal.
  8167. Retracting elaborate*copy-see-to-output-link
  8168. -->
  8169. (I3 ^see 0 +)
  8170. Retracting propose*predict-no
  8171. -->
  8172. (O1918 ^name predict-no +)
  8173. (S1 ^operator O1918 +)
  8174. Retracting propose*predict-yes
  8175. -->
  8176. (O1917 ^name predict-yes +)
  8177. (S1 ^operator O1917 +)
  8178. Retracting elaborate*reward*based*on*reward
  8179. -->
  8180. (R962 ^value 1 +)
  8181. (R1 ^reward R962 +)
  8182. Retracting elaborate*copy-dir-to-output-link
  8183. -->
  8184. (I3 ^dir R +)
  8185. Retracting rl*prefer*rvt*predict-no*H0*6
  8186. -->
  8187. (S1 ^operator O1918 = 0.3993314366691663)
  8188. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  8189. -->
  8190. (S1 ^operator O1918 = -0.1957074416057287)
  8191. Retracting rl*prefer*rvt*predict-yes*H0*5
  8192. -->
  8193. (S1 ^operator O1917 = 0.112106939280937)
  8194. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  8195. -->
  8196. (S1 ^operator O1917 = 0.8878774738146793)
  8197. =>WM: (13442: S1 ^operator O1920 +)
  8198. =>WM: (13441: S1 ^operator O1919 +)
  8199. =>WM: (13440: I3 ^dir U)
  8200. =>WM: (13439: O1920 ^name predict-no)
  8201. =>WM: (13438: O1919 ^name predict-yes)
  8202. =>WM: (13437: R963 ^value 1)
  8203. =>WM: (13436: R1 ^reward R963)
  8204. =>WM: (13435: I3 ^see 1)
  8205. <=WM: (13426: S1 ^operator O1917 +)
  8206. <=WM: (13428: S1 ^operator O1917)
  8207. <=WM: (13427: S1 ^operator O1918 +)
  8208. <=WM: (13425: I3 ^dir R)
  8209. <=WM: (13421: R1 ^reward R962)
  8210. <=WM: (13407: I3 ^see 0)
  8211. <=WM: (13424: O1918 ^name predict-no)
  8212. <=WM: (13423: O1917 ^name predict-yes)
  8213. <=WM: (13422: R962 ^value 1)
  8214. --- Inner Elaboration Phase, active level 1 (S1) ---
  8215. Firing prefer*rvt*predict-yes*H0
  8216. -->
  8217. Firing rl*prefer*rvt*predict-yes*H0*3
  8218. -->
  8219. (S1 ^operator O1919 = 0.)
  8220. Firing prefer*rvt*predict-no*H0
  8221. -->
  8222. Firing rl*prefer*rvt*predict-no*H0*4
  8223. -->
  8224. (S1 ^operator O1920 = 0.9999999999999999)
  8225. inner elaboration loop at bottom goal.
  8226. Retracting rl*prefer*rvt*predict-no*H0*4
  8227. -->
  8228. (S1 ^operator O1918 = 0.9999999999999999)
  8229. Retracting rl*prefer*rvt*predict-yes*H0*3
  8230. -->
  8231. (S1 ^operator O1917 = 0.)
  8232. --- END Proposal Phase ---
  8233. --- Decision Phase ---
  8234. RL update rl*prefer*rvt*predict-yes*H0*5 0.61903 -0.506923 0.112107 -> 0.619033 -0.506924 0.112109(R,m,v=1,0.896104,0.0937102)
  8235. RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380951 0.506926 0.887877 -> 0.380954 0.506926 0.88788(R,m,v=1,1,0)
  8236. =>WM: (13443: S1 ^operator O1920)
  8237. 960: O: O1920 (predict-no)
  8238. --- END Decision Phase ---
  8239. --- Application Phase ---
  8240. --- Firing Productions (PE) For State At Depth 1 ---
  8241. --- Inner Elaboration Phase, active level 1 (S1) ---
  8242. Firing apply*operator
  8243. -->
  8244. (I3 ^predict-no N960 + :O )
  8245. Firing apply*operator*complete
  8246. -->
  8247. (I3 ^predict-yes N959 - :O )
  8248. inner elaboration loop at bottom goal.
  8249. --- Change Working Memory (PE) ---
  8250. =>WM: (13444: I3 ^predict-no N960)
  8251. <=WM: (13430: N959 ^status complete)
  8252. <=WM: (13429: I3 ^predict-yes N959)
  8253. --- Firing Productions (IE) For State At Depth 1 ---
  8254. --- Inner Elaboration Phase, active level 1 (S1) ---
  8255. Firing monitor*world
  8256. -->
  8257. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8258. --- Change Working Memory (IE) ---
  8259. --- END Application Phase ---
  8260. --- Output Phase ---
  8261. ENV: Agent did: predict-no for direction U in state State-B
  8262. In State-B moving U
  8263. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8264. predict error 0
  8265. dir: dir isU
  8266. --- END Output Phase ---
  8267. \-/--- Input Phase ---
  8268. =>WM: (13448: I2 ^dir U)
  8269. =>WM: (13447: I2 ^reward 1)
  8270. =>WM: (13446: I2 ^see 0)
  8271. =>WM: (13445: N960 ^status complete)
  8272. <=WM: (13433: I2 ^dir U)
  8273. <=WM: (13432: I2 ^reward 1)
  8274. <=WM: (13431: I2 ^see 1)
  8275. =>WM: (13449: I2 ^level-1 R1-root)
  8276. <=WM: (13434: I2 ^level-1 R1-root)
  8277. --- END Input Phase ---
  8278. --- Proposal Phase ---
  8279. --- Inner Elaboration Phase, active level 1 (S1) ---
  8280. Firing elaborate*copy-see-to-output-link
  8281. -->
  8282. (I3 ^see 0 +)
  8283. Firing elaborate*reward*based*on*reward
  8284. -->
  8285. (R964 ^value 1 +)
  8286. (R1 ^reward R964 +)
  8287. Firing propose*predict-yes
  8288. -->
  8289. (O1921 ^name predict-yes +)
  8290. (S1 ^operator O1921 +)
  8291. Firing propose*predict-no
  8292. -->
  8293. (O1922 ^name predict-no +)
  8294. (S1 ^operator O1922 +)
  8295. Firing rl*prefer*rvt*predict-no*H0*4
  8296. -->
  8297. (S1 ^operator O1920 = 0.9999999999999999)
  8298. Firing rl*prefer*rvt*predict-yes*H0*3
  8299. -->
  8300. (S1 ^operator O1919 = 0.)
  8301. Firing prefer*rvt*predict-yes*H0
  8302. -->
  8303. Firing prefer*rvt*predict-no*H0
  8304. -->
  8305. Firing elaborate*copy-dir-to-output-link
  8306. -->
  8307. (I3 ^dir U +)
  8308. inner elaboration loop at bottom goal.
  8309. Retracting elaborate*copy-see-to-output-link
  8310. -->
  8311. (I3 ^see 1 +)
  8312. Retracting propose*predict-no
  8313. -->
  8314. (O1920 ^name predict-no +)
  8315. (S1 ^operator O1920 +)
  8316. Retracting propose*predict-yes
  8317. -->
  8318. (O1919 ^name predict-yes +)
  8319. (S1 ^operator O1919 +)
  8320. Retracting elaborate*reward*based*on*reward
  8321. -->
  8322. (R963 ^value 1 +)
  8323. (R1 ^reward R963 +)
  8324. Retracting elaborate*copy-dir-to-output-link
  8325. -->
  8326. (I3 ^dir U +)
  8327. Retracting rl*prefer*rvt*predict-no*H0*4
  8328. -->
  8329. (S1 ^operator O1920 = 0.9999999999999999)
  8330. Retracting rl*prefer*rvt*predict-yes*H0*3
  8331. -->
  8332. (S1 ^operator O1919 = 0.)
  8333. =>WM: (13456: S1 ^operator O1922 +)
  8334. =>WM: (13455: S1 ^operator O1921 +)
  8335. =>WM: (13454: O1922 ^name predict-no)
  8336. =>WM: (13453: O1921 ^name predict-yes)
  8337. =>WM: (13452: R964 ^value 1)
  8338. =>WM: (13451: R1 ^reward R964)
  8339. =>WM: (13450: I3 ^see 0)
  8340. <=WM: (13441: S1 ^operator O1919 +)
  8341. <=WM: (13442: S1 ^operator O1920 +)
  8342. <=WM: (13443: S1 ^operator O1920)
  8343. <=WM: (13436: R1 ^reward R963)
  8344. <=WM: (13435: I3 ^see 1)
  8345. <=WM: (13439: O1920 ^name predict-no)
  8346. <=WM: (13438: O1919 ^name predict-yes)
  8347. <=WM: (13437: R963 ^value 1)
  8348. --- Inner Elaboration Phase, active level 1 (S1) ---
  8349. Firing prefer*rvt*predict-yes*H0
  8350. -->
  8351. Firing rl*prefer*rvt*predict-yes*H0*3
  8352. -->
  8353. (S1 ^operator O1921 = 0.)
  8354. Firing prefer*rvt*predict-no*H0
  8355. -->
  8356. Firing rl*prefer*rvt*predict-no*H0*4
  8357. -->
  8358. (S1 ^operator O1922 = 0.9999999999999999)
  8359. inner elaboration loop at bottom goal.
  8360. Retracting rl*prefer*rvt*predict-no*H0*4
  8361. -->
  8362. (S1 ^operator O1920 = 0.9999999999999999)
  8363. Retracting rl*prefer*rvt*predict-yes*H0*3
  8364. -->
  8365. (S1 ^operator O1919 = 0.)
  8366. --- END Proposal Phase ---
  8367. --- Decision Phase ---
  8368. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8369. =>WM: (13457: S1 ^operator O1922)
  8370. 961: O: O1922 (predict-no)
  8371. --- END Decision Phase ---
  8372. --- Application Phase ---
  8373. --- Firing Productions (PE) For State At Depth 1 ---
  8374. --- Inner Elaboration Phase, active level 1 (S1) ---
  8375. Firing apply*operator
  8376. -->
  8377. (I3 ^predict-no N961 + :O )
  8378. Firing apply*operator*complete
  8379. -->
  8380. (I3 ^predict-no N960 - :O )
  8381. inner elaboration loop at bottom goal.
  8382. --- Change Working Memory (PE) ---
  8383. =>WM: (13458: I3 ^predict-no N961)
  8384. <=WM: (13445: N960 ^status complete)
  8385. <=WM: (13444: I3 ^predict-no N960)
  8386. --- Firing Productions (IE) For State At Depth 1 ---
  8387. --- Inner Elaboration Phase, active level 1 (S1) ---
  8388. Firing monitor*world
  8389. -->
  8390. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8391. --- Change Working Memory (IE) ---
  8392. --- END Application Phase ---
  8393. --- Output Phase ---
  8394. ENV: Agent did: predict-no for direction U in state State-B
  8395. In State-B moving U
  8396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8397. predict error 0
  8398. dir: dir isU
  8399. --- END Output Phase ---
  8400. |--- Input Phase ---
  8401. =>WM: (13462: I2 ^dir U)
  8402. =>WM: (13461: I2 ^reward 1)
  8403. =>WM: (13460: I2 ^see 0)
  8404. =>WM: (13459: N961 ^status complete)
  8405. <=WM: (13448: I2 ^dir U)
  8406. <=WM: (13447: I2 ^reward 1)
  8407. <=WM: (13446: I2 ^see 0)
  8408. =>WM: (13463: I2 ^level-1 R1-root)
  8409. <=WM: (13449: I2 ^level-1 R1-root)
  8410. --- END Input Phase ---
  8411. --- Proposal Phase ---
  8412. --- Inner Elaboration Phase, active level 1 (S1) ---
  8413. Firing elaborate*copy-see-to-output-link
  8414. -->
  8415. (I3 ^see 0 +)
  8416. Firing elaborate*reward*based*on*reward
  8417. -->
  8418. (R965 ^value 1 +)
  8419. (R1 ^reward R965 +)
  8420. Firing propose*predict-yes
  8421. -->
  8422. (O1923 ^name predict-yes +)
  8423. (S1 ^operator O1923 +)
  8424. Firing propose*predict-no
  8425. -->
  8426. (O1924 ^name predict-no +)
  8427. (S1 ^operator O1924 +)
  8428. Firing rl*prefer*rvt*predict-no*H0*4
  8429. -->
  8430. (S1 ^operator O1922 = 0.9999999999999999)
  8431. Firing rl*prefer*rvt*predict-yes*H0*3
  8432. -->
  8433. (S1 ^operator O1921 = 0.)
  8434. Firing prefer*rvt*predict-yes*H0
  8435. -->
  8436. Firing prefer*rvt*predict-no*H0
  8437. -->
  8438. Firing elaborate*copy-dir-to-output-link
  8439. -->
  8440. (I3 ^dir U +)
  8441. inner elaboration loop at bottom goal.
  8442. Retracting elaborate*copy-see-to-output-link
  8443. -->
  8444. (I3 ^see 0 +)
  8445. Retracting propose*predict-no
  8446. -->
  8447. (O1922 ^name predict-no +)
  8448. (S1 ^operator O1922 +)
  8449. Retracting propose*predict-yes
  8450. -->
  8451. (O1921 ^name predict-yes +)
  8452. (S1 ^operator O1921 +)
  8453. Retracting elaborate*reward*based*on*reward
  8454. -->
  8455. (R964 ^value 1 +)
  8456. (R1 ^reward R964 +)
  8457. Retracting elaborate*copy-dir-to-output-link
  8458. -->
  8459. (I3 ^dir U +)
  8460. Retracting rl*prefer*rvt*predict-no*H0*4
  8461. -->
  8462. (S1 ^operator O1922 = 0.9999999999999999)
  8463. Retracting rl*prefer*rvt*predict-yes*H0*3
  8464. -->
  8465. (S1 ^operator O1921 = 0.)
  8466. =>WM: (13469: S1 ^operator O1924 +)
  8467. =>WM: (13468: S1 ^operator O1923 +)
  8468. =>WM: (13467: O1924 ^name predict-no)
  8469. =>WM: (13466: O1923 ^name predict-yes)
  8470. =>WM: (13465: R965 ^value 1)
  8471. =>WM: (13464: R1 ^reward R965)
  8472. <=WM: (13455: S1 ^operator O1921 +)
  8473. <=WM: (13456: S1 ^operator O1922 +)
  8474. <=WM: (13457: S1 ^operator O1922)
  8475. <=WM: (13451: R1 ^reward R964)
  8476. <=WM: (13454: O1922 ^name predict-no)
  8477. <=WM: (13453: O1921 ^name predict-yes)
  8478. <=WM: (13452: R964 ^value 1)
  8479. --- Inner Elaboration Phase, active level 1 (S1) ---
  8480. Firing prefer*rvt*predict-yes*H0
  8481. -->
  8482. Firing rl*prefer*rvt*predict-yes*H0*3
  8483. -->
  8484. (S1 ^operator O1923 = 0.)
  8485. Firing prefer*rvt*predict-no*H0
  8486. -->
  8487. Firing rl*prefer*rvt*predict-no*H0*4
  8488. -->
  8489. (S1 ^operator O1924 = 0.9999999999999999)
  8490. inner elaboration loop at bottom goal.
  8491. Retracting rl*prefer*rvt*predict-no*H0*4
  8492. -->
  8493. (S1 ^operator O1922 = 0.9999999999999999)
  8494. Retracting rl*prefer*rvt*predict-yes*H0*3
  8495. -->
  8496. (S1 ^operator O1921 = 0.)
  8497. --- END Proposal Phase ---
  8498. --- Decision Phase ---
  8499. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8500. =>WM: (13470: S1 ^operator O1924)
  8501. 962: O: O1924 (predict-no)
  8502. --- END Decision Phase ---
  8503. --- Application Phase ---
  8504. --- Firing Productions (PE) For State At Depth 1 ---
  8505. --- Inner Elaboration Phase, active level 1 (S1) ---
  8506. Firing apply*operator
  8507. -->
  8508. (I3 ^predict-no N962 + :O )
  8509. Firing apply*operator*complete
  8510. -->
  8511. (I3 ^predict-no N961 - :O )
  8512. inner elaboration loop at bottom goal.
  8513. --- Change Working Memory (PE) ---
  8514. =>WM: (13471: I3 ^predict-no N962)
  8515. <=WM: (13459: N961 ^status complete)
  8516. <=WM: (13458: I3 ^predict-no N961)
  8517. --- Firing Productions (IE) For State At Depth 1 ---
  8518. --- Inner Elaboration Phase, active level 1 (S1) ---
  8519. Firing monitor*world
  8520. -->
  8521. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8522. --- Change Working Memory (IE) ---
  8523. --- END Application Phase ---
  8524. --- Output Phase ---
  8525. ENV: Agent did: predict-no for direction U in state State-B
  8526. In State-B moving U
  8527. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8528. predict error 0
  8529. dir: dir isU
  8530. --- END Output Phase ---
  8531. \-/--- Input Phase ---
  8532. =>WM: (13475: I2 ^dir U)
  8533. =>WM: (13474: I2 ^reward 1)
  8534. =>WM: (13473: I2 ^see 0)
  8535. =>WM: (13472: N962 ^status complete)
  8536. <=WM: (13462: I2 ^dir U)
  8537. <=WM: (13461: I2 ^reward 1)
  8538. <=WM: (13460: I2 ^see 0)
  8539. =>WM: (13476: I2 ^level-1 R1-root)
  8540. <=WM: (13463: I2 ^level-1 R1-root)
  8541. --- END Input Phase ---
  8542. --- Proposal Phase ---
  8543. --- Inner Elaboration Phase, active level 1 (S1) ---
  8544. Firing elaborate*copy-see-to-output-link
  8545. -->
  8546. (I3 ^see 0 +)
  8547. Firing elaborate*reward*based*on*reward
  8548. -->
  8549. (R966 ^value 1 +)
  8550. (R1 ^reward R966 +)
  8551. Firing propose*predict-yes
  8552. -->
  8553. (O1925 ^name predict-yes +)
  8554. (S1 ^operator O1925 +)
  8555. Firing propose*predict-no
  8556. -->
  8557. (O1926 ^name predict-no +)
  8558. (S1 ^operator O1926 +)
  8559. Firing rl*prefer*rvt*predict-no*H0*4
  8560. -->
  8561. (S1 ^operator O1924 = 0.9999999999999999)
  8562. Firing rl*prefer*rvt*predict-yes*H0*3
  8563. -->
  8564. (S1 ^operator O1923 = 0.)
  8565. Firing prefer*rvt*predict-yes*H0
  8566. -->
  8567. Firing prefer*rvt*predict-no*H0
  8568. -->
  8569. Firing elaborate*copy-dir-to-output-link
  8570. -->
  8571. (I3 ^dir U +)
  8572. inner elaboration loop at bottom goal.
  8573. Retracting elaborate*copy-see-to-output-link
  8574. -->
  8575. (I3 ^see 0 +)
  8576. Retracting propose*predict-no
  8577. -->
  8578. (O1924 ^name predict-no +)
  8579. (S1 ^operator O1924 +)
  8580. Retracting propose*predict-yes
  8581. -->
  8582. (O1923 ^name predict-yes +)
  8583. (S1 ^operator O1923 +)
  8584. Retracting elaborate*reward*based*on*reward
  8585. -->
  8586. (R965 ^value 1 +)
  8587. (R1 ^reward R965 +)
  8588. Retracting elaborate*copy-dir-to-output-link
  8589. -->
  8590. (I3 ^dir U +)
  8591. Retracting rl*prefer*rvt*predict-no*H0*4
  8592. -->
  8593. (S1 ^operator O1924 = 0.9999999999999999)
  8594. Retracting rl*prefer*rvt*predict-yes*H0*3
  8595. -->
  8596. (S1 ^operator O1923 = 0.)
  8597. =>WM: (13482: S1 ^operator O1926 +)
  8598. =>WM: (13481: S1 ^operator O1925 +)
  8599. =>WM: (13480: O1926 ^name predict-no)
  8600. =>WM: (13479: O1925 ^name predict-yes)
  8601. =>WM: (13478: R966 ^value 1)
  8602. =>WM: (13477: R1 ^reward R966)
  8603. <=WM: (13468: S1 ^operator O1923 +)
  8604. <=WM: (13469: S1 ^operator O1924 +)
  8605. <=WM: (13470: S1 ^operator O1924)
  8606. <=WM: (13464: R1 ^reward R965)
  8607. <=WM: (13467: O1924 ^name predict-no)
  8608. <=WM: (13466: O1923 ^name predict-yes)
  8609. <=WM: (13465: R965 ^value 1)
  8610. --- Inner Elaboration Phase, active level 1 (S1) ---
  8611. Firing prefer*rvt*predict-yes*H0
  8612. -->
  8613. Firing rl*prefer*rvt*predict-yes*H0*3
  8614. -->
  8615. (S1 ^operator O1925 = 0.)
  8616. Firing prefer*rvt*predict-no*H0
  8617. -->
  8618. Firing rl*prefer*rvt*predict-no*H0*4
  8619. -->
  8620. (S1 ^operator O1926 = 0.9999999999999999)
  8621. inner elaboration loop at bottom goal.
  8622. Retracting rl*prefer*rvt*predict-no*H0*4
  8623. -->
  8624. (S1 ^operator O1924 = 0.9999999999999999)
  8625. Retracting rl*prefer*rvt*predict-yes*H0*3
  8626. -->
  8627. (S1 ^operator O1923 = 0.)
  8628. --- END Proposal Phase ---
  8629. --- Decision Phase ---
  8630. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8631. =>WM: (13483: S1 ^operator O1926)
  8632. 963: O: O1926 (predict-no)
  8633. --- END Decision Phase ---
  8634. --- Application Phase ---
  8635. --- Firing Productions (PE) For State At Depth 1 ---
  8636. --- Inner Elaboration Phase, active level 1 (S1) ---
  8637. Firing apply*operator
  8638. -->
  8639. (I3 ^predict-no N963 + :O )
  8640. Firing apply*operator*complete
  8641. -->
  8642. (I3 ^predict-no N962 - :O )
  8643. inner elaboration loop at bottom goal.
  8644. --- Change Working Memory (PE) ---
  8645. =>WM: (13484: I3 ^predict-no N963)
  8646. <=WM: (13472: N962 ^status complete)
  8647. <=WM: (13471: I3 ^predict-no N962)
  8648. --- Firing Productions (IE) For State At Depth 1 ---
  8649. --- Inner Elaboration Phase, active level 1 (S1) ---
  8650. Firing monitor*world
  8651. -->
  8652. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8653. --- Change Working Memory (IE) ---
  8654. --- END Application Phase ---
  8655. --- Output Phase ---
  8656. ENV: Agent did: predict-no for direction U in state State-B
  8657. In State-B moving U
  8658. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8659. predict error 0
  8660. dir: dir isL
  8661. --- END Output Phase ---
  8662. |\---- Input Phase ---
  8663. =>WM: (13488: I2 ^dir L)
  8664. =>WM: (13487: I2 ^reward 1)
  8665. =>WM: (13486: I2 ^see 0)
  8666. =>WM: (13485: N963 ^status complete)
  8667. <=WM: (13475: I2 ^dir U)
  8668. <=WM: (13474: I2 ^reward 1)
  8669. <=WM: (13473: I2 ^see 0)
  8670. =>WM: (13489: I2 ^level-1 R1-root)
  8671. <=WM: (13476: I2 ^level-1 R1-root)
  8672. --- END Input Phase ---
  8673. --- Proposal Phase ---
  8674. --- Inner Elaboration Phase, active level 1 (S1) ---
  8675. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  8676. -->
  8677. (S1 ^operator O1926 = 0.03900899329983293)
  8678. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  8679. -->
  8680. (S1 ^operator O1925 = 0.6597567463960877)
  8681. Firing prefer*rvt*predict-no*H0*2*H1
  8682. -->
  8683. Firing prefer*rvt*predict-yes*H0*1*H1
  8684. -->
  8685. Firing elaborate*copy-see-to-output-link
  8686. -->
  8687. (I3 ^see 0 +)
  8688. Firing elaborate*reward*based*on*reward
  8689. -->
  8690. (R967 ^value 1 +)
  8691. (R1 ^reward R967 +)
  8692. Firing propose*predict-yes
  8693. -->
  8694. (O1927 ^name predict-yes +)
  8695. (S1 ^operator O1927 +)
  8696. Firing propose*predict-no
  8697. -->
  8698. (O1928 ^name predict-no +)
  8699. (S1 ^operator O1928 +)
  8700. Firing rl*prefer*rvt*predict-no*H0*2
  8701. -->
  8702. (S1 ^operator O1926 = 0.3212956367143155)
  8703. Firing rl*prefer*rvt*predict-yes*H0*1
  8704. -->
  8705. (S1 ^operator O1925 = 0.3402464375075579)
  8706. Firing prefer*rvt*predict-yes*H0
  8707. -->
  8708. Firing prefer*rvt*predict-no*H0
  8709. -->
  8710. Firing elaborate*copy-dir-to-output-link
  8711. -->
  8712. (I3 ^dir L +)
  8713. inner elaboration loop at bottom goal.
  8714. Retracting elaborate*copy-see-to-output-link
  8715. -->
  8716. (I3 ^see 0 +)
  8717. Retracting propose*predict-no
  8718. -->
  8719. (O1926 ^name predict-no +)
  8720. (S1 ^operator O1926 +)
  8721. Retracting propose*predict-yes
  8722. -->
  8723. (O1925 ^name predict-yes +)
  8724. (S1 ^operator O1925 +)
  8725. Retracting elaborate*reward*based*on*reward
  8726. -->
  8727. (R966 ^value 1 +)
  8728. (R1 ^reward R966 +)
  8729. Retracting elaborate*copy-dir-to-output-link
  8730. -->
  8731. (I3 ^dir U +)
  8732. Retracting rl*prefer*rvt*predict-no*H0*4
  8733. -->
  8734. (S1 ^operator O1926 = 0.9999999999999999)
  8735. Retracting rl*prefer*rvt*predict-yes*H0*3
  8736. -->
  8737. (S1 ^operator O1925 = 0.)
  8738. =>WM: (13496: S1 ^operator O1928 +)
  8739. =>WM: (13495: S1 ^operator O1927 +)
  8740. =>WM: (13494: I3 ^dir L)
  8741. =>WM: (13493: O1928 ^name predict-no)
  8742. =>WM: (13492: O1927 ^name predict-yes)
  8743. =>WM: (13491: R967 ^value 1)
  8744. =>WM: (13490: R1 ^reward R967)
  8745. <=WM: (13481: S1 ^operator O1925 +)
  8746. <=WM: (13482: S1 ^operator O1926 +)
  8747. <=WM: (13483: S1 ^operator O1926)
  8748. <=WM: (13440: I3 ^dir U)
  8749. <=WM: (13477: R1 ^reward R966)
  8750. <=WM: (13480: O1926 ^name predict-no)
  8751. <=WM: (13479: O1925 ^name predict-yes)
  8752. <=WM: (13478: R966 ^value 1)
  8753. --- Inner Elaboration Phase, active level 1 (S1) ---
  8754. Firing prefer*rvt*predict-yes*H0
  8755. -->
  8756. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  8757. -->
  8758. (S1 ^operator O1927 = 0.6597567463960877)
  8759. Firing rl*prefer*rvt*predict-yes*H0*1
  8760. -->
  8761. (S1 ^operator O1927 = 0.3402464375075579)
  8762. Firing prefer*rvt*predict-yes*H0*1*H1
  8763. -->
  8764. Firing prefer*rvt*predict-no*H0
  8765. -->
  8766. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  8767. -->
  8768. (S1 ^operator O1928 = 0.03900899329983293)
  8769. Firing rl*prefer*rvt*predict-no*H0*2
  8770. -->
  8771. (S1 ^operator O1928 = 0.3212956367143155)
  8772. Firing prefer*rvt*predict-no*H0*2*H1
  8773. -->
  8774. inner elaboration loop at bottom goal.
  8775. Retracting rl*prefer*rvt*predict-no*H0*2
  8776. -->
  8777. (S1 ^operator O1926 = 0.3212956367143155)
  8778. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  8779. -->
  8780. (S1 ^operator O1926 = 0.03900899329983293)
  8781. Retracting rl*prefer*rvt*predict-yes*H0*1
  8782. -->
  8783. (S1 ^operator O1925 = 0.3402464375075579)
  8784. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  8785. -->
  8786. (S1 ^operator O1925 = 0.6597567463960877)
  8787. --- END Proposal Phase ---
  8788. --- Decision Phase ---
  8789. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8790. =>WM: (13497: S1 ^operator O1927)
  8791. 964: O: O1927 (predict-yes)
  8792. --- END Decision Phase ---
  8793. --- Application Phase ---
  8794. --- Firing Productions (PE) For State At Depth 1 ---
  8795. --- Inner Elaboration Phase, active level 1 (S1) ---
  8796. Firing apply*operator
  8797. -->
  8798. (I3 ^predict-yes N964 + :O )
  8799. Firing apply*operator*complete
  8800. -->
  8801. (I3 ^predict-no N963 - :O )
  8802. inner elaboration loop at bottom goal.
  8803. --- Change Working Memory (PE) ---
  8804. =>WM: (13498: I3 ^predict-yes N964)
  8805. <=WM: (13485: N963 ^status complete)
  8806. <=WM: (13484: I3 ^predict-no N963)
  8807. --- Firing Productions (IE) For State At Depth 1 ---
  8808. --- Inner Elaboration Phase, active level 1 (S1) ---
  8809. Firing monitor*world
  8810. -->
  8811. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8812. --- Change Working Memory (IE) ---
  8813. --- END Application Phase ---
  8814. --- Output Phase ---
  8815. ENV: Agent did: predict-yes for direction L in state State-B
  8816. In State-B moving L
  8817. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8818. predict error 0
  8819. dir: dir isR
  8820. --- END Output Phase ---
  8821. /|\--- Input Phase ---
  8822. =>WM: (13502: I2 ^dir R)
  8823. =>WM: (13501: I2 ^reward 1)
  8824. =>WM: (13500: I2 ^see 1)
  8825. =>WM: (13499: N964 ^status complete)
  8826. <=WM: (13488: I2 ^dir L)
  8827. <=WM: (13487: I2 ^reward 1)
  8828. <=WM: (13486: I2 ^see 0)
  8829. =>WM: (13503: I2 ^level-1 L1-root)
  8830. <=WM: (13489: I2 ^level-1 R1-root)
  8831. --- END Input Phase ---
  8832. --- Proposal Phase ---
  8833. --- Inner Elaboration Phase, active level 1 (S1) ---
  8834. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  8835. -->
  8836. (S1 ^operator O1927 = 0.8879071751461909)
  8837. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  8838. -->
  8839. (S1 ^operator O1928 = 0.02370016355578053)
  8840. Firing prefer*rvt*predict-no*H0*6*H1
  8841. -->
  8842. Firing prefer*rvt*predict-yes*H0*5*H1
  8843. -->
  8844. Firing elaborate*copy-see-to-output-link
  8845. -->
  8846. (I3 ^see 1 +)
  8847. Firing elaborate*reward*based*on*reward
  8848. -->
  8849. (R968 ^value 1 +)
  8850. (R1 ^reward R968 +)
  8851. Firing propose*predict-yes
  8852. -->
  8853. (O1929 ^name predict-yes +)
  8854. (S1 ^operator O1929 +)
  8855. Firing propose*predict-no
  8856. -->
  8857. (O1930 ^name predict-no +)
  8858. (S1 ^operator O1930 +)
  8859. Firing rl*prefer*rvt*predict-no*H0*6
  8860. -->
  8861. (S1 ^operator O1928 = 0.3993314366691663)
  8862. Firing rl*prefer*rvt*predict-yes*H0*5
  8863. -->
  8864. (S1 ^operator O1927 = 0.1121092773165946)
  8865. Firing prefer*rvt*predict-yes*H0
  8866. -->
  8867. Firing prefer*rvt*predict-no*H0
  8868. -->
  8869. Firing elaborate*copy-dir-to-output-link
  8870. -->
  8871. (I3 ^dir R +)
  8872. inner elaboration loop at bottom goal.
  8873. Retracting elaborate*copy-see-to-output-link
  8874. -->
  8875. (I3 ^see 0 +)
  8876. Retracting propose*predict-no
  8877. -->
  8878. (O1928 ^name predict-no +)
  8879. (S1 ^operator O1928 +)
  8880. Retracting propose*predict-yes
  8881. -->
  8882. (O1927 ^name predict-yes +)
  8883. (S1 ^operator O1927 +)
  8884. Retracting elaborate*reward*based*on*reward
  8885. -->
  8886. (R967 ^value 1 +)
  8887. (R1 ^reward R967 +)
  8888. Retracting elaborate*copy-dir-to-output-link
  8889. -->
  8890. (I3 ^dir L +)
  8891. Retracting rl*prefer*rvt*predict-no*H0*2
  8892. -->
  8893. (S1 ^operator O1928 = 0.3212956367143155)
  8894. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  8895. -->
  8896. (S1 ^operator O1928 = 0.03900899329983293)
  8897. Retracting rl*prefer*rvt*predict-yes*H0*1
  8898. -->
  8899. (S1 ^operator O1927 = 0.3402464375075579)
  8900. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  8901. -->
  8902. (S1 ^operator O1927 = 0.6597567463960877)
  8903. =>WM: (13511: S1 ^operator O1930 +)
  8904. =>WM: (13510: S1 ^operator O1929 +)
  8905. =>WM: (13509: I3 ^dir R)
  8906. =>WM: (13508: O1930 ^name predict-no)
  8907. =>WM: (13507: O1929 ^name predict-yes)
  8908. =>WM: (13506: R968 ^value 1)
  8909. =>WM: (13505: R1 ^reward R968)
  8910. =>WM: (13504: I3 ^see 1)
  8911. <=WM: (13495: S1 ^operator O1927 +)
  8912. <=WM: (13497: S1 ^operator O1927)
  8913. <=WM: (13496: S1 ^operator O1928 +)
  8914. <=WM: (13494: I3 ^dir L)
  8915. <=WM: (13490: R1 ^reward R967)
  8916. <=WM: (13450: I3 ^see 0)
  8917. <=WM: (13493: O1928 ^name predict-no)
  8918. <=WM: (13492: O1927 ^name predict-yes)
  8919. <=WM: (13491: R967 ^value 1)
  8920. --- Inner Elaboration Phase, active level 1 (S1) ---
  8921. Firing prefer*rvt*predict-yes*H0
  8922. -->
  8923. Firing rl*prefer*rvt*predict-yes*H0*5
  8924. -->
  8925. (S1 ^operator O1929 = 0.1121092773165946)
  8926. Firing prefer*rvt*predict-yes*H0*5*H1
  8927. -->
  8928. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  8929. -->
  8930. (S1 ^operator O1929 = 0.8879071751461909)
  8931. Firing prefer*rvt*predict-no*H0
  8932. -->
  8933. Firing rl*prefer*rvt*predict-no*H0*6
  8934. -->
  8935. (S1 ^operator O1930 = 0.3993314366691663)
  8936. Firing prefer*rvt*predict-no*H0*6*H1
  8937. -->
  8938. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  8939. -->
  8940. (S1 ^operator O1930 = 0.02370016355578053)
  8941. inner elaboration loop at bottom goal.
  8942. Retracting rl*prefer*rvt*predict-no*H0*6
  8943. -->
  8944. (S1 ^operator O1928 = 0.3993314366691663)
  8945. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  8946. -->
  8947. (S1 ^operator O1928 = 0.02370016355578053)
  8948. Retracting rl*prefer*rvt*predict-yes*H0*5
  8949. -->
  8950. (S1 ^operator O1927 = 0.1121092773165946)
  8951. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  8952. -->
  8953. (S1 ^operator O1927 = 0.8879071751461909)
  8954. --- END Proposal Phase ---
  8955. --- Decision Phase ---
  8956. RL update rl*prefer*rvt*predict-yes*H0*1 0.577179 -0.236932 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.89172,0.0971746)
  8957. RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422823 0.236934 0.659757 -> 0.422823 0.236934 0.659756(R,m,v=1,1,0)
  8958. =>WM: (13512: S1 ^operator O1929)
  8959. 965: O: O1929 (predict-yes)
  8960. --- END Decision Phase ---
  8961. --- Application Phase ---
  8962. --- Firing Productions (PE) For State At Depth 1 ---
  8963. --- Inner Elaboration Phase, active level 1 (S1) ---
  8964. Firing apply*operator
  8965. -->
  8966. (I3 ^predict-yes N965 + :O )
  8967. Firing apply*operator*complete
  8968. -->
  8969. (I3 ^predict-yes N964 - :O )
  8970. inner elaboration loop at bottom goal.
  8971. --- Change Working Memory (PE) ---
  8972. =>WM: (13513: I3 ^predict-yes N965)
  8973. <=WM: (13499: N964 ^status complete)
  8974. <=WM: (13498: I3 ^predict-yes N964)
  8975. --- Firing Productions (IE) For State At Depth 1 ---
  8976. --- Inner Elaboration Phase, active level 1 (S1) ---
  8977. Firing monitor*world
  8978. -->
  8979. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8980. --- Change Working Memory (IE) ---
  8981. --- END Application Phase ---
  8982. --- Output Phase ---
  8983. ENV: Agent did: predict-yes for direction R in state State-A
  8984. In State-A moving R
  8985. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8986. predict error 0
  8987. dir: dir isU
  8988. --- END Output Phase ---
  8989. -/|\sleeping...
  8990. ---- Input Phase ---
  8991. =>WM: (13517: I2 ^dir U)
  8992. =>WM: (13516: I2 ^reward 1)
  8993. =>WM: (13515: I2 ^see 1)
  8994. =>WM: (13514: N965 ^status complete)
  8995. <=WM: (13502: I2 ^dir R)
  8996. <=WM: (13501: I2 ^reward 1)
  8997. <=WM: (13500: I2 ^see 1)
  8998. =>WM: (13518: I2 ^level-1 R1-root)
  8999. <=WM: (13503: I2 ^level-1 L1-root)
  9000. --- END Input Phase ---
  9001. --- Proposal Phase ---
  9002. --- Inner Elaboration Phase, active level 1 (S1) ---
  9003. Firing elaborate*copy-see-to-output-link
  9004. -->
  9005. (I3 ^see 1 +)
  9006. Firing elaborate*reward*based*on*reward
  9007. -->
  9008. (R969 ^value 1 +)
  9009. (R1 ^reward R969 +)
  9010. Firing propose*predict-yes
  9011. -->
  9012. (O1931 ^name predict-yes +)
  9013. (S1 ^operator O1931 +)
  9014. Firing propose*predict-no
  9015. -->
  9016. (O1932 ^name predict-no +)
  9017. (S1 ^operator O1932 +)
  9018. Firing rl*prefer*rvt*predict-no*H0*4
  9019. -->
  9020. (S1 ^operator O1930 = 0.9999999999999999)
  9021. Firing rl*prefer*rvt*predict-yes*H0*3
  9022. -->
  9023. (S1 ^operator O1929 = 0.)
  9024. Firing prefer*rvt*predict-yes*H0
  9025. -->
  9026. Firing prefer*rvt*predict-no*H0
  9027. -->
  9028. Firing elaborate*copy-dir-to-output-link
  9029. -->
  9030. (I3 ^dir U +)
  9031. inner elaboration loop at bottom goal.
  9032. Retracting elaborate*copy-see-to-output-link
  9033. -->
  9034. (I3 ^see 1 +)
  9035. Retracting propose*predict-no
  9036. -->
  9037. (O1930 ^name predict-no +)
  9038. (S1 ^operator O1930 +)
  9039. Retracting propose*predict-yes
  9040. -->
  9041. (O1929 ^name predict-yes +)
  9042. (S1 ^operator O1929 +)
  9043. Retracting elaborate*reward*based*on*reward
  9044. -->
  9045. (R968 ^value 1 +)
  9046. (R1 ^reward R968 +)
  9047. Retracting elaborate*copy-dir-to-output-link
  9048. -->
  9049. (I3 ^dir R +)
  9050. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  9051. -->
  9052. (S1 ^operator O1930 = 0.02370016355578053)
  9053. Retracting rl*prefer*rvt*predict-no*H0*6
  9054. -->
  9055. (S1 ^operator O1930 = 0.3993314366691663)
  9056. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  9057. -->
  9058. (S1 ^operator O1929 = 0.8879071751461909)
  9059. Retracting rl*prefer*rvt*predict-yes*H0*5
  9060. -->
  9061. (S1 ^operator O1929 = 0.1121092773165946)
  9062. =>WM: (13525: S1 ^operator O1932 +)
  9063. =>WM: (13524: S1 ^operator O1931 +)
  9064. =>WM: (13523: I3 ^dir U)
  9065. =>WM: (13522: O1932 ^name predict-no)
  9066. =>WM: (13521: O1931 ^name predict-yes)
  9067. =>WM: (13520: R969 ^value 1)
  9068. =>WM: (13519: R1 ^reward R969)
  9069. <=WM: (13510: S1 ^operator O1929 +)
  9070. <=WM: (13512: S1 ^operator O1929)
  9071. <=WM: (13511: S1 ^operator O1930 +)
  9072. <=WM: (13509: I3 ^dir R)
  9073. <=WM: (13505: R1 ^reward R968)
  9074. <=WM: (13508: O1930 ^name predict-no)
  9075. <=WM: (13507: O1929 ^name predict-yes)
  9076. <=WM: (13506: R968 ^value 1)
  9077. --- Inner Elaboration Phase, active level 1 (S1) ---
  9078. Firing prefer*rvt*predict-yes*H0
  9079. -->
  9080. Firing rl*prefer*rvt*predict-yes*H0*3
  9081. -->
  9082. (S1 ^operator O1931 = 0.)
  9083. Firing prefer*rvt*predict-no*H0
  9084. -->
  9085. Firing rl*prefer*rvt*predict-no*H0*4
  9086. -->
  9087. (S1 ^operator O1932 = 0.9999999999999999)
  9088. inner elaboration loop at bottom goal.
  9089. Retracting rl*prefer*rvt*predict-no*H0*4
  9090. -->
  9091. (S1 ^operator O1930 = 0.9999999999999999)
  9092. Retracting rl*prefer*rvt*predict-yes*H0*3
  9093. -->
  9094. (S1 ^operator O1929 = 0.)
  9095. --- END Proposal Phase ---
  9096. --- Decision Phase ---
  9097. RL update rl*prefer*rvt*predict-yes*H0*5 0.619033 -0.506924 0.112109 -> 0.61903 -0.506923 0.112107(R,m,v=1,0.896774,0.0931713)
  9098. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380987 0.506921 0.887907 -> 0.380984 0.506921 0.887905(R,m,v=1,1,0)
  9099. =>WM: (13526: S1 ^operator O1932)
  9100. 966: O: O1932 (predict-no)
  9101. --- END Decision Phase ---
  9102. --- Application Phase ---
  9103. --- Firing Productions (PE) For State At Depth 1 ---
  9104. --- Inner Elaboration Phase, active level 1 (S1) ---
  9105. Firing apply*operator
  9106. -->
  9107. (I3 ^predict-no N966 + :O )
  9108. Firing apply*operator*complete
  9109. -->
  9110. (I3 ^predict-yes N965 - :O )
  9111. inner elaboration loop at bottom goal.
  9112. --- Change Working Memory (PE) ---
  9113. =>WM: (13527: I3 ^predict-no N966)
  9114. <=WM: (13514: N965 ^status complete)
  9115. <=WM: (13513: I3 ^predict-yes N965)
  9116. --- Firing Productions (IE) For State At Depth 1 ---
  9117. --- Inner Elaboration Phase, active level 1 (S1) ---
  9118. Firing monitor*world
  9119. -->
  9120. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9121. --- Change Working Memory (IE) ---
  9122. --- END Application Phase ---
  9123. --- Output Phase ---
  9124. ENV: Agent did: predict-no for direction U in state State-B
  9125. In State-B moving U
  9126. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9127. predict error 0
  9128. dir: dir isL
  9129. --- END Output Phase ---
  9130. /|\--- Input Phase ---
  9131. =>WM: (13531: I2 ^dir L)
  9132. =>WM: (13530: I2 ^reward 1)
  9133. =>WM: (13529: I2 ^see 0)
  9134. =>WM: (13528: N966 ^status complete)
  9135. <=WM: (13517: I2 ^dir U)
  9136. <=WM: (13516: I2 ^reward 1)
  9137. <=WM: (13515: I2 ^see 1)
  9138. =>WM: (13532: I2 ^level-1 R1-root)
  9139. <=WM: (13518: I2 ^level-1 R1-root)
  9140. --- END Input Phase ---
  9141. --- Proposal Phase ---
  9142. --- Inner Elaboration Phase, active level 1 (S1) ---
  9143. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  9144. -->
  9145. (S1 ^operator O1932 = 0.03900899329983293)
  9146. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  9147. -->
  9148. (S1 ^operator O1931 = 0.6597562688105409)
  9149. Firing prefer*rvt*predict-no*H0*2*H1
  9150. -->
  9151. Firing prefer*rvt*predict-yes*H0*1*H1
  9152. -->
  9153. Firing elaborate*copy-see-to-output-link
  9154. -->
  9155. (I3 ^see 0 +)
  9156. Firing elaborate*reward*based*on*reward
  9157. -->
  9158. (R970 ^value 1 +)
  9159. (R1 ^reward R970 +)
  9160. Firing propose*predict-yes
  9161. -->
  9162. (O1933 ^name predict-yes +)
  9163. (S1 ^operator O1933 +)
  9164. Firing propose*predict-no
  9165. -->
  9166. (O1934 ^name predict-no +)
  9167. (S1 ^operator O1934 +)
  9168. Firing rl*prefer*rvt*predict-no*H0*2
  9169. -->
  9170. (S1 ^operator O1932 = 0.3212956367143155)
  9171. Firing rl*prefer*rvt*predict-yes*H0*1
  9172. -->
  9173. (S1 ^operator O1931 = 0.3402459599220111)
  9174. Firing prefer*rvt*predict-yes*H0
  9175. -->
  9176. Firing prefer*rvt*predict-no*H0
  9177. -->
  9178. Firing elaborate*copy-dir-to-output-link
  9179. -->
  9180. (I3 ^dir L +)
  9181. inner elaboration loop at bottom goal.
  9182. Retracting elaborate*copy-see-to-output-link
  9183. -->
  9184. (I3 ^see 1 +)
  9185. Retracting propose*predict-no
  9186. -->
  9187. (O1932 ^name predict-no +)
  9188. (S1 ^operator O1932 +)
  9189. Retracting propose*predict-yes
  9190. -->
  9191. (O1931 ^name predict-yes +)
  9192. (S1 ^operator O1931 +)
  9193. Retracting elaborate*reward*based*on*reward
  9194. -->
  9195. (R969 ^value 1 +)
  9196. (R1 ^reward R969 +)
  9197. Retracting elaborate*copy-dir-to-output-link
  9198. -->
  9199. (I3 ^dir U +)
  9200. Retracting rl*prefer*rvt*predict-no*H0*4
  9201. -->
  9202. (S1 ^operator O1932 = 0.9999999999999999)
  9203. Retracting rl*prefer*rvt*predict-yes*H0*3
  9204. -->
  9205. (S1 ^operator O1931 = 0.)
  9206. =>WM: (13540: S1 ^operator O1934 +)
  9207. =>WM: (13539: S1 ^operator O1933 +)
  9208. =>WM: (13538: I3 ^dir L)
  9209. =>WM: (13537: O1934 ^name predict-no)
  9210. =>WM: (13536: O1933 ^name predict-yes)
  9211. =>WM: (13535: R970 ^value 1)
  9212. =>WM: (13534: R1 ^reward R970)
  9213. =>WM: (13533: I3 ^see 0)
  9214. <=WM: (13524: S1 ^operator O1931 +)
  9215. <=WM: (13525: S1 ^operator O1932 +)
  9216. <=WM: (13526: S1 ^operator O1932)
  9217. <=WM: (13523: I3 ^dir U)
  9218. <=WM: (13519: R1 ^reward R969)
  9219. <=WM: (13504: I3 ^see 1)
  9220. <=WM: (13522: O1932 ^name predict-no)
  9221. <=WM: (13521: O1931 ^name predict-yes)
  9222. <=WM: (13520: R969 ^value 1)
  9223. --- Inner Elaboration Phase, active level 1 (S1) ---
  9224. Firing prefer*rvt*predict-yes*H0
  9225. -->
  9226. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  9227. -->
  9228. (S1 ^operator O1933 = 0.6597562688105409)
  9229. Firing rl*prefer*rvt*predict-yes*H0*1
  9230. -->
  9231. (S1 ^operator O1933 = 0.3402459599220111)
  9232. Firing prefer*rvt*predict-yes*H0*1*H1
  9233. -->
  9234. Firing prefer*rvt*predict-no*H0
  9235. -->
  9236. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  9237. -->
  9238. (S1 ^operator O1934 = 0.03900899329983293)
  9239. Firing rl*prefer*rvt*predict-no*H0*2
  9240. -->
  9241. (S1 ^operator O1934 = 0.3212956367143155)
  9242. Firing prefer*rvt*predict-no*H0*2*H1
  9243. -->
  9244. inner elaboration loop at bottom goal.
  9245. Retracting rl*prefer*rvt*predict-no*H0*2
  9246. -->
  9247. (S1 ^operator O1932 = 0.3212956367143155)
  9248. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  9249. -->
  9250. (S1 ^operator O1932 = 0.03900899329983293)
  9251. Retracting rl*prefer*rvt*predict-yes*H0*1
  9252. -->
  9253. (S1 ^operator O1931 = 0.3402459599220111)
  9254. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  9255. -->
  9256. (S1 ^operator O1931 = 0.6597562688105409)
  9257. --- END Proposal Phase ---
  9258. --- Decision Phase ---
  9259. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9260. =>WM: (13541: S1 ^operator O1933)
  9261. 967: O: O1933 (predict-yes)
  9262. --- END Decision Phase ---
  9263. --- Application Phase ---
  9264. --- Firing Productions (PE) For State At Depth 1 ---
  9265. --- Inner Elaboration Phase, active level 1 (S1) ---
  9266. Firing apply*operator
  9267. -->
  9268. (I3 ^predict-yes N967 + :O )
  9269. Firing apply*operator*complete
  9270. -->
  9271. (I3 ^predict-no N966 - :O )
  9272. inner elaboration loop at bottom goal.
  9273. --- Change Working Memory (PE) ---
  9274. =>WM: (13542: I3 ^predict-yes N967)
  9275. <=WM: (13528: N966 ^status complete)
  9276. <=WM: (13527: I3 ^predict-no N966)
  9277. --- Firing Productions (IE) For State At Depth 1 ---
  9278. --- Inner Elaboration Phase, active level 1 (S1) ---
  9279. Firing monitor*world
  9280. -->
  9281. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9282. --- Change Working Memory (IE) ---
  9283. --- END Application Phase ---
  9284. --- Output Phase ---
  9285. ENV: Agent did: predict-yes for direction L in state State-B
  9286. In State-B moving L
  9287. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9288. predict error 0
  9289. dir: dir isR
  9290. --- END Output Phase ---
  9291. -/|--- Input Phase ---
  9292. =>WM: (13546: I2 ^dir R)
  9293. =>WM: (13545: I2 ^reward 1)
  9294. =>WM: (13544: I2 ^see 1)
  9295. =>WM: (13543: N967 ^status complete)
  9296. <=WM: (13531: I2 ^dir L)
  9297. <=WM: (13530: I2 ^reward 1)
  9298. <=WM: (13529: I2 ^see 0)
  9299. =>WM: (13547: I2 ^level-1 L1-root)
  9300. <=WM: (13532: I2 ^level-1 R1-root)
  9301. --- END Input Phase ---
  9302. --- Proposal Phase ---
  9303. --- Inner Elaboration Phase, active level 1 (S1) ---
  9304. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  9305. -->
  9306. (S1 ^operator O1933 = 0.887904707276773)
  9307. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  9308. -->
  9309. (S1 ^operator O1934 = 0.02370016355578053)
  9310. Firing prefer*rvt*predict-no*H0*6*H1
  9311. -->
  9312. Firing prefer*rvt*predict-yes*H0*5*H1
  9313. -->
  9314. Firing elaborate*copy-see-to-output-link
  9315. -->
  9316. (I3 ^see 1 +)
  9317. Firing elaborate*reward*based*on*reward
  9318. -->
  9319. (R971 ^value 1 +)
  9320. (R1 ^reward R971 +)
  9321. Firing propose*predict-yes
  9322. -->
  9323. (O1935 ^name predict-yes +)
  9324. (S1 ^operator O1935 +)
  9325. Firing propose*predict-no
  9326. -->
  9327. (O1936 ^name predict-no +)
  9328. (S1 ^operator O1936 +)
  9329. Firing rl*prefer*rvt*predict-no*H0*6
  9330. -->
  9331. (S1 ^operator O1934 = 0.3993314366691663)
  9332. Firing rl*prefer*rvt*predict-yes*H0*5
  9333. -->
  9334. (S1 ^operator O1933 = 0.1121068094471768)
  9335. Firing prefer*rvt*predict-yes*H0
  9336. -->
  9337. Firing prefer*rvt*predict-no*H0
  9338. -->
  9339. Firing elaborate*copy-dir-to-output-link
  9340. -->
  9341. (I3 ^dir R +)
  9342. inner elaboration loop at bottom goal.
  9343. Retracting elaborate*copy-see-to-output-link
  9344. -->
  9345. (I3 ^see 0 +)
  9346. Retracting propose*predict-no
  9347. -->
  9348. (O1934 ^name predict-no +)
  9349. (S1 ^operator O1934 +)
  9350. Retracting propose*predict-yes
  9351. -->
  9352. (O1933 ^name predict-yes +)
  9353. (S1 ^operator O1933 +)
  9354. Retracting elaborate*reward*based*on*reward
  9355. -->
  9356. (R970 ^value 1 +)
  9357. (R1 ^reward R970 +)
  9358. Retracting elaborate*copy-dir-to-output-link
  9359. -->
  9360. (I3 ^dir L +)
  9361. Retracting rl*prefer*rvt*predict-no*H0*2
  9362. -->
  9363. (S1 ^operator O1934 = 0.3212956367143155)
  9364. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  9365. -->
  9366. (S1 ^operator O1934 = 0.03900899329983293)
  9367. Retracting rl*prefer*rvt*predict-yes*H0*1
  9368. -->
  9369. (S1 ^operator O1933 = 0.3402459599220111)
  9370. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  9371. -->
  9372. (S1 ^operator O1933 = 0.6597562688105409)
  9373. =>WM: (13555: S1 ^operator O1936 +)
  9374. =>WM: (13554: S1 ^operator O1935 +)
  9375. =>WM: (13553: I3 ^dir R)
  9376. =>WM: (13552: O1936 ^name predict-no)
  9377. =>WM: (13551: O1935 ^name predict-yes)
  9378. =>WM: (13550: R971 ^value 1)
  9379. =>WM: (13549: R1 ^reward R971)
  9380. =>WM: (13548: I3 ^see 1)
  9381. <=WM: (13539: S1 ^operator O1933 +)
  9382. <=WM: (13541: S1 ^operator O1933)
  9383. <=WM: (13540: S1 ^operator O1934 +)
  9384. <=WM: (13538: I3 ^dir L)
  9385. <=WM: (13534: R1 ^reward R970)
  9386. <=WM: (13533: I3 ^see 0)
  9387. <=WM: (13537: O1934 ^name predict-no)
  9388. <=WM: (13536: O1933 ^name predict-yes)
  9389. <=WM: (13535: R970 ^value 1)
  9390. --- Inner Elaboration Phase, active level 1 (S1) ---
  9391. Firing prefer*rvt*predict-yes*H0
  9392. -->
  9393. Firing rl*prefer*rvt*predict-yes*H0*5
  9394. -->
  9395. (S1 ^operator O1935 = 0.1121068094471768)
  9396. Firing prefer*rvt*predict-yes*H0*5*H1
  9397. -->
  9398. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  9399. -->
  9400. (S1 ^operator O1935 = 0.887904707276773)
  9401. Firing prefer*rvt*predict-no*H0
  9402. -->
  9403. Firing rl*prefer*rvt*predict-no*H0*6
  9404. -->
  9405. (S1 ^operator O1936 = 0.3993314366691663)
  9406. Firing prefer*rvt*predict-no*H0*6*H1
  9407. -->
  9408. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  9409. -->
  9410. (S1 ^operator O1936 = 0.02370016355578053)
  9411. inner elaboration loop at bottom goal.
  9412. Retracting rl*prefer*rvt*predict-no*H0*6
  9413. -->
  9414. (S1 ^operator O1934 = 0.3993314366691663)
  9415. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  9416. -->
  9417. (S1 ^operator O1934 = 0.02370016355578053)
  9418. Retracting rl*prefer*rvt*predict-yes*H0*5
  9419. -->
  9420. (S1 ^operator O1933 = 0.1121068094471768)
  9421. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  9422. -->
  9423. (S1 ^operator O1933 = 0.887904707276773)
  9424. --- END Proposal Phase ---
  9425. --- Decision Phase ---
  9426. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.892405,0.0966298)
  9427. RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422823 0.236934 0.659756 -> 0.422823 0.236933 0.659756(R,m,v=1,1,0)
  9428. =>WM: (13556: S1 ^operator O1935)
  9429. 968: O: O1935 (predict-yes)
  9430. --- END Decision Phase ---
  9431. --- Application Phase ---
  9432. --- Firing Productions (PE) For State At Depth 1 ---
  9433. --- Inner Elaboration Phase, active level 1 (S1) ---
  9434. Firing apply*operator
  9435. -->
  9436. (I3 ^predict-yes N968 + :O )
  9437. Firing apply*operator*complete
  9438. -->
  9439. (I3 ^predict-yes N967 - :O )
  9440. inner elaboration loop at bottom goal.
  9441. --- Change Working Memory (PE) ---
  9442. =>WM: (13557: I3 ^predict-yes N968)
  9443. <=WM: (13543: N967 ^status complete)
  9444. <=WM: (13542: I3 ^predict-yes N967)
  9445. --- Firing Productions (IE) For State At Depth 1 ---
  9446. --- Inner Elaboration Phase, active level 1 (S1) ---
  9447. Firing monitor*world
  9448. -->
  9449. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9450. --- Change Working Memory (IE) ---
  9451. --- END Application Phase ---
  9452. --- Output Phase ---
  9453. ENV: Agent did: predict-yes for direction R in state State-A
  9454. In State-A moving R
  9455. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9456. predict error 0
  9457. dir: dir isU
  9458. --- END Output Phase ---
  9459. \-/--- Input Phase ---
  9460. =>WM: (13561: I2 ^dir U)
  9461. =>WM: (13560: I2 ^reward 1)
  9462. =>WM: (13559: I2 ^see 1)
  9463. =>WM: (13558: N968 ^status complete)
  9464. <=WM: (13546: I2 ^dir R)
  9465. <=WM: (13545: I2 ^reward 1)
  9466. <=WM: (13544: I2 ^see 1)
  9467. =>WM: (13562: I2 ^level-1 R1-root)
  9468. <=WM: (13547: I2 ^level-1 L1-root)
  9469. --- END Input Phase ---
  9470. --- Proposal Phase ---
  9471. --- Inner Elaboration Phase, active level 1 (S1) ---
  9472. Firing elaborate*copy-see-to-output-link
  9473. -->
  9474. (I3 ^see 1 +)
  9475. Firing elaborate*reward*based*on*reward
  9476. -->
  9477. (R972 ^value 1 +)
  9478. (R1 ^reward R972 +)
  9479. Firing propose*predict-yes
  9480. -->
  9481. (O1937 ^name predict-yes +)
  9482. (S1 ^operator O1937 +)
  9483. Firing propose*predict-no
  9484. -->
  9485. (O1938 ^name predict-no +)
  9486. (S1 ^operator O1938 +)
  9487. Firing rl*prefer*rvt*predict-no*H0*4
  9488. -->
  9489. (S1 ^operator O1936 = 0.9999999999999999)
  9490. Firing rl*prefer*rvt*predict-yes*H0*3
  9491. -->
  9492. (S1 ^operator O1935 = 0.)
  9493. Firing prefer*rvt*predict-yes*H0
  9494. -->
  9495. Firing prefer*rvt*predict-no*H0
  9496. -->
  9497. Firing elaborate*copy-dir-to-output-link
  9498. -->
  9499. (I3 ^dir U +)
  9500. inner elaboration loop at bottom goal.
  9501. Retracting elaborate*copy-see-to-output-link
  9502. -->
  9503. (I3 ^see 1 +)
  9504. Retracting propose*predict-no
  9505. -->
  9506. (O1936 ^name predict-no +)
  9507. (S1 ^operator O1936 +)
  9508. Retracting propose*predict-yes
  9509. -->
  9510. (O1935 ^name predict-yes +)
  9511. (S1 ^operator O1935 +)
  9512. Retracting elaborate*reward*based*on*reward
  9513. -->
  9514. (R971 ^value 1 +)
  9515. (R1 ^reward R971 +)
  9516. Retracting elaborate*copy-dir-to-output-link
  9517. -->
  9518. (I3 ^dir R +)
  9519. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  9520. -->
  9521. (S1 ^operator O1936 = 0.02370016355578053)
  9522. Retracting rl*prefer*rvt*predict-no*H0*6
  9523. -->
  9524. (S1 ^operator O1936 = 0.3993314366691663)
  9525. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  9526. -->
  9527. (S1 ^operator O1935 = 0.887904707276773)
  9528. Retracting rl*prefer*rvt*predict-yes*H0*5
  9529. -->
  9530. (S1 ^operator O1935 = 0.1121068094471768)
  9531. =>WM: (13569: S1 ^operator O1938 +)
  9532. =>WM: (13568: S1 ^operator O1937 +)
  9533. =>WM: (13567: I3 ^dir U)
  9534. =>WM: (13566: O1938 ^name predict-no)
  9535. =>WM: (13565: O1937 ^name predict-yes)
  9536. =>WM: (13564: R972 ^value 1)
  9537. =>WM: (13563: R1 ^reward R972)
  9538. <=WM: (13554: S1 ^operator O1935 +)
  9539. <=WM: (13556: S1 ^operator O1935)
  9540. <=WM: (13555: S1 ^operator O1936 +)
  9541. <=WM: (13553: I3 ^dir R)
  9542. <=WM: (13549: R1 ^reward R971)
  9543. <=WM: (13552: O1936 ^name predict-no)
  9544. <=WM: (13551: O1935 ^name predict-yes)
  9545. <=WM: (13550: R971 ^value 1)
  9546. --- Inner Elaboration Phase, active level 1 (S1) ---
  9547. Firing prefer*rvt*predict-yes*H0
  9548. -->
  9549. Firing rl*prefer*rvt*predict-yes*H0*3
  9550. -->
  9551. (S1 ^operator O1937 = 0.)
  9552. Firing prefer*rvt*predict-no*H0
  9553. -->
  9554. Firing rl*prefer*rvt*predict-no*H0*4
  9555. -->
  9556. (S1 ^operator O1938 = 0.9999999999999999)
  9557. inner elaboration loop at bottom goal.
  9558. Retracting rl*prefer*rvt*predict-no*H0*4
  9559. -->
  9560. (S1 ^operator O1936 = 0.9999999999999999)
  9561. Retracting rl*prefer*rvt*predict-yes*H0*3
  9562. -->
  9563. (S1 ^operator O1935 = 0.)
  9564. --- END Proposal Phase ---
  9565. --- Decision Phase ---
  9566. RL update rl*prefer*rvt*predict-yes*H0*5 0.61903 -0.506923 0.112107 -> 0.619028 -0.506923 0.112105(R,m,v=1,0.897436,0.0926385)
  9567. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380984 0.506921 0.887905 -> 0.380982 0.506921 0.887903(R,m,v=1,1,0)
  9568. =>WM: (13570: S1 ^operator O1938)
  9569. 969: O: O1938 (predict-no)
  9570. --- END Decision Phase ---
  9571. --- Application Phase ---
  9572. --- Firing Productions (PE) For State At Depth 1 ---
  9573. --- Inner Elaboration Phase, active level 1 (S1) ---
  9574. Firing apply*operator
  9575. -->
  9576. (I3 ^predict-no N969 + :O )
  9577. Firing apply*operator*complete
  9578. -->
  9579. (I3 ^predict-yes N968 - :O )
  9580. inner elaboration loop at bottom goal.
  9581. --- Change Working Memory (PE) ---
  9582. =>WM: (13571: I3 ^predict-no N969)
  9583. <=WM: (13558: N968 ^status complete)
  9584. <=WM: (13557: I3 ^predict-yes N968)
  9585. --- Firing Productions (IE) For State At Depth 1 ---
  9586. --- Inner Elaboration Phase, active level 1 (S1) ---
  9587. Firing monitor*world
  9588. -->
  9589. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9590. --- Change Working Memory (IE) ---
  9591. --- END Application Phase ---
  9592. --- Output Phase ---
  9593. ENV: Agent did: predict-no for direction U in state State-B
  9594. In State-B moving U
  9595. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9596. predict error 0
  9597. dir: dir isL
  9598. --- END Output Phase ---
  9599. |\---- Input Phase ---
  9600. =>WM: (13575: I2 ^dir L)
  9601. =>WM: (13574: I2 ^reward 1)
  9602. =>WM: (13573: I2 ^see 0)
  9603. =>WM: (13572: N969 ^status complete)
  9604. <=WM: (13561: I2 ^dir U)
  9605. <=WM: (13560: I2 ^reward 1)
  9606. <=WM: (13559: I2 ^see 1)
  9607. =>WM: (13576: I2 ^level-1 R1-root)
  9608. <=WM: (13562: I2 ^level-1 R1-root)
  9609. --- END Input Phase ---
  9610. --- Proposal Phase ---
  9611. --- Inner Elaboration Phase, active level 1 (S1) ---
  9612. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  9613. -->
  9614. (S1 ^operator O1938 = 0.03900899329983293)
  9615. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  9616. -->
  9617. (S1 ^operator O1937 = 0.6597559345006581)
  9618. Firing prefer*rvt*predict-no*H0*2*H1
  9619. -->
  9620. Firing prefer*rvt*predict-yes*H0*1*H1
  9621. -->
  9622. Firing elaborate*copy-see-to-output-link
  9623. -->
  9624. (I3 ^see 0 +)
  9625. Firing elaborate*reward*based*on*reward
  9626. -->
  9627. (R973 ^value 1 +)
  9628. (R1 ^reward R973 +)
  9629. Firing propose*predict-yes
  9630. -->
  9631. (O1939 ^name predict-yes +)
  9632. (S1 ^operator O1939 +)
  9633. Firing propose*predict-no
  9634. -->
  9635. (O1940 ^name predict-no +)
  9636. (S1 ^operator O1940 +)
  9637. Firing rl*prefer*rvt*predict-no*H0*2
  9638. -->
  9639. (S1 ^operator O1938 = 0.3212956367143155)
  9640. Firing rl*prefer*rvt*predict-yes*H0*1
  9641. -->
  9642. (S1 ^operator O1937 = 0.3402456256121283)
  9643. Firing prefer*rvt*predict-yes*H0
  9644. -->
  9645. Firing prefer*rvt*predict-no*H0
  9646. -->
  9647. Firing elaborate*copy-dir-to-output-link
  9648. -->
  9649. (I3 ^dir L +)
  9650. inner elaboration loop at bottom goal.
  9651. Retracting elaborate*copy-see-to-output-link
  9652. -->
  9653. (I3 ^see 1 +)
  9654. Retracting propose*predict-no
  9655. -->
  9656. (O1938 ^name predict-no +)
  9657. (S1 ^operator O1938 +)
  9658. Retracting propose*predict-yes
  9659. -->
  9660. (O1937 ^name predict-yes +)
  9661. (S1 ^operator O1937 +)
  9662. Retracting elaborate*reward*based*on*reward
  9663. -->
  9664. (R972 ^value 1 +)
  9665. (R1 ^reward R972 +)
  9666. Retracting elaborate*copy-dir-to-output-link
  9667. -->
  9668. (I3 ^dir U +)
  9669. Retracting rl*prefer*rvt*predict-no*H0*4
  9670. -->
  9671. (S1 ^operator O1938 = 0.9999999999999999)
  9672. Retracting rl*prefer*rvt*predict-yes*H0*3
  9673. -->
  9674. (S1 ^operator O1937 = 0.)
  9675. =>WM: (13584: S1 ^operator O1940 +)
  9676. =>WM: (13583: S1 ^operator O1939 +)
  9677. =>WM: (13582: I3 ^dir L)
  9678. =>WM: (13581: O1940 ^name predict-no)
  9679. =>WM: (13580: O1939 ^name predict-yes)
  9680. =>WM: (13579: R973 ^value 1)
  9681. =>WM: (13578: R1 ^reward R973)
  9682. =>WM: (13577: I3 ^see 0)
  9683. <=WM: (13568: S1 ^operator O1937 +)
  9684. <=WM: (13569: S1 ^operator O1938 +)
  9685. <=WM: (13570: S1 ^operator O1938)
  9686. <=WM: (13567: I3 ^dir U)
  9687. <=WM: (13563: R1 ^reward R972)
  9688. <=WM: (13548: I3 ^see 1)
  9689. <=WM: (13566: O1938 ^name predict-no)
  9690. <=WM: (13565: O1937 ^name predict-yes)
  9691. <=WM: (13564: R972 ^value 1)
  9692. --- Inner Elaboration Phase, active level 1 (S1) ---
  9693. Firing prefer*rvt*predict-yes*H0
  9694. -->
  9695. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  9696. -->
  9697. (S1 ^operator O1939 = 0.6597559345006581)
  9698. Firing rl*prefer*rvt*predict-yes*H0*1
  9699. -->
  9700. (S1 ^operator O1939 = 0.3402456256121283)
  9701. Firing prefer*rvt*predict-yes*H0*1*H1
  9702. -->
  9703. Firing prefer*rvt*predict-no*H0
  9704. -->
  9705. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  9706. -->
  9707. (S1 ^operator O1940 = 0.03900899329983293)
  9708. Firing rl*prefer*rvt*predict-no*H0*2
  9709. -->
  9710. (S1 ^operator O1940 = 0.3212956367143155)
  9711. Firing prefer*rvt*predict-no*H0*2*H1
  9712. -->
  9713. inner elaboration loop at bottom goal.
  9714. Retracting rl*prefer*rvt*predict-no*H0*2
  9715. -->
  9716. (S1 ^operator O1938 = 0.3212956367143155)
  9717. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  9718. -->
  9719. (S1 ^operator O1938 = 0.03900899329983293)
  9720. Retracting rl*prefer*rvt*predict-yes*H0*1
  9721. -->
  9722. (S1 ^operator O1937 = 0.3402456256121283)
  9723. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  9724. -->
  9725. (S1 ^operator O1937 = 0.6597559345006581)
  9726. --- END Proposal Phase ---
  9727. --- Decision Phase ---
  9728. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9729. =>WM: (13585: S1 ^operator O1939)
  9730. 970: O: O1939 (predict-yes)
  9731. --- END Decision Phase ---
  9732. --- Application Phase ---
  9733. --- Firing Productions (PE) For State At Depth 1 ---
  9734. --- Inner Elaboration Phase, active level 1 (S1) ---
  9735. Firing apply*operator
  9736. -->
  9737. (I3 ^predict-yes N970 + :O )
  9738. Firing apply*operator*complete
  9739. -->
  9740. (I3 ^predict-no N969 - :O )
  9741. inner elaboration loop at bottom goal.
  9742. --- Change Working Memory (PE) ---
  9743. =>WM: (13586: I3 ^predict-yes N970)
  9744. <=WM: (13572: N969 ^status complete)
  9745. <=WM: (13571: I3 ^predict-no N969)
  9746. --- Firing Productions (IE) For State At Depth 1 ---
  9747. --- Inner Elaboration Phase, active level 1 (S1) ---
  9748. Firing monitor*world
  9749. -->
  9750. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9751. --- Change Working Memory (IE) ---
  9752. --- END Application Phase ---
  9753. --- Output Phase ---
  9754. ENV: Agent did: predict-yes for direction L in state State-B
  9755. In State-B moving L
  9756. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9757. predict error 0
  9758. dir: dir isU
  9759. --- END Output Phase ---
  9760. /|\--- Input Phase ---
  9761. =>WM: (13590: I2 ^dir U)
  9762. =>WM: (13589: I2 ^reward 1)
  9763. =>WM: (13588: I2 ^see 1)
  9764. =>WM: (13587: N970 ^status complete)
  9765. <=WM: (13575: I2 ^dir L)
  9766. <=WM: (13574: I2 ^reward 1)
  9767. <=WM: (13573: I2 ^see 0)
  9768. =>WM: (13591: I2 ^level-1 L1-root)
  9769. <=WM: (13576: I2 ^level-1 R1-root)
  9770. --- END Input Phase ---
  9771. --- Proposal Phase ---
  9772. --- Inner Elaboration Phase, active level 1 (S1) ---
  9773. Firing elaborate*copy-see-to-output-link
  9774. -->
  9775. (I3 ^see 1 +)
  9776. Firing elaborate*reward*based*on*reward
  9777. -->
  9778. (R974 ^value 1 +)
  9779. (R1 ^reward R974 +)
  9780. Firing propose*predict-yes
  9781. -->
  9782. (O1941 ^name predict-yes +)
  9783. (S1 ^operator O1941 +)
  9784. Firing propose*predict-no
  9785. -->
  9786. (O1942 ^name predict-no +)
  9787. (S1 ^operator O1942 +)
  9788. Firing rl*prefer*rvt*predict-no*H0*4
  9789. -->
  9790. (S1 ^operator O1940 = 0.9999999999999999)
  9791. Firing rl*prefer*rvt*predict-yes*H0*3
  9792. -->
  9793. (S1 ^operator O1939 = 0.)
  9794. Firing prefer*rvt*predict-yes*H0
  9795. -->
  9796. Firing prefer*rvt*predict-no*H0
  9797. -->
  9798. Firing elaborate*copy-dir-to-output-link
  9799. -->
  9800. (I3 ^dir U +)
  9801. inner elaboration loop at bottom goal.
  9802. Retracting elaborate*copy-see-to-output-link
  9803. -->
  9804. (I3 ^see 0 +)
  9805. Retracting propose*predict-no
  9806. -->
  9807. (O1940 ^name predict-no +)
  9808. (S1 ^operator O1940 +)
  9809. Retracting propose*predict-yes
  9810. -->
  9811. (O1939 ^name predict-yes +)
  9812. (S1 ^operator O1939 +)
  9813. Retracting elaborate*reward*based*on*reward
  9814. -->
  9815. (R973 ^value 1 +)
  9816. (R1 ^reward R973 +)
  9817. Retracting elaborate*copy-dir-to-output-link
  9818. -->
  9819. (I3 ^dir L +)
  9820. Retracting rl*prefer*rvt*predict-no*H0*2
  9821. -->
  9822. (S1 ^operator O1940 = 0.3212956367143155)
  9823. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  9824. -->
  9825. (S1 ^operator O1940 = 0.03900899329983293)
  9826. Retracting rl*prefer*rvt*predict-yes*H0*1
  9827. -->
  9828. (S1 ^operator O1939 = 0.3402456256121283)
  9829. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  9830. -->
  9831. (S1 ^operator O1939 = 0.6597559345006581)
  9832. =>WM: (13599: S1 ^operator O1942 +)
  9833. =>WM: (13598: S1 ^operator O1941 +)
  9834. =>WM: (13597: I3 ^dir U)
  9835. =>WM: (13596: O1942 ^name predict-no)
  9836. =>WM: (13595: O1941 ^name predict-yes)
  9837. =>WM: (13594: R974 ^value 1)
  9838. =>WM: (13593: R1 ^reward R974)
  9839. =>WM: (13592: I3 ^see 1)
  9840. <=WM: (13583: S1 ^operator O1939 +)
  9841. <=WM: (13585: S1 ^operator O1939)
  9842. <=WM: (13584: S1 ^operator O1940 +)
  9843. <=WM: (13582: I3 ^dir L)
  9844. <=WM: (13578: R1 ^reward R973)
  9845. <=WM: (13577: I3 ^see 0)
  9846. <=WM: (13581: O1940 ^name predict-no)
  9847. <=WM: (13580: O1939 ^name predict-yes)
  9848. <=WM: (13579: R973 ^value 1)
  9849. --- Inner Elaboration Phase, active level 1 (S1) ---
  9850. Firing prefer*rvt*predict-yes*H0
  9851. -->
  9852. Firing rl*prefer*rvt*predict-yes*H0*3
  9853. -->
  9854. (S1 ^operator O1941 = 0.)
  9855. Firing prefer*rvt*predict-no*H0
  9856. -->
  9857. Firing rl*prefer*rvt*predict-no*H0*4
  9858. -->
  9859. (S1 ^operator O1942 = 0.9999999999999999)
  9860. inner elaboration loop at bottom goal.
  9861. Retracting rl*prefer*rvt*predict-no*H0*4
  9862. -->
  9863. (S1 ^operator O1940 = 0.9999999999999999)
  9864. Retracting rl*prefer*rvt*predict-yes*H0*3
  9865. -->
  9866. (S1 ^operator O1939 = 0.)
  9867. --- END Proposal Phase ---
  9868. --- Decision Phase ---
  9869. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340246 -> 0.577178 -0.236933 0.340245(R,m,v=1,0.893082,0.0960911)
  9870. RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422823 0.236933 0.659756 -> 0.422822 0.236933 0.659756(R,m,v=1,1,0)
  9871. =>WM: (13600: S1 ^operator O1942)
  9872. 971: O: O1942 (predict-no)
  9873. --- END Decision Phase ---
  9874. --- Application Phase ---
  9875. --- Firing Productions (PE) For State At Depth 1 ---
  9876. --- Inner Elaboration Phase, active level 1 (S1) ---
  9877. Firing apply*operator
  9878. -->
  9879. (I3 ^predict-no N971 + :O )
  9880. Firing apply*operator*complete
  9881. -->
  9882. (I3 ^predict-yes N970 - :O )
  9883. inner elaboration loop at bottom goal.
  9884. --- Change Working Memory (PE) ---
  9885. =>WM: (13601: I3 ^predict-no N971)
  9886. <=WM: (13587: N970 ^status complete)
  9887. <=WM: (13586: I3 ^predict-yes N970)
  9888. --- Firing Productions (IE) For State At Depth 1 ---
  9889. --- Inner Elaboration Phase, active level 1 (S1) ---
  9890. Firing monitor*world
  9891. -->
  9892. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9893. --- Change Working Memory (IE) ---
  9894. --- END Application Phase ---
  9895. --- Output Phase ---
  9896. ENV: Agent did: predict-no for direction U in state State-A
  9897. In State-A moving U
  9898. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9899. predict error 0
  9900. dir: dir isL
  9901. --- END Output Phase ---
  9902. ---- Input Phase ---
  9903. =>WM: (13605: I2 ^dir L)
  9904. =>WM: (13604: I2 ^reward 1)
  9905. =>WM: (13603: I2 ^see 0)
  9906. =>WM: (13602: N971 ^status complete)
  9907. <=WM: (13590: I2 ^dir U)
  9908. <=WM: (13589: I2 ^reward 1)
  9909. <=WM: (13588: I2 ^see 1)
  9910. =>WM: (13606: I2 ^level-1 L1-root)
  9911. <=WM: (13591: I2 ^level-1 L1-root)
  9912. --- END Input Phase ---
  9913. --- Proposal Phase ---
  9914. --- Inner Elaboration Phase, active level 1 (S1) ---
  9915. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  9916. -->
  9917. (S1 ^operator O1941 = 0.02884852834965246)
  9918. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  9919. -->
  9920. (S1 ^operator O1942 = 0.6787425437117627)
  9921. Firing prefer*rvt*predict-no*H0*2*H1
  9922. -->
  9923. Firing prefer*rvt*predict-yes*H0*1*H1
  9924. -->
  9925. Firing elaborate*copy-see-to-output-link
  9926. -->
  9927. (I3 ^see 0 +)
  9928. Firing elaborate*reward*based*on*reward
  9929. -->
  9930. (R975 ^value 1 +)
  9931. (R1 ^reward R975 +)
  9932. Firing propose*predict-yes
  9933. -->
  9934. (O1943 ^name predict-yes +)
  9935. (S1 ^operator O1943 +)
  9936. Firing propose*predict-no
  9937. -->
  9938. (O1944 ^name predict-no +)
  9939. (S1 ^operator O1944 +)
  9940. Firing rl*prefer*rvt*predict-no*H0*2
  9941. -->
  9942. (S1 ^operator O1942 = 0.3212956367143155)
  9943. Firing rl*prefer*rvt*predict-yes*H0*1
  9944. -->
  9945. (S1 ^operator O1941 = 0.3402453915952103)
  9946. Firing prefer*rvt*predict-yes*H0
  9947. -->
  9948. Firing prefer*rvt*predict-no*H0
  9949. -->
  9950. Firing elaborate*copy-dir-to-output-link
  9951. -->
  9952. (I3 ^dir L +)
  9953. inner elaboration loop at bottom goal.
  9954. Retracting elaborate*copy-see-to-output-link
  9955. -->
  9956. (I3 ^see 1 +)
  9957. Retracting propose*predict-no
  9958. -->
  9959. (O1942 ^name predict-no +)
  9960. (S1 ^operator O1942 +)
  9961. Retracting propose*predict-yes
  9962. -->
  9963. (O1941 ^name predict-yes +)
  9964. (S1 ^operator O1941 +)
  9965. Retracting elaborate*reward*based*on*reward
  9966. -->
  9967. (R974 ^value 1 +)
  9968. (R1 ^reward R974 +)
  9969. Retracting elaborate*copy-dir-to-output-link
  9970. -->
  9971. (I3 ^dir U +)
  9972. Retracting rl*prefer*rvt*predict-no*H0*4
  9973. -->
  9974. (S1 ^operator O1942 = 0.9999999999999999)
  9975. Retracting rl*prefer*rvt*predict-yes*H0*3
  9976. -->
  9977. (S1 ^operator O1941 = 0.)
  9978. =>WM: (13614: S1 ^operator O1944 +)
  9979. =>WM: (13613: S1 ^operator O1943 +)
  9980. =>WM: (13612: I3 ^dir L)
  9981. =>WM: (13611: O1944 ^name predict-no)
  9982. =>WM: (13610: O1943 ^name predict-yes)
  9983. =>WM: (13609: R975 ^value 1)
  9984. =>WM: (13608: R1 ^reward R975)
  9985. =>WM: (13607: I3 ^see 0)
  9986. <=WM: (13598: S1 ^operator O1941 +)
  9987. <=WM: (13599: S1 ^operator O1942 +)
  9988. <=WM: (13600: S1 ^operator O1942)
  9989. <=WM: (13597: I3 ^dir U)
  9990. <=WM: (13593: R1 ^reward R974)
  9991. <=WM: (13592: I3 ^see 1)
  9992. <=WM: (13596: O1942 ^name predict-no)
  9993. <=WM: (13595: O1941 ^name predict-yes)
  9994. <=WM: (13594: R974 ^value 1)
  9995. --- Inner Elaboration Phase, active level 1 (S1) ---
  9996. Firing prefer*rvt*predict-yes*H0
  9997. -->
  9998. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  9999. -->
  10000. (S1 ^operator O1943 = 0.02884852834965246)
  10001. Firing rl*prefer*rvt*predict-yes*H0*1
  10002. -->
  10003. (S1 ^operator O1943 = 0.3402453915952103)
  10004. Firing prefer*rvt*predict-yes*H0*1*H1
  10005. -->
  10006. Firing prefer*rvt*predict-no*H0
  10007. -->
  10008. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  10009. -->
  10010. (S1 ^operator O1944 = 0.6787425437117627)
  10011. Firing rl*prefer*rvt*predict-no*H0*2
  10012. -->
  10013. (S1 ^operator O1944 = 0.3212956367143155)
  10014. Firing prefer*rvt*predict-no*H0*2*H1
  10015. -->
  10016. inner elaboration loop at bottom goal.
  10017. Retracting rl*prefer*rvt*predict-no*H0*2
  10018. -->
  10019. (S1 ^operator O1942 = 0.3212956367143155)
  10020. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  10021. -->
  10022. (S1 ^operator O1942 = 0.6787425437117627)
  10023. Retracting rl*prefer*rvt*predict-yes*H0*1
  10024. -->
  10025. (S1 ^operator O1941 = 0.3402453915952103)
  10026. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  10027. -->
  10028. (S1 ^operator O1941 = 0.02884852834965246)
  10029. --- END Proposal Phase ---
  10030. --- Decision Phase ---
  10031. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10032. =>WM: (13615: S1 ^operator O1944)
  10033. 972: O: O1944 (predict-no)
  10034. --- END Decision Phase ---
  10035. --- Application Phase ---
  10036. --- Firing Productions (PE) For State At Depth 1 ---
  10037. --- Inner Elaboration Phase, active level 1 (S1) ---
  10038. Firing apply*operator
  10039. -->
  10040. (I3 ^predict-no N972 + :O )
  10041. Firing apply*operator*complete
  10042. -->
  10043. (I3 ^predict-no N971 - :O )
  10044. inner elaboration loop at bottom goal.
  10045. --- Change Working Memory (PE) ---
  10046. =>WM: (13616: I3 ^predict-no N972)
  10047. <=WM: (13602: N971 ^status complete)
  10048. <=WM: (13601: I3 ^predict-no N971)
  10049. --- Firing Productions (IE) For State At Depth 1 ---
  10050. --- Inner Elaboration Phase, active level 1 (S1) ---
  10051. Firing monitor*world
  10052. -->
  10053. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10054. --- Change Working Memory (IE) ---
  10055. --- END Application Phase ---
  10056. --- Output Phase ---
  10057. ENV: Agent did: predict-no for direction L in state State-A
  10058. In State-A moving L
  10059. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10060. predict error 0
  10061. dir: dir isR
  10062. --- END Output Phase ---
  10063. /|\--- Input Phase ---
  10064. =>WM: (13620: I2 ^dir R)
  10065. =>WM: (13619: I2 ^reward 1)
  10066. =>WM: (13618: I2 ^see 0)
  10067. =>WM: (13617: N972 ^status complete)
  10068. <=WM: (13605: I2 ^dir L)
  10069. <=WM: (13604: I2 ^reward 1)
  10070. <=WM: (13603: I2 ^see 0)
  10071. =>WM: (13621: I2 ^level-1 L0-root)
  10072. <=WM: (13606: I2 ^level-1 L1-root)
  10073. --- END Input Phase ---
  10074. --- Proposal Phase ---
  10075. --- Inner Elaboration Phase, active level 1 (S1) ---
  10076. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  10077. -->
  10078. (S1 ^operator O1943 = 0.8878798118503368)
  10079. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  10080. -->
  10081. (S1 ^operator O1944 = -0.1957074416057287)
  10082. Firing prefer*rvt*predict-no*H0*6*H1
  10083. -->
  10084. Firing prefer*rvt*predict-yes*H0*5*H1
  10085. -->
  10086. Firing elaborate*copy-see-to-output-link
  10087. -->
  10088. (I3 ^see 0 +)
  10089. Firing elaborate*reward*based*on*reward
  10090. -->
  10091. (R976 ^value 1 +)
  10092. (R1 ^reward R976 +)
  10093. Firing propose*predict-yes
  10094. -->
  10095. (O1945 ^name predict-yes +)
  10096. (S1 ^operator O1945 +)
  10097. Firing propose*predict-no
  10098. -->
  10099. (O1946 ^name predict-no +)
  10100. (S1 ^operator O1946 +)
  10101. Firing rl*prefer*rvt*predict-no*H0*6
  10102. -->
  10103. (S1 ^operator O1944 = 0.3993314366691663)
  10104. Firing rl*prefer*rvt*predict-yes*H0*5
  10105. -->
  10106. (S1 ^operator O1943 = 0.1121050819385843)
  10107. Firing prefer*rvt*predict-yes*H0
  10108. -->
  10109. Firing prefer*rvt*predict-no*H0
  10110. -->
  10111. Firing elaborate*copy-dir-to-output-link
  10112. -->
  10113. (I3 ^dir R +)
  10114. inner elaboration loop at bottom goal.
  10115. Retracting elaborate*copy-see-to-output-link
  10116. -->
  10117. (I3 ^see 0 +)
  10118. Retracting propose*predict-no
  10119. -->
  10120. (O1944 ^name predict-no +)
  10121. (S1 ^operator O1944 +)
  10122. Retracting propose*predict-yes
  10123. -->
  10124. (O1943 ^name predict-yes +)
  10125. (S1 ^operator O1943 +)
  10126. Retracting elaborate*reward*based*on*reward
  10127. -->
  10128. (R975 ^value 1 +)
  10129. (R1 ^reward R975 +)
  10130. Retracting elaborate*copy-dir-to-output-link
  10131. -->
  10132. (I3 ^dir L +)
  10133. Retracting rl*prefer*rvt*predict-no*H0*2
  10134. -->
  10135. (S1 ^operator O1944 = 0.3212956367143155)
  10136. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  10137. -->
  10138. (S1 ^operator O1944 = 0.6787425437117627)
  10139. Retracting rl*prefer*rvt*predict-yes*H0*1
  10140. -->
  10141. (S1 ^operator O1943 = 0.3402453915952103)
  10142. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  10143. -->
  10144. (S1 ^operator O1943 = 0.02884852834965246)
  10145. =>WM: (13628: S1 ^operator O1946 +)
  10146. =>WM: (13627: S1 ^operator O1945 +)
  10147. =>WM: (13626: I3 ^dir R)
  10148. =>WM: (13625: O1946 ^name predict-no)
  10149. =>WM: (13624: O1945 ^name predict-yes)
  10150. =>WM: (13623: R976 ^value 1)
  10151. =>WM: (13622: R1 ^reward R976)
  10152. <=WM: (13613: S1 ^operator O1943 +)
  10153. <=WM: (13614: S1 ^operator O1944 +)
  10154. <=WM: (13615: S1 ^operator O1944)
  10155. <=WM: (13612: I3 ^dir L)
  10156. <=WM: (13608: R1 ^reward R975)
  10157. <=WM: (13611: O1944 ^name predict-no)
  10158. <=WM: (13610: O1943 ^name predict-yes)
  10159. <=WM: (13609: R975 ^value 1)
  10160. --- Inner Elaboration Phase, active level 1 (S1) ---
  10161. Firing prefer*rvt*predict-yes*H0
  10162. -->
  10163. Firing rl*prefer*rvt*predict-yes*H0*5
  10164. -->
  10165. (S1 ^operator O1945 = 0.1121050819385843)
  10166. Firing prefer*rvt*predict-yes*H0*5*H1
  10167. -->
  10168. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  10169. -->
  10170. (S1 ^operator O1945 = 0.8878798118503368)
  10171. Firing prefer*rvt*predict-no*H0
  10172. -->
  10173. Firing rl*prefer*rvt*predict-no*H0*6
  10174. -->
  10175. (S1 ^operator O1946 = 0.3993314366691663)
  10176. Firing prefer*rvt*predict-no*H0*6*H1
  10177. -->
  10178. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  10179. -->
  10180. (S1 ^operator O1946 = -0.1957074416057287)
  10181. inner elaboration loop at bottom goal.
  10182. Retracting rl*prefer*rvt*predict-no*H0*6
  10183. -->
  10184. (S1 ^operator O1944 = 0.3993314366691663)
  10185. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  10186. -->
  10187. (S1 ^operator O1944 = -0.1957074416057287)
  10188. Retracting rl*prefer*rvt*predict-yes*H0*5
  10189. -->
  10190. (S1 ^operator O1943 = 0.1121050819385843)
  10191. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  10192. -->
  10193. (S1 ^operator O1943 = 0.8878798118503368)
  10194. --- END Proposal Phase ---
  10195. --- Decision Phase ---
  10196. RL update rl*prefer*rvt*predict-no*H0*2 0.641773 -0.320478 0.321296 -> 0.641767 -0.320477 0.32129(R,m,v=1,0.933333,0.0626398)
  10197. RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358265 0.320477 0.678743 -> 0.358259 0.320477 0.678737(R,m,v=1,1,0)
  10198. =>WM: (13629: S1 ^operator O1945)
  10199. 973: O: O1945 (predict-yes)
  10200. --- END Decision Phase ---
  10201. --- Application Phase ---
  10202. --- Firing Productions (PE) For State At Depth 1 ---
  10203. --- Inner Elaboration Phase, active level 1 (S1) ---
  10204. Firing apply*operator
  10205. -->
  10206. (I3 ^predict-yes N973 + :O )
  10207. Firing apply*operator*complete
  10208. -->
  10209. (I3 ^predict-no N972 - :O )
  10210. inner elaboration loop at bottom goal.
  10211. --- Change Working Memory (PE) ---
  10212. =>WM: (13630: I3 ^predict-yes N973)
  10213. <=WM: (13617: N972 ^status complete)
  10214. <=WM: (13616: I3 ^predict-no N972)
  10215. --- Firing Productions (IE) For State At Depth 1 ---
  10216. --- Inner Elaboration Phase, active level 1 (S1) ---
  10217. Firing monitor*world
  10218. -->
  10219. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10220. --- Change Working Memory (IE) ---
  10221. --- END Application Phase ---
  10222. --- Output Phase ---
  10223. ENV: Agent did: predict-yes for direction R in state State-A
  10224. In State-A moving R
  10225. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10226. predict error 0
  10227. dir: dir isU
  10228. --- END Output Phase ---
  10229. -/|--- Input Phase ---
  10230. =>WM: (13634: I2 ^dir U)
  10231. =>WM: (13633: I2 ^reward 1)
  10232. =>WM: (13632: I2 ^see 1)
  10233. =>WM: (13631: N973 ^status complete)
  10234. <=WM: (13620: I2 ^dir R)
  10235. <=WM: (13619: I2 ^reward 1)
  10236. <=WM: (13618: I2 ^see 0)
  10237. =>WM: (13635: I2 ^level-1 R1-root)
  10238. <=WM: (13621: I2 ^level-1 L0-root)
  10239. --- END Input Phase ---
  10240. --- Proposal Phase ---
  10241. --- Inner Elaboration Phase, active level 1 (S1) ---
  10242. Firing elaborate*copy-see-to-output-link
  10243. -->
  10244. (I3 ^see 1 +)
  10245. Firing elaborate*reward*based*on*reward
  10246. -->
  10247. (R977 ^value 1 +)
  10248. (R1 ^reward R977 +)
  10249. Firing propose*predict-yes
  10250. -->
  10251. (O1947 ^name predict-yes +)
  10252. (S1 ^operator O1947 +)
  10253. Firing propose*predict-no
  10254. -->
  10255. (O1948 ^name predict-no +)
  10256. (S1 ^operator O1948 +)
  10257. Firing rl*prefer*rvt*predict-no*H0*4
  10258. -->
  10259. (S1 ^operator O1946 = 0.9999999999999999)
  10260. Firing rl*prefer*rvt*predict-yes*H0*3
  10261. -->
  10262. (S1 ^operator O1945 = 0.)
  10263. Firing prefer*rvt*predict-yes*H0
  10264. -->
  10265. Firing prefer*rvt*predict-no*H0
  10266. -->
  10267. Firing elaborate*copy-dir-to-output-link
  10268. -->
  10269. (I3 ^dir U +)
  10270. inner elaboration loop at bottom goal.
  10271. Retracting elaborate*copy-see-to-output-link
  10272. -->
  10273. (I3 ^see 0 +)
  10274. Retracting propose*predict-no
  10275. -->
  10276. (O1946 ^name predict-no +)
  10277. (S1 ^operator O1946 +)
  10278. Retracting propose*predict-yes
  10279. -->
  10280. (O1945 ^name predict-yes +)
  10281. (S1 ^operator O1945 +)
  10282. Retracting elaborate*reward*based*on*reward
  10283. -->
  10284. (R976 ^value 1 +)
  10285. (R1 ^reward R976 +)
  10286. Retracting elaborate*copy-dir-to-output-link
  10287. -->
  10288. (I3 ^dir R +)
  10289. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  10290. -->
  10291. (S1 ^operator O1946 = -0.1957074416057287)
  10292. Retracting rl*prefer*rvt*predict-no*H0*6
  10293. -->
  10294. (S1 ^operator O1946 = 0.3993314366691663)
  10295. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  10296. -->
  10297. (S1 ^operator O1945 = 0.8878798118503368)
  10298. Retracting rl*prefer*rvt*predict-yes*H0*5
  10299. -->
  10300. (S1 ^operator O1945 = 0.1121050819385843)
  10301. =>WM: (13643: S1 ^operator O1948 +)
  10302. =>WM: (13642: S1 ^operator O1947 +)
  10303. =>WM: (13641: I3 ^dir U)
  10304. =>WM: (13640: O1948 ^name predict-no)
  10305. =>WM: (13639: O1947 ^name predict-yes)
  10306. =>WM: (13638: R977 ^value 1)
  10307. =>WM: (13637: R1 ^reward R977)
  10308. =>WM: (13636: I3 ^see 1)
  10309. <=WM: (13627: S1 ^operator O1945 +)
  10310. <=WM: (13629: S1 ^operator O1945)
  10311. <=WM: (13628: S1 ^operator O1946 +)
  10312. <=WM: (13626: I3 ^dir R)
  10313. <=WM: (13622: R1 ^reward R976)
  10314. <=WM: (13607: I3 ^see 0)
  10315. <=WM: (13625: O1946 ^name predict-no)
  10316. <=WM: (13624: O1945 ^name predict-yes)
  10317. <=WM: (13623: R976 ^value 1)
  10318. --- Inner Elaboration Phase, active level 1 (S1) ---
  10319. Firing prefer*rvt*predict-yes*H0
  10320. -->
  10321. Firing rl*prefer*rvt*predict-yes*H0*3
  10322. -->
  10323. (S1 ^operator O1947 = 0.)
  10324. Firing prefer*rvt*predict-no*H0
  10325. -->
  10326. Firing rl*prefer*rvt*predict-no*H0*4
  10327. -->
  10328. (S1 ^operator O1948 = 0.9999999999999999)
  10329. inner elaboration loop at bottom goal.
  10330. Retracting rl*prefer*rvt*predict-no*H0*4
  10331. -->
  10332. (S1 ^operator O1946 = 0.9999999999999999)
  10333. Retracting rl*prefer*rvt*predict-yes*H0*3
  10334. -->
  10335. (S1 ^operator O1945 = 0.)
  10336. --- END Proposal Phase ---
  10337. --- Decision Phase ---
  10338. RL update rl*prefer*rvt*predict-yes*H0*5 0.619028 -0.506923 0.112105 -> 0.619031 -0.506923 0.112107(R,m,v=1,0.898089,0.0921117)
  10339. RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380954 0.506926 0.88788 -> 0.380957 0.506925 0.887882(R,m,v=1,1,0)
  10340. =>WM: (13644: S1 ^operator O1948)
  10341. 974: O: O1948 (predict-no)
  10342. --- END Decision Phase ---
  10343. --- Application Phase ---
  10344. --- Firing Productions (PE) For State At Depth 1 ---
  10345. --- Inner Elaboration Phase, active level 1 (S1) ---
  10346. Firing apply*operator
  10347. -->
  10348. (I3 ^predict-no N974 + :O )
  10349. Firing apply*operator*complete
  10350. -->
  10351. (I3 ^predict-yes N973 - :O )
  10352. inner elaboration loop at bottom goal.
  10353. --- Change Working Memory (PE) ---
  10354. =>WM: (13645: I3 ^predict-no N974)
  10355. <=WM: (13631: N973 ^status complete)
  10356. <=WM: (13630: I3 ^predict-yes N973)
  10357. --- Firing Productions (IE) For State At Depth 1 ---
  10358. --- Inner Elaboration Phase, active level 1 (S1) ---
  10359. Firing monitor*world
  10360. -->
  10361. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10362. --- Change Working Memory (IE) ---
  10363. --- END Application Phase ---
  10364. --- Output Phase ---
  10365. ENV: Agent did: predict-no for direction U in state State-B
  10366. In State-B moving U
  10367. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10368. predict error 0
  10369. dir: dir isL
  10370. --- END Output Phase ---
  10371. \-/--- Input Phase ---
  10372. =>WM: (13649: I2 ^dir L)
  10373. =>WM: (13648: I2 ^reward 1)
  10374. =>WM: (13647: I2 ^see 0)
  10375. =>WM: (13646: N974 ^status complete)
  10376. <=WM: (13634: I2 ^dir U)
  10377. <=WM: (13633: I2 ^reward 1)
  10378. <=WM: (13632: I2 ^see 1)
  10379. =>WM: (13650: I2 ^level-1 R1-root)
  10380. <=WM: (13635: I2 ^level-1 R1-root)
  10381. --- END Input Phase ---
  10382. --- Proposal Phase ---
  10383. --- Inner Elaboration Phase, active level 1 (S1) ---
  10384. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  10385. -->
  10386. (S1 ^operator O1948 = 0.03900899329983293)
  10387. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  10388. -->
  10389. (S1 ^operator O1947 = 0.6597557004837401)
  10390. Firing prefer*rvt*predict-no*H0*2*H1
  10391. -->
  10392. Firing prefer*rvt*predict-yes*H0*1*H1
  10393. -->
  10394. Firing elaborate*copy-see-to-output-link
  10395. -->
  10396. (I3 ^see 0 +)
  10397. Firing elaborate*reward*based*on*reward
  10398. -->
  10399. (R978 ^value 1 +)
  10400. (R1 ^reward R978 +)
  10401. Firing propose*predict-yes
  10402. -->
  10403. (O1949 ^name predict-yes +)
  10404. (S1 ^operator O1949 +)
  10405. Firing propose*predict-no
  10406. -->
  10407. (O1950 ^name predict-no +)
  10408. (S1 ^operator O1950 +)
  10409. Firing rl*prefer*rvt*predict-no*H0*2
  10410. -->
  10411. (S1 ^operator O1948 = 0.3212899096504038)
  10412. Firing rl*prefer*rvt*predict-yes*H0*1
  10413. -->
  10414. (S1 ^operator O1947 = 0.3402453915952103)
  10415. Firing prefer*rvt*predict-yes*H0
  10416. -->
  10417. Firing prefer*rvt*predict-no*H0
  10418. -->
  10419. Firing elaborate*copy-dir-to-output-link
  10420. -->
  10421. (I3 ^dir L +)
  10422. inner elaboration loop at bottom goal.
  10423. Retracting elaborate*copy-see-to-output-link
  10424. -->
  10425. (I3 ^see 1 +)
  10426. Retracting propose*predict-no
  10427. -->
  10428. (O1948 ^name predict-no +)
  10429. (S1 ^operator O1948 +)
  10430. Retracting propose*predict-yes
  10431. -->
  10432. (O1947 ^name predict-yes +)
  10433. (S1 ^operator O1947 +)
  10434. Retracting elaborate*reward*based*on*reward
  10435. -->
  10436. (R977 ^value 1 +)
  10437. (R1 ^reward R977 +)
  10438. Retracting elaborate*copy-dir-to-output-link
  10439. -->
  10440. (I3 ^dir U +)
  10441. Retracting rl*prefer*rvt*predict-no*H0*4
  10442. -->
  10443. (S1 ^operator O1948 = 0.9999999999999999)
  10444. Retracting rl*prefer*rvt*predict-yes*H0*3
  10445. -->
  10446. (S1 ^operator O1947 = 0.)
  10447. =>WM: (13658: S1 ^operator O1950 +)
  10448. =>WM: (13657: S1 ^operator O1949 +)
  10449. =>WM: (13656: I3 ^dir L)
  10450. =>WM: (13655: O1950 ^name predict-no)
  10451. =>WM: (13654: O1949 ^name predict-yes)
  10452. =>WM: (13653: R978 ^value 1)
  10453. =>WM: (13652: R1 ^reward R978)
  10454. =>WM: (13651: I3 ^see 0)
  10455. <=WM: (13642: S1 ^operator O1947 +)
  10456. <=WM: (13643: S1 ^operator O1948 +)
  10457. <=WM: (13644: S1 ^operator O1948)
  10458. <=WM: (13641: I3 ^dir U)
  10459. <=WM: (13637: R1 ^reward R977)
  10460. <=WM: (13636: I3 ^see 1)
  10461. <=WM: (13640: O1948 ^name predict-no)
  10462. <=WM: (13639: O1947 ^name predict-yes)
  10463. <=WM: (13638: R977 ^value 1)
  10464. --- Inner Elaboration Phase, active level 1 (S1) ---
  10465. Firing prefer*rvt*predict-yes*H0
  10466. -->
  10467. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  10468. -->
  10469. (S1 ^operator O1949 = 0.6597557004837401)
  10470. Firing rl*prefer*rvt*predict-yes*H0*1
  10471. -->
  10472. (S1 ^operator O1949 = 0.3402453915952103)
  10473. Firing prefer*rvt*predict-yes*H0*1*H1
  10474. -->
  10475. Firing prefer*rvt*predict-no*H0
  10476. -->
  10477. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  10478. -->
  10479. (S1 ^operator O1950 = 0.03900899329983293)
  10480. Firing rl*prefer*rvt*predict-no*H0*2
  10481. -->
  10482. (S1 ^operator O1950 = 0.3212899096504038)
  10483. Firing prefer*rvt*predict-no*H0*2*H1
  10484. -->
  10485. inner elaboration loop at bottom goal.
  10486. Retracting rl*prefer*rvt*predict-no*H0*2
  10487. -->
  10488. (S1 ^operator O1948 = 0.3212899096504038)
  10489. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  10490. -->
  10491. (S1 ^operator O1948 = 0.03900899329983293)
  10492. Retracting rl*prefer*rvt*predict-yes*H0*1
  10493. -->
  10494. (S1 ^operator O1947 = 0.3402453915952103)
  10495. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  10496. -->
  10497. (S1 ^operator O1947 = 0.6597557004837401)
  10498. --- END Proposal Phase ---
  10499. --- Decision Phase ---
  10500. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10501. =>WM: (13659: S1 ^operator O1949)
  10502. 975: O: O1949 (predict-yes)
  10503. --- END Decision Phase ---
  10504. --- Application Phase ---
  10505. --- Firing Productions (PE) For State At Depth 1 ---
  10506. --- Inner Elaboration Phase, active level 1 (S1) ---
  10507. Firing apply*operator
  10508. -->
  10509. (I3 ^predict-yes N975 + :O )
  10510. Firing apply*operator*complete
  10511. -->
  10512. (I3 ^predict-no N974 - :O )
  10513. inner elaboration loop at bottom goal.
  10514. --- Change Working Memory (PE) ---
  10515. =>WM: (13660: I3 ^predict-yes N975)
  10516. <=WM: (13646: N974 ^status complete)
  10517. <=WM: (13645: I3 ^predict-no N974)
  10518. --- Firing Productions (IE) For State At Depth 1 ---
  10519. --- Inner Elaboration Phase, active level 1 (S1) ---
  10520. Firing monitor*world
  10521. -->
  10522. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10523. --- Change Working Memory (IE) ---
  10524. --- END Application Phase ---
  10525. --- Output Phase ---
  10526. ENV: Agent did: predict-yes for direction L in state State-B
  10527. In State-B moving L
  10528. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10529. predict error 0
  10530. dir: dir isR
  10531. --- END Output Phase ---
  10532. |\---- Input Phase ---
  10533. =>WM: (13664: I2 ^dir R)
  10534. =>WM: (13663: I2 ^reward 1)
  10535. =>WM: (13662: I2 ^see 1)
  10536. =>WM: (13661: N975 ^status complete)
  10537. <=WM: (13649: I2 ^dir L)
  10538. <=WM: (13648: I2 ^reward 1)
  10539. <=WM: (13647: I2 ^see 0)
  10540. =>WM: (13665: I2 ^level-1 L1-root)
  10541. <=WM: (13650: I2 ^level-1 R1-root)
  10542. --- END Input Phase ---
  10543. --- Proposal Phase ---
  10544. --- Inner Elaboration Phase, active level 1 (S1) ---
  10545. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  10546. -->
  10547. (S1 ^operator O1949 = 0.8879029797681804)
  10548. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  10549. -->
  10550. (S1 ^operator O1950 = 0.02370016355578053)
  10551. Firing prefer*rvt*predict-no*H0*6*H1
  10552. -->
  10553. Firing prefer*rvt*predict-yes*H0*5*H1
  10554. -->
  10555. Firing elaborate*copy-see-to-output-link
  10556. -->
  10557. (I3 ^see 1 +)
  10558. Firing elaborate*reward*based*on*reward
  10559. -->
  10560. (R979 ^value 1 +)
  10561. (R1 ^reward R979 +)
  10562. Firing propose*predict-yes
  10563. -->
  10564. (O1951 ^name predict-yes +)
  10565. (S1 ^operator O1951 +)
  10566. Firing propose*predict-no
  10567. -->
  10568. (O1952 ^name predict-no +)
  10569. (S1 ^operator O1952 +)
  10570. Firing rl*prefer*rvt*predict-no*H0*6
  10571. -->
  10572. (S1 ^operator O1950 = 0.3993314366691663)
  10573. Firing rl*prefer*rvt*predict-yes*H0*5
  10574. -->
  10575. (S1 ^operator O1949 = 0.1121073478702461)
  10576. Firing prefer*rvt*predict-yes*H0
  10577. -->
  10578. Firing prefer*rvt*predict-no*H0
  10579. -->
  10580. Firing elaborate*copy-dir-to-output-link
  10581. -->
  10582. (I3 ^dir R +)
  10583. inner elaboration loop at bottom goal.
  10584. Retracting elaborate*copy-see-to-output-link
  10585. -->
  10586. (I3 ^see 0 +)
  10587. Retracting propose*predict-no
  10588. -->
  10589. (O1950 ^name predict-no +)
  10590. (S1 ^operator O1950 +)
  10591. Retracting propose*predict-yes
  10592. -->
  10593. (O1949 ^name predict-yes +)
  10594. (S1 ^operator O1949 +)
  10595. Retracting elaborate*reward*based*on*reward
  10596. -->
  10597. (R978 ^value 1 +)
  10598. (R1 ^reward R978 +)
  10599. Retracting elaborate*copy-dir-to-output-link
  10600. -->
  10601. (I3 ^dir L +)
  10602. Retracting rl*prefer*rvt*predict-no*H0*2
  10603. -->
  10604. (S1 ^operator O1950 = 0.3212899096504038)
  10605. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  10606. -->
  10607. (S1 ^operator O1950 = 0.03900899329983293)
  10608. Retracting rl*prefer*rvt*predict-yes*H0*1
  10609. -->
  10610. (S1 ^operator O1949 = 0.3402453915952103)
  10611. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  10612. -->
  10613. (S1 ^operator O1949 = 0.6597557004837401)
  10614. =>WM: (13673: S1 ^operator O1952 +)
  10615. =>WM: (13672: S1 ^operator O1951 +)
  10616. =>WM: (13671: I3 ^dir R)
  10617. =>WM: (13670: O1952 ^name predict-no)
  10618. =>WM: (13669: O1951 ^name predict-yes)
  10619. =>WM: (13668: R979 ^value 1)
  10620. =>WM: (13667: R1 ^reward R979)
  10621. =>WM: (13666: I3 ^see 1)
  10622. <=WM: (13657: S1 ^operator O1949 +)
  10623. <=WM: (13659: S1 ^operator O1949)
  10624. <=WM: (13658: S1 ^operator O1950 +)
  10625. <=WM: (13656: I3 ^dir L)
  10626. <=WM: (13652: R1 ^reward R978)
  10627. <=WM: (13651: I3 ^see 0)
  10628. <=WM: (13655: O1950 ^name predict-no)
  10629. <=WM: (13654: O1949 ^name predict-yes)
  10630. <=WM: (13653: R978 ^value 1)
  10631. --- Inner Elaboration Phase, active level 1 (S1) ---
  10632. Firing prefer*rvt*predict-yes*H0
  10633. -->
  10634. Firing rl*prefer*rvt*predict-yes*H0*5
  10635. -->
  10636. (S1 ^operator O1951 = 0.1121073478702461)
  10637. Firing prefer*rvt*predict-yes*H0*5*H1
  10638. -->
  10639. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  10640. -->
  10641. (S1 ^operator O1951 = 0.8879029797681804)
  10642. Firing prefer*rvt*predict-no*H0
  10643. -->
  10644. Firing rl*prefer*rvt*predict-no*H0*6
  10645. -->
  10646. (S1 ^operator O1952 = 0.3993314366691663)
  10647. Firing prefer*rvt*predict-no*H0*6*H1
  10648. -->
  10649. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  10650. -->
  10651. (S1 ^operator O1952 = 0.02370016355578053)
  10652. inner elaboration loop at bottom goal.
  10653. Retracting rl*prefer*rvt*predict-no*H0*6
  10654. -->
  10655. (S1 ^operator O1950 = 0.3993314366691663)
  10656. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  10657. -->
  10658. (S1 ^operator O1950 = 0.02370016355578053)
  10659. Retracting rl*prefer*rvt*predict-yes*H0*5
  10660. -->
  10661. (S1 ^operator O1949 = 0.1121073478702461)
  10662. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  10663. -->
  10664. (S1 ^operator O1949 = 0.8879029797681804)
  10665. --- END Proposal Phase ---
  10666. --- Decision Phase ---
  10667. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340245 -> 0.577178 -0.236933 0.340245(R,m,v=1,0.89375,0.0955582)
  10668. RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422822 0.236933 0.659756 -> 0.422822 0.236933 0.659756(R,m,v=1,1,0)
  10669. =>WM: (13674: S1 ^operator O1951)
  10670. 976: O: O1951 (predict-yes)
  10671. --- END Decision Phase ---
  10672. --- Application Phase ---
  10673. --- Firing Productions (PE) For State At Depth 1 ---
  10674. --- Inner Elaboration Phase, active level 1 (S1) ---
  10675. Firing apply*operator
  10676. -->
  10677. (I3 ^predict-yes N976 + :O )
  10678. Firing apply*operator*complete
  10679. -->
  10680. (I3 ^predict-yes N975 - :O )
  10681. inner elaboration loop at bottom goal.
  10682. --- Change Working Memory (PE) ---
  10683. =>WM: (13675: I3 ^predict-yes N976)
  10684. <=WM: (13661: N975 ^status complete)
  10685. <=WM: (13660: I3 ^predict-yes N975)
  10686. --- Firing Productions (IE) For State At Depth 1 ---
  10687. --- Inner Elaboration Phase, active level 1 (S1) ---
  10688. Firing monitor*world
  10689. -->
  10690. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10691. --- Change Working Memory (IE) ---
  10692. --- END Application Phase ---
  10693. --- Output Phase ---
  10694. ENV: Agent did: predict-yes for direction R in state State-A
  10695. In State-A moving R
  10696. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10697. predict error 0
  10698. dir: dir isR
  10699. --- END Output Phase ---
  10700. /|\--- Input Phase ---
  10701. =>WM: (13679: I2 ^dir R)
  10702. =>WM: (13678: I2 ^reward 1)
  10703. =>WM: (13677: I2 ^see 1)
  10704. =>WM: (13676: N976 ^status complete)
  10705. <=WM: (13664: I2 ^dir R)
  10706. <=WM: (13663: I2 ^reward 1)
  10707. <=WM: (13662: I2 ^see 1)
  10708. =>WM: (13680: I2 ^level-1 R1-root)
  10709. <=WM: (13665: I2 ^level-1 L1-root)
  10710. --- END Input Phase ---
  10711. --- Proposal Phase ---
  10712. --- Inner Elaboration Phase, active level 1 (S1) ---
  10713. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  10714. -->
  10715. (S1 ^operator O1952 = 0.6006758138031456)
  10716. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  10717. -->
  10718. (S1 ^operator O1951 = 0.1602187148382515)
  10719. Firing prefer*rvt*predict-no*H0*6*H1
  10720. -->
  10721. Firing prefer*rvt*predict-yes*H0*5*H1
  10722. -->
  10723. Firing elaborate*copy-see-to-output-link
  10724. -->
  10725. (I3 ^see 1 +)
  10726. Firing elaborate*reward*based*on*reward
  10727. -->
  10728. (R980 ^value 1 +)
  10729. (R1 ^reward R980 +)
  10730. Firing propose*predict-yes
  10731. -->
  10732. (O1953 ^name predict-yes +)
  10733. (S1 ^operator O1953 +)
  10734. Firing propose*predict-no
  10735. -->
  10736. (O1954 ^name predict-no +)
  10737. (S1 ^operator O1954 +)
  10738. Firing rl*prefer*rvt*predict-no*H0*6
  10739. -->
  10740. (S1 ^operator O1952 = 0.3993314366691663)
  10741. Firing rl*prefer*rvt*predict-yes*H0*5
  10742. -->
  10743. (S1 ^operator O1951 = 0.1121073478702461)
  10744. Firing prefer*rvt*predict-yes*H0
  10745. -->
  10746. Firing prefer*rvt*predict-no*H0
  10747. -->
  10748. Firing elaborate*copy-dir-to-output-link
  10749. -->
  10750. (I3 ^dir R +)
  10751. inner elaboration loop at bottom goal.
  10752. Retracting elaborate*copy-see-to-output-link
  10753. -->
  10754. (I3 ^see 1 +)
  10755. Retracting propose*predict-no
  10756. -->
  10757. (O1952 ^name predict-no +)
  10758. (S1 ^operator O1952 +)
  10759. Retracting propose*predict-yes
  10760. -->
  10761. (O1951 ^name predict-yes +)
  10762. (S1 ^operator O1951 +)
  10763. Retracting elaborate*reward*based*on*reward
  10764. -->
  10765. (R979 ^value 1 +)
  10766. (R1 ^reward R979 +)
  10767. Retracting elaborate*copy-dir-to-output-link
  10768. -->
  10769. (I3 ^dir R +)
  10770. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  10771. -->
  10772. (S1 ^operator O1952 = 0.02370016355578053)
  10773. Retracting rl*prefer*rvt*predict-no*H0*6
  10774. -->
  10775. (S1 ^operator O1952 = 0.3993314366691663)
  10776. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  10777. -->
  10778. (S1 ^operator O1951 = 0.8879029797681804)
  10779. Retracting rl*prefer*rvt*predict-yes*H0*5
  10780. -->
  10781. (S1 ^operator O1951 = 0.1121073478702461)
  10782. =>WM: (13686: S1 ^operator O1954 +)
  10783. =>WM: (13685: S1 ^operator O1953 +)
  10784. =>WM: (13684: O1954 ^name predict-no)
  10785. =>WM: (13683: O1953 ^name predict-yes)
  10786. =>WM: (13682: R980 ^value 1)
  10787. =>WM: (13681: R1 ^reward R980)
  10788. <=WM: (13672: S1 ^operator O1951 +)
  10789. <=WM: (13674: S1 ^operator O1951)
  10790. <=WM: (13673: S1 ^operator O1952 +)
  10791. <=WM: (13667: R1 ^reward R979)
  10792. <=WM: (13670: O1952 ^name predict-no)
  10793. <=WM: (13669: O1951 ^name predict-yes)
  10794. <=WM: (13668: R979 ^value 1)
  10795. --- Inner Elaboration Phase, active level 1 (S1) ---
  10796. Firing prefer*rvt*predict-yes*H0
  10797. -->
  10798. Firing rl*prefer*rvt*predict-yes*H0*5
  10799. -->
  10800. (S1 ^operator O1953 = 0.1121073478702461)
  10801. Firing prefer*rvt*predict-yes*H0*5*H1
  10802. -->
  10803. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  10804. -->
  10805. (S1 ^operator O1953 = 0.1602187148382515)
  10806. Firing prefer*rvt*predict-no*H0
  10807. -->
  10808. Firing rl*prefer*rvt*predict-no*H0*6
  10809. -->
  10810. (S1 ^operator O1954 = 0.3993314366691663)
  10811. Firing prefer*rvt*predict-no*H0*6*H1
  10812. -->
  10813. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  10814. -->
  10815. (S1 ^operator O1954 = 0.6006758138031456)
  10816. inner elaboration loop at bottom goal.
  10817. Retracting rl*prefer*rvt*predict-no*H0*6
  10818. -->
  10819. (S1 ^operator O1952 = 0.3993314366691663)
  10820. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  10821. -->
  10822. (S1 ^operator O1952 = 0.6006758138031456)
  10823. Retracting rl*prefer*rvt*predict-yes*H0*5
  10824. -->
  10825. (S1 ^operator O1951 = 0.1121073478702461)
  10826. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  10827. -->
  10828. (S1 ^operator O1951 = 0.1602187148382515)
  10829. --- END Proposal Phase ---
  10830. --- Decision Phase ---
  10831. RL update rl*prefer*rvt*predict-yes*H0*5 0.619031 -0.506923 0.112107 -> 0.619029 -0.506923 0.112106(R,m,v=1,0.898734,0.0915907)
  10832. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380982 0.506921 0.887903 -> 0.38098 0.506922 0.887901(R,m,v=1,1,0)
  10833. =>WM: (13687: S1 ^operator O1954)
  10834. 977: O: O1954 (predict-no)
  10835. --- END Decision Phase ---
  10836. --- Application Phase ---
  10837. --- Firing Productions (PE) For State At Depth 1 ---
  10838. --- Inner Elaboration Phase, active level 1 (S1) ---
  10839. Firing apply*operator
  10840. -->
  10841. (I3 ^predict-no N977 + :O )
  10842. Firing apply*operator*complete
  10843. -->
  10844. (I3 ^predict-yes N976 - :O )
  10845. inner elaboration loop at bottom goal.
  10846. --- Change Working Memory (PE) ---
  10847. =>WM: (13688: I3 ^predict-no N977)
  10848. <=WM: (13676: N976 ^status complete)
  10849. <=WM: (13675: I3 ^predict-yes N976)
  10850. --- Firing Productions (IE) For State At Depth 1 ---
  10851. --- Inner Elaboration Phase, active level 1 (S1) ---
  10852. Firing monitor*world
  10853. -->
  10854. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10855. --- Change Working Memory (IE) ---
  10856. --- END Application Phase ---
  10857. --- Output Phase ---
  10858. ENV: Agent did: predict-no for direction R in state State-B
  10859. In State-B moving R
  10860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10861. predict error 0
  10862. dir: dir isU
  10863. --- END Output Phase ---
  10864. -/|--- Input Phase ---
  10865. =>WM: (13692: I2 ^dir U)
  10866. =>WM: (13691: I2 ^reward 1)
  10867. =>WM: (13690: I2 ^see 0)
  10868. =>WM: (13689: N977 ^status complete)
  10869. <=WM: (13679: I2 ^dir R)
  10870. <=WM: (13678: I2 ^reward 1)
  10871. <=WM: (13677: I2 ^see 1)
  10872. =>WM: (13693: I2 ^level-1 R0-root)
  10873. <=WM: (13680: I2 ^level-1 R1-root)
  10874. --- END Input Phase ---
  10875. --- Proposal Phase ---
  10876. --- Inner Elaboration Phase, active level 1 (S1) ---
  10877. Firing elaborate*copy-see-to-output-link
  10878. -->
  10879. (I3 ^see 0 +)
  10880. Firing elaborate*reward*based*on*reward
  10881. -->
  10882. (R981 ^value 1 +)
  10883. (R1 ^reward R981 +)
  10884. Firing propose*predict-yes
  10885. -->
  10886. (O1955 ^name predict-yes +)
  10887. (S1 ^operator O1955 +)
  10888. Firing propose*predict-no
  10889. -->
  10890. (O1956 ^name predict-no +)
  10891. (S1 ^operator O1956 +)
  10892. Firing rl*prefer*rvt*predict-no*H0*4
  10893. -->
  10894. (S1 ^operator O1954 = 0.9999999999999999)
  10895. Firing rl*prefer*rvt*predict-yes*H0*3
  10896. -->
  10897. (S1 ^operator O1953 = 0.)
  10898. Firing prefer*rvt*predict-yes*H0
  10899. -->
  10900. Firing prefer*rvt*predict-no*H0
  10901. -->
  10902. Firing elaborate*copy-dir-to-output-link
  10903. -->
  10904. (I3 ^dir U +)
  10905. inner elaboration loop at bottom goal.
  10906. Retracting elaborate*copy-see-to-output-link
  10907. -->
  10908. (I3 ^see 1 +)
  10909. Retracting propose*predict-no
  10910. -->
  10911. (O1954 ^name predict-no +)
  10912. (S1 ^operator O1954 +)
  10913. Retracting propose*predict-yes
  10914. -->
  10915. (O1953 ^name predict-yes +)
  10916. (S1 ^operator O1953 +)
  10917. Retracting elaborate*reward*based*on*reward
  10918. -->
  10919. (R980 ^value 1 +)
  10920. (R1 ^reward R980 +)
  10921. Retracting elaborate*copy-dir-to-output-link
  10922. -->
  10923. (I3 ^dir R +)
  10924. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  10925. -->
  10926. (S1 ^operator O1954 = 0.6006758138031456)
  10927. Retracting rl*prefer*rvt*predict-no*H0*6
  10928. -->
  10929. (S1 ^operator O1954 = 0.3993314366691663)
  10930. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  10931. -->
  10932. (S1 ^operator O1953 = 0.1602187148382515)
  10933. Retracting rl*prefer*rvt*predict-yes*H0*5
  10934. -->
  10935. (S1 ^operator O1953 = 0.1121057987244822)
  10936. =>WM: (13701: S1 ^operator O1956 +)
  10937. =>WM: (13700: S1 ^operator O1955 +)
  10938. =>WM: (13699: I3 ^dir U)
  10939. =>WM: (13698: O1956 ^name predict-no)
  10940. =>WM: (13697: O1955 ^name predict-yes)
  10941. =>WM: (13696: R981 ^value 1)
  10942. =>WM: (13695: R1 ^reward R981)
  10943. =>WM: (13694: I3 ^see 0)
  10944. <=WM: (13685: S1 ^operator O1953 +)
  10945. <=WM: (13686: S1 ^operator O1954 +)
  10946. <=WM: (13687: S1 ^operator O1954)
  10947. <=WM: (13671: I3 ^dir R)
  10948. <=WM: (13681: R1 ^reward R980)
  10949. <=WM: (13666: I3 ^see 1)
  10950. <=WM: (13684: O1954 ^name predict-no)
  10951. <=WM: (13683: O1953 ^name predict-yes)
  10952. <=WM: (13682: R980 ^value 1)
  10953. --- Inner Elaboration Phase, active level 1 (S1) ---
  10954. Firing prefer*rvt*predict-yes*H0
  10955. -->
  10956. Firing rl*prefer*rvt*predict-yes*H0*3
  10957. -->
  10958. (S1 ^operator O1955 = 0.)
  10959. Firing prefer*rvt*predict-no*H0
  10960. -->
  10961. Firing rl*prefer*rvt*predict-no*H0*4
  10962. -->
  10963. (S1 ^operator O1956 = 0.9999999999999999)
  10964. inner elaboration loop at bottom goal.
  10965. Retracting rl*prefer*rvt*predict-no*H0*4
  10966. -->
  10967. (S1 ^operator O1954 = 0.9999999999999999)
  10968. Retracting rl*prefer*rvt*predict-yes*H0*3
  10969. -->
  10970. (S1 ^operator O1953 = 0.)
  10971. --- END Proposal Phase ---
  10972. --- Decision Phase ---
  10973. RL update rl*prefer*rvt*predict-no*H0*6 0.55804 -0.158708 0.399331 -> 0.558039 -0.158709 0.39933(R,m,v=1,0.927273,0.0678492)
  10974. RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441967 0.158709 0.600676 -> 0.441966 0.158709 0.600675(R,m,v=1,1,0)
  10975. =>WM: (13702: S1 ^operator O1956)
  10976. 978: O: O1956 (predict-no)
  10977. --- END Decision Phase ---
  10978. --- Application Phase ---
  10979. --- Firing Productions (PE) For State At Depth 1 ---
  10980. --- Inner Elaboration Phase, active level 1 (S1) ---
  10981. Firing apply*operator
  10982. -->
  10983. (I3 ^predict-no N978 + :O )
  10984. Firing apply*operator*complete
  10985. -->
  10986. (I3 ^predict-no N977 - :O )
  10987. inner elaboration loop at bottom goal.
  10988. --- Change Working Memory (PE) ---
  10989. =>WM: (13703: I3 ^predict-no N978)
  10990. <=WM: (13689: N977 ^status complete)
  10991. <=WM: (13688: I3 ^predict-no N977)
  10992. --- Firing Productions (IE) For State At Depth 1 ---
  10993. --- Inner Elaboration Phase, active level 1 (S1) ---
  10994. Firing monitor*world
  10995. -->
  10996. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10997. --- Change Working Memory (IE) ---
  10998. --- END Application Phase ---
  10999. --- Output Phase ---
  11000. ENV: Agent did: predict-no for direction U in state State-B
  11001. In State-B moving U
  11002. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11003. predict error 0
  11004. dir: dir isU
  11005. --- END Output Phase ---
  11006. \-/|--- Input Phase ---
  11007. =>WM: (13707: I2 ^dir U)
  11008. =>WM: (13706: I2 ^reward 1)
  11009. =>WM: (13705: I2 ^see 0)
  11010. =>WM: (13704: N978 ^status complete)
  11011. <=WM: (13692: I2 ^dir U)
  11012. <=WM: (13691: I2 ^reward 1)
  11013. <=WM: (13690: I2 ^see 0)
  11014. =>WM: (13708: I2 ^level-1 R0-root)
  11015. <=WM: (13693: I2 ^level-1 R0-root)
  11016. --- END Input Phase ---
  11017. --- Proposal Phase ---
  11018. --- Inner Elaboration Phase, active level 1 (S1) ---
  11019. Firing elaborate*copy-see-to-output-link
  11020. -->
  11021. (I3 ^see 0 +)
  11022. Firing elaborate*reward*based*on*reward
  11023. -->
  11024. (R982 ^value 1 +)
  11025. (R1 ^reward R982 +)
  11026. Firing propose*predict-yes
  11027. -->
  11028. (O1957 ^name predict-yes +)
  11029. (S1 ^operator O1957 +)
  11030. Firing propose*predict-no
  11031. -->
  11032. (O1958 ^name predict-no +)
  11033. (S1 ^operator O1958 +)
  11034. Firing rl*prefer*rvt*predict-no*H0*4
  11035. -->
  11036. (S1 ^operator O1956 = 0.9999999999999999)
  11037. Firing rl*prefer*rvt*predict-yes*H0*3
  11038. -->
  11039. (S1 ^operator O1955 = 0.)
  11040. Firing prefer*rvt*predict-yes*H0
  11041. -->
  11042. Firing prefer*rvt*predict-no*H0
  11043. -->
  11044. Firing elaborate*copy-dir-to-output-link
  11045. -->
  11046. (I3 ^dir U +)
  11047. inner elaboration loop at bottom goal.
  11048. Retracting elaborate*copy-see-to-output-link
  11049. -->
  11050. (I3 ^see 0 +)
  11051. Retracting propose*predict-no
  11052. -->
  11053. (O1956 ^name predict-no +)
  11054. (S1 ^operator O1956 +)
  11055. Retracting propose*predict-yes
  11056. -->
  11057. (O1955 ^name predict-yes +)
  11058. (S1 ^operator O1955 +)
  11059. Retracting elaborate*reward*based*on*reward
  11060. -->
  11061. (R981 ^value 1 +)
  11062. (R1 ^reward R981 +)
  11063. Retracting elaborate*copy-dir-to-output-link
  11064. -->
  11065. (I3 ^dir U +)
  11066. Retracting rl*prefer*rvt*predict-no*H0*4
  11067. -->
  11068. (S1 ^operator O1956 = 0.9999999999999999)
  11069. Retracting rl*prefer*rvt*predict-yes*H0*3
  11070. -->
  11071. (S1 ^operator O1955 = 0.)
  11072. =>WM: (13714: S1 ^operator O1958 +)
  11073. =>WM: (13713: S1 ^operator O1957 +)
  11074. =>WM: (13712: O1958 ^name predict-no)
  11075. =>WM: (13711: O1957 ^name predict-yes)
  11076. =>WM: (13710: R982 ^value 1)
  11077. =>WM: (13709: R1 ^reward R982)
  11078. <=WM: (13700: S1 ^operator O1955 +)
  11079. <=WM: (13701: S1 ^operator O1956 +)
  11080. <=WM: (13702: S1 ^operator O1956)
  11081. <=WM: (13695: R1 ^reward R981)
  11082. <=WM: (13698: O1956 ^name predict-no)
  11083. <=WM: (13697: O1955 ^name predict-yes)
  11084. <=WM: (13696: R981 ^value 1)
  11085. --- Inner Elaboration Phase, active level 1 (S1) ---
  11086. Firing prefer*rvt*predict-yes*H0
  11087. -->
  11088. Firing rl*prefer*rvt*predict-yes*H0*3
  11089. -->
  11090. (S1 ^operator O1957 = 0.)
  11091. Firing prefer*rvt*predict-no*H0
  11092. -->
  11093. Firing rl*prefer*rvt*predict-no*H0*4
  11094. -->
  11095. (S1 ^operator O1958 = 0.9999999999999999)
  11096. inner elaboration loop at bottom goal.
  11097. Retracting rl*prefer*rvt*predict-no*H0*4
  11098. -->
  11099. (S1 ^operator O1956 = 0.9999999999999999)
  11100. Retracting rl*prefer*rvt*predict-yes*H0*3
  11101. -->
  11102. (S1 ^operator O1955 = 0.)
  11103. --- END Proposal Phase ---
  11104. --- Decision Phase ---
  11105. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11106. =>WM: (13715: S1 ^operator O1958)
  11107. 979: O: O1958 (predict-no)
  11108. --- END Decision Phase ---
  11109. --- Application Phase ---
  11110. --- Firing Productions (PE) For State At Depth 1 ---
  11111. --- Inner Elaboration Phase, active level 1 (S1) ---
  11112. Firing apply*operator
  11113. -->
  11114. (I3 ^predict-no N979 + :O )
  11115. Firing apply*operator*complete
  11116. -->
  11117. (I3 ^predict-no N978 - :O )
  11118. inner elaboration loop at bottom goal.
  11119. --- Change Working Memory (PE) ---
  11120. =>WM: (13716: I3 ^predict-no N979)
  11121. <=WM: (13704: N978 ^status complete)
  11122. <=WM: (13703: I3 ^predict-no N978)
  11123. --- Firing Productions (IE) For State At Depth 1 ---
  11124. --- Inner Elaboration Phase, active level 1 (S1) ---
  11125. Firing monitor*world
  11126. -->
  11127. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11128. --- Change Working Memory (IE) ---
  11129. --- END Application Phase ---
  11130. --- Output Phase ---
  11131. ENV: Agent did: predict-no for direction U in state State-B
  11132. In State-B moving U
  11133. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11134. predict error 0
  11135. dir: dir isL
  11136. --- END Output Phase ---
  11137. \-/--- Input Phase ---
  11138. =>WM: (13720: I2 ^dir L)
  11139. =>WM: (13719: I2 ^reward 1)
  11140. =>WM: (13718: I2 ^see 0)
  11141. =>WM: (13717: N979 ^status complete)
  11142. <=WM: (13707: I2 ^dir U)
  11143. <=WM: (13706: I2 ^reward 1)
  11144. <=WM: (13705: I2 ^see 0)
  11145. =>WM: (13721: I2 ^level-1 R0-root)
  11146. <=WM: (13708: I2 ^level-1 R0-root)
  11147. --- END Input Phase ---
  11148. --- Proposal Phase ---
  11149. --- Inner Elaboration Phase, active level 1 (S1) ---
  11150. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  11151. -->
  11152. (S1 ^operator O1957 = 0.6597532174346419)
  11153. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  11154. -->
  11155. (S1 ^operator O1958 = 0.133561435542329)
  11156. Firing prefer*rvt*predict-no*H0*2*H1
  11157. -->
  11158. Firing prefer*rvt*predict-yes*H0*1*H1
  11159. -->
  11160. Firing elaborate*copy-see-to-output-link
  11161. -->
  11162. (I3 ^see 0 +)
  11163. Firing elaborate*reward*based*on*reward
  11164. -->
  11165. (R983 ^value 1 +)
  11166. (R1 ^reward R983 +)
  11167. Firing propose*predict-yes
  11168. -->
  11169. (O1959 ^name predict-yes +)
  11170. (S1 ^operator O1959 +)
  11171. Firing propose*predict-no
  11172. -->
  11173. (O1960 ^name predict-no +)
  11174. (S1 ^operator O1960 +)
  11175. Firing rl*prefer*rvt*predict-no*H0*2
  11176. -->
  11177. (S1 ^operator O1958 = 0.3212899096504038)
  11178. Firing rl*prefer*rvt*predict-yes*H0*1
  11179. -->
  11180. (S1 ^operator O1957 = 0.3402452277833678)
  11181. Firing prefer*rvt*predict-yes*H0
  11182. -->
  11183. Firing prefer*rvt*predict-no*H0
  11184. -->
  11185. Firing elaborate*copy-dir-to-output-link
  11186. -->
  11187. (I3 ^dir L +)
  11188. inner elaboration loop at bottom goal.
  11189. Retracting elaborate*copy-see-to-output-link
  11190. -->
  11191. (I3 ^see 0 +)
  11192. Retracting propose*predict-no
  11193. -->
  11194. (O1958 ^name predict-no +)
  11195. (S1 ^operator O1958 +)
  11196. Retracting propose*predict-yes
  11197. -->
  11198. (O1957 ^name predict-yes +)
  11199. (S1 ^operator O1957 +)
  11200. Retracting elaborate*reward*based*on*reward
  11201. -->
  11202. (R982 ^value 1 +)
  11203. (R1 ^reward R982 +)
  11204. Retracting elaborate*copy-dir-to-output-link
  11205. -->
  11206. (I3 ^dir U +)
  11207. Retracting rl*prefer*rvt*predict-no*H0*4
  11208. -->
  11209. (S1 ^operator O1958 = 0.9999999999999999)
  11210. Retracting rl*prefer*rvt*predict-yes*H0*3
  11211. -->
  11212. (S1 ^operator O1957 = 0.)
  11213. =>WM: (13728: S1 ^operator O1960 +)
  11214. =>WM: (13727: S1 ^operator O1959 +)
  11215. =>WM: (13726: I3 ^dir L)
  11216. =>WM: (13725: O1960 ^name predict-no)
  11217. =>WM: (13724: O1959 ^name predict-yes)
  11218. =>WM: (13723: R983 ^value 1)
  11219. =>WM: (13722: R1 ^reward R983)
  11220. <=WM: (13713: S1 ^operator O1957 +)
  11221. <=WM: (13714: S1 ^operator O1958 +)
  11222. <=WM: (13715: S1 ^operator O1958)
  11223. <=WM: (13699: I3 ^dir U)
  11224. <=WM: (13709: R1 ^reward R982)
  11225. <=WM: (13712: O1958 ^name predict-no)
  11226. <=WM: (13711: O1957 ^name predict-yes)
  11227. <=WM: (13710: R982 ^value 1)
  11228. --- Inner Elaboration Phase, active level 1 (S1) ---
  11229. Firing prefer*rvt*predict-yes*H0
  11230. -->
  11231. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  11232. -->
  11233. (S1 ^operator O1959 = 0.6597532174346419)
  11234. Firing rl*prefer*rvt*predict-yes*H0*1
  11235. -->
  11236. (S1 ^operator O1959 = 0.3402452277833678)
  11237. Firing prefer*rvt*predict-yes*H0*1*H1
  11238. -->
  11239. Firing prefer*rvt*predict-no*H0
  11240. -->
  11241. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  11242. -->
  11243. (S1 ^operator O1960 = 0.133561435542329)
  11244. Firing rl*prefer*rvt*predict-no*H0*2
  11245. -->
  11246. (S1 ^operator O1960 = 0.3212899096504038)
  11247. Firing prefer*rvt*predict-no*H0*2*H1
  11248. -->
  11249. inner elaboration loop at bottom goal.
  11250. Retracting rl*prefer*rvt*predict-no*H0*2
  11251. -->
  11252. (S1 ^operator O1958 = 0.3212899096504038)
  11253. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  11254. -->
  11255. (S1 ^operator O1958 = 0.133561435542329)
  11256. Retracting rl*prefer*rvt*predict-yes*H0*1
  11257. -->
  11258. (S1 ^operator O1957 = 0.3402452277833678)
  11259. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  11260. -->
  11261. (S1 ^operator O1957 = 0.6597532174346419)
  11262. --- END Proposal Phase ---
  11263. --- Decision Phase ---
  11264. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11265. =>WM: (13729: S1 ^operator O1959)
  11266. 980: O: O1959 (predict-yes)
  11267. --- END Decision Phase ---
  11268. --- Application Phase ---
  11269. --- Firing Productions (PE) For State At Depth 1 ---
  11270. --- Inner Elaboration Phase, active level 1 (S1) ---
  11271. Firing apply*operator
  11272. -->
  11273. (I3 ^predict-yes N980 + :O )
  11274. Firing apply*operator*complete
  11275. -->
  11276. (I3 ^predict-no N979 - :O )
  11277. inner elaboration loop at bottom goal.
  11278. --- Change Working Memory (PE) ---
  11279. =>WM: (13730: I3 ^predict-yes N980)
  11280. <=WM: (13717: N979 ^status complete)
  11281. <=WM: (13716: I3 ^predict-no N979)
  11282. --- Firing Productions (IE) For State At Depth 1 ---
  11283. --- Inner Elaboration Phase, active level 1 (S1) ---
  11284. Firing monitor*world
  11285. -->
  11286. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11287. --- Change Working Memory (IE) ---
  11288. --- END Application Phase ---
  11289. --- Output Phase ---
  11290. ENV: Agent did: predict-yes for direction L in state State-B
  11291. In State-B moving L
  11292. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11293. predict error 0
  11294. dir: dir isR
  11295. --- END Output Phase ---
  11296. |\--- Input Phase ---
  11297. =>WM: (13734: I2 ^dir R)
  11298. =>WM: (13733: I2 ^reward 1)
  11299. =>WM: (13732: I2 ^see 1)
  11300. =>WM: (13731: N980 ^status complete)
  11301. <=WM: (13720: I2 ^dir L)
  11302. <=WM: (13719: I2 ^reward 1)
  11303. <=WM: (13718: I2 ^see 0)
  11304. =>WM: (13735: I2 ^level-1 L1-root)
  11305. <=WM: (13721: I2 ^level-1 R0-root)
  11306. --- END Input Phase ---
  11307. --- Proposal Phase ---
  11308. --- Inner Elaboration Phase, active level 1 (S1) ---
  11309. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  11310. -->
  11311. (S1 ^operator O1959 = 0.8879014306224164)
  11312. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  11313. -->
  11314. (S1 ^operator O1960 = 0.02370016355578053)
  11315. Firing prefer*rvt*predict-no*H0*6*H1
  11316. -->
  11317. Firing prefer*rvt*predict-yes*H0*5*H1
  11318. -->
  11319. Firing elaborate*copy-see-to-output-link
  11320. -->
  11321. (I3 ^see 1 +)
  11322. Firing elaborate*reward*based*on*reward
  11323. -->
  11324. (R984 ^value 1 +)
  11325. (R1 ^reward R984 +)
  11326. Firing propose*predict-yes
  11327. -->
  11328. (O1961 ^name predict-yes +)
  11329. (S1 ^operator O1961 +)
  11330. Firing propose*predict-no
  11331. -->
  11332. (O1962 ^name predict-no +)
  11333. (S1 ^operator O1962 +)
  11334. Firing rl*prefer*rvt*predict-no*H0*6
  11335. -->
  11336. (S1 ^operator O1960 = 0.3993303490983195)
  11337. Firing rl*prefer*rvt*predict-yes*H0*5
  11338. -->
  11339. (S1 ^operator O1959 = 0.1121057987244822)
  11340. Firing prefer*rvt*predict-yes*H0
  11341. -->
  11342. Firing prefer*rvt*predict-no*H0
  11343. -->
  11344. Firing elaborate*copy-dir-to-output-link
  11345. -->
  11346. (I3 ^dir R +)
  11347. inner elaboration loop at bottom goal.
  11348. Retracting elaborate*copy-see-to-output-link
  11349. -->
  11350. (I3 ^see 0 +)
  11351. Retracting propose*predict-no
  11352. -->
  11353. (O1960 ^name predict-no +)
  11354. (S1 ^operator O1960 +)
  11355. Retracting propose*predict-yes
  11356. -->
  11357. (O1959 ^name predict-yes +)
  11358. (S1 ^operator O1959 +)
  11359. Retracting elaborate*reward*based*on*reward
  11360. -->
  11361. (R983 ^value 1 +)
  11362. (R1 ^reward R983 +)
  11363. Retracting elaborate*copy-dir-to-output-link
  11364. -->
  11365. (I3 ^dir L +)
  11366. Retracting rl*prefer*rvt*predict-no*H0*2
  11367. -->
  11368. (S1 ^operator O1960 = 0.3212899096504038)
  11369. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  11370. -->
  11371. (S1 ^operator O1960 = 0.133561435542329)
  11372. Retracting rl*prefer*rvt*predict-yes*H0*1
  11373. -->
  11374. (S1 ^operator O1959 = 0.3402452277833678)
  11375. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  11376. -->
  11377. (S1 ^operator O1959 = 0.6597532174346419)
  11378. =>WM: (13743: S1 ^operator O1962 +)
  11379. =>WM: (13742: S1 ^operator O1961 +)
  11380. =>WM: (13741: I3 ^dir R)
  11381. =>WM: (13740: O1962 ^name predict-no)
  11382. =>WM: (13739: O1961 ^name predict-yes)
  11383. =>WM: (13738: R984 ^value 1)
  11384. =>WM: (13737: R1 ^reward R984)
  11385. =>WM: (13736: I3 ^see 1)
  11386. <=WM: (13727: S1 ^operator O1959 +)
  11387. <=WM: (13729: S1 ^operator O1959)
  11388. <=WM: (13728: S1 ^operator O1960 +)
  11389. <=WM: (13726: I3 ^dir L)
  11390. <=WM: (13722: R1 ^reward R983)
  11391. <=WM: (13694: I3 ^see 0)
  11392. <=WM: (13725: O1960 ^name predict-no)
  11393. <=WM: (13724: O1959 ^name predict-yes)
  11394. <=WM: (13723: R983 ^value 1)
  11395. --- Inner Elaboration Phase, active level 1 (S1) ---
  11396. Firing prefer*rvt*predict-yes*H0
  11397. -->
  11398. Firing rl*prefer*rvt*predict-yes*H0*5
  11399. -->
  11400. (S1 ^operator O1961 = 0.1121057987244822)
  11401. Firing prefer*rvt*predict-yes*H0*5*H1
  11402. -->
  11403. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  11404. -->
  11405. (S1 ^operator O1961 = 0.8879014306224164)
  11406. Firing prefer*rvt*predict-no*H0
  11407. -->
  11408. Firing rl*prefer*rvt*predict-no*H0*6
  11409. -->
  11410. (S1 ^operator O1962 = 0.3993303490983195)
  11411. Firing prefer*rvt*predict-no*H0*6*H1
  11412. -->
  11413. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  11414. -->
  11415. (S1 ^operator O1962 = 0.02370016355578053)
  11416. inner elaboration loop at bottom goal.
  11417. Retracting rl*prefer*rvt*predict-no*H0*6
  11418. -->
  11419. (S1 ^operator O1960 = 0.3993303490983195)
  11420. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  11421. -->
  11422. (S1 ^operator O1960 = 0.02370016355578053)
  11423. Retracting rl*prefer*rvt*predict-yes*H0*5
  11424. -->
  11425. (S1 ^operator O1959 = 0.1121057987244822)
  11426. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  11427. -->
  11428. (S1 ^operator O1959 = 0.8879014306224164)
  11429. --- END Proposal Phase ---
  11430. --- Decision Phase ---
  11431. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340245 -> 0.577178 -0.236933 0.340245(R,m,v=1,0.89441,0.0950311)
  11432. RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422821 0.236932 0.659753(R,m,v=1,1,0)
  11433. =>WM: (13744: S1 ^operator O1961)
  11434. 981: O: O1961 (predict-yes)
  11435. --- END Decision Phase ---
  11436. --- Application Phase ---
  11437. --- Firing Productions (PE) For State At Depth 1 ---
  11438. --- Inner Elaboration Phase, active level 1 (S1) ---
  11439. Firing apply*operator
  11440. -->
  11441. (I3 ^predict-yes N981 + :O )
  11442. Firing apply*operator*complete
  11443. -->
  11444. (I3 ^predict-yes N980 - :O )
  11445. inner elaboration loop at bottom goal.
  11446. --- Change Working Memory (PE) ---
  11447. =>WM: (13745: I3 ^predict-yes N981)
  11448. <=WM: (13731: N980 ^status complete)
  11449. <=WM: (13730: I3 ^predict-yes N980)
  11450. --- Firing Productions (IE) For State At Depth 1 ---
  11451. --- Inner Elaboration Phase, active level 1 (S1) ---
  11452. Firing monitor*world
  11453. -->
  11454. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11455. --- Change Working Memory (IE) ---
  11456. --- END Application Phase ---
  11457. --- Output Phase ---
  11458. ENV: Agent did: predict-yes for direction R in state State-A
  11459. In State-A moving R
  11460. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11461. predict error 0
  11462. dir: dir isU
  11463. --- END Output Phase ---
  11464. ---- Input Phase ---
  11465. =>WM: (13749: I2 ^dir U)
  11466. =>WM: (13748: I2 ^reward 1)
  11467. =>WM: (13747: I2 ^see 1)
  11468. =>WM: (13746: N981 ^status complete)
  11469. <=WM: (13734: I2 ^dir R)
  11470. <=WM: (13733: I2 ^reward 1)
  11471. <=WM: (13732: I2 ^see 1)
  11472. =>WM: (13750: I2 ^level-1 R1-root)
  11473. <=WM: (13735: I2 ^level-1 L1-root)
  11474. --- END Input Phase ---
  11475. --- Proposal Phase ---
  11476. --- Inner Elaboration Phase, active level 1 (S1) ---
  11477. Firing elaborate*copy-see-to-output-link
  11478. -->
  11479. (I3 ^see 1 +)
  11480. Firing elaborate*reward*based*on*reward
  11481. -->
  11482. (R985 ^value 1 +)
  11483. (R1 ^reward R985 +)
  11484. Firing propose*predict-yes
  11485. -->
  11486. (O1963 ^name predict-yes +)
  11487. (S1 ^operator O1963 +)
  11488. Firing propose*predict-no
  11489. -->
  11490. (O1964 ^name predict-no +)
  11491. (S1 ^operator O1964 +)
  11492. Firing rl*prefer*rvt*predict-no*H0*4
  11493. -->
  11494. (S1 ^operator O1962 = 0.9999999999999999)
  11495. Firing rl*prefer*rvt*predict-yes*H0*3
  11496. -->
  11497. (S1 ^operator O1961 = 0.)
  11498. Firing prefer*rvt*predict-yes*H0
  11499. -->
  11500. Firing prefer*rvt*predict-no*H0
  11501. -->
  11502. Firing elaborate*copy-dir-to-output-link
  11503. -->
  11504. (I3 ^dir U +)
  11505. inner elaboration loop at bottom goal.
  11506. Retracting elaborate*copy-see-to-output-link
  11507. -->
  11508. (I3 ^see 1 +)
  11509. Retracting propose*predict-no
  11510. -->
  11511. (O1962 ^name predict-no +)
  11512. (S1 ^operator O1962 +)
  11513. Retracting propose*predict-yes
  11514. -->
  11515. (O1961 ^name predict-yes +)
  11516. (S1 ^operator O1961 +)
  11517. Retracting elaborate*reward*based*on*reward
  11518. -->
  11519. (R984 ^value 1 +)
  11520. (R1 ^reward R984 +)
  11521. Retracting elaborate*copy-dir-to-output-link
  11522. -->
  11523. (I3 ^dir R +)
  11524. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  11525. -->
  11526. (S1 ^operator O1962 = 0.02370016355578053)
  11527. Retracting rl*prefer*rvt*predict-no*H0*6
  11528. -->
  11529. (S1 ^operator O1962 = 0.3993303490983195)
  11530. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  11531. -->
  11532. (S1 ^operator O1961 = 0.8879014306224164)
  11533. Retracting rl*prefer*rvt*predict-yes*H0*5
  11534. -->
  11535. (S1 ^operator O1961 = 0.1121057987244822)
  11536. =>WM: (13757: S1 ^operator O1964 +)
  11537. =>WM: (13756: S1 ^operator O1963 +)
  11538. =>WM: (13755: I3 ^dir U)
  11539. =>WM: (13754: O1964 ^name predict-no)
  11540. =>WM: (13753: O1963 ^name predict-yes)
  11541. =>WM: (13752: R985 ^value 1)
  11542. =>WM: (13751: R1 ^reward R985)
  11543. <=WM: (13742: S1 ^operator O1961 +)
  11544. <=WM: (13744: S1 ^operator O1961)
  11545. <=WM: (13743: S1 ^operator O1962 +)
  11546. <=WM: (13741: I3 ^dir R)
  11547. <=WM: (13737: R1 ^reward R984)
  11548. <=WM: (13740: O1962 ^name predict-no)
  11549. <=WM: (13739: O1961 ^name predict-yes)
  11550. <=WM: (13738: R984 ^value 1)
  11551. --- Inner Elaboration Phase, active level 1 (S1) ---
  11552. Firing prefer*rvt*predict-yes*H0
  11553. -->
  11554. Firing rl*prefer*rvt*predict-yes*H0*3
  11555. -->
  11556. (S1 ^operator O1963 = 0.)
  11557. Firing prefer*rvt*predict-no*H0
  11558. -->
  11559. Firing rl*prefer*rvt*predict-no*H0*4
  11560. -->
  11561. (S1 ^operator O1964 = 0.9999999999999999)
  11562. inner elaboration loop at bottom goal.
  11563. Retracting rl*prefer*rvt*predict-no*H0*4
  11564. -->
  11565. (S1 ^operator O1962 = 0.9999999999999999)
  11566. Retracting rl*prefer*rvt*predict-yes*H0*3
  11567. -->
  11568. (S1 ^operator O1961 = 0.)
  11569. --- END Proposal Phase ---
  11570. --- Decision Phase ---
  11571. RL update rl*prefer*rvt*predict-yes*H0*5 0.619029 -0.506923 0.112106 -> 0.619028 -0.506923 0.112105(R,m,v=1,0.899371,0.0910756)
  11572. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.38098 0.506922 0.887901 -> 0.380978 0.506922 0.8879(R,m,v=1,1,0)
  11573. =>WM: (13758: S1 ^operator O1964)
  11574. 982: O: O1964 (predict-no)
  11575. --- END Decision Phase ---
  11576. --- Application Phase ---
  11577. --- Firing Productions (PE) For State At Depth 1 ---
  11578. --- Inner Elaboration Phase, active level 1 (S1) ---
  11579. Firing apply*operator
  11580. -->
  11581. (I3 ^predict-no N982 + :O )
  11582. Firing apply*operator*complete
  11583. -->
  11584. (I3 ^predict-yes N981 - :O )
  11585. inner elaboration loop at bottom goal.
  11586. --- Change Working Memory (PE) ---
  11587. =>WM: (13759: I3 ^predict-no N982)
  11588. <=WM: (13746: N981 ^status complete)
  11589. <=WM: (13745: I3 ^predict-yes N981)
  11590. --- Firing Productions (IE) For State At Depth 1 ---
  11591. --- Inner Elaboration Phase, active level 1 (S1) ---
  11592. Firing monitor*world
  11593. -->
  11594. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11595. --- Change Working Memory (IE) ---
  11596. --- END Application Phase ---
  11597. --- Output Phase ---
  11598. ENV: Agent did: predict-no for direction U in state State-B
  11599. In State-B moving U
  11600. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11601. predict error 0
  11602. dir: dir isR
  11603. --- END Output Phase ---
  11604. /|--- Input Phase ---
  11605. =>WM: (13763: I2 ^dir R)
  11606. =>WM: (13762: I2 ^reward 1)
  11607. =>WM: (13761: I2 ^see 0)
  11608. =>WM: (13760: N982 ^status complete)
  11609. <=WM: (13749: I2 ^dir U)
  11610. <=WM: (13748: I2 ^reward 1)
  11611. <=WM: (13747: I2 ^see 1)
  11612. =>WM: (13764: I2 ^level-1 R1-root)
  11613. <=WM: (13750: I2 ^level-1 R1-root)
  11614. --- END Input Phase ---
  11615. --- Proposal Phase ---
  11616. --- Inner Elaboration Phase, active level 1 (S1) ---
  11617. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  11618. -->
  11619. (S1 ^operator O1964 = 0.6006747262322989)
  11620. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  11621. -->
  11622. (S1 ^operator O1963 = 0.1602187148382515)
  11623. Firing prefer*rvt*predict-no*H0*6*H1
  11624. -->
  11625. Firing prefer*rvt*predict-yes*H0*5*H1
  11626. -->
  11627. Firing elaborate*copy-see-to-output-link
  11628. -->
  11629. (I3 ^see 0 +)
  11630. Firing elaborate*reward*based*on*reward
  11631. -->
  11632. (R986 ^value 1 +)
  11633. (R1 ^reward R986 +)
  11634. Firing propose*predict-yes
  11635. -->
  11636. (O1965 ^name predict-yes +)
  11637. (S1 ^operator O1965 +)
  11638. Firing propose*predict-no
  11639. -->
  11640. (O1966 ^name predict-no +)
  11641. (S1 ^operator O1966 +)
  11642. Firing rl*prefer*rvt*predict-no*H0*6
  11643. -->
  11644. (S1 ^operator O1964 = 0.3993303490983195)
  11645. Firing rl*prefer*rvt*predict-yes*H0*5
  11646. -->
  11647. (S1 ^operator O1963 = 0.1121047143224474)
  11648. Firing prefer*rvt*predict-yes*H0
  11649. -->
  11650. Firing prefer*rvt*predict-no*H0
  11651. -->
  11652. Firing elaborate*copy-dir-to-output-link
  11653. -->
  11654. (I3 ^dir R +)
  11655. inner elaboration loop at bottom goal.
  11656. Retracting elaborate*copy-see-to-output-link
  11657. -->
  11658. (I3 ^see 1 +)
  11659. Retracting propose*predict-no
  11660. -->
  11661. (O1964 ^name predict-no +)
  11662. (S1 ^operator O1964 +)
  11663. Retracting propose*predict-yes
  11664. -->
  11665. (O1963 ^name predict-yes +)
  11666. (S1 ^operator O1963 +)
  11667. Retracting elaborate*reward*based*on*reward
  11668. -->
  11669. (R985 ^value 1 +)
  11670. (R1 ^reward R985 +)
  11671. Retracting elaborate*copy-dir-to-output-link
  11672. -->
  11673. (I3 ^dir U +)
  11674. Retracting rl*prefer*rvt*predict-no*H0*4
  11675. -->
  11676. (S1 ^operator O1964 = 0.9999999999999999)
  11677. Retracting rl*prefer*rvt*predict-yes*H0*3
  11678. -->
  11679. (S1 ^operator O1963 = 0.)
  11680. =>WM: (13772: S1 ^operator O1966 +)
  11681. =>WM: (13771: S1 ^operator O1965 +)
  11682. =>WM: (13770: I3 ^dir R)
  11683. =>WM: (13769: O1966 ^name predict-no)
  11684. =>WM: (13768: O1965 ^name predict-yes)
  11685. =>WM: (13767: R986 ^value 1)
  11686. =>WM: (13766: R1 ^reward R986)
  11687. =>WM: (13765: I3 ^see 0)
  11688. <=WM: (13756: S1 ^operator O1963 +)
  11689. <=WM: (13757: S1 ^operator O1964 +)
  11690. <=WM: (13758: S1 ^operator O1964)
  11691. <=WM: (13755: I3 ^dir U)
  11692. <=WM: (13751: R1 ^reward R985)
  11693. <=WM: (13736: I3 ^see 1)
  11694. <=WM: (13754: O1964 ^name predict-no)
  11695. <=WM: (13753: O1963 ^name predict-yes)
  11696. <=WM: (13752: R985 ^value 1)
  11697. --- Inner Elaboration Phase, active level 1 (S1) ---
  11698. Firing prefer*rvt*predict-yes*H0
  11699. -->
  11700. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  11701. -->
  11702. (S1 ^operator O1965 = 0.1602187148382515)
  11703. Firing rl*prefer*rvt*predict-yes*H0*5
  11704. -->
  11705. (S1 ^operator O1965 = 0.1121047143224474)
  11706. Firing prefer*rvt*predict-yes*H0*5*H1
  11707. -->
  11708. Firing prefer*rvt*predict-no*H0
  11709. -->
  11710. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  11711. -->
  11712. (S1 ^operator O1966 = 0.6006747262322989)
  11713. Firing rl*prefer*rvt*predict-no*H0*6
  11714. -->
  11715. (S1 ^operator O1966 = 0.3993303490983195)
  11716. Firing prefer*rvt*predict-no*H0*6*H1
  11717. -->
  11718. inner elaboration loop at bottom goal.
  11719. Retracting rl*prefer*rvt*predict-no*H0*6
  11720. -->
  11721. (S1 ^operator O1964 = 0.3993303490983195)
  11722. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  11723. -->
  11724. (S1 ^operator O1964 = 0.6006747262322989)
  11725. Retracting rl*prefer*rvt*predict-yes*H0*5
  11726. -->
  11727. (S1 ^operator O1963 = 0.1121047143224474)
  11728. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  11729. -->
  11730. (S1 ^operator O1963 = 0.1602187148382515)
  11731. --- END Proposal Phase ---
  11732. --- Decision Phase ---
  11733. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11734. =>WM: (13773: S1 ^operator O1966)
  11735. 983: O: O1966 (predict-no)
  11736. --- END Decision Phase ---
  11737. --- Application Phase ---
  11738. --- Firing Productions (PE) For State At Depth 1 ---
  11739. --- Inner Elaboration Phase, active level 1 (S1) ---
  11740. Firing apply*operator
  11741. -->
  11742. (I3 ^predict-no N983 + :O )
  11743. Firing apply*operator*complete
  11744. -->
  11745. (I3 ^predict-no N982 - :O )
  11746. inner elaboration loop at bottom goal.
  11747. --- Change Working Memory (PE) ---
  11748. =>WM: (13774: I3 ^predict-no N983)
  11749. <=WM: (13760: N982 ^status complete)
  11750. <=WM: (13759: I3 ^predict-no N982)
  11751. --- Firing Productions (IE) For State At Depth 1 ---
  11752. --- Inner Elaboration Phase, active level 1 (S1) ---
  11753. Firing monitor*world
  11754. -->
  11755. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11756. --- Change Working Memory (IE) ---
  11757. --- END Application Phase ---
  11758. --- Output Phase ---
  11759. ENV: Agent did: predict-no for direction R in state State-B
  11760. In State-B moving R
  11761. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11762. predict error 0
  11763. dir: dir isL
  11764. --- END Output Phase ---
  11765. \-/--- Input Phase ---
  11766. =>WM: (13778: I2 ^dir L)
  11767. =>WM: (13777: I2 ^reward 1)
  11768. =>WM: (13776: I2 ^see 0)
  11769. =>WM: (13775: N983 ^status complete)
  11770. <=WM: (13763: I2 ^dir R)
  11771. <=WM: (13762: I2 ^reward 1)
  11772. <=WM: (13761: I2 ^see 0)
  11773. =>WM: (13779: I2 ^level-1 R0-root)
  11774. <=WM: (13764: I2 ^level-1 R1-root)
  11775. --- END Input Phase ---
  11776. --- Proposal Phase ---
  11777. --- Inner Elaboration Phase, active level 1 (S1) ---
  11778. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  11779. -->
  11780. (S1 ^operator O1965 = 0.6597534506519405)
  11781. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  11782. -->
  11783. (S1 ^operator O1966 = 0.133561435542329)
  11784. Firing prefer*rvt*predict-no*H0*2*H1
  11785. -->
  11786. Firing prefer*rvt*predict-yes*H0*1*H1
  11787. -->
  11788. Firing elaborate*copy-see-to-output-link
  11789. -->
  11790. (I3 ^see 0 +)
  11791. Firing elaborate*reward*based*on*reward
  11792. -->
  11793. (R987 ^value 1 +)
  11794. (R1 ^reward R987 +)
  11795. Firing propose*predict-yes
  11796. -->
  11797. (O1967 ^name predict-yes +)
  11798. (S1 ^operator O1967 +)
  11799. Firing propose*predict-no
  11800. -->
  11801. (O1968 ^name predict-no +)
  11802. (S1 ^operator O1968 +)
  11803. Firing rl*prefer*rvt*predict-no*H0*2
  11804. -->
  11805. (S1 ^operator O1966 = 0.3212899096504038)
  11806. Firing rl*prefer*rvt*predict-yes*H0*1
  11807. -->
  11808. (S1 ^operator O1965 = 0.3402454610006663)
  11809. Firing prefer*rvt*predict-yes*H0
  11810. -->
  11811. Firing prefer*rvt*predict-no*H0
  11812. -->
  11813. Firing elaborate*copy-dir-to-output-link
  11814. -->
  11815. (I3 ^dir L +)
  11816. inner elaboration loop at bottom goal.
  11817. Retracting elaborate*copy-see-to-output-link
  11818. -->
  11819. (I3 ^see 0 +)
  11820. Retracting propose*predict-no
  11821. -->
  11822. (O1966 ^name predict-no +)
  11823. (S1 ^operator O1966 +)
  11824. Retracting propose*predict-yes
  11825. -->
  11826. (O1965 ^name predict-yes +)
  11827. (S1 ^operator O1965 +)
  11828. Retracting elaborate*reward*based*on*reward
  11829. -->
  11830. (R986 ^value 1 +)
  11831. (R1 ^reward R986 +)
  11832. Retracting elaborate*copy-dir-to-output-link
  11833. -->
  11834. (I3 ^dir R +)
  11835. Retracting rl*prefer*rvt*predict-no*H0*6
  11836. -->
  11837. (S1 ^operator O1966 = 0.3993303490983195)
  11838. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  11839. -->
  11840. (S1 ^operator O1966 = 0.6006747262322989)
  11841. Retracting rl*prefer*rvt*predict-yes*H0*5
  11842. -->
  11843. (S1 ^operator O1965 = 0.1121047143224474)
  11844. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  11845. -->
  11846. (S1 ^operator O1965 = 0.1602187148382515)
  11847. =>WM: (13786: S1 ^operator O1968 +)
  11848. =>WM: (13785: S1 ^operator O1967 +)
  11849. =>WM: (13784: I3 ^dir L)
  11850. =>WM: (13783: O1968 ^name predict-no)
  11851. =>WM: (13782: O1967 ^name predict-yes)
  11852. =>WM: (13781: R987 ^value 1)
  11853. =>WM: (13780: R1 ^reward R987)
  11854. <=WM: (13771: S1 ^operator O1965 +)
  11855. <=WM: (13772: S1 ^operator O1966 +)
  11856. <=WM: (13773: S1 ^operator O1966)
  11857. <=WM: (13770: I3 ^dir R)
  11858. <=WM: (13766: R1 ^reward R986)
  11859. <=WM: (13769: O1966 ^name predict-no)
  11860. <=WM: (13768: O1965 ^name predict-yes)
  11861. <=WM: (13767: R986 ^value 1)
  11862. --- Inner Elaboration Phase, active level 1 (S1) ---
  11863. Firing prefer*rvt*predict-yes*H0
  11864. -->
  11865. Firing rl*prefer*rvt*predict-yes*H0*1
  11866. -->
  11867. (S1 ^operator O1967 = 0.3402454610006663)
  11868. Firing prefer*rvt*predict-yes*H0*1*H1
  11869. -->
  11870. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  11871. -->
  11872. (S1 ^operator O1967 = 0.6597534506519405)
  11873. Firing prefer*rvt*predict-no*H0
  11874. -->
  11875. Firing rl*prefer*rvt*predict-no*H0*2
  11876. -->
  11877. (S1 ^operator O1968 = 0.3212899096504038)
  11878. Firing prefer*rvt*predict-no*H0*2*H1
  11879. -->
  11880. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  11881. -->
  11882. (S1 ^operator O1968 = 0.133561435542329)
  11883. inner elaboration loop at bottom goal.
  11884. Retracting rl*prefer*rvt*predict-no*H0*2
  11885. -->
  11886. (S1 ^operator O1966 = 0.3212899096504038)
  11887. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  11888. -->
  11889. (S1 ^operator O1966 = 0.133561435542329)
  11890. Retracting rl*prefer*rvt*predict-yes*H0*1
  11891. -->
  11892. (S1 ^operator O1965 = 0.3402454610006663)
  11893. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  11894. -->
  11895. (S1 ^operator O1965 = 0.6597534506519405)
  11896. --- END Proposal Phase ---
  11897. --- Decision Phase ---
  11898. RL update rl*prefer*rvt*predict-no*H0*6 0.558039 -0.158709 0.39933 -> 0.558038 -0.158709 0.39933(R,m,v=1,0.927711,0.0674699)
  11899. RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441966 0.158709 0.600675 -> 0.441965 0.158709 0.600674(R,m,v=1,1,0)
  11900. =>WM: (13787: S1 ^operator O1967)
  11901. 984: O: O1967 (predict-yes)
  11902. --- END Decision Phase ---
  11903. --- Application Phase ---
  11904. --- Firing Productions (PE) For State At Depth 1 ---
  11905. --- Inner Elaboration Phase, active level 1 (S1) ---
  11906. Firing apply*operator
  11907. -->
  11908. (I3 ^predict-yes N984 + :O )
  11909. Firing apply*operator*complete
  11910. -->
  11911. (I3 ^predict-no N983 - :O )
  11912. inner elaboration loop at bottom goal.
  11913. --- Change Working Memory (PE) ---
  11914. =>WM: (13788: I3 ^predict-yes N984)
  11915. <=WM: (13775: N983 ^status complete)
  11916. <=WM: (13774: I3 ^predict-no N983)
  11917. --- Firing Productions (IE) For State At Depth 1 ---
  11918. --- Inner Elaboration Phase, active level 1 (S1) ---
  11919. Firing monitor*world
  11920. -->
  11921. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11922. --- Change Working Memory (IE) ---
  11923. --- END Application Phase ---
  11924. --- Output Phase ---
  11925. ENV: Agent did: predict-yes for direction L in state State-B
  11926. In State-B moving L
  11927. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11928. predict error 0
  11929. dir: dir isU
  11930. --- END Output Phase ---
  11931. |\---- Input Phase ---
  11932. =>WM: (13792: I2 ^dir U)
  11933. =>WM: (13791: I2 ^reward 1)
  11934. =>WM: (13790: I2 ^see 1)
  11935. =>WM: (13789: N984 ^status complete)
  11936. <=WM: (13778: I2 ^dir L)
  11937. <=WM: (13777: I2 ^reward 1)
  11938. <=WM: (13776: I2 ^see 0)
  11939. =>WM: (13793: I2 ^level-1 L1-root)
  11940. <=WM: (13779: I2 ^level-1 R0-root)
  11941. --- END Input Phase ---
  11942. --- Proposal Phase ---
  11943. --- Inner Elaboration Phase, active level 1 (S1) ---
  11944. Firing elaborate*copy-see-to-output-link
  11945. -->
  11946. (I3 ^see 1 +)
  11947. Firing elaborate*reward*based*on*reward
  11948. -->
  11949. (R988 ^value 1 +)
  11950. (R1 ^reward R988 +)
  11951. Firing propose*predict-yes
  11952. -->
  11953. (O1969 ^name predict-yes +)
  11954. (S1 ^operator O1969 +)
  11955. Firing propose*predict-no
  11956. -->
  11957. (O1970 ^name predict-no +)
  11958. (S1 ^operator O1970 +)
  11959. Firing rl*prefer*rvt*predict-no*H0*4
  11960. -->
  11961. (S1 ^operator O1968 = 0.9999999999999999)
  11962. Firing rl*prefer*rvt*predict-yes*H0*3
  11963. -->
  11964. (S1 ^operator O1967 = 0.)
  11965. Firing prefer*rvt*predict-yes*H0
  11966. -->
  11967. Firing prefer*rvt*predict-no*H0
  11968. -->
  11969. Firing elaborate*copy-dir-to-output-link
  11970. -->
  11971. (I3 ^dir U +)
  11972. inner elaboration loop at bottom goal.
  11973. Retracting elaborate*copy-see-to-output-link
  11974. -->
  11975. (I3 ^see 0 +)
  11976. Retracting propose*predict-no
  11977. -->
  11978. (O1968 ^name predict-no +)
  11979. (S1 ^operator O1968 +)
  11980. Retracting propose*predict-yes
  11981. -->
  11982. (O1967 ^name predict-yes +)
  11983. (S1 ^operator O1967 +)
  11984. Retracting elaborate*reward*based*on*reward
  11985. -->
  11986. (R987 ^value 1 +)
  11987. (R1 ^reward R987 +)
  11988. Retracting elaborate*copy-dir-to-output-link
  11989. -->
  11990. (I3 ^dir L +)
  11991. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  11992. -->
  11993. (S1 ^operator O1968 = 0.133561435542329)
  11994. Retracting rl*prefer*rvt*predict-no*H0*2
  11995. -->
  11996. (S1 ^operator O1968 = 0.3212899096504038)
  11997. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  11998. -->
  11999. (S1 ^operator O1967 = 0.6597534506519405)
  12000. Retracting rl*prefer*rvt*predict-yes*H0*1
  12001. -->
  12002. (S1 ^operator O1967 = 0.3402454610006663)
  12003. =>WM: (13801: S1 ^operator O1970 +)
  12004. =>WM: (13800: S1 ^operator O1969 +)
  12005. =>WM: (13799: I3 ^dir U)
  12006. =>WM: (13798: O1970 ^name predict-no)
  12007. =>WM: (13797: O1969 ^name predict-yes)
  12008. =>WM: (13796: R988 ^value 1)
  12009. =>WM: (13795: R1 ^reward R988)
  12010. =>WM: (13794: I3 ^see 1)
  12011. <=WM: (13785: S1 ^operator O1967 +)
  12012. <=WM: (13787: S1 ^operator O1967)
  12013. <=WM: (13786: S1 ^operator O1968 +)
  12014. <=WM: (13784: I3 ^dir L)
  12015. <=WM: (13780: R1 ^reward R987)
  12016. <=WM: (13765: I3 ^see 0)
  12017. <=WM: (13783: O1968 ^name predict-no)
  12018. <=WM: (13782: O1967 ^name predict-yes)
  12019. <=WM: (13781: R987 ^value 1)
  12020. --- Inner Elaboration Phase, active level 1 (S1) ---
  12021. Firing prefer*rvt*predict-yes*H0
  12022. -->
  12023. Firing rl*prefer*rvt*predict-yes*H0*3
  12024. -->
  12025. (S1 ^operator O1969 = 0.)
  12026. Firing prefer*rvt*predict-no*H0
  12027. -->
  12028. Firing rl*prefer*rvt*predict-no*H0*4
  12029. -->
  12030. (S1 ^operator O1970 = 0.9999999999999999)
  12031. inner elaboration loop at bottom goal.
  12032. Retracting rl*prefer*rvt*predict-no*H0*4
  12033. -->
  12034. (S1 ^operator O1968 = 0.9999999999999999)
  12035. Retracting rl*prefer*rvt*predict-yes*H0*3
  12036. -->
  12037. (S1 ^operator O1967 = 0.)
  12038. --- END Proposal Phase ---
  12039. --- Decision Phase ---
  12040. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340245 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.895062,0.0945096)
  12041. RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422821 0.236932 0.659753 -> 0.422822 0.236932 0.659754(R,m,v=1,1,0)
  12042. =>WM: (13802: S1 ^operator O1970)
  12043. 985: O: O1970 (predict-no)
  12044. --- END Decision Phase ---
  12045. --- Application Phase ---
  12046. --- Firing Productions (PE) For State At Depth 1 ---
  12047. --- Inner Elaboration Phase, active level 1 (S1) ---
  12048. Firing apply*operator
  12049. -->
  12050. (I3 ^predict-no N985 + :O )
  12051. Firing apply*operator*complete
  12052. -->
  12053. (I3 ^predict-yes N984 - :O )
  12054. inner elaboration loop at bottom goal.
  12055. --- Change Working Memory (PE) ---
  12056. =>WM: (13803: I3 ^predict-no N985)
  12057. <=WM: (13789: N984 ^status complete)
  12058. <=WM: (13788: I3 ^predict-yes N984)
  12059. --- Firing Productions (IE) For State At Depth 1 ---
  12060. --- Inner Elaboration Phase, active level 1 (S1) ---
  12061. Firing monitor*world
  12062. -->
  12063. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12064. --- Change Working Memory (IE) ---
  12065. --- END Application Phase ---
  12066. --- Output Phase ---
  12067. ENV: Agent did: predict-no for direction U in state State-A
  12068. In State-A moving U
  12069. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12070. predict error 0
  12071. dir: dir isR
  12072. --- END Output Phase ---
  12073. /|--- Input Phase ---
  12074. =>WM: (13807: I2 ^dir R)
  12075. =>WM: (13806: I2 ^reward 1)
  12076. =>WM: (13805: I2 ^see 0)
  12077. =>WM: (13804: N985 ^status complete)
  12078. <=WM: (13792: I2 ^dir U)
  12079. <=WM: (13791: I2 ^reward 1)
  12080. <=WM: (13790: I2 ^see 1)
  12081. =>WM: (13808: I2 ^level-1 L1-root)
  12082. <=WM: (13793: I2 ^level-1 L1-root)
  12083. --- END Input Phase ---
  12084. --- Proposal Phase ---
  12085. --- Inner Elaboration Phase, active level 1 (S1) ---
  12086. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  12087. -->
  12088. (S1 ^operator O1969 = 0.8879003462203817)
  12089. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  12090. -->
  12091. (S1 ^operator O1970 = 0.02370016355578053)
  12092. Firing prefer*rvt*predict-no*H0*6*H1
  12093. -->
  12094. Firing prefer*rvt*predict-yes*H0*5*H1
  12095. -->
  12096. Firing elaborate*copy-see-to-output-link
  12097. -->
  12098. (I3 ^see 0 +)
  12099. Firing elaborate*reward*based*on*reward
  12100. -->
  12101. (R989 ^value 1 +)
  12102. (R1 ^reward R989 +)
  12103. Firing propose*predict-yes
  12104. -->
  12105. (O1971 ^name predict-yes +)
  12106. (S1 ^operator O1971 +)
  12107. Firing propose*predict-no
  12108. -->
  12109. (O1972 ^name predict-no +)
  12110. (S1 ^operator O1972 +)
  12111. Firing rl*prefer*rvt*predict-no*H0*6
  12112. -->
  12113. (S1 ^operator O1970 = 0.3993295877987267)
  12114. Firing rl*prefer*rvt*predict-yes*H0*5
  12115. -->
  12116. (S1 ^operator O1969 = 0.1121047143224474)
  12117. Firing prefer*rvt*predict-yes*H0
  12118. -->
  12119. Firing prefer*rvt*predict-no*H0
  12120. -->
  12121. Firing elaborate*copy-dir-to-output-link
  12122. -->
  12123. (I3 ^dir R +)
  12124. inner elaboration loop at bottom goal.
  12125. Retracting elaborate*copy-see-to-output-link
  12126. -->
  12127. (I3 ^see 1 +)
  12128. Retracting propose*predict-no
  12129. -->
  12130. (O1970 ^name predict-no +)
  12131. (S1 ^operator O1970 +)
  12132. Retracting propose*predict-yes
  12133. -->
  12134. (O1969 ^name predict-yes +)
  12135. (S1 ^operator O1969 +)
  12136. Retracting elaborate*reward*based*on*reward
  12137. -->
  12138. (R988 ^value 1 +)
  12139. (R1 ^reward R988 +)
  12140. Retracting elaborate*copy-dir-to-output-link
  12141. -->
  12142. (I3 ^dir U +)
  12143. Retracting rl*prefer*rvt*predict-no*H0*4
  12144. -->
  12145. (S1 ^operator O1970 = 0.9999999999999999)
  12146. Retracting rl*prefer*rvt*predict-yes*H0*3
  12147. -->
  12148. (S1 ^operator O1969 = 0.)
  12149. =>WM: (13816: S1 ^operator O1972 +)
  12150. =>WM: (13815: S1 ^operator O1971 +)
  12151. =>WM: (13814: I3 ^dir R)
  12152. =>WM: (13813: O1972 ^name predict-no)
  12153. =>WM: (13812: O1971 ^name predict-yes)
  12154. =>WM: (13811: R989 ^value 1)
  12155. =>WM: (13810: R1 ^reward R989)
  12156. =>WM: (13809: I3 ^see 0)
  12157. <=WM: (13800: S1 ^operator O1969 +)
  12158. <=WM: (13801: S1 ^operator O1970 +)
  12159. <=WM: (13802: S1 ^operator O1970)
  12160. <=WM: (13799: I3 ^dir U)
  12161. <=WM: (13795: R1 ^reward R988)
  12162. <=WM: (13794: I3 ^see 1)
  12163. <=WM: (13798: O1970 ^name predict-no)
  12164. <=WM: (13797: O1969 ^name predict-yes)
  12165. <=WM: (13796: R988 ^value 1)
  12166. --- Inner Elaboration Phase, active level 1 (S1) ---
  12167. Firing prefer*rvt*predict-yes*H0
  12168. -->
  12169. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  12170. -->
  12171. (S1 ^operator O1971 = 0.8879003462203817)
  12172. Firing rl*prefer*rvt*predict-yes*H0*5
  12173. -->
  12174. (S1 ^operator O1971 = 0.1121047143224474)
  12175. Firing prefer*rvt*predict-yes*H0*5*H1
  12176. -->
  12177. Firing prefer*rvt*predict-no*H0
  12178. -->
  12179. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  12180. -->
  12181. (S1 ^operator O1972 = 0.02370016355578053)
  12182. Firing rl*prefer*rvt*predict-no*H0*6
  12183. -->
  12184. (S1 ^operator O1972 = 0.3993295877987267)
  12185. Firing prefer*rvt*predict-no*H0*6*H1
  12186. -->
  12187. inner elaboration loop at bottom goal.
  12188. Retracting rl*prefer*rvt*predict-no*H0*6
  12189. -->
  12190. (S1 ^operator O1970 = 0.3993295877987267)
  12191. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  12192. -->
  12193. (S1 ^operator O1970 = 0.02370016355578053)
  12194. Retracting rl*prefer*rvt*predict-yes*H0*5
  12195. -->
  12196. (S1 ^operator O1969 = 0.1121047143224474)
  12197. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  12198. -->
  12199. (S1 ^operator O1969 = 0.8879003462203817)
  12200. --- END Proposal Phase ---
  12201. --- Decision Phase ---
  12202. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12203. =>WM: (13817: S1 ^operator O1971)
  12204. 986: O: O1971 (predict-yes)
  12205. --- END Decision Phase ---
  12206. --- Application Phase ---
  12207. --- Firing Productions (PE) For State At Depth 1 ---
  12208. --- Inner Elaboration Phase, active level 1 (S1) ---
  12209. Firing apply*operator
  12210. -->
  12211. (I3 ^predict-yes N986 + :O )
  12212. Firing apply*operator*complete
  12213. -->
  12214. (I3 ^predict-no N985 - :O )
  12215. inner elaboration loop at bottom goal.
  12216. --- Change Working Memory (PE) ---
  12217. =>WM: (13818: I3 ^predict-yes N986)
  12218. <=WM: (13804: N985 ^status complete)
  12219. <=WM: (13803: I3 ^predict-no N985)
  12220. --- Firing Productions (IE) For State At Depth 1 ---
  12221. --- Inner Elaboration Phase, active level 1 (S1) ---
  12222. Firing monitor*world
  12223. -->
  12224. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12225. --- Change Working Memory (IE) ---
  12226. --- END Application Phase ---
  12227. --- Output Phase ---
  12228. ENV: Agent did: predict-yes for direction R in state State-A
  12229. In State-A moving R
  12230. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12231. predict error 0
  12232. dir: dir isR
  12233. --- END Output Phase ---
  12234. \-/--- Input Phase ---
  12235. =>WM: (13822: I2 ^dir R)
  12236. =>WM: (13821: I2 ^reward 1)
  12237. =>WM: (13820: I2 ^see 1)
  12238. =>WM: (13819: N986 ^status complete)
  12239. <=WM: (13807: I2 ^dir R)
  12240. <=WM: (13806: I2 ^reward 1)
  12241. <=WM: (13805: I2 ^see 0)
  12242. =>WM: (13823: I2 ^level-1 R1-root)
  12243. <=WM: (13808: I2 ^level-1 L1-root)
  12244. --- END Input Phase ---
  12245. --- Proposal Phase ---
  12246. --- Inner Elaboration Phase, active level 1 (S1) ---
  12247. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  12248. -->
  12249. (S1 ^operator O1972 = 0.600673964932706)
  12250. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  12251. -->
  12252. (S1 ^operator O1971 = 0.1602187148382515)
  12253. Firing prefer*rvt*predict-no*H0*6*H1
  12254. -->
  12255. Firing prefer*rvt*predict-yes*H0*5*H1
  12256. -->
  12257. Firing elaborate*copy-see-to-output-link
  12258. -->
  12259. (I3 ^see 1 +)
  12260. Firing elaborate*reward*based*on*reward
  12261. -->
  12262. (R990 ^value 1 +)
  12263. (R1 ^reward R990 +)
  12264. Firing propose*predict-yes
  12265. -->
  12266. (O1973 ^name predict-yes +)
  12267. (S1 ^operator O1973 +)
  12268. Firing propose*predict-no
  12269. -->
  12270. (O1974 ^name predict-no +)
  12271. (S1 ^operator O1974 +)
  12272. Firing rl*prefer*rvt*predict-no*H0*6
  12273. -->
  12274. (S1 ^operator O1972 = 0.3993295877987267)
  12275. Firing rl*prefer*rvt*predict-yes*H0*5
  12276. -->
  12277. (S1 ^operator O1971 = 0.1121047143224474)
  12278. Firing prefer*rvt*predict-yes*H0
  12279. -->
  12280. Firing prefer*rvt*predict-no*H0
  12281. -->
  12282. Firing elaborate*copy-dir-to-output-link
  12283. -->
  12284. (I3 ^dir R +)
  12285. inner elaboration loop at bottom goal.
  12286. Retracting elaborate*copy-see-to-output-link
  12287. -->
  12288. (I3 ^see 0 +)
  12289. Retracting propose*predict-no
  12290. -->
  12291. (O1972 ^name predict-no +)
  12292. (S1 ^operator O1972 +)
  12293. Retracting propose*predict-yes
  12294. -->
  12295. (O1971 ^name predict-yes +)
  12296. (S1 ^operator O1971 +)
  12297. Retracting elaborate*reward*based*on*reward
  12298. -->
  12299. (R989 ^value 1 +)
  12300. (R1 ^reward R989 +)
  12301. Retracting elaborate*copy-dir-to-output-link
  12302. -->
  12303. (I3 ^dir R +)
  12304. Retracting rl*prefer*rvt*predict-no*H0*6
  12305. -->
  12306. (S1 ^operator O1972 = 0.3993295877987267)
  12307. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  12308. -->
  12309. (S1 ^operator O1972 = 0.02370016355578053)
  12310. Retracting rl*prefer*rvt*predict-yes*H0*5
  12311. -->
  12312. (S1 ^operator O1971 = 0.1121047143224474)
  12313. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  12314. -->
  12315. (S1 ^operator O1971 = 0.8879003462203817)
  12316. =>WM: (13830: S1 ^operator O1974 +)
  12317. =>WM: (13829: S1 ^operator O1973 +)
  12318. =>WM: (13828: O1974 ^name predict-no)
  12319. =>WM: (13827: O1973 ^name predict-yes)
  12320. =>WM: (13826: R990 ^value 1)
  12321. =>WM: (13825: R1 ^reward R990)
  12322. =>WM: (13824: I3 ^see 1)
  12323. <=WM: (13815: S1 ^operator O1971 +)
  12324. <=WM: (13817: S1 ^operator O1971)
  12325. <=WM: (13816: S1 ^operator O1972 +)
  12326. <=WM: (13810: R1 ^reward R989)
  12327. <=WM: (13809: I3 ^see 0)
  12328. <=WM: (13813: O1972 ^name predict-no)
  12329. <=WM: (13812: O1971 ^name predict-yes)
  12330. <=WM: (13811: R989 ^value 1)
  12331. --- Inner Elaboration Phase, active level 1 (S1) ---
  12332. Firing prefer*rvt*predict-yes*H0
  12333. -->
  12334. Firing rl*prefer*rvt*predict-yes*H0*5
  12335. -->
  12336. (S1 ^operator O1973 = 0.1121047143224474)
  12337. Firing prefer*rvt*predict-yes*H0*5*H1
  12338. -->
  12339. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  12340. -->
  12341. (S1 ^operator O1973 = 0.1602187148382515)
  12342. Firing prefer*rvt*predict-no*H0
  12343. -->
  12344. Firing rl*prefer*rvt*predict-no*H0*6
  12345. -->
  12346. (S1 ^operator O1974 = 0.3993295877987267)
  12347. Firing prefer*rvt*predict-no*H0*6*H1
  12348. -->
  12349. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  12350. -->
  12351. (S1 ^operator O1974 = 0.600673964932706)
  12352. inner elaboration loop at bottom goal.
  12353. Retracting rl*prefer*rvt*predict-no*H0*6
  12354. -->
  12355. (S1 ^operator O1972 = 0.3993295877987267)
  12356. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  12357. -->
  12358. (S1 ^operator O1972 = 0.600673964932706)
  12359. Retracting rl*prefer*rvt*predict-yes*H0*5
  12360. -->
  12361. (S1 ^operator O1971 = 0.1121047143224474)
  12362. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  12363. -->
  12364. (S1 ^operator O1971 = 0.1602187148382515)
  12365. --- END Proposal Phase ---
  12366. --- Decision Phase ---
  12367. RL update rl*prefer*rvt*predict-yes*H0*5 0.619028 -0.506923 0.112105 -> 0.619027 -0.506923 0.112104(R,m,v=1,0.9,0.090566)
  12368. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380978 0.506922 0.8879 -> 0.380978 0.506922 0.8879(R,m,v=1,1,0)
  12369. =>WM: (13831: S1 ^operator O1974)
  12370. 987: O: O1974 (predict-no)
  12371. --- END Decision Phase ---
  12372. --- Application Phase ---
  12373. --- Firing Productions (PE) For State At Depth 1 ---
  12374. --- Inner Elaboration Phase, active level 1 (S1) ---
  12375. Firing apply*operator
  12376. -->
  12377. (I3 ^predict-no N987 + :O )
  12378. Firing apply*operator*complete
  12379. -->
  12380. (I3 ^predict-yes N986 - :O )
  12381. inner elaboration loop at bottom goal.
  12382. --- Change Working Memory (PE) ---
  12383. =>WM: (13832: I3 ^predict-no N987)
  12384. <=WM: (13819: N986 ^status complete)
  12385. <=WM: (13818: I3 ^predict-yes N986)
  12386. --- Firing Productions (IE) For State At Depth 1 ---
  12387. --- Inner Elaboration Phase, active level 1 (S1) ---
  12388. Firing monitor*world
  12389. -->
  12390. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12391. --- Change Working Memory (IE) ---
  12392. --- END Application Phase ---
  12393. --- Output Phase ---
  12394. ENV: Agent did: predict-no for direction R in state State-B
  12395. In State-B moving R
  12396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12397. predict error 0
  12398. dir: dir isL
  12399. --- END Output Phase ---
  12400. |\---- Input Phase ---
  12401. =>WM: (13836: I2 ^dir L)
  12402. =>WM: (13835: I2 ^reward 1)
  12403. =>WM: (13834: I2 ^see 0)
  12404. =>WM: (13833: N987 ^status complete)
  12405. <=WM: (13822: I2 ^dir R)
  12406. <=WM: (13821: I2 ^reward 1)
  12407. <=WM: (13820: I2 ^see 1)
  12408. =>WM: (13837: I2 ^level-1 R0-root)
  12409. <=WM: (13823: I2 ^level-1 R1-root)
  12410. --- END Input Phase ---
  12411. --- Proposal Phase ---
  12412. --- Inner Elaboration Phase, active level 1 (S1) ---
  12413. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  12414. -->
  12415. (S1 ^operator O1973 = 0.6597536139040494)
  12416. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  12417. -->
  12418. (S1 ^operator O1974 = 0.133561435542329)
  12419. Firing prefer*rvt*predict-no*H0*2*H1
  12420. -->
  12421. Firing prefer*rvt*predict-yes*H0*1*H1
  12422. -->
  12423. Firing elaborate*copy-see-to-output-link
  12424. -->
  12425. (I3 ^see 0 +)
  12426. Firing elaborate*reward*based*on*reward
  12427. -->
  12428. (R991 ^value 1 +)
  12429. (R1 ^reward R991 +)
  12430. Firing propose*predict-yes
  12431. -->
  12432. (O1975 ^name predict-yes +)
  12433. (S1 ^operator O1975 +)
  12434. Firing propose*predict-no
  12435. -->
  12436. (O1976 ^name predict-no +)
  12437. (S1 ^operator O1976 +)
  12438. Firing rl*prefer*rvt*predict-no*H0*2
  12439. -->
  12440. (S1 ^operator O1974 = 0.3212899096504038)
  12441. Firing rl*prefer*rvt*predict-yes*H0*1
  12442. -->
  12443. (S1 ^operator O1973 = 0.3402456242527754)
  12444. Firing prefer*rvt*predict-yes*H0
  12445. -->
  12446. Firing prefer*rvt*predict-no*H0
  12447. -->
  12448. Firing elaborate*copy-dir-to-output-link
  12449. -->
  12450. (I3 ^dir L +)
  12451. inner elaboration loop at bottom goal.
  12452. Retracting elaborate*copy-see-to-output-link
  12453. -->
  12454. (I3 ^see 1 +)
  12455. Retracting propose*predict-no
  12456. -->
  12457. (O1974 ^name predict-no +)
  12458. (S1 ^operator O1974 +)
  12459. Retracting propose*predict-yes
  12460. -->
  12461. (O1973 ^name predict-yes +)
  12462. (S1 ^operator O1973 +)
  12463. Retracting elaborate*reward*based*on*reward
  12464. -->
  12465. (R990 ^value 1 +)
  12466. (R1 ^reward R990 +)
  12467. Retracting elaborate*copy-dir-to-output-link
  12468. -->
  12469. (I3 ^dir R +)
  12470. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  12471. -->
  12472. (S1 ^operator O1974 = 0.600673964932706)
  12473. Retracting rl*prefer*rvt*predict-no*H0*6
  12474. -->
  12475. (S1 ^operator O1974 = 0.3993295877987267)
  12476. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  12477. -->
  12478. (S1 ^operator O1973 = 0.1602187148382515)
  12479. Retracting rl*prefer*rvt*predict-yes*H0*5
  12480. -->
  12481. (S1 ^operator O1973 = 0.112103955241023)
  12482. =>WM: (13845: S1 ^operator O1976 +)
  12483. =>WM: (13844: S1 ^operator O1975 +)
  12484. =>WM: (13843: I3 ^dir L)
  12485. =>WM: (13842: O1976 ^name predict-no)
  12486. =>WM: (13841: O1975 ^name predict-yes)
  12487. =>WM: (13840: R991 ^value 1)
  12488. =>WM: (13839: R1 ^reward R991)
  12489. =>WM: (13838: I3 ^see 0)
  12490. <=WM: (13829: S1 ^operator O1973 +)
  12491. <=WM: (13830: S1 ^operator O1974 +)
  12492. <=WM: (13831: S1 ^operator O1974)
  12493. <=WM: (13814: I3 ^dir R)
  12494. <=WM: (13825: R1 ^reward R990)
  12495. <=WM: (13824: I3 ^see 1)
  12496. <=WM: (13828: O1974 ^name predict-no)
  12497. <=WM: (13827: O1973 ^name predict-yes)
  12498. <=WM: (13826: R990 ^value 1)
  12499. --- Inner Elaboration Phase, active level 1 (S1) ---
  12500. Firing prefer*rvt*predict-yes*H0
  12501. -->
  12502. Firing rl*prefer*rvt*predict-yes*H0*1
  12503. -->
  12504. (S1 ^operator O1975 = 0.3402456242527754)
  12505. Firing prefer*rvt*predict-yes*H0*1*H1
  12506. -->
  12507. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  12508. -->
  12509. (S1 ^operator O1975 = 0.6597536139040494)
  12510. Firing prefer*rvt*predict-no*H0
  12511. -->
  12512. Firing rl*prefer*rvt*predict-no*H0*2
  12513. -->
  12514. (S1 ^operator O1976 = 0.3212899096504038)
  12515. Firing prefer*rvt*predict-no*H0*2*H1
  12516. -->
  12517. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  12518. -->
  12519. (S1 ^operator O1976 = 0.133561435542329)
  12520. inner elaboration loop at bottom goal.
  12521. Retracting rl*prefer*rvt*predict-no*H0*2
  12522. -->
  12523. (S1 ^operator O1974 = 0.3212899096504038)
  12524. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  12525. -->
  12526. (S1 ^operator O1974 = 0.133561435542329)
  12527. Retracting rl*prefer*rvt*predict-yes*H0*1
  12528. -->
  12529. (S1 ^operator O1973 = 0.3402456242527754)
  12530. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  12531. -->
  12532. (S1 ^operator O1973 = 0.6597536139040494)
  12533. --- END Proposal Phase ---
  12534. --- Decision Phase ---
  12535. RL update rl*prefer*rvt*predict-no*H0*6 0.558038 -0.158709 0.39933 -> 0.558038 -0.158709 0.399329(R,m,v=1,0.928144,0.0670947)
  12536. RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441965 0.158709 0.600674 -> 0.441965 0.158709 0.600673(R,m,v=1,1,0)
  12537. =>WM: (13846: S1 ^operator O1975)
  12538. 988: O: O1975 (predict-yes)
  12539. --- END Decision Phase ---
  12540. --- Application Phase ---
  12541. --- Firing Productions (PE) For State At Depth 1 ---
  12542. --- Inner Elaboration Phase, active level 1 (S1) ---
  12543. Firing apply*operator
  12544. -->
  12545. (I3 ^predict-yes N988 + :O )
  12546. Firing apply*operator*complete
  12547. -->
  12548. (I3 ^predict-no N987 - :O )
  12549. inner elaboration loop at bottom goal.
  12550. --- Change Working Memory (PE) ---
  12551. =>WM: (13847: I3 ^predict-yes N988)
  12552. <=WM: (13833: N987 ^status complete)
  12553. <=WM: (13832: I3 ^predict-no N987)
  12554. --- Firing Productions (IE) For State At Depth 1 ---
  12555. --- Inner Elaboration Phase, active level 1 (S1) ---
  12556. Firing monitor*world
  12557. -->
  12558. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12559. --- Change Working Memory (IE) ---
  12560. --- END Application Phase ---
  12561. --- Output Phase ---
  12562. ENV: Agent did: predict-yes for direction L in state State-B
  12563. In State-B moving L
  12564. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12565. predict error 0
  12566. dir: dir isU
  12567. --- END Output Phase ---
  12568. /|--- Input Phase ---
  12569. =>WM: (13851: I2 ^dir U)
  12570. =>WM: (13850: I2 ^reward 1)
  12571. =>WM: (13849: I2 ^see 1)
  12572. =>WM: (13848: N988 ^status complete)
  12573. <=WM: (13836: I2 ^dir L)
  12574. <=WM: (13835: I2 ^reward 1)
  12575. <=WM: (13834: I2 ^see 0)
  12576. =>WM: (13852: I2 ^level-1 L1-root)
  12577. <=WM: (13837: I2 ^level-1 R0-root)
  12578. --- END Input Phase ---
  12579. --- Proposal Phase ---
  12580. --- Inner Elaboration Phase, active level 1 (S1) ---
  12581. Firing elaborate*copy-see-to-output-link
  12582. -->
  12583. (I3 ^see 1 +)
  12584. Firing elaborate*reward*based*on*reward
  12585. -->
  12586. (R992 ^value 1 +)
  12587. (R1 ^reward R992 +)
  12588. Firing propose*predict-yes
  12589. -->
  12590. (O1977 ^name predict-yes +)
  12591. (S1 ^operator O1977 +)
  12592. Firing propose*predict-no
  12593. -->
  12594. (O1978 ^name predict-no +)
  12595. (S1 ^operator O1978 +)
  12596. Firing rl*prefer*rvt*predict-no*H0*4
  12597. -->
  12598. (S1 ^operator O1976 = 0.9999999999999999)
  12599. Firing rl*prefer*rvt*predict-yes*H0*3
  12600. -->
  12601. (S1 ^operator O1975 = 0.)
  12602. Firing prefer*rvt*predict-yes*H0
  12603. -->
  12604. Firing prefer*rvt*predict-no*H0
  12605. -->
  12606. Firing elaborate*copy-dir-to-output-link
  12607. -->
  12608. (I3 ^dir U +)
  12609. inner elaboration loop at bottom goal.
  12610. Retracting elaborate*copy-see-to-output-link
  12611. -->
  12612. (I3 ^see 0 +)
  12613. Retracting propose*predict-no
  12614. -->
  12615. (O1976 ^name predict-no +)
  12616. (S1 ^operator O1976 +)
  12617. Retracting propose*predict-yes
  12618. -->
  12619. (O1975 ^name predict-yes +)
  12620. (S1 ^operator O1975 +)
  12621. Retracting elaborate*reward*based*on*reward
  12622. -->
  12623. (R991 ^value 1 +)
  12624. (R1 ^reward R991 +)
  12625. Retracting elaborate*copy-dir-to-output-link
  12626. -->
  12627. (I3 ^dir L +)
  12628. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  12629. -->
  12630. (S1 ^operator O1976 = 0.133561435542329)
  12631. Retracting rl*prefer*rvt*predict-no*H0*2
  12632. -->
  12633. (S1 ^operator O1976 = 0.3212899096504038)
  12634. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  12635. -->
  12636. (S1 ^operator O1975 = 0.6597536139040494)
  12637. Retracting rl*prefer*rvt*predict-yes*H0*1
  12638. -->
  12639. (S1 ^operator O1975 = 0.3402456242527754)
  12640. =>WM: (13860: S1 ^operator O1978 +)
  12641. =>WM: (13859: S1 ^operator O1977 +)
  12642. =>WM: (13858: I3 ^dir U)
  12643. =>WM: (13857: O1978 ^name predict-no)
  12644. =>WM: (13856: O1977 ^name predict-yes)
  12645. =>WM: (13855: R992 ^value 1)
  12646. =>WM: (13854: R1 ^reward R992)
  12647. =>WM: (13853: I3 ^see 1)
  12648. <=WM: (13844: S1 ^operator O1975 +)
  12649. <=WM: (13846: S1 ^operator O1975)
  12650. <=WM: (13845: S1 ^operator O1976 +)
  12651. <=WM: (13843: I3 ^dir L)
  12652. <=WM: (13839: R1 ^reward R991)
  12653. <=WM: (13838: I3 ^see 0)
  12654. <=WM: (13842: O1976 ^name predict-no)
  12655. <=WM: (13841: O1975 ^name predict-yes)
  12656. <=WM: (13840: R991 ^value 1)
  12657. --- Inner Elaboration Phase, active level 1 (S1) ---
  12658. Firing prefer*rvt*predict-yes*H0
  12659. -->
  12660. Firing rl*prefer*rvt*predict-yes*H0*3
  12661. -->
  12662. (S1 ^operator O1977 = 0.)
  12663. Firing prefer*rvt*predict-no*H0
  12664. -->
  12665. Firing rl*prefer*rvt*predict-no*H0*4
  12666. -->
  12667. (S1 ^operator O1978 = 0.9999999999999999)
  12668. inner elaboration loop at bottom goal.
  12669. Retracting rl*prefer*rvt*predict-no*H0*4
  12670. -->
  12671. (S1 ^operator O1976 = 0.9999999999999999)
  12672. Retracting rl*prefer*rvt*predict-yes*H0*3
  12673. -->
  12674. (S1 ^operator O1975 = 0.)
  12675. --- END Proposal Phase ---
  12676. --- Decision Phase ---
  12677. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.895706,0.0939938)
  12678. RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422822 0.236932 0.659754 -> 0.422822 0.236932 0.659754(R,m,v=1,1,0)
  12679. =>WM: (13861: S1 ^operator O1978)
  12680. 989: O: O1978 (predict-no)
  12681. --- END Decision Phase ---
  12682. --- Application Phase ---
  12683. --- Firing Productions (PE) For State At Depth 1 ---
  12684. --- Inner Elaboration Phase, active level 1 (S1) ---
  12685. Firing apply*operator
  12686. -->
  12687. (I3 ^predict-no N989 + :O )
  12688. Firing apply*operator*complete
  12689. -->
  12690. (I3 ^predict-yes N988 - :O )
  12691. inner elaboration loop at bottom goal.
  12692. --- Change Working Memory (PE) ---
  12693. =>WM: (13862: I3 ^predict-no N989)
  12694. <=WM: (13848: N988 ^status complete)
  12695. <=WM: (13847: I3 ^predict-yes N988)
  12696. --- Firing Productions (IE) For State At Depth 1 ---
  12697. --- Inner Elaboration Phase, active level 1 (S1) ---
  12698. Firing monitor*world
  12699. -->
  12700. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12701. --- Change Working Memory (IE) ---
  12702. --- END Application Phase ---
  12703. --- Output Phase ---
  12704. ENV: Agent did: predict-no for direction U in state State-A
  12705. In State-A moving U
  12706. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12707. predict error 0
  12708. dir: dir isR
  12709. --- END Output Phase ---
  12710. \-/--- Input Phase ---
  12711. =>WM: (13866: I2 ^dir R)
  12712. =>WM: (13865: I2 ^reward 1)
  12713. =>WM: (13864: I2 ^see 0)
  12714. =>WM: (13863: N989 ^status complete)
  12715. <=WM: (13851: I2 ^dir U)
  12716. <=WM: (13850: I2 ^reward 1)
  12717. <=WM: (13849: I2 ^see 1)
  12718. =>WM: (13867: I2 ^level-1 L1-root)
  12719. <=WM: (13852: I2 ^level-1 L1-root)
  12720. --- END Input Phase ---
  12721. --- Proposal Phase ---
  12722. --- Inner Elaboration Phase, active level 1 (S1) ---
  12723. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  12724. -->
  12725. (S1 ^operator O1977 = 0.8878995871389573)
  12726. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  12727. -->
  12728. (S1 ^operator O1978 = 0.02370016355578053)
  12729. Firing prefer*rvt*predict-no*H0*6*H1
  12730. -->
  12731. Firing prefer*rvt*predict-yes*H0*5*H1
  12732. -->
  12733. Firing elaborate*copy-see-to-output-link
  12734. -->
  12735. (I3 ^see 0 +)
  12736. Firing elaborate*reward*based*on*reward
  12737. -->
  12738. (R993 ^value 1 +)
  12739. (R1 ^reward R993 +)
  12740. Firing propose*predict-yes
  12741. -->
  12742. (O1979 ^name predict-yes +)
  12743. (S1 ^operator O1979 +)
  12744. Firing propose*predict-no
  12745. -->
  12746. (O1980 ^name predict-no +)
  12747. (S1 ^operator O1980 +)
  12748. Firing rl*prefer*rvt*predict-no*H0*6
  12749. -->
  12750. (S1 ^operator O1978 = 0.3993290548890118)
  12751. Firing rl*prefer*rvt*predict-yes*H0*5
  12752. -->
  12753. (S1 ^operator O1977 = 0.112103955241023)
  12754. Firing prefer*rvt*predict-yes*H0
  12755. -->
  12756. Firing prefer*rvt*predict-no*H0
  12757. -->
  12758. Firing elaborate*copy-dir-to-output-link
  12759. -->
  12760. (I3 ^dir R +)
  12761. inner elaboration loop at bottom goal.
  12762. Retracting elaborate*copy-see-to-output-link
  12763. -->
  12764. (I3 ^see 1 +)
  12765. Retracting propose*predict-no
  12766. -->
  12767. (O1978 ^name predict-no +)
  12768. (S1 ^operator O1978 +)
  12769. Retracting propose*predict-yes
  12770. -->
  12771. (O1977 ^name predict-yes +)
  12772. (S1 ^operator O1977 +)
  12773. Retracting elaborate*reward*based*on*reward
  12774. -->
  12775. (R992 ^value 1 +)
  12776. (R1 ^reward R992 +)
  12777. Retracting elaborate*copy-dir-to-output-link
  12778. -->
  12779. (I3 ^dir U +)
  12780. Retracting rl*prefer*rvt*predict-no*H0*4
  12781. -->
  12782. (S1 ^operator O1978 = 0.9999999999999999)
  12783. Retracting rl*prefer*rvt*predict-yes*H0*3
  12784. -->
  12785. (S1 ^operator O1977 = 0.)
  12786. =>WM: (13875: S1 ^operator O1980 +)
  12787. =>WM: (13874: S1 ^operator O1979 +)
  12788. =>WM: (13873: I3 ^dir R)
  12789. =>WM: (13872: O1980 ^name predict-no)
  12790. =>WM: (13871: O1979 ^name predict-yes)
  12791. =>WM: (13870: R993 ^value 1)
  12792. =>WM: (13869: R1 ^reward R993)
  12793. =>WM: (13868: I3 ^see 0)
  12794. <=WM: (13859: S1 ^operator O1977 +)
  12795. <=WM: (13860: S1 ^operator O1978 +)
  12796. <=WM: (13861: S1 ^operator O1978)
  12797. <=WM: (13858: I3 ^dir U)
  12798. <=WM: (13854: R1 ^reward R992)
  12799. <=WM: (13853: I3 ^see 1)
  12800. <=WM: (13857: O1978 ^name predict-no)
  12801. <=WM: (13856: O1977 ^name predict-yes)
  12802. <=WM: (13855: R992 ^value 1)
  12803. --- Inner Elaboration Phase, active level 1 (S1) ---
  12804. Firing prefer*rvt*predict-yes*H0
  12805. -->
  12806. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  12807. -->
  12808. (S1 ^operator O1979 = 0.8878995871389573)
  12809. Firing rl*prefer*rvt*predict-yes*H0*5
  12810. -->
  12811. (S1 ^operator O1979 = 0.112103955241023)
  12812. Firing prefer*rvt*predict-yes*H0*5*H1
  12813. -->
  12814. Firing prefer*rvt*predict-no*H0
  12815. -->
  12816. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  12817. -->
  12818. (S1 ^operator O1980 = 0.02370016355578053)
  12819. Firing rl*prefer*rvt*predict-no*H0*6
  12820. -->
  12821. (S1 ^operator O1980 = 0.3993290548890118)
  12822. Firing prefer*rvt*predict-no*H0*6*H1
  12823. -->
  12824. inner elaboration loop at bottom goal.
  12825. Retracting rl*prefer*rvt*predict-no*H0*6
  12826. -->
  12827. (S1 ^operator O1978 = 0.3993290548890118)
  12828. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  12829. -->
  12830. (S1 ^operator O1978 = 0.02370016355578053)
  12831. Retracting rl*prefer*rvt*predict-yes*H0*5
  12832. -->
  12833. (S1 ^operator O1977 = 0.112103955241023)
  12834. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  12835. -->
  12836. (S1 ^operator O1977 = 0.8878995871389573)
  12837. --- END Proposal Phase ---
  12838. --- Decision Phase ---
  12839. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12840. =>WM: (13876: S1 ^operator O1979)
  12841. 990: O: O1979 (predict-yes)
  12842. --- END Decision Phase ---
  12843. --- Application Phase ---
  12844. --- Firing Productions (PE) For State At Depth 1 ---
  12845. --- Inner Elaboration Phase, active level 1 (S1) ---
  12846. Firing apply*operator
  12847. -->
  12848. (I3 ^predict-yes N990 + :O )
  12849. Firing apply*operator*complete
  12850. -->
  12851. (I3 ^predict-no N989 - :O )
  12852. inner elaboration loop at bottom goal.
  12853. --- Change Working Memory (PE) ---
  12854. =>WM: (13877: I3 ^predict-yes N990)
  12855. <=WM: (13863: N989 ^status complete)
  12856. <=WM: (13862: I3 ^predict-no N989)
  12857. --- Firing Productions (IE) For State At Depth 1 ---
  12858. --- Inner Elaboration Phase, active level 1 (S1) ---
  12859. Firing monitor*world
  12860. -->
  12861. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12862. --- Change Working Memory (IE) ---
  12863. --- END Application Phase ---
  12864. --- Output Phase ---
  12865. ENV: Agent did: predict-yes for direction R in state State-A
  12866. In State-A moving R
  12867. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12868. predict error 0
  12869. dir: dir isU
  12870. --- END Output Phase ---
  12871. |\---- Input Phase ---
  12872. =>WM: (13881: I2 ^dir U)
  12873. =>WM: (13880: I2 ^reward 1)
  12874. =>WM: (13879: I2 ^see 1)
  12875. =>WM: (13878: N990 ^status complete)
  12876. <=WM: (13866: I2 ^dir R)
  12877. <=WM: (13865: I2 ^reward 1)
  12878. <=WM: (13864: I2 ^see 0)
  12879. =>WM: (13882: I2 ^level-1 R1-root)
  12880. <=WM: (13867: I2 ^level-1 L1-root)
  12881. --- END Input Phase ---
  12882. --- Proposal Phase ---
  12883. --- Inner Elaboration Phase, active level 1 (S1) ---
  12884. Firing elaborate*copy-see-to-output-link
  12885. -->
  12886. (I3 ^see 1 +)
  12887. Firing elaborate*reward*based*on*reward
  12888. -->
  12889. (R994 ^value 1 +)
  12890. (R1 ^reward R994 +)
  12891. Firing propose*predict-yes
  12892. -->
  12893. (O1981 ^name predict-yes +)
  12894. (S1 ^operator O1981 +)
  12895. Firing propose*predict-no
  12896. -->
  12897. (O1982 ^name predict-no +)
  12898. (S1 ^operator O1982 +)
  12899. Firing rl*prefer*rvt*predict-no*H0*4
  12900. -->
  12901. (S1 ^operator O1980 = 0.9999999999999999)
  12902. Firing rl*prefer*rvt*predict-yes*H0*3
  12903. -->
  12904. (S1 ^operator O1979 = 0.)
  12905. Firing prefer*rvt*predict-yes*H0
  12906. -->
  12907. Firing prefer*rvt*predict-no*H0
  12908. -->
  12909. Firing elaborate*copy-dir-to-output-link
  12910. -->
  12911. (I3 ^dir U +)
  12912. inner elaboration loop at bottom goal.
  12913. Retracting elaborate*copy-see-to-output-link
  12914. -->
  12915. (I3 ^see 0 +)
  12916. Retracting propose*predict-no
  12917. -->
  12918. (O1980 ^name predict-no +)
  12919. (S1 ^operator O1980 +)
  12920. Retracting propose*predict-yes
  12921. -->
  12922. (O1979 ^name predict-yes +)
  12923. (S1 ^operator O1979 +)
  12924. Retracting elaborate*reward*based*on*reward
  12925. -->
  12926. (R993 ^value 1 +)
  12927. (R1 ^reward R993 +)
  12928. Retracting elaborate*copy-dir-to-output-link
  12929. -->
  12930. (I3 ^dir R +)
  12931. Retracting rl*prefer*rvt*predict-no*H0*6
  12932. -->
  12933. (S1 ^operator O1980 = 0.3993290548890118)
  12934. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  12935. -->
  12936. (S1 ^operator O1980 = 0.02370016355578053)
  12937. Retracting rl*prefer*rvt*predict-yes*H0*5
  12938. -->
  12939. (S1 ^operator O1979 = 0.112103955241023)
  12940. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  12941. -->
  12942. (S1 ^operator O1979 = 0.8878995871389573)
  12943. =>WM: (13890: S1 ^operator O1982 +)
  12944. =>WM: (13889: S1 ^operator O1981 +)
  12945. =>WM: (13888: I3 ^dir U)
  12946. =>WM: (13887: O1982 ^name predict-no)
  12947. =>WM: (13886: O1981 ^name predict-yes)
  12948. =>WM: (13885: R994 ^value 1)
  12949. =>WM: (13884: R1 ^reward R994)
  12950. =>WM: (13883: I3 ^see 1)
  12951. <=WM: (13874: S1 ^operator O1979 +)
  12952. <=WM: (13876: S1 ^operator O1979)
  12953. <=WM: (13875: S1 ^operator O1980 +)
  12954. <=WM: (13873: I3 ^dir R)
  12955. <=WM: (13869: R1 ^reward R993)
  12956. <=WM: (13868: I3 ^see 0)
  12957. <=WM: (13872: O1980 ^name predict-no)
  12958. <=WM: (13871: O1979 ^name predict-yes)
  12959. <=WM: (13870: R993 ^value 1)
  12960. --- Inner Elaboration Phase, active level 1 (S1) ---
  12961. Firing prefer*rvt*predict-yes*H0
  12962. -->
  12963. Firing rl*prefer*rvt*predict-yes*H0*3
  12964. -->
  12965. (S1 ^operator O1981 = 0.)
  12966. Firing prefer*rvt*predict-no*H0
  12967. -->
  12968. Firing rl*prefer*rvt*predict-no*H0*4
  12969. -->
  12970. (S1 ^operator O1982 = 0.9999999999999999)
  12971. inner elaboration loop at bottom goal.
  12972. Retracting rl*prefer*rvt*predict-no*H0*4
  12973. -->
  12974. (S1 ^operator O1980 = 0.9999999999999999)
  12975. Retracting rl*prefer*rvt*predict-yes*H0*3
  12976. -->
  12977. (S1 ^operator O1979 = 0.)
  12978. --- END Proposal Phase ---
  12979. --- Decision Phase ---
  12980. RL update rl*prefer*rvt*predict-yes*H0*5 0.619027 -0.506923 0.112104 -> 0.619026 -0.506923 0.112103(R,m,v=1,0.900621,0.0900621)
  12981. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380978 0.506922 0.8879 -> 0.380977 0.506922 0.887899(R,m,v=1,1,0)
  12982. =>WM: (13891: S1 ^operator O1982)
  12983. 991: O: O1982 (predict-no)
  12984. --- END Decision Phase ---
  12985. --- Application Phase ---
  12986. --- Firing Productions (PE) For State At Depth 1 ---
  12987. --- Inner Elaboration Phase, active level 1 (S1) ---
  12988. Firing apply*operator
  12989. -->
  12990. (I3 ^predict-no N991 + :O )
  12991. Firing apply*operator*complete
  12992. -->
  12993. (I3 ^predict-yes N990 - :O )
  12994. inner elaboration loop at bottom goal.
  12995. --- Change Working Memory (PE) ---
  12996. =>WM: (13892: I3 ^predict-no N991)
  12997. <=WM: (13878: N990 ^status complete)
  12998. <=WM: (13877: I3 ^predict-yes N990)
  12999. --- Firing Productions (IE) For State At Depth 1 ---
  13000. --- Inner Elaboration Phase, active level 1 (S1) ---
  13001. Firing monitor*world
  13002. -->
  13003. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13004. --- Change Working Memory (IE) ---
  13005. --- END Application Phase ---
  13006. --- Output Phase ---
  13007. ENV: Agent did: predict-no for direction U in state State-B
  13008. In State-B moving U
  13009. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13010. predict error 0
  13011. dir: dir isU
  13012. --- END Output Phase ---
  13013. /--- Input Phase ---
  13014. =>WM: (13896: I2 ^dir U)
  13015. =>WM: (13895: I2 ^reward 1)
  13016. =>WM: (13894: I2 ^see 0)
  13017. =>WM: (13893: N991 ^status complete)
  13018. <=WM: (13881: I2 ^dir U)
  13019. <=WM: (13880: I2 ^reward 1)
  13020. <=WM: (13879: I2 ^see 1)
  13021. =>WM: (13897: I2 ^level-1 R1-root)
  13022. <=WM: (13882: I2 ^level-1 R1-root)
  13023. --- END Input Phase ---
  13024. --- Proposal Phase ---
  13025. --- Inner Elaboration Phase, active level 1 (S1) ---
  13026. Firing elaborate*copy-see-to-output-link
  13027. -->
  13028. (I3 ^see 0 +)
  13029. Firing elaborate*reward*based*on*reward
  13030. -->
  13031. (R995 ^value 1 +)
  13032. (R1 ^reward R995 +)
  13033. Firing propose*predict-yes
  13034. -->
  13035. (O1983 ^name predict-yes +)
  13036. (S1 ^operator O1983 +)
  13037. Firing propose*predict-no
  13038. -->
  13039. (O1984 ^name predict-no +)
  13040. (S1 ^operator O1984 +)
  13041. Firing rl*prefer*rvt*predict-no*H0*4
  13042. -->
  13043. (S1 ^operator O1982 = 0.9999999999999999)
  13044. Firing rl*prefer*rvt*predict-yes*H0*3
  13045. -->
  13046. (S1 ^operator O1981 = 0.)
  13047. Firing prefer*rvt*predict-yes*H0
  13048. -->
  13049. Firing prefer*rvt*predict-no*H0
  13050. -->
  13051. Firing elaborate*copy-dir-to-output-link
  13052. -->
  13053. (I3 ^dir U +)
  13054. inner elaboration loop at bottom goal.
  13055. Retracting elaborate*copy-see-to-output-link
  13056. -->
  13057. (I3 ^see 1 +)
  13058. Retracting propose*predict-no
  13059. -->
  13060. (O1982 ^name predict-no +)
  13061. (S1 ^operator O1982 +)
  13062. Retracting propose*predict-yes
  13063. -->
  13064. (O1981 ^name predict-yes +)
  13065. (S1 ^operator O1981 +)
  13066. Retracting elaborate*reward*based*on*reward
  13067. -->
  13068. (R994 ^value 1 +)
  13069. (R1 ^reward R994 +)
  13070. Retracting elaborate*copy-dir-to-output-link
  13071. -->
  13072. (I3 ^dir U +)
  13073. Retracting rl*prefer*rvt*predict-no*H0*4
  13074. -->
  13075. (S1 ^operator O1982 = 0.9999999999999999)
  13076. Retracting rl*prefer*rvt*predict-yes*H0*3
  13077. -->
  13078. (S1 ^operator O1981 = 0.)
  13079. =>WM: (13904: S1 ^operator O1984 +)
  13080. =>WM: (13903: S1 ^operator O1983 +)
  13081. =>WM: (13902: O1984 ^name predict-no)
  13082. =>WM: (13901: O1983 ^name predict-yes)
  13083. =>WM: (13900: R995 ^value 1)
  13084. =>WM: (13899: R1 ^reward R995)
  13085. =>WM: (13898: I3 ^see 0)
  13086. <=WM: (13889: S1 ^operator O1981 +)
  13087. <=WM: (13890: S1 ^operator O1982 +)
  13088. <=WM: (13891: S1 ^operator O1982)
  13089. <=WM: (13884: R1 ^reward R994)
  13090. <=WM: (13883: I3 ^see 1)
  13091. <=WM: (13887: O1982 ^name predict-no)
  13092. <=WM: (13886: O1981 ^name predict-yes)
  13093. <=WM: (13885: R994 ^value 1)
  13094. --- Inner Elaboration Phase, active level 1 (S1) ---
  13095. Firing prefer*rvt*predict-yes*H0
  13096. -->
  13097. Firing rl*prefer*rvt*predict-yes*H0*3
  13098. -->
  13099. (S1 ^operator O1983 = 0.)
  13100. Firing prefer*rvt*predict-no*H0
  13101. -->
  13102. Firing rl*prefer*rvt*predict-no*H0*4
  13103. -->
  13104. (S1 ^operator O1984 = 0.9999999999999999)
  13105. inner elaboration loop at bottom goal.
  13106. Retracting rl*prefer*rvt*predict-no*H0*4
  13107. -->
  13108. (S1 ^operator O1982 = 0.9999999999999999)
  13109. Retracting rl*prefer*rvt*predict-yes*H0*3
  13110. -->
  13111. (S1 ^operator O1981 = 0.)
  13112. --- END Proposal Phase ---
  13113. --- Decision Phase ---
  13114. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13115. =>WM: (13905: S1 ^operator O1984)
  13116. 992: O: O1984 (predict-no)
  13117. --- END Decision Phase ---
  13118. --- Application Phase ---
  13119. --- Firing Productions (PE) For State At Depth 1 ---
  13120. --- Inner Elaboration Phase, active level 1 (S1) ---
  13121. Firing apply*operator
  13122. -->
  13123. (I3 ^predict-no N992 + :O )
  13124. Firing apply*operator*complete
  13125. -->
  13126. (I3 ^predict-no N991 - :O )
  13127. inner elaboration loop at bottom goal.
  13128. --- Change Working Memory (PE) ---
  13129. =>WM: (13906: I3 ^predict-no N992)
  13130. <=WM: (13893: N991 ^status complete)
  13131. <=WM: (13892: I3 ^predict-no N991)
  13132. --- Firing Productions (IE) For State At Depth 1 ---
  13133. --- Inner Elaboration Phase, active level 1 (S1) ---
  13134. Firing monitor*world
  13135. -->
  13136. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13137. --- Change Working Memory (IE) ---
  13138. --- END Application Phase ---
  13139. --- Output Phase ---
  13140. ENV: Agent did: predict-no for direction U in state State-B
  13141. In State-B moving U
  13142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13143. predict error 0
  13144. dir: dir isL
  13145. --- END Output Phase ---
  13146. |\---- Input Phase ---
  13147. =>WM: (13910: I2 ^dir L)
  13148. =>WM: (13909: I2 ^reward 1)
  13149. =>WM: (13908: I2 ^see 0)
  13150. =>WM: (13907: N992 ^status complete)
  13151. <=WM: (13896: I2 ^dir U)
  13152. <=WM: (13895: I2 ^reward 1)
  13153. <=WM: (13894: I2 ^see 0)
  13154. =>WM: (13911: I2 ^level-1 R1-root)
  13155. <=WM: (13897: I2 ^level-1 R1-root)
  13156. --- END Input Phase ---
  13157. --- Proposal Phase ---
  13158. --- Inner Elaboration Phase, active level 1 (S1) ---
  13159. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  13160. -->
  13161. (S1 ^operator O1984 = 0.03900899329983293)
  13162. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  13163. -->
  13164. (S1 ^operator O1983 = 0.6597555366718975)
  13165. Firing prefer*rvt*predict-no*H0*2*H1
  13166. -->
  13167. Firing prefer*rvt*predict-yes*H0*1*H1
  13168. -->
  13169. Firing elaborate*copy-see-to-output-link
  13170. -->
  13171. (I3 ^see 0 +)
  13172. Firing elaborate*reward*based*on*reward
  13173. -->
  13174. (R996 ^value 1 +)
  13175. (R1 ^reward R996 +)
  13176. Firing propose*predict-yes
  13177. -->
  13178. (O1985 ^name predict-yes +)
  13179. (S1 ^operator O1985 +)
  13180. Firing propose*predict-no
  13181. -->
  13182. (O1986 ^name predict-no +)
  13183. (S1 ^operator O1986 +)
  13184. Firing rl*prefer*rvt*predict-no*H0*2
  13185. -->
  13186. (S1 ^operator O1984 = 0.3212899096504038)
  13187. Firing rl*prefer*rvt*predict-yes*H0*1
  13188. -->
  13189. (S1 ^operator O1983 = 0.3402457385292517)
  13190. Firing prefer*rvt*predict-yes*H0
  13191. -->
  13192. Firing prefer*rvt*predict-no*H0
  13193. -->
  13194. Firing elaborate*copy-dir-to-output-link
  13195. -->
  13196. (I3 ^dir L +)
  13197. inner elaboration loop at bottom goal.
  13198. Retracting elaborate*copy-see-to-output-link
  13199. -->
  13200. (I3 ^see 0 +)
  13201. Retracting propose*predict-no
  13202. -->
  13203. (O1984 ^name predict-no +)
  13204. (S1 ^operator O1984 +)
  13205. Retracting propose*predict-yes
  13206. -->
  13207. (O1983 ^name predict-yes +)
  13208. (S1 ^operator O1983 +)
  13209. Retracting elaborate*reward*based*on*reward
  13210. -->
  13211. (R995 ^value 1 +)
  13212. (R1 ^reward R995 +)
  13213. Retracting elaborate*copy-dir-to-output-link
  13214. -->
  13215. (I3 ^dir U +)
  13216. Retracting rl*prefer*rvt*predict-no*H0*4
  13217. -->
  13218. (S1 ^operator O1984 = 0.9999999999999999)
  13219. Retracting rl*prefer*rvt*predict-yes*H0*3
  13220. -->
  13221. (S1 ^operator O1983 = 0.)
  13222. =>WM: (13918: S1 ^operator O1986 +)
  13223. =>WM: (13917: S1 ^operator O1985 +)
  13224. =>WM: (13916: I3 ^dir L)
  13225. =>WM: (13915: O1986 ^name predict-no)
  13226. =>WM: (13914: O1985 ^name predict-yes)
  13227. =>WM: (13913: R996 ^value 1)
  13228. =>WM: (13912: R1 ^reward R996)
  13229. <=WM: (13903: S1 ^operator O1983 +)
  13230. <=WM: (13904: S1 ^operator O1984 +)
  13231. <=WM: (13905: S1 ^operator O1984)
  13232. <=WM: (13888: I3 ^dir U)
  13233. <=WM: (13899: R1 ^reward R995)
  13234. <=WM: (13902: O1984 ^name predict-no)
  13235. <=WM: (13901: O1983 ^name predict-yes)
  13236. <=WM: (13900: R995 ^value 1)
  13237. --- Inner Elaboration Phase, active level 1 (S1) ---
  13238. Firing prefer*rvt*predict-yes*H0
  13239. -->
  13240. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  13241. -->
  13242. (S1 ^operator O1985 = 0.6597555366718975)
  13243. Firing rl*prefer*rvt*predict-yes*H0*1
  13244. -->
  13245. (S1 ^operator O1985 = 0.3402457385292517)
  13246. Firing prefer*rvt*predict-yes*H0*1*H1
  13247. -->
  13248. Firing prefer*rvt*predict-no*H0
  13249. -->
  13250. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  13251. -->
  13252. (S1 ^operator O1986 = 0.03900899329983293)
  13253. Firing rl*prefer*rvt*predict-no*H0*2
  13254. -->
  13255. (S1 ^operator O1986 = 0.3212899096504038)
  13256. Firing prefer*rvt*predict-no*H0*2*H1
  13257. -->
  13258. inner elaboration loop at bottom goal.
  13259. Retracting rl*prefer*rvt*predict-no*H0*2
  13260. -->
  13261. (S1 ^operator O1984 = 0.3212899096504038)
  13262. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  13263. -->
  13264. (S1 ^operator O1984 = 0.03900899329983293)
  13265. Retracting rl*prefer*rvt*predict-yes*H0*1
  13266. -->
  13267. (S1 ^operator O1983 = 0.3402457385292517)
  13268. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  13269. -->
  13270. (S1 ^operator O1983 = 0.6597555366718975)
  13271. --- END Proposal Phase ---
  13272. --- Decision Phase ---
  13273. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13274. =>WM: (13919: S1 ^operator O1985)
  13275. 993: O: O1985 (predict-yes)
  13276. --- END Decision Phase ---
  13277. --- Application Phase ---
  13278. --- Firing Productions (PE) For State At Depth 1 ---
  13279. --- Inner Elaboration Phase, active level 1 (S1) ---
  13280. Firing apply*operator
  13281. -->
  13282. (I3 ^predict-yes N993 + :O )
  13283. Firing apply*operator*complete
  13284. -->
  13285. (I3 ^predict-no N992 - :O )
  13286. inner elaboration loop at bottom goal.
  13287. --- Change Working Memory (PE) ---
  13288. =>WM: (13920: I3 ^predict-yes N993)
  13289. <=WM: (13907: N992 ^status complete)
  13290. <=WM: (13906: I3 ^predict-no N992)
  13291. --- Firing Productions (IE) For State At Depth 1 ---
  13292. --- Inner Elaboration Phase, active level 1 (S1) ---
  13293. Firing monitor*world
  13294. -->
  13295. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13296. --- Change Working Memory (IE) ---
  13297. --- END Application Phase ---
  13298. --- Output Phase ---
  13299. ENV: Agent did: predict-yes for direction L in state State-B
  13300. In State-B moving L
  13301. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13302. predict error 0
  13303. dir: dir isR
  13304. --- END Output Phase ---
  13305. /|\--- Input Phase ---
  13306. =>WM: (13924: I2 ^dir R)
  13307. =>WM: (13923: I2 ^reward 1)
  13308. =>WM: (13922: I2 ^see 1)
  13309. =>WM: (13921: N993 ^status complete)
  13310. <=WM: (13910: I2 ^dir L)
  13311. <=WM: (13909: I2 ^reward 1)
  13312. <=WM: (13908: I2 ^see 0)
  13313. =>WM: (13925: I2 ^level-1 L1-root)
  13314. <=WM: (13911: I2 ^level-1 R1-root)
  13315. --- END Input Phase ---
  13316. --- Proposal Phase ---
  13317. --- Inner Elaboration Phase, active level 1 (S1) ---
  13318. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  13319. -->
  13320. (S1 ^operator O1985 = 0.8878990557819602)
  13321. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  13322. -->
  13323. (S1 ^operator O1986 = 0.02370016355578053)
  13324. Firing prefer*rvt*predict-no*H0*6*H1
  13325. -->
  13326. Firing prefer*rvt*predict-yes*H0*5*H1
  13327. -->
  13328. Firing elaborate*copy-see-to-output-link
  13329. -->
  13330. (I3 ^see 1 +)
  13331. Firing elaborate*reward*based*on*reward
  13332. -->
  13333. (R997 ^value 1 +)
  13334. (R1 ^reward R997 +)
  13335. Firing propose*predict-yes
  13336. -->
  13337. (O1987 ^name predict-yes +)
  13338. (S1 ^operator O1987 +)
  13339. Firing propose*predict-no
  13340. -->
  13341. (O1988 ^name predict-no +)
  13342. (S1 ^operator O1988 +)
  13343. Firing rl*prefer*rvt*predict-no*H0*6
  13344. -->
  13345. (S1 ^operator O1986 = 0.3993290548890118)
  13346. Firing rl*prefer*rvt*predict-yes*H0*5
  13347. -->
  13348. (S1 ^operator O1985 = 0.1121034238840259)
  13349. Firing prefer*rvt*predict-yes*H0
  13350. -->
  13351. Firing prefer*rvt*predict-no*H0
  13352. -->
  13353. Firing elaborate*copy-dir-to-output-link
  13354. -->
  13355. (I3 ^dir R +)
  13356. inner elaboration loop at bottom goal.
  13357. Retracting elaborate*copy-see-to-output-link
  13358. -->
  13359. (I3 ^see 0 +)
  13360. Retracting propose*predict-no
  13361. -->
  13362. (O1986 ^name predict-no +)
  13363. (S1 ^operator O1986 +)
  13364. Retracting propose*predict-yes
  13365. -->
  13366. (O1985 ^name predict-yes +)
  13367. (S1 ^operator O1985 +)
  13368. Retracting elaborate*reward*based*on*reward
  13369. -->
  13370. (R996 ^value 1 +)
  13371. (R1 ^reward R996 +)
  13372. Retracting elaborate*copy-dir-to-output-link
  13373. -->
  13374. (I3 ^dir L +)
  13375. Retracting rl*prefer*rvt*predict-no*H0*2
  13376. -->
  13377. (S1 ^operator O1986 = 0.3212899096504038)
  13378. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  13379. -->
  13380. (S1 ^operator O1986 = 0.03900899329983293)
  13381. Retracting rl*prefer*rvt*predict-yes*H0*1
  13382. -->
  13383. (S1 ^operator O1985 = 0.3402457385292517)
  13384. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  13385. -->
  13386. (S1 ^operator O1985 = 0.6597555366718975)
  13387. =>WM: (13933: S1 ^operator O1988 +)
  13388. =>WM: (13932: S1 ^operator O1987 +)
  13389. =>WM: (13931: I3 ^dir R)
  13390. =>WM: (13930: O1988 ^name predict-no)
  13391. =>WM: (13929: O1987 ^name predict-yes)
  13392. =>WM: (13928: R997 ^value 1)
  13393. =>WM: (13927: R1 ^reward R997)
  13394. =>WM: (13926: I3 ^see 1)
  13395. <=WM: (13917: S1 ^operator O1985 +)
  13396. <=WM: (13919: S1 ^operator O1985)
  13397. <=WM: (13918: S1 ^operator O1986 +)
  13398. <=WM: (13916: I3 ^dir L)
  13399. <=WM: (13912: R1 ^reward R996)
  13400. <=WM: (13898: I3 ^see 0)
  13401. <=WM: (13915: O1986 ^name predict-no)
  13402. <=WM: (13914: O1985 ^name predict-yes)
  13403. <=WM: (13913: R996 ^value 1)
  13404. --- Inner Elaboration Phase, active level 1 (S1) ---
  13405. Firing prefer*rvt*predict-yes*H0
  13406. -->
  13407. Firing rl*prefer*rvt*predict-yes*H0*5
  13408. -->
  13409. (S1 ^operator O1987 = 0.1121034238840259)
  13410. Firing prefer*rvt*predict-yes*H0*5*H1
  13411. -->
  13412. Firing rl*prefer*rvt*predict-yes*H0*5*H1*10
  13413. -->
  13414. (S1 ^operator O1987 = 0.8878990557819602)
  13415. Firing prefer*rvt*predict-no*H0
  13416. -->
  13417. Firing rl*prefer*rvt*predict-no*H0*6
  13418. -->
  13419. (S1 ^operator O1988 = 0.3993290548890118)
  13420. Firing prefer*rvt*predict-no*H0*6*H1
  13421. -->
  13422. Firing rl*prefer*rvt*predict-no*H0*6*H1*9
  13423. -->
  13424. (S1 ^operator O1988 = 0.02370016355578053)
  13425. inner elaboration loop at bottom goal.
  13426. Retracting rl*prefer*rvt*predict-no*H0*6
  13427. -->
  13428. (S1 ^operator O1986 = 0.3993290548890118)
  13429. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  13430. -->
  13431. (S1 ^operator O1986 = 0.02370016355578053)
  13432. Retracting rl*prefer*rvt*predict-yes*H0*5
  13433. -->
  13434. (S1 ^operator O1985 = 0.1121034238840259)
  13435. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  13436. -->
  13437. (S1 ^operator O1985 = 0.8878990557819602)
  13438. --- END Proposal Phase ---
  13439. --- Decision Phase ---
  13440. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.896341,0.0934835)
  13441. RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422822 0.236933 0.659756 -> 0.422822 0.236933 0.659755(R,m,v=1,1,0)
  13442. =>WM: (13934: S1 ^operator O1987)
  13443. 994: O: O1987 (predict-yes)
  13444. --- END Decision Phase ---
  13445. --- Application Phase ---
  13446. --- Firing Productions (PE) For State At Depth 1 ---
  13447. --- Inner Elaboration Phase, active level 1 (S1) ---
  13448. Firing apply*operator
  13449. -->
  13450. (I3 ^predict-yes N994 + :O )
  13451. Firing apply*operator*complete
  13452. -->
  13453. (I3 ^predict-yes N993 - :O )
  13454. inner elaboration loop at bottom goal.
  13455. --- Change Working Memory (PE) ---
  13456. =>WM: (13935: I3 ^predict-yes N994)
  13457. <=WM: (13921: N993 ^status complete)
  13458. <=WM: (13920: I3 ^predict-yes N993)
  13459. --- Firing Productions (IE) For State At Depth 1 ---
  13460. --- Inner Elaboration Phase, active level 1 (S1) ---
  13461. Firing monitor*world
  13462. -->
  13463. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13464. --- Change Working Memory (IE) ---
  13465. --- END Application Phase ---
  13466. --- Output Phase ---
  13467. ENV: Agent did: predict-yes for direction R in state State-A
  13468. In State-A moving R
  13469. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13470. predict error 0
  13471. dir: dir isR
  13472. --- END Output Phase ---
  13473. -/|--- Input Phase ---
  13474. =>WM: (13939: I2 ^dir R)
  13475. =>WM: (13938: I2 ^reward 1)
  13476. =>WM: (13937: I2 ^see 1)
  13477. =>WM: (13936: N994 ^status complete)
  13478. <=WM: (13924: I2 ^dir R)
  13479. <=WM: (13923: I2 ^reward 1)
  13480. <=WM: (13922: I2 ^see 1)
  13481. =>WM: (13940: I2 ^level-1 R1-root)
  13482. <=WM: (13925: I2 ^level-1 L1-root)
  13483. --- END Input Phase ---
  13484. --- Proposal Phase ---
  13485. --- Inner Elaboration Phase, active level 1 (S1) ---
  13486. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  13487. -->
  13488. (S1 ^operator O1988 = 0.6006734320229912)
  13489. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  13490. -->
  13491. (S1 ^operator O1987 = 0.1602187148382515)
  13492. Firing prefer*rvt*predict-no*H0*6*H1
  13493. -->
  13494. Firing prefer*rvt*predict-yes*H0*5*H1
  13495. -->
  13496. Firing elaborate*copy-see-to-output-link
  13497. -->
  13498. (I3 ^see 1 +)
  13499. Firing elaborate*reward*based*on*reward
  13500. -->
  13501. (R998 ^value 1 +)
  13502. (R1 ^reward R998 +)
  13503. Firing propose*predict-yes
  13504. -->
  13505. (O1989 ^name predict-yes +)
  13506. (S1 ^operator O1989 +)
  13507. Firing propose*predict-no
  13508. -->
  13509. (O1990 ^name predict-no +)
  13510. (S1 ^operator O1990 +)
  13511. Firing rl*prefer*rvt*predict-no*H0*6
  13512. -->
  13513. (S1 ^operator O1988 = 0.3993290548890118)
  13514. Firing rl*prefer*rvt*predict-yes*H0*5
  13515. -->
  13516. (S1 ^operator O1987 = 0.1121034238840259)
  13517. Firing prefer*rvt*predict-yes*H0
  13518. -->
  13519. Firing prefer*rvt*predict-no*H0
  13520. -->
  13521. Firing elaborate*copy-dir-to-output-link
  13522. -->
  13523. (I3 ^dir R +)
  13524. inner elaboration loop at bottom goal.
  13525. Retracting elaborate*copy-see-to-output-link
  13526. -->
  13527. (I3 ^see 1 +)
  13528. Retracting propose*predict-no
  13529. -->
  13530. (O1988 ^name predict-no +)
  13531. (S1 ^operator O1988 +)
  13532. Retracting propose*predict-yes
  13533. -->
  13534. (O1987 ^name predict-yes +)
  13535. (S1 ^operator O1987 +)
  13536. Retracting elaborate*reward*based*on*reward
  13537. -->
  13538. (R997 ^value 1 +)
  13539. (R1 ^reward R997 +)
  13540. Retracting elaborate*copy-dir-to-output-link
  13541. -->
  13542. (I3 ^dir R +)
  13543. Retracting rl*prefer*rvt*predict-no*H0*6*H1*9
  13544. -->
  13545. (S1 ^operator O1988 = 0.02370016355578053)
  13546. Retracting rl*prefer*rvt*predict-no*H0*6
  13547. -->
  13548. (S1 ^operator O1988 = 0.3993290548890118)
  13549. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*10
  13550. -->
  13551. (S1 ^operator O1987 = 0.8878990557819602)
  13552. Retracting rl*prefer*rvt*predict-yes*H0*5
  13553. -->
  13554. (S1 ^operator O1987 = 0.1121034238840259)
  13555. =>WM: (13946: S1 ^operator O1990 +)
  13556. =>WM: (13945: S1 ^operator O1989 +)
  13557. =>WM: (13944: O1990 ^name predict-no)
  13558. =>WM: (13943: O1989 ^name predict-yes)
  13559. =>WM: (13942: R998 ^value 1)
  13560. =>WM: (13941: R1 ^reward R998)
  13561. <=WM: (13932: S1 ^operator O1987 +)
  13562. <=WM: (13934: S1 ^operator O1987)
  13563. <=WM: (13933: S1 ^operator O1988 +)
  13564. <=WM: (13927: R1 ^reward R997)
  13565. <=WM: (13930: O1988 ^name predict-no)
  13566. <=WM: (13929: O1987 ^name predict-yes)
  13567. <=WM: (13928: R997 ^value 1)
  13568. --- Inner Elaboration Phase, active level 1 (S1) ---
  13569. Firing prefer*rvt*predict-yes*H0
  13570. -->
  13571. Firing rl*prefer*rvt*predict-yes*H0*5
  13572. -->
  13573. (S1 ^operator O1989 = 0.1121034238840259)
  13574. Firing prefer*rvt*predict-yes*H0*5*H1
  13575. -->
  13576. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  13577. -->
  13578. (S1 ^operator O1989 = 0.1602187148382515)
  13579. Firing prefer*rvt*predict-no*H0
  13580. -->
  13581. Firing rl*prefer*rvt*predict-no*H0*6
  13582. -->
  13583. (S1 ^operator O1990 = 0.3993290548890118)
  13584. Firing prefer*rvt*predict-no*H0*6*H1
  13585. -->
  13586. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  13587. -->
  13588. (S1 ^operator O1990 = 0.6006734320229912)
  13589. inner elaboration loop at bottom goal.
  13590. Retracting rl*prefer*rvt*predict-no*H0*6
  13591. -->
  13592. (S1 ^operator O1988 = 0.3993290548890118)
  13593. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  13594. -->
  13595. (S1 ^operator O1988 = 0.6006734320229912)
  13596. Retracting rl*prefer*rvt*predict-yes*H0*5
  13597. -->
  13598. (S1 ^operator O1987 = 0.1121034238840259)
  13599. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  13600. -->
  13601. (S1 ^operator O1987 = 0.1602187148382515)
  13602. --- END Proposal Phase ---
  13603. --- Decision Phase ---
  13604. RL update rl*prefer*rvt*predict-yes*H0*5 0.619026 -0.506923 0.112103 -> 0.619026 -0.506923 0.112103(R,m,v=1,0.901235,0.0895637)
  13605. RL update rl*prefer*rvt*predict-yes*H0*5*H1*10 0.380977 0.506922 0.887899 -> 0.380976 0.506922 0.887899(R,m,v=1,1,0)
  13606. =>WM: (13947: S1 ^operator O1990)
  13607. 995: O: O1990 (predict-no)
  13608. --- END Decision Phase ---
  13609. --- Application Phase ---
  13610. --- Firing Productions (PE) For State At Depth 1 ---
  13611. --- Inner Elaboration Phase, active level 1 (S1) ---
  13612. Firing apply*operator
  13613. -->
  13614. (I3 ^predict-no N995 + :O )
  13615. Firing apply*operator*complete
  13616. -->
  13617. (I3 ^predict-yes N994 - :O )
  13618. inner elaboration loop at bottom goal.
  13619. --- Change Working Memory (PE) ---
  13620. =>WM: (13948: I3 ^predict-no N995)
  13621. <=WM: (13936: N994 ^status complete)
  13622. <=WM: (13935: I3 ^predict-yes N994)
  13623. --- Firing Productions (IE) For State At Depth 1 ---
  13624. --- Inner Elaboration Phase, active level 1 (S1) ---
  13625. Firing monitor*world
  13626. -->
  13627. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13628. --- Change Working Memory (IE) ---
  13629. --- END Application Phase ---
  13630. --- Output Phase ---
  13631. ENV: Agent did: predict-no for direction R in state State-B
  13632. In State-B moving R
  13633. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13634. predict error 0
  13635. dir: dir isU
  13636. --- END Output Phase ---
  13637. \---- Input Phase ---
  13638. =>WM: (13952: I2 ^dir U)
  13639. =>WM: (13951: I2 ^reward 1)
  13640. =>WM: (13950: I2 ^see 0)
  13641. =>WM: (13949: N995 ^status complete)
  13642. <=WM: (13939: I2 ^dir R)
  13643. <=WM: (13938: I2 ^reward 1)
  13644. <=WM: (13937: I2 ^see 1)
  13645. =>WM: (13953: I2 ^level-1 R0-root)
  13646. <=WM: (13940: I2 ^level-1 R1-root)
  13647. --- END Input Phase ---
  13648. --- Proposal Phase ---
  13649. --- Inner Elaboration Phase, active level 1 (S1) ---
  13650. Firing elaborate*copy-see-to-output-link
  13651. -->
  13652. (I3 ^see 0 +)
  13653. Firing elaborate*reward*based*on*reward
  13654. -->
  13655. (R999 ^value 1 +)
  13656. (R1 ^reward R999 +)
  13657. Firing propose*predict-yes
  13658. -->
  13659. (O1991 ^name predict-yes +)
  13660. (S1 ^operator O1991 +)
  13661. Firing propose*predict-no
  13662. -->
  13663. (O1992 ^name predict-no +)
  13664. (S1 ^operator O1992 +)
  13665. Firing rl*prefer*rvt*predict-no*H0*4
  13666. -->
  13667. (S1 ^operator O1990 = 0.9999999999999999)
  13668. Firing rl*prefer*rvt*predict-yes*H0*3
  13669. -->
  13670. (S1 ^operator O1989 = 0.)
  13671. Firing prefer*rvt*predict-yes*H0
  13672. -->
  13673. Firing prefer*rvt*predict-no*H0
  13674. -->
  13675. Firing elaborate*copy-dir-to-output-link
  13676. -->
  13677. (I3 ^dir U +)
  13678. inner elaboration loop at bottom goal.
  13679. Retracting elaborate*copy-see-to-output-link
  13680. -->
  13681. (I3 ^see 1 +)
  13682. Retracting propose*predict-no
  13683. -->
  13684. (O1990 ^name predict-no +)
  13685. (S1 ^operator O1990 +)
  13686. Retracting propose*predict-yes
  13687. -->
  13688. (O1989 ^name predict-yes +)
  13689. (S1 ^operator O1989 +)
  13690. Retracting elaborate*reward*based*on*reward
  13691. -->
  13692. (R998 ^value 1 +)
  13693. (R1 ^reward R998 +)
  13694. Retracting elaborate*copy-dir-to-output-link
  13695. -->
  13696. (I3 ^dir R +)
  13697. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  13698. -->
  13699. (S1 ^operator O1990 = 0.6006734320229912)
  13700. Retracting rl*prefer*rvt*predict-no*H0*6
  13701. -->
  13702. (S1 ^operator O1990 = 0.3993290548890118)
  13703. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  13704. -->
  13705. (S1 ^operator O1989 = 0.1602187148382515)
  13706. Retracting rl*prefer*rvt*predict-yes*H0*5
  13707. -->
  13708. (S1 ^operator O1989 = 0.1121030519341281)
  13709. =>WM: (13961: S1 ^operator O1992 +)
  13710. =>WM: (13960: S1 ^operator O1991 +)
  13711. =>WM: (13959: I3 ^dir U)
  13712. =>WM: (13958: O1992 ^name predict-no)
  13713. =>WM: (13957: O1991 ^name predict-yes)
  13714. =>WM: (13956: R999 ^value 1)
  13715. =>WM: (13955: R1 ^reward R999)
  13716. =>WM: (13954: I3 ^see 0)
  13717. <=WM: (13945: S1 ^operator O1989 +)
  13718. <=WM: (13946: S1 ^operator O1990 +)
  13719. <=WM: (13947: S1 ^operator O1990)
  13720. <=WM: (13931: I3 ^dir R)
  13721. <=WM: (13941: R1 ^reward R998)
  13722. <=WM: (13926: I3 ^see 1)
  13723. <=WM: (13944: O1990 ^name predict-no)
  13724. <=WM: (13943: O1989 ^name predict-yes)
  13725. <=WM: (13942: R998 ^value 1)
  13726. --- Inner Elaboration Phase, active level 1 (S1) ---
  13727. Firing prefer*rvt*predict-yes*H0
  13728. -->
  13729. Firing rl*prefer*rvt*predict-yes*H0*3
  13730. -->
  13731. (S1 ^operator O1991 = 0.)
  13732. Firing prefer*rvt*predict-no*H0
  13733. -->
  13734. Firing rl*prefer*rvt*predict-no*H0*4
  13735. -->
  13736. (S1 ^operator O1992 = 0.9999999999999999)
  13737. inner elaboration loop at bottom goal.
  13738. Retracting rl*prefer*rvt*predict-no*H0*4
  13739. -->
  13740. (S1 ^operator O1990 = 0.9999999999999999)
  13741. Retracting rl*prefer*rvt*predict-yes*H0*3
  13742. -->
  13743. (S1 ^operator O1989 = 0.)
  13744. --- END Proposal Phase ---
  13745. --- Decision Phase ---
  13746. RL update rl*prefer*rvt*predict-no*H0*6 0.558038 -0.158709 0.399329 -> 0.558037 -0.158709 0.399329(R,m,v=1,0.928571,0.0667237)
  13747. RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441965 0.158709 0.600673 -> 0.441964 0.158709 0.600673(R,m,v=1,1,0)
  13748. =>WM: (13962: S1 ^operator O1992)
  13749. 996: O: O1992 (predict-no)
  13750. --- END Decision Phase ---
  13751. --- Application Phase ---
  13752. --- Firing Productions (PE) For State At Depth 1 ---
  13753. --- Inner Elaboration Phase, active level 1 (S1) ---
  13754. Firing apply*operator
  13755. -->
  13756. (I3 ^predict-no N996 + :O )
  13757. Firing apply*operator*complete
  13758. -->
  13759. (I3 ^predict-no N995 - :O )
  13760. inner elaboration loop at bottom goal.
  13761. --- Change Working Memory (PE) ---
  13762. =>WM: (13963: I3 ^predict-no N996)
  13763. <=WM: (13949: N995 ^status complete)
  13764. <=WM: (13948: I3 ^predict-no N995)
  13765. --- Firing Productions (IE) For State At Depth 1 ---
  13766. --- Inner Elaboration Phase, active level 1 (S1) ---
  13767. Firing monitor*world
  13768. -->
  13769. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13770. --- Change Working Memory (IE) ---
  13771. --- END Application Phase ---
  13772. --- Output Phase ---
  13773. ENV: Agent did: predict-no for direction U in state State-B
  13774. In State-B moving U
  13775. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13776. predict error 0
  13777. dir: dir isU
  13778. --- END Output Phase ---
  13779. /|\--- Input Phase ---
  13780. =>WM: (13967: I2 ^dir U)
  13781. =>WM: (13966: I2 ^reward 1)
  13782. =>WM: (13965: I2 ^see 0)
  13783. =>WM: (13964: N996 ^status complete)
  13784. <=WM: (13952: I2 ^dir U)
  13785. <=WM: (13951: I2 ^reward 1)
  13786. <=WM: (13950: I2 ^see 0)
  13787. =>WM: (13968: I2 ^level-1 R0-root)
  13788. <=WM: (13953: I2 ^level-1 R0-root)
  13789. --- END Input Phase ---
  13790. --- Proposal Phase ---
  13791. --- Inner Elaboration Phase, active level 1 (S1) ---
  13792. Firing elaborate*copy-see-to-output-link
  13793. -->
  13794. (I3 ^see 0 +)
  13795. Firing elaborate*reward*based*on*reward
  13796. -->
  13797. (R1000 ^value 1 +)
  13798. (R1 ^reward R1000 +)
  13799. Firing propose*predict-yes
  13800. -->
  13801. (O1993 ^name predict-yes +)
  13802. (S1 ^operator O1993 +)
  13803. Firing propose*predict-no
  13804. -->
  13805. (O1994 ^name predict-no +)
  13806. (S1 ^operator O1994 +)
  13807. Firing rl*prefer*rvt*predict-no*H0*4
  13808. -->
  13809. (S1 ^operator O1992 = 0.9999999999999999)
  13810. Firing rl*prefer*rvt*predict-yes*H0*3
  13811. -->
  13812. (S1 ^operator O1991 = 0.)
  13813. Firing prefer*rvt*predict-yes*H0
  13814. -->
  13815. Firing prefer*rvt*predict-no*H0
  13816. -->
  13817. Firing elaborate*copy-dir-to-output-link
  13818. -->
  13819. (I3 ^dir U +)
  13820. inner elaboration loop at bottom goal.
  13821. Retracting elaborate*copy-see-to-output-link
  13822. -->
  13823. (I3 ^see 0 +)
  13824. Retracting propose*predict-no
  13825. -->
  13826. (O1992 ^name predict-no +)
  13827. (S1 ^operator O1992 +)
  13828. Retracting propose*predict-yes
  13829. -->
  13830. (O1991 ^name predict-yes +)
  13831. (S1 ^operator O1991 +)
  13832. Retracting elaborate*reward*based*on*reward
  13833. -->
  13834. (R999 ^value 1 +)
  13835. (R1 ^reward R999 +)
  13836. Retracting elaborate*copy-dir-to-output-link
  13837. -->
  13838. (I3 ^dir U +)
  13839. Retracting rl*prefer*rvt*predict-no*H0*4
  13840. -->
  13841. (S1 ^operator O1992 = 0.9999999999999999)
  13842. Retracting rl*prefer*rvt*predict-yes*H0*3
  13843. -->
  13844. (S1 ^operator O1991 = 0.)
  13845. =>WM: (13974: S1 ^operator O1994 +)
  13846. =>WM: (13973: S1 ^operator O1993 +)
  13847. =>WM: (13972: O1994 ^name predict-no)
  13848. =>WM: (13971: O1993 ^name predict-yes)
  13849. =>WM: (13970: R1000 ^value 1)
  13850. =>WM: (13969: R1 ^reward R1000)
  13851. <=WM: (13960: S1 ^operator O1991 +)
  13852. <=WM: (13961: S1 ^operator O1992 +)
  13853. <=WM: (13962: S1 ^operator O1992)
  13854. <=WM: (13955: R1 ^reward R999)
  13855. <=WM: (13958: O1992 ^name predict-no)
  13856. <=WM: (13957: O1991 ^name predict-yes)
  13857. <=WM: (13956: R999 ^value 1)
  13858. --- Inner Elaboration Phase, active level 1 (S1) ---
  13859. Firing prefer*rvt*predict-yes*H0
  13860. -->
  13861. Firing rl*prefer*rvt*predict-yes*H0*3
  13862. -->
  13863. (S1 ^operator O1993 = 0.)
  13864. Firing prefer*rvt*predict-no*H0
  13865. -->
  13866. Firing rl*prefer*rvt*predict-no*H0*4
  13867. -->
  13868. (S1 ^operator O1994 = 0.9999999999999999)
  13869. inner elaboration loop at bottom goal.
  13870. Retracting rl*prefer*rvt*predict-no*H0*4
  13871. -->
  13872. (S1 ^operator O1992 = 0.9999999999999999)
  13873. Retracting rl*prefer*rvt*predict-yes*H0*3
  13874. -->
  13875. (S1 ^operator O1991 = 0.)
  13876. --- END Proposal Phase ---
  13877. --- Decision Phase ---
  13878. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13879. =>WM: (13975: S1 ^operator O1994)
  13880. 997: O: O1994 (predict-no)
  13881. --- END Decision Phase ---
  13882. --- Application Phase ---
  13883. --- Firing Productions (PE) For State At Depth 1 ---
  13884. --- Inner Elaboration Phase, active level 1 (S1) ---
  13885. Firing apply*operator
  13886. -->
  13887. (I3 ^predict-no N997 + :O )
  13888. Firing apply*operator*complete
  13889. -->
  13890. (I3 ^predict-no N996 - :O )
  13891. inner elaboration loop at bottom goal.
  13892. --- Change Working Memory (PE) ---
  13893. =>WM: (13976: I3 ^predict-no N997)
  13894. <=WM: (13964: N996 ^status complete)
  13895. <=WM: (13963: I3 ^predict-no N996)
  13896. --- Firing Productions (IE) For State At Depth 1 ---
  13897. --- Inner Elaboration Phase, active level 1 (S1) ---
  13898. Firing monitor*world
  13899. -->
  13900. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13901. --- Change Working Memory (IE) ---
  13902. --- END Application Phase ---
  13903. --- Output Phase ---
  13904. ENV: Agent did: predict-no for direction U in state State-B
  13905. In State-B moving U
  13906. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13907. predict error 0
  13908. dir: dir isL
  13909. --- END Output Phase ---
  13910. -/|--- Input Phase ---
  13911. =>WM: (13980: I2 ^dir L)
  13912. =>WM: (13979: I2 ^reward 1)
  13913. =>WM: (13978: I2 ^see 0)
  13914. =>WM: (13977: N997 ^status complete)
  13915. <=WM: (13967: I2 ^dir U)
  13916. <=WM: (13966: I2 ^reward 1)
  13917. <=WM: (13965: I2 ^see 0)
  13918. =>WM: (13981: I2 ^level-1 R0-root)
  13919. <=WM: (13968: I2 ^level-1 R0-root)
  13920. --- END Input Phase ---
  13921. --- Proposal Phase ---
  13922. --- Inner Elaboration Phase, active level 1 (S1) ---
  13923. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  13924. -->
  13925. (S1 ^operator O1993 = 0.6597537281805257)
  13926. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  13927. -->
  13928. (S1 ^operator O1994 = 0.133561435542329)
  13929. Firing prefer*rvt*predict-no*H0*2*H1
  13930. -->
  13931. Firing prefer*rvt*predict-yes*H0*1*H1
  13932. -->
  13933. Firing elaborate*copy-see-to-output-link
  13934. -->
  13935. (I3 ^see 0 +)
  13936. Firing elaborate*reward*based*on*reward
  13937. -->
  13938. (R1001 ^value 1 +)
  13939. (R1 ^reward R1001 +)
  13940. Firing propose*predict-yes
  13941. -->
  13942. (O1995 ^name predict-yes +)
  13943. (S1 ^operator O1995 +)
  13944. Firing propose*predict-no
  13945. -->
  13946. (O1996 ^name predict-no +)
  13947. (S1 ^operator O1996 +)
  13948. Firing rl*prefer*rvt*predict-no*H0*2
  13949. -->
  13950. (S1 ^operator O1994 = 0.3212899096504038)
  13951. Firing rl*prefer*rvt*predict-yes*H0*1
  13952. -->
  13953. (S1 ^operator O1993 = 0.3402455472490794)
  13954. Firing prefer*rvt*predict-yes*H0
  13955. -->
  13956. Firing prefer*rvt*predict-no*H0
  13957. -->
  13958. Firing elaborate*copy-dir-to-output-link
  13959. -->
  13960. (I3 ^dir L +)
  13961. inner elaboration loop at bottom goal.
  13962. Retracting elaborate*copy-see-to-output-link
  13963. -->
  13964. (I3 ^see 0 +)
  13965. Retracting propose*predict-no
  13966. -->
  13967. (O1994 ^name predict-no +)
  13968. (S1 ^operator O1994 +)
  13969. Retracting propose*predict-yes
  13970. -->
  13971. (O1993 ^name predict-yes +)
  13972. (S1 ^operator O1993 +)
  13973. Retracting elaborate*reward*based*on*reward
  13974. -->
  13975. (R1000 ^value 1 +)
  13976. (R1 ^reward R1000 +)
  13977. Retracting elaborate*copy-dir-to-output-link
  13978. -->
  13979. (I3 ^dir U +)
  13980. Retracting rl*prefer*rvt*predict-no*H0*4
  13981. -->
  13982. (S1 ^operator O1994 = 0.9999999999999999)
  13983. Retracting rl*prefer*rvt*predict-yes*H0*3
  13984. -->
  13985. (S1 ^operator O1993 = 0.)
  13986. =>WM: (13988: S1 ^operator O1996 +)
  13987. =>WM: (13987: S1 ^operator O1995 +)
  13988. =>WM: (13986: I3 ^dir L)
  13989. =>WM: (13985: O1996 ^name predict-no)
  13990. =>WM: (13984: O1995 ^name predict-yes)
  13991. =>WM: (13983: R1001 ^value 1)
  13992. =>WM: (13982: R1 ^reward R1001)
  13993. <=WM: (13973: S1 ^operator O1993 +)
  13994. <=WM: (13974: S1 ^operator O1994 +)
  13995. <=WM: (13975: S1 ^operator O1994)
  13996. <=WM: (13959: I3 ^dir U)
  13997. <=WM: (13969: R1 ^reward R1000)
  13998. <=WM: (13972: O1994 ^name predict-no)
  13999. <=WM: (13971: O1993 ^name predict-yes)
  14000. <=WM: (13970: R1000 ^value 1)
  14001. --- Inner Elaboration Phase, active level 1 (S1) ---
  14002. Firing prefer*rvt*predict-yes*H0
  14003. -->
  14004. Firing rl*prefer*rvt*predict-yes*H0*1*H1*16
  14005. -->
  14006. (S1 ^operator O1995 = 0.6597537281805257)
  14007. Firing rl*prefer*rvt*predict-yes*H0*1
  14008. -->
  14009. (S1 ^operator O1995 = 0.3402455472490794)
  14010. Firing prefer*rvt*predict-yes*H0*1*H1
  14011. -->
  14012. Firing prefer*rvt*predict-no*H0
  14013. -->
  14014. Firing rl*prefer*rvt*predict-no*H0*2*H1*15
  14015. -->
  14016. (S1 ^operator O1996 = 0.133561435542329)
  14017. Firing rl*prefer*rvt*predict-no*H0*2
  14018. -->
  14019. (S1 ^operator O1996 = 0.3212899096504038)
  14020. Firing prefer*rvt*predict-no*H0*2*H1
  14021. -->
  14022. inner elaboration loop at bottom goal.
  14023. Retracting rl*prefer*rvt*predict-no*H0*2
  14024. -->
  14025. (S1 ^operator O1994 = 0.3212899096504038)
  14026. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  14027. -->
  14028. (S1 ^operator O1994 = 0.133561435542329)
  14029. Retracting rl*prefer*rvt*predict-yes*H0*1
  14030. -->
  14031. (S1 ^operator O1993 = 0.3402455472490794)
  14032. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  14033. -->
  14034. (S1 ^operator O1993 = 0.6597537281805257)
  14035. --- END Proposal Phase ---
  14036. --- Decision Phase ---
  14037. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14038. =>WM: (13989: S1 ^operator O1995)
  14039. 998: O: O1995 (predict-yes)
  14040. --- END Decision Phase ---
  14041. --- Application Phase ---
  14042. --- Firing Productions (PE) For State At Depth 1 ---
  14043. --- Inner Elaboration Phase, active level 1 (S1) ---
  14044. Firing apply*operator
  14045. -->
  14046. (I3 ^predict-yes N998 + :O )
  14047. Firing apply*operator*complete
  14048. -->
  14049. (I3 ^predict-no N997 - :O )
  14050. inner elaboration loop at bottom goal.
  14051. --- Change Working Memory (PE) ---
  14052. =>WM: (13990: I3 ^predict-yes N998)
  14053. <=WM: (13977: N997 ^status complete)
  14054. <=WM: (13976: I3 ^predict-no N997)
  14055. --- Firing Productions (IE) For State At Depth 1 ---
  14056. --- Inner Elaboration Phase, active level 1 (S1) ---
  14057. Firing monitor*world
  14058. -->
  14059. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14060. --- Change Working Memory (IE) ---
  14061. --- END Application Phase ---
  14062. --- Output Phase ---
  14063. ENV: Agent did: predict-yes for direction L in state State-B
  14064. In State-B moving L
  14065. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14066. predict error 0
  14067. dir: dir isL
  14068. --- END Output Phase ---
  14069. \-/--- Input Phase ---
  14070. =>WM: (13994: I2 ^dir L)
  14071. =>WM: (13993: I2 ^reward 1)
  14072. =>WM: (13992: I2 ^see 1)
  14073. =>WM: (13991: N998 ^status complete)
  14074. <=WM: (13980: I2 ^dir L)
  14075. <=WM: (13979: I2 ^reward 1)
  14076. <=WM: (13978: I2 ^see 0)
  14077. =>WM: (13995: I2 ^level-1 L1-root)
  14078. <=WM: (13981: I2 ^level-1 R0-root)
  14079. --- END Input Phase ---
  14080. --- Proposal Phase ---
  14081. --- Inner Elaboration Phase, active level 1 (S1) ---
  14082. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  14083. -->
  14084. (S1 ^operator O1995 = 0.02884852834965246)
  14085. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  14086. -->
  14087. (S1 ^operator O1996 = 0.678736816647851)
  14088. Firing prefer*rvt*predict-no*H0*2*H1
  14089. -->
  14090. Firing prefer*rvt*predict-yes*H0*1*H1
  14091. -->
  14092. Firing elaborate*copy-see-to-output-link
  14093. -->
  14094. (I3 ^see 1 +)
  14095. Firing elaborate*reward*based*on*reward
  14096. -->
  14097. (R1002 ^value 1 +)
  14098. (R1 ^reward R1002 +)
  14099. Firing propose*predict-yes
  14100. -->
  14101. (O1997 ^name predict-yes +)
  14102. (S1 ^operator O1997 +)
  14103. Firing propose*predict-no
  14104. -->
  14105. (O1998 ^name predict-no +)
  14106. (S1 ^operator O1998 +)
  14107. Firing rl*prefer*rvt*predict-no*H0*2
  14108. -->
  14109. (S1 ^operator O1996 = 0.3212899096504038)
  14110. Firing rl*prefer*rvt*predict-yes*H0*1
  14111. -->
  14112. (S1 ^operator O1995 = 0.3402455472490794)
  14113. Firing prefer*rvt*predict-yes*H0
  14114. -->
  14115. Firing prefer*rvt*predict-no*H0
  14116. -->
  14117. Firing elaborate*copy-dir-to-output-link
  14118. -->
  14119. (I3 ^dir L +)
  14120. inner elaboration loop at bottom goal.
  14121. Retracting elaborate*copy-see-to-output-link
  14122. -->
  14123. (I3 ^see 0 +)
  14124. Retracting propose*predict-no
  14125. -->
  14126. (O1996 ^name predict-no +)
  14127. (S1 ^operator O1996 +)
  14128. Retracting propose*predict-yes
  14129. -->
  14130. (O1995 ^name predict-yes +)
  14131. (S1 ^operator O1995 +)
  14132. Retracting elaborate*reward*based*on*reward
  14133. -->
  14134. (R1001 ^value 1 +)
  14135. (R1 ^reward R1001 +)
  14136. Retracting elaborate*copy-dir-to-output-link
  14137. -->
  14138. (I3 ^dir L +)
  14139. Retracting rl*prefer*rvt*predict-no*H0*2
  14140. -->
  14141. (S1 ^operator O1996 = 0.3212899096504038)
  14142. Retracting rl*prefer*rvt*predict-no*H0*2*H1*15
  14143. -->
  14144. (S1 ^operator O1996 = 0.133561435542329)
  14145. Retracting rl*prefer*rvt*predict-yes*H0*1
  14146. -->
  14147. (S1 ^operator O1995 = 0.3402455472490794)
  14148. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*16
  14149. -->
  14150. (S1 ^operator O1995 = 0.6597537281805257)
  14151. =>WM: (14002: S1 ^operator O1998 +)
  14152. =>WM: (14001: S1 ^operator O1997 +)
  14153. =>WM: (14000: O1998 ^name predict-no)
  14154. =>WM: (13999: O1997 ^name predict-yes)
  14155. =>WM: (13998: R1002 ^value 1)
  14156. =>WM: (13997: R1 ^reward R1002)
  14157. =>WM: (13996: I3 ^see 1)
  14158. <=WM: (13987: S1 ^operator O1995 +)
  14159. <=WM: (13989: S1 ^operator O1995)
  14160. <=WM: (13988: S1 ^operator O1996 +)
  14161. <=WM: (13982: R1 ^reward R1001)
  14162. <=WM: (13954: I3 ^see 0)
  14163. <=WM: (13985: O1996 ^name predict-no)
  14164. <=WM: (13984: O1995 ^name predict-yes)
  14165. <=WM: (13983: R1001 ^value 1)
  14166. --- Inner Elaboration Phase, active level 1 (S1) ---
  14167. Firing prefer*rvt*predict-yes*H0
  14168. -->
  14169. Firing rl*prefer*rvt*predict-yes*H0*1
  14170. -->
  14171. (S1 ^operator O1997 = 0.3402455472490794)
  14172. Firing prefer*rvt*predict-yes*H0*1*H1
  14173. -->
  14174. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  14175. -->
  14176. (S1 ^operator O1997 = 0.02884852834965246)
  14177. Firing prefer*rvt*predict-no*H0
  14178. -->
  14179. Firing rl*prefer*rvt*predict-no*H0*2
  14180. -->
  14181. (S1 ^operator O1998 = 0.3212899096504038)
  14182. Firing prefer*rvt*predict-no*H0*2*H1
  14183. -->
  14184. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  14185. -->
  14186. (S1 ^operator O1998 = 0.678736816647851)
  14187. inner elaboration loop at bottom goal.
  14188. Retracting rl*prefer*rvt*predict-no*H0*2
  14189. -->
  14190. (S1 ^operator O1996 = 0.3212899096504038)
  14191. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  14192. -->
  14193. (S1 ^operator O1996 = 0.678736816647851)
  14194. Retracting rl*prefer*rvt*predict-yes*H0*1
  14195. -->
  14196. (S1 ^operator O1995 = 0.3402455472490794)
  14197. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  14198. -->
  14199. (S1 ^operator O1995 = 0.02884852834965246)
  14200. --- END Proposal Phase ---
  14201. --- Decision Phase ---
  14202. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236933 0.340246 -> 0.577178 -0.236932 0.340246(R,m,v=1,0.89697,0.0929786)
  14203. RL update rl*prefer*rvt*predict-yes*H0*1*H1*16 0.422822 0.236932 0.659754 -> 0.422822 0.236932 0.659754(R,m,v=1,1,0)
  14204. =>WM: (14003: S1 ^operator O1998)
  14205. 999: O: O1998 (predict-no)
  14206. --- END Decision Phase ---
  14207. --- Application Phase ---
  14208. --- Firing Productions (PE) For State At Depth 1 ---
  14209. --- Inner Elaboration Phase, active level 1 (S1) ---
  14210. Firing apply*operator
  14211. -->
  14212. (I3 ^predict-no N999 + :O )
  14213. Firing apply*operator*complete
  14214. -->
  14215. (I3 ^predict-yes N998 - :O )
  14216. inner elaboration loop at bottom goal.
  14217. --- Change Working Memory (PE) ---
  14218. =>WM: (14004: I3 ^predict-no N999)
  14219. <=WM: (13991: N998 ^status complete)
  14220. <=WM: (13990: I3 ^predict-yes N998)
  14221. --- Firing Productions (IE) For State At Depth 1 ---
  14222. --- Inner Elaboration Phase, active level 1 (S1) ---
  14223. Firing monitor*world
  14224. -->
  14225. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14226. --- Change Working Memory (IE) ---
  14227. --- END Application Phase ---
  14228. --- Output Phase ---
  14229. ENV: Agent did: predict-no for direction L in state State-A
  14230. In State-A moving L
  14231. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14232. predict error 0
  14233. dir: dir isU
  14234. --- END Output Phase ---
  14235. |\---- Input Phase ---
  14236. =>WM: (14008: I2 ^dir U)
  14237. =>WM: (14007: I2 ^reward 1)
  14238. =>WM: (14006: I2 ^see 0)
  14239. =>WM: (14005: N999 ^status complete)
  14240. <=WM: (13994: I2 ^dir L)
  14241. <=WM: (13993: I2 ^reward 1)
  14242. <=WM: (13992: I2 ^see 1)
  14243. =>WM: (14009: I2 ^level-1 L0-root)
  14244. <=WM: (13995: I2 ^level-1 L1-root)
  14245. --- END Input Phase ---
  14246. --- Proposal Phase ---
  14247. --- Inner Elaboration Phase, active level 1 (S1) ---
  14248. Firing elaborate*copy-see-to-output-link
  14249. -->
  14250. (I3 ^see 0 +)
  14251. Firing elaborate*reward*based*on*reward
  14252. -->
  14253. (R1003 ^value 1 +)
  14254. (R1 ^reward R1003 +)
  14255. Firing propose*predict-yes
  14256. -->
  14257. (O1999 ^name predict-yes +)
  14258. (S1 ^operator O1999 +)
  14259. Firing propose*predict-no
  14260. -->
  14261. (O2000 ^name predict-no +)
  14262. (S1 ^operator O2000 +)
  14263. Firing rl*prefer*rvt*predict-no*H0*4
  14264. -->
  14265. (S1 ^operator O1998 = 0.9999999999999999)
  14266. Firing rl*prefer*rvt*predict-yes*H0*3
  14267. -->
  14268. (S1 ^operator O1997 = 0.)
  14269. Firing prefer*rvt*predict-yes*H0
  14270. -->
  14271. Firing prefer*rvt*predict-no*H0
  14272. -->
  14273. Firing elaborate*copy-dir-to-output-link
  14274. -->
  14275. (I3 ^dir U +)
  14276. inner elaboration loop at bottom goal.
  14277. Retracting elaborate*copy-see-to-output-link
  14278. -->
  14279. (I3 ^see 1 +)
  14280. Retracting propose*predict-no
  14281. -->
  14282. (O1998 ^name predict-no +)
  14283. (S1 ^operator O1998 +)
  14284. Retracting propose*predict-yes
  14285. -->
  14286. (O1997 ^name predict-yes +)
  14287. (S1 ^operator O1997 +)
  14288. Retracting elaborate*reward*based*on*reward
  14289. -->
  14290. (R1002 ^value 1 +)
  14291. (R1 ^reward R1002 +)
  14292. Retracting elaborate*copy-dir-to-output-link
  14293. -->
  14294. (I3 ^dir L +)
  14295. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  14296. -->
  14297. (S1 ^operator O1998 = 0.678736816647851)
  14298. Retracting rl*prefer*rvt*predict-no*H0*2
  14299. -->
  14300. (S1 ^operator O1998 = 0.3212899096504038)
  14301. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  14302. -->
  14303. (S1 ^operator O1997 = 0.02884852834965246)
  14304. Retracting rl*prefer*rvt*predict-yes*H0*1
  14305. -->
  14306. (S1 ^operator O1997 = 0.3402456559346386)
  14307. =>WM: (14017: S1 ^operator O2000 +)
  14308. =>WM: (14016: S1 ^operator O1999 +)
  14309. =>WM: (14015: I3 ^dir U)
  14310. =>WM: (14014: O2000 ^name predict-no)
  14311. =>WM: (14013: O1999 ^name predict-yes)
  14312. =>WM: (14012: R1003 ^value 1)
  14313. =>WM: (14011: R1 ^reward R1003)
  14314. =>WM: (14010: I3 ^see 0)
  14315. <=WM: (14001: S1 ^operator O1997 +)
  14316. <=WM: (14002: S1 ^operator O1998 +)
  14317. <=WM: (14003: S1 ^operator O1998)
  14318. <=WM: (13986: I3 ^dir L)
  14319. <=WM: (13997: R1 ^reward R1002)
  14320. <=WM: (13996: I3 ^see 1)
  14321. <=WM: (14000: O1998 ^name predict-no)
  14322. <=WM: (13999: O1997 ^name predict-yes)
  14323. <=WM: (13998: R1002 ^value 1)
  14324. --- Inner Elaboration Phase, active level 1 (S1) ---
  14325. Firing prefer*rvt*predict-yes*H0
  14326. -->
  14327. Firing rl*prefer*rvt*predict-yes*H0*3
  14328. -->
  14329. (S1 ^operator O1999 = 0.)
  14330. Firing prefer*rvt*predict-no*H0
  14331. -->
  14332. Firing rl*prefer*rvt*predict-no*H0*4
  14333. -->
  14334. (S1 ^operator O2000 = 0.9999999999999999)
  14335. inner elaboration loop at bottom goal.
  14336. Retracting rl*prefer*rvt*predict-no*H0*4
  14337. -->
  14338. (S1 ^operator O1998 = 0.9999999999999999)
  14339. Retracting rl*prefer*rvt*predict-yes*H0*3
  14340. -->
  14341. (S1 ^operator O1997 = 0.)
  14342. --- END Proposal Phase ---
  14343. --- Decision Phase ---
  14344. RL update rl*prefer*rvt*predict-no*H0*2 0.641767 -0.320477 0.32129 -> 0.641763 -0.320477 0.321286(R,m,v=1,0.933775,0.0622517)
  14345. RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358259 0.320477 0.678737 -> 0.358255 0.320477 0.678733(R,m,v=1,1,0)
  14346. =>WM: (14018: S1 ^operator O2000)
  14347. 1000: O: O2000 (predict-no)
  14348. --- END Decision Phase ---
  14349. --- Application Phase ---
  14350. --- Firing Productions (PE) For State At Depth 1 ---
  14351. --- Inner Elaboration Phase, active level 1 (S1) ---
  14352. Firing apply*operator
  14353. -->
  14354. (I3 ^predict-no N1000 + :O )
  14355. Firing apply*operator*complete
  14356. -->
  14357. (I3 ^predict-no N999 - :O )
  14358. inner elaboration loop at bottom goal.
  14359. --- Change Working Memory (PE) ---
  14360. =>WM: (14019: I3 ^predict-no N1000)
  14361. <=WM: (14005: N999 ^status complete)
  14362. <=WM: (14004: I3 ^predict-no N999)
  14363. --- Firing Productions (IE) For State At Depth 1 ---
  14364. --- Inner Elaboration Phase, active level 1 (S1) ---
  14365. Firing monitor*world
  14366. -->
  14367. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14368. --- Change Working Memory (IE) ---
  14369. --- END Application Phase ---
  14370. --- Output Phase ---
  14371. ENV: Agent did: predict-no for direction U in state State-A
  14372. In State-A moving U
  14373. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14374. predict error 0
  14375. dir: dir isR
  14376. --- END Output Phase ---
  14377. /|\-/|\-/|\--- Input Phase ---
  14378. =>WM: (14023: I2 ^dir R)
  14379. =>WM: (14022: I2 ^reward 1)
  14380. =>WM: (14021: I2 ^see 0)
  14381. =>WM: (14020: N1000 ^status complete)
  14382. <=WM: (14008: I2 ^dir U)
  14383. <=WM: (14007: I2 ^reward 1)
  14384. <=WM: (14006: I2 ^see 0)
  14385. =>WM: (14024: I2 ^level-1 L0-root)
  14386. <=WM: (14009: I2 ^level-1 L0-root)
  14387. --- END Input Phase ---
  14388. --- Proposal Phase ---
  14389. --- Inner Elaboration Phase, active level 1 (S1) ---
  14390. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  14391. -->
  14392. (S1 ^operator O1999 = 0.8878820777819987)
  14393. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  14394. -->
  14395. (S1 ^operator O2000 = -0.1957074416057287)
  14396. Firing prefer*rvt*predict-no*H0*6*H1
  14397. -->
  14398. Firing prefer*rvt*predict-yes*H0*5*H1
  14399. -->
  14400. Firing elaborate*copy-see-to-output-link
  14401. -->
  14402. (I3 ^see 0 +)
  14403. Firing elaborate*reward*based*on*reward
  14404. -->
  14405. (R1004 ^value 1 +)
  14406. (R1 ^reward R1004 +)
  14407. Firing propose*predict-yes
  14408. -->
  14409. (O2001 ^name predict-yes +)
  14410. (S1 ^operator O2001 +)
  14411. Firing propose*predict-no
  14412. -->
  14413. (O2002 ^name predict-no +)
  14414. (S1 ^operator O2002 +)
  14415. Firing rl*prefer*rvt*predict-no*H0*6
  14416. -->
  14417. (S1 ^operator O2000 = 0.3993286818522114)
  14418. Firing rl*prefer*rvt*predict-yes*H0*5
  14419. -->
  14420. (S1 ^operator O1999 = 0.1121030519341281)
  14421. Firing prefer*rvt*predict-yes*H0
  14422. -->
  14423. Firing prefer*rvt*predict-no*H0
  14424. -->
  14425. Firing elaborate*copy-dir-to-output-link
  14426. -->
  14427. (I3 ^dir R +)
  14428. inner elaboration loop at bottom goal.
  14429. Retracting elaborate*copy-see-to-output-link
  14430. -->
  14431. (I3 ^see 0 +)
  14432. Retracting propose*predict-no
  14433. -->
  14434. (O2000 ^name predict-no +)
  14435. (S1 ^operator O2000 +)
  14436. Retracting propose*predict-yes
  14437. -->
  14438. (O1999 ^name predict-yes +)
  14439. (S1 ^operator O1999 +)
  14440. Retracting elaborate*reward*based*on*reward
  14441. -->
  14442. (R1003 ^value 1 +)
  14443. (R1 ^reward R1003 +)
  14444. Retracting elaborate*copy-dir-to-output-link
  14445. -->
  14446. (I3 ^dir U +)
  14447. Retracting rl*prefer*rvt*predict-no*H0*4
  14448. -->
  14449. (S1 ^operator O2000 = 0.9999999999999999)
  14450. Retracting rl*prefer*rvt*predict-yes*H0*3
  14451. -->
  14452. (S1 ^operator O1999 = 0.)
  14453. =>WM: (14031: S1 ^operator O2002 +)
  14454. =>WM: (14030: S1 ^operator O2001 +)
  14455. =>WM: (14029: I3 ^dir R)
  14456. =>WM: (14028: O2002 ^name predict-no)
  14457. =>WM: (14027: O2001 ^name predict-yes)
  14458. =>WM: (14026: R1004 ^value 1)
  14459. =>WM: (14025: R1 ^reward R1004)
  14460. <=WM: (14016: S1 ^operator O1999 +)
  14461. <=WM: (14017: S1 ^operator O2000 +)
  14462. <=WM: (14018: S1 ^operator O2000)
  14463. <=WM: (14015: I3 ^dir U)
  14464. <=WM: (14011: R1 ^reward R1003)
  14465. <=WM: (14014: O2000 ^name predict-no)
  14466. <=WM: (14013: O1999 ^name predict-yes)
  14467. <=WM: (14012: R1003 ^value 1)
  14468. --- Inner Elaboration Phase, active level 1 (S1) ---
  14469. Firing prefer*rvt*predict-yes*H0
  14470. -->
  14471. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  14472. -->
  14473. (S1 ^operator O2001 = 0.8878820777819987)
  14474. Firing rl*prefer*rvt*predict-yes*H0*5
  14475. -->
  14476. (S1 ^operator O2001 = 0.1121030519341281)
  14477. Firing prefer*rvt*predict-yes*H0*5*H1
  14478. -->
  14479. Firing prefer*rvt*predict-no*H0
  14480. -->
  14481. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  14482. -->
  14483. (S1 ^operator O2002 = -0.1957074416057287)
  14484. Firing rl*prefer*rvt*predict-no*H0*6
  14485. -->
  14486. (S1 ^operator O2002 = 0.3993286818522114)
  14487. Firing prefer*rvt*predict-no*H0*6*H1
  14488. -->
  14489. inner elaboration loop at bottom goal.
  14490. Retracting rl*prefer*rvt*predict-no*H0*6
  14491. -->
  14492. (S1 ^operator O2000 = 0.3993286818522114)
  14493. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  14494. -->
  14495. (S1 ^operator O2000 = -0.1957074416057287)
  14496. Retracting rl*prefer*rvt*predict-yes*H0*5
  14497. -->
  14498. (S1 ^operator O1999 = 0.1121030519341281)
  14499. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  14500. -->
  14501. (S1 ^operator O1999 = 0.8878820777819987)
  14502. --- END Proposal Phase ---
  14503. --- Decision Phase ---
  14504. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14505. =>WM: (14032: S1 ^operator O2001)
  14506. 1001: O: O2001 (predict-yes)
  14507. --- END Decision Phase ---
  14508. --- Application Phase ---
  14509. --- Firing Productions (PE) For State At Depth 1 ---
  14510. --- Inner Elaboration Phase, active level 1 (S1) ---
  14511. Firing apply*operator
  14512. -->
  14513. (I3 ^predict-yes N1001 + :O )
  14514. Firing apply*operator*complete
  14515. -->
  14516. (I3 ^predict-no N1000 - :O )
  14517. inner elaboration loop at bottom goal.
  14518. --- Change Working Memory (PE) ---
  14519. =>WM: (14033: I3 ^predict-yes N1001)
  14520. <=WM: (14020: N1000 ^status complete)
  14521. <=WM: (14019: I3 ^predict-no N1000)
  14522. --- Firing Productions (IE) For State At Depth 1 ---
  14523. --- Inner Elaboration Phase, active level 1 (S1) ---
  14524. Firing monitor*world
  14525. -->
  14526. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14527. --- Change Working Memory (IE) ---
  14528. --- END Application Phase ---
  14529. --- Output Phase ---
  14530. ENV: Agent did: predict-yes for direction R in state State-A
  14531. In State-A moving R
  14532. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14533. predict error 0
  14534. dir: dir isL
  14535. --- END Output Phase ---
  14536. ---- Input Phase ---
  14537. =>WM: (14037: I2 ^dir L)
  14538. =>WM: (14036: I2 ^reward 1)
  14539. =>WM: (14035: I2 ^see 1)
  14540. =>WM: (14034: N1001 ^status complete)
  14541. <=WM: (14023: I2 ^dir R)
  14542. <=WM: (14022: I2 ^reward 1)
  14543. <=WM: (14021: I2 ^see 0)
  14544. =>WM: (14038: I2 ^level-1 R1-root)
  14545. <=WM: (14024: I2 ^level-1 L0-root)
  14546. --- END Input Phase ---
  14547. --- Proposal Phase ---
  14548. --- Inner Elaboration Phase, active level 1 (S1) ---
  14549. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  14550. -->
  14551. (S1 ^operator O2002 = 0.03900899329983293)
  14552. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  14553. -->
  14554. (S1 ^operator O2001 = 0.6597553453917251)
  14555. Firing prefer*rvt*predict-no*H0*2*H1
  14556. -->
  14557. Firing prefer*rvt*predict-yes*H0*1*H1
  14558. -->
  14559. Firing elaborate*copy-see-to-output-link
  14560. -->
  14561. (I3 ^see 1 +)
  14562. Firing elaborate*reward*based*on*reward
  14563. -->
  14564. (R1005 ^value 1 +)
  14565. (R1 ^reward R1005 +)
  14566. Firing propose*predict-yes
  14567. -->
  14568. (O2003 ^name predict-yes +)
  14569. (S1 ^operator O2003 +)
  14570. Firing propose*predict-no
  14571. -->
  14572. (O2004 ^name predict-no +)
  14573. (S1 ^operator O2004 +)
  14574. Firing rl*prefer*rvt*predict-no*H0*2
  14575. -->
  14576. (S1 ^operator O2002 = 0.3212859007056656)
  14577. Firing rl*prefer*rvt*predict-yes*H0*1
  14578. -->
  14579. (S1 ^operator O2001 = 0.3402456559346386)
  14580. Firing prefer*rvt*predict-yes*H0
  14581. -->
  14582. Firing prefer*rvt*predict-no*H0
  14583. -->
  14584. Firing elaborate*copy-dir-to-output-link
  14585. -->
  14586. (I3 ^dir L +)
  14587. inner elaboration loop at bottom goal.
  14588. Retracting elaborate*copy-see-to-output-link
  14589. -->
  14590. (I3 ^see 0 +)
  14591. Retracting propose*predict-no
  14592. -->
  14593. (O2002 ^name predict-no +)
  14594. (S1 ^operator O2002 +)
  14595. Retracting propose*predict-yes
  14596. -->
  14597. (O2001 ^name predict-yes +)
  14598. (S1 ^operator O2001 +)
  14599. Retracting elaborate*reward*based*on*reward
  14600. -->
  14601. (R1004 ^value 1 +)
  14602. (R1 ^reward R1004 +)
  14603. Retracting elaborate*copy-dir-to-output-link
  14604. -->
  14605. (I3 ^dir R +)
  14606. Retracting rl*prefer*rvt*predict-no*H0*6
  14607. -->
  14608. (S1 ^operator O2002 = 0.3993286818522114)
  14609. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  14610. -->
  14611. (S1 ^operator O2002 = -0.1957074416057287)
  14612. Retracting rl*prefer*rvt*predict-yes*H0*5
  14613. -->
  14614. (S1 ^operator O2001 = 0.1121030519341281)
  14615. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  14616. -->
  14617. (S1 ^operator O2001 = 0.8878820777819987)
  14618. =>WM: (14046: S1 ^operator O2004 +)
  14619. =>WM: (14045: S1 ^operator O2003 +)
  14620. =>WM: (14044: I3 ^dir L)
  14621. =>WM: (14043: O2004 ^name predict-no)
  14622. =>WM: (14042: O2003 ^name predict-yes)
  14623. =>WM: (14041: R1005 ^value 1)
  14624. =>WM: (14040: R1 ^reward R1005)
  14625. =>WM: (14039: I3 ^see 1)
  14626. <=WM: (14030: S1 ^operator O2001 +)
  14627. <=WM: (14032: S1 ^operator O2001)
  14628. <=WM: (14031: S1 ^operator O2002 +)
  14629. <=WM: (14029: I3 ^dir R)
  14630. <=WM: (14025: R1 ^reward R1004)
  14631. <=WM: (14010: I3 ^see 0)
  14632. <=WM: (14028: O2002 ^name predict-no)
  14633. <=WM: (14027: O2001 ^name predict-yes)
  14634. <=WM: (14026: R1004 ^value 1)
  14635. --- Inner Elaboration Phase, active level 1 (S1) ---
  14636. Firing prefer*rvt*predict-yes*H0
  14637. -->
  14638. Firing rl*prefer*rvt*predict-yes*H0*1
  14639. -->
  14640. (S1 ^operator O2003 = 0.3402456559346386)
  14641. Firing prefer*rvt*predict-yes*H0*1*H1
  14642. -->
  14643. Firing rl*prefer*rvt*predict-yes*H0*1*H1*7
  14644. -->
  14645. (S1 ^operator O2003 = 0.6597553453917251)
  14646. Firing prefer*rvt*predict-no*H0
  14647. -->
  14648. Firing rl*prefer*rvt*predict-no*H0*2
  14649. -->
  14650. (S1 ^operator O2004 = 0.3212859007056656)
  14651. Firing prefer*rvt*predict-no*H0*2*H1
  14652. -->
  14653. Firing rl*prefer*rvt*predict-no*H0*2*H1*8
  14654. -->
  14655. (S1 ^operator O2004 = 0.03900899329983293)
  14656. inner elaboration loop at bottom goal.
  14657. Retracting rl*prefer*rvt*predict-no*H0*2
  14658. -->
  14659. (S1 ^operator O2002 = 0.3212859007056656)
  14660. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  14661. -->
  14662. (S1 ^operator O2002 = 0.03900899329983293)
  14663. Retracting rl*prefer*rvt*predict-yes*H0*1
  14664. -->
  14665. (S1 ^operator O2001 = 0.3402456559346386)
  14666. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  14667. -->
  14668. (S1 ^operator O2001 = 0.6597553453917251)
  14669. --- END Proposal Phase ---
  14670. --- Decision Phase ---
  14671. RL update rl*prefer*rvt*predict-yes*H0*5 0.619026 -0.506923 0.112103 -> 0.619028 -0.506923 0.112105(R,m,v=1,0.90184,0.0890707)
  14672. RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380957 0.506925 0.887882 -> 0.380959 0.506925 0.887884(R,m,v=1,1,0)
  14673. =>WM: (14047: S1 ^operator O2003)
  14674. 1002: O: O2003 (predict-yes)
  14675. --- END Decision Phase ---
  14676. --- Application Phase ---
  14677. --- Firing Productions (PE) For State At Depth 1 ---
  14678. --- Inner Elaboration Phase, active level 1 (S1) ---
  14679. Firing apply*operator
  14680. -->
  14681. (I3 ^predict-yes N1002 + :O )
  14682. Firing apply*operator*complete
  14683. -->
  14684. (I3 ^predict-yes N1001 - :O )
  14685. inner elaboration loop at bottom goal.
  14686. --- Change Working Memory (PE) ---
  14687. =>WM: (14048: I3 ^predict-yes N1002)
  14688. <=WM: (14034: N1001 ^status complete)
  14689. <=WM: (14033: I3 ^predict-yes N1001)
  14690. --- Firing Productions (IE) For State At Depth 1 ---
  14691. --- Inner Elaboration Phase, active level 1 (S1) ---
  14692. Firing monitor*world
  14693. -->
  14694. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14695. --- Change Working Memory (IE) ---
  14696. --- END Application Phase ---
  14697. --- Output Phase ---
  14698. ENV: Agent did: predict-yes for direction L in state State-B
  14699. In State-B moving L
  14700. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14701. predict error 0
  14702. dir: dir isL
  14703. --- END Output Phase ---
  14704. /|\--- Input Phase ---
  14705. =>WM: (14052: I2 ^dir L)
  14706. =>WM: (14051: I2 ^reward 1)
  14707. =>WM: (14050: I2 ^see 1)
  14708. =>WM: (14049: N1002 ^status complete)
  14709. <=WM: (14037: I2 ^dir L)
  14710. <=WM: (14036: I2 ^reward 1)
  14711. <=WM: (14035: I2 ^see 1)
  14712. =>WM: (14053: I2 ^level-1 L1-root)
  14713. <=WM: (14038: I2 ^level-1 R1-root)
  14714. --- END Input Phase ---
  14715. --- Proposal Phase ---
  14716. --- Inner Elaboration Phase, active level 1 (S1) ---
  14717. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  14718. -->
  14719. (S1 ^operator O2003 = 0.02884852834965246)
  14720. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  14721. -->
  14722. (S1 ^operator O2004 = 0.6787328077031127)
  14723. Firing prefer*rvt*predict-no*H0*2*H1
  14724. -->
  14725. Firing prefer*rvt*predict-yes*H0*1*H1
  14726. -->
  14727. Firing elaborate*copy-see-to-output-link
  14728. -->
  14729. (I3 ^see 1 +)
  14730. Firing elaborate*reward*based*on*reward
  14731. -->
  14732. (R1006 ^value 1 +)
  14733. (R1 ^reward R1006 +)
  14734. Firing propose*predict-yes
  14735. -->
  14736. (O2005 ^name predict-yes +)
  14737. (S1 ^operator O2005 +)
  14738. Firing propose*predict-no
  14739. -->
  14740. (O2006 ^name predict-no +)
  14741. (S1 ^operator O2006 +)
  14742. Firing rl*prefer*rvt*predict-no*H0*2
  14743. -->
  14744. (S1 ^operator O2004 = 0.3212859007056656)
  14745. Firing rl*prefer*rvt*predict-yes*H0*1
  14746. -->
  14747. (S1 ^operator O2003 = 0.3402456559346386)
  14748. Firing prefer*rvt*predict-yes*H0
  14749. -->
  14750. Firing prefer*rvt*predict-no*H0
  14751. -->
  14752. Firing elaborate*copy-dir-to-output-link
  14753. -->
  14754. (I3 ^dir L +)
  14755. inner elaboration loop at bottom goal.
  14756. Retracting elaborate*copy-see-to-output-link
  14757. -->
  14758. (I3 ^see 1 +)
  14759. Retracting propose*predict-no
  14760. -->
  14761. (O2004 ^name predict-no +)
  14762. (S1 ^operator O2004 +)
  14763. Retracting propose*predict-yes
  14764. -->
  14765. (O2003 ^name predict-yes +)
  14766. (S1 ^operator O2003 +)
  14767. Retracting elaborate*reward*based*on*reward
  14768. -->
  14769. (R1005 ^value 1 +)
  14770. (R1 ^reward R1005 +)
  14771. Retracting elaborate*copy-dir-to-output-link
  14772. -->
  14773. (I3 ^dir L +)
  14774. Retracting rl*prefer*rvt*predict-no*H0*2*H1*8
  14775. -->
  14776. (S1 ^operator O2004 = 0.03900899329983293)
  14777. Retracting rl*prefer*rvt*predict-no*H0*2
  14778. -->
  14779. (S1 ^operator O2004 = 0.3212859007056656)
  14780. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*7
  14781. -->
  14782. (S1 ^operator O2003 = 0.6597553453917251)
  14783. Retracting rl*prefer*rvt*predict-yes*H0*1
  14784. -->
  14785. (S1 ^operator O2003 = 0.3402456559346386)
  14786. =>WM: (14059: S1 ^operator O2006 +)
  14787. =>WM: (14058: S1 ^operator O2005 +)
  14788. =>WM: (14057: O2006 ^name predict-no)
  14789. =>WM: (14056: O2005 ^name predict-yes)
  14790. =>WM: (14055: R1006 ^value 1)
  14791. =>WM: (14054: R1 ^reward R1006)
  14792. <=WM: (14045: S1 ^operator O2003 +)
  14793. <=WM: (14047: S1 ^operator O2003)
  14794. <=WM: (14046: S1 ^operator O2004 +)
  14795. <=WM: (14040: R1 ^reward R1005)
  14796. <=WM: (14043: O2004 ^name predict-no)
  14797. <=WM: (14042: O2003 ^name predict-yes)
  14798. <=WM: (14041: R1005 ^value 1)
  14799. --- Inner Elaboration Phase, active level 1 (S1) ---
  14800. Firing prefer*rvt*predict-yes*H0
  14801. -->
  14802. Firing rl*prefer*rvt*predict-yes*H0*1
  14803. -->
  14804. (S1 ^operator O2005 = 0.3402456559346386)
  14805. Firing prefer*rvt*predict-yes*H0*1*H1
  14806. -->
  14807. Firing rl*prefer*rvt*predict-yes*H0*1*H1*18
  14808. -->
  14809. (S1 ^operator O2005 = 0.02884852834965246)
  14810. Firing prefer*rvt*predict-no*H0
  14811. -->
  14812. Firing rl*prefer*rvt*predict-no*H0*2
  14813. -->
  14814. (S1 ^operator O2006 = 0.3212859007056656)
  14815. Firing prefer*rvt*predict-no*H0*2*H1
  14816. -->
  14817. Firing rl*prefer*rvt*predict-no*H0*2*H1*17
  14818. -->
  14819. (S1 ^operator O2006 = 0.6787328077031127)
  14820. inner elaboration loop at bottom goal.
  14821. Retracting rl*prefer*rvt*predict-no*H0*2
  14822. -->
  14823. (S1 ^operator O2004 = 0.3212859007056656)
  14824. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  14825. -->
  14826. (S1 ^operator O2004 = 0.6787328077031127)
  14827. Retracting rl*prefer*rvt*predict-yes*H0*1
  14828. -->
  14829. (S1 ^operator O2003 = 0.3402456559346386)
  14830. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  14831. -->
  14832. (S1 ^operator O2003 = 0.02884852834965246)
  14833. --- END Proposal Phase ---
  14834. --- Decision Phase ---
  14835. RL update rl*prefer*rvt*predict-yes*H0*1 0.577178 -0.236932 0.340246 -> 0.577178 -0.236933 0.340246(R,m,v=1,0.89759,0.092479)
  14836. RL update rl*prefer*rvt*predict-yes*H0*1*H1*7 0.422822 0.236933 0.659755 -> 0.422822 0.236933 0.659755(R,m,v=1,1,0)
  14837. =>WM: (14060: S1 ^operator O2006)
  14838. 1003: O: O2006 (predict-no)
  14839. --- END Decision Phase ---
  14840. --- Application Phase ---
  14841. --- Firing Productions (PE) For State At Depth 1 ---
  14842. --- Inner Elaboration Phase, active level 1 (S1) ---
  14843. Firing apply*operator
  14844. -->
  14845. (I3 ^predict-no N1003 + :O )
  14846. Firing apply*operator*complete
  14847. -->
  14848. (I3 ^predict-yes N1002 - :O )
  14849. inner elaboration loop at bottom goal.
  14850. --- Change Working Memory (PE) ---
  14851. =>WM: (14061: I3 ^predict-no N1003)
  14852. <=WM: (14049: N1002 ^status complete)
  14853. <=WM: (14048: I3 ^predict-yes N1002)
  14854. --- Firing Productions (IE) For State At Depth 1 ---
  14855. --- Inner Elaboration Phase, active level 1 (S1) ---
  14856. Firing monitor*world
  14857. -->
  14858. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14859. --- Change Working Memory (IE) ---
  14860. --- END Application Phase ---
  14861. --- Output Phase ---
  14862. ENV: Agent did: predict-no for direction L in state State-A
  14863. In State-A moving L
  14864. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14865. predict error 0
  14866. dir: dir isR
  14867. --- END Output Phase ---
  14868. -/--- Input Phase ---
  14869. =>WM: (14065: I2 ^dir R)
  14870. =>WM: (14064: I2 ^reward 1)
  14871. =>WM: (14063: I2 ^see 0)
  14872. =>WM: (14062: N1003 ^status complete)
  14873. <=WM: (14052: I2 ^dir L)
  14874. <=WM: (14051: I2 ^reward 1)
  14875. <=WM: (14050: I2 ^see 1)
  14876. =>WM: (14066: I2 ^level-1 L0-root)
  14877. <=WM: (14053: I2 ^level-1 L1-root)
  14878. --- END Input Phase ---
  14879. --- Proposal Phase ---
  14880. --- Inner Elaboration Phase, active level 1 (S1) ---
  14881. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  14882. -->
  14883. (S1 ^operator O2005 = 0.8878843083245797)
  14884. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  14885. -->
  14886. (S1 ^operator O2006 = -0.1957074416057287)
  14887. Firing prefer*rvt*predict-no*H0*6*H1
  14888. -->
  14889. Firing prefer*rvt*predict-yes*H0*5*H1
  14890. -->
  14891. Firing elaborate*copy-see-to-output-link
  14892. -->
  14893. (I3 ^see 0 +)
  14894. Firing elaborate*reward*based*on*reward
  14895. -->
  14896. (R1007 ^value 1 +)
  14897. (R1 ^reward R1007 +)
  14898. Firing propose*predict-yes
  14899. -->
  14900. (O2007 ^name predict-yes +)
  14901. (S1 ^operator O2007 +)
  14902. Firing propose*predict-no
  14903. -->
  14904. (O2008 ^name predict-no +)
  14905. (S1 ^operator O2008 +)
  14906. Firing rl*prefer*rvt*predict-no*H0*6
  14907. -->
  14908. (S1 ^operator O2006 = 0.3993286818522114)
  14909. Firing rl*prefer*rvt*predict-yes*H0*5
  14910. -->
  14911. (S1 ^operator O2005 = 0.1121052824767091)
  14912. Firing prefer*rvt*predict-yes*H0
  14913. -->
  14914. Firing prefer*rvt*predict-no*H0
  14915. -->
  14916. Firing elaborate*copy-dir-to-output-link
  14917. -->
  14918. (I3 ^dir R +)
  14919. inner elaboration loop at bottom goal.
  14920. Retracting elaborate*copy-see-to-output-link
  14921. -->
  14922. (I3 ^see 1 +)
  14923. Retracting propose*predict-no
  14924. -->
  14925. (O2006 ^name predict-no +)
  14926. (S1 ^operator O2006 +)
  14927. Retracting propose*predict-yes
  14928. -->
  14929. (O2005 ^name predict-yes +)
  14930. (S1 ^operator O2005 +)
  14931. Retracting elaborate*reward*based*on*reward
  14932. -->
  14933. (R1006 ^value 1 +)
  14934. (R1 ^reward R1006 +)
  14935. Retracting elaborate*copy-dir-to-output-link
  14936. -->
  14937. (I3 ^dir L +)
  14938. Retracting rl*prefer*rvt*predict-no*H0*2*H1*17
  14939. -->
  14940. (S1 ^operator O2006 = 0.6787328077031127)
  14941. Retracting rl*prefer*rvt*predict-no*H0*2
  14942. -->
  14943. (S1 ^operator O2006 = 0.3212859007056656)
  14944. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*18
  14945. -->
  14946. (S1 ^operator O2005 = 0.02884852834965246)
  14947. Retracting rl*prefer*rvt*predict-yes*H0*1
  14948. -->
  14949. (S1 ^operator O2005 = 0.340245505735684)
  14950. =>WM: (14074: S1 ^operator O2008 +)
  14951. =>WM: (14073: S1 ^operator O2007 +)
  14952. =>WM: (14072: I3 ^dir R)
  14953. =>WM: (14071: O2008 ^name predict-no)
  14954. =>WM: (14070: O2007 ^name predict-yes)
  14955. =>WM: (14069: R1007 ^value 1)
  14956. =>WM: (14068: R1 ^reward R1007)
  14957. =>WM: (14067: I3 ^see 0)
  14958. <=WM: (14058: S1 ^operator O2005 +)
  14959. <=WM: (14059: S1 ^operator O2006 +)
  14960. <=WM: (14060: S1 ^operator O2006)
  14961. <=WM: (14044: I3 ^dir L)
  14962. <=WM: (14054: R1 ^reward R1006)
  14963. <=WM: (14039: I3 ^see 1)
  14964. <=WM: (14057: O2006 ^name predict-no)
  14965. <=WM: (14056: O2005 ^name predict-yes)
  14966. <=WM: (14055: R1006 ^value 1)
  14967. --- Inner Elaboration Phase, active level 1 (S1) ---
  14968. Firing prefer*rvt*predict-yes*H0
  14969. -->
  14970. Firing rl*prefer*rvt*predict-yes*H0*5
  14971. -->
  14972. (S1 ^operator O2007 = 0.1121052824767091)
  14973. Firing prefer*rvt*predict-yes*H0*5*H1
  14974. -->
  14975. Firing rl*prefer*rvt*predict-yes*H0*5*H1*20
  14976. -->
  14977. (S1 ^operator O2007 = 0.8878843083245797)
  14978. Firing prefer*rvt*predict-no*H0
  14979. -->
  14980. Firing rl*prefer*rvt*predict-no*H0*6
  14981. -->
  14982. (S1 ^operator O2008 = 0.3993286818522114)
  14983. Firing prefer*rvt*predict-no*H0*6*H1
  14984. -->
  14985. Firing rl*prefer*rvt*predict-no*H0*6*H1*19
  14986. -->
  14987. (S1 ^operator O2008 = -0.1957074416057287)
  14988. inner elaboration loop at bottom goal.
  14989. Retracting rl*prefer*rvt*predict-no*H0*6
  14990. -->
  14991. (S1 ^operator O2006 = 0.3993286818522114)
  14992. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  14993. -->
  14994. (S1 ^operator O2006 = -0.1957074416057287)
  14995. Retracting rl*prefer*rvt*predict-yes*H0*5
  14996. -->
  14997. (S1 ^operator O2005 = 0.1121052824767091)
  14998. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  14999. -->
  15000. (S1 ^operator O2005 = 0.8878843083245797)
  15001. --- END Proposal Phase ---
  15002. --- Decision Phase ---
  15003. RL update rl*prefer*rvt*predict-no*H0*2 0.641763 -0.320477 0.321286 -> 0.641761 -0.320477 0.321283(R,m,v=1,0.934211,0.0618682)
  15004. RL update rl*prefer*rvt*predict-no*H0*2*H1*17 0.358255 0.320477 0.678733 -> 0.358253 0.320477 0.67873(R,m,v=1,1,0)
  15005. =>WM: (14075: S1 ^operator O2007)
  15006. 1004: O: O2007 (predict-yes)
  15007. --- END Decision Phase ---
  15008. --- Application Phase ---
  15009. --- Firing Productions (PE) For State At Depth 1 ---
  15010. --- Inner Elaboration Phase, active level 1 (S1) ---
  15011. Firing apply*operator
  15012. -->
  15013. (I3 ^predict-yes N1004 + :O )
  15014. Firing apply*operator*complete
  15015. -->
  15016. (I3 ^predict-no N1003 - :O )
  15017. inner elaboration loop at bottom goal.
  15018. --- Change Working Memory (PE) ---
  15019. =>WM: (14076: I3 ^predict-yes N1004)
  15020. <=WM: (14062: N1003 ^status complete)
  15021. <=WM: (14061: I3 ^predict-no N1003)
  15022. --- Firing Productions (IE) For State At Depth 1 ---
  15023. --- Inner Elaboration Phase, active level 1 (S1) ---
  15024. Firing monitor*world
  15025. -->
  15026. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15027. --- Change Working Memory (IE) ---
  15028. --- END Application Phase ---
  15029. --- Output Phase ---
  15030. ENV: Agent did: predict-yes for direction R in state State-A
  15031. In State-A moving R
  15032. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15033. predict error 0
  15034. dir: dir isR
  15035. --- END Output Phase ---
  15036. |\-/--- Input Phase ---
  15037. =>WM: (14080: I2 ^dir R)
  15038. =>WM: (14079: I2 ^reward 1)
  15039. =>WM: (14078: I2 ^see 1)
  15040. =>WM: (14077: N1004 ^status complete)
  15041. <=WM: (14065: I2 ^dir R)
  15042. <=WM: (14064: I2 ^reward 1)
  15043. <=WM: (14063: I2 ^see 0)
  15044. =>WM: (14081: I2 ^level-1 R1-root)
  15045. <=WM: (14066: I2 ^level-1 L0-root)
  15046. --- END Input Phase ---
  15047. --- Proposal Phase ---
  15048. --- Inner Elaboration Phase, active level 1 (S1) ---
  15049. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  15050. -->
  15051. (S1 ^operator O2008 = 0.6006730589861906)
  15052. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  15053. -->
  15054. (S1 ^operator O2007 = 0.1602187148382515)
  15055. Firing prefer*rvt*predict-no*H0*6*H1
  15056. -->
  15057. Firing prefer*rvt*predict-yes*H0*5*H1
  15058. -->
  15059. Firing elaborate*copy-see-to-output-link
  15060. -->
  15061. (I3 ^see 1 +)
  15062. Firing elaborate*reward*based*on*reward
  15063. -->
  15064. (R1008 ^value 1 +)
  15065. (R1 ^reward R1008 +)
  15066. Firing propose*predict-yes
  15067. -->
  15068. (O2009 ^name predict-yes +)
  15069. (S1 ^operator O2009 +)
  15070. Firing propose*predict-no
  15071. -->
  15072. (O2010 ^name predict-no +)
  15073. (S1 ^operator O2010 +)
  15074. Firing rl*prefer*rvt*predict-no*H0*6
  15075. -->
  15076. (S1 ^operator O2008 = 0.3993286818522114)
  15077. Firing rl*prefer*rvt*predict-yes*H0*5
  15078. -->
  15079. (S1 ^operator O2007 = 0.1121052824767091)
  15080. Firing prefer*rvt*predict-yes*H0
  15081. -->
  15082. Firing prefer*rvt*predict-no*H0
  15083. -->
  15084. Firing elaborate*copy-dir-to-output-link
  15085. -->
  15086. (I3 ^dir R +)
  15087. inner elaboration loop at bottom goal.
  15088. Retracting elaborate*copy-see-to-output-link
  15089. -->
  15090. (I3 ^see 0 +)
  15091. Retracting propose*predict-no
  15092. -->
  15093. (O2008 ^name predict-no +)
  15094. (S1 ^operator O2008 +)
  15095. Retracting propose*predict-yes
  15096. -->
  15097. (O2007 ^name predict-yes +)
  15098. (S1 ^operator O2007 +)
  15099. Retracting elaborate*reward*based*on*reward
  15100. -->
  15101. (R1007 ^value 1 +)
  15102. (R1 ^reward R1007 +)
  15103. Retracting elaborate*copy-dir-to-output-link
  15104. -->
  15105. (I3 ^dir R +)
  15106. Retracting rl*prefer*rvt*predict-no*H0*6*H1*19
  15107. -->
  15108. (S1 ^operator O2008 = -0.1957074416057287)
  15109. Retracting rl*prefer*rvt*predict-no*H0*6
  15110. -->
  15111. (S1 ^operator O2008 = 0.3993286818522114)
  15112. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*20
  15113. -->
  15114. (S1 ^operator O2007 = 0.8878843083245797)
  15115. Retracting rl*prefer*rvt*predict-yes*H0*5
  15116. -->
  15117. (S1 ^operator O2007 = 0.1121052824767091)
  15118. =>WM: (14088: S1 ^operator O2010 +)
  15119. =>WM: (14087: S1 ^operator O2009 +)
  15120. =>WM: (14086: O2010 ^name predict-no)
  15121. =>WM: (14085: O2009 ^name predict-yes)
  15122. =>WM: (14084: R1008 ^value 1)
  15123. =>WM: (14083: R1 ^reward R1008)
  15124. =>WM: (14082: I3 ^see 1)
  15125. <=WM: (14073: S1 ^operator O2007 +)
  15126. <=WM: (14075: S1 ^operator O2007)
  15127. <=WM: (14074: S1 ^operator O2008 +)
  15128. <=WM: (14068: R1 ^reward R1007)
  15129. <=WM: (14067: I3 ^see 0)
  15130. <=WM: (14071: O2008 ^name predict-no)
  15131. <=WM: (14070: O2007 ^name predict-yes)
  15132. <=WM: (14069: R1007 ^value 1)
  15133. --- Inner Elaboration Phase, active level 1 (S1) ---
  15134. Firing prefer*rvt*predict-yes*H0
  15135. -->
  15136. Firing rl*prefer*rvt*predict-yes*H0*5
  15137. -->
  15138. (S1 ^operator O2009 = 0.1121052824767091)
  15139. Firing prefer*rvt*predict-yes*H0*5*H1
  15140. -->
  15141. Firing rl*prefer*rvt*predict-yes*H0*5*H1*12
  15142. -->
  15143. (S1 ^operator O2009 = 0.1602187148382515)
  15144. Firing prefer*rvt*predict-no*H0
  15145. -->
  15146. Firing rl*prefer*rvt*predict-no*H0*6
  15147. -->
  15148. (S1 ^operator O2010 = 0.3993286818522114)
  15149. Firing prefer*rvt*predict-no*H0*6*H1
  15150. -->
  15151. Firing rl*prefer*rvt*predict-no*H0*6*H1*11
  15152. -->
  15153. (S1 ^operator O2010 = 0.6006730589861906)
  15154. inner elaboration loop at bottom goal.
  15155. Retracting rl*prefer*rvt*predict-no*H0*6
  15156. -->
  15157. (S1 ^operator O2008 = 0.3993286818522114)
  15158. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  15159. -->
  15160. (S1 ^operator O2008 = 0.6006730589861906)
  15161. Retracting rl*prefer*rvt*predict-yes*H0*5
  15162. -->
  15163. (S1 ^operator O2007 = 0.1121052824767091)
  15164. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  15165. -->
  15166. (S1 ^operator O2007 = 0.1602187148382515)
  15167. --- END Proposal Phase ---
  15168. --- Decision Phase ---
  15169. RL update rl*prefer*rvt*predict-yes*H0*5 0.619028 -0.506923 0.112105 -> 0.61903 -0.506923 0.112107(R,m,v=1,0.902439,0.088583)
  15170. RL update rl*prefer*rvt*predict-yes*H0*5*H1*20 0.380959 0.506925 0.887884 -> 0.380961 0.506925 0.887886(R,m,v=1,1,0)
  15171. =>WM: (14089: S1 ^operator O2010)
  15172. 1005: O: O2010 (predict-no)
  15173. --- END Decision Phase ---
  15174. --- Application Phase ---
  15175. --- Firing Productions (PE) For State At Depth 1 ---
  15176. --- Inner Elaboration Phase, active level 1 (S1) ---
  15177. Firing apply*operator
  15178. -->
  15179. (I3 ^predict-no N1005 + :O )
  15180. Firing apply*operator*complete
  15181. -->
  15182. (I3 ^predict-yes N1004 - :O )
  15183. inner elaboration loop at bottom goal.
  15184. --- Change Working Memory (PE) ---
  15185. =>WM: (14090: I3 ^predict-no N1005)
  15186. <=WM: (14077: N1004 ^status complete)
  15187. <=WM: (14076: I3 ^predict-yes N1004)
  15188. --- Firing Productions (IE) For State At Depth 1 ---
  15189. --- Inner Elaboration Phase, active level 1 (S1) ---
  15190. Firing monitor*world
  15191. -->
  15192. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15193. --- Change Working Memory (IE) ---
  15194. --- END Application Phase ---
  15195. --- Output Phase ---
  15196. ENV: Agent did: predict-no for direction R in state State-B
  15197. In State-B moving R
  15198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15199. predict error 0
  15200. dir: dir isU
  15201. --- END Output Phase ---
  15202. |\---- Input Phase ---
  15203. =>WM: (14094: I2 ^dir U)
  15204. =>WM: (14093: I2 ^reward 1)
  15205. =>WM: (14092: I2 ^see 0)
  15206. =>WM: (14091: N1005 ^status complete)
  15207. <=WM: (14080: I2 ^dir R)
  15208. <=WM: (14079: I2 ^reward 1)
  15209. <=WM: (14078: I2 ^see 1)
  15210. =>WM: (14095: I2 ^level-1 R0-root)
  15211. <=WM: (14081: I2 ^level-1 R1-root)
  15212. --- END Input Phase ---
  15213. --- Proposal Phase ---
  15214. --- Inner Elaboration Phase, active level 1 (S1) ---
  15215. Firing elaborate*copy-see-to-output-link
  15216. -->
  15217. (I3 ^see 0 +)
  15218. Firing elaborate*reward*based*on*reward
  15219. -->
  15220. (R1009 ^value 1 +)
  15221. (R1 ^reward R1009 +)
  15222. Firing propose*predict-yes
  15223. -->
  15224. (O2011 ^name predict-yes +)
  15225. (S1 ^operator O2011 +)
  15226. Firing propose*predict-no
  15227. -->
  15228. (O2012 ^name predict-no +)
  15229. (S1 ^operator O2012 +)
  15230. Firing rl*prefer*rvt*predict-no*H0*4
  15231. -->
  15232. (S1 ^operator O2010 = 0.9999999999999999)
  15233. Firing rl*prefer*rvt*predict-yes*H0*3
  15234. -->
  15235. (S1 ^operator O2009 = 0.)
  15236. Firing prefer*rvt*predict-yes*H0
  15237. -->
  15238. Firing prefer*rvt*predict-no*H0
  15239. -->
  15240. Firing elaborate*copy-dir-to-output-link
  15241. -->
  15242. (I3 ^dir U +)
  15243. inner elaboration loop at bottom goal.
  15244. Retracting elaborate*copy-see-to-output-link
  15245. -->
  15246. (I3 ^see 1 +)
  15247. Retracting propose*predict-no
  15248. -->
  15249. (O2010 ^name predict-no +)
  15250. (S1 ^operator O2010 +)
  15251. Retracting propose*predict-yes
  15252. -->
  15253. (O2009 ^name predict-yes +)
  15254. (S1 ^operator O2009 +)
  15255. Retracting elaborate*reward*based*on*reward
  15256. -->
  15257. (R1008 ^value 1 +)
  15258. (R1 ^reward R1008 +)
  15259. Retracting elaborate*copy-dir-to-output-link
  15260. -->
  15261. (I3 ^dir R +)
  15262. Retracting rl*prefer*rvt*predict-no*H0*6*H1*11
  15263. -->
  15264. (S1 ^operator O2010 = 0.6006730589861906)
  15265. Retracting rl*prefer*rvt*predict-no*H0*6
  15266. -->
  15267. (S1 ^operator O2010 = 0.3993286818522114)
  15268. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*12
  15269. -->
  15270. (S1 ^operator O2009 = 0.1602187148382515)
  15271. Retracting rl*prefer*rvt*predict-yes*H0*5
  15272. -->
  15273. (S1 ^operator O2009 = 0.1121068438565158)
  15274. =>WM: (14103: S1 ^operator O2012 +)
  15275. =>WM: (14102: S1 ^operator O2011 +)
  15276. =>WM: (14101: I3 ^dir U)
  15277. =>WM: (14100: O2012 ^name predict-no)
  15278. =>WM: (14099: O2011 ^name predict-yes)
  15279. =>WM: (14098: R1009 ^value 1)
  15280. =>WM: (14097: R1 ^reward R1009)
  15281. =>WM: (14096: I3 ^see 0)
  15282. <=WM: (14087: S1 ^operator O2009 +)
  15283. <=WM: (14088: S1 ^operator O2010 +)
  15284. <=WM: (14089: S1 ^operator O2010)
  15285. <=WM: (14072: I3 ^dir R)
  15286. <=WM: (14083: R1 ^reward R1008)
  15287. <=WM: (14082: I3 ^see 1)
  15288. <=WM: (14086: O2010 ^name predict-no)
  15289. <=WM: (14085: O2009 ^name predict-yes)
  15290. <=WM: (14084: R1008 ^value 1)
  15291. --- Inner Elaboration Phase, active level 1 (S1) ---
  15292. Firing prefer*rvt*predict-yes*H0
  15293. -->
  15294. Firing rl*prefer*rvt*predict-yes*H0*3
  15295. -->
  15296. (S1 ^operator O2011 = 0.)
  15297. Firing prefer*rvt*predict-no*H0
  15298. -->
  15299. Firing rl*prefer*rvt*predict-no*H0*4
  15300. -->
  15301. (S1 ^operator O2012 = 0.9999999999999999)
  15302. inner elaboration loop at bottom goal.
  15303. Retracting rl*prefer*rvt*predict-no*H0*4
  15304. -->
  15305. (S1 ^operator O2010 = 0.9999999999999999)
  15306. Retracting rl*prefer*rvt*predict-yes*H0*3
  15307. -->
  15308. (S1 ^operator O2009 = 0.)
  15309. --- END Proposal Phase ---
  15310. --- Decision Phase ---
  15311. RL update rl*prefer*rvt*predict-no*H0*6 0.558037 -0.158709 0.399329 -> 0.558037 -0.158709 0.399328(R,m,v=1,0.928994,0.0663567)
  15312. RL update rl*prefer*rvt*predict-no*H0*6*H1*11 0.441964 0.158709 0.600673 -> 0.441964 0.158709 0.600673(R,m,v=1,1,0)
  15313. =>WM: (14104: S1 ^operator O2012)
  15314. 1006: O: O2012 (predict-no)
  15315. --- END Decision Phase ---
  15316. --- Application Phase ---
  15317. --- Firing Productions (PE) For State At Depth 1 ---
  15318. --- Inner Elaboration Phase, active level 1 (S1) ---
  15319. Firing apply*operator
  15320. -->
  15321. (I3 ^predict-no N1006 + :O )
  15322. Firing apply*operator*complete
  15323. -->
  15324. (I3 ^predict-no N1005 - :O )
  15325. inner elaboration loop at bottom goal.
  15326. --- Change Working Memory (PE) ---
  15327. =>WM: (14105: I3 ^predict-no N1006)
  15328. <=WM: (14091: N1005 ^status complete)
  15329. <=WM: (14090: I3 ^predict-no N1005)
  15330. --- Firing Productions (IE) For State At Depth 1 ---
  15331. --- Inner Elaboration Phase, active level 1 (S1) ---
  15332. Firing monitor*world
  15333. -->
  15334. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15335. --- Change Working Memory (IE) ---
  15336. --- END Application Phase ---
  15337. --- Output Phase ---
  15338. ENV: Agent did: predict-no for direction U in state State-B
  15339. In State-B moving U
  15340. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15341. predict error 0
  15342. dir: dir isR
  15343. --- END Output Phase ---
  15344. /|\--- Input Phase ---
  15345. =>WM: (14109: I2 ^dir R)
  15346. =>WM: (14108: I2 ^reward 1)
  15347. =>WM: (14107: I2 ^see 0)
  15348. =>WM: (14106: N1006 ^status complete)
  15349. <=WM: (14094: I2 ^dir U)
  15350. <=WM: (14093: I2 ^reward 1)
  15351. <=WM: (14092: I2 ^see 0)
  15352. =>WM: (14110: I2 ^level-1 R0-root)
  15353. <=WM: (14095: I2 ^level-1 R0-root)
  15354. --- END Input Phase ---
  15355. --- Proposal Phase ---
  15356. --- Inner Elaboration Phase, active level 1 (S1) ---
  15357. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  15358. -->
  15359. (S1 ^operator O2011 = 0.08295067548181556)
  15360. Firing rl*prefer*rvt*predict-no*H0*6*H1*13
  15361. -->
  15362. (S1 ^operator O2012 = 0.600662900915969)
  15363. Firing prefer*rvt*predict-no*H0*6*H1
  15364. -->
  15365. Firing prefer*rvt*predict-yes*H0*5*H1
  15366. -->
  15367. Firing elaborate*copy-see-to-output-link
  15368. -->
  15369. (I3 ^see 0 +)
  15370. Firing elaborate*reward*based*on*reward
  15371. -->
  15372. (R1010 ^value 1 +)
  15373. (R1 ^reward R1010 +)
  15374. Firing propose*predict-yes
  15375. -->
  15376. (O2013 ^name predict-yes +)
  15377. (S1 ^operator O2013 +)
  15378. Firing propose*predict-no
  15379. -->
  15380. (O2014 ^name predict-no +)
  15381. (S1 ^operator O2014 +)
  15382. Firing rl*prefer*rvt*predict-no*H0*6
  15383. -->
  15384. (S1 ^operator O2012 = 0.3993284207264511)
  15385. Firing rl*prefer*rvt*predict-yes*H0*5
  15386. -->
  15387. (S1 ^operator O2011 = 0.1121068438565158)
  15388. Firing prefer*rvt*predict-yes*H0
  15389. -->
  15390. Firing prefer*rvt*predict-no*H0
  15391. -->
  15392. Firing elaborate*copy-dir-to-output-link
  15393. -->
  15394. (I3 ^dir R +)
  15395. inner elaboration loop at bottom goal.
  15396. Retracting elaborate*copy-see-to-output-link
  15397. -->
  15398. (I3 ^see 0 +)
  15399. Retracting propose*predict-no
  15400. -->
  15401. (O2012 ^name predict-no +)
  15402. (S1 ^operator O2012 +)
  15403. Retracting propose*predict-yes
  15404. -->
  15405. (O2011 ^name predict-yes +)
  15406. (S1 ^operator O2011 +)
  15407. Retracting elaborate*reward*based*on*reward
  15408. -->
  15409. (R1009 ^value 1 +)
  15410. (R1 ^reward R1009 +)
  15411. Retracting elaborate*copy-dir-to-output-link
  15412. -->
  15413. (I3 ^dir U +)
  15414. Retracting rl*prefer*rvt*predict-no*H0*4
  15415. -->
  15416. (S1 ^operator O2012 = 0.9999999999999999)
  15417. Retracting rl*prefer*rvt*predict-yes*H0*3
  15418. -->
  15419. (S1 ^operator O2011 = 0.)
  15420. =>WM: (14117: S1 ^operator O2014 +)
  15421. =>WM: (14116: S1 ^operator O2013 +)
  15422. =>WM: (14115: I3 ^dir R)
  15423. =>WM: (14114: O2014 ^name predict-no)
  15424. =>WM: (14113: O2013 ^name predict-yes)
  15425. =>WM: (14112: R1010 ^value 1)
  15426. =>WM: (14111: R1 ^reward R1010)
  15427. <=WM: (14102: S1 ^operator O2011 +)
  15428. <=WM: (14103: S1 ^operator O2012 +)
  15429. <=WM: (14104: S1 ^operator O2012)
  15430. <=WM: (14101: I3 ^dir U)
  15431. <=WM: (14097: R1 ^reward R1009)
  15432. <=WM: (14100: O2012 ^name predict-no)
  15433. <=WM: (14099: O2011 ^name predict-yes)
  15434. <=WM: (14098: R1009 ^value 1)
  15435. --- Inner Elaboration Phase, active level 1 (S1) ---
  15436. Firing prefer*rvt*predict-yes*H0
  15437. -->
  15438. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  15439. -->
  15440. (S1 ^operator O2013 = 0.08295067548181556)
  15441. Firing rl*prefer*rvt*predict-yes*H0*5
  15442. -->
  15443. (S1 ^operator O2013 = 0.1121068438565158)
  15444. Firing prefer*rvt*predict-yes*H0*5*H1
  15445. -->
  15446. Firing prefer*rvt*predict-no*H0
  15447. -->
  15448. Firing rl*prefer*rvt*predict-no*H0*6*H1*13
  15449. -->
  15450. (S1 ^operator O2014 = 0.600662900915969)
  15451. Firing rl*prefer*rvt*predict-no*H0*6
  15452. -->
  15453. (S1 ^operator O2014 = 0.3993284207264511)
  15454. Firing prefer*rvt*predict-no*H0*6*H1
  15455. -->
  15456. inner elaboration loop at bottom goal.
  15457. Retracting rl*prefer*rvt*predict-no*H0*6
  15458. -->
  15459. (S1 ^operator O2012 = 0.3993284207264511)
  15460. Retracting rl*prefer*rvt*predict-no*H0*6*H1*13
  15461. -->
  15462. (S1 ^operator O2012 = 0.600662900915969)
  15463. Retracting rl*prefer*rvt*predict-yes*H0*5
  15464. -->
  15465. (S1 ^operator O2011 = 0.1121068438565158)
  15466. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  15467. -->
  15468. (S1 ^operator O2011 = 0.08295067548181556)
  15469. --- END Proposal Phase ---
  15470. --- Decision Phase ---
  15471. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15472. =>WM: (14118: S1 ^operator O2014)
  15473. 1007: O: O2014 (predict-no)
  15474. --- END Decision Phase ---
  15475. --- Application Phase ---
  15476. --- Firing Productions (PE) For State At Depth 1 ---
  15477. --- Inner Elaboration Phase, active level 1 (S1) ---
  15478. Firing apply*operator
  15479. -->
  15480. (I3 ^predict-no N1007 + :O )
  15481. Firing apply*operator*complete
  15482. -->
  15483. (I3 ^predict-no N1006 - :O )
  15484. inner elaboration loop at bottom goal.
  15485. --- Change Working Memory (PE) ---
  15486. =>WM: (14119: I3 ^predict-no N1007)
  15487. <=WM: (14106: N1006 ^status complete)
  15488. <=WM: (14105: I3 ^predict-no N1006)
  15489. --- Firing Productions (IE) For State At Depth 1 ---
  15490. --- Inner Elaboration Phase, active level 1 (S1) ---
  15491. Firing monitor*world
  15492. -->
  15493. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15494. --- Change Working Memory (IE) ---
  15495. --- END Application Phase ---
  15496. --- Output Phase ---
  15497. ENV: Agent did: predict-no for direction R in state State-B
  15498. In State-B moving R
  15499. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15500. predict error 0
  15501. dir: dir isR
  15502. --- END Output Phase ---
  15503. -/|--- Input Phase ---
  15504. =>WM: (14123: I2 ^dir R)
  15505. =>WM: (14122: I2 ^reward 1)
  15506. =>WM: (14121: I2 ^see 0)
  15507. =>WM: (14120: N1007 ^status complete)
  15508. <=WM: (14109: I2 ^dir R)
  15509. <=WM: (14108: I2 ^reward 1)
  15510. <=WM: (14107: I2 ^see 0)
  15511. =>WM: (14124: I2 ^level-1 R0-root)
  15512. <=WM: (14110: I2 ^level-1 R0-root)
  15513. --- END Input Phase ---
  15514. --- Proposal Phase ---
  15515. --- Inner Elaboration Phase, active level 1 (S1) ---
  15516. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  15517. -->
  15518. (S1 ^operator O2013 = 0.08295067548181556)
  15519. Firing rl*prefer*rvt*predict-no*H0*6*H1*13
  15520. -->
  15521. (S1 ^operator O2014 = 0.600662900915969)
  15522. Firing prefer*rvt*predict-no*H0*6*H1
  15523. -->
  15524. Firing prefer*rvt*predict-yes*H0*5*H1
  15525. -->
  15526. Firing elaborate*copy-see-to-output-link
  15527. -->
  15528. (I3 ^see 0 +)
  15529. Firing elaborate*reward*based*on*reward
  15530. -->
  15531. (R1011 ^value 1 +)
  15532. (R1 ^reward R1011 +)
  15533. Firing propose*predict-yes
  15534. -->
  15535. (O2015 ^name predict-yes +)
  15536. (S1 ^operator O2015 +)
  15537. Firing propose*predict-no
  15538. -->
  15539. (O2016 ^name predict-no +)
  15540. (S1 ^operator O2016 +)
  15541. Firing rl*prefer*rvt*predict-no*H0*6
  15542. -->
  15543. (S1 ^operator O2014 = 0.3993284207264511)
  15544. Firing rl*prefer*rvt*predict-yes*H0*5
  15545. -->
  15546. (S1 ^operator O2013 = 0.1121068438565158)
  15547. Firing prefer*rvt*predict-yes*H0
  15548. -->
  15549. Firing prefer*rvt*predict-no*H0
  15550. -->
  15551. Firing elaborate*copy-dir-to-output-link
  15552. -->
  15553. (I3 ^dir R +)
  15554. inner elaboration loop at bottom goal.
  15555. Retracting elaborate*copy-see-to-output-link
  15556. -->
  15557. (I3 ^see 0 +)
  15558. Retracting propose*predict-no
  15559. -->
  15560. (O2014 ^name predict-no +)
  15561. (S1 ^operator O2014 +)
  15562. Retracting propose*predict-yes
  15563. -->
  15564. (O2013 ^name predict-yes +)
  15565. (S1 ^operator O2013 +)
  15566. Retracting elaborate*reward*based*on*reward
  15567. -->
  15568. (R1010 ^value 1 +)
  15569. (R1 ^reward R1010 +)
  15570. Retracting elaborate*copy-dir-to-output-link
  15571. -->
  15572. (I3 ^dir R +)
  15573. Retracting rl*prefer*rvt*predict-no*H0*6
  15574. -->
  15575. (S1 ^operator O2014 = 0.3993284207264511)
  15576. Retracting rl*prefer*rvt*predict-no*H0*6*H1*13
  15577. -->
  15578. (S1 ^operator O2014 = 0.600662900915969)
  15579. Retracting rl*prefer*rvt*predict-yes*H0*5
  15580. -->
  15581. (S1 ^operator O2013 = 0.1121068438565158)
  15582. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  15583. -->
  15584. (S1 ^operator O2013 = 0.08295067548181556)
  15585. =>WM: (14130: S1 ^operator O2016 +)
  15586. =>WM: (14129: S1 ^operator O2015 +)
  15587. =>WM: (14128: O2016 ^name predict-no)
  15588. =>WM: (14127: O2015 ^name predict-yes)
  15589. =>WM: (14126: R1011 ^value 1)
  15590. =>WM: (14125: R1 ^reward R1011)
  15591. <=WM: (14116: S1 ^operator O2013 +)
  15592. <=WM: (14117: S1 ^operator O2014 +)
  15593. <=WM: (14118: S1 ^operator O2014)
  15594. <=WM: (14111: R1 ^reward R1010)
  15595. <=WM: (14114: O2014 ^name predict-no)
  15596. <=WM: (14113: O2013 ^name predict-yes)
  15597. <=WM: (14112: R1010 ^value 1)
  15598. --- Inner Elaboration Phase, active level 1 (S1) ---
  15599. Firing prefer*rvt*predict-yes*H0
  15600. -->
  15601. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  15602. -->
  15603. (S1 ^operator O2015 = 0.08295067548181556)
  15604. Firing rl*prefer*rvt*predict-yes*H0*5
  15605. -->
  15606. (S1 ^operator O2015 = 0.1121068438565158)
  15607. Firing prefer*rvt*predict-yes*H0*5*H1
  15608. -->
  15609. Firing prefer*rvt*predict-no*H0
  15610. -->
  15611. Firing rl*prefer*rvt*predict-no*H0*6*H1*13
  15612. -->
  15613. (S1 ^operator O2016 = 0.600662900915969)
  15614. Firing rl*prefer*rvt*predict-no*H0*6
  15615. -->
  15616. (S1 ^operator O2016 = 0.3993284207264511)
  15617. Firing prefer*rvt*predict-no*H0*6*H1
  15618. -->
  15619. inner elaboration loop at bottom goal.
  15620. Retracting rl*prefer*rvt*predict-no*H0*6
  15621. -->
  15622. (S1 ^operator O2014 = 0.3993284207264511)
  15623. Retracting rl*prefer*rvt*predict-no*H0*6*H1*13
  15624. -->
  15625. (S1 ^operator O2014 = 0.600662900915969)
  15626. Retracting rl*prefer*rvt*predict-yes*H0*5
  15627. -->
  15628. (S1 ^operator O2013 = 0.1121068438565158)
  15629. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  15630. -->
  15631. (S1 ^operator O2013 = 0.08295067548181556)
  15632. --- END Proposal Phase ---
  15633. --- Decision Phase ---
  15634. RL update rl*prefer*rvt*predict-no*H0*6 0.558037 -0.158709 0.399328 -> 0.558038 -0.158709 0.39933(R,m,v=1,0.929412,0.0659937)
  15635. RL update rl*prefer*rvt*predict-no*H0*6*H1*13 0.441955 0.158708 0.600663 -> 0.441956 0.158708 0.600664(R,m,v=1,1,0)
  15636. =>WM: (14131: S1 ^operator O2016)
  15637. 1008: O: O2016 (predict-no)
  15638. --- END Decision Phase ---
  15639. --- Application Phase ---
  15640. --- Firing Productions (PE) For State At Depth 1 ---
  15641. --- Inner Elaboration Phase, active level 1 (S1) ---
  15642. Firing apply*operator
  15643. -->
  15644. (I3 ^predict-no N1008 + :O )
  15645. Firing apply*operator*complete
  15646. -->
  15647. (I3 ^predict-no N1007 - :O )
  15648. inner elaboration loop at bottom goal.
  15649. --- Change Working Memory (PE) ---
  15650. =>WM: (14132: I3 ^predict-no N1008)
  15651. <=WM: (14120: N1007 ^status complete)
  15652. <=WM: (14119: I3 ^predict-no N1007)
  15653. --- Firing Productions (IE) For State At Depth 1 ---
  15654. --- Inner Elaboration Phase, active level 1 (S1) ---
  15655. Firing monitor*world
  15656. -->
  15657. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15658. --- Change Working Memory (IE) ---
  15659. --- END Application Phase ---
  15660. --- Output Phase ---
  15661. ENV: Agent did: predict-no for direction R in state State-B
  15662. In State-B moving R
  15663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15664. predict error 0
  15665. dir: dir isL
  15666. --- END Output Phase ---
  15667. \-/--- Input Phase ---
  15668. =>WM: (14136: I2 ^dir L)
  15669. =>WM: (14135: I2 ^reward 1)
  15670. =>WM: (14134: I2 ^see 0)
  15671. =>WM: (14133: N1008 ^status complete)
  15672. <=WM: (14123: I2 ^dir R)
  15673. <=WM: (14122: I2 ^reward 1)
  15674. <=WM: (14121: I2