PageRenderTime 151ms CodeModel.GetById 23ms RepoModel.GetById 0ms app.codeStats 1ms

/flipv2/20121112-100543-2.5K-ReLST-Wallace/stdout-flip-2.5K_0.txt

https://bitbucket.org/evan13579b/soar-ziggurat
Plain Text | 16560 lines | 15788 code | 772 blank | 0 comment | 0 complexity | 360f72188a586636e7bb8061f6782bfb MD5 | raw file
Possible License(s): BSD-3-Clause
  1. Seeding... 0
  2. dir: dir isU
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 0 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_0.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\-/|sleeping...
  20. \-/|\-/sleeping...
  21. |1: O: O1 (predict-yes)
  22. I see 0 and I'm going to do: predict-yes
  23. ENV: Agent did: predict-yes for direction U in state State-A
  24. In State-A moving U
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  26. predict error 1
  27. dir: dir isU
  28. rule alias: '*'
  29. rule alias: '*'
  30. \-/|\-/|2: O: O4 (predict-no)
  31. I see 0 and I'm going to do: predict-no
  32. ENV: Agent did: predict-no for direction U in state State-A
  33. In State-A moving U
  34. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  35. predict error 0
  36. dir: dir isR
  37. \-/3: O: O5 (predict-yes)
  38. I see 1 and I'm going to do: predict-yes
  39. ENV: Agent did: predict-yes for direction R in state State-A
  40. In State-A moving R
  41. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  42. predict error 0
  43. dir: dir isL
  44. |\4: O: O7 (predict-yes)
  45. I see 1 and I'm going to do: predict-yes
  46. ENV: Agent did: predict-yes for direction L in state State-B
  47. In State-B moving L
  48. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  49. predict error 0
  50. dir: dir isR
  51. -/|5: O: O9 (predict-yes)
  52. I see 1 and I'm going to do: predict-yes
  53. ENV: Agent did: predict-yes for direction R in state State-A
  54. In State-A moving R
  55. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  56. predict error 0
  57. dir: dir isR
  58. \-/6: O: O11 (predict-yes)
  59. I see 1 and I'm going to do: predict-yes
  60. ENV: Agent did: predict-yes for direction R in state State-B
  61. In State-B moving R
  62. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  63. predict error 1
  64. dir: dir isU
  65. |\-/sleeping...
  66. |7: O: O14 (predict-no)
  67. I see 0 and I'm going to do: predict-no
  68. ENV: Agent did: predict-no for direction U in state State-B
  69. In State-B moving U
  70. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  71. predict error 0
  72. dir: dir isL
  73. \-8: O: O15 (predict-yes)
  74. I see 1 and I'm going to do: predict-yes
  75. ENV: Agent did: predict-yes for direction L in state State-B
  76. In State-B moving L
  77. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  78. predict error 0
  79. dir: dir isR
  80. /|9: O: O17 (predict-yes)
  81. I see 1 and I'm going to do: predict-yes
  82. ENV: Agent did: predict-yes for direction R in state State-A
  83. In State-A moving R
  84. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  85. predict error 0
  86. dir: dir isR
  87. \-/10: O: O19 (predict-yes)
  88. I see 1 and I'm going to do: predict-yes
  89. ENV: Agent did: predict-yes for direction R in state State-B
  90. In State-B moving R
  91. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  92. predict error 1
  93. dir: dir isU
  94. |\-11: O: O22 (predict-no)
  95. I see 0 and I'm going to do: predict-no
  96. ENV: Agent did: predict-no for direction U in state State-B
  97. In State-B moving U
  98. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  99. predict error 0
  100. dir: dir isR
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. rule alias: '*'
  105. /12: O: O24 (predict-no)
  106. I see 1 and I'm going to do: predict-no
  107. ENV: Agent did: predict-no for direction R in state State-B
  108. In State-B moving R
  109. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  110. predict error 0
  111. dir: dir isL
  112. |\13: O: O26 (predict-no)
  113. I see 1 and I'm going to do: predict-no
  114. ENV: Agent did: predict-no for direction L in state State-B
  115. In State-B moving L
  116. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  117. predict error 1
  118. dir: dir isU
  119. -/|\14: O: O28 (predict-no)
  120. I see 0 and I'm going to do: predict-no
  121. ENV: Agent did: predict-no for direction U in state State-A
  122. In State-A moving U
  123. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  124. predict error 0
  125. dir: dir isR
  126. -/15: O: O30 (predict-no)
  127. I see 1 and I'm going to do: predict-no
  128. ENV: Agent did: predict-no for direction R in state State-A
  129. In State-A moving R
  130. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  131. predict error 1
  132. dir: dir isL
  133. |\-16: O: O31 (predict-yes)
  134. I see 0 and I'm going to do: predict-yes
  135. ENV: Agent did: predict-yes for direction L in state State-B
  136. In State-B moving L
  137. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  138. predict error 0
  139. dir: dir isU
  140. /|\17: O: O34 (predict-no)
  141. I see 1 and I'm going to do: predict-no
  142. ENV: Agent did: predict-no for direction U in state State-A
  143. In State-A moving U
  144. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  145. predict error 0
  146. dir: dir isU
  147. -/|18: O: O36 (predict-no)
  148. I see 1 and I'm going to do: predict-no
  149. ENV: Agent did: predict-no for direction U in state State-A
  150. In State-A moving U
  151. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  152. predict error 0
  153. dir: dir isU
  154. \-/19: O: O38 (predict-no)
  155. I see 1 and I'm going to do: predict-no
  156. ENV: Agent did: predict-no for direction U in state State-A
  157. In State-A moving U
  158. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  159. predict error 0
  160. dir: dir isU
  161. |\-20: O: O40 (predict-no)
  162. I see 1 and I'm going to do: predict-no
  163. ENV: Agent did: predict-no for direction U in state State-A
  164. In State-A moving U
  165. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  166. predict error 0
  167. dir: dir isL
  168. /|\-21: O: O41 (predict-yes)
  169. I see 1 and I'm going to do: predict-yes
  170. ENV: Agent did: predict-yes for direction L in state State-A
  171. In State-A moving L
  172. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  173. predict error 1
  174. dir: dir isU
  175. /22: O: O44 (predict-no)
  176. I see 0 and I'm going to do: predict-no
  177. ENV: Agent did: predict-no for direction U in state State-A
  178. In State-A moving U
  179. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  180. predict error 0
  181. dir: dir isU
  182. |\-23: O: O46 (predict-no)
  183. I see 1 and I'm going to do: predict-no
  184. ENV: Agent did: predict-no for direction U in state State-A
  185. In State-A moving U
  186. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  187. predict error 0
  188. dir: dir isU
  189. /|\24: O: O48 (predict-no)
  190. I see 1 and I'm going to do: predict-no
  191. ENV: Agent did: predict-no for direction U in state State-A
  192. In State-A moving U
  193. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  194. predict error 0
  195. dir: dir isR
  196. -/25: O: O50 (predict-no)
  197. I see 1 and I'm going to do: predict-no
  198. ENV: Agent did: predict-no for direction R in state State-A
  199. In State-A moving R
  200. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  201. predict error 1
  202. dir: dir isL
  203. |\-26: O: O51 (predict-yes)
  204. I see 0 and I'm going to do: predict-yes
  205. ENV: Agent did: predict-yes for direction L in state State-B
  206. In State-B moving L
  207. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  208. predict error 0
  209. dir: dir isR
  210. /|27: O: O53 (predict-yes)
  211. I see 1 and I'm going to do: predict-yes
  212. ENV: Agent did: predict-yes for direction R in state State-A
  213. In State-A moving R
  214. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  215. predict error 0
  216. dir: dir isR
  217. \-/28: O: O55 (predict-yes)
  218. I see 1 and I'm going to do: predict-yes
  219. ENV: Agent did: predict-yes for direction R in state State-B
  220. In State-B moving R
  221. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  222. predict error 1
  223. dir: dir isU
  224. |\-29: O: O57 (predict-yes)
  225. I see 0 and I'm going to do: predict-yes
  226. ENV: Agent did: predict-yes for direction U in state State-B
  227. In State-B moving U
  228. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  229. predict error 1
  230. dir: dir isU
  231. /|30: O: O60 (predict-no)
  232. I see 0 and I'm going to do: predict-no
  233. ENV: Agent did: predict-no for direction U in state State-B
  234. In State-B moving U
  235. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  236. predict error 0
  237. dir: dir isR
  238. \-/31: O: O61 (predict-yes)
  239. I see 1 and I'm going to do: predict-yes
  240. ENV: Agent did: predict-yes for direction R in state State-B
  241. In State-B moving R
  242. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  243. predict error 1
  244. dir: dir isU
  245. |32: O: O64 (predict-no)
  246. I see 0 and I'm going to do: predict-no
  247. ENV: Agent did: predict-no for direction U in state State-B
  248. In State-B moving U
  249. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  250. predict error 0
  251. dir: dir isL
  252. \-33: O: O65 (predict-yes)
  253. I see 1 and I'm going to do: predict-yes
  254. ENV: Agent did: predict-yes for direction L in state State-B
  255. In State-B moving L
  256. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  257. predict error 0
  258. dir: dir isU
  259. /|34: O: O68 (predict-no)
  260. I see 1 and I'm going to do: predict-no
  261. ENV: Agent did: predict-no for direction U in state State-A
  262. In State-A moving U
  263. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  264. predict error 0
  265. dir: dir isR
  266. \-/35: O: O69 (predict-yes)
  267. I see 1 and I'm going to do: predict-yes
  268. ENV: Agent did: predict-yes for direction R in state State-A
  269. In State-A moving R
  270. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  271. predict error 0
  272. dir: dir isL
  273. |\36: O: O71 (predict-yes)
  274. I see 1 and I'm going to do: predict-yes
  275. ENV: Agent did: predict-yes for direction L in state State-B
  276. In State-B moving L
  277. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  278. predict error 0
  279. dir: dir isU
  280. -/37: O: O74 (predict-no)
  281. I see 1 and I'm going to do: predict-no
  282. ENV: Agent did: predict-no for direction U in state State-A
  283. In State-A moving U
  284. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  285. predict error 0
  286. dir: dir isR
  287. |\38: O: O75 (predict-yes)
  288. I see 1 and I'm going to do: predict-yes
  289. ENV: Agent did: predict-yes for direction R in state State-A
  290. In State-A moving R
  291. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  292. predict error 0
  293. dir: dir isU
  294. -/39: O: O77 (predict-yes)
  295. I see 1 and I'm going to do: predict-yes
  296. ENV: Agent did: predict-yes for direction U in state State-B
  297. In State-B moving U
  298. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  299. predict error 1
  300. dir: dir isU
  301. |\-40: O: O80 (predict-no)
  302. I see 0 and I'm going to do: predict-no
  303. ENV: Agent did: predict-no for direction U in state State-B
  304. In State-B moving U
  305. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  306. predict error 0
  307. dir: dir isL
  308. /|\41: O: O81 (predict-yes)
  309. I see 1 and I'm going to do: predict-yes
  310. ENV: Agent did: predict-yes for direction L in state State-B
  311. In State-B moving L
  312. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  313. predict error 0
  314. dir: dir isR
  315. -42: O: O83 (predict-yes)
  316. I see 1 and I'm going to do: predict-yes
  317. ENV: Agent did: predict-yes for direction R in state State-A
  318. In State-A moving R
  319. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  320. predict error 0
  321. dir: dir isU
  322. /|\43: O: O86 (predict-no)
  323. I see 1 and I'm going to do: predict-no
  324. ENV: Agent did: predict-no for direction U in state State-B
  325. In State-B moving U
  326. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  327. predict error 0
  328. dir: dir isL
  329. -/44: O: O87 (predict-yes)
  330. I see 1 and I'm going to do: predict-yes
  331. ENV: Agent did: predict-yes for direction L in state State-B
  332. In State-B moving L
  333. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  334. predict error 0
  335. dir: dir isL
  336. |\45: O: O89 (predict-yes)
  337. I see 1 and I'm going to do: predict-yes
  338. ENV: Agent did: predict-yes for direction L in state State-A
  339. In State-A moving L
  340. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  341. predict error 1
  342. dir: dir isU
  343. -/|46: O: O92 (predict-no)
  344. I see 0 and I'm going to do: predict-no
  345. ENV: Agent did: predict-no for direction U in state State-A
  346. In State-A moving U
  347. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  348. predict error 0
  349. dir: dir isL
  350. \-/47: O: O93 (predict-yes)
  351. I see 1 and I'm going to do: predict-yes
  352. ENV: Agent did: predict-yes for direction L in state State-A
  353. In State-A moving L
  354. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  355. predict error 1
  356. dir: dir isR
  357. |\48: O: O96 (predict-no)
  358. I see 0 and I'm going to do: predict-no
  359. ENV: Agent did: predict-no for direction R in state State-A
  360. In State-A moving R
  361. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  362. predict error 1
  363. dir: dir isL
  364. -/|49: O: O97 (predict-yes)
  365. I see 0 and I'm going to do: predict-yes
  366. ENV: Agent did: predict-yes for direction L in state State-B
  367. In State-B moving L
  368. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  369. predict error 0
  370. dir: dir isU
  371. \-/50: O: O100 (predict-no)
  372. I see 1 and I'm going to do: predict-no
  373. ENV: Agent did: predict-no for direction U in state State-A
  374. In State-A moving U
  375. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  376. predict error 0
  377. dir: dir isU
  378. |\-/|\sleeping...
  379. -sleeping...
  380. /sleeping...
  381. |sleeping...
  382. \sleeping...
  383. -sleeping...
  384. /sleeping...
  385. |sleeping...
  386. \sleeping...
  387. -sleeping...
  388. /sleeping...
  389. |sleeping...
  390. \sleeping...
  391. -sleeping...
  392. /sleeping...
  393. |sleeping...
  394. \sleeping...
  395. -sleeping...
  396. /sleeping...
  397. |sleeping...
  398. \sleeping...
  399. -sleeping...
  400. /sleeping...
  401. |sleeping...
  402. \sleeping...
  403. -sleeping...
  404. /sleeping...
  405. |sleeping...
  406. \sleeping...
  407. -sleeping...
  408. /sleeping...
  409. |sleeping...
  410. \sleeping...
  411. -sleeping...
  412. /sleeping...
  413. |sleeping...
  414. \sleeping...
  415. -sleeping...
  416. /sleeping...
  417. |sleeping...
  418. \sleeping...
  419. -sleeping...
  420. /sleeping...
  421. |sleeping...
  422. \sleeping...
  423. -sleeping...
  424. /sleeping...
  425. |sleeping...
  426. \sleeping...
  427. -sleeping...
  428. /sleeping...
  429. |sleeping...
  430. \sleeping...
  431. -sleeping...
  432. /sleeping...
  433. |sleeping...
  434. \sleeping...
  435. -51: O: O102 (predict-no)
  436. I see 1 and I'm going to do: predict-no
  437. ENV: Agent did: predict-no for direction U in state State-A
  438. In State-A moving U
  439. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  440. predict error 0
  441. dir: dir isR
  442. /52: O: O104 (predict-no)
  443. I see 1 and I'm going to do: predict-no
  444. ENV: Agent did: predict-no for direction R in state State-A
  445. In State-A moving R
  446. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  447. predict error 1
  448. dir: dir isL
  449. |\-53: O: O106 (predict-no)
  450. I see 0 and I'm going to do: predict-no
  451. ENV: Agent did: predict-no for direction L in state State-B
  452. In State-B moving L
  453. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  454. predict error 1
  455. dir: dir isL
  456. /|\54: O: O107 (predict-yes)
  457. I see 0 and I'm going to do: predict-yes
  458. ENV: Agent did: predict-yes for direction L in state State-A
  459. In State-A moving L
  460. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  461. predict error 1
  462. dir: dir isR
  463. -/|55: O: O109 (predict-yes)
  464. I see 0 and I'm going to do: predict-yes
  465. ENV: Agent did: predict-yes for direction R in state State-A
  466. In State-A moving R
  467. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  468. predict error 0
  469. dir: dir isU
  470. \-56: O: O112 (predict-no)
  471. I see 1 and I'm going to do: predict-no
  472. ENV: Agent did: predict-no for direction U in state State-B
  473. In State-B moving U
  474. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  475. predict error 0
  476. dir: dir isL
  477. /|\57: O: O114 (predict-no)
  478. I see 1 and I'm going to do: predict-no
  479. ENV: Agent did: predict-no for direction L in state State-B
  480. In State-B moving L
  481. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  482. predict error 1
  483. dir: dir isR
  484. -/58: O: O115 (predict-yes)
  485. I see 0 and I'm going to do: predict-yes
  486. ENV: Agent did: predict-yes for direction R in state State-A
  487. In State-A moving R
  488. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  489. predict error 0
  490. dir: dir isU
  491. |\-59: O: O118 (predict-no)
  492. I see 1 and I'm going to do: predict-no
  493. ENV: Agent did: predict-no for direction U in state State-B
  494. In State-B moving U
  495. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  496. predict error 0
  497. dir: dir isR
  498. /|\60: O: O119 (predict-yes)
  499. I see 1 and I'm going to do: predict-yes
  500. ENV: Agent did: predict-yes for direction R in state State-B
  501. In State-B moving R
  502. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  503. predict error 1
  504. dir: dir isU
  505. -/61: O: O122 (predict-no)
  506. I see 0 and I'm going to do: predict-no
  507. ENV: Agent did: predict-no for direction U in state State-B
  508. In State-B moving U
  509. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  510. predict error 0
  511. dir: dir isR
  512. rule alias: '*'
  513. rule alias: '*'
  514. rule alias: '*'
  515. rule alias: '*'
  516. rule alias: '*'
  517. rule alias: '*'
  518. rule alias: '*'
  519. rule alias: '*'
  520. rule alias: '*'
  521. rule alias: '*'
  522. rule alias: '*'
  523. |62: O: O123 (predict-yes)
  524. I see 1 and I'm going to do: predict-yes
  525. ENV: Agent did: predict-yes for direction R in state State-B
  526. In State-B moving R
  527. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  528. predict error 1
  529. dir: dir isU
  530. \-63: O: O126 (predict-no)
  531. I see 0 and I'm going to do: predict-no
  532. ENV: Agent did: predict-no for direction U in state State-B
  533. In State-B moving U
  534. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  535. predict error 0
  536. dir: dir isR
  537. /|64: O: O127 (predict-yes)
  538. I see 1 and I'm going to do: predict-yes
  539. ENV: Agent did: predict-yes for direction R in state State-B
  540. In State-B moving R
  541. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  542. predict error 1
  543. dir: dir isR
  544. \-65: O: O129 (predict-yes)
  545. I see 0 and I'm going to do: predict-yes
  546. ENV: Agent did: predict-yes for direction R in state State-B
  547. In State-B moving R
  548. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  549. predict error 1
  550. dir: dir isR
  551. /|\66: O: O131 (predict-yes)
  552. I see 0 and I'm going to do: predict-yes
  553. ENV: Agent did: predict-yes for direction R in state State-B
  554. In State-B moving R
  555. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  556. predict error 1
  557. dir: dir isR
  558. -/|\67: O: O133 (predict-yes)
  559. I see 0 and I'm going to do: predict-yes
  560. ENV: Agent did: predict-yes for direction R in state State-B
  561. In State-B moving R
  562. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  563. predict error 1
  564. dir: dir isR
  565. -/68: O: O135 (predict-yes)
  566. I see 0 and I'm going to do: predict-yes
  567. ENV: Agent did: predict-yes for direction R in state State-B
  568. In State-B moving R
  569. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  570. predict error 1
  571. dir: dir isR
  572. |\69: O: O138 (predict-no)
  573. I see 0 and I'm going to do: predict-no
  574. ENV: Agent did: predict-no for direction R in state State-B
  575. In State-B moving R
  576. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  577. predict error 0
  578. dir: dir isL
  579. -/|\70: O: O139 (predict-yes)
  580. I see 1 and I'm going to do: predict-yes
  581. ENV: Agent did: predict-yes for direction L in state State-B
  582. In State-B moving L
  583. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  584. predict error 0
  585. dir: dir isL
  586. -/71: O: O141 (predict-yes)
  587. I see 1 and I'm going to do: predict-yes
  588. ENV: Agent did: predict-yes for direction L in state State-A
  589. In State-A moving L
  590. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  591. predict error 1
  592. dir: dir isL
  593. rule alias: '*'
  594. rule alias: '*'
  595. rule alias: '*'
  596. rule alias: '*'
  597. rule alias: '*'
  598. |72: O: O143 (predict-yes)
  599. I see 0 and I'm going to do: predict-yes
  600. ENV: Agent did: predict-yes for direction L in state State-A
  601. In State-A moving L
  602. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  603. predict error 1
  604. dir: dir isR
  605. \-/73: O: O146 (predict-no)
  606. I see 0 and I'm going to do: predict-no
  607. ENV: Agent did: predict-no for direction R in state State-A
  608. In State-A moving R
  609. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  610. predict error 1
  611. dir: dir isR
  612. |\-74: O: O147 (predict-yes)
  613. I see 0 and I'm going to do: predict-yes
  614. ENV: Agent did: predict-yes for direction R in state State-B
  615. In State-B moving R
  616. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  617. predict error 1
  618. dir: dir isR
  619. /|\75: O: O150 (predict-no)
  620. I see 0 and I'm going to do: predict-no
  621. ENV: Agent did: predict-no for direction R in state State-B
  622. In State-B moving R
  623. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  624. predict error 0
  625. dir: dir isL
  626. -/|76: O: O151 (predict-yes)
  627. I see 1 and I'm going to do: predict-yes
  628. ENV: Agent did: predict-yes for direction L in state State-B
  629. In State-B moving L
  630. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  631. predict error 0
  632. dir: dir isU
  633. \-/77: O: O154 (predict-no)
  634. I see 1 and I'm going to do: predict-no
  635. ENV: Agent did: predict-no for direction U in state State-A
  636. In State-A moving U
  637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  638. predict error 0
  639. dir: dir isU
  640. |\78: O: O156 (predict-no)
  641. I see 1 and I'm going to do: predict-no
  642. ENV: Agent did: predict-no for direction U in state State-A
  643. In State-A moving U
  644. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  645. predict error 0
  646. dir: dir isU
  647. -/|79: O: O158 (predict-no)
  648. I see 1 and I'm going to do: predict-no
  649. ENV: Agent did: predict-no for direction U in state State-A
  650. In State-A moving U
  651. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  652. predict error 0
  653. dir: dir isU
  654. \-80: O: O160 (predict-no)
  655. I see 1 and I'm going to do: predict-no
  656. ENV: Agent did: predict-no for direction U in state State-A
  657. In State-A moving U
  658. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  659. predict error 0
  660. dir: dir isU
  661. /|81: O: O162 (predict-no)
  662. I see 1 and I'm going to do: predict-no
  663. ENV: Agent did: predict-no for direction U in state State-A
  664. In State-A moving U
  665. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  666. predict error 0
  667. dir: dir isU
  668. rule alias: '*'
  669. rule alias: '*'
  670. rule alias: '*'
  671. \82: O: O164 (predict-no)
  672. I see 1 and I'm going to do: predict-no
  673. ENV: Agent did: predict-no for direction U in state State-A
  674. In State-A moving U
  675. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  676. predict error 0
  677. dir: dir isR
  678. -/|83: O: O165 (predict-yes)
  679. I see 1 and I'm going to do: predict-yes
  680. ENV: Agent did: predict-yes for direction R in state State-A
  681. In State-A moving R
  682. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  683. predict error 0
  684. dir: dir isR
  685. \-/84: O: O168 (predict-no)
  686. I see 1 and I'm going to do: predict-no
  687. ENV: Agent did: predict-no for direction R in state State-B
  688. In State-B moving R
  689. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  690. predict error 0
  691. dir: dir isU
  692. |\-85: O: O169 (predict-yes)
  693. I see 1 and I'm going to do: predict-yes
  694. ENV: Agent did: predict-yes for direction U in state State-B
  695. In State-B moving U
  696. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  697. predict error 1
  698. dir: dir isL
  699. /|\86: O: O172 (predict-no)
  700. I see 0 and I'm going to do: predict-no
  701. ENV: Agent did: predict-no for direction L in state State-B
  702. In State-B moving L
  703. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  704. predict error 1
  705. dir: dir isU
  706. -/|87: O: O174 (predict-no)
  707. I see 0 and I'm going to do: predict-no
  708. ENV: Agent did: predict-no for direction U in state State-A
  709. In State-A moving U
  710. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  711. predict error 0
  712. dir: dir isU
  713. \-/88: O: O176 (predict-no)
  714. I see 1 and I'm going to do: predict-no
  715. ENV: Agent did: predict-no for direction U in state State-A
  716. In State-A moving U
  717. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  718. predict error 0
  719. dir: dir isU
  720. |\-89: O: O178 (predict-no)
  721. I see 1 and I'm going to do: predict-no
  722. ENV: Agent did: predict-no for direction U in state State-A
  723. In State-A moving U
  724. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  725. predict error 0
  726. dir: dir isR
  727. /|\90: O: O180 (predict-no)
  728. I see 1 and I'm going to do: predict-no
  729. ENV: Agent did: predict-no for direction R in state State-A
  730. In State-A moving R
  731. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  732. predict error 1
  733. dir: dir isU
  734. -/91: O: O182 (predict-no)
  735. I see 0 and I'm going to do: predict-no
  736. ENV: Agent did: predict-no for direction U in state State-B
  737. In State-B moving U
  738. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  739. predict error 0
  740. dir: dir isR
  741. rule alias: '*'
  742. rule alias: '*'
  743. rule alias: '*'
  744. |92: O: O184 (predict-no)
  745. I see 1 and I'm going to do: predict-no
  746. ENV: Agent did: predict-no for direction R in state State-B
  747. In State-B moving R
  748. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  749. predict error 0
  750. dir: dir isR
  751. \-93: O: O186 (predict-no)
  752. I see 1 and I'm going to do: predict-no
  753. ENV: Agent did: predict-no for direction R in state State-B
  754. In State-B moving R
  755. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  756. predict error 0
  757. dir: dir isR
  758. /|94: O: O187 (predict-yes)
  759. I see 1 and I'm going to do: predict-yes
  760. ENV: Agent did: predict-yes for direction R in state State-B
  761. In State-B moving R
  762. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  763. predict error 1
  764. dir: dir isU
  765. \-/95: O: O189 (predict-yes)
  766. I see 0 and I'm going to do: predict-yes
  767. ENV: Agent did: predict-yes for direction U in state State-B
  768. In State-B moving U
  769. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  770. predict error 1
  771. dir: dir isU
  772. |\-96: O: O192 (predict-no)
  773. I see 0 and I'm going to do: predict-no
  774. ENV: Agent did: predict-no for direction U in state State-B
  775. In State-B moving U
  776. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  777. predict error 0
  778. dir: dir isU
  779. /|\97: O: O194 (predict-no)
  780. I see 1 and I'm going to do: predict-no
  781. ENV: Agent did: predict-no for direction U in state State-B
  782. In State-B moving U
  783. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  784. predict error 0
  785. dir: dir isL
  786. -/98: O: O195 (predict-yes)
  787. I see 1 and I'm going to do: predict-yes
  788. ENV: Agent did: predict-yes for direction L in state State-B
  789. In State-B moving L
  790. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  791. predict error 0
  792. dir: dir isR
  793. |\-99: O: O197 (predict-yes)
  794. I see 1 and I'm going to do: predict-yes
  795. ENV: Agent did: predict-yes for direction R in state State-A
  796. In State-A moving R
  797. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  798. predict error 0
  799. dir: dir isR
  800. /|\100: O: O200 (predict-no)
  801. I see 1 and I'm going to do: predict-no
  802. ENV: Agent did: predict-no for direction R in state State-B
  803. In State-B moving R
  804. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  805. predict error 0
  806. dir: dir isR
  807. -/|101: O: O202 (predict-no)
  808. I see 1 and I'm going to do: predict-no
  809. ENV: Agent did: predict-no for direction R in state State-B
  810. In State-B moving R
  811. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  812. predict error 0
  813. dir: dir isU
  814. rule alias: '*'
  815. \-/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|\sleeping...
  816. -sleeping...
  817. /sleeping...
  818. |sleeping...
  819. \sleeping...
  820. -sleeping...
  821. /sleeping...
  822. |sleeping...
  823. \sleeping...
  824. -sleeping...
  825. /sleeping...
  826. |sleeping...
  827. \sleeping...
  828. -sleeping...
  829. /sleeping...
  830. |sleeping...
  831. \sleeping...
  832. -sleeping...
  833. /102: O: O204 (predict-no)
  834. I see 1 and I'm going to do: predict-no
  835. ENV: Agent did: predict-no for direction U in state State-B
  836. In State-B moving U
  837. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  838. predict error 0
  839. dir: dir isL
  840. |\103: O: O206 (predict-no)
  841. I see 1 and I'm going to do: predict-no
  842. ENV: Agent did: predict-no for direction L in state State-B
  843. In State-B moving L
  844. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  845. predict error 1
  846. dir: dir isU
  847. -/|104: O: O208 (predict-no)
  848. I see 0 and I'm going to do: predict-no
  849. ENV: Agent did: predict-no for direction U in state State-A
  850. In State-A moving U
  851. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  852. predict error 0
  853. dir: dir isL
  854. \-/105: O: O209 (predict-yes)
  855. I see 1 and I'm going to do: predict-yes
  856. ENV: Agent did: predict-yes for direction L in state State-A
  857. In State-A moving L
  858. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  859. predict error 1
  860. dir: dir isL
  861. |\106: O: O211 (predict-yes)
  862. I see 0 and I'm going to do: predict-yes
  863. ENV: Agent did: predict-yes for direction L in state State-A
  864. In State-A moving L
  865. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  866. predict error 1
  867. dir: dir isU
  868. -/|107: O: O214 (predict-no)
  869. I see 0 and I'm going to do: predict-no
  870. ENV: Agent did: predict-no for direction U in state State-A
  871. In State-A moving U
  872. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  873. predict error 0
  874. dir: dir isL
  875. \-108: O: O215 (predict-yes)
  876. I see 1 and I'm going to do: predict-yes
  877. ENV: Agent did: predict-yes for direction L in state State-A
  878. In State-A moving L
  879. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  880. predict error 1
  881. dir: dir isU
  882. /|\109: O: O218 (predict-no)
  883. I see 0 and I'm going to do: predict-no
  884. ENV: Agent did: predict-no for direction U in state State-A
  885. In State-A moving U
  886. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  887. predict error 0
  888. dir: dir isL
  889. -/110: O: O219 (predict-yes)
  890. I see 1 and I'm going to do: predict-yes
  891. ENV: Agent did: predict-yes for direction L in state State-A
  892. In State-A moving L
  893. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  894. predict error 1
  895. dir: dir isL
  896. |\-111: O: O221 (predict-yes)
  897. I see 0 and I'm going to do: predict-yes
  898. ENV: Agent did: predict-yes for direction L in state State-A
  899. In State-A moving L
  900. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  901. predict error 1
  902. dir: dir isU
  903. rule alias: '*'
  904. rule alias: '*'
  905. rule alias: '*'
  906. /112: O: O224 (predict-no)
  907. I see 0 and I'm going to do: predict-no
  908. ENV: Agent did: predict-no for direction U in state State-A
  909. In State-A moving U
  910. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  911. predict error 0
  912. dir: dir isL
  913. |\-113: O: O225 (predict-yes)
  914. I see 1 and I'm going to do: predict-yes
  915. ENV: Agent did: predict-yes for direction L in state State-A
  916. In State-A moving L
  917. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  918. predict error 1
  919. dir: dir isR
  920. /|\114: O: O228 (predict-no)
  921. I see 0 and I'm going to do: predict-no
  922. ENV: Agent did: predict-no for direction R in state State-A
  923. In State-A moving R
  924. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  925. predict error 1
  926. dir: dir isU
  927. -/|115: O: O230 (predict-no)
  928. I see 0 and I'm going to do: predict-no
  929. ENV: Agent did: predict-no for direction U in state State-B
  930. In State-B moving U
  931. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  932. predict error 0
  933. dir: dir isR
  934. \-116: O: O232 (predict-no)
  935. I see 1 and I'm going to do: predict-no
  936. ENV: Agent did: predict-no for direction R in state State-B
  937. In State-B moving R
  938. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  939. predict error 0
  940. dir: dir isU
  941. /|117: O: O234 (predict-no)
  942. I see 1 and I'm going to do: predict-no
  943. ENV: Agent did: predict-no for direction U in state State-B
  944. In State-B moving U
  945. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  946. predict error 0
  947. dir: dir isL
  948. \-/118: O: O235 (predict-yes)
  949. I see 1 and I'm going to do: predict-yes
  950. ENV: Agent did: predict-yes for direction L in state State-B
  951. In State-B moving L
  952. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  953. predict error 0
  954. dir: dir isR
  955. |\119: O: O238 (predict-no)
  956. I see 1 and I'm going to do: predict-no
  957. ENV: Agent did: predict-no for direction R in state State-A
  958. In State-A moving R
  959. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  960. predict error 1
  961. dir: dir isR
  962. -/|120: O: O239 (predict-yes)
  963. I see 0 and I'm going to do: predict-yes
  964. ENV: Agent did: predict-yes for direction R in state State-B
  965. In State-B moving R
  966. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  967. predict error 1
  968. dir: dir isR
  969. \-/121: O: O242 (predict-no)
  970. I see 0 and I'm going to do: predict-no
  971. ENV: Agent did: predict-no for direction R in state State-B
  972. In State-B moving R
  973. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  974. predict error 0
  975. dir: dir isR
  976. rule alias: '*'
  977. rule alias: '*'
  978. rule alias: '*'
  979. rule alias: '*'
  980. rule alias: '*'
  981. rule alias: '*'
  982. rule alias: '*'
  983. rule alias: '*'
  984. |122: O: O244 (predict-no)
  985. I see 1 and I'm going to do: predict-no
  986. ENV: Agent did: predict-no for direction R in state State-B
  987. In State-B moving R
  988. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  989. predict error 0
  990. dir: dir isR
  991. \-123: O: O245 (predict-yes)
  992. I see 1 and I'm going to do: predict-yes
  993. ENV: Agent did: predict-yes for direction R in state State-B
  994. In State-B moving R
  995. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  996. predict error 1
  997. dir: dir isU
  998. /|\124: O: O248 (predict-no)
  999. I see 0 and I'm going to do: predict-no
  1000. ENV: Agent did: predict-no for direction U in state State-B
  1001. In State-B moving U
  1002. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1003. predict error 0
  1004. dir: dir isL
  1005. -/|125: O: O249 (predict-yes)
  1006. I see 1 and I'm going to do: predict-yes
  1007. ENV: Agent did: predict-yes for direction L in state State-B
  1008. In State-B moving L
  1009. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1010. predict error 0
  1011. dir: dir isL
  1012. \-/126: O: O251 (predict-yes)
  1013. I see 1 and I'm going to do: predict-yes
  1014. ENV: Agent did: predict-yes for direction L in state State-A
  1015. In State-A moving L
  1016. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1017. predict error 1
  1018. dir: dir isU
  1019. |\-127: O: O254 (predict-no)
  1020. I see 0 and I'm going to do: predict-no
  1021. ENV: Agent did: predict-no for direction U in state State-A
  1022. In State-A moving U
  1023. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1024. predict error 0
  1025. dir: dir isL
  1026. /|\128: O: O255 (predict-yes)
  1027. I see 1 and I'm going to do: predict-yes
  1028. ENV: Agent did: predict-yes for direction L in state State-A
  1029. In State-A moving L
  1030. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1031. predict error 1
  1032. dir: dir isL
  1033. -/|129: O: O257 (predict-yes)
  1034. I see 0 and I'm going to do: predict-yes
  1035. ENV: Agent did: predict-yes for direction L in state State-A
  1036. In State-A moving L
  1037. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1038. predict error 1
  1039. dir: dir isL
  1040. \-/130: O: O259 (predict-yes)
  1041. I see 0 and I'm going to do: predict-yes
  1042. ENV: Agent did: predict-yes for direction L in state State-A
  1043. In State-A moving L
  1044. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1045. predict error 1
  1046. dir: dir isU
  1047. |\-131: O: O262 (predict-no)
  1048. I see 0 and I'm going to do: predict-no
  1049. ENV: Agent did: predict-no for direction U in state State-A
  1050. In State-A moving U
  1051. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1052. predict error 0
  1053. dir: dir isU
  1054. /132: O: O264 (predict-no)
  1055. I see 1 and I'm going to do: predict-no
  1056. ENV: Agent did: predict-no for direction U in state State-A
  1057. In State-A moving U
  1058. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1059. predict error 0
  1060. dir: dir isL
  1061. |\133: O: O265 (predict-yes)
  1062. I see 1 and I'm going to do: predict-yes
  1063. ENV: Agent did: predict-yes for direction L in state State-A
  1064. In State-A moving L
  1065. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1066. predict error 1
  1067. dir: dir isR
  1068. -/|134: O: O268 (predict-no)
  1069. I see 0 and I'm going to do: predict-no
  1070. ENV: Agent did: predict-no for direction R in state State-A
  1071. In State-A moving R
  1072. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1073. predict error 1
  1074. dir: dir isL
  1075. \-/135: O: O269 (predict-yes)
  1076. I see 0 and I'm going to do: predict-yes
  1077. ENV: Agent did: predict-yes for direction L in state State-B
  1078. In State-B moving L
  1079. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1080. predict error 0
  1081. dir: dir isL
  1082. |\-136: O: O271 (predict-yes)
  1083. I see 1 and I'm going to do: predict-yes
  1084. ENV: Agent did: predict-yes for direction L in state State-A
  1085. In State-A moving L
  1086. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1087. predict error 1
  1088. dir: dir isL
  1089. /|\-sleeping...
  1090. /137: O: O274 (predict-no)
  1091. I see 0 and I'm going to do: predict-no
  1092. ENV: Agent did: predict-no for direction L in state State-A
  1093. In State-A moving L
  1094. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1095. predict error 0
  1096. dir: dir isR
  1097. |\138: O: O276 (predict-no)
  1098. I see 1 and I'm going to do: predict-no
  1099. ENV: Agent did: predict-no for direction R in state State-A
  1100. In State-A moving R
  1101. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1102. predict error 1
  1103. dir: dir isR
  1104. -/|139: O: O278 (predict-no)
  1105. I see 0 and I'm going to do: predict-no
  1106. ENV: Agent did: predict-no for direction R in state State-B
  1107. In State-B moving R
  1108. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1109. predict error 0
  1110. dir: dir isL
  1111. \-/140: O: O279 (predict-yes)
  1112. I see 1 and I'm going to do: predict-yes
  1113. ENV: Agent did: predict-yes for direction L in state State-B
  1114. In State-B moving L
  1115. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1116. predict error 0
  1117. dir: dir isR
  1118. |\-141: O: O282 (predict-no)
  1119. I see 1 and I'm going to do: predict-no
  1120. ENV: Agent did: predict-no for direction R in state State-A
  1121. In State-A moving R
  1122. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1123. predict error 1
  1124. dir: dir isL
  1125. rule alias: '*'
  1126. /142: O: O283 (predict-yes)
  1127. I see 0 and I'm going to do: predict-yes
  1128. ENV: Agent did: predict-yes for direction L in state State-B
  1129. In State-B moving L
  1130. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1131. predict error 0
  1132. dir: dir isL
  1133. |\143: O: O286 (predict-no)
  1134. I see 1 and I'm going to do: predict-no
  1135. ENV: Agent did: predict-no for direction L in state State-A
  1136. In State-A moving L
  1137. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1138. predict error 0
  1139. dir: dir isU
  1140. -/|144: O: O288 (predict-no)
  1141. I see 1 and I'm going to do: predict-no
  1142. ENV: Agent did: predict-no for direction U in state State-A
  1143. In State-A moving U
  1144. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1145. predict error 0
  1146. dir: dir isL
  1147. \-/145: O: O290 (predict-no)
  1148. I see 1 and I'm going to do: predict-no
  1149. ENV: Agent did: predict-no for direction L in state State-A
  1150. In State-A moving L
  1151. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1152. predict error 0
  1153. dir: dir isR
  1154. |\146: O: O292 (predict-no)
  1155. I see 1 and I'm going to do: predict-no
  1156. ENV: Agent did: predict-no for direction R in state State-A
  1157. In State-A moving R
  1158. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1159. predict error 1
  1160. dir: dir isU
  1161. -/|147: O: O294 (predict-no)
  1162. I see 0 and I'm going to do: predict-no
  1163. ENV: Agent did: predict-no for direction U in state State-B
  1164. In State-B moving U
  1165. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1166. predict error 0
  1167. dir: dir isU
  1168. \-/148: O: O296 (predict-no)
  1169. I see 1 and I'm going to do: predict-no
  1170. ENV: Agent did: predict-no for direction U in state State-B
  1171. In State-B moving U
  1172. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1173. predict error 0
  1174. dir: dir isU
  1175. |\149: O: O298 (predict-no)
  1176. I see 1 and I'm going to do: predict-no
  1177. ENV: Agent did: predict-no for direction U in state State-B
  1178. In State-B moving U
  1179. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1180. predict error 0
  1181. dir: dir isL
  1182. -/|150: O: O299 (predict-yes)
  1183. I see 1 and I'm going to do: predict-yes
  1184. ENV: Agent did: predict-yes for direction L in state State-B
  1185. In State-B moving L
  1186. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1187. predict error 0
  1188. dir: dir isU
  1189. \-/151: O: O302 (predict-no)
  1190. I see 1 and I'm going to do: predict-no
  1191. ENV: Agent did: predict-no for direction U in state State-A
  1192. In State-A moving U
  1193. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1194. predict error 0
  1195. dir: dir isU
  1196. |152: O: O304 (predict-no)
  1197. I see 1 and I'm going to do: predict-no
  1198. ENV: Agent did: predict-no for direction U in state State-A
  1199. In State-A moving U
  1200. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1201. predict error 0
  1202. dir: dir isL
  1203. \-/153: O: O306 (predict-no)
  1204. I see 1 and I'm going to do: predict-no
  1205. ENV: Agent did: predict-no for direction L in state State-A
  1206. In State-A moving L
  1207. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1208. predict error 0
  1209. dir: dir isU
  1210. |\-154: O: O308 (predict-no)
  1211. I see 1 and I'm going to do: predict-no
  1212. ENV: Agent did: predict-no for direction U in state State-A
  1213. In State-A moving U
  1214. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1215. predict error 0
  1216. dir: dir isU
  1217. /|\-155: O: O310 (predict-no)
  1218. I see 1 and I'm going to do: predict-no
  1219. ENV: Agent did: predict-no for direction U in state State-A
  1220. In State-A moving U
  1221. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1222. predict error 0
  1223. dir: dir isR
  1224. /|156: O: O312 (predict-no)
  1225. I see 1 and I'm going to do: predict-no
  1226. ENV: Agent did: predict-no for direction R in state State-A
  1227. In State-A moving R
  1228. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1229. predict error 1
  1230. dir: dir isL
  1231. \-/157: O: O313 (predict-yes)
  1232. I see 0 and I'm going to do: predict-yes
  1233. ENV: Agent did: predict-yes for direction L in state State-B
  1234. In State-B moving L
  1235. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1236. predict error 0
  1237. dir: dir isR
  1238. |\-158: O: O316 (predict-no)
  1239. I see 1 and I'm going to do: predict-no
  1240. ENV: Agent did: predict-no for direction R in state State-A
  1241. In State-A moving R
  1242. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1243. predict error 1
  1244. dir: dir isR
  1245. /|159: O: O318 (predict-no)
  1246. I see 0 and I'm going to do: predict-no
  1247. ENV: Agent did: predict-no for direction R in state State-B
  1248. In State-B moving R
  1249. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1250. predict error 0
  1251. dir: dir isL
  1252. \-/160: O: O319 (predict-yes)
  1253. I see 1 and I'm going to do: predict-yes
  1254. ENV: Agent did: predict-yes for direction L in state State-B
  1255. In State-B moving L
  1256. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1257. predict error 0
  1258. dir: dir isR
  1259. |\-161: O: O322 (predict-no)
  1260. I see 1 and I'm going to do: predict-no
  1261. ENV: Agent did: predict-no for direction R in state State-A
  1262. In State-A moving R
  1263. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1264. predict error 1
  1265. dir: dir isR
  1266. /162: O: O324 (predict-no)
  1267. I see 0 and I'm going to do: predict-no
  1268. ENV: Agent did: predict-no for direction R in state State-B
  1269. In State-B moving R
  1270. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1271. predict error 0
  1272. dir: dir isR
  1273. |\-163: O: O326 (predict-no)
  1274. I see 1 and I'm going to do: predict-no
  1275. ENV: Agent did: predict-no for direction R in state State-B
  1276. In State-B moving R
  1277. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1278. predict error 0
  1279. dir: dir isR
  1280. /|\164: O: O328 (predict-no)
  1281. I see 1 and I'm going to do: predict-no
  1282. ENV: Agent did: predict-no for direction R in state State-B
  1283. In State-B moving R
  1284. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1285. predict error 0
  1286. dir: dir isL
  1287. -/|165: O: O329 (predict-yes)
  1288. I see 1 and I'm going to do: predict-yes
  1289. ENV: Agent did: predict-yes for direction L in state State-B
  1290. In State-B moving L
  1291. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1292. predict error 0
  1293. dir: dir isR
  1294. \166: O: O332 (predict-no)
  1295. I see 1 and I'm going to do: predict-no
  1296. ENV: Agent did: predict-no for direction R in state State-A
  1297. In State-A moving R
  1298. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1299. predict error 1
  1300. dir: dir isU
  1301. -/|167: O: O334 (predict-no)
  1302. I see 0 and I'm going to do: predict-no
  1303. ENV: Agent did: predict-no for direction U in state State-B
  1304. In State-B moving U
  1305. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1306. predict error 0
  1307. dir: dir isL
  1308. \-/168: O: O335 (predict-yes)
  1309. I see 1 and I'm going to do: predict-yes
  1310. ENV: Agent did: predict-yes for direction L in state State-B
  1311. In State-B moving L
  1312. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1313. predict error 0
  1314. dir: dir isR
  1315. |\169: O: O338 (predict-no)
  1316. I see 1 and I'm going to do: predict-no
  1317. ENV: Agent did: predict-no for direction R in state State-A
  1318. In State-A moving R
  1319. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1320. predict error 1
  1321. dir: dir isL
  1322. -/|170: O: O339 (predict-yes)
  1323. I see 0 and I'm going to do: predict-yes
  1324. ENV: Agent did: predict-yes for direction L in state State-B
  1325. In State-B moving L
  1326. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1327. predict error 0
  1328. dir: dir isU
  1329. \-/171: O: O342 (predict-no)
  1330. I see 1 and I'm going to do: predict-no
  1331. ENV: Agent did: predict-no for direction U in state State-A
  1332. In State-A moving U
  1333. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1334. predict error 0
  1335. dir: dir isR
  1336. |172: O: O343 (predict-yes)
  1337. I see 1 and I'm going to do: predict-yes
  1338. ENV: Agent did: predict-yes for direction R in state State-A
  1339. In State-A moving R
  1340. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1341. predict error 0
  1342. dir: dir isL
  1343. \-/173: O: O345 (predict-yes)
  1344. I see 1 and I'm going to do: predict-yes
  1345. ENV: Agent did: predict-yes for direction L in state State-B
  1346. In State-B moving L
  1347. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1348. predict error 0
  1349. dir: dir isL
  1350. |\-174: O: O348 (predict-no)
  1351. I see 1 and I'm going to do: predict-no
  1352. ENV: Agent did: predict-no for direction L in state State-A
  1353. In State-A moving L
  1354. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1355. predict error 0
  1356. dir: dir isL
  1357. /|\175: O: O350 (predict-no)
  1358. I see 1 and I'm going to do: predict-no
  1359. ENV: Agent did: predict-no for direction L in state State-A
  1360. In State-A moving L
  1361. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1362. predict error 0
  1363. dir: dir isU
  1364. -/|176: O: O352 (predict-no)
  1365. I see 1 and I'm going to do: predict-no
  1366. ENV: Agent did: predict-no for direction U in state State-A
  1367. In State-A moving U
  1368. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1369. predict error 0
  1370. dir: dir isR
  1371. \177: O: O353 (predict-yes)
  1372. I see 1 and I'm going to do: predict-yes
  1373. ENV: Agent did: predict-yes for direction R in state State-A
  1374. In State-A moving R
  1375. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1376. predict error 0
  1377. dir: dir isL
  1378. -/|178: O: O355 (predict-yes)
  1379. I see 1 and I'm going to do: predict-yes
  1380. ENV: Agent did: predict-yes for direction L in state State-B
  1381. In State-B moving L
  1382. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1383. predict error 0
  1384. dir: dir isR
  1385. \-179: O: O357 (predict-yes)
  1386. I see 1 and I'm going to do: predict-yes
  1387. ENV: Agent did: predict-yes for direction R in state State-A
  1388. In State-A moving R
  1389. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1390. predict error 0
  1391. dir: dir isU
  1392. /|\180: O: O360 (predict-no)
  1393. I see 1 and I'm going to do: predict-no
  1394. ENV: Agent did: predict-no for direction U in state State-B
  1395. In State-B moving U
  1396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1397. predict error 0
  1398. dir: dir isR
  1399. -/|181: O: O362 (predict-no)
  1400. I see 1 and I'm going to do: predict-no
  1401. ENV: Agent did: predict-no for direction R in state State-B
  1402. In State-B moving R
  1403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1404. predict error 0
  1405. dir: dir isR
  1406. \182: O: O364 (predict-no)
  1407. I see 1 and I'm going to do: predict-no
  1408. ENV: Agent did: predict-no for direction R in state State-B
  1409. In State-B moving R
  1410. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1411. predict error 0
  1412. dir: dir isU
  1413. -/|183: O: O366 (predict-no)
  1414. I see 1 and I'm going to do: predict-no
  1415. ENV: Agent did: predict-no for direction U in state State-B
  1416. In State-B moving U
  1417. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1418. predict error 0
  1419. dir: dir isR
  1420. \-/184: O: O368 (predict-no)
  1421. I see 1 and I'm going to do: predict-no
  1422. ENV: Agent did: predict-no for direction R in state State-B
  1423. In State-B moving R
  1424. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1425. predict error 0
  1426. dir: dir isR
  1427. |\-185: O: O370 (predict-no)
  1428. I see 1 and I'm going to do: predict-no
  1429. ENV: Agent did: predict-no for direction R in state State-B
  1430. In State-B moving R
  1431. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1432. predict error 0
  1433. dir: dir isR
  1434. /|\186: O: O372 (predict-no)
  1435. I see 1 and I'm going to do: predict-no
  1436. ENV: Agent did: predict-no for direction R in state State-B
  1437. In State-B moving R
  1438. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1439. predict error 0
  1440. dir: dir isL
  1441. -/187: O: O373 (predict-yes)
  1442. I see 1 and I'm going to do: predict-yes
  1443. ENV: Agent did: predict-yes for direction L in state State-B
  1444. In State-B moving L
  1445. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1446. predict error 0
  1447. dir: dir isL
  1448. |\188: O: O376 (predict-no)
  1449. I see 1 and I'm going to do: predict-no
  1450. ENV: Agent did: predict-no for direction L in state State-A
  1451. In State-A moving L
  1452. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1453. predict error 0
  1454. dir: dir isR
  1455. -/|189: O: O377 (predict-yes)
  1456. I see 1 and I'm going to do: predict-yes
  1457. ENV: Agent did: predict-yes for direction R in state State-A
  1458. In State-A moving R
  1459. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1460. predict error 0
  1461. dir: dir isL
  1462. \-/190: O: O379 (predict-yes)
  1463. I see 1 and I'm going to do: predict-yes
  1464. ENV: Agent did: predict-yes for direction L in state State-B
  1465. In State-B moving L
  1466. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1467. predict error 0
  1468. dir: dir isR
  1469. |\-191: O: O381 (predict-yes)
  1470. I see 1 and I'm going to do: predict-yes
  1471. ENV: Agent did: predict-yes for direction R in state State-A
  1472. In State-A moving R
  1473. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1474. predict error 0
  1475. dir: dir isR
  1476. /192: O: O384 (predict-no)
  1477. I see 1 and I'm going to do: predict-no
  1478. ENV: Agent did: predict-no for direction R in state State-B
  1479. In State-B moving R
  1480. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1481. predict error 0
  1482. dir: dir isU
  1483. |\-193: O: O386 (predict-no)
  1484. I see 1 and I'm going to do: predict-no
  1485. ENV: Agent did: predict-no for direction U in state State-B
  1486. In State-B moving U
  1487. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1488. predict error 0
  1489. dir: dir isR
  1490. /|\194: O: O388 (predict-no)
  1491. I see 1 and I'm going to do: predict-no
  1492. ENV: Agent did: predict-no for direction R in state State-B
  1493. In State-B moving R
  1494. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1495. predict error 0
  1496. dir: dir isR
  1497. -/195: O: O390 (predict-no)
  1498. I see 1 and I'm going to do: predict-no
  1499. ENV: Agent did: predict-no for direction R in state State-B
  1500. In State-B moving R
  1501. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1502. predict error 0
  1503. dir: dir isR
  1504. |\-196: O: O392 (predict-no)
  1505. I see 1 and I'm going to do: predict-no
  1506. ENV: Agent did: predict-no for direction R in state State-B
  1507. In State-B moving R
  1508. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1509. predict error 0
  1510. dir: dir isU
  1511. /|\197: O: O394 (predict-no)
  1512. I see 1 and I'm going to do: predict-no
  1513. ENV: Agent did: predict-no for direction U in state State-B
  1514. In State-B moving U
  1515. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1516. predict error 0
  1517. dir: dir isR
  1518. -/198: O: O396 (predict-no)
  1519. I see 1 and I'm going to do: predict-no
  1520. ENV: Agent did: predict-no for direction R in state State-B
  1521. In State-B moving R
  1522. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1523. predict error 0
  1524. dir: dir isR
  1525. |\-199: O: O398 (predict-no)
  1526. I see 1 and I'm going to do: predict-no
  1527. ENV: Agent did: predict-no for direction R in state State-B
  1528. In State-B moving R
  1529. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1530. predict error 0
  1531. dir: dir isL
  1532. /|\200: O: O399 (predict-yes)
  1533. I see 1 and I'm going to do: predict-yes
  1534. ENV: Agent did: predict-yes for direction L in state State-B
  1535. In State-B moving L
  1536. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1537. predict error 0
  1538. dir: dir isR
  1539. -/|201: O: O401 (predict-yes)
  1540. I see 1 and I'm going to do: predict-yes
  1541. ENV: Agent did: predict-yes for direction R in state State-A
  1542. In State-A moving R
  1543. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1544. predict error 0
  1545. dir: dir isL
  1546. \-202: O: O403 (predict-yes)
  1547. I see 1 and I'm going to do: predict-yes
  1548. ENV: Agent did: predict-yes for direction L in state State-B
  1549. In State-B moving L
  1550. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1551. predict error 0
  1552. dir: dir isL
  1553. /|\203: O: O406 (predict-no)
  1554. I see 1 and I'm going to do: predict-no
  1555. ENV: Agent did: predict-no for direction L in state State-A
  1556. In State-A moving L
  1557. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1558. predict error 0
  1559. dir: dir isR
  1560. -/|204: O: O407 (predict-yes)
  1561. I see 1 and I'm going to do: predict-yes
  1562. ENV: Agent did: predict-yes for direction R in state State-A
  1563. In State-A moving R
  1564. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1565. predict error 0
  1566. dir: dir isR
  1567. \205: O: O410 (predict-no)
  1568. I see 1 and I'm going to do: predict-no
  1569. ENV: Agent did: predict-no for direction R in state State-B
  1570. In State-B moving R
  1571. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1572. predict error 0
  1573. dir: dir isR
  1574. -/206: O: O412 (predict-no)
  1575. I see 1 and I'm going to do: predict-no
  1576. ENV: Agent did: predict-no for direction R in state State-B
  1577. In State-B moving R
  1578. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1579. predict error 0
  1580. dir: dir isU
  1581. |\-207: O: O414 (predict-no)
  1582. I see 1 and I'm going to do: predict-no
  1583. ENV: Agent did: predict-no for direction U in state State-B
  1584. In State-B moving U
  1585. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1586. predict error 0
  1587. dir: dir isU
  1588. /|\208: O: O416 (predict-no)
  1589. I see 1 and I'm going to do: predict-no
  1590. ENV: Agent did: predict-no for direction U in state State-B
  1591. In State-B moving U
  1592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1593. predict error 0
  1594. dir: dir isR
  1595. -/209: O: O418 (predict-no)
  1596. I see 1 and I'm going to do: predict-no
  1597. ENV: Agent did: predict-no for direction R in state State-B
  1598. In State-B moving R
  1599. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1600. predict error 0
  1601. dir: dir isL
  1602. |\-210: O: O419 (predict-yes)
  1603. I see 1 and I'm going to do: predict-yes
  1604. ENV: Agent did: predict-yes for direction L in state State-B
  1605. In State-B moving L
  1606. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1607. predict error 0
  1608. dir: dir isR
  1609. /|\211: O: O421 (predict-yes)
  1610. I see 1 and I'm going to do: predict-yes
  1611. ENV: Agent did: predict-yes for direction R in state State-A
  1612. In State-A moving R
  1613. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1614. predict error 0
  1615. dir: dir isU
  1616. -212: O: O424 (predict-no)
  1617. I see 1 and I'm going to do: predict-no
  1618. ENV: Agent did: predict-no for direction U in state State-B
  1619. In State-B moving U
  1620. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1621. predict error 0
  1622. dir: dir isU
  1623. /|\213: O: O426 (predict-no)
  1624. I see 1 and I'm going to do: predict-no
  1625. ENV: Agent did: predict-no for direction U in state State-B
  1626. In State-B moving U
  1627. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1628. predict error 0
  1629. dir: dir isU
  1630. -214: O: O428 (predict-no)
  1631. I see 1 and I'm going to do: predict-no
  1632. ENV: Agent did: predict-no for direction U in state State-B
  1633. In State-B moving U
  1634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1635. predict error 0
  1636. dir: dir isL
  1637. /|\215: O: O429 (predict-yes)
  1638. I see 1 and I'm going to do: predict-yes
  1639. ENV: Agent did: predict-yes for direction L in state State-B
  1640. In State-B moving L
  1641. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1642. predict error 0
  1643. dir: dir isU
  1644. -/|216: O: O432 (predict-no)
  1645. I see 1 and I'm going to do: predict-no
  1646. ENV: Agent did: predict-no for direction U in state State-A
  1647. In State-A moving U
  1648. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1649. predict error 0
  1650. dir: dir isR
  1651. \-217: O: O433 (predict-yes)
  1652. I see 1 and I'm going to do: predict-yes
  1653. ENV: Agent did: predict-yes for direction R in state State-A
  1654. In State-A moving R
  1655. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1656. predict error 0
  1657. dir: dir isL
  1658. /|\218: O: O435 (predict-yes)
  1659. I see 1 and I'm going to do: predict-yes
  1660. ENV: Agent did: predict-yes for direction L in state State-B
  1661. In State-B moving L
  1662. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1663. predict error 0
  1664. dir: dir isU
  1665. -/|219: O: O437 (predict-yes)
  1666. I see 1 and I'm going to do: predict-yes
  1667. ENV: Agent did: predict-yes for direction U in state State-A
  1668. In State-A moving U
  1669. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1670. predict error 1
  1671. dir: dir isU
  1672. \-/220: O: O440 (predict-no)
  1673. I see 0 and I'm going to do: predict-no
  1674. ENV: Agent did: predict-no for direction U in state State-A
  1675. In State-A moving U
  1676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1677. predict error 0
  1678. dir: dir isR
  1679. |\-221: O: O441 (predict-yes)
  1680. I see 1 and I'm going to do: predict-yes
  1681. ENV: Agent did: predict-yes for direction R in state State-A
  1682. In State-A moving R
  1683. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1684. predict error 0
  1685. dir: dir isU
  1686. /222: O: O444 (predict-no)
  1687. I see 1 and I'm going to do: predict-no
  1688. ENV: Agent did: predict-no for direction U in state State-B
  1689. In State-B moving U
  1690. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1691. predict error 0
  1692. dir: dir isL
  1693. |\-223: O: O445 (predict-yes)
  1694. I see 1 and I'm going to do: predict-yes
  1695. ENV: Agent did: predict-yes for direction L in state State-B
  1696. In State-B moving L
  1697. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1698. predict error 0
  1699. dir: dir isL
  1700. /|224: O: O448 (predict-no)
  1701. I see 1 and I'm going to do: predict-no
  1702. ENV: Agent did: predict-no for direction L in state State-A
  1703. In State-A moving L
  1704. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1705. predict error 0
  1706. dir: dir isU
  1707. \-/225: O: O450 (predict-no)
  1708. I see 1 and I'm going to do: predict-no
  1709. ENV: Agent did: predict-no for direction U in state State-A
  1710. In State-A moving U
  1711. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1712. predict error 0
  1713. dir: dir isL
  1714. |226: O: O452 (predict-no)
  1715. I see 1 and I'm going to do: predict-no
  1716. ENV: Agent did: predict-no for direction L in state State-A
  1717. In State-A moving L
  1718. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1719. predict error 0
  1720. dir: dir isU
  1721. \-/227: O: O454 (predict-no)
  1722. I see 1 and I'm going to do: predict-no
  1723. ENV: Agent did: predict-no for direction U in state State-A
  1724. In State-A moving U
  1725. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1726. predict error 0
  1727. dir: dir isR
  1728. |228: O: O455 (predict-yes)
  1729. I see 1 and I'm going to do: predict-yes
  1730. ENV: Agent did: predict-yes for direction R in state State-A
  1731. In State-A moving R
  1732. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1733. predict error 0
  1734. dir: dir isL
  1735. \-/229: O: O457 (predict-yes)
  1736. I see 1 and I'm going to do: predict-yes
  1737. ENV: Agent did: predict-yes for direction L in state State-B
  1738. In State-B moving L
  1739. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1740. predict error 0
  1741. dir: dir isL
  1742. |\-230: O: O460 (predict-no)
  1743. I see 1 and I'm going to do: predict-no
  1744. ENV: Agent did: predict-no for direction L in state State-A
  1745. In State-A moving L
  1746. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1747. predict error 0
  1748. dir: dir isR
  1749. /|\231: O: O462 (predict-no)
  1750. I see 1 and I'm going to do: predict-no
  1751. ENV: Agent did: predict-no for direction R in state State-A
  1752. In State-A moving R
  1753. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1754. predict error 1
  1755. dir: dir isU
  1756. -232: O: O464 (predict-no)
  1757. I see 0 and I'm going to do: predict-no
  1758. ENV: Agent did: predict-no for direction U in state State-B
  1759. In State-B moving U
  1760. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1761. predict error 0
  1762. dir: dir isL
  1763. /|\233: O: O465 (predict-yes)
  1764. I see 1 and I'm going to do: predict-yes
  1765. ENV: Agent did: predict-yes for direction L in state State-B
  1766. In State-B moving L
  1767. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1768. predict error 0
  1769. dir: dir isU
  1770. -/|234: O: O468 (predict-no)
  1771. I see 1 and I'm going to do: predict-no
  1772. ENV: Agent did: predict-no for direction U in state State-A
  1773. In State-A moving U
  1774. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1775. predict error 0
  1776. dir: dir isL
  1777. \-/235: O: O470 (predict-no)
  1778. I see 1 and I'm going to do: predict-no
  1779. ENV: Agent did: predict-no for direction L in state State-A
  1780. In State-A moving L
  1781. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1782. predict error 0
  1783. dir: dir isU
  1784. |\-236: O: O472 (predict-no)
  1785. I see 1 and I'm going to do: predict-no
  1786. ENV: Agent did: predict-no for direction U in state State-A
  1787. In State-A moving U
  1788. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1789. predict error 0
  1790. dir: dir isR
  1791. /|\237: O: O473 (predict-yes)
  1792. I see 1 and I'm going to do: predict-yes
  1793. ENV: Agent did: predict-yes for direction R in state State-A
  1794. In State-A moving R
  1795. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1796. predict error 0
  1797. dir: dir isL
  1798. -/238: O: O475 (predict-yes)
  1799. I see 1 and I'm going to do: predict-yes
  1800. ENV: Agent did: predict-yes for direction L in state State-B
  1801. In State-B moving L
  1802. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1803. predict error 0
  1804. dir: dir isL
  1805. |\239: O: O478 (predict-no)
  1806. I see 1 and I'm going to do: predict-no
  1807. ENV: Agent did: predict-no for direction L in state State-A
  1808. In State-A moving L
  1809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1810. predict error 0
  1811. dir: dir isR
  1812. -/|240: O: O479 (predict-yes)
  1813. I see 1 and I'm going to do: predict-yes
  1814. ENV: Agent did: predict-yes for direction R in state State-A
  1815. In State-A moving R
  1816. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1817. predict error 0
  1818. dir: dir isU
  1819. \-/|241: O: O482 (predict-no)
  1820. I see 1 and I'm going to do: predict-no
  1821. ENV: Agent did: predict-no for direction U in state State-B
  1822. In State-B moving U
  1823. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1824. predict error 0
  1825. dir: dir isU
  1826. \242: O: O484 (predict-no)
  1827. I see 1 and I'm going to do: predict-no
  1828. ENV: Agent did: predict-no for direction U in state State-B
  1829. In State-B moving U
  1830. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1831. predict error 0
  1832. dir: dir isL
  1833. -/|243: O: O485 (predict-yes)
  1834. I see 1 and I'm going to do: predict-yes
  1835. ENV: Agent did: predict-yes for direction L in state State-B
  1836. In State-B moving L
  1837. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1838. predict error 0
  1839. dir: dir isR
  1840. \-244: O: O487 (predict-yes)
  1841. I see 1 and I'm going to do: predict-yes
  1842. ENV: Agent did: predict-yes for direction R in state State-A
  1843. In State-A moving R
  1844. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1845. predict error 0
  1846. dir: dir isR
  1847. /|\245: O: O490 (predict-no)
  1848. I see 1 and I'm going to do: predict-no
  1849. ENV: Agent did: predict-no for direction R in state State-B
  1850. In State-B moving R
  1851. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1852. predict error 0
  1853. dir: dir isR
  1854. -/|246: O: O492 (predict-no)
  1855. I see 1 and I'm going to do: predict-no
  1856. ENV: Agent did: predict-no for direction R in state State-B
  1857. In State-B moving R
  1858. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1859. predict error 0
  1860. dir: dir isU
  1861. \-247: O: O494 (predict-no)
  1862. I see 1 and I'm going to do: predict-no
  1863. ENV: Agent did: predict-no for direction U in state State-B
  1864. In State-B moving U
  1865. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1866. predict error 0
  1867. dir: dir isL
  1868. /|248: O: O495 (predict-yes)
  1869. I see 1 and I'm going to do: predict-yes
  1870. ENV: Agent did: predict-yes for direction L in state State-B
  1871. In State-B moving L
  1872. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1873. predict error 0
  1874. dir: dir isL
  1875. \-249: O: O498 (predict-no)
  1876. I see 1 and I'm going to do: predict-no
  1877. ENV: Agent did: predict-no for direction L in state State-A
  1878. In State-A moving L
  1879. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1880. predict error 0
  1881. dir: dir isL
  1882. /|\250: O: O500 (predict-no)
  1883. I see 1 and I'm going to do: predict-no
  1884. ENV: Agent did: predict-no for direction L in state State-A
  1885. In State-A moving L
  1886. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1887. predict error 0
  1888. dir: dir isU
  1889. -/|251: O: O502 (predict-no)
  1890. I see 1 and I'm going to do: predict-no
  1891. ENV: Agent did: predict-no for direction U in state State-A
  1892. In State-A moving U
  1893. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1894. predict error 0
  1895. dir: dir isR
  1896. \252: O: O503 (predict-yes)
  1897. I see 1 and I'm going to do: predict-yes
  1898. ENV: Agent did: predict-yes for direction R in state State-A
  1899. In State-A moving R
  1900. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1901. predict error 0
  1902. dir: dir isU
  1903. -/253: O: O506 (predict-no)
  1904. I see 1 and I'm going to do: predict-no
  1905. ENV: Agent did: predict-no for direction U in state State-B
  1906. In State-B moving U
  1907. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1908. predict error 0
  1909. dir: dir isU
  1910. |\254: O: O508 (predict-no)
  1911. I see 1 and I'm going to do: predict-no
  1912. ENV: Agent did: predict-no for direction U in state State-B
  1913. In State-B moving U
  1914. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1915. predict error 0
  1916. dir: dir isU
  1917. -255: O: O510 (predict-no)
  1918. I see 1 and I'm going to do: predict-no
  1919. ENV: Agent did: predict-no for direction U in state State-B
  1920. In State-B moving U
  1921. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1922. predict error 0
  1923. dir: dir isL
  1924. /|\256: O: O511 (predict-yes)
  1925. I see 1 and I'm going to do: predict-yes
  1926. ENV: Agent did: predict-yes for direction L in state State-B
  1927. In State-B moving L
  1928. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1929. predict error 0
  1930. dir: dir isU
  1931. -/|257: O: O514 (predict-no)
  1932. I see 1 and I'm going to do: predict-no
  1933. ENV: Agent did: predict-no for direction U in state State-A
  1934. In State-A moving U
  1935. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1936. predict error 0
  1937. dir: dir isU
  1938. \-258: O: O516 (predict-no)
  1939. I see 1 and I'm going to do: predict-no
  1940. ENV: Agent did: predict-no for direction U in state State-A
  1941. In State-A moving U
  1942. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1943. predict error 0
  1944. dir: dir isR
  1945. /|259: O: O517 (predict-yes)
  1946. I see 1 and I'm going to do: predict-yes
  1947. ENV: Agent did: predict-yes for direction R in state State-A
  1948. In State-A moving R
  1949. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1950. predict error 0
  1951. dir: dir isU
  1952. \-/260: O: O519 (predict-yes)
  1953. I see 1 and I'm going to do: predict-yes
  1954. ENV: Agent did: predict-yes for direction U in state State-B
  1955. In State-B moving U
  1956. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1957. predict error 1
  1958. dir: dir isU
  1959. |\-261: O: O522 (predict-no)
  1960. I see 0 and I'm going to do: predict-no
  1961. ENV: Agent did: predict-no for direction U in state State-B
  1962. In State-B moving U
  1963. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1964. predict error 0
  1965. dir: dir isR
  1966. /262: O: O524 (predict-no)
  1967. I see 1 and I'm going to do: predict-no
  1968. ENV: Agent did: predict-no for direction R in state State-B
  1969. In State-B moving R
  1970. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1971. predict error 0
  1972. dir: dir isR
  1973. |\-263: O: O526 (predict-no)
  1974. I see 1 and I'm going to do: predict-no
  1975. ENV: Agent did: predict-no for direction R in state State-B
  1976. In State-B moving R
  1977. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1978. predict error 0
  1979. dir: dir isR
  1980. /|\264: O: O528 (predict-no)
  1981. I see 1 and I'm going to do: predict-no
  1982. ENV: Agent did: predict-no for direction R in state State-B
  1983. In State-B moving R
  1984. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1985. predict error 0
  1986. dir: dir isL
  1987. -265: O: O529 (predict-yes)
  1988. I see 1 and I'm going to do: predict-yes
  1989. ENV: Agent did: predict-yes for direction L in state State-B
  1990. In State-B moving L
  1991. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1992. predict error 0
  1993. dir: dir isR
  1994. /|\266: O: O531 (predict-yes)
  1995. I see 1 and I'm going to do: predict-yes
  1996. ENV: Agent did: predict-yes for direction R in state State-A
  1997. In State-A moving R
  1998. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1999. predict error 0
  2000. dir: dir isL
  2001. -/|267: O: O534 (predict-no)
  2002. I see 1 and I'm going to do: predict-no
  2003. ENV: Agent did: predict-no for direction L in state State-B
  2004. In State-B moving L
  2005. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2006. predict error 1
  2007. dir: dir isL
  2008. \-/268: O: O536 (predict-no)
  2009. I see 0 and I'm going to do: predict-no
  2010. ENV: Agent did: predict-no for direction L in state State-A
  2011. In State-A moving L
  2012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2013. predict error 0
  2014. dir: dir isR
  2015. |\-269: O: O537 (predict-yes)
  2016. I see 1 and I'm going to do: predict-yes
  2017. ENV: Agent did: predict-yes for direction R in state State-A
  2018. In State-A moving R
  2019. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2020. predict error 0
  2021. dir: dir isU
  2022. /|\270: O: O540 (predict-no)
  2023. I see 1 and I'm going to do: predict-no
  2024. ENV: Agent did: predict-no for direction U in state State-B
  2025. In State-B moving U
  2026. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2027. predict error 0
  2028. dir: dir isU
  2029. -/|271: O: O542 (predict-no)
  2030. I see 1 and I'm going to do: predict-no
  2031. ENV: Agent did: predict-no for direction U in state State-B
  2032. In State-B moving U
  2033. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2034. predict error 0
  2035. dir: dir isR
  2036. \272: O: O543 (predict-yes)
  2037. I see 1 and I'm going to do: predict-yes
  2038. ENV: Agent did: predict-yes for direction R in state State-B
  2039. In State-B moving R
  2040. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2041. predict error 1
  2042. dir: dir isR
  2043. -273: O: O546 (predict-no)
  2044. I see 0 and I'm going to do: predict-no
  2045. ENV: Agent did: predict-no for direction R in state State-B
  2046. In State-B moving R
  2047. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2048. predict error 0
  2049. dir: dir isL
  2050. /|\274: O: O547 (predict-yes)
  2051. I see 1 and I'm going to do: predict-yes
  2052. ENV: Agent did: predict-yes for direction L in state State-B
  2053. In State-B moving L
  2054. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2055. predict error 0
  2056. dir: dir isL
  2057. -/|275: O: O550 (predict-no)
  2058. I see 1 and I'm going to do: predict-no
  2059. ENV: Agent did: predict-no for direction L in state State-A
  2060. In State-A moving L
  2061. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2062. predict error 0
  2063. dir: dir isU
  2064. \-/276: O: O552 (predict-no)
  2065. I see 1 and I'm going to do: predict-no
  2066. ENV: Agent did: predict-no for direction U in state State-A
  2067. In State-A moving U
  2068. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2069. predict error 0
  2070. dir: dir isL
  2071. |\-277: O: O554 (predict-no)
  2072. I see 1 and I'm going to do: predict-no
  2073. ENV: Agent did: predict-no for direction L in state State-A
  2074. In State-A moving L
  2075. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2076. predict error 0
  2077. dir: dir isR
  2078. /|278: O: O555 (predict-yes)
  2079. I see 1 and I'm going to do: predict-yes
  2080. ENV: Agent did: predict-yes for direction R in state State-A
  2081. In State-A moving R
  2082. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2083. predict error 0
  2084. dir: dir isR
  2085. \-/279: O: O558 (predict-no)
  2086. I see 1 and I'm going to do: predict-no
  2087. ENV: Agent did: predict-no for direction R in state State-B
  2088. In State-B moving R
  2089. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2090. predict error 0
  2091. dir: dir isL
  2092. |\280: O: O559 (predict-yes)
  2093. I see 1 and I'm going to do: predict-yes
  2094. ENV: Agent did: predict-yes for direction L in state State-B
  2095. In State-B moving L
  2096. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2097. predict error 0
  2098. dir: dir isR
  2099. -/|281: O: O561 (predict-yes)
  2100. I see 1 and I'm going to do: predict-yes
  2101. ENV: Agent did: predict-yes for direction R in state State-A
  2102. In State-A moving R
  2103. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2104. predict error 0
  2105. dir: dir isL
  2106. \282: O: O564 (predict-no)
  2107. I see 1 and I'm going to do: predict-no
  2108. ENV: Agent did: predict-no for direction L in state State-B
  2109. In State-B moving L
  2110. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2111. predict error 1
  2112. dir: dir isL
  2113. -/|283: O: O565 (predict-yes)
  2114. I see 0 and I'm going to do: predict-yes
  2115. ENV: Agent did: predict-yes for direction L in state State-A
  2116. In State-A moving L
  2117. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2118. predict error 1
  2119. dir: dir isL
  2120. \-/284: O: O568 (predict-no)
  2121. I see 0 and I'm going to do: predict-no
  2122. ENV: Agent did: predict-no for direction L in state State-A
  2123. In State-A moving L
  2124. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2125. predict error 0
  2126. dir: dir isR
  2127. |\-285: O: O569 (predict-yes)
  2128. I see 1 and I'm going to do: predict-yes
  2129. ENV: Agent did: predict-yes for direction R in state State-A
  2130. In State-A moving R
  2131. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2132. predict error 0
  2133. dir: dir isL
  2134. /|\286: O: O571 (predict-yes)
  2135. I see 1 and I'm going to do: predict-yes
  2136. ENV: Agent did: predict-yes for direction L in state State-B
  2137. In State-B moving L
  2138. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2139. predict error 0
  2140. dir: dir isR
  2141. -/287: O: O573 (predict-yes)
  2142. I see 1 and I'm going to do: predict-yes
  2143. ENV: Agent did: predict-yes for direction R in state State-A
  2144. In State-A moving R
  2145. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2146. predict error 0
  2147. dir: dir isL
  2148. |288: O: O575 (predict-yes)
  2149. I see 1 and I'm going to do: predict-yes
  2150. ENV: Agent did: predict-yes for direction L in state State-B
  2151. In State-B moving L
  2152. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2153. predict error 0
  2154. dir: dir isR
  2155. \-/|289: O: O577 (predict-yes)
  2156. I see 1 and I'm going to do: predict-yes
  2157. ENV: Agent did: predict-yes for direction R in state State-A
  2158. In State-A moving R
  2159. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2160. predict error 0
  2161. dir: dir isL
  2162. \-290: O: O579 (predict-yes)
  2163. I see 1 and I'm going to do: predict-yes
  2164. ENV: Agent did: predict-yes for direction L in state State-B
  2165. In State-B moving L
  2166. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2167. predict error 0
  2168. dir: dir isU
  2169. /|291: O: O582 (predict-no)
  2170. I see 1 and I'm going to do: predict-no
  2171. ENV: Agent did: predict-no for direction U in state State-A
  2172. In State-A moving U
  2173. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2174. predict error 0
  2175. dir: dir isR
  2176. \292: O: O583 (predict-yes)
  2177. I see 1 and I'm going to do: predict-yes
  2178. ENV: Agent did: predict-yes for direction R in state State-A
  2179. In State-A moving R
  2180. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2181. predict error 0
  2182. dir: dir isU
  2183. -/|293: O: O585 (predict-yes)
  2184. I see 1 and I'm going to do: predict-yes
  2185. ENV: Agent did: predict-yes for direction U in state State-B
  2186. In State-B moving U
  2187. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2188. predict error 1
  2189. dir: dir isU
  2190. \-294: O: O587 (predict-yes)
  2191. I see 0 and I'm going to do: predict-yes
  2192. ENV: Agent did: predict-yes for direction U in state State-B
  2193. In State-B moving U
  2194. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2195. predict error 1
  2196. dir: dir isR
  2197. /|\295: O: O590 (predict-no)
  2198. I see 0 and I'm going to do: predict-no
  2199. ENV: Agent did: predict-no for direction R in state State-B
  2200. In State-B moving R
  2201. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2202. predict error 0
  2203. dir: dir isR
  2204. -/|296: O: O592 (predict-no)
  2205. I see 1 and I'm going to do: predict-no
  2206. ENV: Agent did: predict-no for direction R in state State-B
  2207. In State-B moving R
  2208. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2209. predict error 0
  2210. dir: dir isU
  2211. \-/297: O: O594 (predict-no)
  2212. I see 1 and I'm going to do: predict-no
  2213. ENV: Agent did: predict-no for direction U in state State-B
  2214. In State-B moving U
  2215. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2216. predict error 0
  2217. dir: dir isR
  2218. |\-298: O: O596 (predict-no)
  2219. I see 1 and I'm going to do: predict-no
  2220. ENV: Agent did: predict-no for direction R in state State-B
  2221. In State-B moving R
  2222. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2223. predict error 0
  2224. dir: dir isL
  2225. /|\299: O: O597 (predict-yes)
  2226. I see 1 and I'm going to do: predict-yes
  2227. ENV: Agent did: predict-yes for direction L in state State-B
  2228. In State-B moving L
  2229. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2230. predict error 0
  2231. dir: dir isU
  2232. -/|300: O: O600 (predict-no)
  2233. I see 1 and I'm going to do: predict-no
  2234. ENV: Agent did: predict-no for direction U in state State-A
  2235. In State-A moving U
  2236. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2237. predict error 0
  2238. dir: dir isU
  2239. \-/|\-301: O: O602 (predict-no)
  2240. I see 1 and I'm going to do: predict-no
  2241. ENV: Agent did: predict-no for direction U in state State-A
  2242. In State-A moving U
  2243. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2244. predict error 0
  2245. dir: dir isU
  2246. /302: O: O604 (predict-no)
  2247. I see 1 and I'm going to do: predict-no
  2248. ENV: Agent did: predict-no for direction U in state State-A
  2249. In State-A moving U
  2250. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2251. predict error 0
  2252. dir: dir isR
  2253. |\303: O: O605 (predict-yes)
  2254. I see 1 and I'm going to do: predict-yes
  2255. ENV: Agent did: predict-yes for direction R in state State-A
  2256. In State-A moving R
  2257. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2258. predict error 0
  2259. dir: dir isR
  2260. -/|304: O: O608 (predict-no)
  2261. I see 1 and I'm going to do: predict-no
  2262. ENV: Agent did: predict-no for direction R in state State-B
  2263. In State-B moving R
  2264. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2265. predict error 0
  2266. dir: dir isU
  2267. \-/305: O: O610 (predict-no)
  2268. I see 1 and I'm going to do: predict-no
  2269. ENV: Agent did: predict-no for direction U in state State-B
  2270. In State-B moving U
  2271. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2272. predict error 0
  2273. dir: dir isR
  2274. |\-306: O: O612 (predict-no)
  2275. I see 1 and I'm going to do: predict-no
  2276. ENV: Agent did: predict-no for direction R in state State-B
  2277. In State-B moving R
  2278. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2279. predict error 0
  2280. dir: dir isL
  2281. /|\307: O: O613 (predict-yes)
  2282. I see 1 and I'm going to do: predict-yes
  2283. ENV: Agent did: predict-yes for direction L in state State-B
  2284. In State-B moving L
  2285. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2286. predict error 0
  2287. dir: dir isL
  2288. -308: O: O616 (predict-no)
  2289. I see 1 and I'm going to do: predict-no
  2290. ENV: Agent did: predict-no for direction L in state State-A
  2291. In State-A moving L
  2292. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2293. predict error 0
  2294. dir: dir isU
  2295. /|\309: O: O618 (predict-no)
  2296. I see 1 and I'm going to do: predict-no
  2297. ENV: Agent did: predict-no for direction U in state State-A
  2298. In State-A moving U
  2299. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2300. predict error 0
  2301. dir: dir isL
  2302. -/|310: O: O620 (predict-no)
  2303. I see 1 and I'm going to do: predict-no
  2304. ENV: Agent did: predict-no for direction L in state State-A
  2305. In State-A moving L
  2306. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2307. predict error 0
  2308. dir: dir isL
  2309. \-311: O: O622 (predict-no)
  2310. I see 1 and I'm going to do: predict-no
  2311. ENV: Agent did: predict-no for direction L in state State-A
  2312. In State-A moving L
  2313. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2314. predict error 0
  2315. dir: dir isR
  2316. /312: O: O623 (predict-yes)
  2317. I see 1 and I'm going to do: predict-yes
  2318. ENV: Agent did: predict-yes for direction R in state State-A
  2319. In State-A moving R
  2320. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2321. predict error 0
  2322. dir: dir isR
  2323. |\-313: O: O626 (predict-no)
  2324. I see 1 and I'm going to do: predict-no
  2325. ENV: Agent did: predict-no for direction R in state State-B
  2326. In State-B moving R
  2327. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2328. predict error 0
  2329. dir: dir isR
  2330. /|\314: O: O628 (predict-no)
  2331. I see 1 and I'm going to do: predict-no
  2332. ENV: Agent did: predict-no for direction R in state State-B
  2333. In State-B moving R
  2334. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2335. predict error 0
  2336. dir: dir isR
  2337. -/|315: O: O630 (predict-no)
  2338. I see 1 and I'm going to do: predict-no
  2339. ENV: Agent did: predict-no for direction R in state State-B
  2340. In State-B moving R
  2341. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2342. predict error 0
  2343. dir: dir isR
  2344. \-/316: O: O632 (predict-no)
  2345. I see 1 and I'm going to do: predict-no
  2346. ENV: Agent did: predict-no for direction R in state State-B
  2347. In State-B moving R
  2348. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2349. predict error 0
  2350. dir: dir isU
  2351. |\-317: O: O634 (predict-no)
  2352. I see 1 and I'm going to do: predict-no
  2353. ENV: Agent did: predict-no for direction U in state State-B
  2354. In State-B moving U
  2355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2356. predict error 0
  2357. dir: dir isR
  2358. /|318: O: O636 (predict-no)
  2359. I see 1 and I'm going to do: predict-no
  2360. ENV: Agent did: predict-no for direction R in state State-B
  2361. In State-B moving R
  2362. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2363. predict error 0
  2364. dir: dir isR
  2365. \319: O: O638 (predict-no)
  2366. I see 1 and I'm going to do: predict-no
  2367. ENV: Agent did: predict-no for direction R in state State-B
  2368. In State-B moving R
  2369. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2370. predict error 0
  2371. dir: dir isU
  2372. -/|320: O: O640 (predict-no)
  2373. I see 1 and I'm going to do: predict-no
  2374. ENV: Agent did: predict-no for direction U in state State-B
  2375. In State-B moving U
  2376. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2377. predict error 0
  2378. dir: dir isL
  2379. \-/321: O: O641 (predict-yes)
  2380. I see 1 and I'm going to do: predict-yes
  2381. ENV: Agent did: predict-yes for direction L in state State-B
  2382. In State-B moving L
  2383. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2384. predict error 0
  2385. dir: dir isU
  2386. |322: O: O644 (predict-no)
  2387. I see 1 and I'm going to do: predict-no
  2388. ENV: Agent did: predict-no for direction U in state State-A
  2389. In State-A moving U
  2390. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2391. predict error 0
  2392. dir: dir isR
  2393. \-323: O: O645 (predict-yes)
  2394. I see 1 and I'm going to do: predict-yes
  2395. ENV: Agent did: predict-yes for direction R in state State-A
  2396. In State-A moving R
  2397. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2398. predict error 0
  2399. dir: dir isR
  2400. /|\324: O: O648 (predict-no)
  2401. I see 1 and I'm going to do: predict-no
  2402. ENV: Agent did: predict-no for direction R in state State-B
  2403. In State-B moving R
  2404. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2405. predict error 0
  2406. dir: dir isL
  2407. -/325: O: O649 (predict-yes)
  2408. I see 1 and I'm going to do: predict-yes
  2409. ENV: Agent did: predict-yes for direction L in state State-B
  2410. In State-B moving L
  2411. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2412. predict error 0
  2413. dir: dir isU
  2414. |\-326: O: O652 (predict-no)
  2415. I see 1 and I'm going to do: predict-no
  2416. ENV: Agent did: predict-no for direction U in state State-A
  2417. In State-A moving U
  2418. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2419. predict error 0
  2420. dir: dir isU
  2421. /|\327: O: O653 (predict-yes)
  2422. I see 1 and I'm going to do: predict-yes
  2423. ENV: Agent did: predict-yes for direction U in state State-A
  2424. In State-A moving U
  2425. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2426. predict error 1
  2427. dir: dir isU
  2428. -/|328: O: O656 (predict-no)
  2429. I see 0 and I'm going to do: predict-no
  2430. ENV: Agent did: predict-no for direction U in state State-A
  2431. In State-A moving U
  2432. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2433. predict error 0
  2434. dir: dir isR
  2435. \-/329: O: O657 (predict-yes)
  2436. I see 1 and I'm going to do: predict-yes
  2437. ENV: Agent did: predict-yes for direction R in state State-A
  2438. In State-A moving R
  2439. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2440. predict error 0
  2441. dir: dir isU
  2442. |\-330: O: O660 (predict-no)
  2443. I see 1 and I'm going to do: predict-no
  2444. ENV: Agent did: predict-no for direction U in state State-B
  2445. In State-B moving U
  2446. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2447. predict error 0
  2448. dir: dir isL
  2449. /|331: O: O661 (predict-yes)
  2450. I see 1 and I'm going to do: predict-yes
  2451. ENV: Agent did: predict-yes for direction L in state State-B
  2452. In State-B moving L
  2453. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2454. predict error 0
  2455. dir: dir isR
  2456. \332: O: O663 (predict-yes)
  2457. I see 1 and I'm going to do: predict-yes
  2458. ENV: Agent did: predict-yes for direction R in state State-A
  2459. In State-A moving R
  2460. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2461. predict error 0
  2462. dir: dir isL
  2463. -/333: O: O665 (predict-yes)
  2464. I see 1 and I'm going to do: predict-yes
  2465. ENV: Agent did: predict-yes for direction L in state State-B
  2466. In State-B moving L
  2467. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2468. predict error 0
  2469. dir: dir isL
  2470. |\334: O: O668 (predict-no)
  2471. I see 1 and I'm going to do: predict-no
  2472. ENV: Agent did: predict-no for direction L in state State-A
  2473. In State-A moving L
  2474. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2475. predict error 0
  2476. dir: dir isU
  2477. -/|335: O: O670 (predict-no)
  2478. I see 1 and I'm going to do: predict-no
  2479. ENV: Agent did: predict-no for direction U in state State-A
  2480. In State-A moving U
  2481. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2482. predict error 0
  2483. dir: dir isL
  2484. \-336: O: O672 (predict-no)
  2485. I see 1 and I'm going to do: predict-no
  2486. ENV: Agent did: predict-no for direction L in state State-A
  2487. In State-A moving L
  2488. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2489. predict error 0
  2490. dir: dir isL
  2491. /|\337: O: O674 (predict-no)
  2492. I see 1 and I'm going to do: predict-no
  2493. ENV: Agent did: predict-no for direction L in state State-A
  2494. In State-A moving L
  2495. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2496. predict error 0
  2497. dir: dir isL
  2498. -/|338: O: O676 (predict-no)
  2499. I see 1 and I'm going to do: predict-no
  2500. ENV: Agent did: predict-no for direction L in state State-A
  2501. In State-A moving L
  2502. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2503. predict error 0
  2504. dir: dir isR
  2505. \-/339: O: O677 (predict-yes)
  2506. I see 1 and I'm going to do: predict-yes
  2507. ENV: Agent did: predict-yes for direction R in state State-A
  2508. In State-A moving R
  2509. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2510. predict error 0
  2511. dir: dir isR
  2512. |\340: O: O680 (predict-no)
  2513. I see 1 and I'm going to do: predict-no
  2514. ENV: Agent did: predict-no for direction R in state State-B
  2515. In State-B moving R
  2516. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2517. predict error 0
  2518. dir: dir isL
  2519. -/341: O: O681 (predict-yes)
  2520. I see 1 and I'm going to do: predict-yes
  2521. ENV: Agent did: predict-yes for direction L in state State-B
  2522. In State-B moving L
  2523. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2524. predict error 0
  2525. dir: dir isU
  2526. |342: O: O684 (predict-no)
  2527. I see 1 and I'm going to do: predict-no
  2528. ENV: Agent did: predict-no for direction U in state State-A
  2529. In State-A moving U
  2530. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2531. predict error 0
  2532. dir: dir isU
  2533. \-/343: O: O686 (predict-no)
  2534. I see 1 and I'm going to do: predict-no
  2535. ENV: Agent did: predict-no for direction U in state State-A
  2536. In State-A moving U
  2537. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2538. predict error 0
  2539. dir: dir isL
  2540. |\-344: O: O688 (predict-no)
  2541. I see 1 and I'm going to do: predict-no
  2542. ENV: Agent did: predict-no for direction L in state State-A
  2543. In State-A moving L
  2544. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2545. predict error 0
  2546. dir: dir isR
  2547. /|\345: O: O689 (predict-yes)
  2548. I see 1 and I'm going to do: predict-yes
  2549. ENV: Agent did: predict-yes for direction R in state State-A
  2550. In State-A moving R
  2551. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2552. predict error 0
  2553. dir: dir isU
  2554. -/|346: O: O692 (predict-no)
  2555. I see 1 and I'm going to do: predict-no
  2556. ENV: Agent did: predict-no for direction U in state State-B
  2557. In State-B moving U
  2558. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2559. predict error 0
  2560. dir: dir isU
  2561. \-/347: O: O694 (predict-no)
  2562. I see 1 and I'm going to do: predict-no
  2563. ENV: Agent did: predict-no for direction U in state State-B
  2564. In State-B moving U
  2565. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2566. predict error 0
  2567. dir: dir isR
  2568. |\348: O: O696 (predict-no)
  2569. I see 1 and I'm going to do: predict-no
  2570. ENV: Agent did: predict-no for direction R in state State-B
  2571. In State-B moving R
  2572. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2573. predict error 0
  2574. dir: dir isU
  2575. -/|349: O: O698 (predict-no)
  2576. I see 1 and I'm going to do: predict-no
  2577. ENV: Agent did: predict-no for direction U in state State-B
  2578. In State-B moving U
  2579. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2580. predict error 0
  2581. dir: dir isL
  2582. \-350: O: O699 (predict-yes)
  2583. I see 1 and I'm going to do: predict-yes
  2584. ENV: Agent did: predict-yes for direction L in state State-B
  2585. In State-B moving L
  2586. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2587. predict error 0
  2588. dir: dir isR
  2589. /|\351: O: O701 (predict-yes)
  2590. I see 1 and I'm going to do: predict-yes
  2591. ENV: Agent did: predict-yes for direction R in state State-A
  2592. In State-A moving R
  2593. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2594. predict error 0
  2595. dir: dir isR
  2596. -352: O: O704 (predict-no)
  2597. I see 1 and I'm going to do: predict-no
  2598. ENV: Agent did: predict-no for direction R in state State-B
  2599. In State-B moving R
  2600. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2601. predict error 0
  2602. dir: dir isU
  2603. /|353: O: O706 (predict-no)
  2604. I see 1 and I'm going to do: predict-no
  2605. ENV: Agent did: predict-no for direction U in state State-B
  2606. In State-B moving U
  2607. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2608. predict error 0
  2609. dir: dir isL
  2610. \-/354: O: O707 (predict-yes)
  2611. I see 1 and I'm going to do: predict-yes
  2612. ENV: Agent did: predict-yes for direction L in state State-B
  2613. In State-B moving L
  2614. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2615. predict error 0
  2616. dir: dir isR
  2617. |\-355: O: O710 (predict-no)
  2618. I see 1 and I'm going to do: predict-no
  2619. ENV: Agent did: predict-no for direction R in state State-A
  2620. In State-A moving R
  2621. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2622. predict error 1
  2623. dir: dir isL
  2624. /|\356: O: O711 (predict-yes)
  2625. I see 0 and I'm going to do: predict-yes
  2626. ENV: Agent did: predict-yes for direction L in state State-B
  2627. In State-B moving L
  2628. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2629. predict error 0
  2630. dir: dir isR
  2631. -/|357: O: O713 (predict-yes)
  2632. I see 1 and I'm going to do: predict-yes
  2633. ENV: Agent did: predict-yes for direction R in state State-A
  2634. In State-A moving R
  2635. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2636. predict error 0
  2637. dir: dir isU
  2638. \-/358: O: O716 (predict-no)
  2639. I see 1 and I'm going to do: predict-no
  2640. ENV: Agent did: predict-no for direction U in state State-B
  2641. In State-B moving U
  2642. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2643. predict error 0
  2644. dir: dir isU
  2645. |\359: O: O718 (predict-no)
  2646. I see 1 and I'm going to do: predict-no
  2647. ENV: Agent did: predict-no for direction U in state State-B
  2648. In State-B moving U
  2649. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2650. predict error 0
  2651. dir: dir isU
  2652. -/|360: O: O720 (predict-no)
  2653. I see 1 and I'm going to do: predict-no
  2654. ENV: Agent did: predict-no for direction U in state State-B
  2655. In State-B moving U
  2656. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2657. predict error 0
  2658. dir: dir isL
  2659. \-361: O: O721 (predict-yes)
  2660. I see 1 and I'm going to do: predict-yes
  2661. ENV: Agent did: predict-yes for direction L in state State-B
  2662. In State-B moving L
  2663. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2664. predict error 0
  2665. dir: dir isL
  2666. /362: O: O724 (predict-no)
  2667. I see 1 and I'm going to do: predict-no
  2668. ENV: Agent did: predict-no for direction L in state State-A
  2669. In State-A moving L
  2670. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2671. predict error 0
  2672. dir: dir isL
  2673. |\363: O: O726 (predict-no)
  2674. I see 1 and I'm going to do: predict-no
  2675. ENV: Agent did: predict-no for direction L in state State-A
  2676. In State-A moving L
  2677. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2678. predict error 0
  2679. dir: dir isU
  2680. -/364: O: O728 (predict-no)
  2681. I see 1 and I'm going to do: predict-no
  2682. ENV: Agent did: predict-no for direction U in state State-A
  2683. In State-A moving U
  2684. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2685. predict error 0
  2686. dir: dir isU
  2687. |\-365: O: O730 (predict-no)
  2688. I see 1 and I'm going to do: predict-no
  2689. ENV: Agent did: predict-no for direction U in state State-A
  2690. In State-A moving U
  2691. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2692. predict error 0
  2693. dir: dir isR
  2694. /|\366: O: O731 (predict-yes)
  2695. I see 1 and I'm going to do: predict-yes
  2696. ENV: Agent did: predict-yes for direction R in state State-A
  2697. In State-A moving R
  2698. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2699. predict error 0
  2700. dir: dir isU
  2701. -/367: O: O734 (predict-no)
  2702. I see 1 and I'm going to do: predict-no
  2703. ENV: Agent did: predict-no for direction U in state State-B
  2704. In State-B moving U
  2705. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2706. predict error 0
  2707. dir: dir isU
  2708. |368: O: O735 (predict-yes)
  2709. I see 1 and I'm going to do: predict-yes
  2710. ENV: Agent did: predict-yes for direction U in state State-B
  2711. In State-B moving U
  2712. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2713. predict error 1
  2714. dir: dir isL
  2715. \369: O: O737 (predict-yes)
  2716. I see 0 and I'm going to do: predict-yes
  2717. ENV: Agent did: predict-yes for direction L in state State-B
  2718. In State-B moving L
  2719. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2720. predict error 0
  2721. dir: dir isL
  2722. -/370: O: O740 (predict-no)
  2723. I see 1 and I'm going to do: predict-no
  2724. ENV: Agent did: predict-no for direction L in state State-A
  2725. In State-A moving L
  2726. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2727. predict error 0
  2728. dir: dir isU
  2729. |\-371: O: O742 (predict-no)
  2730. I see 1 and I'm going to do: predict-no
  2731. ENV: Agent did: predict-no for direction U in state State-A
  2732. In State-A moving U
  2733. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2734. predict error 0
  2735. dir: dir isL
  2736. /372: O: O744 (predict-no)
  2737. I see 1 and I'm going to do: predict-no
  2738. ENV: Agent did: predict-no for direction L in state State-A
  2739. In State-A moving L
  2740. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2741. predict error 0
  2742. dir: dir isL
  2743. |\-373: O: O746 (predict-no)
  2744. I see 1 and I'm going to do: predict-no
  2745. ENV: Agent did: predict-no for direction L in state State-A
  2746. In State-A moving L
  2747. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2748. predict error 0
  2749. dir: dir isL
  2750. /|\374: O: O748 (predict-no)
  2751. I see 1 and I'm going to do: predict-no
  2752. ENV: Agent did: predict-no for direction L in state State-A
  2753. In State-A moving L
  2754. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2755. predict error 0
  2756. dir: dir isL
  2757. -/375: O: O750 (predict-no)
  2758. I see 1 and I'm going to do: predict-no
  2759. ENV: Agent did: predict-no for direction L in state State-A
  2760. In State-A moving L
  2761. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2762. predict error 0
  2763. dir: dir isU
  2764. |\376: O: O752 (predict-no)
  2765. I see 1 and I'm going to do: predict-no
  2766. ENV: Agent did: predict-no for direction U in state State-A
  2767. In State-A moving U
  2768. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2769. predict error 0
  2770. dir: dir isL
  2771. -/|377: O: O754 (predict-no)
  2772. I see 1 and I'm going to do: predict-no
  2773. ENV: Agent did: predict-no for direction L in state State-A
  2774. In State-A moving L
  2775. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2776. predict error 0
  2777. dir: dir isL
  2778. \-/378: O: O756 (predict-no)
  2779. I see 1 and I'm going to do: predict-no
  2780. ENV: Agent did: predict-no for direction L in state State-A
  2781. In State-A moving L
  2782. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2783. predict error 0
  2784. dir: dir isL
  2785. |\-379: O: O758 (predict-no)
  2786. I see 1 and I'm going to do: predict-no
  2787. ENV: Agent did: predict-no for direction L in state State-A
  2788. In State-A moving L
  2789. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2790. predict error 0
  2791. dir: dir isR
  2792. /|\380: O: O759 (predict-yes)
  2793. I see 1 and I'm going to do: predict-yes
  2794. ENV: Agent did: predict-yes for direction R in state State-A
  2795. In State-A moving R
  2796. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2797. predict error 0
  2798. dir: dir isU
  2799. -/|381: O: O762 (predict-no)
  2800. I see 1 and I'm going to do: predict-no
  2801. ENV: Agent did: predict-no for direction U in state State-B
  2802. In State-B moving U
  2803. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2804. predict error 0
  2805. dir: dir isR
  2806. \382: O: O764 (predict-no)
  2807. I see 1 and I'm going to do: predict-no
  2808. ENV: Agent did: predict-no for direction R in state State-B
  2809. In State-B moving R
  2810. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2811. predict error 0
  2812. dir: dir isU
  2813. -/|383: O: O766 (predict-no)
  2814. I see 1 and I'm going to do: predict-no
  2815. ENV: Agent did: predict-no for direction U in state State-B
  2816. In State-B moving U
  2817. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2818. predict error 0
  2819. dir: dir isR
  2820. \-/384: O: O768 (predict-no)
  2821. I see 1 and I'm going to do: predict-no
  2822. ENV: Agent did: predict-no for direction R in state State-B
  2823. In State-B moving R
  2824. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2825. predict error 0
  2826. dir: dir isR
  2827. |385: O: O770 (predict-no)
  2828. I see 1 and I'm going to do: predict-no
  2829. ENV: Agent did: predict-no for direction R in state State-B
  2830. In State-B moving R
  2831. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2832. predict error 0
  2833. dir: dir isU
  2834. \-/386: O: O772 (predict-no)
  2835. I see 1 and I'm going to do: predict-no
  2836. ENV: Agent did: predict-no for direction U in state State-B
  2837. In State-B moving U
  2838. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2839. predict error 0
  2840. dir: dir isU
  2841. |\387: O: O774 (predict-no)
  2842. I see 1 and I'm going to do: predict-no
  2843. ENV: Agent did: predict-no for direction U in state State-B
  2844. In State-B moving U
  2845. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2846. predict error 0
  2847. dir: dir isU
  2848. -/|388: O: O776 (predict-no)
  2849. I see 1 and I'm going to do: predict-no
  2850. ENV: Agent did: predict-no for direction U in state State-B
  2851. In State-B moving U
  2852. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2853. predict error 0
  2854. dir: dir isU
  2855. \-389: O: O778 (predict-no)
  2856. I see 1 and I'm going to do: predict-no
  2857. ENV: Agent did: predict-no for direction U in state State-B
  2858. In State-B moving U
  2859. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2860. predict error 0
  2861. dir: dir isU
  2862. /|\390: O: O780 (predict-no)
  2863. I see 1 and I'm going to do: predict-no
  2864. ENV: Agent did: predict-no for direction U in state State-B
  2865. In State-B moving U
  2866. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2867. predict error 0
  2868. dir: dir isU
  2869. -/391: O: O782 (predict-no)
  2870. I see 1 and I'm going to do: predict-no
  2871. ENV: Agent did: predict-no for direction U in state State-B
  2872. In State-B moving U
  2873. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2874. predict error 0
  2875. dir: dir isL
  2876. |392: O: O784 (predict-no)
  2877. I see 1 and I'm going to do: predict-no
  2878. ENV: Agent did: predict-no for direction L in state State-B
  2879. In State-B moving L
  2880. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2881. predict error 1
  2882. dir: dir isR
  2883. \-393: O: O785 (predict-yes)
  2884. I see 0 and I'm going to do: predict-yes
  2885. ENV: Agent did: predict-yes for direction R in state State-A
  2886. In State-A moving R
  2887. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2888. predict error 0
  2889. dir: dir isR
  2890. /394: O: O788 (predict-no)
  2891. I see 1 and I'm going to do: predict-no
  2892. ENV: Agent did: predict-no for direction R in state State-B
  2893. In State-B moving R
  2894. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2895. predict error 0
  2896. dir: dir isU
  2897. |\-395: O: O790 (predict-no)
  2898. I see 1 and I'm going to do: predict-no
  2899. ENV: Agent did: predict-no for direction U in state State-B
  2900. In State-B moving U
  2901. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2902. predict error 0
  2903. dir: dir isR
  2904. /|\396: O: O792 (predict-no)
  2905. I see 1 and I'm going to do: predict-no
  2906. ENV: Agent did: predict-no for direction R in state State-B
  2907. In State-B moving R
  2908. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2909. predict error 0
  2910. dir: dir isU
  2911. -/397: O: O794 (predict-no)
  2912. I see 1 and I'm going to do: predict-no
  2913. ENV: Agent did: predict-no for direction U in state State-B
  2914. In State-B moving U
  2915. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2916. predict error 0
  2917. dir: dir isR
  2918. |\398: O: O796 (predict-no)
  2919. I see 1 and I'm going to do: predict-no
  2920. ENV: Agent did: predict-no for direction R in state State-B
  2921. In State-B moving R
  2922. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2923. predict error 0
  2924. dir: dir isR
  2925. -/|399: O: O798 (predict-no)
  2926. I see 1 and I'm going to do: predict-no
  2927. ENV: Agent did: predict-no for direction R in state State-B
  2928. In State-B moving R
  2929. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2930. predict error 0
  2931. dir: dir isU
  2932. \-/400: O: O800 (predict-no)
  2933. I see 1 and I'm going to do: predict-no
  2934. ENV: Agent did: predict-no for direction U in state State-B
  2935. In State-B moving U
  2936. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2937. predict error 0
  2938. dir: dir isU
  2939. |\-401: O: O802 (predict-no)
  2940. I see 1 and I'm going to do: predict-no
  2941. ENV: Agent did: predict-no for direction U in state State-B
  2942. In State-B moving U
  2943. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2944. predict error 0
  2945. dir: dir isR
  2946. /402: O: O804 (predict-no)
  2947. I see 1 and I'm going to do: predict-no
  2948. ENV: Agent did: predict-no for direction R in state State-B
  2949. In State-B moving R
  2950. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2951. predict error 0
  2952. dir: dir isL
  2953. |\-403: O: O805 (predict-yes)
  2954. I see 1 and I'm going to do: predict-yes
  2955. ENV: Agent did: predict-yes for direction L in state State-B
  2956. In State-B moving L
  2957. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2958. predict error 0
  2959. dir: dir isL
  2960. /|\404: O: O808 (predict-no)
  2961. I see 1 and I'm going to do: predict-no
  2962. ENV: Agent did: predict-no for direction L in state State-A
  2963. In State-A moving L
  2964. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2965. predict error 0
  2966. dir: dir isR
  2967. -/405: O: O809 (predict-yes)
  2968. I see 1 and I'm going to do: predict-yes
  2969. ENV: Agent did: predict-yes for direction R in state State-A
  2970. In State-A moving R
  2971. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2972. predict error 0
  2973. dir: dir isL
  2974. |\406: O: O811 (predict-yes)
  2975. I see 1 and I'm going to do: predict-yes
  2976. ENV: Agent did: predict-yes for direction L in state State-B
  2977. In State-B moving L
  2978. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2979. predict error 0
  2980. dir: dir isL
  2981. -/|407: O: O813 (predict-yes)
  2982. I see 1 and I'm going to do: predict-yes
  2983. ENV: Agent did: predict-yes for direction L in state State-A
  2984. In State-A moving L
  2985. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2986. predict error 1
  2987. dir: dir isU
  2988. \-/408: O: O816 (predict-no)
  2989. I see 0 and I'm going to do: predict-no
  2990. ENV: Agent did: predict-no for direction U in state State-A
  2991. In State-A moving U
  2992. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2993. predict error 0
  2994. dir: dir isU
  2995. |\-409: O: O818 (predict-no)
  2996. I see 1 and I'm going to do: predict-no
  2997. ENV: Agent did: predict-no for direction U in state State-A
  2998. In State-A moving U
  2999. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3000. predict error 0
  3001. dir: dir isL
  3002. /|\410: O: O820 (predict-no)
  3003. I see 1 and I'm going to do: predict-no
  3004. ENV: Agent did: predict-no for direction L in state State-A
  3005. In State-A moving L
  3006. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3007. predict error 0
  3008. dir: dir isR
  3009. -/|411: O: O821 (predict-yes)
  3010. I see 1 and I'm going to do: predict-yes
  3011. ENV: Agent did: predict-yes for direction R in state State-A
  3012. In State-A moving R
  3013. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3014. predict error 0
  3015. dir: dir isU
  3016. \412: O: O824 (predict-no)
  3017. I see 1 and I'm going to do: predict-no
  3018. ENV: Agent did: predict-no for direction U in state State-B
  3019. In State-B moving U
  3020. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3021. predict error 0
  3022. dir: dir isL
  3023. -/|413: O: O825 (predict-yes)
  3024. I see 1 and I'm going to do: predict-yes
  3025. ENV: Agent did: predict-yes for direction L in state State-B
  3026. In State-B moving L
  3027. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3028. predict error 0
  3029. dir: dir isR
  3030. \-/414: O: O827 (predict-yes)
  3031. I see 1 and I'm going to do: predict-yes
  3032. ENV: Agent did: predict-yes for direction R in state State-A
  3033. In State-A moving R
  3034. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3035. predict error 0
  3036. dir: dir isL
  3037. |\-415: O: O829 (predict-yes)
  3038. I see 1 and I'm going to do: predict-yes
  3039. ENV: Agent did: predict-yes for direction L in state State-B
  3040. In State-B moving L
  3041. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3042. predict error 0
  3043. dir: dir isL
  3044. /|\416: O: O832 (predict-no)
  3045. I see 1 and I'm going to do: predict-no
  3046. ENV: Agent did: predict-no for direction L in state State-A
  3047. In State-A moving L
  3048. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3049. predict error 0
  3050. dir: dir isU
  3051. -/417: O: O834 (predict-no)
  3052. I see 1 and I'm going to do: predict-no
  3053. ENV: Agent did: predict-no for direction U in state State-A
  3054. In State-A moving U
  3055. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3056. predict error 0
  3057. dir: dir isL
  3058. |\418: O: O836 (predict-no)
  3059. I see 1 and I'm going to do: predict-no
  3060. ENV: Agent did: predict-no for direction L in state State-A
  3061. In State-A moving L
  3062. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3063. predict error 0
  3064. dir: dir isL
  3065. -/|419: O: O838 (predict-no)
  3066. I see 1 and I'm going to do: predict-no
  3067. ENV: Agent did: predict-no for direction L in state State-A
  3068. In State-A moving L
  3069. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3070. predict error 0
  3071. dir: dir isR
  3072. \-420: O: O839 (predict-yes)
  3073. I see 1 and I'm going to do: predict-yes
  3074. ENV: Agent did: predict-yes for direction R in state State-A
  3075. In State-A moving R
  3076. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3077. predict error 0
  3078. dir: dir isR
  3079. /|\421: O: O842 (predict-no)
  3080. I see 1 and I'm going to do: predict-no
  3081. ENV: Agent did: predict-no for direction R in state State-B
  3082. In State-B moving R
  3083. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3084. predict error 0
  3085. dir: dir isU
  3086. -422: O: O843 (predict-yes)
  3087. I see 1 and I'm going to do: predict-yes
  3088. ENV: Agent did: predict-yes for direction U in state State-B
  3089. In State-B moving U
  3090. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3091. predict error 1
  3092. dir: dir isU
  3093. /|423: O: O846 (predict-no)
  3094. I see 0 and I'm going to do: predict-no
  3095. ENV: Agent did: predict-no for direction U in state State-B
  3096. In State-B moving U
  3097. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3098. predict error 0
  3099. dir: dir isU
  3100. \-/424: O: O848 (predict-no)
  3101. I see 1 and I'm going to do: predict-no
  3102. ENV: Agent did: predict-no for direction U in state State-B
  3103. In State-B moving U
  3104. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3105. predict error 0
  3106. dir: dir isL
  3107. |\425: O: O850 (predict-no)
  3108. I see 1 and I'm going to do: predict-no
  3109. ENV: Agent did: predict-no for direction L in state State-B
  3110. In State-B moving L
  3111. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3112. predict error 1
  3113. dir: dir isU
  3114. -/|426: O: O852 (predict-no)
  3115. I see 0 and I'm going to do: predict-no
  3116. ENV: Agent did: predict-no for direction U in state State-A
  3117. In State-A moving U
  3118. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3119. predict error 0
  3120. dir: dir isR
  3121. \-/427: O: O853 (predict-yes)
  3122. I see 1 and I'm going to do: predict-yes
  3123. ENV: Agent did: predict-yes for direction R in state State-A
  3124. In State-A moving R
  3125. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3126. predict error 0
  3127. dir: dir isR
  3128. |\-428: O: O856 (predict-no)
  3129. I see 1 and I'm going to do: predict-no
  3130. ENV: Agent did: predict-no for direction R in state State-B
  3131. In State-B moving R
  3132. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3133. predict error 0
  3134. dir: dir isR
  3135. /|\429: O: O858 (predict-no)
  3136. I see 1 and I'm going to do: predict-no
  3137. ENV: Agent did: predict-no for direction R in state State-B
  3138. In State-B moving R
  3139. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3140. predict error 0
  3141. dir: dir isL
  3142. -/|430: O: O860 (predict-no)
  3143. I see 1 and I'm going to do: predict-no
  3144. ENV: Agent did: predict-no for direction L in state State-B
  3145. In State-B moving L
  3146. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3147. predict error 1
  3148. dir: dir isR
  3149. \-/|431: O: O861 (predict-yes)
  3150. I see 0 and I'm going to do: predict-yes
  3151. ENV: Agent did: predict-yes for direction R in state State-A
  3152. In State-A moving R
  3153. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3154. predict error 0
  3155. dir: dir isL
  3156. \432: O: O863 (predict-yes)
  3157. I see 1 and I'm going to do: predict-yes
  3158. ENV: Agent did: predict-yes for direction L in state State-B
  3159. In State-B moving L
  3160. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3161. predict error 0
  3162. dir: dir isL
  3163. -/|433: O: O866 (predict-no)
  3164. I see 1 and I'm going to do: predict-no
  3165. ENV: Agent did: predict-no for direction L in state State-A
  3166. In State-A moving L
  3167. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3168. predict error 0
  3169. dir: dir isR
  3170. \434: O: O868 (predict-no)
  3171. I see 1 and I'm going to do: predict-no
  3172. ENV: Agent did: predict-no for direction R in state State-A
  3173. In State-A moving R
  3174. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3175. predict error 1
  3176. dir: dir isR
  3177. -/|435: O: O870 (predict-no)
  3178. I see 0 and I'm going to do: predict-no
  3179. ENV: Agent did: predict-no for direction R in state State-B
  3180. In State-B moving R
  3181. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3182. predict error 0
  3183. dir: dir isL
  3184. \-/436: O: O871 (predict-yes)
  3185. I see 1 and I'm going to do: predict-yes
  3186. ENV: Agent did: predict-yes for direction L in state State-B
  3187. In State-B moving L
  3188. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3189. predict error 0
  3190. dir: dir isR
  3191. |\-437: O: O873 (predict-yes)
  3192. I see 1 and I'm going to do: predict-yes
  3193. ENV: Agent did: predict-yes for direction R in state State-A
  3194. In State-A moving R
  3195. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3196. predict error 0
  3197. dir: dir isR
  3198. /|438: O: O876 (predict-no)
  3199. I see 1 and I'm going to do: predict-no
  3200. ENV: Agent did: predict-no for direction R in state State-B
  3201. In State-B moving R
  3202. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3203. predict error 0
  3204. dir: dir isR
  3205. \-/439: O: O878 (predict-no)
  3206. I see 1 and I'm going to do: predict-no
  3207. ENV: Agent did: predict-no for direction R in state State-B
  3208. In State-B moving R
  3209. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3210. predict error 0
  3211. dir: dir isU
  3212. |\-440: O: O879 (predict-yes)
  3213. I see 1 and I'm going to do: predict-yes
  3214. ENV: Agent did: predict-yes for direction U in state State-B
  3215. In State-B moving U
  3216. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3217. predict error 1
  3218. dir: dir isR
  3219. /|\441: O: O882 (predict-no)
  3220. I see 0 and I'm going to do: predict-no
  3221. ENV: Agent did: predict-no for direction R in state State-B
  3222. In State-B moving R
  3223. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3224. predict error 0
  3225. dir: dir isU
  3226. -442: O: O884 (predict-no)
  3227. I see 1 and I'm going to do: predict-no
  3228. ENV: Agent did: predict-no for direction U in state State-B
  3229. In State-B moving U
  3230. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3231. predict error 0
  3232. dir: dir isR
  3233. /|\443: O: O886 (predict-no)
  3234. I see 1 and I'm going to do: predict-no
  3235. ENV: Agent did: predict-no for direction R in state State-B
  3236. In State-B moving R
  3237. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3238. predict error 0
  3239. dir: dir isR
  3240. -/|444: O: O888 (predict-no)
  3241. I see 1 and I'm going to do: predict-no
  3242. ENV: Agent did: predict-no for direction R in state State-B
  3243. In State-B moving R
  3244. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3245. predict error 0
  3246. dir: dir isR
  3247. \-445: O: O890 (predict-no)
  3248. I see 1 and I'm going to do: predict-no
  3249. ENV: Agent did: predict-no for direction R in state State-B
  3250. In State-B moving R
  3251. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3252. predict error 0
  3253. dir: dir isR
  3254. /|446: O: O892 (predict-no)
  3255. I see 1 and I'm going to do: predict-no
  3256. ENV: Agent did: predict-no for direction R in state State-B
  3257. In State-B moving R
  3258. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3259. predict error 0
  3260. dir: dir isL
  3261. \447: O: O893 (predict-yes)
  3262. I see 1 and I'm going to do: predict-yes
  3263. ENV: Agent did: predict-yes for direction L in state State-B
  3264. In State-B moving L
  3265. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3266. predict error 0
  3267. dir: dir isU
  3268. -/|448: O: O896 (predict-no)
  3269. I see 1 and I'm going to do: predict-no
  3270. ENV: Agent did: predict-no for direction U in state State-A
  3271. In State-A moving U
  3272. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3273. predict error 0
  3274. dir: dir isR
  3275. \-/449: O: O897 (predict-yes)
  3276. I see 1 and I'm going to do: predict-yes
  3277. ENV: Agent did: predict-yes for direction R in state State-A
  3278. In State-A moving R
  3279. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3280. predict error 0
  3281. dir: dir isU
  3282. |\-450: O: O900 (predict-no)
  3283. I see 1 and I'm going to do: predict-no
  3284. ENV: Agent did: predict-no for direction U in state State-B
  3285. In State-B moving U
  3286. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3287. predict error 0
  3288. dir: dir isL
  3289. /451: O: O901 (predict-yes)
  3290. I see 1 and I'm going to do: predict-yes
  3291. ENV: Agent did: predict-yes for direction L in state State-B
  3292. In State-B moving L
  3293. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3294. predict error 0
  3295. dir: dir isU
  3296. |452: O: O904 (predict-no)
  3297. I see 1 and I'm going to do: predict-no
  3298. ENV: Agent did: predict-no for direction U in state State-A
  3299. In State-A moving U
  3300. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3301. predict error 0
  3302. dir: dir isU
  3303. \-/453: O: O906 (predict-no)
  3304. I see 1 and I'm going to do: predict-no
  3305. ENV: Agent did: predict-no for direction U in state State-A
  3306. In State-A moving U
  3307. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3308. predict error 0
  3309. dir: dir isU
  3310. |\-454: O: O908 (predict-no)
  3311. I see 1 and I'm going to do: predict-no
  3312. ENV: Agent did: predict-no for direction U in state State-A
  3313. In State-A moving U
  3314. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3315. predict error 0
  3316. dir: dir isU
  3317. /|\-455: O: O910 (predict-no)
  3318. I see 1 and I'm going to do: predict-no
  3319. ENV: Agent did: predict-no for direction U in state State-A
  3320. In State-A moving U
  3321. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3322. predict error 0
  3323. dir: dir isU
  3324. /|\456: O: O912 (predict-no)
  3325. I see 1 and I'm going to do: predict-no
  3326. ENV: Agent did: predict-no for direction U in state State-A
  3327. In State-A moving U
  3328. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3329. predict error 0
  3330. dir: dir isU
  3331. -/|457: O: O914 (predict-no)
  3332. I see 1 and I'm going to do: predict-no
  3333. ENV: Agent did: predict-no for direction U in state State-A
  3334. In State-A moving U
  3335. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3336. predict error 0
  3337. dir: dir isR
  3338. \-/458: O: O915 (predict-yes)
  3339. I see 1 and I'm going to do: predict-yes
  3340. ENV: Agent did: predict-yes for direction R in state State-A
  3341. In State-A moving R
  3342. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3343. predict error 0
  3344. dir: dir isU
  3345. |\459: O: O918 (predict-no)
  3346. I see 1 and I'm going to do: predict-no
  3347. ENV: Agent did: predict-no for direction U in state State-B
  3348. In State-B moving U
  3349. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3350. predict error 0
  3351. dir: dir isL
  3352. -/460: O: O919 (predict-yes)
  3353. I see 1 and I'm going to do: predict-yes
  3354. ENV: Agent did: predict-yes for direction L in state State-B
  3355. In State-B moving L
  3356. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3357. predict error 0
  3358. dir: dir isU
  3359. |\-461: O: O922 (predict-no)
  3360. I see 1 and I'm going to do: predict-no
  3361. ENV: Agent did: predict-no for direction U in state State-A
  3362. In State-A moving U
  3363. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3364. predict error 0
  3365. dir: dir isR
  3366. /462: O: O923 (predict-yes)
  3367. I see 1 and I'm going to do: predict-yes
  3368. ENV: Agent did: predict-yes for direction R in state State-A
  3369. In State-A moving R
  3370. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3371. predict error 0
  3372. dir: dir isU
  3373. |\-463: O: O926 (predict-no)
  3374. I see 1 and I'm going to do: predict-no
  3375. ENV: Agent did: predict-no for direction U in state State-B
  3376. In State-B moving U
  3377. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3378. predict error 0
  3379. dir: dir isR
  3380. /|\464: O: O928 (predict-no)
  3381. I see 1 and I'm going to do: predict-no
  3382. ENV: Agent did: predict-no for direction R in state State-B
  3383. In State-B moving R
  3384. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3385. predict error 0
  3386. dir: dir isU
  3387. -/465: O: O930 (predict-no)
  3388. I see 1 and I'm going to do: predict-no
  3389. ENV: Agent did: predict-no for direction U in state State-B
  3390. In State-B moving U
  3391. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3392. predict error 0
  3393. dir: dir isL
  3394. |\466: O: O931 (predict-yes)
  3395. I see 1 and I'm going to do: predict-yes
  3396. ENV: Agent did: predict-yes for direction L in state State-B
  3397. In State-B moving L
  3398. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3399. predict error 0
  3400. dir: dir isL
  3401. -467: O: O934 (predict-no)
  3402. I see 1 and I'm going to do: predict-no
  3403. ENV: Agent did: predict-no for direction L in state State-A
  3404. In State-A moving L
  3405. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3406. predict error 0
  3407. dir: dir isU
  3408. /|\468: O: O936 (predict-no)
  3409. I see 1 and I'm going to do: predict-no
  3410. ENV: Agent did: predict-no for direction U in state State-A
  3411. In State-A moving U
  3412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3413. predict error 0
  3414. dir: dir isR
  3415. -/469: O: O937 (predict-yes)
  3416. I see 1 and I'm going to do: predict-yes
  3417. ENV: Agent did: predict-yes for direction R in state State-A
  3418. In State-A moving R
  3419. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3420. predict error 0
  3421. dir: dir isU
  3422. |\-470: O: O940 (predict-no)
  3423. I see 1 and I'm going to do: predict-no
  3424. ENV: Agent did: predict-no for direction U in state State-B
  3425. In State-B moving U
  3426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3427. predict error 0
  3428. dir: dir isU
  3429. /|\471: O: O942 (predict-no)
  3430. I see 1 and I'm going to do: predict-no
  3431. ENV: Agent did: predict-no for direction U in state State-B
  3432. In State-B moving U
  3433. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3434. predict error 0
  3435. dir: dir isR
  3436. -472: O: O944 (predict-no)
  3437. I see 1 and I'm going to do: predict-no
  3438. ENV: Agent did: predict-no for direction R in state State-B
  3439. In State-B moving R
  3440. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3441. predict error 0
  3442. dir: dir isR
  3443. /|\473: O: O946 (predict-no)
  3444. I see 1 and I'm going to do: predict-no
  3445. ENV: Agent did: predict-no for direction R in state State-B
  3446. In State-B moving R
  3447. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3448. predict error 0
  3449. dir: dir isL
  3450. -/474: O: O947 (predict-yes)
  3451. I see 1 and I'm going to do: predict-yes
  3452. ENV: Agent did: predict-yes for direction L in state State-B
  3453. In State-B moving L
  3454. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3455. predict error 0
  3456. dir: dir isL
  3457. |\-475: O: O950 (predict-no)
  3458. I see 1 and I'm going to do: predict-no
  3459. ENV: Agent did: predict-no for direction L in state State-A
  3460. In State-A moving L
  3461. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3462. predict error 0
  3463. dir: dir isU
  3464. /|\476: O: O952 (predict-no)
  3465. I see 1 and I'm going to do: predict-no
  3466. ENV: Agent did: predict-no for direction U in state State-A
  3467. In State-A moving U
  3468. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3469. predict error 0
  3470. dir: dir isU
  3471. -/477: O: O954 (predict-no)
  3472. I see 1 and I'm going to do: predict-no
  3473. ENV: Agent did: predict-no for direction U in state State-A
  3474. In State-A moving U
  3475. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3476. predict error 0
  3477. dir: dir isU
  3478. |\-478: O: O956 (predict-no)
  3479. I see 1 and I'm going to do: predict-no
  3480. ENV: Agent did: predict-no for direction U in state State-A
  3481. In State-A moving U
  3482. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3483. predict error 0
  3484. dir: dir isU
  3485. /479: O: O958 (predict-no)
  3486. I see 1 and I'm going to do: predict-no
  3487. ENV: Agent did: predict-no for direction U in state State-A
  3488. In State-A moving U
  3489. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3490. predict error 0
  3491. dir: dir isR
  3492. |\-480: O: O959 (predict-yes)
  3493. I see 1 and I'm going to do: predict-yes
  3494. ENV: Agent did: predict-yes for direction R in state State-A
  3495. In State-A moving R
  3496. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3497. predict error 0
  3498. dir: dir isL
  3499. /|481: O: O961 (predict-yes)
  3500. I see 1 and I'm going to do: predict-yes
  3501. ENV: Agent did: predict-yes for direction L in state State-B
  3502. In State-B moving L
  3503. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3504. predict error 0
  3505. dir: dir isL
  3506. \482: O: O964 (predict-no)
  3507. I see 1 and I'm going to do: predict-no
  3508. ENV: Agent did: predict-no for direction L in state State-A
  3509. In State-A moving L
  3510. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3511. predict error 0
  3512. dir: dir isR
  3513. -/483: O: O965 (predict-yes)
  3514. I see 1 and I'm going to do: predict-yes
  3515. ENV: Agent did: predict-yes for direction R in state State-A
  3516. In State-A moving R
  3517. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3518. predict error 0
  3519. dir: dir isR
  3520. |\-484: O: O968 (predict-no)
  3521. I see 1 and I'm going to do: predict-no
  3522. ENV: Agent did: predict-no for direction R in state State-B
  3523. In State-B moving R
  3524. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3525. predict error 0
  3526. dir: dir isU
  3527. /|\485: O: O970 (predict-no)
  3528. I see 1 and I'm going to do: predict-no
  3529. ENV: Agent did: predict-no for direction U in state State-B
  3530. In State-B moving U
  3531. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3532. predict error 0
  3533. dir: dir isU
  3534. -/|486: O: O972 (predict-no)
  3535. I see 1 and I'm going to do: predict-no
  3536. ENV: Agent did: predict-no for direction U in state State-B
  3537. In State-B moving U
  3538. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3539. predict error 0
  3540. dir: dir isR
  3541. \-487: O: O974 (predict-no)
  3542. I see 1 and I'm going to do: predict-no
  3543. ENV: Agent did: predict-no for direction R in state State-B
  3544. In State-B moving R
  3545. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3546. predict error 0
  3547. dir: dir isL
  3548. /|488: O: O975 (predict-yes)
  3549. I see 1 and I'm going to do: predict-yes
  3550. ENV: Agent did: predict-yes for direction L in state State-B
  3551. In State-B moving L
  3552. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3553. predict error 0
  3554. dir: dir isU
  3555. \-/489: O: O978 (predict-no)
  3556. I see 1 and I'm going to do: predict-no
  3557. ENV: Agent did: predict-no for direction U in state State-A
  3558. In State-A moving U
  3559. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3560. predict error 0
  3561. dir: dir isU
  3562. |\-490: O: O979 (predict-yes)
  3563. I see 1 and I'm going to do: predict-yes
  3564. ENV: Agent did: predict-yes for direction U in state State-A
  3565. In State-A moving U
  3566. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3567. predict error 1
  3568. dir: dir isL
  3569. /|\491: O: O982 (predict-no)
  3570. I see 0 and I'm going to do: predict-no
  3571. ENV: Agent did: predict-no for direction L in state State-A
  3572. In State-A moving L
  3573. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3574. predict error 0
  3575. dir: dir isR
  3576. -492: O: O983 (predict-yes)
  3577. I see 1 and I'm going to do: predict-yes
  3578. ENV: Agent did: predict-yes for direction R in state State-A
  3579. In State-A moving R
  3580. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3581. predict error 0
  3582. dir: dir isU
  3583. /|\493: O: O986 (predict-no)
  3584. I see 1 and I'm going to do: predict-no
  3585. ENV: Agent did: predict-no for direction U in state State-B
  3586. In State-B moving U
  3587. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3588. predict error 0
  3589. dir: dir isL
  3590. -/|494: O: O987 (predict-yes)
  3591. I see 1 and I'm going to do: predict-yes
  3592. ENV: Agent did: predict-yes for direction L in state State-B
  3593. In State-B moving L
  3594. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3595. predict error 0
  3596. dir: dir isU
  3597. \-495: O: O990 (predict-no)
  3598. I see 1 and I'm going to do: predict-no
  3599. ENV: Agent did: predict-no for direction U in state State-A
  3600. In State-A moving U
  3601. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3602. predict error 0
  3603. dir: dir isU
  3604. /|496: O: O992 (predict-no)
  3605. I see 1 and I'm going to do: predict-no
  3606. ENV: Agent did: predict-no for direction U in state State-A
  3607. In State-A moving U
  3608. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3609. predict error 0
  3610. dir: dir isU
  3611. \-/497: O: O994 (predict-no)
  3612. I see 1 and I'm going to do: predict-no
  3613. ENV: Agent did: predict-no for direction U in state State-A
  3614. In State-A moving U
  3615. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3616. predict error 0
  3617. dir: dir isL
  3618. |\-498: O: O996 (predict-no)
  3619. I see 1 and I'm going to do: predict-no
  3620. ENV: Agent did: predict-no for direction L in state State-A
  3621. In State-A moving L
  3622. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3623. predict error 0
  3624. dir: dir isL
  3625. /|\-499: O: O998 (predict-no)
  3626. I see 1 and I'm going to do: predict-no
  3627. ENV: Agent did: predict-no for direction L in state State-A
  3628. In State-A moving L
  3629. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3630. predict error 0
  3631. dir: dir isR
  3632. /|500: O: O999 (predict-yes)
  3633. I see 1 and I'm going to do: predict-yes
  3634. ENV: Agent did: predict-yes for direction R in state State-A
  3635. In State-A moving R
  3636. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3637. predict error 0
  3638. dir: dir isL
  3639. \-/|\-501: O: O1001 (predict-yes)
  3640. I see 1 and I'm going to do: predict-yes
  3641. ENV: Agent did: predict-yes for direction L in state State-B
  3642. In State-B moving L
  3643. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3644. predict error 0
  3645. dir: dir isR
  3646. /502: O: O1003 (predict-yes)
  3647. I see 1 and I'm going to do: predict-yes
  3648. ENV: Agent did: predict-yes for direction R in state State-A
  3649. In State-A moving R
  3650. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3651. predict error 0
  3652. dir: dir isL
  3653. |\-503: O: O1005 (predict-yes)
  3654. I see 1 and I'm going to do: predict-yes
  3655. ENV: Agent did: predict-yes for direction L in state State-B
  3656. In State-B moving L
  3657. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3658. predict error 0
  3659. dir: dir isU
  3660. /|504: O: O1008 (predict-no)
  3661. I see 1 and I'm going to do: predict-no
  3662. ENV: Agent did: predict-no for direction U in state State-A
  3663. In State-A moving U
  3664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3665. predict error 0
  3666. dir: dir isU
  3667. \-505: O: O1010 (predict-no)
  3668. I see 1 and I'm going to do: predict-no
  3669. ENV: Agent did: predict-no for direction U in state State-A
  3670. In State-A moving U
  3671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3672. predict error 0
  3673. dir: dir isL
  3674. /506: O: O1012 (predict-no)
  3675. I see 1 and I'm going to do: predict-no
  3676. ENV: Agent did: predict-no for direction L in state State-A
  3677. In State-A moving L
  3678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3679. predict error 0
  3680. dir: dir isU
  3681. |\-507: O: O1014 (predict-no)
  3682. I see 1 and I'm going to do: predict-no
  3683. ENV: Agent did: predict-no for direction U in state State-A
  3684. In State-A moving U
  3685. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3686. predict error 0
  3687. dir: dir isL
  3688. /|508: O: O1016 (predict-no)
  3689. I see 1 and I'm going to do: predict-no
  3690. ENV: Agent did: predict-no for direction L in state State-A
  3691. In State-A moving L
  3692. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3693. predict error 0
  3694. dir: dir isL
  3695. \-/509: O: O1018 (predict-no)
  3696. I see 1 and I'm going to do: predict-no
  3697. ENV: Agent did: predict-no for direction L in state State-A
  3698. In State-A moving L
  3699. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3700. predict error 0
  3701. dir: dir isU
  3702. |\-510: O: O1020 (predict-no)
  3703. I see 1 and I'm going to do: predict-no
  3704. ENV: Agent did: predict-no for direction U in state State-A
  3705. In State-A moving U
  3706. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3707. predict error 0
  3708. dir: dir isU
  3709. /|\511: O: O1022 (predict-no)
  3710. I see 1 and I'm going to do: predict-no
  3711. ENV: Agent did: predict-no for direction U in state State-A
  3712. In State-A moving U
  3713. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3714. predict error 0
  3715. dir: dir isL
  3716. -512: O: O1024 (predict-no)
  3717. I see 1 and I'm going to do: predict-no
  3718. ENV: Agent did: predict-no for direction L in state State-A
  3719. In State-A moving L
  3720. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3721. predict error 0
  3722. dir: dir isL
  3723. /|\513: O: O1026 (predict-no)
  3724. I see 1 and I'm going to do: predict-no
  3725. ENV: Agent did: predict-no for direction L in state State-A
  3726. In State-A moving L
  3727. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3728. predict error 0
  3729. dir: dir isR
  3730. -/|514: O: O1027 (predict-yes)
  3731. I see 1 and I'm going to do: predict-yes
  3732. ENV: Agent did: predict-yes for direction R in state State-A
  3733. In State-A moving R
  3734. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3735. predict error 0
  3736. dir: dir isL
  3737. \-/515: O: O1029 (predict-yes)
  3738. I see 1 and I'm going to do: predict-yes
  3739. ENV: Agent did: predict-yes for direction L in state State-B
  3740. In State-B moving L
  3741. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3742. predict error 0
  3743. dir: dir isR
  3744. |516: O: O1031 (predict-yes)
  3745. I see 1 and I'm going to do: predict-yes
  3746. ENV: Agent did: predict-yes for direction R in state State-A
  3747. In State-A moving R
  3748. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3749. predict error 0
  3750. dir: dir isU
  3751. \-/517: O: O1034 (predict-no)
  3752. I see 1 and I'm going to do: predict-no
  3753. ENV: Agent did: predict-no for direction U in state State-B
  3754. In State-B moving U
  3755. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3756. predict error 0
  3757. dir: dir isL
  3758. |\-518: O: O1035 (predict-yes)
  3759. I see 1 and I'm going to do: predict-yes
  3760. ENV: Agent did: predict-yes for direction L in state State-B
  3761. In State-B moving L
  3762. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3763. predict error 0
  3764. dir: dir isL
  3765. /|\-519: O: O1038 (predict-no)
  3766. I see 1 and I'm going to do: predict-no
  3767. ENV: Agent did: predict-no for direction L in state State-A
  3768. In State-A moving L
  3769. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3770. predict error 0
  3771. dir: dir isR
  3772. /|\520: O: O1039 (predict-yes)
  3773. I see 1 and I'm going to do: predict-yes
  3774. ENV: Agent did: predict-yes for direction R in state State-A
  3775. In State-A moving R
  3776. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3777. predict error 0
  3778. dir: dir isU
  3779. -/|521: O: O1042 (predict-no)
  3780. I see 1 and I'm going to do: predict-no
  3781. ENV: Agent did: predict-no for direction U in state State-B
  3782. In State-B moving U
  3783. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3784. predict error 0
  3785. dir: dir isL
  3786. \522: O: O1043 (predict-yes)
  3787. I see 1 and I'm going to do: predict-yes
  3788. ENV: Agent did: predict-yes for direction L in state State-B
  3789. In State-B moving L
  3790. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3791. predict error 0
  3792. dir: dir isU
  3793. -/523: O: O1046 (predict-no)
  3794. I see 1 and I'm going to do: predict-no
  3795. ENV: Agent did: predict-no for direction U in state State-A
  3796. In State-A moving U
  3797. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3798. predict error 0
  3799. dir: dir isR
  3800. |\-524: O: O1048 (predict-no)
  3801. I see 1 and I'm going to do: predict-no
  3802. ENV: Agent did: predict-no for direction R in state State-A
  3803. In State-A moving R
  3804. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3805. predict error 1
  3806. dir: dir isR
  3807. /|\525: O: O1050 (predict-no)
  3808. I see 0 and I'm going to do: predict-no
  3809. ENV: Agent did: predict-no for direction R in state State-B
  3810. In State-B moving R
  3811. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3812. predict error 0
  3813. dir: dir isL
  3814. -/|526: O: O1052 (predict-no)
  3815. I see 1 and I'm going to do: predict-no
  3816. ENV: Agent did: predict-no for direction L in state State-B
  3817. In State-B moving L
  3818. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3819. predict error 1
  3820. dir: dir isU
  3821. \-/527: O: O1054 (predict-no)
  3822. I see 0 and I'm going to do: predict-no
  3823. ENV: Agent did: predict-no for direction U in state State-A
  3824. In State-A moving U
  3825. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3826. predict error 0
  3827. dir: dir isU
  3828. |\-528: O: O1056 (predict-no)
  3829. I see 1 and I'm going to do: predict-no
  3830. ENV: Agent did: predict-no for direction U in state State-A
  3831. In State-A moving U
  3832. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3833. predict error 0
  3834. dir: dir isR
  3835. /|\529: O: O1057 (predict-yes)
  3836. I see 1 and I'm going to do: predict-yes
  3837. ENV: Agent did: predict-yes for direction R in state State-A
  3838. In State-A moving R
  3839. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3840. predict error 0
  3841. dir: dir isL
  3842. -/|530: O: O1059 (predict-yes)
  3843. I see 1 and I'm going to do: predict-yes
  3844. ENV: Agent did: predict-yes for direction L in state State-B
  3845. In State-B moving L
  3846. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3847. predict error 0
  3848. dir: dir isU
  3849. \-/531: O: O1062 (predict-no)
  3850. I see 1 and I'm going to do: predict-no
  3851. ENV: Agent did: predict-no for direction U in state State-A
  3852. In State-A moving U
  3853. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3854. predict error 0
  3855. dir: dir isL
  3856. |532: O: O1063 (predict-yes)
  3857. I see 1 and I'm going to do: predict-yes
  3858. ENV: Agent did: predict-yes for direction L in state State-A
  3859. In State-A moving L
  3860. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3861. predict error 1
  3862. dir: dir isR
  3863. \-/533: O: O1065 (predict-yes)
  3864. I see 0 and I'm going to do: predict-yes
  3865. ENV: Agent did: predict-yes for direction R in state State-A
  3866. In State-A moving R
  3867. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3868. predict error 0
  3869. dir: dir isL
  3870. |\-534: O: O1067 (predict-yes)
  3871. I see 1 and I'm going to do: predict-yes
  3872. ENV: Agent did: predict-yes for direction L in state State-B
  3873. In State-B moving L
  3874. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3875. predict error 0
  3876. dir: dir isU
  3877. /535: O: O1070 (predict-no)
  3878. I see 1 and I'm going to do: predict-no
  3879. ENV: Agent did: predict-no for direction U in state State-A
  3880. In State-A moving U
  3881. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3882. predict error 0
  3883. dir: dir isU
  3884. |\-/536: O: O1072 (predict-no)
  3885. I see 1 and I'm going to do: predict-no
  3886. ENV: Agent did: predict-no for direction U in state State-A
  3887. In State-A moving U
  3888. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3889. predict error 0
  3890. dir: dir isU
  3891. |\-537: O: O1074 (predict-no)
  3892. I see 1 and I'm going to do: predict-no
  3893. ENV: Agent did: predict-no for direction U in state State-A
  3894. In State-A moving U
  3895. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3896. predict error 0
  3897. dir: dir isL
  3898. /|\538: O: O1076 (predict-no)
  3899. I see 1 and I'm going to do: predict-no
  3900. ENV: Agent did: predict-no for direction L in state State-A
  3901. In State-A moving L
  3902. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3903. predict error 0
  3904. dir: dir isL
  3905. -/|539: O: O1078 (predict-no)
  3906. I see 1 and I'm going to do: predict-no
  3907. ENV: Agent did: predict-no for direction L in state State-A
  3908. In State-A moving L
  3909. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3910. predict error 0
  3911. dir: dir isU
  3912. \-/540: O: O1080 (predict-no)
  3913. I see 1 and I'm going to do: predict-no
  3914. ENV: Agent did: predict-no for direction U in state State-A
  3915. In State-A moving U
  3916. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3917. predict error 0
  3918. dir: dir isL
  3919. |\-541: O: O1082 (predict-no)
  3920. I see 1 and I'm going to do: predict-no
  3921. ENV: Agent did: predict-no for direction L in state State-A
  3922. In State-A moving L
  3923. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3924. predict error 0
  3925. dir: dir isR
  3926. /542: O: O1083 (predict-yes)
  3927. I see 1 and I'm going to do: predict-yes
  3928. ENV: Agent did: predict-yes for direction R in state State-A
  3929. In State-A moving R
  3930. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3931. predict error 0
  3932. dir: dir isL
  3933. |\-543: O: O1085 (predict-yes)
  3934. I see 1 and I'm going to do: predict-yes
  3935. ENV: Agent did: predict-yes for direction L in state State-B
  3936. In State-B moving L
  3937. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3938. predict error 0
  3939. dir: dir isL
  3940. /|544: O: O1088 (predict-no)
  3941. I see 1 and I'm going to do: predict-no
  3942. ENV: Agent did: predict-no for direction L in state State-A
  3943. In State-A moving L
  3944. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3945. predict error 0
  3946. dir: dir isL
  3947. \-/545: O: O1090 (predict-no)
  3948. I see 1 and I'm going to do: predict-no
  3949. ENV: Agent did: predict-no for direction L in state State-A
  3950. In State-A moving L
  3951. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3952. predict error 0
  3953. dir: dir isL
  3954. |\-546: O: O1092 (predict-no)
  3955. I see 1 and I'm going to do: predict-no
  3956. ENV: Agent did: predict-no for direction L in state State-A
  3957. In State-A moving L
  3958. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3959. predict error 0
  3960. dir: dir isL
  3961. /|\547: O: O1094 (predict-no)
  3962. I see 1 and I'm going to do: predict-no
  3963. ENV: Agent did: predict-no for direction L in state State-A
  3964. In State-A moving L
  3965. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3966. predict error 0
  3967. dir: dir isR
  3968. -/|548: O: O1095 (predict-yes)
  3969. I see 1 and I'm going to do: predict-yes
  3970. ENV: Agent did: predict-yes for direction R in state State-A
  3971. In State-A moving R
  3972. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3973. predict error 0
  3974. dir: dir isR
  3975. \-/549: O: O1098 (predict-no)
  3976. I see 1 and I'm going to do: predict-no
  3977. ENV: Agent did: predict-no for direction R in state State-B
  3978. In State-B moving R
  3979. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3980. predict error 0
  3981. dir: dir isU
  3982. |\550: O: O1100 (predict-no)
  3983. I see 1 and I'm going to do: predict-no
  3984. ENV: Agent did: predict-no for direction U in state State-B
  3985. In State-B moving U
  3986. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3987. predict error 0
  3988. dir: dir isL
  3989. -/551: O: O1102 (predict-no)
  3990. I see 1 and I'm going to do: predict-no
  3991. ENV: Agent did: predict-no for direction L in state State-B
  3992. In State-B moving L
  3993. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3994. predict error 1
  3995. dir: dir isR
  3996. |552: O: O1103 (predict-yes)
  3997. I see 0 and I'm going to do: predict-yes
  3998. ENV: Agent did: predict-yes for direction R in state State-A
  3999. In State-A moving R
  4000. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4001. predict error 0
  4002. dir: dir isR
  4003. \-/553: O: O1106 (predict-no)
  4004. I see 1 and I'm going to do: predict-no
  4005. ENV: Agent did: predict-no for direction R in state State-B
  4006. In State-B moving R
  4007. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4008. predict error 0
  4009. dir: dir isL
  4010. |\554: O: O1107 (predict-yes)
  4011. I see 1 and I'm going to do: predict-yes
  4012. ENV: Agent did: predict-yes for direction L in state State-B
  4013. In State-B moving L
  4014. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4015. predict error 0
  4016. dir: dir isR
  4017. -/555: O: O1109 (predict-yes)
  4018. I see 1 and I'm going to do: predict-yes
  4019. ENV: Agent did: predict-yes for direction R in state State-A
  4020. In State-A moving R
  4021. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4022. predict error 0
  4023. dir: dir isR
  4024. |556: O: O1112 (predict-no)
  4025. I see 1 and I'm going to do: predict-no
  4026. ENV: Agent did: predict-no for direction R in state State-B
  4027. In State-B moving R
  4028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4029. predict error 0
  4030. dir: dir isU
  4031. \-/557: O: O1114 (predict-no)
  4032. I see 1 and I'm going to do: predict-no
  4033. ENV: Agent did: predict-no for direction U in state State-B
  4034. In State-B moving U
  4035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4036. predict error 0
  4037. dir: dir isL
  4038. |\-558: O: O1115 (predict-yes)
  4039. I see 1 and I'm going to do: predict-yes
  4040. ENV: Agent did: predict-yes for direction L in state State-B
  4041. In State-B moving L
  4042. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4043. predict error 0
  4044. dir: dir isR
  4045. /|\559: O: O1117 (predict-yes)
  4046. I see 1 and I'm going to do: predict-yes
  4047. ENV: Agent did: predict-yes for direction R in state State-A
  4048. In State-A moving R
  4049. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4050. predict error 0
  4051. dir: dir isR
  4052. -/|560: O: O1120 (predict-no)
  4053. I see 1 and I'm going to do: predict-no
  4054. ENV: Agent did: predict-no for direction R in state State-B
  4055. In State-B moving R
  4056. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4057. predict error 0
  4058. dir: dir isU
  4059. \-561: O: O1122 (predict-no)
  4060. I see 1 and I'm going to do: predict-no
  4061. ENV: Agent did: predict-no for direction U in state State-B
  4062. In State-B moving U
  4063. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4064. predict error 0
  4065. dir: dir isL
  4066. /562: O: O1123 (predict-yes)
  4067. I see 1 and I'm going to do: predict-yes
  4068. ENV: Agent did: predict-yes for direction L in state State-B
  4069. In State-B moving L
  4070. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4071. predict error 0
  4072. dir: dir isL
  4073. |\563: O: O1126 (predict-no)
  4074. I see 1 and I'm going to do: predict-no
  4075. ENV: Agent did: predict-no for direction L in state State-A
  4076. In State-A moving L
  4077. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4078. predict error 0
  4079. dir: dir isL
  4080. -/|564: O: O1128 (predict-no)
  4081. I see 1 and I'm going to do: predict-no
  4082. ENV: Agent did: predict-no for direction L in state State-A
  4083. In State-A moving L
  4084. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4085. predict error 0
  4086. dir: dir isR
  4087. \-565: O: O1129 (predict-yes)
  4088. I see 1 and I'm going to do: predict-yes
  4089. ENV: Agent did: predict-yes for direction R in state State-A
  4090. In State-A moving R
  4091. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4092. predict error 0
  4093. dir: dir isU
  4094. /|566: O: O1132 (predict-no)
  4095. I see 1 and I'm going to do: predict-no
  4096. ENV: Agent did: predict-no for direction U in state State-B
  4097. In State-B moving U
  4098. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4099. predict error 0
  4100. dir: dir isU
  4101. \-/567: O: O1134 (predict-no)
  4102. I see 1 and I'm going to do: predict-no
  4103. ENV: Agent did: predict-no for direction U in state State-B
  4104. In State-B moving U
  4105. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4106. predict error 0
  4107. dir: dir isL
  4108. |\-568: O: O1135 (predict-yes)
  4109. I see 1 and I'm going to do: predict-yes
  4110. ENV: Agent did: predict-yes for direction L in state State-B
  4111. In State-B moving L
  4112. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4113. predict error 0
  4114. dir: dir isR
  4115. /|\569: O: O1137 (predict-yes)
  4116. I see 1 and I'm going to do: predict-yes
  4117. ENV: Agent did: predict-yes for direction R in state State-A
  4118. In State-A moving R
  4119. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4120. predict error 0
  4121. dir: dir isU
  4122. -/|570: O: O1140 (predict-no)
  4123. I see 1 and I'm going to do: predict-no
  4124. ENV: Agent did: predict-no for direction U in state State-B
  4125. In State-B moving U
  4126. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4127. predict error 0
  4128. dir: dir isU
  4129. \-/571: O: O1142 (predict-no)
  4130. I see 1 and I'm going to do: predict-no
  4131. ENV: Agent did: predict-no for direction U in state State-B
  4132. In State-B moving U
  4133. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4134. predict error 0
  4135. dir: dir isR
  4136. |572: O: O1144 (predict-no)
  4137. I see 1 and I'm going to do: predict-no
  4138. ENV: Agent did: predict-no for direction R in state State-B
  4139. In State-B moving R
  4140. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4141. predict error 0
  4142. dir: dir isR
  4143. \-/573: O: O1146 (predict-no)
  4144. I see 1 and I'm going to do: predict-no
  4145. ENV: Agent did: predict-no for direction R in state State-B
  4146. In State-B moving R
  4147. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4148. predict error 0
  4149. dir: dir isU
  4150. |\-574: O: O1148 (predict-no)
  4151. I see 1 and I'm going to do: predict-no
  4152. ENV: Agent did: predict-no for direction U in state State-B
  4153. In State-B moving U
  4154. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4155. predict error 0
  4156. dir: dir isR
  4157. /|\575: O: O1150 (predict-no)
  4158. I see 1 and I'm going to do: predict-no
  4159. ENV: Agent did: predict-no for direction R in state State-B
  4160. In State-B moving R
  4161. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4162. predict error 0
  4163. dir: dir isL
  4164. -/|576: O: O1151 (predict-yes)
  4165. I see 1 and I'm going to do: predict-yes
  4166. ENV: Agent did: predict-yes for direction L in state State-B
  4167. In State-B moving L
  4168. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4169. predict error 0
  4170. dir: dir isR
  4171. \-577: O: O1153 (predict-yes)
  4172. I see 1 and I'm going to do: predict-yes
  4173. ENV: Agent did: predict-yes for direction R in state State-A
  4174. In State-A moving R
  4175. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4176. predict error 0
  4177. dir: dir isU
  4178. /|578: O: O1156 (predict-no)
  4179. I see 1 and I'm going to do: predict-no
  4180. ENV: Agent did: predict-no for direction U in state State-B
  4181. In State-B moving U
  4182. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4183. predict error 0
  4184. dir: dir isL
  4185. \-/579: O: O1157 (predict-yes)
  4186. I see 1 and I'm going to do: predict-yes
  4187. ENV: Agent did: predict-yes for direction L in state State-B
  4188. In State-B moving L
  4189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4190. predict error 0
  4191. dir: dir isR
  4192. |\-580: O: O1159 (predict-yes)
  4193. I see 1 and I'm going to do: predict-yes
  4194. ENV: Agent did: predict-yes for direction R in state State-A
  4195. In State-A moving R
  4196. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4197. predict error 0
  4198. dir: dir isR
  4199. /|581: O: O1162 (predict-no)
  4200. I see 1 and I'm going to do: predict-no
  4201. ENV: Agent did: predict-no for direction R in state State-B
  4202. In State-B moving R
  4203. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4204. predict error 0
  4205. dir: dir isR
  4206. \582: O: O1163 (predict-yes)
  4207. I see 1 and I'm going to do: predict-yes
  4208. ENV: Agent did: predict-yes for direction R in state State-B
  4209. In State-B moving R
  4210. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  4211. predict error 1
  4212. dir: dir isL
  4213. -/|583: O: O1165 (predict-yes)
  4214. I see 0 and I'm going to do: predict-yes
  4215. ENV: Agent did: predict-yes for direction L in state State-B
  4216. In State-B moving L
  4217. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4218. predict error 0
  4219. dir: dir isL
  4220. \-584: O: O1168 (predict-no)
  4221. I see 1 and I'm going to do: predict-no
  4222. ENV: Agent did: predict-no for direction L in state State-A
  4223. In State-A moving L
  4224. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4225. predict error 0
  4226. dir: dir isU
  4227. /|\585: O: O1170 (predict-no)
  4228. I see 1 and I'm going to do: predict-no
  4229. ENV: Agent did: predict-no for direction U in state State-A
  4230. In State-A moving U
  4231. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4232. predict error 0
  4233. dir: dir isR
  4234. -/|586: O: O1171 (predict-yes)
  4235. I see 1 and I'm going to do: predict-yes
  4236. ENV: Agent did: predict-yes for direction R in state State-A
  4237. In State-A moving R
  4238. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4239. predict error 0
  4240. dir: dir isL
  4241. \-/587: O: O1173 (predict-yes)
  4242. I see 1 and I'm going to do: predict-yes
  4243. ENV: Agent did: predict-yes for direction L in state State-B
  4244. In State-B moving L
  4245. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4246. predict error 0
  4247. dir: dir isL
  4248. |\588: O: O1176 (predict-no)
  4249. I see 1 and I'm going to do: predict-no
  4250. ENV: Agent did: predict-no for direction L in state State-A
  4251. In State-A moving L
  4252. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4253. predict error 0
  4254. dir: dir isU
  4255. -/589: O: O1178 (predict-no)
  4256. I see 1 and I'm going to do: predict-no
  4257. ENV: Agent did: predict-no for direction U in state State-A
  4258. In State-A moving U
  4259. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4260. predict error 0
  4261. dir: dir isR
  4262. |\590: O: O1179 (predict-yes)
  4263. I see 1 and I'm going to do: predict-yes
  4264. ENV: Agent did: predict-yes for direction R in state State-A
  4265. In State-A moving R
  4266. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4267. predict error 0
  4268. dir: dir isR
  4269. -/|591: O: O1182 (predict-no)
  4270. I see 1 and I'm going to do: predict-no
  4271. ENV: Agent did: predict-no for direction R in state State-B
  4272. In State-B moving R
  4273. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4274. predict error 0
  4275. dir: dir isL
  4276. \592: O: O1183 (predict-yes)
  4277. I see 1 and I'm going to do: predict-yes
  4278. ENV: Agent did: predict-yes for direction L in state State-B
  4279. In State-B moving L
  4280. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4281. predict error 0
  4282. dir: dir isU
  4283. -/593: O: O1186 (predict-no)
  4284. I see 1 and I'm going to do: predict-no
  4285. ENV: Agent did: predict-no for direction U in state State-A
  4286. In State-A moving U
  4287. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4288. predict error 0
  4289. dir: dir isR
  4290. |\-594: O: O1187 (predict-yes)
  4291. I see 1 and I'm going to do: predict-yes
  4292. ENV: Agent did: predict-yes for direction R in state State-A
  4293. In State-A moving R
  4294. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4295. predict error 0
  4296. dir: dir isU
  4297. /|\595: O: O1190 (predict-no)
  4298. I see 1 and I'm going to do: predict-no
  4299. ENV: Agent did: predict-no for direction U in state State-B
  4300. In State-B moving U
  4301. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4302. predict error 0
  4303. dir: dir isU
  4304. -/596: O: O1192 (predict-no)
  4305. I see 1 and I'm going to do: predict-no
  4306. ENV: Agent did: predict-no for direction U in state State-B
  4307. In State-B moving U
  4308. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4309. predict error 0
  4310. dir: dir isL
  4311. |\597: O: O1193 (predict-yes)
  4312. I see 1 and I'm going to do: predict-yes
  4313. ENV: Agent did: predict-yes for direction L in state State-B
  4314. In State-B moving L
  4315. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4316. predict error 0
  4317. dir: dir isU
  4318. -/598: O: O1196 (predict-no)
  4319. I see 1 and I'm going to do: predict-no
  4320. ENV: Agent did: predict-no for direction U in state State-A
  4321. In State-A moving U
  4322. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4323. predict error 0
  4324. dir: dir isL
  4325. |\599: O: O1198 (predict-no)
  4326. I see 1 and I'm going to do: predict-no
  4327. ENV: Agent did: predict-no for direction L in state State-A
  4328. In State-A moving L
  4329. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4330. predict error 0
  4331. dir: dir isU
  4332. -/|600: O: O1200 (predict-no)
  4333. I see 1 and I'm going to do: predict-no
  4334. ENV: Agent did: predict-no for direction U in state State-A
  4335. In State-A moving U
  4336. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4337. predict error 0
  4338. dir: dir isL
  4339. \-/601: O: O1202 (predict-no)
  4340. I see 1 and I'm going to do: predict-no
  4341. ENV: Agent did: predict-no for direction L in state State-A
  4342. In State-A moving L
  4343. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4344. predict error 0
  4345. dir: dir isU
  4346. |602: O: O1204 (predict-no)
  4347. I see 1 and I'm going to do: predict-no
  4348. ENV: Agent did: predict-no for direction U in state State-A
  4349. In State-A moving U
  4350. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4351. predict error 0
  4352. dir: dir isL
  4353. \-603: O: O1206 (predict-no)
  4354. I see 1 and I'm going to do: predict-no
  4355. ENV: Agent did: predict-no for direction L in state State-A
  4356. In State-A moving L
  4357. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4358. predict error 0
  4359. dir: dir isL
  4360. /|\604: O: O1208 (predict-no)
  4361. I see 1 and I'm going to do: predict-no
  4362. ENV: Agent did: predict-no for direction L in state State-A
  4363. In State-A moving L
  4364. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4365. predict error 0
  4366. dir: dir isR
  4367. -/|605: O: O1209 (predict-yes)
  4368. I see 1 and I'm going to do: predict-yes
  4369. ENV: Agent did: predict-yes for direction R in state State-A
  4370. In State-A moving R
  4371. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4372. predict error 0
  4373. dir: dir isR
  4374. \-/606: O: O1212 (predict-no)
  4375. I see 1 and I'm going to do: predict-no
  4376. ENV: Agent did: predict-no for direction R in state State-B
  4377. In State-B moving R
  4378. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4379. predict error 0
  4380. dir: dir isR
  4381. |\-607: O: O1214 (predict-no)
  4382. I see 1 and I'm going to do: predict-no
  4383. ENV: Agent did: predict-no for direction R in state State-B
  4384. In State-B moving R
  4385. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4386. predict error 0
  4387. dir: dir isL
  4388. /|608: O: O1215 (predict-yes)
  4389. I see 1 and I'm going to do: predict-yes
  4390. ENV: Agent did: predict-yes for direction L in state State-B
  4391. In State-B moving L
  4392. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4393. predict error 0
  4394. dir: dir isL
  4395. \-/609: O: O1218 (predict-no)
  4396. I see 1 and I'm going to do: predict-no
  4397. ENV: Agent did: predict-no for direction L in state State-A
  4398. In State-A moving L
  4399. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4400. predict error 0
  4401. dir: dir isL
  4402. |\-610: O: O1220 (predict-no)
  4403. I see 1 and I'm going to do: predict-no
  4404. ENV: Agent did: predict-no for direction L in state State-A
  4405. In State-A moving L
  4406. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4407. predict error 0
  4408. dir: dir isU
  4409. /|\-611: O: O1222 (predict-no)
  4410. I see 1 and I'm going to do: predict-no
  4411. ENV: Agent did: predict-no for direction U in state State-A
  4412. In State-A moving U
  4413. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4414. predict error 0
  4415. dir: dir isU
  4416. /612: O: O1224 (predict-no)
  4417. I see 1 and I'm going to do: predict-no
  4418. ENV: Agent did: predict-no for direction U in state State-A
  4419. In State-A moving U
  4420. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4421. predict error 0
  4422. dir: dir isR
  4423. |\613: O: O1225 (predict-yes)
  4424. I see 1 and I'm going to do: predict-yes
  4425. ENV: Agent did: predict-yes for direction R in state State-A
  4426. In State-A moving R
  4427. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4428. predict error 0
  4429. dir: dir isL
  4430. -/614: O: O1227 (predict-yes)
  4431. I see 1 and I'm going to do: predict-yes
  4432. ENV: Agent did: predict-yes for direction L in state State-B
  4433. In State-B moving L
  4434. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4435. predict error 0
  4436. dir: dir isU
  4437. |\-615: O: O1230 (predict-no)
  4438. I see 1 and I'm going to do: predict-no
  4439. ENV: Agent did: predict-no for direction U in state State-A
  4440. In State-A moving U
  4441. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4442. predict error 0
  4443. dir: dir isL
  4444. /|\616: O: O1232 (predict-no)
  4445. I see 1 and I'm going to do: predict-no
  4446. ENV: Agent did: predict-no for direction L in state State-A
  4447. In State-A moving L
  4448. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4449. predict error 0
  4450. dir: dir isR
  4451. -/|617: O: O1233 (predict-yes)
  4452. I see 1 and I'm going to do: predict-yes
  4453. ENV: Agent did: predict-yes for direction R in state State-A
  4454. In State-A moving R
  4455. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4456. predict error 0
  4457. dir: dir isR
  4458. \-/618: O: O1236 (predict-no)
  4459. I see 1 and I'm going to do: predict-no
  4460. ENV: Agent did: predict-no for direction R in state State-B
  4461. In State-B moving R
  4462. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4463. predict error 0
  4464. dir: dir isL
  4465. |\-619: O: O1237 (predict-yes)
  4466. I see 1 and I'm going to do: predict-yes
  4467. ENV: Agent did: predict-yes for direction L in state State-B
  4468. In State-B moving L
  4469. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4470. predict error 0
  4471. dir: dir isU
  4472. /|620: O: O1240 (predict-no)
  4473. I see 1 and I'm going to do: predict-no
  4474. ENV: Agent did: predict-no for direction U in state State-A
  4475. In State-A moving U
  4476. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4477. predict error 0
  4478. dir: dir isL
  4479. \-/621: O: O1242 (predict-no)
  4480. I see 1 and I'm going to do: predict-no
  4481. ENV: Agent did: predict-no for direction L in state State-A
  4482. In State-A moving L
  4483. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4484. predict error 0
  4485. dir: dir isR
  4486. |622: O: O1243 (predict-yes)
  4487. I see 1 and I'm going to do: predict-yes
  4488. ENV: Agent did: predict-yes for direction R in state State-A
  4489. In State-A moving R
  4490. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4491. predict error 0
  4492. dir: dir isL
  4493. \-/623: O: O1245 (predict-yes)
  4494. I see 1 and I'm going to do: predict-yes
  4495. ENV: Agent did: predict-yes for direction L in state State-B
  4496. In State-B moving L
  4497. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4498. predict error 0
  4499. dir: dir isR
  4500. |\624: O: O1247 (predict-yes)
  4501. I see 1 and I'm going to do: predict-yes
  4502. ENV: Agent did: predict-yes for direction R in state State-A
  4503. In State-A moving R
  4504. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4505. predict error 0
  4506. dir: dir isR
  4507. -/|625: O: O1250 (predict-no)
  4508. I see 1 and I'm going to do: predict-no
  4509. ENV: Agent did: predict-no for direction R in state State-B
  4510. In State-B moving R
  4511. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4512. predict error 0
  4513. dir: dir isR
  4514. \-626: O: O1252 (predict-no)
  4515. I see 1 and I'm going to do: predict-no
  4516. ENV: Agent did: predict-no for direction R in state State-B
  4517. In State-B moving R
  4518. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4519. predict error 0
  4520. dir: dir isR
  4521. /|\627: O: O1254 (predict-no)
  4522. I see 1 and I'm going to do: predict-no
  4523. ENV: Agent did: predict-no for direction R in state State-B
  4524. In State-B moving R
  4525. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4526. predict error 0
  4527. dir: dir isR
  4528. -/628: O: O1256 (predict-no)
  4529. I see 1 and I'm going to do: predict-no
  4530. ENV: Agent did: predict-no for direction R in state State-B
  4531. In State-B moving R
  4532. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4533. predict error 0
  4534. dir: dir isR
  4535. |\-629: O: O1258 (predict-no)
  4536. I see 1 and I'm going to do: predict-no
  4537. ENV: Agent did: predict-no for direction R in state State-B
  4538. In State-B moving R
  4539. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4540. predict error 0
  4541. dir: dir isR
  4542. /|630: O: O1260 (predict-no)
  4543. I see 1 and I'm going to do: predict-no
  4544. ENV: Agent did: predict-no for direction R in state State-B
  4545. In State-B moving R
  4546. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4547. predict error 0
  4548. dir: dir isU
  4549. \-/631: O: O1262 (predict-no)
  4550. I see 1 and I'm going to do: predict-no
  4551. ENV: Agent did: predict-no for direction U in state State-B
  4552. In State-B moving U
  4553. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4554. predict error 0
  4555. dir: dir isU
  4556. |632: O: O1264 (predict-no)
  4557. I see 1 and I'm going to do: predict-no
  4558. ENV: Agent did: predict-no for direction U in state State-B
  4559. In State-B moving U
  4560. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4561. predict error 0
  4562. dir: dir isL
  4563. \-633: O: O1265 (predict-yes)
  4564. I see 1 and I'm going to do: predict-yes
  4565. ENV: Agent did: predict-yes for direction L in state State-B
  4566. In State-B moving L
  4567. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4568. predict error 0
  4569. dir: dir isR
  4570. /|\634: O: O1267 (predict-yes)
  4571. I see 1 and I'm going to do: predict-yes
  4572. ENV: Agent did: predict-yes for direction R in state State-A
  4573. In State-A moving R
  4574. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4575. predict error 0
  4576. dir: dir isR
  4577. -/|635: O: O1270 (predict-no)
  4578. I see 1 and I'm going to do: predict-no
  4579. ENV: Agent did: predict-no for direction R in state State-B
  4580. In State-B moving R
  4581. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4582. predict error 0
  4583. dir: dir isL
  4584. \-/636: O: O1271 (predict-yes)
  4585. I see 1 and I'm going to do: predict-yes
  4586. ENV: Agent did: predict-yes for direction L in state State-B
  4587. In State-B moving L
  4588. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4589. predict error 0
  4590. dir: dir isU
  4591. |637: O: O1274 (predict-no)
  4592. I see 1 and I'm going to do: predict-no
  4593. ENV: Agent did: predict-no for direction U in state State-A
  4594. In State-A moving U
  4595. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4596. predict error 0
  4597. dir: dir isR
  4598. \-/638: O: O1275 (predict-yes)
  4599. I see 1 and I'm going to do: predict-yes
  4600. ENV: Agent did: predict-yes for direction R in state State-A
  4601. In State-A moving R
  4602. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4603. predict error 0
  4604. dir: dir isR
  4605. |639: O: O1278 (predict-no)
  4606. I see 1 and I'm going to do: predict-no
  4607. ENV: Agent did: predict-no for direction R in state State-B
  4608. In State-B moving R
  4609. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4610. predict error 0
  4611. dir: dir isL
  4612. \-/640: O: O1279 (predict-yes)
  4613. I see 1 and I'm going to do: predict-yes
  4614. ENV: Agent did: predict-yes for direction L in state State-B
  4615. In State-B moving L
  4616. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4617. predict error 0
  4618. dir: dir isU
  4619. |\-641: O: O1282 (predict-no)
  4620. I see 1 and I'm going to do: predict-no
  4621. ENV: Agent did: predict-no for direction U in state State-A
  4622. In State-A moving U
  4623. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4624. predict error 0
  4625. dir: dir isR
  4626. /642: O: O1283 (predict-yes)
  4627. I see 1 and I'm going to do: predict-yes
  4628. ENV: Agent did: predict-yes for direction R in state State-A
  4629. In State-A moving R
  4630. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4631. predict error 0
  4632. dir: dir isR
  4633. |\-643: O: O1286 (predict-no)
  4634. I see 1 and I'm going to do: predict-no
  4635. ENV: Agent did: predict-no for direction R in state State-B
  4636. In State-B moving R
  4637. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4638. predict error 0
  4639. dir: dir isR
  4640. /|644: O: O1288 (predict-no)
  4641. I see 1 and I'm going to do: predict-no
  4642. ENV: Agent did: predict-no for direction R in state State-B
  4643. In State-B moving R
  4644. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4645. predict error 0
  4646. dir: dir isR
  4647. \-/645: O: O1290 (predict-no)
  4648. I see 1 and I'm going to do: predict-no
  4649. ENV: Agent did: predict-no for direction R in state State-B
  4650. In State-B moving R
  4651. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4652. predict error 0
  4653. dir: dir isU
  4654. |\-646: O: O1292 (predict-no)
  4655. I see 1 and I'm going to do: predict-no
  4656. ENV: Agent did: predict-no for direction U in state State-B
  4657. In State-B moving U
  4658. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4659. predict error 0
  4660. dir: dir isL
  4661. /|\647: O: O1294 (predict-no)
  4662. I see 1 and I'm going to do: predict-no
  4663. ENV: Agent did: predict-no for direction L in state State-B
  4664. In State-B moving L
  4665. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  4666. predict error 1
  4667. dir: dir isR
  4668. -/648: O: O1295 (predict-yes)
  4669. I see 0 and I'm going to do: predict-yes
  4670. ENV: Agent did: predict-yes for direction R in state State-A
  4671. In State-A moving R
  4672. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4673. predict error 0
  4674. dir: dir isL
  4675. |\-649: O: O1297 (predict-yes)
  4676. I see 1 and I'm going to do: predict-yes
  4677. ENV: Agent did: predict-yes for direction L in state State-B
  4678. In State-B moving L
  4679. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4680. predict error 0
  4681. dir: dir isL
  4682. /|\650: O: O1300 (predict-no)
  4683. I see 1 and I'm going to do: predict-no
  4684. ENV: Agent did: predict-no for direction L in state State-A
  4685. In State-A moving L
  4686. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4687. predict error 0
  4688. dir: dir isU
  4689. -/|651: O: O1302 (predict-no)
  4690. I see 1 and I'm going to do: predict-no
  4691. ENV: Agent did: predict-no for direction U in state State-A
  4692. In State-A moving U
  4693. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4694. predict error 0
  4695. dir: dir isL
  4696. \652: O: O1303 (predict-yes)
  4697. I see 1 and I'm going to do: predict-yes
  4698. ENV: Agent did: predict-yes for direction L in state State-A
  4699. In State-A moving L
  4700. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  4701. predict error 1
  4702. dir: dir isR
  4703. -/653: O: O1305 (predict-yes)
  4704. I see 0 and I'm going to do: predict-yes
  4705. ENV: Agent did: predict-yes for direction R in state State-A
  4706. In State-A moving R
  4707. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4708. predict error 0
  4709. dir: dir isL
  4710. |654: O: O1307 (predict-yes)
  4711. I see 1 and I'm going to do: predict-yes
  4712. ENV: Agent did: predict-yes for direction L in state State-B
  4713. In State-B moving L
  4714. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4715. predict error 0
  4716. dir: dir isR
  4717. \-/655: O: O1309 (predict-yes)
  4718. I see 1 and I'm going to do: predict-yes
  4719. ENV: Agent did: predict-yes for direction R in state State-A
  4720. In State-A moving R
  4721. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4722. predict error 0
  4723. dir: dir isU
  4724. |\-656: O: O1312 (predict-no)
  4725. I see 1 and I'm going to do: predict-no
  4726. ENV: Agent did: predict-no for direction U in state State-B
  4727. In State-B moving U
  4728. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4729. predict error 0
  4730. dir: dir isL
  4731. /|\657: O: O1313 (predict-yes)
  4732. I see 1 and I'm going to do: predict-yes
  4733. ENV: Agent did: predict-yes for direction L in state State-B
  4734. In State-B moving L
  4735. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4736. predict error 0
  4737. dir: dir isR
  4738. -/|658: O: O1315 (predict-yes)
  4739. I see 1 and I'm going to do: predict-yes
  4740. ENV: Agent did: predict-yes for direction R in state State-A
  4741. In State-A moving R
  4742. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4743. predict error 0
  4744. dir: dir isL
  4745. \-/659: O: O1317 (predict-yes)
  4746. I see 1 and I'm going to do: predict-yes
  4747. ENV: Agent did: predict-yes for direction L in state State-B
  4748. In State-B moving L
  4749. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4750. predict error 0
  4751. dir: dir isU
  4752. |\-660: O: O1320 (predict-no)
  4753. I see 1 and I'm going to do: predict-no
  4754. ENV: Agent did: predict-no for direction U in state State-A
  4755. In State-A moving U
  4756. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4757. predict error 0
  4758. dir: dir isU
  4759. /|\661: O: O1322 (predict-no)
  4760. I see 1 and I'm going to do: predict-no
  4761. ENV: Agent did: predict-no for direction U in state State-A
  4762. In State-A moving U
  4763. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4764. predict error 0
  4765. dir: dir isU
  4766. -662: O: O1324 (predict-no)
  4767. I see 1 and I'm going to do: predict-no
  4768. ENV: Agent did: predict-no for direction U in state State-A
  4769. In State-A moving U
  4770. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4771. predict error 0
  4772. dir: dir isU
  4773. /663: O: O1326 (predict-no)
  4774. I see 1 and I'm going to do: predict-no
  4775. ENV: Agent did: predict-no for direction U in state State-A
  4776. In State-A moving U
  4777. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4778. predict error 0
  4779. dir: dir isL
  4780. |\-664: O: O1328 (predict-no)
  4781. I see 1 and I'm going to do: predict-no
  4782. ENV: Agent did: predict-no for direction L in state State-A
  4783. In State-A moving L
  4784. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4785. predict error 0
  4786. dir: dir isL
  4787. /|\665: O: O1330 (predict-no)
  4788. I see 1 and I'm going to do: predict-no
  4789. ENV: Agent did: predict-no for direction L in state State-A
  4790. In State-A moving L
  4791. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4792. predict error 0
  4793. dir: dir isR
  4794. -/|666: O: O1331 (predict-yes)
  4795. I see 1 and I'm going to do: predict-yes
  4796. ENV: Agent did: predict-yes for direction R in state State-A
  4797. In State-A moving R
  4798. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4799. predict error 0
  4800. dir: dir isU
  4801. \-/667: O: O1334 (predict-no)
  4802. I see 1 and I'm going to do: predict-no
  4803. ENV: Agent did: predict-no for direction U in state State-B
  4804. In State-B moving U
  4805. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4806. predict error 0
  4807. dir: dir isU
  4808. |\668: O: O1336 (predict-no)
  4809. I see 1 and I'm going to do: predict-no
  4810. ENV: Agent did: predict-no for direction U in state State-B
  4811. In State-B moving U
  4812. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4813. predict error 0
  4814. dir: dir isU
  4815. -/|669: O: O1338 (predict-no)
  4816. I see 1 and I'm going to do: predict-no
  4817. ENV: Agent did: predict-no for direction U in state State-B
  4818. In State-B moving U
  4819. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4820. predict error 0
  4821. dir: dir isU
  4822. \-/670: O: O1340 (predict-no)
  4823. I see 1 and I'm going to do: predict-no
  4824. ENV: Agent did: predict-no for direction U in state State-B
  4825. In State-B moving U
  4826. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4827. predict error 0
  4828. dir: dir isL
  4829. |\-671: O: O1341 (predict-yes)
  4830. I see 1 and I'm going to do: predict-yes
  4831. ENV: Agent did: predict-yes for direction L in state State-B
  4832. In State-B moving L
  4833. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4834. predict error 0
  4835. dir: dir isU
  4836. /672: O: O1344 (predict-no)
  4837. I see 1 and I'm going to do: predict-no
  4838. ENV: Agent did: predict-no for direction U in state State-A
  4839. In State-A moving U
  4840. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4841. predict error 0
  4842. dir: dir isL
  4843. |\-673: O: O1346 (predict-no)
  4844. I see 1 and I'm going to do: predict-no
  4845. ENV: Agent did: predict-no for direction L in state State-A
  4846. In State-A moving L
  4847. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4848. predict error 0
  4849. dir: dir isL
  4850. /|\674: O: O1348 (predict-no)
  4851. I see 1 and I'm going to do: predict-no
  4852. ENV: Agent did: predict-no for direction L in state State-A
  4853. In State-A moving L
  4854. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4855. predict error 0
  4856. dir: dir isR
  4857. -/675: O: O1349 (predict-yes)
  4858. I see 1 and I'm going to do: predict-yes
  4859. ENV: Agent did: predict-yes for direction R in state State-A
  4860. In State-A moving R
  4861. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4862. predict error 0
  4863. dir: dir isL
  4864. |\676: O: O1351 (predict-yes)
  4865. I see 1 and I'm going to do: predict-yes
  4866. ENV: Agent did: predict-yes for direction L in state State-B
  4867. In State-B moving L
  4868. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4869. predict error 0
  4870. dir: dir isL
  4871. -/|677: O: O1354 (predict-no)
  4872. I see 1 and I'm going to do: predict-no
  4873. ENV: Agent did: predict-no for direction L in state State-A
  4874. In State-A moving L
  4875. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4876. predict error 0
  4877. dir: dir isR
  4878. \-/678: O: O1355 (predict-yes)
  4879. I see 1 and I'm going to do: predict-yes
  4880. ENV: Agent did: predict-yes for direction R in state State-A
  4881. In State-A moving R
  4882. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4883. predict error 0
  4884. dir: dir isU
  4885. |\679: O: O1358 (predict-no)
  4886. I see 1 and I'm going to do: predict-no
  4887. ENV: Agent did: predict-no for direction U in state State-B
  4888. In State-B moving U
  4889. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4890. predict error 0
  4891. dir: dir isR
  4892. -/|680: O: O1360 (predict-no)
  4893. I see 1 and I'm going to do: predict-no
  4894. ENV: Agent did: predict-no for direction R in state State-B
  4895. In State-B moving R
  4896. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4897. predict error 0
  4898. dir: dir isR
  4899. \-/681: O: O1362 (predict-no)
  4900. I see 1 and I'm going to do: predict-no
  4901. ENV: Agent did: predict-no for direction R in state State-B
  4902. In State-B moving R
  4903. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4904. predict error 0
  4905. dir: dir isU
  4906. |682: O: O1364 (predict-no)
  4907. I see 1 and I'm going to do: predict-no
  4908. ENV: Agent did: predict-no for direction U in state State-B
  4909. In State-B moving U
  4910. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4911. predict error 0
  4912. dir: dir isR
  4913. \683: O: O1366 (predict-no)
  4914. I see 1 and I'm going to do: predict-no
  4915. ENV: Agent did: predict-no for direction R in state State-B
  4916. In State-B moving R
  4917. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4918. predict error 0
  4919. dir: dir isL
  4920. -/|684: O: O1367 (predict-yes)
  4921. I see 1 and I'm going to do: predict-yes
  4922. ENV: Agent did: predict-yes for direction L in state State-B
  4923. In State-B moving L
  4924. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4925. predict error 0
  4926. dir: dir isU
  4927. \-/685: O: O1370 (predict-no)
  4928. I see 1 and I'm going to do: predict-no
  4929. ENV: Agent did: predict-no for direction U in state State-A
  4930. In State-A moving U
  4931. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4932. predict error 0
  4933. dir: dir isR
  4934. |\-686: O: O1371 (predict-yes)
  4935. I see 1 and I'm going to do: predict-yes
  4936. ENV: Agent did: predict-yes for direction R in state State-A
  4937. In State-A moving R
  4938. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4939. predict error 0
  4940. dir: dir isU
  4941. /|\687: O: O1374 (predict-no)
  4942. I see 1 and I'm going to do: predict-no
  4943. ENV: Agent did: predict-no for direction U in state State-B
  4944. In State-B moving U
  4945. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4946. predict error 0
  4947. dir: dir isR
  4948. -/|688: O: O1376 (predict-no)
  4949. I see 1 and I'm going to do: predict-no
  4950. ENV: Agent did: predict-no for direction R in state State-B
  4951. In State-B moving R
  4952. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4953. predict error 0
  4954. dir: dir isU
  4955. \-/689: O: O1378 (predict-no)
  4956. I see 1 and I'm going to do: predict-no
  4957. ENV: Agent did: predict-no for direction U in state State-B
  4958. In State-B moving U
  4959. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4960. predict error 0
  4961. dir: dir isR
  4962. |\-690: O: O1380 (predict-no)
  4963. I see 1 and I'm going to do: predict-no
  4964. ENV: Agent did: predict-no for direction R in state State-B
  4965. In State-B moving R
  4966. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4967. predict error 0
  4968. dir: dir isL
  4969. /|\691: O: O1381 (predict-yes)
  4970. I see 1 and I'm going to do: predict-yes
  4971. ENV: Agent did: predict-yes for direction L in state State-B
  4972. In State-B moving L
  4973. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4974. predict error 0
  4975. dir: dir isL
  4976. -692: O: O1384 (predict-no)
  4977. I see 1 and I'm going to do: predict-no
  4978. ENV: Agent did: predict-no for direction L in state State-A
  4979. In State-A moving L
  4980. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4981. predict error 0
  4982. dir: dir isU
  4983. /|\693: O: O1386 (predict-no)
  4984. I see 1 and I'm going to do: predict-no
  4985. ENV: Agent did: predict-no for direction U in state State-A
  4986. In State-A moving U
  4987. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4988. predict error 0
  4989. dir: dir isR
  4990. -/694: O: O1387 (predict-yes)
  4991. I see 1 and I'm going to do: predict-yes
  4992. ENV: Agent did: predict-yes for direction R in state State-A
  4993. In State-A moving R
  4994. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4995. predict error 0
  4996. dir: dir isL
  4997. |\695: O: O1389 (predict-yes)
  4998. I see 1 and I'm going to do: predict-yes
  4999. ENV: Agent did: predict-yes for direction L in state State-B
  5000. In State-B moving L
  5001. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5002. predict error 0
  5003. dir: dir isR
  5004. -/|696: O: O1391 (predict-yes)
  5005. I see 1 and I'm going to do: predict-yes
  5006. ENV: Agent did: predict-yes for direction R in state State-A
  5007. In State-A moving R
  5008. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5009. predict error 0
  5010. dir: dir isL
  5011. \-/697: O: O1393 (predict-yes)
  5012. I see 1 and I'm going to do: predict-yes
  5013. ENV: Agent did: predict-yes for direction L in state State-B
  5014. In State-B moving L
  5015. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5016. predict error 0
  5017. dir: dir isL
  5018. |\698: O: O1396 (predict-no)
  5019. I see 1 and I'm going to do: predict-no
  5020. ENV: Agent did: predict-no for direction L in state State-A
  5021. In State-A moving L
  5022. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5023. predict error 0
  5024. dir: dir isL
  5025. -/|699: O: O1398 (predict-no)
  5026. I see 1 and I'm going to do: predict-no
  5027. ENV: Agent did: predict-no for direction L in state State-A
  5028. In State-A moving L
  5029. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5030. predict error 0
  5031. dir: dir isL
  5032. \-/700: O: O1400 (predict-no)
  5033. I see 1 and I'm going to do: predict-no
  5034. ENV: Agent did: predict-no for direction L in state State-A
  5035. In State-A moving L
  5036. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5037. predict error 0
  5038. dir: dir isR
  5039. |\-701: O: O1401 (predict-yes)
  5040. I see 1 and I'm going to do: predict-yes
  5041. ENV: Agent did: predict-yes for direction R in state State-A
  5042. In State-A moving R
  5043. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5044. predict error 0
  5045. dir: dir isL
  5046. /702: O: O1403 (predict-yes)
  5047. I see 1 and I'm going to do: predict-yes
  5048. ENV: Agent did: predict-yes for direction L in state State-B
  5049. In State-B moving L
  5050. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5051. predict error 0
  5052. dir: dir isR
  5053. |\-703: O: O1405 (predict-yes)
  5054. I see 1 and I'm going to do: predict-yes
  5055. ENV: Agent did: predict-yes for direction R in state State-A
  5056. In State-A moving R
  5057. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5058. predict error 0
  5059. dir: dir isR
  5060. /|\704: O: O1408 (predict-no)
  5061. I see 1 and I'm going to do: predict-no
  5062. ENV: Agent did: predict-no for direction R in state State-B
  5063. In State-B moving R
  5064. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5065. predict error 0
  5066. dir: dir isU
  5067. -/|705: O: O1410 (predict-no)
  5068. I see 1 and I'm going to do: predict-no
  5069. ENV: Agent did: predict-no for direction U in state State-B
  5070. In State-B moving U
  5071. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5072. predict error 0
  5073. dir: dir isR
  5074. \-706: O: O1412 (predict-no)
  5075. I see 1 and I'm going to do: predict-no
  5076. ENV: Agent did: predict-no for direction R in state State-B
  5077. In State-B moving R
  5078. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5079. predict error 0
  5080. dir: dir isL
  5081. /|\707: O: O1413 (predict-yes)
  5082. I see 1 and I'm going to do: predict-yes
  5083. ENV: Agent did: predict-yes for direction L in state State-B
  5084. In State-B moving L
  5085. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5086. predict error 0
  5087. dir: dir isU
  5088. -/|708: O: O1416 (predict-no)
  5089. I see 1 and I'm going to do: predict-no
  5090. ENV: Agent did: predict-no for direction U in state State-A
  5091. In State-A moving U
  5092. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5093. predict error 0
  5094. dir: dir isR
  5095. \-/709: O: O1417 (predict-yes)
  5096. I see 1 and I'm going to do: predict-yes
  5097. ENV: Agent did: predict-yes for direction R in state State-A
  5098. In State-A moving R
  5099. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5100. predict error 0
  5101. dir: dir isR
  5102. |\-710: O: O1420 (predict-no)
  5103. I see 1 and I'm going to do: predict-no
  5104. ENV: Agent did: predict-no for direction R in state State-B
  5105. In State-B moving R
  5106. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5107. predict error 0
  5108. dir: dir isR
  5109. /|\711: O: O1422 (predict-no)
  5110. I see 1 and I'm going to do: predict-no
  5111. ENV: Agent did: predict-no for direction R in state State-B
  5112. In State-B moving R
  5113. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5114. predict error 0
  5115. dir: dir isR
  5116. -712: O: O1424 (predict-no)
  5117. I see 1 and I'm going to do: predict-no
  5118. ENV: Agent did: predict-no for direction R in state State-B
  5119. In State-B moving R
  5120. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5121. predict error 0
  5122. dir: dir isU
  5123. /|713: O: O1426 (predict-no)
  5124. I see 1 and I'm going to do: predict-no
  5125. ENV: Agent did: predict-no for direction U in state State-B
  5126. In State-B moving U
  5127. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5128. predict error 0
  5129. dir: dir isU
  5130. \-/714: O: O1428 (predict-no)
  5131. I see 1 and I'm going to do: predict-no
  5132. ENV: Agent did: predict-no for direction U in state State-B
  5133. In State-B moving U
  5134. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5135. predict error 0
  5136. dir: dir isU
  5137. |\-715: O: O1430 (predict-no)
  5138. I see 1 and I'm going to do: predict-no
  5139. ENV: Agent did: predict-no for direction U in state State-B
  5140. In State-B moving U
  5141. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5142. predict error 0
  5143. dir: dir isU
  5144. /|\-716: O: O1432 (predict-no)
  5145. I see 1 and I'm going to do: predict-no
  5146. ENV: Agent did: predict-no for direction U in state State-B
  5147. In State-B moving U
  5148. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5149. predict error 0
  5150. dir: dir isR
  5151. /|717: O: O1434 (predict-no)
  5152. I see 1 and I'm going to do: predict-no
  5153. ENV: Agent did: predict-no for direction R in state State-B
  5154. In State-B moving R
  5155. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5156. predict error 0
  5157. dir: dir isR
  5158. \-/718: O: O1436 (predict-no)
  5159. I see 1 and I'm going to do: predict-no
  5160. ENV: Agent did: predict-no for direction R in state State-B
  5161. In State-B moving R
  5162. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5163. predict error 0
  5164. dir: dir isU
  5165. |\-719: O: O1438 (predict-no)
  5166. I see 1 and I'm going to do: predict-no
  5167. ENV: Agent did: predict-no for direction U in state State-B
  5168. In State-B moving U
  5169. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5170. predict error 0
  5171. dir: dir isL
  5172. /|\720: O: O1439 (predict-yes)
  5173. I see 1 and I'm going to do: predict-yes
  5174. ENV: Agent did: predict-yes for direction L in state State-B
  5175. In State-B moving L
  5176. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5177. predict error 0
  5178. dir: dir isL
  5179. -/|721: O: O1442 (predict-no)
  5180. I see 1 and I'm going to do: predict-no
  5181. ENV: Agent did: predict-no for direction L in state State-A
  5182. In State-A moving L
  5183. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5184. predict error 0
  5185. dir: dir isL
  5186. \722: O: O1444 (predict-no)
  5187. I see 1 and I'm going to do: predict-no
  5188. ENV: Agent did: predict-no for direction L in state State-A
  5189. In State-A moving L
  5190. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5191. predict error 0
  5192. dir: dir isL
  5193. -/|723: O: O1446 (predict-no)
  5194. I see 1 and I'm going to do: predict-no
  5195. ENV: Agent did: predict-no for direction L in state State-A
  5196. In State-A moving L
  5197. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5198. predict error 0
  5199. dir: dir isL
  5200. \-724: O: O1448 (predict-no)
  5201. I see 1 and I'm going to do: predict-no
  5202. ENV: Agent did: predict-no for direction L in state State-A
  5203. In State-A moving L
  5204. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5205. predict error 0
  5206. dir: dir isR
  5207. /|\725: O: O1449 (predict-yes)
  5208. I see 1 and I'm going to do: predict-yes
  5209. ENV: Agent did: predict-yes for direction R in state State-A
  5210. In State-A moving R
  5211. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5212. predict error 0
  5213. dir: dir isL
  5214. -/|726: O: O1451 (predict-yes)
  5215. I see 1 and I'm going to do: predict-yes
  5216. ENV: Agent did: predict-yes for direction L in state State-B
  5217. In State-B moving L
  5218. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5219. predict error 0
  5220. dir: dir isU
  5221. \-/727: O: O1454 (predict-no)
  5222. I see 1 and I'm going to do: predict-no
  5223. ENV: Agent did: predict-no for direction U in state State-A
  5224. In State-A moving U
  5225. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5226. predict error 0
  5227. dir: dir isU
  5228. |\728: O: O1456 (predict-no)
  5229. I see 1 and I'm going to do: predict-no
  5230. ENV: Agent did: predict-no for direction U in state State-A
  5231. In State-A moving U
  5232. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5233. predict error 0
  5234. dir: dir isU
  5235. -/|729: O: O1458 (predict-no)
  5236. I see 1 and I'm going to do: predict-no
  5237. ENV: Agent did: predict-no for direction U in state State-A
  5238. In State-A moving U
  5239. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5240. predict error 0
  5241. dir: dir isR
  5242. \-/730: O: O1459 (predict-yes)
  5243. I see 1 and I'm going to do: predict-yes
  5244. ENV: Agent did: predict-yes for direction R in state State-A
  5245. In State-A moving R
  5246. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5247. predict error 0
  5248. dir: dir isU
  5249. |\-731: O: O1462 (predict-no)
  5250. I see 1 and I'm going to do: predict-no
  5251. ENV: Agent did: predict-no for direction U in state State-B
  5252. In State-B moving U
  5253. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5254. predict error 0
  5255. dir: dir isR
  5256. /732: O: O1464 (predict-no)
  5257. I see 1 and I'm going to do: predict-no
  5258. ENV: Agent did: predict-no for direction R in state State-B
  5259. In State-B moving R
  5260. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5261. predict error 0
  5262. dir: dir isR
  5263. |\-733: O: O1466 (predict-no)
  5264. I see 1 and I'm going to do: predict-no
  5265. ENV: Agent did: predict-no for direction R in state State-B
  5266. In State-B moving R
  5267. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5268. predict error 0
  5269. dir: dir isL
  5270. /|\734: O: O1467 (predict-yes)
  5271. I see 1 and I'm going to do: predict-yes
  5272. ENV: Agent did: predict-yes for direction L in state State-B
  5273. In State-B moving L
  5274. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5275. predict error 0
  5276. dir: dir isU
  5277. -/|735: O: O1470 (predict-no)
  5278. I see 1 and I'm going to do: predict-no
  5279. ENV: Agent did: predict-no for direction U in state State-A
  5280. In State-A moving U
  5281. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5282. predict error 0
  5283. dir: dir isU
  5284. \-/736: O: O1472 (predict-no)
  5285. I see 1 and I'm going to do: predict-no
  5286. ENV: Agent did: predict-no for direction U in state State-A
  5287. In State-A moving U
  5288. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5289. predict error 0
  5290. dir: dir isL
  5291. |\-737: O: O1474 (predict-no)
  5292. I see 1 and I'm going to do: predict-no
  5293. ENV: Agent did: predict-no for direction L in state State-A
  5294. In State-A moving L
  5295. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5296. predict error 0
  5297. dir: dir isR
  5298. /|\738: O: O1475 (predict-yes)
  5299. I see 1 and I'm going to do: predict-yes
  5300. ENV: Agent did: predict-yes for direction R in state State-A
  5301. In State-A moving R
  5302. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5303. predict error 0
  5304. dir: dir isL
  5305. -/|739: O: O1477 (predict-yes)
  5306. I see 1 and I'm going to do: predict-yes
  5307. ENV: Agent did: predict-yes for direction L in state State-B
  5308. In State-B moving L
  5309. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5310. predict error 0
  5311. dir: dir isL
  5312. \-740: O: O1480 (predict-no)
  5313. I see 1 and I'm going to do: predict-no
  5314. ENV: Agent did: predict-no for direction L in state State-A
  5315. In State-A moving L
  5316. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5317. predict error 0
  5318. dir: dir isL
  5319. /|\741: O: O1482 (predict-no)
  5320. I see 1 and I'm going to do: predict-no
  5321. ENV: Agent did: predict-no for direction L in state State-A
  5322. In State-A moving L
  5323. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5324. predict error 0
  5325. dir: dir isR
  5326. -742: O: O1483 (predict-yes)
  5327. I see 1 and I'm going to do: predict-yes
  5328. ENV: Agent did: predict-yes for direction R in state State-A
  5329. In State-A moving R
  5330. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5331. predict error 0
  5332. dir: dir isL
  5333. /|\743: O: O1485 (predict-yes)
  5334. I see 1 and I'm going to do: predict-yes
  5335. ENV: Agent did: predict-yes for direction L in state State-B
  5336. In State-B moving L
  5337. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5338. predict error 0
  5339. dir: dir isR
  5340. -/744: O: O1487 (predict-yes)
  5341. I see 1 and I'm going to do: predict-yes
  5342. ENV: Agent did: predict-yes for direction R in state State-A
  5343. In State-A moving R
  5344. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5345. predict error 0
  5346. dir: dir isL
  5347. |745: O: O1489 (predict-yes)
  5348. I see 1 and I'm going to do: predict-yes
  5349. ENV: Agent did: predict-yes for direction L in state State-B
  5350. In State-B moving L
  5351. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5352. predict error 0
  5353. dir: dir isL
  5354. \-/746: O: O1492 (predict-no)
  5355. I see 1 and I'm going to do: predict-no
  5356. ENV: Agent did: predict-no for direction L in state State-A
  5357. In State-A moving L
  5358. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5359. predict error 0
  5360. dir: dir isU
  5361. |747: O: O1494 (predict-no)
  5362. I see 1 and I'm going to do: predict-no
  5363. ENV: Agent did: predict-no for direction U in state State-A
  5364. In State-A moving U
  5365. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5366. predict error 0
  5367. dir: dir isU
  5368. \-/748: O: O1496 (predict-no)
  5369. I see 1 and I'm going to do: predict-no
  5370. ENV: Agent did: predict-no for direction U in state State-A
  5371. In State-A moving U
  5372. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5373. predict error 0
  5374. dir: dir isL
  5375. |\-749: O: O1498 (predict-no)
  5376. I see 1 and I'm going to do: predict-no
  5377. ENV: Agent did: predict-no for direction L in state State-A
  5378. In State-A moving L
  5379. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5380. predict error 0
  5381. dir: dir isU
  5382. /|\750: O: O1500 (predict-no)
  5383. I see 1 and I'm going to do: predict-no
  5384. ENV: Agent did: predict-no for direction U in state State-A
  5385. In State-A moving U
  5386. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5387. predict error 0
  5388. dir: dir isL
  5389. -/|751: O: O1502 (predict-no)
  5390. I see 1 and I'm going to do: predict-no
  5391. ENV: Agent did: predict-no for direction L in state State-A
  5392. In State-A moving L
  5393. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5394. predict error 0
  5395. dir: dir isR
  5396. \752: O: O1503 (predict-yes)
  5397. I see 1 and I'm going to do: predict-yes
  5398. ENV: Agent did: predict-yes for direction R in state State-A
  5399. In State-A moving R
  5400. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5401. predict error 0
  5402. dir: dir isU
  5403. -/|753: O: O1506 (predict-no)
  5404. I see 1 and I'm going to do: predict-no
  5405. ENV: Agent did: predict-no for direction U in state State-B
  5406. In State-B moving U
  5407. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5408. predict error 0
  5409. dir: dir isL
  5410. \-754: O: O1507 (predict-yes)
  5411. I see 1 and I'm going to do: predict-yes
  5412. ENV: Agent did: predict-yes for direction L in state State-B
  5413. In State-B moving L
  5414. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5415. predict error 0
  5416. dir: dir isU
  5417. /|755: O: O1510 (predict-no)
  5418. I see 1 and I'm going to do: predict-no
  5419. ENV: Agent did: predict-no for direction U in state State-A
  5420. In State-A moving U
  5421. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5422. predict error 0
  5423. dir: dir isL
  5424. \-756: O: O1512 (predict-no)
  5425. I see 1 and I'm going to do: predict-no
  5426. ENV: Agent did: predict-no for direction L in state State-A
  5427. In State-A moving L
  5428. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5429. predict error 0
  5430. dir: dir isR
  5431. /|\757: O: O1513 (predict-yes)
  5432. I see 1 and I'm going to do: predict-yes
  5433. ENV: Agent did: predict-yes for direction R in state State-A
  5434. In State-A moving R
  5435. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5436. predict error 0
  5437. dir: dir isU
  5438. -/|758: O: O1516 (predict-no)
  5439. I see 1 and I'm going to do: predict-no
  5440. ENV: Agent did: predict-no for direction U in state State-B
  5441. In State-B moving U
  5442. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5443. predict error 0
  5444. dir: dir isL
  5445. \-/759: O: O1517 (predict-yes)
  5446. I see 1 and I'm going to do: predict-yes
  5447. ENV: Agent did: predict-yes for direction L in state State-B
  5448. In State-B moving L
  5449. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5450. predict error 0
  5451. dir: dir isU
  5452. |\-760: O: O1520 (predict-no)
  5453. I see 1 and I'm going to do: predict-no
  5454. ENV: Agent did: predict-no for direction U in state State-A
  5455. In State-A moving U
  5456. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5457. predict error 0
  5458. dir: dir isU
  5459. /761: O: O1522 (predict-no)
  5460. I see 1 and I'm going to do: predict-no
  5461. ENV: Agent did: predict-no for direction U in state State-A
  5462. In State-A moving U
  5463. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5464. predict error 0
  5465. dir: dir isR
  5466. |762: O: O1523 (predict-yes)
  5467. I see 1 and I'm going to do: predict-yes
  5468. ENV: Agent did: predict-yes for direction R in state State-A
  5469. In State-A moving R
  5470. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5471. predict error 0
  5472. dir: dir isL
  5473. \-/763: O: O1525 (predict-yes)
  5474. I see 1 and I'm going to do: predict-yes
  5475. ENV: Agent did: predict-yes for direction L in state State-B
  5476. In State-B moving L
  5477. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5478. predict error 0
  5479. dir: dir isL
  5480. |\-/sleeping...
  5481. |764: O: O1528 (predict-no)
  5482. I see 1 and I'm going to do: predict-no
  5483. ENV: Agent did: predict-no for direction L in state State-A
  5484. In State-A moving L
  5485. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5486. predict error 0
  5487. dir: dir isL
  5488. \-765: O: O1530 (predict-no)
  5489. I see 1 and I'm going to do: predict-no
  5490. ENV: Agent did: predict-no for direction L in state State-A
  5491. In State-A moving L
  5492. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5493. predict error 0
  5494. dir: dir isU
  5495. /|\766: O: O1532 (predict-no)
  5496. I see 1 and I'm going to do: predict-no
  5497. ENV: Agent did: predict-no for direction U in state State-A
  5498. In State-A moving U
  5499. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5500. predict error 0
  5501. dir: dir isR
  5502. -/767: O: O1533 (predict-yes)
  5503. I see 1 and I'm going to do: predict-yes
  5504. ENV: Agent did: predict-yes for direction R in state State-A
  5505. In State-A moving R
  5506. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5507. predict error 0
  5508. dir: dir isU
  5509. |\-768: O: O1536 (predict-no)
  5510. I see 1 and I'm going to do: predict-no
  5511. ENV: Agent did: predict-no for direction U in state State-B
  5512. In State-B moving U
  5513. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5514. predict error 0
  5515. dir: dir isR
  5516. /|\769: O: O1538 (predict-no)
  5517. I see 1 and I'm going to do: predict-no
  5518. ENV: Agent did: predict-no for direction R in state State-B
  5519. In State-B moving R
  5520. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5521. predict error 0
  5522. dir: dir isL
  5523. -/770: O: O1540 (predict-no)
  5524. I see 1 and I'm going to do: predict-no
  5525. ENV: Agent did: predict-no for direction L in state State-B
  5526. In State-B moving L
  5527. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  5528. predict error 1
  5529. dir: dir isR
  5530. |\-771: O: O1541 (predict-yes)
  5531. I see 0 and I'm going to do: predict-yes
  5532. ENV: Agent did: predict-yes for direction R in state State-A
  5533. In State-A moving R
  5534. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5535. predict error 0
  5536. dir: dir isU
  5537. /772: O: O1544 (predict-no)
  5538. I see 1 and I'm going to do: predict-no
  5539. ENV: Agent did: predict-no for direction U in state State-B
  5540. In State-B moving U
  5541. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5542. predict error 0
  5543. dir: dir isU
  5544. |\-773: O: O1546 (predict-no)
  5545. I see 1 and I'm going to do: predict-no
  5546. ENV: Agent did: predict-no for direction U in state State-B
  5547. In State-B moving U
  5548. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5549. predict error 0
  5550. dir: dir isL
  5551. /|\774: O: O1547 (predict-yes)
  5552. I see 1 and I'm going to do: predict-yes
  5553. ENV: Agent did: predict-yes for direction L in state State-B
  5554. In State-B moving L
  5555. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5556. predict error 0
  5557. dir: dir isL
  5558. -/775: O: O1550 (predict-no)
  5559. I see 1 and I'm going to do: predict-no
  5560. ENV: Agent did: predict-no for direction L in state State-A
  5561. In State-A moving L
  5562. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5563. predict error 0
  5564. dir: dir isR
  5565. |\776: O: O1551 (predict-yes)
  5566. I see 1 and I'm going to do: predict-yes
  5567. ENV: Agent did: predict-yes for direction R in state State-A
  5568. In State-A moving R
  5569. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5570. predict error 0
  5571. dir: dir isL
  5572. -/|777: O: O1553 (predict-yes)
  5573. I see 1 and I'm going to do: predict-yes
  5574. ENV: Agent did: predict-yes for direction L in state State-B
  5575. In State-B moving L
  5576. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5577. predict error 0
  5578. dir: dir isU
  5579. \-/778: O: O1556 (predict-no)
  5580. I see 1 and I'm going to do: predict-no
  5581. ENV: Agent did: predict-no for direction U in state State-A
  5582. In State-A moving U
  5583. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5584. predict error 0
  5585. dir: dir isU
  5586. |\-779: O: O1558 (predict-no)
  5587. I see 1 and I'm going to do: predict-no
  5588. ENV: Agent did: predict-no for direction U in state State-A
  5589. In State-A moving U
  5590. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5591. predict error 0
  5592. dir: dir isL
  5593. /|\780: O: O1560 (predict-no)
  5594. I see 1 and I'm going to do: predict-no
  5595. ENV: Agent did: predict-no for direction L in state State-A
  5596. In State-A moving L
  5597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5598. predict error 0
  5599. dir: dir isR
  5600. -/|781: O: O1561 (predict-yes)
  5601. I see 1 and I'm going to do: predict-yes
  5602. ENV: Agent did: predict-yes for direction R in state State-A
  5603. In State-A moving R
  5604. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5605. predict error 0
  5606. dir: dir isR
  5607. \782: O: O1564 (predict-no)
  5608. I see 1 and I'm going to do: predict-no
  5609. ENV: Agent did: predict-no for direction R in state State-B
  5610. In State-B moving R
  5611. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5612. predict error 0
  5613. dir: dir isL
  5614. -/783: O: O1565 (predict-yes)
  5615. I see 1 and I'm going to do: predict-yes
  5616. ENV: Agent did: predict-yes for direction L in state State-B
  5617. In State-B moving L
  5618. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5619. predict error 0
  5620. dir: dir isR
  5621. |\-784: O: O1567 (predict-yes)
  5622. I see 1 and I'm going to do: predict-yes
  5623. ENV: Agent did: predict-yes for direction R in state State-A
  5624. In State-A moving R
  5625. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5626. predict error 0
  5627. dir: dir isL
  5628. /|\785: O: O1569 (predict-yes)
  5629. I see 1 and I'm going to do: predict-yes
  5630. ENV: Agent did: predict-yes for direction L in state State-B
  5631. In State-B moving L
  5632. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5633. predict error 0
  5634. dir: dir isL
  5635. -/786: O: O1572 (predict-no)
  5636. I see 1 and I'm going to do: predict-no
  5637. ENV: Agent did: predict-no for direction L in state State-A
  5638. In State-A moving L
  5639. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5640. predict error 0
  5641. dir: dir isR
  5642. |\-787: O: O1573 (predict-yes)
  5643. I see 1 and I'm going to do: predict-yes
  5644. ENV: Agent did: predict-yes for direction R in state State-A
  5645. In State-A moving R
  5646. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5647. predict error 0
  5648. dir: dir isR
  5649. /|788: O: O1576 (predict-no)
  5650. I see 1 and I'm going to do: predict-no
  5651. ENV: Agent did: predict-no for direction R in state State-B
  5652. In State-B moving R
  5653. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5654. predict error 0
  5655. dir: dir isR
  5656. \-/789: O: O1578 (predict-no)
  5657. I see 1 and I'm going to do: predict-no
  5658. ENV: Agent did: predict-no for direction R in state State-B
  5659. In State-B moving R
  5660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5661. predict error 0
  5662. dir: dir isL
  5663. |\-790: O: O1579 (predict-yes)
  5664. I see 1 and I'm going to do: predict-yes
  5665. ENV: Agent did: predict-yes for direction L in state State-B
  5666. In State-B moving L
  5667. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5668. predict error 0
  5669. dir: dir isL
  5670. /|\791: O: O1582 (predict-no)
  5671. I see 1 and I'm going to do: predict-no
  5672. ENV: Agent did: predict-no for direction L in state State-A
  5673. In State-A moving L
  5674. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5675. predict error 0
  5676. dir: dir isL
  5677. -792: O: O1584 (predict-no)
  5678. I see 1 and I'm going to do: predict-no
  5679. ENV: Agent did: predict-no for direction L in state State-A
  5680. In State-A moving L
  5681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5682. predict error 0
  5683. dir: dir isU
  5684. /|\793: O: O1586 (predict-no)
  5685. I see 1 and I'm going to do: predict-no
  5686. ENV: Agent did: predict-no for direction U in state State-A
  5687. In State-A moving U
  5688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5689. predict error 0
  5690. dir: dir isL
  5691. -/794: O: O1588 (predict-no)
  5692. I see 1 and I'm going to do: predict-no
  5693. ENV: Agent did: predict-no for direction L in state State-A
  5694. In State-A moving L
  5695. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5696. predict error 0
  5697. dir: dir isU
  5698. |\795: O: O1590 (predict-no)
  5699. I see 1 and I'm going to do: predict-no
  5700. ENV: Agent did: predict-no for direction U in state State-A
  5701. In State-A moving U
  5702. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5703. predict error 0
  5704. dir: dir isL
  5705. -/|796: O: O1592 (predict-no)
  5706. I see 1 and I'm going to do: predict-no
  5707. ENV: Agent did: predict-no for direction L in state State-A
  5708. In State-A moving L
  5709. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5710. predict error 0
  5711. dir: dir isL
  5712. \-/797: O: O1594 (predict-no)
  5713. I see 1 and I'm going to do: predict-no
  5714. ENV: Agent did: predict-no for direction L in state State-A
  5715. In State-A moving L
  5716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5717. predict error 0
  5718. dir: dir isU
  5719. |\-798: O: O1596 (predict-no)
  5720. I see 1 and I'm going to do: predict-no
  5721. ENV: Agent did: predict-no for direction U in state State-A
  5722. In State-A moving U
  5723. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5724. predict error 0
  5725. dir: dir isR
  5726. /|799: O: O1597 (predict-yes)
  5727. I see 1 and I'm going to do: predict-yes
  5728. ENV: Agent did: predict-yes for direction R in state State-A
  5729. In State-A moving R
  5730. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5731. predict error 0
  5732. dir: dir isU
  5733. \-/800: O: O1600 (predict-no)
  5734. I see 1 and I'm going to do: predict-no
  5735. ENV: Agent did: predict-no for direction U in state State-B
  5736. In State-B moving U
  5737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5738. predict error 0
  5739. dir: dir isR
  5740. |\801: O: O1602 (predict-no)
  5741. I see 1 and I'm going to do: predict-no
  5742. ENV: Agent did: predict-no for direction R in state State-B
  5743. In State-B moving R
  5744. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5745. predict error 0
  5746. dir: dir isU
  5747. -802: O: O1604 (predict-no)
  5748. I see 1 and I'm going to do: predict-no
  5749. ENV: Agent did: predict-no for direction U in state State-B
  5750. In State-B moving U
  5751. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5752. predict error 0
  5753. dir: dir isL
  5754. /|\803: O: O1605 (predict-yes)
  5755. I see 1 and I'm going to do: predict-yes
  5756. ENV: Agent did: predict-yes for direction L in state State-B
  5757. In State-B moving L
  5758. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5759. predict error 0
  5760. dir: dir isR
  5761. -/804: O: O1607 (predict-yes)
  5762. I see 1 and I'm going to do: predict-yes
  5763. ENV: Agent did: predict-yes for direction R in state State-A
  5764. In State-A moving R
  5765. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5766. predict error 0
  5767. dir: dir isL
  5768. |\-805: O: O1609 (predict-yes)
  5769. I see 1 and I'm going to do: predict-yes
  5770. ENV: Agent did: predict-yes for direction L in state State-B
  5771. In State-B moving L
  5772. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5773. predict error 0
  5774. dir: dir isU
  5775. /|\806: O: O1612 (predict-no)
  5776. I see 1 and I'm going to do: predict-no
  5777. ENV: Agent did: predict-no for direction U in state State-A
  5778. In State-A moving U
  5779. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5780. predict error 0
  5781. dir: dir isR
  5782. -/|807: O: O1613 (predict-yes)
  5783. I see 1 and I'm going to do: predict-yes
  5784. ENV: Agent did: predict-yes for direction R in state State-A
  5785. In State-A moving R
  5786. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5787. predict error 0
  5788. dir: dir isU
  5789. \-/808: O: O1616 (predict-no)
  5790. I see 1 and I'm going to do: predict-no
  5791. ENV: Agent did: predict-no for direction U in state State-B
  5792. In State-B moving U
  5793. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5794. predict error 0
  5795. dir: dir isU
  5796. |\-809: O: O1618 (predict-no)
  5797. I see 1 and I'm going to do: predict-no
  5798. ENV: Agent did: predict-no for direction U in state State-B
  5799. In State-B moving U
  5800. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5801. predict error 0
  5802. dir: dir isR
  5803. /|810: O: O1620 (predict-no)
  5804. I see 1 and I'm going to do: predict-no
  5805. ENV: Agent did: predict-no for direction R in state State-B
  5806. In State-B moving R
  5807. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5808. predict error 0
  5809. dir: dir isL
  5810. \-/811: O: O1621 (predict-yes)
  5811. I see 1 and I'm going to do: predict-yes
  5812. ENV: Agent did: predict-yes for direction L in state State-B
  5813. In State-B moving L
  5814. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5815. predict error 0
  5816. dir: dir isU
  5817. |812: O: O1624 (predict-no)
  5818. I see 1 and I'm going to do: predict-no
  5819. ENV: Agent did: predict-no for direction U in state State-A
  5820. In State-A moving U
  5821. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5822. predict error 0
  5823. dir: dir isL
  5824. \-813: O: O1626 (predict-no)
  5825. I see 1 and I'm going to do: predict-no
  5826. ENV: Agent did: predict-no for direction L in state State-A
  5827. In State-A moving L
  5828. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5829. predict error 0
  5830. dir: dir isR
  5831. /|\814: O: O1627 (predict-yes)
  5832. I see 1 and I'm going to do: predict-yes
  5833. ENV: Agent did: predict-yes for direction R in state State-A
  5834. In State-A moving R
  5835. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5836. predict error 0
  5837. dir: dir isU
  5838. -/|815: O: O1630 (predict-no)
  5839. I see 1 and I'm going to do: predict-no
  5840. ENV: Agent did: predict-no for direction U in state State-B
  5841. In State-B moving U
  5842. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5843. predict error 0
  5844. dir: dir isL
  5845. \-/816: O: O1631 (predict-yes)
  5846. I see 1 and I'm going to do: predict-yes
  5847. ENV: Agent did: predict-yes for direction L in state State-B
  5848. In State-B moving L
  5849. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5850. predict error 0
  5851. dir: dir isR
  5852. |\-817: O: O1633 (predict-yes)
  5853. I see 1 and I'm going to do: predict-yes
  5854. ENV: Agent did: predict-yes for direction R in state State-A
  5855. In State-A moving R
  5856. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5857. predict error 0
  5858. dir: dir isL
  5859. /|\818: O: O1635 (predict-yes)
  5860. I see 1 and I'm going to do: predict-yes
  5861. ENV: Agent did: predict-yes for direction L in state State-B
  5862. In State-B moving L
  5863. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5864. predict error 0
  5865. dir: dir isL
  5866. -/|819: O: O1638 (predict-no)
  5867. I see 1 and I'm going to do: predict-no
  5868. ENV: Agent did: predict-no for direction L in state State-A
  5869. In State-A moving L
  5870. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5871. predict error 0
  5872. dir: dir isU
  5873. \-/820: O: O1640 (predict-no)
  5874. I see 1 and I'm going to do: predict-no
  5875. ENV: Agent did: predict-no for direction U in state State-A
  5876. In State-A moving U
  5877. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5878. predict error 0
  5879. dir: dir isR
  5880. |\-821: O: O1641 (predict-yes)
  5881. I see 1 and I'm going to do: predict-yes
  5882. ENV: Agent did: predict-yes for direction R in state State-A
  5883. In State-A moving R
  5884. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5885. predict error 0
  5886. dir: dir isL
  5887. /822: O: O1643 (predict-yes)
  5888. I see 1 and I'm going to do: predict-yes
  5889. ENV: Agent did: predict-yes for direction L in state State-B
  5890. In State-B moving L
  5891. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5892. predict error 0
  5893. dir: dir isR
  5894. |\823: O: O1645 (predict-yes)
  5895. I see 1 and I'm going to do: predict-yes
  5896. ENV: Agent did: predict-yes for direction R in state State-A
  5897. In State-A moving R
  5898. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5899. predict error 0
  5900. dir: dir isL
  5901. -/|824: O: O1647 (predict-yes)
  5902. I see 1 and I'm going to do: predict-yes
  5903. ENV: Agent did: predict-yes for direction L in state State-B
  5904. In State-B moving L
  5905. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5906. predict error 0
  5907. dir: dir isL
  5908. \-/825: O: O1650 (predict-no)
  5909. I see 1 and I'm going to do: predict-no
  5910. ENV: Agent did: predict-no for direction L in state State-A
  5911. In State-A moving L
  5912. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5913. predict error 0
  5914. dir: dir isR
  5915. |\826: O: O1651 (predict-yes)
  5916. I see 1 and I'm going to do: predict-yes
  5917. ENV: Agent did: predict-yes for direction R in state State-A
  5918. In State-A moving R
  5919. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5920. predict error 0
  5921. dir: dir isU
  5922. -/|827: O: O1654 (predict-no)
  5923. I see 1 and I'm going to do: predict-no
  5924. ENV: Agent did: predict-no for direction U in state State-B
  5925. In State-B moving U
  5926. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5927. predict error 0
  5928. dir: dir isR
  5929. \-/828: O: O1656 (predict-no)
  5930. I see 1 and I'm going to do: predict-no
  5931. ENV: Agent did: predict-no for direction R in state State-B
  5932. In State-B moving R
  5933. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5934. predict error 0
  5935. dir: dir isL
  5936. |\-829: O: O1657 (predict-yes)
  5937. I see 1 and I'm going to do: predict-yes
  5938. ENV: Agent did: predict-yes for direction L in state State-B
  5939. In State-B moving L
  5940. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5941. predict error 0
  5942. dir: dir isU
  5943. /|\-830: O: O1660 (predict-no)
  5944. I see 1 and I'm going to do: predict-no
  5945. ENV: Agent did: predict-no for direction U in state State-A
  5946. In State-A moving U
  5947. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5948. predict error 0
  5949. dir: dir isU
  5950. /|\831: O: O1662 (predict-no)
  5951. I see 1 and I'm going to do: predict-no
  5952. ENV: Agent did: predict-no for direction U in state State-A
  5953. In State-A moving U
  5954. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5955. predict error 0
  5956. dir: dir isU
  5957. -832: O: O1664 (predict-no)
  5958. I see 1 and I'm going to do: predict-no
  5959. ENV: Agent did: predict-no for direction U in state State-A
  5960. In State-A moving U
  5961. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5962. predict error 0
  5963. dir: dir isR
  5964. /|833: O: O1665 (predict-yes)
  5965. I see 1 and I'm going to do: predict-yes
  5966. ENV: Agent did: predict-yes for direction R in state State-A
  5967. In State-A moving R
  5968. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5969. predict error 0
  5970. dir: dir isU
  5971. \-834: O: O1668 (predict-no)
  5972. I see 1 and I'm going to do: predict-no
  5973. ENV: Agent did: predict-no for direction U in state State-B
  5974. In State-B moving U
  5975. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5976. predict error 0
  5977. dir: dir isL
  5978. /|\835: O: O1669 (predict-yes)
  5979. I see 1 and I'm going to do: predict-yes
  5980. ENV: Agent did: predict-yes for direction L in state State-B
  5981. In State-B moving L
  5982. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5983. predict error 0
  5984. dir: dir isU
  5985. -/|836: O: O1672 (predict-no)
  5986. I see 1 and I'm going to do: predict-no
  5987. ENV: Agent did: predict-no for direction U in state State-A
  5988. In State-A moving U
  5989. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5990. predict error 0
  5991. dir: dir isU
  5992. \-/837: O: O1674 (predict-no)
  5993. I see 1 and I'm going to do: predict-no
  5994. ENV: Agent did: predict-no for direction U in state State-A
  5995. In State-A moving U
  5996. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5997. predict error 0
  5998. dir: dir isU
  5999. |\-838: O: O1676 (predict-no)
  6000. I see 1 and I'm going to do: predict-no
  6001. ENV: Agent did: predict-no for direction U in state State-A
  6002. In State-A moving U
  6003. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6004. predict error 0
  6005. dir: dir isR
  6006. /|\839: O: O1677 (predict-yes)
  6007. I see 1 and I'm going to do: predict-yes
  6008. ENV: Agent did: predict-yes for direction R in state State-A
  6009. In State-A moving R
  6010. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6011. predict error 0
  6012. dir: dir isR
  6013. -/840: O: O1680 (predict-no)
  6014. I see 1 and I'm going to do: predict-no
  6015. ENV: Agent did: predict-no for direction R in state State-B
  6016. In State-B moving R
  6017. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6018. predict error 0
  6019. dir: dir isR
  6020. |\-841: O: O1682 (predict-no)
  6021. I see 1 and I'm going to do: predict-no
  6022. ENV: Agent did: predict-no for direction R in state State-B
  6023. In State-B moving R
  6024. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6025. predict error 0
  6026. dir: dir isU
  6027. /842: O: O1684 (predict-no)
  6028. I see 1 and I'm going to do: predict-no
  6029. ENV: Agent did: predict-no for direction U in state State-B
  6030. In State-B moving U
  6031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6032. predict error 0
  6033. dir: dir isL
  6034. |\-843: O: O1685 (predict-yes)
  6035. I see 1 and I'm going to do: predict-yes
  6036. ENV: Agent did: predict-yes for direction L in state State-B
  6037. In State-B moving L
  6038. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6039. predict error 0
  6040. dir: dir isU
  6041. /|844: O: O1688 (predict-no)
  6042. I see 1 and I'm going to do: predict-no
  6043. ENV: Agent did: predict-no for direction U in state State-A
  6044. In State-A moving U
  6045. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6046. predict error 0
  6047. dir: dir isR
  6048. \-/845: O: O1689 (predict-yes)
  6049. I see 1 and I'm going to do: predict-yes
  6050. ENV: Agent did: predict-yes for direction R in state State-A
  6051. In State-A moving R
  6052. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6053. predict error 0
  6054. dir: dir isR
  6055. |\-846: O: O1692 (predict-no)
  6056. I see 1 and I'm going to do: predict-no
  6057. ENV: Agent did: predict-no for direction R in state State-B
  6058. In State-B moving R
  6059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6060. predict error 0
  6061. dir: dir isR
  6062. /|\847: O: O1694 (predict-no)
  6063. I see 1 and I'm going to do: predict-no
  6064. ENV: Agent did: predict-no for direction R in state State-B
  6065. In State-B moving R
  6066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6067. predict error 0
  6068. dir: dir isL
  6069. -/848: O: O1695 (predict-yes)
  6070. I see 1 and I'm going to do: predict-yes
  6071. ENV: Agent did: predict-yes for direction L in state State-B
  6072. In State-B moving L
  6073. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6074. predict error 0
  6075. dir: dir isL
  6076. |\-849: O: O1698 (predict-no)
  6077. I see 1 and I'm going to do: predict-no
  6078. ENV: Agent did: predict-no for direction L in state State-A
  6079. In State-A moving L
  6080. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6081. predict error 0
  6082. dir: dir isR
  6083. /|\850: O: O1699 (predict-yes)
  6084. I see 1 and I'm going to do: predict-yes
  6085. ENV: Agent did: predict-yes for direction R in state State-A
  6086. In State-A moving R
  6087. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6088. predict error 0
  6089. dir: dir isR
  6090. -/|851: O: O1702 (predict-no)
  6091. I see 1 and I'm going to do: predict-no
  6092. ENV: Agent did: predict-no for direction R in state State-B
  6093. In State-B moving R
  6094. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6095. predict error 0
  6096. dir: dir isR
  6097. \852: O: O1704 (predict-no)
  6098. I see 1 and I'm going to do: predict-no
  6099. ENV: Agent did: predict-no for direction R in state State-B
  6100. In State-B moving R
  6101. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6102. predict error 0
  6103. dir: dir isU
  6104. -/853: O: O1706 (predict-no)
  6105. I see 1 and I'm going to do: predict-no
  6106. ENV: Agent did: predict-no for direction U in state State-B
  6107. In State-B moving U
  6108. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6109. predict error 0
  6110. dir: dir isR
  6111. |854: O: O1708 (predict-no)
  6112. I see 1 and I'm going to do: predict-no
  6113. ENV: Agent did: predict-no for direction R in state State-B
  6114. In State-B moving R
  6115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6116. predict error 0
  6117. dir: dir isL
  6118. \-/855: O: O1709 (predict-yes)
  6119. I see 1 and I'm going to do: predict-yes
  6120. ENV: Agent did: predict-yes for direction L in state State-B
  6121. In State-B moving L
  6122. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6123. predict error 0
  6124. dir: dir isU
  6125. |\-856: O: O1712 (predict-no)
  6126. I see 1 and I'm going to do: predict-no
  6127. ENV: Agent did: predict-no for direction U in state State-A
  6128. In State-A moving U
  6129. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6130. predict error 0
  6131. dir: dir isL
  6132. /|\857: O: O1714 (predict-no)
  6133. I see 1 and I'm going to do: predict-no
  6134. ENV: Agent did: predict-no for direction L in state State-A
  6135. In State-A moving L
  6136. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6137. predict error 0
  6138. dir: dir isR
  6139. -/858: O: O1715 (predict-yes)
  6140. I see 1 and I'm going to do: predict-yes
  6141. ENV: Agent did: predict-yes for direction R in state State-A
  6142. In State-A moving R
  6143. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6144. predict error 0
  6145. dir: dir isU
  6146. |\859: O: O1718 (predict-no)
  6147. I see 1 and I'm going to do: predict-no
  6148. ENV: Agent did: predict-no for direction U in state State-B
  6149. In State-B moving U
  6150. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6151. predict error 0
  6152. dir: dir isU
  6153. -/|\860: O: O1720 (predict-no)
  6154. I see 1 and I'm going to do: predict-no
  6155. ENV: Agent did: predict-no for direction U in state State-B
  6156. In State-B moving U
  6157. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6158. predict error 0
  6159. dir: dir isU
  6160. -/861: O: O1722 (predict-no)
  6161. I see 1 and I'm going to do: predict-no
  6162. ENV: Agent did: predict-no for direction U in state State-B
  6163. In State-B moving U
  6164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6165. predict error 0
  6166. dir: dir isR
  6167. |862: O: O1724 (predict-no)
  6168. I see 1 and I'm going to do: predict-no
  6169. ENV: Agent did: predict-no for direction R in state State-B
  6170. In State-B moving R
  6171. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6172. predict error 0
  6173. dir: dir isL
  6174. \-/863: O: O1725 (predict-yes)
  6175. I see 1 and I'm going to do: predict-yes
  6176. ENV: Agent did: predict-yes for direction L in state State-B
  6177. In State-B moving L
  6178. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6179. predict error 0
  6180. dir: dir isL
  6181. |\-864: O: O1728 (predict-no)
  6182. I see 1 and I'm going to do: predict-no
  6183. ENV: Agent did: predict-no for direction L in state State-A
  6184. In State-A moving L
  6185. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6186. predict error 0
  6187. dir: dir isU
  6188. /|\865: O: O1730 (predict-no)
  6189. I see 1 and I'm going to do: predict-no
  6190. ENV: Agent did: predict-no for direction U in state State-A
  6191. In State-A moving U
  6192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6193. predict error 0
  6194. dir: dir isU
  6195. -/|866: O: O1732 (predict-no)
  6196. I see 1 and I'm going to do: predict-no
  6197. ENV: Agent did: predict-no for direction U in state State-A
  6198. In State-A moving U
  6199. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6200. predict error 0
  6201. dir: dir isR
  6202. \-/867: O: O1733 (predict-yes)
  6203. I see 1 and I'm going to do: predict-yes
  6204. ENV: Agent did: predict-yes for direction R in state State-A
  6205. In State-A moving R
  6206. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6207. predict error 0
  6208. dir: dir isR
  6209. |\-868: O: O1736 (predict-no)
  6210. I see 1 and I'm going to do: predict-no
  6211. ENV: Agent did: predict-no for direction R in state State-B
  6212. In State-B moving R
  6213. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6214. predict error 0
  6215. dir: dir isU
  6216. /|\869: O: O1738 (predict-no)
  6217. I see 1 and I'm going to do: predict-no
  6218. ENV: Agent did: predict-no for direction U in state State-B
  6219. In State-B moving U
  6220. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6221. predict error 0
  6222. dir: dir isR
  6223. -/870: O: O1740 (predict-no)
  6224. I see 1 and I'm going to do: predict-no
  6225. ENV: Agent did: predict-no for direction R in state State-B
  6226. In State-B moving R
  6227. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6228. predict error 0
  6229. dir: dir isL
  6230. |\871: O: O1741 (predict-yes)
  6231. I see 1 and I'm going to do: predict-yes
  6232. ENV: Agent did: predict-yes for direction L in state State-B
  6233. In State-B moving L
  6234. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6235. predict error 0
  6236. dir: dir isU
  6237. -872: O: O1744 (predict-no)
  6238. I see 1 and I'm going to do: predict-no
  6239. ENV: Agent did: predict-no for direction U in state State-A
  6240. In State-A moving U
  6241. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6242. predict error 0
  6243. dir: dir isL
  6244. /873: O: O1746 (predict-no)
  6245. I see 1 and I'm going to do: predict-no
  6246. ENV: Agent did: predict-no for direction L in state State-A
  6247. In State-A moving L
  6248. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6249. predict error 0
  6250. dir: dir isR
  6251. |\874: O: O1747 (predict-yes)
  6252. I see 1 and I'm going to do: predict-yes
  6253. ENV: Agent did: predict-yes for direction R in state State-A
  6254. In State-A moving R
  6255. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6256. predict error 0
  6257. dir: dir isR
  6258. -/875: O: O1750 (predict-no)
  6259. I see 1 and I'm going to do: predict-no
  6260. ENV: Agent did: predict-no for direction R in state State-B
  6261. In State-B moving R
  6262. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6263. predict error 0
  6264. dir: dir isU
  6265. |\-876: O: O1752 (predict-no)
  6266. I see 1 and I'm going to do: predict-no
  6267. ENV: Agent did: predict-no for direction U in state State-B
  6268. In State-B moving U
  6269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6270. predict error 0
  6271. dir: dir isL
  6272. /|877: O: O1753 (predict-yes)
  6273. I see 1 and I'm going to do: predict-yes
  6274. ENV: Agent did: predict-yes for direction L in state State-B
  6275. In State-B moving L
  6276. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6277. predict error 0
  6278. dir: dir isL
  6279. \-/878: O: O1756 (predict-no)
  6280. I see 1 and I'm going to do: predict-no
  6281. ENV: Agent did: predict-no for direction L in state State-A
  6282. In State-A moving L
  6283. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6284. predict error 0
  6285. dir: dir isR
  6286. |\-879: O: O1757 (predict-yes)
  6287. I see 1 and I'm going to do: predict-yes
  6288. ENV: Agent did: predict-yes for direction R in state State-A
  6289. In State-A moving R
  6290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6291. predict error 0
  6292. dir: dir isL
  6293. /|\880: O: O1759 (predict-yes)
  6294. I see 1 and I'm going to do: predict-yes
  6295. ENV: Agent did: predict-yes for direction L in state State-B
  6296. In State-B moving L
  6297. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6298. predict error 0
  6299. dir: dir isL
  6300. -/|881: O: O1762 (predict-no)
  6301. I see 1 and I'm going to do: predict-no
  6302. ENV: Agent did: predict-no for direction L in state State-A
  6303. In State-A moving L
  6304. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6305. predict error 0
  6306. dir: dir isL
  6307. \882: O: O1764 (predict-no)
  6308. I see 1 and I'm going to do: predict-no
  6309. ENV: Agent did: predict-no for direction L in state State-A
  6310. In State-A moving L
  6311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6312. predict error 0
  6313. dir: dir isR
  6314. -/|883: O: O1765 (predict-yes)
  6315. I see 1 and I'm going to do: predict-yes
  6316. ENV: Agent did: predict-yes for direction R in state State-A
  6317. In State-A moving R
  6318. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6319. predict error 0
  6320. dir: dir isL
  6321. \-884: O: O1767 (predict-yes)
  6322. I see 1 and I'm going to do: predict-yes
  6323. ENV: Agent did: predict-yes for direction L in state State-B
  6324. In State-B moving L
  6325. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6326. predict error 0
  6327. dir: dir isL
  6328. /|\885: O: O1770 (predict-no)
  6329. I see 1 and I'm going to do: predict-no
  6330. ENV: Agent did: predict-no for direction L in state State-A
  6331. In State-A moving L
  6332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6333. predict error 0
  6334. dir: dir isU
  6335. -/886: O: O1772 (predict-no)
  6336. I see 1 and I'm going to do: predict-no
  6337. ENV: Agent did: predict-no for direction U in state State-A
  6338. In State-A moving U
  6339. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6340. predict error 0
  6341. dir: dir isR
  6342. |\-887: O: O1773 (predict-yes)
  6343. I see 1 and I'm going to do: predict-yes
  6344. ENV: Agent did: predict-yes for direction R in state State-A
  6345. In State-A moving R
  6346. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6347. predict error 0
  6348. dir: dir isU
  6349. /|888: O: O1776 (predict-no)
  6350. I see 1 and I'm going to do: predict-no
  6351. ENV: Agent did: predict-no for direction U in state State-B
  6352. In State-B moving U
  6353. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6354. predict error 0
  6355. dir: dir isL
  6356. \-/889: O: O1777 (predict-yes)
  6357. I see 1 and I'm going to do: predict-yes
  6358. ENV: Agent did: predict-yes for direction L in state State-B
  6359. In State-B moving L
  6360. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6361. predict error 0
  6362. dir: dir isR
  6363. |\890: O: O1779 (predict-yes)
  6364. I see 1 and I'm going to do: predict-yes
  6365. ENV: Agent did: predict-yes for direction R in state State-A
  6366. In State-A moving R
  6367. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6368. predict error 0
  6369. dir: dir isR
  6370. -/891: O: O1782 (predict-no)
  6371. I see 1 and I'm going to do: predict-no
  6372. ENV: Agent did: predict-no for direction R in state State-B
  6373. In State-B moving R
  6374. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6375. predict error 0
  6376. dir: dir isL
  6377. |892: O: O1783 (predict-yes)
  6378. I see 1 and I'm going to do: predict-yes
  6379. ENV: Agent did: predict-yes for direction L in state State-B
  6380. In State-B moving L
  6381. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6382. predict error 0
  6383. dir: dir isL
  6384. \-/893: O: O1786 (predict-no)
  6385. I see 1 and I'm going to do: predict-no
  6386. ENV: Agent did: predict-no for direction L in state State-A
  6387. In State-A moving L
  6388. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6389. predict error 0
  6390. dir: dir isU
  6391. |\-894: O: O1788 (predict-no)
  6392. I see 1 and I'm going to do: predict-no
  6393. ENV: Agent did: predict-no for direction U in state State-A
  6394. In State-A moving U
  6395. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6396. predict error 0
  6397. dir: dir isU
  6398. /|\895: O: O1790 (predict-no)
  6399. I see 1 and I'm going to do: predict-no
  6400. ENV: Agent did: predict-no for direction U in state State-A
  6401. In State-A moving U
  6402. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6403. predict error 0
  6404. dir: dir isR
  6405. -/896: O: O1791 (predict-yes)
  6406. I see 1 and I'm going to do: predict-yes
  6407. ENV: Agent did: predict-yes for direction R in state State-A
  6408. In State-A moving R
  6409. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6410. predict error 0
  6411. dir: dir isR
  6412. |\-/897: O: O1794 (predict-no)
  6413. I see 1 and I'm going to do: predict-no
  6414. ENV: Agent did: predict-no for direction R in state State-B
  6415. In State-B moving R
  6416. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6417. predict error 0
  6418. dir: dir isL
  6419. |\898: O: O1795 (predict-yes)
  6420. I see 1 and I'm going to do: predict-yes
  6421. ENV: Agent did: predict-yes for direction L in state State-B
  6422. In State-B moving L
  6423. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6424. predict error 0
  6425. dir: dir isR
  6426. -/|899: O: O1797 (predict-yes)
  6427. I see 1 and I'm going to do: predict-yes
  6428. ENV: Agent did: predict-yes for direction R in state State-A
  6429. In State-A moving R
  6430. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6431. predict error 0
  6432. dir: dir isU
  6433. \-/900: O: O1800 (predict-no)
  6434. I see 1 and I'm going to do: predict-no
  6435. ENV: Agent did: predict-no for direction U in state State-B
  6436. In State-B moving U
  6437. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6438. predict error 0
  6439. dir: dir isU
  6440. |\901: O: O1802 (predict-no)
  6441. I see 1 and I'm going to do: predict-no
  6442. ENV: Agent did: predict-no for direction U in state State-B
  6443. In State-B moving U
  6444. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6445. predict error 0
  6446. dir: dir isR
  6447. -902: O: O1804 (predict-no)
  6448. I see 1 and I'm going to do: predict-no
  6449. ENV: Agent did: predict-no for direction R in state State-B
  6450. In State-B moving R
  6451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6452. predict error 0
  6453. dir: dir isL
  6454. /|\903: O: O1805 (predict-yes)
  6455. I see 1 and I'm going to do: predict-yes
  6456. ENV: Agent did: predict-yes for direction L in state State-B
  6457. In State-B moving L
  6458. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6459. predict error 0
  6460. dir: dir isU
  6461. -/|\904: O: O1808 (predict-no)
  6462. I see 1 and I'm going to do: predict-no
  6463. ENV: Agent did: predict-no for direction U in state State-A
  6464. In State-A moving U
  6465. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6466. predict error 0
  6467. dir: dir isR
  6468. -/|905: O: O1809 (predict-yes)
  6469. I see 1 and I'm going to do: predict-yes
  6470. ENV: Agent did: predict-yes for direction R in state State-A
  6471. In State-A moving R
  6472. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6473. predict error 0
  6474. dir: dir isR
  6475. \-906: O: O1812 (predict-no)
  6476. I see 1 and I'm going to do: predict-no
  6477. ENV: Agent did: predict-no for direction R in state State-B
  6478. In State-B moving R
  6479. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6480. predict error 0
  6481. dir: dir isL
  6482. /|907: O: O1813 (predict-yes)
  6483. I see 1 and I'm going to do: predict-yes
  6484. ENV: Agent did: predict-yes for direction L in state State-B
  6485. In State-B moving L
  6486. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6487. predict error 0
  6488. dir: dir isL
  6489. \-/908: O: O1816 (predict-no)
  6490. I see 1 and I'm going to do: predict-no
  6491. ENV: Agent did: predict-no for direction L in state State-A
  6492. In State-A moving L
  6493. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6494. predict error 0
  6495. dir: dir isU
  6496. |\-909: O: O1818 (predict-no)
  6497. I see 1 and I'm going to do: predict-no
  6498. ENV: Agent did: predict-no for direction U in state State-A
  6499. In State-A moving U
  6500. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6501. predict error 0
  6502. dir: dir isR
  6503. /|\910: O: O1819 (predict-yes)
  6504. I see 1 and I'm going to do: predict-yes
  6505. ENV: Agent did: predict-yes for direction R in state State-A
  6506. In State-A moving R
  6507. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6508. predict error 0
  6509. dir: dir isU
  6510. -/911: O: O1822 (predict-no)
  6511. I see 1 and I'm going to do: predict-no
  6512. ENV: Agent did: predict-no for direction U in state State-B
  6513. In State-B moving U
  6514. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6515. predict error 0
  6516. dir: dir isL
  6517. |912: O: O1823 (predict-yes)
  6518. I see 1 and I'm going to do: predict-yes
  6519. ENV: Agent did: predict-yes for direction L in state State-B
  6520. In State-B moving L
  6521. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6522. predict error 0
  6523. dir: dir isL
  6524. \-/913: O: O1826 (predict-no)
  6525. I see 1 and I'm going to do: predict-no
  6526. ENV: Agent did: predict-no for direction L in state State-A
  6527. In State-A moving L
  6528. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6529. predict error 0
  6530. dir: dir isU
  6531. |\-914: O: O1828 (predict-no)
  6532. I see 1 and I'm going to do: predict-no
  6533. ENV: Agent did: predict-no for direction U in state State-A
  6534. In State-A moving U
  6535. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6536. predict error 0
  6537. dir: dir isU
  6538. /|\915: O: O1830 (predict-no)
  6539. I see 1 and I'm going to do: predict-no
  6540. ENV: Agent did: predict-no for direction U in state State-A
  6541. In State-A moving U
  6542. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6543. predict error 0
  6544. dir: dir isL
  6545. -/|916: O: O1832 (predict-no)
  6546. I see 1 and I'm going to do: predict-no
  6547. ENV: Agent did: predict-no for direction L in state State-A
  6548. In State-A moving L
  6549. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6550. predict error 0
  6551. dir: dir isL
  6552. \-/|917: O: O1834 (predict-no)
  6553. I see 1 and I'm going to do: predict-no
  6554. ENV: Agent did: predict-no for direction L in state State-A
  6555. In State-A moving L
  6556. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6557. predict error 0
  6558. dir: dir isU
  6559. \-918: O: O1836 (predict-no)
  6560. I see 1 and I'm going to do: predict-no
  6561. ENV: Agent did: predict-no for direction U in state State-A
  6562. In State-A moving U
  6563. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6564. predict error 0
  6565. dir: dir isL
  6566. /|\919: O: O1838 (predict-no)
  6567. I see 1 and I'm going to do: predict-no
  6568. ENV: Agent did: predict-no for direction L in state State-A
  6569. In State-A moving L
  6570. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6571. predict error 0
  6572. dir: dir isU
  6573. -/|920: O: O1840 (predict-no)
  6574. I see 1 and I'm going to do: predict-no
  6575. ENV: Agent did: predict-no for direction U in state State-A
  6576. In State-A moving U
  6577. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6578. predict error 0
  6579. dir: dir isU
  6580. \-921: O: O1842 (predict-no)
  6581. I see 1 and I'm going to do: predict-no
  6582. ENV: Agent did: predict-no for direction U in state State-A
  6583. In State-A moving U
  6584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6585. predict error 0
  6586. dir: dir isL
  6587. /922: O: O1844 (predict-no)
  6588. I see 1 and I'm going to do: predict-no
  6589. ENV: Agent did: predict-no for direction L in state State-A
  6590. In State-A moving L
  6591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6592. predict error 0
  6593. dir: dir isL
  6594. |\-923: O: O1846 (predict-no)
  6595. I see 1 and I'm going to do: predict-no
  6596. ENV: Agent did: predict-no for direction L in state State-A
  6597. In State-A moving L
  6598. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6599. predict error 0
  6600. dir: dir isU
  6601. /|\924: O: O1848 (predict-no)
  6602. I see 1 and I'm going to do: predict-no
  6603. ENV: Agent did: predict-no for direction U in state State-A
  6604. In State-A moving U
  6605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6606. predict error 0
  6607. dir: dir isR
  6608. -/925: O: O1849 (predict-yes)
  6609. I see 1 and I'm going to do: predict-yes
  6610. ENV: Agent did: predict-yes for direction R in state State-A
  6611. In State-A moving R
  6612. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6613. predict error 0
  6614. dir: dir isR
  6615. |\926: O: O1852 (predict-no)
  6616. I see 1 and I'm going to do: predict-no
  6617. ENV: Agent did: predict-no for direction R in state State-B
  6618. In State-B moving R
  6619. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6620. predict error 0
  6621. dir: dir isR
  6622. -/|927: O: O1854 (predict-no)
  6623. I see 1 and I'm going to do: predict-no
  6624. ENV: Agent did: predict-no for direction R in state State-B
  6625. In State-B moving R
  6626. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6627. predict error 0
  6628. dir: dir isL
  6629. \-928: O: O1855 (predict-yes)
  6630. I see 1 and I'm going to do: predict-yes
  6631. ENV: Agent did: predict-yes for direction L in state State-B
  6632. In State-B moving L
  6633. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6634. predict error 0
  6635. dir: dir isR
  6636. /|\929: O: O1857 (predict-yes)
  6637. I see 1 and I'm going to do: predict-yes
  6638. ENV: Agent did: predict-yes for direction R in state State-A
  6639. In State-A moving R
  6640. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6641. predict error 0
  6642. dir: dir isL
  6643. -/|930: O: O1859 (predict-yes)
  6644. I see 1 and I'm going to do: predict-yes
  6645. ENV: Agent did: predict-yes for direction L in state State-B
  6646. In State-B moving L
  6647. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6648. predict error 0
  6649. dir: dir isU
  6650. \-931: O: O1862 (predict-no)
  6651. I see 1 and I'm going to do: predict-no
  6652. ENV: Agent did: predict-no for direction U in state State-A
  6653. In State-A moving U
  6654. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6655. predict error 0
  6656. dir: dir isU
  6657. /932: O: O1864 (predict-no)
  6658. I see 1 and I'm going to do: predict-no
  6659. ENV: Agent did: predict-no for direction U in state State-A
  6660. In State-A moving U
  6661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6662. predict error 0
  6663. dir: dir isL
  6664. |\-933: O: O1866 (predict-no)
  6665. I see 1 and I'm going to do: predict-no
  6666. ENV: Agent did: predict-no for direction L in state State-A
  6667. In State-A moving L
  6668. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6669. predict error 0
  6670. dir: dir isU
  6671. /|\-934: O: O1868 (predict-no)
  6672. I see 1 and I'm going to do: predict-no
  6673. ENV: Agent did: predict-no for direction U in state State-A
  6674. In State-A moving U
  6675. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6676. predict error 0
  6677. dir: dir isL
  6678. /|\935: O: O1870 (predict-no)
  6679. I see 1 and I'm going to do: predict-no
  6680. ENV: Agent did: predict-no for direction L in state State-A
  6681. In State-A moving L
  6682. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6683. predict error 0
  6684. dir: dir isL
  6685. -/|936: O: O1872 (predict-no)
  6686. I see 1 and I'm going to do: predict-no
  6687. ENV: Agent did: predict-no for direction L in state State-A
  6688. In State-A moving L
  6689. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6690. predict error 0
  6691. dir: dir isL
  6692. \-/937: O: O1874 (predict-no)
  6693. I see 1 and I'm going to do: predict-no
  6694. ENV: Agent did: predict-no for direction L in state State-A
  6695. In State-A moving L
  6696. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6697. predict error 0
  6698. dir: dir isL
  6699. |\-938: O: O1876 (predict-no)
  6700. I see 1 and I'm going to do: predict-no
  6701. ENV: Agent did: predict-no for direction L in state State-A
  6702. In State-A moving L
  6703. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6704. predict error 0
  6705. dir: dir isL
  6706. /|\939: O: O1878 (predict-no)
  6707. I see 1 and I'm going to do: predict-no
  6708. ENV: Agent did: predict-no for direction L in state State-A
  6709. In State-A moving L
  6710. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6711. predict error 0
  6712. dir: dir isR
  6713. -/|940: O: O1879 (predict-yes)
  6714. I see 1 and I'm going to do: predict-yes
  6715. ENV: Agent did: predict-yes for direction R in state State-A
  6716. In State-A moving R
  6717. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6718. predict error 0
  6719. dir: dir isU
  6720. \-/|941: O: O1882 (predict-no)
  6721. I see 1 and I'm going to do: predict-no
  6722. ENV: Agent did: predict-no for direction U in state State-B
  6723. In State-B moving U
  6724. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6725. predict error 0
  6726. dir: dir isR
  6727. \942: O: O1884 (predict-no)
  6728. I see 1 and I'm going to do: predict-no
  6729. ENV: Agent did: predict-no for direction R in state State-B
  6730. In State-B moving R
  6731. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6732. predict error 0
  6733. dir: dir isL
  6734. -/|943: O: O1885 (predict-yes)
  6735. I see 1 and I'm going to do: predict-yes
  6736. ENV: Agent did: predict-yes for direction L in state State-B
  6737. In State-B moving L
  6738. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6739. predict error 0
  6740. dir: dir isR
  6741. \-/944: O: O1887 (predict-yes)
  6742. I see 1 and I'm going to do: predict-yes
  6743. ENV: Agent did: predict-yes for direction R in state State-A
  6744. In State-A moving R
  6745. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6746. predict error 0
  6747. dir: dir isR
  6748. |\945: O: O1890 (predict-no)
  6749. I see 1 and I'm going to do: predict-no
  6750. ENV: Agent did: predict-no for direction R in state State-B
  6751. In State-B moving R
  6752. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6753. predict error 0
  6754. dir: dir isL
  6755. -/|946: O: O1891 (predict-yes)
  6756. I see 1 and I'm going to do: predict-yes
  6757. ENV: Agent did: predict-yes for direction L in state State-B
  6758. In State-B moving L
  6759. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6760. predict error 0
  6761. dir: dir isU
  6762. \-/947: O: O1894 (predict-no)
  6763. I see 1 and I'm going to do: predict-no
  6764. ENV: Agent did: predict-no for direction U in state State-A
  6765. In State-A moving U
  6766. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6767. predict error 0
  6768. dir: dir isR
  6769. |\-948: O: O1895 (predict-yes)
  6770. I see 1 and I'm going to do: predict-yes
  6771. ENV: Agent did: predict-yes for direction R in state State-A
  6772. In State-A moving R
  6773. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6774. predict error 0
  6775. dir: dir isU
  6776. /|949: O: O1898 (predict-no)
  6777. I see 1 and I'm going to do: predict-no
  6778. ENV: Agent did: predict-no for direction U in state State-B
  6779. In State-B moving U
  6780. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6781. predict error 0
  6782. dir: dir isU
  6783. \-/950: O: O1900 (predict-no)
  6784. I see 1 and I'm going to do: predict-no
  6785. ENV: Agent did: predict-no for direction U in state State-B
  6786. In State-B moving U
  6787. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6788. predict error 0
  6789. dir: dir isR
  6790. |\-/|\-/|\--- Input Phase ---
  6791. =>WM: (13382: I2 ^dir R)
  6792. =>WM: (13381: I2 ^reward 1)
  6793. =>WM: (13380: I2 ^see 0)
  6794. =>WM: (13379: N950 ^status complete)
  6795. <=WM: (13368: I2 ^dir U)
  6796. <=WM: (13367: I2 ^reward 1)
  6797. <=WM: (13366: I2 ^see 0)
  6798. =>WM: (13383: I2 ^level-1 R1-root)
  6799. <=WM: (13369: I2 ^level-1 R1-root)
  6800. --- END Input Phase ---
  6801. --- Proposal Phase ---
  6802. --- Inner Elaboration Phase, active level 1 (S1) ---
  6803. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6804. -->
  6805. (S1 ^operator O1899 = -0.1070236389116304)
  6806. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6807. -->
  6808. (S1 ^operator O1900 = 0.66025212945601)
  6809. Firing prefer*rvt*predict-no*H0*4*v1*H1
  6810. -->
  6811. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  6812. -->
  6813. Firing elaborate*copy-see-to-output-link
  6814. -->
  6815. (I3 ^see 0 +)
  6816. Firing elaborate*reward*based*on*reward
  6817. -->
  6818. (R954 ^value 1 +)
  6819. (R1 ^reward R954 +)
  6820. Firing propose*predict-yes
  6821. -->
  6822. (O1901 ^name predict-yes +)
  6823. (S1 ^operator O1901 +)
  6824. Firing propose*predict-no
  6825. -->
  6826. (O1902 ^name predict-no +)
  6827. (S1 ^operator O1902 +)
  6828. Firing rl*prefer*rvt*predict-no*H0*4
  6829. -->
  6830. (S1 ^operator O1900 = 0.3397665963572414)
  6831. Firing rl*prefer*rvt*predict-yes*H0*3
  6832. -->
  6833. (S1 ^operator O1899 = 0.3377110766337923)
  6834. Firing prefer*rvt*predict-yes*H0
  6835. -->
  6836. Firing prefer*rvt*predict-no*H0
  6837. -->
  6838. Firing elaborate*copy-dir-to-output-link
  6839. -->
  6840. (I3 ^dir R +)
  6841. inner elaboration loop at bottom goal.
  6842. Retracting elaborate*copy-see-to-output-link
  6843. -->
  6844. (I3 ^see 0 +)
  6845. Retracting propose*predict-no
  6846. -->
  6847. (O1900 ^name predict-no +)
  6848. (S1 ^operator O1900 +)
  6849. Retracting propose*predict-yes
  6850. -->
  6851. (O1899 ^name predict-yes +)
  6852. (S1 ^operator O1899 +)
  6853. Retracting elaborate*reward*based*on*reward
  6854. -->
  6855. (R953 ^value 1 +)
  6856. (R1 ^reward R953 +)
  6857. Retracting elaborate*copy-dir-to-output-link
  6858. -->
  6859. (I3 ^dir U +)
  6860. Retracting rl*prefer*rvt*predict-no*H0*2
  6861. -->
  6862. (S1 ^operator O1900 = 1.)
  6863. Retracting rl*prefer*rvt*predict-yes*H0*1
  6864. -->
  6865. (S1 ^operator O1899 = 0.)
  6866. =>WM: (13390: S1 ^operator O1902 +)
  6867. =>WM: (13389: S1 ^operator O1901 +)
  6868. =>WM: (13388: I3 ^dir R)
  6869. =>WM: (13387: O1902 ^name predict-no)
  6870. =>WM: (13386: O1901 ^name predict-yes)
  6871. =>WM: (13385: R954 ^value 1)
  6872. =>WM: (13384: R1 ^reward R954)
  6873. <=WM: (13375: S1 ^operator O1899 +)
  6874. <=WM: (13376: S1 ^operator O1900 +)
  6875. <=WM: (13377: S1 ^operator O1900)
  6876. <=WM: (13360: I3 ^dir U)
  6877. <=WM: (13371: R1 ^reward R953)
  6878. <=WM: (13374: O1900 ^name predict-no)
  6879. <=WM: (13373: O1899 ^name predict-yes)
  6880. <=WM: (13372: R953 ^value 1)
  6881. --- Inner Elaboration Phase, active level 1 (S1) ---
  6882. Firing prefer*rvt*predict-yes*H0
  6883. -->
  6884. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6885. -->
  6886. (S1 ^operator O1901 = -0.1070236389116304)
  6887. Firing rl*prefer*rvt*predict-yes*H0*3
  6888. -->
  6889. (S1 ^operator O1901 = 0.3377110766337923)
  6890. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  6891. -->
  6892. Firing prefer*rvt*predict-no*H0
  6893. -->
  6894. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6895. -->
  6896. (S1 ^operator O1902 = 0.66025212945601)
  6897. Firing rl*prefer*rvt*predict-no*H0*4
  6898. -->
  6899. (S1 ^operator O1902 = 0.3397665963572414)
  6900. Firing prefer*rvt*predict-no*H0*4*v1*H1
  6901. -->
  6902. inner elaboration loop at bottom goal.
  6903. Retracting rl*prefer*rvt*predict-no*H0*4
  6904. -->
  6905. (S1 ^operator O1900 = 0.3397665963572414)
  6906. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6907. -->
  6908. (S1 ^operator O1900 = 0.66025212945601)
  6909. Retracting rl*prefer*rvt*predict-yes*H0*3
  6910. -->
  6911. (S1 ^operator O1899 = 0.3377110766337923)
  6912. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6913. -->
  6914. (S1 ^operator O1899 = -0.1070236389116304)
  6915. --- END Proposal Phase ---
  6916. --- Decision Phase ---
  6917. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6918. =>WM: (13391: S1 ^operator O1902)
  6919. 951: O: O1902 (predict-no)
  6920. --- END Decision Phase ---
  6921. --- Application Phase ---
  6922. --- Firing Productions (PE) For State At Depth 1 ---
  6923. --- Inner Elaboration Phase, active level 1 (S1) ---
  6924. Firing apply*operator
  6925. -->
  6926. (I3 ^predict-no N951 + :O )
  6927. Firing apply*operator*complete
  6928. -->
  6929. (I3 ^predict-no N950 - :O )
  6930. inner elaboration loop at bottom goal.
  6931. --- Change Working Memory (PE) ---
  6932. =>WM: (13392: I3 ^predict-no N951)
  6933. <=WM: (13379: N950 ^status complete)
  6934. <=WM: (13378: I3 ^predict-no N950)
  6935. --- Firing Productions (IE) For State At Depth 1 ---
  6936. --- Inner Elaboration Phase, active level 1 (S1) ---
  6937. Firing monitor*world
  6938. -->
  6939. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6940. --- Change Working Memory (IE) ---
  6941. --- END Application Phase ---
  6942. --- Output Phase ---
  6943. ENV: Agent did: predict-no for direction R in state State-B
  6944. In State-B moving R
  6945. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6946. predict error 0
  6947. dir: dir isL
  6948. --- END Output Phase ---
  6949. ---- Input Phase ---
  6950. =>WM: (13396: I2 ^dir L)
  6951. =>WM: (13395: I2 ^reward 1)
  6952. =>WM: (13394: I2 ^see 0)
  6953. =>WM: (13393: N951 ^status complete)
  6954. <=WM: (13382: I2 ^dir R)
  6955. <=WM: (13381: I2 ^reward 1)
  6956. <=WM: (13380: I2 ^see 0)
  6957. =>WM: (13397: I2 ^level-1 R0-root)
  6958. <=WM: (13383: I2 ^level-1 R1-root)
  6959. --- END Input Phase ---
  6960. --- Proposal Phase ---
  6961. --- Inner Elaboration Phase, active level 1 (S1) ---
  6962. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  6963. -->
  6964. (S1 ^operator O1901 = 0.735786774178754)
  6965. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  6966. -->
  6967. Firing elaborate*copy-see-to-output-link
  6968. -->
  6969. (I3 ^see 0 +)
  6970. Firing elaborate*reward*based*on*reward
  6971. -->
  6972. (R955 ^value 1 +)
  6973. (R1 ^reward R955 +)
  6974. Firing propose*predict-yes
  6975. -->
  6976. (O1903 ^name predict-yes +)
  6977. (S1 ^operator O1903 +)
  6978. Firing propose*predict-no
  6979. -->
  6980. (O1904 ^name predict-no +)
  6981. (S1 ^operator O1904 +)
  6982. Firing rl*prefer*rvt*predict-no*H0*6
  6983. -->
  6984. (S1 ^operator O1902 = 0.9996367744406318)
  6985. Firing rl*prefer*rvt*predict-yes*H0*5
  6986. -->
  6987. (S1 ^operator O1901 = 0.2640533371018167)
  6988. Firing prefer*rvt*predict-yes*H0
  6989. -->
  6990. Firing prefer*rvt*predict-no*H0
  6991. -->
  6992. Firing elaborate*copy-dir-to-output-link
  6993. -->
  6994. (I3 ^dir L +)
  6995. inner elaboration loop at bottom goal.
  6996. Retracting elaborate*copy-see-to-output-link
  6997. -->
  6998. (I3 ^see 0 +)
  6999. Retracting propose*predict-no
  7000. -->
  7001. (O1902 ^name predict-no +)
  7002. (S1 ^operator O1902 +)
  7003. Retracting propose*predict-yes
  7004. -->
  7005. (O1901 ^name predict-yes +)
  7006. (S1 ^operator O1901 +)
  7007. Retracting elaborate*reward*based*on*reward
  7008. -->
  7009. (R954 ^value 1 +)
  7010. (R1 ^reward R954 +)
  7011. Retracting elaborate*copy-dir-to-output-link
  7012. -->
  7013. (I3 ^dir R +)
  7014. Retracting rl*prefer*rvt*predict-no*H0*4
  7015. -->
  7016. (S1 ^operator O1902 = 0.3397665963572414)
  7017. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7018. -->
  7019. (S1 ^operator O1902 = 0.66025212945601)
  7020. Retracting rl*prefer*rvt*predict-yes*H0*3
  7021. -->
  7022. (S1 ^operator O1901 = 0.3377110766337923)
  7023. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7024. -->
  7025. (S1 ^operator O1901 = -0.1070236389116304)
  7026. =>WM: (13404: S1 ^operator O1904 +)
  7027. =>WM: (13403: S1 ^operator O1903 +)
  7028. =>WM: (13402: I3 ^dir L)
  7029. =>WM: (13401: O1904 ^name predict-no)
  7030. =>WM: (13400: O1903 ^name predict-yes)
  7031. =>WM: (13399: R955 ^value 1)
  7032. =>WM: (13398: R1 ^reward R955)
  7033. <=WM: (13389: S1 ^operator O1901 +)
  7034. <=WM: (13390: S1 ^operator O1902 +)
  7035. <=WM: (13391: S1 ^operator O1902)
  7036. <=WM: (13388: I3 ^dir R)
  7037. <=WM: (13384: R1 ^reward R954)
  7038. <=WM: (13387: O1902 ^name predict-no)
  7039. <=WM: (13386: O1901 ^name predict-yes)
  7040. <=WM: (13385: R954 ^value 1)
  7041. --- Inner Elaboration Phase, active level 1 (S1) ---
  7042. Firing prefer*rvt*predict-yes*H0
  7043. -->
  7044. Firing rl*prefer*rvt*predict-yes*H0*5
  7045. -->
  7046. (S1 ^operator O1903 = 0.2640533371018167)
  7047. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7048. -->
  7049. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7050. -->
  7051. (S1 ^operator O1903 = 0.735786774178754)
  7052. Firing prefer*rvt*predict-no*H0
  7053. -->
  7054. Firing rl*prefer*rvt*predict-no*H0*6
  7055. -->
  7056. (S1 ^operator O1904 = 0.9996367744406318)
  7057. inner elaboration loop at bottom goal.
  7058. Retracting rl*prefer*rvt*predict-no*H0*6
  7059. -->
  7060. (S1 ^operator O1902 = 0.9996367744406318)
  7061. Retracting rl*prefer*rvt*predict-yes*H0*5
  7062. -->
  7063. (S1 ^operator O1901 = 0.2640533371018167)
  7064. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7065. -->
  7066. (S1 ^operator O1901 = 0.735786774178754)
  7067. --- END Proposal Phase ---
  7068. --- Decision Phase ---
  7069. RL update rl*prefer*rvt*predict-no*H0*4 0.57025 -0.230483 0.339767 -> 0.570248 -0.230483 0.339765(R,m,v=1,0.87037,0.113527)
  7070. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.42977 0.230482 0.660252 -> 0.429768 0.230482 0.66025(R,m,v=1,1,0)
  7071. =>WM: (13405: S1 ^operator O1903)
  7072. 952: O: O1903 (predict-yes)
  7073. --- END Decision Phase ---
  7074. --- Application Phase ---
  7075. --- Firing Productions (PE) For State At Depth 1 ---
  7076. --- Inner Elaboration Phase, active level 1 (S1) ---
  7077. Firing apply*operator
  7078. -->
  7079. (I3 ^predict-yes N952 + :O )
  7080. Firing apply*operator*complete
  7081. -->
  7082. (I3 ^predict-no N951 - :O )
  7083. inner elaboration loop at bottom goal.
  7084. --- Change Working Memory (PE) ---
  7085. =>WM: (13406: I3 ^predict-yes N952)
  7086. <=WM: (13393: N951 ^status complete)
  7087. <=WM: (13392: I3 ^predict-no N951)
  7088. --- Firing Productions (IE) For State At Depth 1 ---
  7089. --- Inner Elaboration Phase, active level 1 (S1) ---
  7090. Firing monitor*world
  7091. -->
  7092. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7093. --- Change Working Memory (IE) ---
  7094. --- END Application Phase ---
  7095. --- Output Phase ---
  7096. ENV: Agent did: predict-yes for direction L in state State-B
  7097. In State-B moving L
  7098. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7099. predict error 0
  7100. dir: dir isU
  7101. --- END Output Phase ---
  7102. /|--- Input Phase ---
  7103. =>WM: (13410: I2 ^dir U)
  7104. =>WM: (13409: I2 ^reward 1)
  7105. =>WM: (13408: I2 ^see 1)
  7106. =>WM: (13407: N952 ^status complete)
  7107. <=WM: (13396: I2 ^dir L)
  7108. <=WM: (13395: I2 ^reward 1)
  7109. <=WM: (13394: I2 ^see 0)
  7110. =>WM: (13411: I2 ^level-1 L1-root)
  7111. <=WM: (13397: I2 ^level-1 R0-root)
  7112. --- END Input Phase ---
  7113. --- Proposal Phase ---
  7114. --- Inner Elaboration Phase, active level 1 (S1) ---
  7115. Firing elaborate*copy-see-to-output-link
  7116. -->
  7117. (I3 ^see 1 +)
  7118. Firing elaborate*reward*based*on*reward
  7119. -->
  7120. (R956 ^value 1 +)
  7121. (R1 ^reward R956 +)
  7122. Firing propose*predict-yes
  7123. -->
  7124. (O1905 ^name predict-yes +)
  7125. (S1 ^operator O1905 +)
  7126. Firing propose*predict-no
  7127. -->
  7128. (O1906 ^name predict-no +)
  7129. (S1 ^operator O1906 +)
  7130. Firing rl*prefer*rvt*predict-no*H0*2
  7131. -->
  7132. (S1 ^operator O1904 = 1.)
  7133. Firing rl*prefer*rvt*predict-yes*H0*1
  7134. -->
  7135. (S1 ^operator O1903 = 0.)
  7136. Firing prefer*rvt*predict-yes*H0
  7137. -->
  7138. Firing prefer*rvt*predict-no*H0
  7139. -->
  7140. Firing elaborate*copy-dir-to-output-link
  7141. -->
  7142. (I3 ^dir U +)
  7143. inner elaboration loop at bottom goal.
  7144. Retracting elaborate*copy-see-to-output-link
  7145. -->
  7146. (I3 ^see 0 +)
  7147. Retracting propose*predict-no
  7148. -->
  7149. (O1904 ^name predict-no +)
  7150. (S1 ^operator O1904 +)
  7151. Retracting propose*predict-yes
  7152. -->
  7153. (O1903 ^name predict-yes +)
  7154. (S1 ^operator O1903 +)
  7155. Retracting elaborate*reward*based*on*reward
  7156. -->
  7157. (R955 ^value 1 +)
  7158. (R1 ^reward R955 +)
  7159. Retracting elaborate*copy-dir-to-output-link
  7160. -->
  7161. (I3 ^dir L +)
  7162. Retracting rl*prefer*rvt*predict-no*H0*6
  7163. -->
  7164. (S1 ^operator O1904 = 0.9996367744406318)
  7165. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7166. -->
  7167. (S1 ^operator O1903 = 0.735786774178754)
  7168. Retracting rl*prefer*rvt*predict-yes*H0*5
  7169. -->
  7170. (S1 ^operator O1903 = 0.2640533371018167)
  7171. =>WM: (13419: S1 ^operator O1906 +)
  7172. =>WM: (13418: S1 ^operator O1905 +)
  7173. =>WM: (13417: I3 ^dir U)
  7174. =>WM: (13416: O1906 ^name predict-no)
  7175. =>WM: (13415: O1905 ^name predict-yes)
  7176. =>WM: (13414: R956 ^value 1)
  7177. =>WM: (13413: R1 ^reward R956)
  7178. =>WM: (13412: I3 ^see 1)
  7179. <=WM: (13403: S1 ^operator O1903 +)
  7180. <=WM: (13405: S1 ^operator O1903)
  7181. <=WM: (13404: S1 ^operator O1904 +)
  7182. <=WM: (13402: I3 ^dir L)
  7183. <=WM: (13398: R1 ^reward R955)
  7184. <=WM: (13370: I3 ^see 0)
  7185. <=WM: (13401: O1904 ^name predict-no)
  7186. <=WM: (13400: O1903 ^name predict-yes)
  7187. <=WM: (13399: R955 ^value 1)
  7188. --- Inner Elaboration Phase, active level 1 (S1) ---
  7189. Firing prefer*rvt*predict-yes*H0
  7190. -->
  7191. Firing rl*prefer*rvt*predict-yes*H0*1
  7192. -->
  7193. (S1 ^operator O1905 = 0.)
  7194. Firing prefer*rvt*predict-no*H0
  7195. -->
  7196. Firing rl*prefer*rvt*predict-no*H0*2
  7197. -->
  7198. (S1 ^operator O1906 = 1.)
  7199. inner elaboration loop at bottom goal.
  7200. Retracting rl*prefer*rvt*predict-no*H0*2
  7201. -->
  7202. (S1 ^operator O1904 = 1.)
  7203. Retracting rl*prefer*rvt*predict-yes*H0*1
  7204. -->
  7205. (S1 ^operator O1903 = 0.)
  7206. --- END Proposal Phase ---
  7207. --- Decision Phase ---
  7208. RL update rl*prefer*rvt*predict-yes*H0*5 0.554438 -0.290385 0.264053 -> 0.554451 -0.290385 0.264066(R,m,v=1,0.872093,0.112199)
  7209. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445404 0.290382 0.735787 -> 0.44542 0.290383 0.735802(R,m,v=1,1,0)
  7210. =>WM: (13420: S1 ^operator O1906)
  7211. 953: O: O1906 (predict-no)
  7212. --- END Decision Phase ---
  7213. --- Application Phase ---
  7214. --- Firing Productions (PE) For State At Depth 1 ---
  7215. --- Inner Elaboration Phase, active level 1 (S1) ---
  7216. Firing apply*operator
  7217. -->
  7218. (I3 ^predict-no N953 + :O )
  7219. Firing apply*operator*complete
  7220. -->
  7221. (I3 ^predict-yes N952 - :O )
  7222. inner elaboration loop at bottom goal.
  7223. --- Change Working Memory (PE) ---
  7224. =>WM: (13421: I3 ^predict-no N953)
  7225. <=WM: (13407: N952 ^status complete)
  7226. <=WM: (13406: I3 ^predict-yes N952)
  7227. --- Firing Productions (IE) For State At Depth 1 ---
  7228. --- Inner Elaboration Phase, active level 1 (S1) ---
  7229. Firing monitor*world
  7230. -->
  7231. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7232. --- Change Working Memory (IE) ---
  7233. --- END Application Phase ---
  7234. --- Output Phase ---
  7235. ENV: Agent did: predict-no for direction U in state State-A
  7236. In State-A moving U
  7237. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7238. predict error 0
  7239. dir: dir isR
  7240. --- END Output Phase ---
  7241. \-/--- Input Phase ---
  7242. =>WM: (13425: I2 ^dir R)
  7243. =>WM: (13424: I2 ^reward 1)
  7244. =>WM: (13423: I2 ^see 0)
  7245. =>WM: (13422: N953 ^status complete)
  7246. <=WM: (13410: I2 ^dir U)
  7247. <=WM: (13409: I2 ^reward 1)
  7248. <=WM: (13408: I2 ^see 1)
  7249. =>WM: (13426: I2 ^level-1 L1-root)
  7250. <=WM: (13411: I2 ^level-1 L1-root)
  7251. --- END Input Phase ---
  7252. --- Proposal Phase ---
  7253. --- Inner Elaboration Phase, active level 1 (S1) ---
  7254. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7255. -->
  7256. (S1 ^operator O1906 = -0.2714224023553999)
  7257. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7258. -->
  7259. (S1 ^operator O1905 = 0.6621942993402632)
  7260. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7261. -->
  7262. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7263. -->
  7264. Firing elaborate*copy-see-to-output-link
  7265. -->
  7266. (I3 ^see 0 +)
  7267. Firing elaborate*reward*based*on*reward
  7268. -->
  7269. (R957 ^value 1 +)
  7270. (R1 ^reward R957 +)
  7271. Firing propose*predict-yes
  7272. -->
  7273. (O1907 ^name predict-yes +)
  7274. (S1 ^operator O1907 +)
  7275. Firing propose*predict-no
  7276. -->
  7277. (O1908 ^name predict-no +)
  7278. (S1 ^operator O1908 +)
  7279. Firing rl*prefer*rvt*predict-no*H0*4
  7280. -->
  7281. (S1 ^operator O1906 = 0.3397650583271044)
  7282. Firing rl*prefer*rvt*predict-yes*H0*3
  7283. -->
  7284. (S1 ^operator O1905 = 0.3377110766337923)
  7285. Firing prefer*rvt*predict-yes*H0
  7286. -->
  7287. Firing prefer*rvt*predict-no*H0
  7288. -->
  7289. Firing elaborate*copy-dir-to-output-link
  7290. -->
  7291. (I3 ^dir R +)
  7292. inner elaboration loop at bottom goal.
  7293. Retracting elaborate*copy-see-to-output-link
  7294. -->
  7295. (I3 ^see 1 +)
  7296. Retracting propose*predict-no
  7297. -->
  7298. (O1906 ^name predict-no +)
  7299. (S1 ^operator O1906 +)
  7300. Retracting propose*predict-yes
  7301. -->
  7302. (O1905 ^name predict-yes +)
  7303. (S1 ^operator O1905 +)
  7304. Retracting elaborate*reward*based*on*reward
  7305. -->
  7306. (R956 ^value 1 +)
  7307. (R1 ^reward R956 +)
  7308. Retracting elaborate*copy-dir-to-output-link
  7309. -->
  7310. (I3 ^dir U +)
  7311. Retracting rl*prefer*rvt*predict-no*H0*2
  7312. -->
  7313. (S1 ^operator O1906 = 1.)
  7314. Retracting rl*prefer*rvt*predict-yes*H0*1
  7315. -->
  7316. (S1 ^operator O1905 = 0.)
  7317. =>WM: (13434: S1 ^operator O1908 +)
  7318. =>WM: (13433: S1 ^operator O1907 +)
  7319. =>WM: (13432: I3 ^dir R)
  7320. =>WM: (13431: O1908 ^name predict-no)
  7321. =>WM: (13430: O1907 ^name predict-yes)
  7322. =>WM: (13429: R957 ^value 1)
  7323. =>WM: (13428: R1 ^reward R957)
  7324. =>WM: (13427: I3 ^see 0)
  7325. <=WM: (13418: S1 ^operator O1905 +)
  7326. <=WM: (13419: S1 ^operator O1906 +)
  7327. <=WM: (13420: S1 ^operator O1906)
  7328. <=WM: (13417: I3 ^dir U)
  7329. <=WM: (13413: R1 ^reward R956)
  7330. <=WM: (13412: I3 ^see 1)
  7331. <=WM: (13416: O1906 ^name predict-no)
  7332. <=WM: (13415: O1905 ^name predict-yes)
  7333. <=WM: (13414: R956 ^value 1)
  7334. --- Inner Elaboration Phase, active level 1 (S1) ---
  7335. Firing prefer*rvt*predict-yes*H0
  7336. -->
  7337. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7338. -->
  7339. (S1 ^operator O1907 = 0.6621942993402632)
  7340. Firing rl*prefer*rvt*predict-yes*H0*3
  7341. -->
  7342. (S1 ^operator O1907 = 0.3377110766337923)
  7343. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7344. -->
  7345. Firing prefer*rvt*predict-no*H0
  7346. -->
  7347. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7348. -->
  7349. (S1 ^operator O1908 = -0.2714224023553999)
  7350. Firing rl*prefer*rvt*predict-no*H0*4
  7351. -->
  7352. (S1 ^operator O1908 = 0.3397650583271044)
  7353. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7354. -->
  7355. inner elaboration loop at bottom goal.
  7356. Retracting rl*prefer*rvt*predict-no*H0*4
  7357. -->
  7358. (S1 ^operator O1906 = 0.3397650583271044)
  7359. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7360. -->
  7361. (S1 ^operator O1906 = -0.2714224023553999)
  7362. Retracting rl*prefer*rvt*predict-yes*H0*3
  7363. -->
  7364. (S1 ^operator O1905 = 0.3377110766337923)
  7365. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7366. -->
  7367. (S1 ^operator O1905 = 0.6621942993402632)
  7368. --- END Proposal Phase ---
  7369. --- Decision Phase ---
  7370. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7371. =>WM: (13435: S1 ^operator O1907)
  7372. 954: O: O1907 (predict-yes)
  7373. --- END Decision Phase ---
  7374. --- Application Phase ---
  7375. --- Firing Productions (PE) For State At Depth 1 ---
  7376. --- Inner Elaboration Phase, active level 1 (S1) ---
  7377. Firing apply*operator
  7378. -->
  7379. (I3 ^predict-yes N954 + :O )
  7380. Firing apply*operator*complete
  7381. -->
  7382. (I3 ^predict-no N953 - :O )
  7383. inner elaboration loop at bottom goal.
  7384. --- Change Working Memory (PE) ---
  7385. =>WM: (13436: I3 ^predict-yes N954)
  7386. <=WM: (13422: N953 ^status complete)
  7387. <=WM: (13421: I3 ^predict-no N953)
  7388. --- Firing Productions (IE) For State At Depth 1 ---
  7389. --- Inner Elaboration Phase, active level 1 (S1) ---
  7390. Firing monitor*world
  7391. -->
  7392. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7393. --- Change Working Memory (IE) ---
  7394. --- END Application Phase ---
  7395. --- Output Phase ---
  7396. ENV: Agent did: predict-yes for direction R in state State-A
  7397. In State-A moving R
  7398. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7399. predict error 0
  7400. dir: dir isU
  7401. --- END Output Phase ---
  7402. |\--- Input Phase ---
  7403. =>WM: (13440: I2 ^dir U)
  7404. =>WM: (13439: I2 ^reward 1)
  7405. =>WM: (13438: I2 ^see 1)
  7406. =>WM: (13437: N954 ^status complete)
  7407. <=WM: (13425: I2 ^dir R)
  7408. <=WM: (13424: I2 ^reward 1)
  7409. <=WM: (13423: I2 ^see 0)
  7410. =>WM: (13441: I2 ^level-1 R1-root)
  7411. <=WM: (13426: I2 ^level-1 L1-root)
  7412. --- END Input Phase ---
  7413. --- Proposal Phase ---
  7414. --- Inner Elaboration Phase, active level 1 (S1) ---
  7415. Firing elaborate*copy-see-to-output-link
  7416. -->
  7417. (I3 ^see 1 +)
  7418. Firing elaborate*reward*based*on*reward
  7419. -->
  7420. (R958 ^value 1 +)
  7421. (R1 ^reward R958 +)
  7422. Firing propose*predict-yes
  7423. -->
  7424. (O1909 ^name predict-yes +)
  7425. (S1 ^operator O1909 +)
  7426. Firing propose*predict-no
  7427. -->
  7428. (O1910 ^name predict-no +)
  7429. (S1 ^operator O1910 +)
  7430. Firing rl*prefer*rvt*predict-no*H0*2
  7431. -->
  7432. (S1 ^operator O1908 = 1.)
  7433. Firing rl*prefer*rvt*predict-yes*H0*1
  7434. -->
  7435. (S1 ^operator O1907 = 0.)
  7436. Firing prefer*rvt*predict-yes*H0
  7437. -->
  7438. Firing prefer*rvt*predict-no*H0
  7439. -->
  7440. Firing elaborate*copy-dir-to-output-link
  7441. -->
  7442. (I3 ^dir U +)
  7443. inner elaboration loop at bottom goal.
  7444. Retracting elaborate*copy-see-to-output-link
  7445. -->
  7446. (I3 ^see 0 +)
  7447. Retracting propose*predict-no
  7448. -->
  7449. (O1908 ^name predict-no +)
  7450. (S1 ^operator O1908 +)
  7451. Retracting propose*predict-yes
  7452. -->
  7453. (O1907 ^name predict-yes +)
  7454. (S1 ^operator O1907 +)
  7455. Retracting elaborate*reward*based*on*reward
  7456. -->
  7457. (R957 ^value 1 +)
  7458. (R1 ^reward R957 +)
  7459. Retracting elaborate*copy-dir-to-output-link
  7460. -->
  7461. (I3 ^dir R +)
  7462. Retracting rl*prefer*rvt*predict-no*H0*4
  7463. -->
  7464. (S1 ^operator O1908 = 0.3397650583271044)
  7465. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7466. -->
  7467. (S1 ^operator O1908 = -0.2714224023553999)
  7468. Retracting rl*prefer*rvt*predict-yes*H0*3
  7469. -->
  7470. (S1 ^operator O1907 = 0.3377110766337923)
  7471. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7472. -->
  7473. (S1 ^operator O1907 = 0.6621942993402632)
  7474. =>WM: (13449: S1 ^operator O1910 +)
  7475. =>WM: (13448: S1 ^operator O1909 +)
  7476. =>WM: (13447: I3 ^dir U)
  7477. =>WM: (13446: O1910 ^name predict-no)
  7478. =>WM: (13445: O1909 ^name predict-yes)
  7479. =>WM: (13444: R958 ^value 1)
  7480. =>WM: (13443: R1 ^reward R958)
  7481. =>WM: (13442: I3 ^see 1)
  7482. <=WM: (13433: S1 ^operator O1907 +)
  7483. <=WM: (13435: S1 ^operator O1907)
  7484. <=WM: (13434: S1 ^operator O1908 +)
  7485. <=WM: (13432: I3 ^dir R)
  7486. <=WM: (13428: R1 ^reward R957)
  7487. <=WM: (13427: I3 ^see 0)
  7488. <=WM: (13431: O1908 ^name predict-no)
  7489. <=WM: (13430: O1907 ^name predict-yes)
  7490. <=WM: (13429: R957 ^value 1)
  7491. --- Inner Elaboration Phase, active level 1 (S1) ---
  7492. Firing prefer*rvt*predict-yes*H0
  7493. -->
  7494. Firing rl*prefer*rvt*predict-yes*H0*1
  7495. -->
  7496. (S1 ^operator O1909 = 0.)
  7497. Firing prefer*rvt*predict-no*H0
  7498. -->
  7499. Firing rl*prefer*rvt*predict-no*H0*2
  7500. -->
  7501. (S1 ^operator O1910 = 1.)
  7502. inner elaboration loop at bottom goal.
  7503. Retracting rl*prefer*rvt*predict-no*H0*2
  7504. -->
  7505. (S1 ^operator O1908 = 1.)
  7506. Retracting rl*prefer*rvt*predict-yes*H0*1
  7507. -->
  7508. (S1 ^operator O1907 = 0.)
  7509. --- END Proposal Phase ---
  7510. --- Decision Phase ---
  7511. RL update rl*prefer*rvt*predict-yes*H0*3 0.590111 -0.2524 0.337711 -> 0.59012 -0.252401 0.337719(R,m,v=1,0.89441,0.0950311)
  7512. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.40978 0.252415 0.662194 -> 0.40979 0.252413 0.662203(R,m,v=1,1,0)
  7513. =>WM: (13450: S1 ^operator O1910)
  7514. 955: O: O1910 (predict-no)
  7515. --- END Decision Phase ---
  7516. --- Application Phase ---
  7517. --- Firing Productions (PE) For State At Depth 1 ---
  7518. --- Inner Elaboration Phase, active level 1 (S1) ---
  7519. Firing apply*operator
  7520. -->
  7521. (I3 ^predict-no N955 + :O )
  7522. Firing apply*operator*complete
  7523. -->
  7524. (I3 ^predict-yes N954 - :O )
  7525. inner elaboration loop at bottom goal.
  7526. --- Change Working Memory (PE) ---
  7527. =>WM: (13451: I3 ^predict-no N955)
  7528. <=WM: (13437: N954 ^status complete)
  7529. <=WM: (13436: I3 ^predict-yes N954)
  7530. --- Firing Productions (IE) For State At Depth 1 ---
  7531. --- Inner Elaboration Phase, active level 1 (S1) ---
  7532. Firing monitor*world
  7533. -->
  7534. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7535. --- Change Working Memory (IE) ---
  7536. --- END Application Phase ---
  7537. --- Output Phase ---
  7538. ENV: Agent did: predict-no for direction U in state State-B
  7539. In State-B moving U
  7540. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7541. predict error 0
  7542. dir: dir isR
  7543. --- END Output Phase ---
  7544. -/|--- Input Phase ---
  7545. =>WM: (13455: I2 ^dir R)
  7546. =>WM: (13454: I2 ^reward 1)
  7547. =>WM: (13453: I2 ^see 0)
  7548. =>WM: (13452: N955 ^status complete)
  7549. <=WM: (13440: I2 ^dir U)
  7550. <=WM: (13439: I2 ^reward 1)
  7551. <=WM: (13438: I2 ^see 1)
  7552. =>WM: (13456: I2 ^level-1 R1-root)
  7553. <=WM: (13441: I2 ^level-1 R1-root)
  7554. --- END Input Phase ---
  7555. --- Proposal Phase ---
  7556. --- Inner Elaboration Phase, active level 1 (S1) ---
  7557. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7558. -->
  7559. (S1 ^operator O1909 = -0.1070236389116304)
  7560. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7561. -->
  7562. (S1 ^operator O1910 = 0.6602503199844459)
  7563. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7564. -->
  7565. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7566. -->
  7567. Firing elaborate*copy-see-to-output-link
  7568. -->
  7569. (I3 ^see 0 +)
  7570. Firing elaborate*reward*based*on*reward
  7571. -->
  7572. (R959 ^value 1 +)
  7573. (R1 ^reward R959 +)
  7574. Firing propose*predict-yes
  7575. -->
  7576. (O1911 ^name predict-yes +)
  7577. (S1 ^operator O1911 +)
  7578. Firing propose*predict-no
  7579. -->
  7580. (O1912 ^name predict-no +)
  7581. (S1 ^operator O1912 +)
  7582. Firing rl*prefer*rvt*predict-no*H0*4
  7583. -->
  7584. (S1 ^operator O1910 = 0.3397650583271044)
  7585. Firing rl*prefer*rvt*predict-yes*H0*3
  7586. -->
  7587. (S1 ^operator O1909 = 0.3377188564178903)
  7588. Firing prefer*rvt*predict-yes*H0
  7589. -->
  7590. Firing prefer*rvt*predict-no*H0
  7591. -->
  7592. Firing elaborate*copy-dir-to-output-link
  7593. -->
  7594. (I3 ^dir R +)
  7595. inner elaboration loop at bottom goal.
  7596. Retracting elaborate*copy-see-to-output-link
  7597. -->
  7598. (I3 ^see 1 +)
  7599. Retracting propose*predict-no
  7600. -->
  7601. (O1910 ^name predict-no +)
  7602. (S1 ^operator O1910 +)
  7603. Retracting propose*predict-yes
  7604. -->
  7605. (O1909 ^name predict-yes +)
  7606. (S1 ^operator O1909 +)
  7607. Retracting elaborate*reward*based*on*reward
  7608. -->
  7609. (R958 ^value 1 +)
  7610. (R1 ^reward R958 +)
  7611. Retracting elaborate*copy-dir-to-output-link
  7612. -->
  7613. (I3 ^dir U +)
  7614. Retracting rl*prefer*rvt*predict-no*H0*2
  7615. -->
  7616. (S1 ^operator O1910 = 1.)
  7617. Retracting rl*prefer*rvt*predict-yes*H0*1
  7618. -->
  7619. (S1 ^operator O1909 = 0.)
  7620. =>WM: (13464: S1 ^operator O1912 +)
  7621. =>WM: (13463: S1 ^operator O1911 +)
  7622. =>WM: (13462: I3 ^dir R)
  7623. =>WM: (13461: O1912 ^name predict-no)
  7624. =>WM: (13460: O1911 ^name predict-yes)
  7625. =>WM: (13459: R959 ^value 1)
  7626. =>WM: (13458: R1 ^reward R959)
  7627. =>WM: (13457: I3 ^see 0)
  7628. <=WM: (13448: S1 ^operator O1909 +)
  7629. <=WM: (13449: S1 ^operator O1910 +)
  7630. <=WM: (13450: S1 ^operator O1910)
  7631. <=WM: (13447: I3 ^dir U)
  7632. <=WM: (13443: R1 ^reward R958)
  7633. <=WM: (13442: I3 ^see 1)
  7634. <=WM: (13446: O1910 ^name predict-no)
  7635. <=WM: (13445: O1909 ^name predict-yes)
  7636. <=WM: (13444: R958 ^value 1)
  7637. --- Inner Elaboration Phase, active level 1 (S1) ---
  7638. Firing prefer*rvt*predict-yes*H0
  7639. -->
  7640. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7641. -->
  7642. (S1 ^operator O1911 = -0.1070236389116304)
  7643. Firing rl*prefer*rvt*predict-yes*H0*3
  7644. -->
  7645. (S1 ^operator O1911 = 0.3377188564178903)
  7646. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7647. -->
  7648. Firing prefer*rvt*predict-no*H0
  7649. -->
  7650. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7651. -->
  7652. (S1 ^operator O1912 = 0.6602503199844459)
  7653. Firing rl*prefer*rvt*predict-no*H0*4
  7654. -->
  7655. (S1 ^operator O1912 = 0.3397650583271044)
  7656. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7657. -->
  7658. inner elaboration loop at bottom goal.
  7659. Retracting rl*prefer*rvt*predict-no*H0*4
  7660. -->
  7661. (S1 ^operator O1910 = 0.3397650583271044)
  7662. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7663. -->
  7664. (S1 ^operator O1910 = 0.6602503199844459)
  7665. Retracting rl*prefer*rvt*predict-yes*H0*3
  7666. -->
  7667. (S1 ^operator O1909 = 0.3377188564178903)
  7668. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7669. -->
  7670. (S1 ^operator O1909 = -0.1070236389116304)
  7671. --- END Proposal Phase ---
  7672. --- Decision Phase ---
  7673. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7674. =>WM: (13465: S1 ^operator O1912)
  7675. 956: O: O1912 (predict-no)
  7676. --- END Decision Phase ---
  7677. --- Application Phase ---
  7678. --- Firing Productions (PE) For State At Depth 1 ---
  7679. --- Inner Elaboration Phase, active level 1 (S1) ---
  7680. Firing apply*operator
  7681. -->
  7682. (I3 ^predict-no N956 + :O )
  7683. Firing apply*operator*complete
  7684. -->
  7685. (I3 ^predict-no N955 - :O )
  7686. inner elaboration loop at bottom goal.
  7687. --- Change Working Memory (PE) ---
  7688. =>WM: (13466: I3 ^predict-no N956)
  7689. <=WM: (13452: N955 ^status complete)
  7690. <=WM: (13451: I3 ^predict-no N955)
  7691. --- Firing Productions (IE) For State At Depth 1 ---
  7692. --- Inner Elaboration Phase, active level 1 (S1) ---
  7693. Firing monitor*world
  7694. -->
  7695. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7696. --- Change Working Memory (IE) ---
  7697. --- END Application Phase ---
  7698. --- Output Phase ---
  7699. ENV: Agent did: predict-no for direction R in state State-B
  7700. In State-B moving R
  7701. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7702. predict error 0
  7703. dir: dir isR
  7704. --- END Output Phase ---
  7705. \-/--- Input Phase ---
  7706. =>WM: (13470: I2 ^dir R)
  7707. =>WM: (13469: I2 ^reward 1)
  7708. =>WM: (13468: I2 ^see 0)
  7709. =>WM: (13467: N956 ^status complete)
  7710. <=WM: (13455: I2 ^dir R)
  7711. <=WM: (13454: I2 ^reward 1)
  7712. <=WM: (13453: I2 ^see 0)
  7713. =>WM: (13471: I2 ^level-1 R0-root)
  7714. <=WM: (13456: I2 ^level-1 R1-root)
  7715. --- END Input Phase ---
  7716. --- Proposal Phase ---
  7717. --- Inner Elaboration Phase, active level 1 (S1) ---
  7718. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7719. -->
  7720. (S1 ^operator O1912 = 0.6601435952544124)
  7721. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7722. -->
  7723. (S1 ^operator O1911 = -0.1028953566115423)
  7724. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7725. -->
  7726. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7727. -->
  7728. Firing elaborate*copy-see-to-output-link
  7729. -->
  7730. (I3 ^see 0 +)
  7731. Firing elaborate*reward*based*on*reward
  7732. -->
  7733. (R960 ^value 1 +)
  7734. (R1 ^reward R960 +)
  7735. Firing propose*predict-yes
  7736. -->
  7737. (O1913 ^name predict-yes +)
  7738. (S1 ^operator O1913 +)
  7739. Firing propose*predict-no
  7740. -->
  7741. (O1914 ^name predict-no +)
  7742. (S1 ^operator O1914 +)
  7743. Firing rl*prefer*rvt*predict-no*H0*4
  7744. -->
  7745. (S1 ^operator O1912 = 0.3397650583271044)
  7746. Firing rl*prefer*rvt*predict-yes*H0*3
  7747. -->
  7748. (S1 ^operator O1911 = 0.3377188564178903)
  7749. Firing prefer*rvt*predict-yes*H0
  7750. -->
  7751. Firing prefer*rvt*predict-no*H0
  7752. -->
  7753. Firing elaborate*copy-dir-to-output-link
  7754. -->
  7755. (I3 ^dir R +)
  7756. inner elaboration loop at bottom goal.
  7757. Retracting elaborate*copy-see-to-output-link
  7758. -->
  7759. (I3 ^see 0 +)
  7760. Retracting propose*predict-no
  7761. -->
  7762. (O1912 ^name predict-no +)
  7763. (S1 ^operator O1912 +)
  7764. Retracting propose*predict-yes
  7765. -->
  7766. (O1911 ^name predict-yes +)
  7767. (S1 ^operator O1911 +)
  7768. Retracting elaborate*reward*based*on*reward
  7769. -->
  7770. (R959 ^value 1 +)
  7771. (R1 ^reward R959 +)
  7772. Retracting elaborate*copy-dir-to-output-link
  7773. -->
  7774. (I3 ^dir R +)
  7775. Retracting rl*prefer*rvt*predict-no*H0*4
  7776. -->
  7777. (S1 ^operator O1912 = 0.3397650583271044)
  7778. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7779. -->
  7780. (S1 ^operator O1912 = 0.6602503199844459)
  7781. Retracting rl*prefer*rvt*predict-yes*H0*3
  7782. -->
  7783. (S1 ^operator O1911 = 0.3377188564178903)
  7784. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7785. -->
  7786. (S1 ^operator O1911 = -0.1070236389116304)
  7787. =>WM: (13477: S1 ^operator O1914 +)
  7788. =>WM: (13476: S1 ^operator O1913 +)
  7789. =>WM: (13475: O1914 ^name predict-no)
  7790. =>WM: (13474: O1913 ^name predict-yes)
  7791. =>WM: (13473: R960 ^value 1)
  7792. =>WM: (13472: R1 ^reward R960)
  7793. <=WM: (13463: S1 ^operator O1911 +)
  7794. <=WM: (13464: S1 ^operator O1912 +)
  7795. <=WM: (13465: S1 ^operator O1912)
  7796. <=WM: (13458: R1 ^reward R959)
  7797. <=WM: (13461: O1912 ^name predict-no)
  7798. <=WM: (13460: O1911 ^name predict-yes)
  7799. <=WM: (13459: R959 ^value 1)
  7800. --- Inner Elaboration Phase, active level 1 (S1) ---
  7801. Firing prefer*rvt*predict-yes*H0
  7802. -->
  7803. Firing rl*prefer*rvt*predict-yes*H0*3
  7804. -->
  7805. (S1 ^operator O1913 = 0.3377188564178903)
  7806. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7807. -->
  7808. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7809. -->
  7810. (S1 ^operator O1913 = -0.1028953566115423)
  7811. Firing prefer*rvt*predict-no*H0
  7812. -->
  7813. Firing rl*prefer*rvt*predict-no*H0*4
  7814. -->
  7815. (S1 ^operator O1914 = 0.3397650583271044)
  7816. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7817. -->
  7818. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7819. -->
  7820. (S1 ^operator O1914 = 0.6601435952544124)
  7821. inner elaboration loop at bottom goal.
  7822. Retracting rl*prefer*rvt*predict-no*H0*4
  7823. -->
  7824. (S1 ^operator O1912 = 0.3397650583271044)
  7825. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7826. -->
  7827. (S1 ^operator O1912 = 0.6601435952544124)
  7828. Retracting rl*prefer*rvt*predict-yes*H0*3
  7829. -->
  7830. (S1 ^operator O1911 = 0.3377188564178903)
  7831. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7832. -->
  7833. (S1 ^operator O1911 = -0.1028953566115423)
  7834. --- END Proposal Phase ---
  7835. --- Decision Phase ---
  7836. RL update rl*prefer*rvt*predict-no*H0*4 0.570248 -0.230483 0.339765 -> 0.570247 -0.230483 0.339764(R,m,v=1,0.871166,0.112929)
  7837. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429768 0.230482 0.66025 -> 0.429766 0.230483 0.660249(R,m,v=1,1,0)
  7838. =>WM: (13478: S1 ^operator O1914)
  7839. 957: O: O1914 (predict-no)
  7840. --- END Decision Phase ---
  7841. --- Application Phase ---
  7842. --- Firing Productions (PE) For State At Depth 1 ---
  7843. --- Inner Elaboration Phase, active level 1 (S1) ---
  7844. Firing apply*operator
  7845. -->
  7846. (I3 ^predict-no N957 + :O )
  7847. Firing apply*operator*complete
  7848. -->
  7849. (I3 ^predict-no N956 - :O )
  7850. inner elaboration loop at bottom goal.
  7851. --- Change Working Memory (PE) ---
  7852. =>WM: (13479: I3 ^predict-no N957)
  7853. <=WM: (13467: N956 ^status complete)
  7854. <=WM: (13466: I3 ^predict-no N956)
  7855. --- Firing Productions (IE) For State At Depth 1 ---
  7856. --- Inner Elaboration Phase, active level 1 (S1) ---
  7857. Firing monitor*world
  7858. -->
  7859. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7860. --- Change Working Memory (IE) ---
  7861. --- END Application Phase ---
  7862. --- Output Phase ---
  7863. ENV: Agent did: predict-no for direction R in state State-B
  7864. In State-B moving R
  7865. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7866. predict error 0
  7867. dir: dir isL
  7868. --- END Output Phase ---
  7869. |\--- Input Phase ---
  7870. =>WM: (13483: I2 ^dir L)
  7871. =>WM: (13482: I2 ^reward 1)
  7872. =>WM: (13481: I2 ^see 0)
  7873. =>WM: (13480: N957 ^status complete)
  7874. <=WM: (13470: I2 ^dir R)
  7875. <=WM: (13469: I2 ^reward 1)
  7876. <=WM: (13468: I2 ^see 0)
  7877. =>WM: (13484: I2 ^level-1 R0-root)
  7878. <=WM: (13471: I2 ^level-1 R0-root)
  7879. --- END Input Phase ---
  7880. --- Proposal Phase ---
  7881. --- Inner Elaboration Phase, active level 1 (S1) ---
  7882. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7883. -->
  7884. (S1 ^operator O1913 = 0.7358024669452599)
  7885. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7886. -->
  7887. Firing elaborate*copy-see-to-output-link
  7888. -->
  7889. (I3 ^see 0 +)
  7890. Firing elaborate*reward*based*on*reward
  7891. -->
  7892. (R961 ^value 1 +)
  7893. (R1 ^reward R961 +)
  7894. Firing propose*predict-yes
  7895. -->
  7896. (O1915 ^name predict-yes +)
  7897. (S1 ^operator O1915 +)
  7898. Firing propose*predict-no
  7899. -->
  7900. (O1916 ^name predict-no +)
  7901. (S1 ^operator O1916 +)
  7902. Firing rl*prefer*rvt*predict-no*H0*6
  7903. -->
  7904. (S1 ^operator O1914 = 0.9996367744406318)
  7905. Firing rl*prefer*rvt*predict-yes*H0*5
  7906. -->
  7907. (S1 ^operator O1913 = 0.2640663414827097)
  7908. Firing prefer*rvt*predict-yes*H0
  7909. -->
  7910. Firing prefer*rvt*predict-no*H0
  7911. -->
  7912. Firing elaborate*copy-dir-to-output-link
  7913. -->
  7914. (I3 ^dir L +)
  7915. inner elaboration loop at bottom goal.
  7916. Retracting elaborate*copy-see-to-output-link
  7917. -->
  7918. (I3 ^see 0 +)
  7919. Retracting propose*predict-no
  7920. -->
  7921. (O1914 ^name predict-no +)
  7922. (S1 ^operator O1914 +)
  7923. Retracting propose*predict-yes
  7924. -->
  7925. (O1913 ^name predict-yes +)
  7926. (S1 ^operator O1913 +)
  7927. Retracting elaborate*reward*based*on*reward
  7928. -->
  7929. (R960 ^value 1 +)
  7930. (R1 ^reward R960 +)
  7931. Retracting elaborate*copy-dir-to-output-link
  7932. -->
  7933. (I3 ^dir R +)
  7934. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7935. -->
  7936. (S1 ^operator O1914 = 0.6601435952544124)
  7937. Retracting rl*prefer*rvt*predict-no*H0*4
  7938. -->
  7939. (S1 ^operator O1914 = 0.3397637965169674)
  7940. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7941. -->
  7942. (S1 ^operator O1913 = -0.1028953566115423)
  7943. Retracting rl*prefer*rvt*predict-yes*H0*3
  7944. -->
  7945. (S1 ^operator O1913 = 0.3377188564178903)
  7946. =>WM: (13491: S1 ^operator O1916 +)
  7947. =>WM: (13490: S1 ^operator O1915 +)
  7948. =>WM: (13489: I3 ^dir L)
  7949. =>WM: (13488: O1916 ^name predict-no)
  7950. =>WM: (13487: O1915 ^name predict-yes)
  7951. =>WM: (13486: R961 ^value 1)
  7952. =>WM: (13485: R1 ^reward R961)
  7953. <=WM: (13476: S1 ^operator O1913 +)
  7954. <=WM: (13477: S1 ^operator O1914 +)
  7955. <=WM: (13478: S1 ^operator O1914)
  7956. <=WM: (13462: I3 ^dir R)
  7957. <=WM: (13472: R1 ^reward R960)
  7958. <=WM: (13475: O1914 ^name predict-no)
  7959. <=WM: (13474: O1913 ^name predict-yes)
  7960. <=WM: (13473: R960 ^value 1)
  7961. --- Inner Elaboration Phase, active level 1 (S1) ---
  7962. Firing prefer*rvt*predict-yes*H0
  7963. -->
  7964. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7965. -->
  7966. (S1 ^operator O1915 = 0.7358024669452599)
  7967. Firing rl*prefer*rvt*predict-yes*H0*5
  7968. -->
  7969. (S1 ^operator O1915 = 0.2640663414827097)
  7970. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7971. -->
  7972. Firing prefer*rvt*predict-no*H0
  7973. -->
  7974. Firing rl*prefer*rvt*predict-no*H0*6
  7975. -->
  7976. (S1 ^operator O1916 = 0.9996367744406318)
  7977. inner elaboration loop at bottom goal.
  7978. Retracting rl*prefer*rvt*predict-no*H0*6
  7979. -->
  7980. (S1 ^operator O1914 = 0.9996367744406318)
  7981. Retracting rl*prefer*rvt*predict-yes*H0*5
  7982. -->
  7983. (S1 ^operator O1913 = 0.2640663414827097)
  7984. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7985. -->
  7986. (S1 ^operator O1913 = 0.7358024669452599)
  7987. --- END Proposal Phase ---
  7988. --- Decision Phase ---
  7989. RL update rl*prefer*rvt*predict-no*H0*4 0.570247 -0.230483 0.339764 -> 0.570255 -0.230484 0.339771(R,m,v=1,0.871951,0.112337)
  7990. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429656 0.230488 0.660144 -> 0.429665 0.230487 0.660152(R,m,v=1,1,0)
  7991. =>WM: (13492: S1 ^operator O1915)
  7992. 958: O: O1915 (predict-yes)
  7993. --- END Decision Phase ---
  7994. --- Application Phase ---
  7995. --- Firing Productions (PE) For State At Depth 1 ---
  7996. --- Inner Elaboration Phase, active level 1 (S1) ---
  7997. Firing apply*operator
  7998. -->
  7999. (I3 ^predict-yes N958 + :O )
  8000. Firing apply*operator*complete
  8001. -->
  8002. (I3 ^predict-no N957 - :O )
  8003. inner elaboration loop at bottom goal.
  8004. --- Change Working Memory (PE) ---
  8005. =>WM: (13493: I3 ^predict-yes N958)
  8006. <=WM: (13480: N957 ^status complete)
  8007. <=WM: (13479: I3 ^predict-no N957)
  8008. --- Firing Productions (IE) For State At Depth 1 ---
  8009. --- Inner Elaboration Phase, active level 1 (S1) ---
  8010. Firing monitor*world
  8011. -->
  8012. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8013. --- Change Working Memory (IE) ---
  8014. --- END Application Phase ---
  8015. --- Output Phase ---
  8016. ENV: Agent did: predict-yes for direction L in state State-B
  8017. In State-B moving L
  8018. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8019. predict error 0
  8020. dir: dir isU
  8021. --- END Output Phase ---
  8022. -/|--- Input Phase ---
  8023. =>WM: (13497: I2 ^dir U)
  8024. =>WM: (13496: I2 ^reward 1)
  8025. =>WM: (13495: I2 ^see 1)
  8026. =>WM: (13494: N958 ^status complete)
  8027. <=WM: (13483: I2 ^dir L)
  8028. <=WM: (13482: I2 ^reward 1)
  8029. <=WM: (13481: I2 ^see 0)
  8030. =>WM: (13498: I2 ^level-1 L1-root)
  8031. <=WM: (13484: I2 ^level-1 R0-root)
  8032. --- END Input Phase ---
  8033. --- Proposal Phase ---
  8034. --- Inner Elaboration Phase, active level 1 (S1) ---
  8035. Firing elaborate*copy-see-to-output-link
  8036. -->
  8037. (I3 ^see 1 +)
  8038. Firing elaborate*reward*based*on*reward
  8039. -->
  8040. (R962 ^value 1 +)
  8041. (R1 ^reward R962 +)
  8042. Firing propose*predict-yes
  8043. -->
  8044. (O1917 ^name predict-yes +)
  8045. (S1 ^operator O1917 +)
  8046. Firing propose*predict-no
  8047. -->
  8048. (O1918 ^name predict-no +)
  8049. (S1 ^operator O1918 +)
  8050. Firing rl*prefer*rvt*predict-no*H0*2
  8051. -->
  8052. (S1 ^operator O1916 = 1.)
  8053. Firing rl*prefer*rvt*predict-yes*H0*1
  8054. -->
  8055. (S1 ^operator O1915 = 0.)
  8056. Firing prefer*rvt*predict-yes*H0
  8057. -->
  8058. Firing prefer*rvt*predict-no*H0
  8059. -->
  8060. Firing elaborate*copy-dir-to-output-link
  8061. -->
  8062. (I3 ^dir U +)
  8063. inner elaboration loop at bottom goal.
  8064. Retracting elaborate*copy-see-to-output-link
  8065. -->
  8066. (I3 ^see 0 +)
  8067. Retracting propose*predict-no
  8068. -->
  8069. (O1916 ^name predict-no +)
  8070. (S1 ^operator O1916 +)
  8071. Retracting propose*predict-yes
  8072. -->
  8073. (O1915 ^name predict-yes +)
  8074. (S1 ^operator O1915 +)
  8075. Retracting elaborate*reward*based*on*reward
  8076. -->
  8077. (R961 ^value 1 +)
  8078. (R1 ^reward R961 +)
  8079. Retracting elaborate*copy-dir-to-output-link
  8080. -->
  8081. (I3 ^dir L +)
  8082. Retracting rl*prefer*rvt*predict-no*H0*6
  8083. -->
  8084. (S1 ^operator O1916 = 0.9996367744406318)
  8085. Retracting rl*prefer*rvt*predict-yes*H0*5
  8086. -->
  8087. (S1 ^operator O1915 = 0.2640663414827097)
  8088. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  8089. -->
  8090. (S1 ^operator O1915 = 0.7358024669452599)
  8091. =>WM: (13506: S1 ^operator O1918 +)
  8092. =>WM: (13505: S1 ^operator O1917 +)
  8093. =>WM: (13504: I3 ^dir U)
  8094. =>WM: (13503: O1918 ^name predict-no)
  8095. =>WM: (13502: O1917 ^name predict-yes)
  8096. =>WM: (13501: R962 ^value 1)
  8097. =>WM: (13500: R1 ^reward R962)
  8098. =>WM: (13499: I3 ^see 1)
  8099. <=WM: (13490: S1 ^operator O1915 +)
  8100. <=WM: (13492: S1 ^operator O1915)
  8101. <=WM: (13491: S1 ^operator O1916 +)
  8102. <=WM: (13489: I3 ^dir L)
  8103. <=WM: (13485: R1 ^reward R961)
  8104. <=WM: (13457: I3 ^see 0)
  8105. <=WM: (13488: O1916 ^name predict-no)
  8106. <=WM: (13487: O1915 ^name predict-yes)
  8107. <=WM: (13486: R961 ^value 1)
  8108. --- Inner Elaboration Phase, active level 1 (S1) ---
  8109. Firing prefer*rvt*predict-yes*H0
  8110. -->
  8111. Firing rl*prefer*rvt*predict-yes*H0*1
  8112. -->
  8113. (S1 ^operator O1917 = 0.)
  8114. Firing prefer*rvt*predict-no*H0
  8115. -->
  8116. Firing rl*prefer*rvt*predict-no*H0*2
  8117. -->
  8118. (S1 ^operator O1918 = 1.)
  8119. inner elaboration loop at bottom goal.
  8120. Retracting rl*prefer*rvt*predict-no*H0*2
  8121. -->
  8122. (S1 ^operator O1916 = 1.)
  8123. Retracting rl*prefer*rvt*predict-yes*H0*1
  8124. -->
  8125. (S1 ^operator O1915 = 0.)
  8126. --- END Proposal Phase ---
  8127. --- Decision Phase ---
  8128. RL update rl*prefer*rvt*predict-yes*H0*5 0.554451 -0.290385 0.264066 -> 0.554462 -0.290385 0.264077(R,m,v=1,0.872832,0.111641)
  8129. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.44542 0.290383 0.735802 -> 0.445432 0.290383 0.735815(R,m,v=1,1,0)
  8130. =>WM: (13507: S1 ^operator O1918)
  8131. 959: O: O1918 (predict-no)
  8132. --- END Decision Phase ---
  8133. --- Application Phase ---
  8134. --- Firing Productions (PE) For State At Depth 1 ---
  8135. --- Inner Elaboration Phase, active level 1 (S1) ---
  8136. Firing apply*operator
  8137. -->
  8138. (I3 ^predict-no N959 + :O )
  8139. Firing apply*operator*complete
  8140. -->
  8141. (I3 ^predict-yes N958 - :O )
  8142. inner elaboration loop at bottom goal.
  8143. --- Change Working Memory (PE) ---
  8144. =>WM: (13508: I3 ^predict-no N959)
  8145. <=WM: (13494: N958 ^status complete)
  8146. <=WM: (13493: I3 ^predict-yes N958)
  8147. --- Firing Productions (IE) For State At Depth 1 ---
  8148. --- Inner Elaboration Phase, active level 1 (S1) ---
  8149. Firing monitor*world
  8150. -->
  8151. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8152. --- Change Working Memory (IE) ---
  8153. --- END Application Phase ---
  8154. --- Output Phase ---
  8155. ENV: Agent did: predict-no for direction U in state State-A
  8156. In State-A moving U
  8157. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8158. predict error 0
  8159. dir: dir isL
  8160. --- END Output Phase ---
  8161. \-/--- Input Phase ---
  8162. =>WM: (13512: I2 ^dir L)
  8163. =>WM: (13511: I2 ^reward 1)
  8164. =>WM: (13510: I2 ^see 0)
  8165. =>WM: (13509: N959 ^status complete)
  8166. <=WM: (13497: I2 ^dir U)
  8167. <=WM: (13496: I2 ^reward 1)
  8168. <=WM: (13495: I2 ^see 1)
  8169. =>WM: (13513: I2 ^level-1 L1-root)
  8170. <=WM: (13498: I2 ^level-1 L1-root)
  8171. --- END Input Phase ---
  8172. --- Proposal Phase ---
  8173. --- Inner Elaboration Phase, active level 1 (S1) ---
  8174. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8175. -->
  8176. (S1 ^operator O1917 = -0.181727099742844)
  8177. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8178. -->
  8179. Firing elaborate*copy-see-to-output-link
  8180. -->
  8181. (I3 ^see 0 +)
  8182. Firing elaborate*reward*based*on*reward
  8183. -->
  8184. (R963 ^value 1 +)
  8185. (R1 ^reward R963 +)
  8186. Firing propose*predict-yes
  8187. -->
  8188. (O1919 ^name predict-yes +)
  8189. (S1 ^operator O1919 +)
  8190. Firing propose*predict-no
  8191. -->
  8192. (O1920 ^name predict-no +)
  8193. (S1 ^operator O1920 +)
  8194. Firing rl*prefer*rvt*predict-no*H0*6
  8195. -->
  8196. (S1 ^operator O1918 = 0.9996367744406318)
  8197. Firing rl*prefer*rvt*predict-yes*H0*5
  8198. -->
  8199. (S1 ^operator O1917 = 0.2640770017585976)
  8200. Firing prefer*rvt*predict-yes*H0
  8201. -->
  8202. Firing prefer*rvt*predict-no*H0
  8203. -->
  8204. Firing elaborate*copy-dir-to-output-link
  8205. -->
  8206. (I3 ^dir L +)
  8207. inner elaboration loop at bottom goal.
  8208. Retracting elaborate*copy-see-to-output-link
  8209. -->
  8210. (I3 ^see 1 +)
  8211. Retracting propose*predict-no
  8212. -->
  8213. (O1918 ^name predict-no +)
  8214. (S1 ^operator O1918 +)
  8215. Retracting propose*predict-yes
  8216. -->
  8217. (O1917 ^name predict-yes +)
  8218. (S1 ^operator O1917 +)
  8219. Retracting elaborate*reward*based*on*reward
  8220. -->
  8221. (R962 ^value 1 +)
  8222. (R1 ^reward R962 +)
  8223. Retracting elaborate*copy-dir-to-output-link
  8224. -->
  8225. (I3 ^dir U +)
  8226. Retracting rl*prefer*rvt*predict-no*H0*2
  8227. -->
  8228. (S1 ^operator O1918 = 1.)
  8229. Retracting rl*prefer*rvt*predict-yes*H0*1
  8230. -->
  8231. (S1 ^operator O1917 = 0.)
  8232. =>WM: (13521: S1 ^operator O1920 +)
  8233. =>WM: (13520: S1 ^operator O1919 +)
  8234. =>WM: (13519: I3 ^dir L)
  8235. =>WM: (13518: O1920 ^name predict-no)
  8236. =>WM: (13517: O1919 ^name predict-yes)
  8237. =>WM: (13516: R963 ^value 1)
  8238. =>WM: (13515: R1 ^reward R963)
  8239. =>WM: (13514: I3 ^see 0)
  8240. <=WM: (13505: S1 ^operator O1917 +)
  8241. <=WM: (13506: S1 ^operator O1918 +)
  8242. <=WM: (13507: S1 ^operator O1918)
  8243. <=WM: (13504: I3 ^dir U)
  8244. <=WM: (13500: R1 ^reward R962)
  8245. <=WM: (13499: I3 ^see 1)
  8246. <=WM: (13503: O1918 ^name predict-no)
  8247. <=WM: (13502: O1917 ^name predict-yes)
  8248. <=WM: (13501: R962 ^value 1)
  8249. --- Inner Elaboration Phase, active level 1 (S1) ---
  8250. Firing prefer*rvt*predict-yes*H0
  8251. -->
  8252. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8253. -->
  8254. (S1 ^operator O1919 = -0.181727099742844)
  8255. Firing rl*prefer*rvt*predict-yes*H0*5
  8256. -->
  8257. (S1 ^operator O1919 = 0.2640770017585976)
  8258. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8259. -->
  8260. Firing prefer*rvt*predict-no*H0
  8261. -->
  8262. Firing rl*prefer*rvt*predict-no*H0*6
  8263. -->
  8264. (S1 ^operator O1920 = 0.9996367744406318)
  8265. inner elaboration loop at bottom goal.
  8266. Retracting rl*prefer*rvt*predict-no*H0*6
  8267. -->
  8268. (S1 ^operator O1918 = 0.9996367744406318)
  8269. Retracting rl*prefer*rvt*predict-yes*H0*5
  8270. -->
  8271. (S1 ^operator O1917 = 0.2640770017585976)
  8272. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8273. -->
  8274. (S1 ^operator O1917 = -0.181727099742844)
  8275. --- END Proposal Phase ---
  8276. --- Decision Phase ---
  8277. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8278. =>WM: (13522: S1 ^operator O1920)
  8279. 960: O: O1920 (predict-no)
  8280. --- END Decision Phase ---
  8281. --- Application Phase ---
  8282. --- Firing Productions (PE) For State At Depth 1 ---
  8283. --- Inner Elaboration Phase, active level 1 (S1) ---
  8284. Firing apply*operator
  8285. -->
  8286. (I3 ^predict-no N960 + :O )
  8287. Firing apply*operator*complete
  8288. -->
  8289. (I3 ^predict-no N959 - :O )
  8290. inner elaboration loop at bottom goal.
  8291. --- Change Working Memory (PE) ---
  8292. =>WM: (13523: I3 ^predict-no N960)
  8293. <=WM: (13509: N959 ^status complete)
  8294. <=WM: (13508: I3 ^predict-no N959)
  8295. --- Firing Productions (IE) For State At Depth 1 ---
  8296. --- Inner Elaboration Phase, active level 1 (S1) ---
  8297. Firing monitor*world
  8298. -->
  8299. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8300. --- Change Working Memory (IE) ---
  8301. --- END Application Phase ---
  8302. --- Output Phase ---
  8303. ENV: Agent did: predict-no for direction L in state State-A
  8304. In State-A moving L
  8305. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8306. predict error 0
  8307. dir: dir isU
  8308. --- END Output Phase ---
  8309. |\---- Input Phase ---
  8310. =>WM: (13527: I2 ^dir U)
  8311. =>WM: (13526: I2 ^reward 1)
  8312. =>WM: (13525: I2 ^see 0)
  8313. =>WM: (13524: N960 ^status complete)
  8314. <=WM: (13512: I2 ^dir L)
  8315. <=WM: (13511: I2 ^reward 1)
  8316. <=WM: (13510: I2 ^see 0)
  8317. =>WM: (13528: I2 ^level-1 L0-root)
  8318. <=WM: (13513: I2 ^level-1 L1-root)
  8319. --- END Input Phase ---
  8320. --- Proposal Phase ---
  8321. --- Inner Elaboration Phase, active level 1 (S1) ---
  8322. Firing elaborate*copy-see-to-output-link
  8323. -->
  8324. (I3 ^see 0 +)
  8325. Firing elaborate*reward*based*on*reward
  8326. -->
  8327. (R964 ^value 1 +)
  8328. (R1 ^reward R964 +)
  8329. Firing propose*predict-yes
  8330. -->
  8331. (O1921 ^name predict-yes +)
  8332. (S1 ^operator O1921 +)
  8333. Firing propose*predict-no
  8334. -->
  8335. (O1922 ^name predict-no +)
  8336. (S1 ^operator O1922 +)
  8337. Firing rl*prefer*rvt*predict-no*H0*2
  8338. -->
  8339. (S1 ^operator O1920 = 1.)
  8340. Firing rl*prefer*rvt*predict-yes*H0*1
  8341. -->
  8342. (S1 ^operator O1919 = 0.)
  8343. Firing prefer*rvt*predict-yes*H0
  8344. -->
  8345. Firing prefer*rvt*predict-no*H0
  8346. -->
  8347. Firing elaborate*copy-dir-to-output-link
  8348. -->
  8349. (I3 ^dir U +)
  8350. inner elaboration loop at bottom goal.
  8351. Retracting elaborate*copy-see-to-output-link
  8352. -->
  8353. (I3 ^see 0 +)
  8354. Retracting propose*predict-no
  8355. -->
  8356. (O1920 ^name predict-no +)
  8357. (S1 ^operator O1920 +)
  8358. Retracting propose*predict-yes
  8359. -->
  8360. (O1919 ^name predict-yes +)
  8361. (S1 ^operator O1919 +)
  8362. Retracting elaborate*reward*based*on*reward
  8363. -->
  8364. (R963 ^value 1 +)
  8365. (R1 ^reward R963 +)
  8366. Retracting elaborate*copy-dir-to-output-link
  8367. -->
  8368. (I3 ^dir L +)
  8369. Retracting rl*prefer*rvt*predict-no*H0*6
  8370. -->
  8371. (S1 ^operator O1920 = 0.9996367744406318)
  8372. Retracting rl*prefer*rvt*predict-yes*H0*5
  8373. -->
  8374. (S1 ^operator O1919 = 0.2640770017585976)
  8375. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8376. -->
  8377. (S1 ^operator O1919 = -0.181727099742844)
  8378. =>WM: (13535: S1 ^operator O1922 +)
  8379. =>WM: (13534: S1 ^operator O1921 +)
  8380. =>WM: (13533: I3 ^dir U)
  8381. =>WM: (13532: O1922 ^name predict-no)
  8382. =>WM: (13531: O1921 ^name predict-yes)
  8383. =>WM: (13530: R964 ^value 1)
  8384. =>WM: (13529: R1 ^reward R964)
  8385. <=WM: (13520: S1 ^operator O1919 +)
  8386. <=WM: (13521: S1 ^operator O1920 +)
  8387. <=WM: (13522: S1 ^operator O1920)
  8388. <=WM: (13519: I3 ^dir L)
  8389. <=WM: (13515: R1 ^reward R963)
  8390. <=WM: (13518: O1920 ^name predict-no)
  8391. <=WM: (13517: O1919 ^name predict-yes)
  8392. <=WM: (13516: R963 ^value 1)
  8393. --- Inner Elaboration Phase, active level 1 (S1) ---
  8394. Firing prefer*rvt*predict-yes*H0
  8395. -->
  8396. Firing rl*prefer*rvt*predict-yes*H0*1
  8397. -->
  8398. (S1 ^operator O1921 = 0.)
  8399. Firing prefer*rvt*predict-no*H0
  8400. -->
  8401. Firing rl*prefer*rvt*predict-no*H0*2
  8402. -->
  8403. (S1 ^operator O1922 = 1.)
  8404. inner elaboration loop at bottom goal.
  8405. Retracting rl*prefer*rvt*predict-no*H0*2
  8406. -->
  8407. (S1 ^operator O1920 = 1.)
  8408. Retracting rl*prefer*rvt*predict-yes*H0*1
  8409. -->
  8410. (S1 ^operator O1919 = 0.)
  8411. --- END Proposal Phase ---
  8412. --- Decision Phase ---
  8413. RL update rl*prefer*rvt*predict-no*H0*6 0.999637 0 0.999637 -> 0.999698 0 0.999698(R,m,v=1,0.903448,0.0878352)
  8414. =>WM: (13536: S1 ^operator O1922)
  8415. 961: O: O1922 (predict-no)
  8416. --- END Decision Phase ---
  8417. --- Application Phase ---
  8418. --- Firing Productions (PE) For State At Depth 1 ---
  8419. --- Inner Elaboration Phase, active level 1 (S1) ---
  8420. Firing apply*operator
  8421. -->
  8422. (I3 ^predict-no N961 + :O )
  8423. Firing apply*operator*complete
  8424. -->
  8425. (I3 ^predict-no N960 - :O )
  8426. inner elaboration loop at bottom goal.
  8427. --- Change Working Memory (PE) ---
  8428. =>WM: (13537: I3 ^predict-no N961)
  8429. <=WM: (13524: N960 ^status complete)
  8430. <=WM: (13523: I3 ^predict-no N960)
  8431. --- Firing Productions (IE) For State At Depth 1 ---
  8432. --- Inner Elaboration Phase, active level 1 (S1) ---
  8433. Firing monitor*world
  8434. -->
  8435. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8436. --- Change Working Memory (IE) ---
  8437. --- END Application Phase ---
  8438. --- Output Phase ---
  8439. ENV: Agent did: predict-no for direction U in state State-A
  8440. In State-A moving U
  8441. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8442. predict error 0
  8443. dir: dir isR
  8444. --- END Output Phase ---
  8445. /--- Input Phase ---
  8446. =>WM: (13541: I2 ^dir R)
  8447. =>WM: (13540: I2 ^reward 1)
  8448. =>WM: (13539: I2 ^see 0)
  8449. =>WM: (13538: N961 ^status complete)
  8450. <=WM: (13527: I2 ^dir U)
  8451. <=WM: (13526: I2 ^reward 1)
  8452. <=WM: (13525: I2 ^see 0)
  8453. =>WM: (13542: I2 ^level-1 L0-root)
  8454. <=WM: (13528: I2 ^level-1 L0-root)
  8455. --- END Input Phase ---
  8456. --- Proposal Phase ---
  8457. --- Inner Elaboration Phase, active level 1 (S1) ---
  8458. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8459. -->
  8460. (S1 ^operator O1922 = -0.2817060109291377)
  8461. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8462. -->
  8463. (S1 ^operator O1921 = 0.6623767743575877)
  8464. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8465. -->
  8466. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8467. -->
  8468. Firing elaborate*copy-see-to-output-link
  8469. -->
  8470. (I3 ^see 0 +)
  8471. Firing elaborate*reward*based*on*reward
  8472. -->
  8473. (R965 ^value 1 +)
  8474. (R1 ^reward R965 +)
  8475. Firing propose*predict-yes
  8476. -->
  8477. (O1923 ^name predict-yes +)
  8478. (S1 ^operator O1923 +)
  8479. Firing propose*predict-no
  8480. -->
  8481. (O1924 ^name predict-no +)
  8482. (S1 ^operator O1924 +)
  8483. Firing rl*prefer*rvt*predict-no*H0*4
  8484. -->
  8485. (S1 ^operator O1922 = 0.3397713875215998)
  8486. Firing rl*prefer*rvt*predict-yes*H0*3
  8487. -->
  8488. (S1 ^operator O1921 = 0.3377188564178903)
  8489. Firing prefer*rvt*predict-yes*H0
  8490. -->
  8491. Firing prefer*rvt*predict-no*H0
  8492. -->
  8493. Firing elaborate*copy-dir-to-output-link
  8494. -->
  8495. (I3 ^dir R +)
  8496. inner elaboration loop at bottom goal.
  8497. Retracting elaborate*copy-see-to-output-link
  8498. -->
  8499. (I3 ^see 0 +)
  8500. Retracting propose*predict-no
  8501. -->
  8502. (O1922 ^name predict-no +)
  8503. (S1 ^operator O1922 +)
  8504. Retracting propose*predict-yes
  8505. -->
  8506. (O1921 ^name predict-yes +)
  8507. (S1 ^operator O1921 +)
  8508. Retracting elaborate*reward*based*on*reward
  8509. -->
  8510. (R964 ^value 1 +)
  8511. (R1 ^reward R964 +)
  8512. Retracting elaborate*copy-dir-to-output-link
  8513. -->
  8514. (I3 ^dir U +)
  8515. Retracting rl*prefer*rvt*predict-no*H0*2
  8516. -->
  8517. (S1 ^operator O1922 = 1.)
  8518. Retracting rl*prefer*rvt*predict-yes*H0*1
  8519. -->
  8520. (S1 ^operator O1921 = 0.)
  8521. =>WM: (13549: S1 ^operator O1924 +)
  8522. =>WM: (13548: S1 ^operator O1923 +)
  8523. =>WM: (13547: I3 ^dir R)
  8524. =>WM: (13546: O1924 ^name predict-no)
  8525. =>WM: (13545: O1923 ^name predict-yes)
  8526. =>WM: (13544: R965 ^value 1)
  8527. =>WM: (13543: R1 ^reward R965)
  8528. <=WM: (13534: S1 ^operator O1921 +)
  8529. <=WM: (13535: S1 ^operator O1922 +)
  8530. <=WM: (13536: S1 ^operator O1922)
  8531. <=WM: (13533: I3 ^dir U)
  8532. <=WM: (13529: R1 ^reward R964)
  8533. <=WM: (13532: O1922 ^name predict-no)
  8534. <=WM: (13531: O1921 ^name predict-yes)
  8535. <=WM: (13530: R964 ^value 1)
  8536. --- Inner Elaboration Phase, active level 1 (S1) ---
  8537. Firing prefer*rvt*predict-yes*H0
  8538. -->
  8539. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8540. -->
  8541. (S1 ^operator O1923 = 0.6623767743575877)
  8542. Firing rl*prefer*rvt*predict-yes*H0*3
  8543. -->
  8544. (S1 ^operator O1923 = 0.3377188564178903)
  8545. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8546. -->
  8547. Firing prefer*rvt*predict-no*H0
  8548. -->
  8549. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8550. -->
  8551. (S1 ^operator O1924 = -0.2817060109291377)
  8552. Firing rl*prefer*rvt*predict-no*H0*4
  8553. -->
  8554. (S1 ^operator O1924 = 0.3397713875215998)
  8555. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8556. -->
  8557. inner elaboration loop at bottom goal.
  8558. Retracting rl*prefer*rvt*predict-no*H0*4
  8559. -->
  8560. (S1 ^operator O1922 = 0.3397713875215998)
  8561. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8562. -->
  8563. (S1 ^operator O1922 = -0.2817060109291377)
  8564. Retracting rl*prefer*rvt*predict-yes*H0*3
  8565. -->
  8566. (S1 ^operator O1921 = 0.3377188564178903)
  8567. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8568. -->
  8569. (S1 ^operator O1921 = 0.6623767743575877)
  8570. --- END Proposal Phase ---
  8571. --- Decision Phase ---
  8572. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8573. =>WM: (13550: S1 ^operator O1923)
  8574. 962: O: O1923 (predict-yes)
  8575. --- END Decision Phase ---
  8576. --- Application Phase ---
  8577. --- Firing Productions (PE) For State At Depth 1 ---
  8578. --- Inner Elaboration Phase, active level 1 (S1) ---
  8579. Firing apply*operator
  8580. -->
  8581. (I3 ^predict-yes N962 + :O )
  8582. Firing apply*operator*complete
  8583. -->
  8584. (I3 ^predict-no N961 - :O )
  8585. inner elaboration loop at bottom goal.
  8586. --- Change Working Memory (PE) ---
  8587. =>WM: (13551: I3 ^predict-yes N962)
  8588. <=WM: (13538: N961 ^status complete)
  8589. <=WM: (13537: I3 ^predict-no N961)
  8590. --- Firing Productions (IE) For State At Depth 1 ---
  8591. --- Inner Elaboration Phase, active level 1 (S1) ---
  8592. Firing monitor*world
  8593. -->
  8594. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8595. --- Change Working Memory (IE) ---
  8596. --- END Application Phase ---
  8597. --- Output Phase ---
  8598. ENV: Agent did: predict-yes for direction R in state State-A
  8599. In State-A moving R
  8600. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8601. predict error 0
  8602. dir: dir isU
  8603. --- END Output Phase ---
  8604. |\--- Input Phase ---
  8605. =>WM: (13555: I2 ^dir U)
  8606. =>WM: (13554: I2 ^reward 1)
  8607. =>WM: (13553: I2 ^see 1)
  8608. =>WM: (13552: N962 ^status complete)
  8609. <=WM: (13541: I2 ^dir R)
  8610. <=WM: (13540: I2 ^reward 1)
  8611. <=WM: (13539: I2 ^see 0)
  8612. =>WM: (13556: I2 ^level-1 R1-root)
  8613. <=WM: (13542: I2 ^level-1 L0-root)
  8614. --- END Input Phase ---
  8615. --- Proposal Phase ---
  8616. --- Inner Elaboration Phase, active level 1 (S1) ---
  8617. Firing elaborate*copy-see-to-output-link
  8618. -->
  8619. (I3 ^see 1 +)
  8620. Firing elaborate*reward*based*on*reward
  8621. -->
  8622. (R966 ^value 1 +)
  8623. (R1 ^reward R966 +)
  8624. Firing propose*predict-yes
  8625. -->
  8626. (O1925 ^name predict-yes +)
  8627. (S1 ^operator O1925 +)
  8628. Firing propose*predict-no
  8629. -->
  8630. (O1926 ^name predict-no +)
  8631. (S1 ^operator O1926 +)
  8632. Firing rl*prefer*rvt*predict-no*H0*2
  8633. -->
  8634. (S1 ^operator O1924 = 1.)
  8635. Firing rl*prefer*rvt*predict-yes*H0*1
  8636. -->
  8637. (S1 ^operator O1923 = 0.)
  8638. Firing prefer*rvt*predict-yes*H0
  8639. -->
  8640. Firing prefer*rvt*predict-no*H0
  8641. -->
  8642. Firing elaborate*copy-dir-to-output-link
  8643. -->
  8644. (I3 ^dir U +)
  8645. inner elaboration loop at bottom goal.
  8646. Retracting elaborate*copy-see-to-output-link
  8647. -->
  8648. (I3 ^see 0 +)
  8649. Retracting propose*predict-no
  8650. -->
  8651. (O1924 ^name predict-no +)
  8652. (S1 ^operator O1924 +)
  8653. Retracting propose*predict-yes
  8654. -->
  8655. (O1923 ^name predict-yes +)
  8656. (S1 ^operator O1923 +)
  8657. Retracting elaborate*reward*based*on*reward
  8658. -->
  8659. (R965 ^value 1 +)
  8660. (R1 ^reward R965 +)
  8661. Retracting elaborate*copy-dir-to-output-link
  8662. -->
  8663. (I3 ^dir R +)
  8664. Retracting rl*prefer*rvt*predict-no*H0*4
  8665. -->
  8666. (S1 ^operator O1924 = 0.3397713875215998)
  8667. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8668. -->
  8669. (S1 ^operator O1924 = -0.2817060109291377)
  8670. Retracting rl*prefer*rvt*predict-yes*H0*3
  8671. -->
  8672. (S1 ^operator O1923 = 0.3377188564178903)
  8673. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8674. -->
  8675. (S1 ^operator O1923 = 0.6623767743575877)
  8676. =>WM: (13564: S1 ^operator O1926 +)
  8677. =>WM: (13563: S1 ^operator O1925 +)
  8678. =>WM: (13562: I3 ^dir U)
  8679. =>WM: (13561: O1926 ^name predict-no)
  8680. =>WM: (13560: O1925 ^name predict-yes)
  8681. =>WM: (13559: R966 ^value 1)
  8682. =>WM: (13558: R1 ^reward R966)
  8683. =>WM: (13557: I3 ^see 1)
  8684. <=WM: (13548: S1 ^operator O1923 +)
  8685. <=WM: (13550: S1 ^operator O1923)
  8686. <=WM: (13549: S1 ^operator O1924 +)
  8687. <=WM: (13547: I3 ^dir R)
  8688. <=WM: (13543: R1 ^reward R965)
  8689. <=WM: (13514: I3 ^see 0)
  8690. <=WM: (13546: O1924 ^name predict-no)
  8691. <=WM: (13545: O1923 ^name predict-yes)
  8692. <=WM: (13544: R965 ^value 1)
  8693. --- Inner Elaboration Phase, active level 1 (S1) ---
  8694. Firing prefer*rvt*predict-yes*H0
  8695. -->
  8696. Firing rl*prefer*rvt*predict-yes*H0*1
  8697. -->
  8698. (S1 ^operator O1925 = 0.)
  8699. Firing prefer*rvt*predict-no*H0
  8700. -->
  8701. Firing rl*prefer*rvt*predict-no*H0*2
  8702. -->
  8703. (S1 ^operator O1926 = 1.)
  8704. inner elaboration loop at bottom goal.
  8705. Retracting rl*prefer*rvt*predict-no*H0*2
  8706. -->
  8707. (S1 ^operator O1924 = 1.)
  8708. Retracting rl*prefer*rvt*predict-yes*H0*1
  8709. -->
  8710. (S1 ^operator O1923 = 0.)
  8711. --- END Proposal Phase ---
  8712. --- Decision Phase ---
  8713. RL update rl*prefer*rvt*predict-yes*H0*3 0.59012 -0.252401 0.337719 -> 0.590111 -0.2524 0.337711(R,m,v=1,0.895062,0.0945096)
  8714. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.40999 0.252387 0.662377 -> 0.409979 0.252388 0.662368(R,m,v=1,1,0)
  8715. =>WM: (13565: S1 ^operator O1926)
  8716. 963: O: O1926 (predict-no)
  8717. --- END Decision Phase ---
  8718. --- Application Phase ---
  8719. --- Firing Productions (PE) For State At Depth 1 ---
  8720. --- Inner Elaboration Phase, active level 1 (S1) ---
  8721. Firing apply*operator
  8722. -->
  8723. (I3 ^predict-no N963 + :O )
  8724. Firing apply*operator*complete
  8725. -->
  8726. (I3 ^predict-yes N962 - :O )
  8727. inner elaboration loop at bottom goal.
  8728. --- Change Working Memory (PE) ---
  8729. =>WM: (13566: I3 ^predict-no N963)
  8730. <=WM: (13552: N962 ^status complete)
  8731. <=WM: (13551: I3 ^predict-yes N962)
  8732. --- Firing Productions (IE) For State At Depth 1 ---
  8733. --- Inner Elaboration Phase, active level 1 (S1) ---
  8734. Firing monitor*world
  8735. -->
  8736. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8737. --- Change Working Memory (IE) ---
  8738. --- END Application Phase ---
  8739. --- Output Phase ---
  8740. ENV: Agent did: predict-no for direction U in state State-B
  8741. In State-B moving U
  8742. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8743. predict error 0
  8744. dir: dir isL
  8745. --- END Output Phase ---
  8746. -/--- Input Phase ---
  8747. =>WM: (13570: I2 ^dir L)
  8748. =>WM: (13569: I2 ^reward 1)
  8749. =>WM: (13568: I2 ^see 0)
  8750. =>WM: (13567: N963 ^status complete)
  8751. <=WM: (13555: I2 ^dir U)
  8752. <=WM: (13554: I2 ^reward 1)
  8753. <=WM: (13553: I2 ^see 1)
  8754. =>WM: (13571: I2 ^level-1 R1-root)
  8755. <=WM: (13556: I2 ^level-1 R1-root)
  8756. --- END Input Phase ---
  8757. --- Proposal Phase ---
  8758. --- Inner Elaboration Phase, active level 1 (S1) ---
  8759. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8760. -->
  8761. (S1 ^operator O1925 = 0.7363235474336447)
  8762. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8763. -->
  8764. Firing elaborate*copy-see-to-output-link
  8765. -->
  8766. (I3 ^see 0 +)
  8767. Firing elaborate*reward*based*on*reward
  8768. -->
  8769. (R967 ^value 1 +)
  8770. (R1 ^reward R967 +)
  8771. Firing propose*predict-yes
  8772. -->
  8773. (O1927 ^name predict-yes +)
  8774. (S1 ^operator O1927 +)
  8775. Firing propose*predict-no
  8776. -->
  8777. (O1928 ^name predict-no +)
  8778. (S1 ^operator O1928 +)
  8779. Firing rl*prefer*rvt*predict-no*H0*6
  8780. -->
  8781. (S1 ^operator O1926 = 0.9996975476948911)
  8782. Firing rl*prefer*rvt*predict-yes*H0*5
  8783. -->
  8784. (S1 ^operator O1925 = 0.2640770017585976)
  8785. Firing prefer*rvt*predict-yes*H0
  8786. -->
  8787. Firing prefer*rvt*predict-no*H0
  8788. -->
  8789. Firing elaborate*copy-dir-to-output-link
  8790. -->
  8791. (I3 ^dir L +)
  8792. inner elaboration loop at bottom goal.
  8793. Retracting elaborate*copy-see-to-output-link
  8794. -->
  8795. (I3 ^see 1 +)
  8796. Retracting propose*predict-no
  8797. -->
  8798. (O1926 ^name predict-no +)
  8799. (S1 ^operator O1926 +)
  8800. Retracting propose*predict-yes
  8801. -->
  8802. (O1925 ^name predict-yes +)
  8803. (S1 ^operator O1925 +)
  8804. Retracting elaborate*reward*based*on*reward
  8805. -->
  8806. (R966 ^value 1 +)
  8807. (R1 ^reward R966 +)
  8808. Retracting elaborate*copy-dir-to-output-link
  8809. -->
  8810. (I3 ^dir U +)
  8811. Retracting rl*prefer*rvt*predict-no*H0*2
  8812. -->
  8813. (S1 ^operator O1926 = 1.)
  8814. Retracting rl*prefer*rvt*predict-yes*H0*1
  8815. -->
  8816. (S1 ^operator O1925 = 0.)
  8817. =>WM: (13579: S1 ^operator O1928 +)
  8818. =>WM: (13578: S1 ^operator O1927 +)
  8819. =>WM: (13577: I3 ^dir L)
  8820. =>WM: (13576: O1928 ^name predict-no)
  8821. =>WM: (13575: O1927 ^name predict-yes)
  8822. =>WM: (13574: R967 ^value 1)
  8823. =>WM: (13573: R1 ^reward R967)
  8824. =>WM: (13572: I3 ^see 0)
  8825. <=WM: (13563: S1 ^operator O1925 +)
  8826. <=WM: (13564: S1 ^operator O1926 +)
  8827. <=WM: (13565: S1 ^operator O1926)
  8828. <=WM: (13562: I3 ^dir U)
  8829. <=WM: (13558: R1 ^reward R966)
  8830. <=WM: (13557: I3 ^see 1)
  8831. <=WM: (13561: O1926 ^name predict-no)
  8832. <=WM: (13560: O1925 ^name predict-yes)
  8833. <=WM: (13559: R966 ^value 1)
  8834. --- Inner Elaboration Phase, active level 1 (S1) ---
  8835. Firing prefer*rvt*predict-yes*H0
  8836. -->
  8837. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8838. -->
  8839. (S1 ^operator O1927 = 0.7363235474336447)
  8840. Firing rl*prefer*rvt*predict-yes*H0*5
  8841. -->
  8842. (S1 ^operator O1927 = 0.2640770017585976)
  8843. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8844. -->
  8845. Firing prefer*rvt*predict-no*H0
  8846. -->
  8847. Firing rl*prefer*rvt*predict-no*H0*6
  8848. -->
  8849. (S1 ^operator O1928 = 0.9996975476948911)
  8850. inner elaboration loop at bottom goal.
  8851. Retracting rl*prefer*rvt*predict-no*H0*6
  8852. -->
  8853. (S1 ^operator O1926 = 0.9996975476948911)
  8854. Retracting rl*prefer*rvt*predict-yes*H0*5
  8855. -->
  8856. (S1 ^operator O1925 = 0.2640770017585976)
  8857. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8858. -->
  8859. (S1 ^operator O1925 = 0.7363235474336447)
  8860. --- END Proposal Phase ---
  8861. --- Decision Phase ---
  8862. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8863. =>WM: (13580: S1 ^operator O1927)
  8864. 964: O: O1927 (predict-yes)
  8865. --- END Decision Phase ---
  8866. --- Application Phase ---
  8867. --- Firing Productions (PE) For State At Depth 1 ---
  8868. --- Inner Elaboration Phase, active level 1 (S1) ---
  8869. Firing apply*operator
  8870. -->
  8871. (I3 ^predict-yes N964 + :O )
  8872. Firing apply*operator*complete
  8873. -->
  8874. (I3 ^predict-no N963 - :O )
  8875. inner elaboration loop at bottom goal.
  8876. --- Change Working Memory (PE) ---
  8877. =>WM: (13581: I3 ^predict-yes N964)
  8878. <=WM: (13567: N963 ^status complete)
  8879. <=WM: (13566: I3 ^predict-no N963)
  8880. --- Firing Productions (IE) For State At Depth 1 ---
  8881. --- Inner Elaboration Phase, active level 1 (S1) ---
  8882. Firing monitor*world
  8883. -->
  8884. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8885. --- Change Working Memory (IE) ---
  8886. --- END Application Phase ---
  8887. --- Output Phase ---
  8888. ENV: Agent did: predict-yes for direction L in state State-B
  8889. In State-B moving L
  8890. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8891. predict error 0
  8892. dir: dir isU
  8893. --- END Output Phase ---
  8894. |\---- Input Phase ---
  8895. =>WM: (13585: I2 ^dir U)
  8896. =>WM: (13584: I2 ^reward 1)
  8897. =>WM: (13583: I2 ^see 1)
  8898. =>WM: (13582: N964 ^status complete)
  8899. <=WM: (13570: I2 ^dir L)
  8900. <=WM: (13569: I2 ^reward 1)
  8901. <=WM: (13568: I2 ^see 0)
  8902. =>WM: (13586: I2 ^level-1 L1-root)
  8903. <=WM: (13571: I2 ^level-1 R1-root)
  8904. --- END Input Phase ---
  8905. --- Proposal Phase ---
  8906. --- Inner Elaboration Phase, active level 1 (S1) ---
  8907. Firing elaborate*copy-see-to-output-link
  8908. -->
  8909. (I3 ^see 1 +)
  8910. Firing elaborate*reward*based*on*reward
  8911. -->
  8912. (R968 ^value 1 +)
  8913. (R1 ^reward R968 +)
  8914. Firing propose*predict-yes
  8915. -->
  8916. (O1929 ^name predict-yes +)
  8917. (S1 ^operator O1929 +)
  8918. Firing propose*predict-no
  8919. -->
  8920. (O1930 ^name predict-no +)
  8921. (S1 ^operator O1930 +)
  8922. Firing rl*prefer*rvt*predict-no*H0*2
  8923. -->
  8924. (S1 ^operator O1928 = 1.)
  8925. Firing rl*prefer*rvt*predict-yes*H0*1
  8926. -->
  8927. (S1 ^operator O1927 = 0.)
  8928. Firing prefer*rvt*predict-yes*H0
  8929. -->
  8930. Firing prefer*rvt*predict-no*H0
  8931. -->
  8932. Firing elaborate*copy-dir-to-output-link
  8933. -->
  8934. (I3 ^dir U +)
  8935. inner elaboration loop at bottom goal.
  8936. Retracting elaborate*copy-see-to-output-link
  8937. -->
  8938. (I3 ^see 0 +)
  8939. Retracting propose*predict-no
  8940. -->
  8941. (O1928 ^name predict-no +)
  8942. (S1 ^operator O1928 +)
  8943. Retracting propose*predict-yes
  8944. -->
  8945. (O1927 ^name predict-yes +)
  8946. (S1 ^operator O1927 +)
  8947. Retracting elaborate*reward*based*on*reward
  8948. -->
  8949. (R967 ^value 1 +)
  8950. (R1 ^reward R967 +)
  8951. Retracting elaborate*copy-dir-to-output-link
  8952. -->
  8953. (I3 ^dir L +)
  8954. Retracting rl*prefer*rvt*predict-no*H0*6
  8955. -->
  8956. (S1 ^operator O1928 = 0.9996975476948911)
  8957. Retracting rl*prefer*rvt*predict-yes*H0*5
  8958. -->
  8959. (S1 ^operator O1927 = 0.2640770017585976)
  8960. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8961. -->
  8962. (S1 ^operator O1927 = 0.7363235474336447)
  8963. =>WM: (13594: S1 ^operator O1930 +)
  8964. =>WM: (13593: S1 ^operator O1929 +)
  8965. =>WM: (13592: I3 ^dir U)
  8966. =>WM: (13591: O1930 ^name predict-no)
  8967. =>WM: (13590: O1929 ^name predict-yes)
  8968. =>WM: (13589: R968 ^value 1)
  8969. =>WM: (13588: R1 ^reward R968)
  8970. =>WM: (13587: I3 ^see 1)
  8971. <=WM: (13578: S1 ^operator O1927 +)
  8972. <=WM: (13580: S1 ^operator O1927)
  8973. <=WM: (13579: S1 ^operator O1928 +)
  8974. <=WM: (13577: I3 ^dir L)
  8975. <=WM: (13573: R1 ^reward R967)
  8976. <=WM: (13572: I3 ^see 0)
  8977. <=WM: (13576: O1928 ^name predict-no)
  8978. <=WM: (13575: O1927 ^name predict-yes)
  8979. <=WM: (13574: R967 ^value 1)
  8980. --- Inner Elaboration Phase, active level 1 (S1) ---
  8981. Firing prefer*rvt*predict-yes*H0
  8982. -->
  8983. Firing rl*prefer*rvt*predict-yes*H0*1
  8984. -->
  8985. (S1 ^operator O1929 = 0.)
  8986. Firing prefer*rvt*predict-no*H0
  8987. -->
  8988. Firing rl*prefer*rvt*predict-no*H0*2
  8989. -->
  8990. (S1 ^operator O1930 = 1.)
  8991. inner elaboration loop at bottom goal.
  8992. Retracting rl*prefer*rvt*predict-no*H0*2
  8993. -->
  8994. (S1 ^operator O1928 = 1.)
  8995. Retracting rl*prefer*rvt*predict-yes*H0*1
  8996. -->
  8997. (S1 ^operator O1927 = 0.)
  8998. --- END Proposal Phase ---
  8999. --- Decision Phase ---
  9000. RL update rl*prefer*rvt*predict-yes*H0*5 0.554462 -0.290385 0.264077 -> 0.55443 -0.290385 0.264044(R,m,v=1,0.873563,0.111089)
  9001. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445932 0.290392 0.736324 -> 0.445895 0.290391 0.736286(R,m,v=1,1,0)
  9002. =>WM: (13595: S1 ^operator O1930)
  9003. 965: O: O1930 (predict-no)
  9004. --- END Decision Phase ---
  9005. --- Application Phase ---
  9006. --- Firing Productions (PE) For State At Depth 1 ---
  9007. --- Inner Elaboration Phase, active level 1 (S1) ---
  9008. Firing apply*operator
  9009. -->
  9010. (I3 ^predict-no N965 + :O )
  9011. Firing apply*operator*complete
  9012. -->
  9013. (I3 ^predict-yes N964 - :O )
  9014. inner elaboration loop at bottom goal.
  9015. --- Change Working Memory (PE) ---
  9016. =>WM: (13596: I3 ^predict-no N965)
  9017. <=WM: (13582: N964 ^status complete)
  9018. <=WM: (13581: I3 ^predict-yes N964)
  9019. --- Firing Productions (IE) For State At Depth 1 ---
  9020. --- Inner Elaboration Phase, active level 1 (S1) ---
  9021. Firing monitor*world
  9022. -->
  9023. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9024. --- Change Working Memory (IE) ---
  9025. --- END Application Phase ---
  9026. --- Output Phase ---
  9027. ENV: Agent did: predict-no for direction U in state State-A
  9028. In State-A moving U
  9029. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9030. predict error 0
  9031. dir: dir isL
  9032. --- END Output Phase ---
  9033. /|--- Input Phase ---
  9034. =>WM: (13600: I2 ^dir L)
  9035. =>WM: (13599: I2 ^reward 1)
  9036. =>WM: (13598: I2 ^see 0)
  9037. =>WM: (13597: N965 ^status complete)
  9038. <=WM: (13585: I2 ^dir U)
  9039. <=WM: (13584: I2 ^reward 1)
  9040. <=WM: (13583: I2 ^see 1)
  9041. =>WM: (13601: I2 ^level-1 L1-root)
  9042. <=WM: (13586: I2 ^level-1 L1-root)
  9043. --- END Input Phase ---
  9044. --- Proposal Phase ---
  9045. --- Inner Elaboration Phase, active level 1 (S1) ---
  9046. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9047. -->
  9048. (S1 ^operator O1929 = -0.181727099742844)
  9049. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9050. -->
  9051. Firing elaborate*copy-see-to-output-link
  9052. -->
  9053. (I3 ^see 0 +)
  9054. Firing elaborate*reward*based*on*reward
  9055. -->
  9056. (R969 ^value 1 +)
  9057. (R1 ^reward R969 +)
  9058. Firing propose*predict-yes
  9059. -->
  9060. (O1931 ^name predict-yes +)
  9061. (S1 ^operator O1931 +)
  9062. Firing propose*predict-no
  9063. -->
  9064. (O1932 ^name predict-no +)
  9065. (S1 ^operator O1932 +)
  9066. Firing rl*prefer*rvt*predict-no*H0*6
  9067. -->
  9068. (S1 ^operator O1930 = 0.9996975476948911)
  9069. Firing rl*prefer*rvt*predict-yes*H0*5
  9070. -->
  9071. (S1 ^operator O1929 = 0.2640444846619989)
  9072. Firing prefer*rvt*predict-yes*H0
  9073. -->
  9074. Firing prefer*rvt*predict-no*H0
  9075. -->
  9076. Firing elaborate*copy-dir-to-output-link
  9077. -->
  9078. (I3 ^dir L +)
  9079. inner elaboration loop at bottom goal.
  9080. Retracting elaborate*copy-see-to-output-link
  9081. -->
  9082. (I3 ^see 1 +)
  9083. Retracting propose*predict-no
  9084. -->
  9085. (O1930 ^name predict-no +)
  9086. (S1 ^operator O1930 +)
  9087. Retracting propose*predict-yes
  9088. -->
  9089. (O1929 ^name predict-yes +)
  9090. (S1 ^operator O1929 +)
  9091. Retracting elaborate*reward*based*on*reward
  9092. -->
  9093. (R968 ^value 1 +)
  9094. (R1 ^reward R968 +)
  9095. Retracting elaborate*copy-dir-to-output-link
  9096. -->
  9097. (I3 ^dir U +)
  9098. Retracting rl*prefer*rvt*predict-no*H0*2
  9099. -->
  9100. (S1 ^operator O1930 = 1.)
  9101. Retracting rl*prefer*rvt*predict-yes*H0*1
  9102. -->
  9103. (S1 ^operator O1929 = 0.)
  9104. =>WM: (13609: S1 ^operator O1932 +)
  9105. =>WM: (13608: S1 ^operator O1931 +)
  9106. =>WM: (13607: I3 ^dir L)
  9107. =>WM: (13606: O1932 ^name predict-no)
  9108. =>WM: (13605: O1931 ^name predict-yes)
  9109. =>WM: (13604: R969 ^value 1)
  9110. =>WM: (13603: R1 ^reward R969)
  9111. =>WM: (13602: I3 ^see 0)
  9112. <=WM: (13593: S1 ^operator O1929 +)
  9113. <=WM: (13594: S1 ^operator O1930 +)
  9114. <=WM: (13595: S1 ^operator O1930)
  9115. <=WM: (13592: I3 ^dir U)
  9116. <=WM: (13588: R1 ^reward R968)
  9117. <=WM: (13587: I3 ^see 1)
  9118. <=WM: (13591: O1930 ^name predict-no)
  9119. <=WM: (13590: O1929 ^name predict-yes)
  9120. <=WM: (13589: R968 ^value 1)
  9121. --- Inner Elaboration Phase, active level 1 (S1) ---
  9122. Firing prefer*rvt*predict-yes*H0
  9123. -->
  9124. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9125. -->
  9126. (S1 ^operator O1931 = -0.181727099742844)
  9127. Firing rl*prefer*rvt*predict-yes*H0*5
  9128. -->
  9129. (S1 ^operator O1931 = 0.2640444846619989)
  9130. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9131. -->
  9132. Firing prefer*rvt*predict-no*H0
  9133. -->
  9134. Firing rl*prefer*rvt*predict-no*H0*6
  9135. -->
  9136. (S1 ^operator O1932 = 0.9996975476948911)
  9137. inner elaboration loop at bottom goal.
  9138. Retracting rl*prefer*rvt*predict-no*H0*6
  9139. -->
  9140. (S1 ^operator O1930 = 0.9996975476948911)
  9141. Retracting rl*prefer*rvt*predict-yes*H0*5
  9142. -->
  9143. (S1 ^operator O1929 = 0.2640444846619989)
  9144. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9145. -->
  9146. (S1 ^operator O1929 = -0.181727099742844)
  9147. --- END Proposal Phase ---
  9148. --- Decision Phase ---
  9149. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9150. =>WM: (13610: S1 ^operator O1932)
  9151. 966: O: O1932 (predict-no)
  9152. --- END Decision Phase ---
  9153. --- Application Phase ---
  9154. --- Firing Productions (PE) For State At Depth 1 ---
  9155. --- Inner Elaboration Phase, active level 1 (S1) ---
  9156. Firing apply*operator
  9157. -->
  9158. (I3 ^predict-no N966 + :O )
  9159. Firing apply*operator*complete
  9160. -->
  9161. (I3 ^predict-no N965 - :O )
  9162. inner elaboration loop at bottom goal.
  9163. --- Change Working Memory (PE) ---
  9164. =>WM: (13611: I3 ^predict-no N966)
  9165. <=WM: (13597: N965 ^status complete)
  9166. <=WM: (13596: I3 ^predict-no N965)
  9167. --- Firing Productions (IE) For State At Depth 1 ---
  9168. --- Inner Elaboration Phase, active level 1 (S1) ---
  9169. Firing monitor*world
  9170. -->
  9171. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9172. --- Change Working Memory (IE) ---
  9173. --- END Application Phase ---
  9174. --- Output Phase ---
  9175. ENV: Agent did: predict-no for direction L in state State-A
  9176. In State-A moving L
  9177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9178. predict error 0
  9179. dir: dir isR
  9180. --- END Output Phase ---
  9181. \-/--- Input Phase ---
  9182. =>WM: (13615: I2 ^dir R)
  9183. =>WM: (13614: I2 ^reward 1)
  9184. =>WM: (13613: I2 ^see 0)
  9185. =>WM: (13612: N966 ^status complete)
  9186. <=WM: (13600: I2 ^dir L)
  9187. <=WM: (13599: I2 ^reward 1)
  9188. <=WM: (13598: I2 ^see 0)
  9189. =>WM: (13616: I2 ^level-1 L0-root)
  9190. <=WM: (13601: I2 ^level-1 L1-root)
  9191. --- END Input Phase ---
  9192. --- Proposal Phase ---
  9193. --- Inner Elaboration Phase, active level 1 (S1) ---
  9194. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9195. -->
  9196. (S1 ^operator O1932 = -0.2817060109291377)
  9197. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9198. -->
  9199. (S1 ^operator O1931 = 0.6623675607605151)
  9200. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9201. -->
  9202. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9203. -->
  9204. Firing elaborate*copy-see-to-output-link
  9205. -->
  9206. (I3 ^see 0 +)
  9207. Firing elaborate*reward*based*on*reward
  9208. -->
  9209. (R970 ^value 1 +)
  9210. (R1 ^reward R970 +)
  9211. Firing propose*predict-yes
  9212. -->
  9213. (O1933 ^name predict-yes +)
  9214. (S1 ^operator O1933 +)
  9215. Firing propose*predict-no
  9216. -->
  9217. (O1934 ^name predict-no +)
  9218. (S1 ^operator O1934 +)
  9219. Firing rl*prefer*rvt*predict-no*H0*4
  9220. -->
  9221. (S1 ^operator O1932 = 0.3397713875215998)
  9222. Firing rl*prefer*rvt*predict-yes*H0*3
  9223. -->
  9224. (S1 ^operator O1931 = 0.3377110018583719)
  9225. Firing prefer*rvt*predict-yes*H0
  9226. -->
  9227. Firing prefer*rvt*predict-no*H0
  9228. -->
  9229. Firing elaborate*copy-dir-to-output-link
  9230. -->
  9231. (I3 ^dir R +)
  9232. inner elaboration loop at bottom goal.
  9233. Retracting elaborate*copy-see-to-output-link
  9234. -->
  9235. (I3 ^see 0 +)
  9236. Retracting propose*predict-no
  9237. -->
  9238. (O1932 ^name predict-no +)
  9239. (S1 ^operator O1932 +)
  9240. Retracting propose*predict-yes
  9241. -->
  9242. (O1931 ^name predict-yes +)
  9243. (S1 ^operator O1931 +)
  9244. Retracting elaborate*reward*based*on*reward
  9245. -->
  9246. (R969 ^value 1 +)
  9247. (R1 ^reward R969 +)
  9248. Retracting elaborate*copy-dir-to-output-link
  9249. -->
  9250. (I3 ^dir L +)
  9251. Retracting rl*prefer*rvt*predict-no*H0*6
  9252. -->
  9253. (S1 ^operator O1932 = 0.9996975476948911)
  9254. Retracting rl*prefer*rvt*predict-yes*H0*5
  9255. -->
  9256. (S1 ^operator O1931 = 0.2640444846619989)
  9257. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9258. -->
  9259. (S1 ^operator O1931 = -0.181727099742844)
  9260. =>WM: (13623: S1 ^operator O1934 +)
  9261. =>WM: (13622: S1 ^operator O1933 +)
  9262. =>WM: (13621: I3 ^dir R)
  9263. =>WM: (13620: O1934 ^name predict-no)
  9264. =>WM: (13619: O1933 ^name predict-yes)
  9265. =>WM: (13618: R970 ^value 1)
  9266. =>WM: (13617: R1 ^reward R970)
  9267. <=WM: (13608: S1 ^operator O1931 +)
  9268. <=WM: (13609: S1 ^operator O1932 +)
  9269. <=WM: (13610: S1 ^operator O1932)
  9270. <=WM: (13607: I3 ^dir L)
  9271. <=WM: (13603: R1 ^reward R969)
  9272. <=WM: (13606: O1932 ^name predict-no)
  9273. <=WM: (13605: O1931 ^name predict-yes)
  9274. <=WM: (13604: R969 ^value 1)
  9275. --- Inner Elaboration Phase, active level 1 (S1) ---
  9276. Firing prefer*rvt*predict-yes*H0
  9277. -->
  9278. Firing rl*prefer*rvt*predict-yes*H0*3
  9279. -->
  9280. (S1 ^operator O1933 = 0.3377110018583719)
  9281. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9282. -->
  9283. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9284. -->
  9285. (S1 ^operator O1933 = 0.6623675607605151)
  9286. Firing prefer*rvt*predict-no*H0
  9287. -->
  9288. Firing rl*prefer*rvt*predict-no*H0*4
  9289. -->
  9290. (S1 ^operator O1934 = 0.3397713875215998)
  9291. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9292. -->
  9293. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9294. -->
  9295. (S1 ^operator O1934 = -0.2817060109291377)
  9296. inner elaboration loop at bottom goal.
  9297. Retracting rl*prefer*rvt*predict-no*H0*4
  9298. -->
  9299. (S1 ^operator O1932 = 0.3397713875215998)
  9300. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9301. -->
  9302. (S1 ^operator O1932 = -0.2817060109291377)
  9303. Retracting rl*prefer*rvt*predict-yes*H0*3
  9304. -->
  9305. (S1 ^operator O1931 = 0.3377110018583719)
  9306. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9307. -->
  9308. (S1 ^operator O1931 = 0.6623675607605151)
  9309. --- END Proposal Phase ---
  9310. --- Decision Phase ---
  9311. RL update rl*prefer*rvt*predict-no*H0*6 0.999698 0 0.999698 -> 0.999748 0 0.999748(R,m,v=1,0.90411,0.0872933)
  9312. =>WM: (13624: S1 ^operator O1933)
  9313. 967: O: O1933 (predict-yes)
  9314. --- END Decision Phase ---
  9315. --- Application Phase ---
  9316. --- Firing Productions (PE) For State At Depth 1 ---
  9317. --- Inner Elaboration Phase, active level 1 (S1) ---
  9318. Firing apply*operator
  9319. -->
  9320. (I3 ^predict-yes N967 + :O )
  9321. Firing apply*operator*complete
  9322. -->
  9323. (I3 ^predict-no N966 - :O )
  9324. inner elaboration loop at bottom goal.
  9325. --- Change Working Memory (PE) ---
  9326. =>WM: (13625: I3 ^predict-yes N967)
  9327. <=WM: (13612: N966 ^status complete)
  9328. <=WM: (13611: I3 ^predict-no N966)
  9329. --- Firing Productions (IE) For State At Depth 1 ---
  9330. --- Inner Elaboration Phase, active level 1 (S1) ---
  9331. Firing monitor*world
  9332. -->
  9333. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9334. --- Change Working Memory (IE) ---
  9335. --- END Application Phase ---
  9336. --- Output Phase ---
  9337. ENV: Agent did: predict-yes for direction R in state State-A
  9338. In State-A moving R
  9339. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9340. predict error 0
  9341. dir: dir isR
  9342. --- END Output Phase ---
  9343. |\---- Input Phase ---
  9344. =>WM: (13629: I2 ^dir R)
  9345. =>WM: (13628: I2 ^reward 1)
  9346. =>WM: (13627: I2 ^see 1)
  9347. =>WM: (13626: N967 ^status complete)
  9348. <=WM: (13615: I2 ^dir R)
  9349. <=WM: (13614: I2 ^reward 1)
  9350. <=WM: (13613: I2 ^see 0)
  9351. =>WM: (13630: I2 ^level-1 R1-root)
  9352. <=WM: (13616: I2 ^level-1 L0-root)
  9353. --- END Input Phase ---
  9354. --- Proposal Phase ---
  9355. --- Inner Elaboration Phase, active level 1 (S1) ---
  9356. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9357. -->
  9358. (S1 ^operator O1933 = -0.1070236389116304)
  9359. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9360. -->
  9361. (S1 ^operator O1934 = 0.6602488383529777)
  9362. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9363. -->
  9364. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9365. -->
  9366. Firing elaborate*copy-see-to-output-link
  9367. -->
  9368. (I3 ^see 1 +)
  9369. Firing elaborate*reward*based*on*reward
  9370. -->
  9371. (R971 ^value 1 +)
  9372. (R1 ^reward R971 +)
  9373. Firing propose*predict-yes
  9374. -->
  9375. (O1935 ^name predict-yes +)
  9376. (S1 ^operator O1935 +)
  9377. Firing propose*predict-no
  9378. -->
  9379. (O1936 ^name predict-no +)
  9380. (S1 ^operator O1936 +)
  9381. Firing rl*prefer*rvt*predict-no*H0*4
  9382. -->
  9383. (S1 ^operator O1934 = 0.3397713875215998)
  9384. Firing rl*prefer*rvt*predict-yes*H0*3
  9385. -->
  9386. (S1 ^operator O1933 = 0.3377110018583719)
  9387. Firing prefer*rvt*predict-yes*H0
  9388. -->
  9389. Firing prefer*rvt*predict-no*H0
  9390. -->
  9391. Firing elaborate*copy-dir-to-output-link
  9392. -->
  9393. (I3 ^dir R +)
  9394. inner elaboration loop at bottom goal.
  9395. Retracting elaborate*copy-see-to-output-link
  9396. -->
  9397. (I3 ^see 0 +)
  9398. Retracting propose*predict-no
  9399. -->
  9400. (O1934 ^name predict-no +)
  9401. (S1 ^operator O1934 +)
  9402. Retracting propose*predict-yes
  9403. -->
  9404. (O1933 ^name predict-yes +)
  9405. (S1 ^operator O1933 +)
  9406. Retracting elaborate*reward*based*on*reward
  9407. -->
  9408. (R970 ^value 1 +)
  9409. (R1 ^reward R970 +)
  9410. Retracting elaborate*copy-dir-to-output-link
  9411. -->
  9412. (I3 ^dir R +)
  9413. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9414. -->
  9415. (S1 ^operator O1934 = -0.2817060109291377)
  9416. Retracting rl*prefer*rvt*predict-no*H0*4
  9417. -->
  9418. (S1 ^operator O1934 = 0.3397713875215998)
  9419. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9420. -->
  9421. (S1 ^operator O1933 = 0.6623675607605151)
  9422. Retracting rl*prefer*rvt*predict-yes*H0*3
  9423. -->
  9424. (S1 ^operator O1933 = 0.3377110018583719)
  9425. =>WM: (13637: S1 ^operator O1936 +)
  9426. =>WM: (13636: S1 ^operator O1935 +)
  9427. =>WM: (13635: O1936 ^name predict-no)
  9428. =>WM: (13634: O1935 ^name predict-yes)
  9429. =>WM: (13633: R971 ^value 1)
  9430. =>WM: (13632: R1 ^reward R971)
  9431. =>WM: (13631: I3 ^see 1)
  9432. <=WM: (13622: S1 ^operator O1933 +)
  9433. <=WM: (13624: S1 ^operator O1933)
  9434. <=WM: (13623: S1 ^operator O1934 +)
  9435. <=WM: (13617: R1 ^reward R970)
  9436. <=WM: (13602: I3 ^see 0)
  9437. <=WM: (13620: O1934 ^name predict-no)
  9438. <=WM: (13619: O1933 ^name predict-yes)
  9439. <=WM: (13618: R970 ^value 1)
  9440. --- Inner Elaboration Phase, active level 1 (S1) ---
  9441. Firing prefer*rvt*predict-yes*H0
  9442. -->
  9443. Firing rl*prefer*rvt*predict-yes*H0*3
  9444. -->
  9445. (S1 ^operator O1935 = 0.3377110018583719)
  9446. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9447. -->
  9448. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9449. -->
  9450. (S1 ^operator O1935 = -0.1070236389116304)
  9451. Firing prefer*rvt*predict-no*H0
  9452. -->
  9453. Firing rl*prefer*rvt*predict-no*H0*4
  9454. -->
  9455. (S1 ^operator O1936 = 0.3397713875215998)
  9456. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9457. -->
  9458. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9459. -->
  9460. (S1 ^operator O1936 = 0.6602488383529777)
  9461. inner elaboration loop at bottom goal.
  9462. Retracting rl*prefer*rvt*predict-no*H0*4
  9463. -->
  9464. (S1 ^operator O1934 = 0.3397713875215998)
  9465. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9466. -->
  9467. (S1 ^operator O1934 = 0.6602488383529777)
  9468. Retracting rl*prefer*rvt*predict-yes*H0*3
  9469. -->
  9470. (S1 ^operator O1933 = 0.3377110018583719)
  9471. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9472. -->
  9473. (S1 ^operator O1933 = -0.1070236389116304)
  9474. --- END Proposal Phase ---
  9475. --- Decision Phase ---
  9476. RL update rl*prefer*rvt*predict-yes*H0*3 0.590111 -0.2524 0.337711 -> 0.590104 -0.252399 0.337705(R,m,v=1,0.895706,0.0939938)
  9477. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409979 0.252388 0.662368 -> 0.409971 0.252389 0.66236(R,m,v=1,1,0)
  9478. =>WM: (13638: S1 ^operator O1936)
  9479. 968: O: O1936 (predict-no)
  9480. --- END Decision Phase ---
  9481. --- Application Phase ---
  9482. --- Firing Productions (PE) For State At Depth 1 ---
  9483. --- Inner Elaboration Phase, active level 1 (S1) ---
  9484. Firing apply*operator
  9485. -->
  9486. (I3 ^predict-no N968 + :O )
  9487. Firing apply*operator*complete
  9488. -->
  9489. (I3 ^predict-yes N967 - :O )
  9490. inner elaboration loop at bottom goal.
  9491. --- Change Working Memory (PE) ---
  9492. =>WM: (13639: I3 ^predict-no N968)
  9493. <=WM: (13626: N967 ^status complete)
  9494. <=WM: (13625: I3 ^predict-yes N967)
  9495. --- Firing Productions (IE) For State At Depth 1 ---
  9496. --- Inner Elaboration Phase, active level 1 (S1) ---
  9497. Firing monitor*world
  9498. -->
  9499. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9500. --- Change Working Memory (IE) ---
  9501. --- END Application Phase ---
  9502. --- Output Phase ---
  9503. ENV: Agent did: predict-no for direction R in state State-B
  9504. In State-B moving R
  9505. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9506. predict error 0
  9507. dir: dir isU
  9508. --- END Output Phase ---
  9509. /|--- Input Phase ---
  9510. =>WM: (13643: I2 ^dir U)
  9511. =>WM: (13642: I2 ^reward 1)
  9512. =>WM: (13641: I2 ^see 0)
  9513. =>WM: (13640: N968 ^status complete)
  9514. <=WM: (13629: I2 ^dir R)
  9515. <=WM: (13628: I2 ^reward 1)
  9516. <=WM: (13627: I2 ^see 1)
  9517. =>WM: (13644: I2 ^level-1 R0-root)
  9518. <=WM: (13630: I2 ^level-1 R1-root)
  9519. --- END Input Phase ---
  9520. --- Proposal Phase ---
  9521. --- Inner Elaboration Phase, active level 1 (S1) ---
  9522. Firing elaborate*copy-see-to-output-link
  9523. -->
  9524. (I3 ^see 0 +)
  9525. Firing elaborate*reward*based*on*reward
  9526. -->
  9527. (R972 ^value 1 +)
  9528. (R1 ^reward R972 +)
  9529. Firing propose*predict-yes
  9530. -->
  9531. (O1937 ^name predict-yes +)
  9532. (S1 ^operator O1937 +)
  9533. Firing propose*predict-no
  9534. -->
  9535. (O1938 ^name predict-no +)
  9536. (S1 ^operator O1938 +)
  9537. Firing rl*prefer*rvt*predict-no*H0*2
  9538. -->
  9539. (S1 ^operator O1936 = 1.)
  9540. Firing rl*prefer*rvt*predict-yes*H0*1
  9541. -->
  9542. (S1 ^operator O1935 = 0.)
  9543. Firing prefer*rvt*predict-yes*H0
  9544. -->
  9545. Firing prefer*rvt*predict-no*H0
  9546. -->
  9547. Firing elaborate*copy-dir-to-output-link
  9548. -->
  9549. (I3 ^dir U +)
  9550. inner elaboration loop at bottom goal.
  9551. Retracting elaborate*copy-see-to-output-link
  9552. -->
  9553. (I3 ^see 1 +)
  9554. Retracting propose*predict-no
  9555. -->
  9556. (O1936 ^name predict-no +)
  9557. (S1 ^operator O1936 +)
  9558. Retracting propose*predict-yes
  9559. -->
  9560. (O1935 ^name predict-yes +)
  9561. (S1 ^operator O1935 +)
  9562. Retracting elaborate*reward*based*on*reward
  9563. -->
  9564. (R971 ^value 1 +)
  9565. (R1 ^reward R971 +)
  9566. Retracting elaborate*copy-dir-to-output-link
  9567. -->
  9568. (I3 ^dir R +)
  9569. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9570. -->
  9571. (S1 ^operator O1936 = 0.6602488383529777)
  9572. Retracting rl*prefer*rvt*predict-no*H0*4
  9573. -->
  9574. (S1 ^operator O1936 = 0.3397713875215998)
  9575. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9576. -->
  9577. (S1 ^operator O1935 = -0.1070236389116304)
  9578. Retracting rl*prefer*rvt*predict-yes*H0*3
  9579. -->
  9580. (S1 ^operator O1935 = 0.3377045556949833)
  9581. =>WM: (13652: S1 ^operator O1938 +)
  9582. =>WM: (13651: S1 ^operator O1937 +)
  9583. =>WM: (13650: I3 ^dir U)
  9584. =>WM: (13649: O1938 ^name predict-no)
  9585. =>WM: (13648: O1937 ^name predict-yes)
  9586. =>WM: (13647: R972 ^value 1)
  9587. =>WM: (13646: R1 ^reward R972)
  9588. =>WM: (13645: I3 ^see 0)
  9589. <=WM: (13636: S1 ^operator O1935 +)
  9590. <=WM: (13637: S1 ^operator O1936 +)
  9591. <=WM: (13638: S1 ^operator O1936)
  9592. <=WM: (13621: I3 ^dir R)
  9593. <=WM: (13632: R1 ^reward R971)
  9594. <=WM: (13631: I3 ^see 1)
  9595. <=WM: (13635: O1936 ^name predict-no)
  9596. <=WM: (13634: O1935 ^name predict-yes)
  9597. <=WM: (13633: R971 ^value 1)
  9598. --- Inner Elaboration Phase, active level 1 (S1) ---
  9599. Firing prefer*rvt*predict-yes*H0
  9600. -->
  9601. Firing rl*prefer*rvt*predict-yes*H0*1
  9602. -->
  9603. (S1 ^operator O1937 = 0.)
  9604. Firing prefer*rvt*predict-no*H0
  9605. -->
  9606. Firing rl*prefer*rvt*predict-no*H0*2
  9607. -->
  9608. (S1 ^operator O1938 = 1.)
  9609. inner elaboration loop at bottom goal.
  9610. Retracting rl*prefer*rvt*predict-no*H0*2
  9611. -->
  9612. (S1 ^operator O1936 = 1.)
  9613. Retracting rl*prefer*rvt*predict-yes*H0*1
  9614. -->
  9615. (S1 ^operator O1935 = 0.)
  9616. --- END Proposal Phase ---
  9617. --- Decision Phase ---
  9618. RL update rl*prefer*rvt*predict-no*H0*4 0.570255 -0.230484 0.339771 -> 0.570253 -0.230483 0.33977(R,m,v=1,0.872727,0.111752)
  9619. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429766 0.230483 0.660249 -> 0.429764 0.230483 0.660247(R,m,v=1,1,0)
  9620. =>WM: (13653: S1 ^operator O1938)
  9621. 969: O: O1938 (predict-no)
  9622. --- END Decision Phase ---
  9623. --- Application Phase ---
  9624. --- Firing Productions (PE) For State At Depth 1 ---
  9625. --- Inner Elaboration Phase, active level 1 (S1) ---
  9626. Firing apply*operator
  9627. -->
  9628. (I3 ^predict-no N969 + :O )
  9629. Firing apply*operator*complete
  9630. -->
  9631. (I3 ^predict-no N968 - :O )
  9632. inner elaboration loop at bottom goal.
  9633. --- Change Working Memory (PE) ---
  9634. =>WM: (13654: I3 ^predict-no N969)
  9635. <=WM: (13640: N968 ^status complete)
  9636. <=WM: (13639: I3 ^predict-no N968)
  9637. --- Firing Productions (IE) For State At Depth 1 ---
  9638. --- Inner Elaboration Phase, active level 1 (S1) ---
  9639. Firing monitor*world
  9640. -->
  9641. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9642. --- Change Working Memory (IE) ---
  9643. --- END Application Phase ---
  9644. --- Output Phase ---
  9645. ENV: Agent did: predict-no for direction U in state State-B
  9646. In State-B moving U
  9647. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9648. predict error 0
  9649. dir: dir isU
  9650. --- END Output Phase ---
  9651. \-/--- Input Phase ---
  9652. =>WM: (13658: I2 ^dir U)
  9653. =>WM: (13657: I2 ^reward 1)
  9654. =>WM: (13656: I2 ^see 0)
  9655. =>WM: (13655: N969 ^status complete)
  9656. <=WM: (13643: I2 ^dir U)
  9657. <=WM: (13642: I2 ^reward 1)
  9658. <=WM: (13641: I2 ^see 0)
  9659. =>WM: (13659: I2 ^level-1 R0-root)
  9660. <=WM: (13644: I2 ^level-1 R0-root)
  9661. --- END Input Phase ---
  9662. --- Proposal Phase ---
  9663. --- Inner Elaboration Phase, active level 1 (S1) ---
  9664. Firing elaborate*copy-see-to-output-link
  9665. -->
  9666. (I3 ^see 0 +)
  9667. Firing elaborate*reward*based*on*reward
  9668. -->
  9669. (R973 ^value 1 +)
  9670. (R1 ^reward R973 +)
  9671. Firing propose*predict-yes
  9672. -->
  9673. (O1939 ^name predict-yes +)
  9674. (S1 ^operator O1939 +)
  9675. Firing propose*predict-no
  9676. -->
  9677. (O1940 ^name predict-no +)
  9678. (S1 ^operator O1940 +)
  9679. Firing rl*prefer*rvt*predict-no*H0*2
  9680. -->
  9681. (S1 ^operator O1938 = 1.)
  9682. Firing rl*prefer*rvt*predict-yes*H0*1
  9683. -->
  9684. (S1 ^operator O1937 = 0.)
  9685. Firing prefer*rvt*predict-yes*H0
  9686. -->
  9687. Firing prefer*rvt*predict-no*H0
  9688. -->
  9689. Firing elaborate*copy-dir-to-output-link
  9690. -->
  9691. (I3 ^dir U +)
  9692. inner elaboration loop at bottom goal.
  9693. Retracting elaborate*copy-see-to-output-link
  9694. -->
  9695. (I3 ^see 0 +)
  9696. Retracting propose*predict-no
  9697. -->
  9698. (O1938 ^name predict-no +)
  9699. (S1 ^operator O1938 +)
  9700. Retracting propose*predict-yes
  9701. -->
  9702. (O1937 ^name predict-yes +)
  9703. (S1 ^operator O1937 +)
  9704. Retracting elaborate*reward*based*on*reward
  9705. -->
  9706. (R972 ^value 1 +)
  9707. (R1 ^reward R972 +)
  9708. Retracting elaborate*copy-dir-to-output-link
  9709. -->
  9710. (I3 ^dir U +)
  9711. Retracting rl*prefer*rvt*predict-no*H0*2
  9712. -->
  9713. (S1 ^operator O1938 = 1.)
  9714. Retracting rl*prefer*rvt*predict-yes*H0*1
  9715. -->
  9716. (S1 ^operator O1937 = 0.)
  9717. =>WM: (13665: S1 ^operator O1940 +)
  9718. =>WM: (13664: S1 ^operator O1939 +)
  9719. =>WM: (13663: O1940 ^name predict-no)
  9720. =>WM: (13662: O1939 ^name predict-yes)
  9721. =>WM: (13661: R973 ^value 1)
  9722. =>WM: (13660: R1 ^reward R973)
  9723. <=WM: (13651: S1 ^operator O1937 +)
  9724. <=WM: (13652: S1 ^operator O1938 +)
  9725. <=WM: (13653: S1 ^operator O1938)
  9726. <=WM: (13646: R1 ^reward R972)
  9727. <=WM: (13649: O1938 ^name predict-no)
  9728. <=WM: (13648: O1937 ^name predict-yes)
  9729. <=WM: (13647: R972 ^value 1)
  9730. --- Inner Elaboration Phase, active level 1 (S1) ---
  9731. Firing prefer*rvt*predict-yes*H0
  9732. -->
  9733. Firing rl*prefer*rvt*predict-yes*H0*1
  9734. -->
  9735. (S1 ^operator O1939 = 0.)
  9736. Firing prefer*rvt*predict-no*H0
  9737. -->
  9738. Firing rl*prefer*rvt*predict-no*H0*2
  9739. -->
  9740. (S1 ^operator O1940 = 1.)
  9741. inner elaboration loop at bottom goal.
  9742. Retracting rl*prefer*rvt*predict-no*H0*2
  9743. -->
  9744. (S1 ^operator O1938 = 1.)
  9745. Retracting rl*prefer*rvt*predict-yes*H0*1
  9746. -->
  9747. (S1 ^operator O1937 = 0.)
  9748. --- END Proposal Phase ---
  9749. --- Decision Phase ---
  9750. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9751. =>WM: (13666: S1 ^operator O1940)
  9752. 970: O: O1940 (predict-no)
  9753. --- END Decision Phase ---
  9754. --- Application Phase ---
  9755. --- Firing Productions (PE) For State At Depth 1 ---
  9756. --- Inner Elaboration Phase, active level 1 (S1) ---
  9757. Firing apply*operator
  9758. -->
  9759. (I3 ^predict-no N970 + :O )
  9760. Firing apply*operator*complete
  9761. -->
  9762. (I3 ^predict-no N969 - :O )
  9763. inner elaboration loop at bottom goal.
  9764. --- Change Working Memory (PE) ---
  9765. =>WM: (13667: I3 ^predict-no N970)
  9766. <=WM: (13655: N969 ^status complete)
  9767. <=WM: (13654: I3 ^predict-no N969)
  9768. --- Firing Productions (IE) For State At Depth 1 ---
  9769. --- Inner Elaboration Phase, active level 1 (S1) ---
  9770. Firing monitor*world
  9771. -->
  9772. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9773. --- Change Working Memory (IE) ---
  9774. --- END Application Phase ---
  9775. --- Output Phase ---
  9776. ENV: Agent did: predict-no for direction U in state State-B
  9777. In State-B moving U
  9778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9779. predict error 0
  9780. dir: dir isL
  9781. --- END Output Phase ---
  9782. |\--- Input Phase ---
  9783. =>WM: (13671: I2 ^dir L)
  9784. =>WM: (13670: I2 ^reward 1)
  9785. =>WM: (13669: I2 ^see 0)
  9786. =>WM: (13668: N970 ^status complete)
  9787. <=WM: (13658: I2 ^dir U)
  9788. <=WM: (13657: I2 ^reward 1)
  9789. <=WM: (13656: I2 ^see 0)
  9790. =>WM: (13672: I2 ^level-1 R0-root)
  9791. <=WM: (13659: I2 ^level-1 R0-root)
  9792. --- END Input Phase ---
  9793. --- Proposal Phase ---
  9794. --- Inner Elaboration Phase, active level 1 (S1) ---
  9795. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9796. -->
  9797. (S1 ^operator O1939 = 0.735815301499146)
  9798. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9799. -->
  9800. Firing elaborate*copy-see-to-output-link
  9801. -->
  9802. (I3 ^see 0 +)
  9803. Firing elaborate*reward*based*on*reward
  9804. -->
  9805. (R974 ^value 1 +)
  9806. (R1 ^reward R974 +)
  9807. Firing propose*predict-yes
  9808. -->
  9809. (O1941 ^name predict-yes +)
  9810. (S1 ^operator O1941 +)
  9811. Firing propose*predict-no
  9812. -->
  9813. (O1942 ^name predict-no +)
  9814. (S1 ^operator O1942 +)
  9815. Firing rl*prefer*rvt*predict-no*H0*6
  9816. -->
  9817. (S1 ^operator O1940 = 0.9997480945179411)
  9818. Firing rl*prefer*rvt*predict-yes*H0*5
  9819. -->
  9820. (S1 ^operator O1939 = 0.2640444846619989)
  9821. Firing prefer*rvt*predict-yes*H0
  9822. -->
  9823. Firing prefer*rvt*predict-no*H0
  9824. -->
  9825. Firing elaborate*copy-dir-to-output-link
  9826. -->
  9827. (I3 ^dir L +)
  9828. inner elaboration loop at bottom goal.
  9829. Retracting elaborate*copy-see-to-output-link
  9830. -->
  9831. (I3 ^see 0 +)
  9832. Retracting propose*predict-no
  9833. -->
  9834. (O1940 ^name predict-no +)
  9835. (S1 ^operator O1940 +)
  9836. Retracting propose*predict-yes
  9837. -->
  9838. (O1939 ^name predict-yes +)
  9839. (S1 ^operator O1939 +)
  9840. Retracting elaborate*reward*based*on*reward
  9841. -->
  9842. (R973 ^value 1 +)
  9843. (R1 ^reward R973 +)
  9844. Retracting elaborate*copy-dir-to-output-link
  9845. -->
  9846. (I3 ^dir U +)
  9847. Retracting rl*prefer*rvt*predict-no*H0*2
  9848. -->
  9849. (S1 ^operator O1940 = 1.)
  9850. Retracting rl*prefer*rvt*predict-yes*H0*1
  9851. -->
  9852. (S1 ^operator O1939 = 0.)
  9853. =>WM: (13679: S1 ^operator O1942 +)
  9854. =>WM: (13678: S1 ^operator O1941 +)
  9855. =>WM: (13677: I3 ^dir L)
  9856. =>WM: (13676: O1942 ^name predict-no)
  9857. =>WM: (13675: O1941 ^name predict-yes)
  9858. =>WM: (13674: R974 ^value 1)
  9859. =>WM: (13673: R1 ^reward R974)
  9860. <=WM: (13664: S1 ^operator O1939 +)
  9861. <=WM: (13665: S1 ^operator O1940 +)
  9862. <=WM: (13666: S1 ^operator O1940)
  9863. <=WM: (13650: I3 ^dir U)
  9864. <=WM: (13660: R1 ^reward R973)
  9865. <=WM: (13663: O1940 ^name predict-no)
  9866. <=WM: (13662: O1939 ^name predict-yes)
  9867. <=WM: (13661: R973 ^value 1)
  9868. --- Inner Elaboration Phase, active level 1 (S1) ---
  9869. Firing prefer*rvt*predict-yes*H0
  9870. -->
  9871. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9872. -->
  9873. (S1 ^operator O1941 = 0.735815301499146)
  9874. Firing rl*prefer*rvt*predict-yes*H0*5
  9875. -->
  9876. (S1 ^operator O1941 = 0.2640444846619989)
  9877. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9878. -->
  9879. Firing prefer*rvt*predict-no*H0
  9880. -->
  9881. Firing rl*prefer*rvt*predict-no*H0*6
  9882. -->
  9883. (S1 ^operator O1942 = 0.9997480945179411)
  9884. inner elaboration loop at bottom goal.
  9885. Retracting rl*prefer*rvt*predict-no*H0*6
  9886. -->
  9887. (S1 ^operator O1940 = 0.9997480945179411)
  9888. Retracting rl*prefer*rvt*predict-yes*H0*5
  9889. -->
  9890. (S1 ^operator O1939 = 0.2640444846619989)
  9891. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9892. -->
  9893. (S1 ^operator O1939 = 0.735815301499146)
  9894. --- END Proposal Phase ---
  9895. --- Decision Phase ---
  9896. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9897. =>WM: (13680: S1 ^operator O1941)
  9898. 971: O: O1941 (predict-yes)
  9899. --- END Decision Phase ---
  9900. --- Application Phase ---
  9901. --- Firing Productions (PE) For State At Depth 1 ---
  9902. --- Inner Elaboration Phase, active level 1 (S1) ---
  9903. Firing apply*operator
  9904. -->
  9905. (I3 ^predict-yes N971 + :O )
  9906. Firing apply*operator*complete
  9907. -->
  9908. (I3 ^predict-no N970 - :O )
  9909. inner elaboration loop at bottom goal.
  9910. --- Change Working Memory (PE) ---
  9911. =>WM: (13681: I3 ^predict-yes N971)
  9912. <=WM: (13668: N970 ^status complete)
  9913. <=WM: (13667: I3 ^predict-no N970)
  9914. --- Firing Productions (IE) For State At Depth 1 ---
  9915. --- Inner Elaboration Phase, active level 1 (S1) ---
  9916. Firing monitor*world
  9917. -->
  9918. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9919. --- Change Working Memory (IE) ---
  9920. --- END Application Phase ---
  9921. --- Output Phase ---
  9922. ENV: Agent did: predict-yes for direction L in state State-B
  9923. In State-B moving L
  9924. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9925. predict error 0
  9926. dir: dir isR
  9927. --- END Output Phase ---
  9928. ---- Input Phase ---
  9929. =>WM: (13685: I2 ^dir R)
  9930. =>WM: (13684: I2 ^reward 1)
  9931. =>WM: (13683: I2 ^see 1)
  9932. =>WM: (13682: N971 ^status complete)
  9933. <=WM: (13671: I2 ^dir L)
  9934. <=WM: (13670: I2 ^reward 1)
  9935. <=WM: (13669: I2 ^see 0)
  9936. =>WM: (13686: I2 ^level-1 L1-root)
  9937. <=WM: (13672: I2 ^level-1 R0-root)
  9938. --- END Input Phase ---
  9939. --- Proposal Phase ---
  9940. --- Inner Elaboration Phase, active level 1 (S1) ---
  9941. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  9942. -->
  9943. (S1 ^operator O1942 = -0.2714224023553999)
  9944. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  9945. -->
  9946. (S1 ^operator O1941 = 0.6622033637991441)
  9947. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9948. -->
  9949. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9950. -->
  9951. Firing elaborate*copy-see-to-output-link
  9952. -->
  9953. (I3 ^see 1 +)
  9954. Firing elaborate*reward*based*on*reward
  9955. -->
  9956. (R975 ^value 1 +)
  9957. (R1 ^reward R975 +)
  9958. Firing propose*predict-yes
  9959. -->
  9960. (O1943 ^name predict-yes +)
  9961. (S1 ^operator O1943 +)
  9962. Firing propose*predict-no
  9963. -->
  9964. (O1944 ^name predict-no +)
  9965. (S1 ^operator O1944 +)
  9966. Firing rl*prefer*rvt*predict-no*H0*4
  9967. -->
  9968. (S1 ^operator O1942 = 0.339769731277316)
  9969. Firing rl*prefer*rvt*predict-yes*H0*3
  9970. -->
  9971. (S1 ^operator O1941 = 0.3377045556949833)
  9972. Firing prefer*rvt*predict-yes*H0
  9973. -->
  9974. Firing prefer*rvt*predict-no*H0
  9975. -->
  9976. Firing elaborate*copy-dir-to-output-link
  9977. -->
  9978. (I3 ^dir R +)
  9979. inner elaboration loop at bottom goal.
  9980. Retracting elaborate*copy-see-to-output-link
  9981. -->
  9982. (I3 ^see 0 +)
  9983. Retracting propose*predict-no
  9984. -->
  9985. (O1942 ^name predict-no +)
  9986. (S1 ^operator O1942 +)
  9987. Retracting propose*predict-yes
  9988. -->
  9989. (O1941 ^name predict-yes +)
  9990. (S1 ^operator O1941 +)
  9991. Retracting elaborate*reward*based*on*reward
  9992. -->
  9993. (R974 ^value 1 +)
  9994. (R1 ^reward R974 +)
  9995. Retracting elaborate*copy-dir-to-output-link
  9996. -->
  9997. (I3 ^dir L +)
  9998. Retracting rl*prefer*rvt*predict-no*H0*6
  9999. -->
  10000. (S1 ^operator O1942 = 0.9997480945179411)
  10001. Retracting rl*prefer*rvt*predict-yes*H0*5
  10002. -->
  10003. (S1 ^operator O1941 = 0.2640444846619989)
  10004. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  10005. -->
  10006. (S1 ^operator O1941 = 0.735815301499146)
  10007. =>WM: (13694: S1 ^operator O1944 +)
  10008. =>WM: (13693: S1 ^operator O1943 +)
  10009. =>WM: (13692: I3 ^dir R)
  10010. =>WM: (13691: O1944 ^name predict-no)
  10011. =>WM: (13690: O1943 ^name predict-yes)
  10012. =>WM: (13689: R975 ^value 1)
  10013. =>WM: (13688: R1 ^reward R975)
  10014. =>WM: (13687: I3 ^see 1)
  10015. <=WM: (13678: S1 ^operator O1941 +)
  10016. <=WM: (13680: S1 ^operator O1941)
  10017. <=WM: (13679: S1 ^operator O1942 +)
  10018. <=WM: (13677: I3 ^dir L)
  10019. <=WM: (13673: R1 ^reward R974)
  10020. <=WM: (13645: I3 ^see 0)
  10021. <=WM: (13676: O1942 ^name predict-no)
  10022. <=WM: (13675: O1941 ^name predict-yes)
  10023. <=WM: (13674: R974 ^value 1)
  10024. --- Inner Elaboration Phase, active level 1 (S1) ---
  10025. Firing prefer*rvt*predict-yes*H0
  10026. -->
  10027. Firing rl*prefer*rvt*predict-yes*H0*3
  10028. -->
  10029. (S1 ^operator O1943 = 0.3377045556949833)
  10030. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10031. -->
  10032. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10033. -->
  10034. (S1 ^operator O1943 = 0.6622033637991441)
  10035. Firing prefer*rvt*predict-no*H0
  10036. -->
  10037. Firing rl*prefer*rvt*predict-no*H0*4
  10038. -->
  10039. (S1 ^operator O1944 = 0.339769731277316)
  10040. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10041. -->
  10042. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10043. -->
  10044. (S1 ^operator O1944 = -0.2714224023553999)
  10045. inner elaboration loop at bottom goal.
  10046. Retracting rl*prefer*rvt*predict-no*H0*4
  10047. -->
  10048. (S1 ^operator O1942 = 0.339769731277316)
  10049. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10050. -->
  10051. (S1 ^operator O1942 = -0.2714224023553999)
  10052. Retracting rl*prefer*rvt*predict-yes*H0*3
  10053. -->
  10054. (S1 ^operator O1941 = 0.3377045556949833)
  10055. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10056. -->
  10057. (S1 ^operator O1941 = 0.6622033637991441)
  10058. --- END Proposal Phase ---
  10059. --- Decision Phase ---
  10060. RL update rl*prefer*rvt*predict-yes*H0*5 0.55443 -0.290385 0.264044 -> 0.554441 -0.290385 0.264056(R,m,v=1,0.874286,0.110542)
  10061. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445432 0.290383 0.735815 -> 0.445446 0.290383 0.735829(R,m,v=1,1,0)
  10062. =>WM: (13695: S1 ^operator O1943)
  10063. 972: O: O1943 (predict-yes)
  10064. --- END Decision Phase ---
  10065. --- Application Phase ---
  10066. --- Firing Productions (PE) For State At Depth 1 ---
  10067. --- Inner Elaboration Phase, active level 1 (S1) ---
  10068. Firing apply*operator
  10069. -->
  10070. (I3 ^predict-yes N972 + :O )
  10071. Firing apply*operator*complete
  10072. -->
  10073. (I3 ^predict-yes N971 - :O )
  10074. inner elaboration loop at bottom goal.
  10075. --- Change Working Memory (PE) ---
  10076. =>WM: (13696: I3 ^predict-yes N972)
  10077. <=WM: (13682: N971 ^status complete)
  10078. <=WM: (13681: I3 ^predict-yes N971)
  10079. --- Firing Productions (IE) For State At Depth 1 ---
  10080. --- Inner Elaboration Phase, active level 1 (S1) ---
  10081. Firing monitor*world
  10082. -->
  10083. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10084. --- Change Working Memory (IE) ---
  10085. --- END Application Phase ---
  10086. --- Output Phase ---
  10087. ENV: Agent did: predict-yes for direction R in state State-A
  10088. In State-A moving R
  10089. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10090. predict error 0
  10091. dir: dir isL
  10092. --- END Output Phase ---
  10093. /|--- Input Phase ---
  10094. =>WM: (13700: I2 ^dir L)
  10095. =>WM: (13699: I2 ^reward 1)
  10096. =>WM: (13698: I2 ^see 1)
  10097. =>WM: (13697: N972 ^status complete)
  10098. <=WM: (13685: I2 ^dir R)
  10099. <=WM: (13684: I2 ^reward 1)
  10100. <=WM: (13683: I2 ^see 1)
  10101. =>WM: (13701: I2 ^level-1 R1-root)
  10102. <=WM: (13686: I2 ^level-1 L1-root)
  10103. --- END Input Phase ---
  10104. --- Proposal Phase ---
  10105. --- Inner Elaboration Phase, active level 1 (S1) ---
  10106. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10107. -->
  10108. (S1 ^operator O1943 = 0.7362862485154646)
  10109. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10110. -->
  10111. Firing elaborate*copy-see-to-output-link
  10112. -->
  10113. (I3 ^see 1 +)
  10114. Firing elaborate*reward*based*on*reward
  10115. -->
  10116. (R976 ^value 1 +)
  10117. (R1 ^reward R976 +)
  10118. Firing propose*predict-yes
  10119. -->
  10120. (O1945 ^name predict-yes +)
  10121. (S1 ^operator O1945 +)
  10122. Firing propose*predict-no
  10123. -->
  10124. (O1946 ^name predict-no +)
  10125. (S1 ^operator O1946 +)
  10126. Firing rl*prefer*rvt*predict-no*H0*6
  10127. -->
  10128. (S1 ^operator O1944 = 0.9997480945179411)
  10129. Firing rl*prefer*rvt*predict-yes*H0*5
  10130. -->
  10131. (S1 ^operator O1943 = 0.2640558568198847)
  10132. Firing prefer*rvt*predict-yes*H0
  10133. -->
  10134. Firing prefer*rvt*predict-no*H0
  10135. -->
  10136. Firing elaborate*copy-dir-to-output-link
  10137. -->
  10138. (I3 ^dir L +)
  10139. inner elaboration loop at bottom goal.
  10140. Retracting elaborate*copy-see-to-output-link
  10141. -->
  10142. (I3 ^see 1 +)
  10143. Retracting propose*predict-no
  10144. -->
  10145. (O1944 ^name predict-no +)
  10146. (S1 ^operator O1944 +)
  10147. Retracting propose*predict-yes
  10148. -->
  10149. (O1943 ^name predict-yes +)
  10150. (S1 ^operator O1943 +)
  10151. Retracting elaborate*reward*based*on*reward
  10152. -->
  10153. (R975 ^value 1 +)
  10154. (R1 ^reward R975 +)
  10155. Retracting elaborate*copy-dir-to-output-link
  10156. -->
  10157. (I3 ^dir R +)
  10158. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10159. -->
  10160. (S1 ^operator O1944 = -0.2714224023553999)
  10161. Retracting rl*prefer*rvt*predict-no*H0*4
  10162. -->
  10163. (S1 ^operator O1944 = 0.339769731277316)
  10164. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10165. -->
  10166. (S1 ^operator O1943 = 0.6622033637991441)
  10167. Retracting rl*prefer*rvt*predict-yes*H0*3
  10168. -->
  10169. (S1 ^operator O1943 = 0.3377045556949833)
  10170. =>WM: (13708: S1 ^operator O1946 +)
  10171. =>WM: (13707: S1 ^operator O1945 +)
  10172. =>WM: (13706: I3 ^dir L)
  10173. =>WM: (13705: O1946 ^name predict-no)
  10174. =>WM: (13704: O1945 ^name predict-yes)
  10175. =>WM: (13703: R976 ^value 1)
  10176. =>WM: (13702: R1 ^reward R976)
  10177. <=WM: (13693: S1 ^operator O1943 +)
  10178. <=WM: (13695: S1 ^operator O1943)
  10179. <=WM: (13694: S1 ^operator O1944 +)
  10180. <=WM: (13692: I3 ^dir R)
  10181. <=WM: (13688: R1 ^reward R975)
  10182. <=WM: (13691: O1944 ^name predict-no)
  10183. <=WM: (13690: O1943 ^name predict-yes)
  10184. <=WM: (13689: R975 ^value 1)
  10185. --- Inner Elaboration Phase, active level 1 (S1) ---
  10186. Firing prefer*rvt*predict-yes*H0
  10187. -->
  10188. Firing rl*prefer*rvt*predict-yes*H0*5
  10189. -->
  10190. (S1 ^operator O1945 = 0.2640558568198847)
  10191. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10192. -->
  10193. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10194. -->
  10195. (S1 ^operator O1945 = 0.7362862485154646)
  10196. Firing prefer*rvt*predict-no*H0
  10197. -->
  10198. Firing rl*prefer*rvt*predict-no*H0*6
  10199. -->
  10200. (S1 ^operator O1946 = 0.9997480945179411)
  10201. inner elaboration loop at bottom goal.
  10202. Retracting rl*prefer*rvt*predict-no*H0*6
  10203. -->
  10204. (S1 ^operator O1944 = 0.9997480945179411)
  10205. Retracting rl*prefer*rvt*predict-yes*H0*5
  10206. -->
  10207. (S1 ^operator O1943 = 0.2640558568198847)
  10208. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10209. -->
  10210. (S1 ^operator O1943 = 0.7362862485154646)
  10211. --- END Proposal Phase ---
  10212. --- Decision Phase ---
  10213. RL update rl*prefer*rvt*predict-yes*H0*3 0.590104 -0.252399 0.337705 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.896341,0.0934835)
  10214. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.40979 0.252413 0.662203 -> 0.4098 0.252412 0.662212(R,m,v=1,1,0)
  10215. =>WM: (13709: S1 ^operator O1945)
  10216. 973: O: O1945 (predict-yes)
  10217. --- END Decision Phase ---
  10218. --- Application Phase ---
  10219. --- Firing Productions (PE) For State At Depth 1 ---
  10220. --- Inner Elaboration Phase, active level 1 (S1) ---
  10221. Firing apply*operator
  10222. -->
  10223. (I3 ^predict-yes N973 + :O )
  10224. Firing apply*operator*complete
  10225. -->
  10226. (I3 ^predict-yes N972 - :O )
  10227. inner elaboration loop at bottom goal.
  10228. --- Change Working Memory (PE) ---
  10229. =>WM: (13710: I3 ^predict-yes N973)
  10230. <=WM: (13697: N972 ^status complete)
  10231. <=WM: (13696: I3 ^predict-yes N972)
  10232. --- Firing Productions (IE) For State At Depth 1 ---
  10233. --- Inner Elaboration Phase, active level 1 (S1) ---
  10234. Firing monitor*world
  10235. -->
  10236. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10237. --- Change Working Memory (IE) ---
  10238. --- END Application Phase ---
  10239. --- Output Phase ---
  10240. ENV: Agent did: predict-yes for direction L in state State-B
  10241. In State-B moving L
  10242. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10243. predict error 0
  10244. dir: dir isU
  10245. --- END Output Phase ---
  10246. \--- Input Phase ---
  10247. =>WM: (13714: I2 ^dir U)
  10248. =>WM: (13713: I2 ^reward 1)
  10249. =>WM: (13712: I2 ^see 1)
  10250. =>WM: (13711: N973 ^status complete)
  10251. <=WM: (13700: I2 ^dir L)
  10252. <=WM: (13699: I2 ^reward 1)
  10253. <=WM: (13698: I2 ^see 1)
  10254. =>WM: (13715: I2 ^level-1 L1-root)
  10255. <=WM: (13701: I2 ^level-1 R1-root)
  10256. --- END Input Phase ---
  10257. --- Proposal Phase ---
  10258. --- Inner Elaboration Phase, active level 1 (S1) ---
  10259. Firing elaborate*copy-see-to-output-link
  10260. -->
  10261. (I3 ^see 1 +)
  10262. Firing elaborate*reward*based*on*reward
  10263. -->
  10264. (R977 ^value 1 +)
  10265. (R1 ^reward R977 +)
  10266. Firing propose*predict-yes
  10267. -->
  10268. (O1947 ^name predict-yes +)
  10269. (S1 ^operator O1947 +)
  10270. Firing propose*predict-no
  10271. -->
  10272. (O1948 ^name predict-no +)
  10273. (S1 ^operator O1948 +)
  10274. Firing rl*prefer*rvt*predict-no*H0*2
  10275. -->
  10276. (S1 ^operator O1946 = 1.)
  10277. Firing rl*prefer*rvt*predict-yes*H0*1
  10278. -->
  10279. (S1 ^operator O1945 = 0.)
  10280. Firing prefer*rvt*predict-yes*H0
  10281. -->
  10282. Firing prefer*rvt*predict-no*H0
  10283. -->
  10284. Firing elaborate*copy-dir-to-output-link
  10285. -->
  10286. (I3 ^dir U +)
  10287. inner elaboration loop at bottom goal.
  10288. Retracting elaborate*copy-see-to-output-link
  10289. -->
  10290. (I3 ^see 1 +)
  10291. Retracting propose*predict-no
  10292. -->
  10293. (O1946 ^name predict-no +)
  10294. (S1 ^operator O1946 +)
  10295. Retracting propose*predict-yes
  10296. -->
  10297. (O1945 ^name predict-yes +)
  10298. (S1 ^operator O1945 +)
  10299. Retracting elaborate*reward*based*on*reward
  10300. -->
  10301. (R976 ^value 1 +)
  10302. (R1 ^reward R976 +)
  10303. Retracting elaborate*copy-dir-to-output-link
  10304. -->
  10305. (I3 ^dir L +)
  10306. Retracting rl*prefer*rvt*predict-no*H0*6
  10307. -->
  10308. (S1 ^operator O1946 = 0.9997480945179411)
  10309. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10310. -->
  10311. (S1 ^operator O1945 = 0.7362862485154646)
  10312. Retracting rl*prefer*rvt*predict-yes*H0*5
  10313. -->
  10314. (S1 ^operator O1945 = 0.2640558568198847)
  10315. =>WM: (13722: S1 ^operator O1948 +)
  10316. =>WM: (13721: S1 ^operator O1947 +)
  10317. =>WM: (13720: I3 ^dir U)
  10318. =>WM: (13719: O1948 ^name predict-no)
  10319. =>WM: (13718: O1947 ^name predict-yes)
  10320. =>WM: (13717: R977 ^value 1)
  10321. =>WM: (13716: R1 ^reward R977)
  10322. <=WM: (13707: S1 ^operator O1945 +)
  10323. <=WM: (13709: S1 ^operator O1945)
  10324. <=WM: (13708: S1 ^operator O1946 +)
  10325. <=WM: (13706: I3 ^dir L)
  10326. <=WM: (13702: R1 ^reward R976)
  10327. <=WM: (13705: O1946 ^name predict-no)
  10328. <=WM: (13704: O1945 ^name predict-yes)
  10329. <=WM: (13703: R976 ^value 1)
  10330. --- Inner Elaboration Phase, active level 1 (S1) ---
  10331. Firing prefer*rvt*predict-yes*H0
  10332. -->
  10333. Firing rl*prefer*rvt*predict-yes*H0*1
  10334. -->
  10335. (S1 ^operator O1947 = 0.)
  10336. Firing prefer*rvt*predict-no*H0
  10337. -->
  10338. Firing rl*prefer*rvt*predict-no*H0*2
  10339. -->
  10340. (S1 ^operator O1948 = 1.)
  10341. inner elaboration loop at bottom goal.
  10342. Retracting rl*prefer*rvt*predict-no*H0*2
  10343. -->
  10344. (S1 ^operator O1946 = 1.)
  10345. Retracting rl*prefer*rvt*predict-yes*H0*1
  10346. -->
  10347. (S1 ^operator O1945 = 0.)
  10348. --- END Proposal Phase ---
  10349. --- Decision Phase ---
  10350. RL update rl*prefer*rvt*predict-yes*H0*5 0.554441 -0.290385 0.264056 -> 0.554414 -0.290386 0.264028(R,m,v=1,0.875,0.11)
  10351. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445895 0.290391 0.736286 -> 0.445864 0.29039 0.736254(R,m,v=1,1,0)
  10352. =>WM: (13723: S1 ^operator O1948)
  10353. 974: O: O1948 (predict-no)
  10354. --- END Decision Phase ---
  10355. --- Application Phase ---
  10356. --- Firing Productions (PE) For State At Depth 1 ---
  10357. --- Inner Elaboration Phase, active level 1 (S1) ---
  10358. Firing apply*operator
  10359. -->
  10360. (I3 ^predict-no N974 + :O )
  10361. Firing apply*operator*complete
  10362. -->
  10363. (I3 ^predict-yes N973 - :O )
  10364. inner elaboration loop at bottom goal.
  10365. --- Change Working Memory (PE) ---
  10366. =>WM: (13724: I3 ^predict-no N974)
  10367. <=WM: (13711: N973 ^status complete)
  10368. <=WM: (13710: I3 ^predict-yes N973)
  10369. --- Firing Productions (IE) For State At Depth 1 ---
  10370. --- Inner Elaboration Phase, active level 1 (S1) ---
  10371. Firing monitor*world
  10372. -->
  10373. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10374. --- Change Working Memory (IE) ---
  10375. --- END Application Phase ---
  10376. --- Output Phase ---
  10377. ENV: Agent did: predict-no for direction U in state State-A
  10378. In State-A moving U
  10379. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10380. predict error 0
  10381. dir: dir isU
  10382. --- END Output Phase ---
  10383. -/--- Input Phase ---
  10384. =>WM: (13728: I2 ^dir U)
  10385. =>WM: (13727: I2 ^reward 1)
  10386. =>WM: (13726: I2 ^see 0)
  10387. =>WM: (13725: N974 ^status complete)
  10388. <=WM: (13714: I2 ^dir U)
  10389. <=WM: (13713: I2 ^reward 1)
  10390. <=WM: (13712: I2 ^see 1)
  10391. =>WM: (13729: I2 ^level-1 L1-root)
  10392. <=WM: (13715: I2 ^level-1 L1-root)
  10393. --- END Input Phase ---
  10394. --- Proposal Phase ---
  10395. --- Inner Elaboration Phase, active level 1 (S1) ---
  10396. Firing elaborate*copy-see-to-output-link
  10397. -->
  10398. (I3 ^see 0 +)
  10399. Firing elaborate*reward*based*on*reward
  10400. -->
  10401. (R978 ^value 1 +)
  10402. (R1 ^reward R978 +)
  10403. Firing propose*predict-yes
  10404. -->
  10405. (O1949 ^name predict-yes +)
  10406. (S1 ^operator O1949 +)
  10407. Firing propose*predict-no
  10408. -->
  10409. (O1950 ^name predict-no +)
  10410. (S1 ^operator O1950 +)
  10411. Firing rl*prefer*rvt*predict-no*H0*2
  10412. -->
  10413. (S1 ^operator O1948 = 1.)
  10414. Firing rl*prefer*rvt*predict-yes*H0*1
  10415. -->
  10416. (S1 ^operator O1947 = 0.)
  10417. Firing prefer*rvt*predict-yes*H0
  10418. -->
  10419. Firing prefer*rvt*predict-no*H0
  10420. -->
  10421. Firing elaborate*copy-dir-to-output-link
  10422. -->
  10423. (I3 ^dir U +)
  10424. inner elaboration loop at bottom goal.
  10425. Retracting elaborate*copy-see-to-output-link
  10426. -->
  10427. (I3 ^see 1 +)
  10428. Retracting propose*predict-no
  10429. -->
  10430. (O1948 ^name predict-no +)
  10431. (S1 ^operator O1948 +)
  10432. Retracting propose*predict-yes
  10433. -->
  10434. (O1947 ^name predict-yes +)
  10435. (S1 ^operator O1947 +)
  10436. Retracting elaborate*reward*based*on*reward
  10437. -->
  10438. (R977 ^value 1 +)
  10439. (R1 ^reward R977 +)
  10440. Retracting elaborate*copy-dir-to-output-link
  10441. -->
  10442. (I3 ^dir U +)
  10443. Retracting rl*prefer*rvt*predict-no*H0*2
  10444. -->
  10445. (S1 ^operator O1948 = 1.)
  10446. Retracting rl*prefer*rvt*predict-yes*H0*1
  10447. -->
  10448. (S1 ^operator O1947 = 0.)
  10449. =>WM: (13736: S1 ^operator O1950 +)
  10450. =>WM: (13735: S1 ^operator O1949 +)
  10451. =>WM: (13734: O1950 ^name predict-no)
  10452. =>WM: (13733: O1949 ^name predict-yes)
  10453. =>WM: (13732: R978 ^value 1)
  10454. =>WM: (13731: R1 ^reward R978)
  10455. =>WM: (13730: I3 ^see 0)
  10456. <=WM: (13721: S1 ^operator O1947 +)
  10457. <=WM: (13722: S1 ^operator O1948 +)
  10458. <=WM: (13723: S1 ^operator O1948)
  10459. <=WM: (13716: R1 ^reward R977)
  10460. <=WM: (13687: I3 ^see 1)
  10461. <=WM: (13719: O1948 ^name predict-no)
  10462. <=WM: (13718: O1947 ^name predict-yes)
  10463. <=WM: (13717: R977 ^value 1)
  10464. --- Inner Elaboration Phase, active level 1 (S1) ---
  10465. Firing prefer*rvt*predict-yes*H0
  10466. -->
  10467. Firing rl*prefer*rvt*predict-yes*H0*1
  10468. -->
  10469. (S1 ^operator O1949 = 0.)
  10470. Firing prefer*rvt*predict-no*H0
  10471. -->
  10472. Firing rl*prefer*rvt*predict-no*H0*2
  10473. -->
  10474. (S1 ^operator O1950 = 1.)
  10475. inner elaboration loop at bottom goal.
  10476. Retracting rl*prefer*rvt*predict-no*H0*2
  10477. -->
  10478. (S1 ^operator O1948 = 1.)
  10479. Retracting rl*prefer*rvt*predict-yes*H0*1
  10480. -->
  10481. (S1 ^operator O1947 = 0.)
  10482. --- END Proposal Phase ---
  10483. --- Decision Phase ---
  10484. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10485. =>WM: (13737: S1 ^operator O1950)
  10486. 975: O: O1950 (predict-no)
  10487. --- END Decision Phase ---
  10488. --- Application Phase ---
  10489. --- Firing Productions (PE) For State At Depth 1 ---
  10490. --- Inner Elaboration Phase, active level 1 (S1) ---
  10491. Firing apply*operator
  10492. -->
  10493. (I3 ^predict-no N975 + :O )
  10494. Firing apply*operator*complete
  10495. -->
  10496. (I3 ^predict-no N974 - :O )
  10497. inner elaboration loop at bottom goal.
  10498. --- Change Working Memory (PE) ---
  10499. =>WM: (13738: I3 ^predict-no N975)
  10500. <=WM: (13725: N974 ^status complete)
  10501. <=WM: (13724: I3 ^predict-no N974)
  10502. --- Firing Productions (IE) For State At Depth 1 ---
  10503. --- Inner Elaboration Phase, active level 1 (S1) ---
  10504. Firing monitor*world
  10505. -->
  10506. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10507. --- Change Working Memory (IE) ---
  10508. --- END Application Phase ---
  10509. --- Output Phase ---
  10510. ENV: Agent did: predict-no for direction U in state State-A
  10511. In State-A moving U
  10512. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10513. predict error 0
  10514. dir: dir isR
  10515. --- END Output Phase ---
  10516. |\---- Input Phase ---
  10517. =>WM: (13742: I2 ^dir R)
  10518. =>WM: (13741: I2 ^reward 1)
  10519. =>WM: (13740: I2 ^see 0)
  10520. =>WM: (13739: N975 ^status complete)
  10521. <=WM: (13728: I2 ^dir U)
  10522. <=WM: (13727: I2 ^reward 1)
  10523. <=WM: (13726: I2 ^see 0)
  10524. =>WM: (13743: I2 ^level-1 L1-root)
  10525. <=WM: (13729: I2 ^level-1 L1-root)
  10526. --- END Input Phase ---
  10527. --- Proposal Phase ---
  10528. --- Inner Elaboration Phase, active level 1 (S1) ---
  10529. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10530. -->
  10531. (S1 ^operator O1950 = -0.2714224023553999)
  10532. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10533. -->
  10534. (S1 ^operator O1949 = 0.6622121600001568)
  10535. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10536. -->
  10537. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10538. -->
  10539. Firing elaborate*copy-see-to-output-link
  10540. -->
  10541. (I3 ^see 0 +)
  10542. Firing elaborate*reward*based*on*reward
  10543. -->
  10544. (R979 ^value 1 +)
  10545. (R1 ^reward R979 +)
  10546. Firing propose*predict-yes
  10547. -->
  10548. (O1951 ^name predict-yes +)
  10549. (S1 ^operator O1951 +)
  10550. Firing propose*predict-no
  10551. -->
  10552. (O1952 ^name predict-no +)
  10553. (S1 ^operator O1952 +)
  10554. Firing rl*prefer*rvt*predict-no*H0*4
  10555. -->
  10556. (S1 ^operator O1950 = 0.339769731277316)
  10557. Firing rl*prefer*rvt*predict-yes*H0*3
  10558. -->
  10559. (S1 ^operator O1949 = 0.3377121034427055)
  10560. Firing prefer*rvt*predict-yes*H0
  10561. -->
  10562. Firing prefer*rvt*predict-no*H0
  10563. -->
  10564. Firing elaborate*copy-dir-to-output-link
  10565. -->
  10566. (I3 ^dir R +)
  10567. inner elaboration loop at bottom goal.
  10568. Retracting elaborate*copy-see-to-output-link
  10569. -->
  10570. (I3 ^see 0 +)
  10571. Retracting propose*predict-no
  10572. -->
  10573. (O1950 ^name predict-no +)
  10574. (S1 ^operator O1950 +)
  10575. Retracting propose*predict-yes
  10576. -->
  10577. (O1949 ^name predict-yes +)
  10578. (S1 ^operator O1949 +)
  10579. Retracting elaborate*reward*based*on*reward
  10580. -->
  10581. (R978 ^value 1 +)
  10582. (R1 ^reward R978 +)
  10583. Retracting elaborate*copy-dir-to-output-link
  10584. -->
  10585. (I3 ^dir U +)
  10586. Retracting rl*prefer*rvt*predict-no*H0*2
  10587. -->
  10588. (S1 ^operator O1950 = 1.)
  10589. Retracting rl*prefer*rvt*predict-yes*H0*1
  10590. -->
  10591. (S1 ^operator O1949 = 0.)
  10592. =>WM: (13750: S1 ^operator O1952 +)
  10593. =>WM: (13749: S1 ^operator O1951 +)
  10594. =>WM: (13748: I3 ^dir R)
  10595. =>WM: (13747: O1952 ^name predict-no)
  10596. =>WM: (13746: O1951 ^name predict-yes)
  10597. =>WM: (13745: R979 ^value 1)
  10598. =>WM: (13744: R1 ^reward R979)
  10599. <=WM: (13735: S1 ^operator O1949 +)
  10600. <=WM: (13736: S1 ^operator O1950 +)
  10601. <=WM: (13737: S1 ^operator O1950)
  10602. <=WM: (13720: I3 ^dir U)
  10603. <=WM: (13731: R1 ^reward R978)
  10604. <=WM: (13734: O1950 ^name predict-no)
  10605. <=WM: (13733: O1949 ^name predict-yes)
  10606. <=WM: (13732: R978 ^value 1)
  10607. --- Inner Elaboration Phase, active level 1 (S1) ---
  10608. Firing prefer*rvt*predict-yes*H0
  10609. -->
  10610. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10611. -->
  10612. (S1 ^operator O1951 = 0.6622121600001568)
  10613. Firing rl*prefer*rvt*predict-yes*H0*3
  10614. -->
  10615. (S1 ^operator O1951 = 0.3377121034427055)
  10616. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10617. -->
  10618. Firing prefer*rvt*predict-no*H0
  10619. -->
  10620. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10621. -->
  10622. (S1 ^operator O1952 = -0.2714224023553999)
  10623. Firing rl*prefer*rvt*predict-no*H0*4
  10624. -->
  10625. (S1 ^operator O1952 = 0.339769731277316)
  10626. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10627. -->
  10628. inner elaboration loop at bottom goal.
  10629. Retracting rl*prefer*rvt*predict-no*H0*4
  10630. -->
  10631. (S1 ^operator O1950 = 0.339769731277316)
  10632. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10633. -->
  10634. (S1 ^operator O1950 = -0.2714224023553999)
  10635. Retracting rl*prefer*rvt*predict-yes*H0*3
  10636. -->
  10637. (S1 ^operator O1949 = 0.3377121034427055)
  10638. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10639. -->
  10640. (S1 ^operator O1949 = 0.6622121600001568)
  10641. --- END Proposal Phase ---
  10642. --- Decision Phase ---
  10643. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10644. =>WM: (13751: S1 ^operator O1951)
  10645. 976: O: O1951 (predict-yes)
  10646. --- END Decision Phase ---
  10647. --- Application Phase ---
  10648. --- Firing Productions (PE) For State At Depth 1 ---
  10649. --- Inner Elaboration Phase, active level 1 (S1) ---
  10650. Firing apply*operator
  10651. -->
  10652. (I3 ^predict-yes N976 + :O )
  10653. Firing apply*operator*complete
  10654. -->
  10655. (I3 ^predict-no N975 - :O )
  10656. inner elaboration loop at bottom goal.
  10657. --- Change Working Memory (PE) ---
  10658. =>WM: (13752: I3 ^predict-yes N976)
  10659. <=WM: (13739: N975 ^status complete)
  10660. <=WM: (13738: I3 ^predict-no N975)
  10661. --- Firing Productions (IE) For State At Depth 1 ---
  10662. --- Inner Elaboration Phase, active level 1 (S1) ---
  10663. Firing monitor*world
  10664. -->
  10665. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10666. --- Change Working Memory (IE) ---
  10667. --- END Application Phase ---
  10668. --- Output Phase ---
  10669. ENV: Agent did: predict-yes for direction R in state State-A
  10670. In State-A moving R
  10671. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10672. predict error 0
  10673. dir: dir isU
  10674. --- END Output Phase ---
  10675. /|\--- Input Phase ---
  10676. =>WM: (13756: I2 ^dir U)
  10677. =>WM: (13755: I2 ^reward 1)
  10678. =>WM: (13754: I2 ^see 1)
  10679. =>WM: (13753: N976 ^status complete)
  10680. <=WM: (13742: I2 ^dir R)
  10681. <=WM: (13741: I2 ^reward 1)
  10682. <=WM: (13740: I2 ^see 0)
  10683. =>WM: (13757: I2 ^level-1 R1-root)
  10684. <=WM: (13743: I2 ^level-1 L1-root)
  10685. --- END Input Phase ---
  10686. --- Proposal Phase ---
  10687. --- Inner Elaboration Phase, active level 1 (S1) ---
  10688. Firing elaborate*copy-see-to-output-link
  10689. -->
  10690. (I3 ^see 1 +)
  10691. Firing elaborate*reward*based*on*reward
  10692. -->
  10693. (R980 ^value 1 +)
  10694. (R1 ^reward R980 +)
  10695. Firing propose*predict-yes
  10696. -->
  10697. (O1953 ^name predict-yes +)
  10698. (S1 ^operator O1953 +)
  10699. Firing propose*predict-no
  10700. -->
  10701. (O1954 ^name predict-no +)
  10702. (S1 ^operator O1954 +)
  10703. Firing rl*prefer*rvt*predict-no*H0*2
  10704. -->
  10705. (S1 ^operator O1952 = 1.)
  10706. Firing rl*prefer*rvt*predict-yes*H0*1
  10707. -->
  10708. (S1 ^operator O1951 = 0.)
  10709. Firing prefer*rvt*predict-yes*H0
  10710. -->
  10711. Firing prefer*rvt*predict-no*H0
  10712. -->
  10713. Firing elaborate*copy-dir-to-output-link
  10714. -->
  10715. (I3 ^dir U +)
  10716. inner elaboration loop at bottom goal.
  10717. Retracting elaborate*copy-see-to-output-link
  10718. -->
  10719. (I3 ^see 0 +)
  10720. Retracting propose*predict-no
  10721. -->
  10722. (O1952 ^name predict-no +)
  10723. (S1 ^operator O1952 +)
  10724. Retracting propose*predict-yes
  10725. -->
  10726. (O1951 ^name predict-yes +)
  10727. (S1 ^operator O1951 +)
  10728. Retracting elaborate*reward*based*on*reward
  10729. -->
  10730. (R979 ^value 1 +)
  10731. (R1 ^reward R979 +)
  10732. Retracting elaborate*copy-dir-to-output-link
  10733. -->
  10734. (I3 ^dir R +)
  10735. Retracting rl*prefer*rvt*predict-no*H0*4
  10736. -->
  10737. (S1 ^operator O1952 = 0.339769731277316)
  10738. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10739. -->
  10740. (S1 ^operator O1952 = -0.2714224023553999)
  10741. Retracting rl*prefer*rvt*predict-yes*H0*3
  10742. -->
  10743. (S1 ^operator O1951 = 0.3377121034427055)
  10744. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10745. -->
  10746. (S1 ^operator O1951 = 0.6622121600001568)
  10747. =>WM: (13765: S1 ^operator O1954 +)
  10748. =>WM: (13764: S1 ^operator O1953 +)
  10749. =>WM: (13763: I3 ^dir U)
  10750. =>WM: (13762: O1954 ^name predict-no)
  10751. =>WM: (13761: O1953 ^name predict-yes)
  10752. =>WM: (13760: R980 ^value 1)
  10753. =>WM: (13759: R1 ^reward R980)
  10754. =>WM: (13758: I3 ^see 1)
  10755. <=WM: (13749: S1 ^operator O1951 +)
  10756. <=WM: (13751: S1 ^operator O1951)
  10757. <=WM: (13750: S1 ^operator O1952 +)
  10758. <=WM: (13748: I3 ^dir R)
  10759. <=WM: (13744: R1 ^reward R979)
  10760. <=WM: (13730: I3 ^see 0)
  10761. <=WM: (13747: O1952 ^name predict-no)
  10762. <=WM: (13746: O1951 ^name predict-yes)
  10763. <=WM: (13745: R979 ^value 1)
  10764. --- Inner Elaboration Phase, active level 1 (S1) ---
  10765. Firing prefer*rvt*predict-yes*H0
  10766. -->
  10767. Firing rl*prefer*rvt*predict-yes*H0*1
  10768. -->
  10769. (S1 ^operator O1953 = 0.)
  10770. Firing prefer*rvt*predict-no*H0
  10771. -->
  10772. Firing rl*prefer*rvt*predict-no*H0*2
  10773. -->
  10774. (S1 ^operator O1954 = 1.)
  10775. inner elaboration loop at bottom goal.
  10776. Retracting rl*prefer*rvt*predict-no*H0*2
  10777. -->
  10778. (S1 ^operator O1952 = 1.)
  10779. Retracting rl*prefer*rvt*predict-yes*H0*1
  10780. -->
  10781. (S1 ^operator O1951 = 0.)
  10782. --- END Proposal Phase ---
  10783. --- Decision Phase ---
  10784. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.59012 -0.252401 0.337718(R,m,v=1,0.89697,0.0929786)
  10785. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.4098 0.252412 0.662212 -> 0.409809 0.252411 0.662219(R,m,v=1,1,0)
  10786. =>WM: (13766: S1 ^operator O1954)
  10787. 977: O: O1954 (predict-no)
  10788. --- END Decision Phase ---
  10789. --- Application Phase ---
  10790. --- Firing Productions (PE) For State At Depth 1 ---
  10791. --- Inner Elaboration Phase, active level 1 (S1) ---
  10792. Firing apply*operator
  10793. -->
  10794. (I3 ^predict-no N977 + :O )
  10795. Firing apply*operator*complete
  10796. -->
  10797. (I3 ^predict-yes N976 - :O )
  10798. inner elaboration loop at bottom goal.
  10799. --- Change Working Memory (PE) ---
  10800. =>WM: (13767: I3 ^predict-no N977)
  10801. <=WM: (13753: N976 ^status complete)
  10802. <=WM: (13752: I3 ^predict-yes N976)
  10803. --- Firing Productions (IE) For State At Depth 1 ---
  10804. --- Inner Elaboration Phase, active level 1 (S1) ---
  10805. Firing monitor*world
  10806. -->
  10807. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10808. --- Change Working Memory (IE) ---
  10809. --- END Application Phase ---
  10810. --- Output Phase ---
  10811. ENV: Agent did: predict-no for direction U in state State-B
  10812. In State-B moving U
  10813. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10814. predict error 0
  10815. dir: dir isU
  10816. --- END Output Phase ---
  10817. -/|\--- Input Phase ---
  10818. =>WM: (13771: I2 ^dir U)
  10819. =>WM: (13770: I2 ^reward 1)
  10820. =>WM: (13769: I2 ^see 0)
  10821. =>WM: (13768: N977 ^status complete)
  10822. <=WM: (13756: I2 ^dir U)
  10823. <=WM: (13755: I2 ^reward 1)
  10824. <=WM: (13754: I2 ^see 1)
  10825. =>WM: (13772: I2 ^level-1 R1-root)
  10826. <=WM: (13757: I2 ^level-1 R1-root)
  10827. --- END Input Phase ---
  10828. --- Proposal Phase ---
  10829. --- Inner Elaboration Phase, active level 1 (S1) ---
  10830. Firing elaborate*copy-see-to-output-link
  10831. -->
  10832. (I3 ^see 0 +)
  10833. Firing elaborate*reward*based*on*reward
  10834. -->
  10835. (R981 ^value 1 +)
  10836. (R1 ^reward R981 +)
  10837. Firing propose*predict-yes
  10838. -->
  10839. (O1955 ^name predict-yes +)
  10840. (S1 ^operator O1955 +)
  10841. Firing propose*predict-no
  10842. -->
  10843. (O1956 ^name predict-no +)
  10844. (S1 ^operator O1956 +)
  10845. Firing rl*prefer*rvt*predict-no*H0*2
  10846. -->
  10847. (S1 ^operator O1954 = 1.)
  10848. Firing rl*prefer*rvt*predict-yes*H0*1
  10849. -->
  10850. (S1 ^operator O1953 = 0.)
  10851. Firing prefer*rvt*predict-yes*H0
  10852. -->
  10853. Firing prefer*rvt*predict-no*H0
  10854. -->
  10855. Firing elaborate*copy-dir-to-output-link
  10856. -->
  10857. (I3 ^dir U +)
  10858. inner elaboration loop at bottom goal.
  10859. Retracting elaborate*copy-see-to-output-link
  10860. -->
  10861. (I3 ^see 1 +)
  10862. Retracting propose*predict-no
  10863. -->
  10864. (O1954 ^name predict-no +)
  10865. (S1 ^operator O1954 +)
  10866. Retracting propose*predict-yes
  10867. -->
  10868. (O1953 ^name predict-yes +)
  10869. (S1 ^operator O1953 +)
  10870. Retracting elaborate*reward*based*on*reward
  10871. -->
  10872. (R980 ^value 1 +)
  10873. (R1 ^reward R980 +)
  10874. Retracting elaborate*copy-dir-to-output-link
  10875. -->
  10876. (I3 ^dir U +)
  10877. Retracting rl*prefer*rvt*predict-no*H0*2
  10878. -->
  10879. (S1 ^operator O1954 = 1.)
  10880. Retracting rl*prefer*rvt*predict-yes*H0*1
  10881. -->
  10882. (S1 ^operator O1953 = 0.)
  10883. =>WM: (13779: S1 ^operator O1956 +)
  10884. =>WM: (13778: S1 ^operator O1955 +)
  10885. =>WM: (13777: O1956 ^name predict-no)
  10886. =>WM: (13776: O1955 ^name predict-yes)
  10887. =>WM: (13775: R981 ^value 1)
  10888. =>WM: (13774: R1 ^reward R981)
  10889. =>WM: (13773: I3 ^see 0)
  10890. <=WM: (13764: S1 ^operator O1953 +)
  10891. <=WM: (13765: S1 ^operator O1954 +)
  10892. <=WM: (13766: S1 ^operator O1954)
  10893. <=WM: (13759: R1 ^reward R980)
  10894. <=WM: (13758: I3 ^see 1)
  10895. <=WM: (13762: O1954 ^name predict-no)
  10896. <=WM: (13761: O1953 ^name predict-yes)
  10897. <=WM: (13760: R980 ^value 1)
  10898. --- Inner Elaboration Phase, active level 1 (S1) ---
  10899. Firing prefer*rvt*predict-yes*H0
  10900. -->
  10901. Firing rl*prefer*rvt*predict-yes*H0*1
  10902. -->
  10903. (S1 ^operator O1955 = 0.)
  10904. Firing prefer*rvt*predict-no*H0
  10905. -->
  10906. Firing rl*prefer*rvt*predict-no*H0*2
  10907. -->
  10908. (S1 ^operator O1956 = 1.)
  10909. inner elaboration loop at bottom goal.
  10910. Retracting rl*prefer*rvt*predict-no*H0*2
  10911. -->
  10912. (S1 ^operator O1954 = 1.)
  10913. Retracting rl*prefer*rvt*predict-yes*H0*1
  10914. -->
  10915. (S1 ^operator O1953 = 0.)
  10916. --- END Proposal Phase ---
  10917. --- Decision Phase ---
  10918. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10919. =>WM: (13780: S1 ^operator O1956)
  10920. 978: O: O1956 (predict-no)
  10921. --- END Decision Phase ---
  10922. --- Application Phase ---
  10923. --- Firing Productions (PE) For State At Depth 1 ---
  10924. --- Inner Elaboration Phase, active level 1 (S1) ---
  10925. Firing apply*operator
  10926. -->
  10927. (I3 ^predict-no N978 + :O )
  10928. Firing apply*operator*complete
  10929. -->
  10930. (I3 ^predict-no N977 - :O )
  10931. inner elaboration loop at bottom goal.
  10932. --- Change Working Memory (PE) ---
  10933. =>WM: (13781: I3 ^predict-no N978)
  10934. <=WM: (13768: N977 ^status complete)
  10935. <=WM: (13767: I3 ^predict-no N977)
  10936. --- Firing Productions (IE) For State At Depth 1 ---
  10937. --- Inner Elaboration Phase, active level 1 (S1) ---
  10938. Firing monitor*world
  10939. -->
  10940. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10941. --- Change Working Memory (IE) ---
  10942. --- END Application Phase ---
  10943. --- Output Phase ---
  10944. ENV: Agent did: predict-no for direction U in state State-B
  10945. In State-B moving U
  10946. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10947. predict error 0
  10948. dir: dir isR
  10949. --- END Output Phase ---
  10950. -/|--- Input Phase ---
  10951. =>WM: (13785: I2 ^dir R)
  10952. =>WM: (13784: I2 ^reward 1)
  10953. =>WM: (13783: I2 ^see 0)
  10954. =>WM: (13782: N978 ^status complete)
  10955. <=WM: (13771: I2 ^dir U)
  10956. <=WM: (13770: I2 ^reward 1)
  10957. <=WM: (13769: I2 ^see 0)
  10958. =>WM: (13786: I2 ^level-1 R1-root)
  10959. <=WM: (13772: I2 ^level-1 R1-root)
  10960. --- END Input Phase ---
  10961. --- Proposal Phase ---
  10962. --- Inner Elaboration Phase, active level 1 (S1) ---
  10963. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  10964. -->
  10965. (S1 ^operator O1955 = -0.1070236389116304)
  10966. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  10967. -->
  10968. (S1 ^operator O1956 = 0.6602468953107985)
  10969. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10970. -->
  10971. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10972. -->
  10973. Firing elaborate*copy-see-to-output-link
  10974. -->
  10975. (I3 ^see 0 +)
  10976. Firing elaborate*reward*based*on*reward
  10977. -->
  10978. (R982 ^value 1 +)
  10979. (R1 ^reward R982 +)
  10980. Firing propose*predict-yes
  10981. -->
  10982. (O1957 ^name predict-yes +)
  10983. (S1 ^operator O1957 +)
  10984. Firing propose*predict-no
  10985. -->
  10986. (O1958 ^name predict-no +)
  10987. (S1 ^operator O1958 +)
  10988. Firing rl*prefer*rvt*predict-no*H0*4
  10989. -->
  10990. (S1 ^operator O1956 = 0.339769731277316)
  10991. Firing rl*prefer*rvt*predict-yes*H0*3
  10992. -->
  10993. (S1 ^operator O1955 = 0.3377183053124619)
  10994. Firing prefer*rvt*predict-yes*H0
  10995. -->
  10996. Firing prefer*rvt*predict-no*H0
  10997. -->
  10998. Firing elaborate*copy-dir-to-output-link
  10999. -->
  11000. (I3 ^dir R +)
  11001. inner elaboration loop at bottom goal.
  11002. Retracting elaborate*copy-see-to-output-link
  11003. -->
  11004. (I3 ^see 0 +)
  11005. Retracting propose*predict-no
  11006. -->
  11007. (O1956 ^name predict-no +)
  11008. (S1 ^operator O1956 +)
  11009. Retracting propose*predict-yes
  11010. -->
  11011. (O1955 ^name predict-yes +)
  11012. (S1 ^operator O1955 +)
  11013. Retracting elaborate*reward*based*on*reward
  11014. -->
  11015. (R981 ^value 1 +)
  11016. (R1 ^reward R981 +)
  11017. Retracting elaborate*copy-dir-to-output-link
  11018. -->
  11019. (I3 ^dir U +)
  11020. Retracting rl*prefer*rvt*predict-no*H0*2
  11021. -->
  11022. (S1 ^operator O1956 = 1.)
  11023. Retracting rl*prefer*rvt*predict-yes*H0*1
  11024. -->
  11025. (S1 ^operator O1955 = 0.)
  11026. =>WM: (13793: S1 ^operator O1958 +)
  11027. =>WM: (13792: S1 ^operator O1957 +)
  11028. =>WM: (13791: I3 ^dir R)
  11029. =>WM: (13790: O1958 ^name predict-no)
  11030. =>WM: (13789: O1957 ^name predict-yes)
  11031. =>WM: (13788: R982 ^value 1)
  11032. =>WM: (13787: R1 ^reward R982)
  11033. <=WM: (13778: S1 ^operator O1955 +)
  11034. <=WM: (13779: S1 ^operator O1956 +)
  11035. <=WM: (13780: S1 ^operator O1956)
  11036. <=WM: (13763: I3 ^dir U)
  11037. <=WM: (13774: R1 ^reward R981)
  11038. <=WM: (13777: O1956 ^name predict-no)
  11039. <=WM: (13776: O1955 ^name predict-yes)
  11040. <=WM: (13775: R981 ^value 1)
  11041. --- Inner Elaboration Phase, active level 1 (S1) ---
  11042. Firing prefer*rvt*predict-yes*H0
  11043. -->
  11044. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  11045. -->
  11046. (S1 ^operator O1957 = -0.1070236389116304)
  11047. Firing rl*prefer*rvt*predict-yes*H0*3
  11048. -->
  11049. (S1 ^operator O1957 = 0.3377183053124619)
  11050. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11051. -->
  11052. Firing prefer*rvt*predict-no*H0
  11053. -->
  11054. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  11055. -->
  11056. (S1 ^operator O1958 = 0.6602468953107985)
  11057. Firing rl*prefer*rvt*predict-no*H0*4
  11058. -->
  11059. (S1 ^operator O1958 = 0.339769731277316)
  11060. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11061. -->
  11062. inner elaboration loop at bottom goal.
  11063. Retracting rl*prefer*rvt*predict-no*H0*4
  11064. -->
  11065. (S1 ^operator O1956 = 0.339769731277316)
  11066. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  11067. -->
  11068. (S1 ^operator O1956 = 0.6602468953107985)
  11069. Retracting rl*prefer*rvt*predict-yes*H0*3
  11070. -->
  11071. (S1 ^operator O1955 = 0.3377183053124619)
  11072. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  11073. -->
  11074. (S1 ^operator O1955 = -0.1070236389116304)
  11075. --- END Proposal Phase ---
  11076. --- Decision Phase ---
  11077. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11078. =>WM: (13794: S1 ^operator O1958)
  11079. 979: O: O1958 (predict-no)
  11080. --- END Decision Phase ---
  11081. --- Application Phase ---
  11082. --- Firing Productions (PE) For State At Depth 1 ---
  11083. --- Inner Elaboration Phase, active level 1 (S1) ---
  11084. Firing apply*operator
  11085. -->
  11086. (I3 ^predict-no N979 + :O )
  11087. Firing apply*operator*complete
  11088. -->
  11089. (I3 ^predict-no N978 - :O )
  11090. inner elaboration loop at bottom goal.
  11091. --- Change Working Memory (PE) ---
  11092. =>WM: (13795: I3 ^predict-no N979)
  11093. <=WM: (13782: N978 ^status complete)
  11094. <=WM: (13781: I3 ^predict-no N978)
  11095. --- Firing Productions (IE) For State At Depth 1 ---
  11096. --- Inner Elaboration Phase, active level 1 (S1) ---
  11097. Firing monitor*world
  11098. -->
  11099. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11100. --- Change Working Memory (IE) ---
  11101. --- END Application Phase ---
  11102. --- Output Phase ---
  11103. ENV: Agent did: predict-no for direction R in state State-B
  11104. In State-B moving R
  11105. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11106. predict error 0
  11107. dir: dir isU
  11108. --- END Output Phase ---
  11109. \-/--- Input Phase ---
  11110. =>WM: (13799: I2 ^dir U)
  11111. =>WM: (13798: I2 ^reward 1)
  11112. =>WM: (13797: I2 ^see 0)
  11113. =>WM: (13796: N979 ^status complete)
  11114. <=WM: (13785: I2 ^dir R)
  11115. <=WM: (13784: I2 ^reward 1)
  11116. <=WM: (13783: I2 ^see 0)
  11117. =>WM: (13800: I2 ^level-1 R0-root)
  11118. <=WM: (13786: I2 ^level-1 R1-root)
  11119. --- END Input Phase ---
  11120. --- Proposal Phase ---
  11121. --- Inner Elaboration Phase, active level 1 (S1) ---
  11122. Firing elaborate*copy-see-to-output-link
  11123. -->
  11124. (I3 ^see 0 +)
  11125. Firing elaborate*reward*based*on*reward
  11126. -->
  11127. (R983 ^value 1 +)
  11128. (R1 ^reward R983 +)
  11129. Firing propose*predict-yes
  11130. -->
  11131. (O1959 ^name predict-yes +)
  11132. (S1 ^operator O1959 +)
  11133. Firing propose*predict-no
  11134. -->
  11135. (O1960 ^name predict-no +)
  11136. (S1 ^operator O1960 +)
  11137. Firing rl*prefer*rvt*predict-no*H0*2
  11138. -->
  11139. (S1 ^operator O1958 = 1.)
  11140. Firing rl*prefer*rvt*predict-yes*H0*1
  11141. -->
  11142. (S1 ^operator O1957 = 0.)
  11143. Firing prefer*rvt*predict-yes*H0
  11144. -->
  11145. Firing prefer*rvt*predict-no*H0
  11146. -->
  11147. Firing elaborate*copy-dir-to-output-link
  11148. -->
  11149. (I3 ^dir U +)
  11150. inner elaboration loop at bottom goal.
  11151. Retracting elaborate*copy-see-to-output-link
  11152. -->
  11153. (I3 ^see 0 +)
  11154. Retracting propose*predict-no
  11155. -->
  11156. (O1958 ^name predict-no +)
  11157. (S1 ^operator O1958 +)
  11158. Retracting propose*predict-yes
  11159. -->
  11160. (O1957 ^name predict-yes +)
  11161. (S1 ^operator O1957 +)
  11162. Retracting elaborate*reward*based*on*reward
  11163. -->
  11164. (R982 ^value 1 +)
  11165. (R1 ^reward R982 +)
  11166. Retracting elaborate*copy-dir-to-output-link
  11167. -->
  11168. (I3 ^dir R +)
  11169. Retracting rl*prefer*rvt*predict-no*H0*4
  11170. -->
  11171. (S1 ^operator O1958 = 0.339769731277316)
  11172. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  11173. -->
  11174. (S1 ^operator O1958 = 0.6602468953107985)
  11175. Retracting rl*prefer*rvt*predict-yes*H0*3
  11176. -->
  11177. (S1 ^operator O1957 = 0.3377183053124619)
  11178. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  11179. -->
  11180. (S1 ^operator O1957 = -0.1070236389116304)
  11181. =>WM: (13807: S1 ^operator O1960 +)
  11182. =>WM: (13806: S1 ^operator O1959 +)
  11183. =>WM: (13805: I3 ^dir U)
  11184. =>WM: (13804: O1960 ^name predict-no)
  11185. =>WM: (13803: O1959 ^name predict-yes)
  11186. =>WM: (13802: R983 ^value 1)
  11187. =>WM: (13801: R1 ^reward R983)
  11188. <=WM: (13792: S1 ^operator O1957 +)
  11189. <=WM: (13793: S1 ^operator O1958 +)
  11190. <=WM: (13794: S1 ^operator O1958)
  11191. <=WM: (13791: I3 ^dir R)
  11192. <=WM: (13787: R1 ^reward R982)
  11193. <=WM: (13790: O1958 ^name predict-no)
  11194. <=WM: (13789: O1957 ^name predict-yes)
  11195. <=WM: (13788: R982 ^value 1)
  11196. --- Inner Elaboration Phase, active level 1 (S1) ---
  11197. Firing prefer*rvt*predict-yes*H0
  11198. -->
  11199. Firing rl*prefer*rvt*predict-yes*H0*1
  11200. -->
  11201. (S1 ^operator O1959 = 0.)
  11202. Firing prefer*rvt*predict-no*H0
  11203. -->
  11204. Firing rl*prefer*rvt*predict-no*H0*2
  11205. -->
  11206. (S1 ^operator O1960 = 1.)
  11207. inner elaboration loop at bottom goal.
  11208. Retracting rl*prefer*rvt*predict-no*H0*2
  11209. -->
  11210. (S1 ^operator O1958 = 1.)
  11211. Retracting rl*prefer*rvt*predict-yes*H0*1
  11212. -->
  11213. (S1 ^operator O1957 = 0.)
  11214. --- END Proposal Phase ---
  11215. --- Decision Phase ---
  11216. RL update rl*prefer*rvt*predict-no*H0*4 0.570253 -0.230483 0.33977 -> 0.570252 -0.230483 0.339768(R,m,v=1,0.873494,0.111172)
  11217. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429764 0.230483 0.660247 -> 0.429763 0.230483 0.660245(R,m,v=1,1,0)
  11218. =>WM: (13808: S1 ^operator O1960)
  11219. 980: O: O1960 (predict-no)
  11220. --- END Decision Phase ---
  11221. --- Application Phase ---
  11222. --- Firing Productions (PE) For State At Depth 1 ---
  11223. --- Inner Elaboration Phase, active level 1 (S1) ---
  11224. Firing apply*operator
  11225. -->
  11226. (I3 ^predict-no N980 + :O )
  11227. Firing apply*operator*complete
  11228. -->
  11229. (I3 ^predict-no N979 - :O )
  11230. inner elaboration loop at bottom goal.
  11231. --- Change Working Memory (PE) ---
  11232. =>WM: (13809: I3 ^predict-no N980)
  11233. <=WM: (13796: N979 ^status complete)
  11234. <=WM: (13795: I3 ^predict-no N979)
  11235. --- Firing Productions (IE) For State At Depth 1 ---
  11236. --- Inner Elaboration Phase, active level 1 (S1) ---
  11237. Firing monitor*world
  11238. -->
  11239. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11240. --- Change Working Memory (IE) ---
  11241. --- END Application Phase ---
  11242. --- Output Phase ---
  11243. ENV: Agent did: predict-no for direction U in state State-B
  11244. In State-B moving U
  11245. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11246. predict error 0
  11247. dir: dir isU
  11248. --- END Output Phase ---
  11249. |\---- Input Phase ---
  11250. =>WM: (13813: I2 ^dir U)
  11251. =>WM: (13812: I2 ^reward 1)
  11252. =>WM: (13811: I2 ^see 0)
  11253. =>WM: (13810: N980 ^status complete)
  11254. <=WM: (13799: I2 ^dir U)
  11255. <=WM: (13798: I2 ^reward 1)
  11256. <=WM: (13797: I2 ^see 0)
  11257. =>WM: (13814: I2 ^level-1 R0-root)
  11258. <=WM: (13800: I2 ^level-1 R0-root)
  11259. --- END Input Phase ---
  11260. --- Proposal Phase ---
  11261. --- Inner Elaboration Phase, active level 1 (S1) ---
  11262. Firing elaborate*copy-see-to-output-link
  11263. -->
  11264. (I3 ^see 0 +)
  11265. Firing elaborate*reward*based*on*reward
  11266. -->
  11267. (R984 ^value 1 +)
  11268. (R1 ^reward R984 +)
  11269. Firing propose*predict-yes
  11270. -->
  11271. (O1961 ^name predict-yes +)
  11272. (S1 ^operator O1961 +)
  11273. Firing propose*predict-no
  11274. -->
  11275. (O1962 ^name predict-no +)
  11276. (S1 ^operator O1962 +)
  11277. Firing rl*prefer*rvt*predict-no*H0*2
  11278. -->
  11279. (S1 ^operator O1960 = 1.)
  11280. Firing rl*prefer*rvt*predict-yes*H0*1
  11281. -->
  11282. (S1 ^operator O1959 = 0.)
  11283. Firing prefer*rvt*predict-yes*H0
  11284. -->
  11285. Firing prefer*rvt*predict-no*H0
  11286. -->
  11287. Firing elaborate*copy-dir-to-output-link
  11288. -->
  11289. (I3 ^dir U +)
  11290. inner elaboration loop at bottom goal.
  11291. Retracting elaborate*copy-see-to-output-link
  11292. -->
  11293. (I3 ^see 0 +)
  11294. Retracting propose*predict-no
  11295. -->
  11296. (O1960 ^name predict-no +)
  11297. (S1 ^operator O1960 +)
  11298. Retracting propose*predict-yes
  11299. -->
  11300. (O1959 ^name predict-yes +)
  11301. (S1 ^operator O1959 +)
  11302. Retracting elaborate*reward*based*on*reward
  11303. -->
  11304. (R983 ^value 1 +)
  11305. (R1 ^reward R983 +)
  11306. Retracting elaborate*copy-dir-to-output-link
  11307. -->
  11308. (I3 ^dir U +)
  11309. Retracting rl*prefer*rvt*predict-no*H0*2
  11310. -->
  11311. (S1 ^operator O1960 = 1.)
  11312. Retracting rl*prefer*rvt*predict-yes*H0*1
  11313. -->
  11314. (S1 ^operator O1959 = 0.)
  11315. =>WM: (13820: S1 ^operator O1962 +)
  11316. =>WM: (13819: S1 ^operator O1961 +)
  11317. =>WM: (13818: O1962 ^name predict-no)
  11318. =>WM: (13817: O1961 ^name predict-yes)
  11319. =>WM: (13816: R984 ^value 1)
  11320. =>WM: (13815: R1 ^reward R984)
  11321. <=WM: (13806: S1 ^operator O1959 +)
  11322. <=WM: (13807: S1 ^operator O1960 +)
  11323. <=WM: (13808: S1 ^operator O1960)
  11324. <=WM: (13801: R1 ^reward R983)
  11325. <=WM: (13804: O1960 ^name predict-no)
  11326. <=WM: (13803: O1959 ^name predict-yes)
  11327. <=WM: (13802: R983 ^value 1)
  11328. --- Inner Elaboration Phase, active level 1 (S1) ---
  11329. Firing prefer*rvt*predict-yes*H0
  11330. -->
  11331. Firing rl*prefer*rvt*predict-yes*H0*1
  11332. -->
  11333. (S1 ^operator O1961 = 0.)
  11334. Firing prefer*rvt*predict-no*H0
  11335. -->
  11336. Firing rl*prefer*rvt*predict-no*H0*2
  11337. -->
  11338. (S1 ^operator O1962 = 1.)
  11339. inner elaboration loop at bottom goal.
  11340. Retracting rl*prefer*rvt*predict-no*H0*2
  11341. -->
  11342. (S1 ^operator O1960 = 1.)
  11343. Retracting rl*prefer*rvt*predict-yes*H0*1
  11344. -->
  11345. (S1 ^operator O1959 = 0.)
  11346. --- END Proposal Phase ---
  11347. --- Decision Phase ---
  11348. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11349. =>WM: (13821: S1 ^operator O1962)
  11350. 981: O: O1962 (predict-no)
  11351. --- END Decision Phase ---
  11352. --- Application Phase ---
  11353. --- Firing Productions (PE) For State At Depth 1 ---
  11354. --- Inner Elaboration Phase, active level 1 (S1) ---
  11355. Firing apply*operator
  11356. -->
  11357. (I3 ^predict-no N981 + :O )
  11358. Firing apply*operator*complete
  11359. -->
  11360. (I3 ^predict-no N980 - :O )
  11361. inner elaboration loop at bottom goal.
  11362. --- Change Working Memory (PE) ---
  11363. =>WM: (13822: I3 ^predict-no N981)
  11364. <=WM: (13810: N980 ^status complete)
  11365. <=WM: (13809: I3 ^predict-no N980)
  11366. --- Firing Productions (IE) For State At Depth 1 ---
  11367. --- Inner Elaboration Phase, active level 1 (S1) ---
  11368. Firing monitor*world
  11369. -->
  11370. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11371. --- Change Working Memory (IE) ---
  11372. --- END Application Phase ---
  11373. --- Output Phase ---
  11374. ENV: Agent did: predict-no for direction U in state State-B
  11375. In State-B moving U
  11376. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11377. predict error 0
  11378. dir: dir isL
  11379. --- END Output Phase ---
  11380. /--- Input Phase ---
  11381. =>WM: (13826: I2 ^dir L)
  11382. =>WM: (13825: I2 ^reward 1)
  11383. =>WM: (13824: I2 ^see 0)
  11384. =>WM: (13823: N981 ^status complete)
  11385. <=WM: (13813: I2 ^dir U)
  11386. <=WM: (13812: I2 ^reward 1)
  11387. <=WM: (13811: I2 ^see 0)
  11388. =>WM: (13827: I2 ^level-1 R0-root)
  11389. <=WM: (13814: I2 ^level-1 R0-root)
  11390. --- END Input Phase ---
  11391. --- Proposal Phase ---
  11392. --- Inner Elaboration Phase, active level 1 (S1) ---
  11393. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11394. -->
  11395. (S1 ^operator O1961 = 0.7358289752034343)
  11396. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11397. -->
  11398. Firing elaborate*copy-see-to-output-link
  11399. -->
  11400. (I3 ^see 0 +)
  11401. Firing elaborate*reward*based*on*reward
  11402. -->
  11403. (R985 ^value 1 +)
  11404. (R1 ^reward R985 +)
  11405. Firing propose*predict-yes
  11406. -->
  11407. (O1963 ^name predict-yes +)
  11408. (S1 ^operator O1963 +)
  11409. Firing propose*predict-no
  11410. -->
  11411. (O1964 ^name predict-no +)
  11412. (S1 ^operator O1964 +)
  11413. Firing rl*prefer*rvt*predict-no*H0*6
  11414. -->
  11415. (S1 ^operator O1962 = 0.9997480945179411)
  11416. Firing rl*prefer*rvt*predict-yes*H0*5
  11417. -->
  11418. (S1 ^operator O1961 = 0.2640281357095451)
  11419. Firing prefer*rvt*predict-yes*H0
  11420. -->
  11421. Firing prefer*rvt*predict-no*H0
  11422. -->
  11423. Firing elaborate*copy-dir-to-output-link
  11424. -->
  11425. (I3 ^dir L +)
  11426. inner elaboration loop at bottom goal.
  11427. Retracting elaborate*copy-see-to-output-link
  11428. -->
  11429. (I3 ^see 0 +)
  11430. Retracting propose*predict-no
  11431. -->
  11432. (O1962 ^name predict-no +)
  11433. (S1 ^operator O1962 +)
  11434. Retracting propose*predict-yes
  11435. -->
  11436. (O1961 ^name predict-yes +)
  11437. (S1 ^operator O1961 +)
  11438. Retracting elaborate*reward*based*on*reward
  11439. -->
  11440. (R984 ^value 1 +)
  11441. (R1 ^reward R984 +)
  11442. Retracting elaborate*copy-dir-to-output-link
  11443. -->
  11444. (I3 ^dir U +)
  11445. Retracting rl*prefer*rvt*predict-no*H0*2
  11446. -->
  11447. (S1 ^operator O1962 = 1.)
  11448. Retracting rl*prefer*rvt*predict-yes*H0*1
  11449. -->
  11450. (S1 ^operator O1961 = 0.)
  11451. =>WM: (13834: S1 ^operator O1964 +)
  11452. =>WM: (13833: S1 ^operator O1963 +)
  11453. =>WM: (13832: I3 ^dir L)
  11454. =>WM: (13831: O1964 ^name predict-no)
  11455. =>WM: (13830: O1963 ^name predict-yes)
  11456. =>WM: (13829: R985 ^value 1)
  11457. =>WM: (13828: R1 ^reward R985)
  11458. <=WM: (13819: S1 ^operator O1961 +)
  11459. <=WM: (13820: S1 ^operator O1962 +)
  11460. <=WM: (13821: S1 ^operator O1962)
  11461. <=WM: (13805: I3 ^dir U)
  11462. <=WM: (13815: R1 ^reward R984)
  11463. <=WM: (13818: O1962 ^name predict-no)
  11464. <=WM: (13817: O1961 ^name predict-yes)
  11465. <=WM: (13816: R984 ^value 1)
  11466. --- Inner Elaboration Phase, active level 1 (S1) ---
  11467. Firing prefer*rvt*predict-yes*H0
  11468. -->
  11469. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11470. -->
  11471. (S1 ^operator O1963 = 0.7358289752034343)
  11472. Firing rl*prefer*rvt*predict-yes*H0*5
  11473. -->
  11474. (S1 ^operator O1963 = 0.2640281357095451)
  11475. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11476. -->
  11477. Firing prefer*rvt*predict-no*H0
  11478. -->
  11479. Firing rl*prefer*rvt*predict-no*H0*6
  11480. -->
  11481. (S1 ^operator O1964 = 0.9997480945179411)
  11482. inner elaboration loop at bottom goal.
  11483. Retracting rl*prefer*rvt*predict-no*H0*6
  11484. -->
  11485. (S1 ^operator O1962 = 0.9997480945179411)
  11486. Retracting rl*prefer*rvt*predict-yes*H0*5
  11487. -->
  11488. (S1 ^operator O1961 = 0.2640281357095451)
  11489. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11490. -->
  11491. (S1 ^operator O1961 = 0.7358289752034343)
  11492. --- END Proposal Phase ---
  11493. --- Decision Phase ---
  11494. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11495. =>WM: (13835: S1 ^operator O1963)
  11496. 982: O: O1963 (predict-yes)
  11497. --- END Decision Phase ---
  11498. --- Application Phase ---
  11499. --- Firing Productions (PE) For State At Depth 1 ---
  11500. --- Inner Elaboration Phase, active level 1 (S1) ---
  11501. Firing apply*operator
  11502. -->
  11503. (I3 ^predict-yes N982 + :O )
  11504. Firing apply*operator*complete
  11505. -->
  11506. (I3 ^predict-no N981 - :O )
  11507. inner elaboration loop at bottom goal.
  11508. --- Change Working Memory (PE) ---
  11509. =>WM: (13836: I3 ^predict-yes N982)
  11510. <=WM: (13823: N981 ^status complete)
  11511. <=WM: (13822: I3 ^predict-no N981)
  11512. --- Firing Productions (IE) For State At Depth 1 ---
  11513. --- Inner Elaboration Phase, active level 1 (S1) ---
  11514. Firing monitor*world
  11515. -->
  11516. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11517. --- Change Working Memory (IE) ---
  11518. --- END Application Phase ---
  11519. --- Output Phase ---
  11520. ENV: Agent did: predict-yes for direction L in state State-B
  11521. In State-B moving L
  11522. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11523. predict error 0
  11524. dir: dir isU
  11525. --- END Output Phase ---
  11526. |\---- Input Phase ---
  11527. =>WM: (13840: I2 ^dir U)
  11528. =>WM: (13839: I2 ^reward 1)
  11529. =>WM: (13838: I2 ^see 1)
  11530. =>WM: (13837: N982 ^status complete)
  11531. <=WM: (13826: I2 ^dir L)
  11532. <=WM: (13825: I2 ^reward 1)
  11533. <=WM: (13824: I2 ^see 0)
  11534. =>WM: (13841: I2 ^level-1 L1-root)
  11535. <=WM: (13827: I2 ^level-1 R0-root)
  11536. --- END Input Phase ---
  11537. --- Proposal Phase ---
  11538. --- Inner Elaboration Phase, active level 1 (S1) ---
  11539. Firing elaborate*copy-see-to-output-link
  11540. -->
  11541. (I3 ^see 1 +)
  11542. Firing elaborate*reward*based*on*reward
  11543. -->
  11544. (R986 ^value 1 +)
  11545. (R1 ^reward R986 +)
  11546. Firing propose*predict-yes
  11547. -->
  11548. (O1965 ^name predict-yes +)
  11549. (S1 ^operator O1965 +)
  11550. Firing propose*predict-no
  11551. -->
  11552. (O1966 ^name predict-no +)
  11553. (S1 ^operator O1966 +)
  11554. Firing rl*prefer*rvt*predict-no*H0*2
  11555. -->
  11556. (S1 ^operator O1964 = 1.)
  11557. Firing rl*prefer*rvt*predict-yes*H0*1
  11558. -->
  11559. (S1 ^operator O1963 = 0.)
  11560. Firing prefer*rvt*predict-yes*H0
  11561. -->
  11562. Firing prefer*rvt*predict-no*H0
  11563. -->
  11564. Firing elaborate*copy-dir-to-output-link
  11565. -->
  11566. (I3 ^dir U +)
  11567. inner elaboration loop at bottom goal.
  11568. Retracting elaborate*copy-see-to-output-link
  11569. -->
  11570. (I3 ^see 0 +)
  11571. Retracting propose*predict-no
  11572. -->
  11573. (O1964 ^name predict-no +)
  11574. (S1 ^operator O1964 +)
  11575. Retracting propose*predict-yes
  11576. -->
  11577. (O1963 ^name predict-yes +)
  11578. (S1 ^operator O1963 +)
  11579. Retracting elaborate*reward*based*on*reward
  11580. -->
  11581. (R985 ^value 1 +)
  11582. (R1 ^reward R985 +)
  11583. Retracting elaborate*copy-dir-to-output-link
  11584. -->
  11585. (I3 ^dir L +)
  11586. Retracting rl*prefer*rvt*predict-no*H0*6
  11587. -->
  11588. (S1 ^operator O1964 = 0.9997480945179411)
  11589. Retracting rl*prefer*rvt*predict-yes*H0*5
  11590. -->
  11591. (S1 ^operator O1963 = 0.2640281357095451)
  11592. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11593. -->
  11594. (S1 ^operator O1963 = 0.7358289752034343)
  11595. =>WM: (13849: S1 ^operator O1966 +)
  11596. =>WM: (13848: S1 ^operator O1965 +)
  11597. =>WM: (13847: I3 ^dir U)
  11598. =>WM: (13846: O1966 ^name predict-no)
  11599. =>WM: (13845: O1965 ^name predict-yes)
  11600. =>WM: (13844: R986 ^value 1)
  11601. =>WM: (13843: R1 ^reward R986)
  11602. =>WM: (13842: I3 ^see 1)
  11603. <=WM: (13833: S1 ^operator O1963 +)
  11604. <=WM: (13835: S1 ^operator O1963)
  11605. <=WM: (13834: S1 ^operator O1964 +)
  11606. <=WM: (13832: I3 ^dir L)
  11607. <=WM: (13828: R1 ^reward R985)
  11608. <=WM: (13773: I3 ^see 0)
  11609. <=WM: (13831: O1964 ^name predict-no)
  11610. <=WM: (13830: O1963 ^name predict-yes)
  11611. <=WM: (13829: R985 ^value 1)
  11612. --- Inner Elaboration Phase, active level 1 (S1) ---
  11613. Firing prefer*rvt*predict-yes*H0
  11614. -->
  11615. Firing rl*prefer*rvt*predict-yes*H0*1
  11616. -->
  11617. (S1 ^operator O1965 = 0.)
  11618. Firing prefer*rvt*predict-no*H0
  11619. -->
  11620. Firing rl*prefer*rvt*predict-no*H0*2
  11621. -->
  11622. (S1 ^operator O1966 = 1.)
  11623. inner elaboration loop at bottom goal.
  11624. Retracting rl*prefer*rvt*predict-no*H0*2
  11625. -->
  11626. (S1 ^operator O1964 = 1.)
  11627. Retracting rl*prefer*rvt*predict-yes*H0*1
  11628. -->
  11629. (S1 ^operator O1963 = 0.)
  11630. --- END Proposal Phase ---
  11631. --- Decision Phase ---
  11632. RL update rl*prefer*rvt*predict-yes*H0*5 0.554414 -0.290386 0.264028 -> 0.554425 -0.290385 0.26404(R,m,v=1,0.875706,0.109463)
  11633. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445446 0.290383 0.735829 -> 0.44546 0.290383 0.735843(R,m,v=1,1,0)
  11634. =>WM: (13850: S1 ^operator O1966)
  11635. 983: O: O1966 (predict-no)
  11636. --- END Decision Phase ---
  11637. --- Application Phase ---
  11638. --- Firing Productions (PE) For State At Depth 1 ---
  11639. --- Inner Elaboration Phase, active level 1 (S1) ---
  11640. Firing apply*operator
  11641. -->
  11642. (I3 ^predict-no N983 + :O )
  11643. Firing apply*operator*complete
  11644. -->
  11645. (I3 ^predict-yes N982 - :O )
  11646. inner elaboration loop at bottom goal.
  11647. --- Change Working Memory (PE) ---
  11648. =>WM: (13851: I3 ^predict-no N983)
  11649. <=WM: (13837: N982 ^status complete)
  11650. <=WM: (13836: I3 ^predict-yes N982)
  11651. --- Firing Productions (IE) For State At Depth 1 ---
  11652. --- Inner Elaboration Phase, active level 1 (S1) ---
  11653. Firing monitor*world
  11654. -->
  11655. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11656. --- Change Working Memory (IE) ---
  11657. --- END Application Phase ---
  11658. --- Output Phase ---
  11659. ENV: Agent did: predict-no for direction U in state State-A
  11660. In State-A moving U
  11661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11662. predict error 0
  11663. dir: dir isL
  11664. --- END Output Phase ---
  11665. /|\--- Input Phase ---
  11666. =>WM: (13855: I2 ^dir L)
  11667. =>WM: (13854: I2 ^reward 1)
  11668. =>WM: (13853: I2 ^see 0)
  11669. =>WM: (13852: N983 ^status complete)
  11670. <=WM: (13840: I2 ^dir U)
  11671. <=WM: (13839: I2 ^reward 1)
  11672. <=WM: (13838: I2 ^see 1)
  11673. =>WM: (13856: I2 ^level-1 L1-root)
  11674. <=WM: (13841: I2 ^level-1 L1-root)
  11675. --- END Input Phase ---
  11676. --- Proposal Phase ---
  11677. --- Inner Elaboration Phase, active level 1 (S1) ---
  11678. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11679. -->
  11680. (S1 ^operator O1965 = -0.181727099742844)
  11681. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11682. -->
  11683. Firing elaborate*copy-see-to-output-link
  11684. -->
  11685. (I3 ^see 0 +)
  11686. Firing elaborate*reward*based*on*reward
  11687. -->
  11688. (R987 ^value 1 +)
  11689. (R1 ^reward R987 +)
  11690. Firing propose*predict-yes
  11691. -->
  11692. (O1967 ^name predict-yes +)
  11693. (S1 ^operator O1967 +)
  11694. Firing propose*predict-no
  11695. -->
  11696. (O1968 ^name predict-no +)
  11697. (S1 ^operator O1968 +)
  11698. Firing rl*prefer*rvt*predict-no*H0*6
  11699. -->
  11700. (S1 ^operator O1966 = 0.9997480945179411)
  11701. Firing rl*prefer*rvt*predict-yes*H0*5
  11702. -->
  11703. (S1 ^operator O1965 = 0.264039703522277)
  11704. Firing prefer*rvt*predict-yes*H0
  11705. -->
  11706. Firing prefer*rvt*predict-no*H0
  11707. -->
  11708. Firing elaborate*copy-dir-to-output-link
  11709. -->
  11710. (I3 ^dir L +)
  11711. inner elaboration loop at bottom goal.
  11712. Retracting elaborate*copy-see-to-output-link
  11713. -->
  11714. (I3 ^see 1 +)
  11715. Retracting propose*predict-no
  11716. -->
  11717. (O1966 ^name predict-no +)
  11718. (S1 ^operator O1966 +)
  11719. Retracting propose*predict-yes
  11720. -->
  11721. (O1965 ^name predict-yes +)
  11722. (S1 ^operator O1965 +)
  11723. Retracting elaborate*reward*based*on*reward
  11724. -->
  11725. (R986 ^value 1 +)
  11726. (R1 ^reward R986 +)
  11727. Retracting elaborate*copy-dir-to-output-link
  11728. -->
  11729. (I3 ^dir U +)
  11730. Retracting rl*prefer*rvt*predict-no*H0*2
  11731. -->
  11732. (S1 ^operator O1966 = 1.)
  11733. Retracting rl*prefer*rvt*predict-yes*H0*1
  11734. -->
  11735. (S1 ^operator O1965 = 0.)
  11736. =>WM: (13864: S1 ^operator O1968 +)
  11737. =>WM: (13863: S1 ^operator O1967 +)
  11738. =>WM: (13862: I3 ^dir L)
  11739. =>WM: (13861: O1968 ^name predict-no)
  11740. =>WM: (13860: O1967 ^name predict-yes)
  11741. =>WM: (13859: R987 ^value 1)
  11742. =>WM: (13858: R1 ^reward R987)
  11743. =>WM: (13857: I3 ^see 0)
  11744. <=WM: (13848: S1 ^operator O1965 +)
  11745. <=WM: (13849: S1 ^operator O1966 +)
  11746. <=WM: (13850: S1 ^operator O1966)
  11747. <=WM: (13847: I3 ^dir U)
  11748. <=WM: (13843: R1 ^reward R986)
  11749. <=WM: (13842: I3 ^see 1)
  11750. <=WM: (13846: O1966 ^name predict-no)
  11751. <=WM: (13845: O1965 ^name predict-yes)
  11752. <=WM: (13844: R986 ^value 1)
  11753. --- Inner Elaboration Phase, active level 1 (S1) ---
  11754. Firing prefer*rvt*predict-yes*H0
  11755. -->
  11756. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11757. -->
  11758. (S1 ^operator O1967 = -0.181727099742844)
  11759. Firing rl*prefer*rvt*predict-yes*H0*5
  11760. -->
  11761. (S1 ^operator O1967 = 0.264039703522277)
  11762. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11763. -->
  11764. Firing prefer*rvt*predict-no*H0
  11765. -->
  11766. Firing rl*prefer*rvt*predict-no*H0*6
  11767. -->
  11768. (S1 ^operator O1968 = 0.9997480945179411)
  11769. inner elaboration loop at bottom goal.
  11770. Retracting rl*prefer*rvt*predict-no*H0*6
  11771. -->
  11772. (S1 ^operator O1966 = 0.9997480945179411)
  11773. Retracting rl*prefer*rvt*predict-yes*H0*5
  11774. -->
  11775. (S1 ^operator O1965 = 0.264039703522277)
  11776. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11777. -->
  11778. (S1 ^operator O1965 = -0.181727099742844)
  11779. --- END Proposal Phase ---
  11780. --- Decision Phase ---
  11781. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11782. =>WM: (13865: S1 ^operator O1968)
  11783. 984: O: O1968 (predict-no)
  11784. --- END Decision Phase ---
  11785. --- Application Phase ---
  11786. --- Firing Productions (PE) For State At Depth 1 ---
  11787. --- Inner Elaboration Phase, active level 1 (S1) ---
  11788. Firing apply*operator
  11789. -->
  11790. (I3 ^predict-no N984 + :O )
  11791. Firing apply*operator*complete
  11792. -->
  11793. (I3 ^predict-no N983 - :O )
  11794. inner elaboration loop at bottom goal.
  11795. --- Change Working Memory (PE) ---
  11796. =>WM: (13866: I3 ^predict-no N984)
  11797. <=WM: (13852: N983 ^status complete)
  11798. <=WM: (13851: I3 ^predict-no N983)
  11799. --- Firing Productions (IE) For State At Depth 1 ---
  11800. --- Inner Elaboration Phase, active level 1 (S1) ---
  11801. Firing monitor*world
  11802. -->
  11803. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11804. --- Change Working Memory (IE) ---
  11805. --- END Application Phase ---
  11806. --- Output Phase ---
  11807. ENV: Agent did: predict-no for direction L in state State-A
  11808. In State-A moving L
  11809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11810. predict error 0
  11811. dir: dir isU
  11812. --- END Output Phase ---
  11813. -/|--- Input Phase ---
  11814. =>WM: (13870: I2 ^dir U)
  11815. =>WM: (13869: I2 ^reward 1)
  11816. =>WM: (13868: I2 ^see 0)
  11817. =>WM: (13867: N984 ^status complete)
  11818. <=WM: (13855: I2 ^dir L)
  11819. <=WM: (13854: I2 ^reward 1)
  11820. <=WM: (13853: I2 ^see 0)
  11821. =>WM: (13871: I2 ^level-1 L0-root)
  11822. <=WM: (13856: I2 ^level-1 L1-root)
  11823. --- END Input Phase ---
  11824. --- Proposal Phase ---
  11825. --- Inner Elaboration Phase, active level 1 (S1) ---
  11826. Firing elaborate*copy-see-to-output-link
  11827. -->
  11828. (I3 ^see 0 +)
  11829. Firing elaborate*reward*based*on*reward
  11830. -->
  11831. (R988 ^value 1 +)
  11832. (R1 ^reward R988 +)
  11833. Firing propose*predict-yes
  11834. -->
  11835. (O1969 ^name predict-yes +)
  11836. (S1 ^operator O1969 +)
  11837. Firing propose*predict-no
  11838. -->
  11839. (O1970 ^name predict-no +)
  11840. (S1 ^operator O1970 +)
  11841. Firing rl*prefer*rvt*predict-no*H0*2
  11842. -->
  11843. (S1 ^operator O1968 = 1.)
  11844. Firing rl*prefer*rvt*predict-yes*H0*1
  11845. -->
  11846. (S1 ^operator O1967 = 0.)
  11847. Firing prefer*rvt*predict-yes*H0
  11848. -->
  11849. Firing prefer*rvt*predict-no*H0
  11850. -->
  11851. Firing elaborate*copy-dir-to-output-link
  11852. -->
  11853. (I3 ^dir U +)
  11854. inner elaboration loop at bottom goal.
  11855. Retracting elaborate*copy-see-to-output-link
  11856. -->
  11857. (I3 ^see 0 +)
  11858. Retracting propose*predict-no
  11859. -->
  11860. (O1968 ^name predict-no +)
  11861. (S1 ^operator O1968 +)
  11862. Retracting propose*predict-yes
  11863. -->
  11864. (O1967 ^name predict-yes +)
  11865. (S1 ^operator O1967 +)
  11866. Retracting elaborate*reward*based*on*reward
  11867. -->
  11868. (R987 ^value 1 +)
  11869. (R1 ^reward R987 +)
  11870. Retracting elaborate*copy-dir-to-output-link
  11871. -->
  11872. (I3 ^dir L +)
  11873. Retracting rl*prefer*rvt*predict-no*H0*6
  11874. -->
  11875. (S1 ^operator O1968 = 0.9997480945179411)
  11876. Retracting rl*prefer*rvt*predict-yes*H0*5
  11877. -->
  11878. (S1 ^operator O1967 = 0.264039703522277)
  11879. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11880. -->
  11881. (S1 ^operator O1967 = -0.181727099742844)
  11882. =>WM: (13878: S1 ^operator O1970 +)
  11883. =>WM: (13877: S1 ^operator O1969 +)
  11884. =>WM: (13876: I3 ^dir U)
  11885. =>WM: (13875: O1970 ^name predict-no)
  11886. =>WM: (13874: O1969 ^name predict-yes)
  11887. =>WM: (13873: R988 ^value 1)
  11888. =>WM: (13872: R1 ^reward R988)
  11889. <=WM: (13863: S1 ^operator O1967 +)
  11890. <=WM: (13864: S1 ^operator O1968 +)
  11891. <=WM: (13865: S1 ^operator O1968)
  11892. <=WM: (13862: I3 ^dir L)
  11893. <=WM: (13858: R1 ^reward R987)
  11894. <=WM: (13861: O1968 ^name predict-no)
  11895. <=WM: (13860: O1967 ^name predict-yes)
  11896. <=WM: (13859: R987 ^value 1)
  11897. --- Inner Elaboration Phase, active level 1 (S1) ---
  11898. Firing prefer*rvt*predict-yes*H0
  11899. -->
  11900. Firing rl*prefer*rvt*predict-yes*H0*1
  11901. -->
  11902. (S1 ^operator O1969 = 0.)
  11903. Firing prefer*rvt*predict-no*H0
  11904. -->
  11905. Firing rl*prefer*rvt*predict-no*H0*2
  11906. -->
  11907. (S1 ^operator O1970 = 1.)
  11908. inner elaboration loop at bottom goal.
  11909. Retracting rl*prefer*rvt*predict-no*H0*2
  11910. -->
  11911. (S1 ^operator O1968 = 1.)
  11912. Retracting rl*prefer*rvt*predict-yes*H0*1
  11913. -->
  11914. (S1 ^operator O1967 = 0.)
  11915. --- END Proposal Phase ---
  11916. --- Decision Phase ---
  11917. RL update rl*prefer*rvt*predict-no*H0*6 0.999748 0 0.999748 -> 0.99979 0 0.99979(R,m,v=1,0.904762,0.086758)
  11918. =>WM: (13879: S1 ^operator O1970)
  11919. 985: O: O1970 (predict-no)
  11920. --- END Decision Phase ---
  11921. --- Application Phase ---
  11922. --- Firing Productions (PE) For State At Depth 1 ---
  11923. --- Inner Elaboration Phase, active level 1 (S1) ---
  11924. Firing apply*operator
  11925. -->
  11926. (I3 ^predict-no N985 + :O )
  11927. Firing apply*operator*complete
  11928. -->
  11929. (I3 ^predict-no N984 - :O )
  11930. inner elaboration loop at bottom goal.
  11931. --- Change Working Memory (PE) ---
  11932. =>WM: (13880: I3 ^predict-no N985)
  11933. <=WM: (13867: N984 ^status complete)
  11934. <=WM: (13866: I3 ^predict-no N984)
  11935. --- Firing Productions (IE) For State At Depth 1 ---
  11936. --- Inner Elaboration Phase, active level 1 (S1) ---
  11937. Firing monitor*world
  11938. -->
  11939. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11940. --- Change Working Memory (IE) ---
  11941. --- END Application Phase ---
  11942. --- Output Phase ---
  11943. ENV: Agent did: predict-no for direction U in state State-A
  11944. In State-A moving U
  11945. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11946. predict error 0
  11947. dir: dir isR
  11948. --- END Output Phase ---
  11949. \---- Input Phase ---
  11950. =>WM: (13884: I2 ^dir R)
  11951. =>WM: (13883: I2 ^reward 1)
  11952. =>WM: (13882: I2 ^see 0)
  11953. =>WM: (13881: N985 ^status complete)
  11954. <=WM: (13870: I2 ^dir U)
  11955. <=WM: (13869: I2 ^reward 1)
  11956. <=WM: (13868: I2 ^see 0)
  11957. =>WM: (13885: I2 ^level-1 L0-root)
  11958. <=WM: (13871: I2 ^level-1 L0-root)
  11959. --- END Input Phase ---
  11960. --- Proposal Phase ---
  11961. --- Inner Elaboration Phase, active level 1 (S1) ---
  11962. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11963. -->
  11964. (S1 ^operator O1970 = -0.2817060109291377)
  11965. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11966. -->
  11967. (S1 ^operator O1969 = 0.6623600134734193)
  11968. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11969. -->
  11970. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11971. -->
  11972. Firing elaborate*copy-see-to-output-link
  11973. -->
  11974. (I3 ^see 0 +)
  11975. Firing elaborate*reward*based*on*reward
  11976. -->
  11977. (R989 ^value 1 +)
  11978. (R1 ^reward R989 +)
  11979. Firing propose*predict-yes
  11980. -->
  11981. (O1971 ^name predict-yes +)
  11982. (S1 ^operator O1971 +)
  11983. Firing propose*predict-no
  11984. -->
  11985. (O1972 ^name predict-no +)
  11986. (S1 ^operator O1972 +)
  11987. Firing rl*prefer*rvt*predict-no*H0*4
  11988. -->
  11989. (S1 ^operator O1970 = 0.3397683711152304)
  11990. Firing rl*prefer*rvt*predict-yes*H0*3
  11991. -->
  11992. (S1 ^operator O1969 = 0.3377183053124619)
  11993. Firing prefer*rvt*predict-yes*H0
  11994. -->
  11995. Firing prefer*rvt*predict-no*H0
  11996. -->
  11997. Firing elaborate*copy-dir-to-output-link
  11998. -->
  11999. (I3 ^dir R +)
  12000. inner elaboration loop at bottom goal.
  12001. Retracting elaborate*copy-see-to-output-link
  12002. -->
  12003. (I3 ^see 0 +)
  12004. Retracting propose*predict-no
  12005. -->
  12006. (O1970 ^name predict-no +)
  12007. (S1 ^operator O1970 +)
  12008. Retracting propose*predict-yes
  12009. -->
  12010. (O1969 ^name predict-yes +)
  12011. (S1 ^operator O1969 +)
  12012. Retracting elaborate*reward*based*on*reward
  12013. -->
  12014. (R988 ^value 1 +)
  12015. (R1 ^reward R988 +)
  12016. Retracting elaborate*copy-dir-to-output-link
  12017. -->
  12018. (I3 ^dir U +)
  12019. Retracting rl*prefer*rvt*predict-no*H0*2
  12020. -->
  12021. (S1 ^operator O1970 = 1.)
  12022. Retracting rl*prefer*rvt*predict-yes*H0*1
  12023. -->
  12024. (S1 ^operator O1969 = 0.)
  12025. =>WM: (13892: S1 ^operator O1972 +)
  12026. =>WM: (13891: S1 ^operator O1971 +)
  12027. =>WM: (13890: I3 ^dir R)
  12028. =>WM: (13889: O1972 ^name predict-no)
  12029. =>WM: (13888: O1971 ^name predict-yes)
  12030. =>WM: (13887: R989 ^value 1)
  12031. =>WM: (13886: R1 ^reward R989)
  12032. <=WM: (13877: S1 ^operator O1969 +)
  12033. <=WM: (13878: S1 ^operator O1970 +)
  12034. <=WM: (13879: S1 ^operator O1970)
  12035. <=WM: (13876: I3 ^dir U)
  12036. <=WM: (13872: R1 ^reward R988)
  12037. <=WM: (13875: O1970 ^name predict-no)
  12038. <=WM: (13874: O1969 ^name predict-yes)
  12039. <=WM: (13873: R988 ^value 1)
  12040. --- Inner Elaboration Phase, active level 1 (S1) ---
  12041. Firing prefer*rvt*predict-yes*H0
  12042. -->
  12043. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  12044. -->
  12045. (S1 ^operator O1971 = 0.6623600134734193)
  12046. Firing rl*prefer*rvt*predict-yes*H0*3
  12047. -->
  12048. (S1 ^operator O1971 = 0.3377183053124619)
  12049. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12050. -->
  12051. Firing prefer*rvt*predict-no*H0
  12052. -->
  12053. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  12054. -->
  12055. (S1 ^operator O1972 = -0.2817060109291377)
  12056. Firing rl*prefer*rvt*predict-no*H0*4
  12057. -->
  12058. (S1 ^operator O1972 = 0.3397683711152304)
  12059. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12060. -->
  12061. inner elaboration loop at bottom goal.
  12062. Retracting rl*prefer*rvt*predict-no*H0*4
  12063. -->
  12064. (S1 ^operator O1970 = 0.3397683711152304)
  12065. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  12066. -->
  12067. (S1 ^operator O1970 = -0.2817060109291377)
  12068. Retracting rl*prefer*rvt*predict-yes*H0*3
  12069. -->
  12070. (S1 ^operator O1969 = 0.3377183053124619)
  12071. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  12072. -->
  12073. (S1 ^operator O1969 = 0.6623600134734193)
  12074. --- END Proposal Phase ---
  12075. --- Decision Phase ---
  12076. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12077. =>WM: (13893: S1 ^operator O1971)
  12078. 986: O: O1971 (predict-yes)
  12079. --- END Decision Phase ---
  12080. --- Application Phase ---
  12081. --- Firing Productions (PE) For State At Depth 1 ---
  12082. --- Inner Elaboration Phase, active level 1 (S1) ---
  12083. Firing apply*operator
  12084. -->
  12085. (I3 ^predict-yes N986 + :O )
  12086. Firing apply*operator*complete
  12087. -->
  12088. (I3 ^predict-no N985 - :O )
  12089. inner elaboration loop at bottom goal.
  12090. --- Change Working Memory (PE) ---
  12091. =>WM: (13894: I3 ^predict-yes N986)
  12092. <=WM: (13881: N985 ^status complete)
  12093. <=WM: (13880: I3 ^predict-no N985)
  12094. --- Firing Productions (IE) For State At Depth 1 ---
  12095. --- Inner Elaboration Phase, active level 1 (S1) ---
  12096. Firing monitor*world
  12097. -->
  12098. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12099. --- Change Working Memory (IE) ---
  12100. --- END Application Phase ---
  12101. --- Output Phase ---
  12102. ENV: Agent did: predict-yes for direction R in state State-A
  12103. In State-A moving R
  12104. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12105. predict error 0
  12106. dir: dir isU
  12107. --- END Output Phase ---
  12108. /--- Input Phase ---
  12109. =>WM: (13898: I2 ^dir U)
  12110. =>WM: (13897: I2 ^reward 1)
  12111. =>WM: (13896: I2 ^see 1)
  12112. =>WM: (13895: N986 ^status complete)
  12113. <=WM: (13884: I2 ^dir R)
  12114. <=WM: (13883: I2 ^reward 1)
  12115. <=WM: (13882: I2 ^see 0)
  12116. =>WM: (13899: I2 ^level-1 R1-root)
  12117. <=WM: (13885: I2 ^level-1 L0-root)
  12118. --- END Input Phase ---
  12119. --- Proposal Phase ---
  12120. --- Inner Elaboration Phase, active level 1 (S1) ---
  12121. Firing elaborate*copy-see-to-output-link
  12122. -->
  12123. (I3 ^see 1 +)
  12124. Firing elaborate*reward*based*on*reward
  12125. -->
  12126. (R990 ^value 1 +)
  12127. (R1 ^reward R990 +)
  12128. Firing propose*predict-yes
  12129. -->
  12130. (O1973 ^name predict-yes +)
  12131. (S1 ^operator O1973 +)
  12132. Firing propose*predict-no
  12133. -->
  12134. (O1974 ^name predict-no +)
  12135. (S1 ^operator O1974 +)
  12136. Firing rl*prefer*rvt*predict-no*H0*2
  12137. -->
  12138. (S1 ^operator O1972 = 1.)
  12139. Firing rl*prefer*rvt*predict-yes*H0*1
  12140. -->
  12141. (S1 ^operator O1971 = 0.)
  12142. Firing prefer*rvt*predict-yes*H0
  12143. -->
  12144. Firing prefer*rvt*predict-no*H0
  12145. -->
  12146. Firing elaborate*copy-dir-to-output-link
  12147. -->
  12148. (I3 ^dir U +)
  12149. inner elaboration loop at bottom goal.
  12150. Retracting elaborate*copy-see-to-output-link
  12151. -->
  12152. (I3 ^see 0 +)
  12153. Retracting propose*predict-no
  12154. -->
  12155. (O1972 ^name predict-no +)
  12156. (S1 ^operator O1972 +)
  12157. Retracting propose*predict-yes
  12158. -->
  12159. (O1971 ^name predict-yes +)
  12160. (S1 ^operator O1971 +)
  12161. Retracting elaborate*reward*based*on*reward
  12162. -->
  12163. (R989 ^value 1 +)
  12164. (R1 ^reward R989 +)
  12165. Retracting elaborate*copy-dir-to-output-link
  12166. -->
  12167. (I3 ^dir R +)
  12168. Retracting rl*prefer*rvt*predict-no*H0*4
  12169. -->
  12170. (S1 ^operator O1972 = 0.3397683711152304)
  12171. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  12172. -->
  12173. (S1 ^operator O1972 = -0.2817060109291377)
  12174. Retracting rl*prefer*rvt*predict-yes*H0*3
  12175. -->
  12176. (S1 ^operator O1971 = 0.3377183053124619)
  12177. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  12178. -->
  12179. (S1 ^operator O1971 = 0.6623600134734193)
  12180. =>WM: (13907: S1 ^operator O1974 +)
  12181. =>WM: (13906: S1 ^operator O1973 +)
  12182. =>WM: (13905: I3 ^dir U)
  12183. =>WM: (13904: O1974 ^name predict-no)
  12184. =>WM: (13903: O1973 ^name predict-yes)
  12185. =>WM: (13902: R990 ^value 1)
  12186. =>WM: (13901: R1 ^reward R990)
  12187. =>WM: (13900: I3 ^see 1)
  12188. <=WM: (13891: S1 ^operator O1971 +)
  12189. <=WM: (13893: S1 ^operator O1971)
  12190. <=WM: (13892: S1 ^operator O1972 +)
  12191. <=WM: (13890: I3 ^dir R)
  12192. <=WM: (13886: R1 ^reward R989)
  12193. <=WM: (13857: I3 ^see 0)
  12194. <=WM: (13889: O1972 ^name predict-no)
  12195. <=WM: (13888: O1971 ^name predict-yes)
  12196. <=WM: (13887: R989 ^value 1)
  12197. --- Inner Elaboration Phase, active level 1 (S1) ---
  12198. Firing prefer*rvt*predict-yes*H0
  12199. -->
  12200. Firing rl*prefer*rvt*predict-yes*H0*1
  12201. -->
  12202. (S1 ^operator O1973 = 0.)
  12203. Firing prefer*rvt*predict-no*H0
  12204. -->
  12205. Firing rl*prefer*rvt*predict-no*H0*2
  12206. -->
  12207. (S1 ^operator O1974 = 1.)
  12208. inner elaboration loop at bottom goal.
  12209. Retracting rl*prefer*rvt*predict-no*H0*2
  12210. -->
  12211. (S1 ^operator O1972 = 1.)
  12212. Retracting rl*prefer*rvt*predict-yes*H0*1
  12213. -->
  12214. (S1 ^operator O1971 = 0.)
  12215. --- END Proposal Phase ---
  12216. --- Decision Phase ---
  12217. RL update rl*prefer*rvt*predict-yes*H0*3 0.59012 -0.252401 0.337718 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.89759,0.092479)
  12218. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409971 0.252389 0.66236 -> 0.409962 0.25239 0.662353(R,m,v=1,1,0)
  12219. =>WM: (13908: S1 ^operator O1974)
  12220. 987: O: O1974 (predict-no)
  12221. --- END Decision Phase ---
  12222. --- Application Phase ---
  12223. --- Firing Productions (PE) For State At Depth 1 ---
  12224. --- Inner Elaboration Phase, active level 1 (S1) ---
  12225. Firing apply*operator
  12226. -->
  12227. (I3 ^predict-no N987 + :O )
  12228. Firing apply*operator*complete
  12229. -->
  12230. (I3 ^predict-yes N986 - :O )
  12231. inner elaboration loop at bottom goal.
  12232. --- Change Working Memory (PE) ---
  12233. =>WM: (13909: I3 ^predict-no N987)
  12234. <=WM: (13895: N986 ^status complete)
  12235. <=WM: (13894: I3 ^predict-yes N986)
  12236. --- Firing Productions (IE) For State At Depth 1 ---
  12237. --- Inner Elaboration Phase, active level 1 (S1) ---
  12238. Firing monitor*world
  12239. -->
  12240. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12241. --- Change Working Memory (IE) ---
  12242. --- END Application Phase ---
  12243. --- Output Phase ---
  12244. ENV: Agent did: predict-no for direction U in state State-B
  12245. In State-B moving U
  12246. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12247. predict error 0
  12248. dir: dir isR
  12249. --- END Output Phase ---
  12250. |\---- Input Phase ---
  12251. =>WM: (13913: I2 ^dir R)
  12252. =>WM: (13912: I2 ^reward 1)
  12253. =>WM: (13911: I2 ^see 0)
  12254. =>WM: (13910: N987 ^status complete)
  12255. <=WM: (13898: I2 ^dir U)
  12256. <=WM: (13897: I2 ^reward 1)
  12257. <=WM: (13896: I2 ^see 1)
  12258. =>WM: (13914: I2 ^level-1 R1-root)
  12259. <=WM: (13899: I2 ^level-1 R1-root)
  12260. --- END Input Phase ---
  12261. --- Proposal Phase ---
  12262. --- Inner Elaboration Phase, active level 1 (S1) ---
  12263. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12264. -->
  12265. (S1 ^operator O1973 = -0.1070236389116304)
  12266. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12267. -->
  12268. (S1 ^operator O1974 = 0.6602453025755203)
  12269. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12270. -->
  12271. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12272. -->
  12273. Firing elaborate*copy-see-to-output-link
  12274. -->
  12275. (I3 ^see 0 +)
  12276. Firing elaborate*reward*based*on*reward
  12277. -->
  12278. (R991 ^value 1 +)
  12279. (R1 ^reward R991 +)
  12280. Firing propose*predict-yes
  12281. -->
  12282. (O1975 ^name predict-yes +)
  12283. (S1 ^operator O1975 +)
  12284. Firing propose*predict-no
  12285. -->
  12286. (O1976 ^name predict-no +)
  12287. (S1 ^operator O1976 +)
  12288. Firing rl*prefer*rvt*predict-no*H0*4
  12289. -->
  12290. (S1 ^operator O1974 = 0.3397683711152304)
  12291. Firing rl*prefer*rvt*predict-yes*H0*3
  12292. -->
  12293. (S1 ^operator O1973 = 0.3377118983309207)
  12294. Firing prefer*rvt*predict-yes*H0
  12295. -->
  12296. Firing prefer*rvt*predict-no*H0
  12297. -->
  12298. Firing elaborate*copy-dir-to-output-link
  12299. -->
  12300. (I3 ^dir R +)
  12301. inner elaboration loop at bottom goal.
  12302. Retracting elaborate*copy-see-to-output-link
  12303. -->
  12304. (I3 ^see 1 +)
  12305. Retracting propose*predict-no
  12306. -->
  12307. (O1974 ^name predict-no +)
  12308. (S1 ^operator O1974 +)
  12309. Retracting propose*predict-yes
  12310. -->
  12311. (O1973 ^name predict-yes +)
  12312. (S1 ^operator O1973 +)
  12313. Retracting elaborate*reward*based*on*reward
  12314. -->
  12315. (R990 ^value 1 +)
  12316. (R1 ^reward R990 +)
  12317. Retracting elaborate*copy-dir-to-output-link
  12318. -->
  12319. (I3 ^dir U +)
  12320. Retracting rl*prefer*rvt*predict-no*H0*2
  12321. -->
  12322. (S1 ^operator O1974 = 1.)
  12323. Retracting rl*prefer*rvt*predict-yes*H0*1
  12324. -->
  12325. (S1 ^operator O1973 = 0.)
  12326. =>WM: (13922: S1 ^operator O1976 +)
  12327. =>WM: (13921: S1 ^operator O1975 +)
  12328. =>WM: (13920: I3 ^dir R)
  12329. =>WM: (13919: O1976 ^name predict-no)
  12330. =>WM: (13918: O1975 ^name predict-yes)
  12331. =>WM: (13917: R991 ^value 1)
  12332. =>WM: (13916: R1 ^reward R991)
  12333. =>WM: (13915: I3 ^see 0)
  12334. <=WM: (13906: S1 ^operator O1973 +)
  12335. <=WM: (13907: S1 ^operator O1974 +)
  12336. <=WM: (13908: S1 ^operator O1974)
  12337. <=WM: (13905: I3 ^dir U)
  12338. <=WM: (13901: R1 ^reward R990)
  12339. <=WM: (13900: I3 ^see 1)
  12340. <=WM: (13904: O1974 ^name predict-no)
  12341. <=WM: (13903: O1973 ^name predict-yes)
  12342. <=WM: (13902: R990 ^value 1)
  12343. --- Inner Elaboration Phase, active level 1 (S1) ---
  12344. Firing prefer*rvt*predict-yes*H0
  12345. -->
  12346. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12347. -->
  12348. (S1 ^operator O1975 = -0.1070236389116304)
  12349. Firing rl*prefer*rvt*predict-yes*H0*3
  12350. -->
  12351. (S1 ^operator O1975 = 0.3377118983309207)
  12352. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12353. -->
  12354. Firing prefer*rvt*predict-no*H0
  12355. -->
  12356. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12357. -->
  12358. (S1 ^operator O1976 = 0.6602453025755203)
  12359. Firing rl*prefer*rvt*predict-no*H0*4
  12360. -->
  12361. (S1 ^operator O1976 = 0.3397683711152304)
  12362. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12363. -->
  12364. inner elaboration loop at bottom goal.
  12365. Retracting rl*prefer*rvt*predict-no*H0*4
  12366. -->
  12367. (S1 ^operator O1974 = 0.3397683711152304)
  12368. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12369. -->
  12370. (S1 ^operator O1974 = 0.6602453025755203)
  12371. Retracting rl*prefer*rvt*predict-yes*H0*3
  12372. -->
  12373. (S1 ^operator O1973 = 0.3377118983309207)
  12374. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12375. -->
  12376. (S1 ^operator O1973 = -0.1070236389116304)
  12377. --- END Proposal Phase ---
  12378. --- Decision Phase ---
  12379. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12380. =>WM: (13923: S1 ^operator O1976)
  12381. 988: O: O1976 (predict-no)
  12382. --- END Decision Phase ---
  12383. --- Application Phase ---
  12384. --- Firing Productions (PE) For State At Depth 1 ---
  12385. --- Inner Elaboration Phase, active level 1 (S1) ---
  12386. Firing apply*operator
  12387. -->
  12388. (I3 ^predict-no N988 + :O )
  12389. Firing apply*operator*complete
  12390. -->
  12391. (I3 ^predict-no N987 - :O )
  12392. inner elaboration loop at bottom goal.
  12393. --- Change Working Memory (PE) ---
  12394. =>WM: (13924: I3 ^predict-no N988)
  12395. <=WM: (13910: N987 ^status complete)
  12396. <=WM: (13909: I3 ^predict-no N987)
  12397. --- Firing Productions (IE) For State At Depth 1 ---
  12398. --- Inner Elaboration Phase, active level 1 (S1) ---
  12399. Firing monitor*world
  12400. -->
  12401. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12402. --- Change Working Memory (IE) ---
  12403. --- END Application Phase ---
  12404. --- Output Phase ---
  12405. ENV: Agent did: predict-no for direction R in state State-B
  12406. In State-B moving R
  12407. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12408. predict error 0
  12409. dir: dir isR
  12410. --- END Output Phase ---
  12411. /|\--- Input Phase ---
  12412. =>WM: (13928: I2 ^dir R)
  12413. =>WM: (13927: I2 ^reward 1)
  12414. =>WM: (13926: I2 ^see 0)
  12415. =>WM: (13925: N988 ^status complete)
  12416. <=WM: (13913: I2 ^dir R)
  12417. <=WM: (13912: I2 ^reward 1)
  12418. <=WM: (13911: I2 ^see 0)
  12419. =>WM: (13929: I2 ^level-1 R0-root)
  12420. <=WM: (13914: I2 ^level-1 R1-root)
  12421. --- END Input Phase ---
  12422. --- Proposal Phase ---
  12423. --- Inner Elaboration Phase, active level 1 (S1) ---
  12424. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12425. -->
  12426. (S1 ^operator O1976 = 0.660152441867348)
  12427. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12428. -->
  12429. (S1 ^operator O1975 = -0.1028953566115423)
  12430. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12431. -->
  12432. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12433. -->
  12434. Firing elaborate*copy-see-to-output-link
  12435. -->
  12436. (I3 ^see 0 +)
  12437. Firing elaborate*reward*based*on*reward
  12438. -->
  12439. (R992 ^value 1 +)
  12440. (R1 ^reward R992 +)
  12441. Firing propose*predict-yes
  12442. -->
  12443. (O1977 ^name predict-yes +)
  12444. (S1 ^operator O1977 +)
  12445. Firing propose*predict-no
  12446. -->
  12447. (O1978 ^name predict-no +)
  12448. (S1 ^operator O1978 +)
  12449. Firing rl*prefer*rvt*predict-no*H0*4
  12450. -->
  12451. (S1 ^operator O1976 = 0.3397683711152304)
  12452. Firing rl*prefer*rvt*predict-yes*H0*3
  12453. -->
  12454. (S1 ^operator O1975 = 0.3377118983309207)
  12455. Firing prefer*rvt*predict-yes*H0
  12456. -->
  12457. Firing prefer*rvt*predict-no*H0
  12458. -->
  12459. Firing elaborate*copy-dir-to-output-link
  12460. -->
  12461. (I3 ^dir R +)
  12462. inner elaboration loop at bottom goal.
  12463. Retracting elaborate*copy-see-to-output-link
  12464. -->
  12465. (I3 ^see 0 +)
  12466. Retracting propose*predict-no
  12467. -->
  12468. (O1976 ^name predict-no +)
  12469. (S1 ^operator O1976 +)
  12470. Retracting propose*predict-yes
  12471. -->
  12472. (O1975 ^name predict-yes +)
  12473. (S1 ^operator O1975 +)
  12474. Retracting elaborate*reward*based*on*reward
  12475. -->
  12476. (R991 ^value 1 +)
  12477. (R1 ^reward R991 +)
  12478. Retracting elaborate*copy-dir-to-output-link
  12479. -->
  12480. (I3 ^dir R +)
  12481. Retracting rl*prefer*rvt*predict-no*H0*4
  12482. -->
  12483. (S1 ^operator O1976 = 0.3397683711152304)
  12484. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12485. -->
  12486. (S1 ^operator O1976 = 0.6602453025755203)
  12487. Retracting rl*prefer*rvt*predict-yes*H0*3
  12488. -->
  12489. (S1 ^operator O1975 = 0.3377118983309207)
  12490. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12491. -->
  12492. (S1 ^operator O1975 = -0.1070236389116304)
  12493. =>WM: (13935: S1 ^operator O1978 +)
  12494. =>WM: (13934: S1 ^operator O1977 +)
  12495. =>WM: (13933: O1978 ^name predict-no)
  12496. =>WM: (13932: O1977 ^name predict-yes)
  12497. =>WM: (13931: R992 ^value 1)
  12498. =>WM: (13930: R1 ^reward R992)
  12499. <=WM: (13921: S1 ^operator O1975 +)
  12500. <=WM: (13922: S1 ^operator O1976 +)
  12501. <=WM: (13923: S1 ^operator O1976)
  12502. <=WM: (13916: R1 ^reward R991)
  12503. <=WM: (13919: O1976 ^name predict-no)
  12504. <=WM: (13918: O1975 ^name predict-yes)
  12505. <=WM: (13917: R991 ^value 1)
  12506. --- Inner Elaboration Phase, active level 1 (S1) ---
  12507. Firing prefer*rvt*predict-yes*H0
  12508. -->
  12509. Firing rl*prefer*rvt*predict-yes*H0*3
  12510. -->
  12511. (S1 ^operator O1977 = 0.3377118983309207)
  12512. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12513. -->
  12514. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12515. -->
  12516. (S1 ^operator O1977 = -0.1028953566115423)
  12517. Firing prefer*rvt*predict-no*H0
  12518. -->
  12519. Firing rl*prefer*rvt*predict-no*H0*4
  12520. -->
  12521. (S1 ^operator O1978 = 0.3397683711152304)
  12522. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12523. -->
  12524. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12525. -->
  12526. (S1 ^operator O1978 = 0.660152441867348)
  12527. inner elaboration loop at bottom goal.
  12528. Retracting rl*prefer*rvt*predict-no*H0*4
  12529. -->
  12530. (S1 ^operator O1976 = 0.3397683711152304)
  12531. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12532. -->
  12533. (S1 ^operator O1976 = 0.660152441867348)
  12534. Retracting rl*prefer*rvt*predict-yes*H0*3
  12535. -->
  12536. (S1 ^operator O1975 = 0.3377118983309207)
  12537. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12538. -->
  12539. (S1 ^operator O1975 = -0.1028953566115423)
  12540. --- END Proposal Phase ---
  12541. --- Decision Phase ---
  12542. RL update rl*prefer*rvt*predict-no*H0*4 0.570252 -0.230483 0.339768 -> 0.570251 -0.230483 0.339767(R,m,v=1,0.874251,0.110598)
  12543. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429763 0.230483 0.660245 -> 0.429761 0.230483 0.660244(R,m,v=1,1,0)
  12544. =>WM: (13936: S1 ^operator O1978)
  12545. 989: O: O1978 (predict-no)
  12546. --- END Decision Phase ---
  12547. --- Application Phase ---
  12548. --- Firing Productions (PE) For State At Depth 1 ---
  12549. --- Inner Elaboration Phase, active level 1 (S1) ---
  12550. Firing apply*operator
  12551. -->
  12552. (I3 ^predict-no N989 + :O )
  12553. Firing apply*operator*complete
  12554. -->
  12555. (I3 ^predict-no N988 - :O )
  12556. inner elaboration loop at bottom goal.
  12557. --- Change Working Memory (PE) ---
  12558. =>WM: (13937: I3 ^predict-no N989)
  12559. <=WM: (13925: N988 ^status complete)
  12560. <=WM: (13924: I3 ^predict-no N988)
  12561. --- Firing Productions (IE) For State At Depth 1 ---
  12562. --- Inner Elaboration Phase, active level 1 (S1) ---
  12563. Firing monitor*world
  12564. -->
  12565. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12566. --- Change Working Memory (IE) ---
  12567. --- END Application Phase ---
  12568. --- Output Phase ---
  12569. ENV: Agent did: predict-no for direction R in state State-B
  12570. In State-B moving R
  12571. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12572. predict error 0
  12573. dir: dir isL
  12574. --- END Output Phase ---
  12575. -/|--- Input Phase ---
  12576. =>WM: (13941: I2 ^dir L)
  12577. =>WM: (13940: I2 ^reward 1)
  12578. =>WM: (13939: I2 ^see 0)
  12579. =>WM: (13938: N989 ^status complete)
  12580. <=WM: (13928: I2 ^dir R)
  12581. <=WM: (13927: I2 ^reward 1)
  12582. <=WM: (13926: I2 ^see 0)
  12583. =>WM: (13942: I2 ^level-1 R0-root)
  12584. <=WM: (13929: I2 ^level-1 R0-root)
  12585. --- END Input Phase ---
  12586. --- Proposal Phase ---
  12587. --- Inner Elaboration Phase, active level 1 (S1) ---
  12588. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12589. -->
  12590. (S1 ^operator O1977 = 0.7358428664482317)
  12591. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12592. -->
  12593. Firing elaborate*copy-see-to-output-link
  12594. -->
  12595. (I3 ^see 0 +)
  12596. Firing elaborate*reward*based*on*reward
  12597. -->
  12598. (R993 ^value 1 +)
  12599. (R1 ^reward R993 +)
  12600. Firing propose*predict-yes
  12601. -->
  12602. (O1979 ^name predict-yes +)
  12603. (S1 ^operator O1979 +)
  12604. Firing propose*predict-no
  12605. -->
  12606. (O1980 ^name predict-no +)
  12607. (S1 ^operator O1980 +)
  12608. Firing rl*prefer*rvt*predict-no*H0*6
  12609. -->
  12610. (S1 ^operator O1978 = 0.999790145818646)
  12611. Firing rl*prefer*rvt*predict-yes*H0*5
  12612. -->
  12613. (S1 ^operator O1977 = 0.264039703522277)
  12614. Firing prefer*rvt*predict-yes*H0
  12615. -->
  12616. Firing prefer*rvt*predict-no*H0
  12617. -->
  12618. Firing elaborate*copy-dir-to-output-link
  12619. -->
  12620. (I3 ^dir L +)
  12621. inner elaboration loop at bottom goal.
  12622. Retracting elaborate*copy-see-to-output-link
  12623. -->
  12624. (I3 ^see 0 +)
  12625. Retracting propose*predict-no
  12626. -->
  12627. (O1978 ^name predict-no +)
  12628. (S1 ^operator O1978 +)
  12629. Retracting propose*predict-yes
  12630. -->
  12631. (O1977 ^name predict-yes +)
  12632. (S1 ^operator O1977 +)
  12633. Retracting elaborate*reward*based*on*reward
  12634. -->
  12635. (R992 ^value 1 +)
  12636. (R1 ^reward R992 +)
  12637. Retracting elaborate*copy-dir-to-output-link
  12638. -->
  12639. (I3 ^dir R +)
  12640. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12641. -->
  12642. (S1 ^operator O1978 = 0.660152441867348)
  12643. Retracting rl*prefer*rvt*predict-no*H0*4
  12644. -->
  12645. (S1 ^operator O1978 = 0.339767253617308)
  12646. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12647. -->
  12648. (S1 ^operator O1977 = -0.1028953566115423)
  12649. Retracting rl*prefer*rvt*predict-yes*H0*3
  12650. -->
  12651. (S1 ^operator O1977 = 0.3377118983309207)
  12652. =>WM: (13949: S1 ^operator O1980 +)
  12653. =>WM: (13948: S1 ^operator O1979 +)
  12654. =>WM: (13947: I3 ^dir L)
  12655. =>WM: (13946: O1980 ^name predict-no)
  12656. =>WM: (13945: O1979 ^name predict-yes)
  12657. =>WM: (13944: R993 ^value 1)
  12658. =>WM: (13943: R1 ^reward R993)
  12659. <=WM: (13934: S1 ^operator O1977 +)
  12660. <=WM: (13935: S1 ^operator O1978 +)
  12661. <=WM: (13936: S1 ^operator O1978)
  12662. <=WM: (13920: I3 ^dir R)
  12663. <=WM: (13930: R1 ^reward R992)
  12664. <=WM: (13933: O1978 ^name predict-no)
  12665. <=WM: (13932: O1977 ^name predict-yes)
  12666. <=WM: (13931: R992 ^value 1)
  12667. --- Inner Elaboration Phase, active level 1 (S1) ---
  12668. Firing prefer*rvt*predict-yes*H0
  12669. -->
  12670. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12671. -->
  12672. (S1 ^operator O1979 = 0.7358428664482317)
  12673. Firing rl*prefer*rvt*predict-yes*H0*5
  12674. -->
  12675. (S1 ^operator O1979 = 0.264039703522277)
  12676. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12677. -->
  12678. Firing prefer*rvt*predict-no*H0
  12679. -->
  12680. Firing rl*prefer*rvt*predict-no*H0*6
  12681. -->
  12682. (S1 ^operator O1980 = 0.999790145818646)
  12683. inner elaboration loop at bottom goal.
  12684. Retracting rl*prefer*rvt*predict-no*H0*6
  12685. -->
  12686. (S1 ^operator O1978 = 0.999790145818646)
  12687. Retracting rl*prefer*rvt*predict-yes*H0*5
  12688. -->
  12689. (S1 ^operator O1977 = 0.264039703522277)
  12690. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12691. -->
  12692. (S1 ^operator O1977 = 0.7358428664482317)
  12693. --- END Proposal Phase ---
  12694. --- Decision Phase ---
  12695. RL update rl*prefer*rvt*predict-no*H0*4 0.570251 -0.230483 0.339767 -> 0.570257 -0.230484 0.339774(R,m,v=1,0.875,0.11003)
  12696. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429665 0.230487 0.660152 -> 0.429673 0.230487 0.66016(R,m,v=1,1,0)
  12697. =>WM: (13950: S1 ^operator O1979)
  12698. 990: O: O1979 (predict-yes)
  12699. --- END Decision Phase ---
  12700. --- Application Phase ---
  12701. --- Firing Productions (PE) For State At Depth 1 ---
  12702. --- Inner Elaboration Phase, active level 1 (S1) ---
  12703. Firing apply*operator
  12704. -->
  12705. (I3 ^predict-yes N990 + :O )
  12706. Firing apply*operator*complete
  12707. -->
  12708. (I3 ^predict-no N989 - :O )
  12709. inner elaboration loop at bottom goal.
  12710. --- Change Working Memory (PE) ---
  12711. =>WM: (13951: I3 ^predict-yes N990)
  12712. <=WM: (13938: N989 ^status complete)
  12713. <=WM: (13937: I3 ^predict-no N989)
  12714. --- Firing Productions (IE) For State At Depth 1 ---
  12715. --- Inner Elaboration Phase, active level 1 (S1) ---
  12716. Firing monitor*world
  12717. -->
  12718. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12719. --- Change Working Memory (IE) ---
  12720. --- END Application Phase ---
  12721. --- Output Phase ---
  12722. ENV: Agent did: predict-yes for direction L in state State-B
  12723. In State-B moving L
  12724. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12725. predict error 0
  12726. dir: dir isU
  12727. --- END Output Phase ---
  12728. \-/--- Input Phase ---
  12729. =>WM: (13955: I2 ^dir U)
  12730. =>WM: (13954: I2 ^reward 1)
  12731. =>WM: (13953: I2 ^see 1)
  12732. =>WM: (13952: N990 ^status complete)
  12733. <=WM: (13941: I2 ^dir L)
  12734. <=WM: (13940: I2 ^reward 1)
  12735. <=WM: (13939: I2 ^see 0)
  12736. =>WM: (13956: I2 ^level-1 L1-root)
  12737. <=WM: (13942: I2 ^level-1 R0-root)
  12738. --- END Input Phase ---
  12739. --- Proposal Phase ---
  12740. --- Inner Elaboration Phase, active level 1 (S1) ---
  12741. Firing elaborate*copy-see-to-output-link
  12742. -->
  12743. (I3 ^see 1 +)
  12744. Firing elaborate*reward*based*on*reward
  12745. -->
  12746. (R994 ^value 1 +)
  12747. (R1 ^reward R994 +)
  12748. Firing propose*predict-yes
  12749. -->
  12750. (O1981 ^name predict-yes +)
  12751. (S1 ^operator O1981 +)
  12752. Firing propose*predict-no
  12753. -->
  12754. (O1982 ^name predict-no +)
  12755. (S1 ^operator O1982 +)
  12756. Firing rl*prefer*rvt*predict-no*H0*2
  12757. -->
  12758. (S1 ^operator O1980 = 1.)
  12759. Firing rl*prefer*rvt*predict-yes*H0*1
  12760. -->
  12761. (S1 ^operator O1979 = 0.)
  12762. Firing prefer*rvt*predict-yes*H0
  12763. -->
  12764. Firing prefer*rvt*predict-no*H0
  12765. -->
  12766. Firing elaborate*copy-dir-to-output-link
  12767. -->
  12768. (I3 ^dir U +)
  12769. inner elaboration loop at bottom goal.
  12770. Retracting elaborate*copy-see-to-output-link
  12771. -->
  12772. (I3 ^see 0 +)
  12773. Retracting propose*predict-no
  12774. -->
  12775. (O1980 ^name predict-no +)
  12776. (S1 ^operator O1980 +)
  12777. Retracting propose*predict-yes
  12778. -->
  12779. (O1979 ^name predict-yes +)
  12780. (S1 ^operator O1979 +)
  12781. Retracting elaborate*reward*based*on*reward
  12782. -->
  12783. (R993 ^value 1 +)
  12784. (R1 ^reward R993 +)
  12785. Retracting elaborate*copy-dir-to-output-link
  12786. -->
  12787. (I3 ^dir L +)
  12788. Retracting rl*prefer*rvt*predict-no*H0*6
  12789. -->
  12790. (S1 ^operator O1980 = 0.999790145818646)
  12791. Retracting rl*prefer*rvt*predict-yes*H0*5
  12792. -->
  12793. (S1 ^operator O1979 = 0.264039703522277)
  12794. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12795. -->
  12796. (S1 ^operator O1979 = 0.7358428664482317)
  12797. =>WM: (13964: S1 ^operator O1982 +)
  12798. =>WM: (13963: S1 ^operator O1981 +)
  12799. =>WM: (13962: I3 ^dir U)
  12800. =>WM: (13961: O1982 ^name predict-no)
  12801. =>WM: (13960: O1981 ^name predict-yes)
  12802. =>WM: (13959: R994 ^value 1)
  12803. =>WM: (13958: R1 ^reward R994)
  12804. =>WM: (13957: I3 ^see 1)
  12805. <=WM: (13948: S1 ^operator O1979 +)
  12806. <=WM: (13950: S1 ^operator O1979)
  12807. <=WM: (13949: S1 ^operator O1980 +)
  12808. <=WM: (13947: I3 ^dir L)
  12809. <=WM: (13943: R1 ^reward R993)
  12810. <=WM: (13915: I3 ^see 0)
  12811. <=WM: (13946: O1980 ^name predict-no)
  12812. <=WM: (13945: O1979 ^name predict-yes)
  12813. <=WM: (13944: R993 ^value 1)
  12814. --- Inner Elaboration Phase, active level 1 (S1) ---
  12815. Firing prefer*rvt*predict-yes*H0
  12816. -->
  12817. Firing rl*prefer*rvt*predict-yes*H0*1
  12818. -->
  12819. (S1 ^operator O1981 = 0.)
  12820. Firing prefer*rvt*predict-no*H0
  12821. -->
  12822. Firing rl*prefer*rvt*predict-no*H0*2
  12823. -->
  12824. (S1 ^operator O1982 = 1.)
  12825. inner elaboration loop at bottom goal.
  12826. Retracting rl*prefer*rvt*predict-no*H0*2
  12827. -->
  12828. (S1 ^operator O1980 = 1.)
  12829. Retracting rl*prefer*rvt*predict-yes*H0*1
  12830. -->
  12831. (S1 ^operator O1979 = 0.)
  12832. --- END Proposal Phase ---
  12833. --- Decision Phase ---
  12834. RL update rl*prefer*rvt*predict-yes*H0*5 0.554425 -0.290385 0.26404 -> 0.554434 -0.290385 0.264049(R,m,v=1,0.876404,0.108932)
  12835. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.44546 0.290383 0.735843 -> 0.445471 0.290384 0.735854(R,m,v=1,1,0)
  12836. =>WM: (13965: S1 ^operator O1982)
  12837. 991: O: O1982 (predict-no)
  12838. --- END Decision Phase ---
  12839. --- Application Phase ---
  12840. --- Firing Productions (PE) For State At Depth 1 ---
  12841. --- Inner Elaboration Phase, active level 1 (S1) ---
  12842. Firing apply*operator
  12843. -->
  12844. (I3 ^predict-no N991 + :O )
  12845. Firing apply*operator*complete
  12846. -->
  12847. (I3 ^predict-yes N990 - :O )
  12848. inner elaboration loop at bottom goal.
  12849. --- Change Working Memory (PE) ---
  12850. =>WM: (13966: I3 ^predict-no N991)
  12851. <=WM: (13952: N990 ^status complete)
  12852. <=WM: (13951: I3 ^predict-yes N990)
  12853. --- Firing Productions (IE) For State At Depth 1 ---
  12854. --- Inner Elaboration Phase, active level 1 (S1) ---
  12855. Firing monitor*world
  12856. -->
  12857. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12858. --- Change Working Memory (IE) ---
  12859. --- END Application Phase ---
  12860. --- Output Phase ---
  12861. ENV: Agent did: predict-no for direction U in state State-A
  12862. In State-A moving U
  12863. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12864. predict error 0
  12865. dir: dir isR
  12866. --- END Output Phase ---
  12867. |--- Input Phase ---
  12868. =>WM: (13970: I2 ^dir R)
  12869. =>WM: (13969: I2 ^reward 1)
  12870. =>WM: (13968: I2 ^see 0)
  12871. =>WM: (13967: N991 ^status complete)
  12872. <=WM: (13955: I2 ^dir U)
  12873. <=WM: (13954: I2 ^reward 1)
  12874. <=WM: (13953: I2 ^see 1)
  12875. =>WM: (13971: I2 ^level-1 L1-root)
  12876. <=WM: (13956: I2 ^level-1 L1-root)
  12877. --- END Input Phase ---
  12878. --- Proposal Phase ---
  12879. --- Inner Elaboration Phase, active level 1 (S1) ---
  12880. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  12881. -->
  12882. (S1 ^operator O1982 = -0.2714224023553999)
  12883. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  12884. -->
  12885. (S1 ^operator O1981 = 0.662219375073587)
  12886. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12887. -->
  12888. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12889. -->
  12890. Firing elaborate*copy-see-to-output-link
  12891. -->
  12892. (I3 ^see 0 +)
  12893. Firing elaborate*reward*based*on*reward
  12894. -->
  12895. (R995 ^value 1 +)
  12896. (R1 ^reward R995 +)
  12897. Firing propose*predict-yes
  12898. -->
  12899. (O1983 ^name predict-yes +)
  12900. (S1 ^operator O1983 +)
  12901. Firing propose*predict-no
  12902. -->
  12903. (O1984 ^name predict-no +)
  12904. (S1 ^operator O1984 +)
  12905. Firing rl*prefer*rvt*predict-no*H0*4
  12906. -->
  12907. (S1 ^operator O1982 = 0.339773810196969)
  12908. Firing rl*prefer*rvt*predict-yes*H0*3
  12909. -->
  12910. (S1 ^operator O1981 = 0.3377118983309207)
  12911. Firing prefer*rvt*predict-yes*H0
  12912. -->
  12913. Firing prefer*rvt*predict-no*H0
  12914. -->
  12915. Firing elaborate*copy-dir-to-output-link
  12916. -->
  12917. (I3 ^dir R +)
  12918. inner elaboration loop at bottom goal.
  12919. Retracting elaborate*copy-see-to-output-link
  12920. -->
  12921. (I3 ^see 1 +)
  12922. Retracting propose*predict-no
  12923. -->
  12924. (O1982 ^name predict-no +)
  12925. (S1 ^operator O1982 +)
  12926. Retracting propose*predict-yes
  12927. -->
  12928. (O1981 ^name predict-yes +)
  12929. (S1 ^operator O1981 +)
  12930. Retracting elaborate*reward*based*on*reward
  12931. -->
  12932. (R994 ^value 1 +)
  12933. (R1 ^reward R994 +)
  12934. Retracting elaborate*copy-dir-to-output-link
  12935. -->
  12936. (I3 ^dir U +)
  12937. Retracting rl*prefer*rvt*predict-no*H0*2
  12938. -->
  12939. (S1 ^operator O1982 = 1.)
  12940. Retracting rl*prefer*rvt*predict-yes*H0*1
  12941. -->
  12942. (S1 ^operator O1981 = 0.)
  12943. =>WM: (13979: S1 ^operator O1984 +)
  12944. =>WM: (13978: S1 ^operator O1983 +)
  12945. =>WM: (13977: I3 ^dir R)
  12946. =>WM: (13976: O1984 ^name predict-no)
  12947. =>WM: (13975: O1983 ^name predict-yes)
  12948. =>WM: (13974: R995 ^value 1)
  12949. =>WM: (13973: R1 ^reward R995)
  12950. =>WM: (13972: I3 ^see 0)
  12951. <=WM: (13963: S1 ^operator O1981 +)
  12952. <=WM: (13964: S1 ^operator O1982 +)
  12953. <=WM: (13965: S1 ^operator O1982)
  12954. <=WM: (13962: I3 ^dir U)
  12955. <=WM: (13958: R1 ^reward R994)
  12956. <=WM: (13957: I3 ^see 1)
  12957. <=WM: (13961: O1982 ^name predict-no)
  12958. <=WM: (13960: O1981 ^name predict-yes)
  12959. <=WM: (13959: R994 ^value 1)
  12960. --- Inner Elaboration Phase, active level 1 (S1) ---
  12961. Firing prefer*rvt*predict-yes*H0
  12962. -->
  12963. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  12964. -->
  12965. (S1 ^operator O1983 = 0.662219375073587)
  12966. Firing rl*prefer*rvt*predict-yes*H0*3
  12967. -->
  12968. (S1 ^operator O1983 = 0.3377118983309207)
  12969. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12970. -->
  12971. Firing prefer*rvt*predict-no*H0
  12972. -->
  12973. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  12974. -->
  12975. (S1 ^operator O1984 = -0.2714224023553999)
  12976. Firing rl*prefer*rvt*predict-no*H0*4
  12977. -->
  12978. (S1 ^operator O1984 = 0.339773810196969)
  12979. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12980. -->
  12981. inner elaboration loop at bottom goal.
  12982. Retracting rl*prefer*rvt*predict-no*H0*4
  12983. -->
  12984. (S1 ^operator O1982 = 0.339773810196969)
  12985. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  12986. -->
  12987. (S1 ^operator O1982 = -0.2714224023553999)
  12988. Retracting rl*prefer*rvt*predict-yes*H0*3
  12989. -->
  12990. (S1 ^operator O1981 = 0.3377118983309207)
  12991. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  12992. -->
  12993. (S1 ^operator O1981 = 0.662219375073587)
  12994. --- END Proposal Phase ---
  12995. --- Decision Phase ---
  12996. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12997. =>WM: (13980: S1 ^operator O1983)
  12998. 992: O: O1983 (predict-yes)
  12999. --- END Decision Phase ---
  13000. --- Application Phase ---
  13001. --- Firing Productions (PE) For State At Depth 1 ---
  13002. --- Inner Elaboration Phase, active level 1 (S1) ---
  13003. Firing apply*operator
  13004. -->
  13005. (I3 ^predict-yes N992 + :O )
  13006. Firing apply*operator*complete
  13007. -->
  13008. (I3 ^predict-no N991 - :O )
  13009. inner elaboration loop at bottom goal.
  13010. --- Change Working Memory (PE) ---
  13011. =>WM: (13981: I3 ^predict-yes N992)
  13012. <=WM: (13967: N991 ^status complete)
  13013. <=WM: (13966: I3 ^predict-no N991)
  13014. --- Firing Productions (IE) For State At Depth 1 ---
  13015. --- Inner Elaboration Phase, active level 1 (S1) ---
  13016. Firing monitor*world
  13017. -->
  13018. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13019. --- Change Working Memory (IE) ---
  13020. --- END Application Phase ---
  13021. --- Output Phase ---
  13022. ENV: Agent did: predict-yes for direction R in state State-A
  13023. In State-A moving R
  13024. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13025. predict error 0
  13026. dir: dir isU
  13027. --- END Output Phase ---
  13028. \-/--- Input Phase ---
  13029. =>WM: (13985: I2 ^dir U)
  13030. =>WM: (13984: I2 ^reward 1)
  13031. =>WM: (13983: I2 ^see 1)
  13032. =>WM: (13982: N992 ^status complete)
  13033. <=WM: (13970: I2 ^dir R)
  13034. <=WM: (13969: I2 ^reward 1)
  13035. <=WM: (13968: I2 ^see 0)
  13036. =>WM: (13986: I2 ^level-1 R1-root)
  13037. <=WM: (13971: I2 ^level-1 L1-root)
  13038. --- END Input Phase ---
  13039. --- Proposal Phase ---
  13040. --- Inner Elaboration Phase, active level 1 (S1) ---
  13041. Firing elaborate*copy-see-to-output-link
  13042. -->
  13043. (I3 ^see 1 +)
  13044. Firing elaborate*reward*based*on*reward
  13045. -->
  13046. (R996 ^value 1 +)
  13047. (R1 ^reward R996 +)
  13048. Firing propose*predict-yes
  13049. -->
  13050. (O1985 ^name predict-yes +)
  13051. (S1 ^operator O1985 +)
  13052. Firing propose*predict-no
  13053. -->
  13054. (O1986 ^name predict-no +)
  13055. (S1 ^operator O1986 +)
  13056. Firing rl*prefer*rvt*predict-no*H0*2
  13057. -->
  13058. (S1 ^operator O1984 = 1.)
  13059. Firing rl*prefer*rvt*predict-yes*H0*1
  13060. -->
  13061. (S1 ^operator O1983 = 0.)
  13062. Firing prefer*rvt*predict-yes*H0
  13063. -->
  13064. Firing prefer*rvt*predict-no*H0
  13065. -->
  13066. Firing elaborate*copy-dir-to-output-link
  13067. -->
  13068. (I3 ^dir U +)
  13069. inner elaboration loop at bottom goal.
  13070. Retracting elaborate*copy-see-to-output-link
  13071. -->
  13072. (I3 ^see 0 +)
  13073. Retracting propose*predict-no
  13074. -->
  13075. (O1984 ^name predict-no +)
  13076. (S1 ^operator O1984 +)
  13077. Retracting propose*predict-yes
  13078. -->
  13079. (O1983 ^name predict-yes +)
  13080. (S1 ^operator O1983 +)
  13081. Retracting elaborate*reward*based*on*reward
  13082. -->
  13083. (R995 ^value 1 +)
  13084. (R1 ^reward R995 +)
  13085. Retracting elaborate*copy-dir-to-output-link
  13086. -->
  13087. (I3 ^dir R +)
  13088. Retracting rl*prefer*rvt*predict-no*H0*4
  13089. -->
  13090. (S1 ^operator O1984 = 0.339773810196969)
  13091. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  13092. -->
  13093. (S1 ^operator O1984 = -0.2714224023553999)
  13094. Retracting rl*prefer*rvt*predict-yes*H0*3
  13095. -->
  13096. (S1 ^operator O1983 = 0.3377118983309207)
  13097. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  13098. -->
  13099. (S1 ^operator O1983 = 0.662219375073587)
  13100. =>WM: (13994: S1 ^operator O1986 +)
  13101. =>WM: (13993: S1 ^operator O1985 +)
  13102. =>WM: (13992: I3 ^dir U)
  13103. =>WM: (13991: O1986 ^name predict-no)
  13104. =>WM: (13990: O1985 ^name predict-yes)
  13105. =>WM: (13989: R996 ^value 1)
  13106. =>WM: (13988: R1 ^reward R996)
  13107. =>WM: (13987: I3 ^see 1)
  13108. <=WM: (13978: S1 ^operator O1983 +)
  13109. <=WM: (13980: S1 ^operator O1983)
  13110. <=WM: (13979: S1 ^operator O1984 +)
  13111. <=WM: (13977: I3 ^dir R)
  13112. <=WM: (13973: R1 ^reward R995)
  13113. <=WM: (13972: I3 ^see 0)
  13114. <=WM: (13976: O1984 ^name predict-no)
  13115. <=WM: (13975: O1983 ^name predict-yes)
  13116. <=WM: (13974: R995 ^value 1)
  13117. --- Inner Elaboration Phase, active level 1 (S1) ---
  13118. Firing prefer*rvt*predict-yes*H0
  13119. -->
  13120. Firing rl*prefer*rvt*predict-yes*H0*1
  13121. -->
  13122. (S1 ^operator O1985 = 0.)
  13123. Firing prefer*rvt*predict-no*H0
  13124. -->
  13125. Firing rl*prefer*rvt*predict-no*H0*2
  13126. -->
  13127. (S1 ^operator O1986 = 1.)
  13128. inner elaboration loop at bottom goal.
  13129. Retracting rl*prefer*rvt*predict-no*H0*2
  13130. -->
  13131. (S1 ^operator O1984 = 1.)
  13132. Retracting rl*prefer*rvt*predict-yes*H0*1
  13133. -->
  13134. (S1 ^operator O1983 = 0.)
  13135. --- END Proposal Phase ---
  13136. --- Decision Phase ---
  13137. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.590119 -0.252401 0.337718(R,m,v=1,0.898204,0.0919847)
  13138. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409809 0.252411 0.662219 -> 0.409816 0.25241 0.662226(R,m,v=1,1,0)
  13139. =>WM: (13995: S1 ^operator O1986)
  13140. 993: O: O1986 (predict-no)
  13141. --- END Decision Phase ---
  13142. --- Application Phase ---
  13143. --- Firing Productions (PE) For State At Depth 1 ---
  13144. --- Inner Elaboration Phase, active level 1 (S1) ---
  13145. Firing apply*operator
  13146. -->
  13147. (I3 ^predict-no N993 + :O )
  13148. Firing apply*operator*complete
  13149. -->
  13150. (I3 ^predict-yes N992 - :O )
  13151. inner elaboration loop at bottom goal.
  13152. --- Change Working Memory (PE) ---
  13153. =>WM: (13996: I3 ^predict-no N993)
  13154. <=WM: (13982: N992 ^status complete)
  13155. <=WM: (13981: I3 ^predict-yes N992)
  13156. --- Firing Productions (IE) For State At Depth 1 ---
  13157. --- Inner Elaboration Phase, active level 1 (S1) ---
  13158. Firing monitor*world
  13159. -->
  13160. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13161. --- Change Working Memory (IE) ---
  13162. --- END Application Phase ---
  13163. --- Output Phase ---
  13164. ENV: Agent did: predict-no for direction U in state State-B
  13165. In State-B moving U
  13166. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13167. predict error 0
  13168. dir: dir isL
  13169. --- END Output Phase ---
  13170. |\---- Input Phase ---
  13171. =>WM: (14000: I2 ^dir L)
  13172. =>WM: (13999: I2 ^reward 1)
  13173. =>WM: (13998: I2 ^see 0)
  13174. =>WM: (13997: N993 ^status complete)
  13175. <=WM: (13985: I2 ^dir U)
  13176. <=WM: (13984: I2 ^reward 1)
  13177. <=WM: (13983: I2 ^see 1)
  13178. =>WM: (14001: I2 ^level-1 R1-root)
  13179. <=WM: (13986: I2 ^level-1 R1-root)
  13180. --- END Input Phase ---
  13181. --- Proposal Phase ---
  13182. --- Inner Elaboration Phase, active level 1 (S1) ---
  13183. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13184. -->
  13185. (S1 ^operator O1985 = 0.7362544663116062)
  13186. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13187. -->
  13188. Firing elaborate*copy-see-to-output-link
  13189. -->
  13190. (I3 ^see 0 +)
  13191. Firing elaborate*reward*based*on*reward
  13192. -->
  13193. (R997 ^value 1 +)
  13194. (R1 ^reward R997 +)
  13195. Firing propose*predict-yes
  13196. -->
  13197. (O1987 ^name predict-yes +)
  13198. (S1 ^operator O1987 +)
  13199. Firing propose*predict-no
  13200. -->
  13201. (O1988 ^name predict-no +)
  13202. (S1 ^operator O1988 +)
  13203. Firing rl*prefer*rvt*predict-no*H0*6
  13204. -->
  13205. (S1 ^operator O1986 = 0.999790145818646)
  13206. Firing rl*prefer*rvt*predict-yes*H0*5
  13207. -->
  13208. (S1 ^operator O1985 = 0.2640492015925779)
  13209. Firing prefer*rvt*predict-yes*H0
  13210. -->
  13211. Firing prefer*rvt*predict-no*H0
  13212. -->
  13213. Firing elaborate*copy-dir-to-output-link
  13214. -->
  13215. (I3 ^dir L +)
  13216. inner elaboration loop at bottom goal.
  13217. Retracting elaborate*copy-see-to-output-link
  13218. -->
  13219. (I3 ^see 1 +)
  13220. Retracting propose*predict-no
  13221. -->
  13222. (O1986 ^name predict-no +)
  13223. (S1 ^operator O1986 +)
  13224. Retracting propose*predict-yes
  13225. -->
  13226. (O1985 ^name predict-yes +)
  13227. (S1 ^operator O1985 +)
  13228. Retracting elaborate*reward*based*on*reward
  13229. -->
  13230. (R996 ^value 1 +)
  13231. (R1 ^reward R996 +)
  13232. Retracting elaborate*copy-dir-to-output-link
  13233. -->
  13234. (I3 ^dir U +)
  13235. Retracting rl*prefer*rvt*predict-no*H0*2
  13236. -->
  13237. (S1 ^operator O1986 = 1.)
  13238. Retracting rl*prefer*rvt*predict-yes*H0*1
  13239. -->
  13240. (S1 ^operator O1985 = 0.)
  13241. =>WM: (14009: S1 ^operator O1988 +)
  13242. =>WM: (14008: S1 ^operator O1987 +)
  13243. =>WM: (14007: I3 ^dir L)
  13244. =>WM: (14006: O1988 ^name predict-no)
  13245. =>WM: (14005: O1987 ^name predict-yes)
  13246. =>WM: (14004: R997 ^value 1)
  13247. =>WM: (14003: R1 ^reward R997)
  13248. =>WM: (14002: I3 ^see 0)
  13249. <=WM: (13993: S1 ^operator O1985 +)
  13250. <=WM: (13994: S1 ^operator O1986 +)
  13251. <=WM: (13995: S1 ^operator O1986)
  13252. <=WM: (13992: I3 ^dir U)
  13253. <=WM: (13988: R1 ^reward R996)
  13254. <=WM: (13987: I3 ^see 1)
  13255. <=WM: (13991: O1986 ^name predict-no)
  13256. <=WM: (13990: O1985 ^name predict-yes)
  13257. <=WM: (13989: R996 ^value 1)
  13258. --- Inner Elaboration Phase, active level 1 (S1) ---
  13259. Firing prefer*rvt*predict-yes*H0
  13260. -->
  13261. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13262. -->
  13263. (S1 ^operator O1987 = 0.7362544663116062)
  13264. Firing rl*prefer*rvt*predict-yes*H0*5
  13265. -->
  13266. (S1 ^operator O1987 = 0.2640492015925779)
  13267. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13268. -->
  13269. Firing prefer*rvt*predict-no*H0
  13270. -->
  13271. Firing rl*prefer*rvt*predict-no*H0*6
  13272. -->
  13273. (S1 ^operator O1988 = 0.999790145818646)
  13274. inner elaboration loop at bottom goal.
  13275. Retracting rl*prefer*rvt*predict-no*H0*6
  13276. -->
  13277. (S1 ^operator O1986 = 0.999790145818646)
  13278. Retracting rl*prefer*rvt*predict-yes*H0*5
  13279. -->
  13280. (S1 ^operator O1985 = 0.2640492015925779)
  13281. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13282. -->
  13283. (S1 ^operator O1985 = 0.7362544663116062)
  13284. --- END Proposal Phase ---
  13285. --- Decision Phase ---
  13286. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13287. =>WM: (14010: S1 ^operator O1987)
  13288. 994: O: O1987 (predict-yes)
  13289. --- END Decision Phase ---
  13290. --- Application Phase ---
  13291. --- Firing Productions (PE) For State At Depth 1 ---
  13292. --- Inner Elaboration Phase, active level 1 (S1) ---
  13293. Firing apply*operator
  13294. -->
  13295. (I3 ^predict-yes N994 + :O )
  13296. Firing apply*operator*complete
  13297. -->
  13298. (I3 ^predict-no N993 - :O )
  13299. inner elaboration loop at bottom goal.
  13300. --- Change Working Memory (PE) ---
  13301. =>WM: (14011: I3 ^predict-yes N994)
  13302. <=WM: (13997: N993 ^status complete)
  13303. <=WM: (13996: I3 ^predict-no N993)
  13304. --- Firing Productions (IE) For State At Depth 1 ---
  13305. --- Inner Elaboration Phase, active level 1 (S1) ---
  13306. Firing monitor*world
  13307. -->
  13308. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13309. --- Change Working Memory (IE) ---
  13310. --- END Application Phase ---
  13311. --- Output Phase ---
  13312. ENV: Agent did: predict-yes for direction L in state State-B
  13313. In State-B moving L
  13314. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13315. predict error 0
  13316. dir: dir isL
  13317. --- END Output Phase ---
  13318. /|\--- Input Phase ---
  13319. =>WM: (14015: I2 ^dir L)
  13320. =>WM: (14014: I2 ^reward 1)
  13321. =>WM: (14013: I2 ^see 1)
  13322. =>WM: (14012: N994 ^status complete)
  13323. <=WM: (14000: I2 ^dir L)
  13324. <=WM: (13999: I2 ^reward 1)
  13325. <=WM: (13998: I2 ^see 0)
  13326. =>WM: (14016: I2 ^level-1 L1-root)
  13327. <=WM: (14001: I2 ^level-1 R1-root)
  13328. --- END Input Phase ---
  13329. --- Proposal Phase ---
  13330. --- Inner Elaboration Phase, active level 1 (S1) ---
  13331. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13332. -->
  13333. (S1 ^operator O1987 = -0.181727099742844)
  13334. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13335. -->
  13336. Firing elaborate*copy-see-to-output-link
  13337. -->
  13338. (I3 ^see 1 +)
  13339. Firing elaborate*reward*based*on*reward
  13340. -->
  13341. (R998 ^value 1 +)
  13342. (R1 ^reward R998 +)
  13343. Firing propose*predict-yes
  13344. -->
  13345. (O1989 ^name predict-yes +)
  13346. (S1 ^operator O1989 +)
  13347. Firing propose*predict-no
  13348. -->
  13349. (O1990 ^name predict-no +)
  13350. (S1 ^operator O1990 +)
  13351. Firing rl*prefer*rvt*predict-no*H0*6
  13352. -->
  13353. (S1 ^operator O1988 = 0.999790145818646)
  13354. Firing rl*prefer*rvt*predict-yes*H0*5
  13355. -->
  13356. (S1 ^operator O1987 = 0.2640492015925779)
  13357. Firing prefer*rvt*predict-yes*H0
  13358. -->
  13359. Firing prefer*rvt*predict-no*H0
  13360. -->
  13361. Firing elaborate*copy-dir-to-output-link
  13362. -->
  13363. (I3 ^dir L +)
  13364. inner elaboration loop at bottom goal.
  13365. Retracting elaborate*copy-see-to-output-link
  13366. -->
  13367. (I3 ^see 0 +)
  13368. Retracting propose*predict-no
  13369. -->
  13370. (O1988 ^name predict-no +)
  13371. (S1 ^operator O1988 +)
  13372. Retracting propose*predict-yes
  13373. -->
  13374. (O1987 ^name predict-yes +)
  13375. (S1 ^operator O1987 +)
  13376. Retracting elaborate*reward*based*on*reward
  13377. -->
  13378. (R997 ^value 1 +)
  13379. (R1 ^reward R997 +)
  13380. Retracting elaborate*copy-dir-to-output-link
  13381. -->
  13382. (I3 ^dir L +)
  13383. Retracting rl*prefer*rvt*predict-no*H0*6
  13384. -->
  13385. (S1 ^operator O1988 = 0.999790145818646)
  13386. Retracting rl*prefer*rvt*predict-yes*H0*5
  13387. -->
  13388. (S1 ^operator O1987 = 0.2640492015925779)
  13389. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13390. -->
  13391. (S1 ^operator O1987 = 0.7362544663116062)
  13392. =>WM: (14023: S1 ^operator O1990 +)
  13393. =>WM: (14022: S1 ^operator O1989 +)
  13394. =>WM: (14021: O1990 ^name predict-no)
  13395. =>WM: (14020: O1989 ^name predict-yes)
  13396. =>WM: (14019: R998 ^value 1)
  13397. =>WM: (14018: R1 ^reward R998)
  13398. =>WM: (14017: I3 ^see 1)
  13399. <=WM: (14008: S1 ^operator O1987 +)
  13400. <=WM: (14010: S1 ^operator O1987)
  13401. <=WM: (14009: S1 ^operator O1988 +)
  13402. <=WM: (14003: R1 ^reward R997)
  13403. <=WM: (14002: I3 ^see 0)
  13404. <=WM: (14006: O1988 ^name predict-no)
  13405. <=WM: (14005: O1987 ^name predict-yes)
  13406. <=WM: (14004: R997 ^value 1)
  13407. --- Inner Elaboration Phase, active level 1 (S1) ---
  13408. Firing prefer*rvt*predict-yes*H0
  13409. -->
  13410. Firing rl*prefer*rvt*predict-yes*H0*5
  13411. -->
  13412. (S1 ^operator O1989 = 0.2640492015925779)
  13413. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13414. -->
  13415. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13416. -->
  13417. (S1 ^operator O1989 = -0.181727099742844)
  13418. Firing prefer*rvt*predict-no*H0
  13419. -->
  13420. Firing rl*prefer*rvt*predict-no*H0*6
  13421. -->
  13422. (S1 ^operator O1990 = 0.999790145818646)
  13423. inner elaboration loop at bottom goal.
  13424. Retracting rl*prefer*rvt*predict-no*H0*6
  13425. -->
  13426. (S1 ^operator O1988 = 0.999790145818646)
  13427. Retracting rl*prefer*rvt*predict-yes*H0*5
  13428. -->
  13429. (S1 ^operator O1987 = 0.2640492015925779)
  13430. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13431. -->
  13432. (S1 ^operator O1987 = -0.181727099742844)
  13433. --- END Proposal Phase ---
  13434. --- Decision Phase ---
  13435. RL update rl*prefer*rvt*predict-yes*H0*5 0.554434 -0.290385 0.264049 -> 0.55441 -0.290386 0.264025(R,m,v=1,0.877095,0.108405)
  13436. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445864 0.29039 0.736254 -> 0.445836 0.29039 0.736226(R,m,v=1,1,0)
  13437. =>WM: (14024: S1 ^operator O1990)
  13438. 995: O: O1990 (predict-no)
  13439. --- END Decision Phase ---
  13440. --- Application Phase ---
  13441. --- Firing Productions (PE) For State At Depth 1 ---
  13442. --- Inner Elaboration Phase, active level 1 (S1) ---
  13443. Firing apply*operator
  13444. -->
  13445. (I3 ^predict-no N995 + :O )
  13446. Firing apply*operator*complete
  13447. -->
  13448. (I3 ^predict-yes N994 - :O )
  13449. inner elaboration loop at bottom goal.
  13450. --- Change Working Memory (PE) ---
  13451. =>WM: (14025: I3 ^predict-no N995)
  13452. <=WM: (14012: N994 ^status complete)
  13453. <=WM: (14011: I3 ^predict-yes N994)
  13454. --- Firing Productions (IE) For State At Depth 1 ---
  13455. --- Inner Elaboration Phase, active level 1 (S1) ---
  13456. Firing monitor*world
  13457. -->
  13458. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13459. --- Change Working Memory (IE) ---
  13460. --- END Application Phase ---
  13461. --- Output Phase ---
  13462. ENV: Agent did: predict-no for direction L in state State-A
  13463. In State-A moving L
  13464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13465. predict error 0
  13466. dir: dir isL
  13467. --- END Output Phase ---
  13468. -/|--- Input Phase ---
  13469. =>WM: (14029: I2 ^dir L)
  13470. =>WM: (14028: I2 ^reward 1)
  13471. =>WM: (14027: I2 ^see 0)
  13472. =>WM: (14026: N995 ^status complete)
  13473. <=WM: (14015: I2 ^dir L)
  13474. <=WM: (14014: I2 ^reward 1)
  13475. <=WM: (14013: I2 ^see 1)
  13476. =>WM: (14030: I2 ^level-1 L0-root)
  13477. <=WM: (14016: I2 ^level-1 L1-root)
  13478. --- END Input Phase ---
  13479. --- Proposal Phase ---
  13480. --- Inner Elaboration Phase, active level 1 (S1) ---
  13481. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13482. -->
  13483. (S1 ^operator O1989 = -0.1386470047172653)
  13484. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13485. -->
  13486. Firing elaborate*copy-see-to-output-link
  13487. -->
  13488. (I3 ^see 0 +)
  13489. Firing elaborate*reward*based*on*reward
  13490. -->
  13491. (R999 ^value 1 +)
  13492. (R1 ^reward R999 +)
  13493. Firing propose*predict-yes
  13494. -->
  13495. (O1991 ^name predict-yes +)
  13496. (S1 ^operator O1991 +)
  13497. Firing propose*predict-no
  13498. -->
  13499. (O1992 ^name predict-no +)
  13500. (S1 ^operator O1992 +)
  13501. Firing rl*prefer*rvt*predict-no*H0*6
  13502. -->
  13503. (S1 ^operator O1990 = 0.999790145818646)
  13504. Firing rl*prefer*rvt*predict-yes*H0*5
  13505. -->
  13506. (S1 ^operator O1989 = 0.2640246623191502)
  13507. Firing prefer*rvt*predict-yes*H0
  13508. -->
  13509. Firing prefer*rvt*predict-no*H0
  13510. -->
  13511. Firing elaborate*copy-dir-to-output-link
  13512. -->
  13513. (I3 ^dir L +)
  13514. inner elaboration loop at bottom goal.
  13515. Retracting elaborate*copy-see-to-output-link
  13516. -->
  13517. (I3 ^see 1 +)
  13518. Retracting propose*predict-no
  13519. -->
  13520. (O1990 ^name predict-no +)
  13521. (S1 ^operator O1990 +)
  13522. Retracting propose*predict-yes
  13523. -->
  13524. (O1989 ^name predict-yes +)
  13525. (S1 ^operator O1989 +)
  13526. Retracting elaborate*reward*based*on*reward
  13527. -->
  13528. (R998 ^value 1 +)
  13529. (R1 ^reward R998 +)
  13530. Retracting elaborate*copy-dir-to-output-link
  13531. -->
  13532. (I3 ^dir L +)
  13533. Retracting rl*prefer*rvt*predict-no*H0*6
  13534. -->
  13535. (S1 ^operator O1990 = 0.999790145818646)
  13536. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13537. -->
  13538. (S1 ^operator O1989 = -0.181727099742844)
  13539. Retracting rl*prefer*rvt*predict-yes*H0*5
  13540. -->
  13541. (S1 ^operator O1989 = 0.2640246623191502)
  13542. =>WM: (14037: S1 ^operator O1992 +)
  13543. =>WM: (14036: S1 ^operator O1991 +)
  13544. =>WM: (14035: O1992 ^name predict-no)
  13545. =>WM: (14034: O1991 ^name predict-yes)
  13546. =>WM: (14033: R999 ^value 1)
  13547. =>WM: (14032: R1 ^reward R999)
  13548. =>WM: (14031: I3 ^see 0)
  13549. <=WM: (14022: S1 ^operator O1989 +)
  13550. <=WM: (14023: S1 ^operator O1990 +)
  13551. <=WM: (14024: S1 ^operator O1990)
  13552. <=WM: (14018: R1 ^reward R998)
  13553. <=WM: (14017: I3 ^see 1)
  13554. <=WM: (14021: O1990 ^name predict-no)
  13555. <=WM: (14020: O1989 ^name predict-yes)
  13556. <=WM: (14019: R998 ^value 1)
  13557. --- Inner Elaboration Phase, active level 1 (S1) ---
  13558. Firing prefer*rvt*predict-yes*H0
  13559. -->
  13560. Firing rl*prefer*rvt*predict-yes*H0*5
  13561. -->
  13562. (S1 ^operator O1991 = 0.2640246623191502)
  13563. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13564. -->
  13565. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13566. -->
  13567. (S1 ^operator O1991 = -0.1386470047172653)
  13568. Firing prefer*rvt*predict-no*H0
  13569. -->
  13570. Firing rl*prefer*rvt*predict-no*H0*6
  13571. -->
  13572. (S1 ^operator O1992 = 0.999790145818646)
  13573. inner elaboration loop at bottom goal.
  13574. Retracting rl*prefer*rvt*predict-no*H0*6
  13575. -->
  13576. (S1 ^operator O1990 = 0.999790145818646)
  13577. Retracting rl*prefer*rvt*predict-yes*H0*5
  13578. -->
  13579. (S1 ^operator O1989 = 0.2640246623191502)
  13580. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13581. -->
  13582. (S1 ^operator O1989 = -0.1386470047172653)
  13583. --- END Proposal Phase ---
  13584. --- Decision Phase ---
  13585. RL update rl*prefer*rvt*predict-no*H0*6 0.99979 0 0.99979 -> 0.999825 0 0.999825(R,m,v=1,0.905405,0.0862291)
  13586. =>WM: (14038: S1 ^operator O1992)
  13587. 996: O: O1992 (predict-no)
  13588. --- END Decision Phase ---
  13589. --- Application Phase ---
  13590. --- Firing Productions (PE) For State At Depth 1 ---
  13591. --- Inner Elaboration Phase, active level 1 (S1) ---
  13592. Firing apply*operator
  13593. -->
  13594. (I3 ^predict-no N996 + :O )
  13595. Firing apply*operator*complete
  13596. -->
  13597. (I3 ^predict-no N995 - :O )
  13598. inner elaboration loop at bottom goal.
  13599. --- Change Working Memory (PE) ---
  13600. =>WM: (14039: I3 ^predict-no N996)
  13601. <=WM: (14026: N995 ^status complete)
  13602. <=WM: (14025: I3 ^predict-no N995)
  13603. --- Firing Productions (IE) For State At Depth 1 ---
  13604. --- Inner Elaboration Phase, active level 1 (S1) ---
  13605. Firing monitor*world
  13606. -->
  13607. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13608. --- Change Working Memory (IE) ---
  13609. --- END Application Phase ---
  13610. --- Output Phase ---
  13611. ENV: Agent did: predict-no for direction L in state State-A
  13612. In State-A moving L
  13613. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13614. predict error 0
  13615. dir: dir isL
  13616. --- END Output Phase ---
  13617. \-/--- Input Phase ---
  13618. =>WM: (14043: I2 ^dir L)
  13619. =>WM: (14042: I2 ^reward 1)
  13620. =>WM: (14041: I2 ^see 0)
  13621. =>WM: (14040: N996 ^status complete)
  13622. <=WM: (14029: I2 ^dir L)
  13623. <=WM: (14028: I2 ^reward 1)
  13624. <=WM: (14027: I2 ^see 0)
  13625. =>WM: (14044: I2 ^level-1 L0-root)
  13626. <=WM: (14030: I2 ^level-1 L0-root)
  13627. --- END Input Phase ---
  13628. --- Proposal Phase ---
  13629. --- Inner Elaboration Phase, active level 1 (S1) ---
  13630. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13631. -->
  13632. (S1 ^operator O1991 = -0.1386470047172653)
  13633. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13634. -->
  13635. Firing elaborate*copy-see-to-output-link
  13636. -->
  13637. (I3 ^see 0 +)
  13638. Firing elaborate*reward*based*on*reward
  13639. -->
  13640. (R1000 ^value 1 +)
  13641. (R1 ^reward R1000 +)
  13642. Firing propose*predict-yes
  13643. -->
  13644. (O1993 ^name predict-yes +)
  13645. (S1 ^operator O1993 +)
  13646. Firing propose*predict-no
  13647. -->
  13648. (O1994 ^name predict-no +)
  13649. (S1 ^operator O1994 +)
  13650. Firing rl*prefer*rvt*predict-no*H0*6
  13651. -->
  13652. (S1 ^operator O1992 = 0.9998251377735368)
  13653. Firing rl*prefer*rvt*predict-yes*H0*5
  13654. -->
  13655. (S1 ^operator O1991 = 0.2640246623191502)
  13656. Firing prefer*rvt*predict-yes*H0
  13657. -->
  13658. Firing prefer*rvt*predict-no*H0
  13659. -->
  13660. Firing elaborate*copy-dir-to-output-link
  13661. -->
  13662. (I3 ^dir L +)
  13663. inner elaboration loop at bottom goal.
  13664. Retracting elaborate*copy-see-to-output-link
  13665. -->
  13666. (I3 ^see 0 +)
  13667. Retracting propose*predict-no
  13668. -->
  13669. (O1992 ^name predict-no +)
  13670. (S1 ^operator O1992 +)
  13671. Retracting propose*predict-yes
  13672. -->
  13673. (O1991 ^name predict-yes +)
  13674. (S1 ^operator O1991 +)
  13675. Retracting elaborate*reward*based*on*reward
  13676. -->
  13677. (R999 ^value 1 +)
  13678. (R1 ^reward R999 +)
  13679. Retracting elaborate*copy-dir-to-output-link
  13680. -->
  13681. (I3 ^dir L +)
  13682. Retracting rl*prefer*rvt*predict-no*H0*6
  13683. -->
  13684. (S1 ^operator O1992 = 0.9998251377735368)
  13685. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13686. -->
  13687. (S1 ^operator O1991 = -0.1386470047172653)
  13688. Retracting rl*prefer*rvt*predict-yes*H0*5
  13689. -->
  13690. (S1 ^operator O1991 = 0.2640246623191502)
  13691. =>WM: (14050: S1 ^operator O1994 +)
  13692. =>WM: (14049: S1 ^operator O1993 +)
  13693. =>WM: (14048: O1994 ^name predict-no)
  13694. =>WM: (14047: O1993 ^name predict-yes)
  13695. =>WM: (14046: R1000 ^value 1)
  13696. =>WM: (14045: R1 ^reward R1000)
  13697. <=WM: (14036: S1 ^operator O1991 +)
  13698. <=WM: (14037: S1 ^operator O1992 +)
  13699. <=WM: (14038: S1 ^operator O1992)
  13700. <=WM: (14032: R1 ^reward R999)
  13701. <=WM: (14035: O1992 ^name predict-no)
  13702. <=WM: (14034: O1991 ^name predict-yes)
  13703. <=WM: (14033: R999 ^value 1)
  13704. --- Inner Elaboration Phase, active level 1 (S1) ---
  13705. Firing prefer*rvt*predict-yes*H0
  13706. -->
  13707. Firing rl*prefer*rvt*predict-yes*H0*5
  13708. -->
  13709. (S1 ^operator O1993 = 0.2640246623191502)
  13710. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13711. -->
  13712. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13713. -->
  13714. (S1 ^operator O1993 = -0.1386470047172653)
  13715. Firing prefer*rvt*predict-no*H0
  13716. -->
  13717. Firing rl*prefer*rvt*predict-no*H0*6
  13718. -->
  13719. (S1 ^operator O1994 = 0.9998251377735368)
  13720. inner elaboration loop at bottom goal.
  13721. Retracting rl*prefer*rvt*predict-no*H0*6
  13722. -->
  13723. (S1 ^operator O1992 = 0.9998251377735368)
  13724. Retracting rl*prefer*rvt*predict-yes*H0*5
  13725. -->
  13726. (S1 ^operator O1991 = 0.2640246623191502)
  13727. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13728. -->
  13729. (S1 ^operator O1991 = -0.1386470047172653)
  13730. --- END Proposal Phase ---
  13731. --- Decision Phase ---
  13732. RL update rl*prefer*rvt*predict-no*H0*6 0.999825 0 0.999825 -> 0.999854 0 0.999854(R,m,v=1,0.90604,0.0857065)
  13733. =>WM: (14051: S1 ^operator O1994)
  13734. 997: O: O1994 (predict-no)
  13735. --- END Decision Phase ---
  13736. --- Application Phase ---
  13737. --- Firing Productions (PE) For State At Depth 1 ---
  13738. --- Inner Elaboration Phase, active level 1 (S1) ---
  13739. Firing apply*operator
  13740. -->
  13741. (I3 ^predict-no N997 + :O )
  13742. Firing apply*operator*complete
  13743. -->
  13744. (I3 ^predict-no N996 - :O )
  13745. inner elaboration loop at bottom goal.
  13746. --- Change Working Memory (PE) ---
  13747. =>WM: (14052: I3 ^predict-no N997)
  13748. <=WM: (14040: N996 ^status complete)
  13749. <=WM: (14039: I3 ^predict-no N996)
  13750. --- Firing Productions (IE) For State At Depth 1 ---
  13751. --- Inner Elaboration Phase, active level 1 (S1) ---
  13752. Firing monitor*world
  13753. -->
  13754. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13755. --- Change Working Memory (IE) ---
  13756. --- END Application Phase ---
  13757. --- Output Phase ---
  13758. ENV: Agent did: predict-no for direction L in state State-A
  13759. In State-A moving L
  13760. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13761. predict error 0
  13762. dir: dir isU
  13763. --- END Output Phase ---
  13764. |\---- Input Phase ---
  13765. =>WM: (14056: I2 ^dir U)
  13766. =>WM: (14055: I2 ^reward 1)
  13767. =>WM: (14054: I2 ^see 0)
  13768. =>WM: (14053: N997 ^status complete)
  13769. <=WM: (14043: I2 ^dir L)
  13770. <=WM: (14042: I2 ^reward 1)
  13771. <=WM: (14041: I2 ^see 0)
  13772. =>WM: (14057: I2 ^level-1 L0-root)
  13773. <=WM: (14044: I2 ^level-1 L0-root)
  13774. --- END Input Phase ---
  13775. --- Proposal Phase ---
  13776. --- Inner Elaboration Phase, active level 1 (S1) ---
  13777. Firing elaborate*copy-see-to-output-link
  13778. -->
  13779. (I3 ^see 0 +)
  13780. Firing elaborate*reward*based*on*reward
  13781. -->
  13782. (R1001 ^value 1 +)
  13783. (R1 ^reward R1001 +)
  13784. Firing propose*predict-yes
  13785. -->
  13786. (O1995 ^name predict-yes +)
  13787. (S1 ^operator O1995 +)
  13788. Firing propose*predict-no
  13789. -->
  13790. (O1996 ^name predict-no +)
  13791. (S1 ^operator O1996 +)
  13792. Firing rl*prefer*rvt*predict-no*H0*2
  13793. -->
  13794. (S1 ^operator O1994 = 1.)
  13795. Firing rl*prefer*rvt*predict-yes*H0*1
  13796. -->
  13797. (S1 ^operator O1993 = 0.)
  13798. Firing prefer*rvt*predict-yes*H0
  13799. -->
  13800. Firing prefer*rvt*predict-no*H0
  13801. -->
  13802. Firing elaborate*copy-dir-to-output-link
  13803. -->
  13804. (I3 ^dir U +)
  13805. inner elaboration loop at bottom goal.
  13806. Retracting elaborate*copy-see-to-output-link
  13807. -->
  13808. (I3 ^see 0 +)
  13809. Retracting propose*predict-no
  13810. -->
  13811. (O1994 ^name predict-no +)
  13812. (S1 ^operator O1994 +)
  13813. Retracting propose*predict-yes
  13814. -->
  13815. (O1993 ^name predict-yes +)
  13816. (S1 ^operator O1993 +)
  13817. Retracting elaborate*reward*based*on*reward
  13818. -->
  13819. (R1000 ^value 1 +)
  13820. (R1 ^reward R1000 +)
  13821. Retracting elaborate*copy-dir-to-output-link
  13822. -->
  13823. (I3 ^dir L +)
  13824. Retracting rl*prefer*rvt*predict-no*H0*6
  13825. -->
  13826. (S1 ^operator O1994 = 0.9998542623222174)
  13827. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13828. -->
  13829. (S1 ^operator O1993 = -0.1386470047172653)
  13830. Retracting rl*prefer*rvt*predict-yes*H0*5
  13831. -->
  13832. (S1 ^operator O1993 = 0.2640246623191502)
  13833. =>WM: (14064: S1 ^operator O1996 +)
  13834. =>WM: (14063: S1 ^operator O1995 +)
  13835. =>WM: (14062: I3 ^dir U)
  13836. =>WM: (14061: O1996 ^name predict-no)
  13837. =>WM: (14060: O1995 ^name predict-yes)
  13838. =>WM: (14059: R1001 ^value 1)
  13839. =>WM: (14058: R1 ^reward R1001)
  13840. <=WM: (14049: S1 ^operator O1993 +)
  13841. <=WM: (14050: S1 ^operator O1994 +)
  13842. <=WM: (14051: S1 ^operator O1994)
  13843. <=WM: (14007: I3 ^dir L)
  13844. <=WM: (14045: R1 ^reward R1000)
  13845. <=WM: (14048: O1994 ^name predict-no)
  13846. <=WM: (14047: O1993 ^name predict-yes)
  13847. <=WM: (14046: R1000 ^value 1)
  13848. --- Inner Elaboration Phase, active level 1 (S1) ---
  13849. Firing prefer*rvt*predict-yes*H0
  13850. -->
  13851. Firing rl*prefer*rvt*predict-yes*H0*1
  13852. -->
  13853. (S1 ^operator O1995 = 0.)
  13854. Firing prefer*rvt*predict-no*H0
  13855. -->
  13856. Firing rl*prefer*rvt*predict-no*H0*2
  13857. -->
  13858. (S1 ^operator O1996 = 1.)
  13859. inner elaboration loop at bottom goal.
  13860. Retracting rl*prefer*rvt*predict-no*H0*2
  13861. -->
  13862. (S1 ^operator O1994 = 1.)
  13863. Retracting rl*prefer*rvt*predict-yes*H0*1
  13864. -->
  13865. (S1 ^operator O1993 = 0.)
  13866. --- END Proposal Phase ---
  13867. --- Decision Phase ---
  13868. RL update rl*prefer*rvt*predict-no*H0*6 0.999854 0 0.999854 -> 0.999879 0 0.999879(R,m,v=1,0.906667,0.0851902)
  13869. =>WM: (14065: S1 ^operator O1996)
  13870. 998: O: O1996 (predict-no)
  13871. --- END Decision Phase ---
  13872. --- Application Phase ---
  13873. --- Firing Productions (PE) For State At Depth 1 ---
  13874. --- Inner Elaboration Phase, active level 1 (S1) ---
  13875. Firing apply*operator
  13876. -->
  13877. (I3 ^predict-no N998 + :O )
  13878. Firing apply*operator*complete
  13879. -->
  13880. (I3 ^predict-no N997 - :O )
  13881. inner elaboration loop at bottom goal.
  13882. --- Change Working Memory (PE) ---
  13883. =>WM: (14066: I3 ^predict-no N998)
  13884. <=WM: (14053: N997 ^status complete)
  13885. <=WM: (14052: I3 ^predict-no N997)
  13886. --- Firing Productions (IE) For State At Depth 1 ---
  13887. --- Inner Elaboration Phase, active level 1 (S1) ---
  13888. Firing monitor*world
  13889. -->
  13890. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13891. --- Change Working Memory (IE) ---
  13892. --- END Application Phase ---
  13893. --- Output Phase ---
  13894. ENV: Agent did: predict-no for direction U in state State-A
  13895. In State-A moving U
  13896. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13897. predict error 0
  13898. dir: dir isU
  13899. --- END Output Phase ---
  13900. /|--- Input Phase ---
  13901. =>WM: (14070: I2 ^dir U)
  13902. =>WM: (14069: I2 ^reward 1)
  13903. =>WM: (14068: I2 ^see 0)
  13904. =>WM: (14067: N998 ^status complete)
  13905. <=WM: (14056: I2 ^dir U)
  13906. <=WM: (14055: I2 ^reward 1)
  13907. <=WM: (14054: I2 ^see 0)
  13908. =>WM: (14071: I2 ^level-1 L0-root)
  13909. <=WM: (14057: I2 ^level-1 L0-root)
  13910. --- END Input Phase ---
  13911. --- Proposal Phase ---
  13912. --- Inner Elaboration Phase, active level 1 (S1) ---
  13913. Firing elaborate*copy-see-to-output-link
  13914. -->
  13915. (I3 ^see 0 +)
  13916. Firing elaborate*reward*based*on*reward
  13917. -->
  13918. (R1002 ^value 1 +)
  13919. (R1 ^reward R1002 +)
  13920. Firing propose*predict-yes
  13921. -->
  13922. (O1997 ^name predict-yes +)
  13923. (S1 ^operator O1997 +)
  13924. Firing propose*predict-no
  13925. -->
  13926. (O1998 ^name predict-no +)
  13927. (S1 ^operator O1998 +)
  13928. Firing rl*prefer*rvt*predict-no*H0*2
  13929. -->
  13930. (S1 ^operator O1996 = 1.)
  13931. Firing rl*prefer*rvt*predict-yes*H0*1
  13932. -->
  13933. (S1 ^operator O1995 = 0.)
  13934. Firing prefer*rvt*predict-yes*H0
  13935. -->
  13936. Firing prefer*rvt*predict-no*H0
  13937. -->
  13938. Firing elaborate*copy-dir-to-output-link
  13939. -->
  13940. (I3 ^dir U +)
  13941. inner elaboration loop at bottom goal.
  13942. Retracting elaborate*copy-see-to-output-link
  13943. -->
  13944. (I3 ^see 0 +)
  13945. Retracting propose*predict-no
  13946. -->
  13947. (O1996 ^name predict-no +)
  13948. (S1 ^operator O1996 +)
  13949. Retracting propose*predict-yes
  13950. -->
  13951. (O1995 ^name predict-yes +)
  13952. (S1 ^operator O1995 +)
  13953. Retracting elaborate*reward*based*on*reward
  13954. -->
  13955. (R1001 ^value 1 +)
  13956. (R1 ^reward R1001 +)
  13957. Retracting elaborate*copy-dir-to-output-link
  13958. -->
  13959. (I3 ^dir U +)
  13960. Retracting rl*prefer*rvt*predict-no*H0*2
  13961. -->
  13962. (S1 ^operator O1996 = 1.)
  13963. Retracting rl*prefer*rvt*predict-yes*H0*1
  13964. -->
  13965. (S1 ^operator O1995 = 0.)
  13966. =>WM: (14077: S1 ^operator O1998 +)
  13967. =>WM: (14076: S1 ^operator O1997 +)
  13968. =>WM: (14075: O1998 ^name predict-no)
  13969. =>WM: (14074: O1997 ^name predict-yes)
  13970. =>WM: (14073: R1002 ^value 1)
  13971. =>WM: (14072: R1 ^reward R1002)
  13972. <=WM: (14063: S1 ^operator O1995 +)
  13973. <=WM: (14064: S1 ^operator O1996 +)
  13974. <=WM: (14065: S1 ^operator O1996)
  13975. <=WM: (14058: R1 ^reward R1001)
  13976. <=WM: (14061: O1996 ^name predict-no)
  13977. <=WM: (14060: O1995 ^name predict-yes)
  13978. <=WM: (14059: R1001 ^value 1)
  13979. --- Inner Elaboration Phase, active level 1 (S1) ---
  13980. Firing prefer*rvt*predict-yes*H0
  13981. -->
  13982. Firing rl*prefer*rvt*predict-yes*H0*1
  13983. -->
  13984. (S1 ^operator O1997 = 0.)
  13985. Firing prefer*rvt*predict-no*H0
  13986. -->
  13987. Firing rl*prefer*rvt*predict-no*H0*2
  13988. -->
  13989. (S1 ^operator O1998 = 1.)
  13990. inner elaboration loop at bottom goal.
  13991. Retracting rl*prefer*rvt*predict-no*H0*2
  13992. -->
  13993. (S1 ^operator O1996 = 1.)
  13994. Retracting rl*prefer*rvt*predict-yes*H0*1
  13995. -->
  13996. (S1 ^operator O1995 = 0.)
  13997. --- END Proposal Phase ---
  13998. --- Decision Phase ---
  13999. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14000. =>WM: (14078: S1 ^operator O1998)
  14001. 999: O: O1998 (predict-no)
  14002. --- END Decision Phase ---
  14003. --- Application Phase ---
  14004. --- Firing Productions (PE) For State At Depth 1 ---
  14005. --- Inner Elaboration Phase, active level 1 (S1) ---
  14006. Firing apply*operator
  14007. -->
  14008. (I3 ^predict-no N999 + :O )
  14009. Firing apply*operator*complete
  14010. -->
  14011. (I3 ^predict-no N998 - :O )
  14012. inner elaboration loop at bottom goal.
  14013. --- Change Working Memory (PE) ---
  14014. =>WM: (14079: I3 ^predict-no N999)
  14015. <=WM: (14067: N998 ^status complete)
  14016. <=WM: (14066: I3 ^predict-no N998)
  14017. --- Firing Productions (IE) For State At Depth 1 ---
  14018. --- Inner Elaboration Phase, active level 1 (S1) ---
  14019. Firing monitor*world
  14020. -->
  14021. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14022. --- Change Working Memory (IE) ---
  14023. --- END Application Phase ---
  14024. --- Output Phase ---
  14025. ENV: Agent did: predict-no for direction U in state State-A
  14026. In State-A moving U
  14027. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14028. predict error 0
  14029. dir: dir isR
  14030. --- END Output Phase ---
  14031. \-/--- Input Phase ---
  14032. =>WM: (14083: I2 ^dir R)
  14033. =>WM: (14082: I2 ^reward 1)
  14034. =>WM: (14081: I2 ^see 0)
  14035. =>WM: (14080: N999 ^status complete)
  14036. <=WM: (14070: I2 ^dir U)
  14037. <=WM: (14069: I2 ^reward 1)
  14038. <=WM: (14068: I2 ^see 0)
  14039. =>WM: (14084: I2 ^level-1 L0-root)
  14040. <=WM: (14071: I2 ^level-1 L0-root)
  14041. --- END Input Phase ---
  14042. --- Proposal Phase ---
  14043. --- Inner Elaboration Phase, active level 1 (S1) ---
  14044. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14045. -->
  14046. (S1 ^operator O1998 = -0.2817060109291377)
  14047. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14048. -->
  14049. (S1 ^operator O1997 = 0.6623525109664488)
  14050. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14051. -->
  14052. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14053. -->
  14054. Firing elaborate*copy-see-to-output-link
  14055. -->
  14056. (I3 ^see 0 +)
  14057. Firing elaborate*reward*based*on*reward
  14058. -->
  14059. (R1003 ^value 1 +)
  14060. (R1 ^reward R1003 +)
  14061. Firing propose*predict-yes
  14062. -->
  14063. (O1999 ^name predict-yes +)
  14064. (S1 ^operator O1999 +)
  14065. Firing propose*predict-no
  14066. -->
  14067. (O2000 ^name predict-no +)
  14068. (S1 ^operator O2000 +)
  14069. Firing rl*prefer*rvt*predict-no*H0*4
  14070. -->
  14071. (S1 ^operator O1998 = 0.339773810196969)
  14072. Firing rl*prefer*rvt*predict-yes*H0*3
  14073. -->
  14074. (S1 ^operator O1997 = 0.337717515090074)
  14075. Firing prefer*rvt*predict-yes*H0
  14076. -->
  14077. Firing prefer*rvt*predict-no*H0
  14078. -->
  14079. Firing elaborate*copy-dir-to-output-link
  14080. -->
  14081. (I3 ^dir R +)
  14082. inner elaboration loop at bottom goal.
  14083. Retracting elaborate*copy-see-to-output-link
  14084. -->
  14085. (I3 ^see 0 +)
  14086. Retracting propose*predict-no
  14087. -->
  14088. (O1998 ^name predict-no +)
  14089. (S1 ^operator O1998 +)
  14090. Retracting propose*predict-yes
  14091. -->
  14092. (O1997 ^name predict-yes +)
  14093. (S1 ^operator O1997 +)
  14094. Retracting elaborate*reward*based*on*reward
  14095. -->
  14096. (R1002 ^value 1 +)
  14097. (R1 ^reward R1002 +)
  14098. Retracting elaborate*copy-dir-to-output-link
  14099. -->
  14100. (I3 ^dir U +)
  14101. Retracting rl*prefer*rvt*predict-no*H0*2
  14102. -->
  14103. (S1 ^operator O1998 = 1.)
  14104. Retracting rl*prefer*rvt*predict-yes*H0*1
  14105. -->
  14106. (S1 ^operator O1997 = 0.)
  14107. =>WM: (14091: S1 ^operator O2000 +)
  14108. =>WM: (14090: S1 ^operator O1999 +)
  14109. =>WM: (14089: I3 ^dir R)
  14110. =>WM: (14088: O2000 ^name predict-no)
  14111. =>WM: (14087: O1999 ^name predict-yes)
  14112. =>WM: (14086: R1003 ^value 1)
  14113. =>WM: (14085: R1 ^reward R1003)
  14114. <=WM: (14076: S1 ^operator O1997 +)
  14115. <=WM: (14077: S1 ^operator O1998 +)
  14116. <=WM: (14078: S1 ^operator O1998)
  14117. <=WM: (14062: I3 ^dir U)
  14118. <=WM: (14072: R1 ^reward R1002)
  14119. <=WM: (14075: O1998 ^name predict-no)
  14120. <=WM: (14074: O1997 ^name predict-yes)
  14121. <=WM: (14073: R1002 ^value 1)
  14122. --- Inner Elaboration Phase, active level 1 (S1) ---
  14123. Firing prefer*rvt*predict-yes*H0
  14124. -->
  14125. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14126. -->
  14127. (S1 ^operator O1999 = 0.6623525109664488)
  14128. Firing rl*prefer*rvt*predict-yes*H0*3
  14129. -->
  14130. (S1 ^operator O1999 = 0.337717515090074)
  14131. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14132. -->
  14133. Firing prefer*rvt*predict-no*H0
  14134. -->
  14135. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14136. -->
  14137. (S1 ^operator O2000 = -0.2817060109291377)
  14138. Firing rl*prefer*rvt*predict-no*H0*4
  14139. -->
  14140. (S1 ^operator O2000 = 0.339773810196969)
  14141. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14142. -->
  14143. inner elaboration loop at bottom goal.
  14144. Retracting rl*prefer*rvt*predict-no*H0*4
  14145. -->
  14146. (S1 ^operator O1998 = 0.339773810196969)
  14147. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14148. -->
  14149. (S1 ^operator O1998 = -0.2817060109291377)
  14150. Retracting rl*prefer*rvt*predict-yes*H0*3
  14151. -->
  14152. (S1 ^operator O1997 = 0.337717515090074)
  14153. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14154. -->
  14155. (S1 ^operator O1997 = 0.6623525109664488)
  14156. --- END Proposal Phase ---
  14157. --- Decision Phase ---
  14158. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14159. =>WM: (14092: S1 ^operator O1999)
  14160. 1000: O: O1999 (predict-yes)
  14161. --- END Decision Phase ---
  14162. --- Application Phase ---
  14163. --- Firing Productions (PE) For State At Depth 1 ---
  14164. --- Inner Elaboration Phase, active level 1 (S1) ---
  14165. Firing apply*operator
  14166. -->
  14167. (I3 ^predict-yes N1000 + :O )
  14168. Firing apply*operator*complete
  14169. -->
  14170. (I3 ^predict-no N999 - :O )
  14171. inner elaboration loop at bottom goal.
  14172. --- Change Working Memory (PE) ---
  14173. =>WM: (14093: I3 ^predict-yes N1000)
  14174. <=WM: (14080: N999 ^status complete)
  14175. <=WM: (14079: I3 ^predict-no N999)
  14176. --- Firing Productions (IE) For State At Depth 1 ---
  14177. --- Inner Elaboration Phase, active level 1 (S1) ---
  14178. Firing monitor*world
  14179. -->
  14180. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14181. --- Change Working Memory (IE) ---
  14182. --- END Application Phase ---
  14183. --- Output Phase ---
  14184. ENV: Agent did: predict-yes for direction R in state State-A
  14185. In State-A moving R
  14186. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14187. predict error 0
  14188. dir: dir isU
  14189. --- END Output Phase ---
  14190. |\-/|\-/|\---- Input Phase ---
  14191. =>WM: (14097: I2 ^dir U)
  14192. =>WM: (14096: I2 ^reward 1)
  14193. =>WM: (14095: I2 ^see 1)
  14194. =>WM: (14094: N1000 ^status complete)
  14195. <=WM: (14083: I2 ^dir R)
  14196. <=WM: (14082: I2 ^reward 1)
  14197. <=WM: (14081: I2 ^see 0)
  14198. =>WM: (14098: I2 ^level-1 R1-root)
  14199. <=WM: (14084: I2 ^level-1 L0-root)
  14200. --- END Input Phase ---
  14201. --- Proposal Phase ---
  14202. --- Inner Elaboration Phase, active level 1 (S1) ---
  14203. Firing elaborate*copy-see-to-output-link
  14204. -->
  14205. (I3 ^see 1 +)
  14206. Firing elaborate*reward*based*on*reward
  14207. -->
  14208. (R1004 ^value 1 +)
  14209. (R1 ^reward R1004 +)
  14210. Firing propose*predict-yes
  14211. -->
  14212. (O2001 ^name predict-yes +)
  14213. (S1 ^operator O2001 +)
  14214. Firing propose*predict-no
  14215. -->
  14216. (O2002 ^name predict-no +)
  14217. (S1 ^operator O2002 +)
  14218. Firing rl*prefer*rvt*predict-no*H0*2
  14219. -->
  14220. (S1 ^operator O2000 = 1.)
  14221. Firing rl*prefer*rvt*predict-yes*H0*1
  14222. -->
  14223. (S1 ^operator O1999 = 0.)
  14224. Firing prefer*rvt*predict-yes*H0
  14225. -->
  14226. Firing prefer*rvt*predict-no*H0
  14227. -->
  14228. Firing elaborate*copy-dir-to-output-link
  14229. -->
  14230. (I3 ^dir U +)
  14231. inner elaboration loop at bottom goal.
  14232. Retracting elaborate*copy-see-to-output-link
  14233. -->
  14234. (I3 ^see 0 +)
  14235. Retracting propose*predict-no
  14236. -->
  14237. (O2000 ^name predict-no +)
  14238. (S1 ^operator O2000 +)
  14239. Retracting propose*predict-yes
  14240. -->
  14241. (O1999 ^name predict-yes +)
  14242. (S1 ^operator O1999 +)
  14243. Retracting elaborate*reward*based*on*reward
  14244. -->
  14245. (R1003 ^value 1 +)
  14246. (R1 ^reward R1003 +)
  14247. Retracting elaborate*copy-dir-to-output-link
  14248. -->
  14249. (I3 ^dir R +)
  14250. Retracting rl*prefer*rvt*predict-no*H0*4
  14251. -->
  14252. (S1 ^operator O2000 = 0.339773810196969)
  14253. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14254. -->
  14255. (S1 ^operator O2000 = -0.2817060109291377)
  14256. Retracting rl*prefer*rvt*predict-yes*H0*3
  14257. -->
  14258. (S1 ^operator O1999 = 0.337717515090074)
  14259. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14260. -->
  14261. (S1 ^operator O1999 = 0.6623525109664488)
  14262. =>WM: (14106: S1 ^operator O2002 +)
  14263. =>WM: (14105: S1 ^operator O2001 +)
  14264. =>WM: (14104: I3 ^dir U)
  14265. =>WM: (14103: O2002 ^name predict-no)
  14266. =>WM: (14102: O2001 ^name predict-yes)
  14267. =>WM: (14101: R1004 ^value 1)
  14268. =>WM: (14100: R1 ^reward R1004)
  14269. =>WM: (14099: I3 ^see 1)
  14270. <=WM: (14090: S1 ^operator O1999 +)
  14271. <=WM: (14092: S1 ^operator O1999)
  14272. <=WM: (14091: S1 ^operator O2000 +)
  14273. <=WM: (14089: I3 ^dir R)
  14274. <=WM: (14085: R1 ^reward R1003)
  14275. <=WM: (14031: I3 ^see 0)
  14276. <=WM: (14088: O2000 ^name predict-no)
  14277. <=WM: (14087: O1999 ^name predict-yes)
  14278. <=WM: (14086: R1003 ^value 1)
  14279. --- Inner Elaboration Phase, active level 1 (S1) ---
  14280. Firing prefer*rvt*predict-yes*H0
  14281. -->
  14282. Firing rl*prefer*rvt*predict-yes*H0*1
  14283. -->
  14284. (S1 ^operator O2001 = 0.)
  14285. Firing prefer*rvt*predict-no*H0
  14286. -->
  14287. Firing rl*prefer*rvt*predict-no*H0*2
  14288. -->
  14289. (S1 ^operator O2002 = 1.)
  14290. inner elaboration loop at bottom goal.
  14291. Retracting rl*prefer*rvt*predict-no*H0*2
  14292. -->
  14293. (S1 ^operator O2000 = 1.)
  14294. Retracting rl*prefer*rvt*predict-yes*H0*1
  14295. -->
  14296. (S1 ^operator O1999 = 0.)
  14297. --- END Proposal Phase ---
  14298. --- Decision Phase ---
  14299. RL update rl*prefer*rvt*predict-yes*H0*3 0.590119 -0.252401 0.337718 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.89881,0.0914956)
  14300. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409962 0.25239 0.662353 -> 0.409954 0.252391 0.662346(R,m,v=1,1,0)
  14301. =>WM: (14107: S1 ^operator O2002)
  14302. 1001: O: O2002 (predict-no)
  14303. --- END Decision Phase ---
  14304. --- Application Phase ---
  14305. --- Firing Productions (PE) For State At Depth 1 ---
  14306. --- Inner Elaboration Phase, active level 1 (S1) ---
  14307. Firing apply*operator
  14308. -->
  14309. (I3 ^predict-no N1001 + :O )
  14310. Firing apply*operator*complete
  14311. -->
  14312. (I3 ^predict-yes N1000 - :O )
  14313. inner elaboration loop at bottom goal.
  14314. --- Change Working Memory (PE) ---
  14315. =>WM: (14108: I3 ^predict-no N1001)
  14316. <=WM: (14094: N1000 ^status complete)
  14317. <=WM: (14093: I3 ^predict-yes N1000)
  14318. --- Firing Productions (IE) For State At Depth 1 ---
  14319. --- Inner Elaboration Phase, active level 1 (S1) ---
  14320. Firing monitor*world
  14321. -->
  14322. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14323. --- Change Working Memory (IE) ---
  14324. --- END Application Phase ---
  14325. --- Output Phase ---
  14326. ENV: Agent did: predict-no for direction U in state State-B
  14327. In State-B moving U
  14328. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14329. predict error 0
  14330. dir: dir isU
  14331. --- END Output Phase ---
  14332. /--- Input Phase ---
  14333. =>WM: (14112: I2 ^dir U)
  14334. =>WM: (14111: I2 ^reward 1)
  14335. =>WM: (14110: I2 ^see 0)
  14336. =>WM: (14109: N1001 ^status complete)
  14337. <=WM: (14097: I2 ^dir U)
  14338. <=WM: (14096: I2 ^reward 1)
  14339. <=WM: (14095: I2 ^see 1)
  14340. =>WM: (14113: I2 ^level-1 R1-root)
  14341. <=WM: (14098: I2 ^level-1 R1-root)
  14342. --- END Input Phase ---
  14343. --- Proposal Phase ---
  14344. --- Inner Elaboration Phase, active level 1 (S1) ---
  14345. Firing elaborate*copy-see-to-output-link
  14346. -->
  14347. (I3 ^see 0 +)
  14348. Firing elaborate*reward*based*on*reward
  14349. -->
  14350. (R1005 ^value 1 +)
  14351. (R1 ^reward R1005 +)
  14352. Firing propose*predict-yes
  14353. -->
  14354. (O2003 ^name predict-yes +)
  14355. (S1 ^operator O2003 +)
  14356. Firing propose*predict-no
  14357. -->
  14358. (O2004 ^name predict-no +)
  14359. (S1 ^operator O2004 +)
  14360. Firing rl*prefer*rvt*predict-no*H0*2
  14361. -->
  14362. (S1 ^operator O2002 = 1.)
  14363. Firing rl*prefer*rvt*predict-yes*H0*1
  14364. -->
  14365. (S1 ^operator O2001 = 0.)
  14366. Firing prefer*rvt*predict-yes*H0
  14367. -->
  14368. Firing prefer*rvt*predict-no*H0
  14369. -->
  14370. Firing elaborate*copy-dir-to-output-link
  14371. -->
  14372. (I3 ^dir U +)
  14373. inner elaboration loop at bottom goal.
  14374. Retracting elaborate*copy-see-to-output-link
  14375. -->
  14376. (I3 ^see 1 +)
  14377. Retracting propose*predict-no
  14378. -->
  14379. (O2002 ^name predict-no +)
  14380. (S1 ^operator O2002 +)
  14381. Retracting propose*predict-yes
  14382. -->
  14383. (O2001 ^name predict-yes +)
  14384. (S1 ^operator O2001 +)
  14385. Retracting elaborate*reward*based*on*reward
  14386. -->
  14387. (R1004 ^value 1 +)
  14388. (R1 ^reward R1004 +)
  14389. Retracting elaborate*copy-dir-to-output-link
  14390. -->
  14391. (I3 ^dir U +)
  14392. Retracting rl*prefer*rvt*predict-no*H0*2
  14393. -->
  14394. (S1 ^operator O2002 = 1.)
  14395. Retracting rl*prefer*rvt*predict-yes*H0*1
  14396. -->
  14397. (S1 ^operator O2001 = 0.)
  14398. =>WM: (14120: S1 ^operator O2004 +)
  14399. =>WM: (14119: S1 ^operator O2003 +)
  14400. =>WM: (14118: O2004 ^name predict-no)
  14401. =>WM: (14117: O2003 ^name predict-yes)
  14402. =>WM: (14116: R1005 ^value 1)
  14403. =>WM: (14115: R1 ^reward R1005)
  14404. =>WM: (14114: I3 ^see 0)
  14405. <=WM: (14105: S1 ^operator O2001 +)
  14406. <=WM: (14106: S1 ^operator O2002 +)
  14407. <=WM: (14107: S1 ^operator O2002)
  14408. <=WM: (14100: R1 ^reward R1004)
  14409. <=WM: (14099: I3 ^see 1)
  14410. <=WM: (14103: O2002 ^name predict-no)
  14411. <=WM: (14102: O2001 ^name predict-yes)
  14412. <=WM: (14101: R1004 ^value 1)
  14413. --- Inner Elaboration Phase, active level 1 (S1) ---
  14414. Firing prefer*rvt*predict-yes*H0
  14415. -->
  14416. Firing rl*prefer*rvt*predict-yes*H0*1
  14417. -->
  14418. (S1 ^operator O2003 = 0.)
  14419. Firing prefer*rvt*predict-no*H0
  14420. -->
  14421. Firing rl*prefer*rvt*predict-no*H0*2
  14422. -->
  14423. (S1 ^operator O2004 = 1.)
  14424. inner elaboration loop at bottom goal.
  14425. Retracting rl*prefer*rvt*predict-no*H0*2
  14426. -->
  14427. (S1 ^operator O2002 = 1.)
  14428. Retracting rl*prefer*rvt*predict-yes*H0*1
  14429. -->
  14430. (S1 ^operator O2001 = 0.)
  14431. --- END Proposal Phase ---
  14432. --- Decision Phase ---
  14433. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14434. =>WM: (14121: S1 ^operator O2004)
  14435. 1002: O: O2004 (predict-no)
  14436. --- END Decision Phase ---
  14437. --- Application Phase ---
  14438. --- Firing Productions (PE) For State At Depth 1 ---
  14439. --- Inner Elaboration Phase, active level 1 (S1) ---
  14440. Firing apply*operator
  14441. -->
  14442. (I3 ^predict-no N1002 + :O )
  14443. Firing apply*operator*complete
  14444. -->
  14445. (I3 ^predict-no N1001 - :O )
  14446. inner elaboration loop at bottom goal.
  14447. --- Change Working Memory (PE) ---
  14448. =>WM: (14122: I3 ^predict-no N1002)
  14449. <=WM: (14109: N1001 ^status complete)
  14450. <=WM: (14108: I3 ^predict-no N1001)
  14451. --- Firing Productions (IE) For State At Depth 1 ---
  14452. --- Inner Elaboration Phase, active level 1 (S1) ---
  14453. Firing monitor*world
  14454. -->
  14455. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14456. --- Change Working Memory (IE) ---
  14457. --- END Application Phase ---
  14458. --- Output Phase ---
  14459. ENV: Agent did: predict-no for direction U in state State-B
  14460. In State-B moving U
  14461. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14462. predict error 0
  14463. dir: dir isU
  14464. --- END Output Phase ---
  14465. |\--- Input Phase ---
  14466. =>WM: (14126: I2 ^dir U)
  14467. =>WM: (14125: I2 ^reward 1)
  14468. =>WM: (14124: I2 ^see 0)
  14469. =>WM: (14123: N1002 ^status complete)
  14470. <=WM: (14112: I2 ^dir U)
  14471. <=WM: (14111: I2 ^reward 1)
  14472. <=WM: (14110: I2 ^see 0)
  14473. =>WM: (14127: I2 ^level-1 R1-root)
  14474. <=WM: (14113: I2 ^level-1 R1-root)
  14475. --- END Input Phase ---
  14476. --- Proposal Phase ---
  14477. --- Inner Elaboration Phase, active level 1 (S1) ---
  14478. Firing elaborate*copy-see-to-output-link
  14479. -->
  14480. (I3 ^see 0 +)
  14481. Firing elaborate*reward*based*on*reward
  14482. -->
  14483. (R1006 ^value 1 +)
  14484. (R1 ^reward R1006 +)
  14485. Firing propose*predict-yes
  14486. -->
  14487. (O2005 ^name predict-yes +)
  14488. (S1 ^operator O2005 +)
  14489. Firing propose*predict-no
  14490. -->
  14491. (O2006 ^name predict-no +)
  14492. (S1 ^operator O2006 +)
  14493. Firing rl*prefer*rvt*predict-no*H0*2
  14494. -->
  14495. (S1 ^operator O2004 = 1.)
  14496. Firing rl*prefer*rvt*predict-yes*H0*1
  14497. -->
  14498. (S1 ^operator O2003 = 0.)
  14499. Firing prefer*rvt*predict-yes*H0
  14500. -->
  14501. Firing prefer*rvt*predict-no*H0
  14502. -->
  14503. Firing elaborate*copy-dir-to-output-link
  14504. -->
  14505. (I3 ^dir U +)
  14506. inner elaboration loop at bottom goal.
  14507. Retracting elaborate*copy-see-to-output-link
  14508. -->
  14509. (I3 ^see 0 +)
  14510. Retracting propose*predict-no
  14511. -->
  14512. (O2004 ^name predict-no +)
  14513. (S1 ^operator O2004 +)
  14514. Retracting propose*predict-yes
  14515. -->
  14516. (O2003 ^name predict-yes +)
  14517. (S1 ^operator O2003 +)
  14518. Retracting elaborate*reward*based*on*reward
  14519. -->
  14520. (R1005 ^value 1 +)
  14521. (R1 ^reward R1005 +)
  14522. Retracting elaborate*copy-dir-to-output-link
  14523. -->
  14524. (I3 ^dir U +)
  14525. Retracting rl*prefer*rvt*predict-no*H0*2
  14526. -->
  14527. (S1 ^operator O2004 = 1.)
  14528. Retracting rl*prefer*rvt*predict-yes*H0*1
  14529. -->
  14530. (S1 ^operator O2003 = 0.)
  14531. =>WM: (14133: S1 ^operator O2006 +)
  14532. =>WM: (14132: S1 ^operator O2005 +)
  14533. =>WM: (14131: O2006 ^name predict-no)
  14534. =>WM: (14130: O2005 ^name predict-yes)
  14535. =>WM: (14129: R1006 ^value 1)
  14536. =>WM: (14128: R1 ^reward R1006)
  14537. <=WM: (14119: S1 ^operator O2003 +)
  14538. <=WM: (14120: S1 ^operator O2004 +)
  14539. <=WM: (14121: S1 ^operator O2004)
  14540. <=WM: (14115: R1 ^reward R1005)
  14541. <=WM: (14118: O2004 ^name predict-no)
  14542. <=WM: (14117: O2003 ^name predict-yes)
  14543. <=WM: (14116: R1005 ^value 1)
  14544. --- Inner Elaboration Phase, active level 1 (S1) ---
  14545. Firing prefer*rvt*predict-yes*H0
  14546. -->
  14547. Firing rl*prefer*rvt*predict-yes*H0*1
  14548. -->
  14549. (S1 ^operator O2005 = 0.)
  14550. Firing prefer*rvt*predict-no*H0
  14551. -->
  14552. Firing rl*prefer*rvt*predict-no*H0*2
  14553. -->
  14554. (S1 ^operator O2006 = 1.)
  14555. inner elaboration loop at bottom goal.
  14556. Retracting rl*prefer*rvt*predict-no*H0*2
  14557. -->
  14558. (S1 ^operator O2004 = 1.)
  14559. Retracting rl*prefer*rvt*predict-yes*H0*1
  14560. -->
  14561. (S1 ^operator O2003 = 0.)
  14562. --- END Proposal Phase ---
  14563. --- Decision Phase ---
  14564. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14565. =>WM: (14134: S1 ^operator O2006)
  14566. 1003: O: O2006 (predict-no)
  14567. --- END Decision Phase ---
  14568. --- Application Phase ---
  14569. --- Firing Productions (PE) For State At Depth 1 ---
  14570. --- Inner Elaboration Phase, active level 1 (S1) ---
  14571. Firing apply*operator
  14572. -->
  14573. (I3 ^predict-no N1003 + :O )
  14574. Firing apply*operator*complete
  14575. -->
  14576. (I3 ^predict-no N1002 - :O )
  14577. inner elaboration loop at bottom goal.
  14578. --- Change Working Memory (PE) ---
  14579. =>WM: (14135: I3 ^predict-no N1003)
  14580. <=WM: (14123: N1002 ^status complete)
  14581. <=WM: (14122: I3 ^predict-no N1002)
  14582. --- Firing Productions (IE) For State At Depth 1 ---
  14583. --- Inner Elaboration Phase, active level 1 (S1) ---
  14584. Firing monitor*world
  14585. -->
  14586. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14587. --- Change Working Memory (IE) ---
  14588. --- END Application Phase ---
  14589. --- Output Phase ---
  14590. ENV: Agent did: predict-no for direction U in state State-B
  14591. In State-B moving U
  14592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14593. predict error 0
  14594. dir: dir isU
  14595. --- END Output Phase ---
  14596. -/--- Input Phase ---
  14597. =>WM: (14139: I2 ^dir U)
  14598. =>WM: (14138: I2 ^reward 1)
  14599. =>WM: (14137: I2 ^see 0)
  14600. =>WM: (14136: N1003 ^status complete)
  14601. <=WM: (14126: I2 ^dir U)
  14602. <=WM: (14125: I2 ^reward 1)
  14603. <=WM: (14124: I2 ^see 0)
  14604. =>WM: (14140: I2 ^level-1 R1-root)
  14605. <=WM: (14127: I2 ^level-1 R1-root)
  14606. --- END Input Phase ---
  14607. --- Proposal Phase ---
  14608. --- Inner Elaboration Phase, active level 1 (S1) ---
  14609. Firing elaborate*copy-see-to-output-link
  14610. -->
  14611. (I3 ^see 0 +)
  14612. Firing elaborate*reward*based*on*reward
  14613. -->
  14614. (R1007 ^value 1 +)
  14615. (R1 ^reward R1007 +)
  14616. Firing propose*predict-yes
  14617. -->
  14618. (O2007 ^name predict-yes +)
  14619. (S1 ^operator O2007 +)
  14620. Firing propose*predict-no
  14621. -->
  14622. (O2008 ^name predict-no +)
  14623. (S1 ^operator O2008 +)
  14624. Firing rl*prefer*rvt*predict-no*H0*2
  14625. -->
  14626. (S1 ^operator O2006 = 1.)
  14627. Firing rl*prefer*rvt*predict-yes*H0*1
  14628. -->
  14629. (S1 ^operator O2005 = 0.)
  14630. Firing prefer*rvt*predict-yes*H0
  14631. -->
  14632. Firing prefer*rvt*predict-no*H0
  14633. -->
  14634. Firing elaborate*copy-dir-to-output-link
  14635. -->
  14636. (I3 ^dir U +)
  14637. inner elaboration loop at bottom goal.
  14638. Retracting elaborate*copy-see-to-output-link
  14639. -->
  14640. (I3 ^see 0 +)
  14641. Retracting propose*predict-no
  14642. -->
  14643. (O2006 ^name predict-no +)
  14644. (S1 ^operator O2006 +)
  14645. Retracting propose*predict-yes
  14646. -->
  14647. (O2005 ^name predict-yes +)
  14648. (S1 ^operator O2005 +)
  14649. Retracting elaborate*reward*based*on*reward
  14650. -->
  14651. (R1006 ^value 1 +)
  14652. (R1 ^reward R1006 +)
  14653. Retracting elaborate*copy-dir-to-output-link
  14654. -->
  14655. (I3 ^dir U +)
  14656. Retracting rl*prefer*rvt*predict-no*H0*2
  14657. -->
  14658. (S1 ^operator O2006 = 1.)
  14659. Retracting rl*prefer*rvt*predict-yes*H0*1
  14660. -->
  14661. (S1 ^operator O2005 = 0.)
  14662. =>WM: (14146: S1 ^operator O2008 +)
  14663. =>WM: (14145: S1 ^operator O2007 +)
  14664. =>WM: (14144: O2008 ^name predict-no)
  14665. =>WM: (14143: O2007 ^name predict-yes)
  14666. =>WM: (14142: R1007 ^value 1)
  14667. =>WM: (14141: R1 ^reward R1007)
  14668. <=WM: (14132: S1 ^operator O2005 +)
  14669. <=WM: (14133: S1 ^operator O2006 +)
  14670. <=WM: (14134: S1 ^operator O2006)
  14671. <=WM: (14128: R1 ^reward R1006)
  14672. <=WM: (14131: O2006 ^name predict-no)
  14673. <=WM: (14130: O2005 ^name predict-yes)
  14674. <=WM: (14129: R1006 ^value 1)
  14675. --- Inner Elaboration Phase, active level 1 (S1) ---
  14676. Firing prefer*rvt*predict-yes*H0
  14677. -->
  14678. Firing rl*prefer*rvt*predict-yes*H0*1
  14679. -->
  14680. (S1 ^operator O2007 = 0.)
  14681. Firing prefer*rvt*predict-no*H0
  14682. -->
  14683. Firing rl*prefer*rvt*predict-no*H0*2
  14684. -->
  14685. (S1 ^operator O2008 = 1.)
  14686. inner elaboration loop at bottom goal.
  14687. Retracting rl*prefer*rvt*predict-no*H0*2
  14688. -->
  14689. (S1 ^operator O2006 = 1.)
  14690. Retracting rl*prefer*rvt*predict-yes*H0*1
  14691. -->
  14692. (S1 ^operator O2005 = 0.)
  14693. --- END Proposal Phase ---
  14694. --- Decision Phase ---
  14695. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14696. =>WM: (14147: S1 ^operator O2008)
  14697. 1004: O: O2008 (predict-no)
  14698. --- END Decision Phase ---
  14699. --- Application Phase ---
  14700. --- Firing Productions (PE) For State At Depth 1 ---
  14701. --- Inner Elaboration Phase, active level 1 (S1) ---
  14702. Firing apply*operator
  14703. -->
  14704. (I3 ^predict-no N1004 + :O )
  14705. Firing apply*operator*complete
  14706. -->
  14707. (I3 ^predict-no N1003 - :O )
  14708. inner elaboration loop at bottom goal.
  14709. --- Change Working Memory (PE) ---
  14710. =>WM: (14148: I3 ^predict-no N1004)
  14711. <=WM: (14136: N1003 ^status complete)
  14712. <=WM: (14135: I3 ^predict-no N1003)
  14713. --- Firing Productions (IE) For State At Depth 1 ---
  14714. --- Inner Elaboration Phase, active level 1 (S1) ---
  14715. Firing monitor*world
  14716. -->
  14717. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14718. --- Change Working Memory (IE) ---
  14719. --- END Application Phase ---
  14720. --- Output Phase ---
  14721. ENV: Agent did: predict-no for direction U in state State-B
  14722. In State-B moving U
  14723. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14724. predict error 0
  14725. dir: dir isL
  14726. --- END Output Phase ---
  14727. |\---- Input Phase ---
  14728. =>WM: (14152: I2 ^dir L)
  14729. =>WM: (14151: I2 ^reward 1)
  14730. =>WM: (14150: I2 ^see 0)
  14731. =>WM: (14149: N1004 ^status complete)
  14732. <=WM: (14139: I2 ^dir U)
  14733. <=WM: (14138: I2 ^reward 1)
  14734. <=WM: (14137: I2 ^see 0)
  14735. =>WM: (14153: I2 ^level-1 R1-root)
  14736. <=WM: (14140: I2 ^level-1 R1-root)
  14737. --- END Input Phase ---
  14738. --- Proposal Phase ---
  14739. --- Inner Elaboration Phase, active level 1 (S1) ---
  14740. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14741. -->
  14742. (S1 ^operator O2007 = 0.7362263199804909)
  14743. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14744. -->
  14745. Firing elaborate*copy-see-to-output-link
  14746. -->
  14747. (I3 ^see 0 +)
  14748. Firing elaborate*reward*based*on*reward
  14749. -->
  14750. (R1008 ^value 1 +)
  14751. (R1 ^reward R1008 +)
  14752. Firing propose*predict-yes
  14753. -->
  14754. (O2009 ^name predict-yes +)
  14755. (S1 ^operator O2009 +)
  14756. Firing propose*predict-no
  14757. -->
  14758. (O2010 ^name predict-no +)
  14759. (S1 ^operator O2010 +)
  14760. Firing rl*prefer*rvt*predict-no*H0*6
  14761. -->
  14762. (S1 ^operator O2008 = 0.9998785089568328)
  14763. Firing rl*prefer*rvt*predict-yes*H0*5
  14764. -->
  14765. (S1 ^operator O2007 = 0.2640246623191502)
  14766. Firing prefer*rvt*predict-yes*H0
  14767. -->
  14768. Firing prefer*rvt*predict-no*H0
  14769. -->
  14770. Firing elaborate*copy-dir-to-output-link
  14771. -->
  14772. (I3 ^dir L +)
  14773. inner elaboration loop at bottom goal.
  14774. Retracting elaborate*copy-see-to-output-link
  14775. -->
  14776. (I3 ^see 0 +)
  14777. Retracting propose*predict-no
  14778. -->
  14779. (O2008 ^name predict-no +)
  14780. (S1 ^operator O2008 +)
  14781. Retracting propose*predict-yes
  14782. -->
  14783. (O2007 ^name predict-yes +)
  14784. (S1 ^operator O2007 +)
  14785. Retracting elaborate*reward*based*on*reward
  14786. -->
  14787. (R1007 ^value 1 +)
  14788. (R1 ^reward R1007 +)
  14789. Retracting elaborate*copy-dir-to-output-link
  14790. -->
  14791. (I3 ^dir U +)
  14792. Retracting rl*prefer*rvt*predict-no*H0*2
  14793. -->
  14794. (S1 ^operator O2008 = 1.)
  14795. Retracting rl*prefer*rvt*predict-yes*H0*1
  14796. -->
  14797. (S1 ^operator O2007 = 0.)
  14798. =>WM: (14160: S1 ^operator O2010 +)
  14799. =>WM: (14159: S1 ^operator O2009 +)
  14800. =>WM: (14158: I3 ^dir L)
  14801. =>WM: (14157: O2010 ^name predict-no)
  14802. =>WM: (14156: O2009 ^name predict-yes)
  14803. =>WM: (14155: R1008 ^value 1)
  14804. =>WM: (14154: R1 ^reward R1008)
  14805. <=WM: (14145: S1 ^operator O2007 +)
  14806. <=WM: (14146: S1 ^operator O2008 +)
  14807. <=WM: (14147: S1 ^operator O2008)
  14808. <=WM: (14104: I3 ^dir U)
  14809. <=WM: (14141: R1 ^reward R1007)
  14810. <=WM: (14144: O2008 ^name predict-no)
  14811. <=WM: (14143: O2007 ^name predict-yes)
  14812. <=WM: (14142: R1007 ^value 1)
  14813. --- Inner Elaboration Phase, active level 1 (S1) ---
  14814. Firing prefer*rvt*predict-yes*H0
  14815. -->
  14816. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14817. -->
  14818. (S1 ^operator O2009 = 0.7362263199804909)
  14819. Firing rl*prefer*rvt*predict-yes*H0*5
  14820. -->
  14821. (S1 ^operator O2009 = 0.2640246623191502)
  14822. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14823. -->
  14824. Firing prefer*rvt*predict-no*H0
  14825. -->
  14826. Firing rl*prefer*rvt*predict-no*H0*6
  14827. -->
  14828. (S1 ^operator O2010 = 0.9998785089568328)
  14829. inner elaboration loop at bottom goal.
  14830. Retracting rl*prefer*rvt*predict-no*H0*6
  14831. -->
  14832. (S1 ^operator O2008 = 0.9998785089568328)
  14833. Retracting rl*prefer*rvt*predict-yes*H0*5
  14834. -->
  14835. (S1 ^operator O2007 = 0.2640246623191502)
  14836. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14837. -->
  14838. (S1 ^operator O2007 = 0.7362263199804909)
  14839. --- END Proposal Phase ---
  14840. --- Decision Phase ---
  14841. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14842. =>WM: (14161: S1 ^operator O2009)
  14843. 1005: O: O2009 (predict-yes)
  14844. --- END Decision Phase ---
  14845. --- Application Phase ---
  14846. --- Firing Productions (PE) For State At Depth 1 ---
  14847. --- Inner Elaboration Phase, active level 1 (S1) ---
  14848. Firing apply*operator
  14849. -->
  14850. (I3 ^predict-yes N1005 + :O )
  14851. Firing apply*operator*complete
  14852. -->
  14853. (I3 ^predict-no N1004 - :O )
  14854. inner elaboration loop at bottom goal.
  14855. --- Change Working Memory (PE) ---
  14856. =>WM: (14162: I3 ^predict-yes N1005)
  14857. <=WM: (14149: N1004 ^status complete)
  14858. <=WM: (14148: I3 ^predict-no N1004)
  14859. --- Firing Productions (IE) For State At Depth 1 ---
  14860. --- Inner Elaboration Phase, active level 1 (S1) ---
  14861. Firing monitor*world
  14862. -->
  14863. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14864. --- Change Working Memory (IE) ---
  14865. --- END Application Phase ---
  14866. --- Output Phase ---
  14867. ENV: Agent did: predict-yes for direction L in state State-B
  14868. In State-B moving L
  14869. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14870. predict error 0
  14871. dir: dir isR
  14872. --- END Output Phase ---
  14873. /|\--- Input Phase ---
  14874. =>WM: (14166: I2 ^dir R)
  14875. =>WM: (14165: I2 ^reward 1)
  14876. =>WM: (14164: I2 ^see 1)
  14877. =>WM: (14163: N1005 ^status complete)
  14878. <=WM: (14152: I2 ^dir L)
  14879. <=WM: (14151: I2 ^reward 1)
  14880. <=WM: (14150: I2 ^see 0)
  14881. =>WM: (14167: I2 ^level-1 L1-root)
  14882. <=WM: (14153: I2 ^level-1 R1-root)
  14883. --- END Input Phase ---
  14884. --- Proposal Phase ---
  14885. --- Inner Elaboration Phase, active level 1 (S1) ---
  14886. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  14887. -->
  14888. (S1 ^operator O2010 = -0.2714224023553999)
  14889. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  14890. -->
  14891. (S1 ^operator O2009 = 0.6622259046932006)
  14892. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14893. -->
  14894. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14895. -->
  14896. Firing elaborate*copy-see-to-output-link
  14897. -->
  14898. (I3 ^see 1 +)
  14899. Firing elaborate*reward*based*on*reward
  14900. -->
  14901. (R1009 ^value 1 +)
  14902. (R1 ^reward R1009 +)
  14903. Firing propose*predict-yes
  14904. -->
  14905. (O2011 ^name predict-yes +)
  14906. (S1 ^operator O2011 +)
  14907. Firing propose*predict-no
  14908. -->
  14909. (O2012 ^name predict-no +)
  14910. (S1 ^operator O2012 +)
  14911. Firing rl*prefer*rvt*predict-no*H0*4
  14912. -->
  14913. (S1 ^operator O2010 = 0.339773810196969)
  14914. Firing rl*prefer*rvt*predict-yes*H0*3
  14915. -->
  14916. (S1 ^operator O2009 = 0.3377117977102235)
  14917. Firing prefer*rvt*predict-yes*H0
  14918. -->
  14919. Firing prefer*rvt*predict-no*H0
  14920. -->
  14921. Firing elaborate*copy-dir-to-output-link
  14922. -->
  14923. (I3 ^dir R +)
  14924. inner elaboration loop at bottom goal.
  14925. Retracting elaborate*copy-see-to-output-link
  14926. -->
  14927. (I3 ^see 0 +)
  14928. Retracting propose*predict-no
  14929. -->
  14930. (O2010 ^name predict-no +)
  14931. (S1 ^operator O2010 +)
  14932. Retracting propose*predict-yes
  14933. -->
  14934. (O2009 ^name predict-yes +)
  14935. (S1 ^operator O2009 +)
  14936. Retracting elaborate*reward*based*on*reward
  14937. -->
  14938. (R1008 ^value 1 +)
  14939. (R1 ^reward R1008 +)
  14940. Retracting elaborate*copy-dir-to-output-link
  14941. -->
  14942. (I3 ^dir L +)
  14943. Retracting rl*prefer*rvt*predict-no*H0*6
  14944. -->
  14945. (S1 ^operator O2010 = 0.9998785089568328)
  14946. Retracting rl*prefer*rvt*predict-yes*H0*5
  14947. -->
  14948. (S1 ^operator O2009 = 0.2640246623191502)
  14949. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14950. -->
  14951. (S1 ^operator O2009 = 0.7362263199804909)
  14952. =>WM: (14175: S1 ^operator O2012 +)
  14953. =>WM: (14174: S1 ^operator O2011 +)
  14954. =>WM: (14173: I3 ^dir R)
  14955. =>WM: (14172: O2012 ^name predict-no)
  14956. =>WM: (14171: O2011 ^name predict-yes)
  14957. =>WM: (14170: R1009 ^value 1)
  14958. =>WM: (14169: R1 ^reward R1009)
  14959. =>WM: (14168: I3 ^see 1)
  14960. <=WM: (14159: S1 ^operator O2009 +)
  14961. <=WM: (14161: S1 ^operator O2009)
  14962. <=WM: (14160: S1 ^operator O2010 +)
  14963. <=WM: (14158: I3 ^dir L)
  14964. <=WM: (14154: R1 ^reward R1008)
  14965. <=WM: (14114: I3 ^see 0)
  14966. <=WM: (14157: O2010 ^name predict-no)
  14967. <=WM: (14156: O2009 ^name predict-yes)
  14968. <=WM: (14155: R1008 ^value 1)
  14969. --- Inner Elaboration Phase, active level 1 (S1) ---
  14970. Firing prefer*rvt*predict-yes*H0
  14971. -->
  14972. Firing rl*prefer*rvt*predict-yes*H0*3
  14973. -->
  14974. (S1 ^operator O2011 = 0.3377117977102235)
  14975. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14976. -->
  14977. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  14978. -->
  14979. (S1 ^operator O2011 = 0.6622259046932006)
  14980. Firing prefer*rvt*predict-no*H0
  14981. -->
  14982. Firing rl*prefer*rvt*predict-no*H0*4
  14983. -->
  14984. (S1 ^operator O2012 = 0.339773810196969)
  14985. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14986. -->
  14987. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  14988. -->
  14989. (S1 ^operator O2012 = -0.2714224023553999)
  14990. inner elaboration loop at bottom goal.
  14991. Retracting rl*prefer*rvt*predict-no*H0*4
  14992. -->
  14993. (S1 ^operator O2010 = 0.339773810196969)
  14994. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  14995. -->
  14996. (S1 ^operator O2010 = -0.2714224023553999)
  14997. Retracting rl*prefer*rvt*predict-yes*H0*3
  14998. -->
  14999. (S1 ^operator O2009 = 0.3377117977102235)
  15000. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  15001. -->
  15002. (S1 ^operator O2009 = 0.6622259046932006)
  15003. --- END Proposal Phase ---
  15004. --- Decision Phase ---
  15005. RL update rl*prefer*rvt*predict-yes*H0*5 0.55441 -0.290386 0.264025 -> 0.55439 -0.290386 0.264004(R,m,v=1,0.877778,0.107883)
  15006. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445836 0.29039 0.736226 -> 0.445814 0.290389 0.736203(R,m,v=1,1,0)
  15007. =>WM: (14176: S1 ^operator O2011)
  15008. 1006: O: O2011 (predict-yes)
  15009. --- END Decision Phase ---
  15010. --- Application Phase ---
  15011. --- Firing Productions (PE) For State At Depth 1 ---
  15012. --- Inner Elaboration Phase, active level 1 (S1) ---
  15013. Firing apply*operator
  15014. -->
  15015. (I3 ^predict-yes N1006 + :O )
  15016. Firing apply*operator*complete
  15017. -->
  15018. (I3 ^predict-yes N1005 - :O )
  15019. inner elaboration loop at bottom goal.
  15020. --- Change Working Memory (PE) ---
  15021. =>WM: (14177: I3 ^predict-yes N1006)
  15022. <=WM: (14163: N1005 ^status complete)
  15023. <=WM: (14162: I3 ^predict-yes N1005)
  15024. --- Firing Productions (IE) For State At Depth 1 ---
  15025. --- Inner Elaboration Phase, active level 1 (S1) ---
  15026. Firing monitor*world
  15027. -->
  15028. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15029. --- Change Working Memory (IE) ---
  15030. --- END Application Phase ---
  15031. --- Output Phase ---
  15032. ENV: Agent did: predict-yes for direction R in state State-A
  15033. In State-A moving R
  15034. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15035. predict error 0
  15036. dir: dir isR
  15037. --- END Output Phase ---
  15038. -/|--- Input Phase ---
  15039. =>WM: (14181: I2 ^dir R)
  15040. =>WM: (14180: I2 ^reward 1)
  15041. =>WM: (14179: I2 ^see 1)
  15042. =>WM: (14178: N1006 ^status complete)
  15043. <=WM: (14166: I2 ^dir R)
  15044. <=WM: (14165: I2 ^reward 1)
  15045. <=WM: (14164: I2 ^see 1)
  15046. =>WM: (14182: I2 ^level-1 R1-root)
  15047. <=WM: (14167: I2 ^level-1 L1-root)
  15048. --- END Input Phase ---
  15049. --- Proposal Phase ---
  15050. --- Inner Elaboration Phase, active level 1 (S1) ---
  15051. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15052. -->
  15053. (S1 ^operator O2011 = -0.1070236389116304)
  15054. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15055. -->
  15056. (S1 ^operator O2012 = 0.6602439963649246)
  15057. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15058. -->
  15059. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15060. -->
  15061. Firing elaborate*copy-see-to-output-link
  15062. -->
  15063. (I3 ^see 1 +)
  15064. Firing elaborate*reward*based*on*reward
  15065. -->
  15066. (R1010 ^value 1 +)
  15067. (R1 ^reward R1010 +)
  15068. Firing propose*predict-yes
  15069. -->
  15070. (O2013 ^name predict-yes +)
  15071. (S1 ^operator O2013 +)
  15072. Firing propose*predict-no
  15073. -->
  15074. (O2014 ^name predict-no +)
  15075. (S1 ^operator O2014 +)
  15076. Firing rl*prefer*rvt*predict-no*H0*4
  15077. -->
  15078. (S1 ^operator O2012 = 0.339773810196969)
  15079. Firing rl*prefer*rvt*predict-yes*H0*3
  15080. -->
  15081. (S1 ^operator O2011 = 0.3377117977102235)
  15082. Firing prefer*rvt*predict-yes*H0
  15083. -->
  15084. Firing prefer*rvt*predict-no*H0
  15085. -->
  15086. Firing elaborate*copy-dir-to-output-link
  15087. -->
  15088. (I3 ^dir R +)
  15089. inner elaboration loop at bottom goal.
  15090. Retracting elaborate*copy-see-to-output-link
  15091. -->
  15092. (I3 ^see 1 +)
  15093. Retracting propose*predict-no
  15094. -->
  15095. (O2012 ^name predict-no +)
  15096. (S1 ^operator O2012 +)
  15097. Retracting propose*predict-yes
  15098. -->
  15099. (O2011 ^name predict-yes +)
  15100. (S1 ^operator O2011 +)
  15101. Retracting elaborate*reward*based*on*reward
  15102. -->
  15103. (R1009 ^value 1 +)
  15104. (R1 ^reward R1009 +)
  15105. Retracting elaborate*copy-dir-to-output-link
  15106. -->
  15107. (I3 ^dir R +)
  15108. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  15109. -->
  15110. (S1 ^operator O2012 = -0.2714224023553999)
  15111. Retracting rl*prefer*rvt*predict-no*H0*4
  15112. -->
  15113. (S1 ^operator O2012 = 0.339773810196969)
  15114. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  15115. -->
  15116. (S1 ^operator O2011 = 0.6622259046932006)
  15117. Retracting rl*prefer*rvt*predict-yes*H0*3
  15118. -->
  15119. (S1 ^operator O2011 = 0.3377117977102235)
  15120. =>WM: (14188: S1 ^operator O2014 +)
  15121. =>WM: (14187: S1 ^operator O2013 +)
  15122. =>WM: (14186: O2014 ^name predict-no)
  15123. =>WM: (14185: O2013 ^name predict-yes)
  15124. =>WM: (14184: R1010 ^value 1)
  15125. =>WM: (14183: R1 ^reward R1010)
  15126. <=WM: (14174: S1 ^operator O2011 +)
  15127. <=WM: (14176: S1 ^operator O2011)
  15128. <=WM: (14175: S1 ^operator O2012 +)
  15129. <=WM: (14169: R1 ^reward R1009)
  15130. <=WM: (14172: O2012 ^name predict-no)
  15131. <=WM: (14171: O2011 ^name predict-yes)
  15132. <=WM: (14170: R1009 ^value 1)
  15133. --- Inner Elaboration Phase, active level 1 (S1) ---
  15134. Firing prefer*rvt*predict-yes*H0
  15135. -->
  15136. Firing rl*prefer*rvt*predict-yes*H0*3
  15137. -->
  15138. (S1 ^operator O2013 = 0.3377117977102235)
  15139. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15140. -->
  15141. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15142. -->
  15143. (S1 ^operator O2013 = -0.1070236389116304)
  15144. Firing prefer*rvt*predict-no*H0
  15145. -->
  15146. Firing rl*prefer*rvt*predict-no*H0*4
  15147. -->
  15148. (S1 ^operator O2014 = 0.339773810196969)
  15149. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15150. -->
  15151. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15152. -->
  15153. (S1 ^operator O2014 = 0.6602439963649246)
  15154. inner elaboration loop at bottom goal.
  15155. Retracting rl*prefer*rvt*predict-no*H0*4
  15156. -->
  15157. (S1 ^operator O2012 = 0.339773810196969)
  15158. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15159. -->
  15160. (S1 ^operator O2012 = 0.6602439963649246)
  15161. Retracting rl*prefer*rvt*predict-yes*H0*3
  15162. -->
  15163. (S1 ^operator O2011 = 0.3377117977102235)
  15164. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15165. -->
  15166. (S1 ^operator O2011 = -0.1070236389116304)
  15167. --- END Proposal Phase ---
  15168. --- Decision Phase ---
  15169. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.590118 -0.252401 0.337717(R,m,v=1,0.899408,0.0910116)
  15170. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409816 0.25241 0.662226 -> 0.409823 0.252409 0.662232(R,m,v=1,1,0)
  15171. =>WM: (14189: S1 ^operator O2014)
  15172. 1007: O: O2014 (predict-no)
  15173. --- END Decision Phase ---
  15174. --- Application Phase ---
  15175. --- Firing Productions (PE) For State At Depth 1 ---
  15176. --- Inner Elaboration Phase, active level 1 (S1) ---
  15177. Firing apply*operator
  15178. -->
  15179. (I3 ^predict-no N1007 + :O )
  15180. Firing apply*operator*complete
  15181. -->
  15182. (I3 ^predict-yes N1006 - :O )
  15183. inner elaboration loop at bottom goal.
  15184. --- Change Working Memory (PE) ---
  15185. =>WM: (14190: I3 ^predict-no N1007)
  15186. <=WM: (14178: N1006 ^status complete)
  15187. <=WM: (14177: I3 ^predict-yes N1006)
  15188. --- Firing Productions (IE) For State At Depth 1 ---
  15189. --- Inner Elaboration Phase, active level 1 (S1) ---
  15190. Firing monitor*world
  15191. -->
  15192. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15193. --- Change Working Memory (IE) ---
  15194. --- END Application Phase ---
  15195. --- Output Phase ---
  15196. ENV: Agent did: predict-no for direction R in state State-B
  15197. In State-B moving R
  15198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15199. predict error 0
  15200. dir: dir isU
  15201. --- END Output Phase ---
  15202. \-/--- Input Phase ---
  15203. =>WM: (14194: I2 ^dir U)
  15204. =>WM: (14193: I2 ^reward 1)
  15205. =>WM: (14192: I2 ^see 0)
  15206. =>WM: (14191: N1007 ^status complete)
  15207. <=WM: (14181: I2 ^dir R)
  15208. <=WM: (14180: I2 ^reward 1)
  15209. <=WM: (14179: I2 ^see 1)
  15210. =>WM: (14195: I2 ^level-1 R0-root)
  15211. <=WM: (14182: I2 ^level-1 R1-root)
  15212. --- END Input Phase ---
  15213. --- Proposal Phase ---
  15214. --- Inner Elaboration Phase, active level 1 (S1) ---
  15215. Firing elaborate*copy-see-to-output-link
  15216. -->
  15217. (I3 ^see 0 +)
  15218. Firing elaborate*reward*based*on*reward
  15219. -->
  15220. (R1011 ^value 1 +)
  15221. (R1 ^reward R1011 +)
  15222. Firing propose*predict-yes
  15223. -->
  15224. (O2015 ^name predict-yes +)
  15225. (S1 ^operator O2015 +)
  15226. Firing propose*predict-no
  15227. -->
  15228. (O2016 ^name predict-no +)
  15229. (S1 ^operator O2016 +)
  15230. Firing rl*prefer*rvt*predict-no*H0*2
  15231. -->
  15232. (S1 ^operator O2014 = 1.)
  15233. Firing rl*prefer*rvt*predict-yes*H0*1
  15234. -->
  15235. (S1 ^operator O2013 = 0.)
  15236. Firing prefer*rvt*predict-yes*H0
  15237. -->
  15238. Firing prefer*rvt*predict-no*H0
  15239. -->
  15240. Firing elaborate*copy-dir-to-output-link
  15241. -->
  15242. (I3 ^dir U +)
  15243. inner elaboration loop at bottom goal.
  15244. Retracting elaborate*copy-see-to-output-link
  15245. -->
  15246. (I3 ^see 1 +)
  15247. Retracting propose*predict-no
  15248. -->
  15249. (O2014 ^name predict-no +)
  15250. (S1 ^operator O2014 +)
  15251. Retracting propose*predict-yes
  15252. -->
  15253. (O2013 ^name predict-yes +)
  15254. (S1 ^operator O2013 +)
  15255. Retracting elaborate*reward*based*on*reward
  15256. -->
  15257. (R1010 ^value 1 +)
  15258. (R1 ^reward R1010 +)
  15259. Retracting elaborate*copy-dir-to-output-link
  15260. -->
  15261. (I3 ^dir R +)
  15262. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15263. -->
  15264. (S1 ^operator O2014 = 0.6602439963649246)
  15265. Retracting rl*prefer*rvt*predict-no*H0*4
  15266. -->
  15267. (S1 ^operator O2014 = 0.339773810196969)
  15268. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15269. -->
  15270. (S1 ^operator O2013 = -0.1070236389116304)
  15271. Retracting rl*prefer*rvt*predict-yes*H0*3
  15272. -->
  15273. (S1 ^operator O2013 = 0.3377168791642142)
  15274. =>WM: (14203: S1 ^operator O2016 +)
  15275. =>WM: (14202: S1 ^operator O2015 +)
  15276. =>WM: (14201: I3 ^dir U)
  15277. =>WM: (14200: O2016 ^name predict-no)
  15278. =>WM: (14199: O2015 ^name predict-yes)
  15279. =>WM: (14198: R1011 ^value 1)
  15280. =>WM: (14197: R1 ^reward R1011)
  15281. =>WM: (14196: I3 ^see 0)
  15282. <=WM: (14187: S1 ^operator O2013 +)
  15283. <=WM: (14188: S1 ^operator O2014 +)
  15284. <=WM: (14189: S1 ^operator O2014)
  15285. <=WM: (14173: I3 ^dir R)
  15286. <=WM: (14183: R1 ^reward R1010)
  15287. <=WM: (14168: I3 ^see 1)
  15288. <=WM: (14186: O2014 ^name predict-no)
  15289. <=WM: (14185: O2013 ^name predict-yes)
  15290. <=WM: (14184: R1010 ^value 1)
  15291. --- Inner Elaboration Phase, active level 1 (S1) ---
  15292. Firing prefer*rvt*predict-yes*H0
  15293. -->
  15294. Firing rl*prefer*rvt*predict-yes*H0*1
  15295. -->
  15296. (S1 ^operator O2015 = 0.)
  15297. Firing prefer*rvt*predict-no*H0
  15298. -->
  15299. Firing rl*prefer*rvt*predict-no*H0*2
  15300. -->
  15301. (S1 ^operator O2016 = 1.)
  15302. inner elaboration loop at bottom goal.
  15303. Retracting rl*prefer*rvt*predict-no*H0*2
  15304. -->
  15305. (S1 ^operator O2014 = 1.)
  15306. Retracting rl*prefer*rvt*predict-yes*H0*1
  15307. -->
  15308. (S1 ^operator O2013 = 0.)
  15309. --- END Proposal Phase ---
  15310. --- Decision Phase ---
  15311. RL update rl*prefer*rvt*predict-no*H0*4 0.570257 -0.230484 0.339774 -> 0.570256 -0.230484 0.339772(R,m,v=1,0.87574,0.109467)
  15312. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429761 0.230483 0.660244 -> 0.429759 0.230483 0.660242(R,m,v=1,1,0)
  15313. =>WM: (14204: S1 ^operator O2016)
  15314. 1008: O: O2016 (predict-no)
  15315. --- END Decision Phase ---
  15316. --- Application Phase ---
  15317. --- Firing Productions (PE) For State At Depth 1 ---
  15318. --- Inner Elaboration Phase, active level 1 (S1) ---
  15319. Firing apply*operator
  15320. -->
  15321. (I3 ^predict-no N1008 + :O )
  15322. Firing apply*operator*complete
  15323. -->
  15324. (I3 ^predict-no N1007 - :O )
  15325. inner elaboration loop at bottom goal.
  15326. --- Change Working Memory (PE) ---
  15327. =>WM: (14205: I3 ^predict-no N1008)
  15328. <=WM: (14191: N1007 ^status complete)
  15329. <=WM: (14190: I3 ^predict-no N1007)
  15330. --- Firing Productions (IE) For State At Depth 1 ---
  15331. --- Inner Elaboration Phase, active level 1 (S1) ---
  15332. Firing monitor*world
  15333. -->
  15334. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15335. --- Change Working Memory (IE) ---
  15336. --- END Application Phase ---
  15337. --- Output Phase ---
  15338. ENV: Agent did: predict-no for direction U in state State-B
  15339. In State-B moving U
  15340. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15341. predict error 0
  15342. dir: dir isL
  15343. --- END Output Phase ---
  15344. |\---- Input Phase ---
  15345. =>WM: (14209: I2 ^dir L)
  15346. =>WM: (14208: I2 ^reward 1)
  15347. =>WM: (14207: I2 ^see 0)
  15348. =>WM: (14206: N1008 ^status complete)
  15349. <=WM: (14194: I2 ^dir U)
  15350. <=WM: (14193: I2 ^reward 1)
  15351. <=WM: (14192: I2 ^see 0)
  15352. =>WM: (14210: I2 ^level-1 R0-root)
  15353. <=WM: (14195: I2 ^level-1 R0-root)
  15354. --- END Input Phase ---
  15355. --- Proposal Phase ---
  15356. --- Inner Elaboration Phase, active level 1 (S1) ---
  15357. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15358. -->
  15359. (S1 ^operator O2015 = 0.7358542477906264)
  15360. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15361. -->
  15362. Firing elaborate*copy-see-to-output-link
  15363. -->
  15364. (I3 ^see 0 +)
  15365. Firing elaborate*reward*based*on*reward
  15366. -->
  15367. (R1012 ^value 1 +)
  15368. (R1 ^reward R1012 +)
  15369. Firing propose*predict-yes
  15370. -->
  15371. (O2017 ^name predict-yes +)
  15372. (S1 ^operator O2017 +)
  15373. Firing propose*predict-no
  15374. -->
  15375. (O2018 ^name predict-no +)
  15376. (S1 ^operator O2018 +)
  15377. Firing rl*prefer*rvt*predict-no*H0*6
  15378. -->
  15379. (S1 ^operator O2016 = 0.9998785089568328)
  15380. Firing rl*prefer*rvt*predict-yes*H0*5
  15381. -->
  15382. (S1 ^operator O2015 = 0.2640043987919141)
  15383. Firing prefer*rvt*predict-yes*H0
  15384. -->
  15385. Firing prefer*rvt*predict-no*H0
  15386. -->
  15387. Firing elaborate*copy-dir-to-output-link
  15388. -->
  15389. (I3 ^dir L +)
  15390. inner elaboration loop at bottom goal.
  15391. Retracting elaborate*copy-see-to-output-link
  15392. -->
  15393. (I3 ^see 0 +)
  15394. Retracting propose*predict-no
  15395. -->
  15396. (O2016 ^name predict-no +)
  15397. (S1 ^operator O2016 +)
  15398. Retracting propose*predict-yes
  15399. -->
  15400. (O2015 ^name predict-yes +)
  15401. (S1 ^operator O2015 +)
  15402. Retracting elaborate*reward*based*on*reward
  15403. -->
  15404. (R1011 ^value 1 +)
  15405. (R1 ^reward R1011 +)
  15406. Retracting elaborate*copy-dir-to-output-link
  15407. -->
  15408. (I3 ^dir U +)
  15409. Retracting rl*prefer*rvt*predict-no*H0*2
  15410. -->
  15411. (S1 ^operator O2016 = 1.)
  15412. Retracting rl*prefer*rvt*predict-yes*H0*1
  15413. -->
  15414. (S1 ^operator O2015 = 0.)
  15415. =>WM: (14217: S1 ^operator O2018 +)
  15416. =>WM: (14216: S1 ^operator O2017 +)
  15417. =>WM: (14215: I3 ^dir L)
  15418. =>WM: (14214: O2018 ^name predict-no)
  15419. =>WM: (14213: O2017 ^name predict-yes)
  15420. =>WM: (14212: R1012 ^value 1)
  15421. =>WM: (14211: R1 ^reward R1012)
  15422. <=WM: (14202: S1 ^operator O2015 +)
  15423. <=WM: (14203: S1 ^operator O2016 +)
  15424. <=WM: (14204: S1 ^operator O2016)
  15425. <=WM: (14201: I3 ^dir U)
  15426. <=WM: (14197: R1 ^reward R1011)
  15427. <=WM: (14200: O2016 ^name predict-no)
  15428. <=WM: (14199: O2015 ^name predict-yes)
  15429. <=WM: (14198: R1011 ^value 1)
  15430. --- Inner Elaboration Phase, active level 1 (S1) ---
  15431. Firing prefer*rvt*predict-yes*H0
  15432. -->
  15433. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15434. -->
  15435. (S1 ^operator O2017 = 0.7358542477906264)
  15436. Firing rl*prefer*rvt*predict-yes*H0*5
  15437. -->
  15438. (S1 ^operator O2017 = 0.2640043987919141)
  15439. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15440. -->
  15441. Firing prefer*rvt*predict-no*H0
  15442. -->
  15443. Firing rl*prefer*rvt*predict-no*H0*6
  15444. -->
  15445. (S1 ^operator O2018 = 0.9998785089568328)
  15446. inner elaboration loop at bottom goal.
  15447. Retracting rl*prefer*rvt*predict-no*H0*6
  15448. -->
  15449. (S1 ^operator O2016 = 0.9998785089568328)
  15450. Retracting rl*prefer*rvt*predict-yes*H0*5
  15451. -->
  15452. (S1 ^operator O2015 = 0.2640043987919141)
  15453. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15454. -->
  15455. (S1 ^operator O2015 = 0.7358542477906264)
  15456. --- END Proposal Phase ---
  15457. --- Decision Phase ---
  15458. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15459. =>WM: (14218: S1 ^operator O2018)
  15460. 1009: O: O2018 (predict-no)
  15461. --- END Decision Phase ---
  15462. --- Application Phase ---
  15463. --- Firing Productions (PE) For State At Depth 1 ---
  15464. --- Inner Elaboration Phase, active level 1 (S1) ---
  15465. Firing apply*operator
  15466. -->
  15467. (I3 ^predict-no N1009 + :O )
  15468. Firing apply*operator*complete
  15469. -->
  15470. (I3 ^predict-no N1008 - :O )
  15471. inner elaboration loop at bottom goal.
  15472. --- Change Working Memory (PE) ---
  15473. =>WM: (14219: I3 ^predict-no N1009)
  15474. <=WM: (14206: N1008 ^status complete)
  15475. <=WM: (14205: I3 ^predict-no N1008)
  15476. --- Firing Productions (IE) For State At Depth 1 ---
  15477. --- Inner Elaboration Phase, active level 1 (S1) ---
  15478. Firing monitor*world
  15479. -->
  15480. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15481. --- Change Working Memory (IE) ---
  15482. --- END Application Phase ---
  15483. --- Output Phase ---
  15484. ENV: Agent did: predict-no for direction L in state State-B
  15485. In State-B moving L
  15486. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  15487. predict error 1
  15488. dir: dir isL
  15489. --- END Output Phase ---
  15490. /|\--- Input Phase ---
  15491. =>WM: (14223: I2 ^dir L)
  15492. =>WM: (14222: I2 ^reward 0)
  15493. =>WM: (14221: I2 ^see 1)
  15494. =>WM: (14220: N1009 ^status complete)
  15495. <=WM: (14209: I2 ^dir L)
  15496. <=WM: (14208: I2 ^reward 1)
  15497. <=WM: (14207: I2 ^see 0)
  15498. =>WM: (14224: I2 ^level-1 L1-root)
  15499. <=WM: (14210: I2 ^level-1 R0-root)
  15500. --- END Input Phase ---
  15501. --- Proposal Phase ---
  15502. --- Inner Elaboration Phase, active level 1 (S1) ---
  15503. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15504. -->
  15505. (S1 ^operator O2017 = -0.181727099742844)
  15506. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15507. -->
  15508. Firing elaborate*copy-see-to-output-link
  15509. -->
  15510. (I3 ^see 1 +)
  15511. Firing elaborate*reward*based*on*reward
  15512. -->
  15513. (R1013 ^value 0 +)
  15514. (R1 ^reward R1013 +)
  15515. Firing propose*predict-yes
  15516. -->
  15517. (O2019 ^name predict-yes +)
  15518. (S1 ^operator O2019 +)
  15519. Firing propose*predict-no
  15520. -->
  15521. (O2020 ^name predict-no +)
  15522. (S1 ^operator O2020 +)
  15523. Firing rl*prefer*rvt*predict-no*H0*6
  15524. -->
  15525. (S1 ^operator O2018 = 0.9998785089568328)
  15526. Firing rl*prefer*rvt*predict-yes*H0*5
  15527. -->
  15528. (S1 ^operator O2017 = 0.2640043987919141)
  15529. Firing prefer*rvt*predict-yes*H0
  15530. -->
  15531. Firing prefer*rvt*predict-no*H0
  15532. -->
  15533. Firing elaborate*copy-dir-to-output-link
  15534. -->
  15535. (I3 ^dir L +)
  15536. inner elaboration loop at bottom goal.
  15537. Retracting elaborate*copy-see-to-output-link
  15538. -->
  15539. (I3 ^see 0 +)
  15540. Retracting propose*predict-no
  15541. -->
  15542. (O2018 ^name predict-no +)
  15543. (S1 ^operator O2018 +)
  15544. Retracting propose*predict-yes
  15545. -->
  15546. (O2017 ^name predict-yes +)
  15547. (S1 ^operator O2017 +)
  15548. Retracting elaborate*reward*based*on*reward
  15549. -->
  15550. (R1012 ^value 1 +)
  15551. (R1 ^reward R1012 +)
  15552. Retracting elaborate*copy-dir-to-output-link
  15553. -->
  15554. (I3 ^dir L +)
  15555. Retracting rl*prefer*rvt*predict-no*H0*6
  15556. -->
  15557. (S1 ^operator O2018 = 0.9998785089568328)
  15558. Retracting rl*prefer*rvt*predict-yes*H0*5
  15559. -->
  15560. (S1 ^operator O2017 = 0.2640043987919141)
  15561. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15562. -->
  15563. (S1 ^operator O2017 = 0.7358542477906264)
  15564. =>WM: (14231: S1 ^operator O2020 +)
  15565. =>WM: (14230: S1 ^operator O2019 +)
  15566. =>WM: (14229: O2020 ^name predict-no)
  15567. =>WM: (14228: O2019 ^name predict-yes)
  15568. =>WM: (14227: R1013 ^value 0)
  15569. =>WM: (14226: R1 ^reward R1013)
  15570. =>WM: (14225: I3 ^see 1)
  15571. <=WM: (14216: S1 ^operator O2017 +)
  15572. <=WM: (14217: S1 ^operator O2018 +)
  15573. <=WM: (14218: S1 ^operator O2018)
  15574. <=WM: (14211: R1 ^reward R1012)
  15575. <=WM: (14196: I3 ^see 0)
  15576. <=WM: (14214: O2018 ^name predict-no)
  15577. <=WM: (14213: O2017 ^name predict-yes)
  15578. <=WM: (14212: R1012 ^value 1)
  15579. --- Inner Elaboration Phase, active level 1 (S1) ---
  15580. Firing prefer*rvt*predict-yes*H0
  15581. -->
  15582. Firing rl*prefer*rvt*predict-yes*H0*5
  15583. -->
  15584. (S1 ^operator O2019 = 0.2640043987919141)
  15585. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15586. -->
  15587. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15588. -->
  15589. (S1 ^operator O2019 = -0.181727099742844)
  15590. Firing prefer*rvt*predict-no*H0
  15591. -->
  15592. Firing rl*prefer*rvt*predict-no*H0*6
  15593. -->
  15594. (S1 ^operator O2020 = 0.9998785089568328)
  15595. inner elaboration loop at bottom goal.
  15596. Retracting rl*prefer*rvt*predict-no*H0*6
  15597. -->
  15598. (S1 ^operator O2018 = 0.9998785089568328)
  15599. Retracting rl*prefer*rvt*predict-yes*H0*5
  15600. -->
  15601. (S1 ^operator O2017 = 0.2640043987919141)
  15602. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15603. -->
  15604. (S1 ^operator O2017 = -0.181727099742844)
  15605. --- END Proposal Phase ---
  15606. --- Decision Phase ---
  15607. RL update rl*prefer*rvt*predict-no*H0*6 0.999879 0 0.999879 -> 0.833711 0 0.833711(R,m,v=0,0.900662,0.0900662)
  15608. =>WM: (14232: S1 ^operator O2020)
  15609. 1010: O: O2020 (predict-no)
  15610. --- END Decision Phase ---
  15611. --- Application Phase ---
  15612. --- Firing Productions (PE) For State At Depth 1 ---
  15613. --- Inner Elaboration Phase, active level 1 (S1) ---
  15614. Firing apply*operator
  15615. -->
  15616. (I3 ^predict-no N1010 + :O )
  15617. Firing apply*operator*complete
  15618. -->
  15619. (I3 ^predict-no N1009 - :O )
  15620. inner elaboration loop at bottom goal.
  15621. --- Change Working Memory (PE) ---
  15622. =>WM: (14233: I3 ^predict-no N1010)
  15623. <=WM: (14220: N1009 ^status complete)
  15624. <=WM: (14219: I3 ^predict-no N1009)
  15625. --- Firing Productions (IE) For State At Depth 1 ---
  15626. --- Inner Elaboration Phase, active level 1 (S1) ---
  15627. Firing monitor*world
  15628. -->
  15629. I see 0 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15630. --- Change Working Memory (IE) ---
  15631. --- END Application Phase ---
  15632. --- Output Phase ---
  15633. ENV: Agent did: predict-no for direction L in state State-A
  15634. In State-A moving L
  15635. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15636. predict error 0
  15637. dir: dir isR
  15638. --- END Output Phase ---
  15639. -/|--- Input Phase ---
  15640. =>WM: (14237: I2 ^dir R)
  15641. =>WM: (14236: I2 ^reward 1)
  15642. =>WM: (14235: I2 ^see 0)
  15643. =>WM: (14234: N1010 ^status complete)
  15644. <=WM: (14223: I2 ^dir L)
  15645. <=WM: (14222: I2 ^reward 0)
  15646. <=WM: (14221: I2 ^see 1)
  15647. =>WM: (14238: I2 ^level-1 L0-root)
  15648. <=WM: (14224: I2 ^level-1 L1-root)
  15649. --- END Input Phase ---
  15650. --- Proposal Phase ---
  15651. --- Inner Elaboration Phase, active level 1 (S1) ---
  15652. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15653. -->
  15654. (S1 ^operator O2020 = -0.2817060109291377)
  15655. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15656. -->
  15657. (S1 ^operator O2019 = 0.6623458215671729)
  15658. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15659. -->
  15660. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15661. -->
  15662. Firing elaborate*copy-see-to-output-link
  15663. -->
  15664. (I3 ^see 0 +)
  15665. Firing elaborate*reward*based*on*reward
  15666. -->
  15667. (R1014 ^value 1 +)
  15668. (R1 ^reward R1014 +)
  15669. Firing propose*predict-yes
  15670. -->
  15671. (O2021 ^name predict-yes +)
  15672. (S1 ^operator O2021 +)
  15673. Firing propose*predict-no
  15674. -->
  15675. (O2022 ^name predict-no +)
  15676. (S1 ^operator O2022 +)
  15677. Firing rl*prefer*rvt*predict-no*H0*4
  15678. -->
  15679. (S1 ^operator O2020 = 0.3397723577617232)
  15680. Firing rl*prefer*rvt*predict-yes*H0*3
  15681. -->
  15682. (S1 ^operator O2019 = 0.3377168791642142)
  15683. Firing prefer*rvt*predict-yes*H0
  15684. -->
  15685. Firing prefer*rvt*predict-no*H0
  15686. -->
  15687. Firing elaborate*copy-dir-to-output-link
  15688. -->
  15689. (I3 ^dir R +)
  15690. inner elaboration loop at bottom goal.
  15691. Retracting elaborate*copy-see-to-output-link
  15692. -->
  15693. (I3 ^see 1 +)
  15694. Retracting propose*predict-no
  15695. -->
  15696. (O2020 ^name predict-no +)
  15697. (S1 ^operator O2020 +)
  15698. Retracting propose*predict-yes
  15699. -->
  15700. (O2019 ^name predict-yes +)
  15701. (S1 ^operator O2019 +)
  15702. Retracting elaborate*reward*based*on*reward
  15703. -->
  15704. (R1013 ^value 0 +)
  15705. (R1 ^reward R1013 +)
  15706. Retracting elaborate*copy-dir-to-output-link
  15707. -->
  15708. (I3 ^dir L +)
  15709. Retracting rl*prefer*rvt*predict-no*H0*6
  15710. -->
  15711. (S1 ^operator O2020 = 0.8337106497126315)
  15712. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15713. -->
  15714. (S1 ^operator O2019 = -0.181727099742844)
  15715. Retracting rl*prefer*rvt*predict-yes*H0*5
  15716. -->
  15717. (S1 ^operator O2019 = 0.2640043987919141)
  15718. =>WM: (14246: S1 ^operator O2022 +)
  15719. =>WM: (14245: S1 ^operator O2021 +)
  15720. =>WM: (14244: I3 ^dir R)
  15721. =>WM: (14243: O2022 ^name predict-no)
  15722. =>WM: (14242: O2021 ^name predict-yes)
  15723. =>WM: (14241: R1014 ^value 1)
  15724. =>WM: (14240: R1 ^reward R1014)
  15725. =>WM: (14239: I3 ^see 0)
  15726. <=WM: (14230: S1 ^operator O2019 +)
  15727. <=WM: (14231: S1 ^operator O2020 +)
  15728. <=WM: (14232: S1 ^operator O2020)
  15729. <=WM: (14215: I3 ^dir L)
  15730. <=WM: (14226: R1 ^reward R1013)
  15731. <=WM: (14225: I3 ^see 1)
  15732. <=WM: (14229: O2020 ^name predict-no)
  15733. <=WM: (14228: O2019 ^name predict-yes)
  15734. <=WM: (14227: R1013 ^value 0)
  15735. --- Inner Elaboration Phase, active level 1 (S1) ---
  15736. Firing prefer*rvt*predict-yes*H0
  15737. -->
  15738. Firing rl*prefer*rvt*predict-yes*H0*3
  15739. -->
  15740. (S1 ^operator O2021 = 0.3377168791642142)
  15741. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15742. -->
  15743. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15744. -->
  15745. (S1 ^operator O2021 = 0.6623458215671729)
  15746. Firing prefer*rvt*predict-no*H0
  15747. -->
  15748. Firing rl*prefer*rvt*predict-no*H0*4
  15749. -->
  15750. (S1 ^operator O2022 = 0.3397723577617232)
  15751. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15752. -->
  15753. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15754. -->
  15755. (S1 ^operator O2022 = -0.2817060109291377)
  15756. inner elaboration loop at bottom goal.
  15757. Retracting rl*prefer*rvt*predict-no*H0*4
  15758. -->
  15759. (S1 ^operator O2020 = 0.3397723577617232)
  15760. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15761. -->
  15762. (S1 ^operator O2020 = -0.2817060109291377)
  15763. Retracting rl*prefer*rvt*predict-yes*H0*3
  15764. -->
  15765. (S1 ^operator O2019 = 0.3377168791642142)
  15766. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15767. -->
  15768. (S1 ^operator O2019 = 0.6623458215671729)
  15769. --- END Proposal Phase ---
  15770. --- Decision Phase ---
  15771. RL update rl*prefer*rvt*predict-no*H0*6 0.833711 0 0.833711 -> 0.861316 0 0.861316(R,m,v=1,0.901316,0.0895347)
  15772. =>WM: (14247: S1 ^operator O2021)
  15773. 1011: O: O2021 (predict-yes)
  15774. --- END Decision Phase ---
  15775. --- Application Phase ---
  15776. --- Firing Productions (PE) For State At Depth 1 ---
  15777. --- Inner Elaboration Phase, active level 1 (S1) ---
  15778. Firing apply*operator
  15779. -->
  15780. (I3 ^predict-yes N1011 + :O )
  15781. Firing apply*operator*complete
  15782. -->
  15783. (I3 ^predict-no N1010 - :O )
  15784. inner elaboration loop at bottom goal.
  15785. --- Change Working Memory (PE) ---
  15786. =>WM: (14248: I3 ^predict-yes N1011)
  15787. <=WM: (14234: N1010 ^status complete)
  15788. <=WM: (14233: I3 ^predict-no N1010)