PageRenderTime 75ms CodeModel.GetById 26ms RepoModel.GetById 20ms app.codeStats 1ms

/flipv2/20121112-100543-2.5K-ReLST-Wallace/stdout-flip-2.5K_1.txt

https://bitbucket.org/evan13579b/soar-ziggurat
Plain Text | 16440 lines | 15705 code | 735 blank | 0 comment | 0 complexity | 65c053b700960c62ea6c2bac5dde26ac MD5 | raw file
Possible License(s): BSD-3-Clause
  1. Seeding... 1
  2. dir: dir isL
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 1 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_1.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\sleeping...
  20. -/|\-/1: O: O1 (predict-yes)
  21. I see 0 and I'm going to do: predict-yes
  22. ENV: Agent did: predict-yes for direction L in state State-A
  23. In State-A moving L
  24. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  25. predict error 1
  26. dir: dir isU
  27. rule alias: '*'
  28. rule alias: '*'
  29. |\-/|\-/2: O: O4 (predict-no)
  30. I see 0 and I'm going to do: predict-no
  31. ENV: Agent did: predict-no for direction U in state State-A
  32. In State-A moving U
  33. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  34. predict error 0
  35. dir: dir isU
  36. |\-3: O: O5 (predict-yes)
  37. I see 1 and I'm going to do: predict-yes
  38. ENV: Agent did: predict-yes for direction U in state State-A
  39. In State-A moving U
  40. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  41. predict error 1
  42. dir: dir isL
  43. /4: O: O7 (predict-yes)
  44. I see 0 and I'm going to do: predict-yes
  45. ENV: Agent did: predict-yes for direction L in state State-A
  46. In State-A moving L
  47. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  48. predict error 1
  49. dir: dir isR
  50. |\-5: O: O10 (predict-no)
  51. I see 0 and I'm going to do: predict-no
  52. ENV: Agent did: predict-no for direction R in state State-A
  53. In State-A moving R
  54. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  55. predict error 1
  56. dir: dir isR
  57. /|6: O: O11 (predict-yes)
  58. I see 0 and I'm going to do: predict-yes
  59. ENV: Agent did: predict-yes for direction R in state State-B
  60. In State-B moving R
  61. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  62. predict error 1
  63. dir: dir isR
  64. \-/|7: O: O13 (predict-yes)
  65. I see 0 and I'm going to do: predict-yes
  66. ENV: Agent did: predict-yes for direction R in state State-B
  67. In State-B moving R
  68. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  69. predict error 1
  70. dir: dir isU
  71. \-/|8: O: O16 (predict-no)
  72. I see 0 and I'm going to do: predict-no
  73. ENV: Agent did: predict-no for direction U in state State-B
  74. In State-B moving U
  75. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  76. predict error 0
  77. dir: dir isL
  78. \-9: O: O18 (predict-no)
  79. I see 1 and I'm going to do: predict-no
  80. ENV: Agent did: predict-no for direction L in state State-B
  81. In State-B moving L
  82. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  83. predict error 1
  84. dir: dir isL
  85. /|\10: O: O20 (predict-no)
  86. I see 0 and I'm going to do: predict-no
  87. ENV: Agent did: predict-no for direction L in state State-A
  88. In State-A moving L
  89. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  90. predict error 0
  91. dir: dir isU
  92. -/|11: O: O22 (predict-no)
  93. I see 1 and I'm going to do: predict-no
  94. ENV: Agent did: predict-no for direction U in state State-A
  95. In State-A moving U
  96. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  97. predict error 0
  98. dir: dir isR
  99. rule alias: '*'
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. \12: O: O23 (predict-yes)
  104. I see 1 and I'm going to do: predict-yes
  105. ENV: Agent did: predict-yes for direction R in state State-A
  106. In State-A moving R
  107. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  108. predict error 0
  109. dir: dir isU
  110. -/|13: O: O26 (predict-no)
  111. I see 1 and I'm going to do: predict-no
  112. ENV: Agent did: predict-no for direction U in state State-B
  113. In State-B moving U
  114. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  115. predict error 0
  116. dir: dir isL
  117. \-14: O: O28 (predict-no)
  118. I see 1 and I'm going to do: predict-no
  119. ENV: Agent did: predict-no for direction L in state State-B
  120. In State-B moving L
  121. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  122. predict error 1
  123. dir: dir isR
  124. /|15: O: O30 (predict-no)
  125. I see 0 and I'm going to do: predict-no
  126. ENV: Agent did: predict-no for direction R in state State-A
  127. In State-A moving R
  128. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  129. predict error 1
  130. dir: dir isU
  131. \-/16: O: O32 (predict-no)
  132. I see 0 and I'm going to do: predict-no
  133. ENV: Agent did: predict-no for direction U in state State-B
  134. In State-B moving U
  135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  136. predict error 0
  137. dir: dir isL
  138. |\-17: O: O33 (predict-yes)
  139. I see 1 and I'm going to do: predict-yes
  140. ENV: Agent did: predict-yes for direction L in state State-B
  141. In State-B moving L
  142. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  143. predict error 0
  144. dir: dir isU
  145. /|18: O: O36 (predict-no)
  146. I see 1 and I'm going to do: predict-no
  147. ENV: Agent did: predict-no for direction U in state State-A
  148. In State-A moving U
  149. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  150. predict error 0
  151. dir: dir isU
  152. \-/19: O: O38 (predict-no)
  153. I see 1 and I'm going to do: predict-no
  154. ENV: Agent did: predict-no for direction U in state State-A
  155. In State-A moving U
  156. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  157. predict error 0
  158. dir: dir isL
  159. |\-20: O: O39 (predict-yes)
  160. I see 1 and I'm going to do: predict-yes
  161. ENV: Agent did: predict-yes for direction L in state State-A
  162. In State-A moving L
  163. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  164. predict error 1
  165. dir: dir isL
  166. /|\21: O: O41 (predict-yes)
  167. I see 0 and I'm going to do: predict-yes
  168. ENV: Agent did: predict-yes for direction L in state State-A
  169. In State-A moving L
  170. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  171. predict error 1
  172. dir: dir isR
  173. -22: O: O43 (predict-yes)
  174. I see 0 and I'm going to do: predict-yes
  175. ENV: Agent did: predict-yes for direction R in state State-A
  176. In State-A moving R
  177. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  178. predict error 0
  179. dir: dir isU
  180. /|23: O: O46 (predict-no)
  181. I see 1 and I'm going to do: predict-no
  182. ENV: Agent did: predict-no for direction U in state State-B
  183. In State-B moving U
  184. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  185. predict error 0
  186. dir: dir isR
  187. \-/24: O: O47 (predict-yes)
  188. I see 1 and I'm going to do: predict-yes
  189. ENV: Agent did: predict-yes for direction R in state State-B
  190. In State-B moving R
  191. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  192. predict error 1
  193. dir: dir isL
  194. |\25: O: O50 (predict-no)
  195. I see 0 and I'm going to do: predict-no
  196. ENV: Agent did: predict-no for direction L in state State-B
  197. In State-B moving L
  198. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  199. predict error 1
  200. dir: dir isR
  201. -/|26: O: O52 (predict-no)
  202. I see 0 and I'm going to do: predict-no
  203. ENV: Agent did: predict-no for direction R in state State-A
  204. In State-A moving R
  205. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  206. predict error 1
  207. dir: dir isL
  208. \-27: O: O54 (predict-no)
  209. I see 0 and I'm going to do: predict-no
  210. ENV: Agent did: predict-no for direction L in state State-B
  211. In State-B moving L
  212. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  213. predict error 1
  214. dir: dir isL
  215. /|28: O: O56 (predict-no)
  216. I see 0 and I'm going to do: predict-no
  217. ENV: Agent did: predict-no for direction L in state State-A
  218. In State-A moving L
  219. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  220. predict error 0
  221. dir: dir isR
  222. \-/29: O: O57 (predict-yes)
  223. I see 1 and I'm going to do: predict-yes
  224. ENV: Agent did: predict-yes for direction R in state State-A
  225. In State-A moving R
  226. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  227. predict error 0
  228. dir: dir isR
  229. |\-30: O: O59 (predict-yes)
  230. I see 1 and I'm going to do: predict-yes
  231. ENV: Agent did: predict-yes for direction R in state State-B
  232. In State-B moving R
  233. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  234. predict error 1
  235. dir: dir isL
  236. /|\31: O: O62 (predict-no)
  237. I see 0 and I'm going to do: predict-no
  238. ENV: Agent did: predict-no for direction L in state State-B
  239. In State-B moving L
  240. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  241. predict error 1
  242. dir: dir isL
  243. -32: O: O64 (predict-no)
  244. I see 0 and I'm going to do: predict-no
  245. ENV: Agent did: predict-no for direction L in state State-A
  246. In State-A moving L
  247. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  248. predict error 0
  249. dir: dir isL
  250. /|\33: O: O66 (predict-no)
  251. I see 1 and I'm going to do: predict-no
  252. ENV: Agent did: predict-no for direction L in state State-A
  253. In State-A moving L
  254. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  255. predict error 0
  256. dir: dir isR
  257. -/|34: O: O67 (predict-yes)
  258. I see 1 and I'm going to do: predict-yes
  259. ENV: Agent did: predict-yes for direction R in state State-A
  260. In State-A moving R
  261. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  262. predict error 0
  263. dir: dir isL
  264. \-/35: O: O70 (predict-no)
  265. I see 1 and I'm going to do: predict-no
  266. ENV: Agent did: predict-no for direction L in state State-B
  267. In State-B moving L
  268. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  269. predict error 1
  270. dir: dir isL
  271. |\-/36: O: O72 (predict-no)
  272. I see 0 and I'm going to do: predict-no
  273. ENV: Agent did: predict-no for direction L in state State-A
  274. In State-A moving L
  275. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  276. predict error 0
  277. dir: dir isU
  278. |\-37: O: O74 (predict-no)
  279. I see 1 and I'm going to do: predict-no
  280. ENV: Agent did: predict-no for direction U in state State-A
  281. In State-A moving U
  282. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  283. predict error 0
  284. dir: dir isR
  285. /|\38: O: O76 (predict-no)
  286. I see 1 and I'm going to do: predict-no
  287. ENV: Agent did: predict-no for direction R in state State-A
  288. In State-A moving R
  289. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  290. predict error 1
  291. dir: dir isR
  292. -/|39: O: O77 (predict-yes)
  293. I see 0 and I'm going to do: predict-yes
  294. ENV: Agent did: predict-yes for direction R in state State-B
  295. In State-B moving R
  296. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  297. predict error 1
  298. dir: dir isL
  299. \-/40: O: O80 (predict-no)
  300. I see 0 and I'm going to do: predict-no
  301. ENV: Agent did: predict-no for direction L in state State-B
  302. In State-B moving L
  303. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  304. predict error 1
  305. dir: dir isU
  306. |\-41: O: O82 (predict-no)
  307. I see 0 and I'm going to do: predict-no
  308. ENV: Agent did: predict-no for direction U in state State-A
  309. In State-A moving U
  310. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  311. predict error 0
  312. dir: dir isU
  313. /42: O: O84 (predict-no)
  314. I see 1 and I'm going to do: predict-no
  315. ENV: Agent did: predict-no for direction U in state State-A
  316. In State-A moving U
  317. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  318. predict error 0
  319. dir: dir isL
  320. |\43: O: O85 (predict-yes)
  321. I see 1 and I'm going to do: predict-yes
  322. ENV: Agent did: predict-yes for direction L in state State-A
  323. In State-A moving L
  324. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  325. predict error 1
  326. dir: dir isL
  327. -/|44: O: O88 (predict-no)
  328. I see 0 and I'm going to do: predict-no
  329. ENV: Agent did: predict-no for direction L in state State-A
  330. In State-A moving L
  331. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  332. predict error 0
  333. dir: dir isU
  334. \-/45: O: O90 (predict-no)
  335. I see 1 and I'm going to do: predict-no
  336. ENV: Agent did: predict-no for direction U in state State-A
  337. In State-A moving U
  338. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  339. predict error 0
  340. dir: dir isU
  341. |\-46: O: O92 (predict-no)
  342. I see 1 and I'm going to do: predict-no
  343. ENV: Agent did: predict-no for direction U in state State-A
  344. In State-A moving U
  345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  346. predict error 0
  347. dir: dir isU
  348. /|\47: O: O94 (predict-no)
  349. I see 1 and I'm going to do: predict-no
  350. ENV: Agent did: predict-no for direction U in state State-A
  351. In State-A moving U
  352. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  353. predict error 0
  354. dir: dir isR
  355. -/48: O: O95 (predict-yes)
  356. I see 1 and I'm going to do: predict-yes
  357. ENV: Agent did: predict-yes for direction R in state State-A
  358. In State-A moving R
  359. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  360. predict error 0
  361. dir: dir isU
  362. |\-49: O: O98 (predict-no)
  363. I see 1 and I'm going to do: predict-no
  364. ENV: Agent did: predict-no for direction U in state State-B
  365. In State-B moving U
  366. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  367. predict error 0
  368. dir: dir isU
  369. /|\50: O: O100 (predict-no)
  370. I see 1 and I'm going to do: predict-no
  371. ENV: Agent did: predict-no for direction U in state State-B
  372. In State-B moving U
  373. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  374. predict error 0
  375. dir: dir isL
  376. -/|\-/|sleeping...
  377. \sleeping...
  378. -sleeping...
  379. /sleeping...
  380. |sleeping...
  381. \sleeping...
  382. -sleeping...
  383. /sleeping...
  384. |sleeping...
  385. \sleeping...
  386. -sleeping...
  387. /sleeping...
  388. |sleeping...
  389. \sleeping...
  390. -sleeping...
  391. /sleeping...
  392. |sleeping...
  393. \sleeping...
  394. -sleeping...
  395. /sleeping...
  396. |sleeping...
  397. \sleeping...
  398. -sleeping...
  399. /sleeping...
  400. |sleeping...
  401. \sleeping...
  402. -sleeping...
  403. /sleeping...
  404. |sleeping...
  405. \sleeping...
  406. -sleeping...
  407. /sleeping...
  408. |sleeping...
  409. \sleeping...
  410. -sleeping...
  411. /sleeping...
  412. |sleeping...
  413. \51: O: O102 (predict-no)
  414. I see 1 and I'm going to do: predict-no
  415. ENV: Agent did: predict-no for direction L in state State-B
  416. In State-B moving L
  417. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  418. predict error 1
  419. dir: dir isR
  420. rule alias: '*'
  421. rule alias: '*'
  422. -52: O: O104 (predict-no)
  423. I see 0 and I'm going to do: predict-no
  424. ENV: Agent did: predict-no for direction R in state State-A
  425. In State-A moving R
  426. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  427. predict error 1
  428. dir: dir isU
  429. /|53: O: O106 (predict-no)
  430. I see 0 and I'm going to do: predict-no
  431. ENV: Agent did: predict-no for direction U in state State-B
  432. In State-B moving U
  433. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  434. predict error 0
  435. dir: dir isU
  436. \-/54: O: O108 (predict-no)
  437. I see 1 and I'm going to do: predict-no
  438. ENV: Agent did: predict-no for direction U in state State-B
  439. In State-B moving U
  440. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  441. predict error 0
  442. dir: dir isR
  443. |\55: O: O109 (predict-yes)
  444. I see 1 and I'm going to do: predict-yes
  445. ENV: Agent did: predict-yes for direction R in state State-B
  446. In State-B moving R
  447. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  448. predict error 1
  449. dir: dir isR
  450. -/|56: O: O111 (predict-yes)
  451. I see 0 and I'm going to do: predict-yes
  452. ENV: Agent did: predict-yes for direction R in state State-B
  453. In State-B moving R
  454. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  455. predict error 1
  456. dir: dir isL
  457. \-/57: O: O114 (predict-no)
  458. I see 0 and I'm going to do: predict-no
  459. ENV: Agent did: predict-no for direction L in state State-B
  460. In State-B moving L
  461. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  462. predict error 1
  463. dir: dir isL
  464. |\58: O: O116 (predict-no)
  465. I see 0 and I'm going to do: predict-no
  466. ENV: Agent did: predict-no for direction L in state State-A
  467. In State-A moving L
  468. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  469. predict error 0
  470. dir: dir isU
  471. -/|59: O: O118 (predict-no)
  472. I see 1 and I'm going to do: predict-no
  473. ENV: Agent did: predict-no for direction U in state State-A
  474. In State-A moving U
  475. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  476. predict error 0
  477. dir: dir isR
  478. \-60: O: O119 (predict-yes)
  479. I see 1 and I'm going to do: predict-yes
  480. ENV: Agent did: predict-yes for direction R in state State-A
  481. In State-A moving R
  482. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  483. predict error 0
  484. dir: dir isL
  485. /61: O: O122 (predict-no)
  486. I see 1 and I'm going to do: predict-no
  487. ENV: Agent did: predict-no for direction L in state State-B
  488. In State-B moving L
  489. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  490. predict error 1
  491. dir: dir isR
  492. rule alias: '*'
  493. rule alias: '*'
  494. rule alias: '*'
  495. rule alias: '*'
  496. rule alias: '*'
  497. rule alias: '*'
  498. rule alias: '*'
  499. rule alias: '*'
  500. rule alias: '*'
  501. rule alias: '*'
  502. |62: O: O123 (predict-yes)
  503. I see 0 and I'm going to do: predict-yes
  504. ENV: Agent did: predict-yes for direction R in state State-A
  505. In State-A moving R
  506. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  507. predict error 0
  508. dir: dir isU
  509. \-/63: O: O126 (predict-no)
  510. I see 1 and I'm going to do: predict-no
  511. ENV: Agent did: predict-no for direction U in state State-B
  512. In State-B moving U
  513. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  514. predict error 0
  515. dir: dir isU
  516. |\-64: O: O128 (predict-no)
  517. I see 1 and I'm going to do: predict-no
  518. ENV: Agent did: predict-no for direction U in state State-B
  519. In State-B moving U
  520. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  521. predict error 0
  522. dir: dir isR
  523. /|65: O: O129 (predict-yes)
  524. I see 1 and I'm going to do: predict-yes
  525. ENV: Agent did: predict-yes for direction R in state State-B
  526. In State-B moving R
  527. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  528. predict error 1
  529. dir: dir isR
  530. \-/66: O: O132 (predict-no)
  531. I see 0 and I'm going to do: predict-no
  532. ENV: Agent did: predict-no for direction R in state State-B
  533. In State-B moving R
  534. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  535. predict error 0
  536. dir: dir isR
  537. |\-67: O: O134 (predict-no)
  538. I see 1 and I'm going to do: predict-no
  539. ENV: Agent did: predict-no for direction R in state State-B
  540. In State-B moving R
  541. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  542. predict error 0
  543. dir: dir isU
  544. /|68: O: O136 (predict-no)
  545. I see 1 and I'm going to do: predict-no
  546. ENV: Agent did: predict-no for direction U in state State-B
  547. In State-B moving U
  548. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  549. predict error 0
  550. dir: dir isR
  551. \-/69: O: O137 (predict-yes)
  552. I see 1 and I'm going to do: predict-yes
  553. ENV: Agent did: predict-yes for direction R in state State-B
  554. In State-B moving R
  555. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  556. predict error 1
  557. dir: dir isR
  558. |\-70: O: O139 (predict-yes)
  559. I see 0 and I'm going to do: predict-yes
  560. ENV: Agent did: predict-yes for direction R in state State-B
  561. In State-B moving R
  562. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  563. predict error 1
  564. dir: dir isR
  565. /71: O: O142 (predict-no)
  566. I see 0 and I'm going to do: predict-no
  567. ENV: Agent did: predict-no for direction R in state State-B
  568. In State-B moving R
  569. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  570. predict error 0
  571. dir: dir isL
  572. rule alias: '*'
  573. |72: O: O144 (predict-no)
  574. I see 1 and I'm going to do: predict-no
  575. ENV: Agent did: predict-no for direction L in state State-B
  576. In State-B moving L
  577. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  578. predict error 1
  579. dir: dir isL
  580. \-/73: O: O146 (predict-no)
  581. I see 0 and I'm going to do: predict-no
  582. ENV: Agent did: predict-no for direction L in state State-A
  583. In State-A moving L
  584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  585. predict error 0
  586. dir: dir isU
  587. |\74: O: O148 (predict-no)
  588. I see 1 and I'm going to do: predict-no
  589. ENV: Agent did: predict-no for direction U in state State-A
  590. In State-A moving U
  591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  592. predict error 0
  593. dir: dir isU
  594. -/75: O: O149 (predict-yes)
  595. I see 1 and I'm going to do: predict-yes
  596. ENV: Agent did: predict-yes for direction U in state State-A
  597. In State-A moving U
  598. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  599. predict error 1
  600. dir: dir isR
  601. |\76: O: O152 (predict-no)
  602. I see 0 and I'm going to do: predict-no
  603. ENV: Agent did: predict-no for direction R in state State-A
  604. In State-A moving R
  605. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  606. predict error 1
  607. dir: dir isR
  608. -/|77: O: O153 (predict-yes)
  609. I see 0 and I'm going to do: predict-yes
  610. ENV: Agent did: predict-yes for direction R in state State-B
  611. In State-B moving R
  612. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  613. predict error 1
  614. dir: dir isL
  615. \-/78: O: O156 (predict-no)
  616. I see 0 and I'm going to do: predict-no
  617. ENV: Agent did: predict-no for direction L in state State-B
  618. In State-B moving L
  619. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  620. predict error 1
  621. dir: dir isR
  622. |\-79: O: O158 (predict-no)
  623. I see 0 and I'm going to do: predict-no
  624. ENV: Agent did: predict-no for direction R in state State-A
  625. In State-A moving R
  626. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  627. predict error 1
  628. dir: dir isU
  629. /|\80: O: O160 (predict-no)
  630. I see 0 and I'm going to do: predict-no
  631. ENV: Agent did: predict-no for direction U in state State-B
  632. In State-B moving U
  633. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  634. predict error 0
  635. dir: dir isU
  636. -/81: O: O162 (predict-no)
  637. I see 1 and I'm going to do: predict-no
  638. ENV: Agent did: predict-no for direction U in state State-B
  639. In State-B moving U
  640. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  641. predict error 0
  642. dir: dir isR
  643. rule alias: '*'
  644. |82: O: O163 (predict-yes)
  645. I see 1 and I'm going to do: predict-yes
  646. ENV: Agent did: predict-yes for direction R in state State-B
  647. In State-B moving R
  648. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  649. predict error 1
  650. dir: dir isU
  651. \-/|83: O: O166 (predict-no)
  652. I see 0 and I'm going to do: predict-no
  653. ENV: Agent did: predict-no for direction U in state State-B
  654. In State-B moving U
  655. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  656. predict error 0
  657. dir: dir isL
  658. \-/84: O: O168 (predict-no)
  659. I see 1 and I'm going to do: predict-no
  660. ENV: Agent did: predict-no for direction L in state State-B
  661. In State-B moving L
  662. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  663. predict error 1
  664. dir: dir isR
  665. |\-85: O: O170 (predict-no)
  666. I see 0 and I'm going to do: predict-no
  667. ENV: Agent did: predict-no for direction R in state State-A
  668. In State-A moving R
  669. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  670. predict error 1
  671. dir: dir isU
  672. /|\86: O: O172 (predict-no)
  673. I see 0 and I'm going to do: predict-no
  674. ENV: Agent did: predict-no for direction U in state State-B
  675. In State-B moving U
  676. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  677. predict error 0
  678. dir: dir isR
  679. -/|87: O: O174 (predict-no)
  680. I see 1 and I'm going to do: predict-no
  681. ENV: Agent did: predict-no for direction R in state State-B
  682. In State-B moving R
  683. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  684. predict error 0
  685. dir: dir isR
  686. \-/88: O: O176 (predict-no)
  687. I see 1 and I'm going to do: predict-no
  688. ENV: Agent did: predict-no for direction R in state State-B
  689. In State-B moving R
  690. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  691. predict error 0
  692. dir: dir isL
  693. |\-89: O: O177 (predict-yes)
  694. I see 1 and I'm going to do: predict-yes
  695. ENV: Agent did: predict-yes for direction L in state State-B
  696. In State-B moving L
  697. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  698. predict error 0
  699. dir: dir isR
  700. /|\90: O: O179 (predict-yes)
  701. I see 1 and I'm going to do: predict-yes
  702. ENV: Agent did: predict-yes for direction R in state State-A
  703. In State-A moving R
  704. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  705. predict error 0
  706. dir: dir isU
  707. -/91: O: O182 (predict-no)
  708. I see 1 and I'm going to do: predict-no
  709. ENV: Agent did: predict-no for direction U in state State-B
  710. In State-B moving U
  711. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  712. predict error 0
  713. dir: dir isL
  714. rule alias: '*'
  715. rule alias: '*'
  716. rule alias: '*'
  717. |92: O: O184 (predict-no)
  718. I see 1 and I'm going to do: predict-no
  719. ENV: Agent did: predict-no for direction L in state State-B
  720. In State-B moving L
  721. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  722. predict error 1
  723. dir: dir isU
  724. \-93: O: O186 (predict-no)
  725. I see 0 and I'm going to do: predict-no
  726. ENV: Agent did: predict-no for direction U in state State-A
  727. In State-A moving U
  728. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  729. predict error 0
  730. dir: dir isU
  731. /|94: O: O188 (predict-no)
  732. I see 1 and I'm going to do: predict-no
  733. ENV: Agent did: predict-no for direction U in state State-A
  734. In State-A moving U
  735. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  736. predict error 0
  737. dir: dir isU
  738. \-/95: O: O190 (predict-no)
  739. I see 1 and I'm going to do: predict-no
  740. ENV: Agent did: predict-no for direction U in state State-A
  741. In State-A moving U
  742. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  743. predict error 0
  744. dir: dir isU
  745. |\-96: O: O191 (predict-yes)
  746. I see 1 and I'm going to do: predict-yes
  747. ENV: Agent did: predict-yes for direction U in state State-A
  748. In State-A moving U
  749. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  750. predict error 1
  751. dir: dir isU
  752. /|\-97: O: O194 (predict-no)
  753. I see 0 and I'm going to do: predict-no
  754. ENV: Agent did: predict-no for direction U in state State-A
  755. In State-A moving U
  756. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  757. predict error 0
  758. dir: dir isR
  759. /|\98: O: O196 (predict-no)
  760. I see 1 and I'm going to do: predict-no
  761. ENV: Agent did: predict-no for direction R in state State-A
  762. In State-A moving R
  763. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  764. predict error 1
  765. dir: dir isR
  766. -/99: O: O198 (predict-no)
  767. I see 0 and I'm going to do: predict-no
  768. ENV: Agent did: predict-no for direction R in state State-B
  769. In State-B moving R
  770. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  771. predict error 0
  772. dir: dir isR
  773. |\-100: O: O200 (predict-no)
  774. I see 1 and I'm going to do: predict-no
  775. ENV: Agent did: predict-no for direction R in state State-B
  776. In State-B moving R
  777. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  778. predict error 0
  779. dir: dir isL
  780. /|\101: O: O201 (predict-yes)
  781. I see 1 and I'm going to do: predict-yes
  782. ENV: Agent did: predict-yes for direction L in state State-B
  783. In State-B moving L
  784. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  785. predict error 0
  786. dir: dir isU
  787. rule alias: '*'
  788. rule alias: '*'
  789. -/|\-/|\-/|\-/|\-/|\-/|\-/|\-sleeping...
  790. /sleeping...
  791. |sleeping...
  792. \sleeping...
  793. -sleeping...
  794. /sleeping...
  795. |sleeping...
  796. \sleeping...
  797. -sleeping...
  798. /sleeping...
  799. |sleeping...
  800. \sleeping...
  801. -sleeping...
  802. /sleeping...
  803. |sleeping...
  804. \sleeping...
  805. -sleeping...
  806. /sleeping...
  807. |102: O: O203 (predict-yes)
  808. I see 1 and I'm going to do: predict-yes
  809. ENV: Agent did: predict-yes for direction U in state State-A
  810. In State-A moving U
  811. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  812. predict error 1
  813. dir: dir isR
  814. \-/|103: O: O206 (predict-no)
  815. I see 0 and I'm going to do: predict-no
  816. ENV: Agent did: predict-no for direction R in state State-A
  817. In State-A moving R
  818. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  819. predict error 1
  820. dir: dir isL
  821. \-/104: O: O207 (predict-yes)
  822. I see 0 and I'm going to do: predict-yes
  823. ENV: Agent did: predict-yes for direction L in state State-B
  824. In State-B moving L
  825. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  826. predict error 0
  827. dir: dir isR
  828. |\105: O: O210 (predict-no)
  829. I see 1 and I'm going to do: predict-no
  830. ENV: Agent did: predict-no for direction R in state State-A
  831. In State-A moving R
  832. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  833. predict error 1
  834. dir: dir isR
  835. -/106: O: O211 (predict-yes)
  836. I see 0 and I'm going to do: predict-yes
  837. ENV: Agent did: predict-yes for direction R in state State-B
  838. In State-B moving R
  839. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  840. predict error 1
  841. dir: dir isR
  842. |\-107: O: O213 (predict-yes)
  843. I see 0 and I'm going to do: predict-yes
  844. ENV: Agent did: predict-yes for direction R in state State-B
  845. In State-B moving R
  846. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  847. predict error 1
  848. dir: dir isR
  849. /|\-sleeping...
  850. /108: O: O216 (predict-no)
  851. I see 0 and I'm going to do: predict-no
  852. ENV: Agent did: predict-no for direction R in state State-B
  853. In State-B moving R
  854. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  855. predict error 0
  856. dir: dir isR
  857. |\109: O: O218 (predict-no)
  858. I see 1 and I'm going to do: predict-no
  859. ENV: Agent did: predict-no for direction R in state State-B
  860. In State-B moving R
  861. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  862. predict error 0
  863. dir: dir isR
  864. -110: O: O220 (predict-no)
  865. I see 1 and I'm going to do: predict-no
  866. ENV: Agent did: predict-no for direction R in state State-B
  867. In State-B moving R
  868. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  869. predict error 0
  870. dir: dir isR
  871. /|\111: O: O222 (predict-no)
  872. I see 1 and I'm going to do: predict-no
  873. ENV: Agent did: predict-no for direction R in state State-B
  874. In State-B moving R
  875. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  876. predict error 0
  877. dir: dir isR
  878. rule alias: '*'
  879. rule alias: '*'
  880. rule alias: '*'
  881. rule alias: '*'
  882. rule alias: '*'
  883. rule alias: '*'
  884. rule alias: '*'
  885. rule alias: '*'
  886. -112: O: O223 (predict-yes)
  887. I see 1 and I'm going to do: predict-yes
  888. ENV: Agent did: predict-yes for direction R in state State-B
  889. In State-B moving R
  890. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  891. predict error 1
  892. dir: dir isL
  893. /|\113: O: O225 (predict-yes)
  894. I see 0 and I'm going to do: predict-yes
  895. ENV: Agent did: predict-yes for direction L in state State-B
  896. In State-B moving L
  897. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  898. predict error 0
  899. dir: dir isL
  900. -/|114: O: O227 (predict-yes)
  901. I see 1 and I'm going to do: predict-yes
  902. ENV: Agent did: predict-yes for direction L in state State-A
  903. In State-A moving L
  904. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  905. predict error 1
  906. dir: dir isL
  907. \-/115: O: O229 (predict-yes)
  908. I see 0 and I'm going to do: predict-yes
  909. ENV: Agent did: predict-yes for direction L in state State-A
  910. In State-A moving L
  911. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  912. predict error 1
  913. dir: dir isR
  914. |\-/116: O: O232 (predict-no)
  915. I see 0 and I'm going to do: predict-no
  916. ENV: Agent did: predict-no for direction R in state State-A
  917. In State-A moving R
  918. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  919. predict error 1
  920. dir: dir isU
  921. |\-117: O: O234 (predict-no)
  922. I see 0 and I'm going to do: predict-no
  923. ENV: Agent did: predict-no for direction U in state State-B
  924. In State-B moving U
  925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  926. predict error 0
  927. dir: dir isU
  928. /|118: O: O236 (predict-no)
  929. I see 1 and I'm going to do: predict-no
  930. ENV: Agent did: predict-no for direction U in state State-B
  931. In State-B moving U
  932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  933. predict error 0
  934. dir: dir isU
  935. \-/119: O: O238 (predict-no)
  936. I see 1 and I'm going to do: predict-no
  937. ENV: Agent did: predict-no for direction U in state State-B
  938. In State-B moving U
  939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  940. predict error 0
  941. dir: dir isU
  942. |\-120: O: O239 (predict-yes)
  943. I see 1 and I'm going to do: predict-yes
  944. ENV: Agent did: predict-yes for direction U in state State-B
  945. In State-B moving U
  946. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  947. predict error 1
  948. dir: dir isL
  949. /|\121: O: O241 (predict-yes)
  950. I see 0 and I'm going to do: predict-yes
  951. ENV: Agent did: predict-yes for direction L in state State-B
  952. In State-B moving L
  953. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  954. predict error 0
  955. dir: dir isU
  956. rule alias: '*'
  957. rule alias: '*'
  958. rule alias: '*'
  959. rule alias: '*'
  960. rule alias: '*'
  961. rule alias: '*'
  962. rule alias: '*'
  963. rule alias: '*'
  964. -122: O: O244 (predict-no)
  965. I see 1 and I'm going to do: predict-no
  966. ENV: Agent did: predict-no for direction U in state State-A
  967. In State-A moving U
  968. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  969. predict error 0
  970. dir: dir isU
  971. /|123: O: O246 (predict-no)
  972. I see 1 and I'm going to do: predict-no
  973. ENV: Agent did: predict-no for direction U in state State-A
  974. In State-A moving U
  975. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  976. predict error 0
  977. dir: dir isL
  978. \-124: O: O247 (predict-yes)
  979. I see 1 and I'm going to do: predict-yes
  980. ENV: Agent did: predict-yes for direction L in state State-A
  981. In State-A moving L
  982. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  983. predict error 1
  984. dir: dir isL
  985. /|\125: O: O249 (predict-yes)
  986. I see 0 and I'm going to do: predict-yes
  987. ENV: Agent did: predict-yes for direction L in state State-A
  988. In State-A moving L
  989. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  990. predict error 1
  991. dir: dir isL
  992. -/126: O: O251 (predict-yes)
  993. I see 0 and I'm going to do: predict-yes
  994. ENV: Agent did: predict-yes for direction L in state State-A
  995. In State-A moving L
  996. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  997. predict error 1
  998. dir: dir isU
  999. |\-127: O: O254 (predict-no)
  1000. I see 0 and I'm going to do: predict-no
  1001. ENV: Agent did: predict-no for direction U in state State-A
  1002. In State-A moving U
  1003. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1004. predict error 0
  1005. dir: dir isL
  1006. /|128: O: O255 (predict-yes)
  1007. I see 1 and I'm going to do: predict-yes
  1008. ENV: Agent did: predict-yes for direction L in state State-A
  1009. In State-A moving L
  1010. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1011. predict error 1
  1012. dir: dir isL
  1013. \-/129: O: O257 (predict-yes)
  1014. I see 0 and I'm going to do: predict-yes
  1015. ENV: Agent did: predict-yes for direction L in state State-A
  1016. In State-A moving L
  1017. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1018. predict error 1
  1019. dir: dir isR
  1020. |\-130: O: O260 (predict-no)
  1021. I see 0 and I'm going to do: predict-no
  1022. ENV: Agent did: predict-no for direction R in state State-A
  1023. In State-A moving R
  1024. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1025. predict error 1
  1026. dir: dir isR
  1027. /|\131: O: O262 (predict-no)
  1028. I see 0 and I'm going to do: predict-no
  1029. ENV: Agent did: predict-no for direction R in state State-B
  1030. In State-B moving R
  1031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1032. predict error 0
  1033. dir: dir isL
  1034. -132: O: O263 (predict-yes)
  1035. I see 1 and I'm going to do: predict-yes
  1036. ENV: Agent did: predict-yes for direction L in state State-B
  1037. In State-B moving L
  1038. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1039. predict error 0
  1040. dir: dir isL
  1041. /|133: O: O265 (predict-yes)
  1042. I see 1 and I'm going to do: predict-yes
  1043. ENV: Agent did: predict-yes for direction L in state State-A
  1044. In State-A moving L
  1045. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1046. predict error 1
  1047. dir: dir isR
  1048. \-134: O: O268 (predict-no)
  1049. I see 0 and I'm going to do: predict-no
  1050. ENV: Agent did: predict-no for direction R in state State-A
  1051. In State-A moving R
  1052. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1053. predict error 1
  1054. dir: dir isL
  1055. /|135: O: O270 (predict-no)
  1056. I see 0 and I'm going to do: predict-no
  1057. ENV: Agent did: predict-no for direction L in state State-B
  1058. In State-B moving L
  1059. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1060. predict error 1
  1061. dir: dir isL
  1062. \-/136: O: O271 (predict-yes)
  1063. I see 0 and I'm going to do: predict-yes
  1064. ENV: Agent did: predict-yes for direction L in state State-A
  1065. In State-A moving L
  1066. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1067. predict error 1
  1068. dir: dir isU
  1069. |137: O: O274 (predict-no)
  1070. I see 0 and I'm going to do: predict-no
  1071. ENV: Agent did: predict-no for direction U in state State-A
  1072. In State-A moving U
  1073. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1074. predict error 0
  1075. dir: dir isR
  1076. \-/138: O: O276 (predict-no)
  1077. I see 1 and I'm going to do: predict-no
  1078. ENV: Agent did: predict-no for direction R in state State-A
  1079. In State-A moving R
  1080. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1081. predict error 1
  1082. dir: dir isL
  1083. |\-139: O: O277 (predict-yes)
  1084. I see 0 and I'm going to do: predict-yes
  1085. ENV: Agent did: predict-yes for direction L in state State-B
  1086. In State-B moving L
  1087. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1088. predict error 0
  1089. dir: dir isR
  1090. /|140: O: O279 (predict-yes)
  1091. I see 1 and I'm going to do: predict-yes
  1092. ENV: Agent did: predict-yes for direction R in state State-A
  1093. In State-A moving R
  1094. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1095. predict error 0
  1096. dir: dir isL
  1097. \-141: O: O282 (predict-no)
  1098. I see 1 and I'm going to do: predict-no
  1099. ENV: Agent did: predict-no for direction L in state State-B
  1100. In State-B moving L
  1101. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1102. predict error 1
  1103. dir: dir isR
  1104. /142: O: O283 (predict-yes)
  1105. I see 0 and I'm going to do: predict-yes
  1106. ENV: Agent did: predict-yes for direction R in state State-A
  1107. In State-A moving R
  1108. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1109. predict error 0
  1110. dir: dir isR
  1111. |\-143: O: O286 (predict-no)
  1112. I see 1 and I'm going to do: predict-no
  1113. ENV: Agent did: predict-no for direction R in state State-B
  1114. In State-B moving R
  1115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1116. predict error 0
  1117. dir: dir isL
  1118. /|144: O: O287 (predict-yes)
  1119. I see 1 and I'm going to do: predict-yes
  1120. ENV: Agent did: predict-yes for direction L in state State-B
  1121. In State-B moving L
  1122. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1123. predict error 0
  1124. dir: dir isL
  1125. \-/145: O: O289 (predict-yes)
  1126. I see 1 and I'm going to do: predict-yes
  1127. ENV: Agent did: predict-yes for direction L in state State-A
  1128. In State-A moving L
  1129. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1130. predict error 1
  1131. dir: dir isU
  1132. |\-146: O: O292 (predict-no)
  1133. I see 0 and I'm going to do: predict-no
  1134. ENV: Agent did: predict-no for direction U in state State-A
  1135. In State-A moving U
  1136. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1137. predict error 0
  1138. dir: dir isR
  1139. /|\147: O: O294 (predict-no)
  1140. I see 1 and I'm going to do: predict-no
  1141. ENV: Agent did: predict-no for direction R in state State-A
  1142. In State-A moving R
  1143. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1144. predict error 1
  1145. dir: dir isL
  1146. -148: O: O295 (predict-yes)
  1147. I see 0 and I'm going to do: predict-yes
  1148. ENV: Agent did: predict-yes for direction L in state State-B
  1149. In State-B moving L
  1150. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1151. predict error 0
  1152. dir: dir isR
  1153. /|\149: O: O297 (predict-yes)
  1154. I see 1 and I'm going to do: predict-yes
  1155. ENV: Agent did: predict-yes for direction R in state State-A
  1156. In State-A moving R
  1157. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1158. predict error 0
  1159. dir: dir isU
  1160. -/|150: O: O300 (predict-no)
  1161. I see 1 and I'm going to do: predict-no
  1162. ENV: Agent did: predict-no for direction U in state State-B
  1163. In State-B moving U
  1164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1165. predict error 0
  1166. dir: dir isL
  1167. \-/151: O: O301 (predict-yes)
  1168. I see 1 and I'm going to do: predict-yes
  1169. ENV: Agent did: predict-yes for direction L in state State-B
  1170. In State-B moving L
  1171. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1172. predict error 0
  1173. dir: dir isL
  1174. |152: O: O303 (predict-yes)
  1175. I see 1 and I'm going to do: predict-yes
  1176. ENV: Agent did: predict-yes for direction L in state State-A
  1177. In State-A moving L
  1178. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1179. predict error 1
  1180. dir: dir isL
  1181. \-153: O: O305 (predict-yes)
  1182. I see 0 and I'm going to do: predict-yes
  1183. ENV: Agent did: predict-yes for direction L in state State-A
  1184. In State-A moving L
  1185. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1186. predict error 1
  1187. dir: dir isU
  1188. /|\154: O: O308 (predict-no)
  1189. I see 0 and I'm going to do: predict-no
  1190. ENV: Agent did: predict-no for direction U in state State-A
  1191. In State-A moving U
  1192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1193. predict error 0
  1194. dir: dir isL
  1195. -/|155: O: O309 (predict-yes)
  1196. I see 1 and I'm going to do: predict-yes
  1197. ENV: Agent did: predict-yes for direction L in state State-A
  1198. In State-A moving L
  1199. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1200. predict error 1
  1201. dir: dir isU
  1202. \-156: O: O312 (predict-no)
  1203. I see 0 and I'm going to do: predict-no
  1204. ENV: Agent did: predict-no for direction U in state State-A
  1205. In State-A moving U
  1206. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1207. predict error 0
  1208. dir: dir isU
  1209. /|157: O: O313 (predict-yes)
  1210. I see 1 and I'm going to do: predict-yes
  1211. ENV: Agent did: predict-yes for direction U in state State-A
  1212. In State-A moving U
  1213. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1214. predict error 1
  1215. dir: dir isR
  1216. \-158: O: O315 (predict-yes)
  1217. I see 0 and I'm going to do: predict-yes
  1218. ENV: Agent did: predict-yes for direction R in state State-A
  1219. In State-A moving R
  1220. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1221. predict error 0
  1222. dir: dir isL
  1223. /159: O: O317 (predict-yes)
  1224. I see 1 and I'm going to do: predict-yes
  1225. ENV: Agent did: predict-yes for direction L in state State-B
  1226. In State-B moving L
  1227. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1228. predict error 0
  1229. dir: dir isU
  1230. |\-160: O: O320 (predict-no)
  1231. I see 1 and I'm going to do: predict-no
  1232. ENV: Agent did: predict-no for direction U in state State-A
  1233. In State-A moving U
  1234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1235. predict error 0
  1236. dir: dir isU
  1237. /|161: O: O322 (predict-no)
  1238. I see 1 and I'm going to do: predict-no
  1239. ENV: Agent did: predict-no for direction U in state State-A
  1240. In State-A moving U
  1241. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1242. predict error 0
  1243. dir: dir isR
  1244. \162: O: O323 (predict-yes)
  1245. I see 1 and I'm going to do: predict-yes
  1246. ENV: Agent did: predict-yes for direction R in state State-A
  1247. In State-A moving R
  1248. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1249. predict error 0
  1250. dir: dir isL
  1251. -/163: O: O325 (predict-yes)
  1252. I see 1 and I'm going to do: predict-yes
  1253. ENV: Agent did: predict-yes for direction L in state State-B
  1254. In State-B moving L
  1255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1256. predict error 0
  1257. dir: dir isR
  1258. |\-164: O: O327 (predict-yes)
  1259. I see 1 and I'm going to do: predict-yes
  1260. ENV: Agent did: predict-yes for direction R in state State-A
  1261. In State-A moving R
  1262. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1263. predict error 0
  1264. dir: dir isR
  1265. /|\165: O: O329 (predict-yes)
  1266. I see 1 and I'm going to do: predict-yes
  1267. ENV: Agent did: predict-yes for direction R in state State-B
  1268. In State-B moving R
  1269. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1270. predict error 1
  1271. dir: dir isR
  1272. -/166: O: O332 (predict-no)
  1273. I see 0 and I'm going to do: predict-no
  1274. ENV: Agent did: predict-no for direction R in state State-B
  1275. In State-B moving R
  1276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1277. predict error 0
  1278. dir: dir isL
  1279. |\-167: O: O333 (predict-yes)
  1280. I see 1 and I'm going to do: predict-yes
  1281. ENV: Agent did: predict-yes for direction L in state State-B
  1282. In State-B moving L
  1283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1284. predict error 0
  1285. dir: dir isR
  1286. /|168: O: O335 (predict-yes)
  1287. I see 1 and I'm going to do: predict-yes
  1288. ENV: Agent did: predict-yes for direction R in state State-A
  1289. In State-A moving R
  1290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1291. predict error 0
  1292. dir: dir isL
  1293. \-169: O: O337 (predict-yes)
  1294. I see 1 and I'm going to do: predict-yes
  1295. ENV: Agent did: predict-yes for direction L in state State-B
  1296. In State-B moving L
  1297. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1298. predict error 0
  1299. dir: dir isL
  1300. /|170: O: O339 (predict-yes)
  1301. I see 1 and I'm going to do: predict-yes
  1302. ENV: Agent did: predict-yes for direction L in state State-A
  1303. In State-A moving L
  1304. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1305. predict error 1
  1306. dir: dir isU
  1307. \-171: O: O341 (predict-yes)
  1308. I see 0 and I'm going to do: predict-yes
  1309. ENV: Agent did: predict-yes for direction U in state State-A
  1310. In State-A moving U
  1311. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1312. predict error 1
  1313. dir: dir isU
  1314. /172: O: O344 (predict-no)
  1315. I see 0 and I'm going to do: predict-no
  1316. ENV: Agent did: predict-no for direction U in state State-A
  1317. In State-A moving U
  1318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1319. predict error 0
  1320. dir: dir isL
  1321. |\173: O: O345 (predict-yes)
  1322. I see 1 and I'm going to do: predict-yes
  1323. ENV: Agent did: predict-yes for direction L in state State-A
  1324. In State-A moving L
  1325. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1326. predict error 1
  1327. dir: dir isU
  1328. -/|174: O: O348 (predict-no)
  1329. I see 0 and I'm going to do: predict-no
  1330. ENV: Agent did: predict-no for direction U in state State-A
  1331. In State-A moving U
  1332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1333. predict error 0
  1334. dir: dir isL
  1335. \-/175: O: O350 (predict-no)
  1336. I see 1 and I'm going to do: predict-no
  1337. ENV: Agent did: predict-no for direction L in state State-A
  1338. In State-A moving L
  1339. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1340. predict error 0
  1341. dir: dir isU
  1342. |\-/176: O: O352 (predict-no)
  1343. I see 1 and I'm going to do: predict-no
  1344. ENV: Agent did: predict-no for direction U in state State-A
  1345. In State-A moving U
  1346. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1347. predict error 0
  1348. dir: dir isU
  1349. |\-177: O: O354 (predict-no)
  1350. I see 1 and I'm going to do: predict-no
  1351. ENV: Agent did: predict-no for direction U in state State-A
  1352. In State-A moving U
  1353. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1354. predict error 0
  1355. dir: dir isR
  1356. /|\-178: O: O355 (predict-yes)
  1357. I see 1 and I'm going to do: predict-yes
  1358. ENV: Agent did: predict-yes for direction R in state State-A
  1359. In State-A moving R
  1360. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1361. predict error 0
  1362. dir: dir isL
  1363. /|\179: O: O357 (predict-yes)
  1364. I see 1 and I'm going to do: predict-yes
  1365. ENV: Agent did: predict-yes for direction L in state State-B
  1366. In State-B moving L
  1367. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1368. predict error 0
  1369. dir: dir isL
  1370. -/|180: O: O360 (predict-no)
  1371. I see 1 and I'm going to do: predict-no
  1372. ENV: Agent did: predict-no for direction L in state State-A
  1373. In State-A moving L
  1374. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1375. predict error 0
  1376. dir: dir isU
  1377. \-/181: O: O362 (predict-no)
  1378. I see 1 and I'm going to do: predict-no
  1379. ENV: Agent did: predict-no for direction U in state State-A
  1380. In State-A moving U
  1381. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1382. predict error 0
  1383. dir: dir isL
  1384. |182: O: O363 (predict-yes)
  1385. I see 1 and I'm going to do: predict-yes
  1386. ENV: Agent did: predict-yes for direction L in state State-A
  1387. In State-A moving L
  1388. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1389. predict error 1
  1390. dir: dir isU
  1391. \-183: O: O366 (predict-no)
  1392. I see 0 and I'm going to do: predict-no
  1393. ENV: Agent did: predict-no for direction U in state State-A
  1394. In State-A moving U
  1395. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1396. predict error 0
  1397. dir: dir isU
  1398. /|\-184: O: O367 (predict-yes)
  1399. I see 1 and I'm going to do: predict-yes
  1400. ENV: Agent did: predict-yes for direction U in state State-A
  1401. In State-A moving U
  1402. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1403. predict error 1
  1404. dir: dir isR
  1405. /|\185: O: O370 (predict-no)
  1406. I see 0 and I'm going to do: predict-no
  1407. ENV: Agent did: predict-no for direction R in state State-A
  1408. In State-A moving R
  1409. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1410. predict error 1
  1411. dir: dir isL
  1412. -/|186: O: O372 (predict-no)
  1413. I see 0 and I'm going to do: predict-no
  1414. ENV: Agent did: predict-no for direction L in state State-B
  1415. In State-B moving L
  1416. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1417. predict error 1
  1418. dir: dir isU
  1419. \-/187: O: O374 (predict-no)
  1420. I see 0 and I'm going to do: predict-no
  1421. ENV: Agent did: predict-no for direction U in state State-A
  1422. In State-A moving U
  1423. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1424. predict error 0
  1425. dir: dir isU
  1426. |188: O: O376 (predict-no)
  1427. I see 1 and I'm going to do: predict-no
  1428. ENV: Agent did: predict-no for direction U in state State-A
  1429. In State-A moving U
  1430. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1431. predict error 0
  1432. dir: dir isU
  1433. \-189: O: O377 (predict-yes)
  1434. I see 1 and I'm going to do: predict-yes
  1435. ENV: Agent did: predict-yes for direction U in state State-A
  1436. In State-A moving U
  1437. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1438. predict error 1
  1439. dir: dir isR
  1440. /|190: O: O379 (predict-yes)
  1441. I see 0 and I'm going to do: predict-yes
  1442. ENV: Agent did: predict-yes for direction R in state State-A
  1443. In State-A moving R
  1444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1445. predict error 0
  1446. dir: dir isR
  1447. \-191: O: O382 (predict-no)
  1448. I see 1 and I'm going to do: predict-no
  1449. ENV: Agent did: predict-no for direction R in state State-B
  1450. In State-B moving R
  1451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1452. predict error 0
  1453. dir: dir isR
  1454. /192: O: O384 (predict-no)
  1455. I see 1 and I'm going to do: predict-no
  1456. ENV: Agent did: predict-no for direction R in state State-B
  1457. In State-B moving R
  1458. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1459. predict error 0
  1460. dir: dir isL
  1461. |193: O: O385 (predict-yes)
  1462. I see 1 and I'm going to do: predict-yes
  1463. ENV: Agent did: predict-yes for direction L in state State-B
  1464. In State-B moving L
  1465. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1466. predict error 0
  1467. dir: dir isU
  1468. \-/194: O: O388 (predict-no)
  1469. I see 1 and I'm going to do: predict-no
  1470. ENV: Agent did: predict-no for direction U in state State-A
  1471. In State-A moving U
  1472. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1473. predict error 0
  1474. dir: dir isR
  1475. |\-195: O: O389 (predict-yes)
  1476. I see 1 and I'm going to do: predict-yes
  1477. ENV: Agent did: predict-yes for direction R in state State-A
  1478. In State-A moving R
  1479. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1480. predict error 0
  1481. dir: dir isL
  1482. /|\196: O: O391 (predict-yes)
  1483. I see 1 and I'm going to do: predict-yes
  1484. ENV: Agent did: predict-yes for direction L in state State-B
  1485. In State-B moving L
  1486. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1487. predict error 0
  1488. dir: dir isL
  1489. -197: O: O394 (predict-no)
  1490. I see 1 and I'm going to do: predict-no
  1491. ENV: Agent did: predict-no for direction L in state State-A
  1492. In State-A moving L
  1493. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1494. predict error 0
  1495. dir: dir isR
  1496. /|\198: O: O395 (predict-yes)
  1497. I see 1 and I'm going to do: predict-yes
  1498. ENV: Agent did: predict-yes for direction R in state State-A
  1499. In State-A moving R
  1500. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1501. predict error 0
  1502. dir: dir isL
  1503. -/|199: O: O397 (predict-yes)
  1504. I see 1 and I'm going to do: predict-yes
  1505. ENV: Agent did: predict-yes for direction L in state State-B
  1506. In State-B moving L
  1507. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1508. predict error 0
  1509. dir: dir isR
  1510. \-/200: O: O399 (predict-yes)
  1511. I see 1 and I'm going to do: predict-yes
  1512. ENV: Agent did: predict-yes for direction R in state State-A
  1513. In State-A moving R
  1514. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1515. predict error 0
  1516. dir: dir isL
  1517. |\-201: O: O401 (predict-yes)
  1518. I see 1 and I'm going to do: predict-yes
  1519. ENV: Agent did: predict-yes for direction L in state State-B
  1520. In State-B moving L
  1521. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1522. predict error 0
  1523. dir: dir isU
  1524. /|202: O: O404 (predict-no)
  1525. I see 1 and I'm going to do: predict-no
  1526. ENV: Agent did: predict-no for direction U in state State-A
  1527. In State-A moving U
  1528. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1529. predict error 0
  1530. dir: dir isU
  1531. \-203: O: O406 (predict-no)
  1532. I see 1 and I'm going to do: predict-no
  1533. ENV: Agent did: predict-no for direction U in state State-A
  1534. In State-A moving U
  1535. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1536. predict error 0
  1537. dir: dir isL
  1538. /|\204: O: O408 (predict-no)
  1539. I see 1 and I'm going to do: predict-no
  1540. ENV: Agent did: predict-no for direction L in state State-A
  1541. In State-A moving L
  1542. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1543. predict error 0
  1544. dir: dir isL
  1545. -205: O: O409 (predict-yes)
  1546. I see 1 and I'm going to do: predict-yes
  1547. ENV: Agent did: predict-yes for direction L in state State-A
  1548. In State-A moving L
  1549. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1550. predict error 1
  1551. dir: dir isL
  1552. /|\206: O: O412 (predict-no)
  1553. I see 0 and I'm going to do: predict-no
  1554. ENV: Agent did: predict-no for direction L in state State-A
  1555. In State-A moving L
  1556. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1557. predict error 0
  1558. dir: dir isU
  1559. -/|207: O: O414 (predict-no)
  1560. I see 1 and I'm going to do: predict-no
  1561. ENV: Agent did: predict-no for direction U in state State-A
  1562. In State-A moving U
  1563. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1564. predict error 0
  1565. dir: dir isU
  1566. \-/208: O: O416 (predict-no)
  1567. I see 1 and I'm going to do: predict-no
  1568. ENV: Agent did: predict-no for direction U in state State-A
  1569. In State-A moving U
  1570. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1571. predict error 0
  1572. dir: dir isR
  1573. |\209: O: O417 (predict-yes)
  1574. I see 1 and I'm going to do: predict-yes
  1575. ENV: Agent did: predict-yes for direction R in state State-A
  1576. In State-A moving R
  1577. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1578. predict error 0
  1579. dir: dir isL
  1580. -/|210: O: O419 (predict-yes)
  1581. I see 1 and I'm going to do: predict-yes
  1582. ENV: Agent did: predict-yes for direction L in state State-B
  1583. In State-B moving L
  1584. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1585. predict error 0
  1586. dir: dir isU
  1587. \-/211: O: O422 (predict-no)
  1588. I see 1 and I'm going to do: predict-no
  1589. ENV: Agent did: predict-no for direction U in state State-A
  1590. In State-A moving U
  1591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1592. predict error 0
  1593. dir: dir isU
  1594. |212: O: O424 (predict-no)
  1595. I see 1 and I'm going to do: predict-no
  1596. ENV: Agent did: predict-no for direction U in state State-A
  1597. In State-A moving U
  1598. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1599. predict error 0
  1600. dir: dir isU
  1601. \-/213: O: O426 (predict-no)
  1602. I see 1 and I'm going to do: predict-no
  1603. ENV: Agent did: predict-no for direction U in state State-A
  1604. In State-A moving U
  1605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1606. predict error 0
  1607. dir: dir isR
  1608. |\-214: O: O427 (predict-yes)
  1609. I see 1 and I'm going to do: predict-yes
  1610. ENV: Agent did: predict-yes for direction R in state State-A
  1611. In State-A moving R
  1612. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1613. predict error 0
  1614. dir: dir isU
  1615. /|215: O: O430 (predict-no)
  1616. I see 1 and I'm going to do: predict-no
  1617. ENV: Agent did: predict-no for direction U in state State-B
  1618. In State-B moving U
  1619. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1620. predict error 0
  1621. dir: dir isU
  1622. \216: O: O432 (predict-no)
  1623. I see 1 and I'm going to do: predict-no
  1624. ENV: Agent did: predict-no for direction U in state State-B
  1625. In State-B moving U
  1626. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1627. predict error 0
  1628. dir: dir isR
  1629. -/|217: O: O434 (predict-no)
  1630. I see 1 and I'm going to do: predict-no
  1631. ENV: Agent did: predict-no for direction R in state State-B
  1632. In State-B moving R
  1633. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1634. predict error 0
  1635. dir: dir isU
  1636. \-/218: O: O436 (predict-no)
  1637. I see 1 and I'm going to do: predict-no
  1638. ENV: Agent did: predict-no for direction U in state State-B
  1639. In State-B moving U
  1640. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1641. predict error 0
  1642. dir: dir isL
  1643. |\-219: O: O437 (predict-yes)
  1644. I see 1 and I'm going to do: predict-yes
  1645. ENV: Agent did: predict-yes for direction L in state State-B
  1646. In State-B moving L
  1647. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1648. predict error 0
  1649. dir: dir isU
  1650. /|220: O: O439 (predict-yes)
  1651. I see 1 and I'm going to do: predict-yes
  1652. ENV: Agent did: predict-yes for direction U in state State-A
  1653. In State-A moving U
  1654. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1655. predict error 1
  1656. dir: dir isL
  1657. \-/|221: O: O442 (predict-no)
  1658. I see 0 and I'm going to do: predict-no
  1659. ENV: Agent did: predict-no for direction L in state State-A
  1660. In State-A moving L
  1661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1662. predict error 0
  1663. dir: dir isL
  1664. \222: O: O444 (predict-no)
  1665. I see 1 and I'm going to do: predict-no
  1666. ENV: Agent did: predict-no for direction L in state State-A
  1667. In State-A moving L
  1668. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1669. predict error 0
  1670. dir: dir isU
  1671. -/|223: O: O445 (predict-yes)
  1672. I see 1 and I'm going to do: predict-yes
  1673. ENV: Agent did: predict-yes for direction U in state State-A
  1674. In State-A moving U
  1675. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1676. predict error 1
  1677. dir: dir isL
  1678. \-/|sleeping...
  1679. \224: O: O448 (predict-no)
  1680. I see 0 and I'm going to do: predict-no
  1681. ENV: Agent did: predict-no for direction L in state State-A
  1682. In State-A moving L
  1683. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1684. predict error 0
  1685. dir: dir isU
  1686. -/|225: O: O450 (predict-no)
  1687. I see 1 and I'm going to do: predict-no
  1688. ENV: Agent did: predict-no for direction U in state State-A
  1689. In State-A moving U
  1690. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1691. predict error 0
  1692. dir: dir isR
  1693. \-/226: O: O451 (predict-yes)
  1694. I see 1 and I'm going to do: predict-yes
  1695. ENV: Agent did: predict-yes for direction R in state State-A
  1696. In State-A moving R
  1697. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1698. predict error 0
  1699. dir: dir isU
  1700. |\-/227: O: O454 (predict-no)
  1701. I see 1 and I'm going to do: predict-no
  1702. ENV: Agent did: predict-no for direction U in state State-B
  1703. In State-B moving U
  1704. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1705. predict error 0
  1706. dir: dir isR
  1707. |\-/228: O: O455 (predict-yes)
  1708. I see 1 and I'm going to do: predict-yes
  1709. ENV: Agent did: predict-yes for direction R in state State-B
  1710. In State-B moving R
  1711. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1712. predict error 1
  1713. dir: dir isR
  1714. |\-229: O: O458 (predict-no)
  1715. I see 0 and I'm going to do: predict-no
  1716. ENV: Agent did: predict-no for direction R in state State-B
  1717. In State-B moving R
  1718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1719. predict error 0
  1720. dir: dir isL
  1721. /|\230: O: O459 (predict-yes)
  1722. I see 1 and I'm going to do: predict-yes
  1723. ENV: Agent did: predict-yes for direction L in state State-B
  1724. In State-B moving L
  1725. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1726. predict error 0
  1727. dir: dir isU
  1728. -/231: O: O461 (predict-yes)
  1729. I see 1 and I'm going to do: predict-yes
  1730. ENV: Agent did: predict-yes for direction U in state State-A
  1731. In State-A moving U
  1732. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1733. predict error 1
  1734. dir: dir isR
  1735. |232: O: O463 (predict-yes)
  1736. I see 0 and I'm going to do: predict-yes
  1737. ENV: Agent did: predict-yes for direction R in state State-A
  1738. In State-A moving R
  1739. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1740. predict error 0
  1741. dir: dir isU
  1742. \-/233: O: O466 (predict-no)
  1743. I see 1 and I'm going to do: predict-no
  1744. ENV: Agent did: predict-no for direction U in state State-B
  1745. In State-B moving U
  1746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1747. predict error 0
  1748. dir: dir isU
  1749. |\-234: O: O468 (predict-no)
  1750. I see 1 and I'm going to do: predict-no
  1751. ENV: Agent did: predict-no for direction U in state State-B
  1752. In State-B moving U
  1753. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1754. predict error 0
  1755. dir: dir isL
  1756. /|235: O: O469 (predict-yes)
  1757. I see 1 and I'm going to do: predict-yes
  1758. ENV: Agent did: predict-yes for direction L in state State-B
  1759. In State-B moving L
  1760. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1761. predict error 0
  1762. dir: dir isR
  1763. \-236: O: O471 (predict-yes)
  1764. I see 1 and I'm going to do: predict-yes
  1765. ENV: Agent did: predict-yes for direction R in state State-A
  1766. In State-A moving R
  1767. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1768. predict error 0
  1769. dir: dir isL
  1770. /|\237: O: O473 (predict-yes)
  1771. I see 1 and I'm going to do: predict-yes
  1772. ENV: Agent did: predict-yes for direction L in state State-B
  1773. In State-B moving L
  1774. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1775. predict error 0
  1776. dir: dir isL
  1777. -/238: O: O475 (predict-yes)
  1778. I see 1 and I'm going to do: predict-yes
  1779. ENV: Agent did: predict-yes for direction L in state State-A
  1780. In State-A moving L
  1781. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1782. predict error 1
  1783. dir: dir isL
  1784. |239: O: O478 (predict-no)
  1785. I see 0 and I'm going to do: predict-no
  1786. ENV: Agent did: predict-no for direction L in state State-A
  1787. In State-A moving L
  1788. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1789. predict error 0
  1790. dir: dir isU
  1791. \-240: O: O480 (predict-no)
  1792. I see 1 and I'm going to do: predict-no
  1793. ENV: Agent did: predict-no for direction U in state State-A
  1794. In State-A moving U
  1795. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1796. predict error 0
  1797. dir: dir isU
  1798. /|\241: O: O482 (predict-no)
  1799. I see 1 and I'm going to do: predict-no
  1800. ENV: Agent did: predict-no for direction U in state State-A
  1801. In State-A moving U
  1802. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1803. predict error 0
  1804. dir: dir isU
  1805. -242: O: O484 (predict-no)
  1806. I see 1 and I'm going to do: predict-no
  1807. ENV: Agent did: predict-no for direction U in state State-A
  1808. In State-A moving U
  1809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1810. predict error 0
  1811. dir: dir isR
  1812. /|\243: O: O485 (predict-yes)
  1813. I see 1 and I'm going to do: predict-yes
  1814. ENV: Agent did: predict-yes for direction R in state State-A
  1815. In State-A moving R
  1816. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1817. predict error 0
  1818. dir: dir isR
  1819. -/|244: O: O487 (predict-yes)
  1820. I see 1 and I'm going to do: predict-yes
  1821. ENV: Agent did: predict-yes for direction R in state State-B
  1822. In State-B moving R
  1823. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1824. predict error 1
  1825. dir: dir isU
  1826. \245: O: O490 (predict-no)
  1827. I see 0 and I'm going to do: predict-no
  1828. ENV: Agent did: predict-no for direction U in state State-B
  1829. In State-B moving U
  1830. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1831. predict error 0
  1832. dir: dir isR
  1833. -/|246: O: O492 (predict-no)
  1834. I see 1 and I'm going to do: predict-no
  1835. ENV: Agent did: predict-no for direction R in state State-B
  1836. In State-B moving R
  1837. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1838. predict error 0
  1839. dir: dir isR
  1840. \-/247: O: O494 (predict-no)
  1841. I see 1 and I'm going to do: predict-no
  1842. ENV: Agent did: predict-no for direction R in state State-B
  1843. In State-B moving R
  1844. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1845. predict error 0
  1846. dir: dir isL
  1847. |\248: O: O495 (predict-yes)
  1848. I see 1 and I'm going to do: predict-yes
  1849. ENV: Agent did: predict-yes for direction L in state State-B
  1850. In State-B moving L
  1851. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1852. predict error 0
  1853. dir: dir isL
  1854. -/|\249: O: O498 (predict-no)
  1855. I see 1 and I'm going to do: predict-no
  1856. ENV: Agent did: predict-no for direction L in state State-A
  1857. In State-A moving L
  1858. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1859. predict error 0
  1860. dir: dir isL
  1861. -/|250: O: O500 (predict-no)
  1862. I see 1 and I'm going to do: predict-no
  1863. ENV: Agent did: predict-no for direction L in state State-A
  1864. In State-A moving L
  1865. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1866. predict error 0
  1867. dir: dir isU
  1868. \-251: O: O502 (predict-no)
  1869. I see 1 and I'm going to do: predict-no
  1870. ENV: Agent did: predict-no for direction U in state State-A
  1871. In State-A moving U
  1872. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1873. predict error 0
  1874. dir: dir isR
  1875. /252: O: O503 (predict-yes)
  1876. I see 1 and I'm going to do: predict-yes
  1877. ENV: Agent did: predict-yes for direction R in state State-A
  1878. In State-A moving R
  1879. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1880. predict error 0
  1881. dir: dir isU
  1882. |\253: O: O506 (predict-no)
  1883. I see 1 and I'm going to do: predict-no
  1884. ENV: Agent did: predict-no for direction U in state State-B
  1885. In State-B moving U
  1886. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1887. predict error 0
  1888. dir: dir isR
  1889. -254: O: O507 (predict-yes)
  1890. I see 1 and I'm going to do: predict-yes
  1891. ENV: Agent did: predict-yes for direction R in state State-B
  1892. In State-B moving R
  1893. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1894. predict error 1
  1895. dir: dir isL
  1896. /|255: O: O510 (predict-no)
  1897. I see 0 and I'm going to do: predict-no
  1898. ENV: Agent did: predict-no for direction L in state State-B
  1899. In State-B moving L
  1900. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1901. predict error 1
  1902. dir: dir isU
  1903. \-/256: O: O511 (predict-yes)
  1904. I see 0 and I'm going to do: predict-yes
  1905. ENV: Agent did: predict-yes for direction U in state State-A
  1906. In State-A moving U
  1907. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1908. predict error 1
  1909. dir: dir isU
  1910. |\-257: O: O514 (predict-no)
  1911. I see 0 and I'm going to do: predict-no
  1912. ENV: Agent did: predict-no for direction U in state State-A
  1913. In State-A moving U
  1914. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1915. predict error 0
  1916. dir: dir isL
  1917. /|258: O: O516 (predict-no)
  1918. I see 1 and I'm going to do: predict-no
  1919. ENV: Agent did: predict-no for direction L in state State-A
  1920. In State-A moving L
  1921. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1922. predict error 0
  1923. dir: dir isU
  1924. \-/259: O: O518 (predict-no)
  1925. I see 1 and I'm going to do: predict-no
  1926. ENV: Agent did: predict-no for direction U in state State-A
  1927. In State-A moving U
  1928. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1929. predict error 0
  1930. dir: dir isL
  1931. |\-260: O: O520 (predict-no)
  1932. I see 1 and I'm going to do: predict-no
  1933. ENV: Agent did: predict-no for direction L in state State-A
  1934. In State-A moving L
  1935. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1936. predict error 0
  1937. dir: dir isL
  1938. /|261: O: O522 (predict-no)
  1939. I see 1 and I'm going to do: predict-no
  1940. ENV: Agent did: predict-no for direction L in state State-A
  1941. In State-A moving L
  1942. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1943. predict error 0
  1944. dir: dir isU
  1945. \262: O: O524 (predict-no)
  1946. I see 1 and I'm going to do: predict-no
  1947. ENV: Agent did: predict-no for direction U in state State-A
  1948. In State-A moving U
  1949. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1950. predict error 0
  1951. dir: dir isL
  1952. -/|263: O: O526 (predict-no)
  1953. I see 1 and I'm going to do: predict-no
  1954. ENV: Agent did: predict-no for direction L in state State-A
  1955. In State-A moving L
  1956. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1957. predict error 0
  1958. dir: dir isL
  1959. \-/264: O: O528 (predict-no)
  1960. I see 1 and I'm going to do: predict-no
  1961. ENV: Agent did: predict-no for direction L in state State-A
  1962. In State-A moving L
  1963. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1964. predict error 0
  1965. dir: dir isU
  1966. |\-265: O: O530 (predict-no)
  1967. I see 1 and I'm going to do: predict-no
  1968. ENV: Agent did: predict-no for direction U in state State-A
  1969. In State-A moving U
  1970. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1971. predict error 0
  1972. dir: dir isR
  1973. /|266: O: O532 (predict-no)
  1974. I see 1 and I'm going to do: predict-no
  1975. ENV: Agent did: predict-no for direction R in state State-A
  1976. In State-A moving R
  1977. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1978. predict error 1
  1979. dir: dir isL
  1980. \-/267: O: O534 (predict-no)
  1981. I see 0 and I'm going to do: predict-no
  1982. ENV: Agent did: predict-no for direction L in state State-B
  1983. In State-B moving L
  1984. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1985. predict error 1
  1986. dir: dir isL
  1987. |\-268: O: O536 (predict-no)
  1988. I see 0 and I'm going to do: predict-no
  1989. ENV: Agent did: predict-no for direction L in state State-A
  1990. In State-A moving L
  1991. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1992. predict error 0
  1993. dir: dir isL
  1994. /269: O: O538 (predict-no)
  1995. I see 1 and I'm going to do: predict-no
  1996. ENV: Agent did: predict-no for direction L in state State-A
  1997. In State-A moving L
  1998. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1999. predict error 0
  2000. dir: dir isU
  2001. |\270: O: O540 (predict-no)
  2002. I see 1 and I'm going to do: predict-no
  2003. ENV: Agent did: predict-no for direction U in state State-A
  2004. In State-A moving U
  2005. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2006. predict error 0
  2007. dir: dir isL
  2008. -/271: O: O542 (predict-no)
  2009. I see 1 and I'm going to do: predict-no
  2010. ENV: Agent did: predict-no for direction L in state State-A
  2011. In State-A moving L
  2012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2013. predict error 0
  2014. dir: dir isU
  2015. |272: O: O544 (predict-no)
  2016. I see 1 and I'm going to do: predict-no
  2017. ENV: Agent did: predict-no for direction U in state State-A
  2018. In State-A moving U
  2019. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2020. predict error 0
  2021. dir: dir isR
  2022. \-/273: O: O545 (predict-yes)
  2023. I see 1 and I'm going to do: predict-yes
  2024. ENV: Agent did: predict-yes for direction R in state State-A
  2025. In State-A moving R
  2026. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2027. predict error 0
  2028. dir: dir isU
  2029. |274: O: O548 (predict-no)
  2030. I see 1 and I'm going to do: predict-no
  2031. ENV: Agent did: predict-no for direction U in state State-B
  2032. In State-B moving U
  2033. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2034. predict error 0
  2035. dir: dir isU
  2036. \-275: O: O550 (predict-no)
  2037. I see 1 and I'm going to do: predict-no
  2038. ENV: Agent did: predict-no for direction U in state State-B
  2039. In State-B moving U
  2040. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2041. predict error 0
  2042. dir: dir isL
  2043. /|276: O: O551 (predict-yes)
  2044. I see 1 and I'm going to do: predict-yes
  2045. ENV: Agent did: predict-yes for direction L in state State-B
  2046. In State-B moving L
  2047. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2048. predict error 0
  2049. dir: dir isL
  2050. \-/277: O: O554 (predict-no)
  2051. I see 1 and I'm going to do: predict-no
  2052. ENV: Agent did: predict-no for direction L in state State-A
  2053. In State-A moving L
  2054. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2055. predict error 0
  2056. dir: dir isR
  2057. |\278: O: O555 (predict-yes)
  2058. I see 1 and I'm going to do: predict-yes
  2059. ENV: Agent did: predict-yes for direction R in state State-A
  2060. In State-A moving R
  2061. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2062. predict error 0
  2063. dir: dir isL
  2064. -/279: O: O557 (predict-yes)
  2065. I see 1 and I'm going to do: predict-yes
  2066. ENV: Agent did: predict-yes for direction L in state State-B
  2067. In State-B moving L
  2068. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2069. predict error 0
  2070. dir: dir isR
  2071. |\-280: O: O559 (predict-yes)
  2072. I see 1 and I'm going to do: predict-yes
  2073. ENV: Agent did: predict-yes for direction R in state State-A
  2074. In State-A moving R
  2075. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2076. predict error 0
  2077. dir: dir isL
  2078. /|281: O: O561 (predict-yes)
  2079. I see 1 and I'm going to do: predict-yes
  2080. ENV: Agent did: predict-yes for direction L in state State-B
  2081. In State-B moving L
  2082. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2083. predict error 0
  2084. dir: dir isL
  2085. \282: O: O563 (predict-yes)
  2086. I see 1 and I'm going to do: predict-yes
  2087. ENV: Agent did: predict-yes for direction L in state State-A
  2088. In State-A moving L
  2089. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2090. predict error 1
  2091. dir: dir isU
  2092. -/|283: O: O566 (predict-no)
  2093. I see 0 and I'm going to do: predict-no
  2094. ENV: Agent did: predict-no for direction U in state State-A
  2095. In State-A moving U
  2096. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2097. predict error 0
  2098. dir: dir isL
  2099. \-284: O: O568 (predict-no)
  2100. I see 1 and I'm going to do: predict-no
  2101. ENV: Agent did: predict-no for direction L in state State-A
  2102. In State-A moving L
  2103. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2104. predict error 0
  2105. dir: dir isR
  2106. /|285: O: O569 (predict-yes)
  2107. I see 1 and I'm going to do: predict-yes
  2108. ENV: Agent did: predict-yes for direction R in state State-A
  2109. In State-A moving R
  2110. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2111. predict error 0
  2112. dir: dir isR
  2113. \-/|286: O: O572 (predict-no)
  2114. I see 1 and I'm going to do: predict-no
  2115. ENV: Agent did: predict-no for direction R in state State-B
  2116. In State-B moving R
  2117. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2118. predict error 0
  2119. dir: dir isL
  2120. \-/287: O: O574 (predict-no)
  2121. I see 1 and I'm going to do: predict-no
  2122. ENV: Agent did: predict-no for direction L in state State-B
  2123. In State-B moving L
  2124. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2125. predict error 1
  2126. dir: dir isL
  2127. |\-288: O: O576 (predict-no)
  2128. I see 0 and I'm going to do: predict-no
  2129. ENV: Agent did: predict-no for direction L in state State-A
  2130. In State-A moving L
  2131. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2132. predict error 0
  2133. dir: dir isU
  2134. /|\289: O: O578 (predict-no)
  2135. I see 1 and I'm going to do: predict-no
  2136. ENV: Agent did: predict-no for direction U in state State-A
  2137. In State-A moving U
  2138. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2139. predict error 0
  2140. dir: dir isU
  2141. -/|290: O: O580 (predict-no)
  2142. I see 1 and I'm going to do: predict-no
  2143. ENV: Agent did: predict-no for direction U in state State-A
  2144. In State-A moving U
  2145. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2146. predict error 0
  2147. dir: dir isU
  2148. \-/291: O: O582 (predict-no)
  2149. I see 1 and I'm going to do: predict-no
  2150. ENV: Agent did: predict-no for direction U in state State-A
  2151. In State-A moving U
  2152. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2153. predict error 0
  2154. dir: dir isL
  2155. |292: O: O584 (predict-no)
  2156. I see 1 and I'm going to do: predict-no
  2157. ENV: Agent did: predict-no for direction L in state State-A
  2158. In State-A moving L
  2159. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2160. predict error 0
  2161. dir: dir isL
  2162. \-293: O: O586 (predict-no)
  2163. I see 1 and I'm going to do: predict-no
  2164. ENV: Agent did: predict-no for direction L in state State-A
  2165. In State-A moving L
  2166. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2167. predict error 0
  2168. dir: dir isR
  2169. /|\294: O: O587 (predict-yes)
  2170. I see 1 and I'm going to do: predict-yes
  2171. ENV: Agent did: predict-yes for direction R in state State-A
  2172. In State-A moving R
  2173. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2174. predict error 0
  2175. dir: dir isU
  2176. -/|295: O: O590 (predict-no)
  2177. I see 1 and I'm going to do: predict-no
  2178. ENV: Agent did: predict-no for direction U in state State-B
  2179. In State-B moving U
  2180. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2181. predict error 0
  2182. dir: dir isR
  2183. \296: O: O592 (predict-no)
  2184. I see 1 and I'm going to do: predict-no
  2185. ENV: Agent did: predict-no for direction R in state State-B
  2186. In State-B moving R
  2187. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2188. predict error 0
  2189. dir: dir isU
  2190. -/|297: O: O594 (predict-no)
  2191. I see 1 and I'm going to do: predict-no
  2192. ENV: Agent did: predict-no for direction U in state State-B
  2193. In State-B moving U
  2194. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2195. predict error 0
  2196. dir: dir isR
  2197. \-298: O: O596 (predict-no)
  2198. I see 1 and I'm going to do: predict-no
  2199. ENV: Agent did: predict-no for direction R in state State-B
  2200. In State-B moving R
  2201. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2202. predict error 0
  2203. dir: dir isL
  2204. /|\299: O: O597 (predict-yes)
  2205. I see 1 and I'm going to do: predict-yes
  2206. ENV: Agent did: predict-yes for direction L in state State-B
  2207. In State-B moving L
  2208. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2209. predict error 0
  2210. dir: dir isR
  2211. -/|300: O: O599 (predict-yes)
  2212. I see 1 and I'm going to do: predict-yes
  2213. ENV: Agent did: predict-yes for direction R in state State-A
  2214. In State-A moving R
  2215. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2216. predict error 0
  2217. dir: dir isL
  2218. \-/|\-301: O: O601 (predict-yes)
  2219. I see 1 and I'm going to do: predict-yes
  2220. ENV: Agent did: predict-yes for direction L in state State-B
  2221. In State-B moving L
  2222. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2223. predict error 0
  2224. dir: dir isL
  2225. /302: O: O604 (predict-no)
  2226. I see 1 and I'm going to do: predict-no
  2227. ENV: Agent did: predict-no for direction L in state State-A
  2228. In State-A moving L
  2229. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2230. predict error 0
  2231. dir: dir isL
  2232. |\303: O: O606 (predict-no)
  2233. I see 1 and I'm going to do: predict-no
  2234. ENV: Agent did: predict-no for direction L in state State-A
  2235. In State-A moving L
  2236. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2237. predict error 0
  2238. dir: dir isL
  2239. -/|304: O: O608 (predict-no)
  2240. I see 1 and I'm going to do: predict-no
  2241. ENV: Agent did: predict-no for direction L in state State-A
  2242. In State-A moving L
  2243. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2244. predict error 0
  2245. dir: dir isU
  2246. \-/305: O: O610 (predict-no)
  2247. I see 1 and I'm going to do: predict-no
  2248. ENV: Agent did: predict-no for direction U in state State-A
  2249. In State-A moving U
  2250. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2251. predict error 0
  2252. dir: dir isR
  2253. |\-306: O: O611 (predict-yes)
  2254. I see 1 and I'm going to do: predict-yes
  2255. ENV: Agent did: predict-yes for direction R in state State-A
  2256. In State-A moving R
  2257. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2258. predict error 0
  2259. dir: dir isR
  2260. /|\307: O: O614 (predict-no)
  2261. I see 1 and I'm going to do: predict-no
  2262. ENV: Agent did: predict-no for direction R in state State-B
  2263. In State-B moving R
  2264. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2265. predict error 0
  2266. dir: dir isR
  2267. -/|308: O: O616 (predict-no)
  2268. I see 1 and I'm going to do: predict-no
  2269. ENV: Agent did: predict-no for direction R in state State-B
  2270. In State-B moving R
  2271. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2272. predict error 0
  2273. dir: dir isU
  2274. \-/309: O: O618 (predict-no)
  2275. I see 1 and I'm going to do: predict-no
  2276. ENV: Agent did: predict-no for direction U in state State-B
  2277. In State-B moving U
  2278. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2279. predict error 0
  2280. dir: dir isR
  2281. |\-310: O: O620 (predict-no)
  2282. I see 1 and I'm going to do: predict-no
  2283. ENV: Agent did: predict-no for direction R in state State-B
  2284. In State-B moving R
  2285. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2286. predict error 0
  2287. dir: dir isL
  2288. /|\311: O: O621 (predict-yes)
  2289. I see 1 and I'm going to do: predict-yes
  2290. ENV: Agent did: predict-yes for direction L in state State-B
  2291. In State-B moving L
  2292. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2293. predict error 0
  2294. dir: dir isL
  2295. -312: O: O624 (predict-no)
  2296. I see 1 and I'm going to do: predict-no
  2297. ENV: Agent did: predict-no for direction L in state State-A
  2298. In State-A moving L
  2299. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2300. predict error 0
  2301. dir: dir isL
  2302. /|\313: O: O626 (predict-no)
  2303. I see 1 and I'm going to do: predict-no
  2304. ENV: Agent did: predict-no for direction L in state State-A
  2305. In State-A moving L
  2306. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2307. predict error 0
  2308. dir: dir isU
  2309. -/314: O: O628 (predict-no)
  2310. I see 1 and I'm going to do: predict-no
  2311. ENV: Agent did: predict-no for direction U in state State-A
  2312. In State-A moving U
  2313. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2314. predict error 0
  2315. dir: dir isU
  2316. |\315: O: O630 (predict-no)
  2317. I see 1 and I'm going to do: predict-no
  2318. ENV: Agent did: predict-no for direction U in state State-A
  2319. In State-A moving U
  2320. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2321. predict error 0
  2322. dir: dir isL
  2323. -/316: O: O632 (predict-no)
  2324. I see 1 and I'm going to do: predict-no
  2325. ENV: Agent did: predict-no for direction L in state State-A
  2326. In State-A moving L
  2327. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2328. predict error 0
  2329. dir: dir isR
  2330. |\-317: O: O634 (predict-no)
  2331. I see 1 and I'm going to do: predict-no
  2332. ENV: Agent did: predict-no for direction R in state State-A
  2333. In State-A moving R
  2334. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2335. predict error 1
  2336. dir: dir isR
  2337. /|318: O: O636 (predict-no)
  2338. I see 0 and I'm going to do: predict-no
  2339. ENV: Agent did: predict-no for direction R in state State-B
  2340. In State-B moving R
  2341. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2342. predict error 0
  2343. dir: dir isR
  2344. \-/319: O: O638 (predict-no)
  2345. I see 1 and I'm going to do: predict-no
  2346. ENV: Agent did: predict-no for direction R in state State-B
  2347. In State-B moving R
  2348. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2349. predict error 0
  2350. dir: dir isR
  2351. |\-320: O: O640 (predict-no)
  2352. I see 1 and I'm going to do: predict-no
  2353. ENV: Agent did: predict-no for direction R in state State-B
  2354. In State-B moving R
  2355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2356. predict error 0
  2357. dir: dir isL
  2358. /|321: O: O641 (predict-yes)
  2359. I see 1 and I'm going to do: predict-yes
  2360. ENV: Agent did: predict-yes for direction L in state State-B
  2361. In State-B moving L
  2362. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2363. predict error 0
  2364. dir: dir isL
  2365. \322: O: O643 (predict-yes)
  2366. I see 1 and I'm going to do: predict-yes
  2367. ENV: Agent did: predict-yes for direction L in state State-A
  2368. In State-A moving L
  2369. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2370. predict error 1
  2371. dir: dir isL
  2372. -/|323: O: O645 (predict-yes)
  2373. I see 0 and I'm going to do: predict-yes
  2374. ENV: Agent did: predict-yes for direction L in state State-A
  2375. In State-A moving L
  2376. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2377. predict error 1
  2378. dir: dir isL
  2379. \-/324: O: O648 (predict-no)
  2380. I see 0 and I'm going to do: predict-no
  2381. ENV: Agent did: predict-no for direction L in state State-A
  2382. In State-A moving L
  2383. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2384. predict error 0
  2385. dir: dir isR
  2386. |\325: O: O649 (predict-yes)
  2387. I see 1 and I'm going to do: predict-yes
  2388. ENV: Agent did: predict-yes for direction R in state State-A
  2389. In State-A moving R
  2390. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2391. predict error 0
  2392. dir: dir isL
  2393. -/|326: O: O651 (predict-yes)
  2394. I see 1 and I'm going to do: predict-yes
  2395. ENV: Agent did: predict-yes for direction L in state State-B
  2396. In State-B moving L
  2397. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2398. predict error 0
  2399. dir: dir isL
  2400. \-/327: O: O654 (predict-no)
  2401. I see 1 and I'm going to do: predict-no
  2402. ENV: Agent did: predict-no for direction L in state State-A
  2403. In State-A moving L
  2404. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2405. predict error 0
  2406. dir: dir isR
  2407. |\-328: O: O655 (predict-yes)
  2408. I see 1 and I'm going to do: predict-yes
  2409. ENV: Agent did: predict-yes for direction R in state State-A
  2410. In State-A moving R
  2411. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2412. predict error 0
  2413. dir: dir isL
  2414. /|\329: O: O657 (predict-yes)
  2415. I see 1 and I'm going to do: predict-yes
  2416. ENV: Agent did: predict-yes for direction L in state State-B
  2417. In State-B moving L
  2418. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2419. predict error 0
  2420. dir: dir isU
  2421. -/|330: O: O660 (predict-no)
  2422. I see 1 and I'm going to do: predict-no
  2423. ENV: Agent did: predict-no for direction U in state State-A
  2424. In State-A moving U
  2425. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2426. predict error 0
  2427. dir: dir isR
  2428. \-331: O: O661 (predict-yes)
  2429. I see 1 and I'm going to do: predict-yes
  2430. ENV: Agent did: predict-yes for direction R in state State-A
  2431. In State-A moving R
  2432. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2433. predict error 0
  2434. dir: dir isU
  2435. /332: O: O663 (predict-yes)
  2436. I see 1 and I'm going to do: predict-yes
  2437. ENV: Agent did: predict-yes for direction U in state State-B
  2438. In State-B moving U
  2439. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2440. predict error 1
  2441. dir: dir isL
  2442. |\-333: O: O665 (predict-yes)
  2443. I see 0 and I'm going to do: predict-yes
  2444. ENV: Agent did: predict-yes for direction L in state State-B
  2445. In State-B moving L
  2446. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2447. predict error 0
  2448. dir: dir isR
  2449. /|334: O: O667 (predict-yes)
  2450. I see 1 and I'm going to do: predict-yes
  2451. ENV: Agent did: predict-yes for direction R in state State-A
  2452. In State-A moving R
  2453. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2454. predict error 0
  2455. dir: dir isU
  2456. \-/335: O: O670 (predict-no)
  2457. I see 1 and I'm going to do: predict-no
  2458. ENV: Agent did: predict-no for direction U in state State-B
  2459. In State-B moving U
  2460. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2461. predict error 0
  2462. dir: dir isL
  2463. |\-336: O: O671 (predict-yes)
  2464. I see 1 and I'm going to do: predict-yes
  2465. ENV: Agent did: predict-yes for direction L in state State-B
  2466. In State-B moving L
  2467. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2468. predict error 0
  2469. dir: dir isU
  2470. /|\337: O: O673 (predict-yes)
  2471. I see 1 and I'm going to do: predict-yes
  2472. ENV: Agent did: predict-yes for direction U in state State-A
  2473. In State-A moving U
  2474. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2475. predict error 1
  2476. dir: dir isL
  2477. -/338: O: O676 (predict-no)
  2478. I see 0 and I'm going to do: predict-no
  2479. ENV: Agent did: predict-no for direction L in state State-A
  2480. In State-A moving L
  2481. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2482. predict error 0
  2483. dir: dir isU
  2484. |\339: O: O678 (predict-no)
  2485. I see 1 and I'm going to do: predict-no
  2486. ENV: Agent did: predict-no for direction U in state State-A
  2487. In State-A moving U
  2488. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2489. predict error 0
  2490. dir: dir isU
  2491. -340: O: O680 (predict-no)
  2492. I see 1 and I'm going to do: predict-no
  2493. ENV: Agent did: predict-no for direction U in state State-A
  2494. In State-A moving U
  2495. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2496. predict error 0
  2497. dir: dir isU
  2498. /|341: O: O682 (predict-no)
  2499. I see 1 and I'm going to do: predict-no
  2500. ENV: Agent did: predict-no for direction U in state State-A
  2501. In State-A moving U
  2502. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2503. predict error 0
  2504. dir: dir isL
  2505. \342: O: O684 (predict-no)
  2506. I see 1 and I'm going to do: predict-no
  2507. ENV: Agent did: predict-no for direction L in state State-A
  2508. In State-A moving L
  2509. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2510. predict error 0
  2511. dir: dir isL
  2512. -/|343: O: O686 (predict-no)
  2513. I see 1 and I'm going to do: predict-no
  2514. ENV: Agent did: predict-no for direction L in state State-A
  2515. In State-A moving L
  2516. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2517. predict error 0
  2518. dir: dir isR
  2519. \-/344: O: O687 (predict-yes)
  2520. I see 1 and I'm going to do: predict-yes
  2521. ENV: Agent did: predict-yes for direction R in state State-A
  2522. In State-A moving R
  2523. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2524. predict error 0
  2525. dir: dir isU
  2526. |\-345: O: O689 (predict-yes)
  2527. I see 1 and I'm going to do: predict-yes
  2528. ENV: Agent did: predict-yes for direction U in state State-B
  2529. In State-B moving U
  2530. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2531. predict error 1
  2532. dir: dir isL
  2533. /|\-sleeping...
  2534. /346: O: O691 (predict-yes)
  2535. I see 0 and I'm going to do: predict-yes
  2536. ENV: Agent did: predict-yes for direction L in state State-B
  2537. In State-B moving L
  2538. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2539. predict error 0
  2540. dir: dir isU
  2541. |\-347: O: O693 (predict-yes)
  2542. I see 1 and I'm going to do: predict-yes
  2543. ENV: Agent did: predict-yes for direction U in state State-A
  2544. In State-A moving U
  2545. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2546. predict error 1
  2547. dir: dir isL
  2548. /|\348: O: O696 (predict-no)
  2549. I see 0 and I'm going to do: predict-no
  2550. ENV: Agent did: predict-no for direction L in state State-A
  2551. In State-A moving L
  2552. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2553. predict error 0
  2554. dir: dir isU
  2555. -/|349: O: O698 (predict-no)
  2556. I see 1 and I'm going to do: predict-no
  2557. ENV: Agent did: predict-no for direction U in state State-A
  2558. In State-A moving U
  2559. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2560. predict error 0
  2561. dir: dir isL
  2562. \-/350: O: O700 (predict-no)
  2563. I see 1 and I'm going to do: predict-no
  2564. ENV: Agent did: predict-no for direction L in state State-A
  2565. In State-A moving L
  2566. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2567. predict error 0
  2568. dir: dir isL
  2569. |\-351: O: O702 (predict-no)
  2570. I see 1 and I'm going to do: predict-no
  2571. ENV: Agent did: predict-no for direction L in state State-A
  2572. In State-A moving L
  2573. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2574. predict error 0
  2575. dir: dir isU
  2576. /352: O: O704 (predict-no)
  2577. I see 1 and I'm going to do: predict-no
  2578. ENV: Agent did: predict-no for direction U in state State-A
  2579. In State-A moving U
  2580. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2581. predict error 0
  2582. dir: dir isU
  2583. |\353: O: O706 (predict-no)
  2584. I see 1 and I'm going to do: predict-no
  2585. ENV: Agent did: predict-no for direction U in state State-A
  2586. In State-A moving U
  2587. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2588. predict error 0
  2589. dir: dir isU
  2590. -/|354: O: O708 (predict-no)
  2591. I see 1 and I'm going to do: predict-no
  2592. ENV: Agent did: predict-no for direction U in state State-A
  2593. In State-A moving U
  2594. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2595. predict error 0
  2596. dir: dir isU
  2597. \-/355: O: O710 (predict-no)
  2598. I see 1 and I'm going to do: predict-no
  2599. ENV: Agent did: predict-no for direction U in state State-A
  2600. In State-A moving U
  2601. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2602. predict error 0
  2603. dir: dir isU
  2604. |\-356: O: O712 (predict-no)
  2605. I see 1 and I'm going to do: predict-no
  2606. ENV: Agent did: predict-no for direction U in state State-A
  2607. In State-A moving U
  2608. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2609. predict error 0
  2610. dir: dir isU
  2611. /|\357: O: O714 (predict-no)
  2612. I see 1 and I'm going to do: predict-no
  2613. ENV: Agent did: predict-no for direction U in state State-A
  2614. In State-A moving U
  2615. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2616. predict error 0
  2617. dir: dir isL
  2618. -/|358: O: O716 (predict-no)
  2619. I see 1 and I'm going to do: predict-no
  2620. ENV: Agent did: predict-no for direction L in state State-A
  2621. In State-A moving L
  2622. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2623. predict error 0
  2624. dir: dir isR
  2625. \-/359: O: O718 (predict-no)
  2626. I see 1 and I'm going to do: predict-no
  2627. ENV: Agent did: predict-no for direction R in state State-A
  2628. In State-A moving R
  2629. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2630. predict error 1
  2631. dir: dir isL
  2632. |\360: O: O719 (predict-yes)
  2633. I see 0 and I'm going to do: predict-yes
  2634. ENV: Agent did: predict-yes for direction L in state State-B
  2635. In State-B moving L
  2636. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2637. predict error 0
  2638. dir: dir isU
  2639. -/|361: O: O722 (predict-no)
  2640. I see 1 and I'm going to do: predict-no
  2641. ENV: Agent did: predict-no for direction U in state State-A
  2642. In State-A moving U
  2643. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2644. predict error 0
  2645. dir: dir isU
  2646. \362: O: O724 (predict-no)
  2647. I see 1 and I'm going to do: predict-no
  2648. ENV: Agent did: predict-no for direction U in state State-A
  2649. In State-A moving U
  2650. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2651. predict error 0
  2652. dir: dir isL
  2653. -/|363: O: O726 (predict-no)
  2654. I see 1 and I'm going to do: predict-no
  2655. ENV: Agent did: predict-no for direction L in state State-A
  2656. In State-A moving L
  2657. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2658. predict error 0
  2659. dir: dir isL
  2660. \-/364: O: O728 (predict-no)
  2661. I see 1 and I'm going to do: predict-no
  2662. ENV: Agent did: predict-no for direction L in state State-A
  2663. In State-A moving L
  2664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2665. predict error 0
  2666. dir: dir isU
  2667. |\365: O: O730 (predict-no)
  2668. I see 1 and I'm going to do: predict-no
  2669. ENV: Agent did: predict-no for direction U in state State-A
  2670. In State-A moving U
  2671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2672. predict error 0
  2673. dir: dir isU
  2674. -/|366: O: O732 (predict-no)
  2675. I see 1 and I'm going to do: predict-no
  2676. ENV: Agent did: predict-no for direction U in state State-A
  2677. In State-A moving U
  2678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2679. predict error 0
  2680. dir: dir isR
  2681. \-/367: O: O733 (predict-yes)
  2682. I see 1 and I'm going to do: predict-yes
  2683. ENV: Agent did: predict-yes for direction R in state State-A
  2684. In State-A moving R
  2685. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2686. predict error 0
  2687. dir: dir isR
  2688. |\368: O: O735 (predict-yes)
  2689. I see 1 and I'm going to do: predict-yes
  2690. ENV: Agent did: predict-yes for direction R in state State-B
  2691. In State-B moving R
  2692. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2693. predict error 1
  2694. dir: dir isU
  2695. -/|369: O: O738 (predict-no)
  2696. I see 0 and I'm going to do: predict-no
  2697. ENV: Agent did: predict-no for direction U in state State-B
  2698. In State-B moving U
  2699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2700. predict error 0
  2701. dir: dir isR
  2702. \-/370: O: O740 (predict-no)
  2703. I see 1 and I'm going to do: predict-no
  2704. ENV: Agent did: predict-no for direction R in state State-B
  2705. In State-B moving R
  2706. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2707. predict error 0
  2708. dir: dir isR
  2709. |\371: O: O742 (predict-no)
  2710. I see 1 and I'm going to do: predict-no
  2711. ENV: Agent did: predict-no for direction R in state State-B
  2712. In State-B moving R
  2713. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2714. predict error 0
  2715. dir: dir isR
  2716. -372: O: O744 (predict-no)
  2717. I see 1 and I'm going to do: predict-no
  2718. ENV: Agent did: predict-no for direction R in state State-B
  2719. In State-B moving R
  2720. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2721. predict error 0
  2722. dir: dir isL
  2723. /|\373: O: O745 (predict-yes)
  2724. I see 1 and I'm going to do: predict-yes
  2725. ENV: Agent did: predict-yes for direction L in state State-B
  2726. In State-B moving L
  2727. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2728. predict error 0
  2729. dir: dir isL
  2730. -/374: O: O748 (predict-no)
  2731. I see 1 and I'm going to do: predict-no
  2732. ENV: Agent did: predict-no for direction L in state State-A
  2733. In State-A moving L
  2734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2735. predict error 0
  2736. dir: dir isR
  2737. |\-375: O: O749 (predict-yes)
  2738. I see 1 and I'm going to do: predict-yes
  2739. ENV: Agent did: predict-yes for direction R in state State-A
  2740. In State-A moving R
  2741. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2742. predict error 0
  2743. dir: dir isR
  2744. /|\376: O: O752 (predict-no)
  2745. I see 1 and I'm going to do: predict-no
  2746. ENV: Agent did: predict-no for direction R in state State-B
  2747. In State-B moving R
  2748. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2749. predict error 0
  2750. dir: dir isR
  2751. -/|377: O: O754 (predict-no)
  2752. I see 1 and I'm going to do: predict-no
  2753. ENV: Agent did: predict-no for direction R in state State-B
  2754. In State-B moving R
  2755. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2756. predict error 0
  2757. dir: dir isL
  2758. \-378: O: O755 (predict-yes)
  2759. I see 1 and I'm going to do: predict-yes
  2760. ENV: Agent did: predict-yes for direction L in state State-B
  2761. In State-B moving L
  2762. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2763. predict error 0
  2764. dir: dir isR
  2765. /|\379: O: O757 (predict-yes)
  2766. I see 1 and I'm going to do: predict-yes
  2767. ENV: Agent did: predict-yes for direction R in state State-A
  2768. In State-A moving R
  2769. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2770. predict error 0
  2771. dir: dir isL
  2772. -/|380: O: O759 (predict-yes)
  2773. I see 1 and I'm going to do: predict-yes
  2774. ENV: Agent did: predict-yes for direction L in state State-B
  2775. In State-B moving L
  2776. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2777. predict error 0
  2778. dir: dir isL
  2779. \-/381: O: O762 (predict-no)
  2780. I see 1 and I'm going to do: predict-no
  2781. ENV: Agent did: predict-no for direction L in state State-A
  2782. In State-A moving L
  2783. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2784. predict error 0
  2785. dir: dir isL
  2786. |382: O: O764 (predict-no)
  2787. I see 1 and I'm going to do: predict-no
  2788. ENV: Agent did: predict-no for direction L in state State-A
  2789. In State-A moving L
  2790. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2791. predict error 0
  2792. dir: dir isU
  2793. \-/383: O: O766 (predict-no)
  2794. I see 1 and I'm going to do: predict-no
  2795. ENV: Agent did: predict-no for direction U in state State-A
  2796. In State-A moving U
  2797. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2798. predict error 0
  2799. dir: dir isR
  2800. |\384: O: O767 (predict-yes)
  2801. I see 1 and I'm going to do: predict-yes
  2802. ENV: Agent did: predict-yes for direction R in state State-A
  2803. In State-A moving R
  2804. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2805. predict error 0
  2806. dir: dir isR
  2807. -/|385: O: O770 (predict-no)
  2808. I see 1 and I'm going to do: predict-no
  2809. ENV: Agent did: predict-no for direction R in state State-B
  2810. In State-B moving R
  2811. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2812. predict error 0
  2813. dir: dir isR
  2814. \-386: O: O772 (predict-no)
  2815. I see 1 and I'm going to do: predict-no
  2816. ENV: Agent did: predict-no for direction R in state State-B
  2817. In State-B moving R
  2818. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2819. predict error 0
  2820. dir: dir isL
  2821. /|\387: O: O773 (predict-yes)
  2822. I see 1 and I'm going to do: predict-yes
  2823. ENV: Agent did: predict-yes for direction L in state State-B
  2824. In State-B moving L
  2825. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2826. predict error 0
  2827. dir: dir isL
  2828. -/|388: O: O776 (predict-no)
  2829. I see 1 and I'm going to do: predict-no
  2830. ENV: Agent did: predict-no for direction L in state State-A
  2831. In State-A moving L
  2832. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2833. predict error 0
  2834. dir: dir isR
  2835. \-389: O: O777 (predict-yes)
  2836. I see 1 and I'm going to do: predict-yes
  2837. ENV: Agent did: predict-yes for direction R in state State-A
  2838. In State-A moving R
  2839. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2840. predict error 0
  2841. dir: dir isR
  2842. /|\390: O: O779 (predict-yes)
  2843. I see 1 and I'm going to do: predict-yes
  2844. ENV: Agent did: predict-yes for direction R in state State-B
  2845. In State-B moving R
  2846. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2847. predict error 1
  2848. dir: dir isR
  2849. -/|391: O: O782 (predict-no)
  2850. I see 0 and I'm going to do: predict-no
  2851. ENV: Agent did: predict-no for direction R in state State-B
  2852. In State-B moving R
  2853. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2854. predict error 0
  2855. dir: dir isR
  2856. \392: O: O784 (predict-no)
  2857. I see 1 and I'm going to do: predict-no
  2858. ENV: Agent did: predict-no for direction R in state State-B
  2859. In State-B moving R
  2860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2861. predict error 0
  2862. dir: dir isU
  2863. -/|393: O: O786 (predict-no)
  2864. I see 1 and I'm going to do: predict-no
  2865. ENV: Agent did: predict-no for direction U in state State-B
  2866. In State-B moving U
  2867. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2868. predict error 0
  2869. dir: dir isU
  2870. \-/394: O: O788 (predict-no)
  2871. I see 1 and I'm going to do: predict-no
  2872. ENV: Agent did: predict-no for direction U in state State-B
  2873. In State-B moving U
  2874. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2875. predict error 0
  2876. dir: dir isL
  2877. |\395: O: O789 (predict-yes)
  2878. I see 1 and I'm going to do: predict-yes
  2879. ENV: Agent did: predict-yes for direction L in state State-B
  2880. In State-B moving L
  2881. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2882. predict error 0
  2883. dir: dir isR
  2884. -/|396: O: O791 (predict-yes)
  2885. I see 1 and I'm going to do: predict-yes
  2886. ENV: Agent did: predict-yes for direction R in state State-A
  2887. In State-A moving R
  2888. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2889. predict error 0
  2890. dir: dir isR
  2891. \-397: O: O794 (predict-no)
  2892. I see 1 and I'm going to do: predict-no
  2893. ENV: Agent did: predict-no for direction R in state State-B
  2894. In State-B moving R
  2895. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2896. predict error 0
  2897. dir: dir isL
  2898. /|\398: O: O795 (predict-yes)
  2899. I see 1 and I'm going to do: predict-yes
  2900. ENV: Agent did: predict-yes for direction L in state State-B
  2901. In State-B moving L
  2902. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2903. predict error 0
  2904. dir: dir isR
  2905. -/|399: O: O797 (predict-yes)
  2906. I see 1 and I'm going to do: predict-yes
  2907. ENV: Agent did: predict-yes for direction R in state State-A
  2908. In State-A moving R
  2909. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2910. predict error 0
  2911. dir: dir isR
  2912. \-/400: O: O800 (predict-no)
  2913. I see 1 and I'm going to do: predict-no
  2914. ENV: Agent did: predict-no for direction R in state State-B
  2915. In State-B moving R
  2916. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2917. predict error 0
  2918. dir: dir isU
  2919. |\-401: O: O802 (predict-no)
  2920. I see 1 and I'm going to do: predict-no
  2921. ENV: Agent did: predict-no for direction U in state State-B
  2922. In State-B moving U
  2923. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2924. predict error 0
  2925. dir: dir isU
  2926. /402: O: O804 (predict-no)
  2927. I see 1 and I'm going to do: predict-no
  2928. ENV: Agent did: predict-no for direction U in state State-B
  2929. In State-B moving U
  2930. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2931. predict error 0
  2932. dir: dir isL
  2933. |\403: O: O805 (predict-yes)
  2934. I see 1 and I'm going to do: predict-yes
  2935. ENV: Agent did: predict-yes for direction L in state State-B
  2936. In State-B moving L
  2937. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2938. predict error 0
  2939. dir: dir isR
  2940. -/404: O: O807 (predict-yes)
  2941. I see 1 and I'm going to do: predict-yes
  2942. ENV: Agent did: predict-yes for direction R in state State-A
  2943. In State-A moving R
  2944. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2945. predict error 0
  2946. dir: dir isL
  2947. |\-405: O: O809 (predict-yes)
  2948. I see 1 and I'm going to do: predict-yes
  2949. ENV: Agent did: predict-yes for direction L in state State-B
  2950. In State-B moving L
  2951. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2952. predict error 0
  2953. dir: dir isL
  2954. /|406: O: O812 (predict-no)
  2955. I see 1 and I'm going to do: predict-no
  2956. ENV: Agent did: predict-no for direction L in state State-A
  2957. In State-A moving L
  2958. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2959. predict error 0
  2960. dir: dir isR
  2961. \-407: O: O813 (predict-yes)
  2962. I see 1 and I'm going to do: predict-yes
  2963. ENV: Agent did: predict-yes for direction R in state State-A
  2964. In State-A moving R
  2965. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2966. predict error 0
  2967. dir: dir isU
  2968. /|\408: O: O816 (predict-no)
  2969. I see 1 and I'm going to do: predict-no
  2970. ENV: Agent did: predict-no for direction U in state State-B
  2971. In State-B moving U
  2972. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2973. predict error 0
  2974. dir: dir isL
  2975. -/409: O: O817 (predict-yes)
  2976. I see 1 and I'm going to do: predict-yes
  2977. ENV: Agent did: predict-yes for direction L in state State-B
  2978. In State-B moving L
  2979. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2980. predict error 0
  2981. dir: dir isU
  2982. |\-410: O: O820 (predict-no)
  2983. I see 1 and I'm going to do: predict-no
  2984. ENV: Agent did: predict-no for direction U in state State-A
  2985. In State-A moving U
  2986. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2987. predict error 0
  2988. dir: dir isU
  2989. /|\411: O: O822 (predict-no)
  2990. I see 1 and I'm going to do: predict-no
  2991. ENV: Agent did: predict-no for direction U in state State-A
  2992. In State-A moving U
  2993. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2994. predict error 0
  2995. dir: dir isL
  2996. -412: O: O824 (predict-no)
  2997. I see 1 and I'm going to do: predict-no
  2998. ENV: Agent did: predict-no for direction L in state State-A
  2999. In State-A moving L
  3000. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3001. predict error 0
  3002. dir: dir isU
  3003. /|413: O: O826 (predict-no)
  3004. I see 1 and I'm going to do: predict-no
  3005. ENV: Agent did: predict-no for direction U in state State-A
  3006. In State-A moving U
  3007. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3008. predict error 0
  3009. dir: dir isU
  3010. \-/414: O: O828 (predict-no)
  3011. I see 1 and I'm going to do: predict-no
  3012. ENV: Agent did: predict-no for direction U in state State-A
  3013. In State-A moving U
  3014. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3015. predict error 0
  3016. dir: dir isR
  3017. |\-415: O: O830 (predict-no)
  3018. I see 1 and I'm going to do: predict-no
  3019. ENV: Agent did: predict-no for direction R in state State-A
  3020. In State-A moving R
  3021. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3022. predict error 1
  3023. dir: dir isU
  3024. /|\416: O: O831 (predict-yes)
  3025. I see 0 and I'm going to do: predict-yes
  3026. ENV: Agent did: predict-yes for direction U in state State-B
  3027. In State-B moving U
  3028. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3029. predict error 1
  3030. dir: dir isU
  3031. -/417: O: O834 (predict-no)
  3032. I see 0 and I'm going to do: predict-no
  3033. ENV: Agent did: predict-no for direction U in state State-B
  3034. In State-B moving U
  3035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3036. predict error 0
  3037. dir: dir isR
  3038. |\-418: O: O836 (predict-no)
  3039. I see 1 and I'm going to do: predict-no
  3040. ENV: Agent did: predict-no for direction R in state State-B
  3041. In State-B moving R
  3042. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3043. predict error 0
  3044. dir: dir isU
  3045. /|419: O: O838 (predict-no)
  3046. I see 1 and I'm going to do: predict-no
  3047. ENV: Agent did: predict-no for direction U in state State-B
  3048. In State-B moving U
  3049. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3050. predict error 0
  3051. dir: dir isU
  3052. \-420: O: O840 (predict-no)
  3053. I see 1 and I'm going to do: predict-no
  3054. ENV: Agent did: predict-no for direction U in state State-B
  3055. In State-B moving U
  3056. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3057. predict error 0
  3058. dir: dir isU
  3059. /421: O: O841 (predict-yes)
  3060. I see 1 and I'm going to do: predict-yes
  3061. ENV: Agent did: predict-yes for direction U in state State-B
  3062. In State-B moving U
  3063. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3064. predict error 1
  3065. dir: dir isR
  3066. |422: O: O844 (predict-no)
  3067. I see 0 and I'm going to do: predict-no
  3068. ENV: Agent did: predict-no for direction R in state State-B
  3069. In State-B moving R
  3070. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3071. predict error 0
  3072. dir: dir isL
  3073. \-/423: O: O845 (predict-yes)
  3074. I see 1 and I'm going to do: predict-yes
  3075. ENV: Agent did: predict-yes for direction L in state State-B
  3076. In State-B moving L
  3077. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3078. predict error 0
  3079. dir: dir isL
  3080. |\-424: O: O848 (predict-no)
  3081. I see 1 and I'm going to do: predict-no
  3082. ENV: Agent did: predict-no for direction L in state State-A
  3083. In State-A moving L
  3084. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3085. predict error 0
  3086. dir: dir isL
  3087. /|\425: O: O850 (predict-no)
  3088. I see 1 and I'm going to do: predict-no
  3089. ENV: Agent did: predict-no for direction L in state State-A
  3090. In State-A moving L
  3091. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3092. predict error 0
  3093. dir: dir isR
  3094. -/|426: O: O851 (predict-yes)
  3095. I see 1 and I'm going to do: predict-yes
  3096. ENV: Agent did: predict-yes for direction R in state State-A
  3097. In State-A moving R
  3098. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3099. predict error 0
  3100. dir: dir isU
  3101. \-/427: O: O854 (predict-no)
  3102. I see 1 and I'm going to do: predict-no
  3103. ENV: Agent did: predict-no for direction U in state State-B
  3104. In State-B moving U
  3105. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3106. predict error 0
  3107. dir: dir isL
  3108. |\-428: O: O855 (predict-yes)
  3109. I see 1 and I'm going to do: predict-yes
  3110. ENV: Agent did: predict-yes for direction L in state State-B
  3111. In State-B moving L
  3112. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3113. predict error 0
  3114. dir: dir isU
  3115. /|\429: O: O858 (predict-no)
  3116. I see 1 and I'm going to do: predict-no
  3117. ENV: Agent did: predict-no for direction U in state State-A
  3118. In State-A moving U
  3119. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3120. predict error 0
  3121. dir: dir isU
  3122. -/|430: O: O860 (predict-no)
  3123. I see 1 and I'm going to do: predict-no
  3124. ENV: Agent did: predict-no for direction U in state State-A
  3125. In State-A moving U
  3126. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3127. predict error 0
  3128. dir: dir isR
  3129. \-/431: O: O861 (predict-yes)
  3130. I see 1 and I'm going to do: predict-yes
  3131. ENV: Agent did: predict-yes for direction R in state State-A
  3132. In State-A moving R
  3133. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3134. predict error 0
  3135. dir: dir isR
  3136. |432: O: O864 (predict-no)
  3137. I see 1 and I'm going to do: predict-no
  3138. ENV: Agent did: predict-no for direction R in state State-B
  3139. In State-B moving R
  3140. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3141. predict error 0
  3142. dir: dir isL
  3143. \-433: O: O865 (predict-yes)
  3144. I see 1 and I'm going to do: predict-yes
  3145. ENV: Agent did: predict-yes for direction L in state State-B
  3146. In State-B moving L
  3147. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3148. predict error 0
  3149. dir: dir isU
  3150. /|\434: O: O868 (predict-no)
  3151. I see 1 and I'm going to do: predict-no
  3152. ENV: Agent did: predict-no for direction U in state State-A
  3153. In State-A moving U
  3154. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3155. predict error 0
  3156. dir: dir isL
  3157. -435: O: O870 (predict-no)
  3158. I see 1 and I'm going to do: predict-no
  3159. ENV: Agent did: predict-no for direction L in state State-A
  3160. In State-A moving L
  3161. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3162. predict error 0
  3163. dir: dir isU
  3164. /|\436: O: O872 (predict-no)
  3165. I see 1 and I'm going to do: predict-no
  3166. ENV: Agent did: predict-no for direction U in state State-A
  3167. In State-A moving U
  3168. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3169. predict error 0
  3170. dir: dir isU
  3171. -/|437: O: O874 (predict-no)
  3172. I see 1 and I'm going to do: predict-no
  3173. ENV: Agent did: predict-no for direction U in state State-A
  3174. In State-A moving U
  3175. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3176. predict error 0
  3177. dir: dir isR
  3178. \-/438: O: O875 (predict-yes)
  3179. I see 1 and I'm going to do: predict-yes
  3180. ENV: Agent did: predict-yes for direction R in state State-A
  3181. In State-A moving R
  3182. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3183. predict error 0
  3184. dir: dir isL
  3185. |439: O: O877 (predict-yes)
  3186. I see 1 and I'm going to do: predict-yes
  3187. ENV: Agent did: predict-yes for direction L in state State-B
  3188. In State-B moving L
  3189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3190. predict error 0
  3191. dir: dir isU
  3192. \-440: O: O880 (predict-no)
  3193. I see 1 and I'm going to do: predict-no
  3194. ENV: Agent did: predict-no for direction U in state State-A
  3195. In State-A moving U
  3196. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3197. predict error 0
  3198. dir: dir isU
  3199. /|441: O: O882 (predict-no)
  3200. I see 1 and I'm going to do: predict-no
  3201. ENV: Agent did: predict-no for direction U in state State-A
  3202. In State-A moving U
  3203. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3204. predict error 0
  3205. dir: dir isL
  3206. \442: O: O884 (predict-no)
  3207. I see 1 and I'm going to do: predict-no
  3208. ENV: Agent did: predict-no for direction L in state State-A
  3209. In State-A moving L
  3210. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3211. predict error 0
  3212. dir: dir isU
  3213. -/443: O: O886 (predict-no)
  3214. I see 1 and I'm going to do: predict-no
  3215. ENV: Agent did: predict-no for direction U in state State-A
  3216. In State-A moving U
  3217. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3218. predict error 0
  3219. dir: dir isU
  3220. |\444: O: O888 (predict-no)
  3221. I see 1 and I'm going to do: predict-no
  3222. ENV: Agent did: predict-no for direction U in state State-A
  3223. In State-A moving U
  3224. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3225. predict error 0
  3226. dir: dir isR
  3227. -/|445: O: O890 (predict-no)
  3228. I see 1 and I'm going to do: predict-no
  3229. ENV: Agent did: predict-no for direction R in state State-A
  3230. In State-A moving R
  3231. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3232. predict error 1
  3233. dir: dir isU
  3234. \-/446: O: O892 (predict-no)
  3235. I see 0 and I'm going to do: predict-no
  3236. ENV: Agent did: predict-no for direction U in state State-B
  3237. In State-B moving U
  3238. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3239. predict error 0
  3240. dir: dir isR
  3241. |\-447: O: O894 (predict-no)
  3242. I see 1 and I'm going to do: predict-no
  3243. ENV: Agent did: predict-no for direction R in state State-B
  3244. In State-B moving R
  3245. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3246. predict error 0
  3247. dir: dir isU
  3248. /|448: O: O895 (predict-yes)
  3249. I see 1 and I'm going to do: predict-yes
  3250. ENV: Agent did: predict-yes for direction U in state State-B
  3251. In State-B moving U
  3252. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3253. predict error 1
  3254. dir: dir isU
  3255. \-449: O: O898 (predict-no)
  3256. I see 0 and I'm going to do: predict-no
  3257. ENV: Agent did: predict-no for direction U in state State-B
  3258. In State-B moving U
  3259. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3260. predict error 0
  3261. dir: dir isR
  3262. /|450: O: O900 (predict-no)
  3263. I see 1 and I'm going to do: predict-no
  3264. ENV: Agent did: predict-no for direction R in state State-B
  3265. In State-B moving R
  3266. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3267. predict error 0
  3268. dir: dir isU
  3269. \-/|451: O: O902 (predict-no)
  3270. I see 1 and I'm going to do: predict-no
  3271. ENV: Agent did: predict-no for direction U in state State-B
  3272. In State-B moving U
  3273. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3274. predict error 0
  3275. dir: dir isR
  3276. \452: O: O904 (predict-no)
  3277. I see 1 and I'm going to do: predict-no
  3278. ENV: Agent did: predict-no for direction R in state State-B
  3279. In State-B moving R
  3280. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3281. predict error 0
  3282. dir: dir isL
  3283. -/|453: O: O905 (predict-yes)
  3284. I see 1 and I'm going to do: predict-yes
  3285. ENV: Agent did: predict-yes for direction L in state State-B
  3286. In State-B moving L
  3287. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3288. predict error 0
  3289. dir: dir isL
  3290. \-/454: O: O908 (predict-no)
  3291. I see 1 and I'm going to do: predict-no
  3292. ENV: Agent did: predict-no for direction L in state State-A
  3293. In State-A moving L
  3294. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3295. predict error 0
  3296. dir: dir isL
  3297. |\-455: O: O909 (predict-yes)
  3298. I see 1 and I'm going to do: predict-yes
  3299. ENV: Agent did: predict-yes for direction L in state State-A
  3300. In State-A moving L
  3301. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3302. predict error 1
  3303. dir: dir isU
  3304. /|456: O: O912 (predict-no)
  3305. I see 0 and I'm going to do: predict-no
  3306. ENV: Agent did: predict-no for direction U in state State-A
  3307. In State-A moving U
  3308. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3309. predict error 0
  3310. dir: dir isU
  3311. \-457: O: O914 (predict-no)
  3312. I see 1 and I'm going to do: predict-no
  3313. ENV: Agent did: predict-no for direction U in state State-A
  3314. In State-A moving U
  3315. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3316. predict error 0
  3317. dir: dir isL
  3318. /|\458: O: O916 (predict-no)
  3319. I see 1 and I'm going to do: predict-no
  3320. ENV: Agent did: predict-no for direction L in state State-A
  3321. In State-A moving L
  3322. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3323. predict error 0
  3324. dir: dir isR
  3325. -/|459: O: O917 (predict-yes)
  3326. I see 1 and I'm going to do: predict-yes
  3327. ENV: Agent did: predict-yes for direction R in state State-A
  3328. In State-A moving R
  3329. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3330. predict error 0
  3331. dir: dir isR
  3332. \-/460: O: O920 (predict-no)
  3333. I see 1 and I'm going to do: predict-no
  3334. ENV: Agent did: predict-no for direction R in state State-B
  3335. In State-B moving R
  3336. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3337. predict error 0
  3338. dir: dir isL
  3339. |\-461: O: O921 (predict-yes)
  3340. I see 1 and I'm going to do: predict-yes
  3341. ENV: Agent did: predict-yes for direction L in state State-B
  3342. In State-B moving L
  3343. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3344. predict error 0
  3345. dir: dir isL
  3346. /462: O: O924 (predict-no)
  3347. I see 1 and I'm going to do: predict-no
  3348. ENV: Agent did: predict-no for direction L in state State-A
  3349. In State-A moving L
  3350. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3351. predict error 0
  3352. dir: dir isL
  3353. |\-463: O: O926 (predict-no)
  3354. I see 1 and I'm going to do: predict-no
  3355. ENV: Agent did: predict-no for direction L in state State-A
  3356. In State-A moving L
  3357. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3358. predict error 0
  3359. dir: dir isU
  3360. /|\464: O: O928 (predict-no)
  3361. I see 1 and I'm going to do: predict-no
  3362. ENV: Agent did: predict-no for direction U in state State-A
  3363. In State-A moving U
  3364. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3365. predict error 0
  3366. dir: dir isL
  3367. -/|465: O: O930 (predict-no)
  3368. I see 1 and I'm going to do: predict-no
  3369. ENV: Agent did: predict-no for direction L in state State-A
  3370. In State-A moving L
  3371. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3372. predict error 0
  3373. dir: dir isL
  3374. \-/466: O: O932 (predict-no)
  3375. I see 1 and I'm going to do: predict-no
  3376. ENV: Agent did: predict-no for direction L in state State-A
  3377. In State-A moving L
  3378. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3379. predict error 0
  3380. dir: dir isR
  3381. |\-467: O: O933 (predict-yes)
  3382. I see 1 and I'm going to do: predict-yes
  3383. ENV: Agent did: predict-yes for direction R in state State-A
  3384. In State-A moving R
  3385. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3386. predict error 0
  3387. dir: dir isL
  3388. /|468: O: O935 (predict-yes)
  3389. I see 1 and I'm going to do: predict-yes
  3390. ENV: Agent did: predict-yes for direction L in state State-B
  3391. In State-B moving L
  3392. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3393. predict error 0
  3394. dir: dir isR
  3395. \469: O: O938 (predict-no)
  3396. I see 1 and I'm going to do: predict-no
  3397. ENV: Agent did: predict-no for direction R in state State-A
  3398. In State-A moving R
  3399. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3400. predict error 1
  3401. dir: dir isR
  3402. -/470: O: O940 (predict-no)
  3403. I see 0 and I'm going to do: predict-no
  3404. ENV: Agent did: predict-no for direction R in state State-B
  3405. In State-B moving R
  3406. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3407. predict error 0
  3408. dir: dir isU
  3409. |\-471: O: O942 (predict-no)
  3410. I see 1 and I'm going to do: predict-no
  3411. ENV: Agent did: predict-no for direction U in state State-B
  3412. In State-B moving U
  3413. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3414. predict error 0
  3415. dir: dir isL
  3416. /472: O: O943 (predict-yes)
  3417. I see 1 and I'm going to do: predict-yes
  3418. ENV: Agent did: predict-yes for direction L in state State-B
  3419. In State-B moving L
  3420. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3421. predict error 0
  3422. dir: dir isL
  3423. |\473: O: O945 (predict-yes)
  3424. I see 1 and I'm going to do: predict-yes
  3425. ENV: Agent did: predict-yes for direction L in state State-A
  3426. In State-A moving L
  3427. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3428. predict error 1
  3429. dir: dir isR
  3430. -/|474: O: O947 (predict-yes)
  3431. I see 0 and I'm going to do: predict-yes
  3432. ENV: Agent did: predict-yes for direction R in state State-A
  3433. In State-A moving R
  3434. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3435. predict error 0
  3436. dir: dir isL
  3437. \-/475: O: O949 (predict-yes)
  3438. I see 1 and I'm going to do: predict-yes
  3439. ENV: Agent did: predict-yes for direction L in state State-B
  3440. In State-B moving L
  3441. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3442. predict error 0
  3443. dir: dir isR
  3444. |\-476: O: O952 (predict-no)
  3445. I see 1 and I'm going to do: predict-no
  3446. ENV: Agent did: predict-no for direction R in state State-A
  3447. In State-A moving R
  3448. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3449. predict error 1
  3450. dir: dir isL
  3451. /|\477: O: O953 (predict-yes)
  3452. I see 0 and I'm going to do: predict-yes
  3453. ENV: Agent did: predict-yes for direction L in state State-B
  3454. In State-B moving L
  3455. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3456. predict error 0
  3457. dir: dir isU
  3458. -/|478: O: O956 (predict-no)
  3459. I see 1 and I'm going to do: predict-no
  3460. ENV: Agent did: predict-no for direction U in state State-A
  3461. In State-A moving U
  3462. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3463. predict error 0
  3464. dir: dir isU
  3465. \-/479: O: O958 (predict-no)
  3466. I see 1 and I'm going to do: predict-no
  3467. ENV: Agent did: predict-no for direction U in state State-A
  3468. In State-A moving U
  3469. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3470. predict error 0
  3471. dir: dir isU
  3472. |\480: O: O960 (predict-no)
  3473. I see 1 and I'm going to do: predict-no
  3474. ENV: Agent did: predict-no for direction U in state State-A
  3475. In State-A moving U
  3476. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3477. predict error 0
  3478. dir: dir isU
  3479. -/|481: O: O962 (predict-no)
  3480. I see 1 and I'm going to do: predict-no
  3481. ENV: Agent did: predict-no for direction U in state State-A
  3482. In State-A moving U
  3483. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3484. predict error 0
  3485. dir: dir isR
  3486. \482: O: O963 (predict-yes)
  3487. I see 1 and I'm going to do: predict-yes
  3488. ENV: Agent did: predict-yes for direction R in state State-A
  3489. In State-A moving R
  3490. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3491. predict error 0
  3492. dir: dir isR
  3493. -/|483: O: O966 (predict-no)
  3494. I see 1 and I'm going to do: predict-no
  3495. ENV: Agent did: predict-no for direction R in state State-B
  3496. In State-B moving R
  3497. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3498. predict error 0
  3499. dir: dir isU
  3500. \-/484: O: O968 (predict-no)
  3501. I see 1 and I'm going to do: predict-no
  3502. ENV: Agent did: predict-no for direction U in state State-B
  3503. In State-B moving U
  3504. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3505. predict error 0
  3506. dir: dir isU
  3507. |\-485: O: O970 (predict-no)
  3508. I see 1 and I'm going to do: predict-no
  3509. ENV: Agent did: predict-no for direction U in state State-B
  3510. In State-B moving U
  3511. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3512. predict error 0
  3513. dir: dir isR
  3514. /|\486: O: O972 (predict-no)
  3515. I see 1 and I'm going to do: predict-no
  3516. ENV: Agent did: predict-no for direction R in state State-B
  3517. In State-B moving R
  3518. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3519. predict error 0
  3520. dir: dir isR
  3521. -/|\sleeping...
  3522. -487: O: O974 (predict-no)
  3523. I see 1 and I'm going to do: predict-no
  3524. ENV: Agent did: predict-no for direction R in state State-B
  3525. In State-B moving R
  3526. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3527. predict error 0
  3528. dir: dir isL
  3529. /|488: O: O975 (predict-yes)
  3530. I see 1 and I'm going to do: predict-yes
  3531. ENV: Agent did: predict-yes for direction L in state State-B
  3532. In State-B moving L
  3533. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3534. predict error 0
  3535. dir: dir isL
  3536. \-489: O: O978 (predict-no)
  3537. I see 1 and I'm going to do: predict-no
  3538. ENV: Agent did: predict-no for direction L in state State-A
  3539. In State-A moving L
  3540. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3541. predict error 0
  3542. dir: dir isU
  3543. /|490: O: O980 (predict-no)
  3544. I see 1 and I'm going to do: predict-no
  3545. ENV: Agent did: predict-no for direction U in state State-A
  3546. In State-A moving U
  3547. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3548. predict error 0
  3549. dir: dir isL
  3550. \-/491: O: O982 (predict-no)
  3551. I see 1 and I'm going to do: predict-no
  3552. ENV: Agent did: predict-no for direction L in state State-A
  3553. In State-A moving L
  3554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3555. predict error 0
  3556. dir: dir isU
  3557. |492: O: O984 (predict-no)
  3558. I see 1 and I'm going to do: predict-no
  3559. ENV: Agent did: predict-no for direction U in state State-A
  3560. In State-A moving U
  3561. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3562. predict error 0
  3563. dir: dir isR
  3564. \-/493: O: O985 (predict-yes)
  3565. I see 1 and I'm going to do: predict-yes
  3566. ENV: Agent did: predict-yes for direction R in state State-A
  3567. In State-A moving R
  3568. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3569. predict error 0
  3570. dir: dir isU
  3571. |\494: O: O988 (predict-no)
  3572. I see 1 and I'm going to do: predict-no
  3573. ENV: Agent did: predict-no for direction U in state State-B
  3574. In State-B moving U
  3575. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3576. predict error 0
  3577. dir: dir isU
  3578. -/|495: O: O990 (predict-no)
  3579. I see 1 and I'm going to do: predict-no
  3580. ENV: Agent did: predict-no for direction U in state State-B
  3581. In State-B moving U
  3582. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3583. predict error 0
  3584. dir: dir isU
  3585. \-/496: O: O992 (predict-no)
  3586. I see 1 and I'm going to do: predict-no
  3587. ENV: Agent did: predict-no for direction U in state State-B
  3588. In State-B moving U
  3589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3590. predict error 0
  3591. dir: dir isL
  3592. |\-497: O: O993 (predict-yes)
  3593. I see 1 and I'm going to do: predict-yes
  3594. ENV: Agent did: predict-yes for direction L in state State-B
  3595. In State-B moving L
  3596. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3597. predict error 0
  3598. dir: dir isR
  3599. /|498: O: O995 (predict-yes)
  3600. I see 1 and I'm going to do: predict-yes
  3601. ENV: Agent did: predict-yes for direction R in state State-A
  3602. In State-A moving R
  3603. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3604. predict error 0
  3605. dir: dir isR
  3606. \-/499: O: O998 (predict-no)
  3607. I see 1 and I'm going to do: predict-no
  3608. ENV: Agent did: predict-no for direction R in state State-B
  3609. In State-B moving R
  3610. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3611. predict error 0
  3612. dir: dir isL
  3613. |\-500: O: O999 (predict-yes)
  3614. I see 1 and I'm going to do: predict-yes
  3615. ENV: Agent did: predict-yes for direction L in state State-B
  3616. In State-B moving L
  3617. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3618. predict error 0
  3619. dir: dir isR
  3620. /|\-/501: O: O1001 (predict-yes)
  3621. I see 1 and I'm going to do: predict-yes
  3622. ENV: Agent did: predict-yes for direction R in state State-A
  3623. In State-A moving R
  3624. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3625. predict error 0
  3626. dir: dir isR
  3627. |502: O: O1004 (predict-no)
  3628. I see 1 and I'm going to do: predict-no
  3629. ENV: Agent did: predict-no for direction R in state State-B
  3630. In State-B moving R
  3631. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3632. predict error 0
  3633. dir: dir isR
  3634. \-/503: O: O1006 (predict-no)
  3635. I see 1 and I'm going to do: predict-no
  3636. ENV: Agent did: predict-no for direction R in state State-B
  3637. In State-B moving R
  3638. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3639. predict error 0
  3640. dir: dir isL
  3641. |\504: O: O1007 (predict-yes)
  3642. I see 1 and I'm going to do: predict-yes
  3643. ENV: Agent did: predict-yes for direction L in state State-B
  3644. In State-B moving L
  3645. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3646. predict error 0
  3647. dir: dir isR
  3648. -505: O: O1009 (predict-yes)
  3649. I see 1 and I'm going to do: predict-yes
  3650. ENV: Agent did: predict-yes for direction R in state State-A
  3651. In State-A moving R
  3652. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3653. predict error 0
  3654. dir: dir isR
  3655. /|\506: O: O1012 (predict-no)
  3656. I see 1 and I'm going to do: predict-no
  3657. ENV: Agent did: predict-no for direction R in state State-B
  3658. In State-B moving R
  3659. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3660. predict error 0
  3661. dir: dir isL
  3662. -/507: O: O1013 (predict-yes)
  3663. I see 1 and I'm going to do: predict-yes
  3664. ENV: Agent did: predict-yes for direction L in state State-B
  3665. In State-B moving L
  3666. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3667. predict error 0
  3668. dir: dir isR
  3669. |\508: O: O1015 (predict-yes)
  3670. I see 1 and I'm going to do: predict-yes
  3671. ENV: Agent did: predict-yes for direction R in state State-A
  3672. In State-A moving R
  3673. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3674. predict error 0
  3675. dir: dir isU
  3676. -/|509: O: O1018 (predict-no)
  3677. I see 1 and I'm going to do: predict-no
  3678. ENV: Agent did: predict-no for direction U in state State-B
  3679. In State-B moving U
  3680. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3681. predict error 0
  3682. dir: dir isU
  3683. \-/510: O: O1020 (predict-no)
  3684. I see 1 and I'm going to do: predict-no
  3685. ENV: Agent did: predict-no for direction U in state State-B
  3686. In State-B moving U
  3687. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3688. predict error 0
  3689. dir: dir isR
  3690. |\-511: O: O1022 (predict-no)
  3691. I see 1 and I'm going to do: predict-no
  3692. ENV: Agent did: predict-no for direction R in state State-B
  3693. In State-B moving R
  3694. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3695. predict error 0
  3696. dir: dir isR
  3697. /512: O: O1023 (predict-yes)
  3698. I see 1 and I'm going to do: predict-yes
  3699. ENV: Agent did: predict-yes for direction R in state State-B
  3700. In State-B moving R
  3701. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3702. predict error 1
  3703. dir: dir isR
  3704. |\513: O: O1026 (predict-no)
  3705. I see 0 and I'm going to do: predict-no
  3706. ENV: Agent did: predict-no for direction R in state State-B
  3707. In State-B moving R
  3708. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3709. predict error 0
  3710. dir: dir isL
  3711. -514: O: O1027 (predict-yes)
  3712. I see 1 and I'm going to do: predict-yes
  3713. ENV: Agent did: predict-yes for direction L in state State-B
  3714. In State-B moving L
  3715. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3716. predict error 0
  3717. dir: dir isL
  3718. /|\515: O: O1030 (predict-no)
  3719. I see 1 and I'm going to do: predict-no
  3720. ENV: Agent did: predict-no for direction L in state State-A
  3721. In State-A moving L
  3722. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3723. predict error 0
  3724. dir: dir isL
  3725. -/|516: O: O1032 (predict-no)
  3726. I see 1 and I'm going to do: predict-no
  3727. ENV: Agent did: predict-no for direction L in state State-A
  3728. In State-A moving L
  3729. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3730. predict error 0
  3731. dir: dir isR
  3732. \-517: O: O1034 (predict-no)
  3733. I see 1 and I'm going to do: predict-no
  3734. ENV: Agent did: predict-no for direction R in state State-A
  3735. In State-A moving R
  3736. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3737. predict error 1
  3738. dir: dir isU
  3739. /|\518: O: O1036 (predict-no)
  3740. I see 0 and I'm going to do: predict-no
  3741. ENV: Agent did: predict-no for direction U in state State-B
  3742. In State-B moving U
  3743. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3744. predict error 0
  3745. dir: dir isU
  3746. -/519: O: O1038 (predict-no)
  3747. I see 1 and I'm going to do: predict-no
  3748. ENV: Agent did: predict-no for direction U in state State-B
  3749. In State-B moving U
  3750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3751. predict error 0
  3752. dir: dir isR
  3753. |\-520: O: O1040 (predict-no)
  3754. I see 1 and I'm going to do: predict-no
  3755. ENV: Agent did: predict-no for direction R in state State-B
  3756. In State-B moving R
  3757. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3758. predict error 0
  3759. dir: dir isU
  3760. /|\521: O: O1042 (predict-no)
  3761. I see 1 and I'm going to do: predict-no
  3762. ENV: Agent did: predict-no for direction U in state State-B
  3763. In State-B moving U
  3764. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3765. predict error 0
  3766. dir: dir isR
  3767. -522: O: O1044 (predict-no)
  3768. I see 1 and I'm going to do: predict-no
  3769. ENV: Agent did: predict-no for direction R in state State-B
  3770. In State-B moving R
  3771. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3772. predict error 0
  3773. dir: dir isU
  3774. /|\523: O: O1046 (predict-no)
  3775. I see 1 and I'm going to do: predict-no
  3776. ENV: Agent did: predict-no for direction U in state State-B
  3777. In State-B moving U
  3778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3779. predict error 0
  3780. dir: dir isR
  3781. -/|524: O: O1048 (predict-no)
  3782. I see 1 and I'm going to do: predict-no
  3783. ENV: Agent did: predict-no for direction R in state State-B
  3784. In State-B moving R
  3785. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3786. predict error 0
  3787. dir: dir isU
  3788. \-/525: O: O1050 (predict-no)
  3789. I see 1 and I'm going to do: predict-no
  3790. ENV: Agent did: predict-no for direction U in state State-B
  3791. In State-B moving U
  3792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3793. predict error 0
  3794. dir: dir isU
  3795. |\-526: O: O1052 (predict-no)
  3796. I see 1 and I'm going to do: predict-no
  3797. ENV: Agent did: predict-no for direction U in state State-B
  3798. In State-B moving U
  3799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3800. predict error 0
  3801. dir: dir isL
  3802. /|\527: O: O1053 (predict-yes)
  3803. I see 1 and I'm going to do: predict-yes
  3804. ENV: Agent did: predict-yes for direction L in state State-B
  3805. In State-B moving L
  3806. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3807. predict error 0
  3808. dir: dir isL
  3809. -/528: O: O1056 (predict-no)
  3810. I see 1 and I'm going to do: predict-no
  3811. ENV: Agent did: predict-no for direction L in state State-A
  3812. In State-A moving L
  3813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3814. predict error 0
  3815. dir: dir isR
  3816. |\-529: O: O1057 (predict-yes)
  3817. I see 1 and I'm going to do: predict-yes
  3818. ENV: Agent did: predict-yes for direction R in state State-A
  3819. In State-A moving R
  3820. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3821. predict error 0
  3822. dir: dir isR
  3823. /|\530: O: O1060 (predict-no)
  3824. I see 1 and I'm going to do: predict-no
  3825. ENV: Agent did: predict-no for direction R in state State-B
  3826. In State-B moving R
  3827. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3828. predict error 0
  3829. dir: dir isR
  3830. -/|531: O: O1062 (predict-no)
  3831. I see 1 and I'm going to do: predict-no
  3832. ENV: Agent did: predict-no for direction R in state State-B
  3833. In State-B moving R
  3834. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3835. predict error 0
  3836. dir: dir isL
  3837. \532: O: O1063 (predict-yes)
  3838. I see 1 and I'm going to do: predict-yes
  3839. ENV: Agent did: predict-yes for direction L in state State-B
  3840. In State-B moving L
  3841. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3842. predict error 0
  3843. dir: dir isR
  3844. -/533: O: O1065 (predict-yes)
  3845. I see 1 and I'm going to do: predict-yes
  3846. ENV: Agent did: predict-yes for direction R in state State-A
  3847. In State-A moving R
  3848. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3849. predict error 0
  3850. dir: dir isR
  3851. |\-534: O: O1068 (predict-no)
  3852. I see 1 and I'm going to do: predict-no
  3853. ENV: Agent did: predict-no for direction R in state State-B
  3854. In State-B moving R
  3855. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3856. predict error 0
  3857. dir: dir isR
  3858. /|\535: O: O1070 (predict-no)
  3859. I see 1 and I'm going to do: predict-no
  3860. ENV: Agent did: predict-no for direction R in state State-B
  3861. In State-B moving R
  3862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3863. predict error 0
  3864. dir: dir isU
  3865. -/|536: O: O1072 (predict-no)
  3866. I see 1 and I'm going to do: predict-no
  3867. ENV: Agent did: predict-no for direction U in state State-B
  3868. In State-B moving U
  3869. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3870. predict error 0
  3871. dir: dir isR
  3872. \-537: O: O1074 (predict-no)
  3873. I see 1 and I'm going to do: predict-no
  3874. ENV: Agent did: predict-no for direction R in state State-B
  3875. In State-B moving R
  3876. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3877. predict error 0
  3878. dir: dir isU
  3879. /|\538: O: O1076 (predict-no)
  3880. I see 1 and I'm going to do: predict-no
  3881. ENV: Agent did: predict-no for direction U in state State-B
  3882. In State-B moving U
  3883. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3884. predict error 0
  3885. dir: dir isU
  3886. -/|\539: O: O1078 (predict-no)
  3887. I see 1 and I'm going to do: predict-no
  3888. ENV: Agent did: predict-no for direction U in state State-B
  3889. In State-B moving U
  3890. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3891. predict error 0
  3892. dir: dir isU
  3893. -/|540: O: O1080 (predict-no)
  3894. I see 1 and I'm going to do: predict-no
  3895. ENV: Agent did: predict-no for direction U in state State-B
  3896. In State-B moving U
  3897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3898. predict error 0
  3899. dir: dir isR
  3900. \-541: O: O1082 (predict-no)
  3901. I see 1 and I'm going to do: predict-no
  3902. ENV: Agent did: predict-no for direction R in state State-B
  3903. In State-B moving R
  3904. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3905. predict error 0
  3906. dir: dir isU
  3907. /542: O: O1083 (predict-yes)
  3908. I see 1 and I'm going to do: predict-yes
  3909. ENV: Agent did: predict-yes for direction U in state State-B
  3910. In State-B moving U
  3911. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3912. predict error 1
  3913. dir: dir isR
  3914. |\-/543: O: O1086 (predict-no)
  3915. I see 0 and I'm going to do: predict-no
  3916. ENV: Agent did: predict-no for direction R in state State-B
  3917. In State-B moving R
  3918. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3919. predict error 0
  3920. dir: dir isR
  3921. |\-544: O: O1088 (predict-no)
  3922. I see 1 and I'm going to do: predict-no
  3923. ENV: Agent did: predict-no for direction R in state State-B
  3924. In State-B moving R
  3925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3926. predict error 0
  3927. dir: dir isR
  3928. /|545: O: O1090 (predict-no)
  3929. I see 1 and I'm going to do: predict-no
  3930. ENV: Agent did: predict-no for direction R in state State-B
  3931. In State-B moving R
  3932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3933. predict error 0
  3934. dir: dir isR
  3935. \-/546: O: O1092 (predict-no)
  3936. I see 1 and I'm going to do: predict-no
  3937. ENV: Agent did: predict-no for direction R in state State-B
  3938. In State-B moving R
  3939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3940. predict error 0
  3941. dir: dir isR
  3942. |\547: O: O1094 (predict-no)
  3943. I see 1 and I'm going to do: predict-no
  3944. ENV: Agent did: predict-no for direction R in state State-B
  3945. In State-B moving R
  3946. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3947. predict error 0
  3948. dir: dir isR
  3949. -/|548: O: O1096 (predict-no)
  3950. I see 1 and I'm going to do: predict-no
  3951. ENV: Agent did: predict-no for direction R in state State-B
  3952. In State-B moving R
  3953. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3954. predict error 0
  3955. dir: dir isU
  3956. \-/549: O: O1098 (predict-no)
  3957. I see 1 and I'm going to do: predict-no
  3958. ENV: Agent did: predict-no for direction U in state State-B
  3959. In State-B moving U
  3960. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3961. predict error 0
  3962. dir: dir isU
  3963. |\550: O: O1099 (predict-yes)
  3964. I see 1 and I'm going to do: predict-yes
  3965. ENV: Agent did: predict-yes for direction U in state State-B
  3966. In State-B moving U
  3967. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3968. predict error 1
  3969. dir: dir isU
  3970. -/|551: O: O1102 (predict-no)
  3971. I see 0 and I'm going to do: predict-no
  3972. ENV: Agent did: predict-no for direction U in state State-B
  3973. In State-B moving U
  3974. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3975. predict error 0
  3976. dir: dir isU
  3977. \552: O: O1104 (predict-no)
  3978. I see 1 and I'm going to do: predict-no
  3979. ENV: Agent did: predict-no for direction U in state State-B
  3980. In State-B moving U
  3981. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3982. predict error 0
  3983. dir: dir isU
  3984. -/|553: O: O1105 (predict-yes)
  3985. I see 1 and I'm going to do: predict-yes
  3986. ENV: Agent did: predict-yes for direction U in state State-B
  3987. In State-B moving U
  3988. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3989. predict error 1
  3990. dir: dir isR
  3991. \-/554: O: O1108 (predict-no)
  3992. I see 0 and I'm going to do: predict-no
  3993. ENV: Agent did: predict-no for direction R in state State-B
  3994. In State-B moving R
  3995. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3996. predict error 0
  3997. dir: dir isR
  3998. |\-555: O: O1110 (predict-no)
  3999. I see 1 and I'm going to do: predict-no
  4000. ENV: Agent did: predict-no for direction R in state State-B
  4001. In State-B moving R
  4002. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4003. predict error 0
  4004. dir: dir isL
  4005. /|\556: O: O1111 (predict-yes)
  4006. I see 1 and I'm going to do: predict-yes
  4007. ENV: Agent did: predict-yes for direction L in state State-B
  4008. In State-B moving L
  4009. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4010. predict error 0
  4011. dir: dir isU
  4012. -/557: O: O1114 (predict-no)
  4013. I see 1 and I'm going to do: predict-no
  4014. ENV: Agent did: predict-no for direction U in state State-A
  4015. In State-A moving U
  4016. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4017. predict error 0
  4018. dir: dir isU
  4019. |\-558: O: O1116 (predict-no)
  4020. I see 1 and I'm going to do: predict-no
  4021. ENV: Agent did: predict-no for direction U in state State-A
  4022. In State-A moving U
  4023. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4024. predict error 0
  4025. dir: dir isR
  4026. /|\559: O: O1117 (predict-yes)
  4027. I see 1 and I'm going to do: predict-yes
  4028. ENV: Agent did: predict-yes for direction R in state State-A
  4029. In State-A moving R
  4030. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4031. predict error 0
  4032. dir: dir isL
  4033. -/|560: O: O1119 (predict-yes)
  4034. I see 1 and I'm going to do: predict-yes
  4035. ENV: Agent did: predict-yes for direction L in state State-B
  4036. In State-B moving L
  4037. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4038. predict error 0
  4039. dir: dir isU
  4040. \-/561: O: O1122 (predict-no)
  4041. I see 1 and I'm going to do: predict-no
  4042. ENV: Agent did: predict-no for direction U in state State-A
  4043. In State-A moving U
  4044. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4045. predict error 0
  4046. dir: dir isR
  4047. |562: O: O1124 (predict-no)
  4048. I see 1 and I'm going to do: predict-no
  4049. ENV: Agent did: predict-no for direction R in state State-A
  4050. In State-A moving R
  4051. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  4052. predict error 1
  4053. dir: dir isR
  4054. \-/563: O: O1126 (predict-no)
  4055. I see 0 and I'm going to do: predict-no
  4056. ENV: Agent did: predict-no for direction R in state State-B
  4057. In State-B moving R
  4058. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4059. predict error 0
  4060. dir: dir isL
  4061. |\-564: O: O1127 (predict-yes)
  4062. I see 1 and I'm going to do: predict-yes
  4063. ENV: Agent did: predict-yes for direction L in state State-B
  4064. In State-B moving L
  4065. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4066. predict error 0
  4067. dir: dir isR
  4068. /|\565: O: O1129 (predict-yes)
  4069. I see 1 and I'm going to do: predict-yes
  4070. ENV: Agent did: predict-yes for direction R in state State-A
  4071. In State-A moving R
  4072. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4073. predict error 0
  4074. dir: dir isU
  4075. -/566: O: O1132 (predict-no)
  4076. I see 1 and I'm going to do: predict-no
  4077. ENV: Agent did: predict-no for direction U in state State-B
  4078. In State-B moving U
  4079. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4080. predict error 0
  4081. dir: dir isR
  4082. |\-567: O: O1134 (predict-no)
  4083. I see 1 and I'm going to do: predict-no
  4084. ENV: Agent did: predict-no for direction R in state State-B
  4085. In State-B moving R
  4086. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4087. predict error 0
  4088. dir: dir isR
  4089. /|\568: O: O1136 (predict-no)
  4090. I see 1 and I'm going to do: predict-no
  4091. ENV: Agent did: predict-no for direction R in state State-B
  4092. In State-B moving R
  4093. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4094. predict error 0
  4095. dir: dir isR
  4096. -569: O: O1138 (predict-no)
  4097. I see 1 and I'm going to do: predict-no
  4098. ENV: Agent did: predict-no for direction R in state State-B
  4099. In State-B moving R
  4100. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4101. predict error 0
  4102. dir: dir isL
  4103. /|\570: O: O1139 (predict-yes)
  4104. I see 1 and I'm going to do: predict-yes
  4105. ENV: Agent did: predict-yes for direction L in state State-B
  4106. In State-B moving L
  4107. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4108. predict error 0
  4109. dir: dir isR
  4110. -/571: O: O1141 (predict-yes)
  4111. I see 1 and I'm going to do: predict-yes
  4112. ENV: Agent did: predict-yes for direction R in state State-A
  4113. In State-A moving R
  4114. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4115. predict error 0
  4116. dir: dir isU
  4117. |572: O: O1144 (predict-no)
  4118. I see 1 and I'm going to do: predict-no
  4119. ENV: Agent did: predict-no for direction U in state State-B
  4120. In State-B moving U
  4121. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4122. predict error 0
  4123. dir: dir isU
  4124. \-/573: O: O1146 (predict-no)
  4125. I see 1 and I'm going to do: predict-no
  4126. ENV: Agent did: predict-no for direction U in state State-B
  4127. In State-B moving U
  4128. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4129. predict error 0
  4130. dir: dir isR
  4131. |\-574: O: O1148 (predict-no)
  4132. I see 1 and I'm going to do: predict-no
  4133. ENV: Agent did: predict-no for direction R in state State-B
  4134. In State-B moving R
  4135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4136. predict error 0
  4137. dir: dir isU
  4138. /|\575: O: O1150 (predict-no)
  4139. I see 1 and I'm going to do: predict-no
  4140. ENV: Agent did: predict-no for direction U in state State-B
  4141. In State-B moving U
  4142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4143. predict error 0
  4144. dir: dir isR
  4145. -/|576: O: O1152 (predict-no)
  4146. I see 1 and I'm going to do: predict-no
  4147. ENV: Agent did: predict-no for direction R in state State-B
  4148. In State-B moving R
  4149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4150. predict error 0
  4151. dir: dir isL
  4152. \-/577: O: O1153 (predict-yes)
  4153. I see 1 and I'm going to do: predict-yes
  4154. ENV: Agent did: predict-yes for direction L in state State-B
  4155. In State-B moving L
  4156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4157. predict error 0
  4158. dir: dir isL
  4159. |\-578: O: O1156 (predict-no)
  4160. I see 1 and I'm going to do: predict-no
  4161. ENV: Agent did: predict-no for direction L in state State-A
  4162. In State-A moving L
  4163. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4164. predict error 0
  4165. dir: dir isU
  4166. /|\579: O: O1158 (predict-no)
  4167. I see 1 and I'm going to do: predict-no
  4168. ENV: Agent did: predict-no for direction U in state State-A
  4169. In State-A moving U
  4170. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4171. predict error 0
  4172. dir: dir isL
  4173. -/|580: O: O1160 (predict-no)
  4174. I see 1 and I'm going to do: predict-no
  4175. ENV: Agent did: predict-no for direction L in state State-A
  4176. In State-A moving L
  4177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4178. predict error 0
  4179. dir: dir isL
  4180. \-/|581: O: O1162 (predict-no)
  4181. I see 1 and I'm going to do: predict-no
  4182. ENV: Agent did: predict-no for direction L in state State-A
  4183. In State-A moving L
  4184. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4185. predict error 0
  4186. dir: dir isU
  4187. \582: O: O1164 (predict-no)
  4188. I see 1 and I'm going to do: predict-no
  4189. ENV: Agent did: predict-no for direction U in state State-A
  4190. In State-A moving U
  4191. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4192. predict error 0
  4193. dir: dir isR
  4194. -/583: O: O1165 (predict-yes)
  4195. I see 1 and I'm going to do: predict-yes
  4196. ENV: Agent did: predict-yes for direction R in state State-A
  4197. In State-A moving R
  4198. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4199. predict error 0
  4200. dir: dir isR
  4201. |\-584: O: O1168 (predict-no)
  4202. I see 1 and I'm going to do: predict-no
  4203. ENV: Agent did: predict-no for direction R in state State-B
  4204. In State-B moving R
  4205. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4206. predict error 0
  4207. dir: dir isR
  4208. /|585: O: O1170 (predict-no)
  4209. I see 1 and I'm going to do: predict-no
  4210. ENV: Agent did: predict-no for direction R in state State-B
  4211. In State-B moving R
  4212. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4213. predict error 0
  4214. dir: dir isU
  4215. \-586: O: O1172 (predict-no)
  4216. I see 1 and I'm going to do: predict-no
  4217. ENV: Agent did: predict-no for direction U in state State-B
  4218. In State-B moving U
  4219. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4220. predict error 0
  4221. dir: dir isL
  4222. /587: O: O1173 (predict-yes)
  4223. I see 1 and I'm going to do: predict-yes
  4224. ENV: Agent did: predict-yes for direction L in state State-B
  4225. In State-B moving L
  4226. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4227. predict error 0
  4228. dir: dir isR
  4229. |588: O: O1175 (predict-yes)
  4230. I see 1 and I'm going to do: predict-yes
  4231. ENV: Agent did: predict-yes for direction R in state State-A
  4232. In State-A moving R
  4233. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4234. predict error 0
  4235. dir: dir isU
  4236. \-/589: O: O1178 (predict-no)
  4237. I see 1 and I'm going to do: predict-no
  4238. ENV: Agent did: predict-no for direction U in state State-B
  4239. In State-B moving U
  4240. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4241. predict error 0
  4242. dir: dir isU
  4243. |\-590: O: O1180 (predict-no)
  4244. I see 1 and I'm going to do: predict-no
  4245. ENV: Agent did: predict-no for direction U in state State-B
  4246. In State-B moving U
  4247. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4248. predict error 0
  4249. dir: dir isL
  4250. /|\591: O: O1181 (predict-yes)
  4251. I see 1 and I'm going to do: predict-yes
  4252. ENV: Agent did: predict-yes for direction L in state State-B
  4253. In State-B moving L
  4254. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4255. predict error 0
  4256. dir: dir isR
  4257. -592: O: O1183 (predict-yes)
  4258. I see 1 and I'm going to do: predict-yes
  4259. ENV: Agent did: predict-yes for direction R in state State-A
  4260. In State-A moving R
  4261. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4262. predict error 0
  4263. dir: dir isL
  4264. /|\593: O: O1185 (predict-yes)
  4265. I see 1 and I'm going to do: predict-yes
  4266. ENV: Agent did: predict-yes for direction L in state State-B
  4267. In State-B moving L
  4268. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4269. predict error 0
  4270. dir: dir isR
  4271. -/|594: O: O1187 (predict-yes)
  4272. I see 1 and I'm going to do: predict-yes
  4273. ENV: Agent did: predict-yes for direction R in state State-A
  4274. In State-A moving R
  4275. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4276. predict error 0
  4277. dir: dir isL
  4278. \-/595: O: O1189 (predict-yes)
  4279. I see 1 and I'm going to do: predict-yes
  4280. ENV: Agent did: predict-yes for direction L in state State-B
  4281. In State-B moving L
  4282. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4283. predict error 0
  4284. dir: dir isU
  4285. |\-596: O: O1192 (predict-no)
  4286. I see 1 and I'm going to do: predict-no
  4287. ENV: Agent did: predict-no for direction U in state State-A
  4288. In State-A moving U
  4289. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4290. predict error 0
  4291. dir: dir isU
  4292. /|\597: O: O1194 (predict-no)
  4293. I see 1 and I'm going to do: predict-no
  4294. ENV: Agent did: predict-no for direction U in state State-A
  4295. In State-A moving U
  4296. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4297. predict error 0
  4298. dir: dir isL
  4299. -/598: O: O1196 (predict-no)
  4300. I see 1 and I'm going to do: predict-no
  4301. ENV: Agent did: predict-no for direction L in state State-A
  4302. In State-A moving L
  4303. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4304. predict error 0
  4305. dir: dir isL
  4306. |\599: O: O1198 (predict-no)
  4307. I see 1 and I'm going to do: predict-no
  4308. ENV: Agent did: predict-no for direction L in state State-A
  4309. In State-A moving L
  4310. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4311. predict error 0
  4312. dir: dir isU
  4313. -/|600: O: O1200 (predict-no)
  4314. I see 1 and I'm going to do: predict-no
  4315. ENV: Agent did: predict-no for direction U in state State-A
  4316. In State-A moving U
  4317. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4318. predict error 0
  4319. dir: dir isU
  4320. \-/601: O: O1202 (predict-no)
  4321. I see 1 and I'm going to do: predict-no
  4322. ENV: Agent did: predict-no for direction U in state State-A
  4323. In State-A moving U
  4324. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4325. predict error 0
  4326. dir: dir isU
  4327. |602: O: O1204 (predict-no)
  4328. I see 1 and I'm going to do: predict-no
  4329. ENV: Agent did: predict-no for direction U in state State-A
  4330. In State-A moving U
  4331. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4332. predict error 0
  4333. dir: dir isL
  4334. \-/603: O: O1206 (predict-no)
  4335. I see 1 and I'm going to do: predict-no
  4336. ENV: Agent did: predict-no for direction L in state State-A
  4337. In State-A moving L
  4338. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4339. predict error 0
  4340. dir: dir isU
  4341. |604: O: O1208 (predict-no)
  4342. I see 1 and I'm going to do: predict-no
  4343. ENV: Agent did: predict-no for direction U in state State-A
  4344. In State-A moving U
  4345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4346. predict error 0
  4347. dir: dir isR
  4348. \-605: O: O1209 (predict-yes)
  4349. I see 1 and I'm going to do: predict-yes
  4350. ENV: Agent did: predict-yes for direction R in state State-A
  4351. In State-A moving R
  4352. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4353. predict error 0
  4354. dir: dir isL
  4355. /606: O: O1211 (predict-yes)
  4356. I see 1 and I'm going to do: predict-yes
  4357. ENV: Agent did: predict-yes for direction L in state State-B
  4358. In State-B moving L
  4359. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4360. predict error 0
  4361. dir: dir isR
  4362. |\-607: O: O1213 (predict-yes)
  4363. I see 1 and I'm going to do: predict-yes
  4364. ENV: Agent did: predict-yes for direction R in state State-A
  4365. In State-A moving R
  4366. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4367. predict error 0
  4368. dir: dir isU
  4369. /|\608: O: O1216 (predict-no)
  4370. I see 1 and I'm going to do: predict-no
  4371. ENV: Agent did: predict-no for direction U in state State-B
  4372. In State-B moving U
  4373. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4374. predict error 0
  4375. dir: dir isU
  4376. -/|609: O: O1218 (predict-no)
  4377. I see 1 and I'm going to do: predict-no
  4378. ENV: Agent did: predict-no for direction U in state State-B
  4379. In State-B moving U
  4380. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4381. predict error 0
  4382. dir: dir isL
  4383. \610: O: O1219 (predict-yes)
  4384. I see 1 and I'm going to do: predict-yes
  4385. ENV: Agent did: predict-yes for direction L in state State-B
  4386. In State-B moving L
  4387. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4388. predict error 0
  4389. dir: dir isR
  4390. -/|611: O: O1221 (predict-yes)
  4391. I see 1 and I'm going to do: predict-yes
  4392. ENV: Agent did: predict-yes for direction R in state State-A
  4393. In State-A moving R
  4394. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4395. predict error 0
  4396. dir: dir isL
  4397. \612: O: O1224 (predict-no)
  4398. I see 1 and I'm going to do: predict-no
  4399. ENV: Agent did: predict-no for direction L in state State-B
  4400. In State-B moving L
  4401. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  4402. predict error 1
  4403. dir: dir isU
  4404. -/|613: O: O1226 (predict-no)
  4405. I see 0 and I'm going to do: predict-no
  4406. ENV: Agent did: predict-no for direction U in state State-A
  4407. In State-A moving U
  4408. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4409. predict error 0
  4410. dir: dir isR
  4411. \-/614: O: O1227 (predict-yes)
  4412. I see 1 and I'm going to do: predict-yes
  4413. ENV: Agent did: predict-yes for direction R in state State-A
  4414. In State-A moving R
  4415. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4416. predict error 0
  4417. dir: dir isU
  4418. |\615: O: O1230 (predict-no)
  4419. I see 1 and I'm going to do: predict-no
  4420. ENV: Agent did: predict-no for direction U in state State-B
  4421. In State-B moving U
  4422. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4423. predict error 0
  4424. dir: dir isU
  4425. -/|616: O: O1232 (predict-no)
  4426. I see 1 and I'm going to do: predict-no
  4427. ENV: Agent did: predict-no for direction U in state State-B
  4428. In State-B moving U
  4429. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4430. predict error 0
  4431. dir: dir isR
  4432. \-/617: O: O1233 (predict-yes)
  4433. I see 1 and I'm going to do: predict-yes
  4434. ENV: Agent did: predict-yes for direction R in state State-B
  4435. In State-B moving R
  4436. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  4437. predict error 1
  4438. dir: dir isL
  4439. |\-618: O: O1235 (predict-yes)
  4440. I see 0 and I'm going to do: predict-yes
  4441. ENV: Agent did: predict-yes for direction L in state State-B
  4442. In State-B moving L
  4443. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4444. predict error 0
  4445. dir: dir isR
  4446. /|\619: O: O1237 (predict-yes)
  4447. I see 1 and I'm going to do: predict-yes
  4448. ENV: Agent did: predict-yes for direction R in state State-A
  4449. In State-A moving R
  4450. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4451. predict error 0
  4452. dir: dir isL
  4453. -/|620: O: O1239 (predict-yes)
  4454. I see 1 and I'm going to do: predict-yes
  4455. ENV: Agent did: predict-yes for direction L in state State-B
  4456. In State-B moving L
  4457. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4458. predict error 0
  4459. dir: dir isL
  4460. \621: O: O1242 (predict-no)
  4461. I see 1 and I'm going to do: predict-no
  4462. ENV: Agent did: predict-no for direction L in state State-A
  4463. In State-A moving L
  4464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4465. predict error 0
  4466. dir: dir isU
  4467. -622: O: O1244 (predict-no)
  4468. I see 1 and I'm going to do: predict-no
  4469. ENV: Agent did: predict-no for direction U in state State-A
  4470. In State-A moving U
  4471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4472. predict error 0
  4473. dir: dir isR
  4474. /|\623: O: O1245 (predict-yes)
  4475. I see 1 and I'm going to do: predict-yes
  4476. ENV: Agent did: predict-yes for direction R in state State-A
  4477. In State-A moving R
  4478. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4479. predict error 0
  4480. dir: dir isU
  4481. -/|624: O: O1248 (predict-no)
  4482. I see 1 and I'm going to do: predict-no
  4483. ENV: Agent did: predict-no for direction U in state State-B
  4484. In State-B moving U
  4485. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4486. predict error 0
  4487. dir: dir isL
  4488. \-/625: O: O1249 (predict-yes)
  4489. I see 1 and I'm going to do: predict-yes
  4490. ENV: Agent did: predict-yes for direction L in state State-B
  4491. In State-B moving L
  4492. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4493. predict error 0
  4494. dir: dir isU
  4495. |626: O: O1252 (predict-no)
  4496. I see 1 and I'm going to do: predict-no
  4497. ENV: Agent did: predict-no for direction U in state State-A
  4498. In State-A moving U
  4499. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4500. predict error 0
  4501. dir: dir isU
  4502. \-627: O: O1254 (predict-no)
  4503. I see 1 and I'm going to do: predict-no
  4504. ENV: Agent did: predict-no for direction U in state State-A
  4505. In State-A moving U
  4506. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4507. predict error 0
  4508. dir: dir isL
  4509. /|\628: O: O1256 (predict-no)
  4510. I see 1 and I'm going to do: predict-no
  4511. ENV: Agent did: predict-no for direction L in state State-A
  4512. In State-A moving L
  4513. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4514. predict error 0
  4515. dir: dir isL
  4516. -/|629: O: O1258 (predict-no)
  4517. I see 1 and I'm going to do: predict-no
  4518. ENV: Agent did: predict-no for direction L in state State-A
  4519. In State-A moving L
  4520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4521. predict error 0
  4522. dir: dir isR
  4523. \-/630: O: O1259 (predict-yes)
  4524. I see 1 and I'm going to do: predict-yes
  4525. ENV: Agent did: predict-yes for direction R in state State-A
  4526. In State-A moving R
  4527. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4528. predict error 0
  4529. dir: dir isR
  4530. |\631: O: O1262 (predict-no)
  4531. I see 1 and I'm going to do: predict-no
  4532. ENV: Agent did: predict-no for direction R in state State-B
  4533. In State-B moving R
  4534. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4535. predict error 0
  4536. dir: dir isL
  4537. -632: O: O1263 (predict-yes)
  4538. I see 1 and I'm going to do: predict-yes
  4539. ENV: Agent did: predict-yes for direction L in state State-B
  4540. In State-B moving L
  4541. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4542. predict error 0
  4543. dir: dir isL
  4544. /|633: O: O1266 (predict-no)
  4545. I see 1 and I'm going to do: predict-no
  4546. ENV: Agent did: predict-no for direction L in state State-A
  4547. In State-A moving L
  4548. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4549. predict error 0
  4550. dir: dir isL
  4551. \-/634: O: O1268 (predict-no)
  4552. I see 1 and I'm going to do: predict-no
  4553. ENV: Agent did: predict-no for direction L in state State-A
  4554. In State-A moving L
  4555. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4556. predict error 0
  4557. dir: dir isR
  4558. |\-635: O: O1269 (predict-yes)
  4559. I see 1 and I'm going to do: predict-yes
  4560. ENV: Agent did: predict-yes for direction R in state State-A
  4561. In State-A moving R
  4562. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4563. predict error 0
  4564. dir: dir isU
  4565. /|\636: O: O1272 (predict-no)
  4566. I see 1 and I'm going to do: predict-no
  4567. ENV: Agent did: predict-no for direction U in state State-B
  4568. In State-B moving U
  4569. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4570. predict error 0
  4571. dir: dir isL
  4572. -/|637: O: O1273 (predict-yes)
  4573. I see 1 and I'm going to do: predict-yes
  4574. ENV: Agent did: predict-yes for direction L in state State-B
  4575. In State-B moving L
  4576. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4577. predict error 0
  4578. dir: dir isL
  4579. \-/638: O: O1276 (predict-no)
  4580. I see 1 and I'm going to do: predict-no
  4581. ENV: Agent did: predict-no for direction L in state State-A
  4582. In State-A moving L
  4583. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4584. predict error 0
  4585. dir: dir isU
  4586. |\-639: O: O1278 (predict-no)
  4587. I see 1 and I'm going to do: predict-no
  4588. ENV: Agent did: predict-no for direction U in state State-A
  4589. In State-A moving U
  4590. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4591. predict error 0
  4592. dir: dir isU
  4593. /|\640: O: O1280 (predict-no)
  4594. I see 1 and I'm going to do: predict-no
  4595. ENV: Agent did: predict-no for direction U in state State-A
  4596. In State-A moving U
  4597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4598. predict error 0
  4599. dir: dir isU
  4600. -/|641: O: O1282 (predict-no)
  4601. I see 1 and I'm going to do: predict-no
  4602. ENV: Agent did: predict-no for direction U in state State-A
  4603. In State-A moving U
  4604. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4605. predict error 0
  4606. dir: dir isR
  4607. \642: O: O1283 (predict-yes)
  4608. I see 1 and I'm going to do: predict-yes
  4609. ENV: Agent did: predict-yes for direction R in state State-A
  4610. In State-A moving R
  4611. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4612. predict error 0
  4613. dir: dir isR
  4614. -/643: O: O1286 (predict-no)
  4615. I see 1 and I'm going to do: predict-no
  4616. ENV: Agent did: predict-no for direction R in state State-B
  4617. In State-B moving R
  4618. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4619. predict error 0
  4620. dir: dir isU
  4621. |\644: O: O1288 (predict-no)
  4622. I see 1 and I'm going to do: predict-no
  4623. ENV: Agent did: predict-no for direction U in state State-B
  4624. In State-B moving U
  4625. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4626. predict error 0
  4627. dir: dir isL
  4628. -/645: O: O1289 (predict-yes)
  4629. I see 1 and I'm going to do: predict-yes
  4630. ENV: Agent did: predict-yes for direction L in state State-B
  4631. In State-B moving L
  4632. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4633. predict error 0
  4634. dir: dir isU
  4635. |\-646: O: O1292 (predict-no)
  4636. I see 1 and I'm going to do: predict-no
  4637. ENV: Agent did: predict-no for direction U in state State-A
  4638. In State-A moving U
  4639. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4640. predict error 0
  4641. dir: dir isL
  4642. /647: O: O1294 (predict-no)
  4643. I see 1 and I'm going to do: predict-no
  4644. ENV: Agent did: predict-no for direction L in state State-A
  4645. In State-A moving L
  4646. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4647. predict error 0
  4648. dir: dir isR
  4649. |\648: O: O1295 (predict-yes)
  4650. I see 1 and I'm going to do: predict-yes
  4651. ENV: Agent did: predict-yes for direction R in state State-A
  4652. In State-A moving R
  4653. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4654. predict error 0
  4655. dir: dir isR
  4656. -649: O: O1298 (predict-no)
  4657. I see 1 and I'm going to do: predict-no
  4658. ENV: Agent did: predict-no for direction R in state State-B
  4659. In State-B moving R
  4660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4661. predict error 0
  4662. dir: dir isR
  4663. /|\650: O: O1300 (predict-no)
  4664. I see 1 and I'm going to do: predict-no
  4665. ENV: Agent did: predict-no for direction R in state State-B
  4666. In State-B moving R
  4667. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4668. predict error 0
  4669. dir: dir isL
  4670. -/|651: O: O1301 (predict-yes)
  4671. I see 1 and I'm going to do: predict-yes
  4672. ENV: Agent did: predict-yes for direction L in state State-B
  4673. In State-B moving L
  4674. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4675. predict error 0
  4676. dir: dir isL
  4677. \652: O: O1304 (predict-no)
  4678. I see 1 and I'm going to do: predict-no
  4679. ENV: Agent did: predict-no for direction L in state State-A
  4680. In State-A moving L
  4681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4682. predict error 0
  4683. dir: dir isU
  4684. -/|\653: O: O1306 (predict-no)
  4685. I see 1 and I'm going to do: predict-no
  4686. ENV: Agent did: predict-no for direction U in state State-A
  4687. In State-A moving U
  4688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4689. predict error 0
  4690. dir: dir isR
  4691. -/|654: O: O1308 (predict-no)
  4692. I see 1 and I'm going to do: predict-no
  4693. ENV: Agent did: predict-no for direction R in state State-A
  4694. In State-A moving R
  4695. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  4696. predict error 1
  4697. dir: dir isR
  4698. \-/655: O: O1310 (predict-no)
  4699. I see 0 and I'm going to do: predict-no
  4700. ENV: Agent did: predict-no for direction R in state State-B
  4701. In State-B moving R
  4702. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4703. predict error 0
  4704. dir: dir isL
  4705. |\-656: O: O1311 (predict-yes)
  4706. I see 1 and I'm going to do: predict-yes
  4707. ENV: Agent did: predict-yes for direction L in state State-B
  4708. In State-B moving L
  4709. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4710. predict error 0
  4711. dir: dir isU
  4712. /|\657: O: O1314 (predict-no)
  4713. I see 1 and I'm going to do: predict-no
  4714. ENV: Agent did: predict-no for direction U in state State-A
  4715. In State-A moving U
  4716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4717. predict error 0
  4718. dir: dir isL
  4719. -/658: O: O1316 (predict-no)
  4720. I see 1 and I'm going to do: predict-no
  4721. ENV: Agent did: predict-no for direction L in state State-A
  4722. In State-A moving L
  4723. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4724. predict error 0
  4725. dir: dir isR
  4726. |\-659: O: O1317 (predict-yes)
  4727. I see 1 and I'm going to do: predict-yes
  4728. ENV: Agent did: predict-yes for direction R in state State-A
  4729. In State-A moving R
  4730. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4731. predict error 0
  4732. dir: dir isU
  4733. /|\660: O: O1320 (predict-no)
  4734. I see 1 and I'm going to do: predict-no
  4735. ENV: Agent did: predict-no for direction U in state State-B
  4736. In State-B moving U
  4737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4738. predict error 0
  4739. dir: dir isU
  4740. -/661: O: O1322 (predict-no)
  4741. I see 1 and I'm going to do: predict-no
  4742. ENV: Agent did: predict-no for direction U in state State-B
  4743. In State-B moving U
  4744. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4745. predict error 0
  4746. dir: dir isL
  4747. |662: O: O1323 (predict-yes)
  4748. I see 1 and I'm going to do: predict-yes
  4749. ENV: Agent did: predict-yes for direction L in state State-B
  4750. In State-B moving L
  4751. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4752. predict error 0
  4753. dir: dir isU
  4754. \-/663: O: O1326 (predict-no)
  4755. I see 1 and I'm going to do: predict-no
  4756. ENV: Agent did: predict-no for direction U in state State-A
  4757. In State-A moving U
  4758. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4759. predict error 0
  4760. dir: dir isU
  4761. |\664: O: O1328 (predict-no)
  4762. I see 1 and I'm going to do: predict-no
  4763. ENV: Agent did: predict-no for direction U in state State-A
  4764. In State-A moving U
  4765. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4766. predict error 0
  4767. dir: dir isL
  4768. -665: O: O1330 (predict-no)
  4769. I see 1 and I'm going to do: predict-no
  4770. ENV: Agent did: predict-no for direction L in state State-A
  4771. In State-A moving L
  4772. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4773. predict error 0
  4774. dir: dir isR
  4775. /|\666: O: O1331 (predict-yes)
  4776. I see 1 and I'm going to do: predict-yes
  4777. ENV: Agent did: predict-yes for direction R in state State-A
  4778. In State-A moving R
  4779. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4780. predict error 0
  4781. dir: dir isR
  4782. -667: O: O1334 (predict-no)
  4783. I see 1 and I'm going to do: predict-no
  4784. ENV: Agent did: predict-no for direction R in state State-B
  4785. In State-B moving R
  4786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4787. predict error 0
  4788. dir: dir isU
  4789. /|668: O: O1336 (predict-no)
  4790. I see 1 and I'm going to do: predict-no
  4791. ENV: Agent did: predict-no for direction U in state State-B
  4792. In State-B moving U
  4793. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4794. predict error 0
  4795. dir: dir isR
  4796. \-/669: O: O1338 (predict-no)
  4797. I see 1 and I'm going to do: predict-no
  4798. ENV: Agent did: predict-no for direction R in state State-B
  4799. In State-B moving R
  4800. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4801. predict error 0
  4802. dir: dir isU
  4803. |\-670: O: O1340 (predict-no)
  4804. I see 1 and I'm going to do: predict-no
  4805. ENV: Agent did: predict-no for direction U in state State-B
  4806. In State-B moving U
  4807. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4808. predict error 0
  4809. dir: dir isU
  4810. /|\671: O: O1341 (predict-yes)
  4811. I see 1 and I'm going to do: predict-yes
  4812. ENV: Agent did: predict-yes for direction U in state State-B
  4813. In State-B moving U
  4814. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  4815. predict error 1
  4816. dir: dir isL
  4817. -672: O: O1343 (predict-yes)
  4818. I see 0 and I'm going to do: predict-yes
  4819. ENV: Agent did: predict-yes for direction L in state State-B
  4820. In State-B moving L
  4821. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4822. predict error 0
  4823. dir: dir isU
  4824. /|673: O: O1346 (predict-no)
  4825. I see 1 and I'm going to do: predict-no
  4826. ENV: Agent did: predict-no for direction U in state State-A
  4827. In State-A moving U
  4828. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4829. predict error 0
  4830. dir: dir isL
  4831. \-/674: O: O1348 (predict-no)
  4832. I see 1 and I'm going to do: predict-no
  4833. ENV: Agent did: predict-no for direction L in state State-A
  4834. In State-A moving L
  4835. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4836. predict error 0
  4837. dir: dir isL
  4838. |\-675: O: O1350 (predict-no)
  4839. I see 1 and I'm going to do: predict-no
  4840. ENV: Agent did: predict-no for direction L in state State-A
  4841. In State-A moving L
  4842. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4843. predict error 0
  4844. dir: dir isR
  4845. /676: O: O1351 (predict-yes)
  4846. I see 1 and I'm going to do: predict-yes
  4847. ENV: Agent did: predict-yes for direction R in state State-A
  4848. In State-A moving R
  4849. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4850. predict error 0
  4851. dir: dir isL
  4852. |\-677: O: O1353 (predict-yes)
  4853. I see 1 and I'm going to do: predict-yes
  4854. ENV: Agent did: predict-yes for direction L in state State-B
  4855. In State-B moving L
  4856. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4857. predict error 0
  4858. dir: dir isR
  4859. /|678: O: O1355 (predict-yes)
  4860. I see 1 and I'm going to do: predict-yes
  4861. ENV: Agent did: predict-yes for direction R in state State-A
  4862. In State-A moving R
  4863. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4864. predict error 0
  4865. dir: dir isL
  4866. \-/679: O: O1357 (predict-yes)
  4867. I see 1 and I'm going to do: predict-yes
  4868. ENV: Agent did: predict-yes for direction L in state State-B
  4869. In State-B moving L
  4870. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4871. predict error 0
  4872. dir: dir isR
  4873. |680: O: O1359 (predict-yes)
  4874. I see 1 and I'm going to do: predict-yes
  4875. ENV: Agent did: predict-yes for direction R in state State-A
  4876. In State-A moving R
  4877. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4878. predict error 0
  4879. dir: dir isU
  4880. \-/681: O: O1362 (predict-no)
  4881. I see 1 and I'm going to do: predict-no
  4882. ENV: Agent did: predict-no for direction U in state State-B
  4883. In State-B moving U
  4884. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4885. predict error 0
  4886. dir: dir isU
  4887. |682: O: O1364 (predict-no)
  4888. I see 1 and I'm going to do: predict-no
  4889. ENV: Agent did: predict-no for direction U in state State-B
  4890. In State-B moving U
  4891. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4892. predict error 0
  4893. dir: dir isL
  4894. \-/683: O: O1365 (predict-yes)
  4895. I see 1 and I'm going to do: predict-yes
  4896. ENV: Agent did: predict-yes for direction L in state State-B
  4897. In State-B moving L
  4898. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4899. predict error 0
  4900. dir: dir isL
  4901. |\-684: O: O1368 (predict-no)
  4902. I see 1 and I'm going to do: predict-no
  4903. ENV: Agent did: predict-no for direction L in state State-A
  4904. In State-A moving L
  4905. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4906. predict error 0
  4907. dir: dir isU
  4908. /|\685: O: O1370 (predict-no)
  4909. I see 1 and I'm going to do: predict-no
  4910. ENV: Agent did: predict-no for direction U in state State-A
  4911. In State-A moving U
  4912. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4913. predict error 0
  4914. dir: dir isL
  4915. -/686: O: O1372 (predict-no)
  4916. I see 1 and I'm going to do: predict-no
  4917. ENV: Agent did: predict-no for direction L in state State-A
  4918. In State-A moving L
  4919. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4920. predict error 0
  4921. dir: dir isL
  4922. |\-687: O: O1374 (predict-no)
  4923. I see 1 and I'm going to do: predict-no
  4924. ENV: Agent did: predict-no for direction L in state State-A
  4925. In State-A moving L
  4926. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4927. predict error 0
  4928. dir: dir isL
  4929. /688: O: O1376 (predict-no)
  4930. I see 1 and I'm going to do: predict-no
  4931. ENV: Agent did: predict-no for direction L in state State-A
  4932. In State-A moving L
  4933. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4934. predict error 0
  4935. dir: dir isL
  4936. |\-689: O: O1378 (predict-no)
  4937. I see 1 and I'm going to do: predict-no
  4938. ENV: Agent did: predict-no for direction L in state State-A
  4939. In State-A moving L
  4940. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4941. predict error 0
  4942. dir: dir isL
  4943. /|\690: O: O1380 (predict-no)
  4944. I see 1 and I'm going to do: predict-no
  4945. ENV: Agent did: predict-no for direction L in state State-A
  4946. In State-A moving L
  4947. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4948. predict error 0
  4949. dir: dir isR
  4950. -/|691: O: O1381 (predict-yes)
  4951. I see 1 and I'm going to do: predict-yes
  4952. ENV: Agent did: predict-yes for direction R in state State-A
  4953. In State-A moving R
  4954. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4955. predict error 0
  4956. dir: dir isU
  4957. \692: O: O1384 (predict-no)
  4958. I see 1 and I'm going to do: predict-no
  4959. ENV: Agent did: predict-no for direction U in state State-B
  4960. In State-B moving U
  4961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4962. predict error 0
  4963. dir: dir isU
  4964. -/|\693: O: O1386 (predict-no)
  4965. I see 1 and I'm going to do: predict-no
  4966. ENV: Agent did: predict-no for direction U in state State-B
  4967. In State-B moving U
  4968. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4969. predict error 0
  4970. dir: dir isU
  4971. -/|694: O: O1388 (predict-no)
  4972. I see 1 and I'm going to do: predict-no
  4973. ENV: Agent did: predict-no for direction U in state State-B
  4974. In State-B moving U
  4975. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4976. predict error 0
  4977. dir: dir isR
  4978. \-695: O: O1390 (predict-no)
  4979. I see 1 and I'm going to do: predict-no
  4980. ENV: Agent did: predict-no for direction R in state State-B
  4981. In State-B moving R
  4982. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4983. predict error 0
  4984. dir: dir isR
  4985. /|\696: O: O1392 (predict-no)
  4986. I see 1 and I'm going to do: predict-no
  4987. ENV: Agent did: predict-no for direction R in state State-B
  4988. In State-B moving R
  4989. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4990. predict error 0
  4991. dir: dir isR
  4992. -/697: O: O1394 (predict-no)
  4993. I see 1 and I'm going to do: predict-no
  4994. ENV: Agent did: predict-no for direction R in state State-B
  4995. In State-B moving R
  4996. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4997. predict error 0
  4998. dir: dir isU
  4999. |\-698: O: O1396 (predict-no)
  5000. I see 1 and I'm going to do: predict-no
  5001. ENV: Agent did: predict-no for direction U in state State-B
  5002. In State-B moving U
  5003. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5004. predict error 0
  5005. dir: dir isR
  5006. /|\699: O: O1398 (predict-no)
  5007. I see 1 and I'm going to do: predict-no
  5008. ENV: Agent did: predict-no for direction R in state State-B
  5009. In State-B moving R
  5010. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5011. predict error 0
  5012. dir: dir isL
  5013. -/|700: O: O1399 (predict-yes)
  5014. I see 1 and I'm going to do: predict-yes
  5015. ENV: Agent did: predict-yes for direction L in state State-B
  5016. In State-B moving L
  5017. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5018. predict error 0
  5019. dir: dir isL
  5020. \-701: O: O1402 (predict-no)
  5021. I see 1 and I'm going to do: predict-no
  5022. ENV: Agent did: predict-no for direction L in state State-A
  5023. In State-A moving L
  5024. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5025. predict error 0
  5026. dir: dir isU
  5027. /702: O: O1404 (predict-no)
  5028. I see 1 and I'm going to do: predict-no
  5029. ENV: Agent did: predict-no for direction U in state State-A
  5030. In State-A moving U
  5031. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5032. predict error 0
  5033. dir: dir isR
  5034. |\703: O: O1405 (predict-yes)
  5035. I see 1 and I'm going to do: predict-yes
  5036. ENV: Agent did: predict-yes for direction R in state State-A
  5037. In State-A moving R
  5038. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5039. predict error 0
  5040. dir: dir isR
  5041. -/|704: O: O1408 (predict-no)
  5042. I see 1 and I'm going to do: predict-no
  5043. ENV: Agent did: predict-no for direction R in state State-B
  5044. In State-B moving R
  5045. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5046. predict error 0
  5047. dir: dir isR
  5048. \-/705: O: O1409 (predict-yes)
  5049. I see 1 and I'm going to do: predict-yes
  5050. ENV: Agent did: predict-yes for direction R in state State-B
  5051. In State-B moving R
  5052. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  5053. predict error 1
  5054. dir: dir isR
  5055. |\-706: O: O1412 (predict-no)
  5056. I see 0 and I'm going to do: predict-no
  5057. ENV: Agent did: predict-no for direction R in state State-B
  5058. In State-B moving R
  5059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5060. predict error 0
  5061. dir: dir isR
  5062. /|\707: O: O1414 (predict-no)
  5063. I see 1 and I'm going to do: predict-no
  5064. ENV: Agent did: predict-no for direction R in state State-B
  5065. In State-B moving R
  5066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5067. predict error 0
  5068. dir: dir isL
  5069. -708: O: O1415 (predict-yes)
  5070. I see 1 and I'm going to do: predict-yes
  5071. ENV: Agent did: predict-yes for direction L in state State-B
  5072. In State-B moving L
  5073. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5074. predict error 0
  5075. dir: dir isR
  5076. /|\709: O: O1417 (predict-yes)
  5077. I see 1 and I'm going to do: predict-yes
  5078. ENV: Agent did: predict-yes for direction R in state State-A
  5079. In State-A moving R
  5080. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5081. predict error 0
  5082. dir: dir isR
  5083. -710: O: O1420 (predict-no)
  5084. I see 1 and I'm going to do: predict-no
  5085. ENV: Agent did: predict-no for direction R in state State-B
  5086. In State-B moving R
  5087. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5088. predict error 0
  5089. dir: dir isL
  5090. /|\711: O: O1421 (predict-yes)
  5091. I see 1 and I'm going to do: predict-yes
  5092. ENV: Agent did: predict-yes for direction L in state State-B
  5093. In State-B moving L
  5094. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5095. predict error 0
  5096. dir: dir isU
  5097. -712: O: O1424 (predict-no)
  5098. I see 1 and I'm going to do: predict-no
  5099. ENV: Agent did: predict-no for direction U in state State-A
  5100. In State-A moving U
  5101. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5102. predict error 0
  5103. dir: dir isR
  5104. /|713: O: O1425 (predict-yes)
  5105. I see 1 and I'm going to do: predict-yes
  5106. ENV: Agent did: predict-yes for direction R in state State-A
  5107. In State-A moving R
  5108. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5109. predict error 0
  5110. dir: dir isR
  5111. \-714: O: O1428 (predict-no)
  5112. I see 1 and I'm going to do: predict-no
  5113. ENV: Agent did: predict-no for direction R in state State-B
  5114. In State-B moving R
  5115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5116. predict error 0
  5117. dir: dir isU
  5118. /|\715: O: O1430 (predict-no)
  5119. I see 1 and I'm going to do: predict-no
  5120. ENV: Agent did: predict-no for direction U in state State-B
  5121. In State-B moving U
  5122. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5123. predict error 0
  5124. dir: dir isU
  5125. -/|\716: O: O1432 (predict-no)
  5126. I see 1 and I'm going to do: predict-no
  5127. ENV: Agent did: predict-no for direction U in state State-B
  5128. In State-B moving U
  5129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5130. predict error 0
  5131. dir: dir isU
  5132. -/|\717: O: O1434 (predict-no)
  5133. I see 1 and I'm going to do: predict-no
  5134. ENV: Agent did: predict-no for direction U in state State-B
  5135. In State-B moving U
  5136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5137. predict error 0
  5138. dir: dir isU
  5139. -/|718: O: O1436 (predict-no)
  5140. I see 1 and I'm going to do: predict-no
  5141. ENV: Agent did: predict-no for direction U in state State-B
  5142. In State-B moving U
  5143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5144. predict error 0
  5145. dir: dir isL
  5146. \-719: O: O1437 (predict-yes)
  5147. I see 1 and I'm going to do: predict-yes
  5148. ENV: Agent did: predict-yes for direction L in state State-B
  5149. In State-B moving L
  5150. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5151. predict error 0
  5152. dir: dir isU
  5153. /|720: O: O1440 (predict-no)
  5154. I see 1 and I'm going to do: predict-no
  5155. ENV: Agent did: predict-no for direction U in state State-A
  5156. In State-A moving U
  5157. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5158. predict error 0
  5159. dir: dir isL
  5160. \-721: O: O1442 (predict-no)
  5161. I see 1 and I'm going to do: predict-no
  5162. ENV: Agent did: predict-no for direction L in state State-A
  5163. In State-A moving L
  5164. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5165. predict error 0
  5166. dir: dir isU
  5167. /722: O: O1444 (predict-no)
  5168. I see 1 and I'm going to do: predict-no
  5169. ENV: Agent did: predict-no for direction U in state State-A
  5170. In State-A moving U
  5171. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5172. predict error 0
  5173. dir: dir isU
  5174. |\-723: O: O1446 (predict-no)
  5175. I see 1 and I'm going to do: predict-no
  5176. ENV: Agent did: predict-no for direction U in state State-A
  5177. In State-A moving U
  5178. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5179. predict error 0
  5180. dir: dir isU
  5181. /|\724: O: O1448 (predict-no)
  5182. I see 1 and I'm going to do: predict-no
  5183. ENV: Agent did: predict-no for direction U in state State-A
  5184. In State-A moving U
  5185. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5186. predict error 0
  5187. dir: dir isL
  5188. -/|725: O: O1450 (predict-no)
  5189. I see 1 and I'm going to do: predict-no
  5190. ENV: Agent did: predict-no for direction L in state State-A
  5191. In State-A moving L
  5192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5193. predict error 0
  5194. dir: dir isL
  5195. \-/|726: O: O1452 (predict-no)
  5196. I see 1 and I'm going to do: predict-no
  5197. ENV: Agent did: predict-no for direction L in state State-A
  5198. In State-A moving L
  5199. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5200. predict error 0
  5201. dir: dir isU
  5202. \-/727: O: O1454 (predict-no)
  5203. I see 1 and I'm going to do: predict-no
  5204. ENV: Agent did: predict-no for direction U in state State-A
  5205. In State-A moving U
  5206. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5207. predict error 0
  5208. dir: dir isR
  5209. |\-728: O: O1455 (predict-yes)
  5210. I see 1 and I'm going to do: predict-yes
  5211. ENV: Agent did: predict-yes for direction R in state State-A
  5212. In State-A moving R
  5213. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5214. predict error 0
  5215. dir: dir isR
  5216. /|\729: O: O1458 (predict-no)
  5217. I see 1 and I'm going to do: predict-no
  5218. ENV: Agent did: predict-no for direction R in state State-B
  5219. In State-B moving R
  5220. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5221. predict error 0
  5222. dir: dir isU
  5223. -/730: O: O1460 (predict-no)
  5224. I see 1 and I'm going to do: predict-no
  5225. ENV: Agent did: predict-no for direction U in state State-B
  5226. In State-B moving U
  5227. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5228. predict error 0
  5229. dir: dir isL
  5230. |\-731: O: O1461 (predict-yes)
  5231. I see 1 and I'm going to do: predict-yes
  5232. ENV: Agent did: predict-yes for direction L in state State-B
  5233. In State-B moving L
  5234. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5235. predict error 0
  5236. dir: dir isR
  5237. /732: O: O1463 (predict-yes)
  5238. I see 1 and I'm going to do: predict-yes
  5239. ENV: Agent did: predict-yes for direction R in state State-A
  5240. In State-A moving R
  5241. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5242. predict error 0
  5243. dir: dir isR
  5244. |\733: O: O1466 (predict-no)
  5245. I see 1 and I'm going to do: predict-no
  5246. ENV: Agent did: predict-no for direction R in state State-B
  5247. In State-B moving R
  5248. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5249. predict error 0
  5250. dir: dir isL
  5251. -/|734: O: O1467 (predict-yes)
  5252. I see 1 and I'm going to do: predict-yes
  5253. ENV: Agent did: predict-yes for direction L in state State-B
  5254. In State-B moving L
  5255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5256. predict error 0
  5257. dir: dir isR
  5258. \-/735: O: O1469 (predict-yes)
  5259. I see 1 and I'm going to do: predict-yes
  5260. ENV: Agent did: predict-yes for direction R in state State-A
  5261. In State-A moving R
  5262. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5263. predict error 0
  5264. dir: dir isU
  5265. |\-/736: O: O1472 (predict-no)
  5266. I see 1 and I'm going to do: predict-no
  5267. ENV: Agent did: predict-no for direction U in state State-B
  5268. In State-B moving U
  5269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5270. predict error 0
  5271. dir: dir isU
  5272. |\737: O: O1474 (predict-no)
  5273. I see 1 and I'm going to do: predict-no
  5274. ENV: Agent did: predict-no for direction U in state State-B
  5275. In State-B moving U
  5276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5277. predict error 0
  5278. dir: dir isL
  5279. -/738: O: O1475 (predict-yes)
  5280. I see 1 and I'm going to do: predict-yes
  5281. ENV: Agent did: predict-yes for direction L in state State-B
  5282. In State-B moving L
  5283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5284. predict error 0
  5285. dir: dir isR
  5286. |\-739: O: O1477 (predict-yes)
  5287. I see 1 and I'm going to do: predict-yes
  5288. ENV: Agent did: predict-yes for direction R in state State-A
  5289. In State-A moving R
  5290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5291. predict error 0
  5292. dir: dir isL
  5293. /|\740: O: O1479 (predict-yes)
  5294. I see 1 and I'm going to do: predict-yes
  5295. ENV: Agent did: predict-yes for direction L in state State-B
  5296. In State-B moving L
  5297. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5298. predict error 0
  5299. dir: dir isU
  5300. -/741: O: O1482 (predict-no)
  5301. I see 1 and I'm going to do: predict-no
  5302. ENV: Agent did: predict-no for direction U in state State-A
  5303. In State-A moving U
  5304. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5305. predict error 0
  5306. dir: dir isL
  5307. |742: O: O1484 (predict-no)
  5308. I see 1 and I'm going to do: predict-no
  5309. ENV: Agent did: predict-no for direction L in state State-A
  5310. In State-A moving L
  5311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5312. predict error 0
  5313. dir: dir isL
  5314. \-743: O: O1486 (predict-no)
  5315. I see 1 and I'm going to do: predict-no
  5316. ENV: Agent did: predict-no for direction L in state State-A
  5317. In State-A moving L
  5318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5319. predict error 0
  5320. dir: dir isR
  5321. /|\744: O: O1487 (predict-yes)
  5322. I see 1 and I'm going to do: predict-yes
  5323. ENV: Agent did: predict-yes for direction R in state State-A
  5324. In State-A moving R
  5325. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5326. predict error 0
  5327. dir: dir isU
  5328. -/|745: O: O1490 (predict-no)
  5329. I see 1 and I'm going to do: predict-no
  5330. ENV: Agent did: predict-no for direction U in state State-B
  5331. In State-B moving U
  5332. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5333. predict error 0
  5334. dir: dir isL
  5335. \-746: O: O1491 (predict-yes)
  5336. I see 1 and I'm going to do: predict-yes
  5337. ENV: Agent did: predict-yes for direction L in state State-B
  5338. In State-B moving L
  5339. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5340. predict error 0
  5341. dir: dir isL
  5342. /|\747: O: O1494 (predict-no)
  5343. I see 1 and I'm going to do: predict-no
  5344. ENV: Agent did: predict-no for direction L in state State-A
  5345. In State-A moving L
  5346. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5347. predict error 0
  5348. dir: dir isU
  5349. -/|748: O: O1496 (predict-no)
  5350. I see 1 and I'm going to do: predict-no
  5351. ENV: Agent did: predict-no for direction U in state State-A
  5352. In State-A moving U
  5353. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5354. predict error 0
  5355. dir: dir isU
  5356. \-/749: O: O1498 (predict-no)
  5357. I see 1 and I'm going to do: predict-no
  5358. ENV: Agent did: predict-no for direction U in state State-A
  5359. In State-A moving U
  5360. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5361. predict error 0
  5362. dir: dir isU
  5363. |\-750: O: O1500 (predict-no)
  5364. I see 1 and I'm going to do: predict-no
  5365. ENV: Agent did: predict-no for direction U in state State-A
  5366. In State-A moving U
  5367. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5368. predict error 0
  5369. dir: dir isL
  5370. /|\751: O: O1502 (predict-no)
  5371. I see 1 and I'm going to do: predict-no
  5372. ENV: Agent did: predict-no for direction L in state State-A
  5373. In State-A moving L
  5374. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5375. predict error 0
  5376. dir: dir isR
  5377. -752: O: O1503 (predict-yes)
  5378. I see 1 and I'm going to do: predict-yes
  5379. ENV: Agent did: predict-yes for direction R in state State-A
  5380. In State-A moving R
  5381. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5382. predict error 0
  5383. dir: dir isL
  5384. /|753: O: O1505 (predict-yes)
  5385. I see 1 and I'm going to do: predict-yes
  5386. ENV: Agent did: predict-yes for direction L in state State-B
  5387. In State-B moving L
  5388. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5389. predict error 0
  5390. dir: dir isR
  5391. \-/754: O: O1507 (predict-yes)
  5392. I see 1 and I'm going to do: predict-yes
  5393. ENV: Agent did: predict-yes for direction R in state State-A
  5394. In State-A moving R
  5395. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5396. predict error 0
  5397. dir: dir isL
  5398. |\-755: O: O1509 (predict-yes)
  5399. I see 1 and I'm going to do: predict-yes
  5400. ENV: Agent did: predict-yes for direction L in state State-B
  5401. In State-B moving L
  5402. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5403. predict error 0
  5404. dir: dir isR
  5405. /|\756: O: O1511 (predict-yes)
  5406. I see 1 and I'm going to do: predict-yes
  5407. ENV: Agent did: predict-yes for direction R in state State-A
  5408. In State-A moving R
  5409. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5410. predict error 0
  5411. dir: dir isU
  5412. -/|757: O: O1514 (predict-no)
  5413. I see 1 and I'm going to do: predict-no
  5414. ENV: Agent did: predict-no for direction U in state State-B
  5415. In State-B moving U
  5416. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5417. predict error 0
  5418. dir: dir isU
  5419. \-/758: O: O1516 (predict-no)
  5420. I see 1 and I'm going to do: predict-no
  5421. ENV: Agent did: predict-no for direction U in state State-B
  5422. In State-B moving U
  5423. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5424. predict error 0
  5425. dir: dir isR
  5426. |\-759: O: O1518 (predict-no)
  5427. I see 1 and I'm going to do: predict-no
  5428. ENV: Agent did: predict-no for direction R in state State-B
  5429. In State-B moving R
  5430. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5431. predict error 0
  5432. dir: dir isL
  5433. /|\760: O: O1519 (predict-yes)
  5434. I see 1 and I'm going to do: predict-yes
  5435. ENV: Agent did: predict-yes for direction L in state State-B
  5436. In State-B moving L
  5437. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5438. predict error 0
  5439. dir: dir isR
  5440. -/|761: O: O1521 (predict-yes)
  5441. I see 1 and I'm going to do: predict-yes
  5442. ENV: Agent did: predict-yes for direction R in state State-A
  5443. In State-A moving R
  5444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5445. predict error 0
  5446. dir: dir isR
  5447. \762: O: O1524 (predict-no)
  5448. I see 1 and I'm going to do: predict-no
  5449. ENV: Agent did: predict-no for direction R in state State-B
  5450. In State-B moving R
  5451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5452. predict error 0
  5453. dir: dir isU
  5454. -/|763: O: O1526 (predict-no)
  5455. I see 1 and I'm going to do: predict-no
  5456. ENV: Agent did: predict-no for direction U in state State-B
  5457. In State-B moving U
  5458. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5459. predict error 0
  5460. dir: dir isU
  5461. \-/764: O: O1528 (predict-no)
  5462. I see 1 and I'm going to do: predict-no
  5463. ENV: Agent did: predict-no for direction U in state State-B
  5464. In State-B moving U
  5465. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5466. predict error 0
  5467. dir: dir isU
  5468. |\-765: O: O1530 (predict-no)
  5469. I see 1 and I'm going to do: predict-no
  5470. ENV: Agent did: predict-no for direction U in state State-B
  5471. In State-B moving U
  5472. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5473. predict error 0
  5474. dir: dir isU
  5475. /766: O: O1532 (predict-no)
  5476. I see 1 and I'm going to do: predict-no
  5477. ENV: Agent did: predict-no for direction U in state State-B
  5478. In State-B moving U
  5479. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5480. predict error 0
  5481. dir: dir isR
  5482. |767: O: O1534 (predict-no)
  5483. I see 1 and I'm going to do: predict-no
  5484. ENV: Agent did: predict-no for direction R in state State-B
  5485. In State-B moving R
  5486. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5487. predict error 0
  5488. dir: dir isU
  5489. \-/768: O: O1536 (predict-no)
  5490. I see 1 and I'm going to do: predict-no
  5491. ENV: Agent did: predict-no for direction U in state State-B
  5492. In State-B moving U
  5493. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5494. predict error 0
  5495. dir: dir isU
  5496. |\-769: O: O1538 (predict-no)
  5497. I see 1 and I'm going to do: predict-no
  5498. ENV: Agent did: predict-no for direction U in state State-B
  5499. In State-B moving U
  5500. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5501. predict error 0
  5502. dir: dir isL
  5503. /|770: O: O1539 (predict-yes)
  5504. I see 1 and I'm going to do: predict-yes
  5505. ENV: Agent did: predict-yes for direction L in state State-B
  5506. In State-B moving L
  5507. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5508. predict error 0
  5509. dir: dir isL
  5510. \771: O: O1542 (predict-no)
  5511. I see 1 and I'm going to do: predict-no
  5512. ENV: Agent did: predict-no for direction L in state State-A
  5513. In State-A moving L
  5514. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5515. predict error 0
  5516. dir: dir isR
  5517. -772: O: O1543 (predict-yes)
  5518. I see 1 and I'm going to do: predict-yes
  5519. ENV: Agent did: predict-yes for direction R in state State-A
  5520. In State-A moving R
  5521. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5522. predict error 0
  5523. dir: dir isR
  5524. /|773: O: O1546 (predict-no)
  5525. I see 1 and I'm going to do: predict-no
  5526. ENV: Agent did: predict-no for direction R in state State-B
  5527. In State-B moving R
  5528. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5529. predict error 0
  5530. dir: dir isL
  5531. \-/|774: O: O1547 (predict-yes)
  5532. I see 1 and I'm going to do: predict-yes
  5533. ENV: Agent did: predict-yes for direction L in state State-B
  5534. In State-B moving L
  5535. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5536. predict error 0
  5537. dir: dir isR
  5538. \-/775: O: O1549 (predict-yes)
  5539. I see 1 and I'm going to do: predict-yes
  5540. ENV: Agent did: predict-yes for direction R in state State-A
  5541. In State-A moving R
  5542. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5543. predict error 0
  5544. dir: dir isR
  5545. |\-776: O: O1552 (predict-no)
  5546. I see 1 and I'm going to do: predict-no
  5547. ENV: Agent did: predict-no for direction R in state State-B
  5548. In State-B moving R
  5549. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5550. predict error 0
  5551. dir: dir isL
  5552. /|\777: O: O1553 (predict-yes)
  5553. I see 1 and I'm going to do: predict-yes
  5554. ENV: Agent did: predict-yes for direction L in state State-B
  5555. In State-B moving L
  5556. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5557. predict error 0
  5558. dir: dir isU
  5559. -/778: O: O1556 (predict-no)
  5560. I see 1 and I'm going to do: predict-no
  5561. ENV: Agent did: predict-no for direction U in state State-A
  5562. In State-A moving U
  5563. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5564. predict error 0
  5565. dir: dir isR
  5566. |\-779: O: O1557 (predict-yes)
  5567. I see 1 and I'm going to do: predict-yes
  5568. ENV: Agent did: predict-yes for direction R in state State-A
  5569. In State-A moving R
  5570. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5571. predict error 0
  5572. dir: dir isL
  5573. /|\780: O: O1559 (predict-yes)
  5574. I see 1 and I'm going to do: predict-yes
  5575. ENV: Agent did: predict-yes for direction L in state State-B
  5576. In State-B moving L
  5577. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5578. predict error 0
  5579. dir: dir isL
  5580. -/|781: O: O1562 (predict-no)
  5581. I see 1 and I'm going to do: predict-no
  5582. ENV: Agent did: predict-no for direction L in state State-A
  5583. In State-A moving L
  5584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5585. predict error 0
  5586. dir: dir isR
  5587. \782: O: O1563 (predict-yes)
  5588. I see 1 and I'm going to do: predict-yes
  5589. ENV: Agent did: predict-yes for direction R in state State-A
  5590. In State-A moving R
  5591. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5592. predict error 0
  5593. dir: dir isL
  5594. -/783: O: O1565 (predict-yes)
  5595. I see 1 and I'm going to do: predict-yes
  5596. ENV: Agent did: predict-yes for direction L in state State-B
  5597. In State-B moving L
  5598. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5599. predict error 0
  5600. dir: dir isU
  5601. |\-784: O: O1568 (predict-no)
  5602. I see 1 and I'm going to do: predict-no
  5603. ENV: Agent did: predict-no for direction U in state State-A
  5604. In State-A moving U
  5605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5606. predict error 0
  5607. dir: dir isR
  5608. /|785: O: O1569 (predict-yes)
  5609. I see 1 and I'm going to do: predict-yes
  5610. ENV: Agent did: predict-yes for direction R in state State-A
  5611. In State-A moving R
  5612. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5613. predict error 0
  5614. dir: dir isR
  5615. \786: O: O1572 (predict-no)
  5616. I see 1 and I'm going to do: predict-no
  5617. ENV: Agent did: predict-no for direction R in state State-B
  5618. In State-B moving R
  5619. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5620. predict error 0
  5621. dir: dir isL
  5622. -/787: O: O1573 (predict-yes)
  5623. I see 1 and I'm going to do: predict-yes
  5624. ENV: Agent did: predict-yes for direction L in state State-B
  5625. In State-B moving L
  5626. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5627. predict error 0
  5628. dir: dir isU
  5629. |\-788: O: O1576 (predict-no)
  5630. I see 1 and I'm going to do: predict-no
  5631. ENV: Agent did: predict-no for direction U in state State-A
  5632. In State-A moving U
  5633. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5634. predict error 0
  5635. dir: dir isL
  5636. /|\789: O: O1578 (predict-no)
  5637. I see 1 and I'm going to do: predict-no
  5638. ENV: Agent did: predict-no for direction L in state State-A
  5639. In State-A moving L
  5640. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5641. predict error 0
  5642. dir: dir isL
  5643. -/790: O: O1580 (predict-no)
  5644. I see 1 and I'm going to do: predict-no
  5645. ENV: Agent did: predict-no for direction L in state State-A
  5646. In State-A moving L
  5647. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5648. predict error 0
  5649. dir: dir isL
  5650. |\-791: O: O1582 (predict-no)
  5651. I see 1 and I'm going to do: predict-no
  5652. ENV: Agent did: predict-no for direction L in state State-A
  5653. In State-A moving L
  5654. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5655. predict error 0
  5656. dir: dir isU
  5657. /792: O: O1584 (predict-no)
  5658. I see 1 and I'm going to do: predict-no
  5659. ENV: Agent did: predict-no for direction U in state State-A
  5660. In State-A moving U
  5661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5662. predict error 0
  5663. dir: dir isR
  5664. |\-793: O: O1585 (predict-yes)
  5665. I see 1 and I'm going to do: predict-yes
  5666. ENV: Agent did: predict-yes for direction R in state State-A
  5667. In State-A moving R
  5668. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5669. predict error 0
  5670. dir: dir isU
  5671. /|794: O: O1588 (predict-no)
  5672. I see 1 and I'm going to do: predict-no
  5673. ENV: Agent did: predict-no for direction U in state State-B
  5674. In State-B moving U
  5675. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5676. predict error 0
  5677. dir: dir isU
  5678. \-/795: O: O1590 (predict-no)
  5679. I see 1 and I'm going to do: predict-no
  5680. ENV: Agent did: predict-no for direction U in state State-B
  5681. In State-B moving U
  5682. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5683. predict error 0
  5684. dir: dir isU
  5685. |\-796: O: O1592 (predict-no)
  5686. I see 1 and I'm going to do: predict-no
  5687. ENV: Agent did: predict-no for direction U in state State-B
  5688. In State-B moving U
  5689. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5690. predict error 0
  5691. dir: dir isU
  5692. /|\797: O: O1594 (predict-no)
  5693. I see 1 and I'm going to do: predict-no
  5694. ENV: Agent did: predict-no for direction U in state State-B
  5695. In State-B moving U
  5696. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5697. predict error 0
  5698. dir: dir isU
  5699. -798: O: O1596 (predict-no)
  5700. I see 1 and I'm going to do: predict-no
  5701. ENV: Agent did: predict-no for direction U in state State-B
  5702. In State-B moving U
  5703. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5704. predict error 0
  5705. dir: dir isU
  5706. /|\799: O: O1598 (predict-no)
  5707. I see 1 and I'm going to do: predict-no
  5708. ENV: Agent did: predict-no for direction U in state State-B
  5709. In State-B moving U
  5710. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5711. predict error 0
  5712. dir: dir isU
  5713. -/|800: O: O1600 (predict-no)
  5714. I see 1 and I'm going to do: predict-no
  5715. ENV: Agent did: predict-no for direction U in state State-B
  5716. In State-B moving U
  5717. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5718. predict error 0
  5719. dir: dir isL
  5720. \-/801: O: O1601 (predict-yes)
  5721. I see 1 and I'm going to do: predict-yes
  5722. ENV: Agent did: predict-yes for direction L in state State-B
  5723. In State-B moving L
  5724. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5725. predict error 0
  5726. dir: dir isR
  5727. |802: O: O1603 (predict-yes)
  5728. I see 1 and I'm going to do: predict-yes
  5729. ENV: Agent did: predict-yes for direction R in state State-A
  5730. In State-A moving R
  5731. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5732. predict error 0
  5733. dir: dir isR
  5734. \-/803: O: O1606 (predict-no)
  5735. I see 1 and I'm going to do: predict-no
  5736. ENV: Agent did: predict-no for direction R in state State-B
  5737. In State-B moving R
  5738. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5739. predict error 0
  5740. dir: dir isU
  5741. |\-804: O: O1608 (predict-no)
  5742. I see 1 and I'm going to do: predict-no
  5743. ENV: Agent did: predict-no for direction U in state State-B
  5744. In State-B moving U
  5745. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5746. predict error 0
  5747. dir: dir isU
  5748. /|805: O: O1610 (predict-no)
  5749. I see 1 and I'm going to do: predict-no
  5750. ENV: Agent did: predict-no for direction U in state State-B
  5751. In State-B moving U
  5752. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5753. predict error 0
  5754. dir: dir isU
  5755. \-/806: O: O1612 (predict-no)
  5756. I see 1 and I'm going to do: predict-no
  5757. ENV: Agent did: predict-no for direction U in state State-B
  5758. In State-B moving U
  5759. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5760. predict error 0
  5761. dir: dir isU
  5762. |\-807: O: O1614 (predict-no)
  5763. I see 1 and I'm going to do: predict-no
  5764. ENV: Agent did: predict-no for direction U in state State-B
  5765. In State-B moving U
  5766. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5767. predict error 0
  5768. dir: dir isR
  5769. /|\808: O: O1616 (predict-no)
  5770. I see 1 and I'm going to do: predict-no
  5771. ENV: Agent did: predict-no for direction R in state State-B
  5772. In State-B moving R
  5773. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5774. predict error 0
  5775. dir: dir isU
  5776. -/|809: O: O1618 (predict-no)
  5777. I see 1 and I'm going to do: predict-no
  5778. ENV: Agent did: predict-no for direction U in state State-B
  5779. In State-B moving U
  5780. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5781. predict error 0
  5782. dir: dir isR
  5783. \-/810: O: O1620 (predict-no)
  5784. I see 1 and I'm going to do: predict-no
  5785. ENV: Agent did: predict-no for direction R in state State-B
  5786. In State-B moving R
  5787. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5788. predict error 0
  5789. dir: dir isR
  5790. |\-811: O: O1622 (predict-no)
  5791. I see 1 and I'm going to do: predict-no
  5792. ENV: Agent did: predict-no for direction R in state State-B
  5793. In State-B moving R
  5794. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5795. predict error 0
  5796. dir: dir isR
  5797. /812: O: O1624 (predict-no)
  5798. I see 1 and I'm going to do: predict-no
  5799. ENV: Agent did: predict-no for direction R in state State-B
  5800. In State-B moving R
  5801. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5802. predict error 0
  5803. dir: dir isU
  5804. |\-813: O: O1626 (predict-no)
  5805. I see 1 and I'm going to do: predict-no
  5806. ENV: Agent did: predict-no for direction U in state State-B
  5807. In State-B moving U
  5808. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5809. predict error 0
  5810. dir: dir isR
  5811. /|\814: O: O1628 (predict-no)
  5812. I see 1 and I'm going to do: predict-no
  5813. ENV: Agent did: predict-no for direction R in state State-B
  5814. In State-B moving R
  5815. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5816. predict error 0
  5817. dir: dir isL
  5818. -/|815: O: O1629 (predict-yes)
  5819. I see 1 and I'm going to do: predict-yes
  5820. ENV: Agent did: predict-yes for direction L in state State-B
  5821. In State-B moving L
  5822. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5823. predict error 0
  5824. dir: dir isL
  5825. \-/816: O: O1632 (predict-no)
  5826. I see 1 and I'm going to do: predict-no
  5827. ENV: Agent did: predict-no for direction L in state State-A
  5828. In State-A moving L
  5829. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5830. predict error 0
  5831. dir: dir isU
  5832. |\817: O: O1634 (predict-no)
  5833. I see 1 and I'm going to do: predict-no
  5834. ENV: Agent did: predict-no for direction U in state State-A
  5835. In State-A moving U
  5836. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5837. predict error 0
  5838. dir: dir isR
  5839. -/|818: O: O1635 (predict-yes)
  5840. I see 1 and I'm going to do: predict-yes
  5841. ENV: Agent did: predict-yes for direction R in state State-A
  5842. In State-A moving R
  5843. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5844. predict error 0
  5845. dir: dir isU
  5846. \-/819: O: O1638 (predict-no)
  5847. I see 1 and I'm going to do: predict-no
  5848. ENV: Agent did: predict-no for direction U in state State-B
  5849. In State-B moving U
  5850. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5851. predict error 0
  5852. dir: dir isL
  5853. |\-820: O: O1639 (predict-yes)
  5854. I see 1 and I'm going to do: predict-yes
  5855. ENV: Agent did: predict-yes for direction L in state State-B
  5856. In State-B moving L
  5857. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5858. predict error 0
  5859. dir: dir isR
  5860. /|\821: O: O1641 (predict-yes)
  5861. I see 1 and I'm going to do: predict-yes
  5862. ENV: Agent did: predict-yes for direction R in state State-A
  5863. In State-A moving R
  5864. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5865. predict error 0
  5866. dir: dir isU
  5867. -822: O: O1644 (predict-no)
  5868. I see 1 and I'm going to do: predict-no
  5869. ENV: Agent did: predict-no for direction U in state State-B
  5870. In State-B moving U
  5871. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5872. predict error 0
  5873. dir: dir isL
  5874. /|\823: O: O1645 (predict-yes)
  5875. I see 1 and I'm going to do: predict-yes
  5876. ENV: Agent did: predict-yes for direction L in state State-B
  5877. In State-B moving L
  5878. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5879. predict error 0
  5880. dir: dir isL
  5881. -824: O: O1648 (predict-no)
  5882. I see 1 and I'm going to do: predict-no
  5883. ENV: Agent did: predict-no for direction L in state State-A
  5884. In State-A moving L
  5885. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5886. predict error 0
  5887. dir: dir isR
  5888. /|\825: O: O1649 (predict-yes)
  5889. I see 1 and I'm going to do: predict-yes
  5890. ENV: Agent did: predict-yes for direction R in state State-A
  5891. In State-A moving R
  5892. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5893. predict error 0
  5894. dir: dir isL
  5895. -/|826: O: O1651 (predict-yes)
  5896. I see 1 and I'm going to do: predict-yes
  5897. ENV: Agent did: predict-yes for direction L in state State-B
  5898. In State-B moving L
  5899. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5900. predict error 0
  5901. dir: dir isL
  5902. \-/827: O: O1654 (predict-no)
  5903. I see 1 and I'm going to do: predict-no
  5904. ENV: Agent did: predict-no for direction L in state State-A
  5905. In State-A moving L
  5906. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5907. predict error 0
  5908. dir: dir isL
  5909. |\-828: O: O1656 (predict-no)
  5910. I see 1 and I'm going to do: predict-no
  5911. ENV: Agent did: predict-no for direction L in state State-A
  5912. In State-A moving L
  5913. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5914. predict error 0
  5915. dir: dir isR
  5916. /|\829: O: O1657 (predict-yes)
  5917. I see 1 and I'm going to do: predict-yes
  5918. ENV: Agent did: predict-yes for direction R in state State-A
  5919. In State-A moving R
  5920. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5921. predict error 0
  5922. dir: dir isR
  5923. -/|830: O: O1660 (predict-no)
  5924. I see 1 and I'm going to do: predict-no
  5925. ENV: Agent did: predict-no for direction R in state State-B
  5926. In State-B moving R
  5927. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5928. predict error 0
  5929. dir: dir isL
  5930. \-/831: O: O1661 (predict-yes)
  5931. I see 1 and I'm going to do: predict-yes
  5932. ENV: Agent did: predict-yes for direction L in state State-B
  5933. In State-B moving L
  5934. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5935. predict error 0
  5936. dir: dir isL
  5937. |832: O: O1664 (predict-no)
  5938. I see 1 and I'm going to do: predict-no
  5939. ENV: Agent did: predict-no for direction L in state State-A
  5940. In State-A moving L
  5941. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5942. predict error 0
  5943. dir: dir isU
  5944. \-/833: O: O1666 (predict-no)
  5945. I see 1 and I'm going to do: predict-no
  5946. ENV: Agent did: predict-no for direction U in state State-A
  5947. In State-A moving U
  5948. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5949. predict error 0
  5950. dir: dir isR
  5951. |\-834: O: O1667 (predict-yes)
  5952. I see 1 and I'm going to do: predict-yes
  5953. ENV: Agent did: predict-yes for direction R in state State-A
  5954. In State-A moving R
  5955. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5956. predict error 0
  5957. dir: dir isL
  5958. /|\835: O: O1669 (predict-yes)
  5959. I see 1 and I'm going to do: predict-yes
  5960. ENV: Agent did: predict-yes for direction L in state State-B
  5961. In State-B moving L
  5962. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5963. predict error 0
  5964. dir: dir isU
  5965. -/836: O: O1672 (predict-no)
  5966. I see 1 and I'm going to do: predict-no
  5967. ENV: Agent did: predict-no for direction U in state State-A
  5968. In State-A moving U
  5969. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5970. predict error 0
  5971. dir: dir isL
  5972. |\-837: O: O1674 (predict-no)
  5973. I see 1 and I'm going to do: predict-no
  5974. ENV: Agent did: predict-no for direction L in state State-A
  5975. In State-A moving L
  5976. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5977. predict error 0
  5978. dir: dir isR
  5979. /|\838: O: O1675 (predict-yes)
  5980. I see 1 and I'm going to do: predict-yes
  5981. ENV: Agent did: predict-yes for direction R in state State-A
  5982. In State-A moving R
  5983. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5984. predict error 0
  5985. dir: dir isU
  5986. -/|839: O: O1678 (predict-no)
  5987. I see 1 and I'm going to do: predict-no
  5988. ENV: Agent did: predict-no for direction U in state State-B
  5989. In State-B moving U
  5990. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5991. predict error 0
  5992. dir: dir isU
  5993. \-/840: O: O1680 (predict-no)
  5994. I see 1 and I'm going to do: predict-no
  5995. ENV: Agent did: predict-no for direction U in state State-B
  5996. In State-B moving U
  5997. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5998. predict error 0
  5999. dir: dir isL
  6000. |\-841: O: O1681 (predict-yes)
  6001. I see 1 and I'm going to do: predict-yes
  6002. ENV: Agent did: predict-yes for direction L in state State-B
  6003. In State-B moving L
  6004. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6005. predict error 0
  6006. dir: dir isU
  6007. /842: O: O1684 (predict-no)
  6008. I see 1 and I'm going to do: predict-no
  6009. ENV: Agent did: predict-no for direction U in state State-A
  6010. In State-A moving U
  6011. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6012. predict error 0
  6013. dir: dir isR
  6014. |\843: O: O1685 (predict-yes)
  6015. I see 1 and I'm going to do: predict-yes
  6016. ENV: Agent did: predict-yes for direction R in state State-A
  6017. In State-A moving R
  6018. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6019. predict error 0
  6020. dir: dir isU
  6021. -/844: O: O1688 (predict-no)
  6022. I see 1 and I'm going to do: predict-no
  6023. ENV: Agent did: predict-no for direction U in state State-B
  6024. In State-B moving U
  6025. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6026. predict error 0
  6027. dir: dir isU
  6028. |\-845: O: O1690 (predict-no)
  6029. I see 1 and I'm going to do: predict-no
  6030. ENV: Agent did: predict-no for direction U in state State-B
  6031. In State-B moving U
  6032. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6033. predict error 0
  6034. dir: dir isR
  6035. /|\846: O: O1692 (predict-no)
  6036. I see 1 and I'm going to do: predict-no
  6037. ENV: Agent did: predict-no for direction R in state State-B
  6038. In State-B moving R
  6039. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6040. predict error 0
  6041. dir: dir isU
  6042. -/|847: O: O1694 (predict-no)
  6043. I see 1 and I'm going to do: predict-no
  6044. ENV: Agent did: predict-no for direction U in state State-B
  6045. In State-B moving U
  6046. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6047. predict error 0
  6048. dir: dir isR
  6049. \-/848: O: O1696 (predict-no)
  6050. I see 1 and I'm going to do: predict-no
  6051. ENV: Agent did: predict-no for direction R in state State-B
  6052. In State-B moving R
  6053. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6054. predict error 0
  6055. dir: dir isU
  6056. |849: O: O1698 (predict-no)
  6057. I see 1 and I'm going to do: predict-no
  6058. ENV: Agent did: predict-no for direction U in state State-B
  6059. In State-B moving U
  6060. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6061. predict error 0
  6062. dir: dir isU
  6063. \-/850: O: O1700 (predict-no)
  6064. I see 1 and I'm going to do: predict-no
  6065. ENV: Agent did: predict-no for direction U in state State-B
  6066. In State-B moving U
  6067. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6068. predict error 0
  6069. dir: dir isU
  6070. |\-851: O: O1702 (predict-no)
  6071. I see 1 and I'm going to do: predict-no
  6072. ENV: Agent did: predict-no for direction U in state State-B
  6073. In State-B moving U
  6074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6075. predict error 0
  6076. dir: dir isU
  6077. /852: O: O1704 (predict-no)
  6078. I see 1 and I'm going to do: predict-no
  6079. ENV: Agent did: predict-no for direction U in state State-B
  6080. In State-B moving U
  6081. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6082. predict error 0
  6083. dir: dir isU
  6084. |\-853: O: O1706 (predict-no)
  6085. I see 1 and I'm going to do: predict-no
  6086. ENV: Agent did: predict-no for direction U in state State-B
  6087. In State-B moving U
  6088. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6089. predict error 0
  6090. dir: dir isL
  6091. /|\854: O: O1707 (predict-yes)
  6092. I see 1 and I'm going to do: predict-yes
  6093. ENV: Agent did: predict-yes for direction L in state State-B
  6094. In State-B moving L
  6095. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6096. predict error 0
  6097. dir: dir isL
  6098. -/|855: O: O1710 (predict-no)
  6099. I see 1 and I'm going to do: predict-no
  6100. ENV: Agent did: predict-no for direction L in state State-A
  6101. In State-A moving L
  6102. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6103. predict error 0
  6104. dir: dir isU
  6105. \-856: O: O1712 (predict-no)
  6106. I see 1 and I'm going to do: predict-no
  6107. ENV: Agent did: predict-no for direction U in state State-A
  6108. In State-A moving U
  6109. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6110. predict error 0
  6111. dir: dir isU
  6112. /|\857: O: O1714 (predict-no)
  6113. I see 1 and I'm going to do: predict-no
  6114. ENV: Agent did: predict-no for direction U in state State-A
  6115. In State-A moving U
  6116. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6117. predict error 0
  6118. dir: dir isR
  6119. -/|858: O: O1715 (predict-yes)
  6120. I see 1 and I'm going to do: predict-yes
  6121. ENV: Agent did: predict-yes for direction R in state State-A
  6122. In State-A moving R
  6123. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6124. predict error 0
  6125. dir: dir isR
  6126. \-/859: O: O1718 (predict-no)
  6127. I see 1 and I'm going to do: predict-no
  6128. ENV: Agent did: predict-no for direction R in state State-B
  6129. In State-B moving R
  6130. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6131. predict error 0
  6132. dir: dir isR
  6133. |860: O: O1720 (predict-no)
  6134. I see 1 and I'm going to do: predict-no
  6135. ENV: Agent did: predict-no for direction R in state State-B
  6136. In State-B moving R
  6137. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6138. predict error 0
  6139. dir: dir isU
  6140. \-861: O: O1722 (predict-no)
  6141. I see 1 and I'm going to do: predict-no
  6142. ENV: Agent did: predict-no for direction U in state State-B
  6143. In State-B moving U
  6144. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6145. predict error 0
  6146. dir: dir isU
  6147. /862: O: O1724 (predict-no)
  6148. I see 1 and I'm going to do: predict-no
  6149. ENV: Agent did: predict-no for direction U in state State-B
  6150. In State-B moving U
  6151. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6152. predict error 0
  6153. dir: dir isR
  6154. |\-/863: O: O1726 (predict-no)
  6155. I see 1 and I'm going to do: predict-no
  6156. ENV: Agent did: predict-no for direction R in state State-B
  6157. In State-B moving R
  6158. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6159. predict error 0
  6160. dir: dir isL
  6161. |\-864: O: O1727 (predict-yes)
  6162. I see 1 and I'm going to do: predict-yes
  6163. ENV: Agent did: predict-yes for direction L in state State-B
  6164. In State-B moving L
  6165. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6166. predict error 0
  6167. dir: dir isU
  6168. /865: O: O1730 (predict-no)
  6169. I see 1 and I'm going to do: predict-no
  6170. ENV: Agent did: predict-no for direction U in state State-A
  6171. In State-A moving U
  6172. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6173. predict error 0
  6174. dir: dir isR
  6175. |\-866: O: O1731 (predict-yes)
  6176. I see 1 and I'm going to do: predict-yes
  6177. ENV: Agent did: predict-yes for direction R in state State-A
  6178. In State-A moving R
  6179. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6180. predict error 0
  6181. dir: dir isL
  6182. /|\867: O: O1733 (predict-yes)
  6183. I see 1 and I'm going to do: predict-yes
  6184. ENV: Agent did: predict-yes for direction L in state State-B
  6185. In State-B moving L
  6186. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6187. predict error 0
  6188. dir: dir isL
  6189. -/|868: O: O1736 (predict-no)
  6190. I see 1 and I'm going to do: predict-no
  6191. ENV: Agent did: predict-no for direction L in state State-A
  6192. In State-A moving L
  6193. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6194. predict error 0
  6195. dir: dir isU
  6196. \-/869: O: O1738 (predict-no)
  6197. I see 1 and I'm going to do: predict-no
  6198. ENV: Agent did: predict-no for direction U in state State-A
  6199. In State-A moving U
  6200. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6201. predict error 0
  6202. dir: dir isL
  6203. |\-870: O: O1740 (predict-no)
  6204. I see 1 and I'm going to do: predict-no
  6205. ENV: Agent did: predict-no for direction L in state State-A
  6206. In State-A moving L
  6207. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6208. predict error 0
  6209. dir: dir isL
  6210. /|\-871: O: O1742 (predict-no)
  6211. I see 1 and I'm going to do: predict-no
  6212. ENV: Agent did: predict-no for direction L in state State-A
  6213. In State-A moving L
  6214. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6215. predict error 0
  6216. dir: dir isL
  6217. /872: O: O1744 (predict-no)
  6218. I see 1 and I'm going to do: predict-no
  6219. ENV: Agent did: predict-no for direction L in state State-A
  6220. In State-A moving L
  6221. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6222. predict error 0
  6223. dir: dir isU
  6224. |\-873: O: O1746 (predict-no)
  6225. I see 1 and I'm going to do: predict-no
  6226. ENV: Agent did: predict-no for direction U in state State-A
  6227. In State-A moving U
  6228. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6229. predict error 0
  6230. dir: dir isU
  6231. /|\874: O: O1748 (predict-no)
  6232. I see 1 and I'm going to do: predict-no
  6233. ENV: Agent did: predict-no for direction U in state State-A
  6234. In State-A moving U
  6235. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6236. predict error 0
  6237. dir: dir isU
  6238. -/875: O: O1750 (predict-no)
  6239. I see 1 and I'm going to do: predict-no
  6240. ENV: Agent did: predict-no for direction U in state State-A
  6241. In State-A moving U
  6242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6243. predict error 0
  6244. dir: dir isR
  6245. |\876: O: O1751 (predict-yes)
  6246. I see 1 and I'm going to do: predict-yes
  6247. ENV: Agent did: predict-yes for direction R in state State-A
  6248. In State-A moving R
  6249. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6250. predict error 0
  6251. dir: dir isR
  6252. -/|877: O: O1754 (predict-no)
  6253. I see 1 and I'm going to do: predict-no
  6254. ENV: Agent did: predict-no for direction R in state State-B
  6255. In State-B moving R
  6256. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6257. predict error 0
  6258. dir: dir isR
  6259. \878: O: O1756 (predict-no)
  6260. I see 1 and I'm going to do: predict-no
  6261. ENV: Agent did: predict-no for direction R in state State-B
  6262. In State-B moving R
  6263. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6264. predict error 0
  6265. dir: dir isR
  6266. -/|879: O: O1758 (predict-no)
  6267. I see 1 and I'm going to do: predict-no
  6268. ENV: Agent did: predict-no for direction R in state State-B
  6269. In State-B moving R
  6270. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6271. predict error 0
  6272. dir: dir isR
  6273. \-/880: O: O1760 (predict-no)
  6274. I see 1 and I'm going to do: predict-no
  6275. ENV: Agent did: predict-no for direction R in state State-B
  6276. In State-B moving R
  6277. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6278. predict error 0
  6279. dir: dir isU
  6280. |\-881: O: O1762 (predict-no)
  6281. I see 1 and I'm going to do: predict-no
  6282. ENV: Agent did: predict-no for direction U in state State-B
  6283. In State-B moving U
  6284. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6285. predict error 0
  6286. dir: dir isU
  6287. /882: O: O1764 (predict-no)
  6288. I see 1 and I'm going to do: predict-no
  6289. ENV: Agent did: predict-no for direction U in state State-B
  6290. In State-B moving U
  6291. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6292. predict error 0
  6293. dir: dir isR
  6294. |\-883: O: O1766 (predict-no)
  6295. I see 1 and I'm going to do: predict-no
  6296. ENV: Agent did: predict-no for direction R in state State-B
  6297. In State-B moving R
  6298. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6299. predict error 0
  6300. dir: dir isR
  6301. /|\884: O: O1768 (predict-no)
  6302. I see 1 and I'm going to do: predict-no
  6303. ENV: Agent did: predict-no for direction R in state State-B
  6304. In State-B moving R
  6305. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6306. predict error 0
  6307. dir: dir isL
  6308. -/|885: O: O1769 (predict-yes)
  6309. I see 1 and I'm going to do: predict-yes
  6310. ENV: Agent did: predict-yes for direction L in state State-B
  6311. In State-B moving L
  6312. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6313. predict error 0
  6314. dir: dir isL
  6315. \-/886: O: O1772 (predict-no)
  6316. I see 1 and I'm going to do: predict-no
  6317. ENV: Agent did: predict-no for direction L in state State-A
  6318. In State-A moving L
  6319. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6320. predict error 0
  6321. dir: dir isR
  6322. |\887: O: O1773 (predict-yes)
  6323. I see 1 and I'm going to do: predict-yes
  6324. ENV: Agent did: predict-yes for direction R in state State-A
  6325. In State-A moving R
  6326. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6327. predict error 0
  6328. dir: dir isR
  6329. -/|888: O: O1776 (predict-no)
  6330. I see 1 and I'm going to do: predict-no
  6331. ENV: Agent did: predict-no for direction R in state State-B
  6332. In State-B moving R
  6333. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6334. predict error 0
  6335. dir: dir isR
  6336. \-/889: O: O1778 (predict-no)
  6337. I see 1 and I'm going to do: predict-no
  6338. ENV: Agent did: predict-no for direction R in state State-B
  6339. In State-B moving R
  6340. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6341. predict error 0
  6342. dir: dir isU
  6343. |\-890: O: O1780 (predict-no)
  6344. I see 1 and I'm going to do: predict-no
  6345. ENV: Agent did: predict-no for direction U in state State-B
  6346. In State-B moving U
  6347. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6348. predict error 0
  6349. dir: dir isL
  6350. /|891: O: O1781 (predict-yes)
  6351. I see 1 and I'm going to do: predict-yes
  6352. ENV: Agent did: predict-yes for direction L in state State-B
  6353. In State-B moving L
  6354. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6355. predict error 0
  6356. dir: dir isR
  6357. \892: O: O1783 (predict-yes)
  6358. I see 1 and I'm going to do: predict-yes
  6359. ENV: Agent did: predict-yes for direction R in state State-A
  6360. In State-A moving R
  6361. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6362. predict error 0
  6363. dir: dir isU
  6364. -/|893: O: O1786 (predict-no)
  6365. I see 1 and I'm going to do: predict-no
  6366. ENV: Agent did: predict-no for direction U in state State-B
  6367. In State-B moving U
  6368. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6369. predict error 0
  6370. dir: dir isU
  6371. \894: O: O1788 (predict-no)
  6372. I see 1 and I'm going to do: predict-no
  6373. ENV: Agent did: predict-no for direction U in state State-B
  6374. In State-B moving U
  6375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6376. predict error 0
  6377. dir: dir isR
  6378. -/|895: O: O1790 (predict-no)
  6379. I see 1 and I'm going to do: predict-no
  6380. ENV: Agent did: predict-no for direction R in state State-B
  6381. In State-B moving R
  6382. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6383. predict error 0
  6384. dir: dir isR
  6385. \-/896: O: O1792 (predict-no)
  6386. I see 1 and I'm going to do: predict-no
  6387. ENV: Agent did: predict-no for direction R in state State-B
  6388. In State-B moving R
  6389. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6390. predict error 0
  6391. dir: dir isR
  6392. |\-897: O: O1794 (predict-no)
  6393. I see 1 and I'm going to do: predict-no
  6394. ENV: Agent did: predict-no for direction R in state State-B
  6395. In State-B moving R
  6396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6397. predict error 0
  6398. dir: dir isU
  6399. /|\898: O: O1796 (predict-no)
  6400. I see 1 and I'm going to do: predict-no
  6401. ENV: Agent did: predict-no for direction U in state State-B
  6402. In State-B moving U
  6403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6404. predict error 0
  6405. dir: dir isU
  6406. -/|899: O: O1798 (predict-no)
  6407. I see 1 and I'm going to do: predict-no
  6408. ENV: Agent did: predict-no for direction U in state State-B
  6409. In State-B moving U
  6410. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6411. predict error 0
  6412. dir: dir isU
  6413. \-/900: O: O1800 (predict-no)
  6414. I see 1 and I'm going to do: predict-no
  6415. ENV: Agent did: predict-no for direction U in state State-B
  6416. In State-B moving U
  6417. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6418. predict error 0
  6419. dir: dir isU
  6420. |\-901: O: O1802 (predict-no)
  6421. I see 1 and I'm going to do: predict-no
  6422. ENV: Agent did: predict-no for direction U in state State-B
  6423. In State-B moving U
  6424. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6425. predict error 0
  6426. dir: dir isU
  6427. /902: O: O1804 (predict-no)
  6428. I see 1 and I'm going to do: predict-no
  6429. ENV: Agent did: predict-no for direction U in state State-B
  6430. In State-B moving U
  6431. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6432. predict error 0
  6433. dir: dir isU
  6434. |\903: O: O1806 (predict-no)
  6435. I see 1 and I'm going to do: predict-no
  6436. ENV: Agent did: predict-no for direction U in state State-B
  6437. In State-B moving U
  6438. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6439. predict error 0
  6440. dir: dir isR
  6441. -/904: O: O1808 (predict-no)
  6442. I see 1 and I'm going to do: predict-no
  6443. ENV: Agent did: predict-no for direction R in state State-B
  6444. In State-B moving R
  6445. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6446. predict error 0
  6447. dir: dir isR
  6448. |\-905: O: O1810 (predict-no)
  6449. I see 1 and I'm going to do: predict-no
  6450. ENV: Agent did: predict-no for direction R in state State-B
  6451. In State-B moving R
  6452. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6453. predict error 0
  6454. dir: dir isU
  6455. /|\906: O: O1812 (predict-no)
  6456. I see 1 and I'm going to do: predict-no
  6457. ENV: Agent did: predict-no for direction U in state State-B
  6458. In State-B moving U
  6459. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6460. predict error 0
  6461. dir: dir isR
  6462. -/|907: O: O1814 (predict-no)
  6463. I see 1 and I'm going to do: predict-no
  6464. ENV: Agent did: predict-no for direction R in state State-B
  6465. In State-B moving R
  6466. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6467. predict error 0
  6468. dir: dir isU
  6469. \-/908: O: O1816 (predict-no)
  6470. I see 1 and I'm going to do: predict-no
  6471. ENV: Agent did: predict-no for direction U in state State-B
  6472. In State-B moving U
  6473. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6474. predict error 0
  6475. dir: dir isR
  6476. |\909: O: O1818 (predict-no)
  6477. I see 1 and I'm going to do: predict-no
  6478. ENV: Agent did: predict-no for direction R in state State-B
  6479. In State-B moving R
  6480. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6481. predict error 0
  6482. dir: dir isR
  6483. -/|910: O: O1820 (predict-no)
  6484. I see 1 and I'm going to do: predict-no
  6485. ENV: Agent did: predict-no for direction R in state State-B
  6486. In State-B moving R
  6487. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6488. predict error 0
  6489. dir: dir isR
  6490. \-/911: O: O1822 (predict-no)
  6491. I see 1 and I'm going to do: predict-no
  6492. ENV: Agent did: predict-no for direction R in state State-B
  6493. In State-B moving R
  6494. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6495. predict error 0
  6496. dir: dir isL
  6497. |912: O: O1823 (predict-yes)
  6498. I see 1 and I'm going to do: predict-yes
  6499. ENV: Agent did: predict-yes for direction L in state State-B
  6500. In State-B moving L
  6501. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6502. predict error 0
  6503. dir: dir isR
  6504. \913: O: O1825 (predict-yes)
  6505. I see 1 and I'm going to do: predict-yes
  6506. ENV: Agent did: predict-yes for direction R in state State-A
  6507. In State-A moving R
  6508. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6509. predict error 0
  6510. dir: dir isR
  6511. -/|914: O: O1828 (predict-no)
  6512. I see 1 and I'm going to do: predict-no
  6513. ENV: Agent did: predict-no for direction R in state State-B
  6514. In State-B moving R
  6515. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6516. predict error 0
  6517. dir: dir isL
  6518. \-/915: O: O1829 (predict-yes)
  6519. I see 1 and I'm going to do: predict-yes
  6520. ENV: Agent did: predict-yes for direction L in state State-B
  6521. In State-B moving L
  6522. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6523. predict error 0
  6524. dir: dir isL
  6525. |\-916: O: O1832 (predict-no)
  6526. I see 1 and I'm going to do: predict-no
  6527. ENV: Agent did: predict-no for direction L in state State-A
  6528. In State-A moving L
  6529. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6530. predict error 0
  6531. dir: dir isL
  6532. /|\917: O: O1834 (predict-no)
  6533. I see 1 and I'm going to do: predict-no
  6534. ENV: Agent did: predict-no for direction L in state State-A
  6535. In State-A moving L
  6536. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6537. predict error 0
  6538. dir: dir isU
  6539. -/918: O: O1836 (predict-no)
  6540. I see 1 and I'm going to do: predict-no
  6541. ENV: Agent did: predict-no for direction U in state State-A
  6542. In State-A moving U
  6543. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6544. predict error 0
  6545. dir: dir isR
  6546. |\-919: O: O1837 (predict-yes)
  6547. I see 1 and I'm going to do: predict-yes
  6548. ENV: Agent did: predict-yes for direction R in state State-A
  6549. In State-A moving R
  6550. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6551. predict error 0
  6552. dir: dir isL
  6553. /|\920: O: O1839 (predict-yes)
  6554. I see 1 and I'm going to do: predict-yes
  6555. ENV: Agent did: predict-yes for direction L in state State-B
  6556. In State-B moving L
  6557. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6558. predict error 0
  6559. dir: dir isU
  6560. -/|921: O: O1842 (predict-no)
  6561. I see 1 and I'm going to do: predict-no
  6562. ENV: Agent did: predict-no for direction U in state State-A
  6563. In State-A moving U
  6564. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6565. predict error 0
  6566. dir: dir isL
  6567. \922: O: O1844 (predict-no)
  6568. I see 1 and I'm going to do: predict-no
  6569. ENV: Agent did: predict-no for direction L in state State-A
  6570. In State-A moving L
  6571. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6572. predict error 0
  6573. dir: dir isR
  6574. -/923: O: O1845 (predict-yes)
  6575. I see 1 and I'm going to do: predict-yes
  6576. ENV: Agent did: predict-yes for direction R in state State-A
  6577. In State-A moving R
  6578. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6579. predict error 0
  6580. dir: dir isU
  6581. |\-924: O: O1848 (predict-no)
  6582. I see 1 and I'm going to do: predict-no
  6583. ENV: Agent did: predict-no for direction U in state State-B
  6584. In State-B moving U
  6585. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6586. predict error 0
  6587. dir: dir isU
  6588. /|\925: O: O1850 (predict-no)
  6589. I see 1 and I'm going to do: predict-no
  6590. ENV: Agent did: predict-no for direction U in state State-B
  6591. In State-B moving U
  6592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6593. predict error 0
  6594. dir: dir isR
  6595. -/|926: O: O1852 (predict-no)
  6596. I see 1 and I'm going to do: predict-no
  6597. ENV: Agent did: predict-no for direction R in state State-B
  6598. In State-B moving R
  6599. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6600. predict error 0
  6601. dir: dir isU
  6602. \-/927: O: O1854 (predict-no)
  6603. I see 1 and I'm going to do: predict-no
  6604. ENV: Agent did: predict-no for direction U in state State-B
  6605. In State-B moving U
  6606. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6607. predict error 0
  6608. dir: dir isR
  6609. |\-928: O: O1856 (predict-no)
  6610. I see 1 and I'm going to do: predict-no
  6611. ENV: Agent did: predict-no for direction R in state State-B
  6612. In State-B moving R
  6613. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6614. predict error 0
  6615. dir: dir isU
  6616. /|929: O: O1858 (predict-no)
  6617. I see 1 and I'm going to do: predict-no
  6618. ENV: Agent did: predict-no for direction U in state State-B
  6619. In State-B moving U
  6620. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6621. predict error 0
  6622. dir: dir isR
  6623. \-/930: O: O1860 (predict-no)
  6624. I see 1 and I'm going to do: predict-no
  6625. ENV: Agent did: predict-no for direction R in state State-B
  6626. In State-B moving R
  6627. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6628. predict error 0
  6629. dir: dir isU
  6630. |\931: O: O1862 (predict-no)
  6631. I see 1 and I'm going to do: predict-no
  6632. ENV: Agent did: predict-no for direction U in state State-B
  6633. In State-B moving U
  6634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6635. predict error 0
  6636. dir: dir isU
  6637. -932: O: O1864 (predict-no)
  6638. I see 1 and I'm going to do: predict-no
  6639. ENV: Agent did: predict-no for direction U in state State-B
  6640. In State-B moving U
  6641. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6642. predict error 0
  6643. dir: dir isL
  6644. /|\933: O: O1865 (predict-yes)
  6645. I see 1 and I'm going to do: predict-yes
  6646. ENV: Agent did: predict-yes for direction L in state State-B
  6647. In State-B moving L
  6648. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6649. predict error 0
  6650. dir: dir isL
  6651. -/|934: O: O1868 (predict-no)
  6652. I see 1 and I'm going to do: predict-no
  6653. ENV: Agent did: predict-no for direction L in state State-A
  6654. In State-A moving L
  6655. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6656. predict error 0
  6657. dir: dir isU
  6658. \-/935: O: O1870 (predict-no)
  6659. I see 1 and I'm going to do: predict-no
  6660. ENV: Agent did: predict-no for direction U in state State-A
  6661. In State-A moving U
  6662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6663. predict error 0
  6664. dir: dir isL
  6665. |\936: O: O1872 (predict-no)
  6666. I see 1 and I'm going to do: predict-no
  6667. ENV: Agent did: predict-no for direction L in state State-A
  6668. In State-A moving L
  6669. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6670. predict error 0
  6671. dir: dir isL
  6672. -/|937: O: O1874 (predict-no)
  6673. I see 1 and I'm going to do: predict-no
  6674. ENV: Agent did: predict-no for direction L in state State-A
  6675. In State-A moving L
  6676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6677. predict error 0
  6678. dir: dir isL
  6679. \-/938: O: O1876 (predict-no)
  6680. I see 1 and I'm going to do: predict-no
  6681. ENV: Agent did: predict-no for direction L in state State-A
  6682. In State-A moving L
  6683. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6684. predict error 0
  6685. dir: dir isR
  6686. |939: O: O1877 (predict-yes)
  6687. I see 1 and I'm going to do: predict-yes
  6688. ENV: Agent did: predict-yes for direction R in state State-A
  6689. In State-A moving R
  6690. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6691. predict error 0
  6692. dir: dir isU
  6693. \-/940: O: O1880 (predict-no)
  6694. I see 1 and I'm going to do: predict-no
  6695. ENV: Agent did: predict-no for direction U in state State-B
  6696. In State-B moving U
  6697. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6698. predict error 0
  6699. dir: dir isR
  6700. |941: O: O1882 (predict-no)
  6701. I see 1 and I'm going to do: predict-no
  6702. ENV: Agent did: predict-no for direction R in state State-B
  6703. In State-B moving R
  6704. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6705. predict error 0
  6706. dir: dir isU
  6707. \942: O: O1884 (predict-no)
  6708. I see 1 and I'm going to do: predict-no
  6709. ENV: Agent did: predict-no for direction U in state State-B
  6710. In State-B moving U
  6711. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6712. predict error 0
  6713. dir: dir isU
  6714. -/|943: O: O1886 (predict-no)
  6715. I see 1 and I'm going to do: predict-no
  6716. ENV: Agent did: predict-no for direction U in state State-B
  6717. In State-B moving U
  6718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6719. predict error 0
  6720. dir: dir isL
  6721. \944: O: O1887 (predict-yes)
  6722. I see 1 and I'm going to do: predict-yes
  6723. ENV: Agent did: predict-yes for direction L in state State-B
  6724. In State-B moving L
  6725. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6726. predict error 0
  6727. dir: dir isR
  6728. -/|945: O: O1889 (predict-yes)
  6729. I see 1 and I'm going to do: predict-yes
  6730. ENV: Agent did: predict-yes for direction R in state State-A
  6731. In State-A moving R
  6732. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6733. predict error 0
  6734. dir: dir isU
  6735. \-946: O: O1892 (predict-no)
  6736. I see 1 and I'm going to do: predict-no
  6737. ENV: Agent did: predict-no for direction U in state State-B
  6738. In State-B moving U
  6739. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6740. predict error 0
  6741. dir: dir isR
  6742. /|\947: O: O1894 (predict-no)
  6743. I see 1 and I'm going to do: predict-no
  6744. ENV: Agent did: predict-no for direction R in state State-B
  6745. In State-B moving R
  6746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6747. predict error 0
  6748. dir: dir isR
  6749. -/|948: O: O1896 (predict-no)
  6750. I see 1 and I'm going to do: predict-no
  6751. ENV: Agent did: predict-no for direction R in state State-B
  6752. In State-B moving R
  6753. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6754. predict error 0
  6755. dir: dir isR
  6756. \-/949: O: O1898 (predict-no)
  6757. I see 1 and I'm going to do: predict-no
  6758. ENV: Agent did: predict-no for direction R in state State-B
  6759. In State-B moving R
  6760. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6761. predict error 0
  6762. dir: dir isU
  6763. |\-950: O: O1900 (predict-no)
  6764. I see 1 and I'm going to do: predict-no
  6765. ENV: Agent did: predict-no for direction U in state State-B
  6766. In State-B moving U
  6767. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6768. predict error 0
  6769. dir: dir isU
  6770. /|\-/|\-/--- Input Phase ---
  6771. =>WM: (13307: I2 ^dir U)
  6772. =>WM: (13306: I2 ^reward 1)
  6773. =>WM: (13305: I2 ^see 0)
  6774. =>WM: (13304: N950 ^status complete)
  6775. <=WM: (13293: I2 ^dir U)
  6776. <=WM: (13292: I2 ^reward 1)
  6777. <=WM: (13291: I2 ^see 0)
  6778. =>WM: (13308: I2 ^level-1 R0-root)
  6779. <=WM: (13294: I2 ^level-1 R0-root)
  6780. --- END Input Phase ---
  6781. --- Proposal Phase ---
  6782. --- Inner Elaboration Phase, active level 1 (S1) ---
  6783. Firing elaborate*copy-see-to-output-link
  6784. -->
  6785. (I3 ^see 0 +)
  6786. Firing elaborate*reward*based*on*reward
  6787. -->
  6788. (R954 ^value 1 +)
  6789. (R1 ^reward R954 +)
  6790. Firing propose*predict-yes
  6791. -->
  6792. (O1901 ^name predict-yes +)
  6793. (S1 ^operator O1901 +)
  6794. Firing propose*predict-no
  6795. -->
  6796. (O1902 ^name predict-no +)
  6797. (S1 ^operator O1902 +)
  6798. Firing rl*prefer*rvt*predict-no*H0*4
  6799. -->
  6800. (S1 ^operator O1900 = 1.)
  6801. Firing rl*prefer*rvt*predict-yes*H0*3
  6802. -->
  6803. (S1 ^operator O1899 = 0.)
  6804. Firing prefer*rvt*predict-yes*H0
  6805. -->
  6806. Firing prefer*rvt*predict-no*H0
  6807. -->
  6808. Firing elaborate*copy-dir-to-output-link
  6809. -->
  6810. (I3 ^dir U +)
  6811. inner elaboration loop at bottom goal.
  6812. Retracting elaborate*copy-see-to-output-link
  6813. -->
  6814. (I3 ^see 0 +)
  6815. Retracting propose*predict-no
  6816. -->
  6817. (O1900 ^name predict-no +)
  6818. (S1 ^operator O1900 +)
  6819. Retracting propose*predict-yes
  6820. -->
  6821. (O1899 ^name predict-yes +)
  6822. (S1 ^operator O1899 +)
  6823. Retracting elaborate*reward*based*on*reward
  6824. -->
  6825. (R953 ^value 1 +)
  6826. (R1 ^reward R953 +)
  6827. Retracting elaborate*copy-dir-to-output-link
  6828. -->
  6829. (I3 ^dir U +)
  6830. Retracting rl*prefer*rvt*predict-no*H0*4
  6831. -->
  6832. (S1 ^operator O1900 = 1.)
  6833. Retracting rl*prefer*rvt*predict-yes*H0*3
  6834. -->
  6835. (S1 ^operator O1899 = 0.)
  6836. =>WM: (13314: S1 ^operator O1902 +)
  6837. =>WM: (13313: S1 ^operator O1901 +)
  6838. =>WM: (13312: O1902 ^name predict-no)
  6839. =>WM: (13311: O1901 ^name predict-yes)
  6840. =>WM: (13310: R954 ^value 1)
  6841. =>WM: (13309: R1 ^reward R954)
  6842. <=WM: (13300: S1 ^operator O1899 +)
  6843. <=WM: (13301: S1 ^operator O1900 +)
  6844. <=WM: (13302: S1 ^operator O1900)
  6845. <=WM: (13295: R1 ^reward R953)
  6846. <=WM: (13298: O1900 ^name predict-no)
  6847. <=WM: (13297: O1899 ^name predict-yes)
  6848. <=WM: (13296: R953 ^value 1)
  6849. --- Inner Elaboration Phase, active level 1 (S1) ---
  6850. Firing prefer*rvt*predict-yes*H0
  6851. -->
  6852. Firing rl*prefer*rvt*predict-yes*H0*3
  6853. -->
  6854. (S1 ^operator O1901 = 0.)
  6855. Firing prefer*rvt*predict-no*H0
  6856. -->
  6857. Firing rl*prefer*rvt*predict-no*H0*4
  6858. -->
  6859. (S1 ^operator O1902 = 1.)
  6860. inner elaboration loop at bottom goal.
  6861. Retracting rl*prefer*rvt*predict-no*H0*4
  6862. -->
  6863. (S1 ^operator O1900 = 1.)
  6864. Retracting rl*prefer*rvt*predict-yes*H0*3
  6865. -->
  6866. (S1 ^operator O1899 = 0.)
  6867. --- END Proposal Phase ---
  6868. --- Decision Phase ---
  6869. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6870. =>WM: (13315: S1 ^operator O1902)
  6871. 951: O: O1902 (predict-no)
  6872. --- END Decision Phase ---
  6873. --- Application Phase ---
  6874. --- Firing Productions (PE) For State At Depth 1 ---
  6875. --- Inner Elaboration Phase, active level 1 (S1) ---
  6876. Firing apply*operator
  6877. -->
  6878. (I3 ^predict-no N951 + :O )
  6879. Firing apply*operator*complete
  6880. -->
  6881. (I3 ^predict-no N950 - :O )
  6882. inner elaboration loop at bottom goal.
  6883. --- Change Working Memory (PE) ---
  6884. =>WM: (13316: I3 ^predict-no N951)
  6885. <=WM: (13304: N950 ^status complete)
  6886. <=WM: (13303: I3 ^predict-no N950)
  6887. --- Firing Productions (IE) For State At Depth 1 ---
  6888. --- Inner Elaboration Phase, active level 1 (S1) ---
  6889. Firing monitor*world
  6890. -->
  6891. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6892. --- Change Working Memory (IE) ---
  6893. --- END Application Phase ---
  6894. --- Output Phase ---
  6895. ENV: Agent did: predict-no for direction U in state State-B
  6896. In State-B moving U
  6897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6898. predict error 0
  6899. dir: dir isL
  6900. --- END Output Phase ---
  6901. |--- Input Phase ---
  6902. =>WM: (13320: I2 ^dir L)
  6903. =>WM: (13319: I2 ^reward 1)
  6904. =>WM: (13318: I2 ^see 0)
  6905. =>WM: (13317: N951 ^status complete)
  6906. <=WM: (13307: I2 ^dir U)
  6907. <=WM: (13306: I2 ^reward 1)
  6908. <=WM: (13305: I2 ^see 0)
  6909. =>WM: (13321: I2 ^level-1 R0-root)
  6910. <=WM: (13308: I2 ^level-1 R0-root)
  6911. --- END Input Phase ---
  6912. --- Proposal Phase ---
  6913. --- Inner Elaboration Phase, active level 1 (S1) ---
  6914. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  6915. -->
  6916. (S1 ^operator O1901 = 0.6195564468661043)
  6917. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  6918. -->
  6919. (S1 ^operator O1902 = -0.2190661556260421)
  6920. Firing prefer*rvt*predict-no*H0*2*v1*H1
  6921. -->
  6922. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  6923. -->
  6924. Firing elaborate*copy-see-to-output-link
  6925. -->
  6926. (I3 ^see 0 +)
  6927. Firing elaborate*reward*based*on*reward
  6928. -->
  6929. (R955 ^value 1 +)
  6930. (R1 ^reward R955 +)
  6931. Firing propose*predict-yes
  6932. -->
  6933. (O1903 ^name predict-yes +)
  6934. (S1 ^operator O1903 +)
  6935. Firing propose*predict-no
  6936. -->
  6937. (O1904 ^name predict-no +)
  6938. (S1 ^operator O1904 +)
  6939. Firing rl*prefer*rvt*predict-no*H0*2
  6940. -->
  6941. (S1 ^operator O1902 = 0.314040627026034)
  6942. Firing rl*prefer*rvt*predict-yes*H0*1
  6943. -->
  6944. (S1 ^operator O1901 = 0.3804224030022332)
  6945. Firing prefer*rvt*predict-yes*H0
  6946. -->
  6947. Firing prefer*rvt*predict-no*H0
  6948. -->
  6949. Firing elaborate*copy-dir-to-output-link
  6950. -->
  6951. (I3 ^dir L +)
  6952. inner elaboration loop at bottom goal.
  6953. Retracting elaborate*copy-see-to-output-link
  6954. -->
  6955. (I3 ^see 0 +)
  6956. Retracting propose*predict-no
  6957. -->
  6958. (O1902 ^name predict-no +)
  6959. (S1 ^operator O1902 +)
  6960. Retracting propose*predict-yes
  6961. -->
  6962. (O1901 ^name predict-yes +)
  6963. (S1 ^operator O1901 +)
  6964. Retracting elaborate*reward*based*on*reward
  6965. -->
  6966. (R954 ^value 1 +)
  6967. (R1 ^reward R954 +)
  6968. Retracting elaborate*copy-dir-to-output-link
  6969. -->
  6970. (I3 ^dir U +)
  6971. Retracting rl*prefer*rvt*predict-no*H0*4
  6972. -->
  6973. (S1 ^operator O1902 = 1.)
  6974. Retracting rl*prefer*rvt*predict-yes*H0*3
  6975. -->
  6976. (S1 ^operator O1901 = 0.)
  6977. =>WM: (13328: S1 ^operator O1904 +)
  6978. =>WM: (13327: S1 ^operator O1903 +)
  6979. =>WM: (13326: I3 ^dir L)
  6980. =>WM: (13325: O1904 ^name predict-no)
  6981. =>WM: (13324: O1903 ^name predict-yes)
  6982. =>WM: (13323: R955 ^value 1)
  6983. =>WM: (13322: R1 ^reward R955)
  6984. <=WM: (13313: S1 ^operator O1901 +)
  6985. <=WM: (13314: S1 ^operator O1902 +)
  6986. <=WM: (13315: S1 ^operator O1902)
  6987. <=WM: (13299: I3 ^dir U)
  6988. <=WM: (13309: R1 ^reward R954)
  6989. <=WM: (13312: O1902 ^name predict-no)
  6990. <=WM: (13311: O1901 ^name predict-yes)
  6991. <=WM: (13310: R954 ^value 1)
  6992. --- Inner Elaboration Phase, active level 1 (S1) ---
  6993. Firing prefer*rvt*predict-yes*H0
  6994. -->
  6995. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  6996. -->
  6997. (S1 ^operator O1903 = 0.6195564468661043)
  6998. Firing rl*prefer*rvt*predict-yes*H0*1
  6999. -->
  7000. (S1 ^operator O1903 = 0.3804224030022332)
  7001. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7002. -->
  7003. Firing prefer*rvt*predict-no*H0
  7004. -->
  7005. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7006. -->
  7007. (S1 ^operator O1904 = -0.2190661556260421)
  7008. Firing rl*prefer*rvt*predict-no*H0*2
  7009. -->
  7010. (S1 ^operator O1904 = 0.314040627026034)
  7011. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7012. -->
  7013. inner elaboration loop at bottom goal.
  7014. Retracting rl*prefer*rvt*predict-no*H0*2
  7015. -->
  7016. (S1 ^operator O1902 = 0.314040627026034)
  7017. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7018. -->
  7019. (S1 ^operator O1902 = -0.2190661556260421)
  7020. Retracting rl*prefer*rvt*predict-yes*H0*1
  7021. -->
  7022. (S1 ^operator O1901 = 0.3804224030022332)
  7023. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7024. -->
  7025. (S1 ^operator O1901 = 0.6195564468661043)
  7026. --- END Proposal Phase ---
  7027. --- Decision Phase ---
  7028. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7029. =>WM: (13329: S1 ^operator O1903)
  7030. 952: O: O1903 (predict-yes)
  7031. --- END Decision Phase ---
  7032. --- Application Phase ---
  7033. --- Firing Productions (PE) For State At Depth 1 ---
  7034. --- Inner Elaboration Phase, active level 1 (S1) ---
  7035. Firing apply*operator
  7036. -->
  7037. (I3 ^predict-yes N952 + :O )
  7038. Firing apply*operator*complete
  7039. -->
  7040. (I3 ^predict-no N951 - :O )
  7041. inner elaboration loop at bottom goal.
  7042. --- Change Working Memory (PE) ---
  7043. =>WM: (13330: I3 ^predict-yes N952)
  7044. <=WM: (13317: N951 ^status complete)
  7045. <=WM: (13316: I3 ^predict-no N951)
  7046. --- Firing Productions (IE) For State At Depth 1 ---
  7047. --- Inner Elaboration Phase, active level 1 (S1) ---
  7048. Firing monitor*world
  7049. -->
  7050. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7051. --- Change Working Memory (IE) ---
  7052. --- END Application Phase ---
  7053. --- Output Phase ---
  7054. ENV: Agent did: predict-yes for direction L in state State-B
  7055. In State-B moving L
  7056. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7057. predict error 0
  7058. dir: dir isR
  7059. --- END Output Phase ---
  7060. \-/--- Input Phase ---
  7061. =>WM: (13334: I2 ^dir R)
  7062. =>WM: (13333: I2 ^reward 1)
  7063. =>WM: (13332: I2 ^see 1)
  7064. =>WM: (13331: N952 ^status complete)
  7065. <=WM: (13320: I2 ^dir L)
  7066. <=WM: (13319: I2 ^reward 1)
  7067. <=WM: (13318: I2 ^see 0)
  7068. =>WM: (13335: I2 ^level-1 L1-root)
  7069. <=WM: (13321: I2 ^level-1 R0-root)
  7070. --- END Input Phase ---
  7071. --- Proposal Phase ---
  7072. --- Inner Elaboration Phase, active level 1 (S1) ---
  7073. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7074. -->
  7075. (S1 ^operator O1903 = 0.7066224695034091)
  7076. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7077. -->
  7078. (S1 ^operator O1904 = -0.1937987592593187)
  7079. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7080. -->
  7081. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7082. -->
  7083. Firing elaborate*copy-see-to-output-link
  7084. -->
  7085. (I3 ^see 1 +)
  7086. Firing elaborate*reward*based*on*reward
  7087. -->
  7088. (R956 ^value 1 +)
  7089. (R1 ^reward R956 +)
  7090. Firing propose*predict-yes
  7091. -->
  7092. (O1905 ^name predict-yes +)
  7093. (S1 ^operator O1905 +)
  7094. Firing propose*predict-no
  7095. -->
  7096. (O1906 ^name predict-no +)
  7097. (S1 ^operator O1906 +)
  7098. Firing rl*prefer*rvt*predict-no*H0*6
  7099. -->
  7100. (S1 ^operator O1904 = 0.2298785768141863)
  7101. Firing rl*prefer*rvt*predict-yes*H0*5
  7102. -->
  7103. (S1 ^operator O1903 = 0.2940444083423254)
  7104. Firing prefer*rvt*predict-yes*H0
  7105. -->
  7106. Firing prefer*rvt*predict-no*H0
  7107. -->
  7108. Firing elaborate*copy-dir-to-output-link
  7109. -->
  7110. (I3 ^dir R +)
  7111. inner elaboration loop at bottom goal.
  7112. Retracting elaborate*copy-see-to-output-link
  7113. -->
  7114. (I3 ^see 0 +)
  7115. Retracting propose*predict-no
  7116. -->
  7117. (O1904 ^name predict-no +)
  7118. (S1 ^operator O1904 +)
  7119. Retracting propose*predict-yes
  7120. -->
  7121. (O1903 ^name predict-yes +)
  7122. (S1 ^operator O1903 +)
  7123. Retracting elaborate*reward*based*on*reward
  7124. -->
  7125. (R955 ^value 1 +)
  7126. (R1 ^reward R955 +)
  7127. Retracting elaborate*copy-dir-to-output-link
  7128. -->
  7129. (I3 ^dir L +)
  7130. Retracting rl*prefer*rvt*predict-no*H0*2
  7131. -->
  7132. (S1 ^operator O1904 = 0.314040627026034)
  7133. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7134. -->
  7135. (S1 ^operator O1904 = -0.2190661556260421)
  7136. Retracting rl*prefer*rvt*predict-yes*H0*1
  7137. -->
  7138. (S1 ^operator O1903 = 0.3804224030022332)
  7139. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7140. -->
  7141. (S1 ^operator O1903 = 0.6195564468661043)
  7142. =>WM: (13343: S1 ^operator O1906 +)
  7143. =>WM: (13342: S1 ^operator O1905 +)
  7144. =>WM: (13341: I3 ^dir R)
  7145. =>WM: (13340: O1906 ^name predict-no)
  7146. =>WM: (13339: O1905 ^name predict-yes)
  7147. =>WM: (13338: R956 ^value 1)
  7148. =>WM: (13337: R1 ^reward R956)
  7149. =>WM: (13336: I3 ^see 1)
  7150. <=WM: (13327: S1 ^operator O1903 +)
  7151. <=WM: (13329: S1 ^operator O1903)
  7152. <=WM: (13328: S1 ^operator O1904 +)
  7153. <=WM: (13326: I3 ^dir L)
  7154. <=WM: (13322: R1 ^reward R955)
  7155. <=WM: (13254: I3 ^see 0)
  7156. <=WM: (13325: O1904 ^name predict-no)
  7157. <=WM: (13324: O1903 ^name predict-yes)
  7158. <=WM: (13323: R955 ^value 1)
  7159. --- Inner Elaboration Phase, active level 1 (S1) ---
  7160. Firing prefer*rvt*predict-yes*H0
  7161. -->
  7162. Firing rl*prefer*rvt*predict-yes*H0*5
  7163. -->
  7164. (S1 ^operator O1905 = 0.2940444083423254)
  7165. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7166. -->
  7167. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7168. -->
  7169. (S1 ^operator O1905 = 0.7066224695034091)
  7170. Firing prefer*rvt*predict-no*H0
  7171. -->
  7172. Firing rl*prefer*rvt*predict-no*H0*6
  7173. -->
  7174. (S1 ^operator O1906 = 0.2298785768141863)
  7175. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7176. -->
  7177. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7178. -->
  7179. (S1 ^operator O1906 = -0.1937987592593187)
  7180. inner elaboration loop at bottom goal.
  7181. Retracting rl*prefer*rvt*predict-no*H0*6
  7182. -->
  7183. (S1 ^operator O1904 = 0.2298785768141863)
  7184. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7185. -->
  7186. (S1 ^operator O1904 = -0.1937987592593187)
  7187. Retracting rl*prefer*rvt*predict-yes*H0*5
  7188. -->
  7189. (S1 ^operator O1903 = 0.2940444083423254)
  7190. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7191. -->
  7192. (S1 ^operator O1903 = 0.7066224695034091)
  7193. --- END Proposal Phase ---
  7194. --- Decision Phase ---
  7195. RL update rl*prefer*rvt*predict-yes*H0*1 0.521353 -0.140931 0.380422 -> 0.521355 -0.140931 0.380424(R,m,v=1,0.819355,0.148974)
  7196. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478624 0.140933 0.619556 -> 0.478626 0.140932 0.619559(R,m,v=1,1,0)
  7197. =>WM: (13344: S1 ^operator O1905)
  7198. 953: O: O1905 (predict-yes)
  7199. --- END Decision Phase ---
  7200. --- Application Phase ---
  7201. --- Firing Productions (PE) For State At Depth 1 ---
  7202. --- Inner Elaboration Phase, active level 1 (S1) ---
  7203. Firing apply*operator
  7204. -->
  7205. (I3 ^predict-yes N953 + :O )
  7206. Firing apply*operator*complete
  7207. -->
  7208. (I3 ^predict-yes N952 - :O )
  7209. inner elaboration loop at bottom goal.
  7210. --- Change Working Memory (PE) ---
  7211. =>WM: (13345: I3 ^predict-yes N953)
  7212. <=WM: (13331: N952 ^status complete)
  7213. <=WM: (13330: I3 ^predict-yes N952)
  7214. --- Firing Productions (IE) For State At Depth 1 ---
  7215. --- Inner Elaboration Phase, active level 1 (S1) ---
  7216. Firing monitor*world
  7217. -->
  7218. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7219. --- Change Working Memory (IE) ---
  7220. --- END Application Phase ---
  7221. --- Output Phase ---
  7222. ENV: Agent did: predict-yes for direction R in state State-A
  7223. In State-A moving R
  7224. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7225. predict error 0
  7226. dir: dir isR
  7227. --- END Output Phase ---
  7228. |\---- Input Phase ---
  7229. =>WM: (13349: I2 ^dir R)
  7230. =>WM: (13348: I2 ^reward 1)
  7231. =>WM: (13347: I2 ^see 1)
  7232. =>WM: (13346: N953 ^status complete)
  7233. <=WM: (13334: I2 ^dir R)
  7234. <=WM: (13333: I2 ^reward 1)
  7235. <=WM: (13332: I2 ^see 1)
  7236. =>WM: (13350: I2 ^level-1 R1-root)
  7237. <=WM: (13335: I2 ^level-1 L1-root)
  7238. --- END Input Phase ---
  7239. --- Proposal Phase ---
  7240. --- Inner Elaboration Phase, active level 1 (S1) ---
  7241. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7242. -->
  7243. (S1 ^operator O1905 = -0.252585164213872)
  7244. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7245. -->
  7246. (S1 ^operator O1906 = 0.7702047625716166)
  7247. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7248. -->
  7249. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7250. -->
  7251. Firing elaborate*copy-see-to-output-link
  7252. -->
  7253. (I3 ^see 1 +)
  7254. Firing elaborate*reward*based*on*reward
  7255. -->
  7256. (R957 ^value 1 +)
  7257. (R1 ^reward R957 +)
  7258. Firing propose*predict-yes
  7259. -->
  7260. (O1907 ^name predict-yes +)
  7261. (S1 ^operator O1907 +)
  7262. Firing propose*predict-no
  7263. -->
  7264. (O1908 ^name predict-no +)
  7265. (S1 ^operator O1908 +)
  7266. Firing rl*prefer*rvt*predict-no*H0*6
  7267. -->
  7268. (S1 ^operator O1906 = 0.2298785768141863)
  7269. Firing rl*prefer*rvt*predict-yes*H0*5
  7270. -->
  7271. (S1 ^operator O1905 = 0.2940444083423254)
  7272. Firing prefer*rvt*predict-yes*H0
  7273. -->
  7274. Firing prefer*rvt*predict-no*H0
  7275. -->
  7276. Firing elaborate*copy-dir-to-output-link
  7277. -->
  7278. (I3 ^dir R +)
  7279. inner elaboration loop at bottom goal.
  7280. Retracting elaborate*copy-see-to-output-link
  7281. -->
  7282. (I3 ^see 1 +)
  7283. Retracting propose*predict-no
  7284. -->
  7285. (O1906 ^name predict-no +)
  7286. (S1 ^operator O1906 +)
  7287. Retracting propose*predict-yes
  7288. -->
  7289. (O1905 ^name predict-yes +)
  7290. (S1 ^operator O1905 +)
  7291. Retracting elaborate*reward*based*on*reward
  7292. -->
  7293. (R956 ^value 1 +)
  7294. (R1 ^reward R956 +)
  7295. Retracting elaborate*copy-dir-to-output-link
  7296. -->
  7297. (I3 ^dir R +)
  7298. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7299. -->
  7300. (S1 ^operator O1906 = -0.1937987592593187)
  7301. Retracting rl*prefer*rvt*predict-no*H0*6
  7302. -->
  7303. (S1 ^operator O1906 = 0.2298785768141863)
  7304. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7305. -->
  7306. (S1 ^operator O1905 = 0.7066224695034091)
  7307. Retracting rl*prefer*rvt*predict-yes*H0*5
  7308. -->
  7309. (S1 ^operator O1905 = 0.2940444083423254)
  7310. =>WM: (13356: S1 ^operator O1908 +)
  7311. =>WM: (13355: S1 ^operator O1907 +)
  7312. =>WM: (13354: O1908 ^name predict-no)
  7313. =>WM: (13353: O1907 ^name predict-yes)
  7314. =>WM: (13352: R957 ^value 1)
  7315. =>WM: (13351: R1 ^reward R957)
  7316. <=WM: (13342: S1 ^operator O1905 +)
  7317. <=WM: (13344: S1 ^operator O1905)
  7318. <=WM: (13343: S1 ^operator O1906 +)
  7319. <=WM: (13337: R1 ^reward R956)
  7320. <=WM: (13340: O1906 ^name predict-no)
  7321. <=WM: (13339: O1905 ^name predict-yes)
  7322. <=WM: (13338: R956 ^value 1)
  7323. --- Inner Elaboration Phase, active level 1 (S1) ---
  7324. Firing prefer*rvt*predict-yes*H0
  7325. -->
  7326. Firing rl*prefer*rvt*predict-yes*H0*5
  7327. -->
  7328. (S1 ^operator O1907 = 0.2940444083423254)
  7329. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7330. -->
  7331. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7332. -->
  7333. (S1 ^operator O1907 = -0.252585164213872)
  7334. Firing prefer*rvt*predict-no*H0
  7335. -->
  7336. Firing rl*prefer*rvt*predict-no*H0*6
  7337. -->
  7338. (S1 ^operator O1908 = 0.2298785768141863)
  7339. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7340. -->
  7341. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7342. -->
  7343. (S1 ^operator O1908 = 0.7702047625716166)
  7344. inner elaboration loop at bottom goal.
  7345. Retracting rl*prefer*rvt*predict-no*H0*6
  7346. -->
  7347. (S1 ^operator O1906 = 0.2298785768141863)
  7348. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7349. -->
  7350. (S1 ^operator O1906 = 0.7702047625716166)
  7351. Retracting rl*prefer*rvt*predict-yes*H0*5
  7352. -->
  7353. (S1 ^operator O1905 = 0.2940444083423254)
  7354. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7355. -->
  7356. (S1 ^operator O1905 = -0.252585164213872)
  7357. --- END Proposal Phase ---
  7358. --- Decision Phase ---
  7359. RL update rl*prefer*rvt*predict-yes*H0*5 0.501112 -0.207068 0.294044 -> 0.501062 -0.207073 0.293989(R,m,v=1,0.835616,0.138309)
  7360. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499487 0.207136 0.706622 -> 0.499427 0.207129 0.706557(R,m,v=1,1,0)
  7361. =>WM: (13357: S1 ^operator O1908)
  7362. 954: O: O1908 (predict-no)
  7363. --- END Decision Phase ---
  7364. --- Application Phase ---
  7365. --- Firing Productions (PE) For State At Depth 1 ---
  7366. --- Inner Elaboration Phase, active level 1 (S1) ---
  7367. Firing apply*operator
  7368. -->
  7369. (I3 ^predict-no N954 + :O )
  7370. Firing apply*operator*complete
  7371. -->
  7372. (I3 ^predict-yes N953 - :O )
  7373. inner elaboration loop at bottom goal.
  7374. --- Change Working Memory (PE) ---
  7375. =>WM: (13358: I3 ^predict-no N954)
  7376. <=WM: (13346: N953 ^status complete)
  7377. <=WM: (13345: I3 ^predict-yes N953)
  7378. --- Firing Productions (IE) For State At Depth 1 ---
  7379. --- Inner Elaboration Phase, active level 1 (S1) ---
  7380. Firing monitor*world
  7381. -->
  7382. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7383. --- Change Working Memory (IE) ---
  7384. --- END Application Phase ---
  7385. --- Output Phase ---
  7386. ENV: Agent did: predict-no for direction R in state State-B
  7387. In State-B moving R
  7388. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7389. predict error 0
  7390. dir: dir isU
  7391. --- END Output Phase ---
  7392. /|\--- Input Phase ---
  7393. =>WM: (13362: I2 ^dir U)
  7394. =>WM: (13361: I2 ^reward 1)
  7395. =>WM: (13360: I2 ^see 0)
  7396. =>WM: (13359: N954 ^status complete)
  7397. <=WM: (13349: I2 ^dir R)
  7398. <=WM: (13348: I2 ^reward 1)
  7399. <=WM: (13347: I2 ^see 1)
  7400. =>WM: (13363: I2 ^level-1 R0-root)
  7401. <=WM: (13350: I2 ^level-1 R1-root)
  7402. --- END Input Phase ---
  7403. --- Proposal Phase ---
  7404. --- Inner Elaboration Phase, active level 1 (S1) ---
  7405. Firing elaborate*copy-see-to-output-link
  7406. -->
  7407. (I3 ^see 0 +)
  7408. Firing elaborate*reward*based*on*reward
  7409. -->
  7410. (R958 ^value 1 +)
  7411. (R1 ^reward R958 +)
  7412. Firing propose*predict-yes
  7413. -->
  7414. (O1909 ^name predict-yes +)
  7415. (S1 ^operator O1909 +)
  7416. Firing propose*predict-no
  7417. -->
  7418. (O1910 ^name predict-no +)
  7419. (S1 ^operator O1910 +)
  7420. Firing rl*prefer*rvt*predict-no*H0*4
  7421. -->
  7422. (S1 ^operator O1908 = 1.)
  7423. Firing rl*prefer*rvt*predict-yes*H0*3
  7424. -->
  7425. (S1 ^operator O1907 = 0.)
  7426. Firing prefer*rvt*predict-yes*H0
  7427. -->
  7428. Firing prefer*rvt*predict-no*H0
  7429. -->
  7430. Firing elaborate*copy-dir-to-output-link
  7431. -->
  7432. (I3 ^dir U +)
  7433. inner elaboration loop at bottom goal.
  7434. Retracting elaborate*copy-see-to-output-link
  7435. -->
  7436. (I3 ^see 1 +)
  7437. Retracting propose*predict-no
  7438. -->
  7439. (O1908 ^name predict-no +)
  7440. (S1 ^operator O1908 +)
  7441. Retracting propose*predict-yes
  7442. -->
  7443. (O1907 ^name predict-yes +)
  7444. (S1 ^operator O1907 +)
  7445. Retracting elaborate*reward*based*on*reward
  7446. -->
  7447. (R957 ^value 1 +)
  7448. (R1 ^reward R957 +)
  7449. Retracting elaborate*copy-dir-to-output-link
  7450. -->
  7451. (I3 ^dir R +)
  7452. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7453. -->
  7454. (S1 ^operator O1908 = 0.7702047625716166)
  7455. Retracting rl*prefer*rvt*predict-no*H0*6
  7456. -->
  7457. (S1 ^operator O1908 = 0.2298785768141863)
  7458. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7459. -->
  7460. (S1 ^operator O1907 = -0.252585164213872)
  7461. Retracting rl*prefer*rvt*predict-yes*H0*5
  7462. -->
  7463. (S1 ^operator O1907 = 0.2939886829338975)
  7464. =>WM: (13371: S1 ^operator O1910 +)
  7465. =>WM: (13370: S1 ^operator O1909 +)
  7466. =>WM: (13369: I3 ^dir U)
  7467. =>WM: (13368: O1910 ^name predict-no)
  7468. =>WM: (13367: O1909 ^name predict-yes)
  7469. =>WM: (13366: R958 ^value 1)
  7470. =>WM: (13365: R1 ^reward R958)
  7471. =>WM: (13364: I3 ^see 0)
  7472. <=WM: (13355: S1 ^operator O1907 +)
  7473. <=WM: (13356: S1 ^operator O1908 +)
  7474. <=WM: (13357: S1 ^operator O1908)
  7475. <=WM: (13341: I3 ^dir R)
  7476. <=WM: (13351: R1 ^reward R957)
  7477. <=WM: (13336: I3 ^see 1)
  7478. <=WM: (13354: O1908 ^name predict-no)
  7479. <=WM: (13353: O1907 ^name predict-yes)
  7480. <=WM: (13352: R957 ^value 1)
  7481. --- Inner Elaboration Phase, active level 1 (S1) ---
  7482. Firing prefer*rvt*predict-yes*H0
  7483. -->
  7484. Firing rl*prefer*rvt*predict-yes*H0*3
  7485. -->
  7486. (S1 ^operator O1909 = 0.)
  7487. Firing prefer*rvt*predict-no*H0
  7488. -->
  7489. Firing rl*prefer*rvt*predict-no*H0*4
  7490. -->
  7491. (S1 ^operator O1910 = 1.)
  7492. inner elaboration loop at bottom goal.
  7493. Retracting rl*prefer*rvt*predict-no*H0*4
  7494. -->
  7495. (S1 ^operator O1908 = 1.)
  7496. Retracting rl*prefer*rvt*predict-yes*H0*3
  7497. -->
  7498. (S1 ^operator O1907 = 0.)
  7499. --- END Proposal Phase ---
  7500. --- Decision Phase ---
  7501. RL update rl*prefer*rvt*predict-no*H0*6 0.611927 -0.382049 0.229879 -> 0.611922 -0.38205 0.229872(R,m,v=1,0.842105,0.133746)
  7502. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388141 0.382064 0.770205 -> 0.388134 0.382063 0.770196(R,m,v=1,1,0)
  7503. =>WM: (13372: S1 ^operator O1910)
  7504. 955: O: O1910 (predict-no)
  7505. --- END Decision Phase ---
  7506. --- Application Phase ---
  7507. --- Firing Productions (PE) For State At Depth 1 ---
  7508. --- Inner Elaboration Phase, active level 1 (S1) ---
  7509. Firing apply*operator
  7510. -->
  7511. (I3 ^predict-no N955 + :O )
  7512. Firing apply*operator*complete
  7513. -->
  7514. (I3 ^predict-no N954 - :O )
  7515. inner elaboration loop at bottom goal.
  7516. --- Change Working Memory (PE) ---
  7517. =>WM: (13373: I3 ^predict-no N955)
  7518. <=WM: (13359: N954 ^status complete)
  7519. <=WM: (13358: I3 ^predict-no N954)
  7520. --- Firing Productions (IE) For State At Depth 1 ---
  7521. --- Inner Elaboration Phase, active level 1 (S1) ---
  7522. Firing monitor*world
  7523. -->
  7524. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7525. --- Change Working Memory (IE) ---
  7526. --- END Application Phase ---
  7527. --- Output Phase ---
  7528. ENV: Agent did: predict-no for direction U in state State-B
  7529. In State-B moving U
  7530. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7531. predict error 0
  7532. dir: dir isL
  7533. --- END Output Phase ---
  7534. -/|--- Input Phase ---
  7535. =>WM: (13377: I2 ^dir L)
  7536. =>WM: (13376: I2 ^reward 1)
  7537. =>WM: (13375: I2 ^see 0)
  7538. =>WM: (13374: N955 ^status complete)
  7539. <=WM: (13362: I2 ^dir U)
  7540. <=WM: (13361: I2 ^reward 1)
  7541. <=WM: (13360: I2 ^see 0)
  7542. =>WM: (13378: I2 ^level-1 R0-root)
  7543. <=WM: (13363: I2 ^level-1 R0-root)
  7544. --- END Input Phase ---
  7545. --- Proposal Phase ---
  7546. --- Inner Elaboration Phase, active level 1 (S1) ---
  7547. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7548. -->
  7549. (S1 ^operator O1909 = 0.6195585094345952)
  7550. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7551. -->
  7552. (S1 ^operator O1910 = -0.2190661556260421)
  7553. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7554. -->
  7555. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7556. -->
  7557. Firing elaborate*copy-see-to-output-link
  7558. -->
  7559. (I3 ^see 0 +)
  7560. Firing elaborate*reward*based*on*reward
  7561. -->
  7562. (R959 ^value 1 +)
  7563. (R1 ^reward R959 +)
  7564. Firing propose*predict-yes
  7565. -->
  7566. (O1911 ^name predict-yes +)
  7567. (S1 ^operator O1911 +)
  7568. Firing propose*predict-no
  7569. -->
  7570. (O1912 ^name predict-no +)
  7571. (S1 ^operator O1912 +)
  7572. Firing rl*prefer*rvt*predict-no*H0*2
  7573. -->
  7574. (S1 ^operator O1910 = 0.314040627026034)
  7575. Firing rl*prefer*rvt*predict-yes*H0*1
  7576. -->
  7577. (S1 ^operator O1909 = 0.3804241528486575)
  7578. Firing prefer*rvt*predict-yes*H0
  7579. -->
  7580. Firing prefer*rvt*predict-no*H0
  7581. -->
  7582. Firing elaborate*copy-dir-to-output-link
  7583. -->
  7584. (I3 ^dir L +)
  7585. inner elaboration loop at bottom goal.
  7586. Retracting elaborate*copy-see-to-output-link
  7587. -->
  7588. (I3 ^see 0 +)
  7589. Retracting propose*predict-no
  7590. -->
  7591. (O1910 ^name predict-no +)
  7592. (S1 ^operator O1910 +)
  7593. Retracting propose*predict-yes
  7594. -->
  7595. (O1909 ^name predict-yes +)
  7596. (S1 ^operator O1909 +)
  7597. Retracting elaborate*reward*based*on*reward
  7598. -->
  7599. (R958 ^value 1 +)
  7600. (R1 ^reward R958 +)
  7601. Retracting elaborate*copy-dir-to-output-link
  7602. -->
  7603. (I3 ^dir U +)
  7604. Retracting rl*prefer*rvt*predict-no*H0*4
  7605. -->
  7606. (S1 ^operator O1910 = 1.)
  7607. Retracting rl*prefer*rvt*predict-yes*H0*3
  7608. -->
  7609. (S1 ^operator O1909 = 0.)
  7610. =>WM: (13385: S1 ^operator O1912 +)
  7611. =>WM: (13384: S1 ^operator O1911 +)
  7612. =>WM: (13383: I3 ^dir L)
  7613. =>WM: (13382: O1912 ^name predict-no)
  7614. =>WM: (13381: O1911 ^name predict-yes)
  7615. =>WM: (13380: R959 ^value 1)
  7616. =>WM: (13379: R1 ^reward R959)
  7617. <=WM: (13370: S1 ^operator O1909 +)
  7618. <=WM: (13371: S1 ^operator O1910 +)
  7619. <=WM: (13372: S1 ^operator O1910)
  7620. <=WM: (13369: I3 ^dir U)
  7621. <=WM: (13365: R1 ^reward R958)
  7622. <=WM: (13368: O1910 ^name predict-no)
  7623. <=WM: (13367: O1909 ^name predict-yes)
  7624. <=WM: (13366: R958 ^value 1)
  7625. --- Inner Elaboration Phase, active level 1 (S1) ---
  7626. Firing prefer*rvt*predict-yes*H0
  7627. -->
  7628. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7629. -->
  7630. (S1 ^operator O1911 = 0.6195585094345952)
  7631. Firing rl*prefer*rvt*predict-yes*H0*1
  7632. -->
  7633. (S1 ^operator O1911 = 0.3804241528486575)
  7634. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7635. -->
  7636. Firing prefer*rvt*predict-no*H0
  7637. -->
  7638. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7639. -->
  7640. (S1 ^operator O1912 = -0.2190661556260421)
  7641. Firing rl*prefer*rvt*predict-no*H0*2
  7642. -->
  7643. (S1 ^operator O1912 = 0.314040627026034)
  7644. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7645. -->
  7646. inner elaboration loop at bottom goal.
  7647. Retracting rl*prefer*rvt*predict-no*H0*2
  7648. -->
  7649. (S1 ^operator O1910 = 0.314040627026034)
  7650. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7651. -->
  7652. (S1 ^operator O1910 = -0.2190661556260421)
  7653. Retracting rl*prefer*rvt*predict-yes*H0*1
  7654. -->
  7655. (S1 ^operator O1909 = 0.3804241528486575)
  7656. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7657. -->
  7658. (S1 ^operator O1909 = 0.6195585094345952)
  7659. --- END Proposal Phase ---
  7660. --- Decision Phase ---
  7661. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7662. =>WM: (13386: S1 ^operator O1911)
  7663. 956: O: O1911 (predict-yes)
  7664. --- END Decision Phase ---
  7665. --- Application Phase ---
  7666. --- Firing Productions (PE) For State At Depth 1 ---
  7667. --- Inner Elaboration Phase, active level 1 (S1) ---
  7668. Firing apply*operator
  7669. -->
  7670. (I3 ^predict-yes N956 + :O )
  7671. Firing apply*operator*complete
  7672. -->
  7673. (I3 ^predict-no N955 - :O )
  7674. inner elaboration loop at bottom goal.
  7675. --- Change Working Memory (PE) ---
  7676. =>WM: (13387: I3 ^predict-yes N956)
  7677. <=WM: (13374: N955 ^status complete)
  7678. <=WM: (13373: I3 ^predict-no N955)
  7679. --- Firing Productions (IE) For State At Depth 1 ---
  7680. --- Inner Elaboration Phase, active level 1 (S1) ---
  7681. Firing monitor*world
  7682. -->
  7683. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7684. --- Change Working Memory (IE) ---
  7685. --- END Application Phase ---
  7686. --- Output Phase ---
  7687. ENV: Agent did: predict-yes for direction L in state State-B
  7688. In State-B moving L
  7689. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7690. predict error 0
  7691. dir: dir isL
  7692. --- END Output Phase ---
  7693. \-/--- Input Phase ---
  7694. =>WM: (13391: I2 ^dir L)
  7695. =>WM: (13390: I2 ^reward 1)
  7696. =>WM: (13389: I2 ^see 1)
  7697. =>WM: (13388: N956 ^status complete)
  7698. <=WM: (13377: I2 ^dir L)
  7699. <=WM: (13376: I2 ^reward 1)
  7700. <=WM: (13375: I2 ^see 0)
  7701. =>WM: (13392: I2 ^level-1 L1-root)
  7702. <=WM: (13378: I2 ^level-1 R0-root)
  7703. --- END Input Phase ---
  7704. --- Proposal Phase ---
  7705. --- Inner Elaboration Phase, active level 1 (S1) ---
  7706. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7707. -->
  7708. (S1 ^operator O1911 = -0.3470159027404986)
  7709. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7710. -->
  7711. (S1 ^operator O1912 = 0.6861879370801713)
  7712. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7713. -->
  7714. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7715. -->
  7716. Firing elaborate*copy-see-to-output-link
  7717. -->
  7718. (I3 ^see 1 +)
  7719. Firing elaborate*reward*based*on*reward
  7720. -->
  7721. (R960 ^value 1 +)
  7722. (R1 ^reward R960 +)
  7723. Firing propose*predict-yes
  7724. -->
  7725. (O1913 ^name predict-yes +)
  7726. (S1 ^operator O1913 +)
  7727. Firing propose*predict-no
  7728. -->
  7729. (O1914 ^name predict-no +)
  7730. (S1 ^operator O1914 +)
  7731. Firing rl*prefer*rvt*predict-no*H0*2
  7732. -->
  7733. (S1 ^operator O1912 = 0.314040627026034)
  7734. Firing rl*prefer*rvt*predict-yes*H0*1
  7735. -->
  7736. (S1 ^operator O1911 = 0.3804241528486575)
  7737. Firing prefer*rvt*predict-yes*H0
  7738. -->
  7739. Firing prefer*rvt*predict-no*H0
  7740. -->
  7741. Firing elaborate*copy-dir-to-output-link
  7742. -->
  7743. (I3 ^dir L +)
  7744. inner elaboration loop at bottom goal.
  7745. Retracting elaborate*copy-see-to-output-link
  7746. -->
  7747. (I3 ^see 0 +)
  7748. Retracting propose*predict-no
  7749. -->
  7750. (O1912 ^name predict-no +)
  7751. (S1 ^operator O1912 +)
  7752. Retracting propose*predict-yes
  7753. -->
  7754. (O1911 ^name predict-yes +)
  7755. (S1 ^operator O1911 +)
  7756. Retracting elaborate*reward*based*on*reward
  7757. -->
  7758. (R959 ^value 1 +)
  7759. (R1 ^reward R959 +)
  7760. Retracting elaborate*copy-dir-to-output-link
  7761. -->
  7762. (I3 ^dir L +)
  7763. Retracting rl*prefer*rvt*predict-no*H0*2
  7764. -->
  7765. (S1 ^operator O1912 = 0.314040627026034)
  7766. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7767. -->
  7768. (S1 ^operator O1912 = -0.2190661556260421)
  7769. Retracting rl*prefer*rvt*predict-yes*H0*1
  7770. -->
  7771. (S1 ^operator O1911 = 0.3804241528486575)
  7772. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7773. -->
  7774. (S1 ^operator O1911 = 0.6195585094345952)
  7775. =>WM: (13399: S1 ^operator O1914 +)
  7776. =>WM: (13398: S1 ^operator O1913 +)
  7777. =>WM: (13397: O1914 ^name predict-no)
  7778. =>WM: (13396: O1913 ^name predict-yes)
  7779. =>WM: (13395: R960 ^value 1)
  7780. =>WM: (13394: R1 ^reward R960)
  7781. =>WM: (13393: I3 ^see 1)
  7782. <=WM: (13384: S1 ^operator O1911 +)
  7783. <=WM: (13386: S1 ^operator O1911)
  7784. <=WM: (13385: S1 ^operator O1912 +)
  7785. <=WM: (13379: R1 ^reward R959)
  7786. <=WM: (13364: I3 ^see 0)
  7787. <=WM: (13382: O1912 ^name predict-no)
  7788. <=WM: (13381: O1911 ^name predict-yes)
  7789. <=WM: (13380: R959 ^value 1)
  7790. --- Inner Elaboration Phase, active level 1 (S1) ---
  7791. Firing prefer*rvt*predict-yes*H0
  7792. -->
  7793. Firing rl*prefer*rvt*predict-yes*H0*1
  7794. -->
  7795. (S1 ^operator O1913 = 0.3804241528486575)
  7796. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7797. -->
  7798. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7799. -->
  7800. (S1 ^operator O1913 = -0.3470159027404986)
  7801. Firing prefer*rvt*predict-no*H0
  7802. -->
  7803. Firing rl*prefer*rvt*predict-no*H0*2
  7804. -->
  7805. (S1 ^operator O1914 = 0.314040627026034)
  7806. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7807. -->
  7808. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7809. -->
  7810. (S1 ^operator O1914 = 0.6861879370801713)
  7811. inner elaboration loop at bottom goal.
  7812. Retracting rl*prefer*rvt*predict-no*H0*2
  7813. -->
  7814. (S1 ^operator O1912 = 0.314040627026034)
  7815. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7816. -->
  7817. (S1 ^operator O1912 = 0.6861879370801713)
  7818. Retracting rl*prefer*rvt*predict-yes*H0*1
  7819. -->
  7820. (S1 ^operator O1911 = 0.3804241528486575)
  7821. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7822. -->
  7823. (S1 ^operator O1911 = -0.3470159027404986)
  7824. --- END Proposal Phase ---
  7825. --- Decision Phase ---
  7826. RL update rl*prefer*rvt*predict-yes*H0*1 0.521355 -0.140931 0.380424 -> 0.521357 -0.140931 0.380426(R,m,v=1,0.820513,0.148222)
  7827. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478626 0.140932 0.619559 -> 0.478628 0.140932 0.61956(R,m,v=1,1,0)
  7828. =>WM: (13400: S1 ^operator O1914)
  7829. 957: O: O1914 (predict-no)
  7830. --- END Decision Phase ---
  7831. --- Application Phase ---
  7832. --- Firing Productions (PE) For State At Depth 1 ---
  7833. --- Inner Elaboration Phase, active level 1 (S1) ---
  7834. Firing apply*operator
  7835. -->
  7836. (I3 ^predict-no N957 + :O )
  7837. Firing apply*operator*complete
  7838. -->
  7839. (I3 ^predict-yes N956 - :O )
  7840. inner elaboration loop at bottom goal.
  7841. --- Change Working Memory (PE) ---
  7842. =>WM: (13401: I3 ^predict-no N957)
  7843. <=WM: (13388: N956 ^status complete)
  7844. <=WM: (13387: I3 ^predict-yes N956)
  7845. --- Firing Productions (IE) For State At Depth 1 ---
  7846. --- Inner Elaboration Phase, active level 1 (S1) ---
  7847. Firing monitor*world
  7848. -->
  7849. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7850. --- Change Working Memory (IE) ---
  7851. --- END Application Phase ---
  7852. --- Output Phase ---
  7853. ENV: Agent did: predict-no for direction L in state State-A
  7854. In State-A moving L
  7855. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7856. predict error 0
  7857. dir: dir isL
  7858. --- END Output Phase ---
  7859. |\---- Input Phase ---
  7860. =>WM: (13405: I2 ^dir L)
  7861. =>WM: (13404: I2 ^reward 1)
  7862. =>WM: (13403: I2 ^see 0)
  7863. =>WM: (13402: N957 ^status complete)
  7864. <=WM: (13391: I2 ^dir L)
  7865. <=WM: (13390: I2 ^reward 1)
  7866. <=WM: (13389: I2 ^see 1)
  7867. =>WM: (13406: I2 ^level-1 L0-root)
  7868. <=WM: (13392: I2 ^level-1 L1-root)
  7869. --- END Input Phase ---
  7870. --- Proposal Phase ---
  7871. --- Inner Elaboration Phase, active level 1 (S1) ---
  7872. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  7873. -->
  7874. (S1 ^operator O1913 = -0.3332708974800781)
  7875. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  7876. -->
  7877. (S1 ^operator O1914 = 0.6857507825115492)
  7878. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7879. -->
  7880. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7881. -->
  7882. Firing elaborate*copy-see-to-output-link
  7883. -->
  7884. (I3 ^see 0 +)
  7885. Firing elaborate*reward*based*on*reward
  7886. -->
  7887. (R961 ^value 1 +)
  7888. (R1 ^reward R961 +)
  7889. Firing propose*predict-yes
  7890. -->
  7891. (O1915 ^name predict-yes +)
  7892. (S1 ^operator O1915 +)
  7893. Firing propose*predict-no
  7894. -->
  7895. (O1916 ^name predict-no +)
  7896. (S1 ^operator O1916 +)
  7897. Firing rl*prefer*rvt*predict-no*H0*2
  7898. -->
  7899. (S1 ^operator O1914 = 0.314040627026034)
  7900. Firing rl*prefer*rvt*predict-yes*H0*1
  7901. -->
  7902. (S1 ^operator O1913 = 0.3804255857519139)
  7903. Firing prefer*rvt*predict-yes*H0
  7904. -->
  7905. Firing prefer*rvt*predict-no*H0
  7906. -->
  7907. Firing elaborate*copy-dir-to-output-link
  7908. -->
  7909. (I3 ^dir L +)
  7910. inner elaboration loop at bottom goal.
  7911. Retracting elaborate*copy-see-to-output-link
  7912. -->
  7913. (I3 ^see 1 +)
  7914. Retracting propose*predict-no
  7915. -->
  7916. (O1914 ^name predict-no +)
  7917. (S1 ^operator O1914 +)
  7918. Retracting propose*predict-yes
  7919. -->
  7920. (O1913 ^name predict-yes +)
  7921. (S1 ^operator O1913 +)
  7922. Retracting elaborate*reward*based*on*reward
  7923. -->
  7924. (R960 ^value 1 +)
  7925. (R1 ^reward R960 +)
  7926. Retracting elaborate*copy-dir-to-output-link
  7927. -->
  7928. (I3 ^dir L +)
  7929. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7930. -->
  7931. (S1 ^operator O1914 = 0.6861879370801713)
  7932. Retracting rl*prefer*rvt*predict-no*H0*2
  7933. -->
  7934. (S1 ^operator O1914 = 0.314040627026034)
  7935. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7936. -->
  7937. (S1 ^operator O1913 = -0.3470159027404986)
  7938. Retracting rl*prefer*rvt*predict-yes*H0*1
  7939. -->
  7940. (S1 ^operator O1913 = 0.3804255857519139)
  7941. =>WM: (13413: S1 ^operator O1916 +)
  7942. =>WM: (13412: S1 ^operator O1915 +)
  7943. =>WM: (13411: O1916 ^name predict-no)
  7944. =>WM: (13410: O1915 ^name predict-yes)
  7945. =>WM: (13409: R961 ^value 1)
  7946. =>WM: (13408: R1 ^reward R961)
  7947. =>WM: (13407: I3 ^see 0)
  7948. <=WM: (13398: S1 ^operator O1913 +)
  7949. <=WM: (13399: S1 ^operator O1914 +)
  7950. <=WM: (13400: S1 ^operator O1914)
  7951. <=WM: (13394: R1 ^reward R960)
  7952. <=WM: (13393: I3 ^see 1)
  7953. <=WM: (13397: O1914 ^name predict-no)
  7954. <=WM: (13396: O1913 ^name predict-yes)
  7955. <=WM: (13395: R960 ^value 1)
  7956. --- Inner Elaboration Phase, active level 1 (S1) ---
  7957. Firing prefer*rvt*predict-yes*H0
  7958. -->
  7959. Firing rl*prefer*rvt*predict-yes*H0*1
  7960. -->
  7961. (S1 ^operator O1915 = 0.3804255857519139)
  7962. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7963. -->
  7964. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  7965. -->
  7966. (S1 ^operator O1915 = -0.3332708974800781)
  7967. Firing prefer*rvt*predict-no*H0
  7968. -->
  7969. Firing rl*prefer*rvt*predict-no*H0*2
  7970. -->
  7971. (S1 ^operator O1916 = 0.314040627026034)
  7972. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7973. -->
  7974. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  7975. -->
  7976. (S1 ^operator O1916 = 0.6857507825115492)
  7977. inner elaboration loop at bottom goal.
  7978. Retracting rl*prefer*rvt*predict-no*H0*2
  7979. -->
  7980. (S1 ^operator O1914 = 0.314040627026034)
  7981. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  7982. -->
  7983. (S1 ^operator O1914 = 0.6857507825115492)
  7984. Retracting rl*prefer*rvt*predict-yes*H0*1
  7985. -->
  7986. (S1 ^operator O1913 = 0.3804255857519139)
  7987. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  7988. -->
  7989. (S1 ^operator O1913 = -0.3332708974800781)
  7990. --- END Proposal Phase ---
  7991. --- Decision Phase ---
  7992. RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.314041 -> 0.485031 -0.17101 0.314022(R,m,v=1,0.858108,0.122587)
  7993. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515134 0.171054 0.686188 -> 0.515116 0.171049 0.686165(R,m,v=1,1,0)
  7994. =>WM: (13414: S1 ^operator O1916)
  7995. 958: O: O1916 (predict-no)
  7996. --- END Decision Phase ---
  7997. --- Application Phase ---
  7998. --- Firing Productions (PE) For State At Depth 1 ---
  7999. --- Inner Elaboration Phase, active level 1 (S1) ---
  8000. Firing apply*operator
  8001. -->
  8002. (I3 ^predict-no N958 + :O )
  8003. Firing apply*operator*complete
  8004. -->
  8005. (I3 ^predict-no N957 - :O )
  8006. inner elaboration loop at bottom goal.
  8007. --- Change Working Memory (PE) ---
  8008. =>WM: (13415: I3 ^predict-no N958)
  8009. <=WM: (13402: N957 ^status complete)
  8010. <=WM: (13401: I3 ^predict-no N957)
  8011. --- Firing Productions (IE) For State At Depth 1 ---
  8012. --- Inner Elaboration Phase, active level 1 (S1) ---
  8013. Firing monitor*world
  8014. -->
  8015. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8016. --- Change Working Memory (IE) ---
  8017. --- END Application Phase ---
  8018. --- Output Phase ---
  8019. ENV: Agent did: predict-no for direction L in state State-A
  8020. In State-A moving L
  8021. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8022. predict error 0
  8023. dir: dir isR
  8024. --- END Output Phase ---
  8025. /|\--- Input Phase ---
  8026. =>WM: (13419: I2 ^dir R)
  8027. =>WM: (13418: I2 ^reward 1)
  8028. =>WM: (13417: I2 ^see 0)
  8029. =>WM: (13416: N958 ^status complete)
  8030. <=WM: (13405: I2 ^dir L)
  8031. <=WM: (13404: I2 ^reward 1)
  8032. <=WM: (13403: I2 ^see 0)
  8033. =>WM: (13420: I2 ^level-1 L0-root)
  8034. <=WM: (13406: I2 ^level-1 L0-root)
  8035. --- END Input Phase ---
  8036. --- Proposal Phase ---
  8037. --- Inner Elaboration Phase, active level 1 (S1) ---
  8038. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8039. -->
  8040. (S1 ^operator O1915 = 0.7053811599250611)
  8041. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8042. -->
  8043. (S1 ^operator O1916 = -0.2023211881870005)
  8044. Firing prefer*rvt*predict-no*H0*6*v1*H1
  8045. -->
  8046. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8047. -->
  8048. Firing elaborate*copy-see-to-output-link
  8049. -->
  8050. (I3 ^see 0 +)
  8051. Firing elaborate*reward*based*on*reward
  8052. -->
  8053. (R962 ^value 1 +)
  8054. (R1 ^reward R962 +)
  8055. Firing propose*predict-yes
  8056. -->
  8057. (O1917 ^name predict-yes +)
  8058. (S1 ^operator O1917 +)
  8059. Firing propose*predict-no
  8060. -->
  8061. (O1918 ^name predict-no +)
  8062. (S1 ^operator O1918 +)
  8063. Firing rl*prefer*rvt*predict-no*H0*6
  8064. -->
  8065. (S1 ^operator O1916 = 0.2298717920574965)
  8066. Firing rl*prefer*rvt*predict-yes*H0*5
  8067. -->
  8068. (S1 ^operator O1915 = 0.2939886829338975)
  8069. Firing prefer*rvt*predict-yes*H0
  8070. -->
  8071. Firing prefer*rvt*predict-no*H0
  8072. -->
  8073. Firing elaborate*copy-dir-to-output-link
  8074. -->
  8075. (I3 ^dir R +)
  8076. inner elaboration loop at bottom goal.
  8077. Retracting elaborate*copy-see-to-output-link
  8078. -->
  8079. (I3 ^see 0 +)
  8080. Retracting propose*predict-no
  8081. -->
  8082. (O1916 ^name predict-no +)
  8083. (S1 ^operator O1916 +)
  8084. Retracting propose*predict-yes
  8085. -->
  8086. (O1915 ^name predict-yes +)
  8087. (S1 ^operator O1915 +)
  8088. Retracting elaborate*reward*based*on*reward
  8089. -->
  8090. (R961 ^value 1 +)
  8091. (R1 ^reward R961 +)
  8092. Retracting elaborate*copy-dir-to-output-link
  8093. -->
  8094. (I3 ^dir L +)
  8095. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  8096. -->
  8097. (S1 ^operator O1916 = 0.6857507825115492)
  8098. Retracting rl*prefer*rvt*predict-no*H0*2
  8099. -->
  8100. (S1 ^operator O1916 = 0.3140215711634288)
  8101. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  8102. -->
  8103. (S1 ^operator O1915 = -0.3332708974800781)
  8104. Retracting rl*prefer*rvt*predict-yes*H0*1
  8105. -->
  8106. (S1 ^operator O1915 = 0.3804255857519139)
  8107. =>WM: (13427: S1 ^operator O1918 +)
  8108. =>WM: (13426: S1 ^operator O1917 +)
  8109. =>WM: (13425: I3 ^dir R)
  8110. =>WM: (13424: O1918 ^name predict-no)
  8111. =>WM: (13423: O1917 ^name predict-yes)
  8112. =>WM: (13422: R962 ^value 1)
  8113. =>WM: (13421: R1 ^reward R962)
  8114. <=WM: (13412: S1 ^operator O1915 +)
  8115. <=WM: (13413: S1 ^operator O1916 +)
  8116. <=WM: (13414: S1 ^operator O1916)
  8117. <=WM: (13383: I3 ^dir L)
  8118. <=WM: (13408: R1 ^reward R961)
  8119. <=WM: (13411: O1916 ^name predict-no)
  8120. <=WM: (13410: O1915 ^name predict-yes)
  8121. <=WM: (13409: R961 ^value 1)
  8122. --- Inner Elaboration Phase, active level 1 (S1) ---
  8123. Firing prefer*rvt*predict-yes*H0
  8124. -->
  8125. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8126. -->
  8127. (S1 ^operator O1917 = 0.7053811599250611)
  8128. Firing rl*prefer*rvt*predict-yes*H0*5
  8129. -->
  8130. (S1 ^operator O1917 = 0.2939886829338975)
  8131. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8132. -->
  8133. Firing prefer*rvt*predict-no*H0
  8134. -->
  8135. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8136. -->
  8137. (S1 ^operator O1918 = -0.2023211881870005)
  8138. Firing rl*prefer*rvt*predict-no*H0*6
  8139. -->
  8140. (S1 ^operator O1918 = 0.2298717920574965)
  8141. Firing prefer*rvt*predict-no*H0*6*v1*H1
  8142. -->
  8143. inner elaboration loop at bottom goal.
  8144. Retracting rl*prefer*rvt*predict-no*H0*6
  8145. -->
  8146. (S1 ^operator O1916 = 0.2298717920574965)
  8147. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8148. -->
  8149. (S1 ^operator O1916 = -0.2023211881870005)
  8150. Retracting rl*prefer*rvt*predict-yes*H0*5
  8151. -->
  8152. (S1 ^operator O1915 = 0.2939886829338975)
  8153. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8154. -->
  8155. (S1 ^operator O1915 = 0.7053811599250611)
  8156. --- END Proposal Phase ---
  8157. --- Decision Phase ---
  8158. RL update rl*prefer*rvt*predict-no*H0*2 0.485031 -0.17101 0.314022 -> 0.485046 -0.171006 0.314041(R,m,v=1,0.85906,0.121894)
  8159. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514789 0.170962 0.685751 -> 0.514806 0.170967 0.685773(R,m,v=1,1,0)
  8160. =>WM: (13428: S1 ^operator O1917)
  8161. 959: O: O1917 (predict-yes)
  8162. --- END Decision Phase ---
  8163. --- Application Phase ---
  8164. --- Firing Productions (PE) For State At Depth 1 ---
  8165. --- Inner Elaboration Phase, active level 1 (S1) ---
  8166. Firing apply*operator
  8167. -->
  8168. (I3 ^predict-yes N959 + :O )
  8169. Firing apply*operator*complete
  8170. -->
  8171. (I3 ^predict-no N958 - :O )
  8172. inner elaboration loop at bottom goal.
  8173. --- Change Working Memory (PE) ---
  8174. =>WM: (13429: I3 ^predict-yes N959)
  8175. <=WM: (13416: N958 ^status complete)
  8176. <=WM: (13415: I3 ^predict-no N958)
  8177. --- Firing Productions (IE) For State At Depth 1 ---
  8178. --- Inner Elaboration Phase, active level 1 (S1) ---
  8179. Firing monitor*world
  8180. -->
  8181. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8182. --- Change Working Memory (IE) ---
  8183. --- END Application Phase ---
  8184. --- Output Phase ---
  8185. ENV: Agent did: predict-yes for direction R in state State-A
  8186. In State-A moving R
  8187. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8188. predict error 0
  8189. dir: dir isU
  8190. --- END Output Phase ---
  8191. -/|--- Input Phase ---
  8192. =>WM: (13433: I2 ^dir U)
  8193. =>WM: (13432: I2 ^reward 1)
  8194. =>WM: (13431: I2 ^see 1)
  8195. =>WM: (13430: N959 ^status complete)
  8196. <=WM: (13419: I2 ^dir R)
  8197. <=WM: (13418: I2 ^reward 1)
  8198. <=WM: (13417: I2 ^see 0)
  8199. =>WM: (13434: I2 ^level-1 R1-root)
  8200. <=WM: (13420: I2 ^level-1 L0-root)
  8201. --- END Input Phase ---
  8202. --- Proposal Phase ---
  8203. --- Inner Elaboration Phase, active level 1 (S1) ---
  8204. Firing elaborate*copy-see-to-output-link
  8205. -->
  8206. (I3 ^see 1 +)
  8207. Firing elaborate*reward*based*on*reward
  8208. -->
  8209. (R963 ^value 1 +)
  8210. (R1 ^reward R963 +)
  8211. Firing propose*predict-yes
  8212. -->
  8213. (O1919 ^name predict-yes +)
  8214. (S1 ^operator O1919 +)
  8215. Firing propose*predict-no
  8216. -->
  8217. (O1920 ^name predict-no +)
  8218. (S1 ^operator O1920 +)
  8219. Firing rl*prefer*rvt*predict-no*H0*4
  8220. -->
  8221. (S1 ^operator O1918 = 1.)
  8222. Firing rl*prefer*rvt*predict-yes*H0*3
  8223. -->
  8224. (S1 ^operator O1917 = 0.)
  8225. Firing prefer*rvt*predict-yes*H0
  8226. -->
  8227. Firing prefer*rvt*predict-no*H0
  8228. -->
  8229. Firing elaborate*copy-dir-to-output-link
  8230. -->
  8231. (I3 ^dir U +)
  8232. inner elaboration loop at bottom goal.
  8233. Retracting elaborate*copy-see-to-output-link
  8234. -->
  8235. (I3 ^see 0 +)
  8236. Retracting propose*predict-no
  8237. -->
  8238. (O1918 ^name predict-no +)
  8239. (S1 ^operator O1918 +)
  8240. Retracting propose*predict-yes
  8241. -->
  8242. (O1917 ^name predict-yes +)
  8243. (S1 ^operator O1917 +)
  8244. Retracting elaborate*reward*based*on*reward
  8245. -->
  8246. (R962 ^value 1 +)
  8247. (R1 ^reward R962 +)
  8248. Retracting elaborate*copy-dir-to-output-link
  8249. -->
  8250. (I3 ^dir R +)
  8251. Retracting rl*prefer*rvt*predict-no*H0*6
  8252. -->
  8253. (S1 ^operator O1918 = 0.2298717920574965)
  8254. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8255. -->
  8256. (S1 ^operator O1918 = -0.2023211881870005)
  8257. Retracting rl*prefer*rvt*predict-yes*H0*5
  8258. -->
  8259. (S1 ^operator O1917 = 0.2939886829338975)
  8260. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8261. -->
  8262. (S1 ^operator O1917 = 0.7053811599250611)
  8263. =>WM: (13442: S1 ^operator O1920 +)
  8264. =>WM: (13441: S1 ^operator O1919 +)
  8265. =>WM: (13440: I3 ^dir U)
  8266. =>WM: (13439: O1920 ^name predict-no)
  8267. =>WM: (13438: O1919 ^name predict-yes)
  8268. =>WM: (13437: R963 ^value 1)
  8269. =>WM: (13436: R1 ^reward R963)
  8270. =>WM: (13435: I3 ^see 1)
  8271. <=WM: (13426: S1 ^operator O1917 +)
  8272. <=WM: (13428: S1 ^operator O1917)
  8273. <=WM: (13427: S1 ^operator O1918 +)
  8274. <=WM: (13425: I3 ^dir R)
  8275. <=WM: (13421: R1 ^reward R962)
  8276. <=WM: (13407: I3 ^see 0)
  8277. <=WM: (13424: O1918 ^name predict-no)
  8278. <=WM: (13423: O1917 ^name predict-yes)
  8279. <=WM: (13422: R962 ^value 1)
  8280. --- Inner Elaboration Phase, active level 1 (S1) ---
  8281. Firing prefer*rvt*predict-yes*H0
  8282. -->
  8283. Firing rl*prefer*rvt*predict-yes*H0*3
  8284. -->
  8285. (S1 ^operator O1919 = 0.)
  8286. Firing prefer*rvt*predict-no*H0
  8287. -->
  8288. Firing rl*prefer*rvt*predict-no*H0*4
  8289. -->
  8290. (S1 ^operator O1920 = 1.)
  8291. inner elaboration loop at bottom goal.
  8292. Retracting rl*prefer*rvt*predict-no*H0*4
  8293. -->
  8294. (S1 ^operator O1918 = 1.)
  8295. Retracting rl*prefer*rvt*predict-yes*H0*3
  8296. -->
  8297. (S1 ^operator O1917 = 0.)
  8298. --- END Proposal Phase ---
  8299. --- Decision Phase ---
  8300. RL update rl*prefer*rvt*predict-yes*H0*5 0.501062 -0.207073 0.293989 -> 0.50111 -0.207069 0.294041(R,m,v=1,0.836735,0.137545)
  8301. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498366 0.207015 0.705381 -> 0.498423 0.207021 0.705444(R,m,v=1,1,0)
  8302. =>WM: (13443: S1 ^operator O1920)
  8303. 960: O: O1920 (predict-no)
  8304. --- END Decision Phase ---
  8305. --- Application Phase ---
  8306. --- Firing Productions (PE) For State At Depth 1 ---
  8307. --- Inner Elaboration Phase, active level 1 (S1) ---
  8308. Firing apply*operator
  8309. -->
  8310. (I3 ^predict-no N960 + :O )
  8311. Firing apply*operator*complete
  8312. -->
  8313. (I3 ^predict-yes N959 - :O )
  8314. inner elaboration loop at bottom goal.
  8315. --- Change Working Memory (PE) ---
  8316. =>WM: (13444: I3 ^predict-no N960)
  8317. <=WM: (13430: N959 ^status complete)
  8318. <=WM: (13429: I3 ^predict-yes N959)
  8319. --- Firing Productions (IE) For State At Depth 1 ---
  8320. --- Inner Elaboration Phase, active level 1 (S1) ---
  8321. Firing monitor*world
  8322. -->
  8323. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8324. --- Change Working Memory (IE) ---
  8325. --- END Application Phase ---
  8326. --- Output Phase ---
  8327. ENV: Agent did: predict-no for direction U in state State-B
  8328. In State-B moving U
  8329. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8330. predict error 0
  8331. dir: dir isU
  8332. --- END Output Phase ---
  8333. \---- Input Phase ---
  8334. =>WM: (13448: I2 ^dir U)
  8335. =>WM: (13447: I2 ^reward 1)
  8336. =>WM: (13446: I2 ^see 0)
  8337. =>WM: (13445: N960 ^status complete)
  8338. <=WM: (13433: I2 ^dir U)
  8339. <=WM: (13432: I2 ^reward 1)
  8340. <=WM: (13431: I2 ^see 1)
  8341. =>WM: (13449: I2 ^level-1 R1-root)
  8342. <=WM: (13434: I2 ^level-1 R1-root)
  8343. --- END Input Phase ---
  8344. --- Proposal Phase ---
  8345. --- Inner Elaboration Phase, active level 1 (S1) ---
  8346. Firing elaborate*copy-see-to-output-link
  8347. -->
  8348. (I3 ^see 0 +)
  8349. Firing elaborate*reward*based*on*reward
  8350. -->
  8351. (R964 ^value 1 +)
  8352. (R1 ^reward R964 +)
  8353. Firing propose*predict-yes
  8354. -->
  8355. (O1921 ^name predict-yes +)
  8356. (S1 ^operator O1921 +)
  8357. Firing propose*predict-no
  8358. -->
  8359. (O1922 ^name predict-no +)
  8360. (S1 ^operator O1922 +)
  8361. Firing rl*prefer*rvt*predict-no*H0*4
  8362. -->
  8363. (S1 ^operator O1920 = 1.)
  8364. Firing rl*prefer*rvt*predict-yes*H0*3
  8365. -->
  8366. (S1 ^operator O1919 = 0.)
  8367. Firing prefer*rvt*predict-yes*H0
  8368. -->
  8369. Firing prefer*rvt*predict-no*H0
  8370. -->
  8371. Firing elaborate*copy-dir-to-output-link
  8372. -->
  8373. (I3 ^dir U +)
  8374. inner elaboration loop at bottom goal.
  8375. Retracting elaborate*copy-see-to-output-link
  8376. -->
  8377. (I3 ^see 1 +)
  8378. Retracting propose*predict-no
  8379. -->
  8380. (O1920 ^name predict-no +)
  8381. (S1 ^operator O1920 +)
  8382. Retracting propose*predict-yes
  8383. -->
  8384. (O1919 ^name predict-yes +)
  8385. (S1 ^operator O1919 +)
  8386. Retracting elaborate*reward*based*on*reward
  8387. -->
  8388. (R963 ^value 1 +)
  8389. (R1 ^reward R963 +)
  8390. Retracting elaborate*copy-dir-to-output-link
  8391. -->
  8392. (I3 ^dir U +)
  8393. Retracting rl*prefer*rvt*predict-no*H0*4
  8394. -->
  8395. (S1 ^operator O1920 = 1.)
  8396. Retracting rl*prefer*rvt*predict-yes*H0*3
  8397. -->
  8398. (S1 ^operator O1919 = 0.)
  8399. =>WM: (13456: S1 ^operator O1922 +)
  8400. =>WM: (13455: S1 ^operator O1921 +)
  8401. =>WM: (13454: O1922 ^name predict-no)
  8402. =>WM: (13453: O1921 ^name predict-yes)
  8403. =>WM: (13452: R964 ^value 1)
  8404. =>WM: (13451: R1 ^reward R964)
  8405. =>WM: (13450: I3 ^see 0)
  8406. <=WM: (13441: S1 ^operator O1919 +)
  8407. <=WM: (13442: S1 ^operator O1920 +)
  8408. <=WM: (13443: S1 ^operator O1920)
  8409. <=WM: (13436: R1 ^reward R963)
  8410. <=WM: (13435: I3 ^see 1)
  8411. <=WM: (13439: O1920 ^name predict-no)
  8412. <=WM: (13438: O1919 ^name predict-yes)
  8413. <=WM: (13437: R963 ^value 1)
  8414. --- Inner Elaboration Phase, active level 1 (S1) ---
  8415. Firing prefer*rvt*predict-yes*H0
  8416. -->
  8417. Firing rl*prefer*rvt*predict-yes*H0*3
  8418. -->
  8419. (S1 ^operator O1921 = 0.)
  8420. Firing prefer*rvt*predict-no*H0
  8421. -->
  8422. Firing rl*prefer*rvt*predict-no*H0*4
  8423. -->
  8424. (S1 ^operator O1922 = 1.)
  8425. inner elaboration loop at bottom goal.
  8426. Retracting rl*prefer*rvt*predict-no*H0*4
  8427. -->
  8428. (S1 ^operator O1920 = 1.)
  8429. Retracting rl*prefer*rvt*predict-yes*H0*3
  8430. -->
  8431. (S1 ^operator O1919 = 0.)
  8432. --- END Proposal Phase ---
  8433. --- Decision Phase ---
  8434. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8435. =>WM: (13457: S1 ^operator O1922)
  8436. 961: O: O1922 (predict-no)
  8437. --- END Decision Phase ---
  8438. --- Application Phase ---
  8439. --- Firing Productions (PE) For State At Depth 1 ---
  8440. --- Inner Elaboration Phase, active level 1 (S1) ---
  8441. Firing apply*operator
  8442. -->
  8443. (I3 ^predict-no N961 + :O )
  8444. Firing apply*operator*complete
  8445. -->
  8446. (I3 ^predict-no N960 - :O )
  8447. inner elaboration loop at bottom goal.
  8448. --- Change Working Memory (PE) ---
  8449. =>WM: (13458: I3 ^predict-no N961)
  8450. <=WM: (13445: N960 ^status complete)
  8451. <=WM: (13444: I3 ^predict-no N960)
  8452. --- Firing Productions (IE) For State At Depth 1 ---
  8453. --- Inner Elaboration Phase, active level 1 (S1) ---
  8454. Firing monitor*world
  8455. -->
  8456. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8457. --- Change Working Memory (IE) ---
  8458. --- END Application Phase ---
  8459. --- Output Phase ---
  8460. ENV: Agent did: predict-no for direction U in state State-B
  8461. In State-B moving U
  8462. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8463. predict error 0
  8464. dir: dir isU
  8465. --- END Output Phase ---
  8466. /--- Input Phase ---
  8467. =>WM: (13462: I2 ^dir U)
  8468. =>WM: (13461: I2 ^reward 1)
  8469. =>WM: (13460: I2 ^see 0)
  8470. =>WM: (13459: N961 ^status complete)
  8471. <=WM: (13448: I2 ^dir U)
  8472. <=WM: (13447: I2 ^reward 1)
  8473. <=WM: (13446: I2 ^see 0)
  8474. =>WM: (13463: I2 ^level-1 R1-root)
  8475. <=WM: (13449: I2 ^level-1 R1-root)
  8476. --- END Input Phase ---
  8477. --- Proposal Phase ---
  8478. --- Inner Elaboration Phase, active level 1 (S1) ---
  8479. Firing elaborate*copy-see-to-output-link
  8480. -->
  8481. (I3 ^see 0 +)
  8482. Firing elaborate*reward*based*on*reward
  8483. -->
  8484. (R965 ^value 1 +)
  8485. (R1 ^reward R965 +)
  8486. Firing propose*predict-yes
  8487. -->
  8488. (O1923 ^name predict-yes +)
  8489. (S1 ^operator O1923 +)
  8490. Firing propose*predict-no
  8491. -->
  8492. (O1924 ^name predict-no +)
  8493. (S1 ^operator O1924 +)
  8494. Firing rl*prefer*rvt*predict-no*H0*4
  8495. -->
  8496. (S1 ^operator O1922 = 1.)
  8497. Firing rl*prefer*rvt*predict-yes*H0*3
  8498. -->
  8499. (S1 ^operator O1921 = 0.)
  8500. Firing prefer*rvt*predict-yes*H0
  8501. -->
  8502. Firing prefer*rvt*predict-no*H0
  8503. -->
  8504. Firing elaborate*copy-dir-to-output-link
  8505. -->
  8506. (I3 ^dir U +)
  8507. inner elaboration loop at bottom goal.
  8508. Retracting elaborate*copy-see-to-output-link
  8509. -->
  8510. (I3 ^see 0 +)
  8511. Retracting propose*predict-no
  8512. -->
  8513. (O1922 ^name predict-no +)
  8514. (S1 ^operator O1922 +)
  8515. Retracting propose*predict-yes
  8516. -->
  8517. (O1921 ^name predict-yes +)
  8518. (S1 ^operator O1921 +)
  8519. Retracting elaborate*reward*based*on*reward
  8520. -->
  8521. (R964 ^value 1 +)
  8522. (R1 ^reward R964 +)
  8523. Retracting elaborate*copy-dir-to-output-link
  8524. -->
  8525. (I3 ^dir U +)
  8526. Retracting rl*prefer*rvt*predict-no*H0*4
  8527. -->
  8528. (S1 ^operator O1922 = 1.)
  8529. Retracting rl*prefer*rvt*predict-yes*H0*3
  8530. -->
  8531. (S1 ^operator O1921 = 0.)
  8532. =>WM: (13469: S1 ^operator O1924 +)
  8533. =>WM: (13468: S1 ^operator O1923 +)
  8534. =>WM: (13467: O1924 ^name predict-no)
  8535. =>WM: (13466: O1923 ^name predict-yes)
  8536. =>WM: (13465: R965 ^value 1)
  8537. =>WM: (13464: R1 ^reward R965)
  8538. <=WM: (13455: S1 ^operator O1921 +)
  8539. <=WM: (13456: S1 ^operator O1922 +)
  8540. <=WM: (13457: S1 ^operator O1922)
  8541. <=WM: (13451: R1 ^reward R964)
  8542. <=WM: (13454: O1922 ^name predict-no)
  8543. <=WM: (13453: O1921 ^name predict-yes)
  8544. <=WM: (13452: R964 ^value 1)
  8545. --- Inner Elaboration Phase, active level 1 (S1) ---
  8546. Firing prefer*rvt*predict-yes*H0
  8547. -->
  8548. Firing rl*prefer*rvt*predict-yes*H0*3
  8549. -->
  8550. (S1 ^operator O1923 = 0.)
  8551. Firing prefer*rvt*predict-no*H0
  8552. -->
  8553. Firing rl*prefer*rvt*predict-no*H0*4
  8554. -->
  8555. (S1 ^operator O1924 = 1.)
  8556. inner elaboration loop at bottom goal.
  8557. Retracting rl*prefer*rvt*predict-no*H0*4
  8558. -->
  8559. (S1 ^operator O1922 = 1.)
  8560. Retracting rl*prefer*rvt*predict-yes*H0*3
  8561. -->
  8562. (S1 ^operator O1921 = 0.)
  8563. --- END Proposal Phase ---
  8564. --- Decision Phase ---
  8565. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8566. =>WM: (13470: S1 ^operator O1924)
  8567. 962: O: O1924 (predict-no)
  8568. --- END Decision Phase ---
  8569. --- Application Phase ---
  8570. --- Firing Productions (PE) For State At Depth 1 ---
  8571. --- Inner Elaboration Phase, active level 1 (S1) ---
  8572. Firing apply*operator
  8573. -->
  8574. (I3 ^predict-no N962 + :O )
  8575. Firing apply*operator*complete
  8576. -->
  8577. (I3 ^predict-no N961 - :O )
  8578. inner elaboration loop at bottom goal.
  8579. --- Change Working Memory (PE) ---
  8580. =>WM: (13471: I3 ^predict-no N962)
  8581. <=WM: (13459: N961 ^status complete)
  8582. <=WM: (13458: I3 ^predict-no N961)
  8583. --- Firing Productions (IE) For State At Depth 1 ---
  8584. --- Inner Elaboration Phase, active level 1 (S1) ---
  8585. Firing monitor*world
  8586. -->
  8587. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8588. --- Change Working Memory (IE) ---
  8589. --- END Application Phase ---
  8590. --- Output Phase ---
  8591. ENV: Agent did: predict-no for direction U in state State-B
  8592. In State-B moving U
  8593. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8594. predict error 0
  8595. dir: dir isU
  8596. --- END Output Phase ---
  8597. |\--- Input Phase ---
  8598. =>WM: (13475: I2 ^dir U)
  8599. =>WM: (13474: I2 ^reward 1)
  8600. =>WM: (13473: I2 ^see 0)
  8601. =>WM: (13472: N962 ^status complete)
  8602. <=WM: (13462: I2 ^dir U)
  8603. <=WM: (13461: I2 ^reward 1)
  8604. <=WM: (13460: I2 ^see 0)
  8605. =>WM: (13476: I2 ^level-1 R1-root)
  8606. <=WM: (13463: I2 ^level-1 R1-root)
  8607. --- END Input Phase ---
  8608. --- Proposal Phase ---
  8609. --- Inner Elaboration Phase, active level 1 (S1) ---
  8610. Firing elaborate*copy-see-to-output-link
  8611. -->
  8612. (I3 ^see 0 +)
  8613. Firing elaborate*reward*based*on*reward
  8614. -->
  8615. (R966 ^value 1 +)
  8616. (R1 ^reward R966 +)
  8617. Firing propose*predict-yes
  8618. -->
  8619. (O1925 ^name predict-yes +)
  8620. (S1 ^operator O1925 +)
  8621. Firing propose*predict-no
  8622. -->
  8623. (O1926 ^name predict-no +)
  8624. (S1 ^operator O1926 +)
  8625. Firing rl*prefer*rvt*predict-no*H0*4
  8626. -->
  8627. (S1 ^operator O1924 = 1.)
  8628. Firing rl*prefer*rvt*predict-yes*H0*3
  8629. -->
  8630. (S1 ^operator O1923 = 0.)
  8631. Firing prefer*rvt*predict-yes*H0
  8632. -->
  8633. Firing prefer*rvt*predict-no*H0
  8634. -->
  8635. Firing elaborate*copy-dir-to-output-link
  8636. -->
  8637. (I3 ^dir U +)
  8638. inner elaboration loop at bottom goal.
  8639. Retracting elaborate*copy-see-to-output-link
  8640. -->
  8641. (I3 ^see 0 +)
  8642. Retracting propose*predict-no
  8643. -->
  8644. (O1924 ^name predict-no +)
  8645. (S1 ^operator O1924 +)
  8646. Retracting propose*predict-yes
  8647. -->
  8648. (O1923 ^name predict-yes +)
  8649. (S1 ^operator O1923 +)
  8650. Retracting elaborate*reward*based*on*reward
  8651. -->
  8652. (R965 ^value 1 +)
  8653. (R1 ^reward R965 +)
  8654. Retracting elaborate*copy-dir-to-output-link
  8655. -->
  8656. (I3 ^dir U +)
  8657. Retracting rl*prefer*rvt*predict-no*H0*4
  8658. -->
  8659. (S1 ^operator O1924 = 1.)
  8660. Retracting rl*prefer*rvt*predict-yes*H0*3
  8661. -->
  8662. (S1 ^operator O1923 = 0.)
  8663. =>WM: (13482: S1 ^operator O1926 +)
  8664. =>WM: (13481: S1 ^operator O1925 +)
  8665. =>WM: (13480: O1926 ^name predict-no)
  8666. =>WM: (13479: O1925 ^name predict-yes)
  8667. =>WM: (13478: R966 ^value 1)
  8668. =>WM: (13477: R1 ^reward R966)
  8669. <=WM: (13468: S1 ^operator O1923 +)
  8670. <=WM: (13469: S1 ^operator O1924 +)
  8671. <=WM: (13470: S1 ^operator O1924)
  8672. <=WM: (13464: R1 ^reward R965)
  8673. <=WM: (13467: O1924 ^name predict-no)
  8674. <=WM: (13466: O1923 ^name predict-yes)
  8675. <=WM: (13465: R965 ^value 1)
  8676. --- Inner Elaboration Phase, active level 1 (S1) ---
  8677. Firing prefer*rvt*predict-yes*H0
  8678. -->
  8679. Firing rl*prefer*rvt*predict-yes*H0*3
  8680. -->
  8681. (S1 ^operator O1925 = 0.)
  8682. Firing prefer*rvt*predict-no*H0
  8683. -->
  8684. Firing rl*prefer*rvt*predict-no*H0*4
  8685. -->
  8686. (S1 ^operator O1926 = 1.)
  8687. inner elaboration loop at bottom goal.
  8688. Retracting rl*prefer*rvt*predict-no*H0*4
  8689. -->
  8690. (S1 ^operator O1924 = 1.)
  8691. Retracting rl*prefer*rvt*predict-yes*H0*3
  8692. -->
  8693. (S1 ^operator O1923 = 0.)
  8694. --- END Proposal Phase ---
  8695. --- Decision Phase ---
  8696. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8697. =>WM: (13483: S1 ^operator O1926)
  8698. 963: O: O1926 (predict-no)
  8699. --- END Decision Phase ---
  8700. --- Application Phase ---
  8701. --- Firing Productions (PE) For State At Depth 1 ---
  8702. --- Inner Elaboration Phase, active level 1 (S1) ---
  8703. Firing apply*operator
  8704. -->
  8705. (I3 ^predict-no N963 + :O )
  8706. Firing apply*operator*complete
  8707. -->
  8708. (I3 ^predict-no N962 - :O )
  8709. inner elaboration loop at bottom goal.
  8710. --- Change Working Memory (PE) ---
  8711. =>WM: (13484: I3 ^predict-no N963)
  8712. <=WM: (13472: N962 ^status complete)
  8713. <=WM: (13471: I3 ^predict-no N962)
  8714. --- Firing Productions (IE) For State At Depth 1 ---
  8715. --- Inner Elaboration Phase, active level 1 (S1) ---
  8716. Firing monitor*world
  8717. -->
  8718. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8719. --- Change Working Memory (IE) ---
  8720. --- END Application Phase ---
  8721. --- Output Phase ---
  8722. ENV: Agent did: predict-no for direction U in state State-B
  8723. In State-B moving U
  8724. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8725. predict error 0
  8726. dir: dir isL
  8727. --- END Output Phase ---
  8728. ---- Input Phase ---
  8729. =>WM: (13488: I2 ^dir L)
  8730. =>WM: (13487: I2 ^reward 1)
  8731. =>WM: (13486: I2 ^see 0)
  8732. =>WM: (13485: N963 ^status complete)
  8733. <=WM: (13475: I2 ^dir U)
  8734. <=WM: (13474: I2 ^reward 1)
  8735. <=WM: (13473: I2 ^see 0)
  8736. =>WM: (13489: I2 ^level-1 R1-root)
  8737. <=WM: (13476: I2 ^level-1 R1-root)
  8738. --- END Input Phase ---
  8739. --- Proposal Phase ---
  8740. --- Inner Elaboration Phase, active level 1 (S1) ---
  8741. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8742. -->
  8743. (S1 ^operator O1925 = 0.619629119351056)
  8744. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8745. -->
  8746. (S1 ^operator O1926 = -0.1479504104026684)
  8747. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8748. -->
  8749. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8750. -->
  8751. Firing elaborate*copy-see-to-output-link
  8752. -->
  8753. (I3 ^see 0 +)
  8754. Firing elaborate*reward*based*on*reward
  8755. -->
  8756. (R967 ^value 1 +)
  8757. (R1 ^reward R967 +)
  8758. Firing propose*predict-yes
  8759. -->
  8760. (O1927 ^name predict-yes +)
  8761. (S1 ^operator O1927 +)
  8762. Firing propose*predict-no
  8763. -->
  8764. (O1928 ^name predict-no +)
  8765. (S1 ^operator O1928 +)
  8766. Firing rl*prefer*rvt*predict-no*H0*2
  8767. -->
  8768. (S1 ^operator O1926 = 0.3140405292214645)
  8769. Firing rl*prefer*rvt*predict-yes*H0*1
  8770. -->
  8771. (S1 ^operator O1925 = 0.3804255857519139)
  8772. Firing prefer*rvt*predict-yes*H0
  8773. -->
  8774. Firing prefer*rvt*predict-no*H0
  8775. -->
  8776. Firing elaborate*copy-dir-to-output-link
  8777. -->
  8778. (I3 ^dir L +)
  8779. inner elaboration loop at bottom goal.
  8780. Retracting elaborate*copy-see-to-output-link
  8781. -->
  8782. (I3 ^see 0 +)
  8783. Retracting propose*predict-no
  8784. -->
  8785. (O1926 ^name predict-no +)
  8786. (S1 ^operator O1926 +)
  8787. Retracting propose*predict-yes
  8788. -->
  8789. (O1925 ^name predict-yes +)
  8790. (S1 ^operator O1925 +)
  8791. Retracting elaborate*reward*based*on*reward
  8792. -->
  8793. (R966 ^value 1 +)
  8794. (R1 ^reward R966 +)
  8795. Retracting elaborate*copy-dir-to-output-link
  8796. -->
  8797. (I3 ^dir U +)
  8798. Retracting rl*prefer*rvt*predict-no*H0*4
  8799. -->
  8800. (S1 ^operator O1926 = 1.)
  8801. Retracting rl*prefer*rvt*predict-yes*H0*3
  8802. -->
  8803. (S1 ^operator O1925 = 0.)
  8804. =>WM: (13496: S1 ^operator O1928 +)
  8805. =>WM: (13495: S1 ^operator O1927 +)
  8806. =>WM: (13494: I3 ^dir L)
  8807. =>WM: (13493: O1928 ^name predict-no)
  8808. =>WM: (13492: O1927 ^name predict-yes)
  8809. =>WM: (13491: R967 ^value 1)
  8810. =>WM: (13490: R1 ^reward R967)
  8811. <=WM: (13481: S1 ^operator O1925 +)
  8812. <=WM: (13482: S1 ^operator O1926 +)
  8813. <=WM: (13483: S1 ^operator O1926)
  8814. <=WM: (13440: I3 ^dir U)
  8815. <=WM: (13477: R1 ^reward R966)
  8816. <=WM: (13480: O1926 ^name predict-no)
  8817. <=WM: (13479: O1925 ^name predict-yes)
  8818. <=WM: (13478: R966 ^value 1)
  8819. --- Inner Elaboration Phase, active level 1 (S1) ---
  8820. Firing prefer*rvt*predict-yes*H0
  8821. -->
  8822. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8823. -->
  8824. (S1 ^operator O1927 = 0.619629119351056)
  8825. Firing rl*prefer*rvt*predict-yes*H0*1
  8826. -->
  8827. (S1 ^operator O1927 = 0.3804255857519139)
  8828. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8829. -->
  8830. Firing prefer*rvt*predict-no*H0
  8831. -->
  8832. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8833. -->
  8834. (S1 ^operator O1928 = -0.1479504104026684)
  8835. Firing rl*prefer*rvt*predict-no*H0*2
  8836. -->
  8837. (S1 ^operator O1928 = 0.3140405292214645)
  8838. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8839. -->
  8840. inner elaboration loop at bottom goal.
  8841. Retracting rl*prefer*rvt*predict-no*H0*2
  8842. -->
  8843. (S1 ^operator O1926 = 0.3140405292214645)
  8844. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8845. -->
  8846. (S1 ^operator O1926 = -0.1479504104026684)
  8847. Retracting rl*prefer*rvt*predict-yes*H0*1
  8848. -->
  8849. (S1 ^operator O1925 = 0.3804255857519139)
  8850. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8851. -->
  8852. (S1 ^operator O1925 = 0.619629119351056)
  8853. --- END Proposal Phase ---
  8854. --- Decision Phase ---
  8855. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8856. =>WM: (13497: S1 ^operator O1927)
  8857. 964: O: O1927 (predict-yes)
  8858. --- END Decision Phase ---
  8859. --- Application Phase ---
  8860. --- Firing Productions (PE) For State At Depth 1 ---
  8861. --- Inner Elaboration Phase, active level 1 (S1) ---
  8862. Firing apply*operator
  8863. -->
  8864. (I3 ^predict-yes N964 + :O )
  8865. Firing apply*operator*complete
  8866. -->
  8867. (I3 ^predict-no N963 - :O )
  8868. inner elaboration loop at bottom goal.
  8869. --- Change Working Memory (PE) ---
  8870. =>WM: (13498: I3 ^predict-yes N964)
  8871. <=WM: (13485: N963 ^status complete)
  8872. <=WM: (13484: I3 ^predict-no N963)
  8873. --- Firing Productions (IE) For State At Depth 1 ---
  8874. --- Inner Elaboration Phase, active level 1 (S1) ---
  8875. Firing monitor*world
  8876. -->
  8877. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8878. --- Change Working Memory (IE) ---
  8879. --- END Application Phase ---
  8880. --- Output Phase ---
  8881. ENV: Agent did: predict-yes for direction L in state State-B
  8882. In State-B moving L
  8883. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8884. predict error 0
  8885. dir: dir isR
  8886. --- END Output Phase ---
  8887. /|\--- Input Phase ---
  8888. =>WM: (13502: I2 ^dir R)
  8889. =>WM: (13501: I2 ^reward 1)
  8890. =>WM: (13500: I2 ^see 1)
  8891. =>WM: (13499: N964 ^status complete)
  8892. <=WM: (13488: I2 ^dir L)
  8893. <=WM: (13487: I2 ^reward 1)
  8894. <=WM: (13486: I2 ^see 0)
  8895. =>WM: (13503: I2 ^level-1 L1-root)
  8896. <=WM: (13489: I2 ^level-1 R1-root)
  8897. --- END Input Phase ---
  8898. --- Proposal Phase ---
  8899. --- Inner Elaboration Phase, active level 1 (S1) ---
  8900. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  8901. -->
  8902. (S1 ^operator O1927 = 0.7065565782519569)
  8903. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  8904. -->
  8905. (S1 ^operator O1928 = -0.1937987592593187)
  8906. Firing prefer*rvt*predict-no*H0*6*v1*H1
  8907. -->
  8908. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8909. -->
  8910. Firing elaborate*copy-see-to-output-link
  8911. -->
  8912. (I3 ^see 1 +)
  8913. Firing elaborate*reward*based*on*reward
  8914. -->
  8915. (R968 ^value 1 +)
  8916. (R1 ^reward R968 +)
  8917. Firing propose*predict-yes
  8918. -->
  8919. (O1929 ^name predict-yes +)
  8920. (S1 ^operator O1929 +)
  8921. Firing propose*predict-no
  8922. -->
  8923. (O1930 ^name predict-no +)
  8924. (S1 ^operator O1930 +)
  8925. Firing rl*prefer*rvt*predict-no*H0*6
  8926. -->
  8927. (S1 ^operator O1928 = 0.2298717920574965)
  8928. Firing rl*prefer*rvt*predict-yes*H0*5
  8929. -->
  8930. (S1 ^operator O1927 = 0.2940412798984666)
  8931. Firing prefer*rvt*predict-yes*H0
  8932. -->
  8933. Firing prefer*rvt*predict-no*H0
  8934. -->
  8935. Firing elaborate*copy-dir-to-output-link
  8936. -->
  8937. (I3 ^dir R +)
  8938. inner elaboration loop at bottom goal.
  8939. Retracting elaborate*copy-see-to-output-link
  8940. -->
  8941. (I3 ^see 0 +)
  8942. Retracting propose*predict-no
  8943. -->
  8944. (O1928 ^name predict-no +)
  8945. (S1 ^operator O1928 +)
  8946. Retracting propose*predict-yes
  8947. -->
  8948. (O1927 ^name predict-yes +)
  8949. (S1 ^operator O1927 +)
  8950. Retracting elaborate*reward*based*on*reward
  8951. -->
  8952. (R967 ^value 1 +)
  8953. (R1 ^reward R967 +)
  8954. Retracting elaborate*copy-dir-to-output-link
  8955. -->
  8956. (I3 ^dir L +)
  8957. Retracting rl*prefer*rvt*predict-no*H0*2
  8958. -->
  8959. (S1 ^operator O1928 = 0.3140405292214645)
  8960. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8961. -->
  8962. (S1 ^operator O1928 = -0.1479504104026684)
  8963. Retracting rl*prefer*rvt*predict-yes*H0*1
  8964. -->
  8965. (S1 ^operator O1927 = 0.3804255857519139)
  8966. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8967. -->
  8968. (S1 ^operator O1927 = 0.619629119351056)
  8969. =>WM: (13511: S1 ^operator O1930 +)
  8970. =>WM: (13510: S1 ^operator O1929 +)
  8971. =>WM: (13509: I3 ^dir R)
  8972. =>WM: (13508: O1930 ^name predict-no)
  8973. =>WM: (13507: O1929 ^name predict-yes)
  8974. =>WM: (13506: R968 ^value 1)
  8975. =>WM: (13505: R1 ^reward R968)
  8976. =>WM: (13504: I3 ^see 1)
  8977. <=WM: (13495: S1 ^operator O1927 +)
  8978. <=WM: (13497: S1 ^operator O1927)
  8979. <=WM: (13496: S1 ^operator O1928 +)
  8980. <=WM: (13494: I3 ^dir L)
  8981. <=WM: (13490: R1 ^reward R967)
  8982. <=WM: (13450: I3 ^see 0)
  8983. <=WM: (13493: O1928 ^name predict-no)
  8984. <=WM: (13492: O1927 ^name predict-yes)
  8985. <=WM: (13491: R967 ^value 1)
  8986. --- Inner Elaboration Phase, active level 1 (S1) ---
  8987. Firing prefer*rvt*predict-yes*H0
  8988. -->
  8989. Firing rl*prefer*rvt*predict-yes*H0*5
  8990. -->
  8991. (S1 ^operator O1929 = 0.2940412798984666)
  8992. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8993. -->
  8994. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  8995. -->
  8996. (S1 ^operator O1929 = 0.7065565782519569)
  8997. Firing prefer*rvt*predict-no*H0
  8998. -->
  8999. Firing rl*prefer*rvt*predict-no*H0*6
  9000. -->
  9001. (S1 ^operator O1930 = 0.2298717920574965)
  9002. Firing prefer*rvt*predict-no*H0*6*v1*H1
  9003. -->
  9004. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9005. -->
  9006. (S1 ^operator O1930 = -0.1937987592593187)
  9007. inner elaboration loop at bottom goal.
  9008. Retracting rl*prefer*rvt*predict-no*H0*6
  9009. -->
  9010. (S1 ^operator O1928 = 0.2298717920574965)
  9011. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9012. -->
  9013. (S1 ^operator O1928 = -0.1937987592593187)
  9014. Retracting rl*prefer*rvt*predict-yes*H0*5
  9015. -->
  9016. (S1 ^operator O1927 = 0.2940412798984666)
  9017. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9018. -->
  9019. (S1 ^operator O1927 = 0.7065565782519569)
  9020. --- END Proposal Phase ---
  9021. --- Decision Phase ---
  9022. RL update rl*prefer*rvt*predict-yes*H0*1 0.521357 -0.140931 0.380426 -> 0.521352 -0.140931 0.380421(R,m,v=1,0.821656,0.147477)
  9023. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478703 0.140926 0.619629 -> 0.478697 0.140926 0.619624(R,m,v=1,1,0)
  9024. =>WM: (13512: S1 ^operator O1929)
  9025. 965: O: O1929 (predict-yes)
  9026. --- END Decision Phase ---
  9027. --- Application Phase ---
  9028. --- Firing Productions (PE) For State At Depth 1 ---
  9029. --- Inner Elaboration Phase, active level 1 (S1) ---
  9030. Firing apply*operator
  9031. -->
  9032. (I3 ^predict-yes N965 + :O )
  9033. Firing apply*operator*complete
  9034. -->
  9035. (I3 ^predict-yes N964 - :O )
  9036. inner elaboration loop at bottom goal.
  9037. --- Change Working Memory (PE) ---
  9038. =>WM: (13513: I3 ^predict-yes N965)
  9039. <=WM: (13499: N964 ^status complete)
  9040. <=WM: (13498: I3 ^predict-yes N964)
  9041. --- Firing Productions (IE) For State At Depth 1 ---
  9042. --- Inner Elaboration Phase, active level 1 (S1) ---
  9043. Firing monitor*world
  9044. -->
  9045. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9046. --- Change Working Memory (IE) ---
  9047. --- END Application Phase ---
  9048. --- Output Phase ---
  9049. ENV: Agent did: predict-yes for direction R in state State-A
  9050. In State-A moving R
  9051. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9052. predict error 0
  9053. dir: dir isU
  9054. --- END Output Phase ---
  9055. -/|--- Input Phase ---
  9056. =>WM: (13517: I2 ^dir U)
  9057. =>WM: (13516: I2 ^reward 1)
  9058. =>WM: (13515: I2 ^see 1)
  9059. =>WM: (13514: N965 ^status complete)
  9060. <=WM: (13502: I2 ^dir R)
  9061. <=WM: (13501: I2 ^reward 1)
  9062. <=WM: (13500: I2 ^see 1)
  9063. =>WM: (13518: I2 ^level-1 R1-root)
  9064. <=WM: (13503: I2 ^level-1 L1-root)
  9065. --- END Input Phase ---
  9066. --- Proposal Phase ---
  9067. --- Inner Elaboration Phase, active level 1 (S1) ---
  9068. Firing elaborate*copy-see-to-output-link
  9069. -->
  9070. (I3 ^see 1 +)
  9071. Firing elaborate*reward*based*on*reward
  9072. -->
  9073. (R969 ^value 1 +)
  9074. (R1 ^reward R969 +)
  9075. Firing propose*predict-yes
  9076. -->
  9077. (O1931 ^name predict-yes +)
  9078. (S1 ^operator O1931 +)
  9079. Firing propose*predict-no
  9080. -->
  9081. (O1932 ^name predict-no +)
  9082. (S1 ^operator O1932 +)
  9083. Firing rl*prefer*rvt*predict-no*H0*4
  9084. -->
  9085. (S1 ^operator O1930 = 1.)
  9086. Firing rl*prefer*rvt*predict-yes*H0*3
  9087. -->
  9088. (S1 ^operator O1929 = 0.)
  9089. Firing prefer*rvt*predict-yes*H0
  9090. -->
  9091. Firing prefer*rvt*predict-no*H0
  9092. -->
  9093. Firing elaborate*copy-dir-to-output-link
  9094. -->
  9095. (I3 ^dir U +)
  9096. inner elaboration loop at bottom goal.
  9097. Retracting elaborate*copy-see-to-output-link
  9098. -->
  9099. (I3 ^see 1 +)
  9100. Retracting propose*predict-no
  9101. -->
  9102. (O1930 ^name predict-no +)
  9103. (S1 ^operator O1930 +)
  9104. Retracting propose*predict-yes
  9105. -->
  9106. (O1929 ^name predict-yes +)
  9107. (S1 ^operator O1929 +)
  9108. Retracting elaborate*reward*based*on*reward
  9109. -->
  9110. (R968 ^value 1 +)
  9111. (R1 ^reward R968 +)
  9112. Retracting elaborate*copy-dir-to-output-link
  9113. -->
  9114. (I3 ^dir R +)
  9115. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9116. -->
  9117. (S1 ^operator O1930 = -0.1937987592593187)
  9118. Retracting rl*prefer*rvt*predict-no*H0*6
  9119. -->
  9120. (S1 ^operator O1930 = 0.2298717920574965)
  9121. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9122. -->
  9123. (S1 ^operator O1929 = 0.7065565782519569)
  9124. Retracting rl*prefer*rvt*predict-yes*H0*5
  9125. -->
  9126. (S1 ^operator O1929 = 0.2940412798984666)
  9127. =>WM: (13525: S1 ^operator O1932 +)
  9128. =>WM: (13524: S1 ^operator O1931 +)
  9129. =>WM: (13523: I3 ^dir U)
  9130. =>WM: (13522: O1932 ^name predict-no)
  9131. =>WM: (13521: O1931 ^name predict-yes)
  9132. =>WM: (13520: R969 ^value 1)
  9133. =>WM: (13519: R1 ^reward R969)
  9134. <=WM: (13510: S1 ^operator O1929 +)
  9135. <=WM: (13512: S1 ^operator O1929)
  9136. <=WM: (13511: S1 ^operator O1930 +)
  9137. <=WM: (13509: I3 ^dir R)
  9138. <=WM: (13505: R1 ^reward R968)
  9139. <=WM: (13508: O1930 ^name predict-no)
  9140. <=WM: (13507: O1929 ^name predict-yes)
  9141. <=WM: (13506: R968 ^value 1)
  9142. --- Inner Elaboration Phase, active level 1 (S1) ---
  9143. Firing prefer*rvt*predict-yes*H0
  9144. -->
  9145. Firing rl*prefer*rvt*predict-yes*H0*3
  9146. -->
  9147. (S1 ^operator O1931 = 0.)
  9148. Firing prefer*rvt*predict-no*H0
  9149. -->
  9150. Firing rl*prefer*rvt*predict-no*H0*4
  9151. -->
  9152. (S1 ^operator O1932 = 1.)
  9153. inner elaboration loop at bottom goal.
  9154. Retracting rl*prefer*rvt*predict-no*H0*4
  9155. -->
  9156. (S1 ^operator O1930 = 1.)
  9157. Retracting rl*prefer*rvt*predict-yes*H0*3
  9158. -->
  9159. (S1 ^operator O1929 = 0.)
  9160. --- END Proposal Phase ---
  9161. --- Decision Phase ---
  9162. RL update rl*prefer*rvt*predict-yes*H0*5 0.50111 -0.207069 0.294041 -> 0.501065 -0.207074 0.293991(R,m,v=1,0.837838,0.13679)
  9163. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499427 0.207129 0.706557 -> 0.499374 0.207123 0.706498(R,m,v=1,1,0)
  9164. =>WM: (13526: S1 ^operator O1932)
  9165. 966: O: O1932 (predict-no)
  9166. --- END Decision Phase ---
  9167. --- Application Phase ---
  9168. --- Firing Productions (PE) For State At Depth 1 ---
  9169. --- Inner Elaboration Phase, active level 1 (S1) ---
  9170. Firing apply*operator
  9171. -->
  9172. (I3 ^predict-no N966 + :O )
  9173. Firing apply*operator*complete
  9174. -->
  9175. (I3 ^predict-yes N965 - :O )
  9176. inner elaboration loop at bottom goal.
  9177. --- Change Working Memory (PE) ---
  9178. =>WM: (13527: I3 ^predict-no N966)
  9179. <=WM: (13514: N965 ^status complete)
  9180. <=WM: (13513: I3 ^predict-yes N965)
  9181. --- Firing Productions (IE) For State At Depth 1 ---
  9182. --- Inner Elaboration Phase, active level 1 (S1) ---
  9183. Firing monitor*world
  9184. -->
  9185. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9186. --- Change Working Memory (IE) ---
  9187. --- END Application Phase ---
  9188. --- Output Phase ---
  9189. ENV: Agent did: predict-no for direction U in state State-B
  9190. In State-B moving U
  9191. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9192. predict error 0
  9193. dir: dir isL
  9194. --- END Output Phase ---
  9195. \-/--- Input Phase ---
  9196. =>WM: (13531: I2 ^dir L)
  9197. =>WM: (13530: I2 ^reward 1)
  9198. =>WM: (13529: I2 ^see 0)
  9199. =>WM: (13528: N966 ^status complete)
  9200. <=WM: (13517: I2 ^dir U)
  9201. <=WM: (13516: I2 ^reward 1)
  9202. <=WM: (13515: I2 ^see 1)
  9203. =>WM: (13532: I2 ^level-1 R1-root)
  9204. <=WM: (13518: I2 ^level-1 R1-root)
  9205. --- END Input Phase ---
  9206. --- Proposal Phase ---
  9207. --- Inner Elaboration Phase, active level 1 (S1) ---
  9208. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9209. -->
  9210. (S1 ^operator O1931 = 0.6196238010864294)
  9211. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9212. -->
  9213. (S1 ^operator O1932 = -0.1479504104026684)
  9214. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9215. -->
  9216. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9217. -->
  9218. Firing elaborate*copy-see-to-output-link
  9219. -->
  9220. (I3 ^see 0 +)
  9221. Firing elaborate*reward*based*on*reward
  9222. -->
  9223. (R970 ^value 1 +)
  9224. (R1 ^reward R970 +)
  9225. Firing propose*predict-yes
  9226. -->
  9227. (O1933 ^name predict-yes +)
  9228. (S1 ^operator O1933 +)
  9229. Firing propose*predict-no
  9230. -->
  9231. (O1934 ^name predict-no +)
  9232. (S1 ^operator O1934 +)
  9233. Firing rl*prefer*rvt*predict-no*H0*2
  9234. -->
  9235. (S1 ^operator O1932 = 0.3140405292214645)
  9236. Firing rl*prefer*rvt*predict-yes*H0*1
  9237. -->
  9238. (S1 ^operator O1931 = 0.380421069331616)
  9239. Firing prefer*rvt*predict-yes*H0
  9240. -->
  9241. Firing prefer*rvt*predict-no*H0
  9242. -->
  9243. Firing elaborate*copy-dir-to-output-link
  9244. -->
  9245. (I3 ^dir L +)
  9246. inner elaboration loop at bottom goal.
  9247. Retracting elaborate*copy-see-to-output-link
  9248. -->
  9249. (I3 ^see 1 +)
  9250. Retracting propose*predict-no
  9251. -->
  9252. (O1932 ^name predict-no +)
  9253. (S1 ^operator O1932 +)
  9254. Retracting propose*predict-yes
  9255. -->
  9256. (O1931 ^name predict-yes +)
  9257. (S1 ^operator O1931 +)
  9258. Retracting elaborate*reward*based*on*reward
  9259. -->
  9260. (R969 ^value 1 +)
  9261. (R1 ^reward R969 +)
  9262. Retracting elaborate*copy-dir-to-output-link
  9263. -->
  9264. (I3 ^dir U +)
  9265. Retracting rl*prefer*rvt*predict-no*H0*4
  9266. -->
  9267. (S1 ^operator O1932 = 1.)
  9268. Retracting rl*prefer*rvt*predict-yes*H0*3
  9269. -->
  9270. (S1 ^operator O1931 = 0.)
  9271. =>WM: (13540: S1 ^operator O1934 +)
  9272. =>WM: (13539: S1 ^operator O1933 +)
  9273. =>WM: (13538: I3 ^dir L)
  9274. =>WM: (13537: O1934 ^name predict-no)
  9275. =>WM: (13536: O1933 ^name predict-yes)
  9276. =>WM: (13535: R970 ^value 1)
  9277. =>WM: (13534: R1 ^reward R970)
  9278. =>WM: (13533: I3 ^see 0)
  9279. <=WM: (13524: S1 ^operator O1931 +)
  9280. <=WM: (13525: S1 ^operator O1932 +)
  9281. <=WM: (13526: S1 ^operator O1932)
  9282. <=WM: (13523: I3 ^dir U)
  9283. <=WM: (13519: R1 ^reward R969)
  9284. <=WM: (13504: I3 ^see 1)
  9285. <=WM: (13522: O1932 ^name predict-no)
  9286. <=WM: (13521: O1931 ^name predict-yes)
  9287. <=WM: (13520: R969 ^value 1)
  9288. --- Inner Elaboration Phase, active level 1 (S1) ---
  9289. Firing prefer*rvt*predict-yes*H0
  9290. -->
  9291. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9292. -->
  9293. (S1 ^operator O1933 = 0.6196238010864294)
  9294. Firing rl*prefer*rvt*predict-yes*H0*1
  9295. -->
  9296. (S1 ^operator O1933 = 0.380421069331616)
  9297. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9298. -->
  9299. Firing prefer*rvt*predict-no*H0
  9300. -->
  9301. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9302. -->
  9303. (S1 ^operator O1934 = -0.1479504104026684)
  9304. Firing rl*prefer*rvt*predict-no*H0*2
  9305. -->
  9306. (S1 ^operator O1934 = 0.3140405292214645)
  9307. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9308. -->
  9309. inner elaboration loop at bottom goal.
  9310. Retracting rl*prefer*rvt*predict-no*H0*2
  9311. -->
  9312. (S1 ^operator O1932 = 0.3140405292214645)
  9313. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9314. -->
  9315. (S1 ^operator O1932 = -0.1479504104026684)
  9316. Retracting rl*prefer*rvt*predict-yes*H0*1
  9317. -->
  9318. (S1 ^operator O1931 = 0.380421069331616)
  9319. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9320. -->
  9321. (S1 ^operator O1931 = 0.6196238010864294)
  9322. --- END Proposal Phase ---
  9323. --- Decision Phase ---
  9324. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9325. =>WM: (13541: S1 ^operator O1933)
  9326. 967: O: O1933 (predict-yes)
  9327. --- END Decision Phase ---
  9328. --- Application Phase ---
  9329. --- Firing Productions (PE) For State At Depth 1 ---
  9330. --- Inner Elaboration Phase, active level 1 (S1) ---
  9331. Firing apply*operator
  9332. -->
  9333. (I3 ^predict-yes N967 + :O )
  9334. Firing apply*operator*complete
  9335. -->
  9336. (I3 ^predict-no N966 - :O )
  9337. inner elaboration loop at bottom goal.
  9338. --- Change Working Memory (PE) ---
  9339. =>WM: (13542: I3 ^predict-yes N967)
  9340. <=WM: (13528: N966 ^status complete)
  9341. <=WM: (13527: I3 ^predict-no N966)
  9342. --- Firing Productions (IE) For State At Depth 1 ---
  9343. --- Inner Elaboration Phase, active level 1 (S1) ---
  9344. Firing monitor*world
  9345. -->
  9346. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9347. --- Change Working Memory (IE) ---
  9348. --- END Application Phase ---
  9349. --- Output Phase ---
  9350. ENV: Agent did: predict-yes for direction L in state State-B
  9351. In State-B moving L
  9352. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9353. predict error 0
  9354. dir: dir isR
  9355. --- END Output Phase ---
  9356. |\---- Input Phase ---
  9357. =>WM: (13546: I2 ^dir R)
  9358. =>WM: (13545: I2 ^reward 1)
  9359. =>WM: (13544: I2 ^see 1)
  9360. =>WM: (13543: N967 ^status complete)
  9361. <=WM: (13531: I2 ^dir L)
  9362. <=WM: (13530: I2 ^reward 1)
  9363. <=WM: (13529: I2 ^see 0)
  9364. =>WM: (13547: I2 ^level-1 L1-root)
  9365. <=WM: (13532: I2 ^level-1 R1-root)
  9366. --- END Input Phase ---
  9367. --- Proposal Phase ---
  9368. --- Inner Elaboration Phase, active level 1 (S1) ---
  9369. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9370. -->
  9371. (S1 ^operator O1933 = 0.7064977054068989)
  9372. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9373. -->
  9374. (S1 ^operator O1934 = -0.1937987592593187)
  9375. Firing prefer*rvt*predict-no*H0*6*v1*H1
  9376. -->
  9377. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9378. -->
  9379. Firing elaborate*copy-see-to-output-link
  9380. -->
  9381. (I3 ^see 1 +)
  9382. Firing elaborate*reward*based*on*reward
  9383. -->
  9384. (R971 ^value 1 +)
  9385. (R1 ^reward R971 +)
  9386. Firing propose*predict-yes
  9387. -->
  9388. (O1935 ^name predict-yes +)
  9389. (S1 ^operator O1935 +)
  9390. Firing propose*predict-no
  9391. -->
  9392. (O1936 ^name predict-no +)
  9393. (S1 ^operator O1936 +)
  9394. Firing rl*prefer*rvt*predict-no*H0*6
  9395. -->
  9396. (S1 ^operator O1934 = 0.2298717920574965)
  9397. Firing rl*prefer*rvt*predict-yes*H0*5
  9398. -->
  9399. (S1 ^operator O1933 = 0.2939914352270483)
  9400. Firing prefer*rvt*predict-yes*H0
  9401. -->
  9402. Firing prefer*rvt*predict-no*H0
  9403. -->
  9404. Firing elaborate*copy-dir-to-output-link
  9405. -->
  9406. (I3 ^dir R +)
  9407. inner elaboration loop at bottom goal.
  9408. Retracting elaborate*copy-see-to-output-link
  9409. -->
  9410. (I3 ^see 0 +)
  9411. Retracting propose*predict-no
  9412. -->
  9413. (O1934 ^name predict-no +)
  9414. (S1 ^operator O1934 +)
  9415. Retracting propose*predict-yes
  9416. -->
  9417. (O1933 ^name predict-yes +)
  9418. (S1 ^operator O1933 +)
  9419. Retracting elaborate*reward*based*on*reward
  9420. -->
  9421. (R970 ^value 1 +)
  9422. (R1 ^reward R970 +)
  9423. Retracting elaborate*copy-dir-to-output-link
  9424. -->
  9425. (I3 ^dir L +)
  9426. Retracting rl*prefer*rvt*predict-no*H0*2
  9427. -->
  9428. (S1 ^operator O1934 = 0.3140405292214645)
  9429. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9430. -->
  9431. (S1 ^operator O1934 = -0.1479504104026684)
  9432. Retracting rl*prefer*rvt*predict-yes*H0*1
  9433. -->
  9434. (S1 ^operator O1933 = 0.380421069331616)
  9435. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9436. -->
  9437. (S1 ^operator O1933 = 0.6196238010864294)
  9438. =>WM: (13555: S1 ^operator O1936 +)
  9439. =>WM: (13554: S1 ^operator O1935 +)
  9440. =>WM: (13553: I3 ^dir R)
  9441. =>WM: (13552: O1936 ^name predict-no)
  9442. =>WM: (13551: O1935 ^name predict-yes)
  9443. =>WM: (13550: R971 ^value 1)
  9444. =>WM: (13549: R1 ^reward R971)
  9445. =>WM: (13548: I3 ^see 1)
  9446. <=WM: (13539: S1 ^operator O1933 +)
  9447. <=WM: (13541: S1 ^operator O1933)
  9448. <=WM: (13540: S1 ^operator O1934 +)
  9449. <=WM: (13538: I3 ^dir L)
  9450. <=WM: (13534: R1 ^reward R970)
  9451. <=WM: (13533: I3 ^see 0)
  9452. <=WM: (13537: O1934 ^name predict-no)
  9453. <=WM: (13536: O1933 ^name predict-yes)
  9454. <=WM: (13535: R970 ^value 1)
  9455. --- Inner Elaboration Phase, active level 1 (S1) ---
  9456. Firing prefer*rvt*predict-yes*H0
  9457. -->
  9458. Firing rl*prefer*rvt*predict-yes*H0*5
  9459. -->
  9460. (S1 ^operator O1935 = 0.2939914352270483)
  9461. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9462. -->
  9463. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9464. -->
  9465. (S1 ^operator O1935 = 0.7064977054068989)
  9466. Firing prefer*rvt*predict-no*H0
  9467. -->
  9468. Firing rl*prefer*rvt*predict-no*H0*6
  9469. -->
  9470. (S1 ^operator O1936 = 0.2298717920574965)
  9471. Firing prefer*rvt*predict-no*H0*6*v1*H1
  9472. -->
  9473. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9474. -->
  9475. (S1 ^operator O1936 = -0.1937987592593187)
  9476. inner elaboration loop at bottom goal.
  9477. Retracting rl*prefer*rvt*predict-no*H0*6
  9478. -->
  9479. (S1 ^operator O1934 = 0.2298717920574965)
  9480. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9481. -->
  9482. (S1 ^operator O1934 = -0.1937987592593187)
  9483. Retracting rl*prefer*rvt*predict-yes*H0*5
  9484. -->
  9485. (S1 ^operator O1933 = 0.2939914352270483)
  9486. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9487. -->
  9488. (S1 ^operator O1933 = 0.7064977054068989)
  9489. --- END Proposal Phase ---
  9490. --- Decision Phase ---
  9491. RL update rl*prefer*rvt*predict-yes*H0*1 0.521352 -0.140931 0.380421 -> 0.521348 -0.14093 0.380417(R,m,v=1,0.822785,0.146739)
  9492. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478697 0.140926 0.619624 -> 0.478693 0.140927 0.619619(R,m,v=1,1,0)
  9493. =>WM: (13556: S1 ^operator O1935)
  9494. 968: O: O1935 (predict-yes)
  9495. --- END Decision Phase ---
  9496. --- Application Phase ---
  9497. --- Firing Productions (PE) For State At Depth 1 ---
  9498. --- Inner Elaboration Phase, active level 1 (S1) ---
  9499. Firing apply*operator
  9500. -->
  9501. (I3 ^predict-yes N968 + :O )
  9502. Firing apply*operator*complete
  9503. -->
  9504. (I3 ^predict-yes N967 - :O )
  9505. inner elaboration loop at bottom goal.
  9506. --- Change Working Memory (PE) ---
  9507. =>WM: (13557: I3 ^predict-yes N968)
  9508. <=WM: (13543: N967 ^status complete)
  9509. <=WM: (13542: I3 ^predict-yes N967)
  9510. --- Firing Productions (IE) For State At Depth 1 ---
  9511. --- Inner Elaboration Phase, active level 1 (S1) ---
  9512. Firing monitor*world
  9513. -->
  9514. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9515. --- Change Working Memory (IE) ---
  9516. --- END Application Phase ---
  9517. --- Output Phase ---
  9518. ENV: Agent did: predict-yes for direction R in state State-A
  9519. In State-A moving R
  9520. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9521. predict error 0
  9522. dir: dir isU
  9523. --- END Output Phase ---
  9524. /|\--- Input Phase ---
  9525. =>WM: (13561: I2 ^dir U)
  9526. =>WM: (13560: I2 ^reward 1)
  9527. =>WM: (13559: I2 ^see 1)
  9528. =>WM: (13558: N968 ^status complete)
  9529. <=WM: (13546: I2 ^dir R)
  9530. <=WM: (13545: I2 ^reward 1)
  9531. <=WM: (13544: I2 ^see 1)
  9532. =>WM: (13562: I2 ^level-1 R1-root)
  9533. <=WM: (13547: I2 ^level-1 L1-root)
  9534. --- END Input Phase ---
  9535. --- Proposal Phase ---
  9536. --- Inner Elaboration Phase, active level 1 (S1) ---
  9537. Firing elaborate*copy-see-to-output-link
  9538. -->
  9539. (I3 ^see 1 +)
  9540. Firing elaborate*reward*based*on*reward
  9541. -->
  9542. (R972 ^value 1 +)
  9543. (R1 ^reward R972 +)
  9544. Firing propose*predict-yes
  9545. -->
  9546. (O1937 ^name predict-yes +)
  9547. (S1 ^operator O1937 +)
  9548. Firing propose*predict-no
  9549. -->
  9550. (O1938 ^name predict-no +)
  9551. (S1 ^operator O1938 +)
  9552. Firing rl*prefer*rvt*predict-no*H0*4
  9553. -->
  9554. (S1 ^operator O1936 = 1.)
  9555. Firing rl*prefer*rvt*predict-yes*H0*3
  9556. -->
  9557. (S1 ^operator O1935 = 0.)
  9558. Firing prefer*rvt*predict-yes*H0
  9559. -->
  9560. Firing prefer*rvt*predict-no*H0
  9561. -->
  9562. Firing elaborate*copy-dir-to-output-link
  9563. -->
  9564. (I3 ^dir U +)
  9565. inner elaboration loop at bottom goal.
  9566. Retracting elaborate*copy-see-to-output-link
  9567. -->
  9568. (I3 ^see 1 +)
  9569. Retracting propose*predict-no
  9570. -->
  9571. (O1936 ^name predict-no +)
  9572. (S1 ^operator O1936 +)
  9573. Retracting propose*predict-yes
  9574. -->
  9575. (O1935 ^name predict-yes +)
  9576. (S1 ^operator O1935 +)
  9577. Retracting elaborate*reward*based*on*reward
  9578. -->
  9579. (R971 ^value 1 +)
  9580. (R1 ^reward R971 +)
  9581. Retracting elaborate*copy-dir-to-output-link
  9582. -->
  9583. (I3 ^dir R +)
  9584. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9585. -->
  9586. (S1 ^operator O1936 = -0.1937987592593187)
  9587. Retracting rl*prefer*rvt*predict-no*H0*6
  9588. -->
  9589. (S1 ^operator O1936 = 0.2298717920574965)
  9590. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9591. -->
  9592. (S1 ^operator O1935 = 0.7064977054068989)
  9593. Retracting rl*prefer*rvt*predict-yes*H0*5
  9594. -->
  9595. (S1 ^operator O1935 = 0.2939914352270483)
  9596. =>WM: (13569: S1 ^operator O1938 +)
  9597. =>WM: (13568: S1 ^operator O1937 +)
  9598. =>WM: (13567: I3 ^dir U)
  9599. =>WM: (13566: O1938 ^name predict-no)
  9600. =>WM: (13565: O1937 ^name predict-yes)
  9601. =>WM: (13564: R972 ^value 1)
  9602. =>WM: (13563: R1 ^reward R972)
  9603. <=WM: (13554: S1 ^operator O1935 +)
  9604. <=WM: (13556: S1 ^operator O1935)
  9605. <=WM: (13555: S1 ^operator O1936 +)
  9606. <=WM: (13553: I3 ^dir R)
  9607. <=WM: (13549: R1 ^reward R971)
  9608. <=WM: (13552: O1936 ^name predict-no)
  9609. <=WM: (13551: O1935 ^name predict-yes)
  9610. <=WM: (13550: R971 ^value 1)
  9611. --- Inner Elaboration Phase, active level 1 (S1) ---
  9612. Firing prefer*rvt*predict-yes*H0
  9613. -->
  9614. Firing rl*prefer*rvt*predict-yes*H0*3
  9615. -->
  9616. (S1 ^operator O1937 = 0.)
  9617. Firing prefer*rvt*predict-no*H0
  9618. -->
  9619. Firing rl*prefer*rvt*predict-no*H0*4
  9620. -->
  9621. (S1 ^operator O1938 = 1.)
  9622. inner elaboration loop at bottom goal.
  9623. Retracting rl*prefer*rvt*predict-no*H0*4
  9624. -->
  9625. (S1 ^operator O1936 = 1.)
  9626. Retracting rl*prefer*rvt*predict-yes*H0*3
  9627. -->
  9628. (S1 ^operator O1935 = 0.)
  9629. --- END Proposal Phase ---
  9630. --- Decision Phase ---
  9631. RL update rl*prefer*rvt*predict-yes*H0*5 0.501065 -0.207074 0.293991 -> 0.501028 -0.207078 0.293951(R,m,v=1,0.838926,0.136042)
  9632. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499374 0.207123 0.706498 -> 0.499331 0.207118 0.70645(R,m,v=1,1,0)
  9633. =>WM: (13570: S1 ^operator O1938)
  9634. 969: O: O1938 (predict-no)
  9635. --- END Decision Phase ---
  9636. --- Application Phase ---
  9637. --- Firing Productions (PE) For State At Depth 1 ---
  9638. --- Inner Elaboration Phase, active level 1 (S1) ---
  9639. Firing apply*operator
  9640. -->
  9641. (I3 ^predict-no N969 + :O )
  9642. Firing apply*operator*complete
  9643. -->
  9644. (I3 ^predict-yes N968 - :O )
  9645. inner elaboration loop at bottom goal.
  9646. --- Change Working Memory (PE) ---
  9647. =>WM: (13571: I3 ^predict-no N969)
  9648. <=WM: (13558: N968 ^status complete)
  9649. <=WM: (13557: I3 ^predict-yes N968)
  9650. --- Firing Productions (IE) For State At Depth 1 ---
  9651. --- Inner Elaboration Phase, active level 1 (S1) ---
  9652. Firing monitor*world
  9653. -->
  9654. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9655. --- Change Working Memory (IE) ---
  9656. --- END Application Phase ---
  9657. --- Output Phase ---
  9658. ENV: Agent did: predict-no for direction U in state State-B
  9659. In State-B moving U
  9660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9661. predict error 0
  9662. dir: dir isL
  9663. --- END Output Phase ---
  9664. -/|--- Input Phase ---
  9665. =>WM: (13575: I2 ^dir L)
  9666. =>WM: (13574: I2 ^reward 1)
  9667. =>WM: (13573: I2 ^see 0)
  9668. =>WM: (13572: N969 ^status complete)
  9669. <=WM: (13561: I2 ^dir U)
  9670. <=WM: (13560: I2 ^reward 1)
  9671. <=WM: (13559: I2 ^see 1)
  9672. =>WM: (13576: I2 ^level-1 R1-root)
  9673. <=WM: (13562: I2 ^level-1 R1-root)
  9674. --- END Input Phase ---
  9675. --- Proposal Phase ---
  9676. --- Inner Elaboration Phase, active level 1 (S1) ---
  9677. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9678. -->
  9679. (S1 ^operator O1937 = 0.6196194522363663)
  9680. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9681. -->
  9682. (S1 ^operator O1938 = -0.1479504104026684)
  9683. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9684. -->
  9685. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9686. -->
  9687. Firing elaborate*copy-see-to-output-link
  9688. -->
  9689. (I3 ^see 0 +)
  9690. Firing elaborate*reward*based*on*reward
  9691. -->
  9692. (R973 ^value 1 +)
  9693. (R1 ^reward R973 +)
  9694. Firing propose*predict-yes
  9695. -->
  9696. (O1939 ^name predict-yes +)
  9697. (S1 ^operator O1939 +)
  9698. Firing propose*predict-no
  9699. -->
  9700. (O1940 ^name predict-no +)
  9701. (S1 ^operator O1940 +)
  9702. Firing rl*prefer*rvt*predict-no*H0*2
  9703. -->
  9704. (S1 ^operator O1938 = 0.3140405292214645)
  9705. Firing rl*prefer*rvt*predict-yes*H0*1
  9706. -->
  9707. (S1 ^operator O1937 = 0.3804173687365902)
  9708. Firing prefer*rvt*predict-yes*H0
  9709. -->
  9710. Firing prefer*rvt*predict-no*H0
  9711. -->
  9712. Firing elaborate*copy-dir-to-output-link
  9713. -->
  9714. (I3 ^dir L +)
  9715. inner elaboration loop at bottom goal.
  9716. Retracting elaborate*copy-see-to-output-link
  9717. -->
  9718. (I3 ^see 1 +)
  9719. Retracting propose*predict-no
  9720. -->
  9721. (O1938 ^name predict-no +)
  9722. (S1 ^operator O1938 +)
  9723. Retracting propose*predict-yes
  9724. -->
  9725. (O1937 ^name predict-yes +)
  9726. (S1 ^operator O1937 +)
  9727. Retracting elaborate*reward*based*on*reward
  9728. -->
  9729. (R972 ^value 1 +)
  9730. (R1 ^reward R972 +)
  9731. Retracting elaborate*copy-dir-to-output-link
  9732. -->
  9733. (I3 ^dir U +)
  9734. Retracting rl*prefer*rvt*predict-no*H0*4
  9735. -->
  9736. (S1 ^operator O1938 = 1.)
  9737. Retracting rl*prefer*rvt*predict-yes*H0*3
  9738. -->
  9739. (S1 ^operator O1937 = 0.)
  9740. =>WM: (13584: S1 ^operator O1940 +)
  9741. =>WM: (13583: S1 ^operator O1939 +)
  9742. =>WM: (13582: I3 ^dir L)
  9743. =>WM: (13581: O1940 ^name predict-no)
  9744. =>WM: (13580: O1939 ^name predict-yes)
  9745. =>WM: (13579: R973 ^value 1)
  9746. =>WM: (13578: R1 ^reward R973)
  9747. =>WM: (13577: I3 ^see 0)
  9748. <=WM: (13568: S1 ^operator O1937 +)
  9749. <=WM: (13569: S1 ^operator O1938 +)
  9750. <=WM: (13570: S1 ^operator O1938)
  9751. <=WM: (13567: I3 ^dir U)
  9752. <=WM: (13563: R1 ^reward R972)
  9753. <=WM: (13548: I3 ^see 1)
  9754. <=WM: (13566: O1938 ^name predict-no)
  9755. <=WM: (13565: O1937 ^name predict-yes)
  9756. <=WM: (13564: R972 ^value 1)
  9757. --- Inner Elaboration Phase, active level 1 (S1) ---
  9758. Firing prefer*rvt*predict-yes*H0
  9759. -->
  9760. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9761. -->
  9762. (S1 ^operator O1939 = 0.6196194522363663)
  9763. Firing rl*prefer*rvt*predict-yes*H0*1
  9764. -->
  9765. (S1 ^operator O1939 = 0.3804173687365902)
  9766. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9767. -->
  9768. Firing prefer*rvt*predict-no*H0
  9769. -->
  9770. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9771. -->
  9772. (S1 ^operator O1940 = -0.1479504104026684)
  9773. Firing rl*prefer*rvt*predict-no*H0*2
  9774. -->
  9775. (S1 ^operator O1940 = 0.3140405292214645)
  9776. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9777. -->
  9778. inner elaboration loop at bottom goal.
  9779. Retracting rl*prefer*rvt*predict-no*H0*2
  9780. -->
  9781. (S1 ^operator O1938 = 0.3140405292214645)
  9782. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9783. -->
  9784. (S1 ^operator O1938 = -0.1479504104026684)
  9785. Retracting rl*prefer*rvt*predict-yes*H0*1
  9786. -->
  9787. (S1 ^operator O1937 = 0.3804173687365902)
  9788. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9789. -->
  9790. (S1 ^operator O1937 = 0.6196194522363663)
  9791. --- END Proposal Phase ---
  9792. --- Decision Phase ---
  9793. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9794. =>WM: (13585: S1 ^operator O1939)
  9795. 970: O: O1939 (predict-yes)
  9796. --- END Decision Phase ---
  9797. --- Application Phase ---
  9798. --- Firing Productions (PE) For State At Depth 1 ---
  9799. --- Inner Elaboration Phase, active level 1 (S1) ---
  9800. Firing apply*operator
  9801. -->
  9802. (I3 ^predict-yes N970 + :O )
  9803. Firing apply*operator*complete
  9804. -->
  9805. (I3 ^predict-no N969 - :O )
  9806. inner elaboration loop at bottom goal.
  9807. --- Change Working Memory (PE) ---
  9808. =>WM: (13586: I3 ^predict-yes N970)
  9809. <=WM: (13572: N969 ^status complete)
  9810. <=WM: (13571: I3 ^predict-no N969)
  9811. --- Firing Productions (IE) For State At Depth 1 ---
  9812. --- Inner Elaboration Phase, active level 1 (S1) ---
  9813. Firing monitor*world
  9814. -->
  9815. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9816. --- Change Working Memory (IE) ---
  9817. --- END Application Phase ---
  9818. --- Output Phase ---
  9819. ENV: Agent did: predict-yes for direction L in state State-B
  9820. In State-B moving L
  9821. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9822. predict error 0
  9823. dir: dir isU
  9824. --- END Output Phase ---
  9825. \-/--- Input Phase ---
  9826. =>WM: (13590: I2 ^dir U)
  9827. =>WM: (13589: I2 ^reward 1)
  9828. =>WM: (13588: I2 ^see 1)
  9829. =>WM: (13587: N970 ^status complete)
  9830. <=WM: (13575: I2 ^dir L)
  9831. <=WM: (13574: I2 ^reward 1)
  9832. <=WM: (13573: I2 ^see 0)
  9833. =>WM: (13591: I2 ^level-1 L1-root)
  9834. <=WM: (13576: I2 ^level-1 R1-root)
  9835. --- END Input Phase ---
  9836. --- Proposal Phase ---
  9837. --- Inner Elaboration Phase, active level 1 (S1) ---
  9838. Firing elaborate*copy-see-to-output-link
  9839. -->
  9840. (I3 ^see 1 +)
  9841. Firing elaborate*reward*based*on*reward
  9842. -->
  9843. (R974 ^value 1 +)
  9844. (R1 ^reward R974 +)
  9845. Firing propose*predict-yes
  9846. -->
  9847. (O1941 ^name predict-yes +)
  9848. (S1 ^operator O1941 +)
  9849. Firing propose*predict-no
  9850. -->
  9851. (O1942 ^name predict-no +)
  9852. (S1 ^operator O1942 +)
  9853. Firing rl*prefer*rvt*predict-no*H0*4
  9854. -->
  9855. (S1 ^operator O1940 = 1.)
  9856. Firing rl*prefer*rvt*predict-yes*H0*3
  9857. -->
  9858. (S1 ^operator O1939 = 0.)
  9859. Firing prefer*rvt*predict-yes*H0
  9860. -->
  9861. Firing prefer*rvt*predict-no*H0
  9862. -->
  9863. Firing elaborate*copy-dir-to-output-link
  9864. -->
  9865. (I3 ^dir U +)
  9866. inner elaboration loop at bottom goal.
  9867. Retracting elaborate*copy-see-to-output-link
  9868. -->
  9869. (I3 ^see 0 +)
  9870. Retracting propose*predict-no
  9871. -->
  9872. (O1940 ^name predict-no +)
  9873. (S1 ^operator O1940 +)
  9874. Retracting propose*predict-yes
  9875. -->
  9876. (O1939 ^name predict-yes +)
  9877. (S1 ^operator O1939 +)
  9878. Retracting elaborate*reward*based*on*reward
  9879. -->
  9880. (R973 ^value 1 +)
  9881. (R1 ^reward R973 +)
  9882. Retracting elaborate*copy-dir-to-output-link
  9883. -->
  9884. (I3 ^dir L +)
  9885. Retracting rl*prefer*rvt*predict-no*H0*2
  9886. -->
  9887. (S1 ^operator O1940 = 0.3140405292214645)
  9888. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9889. -->
  9890. (S1 ^operator O1940 = -0.1479504104026684)
  9891. Retracting rl*prefer*rvt*predict-yes*H0*1
  9892. -->
  9893. (S1 ^operator O1939 = 0.3804173687365902)
  9894. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9895. -->
  9896. (S1 ^operator O1939 = 0.6196194522363663)
  9897. =>WM: (13599: S1 ^operator O1942 +)
  9898. =>WM: (13598: S1 ^operator O1941 +)
  9899. =>WM: (13597: I3 ^dir U)
  9900. =>WM: (13596: O1942 ^name predict-no)
  9901. =>WM: (13595: O1941 ^name predict-yes)
  9902. =>WM: (13594: R974 ^value 1)
  9903. =>WM: (13593: R1 ^reward R974)
  9904. =>WM: (13592: I3 ^see 1)
  9905. <=WM: (13583: S1 ^operator O1939 +)
  9906. <=WM: (13585: S1 ^operator O1939)
  9907. <=WM: (13584: S1 ^operator O1940 +)
  9908. <=WM: (13582: I3 ^dir L)
  9909. <=WM: (13578: R1 ^reward R973)
  9910. <=WM: (13577: I3 ^see 0)
  9911. <=WM: (13581: O1940 ^name predict-no)
  9912. <=WM: (13580: O1939 ^name predict-yes)
  9913. <=WM: (13579: R973 ^value 1)
  9914. --- Inner Elaboration Phase, active level 1 (S1) ---
  9915. Firing prefer*rvt*predict-yes*H0
  9916. -->
  9917. Firing rl*prefer*rvt*predict-yes*H0*3
  9918. -->
  9919. (S1 ^operator O1941 = 0.)
  9920. Firing prefer*rvt*predict-no*H0
  9921. -->
  9922. Firing rl*prefer*rvt*predict-no*H0*4
  9923. -->
  9924. (S1 ^operator O1942 = 1.)
  9925. inner elaboration loop at bottom goal.
  9926. Retracting rl*prefer*rvt*predict-no*H0*4
  9927. -->
  9928. (S1 ^operator O1940 = 1.)
  9929. Retracting rl*prefer*rvt*predict-yes*H0*3
  9930. -->
  9931. (S1 ^operator O1939 = 0.)
  9932. --- END Proposal Phase ---
  9933. --- Decision Phase ---
  9934. RL update rl*prefer*rvt*predict-yes*H0*1 0.521348 -0.14093 0.380417 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.823899,0.146007)
  9935. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478693 0.140927 0.619619 -> 0.478689 0.140927 0.619616(R,m,v=1,1,0)
  9936. =>WM: (13600: S1 ^operator O1942)
  9937. 971: O: O1942 (predict-no)
  9938. --- END Decision Phase ---
  9939. --- Application Phase ---
  9940. --- Firing Productions (PE) For State At Depth 1 ---
  9941. --- Inner Elaboration Phase, active level 1 (S1) ---
  9942. Firing apply*operator
  9943. -->
  9944. (I3 ^predict-no N971 + :O )
  9945. Firing apply*operator*complete
  9946. -->
  9947. (I3 ^predict-yes N970 - :O )
  9948. inner elaboration loop at bottom goal.
  9949. --- Change Working Memory (PE) ---
  9950. =>WM: (13601: I3 ^predict-no N971)
  9951. <=WM: (13587: N970 ^status complete)
  9952. <=WM: (13586: I3 ^predict-yes N970)
  9953. --- Firing Productions (IE) For State At Depth 1 ---
  9954. --- Inner Elaboration Phase, active level 1 (S1) ---
  9955. Firing monitor*world
  9956. -->
  9957. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9958. --- Change Working Memory (IE) ---
  9959. --- END Application Phase ---
  9960. --- Output Phase ---
  9961. ENV: Agent did: predict-no for direction U in state State-A
  9962. In State-A moving U
  9963. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9964. predict error 0
  9965. dir: dir isL
  9966. --- END Output Phase ---
  9967. |--- Input Phase ---
  9968. =>WM: (13605: I2 ^dir L)
  9969. =>WM: (13604: I2 ^reward 1)
  9970. =>WM: (13603: I2 ^see 0)
  9971. =>WM: (13602: N971 ^status complete)
  9972. <=WM: (13590: I2 ^dir U)
  9973. <=WM: (13589: I2 ^reward 1)
  9974. <=WM: (13588: I2 ^see 1)
  9975. =>WM: (13606: I2 ^level-1 L1-root)
  9976. <=WM: (13591: I2 ^level-1 L1-root)
  9977. --- END Input Phase ---
  9978. --- Proposal Phase ---
  9979. --- Inner Elaboration Phase, active level 1 (S1) ---
  9980. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  9981. -->
  9982. (S1 ^operator O1941 = -0.3470159027404986)
  9983. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  9984. -->
  9985. (S1 ^operator O1942 = 0.6861654297024582)
  9986. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9987. -->
  9988. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9989. -->
  9990. Firing elaborate*copy-see-to-output-link
  9991. -->
  9992. (I3 ^see 0 +)
  9993. Firing elaborate*reward*based*on*reward
  9994. -->
  9995. (R975 ^value 1 +)
  9996. (R1 ^reward R975 +)
  9997. Firing propose*predict-yes
  9998. -->
  9999. (O1943 ^name predict-yes +)
  10000. (S1 ^operator O1943 +)
  10001. Firing propose*predict-no
  10002. -->
  10003. (O1944 ^name predict-no +)
  10004. (S1 ^operator O1944 +)
  10005. Firing rl*prefer*rvt*predict-no*H0*2
  10006. -->
  10007. (S1 ^operator O1942 = 0.3140405292214645)
  10008. Firing rl*prefer*rvt*predict-yes*H0*1
  10009. -->
  10010. (S1 ^operator O1941 = 0.3804143351598744)
  10011. Firing prefer*rvt*predict-yes*H0
  10012. -->
  10013. Firing prefer*rvt*predict-no*H0
  10014. -->
  10015. Firing elaborate*copy-dir-to-output-link
  10016. -->
  10017. (I3 ^dir L +)
  10018. inner elaboration loop at bottom goal.
  10019. Retracting elaborate*copy-see-to-output-link
  10020. -->
  10021. (I3 ^see 1 +)
  10022. Retracting propose*predict-no
  10023. -->
  10024. (O1942 ^name predict-no +)
  10025. (S1 ^operator O1942 +)
  10026. Retracting propose*predict-yes
  10027. -->
  10028. (O1941 ^name predict-yes +)
  10029. (S1 ^operator O1941 +)
  10030. Retracting elaborate*reward*based*on*reward
  10031. -->
  10032. (R974 ^value 1 +)
  10033. (R1 ^reward R974 +)
  10034. Retracting elaborate*copy-dir-to-output-link
  10035. -->
  10036. (I3 ^dir U +)
  10037. Retracting rl*prefer*rvt*predict-no*H0*4
  10038. -->
  10039. (S1 ^operator O1942 = 1.)
  10040. Retracting rl*prefer*rvt*predict-yes*H0*3
  10041. -->
  10042. (S1 ^operator O1941 = 0.)
  10043. =>WM: (13614: S1 ^operator O1944 +)
  10044. =>WM: (13613: S1 ^operator O1943 +)
  10045. =>WM: (13612: I3 ^dir L)
  10046. =>WM: (13611: O1944 ^name predict-no)
  10047. =>WM: (13610: O1943 ^name predict-yes)
  10048. =>WM: (13609: R975 ^value 1)
  10049. =>WM: (13608: R1 ^reward R975)
  10050. =>WM: (13607: I3 ^see 0)
  10051. <=WM: (13598: S1 ^operator O1941 +)
  10052. <=WM: (13599: S1 ^operator O1942 +)
  10053. <=WM: (13600: S1 ^operator O1942)
  10054. <=WM: (13597: I3 ^dir U)
  10055. <=WM: (13593: R1 ^reward R974)
  10056. <=WM: (13592: I3 ^see 1)
  10057. <=WM: (13596: O1942 ^name predict-no)
  10058. <=WM: (13595: O1941 ^name predict-yes)
  10059. <=WM: (13594: R974 ^value 1)
  10060. --- Inner Elaboration Phase, active level 1 (S1) ---
  10061. Firing prefer*rvt*predict-yes*H0
  10062. -->
  10063. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  10064. -->
  10065. (S1 ^operator O1943 = -0.3470159027404986)
  10066. Firing rl*prefer*rvt*predict-yes*H0*1
  10067. -->
  10068. (S1 ^operator O1943 = 0.3804143351598744)
  10069. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10070. -->
  10071. Firing prefer*rvt*predict-no*H0
  10072. -->
  10073. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  10074. -->
  10075. (S1 ^operator O1944 = 0.6861654297024582)
  10076. Firing rl*prefer*rvt*predict-no*H0*2
  10077. -->
  10078. (S1 ^operator O1944 = 0.3140405292214645)
  10079. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10080. -->
  10081. inner elaboration loop at bottom goal.
  10082. Retracting rl*prefer*rvt*predict-no*H0*2
  10083. -->
  10084. (S1 ^operator O1942 = 0.3140405292214645)
  10085. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  10086. -->
  10087. (S1 ^operator O1942 = 0.6861654297024582)
  10088. Retracting rl*prefer*rvt*predict-yes*H0*1
  10089. -->
  10090. (S1 ^operator O1941 = 0.3804143351598744)
  10091. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  10092. -->
  10093. (S1 ^operator O1941 = -0.3470159027404986)
  10094. --- END Proposal Phase ---
  10095. --- Decision Phase ---
  10096. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10097. =>WM: (13615: S1 ^operator O1944)
  10098. 972: O: O1944 (predict-no)
  10099. --- END Decision Phase ---
  10100. --- Application Phase ---
  10101. --- Firing Productions (PE) For State At Depth 1 ---
  10102. --- Inner Elaboration Phase, active level 1 (S1) ---
  10103. Firing apply*operator
  10104. -->
  10105. (I3 ^predict-no N972 + :O )
  10106. Firing apply*operator*complete
  10107. -->
  10108. (I3 ^predict-no N971 - :O )
  10109. inner elaboration loop at bottom goal.
  10110. --- Change Working Memory (PE) ---
  10111. =>WM: (13616: I3 ^predict-no N972)
  10112. <=WM: (13602: N971 ^status complete)
  10113. <=WM: (13601: I3 ^predict-no N971)
  10114. --- Firing Productions (IE) For State At Depth 1 ---
  10115. --- Inner Elaboration Phase, active level 1 (S1) ---
  10116. Firing monitor*world
  10117. -->
  10118. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10119. --- Change Working Memory (IE) ---
  10120. --- END Application Phase ---
  10121. --- Output Phase ---
  10122. ENV: Agent did: predict-no for direction L in state State-A
  10123. In State-A moving L
  10124. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10125. predict error 0
  10126. dir: dir isR
  10127. --- END Output Phase ---
  10128. \-/--- Input Phase ---
  10129. =>WM: (13620: I2 ^dir R)
  10130. =>WM: (13619: I2 ^reward 1)
  10131. =>WM: (13618: I2 ^see 0)
  10132. =>WM: (13617: N972 ^status complete)
  10133. <=WM: (13605: I2 ^dir L)
  10134. <=WM: (13604: I2 ^reward 1)
  10135. <=WM: (13603: I2 ^see 0)
  10136. =>WM: (13621: I2 ^level-1 L0-root)
  10137. <=WM: (13606: I2 ^level-1 L1-root)
  10138. --- END Input Phase ---
  10139. --- Proposal Phase ---
  10140. --- Inner Elaboration Phase, active level 1 (S1) ---
  10141. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10142. -->
  10143. (S1 ^operator O1943 = 0.7054436376897688)
  10144. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10145. -->
  10146. (S1 ^operator O1944 = -0.2023211881870005)
  10147. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10148. -->
  10149. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10150. -->
  10151. Firing elaborate*copy-see-to-output-link
  10152. -->
  10153. (I3 ^see 0 +)
  10154. Firing elaborate*reward*based*on*reward
  10155. -->
  10156. (R976 ^value 1 +)
  10157. (R1 ^reward R976 +)
  10158. Firing propose*predict-yes
  10159. -->
  10160. (O1945 ^name predict-yes +)
  10161. (S1 ^operator O1945 +)
  10162. Firing propose*predict-no
  10163. -->
  10164. (O1946 ^name predict-no +)
  10165. (S1 ^operator O1946 +)
  10166. Firing rl*prefer*rvt*predict-no*H0*6
  10167. -->
  10168. (S1 ^operator O1944 = 0.2298717920574965)
  10169. Firing rl*prefer*rvt*predict-yes*H0*5
  10170. -->
  10171. (S1 ^operator O1943 = 0.2939507002996337)
  10172. Firing prefer*rvt*predict-yes*H0
  10173. -->
  10174. Firing prefer*rvt*predict-no*H0
  10175. -->
  10176. Firing elaborate*copy-dir-to-output-link
  10177. -->
  10178. (I3 ^dir R +)
  10179. inner elaboration loop at bottom goal.
  10180. Retracting elaborate*copy-see-to-output-link
  10181. -->
  10182. (I3 ^see 0 +)
  10183. Retracting propose*predict-no
  10184. -->
  10185. (O1944 ^name predict-no +)
  10186. (S1 ^operator O1944 +)
  10187. Retracting propose*predict-yes
  10188. -->
  10189. (O1943 ^name predict-yes +)
  10190. (S1 ^operator O1943 +)
  10191. Retracting elaborate*reward*based*on*reward
  10192. -->
  10193. (R975 ^value 1 +)
  10194. (R1 ^reward R975 +)
  10195. Retracting elaborate*copy-dir-to-output-link
  10196. -->
  10197. (I3 ^dir L +)
  10198. Retracting rl*prefer*rvt*predict-no*H0*2
  10199. -->
  10200. (S1 ^operator O1944 = 0.3140405292214645)
  10201. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  10202. -->
  10203. (S1 ^operator O1944 = 0.6861654297024582)
  10204. Retracting rl*prefer*rvt*predict-yes*H0*1
  10205. -->
  10206. (S1 ^operator O1943 = 0.3804143351598744)
  10207. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  10208. -->
  10209. (S1 ^operator O1943 = -0.3470159027404986)
  10210. =>WM: (13628: S1 ^operator O1946 +)
  10211. =>WM: (13627: S1 ^operator O1945 +)
  10212. =>WM: (13626: I3 ^dir R)
  10213. =>WM: (13625: O1946 ^name predict-no)
  10214. =>WM: (13624: O1945 ^name predict-yes)
  10215. =>WM: (13623: R976 ^value 1)
  10216. =>WM: (13622: R1 ^reward R976)
  10217. <=WM: (13613: S1 ^operator O1943 +)
  10218. <=WM: (13614: S1 ^operator O1944 +)
  10219. <=WM: (13615: S1 ^operator O1944)
  10220. <=WM: (13612: I3 ^dir L)
  10221. <=WM: (13608: R1 ^reward R975)
  10222. <=WM: (13611: O1944 ^name predict-no)
  10223. <=WM: (13610: O1943 ^name predict-yes)
  10224. <=WM: (13609: R975 ^value 1)
  10225. --- Inner Elaboration Phase, active level 1 (S1) ---
  10226. Firing prefer*rvt*predict-yes*H0
  10227. -->
  10228. Firing rl*prefer*rvt*predict-yes*H0*5
  10229. -->
  10230. (S1 ^operator O1945 = 0.2939507002996337)
  10231. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10232. -->
  10233. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10234. -->
  10235. (S1 ^operator O1945 = 0.7054436376897688)
  10236. Firing prefer*rvt*predict-no*H0
  10237. -->
  10238. Firing rl*prefer*rvt*predict-no*H0*6
  10239. -->
  10240. (S1 ^operator O1946 = 0.2298717920574965)
  10241. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10242. -->
  10243. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10244. -->
  10245. (S1 ^operator O1946 = -0.2023211881870005)
  10246. inner elaboration loop at bottom goal.
  10247. Retracting rl*prefer*rvt*predict-no*H0*6
  10248. -->
  10249. (S1 ^operator O1944 = 0.2298717920574965)
  10250. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10251. -->
  10252. (S1 ^operator O1944 = -0.2023211881870005)
  10253. Retracting rl*prefer*rvt*predict-yes*H0*5
  10254. -->
  10255. (S1 ^operator O1943 = 0.2939507002996337)
  10256. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10257. -->
  10258. (S1 ^operator O1943 = 0.7054436376897688)
  10259. --- END Proposal Phase ---
  10260. --- Decision Phase ---
  10261. RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.314041 -> 0.485033 -0.171009 0.314023(R,m,v=1,0.86,0.121208)
  10262. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515116 0.171049 0.686165 -> 0.5151 0.171045 0.686145(R,m,v=1,1,0)
  10263. =>WM: (13629: S1 ^operator O1945)
  10264. 973: O: O1945 (predict-yes)
  10265. --- END Decision Phase ---
  10266. --- Application Phase ---
  10267. --- Firing Productions (PE) For State At Depth 1 ---
  10268. --- Inner Elaboration Phase, active level 1 (S1) ---
  10269. Firing apply*operator
  10270. -->
  10271. (I3 ^predict-yes N973 + :O )
  10272. Firing apply*operator*complete
  10273. -->
  10274. (I3 ^predict-no N972 - :O )
  10275. inner elaboration loop at bottom goal.
  10276. --- Change Working Memory (PE) ---
  10277. =>WM: (13630: I3 ^predict-yes N973)
  10278. <=WM: (13617: N972 ^status complete)
  10279. <=WM: (13616: I3 ^predict-no N972)
  10280. --- Firing Productions (IE) For State At Depth 1 ---
  10281. --- Inner Elaboration Phase, active level 1 (S1) ---
  10282. Firing monitor*world
  10283. -->
  10284. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10285. --- Change Working Memory (IE) ---
  10286. --- END Application Phase ---
  10287. --- Output Phase ---
  10288. ENV: Agent did: predict-yes for direction R in state State-A
  10289. In State-A moving R
  10290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10291. predict error 0
  10292. dir: dir isU
  10293. --- END Output Phase ---
  10294. |\---- Input Phase ---
  10295. =>WM: (13634: I2 ^dir U)
  10296. =>WM: (13633: I2 ^reward 1)
  10297. =>WM: (13632: I2 ^see 1)
  10298. =>WM: (13631: N973 ^status complete)
  10299. <=WM: (13620: I2 ^dir R)
  10300. <=WM: (13619: I2 ^reward 1)
  10301. <=WM: (13618: I2 ^see 0)
  10302. =>WM: (13635: I2 ^level-1 R1-root)
  10303. <=WM: (13621: I2 ^level-1 L0-root)
  10304. --- END Input Phase ---
  10305. --- Proposal Phase ---
  10306. --- Inner Elaboration Phase, active level 1 (S1) ---
  10307. Firing elaborate*copy-see-to-output-link
  10308. -->
  10309. (I3 ^see 1 +)
  10310. Firing elaborate*reward*based*on*reward
  10311. -->
  10312. (R977 ^value 1 +)
  10313. (R1 ^reward R977 +)
  10314. Firing propose*predict-yes
  10315. -->
  10316. (O1947 ^name predict-yes +)
  10317. (S1 ^operator O1947 +)
  10318. Firing propose*predict-no
  10319. -->
  10320. (O1948 ^name predict-no +)
  10321. (S1 ^operator O1948 +)
  10322. Firing rl*prefer*rvt*predict-no*H0*4
  10323. -->
  10324. (S1 ^operator O1946 = 1.)
  10325. Firing rl*prefer*rvt*predict-yes*H0*3
  10326. -->
  10327. (S1 ^operator O1945 = 0.)
  10328. Firing prefer*rvt*predict-yes*H0
  10329. -->
  10330. Firing prefer*rvt*predict-no*H0
  10331. -->
  10332. Firing elaborate*copy-dir-to-output-link
  10333. -->
  10334. (I3 ^dir U +)
  10335. inner elaboration loop at bottom goal.
  10336. Retracting elaborate*copy-see-to-output-link
  10337. -->
  10338. (I3 ^see 0 +)
  10339. Retracting propose*predict-no
  10340. -->
  10341. (O1946 ^name predict-no +)
  10342. (S1 ^operator O1946 +)
  10343. Retracting propose*predict-yes
  10344. -->
  10345. (O1945 ^name predict-yes +)
  10346. (S1 ^operator O1945 +)
  10347. Retracting elaborate*reward*based*on*reward
  10348. -->
  10349. (R976 ^value 1 +)
  10350. (R1 ^reward R976 +)
  10351. Retracting elaborate*copy-dir-to-output-link
  10352. -->
  10353. (I3 ^dir R +)
  10354. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10355. -->
  10356. (S1 ^operator O1946 = -0.2023211881870005)
  10357. Retracting rl*prefer*rvt*predict-no*H0*6
  10358. -->
  10359. (S1 ^operator O1946 = 0.2298717920574965)
  10360. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10361. -->
  10362. (S1 ^operator O1945 = 0.7054436376897688)
  10363. Retracting rl*prefer*rvt*predict-yes*H0*5
  10364. -->
  10365. (S1 ^operator O1945 = 0.2939507002996337)
  10366. =>WM: (13643: S1 ^operator O1948 +)
  10367. =>WM: (13642: S1 ^operator O1947 +)
  10368. =>WM: (13641: I3 ^dir U)
  10369. =>WM: (13640: O1948 ^name predict-no)
  10370. =>WM: (13639: O1947 ^name predict-yes)
  10371. =>WM: (13638: R977 ^value 1)
  10372. =>WM: (13637: R1 ^reward R977)
  10373. =>WM: (13636: I3 ^see 1)
  10374. <=WM: (13627: S1 ^operator O1945 +)
  10375. <=WM: (13629: S1 ^operator O1945)
  10376. <=WM: (13628: S1 ^operator O1946 +)
  10377. <=WM: (13626: I3 ^dir R)
  10378. <=WM: (13622: R1 ^reward R976)
  10379. <=WM: (13607: I3 ^see 0)
  10380. <=WM: (13625: O1946 ^name predict-no)
  10381. <=WM: (13624: O1945 ^name predict-yes)
  10382. <=WM: (13623: R976 ^value 1)
  10383. --- Inner Elaboration Phase, active level 1 (S1) ---
  10384. Firing prefer*rvt*predict-yes*H0
  10385. -->
  10386. Firing rl*prefer*rvt*predict-yes*H0*3
  10387. -->
  10388. (S1 ^operator O1947 = 0.)
  10389. Firing prefer*rvt*predict-no*H0
  10390. -->
  10391. Firing rl*prefer*rvt*predict-no*H0*4
  10392. -->
  10393. (S1 ^operator O1948 = 1.)
  10394. inner elaboration loop at bottom goal.
  10395. Retracting rl*prefer*rvt*predict-no*H0*4
  10396. -->
  10397. (S1 ^operator O1946 = 1.)
  10398. Retracting rl*prefer*rvt*predict-yes*H0*3
  10399. -->
  10400. (S1 ^operator O1945 = 0.)
  10401. --- END Proposal Phase ---
  10402. --- Decision Phase ---
  10403. RL update rl*prefer*rvt*predict-yes*H0*5 0.501028 -0.207078 0.293951 -> 0.501074 -0.207073 0.294001(R,m,v=1,0.84,0.135302)
  10404. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498423 0.207021 0.705444 -> 0.498477 0.207026 0.705503(R,m,v=1,1,0)
  10405. =>WM: (13644: S1 ^operator O1948)
  10406. 974: O: O1948 (predict-no)
  10407. --- END Decision Phase ---
  10408. --- Application Phase ---
  10409. --- Firing Productions (PE) For State At Depth 1 ---
  10410. --- Inner Elaboration Phase, active level 1 (S1) ---
  10411. Firing apply*operator
  10412. -->
  10413. (I3 ^predict-no N974 + :O )
  10414. Firing apply*operator*complete
  10415. -->
  10416. (I3 ^predict-yes N973 - :O )
  10417. inner elaboration loop at bottom goal.
  10418. --- Change Working Memory (PE) ---
  10419. =>WM: (13645: I3 ^predict-no N974)
  10420. <=WM: (13631: N973 ^status complete)
  10421. <=WM: (13630: I3 ^predict-yes N973)
  10422. --- Firing Productions (IE) For State At Depth 1 ---
  10423. --- Inner Elaboration Phase, active level 1 (S1) ---
  10424. Firing monitor*world
  10425. -->
  10426. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10427. --- Change Working Memory (IE) ---
  10428. --- END Application Phase ---
  10429. --- Output Phase ---
  10430. ENV: Agent did: predict-no for direction U in state State-B
  10431. In State-B moving U
  10432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10433. predict error 0
  10434. dir: dir isL
  10435. --- END Output Phase ---
  10436. /|\--- Input Phase ---
  10437. =>WM: (13649: I2 ^dir L)
  10438. =>WM: (13648: I2 ^reward 1)
  10439. =>WM: (13647: I2 ^see 0)
  10440. =>WM: (13646: N974 ^status complete)
  10441. <=WM: (13634: I2 ^dir U)
  10442. <=WM: (13633: I2 ^reward 1)
  10443. <=WM: (13632: I2 ^see 1)
  10444. =>WM: (13650: I2 ^level-1 R1-root)
  10445. <=WM: (13635: I2 ^level-1 R1-root)
  10446. --- END Input Phase ---
  10447. --- Proposal Phase ---
  10448. --- Inner Elaboration Phase, active level 1 (S1) ---
  10449. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10450. -->
  10451. (S1 ^operator O1947 = 0.6196158942331635)
  10452. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10453. -->
  10454. (S1 ^operator O1948 = -0.1479504104026684)
  10455. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10456. -->
  10457. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10458. -->
  10459. Firing elaborate*copy-see-to-output-link
  10460. -->
  10461. (I3 ^see 0 +)
  10462. Firing elaborate*reward*based*on*reward
  10463. -->
  10464. (R978 ^value 1 +)
  10465. (R1 ^reward R978 +)
  10466. Firing propose*predict-yes
  10467. -->
  10468. (O1949 ^name predict-yes +)
  10469. (S1 ^operator O1949 +)
  10470. Firing propose*predict-no
  10471. -->
  10472. (O1950 ^name predict-no +)
  10473. (S1 ^operator O1950 +)
  10474. Firing rl*prefer*rvt*predict-no*H0*2
  10475. -->
  10476. (S1 ^operator O1948 = 0.3140233963466647)
  10477. Firing rl*prefer*rvt*predict-yes*H0*1
  10478. -->
  10479. (S1 ^operator O1947 = 0.3804143351598744)
  10480. Firing prefer*rvt*predict-yes*H0
  10481. -->
  10482. Firing prefer*rvt*predict-no*H0
  10483. -->
  10484. Firing elaborate*copy-dir-to-output-link
  10485. -->
  10486. (I3 ^dir L +)
  10487. inner elaboration loop at bottom goal.
  10488. Retracting elaborate*copy-see-to-output-link
  10489. -->
  10490. (I3 ^see 1 +)
  10491. Retracting propose*predict-no
  10492. -->
  10493. (O1948 ^name predict-no +)
  10494. (S1 ^operator O1948 +)
  10495. Retracting propose*predict-yes
  10496. -->
  10497. (O1947 ^name predict-yes +)
  10498. (S1 ^operator O1947 +)
  10499. Retracting elaborate*reward*based*on*reward
  10500. -->
  10501. (R977 ^value 1 +)
  10502. (R1 ^reward R977 +)
  10503. Retracting elaborate*copy-dir-to-output-link
  10504. -->
  10505. (I3 ^dir U +)
  10506. Retracting rl*prefer*rvt*predict-no*H0*4
  10507. -->
  10508. (S1 ^operator O1948 = 1.)
  10509. Retracting rl*prefer*rvt*predict-yes*H0*3
  10510. -->
  10511. (S1 ^operator O1947 = 0.)
  10512. =>WM: (13658: S1 ^operator O1950 +)
  10513. =>WM: (13657: S1 ^operator O1949 +)
  10514. =>WM: (13656: I3 ^dir L)
  10515. =>WM: (13655: O1950 ^name predict-no)
  10516. =>WM: (13654: O1949 ^name predict-yes)
  10517. =>WM: (13653: R978 ^value 1)
  10518. =>WM: (13652: R1 ^reward R978)
  10519. =>WM: (13651: I3 ^see 0)
  10520. <=WM: (13642: S1 ^operator O1947 +)
  10521. <=WM: (13643: S1 ^operator O1948 +)
  10522. <=WM: (13644: S1 ^operator O1948)
  10523. <=WM: (13641: I3 ^dir U)
  10524. <=WM: (13637: R1 ^reward R977)
  10525. <=WM: (13636: I3 ^see 1)
  10526. <=WM: (13640: O1948 ^name predict-no)
  10527. <=WM: (13639: O1947 ^name predict-yes)
  10528. <=WM: (13638: R977 ^value 1)
  10529. --- Inner Elaboration Phase, active level 1 (S1) ---
  10530. Firing prefer*rvt*predict-yes*H0
  10531. -->
  10532. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10533. -->
  10534. (S1 ^operator O1949 = 0.6196158942331635)
  10535. Firing rl*prefer*rvt*predict-yes*H0*1
  10536. -->
  10537. (S1 ^operator O1949 = 0.3804143351598744)
  10538. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10539. -->
  10540. Firing prefer*rvt*predict-no*H0
  10541. -->
  10542. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10543. -->
  10544. (S1 ^operator O1950 = -0.1479504104026684)
  10545. Firing rl*prefer*rvt*predict-no*H0*2
  10546. -->
  10547. (S1 ^operator O1950 = 0.3140233963466647)
  10548. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10549. -->
  10550. inner elaboration loop at bottom goal.
  10551. Retracting rl*prefer*rvt*predict-no*H0*2
  10552. -->
  10553. (S1 ^operator O1948 = 0.3140233963466647)
  10554. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10555. -->
  10556. (S1 ^operator O1948 = -0.1479504104026684)
  10557. Retracting rl*prefer*rvt*predict-yes*H0*1
  10558. -->
  10559. (S1 ^operator O1947 = 0.3804143351598744)
  10560. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10561. -->
  10562. (S1 ^operator O1947 = 0.6196158942331635)
  10563. --- END Proposal Phase ---
  10564. --- Decision Phase ---
  10565. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10566. =>WM: (13659: S1 ^operator O1949)
  10567. 975: O: O1949 (predict-yes)
  10568. --- END Decision Phase ---
  10569. --- Application Phase ---
  10570. --- Firing Productions (PE) For State At Depth 1 ---
  10571. --- Inner Elaboration Phase, active level 1 (S1) ---
  10572. Firing apply*operator
  10573. -->
  10574. (I3 ^predict-yes N975 + :O )
  10575. Firing apply*operator*complete
  10576. -->
  10577. (I3 ^predict-no N974 - :O )
  10578. inner elaboration loop at bottom goal.
  10579. --- Change Working Memory (PE) ---
  10580. =>WM: (13660: I3 ^predict-yes N975)
  10581. <=WM: (13646: N974 ^status complete)
  10582. <=WM: (13645: I3 ^predict-no N974)
  10583. --- Firing Productions (IE) For State At Depth 1 ---
  10584. --- Inner Elaboration Phase, active level 1 (S1) ---
  10585. Firing monitor*world
  10586. -->
  10587. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10588. --- Change Working Memory (IE) ---
  10589. --- END Application Phase ---
  10590. --- Output Phase ---
  10591. ENV: Agent did: predict-yes for direction L in state State-B
  10592. In State-B moving L
  10593. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10594. predict error 0
  10595. dir: dir isR
  10596. --- END Output Phase ---
  10597. -/|--- Input Phase ---
  10598. =>WM: (13664: I2 ^dir R)
  10599. =>WM: (13663: I2 ^reward 1)
  10600. =>WM: (13662: I2 ^see 1)
  10601. =>WM: (13661: N975 ^status complete)
  10602. <=WM: (13649: I2 ^dir L)
  10603. <=WM: (13648: I2 ^reward 1)
  10604. <=WM: (13647: I2 ^see 0)
  10605. =>WM: (13665: I2 ^level-1 L1-root)
  10606. <=WM: (13650: I2 ^level-1 R1-root)
  10607. --- END Input Phase ---
  10608. --- Proposal Phase ---
  10609. --- Inner Elaboration Phase, active level 1 (S1) ---
  10610. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10611. -->
  10612. (S1 ^operator O1949 = 0.7064496972060428)
  10613. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10614. -->
  10615. (S1 ^operator O1950 = -0.1937987592593187)
  10616. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10617. -->
  10618. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10619. -->
  10620. Firing elaborate*copy-see-to-output-link
  10621. -->
  10622. (I3 ^see 1 +)
  10623. Firing elaborate*reward*based*on*reward
  10624. -->
  10625. (R979 ^value 1 +)
  10626. (R1 ^reward R979 +)
  10627. Firing propose*predict-yes
  10628. -->
  10629. (O1951 ^name predict-yes +)
  10630. (S1 ^operator O1951 +)
  10631. Firing propose*predict-no
  10632. -->
  10633. (O1952 ^name predict-no +)
  10634. (S1 ^operator O1952 +)
  10635. Firing rl*prefer*rvt*predict-no*H0*6
  10636. -->
  10637. (S1 ^operator O1950 = 0.2298717920574965)
  10638. Firing rl*prefer*rvt*predict-yes*H0*5
  10639. -->
  10640. (S1 ^operator O1949 = 0.2940010828283485)
  10641. Firing prefer*rvt*predict-yes*H0
  10642. -->
  10643. Firing prefer*rvt*predict-no*H0
  10644. -->
  10645. Firing elaborate*copy-dir-to-output-link
  10646. -->
  10647. (I3 ^dir R +)
  10648. inner elaboration loop at bottom goal.
  10649. Retracting elaborate*copy-see-to-output-link
  10650. -->
  10651. (I3 ^see 0 +)
  10652. Retracting propose*predict-no
  10653. -->
  10654. (O1950 ^name predict-no +)
  10655. (S1 ^operator O1950 +)
  10656. Retracting propose*predict-yes
  10657. -->
  10658. (O1949 ^name predict-yes +)
  10659. (S1 ^operator O1949 +)
  10660. Retracting elaborate*reward*based*on*reward
  10661. -->
  10662. (R978 ^value 1 +)
  10663. (R1 ^reward R978 +)
  10664. Retracting elaborate*copy-dir-to-output-link
  10665. -->
  10666. (I3 ^dir L +)
  10667. Retracting rl*prefer*rvt*predict-no*H0*2
  10668. -->
  10669. (S1 ^operator O1950 = 0.3140233963466647)
  10670. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10671. -->
  10672. (S1 ^operator O1950 = -0.1479504104026684)
  10673. Retracting rl*prefer*rvt*predict-yes*H0*1
  10674. -->
  10675. (S1 ^operator O1949 = 0.3804143351598744)
  10676. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10677. -->
  10678. (S1 ^operator O1949 = 0.6196158942331635)
  10679. =>WM: (13673: S1 ^operator O1952 +)
  10680. =>WM: (13672: S1 ^operator O1951 +)
  10681. =>WM: (13671: I3 ^dir R)
  10682. =>WM: (13670: O1952 ^name predict-no)
  10683. =>WM: (13669: O1951 ^name predict-yes)
  10684. =>WM: (13668: R979 ^value 1)
  10685. =>WM: (13667: R1 ^reward R979)
  10686. =>WM: (13666: I3 ^see 1)
  10687. <=WM: (13657: S1 ^operator O1949 +)
  10688. <=WM: (13659: S1 ^operator O1949)
  10689. <=WM: (13658: S1 ^operator O1950 +)
  10690. <=WM: (13656: I3 ^dir L)
  10691. <=WM: (13652: R1 ^reward R978)
  10692. <=WM: (13651: I3 ^see 0)
  10693. <=WM: (13655: O1950 ^name predict-no)
  10694. <=WM: (13654: O1949 ^name predict-yes)
  10695. <=WM: (13653: R978 ^value 1)
  10696. --- Inner Elaboration Phase, active level 1 (S1) ---
  10697. Firing prefer*rvt*predict-yes*H0
  10698. -->
  10699. Firing rl*prefer*rvt*predict-yes*H0*5
  10700. -->
  10701. (S1 ^operator O1951 = 0.2940010828283485)
  10702. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10703. -->
  10704. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10705. -->
  10706. (S1 ^operator O1951 = 0.7064496972060428)
  10707. Firing prefer*rvt*predict-no*H0
  10708. -->
  10709. Firing rl*prefer*rvt*predict-no*H0*6
  10710. -->
  10711. (S1 ^operator O1952 = 0.2298717920574965)
  10712. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10713. -->
  10714. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10715. -->
  10716. (S1 ^operator O1952 = -0.1937987592593187)
  10717. inner elaboration loop at bottom goal.
  10718. Retracting rl*prefer*rvt*predict-no*H0*6
  10719. -->
  10720. (S1 ^operator O1950 = 0.2298717920574965)
  10721. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10722. -->
  10723. (S1 ^operator O1950 = -0.1937987592593187)
  10724. Retracting rl*prefer*rvt*predict-yes*H0*5
  10725. -->
  10726. (S1 ^operator O1949 = 0.2940010828283485)
  10727. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10728. -->
  10729. (S1 ^operator O1949 = 0.7064496972060428)
  10730. --- END Proposal Phase ---
  10731. --- Decision Phase ---
  10732. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.825,0.145283)
  10733. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478689 0.140927 0.619616 -> 0.478686 0.140927 0.619613(R,m,v=1,1,0)
  10734. =>WM: (13674: S1 ^operator O1951)
  10735. 976: O: O1951 (predict-yes)
  10736. --- END Decision Phase ---
  10737. --- Application Phase ---
  10738. --- Firing Productions (PE) For State At Depth 1 ---
  10739. --- Inner Elaboration Phase, active level 1 (S1) ---
  10740. Firing apply*operator
  10741. -->
  10742. (I3 ^predict-yes N976 + :O )
  10743. Firing apply*operator*complete
  10744. -->
  10745. (I3 ^predict-yes N975 - :O )
  10746. inner elaboration loop at bottom goal.
  10747. --- Change Working Memory (PE) ---
  10748. =>WM: (13675: I3 ^predict-yes N976)
  10749. <=WM: (13661: N975 ^status complete)
  10750. <=WM: (13660: I3 ^predict-yes N975)
  10751. --- Firing Productions (IE) For State At Depth 1 ---
  10752. --- Inner Elaboration Phase, active level 1 (S1) ---
  10753. Firing monitor*world
  10754. -->
  10755. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10756. --- Change Working Memory (IE) ---
  10757. --- END Application Phase ---
  10758. --- Output Phase ---
  10759. ENV: Agent did: predict-yes for direction R in state State-A
  10760. In State-A moving R
  10761. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10762. predict error 0
  10763. dir: dir isR
  10764. --- END Output Phase ---
  10765. \-/--- Input Phase ---
  10766. =>WM: (13679: I2 ^dir R)
  10767. =>WM: (13678: I2 ^reward 1)
  10768. =>WM: (13677: I2 ^see 1)
  10769. =>WM: (13676: N976 ^status complete)
  10770. <=WM: (13664: I2 ^dir R)
  10771. <=WM: (13663: I2 ^reward 1)
  10772. <=WM: (13662: I2 ^see 1)
  10773. =>WM: (13680: I2 ^level-1 R1-root)
  10774. <=WM: (13665: I2 ^level-1 L1-root)
  10775. --- END Input Phase ---
  10776. --- Proposal Phase ---
  10777. --- Inner Elaboration Phase, active level 1 (S1) ---
  10778. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10779. -->
  10780. (S1 ^operator O1951 = -0.252585164213872)
  10781. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10782. -->
  10783. (S1 ^operator O1952 = 0.7701964997777864)
  10784. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10785. -->
  10786. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10787. -->
  10788. Firing elaborate*copy-see-to-output-link
  10789. -->
  10790. (I3 ^see 1 +)
  10791. Firing elaborate*reward*based*on*reward
  10792. -->
  10793. (R980 ^value 1 +)
  10794. (R1 ^reward R980 +)
  10795. Firing propose*predict-yes
  10796. -->
  10797. (O1953 ^name predict-yes +)
  10798. (S1 ^operator O1953 +)
  10799. Firing propose*predict-no
  10800. -->
  10801. (O1954 ^name predict-no +)
  10802. (S1 ^operator O1954 +)
  10803. Firing rl*prefer*rvt*predict-no*H0*6
  10804. -->
  10805. (S1 ^operator O1952 = 0.2298717920574965)
  10806. Firing rl*prefer*rvt*predict-yes*H0*5
  10807. -->
  10808. (S1 ^operator O1951 = 0.2940010828283485)
  10809. Firing prefer*rvt*predict-yes*H0
  10810. -->
  10811. Firing prefer*rvt*predict-no*H0
  10812. -->
  10813. Firing elaborate*copy-dir-to-output-link
  10814. -->
  10815. (I3 ^dir R +)
  10816. inner elaboration loop at bottom goal.
  10817. Retracting elaborate*copy-see-to-output-link
  10818. -->
  10819. (I3 ^see 1 +)
  10820. Retracting propose*predict-no
  10821. -->
  10822. (O1952 ^name predict-no +)
  10823. (S1 ^operator O1952 +)
  10824. Retracting propose*predict-yes
  10825. -->
  10826. (O1951 ^name predict-yes +)
  10827. (S1 ^operator O1951 +)
  10828. Retracting elaborate*reward*based*on*reward
  10829. -->
  10830. (R979 ^value 1 +)
  10831. (R1 ^reward R979 +)
  10832. Retracting elaborate*copy-dir-to-output-link
  10833. -->
  10834. (I3 ^dir R +)
  10835. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10836. -->
  10837. (S1 ^operator O1952 = -0.1937987592593187)
  10838. Retracting rl*prefer*rvt*predict-no*H0*6
  10839. -->
  10840. (S1 ^operator O1952 = 0.2298717920574965)
  10841. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10842. -->
  10843. (S1 ^operator O1951 = 0.7064496972060428)
  10844. Retracting rl*prefer*rvt*predict-yes*H0*5
  10845. -->
  10846. (S1 ^operator O1951 = 0.2940010828283485)
  10847. =>WM: (13686: S1 ^operator O1954 +)
  10848. =>WM: (13685: S1 ^operator O1953 +)
  10849. =>WM: (13684: O1954 ^name predict-no)
  10850. =>WM: (13683: O1953 ^name predict-yes)
  10851. =>WM: (13682: R980 ^value 1)
  10852. =>WM: (13681: R1 ^reward R980)
  10853. <=WM: (13672: S1 ^operator O1951 +)
  10854. <=WM: (13674: S1 ^operator O1951)
  10855. <=WM: (13673: S1 ^operator O1952 +)
  10856. <=WM: (13667: R1 ^reward R979)
  10857. <=WM: (13670: O1952 ^name predict-no)
  10858. <=WM: (13669: O1951 ^name predict-yes)
  10859. <=WM: (13668: R979 ^value 1)
  10860. --- Inner Elaboration Phase, active level 1 (S1) ---
  10861. Firing prefer*rvt*predict-yes*H0
  10862. -->
  10863. Firing rl*prefer*rvt*predict-yes*H0*5
  10864. -->
  10865. (S1 ^operator O1953 = 0.2940010828283485)
  10866. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10867. -->
  10868. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10869. -->
  10870. (S1 ^operator O1953 = -0.252585164213872)
  10871. Firing prefer*rvt*predict-no*H0
  10872. -->
  10873. Firing rl*prefer*rvt*predict-no*H0*6
  10874. -->
  10875. (S1 ^operator O1954 = 0.2298717920574965)
  10876. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10877. -->
  10878. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10879. -->
  10880. (S1 ^operator O1954 = 0.7701964997777864)
  10881. inner elaboration loop at bottom goal.
  10882. Retracting rl*prefer*rvt*predict-no*H0*6
  10883. -->
  10884. (S1 ^operator O1952 = 0.2298717920574965)
  10885. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10886. -->
  10887. (S1 ^operator O1952 = 0.7701964997777864)
  10888. Retracting rl*prefer*rvt*predict-yes*H0*5
  10889. -->
  10890. (S1 ^operator O1951 = 0.2940010828283485)
  10891. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10892. -->
  10893. (S1 ^operator O1951 = -0.252585164213872)
  10894. --- END Proposal Phase ---
  10895. --- Decision Phase ---
  10896. RL update rl*prefer*rvt*predict-yes*H0*5 0.501074 -0.207073 0.294001 -> 0.50104 -0.207077 0.293964(R,m,v=1,0.84106,0.13457)
  10897. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499331 0.207118 0.70645 -> 0.499292 0.207114 0.706406(R,m,v=1,1,0)
  10898. =>WM: (13687: S1 ^operator O1954)
  10899. 977: O: O1954 (predict-no)
  10900. --- END Decision Phase ---
  10901. --- Application Phase ---
  10902. --- Firing Productions (PE) For State At Depth 1 ---
  10903. --- Inner Elaboration Phase, active level 1 (S1) ---
  10904. Firing apply*operator
  10905. -->
  10906. (I3 ^predict-no N977 + :O )
  10907. Firing apply*operator*complete
  10908. -->
  10909. (I3 ^predict-yes N976 - :O )
  10910. inner elaboration loop at bottom goal.
  10911. --- Change Working Memory (PE) ---
  10912. =>WM: (13688: I3 ^predict-no N977)
  10913. <=WM: (13676: N976 ^status complete)
  10914. <=WM: (13675: I3 ^predict-yes N976)
  10915. --- Firing Productions (IE) For State At Depth 1 ---
  10916. --- Inner Elaboration Phase, active level 1 (S1) ---
  10917. Firing monitor*world
  10918. -->
  10919. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10920. --- Change Working Memory (IE) ---
  10921. --- END Application Phase ---
  10922. --- Output Phase ---
  10923. ENV: Agent did: predict-no for direction R in state State-B
  10924. In State-B moving R
  10925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10926. predict error 0
  10927. dir: dir isU
  10928. --- END Output Phase ---
  10929. |\--- Input Phase ---
  10930. =>WM: (13692: I2 ^dir U)
  10931. =>WM: (13691: I2 ^reward 1)
  10932. =>WM: (13690: I2 ^see 0)
  10933. =>WM: (13689: N977 ^status complete)
  10934. <=WM: (13679: I2 ^dir R)
  10935. <=WM: (13678: I2 ^reward 1)
  10936. <=WM: (13677: I2 ^see 1)
  10937. =>WM: (13693: I2 ^level-1 R0-root)
  10938. <=WM: (13680: I2 ^level-1 R1-root)
  10939. --- END Input Phase ---
  10940. --- Proposal Phase ---
  10941. --- Inner Elaboration Phase, active level 1 (S1) ---
  10942. Firing elaborate*copy-see-to-output-link
  10943. -->
  10944. (I3 ^see 0 +)
  10945. Firing elaborate*reward*based*on*reward
  10946. -->
  10947. (R981 ^value 1 +)
  10948. (R1 ^reward R981 +)
  10949. Firing propose*predict-yes
  10950. -->
  10951. (O1955 ^name predict-yes +)
  10952. (S1 ^operator O1955 +)
  10953. Firing propose*predict-no
  10954. -->
  10955. (O1956 ^name predict-no +)
  10956. (S1 ^operator O1956 +)
  10957. Firing rl*prefer*rvt*predict-no*H0*4
  10958. -->
  10959. (S1 ^operator O1954 = 1.)
  10960. Firing rl*prefer*rvt*predict-yes*H0*3
  10961. -->
  10962. (S1 ^operator O1953 = 0.)
  10963. Firing prefer*rvt*predict-yes*H0
  10964. -->
  10965. Firing prefer*rvt*predict-no*H0
  10966. -->
  10967. Firing elaborate*copy-dir-to-output-link
  10968. -->
  10969. (I3 ^dir U +)
  10970. inner elaboration loop at bottom goal.
  10971. Retracting elaborate*copy-see-to-output-link
  10972. -->
  10973. (I3 ^see 1 +)
  10974. Retracting propose*predict-no
  10975. -->
  10976. (O1954 ^name predict-no +)
  10977. (S1 ^operator O1954 +)
  10978. Retracting propose*predict-yes
  10979. -->
  10980. (O1953 ^name predict-yes +)
  10981. (S1 ^operator O1953 +)
  10982. Retracting elaborate*reward*based*on*reward
  10983. -->
  10984. (R980 ^value 1 +)
  10985. (R1 ^reward R980 +)
  10986. Retracting elaborate*copy-dir-to-output-link
  10987. -->
  10988. (I3 ^dir R +)
  10989. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10990. -->
  10991. (S1 ^operator O1954 = 0.7701964997777864)
  10992. Retracting rl*prefer*rvt*predict-no*H0*6
  10993. -->
  10994. (S1 ^operator O1954 = 0.2298717920574965)
  10995. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10996. -->
  10997. (S1 ^operator O1953 = -0.252585164213872)
  10998. Retracting rl*prefer*rvt*predict-yes*H0*5
  10999. -->
  11000. (S1 ^operator O1953 = 0.2939636257009906)
  11001. =>WM: (13701: S1 ^operator O1956 +)
  11002. =>WM: (13700: S1 ^operator O1955 +)
  11003. =>WM: (13699: I3 ^dir U)
  11004. =>WM: (13698: O1956 ^name predict-no)
  11005. =>WM: (13697: O1955 ^name predict-yes)
  11006. =>WM: (13696: R981 ^value 1)
  11007. =>WM: (13695: R1 ^reward R981)
  11008. =>WM: (13694: I3 ^see 0)
  11009. <=WM: (13685: S1 ^operator O1953 +)
  11010. <=WM: (13686: S1 ^operator O1954 +)
  11011. <=WM: (13687: S1 ^operator O1954)
  11012. <=WM: (13671: I3 ^dir R)
  11013. <=WM: (13681: R1 ^reward R980)
  11014. <=WM: (13666: I3 ^see 1)
  11015. <=WM: (13684: O1954 ^name predict-no)
  11016. <=WM: (13683: O1953 ^name predict-yes)
  11017. <=WM: (13682: R980 ^value 1)
  11018. --- Inner Elaboration Phase, active level 1 (S1) ---
  11019. Firing prefer*rvt*predict-yes*H0
  11020. -->
  11021. Firing rl*prefer*rvt*predict-yes*H0*3
  11022. -->
  11023. (S1 ^operator O1955 = 0.)
  11024. Firing prefer*rvt*predict-no*H0
  11025. -->
  11026. Firing rl*prefer*rvt*predict-no*H0*4
  11027. -->
  11028. (S1 ^operator O1956 = 1.)
  11029. inner elaboration loop at bottom goal.
  11030. Retracting rl*prefer*rvt*predict-no*H0*4
  11031. -->
  11032. (S1 ^operator O1954 = 1.)
  11033. Retracting rl*prefer*rvt*predict-yes*H0*3
  11034. -->
  11035. (S1 ^operator O1953 = 0.)
  11036. --- END Proposal Phase ---
  11037. --- Decision Phase ---
  11038. RL update rl*prefer*rvt*predict-no*H0*6 0.611922 -0.38205 0.229872 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.843023,0.133109)
  11039. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388134 0.382063 0.770196 -> 0.388128 0.382061 0.77019(R,m,v=1,1,0)
  11040. =>WM: (13702: S1 ^operator O1956)
  11041. 978: O: O1956 (predict-no)
  11042. --- END Decision Phase ---
  11043. --- Application Phase ---
  11044. --- Firing Productions (PE) For State At Depth 1 ---
  11045. --- Inner Elaboration Phase, active level 1 (S1) ---
  11046. Firing apply*operator
  11047. -->
  11048. (I3 ^predict-no N978 + :O )
  11049. Firing apply*operator*complete
  11050. -->
  11051. (I3 ^predict-no N977 - :O )
  11052. inner elaboration loop at bottom goal.
  11053. --- Change Working Memory (PE) ---
  11054. =>WM: (13703: I3 ^predict-no N978)
  11055. <=WM: (13689: N977 ^status complete)
  11056. <=WM: (13688: I3 ^predict-no N977)
  11057. --- Firing Productions (IE) For State At Depth 1 ---
  11058. --- Inner Elaboration Phase, active level 1 (S1) ---
  11059. Firing monitor*world
  11060. -->
  11061. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11062. --- Change Working Memory (IE) ---
  11063. --- END Application Phase ---
  11064. --- Output Phase ---
  11065. ENV: Agent did: predict-no for direction U in state State-B
  11066. In State-B moving U
  11067. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11068. predict error 0
  11069. dir: dir isU
  11070. --- END Output Phase ---
  11071. -/|--- Input Phase ---
  11072. =>WM: (13707: I2 ^dir U)
  11073. =>WM: (13706: I2 ^reward 1)
  11074. =>WM: (13705: I2 ^see 0)
  11075. =>WM: (13704: N978 ^status complete)
  11076. <=WM: (13692: I2 ^dir U)
  11077. <=WM: (13691: I2 ^reward 1)
  11078. <=WM: (13690: I2 ^see 0)
  11079. =>WM: (13708: I2 ^level-1 R0-root)
  11080. <=WM: (13693: I2 ^level-1 R0-root)
  11081. --- END Input Phase ---
  11082. --- Proposal Phase ---
  11083. --- Inner Elaboration Phase, active level 1 (S1) ---
  11084. Firing elaborate*copy-see-to-output-link
  11085. -->
  11086. (I3 ^see 0 +)
  11087. Firing elaborate*reward*based*on*reward
  11088. -->
  11089. (R982 ^value 1 +)
  11090. (R1 ^reward R982 +)
  11091. Firing propose*predict-yes
  11092. -->
  11093. (O1957 ^name predict-yes +)
  11094. (S1 ^operator O1957 +)
  11095. Firing propose*predict-no
  11096. -->
  11097. (O1958 ^name predict-no +)
  11098. (S1 ^operator O1958 +)
  11099. Firing rl*prefer*rvt*predict-no*H0*4
  11100. -->
  11101. (S1 ^operator O1956 = 1.)
  11102. Firing rl*prefer*rvt*predict-yes*H0*3
  11103. -->
  11104. (S1 ^operator O1955 = 0.)
  11105. Firing prefer*rvt*predict-yes*H0
  11106. -->
  11107. Firing prefer*rvt*predict-no*H0
  11108. -->
  11109. Firing elaborate*copy-dir-to-output-link
  11110. -->
  11111. (I3 ^dir U +)
  11112. inner elaboration loop at bottom goal.
  11113. Retracting elaborate*copy-see-to-output-link
  11114. -->
  11115. (I3 ^see 0 +)
  11116. Retracting propose*predict-no
  11117. -->
  11118. (O1956 ^name predict-no +)
  11119. (S1 ^operator O1956 +)
  11120. Retracting propose*predict-yes
  11121. -->
  11122. (O1955 ^name predict-yes +)
  11123. (S1 ^operator O1955 +)
  11124. Retracting elaborate*reward*based*on*reward
  11125. -->
  11126. (R981 ^value 1 +)
  11127. (R1 ^reward R981 +)
  11128. Retracting elaborate*copy-dir-to-output-link
  11129. -->
  11130. (I3 ^dir U +)
  11131. Retracting rl*prefer*rvt*predict-no*H0*4
  11132. -->
  11133. (S1 ^operator O1956 = 1.)
  11134. Retracting rl*prefer*rvt*predict-yes*H0*3
  11135. -->
  11136. (S1 ^operator O1955 = 0.)
  11137. =>WM: (13714: S1 ^operator O1958 +)
  11138. =>WM: (13713: S1 ^operator O1957 +)
  11139. =>WM: (13712: O1958 ^name predict-no)
  11140. =>WM: (13711: O1957 ^name predict-yes)
  11141. =>WM: (13710: R982 ^value 1)
  11142. =>WM: (13709: R1 ^reward R982)
  11143. <=WM: (13700: S1 ^operator O1955 +)
  11144. <=WM: (13701: S1 ^operator O1956 +)
  11145. <=WM: (13702: S1 ^operator O1956)
  11146. <=WM: (13695: R1 ^reward R981)
  11147. <=WM: (13698: O1956 ^name predict-no)
  11148. <=WM: (13697: O1955 ^name predict-yes)
  11149. <=WM: (13696: R981 ^value 1)
  11150. --- Inner Elaboration Phase, active level 1 (S1) ---
  11151. Firing prefer*rvt*predict-yes*H0
  11152. -->
  11153. Firing rl*prefer*rvt*predict-yes*H0*3
  11154. -->
  11155. (S1 ^operator O1957 = 0.)
  11156. Firing prefer*rvt*predict-no*H0
  11157. -->
  11158. Firing rl*prefer*rvt*predict-no*H0*4
  11159. -->
  11160. (S1 ^operator O1958 = 1.)
  11161. inner elaboration loop at bottom goal.
  11162. Retracting rl*prefer*rvt*predict-no*H0*4
  11163. -->
  11164. (S1 ^operator O1956 = 1.)
  11165. Retracting rl*prefer*rvt*predict-yes*H0*3
  11166. -->
  11167. (S1 ^operator O1955 = 0.)
  11168. --- END Proposal Phase ---
  11169. --- Decision Phase ---
  11170. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11171. =>WM: (13715: S1 ^operator O1958)
  11172. 979: O: O1958 (predict-no)
  11173. --- END Decision Phase ---
  11174. --- Application Phase ---
  11175. --- Firing Productions (PE) For State At Depth 1 ---
  11176. --- Inner Elaboration Phase, active level 1 (S1) ---
  11177. Firing apply*operator
  11178. -->
  11179. (I3 ^predict-no N979 + :O )
  11180. Firing apply*operator*complete
  11181. -->
  11182. (I3 ^predict-no N978 - :O )
  11183. inner elaboration loop at bottom goal.
  11184. --- Change Working Memory (PE) ---
  11185. =>WM: (13716: I3 ^predict-no N979)
  11186. <=WM: (13704: N978 ^status complete)
  11187. <=WM: (13703: I3 ^predict-no N978)
  11188. --- Firing Productions (IE) For State At Depth 1 ---
  11189. --- Inner Elaboration Phase, active level 1 (S1) ---
  11190. Firing monitor*world
  11191. -->
  11192. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11193. --- Change Working Memory (IE) ---
  11194. --- END Application Phase ---
  11195. --- Output Phase ---
  11196. ENV: Agent did: predict-no for direction U in state State-B
  11197. In State-B moving U
  11198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11199. predict error 0
  11200. dir: dir isL
  11201. --- END Output Phase ---
  11202. \---- Input Phase ---
  11203. =>WM: (13720: I2 ^dir L)
  11204. =>WM: (13719: I2 ^reward 1)
  11205. =>WM: (13718: I2 ^see 0)
  11206. =>WM: (13717: N979 ^status complete)
  11207. <=WM: (13707: I2 ^dir U)
  11208. <=WM: (13706: I2 ^reward 1)
  11209. <=WM: (13705: I2 ^see 0)
  11210. =>WM: (13721: I2 ^level-1 R0-root)
  11211. <=WM: (13708: I2 ^level-1 R0-root)
  11212. --- END Input Phase ---
  11213. --- Proposal Phase ---
  11214. --- Inner Elaboration Phase, active level 1 (S1) ---
  11215. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11216. -->
  11217. (S1 ^operator O1957 = 0.6195601949549704)
  11218. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11219. -->
  11220. (S1 ^operator O1958 = -0.2190661556260421)
  11221. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11222. -->
  11223. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11224. -->
  11225. Firing elaborate*copy-see-to-output-link
  11226. -->
  11227. (I3 ^see 0 +)
  11228. Firing elaborate*reward*based*on*reward
  11229. -->
  11230. (R983 ^value 1 +)
  11231. (R1 ^reward R983 +)
  11232. Firing propose*predict-yes
  11233. -->
  11234. (O1959 ^name predict-yes +)
  11235. (S1 ^operator O1959 +)
  11236. Firing propose*predict-no
  11237. -->
  11238. (O1960 ^name predict-no +)
  11239. (S1 ^operator O1960 +)
  11240. Firing rl*prefer*rvt*predict-no*H0*2
  11241. -->
  11242. (S1 ^operator O1958 = 0.3140233963466647)
  11243. Firing rl*prefer*rvt*predict-yes*H0*1
  11244. -->
  11245. (S1 ^operator O1957 = 0.3804118472151704)
  11246. Firing prefer*rvt*predict-yes*H0
  11247. -->
  11248. Firing prefer*rvt*predict-no*H0
  11249. -->
  11250. Firing elaborate*copy-dir-to-output-link
  11251. -->
  11252. (I3 ^dir L +)
  11253. inner elaboration loop at bottom goal.
  11254. Retracting elaborate*copy-see-to-output-link
  11255. -->
  11256. (I3 ^see 0 +)
  11257. Retracting propose*predict-no
  11258. -->
  11259. (O1958 ^name predict-no +)
  11260. (S1 ^operator O1958 +)
  11261. Retracting propose*predict-yes
  11262. -->
  11263. (O1957 ^name predict-yes +)
  11264. (S1 ^operator O1957 +)
  11265. Retracting elaborate*reward*based*on*reward
  11266. -->
  11267. (R982 ^value 1 +)
  11268. (R1 ^reward R982 +)
  11269. Retracting elaborate*copy-dir-to-output-link
  11270. -->
  11271. (I3 ^dir U +)
  11272. Retracting rl*prefer*rvt*predict-no*H0*4
  11273. -->
  11274. (S1 ^operator O1958 = 1.)
  11275. Retracting rl*prefer*rvt*predict-yes*H0*3
  11276. -->
  11277. (S1 ^operator O1957 = 0.)
  11278. =>WM: (13728: S1 ^operator O1960 +)
  11279. =>WM: (13727: S1 ^operator O1959 +)
  11280. =>WM: (13726: I3 ^dir L)
  11281. =>WM: (13725: O1960 ^name predict-no)
  11282. =>WM: (13724: O1959 ^name predict-yes)
  11283. =>WM: (13723: R983 ^value 1)
  11284. =>WM: (13722: R1 ^reward R983)
  11285. <=WM: (13713: S1 ^operator O1957 +)
  11286. <=WM: (13714: S1 ^operator O1958 +)
  11287. <=WM: (13715: S1 ^operator O1958)
  11288. <=WM: (13699: I3 ^dir U)
  11289. <=WM: (13709: R1 ^reward R982)
  11290. <=WM: (13712: O1958 ^name predict-no)
  11291. <=WM: (13711: O1957 ^name predict-yes)
  11292. <=WM: (13710: R982 ^value 1)
  11293. --- Inner Elaboration Phase, active level 1 (S1) ---
  11294. Firing prefer*rvt*predict-yes*H0
  11295. -->
  11296. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11297. -->
  11298. (S1 ^operator O1959 = 0.6195601949549704)
  11299. Firing rl*prefer*rvt*predict-yes*H0*1
  11300. -->
  11301. (S1 ^operator O1959 = 0.3804118472151704)
  11302. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11303. -->
  11304. Firing prefer*rvt*predict-no*H0
  11305. -->
  11306. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11307. -->
  11308. (S1 ^operator O1960 = -0.2190661556260421)
  11309. Firing rl*prefer*rvt*predict-no*H0*2
  11310. -->
  11311. (S1 ^operator O1960 = 0.3140233963466647)
  11312. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11313. -->
  11314. inner elaboration loop at bottom goal.
  11315. Retracting rl*prefer*rvt*predict-no*H0*2
  11316. -->
  11317. (S1 ^operator O1958 = 0.3140233963466647)
  11318. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11319. -->
  11320. (S1 ^operator O1958 = -0.2190661556260421)
  11321. Retracting rl*prefer*rvt*predict-yes*H0*1
  11322. -->
  11323. (S1 ^operator O1957 = 0.3804118472151704)
  11324. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11325. -->
  11326. (S1 ^operator O1957 = 0.6195601949549704)
  11327. --- END Proposal Phase ---
  11328. --- Decision Phase ---
  11329. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11330. =>WM: (13729: S1 ^operator O1959)
  11331. 980: O: O1959 (predict-yes)
  11332. --- END Decision Phase ---
  11333. --- Application Phase ---
  11334. --- Firing Productions (PE) For State At Depth 1 ---
  11335. --- Inner Elaboration Phase, active level 1 (S1) ---
  11336. Firing apply*operator
  11337. -->
  11338. (I3 ^predict-yes N980 + :O )
  11339. Firing apply*operator*complete
  11340. -->
  11341. (I3 ^predict-no N979 - :O )
  11342. inner elaboration loop at bottom goal.
  11343. --- Change Working Memory (PE) ---
  11344. =>WM: (13730: I3 ^predict-yes N980)
  11345. <=WM: (13717: N979 ^status complete)
  11346. <=WM: (13716: I3 ^predict-no N979)
  11347. --- Firing Productions (IE) For State At Depth 1 ---
  11348. --- Inner Elaboration Phase, active level 1 (S1) ---
  11349. Firing monitor*world
  11350. -->
  11351. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11352. --- Change Working Memory (IE) ---
  11353. --- END Application Phase ---
  11354. --- Output Phase ---
  11355. ENV: Agent did: predict-yes for direction L in state State-B
  11356. In State-B moving L
  11357. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11358. predict error 0
  11359. dir: dir isR
  11360. --- END Output Phase ---
  11361. /|\--- Input Phase ---
  11362. =>WM: (13734: I2 ^dir R)
  11363. =>WM: (13733: I2 ^reward 1)
  11364. =>WM: (13732: I2 ^see 1)
  11365. =>WM: (13731: N980 ^status complete)
  11366. <=WM: (13720: I2 ^dir L)
  11367. <=WM: (13719: I2 ^reward 1)
  11368. <=WM: (13718: I2 ^see 0)
  11369. =>WM: (13735: I2 ^level-1 L1-root)
  11370. <=WM: (13721: I2 ^level-1 R0-root)
  11371. --- END Input Phase ---
  11372. --- Proposal Phase ---
  11373. --- Inner Elaboration Phase, active level 1 (S1) ---
  11374. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11375. -->
  11376. (S1 ^operator O1959 = 0.7064055971121673)
  11377. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11378. -->
  11379. (S1 ^operator O1960 = -0.1937987592593187)
  11380. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11381. -->
  11382. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11383. -->
  11384. Firing elaborate*copy-see-to-output-link
  11385. -->
  11386. (I3 ^see 1 +)
  11387. Firing elaborate*reward*based*on*reward
  11388. -->
  11389. (R984 ^value 1 +)
  11390. (R1 ^reward R984 +)
  11391. Firing propose*predict-yes
  11392. -->
  11393. (O1961 ^name predict-yes +)
  11394. (S1 ^operator O1961 +)
  11395. Firing propose*predict-no
  11396. -->
  11397. (O1962 ^name predict-no +)
  11398. (S1 ^operator O1962 +)
  11399. Firing rl*prefer*rvt*predict-no*H0*6
  11400. -->
  11401. (S1 ^operator O1960 = 0.2298662376128736)
  11402. Firing rl*prefer*rvt*predict-yes*H0*5
  11403. -->
  11404. (S1 ^operator O1959 = 0.2939636257009906)
  11405. Firing prefer*rvt*predict-yes*H0
  11406. -->
  11407. Firing prefer*rvt*predict-no*H0
  11408. -->
  11409. Firing elaborate*copy-dir-to-output-link
  11410. -->
  11411. (I3 ^dir R +)
  11412. inner elaboration loop at bottom goal.
  11413. Retracting elaborate*copy-see-to-output-link
  11414. -->
  11415. (I3 ^see 0 +)
  11416. Retracting propose*predict-no
  11417. -->
  11418. (O1960 ^name predict-no +)
  11419. (S1 ^operator O1960 +)
  11420. Retracting propose*predict-yes
  11421. -->
  11422. (O1959 ^name predict-yes +)
  11423. (S1 ^operator O1959 +)
  11424. Retracting elaborate*reward*based*on*reward
  11425. -->
  11426. (R983 ^value 1 +)
  11427. (R1 ^reward R983 +)
  11428. Retracting elaborate*copy-dir-to-output-link
  11429. -->
  11430. (I3 ^dir L +)
  11431. Retracting rl*prefer*rvt*predict-no*H0*2
  11432. -->
  11433. (S1 ^operator O1960 = 0.3140233963466647)
  11434. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11435. -->
  11436. (S1 ^operator O1960 = -0.2190661556260421)
  11437. Retracting rl*prefer*rvt*predict-yes*H0*1
  11438. -->
  11439. (S1 ^operator O1959 = 0.3804118472151704)
  11440. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11441. -->
  11442. (S1 ^operator O1959 = 0.6195601949549704)
  11443. =>WM: (13743: S1 ^operator O1962 +)
  11444. =>WM: (13742: S1 ^operator O1961 +)
  11445. =>WM: (13741: I3 ^dir R)
  11446. =>WM: (13740: O1962 ^name predict-no)
  11447. =>WM: (13739: O1961 ^name predict-yes)
  11448. =>WM: (13738: R984 ^value 1)
  11449. =>WM: (13737: R1 ^reward R984)
  11450. =>WM: (13736: I3 ^see 1)
  11451. <=WM: (13727: S1 ^operator O1959 +)
  11452. <=WM: (13729: S1 ^operator O1959)
  11453. <=WM: (13728: S1 ^operator O1960 +)
  11454. <=WM: (13726: I3 ^dir L)
  11455. <=WM: (13722: R1 ^reward R983)
  11456. <=WM: (13694: I3 ^see 0)
  11457. <=WM: (13725: O1960 ^name predict-no)
  11458. <=WM: (13724: O1959 ^name predict-yes)
  11459. <=WM: (13723: R983 ^value 1)
  11460. --- Inner Elaboration Phase, active level 1 (S1) ---
  11461. Firing prefer*rvt*predict-yes*H0
  11462. -->
  11463. Firing rl*prefer*rvt*predict-yes*H0*5
  11464. -->
  11465. (S1 ^operator O1961 = 0.2939636257009906)
  11466. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11467. -->
  11468. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11469. -->
  11470. (S1 ^operator O1961 = 0.7064055971121673)
  11471. Firing prefer*rvt*predict-no*H0
  11472. -->
  11473. Firing rl*prefer*rvt*predict-no*H0*6
  11474. -->
  11475. (S1 ^operator O1962 = 0.2298662376128736)
  11476. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11477. -->
  11478. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11479. -->
  11480. (S1 ^operator O1962 = -0.1937987592593187)
  11481. inner elaboration loop at bottom goal.
  11482. Retracting rl*prefer*rvt*predict-no*H0*6
  11483. -->
  11484. (S1 ^operator O1960 = 0.2298662376128736)
  11485. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11486. -->
  11487. (S1 ^operator O1960 = -0.1937987592593187)
  11488. Retracting rl*prefer*rvt*predict-yes*H0*5
  11489. -->
  11490. (S1 ^operator O1959 = 0.2939636257009906)
  11491. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11492. -->
  11493. (S1 ^operator O1959 = 0.7064055971121673)
  11494. --- END Proposal Phase ---
  11495. --- Decision Phase ---
  11496. RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.826087,0.144565)
  11497. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478628 0.140932 0.61956 -> 0.478631 0.140932 0.619563(R,m,v=1,1,0)
  11498. =>WM: (13744: S1 ^operator O1961)
  11499. 981: O: O1961 (predict-yes)
  11500. --- END Decision Phase ---
  11501. --- Application Phase ---
  11502. --- Firing Productions (PE) For State At Depth 1 ---
  11503. --- Inner Elaboration Phase, active level 1 (S1) ---
  11504. Firing apply*operator
  11505. -->
  11506. (I3 ^predict-yes N981 + :O )
  11507. Firing apply*operator*complete
  11508. -->
  11509. (I3 ^predict-yes N980 - :O )
  11510. inner elaboration loop at bottom goal.
  11511. --- Change Working Memory (PE) ---
  11512. =>WM: (13745: I3 ^predict-yes N981)
  11513. <=WM: (13731: N980 ^status complete)
  11514. <=WM: (13730: I3 ^predict-yes N980)
  11515. --- Firing Productions (IE) For State At Depth 1 ---
  11516. --- Inner Elaboration Phase, active level 1 (S1) ---
  11517. Firing monitor*world
  11518. -->
  11519. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11520. --- Change Working Memory (IE) ---
  11521. --- END Application Phase ---
  11522. --- Output Phase ---
  11523. ENV: Agent did: predict-yes for direction R in state State-A
  11524. In State-A moving R
  11525. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11526. predict error 0
  11527. dir: dir isU
  11528. --- END Output Phase ---
  11529. ---- Input Phase ---
  11530. =>WM: (13749: I2 ^dir U)
  11531. =>WM: (13748: I2 ^reward 1)
  11532. =>WM: (13747: I2 ^see 1)
  11533. =>WM: (13746: N981 ^status complete)
  11534. <=WM: (13734: I2 ^dir R)
  11535. <=WM: (13733: I2 ^reward 1)
  11536. <=WM: (13732: I2 ^see 1)
  11537. =>WM: (13750: I2 ^level-1 R1-root)
  11538. <=WM: (13735: I2 ^level-1 L1-root)
  11539. --- END Input Phase ---
  11540. --- Proposal Phase ---
  11541. --- Inner Elaboration Phase, active level 1 (S1) ---
  11542. Firing elaborate*copy-see-to-output-link
  11543. -->
  11544. (I3 ^see 1 +)
  11545. Firing elaborate*reward*based*on*reward
  11546. -->
  11547. (R985 ^value 1 +)
  11548. (R1 ^reward R985 +)
  11549. Firing propose*predict-yes
  11550. -->
  11551. (O1963 ^name predict-yes +)
  11552. (S1 ^operator O1963 +)
  11553. Firing propose*predict-no
  11554. -->
  11555. (O1964 ^name predict-no +)
  11556. (S1 ^operator O1964 +)
  11557. Firing rl*prefer*rvt*predict-no*H0*4
  11558. -->
  11559. (S1 ^operator O1962 = 1.)
  11560. Firing rl*prefer*rvt*predict-yes*H0*3
  11561. -->
  11562. (S1 ^operator O1961 = 0.)
  11563. Firing prefer*rvt*predict-yes*H0
  11564. -->
  11565. Firing prefer*rvt*predict-no*H0
  11566. -->
  11567. Firing elaborate*copy-dir-to-output-link
  11568. -->
  11569. (I3 ^dir U +)
  11570. inner elaboration loop at bottom goal.
  11571. Retracting elaborate*copy-see-to-output-link
  11572. -->
  11573. (I3 ^see 1 +)
  11574. Retracting propose*predict-no
  11575. -->
  11576. (O1962 ^name predict-no +)
  11577. (S1 ^operator O1962 +)
  11578. Retracting propose*predict-yes
  11579. -->
  11580. (O1961 ^name predict-yes +)
  11581. (S1 ^operator O1961 +)
  11582. Retracting elaborate*reward*based*on*reward
  11583. -->
  11584. (R984 ^value 1 +)
  11585. (R1 ^reward R984 +)
  11586. Retracting elaborate*copy-dir-to-output-link
  11587. -->
  11588. (I3 ^dir R +)
  11589. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11590. -->
  11591. (S1 ^operator O1962 = -0.1937987592593187)
  11592. Retracting rl*prefer*rvt*predict-no*H0*6
  11593. -->
  11594. (S1 ^operator O1962 = 0.2298662376128736)
  11595. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11596. -->
  11597. (S1 ^operator O1961 = 0.7064055971121673)
  11598. Retracting rl*prefer*rvt*predict-yes*H0*5
  11599. -->
  11600. (S1 ^operator O1961 = 0.2939636257009906)
  11601. =>WM: (13757: S1 ^operator O1964 +)
  11602. =>WM: (13756: S1 ^operator O1963 +)
  11603. =>WM: (13755: I3 ^dir U)
  11604. =>WM: (13754: O1964 ^name predict-no)
  11605. =>WM: (13753: O1963 ^name predict-yes)
  11606. =>WM: (13752: R985 ^value 1)
  11607. =>WM: (13751: R1 ^reward R985)
  11608. <=WM: (13742: S1 ^operator O1961 +)
  11609. <=WM: (13744: S1 ^operator O1961)
  11610. <=WM: (13743: S1 ^operator O1962 +)
  11611. <=WM: (13741: I3 ^dir R)
  11612. <=WM: (13737: R1 ^reward R984)
  11613. <=WM: (13740: O1962 ^name predict-no)
  11614. <=WM: (13739: O1961 ^name predict-yes)
  11615. <=WM: (13738: R984 ^value 1)
  11616. --- Inner Elaboration Phase, active level 1 (S1) ---
  11617. Firing prefer*rvt*predict-yes*H0
  11618. -->
  11619. Firing rl*prefer*rvt*predict-yes*H0*3
  11620. -->
  11621. (S1 ^operator O1963 = 0.)
  11622. Firing prefer*rvt*predict-no*H0
  11623. -->
  11624. Firing rl*prefer*rvt*predict-no*H0*4
  11625. -->
  11626. (S1 ^operator O1964 = 1.)
  11627. inner elaboration loop at bottom goal.
  11628. Retracting rl*prefer*rvt*predict-no*H0*4
  11629. -->
  11630. (S1 ^operator O1962 = 1.)
  11631. Retracting rl*prefer*rvt*predict-yes*H0*3
  11632. -->
  11633. (S1 ^operator O1961 = 0.)
  11634. --- END Proposal Phase ---
  11635. --- Decision Phase ---
  11636. RL update rl*prefer*rvt*predict-yes*H0*5 0.50104 -0.207077 0.293964 -> 0.501013 -0.20708 0.293933(R,m,v=1,0.842105,0.133845)
  11637. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499292 0.207114 0.706406 -> 0.499259 0.20711 0.70637(R,m,v=1,1,0)
  11638. =>WM: (13758: S1 ^operator O1964)
  11639. 982: O: O1964 (predict-no)
  11640. --- END Decision Phase ---
  11641. --- Application Phase ---
  11642. --- Firing Productions (PE) For State At Depth 1 ---
  11643. --- Inner Elaboration Phase, active level 1 (S1) ---
  11644. Firing apply*operator
  11645. -->
  11646. (I3 ^predict-no N982 + :O )
  11647. Firing apply*operator*complete
  11648. -->
  11649. (I3 ^predict-yes N981 - :O )
  11650. inner elaboration loop at bottom goal.
  11651. --- Change Working Memory (PE) ---
  11652. =>WM: (13759: I3 ^predict-no N982)
  11653. <=WM: (13746: N981 ^status complete)
  11654. <=WM: (13745: I3 ^predict-yes N981)
  11655. --- Firing Productions (IE) For State At Depth 1 ---
  11656. --- Inner Elaboration Phase, active level 1 (S1) ---
  11657. Firing monitor*world
  11658. -->
  11659. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11660. --- Change Working Memory (IE) ---
  11661. --- END Application Phase ---
  11662. --- Output Phase ---
  11663. ENV: Agent did: predict-no for direction U in state State-B
  11664. In State-B moving U
  11665. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11666. predict error 0
  11667. dir: dir isR
  11668. --- END Output Phase ---
  11669. /|--- Input Phase ---
  11670. =>WM: (13763: I2 ^dir R)
  11671. =>WM: (13762: I2 ^reward 1)
  11672. =>WM: (13761: I2 ^see 0)
  11673. =>WM: (13760: N982 ^status complete)
  11674. <=WM: (13749: I2 ^dir U)
  11675. <=WM: (13748: I2 ^reward 1)
  11676. <=WM: (13747: I2 ^see 1)
  11677. =>WM: (13764: I2 ^level-1 R1-root)
  11678. <=WM: (13750: I2 ^level-1 R1-root)
  11679. --- END Input Phase ---
  11680. --- Proposal Phase ---
  11681. --- Inner Elaboration Phase, active level 1 (S1) ---
  11682. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11683. -->
  11684. (S1 ^operator O1963 = -0.252585164213872)
  11685. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11686. -->
  11687. (S1 ^operator O1964 = 0.7701897521634826)
  11688. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11689. -->
  11690. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11691. -->
  11692. Firing elaborate*copy-see-to-output-link
  11693. -->
  11694. (I3 ^see 0 +)
  11695. Firing elaborate*reward*based*on*reward
  11696. -->
  11697. (R986 ^value 1 +)
  11698. (R1 ^reward R986 +)
  11699. Firing propose*predict-yes
  11700. -->
  11701. (O1965 ^name predict-yes +)
  11702. (S1 ^operator O1965 +)
  11703. Firing propose*predict-no
  11704. -->
  11705. (O1966 ^name predict-no +)
  11706. (S1 ^operator O1966 +)
  11707. Firing rl*prefer*rvt*predict-no*H0*6
  11708. -->
  11709. (S1 ^operator O1964 = 0.2298662376128736)
  11710. Firing rl*prefer*rvt*predict-yes*H0*5
  11711. -->
  11712. (S1 ^operator O1963 = 0.2939329791093226)
  11713. Firing prefer*rvt*predict-yes*H0
  11714. -->
  11715. Firing prefer*rvt*predict-no*H0
  11716. -->
  11717. Firing elaborate*copy-dir-to-output-link
  11718. -->
  11719. (I3 ^dir R +)
  11720. inner elaboration loop at bottom goal.
  11721. Retracting elaborate*copy-see-to-output-link
  11722. -->
  11723. (I3 ^see 1 +)
  11724. Retracting propose*predict-no
  11725. -->
  11726. (O1964 ^name predict-no +)
  11727. (S1 ^operator O1964 +)
  11728. Retracting propose*predict-yes
  11729. -->
  11730. (O1963 ^name predict-yes +)
  11731. (S1 ^operator O1963 +)
  11732. Retracting elaborate*reward*based*on*reward
  11733. -->
  11734. (R985 ^value 1 +)
  11735. (R1 ^reward R985 +)
  11736. Retracting elaborate*copy-dir-to-output-link
  11737. -->
  11738. (I3 ^dir U +)
  11739. Retracting rl*prefer*rvt*predict-no*H0*4
  11740. -->
  11741. (S1 ^operator O1964 = 1.)
  11742. Retracting rl*prefer*rvt*predict-yes*H0*3
  11743. -->
  11744. (S1 ^operator O1963 = 0.)
  11745. =>WM: (13772: S1 ^operator O1966 +)
  11746. =>WM: (13771: S1 ^operator O1965 +)
  11747. =>WM: (13770: I3 ^dir R)
  11748. =>WM: (13769: O1966 ^name predict-no)
  11749. =>WM: (13768: O1965 ^name predict-yes)
  11750. =>WM: (13767: R986 ^value 1)
  11751. =>WM: (13766: R1 ^reward R986)
  11752. =>WM: (13765: I3 ^see 0)
  11753. <=WM: (13756: S1 ^operator O1963 +)
  11754. <=WM: (13757: S1 ^operator O1964 +)
  11755. <=WM: (13758: S1 ^operator O1964)
  11756. <=WM: (13755: I3 ^dir U)
  11757. <=WM: (13751: R1 ^reward R985)
  11758. <=WM: (13736: I3 ^see 1)
  11759. <=WM: (13754: O1964 ^name predict-no)
  11760. <=WM: (13753: O1963 ^name predict-yes)
  11761. <=WM: (13752: R985 ^value 1)
  11762. --- Inner Elaboration Phase, active level 1 (S1) ---
  11763. Firing prefer*rvt*predict-yes*H0
  11764. -->
  11765. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11766. -->
  11767. (S1 ^operator O1965 = -0.252585164213872)
  11768. Firing rl*prefer*rvt*predict-yes*H0*5
  11769. -->
  11770. (S1 ^operator O1965 = 0.2939329791093226)
  11771. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11772. -->
  11773. Firing prefer*rvt*predict-no*H0
  11774. -->
  11775. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11776. -->
  11777. (S1 ^operator O1966 = 0.7701897521634826)
  11778. Firing rl*prefer*rvt*predict-no*H0*6
  11779. -->
  11780. (S1 ^operator O1966 = 0.2298662376128736)
  11781. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11782. -->
  11783. inner elaboration loop at bottom goal.
  11784. Retracting rl*prefer*rvt*predict-no*H0*6
  11785. -->
  11786. (S1 ^operator O1964 = 0.2298662376128736)
  11787. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11788. -->
  11789. (S1 ^operator O1964 = 0.7701897521634826)
  11790. Retracting rl*prefer*rvt*predict-yes*H0*5
  11791. -->
  11792. (S1 ^operator O1963 = 0.2939329791093226)
  11793. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11794. -->
  11795. (S1 ^operator O1963 = -0.252585164213872)
  11796. --- END Proposal Phase ---
  11797. --- Decision Phase ---
  11798. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11799. =>WM: (13773: S1 ^operator O1966)
  11800. 983: O: O1966 (predict-no)
  11801. --- END Decision Phase ---
  11802. --- Application Phase ---
  11803. --- Firing Productions (PE) For State At Depth 1 ---
  11804. --- Inner Elaboration Phase, active level 1 (S1) ---
  11805. Firing apply*operator
  11806. -->
  11807. (I3 ^predict-no N983 + :O )
  11808. Firing apply*operator*complete
  11809. -->
  11810. (I3 ^predict-no N982 - :O )
  11811. inner elaboration loop at bottom goal.
  11812. --- Change Working Memory (PE) ---
  11813. =>WM: (13774: I3 ^predict-no N983)
  11814. <=WM: (13760: N982 ^status complete)
  11815. <=WM: (13759: I3 ^predict-no N982)
  11816. --- Firing Productions (IE) For State At Depth 1 ---
  11817. --- Inner Elaboration Phase, active level 1 (S1) ---
  11818. Firing monitor*world
  11819. -->
  11820. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11821. --- Change Working Memory (IE) ---
  11822. --- END Application Phase ---
  11823. --- Output Phase ---
  11824. ENV: Agent did: predict-no for direction R in state State-B
  11825. In State-B moving R
  11826. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11827. predict error 0
  11828. dir: dir isL
  11829. --- END Output Phase ---
  11830. \---- Input Phase ---
  11831. =>WM: (13778: I2 ^dir L)
  11832. =>WM: (13777: I2 ^reward 1)
  11833. =>WM: (13776: I2 ^see 0)
  11834. =>WM: (13775: N983 ^status complete)
  11835. <=WM: (13763: I2 ^dir R)
  11836. <=WM: (13762: I2 ^reward 1)
  11837. <=WM: (13761: I2 ^see 0)
  11838. =>WM: (13779: I2 ^level-1 R0-root)
  11839. <=WM: (13764: I2 ^level-1 R1-root)
  11840. --- END Input Phase ---
  11841. --- Proposal Phase ---
  11842. --- Inner Elaboration Phase, active level 1 (S1) ---
  11843. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11844. -->
  11845. (S1 ^operator O1965 = 0.6195629046335391)
  11846. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11847. -->
  11848. (S1 ^operator O1966 = -0.2190661556260421)
  11849. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11850. -->
  11851. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11852. -->
  11853. Firing elaborate*copy-see-to-output-link
  11854. -->
  11855. (I3 ^see 0 +)
  11856. Firing elaborate*reward*based*on*reward
  11857. -->
  11858. (R987 ^value 1 +)
  11859. (R1 ^reward R987 +)
  11860. Firing propose*predict-yes
  11861. -->
  11862. (O1967 ^name predict-yes +)
  11863. (S1 ^operator O1967 +)
  11864. Firing propose*predict-no
  11865. -->
  11866. (O1968 ^name predict-no +)
  11867. (S1 ^operator O1968 +)
  11868. Firing rl*prefer*rvt*predict-no*H0*2
  11869. -->
  11870. (S1 ^operator O1966 = 0.3140233963466647)
  11871. Firing rl*prefer*rvt*predict-yes*H0*1
  11872. -->
  11873. (S1 ^operator O1965 = 0.3804141458478695)
  11874. Firing prefer*rvt*predict-yes*H0
  11875. -->
  11876. Firing prefer*rvt*predict-no*H0
  11877. -->
  11878. Firing elaborate*copy-dir-to-output-link
  11879. -->
  11880. (I3 ^dir L +)
  11881. inner elaboration loop at bottom goal.
  11882. Retracting elaborate*copy-see-to-output-link
  11883. -->
  11884. (I3 ^see 0 +)
  11885. Retracting propose*predict-no
  11886. -->
  11887. (O1966 ^name predict-no +)
  11888. (S1 ^operator O1966 +)
  11889. Retracting propose*predict-yes
  11890. -->
  11891. (O1965 ^name predict-yes +)
  11892. (S1 ^operator O1965 +)
  11893. Retracting elaborate*reward*based*on*reward
  11894. -->
  11895. (R986 ^value 1 +)
  11896. (R1 ^reward R986 +)
  11897. Retracting elaborate*copy-dir-to-output-link
  11898. -->
  11899. (I3 ^dir R +)
  11900. Retracting rl*prefer*rvt*predict-no*H0*6
  11901. -->
  11902. (S1 ^operator O1966 = 0.2298662376128736)
  11903. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11904. -->
  11905. (S1 ^operator O1966 = 0.7701897521634826)
  11906. Retracting rl*prefer*rvt*predict-yes*H0*5
  11907. -->
  11908. (S1 ^operator O1965 = 0.2939329791093226)
  11909. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11910. -->
  11911. (S1 ^operator O1965 = -0.252585164213872)
  11912. =>WM: (13786: S1 ^operator O1968 +)
  11913. =>WM: (13785: S1 ^operator O1967 +)
  11914. =>WM: (13784: I3 ^dir L)
  11915. =>WM: (13783: O1968 ^name predict-no)
  11916. =>WM: (13782: O1967 ^name predict-yes)
  11917. =>WM: (13781: R987 ^value 1)
  11918. =>WM: (13780: R1 ^reward R987)
  11919. <=WM: (13771: S1 ^operator O1965 +)
  11920. <=WM: (13772: S1 ^operator O1966 +)
  11921. <=WM: (13773: S1 ^operator O1966)
  11922. <=WM: (13770: I3 ^dir R)
  11923. <=WM: (13766: R1 ^reward R986)
  11924. <=WM: (13769: O1966 ^name predict-no)
  11925. <=WM: (13768: O1965 ^name predict-yes)
  11926. <=WM: (13767: R986 ^value 1)
  11927. --- Inner Elaboration Phase, active level 1 (S1) ---
  11928. Firing prefer*rvt*predict-yes*H0
  11929. -->
  11930. Firing rl*prefer*rvt*predict-yes*H0*1
  11931. -->
  11932. (S1 ^operator O1967 = 0.3804141458478695)
  11933. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11934. -->
  11935. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11936. -->
  11937. (S1 ^operator O1967 = 0.6195629046335391)
  11938. Firing prefer*rvt*predict-no*H0
  11939. -->
  11940. Firing rl*prefer*rvt*predict-no*H0*2
  11941. -->
  11942. (S1 ^operator O1968 = 0.3140233963466647)
  11943. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11944. -->
  11945. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11946. -->
  11947. (S1 ^operator O1968 = -0.2190661556260421)
  11948. inner elaboration loop at bottom goal.
  11949. Retracting rl*prefer*rvt*predict-no*H0*2
  11950. -->
  11951. (S1 ^operator O1966 = 0.3140233963466647)
  11952. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11953. -->
  11954. (S1 ^operator O1966 = -0.2190661556260421)
  11955. Retracting rl*prefer*rvt*predict-yes*H0*1
  11956. -->
  11957. (S1 ^operator O1965 = 0.3804141458478695)
  11958. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11959. -->
  11960. (S1 ^operator O1965 = 0.6195629046335391)
  11961. --- END Proposal Phase ---
  11962. --- Decision Phase ---
  11963. RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611913 -0.382052 0.229862(R,m,v=1,0.843931,0.132477)
  11964. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388128 0.382061 0.77019 -> 0.388124 0.38206 0.770184(R,m,v=1,1,0)
  11965. =>WM: (13787: S1 ^operator O1967)
  11966. 984: O: O1967 (predict-yes)
  11967. --- END Decision Phase ---
  11968. --- Application Phase ---
  11969. --- Firing Productions (PE) For State At Depth 1 ---
  11970. --- Inner Elaboration Phase, active level 1 (S1) ---
  11971. Firing apply*operator
  11972. -->
  11973. (I3 ^predict-yes N984 + :O )
  11974. Firing apply*operator*complete
  11975. -->
  11976. (I3 ^predict-no N983 - :O )
  11977. inner elaboration loop at bottom goal.
  11978. --- Change Working Memory (PE) ---
  11979. =>WM: (13788: I3 ^predict-yes N984)
  11980. <=WM: (13775: N983 ^status complete)
  11981. <=WM: (13774: I3 ^predict-no N983)
  11982. --- Firing Productions (IE) For State At Depth 1 ---
  11983. --- Inner Elaboration Phase, active level 1 (S1) ---
  11984. Firing monitor*world
  11985. -->
  11986. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11987. --- Change Working Memory (IE) ---
  11988. --- END Application Phase ---
  11989. --- Output Phase ---
  11990. ENV: Agent did: predict-yes for direction L in state State-B
  11991. In State-B moving L
  11992. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11993. predict error 0
  11994. dir: dir isU
  11995. --- END Output Phase ---
  11996. /|\--- Input Phase ---
  11997. =>WM: (13792: I2 ^dir U)
  11998. =>WM: (13791: I2 ^reward 1)
  11999. =>WM: (13790: I2 ^see 1)
  12000. =>WM: (13789: N984 ^status complete)
  12001. <=WM: (13778: I2 ^dir L)
  12002. <=WM: (13777: I2 ^reward 1)
  12003. <=WM: (13776: I2 ^see 0)
  12004. =>WM: (13793: I2 ^level-1 L1-root)
  12005. <=WM: (13779: I2 ^level-1 R0-root)
  12006. --- END Input Phase ---
  12007. --- Proposal Phase ---
  12008. --- Inner Elaboration Phase, active level 1 (S1) ---
  12009. Firing elaborate*copy-see-to-output-link
  12010. -->
  12011. (I3 ^see 1 +)
  12012. Firing elaborate*reward*based*on*reward
  12013. -->
  12014. (R988 ^value 1 +)
  12015. (R1 ^reward R988 +)
  12016. Firing propose*predict-yes
  12017. -->
  12018. (O1969 ^name predict-yes +)
  12019. (S1 ^operator O1969 +)
  12020. Firing propose*predict-no
  12021. -->
  12022. (O1970 ^name predict-no +)
  12023. (S1 ^operator O1970 +)
  12024. Firing rl*prefer*rvt*predict-no*H0*4
  12025. -->
  12026. (S1 ^operator O1968 = 1.)
  12027. Firing rl*prefer*rvt*predict-yes*H0*3
  12028. -->
  12029. (S1 ^operator O1967 = 0.)
  12030. Firing prefer*rvt*predict-yes*H0
  12031. -->
  12032. Firing prefer*rvt*predict-no*H0
  12033. -->
  12034. Firing elaborate*copy-dir-to-output-link
  12035. -->
  12036. (I3 ^dir U +)
  12037. inner elaboration loop at bottom goal.
  12038. Retracting elaborate*copy-see-to-output-link
  12039. -->
  12040. (I3 ^see 0 +)
  12041. Retracting propose*predict-no
  12042. -->
  12043. (O1968 ^name predict-no +)
  12044. (S1 ^operator O1968 +)
  12045. Retracting propose*predict-yes
  12046. -->
  12047. (O1967 ^name predict-yes +)
  12048. (S1 ^operator O1967 +)
  12049. Retracting elaborate*reward*based*on*reward
  12050. -->
  12051. (R987 ^value 1 +)
  12052. (R1 ^reward R987 +)
  12053. Retracting elaborate*copy-dir-to-output-link
  12054. -->
  12055. (I3 ^dir L +)
  12056. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12057. -->
  12058. (S1 ^operator O1968 = -0.2190661556260421)
  12059. Retracting rl*prefer*rvt*predict-no*H0*2
  12060. -->
  12061. (S1 ^operator O1968 = 0.3140233963466647)
  12062. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12063. -->
  12064. (S1 ^operator O1967 = 0.6195629046335391)
  12065. Retracting rl*prefer*rvt*predict-yes*H0*1
  12066. -->
  12067. (S1 ^operator O1967 = 0.3804141458478695)
  12068. =>WM: (13801: S1 ^operator O1970 +)
  12069. =>WM: (13800: S1 ^operator O1969 +)
  12070. =>WM: (13799: I3 ^dir U)
  12071. =>WM: (13798: O1970 ^name predict-no)
  12072. =>WM: (13797: O1969 ^name predict-yes)
  12073. =>WM: (13796: R988 ^value 1)
  12074. =>WM: (13795: R1 ^reward R988)
  12075. =>WM: (13794: I3 ^see 1)
  12076. <=WM: (13785: S1 ^operator O1967 +)
  12077. <=WM: (13787: S1 ^operator O1967)
  12078. <=WM: (13786: S1 ^operator O1968 +)
  12079. <=WM: (13784: I3 ^dir L)
  12080. <=WM: (13780: R1 ^reward R987)
  12081. <=WM: (13765: I3 ^see 0)
  12082. <=WM: (13783: O1968 ^name predict-no)
  12083. <=WM: (13782: O1967 ^name predict-yes)
  12084. <=WM: (13781: R987 ^value 1)
  12085. --- Inner Elaboration Phase, active level 1 (S1) ---
  12086. Firing prefer*rvt*predict-yes*H0
  12087. -->
  12088. Firing rl*prefer*rvt*predict-yes*H0*3
  12089. -->
  12090. (S1 ^operator O1969 = 0.)
  12091. Firing prefer*rvt*predict-no*H0
  12092. -->
  12093. Firing rl*prefer*rvt*predict-no*H0*4
  12094. -->
  12095. (S1 ^operator O1970 = 1.)
  12096. inner elaboration loop at bottom goal.
  12097. Retracting rl*prefer*rvt*predict-no*H0*4
  12098. -->
  12099. (S1 ^operator O1968 = 1.)
  12100. Retracting rl*prefer*rvt*predict-yes*H0*3
  12101. -->
  12102. (S1 ^operator O1967 = 0.)
  12103. --- END Proposal Phase ---
  12104. --- Decision Phase ---
  12105. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521346 -0.14093 0.380416(R,m,v=1,0.82716,0.143854)
  12106. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478631 0.140932 0.619563 -> 0.478633 0.140932 0.619565(R,m,v=1,1,0)
  12107. =>WM: (13802: S1 ^operator O1970)
  12108. 985: O: O1970 (predict-no)
  12109. --- END Decision Phase ---
  12110. --- Application Phase ---
  12111. --- Firing Productions (PE) For State At Depth 1 ---
  12112. --- Inner Elaboration Phase, active level 1 (S1) ---
  12113. Firing apply*operator
  12114. -->
  12115. (I3 ^predict-no N985 + :O )
  12116. Firing apply*operator*complete
  12117. -->
  12118. (I3 ^predict-yes N984 - :O )
  12119. inner elaboration loop at bottom goal.
  12120. --- Change Working Memory (PE) ---
  12121. =>WM: (13803: I3 ^predict-no N985)
  12122. <=WM: (13789: N984 ^status complete)
  12123. <=WM: (13788: I3 ^predict-yes N984)
  12124. --- Firing Productions (IE) For State At Depth 1 ---
  12125. --- Inner Elaboration Phase, active level 1 (S1) ---
  12126. Firing monitor*world
  12127. -->
  12128. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12129. --- Change Working Memory (IE) ---
  12130. --- END Application Phase ---
  12131. --- Output Phase ---
  12132. ENV: Agent did: predict-no for direction U in state State-A
  12133. In State-A moving U
  12134. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12135. predict error 0
  12136. dir: dir isR
  12137. --- END Output Phase ---
  12138. -/|--- Input Phase ---
  12139. =>WM: (13807: I2 ^dir R)
  12140. =>WM: (13806: I2 ^reward 1)
  12141. =>WM: (13805: I2 ^see 0)
  12142. =>WM: (13804: N985 ^status complete)
  12143. <=WM: (13792: I2 ^dir U)
  12144. <=WM: (13791: I2 ^reward 1)
  12145. <=WM: (13790: I2 ^see 1)
  12146. =>WM: (13808: I2 ^level-1 L1-root)
  12147. <=WM: (13793: I2 ^level-1 L1-root)
  12148. --- END Input Phase ---
  12149. --- Proposal Phase ---
  12150. --- Inner Elaboration Phase, active level 1 (S1) ---
  12151. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12152. -->
  12153. (S1 ^operator O1969 = 0.7063695903698597)
  12154. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12155. -->
  12156. (S1 ^operator O1970 = -0.1937987592593187)
  12157. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12158. -->
  12159. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12160. -->
  12161. Firing elaborate*copy-see-to-output-link
  12162. -->
  12163. (I3 ^see 0 +)
  12164. Firing elaborate*reward*based*on*reward
  12165. -->
  12166. (R989 ^value 1 +)
  12167. (R1 ^reward R989 +)
  12168. Firing propose*predict-yes
  12169. -->
  12170. (O1971 ^name predict-yes +)
  12171. (S1 ^operator O1971 +)
  12172. Firing propose*predict-no
  12173. -->
  12174. (O1972 ^name predict-no +)
  12175. (S1 ^operator O1972 +)
  12176. Firing rl*prefer*rvt*predict-no*H0*6
  12177. -->
  12178. (S1 ^operator O1970 = 0.2298616880335552)
  12179. Firing rl*prefer*rvt*predict-yes*H0*5
  12180. -->
  12181. (S1 ^operator O1969 = 0.2939329791093226)
  12182. Firing prefer*rvt*predict-yes*H0
  12183. -->
  12184. Firing prefer*rvt*predict-no*H0
  12185. -->
  12186. Firing elaborate*copy-dir-to-output-link
  12187. -->
  12188. (I3 ^dir R +)
  12189. inner elaboration loop at bottom goal.
  12190. Retracting elaborate*copy-see-to-output-link
  12191. -->
  12192. (I3 ^see 1 +)
  12193. Retracting propose*predict-no
  12194. -->
  12195. (O1970 ^name predict-no +)
  12196. (S1 ^operator O1970 +)
  12197. Retracting propose*predict-yes
  12198. -->
  12199. (O1969 ^name predict-yes +)
  12200. (S1 ^operator O1969 +)
  12201. Retracting elaborate*reward*based*on*reward
  12202. -->
  12203. (R988 ^value 1 +)
  12204. (R1 ^reward R988 +)
  12205. Retracting elaborate*copy-dir-to-output-link
  12206. -->
  12207. (I3 ^dir U +)
  12208. Retracting rl*prefer*rvt*predict-no*H0*4
  12209. -->
  12210. (S1 ^operator O1970 = 1.)
  12211. Retracting rl*prefer*rvt*predict-yes*H0*3
  12212. -->
  12213. (S1 ^operator O1969 = 0.)
  12214. =>WM: (13816: S1 ^operator O1972 +)
  12215. =>WM: (13815: S1 ^operator O1971 +)
  12216. =>WM: (13814: I3 ^dir R)
  12217. =>WM: (13813: O1972 ^name predict-no)
  12218. =>WM: (13812: O1971 ^name predict-yes)
  12219. =>WM: (13811: R989 ^value 1)
  12220. =>WM: (13810: R1 ^reward R989)
  12221. =>WM: (13809: I3 ^see 0)
  12222. <=WM: (13800: S1 ^operator O1969 +)
  12223. <=WM: (13801: S1 ^operator O1970 +)
  12224. <=WM: (13802: S1 ^operator O1970)
  12225. <=WM: (13799: I3 ^dir U)
  12226. <=WM: (13795: R1 ^reward R988)
  12227. <=WM: (13794: I3 ^see 1)
  12228. <=WM: (13798: O1970 ^name predict-no)
  12229. <=WM: (13797: O1969 ^name predict-yes)
  12230. <=WM: (13796: R988 ^value 1)
  12231. --- Inner Elaboration Phase, active level 1 (S1) ---
  12232. Firing prefer*rvt*predict-yes*H0
  12233. -->
  12234. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12235. -->
  12236. (S1 ^operator O1971 = 0.7063695903698597)
  12237. Firing rl*prefer*rvt*predict-yes*H0*5
  12238. -->
  12239. (S1 ^operator O1971 = 0.2939329791093226)
  12240. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12241. -->
  12242. Firing prefer*rvt*predict-no*H0
  12243. -->
  12244. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12245. -->
  12246. (S1 ^operator O1972 = -0.1937987592593187)
  12247. Firing rl*prefer*rvt*predict-no*H0*6
  12248. -->
  12249. (S1 ^operator O1972 = 0.2298616880335552)
  12250. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12251. -->
  12252. inner elaboration loop at bottom goal.
  12253. Retracting rl*prefer*rvt*predict-no*H0*6
  12254. -->
  12255. (S1 ^operator O1970 = 0.2298616880335552)
  12256. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12257. -->
  12258. (S1 ^operator O1970 = -0.1937987592593187)
  12259. Retracting rl*prefer*rvt*predict-yes*H0*5
  12260. -->
  12261. (S1 ^operator O1969 = 0.2939329791093226)
  12262. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12263. -->
  12264. (S1 ^operator O1969 = 0.7063695903698597)
  12265. --- END Proposal Phase ---
  12266. --- Decision Phase ---
  12267. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12268. =>WM: (13817: S1 ^operator O1971)
  12269. 986: O: O1971 (predict-yes)
  12270. --- END Decision Phase ---
  12271. --- Application Phase ---
  12272. --- Firing Productions (PE) For State At Depth 1 ---
  12273. --- Inner Elaboration Phase, active level 1 (S1) ---
  12274. Firing apply*operator
  12275. -->
  12276. (I3 ^predict-yes N986 + :O )
  12277. Firing apply*operator*complete
  12278. -->
  12279. (I3 ^predict-no N985 - :O )
  12280. inner elaboration loop at bottom goal.
  12281. --- Change Working Memory (PE) ---
  12282. =>WM: (13818: I3 ^predict-yes N986)
  12283. <=WM: (13804: N985 ^status complete)
  12284. <=WM: (13803: I3 ^predict-no N985)
  12285. --- Firing Productions (IE) For State At Depth 1 ---
  12286. --- Inner Elaboration Phase, active level 1 (S1) ---
  12287. Firing monitor*world
  12288. -->
  12289. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12290. --- Change Working Memory (IE) ---
  12291. --- END Application Phase ---
  12292. --- Output Phase ---
  12293. ENV: Agent did: predict-yes for direction R in state State-A
  12294. In State-A moving R
  12295. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12296. predict error 0
  12297. dir: dir isR
  12298. --- END Output Phase ---
  12299. \-/--- Input Phase ---
  12300. =>WM: (13822: I2 ^dir R)
  12301. =>WM: (13821: I2 ^reward 1)
  12302. =>WM: (13820: I2 ^see 1)
  12303. =>WM: (13819: N986 ^status complete)
  12304. <=WM: (13807: I2 ^dir R)
  12305. <=WM: (13806: I2 ^reward 1)
  12306. <=WM: (13805: I2 ^see 0)
  12307. =>WM: (13823: I2 ^level-1 R1-root)
  12308. <=WM: (13808: I2 ^level-1 L1-root)
  12309. --- END Input Phase ---
  12310. --- Proposal Phase ---
  12311. --- Inner Elaboration Phase, active level 1 (S1) ---
  12312. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12313. -->
  12314. (S1 ^operator O1971 = -0.252585164213872)
  12315. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12316. -->
  12317. (S1 ^operator O1972 = 0.7701842386860367)
  12318. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12319. -->
  12320. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12321. -->
  12322. Firing elaborate*copy-see-to-output-link
  12323. -->
  12324. (I3 ^see 1 +)
  12325. Firing elaborate*reward*based*on*reward
  12326. -->
  12327. (R990 ^value 1 +)
  12328. (R1 ^reward R990 +)
  12329. Firing propose*predict-yes
  12330. -->
  12331. (O1973 ^name predict-yes +)
  12332. (S1 ^operator O1973 +)
  12333. Firing propose*predict-no
  12334. -->
  12335. (O1974 ^name predict-no +)
  12336. (S1 ^operator O1974 +)
  12337. Firing rl*prefer*rvt*predict-no*H0*6
  12338. -->
  12339. (S1 ^operator O1972 = 0.2298616880335552)
  12340. Firing rl*prefer*rvt*predict-yes*H0*5
  12341. -->
  12342. (S1 ^operator O1971 = 0.2939329791093226)
  12343. Firing prefer*rvt*predict-yes*H0
  12344. -->
  12345. Firing prefer*rvt*predict-no*H0
  12346. -->
  12347. Firing elaborate*copy-dir-to-output-link
  12348. -->
  12349. (I3 ^dir R +)
  12350. inner elaboration loop at bottom goal.
  12351. Retracting elaborate*copy-see-to-output-link
  12352. -->
  12353. (I3 ^see 0 +)
  12354. Retracting propose*predict-no
  12355. -->
  12356. (O1972 ^name predict-no +)
  12357. (S1 ^operator O1972 +)
  12358. Retracting propose*predict-yes
  12359. -->
  12360. (O1971 ^name predict-yes +)
  12361. (S1 ^operator O1971 +)
  12362. Retracting elaborate*reward*based*on*reward
  12363. -->
  12364. (R989 ^value 1 +)
  12365. (R1 ^reward R989 +)
  12366. Retracting elaborate*copy-dir-to-output-link
  12367. -->
  12368. (I3 ^dir R +)
  12369. Retracting rl*prefer*rvt*predict-no*H0*6
  12370. -->
  12371. (S1 ^operator O1972 = 0.2298616880335552)
  12372. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12373. -->
  12374. (S1 ^operator O1972 = -0.1937987592593187)
  12375. Retracting rl*prefer*rvt*predict-yes*H0*5
  12376. -->
  12377. (S1 ^operator O1971 = 0.2939329791093226)
  12378. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12379. -->
  12380. (S1 ^operator O1971 = 0.7063695903698597)
  12381. =>WM: (13830: S1 ^operator O1974 +)
  12382. =>WM: (13829: S1 ^operator O1973 +)
  12383. =>WM: (13828: O1974 ^name predict-no)
  12384. =>WM: (13827: O1973 ^name predict-yes)
  12385. =>WM: (13826: R990 ^value 1)
  12386. =>WM: (13825: R1 ^reward R990)
  12387. =>WM: (13824: I3 ^see 1)
  12388. <=WM: (13815: S1 ^operator O1971 +)
  12389. <=WM: (13817: S1 ^operator O1971)
  12390. <=WM: (13816: S1 ^operator O1972 +)
  12391. <=WM: (13810: R1 ^reward R989)
  12392. <=WM: (13809: I3 ^see 0)
  12393. <=WM: (13813: O1972 ^name predict-no)
  12394. <=WM: (13812: O1971 ^name predict-yes)
  12395. <=WM: (13811: R989 ^value 1)
  12396. --- Inner Elaboration Phase, active level 1 (S1) ---
  12397. Firing prefer*rvt*predict-yes*H0
  12398. -->
  12399. Firing rl*prefer*rvt*predict-yes*H0*5
  12400. -->
  12401. (S1 ^operator O1973 = 0.2939329791093226)
  12402. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12403. -->
  12404. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12405. -->
  12406. (S1 ^operator O1973 = -0.252585164213872)
  12407. Firing prefer*rvt*predict-no*H0
  12408. -->
  12409. Firing rl*prefer*rvt*predict-no*H0*6
  12410. -->
  12411. (S1 ^operator O1974 = 0.2298616880335552)
  12412. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12413. -->
  12414. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12415. -->
  12416. (S1 ^operator O1974 = 0.7701842386860367)
  12417. inner elaboration loop at bottom goal.
  12418. Retracting rl*prefer*rvt*predict-no*H0*6
  12419. -->
  12420. (S1 ^operator O1972 = 0.2298616880335552)
  12421. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12422. -->
  12423. (S1 ^operator O1972 = 0.7701842386860367)
  12424. Retracting rl*prefer*rvt*predict-yes*H0*5
  12425. -->
  12426. (S1 ^operator O1971 = 0.2939329791093226)
  12427. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12428. -->
  12429. (S1 ^operator O1971 = -0.252585164213872)
  12430. --- END Proposal Phase ---
  12431. --- Decision Phase ---
  12432. RL update rl*prefer*rvt*predict-yes*H0*5 0.501013 -0.20708 0.293933 -> 0.50099 -0.207082 0.293908(R,m,v=1,0.843137,0.133127)
  12433. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499259 0.20711 0.70637 -> 0.499233 0.207107 0.70634(R,m,v=1,1,0)
  12434. =>WM: (13831: S1 ^operator O1974)
  12435. 987: O: O1974 (predict-no)
  12436. --- END Decision Phase ---
  12437. --- Application Phase ---
  12438. --- Firing Productions (PE) For State At Depth 1 ---
  12439. --- Inner Elaboration Phase, active level 1 (S1) ---
  12440. Firing apply*operator
  12441. -->
  12442. (I3 ^predict-no N987 + :O )
  12443. Firing apply*operator*complete
  12444. -->
  12445. (I3 ^predict-yes N986 - :O )
  12446. inner elaboration loop at bottom goal.
  12447. --- Change Working Memory (PE) ---
  12448. =>WM: (13832: I3 ^predict-no N987)
  12449. <=WM: (13819: N986 ^status complete)
  12450. <=WM: (13818: I3 ^predict-yes N986)
  12451. --- Firing Productions (IE) For State At Depth 1 ---
  12452. --- Inner Elaboration Phase, active level 1 (S1) ---
  12453. Firing monitor*world
  12454. -->
  12455. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12456. --- Change Working Memory (IE) ---
  12457. --- END Application Phase ---
  12458. --- Output Phase ---
  12459. ENV: Agent did: predict-no for direction R in state State-B
  12460. In State-B moving R
  12461. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12462. predict error 0
  12463. dir: dir isL
  12464. --- END Output Phase ---
  12465. |\---- Input Phase ---
  12466. =>WM: (13836: I2 ^dir L)
  12467. =>WM: (13835: I2 ^reward 1)
  12468. =>WM: (13834: I2 ^see 0)
  12469. =>WM: (13833: N987 ^status complete)
  12470. <=WM: (13822: I2 ^dir R)
  12471. <=WM: (13821: I2 ^reward 1)
  12472. <=WM: (13820: I2 ^see 1)
  12473. =>WM: (13837: I2 ^level-1 R0-root)
  12474. <=WM: (13823: I2 ^level-1 R1-root)
  12475. --- END Input Phase ---
  12476. --- Proposal Phase ---
  12477. --- Inner Elaboration Phase, active level 1 (S1) ---
  12478. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12479. -->
  12480. (S1 ^operator O1973 = 0.6195651222408995)
  12481. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12482. -->
  12483. (S1 ^operator O1974 = -0.2190661556260421)
  12484. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12485. -->
  12486. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12487. -->
  12488. Firing elaborate*copy-see-to-output-link
  12489. -->
  12490. (I3 ^see 0 +)
  12491. Firing elaborate*reward*based*on*reward
  12492. -->
  12493. (R991 ^value 1 +)
  12494. (R1 ^reward R991 +)
  12495. Firing propose*predict-yes
  12496. -->
  12497. (O1975 ^name predict-yes +)
  12498. (S1 ^operator O1975 +)
  12499. Firing propose*predict-no
  12500. -->
  12501. (O1976 ^name predict-no +)
  12502. (S1 ^operator O1976 +)
  12503. Firing rl*prefer*rvt*predict-no*H0*2
  12504. -->
  12505. (S1 ^operator O1974 = 0.3140233963466647)
  12506. Firing rl*prefer*rvt*predict-yes*H0*1
  12507. -->
  12508. (S1 ^operator O1973 = 0.3804160307887663)
  12509. Firing prefer*rvt*predict-yes*H0
  12510. -->
  12511. Firing prefer*rvt*predict-no*H0
  12512. -->
  12513. Firing elaborate*copy-dir-to-output-link
  12514. -->
  12515. (I3 ^dir L +)
  12516. inner elaboration loop at bottom goal.
  12517. Retracting elaborate*copy-see-to-output-link
  12518. -->
  12519. (I3 ^see 1 +)
  12520. Retracting propose*predict-no
  12521. -->
  12522. (O1974 ^name predict-no +)
  12523. (S1 ^operator O1974 +)
  12524. Retracting propose*predict-yes
  12525. -->
  12526. (O1973 ^name predict-yes +)
  12527. (S1 ^operator O1973 +)
  12528. Retracting elaborate*reward*based*on*reward
  12529. -->
  12530. (R990 ^value 1 +)
  12531. (R1 ^reward R990 +)
  12532. Retracting elaborate*copy-dir-to-output-link
  12533. -->
  12534. (I3 ^dir R +)
  12535. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12536. -->
  12537. (S1 ^operator O1974 = 0.7701842386860367)
  12538. Retracting rl*prefer*rvt*predict-no*H0*6
  12539. -->
  12540. (S1 ^operator O1974 = 0.2298616880335552)
  12541. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12542. -->
  12543. (S1 ^operator O1973 = -0.252585164213872)
  12544. Retracting rl*prefer*rvt*predict-yes*H0*5
  12545. -->
  12546. (S1 ^operator O1973 = 0.2939078922513593)
  12547. =>WM: (13845: S1 ^operator O1976 +)
  12548. =>WM: (13844: S1 ^operator O1975 +)
  12549. =>WM: (13843: I3 ^dir L)
  12550. =>WM: (13842: O1976 ^name predict-no)
  12551. =>WM: (13841: O1975 ^name predict-yes)
  12552. =>WM: (13840: R991 ^value 1)
  12553. =>WM: (13839: R1 ^reward R991)
  12554. =>WM: (13838: I3 ^see 0)
  12555. <=WM: (13829: S1 ^operator O1973 +)
  12556. <=WM: (13830: S1 ^operator O1974 +)
  12557. <=WM: (13831: S1 ^operator O1974)
  12558. <=WM: (13814: I3 ^dir R)
  12559. <=WM: (13825: R1 ^reward R990)
  12560. <=WM: (13824: I3 ^see 1)
  12561. <=WM: (13828: O1974 ^name predict-no)
  12562. <=WM: (13827: O1973 ^name predict-yes)
  12563. <=WM: (13826: R990 ^value 1)
  12564. --- Inner Elaboration Phase, active level 1 (S1) ---
  12565. Firing prefer*rvt*predict-yes*H0
  12566. -->
  12567. Firing rl*prefer*rvt*predict-yes*H0*1
  12568. -->
  12569. (S1 ^operator O1975 = 0.3804160307887663)
  12570. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12571. -->
  12572. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12573. -->
  12574. (S1 ^operator O1975 = 0.6195651222408995)
  12575. Firing prefer*rvt*predict-no*H0
  12576. -->
  12577. Firing rl*prefer*rvt*predict-no*H0*2
  12578. -->
  12579. (S1 ^operator O1976 = 0.3140233963466647)
  12580. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12581. -->
  12582. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12583. -->
  12584. (S1 ^operator O1976 = -0.2190661556260421)
  12585. inner elaboration loop at bottom goal.
  12586. Retracting rl*prefer*rvt*predict-no*H0*2
  12587. -->
  12588. (S1 ^operator O1974 = 0.3140233963466647)
  12589. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12590. -->
  12591. (S1 ^operator O1974 = -0.2190661556260421)
  12592. Retracting rl*prefer*rvt*predict-yes*H0*1
  12593. -->
  12594. (S1 ^operator O1973 = 0.3804160307887663)
  12595. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12596. -->
  12597. (S1 ^operator O1973 = 0.6195651222408995)
  12598. --- END Proposal Phase ---
  12599. --- Decision Phase ---
  12600. RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229862 -> 0.61191 -0.382052 0.229858(R,m,v=1,0.844828,0.131852)
  12601. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388124 0.38206 0.770184 -> 0.38812 0.38206 0.77018(R,m,v=1,1,0)
  12602. =>WM: (13846: S1 ^operator O1975)
  12603. 988: O: O1975 (predict-yes)
  12604. --- END Decision Phase ---
  12605. --- Application Phase ---
  12606. --- Firing Productions (PE) For State At Depth 1 ---
  12607. --- Inner Elaboration Phase, active level 1 (S1) ---
  12608. Firing apply*operator
  12609. -->
  12610. (I3 ^predict-yes N988 + :O )
  12611. Firing apply*operator*complete
  12612. -->
  12613. (I3 ^predict-no N987 - :O )
  12614. inner elaboration loop at bottom goal.
  12615. --- Change Working Memory (PE) ---
  12616. =>WM: (13847: I3 ^predict-yes N988)
  12617. <=WM: (13833: N987 ^status complete)
  12618. <=WM: (13832: I3 ^predict-no N987)
  12619. --- Firing Productions (IE) For State At Depth 1 ---
  12620. --- Inner Elaboration Phase, active level 1 (S1) ---
  12621. Firing monitor*world
  12622. -->
  12623. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12624. --- Change Working Memory (IE) ---
  12625. --- END Application Phase ---
  12626. --- Output Phase ---
  12627. ENV: Agent did: predict-yes for direction L in state State-B
  12628. In State-B moving L
  12629. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12630. predict error 0
  12631. dir: dir isU
  12632. --- END Output Phase ---
  12633. /|\--- Input Phase ---
  12634. =>WM: (13851: I2 ^dir U)
  12635. =>WM: (13850: I2 ^reward 1)
  12636. =>WM: (13849: I2 ^see 1)
  12637. =>WM: (13848: N988 ^status complete)
  12638. <=WM: (13836: I2 ^dir L)
  12639. <=WM: (13835: I2 ^reward 1)
  12640. <=WM: (13834: I2 ^see 0)
  12641. =>WM: (13852: I2 ^level-1 L1-root)
  12642. <=WM: (13837: I2 ^level-1 R0-root)
  12643. --- END Input Phase ---
  12644. --- Proposal Phase ---
  12645. --- Inner Elaboration Phase, active level 1 (S1) ---
  12646. Firing elaborate*copy-see-to-output-link
  12647. -->
  12648. (I3 ^see 1 +)
  12649. Firing elaborate*reward*based*on*reward
  12650. -->
  12651. (R992 ^value 1 +)
  12652. (R1 ^reward R992 +)
  12653. Firing propose*predict-yes
  12654. -->
  12655. (O1977 ^name predict-yes +)
  12656. (S1 ^operator O1977 +)
  12657. Firing propose*predict-no
  12658. -->
  12659. (O1978 ^name predict-no +)
  12660. (S1 ^operator O1978 +)
  12661. Firing rl*prefer*rvt*predict-no*H0*4
  12662. -->
  12663. (S1 ^operator O1976 = 1.)
  12664. Firing rl*prefer*rvt*predict-yes*H0*3
  12665. -->
  12666. (S1 ^operator O1975 = 0.)
  12667. Firing prefer*rvt*predict-yes*H0
  12668. -->
  12669. Firing prefer*rvt*predict-no*H0
  12670. -->
  12671. Firing elaborate*copy-dir-to-output-link
  12672. -->
  12673. (I3 ^dir U +)
  12674. inner elaboration loop at bottom goal.
  12675. Retracting elaborate*copy-see-to-output-link
  12676. -->
  12677. (I3 ^see 0 +)
  12678. Retracting propose*predict-no
  12679. -->
  12680. (O1976 ^name predict-no +)
  12681. (S1 ^operator O1976 +)
  12682. Retracting propose*predict-yes
  12683. -->
  12684. (O1975 ^name predict-yes +)
  12685. (S1 ^operator O1975 +)
  12686. Retracting elaborate*reward*based*on*reward
  12687. -->
  12688. (R991 ^value 1 +)
  12689. (R1 ^reward R991 +)
  12690. Retracting elaborate*copy-dir-to-output-link
  12691. -->
  12692. (I3 ^dir L +)
  12693. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12694. -->
  12695. (S1 ^operator O1976 = -0.2190661556260421)
  12696. Retracting rl*prefer*rvt*predict-no*H0*2
  12697. -->
  12698. (S1 ^operator O1976 = 0.3140233963466647)
  12699. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12700. -->
  12701. (S1 ^operator O1975 = 0.6195651222408995)
  12702. Retracting rl*prefer*rvt*predict-yes*H0*1
  12703. -->
  12704. (S1 ^operator O1975 = 0.3804160307887663)
  12705. =>WM: (13860: S1 ^operator O1978 +)
  12706. =>WM: (13859: S1 ^operator O1977 +)
  12707. =>WM: (13858: I3 ^dir U)
  12708. =>WM: (13857: O1978 ^name predict-no)
  12709. =>WM: (13856: O1977 ^name predict-yes)
  12710. =>WM: (13855: R992 ^value 1)
  12711. =>WM: (13854: R1 ^reward R992)
  12712. =>WM: (13853: I3 ^see 1)
  12713. <=WM: (13844: S1 ^operator O1975 +)
  12714. <=WM: (13846: S1 ^operator O1975)
  12715. <=WM: (13845: S1 ^operator O1976 +)
  12716. <=WM: (13843: I3 ^dir L)
  12717. <=WM: (13839: R1 ^reward R991)
  12718. <=WM: (13838: I3 ^see 0)
  12719. <=WM: (13842: O1976 ^name predict-no)
  12720. <=WM: (13841: O1975 ^name predict-yes)
  12721. <=WM: (13840: R991 ^value 1)
  12722. --- Inner Elaboration Phase, active level 1 (S1) ---
  12723. Firing prefer*rvt*predict-yes*H0
  12724. -->
  12725. Firing rl*prefer*rvt*predict-yes*H0*3
  12726. -->
  12727. (S1 ^operator O1977 = 0.)
  12728. Firing prefer*rvt*predict-no*H0
  12729. -->
  12730. Firing rl*prefer*rvt*predict-no*H0*4
  12731. -->
  12732. (S1 ^operator O1978 = 1.)
  12733. inner elaboration loop at bottom goal.
  12734. Retracting rl*prefer*rvt*predict-no*H0*4
  12735. -->
  12736. (S1 ^operator O1976 = 1.)
  12737. Retracting rl*prefer*rvt*predict-yes*H0*3
  12738. -->
  12739. (S1 ^operator O1975 = 0.)
  12740. --- END Proposal Phase ---
  12741. --- Decision Phase ---
  12742. RL update rl*prefer*rvt*predict-yes*H0*1 0.521346 -0.14093 0.380416 -> 0.521348 -0.14093 0.380418(R,m,v=1,0.828221,0.143149)
  12743. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478633 0.140932 0.619565 -> 0.478635 0.140932 0.619567(R,m,v=1,1,0)
  12744. =>WM: (13861: S1 ^operator O1978)
  12745. 989: O: O1978 (predict-no)
  12746. --- END Decision Phase ---
  12747. --- Application Phase ---
  12748. --- Firing Productions (PE) For State At Depth 1 ---
  12749. --- Inner Elaboration Phase, active level 1 (S1) ---
  12750. Firing apply*operator
  12751. -->
  12752. (I3 ^predict-no N989 + :O )
  12753. Firing apply*operator*complete
  12754. -->
  12755. (I3 ^predict-yes N988 - :O )
  12756. inner elaboration loop at bottom goal.
  12757. --- Change Working Memory (PE) ---
  12758. =>WM: (13862: I3 ^predict-no N989)
  12759. <=WM: (13848: N988 ^status complete)
  12760. <=WM: (13847: I3 ^predict-yes N988)
  12761. --- Firing Productions (IE) For State At Depth 1 ---
  12762. --- Inner Elaboration Phase, active level 1 (S1) ---
  12763. Firing monitor*world
  12764. -->
  12765. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12766. --- Change Working Memory (IE) ---
  12767. --- END Application Phase ---
  12768. --- Output Phase ---
  12769. ENV: Agent did: predict-no for direction U in state State-A
  12770. In State-A moving U
  12771. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12772. predict error 0
  12773. dir: dir isR
  12774. --- END Output Phase ---
  12775. -/|--- Input Phase ---
  12776. =>WM: (13866: I2 ^dir R)
  12777. =>WM: (13865: I2 ^reward 1)
  12778. =>WM: (13864: I2 ^see 0)
  12779. =>WM: (13863: N989 ^status complete)
  12780. <=WM: (13851: I2 ^dir U)
  12781. <=WM: (13850: I2 ^reward 1)
  12782. <=WM: (13849: I2 ^see 1)
  12783. =>WM: (13867: I2 ^level-1 L1-root)
  12784. <=WM: (13852: I2 ^level-1 L1-root)
  12785. --- END Input Phase ---
  12786. --- Proposal Phase ---
  12787. --- Inner Elaboration Phase, active level 1 (S1) ---
  12788. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12789. -->
  12790. (S1 ^operator O1977 = 0.7063401754803731)
  12791. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12792. -->
  12793. (S1 ^operator O1978 = -0.1937987592593187)
  12794. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12795. -->
  12796. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12797. -->
  12798. Firing elaborate*copy-see-to-output-link
  12799. -->
  12800. (I3 ^see 0 +)
  12801. Firing elaborate*reward*based*on*reward
  12802. -->
  12803. (R993 ^value 1 +)
  12804. (R1 ^reward R993 +)
  12805. Firing propose*predict-yes
  12806. -->
  12807. (O1979 ^name predict-yes +)
  12808. (S1 ^operator O1979 +)
  12809. Firing propose*predict-no
  12810. -->
  12811. (O1980 ^name predict-no +)
  12812. (S1 ^operator O1980 +)
  12813. Firing rl*prefer*rvt*predict-no*H0*6
  12814. -->
  12815. (S1 ^operator O1978 = 0.2298579596436188)
  12816. Firing rl*prefer*rvt*predict-yes*H0*5
  12817. -->
  12818. (S1 ^operator O1977 = 0.2939078922513593)
  12819. Firing prefer*rvt*predict-yes*H0
  12820. -->
  12821. Firing prefer*rvt*predict-no*H0
  12822. -->
  12823. Firing elaborate*copy-dir-to-output-link
  12824. -->
  12825. (I3 ^dir R +)
  12826. inner elaboration loop at bottom goal.
  12827. Retracting elaborate*copy-see-to-output-link
  12828. -->
  12829. (I3 ^see 1 +)
  12830. Retracting propose*predict-no
  12831. -->
  12832. (O1978 ^name predict-no +)
  12833. (S1 ^operator O1978 +)
  12834. Retracting propose*predict-yes
  12835. -->
  12836. (O1977 ^name predict-yes +)
  12837. (S1 ^operator O1977 +)
  12838. Retracting elaborate*reward*based*on*reward
  12839. -->
  12840. (R992 ^value 1 +)
  12841. (R1 ^reward R992 +)
  12842. Retracting elaborate*copy-dir-to-output-link
  12843. -->
  12844. (I3 ^dir U +)
  12845. Retracting rl*prefer*rvt*predict-no*H0*4
  12846. -->
  12847. (S1 ^operator O1978 = 1.)
  12848. Retracting rl*prefer*rvt*predict-yes*H0*3
  12849. -->
  12850. (S1 ^operator O1977 = 0.)
  12851. =>WM: (13875: S1 ^operator O1980 +)
  12852. =>WM: (13874: S1 ^operator O1979 +)
  12853. =>WM: (13873: I3 ^dir R)
  12854. =>WM: (13872: O1980 ^name predict-no)
  12855. =>WM: (13871: O1979 ^name predict-yes)
  12856. =>WM: (13870: R993 ^value 1)
  12857. =>WM: (13869: R1 ^reward R993)
  12858. =>WM: (13868: I3 ^see 0)
  12859. <=WM: (13859: S1 ^operator O1977 +)
  12860. <=WM: (13860: S1 ^operator O1978 +)
  12861. <=WM: (13861: S1 ^operator O1978)
  12862. <=WM: (13858: I3 ^dir U)
  12863. <=WM: (13854: R1 ^reward R992)
  12864. <=WM: (13853: I3 ^see 1)
  12865. <=WM: (13857: O1978 ^name predict-no)
  12866. <=WM: (13856: O1977 ^name predict-yes)
  12867. <=WM: (13855: R992 ^value 1)
  12868. --- Inner Elaboration Phase, active level 1 (S1) ---
  12869. Firing prefer*rvt*predict-yes*H0
  12870. -->
  12871. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12872. -->
  12873. (S1 ^operator O1979 = 0.7063401754803731)
  12874. Firing rl*prefer*rvt*predict-yes*H0*5
  12875. -->
  12876. (S1 ^operator O1979 = 0.2939078922513593)
  12877. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12878. -->
  12879. Firing prefer*rvt*predict-no*H0
  12880. -->
  12881. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12882. -->
  12883. (S1 ^operator O1980 = -0.1937987592593187)
  12884. Firing rl*prefer*rvt*predict-no*H0*6
  12885. -->
  12886. (S1 ^operator O1980 = 0.2298579596436188)
  12887. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12888. -->
  12889. inner elaboration loop at bottom goal.
  12890. Retracting rl*prefer*rvt*predict-no*H0*6
  12891. -->
  12892. (S1 ^operator O1978 = 0.2298579596436188)
  12893. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12894. -->
  12895. (S1 ^operator O1978 = -0.1937987592593187)
  12896. Retracting rl*prefer*rvt*predict-yes*H0*5
  12897. -->
  12898. (S1 ^operator O1977 = 0.2939078922513593)
  12899. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12900. -->
  12901. (S1 ^operator O1977 = 0.7063401754803731)
  12902. --- END Proposal Phase ---
  12903. --- Decision Phase ---
  12904. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12905. =>WM: (13876: S1 ^operator O1979)
  12906. 990: O: O1979 (predict-yes)
  12907. --- END Decision Phase ---
  12908. --- Application Phase ---
  12909. --- Firing Productions (PE) For State At Depth 1 ---
  12910. --- Inner Elaboration Phase, active level 1 (S1) ---
  12911. Firing apply*operator
  12912. -->
  12913. (I3 ^predict-yes N990 + :O )
  12914. Firing apply*operator*complete
  12915. -->
  12916. (I3 ^predict-no N989 - :O )
  12917. inner elaboration loop at bottom goal.
  12918. --- Change Working Memory (PE) ---
  12919. =>WM: (13877: I3 ^predict-yes N990)
  12920. <=WM: (13863: N989 ^status complete)
  12921. <=WM: (13862: I3 ^predict-no N989)
  12922. --- Firing Productions (IE) For State At Depth 1 ---
  12923. --- Inner Elaboration Phase, active level 1 (S1) ---
  12924. Firing monitor*world
  12925. -->
  12926. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12927. --- Change Working Memory (IE) ---
  12928. --- END Application Phase ---
  12929. --- Output Phase ---
  12930. ENV: Agent did: predict-yes for direction R in state State-A
  12931. In State-A moving R
  12932. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12933. predict error 0
  12934. dir: dir isU
  12935. --- END Output Phase ---
  12936. \-/--- Input Phase ---
  12937. =>WM: (13881: I2 ^dir U)
  12938. =>WM: (13880: I2 ^reward 1)
  12939. =>WM: (13879: I2 ^see 1)
  12940. =>WM: (13878: N990 ^status complete)
  12941. <=WM: (13866: I2 ^dir R)
  12942. <=WM: (13865: I2 ^reward 1)
  12943. <=WM: (13864: I2 ^see 0)
  12944. =>WM: (13882: I2 ^level-1 R1-root)
  12945. <=WM: (13867: I2 ^level-1 L1-root)
  12946. --- END Input Phase ---
  12947. --- Proposal Phase ---
  12948. --- Inner Elaboration Phase, active level 1 (S1) ---
  12949. Firing elaborate*copy-see-to-output-link
  12950. -->
  12951. (I3 ^see 1 +)
  12952. Firing elaborate*reward*based*on*reward
  12953. -->
  12954. (R994 ^value 1 +)
  12955. (R1 ^reward R994 +)
  12956. Firing propose*predict-yes
  12957. -->
  12958. (O1981 ^name predict-yes +)
  12959. (S1 ^operator O1981 +)
  12960. Firing propose*predict-no
  12961. -->
  12962. (O1982 ^name predict-no +)
  12963. (S1 ^operator O1982 +)
  12964. Firing rl*prefer*rvt*predict-no*H0*4
  12965. -->
  12966. (S1 ^operator O1980 = 1.)
  12967. Firing rl*prefer*rvt*predict-yes*H0*3
  12968. -->
  12969. (S1 ^operator O1979 = 0.)
  12970. Firing prefer*rvt*predict-yes*H0
  12971. -->
  12972. Firing prefer*rvt*predict-no*H0
  12973. -->
  12974. Firing elaborate*copy-dir-to-output-link
  12975. -->
  12976. (I3 ^dir U +)
  12977. inner elaboration loop at bottom goal.
  12978. Retracting elaborate*copy-see-to-output-link
  12979. -->
  12980. (I3 ^see 0 +)
  12981. Retracting propose*predict-no
  12982. -->
  12983. (O1980 ^name predict-no +)
  12984. (S1 ^operator O1980 +)
  12985. Retracting propose*predict-yes
  12986. -->
  12987. (O1979 ^name predict-yes +)
  12988. (S1 ^operator O1979 +)
  12989. Retracting elaborate*reward*based*on*reward
  12990. -->
  12991. (R993 ^value 1 +)
  12992. (R1 ^reward R993 +)
  12993. Retracting elaborate*copy-dir-to-output-link
  12994. -->
  12995. (I3 ^dir R +)
  12996. Retracting rl*prefer*rvt*predict-no*H0*6
  12997. -->
  12998. (S1 ^operator O1980 = 0.2298579596436188)
  12999. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13000. -->
  13001. (S1 ^operator O1980 = -0.1937987592593187)
  13002. Retracting rl*prefer*rvt*predict-yes*H0*5
  13003. -->
  13004. (S1 ^operator O1979 = 0.2939078922513593)
  13005. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13006. -->
  13007. (S1 ^operator O1979 = 0.7063401754803731)
  13008. =>WM: (13890: S1 ^operator O1982 +)
  13009. =>WM: (13889: S1 ^operator O1981 +)
  13010. =>WM: (13888: I3 ^dir U)
  13011. =>WM: (13887: O1982 ^name predict-no)
  13012. =>WM: (13886: O1981 ^name predict-yes)
  13013. =>WM: (13885: R994 ^value 1)
  13014. =>WM: (13884: R1 ^reward R994)
  13015. =>WM: (13883: I3 ^see 1)
  13016. <=WM: (13874: S1 ^operator O1979 +)
  13017. <=WM: (13876: S1 ^operator O1979)
  13018. <=WM: (13875: S1 ^operator O1980 +)
  13019. <=WM: (13873: I3 ^dir R)
  13020. <=WM: (13869: R1 ^reward R993)
  13021. <=WM: (13868: I3 ^see 0)
  13022. <=WM: (13872: O1980 ^name predict-no)
  13023. <=WM: (13871: O1979 ^name predict-yes)
  13024. <=WM: (13870: R993 ^value 1)
  13025. --- Inner Elaboration Phase, active level 1 (S1) ---
  13026. Firing prefer*rvt*predict-yes*H0
  13027. -->
  13028. Firing rl*prefer*rvt*predict-yes*H0*3
  13029. -->
  13030. (S1 ^operator O1981 = 0.)
  13031. Firing prefer*rvt*predict-no*H0
  13032. -->
  13033. Firing rl*prefer*rvt*predict-no*H0*4
  13034. -->
  13035. (S1 ^operator O1982 = 1.)
  13036. inner elaboration loop at bottom goal.
  13037. Retracting rl*prefer*rvt*predict-no*H0*4
  13038. -->
  13039. (S1 ^operator O1980 = 1.)
  13040. Retracting rl*prefer*rvt*predict-yes*H0*3
  13041. -->
  13042. (S1 ^operator O1979 = 0.)
  13043. --- END Proposal Phase ---
  13044. --- Decision Phase ---
  13045. RL update rl*prefer*rvt*predict-yes*H0*5 0.50099 -0.207082 0.293908 -> 0.500972 -0.207084 0.293887(R,m,v=1,0.844156,0.132417)
  13046. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499233 0.207107 0.70634 -> 0.499211 0.207105 0.706316(R,m,v=1,1,0)
  13047. =>WM: (13891: S1 ^operator O1982)
  13048. 991: O: O1982 (predict-no)
  13049. --- END Decision Phase ---
  13050. --- Application Phase ---
  13051. --- Firing Productions (PE) For State At Depth 1 ---
  13052. --- Inner Elaboration Phase, active level 1 (S1) ---
  13053. Firing apply*operator
  13054. -->
  13055. (I3 ^predict-no N991 + :O )
  13056. Firing apply*operator*complete
  13057. -->
  13058. (I3 ^predict-yes N990 - :O )
  13059. inner elaboration loop at bottom goal.
  13060. --- Change Working Memory (PE) ---
  13061. =>WM: (13892: I3 ^predict-no N991)
  13062. <=WM: (13878: N990 ^status complete)
  13063. <=WM: (13877: I3 ^predict-yes N990)
  13064. --- Firing Productions (IE) For State At Depth 1 ---
  13065. --- Inner Elaboration Phase, active level 1 (S1) ---
  13066. Firing monitor*world
  13067. -->
  13068. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13069. --- Change Working Memory (IE) ---
  13070. --- END Application Phase ---
  13071. --- Output Phase ---
  13072. ENV: Agent did: predict-no for direction U in state State-B
  13073. In State-B moving U
  13074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13075. predict error 0
  13076. dir: dir isU
  13077. --- END Output Phase ---
  13078. |--- Input Phase ---
  13079. =>WM: (13896: I2 ^dir U)
  13080. =>WM: (13895: I2 ^reward 1)
  13081. =>WM: (13894: I2 ^see 0)
  13082. =>WM: (13893: N991 ^status complete)
  13083. <=WM: (13881: I2 ^dir U)
  13084. <=WM: (13880: I2 ^reward 1)
  13085. <=WM: (13879: I2 ^see 1)
  13086. =>WM: (13897: I2 ^level-1 R1-root)
  13087. <=WM: (13882: I2 ^level-1 R1-root)
  13088. --- END Input Phase ---
  13089. --- Proposal Phase ---
  13090. --- Inner Elaboration Phase, active level 1 (S1) ---
  13091. Firing elaborate*copy-see-to-output-link
  13092. -->
  13093. (I3 ^see 0 +)
  13094. Firing elaborate*reward*based*on*reward
  13095. -->
  13096. (R995 ^value 1 +)
  13097. (R1 ^reward R995 +)
  13098. Firing propose*predict-yes
  13099. -->
  13100. (O1983 ^name predict-yes +)
  13101. (S1 ^operator O1983 +)
  13102. Firing propose*predict-no
  13103. -->
  13104. (O1984 ^name predict-no +)
  13105. (S1 ^operator O1984 +)
  13106. Firing rl*prefer*rvt*predict-no*H0*4
  13107. -->
  13108. (S1 ^operator O1982 = 1.)
  13109. Firing rl*prefer*rvt*predict-yes*H0*3
  13110. -->
  13111. (S1 ^operator O1981 = 0.)
  13112. Firing prefer*rvt*predict-yes*H0
  13113. -->
  13114. Firing prefer*rvt*predict-no*H0
  13115. -->
  13116. Firing elaborate*copy-dir-to-output-link
  13117. -->
  13118. (I3 ^dir U +)
  13119. inner elaboration loop at bottom goal.
  13120. Retracting elaborate*copy-see-to-output-link
  13121. -->
  13122. (I3 ^see 1 +)
  13123. Retracting propose*predict-no
  13124. -->
  13125. (O1982 ^name predict-no +)
  13126. (S1 ^operator O1982 +)
  13127. Retracting propose*predict-yes
  13128. -->
  13129. (O1981 ^name predict-yes +)
  13130. (S1 ^operator O1981 +)
  13131. Retracting elaborate*reward*based*on*reward
  13132. -->
  13133. (R994 ^value 1 +)
  13134. (R1 ^reward R994 +)
  13135. Retracting elaborate*copy-dir-to-output-link
  13136. -->
  13137. (I3 ^dir U +)
  13138. Retracting rl*prefer*rvt*predict-no*H0*4
  13139. -->
  13140. (S1 ^operator O1982 = 1.)
  13141. Retracting rl*prefer*rvt*predict-yes*H0*3
  13142. -->
  13143. (S1 ^operator O1981 = 0.)
  13144. =>WM: (13904: S1 ^operator O1984 +)
  13145. =>WM: (13903: S1 ^operator O1983 +)
  13146. =>WM: (13902: O1984 ^name predict-no)
  13147. =>WM: (13901: O1983 ^name predict-yes)
  13148. =>WM: (13900: R995 ^value 1)
  13149. =>WM: (13899: R1 ^reward R995)
  13150. =>WM: (13898: I3 ^see 0)
  13151. <=WM: (13889: S1 ^operator O1981 +)
  13152. <=WM: (13890: S1 ^operator O1982 +)
  13153. <=WM: (13891: S1 ^operator O1982)
  13154. <=WM: (13884: R1 ^reward R994)
  13155. <=WM: (13883: I3 ^see 1)
  13156. <=WM: (13887: O1982 ^name predict-no)
  13157. <=WM: (13886: O1981 ^name predict-yes)
  13158. <=WM: (13885: R994 ^value 1)
  13159. --- Inner Elaboration Phase, active level 1 (S1) ---
  13160. Firing prefer*rvt*predict-yes*H0
  13161. -->
  13162. Firing rl*prefer*rvt*predict-yes*H0*3
  13163. -->
  13164. (S1 ^operator O1983 = 0.)
  13165. Firing prefer*rvt*predict-no*H0
  13166. -->
  13167. Firing rl*prefer*rvt*predict-no*H0*4
  13168. -->
  13169. (S1 ^operator O1984 = 1.)
  13170. inner elaboration loop at bottom goal.
  13171. Retracting rl*prefer*rvt*predict-no*H0*4
  13172. -->
  13173. (S1 ^operator O1982 = 1.)
  13174. Retracting rl*prefer*rvt*predict-yes*H0*3
  13175. -->
  13176. (S1 ^operator O1981 = 0.)
  13177. --- END Proposal Phase ---
  13178. --- Decision Phase ---
  13179. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13180. =>WM: (13905: S1 ^operator O1984)
  13181. 992: O: O1984 (predict-no)
  13182. --- END Decision Phase ---
  13183. --- Application Phase ---
  13184. --- Firing Productions (PE) For State At Depth 1 ---
  13185. --- Inner Elaboration Phase, active level 1 (S1) ---
  13186. Firing apply*operator
  13187. -->
  13188. (I3 ^predict-no N992 + :O )
  13189. Firing apply*operator*complete
  13190. -->
  13191. (I3 ^predict-no N991 - :O )
  13192. inner elaboration loop at bottom goal.
  13193. --- Change Working Memory (PE) ---
  13194. =>WM: (13906: I3 ^predict-no N992)
  13195. <=WM: (13893: N991 ^status complete)
  13196. <=WM: (13892: I3 ^predict-no N991)
  13197. --- Firing Productions (IE) For State At Depth 1 ---
  13198. --- Inner Elaboration Phase, active level 1 (S1) ---
  13199. Firing monitor*world
  13200. -->
  13201. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13202. --- Change Working Memory (IE) ---
  13203. --- END Application Phase ---
  13204. --- Output Phase ---
  13205. ENV: Agent did: predict-no for direction U in state State-B
  13206. In State-B moving U
  13207. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13208. predict error 0
  13209. dir: dir isL
  13210. --- END Output Phase ---
  13211. \-/--- Input Phase ---
  13212. =>WM: (13910: I2 ^dir L)
  13213. =>WM: (13909: I2 ^reward 1)
  13214. =>WM: (13908: I2 ^see 0)
  13215. =>WM: (13907: N992 ^status complete)
  13216. <=WM: (13896: I2 ^dir U)
  13217. <=WM: (13895: I2 ^reward 1)
  13218. <=WM: (13894: I2 ^see 0)
  13219. =>WM: (13911: I2 ^level-1 R1-root)
  13220. <=WM: (13897: I2 ^level-1 R1-root)
  13221. --- END Input Phase ---
  13222. --- Proposal Phase ---
  13223. --- Inner Elaboration Phase, active level 1 (S1) ---
  13224. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13225. -->
  13226. (S1 ^operator O1983 = 0.6196129817664832)
  13227. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13228. -->
  13229. (S1 ^operator O1984 = -0.1479504104026684)
  13230. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13231. -->
  13232. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13233. -->
  13234. Firing elaborate*copy-see-to-output-link
  13235. -->
  13236. (I3 ^see 0 +)
  13237. Firing elaborate*reward*based*on*reward
  13238. -->
  13239. (R996 ^value 1 +)
  13240. (R1 ^reward R996 +)
  13241. Firing propose*predict-yes
  13242. -->
  13243. (O1985 ^name predict-yes +)
  13244. (S1 ^operator O1985 +)
  13245. Firing propose*predict-no
  13246. -->
  13247. (O1986 ^name predict-no +)
  13248. (S1 ^operator O1986 +)
  13249. Firing rl*prefer*rvt*predict-no*H0*2
  13250. -->
  13251. (S1 ^operator O1984 = 0.3140233963466647)
  13252. Firing rl*prefer*rvt*predict-yes*H0*1
  13253. -->
  13254. (S1 ^operator O1983 = 0.380417577206794)
  13255. Firing prefer*rvt*predict-yes*H0
  13256. -->
  13257. Firing prefer*rvt*predict-no*H0
  13258. -->
  13259. Firing elaborate*copy-dir-to-output-link
  13260. -->
  13261. (I3 ^dir L +)
  13262. inner elaboration loop at bottom goal.
  13263. Retracting elaborate*copy-see-to-output-link
  13264. -->
  13265. (I3 ^see 0 +)
  13266. Retracting propose*predict-no
  13267. -->
  13268. (O1984 ^name predict-no +)
  13269. (S1 ^operator O1984 +)
  13270. Retracting propose*predict-yes
  13271. -->
  13272. (O1983 ^name predict-yes +)
  13273. (S1 ^operator O1983 +)
  13274. Retracting elaborate*reward*based*on*reward
  13275. -->
  13276. (R995 ^value 1 +)
  13277. (R1 ^reward R995 +)
  13278. Retracting elaborate*copy-dir-to-output-link
  13279. -->
  13280. (I3 ^dir U +)
  13281. Retracting rl*prefer*rvt*predict-no*H0*4
  13282. -->
  13283. (S1 ^operator O1984 = 1.)
  13284. Retracting rl*prefer*rvt*predict-yes*H0*3
  13285. -->
  13286. (S1 ^operator O1983 = 0.)
  13287. =>WM: (13918: S1 ^operator O1986 +)
  13288. =>WM: (13917: S1 ^operator O1985 +)
  13289. =>WM: (13916: I3 ^dir L)
  13290. =>WM: (13915: O1986 ^name predict-no)
  13291. =>WM: (13914: O1985 ^name predict-yes)
  13292. =>WM: (13913: R996 ^value 1)
  13293. =>WM: (13912: R1 ^reward R996)
  13294. <=WM: (13903: S1 ^operator O1983 +)
  13295. <=WM: (13904: S1 ^operator O1984 +)
  13296. <=WM: (13905: S1 ^operator O1984)
  13297. <=WM: (13888: I3 ^dir U)
  13298. <=WM: (13899: R1 ^reward R995)
  13299. <=WM: (13902: O1984 ^name predict-no)
  13300. <=WM: (13901: O1983 ^name predict-yes)
  13301. <=WM: (13900: R995 ^value 1)
  13302. --- Inner Elaboration Phase, active level 1 (S1) ---
  13303. Firing prefer*rvt*predict-yes*H0
  13304. -->
  13305. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13306. -->
  13307. (S1 ^operator O1985 = 0.6196129817664832)
  13308. Firing rl*prefer*rvt*predict-yes*H0*1
  13309. -->
  13310. (S1 ^operator O1985 = 0.380417577206794)
  13311. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13312. -->
  13313. Firing prefer*rvt*predict-no*H0
  13314. -->
  13315. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13316. -->
  13317. (S1 ^operator O1986 = -0.1479504104026684)
  13318. Firing rl*prefer*rvt*predict-no*H0*2
  13319. -->
  13320. (S1 ^operator O1986 = 0.3140233963466647)
  13321. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13322. -->
  13323. inner elaboration loop at bottom goal.
  13324. Retracting rl*prefer*rvt*predict-no*H0*2
  13325. -->
  13326. (S1 ^operator O1984 = 0.3140233963466647)
  13327. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13328. -->
  13329. (S1 ^operator O1984 = -0.1479504104026684)
  13330. Retracting rl*prefer*rvt*predict-yes*H0*1
  13331. -->
  13332. (S1 ^operator O1983 = 0.380417577206794)
  13333. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13334. -->
  13335. (S1 ^operator O1983 = 0.6196129817664832)
  13336. --- END Proposal Phase ---
  13337. --- Decision Phase ---
  13338. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13339. =>WM: (13919: S1 ^operator O1985)
  13340. 993: O: O1985 (predict-yes)
  13341. --- END Decision Phase ---
  13342. --- Application Phase ---
  13343. --- Firing Productions (PE) For State At Depth 1 ---
  13344. --- Inner Elaboration Phase, active level 1 (S1) ---
  13345. Firing apply*operator
  13346. -->
  13347. (I3 ^predict-yes N993 + :O )
  13348. Firing apply*operator*complete
  13349. -->
  13350. (I3 ^predict-no N992 - :O )
  13351. inner elaboration loop at bottom goal.
  13352. --- Change Working Memory (PE) ---
  13353. =>WM: (13920: I3 ^predict-yes N993)
  13354. <=WM: (13907: N992 ^status complete)
  13355. <=WM: (13906: I3 ^predict-no N992)
  13356. --- Firing Productions (IE) For State At Depth 1 ---
  13357. --- Inner Elaboration Phase, active level 1 (S1) ---
  13358. Firing monitor*world
  13359. -->
  13360. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13361. --- Change Working Memory (IE) ---
  13362. --- END Application Phase ---
  13363. --- Output Phase ---
  13364. ENV: Agent did: predict-yes for direction L in state State-B
  13365. In State-B moving L
  13366. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13367. predict error 0
  13368. dir: dir isR
  13369. --- END Output Phase ---
  13370. |\---- Input Phase ---
  13371. =>WM: (13924: I2 ^dir R)
  13372. =>WM: (13923: I2 ^reward 1)
  13373. =>WM: (13922: I2 ^see 1)
  13374. =>WM: (13921: N993 ^status complete)
  13375. <=WM: (13910: I2 ^dir L)
  13376. <=WM: (13909: I2 ^reward 1)
  13377. <=WM: (13908: I2 ^see 0)
  13378. =>WM: (13925: I2 ^level-1 L1-root)
  13379. <=WM: (13911: I2 ^level-1 R1-root)
  13380. --- END Input Phase ---
  13381. --- Proposal Phase ---
  13382. --- Inner Elaboration Phase, active level 1 (S1) ---
  13383. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13384. -->
  13385. (S1 ^operator O1985 = 0.7063161327052487)
  13386. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13387. -->
  13388. (S1 ^operator O1986 = -0.1937987592593187)
  13389. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13390. -->
  13391. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13392. -->
  13393. Firing elaborate*copy-see-to-output-link
  13394. -->
  13395. (I3 ^see 1 +)
  13396. Firing elaborate*reward*based*on*reward
  13397. -->
  13398. (R997 ^value 1 +)
  13399. (R1 ^reward R997 +)
  13400. Firing propose*predict-yes
  13401. -->
  13402. (O1987 ^name predict-yes +)
  13403. (S1 ^operator O1987 +)
  13404. Firing propose*predict-no
  13405. -->
  13406. (O1988 ^name predict-no +)
  13407. (S1 ^operator O1988 +)
  13408. Firing rl*prefer*rvt*predict-no*H0*6
  13409. -->
  13410. (S1 ^operator O1986 = 0.2298579596436188)
  13411. Firing rl*prefer*rvt*predict-yes*H0*5
  13412. -->
  13413. (S1 ^operator O1985 = 0.29388734647702)
  13414. Firing prefer*rvt*predict-yes*H0
  13415. -->
  13416. Firing prefer*rvt*predict-no*H0
  13417. -->
  13418. Firing elaborate*copy-dir-to-output-link
  13419. -->
  13420. (I3 ^dir R +)
  13421. inner elaboration loop at bottom goal.
  13422. Retracting elaborate*copy-see-to-output-link
  13423. -->
  13424. (I3 ^see 0 +)
  13425. Retracting propose*predict-no
  13426. -->
  13427. (O1986 ^name predict-no +)
  13428. (S1 ^operator O1986 +)
  13429. Retracting propose*predict-yes
  13430. -->
  13431. (O1985 ^name predict-yes +)
  13432. (S1 ^operator O1985 +)
  13433. Retracting elaborate*reward*based*on*reward
  13434. -->
  13435. (R996 ^value 1 +)
  13436. (R1 ^reward R996 +)
  13437. Retracting elaborate*copy-dir-to-output-link
  13438. -->
  13439. (I3 ^dir L +)
  13440. Retracting rl*prefer*rvt*predict-no*H0*2
  13441. -->
  13442. (S1 ^operator O1986 = 0.3140233963466647)
  13443. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13444. -->
  13445. (S1 ^operator O1986 = -0.1479504104026684)
  13446. Retracting rl*prefer*rvt*predict-yes*H0*1
  13447. -->
  13448. (S1 ^operator O1985 = 0.380417577206794)
  13449. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13450. -->
  13451. (S1 ^operator O1985 = 0.6196129817664832)
  13452. =>WM: (13933: S1 ^operator O1988 +)
  13453. =>WM: (13932: S1 ^operator O1987 +)
  13454. =>WM: (13931: I3 ^dir R)
  13455. =>WM: (13930: O1988 ^name predict-no)
  13456. =>WM: (13929: O1987 ^name predict-yes)
  13457. =>WM: (13928: R997 ^value 1)
  13458. =>WM: (13927: R1 ^reward R997)
  13459. =>WM: (13926: I3 ^see 1)
  13460. <=WM: (13917: S1 ^operator O1985 +)
  13461. <=WM: (13919: S1 ^operator O1985)
  13462. <=WM: (13918: S1 ^operator O1986 +)
  13463. <=WM: (13916: I3 ^dir L)
  13464. <=WM: (13912: R1 ^reward R996)
  13465. <=WM: (13898: I3 ^see 0)
  13466. <=WM: (13915: O1986 ^name predict-no)
  13467. <=WM: (13914: O1985 ^name predict-yes)
  13468. <=WM: (13913: R996 ^value 1)
  13469. --- Inner Elaboration Phase, active level 1 (S1) ---
  13470. Firing prefer*rvt*predict-yes*H0
  13471. -->
  13472. Firing rl*prefer*rvt*predict-yes*H0*5
  13473. -->
  13474. (S1 ^operator O1987 = 0.29388734647702)
  13475. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13476. -->
  13477. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13478. -->
  13479. (S1 ^operator O1987 = 0.7063161327052487)
  13480. Firing prefer*rvt*predict-no*H0
  13481. -->
  13482. Firing rl*prefer*rvt*predict-no*H0*6
  13483. -->
  13484. (S1 ^operator O1988 = 0.2298579596436188)
  13485. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13486. -->
  13487. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13488. -->
  13489. (S1 ^operator O1988 = -0.1937987592593187)
  13490. inner elaboration loop at bottom goal.
  13491. Retracting rl*prefer*rvt*predict-no*H0*6
  13492. -->
  13493. (S1 ^operator O1986 = 0.2298579596436188)
  13494. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13495. -->
  13496. (S1 ^operator O1986 = -0.1937987592593187)
  13497. Retracting rl*prefer*rvt*predict-yes*H0*5
  13498. -->
  13499. (S1 ^operator O1985 = 0.29388734647702)
  13500. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13501. -->
  13502. (S1 ^operator O1985 = 0.7063161327052487)
  13503. --- END Proposal Phase ---
  13504. --- Decision Phase ---
  13505. RL update rl*prefer*rvt*predict-yes*H0*1 0.521348 -0.14093 0.380418 -> 0.521345 -0.14093 0.380415(R,m,v=1,0.829268,0.142451)
  13506. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478686 0.140927 0.619613 -> 0.478682 0.140928 0.61961(R,m,v=1,1,0)
  13507. =>WM: (13934: S1 ^operator O1987)
  13508. 994: O: O1987 (predict-yes)
  13509. --- END Decision Phase ---
  13510. --- Application Phase ---
  13511. --- Firing Productions (PE) For State At Depth 1 ---
  13512. --- Inner Elaboration Phase, active level 1 (S1) ---
  13513. Firing apply*operator
  13514. -->
  13515. (I3 ^predict-yes N994 + :O )
  13516. Firing apply*operator*complete
  13517. -->
  13518. (I3 ^predict-yes N993 - :O )
  13519. inner elaboration loop at bottom goal.
  13520. --- Change Working Memory (PE) ---
  13521. =>WM: (13935: I3 ^predict-yes N994)
  13522. <=WM: (13921: N993 ^status complete)
  13523. <=WM: (13920: I3 ^predict-yes N993)
  13524. --- Firing Productions (IE) For State At Depth 1 ---
  13525. --- Inner Elaboration Phase, active level 1 (S1) ---
  13526. Firing monitor*world
  13527. -->
  13528. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13529. --- Change Working Memory (IE) ---
  13530. --- END Application Phase ---
  13531. --- Output Phase ---
  13532. ENV: Agent did: predict-yes for direction R in state State-A
  13533. In State-A moving R
  13534. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13535. predict error 0
  13536. dir: dir isR
  13537. --- END Output Phase ---
  13538. /|\--- Input Phase ---
  13539. =>WM: (13939: I2 ^dir R)
  13540. =>WM: (13938: I2 ^reward 1)
  13541. =>WM: (13937: I2 ^see 1)
  13542. =>WM: (13936: N994 ^status complete)
  13543. <=WM: (13924: I2 ^dir R)
  13544. <=WM: (13923: I2 ^reward 1)
  13545. <=WM: (13922: I2 ^see 1)
  13546. =>WM: (13940: I2 ^level-1 R1-root)
  13547. <=WM: (13925: I2 ^level-1 L1-root)
  13548. --- END Input Phase ---
  13549. --- Proposal Phase ---
  13550. --- Inner Elaboration Phase, active level 1 (S1) ---
  13551. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13552. -->
  13553. (S1 ^operator O1987 = -0.252585164213872)
  13554. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13555. -->
  13556. (S1 ^operator O1988 = 0.7701797310679288)
  13557. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13558. -->
  13559. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13560. -->
  13561. Firing elaborate*copy-see-to-output-link
  13562. -->
  13563. (I3 ^see 1 +)
  13564. Firing elaborate*reward*based*on*reward
  13565. -->
  13566. (R998 ^value 1 +)
  13567. (R1 ^reward R998 +)
  13568. Firing propose*predict-yes
  13569. -->
  13570. (O1989 ^name predict-yes +)
  13571. (S1 ^operator O1989 +)
  13572. Firing propose*predict-no
  13573. -->
  13574. (O1990 ^name predict-no +)
  13575. (S1 ^operator O1990 +)
  13576. Firing rl*prefer*rvt*predict-no*H0*6
  13577. -->
  13578. (S1 ^operator O1988 = 0.2298579596436188)
  13579. Firing rl*prefer*rvt*predict-yes*H0*5
  13580. -->
  13581. (S1 ^operator O1987 = 0.29388734647702)
  13582. Firing prefer*rvt*predict-yes*H0
  13583. -->
  13584. Firing prefer*rvt*predict-no*H0
  13585. -->
  13586. Firing elaborate*copy-dir-to-output-link
  13587. -->
  13588. (I3 ^dir R +)
  13589. inner elaboration loop at bottom goal.
  13590. Retracting elaborate*copy-see-to-output-link
  13591. -->
  13592. (I3 ^see 1 +)
  13593. Retracting propose*predict-no
  13594. -->
  13595. (O1988 ^name predict-no +)
  13596. (S1 ^operator O1988 +)
  13597. Retracting propose*predict-yes
  13598. -->
  13599. (O1987 ^name predict-yes +)
  13600. (S1 ^operator O1987 +)
  13601. Retracting elaborate*reward*based*on*reward
  13602. -->
  13603. (R997 ^value 1 +)
  13604. (R1 ^reward R997 +)
  13605. Retracting elaborate*copy-dir-to-output-link
  13606. -->
  13607. (I3 ^dir R +)
  13608. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13609. -->
  13610. (S1 ^operator O1988 = -0.1937987592593187)
  13611. Retracting rl*prefer*rvt*predict-no*H0*6
  13612. -->
  13613. (S1 ^operator O1988 = 0.2298579596436188)
  13614. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13615. -->
  13616. (S1 ^operator O1987 = 0.7063161327052487)
  13617. Retracting rl*prefer*rvt*predict-yes*H0*5
  13618. -->
  13619. (S1 ^operator O1987 = 0.29388734647702)
  13620. =>WM: (13946: S1 ^operator O1990 +)
  13621. =>WM: (13945: S1 ^operator O1989 +)
  13622. =>WM: (13944: O1990 ^name predict-no)
  13623. =>WM: (13943: O1989 ^name predict-yes)
  13624. =>WM: (13942: R998 ^value 1)
  13625. =>WM: (13941: R1 ^reward R998)
  13626. <=WM: (13932: S1 ^operator O1987 +)
  13627. <=WM: (13934: S1 ^operator O1987)
  13628. <=WM: (13933: S1 ^operator O1988 +)
  13629. <=WM: (13927: R1 ^reward R997)
  13630. <=WM: (13930: O1988 ^name predict-no)
  13631. <=WM: (13929: O1987 ^name predict-yes)
  13632. <=WM: (13928: R997 ^value 1)
  13633. --- Inner Elaboration Phase, active level 1 (S1) ---
  13634. Firing prefer*rvt*predict-yes*H0
  13635. -->
  13636. Firing rl*prefer*rvt*predict-yes*H0*5
  13637. -->
  13638. (S1 ^operator O1989 = 0.29388734647702)
  13639. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13640. -->
  13641. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13642. -->
  13643. (S1 ^operator O1989 = -0.252585164213872)
  13644. Firing prefer*rvt*predict-no*H0
  13645. -->
  13646. Firing rl*prefer*rvt*predict-no*H0*6
  13647. -->
  13648. (S1 ^operator O1990 = 0.2298579596436188)
  13649. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13650. -->
  13651. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13652. -->
  13653. (S1 ^operator O1990 = 0.7701797310679288)
  13654. inner elaboration loop at bottom goal.
  13655. Retracting rl*prefer*rvt*predict-no*H0*6
  13656. -->
  13657. (S1 ^operator O1988 = 0.2298579596436188)
  13658. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13659. -->
  13660. (S1 ^operator O1988 = 0.7701797310679288)
  13661. Retracting rl*prefer*rvt*predict-yes*H0*5
  13662. -->
  13663. (S1 ^operator O1987 = 0.29388734647702)
  13664. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13665. -->
  13666. (S1 ^operator O1987 = -0.252585164213872)
  13667. --- END Proposal Phase ---
  13668. --- Decision Phase ---
  13669. RL update rl*prefer*rvt*predict-yes*H0*5 0.500972 -0.207084 0.293887 -> 0.500957 -0.207086 0.293871(R,m,v=1,0.845161,0.131713)
  13670. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499211 0.207105 0.706316 -> 0.499194 0.207103 0.706296(R,m,v=1,1,0)
  13671. =>WM: (13947: S1 ^operator O1990)
  13672. 995: O: O1990 (predict-no)
  13673. --- END Decision Phase ---
  13674. --- Application Phase ---
  13675. --- Firing Productions (PE) For State At Depth 1 ---
  13676. --- Inner Elaboration Phase, active level 1 (S1) ---
  13677. Firing apply*operator
  13678. -->
  13679. (I3 ^predict-no N995 + :O )
  13680. Firing apply*operator*complete
  13681. -->
  13682. (I3 ^predict-yes N994 - :O )
  13683. inner elaboration loop at bottom goal.
  13684. --- Change Working Memory (PE) ---
  13685. =>WM: (13948: I3 ^predict-no N995)
  13686. <=WM: (13936: N994 ^status complete)
  13687. <=WM: (13935: I3 ^predict-yes N994)
  13688. --- Firing Productions (IE) For State At Depth 1 ---
  13689. --- Inner Elaboration Phase, active level 1 (S1) ---
  13690. Firing monitor*world
  13691. -->
  13692. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13693. --- Change Working Memory (IE) ---
  13694. --- END Application Phase ---
  13695. --- Output Phase ---
  13696. ENV: Agent did: predict-no for direction R in state State-B
  13697. In State-B moving R
  13698. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13699. predict error 0
  13700. dir: dir isU
  13701. --- END Output Phase ---
  13702. -/|--- Input Phase ---
  13703. =>WM: (13952: I2 ^dir U)
  13704. =>WM: (13951: I2 ^reward 1)
  13705. =>WM: (13950: I2 ^see 0)
  13706. =>WM: (13949: N995 ^status complete)
  13707. <=WM: (13939: I2 ^dir R)
  13708. <=WM: (13938: I2 ^reward 1)
  13709. <=WM: (13937: I2 ^see 1)
  13710. =>WM: (13953: I2 ^level-1 R0-root)
  13711. <=WM: (13940: I2 ^level-1 R1-root)
  13712. --- END Input Phase ---
  13713. --- Proposal Phase ---
  13714. --- Inner Elaboration Phase, active level 1 (S1) ---
  13715. Firing elaborate*copy-see-to-output-link
  13716. -->
  13717. (I3 ^see 0 +)
  13718. Firing elaborate*reward*based*on*reward
  13719. -->
  13720. (R999 ^value 1 +)
  13721. (R1 ^reward R999 +)
  13722. Firing propose*predict-yes
  13723. -->
  13724. (O1991 ^name predict-yes +)
  13725. (S1 ^operator O1991 +)
  13726. Firing propose*predict-no
  13727. -->
  13728. (O1992 ^name predict-no +)
  13729. (S1 ^operator O1992 +)
  13730. Firing rl*prefer*rvt*predict-no*H0*4
  13731. -->
  13732. (S1 ^operator O1990 = 1.)
  13733. Firing rl*prefer*rvt*predict-yes*H0*3
  13734. -->
  13735. (S1 ^operator O1989 = 0.)
  13736. Firing prefer*rvt*predict-yes*H0
  13737. -->
  13738. Firing prefer*rvt*predict-no*H0
  13739. -->
  13740. Firing elaborate*copy-dir-to-output-link
  13741. -->
  13742. (I3 ^dir U +)
  13743. inner elaboration loop at bottom goal.
  13744. Retracting elaborate*copy-see-to-output-link
  13745. -->
  13746. (I3 ^see 1 +)
  13747. Retracting propose*predict-no
  13748. -->
  13749. (O1990 ^name predict-no +)
  13750. (S1 ^operator O1990 +)
  13751. Retracting propose*predict-yes
  13752. -->
  13753. (O1989 ^name predict-yes +)
  13754. (S1 ^operator O1989 +)
  13755. Retracting elaborate*reward*based*on*reward
  13756. -->
  13757. (R998 ^value 1 +)
  13758. (R1 ^reward R998 +)
  13759. Retracting elaborate*copy-dir-to-output-link
  13760. -->
  13761. (I3 ^dir R +)
  13762. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13763. -->
  13764. (S1 ^operator O1990 = 0.7701797310679288)
  13765. Retracting rl*prefer*rvt*predict-no*H0*6
  13766. -->
  13767. (S1 ^operator O1990 = 0.2298579596436188)
  13768. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13769. -->
  13770. (S1 ^operator O1989 = -0.252585164213872)
  13771. Retracting rl*prefer*rvt*predict-yes*H0*5
  13772. -->
  13773. (S1 ^operator O1989 = 0.2938705117203769)
  13774. =>WM: (13961: S1 ^operator O1992 +)
  13775. =>WM: (13960: S1 ^operator O1991 +)
  13776. =>WM: (13959: I3 ^dir U)
  13777. =>WM: (13958: O1992 ^name predict-no)
  13778. =>WM: (13957: O1991 ^name predict-yes)
  13779. =>WM: (13956: R999 ^value 1)
  13780. =>WM: (13955: R1 ^reward R999)
  13781. =>WM: (13954: I3 ^see 0)
  13782. <=WM: (13945: S1 ^operator O1989 +)
  13783. <=WM: (13946: S1 ^operator O1990 +)
  13784. <=WM: (13947: S1 ^operator O1990)
  13785. <=WM: (13931: I3 ^dir R)
  13786. <=WM: (13941: R1 ^reward R998)
  13787. <=WM: (13926: I3 ^see 1)
  13788. <=WM: (13944: O1990 ^name predict-no)
  13789. <=WM: (13943: O1989 ^name predict-yes)
  13790. <=WM: (13942: R998 ^value 1)
  13791. --- Inner Elaboration Phase, active level 1 (S1) ---
  13792. Firing prefer*rvt*predict-yes*H0
  13793. -->
  13794. Firing rl*prefer*rvt*predict-yes*H0*3
  13795. -->
  13796. (S1 ^operator O1991 = 0.)
  13797. Firing prefer*rvt*predict-no*H0
  13798. -->
  13799. Firing rl*prefer*rvt*predict-no*H0*4
  13800. -->
  13801. (S1 ^operator O1992 = 1.)
  13802. inner elaboration loop at bottom goal.
  13803. Retracting rl*prefer*rvt*predict-no*H0*4
  13804. -->
  13805. (S1 ^operator O1990 = 1.)
  13806. Retracting rl*prefer*rvt*predict-yes*H0*3
  13807. -->
  13808. (S1 ^operator O1989 = 0.)
  13809. --- END Proposal Phase ---
  13810. --- Decision Phase ---
  13811. RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382052 0.229858 -> 0.611908 -0.382053 0.229855(R,m,v=1,0.845714,0.131232)
  13812. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.38812 0.38206 0.77018 -> 0.388117 0.382059 0.770176(R,m,v=1,1,0)
  13813. =>WM: (13962: S1 ^operator O1992)
  13814. 996: O: O1992 (predict-no)
  13815. --- END Decision Phase ---
  13816. --- Application Phase ---
  13817. --- Firing Productions (PE) For State At Depth 1 ---
  13818. --- Inner Elaboration Phase, active level 1 (S1) ---
  13819. Firing apply*operator
  13820. -->
  13821. (I3 ^predict-no N996 + :O )
  13822. Firing apply*operator*complete
  13823. -->
  13824. (I3 ^predict-no N995 - :O )
  13825. inner elaboration loop at bottom goal.
  13826. --- Change Working Memory (PE) ---
  13827. =>WM: (13963: I3 ^predict-no N996)
  13828. <=WM: (13949: N995 ^status complete)
  13829. <=WM: (13948: I3 ^predict-no N995)
  13830. --- Firing Productions (IE) For State At Depth 1 ---
  13831. --- Inner Elaboration Phase, active level 1 (S1) ---
  13832. Firing monitor*world
  13833. -->
  13834. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13835. --- Change Working Memory (IE) ---
  13836. --- END Application Phase ---
  13837. --- Output Phase ---
  13838. ENV: Agent did: predict-no for direction U in state State-B
  13839. In State-B moving U
  13840. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13841. predict error 0
  13842. dir: dir isU
  13843. --- END Output Phase ---
  13844. \-/--- Input Phase ---
  13845. =>WM: (13967: I2 ^dir U)
  13846. =>WM: (13966: I2 ^reward 1)
  13847. =>WM: (13965: I2 ^see 0)
  13848. =>WM: (13964: N996 ^status complete)
  13849. <=WM: (13952: I2 ^dir U)
  13850. <=WM: (13951: I2 ^reward 1)
  13851. <=WM: (13950: I2 ^see 0)
  13852. =>WM: (13968: I2 ^level-1 R0-root)
  13853. <=WM: (13953: I2 ^level-1 R0-root)
  13854. --- END Input Phase ---
  13855. --- Proposal Phase ---
  13856. --- Inner Elaboration Phase, active level 1 (S1) ---
  13857. Firing elaborate*copy-see-to-output-link
  13858. -->
  13859. (I3 ^see 0 +)
  13860. Firing elaborate*reward*based*on*reward
  13861. -->
  13862. (R1000 ^value 1 +)
  13863. (R1 ^reward R1000 +)
  13864. Firing propose*predict-yes
  13865. -->
  13866. (O1993 ^name predict-yes +)
  13867. (S1 ^operator O1993 +)
  13868. Firing propose*predict-no
  13869. -->
  13870. (O1994 ^name predict-no +)
  13871. (S1 ^operator O1994 +)
  13872. Firing rl*prefer*rvt*predict-no*H0*4
  13873. -->
  13874. (S1 ^operator O1992 = 1.)
  13875. Firing rl*prefer*rvt*predict-yes*H0*3
  13876. -->
  13877. (S1 ^operator O1991 = 0.)
  13878. Firing prefer*rvt*predict-yes*H0
  13879. -->
  13880. Firing prefer*rvt*predict-no*H0
  13881. -->
  13882. Firing elaborate*copy-dir-to-output-link
  13883. -->
  13884. (I3 ^dir U +)
  13885. inner elaboration loop at bottom goal.
  13886. Retracting elaborate*copy-see-to-output-link
  13887. -->
  13888. (I3 ^see 0 +)
  13889. Retracting propose*predict-no
  13890. -->
  13891. (O1992 ^name predict-no +)
  13892. (S1 ^operator O1992 +)
  13893. Retracting propose*predict-yes
  13894. -->
  13895. (O1991 ^name predict-yes +)
  13896. (S1 ^operator O1991 +)
  13897. Retracting elaborate*reward*based*on*reward
  13898. -->
  13899. (R999 ^value 1 +)
  13900. (R1 ^reward R999 +)
  13901. Retracting elaborate*copy-dir-to-output-link
  13902. -->
  13903. (I3 ^dir U +)
  13904. Retracting rl*prefer*rvt*predict-no*H0*4
  13905. -->
  13906. (S1 ^operator O1992 = 1.)
  13907. Retracting rl*prefer*rvt*predict-yes*H0*3
  13908. -->
  13909. (S1 ^operator O1991 = 0.)
  13910. =>WM: (13974: S1 ^operator O1994 +)
  13911. =>WM: (13973: S1 ^operator O1993 +)
  13912. =>WM: (13972: O1994 ^name predict-no)
  13913. =>WM: (13971: O1993 ^name predict-yes)
  13914. =>WM: (13970: R1000 ^value 1)
  13915. =>WM: (13969: R1 ^reward R1000)
  13916. <=WM: (13960: S1 ^operator O1991 +)
  13917. <=WM: (13961: S1 ^operator O1992 +)
  13918. <=WM: (13962: S1 ^operator O1992)
  13919. <=WM: (13955: R1 ^reward R999)
  13920. <=WM: (13958: O1992 ^name predict-no)
  13921. <=WM: (13957: O1991 ^name predict-yes)
  13922. <=WM: (13956: R999 ^value 1)
  13923. --- Inner Elaboration Phase, active level 1 (S1) ---
  13924. Firing prefer*rvt*predict-yes*H0
  13925. -->
  13926. Firing rl*prefer*rvt*predict-yes*H0*3
  13927. -->
  13928. (S1 ^operator O1993 = 0.)
  13929. Firing prefer*rvt*predict-no*H0
  13930. -->
  13931. Firing rl*prefer*rvt*predict-no*H0*4
  13932. -->
  13933. (S1 ^operator O1994 = 1.)
  13934. inner elaboration loop at bottom goal.
  13935. Retracting rl*prefer*rvt*predict-no*H0*4
  13936. -->
  13937. (S1 ^operator O1992 = 1.)
  13938. Retracting rl*prefer*rvt*predict-yes*H0*3
  13939. -->
  13940. (S1 ^operator O1991 = 0.)
  13941. --- END Proposal Phase ---
  13942. --- Decision Phase ---
  13943. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13944. =>WM: (13975: S1 ^operator O1994)
  13945. 997: O: O1994 (predict-no)
  13946. --- END Decision Phase ---
  13947. --- Application Phase ---
  13948. --- Firing Productions (PE) For State At Depth 1 ---
  13949. --- Inner Elaboration Phase, active level 1 (S1) ---
  13950. Firing apply*operator
  13951. -->
  13952. (I3 ^predict-no N997 + :O )
  13953. Firing apply*operator*complete
  13954. -->
  13955. (I3 ^predict-no N996 - :O )
  13956. inner elaboration loop at bottom goal.
  13957. --- Change Working Memory (PE) ---
  13958. =>WM: (13976: I3 ^predict-no N997)
  13959. <=WM: (13964: N996 ^status complete)
  13960. <=WM: (13963: I3 ^predict-no N996)
  13961. --- Firing Productions (IE) For State At Depth 1 ---
  13962. --- Inner Elaboration Phase, active level 1 (S1) ---
  13963. Firing monitor*world
  13964. -->
  13965. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13966. --- Change Working Memory (IE) ---
  13967. --- END Application Phase ---
  13968. --- Output Phase ---
  13969. ENV: Agent did: predict-no for direction U in state State-B
  13970. In State-B moving U
  13971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13972. predict error 0
  13973. dir: dir isL
  13974. --- END Output Phase ---
  13975. |\--- Input Phase ---
  13976. =>WM: (13980: I2 ^dir L)
  13977. =>WM: (13979: I2 ^reward 1)
  13978. =>WM: (13978: I2 ^see 0)
  13979. =>WM: (13977: N997 ^status complete)
  13980. <=WM: (13967: I2 ^dir U)
  13981. <=WM: (13966: I2 ^reward 1)
  13982. <=WM: (13965: I2 ^see 0)
  13983. =>WM: (13981: I2 ^level-1 R0-root)
  13984. <=WM: (13968: I2 ^level-1 R0-root)
  13985. --- END Input Phase ---
  13986. --- Proposal Phase ---
  13987. --- Inner Elaboration Phase, active level 1 (S1) ---
  13988. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  13989. -->
  13990. (S1 ^operator O1993 = 0.6195669380621123)
  13991. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  13992. -->
  13993. (S1 ^operator O1994 = -0.2190661556260421)
  13994. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13995. -->
  13996. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13997. -->
  13998. Firing elaborate*copy-see-to-output-link
  13999. -->
  14000. (I3 ^see 0 +)
  14001. Firing elaborate*reward*based*on*reward
  14002. -->
  14003. (R1001 ^value 1 +)
  14004. (R1 ^reward R1001 +)
  14005. Firing propose*predict-yes
  14006. -->
  14007. (O1995 ^name predict-yes +)
  14008. (S1 ^operator O1995 +)
  14009. Firing propose*predict-no
  14010. -->
  14011. (O1996 ^name predict-no +)
  14012. (S1 ^operator O1996 +)
  14013. Firing rl*prefer*rvt*predict-no*H0*2
  14014. -->
  14015. (S1 ^operator O1994 = 0.3140233963466647)
  14016. Firing rl*prefer*rvt*predict-yes*H0*1
  14017. -->
  14018. (S1 ^operator O1993 = 0.380415072318069)
  14019. Firing prefer*rvt*predict-yes*H0
  14020. -->
  14021. Firing prefer*rvt*predict-no*H0
  14022. -->
  14023. Firing elaborate*copy-dir-to-output-link
  14024. -->
  14025. (I3 ^dir L +)
  14026. inner elaboration loop at bottom goal.
  14027. Retracting elaborate*copy-see-to-output-link
  14028. -->
  14029. (I3 ^see 0 +)
  14030. Retracting propose*predict-no
  14031. -->
  14032. (O1994 ^name predict-no +)
  14033. (S1 ^operator O1994 +)
  14034. Retracting propose*predict-yes
  14035. -->
  14036. (O1993 ^name predict-yes +)
  14037. (S1 ^operator O1993 +)
  14038. Retracting elaborate*reward*based*on*reward
  14039. -->
  14040. (R1000 ^value 1 +)
  14041. (R1 ^reward R1000 +)
  14042. Retracting elaborate*copy-dir-to-output-link
  14043. -->
  14044. (I3 ^dir U +)
  14045. Retracting rl*prefer*rvt*predict-no*H0*4
  14046. -->
  14047. (S1 ^operator O1994 = 1.)
  14048. Retracting rl*prefer*rvt*predict-yes*H0*3
  14049. -->
  14050. (S1 ^operator O1993 = 0.)
  14051. =>WM: (13988: S1 ^operator O1996 +)
  14052. =>WM: (13987: S1 ^operator O1995 +)
  14053. =>WM: (13986: I3 ^dir L)
  14054. =>WM: (13985: O1996 ^name predict-no)
  14055. =>WM: (13984: O1995 ^name predict-yes)
  14056. =>WM: (13983: R1001 ^value 1)
  14057. =>WM: (13982: R1 ^reward R1001)
  14058. <=WM: (13973: S1 ^operator O1993 +)
  14059. <=WM: (13974: S1 ^operator O1994 +)
  14060. <=WM: (13975: S1 ^operator O1994)
  14061. <=WM: (13959: I3 ^dir U)
  14062. <=WM: (13969: R1 ^reward R1000)
  14063. <=WM: (13972: O1994 ^name predict-no)
  14064. <=WM: (13971: O1993 ^name predict-yes)
  14065. <=WM: (13970: R1000 ^value 1)
  14066. --- Inner Elaboration Phase, active level 1 (S1) ---
  14067. Firing prefer*rvt*predict-yes*H0
  14068. -->
  14069. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  14070. -->
  14071. (S1 ^operator O1995 = 0.6195669380621123)
  14072. Firing rl*prefer*rvt*predict-yes*H0*1
  14073. -->
  14074. (S1 ^operator O1995 = 0.380415072318069)
  14075. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14076. -->
  14077. Firing prefer*rvt*predict-no*H0
  14078. -->
  14079. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  14080. -->
  14081. (S1 ^operator O1996 = -0.2190661556260421)
  14082. Firing rl*prefer*rvt*predict-no*H0*2
  14083. -->
  14084. (S1 ^operator O1996 = 0.3140233963466647)
  14085. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14086. -->
  14087. inner elaboration loop at bottom goal.
  14088. Retracting rl*prefer*rvt*predict-no*H0*2
  14089. -->
  14090. (S1 ^operator O1994 = 0.3140233963466647)
  14091. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  14092. -->
  14093. (S1 ^operator O1994 = -0.2190661556260421)
  14094. Retracting rl*prefer*rvt*predict-yes*H0*1
  14095. -->
  14096. (S1 ^operator O1993 = 0.380415072318069)
  14097. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  14098. -->
  14099. (S1 ^operator O1993 = 0.6195669380621123)
  14100. --- END Proposal Phase ---
  14101. --- Decision Phase ---
  14102. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14103. =>WM: (13989: S1 ^operator O1995)
  14104. 998: O: O1995 (predict-yes)
  14105. --- END Decision Phase ---
  14106. --- Application Phase ---
  14107. --- Firing Productions (PE) For State At Depth 1 ---
  14108. --- Inner Elaboration Phase, active level 1 (S1) ---
  14109. Firing apply*operator
  14110. -->
  14111. (I3 ^predict-yes N998 + :O )
  14112. Firing apply*operator*complete
  14113. -->
  14114. (I3 ^predict-no N997 - :O )
  14115. inner elaboration loop at bottom goal.
  14116. --- Change Working Memory (PE) ---
  14117. =>WM: (13990: I3 ^predict-yes N998)
  14118. <=WM: (13977: N997 ^status complete)
  14119. <=WM: (13976: I3 ^predict-no N997)
  14120. --- Firing Productions (IE) For State At Depth 1 ---
  14121. --- Inner Elaboration Phase, active level 1 (S1) ---
  14122. Firing monitor*world
  14123. -->
  14124. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14125. --- Change Working Memory (IE) ---
  14126. --- END Application Phase ---
  14127. --- Output Phase ---
  14128. ENV: Agent did: predict-yes for direction L in state State-B
  14129. In State-B moving L
  14130. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14131. predict error 0
  14132. dir: dir isL
  14133. --- END Output Phase ---
  14134. -/|--- Input Phase ---
  14135. =>WM: (13994: I2 ^dir L)
  14136. =>WM: (13993: I2 ^reward 1)
  14137. =>WM: (13992: I2 ^see 1)
  14138. =>WM: (13991: N998 ^status complete)
  14139. <=WM: (13980: I2 ^dir L)
  14140. <=WM: (13979: I2 ^reward 1)
  14141. <=WM: (13978: I2 ^see 0)
  14142. =>WM: (13995: I2 ^level-1 L1-root)
  14143. <=WM: (13981: I2 ^level-1 R0-root)
  14144. --- END Input Phase ---
  14145. --- Proposal Phase ---
  14146. --- Inner Elaboration Phase, active level 1 (S1) ---
  14147. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14148. -->
  14149. (S1 ^operator O1995 = -0.3470159027404986)
  14150. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14151. -->
  14152. (S1 ^operator O1996 = 0.686145215235081)
  14153. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14154. -->
  14155. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14156. -->
  14157. Firing elaborate*copy-see-to-output-link
  14158. -->
  14159. (I3 ^see 1 +)
  14160. Firing elaborate*reward*based*on*reward
  14161. -->
  14162. (R1002 ^value 1 +)
  14163. (R1 ^reward R1002 +)
  14164. Firing propose*predict-yes
  14165. -->
  14166. (O1997 ^name predict-yes +)
  14167. (S1 ^operator O1997 +)
  14168. Firing propose*predict-no
  14169. -->
  14170. (O1998 ^name predict-no +)
  14171. (S1 ^operator O1998 +)
  14172. Firing rl*prefer*rvt*predict-no*H0*2
  14173. -->
  14174. (S1 ^operator O1996 = 0.3140233963466647)
  14175. Firing rl*prefer*rvt*predict-yes*H0*1
  14176. -->
  14177. (S1 ^operator O1995 = 0.380415072318069)
  14178. Firing prefer*rvt*predict-yes*H0
  14179. -->
  14180. Firing prefer*rvt*predict-no*H0
  14181. -->
  14182. Firing elaborate*copy-dir-to-output-link
  14183. -->
  14184. (I3 ^dir L +)
  14185. inner elaboration loop at bottom goal.
  14186. Retracting elaborate*copy-see-to-output-link
  14187. -->
  14188. (I3 ^see 0 +)
  14189. Retracting propose*predict-no
  14190. -->
  14191. (O1996 ^name predict-no +)
  14192. (S1 ^operator O1996 +)
  14193. Retracting propose*predict-yes
  14194. -->
  14195. (O1995 ^name predict-yes +)
  14196. (S1 ^operator O1995 +)
  14197. Retracting elaborate*reward*based*on*reward
  14198. -->
  14199. (R1001 ^value 1 +)
  14200. (R1 ^reward R1001 +)
  14201. Retracting elaborate*copy-dir-to-output-link
  14202. -->
  14203. (I3 ^dir L +)
  14204. Retracting rl*prefer*rvt*predict-no*H0*2
  14205. -->
  14206. (S1 ^operator O1996 = 0.3140233963466647)
  14207. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  14208. -->
  14209. (S1 ^operator O1996 = -0.2190661556260421)
  14210. Retracting rl*prefer*rvt*predict-yes*H0*1
  14211. -->
  14212. (S1 ^operator O1995 = 0.380415072318069)
  14213. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  14214. -->
  14215. (S1 ^operator O1995 = 0.6195669380621123)
  14216. =>WM: (14002: S1 ^operator O1998 +)
  14217. =>WM: (14001: S1 ^operator O1997 +)
  14218. =>WM: (14000: O1998 ^name predict-no)
  14219. =>WM: (13999: O1997 ^name predict-yes)
  14220. =>WM: (13998: R1002 ^value 1)
  14221. =>WM: (13997: R1 ^reward R1002)
  14222. =>WM: (13996: I3 ^see 1)
  14223. <=WM: (13987: S1 ^operator O1995 +)
  14224. <=WM: (13989: S1 ^operator O1995)
  14225. <=WM: (13988: S1 ^operator O1996 +)
  14226. <=WM: (13982: R1 ^reward R1001)
  14227. <=WM: (13954: I3 ^see 0)
  14228. <=WM: (13985: O1996 ^name predict-no)
  14229. <=WM: (13984: O1995 ^name predict-yes)
  14230. <=WM: (13983: R1001 ^value 1)
  14231. --- Inner Elaboration Phase, active level 1 (S1) ---
  14232. Firing prefer*rvt*predict-yes*H0
  14233. -->
  14234. Firing rl*prefer*rvt*predict-yes*H0*1
  14235. -->
  14236. (S1 ^operator O1997 = 0.380415072318069)
  14237. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14238. -->
  14239. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14240. -->
  14241. (S1 ^operator O1997 = -0.3470159027404986)
  14242. Firing prefer*rvt*predict-no*H0
  14243. -->
  14244. Firing rl*prefer*rvt*predict-no*H0*2
  14245. -->
  14246. (S1 ^operator O1998 = 0.3140233963466647)
  14247. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14248. -->
  14249. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14250. -->
  14251. (S1 ^operator O1998 = 0.686145215235081)
  14252. inner elaboration loop at bottom goal.
  14253. Retracting rl*prefer*rvt*predict-no*H0*2
  14254. -->
  14255. (S1 ^operator O1996 = 0.3140233963466647)
  14256. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14257. -->
  14258. (S1 ^operator O1996 = 0.686145215235081)
  14259. Retracting rl*prefer*rvt*predict-yes*H0*1
  14260. -->
  14261. (S1 ^operator O1995 = 0.380415072318069)
  14262. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14263. -->
  14264. (S1 ^operator O1995 = -0.3470159027404986)
  14265. --- END Proposal Phase ---
  14266. --- Decision Phase ---
  14267. RL update rl*prefer*rvt*predict-yes*H0*1 0.521345 -0.14093 0.380415 -> 0.521347 -0.14093 0.380417(R,m,v=1,0.830303,0.141759)
  14268. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478635 0.140932 0.619567 -> 0.478637 0.140932 0.619569(R,m,v=1,1,0)
  14269. =>WM: (14003: S1 ^operator O1998)
  14270. 999: O: O1998 (predict-no)
  14271. --- END Decision Phase ---
  14272. --- Application Phase ---
  14273. --- Firing Productions (PE) For State At Depth 1 ---
  14274. --- Inner Elaboration Phase, active level 1 (S1) ---
  14275. Firing apply*operator
  14276. -->
  14277. (I3 ^predict-no N999 + :O )
  14278. Firing apply*operator*complete
  14279. -->
  14280. (I3 ^predict-yes N998 - :O )
  14281. inner elaboration loop at bottom goal.
  14282. --- Change Working Memory (PE) ---
  14283. =>WM: (14004: I3 ^predict-no N999)
  14284. <=WM: (13991: N998 ^status complete)
  14285. <=WM: (13990: I3 ^predict-yes N998)
  14286. --- Firing Productions (IE) For State At Depth 1 ---
  14287. --- Inner Elaboration Phase, active level 1 (S1) ---
  14288. Firing monitor*world
  14289. -->
  14290. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14291. --- Change Working Memory (IE) ---
  14292. --- END Application Phase ---
  14293. --- Output Phase ---
  14294. ENV: Agent did: predict-no for direction L in state State-A
  14295. In State-A moving L
  14296. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14297. predict error 0
  14298. dir: dir isU
  14299. --- END Output Phase ---
  14300. \-/--- Input Phase ---
  14301. =>WM: (14008: I2 ^dir U)
  14302. =>WM: (14007: I2 ^reward 1)
  14303. =>WM: (14006: I2 ^see 0)
  14304. =>WM: (14005: N999 ^status complete)
  14305. <=WM: (13994: I2 ^dir L)
  14306. <=WM: (13993: I2 ^reward 1)
  14307. <=WM: (13992: I2 ^see 1)
  14308. =>WM: (14009: I2 ^level-1 L0-root)
  14309. <=WM: (13995: I2 ^level-1 L1-root)
  14310. --- END Input Phase ---
  14311. --- Proposal Phase ---
  14312. --- Inner Elaboration Phase, active level 1 (S1) ---
  14313. Firing elaborate*copy-see-to-output-link
  14314. -->
  14315. (I3 ^see 0 +)
  14316. Firing elaborate*reward*based*on*reward
  14317. -->
  14318. (R1003 ^value 1 +)
  14319. (R1 ^reward R1003 +)
  14320. Firing propose*predict-yes
  14321. -->
  14322. (O1999 ^name predict-yes +)
  14323. (S1 ^operator O1999 +)
  14324. Firing propose*predict-no
  14325. -->
  14326. (O2000 ^name predict-no +)
  14327. (S1 ^operator O2000 +)
  14328. Firing rl*prefer*rvt*predict-no*H0*4
  14329. -->
  14330. (S1 ^operator O1998 = 1.)
  14331. Firing rl*prefer*rvt*predict-yes*H0*3
  14332. -->
  14333. (S1 ^operator O1997 = 0.)
  14334. Firing prefer*rvt*predict-yes*H0
  14335. -->
  14336. Firing prefer*rvt*predict-no*H0
  14337. -->
  14338. Firing elaborate*copy-dir-to-output-link
  14339. -->
  14340. (I3 ^dir U +)
  14341. inner elaboration loop at bottom goal.
  14342. Retracting elaborate*copy-see-to-output-link
  14343. -->
  14344. (I3 ^see 1 +)
  14345. Retracting propose*predict-no
  14346. -->
  14347. (O1998 ^name predict-no +)
  14348. (S1 ^operator O1998 +)
  14349. Retracting propose*predict-yes
  14350. -->
  14351. (O1997 ^name predict-yes +)
  14352. (S1 ^operator O1997 +)
  14353. Retracting elaborate*reward*based*on*reward
  14354. -->
  14355. (R1002 ^value 1 +)
  14356. (R1 ^reward R1002 +)
  14357. Retracting elaborate*copy-dir-to-output-link
  14358. -->
  14359. (I3 ^dir L +)
  14360. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14361. -->
  14362. (S1 ^operator O1998 = 0.686145215235081)
  14363. Retracting rl*prefer*rvt*predict-no*H0*2
  14364. -->
  14365. (S1 ^operator O1998 = 0.3140233963466647)
  14366. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14367. -->
  14368. (S1 ^operator O1997 = -0.3470159027404986)
  14369. Retracting rl*prefer*rvt*predict-yes*H0*1
  14370. -->
  14371. (S1 ^operator O1997 = 0.3804165454412648)
  14372. =>WM: (14017: S1 ^operator O2000 +)
  14373. =>WM: (14016: S1 ^operator O1999 +)
  14374. =>WM: (14015: I3 ^dir U)
  14375. =>WM: (14014: O2000 ^name predict-no)
  14376. =>WM: (14013: O1999 ^name predict-yes)
  14377. =>WM: (14012: R1003 ^value 1)
  14378. =>WM: (14011: R1 ^reward R1003)
  14379. =>WM: (14010: I3 ^see 0)
  14380. <=WM: (14001: S1 ^operator O1997 +)
  14381. <=WM: (14002: S1 ^operator O1998 +)
  14382. <=WM: (14003: S1 ^operator O1998)
  14383. <=WM: (13986: I3 ^dir L)
  14384. <=WM: (13997: R1 ^reward R1002)
  14385. <=WM: (13996: I3 ^see 1)
  14386. <=WM: (14000: O1998 ^name predict-no)
  14387. <=WM: (13999: O1997 ^name predict-yes)
  14388. <=WM: (13998: R1002 ^value 1)
  14389. --- Inner Elaboration Phase, active level 1 (S1) ---
  14390. Firing prefer*rvt*predict-yes*H0
  14391. -->
  14392. Firing rl*prefer*rvt*predict-yes*H0*3
  14393. -->
  14394. (S1 ^operator O1999 = 0.)
  14395. Firing prefer*rvt*predict-no*H0
  14396. -->
  14397. Firing rl*prefer*rvt*predict-no*H0*4
  14398. -->
  14399. (S1 ^operator O2000 = 1.)
  14400. inner elaboration loop at bottom goal.
  14401. Retracting rl*prefer*rvt*predict-no*H0*4
  14402. -->
  14403. (S1 ^operator O1998 = 1.)
  14404. Retracting rl*prefer*rvt*predict-yes*H0*3
  14405. -->
  14406. (S1 ^operator O1997 = 0.)
  14407. --- END Proposal Phase ---
  14408. --- Decision Phase ---
  14409. RL update rl*prefer*rvt*predict-no*H0*2 0.485033 -0.171009 0.314023 -> 0.485022 -0.171012 0.314009(R,m,v=1,0.860927,0.12053)
  14410. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.5151 0.171045 0.686145 -> 0.515087 0.171042 0.686129(R,m,v=1,1,0)
  14411. =>WM: (14018: S1 ^operator O2000)
  14412. 1000: O: O2000 (predict-no)
  14413. --- END Decision Phase ---
  14414. --- Application Phase ---
  14415. --- Firing Productions (PE) For State At Depth 1 ---
  14416. --- Inner Elaboration Phase, active level 1 (S1) ---
  14417. Firing apply*operator
  14418. -->
  14419. (I3 ^predict-no N1000 + :O )
  14420. Firing apply*operator*complete
  14421. -->
  14422. (I3 ^predict-no N999 - :O )
  14423. inner elaboration loop at bottom goal.
  14424. --- Change Working Memory (PE) ---
  14425. =>WM: (14019: I3 ^predict-no N1000)
  14426. <=WM: (14005: N999 ^status complete)
  14427. <=WM: (14004: I3 ^predict-no N999)
  14428. --- Firing Productions (IE) For State At Depth 1 ---
  14429. --- Inner Elaboration Phase, active level 1 (S1) ---
  14430. Firing monitor*world
  14431. -->
  14432. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14433. --- Change Working Memory (IE) ---
  14434. --- END Application Phase ---
  14435. --- Output Phase ---
  14436. ENV: Agent did: predict-no for direction U in state State-A
  14437. In State-A moving U
  14438. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14439. predict error 0
  14440. dir: dir isR
  14441. --- END Output Phase ---
  14442. |\-/|\-/|\--- Input Phase ---
  14443. =>WM: (14023: I2 ^dir R)
  14444. =>WM: (14022: I2 ^reward 1)
  14445. =>WM: (14021: I2 ^see 0)
  14446. =>WM: (14020: N1000 ^status complete)
  14447. <=WM: (14008: I2 ^dir U)
  14448. <=WM: (14007: I2 ^reward 1)
  14449. <=WM: (14006: I2 ^see 0)
  14450. =>WM: (14024: I2 ^level-1 L0-root)
  14451. <=WM: (14009: I2 ^level-1 L0-root)
  14452. --- END Input Phase ---
  14453. --- Proposal Phase ---
  14454. --- Inner Elaboration Phase, active level 1 (S1) ---
  14455. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14456. -->
  14457. (S1 ^operator O1999 = 0.7055034804752064)
  14458. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14459. -->
  14460. (S1 ^operator O2000 = -0.2023211881870005)
  14461. Firing prefer*rvt*predict-no*H0*6*v1*H1
  14462. -->
  14463. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14464. -->
  14465. Firing elaborate*copy-see-to-output-link
  14466. -->
  14467. (I3 ^see 0 +)
  14468. Firing elaborate*reward*based*on*reward
  14469. -->
  14470. (R1004 ^value 1 +)
  14471. (R1 ^reward R1004 +)
  14472. Firing propose*predict-yes
  14473. -->
  14474. (O2001 ^name predict-yes +)
  14475. (S1 ^operator O2001 +)
  14476. Firing propose*predict-no
  14477. -->
  14478. (O2002 ^name predict-no +)
  14479. (S1 ^operator O2002 +)
  14480. Firing rl*prefer*rvt*predict-no*H0*6
  14481. -->
  14482. (S1 ^operator O2000 = 0.229854902707684)
  14483. Firing rl*prefer*rvt*predict-yes*H0*5
  14484. -->
  14485. (S1 ^operator O1999 = 0.2938705117203769)
  14486. Firing prefer*rvt*predict-yes*H0
  14487. -->
  14488. Firing prefer*rvt*predict-no*H0
  14489. -->
  14490. Firing elaborate*copy-dir-to-output-link
  14491. -->
  14492. (I3 ^dir R +)
  14493. inner elaboration loop at bottom goal.
  14494. Retracting elaborate*copy-see-to-output-link
  14495. -->
  14496. (I3 ^see 0 +)
  14497. Retracting propose*predict-no
  14498. -->
  14499. (O2000 ^name predict-no +)
  14500. (S1 ^operator O2000 +)
  14501. Retracting propose*predict-yes
  14502. -->
  14503. (O1999 ^name predict-yes +)
  14504. (S1 ^operator O1999 +)
  14505. Retracting elaborate*reward*based*on*reward
  14506. -->
  14507. (R1003 ^value 1 +)
  14508. (R1 ^reward R1003 +)
  14509. Retracting elaborate*copy-dir-to-output-link
  14510. -->
  14511. (I3 ^dir U +)
  14512. Retracting rl*prefer*rvt*predict-no*H0*4
  14513. -->
  14514. (S1 ^operator O2000 = 1.)
  14515. Retracting rl*prefer*rvt*predict-yes*H0*3
  14516. -->
  14517. (S1 ^operator O1999 = 0.)
  14518. =>WM: (14031: S1 ^operator O2002 +)
  14519. =>WM: (14030: S1 ^operator O2001 +)
  14520. =>WM: (14029: I3 ^dir R)
  14521. =>WM: (14028: O2002 ^name predict-no)
  14522. =>WM: (14027: O2001 ^name predict-yes)
  14523. =>WM: (14026: R1004 ^value 1)
  14524. =>WM: (14025: R1 ^reward R1004)
  14525. <=WM: (14016: S1 ^operator O1999 +)
  14526. <=WM: (14017: S1 ^operator O2000 +)
  14527. <=WM: (14018: S1 ^operator O2000)
  14528. <=WM: (14015: I3 ^dir U)
  14529. <=WM: (14011: R1 ^reward R1003)
  14530. <=WM: (14014: O2000 ^name predict-no)
  14531. <=WM: (14013: O1999 ^name predict-yes)
  14532. <=WM: (14012: R1003 ^value 1)
  14533. --- Inner Elaboration Phase, active level 1 (S1) ---
  14534. Firing prefer*rvt*predict-yes*H0
  14535. -->
  14536. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14537. -->
  14538. (S1 ^operator O2001 = 0.7055034804752064)
  14539. Firing rl*prefer*rvt*predict-yes*H0*5
  14540. -->
  14541. (S1 ^operator O2001 = 0.2938705117203769)
  14542. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14543. -->
  14544. Firing prefer*rvt*predict-no*H0
  14545. -->
  14546. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14547. -->
  14548. (S1 ^operator O2002 = -0.2023211881870005)
  14549. Firing rl*prefer*rvt*predict-no*H0*6
  14550. -->
  14551. (S1 ^operator O2002 = 0.229854902707684)
  14552. Firing prefer*rvt*predict-no*H0*6*v1*H1
  14553. -->
  14554. inner elaboration loop at bottom goal.
  14555. Retracting rl*prefer*rvt*predict-no*H0*6
  14556. -->
  14557. (S1 ^operator O2000 = 0.229854902707684)
  14558. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14559. -->
  14560. (S1 ^operator O2000 = -0.2023211881870005)
  14561. Retracting rl*prefer*rvt*predict-yes*H0*5
  14562. -->
  14563. (S1 ^operator O1999 = 0.2938705117203769)
  14564. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14565. -->
  14566. (S1 ^operator O1999 = 0.7055034804752064)
  14567. --- END Proposal Phase ---
  14568. --- Decision Phase ---
  14569. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14570. =>WM: (14032: S1 ^operator O2001)
  14571. 1001: O: O2001 (predict-yes)
  14572. --- END Decision Phase ---
  14573. --- Application Phase ---
  14574. --- Firing Productions (PE) For State At Depth 1 ---
  14575. --- Inner Elaboration Phase, active level 1 (S1) ---
  14576. Firing apply*operator
  14577. -->
  14578. (I3 ^predict-yes N1001 + :O )
  14579. Firing apply*operator*complete
  14580. -->
  14581. (I3 ^predict-no N1000 - :O )
  14582. inner elaboration loop at bottom goal.
  14583. --- Change Working Memory (PE) ---
  14584. =>WM: (14033: I3 ^predict-yes N1001)
  14585. <=WM: (14020: N1000 ^status complete)
  14586. <=WM: (14019: I3 ^predict-no N1000)
  14587. --- Firing Productions (IE) For State At Depth 1 ---
  14588. --- Inner Elaboration Phase, active level 1 (S1) ---
  14589. Firing monitor*world
  14590. -->
  14591. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14592. --- Change Working Memory (IE) ---
  14593. --- END Application Phase ---
  14594. --- Output Phase ---
  14595. ENV: Agent did: predict-yes for direction R in state State-A
  14596. In State-A moving R
  14597. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14598. predict error 0
  14599. dir: dir isL
  14600. --- END Output Phase ---
  14601. ---- Input Phase ---
  14602. =>WM: (14037: I2 ^dir L)
  14603. =>WM: (14036: I2 ^reward 1)
  14604. =>WM: (14035: I2 ^see 1)
  14605. =>WM: (14034: N1001 ^status complete)
  14606. <=WM: (14023: I2 ^dir R)
  14607. <=WM: (14022: I2 ^reward 1)
  14608. <=WM: (14021: I2 ^see 0)
  14609. =>WM: (14038: I2 ^level-1 R1-root)
  14610. <=WM: (14024: I2 ^level-1 L0-root)
  14611. --- END Input Phase ---
  14612. --- Proposal Phase ---
  14613. --- Inner Elaboration Phase, active level 1 (S1) ---
  14614. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14615. -->
  14616. (S1 ^operator O2001 = 0.6196100460529347)
  14617. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14618. -->
  14619. (S1 ^operator O2002 = -0.1479504104026684)
  14620. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14621. -->
  14622. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14623. -->
  14624. Firing elaborate*copy-see-to-output-link
  14625. -->
  14626. (I3 ^see 1 +)
  14627. Firing elaborate*reward*based*on*reward
  14628. -->
  14629. (R1005 ^value 1 +)
  14630. (R1 ^reward R1005 +)
  14631. Firing propose*predict-yes
  14632. -->
  14633. (O2003 ^name predict-yes +)
  14634. (S1 ^operator O2003 +)
  14635. Firing propose*predict-no
  14636. -->
  14637. (O2004 ^name predict-no +)
  14638. (S1 ^operator O2004 +)
  14639. Firing rl*prefer*rvt*predict-no*H0*2
  14640. -->
  14641. (S1 ^operator O2002 = 0.3140093857317092)
  14642. Firing rl*prefer*rvt*predict-yes*H0*1
  14643. -->
  14644. (S1 ^operator O2001 = 0.3804165454412648)
  14645. Firing prefer*rvt*predict-yes*H0
  14646. -->
  14647. Firing prefer*rvt*predict-no*H0
  14648. -->
  14649. Firing elaborate*copy-dir-to-output-link
  14650. -->
  14651. (I3 ^dir L +)
  14652. inner elaboration loop at bottom goal.
  14653. Retracting elaborate*copy-see-to-output-link
  14654. -->
  14655. (I3 ^see 0 +)
  14656. Retracting propose*predict-no
  14657. -->
  14658. (O2002 ^name predict-no +)
  14659. (S1 ^operator O2002 +)
  14660. Retracting propose*predict-yes
  14661. -->
  14662. (O2001 ^name predict-yes +)
  14663. (S1 ^operator O2001 +)
  14664. Retracting elaborate*reward*based*on*reward
  14665. -->
  14666. (R1004 ^value 1 +)
  14667. (R1 ^reward R1004 +)
  14668. Retracting elaborate*copy-dir-to-output-link
  14669. -->
  14670. (I3 ^dir R +)
  14671. Retracting rl*prefer*rvt*predict-no*H0*6
  14672. -->
  14673. (S1 ^operator O2002 = 0.229854902707684)
  14674. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14675. -->
  14676. (S1 ^operator O2002 = -0.2023211881870005)
  14677. Retracting rl*prefer*rvt*predict-yes*H0*5
  14678. -->
  14679. (S1 ^operator O2001 = 0.2938705117203769)
  14680. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14681. -->
  14682. (S1 ^operator O2001 = 0.7055034804752064)
  14683. =>WM: (14046: S1 ^operator O2004 +)
  14684. =>WM: (14045: S1 ^operator O2003 +)
  14685. =>WM: (14044: I3 ^dir L)
  14686. =>WM: (14043: O2004 ^name predict-no)
  14687. =>WM: (14042: O2003 ^name predict-yes)
  14688. =>WM: (14041: R1005 ^value 1)
  14689. =>WM: (14040: R1 ^reward R1005)
  14690. =>WM: (14039: I3 ^see 1)
  14691. <=WM: (14030: S1 ^operator O2001 +)
  14692. <=WM: (14032: S1 ^operator O2001)
  14693. <=WM: (14031: S1 ^operator O2002 +)
  14694. <=WM: (14029: I3 ^dir R)
  14695. <=WM: (14025: R1 ^reward R1004)
  14696. <=WM: (14010: I3 ^see 0)
  14697. <=WM: (14028: O2002 ^name predict-no)
  14698. <=WM: (14027: O2001 ^name predict-yes)
  14699. <=WM: (14026: R1004 ^value 1)
  14700. --- Inner Elaboration Phase, active level 1 (S1) ---
  14701. Firing prefer*rvt*predict-yes*H0
  14702. -->
  14703. Firing rl*prefer*rvt*predict-yes*H0*1
  14704. -->
  14705. (S1 ^operator O2003 = 0.3804165454412648)
  14706. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14707. -->
  14708. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14709. -->
  14710. (S1 ^operator O2003 = 0.6196100460529347)
  14711. Firing prefer*rvt*predict-no*H0
  14712. -->
  14713. Firing rl*prefer*rvt*predict-no*H0*2
  14714. -->
  14715. (S1 ^operator O2004 = 0.3140093857317092)
  14716. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14717. -->
  14718. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14719. -->
  14720. (S1 ^operator O2004 = -0.1479504104026684)
  14721. inner elaboration loop at bottom goal.
  14722. Retracting rl*prefer*rvt*predict-no*H0*2
  14723. -->
  14724. (S1 ^operator O2002 = 0.3140093857317092)
  14725. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14726. -->
  14727. (S1 ^operator O2002 = -0.1479504104026684)
  14728. Retracting rl*prefer*rvt*predict-yes*H0*1
  14729. -->
  14730. (S1 ^operator O2001 = 0.3804165454412648)
  14731. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14732. -->
  14733. (S1 ^operator O2001 = 0.6196100460529347)
  14734. --- END Proposal Phase ---
  14735. --- Decision Phase ---
  14736. RL update rl*prefer*rvt*predict-yes*H0*5 0.500957 -0.207086 0.293871 -> 0.501003 -0.207081 0.293922(R,m,v=1,0.846154,0.131017)
  14737. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498477 0.207026 0.705503 -> 0.498533 0.207032 0.705565(R,m,v=1,1,0)
  14738. =>WM: (14047: S1 ^operator O2003)
  14739. 1002: O: O2003 (predict-yes)
  14740. --- END Decision Phase ---
  14741. --- Application Phase ---
  14742. --- Firing Productions (PE) For State At Depth 1 ---
  14743. --- Inner Elaboration Phase, active level 1 (S1) ---
  14744. Firing apply*operator
  14745. -->
  14746. (I3 ^predict-yes N1002 + :O )
  14747. Firing apply*operator*complete
  14748. -->
  14749. (I3 ^predict-yes N1001 - :O )
  14750. inner elaboration loop at bottom goal.
  14751. --- Change Working Memory (PE) ---
  14752. =>WM: (14048: I3 ^predict-yes N1002)
  14753. <=WM: (14034: N1001 ^status complete)
  14754. <=WM: (14033: I3 ^predict-yes N1001)
  14755. --- Firing Productions (IE) For State At Depth 1 ---
  14756. --- Inner Elaboration Phase, active level 1 (S1) ---
  14757. Firing monitor*world
  14758. -->
  14759. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14760. --- Change Working Memory (IE) ---
  14761. --- END Application Phase ---
  14762. --- Output Phase ---
  14763. ENV: Agent did: predict-yes for direction L in state State-B
  14764. In State-B moving L
  14765. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14766. predict error 0
  14767. dir: dir isL
  14768. --- END Output Phase ---
  14769. /|\--- Input Phase ---
  14770. =>WM: (14052: I2 ^dir L)
  14771. =>WM: (14051: I2 ^reward 1)
  14772. =>WM: (14050: I2 ^see 1)
  14773. =>WM: (14049: N1002 ^status complete)
  14774. <=WM: (14037: I2 ^dir L)
  14775. <=WM: (14036: I2 ^reward 1)
  14776. <=WM: (14035: I2 ^see 1)
  14777. =>WM: (14053: I2 ^level-1 L1-root)
  14778. <=WM: (14038: I2 ^level-1 R1-root)
  14779. --- END Input Phase ---
  14780. --- Proposal Phase ---
  14781. --- Inner Elaboration Phase, active level 1 (S1) ---
  14782. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14783. -->
  14784. (S1 ^operator O2003 = -0.3470159027404986)
  14785. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14786. -->
  14787. (S1 ^operator O2004 = 0.6861287198581429)
  14788. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14789. -->
  14790. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14791. -->
  14792. Firing elaborate*copy-see-to-output-link
  14793. -->
  14794. (I3 ^see 1 +)
  14795. Firing elaborate*reward*based*on*reward
  14796. -->
  14797. (R1006 ^value 1 +)
  14798. (R1 ^reward R1006 +)
  14799. Firing propose*predict-yes
  14800. -->
  14801. (O2005 ^name predict-yes +)
  14802. (S1 ^operator O2005 +)
  14803. Firing propose*predict-no
  14804. -->
  14805. (O2006 ^name predict-no +)
  14806. (S1 ^operator O2006 +)
  14807. Firing rl*prefer*rvt*predict-no*H0*2
  14808. -->
  14809. (S1 ^operator O2004 = 0.3140093857317092)
  14810. Firing rl*prefer*rvt*predict-yes*H0*1
  14811. -->
  14812. (S1 ^operator O2003 = 0.3804165454412648)
  14813. Firing prefer*rvt*predict-yes*H0
  14814. -->
  14815. Firing prefer*rvt*predict-no*H0
  14816. -->
  14817. Firing elaborate*copy-dir-to-output-link
  14818. -->
  14819. (I3 ^dir L +)
  14820. inner elaboration loop at bottom goal.
  14821. Retracting elaborate*copy-see-to-output-link
  14822. -->
  14823. (I3 ^see 1 +)
  14824. Retracting propose*predict-no
  14825. -->
  14826. (O2004 ^name predict-no +)
  14827. (S1 ^operator O2004 +)
  14828. Retracting propose*predict-yes
  14829. -->
  14830. (O2003 ^name predict-yes +)
  14831. (S1 ^operator O2003 +)
  14832. Retracting elaborate*reward*based*on*reward
  14833. -->
  14834. (R1005 ^value 1 +)
  14835. (R1 ^reward R1005 +)
  14836. Retracting elaborate*copy-dir-to-output-link
  14837. -->
  14838. (I3 ^dir L +)
  14839. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14840. -->
  14841. (S1 ^operator O2004 = -0.1479504104026684)
  14842. Retracting rl*prefer*rvt*predict-no*H0*2
  14843. -->
  14844. (S1 ^operator O2004 = 0.3140093857317092)
  14845. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14846. -->
  14847. (S1 ^operator O2003 = 0.6196100460529347)
  14848. Retracting rl*prefer*rvt*predict-yes*H0*1
  14849. -->
  14850. (S1 ^operator O2003 = 0.3804165454412648)
  14851. =>WM: (14059: S1 ^operator O2006 +)
  14852. =>WM: (14058: S1 ^operator O2005 +)
  14853. =>WM: (14057: O2006 ^name predict-no)
  14854. =>WM: (14056: O2005 ^name predict-yes)
  14855. =>WM: (14055: R1006 ^value 1)
  14856. =>WM: (14054: R1 ^reward R1006)
  14857. <=WM: (14045: S1 ^operator O2003 +)
  14858. <=WM: (14047: S1 ^operator O2003)
  14859. <=WM: (14046: S1 ^operator O2004 +)
  14860. <=WM: (14040: R1 ^reward R1005)
  14861. <=WM: (14043: O2004 ^name predict-no)
  14862. <=WM: (14042: O2003 ^name predict-yes)
  14863. <=WM: (14041: R1005 ^value 1)
  14864. --- Inner Elaboration Phase, active level 1 (S1) ---
  14865. Firing prefer*rvt*predict-yes*H0
  14866. -->
  14867. Firing rl*prefer*rvt*predict-yes*H0*1
  14868. -->
  14869. (S1 ^operator O2005 = 0.3804165454412648)
  14870. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14871. -->
  14872. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14873. -->
  14874. (S1 ^operator O2005 = -0.3470159027404986)
  14875. Firing prefer*rvt*predict-no*H0
  14876. -->
  14877. Firing rl*prefer*rvt*predict-no*H0*2
  14878. -->
  14879. (S1 ^operator O2006 = 0.3140093857317092)
  14880. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14881. -->
  14882. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14883. -->
  14884. (S1 ^operator O2006 = 0.6861287198581429)
  14885. inner elaboration loop at bottom goal.
  14886. Retracting rl*prefer*rvt*predict-no*H0*2
  14887. -->
  14888. (S1 ^operator O2004 = 0.3140093857317092)
  14889. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14890. -->
  14891. (S1 ^operator O2004 = 0.6861287198581429)
  14892. Retracting rl*prefer*rvt*predict-yes*H0*1
  14893. -->
  14894. (S1 ^operator O2003 = 0.3804165454412648)
  14895. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14896. -->
  14897. (S1 ^operator O2003 = -0.3470159027404986)
  14898. --- END Proposal Phase ---
  14899. --- Decision Phase ---
  14900. RL update rl*prefer*rvt*predict-yes*H0*1 0.521347 -0.14093 0.380417 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.831325,0.141073)
  14901. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478682 0.140928 0.61961 -> 0.47868 0.140928 0.619607(R,m,v=1,1,0)
  14902. =>WM: (14060: S1 ^operator O2006)
  14903. 1003: O: O2006 (predict-no)
  14904. --- END Decision Phase ---
  14905. --- Application Phase ---
  14906. --- Firing Productions (PE) For State At Depth 1 ---
  14907. --- Inner Elaboration Phase, active level 1 (S1) ---
  14908. Firing apply*operator
  14909. -->
  14910. (I3 ^predict-no N1003 + :O )
  14911. Firing apply*operator*complete
  14912. -->
  14913. (I3 ^predict-yes N1002 - :O )
  14914. inner elaboration loop at bottom goal.
  14915. --- Change Working Memory (PE) ---
  14916. =>WM: (14061: I3 ^predict-no N1003)
  14917. <=WM: (14049: N1002 ^status complete)
  14918. <=WM: (14048: I3 ^predict-yes N1002)
  14919. --- Firing Productions (IE) For State At Depth 1 ---
  14920. --- Inner Elaboration Phase, active level 1 (S1) ---
  14921. Firing monitor*world
  14922. -->
  14923. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14924. --- Change Working Memory (IE) ---
  14925. --- END Application Phase ---
  14926. --- Output Phase ---
  14927. ENV: Agent did: predict-no for direction L in state State-A
  14928. In State-A moving L
  14929. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14930. predict error 0
  14931. dir: dir isR
  14932. --- END Output Phase ---
  14933. -/--- Input Phase ---
  14934. =>WM: (14065: I2 ^dir R)
  14935. =>WM: (14064: I2 ^reward 1)
  14936. =>WM: (14063: I2 ^see 0)
  14937. =>WM: (14062: N1003 ^status complete)
  14938. <=WM: (14052: I2 ^dir L)
  14939. <=WM: (14051: I2 ^reward 1)
  14940. <=WM: (14050: I2 ^see 1)
  14941. =>WM: (14066: I2 ^level-1 L0-root)
  14942. <=WM: (14053: I2 ^level-1 L1-root)
  14943. --- END Input Phase ---
  14944. --- Proposal Phase ---
  14945. --- Inner Elaboration Phase, active level 1 (S1) ---
  14946. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14947. -->
  14948. (S1 ^operator O2005 = 0.7055651252992311)
  14949. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14950. -->
  14951. (S1 ^operator O2006 = -0.2023211881870005)
  14952. Firing prefer*rvt*predict-no*H0*6*v1*H1
  14953. -->
  14954. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14955. -->
  14956. Firing elaborate*copy-see-to-output-link
  14957. -->
  14958. (I3 ^see 0 +)
  14959. Firing elaborate*reward*based*on*reward
  14960. -->
  14961. (R1007 ^value 1 +)
  14962. (R1 ^reward R1007 +)
  14963. Firing propose*predict-yes
  14964. -->
  14965. (O2007 ^name predict-yes +)
  14966. (S1 ^operator O2007 +)
  14967. Firing propose*predict-no
  14968. -->
  14969. (O2008 ^name predict-no +)
  14970. (S1 ^operator O2008 +)
  14971. Firing rl*prefer*rvt*predict-no*H0*6
  14972. -->
  14973. (S1 ^operator O2006 = 0.229854902707684)
  14974. Firing rl*prefer*rvt*predict-yes*H0*5
  14975. -->
  14976. (S1 ^operator O2005 = 0.2939222491339341)
  14977. Firing prefer*rvt*predict-yes*H0
  14978. -->
  14979. Firing prefer*rvt*predict-no*H0
  14980. -->
  14981. Firing elaborate*copy-dir-to-output-link
  14982. -->
  14983. (I3 ^dir R +)
  14984. inner elaboration loop at bottom goal.
  14985. Retracting elaborate*copy-see-to-output-link
  14986. -->
  14987. (I3 ^see 1 +)
  14988. Retracting propose*predict-no
  14989. -->
  14990. (O2006 ^name predict-no +)
  14991. (S1 ^operator O2006 +)
  14992. Retracting propose*predict-yes
  14993. -->
  14994. (O2005 ^name predict-yes +)
  14995. (S1 ^operator O2005 +)
  14996. Retracting elaborate*reward*based*on*reward
  14997. -->
  14998. (R1006 ^value 1 +)
  14999. (R1 ^reward R1006 +)
  15000. Retracting elaborate*copy-dir-to-output-link
  15001. -->
  15002. (I3 ^dir L +)
  15003. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  15004. -->
  15005. (S1 ^operator O2006 = 0.6861287198581429)
  15006. Retracting rl*prefer*rvt*predict-no*H0*2
  15007. -->
  15008. (S1 ^operator O2006 = 0.3140093857317092)
  15009. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  15010. -->
  15011. (S1 ^operator O2005 = -0.3470159027404986)
  15012. Retracting rl*prefer*rvt*predict-yes*H0*1
  15013. -->
  15014. (S1 ^operator O2005 = 0.380414370085626)
  15015. =>WM: (14074: S1 ^operator O2008 +)
  15016. =>WM: (14073: S1 ^operator O2007 +)
  15017. =>WM: (14072: I3 ^dir R)
  15018. =>WM: (14071: O2008 ^name predict-no)
  15019. =>WM: (14070: O2007 ^name predict-yes)
  15020. =>WM: (14069: R1007 ^value 1)
  15021. =>WM: (14068: R1 ^reward R1007)
  15022. =>WM: (14067: I3 ^see 0)
  15023. <=WM: (14058: S1 ^operator O2005 +)
  15024. <=WM: (14059: S1 ^operator O2006 +)
  15025. <=WM: (14060: S1 ^operator O2006)
  15026. <=WM: (14044: I3 ^dir L)
  15027. <=WM: (14054: R1 ^reward R1006)
  15028. <=WM: (14039: I3 ^see 1)
  15029. <=WM: (14057: O2006 ^name predict-no)
  15030. <=WM: (14056: O2005 ^name predict-yes)
  15031. <=WM: (14055: R1006 ^value 1)
  15032. --- Inner Elaboration Phase, active level 1 (S1) ---
  15033. Firing prefer*rvt*predict-yes*H0
  15034. -->
  15035. Firing rl*prefer*rvt*predict-yes*H0*5
  15036. -->
  15037. (S1 ^operator O2007 = 0.2939222491339341)
  15038. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15039. -->
  15040. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15041. -->
  15042. (S1 ^operator O2007 = 0.7055651252992311)
  15043. Firing prefer*rvt*predict-no*H0
  15044. -->
  15045. Firing rl*prefer*rvt*predict-no*H0*6
  15046. -->
  15047. (S1 ^operator O2008 = 0.229854902707684)
  15048. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15049. -->
  15050. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  15051. -->
  15052. (S1 ^operator O2008 = -0.2023211881870005)
  15053. inner elaboration loop at bottom goal.
  15054. Retracting rl*prefer*rvt*predict-no*H0*6
  15055. -->
  15056. (S1 ^operator O2006 = 0.229854902707684)
  15057. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  15058. -->
  15059. (S1 ^operator O2006 = -0.2023211881870005)
  15060. Retracting rl*prefer*rvt*predict-yes*H0*5
  15061. -->
  15062. (S1 ^operator O2005 = 0.2939222491339341)
  15063. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15064. -->
  15065. (S1 ^operator O2005 = 0.7055651252992311)
  15066. --- END Proposal Phase ---
  15067. --- Decision Phase ---
  15068. RL update rl*prefer*rvt*predict-no*H0*2 0.485022 -0.171012 0.314009 -> 0.485013 -0.171015 0.313998(R,m,v=1,0.861842,0.119859)
  15069. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515087 0.171042 0.686129 -> 0.515077 0.171039 0.686115(R,m,v=1,1,0)
  15070. =>WM: (14075: S1 ^operator O2007)
  15071. 1004: O: O2007 (predict-yes)
  15072. --- END Decision Phase ---
  15073. --- Application Phase ---
  15074. --- Firing Productions (PE) For State At Depth 1 ---
  15075. --- Inner Elaboration Phase, active level 1 (S1) ---
  15076. Firing apply*operator
  15077. -->
  15078. (I3 ^predict-yes N1004 + :O )
  15079. Firing apply*operator*complete
  15080. -->
  15081. (I3 ^predict-no N1003 - :O )
  15082. inner elaboration loop at bottom goal.
  15083. --- Change Working Memory (PE) ---
  15084. =>WM: (14076: I3 ^predict-yes N1004)
  15085. <=WM: (14062: N1003 ^status complete)
  15086. <=WM: (14061: I3 ^predict-no N1003)
  15087. --- Firing Productions (IE) For State At Depth 1 ---
  15088. --- Inner Elaboration Phase, active level 1 (S1) ---
  15089. Firing monitor*world
  15090. -->
  15091. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15092. --- Change Working Memory (IE) ---
  15093. --- END Application Phase ---
  15094. --- Output Phase ---
  15095. ENV: Agent did: predict-yes for direction R in state State-A
  15096. In State-A moving R
  15097. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15098. predict error 0
  15099. dir: dir isR
  15100. --- END Output Phase ---
  15101. |\---- Input Phase ---
  15102. =>WM: (14080: I2 ^dir R)
  15103. =>WM: (14079: I2 ^reward 1)
  15104. =>WM: (14078: I2 ^see 1)
  15105. =>WM: (14077: N1004 ^status complete)
  15106. <=WM: (14065: I2 ^dir R)
  15107. <=WM: (14064: I2 ^reward 1)
  15108. <=WM: (14063: I2 ^see 0)
  15109. =>WM: (14081: I2 ^level-1 R1-root)
  15110. <=WM: (14066: I2 ^level-1 L0-root)
  15111. --- END Input Phase ---
  15112. --- Proposal Phase ---
  15113. --- Inner Elaboration Phase, active level 1 (S1) ---
  15114. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15115. -->
  15116. (S1 ^operator O2007 = -0.252585164213872)
  15117. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15118. -->
  15119. (S1 ^operator O2008 = 0.7701760437619466)
  15120. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15121. -->
  15122. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15123. -->
  15124. Firing elaborate*copy-see-to-output-link
  15125. -->
  15126. (I3 ^see 1 +)
  15127. Firing elaborate*reward*based*on*reward
  15128. -->
  15129. (R1008 ^value 1 +)
  15130. (R1 ^reward R1008 +)
  15131. Firing propose*predict-yes
  15132. -->
  15133. (O2009 ^name predict-yes +)
  15134. (S1 ^operator O2009 +)
  15135. Firing propose*predict-no
  15136. -->
  15137. (O2010 ^name predict-no +)
  15138. (S1 ^operator O2010 +)
  15139. Firing rl*prefer*rvt*predict-no*H0*6
  15140. -->
  15141. (S1 ^operator O2008 = 0.229854902707684)
  15142. Firing rl*prefer*rvt*predict-yes*H0*5
  15143. -->
  15144. (S1 ^operator O2007 = 0.2939222491339341)
  15145. Firing prefer*rvt*predict-yes*H0
  15146. -->
  15147. Firing prefer*rvt*predict-no*H0
  15148. -->
  15149. Firing elaborate*copy-dir-to-output-link
  15150. -->
  15151. (I3 ^dir R +)
  15152. inner elaboration loop at bottom goal.
  15153. Retracting elaborate*copy-see-to-output-link
  15154. -->
  15155. (I3 ^see 0 +)
  15156. Retracting propose*predict-no
  15157. -->
  15158. (O2008 ^name predict-no +)
  15159. (S1 ^operator O2008 +)
  15160. Retracting propose*predict-yes
  15161. -->
  15162. (O2007 ^name predict-yes +)
  15163. (S1 ^operator O2007 +)
  15164. Retracting elaborate*reward*based*on*reward
  15165. -->
  15166. (R1007 ^value 1 +)
  15167. (R1 ^reward R1007 +)
  15168. Retracting elaborate*copy-dir-to-output-link
  15169. -->
  15170. (I3 ^dir R +)
  15171. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  15172. -->
  15173. (S1 ^operator O2008 = -0.2023211881870005)
  15174. Retracting rl*prefer*rvt*predict-no*H0*6
  15175. -->
  15176. (S1 ^operator O2008 = 0.229854902707684)
  15177. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15178. -->
  15179. (S1 ^operator O2007 = 0.7055651252992311)
  15180. Retracting rl*prefer*rvt*predict-yes*H0*5
  15181. -->
  15182. (S1 ^operator O2007 = 0.2939222491339341)
  15183. =>WM: (14088: S1 ^operator O2010 +)
  15184. =>WM: (14087: S1 ^operator O2009 +)
  15185. =>WM: (14086: O2010 ^name predict-no)
  15186. =>WM: (14085: O2009 ^name predict-yes)
  15187. =>WM: (14084: R1008 ^value 1)
  15188. =>WM: (14083: R1 ^reward R1008)
  15189. =>WM: (14082: I3 ^see 1)
  15190. <=WM: (14073: S1 ^operator O2007 +)
  15191. <=WM: (14075: S1 ^operator O2007)
  15192. <=WM: (14074: S1 ^operator O2008 +)
  15193. <=WM: (14068: R1 ^reward R1007)
  15194. <=WM: (14067: I3 ^see 0)
  15195. <=WM: (14071: O2008 ^name predict-no)
  15196. <=WM: (14070: O2007 ^name predict-yes)
  15197. <=WM: (14069: R1007 ^value 1)
  15198. --- Inner Elaboration Phase, active level 1 (S1) ---
  15199. Firing prefer*rvt*predict-yes*H0
  15200. -->
  15201. Firing rl*prefer*rvt*predict-yes*H0*5
  15202. -->
  15203. (S1 ^operator O2009 = 0.2939222491339341)
  15204. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15205. -->
  15206. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15207. -->
  15208. (S1 ^operator O2009 = -0.252585164213872)
  15209. Firing prefer*rvt*predict-no*H0
  15210. -->
  15211. Firing rl*prefer*rvt*predict-no*H0*6
  15212. -->
  15213. (S1 ^operator O2010 = 0.229854902707684)
  15214. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15215. -->
  15216. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15217. -->
  15218. (S1 ^operator O2010 = 0.7701760437619466)
  15219. inner elaboration loop at bottom goal.
  15220. Retracting rl*prefer*rvt*predict-no*H0*6
  15221. -->
  15222. (S1 ^operator O2008 = 0.229854902707684)
  15223. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15224. -->
  15225. (S1 ^operator O2008 = 0.7701760437619466)
  15226. Retracting rl*prefer*rvt*predict-yes*H0*5
  15227. -->
  15228. (S1 ^operator O2007 = 0.2939222491339341)
  15229. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15230. -->
  15231. (S1 ^operator O2007 = -0.252585164213872)
  15232. --- END Proposal Phase ---
  15233. --- Decision Phase ---
  15234. RL update rl*prefer*rvt*predict-yes*H0*5 0.501003 -0.207081 0.293922 -> 0.501042 -0.207077 0.293965(R,m,v=1,0.847134,0.130328)
  15235. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498533 0.207032 0.705565 -> 0.498578 0.207037 0.705615(R,m,v=1,1,0)
  15236. =>WM: (14089: S1 ^operator O2010)
  15237. 1005: O: O2010 (predict-no)
  15238. --- END Decision Phase ---
  15239. --- Application Phase ---
  15240. --- Firing Productions (PE) For State At Depth 1 ---
  15241. --- Inner Elaboration Phase, active level 1 (S1) ---
  15242. Firing apply*operator
  15243. -->
  15244. (I3 ^predict-no N1005 + :O )
  15245. Firing apply*operator*complete
  15246. -->
  15247. (I3 ^predict-yes N1004 - :O )
  15248. inner elaboration loop at bottom goal.
  15249. --- Change Working Memory (PE) ---
  15250. =>WM: (14090: I3 ^predict-no N1005)
  15251. <=WM: (14077: N1004 ^status complete)
  15252. <=WM: (14076: I3 ^predict-yes N1004)
  15253. --- Firing Productions (IE) For State At Depth 1 ---
  15254. --- Inner Elaboration Phase, active level 1 (S1) ---
  15255. Firing monitor*world
  15256. -->
  15257. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15258. --- Change Working Memory (IE) ---
  15259. --- END Application Phase ---
  15260. --- Output Phase ---
  15261. ENV: Agent did: predict-no for direction R in state State-B
  15262. In State-B moving R
  15263. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15264. predict error 0
  15265. dir: dir isU
  15266. --- END Output Phase ---
  15267. /|--- Input Phase ---
  15268. =>WM: (14094: I2 ^dir U)
  15269. =>WM: (14093: I2 ^reward 1)
  15270. =>WM: (14092: I2 ^see 0)
  15271. =>WM: (14091: N1005 ^status complete)
  15272. <=WM: (14080: I2 ^dir R)
  15273. <=WM: (14079: I2 ^reward 1)
  15274. <=WM: (14078: I2 ^see 1)
  15275. =>WM: (14095: I2 ^level-1 R0-root)
  15276. <=WM: (14081: I2 ^level-1 R1-root)
  15277. --- END Input Phase ---
  15278. --- Proposal Phase ---
  15279. --- Inner Elaboration Phase, active level 1 (S1) ---
  15280. Firing elaborate*copy-see-to-output-link
  15281. -->
  15282. (I3 ^see 0 +)
  15283. Firing elaborate*reward*based*on*reward
  15284. -->
  15285. (R1009 ^value 1 +)
  15286. (R1 ^reward R1009 +)
  15287. Firing propose*predict-yes
  15288. -->
  15289. (O2011 ^name predict-yes +)
  15290. (S1 ^operator O2011 +)
  15291. Firing propose*predict-no
  15292. -->
  15293. (O2012 ^name predict-no +)
  15294. (S1 ^operator O2012 +)
  15295. Firing rl*prefer*rvt*predict-no*H0*4
  15296. -->
  15297. (S1 ^operator O2010 = 1.)
  15298. Firing rl*prefer*rvt*predict-yes*H0*3
  15299. -->
  15300. (S1 ^operator O2009 = 0.)
  15301. Firing prefer*rvt*predict-yes*H0
  15302. -->
  15303. Firing prefer*rvt*predict-no*H0
  15304. -->
  15305. Firing elaborate*copy-dir-to-output-link
  15306. -->
  15307. (I3 ^dir U +)
  15308. inner elaboration loop at bottom goal.
  15309. Retracting elaborate*copy-see-to-output-link
  15310. -->
  15311. (I3 ^see 1 +)
  15312. Retracting propose*predict-no
  15313. -->
  15314. (O2010 ^name predict-no +)
  15315. (S1 ^operator O2010 +)
  15316. Retracting propose*predict-yes
  15317. -->
  15318. (O2009 ^name predict-yes +)
  15319. (S1 ^operator O2009 +)
  15320. Retracting elaborate*reward*based*on*reward
  15321. -->
  15322. (R1008 ^value 1 +)
  15323. (R1 ^reward R1008 +)
  15324. Retracting elaborate*copy-dir-to-output-link
  15325. -->
  15326. (I3 ^dir R +)
  15327. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15328. -->
  15329. (S1 ^operator O2010 = 0.7701760437619466)
  15330. Retracting rl*prefer*rvt*predict-no*H0*6
  15331. -->
  15332. (S1 ^operator O2010 = 0.229854902707684)
  15333. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15334. -->
  15335. (S1 ^operator O2009 = -0.252585164213872)
  15336. Retracting rl*prefer*rvt*predict-yes*H0*5
  15337. -->
  15338. (S1 ^operator O2009 = 0.2939645711914686)
  15339. =>WM: (14103: S1 ^operator O2012 +)
  15340. =>WM: (14102: S1 ^operator O2011 +)
  15341. =>WM: (14101: I3 ^dir U)
  15342. =>WM: (14100: O2012 ^name predict-no)
  15343. =>WM: (14099: O2011 ^name predict-yes)
  15344. =>WM: (14098: R1009 ^value 1)
  15345. =>WM: (14097: R1 ^reward R1009)
  15346. =>WM: (14096: I3 ^see 0)
  15347. <=WM: (14087: S1 ^operator O2009 +)
  15348. <=WM: (14088: S1 ^operator O2010 +)
  15349. <=WM: (14089: S1 ^operator O2010)
  15350. <=WM: (14072: I3 ^dir R)
  15351. <=WM: (14083: R1 ^reward R1008)
  15352. <=WM: (14082: I3 ^see 1)
  15353. <=WM: (14086: O2010 ^name predict-no)
  15354. <=WM: (14085: O2009 ^name predict-yes)
  15355. <=WM: (14084: R1008 ^value 1)
  15356. --- Inner Elaboration Phase, active level 1 (S1) ---
  15357. Firing prefer*rvt*predict-yes*H0
  15358. -->
  15359. Firing rl*prefer*rvt*predict-yes*H0*3
  15360. -->
  15361. (S1 ^operator O2011 = 0.)
  15362. Firing prefer*rvt*predict-no*H0
  15363. -->
  15364. Firing rl*prefer*rvt*predict-no*H0*4
  15365. -->
  15366. (S1 ^operator O2012 = 1.)
  15367. inner elaboration loop at bottom goal.
  15368. Retracting rl*prefer*rvt*predict-no*H0*4
  15369. -->
  15370. (S1 ^operator O2010 = 1.)
  15371. Retracting rl*prefer*rvt*predict-yes*H0*3
  15372. -->
  15373. (S1 ^operator O2009 = 0.)
  15374. --- END Proposal Phase ---
  15375. --- Decision Phase ---
  15376. RL update rl*prefer*rvt*predict-no*H0*6 0.611908 -0.382053 0.229855 -> 0.611906 -0.382053 0.229852(R,m,v=1,0.846591,0.130617)
  15377. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388117 0.382059 0.770176 -> 0.388115 0.382058 0.770173(R,m,v=1,1,0)
  15378. =>WM: (14104: S1 ^operator O2012)
  15379. 1006: O: O2012 (predict-no)
  15380. --- END Decision Phase ---
  15381. --- Application Phase ---
  15382. --- Firing Productions (PE) For State At Depth 1 ---
  15383. --- Inner Elaboration Phase, active level 1 (S1) ---
  15384. Firing apply*operator
  15385. -->
  15386. (I3 ^predict-no N1006 + :O )
  15387. Firing apply*operator*complete
  15388. -->
  15389. (I3 ^predict-no N1005 - :O )
  15390. inner elaboration loop at bottom goal.
  15391. --- Change Working Memory (PE) ---
  15392. =>WM: (14105: I3 ^predict-no N1006)
  15393. <=WM: (14091: N1005 ^status complete)
  15394. <=WM: (14090: I3 ^predict-no N1005)
  15395. --- Firing Productions (IE) For State At Depth 1 ---
  15396. --- Inner Elaboration Phase, active level 1 (S1) ---
  15397. Firing monitor*world
  15398. -->
  15399. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15400. --- Change Working Memory (IE) ---
  15401. --- END Application Phase ---
  15402. --- Output Phase ---
  15403. ENV: Agent did: predict-no for direction U in state State-B
  15404. In State-B moving U
  15405. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15406. predict error 0
  15407. dir: dir isR
  15408. --- END Output Phase ---
  15409. \-/--- Input Phase ---
  15410. =>WM: (14109: I2 ^dir R)
  15411. =>WM: (14108: I2 ^reward 1)
  15412. =>WM: (14107: I2 ^see 0)
  15413. =>WM: (14106: N1006 ^status complete)
  15414. <=WM: (14094: I2 ^dir U)
  15415. <=WM: (14093: I2 ^reward 1)
  15416. <=WM: (14092: I2 ^see 0)
  15417. =>WM: (14110: I2 ^level-1 R0-root)
  15418. <=WM: (14095: I2 ^level-1 R0-root)
  15419. --- END Input Phase ---
  15420. --- Proposal Phase ---
  15421. --- Inner Elaboration Phase, active level 1 (S1) ---
  15422. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15423. -->
  15424. (S1 ^operator O2011 = -0.1254042659579056)
  15425. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15426. -->
  15427. (S1 ^operator O2012 = 0.7700907188039023)
  15428. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15429. -->
  15430. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15431. -->
  15432. Firing elaborate*copy-see-to-output-link
  15433. -->
  15434. (I3 ^see 0 +)
  15435. Firing elaborate*reward*based*on*reward
  15436. -->
  15437. (R1010 ^value 1 +)
  15438. (R1 ^reward R1010 +)
  15439. Firing propose*predict-yes
  15440. -->
  15441. (O2013 ^name predict-yes +)
  15442. (S1 ^operator O2013 +)
  15443. Firing propose*predict-no
  15444. -->
  15445. (O2014 ^name predict-no +)
  15446. (S1 ^operator O2014 +)
  15447. Firing rl*prefer*rvt*predict-no*H0*6
  15448. -->
  15449. (S1 ^operator O2012 = 0.2298523950867538)
  15450. Firing rl*prefer*rvt*predict-yes*H0*5
  15451. -->
  15452. (S1 ^operator O2011 = 0.2939645711914686)
  15453. Firing prefer*rvt*predict-yes*H0
  15454. -->
  15455. Firing prefer*rvt*predict-no*H0
  15456. -->
  15457. Firing elaborate*copy-dir-to-output-link
  15458. -->
  15459. (I3 ^dir R +)
  15460. inner elaboration loop at bottom goal.
  15461. Retracting elaborate*copy-see-to-output-link
  15462. -->
  15463. (I3 ^see 0 +)
  15464. Retracting propose*predict-no
  15465. -->
  15466. (O2012 ^name predict-no +)
  15467. (S1 ^operator O2012 +)
  15468. Retracting propose*predict-yes
  15469. -->
  15470. (O2011 ^name predict-yes +)
  15471. (S1 ^operator O2011 +)
  15472. Retracting elaborate*reward*based*on*reward
  15473. -->
  15474. (R1009 ^value 1 +)
  15475. (R1 ^reward R1009 +)
  15476. Retracting elaborate*copy-dir-to-output-link
  15477. -->
  15478. (I3 ^dir U +)
  15479. Retracting rl*prefer*rvt*predict-no*H0*4
  15480. -->
  15481. (S1 ^operator O2012 = 1.)
  15482. Retracting rl*prefer*rvt*predict-yes*H0*3
  15483. -->
  15484. (S1 ^operator O2011 = 0.)
  15485. =>WM: (14117: S1 ^operator O2014 +)
  15486. =>WM: (14116: S1 ^operator O2013 +)
  15487. =>WM: (14115: I3 ^dir R)
  15488. =>WM: (14114: O2014 ^name predict-no)
  15489. =>WM: (14113: O2013 ^name predict-yes)
  15490. =>WM: (14112: R1010 ^value 1)
  15491. =>WM: (14111: R1 ^reward R1010)
  15492. <=WM: (14102: S1 ^operator O2011 +)
  15493. <=WM: (14103: S1 ^operator O2012 +)
  15494. <=WM: (14104: S1 ^operator O2012)
  15495. <=WM: (14101: I3 ^dir U)
  15496. <=WM: (14097: R1 ^reward R1009)
  15497. <=WM: (14100: O2012 ^name predict-no)
  15498. <=WM: (14099: O2011 ^name predict-yes)
  15499. <=WM: (14098: R1009 ^value 1)
  15500. --- Inner Elaboration Phase, active level 1 (S1) ---
  15501. Firing prefer*rvt*predict-yes*H0
  15502. -->
  15503. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15504. -->
  15505. (S1 ^operator O2013 = -0.1254042659579056)
  15506. Firing rl*prefer*rvt*predict-yes*H0*5
  15507. -->
  15508. (S1 ^operator O2013 = 0.2939645711914686)
  15509. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15510. -->
  15511. Firing prefer*rvt*predict-no*H0
  15512. -->
  15513. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15514. -->
  15515. (S1 ^operator O2014 = 0.7700907188039023)
  15516. Firing rl*prefer*rvt*predict-no*H0*6
  15517. -->
  15518. (S1 ^operator O2014 = 0.2298523950867538)
  15519. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15520. -->
  15521. inner elaboration loop at bottom goal.
  15522. Retracting rl*prefer*rvt*predict-no*H0*6
  15523. -->
  15524. (S1 ^operator O2012 = 0.2298523950867538)
  15525. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15526. -->
  15527. (S1 ^operator O2012 = 0.7700907188039023)
  15528. Retracting rl*prefer*rvt*predict-yes*H0*5
  15529. -->
  15530. (S1 ^operator O2011 = 0.2939645711914686)
  15531. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15532. -->
  15533. (S1 ^operator O2011 = -0.1254042659579056)
  15534. --- END Proposal Phase ---
  15535. --- Decision Phase ---
  15536. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15537. =>WM: (14118: S1 ^operator O2014)
  15538. 1007: O: O2014 (predict-no)
  15539. --- END Decision Phase ---
  15540. --- Application Phase ---
  15541. --- Firing Productions (PE) For State At Depth 1 ---
  15542. --- Inner Elaboration Phase, active level 1 (S1) ---
  15543. Firing apply*operator
  15544. -->
  15545. (I3 ^predict-no N1007 + :O )
  15546. Firing apply*operator*complete
  15547. -->
  15548. (I3 ^predict-no N1006 - :O )
  15549. inner elaboration loop at bottom goal.
  15550. --- Change Working Memory (PE) ---
  15551. =>WM: (14119: I3 ^predict-no N1007)
  15552. <=WM: (14106: N1006 ^status complete)
  15553. <=WM: (14105: I3 ^predict-no N1006)
  15554. --- Firing Productions (IE) For State At Depth 1 ---
  15555. --- Inner Elaboration Phase, active level 1 (S1) ---
  15556. Firing monitor*world
  15557. -->
  15558. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15559. --- Change Working Memory (IE) ---
  15560. --- END Application Phase ---
  15561. --- Output Phase ---
  15562. ENV: Agent did: predict-no for direction R in state State-B
  15563. In State-B moving R
  15564. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15565. predict error 0
  15566. dir: dir isR
  15567. --- END Output Phase ---
  15568. |\---- Input Phase ---
  15569. =>WM: (14123: I2 ^dir R)
  15570. =>WM: (14122: I2 ^reward 1)
  15571. =>WM: (14121: I2 ^see 0)
  15572. =>WM: (14120: N1007 ^status complete)
  15573. <=WM: (14109: I2 ^dir R)
  15574. <=WM: (14108: I2 ^reward 1)
  15575. <=WM: (14107: I2 ^see 0)
  15576. =>WM: (14124: I2 ^level-1 R0-root)
  15577. <=WM: (14110: I2 ^level-1 R0-root)
  15578. --- END Input Phase ---
  15579. --- Proposal Phase ---
  15580. --- Inner Elaboration Phase, active level 1 (S1) ---
  15581. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15582. -->
  15583. (S1 ^operator O2013 = -0.1254042659579056)
  15584. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15585. -->
  15586. (S1 ^operator O2014 = 0.7700907188039023)
  15587. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15588. -->
  15589. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15590. -->
  15591. Firing elaborate*copy-see-to-output-link
  15592. -->
  15593. (I3 ^see 0 +)
  15594. Firing elaborate*reward*based*on*reward
  15595. -->
  15596. (R1011 ^value 1 +)
  15597. (R1 ^reward R1011 +)
  15598. Firing propose*predict-yes
  15599. -->
  15600. (O2015 ^name predict-yes +)
  15601. (S1 ^operator O2015 +)
  15602. Firing propose*predict-no
  15603. -->
  15604. (O2016 ^name predict-no +)
  15605. (S1 ^operator O2016 +)
  15606. Firing rl*prefer*rvt*predict-no*H0*6
  15607. -->
  15608. (S1 ^operator O2014 = 0.2298523950867538)
  15609. Firing rl*prefer*rvt*predict-yes*H0*5
  15610. -->
  15611. (S1 ^operator O2013 = 0.2939645711914686)
  15612. Firing prefer*rvt*predict-yes*H0
  15613. -->
  15614. Firing prefer*rvt*predict-no*H0
  15615. -->
  15616. Firing elaborate*copy-dir-to-output-link
  15617. -->
  15618. (I3 ^dir R +)
  15619. inner elaboration loop at bottom goal.
  15620. Retracting elaborate*copy-see-to-output-link
  15621. -->
  15622. (I3 ^see 0 +)
  15623. Retracting propose*predict-no
  15624. -->
  15625. (O2014 ^name predict-no +)
  15626. (S1 ^operator O2014 +)
  15627. Retracting propose*predict-yes
  15628. -->
  15629. (O2013 ^name predict-yes +)
  15630. (S1 ^operator O2013 +)
  15631. Retracting elaborate*reward*based*on*reward
  15632. -->
  15633. (R1010 ^value 1 +)
  15634. (R1 ^reward R1010 +)
  15635. Retracting elaborate*copy-dir-to-output-link
  15636. -->
  15637. (I3 ^dir R +)
  15638. Retracting rl*prefer*rvt*predict-no*H0*6
  15639. -->
  15640. (S1 ^operator O2014 = 0.2298523950867538)
  15641. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15642. -->
  15643. (S1 ^operator O2014 = 0.7700907188039023)
  15644. Retracting rl*prefer*rvt*predict-yes*H0*5
  15645. -->
  15646. (S1 ^operator O2013 = 0.2939645711914686)
  15647. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15648. -->
  15649. (S1 ^operator O2013 = -0.1254042659579056)
  15650. =>WM: (14130: S1 ^operator O2016 +)
  15651. =>WM: (14129: S1 ^operator O2015 +)
  15652. =>WM: (14128: O2016 ^name predict-no)
  15653. =>WM: (14127: O2015 ^name predict-yes)
  15654. =>WM: (14126: R1011 ^value 1)
  15655. =>WM: (14125: R1 ^reward R1011)
  15656. <=WM: (14116: S1 ^operator O2013 +)
  15657. <=WM: (14117: S1 ^operator O2014 +)
  15658. <=WM: (14118: S1 ^operator O2014)
  15659. <=WM: (14111: R1 ^reward R1010)
  15660. <=WM: (14114: O2014 ^name predict-no)
  15661. <=WM: (14113: O2013 ^name predict-yes)
  15662. <=WM: (14112: R1010 ^value 1)
  15663. --- Inner Elaboration Phase, active level 1 (S1) ---
  15664. Firing prefer*rvt*predict-yes*H0
  15665. -->
  15666. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15667. -->
  15668. (S1 ^operator O2015 = -0.1254042659579056)
  15669. Firing rl*prefer*rvt*predict-yes*H0*5
  15670. -->
  15671. (S1 ^operator O2015 = 0.2939645711914686)
  15672. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15673. -->
  15674. Firing prefer*rvt*predict-no*H0
  15675. -->
  15676. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15677. -->
  15678. (S1 ^operator O2016 = 0.7700907188039023)
  15679. Firing rl*prefer*rvt*predict-no*H0*6
  15680. -->
  15681. (S1 ^operator O2016 = 0.2298523950867538)
  15682. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15683. -->
  15684. inner elaboration loop at bottom goal.
  15685. Retracting rl*prefer*rvt*predict-no*H0*6
  15686. -->
  15687. (S1 ^operator O2014 = 0.2298523950867538)
  15688. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15689. -->
  15690. (S1 ^operator O2014 = 0.7700907188039023)
  15691. Retracting rl*prefer*rvt*predict-yes*H0*5
  15692. -->
  15693. (S1 ^operator O2013 = 0.2939645711914686)
  15694. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15695. -->
  15696. (S1 ^operator O2013 = -0.1254042659579056)
  15697. --- END Proposal Phase ---
  15698. --- Decision Phase ---
  15699. RL update rl*prefer*rvt*predict-no*H0*6 0.611906 -0.382053 0.229852 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.847458,0.130008)
  15700. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388048 0.382043 0.770091 -> 0.388052 0.382044 0.770096(R,m,v=1,1,0)
  15701. =>WM: (14131: S1 ^operator O2016)
  15702. 1008: O: O2016 (predict-no)
  15703. --- END Decision Phase ---
  15704. --- Application Phase ---
  15705. --- Firing Productions (PE) For State At Depth 1 ---