/flipv2/20121112-100543-2.5K-ReLST-Wallace/stdout-flip-2.5K_1.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 34709 lines · 32683 code · 2026 blank · 0 comment · 0 complexity · 65c053b700960c62ea6c2bac5dde26ac MD5 · raw file

  1. Seeding... 1
  2. dir: dir isL
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 1 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_1.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\sleeping...
  20. -/|\-/1: O: O1 (predict-yes)
  21. I see 0 and I'm going to do: predict-yes
  22. ENV: Agent did: predict-yes for direction L in state State-A
  23. In State-A moving L
  24. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  25. predict error 1
  26. dir: dir isU
  27. rule alias: '*'
  28. rule alias: '*'
  29. |\-/|\-/2: O: O4 (predict-no)
  30. I see 0 and I'm going to do: predict-no
  31. ENV: Agent did: predict-no for direction U in state State-A
  32. In State-A moving U
  33. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  34. predict error 0
  35. dir: dir isU
  36. |\-3: O: O5 (predict-yes)
  37. I see 1 and I'm going to do: predict-yes
  38. ENV: Agent did: predict-yes for direction U in state State-A
  39. In State-A moving U
  40. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  41. predict error 1
  42. dir: dir isL
  43. /4: O: O7 (predict-yes)
  44. I see 0 and I'm going to do: predict-yes
  45. ENV: Agent did: predict-yes for direction L in state State-A
  46. In State-A moving L
  47. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  48. predict error 1
  49. dir: dir isR
  50. |\-5: O: O10 (predict-no)
  51. I see 0 and I'm going to do: predict-no
  52. ENV: Agent did: predict-no for direction R in state State-A
  53. In State-A moving R
  54. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  55. predict error 1
  56. dir: dir isR
  57. /|6: O: O11 (predict-yes)
  58. I see 0 and I'm going to do: predict-yes
  59. ENV: Agent did: predict-yes for direction R in state State-B
  60. In State-B moving R
  61. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  62. predict error 1
  63. dir: dir isR
  64. \-/|7: O: O13 (predict-yes)
  65. I see 0 and I'm going to do: predict-yes
  66. ENV: Agent did: predict-yes for direction R in state State-B
  67. In State-B moving R
  68. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  69. predict error 1
  70. dir: dir isU
  71. \-/|8: O: O16 (predict-no)
  72. I see 0 and I'm going to do: predict-no
  73. ENV: Agent did: predict-no for direction U in state State-B
  74. In State-B moving U
  75. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  76. predict error 0
  77. dir: dir isL
  78. \-9: O: O18 (predict-no)
  79. I see 1 and I'm going to do: predict-no
  80. ENV: Agent did: predict-no for direction L in state State-B
  81. In State-B moving L
  82. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  83. predict error 1
  84. dir: dir isL
  85. /|\10: O: O20 (predict-no)
  86. I see 0 and I'm going to do: predict-no
  87. ENV: Agent did: predict-no for direction L in state State-A
  88. In State-A moving L
  89. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  90. predict error 0
  91. dir: dir isU
  92. -/|11: O: O22 (predict-no)
  93. I see 1 and I'm going to do: predict-no
  94. ENV: Agent did: predict-no for direction U in state State-A
  95. In State-A moving U
  96. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  97. predict error 0
  98. dir: dir isR
  99. rule alias: '*'
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. \12: O: O23 (predict-yes)
  104. I see 1 and I'm going to do: predict-yes
  105. ENV: Agent did: predict-yes for direction R in state State-A
  106. In State-A moving R
  107. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  108. predict error 0
  109. dir: dir isU
  110. -/|13: O: O26 (predict-no)
  111. I see 1 and I'm going to do: predict-no
  112. ENV: Agent did: predict-no for direction U in state State-B
  113. In State-B moving U
  114. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  115. predict error 0
  116. dir: dir isL
  117. \-14: O: O28 (predict-no)
  118. I see 1 and I'm going to do: predict-no
  119. ENV: Agent did: predict-no for direction L in state State-B
  120. In State-B moving L
  121. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  122. predict error 1
  123. dir: dir isR
  124. /|15: O: O30 (predict-no)
  125. I see 0 and I'm going to do: predict-no
  126. ENV: Agent did: predict-no for direction R in state State-A
  127. In State-A moving R
  128. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  129. predict error 1
  130. dir: dir isU
  131. \-/16: O: O32 (predict-no)
  132. I see 0 and I'm going to do: predict-no
  133. ENV: Agent did: predict-no for direction U in state State-B
  134. In State-B moving U
  135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  136. predict error 0
  137. dir: dir isL
  138. |\-17: O: O33 (predict-yes)
  139. I see 1 and I'm going to do: predict-yes
  140. ENV: Agent did: predict-yes for direction L in state State-B
  141. In State-B moving L
  142. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  143. predict error 0
  144. dir: dir isU
  145. /|18: O: O36 (predict-no)
  146. I see 1 and I'm going to do: predict-no
  147. ENV: Agent did: predict-no for direction U in state State-A
  148. In State-A moving U
  149. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  150. predict error 0
  151. dir: dir isU
  152. \-/19: O: O38 (predict-no)
  153. I see 1 and I'm going to do: predict-no
  154. ENV: Agent did: predict-no for direction U in state State-A
  155. In State-A moving U
  156. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  157. predict error 0
  158. dir: dir isL
  159. |\-20: O: O39 (predict-yes)
  160. I see 1 and I'm going to do: predict-yes
  161. ENV: Agent did: predict-yes for direction L in state State-A
  162. In State-A moving L
  163. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  164. predict error 1
  165. dir: dir isL
  166. /|\21: O: O41 (predict-yes)
  167. I see 0 and I'm going to do: predict-yes
  168. ENV: Agent did: predict-yes for direction L in state State-A
  169. In State-A moving L
  170. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  171. predict error 1
  172. dir: dir isR
  173. -22: O: O43 (predict-yes)
  174. I see 0 and I'm going to do: predict-yes
  175. ENV: Agent did: predict-yes for direction R in state State-A
  176. In State-A moving R
  177. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  178. predict error 0
  179. dir: dir isU
  180. /|23: O: O46 (predict-no)
  181. I see 1 and I'm going to do: predict-no
  182. ENV: Agent did: predict-no for direction U in state State-B
  183. In State-B moving U
  184. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  185. predict error 0
  186. dir: dir isR
  187. \-/24: O: O47 (predict-yes)
  188. I see 1 and I'm going to do: predict-yes
  189. ENV: Agent did: predict-yes for direction R in state State-B
  190. In State-B moving R
  191. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  192. predict error 1
  193. dir: dir isL
  194. |\25: O: O50 (predict-no)
  195. I see 0 and I'm going to do: predict-no
  196. ENV: Agent did: predict-no for direction L in state State-B
  197. In State-B moving L
  198. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  199. predict error 1
  200. dir: dir isR
  201. -/|26: O: O52 (predict-no)
  202. I see 0 and I'm going to do: predict-no
  203. ENV: Agent did: predict-no for direction R in state State-A
  204. In State-A moving R
  205. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  206. predict error 1
  207. dir: dir isL
  208. \-27: O: O54 (predict-no)
  209. I see 0 and I'm going to do: predict-no
  210. ENV: Agent did: predict-no for direction L in state State-B
  211. In State-B moving L
  212. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  213. predict error 1
  214. dir: dir isL
  215. /|28: O: O56 (predict-no)
  216. I see 0 and I'm going to do: predict-no
  217. ENV: Agent did: predict-no for direction L in state State-A
  218. In State-A moving L
  219. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  220. predict error 0
  221. dir: dir isR
  222. \-/29: O: O57 (predict-yes)
  223. I see 1 and I'm going to do: predict-yes
  224. ENV: Agent did: predict-yes for direction R in state State-A
  225. In State-A moving R
  226. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  227. predict error 0
  228. dir: dir isR
  229. |\-30: O: O59 (predict-yes)
  230. I see 1 and I'm going to do: predict-yes
  231. ENV: Agent did: predict-yes for direction R in state State-B
  232. In State-B moving R
  233. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  234. predict error 1
  235. dir: dir isL
  236. /|\31: O: O62 (predict-no)
  237. I see 0 and I'm going to do: predict-no
  238. ENV: Agent did: predict-no for direction L in state State-B
  239. In State-B moving L
  240. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  241. predict error 1
  242. dir: dir isL
  243. -32: O: O64 (predict-no)
  244. I see 0 and I'm going to do: predict-no
  245. ENV: Agent did: predict-no for direction L in state State-A
  246. In State-A moving L
  247. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  248. predict error 0
  249. dir: dir isL
  250. /|\33: O: O66 (predict-no)
  251. I see 1 and I'm going to do: predict-no
  252. ENV: Agent did: predict-no for direction L in state State-A
  253. In State-A moving L
  254. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  255. predict error 0
  256. dir: dir isR
  257. -/|34: O: O67 (predict-yes)
  258. I see 1 and I'm going to do: predict-yes
  259. ENV: Agent did: predict-yes for direction R in state State-A
  260. In State-A moving R
  261. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  262. predict error 0
  263. dir: dir isL
  264. \-/35: O: O70 (predict-no)
  265. I see 1 and I'm going to do: predict-no
  266. ENV: Agent did: predict-no for direction L in state State-B
  267. In State-B moving L
  268. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  269. predict error 1
  270. dir: dir isL
  271. |\-/36: O: O72 (predict-no)
  272. I see 0 and I'm going to do: predict-no
  273. ENV: Agent did: predict-no for direction L in state State-A
  274. In State-A moving L
  275. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  276. predict error 0
  277. dir: dir isU
  278. |\-37: O: O74 (predict-no)
  279. I see 1 and I'm going to do: predict-no
  280. ENV: Agent did: predict-no for direction U in state State-A
  281. In State-A moving U
  282. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  283. predict error 0
  284. dir: dir isR
  285. /|\38: O: O76 (predict-no)
  286. I see 1 and I'm going to do: predict-no
  287. ENV: Agent did: predict-no for direction R in state State-A
  288. In State-A moving R
  289. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  290. predict error 1
  291. dir: dir isR
  292. -/|39: O: O77 (predict-yes)
  293. I see 0 and I'm going to do: predict-yes
  294. ENV: Agent did: predict-yes for direction R in state State-B
  295. In State-B moving R
  296. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  297. predict error 1
  298. dir: dir isL
  299. \-/40: O: O80 (predict-no)
  300. I see 0 and I'm going to do: predict-no
  301. ENV: Agent did: predict-no for direction L in state State-B
  302. In State-B moving L
  303. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  304. predict error 1
  305. dir: dir isU
  306. |\-41: O: O82 (predict-no)
  307. I see 0 and I'm going to do: predict-no
  308. ENV: Agent did: predict-no for direction U in state State-A
  309. In State-A moving U
  310. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  311. predict error 0
  312. dir: dir isU
  313. /42: O: O84 (predict-no)
  314. I see 1 and I'm going to do: predict-no
  315. ENV: Agent did: predict-no for direction U in state State-A
  316. In State-A moving U
  317. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  318. predict error 0
  319. dir: dir isL
  320. |\43: O: O85 (predict-yes)
  321. I see 1 and I'm going to do: predict-yes
  322. ENV: Agent did: predict-yes for direction L in state State-A
  323. In State-A moving L
  324. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  325. predict error 1
  326. dir: dir isL
  327. -/|44: O: O88 (predict-no)
  328. I see 0 and I'm going to do: predict-no
  329. ENV: Agent did: predict-no for direction L in state State-A
  330. In State-A moving L
  331. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  332. predict error 0
  333. dir: dir isU
  334. \-/45: O: O90 (predict-no)
  335. I see 1 and I'm going to do: predict-no
  336. ENV: Agent did: predict-no for direction U in state State-A
  337. In State-A moving U
  338. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  339. predict error 0
  340. dir: dir isU
  341. |\-46: O: O92 (predict-no)
  342. I see 1 and I'm going to do: predict-no
  343. ENV: Agent did: predict-no for direction U in state State-A
  344. In State-A moving U
  345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  346. predict error 0
  347. dir: dir isU
  348. /|\47: O: O94 (predict-no)
  349. I see 1 and I'm going to do: predict-no
  350. ENV: Agent did: predict-no for direction U in state State-A
  351. In State-A moving U
  352. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  353. predict error 0
  354. dir: dir isR
  355. -/48: O: O95 (predict-yes)
  356. I see 1 and I'm going to do: predict-yes
  357. ENV: Agent did: predict-yes for direction R in state State-A
  358. In State-A moving R
  359. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  360. predict error 0
  361. dir: dir isU
  362. |\-49: O: O98 (predict-no)
  363. I see 1 and I'm going to do: predict-no
  364. ENV: Agent did: predict-no for direction U in state State-B
  365. In State-B moving U
  366. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  367. predict error 0
  368. dir: dir isU
  369. /|\50: O: O100 (predict-no)
  370. I see 1 and I'm going to do: predict-no
  371. ENV: Agent did: predict-no for direction U in state State-B
  372. In State-B moving U
  373. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  374. predict error 0
  375. dir: dir isL
  376. -/|\-/|sleeping...
  377. \sleeping...
  378. -sleeping...
  379. /sleeping...
  380. |sleeping...
  381. \sleeping...
  382. -sleeping...
  383. /sleeping...
  384. |sleeping...
  385. \sleeping...
  386. -sleeping...
  387. /sleeping...
  388. |sleeping...
  389. \sleeping...
  390. -sleeping...
  391. /sleeping...
  392. |sleeping...
  393. \sleeping...
  394. -sleeping...
  395. /sleeping...
  396. |sleeping...
  397. \sleeping...
  398. -sleeping...
  399. /sleeping...
  400. |sleeping...
  401. \sleeping...
  402. -sleeping...
  403. /sleeping...
  404. |sleeping...
  405. \sleeping...
  406. -sleeping...
  407. /sleeping...
  408. |sleeping...
  409. \sleeping...
  410. -sleeping...
  411. /sleeping...
  412. |sleeping...
  413. \51: O: O102 (predict-no)
  414. I see 1 and I'm going to do: predict-no
  415. ENV: Agent did: predict-no for direction L in state State-B
  416. In State-B moving L
  417. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  418. predict error 1
  419. dir: dir isR
  420. rule alias: '*'
  421. rule alias: '*'
  422. -52: O: O104 (predict-no)
  423. I see 0 and I'm going to do: predict-no
  424. ENV: Agent did: predict-no for direction R in state State-A
  425. In State-A moving R
  426. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  427. predict error 1
  428. dir: dir isU
  429. /|53: O: O106 (predict-no)
  430. I see 0 and I'm going to do: predict-no
  431. ENV: Agent did: predict-no for direction U in state State-B
  432. In State-B moving U
  433. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  434. predict error 0
  435. dir: dir isU
  436. \-/54: O: O108 (predict-no)
  437. I see 1 and I'm going to do: predict-no
  438. ENV: Agent did: predict-no for direction U in state State-B
  439. In State-B moving U
  440. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  441. predict error 0
  442. dir: dir isR
  443. |\55: O: O109 (predict-yes)
  444. I see 1 and I'm going to do: predict-yes
  445. ENV: Agent did: predict-yes for direction R in state State-B
  446. In State-B moving R
  447. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  448. predict error 1
  449. dir: dir isR
  450. -/|56: O: O111 (predict-yes)
  451. I see 0 and I'm going to do: predict-yes
  452. ENV: Agent did: predict-yes for direction R in state State-B
  453. In State-B moving R
  454. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  455. predict error 1
  456. dir: dir isL
  457. \-/57: O: O114 (predict-no)
  458. I see 0 and I'm going to do: predict-no
  459. ENV: Agent did: predict-no for direction L in state State-B
  460. In State-B moving L
  461. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  462. predict error 1
  463. dir: dir isL
  464. |\58: O: O116 (predict-no)
  465. I see 0 and I'm going to do: predict-no
  466. ENV: Agent did: predict-no for direction L in state State-A
  467. In State-A moving L
  468. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  469. predict error 0
  470. dir: dir isU
  471. -/|59: O: O118 (predict-no)
  472. I see 1 and I'm going to do: predict-no
  473. ENV: Agent did: predict-no for direction U in state State-A
  474. In State-A moving U
  475. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  476. predict error 0
  477. dir: dir isR
  478. \-60: O: O119 (predict-yes)
  479. I see 1 and I'm going to do: predict-yes
  480. ENV: Agent did: predict-yes for direction R in state State-A
  481. In State-A moving R
  482. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  483. predict error 0
  484. dir: dir isL
  485. /61: O: O122 (predict-no)
  486. I see 1 and I'm going to do: predict-no
  487. ENV: Agent did: predict-no for direction L in state State-B
  488. In State-B moving L
  489. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  490. predict error 1
  491. dir: dir isR
  492. rule alias: '*'
  493. rule alias: '*'
  494. rule alias: '*'
  495. rule alias: '*'
  496. rule alias: '*'
  497. rule alias: '*'
  498. rule alias: '*'
  499. rule alias: '*'
  500. rule alias: '*'
  501. rule alias: '*'
  502. |62: O: O123 (predict-yes)
  503. I see 0 and I'm going to do: predict-yes
  504. ENV: Agent did: predict-yes for direction R in state State-A
  505. In State-A moving R
  506. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  507. predict error 0
  508. dir: dir isU
  509. \-/63: O: O126 (predict-no)
  510. I see 1 and I'm going to do: predict-no
  511. ENV: Agent did: predict-no for direction U in state State-B
  512. In State-B moving U
  513. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  514. predict error 0
  515. dir: dir isU
  516. |\-64: O: O128 (predict-no)
  517. I see 1 and I'm going to do: predict-no
  518. ENV: Agent did: predict-no for direction U in state State-B
  519. In State-B moving U
  520. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  521. predict error 0
  522. dir: dir isR
  523. /|65: O: O129 (predict-yes)
  524. I see 1 and I'm going to do: predict-yes
  525. ENV: Agent did: predict-yes for direction R in state State-B
  526. In State-B moving R
  527. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  528. predict error 1
  529. dir: dir isR
  530. \-/66: O: O132 (predict-no)
  531. I see 0 and I'm going to do: predict-no
  532. ENV: Agent did: predict-no for direction R in state State-B
  533. In State-B moving R
  534. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  535. predict error 0
  536. dir: dir isR
  537. |\-67: O: O134 (predict-no)
  538. I see 1 and I'm going to do: predict-no
  539. ENV: Agent did: predict-no for direction R in state State-B
  540. In State-B moving R
  541. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  542. predict error 0
  543. dir: dir isU
  544. /|68: O: O136 (predict-no)
  545. I see 1 and I'm going to do: predict-no
  546. ENV: Agent did: predict-no for direction U in state State-B
  547. In State-B moving U
  548. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  549. predict error 0
  550. dir: dir isR
  551. \-/69: O: O137 (predict-yes)
  552. I see 1 and I'm going to do: predict-yes
  553. ENV: Agent did: predict-yes for direction R in state State-B
  554. In State-B moving R
  555. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  556. predict error 1
  557. dir: dir isR
  558. |\-70: O: O139 (predict-yes)
  559. I see 0 and I'm going to do: predict-yes
  560. ENV: Agent did: predict-yes for direction R in state State-B
  561. In State-B moving R
  562. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  563. predict error 1
  564. dir: dir isR
  565. /71: O: O142 (predict-no)
  566. I see 0 and I'm going to do: predict-no
  567. ENV: Agent did: predict-no for direction R in state State-B
  568. In State-B moving R
  569. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  570. predict error 0
  571. dir: dir isL
  572. rule alias: '*'
  573. |72: O: O144 (predict-no)
  574. I see 1 and I'm going to do: predict-no
  575. ENV: Agent did: predict-no for direction L in state State-B
  576. In State-B moving L
  577. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  578. predict error 1
  579. dir: dir isL
  580. \-/73: O: O146 (predict-no)
  581. I see 0 and I'm going to do: predict-no
  582. ENV: Agent did: predict-no for direction L in state State-A
  583. In State-A moving L
  584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  585. predict error 0
  586. dir: dir isU
  587. |\74: O: O148 (predict-no)
  588. I see 1 and I'm going to do: predict-no
  589. ENV: Agent did: predict-no for direction U in state State-A
  590. In State-A moving U
  591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  592. predict error 0
  593. dir: dir isU
  594. -/75: O: O149 (predict-yes)
  595. I see 1 and I'm going to do: predict-yes
  596. ENV: Agent did: predict-yes for direction U in state State-A
  597. In State-A moving U
  598. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  599. predict error 1
  600. dir: dir isR
  601. |\76: O: O152 (predict-no)
  602. I see 0 and I'm going to do: predict-no
  603. ENV: Agent did: predict-no for direction R in state State-A
  604. In State-A moving R
  605. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  606. predict error 1
  607. dir: dir isR
  608. -/|77: O: O153 (predict-yes)
  609. I see 0 and I'm going to do: predict-yes
  610. ENV: Agent did: predict-yes for direction R in state State-B
  611. In State-B moving R
  612. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  613. predict error 1
  614. dir: dir isL
  615. \-/78: O: O156 (predict-no)
  616. I see 0 and I'm going to do: predict-no
  617. ENV: Agent did: predict-no for direction L in state State-B
  618. In State-B moving L
  619. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  620. predict error 1
  621. dir: dir isR
  622. |\-79: O: O158 (predict-no)
  623. I see 0 and I'm going to do: predict-no
  624. ENV: Agent did: predict-no for direction R in state State-A
  625. In State-A moving R
  626. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  627. predict error 1
  628. dir: dir isU
  629. /|\80: O: O160 (predict-no)
  630. I see 0 and I'm going to do: predict-no
  631. ENV: Agent did: predict-no for direction U in state State-B
  632. In State-B moving U
  633. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  634. predict error 0
  635. dir: dir isU
  636. -/81: O: O162 (predict-no)
  637. I see 1 and I'm going to do: predict-no
  638. ENV: Agent did: predict-no for direction U in state State-B
  639. In State-B moving U
  640. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  641. predict error 0
  642. dir: dir isR
  643. rule alias: '*'
  644. |82: O: O163 (predict-yes)
  645. I see 1 and I'm going to do: predict-yes
  646. ENV: Agent did: predict-yes for direction R in state State-B
  647. In State-B moving R
  648. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  649. predict error 1
  650. dir: dir isU
  651. \-/|83: O: O166 (predict-no)
  652. I see 0 and I'm going to do: predict-no
  653. ENV: Agent did: predict-no for direction U in state State-B
  654. In State-B moving U
  655. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  656. predict error 0
  657. dir: dir isL
  658. \-/84: O: O168 (predict-no)
  659. I see 1 and I'm going to do: predict-no
  660. ENV: Agent did: predict-no for direction L in state State-B
  661. In State-B moving L
  662. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  663. predict error 1
  664. dir: dir isR
  665. |\-85: O: O170 (predict-no)
  666. I see 0 and I'm going to do: predict-no
  667. ENV: Agent did: predict-no for direction R in state State-A
  668. In State-A moving R
  669. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  670. predict error 1
  671. dir: dir isU
  672. /|\86: O: O172 (predict-no)
  673. I see 0 and I'm going to do: predict-no
  674. ENV: Agent did: predict-no for direction U in state State-B
  675. In State-B moving U
  676. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  677. predict error 0
  678. dir: dir isR
  679. -/|87: O: O174 (predict-no)
  680. I see 1 and I'm going to do: predict-no
  681. ENV: Agent did: predict-no for direction R in state State-B
  682. In State-B moving R
  683. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  684. predict error 0
  685. dir: dir isR
  686. \-/88: O: O176 (predict-no)
  687. I see 1 and I'm going to do: predict-no
  688. ENV: Agent did: predict-no for direction R in state State-B
  689. In State-B moving R
  690. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  691. predict error 0
  692. dir: dir isL
  693. |\-89: O: O177 (predict-yes)
  694. I see 1 and I'm going to do: predict-yes
  695. ENV: Agent did: predict-yes for direction L in state State-B
  696. In State-B moving L
  697. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  698. predict error 0
  699. dir: dir isR
  700. /|\90: O: O179 (predict-yes)
  701. I see 1 and I'm going to do: predict-yes
  702. ENV: Agent did: predict-yes for direction R in state State-A
  703. In State-A moving R
  704. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  705. predict error 0
  706. dir: dir isU
  707. -/91: O: O182 (predict-no)
  708. I see 1 and I'm going to do: predict-no
  709. ENV: Agent did: predict-no for direction U in state State-B
  710. In State-B moving U
  711. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  712. predict error 0
  713. dir: dir isL
  714. rule alias: '*'
  715. rule alias: '*'
  716. rule alias: '*'
  717. |92: O: O184 (predict-no)
  718. I see 1 and I'm going to do: predict-no
  719. ENV: Agent did: predict-no for direction L in state State-B
  720. In State-B moving L
  721. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  722. predict error 1
  723. dir: dir isU
  724. \-93: O: O186 (predict-no)
  725. I see 0 and I'm going to do: predict-no
  726. ENV: Agent did: predict-no for direction U in state State-A
  727. In State-A moving U
  728. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  729. predict error 0
  730. dir: dir isU
  731. /|94: O: O188 (predict-no)
  732. I see 1 and I'm going to do: predict-no
  733. ENV: Agent did: predict-no for direction U in state State-A
  734. In State-A moving U
  735. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  736. predict error 0
  737. dir: dir isU
  738. \-/95: O: O190 (predict-no)
  739. I see 1 and I'm going to do: predict-no
  740. ENV: Agent did: predict-no for direction U in state State-A
  741. In State-A moving U
  742. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  743. predict error 0
  744. dir: dir isU
  745. |\-96: O: O191 (predict-yes)
  746. I see 1 and I'm going to do: predict-yes
  747. ENV: Agent did: predict-yes for direction U in state State-A
  748. In State-A moving U
  749. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  750. predict error 1
  751. dir: dir isU
  752. /|\-97: O: O194 (predict-no)
  753. I see 0 and I'm going to do: predict-no
  754. ENV: Agent did: predict-no for direction U in state State-A
  755. In State-A moving U
  756. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  757. predict error 0
  758. dir: dir isR
  759. /|\98: O: O196 (predict-no)
  760. I see 1 and I'm going to do: predict-no
  761. ENV: Agent did: predict-no for direction R in state State-A
  762. In State-A moving R
  763. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  764. predict error 1
  765. dir: dir isR
  766. -/99: O: O198 (predict-no)
  767. I see 0 and I'm going to do: predict-no
  768. ENV: Agent did: predict-no for direction R in state State-B
  769. In State-B moving R
  770. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  771. predict error 0
  772. dir: dir isR
  773. |\-100: O: O200 (predict-no)
  774. I see 1 and I'm going to do: predict-no
  775. ENV: Agent did: predict-no for direction R in state State-B
  776. In State-B moving R
  777. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  778. predict error 0
  779. dir: dir isL
  780. /|\101: O: O201 (predict-yes)
  781. I see 1 and I'm going to do: predict-yes
  782. ENV: Agent did: predict-yes for direction L in state State-B
  783. In State-B moving L
  784. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  785. predict error 0
  786. dir: dir isU
  787. rule alias: '*'
  788. rule alias: '*'
  789. -/|\-/|\-/|\-/|\-/|\-/|\-/|\-sleeping...
  790. /sleeping...
  791. |sleeping...
  792. \sleeping...
  793. -sleeping...
  794. /sleeping...
  795. |sleeping...
  796. \sleeping...
  797. -sleeping...
  798. /sleeping...
  799. |sleeping...
  800. \sleeping...
  801. -sleeping...
  802. /sleeping...
  803. |sleeping...
  804. \sleeping...
  805. -sleeping...
  806. /sleeping...
  807. |102: O: O203 (predict-yes)
  808. I see 1 and I'm going to do: predict-yes
  809. ENV: Agent did: predict-yes for direction U in state State-A
  810. In State-A moving U
  811. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  812. predict error 1
  813. dir: dir isR
  814. \-/|103: O: O206 (predict-no)
  815. I see 0 and I'm going to do: predict-no
  816. ENV: Agent did: predict-no for direction R in state State-A
  817. In State-A moving R
  818. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  819. predict error 1
  820. dir: dir isL
  821. \-/104: O: O207 (predict-yes)
  822. I see 0 and I'm going to do: predict-yes
  823. ENV: Agent did: predict-yes for direction L in state State-B
  824. In State-B moving L
  825. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  826. predict error 0
  827. dir: dir isR
  828. |\105: O: O210 (predict-no)
  829. I see 1 and I'm going to do: predict-no
  830. ENV: Agent did: predict-no for direction R in state State-A
  831. In State-A moving R
  832. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  833. predict error 1
  834. dir: dir isR
  835. -/106: O: O211 (predict-yes)
  836. I see 0 and I'm going to do: predict-yes
  837. ENV: Agent did: predict-yes for direction R in state State-B
  838. In State-B moving R
  839. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  840. predict error 1
  841. dir: dir isR
  842. |\-107: O: O213 (predict-yes)
  843. I see 0 and I'm going to do: predict-yes
  844. ENV: Agent did: predict-yes for direction R in state State-B
  845. In State-B moving R
  846. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  847. predict error 1
  848. dir: dir isR
  849. /|\-sleeping...
  850. /108: O: O216 (predict-no)
  851. I see 0 and I'm going to do: predict-no
  852. ENV: Agent did: predict-no for direction R in state State-B
  853. In State-B moving R
  854. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  855. predict error 0
  856. dir: dir isR
  857. |\109: O: O218 (predict-no)
  858. I see 1 and I'm going to do: predict-no
  859. ENV: Agent did: predict-no for direction R in state State-B
  860. In State-B moving R
  861. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  862. predict error 0
  863. dir: dir isR
  864. -110: O: O220 (predict-no)
  865. I see 1 and I'm going to do: predict-no
  866. ENV: Agent did: predict-no for direction R in state State-B
  867. In State-B moving R
  868. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  869. predict error 0
  870. dir: dir isR
  871. /|\111: O: O222 (predict-no)
  872. I see 1 and I'm going to do: predict-no
  873. ENV: Agent did: predict-no for direction R in state State-B
  874. In State-B moving R
  875. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  876. predict error 0
  877. dir: dir isR
  878. rule alias: '*'
  879. rule alias: '*'
  880. rule alias: '*'
  881. rule alias: '*'
  882. rule alias: '*'
  883. rule alias: '*'
  884. rule alias: '*'
  885. rule alias: '*'
  886. -112: O: O223 (predict-yes)
  887. I see 1 and I'm going to do: predict-yes
  888. ENV: Agent did: predict-yes for direction R in state State-B
  889. In State-B moving R
  890. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  891. predict error 1
  892. dir: dir isL
  893. /|\113: O: O225 (predict-yes)
  894. I see 0 and I'm going to do: predict-yes
  895. ENV: Agent did: predict-yes for direction L in state State-B
  896. In State-B moving L
  897. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  898. predict error 0
  899. dir: dir isL
  900. -/|114: O: O227 (predict-yes)
  901. I see 1 and I'm going to do: predict-yes
  902. ENV: Agent did: predict-yes for direction L in state State-A
  903. In State-A moving L
  904. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  905. predict error 1
  906. dir: dir isL
  907. \-/115: O: O229 (predict-yes)
  908. I see 0 and I'm going to do: predict-yes
  909. ENV: Agent did: predict-yes for direction L in state State-A
  910. In State-A moving L
  911. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  912. predict error 1
  913. dir: dir isR
  914. |\-/116: O: O232 (predict-no)
  915. I see 0 and I'm going to do: predict-no
  916. ENV: Agent did: predict-no for direction R in state State-A
  917. In State-A moving R
  918. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  919. predict error 1
  920. dir: dir isU
  921. |\-117: O: O234 (predict-no)
  922. I see 0 and I'm going to do: predict-no
  923. ENV: Agent did: predict-no for direction U in state State-B
  924. In State-B moving U
  925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  926. predict error 0
  927. dir: dir isU
  928. /|118: O: O236 (predict-no)
  929. I see 1 and I'm going to do: predict-no
  930. ENV: Agent did: predict-no for direction U in state State-B
  931. In State-B moving U
  932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  933. predict error 0
  934. dir: dir isU
  935. \-/119: O: O238 (predict-no)
  936. I see 1 and I'm going to do: predict-no
  937. ENV: Agent did: predict-no for direction U in state State-B
  938. In State-B moving U
  939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  940. predict error 0
  941. dir: dir isU
  942. |\-120: O: O239 (predict-yes)
  943. I see 1 and I'm going to do: predict-yes
  944. ENV: Agent did: predict-yes for direction U in state State-B
  945. In State-B moving U
  946. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  947. predict error 1
  948. dir: dir isL
  949. /|\121: O: O241 (predict-yes)
  950. I see 0 and I'm going to do: predict-yes
  951. ENV: Agent did: predict-yes for direction L in state State-B
  952. In State-B moving L
  953. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  954. predict error 0
  955. dir: dir isU
  956. rule alias: '*'
  957. rule alias: '*'
  958. rule alias: '*'
  959. rule alias: '*'
  960. rule alias: '*'
  961. rule alias: '*'
  962. rule alias: '*'
  963. rule alias: '*'
  964. -122: O: O244 (predict-no)
  965. I see 1 and I'm going to do: predict-no
  966. ENV: Agent did: predict-no for direction U in state State-A
  967. In State-A moving U
  968. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  969. predict error 0
  970. dir: dir isU
  971. /|123: O: O246 (predict-no)
  972. I see 1 and I'm going to do: predict-no
  973. ENV: Agent did: predict-no for direction U in state State-A
  974. In State-A moving U
  975. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  976. predict error 0
  977. dir: dir isL
  978. \-124: O: O247 (predict-yes)
  979. I see 1 and I'm going to do: predict-yes
  980. ENV: Agent did: predict-yes for direction L in state State-A
  981. In State-A moving L
  982. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  983. predict error 1
  984. dir: dir isL
  985. /|\125: O: O249 (predict-yes)
  986. I see 0 and I'm going to do: predict-yes
  987. ENV: Agent did: predict-yes for direction L in state State-A
  988. In State-A moving L
  989. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  990. predict error 1
  991. dir: dir isL
  992. -/126: O: O251 (predict-yes)
  993. I see 0 and I'm going to do: predict-yes
  994. ENV: Agent did: predict-yes for direction L in state State-A
  995. In State-A moving L
  996. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  997. predict error 1
  998. dir: dir isU
  999. |\-127: O: O254 (predict-no)
  1000. I see 0 and I'm going to do: predict-no
  1001. ENV: Agent did: predict-no for direction U in state State-A
  1002. In State-A moving U
  1003. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1004. predict error 0
  1005. dir: dir isL
  1006. /|128: O: O255 (predict-yes)
  1007. I see 1 and I'm going to do: predict-yes
  1008. ENV: Agent did: predict-yes for direction L in state State-A
  1009. In State-A moving L
  1010. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1011. predict error 1
  1012. dir: dir isL
  1013. \-/129: O: O257 (predict-yes)
  1014. I see 0 and I'm going to do: predict-yes
  1015. ENV: Agent did: predict-yes for direction L in state State-A
  1016. In State-A moving L
  1017. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1018. predict error 1
  1019. dir: dir isR
  1020. |\-130: O: O260 (predict-no)
  1021. I see 0 and I'm going to do: predict-no
  1022. ENV: Agent did: predict-no for direction R in state State-A
  1023. In State-A moving R
  1024. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1025. predict error 1
  1026. dir: dir isR
  1027. /|\131: O: O262 (predict-no)
  1028. I see 0 and I'm going to do: predict-no
  1029. ENV: Agent did: predict-no for direction R in state State-B
  1030. In State-B moving R
  1031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1032. predict error 0
  1033. dir: dir isL
  1034. -132: O: O263 (predict-yes)
  1035. I see 1 and I'm going to do: predict-yes
  1036. ENV: Agent did: predict-yes for direction L in state State-B
  1037. In State-B moving L
  1038. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1039. predict error 0
  1040. dir: dir isL
  1041. /|133: O: O265 (predict-yes)
  1042. I see 1 and I'm going to do: predict-yes
  1043. ENV: Agent did: predict-yes for direction L in state State-A
  1044. In State-A moving L
  1045. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1046. predict error 1
  1047. dir: dir isR
  1048. \-134: O: O268 (predict-no)
  1049. I see 0 and I'm going to do: predict-no
  1050. ENV: Agent did: predict-no for direction R in state State-A
  1051. In State-A moving R
  1052. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1053. predict error 1
  1054. dir: dir isL
  1055. /|135: O: O270 (predict-no)
  1056. I see 0 and I'm going to do: predict-no
  1057. ENV: Agent did: predict-no for direction L in state State-B
  1058. In State-B moving L
  1059. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1060. predict error 1
  1061. dir: dir isL
  1062. \-/136: O: O271 (predict-yes)
  1063. I see 0 and I'm going to do: predict-yes
  1064. ENV: Agent did: predict-yes for direction L in state State-A
  1065. In State-A moving L
  1066. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1067. predict error 1
  1068. dir: dir isU
  1069. |137: O: O274 (predict-no)
  1070. I see 0 and I'm going to do: predict-no
  1071. ENV: Agent did: predict-no for direction U in state State-A
  1072. In State-A moving U
  1073. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1074. predict error 0
  1075. dir: dir isR
  1076. \-/138: O: O276 (predict-no)
  1077. I see 1 and I'm going to do: predict-no
  1078. ENV: Agent did: predict-no for direction R in state State-A
  1079. In State-A moving R
  1080. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1081. predict error 1
  1082. dir: dir isL
  1083. |\-139: O: O277 (predict-yes)
  1084. I see 0 and I'm going to do: predict-yes
  1085. ENV: Agent did: predict-yes for direction L in state State-B
  1086. In State-B moving L
  1087. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1088. predict error 0
  1089. dir: dir isR
  1090. /|140: O: O279 (predict-yes)
  1091. I see 1 and I'm going to do: predict-yes
  1092. ENV: Agent did: predict-yes for direction R in state State-A
  1093. In State-A moving R
  1094. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1095. predict error 0
  1096. dir: dir isL
  1097. \-141: O: O282 (predict-no)
  1098. I see 1 and I'm going to do: predict-no
  1099. ENV: Agent did: predict-no for direction L in state State-B
  1100. In State-B moving L
  1101. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1102. predict error 1
  1103. dir: dir isR
  1104. /142: O: O283 (predict-yes)
  1105. I see 0 and I'm going to do: predict-yes
  1106. ENV: Agent did: predict-yes for direction R in state State-A
  1107. In State-A moving R
  1108. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1109. predict error 0
  1110. dir: dir isR
  1111. |\-143: O: O286 (predict-no)
  1112. I see 1 and I'm going to do: predict-no
  1113. ENV: Agent did: predict-no for direction R in state State-B
  1114. In State-B moving R
  1115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1116. predict error 0
  1117. dir: dir isL
  1118. /|144: O: O287 (predict-yes)
  1119. I see 1 and I'm going to do: predict-yes
  1120. ENV: Agent did: predict-yes for direction L in state State-B
  1121. In State-B moving L
  1122. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1123. predict error 0
  1124. dir: dir isL
  1125. \-/145: O: O289 (predict-yes)
  1126. I see 1 and I'm going to do: predict-yes
  1127. ENV: Agent did: predict-yes for direction L in state State-A
  1128. In State-A moving L
  1129. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1130. predict error 1
  1131. dir: dir isU
  1132. |\-146: O: O292 (predict-no)
  1133. I see 0 and I'm going to do: predict-no
  1134. ENV: Agent did: predict-no for direction U in state State-A
  1135. In State-A moving U
  1136. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1137. predict error 0
  1138. dir: dir isR
  1139. /|\147: O: O294 (predict-no)
  1140. I see 1 and I'm going to do: predict-no
  1141. ENV: Agent did: predict-no for direction R in state State-A
  1142. In State-A moving R
  1143. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1144. predict error 1
  1145. dir: dir isL
  1146. -148: O: O295 (predict-yes)
  1147. I see 0 and I'm going to do: predict-yes
  1148. ENV: Agent did: predict-yes for direction L in state State-B
  1149. In State-B moving L
  1150. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1151. predict error 0
  1152. dir: dir isR
  1153. /|\149: O: O297 (predict-yes)
  1154. I see 1 and I'm going to do: predict-yes
  1155. ENV: Agent did: predict-yes for direction R in state State-A
  1156. In State-A moving R
  1157. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1158. predict error 0
  1159. dir: dir isU
  1160. -/|150: O: O300 (predict-no)
  1161. I see 1 and I'm going to do: predict-no
  1162. ENV: Agent did: predict-no for direction U in state State-B
  1163. In State-B moving U
  1164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1165. predict error 0
  1166. dir: dir isL
  1167. \-/151: O: O301 (predict-yes)
  1168. I see 1 and I'm going to do: predict-yes
  1169. ENV: Agent did: predict-yes for direction L in state State-B
  1170. In State-B moving L
  1171. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1172. predict error 0
  1173. dir: dir isL
  1174. |152: O: O303 (predict-yes)
  1175. I see 1 and I'm going to do: predict-yes
  1176. ENV: Agent did: predict-yes for direction L in state State-A
  1177. In State-A moving L
  1178. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1179. predict error 1
  1180. dir: dir isL
  1181. \-153: O: O305 (predict-yes)
  1182. I see 0 and I'm going to do: predict-yes
  1183. ENV: Agent did: predict-yes for direction L in state State-A
  1184. In State-A moving L
  1185. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1186. predict error 1
  1187. dir: dir isU
  1188. /|\154: O: O308 (predict-no)
  1189. I see 0 and I'm going to do: predict-no
  1190. ENV: Agent did: predict-no for direction U in state State-A
  1191. In State-A moving U
  1192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1193. predict error 0
  1194. dir: dir isL
  1195. -/|155: O: O309 (predict-yes)
  1196. I see 1 and I'm going to do: predict-yes
  1197. ENV: Agent did: predict-yes for direction L in state State-A
  1198. In State-A moving L
  1199. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1200. predict error 1
  1201. dir: dir isU
  1202. \-156: O: O312 (predict-no)
  1203. I see 0 and I'm going to do: predict-no
  1204. ENV: Agent did: predict-no for direction U in state State-A
  1205. In State-A moving U
  1206. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1207. predict error 0
  1208. dir: dir isU
  1209. /|157: O: O313 (predict-yes)
  1210. I see 1 and I'm going to do: predict-yes
  1211. ENV: Agent did: predict-yes for direction U in state State-A
  1212. In State-A moving U
  1213. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1214. predict error 1
  1215. dir: dir isR
  1216. \-158: O: O315 (predict-yes)
  1217. I see 0 and I'm going to do: predict-yes
  1218. ENV: Agent did: predict-yes for direction R in state State-A
  1219. In State-A moving R
  1220. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1221. predict error 0
  1222. dir: dir isL
  1223. /159: O: O317 (predict-yes)
  1224. I see 1 and I'm going to do: predict-yes
  1225. ENV: Agent did: predict-yes for direction L in state State-B
  1226. In State-B moving L
  1227. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1228. predict error 0
  1229. dir: dir isU
  1230. |\-160: O: O320 (predict-no)
  1231. I see 1 and I'm going to do: predict-no
  1232. ENV: Agent did: predict-no for direction U in state State-A
  1233. In State-A moving U
  1234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1235. predict error 0
  1236. dir: dir isU
  1237. /|161: O: O322 (predict-no)
  1238. I see 1 and I'm going to do: predict-no
  1239. ENV: Agent did: predict-no for direction U in state State-A
  1240. In State-A moving U
  1241. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1242. predict error 0
  1243. dir: dir isR
  1244. \162: O: O323 (predict-yes)
  1245. I see 1 and I'm going to do: predict-yes
  1246. ENV: Agent did: predict-yes for direction R in state State-A
  1247. In State-A moving R
  1248. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1249. predict error 0
  1250. dir: dir isL
  1251. -/163: O: O325 (predict-yes)
  1252. I see 1 and I'm going to do: predict-yes
  1253. ENV: Agent did: predict-yes for direction L in state State-B
  1254. In State-B moving L
  1255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1256. predict error 0
  1257. dir: dir isR
  1258. |\-164: O: O327 (predict-yes)
  1259. I see 1 and I'm going to do: predict-yes
  1260. ENV: Agent did: predict-yes for direction R in state State-A
  1261. In State-A moving R
  1262. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1263. predict error 0
  1264. dir: dir isR
  1265. /|\165: O: O329 (predict-yes)
  1266. I see 1 and I'm going to do: predict-yes
  1267. ENV: Agent did: predict-yes for direction R in state State-B
  1268. In State-B moving R
  1269. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1270. predict error 1
  1271. dir: dir isR
  1272. -/166: O: O332 (predict-no)
  1273. I see 0 and I'm going to do: predict-no
  1274. ENV: Agent did: predict-no for direction R in state State-B
  1275. In State-B moving R
  1276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1277. predict error 0
  1278. dir: dir isL
  1279. |\-167: O: O333 (predict-yes)
  1280. I see 1 and I'm going to do: predict-yes
  1281. ENV: Agent did: predict-yes for direction L in state State-B
  1282. In State-B moving L
  1283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1284. predict error 0
  1285. dir: dir isR
  1286. /|168: O: O335 (predict-yes)
  1287. I see 1 and I'm going to do: predict-yes
  1288. ENV: Agent did: predict-yes for direction R in state State-A
  1289. In State-A moving R
  1290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1291. predict error 0
  1292. dir: dir isL
  1293. \-169: O: O337 (predict-yes)
  1294. I see 1 and I'm going to do: predict-yes
  1295. ENV: Agent did: predict-yes for direction L in state State-B
  1296. In State-B moving L
  1297. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1298. predict error 0
  1299. dir: dir isL
  1300. /|170: O: O339 (predict-yes)
  1301. I see 1 and I'm going to do: predict-yes
  1302. ENV: Agent did: predict-yes for direction L in state State-A
  1303. In State-A moving L
  1304. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1305. predict error 1
  1306. dir: dir isU
  1307. \-171: O: O341 (predict-yes)
  1308. I see 0 and I'm going to do: predict-yes
  1309. ENV: Agent did: predict-yes for direction U in state State-A
  1310. In State-A moving U
  1311. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1312. predict error 1
  1313. dir: dir isU
  1314. /172: O: O344 (predict-no)
  1315. I see 0 and I'm going to do: predict-no
  1316. ENV: Agent did: predict-no for direction U in state State-A
  1317. In State-A moving U
  1318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1319. predict error 0
  1320. dir: dir isL
  1321. |\173: O: O345 (predict-yes)
  1322. I see 1 and I'm going to do: predict-yes
  1323. ENV: Agent did: predict-yes for direction L in state State-A
  1324. In State-A moving L
  1325. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1326. predict error 1
  1327. dir: dir isU
  1328. -/|174: O: O348 (predict-no)
  1329. I see 0 and I'm going to do: predict-no
  1330. ENV: Agent did: predict-no for direction U in state State-A
  1331. In State-A moving U
  1332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1333. predict error 0
  1334. dir: dir isL
  1335. \-/175: O: O350 (predict-no)
  1336. I see 1 and I'm going to do: predict-no
  1337. ENV: Agent did: predict-no for direction L in state State-A
  1338. In State-A moving L
  1339. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1340. predict error 0
  1341. dir: dir isU
  1342. |\-/176: O: O352 (predict-no)
  1343. I see 1 and I'm going to do: predict-no
  1344. ENV: Agent did: predict-no for direction U in state State-A
  1345. In State-A moving U
  1346. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1347. predict error 0
  1348. dir: dir isU
  1349. |\-177: O: O354 (predict-no)
  1350. I see 1 and I'm going to do: predict-no
  1351. ENV: Agent did: predict-no for direction U in state State-A
  1352. In State-A moving U
  1353. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1354. predict error 0
  1355. dir: dir isR
  1356. /|\-178: O: O355 (predict-yes)
  1357. I see 1 and I'm going to do: predict-yes
  1358. ENV: Agent did: predict-yes for direction R in state State-A
  1359. In State-A moving R
  1360. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1361. predict error 0
  1362. dir: dir isL
  1363. /|\179: O: O357 (predict-yes)
  1364. I see 1 and I'm going to do: predict-yes
  1365. ENV: Agent did: predict-yes for direction L in state State-B
  1366. In State-B moving L
  1367. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1368. predict error 0
  1369. dir: dir isL
  1370. -/|180: O: O360 (predict-no)
  1371. I see 1 and I'm going to do: predict-no
  1372. ENV: Agent did: predict-no for direction L in state State-A
  1373. In State-A moving L
  1374. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1375. predict error 0
  1376. dir: dir isU
  1377. \-/181: O: O362 (predict-no)
  1378. I see 1 and I'm going to do: predict-no
  1379. ENV: Agent did: predict-no for direction U in state State-A
  1380. In State-A moving U
  1381. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1382. predict error 0
  1383. dir: dir isL
  1384. |182: O: O363 (predict-yes)
  1385. I see 1 and I'm going to do: predict-yes
  1386. ENV: Agent did: predict-yes for direction L in state State-A
  1387. In State-A moving L
  1388. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1389. predict error 1
  1390. dir: dir isU
  1391. \-183: O: O366 (predict-no)
  1392. I see 0 and I'm going to do: predict-no
  1393. ENV: Agent did: predict-no for direction U in state State-A
  1394. In State-A moving U
  1395. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1396. predict error 0
  1397. dir: dir isU
  1398. /|\-184: O: O367 (predict-yes)
  1399. I see 1 and I'm going to do: predict-yes
  1400. ENV: Agent did: predict-yes for direction U in state State-A
  1401. In State-A moving U
  1402. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1403. predict error 1
  1404. dir: dir isR
  1405. /|\185: O: O370 (predict-no)
  1406. I see 0 and I'm going to do: predict-no
  1407. ENV: Agent did: predict-no for direction R in state State-A
  1408. In State-A moving R
  1409. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1410. predict error 1
  1411. dir: dir isL
  1412. -/|186: O: O372 (predict-no)
  1413. I see 0 and I'm going to do: predict-no
  1414. ENV: Agent did: predict-no for direction L in state State-B
  1415. In State-B moving L
  1416. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1417. predict error 1
  1418. dir: dir isU
  1419. \-/187: O: O374 (predict-no)
  1420. I see 0 and I'm going to do: predict-no
  1421. ENV: Agent did: predict-no for direction U in state State-A
  1422. In State-A moving U
  1423. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1424. predict error 0
  1425. dir: dir isU
  1426. |188: O: O376 (predict-no)
  1427. I see 1 and I'm going to do: predict-no
  1428. ENV: Agent did: predict-no for direction U in state State-A
  1429. In State-A moving U
  1430. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1431. predict error 0
  1432. dir: dir isU
  1433. \-189: O: O377 (predict-yes)
  1434. I see 1 and I'm going to do: predict-yes
  1435. ENV: Agent did: predict-yes for direction U in state State-A
  1436. In State-A moving U
  1437. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1438. predict error 1
  1439. dir: dir isR
  1440. /|190: O: O379 (predict-yes)
  1441. I see 0 and I'm going to do: predict-yes
  1442. ENV: Agent did: predict-yes for direction R in state State-A
  1443. In State-A moving R
  1444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1445. predict error 0
  1446. dir: dir isR
  1447. \-191: O: O382 (predict-no)
  1448. I see 1 and I'm going to do: predict-no
  1449. ENV: Agent did: predict-no for direction R in state State-B
  1450. In State-B moving R
  1451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1452. predict error 0
  1453. dir: dir isR
  1454. /192: O: O384 (predict-no)
  1455. I see 1 and I'm going to do: predict-no
  1456. ENV: Agent did: predict-no for direction R in state State-B
  1457. In State-B moving R
  1458. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1459. predict error 0
  1460. dir: dir isL
  1461. |193: O: O385 (predict-yes)
  1462. I see 1 and I'm going to do: predict-yes
  1463. ENV: Agent did: predict-yes for direction L in state State-B
  1464. In State-B moving L
  1465. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1466. predict error 0
  1467. dir: dir isU
  1468. \-/194: O: O388 (predict-no)
  1469. I see 1 and I'm going to do: predict-no
  1470. ENV: Agent did: predict-no for direction U in state State-A
  1471. In State-A moving U
  1472. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1473. predict error 0
  1474. dir: dir isR
  1475. |\-195: O: O389 (predict-yes)
  1476. I see 1 and I'm going to do: predict-yes
  1477. ENV: Agent did: predict-yes for direction R in state State-A
  1478. In State-A moving R
  1479. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1480. predict error 0
  1481. dir: dir isL
  1482. /|\196: O: O391 (predict-yes)
  1483. I see 1 and I'm going to do: predict-yes
  1484. ENV: Agent did: predict-yes for direction L in state State-B
  1485. In State-B moving L
  1486. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1487. predict error 0
  1488. dir: dir isL
  1489. -197: O: O394 (predict-no)
  1490. I see 1 and I'm going to do: predict-no
  1491. ENV: Agent did: predict-no for direction L in state State-A
  1492. In State-A moving L
  1493. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1494. predict error 0
  1495. dir: dir isR
  1496. /|\198: O: O395 (predict-yes)
  1497. I see 1 and I'm going to do: predict-yes
  1498. ENV: Agent did: predict-yes for direction R in state State-A
  1499. In State-A moving R
  1500. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1501. predict error 0
  1502. dir: dir isL
  1503. -/|199: O: O397 (predict-yes)
  1504. I see 1 and I'm going to do: predict-yes
  1505. ENV: Agent did: predict-yes for direction L in state State-B
  1506. In State-B moving L
  1507. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1508. predict error 0
  1509. dir: dir isR
  1510. \-/200: O: O399 (predict-yes)
  1511. I see 1 and I'm going to do: predict-yes
  1512. ENV: Agent did: predict-yes for direction R in state State-A
  1513. In State-A moving R
  1514. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1515. predict error 0
  1516. dir: dir isL
  1517. |\-201: O: O401 (predict-yes)
  1518. I see 1 and I'm going to do: predict-yes
  1519. ENV: Agent did: predict-yes for direction L in state State-B
  1520. In State-B moving L
  1521. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1522. predict error 0
  1523. dir: dir isU
  1524. /|202: O: O404 (predict-no)
  1525. I see 1 and I'm going to do: predict-no
  1526. ENV: Agent did: predict-no for direction U in state State-A
  1527. In State-A moving U
  1528. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1529. predict error 0
  1530. dir: dir isU
  1531. \-203: O: O406 (predict-no)
  1532. I see 1 and I'm going to do: predict-no
  1533. ENV: Agent did: predict-no for direction U in state State-A
  1534. In State-A moving U
  1535. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1536. predict error 0
  1537. dir: dir isL
  1538. /|\204: O: O408 (predict-no)
  1539. I see 1 and I'm going to do: predict-no
  1540. ENV: Agent did: predict-no for direction L in state State-A
  1541. In State-A moving L
  1542. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1543. predict error 0
  1544. dir: dir isL
  1545. -205: O: O409 (predict-yes)
  1546. I see 1 and I'm going to do: predict-yes
  1547. ENV: Agent did: predict-yes for direction L in state State-A
  1548. In State-A moving L
  1549. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1550. predict error 1
  1551. dir: dir isL
  1552. /|\206: O: O412 (predict-no)
  1553. I see 0 and I'm going to do: predict-no
  1554. ENV: Agent did: predict-no for direction L in state State-A
  1555. In State-A moving L
  1556. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1557. predict error 0
  1558. dir: dir isU
  1559. -/|207: O: O414 (predict-no)
  1560. I see 1 and I'm going to do: predict-no
  1561. ENV: Agent did: predict-no for direction U in state State-A
  1562. In State-A moving U
  1563. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1564. predict error 0
  1565. dir: dir isU
  1566. \-/208: O: O416 (predict-no)
  1567. I see 1 and I'm going to do: predict-no
  1568. ENV: Agent did: predict-no for direction U in state State-A
  1569. In State-A moving U
  1570. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1571. predict error 0
  1572. dir: dir isR
  1573. |\209: O: O417 (predict-yes)
  1574. I see 1 and I'm going to do: predict-yes
  1575. ENV: Agent did: predict-yes for direction R in state State-A
  1576. In State-A moving R
  1577. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1578. predict error 0
  1579. dir: dir isL
  1580. -/|210: O: O419 (predict-yes)
  1581. I see 1 and I'm going to do: predict-yes
  1582. ENV: Agent did: predict-yes for direction L in state State-B
  1583. In State-B moving L
  1584. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1585. predict error 0
  1586. dir: dir isU
  1587. \-/211: O: O422 (predict-no)
  1588. I see 1 and I'm going to do: predict-no
  1589. ENV: Agent did: predict-no for direction U in state State-A
  1590. In State-A moving U
  1591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1592. predict error 0
  1593. dir: dir isU
  1594. |212: O: O424 (predict-no)
  1595. I see 1 and I'm going to do: predict-no
  1596. ENV: Agent did: predict-no for direction U in state State-A
  1597. In State-A moving U
  1598. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1599. predict error 0
  1600. dir: dir isU
  1601. \-/213: O: O426 (predict-no)
  1602. I see 1 and I'm going to do: predict-no
  1603. ENV: Agent did: predict-no for direction U in state State-A
  1604. In State-A moving U
  1605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1606. predict error 0
  1607. dir: dir isR
  1608. |\-214: O: O427 (predict-yes)
  1609. I see 1 and I'm going to do: predict-yes
  1610. ENV: Agent did: predict-yes for direction R in state State-A
  1611. In State-A moving R
  1612. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1613. predict error 0
  1614. dir: dir isU
  1615. /|215: O: O430 (predict-no)
  1616. I see 1 and I'm going to do: predict-no
  1617. ENV: Agent did: predict-no for direction U in state State-B
  1618. In State-B moving U
  1619. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1620. predict error 0
  1621. dir: dir isU
  1622. \216: O: O432 (predict-no)
  1623. I see 1 and I'm going to do: predict-no
  1624. ENV: Agent did: predict-no for direction U in state State-B
  1625. In State-B moving U
  1626. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1627. predict error 0
  1628. dir: dir isR
  1629. -/|217: O: O434 (predict-no)
  1630. I see 1 and I'm going to do: predict-no
  1631. ENV: Agent did: predict-no for direction R in state State-B
  1632. In State-B moving R
  1633. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1634. predict error 0
  1635. dir: dir isU
  1636. \-/218: O: O436 (predict-no)
  1637. I see 1 and I'm going to do: predict-no
  1638. ENV: Agent did: predict-no for direction U in state State-B
  1639. In State-B moving U
  1640. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1641. predict error 0
  1642. dir: dir isL
  1643. |\-219: O: O437 (predict-yes)
  1644. I see 1 and I'm going to do: predict-yes
  1645. ENV: Agent did: predict-yes for direction L in state State-B
  1646. In State-B moving L
  1647. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1648. predict error 0
  1649. dir: dir isU
  1650. /|220: O: O439 (predict-yes)
  1651. I see 1 and I'm going to do: predict-yes
  1652. ENV: Agent did: predict-yes for direction U in state State-A
  1653. In State-A moving U
  1654. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1655. predict error 1
  1656. dir: dir isL
  1657. \-/|221: O: O442 (predict-no)
  1658. I see 0 and I'm going to do: predict-no
  1659. ENV: Agent did: predict-no for direction L in state State-A
  1660. In State-A moving L
  1661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1662. predict error 0
  1663. dir: dir isL
  1664. \222: O: O444 (predict-no)
  1665. I see 1 and I'm going to do: predict-no
  1666. ENV: Agent did: predict-no for direction L in state State-A
  1667. In State-A moving L
  1668. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1669. predict error 0
  1670. dir: dir isU
  1671. -/|223: O: O445 (predict-yes)
  1672. I see 1 and I'm going to do: predict-yes
  1673. ENV: Agent did: predict-yes for direction U in state State-A
  1674. In State-A moving U
  1675. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1676. predict error 1
  1677. dir: dir isL
  1678. \-/|sleeping...
  1679. \224: O: O448 (predict-no)
  1680. I see 0 and I'm going to do: predict-no
  1681. ENV: Agent did: predict-no for direction L in state State-A
  1682. In State-A moving L
  1683. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1684. predict error 0
  1685. dir: dir isU
  1686. -/|225: O: O450 (predict-no)
  1687. I see 1 and I'm going to do: predict-no
  1688. ENV: Agent did: predict-no for direction U in state State-A
  1689. In State-A moving U
  1690. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1691. predict error 0
  1692. dir: dir isR
  1693. \-/226: O: O451 (predict-yes)
  1694. I see 1 and I'm going to do: predict-yes
  1695. ENV: Agent did: predict-yes for direction R in state State-A
  1696. In State-A moving R
  1697. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1698. predict error 0
  1699. dir: dir isU
  1700. |\-/227: O: O454 (predict-no)
  1701. I see 1 and I'm going to do: predict-no
  1702. ENV: Agent did: predict-no for direction U in state State-B
  1703. In State-B moving U
  1704. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1705. predict error 0
  1706. dir: dir isR
  1707. |\-/228: O: O455 (predict-yes)
  1708. I see 1 and I'm going to do: predict-yes
  1709. ENV: Agent did: predict-yes for direction R in state State-B
  1710. In State-B moving R
  1711. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1712. predict error 1
  1713. dir: dir isR
  1714. |\-229: O: O458 (predict-no)
  1715. I see 0 and I'm going to do: predict-no
  1716. ENV: Agent did: predict-no for direction R in state State-B
  1717. In State-B moving R
  1718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1719. predict error 0
  1720. dir: dir isL
  1721. /|\230: O: O459 (predict-yes)
  1722. I see 1 and I'm going to do: predict-yes
  1723. ENV: Agent did: predict-yes for direction L in state State-B
  1724. In State-B moving L
  1725. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1726. predict error 0
  1727. dir: dir isU
  1728. -/231: O: O461 (predict-yes)
  1729. I see 1 and I'm going to do: predict-yes
  1730. ENV: Agent did: predict-yes for direction U in state State-A
  1731. In State-A moving U
  1732. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1733. predict error 1
  1734. dir: dir isR
  1735. |232: O: O463 (predict-yes)
  1736. I see 0 and I'm going to do: predict-yes
  1737. ENV: Agent did: predict-yes for direction R in state State-A
  1738. In State-A moving R
  1739. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1740. predict error 0
  1741. dir: dir isU
  1742. \-/233: O: O466 (predict-no)
  1743. I see 1 and I'm going to do: predict-no
  1744. ENV: Agent did: predict-no for direction U in state State-B
  1745. In State-B moving U
  1746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1747. predict error 0
  1748. dir: dir isU
  1749. |\-234: O: O468 (predict-no)
  1750. I see 1 and I'm going to do: predict-no
  1751. ENV: Agent did: predict-no for direction U in state State-B
  1752. In State-B moving U
  1753. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1754. predict error 0
  1755. dir: dir isL
  1756. /|235: O: O469 (predict-yes)
  1757. I see 1 and I'm going to do: predict-yes
  1758. ENV: Agent did: predict-yes for direction L in state State-B
  1759. In State-B moving L
  1760. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1761. predict error 0
  1762. dir: dir isR
  1763. \-236: O: O471 (predict-yes)
  1764. I see 1 and I'm going to do: predict-yes
  1765. ENV: Agent did: predict-yes for direction R in state State-A
  1766. In State-A moving R
  1767. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1768. predict error 0
  1769. dir: dir isL
  1770. /|\237: O: O473 (predict-yes)
  1771. I see 1 and I'm going to do: predict-yes
  1772. ENV: Agent did: predict-yes for direction L in state State-B
  1773. In State-B moving L
  1774. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1775. predict error 0
  1776. dir: dir isL
  1777. -/238: O: O475 (predict-yes)
  1778. I see 1 and I'm going to do: predict-yes
  1779. ENV: Agent did: predict-yes for direction L in state State-A
  1780. In State-A moving L
  1781. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1782. predict error 1
  1783. dir: dir isL
  1784. |239: O: O478 (predict-no)
  1785. I see 0 and I'm going to do: predict-no
  1786. ENV: Agent did: predict-no for direction L in state State-A
  1787. In State-A moving L
  1788. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1789. predict error 0
  1790. dir: dir isU
  1791. \-240: O: O480 (predict-no)
  1792. I see 1 and I'm going to do: predict-no
  1793. ENV: Agent did: predict-no for direction U in state State-A
  1794. In State-A moving U
  1795. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1796. predict error 0
  1797. dir: dir isU
  1798. /|\241: O: O482 (predict-no)
  1799. I see 1 and I'm going to do: predict-no
  1800. ENV: Agent did: predict-no for direction U in state State-A
  1801. In State-A moving U
  1802. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1803. predict error 0
  1804. dir: dir isU
  1805. -242: O: O484 (predict-no)
  1806. I see 1 and I'm going to do: predict-no
  1807. ENV: Agent did: predict-no for direction U in state State-A
  1808. In State-A moving U
  1809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1810. predict error 0
  1811. dir: dir isR
  1812. /|\243: O: O485 (predict-yes)
  1813. I see 1 and I'm going to do: predict-yes
  1814. ENV: Agent did: predict-yes for direction R in state State-A
  1815. In State-A moving R
  1816. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1817. predict error 0
  1818. dir: dir isR
  1819. -/|244: O: O487 (predict-yes)
  1820. I see 1 and I'm going to do: predict-yes
  1821. ENV: Agent did: predict-yes for direction R in state State-B
  1822. In State-B moving R
  1823. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1824. predict error 1
  1825. dir: dir isU
  1826. \245: O: O490 (predict-no)
  1827. I see 0 and I'm going to do: predict-no
  1828. ENV: Agent did: predict-no for direction U in state State-B
  1829. In State-B moving U
  1830. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1831. predict error 0
  1832. dir: dir isR
  1833. -/|246: O: O492 (predict-no)
  1834. I see 1 and I'm going to do: predict-no
  1835. ENV: Agent did: predict-no for direction R in state State-B
  1836. In State-B moving R
  1837. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1838. predict error 0
  1839. dir: dir isR
  1840. \-/247: O: O494 (predict-no)
  1841. I see 1 and I'm going to do: predict-no
  1842. ENV: Agent did: predict-no for direction R in state State-B
  1843. In State-B moving R
  1844. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1845. predict error 0
  1846. dir: dir isL
  1847. |\248: O: O495 (predict-yes)
  1848. I see 1 and I'm going to do: predict-yes
  1849. ENV: Agent did: predict-yes for direction L in state State-B
  1850. In State-B moving L
  1851. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1852. predict error 0
  1853. dir: dir isL
  1854. -/|\249: O: O498 (predict-no)
  1855. I see 1 and I'm going to do: predict-no
  1856. ENV: Agent did: predict-no for direction L in state State-A
  1857. In State-A moving L
  1858. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1859. predict error 0
  1860. dir: dir isL
  1861. -/|250: O: O500 (predict-no)
  1862. I see 1 and I'm going to do: predict-no
  1863. ENV: Agent did: predict-no for direction L in state State-A
  1864. In State-A moving L
  1865. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1866. predict error 0
  1867. dir: dir isU
  1868. \-251: O: O502 (predict-no)
  1869. I see 1 and I'm going to do: predict-no
  1870. ENV: Agent did: predict-no for direction U in state State-A
  1871. In State-A moving U
  1872. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1873. predict error 0
  1874. dir: dir isR
  1875. /252: O: O503 (predict-yes)
  1876. I see 1 and I'm going to do: predict-yes
  1877. ENV: Agent did: predict-yes for direction R in state State-A
  1878. In State-A moving R
  1879. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1880. predict error 0
  1881. dir: dir isU
  1882. |\253: O: O506 (predict-no)
  1883. I see 1 and I'm going to do: predict-no
  1884. ENV: Agent did: predict-no for direction U in state State-B
  1885. In State-B moving U
  1886. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1887. predict error 0
  1888. dir: dir isR
  1889. -254: O: O507 (predict-yes)
  1890. I see 1 and I'm going to do: predict-yes
  1891. ENV: Agent did: predict-yes for direction R in state State-B
  1892. In State-B moving R
  1893. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1894. predict error 1
  1895. dir: dir isL
  1896. /|255: O: O510 (predict-no)
  1897. I see 0 and I'm going to do: predict-no
  1898. ENV: Agent did: predict-no for direction L in state State-B
  1899. In State-B moving L
  1900. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1901. predict error 1
  1902. dir: dir isU
  1903. \-/256: O: O511 (predict-yes)
  1904. I see 0 and I'm going to do: predict-yes
  1905. ENV: Agent did: predict-yes for direction U in state State-A
  1906. In State-A moving U
  1907. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1908. predict error 1
  1909. dir: dir isU
  1910. |\-257: O: O514 (predict-no)
  1911. I see 0 and I'm going to do: predict-no
  1912. ENV: Agent did: predict-no for direction U in state State-A
  1913. In State-A moving U
  1914. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1915. predict error 0
  1916. dir: dir isL
  1917. /|258: O: O516 (predict-no)
  1918. I see 1 and I'm going to do: predict-no
  1919. ENV: Agent did: predict-no for direction L in state State-A
  1920. In State-A moving L
  1921. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1922. predict error 0
  1923. dir: dir isU
  1924. \-/259: O: O518 (predict-no)
  1925. I see 1 and I'm going to do: predict-no
  1926. ENV: Agent did: predict-no for direction U in state State-A
  1927. In State-A moving U
  1928. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1929. predict error 0
  1930. dir: dir isL
  1931. |\-260: O: O520 (predict-no)
  1932. I see 1 and I'm going to do: predict-no
  1933. ENV: Agent did: predict-no for direction L in state State-A
  1934. In State-A moving L
  1935. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1936. predict error 0
  1937. dir: dir isL
  1938. /|261: O: O522 (predict-no)
  1939. I see 1 and I'm going to do: predict-no
  1940. ENV: Agent did: predict-no for direction L in state State-A
  1941. In State-A moving L
  1942. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1943. predict error 0
  1944. dir: dir isU
  1945. \262: O: O524 (predict-no)
  1946. I see 1 and I'm going to do: predict-no
  1947. ENV: Agent did: predict-no for direction U in state State-A
  1948. In State-A moving U
  1949. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1950. predict error 0
  1951. dir: dir isL
  1952. -/|263: O: O526 (predict-no)
  1953. I see 1 and I'm going to do: predict-no
  1954. ENV: Agent did: predict-no for direction L in state State-A
  1955. In State-A moving L
  1956. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1957. predict error 0
  1958. dir: dir isL
  1959. \-/264: O: O528 (predict-no)
  1960. I see 1 and I'm going to do: predict-no
  1961. ENV: Agent did: predict-no for direction L in state State-A
  1962. In State-A moving L
  1963. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1964. predict error 0
  1965. dir: dir isU
  1966. |\-265: O: O530 (predict-no)
  1967. I see 1 and I'm going to do: predict-no
  1968. ENV: Agent did: predict-no for direction U in state State-A
  1969. In State-A moving U
  1970. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1971. predict error 0
  1972. dir: dir isR
  1973. /|266: O: O532 (predict-no)
  1974. I see 1 and I'm going to do: predict-no
  1975. ENV: Agent did: predict-no for direction R in state State-A
  1976. In State-A moving R
  1977. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1978. predict error 1
  1979. dir: dir isL
  1980. \-/267: O: O534 (predict-no)
  1981. I see 0 and I'm going to do: predict-no
  1982. ENV: Agent did: predict-no for direction L in state State-B
  1983. In State-B moving L
  1984. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1985. predict error 1
  1986. dir: dir isL
  1987. |\-268: O: O536 (predict-no)
  1988. I see 0 and I'm going to do: predict-no
  1989. ENV: Agent did: predict-no for direction L in state State-A
  1990. In State-A moving L
  1991. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1992. predict error 0
  1993. dir: dir isL
  1994. /269: O: O538 (predict-no)
  1995. I see 1 and I'm going to do: predict-no
  1996. ENV: Agent did: predict-no for direction L in state State-A
  1997. In State-A moving L
  1998. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1999. predict error 0
  2000. dir: dir isU
  2001. |\270: O: O540 (predict-no)
  2002. I see 1 and I'm going to do: predict-no
  2003. ENV: Agent did: predict-no for direction U in state State-A
  2004. In State-A moving U
  2005. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2006. predict error 0
  2007. dir: dir isL
  2008. -/271: O: O542 (predict-no)
  2009. I see 1 and I'm going to do: predict-no
  2010. ENV: Agent did: predict-no for direction L in state State-A
  2011. In State-A moving L
  2012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2013. predict error 0
  2014. dir: dir isU
  2015. |272: O: O544 (predict-no)
  2016. I see 1 and I'm going to do: predict-no
  2017. ENV: Agent did: predict-no for direction U in state State-A
  2018. In State-A moving U
  2019. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2020. predict error 0
  2021. dir: dir isR
  2022. \-/273: O: O545 (predict-yes)
  2023. I see 1 and I'm going to do: predict-yes
  2024. ENV: Agent did: predict-yes for direction R in state State-A
  2025. In State-A moving R
  2026. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2027. predict error 0
  2028. dir: dir isU
  2029. |274: O: O548 (predict-no)
  2030. I see 1 and I'm going to do: predict-no
  2031. ENV: Agent did: predict-no for direction U in state State-B
  2032. In State-B moving U
  2033. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2034. predict error 0
  2035. dir: dir isU
  2036. \-275: O: O550 (predict-no)
  2037. I see 1 and I'm going to do: predict-no
  2038. ENV: Agent did: predict-no for direction U in state State-B
  2039. In State-B moving U
  2040. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2041. predict error 0
  2042. dir: dir isL
  2043. /|276: O: O551 (predict-yes)
  2044. I see 1 and I'm going to do: predict-yes
  2045. ENV: Agent did: predict-yes for direction L in state State-B
  2046. In State-B moving L
  2047. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2048. predict error 0
  2049. dir: dir isL
  2050. \-/277: O: O554 (predict-no)
  2051. I see 1 and I'm going to do: predict-no
  2052. ENV: Agent did: predict-no for direction L in state State-A
  2053. In State-A moving L
  2054. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2055. predict error 0
  2056. dir: dir isR
  2057. |\278: O: O555 (predict-yes)
  2058. I see 1 and I'm going to do: predict-yes
  2059. ENV: Agent did: predict-yes for direction R in state State-A
  2060. In State-A moving R
  2061. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2062. predict error 0
  2063. dir: dir isL
  2064. -/279: O: O557 (predict-yes)
  2065. I see 1 and I'm going to do: predict-yes
  2066. ENV: Agent did: predict-yes for direction L in state State-B
  2067. In State-B moving L
  2068. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2069. predict error 0
  2070. dir: dir isR
  2071. |\-280: O: O559 (predict-yes)
  2072. I see 1 and I'm going to do: predict-yes
  2073. ENV: Agent did: predict-yes for direction R in state State-A
  2074. In State-A moving R
  2075. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2076. predict error 0
  2077. dir: dir isL
  2078. /|281: O: O561 (predict-yes)
  2079. I see 1 and I'm going to do: predict-yes
  2080. ENV: Agent did: predict-yes for direction L in state State-B
  2081. In State-B moving L
  2082. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2083. predict error 0
  2084. dir: dir isL
  2085. \282: O: O563 (predict-yes)
  2086. I see 1 and I'm going to do: predict-yes
  2087. ENV: Agent did: predict-yes for direction L in state State-A
  2088. In State-A moving L
  2089. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2090. predict error 1
  2091. dir: dir isU
  2092. -/|283: O: O566 (predict-no)
  2093. I see 0 and I'm going to do: predict-no
  2094. ENV: Agent did: predict-no for direction U in state State-A
  2095. In State-A moving U
  2096. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2097. predict error 0
  2098. dir: dir isL
  2099. \-284: O: O568 (predict-no)
  2100. I see 1 and I'm going to do: predict-no
  2101. ENV: Agent did: predict-no for direction L in state State-A
  2102. In State-A moving L
  2103. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2104. predict error 0
  2105. dir: dir isR
  2106. /|285: O: O569 (predict-yes)
  2107. I see 1 and I'm going to do: predict-yes
  2108. ENV: Agent did: predict-yes for direction R in state State-A
  2109. In State-A moving R
  2110. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2111. predict error 0
  2112. dir: dir isR
  2113. \-/|286: O: O572 (predict-no)
  2114. I see 1 and I'm going to do: predict-no
  2115. ENV: Agent did: predict-no for direction R in state State-B
  2116. In State-B moving R
  2117. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2118. predict error 0
  2119. dir: dir isL
  2120. \-/287: O: O574 (predict-no)
  2121. I see 1 and I'm going to do: predict-no
  2122. ENV: Agent did: predict-no for direction L in state State-B
  2123. In State-B moving L
  2124. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2125. predict error 1
  2126. dir: dir isL
  2127. |\-288: O: O576 (predict-no)
  2128. I see 0 and I'm going to do: predict-no
  2129. ENV: Agent did: predict-no for direction L in state State-A
  2130. In State-A moving L
  2131. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2132. predict error 0
  2133. dir: dir isU
  2134. /|\289: O: O578 (predict-no)
  2135. I see 1 and I'm going to do: predict-no
  2136. ENV: Agent did: predict-no for direction U in state State-A
  2137. In State-A moving U
  2138. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2139. predict error 0
  2140. dir: dir isU
  2141. -/|290: O: O580 (predict-no)
  2142. I see 1 and I'm going to do: predict-no
  2143. ENV: Agent did: predict-no for direction U in state State-A
  2144. In State-A moving U
  2145. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2146. predict error 0
  2147. dir: dir isU
  2148. \-/291: O: O582 (predict-no)
  2149. I see 1 and I'm going to do: predict-no
  2150. ENV: Agent did: predict-no for direction U in state State-A
  2151. In State-A moving U
  2152. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2153. predict error 0
  2154. dir: dir isL
  2155. |292: O: O584 (predict-no)
  2156. I see 1 and I'm going to do: predict-no
  2157. ENV: Agent did: predict-no for direction L in state State-A
  2158. In State-A moving L
  2159. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2160. predict error 0
  2161. dir: dir isL
  2162. \-293: O: O586 (predict-no)
  2163. I see 1 and I'm going to do: predict-no
  2164. ENV: Agent did: predict-no for direction L in state State-A
  2165. In State-A moving L
  2166. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2167. predict error 0
  2168. dir: dir isR
  2169. /|\294: O: O587 (predict-yes)
  2170. I see 1 and I'm going to do: predict-yes
  2171. ENV: Agent did: predict-yes for direction R in state State-A
  2172. In State-A moving R
  2173. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2174. predict error 0
  2175. dir: dir isU
  2176. -/|295: O: O590 (predict-no)
  2177. I see 1 and I'm going to do: predict-no
  2178. ENV: Agent did: predict-no for direction U in state State-B
  2179. In State-B moving U
  2180. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2181. predict error 0
  2182. dir: dir isR
  2183. \296: O: O592 (predict-no)
  2184. I see 1 and I'm going to do: predict-no
  2185. ENV: Agent did: predict-no for direction R in state State-B
  2186. In State-B moving R
  2187. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2188. predict error 0
  2189. dir: dir isU
  2190. -/|297: O: O594 (predict-no)
  2191. I see 1 and I'm going to do: predict-no
  2192. ENV: Agent did: predict-no for direction U in state State-B
  2193. In State-B moving U
  2194. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2195. predict error 0
  2196. dir: dir isR
  2197. \-298: O: O596 (predict-no)
  2198. I see 1 and I'm going to do: predict-no
  2199. ENV: Agent did: predict-no for direction R in state State-B
  2200. In State-B moving R
  2201. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2202. predict error 0
  2203. dir: dir isL
  2204. /|\299: O: O597 (predict-yes)
  2205. I see 1 and I'm going to do: predict-yes
  2206. ENV: Agent did: predict-yes for direction L in state State-B
  2207. In State-B moving L
  2208. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2209. predict error 0
  2210. dir: dir isR
  2211. -/|300: O: O599 (predict-yes)
  2212. I see 1 and I'm going to do: predict-yes
  2213. ENV: Agent did: predict-yes for direction R in state State-A
  2214. In State-A moving R
  2215. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2216. predict error 0
  2217. dir: dir isL
  2218. \-/|\-301: O: O601 (predict-yes)
  2219. I see 1 and I'm going to do: predict-yes
  2220. ENV: Agent did: predict-yes for direction L in state State-B
  2221. In State-B moving L
  2222. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2223. predict error 0
  2224. dir: dir isL
  2225. /302: O: O604 (predict-no)
  2226. I see 1 and I'm going to do: predict-no
  2227. ENV: Agent did: predict-no for direction L in state State-A
  2228. In State-A moving L
  2229. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2230. predict error 0
  2231. dir: dir isL
  2232. |\303: O: O606 (predict-no)
  2233. I see 1 and I'm going to do: predict-no
  2234. ENV: Agent did: predict-no for direction L in state State-A
  2235. In State-A moving L
  2236. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2237. predict error 0
  2238. dir: dir isL
  2239. -/|304: O: O608 (predict-no)
  2240. I see 1 and I'm going to do: predict-no
  2241. ENV: Agent did: predict-no for direction L in state State-A
  2242. In State-A moving L
  2243. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2244. predict error 0
  2245. dir: dir isU
  2246. \-/305: O: O610 (predict-no)
  2247. I see 1 and I'm going to do: predict-no
  2248. ENV: Agent did: predict-no for direction U in state State-A
  2249. In State-A moving U
  2250. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2251. predict error 0
  2252. dir: dir isR
  2253. |\-306: O: O611 (predict-yes)
  2254. I see 1 and I'm going to do: predict-yes
  2255. ENV: Agent did: predict-yes for direction R in state State-A
  2256. In State-A moving R
  2257. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2258. predict error 0
  2259. dir: dir isR
  2260. /|\307: O: O614 (predict-no)
  2261. I see 1 and I'm going to do: predict-no
  2262. ENV: Agent did: predict-no for direction R in state State-B
  2263. In State-B moving R
  2264. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2265. predict error 0
  2266. dir: dir isR
  2267. -/|308: O: O616 (predict-no)
  2268. I see 1 and I'm going to do: predict-no
  2269. ENV: Agent did: predict-no for direction R in state State-B
  2270. In State-B moving R
  2271. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2272. predict error 0
  2273. dir: dir isU
  2274. \-/309: O: O618 (predict-no)
  2275. I see 1 and I'm going to do: predict-no
  2276. ENV: Agent did: predict-no for direction U in state State-B
  2277. In State-B moving U
  2278. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2279. predict error 0
  2280. dir: dir isR
  2281. |\-310: O: O620 (predict-no)
  2282. I see 1 and I'm going to do: predict-no
  2283. ENV: Agent did: predict-no for direction R in state State-B
  2284. In State-B moving R
  2285. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2286. predict error 0
  2287. dir: dir isL
  2288. /|\311: O: O621 (predict-yes)
  2289. I see 1 and I'm going to do: predict-yes
  2290. ENV: Agent did: predict-yes for direction L in state State-B
  2291. In State-B moving L
  2292. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2293. predict error 0
  2294. dir: dir isL
  2295. -312: O: O624 (predict-no)
  2296. I see 1 and I'm going to do: predict-no
  2297. ENV: Agent did: predict-no for direction L in state State-A
  2298. In State-A moving L
  2299. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2300. predict error 0
  2301. dir: dir isL
  2302. /|\313: O: O626 (predict-no)
  2303. I see 1 and I'm going to do: predict-no
  2304. ENV: Agent did: predict-no for direction L in state State-A
  2305. In State-A moving L
  2306. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2307. predict error 0
  2308. dir: dir isU
  2309. -/314: O: O628 (predict-no)
  2310. I see 1 and I'm going to do: predict-no
  2311. ENV: Agent did: predict-no for direction U in state State-A
  2312. In State-A moving U
  2313. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2314. predict error 0
  2315. dir: dir isU
  2316. |\315: O: O630 (predict-no)
  2317. I see 1 and I'm going to do: predict-no
  2318. ENV: Agent did: predict-no for direction U in state State-A
  2319. In State-A moving U
  2320. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2321. predict error 0
  2322. dir: dir isL
  2323. -/316: O: O632 (predict-no)
  2324. I see 1 and I'm going to do: predict-no
  2325. ENV: Agent did: predict-no for direction L in state State-A
  2326. In State-A moving L
  2327. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2328. predict error 0
  2329. dir: dir isR
  2330. |\-317: O: O634 (predict-no)
  2331. I see 1 and I'm going to do: predict-no
  2332. ENV: Agent did: predict-no for direction R in state State-A
  2333. In State-A moving R
  2334. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2335. predict error 1
  2336. dir: dir isR
  2337. /|318: O: O636 (predict-no)
  2338. I see 0 and I'm going to do: predict-no
  2339. ENV: Agent did: predict-no for direction R in state State-B
  2340. In State-B moving R
  2341. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2342. predict error 0
  2343. dir: dir isR
  2344. \-/319: O: O638 (predict-no)
  2345. I see 1 and I'm going to do: predict-no
  2346. ENV: Agent did: predict-no for direction R in state State-B
  2347. In State-B moving R
  2348. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2349. predict error 0
  2350. dir: dir isR
  2351. |\-320: O: O640 (predict-no)
  2352. I see 1 and I'm going to do: predict-no
  2353. ENV: Agent did: predict-no for direction R in state State-B
  2354. In State-B moving R
  2355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2356. predict error 0
  2357. dir: dir isL
  2358. /|321: O: O641 (predict-yes)
  2359. I see 1 and I'm going to do: predict-yes
  2360. ENV: Agent did: predict-yes for direction L in state State-B
  2361. In State-B moving L
  2362. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2363. predict error 0
  2364. dir: dir isL
  2365. \322: O: O643 (predict-yes)
  2366. I see 1 and I'm going to do: predict-yes
  2367. ENV: Agent did: predict-yes for direction L in state State-A
  2368. In State-A moving L
  2369. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2370. predict error 1
  2371. dir: dir isL
  2372. -/|323: O: O645 (predict-yes)
  2373. I see 0 and I'm going to do: predict-yes
  2374. ENV: Agent did: predict-yes for direction L in state State-A
  2375. In State-A moving L
  2376. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2377. predict error 1
  2378. dir: dir isL
  2379. \-/324: O: O648 (predict-no)
  2380. I see 0 and I'm going to do: predict-no
  2381. ENV: Agent did: predict-no for direction L in state State-A
  2382. In State-A moving L
  2383. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2384. predict error 0
  2385. dir: dir isR
  2386. |\325: O: O649 (predict-yes)
  2387. I see 1 and I'm going to do: predict-yes
  2388. ENV: Agent did: predict-yes for direction R in state State-A
  2389. In State-A moving R
  2390. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2391. predict error 0
  2392. dir: dir isL
  2393. -/|326: O: O651 (predict-yes)
  2394. I see 1 and I'm going to do: predict-yes
  2395. ENV: Agent did: predict-yes for direction L in state State-B
  2396. In State-B moving L
  2397. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2398. predict error 0
  2399. dir: dir isL
  2400. \-/327: O: O654 (predict-no)
  2401. I see 1 and I'm going to do: predict-no
  2402. ENV: Agent did: predict-no for direction L in state State-A
  2403. In State-A moving L
  2404. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2405. predict error 0
  2406. dir: dir isR
  2407. |\-328: O: O655 (predict-yes)
  2408. I see 1 and I'm going to do: predict-yes
  2409. ENV: Agent did: predict-yes for direction R in state State-A
  2410. In State-A moving R
  2411. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2412. predict error 0
  2413. dir: dir isL
  2414. /|\329: O: O657 (predict-yes)
  2415. I see 1 and I'm going to do: predict-yes
  2416. ENV: Agent did: predict-yes for direction L in state State-B
  2417. In State-B moving L
  2418. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2419. predict error 0
  2420. dir: dir isU
  2421. -/|330: O: O660 (predict-no)
  2422. I see 1 and I'm going to do: predict-no
  2423. ENV: Agent did: predict-no for direction U in state State-A
  2424. In State-A moving U
  2425. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2426. predict error 0
  2427. dir: dir isR
  2428. \-331: O: O661 (predict-yes)
  2429. I see 1 and I'm going to do: predict-yes
  2430. ENV: Agent did: predict-yes for direction R in state State-A
  2431. In State-A moving R
  2432. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2433. predict error 0
  2434. dir: dir isU
  2435. /332: O: O663 (predict-yes)
  2436. I see 1 and I'm going to do: predict-yes
  2437. ENV: Agent did: predict-yes for direction U in state State-B
  2438. In State-B moving U
  2439. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2440. predict error 1
  2441. dir: dir isL
  2442. |\-333: O: O665 (predict-yes)
  2443. I see 0 and I'm going to do: predict-yes
  2444. ENV: Agent did: predict-yes for direction L in state State-B
  2445. In State-B moving L
  2446. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2447. predict error 0
  2448. dir: dir isR
  2449. /|334: O: O667 (predict-yes)
  2450. I see 1 and I'm going to do: predict-yes
  2451. ENV: Agent did: predict-yes for direction R in state State-A
  2452. In State-A moving R
  2453. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2454. predict error 0
  2455. dir: dir isU
  2456. \-/335: O: O670 (predict-no)
  2457. I see 1 and I'm going to do: predict-no
  2458. ENV: Agent did: predict-no for direction U in state State-B
  2459. In State-B moving U
  2460. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2461. predict error 0
  2462. dir: dir isL
  2463. |\-336: O: O671 (predict-yes)
  2464. I see 1 and I'm going to do: predict-yes
  2465. ENV: Agent did: predict-yes for direction L in state State-B
  2466. In State-B moving L
  2467. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2468. predict error 0
  2469. dir: dir isU
  2470. /|\337: O: O673 (predict-yes)
  2471. I see 1 and I'm going to do: predict-yes
  2472. ENV: Agent did: predict-yes for direction U in state State-A
  2473. In State-A moving U
  2474. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2475. predict error 1
  2476. dir: dir isL
  2477. -/338: O: O676 (predict-no)
  2478. I see 0 and I'm going to do: predict-no
  2479. ENV: Agent did: predict-no for direction L in state State-A
  2480. In State-A moving L
  2481. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2482. predict error 0
  2483. dir: dir isU
  2484. |\339: O: O678 (predict-no)
  2485. I see 1 and I'm going to do: predict-no
  2486. ENV: Agent did: predict-no for direction U in state State-A
  2487. In State-A moving U
  2488. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2489. predict error 0
  2490. dir: dir isU
  2491. -340: O: O680 (predict-no)
  2492. I see 1 and I'm going to do: predict-no
  2493. ENV: Agent did: predict-no for direction U in state State-A
  2494. In State-A moving U
  2495. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2496. predict error 0
  2497. dir: dir isU
  2498. /|341: O: O682 (predict-no)
  2499. I see 1 and I'm going to do: predict-no
  2500. ENV: Agent did: predict-no for direction U in state State-A
  2501. In State-A moving U
  2502. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2503. predict error 0
  2504. dir: dir isL
  2505. \342: O: O684 (predict-no)
  2506. I see 1 and I'm going to do: predict-no
  2507. ENV: Agent did: predict-no for direction L in state State-A
  2508. In State-A moving L
  2509. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2510. predict error 0
  2511. dir: dir isL
  2512. -/|343: O: O686 (predict-no)
  2513. I see 1 and I'm going to do: predict-no
  2514. ENV: Agent did: predict-no for direction L in state State-A
  2515. In State-A moving L
  2516. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2517. predict error 0
  2518. dir: dir isR
  2519. \-/344: O: O687 (predict-yes)
  2520. I see 1 and I'm going to do: predict-yes
  2521. ENV: Agent did: predict-yes for direction R in state State-A
  2522. In State-A moving R
  2523. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2524. predict error 0
  2525. dir: dir isU
  2526. |\-345: O: O689 (predict-yes)
  2527. I see 1 and I'm going to do: predict-yes
  2528. ENV: Agent did: predict-yes for direction U in state State-B
  2529. In State-B moving U
  2530. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2531. predict error 1
  2532. dir: dir isL
  2533. /|\-sleeping...
  2534. /346: O: O691 (predict-yes)
  2535. I see 0 and I'm going to do: predict-yes
  2536. ENV: Agent did: predict-yes for direction L in state State-B
  2537. In State-B moving L
  2538. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2539. predict error 0
  2540. dir: dir isU
  2541. |\-347: O: O693 (predict-yes)
  2542. I see 1 and I'm going to do: predict-yes
  2543. ENV: Agent did: predict-yes for direction U in state State-A
  2544. In State-A moving U
  2545. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2546. predict error 1
  2547. dir: dir isL
  2548. /|\348: O: O696 (predict-no)
  2549. I see 0 and I'm going to do: predict-no
  2550. ENV: Agent did: predict-no for direction L in state State-A
  2551. In State-A moving L
  2552. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2553. predict error 0
  2554. dir: dir isU
  2555. -/|349: O: O698 (predict-no)
  2556. I see 1 and I'm going to do: predict-no
  2557. ENV: Agent did: predict-no for direction U in state State-A
  2558. In State-A moving U
  2559. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2560. predict error 0
  2561. dir: dir isL
  2562. \-/350: O: O700 (predict-no)
  2563. I see 1 and I'm going to do: predict-no
  2564. ENV: Agent did: predict-no for direction L in state State-A
  2565. In State-A moving L
  2566. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2567. predict error 0
  2568. dir: dir isL
  2569. |\-351: O: O702 (predict-no)
  2570. I see 1 and I'm going to do: predict-no
  2571. ENV: Agent did: predict-no for direction L in state State-A
  2572. In State-A moving L
  2573. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2574. predict error 0
  2575. dir: dir isU
  2576. /352: O: O704 (predict-no)
  2577. I see 1 and I'm going to do: predict-no
  2578. ENV: Agent did: predict-no for direction U in state State-A
  2579. In State-A moving U
  2580. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2581. predict error 0
  2582. dir: dir isU
  2583. |\353: O: O706 (predict-no)
  2584. I see 1 and I'm going to do: predict-no
  2585. ENV: Agent did: predict-no for direction U in state State-A
  2586. In State-A moving U
  2587. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2588. predict error 0
  2589. dir: dir isU
  2590. -/|354: O: O708 (predict-no)
  2591. I see 1 and I'm going to do: predict-no
  2592. ENV: Agent did: predict-no for direction U in state State-A
  2593. In State-A moving U
  2594. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2595. predict error 0
  2596. dir: dir isU
  2597. \-/355: O: O710 (predict-no)
  2598. I see 1 and I'm going to do: predict-no
  2599. ENV: Agent did: predict-no for direction U in state State-A
  2600. In State-A moving U
  2601. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2602. predict error 0
  2603. dir: dir isU
  2604. |\-356: O: O712 (predict-no)
  2605. I see 1 and I'm going to do: predict-no
  2606. ENV: Agent did: predict-no for direction U in state State-A
  2607. In State-A moving U
  2608. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2609. predict error 0
  2610. dir: dir isU
  2611. /|\357: O: O714 (predict-no)
  2612. I see 1 and I'm going to do: predict-no
  2613. ENV: Agent did: predict-no for direction U in state State-A
  2614. In State-A moving U
  2615. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2616. predict error 0
  2617. dir: dir isL
  2618. -/|358: O: O716 (predict-no)
  2619. I see 1 and I'm going to do: predict-no
  2620. ENV: Agent did: predict-no for direction L in state State-A
  2621. In State-A moving L
  2622. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2623. predict error 0
  2624. dir: dir isR
  2625. \-/359: O: O718 (predict-no)
  2626. I see 1 and I'm going to do: predict-no
  2627. ENV: Agent did: predict-no for direction R in state State-A
  2628. In State-A moving R
  2629. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2630. predict error 1
  2631. dir: dir isL
  2632. |\360: O: O719 (predict-yes)
  2633. I see 0 and I'm going to do: predict-yes
  2634. ENV: Agent did: predict-yes for direction L in state State-B
  2635. In State-B moving L
  2636. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2637. predict error 0
  2638. dir: dir isU
  2639. -/|361: O: O722 (predict-no)
  2640. I see 1 and I'm going to do: predict-no
  2641. ENV: Agent did: predict-no for direction U in state State-A
  2642. In State-A moving U
  2643. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2644. predict error 0
  2645. dir: dir isU
  2646. \362: O: O724 (predict-no)
  2647. I see 1 and I'm going to do: predict-no
  2648. ENV: Agent did: predict-no for direction U in state State-A
  2649. In State-A moving U
  2650. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2651. predict error 0
  2652. dir: dir isL
  2653. -/|363: O: O726 (predict-no)
  2654. I see 1 and I'm going to do: predict-no
  2655. ENV: Agent did: predict-no for direction L in state State-A
  2656. In State-A moving L
  2657. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2658. predict error 0
  2659. dir: dir isL
  2660. \-/364: O: O728 (predict-no)
  2661. I see 1 and I'm going to do: predict-no
  2662. ENV: Agent did: predict-no for direction L in state State-A
  2663. In State-A moving L
  2664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2665. predict error 0
  2666. dir: dir isU
  2667. |\365: O: O730 (predict-no)
  2668. I see 1 and I'm going to do: predict-no
  2669. ENV: Agent did: predict-no for direction U in state State-A
  2670. In State-A moving U
  2671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2672. predict error 0
  2673. dir: dir isU
  2674. -/|366: O: O732 (predict-no)
  2675. I see 1 and I'm going to do: predict-no
  2676. ENV: Agent did: predict-no for direction U in state State-A
  2677. In State-A moving U
  2678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2679. predict error 0
  2680. dir: dir isR
  2681. \-/367: O: O733 (predict-yes)
  2682. I see 1 and I'm going to do: predict-yes
  2683. ENV: Agent did: predict-yes for direction R in state State-A
  2684. In State-A moving R
  2685. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2686. predict error 0
  2687. dir: dir isR
  2688. |\368: O: O735 (predict-yes)
  2689. I see 1 and I'm going to do: predict-yes
  2690. ENV: Agent did: predict-yes for direction R in state State-B
  2691. In State-B moving R
  2692. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2693. predict error 1
  2694. dir: dir isU
  2695. -/|369: O: O738 (predict-no)
  2696. I see 0 and I'm going to do: predict-no
  2697. ENV: Agent did: predict-no for direction U in state State-B
  2698. In State-B moving U
  2699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2700. predict error 0
  2701. dir: dir isR
  2702. \-/370: O: O740 (predict-no)
  2703. I see 1 and I'm going to do: predict-no
  2704. ENV: Agent did: predict-no for direction R in state State-B
  2705. In State-B moving R
  2706. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2707. predict error 0
  2708. dir: dir isR
  2709. |\371: O: O742 (predict-no)
  2710. I see 1 and I'm going to do: predict-no
  2711. ENV: Agent did: predict-no for direction R in state State-B
  2712. In State-B moving R
  2713. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2714. predict error 0
  2715. dir: dir isR
  2716. -372: O: O744 (predict-no)
  2717. I see 1 and I'm going to do: predict-no
  2718. ENV: Agent did: predict-no for direction R in state State-B
  2719. In State-B moving R
  2720. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2721. predict error 0
  2722. dir: dir isL
  2723. /|\373: O: O745 (predict-yes)
  2724. I see 1 and I'm going to do: predict-yes
  2725. ENV: Agent did: predict-yes for direction L in state State-B
  2726. In State-B moving L
  2727. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2728. predict error 0
  2729. dir: dir isL
  2730. -/374: O: O748 (predict-no)
  2731. I see 1 and I'm going to do: predict-no
  2732. ENV: Agent did: predict-no for direction L in state State-A
  2733. In State-A moving L
  2734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2735. predict error 0
  2736. dir: dir isR
  2737. |\-375: O: O749 (predict-yes)
  2738. I see 1 and I'm going to do: predict-yes
  2739. ENV: Agent did: predict-yes for direction R in state State-A
  2740. In State-A moving R
  2741. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2742. predict error 0
  2743. dir: dir isR
  2744. /|\376: O: O752 (predict-no)
  2745. I see 1 and I'm going to do: predict-no
  2746. ENV: Agent did: predict-no for direction R in state State-B
  2747. In State-B moving R
  2748. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2749. predict error 0
  2750. dir: dir isR
  2751. -/|377: O: O754 (predict-no)
  2752. I see 1 and I'm going to do: predict-no
  2753. ENV: Agent did: predict-no for direction R in state State-B
  2754. In State-B moving R
  2755. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2756. predict error 0
  2757. dir: dir isL
  2758. \-378: O: O755 (predict-yes)
  2759. I see 1 and I'm going to do: predict-yes
  2760. ENV: Agent did: predict-yes for direction L in state State-B
  2761. In State-B moving L
  2762. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2763. predict error 0
  2764. dir: dir isR
  2765. /|\379: O: O757 (predict-yes)
  2766. I see 1 and I'm going to do: predict-yes
  2767. ENV: Agent did: predict-yes for direction R in state State-A
  2768. In State-A moving R
  2769. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2770. predict error 0
  2771. dir: dir isL
  2772. -/|380: O: O759 (predict-yes)
  2773. I see 1 and I'm going to do: predict-yes
  2774. ENV: Agent did: predict-yes for direction L in state State-B
  2775. In State-B moving L
  2776. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2777. predict error 0
  2778. dir: dir isL
  2779. \-/381: O: O762 (predict-no)
  2780. I see 1 and I'm going to do: predict-no
  2781. ENV: Agent did: predict-no for direction L in state State-A
  2782. In State-A moving L
  2783. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2784. predict error 0
  2785. dir: dir isL
  2786. |382: O: O764 (predict-no)
  2787. I see 1 and I'm going to do: predict-no
  2788. ENV: Agent did: predict-no for direction L in state State-A
  2789. In State-A moving L
  2790. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2791. predict error 0
  2792. dir: dir isU
  2793. \-/383: O: O766 (predict-no)
  2794. I see 1 and I'm going to do: predict-no
  2795. ENV: Agent did: predict-no for direction U in state State-A
  2796. In State-A moving U
  2797. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2798. predict error 0
  2799. dir: dir isR
  2800. |\384: O: O767 (predict-yes)
  2801. I see 1 and I'm going to do: predict-yes
  2802. ENV: Agent did: predict-yes for direction R in state State-A
  2803. In State-A moving R
  2804. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2805. predict error 0
  2806. dir: dir isR
  2807. -/|385: O: O770 (predict-no)
  2808. I see 1 and I'm going to do: predict-no
  2809. ENV: Agent did: predict-no for direction R in state State-B
  2810. In State-B moving R
  2811. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2812. predict error 0
  2813. dir: dir isR
  2814. \-386: O: O772 (predict-no)
  2815. I see 1 and I'm going to do: predict-no
  2816. ENV: Agent did: predict-no for direction R in state State-B
  2817. In State-B moving R
  2818. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2819. predict error 0
  2820. dir: dir isL
  2821. /|\387: O: O773 (predict-yes)
  2822. I see 1 and I'm going to do: predict-yes
  2823. ENV: Agent did: predict-yes for direction L in state State-B
  2824. In State-B moving L
  2825. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2826. predict error 0
  2827. dir: dir isL
  2828. -/|388: O: O776 (predict-no)
  2829. I see 1 and I'm going to do: predict-no
  2830. ENV: Agent did: predict-no for direction L in state State-A
  2831. In State-A moving L
  2832. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2833. predict error 0
  2834. dir: dir isR
  2835. \-389: O: O777 (predict-yes)
  2836. I see 1 and I'm going to do: predict-yes
  2837. ENV: Agent did: predict-yes for direction R in state State-A
  2838. In State-A moving R
  2839. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2840. predict error 0
  2841. dir: dir isR
  2842. /|\390: O: O779 (predict-yes)
  2843. I see 1 and I'm going to do: predict-yes
  2844. ENV: Agent did: predict-yes for direction R in state State-B
  2845. In State-B moving R
  2846. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2847. predict error 1
  2848. dir: dir isR
  2849. -/|391: O: O782 (predict-no)
  2850. I see 0 and I'm going to do: predict-no
  2851. ENV: Agent did: predict-no for direction R in state State-B
  2852. In State-B moving R
  2853. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2854. predict error 0
  2855. dir: dir isR
  2856. \392: O: O784 (predict-no)
  2857. I see 1 and I'm going to do: predict-no
  2858. ENV: Agent did: predict-no for direction R in state State-B
  2859. In State-B moving R
  2860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2861. predict error 0
  2862. dir: dir isU
  2863. -/|393: O: O786 (predict-no)
  2864. I see 1 and I'm going to do: predict-no
  2865. ENV: Agent did: predict-no for direction U in state State-B
  2866. In State-B moving U
  2867. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2868. predict error 0
  2869. dir: dir isU
  2870. \-/394: O: O788 (predict-no)
  2871. I see 1 and I'm going to do: predict-no
  2872. ENV: Agent did: predict-no for direction U in state State-B
  2873. In State-B moving U
  2874. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2875. predict error 0
  2876. dir: dir isL
  2877. |\395: O: O789 (predict-yes)
  2878. I see 1 and I'm going to do: predict-yes
  2879. ENV: Agent did: predict-yes for direction L in state State-B
  2880. In State-B moving L
  2881. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2882. predict error 0
  2883. dir: dir isR
  2884. -/|396: O: O791 (predict-yes)
  2885. I see 1 and I'm going to do: predict-yes
  2886. ENV: Agent did: predict-yes for direction R in state State-A
  2887. In State-A moving R
  2888. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2889. predict error 0
  2890. dir: dir isR
  2891. \-397: O: O794 (predict-no)
  2892. I see 1 and I'm going to do: predict-no
  2893. ENV: Agent did: predict-no for direction R in state State-B
  2894. In State-B moving R
  2895. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2896. predict error 0
  2897. dir: dir isL
  2898. /|\398: O: O795 (predict-yes)
  2899. I see 1 and I'm going to do: predict-yes
  2900. ENV: Agent did: predict-yes for direction L in state State-B
  2901. In State-B moving L
  2902. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2903. predict error 0
  2904. dir: dir isR
  2905. -/|399: O: O797 (predict-yes)
  2906. I see 1 and I'm going to do: predict-yes
  2907. ENV: Agent did: predict-yes for direction R in state State-A
  2908. In State-A moving R
  2909. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2910. predict error 0
  2911. dir: dir isR
  2912. \-/400: O: O800 (predict-no)
  2913. I see 1 and I'm going to do: predict-no
  2914. ENV: Agent did: predict-no for direction R in state State-B
  2915. In State-B moving R
  2916. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2917. predict error 0
  2918. dir: dir isU
  2919. |\-401: O: O802 (predict-no)
  2920. I see 1 and I'm going to do: predict-no
  2921. ENV: Agent did: predict-no for direction U in state State-B
  2922. In State-B moving U
  2923. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2924. predict error 0
  2925. dir: dir isU
  2926. /402: O: O804 (predict-no)
  2927. I see 1 and I'm going to do: predict-no
  2928. ENV: Agent did: predict-no for direction U in state State-B
  2929. In State-B moving U
  2930. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2931. predict error 0
  2932. dir: dir isL
  2933. |\403: O: O805 (predict-yes)
  2934. I see 1 and I'm going to do: predict-yes
  2935. ENV: Agent did: predict-yes for direction L in state State-B
  2936. In State-B moving L
  2937. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2938. predict error 0
  2939. dir: dir isR
  2940. -/404: O: O807 (predict-yes)
  2941. I see 1 and I'm going to do: predict-yes
  2942. ENV: Agent did: predict-yes for direction R in state State-A
  2943. In State-A moving R
  2944. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2945. predict error 0
  2946. dir: dir isL
  2947. |\-405: O: O809 (predict-yes)
  2948. I see 1 and I'm going to do: predict-yes
  2949. ENV: Agent did: predict-yes for direction L in state State-B
  2950. In State-B moving L
  2951. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2952. predict error 0
  2953. dir: dir isL
  2954. /|406: O: O812 (predict-no)
  2955. I see 1 and I'm going to do: predict-no
  2956. ENV: Agent did: predict-no for direction L in state State-A
  2957. In State-A moving L
  2958. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2959. predict error 0
  2960. dir: dir isR
  2961. \-407: O: O813 (predict-yes)
  2962. I see 1 and I'm going to do: predict-yes
  2963. ENV: Agent did: predict-yes for direction R in state State-A
  2964. In State-A moving R
  2965. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2966. predict error 0
  2967. dir: dir isU
  2968. /|\408: O: O816 (predict-no)
  2969. I see 1 and I'm going to do: predict-no
  2970. ENV: Agent did: predict-no for direction U in state State-B
  2971. In State-B moving U
  2972. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2973. predict error 0
  2974. dir: dir isL
  2975. -/409: O: O817 (predict-yes)
  2976. I see 1 and I'm going to do: predict-yes
  2977. ENV: Agent did: predict-yes for direction L in state State-B
  2978. In State-B moving L
  2979. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2980. predict error 0
  2981. dir: dir isU
  2982. |\-410: O: O820 (predict-no)
  2983. I see 1 and I'm going to do: predict-no
  2984. ENV: Agent did: predict-no for direction U in state State-A
  2985. In State-A moving U
  2986. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2987. predict error 0
  2988. dir: dir isU
  2989. /|\411: O: O822 (predict-no)
  2990. I see 1 and I'm going to do: predict-no
  2991. ENV: Agent did: predict-no for direction U in state State-A
  2992. In State-A moving U
  2993. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2994. predict error 0
  2995. dir: dir isL
  2996. -412: O: O824 (predict-no)
  2997. I see 1 and I'm going to do: predict-no
  2998. ENV: Agent did: predict-no for direction L in state State-A
  2999. In State-A moving L
  3000. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3001. predict error 0
  3002. dir: dir isU
  3003. /|413: O: O826 (predict-no)
  3004. I see 1 and I'm going to do: predict-no
  3005. ENV: Agent did: predict-no for direction U in state State-A
  3006. In State-A moving U
  3007. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3008. predict error 0
  3009. dir: dir isU
  3010. \-/414: O: O828 (predict-no)
  3011. I see 1 and I'm going to do: predict-no
  3012. ENV: Agent did: predict-no for direction U in state State-A
  3013. In State-A moving U
  3014. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3015. predict error 0
  3016. dir: dir isR
  3017. |\-415: O: O830 (predict-no)
  3018. I see 1 and I'm going to do: predict-no
  3019. ENV: Agent did: predict-no for direction R in state State-A
  3020. In State-A moving R
  3021. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3022. predict error 1
  3023. dir: dir isU
  3024. /|\416: O: O831 (predict-yes)
  3025. I see 0 and I'm going to do: predict-yes
  3026. ENV: Agent did: predict-yes for direction U in state State-B
  3027. In State-B moving U
  3028. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3029. predict error 1
  3030. dir: dir isU
  3031. -/417: O: O834 (predict-no)
  3032. I see 0 and I'm going to do: predict-no
  3033. ENV: Agent did: predict-no for direction U in state State-B
  3034. In State-B moving U
  3035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3036. predict error 0
  3037. dir: dir isR
  3038. |\-418: O: O836 (predict-no)
  3039. I see 1 and I'm going to do: predict-no
  3040. ENV: Agent did: predict-no for direction R in state State-B
  3041. In State-B moving R
  3042. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3043. predict error 0
  3044. dir: dir isU
  3045. /|419: O: O838 (predict-no)
  3046. I see 1 and I'm going to do: predict-no
  3047. ENV: Agent did: predict-no for direction U in state State-B
  3048. In State-B moving U
  3049. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3050. predict error 0
  3051. dir: dir isU
  3052. \-420: O: O840 (predict-no)
  3053. I see 1 and I'm going to do: predict-no
  3054. ENV: Agent did: predict-no for direction U in state State-B
  3055. In State-B moving U
  3056. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3057. predict error 0
  3058. dir: dir isU
  3059. /421: O: O841 (predict-yes)
  3060. I see 1 and I'm going to do: predict-yes
  3061. ENV: Agent did: predict-yes for direction U in state State-B
  3062. In State-B moving U
  3063. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3064. predict error 1
  3065. dir: dir isR
  3066. |422: O: O844 (predict-no)
  3067. I see 0 and I'm going to do: predict-no
  3068. ENV: Agent did: predict-no for direction R in state State-B
  3069. In State-B moving R
  3070. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3071. predict error 0
  3072. dir: dir isL
  3073. \-/423: O: O845 (predict-yes)
  3074. I see 1 and I'm going to do: predict-yes
  3075. ENV: Agent did: predict-yes for direction L in state State-B
  3076. In State-B moving L
  3077. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3078. predict error 0
  3079. dir: dir isL
  3080. |\-424: O: O848 (predict-no)
  3081. I see 1 and I'm going to do: predict-no
  3082. ENV: Agent did: predict-no for direction L in state State-A
  3083. In State-A moving L
  3084. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3085. predict error 0
  3086. dir: dir isL
  3087. /|\425: O: O850 (predict-no)
  3088. I see 1 and I'm going to do: predict-no
  3089. ENV: Agent did: predict-no for direction L in state State-A
  3090. In State-A moving L
  3091. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3092. predict error 0
  3093. dir: dir isR
  3094. -/|426: O: O851 (predict-yes)
  3095. I see 1 and I'm going to do: predict-yes
  3096. ENV: Agent did: predict-yes for direction R in state State-A
  3097. In State-A moving R
  3098. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3099. predict error 0
  3100. dir: dir isU
  3101. \-/427: O: O854 (predict-no)
  3102. I see 1 and I'm going to do: predict-no
  3103. ENV: Agent did: predict-no for direction U in state State-B
  3104. In State-B moving U
  3105. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3106. predict error 0
  3107. dir: dir isL
  3108. |\-428: O: O855 (predict-yes)
  3109. I see 1 and I'm going to do: predict-yes
  3110. ENV: Agent did: predict-yes for direction L in state State-B
  3111. In State-B moving L
  3112. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3113. predict error 0
  3114. dir: dir isU
  3115. /|\429: O: O858 (predict-no)
  3116. I see 1 and I'm going to do: predict-no
  3117. ENV: Agent did: predict-no for direction U in state State-A
  3118. In State-A moving U
  3119. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3120. predict error 0
  3121. dir: dir isU
  3122. -/|430: O: O860 (predict-no)
  3123. I see 1 and I'm going to do: predict-no
  3124. ENV: Agent did: predict-no for direction U in state State-A
  3125. In State-A moving U
  3126. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3127. predict error 0
  3128. dir: dir isR
  3129. \-/431: O: O861 (predict-yes)
  3130. I see 1 and I'm going to do: predict-yes
  3131. ENV: Agent did: predict-yes for direction R in state State-A
  3132. In State-A moving R
  3133. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3134. predict error 0
  3135. dir: dir isR
  3136. |432: O: O864 (predict-no)
  3137. I see 1 and I'm going to do: predict-no
  3138. ENV: Agent did: predict-no for direction R in state State-B
  3139. In State-B moving R
  3140. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3141. predict error 0
  3142. dir: dir isL
  3143. \-433: O: O865 (predict-yes)
  3144. I see 1 and I'm going to do: predict-yes
  3145. ENV: Agent did: predict-yes for direction L in state State-B
  3146. In State-B moving L
  3147. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3148. predict error 0
  3149. dir: dir isU
  3150. /|\434: O: O868 (predict-no)
  3151. I see 1 and I'm going to do: predict-no
  3152. ENV: Agent did: predict-no for direction U in state State-A
  3153. In State-A moving U
  3154. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3155. predict error 0
  3156. dir: dir isL
  3157. -435: O: O870 (predict-no)
  3158. I see 1 and I'm going to do: predict-no
  3159. ENV: Agent did: predict-no for direction L in state State-A
  3160. In State-A moving L
  3161. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3162. predict error 0
  3163. dir: dir isU
  3164. /|\436: O: O872 (predict-no)
  3165. I see 1 and I'm going to do: predict-no
  3166. ENV: Agent did: predict-no for direction U in state State-A
  3167. In State-A moving U
  3168. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3169. predict error 0
  3170. dir: dir isU
  3171. -/|437: O: O874 (predict-no)
  3172. I see 1 and I'm going to do: predict-no
  3173. ENV: Agent did: predict-no for direction U in state State-A
  3174. In State-A moving U
  3175. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3176. predict error 0
  3177. dir: dir isR
  3178. \-/438: O: O875 (predict-yes)
  3179. I see 1 and I'm going to do: predict-yes
  3180. ENV: Agent did: predict-yes for direction R in state State-A
  3181. In State-A moving R
  3182. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3183. predict error 0
  3184. dir: dir isL
  3185. |439: O: O877 (predict-yes)
  3186. I see 1 and I'm going to do: predict-yes
  3187. ENV: Agent did: predict-yes for direction L in state State-B
  3188. In State-B moving L
  3189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3190. predict error 0
  3191. dir: dir isU
  3192. \-440: O: O880 (predict-no)
  3193. I see 1 and I'm going to do: predict-no
  3194. ENV: Agent did: predict-no for direction U in state State-A
  3195. In State-A moving U
  3196. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3197. predict error 0
  3198. dir: dir isU
  3199. /|441: O: O882 (predict-no)
  3200. I see 1 and I'm going to do: predict-no
  3201. ENV: Agent did: predict-no for direction U in state State-A
  3202. In State-A moving U
  3203. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3204. predict error 0
  3205. dir: dir isL
  3206. \442: O: O884 (predict-no)
  3207. I see 1 and I'm going to do: predict-no
  3208. ENV: Agent did: predict-no for direction L in state State-A
  3209. In State-A moving L
  3210. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3211. predict error 0
  3212. dir: dir isU
  3213. -/443: O: O886 (predict-no)
  3214. I see 1 and I'm going to do: predict-no
  3215. ENV: Agent did: predict-no for direction U in state State-A
  3216. In State-A moving U
  3217. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3218. predict error 0
  3219. dir: dir isU
  3220. |\444: O: O888 (predict-no)
  3221. I see 1 and I'm going to do: predict-no
  3222. ENV: Agent did: predict-no for direction U in state State-A
  3223. In State-A moving U
  3224. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3225. predict error 0
  3226. dir: dir isR
  3227. -/|445: O: O890 (predict-no)
  3228. I see 1 and I'm going to do: predict-no
  3229. ENV: Agent did: predict-no for direction R in state State-A
  3230. In State-A moving R
  3231. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3232. predict error 1
  3233. dir: dir isU
  3234. \-/446: O: O892 (predict-no)
  3235. I see 0 and I'm going to do: predict-no
  3236. ENV: Agent did: predict-no for direction U in state State-B
  3237. In State-B moving U
  3238. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3239. predict error 0
  3240. dir: dir isR
  3241. |\-447: O: O894 (predict-no)
  3242. I see 1 and I'm going to do: predict-no
  3243. ENV: Agent did: predict-no for direction R in state State-B
  3244. In State-B moving R
  3245. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3246. predict error 0
  3247. dir: dir isU
  3248. /|448: O: O895 (predict-yes)
  3249. I see 1 and I'm going to do: predict-yes
  3250. ENV: Agent did: predict-yes for direction U in state State-B
  3251. In State-B moving U
  3252. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3253. predict error 1
  3254. dir: dir isU
  3255. \-449: O: O898 (predict-no)
  3256. I see 0 and I'm going to do: predict-no
  3257. ENV: Agent did: predict-no for direction U in state State-B
  3258. In State-B moving U
  3259. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3260. predict error 0
  3261. dir: dir isR
  3262. /|450: O: O900 (predict-no)
  3263. I see 1 and I'm going to do: predict-no
  3264. ENV: Agent did: predict-no for direction R in state State-B
  3265. In State-B moving R
  3266. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3267. predict error 0
  3268. dir: dir isU
  3269. \-/|451: O: O902 (predict-no)
  3270. I see 1 and I'm going to do: predict-no
  3271. ENV: Agent did: predict-no for direction U in state State-B
  3272. In State-B moving U
  3273. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3274. predict error 0
  3275. dir: dir isR
  3276. \452: O: O904 (predict-no)
  3277. I see 1 and I'm going to do: predict-no
  3278. ENV: Agent did: predict-no for direction R in state State-B
  3279. In State-B moving R
  3280. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3281. predict error 0
  3282. dir: dir isL
  3283. -/|453: O: O905 (predict-yes)
  3284. I see 1 and I'm going to do: predict-yes
  3285. ENV: Agent did: predict-yes for direction L in state State-B
  3286. In State-B moving L
  3287. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3288. predict error 0
  3289. dir: dir isL
  3290. \-/454: O: O908 (predict-no)
  3291. I see 1 and I'm going to do: predict-no
  3292. ENV: Agent did: predict-no for direction L in state State-A
  3293. In State-A moving L
  3294. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3295. predict error 0
  3296. dir: dir isL
  3297. |\-455: O: O909 (predict-yes)
  3298. I see 1 and I'm going to do: predict-yes
  3299. ENV: Agent did: predict-yes for direction L in state State-A
  3300. In State-A moving L
  3301. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3302. predict error 1
  3303. dir: dir isU
  3304. /|456: O: O912 (predict-no)
  3305. I see 0 and I'm going to do: predict-no
  3306. ENV: Agent did: predict-no for direction U in state State-A
  3307. In State-A moving U
  3308. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3309. predict error 0
  3310. dir: dir isU
  3311. \-457: O: O914 (predict-no)
  3312. I see 1 and I'm going to do: predict-no
  3313. ENV: Agent did: predict-no for direction U in state State-A
  3314. In State-A moving U
  3315. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3316. predict error 0
  3317. dir: dir isL
  3318. /|\458: O: O916 (predict-no)
  3319. I see 1 and I'm going to do: predict-no
  3320. ENV: Agent did: predict-no for direction L in state State-A
  3321. In State-A moving L
  3322. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3323. predict error 0
  3324. dir: dir isR
  3325. -/|459: O: O917 (predict-yes)
  3326. I see 1 and I'm going to do: predict-yes
  3327. ENV: Agent did: predict-yes for direction R in state State-A
  3328. In State-A moving R
  3329. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3330. predict error 0
  3331. dir: dir isR
  3332. \-/460: O: O920 (predict-no)
  3333. I see 1 and I'm going to do: predict-no
  3334. ENV: Agent did: predict-no for direction R in state State-B
  3335. In State-B moving R
  3336. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3337. predict error 0
  3338. dir: dir isL
  3339. |\-461: O: O921 (predict-yes)
  3340. I see 1 and I'm going to do: predict-yes
  3341. ENV: Agent did: predict-yes for direction L in state State-B
  3342. In State-B moving L
  3343. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3344. predict error 0
  3345. dir: dir isL
  3346. /462: O: O924 (predict-no)
  3347. I see 1 and I'm going to do: predict-no
  3348. ENV: Agent did: predict-no for direction L in state State-A
  3349. In State-A moving L
  3350. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3351. predict error 0
  3352. dir: dir isL
  3353. |\-463: O: O926 (predict-no)
  3354. I see 1 and I'm going to do: predict-no
  3355. ENV: Agent did: predict-no for direction L in state State-A
  3356. In State-A moving L
  3357. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3358. predict error 0
  3359. dir: dir isU
  3360. /|\464: O: O928 (predict-no)
  3361. I see 1 and I'm going to do: predict-no
  3362. ENV: Agent did: predict-no for direction U in state State-A
  3363. In State-A moving U
  3364. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3365. predict error 0
  3366. dir: dir isL
  3367. -/|465: O: O930 (predict-no)
  3368. I see 1 and I'm going to do: predict-no
  3369. ENV: Agent did: predict-no for direction L in state State-A
  3370. In State-A moving L
  3371. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3372. predict error 0
  3373. dir: dir isL
  3374. \-/466: O: O932 (predict-no)
  3375. I see 1 and I'm going to do: predict-no
  3376. ENV: Agent did: predict-no for direction L in state State-A
  3377. In State-A moving L
  3378. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3379. predict error 0
  3380. dir: dir isR
  3381. |\-467: O: O933 (predict-yes)
  3382. I see 1 and I'm going to do: predict-yes
  3383. ENV: Agent did: predict-yes for direction R in state State-A
  3384. In State-A moving R
  3385. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3386. predict error 0
  3387. dir: dir isL
  3388. /|468: O: O935 (predict-yes)
  3389. I see 1 and I'm going to do: predict-yes
  3390. ENV: Agent did: predict-yes for direction L in state State-B
  3391. In State-B moving L
  3392. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3393. predict error 0
  3394. dir: dir isR
  3395. \469: O: O938 (predict-no)
  3396. I see 1 and I'm going to do: predict-no
  3397. ENV: Agent did: predict-no for direction R in state State-A
  3398. In State-A moving R
  3399. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3400. predict error 1
  3401. dir: dir isR
  3402. -/470: O: O940 (predict-no)
  3403. I see 0 and I'm going to do: predict-no
  3404. ENV: Agent did: predict-no for direction R in state State-B
  3405. In State-B moving R
  3406. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3407. predict error 0
  3408. dir: dir isU
  3409. |\-471: O: O942 (predict-no)
  3410. I see 1 and I'm going to do: predict-no
  3411. ENV: Agent did: predict-no for direction U in state State-B
  3412. In State-B moving U
  3413. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3414. predict error 0
  3415. dir: dir isL
  3416. /472: O: O943 (predict-yes)
  3417. I see 1 and I'm going to do: predict-yes
  3418. ENV: Agent did: predict-yes for direction L in state State-B
  3419. In State-B moving L
  3420. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3421. predict error 0
  3422. dir: dir isL
  3423. |\473: O: O945 (predict-yes)
  3424. I see 1 and I'm going to do: predict-yes
  3425. ENV: Agent did: predict-yes for direction L in state State-A
  3426. In State-A moving L
  3427. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3428. predict error 1
  3429. dir: dir isR
  3430. -/|474: O: O947 (predict-yes)
  3431. I see 0 and I'm going to do: predict-yes
  3432. ENV: Agent did: predict-yes for direction R in state State-A
  3433. In State-A moving R
  3434. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3435. predict error 0
  3436. dir: dir isL
  3437. \-/475: O: O949 (predict-yes)
  3438. I see 1 and I'm going to do: predict-yes
  3439. ENV: Agent did: predict-yes for direction L in state State-B
  3440. In State-B moving L
  3441. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3442. predict error 0
  3443. dir: dir isR
  3444. |\-476: O: O952 (predict-no)
  3445. I see 1 and I'm going to do: predict-no
  3446. ENV: Agent did: predict-no for direction R in state State-A
  3447. In State-A moving R
  3448. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3449. predict error 1
  3450. dir: dir isL
  3451. /|\477: O: O953 (predict-yes)
  3452. I see 0 and I'm going to do: predict-yes
  3453. ENV: Agent did: predict-yes for direction L in state State-B
  3454. In State-B moving L
  3455. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3456. predict error 0
  3457. dir: dir isU
  3458. -/|478: O: O956 (predict-no)
  3459. I see 1 and I'm going to do: predict-no
  3460. ENV: Agent did: predict-no for direction U in state State-A
  3461. In State-A moving U
  3462. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3463. predict error 0
  3464. dir: dir isU
  3465. \-/479: O: O958 (predict-no)
  3466. I see 1 and I'm going to do: predict-no
  3467. ENV: Agent did: predict-no for direction U in state State-A
  3468. In State-A moving U
  3469. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3470. predict error 0
  3471. dir: dir isU
  3472. |\480: O: O960 (predict-no)
  3473. I see 1 and I'm going to do: predict-no
  3474. ENV: Agent did: predict-no for direction U in state State-A
  3475. In State-A moving U
  3476. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3477. predict error 0
  3478. dir: dir isU
  3479. -/|481: O: O962 (predict-no)
  3480. I see 1 and I'm going to do: predict-no
  3481. ENV: Agent did: predict-no for direction U in state State-A
  3482. In State-A moving U
  3483. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3484. predict error 0
  3485. dir: dir isR
  3486. \482: O: O963 (predict-yes)
  3487. I see 1 and I'm going to do: predict-yes
  3488. ENV: Agent did: predict-yes for direction R in state State-A
  3489. In State-A moving R
  3490. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3491. predict error 0
  3492. dir: dir isR
  3493. -/|483: O: O966 (predict-no)
  3494. I see 1 and I'm going to do: predict-no
  3495. ENV: Agent did: predict-no for direction R in state State-B
  3496. In State-B moving R
  3497. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3498. predict error 0
  3499. dir: dir isU
  3500. \-/484: O: O968 (predict-no)
  3501. I see 1 and I'm going to do: predict-no
  3502. ENV: Agent did: predict-no for direction U in state State-B
  3503. In State-B moving U
  3504. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3505. predict error 0
  3506. dir: dir isU
  3507. |\-485: O: O970 (predict-no)
  3508. I see 1 and I'm going to do: predict-no
  3509. ENV: Agent did: predict-no for direction U in state State-B
  3510. In State-B moving U
  3511. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3512. predict error 0
  3513. dir: dir isR
  3514. /|\486: O: O972 (predict-no)
  3515. I see 1 and I'm going to do: predict-no
  3516. ENV: Agent did: predict-no for direction R in state State-B
  3517. In State-B moving R
  3518. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3519. predict error 0
  3520. dir: dir isR
  3521. -/|\sleeping...
  3522. -487: O: O974 (predict-no)
  3523. I see 1 and I'm going to do: predict-no
  3524. ENV: Agent did: predict-no for direction R in state State-B
  3525. In State-B moving R
  3526. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3527. predict error 0
  3528. dir: dir isL
  3529. /|488: O: O975 (predict-yes)
  3530. I see 1 and I'm going to do: predict-yes
  3531. ENV: Agent did: predict-yes for direction L in state State-B
  3532. In State-B moving L
  3533. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3534. predict error 0
  3535. dir: dir isL
  3536. \-489: O: O978 (predict-no)
  3537. I see 1 and I'm going to do: predict-no
  3538. ENV: Agent did: predict-no for direction L in state State-A
  3539. In State-A moving L
  3540. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3541. predict error 0
  3542. dir: dir isU
  3543. /|490: O: O980 (predict-no)
  3544. I see 1 and I'm going to do: predict-no
  3545. ENV: Agent did: predict-no for direction U in state State-A
  3546. In State-A moving U
  3547. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3548. predict error 0
  3549. dir: dir isL
  3550. \-/491: O: O982 (predict-no)
  3551. I see 1 and I'm going to do: predict-no
  3552. ENV: Agent did: predict-no for direction L in state State-A
  3553. In State-A moving L
  3554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3555. predict error 0
  3556. dir: dir isU
  3557. |492: O: O984 (predict-no)
  3558. I see 1 and I'm going to do: predict-no
  3559. ENV: Agent did: predict-no for direction U in state State-A
  3560. In State-A moving U
  3561. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3562. predict error 0
  3563. dir: dir isR
  3564. \-/493: O: O985 (predict-yes)
  3565. I see 1 and I'm going to do: predict-yes
  3566. ENV: Agent did: predict-yes for direction R in state State-A
  3567. In State-A moving R
  3568. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3569. predict error 0
  3570. dir: dir isU
  3571. |\494: O: O988 (predict-no)
  3572. I see 1 and I'm going to do: predict-no
  3573. ENV: Agent did: predict-no for direction U in state State-B
  3574. In State-B moving U
  3575. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3576. predict error 0
  3577. dir: dir isU
  3578. -/|495: O: O990 (predict-no)
  3579. I see 1 and I'm going to do: predict-no
  3580. ENV: Agent did: predict-no for direction U in state State-B
  3581. In State-B moving U
  3582. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3583. predict error 0
  3584. dir: dir isU
  3585. \-/496: O: O992 (predict-no)
  3586. I see 1 and I'm going to do: predict-no
  3587. ENV: Agent did: predict-no for direction U in state State-B
  3588. In State-B moving U
  3589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3590. predict error 0
  3591. dir: dir isL
  3592. |\-497: O: O993 (predict-yes)
  3593. I see 1 and I'm going to do: predict-yes
  3594. ENV: Agent did: predict-yes for direction L in state State-B
  3595. In State-B moving L
  3596. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3597. predict error 0
  3598. dir: dir isR
  3599. /|498: O: O995 (predict-yes)
  3600. I see 1 and I'm going to do: predict-yes
  3601. ENV: Agent did: predict-yes for direction R in state State-A
  3602. In State-A moving R
  3603. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3604. predict error 0
  3605. dir: dir isR
  3606. \-/499: O: O998 (predict-no)
  3607. I see 1 and I'm going to do: predict-no
  3608. ENV: Agent did: predict-no for direction R in state State-B
  3609. In State-B moving R
  3610. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3611. predict error 0
  3612. dir: dir isL
  3613. |\-500: O: O999 (predict-yes)
  3614. I see 1 and I'm going to do: predict-yes
  3615. ENV: Agent did: predict-yes for direction L in state State-B
  3616. In State-B moving L
  3617. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3618. predict error 0
  3619. dir: dir isR
  3620. /|\-/501: O: O1001 (predict-yes)
  3621. I see 1 and I'm going to do: predict-yes
  3622. ENV: Agent did: predict-yes for direction R in state State-A
  3623. In State-A moving R
  3624. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3625. predict error 0
  3626. dir: dir isR
  3627. |502: O: O1004 (predict-no)
  3628. I see 1 and I'm going to do: predict-no
  3629. ENV: Agent did: predict-no for direction R in state State-B
  3630. In State-B moving R
  3631. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3632. predict error 0
  3633. dir: dir isR
  3634. \-/503: O: O1006 (predict-no)
  3635. I see 1 and I'm going to do: predict-no
  3636. ENV: Agent did: predict-no for direction R in state State-B
  3637. In State-B moving R
  3638. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3639. predict error 0
  3640. dir: dir isL
  3641. |\504: O: O1007 (predict-yes)
  3642. I see 1 and I'm going to do: predict-yes
  3643. ENV: Agent did: predict-yes for direction L in state State-B
  3644. In State-B moving L
  3645. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3646. predict error 0
  3647. dir: dir isR
  3648. -505: O: O1009 (predict-yes)
  3649. I see 1 and I'm going to do: predict-yes
  3650. ENV: Agent did: predict-yes for direction R in state State-A
  3651. In State-A moving R
  3652. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3653. predict error 0
  3654. dir: dir isR
  3655. /|\506: O: O1012 (predict-no)
  3656. I see 1 and I'm going to do: predict-no
  3657. ENV: Agent did: predict-no for direction R in state State-B
  3658. In State-B moving R
  3659. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3660. predict error 0
  3661. dir: dir isL
  3662. -/507: O: O1013 (predict-yes)
  3663. I see 1 and I'm going to do: predict-yes
  3664. ENV: Agent did: predict-yes for direction L in state State-B
  3665. In State-B moving L
  3666. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3667. predict error 0
  3668. dir: dir isR
  3669. |\508: O: O1015 (predict-yes)
  3670. I see 1 and I'm going to do: predict-yes
  3671. ENV: Agent did: predict-yes for direction R in state State-A
  3672. In State-A moving R
  3673. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3674. predict error 0
  3675. dir: dir isU
  3676. -/|509: O: O1018 (predict-no)
  3677. I see 1 and I'm going to do: predict-no
  3678. ENV: Agent did: predict-no for direction U in state State-B
  3679. In State-B moving U
  3680. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3681. predict error 0
  3682. dir: dir isU
  3683. \-/510: O: O1020 (predict-no)
  3684. I see 1 and I'm going to do: predict-no
  3685. ENV: Agent did: predict-no for direction U in state State-B
  3686. In State-B moving U
  3687. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3688. predict error 0
  3689. dir: dir isR
  3690. |\-511: O: O1022 (predict-no)
  3691. I see 1 and I'm going to do: predict-no
  3692. ENV: Agent did: predict-no for direction R in state State-B
  3693. In State-B moving R
  3694. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3695. predict error 0
  3696. dir: dir isR
  3697. /512: O: O1023 (predict-yes)
  3698. I see 1 and I'm going to do: predict-yes
  3699. ENV: Agent did: predict-yes for direction R in state State-B
  3700. In State-B moving R
  3701. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3702. predict error 1
  3703. dir: dir isR
  3704. |\513: O: O1026 (predict-no)
  3705. I see 0 and I'm going to do: predict-no
  3706. ENV: Agent did: predict-no for direction R in state State-B
  3707. In State-B moving R
  3708. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3709. predict error 0
  3710. dir: dir isL
  3711. -514: O: O1027 (predict-yes)
  3712. I see 1 and I'm going to do: predict-yes
  3713. ENV: Agent did: predict-yes for direction L in state State-B
  3714. In State-B moving L
  3715. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3716. predict error 0
  3717. dir: dir isL
  3718. /|\515: O: O1030 (predict-no)
  3719. I see 1 and I'm going to do: predict-no
  3720. ENV: Agent did: predict-no for direction L in state State-A
  3721. In State-A moving L
  3722. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3723. predict error 0
  3724. dir: dir isL
  3725. -/|516: O: O1032 (predict-no)
  3726. I see 1 and I'm going to do: predict-no
  3727. ENV: Agent did: predict-no for direction L in state State-A
  3728. In State-A moving L
  3729. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3730. predict error 0
  3731. dir: dir isR
  3732. \-517: O: O1034 (predict-no)
  3733. I see 1 and I'm going to do: predict-no
  3734. ENV: Agent did: predict-no for direction R in state State-A
  3735. In State-A moving R
  3736. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3737. predict error 1
  3738. dir: dir isU
  3739. /|\518: O: O1036 (predict-no)
  3740. I see 0 and I'm going to do: predict-no
  3741. ENV: Agent did: predict-no for direction U in state State-B
  3742. In State-B moving U
  3743. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3744. predict error 0
  3745. dir: dir isU
  3746. -/519: O: O1038 (predict-no)
  3747. I see 1 and I'm going to do: predict-no
  3748. ENV: Agent did: predict-no for direction U in state State-B
  3749. In State-B moving U
  3750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3751. predict error 0
  3752. dir: dir isR
  3753. |\-520: O: O1040 (predict-no)
  3754. I see 1 and I'm going to do: predict-no
  3755. ENV: Agent did: predict-no for direction R in state State-B
  3756. In State-B moving R
  3757. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3758. predict error 0
  3759. dir: dir isU
  3760. /|\521: O: O1042 (predict-no)
  3761. I see 1 and I'm going to do: predict-no
  3762. ENV: Agent did: predict-no for direction U in state State-B
  3763. In State-B moving U
  3764. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3765. predict error 0
  3766. dir: dir isR
  3767. -522: O: O1044 (predict-no)
  3768. I see 1 and I'm going to do: predict-no
  3769. ENV: Agent did: predict-no for direction R in state State-B
  3770. In State-B moving R
  3771. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3772. predict error 0
  3773. dir: dir isU
  3774. /|\523: O: O1046 (predict-no)
  3775. I see 1 and I'm going to do: predict-no
  3776. ENV: Agent did: predict-no for direction U in state State-B
  3777. In State-B moving U
  3778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3779. predict error 0
  3780. dir: dir isR
  3781. -/|524: O: O1048 (predict-no)
  3782. I see 1 and I'm going to do: predict-no
  3783. ENV: Agent did: predict-no for direction R in state State-B
  3784. In State-B moving R
  3785. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3786. predict error 0
  3787. dir: dir isU
  3788. \-/525: O: O1050 (predict-no)
  3789. I see 1 and I'm going to do: predict-no
  3790. ENV: Agent did: predict-no for direction U in state State-B
  3791. In State-B moving U
  3792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3793. predict error 0
  3794. dir: dir isU
  3795. |\-526: O: O1052 (predict-no)
  3796. I see 1 and I'm going to do: predict-no
  3797. ENV: Agent did: predict-no for direction U in state State-B
  3798. In State-B moving U
  3799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3800. predict error 0
  3801. dir: dir isL
  3802. /|\527: O: O1053 (predict-yes)
  3803. I see 1 and I'm going to do: predict-yes
  3804. ENV: Agent did: predict-yes for direction L in state State-B
  3805. In State-B moving L
  3806. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3807. predict error 0
  3808. dir: dir isL
  3809. -/528: O: O1056 (predict-no)
  3810. I see 1 and I'm going to do: predict-no
  3811. ENV: Agent did: predict-no for direction L in state State-A
  3812. In State-A moving L
  3813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3814. predict error 0
  3815. dir: dir isR
  3816. |\-529: O: O1057 (predict-yes)
  3817. I see 1 and I'm going to do: predict-yes
  3818. ENV: Agent did: predict-yes for direction R in state State-A
  3819. In State-A moving R
  3820. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3821. predict error 0
  3822. dir: dir isR
  3823. /|\530: O: O1060 (predict-no)
  3824. I see 1 and I'm going to do: predict-no
  3825. ENV: Agent did: predict-no for direction R in state State-B
  3826. In State-B moving R
  3827. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3828. predict error 0
  3829. dir: dir isR
  3830. -/|531: O: O1062 (predict-no)
  3831. I see 1 and I'm going to do: predict-no
  3832. ENV: Agent did: predict-no for direction R in state State-B
  3833. In State-B moving R
  3834. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3835. predict error 0
  3836. dir: dir isL
  3837. \532: O: O1063 (predict-yes)
  3838. I see 1 and I'm going to do: predict-yes
  3839. ENV: Agent did: predict-yes for direction L in state State-B
  3840. In State-B moving L
  3841. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3842. predict error 0
  3843. dir: dir isR
  3844. -/533: O: O1065 (predict-yes)
  3845. I see 1 and I'm going to do: predict-yes
  3846. ENV: Agent did: predict-yes for direction R in state State-A
  3847. In State-A moving R
  3848. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3849. predict error 0
  3850. dir: dir isR
  3851. |\-534: O: O1068 (predict-no)
  3852. I see 1 and I'm going to do: predict-no
  3853. ENV: Agent did: predict-no for direction R in state State-B
  3854. In State-B moving R
  3855. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3856. predict error 0
  3857. dir: dir isR
  3858. /|\535: O: O1070 (predict-no)
  3859. I see 1 and I'm going to do: predict-no
  3860. ENV: Agent did: predict-no for direction R in state State-B
  3861. In State-B moving R
  3862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3863. predict error 0
  3864. dir: dir isU
  3865. -/|536: O: O1072 (predict-no)
  3866. I see 1 and I'm going to do: predict-no
  3867. ENV: Agent did: predict-no for direction U in state State-B
  3868. In State-B moving U
  3869. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3870. predict error 0
  3871. dir: dir isR
  3872. \-537: O: O1074 (predict-no)
  3873. I see 1 and I'm going to do: predict-no
  3874. ENV: Agent did: predict-no for direction R in state State-B
  3875. In State-B moving R
  3876. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3877. predict error 0
  3878. dir: dir isU
  3879. /|\538: O: O1076 (predict-no)
  3880. I see 1 and I'm going to do: predict-no
  3881. ENV: Agent did: predict-no for direction U in state State-B
  3882. In State-B moving U
  3883. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3884. predict error 0
  3885. dir: dir isU
  3886. -/|\539: O: O1078 (predict-no)
  3887. I see 1 and I'm going to do: predict-no
  3888. ENV: Agent did: predict-no for direction U in state State-B
  3889. In State-B moving U
  3890. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3891. predict error 0
  3892. dir: dir isU
  3893. -/|540: O: O1080 (predict-no)
  3894. I see 1 and I'm going to do: predict-no
  3895. ENV: Agent did: predict-no for direction U in state State-B
  3896. In State-B moving U
  3897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3898. predict error 0
  3899. dir: dir isR
  3900. \-541: O: O1082 (predict-no)
  3901. I see 1 and I'm going to do: predict-no
  3902. ENV: Agent did: predict-no for direction R in state State-B
  3903. In State-B moving R
  3904. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3905. predict error 0
  3906. dir: dir isU
  3907. /542: O: O1083 (predict-yes)
  3908. I see 1 and I'm going to do: predict-yes
  3909. ENV: Agent did: predict-yes for direction U in state State-B
  3910. In State-B moving U
  3911. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3912. predict error 1
  3913. dir: dir isR
  3914. |\-/543: O: O1086 (predict-no)
  3915. I see 0 and I'm going to do: predict-no
  3916. ENV: Agent did: predict-no for direction R in state State-B
  3917. In State-B moving R
  3918. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3919. predict error 0
  3920. dir: dir isR
  3921. |\-544: O: O1088 (predict-no)
  3922. I see 1 and I'm going to do: predict-no
  3923. ENV: Agent did: predict-no for direction R in state State-B
  3924. In State-B moving R
  3925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3926. predict error 0
  3927. dir: dir isR
  3928. /|545: O: O1090 (predict-no)
  3929. I see 1 and I'm going to do: predict-no
  3930. ENV: Agent did: predict-no for direction R in state State-B
  3931. In State-B moving R
  3932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3933. predict error 0
  3934. dir: dir isR
  3935. \-/546: O: O1092 (predict-no)
  3936. I see 1 and I'm going to do: predict-no
  3937. ENV: Agent did: predict-no for direction R in state State-B
  3938. In State-B moving R
  3939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3940. predict error 0
  3941. dir: dir isR
  3942. |\547: O: O1094 (predict-no)
  3943. I see 1 and I'm going to do: predict-no
  3944. ENV: Agent did: predict-no for direction R in state State-B
  3945. In State-B moving R
  3946. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3947. predict error 0
  3948. dir: dir isR
  3949. -/|548: O: O1096 (predict-no)
  3950. I see 1 and I'm going to do: predict-no
  3951. ENV: Agent did: predict-no for direction R in state State-B
  3952. In State-B moving R
  3953. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3954. predict error 0
  3955. dir: dir isU
  3956. \-/549: O: O1098 (predict-no)
  3957. I see 1 and I'm going to do: predict-no
  3958. ENV: Agent did: predict-no for direction U in state State-B
  3959. In State-B moving U
  3960. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3961. predict error 0
  3962. dir: dir isU
  3963. |\550: O: O1099 (predict-yes)
  3964. I see 1 and I'm going to do: predict-yes
  3965. ENV: Agent did: predict-yes for direction U in state State-B
  3966. In State-B moving U
  3967. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3968. predict error 1
  3969. dir: dir isU
  3970. -/|551: O: O1102 (predict-no)
  3971. I see 0 and I'm going to do: predict-no
  3972. ENV: Agent did: predict-no for direction U in state State-B
  3973. In State-B moving U
  3974. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3975. predict error 0
  3976. dir: dir isU
  3977. \552: O: O1104 (predict-no)
  3978. I see 1 and I'm going to do: predict-no
  3979. ENV: Agent did: predict-no for direction U in state State-B
  3980. In State-B moving U
  3981. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3982. predict error 0
  3983. dir: dir isU
  3984. -/|553: O: O1105 (predict-yes)
  3985. I see 1 and I'm going to do: predict-yes
  3986. ENV: Agent did: predict-yes for direction U in state State-B
  3987. In State-B moving U
  3988. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3989. predict error 1
  3990. dir: dir isR
  3991. \-/554: O: O1108 (predict-no)
  3992. I see 0 and I'm going to do: predict-no
  3993. ENV: Agent did: predict-no for direction R in state State-B
  3994. In State-B moving R
  3995. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3996. predict error 0
  3997. dir: dir isR
  3998. |\-555: O: O1110 (predict-no)
  3999. I see 1 and I'm going to do: predict-no
  4000. ENV: Agent did: predict-no for direction R in state State-B
  4001. In State-B moving R
  4002. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4003. predict error 0
  4004. dir: dir isL
  4005. /|\556: O: O1111 (predict-yes)
  4006. I see 1 and I'm going to do: predict-yes
  4007. ENV: Agent did: predict-yes for direction L in state State-B
  4008. In State-B moving L
  4009. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4010. predict error 0
  4011. dir: dir isU
  4012. -/557: O: O1114 (predict-no)
  4013. I see 1 and I'm going to do: predict-no
  4014. ENV: Agent did: predict-no for direction U in state State-A
  4015. In State-A moving U
  4016. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4017. predict error 0
  4018. dir: dir isU
  4019. |\-558: O: O1116 (predict-no)
  4020. I see 1 and I'm going to do: predict-no
  4021. ENV: Agent did: predict-no for direction U in state State-A
  4022. In State-A moving U
  4023. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4024. predict error 0
  4025. dir: dir isR
  4026. /|\559: O: O1117 (predict-yes)
  4027. I see 1 and I'm going to do: predict-yes
  4028. ENV: Agent did: predict-yes for direction R in state State-A
  4029. In State-A moving R
  4030. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4031. predict error 0
  4032. dir: dir isL
  4033. -/|560: O: O1119 (predict-yes)
  4034. I see 1 and I'm going to do: predict-yes
  4035. ENV: Agent did: predict-yes for direction L in state State-B
  4036. In State-B moving L
  4037. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4038. predict error 0
  4039. dir: dir isU
  4040. \-/561: O: O1122 (predict-no)
  4041. I see 1 and I'm going to do: predict-no
  4042. ENV: Agent did: predict-no for direction U in state State-A
  4043. In State-A moving U
  4044. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4045. predict error 0
  4046. dir: dir isR
  4047. |562: O: O1124 (predict-no)
  4048. I see 1 and I'm going to do: predict-no
  4049. ENV: Agent did: predict-no for direction R in state State-A
  4050. In State-A moving R
  4051. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  4052. predict error 1
  4053. dir: dir isR
  4054. \-/563: O: O1126 (predict-no)
  4055. I see 0 and I'm going to do: predict-no
  4056. ENV: Agent did: predict-no for direction R in state State-B
  4057. In State-B moving R
  4058. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4059. predict error 0
  4060. dir: dir isL
  4061. |\-564: O: O1127 (predict-yes)
  4062. I see 1 and I'm going to do: predict-yes
  4063. ENV: Agent did: predict-yes for direction L in state State-B
  4064. In State-B moving L
  4065. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4066. predict error 0
  4067. dir: dir isR
  4068. /|\565: O: O1129 (predict-yes)
  4069. I see 1 and I'm going to do: predict-yes
  4070. ENV: Agent did: predict-yes for direction R in state State-A
  4071. In State-A moving R
  4072. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4073. predict error 0
  4074. dir: dir isU
  4075. -/566: O: O1132 (predict-no)
  4076. I see 1 and I'm going to do: predict-no
  4077. ENV: Agent did: predict-no for direction U in state State-B
  4078. In State-B moving U
  4079. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4080. predict error 0
  4081. dir: dir isR
  4082. |\-567: O: O1134 (predict-no)
  4083. I see 1 and I'm going to do: predict-no
  4084. ENV: Agent did: predict-no for direction R in state State-B
  4085. In State-B moving R
  4086. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4087. predict error 0
  4088. dir: dir isR
  4089. /|\568: O: O1136 (predict-no)
  4090. I see 1 and I'm going to do: predict-no
  4091. ENV: Agent did: predict-no for direction R in state State-B
  4092. In State-B moving R
  4093. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4094. predict error 0
  4095. dir: dir isR
  4096. -569: O: O1138 (predict-no)
  4097. I see 1 and I'm going to do: predict-no
  4098. ENV: Agent did: predict-no for direction R in state State-B
  4099. In State-B moving R
  4100. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4101. predict error 0
  4102. dir: dir isL
  4103. /|\570: O: O1139 (predict-yes)
  4104. I see 1 and I'm going to do: predict-yes
  4105. ENV: Agent did: predict-yes for direction L in state State-B
  4106. In State-B moving L
  4107. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4108. predict error 0
  4109. dir: dir isR
  4110. -/571: O: O1141 (predict-yes)
  4111. I see 1 and I'm going to do: predict-yes
  4112. ENV: Agent did: predict-yes for direction R in state State-A
  4113. In State-A moving R
  4114. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4115. predict error 0
  4116. dir: dir isU
  4117. |572: O: O1144 (predict-no)
  4118. I see 1 and I'm going to do: predict-no
  4119. ENV: Agent did: predict-no for direction U in state State-B
  4120. In State-B moving U
  4121. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4122. predict error 0
  4123. dir: dir isU
  4124. \-/573: O: O1146 (predict-no)
  4125. I see 1 and I'm going to do: predict-no
  4126. ENV: Agent did: predict-no for direction U in state State-B
  4127. In State-B moving U
  4128. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4129. predict error 0
  4130. dir: dir isR
  4131. |\-574: O: O1148 (predict-no)
  4132. I see 1 and I'm going to do: predict-no
  4133. ENV: Agent did: predict-no for direction R in state State-B
  4134. In State-B moving R
  4135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4136. predict error 0
  4137. dir: dir isU
  4138. /|\575: O: O1150 (predict-no)
  4139. I see 1 and I'm going to do: predict-no
  4140. ENV: Agent did: predict-no for direction U in state State-B
  4141. In State-B moving U
  4142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4143. predict error 0
  4144. dir: dir isR
  4145. -/|576: O: O1152 (predict-no)
  4146. I see 1 and I'm going to do: predict-no
  4147. ENV: Agent did: predict-no for direction R in state State-B
  4148. In State-B moving R
  4149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4150. predict error 0
  4151. dir: dir isL
  4152. \-/577: O: O1153 (predict-yes)
  4153. I see 1 and I'm going to do: predict-yes
  4154. ENV: Agent did: predict-yes for direction L in state State-B
  4155. In State-B moving L
  4156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4157. predict error 0
  4158. dir: dir isL
  4159. |\-578: O: O1156 (predict-no)
  4160. I see 1 and I'm going to do: predict-no
  4161. ENV: Agent did: predict-no for direction L in state State-A
  4162. In State-A moving L
  4163. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4164. predict error 0
  4165. dir: dir isU
  4166. /|\579: O: O1158 (predict-no)
  4167. I see 1 and I'm going to do: predict-no
  4168. ENV: Agent did: predict-no for direction U in state State-A
  4169. In State-A moving U
  4170. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4171. predict error 0
  4172. dir: dir isL
  4173. -/|580: O: O1160 (predict-no)
  4174. I see 1 and I'm going to do: predict-no
  4175. ENV: Agent did: predict-no for direction L in state State-A
  4176. In State-A moving L
  4177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4178. predict error 0
  4179. dir: dir isL
  4180. \-/|581: O: O1162 (predict-no)
  4181. I see 1 and I'm going to do: predict-no
  4182. ENV: Agent did: predict-no for direction L in state State-A
  4183. In State-A moving L
  4184. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4185. predict error 0
  4186. dir: dir isU
  4187. \582: O: O1164 (predict-no)
  4188. I see 1 and I'm going to do: predict-no
  4189. ENV: Agent did: predict-no for direction U in state State-A
  4190. In State-A moving U
  4191. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4192. predict error 0
  4193. dir: dir isR
  4194. -/583: O: O1165 (predict-yes)
  4195. I see 1 and I'm going to do: predict-yes
  4196. ENV: Agent did: predict-yes for direction R in state State-A
  4197. In State-A moving R
  4198. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4199. predict error 0
  4200. dir: dir isR
  4201. |\-584: O: O1168 (predict-no)
  4202. I see 1 and I'm going to do: predict-no
  4203. ENV: Agent did: predict-no for direction R in state State-B
  4204. In State-B moving R
  4205. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4206. predict error 0
  4207. dir: dir isR
  4208. /|585: O: O1170 (predict-no)
  4209. I see 1 and I'm going to do: predict-no
  4210. ENV: Agent did: predict-no for direction R in state State-B
  4211. In State-B moving R
  4212. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4213. predict error 0
  4214. dir: dir isU
  4215. \-586: O: O1172 (predict-no)
  4216. I see 1 and I'm going to do: predict-no
  4217. ENV: Agent did: predict-no for direction U in state State-B
  4218. In State-B moving U
  4219. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4220. predict error 0
  4221. dir: dir isL
  4222. /587: O: O1173 (predict-yes)
  4223. I see 1 and I'm going to do: predict-yes
  4224. ENV: Agent did: predict-yes for direction L in state State-B
  4225. In State-B moving L
  4226. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4227. predict error 0
  4228. dir: dir isR
  4229. |588: O: O1175 (predict-yes)
  4230. I see 1 and I'm going to do: predict-yes
  4231. ENV: Agent did: predict-yes for direction R in state State-A
  4232. In State-A moving R
  4233. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4234. predict error 0
  4235. dir: dir isU
  4236. \-/589: O: O1178 (predict-no)
  4237. I see 1 and I'm going to do: predict-no
  4238. ENV: Agent did: predict-no for direction U in state State-B
  4239. In State-B moving U
  4240. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4241. predict error 0
  4242. dir: dir isU
  4243. |\-590: O: O1180 (predict-no)
  4244. I see 1 and I'm going to do: predict-no
  4245. ENV: Agent did: predict-no for direction U in state State-B
  4246. In State-B moving U
  4247. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4248. predict error 0
  4249. dir: dir isL
  4250. /|\591: O: O1181 (predict-yes)
  4251. I see 1 and I'm going to do: predict-yes
  4252. ENV: Agent did: predict-yes for direction L in state State-B
  4253. In State-B moving L
  4254. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4255. predict error 0
  4256. dir: dir isR
  4257. -592: O: O1183 (predict-yes)
  4258. I see 1 and I'm going to do: predict-yes
  4259. ENV: Agent did: predict-yes for direction R in state State-A
  4260. In State-A moving R
  4261. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4262. predict error 0
  4263. dir: dir isL
  4264. /|\593: O: O1185 (predict-yes)
  4265. I see 1 and I'm going to do: predict-yes
  4266. ENV: Agent did: predict-yes for direction L in state State-B
  4267. In State-B moving L
  4268. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4269. predict error 0
  4270. dir: dir isR
  4271. -/|594: O: O1187 (predict-yes)
  4272. I see 1 and I'm going to do: predict-yes
  4273. ENV: Agent did: predict-yes for direction R in state State-A
  4274. In State-A moving R
  4275. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4276. predict error 0
  4277. dir: dir isL
  4278. \-/595: O: O1189 (predict-yes)
  4279. I see 1 and I'm going to do: predict-yes
  4280. ENV: Agent did: predict-yes for direction L in state State-B
  4281. In State-B moving L
  4282. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4283. predict error 0
  4284. dir: dir isU
  4285. |\-596: O: O1192 (predict-no)
  4286. I see 1 and I'm going to do: predict-no
  4287. ENV: Agent did: predict-no for direction U in state State-A
  4288. In State-A moving U
  4289. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4290. predict error 0
  4291. dir: dir isU
  4292. /|\597: O: O1194 (predict-no)
  4293. I see 1 and I'm going to do: predict-no
  4294. ENV: Agent did: predict-no for direction U in state State-A
  4295. In State-A moving U
  4296. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4297. predict error 0
  4298. dir: dir isL
  4299. -/598: O: O1196 (predict-no)
  4300. I see 1 and I'm going to do: predict-no
  4301. ENV: Agent did: predict-no for direction L in state State-A
  4302. In State-A moving L
  4303. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4304. predict error 0
  4305. dir: dir isL
  4306. |\599: O: O1198 (predict-no)
  4307. I see 1 and I'm going to do: predict-no
  4308. ENV: Agent did: predict-no for direction L in state State-A
  4309. In State-A moving L
  4310. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4311. predict error 0
  4312. dir: dir isU
  4313. -/|600: O: O1200 (predict-no)
  4314. I see 1 and I'm going to do: predict-no
  4315. ENV: Agent did: predict-no for direction U in state State-A
  4316. In State-A moving U
  4317. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4318. predict error 0
  4319. dir: dir isU
  4320. \-/601: O: O1202 (predict-no)
  4321. I see 1 and I'm going to do: predict-no
  4322. ENV: Agent did: predict-no for direction U in state State-A
  4323. In State-A moving U
  4324. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4325. predict error 0
  4326. dir: dir isU
  4327. |602: O: O1204 (predict-no)
  4328. I see 1 and I'm going to do: predict-no
  4329. ENV: Agent did: predict-no for direction U in state State-A
  4330. In State-A moving U
  4331. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4332. predict error 0
  4333. dir: dir isL
  4334. \-/603: O: O1206 (predict-no)
  4335. I see 1 and I'm going to do: predict-no
  4336. ENV: Agent did: predict-no for direction L in state State-A
  4337. In State-A moving L
  4338. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4339. predict error 0
  4340. dir: dir isU
  4341. |604: O: O1208 (predict-no)
  4342. I see 1 and I'm going to do: predict-no
  4343. ENV: Agent did: predict-no for direction U in state State-A
  4344. In State-A moving U
  4345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4346. predict error 0
  4347. dir: dir isR
  4348. \-605: O: O1209 (predict-yes)
  4349. I see 1 and I'm going to do: predict-yes
  4350. ENV: Agent did: predict-yes for direction R in state State-A
  4351. In State-A moving R
  4352. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4353. predict error 0
  4354. dir: dir isL
  4355. /606: O: O1211 (predict-yes)
  4356. I see 1 and I'm going to do: predict-yes
  4357. ENV: Agent did: predict-yes for direction L in state State-B
  4358. In State-B moving L
  4359. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4360. predict error 0
  4361. dir: dir isR
  4362. |\-607: O: O1213 (predict-yes)
  4363. I see 1 and I'm going to do: predict-yes
  4364. ENV: Agent did: predict-yes for direction R in state State-A
  4365. In State-A moving R
  4366. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4367. predict error 0
  4368. dir: dir isU
  4369. /|\608: O: O1216 (predict-no)
  4370. I see 1 and I'm going to do: predict-no
  4371. ENV: Agent did: predict-no for direction U in state State-B
  4372. In State-B moving U
  4373. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4374. predict error 0
  4375. dir: dir isU
  4376. -/|609: O: O1218 (predict-no)
  4377. I see 1 and I'm going to do: predict-no
  4378. ENV: Agent did: predict-no for direction U in state State-B
  4379. In State-B moving U
  4380. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4381. predict error 0
  4382. dir: dir isL
  4383. \610: O: O1219 (predict-yes)
  4384. I see 1 and I'm going to do: predict-yes
  4385. ENV: Agent did: predict-yes for direction L in state State-B
  4386. In State-B moving L
  4387. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4388. predict error 0
  4389. dir: dir isR
  4390. -/|611: O: O1221 (predict-yes)
  4391. I see 1 and I'm going to do: predict-yes
  4392. ENV: Agent did: predict-yes for direction R in state State-A
  4393. In State-A moving R
  4394. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4395. predict error 0
  4396. dir: dir isL
  4397. \612: O: O1224 (predict-no)
  4398. I see 1 and I'm going to do: predict-no
  4399. ENV: Agent did: predict-no for direction L in state State-B
  4400. In State-B moving L
  4401. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  4402. predict error 1
  4403. dir: dir isU
  4404. -/|613: O: O1226 (predict-no)
  4405. I see 0 and I'm going to do: predict-no
  4406. ENV: Agent did: predict-no for direction U in state State-A
  4407. In State-A moving U
  4408. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4409. predict error 0
  4410. dir: dir isR
  4411. \-/614: O: O1227 (predict-yes)
  4412. I see 1 and I'm going to do: predict-yes
  4413. ENV: Agent did: predict-yes for direction R in state State-A
  4414. In State-A moving R
  4415. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4416. predict error 0
  4417. dir: dir isU
  4418. |\615: O: O1230 (predict-no)
  4419. I see 1 and I'm going to do: predict-no
  4420. ENV: Agent did: predict-no for direction U in state State-B
  4421. In State-B moving U
  4422. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4423. predict error 0
  4424. dir: dir isU
  4425. -/|616: O: O1232 (predict-no)
  4426. I see 1 and I'm going to do: predict-no
  4427. ENV: Agent did: predict-no for direction U in state State-B
  4428. In State-B moving U
  4429. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4430. predict error 0
  4431. dir: dir isR
  4432. \-/617: O: O1233 (predict-yes)
  4433. I see 1 and I'm going to do: predict-yes
  4434. ENV: Agent did: predict-yes for direction R in state State-B
  4435. In State-B moving R
  4436. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  4437. predict error 1
  4438. dir: dir isL
  4439. |\-618: O: O1235 (predict-yes)
  4440. I see 0 and I'm going to do: predict-yes
  4441. ENV: Agent did: predict-yes for direction L in state State-B
  4442. In State-B moving L
  4443. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4444. predict error 0
  4445. dir: dir isR
  4446. /|\619: O: O1237 (predict-yes)
  4447. I see 1 and I'm going to do: predict-yes
  4448. ENV: Agent did: predict-yes for direction R in state State-A
  4449. In State-A moving R
  4450. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4451. predict error 0
  4452. dir: dir isL
  4453. -/|620: O: O1239 (predict-yes)
  4454. I see 1 and I'm going to do: predict-yes
  4455. ENV: Agent did: predict-yes for direction L in state State-B
  4456. In State-B moving L
  4457. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4458. predict error 0
  4459. dir: dir isL
  4460. \621: O: O1242 (predict-no)
  4461. I see 1 and I'm going to do: predict-no
  4462. ENV: Agent did: predict-no for direction L in state State-A
  4463. In State-A moving L
  4464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4465. predict error 0
  4466. dir: dir isU
  4467. -622: O: O1244 (predict-no)
  4468. I see 1 and I'm going to do: predict-no
  4469. ENV: Agent did: predict-no for direction U in state State-A
  4470. In State-A moving U
  4471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4472. predict error 0
  4473. dir: dir isR
  4474. /|\623: O: O1245 (predict-yes)
  4475. I see 1 and I'm going to do: predict-yes
  4476. ENV: Agent did: predict-yes for direction R in state State-A
  4477. In State-A moving R
  4478. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4479. predict error 0
  4480. dir: dir isU
  4481. -/|624: O: O1248 (predict-no)
  4482. I see 1 and I'm going to do: predict-no
  4483. ENV: Agent did: predict-no for direction U in state State-B
  4484. In State-B moving U
  4485. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4486. predict error 0
  4487. dir: dir isL
  4488. \-/625: O: O1249 (predict-yes)
  4489. I see 1 and I'm going to do: predict-yes
  4490. ENV: Agent did: predict-yes for direction L in state State-B
  4491. In State-B moving L
  4492. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4493. predict error 0
  4494. dir: dir isU
  4495. |626: O: O1252 (predict-no)
  4496. I see 1 and I'm going to do: predict-no
  4497. ENV: Agent did: predict-no for direction U in state State-A
  4498. In State-A moving U
  4499. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4500. predict error 0
  4501. dir: dir isU
  4502. \-627: O: O1254 (predict-no)
  4503. I see 1 and I'm going to do: predict-no
  4504. ENV: Agent did: predict-no for direction U in state State-A
  4505. In State-A moving U
  4506. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4507. predict error 0
  4508. dir: dir isL
  4509. /|\628: O: O1256 (predict-no)
  4510. I see 1 and I'm going to do: predict-no
  4511. ENV: Agent did: predict-no for direction L in state State-A
  4512. In State-A moving L
  4513. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4514. predict error 0
  4515. dir: dir isL
  4516. -/|629: O: O1258 (predict-no)
  4517. I see 1 and I'm going to do: predict-no
  4518. ENV: Agent did: predict-no for direction L in state State-A
  4519. In State-A moving L
  4520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4521. predict error 0
  4522. dir: dir isR
  4523. \-/630: O: O1259 (predict-yes)
  4524. I see 1 and I'm going to do: predict-yes
  4525. ENV: Agent did: predict-yes for direction R in state State-A
  4526. In State-A moving R
  4527. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4528. predict error 0
  4529. dir: dir isR
  4530. |\631: O: O1262 (predict-no)
  4531. I see 1 and I'm going to do: predict-no
  4532. ENV: Agent did: predict-no for direction R in state State-B
  4533. In State-B moving R
  4534. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4535. predict error 0
  4536. dir: dir isL
  4537. -632: O: O1263 (predict-yes)
  4538. I see 1 and I'm going to do: predict-yes
  4539. ENV: Agent did: predict-yes for direction L in state State-B
  4540. In State-B moving L
  4541. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4542. predict error 0
  4543. dir: dir isL
  4544. /|633: O: O1266 (predict-no)
  4545. I see 1 and I'm going to do: predict-no
  4546. ENV: Agent did: predict-no for direction L in state State-A
  4547. In State-A moving L
  4548. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4549. predict error 0
  4550. dir: dir isL
  4551. \-/634: O: O1268 (predict-no)
  4552. I see 1 and I'm going to do: predict-no
  4553. ENV: Agent did: predict-no for direction L in state State-A
  4554. In State-A moving L
  4555. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4556. predict error 0
  4557. dir: dir isR
  4558. |\-635: O: O1269 (predict-yes)
  4559. I see 1 and I'm going to do: predict-yes
  4560. ENV: Agent did: predict-yes for direction R in state State-A
  4561. In State-A moving R
  4562. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4563. predict error 0
  4564. dir: dir isU
  4565. /|\636: O: O1272 (predict-no)
  4566. I see 1 and I'm going to do: predict-no
  4567. ENV: Agent did: predict-no for direction U in state State-B
  4568. In State-B moving U
  4569. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4570. predict error 0
  4571. dir: dir isL
  4572. -/|637: O: O1273 (predict-yes)
  4573. I see 1 and I'm going to do: predict-yes
  4574. ENV: Agent did: predict-yes for direction L in state State-B
  4575. In State-B moving L
  4576. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4577. predict error 0
  4578. dir: dir isL
  4579. \-/638: O: O1276 (predict-no)
  4580. I see 1 and I'm going to do: predict-no
  4581. ENV: Agent did: predict-no for direction L in state State-A
  4582. In State-A moving L
  4583. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4584. predict error 0
  4585. dir: dir isU
  4586. |\-639: O: O1278 (predict-no)
  4587. I see 1 and I'm going to do: predict-no
  4588. ENV: Agent did: predict-no for direction U in state State-A
  4589. In State-A moving U
  4590. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4591. predict error 0
  4592. dir: dir isU
  4593. /|\640: O: O1280 (predict-no)
  4594. I see 1 and I'm going to do: predict-no
  4595. ENV: Agent did: predict-no for direction U in state State-A
  4596. In State-A moving U
  4597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4598. predict error 0
  4599. dir: dir isU
  4600. -/|641: O: O1282 (predict-no)
  4601. I see 1 and I'm going to do: predict-no
  4602. ENV: Agent did: predict-no for direction U in state State-A
  4603. In State-A moving U
  4604. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4605. predict error 0
  4606. dir: dir isR
  4607. \642: O: O1283 (predict-yes)
  4608. I see 1 and I'm going to do: predict-yes
  4609. ENV: Agent did: predict-yes for direction R in state State-A
  4610. In State-A moving R
  4611. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4612. predict error 0
  4613. dir: dir isR
  4614. -/643: O: O1286 (predict-no)
  4615. I see 1 and I'm going to do: predict-no
  4616. ENV: Agent did: predict-no for direction R in state State-B
  4617. In State-B moving R
  4618. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4619. predict error 0
  4620. dir: dir isU
  4621. |\644: O: O1288 (predict-no)
  4622. I see 1 and I'm going to do: predict-no
  4623. ENV: Agent did: predict-no for direction U in state State-B
  4624. In State-B moving U
  4625. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4626. predict error 0
  4627. dir: dir isL
  4628. -/645: O: O1289 (predict-yes)
  4629. I see 1 and I'm going to do: predict-yes
  4630. ENV: Agent did: predict-yes for direction L in state State-B
  4631. In State-B moving L
  4632. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4633. predict error 0
  4634. dir: dir isU
  4635. |\-646: O: O1292 (predict-no)
  4636. I see 1 and I'm going to do: predict-no
  4637. ENV: Agent did: predict-no for direction U in state State-A
  4638. In State-A moving U
  4639. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4640. predict error 0
  4641. dir: dir isL
  4642. /647: O: O1294 (predict-no)
  4643. I see 1 and I'm going to do: predict-no
  4644. ENV: Agent did: predict-no for direction L in state State-A
  4645. In State-A moving L
  4646. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4647. predict error 0
  4648. dir: dir isR
  4649. |\648: O: O1295 (predict-yes)
  4650. I see 1 and I'm going to do: predict-yes
  4651. ENV: Agent did: predict-yes for direction R in state State-A
  4652. In State-A moving R
  4653. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4654. predict error 0
  4655. dir: dir isR
  4656. -649: O: O1298 (predict-no)
  4657. I see 1 and I'm going to do: predict-no
  4658. ENV: Agent did: predict-no for direction R in state State-B
  4659. In State-B moving R
  4660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4661. predict error 0
  4662. dir: dir isR
  4663. /|\650: O: O1300 (predict-no)
  4664. I see 1 and I'm going to do: predict-no
  4665. ENV: Agent did: predict-no for direction R in state State-B
  4666. In State-B moving R
  4667. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4668. predict error 0
  4669. dir: dir isL
  4670. -/|651: O: O1301 (predict-yes)
  4671. I see 1 and I'm going to do: predict-yes
  4672. ENV: Agent did: predict-yes for direction L in state State-B
  4673. In State-B moving L
  4674. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4675. predict error 0
  4676. dir: dir isL
  4677. \652: O: O1304 (predict-no)
  4678. I see 1 and I'm going to do: predict-no
  4679. ENV: Agent did: predict-no for direction L in state State-A
  4680. In State-A moving L
  4681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4682. predict error 0
  4683. dir: dir isU
  4684. -/|\653: O: O1306 (predict-no)
  4685. I see 1 and I'm going to do: predict-no
  4686. ENV: Agent did: predict-no for direction U in state State-A
  4687. In State-A moving U
  4688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4689. predict error 0
  4690. dir: dir isR
  4691. -/|654: O: O1308 (predict-no)
  4692. I see 1 and I'm going to do: predict-no
  4693. ENV: Agent did: predict-no for direction R in state State-A
  4694. In State-A moving R
  4695. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  4696. predict error 1
  4697. dir: dir isR
  4698. \-/655: O: O1310 (predict-no)
  4699. I see 0 and I'm going to do: predict-no
  4700. ENV: Agent did: predict-no for direction R in state State-B
  4701. In State-B moving R
  4702. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4703. predict error 0
  4704. dir: dir isL
  4705. |\-656: O: O1311 (predict-yes)
  4706. I see 1 and I'm going to do: predict-yes
  4707. ENV: Agent did: predict-yes for direction L in state State-B
  4708. In State-B moving L
  4709. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4710. predict error 0
  4711. dir: dir isU
  4712. /|\657: O: O1314 (predict-no)
  4713. I see 1 and I'm going to do: predict-no
  4714. ENV: Agent did: predict-no for direction U in state State-A
  4715. In State-A moving U
  4716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4717. predict error 0
  4718. dir: dir isL
  4719. -/658: O: O1316 (predict-no)
  4720. I see 1 and I'm going to do: predict-no
  4721. ENV: Agent did: predict-no for direction L in state State-A
  4722. In State-A moving L
  4723. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4724. predict error 0
  4725. dir: dir isR
  4726. |\-659: O: O1317 (predict-yes)
  4727. I see 1 and I'm going to do: predict-yes
  4728. ENV: Agent did: predict-yes for direction R in state State-A
  4729. In State-A moving R
  4730. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4731. predict error 0
  4732. dir: dir isU
  4733. /|\660: O: O1320 (predict-no)
  4734. I see 1 and I'm going to do: predict-no
  4735. ENV: Agent did: predict-no for direction U in state State-B
  4736. In State-B moving U
  4737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4738. predict error 0
  4739. dir: dir isU
  4740. -/661: O: O1322 (predict-no)
  4741. I see 1 and I'm going to do: predict-no
  4742. ENV: Agent did: predict-no for direction U in state State-B
  4743. In State-B moving U
  4744. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4745. predict error 0
  4746. dir: dir isL
  4747. |662: O: O1323 (predict-yes)
  4748. I see 1 and I'm going to do: predict-yes
  4749. ENV: Agent did: predict-yes for direction L in state State-B
  4750. In State-B moving L
  4751. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4752. predict error 0
  4753. dir: dir isU
  4754. \-/663: O: O1326 (predict-no)
  4755. I see 1 and I'm going to do: predict-no
  4756. ENV: Agent did: predict-no for direction U in state State-A
  4757. In State-A moving U
  4758. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4759. predict error 0
  4760. dir: dir isU
  4761. |\664: O: O1328 (predict-no)
  4762. I see 1 and I'm going to do: predict-no
  4763. ENV: Agent did: predict-no for direction U in state State-A
  4764. In State-A moving U
  4765. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4766. predict error 0
  4767. dir: dir isL
  4768. -665: O: O1330 (predict-no)
  4769. I see 1 and I'm going to do: predict-no
  4770. ENV: Agent did: predict-no for direction L in state State-A
  4771. In State-A moving L
  4772. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4773. predict error 0
  4774. dir: dir isR
  4775. /|\666: O: O1331 (predict-yes)
  4776. I see 1 and I'm going to do: predict-yes
  4777. ENV: Agent did: predict-yes for direction R in state State-A
  4778. In State-A moving R
  4779. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4780. predict error 0
  4781. dir: dir isR
  4782. -667: O: O1334 (predict-no)
  4783. I see 1 and I'm going to do: predict-no
  4784. ENV: Agent did: predict-no for direction R in state State-B
  4785. In State-B moving R
  4786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4787. predict error 0
  4788. dir: dir isU
  4789. /|668: O: O1336 (predict-no)
  4790. I see 1 and I'm going to do: predict-no
  4791. ENV: Agent did: predict-no for direction U in state State-B
  4792. In State-B moving U
  4793. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4794. predict error 0
  4795. dir: dir isR
  4796. \-/669: O: O1338 (predict-no)
  4797. I see 1 and I'm going to do: predict-no
  4798. ENV: Agent did: predict-no for direction R in state State-B
  4799. In State-B moving R
  4800. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4801. predict error 0
  4802. dir: dir isU
  4803. |\-670: O: O1340 (predict-no)
  4804. I see 1 and I'm going to do: predict-no
  4805. ENV: Agent did: predict-no for direction U in state State-B
  4806. In State-B moving U
  4807. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4808. predict error 0
  4809. dir: dir isU
  4810. /|\671: O: O1341 (predict-yes)
  4811. I see 1 and I'm going to do: predict-yes
  4812. ENV: Agent did: predict-yes for direction U in state State-B
  4813. In State-B moving U
  4814. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  4815. predict error 1
  4816. dir: dir isL
  4817. -672: O: O1343 (predict-yes)
  4818. I see 0 and I'm going to do: predict-yes
  4819. ENV: Agent did: predict-yes for direction L in state State-B
  4820. In State-B moving L
  4821. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4822. predict error 0
  4823. dir: dir isU
  4824. /|673: O: O1346 (predict-no)
  4825. I see 1 and I'm going to do: predict-no
  4826. ENV: Agent did: predict-no for direction U in state State-A
  4827. In State-A moving U
  4828. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4829. predict error 0
  4830. dir: dir isL
  4831. \-/674: O: O1348 (predict-no)
  4832. I see 1 and I'm going to do: predict-no
  4833. ENV: Agent did: predict-no for direction L in state State-A
  4834. In State-A moving L
  4835. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4836. predict error 0
  4837. dir: dir isL
  4838. |\-675: O: O1350 (predict-no)
  4839. I see 1 and I'm going to do: predict-no
  4840. ENV: Agent did: predict-no for direction L in state State-A
  4841. In State-A moving L
  4842. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4843. predict error 0
  4844. dir: dir isR
  4845. /676: O: O1351 (predict-yes)
  4846. I see 1 and I'm going to do: predict-yes
  4847. ENV: Agent did: predict-yes for direction R in state State-A
  4848. In State-A moving R
  4849. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4850. predict error 0
  4851. dir: dir isL
  4852. |\-677: O: O1353 (predict-yes)
  4853. I see 1 and I'm going to do: predict-yes
  4854. ENV: Agent did: predict-yes for direction L in state State-B
  4855. In State-B moving L
  4856. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4857. predict error 0
  4858. dir: dir isR
  4859. /|678: O: O1355 (predict-yes)
  4860. I see 1 and I'm going to do: predict-yes
  4861. ENV: Agent did: predict-yes for direction R in state State-A
  4862. In State-A moving R
  4863. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4864. predict error 0
  4865. dir: dir isL
  4866. \-/679: O: O1357 (predict-yes)
  4867. I see 1 and I'm going to do: predict-yes
  4868. ENV: Agent did: predict-yes for direction L in state State-B
  4869. In State-B moving L
  4870. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4871. predict error 0
  4872. dir: dir isR
  4873. |680: O: O1359 (predict-yes)
  4874. I see 1 and I'm going to do: predict-yes
  4875. ENV: Agent did: predict-yes for direction R in state State-A
  4876. In State-A moving R
  4877. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4878. predict error 0
  4879. dir: dir isU
  4880. \-/681: O: O1362 (predict-no)
  4881. I see 1 and I'm going to do: predict-no
  4882. ENV: Agent did: predict-no for direction U in state State-B
  4883. In State-B moving U
  4884. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4885. predict error 0
  4886. dir: dir isU
  4887. |682: O: O1364 (predict-no)
  4888. I see 1 and I'm going to do: predict-no
  4889. ENV: Agent did: predict-no for direction U in state State-B
  4890. In State-B moving U
  4891. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4892. predict error 0
  4893. dir: dir isL
  4894. \-/683: O: O1365 (predict-yes)
  4895. I see 1 and I'm going to do: predict-yes
  4896. ENV: Agent did: predict-yes for direction L in state State-B
  4897. In State-B moving L
  4898. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4899. predict error 0
  4900. dir: dir isL
  4901. |\-684: O: O1368 (predict-no)
  4902. I see 1 and I'm going to do: predict-no
  4903. ENV: Agent did: predict-no for direction L in state State-A
  4904. In State-A moving L
  4905. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4906. predict error 0
  4907. dir: dir isU
  4908. /|\685: O: O1370 (predict-no)
  4909. I see 1 and I'm going to do: predict-no
  4910. ENV: Agent did: predict-no for direction U in state State-A
  4911. In State-A moving U
  4912. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4913. predict error 0
  4914. dir: dir isL
  4915. -/686: O: O1372 (predict-no)
  4916. I see 1 and I'm going to do: predict-no
  4917. ENV: Agent did: predict-no for direction L in state State-A
  4918. In State-A moving L
  4919. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4920. predict error 0
  4921. dir: dir isL
  4922. |\-687: O: O1374 (predict-no)
  4923. I see 1 and I'm going to do: predict-no
  4924. ENV: Agent did: predict-no for direction L in state State-A
  4925. In State-A moving L
  4926. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4927. predict error 0
  4928. dir: dir isL
  4929. /688: O: O1376 (predict-no)
  4930. I see 1 and I'm going to do: predict-no
  4931. ENV: Agent did: predict-no for direction L in state State-A
  4932. In State-A moving L
  4933. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4934. predict error 0
  4935. dir: dir isL
  4936. |\-689: O: O1378 (predict-no)
  4937. I see 1 and I'm going to do: predict-no
  4938. ENV: Agent did: predict-no for direction L in state State-A
  4939. In State-A moving L
  4940. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4941. predict error 0
  4942. dir: dir isL
  4943. /|\690: O: O1380 (predict-no)
  4944. I see 1 and I'm going to do: predict-no
  4945. ENV: Agent did: predict-no for direction L in state State-A
  4946. In State-A moving L
  4947. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4948. predict error 0
  4949. dir: dir isR
  4950. -/|691: O: O1381 (predict-yes)
  4951. I see 1 and I'm going to do: predict-yes
  4952. ENV: Agent did: predict-yes for direction R in state State-A
  4953. In State-A moving R
  4954. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4955. predict error 0
  4956. dir: dir isU
  4957. \692: O: O1384 (predict-no)
  4958. I see 1 and I'm going to do: predict-no
  4959. ENV: Agent did: predict-no for direction U in state State-B
  4960. In State-B moving U
  4961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4962. predict error 0
  4963. dir: dir isU
  4964. -/|\693: O: O1386 (predict-no)
  4965. I see 1 and I'm going to do: predict-no
  4966. ENV: Agent did: predict-no for direction U in state State-B
  4967. In State-B moving U
  4968. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4969. predict error 0
  4970. dir: dir isU
  4971. -/|694: O: O1388 (predict-no)
  4972. I see 1 and I'm going to do: predict-no
  4973. ENV: Agent did: predict-no for direction U in state State-B
  4974. In State-B moving U
  4975. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4976. predict error 0
  4977. dir: dir isR
  4978. \-695: O: O1390 (predict-no)
  4979. I see 1 and I'm going to do: predict-no
  4980. ENV: Agent did: predict-no for direction R in state State-B
  4981. In State-B moving R
  4982. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4983. predict error 0
  4984. dir: dir isR
  4985. /|\696: O: O1392 (predict-no)
  4986. I see 1 and I'm going to do: predict-no
  4987. ENV: Agent did: predict-no for direction R in state State-B
  4988. In State-B moving R
  4989. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4990. predict error 0
  4991. dir: dir isR
  4992. -/697: O: O1394 (predict-no)
  4993. I see 1 and I'm going to do: predict-no
  4994. ENV: Agent did: predict-no for direction R in state State-B
  4995. In State-B moving R
  4996. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4997. predict error 0
  4998. dir: dir isU
  4999. |\-698: O: O1396 (predict-no)
  5000. I see 1 and I'm going to do: predict-no
  5001. ENV: Agent did: predict-no for direction U in state State-B
  5002. In State-B moving U
  5003. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5004. predict error 0
  5005. dir: dir isR
  5006. /|\699: O: O1398 (predict-no)
  5007. I see 1 and I'm going to do: predict-no
  5008. ENV: Agent did: predict-no for direction R in state State-B
  5009. In State-B moving R
  5010. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5011. predict error 0
  5012. dir: dir isL
  5013. -/|700: O: O1399 (predict-yes)
  5014. I see 1 and I'm going to do: predict-yes
  5015. ENV: Agent did: predict-yes for direction L in state State-B
  5016. In State-B moving L
  5017. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5018. predict error 0
  5019. dir: dir isL
  5020. \-701: O: O1402 (predict-no)
  5021. I see 1 and I'm going to do: predict-no
  5022. ENV: Agent did: predict-no for direction L in state State-A
  5023. In State-A moving L
  5024. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5025. predict error 0
  5026. dir: dir isU
  5027. /702: O: O1404 (predict-no)
  5028. I see 1 and I'm going to do: predict-no
  5029. ENV: Agent did: predict-no for direction U in state State-A
  5030. In State-A moving U
  5031. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5032. predict error 0
  5033. dir: dir isR
  5034. |\703: O: O1405 (predict-yes)
  5035. I see 1 and I'm going to do: predict-yes
  5036. ENV: Agent did: predict-yes for direction R in state State-A
  5037. In State-A moving R
  5038. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5039. predict error 0
  5040. dir: dir isR
  5041. -/|704: O: O1408 (predict-no)
  5042. I see 1 and I'm going to do: predict-no
  5043. ENV: Agent did: predict-no for direction R in state State-B
  5044. In State-B moving R
  5045. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5046. predict error 0
  5047. dir: dir isR
  5048. \-/705: O: O1409 (predict-yes)
  5049. I see 1 and I'm going to do: predict-yes
  5050. ENV: Agent did: predict-yes for direction R in state State-B
  5051. In State-B moving R
  5052. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  5053. predict error 1
  5054. dir: dir isR
  5055. |\-706: O: O1412 (predict-no)
  5056. I see 0 and I'm going to do: predict-no
  5057. ENV: Agent did: predict-no for direction R in state State-B
  5058. In State-B moving R
  5059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5060. predict error 0
  5061. dir: dir isR
  5062. /|\707: O: O1414 (predict-no)
  5063. I see 1 and I'm going to do: predict-no
  5064. ENV: Agent did: predict-no for direction R in state State-B
  5065. In State-B moving R
  5066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5067. predict error 0
  5068. dir: dir isL
  5069. -708: O: O1415 (predict-yes)
  5070. I see 1 and I'm going to do: predict-yes
  5071. ENV: Agent did: predict-yes for direction L in state State-B
  5072. In State-B moving L
  5073. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5074. predict error 0
  5075. dir: dir isR
  5076. /|\709: O: O1417 (predict-yes)
  5077. I see 1 and I'm going to do: predict-yes
  5078. ENV: Agent did: predict-yes for direction R in state State-A
  5079. In State-A moving R
  5080. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5081. predict error 0
  5082. dir: dir isR
  5083. -710: O: O1420 (predict-no)
  5084. I see 1 and I'm going to do: predict-no
  5085. ENV: Agent did: predict-no for direction R in state State-B
  5086. In State-B moving R
  5087. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5088. predict error 0
  5089. dir: dir isL
  5090. /|\711: O: O1421 (predict-yes)
  5091. I see 1 and I'm going to do: predict-yes
  5092. ENV: Agent did: predict-yes for direction L in state State-B
  5093. In State-B moving L
  5094. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5095. predict error 0
  5096. dir: dir isU
  5097. -712: O: O1424 (predict-no)
  5098. I see 1 and I'm going to do: predict-no
  5099. ENV: Agent did: predict-no for direction U in state State-A
  5100. In State-A moving U
  5101. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5102. predict error 0
  5103. dir: dir isR
  5104. /|713: O: O1425 (predict-yes)
  5105. I see 1 and I'm going to do: predict-yes
  5106. ENV: Agent did: predict-yes for direction R in state State-A
  5107. In State-A moving R
  5108. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5109. predict error 0
  5110. dir: dir isR
  5111. \-714: O: O1428 (predict-no)
  5112. I see 1 and I'm going to do: predict-no
  5113. ENV: Agent did: predict-no for direction R in state State-B
  5114. In State-B moving R
  5115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5116. predict error 0
  5117. dir: dir isU
  5118. /|\715: O: O1430 (predict-no)
  5119. I see 1 and I'm going to do: predict-no
  5120. ENV: Agent did: predict-no for direction U in state State-B
  5121. In State-B moving U
  5122. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5123. predict error 0
  5124. dir: dir isU
  5125. -/|\716: O: O1432 (predict-no)
  5126. I see 1 and I'm going to do: predict-no
  5127. ENV: Agent did: predict-no for direction U in state State-B
  5128. In State-B moving U
  5129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5130. predict error 0
  5131. dir: dir isU
  5132. -/|\717: O: O1434 (predict-no)
  5133. I see 1 and I'm going to do: predict-no
  5134. ENV: Agent did: predict-no for direction U in state State-B
  5135. In State-B moving U
  5136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5137. predict error 0
  5138. dir: dir isU
  5139. -/|718: O: O1436 (predict-no)
  5140. I see 1 and I'm going to do: predict-no
  5141. ENV: Agent did: predict-no for direction U in state State-B
  5142. In State-B moving U
  5143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5144. predict error 0
  5145. dir: dir isL
  5146. \-719: O: O1437 (predict-yes)
  5147. I see 1 and I'm going to do: predict-yes
  5148. ENV: Agent did: predict-yes for direction L in state State-B
  5149. In State-B moving L
  5150. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5151. predict error 0
  5152. dir: dir isU
  5153. /|720: O: O1440 (predict-no)
  5154. I see 1 and I'm going to do: predict-no
  5155. ENV: Agent did: predict-no for direction U in state State-A
  5156. In State-A moving U
  5157. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5158. predict error 0
  5159. dir: dir isL
  5160. \-721: O: O1442 (predict-no)
  5161. I see 1 and I'm going to do: predict-no
  5162. ENV: Agent did: predict-no for direction L in state State-A
  5163. In State-A moving L
  5164. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5165. predict error 0
  5166. dir: dir isU
  5167. /722: O: O1444 (predict-no)
  5168. I see 1 and I'm going to do: predict-no
  5169. ENV: Agent did: predict-no for direction U in state State-A
  5170. In State-A moving U
  5171. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5172. predict error 0
  5173. dir: dir isU
  5174. |\-723: O: O1446 (predict-no)
  5175. I see 1 and I'm going to do: predict-no
  5176. ENV: Agent did: predict-no for direction U in state State-A
  5177. In State-A moving U
  5178. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5179. predict error 0
  5180. dir: dir isU
  5181. /|\724: O: O1448 (predict-no)
  5182. I see 1 and I'm going to do: predict-no
  5183. ENV: Agent did: predict-no for direction U in state State-A
  5184. In State-A moving U
  5185. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5186. predict error 0
  5187. dir: dir isL
  5188. -/|725: O: O1450 (predict-no)
  5189. I see 1 and I'm going to do: predict-no
  5190. ENV: Agent did: predict-no for direction L in state State-A
  5191. In State-A moving L
  5192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5193. predict error 0
  5194. dir: dir isL
  5195. \-/|726: O: O1452 (predict-no)
  5196. I see 1 and I'm going to do: predict-no
  5197. ENV: Agent did: predict-no for direction L in state State-A
  5198. In State-A moving L
  5199. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5200. predict error 0
  5201. dir: dir isU
  5202. \-/727: O: O1454 (predict-no)
  5203. I see 1 and I'm going to do: predict-no
  5204. ENV: Agent did: predict-no for direction U in state State-A
  5205. In State-A moving U
  5206. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5207. predict error 0
  5208. dir: dir isR
  5209. |\-728: O: O1455 (predict-yes)
  5210. I see 1 and I'm going to do: predict-yes
  5211. ENV: Agent did: predict-yes for direction R in state State-A
  5212. In State-A moving R
  5213. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5214. predict error 0
  5215. dir: dir isR
  5216. /|\729: O: O1458 (predict-no)
  5217. I see 1 and I'm going to do: predict-no
  5218. ENV: Agent did: predict-no for direction R in state State-B
  5219. In State-B moving R
  5220. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5221. predict error 0
  5222. dir: dir isU
  5223. -/730: O: O1460 (predict-no)
  5224. I see 1 and I'm going to do: predict-no
  5225. ENV: Agent did: predict-no for direction U in state State-B
  5226. In State-B moving U
  5227. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5228. predict error 0
  5229. dir: dir isL
  5230. |\-731: O: O1461 (predict-yes)
  5231. I see 1 and I'm going to do: predict-yes
  5232. ENV: Agent did: predict-yes for direction L in state State-B
  5233. In State-B moving L
  5234. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5235. predict error 0
  5236. dir: dir isR
  5237. /732: O: O1463 (predict-yes)
  5238. I see 1 and I'm going to do: predict-yes
  5239. ENV: Agent did: predict-yes for direction R in state State-A
  5240. In State-A moving R
  5241. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5242. predict error 0
  5243. dir: dir isR
  5244. |\733: O: O1466 (predict-no)
  5245. I see 1 and I'm going to do: predict-no
  5246. ENV: Agent did: predict-no for direction R in state State-B
  5247. In State-B moving R
  5248. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5249. predict error 0
  5250. dir: dir isL
  5251. -/|734: O: O1467 (predict-yes)
  5252. I see 1 and I'm going to do: predict-yes
  5253. ENV: Agent did: predict-yes for direction L in state State-B
  5254. In State-B moving L
  5255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5256. predict error 0
  5257. dir: dir isR
  5258. \-/735: O: O1469 (predict-yes)
  5259. I see 1 and I'm going to do: predict-yes
  5260. ENV: Agent did: predict-yes for direction R in state State-A
  5261. In State-A moving R
  5262. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5263. predict error 0
  5264. dir: dir isU
  5265. |\-/736: O: O1472 (predict-no)
  5266. I see 1 and I'm going to do: predict-no
  5267. ENV: Agent did: predict-no for direction U in state State-B
  5268. In State-B moving U
  5269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5270. predict error 0
  5271. dir: dir isU
  5272. |\737: O: O1474 (predict-no)
  5273. I see 1 and I'm going to do: predict-no
  5274. ENV: Agent did: predict-no for direction U in state State-B
  5275. In State-B moving U
  5276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5277. predict error 0
  5278. dir: dir isL
  5279. -/738: O: O1475 (predict-yes)
  5280. I see 1 and I'm going to do: predict-yes
  5281. ENV: Agent did: predict-yes for direction L in state State-B
  5282. In State-B moving L
  5283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5284. predict error 0
  5285. dir: dir isR
  5286. |\-739: O: O1477 (predict-yes)
  5287. I see 1 and I'm going to do: predict-yes
  5288. ENV: Agent did: predict-yes for direction R in state State-A
  5289. In State-A moving R
  5290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5291. predict error 0
  5292. dir: dir isL
  5293. /|\740: O: O1479 (predict-yes)
  5294. I see 1 and I'm going to do: predict-yes
  5295. ENV: Agent did: predict-yes for direction L in state State-B
  5296. In State-B moving L
  5297. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5298. predict error 0
  5299. dir: dir isU
  5300. -/741: O: O1482 (predict-no)
  5301. I see 1 and I'm going to do: predict-no
  5302. ENV: Agent did: predict-no for direction U in state State-A
  5303. In State-A moving U
  5304. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5305. predict error 0
  5306. dir: dir isL
  5307. |742: O: O1484 (predict-no)
  5308. I see 1 and I'm going to do: predict-no
  5309. ENV: Agent did: predict-no for direction L in state State-A
  5310. In State-A moving L
  5311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5312. predict error 0
  5313. dir: dir isL
  5314. \-743: O: O1486 (predict-no)
  5315. I see 1 and I'm going to do: predict-no
  5316. ENV: Agent did: predict-no for direction L in state State-A
  5317. In State-A moving L
  5318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5319. predict error 0
  5320. dir: dir isR
  5321. /|\744: O: O1487 (predict-yes)
  5322. I see 1 and I'm going to do: predict-yes
  5323. ENV: Agent did: predict-yes for direction R in state State-A
  5324. In State-A moving R
  5325. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5326. predict error 0
  5327. dir: dir isU
  5328. -/|745: O: O1490 (predict-no)
  5329. I see 1 and I'm going to do: predict-no
  5330. ENV: Agent did: predict-no for direction U in state State-B
  5331. In State-B moving U
  5332. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5333. predict error 0
  5334. dir: dir isL
  5335. \-746: O: O1491 (predict-yes)
  5336. I see 1 and I'm going to do: predict-yes
  5337. ENV: Agent did: predict-yes for direction L in state State-B
  5338. In State-B moving L
  5339. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5340. predict error 0
  5341. dir: dir isL
  5342. /|\747: O: O1494 (predict-no)
  5343. I see 1 and I'm going to do: predict-no
  5344. ENV: Agent did: predict-no for direction L in state State-A
  5345. In State-A moving L
  5346. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5347. predict error 0
  5348. dir: dir isU
  5349. -/|748: O: O1496 (predict-no)
  5350. I see 1 and I'm going to do: predict-no
  5351. ENV: Agent did: predict-no for direction U in state State-A
  5352. In State-A moving U
  5353. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5354. predict error 0
  5355. dir: dir isU
  5356. \-/749: O: O1498 (predict-no)
  5357. I see 1 and I'm going to do: predict-no
  5358. ENV: Agent did: predict-no for direction U in state State-A
  5359. In State-A moving U
  5360. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5361. predict error 0
  5362. dir: dir isU
  5363. |\-750: O: O1500 (predict-no)
  5364. I see 1 and I'm going to do: predict-no
  5365. ENV: Agent did: predict-no for direction U in state State-A
  5366. In State-A moving U
  5367. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5368. predict error 0
  5369. dir: dir isL
  5370. /|\751: O: O1502 (predict-no)
  5371. I see 1 and I'm going to do: predict-no
  5372. ENV: Agent did: predict-no for direction L in state State-A
  5373. In State-A moving L
  5374. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5375. predict error 0
  5376. dir: dir isR
  5377. -752: O: O1503 (predict-yes)
  5378. I see 1 and I'm going to do: predict-yes
  5379. ENV: Agent did: predict-yes for direction R in state State-A
  5380. In State-A moving R
  5381. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5382. predict error 0
  5383. dir: dir isL
  5384. /|753: O: O1505 (predict-yes)
  5385. I see 1 and I'm going to do: predict-yes
  5386. ENV: Agent did: predict-yes for direction L in state State-B
  5387. In State-B moving L
  5388. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5389. predict error 0
  5390. dir: dir isR
  5391. \-/754: O: O1507 (predict-yes)
  5392. I see 1 and I'm going to do: predict-yes
  5393. ENV: Agent did: predict-yes for direction R in state State-A
  5394. In State-A moving R
  5395. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5396. predict error 0
  5397. dir: dir isL
  5398. |\-755: O: O1509 (predict-yes)
  5399. I see 1 and I'm going to do: predict-yes
  5400. ENV: Agent did: predict-yes for direction L in state State-B
  5401. In State-B moving L
  5402. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5403. predict error 0
  5404. dir: dir isR
  5405. /|\756: O: O1511 (predict-yes)
  5406. I see 1 and I'm going to do: predict-yes
  5407. ENV: Agent did: predict-yes for direction R in state State-A
  5408. In State-A moving R
  5409. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5410. predict error 0
  5411. dir: dir isU
  5412. -/|757: O: O1514 (predict-no)
  5413. I see 1 and I'm going to do: predict-no
  5414. ENV: Agent did: predict-no for direction U in state State-B
  5415. In State-B moving U
  5416. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5417. predict error 0
  5418. dir: dir isU
  5419. \-/758: O: O1516 (predict-no)
  5420. I see 1 and I'm going to do: predict-no
  5421. ENV: Agent did: predict-no for direction U in state State-B
  5422. In State-B moving U
  5423. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5424. predict error 0
  5425. dir: dir isR
  5426. |\-759: O: O1518 (predict-no)
  5427. I see 1 and I'm going to do: predict-no
  5428. ENV: Agent did: predict-no for direction R in state State-B
  5429. In State-B moving R
  5430. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5431. predict error 0
  5432. dir: dir isL
  5433. /|\760: O: O1519 (predict-yes)
  5434. I see 1 and I'm going to do: predict-yes
  5435. ENV: Agent did: predict-yes for direction L in state State-B
  5436. In State-B moving L
  5437. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5438. predict error 0
  5439. dir: dir isR
  5440. -/|761: O: O1521 (predict-yes)
  5441. I see 1 and I'm going to do: predict-yes
  5442. ENV: Agent did: predict-yes for direction R in state State-A
  5443. In State-A moving R
  5444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5445. predict error 0
  5446. dir: dir isR
  5447. \762: O: O1524 (predict-no)
  5448. I see 1 and I'm going to do: predict-no
  5449. ENV: Agent did: predict-no for direction R in state State-B
  5450. In State-B moving R
  5451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5452. predict error 0
  5453. dir: dir isU
  5454. -/|763: O: O1526 (predict-no)
  5455. I see 1 and I'm going to do: predict-no
  5456. ENV: Agent did: predict-no for direction U in state State-B
  5457. In State-B moving U
  5458. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5459. predict error 0
  5460. dir: dir isU
  5461. \-/764: O: O1528 (predict-no)
  5462. I see 1 and I'm going to do: predict-no
  5463. ENV: Agent did: predict-no for direction U in state State-B
  5464. In State-B moving U
  5465. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5466. predict error 0
  5467. dir: dir isU
  5468. |\-765: O: O1530 (predict-no)
  5469. I see 1 and I'm going to do: predict-no
  5470. ENV: Agent did: predict-no for direction U in state State-B
  5471. In State-B moving U
  5472. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5473. predict error 0
  5474. dir: dir isU
  5475. /766: O: O1532 (predict-no)
  5476. I see 1 and I'm going to do: predict-no
  5477. ENV: Agent did: predict-no for direction U in state State-B
  5478. In State-B moving U
  5479. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5480. predict error 0
  5481. dir: dir isR
  5482. |767: O: O1534 (predict-no)
  5483. I see 1 and I'm going to do: predict-no
  5484. ENV: Agent did: predict-no for direction R in state State-B
  5485. In State-B moving R
  5486. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5487. predict error 0
  5488. dir: dir isU
  5489. \-/768: O: O1536 (predict-no)
  5490. I see 1 and I'm going to do: predict-no
  5491. ENV: Agent did: predict-no for direction U in state State-B
  5492. In State-B moving U
  5493. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5494. predict error 0
  5495. dir: dir isU
  5496. |\-769: O: O1538 (predict-no)
  5497. I see 1 and I'm going to do: predict-no
  5498. ENV: Agent did: predict-no for direction U in state State-B
  5499. In State-B moving U
  5500. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5501. predict error 0
  5502. dir: dir isL
  5503. /|770: O: O1539 (predict-yes)
  5504. I see 1 and I'm going to do: predict-yes
  5505. ENV: Agent did: predict-yes for direction L in state State-B
  5506. In State-B moving L
  5507. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5508. predict error 0
  5509. dir: dir isL
  5510. \771: O: O1542 (predict-no)
  5511. I see 1 and I'm going to do: predict-no
  5512. ENV: Agent did: predict-no for direction L in state State-A
  5513. In State-A moving L
  5514. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5515. predict error 0
  5516. dir: dir isR
  5517. -772: O: O1543 (predict-yes)
  5518. I see 1 and I'm going to do: predict-yes
  5519. ENV: Agent did: predict-yes for direction R in state State-A
  5520. In State-A moving R
  5521. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5522. predict error 0
  5523. dir: dir isR
  5524. /|773: O: O1546 (predict-no)
  5525. I see 1 and I'm going to do: predict-no
  5526. ENV: Agent did: predict-no for direction R in state State-B
  5527. In State-B moving R
  5528. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5529. predict error 0
  5530. dir: dir isL
  5531. \-/|774: O: O1547 (predict-yes)
  5532. I see 1 and I'm going to do: predict-yes
  5533. ENV: Agent did: predict-yes for direction L in state State-B
  5534. In State-B moving L
  5535. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5536. predict error 0
  5537. dir: dir isR
  5538. \-/775: O: O1549 (predict-yes)
  5539. I see 1 and I'm going to do: predict-yes
  5540. ENV: Agent did: predict-yes for direction R in state State-A
  5541. In State-A moving R
  5542. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5543. predict error 0
  5544. dir: dir isR
  5545. |\-776: O: O1552 (predict-no)
  5546. I see 1 and I'm going to do: predict-no
  5547. ENV: Agent did: predict-no for direction R in state State-B
  5548. In State-B moving R
  5549. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5550. predict error 0
  5551. dir: dir isL
  5552. /|\777: O: O1553 (predict-yes)
  5553. I see 1 and I'm going to do: predict-yes
  5554. ENV: Agent did: predict-yes for direction L in state State-B
  5555. In State-B moving L
  5556. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5557. predict error 0
  5558. dir: dir isU
  5559. -/778: O: O1556 (predict-no)
  5560. I see 1 and I'm going to do: predict-no
  5561. ENV: Agent did: predict-no for direction U in state State-A
  5562. In State-A moving U
  5563. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5564. predict error 0
  5565. dir: dir isR
  5566. |\-779: O: O1557 (predict-yes)
  5567. I see 1 and I'm going to do: predict-yes
  5568. ENV: Agent did: predict-yes for direction R in state State-A
  5569. In State-A moving R
  5570. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5571. predict error 0
  5572. dir: dir isL
  5573. /|\780: O: O1559 (predict-yes)
  5574. I see 1 and I'm going to do: predict-yes
  5575. ENV: Agent did: predict-yes for direction L in state State-B
  5576. In State-B moving L
  5577. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5578. predict error 0
  5579. dir: dir isL
  5580. -/|781: O: O1562 (predict-no)
  5581. I see 1 and I'm going to do: predict-no
  5582. ENV: Agent did: predict-no for direction L in state State-A
  5583. In State-A moving L
  5584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5585. predict error 0
  5586. dir: dir isR
  5587. \782: O: O1563 (predict-yes)
  5588. I see 1 and I'm going to do: predict-yes
  5589. ENV: Agent did: predict-yes for direction R in state State-A
  5590. In State-A moving R
  5591. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5592. predict error 0
  5593. dir: dir isL
  5594. -/783: O: O1565 (predict-yes)
  5595. I see 1 and I'm going to do: predict-yes
  5596. ENV: Agent did: predict-yes for direction L in state State-B
  5597. In State-B moving L
  5598. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5599. predict error 0
  5600. dir: dir isU
  5601. |\-784: O: O1568 (predict-no)
  5602. I see 1 and I'm going to do: predict-no
  5603. ENV: Agent did: predict-no for direction U in state State-A
  5604. In State-A moving U
  5605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5606. predict error 0
  5607. dir: dir isR
  5608. /|785: O: O1569 (predict-yes)
  5609. I see 1 and I'm going to do: predict-yes
  5610. ENV: Agent did: predict-yes for direction R in state State-A
  5611. In State-A moving R
  5612. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5613. predict error 0
  5614. dir: dir isR
  5615. \786: O: O1572 (predict-no)
  5616. I see 1 and I'm going to do: predict-no
  5617. ENV: Agent did: predict-no for direction R in state State-B
  5618. In State-B moving R
  5619. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5620. predict error 0
  5621. dir: dir isL
  5622. -/787: O: O1573 (predict-yes)
  5623. I see 1 and I'm going to do: predict-yes
  5624. ENV: Agent did: predict-yes for direction L in state State-B
  5625. In State-B moving L
  5626. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5627. predict error 0
  5628. dir: dir isU
  5629. |\-788: O: O1576 (predict-no)
  5630. I see 1 and I'm going to do: predict-no
  5631. ENV: Agent did: predict-no for direction U in state State-A
  5632. In State-A moving U
  5633. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5634. predict error 0
  5635. dir: dir isL
  5636. /|\789: O: O1578 (predict-no)
  5637. I see 1 and I'm going to do: predict-no
  5638. ENV: Agent did: predict-no for direction L in state State-A
  5639. In State-A moving L
  5640. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5641. predict error 0
  5642. dir: dir isL
  5643. -/790: O: O1580 (predict-no)
  5644. I see 1 and I'm going to do: predict-no
  5645. ENV: Agent did: predict-no for direction L in state State-A
  5646. In State-A moving L
  5647. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5648. predict error 0
  5649. dir: dir isL
  5650. |\-791: O: O1582 (predict-no)
  5651. I see 1 and I'm going to do: predict-no
  5652. ENV: Agent did: predict-no for direction L in state State-A
  5653. In State-A moving L
  5654. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5655. predict error 0
  5656. dir: dir isU
  5657. /792: O: O1584 (predict-no)
  5658. I see 1 and I'm going to do: predict-no
  5659. ENV: Agent did: predict-no for direction U in state State-A
  5660. In State-A moving U
  5661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5662. predict error 0
  5663. dir: dir isR
  5664. |\-793: O: O1585 (predict-yes)
  5665. I see 1 and I'm going to do: predict-yes
  5666. ENV: Agent did: predict-yes for direction R in state State-A
  5667. In State-A moving R
  5668. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5669. predict error 0
  5670. dir: dir isU
  5671. /|794: O: O1588 (predict-no)
  5672. I see 1 and I'm going to do: predict-no
  5673. ENV: Agent did: predict-no for direction U in state State-B
  5674. In State-B moving U
  5675. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5676. predict error 0
  5677. dir: dir isU
  5678. \-/795: O: O1590 (predict-no)
  5679. I see 1 and I'm going to do: predict-no
  5680. ENV: Agent did: predict-no for direction U in state State-B
  5681. In State-B moving U
  5682. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5683. predict error 0
  5684. dir: dir isU
  5685. |\-796: O: O1592 (predict-no)
  5686. I see 1 and I'm going to do: predict-no
  5687. ENV: Agent did: predict-no for direction U in state State-B
  5688. In State-B moving U
  5689. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5690. predict error 0
  5691. dir: dir isU
  5692. /|\797: O: O1594 (predict-no)
  5693. I see 1 and I'm going to do: predict-no
  5694. ENV: Agent did: predict-no for direction U in state State-B
  5695. In State-B moving U
  5696. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5697. predict error 0
  5698. dir: dir isU
  5699. -798: O: O1596 (predict-no)
  5700. I see 1 and I'm going to do: predict-no
  5701. ENV: Agent did: predict-no for direction U in state State-B
  5702. In State-B moving U
  5703. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5704. predict error 0
  5705. dir: dir isU
  5706. /|\799: O: O1598 (predict-no)
  5707. I see 1 and I'm going to do: predict-no
  5708. ENV: Agent did: predict-no for direction U in state State-B
  5709. In State-B moving U
  5710. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5711. predict error 0
  5712. dir: dir isU
  5713. -/|800: O: O1600 (predict-no)
  5714. I see 1 and I'm going to do: predict-no
  5715. ENV: Agent did: predict-no for direction U in state State-B
  5716. In State-B moving U
  5717. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5718. predict error 0
  5719. dir: dir isL
  5720. \-/801: O: O1601 (predict-yes)
  5721. I see 1 and I'm going to do: predict-yes
  5722. ENV: Agent did: predict-yes for direction L in state State-B
  5723. In State-B moving L
  5724. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5725. predict error 0
  5726. dir: dir isR
  5727. |802: O: O1603 (predict-yes)
  5728. I see 1 and I'm going to do: predict-yes
  5729. ENV: Agent did: predict-yes for direction R in state State-A
  5730. In State-A moving R
  5731. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5732. predict error 0
  5733. dir: dir isR
  5734. \-/803: O: O1606 (predict-no)
  5735. I see 1 and I'm going to do: predict-no
  5736. ENV: Agent did: predict-no for direction R in state State-B
  5737. In State-B moving R
  5738. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5739. predict error 0
  5740. dir: dir isU
  5741. |\-804: O: O1608 (predict-no)
  5742. I see 1 and I'm going to do: predict-no
  5743. ENV: Agent did: predict-no for direction U in state State-B
  5744. In State-B moving U
  5745. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5746. predict error 0
  5747. dir: dir isU
  5748. /|805: O: O1610 (predict-no)
  5749. I see 1 and I'm going to do: predict-no
  5750. ENV: Agent did: predict-no for direction U in state State-B
  5751. In State-B moving U
  5752. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5753. predict error 0
  5754. dir: dir isU
  5755. \-/806: O: O1612 (predict-no)
  5756. I see 1 and I'm going to do: predict-no
  5757. ENV: Agent did: predict-no for direction U in state State-B
  5758. In State-B moving U
  5759. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5760. predict error 0
  5761. dir: dir isU
  5762. |\-807: O: O1614 (predict-no)
  5763. I see 1 and I'm going to do: predict-no
  5764. ENV: Agent did: predict-no for direction U in state State-B
  5765. In State-B moving U
  5766. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5767. predict error 0
  5768. dir: dir isR
  5769. /|\808: O: O1616 (predict-no)
  5770. I see 1 and I'm going to do: predict-no
  5771. ENV: Agent did: predict-no for direction R in state State-B
  5772. In State-B moving R
  5773. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5774. predict error 0
  5775. dir: dir isU
  5776. -/|809: O: O1618 (predict-no)
  5777. I see 1 and I'm going to do: predict-no
  5778. ENV: Agent did: predict-no for direction U in state State-B
  5779. In State-B moving U
  5780. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5781. predict error 0
  5782. dir: dir isR
  5783. \-/810: O: O1620 (predict-no)
  5784. I see 1 and I'm going to do: predict-no
  5785. ENV: Agent did: predict-no for direction R in state State-B
  5786. In State-B moving R
  5787. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5788. predict error 0
  5789. dir: dir isR
  5790. |\-811: O: O1622 (predict-no)
  5791. I see 1 and I'm going to do: predict-no
  5792. ENV: Agent did: predict-no for direction R in state State-B
  5793. In State-B moving R
  5794. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5795. predict error 0
  5796. dir: dir isR
  5797. /812: O: O1624 (predict-no)
  5798. I see 1 and I'm going to do: predict-no
  5799. ENV: Agent did: predict-no for direction R in state State-B
  5800. In State-B moving R
  5801. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5802. predict error 0
  5803. dir: dir isU
  5804. |\-813: O: O1626 (predict-no)
  5805. I see 1 and I'm going to do: predict-no
  5806. ENV: Agent did: predict-no for direction U in state State-B
  5807. In State-B moving U
  5808. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5809. predict error 0
  5810. dir: dir isR
  5811. /|\814: O: O1628 (predict-no)
  5812. I see 1 and I'm going to do: predict-no
  5813. ENV: Agent did: predict-no for direction R in state State-B
  5814. In State-B moving R
  5815. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5816. predict error 0
  5817. dir: dir isL
  5818. -/|815: O: O1629 (predict-yes)
  5819. I see 1 and I'm going to do: predict-yes
  5820. ENV: Agent did: predict-yes for direction L in state State-B
  5821. In State-B moving L
  5822. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5823. predict error 0
  5824. dir: dir isL
  5825. \-/816: O: O1632 (predict-no)
  5826. I see 1 and I'm going to do: predict-no
  5827. ENV: Agent did: predict-no for direction L in state State-A
  5828. In State-A moving L
  5829. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5830. predict error 0
  5831. dir: dir isU
  5832. |\817: O: O1634 (predict-no)
  5833. I see 1 and I'm going to do: predict-no
  5834. ENV: Agent did: predict-no for direction U in state State-A
  5835. In State-A moving U
  5836. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5837. predict error 0
  5838. dir: dir isR
  5839. -/|818: O: O1635 (predict-yes)
  5840. I see 1 and I'm going to do: predict-yes
  5841. ENV: Agent did: predict-yes for direction R in state State-A
  5842. In State-A moving R
  5843. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5844. predict error 0
  5845. dir: dir isU
  5846. \-/819: O: O1638 (predict-no)
  5847. I see 1 and I'm going to do: predict-no
  5848. ENV: Agent did: predict-no for direction U in state State-B
  5849. In State-B moving U
  5850. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5851. predict error 0
  5852. dir: dir isL
  5853. |\-820: O: O1639 (predict-yes)
  5854. I see 1 and I'm going to do: predict-yes
  5855. ENV: Agent did: predict-yes for direction L in state State-B
  5856. In State-B moving L
  5857. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5858. predict error 0
  5859. dir: dir isR
  5860. /|\821: O: O1641 (predict-yes)
  5861. I see 1 and I'm going to do: predict-yes
  5862. ENV: Agent did: predict-yes for direction R in state State-A
  5863. In State-A moving R
  5864. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5865. predict error 0
  5866. dir: dir isU
  5867. -822: O: O1644 (predict-no)
  5868. I see 1 and I'm going to do: predict-no
  5869. ENV: Agent did: predict-no for direction U in state State-B
  5870. In State-B moving U
  5871. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5872. predict error 0
  5873. dir: dir isL
  5874. /|\823: O: O1645 (predict-yes)
  5875. I see 1 and I'm going to do: predict-yes
  5876. ENV: Agent did: predict-yes for direction L in state State-B
  5877. In State-B moving L
  5878. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5879. predict error 0
  5880. dir: dir isL
  5881. -824: O: O1648 (predict-no)
  5882. I see 1 and I'm going to do: predict-no
  5883. ENV: Agent did: predict-no for direction L in state State-A
  5884. In State-A moving L
  5885. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5886. predict error 0
  5887. dir: dir isR
  5888. /|\825: O: O1649 (predict-yes)
  5889. I see 1 and I'm going to do: predict-yes
  5890. ENV: Agent did: predict-yes for direction R in state State-A
  5891. In State-A moving R
  5892. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5893. predict error 0
  5894. dir: dir isL
  5895. -/|826: O: O1651 (predict-yes)
  5896. I see 1 and I'm going to do: predict-yes
  5897. ENV: Agent did: predict-yes for direction L in state State-B
  5898. In State-B moving L
  5899. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5900. predict error 0
  5901. dir: dir isL
  5902. \-/827: O: O1654 (predict-no)
  5903. I see 1 and I'm going to do: predict-no
  5904. ENV: Agent did: predict-no for direction L in state State-A
  5905. In State-A moving L
  5906. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5907. predict error 0
  5908. dir: dir isL
  5909. |\-828: O: O1656 (predict-no)
  5910. I see 1 and I'm going to do: predict-no
  5911. ENV: Agent did: predict-no for direction L in state State-A
  5912. In State-A moving L
  5913. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5914. predict error 0
  5915. dir: dir isR
  5916. /|\829: O: O1657 (predict-yes)
  5917. I see 1 and I'm going to do: predict-yes
  5918. ENV: Agent did: predict-yes for direction R in state State-A
  5919. In State-A moving R
  5920. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5921. predict error 0
  5922. dir: dir isR
  5923. -/|830: O: O1660 (predict-no)
  5924. I see 1 and I'm going to do: predict-no
  5925. ENV: Agent did: predict-no for direction R in state State-B
  5926. In State-B moving R
  5927. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5928. predict error 0
  5929. dir: dir isL
  5930. \-/831: O: O1661 (predict-yes)
  5931. I see 1 and I'm going to do: predict-yes
  5932. ENV: Agent did: predict-yes for direction L in state State-B
  5933. In State-B moving L
  5934. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5935. predict error 0
  5936. dir: dir isL
  5937. |832: O: O1664 (predict-no)
  5938. I see 1 and I'm going to do: predict-no
  5939. ENV: Agent did: predict-no for direction L in state State-A
  5940. In State-A moving L
  5941. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5942. predict error 0
  5943. dir: dir isU
  5944. \-/833: O: O1666 (predict-no)
  5945. I see 1 and I'm going to do: predict-no
  5946. ENV: Agent did: predict-no for direction U in state State-A
  5947. In State-A moving U
  5948. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5949. predict error 0
  5950. dir: dir isR
  5951. |\-834: O: O1667 (predict-yes)
  5952. I see 1 and I'm going to do: predict-yes
  5953. ENV: Agent did: predict-yes for direction R in state State-A
  5954. In State-A moving R
  5955. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5956. predict error 0
  5957. dir: dir isL
  5958. /|\835: O: O1669 (predict-yes)
  5959. I see 1 and I'm going to do: predict-yes
  5960. ENV: Agent did: predict-yes for direction L in state State-B
  5961. In State-B moving L
  5962. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5963. predict error 0
  5964. dir: dir isU
  5965. -/836: O: O1672 (predict-no)
  5966. I see 1 and I'm going to do: predict-no
  5967. ENV: Agent did: predict-no for direction U in state State-A
  5968. In State-A moving U
  5969. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5970. predict error 0
  5971. dir: dir isL
  5972. |\-837: O: O1674 (predict-no)
  5973. I see 1 and I'm going to do: predict-no
  5974. ENV: Agent did: predict-no for direction L in state State-A
  5975. In State-A moving L
  5976. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5977. predict error 0
  5978. dir: dir isR
  5979. /|\838: O: O1675 (predict-yes)
  5980. I see 1 and I'm going to do: predict-yes
  5981. ENV: Agent did: predict-yes for direction R in state State-A
  5982. In State-A moving R
  5983. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5984. predict error 0
  5985. dir: dir isU
  5986. -/|839: O: O1678 (predict-no)
  5987. I see 1 and I'm going to do: predict-no
  5988. ENV: Agent did: predict-no for direction U in state State-B
  5989. In State-B moving U
  5990. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5991. predict error 0
  5992. dir: dir isU
  5993. \-/840: O: O1680 (predict-no)
  5994. I see 1 and I'm going to do: predict-no
  5995. ENV: Agent did: predict-no for direction U in state State-B
  5996. In State-B moving U
  5997. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5998. predict error 0
  5999. dir: dir isL
  6000. |\-841: O: O1681 (predict-yes)
  6001. I see 1 and I'm going to do: predict-yes
  6002. ENV: Agent did: predict-yes for direction L in state State-B
  6003. In State-B moving L
  6004. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6005. predict error 0
  6006. dir: dir isU
  6007. /842: O: O1684 (predict-no)
  6008. I see 1 and I'm going to do: predict-no
  6009. ENV: Agent did: predict-no for direction U in state State-A
  6010. In State-A moving U
  6011. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6012. predict error 0
  6013. dir: dir isR
  6014. |\843: O: O1685 (predict-yes)
  6015. I see 1 and I'm going to do: predict-yes
  6016. ENV: Agent did: predict-yes for direction R in state State-A
  6017. In State-A moving R
  6018. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6019. predict error 0
  6020. dir: dir isU
  6021. -/844: O: O1688 (predict-no)
  6022. I see 1 and I'm going to do: predict-no
  6023. ENV: Agent did: predict-no for direction U in state State-B
  6024. In State-B moving U
  6025. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6026. predict error 0
  6027. dir: dir isU
  6028. |\-845: O: O1690 (predict-no)
  6029. I see 1 and I'm going to do: predict-no
  6030. ENV: Agent did: predict-no for direction U in state State-B
  6031. In State-B moving U
  6032. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6033. predict error 0
  6034. dir: dir isR
  6035. /|\846: O: O1692 (predict-no)
  6036. I see 1 and I'm going to do: predict-no
  6037. ENV: Agent did: predict-no for direction R in state State-B
  6038. In State-B moving R
  6039. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6040. predict error 0
  6041. dir: dir isU
  6042. -/|847: O: O1694 (predict-no)
  6043. I see 1 and I'm going to do: predict-no
  6044. ENV: Agent did: predict-no for direction U in state State-B
  6045. In State-B moving U
  6046. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6047. predict error 0
  6048. dir: dir isR
  6049. \-/848: O: O1696 (predict-no)
  6050. I see 1 and I'm going to do: predict-no
  6051. ENV: Agent did: predict-no for direction R in state State-B
  6052. In State-B moving R
  6053. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6054. predict error 0
  6055. dir: dir isU
  6056. |849: O: O1698 (predict-no)
  6057. I see 1 and I'm going to do: predict-no
  6058. ENV: Agent did: predict-no for direction U in state State-B
  6059. In State-B moving U
  6060. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6061. predict error 0
  6062. dir: dir isU
  6063. \-/850: O: O1700 (predict-no)
  6064. I see 1 and I'm going to do: predict-no
  6065. ENV: Agent did: predict-no for direction U in state State-B
  6066. In State-B moving U
  6067. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6068. predict error 0
  6069. dir: dir isU
  6070. |\-851: O: O1702 (predict-no)
  6071. I see 1 and I'm going to do: predict-no
  6072. ENV: Agent did: predict-no for direction U in state State-B
  6073. In State-B moving U
  6074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6075. predict error 0
  6076. dir: dir isU
  6077. /852: O: O1704 (predict-no)
  6078. I see 1 and I'm going to do: predict-no
  6079. ENV: Agent did: predict-no for direction U in state State-B
  6080. In State-B moving U
  6081. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6082. predict error 0
  6083. dir: dir isU
  6084. |\-853: O: O1706 (predict-no)
  6085. I see 1 and I'm going to do: predict-no
  6086. ENV: Agent did: predict-no for direction U in state State-B
  6087. In State-B moving U
  6088. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6089. predict error 0
  6090. dir: dir isL
  6091. /|\854: O: O1707 (predict-yes)
  6092. I see 1 and I'm going to do: predict-yes
  6093. ENV: Agent did: predict-yes for direction L in state State-B
  6094. In State-B moving L
  6095. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6096. predict error 0
  6097. dir: dir isL
  6098. -/|855: O: O1710 (predict-no)
  6099. I see 1 and I'm going to do: predict-no
  6100. ENV: Agent did: predict-no for direction L in state State-A
  6101. In State-A moving L
  6102. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6103. predict error 0
  6104. dir: dir isU
  6105. \-856: O: O1712 (predict-no)
  6106. I see 1 and I'm going to do: predict-no
  6107. ENV: Agent did: predict-no for direction U in state State-A
  6108. In State-A moving U
  6109. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6110. predict error 0
  6111. dir: dir isU
  6112. /|\857: O: O1714 (predict-no)
  6113. I see 1 and I'm going to do: predict-no
  6114. ENV: Agent did: predict-no for direction U in state State-A
  6115. In State-A moving U
  6116. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6117. predict error 0
  6118. dir: dir isR
  6119. -/|858: O: O1715 (predict-yes)
  6120. I see 1 and I'm going to do: predict-yes
  6121. ENV: Agent did: predict-yes for direction R in state State-A
  6122. In State-A moving R
  6123. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6124. predict error 0
  6125. dir: dir isR
  6126. \-/859: O: O1718 (predict-no)
  6127. I see 1 and I'm going to do: predict-no
  6128. ENV: Agent did: predict-no for direction R in state State-B
  6129. In State-B moving R
  6130. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6131. predict error 0
  6132. dir: dir isR
  6133. |860: O: O1720 (predict-no)
  6134. I see 1 and I'm going to do: predict-no
  6135. ENV: Agent did: predict-no for direction R in state State-B
  6136. In State-B moving R
  6137. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6138. predict error 0
  6139. dir: dir isU
  6140. \-861: O: O1722 (predict-no)
  6141. I see 1 and I'm going to do: predict-no
  6142. ENV: Agent did: predict-no for direction U in state State-B
  6143. In State-B moving U
  6144. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6145. predict error 0
  6146. dir: dir isU
  6147. /862: O: O1724 (predict-no)
  6148. I see 1 and I'm going to do: predict-no
  6149. ENV: Agent did: predict-no for direction U in state State-B
  6150. In State-B moving U
  6151. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6152. predict error 0
  6153. dir: dir isR
  6154. |\-/863: O: O1726 (predict-no)
  6155. I see 1 and I'm going to do: predict-no
  6156. ENV: Agent did: predict-no for direction R in state State-B
  6157. In State-B moving R
  6158. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6159. predict error 0
  6160. dir: dir isL
  6161. |\-864: O: O1727 (predict-yes)
  6162. I see 1 and I'm going to do: predict-yes
  6163. ENV: Agent did: predict-yes for direction L in state State-B
  6164. In State-B moving L
  6165. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6166. predict error 0
  6167. dir: dir isU
  6168. /865: O: O1730 (predict-no)
  6169. I see 1 and I'm going to do: predict-no
  6170. ENV: Agent did: predict-no for direction U in state State-A
  6171. In State-A moving U
  6172. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6173. predict error 0
  6174. dir: dir isR
  6175. |\-866: O: O1731 (predict-yes)
  6176. I see 1 and I'm going to do: predict-yes
  6177. ENV: Agent did: predict-yes for direction R in state State-A
  6178. In State-A moving R
  6179. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6180. predict error 0
  6181. dir: dir isL
  6182. /|\867: O: O1733 (predict-yes)
  6183. I see 1 and I'm going to do: predict-yes
  6184. ENV: Agent did: predict-yes for direction L in state State-B
  6185. In State-B moving L
  6186. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6187. predict error 0
  6188. dir: dir isL
  6189. -/|868: O: O1736 (predict-no)
  6190. I see 1 and I'm going to do: predict-no
  6191. ENV: Agent did: predict-no for direction L in state State-A
  6192. In State-A moving L
  6193. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6194. predict error 0
  6195. dir: dir isU
  6196. \-/869: O: O1738 (predict-no)
  6197. I see 1 and I'm going to do: predict-no
  6198. ENV: Agent did: predict-no for direction U in state State-A
  6199. In State-A moving U
  6200. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6201. predict error 0
  6202. dir: dir isL
  6203. |\-870: O: O1740 (predict-no)
  6204. I see 1 and I'm going to do: predict-no
  6205. ENV: Agent did: predict-no for direction L in state State-A
  6206. In State-A moving L
  6207. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6208. predict error 0
  6209. dir: dir isL
  6210. /|\-871: O: O1742 (predict-no)
  6211. I see 1 and I'm going to do: predict-no
  6212. ENV: Agent did: predict-no for direction L in state State-A
  6213. In State-A moving L
  6214. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6215. predict error 0
  6216. dir: dir isL
  6217. /872: O: O1744 (predict-no)
  6218. I see 1 and I'm going to do: predict-no
  6219. ENV: Agent did: predict-no for direction L in state State-A
  6220. In State-A moving L
  6221. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6222. predict error 0
  6223. dir: dir isU
  6224. |\-873: O: O1746 (predict-no)
  6225. I see 1 and I'm going to do: predict-no
  6226. ENV: Agent did: predict-no for direction U in state State-A
  6227. In State-A moving U
  6228. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6229. predict error 0
  6230. dir: dir isU
  6231. /|\874: O: O1748 (predict-no)
  6232. I see 1 and I'm going to do: predict-no
  6233. ENV: Agent did: predict-no for direction U in state State-A
  6234. In State-A moving U
  6235. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6236. predict error 0
  6237. dir: dir isU
  6238. -/875: O: O1750 (predict-no)
  6239. I see 1 and I'm going to do: predict-no
  6240. ENV: Agent did: predict-no for direction U in state State-A
  6241. In State-A moving U
  6242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6243. predict error 0
  6244. dir: dir isR
  6245. |\876: O: O1751 (predict-yes)
  6246. I see 1 and I'm going to do: predict-yes
  6247. ENV: Agent did: predict-yes for direction R in state State-A
  6248. In State-A moving R
  6249. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6250. predict error 0
  6251. dir: dir isR
  6252. -/|877: O: O1754 (predict-no)
  6253. I see 1 and I'm going to do: predict-no
  6254. ENV: Agent did: predict-no for direction R in state State-B
  6255. In State-B moving R
  6256. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6257. predict error 0
  6258. dir: dir isR
  6259. \878: O: O1756 (predict-no)
  6260. I see 1 and I'm going to do: predict-no
  6261. ENV: Agent did: predict-no for direction R in state State-B
  6262. In State-B moving R
  6263. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6264. predict error 0
  6265. dir: dir isR
  6266. -/|879: O: O1758 (predict-no)
  6267. I see 1 and I'm going to do: predict-no
  6268. ENV: Agent did: predict-no for direction R in state State-B
  6269. In State-B moving R
  6270. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6271. predict error 0
  6272. dir: dir isR
  6273. \-/880: O: O1760 (predict-no)
  6274. I see 1 and I'm going to do: predict-no
  6275. ENV: Agent did: predict-no for direction R in state State-B
  6276. In State-B moving R
  6277. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6278. predict error 0
  6279. dir: dir isU
  6280. |\-881: O: O1762 (predict-no)
  6281. I see 1 and I'm going to do: predict-no
  6282. ENV: Agent did: predict-no for direction U in state State-B
  6283. In State-B moving U
  6284. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6285. predict error 0
  6286. dir: dir isU
  6287. /882: O: O1764 (predict-no)
  6288. I see 1 and I'm going to do: predict-no
  6289. ENV: Agent did: predict-no for direction U in state State-B
  6290. In State-B moving U
  6291. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6292. predict error 0
  6293. dir: dir isR
  6294. |\-883: O: O1766 (predict-no)
  6295. I see 1 and I'm going to do: predict-no
  6296. ENV: Agent did: predict-no for direction R in state State-B
  6297. In State-B moving R
  6298. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6299. predict error 0
  6300. dir: dir isR
  6301. /|\884: O: O1768 (predict-no)
  6302. I see 1 and I'm going to do: predict-no
  6303. ENV: Agent did: predict-no for direction R in state State-B
  6304. In State-B moving R
  6305. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6306. predict error 0
  6307. dir: dir isL
  6308. -/|885: O: O1769 (predict-yes)
  6309. I see 1 and I'm going to do: predict-yes
  6310. ENV: Agent did: predict-yes for direction L in state State-B
  6311. In State-B moving L
  6312. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6313. predict error 0
  6314. dir: dir isL
  6315. \-/886: O: O1772 (predict-no)
  6316. I see 1 and I'm going to do: predict-no
  6317. ENV: Agent did: predict-no for direction L in state State-A
  6318. In State-A moving L
  6319. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6320. predict error 0
  6321. dir: dir isR
  6322. |\887: O: O1773 (predict-yes)
  6323. I see 1 and I'm going to do: predict-yes
  6324. ENV: Agent did: predict-yes for direction R in state State-A
  6325. In State-A moving R
  6326. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6327. predict error 0
  6328. dir: dir isR
  6329. -/|888: O: O1776 (predict-no)
  6330. I see 1 and I'm going to do: predict-no
  6331. ENV: Agent did: predict-no for direction R in state State-B
  6332. In State-B moving R
  6333. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6334. predict error 0
  6335. dir: dir isR
  6336. \-/889: O: O1778 (predict-no)
  6337. I see 1 and I'm going to do: predict-no
  6338. ENV: Agent did: predict-no for direction R in state State-B
  6339. In State-B moving R
  6340. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6341. predict error 0
  6342. dir: dir isU
  6343. |\-890: O: O1780 (predict-no)
  6344. I see 1 and I'm going to do: predict-no
  6345. ENV: Agent did: predict-no for direction U in state State-B
  6346. In State-B moving U
  6347. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6348. predict error 0
  6349. dir: dir isL
  6350. /|891: O: O1781 (predict-yes)
  6351. I see 1 and I'm going to do: predict-yes
  6352. ENV: Agent did: predict-yes for direction L in state State-B
  6353. In State-B moving L
  6354. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6355. predict error 0
  6356. dir: dir isR
  6357. \892: O: O1783 (predict-yes)
  6358. I see 1 and I'm going to do: predict-yes
  6359. ENV: Agent did: predict-yes for direction R in state State-A
  6360. In State-A moving R
  6361. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6362. predict error 0
  6363. dir: dir isU
  6364. -/|893: O: O1786 (predict-no)
  6365. I see 1 and I'm going to do: predict-no
  6366. ENV: Agent did: predict-no for direction U in state State-B
  6367. In State-B moving U
  6368. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6369. predict error 0
  6370. dir: dir isU
  6371. \894: O: O1788 (predict-no)
  6372. I see 1 and I'm going to do: predict-no
  6373. ENV: Agent did: predict-no for direction U in state State-B
  6374. In State-B moving U
  6375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6376. predict error 0
  6377. dir: dir isR
  6378. -/|895: O: O1790 (predict-no)
  6379. I see 1 and I'm going to do: predict-no
  6380. ENV: Agent did: predict-no for direction R in state State-B
  6381. In State-B moving R
  6382. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6383. predict error 0
  6384. dir: dir isR
  6385. \-/896: O: O1792 (predict-no)
  6386. I see 1 and I'm going to do: predict-no
  6387. ENV: Agent did: predict-no for direction R in state State-B
  6388. In State-B moving R
  6389. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6390. predict error 0
  6391. dir: dir isR
  6392. |\-897: O: O1794 (predict-no)
  6393. I see 1 and I'm going to do: predict-no
  6394. ENV: Agent did: predict-no for direction R in state State-B
  6395. In State-B moving R
  6396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6397. predict error 0
  6398. dir: dir isU
  6399. /|\898: O: O1796 (predict-no)
  6400. I see 1 and I'm going to do: predict-no
  6401. ENV: Agent did: predict-no for direction U in state State-B
  6402. In State-B moving U
  6403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6404. predict error 0
  6405. dir: dir isU
  6406. -/|899: O: O1798 (predict-no)
  6407. I see 1 and I'm going to do: predict-no
  6408. ENV: Agent did: predict-no for direction U in state State-B
  6409. In State-B moving U
  6410. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6411. predict error 0
  6412. dir: dir isU
  6413. \-/900: O: O1800 (predict-no)
  6414. I see 1 and I'm going to do: predict-no
  6415. ENV: Agent did: predict-no for direction U in state State-B
  6416. In State-B moving U
  6417. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6418. predict error 0
  6419. dir: dir isU
  6420. |\-901: O: O1802 (predict-no)
  6421. I see 1 and I'm going to do: predict-no
  6422. ENV: Agent did: predict-no for direction U in state State-B
  6423. In State-B moving U
  6424. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6425. predict error 0
  6426. dir: dir isU
  6427. /902: O: O1804 (predict-no)
  6428. I see 1 and I'm going to do: predict-no
  6429. ENV: Agent did: predict-no for direction U in state State-B
  6430. In State-B moving U
  6431. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6432. predict error 0
  6433. dir: dir isU
  6434. |\903: O: O1806 (predict-no)
  6435. I see 1 and I'm going to do: predict-no
  6436. ENV: Agent did: predict-no for direction U in state State-B
  6437. In State-B moving U
  6438. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6439. predict error 0
  6440. dir: dir isR
  6441. -/904: O: O1808 (predict-no)
  6442. I see 1 and I'm going to do: predict-no
  6443. ENV: Agent did: predict-no for direction R in state State-B
  6444. In State-B moving R
  6445. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6446. predict error 0
  6447. dir: dir isR
  6448. |\-905: O: O1810 (predict-no)
  6449. I see 1 and I'm going to do: predict-no
  6450. ENV: Agent did: predict-no for direction R in state State-B
  6451. In State-B moving R
  6452. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6453. predict error 0
  6454. dir: dir isU
  6455. /|\906: O: O1812 (predict-no)
  6456. I see 1 and I'm going to do: predict-no
  6457. ENV: Agent did: predict-no for direction U in state State-B
  6458. In State-B moving U
  6459. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6460. predict error 0
  6461. dir: dir isR
  6462. -/|907: O: O1814 (predict-no)
  6463. I see 1 and I'm going to do: predict-no
  6464. ENV: Agent did: predict-no for direction R in state State-B
  6465. In State-B moving R
  6466. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6467. predict error 0
  6468. dir: dir isU
  6469. \-/908: O: O1816 (predict-no)
  6470. I see 1 and I'm going to do: predict-no
  6471. ENV: Agent did: predict-no for direction U in state State-B
  6472. In State-B moving U
  6473. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6474. predict error 0
  6475. dir: dir isR
  6476. |\909: O: O1818 (predict-no)
  6477. I see 1 and I'm going to do: predict-no
  6478. ENV: Agent did: predict-no for direction R in state State-B
  6479. In State-B moving R
  6480. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6481. predict error 0
  6482. dir: dir isR
  6483. -/|910: O: O1820 (predict-no)
  6484. I see 1 and I'm going to do: predict-no
  6485. ENV: Agent did: predict-no for direction R in state State-B
  6486. In State-B moving R
  6487. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6488. predict error 0
  6489. dir: dir isR
  6490. \-/911: O: O1822 (predict-no)
  6491. I see 1 and I'm going to do: predict-no
  6492. ENV: Agent did: predict-no for direction R in state State-B
  6493. In State-B moving R
  6494. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6495. predict error 0
  6496. dir: dir isL
  6497. |912: O: O1823 (predict-yes)
  6498. I see 1 and I'm going to do: predict-yes
  6499. ENV: Agent did: predict-yes for direction L in state State-B
  6500. In State-B moving L
  6501. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6502. predict error 0
  6503. dir: dir isR
  6504. \913: O: O1825 (predict-yes)
  6505. I see 1 and I'm going to do: predict-yes
  6506. ENV: Agent did: predict-yes for direction R in state State-A
  6507. In State-A moving R
  6508. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6509. predict error 0
  6510. dir: dir isR
  6511. -/|914: O: O1828 (predict-no)
  6512. I see 1 and I'm going to do: predict-no
  6513. ENV: Agent did: predict-no for direction R in state State-B
  6514. In State-B moving R
  6515. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6516. predict error 0
  6517. dir: dir isL
  6518. \-/915: O: O1829 (predict-yes)
  6519. I see 1 and I'm going to do: predict-yes
  6520. ENV: Agent did: predict-yes for direction L in state State-B
  6521. In State-B moving L
  6522. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6523. predict error 0
  6524. dir: dir isL
  6525. |\-916: O: O1832 (predict-no)
  6526. I see 1 and I'm going to do: predict-no
  6527. ENV: Agent did: predict-no for direction L in state State-A
  6528. In State-A moving L
  6529. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6530. predict error 0
  6531. dir: dir isL
  6532. /|\917: O: O1834 (predict-no)
  6533. I see 1 and I'm going to do: predict-no
  6534. ENV: Agent did: predict-no for direction L in state State-A
  6535. In State-A moving L
  6536. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6537. predict error 0
  6538. dir: dir isU
  6539. -/918: O: O1836 (predict-no)
  6540. I see 1 and I'm going to do: predict-no
  6541. ENV: Agent did: predict-no for direction U in state State-A
  6542. In State-A moving U
  6543. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6544. predict error 0
  6545. dir: dir isR
  6546. |\-919: O: O1837 (predict-yes)
  6547. I see 1 and I'm going to do: predict-yes
  6548. ENV: Agent did: predict-yes for direction R in state State-A
  6549. In State-A moving R
  6550. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6551. predict error 0
  6552. dir: dir isL
  6553. /|\920: O: O1839 (predict-yes)
  6554. I see 1 and I'm going to do: predict-yes
  6555. ENV: Agent did: predict-yes for direction L in state State-B
  6556. In State-B moving L
  6557. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6558. predict error 0
  6559. dir: dir isU
  6560. -/|921: O: O1842 (predict-no)
  6561. I see 1 and I'm going to do: predict-no
  6562. ENV: Agent did: predict-no for direction U in state State-A
  6563. In State-A moving U
  6564. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6565. predict error 0
  6566. dir: dir isL
  6567. \922: O: O1844 (predict-no)
  6568. I see 1 and I'm going to do: predict-no
  6569. ENV: Agent did: predict-no for direction L in state State-A
  6570. In State-A moving L
  6571. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6572. predict error 0
  6573. dir: dir isR
  6574. -/923: O: O1845 (predict-yes)
  6575. I see 1 and I'm going to do: predict-yes
  6576. ENV: Agent did: predict-yes for direction R in state State-A
  6577. In State-A moving R
  6578. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6579. predict error 0
  6580. dir: dir isU
  6581. |\-924: O: O1848 (predict-no)
  6582. I see 1 and I'm going to do: predict-no
  6583. ENV: Agent did: predict-no for direction U in state State-B
  6584. In State-B moving U
  6585. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6586. predict error 0
  6587. dir: dir isU
  6588. /|\925: O: O1850 (predict-no)
  6589. I see 1 and I'm going to do: predict-no
  6590. ENV: Agent did: predict-no for direction U in state State-B
  6591. In State-B moving U
  6592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6593. predict error 0
  6594. dir: dir isR
  6595. -/|926: O: O1852 (predict-no)
  6596. I see 1 and I'm going to do: predict-no
  6597. ENV: Agent did: predict-no for direction R in state State-B
  6598. In State-B moving R
  6599. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6600. predict error 0
  6601. dir: dir isU
  6602. \-/927: O: O1854 (predict-no)
  6603. I see 1 and I'm going to do: predict-no
  6604. ENV: Agent did: predict-no for direction U in state State-B
  6605. In State-B moving U
  6606. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6607. predict error 0
  6608. dir: dir isR
  6609. |\-928: O: O1856 (predict-no)
  6610. I see 1 and I'm going to do: predict-no
  6611. ENV: Agent did: predict-no for direction R in state State-B
  6612. In State-B moving R
  6613. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6614. predict error 0
  6615. dir: dir isU
  6616. /|929: O: O1858 (predict-no)
  6617. I see 1 and I'm going to do: predict-no
  6618. ENV: Agent did: predict-no for direction U in state State-B
  6619. In State-B moving U
  6620. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6621. predict error 0
  6622. dir: dir isR
  6623. \-/930: O: O1860 (predict-no)
  6624. I see 1 and I'm going to do: predict-no
  6625. ENV: Agent did: predict-no for direction R in state State-B
  6626. In State-B moving R
  6627. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6628. predict error 0
  6629. dir: dir isU
  6630. |\931: O: O1862 (predict-no)
  6631. I see 1 and I'm going to do: predict-no
  6632. ENV: Agent did: predict-no for direction U in state State-B
  6633. In State-B moving U
  6634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6635. predict error 0
  6636. dir: dir isU
  6637. -932: O: O1864 (predict-no)
  6638. I see 1 and I'm going to do: predict-no
  6639. ENV: Agent did: predict-no for direction U in state State-B
  6640. In State-B moving U
  6641. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6642. predict error 0
  6643. dir: dir isL
  6644. /|\933: O: O1865 (predict-yes)
  6645. I see 1 and I'm going to do: predict-yes
  6646. ENV: Agent did: predict-yes for direction L in state State-B
  6647. In State-B moving L
  6648. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6649. predict error 0
  6650. dir: dir isL
  6651. -/|934: O: O1868 (predict-no)
  6652. I see 1 and I'm going to do: predict-no
  6653. ENV: Agent did: predict-no for direction L in state State-A
  6654. In State-A moving L
  6655. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6656. predict error 0
  6657. dir: dir isU
  6658. \-/935: O: O1870 (predict-no)
  6659. I see 1 and I'm going to do: predict-no
  6660. ENV: Agent did: predict-no for direction U in state State-A
  6661. In State-A moving U
  6662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6663. predict error 0
  6664. dir: dir isL
  6665. |\936: O: O1872 (predict-no)
  6666. I see 1 and I'm going to do: predict-no
  6667. ENV: Agent did: predict-no for direction L in state State-A
  6668. In State-A moving L
  6669. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6670. predict error 0
  6671. dir: dir isL
  6672. -/|937: O: O1874 (predict-no)
  6673. I see 1 and I'm going to do: predict-no
  6674. ENV: Agent did: predict-no for direction L in state State-A
  6675. In State-A moving L
  6676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6677. predict error 0
  6678. dir: dir isL
  6679. \-/938: O: O1876 (predict-no)
  6680. I see 1 and I'm going to do: predict-no
  6681. ENV: Agent did: predict-no for direction L in state State-A
  6682. In State-A moving L
  6683. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6684. predict error 0
  6685. dir: dir isR
  6686. |939: O: O1877 (predict-yes)
  6687. I see 1 and I'm going to do: predict-yes
  6688. ENV: Agent did: predict-yes for direction R in state State-A
  6689. In State-A moving R
  6690. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6691. predict error 0
  6692. dir: dir isU
  6693. \-/940: O: O1880 (predict-no)
  6694. I see 1 and I'm going to do: predict-no
  6695. ENV: Agent did: predict-no for direction U in state State-B
  6696. In State-B moving U
  6697. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6698. predict error 0
  6699. dir: dir isR
  6700. |941: O: O1882 (predict-no)
  6701. I see 1 and I'm going to do: predict-no
  6702. ENV: Agent did: predict-no for direction R in state State-B
  6703. In State-B moving R
  6704. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6705. predict error 0
  6706. dir: dir isU
  6707. \942: O: O1884 (predict-no)
  6708. I see 1 and I'm going to do: predict-no
  6709. ENV: Agent did: predict-no for direction U in state State-B
  6710. In State-B moving U
  6711. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6712. predict error 0
  6713. dir: dir isU
  6714. -/|943: O: O1886 (predict-no)
  6715. I see 1 and I'm going to do: predict-no
  6716. ENV: Agent did: predict-no for direction U in state State-B
  6717. In State-B moving U
  6718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6719. predict error 0
  6720. dir: dir isL
  6721. \944: O: O1887 (predict-yes)
  6722. I see 1 and I'm going to do: predict-yes
  6723. ENV: Agent did: predict-yes for direction L in state State-B
  6724. In State-B moving L
  6725. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6726. predict error 0
  6727. dir: dir isR
  6728. -/|945: O: O1889 (predict-yes)
  6729. I see 1 and I'm going to do: predict-yes
  6730. ENV: Agent did: predict-yes for direction R in state State-A
  6731. In State-A moving R
  6732. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6733. predict error 0
  6734. dir: dir isU
  6735. \-946: O: O1892 (predict-no)
  6736. I see 1 and I'm going to do: predict-no
  6737. ENV: Agent did: predict-no for direction U in state State-B
  6738. In State-B moving U
  6739. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6740. predict error 0
  6741. dir: dir isR
  6742. /|\947: O: O1894 (predict-no)
  6743. I see 1 and I'm going to do: predict-no
  6744. ENV: Agent did: predict-no for direction R in state State-B
  6745. In State-B moving R
  6746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6747. predict error 0
  6748. dir: dir isR
  6749. -/|948: O: O1896 (predict-no)
  6750. I see 1 and I'm going to do: predict-no
  6751. ENV: Agent did: predict-no for direction R in state State-B
  6752. In State-B moving R
  6753. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6754. predict error 0
  6755. dir: dir isR
  6756. \-/949: O: O1898 (predict-no)
  6757. I see 1 and I'm going to do: predict-no
  6758. ENV: Agent did: predict-no for direction R in state State-B
  6759. In State-B moving R
  6760. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6761. predict error 0
  6762. dir: dir isU
  6763. |\-950: O: O1900 (predict-no)
  6764. I see 1 and I'm going to do: predict-no
  6765. ENV: Agent did: predict-no for direction U in state State-B
  6766. In State-B moving U
  6767. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6768. predict error 0
  6769. dir: dir isU
  6770. /|\-/|\-/--- Input Phase ---
  6771. =>WM: (13307: I2 ^dir U)
  6772. =>WM: (13306: I2 ^reward 1)
  6773. =>WM: (13305: I2 ^see 0)
  6774. =>WM: (13304: N950 ^status complete)
  6775. <=WM: (13293: I2 ^dir U)
  6776. <=WM: (13292: I2 ^reward 1)
  6777. <=WM: (13291: I2 ^see 0)
  6778. =>WM: (13308: I2 ^level-1 R0-root)
  6779. <=WM: (13294: I2 ^level-1 R0-root)
  6780. --- END Input Phase ---
  6781. --- Proposal Phase ---
  6782. --- Inner Elaboration Phase, active level 1 (S1) ---
  6783. Firing elaborate*copy-see-to-output-link
  6784. -->
  6785. (I3 ^see 0 +)
  6786. Firing elaborate*reward*based*on*reward
  6787. -->
  6788. (R954 ^value 1 +)
  6789. (R1 ^reward R954 +)
  6790. Firing propose*predict-yes
  6791. -->
  6792. (O1901 ^name predict-yes +)
  6793. (S1 ^operator O1901 +)
  6794. Firing propose*predict-no
  6795. -->
  6796. (O1902 ^name predict-no +)
  6797. (S1 ^operator O1902 +)
  6798. Firing rl*prefer*rvt*predict-no*H0*4
  6799. -->
  6800. (S1 ^operator O1900 = 1.)
  6801. Firing rl*prefer*rvt*predict-yes*H0*3
  6802. -->
  6803. (S1 ^operator O1899 = 0.)
  6804. Firing prefer*rvt*predict-yes*H0
  6805. -->
  6806. Firing prefer*rvt*predict-no*H0
  6807. -->
  6808. Firing elaborate*copy-dir-to-output-link
  6809. -->
  6810. (I3 ^dir U +)
  6811. inner elaboration loop at bottom goal.
  6812. Retracting elaborate*copy-see-to-output-link
  6813. -->
  6814. (I3 ^see 0 +)
  6815. Retracting propose*predict-no
  6816. -->
  6817. (O1900 ^name predict-no +)
  6818. (S1 ^operator O1900 +)
  6819. Retracting propose*predict-yes
  6820. -->
  6821. (O1899 ^name predict-yes +)
  6822. (S1 ^operator O1899 +)
  6823. Retracting elaborate*reward*based*on*reward
  6824. -->
  6825. (R953 ^value 1 +)
  6826. (R1 ^reward R953 +)
  6827. Retracting elaborate*copy-dir-to-output-link
  6828. -->
  6829. (I3 ^dir U +)
  6830. Retracting rl*prefer*rvt*predict-no*H0*4
  6831. -->
  6832. (S1 ^operator O1900 = 1.)
  6833. Retracting rl*prefer*rvt*predict-yes*H0*3
  6834. -->
  6835. (S1 ^operator O1899 = 0.)
  6836. =>WM: (13314: S1 ^operator O1902 +)
  6837. =>WM: (13313: S1 ^operator O1901 +)
  6838. =>WM: (13312: O1902 ^name predict-no)
  6839. =>WM: (13311: O1901 ^name predict-yes)
  6840. =>WM: (13310: R954 ^value 1)
  6841. =>WM: (13309: R1 ^reward R954)
  6842. <=WM: (13300: S1 ^operator O1899 +)
  6843. <=WM: (13301: S1 ^operator O1900 +)
  6844. <=WM: (13302: S1 ^operator O1900)
  6845. <=WM: (13295: R1 ^reward R953)
  6846. <=WM: (13298: O1900 ^name predict-no)
  6847. <=WM: (13297: O1899 ^name predict-yes)
  6848. <=WM: (13296: R953 ^value 1)
  6849. --- Inner Elaboration Phase, active level 1 (S1) ---
  6850. Firing prefer*rvt*predict-yes*H0
  6851. -->
  6852. Firing rl*prefer*rvt*predict-yes*H0*3
  6853. -->
  6854. (S1 ^operator O1901 = 0.)
  6855. Firing prefer*rvt*predict-no*H0
  6856. -->
  6857. Firing rl*prefer*rvt*predict-no*H0*4
  6858. -->
  6859. (S1 ^operator O1902 = 1.)
  6860. inner elaboration loop at bottom goal.
  6861. Retracting rl*prefer*rvt*predict-no*H0*4
  6862. -->
  6863. (S1 ^operator O1900 = 1.)
  6864. Retracting rl*prefer*rvt*predict-yes*H0*3
  6865. -->
  6866. (S1 ^operator O1899 = 0.)
  6867. --- END Proposal Phase ---
  6868. --- Decision Phase ---
  6869. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6870. =>WM: (13315: S1 ^operator O1902)
  6871. 951: O: O1902 (predict-no)
  6872. --- END Decision Phase ---
  6873. --- Application Phase ---
  6874. --- Firing Productions (PE) For State At Depth 1 ---
  6875. --- Inner Elaboration Phase, active level 1 (S1) ---
  6876. Firing apply*operator
  6877. -->
  6878. (I3 ^predict-no N951 + :O )
  6879. Firing apply*operator*complete
  6880. -->
  6881. (I3 ^predict-no N950 - :O )
  6882. inner elaboration loop at bottom goal.
  6883. --- Change Working Memory (PE) ---
  6884. =>WM: (13316: I3 ^predict-no N951)
  6885. <=WM: (13304: N950 ^status complete)
  6886. <=WM: (13303: I3 ^predict-no N950)
  6887. --- Firing Productions (IE) For State At Depth 1 ---
  6888. --- Inner Elaboration Phase, active level 1 (S1) ---
  6889. Firing monitor*world
  6890. -->
  6891. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6892. --- Change Working Memory (IE) ---
  6893. --- END Application Phase ---
  6894. --- Output Phase ---
  6895. ENV: Agent did: predict-no for direction U in state State-B
  6896. In State-B moving U
  6897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6898. predict error 0
  6899. dir: dir isL
  6900. --- END Output Phase ---
  6901. |--- Input Phase ---
  6902. =>WM: (13320: I2 ^dir L)
  6903. =>WM: (13319: I2 ^reward 1)
  6904. =>WM: (13318: I2 ^see 0)
  6905. =>WM: (13317: N951 ^status complete)
  6906. <=WM: (13307: I2 ^dir U)
  6907. <=WM: (13306: I2 ^reward 1)
  6908. <=WM: (13305: I2 ^see 0)
  6909. =>WM: (13321: I2 ^level-1 R0-root)
  6910. <=WM: (13308: I2 ^level-1 R0-root)
  6911. --- END Input Phase ---
  6912. --- Proposal Phase ---
  6913. --- Inner Elaboration Phase, active level 1 (S1) ---
  6914. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  6915. -->
  6916. (S1 ^operator O1901 = 0.6195564468661043)
  6917. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  6918. -->
  6919. (S1 ^operator O1902 = -0.2190661556260421)
  6920. Firing prefer*rvt*predict-no*H0*2*v1*H1
  6921. -->
  6922. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  6923. -->
  6924. Firing elaborate*copy-see-to-output-link
  6925. -->
  6926. (I3 ^see 0 +)
  6927. Firing elaborate*reward*based*on*reward
  6928. -->
  6929. (R955 ^value 1 +)
  6930. (R1 ^reward R955 +)
  6931. Firing propose*predict-yes
  6932. -->
  6933. (O1903 ^name predict-yes +)
  6934. (S1 ^operator O1903 +)
  6935. Firing propose*predict-no
  6936. -->
  6937. (O1904 ^name predict-no +)
  6938. (S1 ^operator O1904 +)
  6939. Firing rl*prefer*rvt*predict-no*H0*2
  6940. -->
  6941. (S1 ^operator O1902 = 0.314040627026034)
  6942. Firing rl*prefer*rvt*predict-yes*H0*1
  6943. -->
  6944. (S1 ^operator O1901 = 0.3804224030022332)
  6945. Firing prefer*rvt*predict-yes*H0
  6946. -->
  6947. Firing prefer*rvt*predict-no*H0
  6948. -->
  6949. Firing elaborate*copy-dir-to-output-link
  6950. -->
  6951. (I3 ^dir L +)
  6952. inner elaboration loop at bottom goal.
  6953. Retracting elaborate*copy-see-to-output-link
  6954. -->
  6955. (I3 ^see 0 +)
  6956. Retracting propose*predict-no
  6957. -->
  6958. (O1902 ^name predict-no +)
  6959. (S1 ^operator O1902 +)
  6960. Retracting propose*predict-yes
  6961. -->
  6962. (O1901 ^name predict-yes +)
  6963. (S1 ^operator O1901 +)
  6964. Retracting elaborate*reward*based*on*reward
  6965. -->
  6966. (R954 ^value 1 +)
  6967. (R1 ^reward R954 +)
  6968. Retracting elaborate*copy-dir-to-output-link
  6969. -->
  6970. (I3 ^dir U +)
  6971. Retracting rl*prefer*rvt*predict-no*H0*4
  6972. -->
  6973. (S1 ^operator O1902 = 1.)
  6974. Retracting rl*prefer*rvt*predict-yes*H0*3
  6975. -->
  6976. (S1 ^operator O1901 = 0.)
  6977. =>WM: (13328: S1 ^operator O1904 +)
  6978. =>WM: (13327: S1 ^operator O1903 +)
  6979. =>WM: (13326: I3 ^dir L)
  6980. =>WM: (13325: O1904 ^name predict-no)
  6981. =>WM: (13324: O1903 ^name predict-yes)
  6982. =>WM: (13323: R955 ^value 1)
  6983. =>WM: (13322: R1 ^reward R955)
  6984. <=WM: (13313: S1 ^operator O1901 +)
  6985. <=WM: (13314: S1 ^operator O1902 +)
  6986. <=WM: (13315: S1 ^operator O1902)
  6987. <=WM: (13299: I3 ^dir U)
  6988. <=WM: (13309: R1 ^reward R954)
  6989. <=WM: (13312: O1902 ^name predict-no)
  6990. <=WM: (13311: O1901 ^name predict-yes)
  6991. <=WM: (13310: R954 ^value 1)
  6992. --- Inner Elaboration Phase, active level 1 (S1) ---
  6993. Firing prefer*rvt*predict-yes*H0
  6994. -->
  6995. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  6996. -->
  6997. (S1 ^operator O1903 = 0.6195564468661043)
  6998. Firing rl*prefer*rvt*predict-yes*H0*1
  6999. -->
  7000. (S1 ^operator O1903 = 0.3804224030022332)
  7001. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7002. -->
  7003. Firing prefer*rvt*predict-no*H0
  7004. -->
  7005. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7006. -->
  7007. (S1 ^operator O1904 = -0.2190661556260421)
  7008. Firing rl*prefer*rvt*predict-no*H0*2
  7009. -->
  7010. (S1 ^operator O1904 = 0.314040627026034)
  7011. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7012. -->
  7013. inner elaboration loop at bottom goal.
  7014. Retracting rl*prefer*rvt*predict-no*H0*2
  7015. -->
  7016. (S1 ^operator O1902 = 0.314040627026034)
  7017. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7018. -->
  7019. (S1 ^operator O1902 = -0.2190661556260421)
  7020. Retracting rl*prefer*rvt*predict-yes*H0*1
  7021. -->
  7022. (S1 ^operator O1901 = 0.3804224030022332)
  7023. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7024. -->
  7025. (S1 ^operator O1901 = 0.6195564468661043)
  7026. --- END Proposal Phase ---
  7027. --- Decision Phase ---
  7028. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7029. =>WM: (13329: S1 ^operator O1903)
  7030. 952: O: O1903 (predict-yes)
  7031. --- END Decision Phase ---
  7032. --- Application Phase ---
  7033. --- Firing Productions (PE) For State At Depth 1 ---
  7034. --- Inner Elaboration Phase, active level 1 (S1) ---
  7035. Firing apply*operator
  7036. -->
  7037. (I3 ^predict-yes N952 + :O )
  7038. Firing apply*operator*complete
  7039. -->
  7040. (I3 ^predict-no N951 - :O )
  7041. inner elaboration loop at bottom goal.
  7042. --- Change Working Memory (PE) ---
  7043. =>WM: (13330: I3 ^predict-yes N952)
  7044. <=WM: (13317: N951 ^status complete)
  7045. <=WM: (13316: I3 ^predict-no N951)
  7046. --- Firing Productions (IE) For State At Depth 1 ---
  7047. --- Inner Elaboration Phase, active level 1 (S1) ---
  7048. Firing monitor*world
  7049. -->
  7050. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7051. --- Change Working Memory (IE) ---
  7052. --- END Application Phase ---
  7053. --- Output Phase ---
  7054. ENV: Agent did: predict-yes for direction L in state State-B
  7055. In State-B moving L
  7056. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7057. predict error 0
  7058. dir: dir isR
  7059. --- END Output Phase ---
  7060. \-/--- Input Phase ---
  7061. =>WM: (13334: I2 ^dir R)
  7062. =>WM: (13333: I2 ^reward 1)
  7063. =>WM: (13332: I2 ^see 1)
  7064. =>WM: (13331: N952 ^status complete)
  7065. <=WM: (13320: I2 ^dir L)
  7066. <=WM: (13319: I2 ^reward 1)
  7067. <=WM: (13318: I2 ^see 0)
  7068. =>WM: (13335: I2 ^level-1 L1-root)
  7069. <=WM: (13321: I2 ^level-1 R0-root)
  7070. --- END Input Phase ---
  7071. --- Proposal Phase ---
  7072. --- Inner Elaboration Phase, active level 1 (S1) ---
  7073. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7074. -->
  7075. (S1 ^operator O1903 = 0.7066224695034091)
  7076. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7077. -->
  7078. (S1 ^operator O1904 = -0.1937987592593187)
  7079. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7080. -->
  7081. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7082. -->
  7083. Firing elaborate*copy-see-to-output-link
  7084. -->
  7085. (I3 ^see 1 +)
  7086. Firing elaborate*reward*based*on*reward
  7087. -->
  7088. (R956 ^value 1 +)
  7089. (R1 ^reward R956 +)
  7090. Firing propose*predict-yes
  7091. -->
  7092. (O1905 ^name predict-yes +)
  7093. (S1 ^operator O1905 +)
  7094. Firing propose*predict-no
  7095. -->
  7096. (O1906 ^name predict-no +)
  7097. (S1 ^operator O1906 +)
  7098. Firing rl*prefer*rvt*predict-no*H0*6
  7099. -->
  7100. (S1 ^operator O1904 = 0.2298785768141863)
  7101. Firing rl*prefer*rvt*predict-yes*H0*5
  7102. -->
  7103. (S1 ^operator O1903 = 0.2940444083423254)
  7104. Firing prefer*rvt*predict-yes*H0
  7105. -->
  7106. Firing prefer*rvt*predict-no*H0
  7107. -->
  7108. Firing elaborate*copy-dir-to-output-link
  7109. -->
  7110. (I3 ^dir R +)
  7111. inner elaboration loop at bottom goal.
  7112. Retracting elaborate*copy-see-to-output-link
  7113. -->
  7114. (I3 ^see 0 +)
  7115. Retracting propose*predict-no
  7116. -->
  7117. (O1904 ^name predict-no +)
  7118. (S1 ^operator O1904 +)
  7119. Retracting propose*predict-yes
  7120. -->
  7121. (O1903 ^name predict-yes +)
  7122. (S1 ^operator O1903 +)
  7123. Retracting elaborate*reward*based*on*reward
  7124. -->
  7125. (R955 ^value 1 +)
  7126. (R1 ^reward R955 +)
  7127. Retracting elaborate*copy-dir-to-output-link
  7128. -->
  7129. (I3 ^dir L +)
  7130. Retracting rl*prefer*rvt*predict-no*H0*2
  7131. -->
  7132. (S1 ^operator O1904 = 0.314040627026034)
  7133. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7134. -->
  7135. (S1 ^operator O1904 = -0.2190661556260421)
  7136. Retracting rl*prefer*rvt*predict-yes*H0*1
  7137. -->
  7138. (S1 ^operator O1903 = 0.3804224030022332)
  7139. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7140. -->
  7141. (S1 ^operator O1903 = 0.6195564468661043)
  7142. =>WM: (13343: S1 ^operator O1906 +)
  7143. =>WM: (13342: S1 ^operator O1905 +)
  7144. =>WM: (13341: I3 ^dir R)
  7145. =>WM: (13340: O1906 ^name predict-no)
  7146. =>WM: (13339: O1905 ^name predict-yes)
  7147. =>WM: (13338: R956 ^value 1)
  7148. =>WM: (13337: R1 ^reward R956)
  7149. =>WM: (13336: I3 ^see 1)
  7150. <=WM: (13327: S1 ^operator O1903 +)
  7151. <=WM: (13329: S1 ^operator O1903)
  7152. <=WM: (13328: S1 ^operator O1904 +)
  7153. <=WM: (13326: I3 ^dir L)
  7154. <=WM: (13322: R1 ^reward R955)
  7155. <=WM: (13254: I3 ^see 0)
  7156. <=WM: (13325: O1904 ^name predict-no)
  7157. <=WM: (13324: O1903 ^name predict-yes)
  7158. <=WM: (13323: R955 ^value 1)
  7159. --- Inner Elaboration Phase, active level 1 (S1) ---
  7160. Firing prefer*rvt*predict-yes*H0
  7161. -->
  7162. Firing rl*prefer*rvt*predict-yes*H0*5
  7163. -->
  7164. (S1 ^operator O1905 = 0.2940444083423254)
  7165. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7166. -->
  7167. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7168. -->
  7169. (S1 ^operator O1905 = 0.7066224695034091)
  7170. Firing prefer*rvt*predict-no*H0
  7171. -->
  7172. Firing rl*prefer*rvt*predict-no*H0*6
  7173. -->
  7174. (S1 ^operator O1906 = 0.2298785768141863)
  7175. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7176. -->
  7177. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7178. -->
  7179. (S1 ^operator O1906 = -0.1937987592593187)
  7180. inner elaboration loop at bottom goal.
  7181. Retracting rl*prefer*rvt*predict-no*H0*6
  7182. -->
  7183. (S1 ^operator O1904 = 0.2298785768141863)
  7184. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7185. -->
  7186. (S1 ^operator O1904 = -0.1937987592593187)
  7187. Retracting rl*prefer*rvt*predict-yes*H0*5
  7188. -->
  7189. (S1 ^operator O1903 = 0.2940444083423254)
  7190. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7191. -->
  7192. (S1 ^operator O1903 = 0.7066224695034091)
  7193. --- END Proposal Phase ---
  7194. --- Decision Phase ---
  7195. RL update rl*prefer*rvt*predict-yes*H0*1 0.521353 -0.140931 0.380422 -> 0.521355 -0.140931 0.380424(R,m,v=1,0.819355,0.148974)
  7196. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478624 0.140933 0.619556 -> 0.478626 0.140932 0.619559(R,m,v=1,1,0)
  7197. =>WM: (13344: S1 ^operator O1905)
  7198. 953: O: O1905 (predict-yes)
  7199. --- END Decision Phase ---
  7200. --- Application Phase ---
  7201. --- Firing Productions (PE) For State At Depth 1 ---
  7202. --- Inner Elaboration Phase, active level 1 (S1) ---
  7203. Firing apply*operator
  7204. -->
  7205. (I3 ^predict-yes N953 + :O )
  7206. Firing apply*operator*complete
  7207. -->
  7208. (I3 ^predict-yes N952 - :O )
  7209. inner elaboration loop at bottom goal.
  7210. --- Change Working Memory (PE) ---
  7211. =>WM: (13345: I3 ^predict-yes N953)
  7212. <=WM: (13331: N952 ^status complete)
  7213. <=WM: (13330: I3 ^predict-yes N952)
  7214. --- Firing Productions (IE) For State At Depth 1 ---
  7215. --- Inner Elaboration Phase, active level 1 (S1) ---
  7216. Firing monitor*world
  7217. -->
  7218. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7219. --- Change Working Memory (IE) ---
  7220. --- END Application Phase ---
  7221. --- Output Phase ---
  7222. ENV: Agent did: predict-yes for direction R in state State-A
  7223. In State-A moving R
  7224. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7225. predict error 0
  7226. dir: dir isR
  7227. --- END Output Phase ---
  7228. |\---- Input Phase ---
  7229. =>WM: (13349: I2 ^dir R)
  7230. =>WM: (13348: I2 ^reward 1)
  7231. =>WM: (13347: I2 ^see 1)
  7232. =>WM: (13346: N953 ^status complete)
  7233. <=WM: (13334: I2 ^dir R)
  7234. <=WM: (13333: I2 ^reward 1)
  7235. <=WM: (13332: I2 ^see 1)
  7236. =>WM: (13350: I2 ^level-1 R1-root)
  7237. <=WM: (13335: I2 ^level-1 L1-root)
  7238. --- END Input Phase ---
  7239. --- Proposal Phase ---
  7240. --- Inner Elaboration Phase, active level 1 (S1) ---
  7241. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7242. -->
  7243. (S1 ^operator O1905 = -0.252585164213872)
  7244. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7245. -->
  7246. (S1 ^operator O1906 = 0.7702047625716166)
  7247. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7248. -->
  7249. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7250. -->
  7251. Firing elaborate*copy-see-to-output-link
  7252. -->
  7253. (I3 ^see 1 +)
  7254. Firing elaborate*reward*based*on*reward
  7255. -->
  7256. (R957 ^value 1 +)
  7257. (R1 ^reward R957 +)
  7258. Firing propose*predict-yes
  7259. -->
  7260. (O1907 ^name predict-yes +)
  7261. (S1 ^operator O1907 +)
  7262. Firing propose*predict-no
  7263. -->
  7264. (O1908 ^name predict-no +)
  7265. (S1 ^operator O1908 +)
  7266. Firing rl*prefer*rvt*predict-no*H0*6
  7267. -->
  7268. (S1 ^operator O1906 = 0.2298785768141863)
  7269. Firing rl*prefer*rvt*predict-yes*H0*5
  7270. -->
  7271. (S1 ^operator O1905 = 0.2940444083423254)
  7272. Firing prefer*rvt*predict-yes*H0
  7273. -->
  7274. Firing prefer*rvt*predict-no*H0
  7275. -->
  7276. Firing elaborate*copy-dir-to-output-link
  7277. -->
  7278. (I3 ^dir R +)
  7279. inner elaboration loop at bottom goal.
  7280. Retracting elaborate*copy-see-to-output-link
  7281. -->
  7282. (I3 ^see 1 +)
  7283. Retracting propose*predict-no
  7284. -->
  7285. (O1906 ^name predict-no +)
  7286. (S1 ^operator O1906 +)
  7287. Retracting propose*predict-yes
  7288. -->
  7289. (O1905 ^name predict-yes +)
  7290. (S1 ^operator O1905 +)
  7291. Retracting elaborate*reward*based*on*reward
  7292. -->
  7293. (R956 ^value 1 +)
  7294. (R1 ^reward R956 +)
  7295. Retracting elaborate*copy-dir-to-output-link
  7296. -->
  7297. (I3 ^dir R +)
  7298. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  7299. -->
  7300. (S1 ^operator O1906 = -0.1937987592593187)
  7301. Retracting rl*prefer*rvt*predict-no*H0*6
  7302. -->
  7303. (S1 ^operator O1906 = 0.2298785768141863)
  7304. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  7305. -->
  7306. (S1 ^operator O1905 = 0.7066224695034091)
  7307. Retracting rl*prefer*rvt*predict-yes*H0*5
  7308. -->
  7309. (S1 ^operator O1905 = 0.2940444083423254)
  7310. =>WM: (13356: S1 ^operator O1908 +)
  7311. =>WM: (13355: S1 ^operator O1907 +)
  7312. =>WM: (13354: O1908 ^name predict-no)
  7313. =>WM: (13353: O1907 ^name predict-yes)
  7314. =>WM: (13352: R957 ^value 1)
  7315. =>WM: (13351: R1 ^reward R957)
  7316. <=WM: (13342: S1 ^operator O1905 +)
  7317. <=WM: (13344: S1 ^operator O1905)
  7318. <=WM: (13343: S1 ^operator O1906 +)
  7319. <=WM: (13337: R1 ^reward R956)
  7320. <=WM: (13340: O1906 ^name predict-no)
  7321. <=WM: (13339: O1905 ^name predict-yes)
  7322. <=WM: (13338: R956 ^value 1)
  7323. --- Inner Elaboration Phase, active level 1 (S1) ---
  7324. Firing prefer*rvt*predict-yes*H0
  7325. -->
  7326. Firing rl*prefer*rvt*predict-yes*H0*5
  7327. -->
  7328. (S1 ^operator O1907 = 0.2940444083423254)
  7329. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7330. -->
  7331. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7332. -->
  7333. (S1 ^operator O1907 = -0.252585164213872)
  7334. Firing prefer*rvt*predict-no*H0
  7335. -->
  7336. Firing rl*prefer*rvt*predict-no*H0*6
  7337. -->
  7338. (S1 ^operator O1908 = 0.2298785768141863)
  7339. Firing prefer*rvt*predict-no*H0*6*v1*H1
  7340. -->
  7341. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7342. -->
  7343. (S1 ^operator O1908 = 0.7702047625716166)
  7344. inner elaboration loop at bottom goal.
  7345. Retracting rl*prefer*rvt*predict-no*H0*6
  7346. -->
  7347. (S1 ^operator O1906 = 0.2298785768141863)
  7348. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7349. -->
  7350. (S1 ^operator O1906 = 0.7702047625716166)
  7351. Retracting rl*prefer*rvt*predict-yes*H0*5
  7352. -->
  7353. (S1 ^operator O1905 = 0.2940444083423254)
  7354. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7355. -->
  7356. (S1 ^operator O1905 = -0.252585164213872)
  7357. --- END Proposal Phase ---
  7358. --- Decision Phase ---
  7359. RL update rl*prefer*rvt*predict-yes*H0*5 0.501112 -0.207068 0.294044 -> 0.501062 -0.207073 0.293989(R,m,v=1,0.835616,0.138309)
  7360. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499487 0.207136 0.706622 -> 0.499427 0.207129 0.706557(R,m,v=1,1,0)
  7361. =>WM: (13357: S1 ^operator O1908)
  7362. 954: O: O1908 (predict-no)
  7363. --- END Decision Phase ---
  7364. --- Application Phase ---
  7365. --- Firing Productions (PE) For State At Depth 1 ---
  7366. --- Inner Elaboration Phase, active level 1 (S1) ---
  7367. Firing apply*operator
  7368. -->
  7369. (I3 ^predict-no N954 + :O )
  7370. Firing apply*operator*complete
  7371. -->
  7372. (I3 ^predict-yes N953 - :O )
  7373. inner elaboration loop at bottom goal.
  7374. --- Change Working Memory (PE) ---
  7375. =>WM: (13358: I3 ^predict-no N954)
  7376. <=WM: (13346: N953 ^status complete)
  7377. <=WM: (13345: I3 ^predict-yes N953)
  7378. --- Firing Productions (IE) For State At Depth 1 ---
  7379. --- Inner Elaboration Phase, active level 1 (S1) ---
  7380. Firing monitor*world
  7381. -->
  7382. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7383. --- Change Working Memory (IE) ---
  7384. --- END Application Phase ---
  7385. --- Output Phase ---
  7386. ENV: Agent did: predict-no for direction R in state State-B
  7387. In State-B moving R
  7388. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7389. predict error 0
  7390. dir: dir isU
  7391. --- END Output Phase ---
  7392. /|\--- Input Phase ---
  7393. =>WM: (13362: I2 ^dir U)
  7394. =>WM: (13361: I2 ^reward 1)
  7395. =>WM: (13360: I2 ^see 0)
  7396. =>WM: (13359: N954 ^status complete)
  7397. <=WM: (13349: I2 ^dir R)
  7398. <=WM: (13348: I2 ^reward 1)
  7399. <=WM: (13347: I2 ^see 1)
  7400. =>WM: (13363: I2 ^level-1 R0-root)
  7401. <=WM: (13350: I2 ^level-1 R1-root)
  7402. --- END Input Phase ---
  7403. --- Proposal Phase ---
  7404. --- Inner Elaboration Phase, active level 1 (S1) ---
  7405. Firing elaborate*copy-see-to-output-link
  7406. -->
  7407. (I3 ^see 0 +)
  7408. Firing elaborate*reward*based*on*reward
  7409. -->
  7410. (R958 ^value 1 +)
  7411. (R1 ^reward R958 +)
  7412. Firing propose*predict-yes
  7413. -->
  7414. (O1909 ^name predict-yes +)
  7415. (S1 ^operator O1909 +)
  7416. Firing propose*predict-no
  7417. -->
  7418. (O1910 ^name predict-no +)
  7419. (S1 ^operator O1910 +)
  7420. Firing rl*prefer*rvt*predict-no*H0*4
  7421. -->
  7422. (S1 ^operator O1908 = 1.)
  7423. Firing rl*prefer*rvt*predict-yes*H0*3
  7424. -->
  7425. (S1 ^operator O1907 = 0.)
  7426. Firing prefer*rvt*predict-yes*H0
  7427. -->
  7428. Firing prefer*rvt*predict-no*H0
  7429. -->
  7430. Firing elaborate*copy-dir-to-output-link
  7431. -->
  7432. (I3 ^dir U +)
  7433. inner elaboration loop at bottom goal.
  7434. Retracting elaborate*copy-see-to-output-link
  7435. -->
  7436. (I3 ^see 1 +)
  7437. Retracting propose*predict-no
  7438. -->
  7439. (O1908 ^name predict-no +)
  7440. (S1 ^operator O1908 +)
  7441. Retracting propose*predict-yes
  7442. -->
  7443. (O1907 ^name predict-yes +)
  7444. (S1 ^operator O1907 +)
  7445. Retracting elaborate*reward*based*on*reward
  7446. -->
  7447. (R957 ^value 1 +)
  7448. (R1 ^reward R957 +)
  7449. Retracting elaborate*copy-dir-to-output-link
  7450. -->
  7451. (I3 ^dir R +)
  7452. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  7453. -->
  7454. (S1 ^operator O1908 = 0.7702047625716166)
  7455. Retracting rl*prefer*rvt*predict-no*H0*6
  7456. -->
  7457. (S1 ^operator O1908 = 0.2298785768141863)
  7458. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  7459. -->
  7460. (S1 ^operator O1907 = -0.252585164213872)
  7461. Retracting rl*prefer*rvt*predict-yes*H0*5
  7462. -->
  7463. (S1 ^operator O1907 = 0.2939886829338975)
  7464. =>WM: (13371: S1 ^operator O1910 +)
  7465. =>WM: (13370: S1 ^operator O1909 +)
  7466. =>WM: (13369: I3 ^dir U)
  7467. =>WM: (13368: O1910 ^name predict-no)
  7468. =>WM: (13367: O1909 ^name predict-yes)
  7469. =>WM: (13366: R958 ^value 1)
  7470. =>WM: (13365: R1 ^reward R958)
  7471. =>WM: (13364: I3 ^see 0)
  7472. <=WM: (13355: S1 ^operator O1907 +)
  7473. <=WM: (13356: S1 ^operator O1908 +)
  7474. <=WM: (13357: S1 ^operator O1908)
  7475. <=WM: (13341: I3 ^dir R)
  7476. <=WM: (13351: R1 ^reward R957)
  7477. <=WM: (13336: I3 ^see 1)
  7478. <=WM: (13354: O1908 ^name predict-no)
  7479. <=WM: (13353: O1907 ^name predict-yes)
  7480. <=WM: (13352: R957 ^value 1)
  7481. --- Inner Elaboration Phase, active level 1 (S1) ---
  7482. Firing prefer*rvt*predict-yes*H0
  7483. -->
  7484. Firing rl*prefer*rvt*predict-yes*H0*3
  7485. -->
  7486. (S1 ^operator O1909 = 0.)
  7487. Firing prefer*rvt*predict-no*H0
  7488. -->
  7489. Firing rl*prefer*rvt*predict-no*H0*4
  7490. -->
  7491. (S1 ^operator O1910 = 1.)
  7492. inner elaboration loop at bottom goal.
  7493. Retracting rl*prefer*rvt*predict-no*H0*4
  7494. -->
  7495. (S1 ^operator O1908 = 1.)
  7496. Retracting rl*prefer*rvt*predict-yes*H0*3
  7497. -->
  7498. (S1 ^operator O1907 = 0.)
  7499. --- END Proposal Phase ---
  7500. --- Decision Phase ---
  7501. RL update rl*prefer*rvt*predict-no*H0*6 0.611927 -0.382049 0.229879 -> 0.611922 -0.38205 0.229872(R,m,v=1,0.842105,0.133746)
  7502. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388141 0.382064 0.770205 -> 0.388134 0.382063 0.770196(R,m,v=1,1,0)
  7503. =>WM: (13372: S1 ^operator O1910)
  7504. 955: O: O1910 (predict-no)
  7505. --- END Decision Phase ---
  7506. --- Application Phase ---
  7507. --- Firing Productions (PE) For State At Depth 1 ---
  7508. --- Inner Elaboration Phase, active level 1 (S1) ---
  7509. Firing apply*operator
  7510. -->
  7511. (I3 ^predict-no N955 + :O )
  7512. Firing apply*operator*complete
  7513. -->
  7514. (I3 ^predict-no N954 - :O )
  7515. inner elaboration loop at bottom goal.
  7516. --- Change Working Memory (PE) ---
  7517. =>WM: (13373: I3 ^predict-no N955)
  7518. <=WM: (13359: N954 ^status complete)
  7519. <=WM: (13358: I3 ^predict-no N954)
  7520. --- Firing Productions (IE) For State At Depth 1 ---
  7521. --- Inner Elaboration Phase, active level 1 (S1) ---
  7522. Firing monitor*world
  7523. -->
  7524. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7525. --- Change Working Memory (IE) ---
  7526. --- END Application Phase ---
  7527. --- Output Phase ---
  7528. ENV: Agent did: predict-no for direction U in state State-B
  7529. In State-B moving U
  7530. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7531. predict error 0
  7532. dir: dir isL
  7533. --- END Output Phase ---
  7534. -/|--- Input Phase ---
  7535. =>WM: (13377: I2 ^dir L)
  7536. =>WM: (13376: I2 ^reward 1)
  7537. =>WM: (13375: I2 ^see 0)
  7538. =>WM: (13374: N955 ^status complete)
  7539. <=WM: (13362: I2 ^dir U)
  7540. <=WM: (13361: I2 ^reward 1)
  7541. <=WM: (13360: I2 ^see 0)
  7542. =>WM: (13378: I2 ^level-1 R0-root)
  7543. <=WM: (13363: I2 ^level-1 R0-root)
  7544. --- END Input Phase ---
  7545. --- Proposal Phase ---
  7546. --- Inner Elaboration Phase, active level 1 (S1) ---
  7547. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7548. -->
  7549. (S1 ^operator O1909 = 0.6195585094345952)
  7550. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7551. -->
  7552. (S1 ^operator O1910 = -0.2190661556260421)
  7553. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7554. -->
  7555. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7556. -->
  7557. Firing elaborate*copy-see-to-output-link
  7558. -->
  7559. (I3 ^see 0 +)
  7560. Firing elaborate*reward*based*on*reward
  7561. -->
  7562. (R959 ^value 1 +)
  7563. (R1 ^reward R959 +)
  7564. Firing propose*predict-yes
  7565. -->
  7566. (O1911 ^name predict-yes +)
  7567. (S1 ^operator O1911 +)
  7568. Firing propose*predict-no
  7569. -->
  7570. (O1912 ^name predict-no +)
  7571. (S1 ^operator O1912 +)
  7572. Firing rl*prefer*rvt*predict-no*H0*2
  7573. -->
  7574. (S1 ^operator O1910 = 0.314040627026034)
  7575. Firing rl*prefer*rvt*predict-yes*H0*1
  7576. -->
  7577. (S1 ^operator O1909 = 0.3804241528486575)
  7578. Firing prefer*rvt*predict-yes*H0
  7579. -->
  7580. Firing prefer*rvt*predict-no*H0
  7581. -->
  7582. Firing elaborate*copy-dir-to-output-link
  7583. -->
  7584. (I3 ^dir L +)
  7585. inner elaboration loop at bottom goal.
  7586. Retracting elaborate*copy-see-to-output-link
  7587. -->
  7588. (I3 ^see 0 +)
  7589. Retracting propose*predict-no
  7590. -->
  7591. (O1910 ^name predict-no +)
  7592. (S1 ^operator O1910 +)
  7593. Retracting propose*predict-yes
  7594. -->
  7595. (O1909 ^name predict-yes +)
  7596. (S1 ^operator O1909 +)
  7597. Retracting elaborate*reward*based*on*reward
  7598. -->
  7599. (R958 ^value 1 +)
  7600. (R1 ^reward R958 +)
  7601. Retracting elaborate*copy-dir-to-output-link
  7602. -->
  7603. (I3 ^dir U +)
  7604. Retracting rl*prefer*rvt*predict-no*H0*4
  7605. -->
  7606. (S1 ^operator O1910 = 1.)
  7607. Retracting rl*prefer*rvt*predict-yes*H0*3
  7608. -->
  7609. (S1 ^operator O1909 = 0.)
  7610. =>WM: (13385: S1 ^operator O1912 +)
  7611. =>WM: (13384: S1 ^operator O1911 +)
  7612. =>WM: (13383: I3 ^dir L)
  7613. =>WM: (13382: O1912 ^name predict-no)
  7614. =>WM: (13381: O1911 ^name predict-yes)
  7615. =>WM: (13380: R959 ^value 1)
  7616. =>WM: (13379: R1 ^reward R959)
  7617. <=WM: (13370: S1 ^operator O1909 +)
  7618. <=WM: (13371: S1 ^operator O1910 +)
  7619. <=WM: (13372: S1 ^operator O1910)
  7620. <=WM: (13369: I3 ^dir U)
  7621. <=WM: (13365: R1 ^reward R958)
  7622. <=WM: (13368: O1910 ^name predict-no)
  7623. <=WM: (13367: O1909 ^name predict-yes)
  7624. <=WM: (13366: R958 ^value 1)
  7625. --- Inner Elaboration Phase, active level 1 (S1) ---
  7626. Firing prefer*rvt*predict-yes*H0
  7627. -->
  7628. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7629. -->
  7630. (S1 ^operator O1911 = 0.6195585094345952)
  7631. Firing rl*prefer*rvt*predict-yes*H0*1
  7632. -->
  7633. (S1 ^operator O1911 = 0.3804241528486575)
  7634. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7635. -->
  7636. Firing prefer*rvt*predict-no*H0
  7637. -->
  7638. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7639. -->
  7640. (S1 ^operator O1912 = -0.2190661556260421)
  7641. Firing rl*prefer*rvt*predict-no*H0*2
  7642. -->
  7643. (S1 ^operator O1912 = 0.314040627026034)
  7644. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7645. -->
  7646. inner elaboration loop at bottom goal.
  7647. Retracting rl*prefer*rvt*predict-no*H0*2
  7648. -->
  7649. (S1 ^operator O1910 = 0.314040627026034)
  7650. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7651. -->
  7652. (S1 ^operator O1910 = -0.2190661556260421)
  7653. Retracting rl*prefer*rvt*predict-yes*H0*1
  7654. -->
  7655. (S1 ^operator O1909 = 0.3804241528486575)
  7656. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7657. -->
  7658. (S1 ^operator O1909 = 0.6195585094345952)
  7659. --- END Proposal Phase ---
  7660. --- Decision Phase ---
  7661. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7662. =>WM: (13386: S1 ^operator O1911)
  7663. 956: O: O1911 (predict-yes)
  7664. --- END Decision Phase ---
  7665. --- Application Phase ---
  7666. --- Firing Productions (PE) For State At Depth 1 ---
  7667. --- Inner Elaboration Phase, active level 1 (S1) ---
  7668. Firing apply*operator
  7669. -->
  7670. (I3 ^predict-yes N956 + :O )
  7671. Firing apply*operator*complete
  7672. -->
  7673. (I3 ^predict-no N955 - :O )
  7674. inner elaboration loop at bottom goal.
  7675. --- Change Working Memory (PE) ---
  7676. =>WM: (13387: I3 ^predict-yes N956)
  7677. <=WM: (13374: N955 ^status complete)
  7678. <=WM: (13373: I3 ^predict-no N955)
  7679. --- Firing Productions (IE) For State At Depth 1 ---
  7680. --- Inner Elaboration Phase, active level 1 (S1) ---
  7681. Firing monitor*world
  7682. -->
  7683. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7684. --- Change Working Memory (IE) ---
  7685. --- END Application Phase ---
  7686. --- Output Phase ---
  7687. ENV: Agent did: predict-yes for direction L in state State-B
  7688. In State-B moving L
  7689. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7690. predict error 0
  7691. dir: dir isL
  7692. --- END Output Phase ---
  7693. \-/--- Input Phase ---
  7694. =>WM: (13391: I2 ^dir L)
  7695. =>WM: (13390: I2 ^reward 1)
  7696. =>WM: (13389: I2 ^see 1)
  7697. =>WM: (13388: N956 ^status complete)
  7698. <=WM: (13377: I2 ^dir L)
  7699. <=WM: (13376: I2 ^reward 1)
  7700. <=WM: (13375: I2 ^see 0)
  7701. =>WM: (13392: I2 ^level-1 L1-root)
  7702. <=WM: (13378: I2 ^level-1 R0-root)
  7703. --- END Input Phase ---
  7704. --- Proposal Phase ---
  7705. --- Inner Elaboration Phase, active level 1 (S1) ---
  7706. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7707. -->
  7708. (S1 ^operator O1911 = -0.3470159027404986)
  7709. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7710. -->
  7711. (S1 ^operator O1912 = 0.6861879370801713)
  7712. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7713. -->
  7714. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7715. -->
  7716. Firing elaborate*copy-see-to-output-link
  7717. -->
  7718. (I3 ^see 1 +)
  7719. Firing elaborate*reward*based*on*reward
  7720. -->
  7721. (R960 ^value 1 +)
  7722. (R1 ^reward R960 +)
  7723. Firing propose*predict-yes
  7724. -->
  7725. (O1913 ^name predict-yes +)
  7726. (S1 ^operator O1913 +)
  7727. Firing propose*predict-no
  7728. -->
  7729. (O1914 ^name predict-no +)
  7730. (S1 ^operator O1914 +)
  7731. Firing rl*prefer*rvt*predict-no*H0*2
  7732. -->
  7733. (S1 ^operator O1912 = 0.314040627026034)
  7734. Firing rl*prefer*rvt*predict-yes*H0*1
  7735. -->
  7736. (S1 ^operator O1911 = 0.3804241528486575)
  7737. Firing prefer*rvt*predict-yes*H0
  7738. -->
  7739. Firing prefer*rvt*predict-no*H0
  7740. -->
  7741. Firing elaborate*copy-dir-to-output-link
  7742. -->
  7743. (I3 ^dir L +)
  7744. inner elaboration loop at bottom goal.
  7745. Retracting elaborate*copy-see-to-output-link
  7746. -->
  7747. (I3 ^see 0 +)
  7748. Retracting propose*predict-no
  7749. -->
  7750. (O1912 ^name predict-no +)
  7751. (S1 ^operator O1912 +)
  7752. Retracting propose*predict-yes
  7753. -->
  7754. (O1911 ^name predict-yes +)
  7755. (S1 ^operator O1911 +)
  7756. Retracting elaborate*reward*based*on*reward
  7757. -->
  7758. (R959 ^value 1 +)
  7759. (R1 ^reward R959 +)
  7760. Retracting elaborate*copy-dir-to-output-link
  7761. -->
  7762. (I3 ^dir L +)
  7763. Retracting rl*prefer*rvt*predict-no*H0*2
  7764. -->
  7765. (S1 ^operator O1912 = 0.314040627026034)
  7766. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  7767. -->
  7768. (S1 ^operator O1912 = -0.2190661556260421)
  7769. Retracting rl*prefer*rvt*predict-yes*H0*1
  7770. -->
  7771. (S1 ^operator O1911 = 0.3804241528486575)
  7772. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  7773. -->
  7774. (S1 ^operator O1911 = 0.6195585094345952)
  7775. =>WM: (13399: S1 ^operator O1914 +)
  7776. =>WM: (13398: S1 ^operator O1913 +)
  7777. =>WM: (13397: O1914 ^name predict-no)
  7778. =>WM: (13396: O1913 ^name predict-yes)
  7779. =>WM: (13395: R960 ^value 1)
  7780. =>WM: (13394: R1 ^reward R960)
  7781. =>WM: (13393: I3 ^see 1)
  7782. <=WM: (13384: S1 ^operator O1911 +)
  7783. <=WM: (13386: S1 ^operator O1911)
  7784. <=WM: (13385: S1 ^operator O1912 +)
  7785. <=WM: (13379: R1 ^reward R959)
  7786. <=WM: (13364: I3 ^see 0)
  7787. <=WM: (13382: O1912 ^name predict-no)
  7788. <=WM: (13381: O1911 ^name predict-yes)
  7789. <=WM: (13380: R959 ^value 1)
  7790. --- Inner Elaboration Phase, active level 1 (S1) ---
  7791. Firing prefer*rvt*predict-yes*H0
  7792. -->
  7793. Firing rl*prefer*rvt*predict-yes*H0*1
  7794. -->
  7795. (S1 ^operator O1913 = 0.3804241528486575)
  7796. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7797. -->
  7798. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7799. -->
  7800. (S1 ^operator O1913 = -0.3470159027404986)
  7801. Firing prefer*rvt*predict-no*H0
  7802. -->
  7803. Firing rl*prefer*rvt*predict-no*H0*2
  7804. -->
  7805. (S1 ^operator O1914 = 0.314040627026034)
  7806. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7807. -->
  7808. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7809. -->
  7810. (S1 ^operator O1914 = 0.6861879370801713)
  7811. inner elaboration loop at bottom goal.
  7812. Retracting rl*prefer*rvt*predict-no*H0*2
  7813. -->
  7814. (S1 ^operator O1912 = 0.314040627026034)
  7815. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7816. -->
  7817. (S1 ^operator O1912 = 0.6861879370801713)
  7818. Retracting rl*prefer*rvt*predict-yes*H0*1
  7819. -->
  7820. (S1 ^operator O1911 = 0.3804241528486575)
  7821. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7822. -->
  7823. (S1 ^operator O1911 = -0.3470159027404986)
  7824. --- END Proposal Phase ---
  7825. --- Decision Phase ---
  7826. RL update rl*prefer*rvt*predict-yes*H0*1 0.521355 -0.140931 0.380424 -> 0.521357 -0.140931 0.380426(R,m,v=1,0.820513,0.148222)
  7827. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478626 0.140932 0.619559 -> 0.478628 0.140932 0.61956(R,m,v=1,1,0)
  7828. =>WM: (13400: S1 ^operator O1914)
  7829. 957: O: O1914 (predict-no)
  7830. --- END Decision Phase ---
  7831. --- Application Phase ---
  7832. --- Firing Productions (PE) For State At Depth 1 ---
  7833. --- Inner Elaboration Phase, active level 1 (S1) ---
  7834. Firing apply*operator
  7835. -->
  7836. (I3 ^predict-no N957 + :O )
  7837. Firing apply*operator*complete
  7838. -->
  7839. (I3 ^predict-yes N956 - :O )
  7840. inner elaboration loop at bottom goal.
  7841. --- Change Working Memory (PE) ---
  7842. =>WM: (13401: I3 ^predict-no N957)
  7843. <=WM: (13388: N956 ^status complete)
  7844. <=WM: (13387: I3 ^predict-yes N956)
  7845. --- Firing Productions (IE) For State At Depth 1 ---
  7846. --- Inner Elaboration Phase, active level 1 (S1) ---
  7847. Firing monitor*world
  7848. -->
  7849. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7850. --- Change Working Memory (IE) ---
  7851. --- END Application Phase ---
  7852. --- Output Phase ---
  7853. ENV: Agent did: predict-no for direction L in state State-A
  7854. In State-A moving L
  7855. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7856. predict error 0
  7857. dir: dir isL
  7858. --- END Output Phase ---
  7859. |\---- Input Phase ---
  7860. =>WM: (13405: I2 ^dir L)
  7861. =>WM: (13404: I2 ^reward 1)
  7862. =>WM: (13403: I2 ^see 0)
  7863. =>WM: (13402: N957 ^status complete)
  7864. <=WM: (13391: I2 ^dir L)
  7865. <=WM: (13390: I2 ^reward 1)
  7866. <=WM: (13389: I2 ^see 1)
  7867. =>WM: (13406: I2 ^level-1 L0-root)
  7868. <=WM: (13392: I2 ^level-1 L1-root)
  7869. --- END Input Phase ---
  7870. --- Proposal Phase ---
  7871. --- Inner Elaboration Phase, active level 1 (S1) ---
  7872. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  7873. -->
  7874. (S1 ^operator O1913 = -0.3332708974800781)
  7875. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  7876. -->
  7877. (S1 ^operator O1914 = 0.6857507825115492)
  7878. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7879. -->
  7880. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7881. -->
  7882. Firing elaborate*copy-see-to-output-link
  7883. -->
  7884. (I3 ^see 0 +)
  7885. Firing elaborate*reward*based*on*reward
  7886. -->
  7887. (R961 ^value 1 +)
  7888. (R1 ^reward R961 +)
  7889. Firing propose*predict-yes
  7890. -->
  7891. (O1915 ^name predict-yes +)
  7892. (S1 ^operator O1915 +)
  7893. Firing propose*predict-no
  7894. -->
  7895. (O1916 ^name predict-no +)
  7896. (S1 ^operator O1916 +)
  7897. Firing rl*prefer*rvt*predict-no*H0*2
  7898. -->
  7899. (S1 ^operator O1914 = 0.314040627026034)
  7900. Firing rl*prefer*rvt*predict-yes*H0*1
  7901. -->
  7902. (S1 ^operator O1913 = 0.3804255857519139)
  7903. Firing prefer*rvt*predict-yes*H0
  7904. -->
  7905. Firing prefer*rvt*predict-no*H0
  7906. -->
  7907. Firing elaborate*copy-dir-to-output-link
  7908. -->
  7909. (I3 ^dir L +)
  7910. inner elaboration loop at bottom goal.
  7911. Retracting elaborate*copy-see-to-output-link
  7912. -->
  7913. (I3 ^see 1 +)
  7914. Retracting propose*predict-no
  7915. -->
  7916. (O1914 ^name predict-no +)
  7917. (S1 ^operator O1914 +)
  7918. Retracting propose*predict-yes
  7919. -->
  7920. (O1913 ^name predict-yes +)
  7921. (S1 ^operator O1913 +)
  7922. Retracting elaborate*reward*based*on*reward
  7923. -->
  7924. (R960 ^value 1 +)
  7925. (R1 ^reward R960 +)
  7926. Retracting elaborate*copy-dir-to-output-link
  7927. -->
  7928. (I3 ^dir L +)
  7929. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  7930. -->
  7931. (S1 ^operator O1914 = 0.6861879370801713)
  7932. Retracting rl*prefer*rvt*predict-no*H0*2
  7933. -->
  7934. (S1 ^operator O1914 = 0.314040627026034)
  7935. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  7936. -->
  7937. (S1 ^operator O1913 = -0.3470159027404986)
  7938. Retracting rl*prefer*rvt*predict-yes*H0*1
  7939. -->
  7940. (S1 ^operator O1913 = 0.3804255857519139)
  7941. =>WM: (13413: S1 ^operator O1916 +)
  7942. =>WM: (13412: S1 ^operator O1915 +)
  7943. =>WM: (13411: O1916 ^name predict-no)
  7944. =>WM: (13410: O1915 ^name predict-yes)
  7945. =>WM: (13409: R961 ^value 1)
  7946. =>WM: (13408: R1 ^reward R961)
  7947. =>WM: (13407: I3 ^see 0)
  7948. <=WM: (13398: S1 ^operator O1913 +)
  7949. <=WM: (13399: S1 ^operator O1914 +)
  7950. <=WM: (13400: S1 ^operator O1914)
  7951. <=WM: (13394: R1 ^reward R960)
  7952. <=WM: (13393: I3 ^see 1)
  7953. <=WM: (13397: O1914 ^name predict-no)
  7954. <=WM: (13396: O1913 ^name predict-yes)
  7955. <=WM: (13395: R960 ^value 1)
  7956. --- Inner Elaboration Phase, active level 1 (S1) ---
  7957. Firing prefer*rvt*predict-yes*H0
  7958. -->
  7959. Firing rl*prefer*rvt*predict-yes*H0*1
  7960. -->
  7961. (S1 ^operator O1915 = 0.3804255857519139)
  7962. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7963. -->
  7964. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  7965. -->
  7966. (S1 ^operator O1915 = -0.3332708974800781)
  7967. Firing prefer*rvt*predict-no*H0
  7968. -->
  7969. Firing rl*prefer*rvt*predict-no*H0*2
  7970. -->
  7971. (S1 ^operator O1916 = 0.314040627026034)
  7972. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7973. -->
  7974. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  7975. -->
  7976. (S1 ^operator O1916 = 0.6857507825115492)
  7977. inner elaboration loop at bottom goal.
  7978. Retracting rl*prefer*rvt*predict-no*H0*2
  7979. -->
  7980. (S1 ^operator O1914 = 0.314040627026034)
  7981. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  7982. -->
  7983. (S1 ^operator O1914 = 0.6857507825115492)
  7984. Retracting rl*prefer*rvt*predict-yes*H0*1
  7985. -->
  7986. (S1 ^operator O1913 = 0.3804255857519139)
  7987. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  7988. -->
  7989. (S1 ^operator O1913 = -0.3332708974800781)
  7990. --- END Proposal Phase ---
  7991. --- Decision Phase ---
  7992. RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.314041 -> 0.485031 -0.17101 0.314022(R,m,v=1,0.858108,0.122587)
  7993. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515134 0.171054 0.686188 -> 0.515116 0.171049 0.686165(R,m,v=1,1,0)
  7994. =>WM: (13414: S1 ^operator O1916)
  7995. 958: O: O1916 (predict-no)
  7996. --- END Decision Phase ---
  7997. --- Application Phase ---
  7998. --- Firing Productions (PE) For State At Depth 1 ---
  7999. --- Inner Elaboration Phase, active level 1 (S1) ---
  8000. Firing apply*operator
  8001. -->
  8002. (I3 ^predict-no N958 + :O )
  8003. Firing apply*operator*complete
  8004. -->
  8005. (I3 ^predict-no N957 - :O )
  8006. inner elaboration loop at bottom goal.
  8007. --- Change Working Memory (PE) ---
  8008. =>WM: (13415: I3 ^predict-no N958)
  8009. <=WM: (13402: N957 ^status complete)
  8010. <=WM: (13401: I3 ^predict-no N957)
  8011. --- Firing Productions (IE) For State At Depth 1 ---
  8012. --- Inner Elaboration Phase, active level 1 (S1) ---
  8013. Firing monitor*world
  8014. -->
  8015. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8016. --- Change Working Memory (IE) ---
  8017. --- END Application Phase ---
  8018. --- Output Phase ---
  8019. ENV: Agent did: predict-no for direction L in state State-A
  8020. In State-A moving L
  8021. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8022. predict error 0
  8023. dir: dir isR
  8024. --- END Output Phase ---
  8025. /|\--- Input Phase ---
  8026. =>WM: (13419: I2 ^dir R)
  8027. =>WM: (13418: I2 ^reward 1)
  8028. =>WM: (13417: I2 ^see 0)
  8029. =>WM: (13416: N958 ^status complete)
  8030. <=WM: (13405: I2 ^dir L)
  8031. <=WM: (13404: I2 ^reward 1)
  8032. <=WM: (13403: I2 ^see 0)
  8033. =>WM: (13420: I2 ^level-1 L0-root)
  8034. <=WM: (13406: I2 ^level-1 L0-root)
  8035. --- END Input Phase ---
  8036. --- Proposal Phase ---
  8037. --- Inner Elaboration Phase, active level 1 (S1) ---
  8038. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8039. -->
  8040. (S1 ^operator O1915 = 0.7053811599250611)
  8041. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8042. -->
  8043. (S1 ^operator O1916 = -0.2023211881870005)
  8044. Firing prefer*rvt*predict-no*H0*6*v1*H1
  8045. -->
  8046. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8047. -->
  8048. Firing elaborate*copy-see-to-output-link
  8049. -->
  8050. (I3 ^see 0 +)
  8051. Firing elaborate*reward*based*on*reward
  8052. -->
  8053. (R962 ^value 1 +)
  8054. (R1 ^reward R962 +)
  8055. Firing propose*predict-yes
  8056. -->
  8057. (O1917 ^name predict-yes +)
  8058. (S1 ^operator O1917 +)
  8059. Firing propose*predict-no
  8060. -->
  8061. (O1918 ^name predict-no +)
  8062. (S1 ^operator O1918 +)
  8063. Firing rl*prefer*rvt*predict-no*H0*6
  8064. -->
  8065. (S1 ^operator O1916 = 0.2298717920574965)
  8066. Firing rl*prefer*rvt*predict-yes*H0*5
  8067. -->
  8068. (S1 ^operator O1915 = 0.2939886829338975)
  8069. Firing prefer*rvt*predict-yes*H0
  8070. -->
  8071. Firing prefer*rvt*predict-no*H0
  8072. -->
  8073. Firing elaborate*copy-dir-to-output-link
  8074. -->
  8075. (I3 ^dir R +)
  8076. inner elaboration loop at bottom goal.
  8077. Retracting elaborate*copy-see-to-output-link
  8078. -->
  8079. (I3 ^see 0 +)
  8080. Retracting propose*predict-no
  8081. -->
  8082. (O1916 ^name predict-no +)
  8083. (S1 ^operator O1916 +)
  8084. Retracting propose*predict-yes
  8085. -->
  8086. (O1915 ^name predict-yes +)
  8087. (S1 ^operator O1915 +)
  8088. Retracting elaborate*reward*based*on*reward
  8089. -->
  8090. (R961 ^value 1 +)
  8091. (R1 ^reward R961 +)
  8092. Retracting elaborate*copy-dir-to-output-link
  8093. -->
  8094. (I3 ^dir L +)
  8095. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  8096. -->
  8097. (S1 ^operator O1916 = 0.6857507825115492)
  8098. Retracting rl*prefer*rvt*predict-no*H0*2
  8099. -->
  8100. (S1 ^operator O1916 = 0.3140215711634288)
  8101. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  8102. -->
  8103. (S1 ^operator O1915 = -0.3332708974800781)
  8104. Retracting rl*prefer*rvt*predict-yes*H0*1
  8105. -->
  8106. (S1 ^operator O1915 = 0.3804255857519139)
  8107. =>WM: (13427: S1 ^operator O1918 +)
  8108. =>WM: (13426: S1 ^operator O1917 +)
  8109. =>WM: (13425: I3 ^dir R)
  8110. =>WM: (13424: O1918 ^name predict-no)
  8111. =>WM: (13423: O1917 ^name predict-yes)
  8112. =>WM: (13422: R962 ^value 1)
  8113. =>WM: (13421: R1 ^reward R962)
  8114. <=WM: (13412: S1 ^operator O1915 +)
  8115. <=WM: (13413: S1 ^operator O1916 +)
  8116. <=WM: (13414: S1 ^operator O1916)
  8117. <=WM: (13383: I3 ^dir L)
  8118. <=WM: (13408: R1 ^reward R961)
  8119. <=WM: (13411: O1916 ^name predict-no)
  8120. <=WM: (13410: O1915 ^name predict-yes)
  8121. <=WM: (13409: R961 ^value 1)
  8122. --- Inner Elaboration Phase, active level 1 (S1) ---
  8123. Firing prefer*rvt*predict-yes*H0
  8124. -->
  8125. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8126. -->
  8127. (S1 ^operator O1917 = 0.7053811599250611)
  8128. Firing rl*prefer*rvt*predict-yes*H0*5
  8129. -->
  8130. (S1 ^operator O1917 = 0.2939886829338975)
  8131. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8132. -->
  8133. Firing prefer*rvt*predict-no*H0
  8134. -->
  8135. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8136. -->
  8137. (S1 ^operator O1918 = -0.2023211881870005)
  8138. Firing rl*prefer*rvt*predict-no*H0*6
  8139. -->
  8140. (S1 ^operator O1918 = 0.2298717920574965)
  8141. Firing prefer*rvt*predict-no*H0*6*v1*H1
  8142. -->
  8143. inner elaboration loop at bottom goal.
  8144. Retracting rl*prefer*rvt*predict-no*H0*6
  8145. -->
  8146. (S1 ^operator O1916 = 0.2298717920574965)
  8147. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8148. -->
  8149. (S1 ^operator O1916 = -0.2023211881870005)
  8150. Retracting rl*prefer*rvt*predict-yes*H0*5
  8151. -->
  8152. (S1 ^operator O1915 = 0.2939886829338975)
  8153. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8154. -->
  8155. (S1 ^operator O1915 = 0.7053811599250611)
  8156. --- END Proposal Phase ---
  8157. --- Decision Phase ---
  8158. RL update rl*prefer*rvt*predict-no*H0*2 0.485031 -0.17101 0.314022 -> 0.485046 -0.171006 0.314041(R,m,v=1,0.85906,0.121894)
  8159. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514789 0.170962 0.685751 -> 0.514806 0.170967 0.685773(R,m,v=1,1,0)
  8160. =>WM: (13428: S1 ^operator O1917)
  8161. 959: O: O1917 (predict-yes)
  8162. --- END Decision Phase ---
  8163. --- Application Phase ---
  8164. --- Firing Productions (PE) For State At Depth 1 ---
  8165. --- Inner Elaboration Phase, active level 1 (S1) ---
  8166. Firing apply*operator
  8167. -->
  8168. (I3 ^predict-yes N959 + :O )
  8169. Firing apply*operator*complete
  8170. -->
  8171. (I3 ^predict-no N958 - :O )
  8172. inner elaboration loop at bottom goal.
  8173. --- Change Working Memory (PE) ---
  8174. =>WM: (13429: I3 ^predict-yes N959)
  8175. <=WM: (13416: N958 ^status complete)
  8176. <=WM: (13415: I3 ^predict-no N958)
  8177. --- Firing Productions (IE) For State At Depth 1 ---
  8178. --- Inner Elaboration Phase, active level 1 (S1) ---
  8179. Firing monitor*world
  8180. -->
  8181. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8182. --- Change Working Memory (IE) ---
  8183. --- END Application Phase ---
  8184. --- Output Phase ---
  8185. ENV: Agent did: predict-yes for direction R in state State-A
  8186. In State-A moving R
  8187. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8188. predict error 0
  8189. dir: dir isU
  8190. --- END Output Phase ---
  8191. -/|--- Input Phase ---
  8192. =>WM: (13433: I2 ^dir U)
  8193. =>WM: (13432: I2 ^reward 1)
  8194. =>WM: (13431: I2 ^see 1)
  8195. =>WM: (13430: N959 ^status complete)
  8196. <=WM: (13419: I2 ^dir R)
  8197. <=WM: (13418: I2 ^reward 1)
  8198. <=WM: (13417: I2 ^see 0)
  8199. =>WM: (13434: I2 ^level-1 R1-root)
  8200. <=WM: (13420: I2 ^level-1 L0-root)
  8201. --- END Input Phase ---
  8202. --- Proposal Phase ---
  8203. --- Inner Elaboration Phase, active level 1 (S1) ---
  8204. Firing elaborate*copy-see-to-output-link
  8205. -->
  8206. (I3 ^see 1 +)
  8207. Firing elaborate*reward*based*on*reward
  8208. -->
  8209. (R963 ^value 1 +)
  8210. (R1 ^reward R963 +)
  8211. Firing propose*predict-yes
  8212. -->
  8213. (O1919 ^name predict-yes +)
  8214. (S1 ^operator O1919 +)
  8215. Firing propose*predict-no
  8216. -->
  8217. (O1920 ^name predict-no +)
  8218. (S1 ^operator O1920 +)
  8219. Firing rl*prefer*rvt*predict-no*H0*4
  8220. -->
  8221. (S1 ^operator O1918 = 1.)
  8222. Firing rl*prefer*rvt*predict-yes*H0*3
  8223. -->
  8224. (S1 ^operator O1917 = 0.)
  8225. Firing prefer*rvt*predict-yes*H0
  8226. -->
  8227. Firing prefer*rvt*predict-no*H0
  8228. -->
  8229. Firing elaborate*copy-dir-to-output-link
  8230. -->
  8231. (I3 ^dir U +)
  8232. inner elaboration loop at bottom goal.
  8233. Retracting elaborate*copy-see-to-output-link
  8234. -->
  8235. (I3 ^see 0 +)
  8236. Retracting propose*predict-no
  8237. -->
  8238. (O1918 ^name predict-no +)
  8239. (S1 ^operator O1918 +)
  8240. Retracting propose*predict-yes
  8241. -->
  8242. (O1917 ^name predict-yes +)
  8243. (S1 ^operator O1917 +)
  8244. Retracting elaborate*reward*based*on*reward
  8245. -->
  8246. (R962 ^value 1 +)
  8247. (R1 ^reward R962 +)
  8248. Retracting elaborate*copy-dir-to-output-link
  8249. -->
  8250. (I3 ^dir R +)
  8251. Retracting rl*prefer*rvt*predict-no*H0*6
  8252. -->
  8253. (S1 ^operator O1918 = 0.2298717920574965)
  8254. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  8255. -->
  8256. (S1 ^operator O1918 = -0.2023211881870005)
  8257. Retracting rl*prefer*rvt*predict-yes*H0*5
  8258. -->
  8259. (S1 ^operator O1917 = 0.2939886829338975)
  8260. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8261. -->
  8262. (S1 ^operator O1917 = 0.7053811599250611)
  8263. =>WM: (13442: S1 ^operator O1920 +)
  8264. =>WM: (13441: S1 ^operator O1919 +)
  8265. =>WM: (13440: I3 ^dir U)
  8266. =>WM: (13439: O1920 ^name predict-no)
  8267. =>WM: (13438: O1919 ^name predict-yes)
  8268. =>WM: (13437: R963 ^value 1)
  8269. =>WM: (13436: R1 ^reward R963)
  8270. =>WM: (13435: I3 ^see 1)
  8271. <=WM: (13426: S1 ^operator O1917 +)
  8272. <=WM: (13428: S1 ^operator O1917)
  8273. <=WM: (13427: S1 ^operator O1918 +)
  8274. <=WM: (13425: I3 ^dir R)
  8275. <=WM: (13421: R1 ^reward R962)
  8276. <=WM: (13407: I3 ^see 0)
  8277. <=WM: (13424: O1918 ^name predict-no)
  8278. <=WM: (13423: O1917 ^name predict-yes)
  8279. <=WM: (13422: R962 ^value 1)
  8280. --- Inner Elaboration Phase, active level 1 (S1) ---
  8281. Firing prefer*rvt*predict-yes*H0
  8282. -->
  8283. Firing rl*prefer*rvt*predict-yes*H0*3
  8284. -->
  8285. (S1 ^operator O1919 = 0.)
  8286. Firing prefer*rvt*predict-no*H0
  8287. -->
  8288. Firing rl*prefer*rvt*predict-no*H0*4
  8289. -->
  8290. (S1 ^operator O1920 = 1.)
  8291. inner elaboration loop at bottom goal.
  8292. Retracting rl*prefer*rvt*predict-no*H0*4
  8293. -->
  8294. (S1 ^operator O1918 = 1.)
  8295. Retracting rl*prefer*rvt*predict-yes*H0*3
  8296. -->
  8297. (S1 ^operator O1917 = 0.)
  8298. --- END Proposal Phase ---
  8299. --- Decision Phase ---
  8300. RL update rl*prefer*rvt*predict-yes*H0*5 0.501062 -0.207073 0.293989 -> 0.50111 -0.207069 0.294041(R,m,v=1,0.836735,0.137545)
  8301. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498366 0.207015 0.705381 -> 0.498423 0.207021 0.705444(R,m,v=1,1,0)
  8302. =>WM: (13443: S1 ^operator O1920)
  8303. 960: O: O1920 (predict-no)
  8304. --- END Decision Phase ---
  8305. --- Application Phase ---
  8306. --- Firing Productions (PE) For State At Depth 1 ---
  8307. --- Inner Elaboration Phase, active level 1 (S1) ---
  8308. Firing apply*operator
  8309. -->
  8310. (I3 ^predict-no N960 + :O )
  8311. Firing apply*operator*complete
  8312. -->
  8313. (I3 ^predict-yes N959 - :O )
  8314. inner elaboration loop at bottom goal.
  8315. --- Change Working Memory (PE) ---
  8316. =>WM: (13444: I3 ^predict-no N960)
  8317. <=WM: (13430: N959 ^status complete)
  8318. <=WM: (13429: I3 ^predict-yes N959)
  8319. --- Firing Productions (IE) For State At Depth 1 ---
  8320. --- Inner Elaboration Phase, active level 1 (S1) ---
  8321. Firing monitor*world
  8322. -->
  8323. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8324. --- Change Working Memory (IE) ---
  8325. --- END Application Phase ---
  8326. --- Output Phase ---
  8327. ENV: Agent did: predict-no for direction U in state State-B
  8328. In State-B moving U
  8329. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8330. predict error 0
  8331. dir: dir isU
  8332. --- END Output Phase ---
  8333. \---- Input Phase ---
  8334. =>WM: (13448: I2 ^dir U)
  8335. =>WM: (13447: I2 ^reward 1)
  8336. =>WM: (13446: I2 ^see 0)
  8337. =>WM: (13445: N960 ^status complete)
  8338. <=WM: (13433: I2 ^dir U)
  8339. <=WM: (13432: I2 ^reward 1)
  8340. <=WM: (13431: I2 ^see 1)
  8341. =>WM: (13449: I2 ^level-1 R1-root)
  8342. <=WM: (13434: I2 ^level-1 R1-root)
  8343. --- END Input Phase ---
  8344. --- Proposal Phase ---
  8345. --- Inner Elaboration Phase, active level 1 (S1) ---
  8346. Firing elaborate*copy-see-to-output-link
  8347. -->
  8348. (I3 ^see 0 +)
  8349. Firing elaborate*reward*based*on*reward
  8350. -->
  8351. (R964 ^value 1 +)
  8352. (R1 ^reward R964 +)
  8353. Firing propose*predict-yes
  8354. -->
  8355. (O1921 ^name predict-yes +)
  8356. (S1 ^operator O1921 +)
  8357. Firing propose*predict-no
  8358. -->
  8359. (O1922 ^name predict-no +)
  8360. (S1 ^operator O1922 +)
  8361. Firing rl*prefer*rvt*predict-no*H0*4
  8362. -->
  8363. (S1 ^operator O1920 = 1.)
  8364. Firing rl*prefer*rvt*predict-yes*H0*3
  8365. -->
  8366. (S1 ^operator O1919 = 0.)
  8367. Firing prefer*rvt*predict-yes*H0
  8368. -->
  8369. Firing prefer*rvt*predict-no*H0
  8370. -->
  8371. Firing elaborate*copy-dir-to-output-link
  8372. -->
  8373. (I3 ^dir U +)
  8374. inner elaboration loop at bottom goal.
  8375. Retracting elaborate*copy-see-to-output-link
  8376. -->
  8377. (I3 ^see 1 +)
  8378. Retracting propose*predict-no
  8379. -->
  8380. (O1920 ^name predict-no +)
  8381. (S1 ^operator O1920 +)
  8382. Retracting propose*predict-yes
  8383. -->
  8384. (O1919 ^name predict-yes +)
  8385. (S1 ^operator O1919 +)
  8386. Retracting elaborate*reward*based*on*reward
  8387. -->
  8388. (R963 ^value 1 +)
  8389. (R1 ^reward R963 +)
  8390. Retracting elaborate*copy-dir-to-output-link
  8391. -->
  8392. (I3 ^dir U +)
  8393. Retracting rl*prefer*rvt*predict-no*H0*4
  8394. -->
  8395. (S1 ^operator O1920 = 1.)
  8396. Retracting rl*prefer*rvt*predict-yes*H0*3
  8397. -->
  8398. (S1 ^operator O1919 = 0.)
  8399. =>WM: (13456: S1 ^operator O1922 +)
  8400. =>WM: (13455: S1 ^operator O1921 +)
  8401. =>WM: (13454: O1922 ^name predict-no)
  8402. =>WM: (13453: O1921 ^name predict-yes)
  8403. =>WM: (13452: R964 ^value 1)
  8404. =>WM: (13451: R1 ^reward R964)
  8405. =>WM: (13450: I3 ^see 0)
  8406. <=WM: (13441: S1 ^operator O1919 +)
  8407. <=WM: (13442: S1 ^operator O1920 +)
  8408. <=WM: (13443: S1 ^operator O1920)
  8409. <=WM: (13436: R1 ^reward R963)
  8410. <=WM: (13435: I3 ^see 1)
  8411. <=WM: (13439: O1920 ^name predict-no)
  8412. <=WM: (13438: O1919 ^name predict-yes)
  8413. <=WM: (13437: R963 ^value 1)
  8414. --- Inner Elaboration Phase, active level 1 (S1) ---
  8415. Firing prefer*rvt*predict-yes*H0
  8416. -->
  8417. Firing rl*prefer*rvt*predict-yes*H0*3
  8418. -->
  8419. (S1 ^operator O1921 = 0.)
  8420. Firing prefer*rvt*predict-no*H0
  8421. -->
  8422. Firing rl*prefer*rvt*predict-no*H0*4
  8423. -->
  8424. (S1 ^operator O1922 = 1.)
  8425. inner elaboration loop at bottom goal.
  8426. Retracting rl*prefer*rvt*predict-no*H0*4
  8427. -->
  8428. (S1 ^operator O1920 = 1.)
  8429. Retracting rl*prefer*rvt*predict-yes*H0*3
  8430. -->
  8431. (S1 ^operator O1919 = 0.)
  8432. --- END Proposal Phase ---
  8433. --- Decision Phase ---
  8434. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8435. =>WM: (13457: S1 ^operator O1922)
  8436. 961: O: O1922 (predict-no)
  8437. --- END Decision Phase ---
  8438. --- Application Phase ---
  8439. --- Firing Productions (PE) For State At Depth 1 ---
  8440. --- Inner Elaboration Phase, active level 1 (S1) ---
  8441. Firing apply*operator
  8442. -->
  8443. (I3 ^predict-no N961 + :O )
  8444. Firing apply*operator*complete
  8445. -->
  8446. (I3 ^predict-no N960 - :O )
  8447. inner elaboration loop at bottom goal.
  8448. --- Change Working Memory (PE) ---
  8449. =>WM: (13458: I3 ^predict-no N961)
  8450. <=WM: (13445: N960 ^status complete)
  8451. <=WM: (13444: I3 ^predict-no N960)
  8452. --- Firing Productions (IE) For State At Depth 1 ---
  8453. --- Inner Elaboration Phase, active level 1 (S1) ---
  8454. Firing monitor*world
  8455. -->
  8456. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8457. --- Change Working Memory (IE) ---
  8458. --- END Application Phase ---
  8459. --- Output Phase ---
  8460. ENV: Agent did: predict-no for direction U in state State-B
  8461. In State-B moving U
  8462. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8463. predict error 0
  8464. dir: dir isU
  8465. --- END Output Phase ---
  8466. /--- Input Phase ---
  8467. =>WM: (13462: I2 ^dir U)
  8468. =>WM: (13461: I2 ^reward 1)
  8469. =>WM: (13460: I2 ^see 0)
  8470. =>WM: (13459: N961 ^status complete)
  8471. <=WM: (13448: I2 ^dir U)
  8472. <=WM: (13447: I2 ^reward 1)
  8473. <=WM: (13446: I2 ^see 0)
  8474. =>WM: (13463: I2 ^level-1 R1-root)
  8475. <=WM: (13449: I2 ^level-1 R1-root)
  8476. --- END Input Phase ---
  8477. --- Proposal Phase ---
  8478. --- Inner Elaboration Phase, active level 1 (S1) ---
  8479. Firing elaborate*copy-see-to-output-link
  8480. -->
  8481. (I3 ^see 0 +)
  8482. Firing elaborate*reward*based*on*reward
  8483. -->
  8484. (R965 ^value 1 +)
  8485. (R1 ^reward R965 +)
  8486. Firing propose*predict-yes
  8487. -->
  8488. (O1923 ^name predict-yes +)
  8489. (S1 ^operator O1923 +)
  8490. Firing propose*predict-no
  8491. -->
  8492. (O1924 ^name predict-no +)
  8493. (S1 ^operator O1924 +)
  8494. Firing rl*prefer*rvt*predict-no*H0*4
  8495. -->
  8496. (S1 ^operator O1922 = 1.)
  8497. Firing rl*prefer*rvt*predict-yes*H0*3
  8498. -->
  8499. (S1 ^operator O1921 = 0.)
  8500. Firing prefer*rvt*predict-yes*H0
  8501. -->
  8502. Firing prefer*rvt*predict-no*H0
  8503. -->
  8504. Firing elaborate*copy-dir-to-output-link
  8505. -->
  8506. (I3 ^dir U +)
  8507. inner elaboration loop at bottom goal.
  8508. Retracting elaborate*copy-see-to-output-link
  8509. -->
  8510. (I3 ^see 0 +)
  8511. Retracting propose*predict-no
  8512. -->
  8513. (O1922 ^name predict-no +)
  8514. (S1 ^operator O1922 +)
  8515. Retracting propose*predict-yes
  8516. -->
  8517. (O1921 ^name predict-yes +)
  8518. (S1 ^operator O1921 +)
  8519. Retracting elaborate*reward*based*on*reward
  8520. -->
  8521. (R964 ^value 1 +)
  8522. (R1 ^reward R964 +)
  8523. Retracting elaborate*copy-dir-to-output-link
  8524. -->
  8525. (I3 ^dir U +)
  8526. Retracting rl*prefer*rvt*predict-no*H0*4
  8527. -->
  8528. (S1 ^operator O1922 = 1.)
  8529. Retracting rl*prefer*rvt*predict-yes*H0*3
  8530. -->
  8531. (S1 ^operator O1921 = 0.)
  8532. =>WM: (13469: S1 ^operator O1924 +)
  8533. =>WM: (13468: S1 ^operator O1923 +)
  8534. =>WM: (13467: O1924 ^name predict-no)
  8535. =>WM: (13466: O1923 ^name predict-yes)
  8536. =>WM: (13465: R965 ^value 1)
  8537. =>WM: (13464: R1 ^reward R965)
  8538. <=WM: (13455: S1 ^operator O1921 +)
  8539. <=WM: (13456: S1 ^operator O1922 +)
  8540. <=WM: (13457: S1 ^operator O1922)
  8541. <=WM: (13451: R1 ^reward R964)
  8542. <=WM: (13454: O1922 ^name predict-no)
  8543. <=WM: (13453: O1921 ^name predict-yes)
  8544. <=WM: (13452: R964 ^value 1)
  8545. --- Inner Elaboration Phase, active level 1 (S1) ---
  8546. Firing prefer*rvt*predict-yes*H0
  8547. -->
  8548. Firing rl*prefer*rvt*predict-yes*H0*3
  8549. -->
  8550. (S1 ^operator O1923 = 0.)
  8551. Firing prefer*rvt*predict-no*H0
  8552. -->
  8553. Firing rl*prefer*rvt*predict-no*H0*4
  8554. -->
  8555. (S1 ^operator O1924 = 1.)
  8556. inner elaboration loop at bottom goal.
  8557. Retracting rl*prefer*rvt*predict-no*H0*4
  8558. -->
  8559. (S1 ^operator O1922 = 1.)
  8560. Retracting rl*prefer*rvt*predict-yes*H0*3
  8561. -->
  8562. (S1 ^operator O1921 = 0.)
  8563. --- END Proposal Phase ---
  8564. --- Decision Phase ---
  8565. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8566. =>WM: (13470: S1 ^operator O1924)
  8567. 962: O: O1924 (predict-no)
  8568. --- END Decision Phase ---
  8569. --- Application Phase ---
  8570. --- Firing Productions (PE) For State At Depth 1 ---
  8571. --- Inner Elaboration Phase, active level 1 (S1) ---
  8572. Firing apply*operator
  8573. -->
  8574. (I3 ^predict-no N962 + :O )
  8575. Firing apply*operator*complete
  8576. -->
  8577. (I3 ^predict-no N961 - :O )
  8578. inner elaboration loop at bottom goal.
  8579. --- Change Working Memory (PE) ---
  8580. =>WM: (13471: I3 ^predict-no N962)
  8581. <=WM: (13459: N961 ^status complete)
  8582. <=WM: (13458: I3 ^predict-no N961)
  8583. --- Firing Productions (IE) For State At Depth 1 ---
  8584. --- Inner Elaboration Phase, active level 1 (S1) ---
  8585. Firing monitor*world
  8586. -->
  8587. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8588. --- Change Working Memory (IE) ---
  8589. --- END Application Phase ---
  8590. --- Output Phase ---
  8591. ENV: Agent did: predict-no for direction U in state State-B
  8592. In State-B moving U
  8593. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8594. predict error 0
  8595. dir: dir isU
  8596. --- END Output Phase ---
  8597. |\--- Input Phase ---
  8598. =>WM: (13475: I2 ^dir U)
  8599. =>WM: (13474: I2 ^reward 1)
  8600. =>WM: (13473: I2 ^see 0)
  8601. =>WM: (13472: N962 ^status complete)
  8602. <=WM: (13462: I2 ^dir U)
  8603. <=WM: (13461: I2 ^reward 1)
  8604. <=WM: (13460: I2 ^see 0)
  8605. =>WM: (13476: I2 ^level-1 R1-root)
  8606. <=WM: (13463: I2 ^level-1 R1-root)
  8607. --- END Input Phase ---
  8608. --- Proposal Phase ---
  8609. --- Inner Elaboration Phase, active level 1 (S1) ---
  8610. Firing elaborate*copy-see-to-output-link
  8611. -->
  8612. (I3 ^see 0 +)
  8613. Firing elaborate*reward*based*on*reward
  8614. -->
  8615. (R966 ^value 1 +)
  8616. (R1 ^reward R966 +)
  8617. Firing propose*predict-yes
  8618. -->
  8619. (O1925 ^name predict-yes +)
  8620. (S1 ^operator O1925 +)
  8621. Firing propose*predict-no
  8622. -->
  8623. (O1926 ^name predict-no +)
  8624. (S1 ^operator O1926 +)
  8625. Firing rl*prefer*rvt*predict-no*H0*4
  8626. -->
  8627. (S1 ^operator O1924 = 1.)
  8628. Firing rl*prefer*rvt*predict-yes*H0*3
  8629. -->
  8630. (S1 ^operator O1923 = 0.)
  8631. Firing prefer*rvt*predict-yes*H0
  8632. -->
  8633. Firing prefer*rvt*predict-no*H0
  8634. -->
  8635. Firing elaborate*copy-dir-to-output-link
  8636. -->
  8637. (I3 ^dir U +)
  8638. inner elaboration loop at bottom goal.
  8639. Retracting elaborate*copy-see-to-output-link
  8640. -->
  8641. (I3 ^see 0 +)
  8642. Retracting propose*predict-no
  8643. -->
  8644. (O1924 ^name predict-no +)
  8645. (S1 ^operator O1924 +)
  8646. Retracting propose*predict-yes
  8647. -->
  8648. (O1923 ^name predict-yes +)
  8649. (S1 ^operator O1923 +)
  8650. Retracting elaborate*reward*based*on*reward
  8651. -->
  8652. (R965 ^value 1 +)
  8653. (R1 ^reward R965 +)
  8654. Retracting elaborate*copy-dir-to-output-link
  8655. -->
  8656. (I3 ^dir U +)
  8657. Retracting rl*prefer*rvt*predict-no*H0*4
  8658. -->
  8659. (S1 ^operator O1924 = 1.)
  8660. Retracting rl*prefer*rvt*predict-yes*H0*3
  8661. -->
  8662. (S1 ^operator O1923 = 0.)
  8663. =>WM: (13482: S1 ^operator O1926 +)
  8664. =>WM: (13481: S1 ^operator O1925 +)
  8665. =>WM: (13480: O1926 ^name predict-no)
  8666. =>WM: (13479: O1925 ^name predict-yes)
  8667. =>WM: (13478: R966 ^value 1)
  8668. =>WM: (13477: R1 ^reward R966)
  8669. <=WM: (13468: S1 ^operator O1923 +)
  8670. <=WM: (13469: S1 ^operator O1924 +)
  8671. <=WM: (13470: S1 ^operator O1924)
  8672. <=WM: (13464: R1 ^reward R965)
  8673. <=WM: (13467: O1924 ^name predict-no)
  8674. <=WM: (13466: O1923 ^name predict-yes)
  8675. <=WM: (13465: R965 ^value 1)
  8676. --- Inner Elaboration Phase, active level 1 (S1) ---
  8677. Firing prefer*rvt*predict-yes*H0
  8678. -->
  8679. Firing rl*prefer*rvt*predict-yes*H0*3
  8680. -->
  8681. (S1 ^operator O1925 = 0.)
  8682. Firing prefer*rvt*predict-no*H0
  8683. -->
  8684. Firing rl*prefer*rvt*predict-no*H0*4
  8685. -->
  8686. (S1 ^operator O1926 = 1.)
  8687. inner elaboration loop at bottom goal.
  8688. Retracting rl*prefer*rvt*predict-no*H0*4
  8689. -->
  8690. (S1 ^operator O1924 = 1.)
  8691. Retracting rl*prefer*rvt*predict-yes*H0*3
  8692. -->
  8693. (S1 ^operator O1923 = 0.)
  8694. --- END Proposal Phase ---
  8695. --- Decision Phase ---
  8696. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8697. =>WM: (13483: S1 ^operator O1926)
  8698. 963: O: O1926 (predict-no)
  8699. --- END Decision Phase ---
  8700. --- Application Phase ---
  8701. --- Firing Productions (PE) For State At Depth 1 ---
  8702. --- Inner Elaboration Phase, active level 1 (S1) ---
  8703. Firing apply*operator
  8704. -->
  8705. (I3 ^predict-no N963 + :O )
  8706. Firing apply*operator*complete
  8707. -->
  8708. (I3 ^predict-no N962 - :O )
  8709. inner elaboration loop at bottom goal.
  8710. --- Change Working Memory (PE) ---
  8711. =>WM: (13484: I3 ^predict-no N963)
  8712. <=WM: (13472: N962 ^status complete)
  8713. <=WM: (13471: I3 ^predict-no N962)
  8714. --- Firing Productions (IE) For State At Depth 1 ---
  8715. --- Inner Elaboration Phase, active level 1 (S1) ---
  8716. Firing monitor*world
  8717. -->
  8718. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8719. --- Change Working Memory (IE) ---
  8720. --- END Application Phase ---
  8721. --- Output Phase ---
  8722. ENV: Agent did: predict-no for direction U in state State-B
  8723. In State-B moving U
  8724. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8725. predict error 0
  8726. dir: dir isL
  8727. --- END Output Phase ---
  8728. ---- Input Phase ---
  8729. =>WM: (13488: I2 ^dir L)
  8730. =>WM: (13487: I2 ^reward 1)
  8731. =>WM: (13486: I2 ^see 0)
  8732. =>WM: (13485: N963 ^status complete)
  8733. <=WM: (13475: I2 ^dir U)
  8734. <=WM: (13474: I2 ^reward 1)
  8735. <=WM: (13473: I2 ^see 0)
  8736. =>WM: (13489: I2 ^level-1 R1-root)
  8737. <=WM: (13476: I2 ^level-1 R1-root)
  8738. --- END Input Phase ---
  8739. --- Proposal Phase ---
  8740. --- Inner Elaboration Phase, active level 1 (S1) ---
  8741. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8742. -->
  8743. (S1 ^operator O1925 = 0.619629119351056)
  8744. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8745. -->
  8746. (S1 ^operator O1926 = -0.1479504104026684)
  8747. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8748. -->
  8749. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8750. -->
  8751. Firing elaborate*copy-see-to-output-link
  8752. -->
  8753. (I3 ^see 0 +)
  8754. Firing elaborate*reward*based*on*reward
  8755. -->
  8756. (R967 ^value 1 +)
  8757. (R1 ^reward R967 +)
  8758. Firing propose*predict-yes
  8759. -->
  8760. (O1927 ^name predict-yes +)
  8761. (S1 ^operator O1927 +)
  8762. Firing propose*predict-no
  8763. -->
  8764. (O1928 ^name predict-no +)
  8765. (S1 ^operator O1928 +)
  8766. Firing rl*prefer*rvt*predict-no*H0*2
  8767. -->
  8768. (S1 ^operator O1926 = 0.3140405292214645)
  8769. Firing rl*prefer*rvt*predict-yes*H0*1
  8770. -->
  8771. (S1 ^operator O1925 = 0.3804255857519139)
  8772. Firing prefer*rvt*predict-yes*H0
  8773. -->
  8774. Firing prefer*rvt*predict-no*H0
  8775. -->
  8776. Firing elaborate*copy-dir-to-output-link
  8777. -->
  8778. (I3 ^dir L +)
  8779. inner elaboration loop at bottom goal.
  8780. Retracting elaborate*copy-see-to-output-link
  8781. -->
  8782. (I3 ^see 0 +)
  8783. Retracting propose*predict-no
  8784. -->
  8785. (O1926 ^name predict-no +)
  8786. (S1 ^operator O1926 +)
  8787. Retracting propose*predict-yes
  8788. -->
  8789. (O1925 ^name predict-yes +)
  8790. (S1 ^operator O1925 +)
  8791. Retracting elaborate*reward*based*on*reward
  8792. -->
  8793. (R966 ^value 1 +)
  8794. (R1 ^reward R966 +)
  8795. Retracting elaborate*copy-dir-to-output-link
  8796. -->
  8797. (I3 ^dir U +)
  8798. Retracting rl*prefer*rvt*predict-no*H0*4
  8799. -->
  8800. (S1 ^operator O1926 = 1.)
  8801. Retracting rl*prefer*rvt*predict-yes*H0*3
  8802. -->
  8803. (S1 ^operator O1925 = 0.)
  8804. =>WM: (13496: S1 ^operator O1928 +)
  8805. =>WM: (13495: S1 ^operator O1927 +)
  8806. =>WM: (13494: I3 ^dir L)
  8807. =>WM: (13493: O1928 ^name predict-no)
  8808. =>WM: (13492: O1927 ^name predict-yes)
  8809. =>WM: (13491: R967 ^value 1)
  8810. =>WM: (13490: R1 ^reward R967)
  8811. <=WM: (13481: S1 ^operator O1925 +)
  8812. <=WM: (13482: S1 ^operator O1926 +)
  8813. <=WM: (13483: S1 ^operator O1926)
  8814. <=WM: (13440: I3 ^dir U)
  8815. <=WM: (13477: R1 ^reward R966)
  8816. <=WM: (13480: O1926 ^name predict-no)
  8817. <=WM: (13479: O1925 ^name predict-yes)
  8818. <=WM: (13478: R966 ^value 1)
  8819. --- Inner Elaboration Phase, active level 1 (S1) ---
  8820. Firing prefer*rvt*predict-yes*H0
  8821. -->
  8822. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8823. -->
  8824. (S1 ^operator O1927 = 0.619629119351056)
  8825. Firing rl*prefer*rvt*predict-yes*H0*1
  8826. -->
  8827. (S1 ^operator O1927 = 0.3804255857519139)
  8828. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8829. -->
  8830. Firing prefer*rvt*predict-no*H0
  8831. -->
  8832. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8833. -->
  8834. (S1 ^operator O1928 = -0.1479504104026684)
  8835. Firing rl*prefer*rvt*predict-no*H0*2
  8836. -->
  8837. (S1 ^operator O1928 = 0.3140405292214645)
  8838. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8839. -->
  8840. inner elaboration loop at bottom goal.
  8841. Retracting rl*prefer*rvt*predict-no*H0*2
  8842. -->
  8843. (S1 ^operator O1926 = 0.3140405292214645)
  8844. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8845. -->
  8846. (S1 ^operator O1926 = -0.1479504104026684)
  8847. Retracting rl*prefer*rvt*predict-yes*H0*1
  8848. -->
  8849. (S1 ^operator O1925 = 0.3804255857519139)
  8850. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8851. -->
  8852. (S1 ^operator O1925 = 0.619629119351056)
  8853. --- END Proposal Phase ---
  8854. --- Decision Phase ---
  8855. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8856. =>WM: (13497: S1 ^operator O1927)
  8857. 964: O: O1927 (predict-yes)
  8858. --- END Decision Phase ---
  8859. --- Application Phase ---
  8860. --- Firing Productions (PE) For State At Depth 1 ---
  8861. --- Inner Elaboration Phase, active level 1 (S1) ---
  8862. Firing apply*operator
  8863. -->
  8864. (I3 ^predict-yes N964 + :O )
  8865. Firing apply*operator*complete
  8866. -->
  8867. (I3 ^predict-no N963 - :O )
  8868. inner elaboration loop at bottom goal.
  8869. --- Change Working Memory (PE) ---
  8870. =>WM: (13498: I3 ^predict-yes N964)
  8871. <=WM: (13485: N963 ^status complete)
  8872. <=WM: (13484: I3 ^predict-no N963)
  8873. --- Firing Productions (IE) For State At Depth 1 ---
  8874. --- Inner Elaboration Phase, active level 1 (S1) ---
  8875. Firing monitor*world
  8876. -->
  8877. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8878. --- Change Working Memory (IE) ---
  8879. --- END Application Phase ---
  8880. --- Output Phase ---
  8881. ENV: Agent did: predict-yes for direction L in state State-B
  8882. In State-B moving L
  8883. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8884. predict error 0
  8885. dir: dir isR
  8886. --- END Output Phase ---
  8887. /|\--- Input Phase ---
  8888. =>WM: (13502: I2 ^dir R)
  8889. =>WM: (13501: I2 ^reward 1)
  8890. =>WM: (13500: I2 ^see 1)
  8891. =>WM: (13499: N964 ^status complete)
  8892. <=WM: (13488: I2 ^dir L)
  8893. <=WM: (13487: I2 ^reward 1)
  8894. <=WM: (13486: I2 ^see 0)
  8895. =>WM: (13503: I2 ^level-1 L1-root)
  8896. <=WM: (13489: I2 ^level-1 R1-root)
  8897. --- END Input Phase ---
  8898. --- Proposal Phase ---
  8899. --- Inner Elaboration Phase, active level 1 (S1) ---
  8900. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  8901. -->
  8902. (S1 ^operator O1927 = 0.7065565782519569)
  8903. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  8904. -->
  8905. (S1 ^operator O1928 = -0.1937987592593187)
  8906. Firing prefer*rvt*predict-no*H0*6*v1*H1
  8907. -->
  8908. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8909. -->
  8910. Firing elaborate*copy-see-to-output-link
  8911. -->
  8912. (I3 ^see 1 +)
  8913. Firing elaborate*reward*based*on*reward
  8914. -->
  8915. (R968 ^value 1 +)
  8916. (R1 ^reward R968 +)
  8917. Firing propose*predict-yes
  8918. -->
  8919. (O1929 ^name predict-yes +)
  8920. (S1 ^operator O1929 +)
  8921. Firing propose*predict-no
  8922. -->
  8923. (O1930 ^name predict-no +)
  8924. (S1 ^operator O1930 +)
  8925. Firing rl*prefer*rvt*predict-no*H0*6
  8926. -->
  8927. (S1 ^operator O1928 = 0.2298717920574965)
  8928. Firing rl*prefer*rvt*predict-yes*H0*5
  8929. -->
  8930. (S1 ^operator O1927 = 0.2940412798984666)
  8931. Firing prefer*rvt*predict-yes*H0
  8932. -->
  8933. Firing prefer*rvt*predict-no*H0
  8934. -->
  8935. Firing elaborate*copy-dir-to-output-link
  8936. -->
  8937. (I3 ^dir R +)
  8938. inner elaboration loop at bottom goal.
  8939. Retracting elaborate*copy-see-to-output-link
  8940. -->
  8941. (I3 ^see 0 +)
  8942. Retracting propose*predict-no
  8943. -->
  8944. (O1928 ^name predict-no +)
  8945. (S1 ^operator O1928 +)
  8946. Retracting propose*predict-yes
  8947. -->
  8948. (O1927 ^name predict-yes +)
  8949. (S1 ^operator O1927 +)
  8950. Retracting elaborate*reward*based*on*reward
  8951. -->
  8952. (R967 ^value 1 +)
  8953. (R1 ^reward R967 +)
  8954. Retracting elaborate*copy-dir-to-output-link
  8955. -->
  8956. (I3 ^dir L +)
  8957. Retracting rl*prefer*rvt*predict-no*H0*2
  8958. -->
  8959. (S1 ^operator O1928 = 0.3140405292214645)
  8960. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  8961. -->
  8962. (S1 ^operator O1928 = -0.1479504104026684)
  8963. Retracting rl*prefer*rvt*predict-yes*H0*1
  8964. -->
  8965. (S1 ^operator O1927 = 0.3804255857519139)
  8966. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  8967. -->
  8968. (S1 ^operator O1927 = 0.619629119351056)
  8969. =>WM: (13511: S1 ^operator O1930 +)
  8970. =>WM: (13510: S1 ^operator O1929 +)
  8971. =>WM: (13509: I3 ^dir R)
  8972. =>WM: (13508: O1930 ^name predict-no)
  8973. =>WM: (13507: O1929 ^name predict-yes)
  8974. =>WM: (13506: R968 ^value 1)
  8975. =>WM: (13505: R1 ^reward R968)
  8976. =>WM: (13504: I3 ^see 1)
  8977. <=WM: (13495: S1 ^operator O1927 +)
  8978. <=WM: (13497: S1 ^operator O1927)
  8979. <=WM: (13496: S1 ^operator O1928 +)
  8980. <=WM: (13494: I3 ^dir L)
  8981. <=WM: (13490: R1 ^reward R967)
  8982. <=WM: (13450: I3 ^see 0)
  8983. <=WM: (13493: O1928 ^name predict-no)
  8984. <=WM: (13492: O1927 ^name predict-yes)
  8985. <=WM: (13491: R967 ^value 1)
  8986. --- Inner Elaboration Phase, active level 1 (S1) ---
  8987. Firing prefer*rvt*predict-yes*H0
  8988. -->
  8989. Firing rl*prefer*rvt*predict-yes*H0*5
  8990. -->
  8991. (S1 ^operator O1929 = 0.2940412798984666)
  8992. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8993. -->
  8994. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  8995. -->
  8996. (S1 ^operator O1929 = 0.7065565782519569)
  8997. Firing prefer*rvt*predict-no*H0
  8998. -->
  8999. Firing rl*prefer*rvt*predict-no*H0*6
  9000. -->
  9001. (S1 ^operator O1930 = 0.2298717920574965)
  9002. Firing prefer*rvt*predict-no*H0*6*v1*H1
  9003. -->
  9004. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9005. -->
  9006. (S1 ^operator O1930 = -0.1937987592593187)
  9007. inner elaboration loop at bottom goal.
  9008. Retracting rl*prefer*rvt*predict-no*H0*6
  9009. -->
  9010. (S1 ^operator O1928 = 0.2298717920574965)
  9011. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9012. -->
  9013. (S1 ^operator O1928 = -0.1937987592593187)
  9014. Retracting rl*prefer*rvt*predict-yes*H0*5
  9015. -->
  9016. (S1 ^operator O1927 = 0.2940412798984666)
  9017. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9018. -->
  9019. (S1 ^operator O1927 = 0.7065565782519569)
  9020. --- END Proposal Phase ---
  9021. --- Decision Phase ---
  9022. RL update rl*prefer*rvt*predict-yes*H0*1 0.521357 -0.140931 0.380426 -> 0.521352 -0.140931 0.380421(R,m,v=1,0.821656,0.147477)
  9023. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478703 0.140926 0.619629 -> 0.478697 0.140926 0.619624(R,m,v=1,1,0)
  9024. =>WM: (13512: S1 ^operator O1929)
  9025. 965: O: O1929 (predict-yes)
  9026. --- END Decision Phase ---
  9027. --- Application Phase ---
  9028. --- Firing Productions (PE) For State At Depth 1 ---
  9029. --- Inner Elaboration Phase, active level 1 (S1) ---
  9030. Firing apply*operator
  9031. -->
  9032. (I3 ^predict-yes N965 + :O )
  9033. Firing apply*operator*complete
  9034. -->
  9035. (I3 ^predict-yes N964 - :O )
  9036. inner elaboration loop at bottom goal.
  9037. --- Change Working Memory (PE) ---
  9038. =>WM: (13513: I3 ^predict-yes N965)
  9039. <=WM: (13499: N964 ^status complete)
  9040. <=WM: (13498: I3 ^predict-yes N964)
  9041. --- Firing Productions (IE) For State At Depth 1 ---
  9042. --- Inner Elaboration Phase, active level 1 (S1) ---
  9043. Firing monitor*world
  9044. -->
  9045. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9046. --- Change Working Memory (IE) ---
  9047. --- END Application Phase ---
  9048. --- Output Phase ---
  9049. ENV: Agent did: predict-yes for direction R in state State-A
  9050. In State-A moving R
  9051. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9052. predict error 0
  9053. dir: dir isU
  9054. --- END Output Phase ---
  9055. -/|--- Input Phase ---
  9056. =>WM: (13517: I2 ^dir U)
  9057. =>WM: (13516: I2 ^reward 1)
  9058. =>WM: (13515: I2 ^see 1)
  9059. =>WM: (13514: N965 ^status complete)
  9060. <=WM: (13502: I2 ^dir R)
  9061. <=WM: (13501: I2 ^reward 1)
  9062. <=WM: (13500: I2 ^see 1)
  9063. =>WM: (13518: I2 ^level-1 R1-root)
  9064. <=WM: (13503: I2 ^level-1 L1-root)
  9065. --- END Input Phase ---
  9066. --- Proposal Phase ---
  9067. --- Inner Elaboration Phase, active level 1 (S1) ---
  9068. Firing elaborate*copy-see-to-output-link
  9069. -->
  9070. (I3 ^see 1 +)
  9071. Firing elaborate*reward*based*on*reward
  9072. -->
  9073. (R969 ^value 1 +)
  9074. (R1 ^reward R969 +)
  9075. Firing propose*predict-yes
  9076. -->
  9077. (O1931 ^name predict-yes +)
  9078. (S1 ^operator O1931 +)
  9079. Firing propose*predict-no
  9080. -->
  9081. (O1932 ^name predict-no +)
  9082. (S1 ^operator O1932 +)
  9083. Firing rl*prefer*rvt*predict-no*H0*4
  9084. -->
  9085. (S1 ^operator O1930 = 1.)
  9086. Firing rl*prefer*rvt*predict-yes*H0*3
  9087. -->
  9088. (S1 ^operator O1929 = 0.)
  9089. Firing prefer*rvt*predict-yes*H0
  9090. -->
  9091. Firing prefer*rvt*predict-no*H0
  9092. -->
  9093. Firing elaborate*copy-dir-to-output-link
  9094. -->
  9095. (I3 ^dir U +)
  9096. inner elaboration loop at bottom goal.
  9097. Retracting elaborate*copy-see-to-output-link
  9098. -->
  9099. (I3 ^see 1 +)
  9100. Retracting propose*predict-no
  9101. -->
  9102. (O1930 ^name predict-no +)
  9103. (S1 ^operator O1930 +)
  9104. Retracting propose*predict-yes
  9105. -->
  9106. (O1929 ^name predict-yes +)
  9107. (S1 ^operator O1929 +)
  9108. Retracting elaborate*reward*based*on*reward
  9109. -->
  9110. (R968 ^value 1 +)
  9111. (R1 ^reward R968 +)
  9112. Retracting elaborate*copy-dir-to-output-link
  9113. -->
  9114. (I3 ^dir R +)
  9115. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9116. -->
  9117. (S1 ^operator O1930 = -0.1937987592593187)
  9118. Retracting rl*prefer*rvt*predict-no*H0*6
  9119. -->
  9120. (S1 ^operator O1930 = 0.2298717920574965)
  9121. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9122. -->
  9123. (S1 ^operator O1929 = 0.7065565782519569)
  9124. Retracting rl*prefer*rvt*predict-yes*H0*5
  9125. -->
  9126. (S1 ^operator O1929 = 0.2940412798984666)
  9127. =>WM: (13525: S1 ^operator O1932 +)
  9128. =>WM: (13524: S1 ^operator O1931 +)
  9129. =>WM: (13523: I3 ^dir U)
  9130. =>WM: (13522: O1932 ^name predict-no)
  9131. =>WM: (13521: O1931 ^name predict-yes)
  9132. =>WM: (13520: R969 ^value 1)
  9133. =>WM: (13519: R1 ^reward R969)
  9134. <=WM: (13510: S1 ^operator O1929 +)
  9135. <=WM: (13512: S1 ^operator O1929)
  9136. <=WM: (13511: S1 ^operator O1930 +)
  9137. <=WM: (13509: I3 ^dir R)
  9138. <=WM: (13505: R1 ^reward R968)
  9139. <=WM: (13508: O1930 ^name predict-no)
  9140. <=WM: (13507: O1929 ^name predict-yes)
  9141. <=WM: (13506: R968 ^value 1)
  9142. --- Inner Elaboration Phase, active level 1 (S1) ---
  9143. Firing prefer*rvt*predict-yes*H0
  9144. -->
  9145. Firing rl*prefer*rvt*predict-yes*H0*3
  9146. -->
  9147. (S1 ^operator O1931 = 0.)
  9148. Firing prefer*rvt*predict-no*H0
  9149. -->
  9150. Firing rl*prefer*rvt*predict-no*H0*4
  9151. -->
  9152. (S1 ^operator O1932 = 1.)
  9153. inner elaboration loop at bottom goal.
  9154. Retracting rl*prefer*rvt*predict-no*H0*4
  9155. -->
  9156. (S1 ^operator O1930 = 1.)
  9157. Retracting rl*prefer*rvt*predict-yes*H0*3
  9158. -->
  9159. (S1 ^operator O1929 = 0.)
  9160. --- END Proposal Phase ---
  9161. --- Decision Phase ---
  9162. RL update rl*prefer*rvt*predict-yes*H0*5 0.50111 -0.207069 0.294041 -> 0.501065 -0.207074 0.293991(R,m,v=1,0.837838,0.13679)
  9163. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499427 0.207129 0.706557 -> 0.499374 0.207123 0.706498(R,m,v=1,1,0)
  9164. =>WM: (13526: S1 ^operator O1932)
  9165. 966: O: O1932 (predict-no)
  9166. --- END Decision Phase ---
  9167. --- Application Phase ---
  9168. --- Firing Productions (PE) For State At Depth 1 ---
  9169. --- Inner Elaboration Phase, active level 1 (S1) ---
  9170. Firing apply*operator
  9171. -->
  9172. (I3 ^predict-no N966 + :O )
  9173. Firing apply*operator*complete
  9174. -->
  9175. (I3 ^predict-yes N965 - :O )
  9176. inner elaboration loop at bottom goal.
  9177. --- Change Working Memory (PE) ---
  9178. =>WM: (13527: I3 ^predict-no N966)
  9179. <=WM: (13514: N965 ^status complete)
  9180. <=WM: (13513: I3 ^predict-yes N965)
  9181. --- Firing Productions (IE) For State At Depth 1 ---
  9182. --- Inner Elaboration Phase, active level 1 (S1) ---
  9183. Firing monitor*world
  9184. -->
  9185. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9186. --- Change Working Memory (IE) ---
  9187. --- END Application Phase ---
  9188. --- Output Phase ---
  9189. ENV: Agent did: predict-no for direction U in state State-B
  9190. In State-B moving U
  9191. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9192. predict error 0
  9193. dir: dir isL
  9194. --- END Output Phase ---
  9195. \-/--- Input Phase ---
  9196. =>WM: (13531: I2 ^dir L)
  9197. =>WM: (13530: I2 ^reward 1)
  9198. =>WM: (13529: I2 ^see 0)
  9199. =>WM: (13528: N966 ^status complete)
  9200. <=WM: (13517: I2 ^dir U)
  9201. <=WM: (13516: I2 ^reward 1)
  9202. <=WM: (13515: I2 ^see 1)
  9203. =>WM: (13532: I2 ^level-1 R1-root)
  9204. <=WM: (13518: I2 ^level-1 R1-root)
  9205. --- END Input Phase ---
  9206. --- Proposal Phase ---
  9207. --- Inner Elaboration Phase, active level 1 (S1) ---
  9208. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9209. -->
  9210. (S1 ^operator O1931 = 0.6196238010864294)
  9211. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9212. -->
  9213. (S1 ^operator O1932 = -0.1479504104026684)
  9214. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9215. -->
  9216. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9217. -->
  9218. Firing elaborate*copy-see-to-output-link
  9219. -->
  9220. (I3 ^see 0 +)
  9221. Firing elaborate*reward*based*on*reward
  9222. -->
  9223. (R970 ^value 1 +)
  9224. (R1 ^reward R970 +)
  9225. Firing propose*predict-yes
  9226. -->
  9227. (O1933 ^name predict-yes +)
  9228. (S1 ^operator O1933 +)
  9229. Firing propose*predict-no
  9230. -->
  9231. (O1934 ^name predict-no +)
  9232. (S1 ^operator O1934 +)
  9233. Firing rl*prefer*rvt*predict-no*H0*2
  9234. -->
  9235. (S1 ^operator O1932 = 0.3140405292214645)
  9236. Firing rl*prefer*rvt*predict-yes*H0*1
  9237. -->
  9238. (S1 ^operator O1931 = 0.380421069331616)
  9239. Firing prefer*rvt*predict-yes*H0
  9240. -->
  9241. Firing prefer*rvt*predict-no*H0
  9242. -->
  9243. Firing elaborate*copy-dir-to-output-link
  9244. -->
  9245. (I3 ^dir L +)
  9246. inner elaboration loop at bottom goal.
  9247. Retracting elaborate*copy-see-to-output-link
  9248. -->
  9249. (I3 ^see 1 +)
  9250. Retracting propose*predict-no
  9251. -->
  9252. (O1932 ^name predict-no +)
  9253. (S1 ^operator O1932 +)
  9254. Retracting propose*predict-yes
  9255. -->
  9256. (O1931 ^name predict-yes +)
  9257. (S1 ^operator O1931 +)
  9258. Retracting elaborate*reward*based*on*reward
  9259. -->
  9260. (R969 ^value 1 +)
  9261. (R1 ^reward R969 +)
  9262. Retracting elaborate*copy-dir-to-output-link
  9263. -->
  9264. (I3 ^dir U +)
  9265. Retracting rl*prefer*rvt*predict-no*H0*4
  9266. -->
  9267. (S1 ^operator O1932 = 1.)
  9268. Retracting rl*prefer*rvt*predict-yes*H0*3
  9269. -->
  9270. (S1 ^operator O1931 = 0.)
  9271. =>WM: (13540: S1 ^operator O1934 +)
  9272. =>WM: (13539: S1 ^operator O1933 +)
  9273. =>WM: (13538: I3 ^dir L)
  9274. =>WM: (13537: O1934 ^name predict-no)
  9275. =>WM: (13536: O1933 ^name predict-yes)
  9276. =>WM: (13535: R970 ^value 1)
  9277. =>WM: (13534: R1 ^reward R970)
  9278. =>WM: (13533: I3 ^see 0)
  9279. <=WM: (13524: S1 ^operator O1931 +)
  9280. <=WM: (13525: S1 ^operator O1932 +)
  9281. <=WM: (13526: S1 ^operator O1932)
  9282. <=WM: (13523: I3 ^dir U)
  9283. <=WM: (13519: R1 ^reward R969)
  9284. <=WM: (13504: I3 ^see 1)
  9285. <=WM: (13522: O1932 ^name predict-no)
  9286. <=WM: (13521: O1931 ^name predict-yes)
  9287. <=WM: (13520: R969 ^value 1)
  9288. --- Inner Elaboration Phase, active level 1 (S1) ---
  9289. Firing prefer*rvt*predict-yes*H0
  9290. -->
  9291. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9292. -->
  9293. (S1 ^operator O1933 = 0.6196238010864294)
  9294. Firing rl*prefer*rvt*predict-yes*H0*1
  9295. -->
  9296. (S1 ^operator O1933 = 0.380421069331616)
  9297. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9298. -->
  9299. Firing prefer*rvt*predict-no*H0
  9300. -->
  9301. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9302. -->
  9303. (S1 ^operator O1934 = -0.1479504104026684)
  9304. Firing rl*prefer*rvt*predict-no*H0*2
  9305. -->
  9306. (S1 ^operator O1934 = 0.3140405292214645)
  9307. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9308. -->
  9309. inner elaboration loop at bottom goal.
  9310. Retracting rl*prefer*rvt*predict-no*H0*2
  9311. -->
  9312. (S1 ^operator O1932 = 0.3140405292214645)
  9313. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9314. -->
  9315. (S1 ^operator O1932 = -0.1479504104026684)
  9316. Retracting rl*prefer*rvt*predict-yes*H0*1
  9317. -->
  9318. (S1 ^operator O1931 = 0.380421069331616)
  9319. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9320. -->
  9321. (S1 ^operator O1931 = 0.6196238010864294)
  9322. --- END Proposal Phase ---
  9323. --- Decision Phase ---
  9324. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9325. =>WM: (13541: S1 ^operator O1933)
  9326. 967: O: O1933 (predict-yes)
  9327. --- END Decision Phase ---
  9328. --- Application Phase ---
  9329. --- Firing Productions (PE) For State At Depth 1 ---
  9330. --- Inner Elaboration Phase, active level 1 (S1) ---
  9331. Firing apply*operator
  9332. -->
  9333. (I3 ^predict-yes N967 + :O )
  9334. Firing apply*operator*complete
  9335. -->
  9336. (I3 ^predict-no N966 - :O )
  9337. inner elaboration loop at bottom goal.
  9338. --- Change Working Memory (PE) ---
  9339. =>WM: (13542: I3 ^predict-yes N967)
  9340. <=WM: (13528: N966 ^status complete)
  9341. <=WM: (13527: I3 ^predict-no N966)
  9342. --- Firing Productions (IE) For State At Depth 1 ---
  9343. --- Inner Elaboration Phase, active level 1 (S1) ---
  9344. Firing monitor*world
  9345. -->
  9346. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9347. --- Change Working Memory (IE) ---
  9348. --- END Application Phase ---
  9349. --- Output Phase ---
  9350. ENV: Agent did: predict-yes for direction L in state State-B
  9351. In State-B moving L
  9352. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9353. predict error 0
  9354. dir: dir isR
  9355. --- END Output Phase ---
  9356. |\---- Input Phase ---
  9357. =>WM: (13546: I2 ^dir R)
  9358. =>WM: (13545: I2 ^reward 1)
  9359. =>WM: (13544: I2 ^see 1)
  9360. =>WM: (13543: N967 ^status complete)
  9361. <=WM: (13531: I2 ^dir L)
  9362. <=WM: (13530: I2 ^reward 1)
  9363. <=WM: (13529: I2 ^see 0)
  9364. =>WM: (13547: I2 ^level-1 L1-root)
  9365. <=WM: (13532: I2 ^level-1 R1-root)
  9366. --- END Input Phase ---
  9367. --- Proposal Phase ---
  9368. --- Inner Elaboration Phase, active level 1 (S1) ---
  9369. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9370. -->
  9371. (S1 ^operator O1933 = 0.7064977054068989)
  9372. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9373. -->
  9374. (S1 ^operator O1934 = -0.1937987592593187)
  9375. Firing prefer*rvt*predict-no*H0*6*v1*H1
  9376. -->
  9377. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9378. -->
  9379. Firing elaborate*copy-see-to-output-link
  9380. -->
  9381. (I3 ^see 1 +)
  9382. Firing elaborate*reward*based*on*reward
  9383. -->
  9384. (R971 ^value 1 +)
  9385. (R1 ^reward R971 +)
  9386. Firing propose*predict-yes
  9387. -->
  9388. (O1935 ^name predict-yes +)
  9389. (S1 ^operator O1935 +)
  9390. Firing propose*predict-no
  9391. -->
  9392. (O1936 ^name predict-no +)
  9393. (S1 ^operator O1936 +)
  9394. Firing rl*prefer*rvt*predict-no*H0*6
  9395. -->
  9396. (S1 ^operator O1934 = 0.2298717920574965)
  9397. Firing rl*prefer*rvt*predict-yes*H0*5
  9398. -->
  9399. (S1 ^operator O1933 = 0.2939914352270483)
  9400. Firing prefer*rvt*predict-yes*H0
  9401. -->
  9402. Firing prefer*rvt*predict-no*H0
  9403. -->
  9404. Firing elaborate*copy-dir-to-output-link
  9405. -->
  9406. (I3 ^dir R +)
  9407. inner elaboration loop at bottom goal.
  9408. Retracting elaborate*copy-see-to-output-link
  9409. -->
  9410. (I3 ^see 0 +)
  9411. Retracting propose*predict-no
  9412. -->
  9413. (O1934 ^name predict-no +)
  9414. (S1 ^operator O1934 +)
  9415. Retracting propose*predict-yes
  9416. -->
  9417. (O1933 ^name predict-yes +)
  9418. (S1 ^operator O1933 +)
  9419. Retracting elaborate*reward*based*on*reward
  9420. -->
  9421. (R970 ^value 1 +)
  9422. (R1 ^reward R970 +)
  9423. Retracting elaborate*copy-dir-to-output-link
  9424. -->
  9425. (I3 ^dir L +)
  9426. Retracting rl*prefer*rvt*predict-no*H0*2
  9427. -->
  9428. (S1 ^operator O1934 = 0.3140405292214645)
  9429. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9430. -->
  9431. (S1 ^operator O1934 = -0.1479504104026684)
  9432. Retracting rl*prefer*rvt*predict-yes*H0*1
  9433. -->
  9434. (S1 ^operator O1933 = 0.380421069331616)
  9435. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9436. -->
  9437. (S1 ^operator O1933 = 0.6196238010864294)
  9438. =>WM: (13555: S1 ^operator O1936 +)
  9439. =>WM: (13554: S1 ^operator O1935 +)
  9440. =>WM: (13553: I3 ^dir R)
  9441. =>WM: (13552: O1936 ^name predict-no)
  9442. =>WM: (13551: O1935 ^name predict-yes)
  9443. =>WM: (13550: R971 ^value 1)
  9444. =>WM: (13549: R1 ^reward R971)
  9445. =>WM: (13548: I3 ^see 1)
  9446. <=WM: (13539: S1 ^operator O1933 +)
  9447. <=WM: (13541: S1 ^operator O1933)
  9448. <=WM: (13540: S1 ^operator O1934 +)
  9449. <=WM: (13538: I3 ^dir L)
  9450. <=WM: (13534: R1 ^reward R970)
  9451. <=WM: (13533: I3 ^see 0)
  9452. <=WM: (13537: O1934 ^name predict-no)
  9453. <=WM: (13536: O1933 ^name predict-yes)
  9454. <=WM: (13535: R970 ^value 1)
  9455. --- Inner Elaboration Phase, active level 1 (S1) ---
  9456. Firing prefer*rvt*predict-yes*H0
  9457. -->
  9458. Firing rl*prefer*rvt*predict-yes*H0*5
  9459. -->
  9460. (S1 ^operator O1935 = 0.2939914352270483)
  9461. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9462. -->
  9463. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9464. -->
  9465. (S1 ^operator O1935 = 0.7064977054068989)
  9466. Firing prefer*rvt*predict-no*H0
  9467. -->
  9468. Firing rl*prefer*rvt*predict-no*H0*6
  9469. -->
  9470. (S1 ^operator O1936 = 0.2298717920574965)
  9471. Firing prefer*rvt*predict-no*H0*6*v1*H1
  9472. -->
  9473. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9474. -->
  9475. (S1 ^operator O1936 = -0.1937987592593187)
  9476. inner elaboration loop at bottom goal.
  9477. Retracting rl*prefer*rvt*predict-no*H0*6
  9478. -->
  9479. (S1 ^operator O1934 = 0.2298717920574965)
  9480. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9481. -->
  9482. (S1 ^operator O1934 = -0.1937987592593187)
  9483. Retracting rl*prefer*rvt*predict-yes*H0*5
  9484. -->
  9485. (S1 ^operator O1933 = 0.2939914352270483)
  9486. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9487. -->
  9488. (S1 ^operator O1933 = 0.7064977054068989)
  9489. --- END Proposal Phase ---
  9490. --- Decision Phase ---
  9491. RL update rl*prefer*rvt*predict-yes*H0*1 0.521352 -0.140931 0.380421 -> 0.521348 -0.14093 0.380417(R,m,v=1,0.822785,0.146739)
  9492. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478697 0.140926 0.619624 -> 0.478693 0.140927 0.619619(R,m,v=1,1,0)
  9493. =>WM: (13556: S1 ^operator O1935)
  9494. 968: O: O1935 (predict-yes)
  9495. --- END Decision Phase ---
  9496. --- Application Phase ---
  9497. --- Firing Productions (PE) For State At Depth 1 ---
  9498. --- Inner Elaboration Phase, active level 1 (S1) ---
  9499. Firing apply*operator
  9500. -->
  9501. (I3 ^predict-yes N968 + :O )
  9502. Firing apply*operator*complete
  9503. -->
  9504. (I3 ^predict-yes N967 - :O )
  9505. inner elaboration loop at bottom goal.
  9506. --- Change Working Memory (PE) ---
  9507. =>WM: (13557: I3 ^predict-yes N968)
  9508. <=WM: (13543: N967 ^status complete)
  9509. <=WM: (13542: I3 ^predict-yes N967)
  9510. --- Firing Productions (IE) For State At Depth 1 ---
  9511. --- Inner Elaboration Phase, active level 1 (S1) ---
  9512. Firing monitor*world
  9513. -->
  9514. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9515. --- Change Working Memory (IE) ---
  9516. --- END Application Phase ---
  9517. --- Output Phase ---
  9518. ENV: Agent did: predict-yes for direction R in state State-A
  9519. In State-A moving R
  9520. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9521. predict error 0
  9522. dir: dir isU
  9523. --- END Output Phase ---
  9524. /|\--- Input Phase ---
  9525. =>WM: (13561: I2 ^dir U)
  9526. =>WM: (13560: I2 ^reward 1)
  9527. =>WM: (13559: I2 ^see 1)
  9528. =>WM: (13558: N968 ^status complete)
  9529. <=WM: (13546: I2 ^dir R)
  9530. <=WM: (13545: I2 ^reward 1)
  9531. <=WM: (13544: I2 ^see 1)
  9532. =>WM: (13562: I2 ^level-1 R1-root)
  9533. <=WM: (13547: I2 ^level-1 L1-root)
  9534. --- END Input Phase ---
  9535. --- Proposal Phase ---
  9536. --- Inner Elaboration Phase, active level 1 (S1) ---
  9537. Firing elaborate*copy-see-to-output-link
  9538. -->
  9539. (I3 ^see 1 +)
  9540. Firing elaborate*reward*based*on*reward
  9541. -->
  9542. (R972 ^value 1 +)
  9543. (R1 ^reward R972 +)
  9544. Firing propose*predict-yes
  9545. -->
  9546. (O1937 ^name predict-yes +)
  9547. (S1 ^operator O1937 +)
  9548. Firing propose*predict-no
  9549. -->
  9550. (O1938 ^name predict-no +)
  9551. (S1 ^operator O1938 +)
  9552. Firing rl*prefer*rvt*predict-no*H0*4
  9553. -->
  9554. (S1 ^operator O1936 = 1.)
  9555. Firing rl*prefer*rvt*predict-yes*H0*3
  9556. -->
  9557. (S1 ^operator O1935 = 0.)
  9558. Firing prefer*rvt*predict-yes*H0
  9559. -->
  9560. Firing prefer*rvt*predict-no*H0
  9561. -->
  9562. Firing elaborate*copy-dir-to-output-link
  9563. -->
  9564. (I3 ^dir U +)
  9565. inner elaboration loop at bottom goal.
  9566. Retracting elaborate*copy-see-to-output-link
  9567. -->
  9568. (I3 ^see 1 +)
  9569. Retracting propose*predict-no
  9570. -->
  9571. (O1936 ^name predict-no +)
  9572. (S1 ^operator O1936 +)
  9573. Retracting propose*predict-yes
  9574. -->
  9575. (O1935 ^name predict-yes +)
  9576. (S1 ^operator O1935 +)
  9577. Retracting elaborate*reward*based*on*reward
  9578. -->
  9579. (R971 ^value 1 +)
  9580. (R1 ^reward R971 +)
  9581. Retracting elaborate*copy-dir-to-output-link
  9582. -->
  9583. (I3 ^dir R +)
  9584. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  9585. -->
  9586. (S1 ^operator O1936 = -0.1937987592593187)
  9587. Retracting rl*prefer*rvt*predict-no*H0*6
  9588. -->
  9589. (S1 ^operator O1936 = 0.2298717920574965)
  9590. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  9591. -->
  9592. (S1 ^operator O1935 = 0.7064977054068989)
  9593. Retracting rl*prefer*rvt*predict-yes*H0*5
  9594. -->
  9595. (S1 ^operator O1935 = 0.2939914352270483)
  9596. =>WM: (13569: S1 ^operator O1938 +)
  9597. =>WM: (13568: S1 ^operator O1937 +)
  9598. =>WM: (13567: I3 ^dir U)
  9599. =>WM: (13566: O1938 ^name predict-no)
  9600. =>WM: (13565: O1937 ^name predict-yes)
  9601. =>WM: (13564: R972 ^value 1)
  9602. =>WM: (13563: R1 ^reward R972)
  9603. <=WM: (13554: S1 ^operator O1935 +)
  9604. <=WM: (13556: S1 ^operator O1935)
  9605. <=WM: (13555: S1 ^operator O1936 +)
  9606. <=WM: (13553: I3 ^dir R)
  9607. <=WM: (13549: R1 ^reward R971)
  9608. <=WM: (13552: O1936 ^name predict-no)
  9609. <=WM: (13551: O1935 ^name predict-yes)
  9610. <=WM: (13550: R971 ^value 1)
  9611. --- Inner Elaboration Phase, active level 1 (S1) ---
  9612. Firing prefer*rvt*predict-yes*H0
  9613. -->
  9614. Firing rl*prefer*rvt*predict-yes*H0*3
  9615. -->
  9616. (S1 ^operator O1937 = 0.)
  9617. Firing prefer*rvt*predict-no*H0
  9618. -->
  9619. Firing rl*prefer*rvt*predict-no*H0*4
  9620. -->
  9621. (S1 ^operator O1938 = 1.)
  9622. inner elaboration loop at bottom goal.
  9623. Retracting rl*prefer*rvt*predict-no*H0*4
  9624. -->
  9625. (S1 ^operator O1936 = 1.)
  9626. Retracting rl*prefer*rvt*predict-yes*H0*3
  9627. -->
  9628. (S1 ^operator O1935 = 0.)
  9629. --- END Proposal Phase ---
  9630. --- Decision Phase ---
  9631. RL update rl*prefer*rvt*predict-yes*H0*5 0.501065 -0.207074 0.293991 -> 0.501028 -0.207078 0.293951(R,m,v=1,0.838926,0.136042)
  9632. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499374 0.207123 0.706498 -> 0.499331 0.207118 0.70645(R,m,v=1,1,0)
  9633. =>WM: (13570: S1 ^operator O1938)
  9634. 969: O: O1938 (predict-no)
  9635. --- END Decision Phase ---
  9636. --- Application Phase ---
  9637. --- Firing Productions (PE) For State At Depth 1 ---
  9638. --- Inner Elaboration Phase, active level 1 (S1) ---
  9639. Firing apply*operator
  9640. -->
  9641. (I3 ^predict-no N969 + :O )
  9642. Firing apply*operator*complete
  9643. -->
  9644. (I3 ^predict-yes N968 - :O )
  9645. inner elaboration loop at bottom goal.
  9646. --- Change Working Memory (PE) ---
  9647. =>WM: (13571: I3 ^predict-no N969)
  9648. <=WM: (13558: N968 ^status complete)
  9649. <=WM: (13557: I3 ^predict-yes N968)
  9650. --- Firing Productions (IE) For State At Depth 1 ---
  9651. --- Inner Elaboration Phase, active level 1 (S1) ---
  9652. Firing monitor*world
  9653. -->
  9654. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9655. --- Change Working Memory (IE) ---
  9656. --- END Application Phase ---
  9657. --- Output Phase ---
  9658. ENV: Agent did: predict-no for direction U in state State-B
  9659. In State-B moving U
  9660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9661. predict error 0
  9662. dir: dir isL
  9663. --- END Output Phase ---
  9664. -/|--- Input Phase ---
  9665. =>WM: (13575: I2 ^dir L)
  9666. =>WM: (13574: I2 ^reward 1)
  9667. =>WM: (13573: I2 ^see 0)
  9668. =>WM: (13572: N969 ^status complete)
  9669. <=WM: (13561: I2 ^dir U)
  9670. <=WM: (13560: I2 ^reward 1)
  9671. <=WM: (13559: I2 ^see 1)
  9672. =>WM: (13576: I2 ^level-1 R1-root)
  9673. <=WM: (13562: I2 ^level-1 R1-root)
  9674. --- END Input Phase ---
  9675. --- Proposal Phase ---
  9676. --- Inner Elaboration Phase, active level 1 (S1) ---
  9677. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9678. -->
  9679. (S1 ^operator O1937 = 0.6196194522363663)
  9680. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9681. -->
  9682. (S1 ^operator O1938 = -0.1479504104026684)
  9683. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9684. -->
  9685. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9686. -->
  9687. Firing elaborate*copy-see-to-output-link
  9688. -->
  9689. (I3 ^see 0 +)
  9690. Firing elaborate*reward*based*on*reward
  9691. -->
  9692. (R973 ^value 1 +)
  9693. (R1 ^reward R973 +)
  9694. Firing propose*predict-yes
  9695. -->
  9696. (O1939 ^name predict-yes +)
  9697. (S1 ^operator O1939 +)
  9698. Firing propose*predict-no
  9699. -->
  9700. (O1940 ^name predict-no +)
  9701. (S1 ^operator O1940 +)
  9702. Firing rl*prefer*rvt*predict-no*H0*2
  9703. -->
  9704. (S1 ^operator O1938 = 0.3140405292214645)
  9705. Firing rl*prefer*rvt*predict-yes*H0*1
  9706. -->
  9707. (S1 ^operator O1937 = 0.3804173687365902)
  9708. Firing prefer*rvt*predict-yes*H0
  9709. -->
  9710. Firing prefer*rvt*predict-no*H0
  9711. -->
  9712. Firing elaborate*copy-dir-to-output-link
  9713. -->
  9714. (I3 ^dir L +)
  9715. inner elaboration loop at bottom goal.
  9716. Retracting elaborate*copy-see-to-output-link
  9717. -->
  9718. (I3 ^see 1 +)
  9719. Retracting propose*predict-no
  9720. -->
  9721. (O1938 ^name predict-no +)
  9722. (S1 ^operator O1938 +)
  9723. Retracting propose*predict-yes
  9724. -->
  9725. (O1937 ^name predict-yes +)
  9726. (S1 ^operator O1937 +)
  9727. Retracting elaborate*reward*based*on*reward
  9728. -->
  9729. (R972 ^value 1 +)
  9730. (R1 ^reward R972 +)
  9731. Retracting elaborate*copy-dir-to-output-link
  9732. -->
  9733. (I3 ^dir U +)
  9734. Retracting rl*prefer*rvt*predict-no*H0*4
  9735. -->
  9736. (S1 ^operator O1938 = 1.)
  9737. Retracting rl*prefer*rvt*predict-yes*H0*3
  9738. -->
  9739. (S1 ^operator O1937 = 0.)
  9740. =>WM: (13584: S1 ^operator O1940 +)
  9741. =>WM: (13583: S1 ^operator O1939 +)
  9742. =>WM: (13582: I3 ^dir L)
  9743. =>WM: (13581: O1940 ^name predict-no)
  9744. =>WM: (13580: O1939 ^name predict-yes)
  9745. =>WM: (13579: R973 ^value 1)
  9746. =>WM: (13578: R1 ^reward R973)
  9747. =>WM: (13577: I3 ^see 0)
  9748. <=WM: (13568: S1 ^operator O1937 +)
  9749. <=WM: (13569: S1 ^operator O1938 +)
  9750. <=WM: (13570: S1 ^operator O1938)
  9751. <=WM: (13567: I3 ^dir U)
  9752. <=WM: (13563: R1 ^reward R972)
  9753. <=WM: (13548: I3 ^see 1)
  9754. <=WM: (13566: O1938 ^name predict-no)
  9755. <=WM: (13565: O1937 ^name predict-yes)
  9756. <=WM: (13564: R972 ^value 1)
  9757. --- Inner Elaboration Phase, active level 1 (S1) ---
  9758. Firing prefer*rvt*predict-yes*H0
  9759. -->
  9760. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9761. -->
  9762. (S1 ^operator O1939 = 0.6196194522363663)
  9763. Firing rl*prefer*rvt*predict-yes*H0*1
  9764. -->
  9765. (S1 ^operator O1939 = 0.3804173687365902)
  9766. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9767. -->
  9768. Firing prefer*rvt*predict-no*H0
  9769. -->
  9770. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9771. -->
  9772. (S1 ^operator O1940 = -0.1479504104026684)
  9773. Firing rl*prefer*rvt*predict-no*H0*2
  9774. -->
  9775. (S1 ^operator O1940 = 0.3140405292214645)
  9776. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9777. -->
  9778. inner elaboration loop at bottom goal.
  9779. Retracting rl*prefer*rvt*predict-no*H0*2
  9780. -->
  9781. (S1 ^operator O1938 = 0.3140405292214645)
  9782. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9783. -->
  9784. (S1 ^operator O1938 = -0.1479504104026684)
  9785. Retracting rl*prefer*rvt*predict-yes*H0*1
  9786. -->
  9787. (S1 ^operator O1937 = 0.3804173687365902)
  9788. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9789. -->
  9790. (S1 ^operator O1937 = 0.6196194522363663)
  9791. --- END Proposal Phase ---
  9792. --- Decision Phase ---
  9793. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9794. =>WM: (13585: S1 ^operator O1939)
  9795. 970: O: O1939 (predict-yes)
  9796. --- END Decision Phase ---
  9797. --- Application Phase ---
  9798. --- Firing Productions (PE) For State At Depth 1 ---
  9799. --- Inner Elaboration Phase, active level 1 (S1) ---
  9800. Firing apply*operator
  9801. -->
  9802. (I3 ^predict-yes N970 + :O )
  9803. Firing apply*operator*complete
  9804. -->
  9805. (I3 ^predict-no N969 - :O )
  9806. inner elaboration loop at bottom goal.
  9807. --- Change Working Memory (PE) ---
  9808. =>WM: (13586: I3 ^predict-yes N970)
  9809. <=WM: (13572: N969 ^status complete)
  9810. <=WM: (13571: I3 ^predict-no N969)
  9811. --- Firing Productions (IE) For State At Depth 1 ---
  9812. --- Inner Elaboration Phase, active level 1 (S1) ---
  9813. Firing monitor*world
  9814. -->
  9815. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9816. --- Change Working Memory (IE) ---
  9817. --- END Application Phase ---
  9818. --- Output Phase ---
  9819. ENV: Agent did: predict-yes for direction L in state State-B
  9820. In State-B moving L
  9821. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9822. predict error 0
  9823. dir: dir isU
  9824. --- END Output Phase ---
  9825. \-/--- Input Phase ---
  9826. =>WM: (13590: I2 ^dir U)
  9827. =>WM: (13589: I2 ^reward 1)
  9828. =>WM: (13588: I2 ^see 1)
  9829. =>WM: (13587: N970 ^status complete)
  9830. <=WM: (13575: I2 ^dir L)
  9831. <=WM: (13574: I2 ^reward 1)
  9832. <=WM: (13573: I2 ^see 0)
  9833. =>WM: (13591: I2 ^level-1 L1-root)
  9834. <=WM: (13576: I2 ^level-1 R1-root)
  9835. --- END Input Phase ---
  9836. --- Proposal Phase ---
  9837. --- Inner Elaboration Phase, active level 1 (S1) ---
  9838. Firing elaborate*copy-see-to-output-link
  9839. -->
  9840. (I3 ^see 1 +)
  9841. Firing elaborate*reward*based*on*reward
  9842. -->
  9843. (R974 ^value 1 +)
  9844. (R1 ^reward R974 +)
  9845. Firing propose*predict-yes
  9846. -->
  9847. (O1941 ^name predict-yes +)
  9848. (S1 ^operator O1941 +)
  9849. Firing propose*predict-no
  9850. -->
  9851. (O1942 ^name predict-no +)
  9852. (S1 ^operator O1942 +)
  9853. Firing rl*prefer*rvt*predict-no*H0*4
  9854. -->
  9855. (S1 ^operator O1940 = 1.)
  9856. Firing rl*prefer*rvt*predict-yes*H0*3
  9857. -->
  9858. (S1 ^operator O1939 = 0.)
  9859. Firing prefer*rvt*predict-yes*H0
  9860. -->
  9861. Firing prefer*rvt*predict-no*H0
  9862. -->
  9863. Firing elaborate*copy-dir-to-output-link
  9864. -->
  9865. (I3 ^dir U +)
  9866. inner elaboration loop at bottom goal.
  9867. Retracting elaborate*copy-see-to-output-link
  9868. -->
  9869. (I3 ^see 0 +)
  9870. Retracting propose*predict-no
  9871. -->
  9872. (O1940 ^name predict-no +)
  9873. (S1 ^operator O1940 +)
  9874. Retracting propose*predict-yes
  9875. -->
  9876. (O1939 ^name predict-yes +)
  9877. (S1 ^operator O1939 +)
  9878. Retracting elaborate*reward*based*on*reward
  9879. -->
  9880. (R973 ^value 1 +)
  9881. (R1 ^reward R973 +)
  9882. Retracting elaborate*copy-dir-to-output-link
  9883. -->
  9884. (I3 ^dir L +)
  9885. Retracting rl*prefer*rvt*predict-no*H0*2
  9886. -->
  9887. (S1 ^operator O1940 = 0.3140405292214645)
  9888. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  9889. -->
  9890. (S1 ^operator O1940 = -0.1479504104026684)
  9891. Retracting rl*prefer*rvt*predict-yes*H0*1
  9892. -->
  9893. (S1 ^operator O1939 = 0.3804173687365902)
  9894. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  9895. -->
  9896. (S1 ^operator O1939 = 0.6196194522363663)
  9897. =>WM: (13599: S1 ^operator O1942 +)
  9898. =>WM: (13598: S1 ^operator O1941 +)
  9899. =>WM: (13597: I3 ^dir U)
  9900. =>WM: (13596: O1942 ^name predict-no)
  9901. =>WM: (13595: O1941 ^name predict-yes)
  9902. =>WM: (13594: R974 ^value 1)
  9903. =>WM: (13593: R1 ^reward R974)
  9904. =>WM: (13592: I3 ^see 1)
  9905. <=WM: (13583: S1 ^operator O1939 +)
  9906. <=WM: (13585: S1 ^operator O1939)
  9907. <=WM: (13584: S1 ^operator O1940 +)
  9908. <=WM: (13582: I3 ^dir L)
  9909. <=WM: (13578: R1 ^reward R973)
  9910. <=WM: (13577: I3 ^see 0)
  9911. <=WM: (13581: O1940 ^name predict-no)
  9912. <=WM: (13580: O1939 ^name predict-yes)
  9913. <=WM: (13579: R973 ^value 1)
  9914. --- Inner Elaboration Phase, active level 1 (S1) ---
  9915. Firing prefer*rvt*predict-yes*H0
  9916. -->
  9917. Firing rl*prefer*rvt*predict-yes*H0*3
  9918. -->
  9919. (S1 ^operator O1941 = 0.)
  9920. Firing prefer*rvt*predict-no*H0
  9921. -->
  9922. Firing rl*prefer*rvt*predict-no*H0*4
  9923. -->
  9924. (S1 ^operator O1942 = 1.)
  9925. inner elaboration loop at bottom goal.
  9926. Retracting rl*prefer*rvt*predict-no*H0*4
  9927. -->
  9928. (S1 ^operator O1940 = 1.)
  9929. Retracting rl*prefer*rvt*predict-yes*H0*3
  9930. -->
  9931. (S1 ^operator O1939 = 0.)
  9932. --- END Proposal Phase ---
  9933. --- Decision Phase ---
  9934. RL update rl*prefer*rvt*predict-yes*H0*1 0.521348 -0.14093 0.380417 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.823899,0.146007)
  9935. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478693 0.140927 0.619619 -> 0.478689 0.140927 0.619616(R,m,v=1,1,0)
  9936. =>WM: (13600: S1 ^operator O1942)
  9937. 971: O: O1942 (predict-no)
  9938. --- END Decision Phase ---
  9939. --- Application Phase ---
  9940. --- Firing Productions (PE) For State At Depth 1 ---
  9941. --- Inner Elaboration Phase, active level 1 (S1) ---
  9942. Firing apply*operator
  9943. -->
  9944. (I3 ^predict-no N971 + :O )
  9945. Firing apply*operator*complete
  9946. -->
  9947. (I3 ^predict-yes N970 - :O )
  9948. inner elaboration loop at bottom goal.
  9949. --- Change Working Memory (PE) ---
  9950. =>WM: (13601: I3 ^predict-no N971)
  9951. <=WM: (13587: N970 ^status complete)
  9952. <=WM: (13586: I3 ^predict-yes N970)
  9953. --- Firing Productions (IE) For State At Depth 1 ---
  9954. --- Inner Elaboration Phase, active level 1 (S1) ---
  9955. Firing monitor*world
  9956. -->
  9957. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9958. --- Change Working Memory (IE) ---
  9959. --- END Application Phase ---
  9960. --- Output Phase ---
  9961. ENV: Agent did: predict-no for direction U in state State-A
  9962. In State-A moving U
  9963. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9964. predict error 0
  9965. dir: dir isL
  9966. --- END Output Phase ---
  9967. |--- Input Phase ---
  9968. =>WM: (13605: I2 ^dir L)
  9969. =>WM: (13604: I2 ^reward 1)
  9970. =>WM: (13603: I2 ^see 0)
  9971. =>WM: (13602: N971 ^status complete)
  9972. <=WM: (13590: I2 ^dir U)
  9973. <=WM: (13589: I2 ^reward 1)
  9974. <=WM: (13588: I2 ^see 1)
  9975. =>WM: (13606: I2 ^level-1 L1-root)
  9976. <=WM: (13591: I2 ^level-1 L1-root)
  9977. --- END Input Phase ---
  9978. --- Proposal Phase ---
  9979. --- Inner Elaboration Phase, active level 1 (S1) ---
  9980. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  9981. -->
  9982. (S1 ^operator O1941 = -0.3470159027404986)
  9983. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  9984. -->
  9985. (S1 ^operator O1942 = 0.6861654297024582)
  9986. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9987. -->
  9988. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9989. -->
  9990. Firing elaborate*copy-see-to-output-link
  9991. -->
  9992. (I3 ^see 0 +)
  9993. Firing elaborate*reward*based*on*reward
  9994. -->
  9995. (R975 ^value 1 +)
  9996. (R1 ^reward R975 +)
  9997. Firing propose*predict-yes
  9998. -->
  9999. (O1943 ^name predict-yes +)
  10000. (S1 ^operator O1943 +)
  10001. Firing propose*predict-no
  10002. -->
  10003. (O1944 ^name predict-no +)
  10004. (S1 ^operator O1944 +)
  10005. Firing rl*prefer*rvt*predict-no*H0*2
  10006. -->
  10007. (S1 ^operator O1942 = 0.3140405292214645)
  10008. Firing rl*prefer*rvt*predict-yes*H0*1
  10009. -->
  10010. (S1 ^operator O1941 = 0.3804143351598744)
  10011. Firing prefer*rvt*predict-yes*H0
  10012. -->
  10013. Firing prefer*rvt*predict-no*H0
  10014. -->
  10015. Firing elaborate*copy-dir-to-output-link
  10016. -->
  10017. (I3 ^dir L +)
  10018. inner elaboration loop at bottom goal.
  10019. Retracting elaborate*copy-see-to-output-link
  10020. -->
  10021. (I3 ^see 1 +)
  10022. Retracting propose*predict-no
  10023. -->
  10024. (O1942 ^name predict-no +)
  10025. (S1 ^operator O1942 +)
  10026. Retracting propose*predict-yes
  10027. -->
  10028. (O1941 ^name predict-yes +)
  10029. (S1 ^operator O1941 +)
  10030. Retracting elaborate*reward*based*on*reward
  10031. -->
  10032. (R974 ^value 1 +)
  10033. (R1 ^reward R974 +)
  10034. Retracting elaborate*copy-dir-to-output-link
  10035. -->
  10036. (I3 ^dir U +)
  10037. Retracting rl*prefer*rvt*predict-no*H0*4
  10038. -->
  10039. (S1 ^operator O1942 = 1.)
  10040. Retracting rl*prefer*rvt*predict-yes*H0*3
  10041. -->
  10042. (S1 ^operator O1941 = 0.)
  10043. =>WM: (13614: S1 ^operator O1944 +)
  10044. =>WM: (13613: S1 ^operator O1943 +)
  10045. =>WM: (13612: I3 ^dir L)
  10046. =>WM: (13611: O1944 ^name predict-no)
  10047. =>WM: (13610: O1943 ^name predict-yes)
  10048. =>WM: (13609: R975 ^value 1)
  10049. =>WM: (13608: R1 ^reward R975)
  10050. =>WM: (13607: I3 ^see 0)
  10051. <=WM: (13598: S1 ^operator O1941 +)
  10052. <=WM: (13599: S1 ^operator O1942 +)
  10053. <=WM: (13600: S1 ^operator O1942)
  10054. <=WM: (13597: I3 ^dir U)
  10055. <=WM: (13593: R1 ^reward R974)
  10056. <=WM: (13592: I3 ^see 1)
  10057. <=WM: (13596: O1942 ^name predict-no)
  10058. <=WM: (13595: O1941 ^name predict-yes)
  10059. <=WM: (13594: R974 ^value 1)
  10060. --- Inner Elaboration Phase, active level 1 (S1) ---
  10061. Firing prefer*rvt*predict-yes*H0
  10062. -->
  10063. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  10064. -->
  10065. (S1 ^operator O1943 = -0.3470159027404986)
  10066. Firing rl*prefer*rvt*predict-yes*H0*1
  10067. -->
  10068. (S1 ^operator O1943 = 0.3804143351598744)
  10069. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10070. -->
  10071. Firing prefer*rvt*predict-no*H0
  10072. -->
  10073. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  10074. -->
  10075. (S1 ^operator O1944 = 0.6861654297024582)
  10076. Firing rl*prefer*rvt*predict-no*H0*2
  10077. -->
  10078. (S1 ^operator O1944 = 0.3140405292214645)
  10079. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10080. -->
  10081. inner elaboration loop at bottom goal.
  10082. Retracting rl*prefer*rvt*predict-no*H0*2
  10083. -->
  10084. (S1 ^operator O1942 = 0.3140405292214645)
  10085. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  10086. -->
  10087. (S1 ^operator O1942 = 0.6861654297024582)
  10088. Retracting rl*prefer*rvt*predict-yes*H0*1
  10089. -->
  10090. (S1 ^operator O1941 = 0.3804143351598744)
  10091. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  10092. -->
  10093. (S1 ^operator O1941 = -0.3470159027404986)
  10094. --- END Proposal Phase ---
  10095. --- Decision Phase ---
  10096. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10097. =>WM: (13615: S1 ^operator O1944)
  10098. 972: O: O1944 (predict-no)
  10099. --- END Decision Phase ---
  10100. --- Application Phase ---
  10101. --- Firing Productions (PE) For State At Depth 1 ---
  10102. --- Inner Elaboration Phase, active level 1 (S1) ---
  10103. Firing apply*operator
  10104. -->
  10105. (I3 ^predict-no N972 + :O )
  10106. Firing apply*operator*complete
  10107. -->
  10108. (I3 ^predict-no N971 - :O )
  10109. inner elaboration loop at bottom goal.
  10110. --- Change Working Memory (PE) ---
  10111. =>WM: (13616: I3 ^predict-no N972)
  10112. <=WM: (13602: N971 ^status complete)
  10113. <=WM: (13601: I3 ^predict-no N971)
  10114. --- Firing Productions (IE) For State At Depth 1 ---
  10115. --- Inner Elaboration Phase, active level 1 (S1) ---
  10116. Firing monitor*world
  10117. -->
  10118. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10119. --- Change Working Memory (IE) ---
  10120. --- END Application Phase ---
  10121. --- Output Phase ---
  10122. ENV: Agent did: predict-no for direction L in state State-A
  10123. In State-A moving L
  10124. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10125. predict error 0
  10126. dir: dir isR
  10127. --- END Output Phase ---
  10128. \-/--- Input Phase ---
  10129. =>WM: (13620: I2 ^dir R)
  10130. =>WM: (13619: I2 ^reward 1)
  10131. =>WM: (13618: I2 ^see 0)
  10132. =>WM: (13617: N972 ^status complete)
  10133. <=WM: (13605: I2 ^dir L)
  10134. <=WM: (13604: I2 ^reward 1)
  10135. <=WM: (13603: I2 ^see 0)
  10136. =>WM: (13621: I2 ^level-1 L0-root)
  10137. <=WM: (13606: I2 ^level-1 L1-root)
  10138. --- END Input Phase ---
  10139. --- Proposal Phase ---
  10140. --- Inner Elaboration Phase, active level 1 (S1) ---
  10141. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10142. -->
  10143. (S1 ^operator O1943 = 0.7054436376897688)
  10144. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10145. -->
  10146. (S1 ^operator O1944 = -0.2023211881870005)
  10147. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10148. -->
  10149. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10150. -->
  10151. Firing elaborate*copy-see-to-output-link
  10152. -->
  10153. (I3 ^see 0 +)
  10154. Firing elaborate*reward*based*on*reward
  10155. -->
  10156. (R976 ^value 1 +)
  10157. (R1 ^reward R976 +)
  10158. Firing propose*predict-yes
  10159. -->
  10160. (O1945 ^name predict-yes +)
  10161. (S1 ^operator O1945 +)
  10162. Firing propose*predict-no
  10163. -->
  10164. (O1946 ^name predict-no +)
  10165. (S1 ^operator O1946 +)
  10166. Firing rl*prefer*rvt*predict-no*H0*6
  10167. -->
  10168. (S1 ^operator O1944 = 0.2298717920574965)
  10169. Firing rl*prefer*rvt*predict-yes*H0*5
  10170. -->
  10171. (S1 ^operator O1943 = 0.2939507002996337)
  10172. Firing prefer*rvt*predict-yes*H0
  10173. -->
  10174. Firing prefer*rvt*predict-no*H0
  10175. -->
  10176. Firing elaborate*copy-dir-to-output-link
  10177. -->
  10178. (I3 ^dir R +)
  10179. inner elaboration loop at bottom goal.
  10180. Retracting elaborate*copy-see-to-output-link
  10181. -->
  10182. (I3 ^see 0 +)
  10183. Retracting propose*predict-no
  10184. -->
  10185. (O1944 ^name predict-no +)
  10186. (S1 ^operator O1944 +)
  10187. Retracting propose*predict-yes
  10188. -->
  10189. (O1943 ^name predict-yes +)
  10190. (S1 ^operator O1943 +)
  10191. Retracting elaborate*reward*based*on*reward
  10192. -->
  10193. (R975 ^value 1 +)
  10194. (R1 ^reward R975 +)
  10195. Retracting elaborate*copy-dir-to-output-link
  10196. -->
  10197. (I3 ^dir L +)
  10198. Retracting rl*prefer*rvt*predict-no*H0*2
  10199. -->
  10200. (S1 ^operator O1944 = 0.3140405292214645)
  10201. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  10202. -->
  10203. (S1 ^operator O1944 = 0.6861654297024582)
  10204. Retracting rl*prefer*rvt*predict-yes*H0*1
  10205. -->
  10206. (S1 ^operator O1943 = 0.3804143351598744)
  10207. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  10208. -->
  10209. (S1 ^operator O1943 = -0.3470159027404986)
  10210. =>WM: (13628: S1 ^operator O1946 +)
  10211. =>WM: (13627: S1 ^operator O1945 +)
  10212. =>WM: (13626: I3 ^dir R)
  10213. =>WM: (13625: O1946 ^name predict-no)
  10214. =>WM: (13624: O1945 ^name predict-yes)
  10215. =>WM: (13623: R976 ^value 1)
  10216. =>WM: (13622: R1 ^reward R976)
  10217. <=WM: (13613: S1 ^operator O1943 +)
  10218. <=WM: (13614: S1 ^operator O1944 +)
  10219. <=WM: (13615: S1 ^operator O1944)
  10220. <=WM: (13612: I3 ^dir L)
  10221. <=WM: (13608: R1 ^reward R975)
  10222. <=WM: (13611: O1944 ^name predict-no)
  10223. <=WM: (13610: O1943 ^name predict-yes)
  10224. <=WM: (13609: R975 ^value 1)
  10225. --- Inner Elaboration Phase, active level 1 (S1) ---
  10226. Firing prefer*rvt*predict-yes*H0
  10227. -->
  10228. Firing rl*prefer*rvt*predict-yes*H0*5
  10229. -->
  10230. (S1 ^operator O1945 = 0.2939507002996337)
  10231. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10232. -->
  10233. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10234. -->
  10235. (S1 ^operator O1945 = 0.7054436376897688)
  10236. Firing prefer*rvt*predict-no*H0
  10237. -->
  10238. Firing rl*prefer*rvt*predict-no*H0*6
  10239. -->
  10240. (S1 ^operator O1946 = 0.2298717920574965)
  10241. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10242. -->
  10243. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10244. -->
  10245. (S1 ^operator O1946 = -0.2023211881870005)
  10246. inner elaboration loop at bottom goal.
  10247. Retracting rl*prefer*rvt*predict-no*H0*6
  10248. -->
  10249. (S1 ^operator O1944 = 0.2298717920574965)
  10250. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10251. -->
  10252. (S1 ^operator O1944 = -0.2023211881870005)
  10253. Retracting rl*prefer*rvt*predict-yes*H0*5
  10254. -->
  10255. (S1 ^operator O1943 = 0.2939507002996337)
  10256. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10257. -->
  10258. (S1 ^operator O1943 = 0.7054436376897688)
  10259. --- END Proposal Phase ---
  10260. --- Decision Phase ---
  10261. RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.314041 -> 0.485033 -0.171009 0.314023(R,m,v=1,0.86,0.121208)
  10262. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515116 0.171049 0.686165 -> 0.5151 0.171045 0.686145(R,m,v=1,1,0)
  10263. =>WM: (13629: S1 ^operator O1945)
  10264. 973: O: O1945 (predict-yes)
  10265. --- END Decision Phase ---
  10266. --- Application Phase ---
  10267. --- Firing Productions (PE) For State At Depth 1 ---
  10268. --- Inner Elaboration Phase, active level 1 (S1) ---
  10269. Firing apply*operator
  10270. -->
  10271. (I3 ^predict-yes N973 + :O )
  10272. Firing apply*operator*complete
  10273. -->
  10274. (I3 ^predict-no N972 - :O )
  10275. inner elaboration loop at bottom goal.
  10276. --- Change Working Memory (PE) ---
  10277. =>WM: (13630: I3 ^predict-yes N973)
  10278. <=WM: (13617: N972 ^status complete)
  10279. <=WM: (13616: I3 ^predict-no N972)
  10280. --- Firing Productions (IE) For State At Depth 1 ---
  10281. --- Inner Elaboration Phase, active level 1 (S1) ---
  10282. Firing monitor*world
  10283. -->
  10284. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10285. --- Change Working Memory (IE) ---
  10286. --- END Application Phase ---
  10287. --- Output Phase ---
  10288. ENV: Agent did: predict-yes for direction R in state State-A
  10289. In State-A moving R
  10290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10291. predict error 0
  10292. dir: dir isU
  10293. --- END Output Phase ---
  10294. |\---- Input Phase ---
  10295. =>WM: (13634: I2 ^dir U)
  10296. =>WM: (13633: I2 ^reward 1)
  10297. =>WM: (13632: I2 ^see 1)
  10298. =>WM: (13631: N973 ^status complete)
  10299. <=WM: (13620: I2 ^dir R)
  10300. <=WM: (13619: I2 ^reward 1)
  10301. <=WM: (13618: I2 ^see 0)
  10302. =>WM: (13635: I2 ^level-1 R1-root)
  10303. <=WM: (13621: I2 ^level-1 L0-root)
  10304. --- END Input Phase ---
  10305. --- Proposal Phase ---
  10306. --- Inner Elaboration Phase, active level 1 (S1) ---
  10307. Firing elaborate*copy-see-to-output-link
  10308. -->
  10309. (I3 ^see 1 +)
  10310. Firing elaborate*reward*based*on*reward
  10311. -->
  10312. (R977 ^value 1 +)
  10313. (R1 ^reward R977 +)
  10314. Firing propose*predict-yes
  10315. -->
  10316. (O1947 ^name predict-yes +)
  10317. (S1 ^operator O1947 +)
  10318. Firing propose*predict-no
  10319. -->
  10320. (O1948 ^name predict-no +)
  10321. (S1 ^operator O1948 +)
  10322. Firing rl*prefer*rvt*predict-no*H0*4
  10323. -->
  10324. (S1 ^operator O1946 = 1.)
  10325. Firing rl*prefer*rvt*predict-yes*H0*3
  10326. -->
  10327. (S1 ^operator O1945 = 0.)
  10328. Firing prefer*rvt*predict-yes*H0
  10329. -->
  10330. Firing prefer*rvt*predict-no*H0
  10331. -->
  10332. Firing elaborate*copy-dir-to-output-link
  10333. -->
  10334. (I3 ^dir U +)
  10335. inner elaboration loop at bottom goal.
  10336. Retracting elaborate*copy-see-to-output-link
  10337. -->
  10338. (I3 ^see 0 +)
  10339. Retracting propose*predict-no
  10340. -->
  10341. (O1946 ^name predict-no +)
  10342. (S1 ^operator O1946 +)
  10343. Retracting propose*predict-yes
  10344. -->
  10345. (O1945 ^name predict-yes +)
  10346. (S1 ^operator O1945 +)
  10347. Retracting elaborate*reward*based*on*reward
  10348. -->
  10349. (R976 ^value 1 +)
  10350. (R1 ^reward R976 +)
  10351. Retracting elaborate*copy-dir-to-output-link
  10352. -->
  10353. (I3 ^dir R +)
  10354. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  10355. -->
  10356. (S1 ^operator O1946 = -0.2023211881870005)
  10357. Retracting rl*prefer*rvt*predict-no*H0*6
  10358. -->
  10359. (S1 ^operator O1946 = 0.2298717920574965)
  10360. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10361. -->
  10362. (S1 ^operator O1945 = 0.7054436376897688)
  10363. Retracting rl*prefer*rvt*predict-yes*H0*5
  10364. -->
  10365. (S1 ^operator O1945 = 0.2939507002996337)
  10366. =>WM: (13643: S1 ^operator O1948 +)
  10367. =>WM: (13642: S1 ^operator O1947 +)
  10368. =>WM: (13641: I3 ^dir U)
  10369. =>WM: (13640: O1948 ^name predict-no)
  10370. =>WM: (13639: O1947 ^name predict-yes)
  10371. =>WM: (13638: R977 ^value 1)
  10372. =>WM: (13637: R1 ^reward R977)
  10373. =>WM: (13636: I3 ^see 1)
  10374. <=WM: (13627: S1 ^operator O1945 +)
  10375. <=WM: (13629: S1 ^operator O1945)
  10376. <=WM: (13628: S1 ^operator O1946 +)
  10377. <=WM: (13626: I3 ^dir R)
  10378. <=WM: (13622: R1 ^reward R976)
  10379. <=WM: (13607: I3 ^see 0)
  10380. <=WM: (13625: O1946 ^name predict-no)
  10381. <=WM: (13624: O1945 ^name predict-yes)
  10382. <=WM: (13623: R976 ^value 1)
  10383. --- Inner Elaboration Phase, active level 1 (S1) ---
  10384. Firing prefer*rvt*predict-yes*H0
  10385. -->
  10386. Firing rl*prefer*rvt*predict-yes*H0*3
  10387. -->
  10388. (S1 ^operator O1947 = 0.)
  10389. Firing prefer*rvt*predict-no*H0
  10390. -->
  10391. Firing rl*prefer*rvt*predict-no*H0*4
  10392. -->
  10393. (S1 ^operator O1948 = 1.)
  10394. inner elaboration loop at bottom goal.
  10395. Retracting rl*prefer*rvt*predict-no*H0*4
  10396. -->
  10397. (S1 ^operator O1946 = 1.)
  10398. Retracting rl*prefer*rvt*predict-yes*H0*3
  10399. -->
  10400. (S1 ^operator O1945 = 0.)
  10401. --- END Proposal Phase ---
  10402. --- Decision Phase ---
  10403. RL update rl*prefer*rvt*predict-yes*H0*5 0.501028 -0.207078 0.293951 -> 0.501074 -0.207073 0.294001(R,m,v=1,0.84,0.135302)
  10404. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498423 0.207021 0.705444 -> 0.498477 0.207026 0.705503(R,m,v=1,1,0)
  10405. =>WM: (13644: S1 ^operator O1948)
  10406. 974: O: O1948 (predict-no)
  10407. --- END Decision Phase ---
  10408. --- Application Phase ---
  10409. --- Firing Productions (PE) For State At Depth 1 ---
  10410. --- Inner Elaboration Phase, active level 1 (S1) ---
  10411. Firing apply*operator
  10412. -->
  10413. (I3 ^predict-no N974 + :O )
  10414. Firing apply*operator*complete
  10415. -->
  10416. (I3 ^predict-yes N973 - :O )
  10417. inner elaboration loop at bottom goal.
  10418. --- Change Working Memory (PE) ---
  10419. =>WM: (13645: I3 ^predict-no N974)
  10420. <=WM: (13631: N973 ^status complete)
  10421. <=WM: (13630: I3 ^predict-yes N973)
  10422. --- Firing Productions (IE) For State At Depth 1 ---
  10423. --- Inner Elaboration Phase, active level 1 (S1) ---
  10424. Firing monitor*world
  10425. -->
  10426. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10427. --- Change Working Memory (IE) ---
  10428. --- END Application Phase ---
  10429. --- Output Phase ---
  10430. ENV: Agent did: predict-no for direction U in state State-B
  10431. In State-B moving U
  10432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10433. predict error 0
  10434. dir: dir isL
  10435. --- END Output Phase ---
  10436. /|\--- Input Phase ---
  10437. =>WM: (13649: I2 ^dir L)
  10438. =>WM: (13648: I2 ^reward 1)
  10439. =>WM: (13647: I2 ^see 0)
  10440. =>WM: (13646: N974 ^status complete)
  10441. <=WM: (13634: I2 ^dir U)
  10442. <=WM: (13633: I2 ^reward 1)
  10443. <=WM: (13632: I2 ^see 1)
  10444. =>WM: (13650: I2 ^level-1 R1-root)
  10445. <=WM: (13635: I2 ^level-1 R1-root)
  10446. --- END Input Phase ---
  10447. --- Proposal Phase ---
  10448. --- Inner Elaboration Phase, active level 1 (S1) ---
  10449. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10450. -->
  10451. (S1 ^operator O1947 = 0.6196158942331635)
  10452. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10453. -->
  10454. (S1 ^operator O1948 = -0.1479504104026684)
  10455. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10456. -->
  10457. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10458. -->
  10459. Firing elaborate*copy-see-to-output-link
  10460. -->
  10461. (I3 ^see 0 +)
  10462. Firing elaborate*reward*based*on*reward
  10463. -->
  10464. (R978 ^value 1 +)
  10465. (R1 ^reward R978 +)
  10466. Firing propose*predict-yes
  10467. -->
  10468. (O1949 ^name predict-yes +)
  10469. (S1 ^operator O1949 +)
  10470. Firing propose*predict-no
  10471. -->
  10472. (O1950 ^name predict-no +)
  10473. (S1 ^operator O1950 +)
  10474. Firing rl*prefer*rvt*predict-no*H0*2
  10475. -->
  10476. (S1 ^operator O1948 = 0.3140233963466647)
  10477. Firing rl*prefer*rvt*predict-yes*H0*1
  10478. -->
  10479. (S1 ^operator O1947 = 0.3804143351598744)
  10480. Firing prefer*rvt*predict-yes*H0
  10481. -->
  10482. Firing prefer*rvt*predict-no*H0
  10483. -->
  10484. Firing elaborate*copy-dir-to-output-link
  10485. -->
  10486. (I3 ^dir L +)
  10487. inner elaboration loop at bottom goal.
  10488. Retracting elaborate*copy-see-to-output-link
  10489. -->
  10490. (I3 ^see 1 +)
  10491. Retracting propose*predict-no
  10492. -->
  10493. (O1948 ^name predict-no +)
  10494. (S1 ^operator O1948 +)
  10495. Retracting propose*predict-yes
  10496. -->
  10497. (O1947 ^name predict-yes +)
  10498. (S1 ^operator O1947 +)
  10499. Retracting elaborate*reward*based*on*reward
  10500. -->
  10501. (R977 ^value 1 +)
  10502. (R1 ^reward R977 +)
  10503. Retracting elaborate*copy-dir-to-output-link
  10504. -->
  10505. (I3 ^dir U +)
  10506. Retracting rl*prefer*rvt*predict-no*H0*4
  10507. -->
  10508. (S1 ^operator O1948 = 1.)
  10509. Retracting rl*prefer*rvt*predict-yes*H0*3
  10510. -->
  10511. (S1 ^operator O1947 = 0.)
  10512. =>WM: (13658: S1 ^operator O1950 +)
  10513. =>WM: (13657: S1 ^operator O1949 +)
  10514. =>WM: (13656: I3 ^dir L)
  10515. =>WM: (13655: O1950 ^name predict-no)
  10516. =>WM: (13654: O1949 ^name predict-yes)
  10517. =>WM: (13653: R978 ^value 1)
  10518. =>WM: (13652: R1 ^reward R978)
  10519. =>WM: (13651: I3 ^see 0)
  10520. <=WM: (13642: S1 ^operator O1947 +)
  10521. <=WM: (13643: S1 ^operator O1948 +)
  10522. <=WM: (13644: S1 ^operator O1948)
  10523. <=WM: (13641: I3 ^dir U)
  10524. <=WM: (13637: R1 ^reward R977)
  10525. <=WM: (13636: I3 ^see 1)
  10526. <=WM: (13640: O1948 ^name predict-no)
  10527. <=WM: (13639: O1947 ^name predict-yes)
  10528. <=WM: (13638: R977 ^value 1)
  10529. --- Inner Elaboration Phase, active level 1 (S1) ---
  10530. Firing prefer*rvt*predict-yes*H0
  10531. -->
  10532. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10533. -->
  10534. (S1 ^operator O1949 = 0.6196158942331635)
  10535. Firing rl*prefer*rvt*predict-yes*H0*1
  10536. -->
  10537. (S1 ^operator O1949 = 0.3804143351598744)
  10538. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10539. -->
  10540. Firing prefer*rvt*predict-no*H0
  10541. -->
  10542. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10543. -->
  10544. (S1 ^operator O1950 = -0.1479504104026684)
  10545. Firing rl*prefer*rvt*predict-no*H0*2
  10546. -->
  10547. (S1 ^operator O1950 = 0.3140233963466647)
  10548. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10549. -->
  10550. inner elaboration loop at bottom goal.
  10551. Retracting rl*prefer*rvt*predict-no*H0*2
  10552. -->
  10553. (S1 ^operator O1948 = 0.3140233963466647)
  10554. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10555. -->
  10556. (S1 ^operator O1948 = -0.1479504104026684)
  10557. Retracting rl*prefer*rvt*predict-yes*H0*1
  10558. -->
  10559. (S1 ^operator O1947 = 0.3804143351598744)
  10560. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10561. -->
  10562. (S1 ^operator O1947 = 0.6196158942331635)
  10563. --- END Proposal Phase ---
  10564. --- Decision Phase ---
  10565. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10566. =>WM: (13659: S1 ^operator O1949)
  10567. 975: O: O1949 (predict-yes)
  10568. --- END Decision Phase ---
  10569. --- Application Phase ---
  10570. --- Firing Productions (PE) For State At Depth 1 ---
  10571. --- Inner Elaboration Phase, active level 1 (S1) ---
  10572. Firing apply*operator
  10573. -->
  10574. (I3 ^predict-yes N975 + :O )
  10575. Firing apply*operator*complete
  10576. -->
  10577. (I3 ^predict-no N974 - :O )
  10578. inner elaboration loop at bottom goal.
  10579. --- Change Working Memory (PE) ---
  10580. =>WM: (13660: I3 ^predict-yes N975)
  10581. <=WM: (13646: N974 ^status complete)
  10582. <=WM: (13645: I3 ^predict-no N974)
  10583. --- Firing Productions (IE) For State At Depth 1 ---
  10584. --- Inner Elaboration Phase, active level 1 (S1) ---
  10585. Firing monitor*world
  10586. -->
  10587. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10588. --- Change Working Memory (IE) ---
  10589. --- END Application Phase ---
  10590. --- Output Phase ---
  10591. ENV: Agent did: predict-yes for direction L in state State-B
  10592. In State-B moving L
  10593. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10594. predict error 0
  10595. dir: dir isR
  10596. --- END Output Phase ---
  10597. -/|--- Input Phase ---
  10598. =>WM: (13664: I2 ^dir R)
  10599. =>WM: (13663: I2 ^reward 1)
  10600. =>WM: (13662: I2 ^see 1)
  10601. =>WM: (13661: N975 ^status complete)
  10602. <=WM: (13649: I2 ^dir L)
  10603. <=WM: (13648: I2 ^reward 1)
  10604. <=WM: (13647: I2 ^see 0)
  10605. =>WM: (13665: I2 ^level-1 L1-root)
  10606. <=WM: (13650: I2 ^level-1 R1-root)
  10607. --- END Input Phase ---
  10608. --- Proposal Phase ---
  10609. --- Inner Elaboration Phase, active level 1 (S1) ---
  10610. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10611. -->
  10612. (S1 ^operator O1949 = 0.7064496972060428)
  10613. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10614. -->
  10615. (S1 ^operator O1950 = -0.1937987592593187)
  10616. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10617. -->
  10618. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10619. -->
  10620. Firing elaborate*copy-see-to-output-link
  10621. -->
  10622. (I3 ^see 1 +)
  10623. Firing elaborate*reward*based*on*reward
  10624. -->
  10625. (R979 ^value 1 +)
  10626. (R1 ^reward R979 +)
  10627. Firing propose*predict-yes
  10628. -->
  10629. (O1951 ^name predict-yes +)
  10630. (S1 ^operator O1951 +)
  10631. Firing propose*predict-no
  10632. -->
  10633. (O1952 ^name predict-no +)
  10634. (S1 ^operator O1952 +)
  10635. Firing rl*prefer*rvt*predict-no*H0*6
  10636. -->
  10637. (S1 ^operator O1950 = 0.2298717920574965)
  10638. Firing rl*prefer*rvt*predict-yes*H0*5
  10639. -->
  10640. (S1 ^operator O1949 = 0.2940010828283485)
  10641. Firing prefer*rvt*predict-yes*H0
  10642. -->
  10643. Firing prefer*rvt*predict-no*H0
  10644. -->
  10645. Firing elaborate*copy-dir-to-output-link
  10646. -->
  10647. (I3 ^dir R +)
  10648. inner elaboration loop at bottom goal.
  10649. Retracting elaborate*copy-see-to-output-link
  10650. -->
  10651. (I3 ^see 0 +)
  10652. Retracting propose*predict-no
  10653. -->
  10654. (O1950 ^name predict-no +)
  10655. (S1 ^operator O1950 +)
  10656. Retracting propose*predict-yes
  10657. -->
  10658. (O1949 ^name predict-yes +)
  10659. (S1 ^operator O1949 +)
  10660. Retracting elaborate*reward*based*on*reward
  10661. -->
  10662. (R978 ^value 1 +)
  10663. (R1 ^reward R978 +)
  10664. Retracting elaborate*copy-dir-to-output-link
  10665. -->
  10666. (I3 ^dir L +)
  10667. Retracting rl*prefer*rvt*predict-no*H0*2
  10668. -->
  10669. (S1 ^operator O1950 = 0.3140233963466647)
  10670. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  10671. -->
  10672. (S1 ^operator O1950 = -0.1479504104026684)
  10673. Retracting rl*prefer*rvt*predict-yes*H0*1
  10674. -->
  10675. (S1 ^operator O1949 = 0.3804143351598744)
  10676. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  10677. -->
  10678. (S1 ^operator O1949 = 0.6196158942331635)
  10679. =>WM: (13673: S1 ^operator O1952 +)
  10680. =>WM: (13672: S1 ^operator O1951 +)
  10681. =>WM: (13671: I3 ^dir R)
  10682. =>WM: (13670: O1952 ^name predict-no)
  10683. =>WM: (13669: O1951 ^name predict-yes)
  10684. =>WM: (13668: R979 ^value 1)
  10685. =>WM: (13667: R1 ^reward R979)
  10686. =>WM: (13666: I3 ^see 1)
  10687. <=WM: (13657: S1 ^operator O1949 +)
  10688. <=WM: (13659: S1 ^operator O1949)
  10689. <=WM: (13658: S1 ^operator O1950 +)
  10690. <=WM: (13656: I3 ^dir L)
  10691. <=WM: (13652: R1 ^reward R978)
  10692. <=WM: (13651: I3 ^see 0)
  10693. <=WM: (13655: O1950 ^name predict-no)
  10694. <=WM: (13654: O1949 ^name predict-yes)
  10695. <=WM: (13653: R978 ^value 1)
  10696. --- Inner Elaboration Phase, active level 1 (S1) ---
  10697. Firing prefer*rvt*predict-yes*H0
  10698. -->
  10699. Firing rl*prefer*rvt*predict-yes*H0*5
  10700. -->
  10701. (S1 ^operator O1951 = 0.2940010828283485)
  10702. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10703. -->
  10704. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10705. -->
  10706. (S1 ^operator O1951 = 0.7064496972060428)
  10707. Firing prefer*rvt*predict-no*H0
  10708. -->
  10709. Firing rl*prefer*rvt*predict-no*H0*6
  10710. -->
  10711. (S1 ^operator O1952 = 0.2298717920574965)
  10712. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10713. -->
  10714. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10715. -->
  10716. (S1 ^operator O1952 = -0.1937987592593187)
  10717. inner elaboration loop at bottom goal.
  10718. Retracting rl*prefer*rvt*predict-no*H0*6
  10719. -->
  10720. (S1 ^operator O1950 = 0.2298717920574965)
  10721. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10722. -->
  10723. (S1 ^operator O1950 = -0.1937987592593187)
  10724. Retracting rl*prefer*rvt*predict-yes*H0*5
  10725. -->
  10726. (S1 ^operator O1949 = 0.2940010828283485)
  10727. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10728. -->
  10729. (S1 ^operator O1949 = 0.7064496972060428)
  10730. --- END Proposal Phase ---
  10731. --- Decision Phase ---
  10732. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.825,0.145283)
  10733. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478689 0.140927 0.619616 -> 0.478686 0.140927 0.619613(R,m,v=1,1,0)
  10734. =>WM: (13674: S1 ^operator O1951)
  10735. 976: O: O1951 (predict-yes)
  10736. --- END Decision Phase ---
  10737. --- Application Phase ---
  10738. --- Firing Productions (PE) For State At Depth 1 ---
  10739. --- Inner Elaboration Phase, active level 1 (S1) ---
  10740. Firing apply*operator
  10741. -->
  10742. (I3 ^predict-yes N976 + :O )
  10743. Firing apply*operator*complete
  10744. -->
  10745. (I3 ^predict-yes N975 - :O )
  10746. inner elaboration loop at bottom goal.
  10747. --- Change Working Memory (PE) ---
  10748. =>WM: (13675: I3 ^predict-yes N976)
  10749. <=WM: (13661: N975 ^status complete)
  10750. <=WM: (13660: I3 ^predict-yes N975)
  10751. --- Firing Productions (IE) For State At Depth 1 ---
  10752. --- Inner Elaboration Phase, active level 1 (S1) ---
  10753. Firing monitor*world
  10754. -->
  10755. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10756. --- Change Working Memory (IE) ---
  10757. --- END Application Phase ---
  10758. --- Output Phase ---
  10759. ENV: Agent did: predict-yes for direction R in state State-A
  10760. In State-A moving R
  10761. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10762. predict error 0
  10763. dir: dir isR
  10764. --- END Output Phase ---
  10765. \-/--- Input Phase ---
  10766. =>WM: (13679: I2 ^dir R)
  10767. =>WM: (13678: I2 ^reward 1)
  10768. =>WM: (13677: I2 ^see 1)
  10769. =>WM: (13676: N976 ^status complete)
  10770. <=WM: (13664: I2 ^dir R)
  10771. <=WM: (13663: I2 ^reward 1)
  10772. <=WM: (13662: I2 ^see 1)
  10773. =>WM: (13680: I2 ^level-1 R1-root)
  10774. <=WM: (13665: I2 ^level-1 L1-root)
  10775. --- END Input Phase ---
  10776. --- Proposal Phase ---
  10777. --- Inner Elaboration Phase, active level 1 (S1) ---
  10778. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10779. -->
  10780. (S1 ^operator O1951 = -0.252585164213872)
  10781. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10782. -->
  10783. (S1 ^operator O1952 = 0.7701964997777864)
  10784. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10785. -->
  10786. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10787. -->
  10788. Firing elaborate*copy-see-to-output-link
  10789. -->
  10790. (I3 ^see 1 +)
  10791. Firing elaborate*reward*based*on*reward
  10792. -->
  10793. (R980 ^value 1 +)
  10794. (R1 ^reward R980 +)
  10795. Firing propose*predict-yes
  10796. -->
  10797. (O1953 ^name predict-yes +)
  10798. (S1 ^operator O1953 +)
  10799. Firing propose*predict-no
  10800. -->
  10801. (O1954 ^name predict-no +)
  10802. (S1 ^operator O1954 +)
  10803. Firing rl*prefer*rvt*predict-no*H0*6
  10804. -->
  10805. (S1 ^operator O1952 = 0.2298717920574965)
  10806. Firing rl*prefer*rvt*predict-yes*H0*5
  10807. -->
  10808. (S1 ^operator O1951 = 0.2940010828283485)
  10809. Firing prefer*rvt*predict-yes*H0
  10810. -->
  10811. Firing prefer*rvt*predict-no*H0
  10812. -->
  10813. Firing elaborate*copy-dir-to-output-link
  10814. -->
  10815. (I3 ^dir R +)
  10816. inner elaboration loop at bottom goal.
  10817. Retracting elaborate*copy-see-to-output-link
  10818. -->
  10819. (I3 ^see 1 +)
  10820. Retracting propose*predict-no
  10821. -->
  10822. (O1952 ^name predict-no +)
  10823. (S1 ^operator O1952 +)
  10824. Retracting propose*predict-yes
  10825. -->
  10826. (O1951 ^name predict-yes +)
  10827. (S1 ^operator O1951 +)
  10828. Retracting elaborate*reward*based*on*reward
  10829. -->
  10830. (R979 ^value 1 +)
  10831. (R1 ^reward R979 +)
  10832. Retracting elaborate*copy-dir-to-output-link
  10833. -->
  10834. (I3 ^dir R +)
  10835. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  10836. -->
  10837. (S1 ^operator O1952 = -0.1937987592593187)
  10838. Retracting rl*prefer*rvt*predict-no*H0*6
  10839. -->
  10840. (S1 ^operator O1952 = 0.2298717920574965)
  10841. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  10842. -->
  10843. (S1 ^operator O1951 = 0.7064496972060428)
  10844. Retracting rl*prefer*rvt*predict-yes*H0*5
  10845. -->
  10846. (S1 ^operator O1951 = 0.2940010828283485)
  10847. =>WM: (13686: S1 ^operator O1954 +)
  10848. =>WM: (13685: S1 ^operator O1953 +)
  10849. =>WM: (13684: O1954 ^name predict-no)
  10850. =>WM: (13683: O1953 ^name predict-yes)
  10851. =>WM: (13682: R980 ^value 1)
  10852. =>WM: (13681: R1 ^reward R980)
  10853. <=WM: (13672: S1 ^operator O1951 +)
  10854. <=WM: (13674: S1 ^operator O1951)
  10855. <=WM: (13673: S1 ^operator O1952 +)
  10856. <=WM: (13667: R1 ^reward R979)
  10857. <=WM: (13670: O1952 ^name predict-no)
  10858. <=WM: (13669: O1951 ^name predict-yes)
  10859. <=WM: (13668: R979 ^value 1)
  10860. --- Inner Elaboration Phase, active level 1 (S1) ---
  10861. Firing prefer*rvt*predict-yes*H0
  10862. -->
  10863. Firing rl*prefer*rvt*predict-yes*H0*5
  10864. -->
  10865. (S1 ^operator O1953 = 0.2940010828283485)
  10866. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10867. -->
  10868. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10869. -->
  10870. (S1 ^operator O1953 = -0.252585164213872)
  10871. Firing prefer*rvt*predict-no*H0
  10872. -->
  10873. Firing rl*prefer*rvt*predict-no*H0*6
  10874. -->
  10875. (S1 ^operator O1954 = 0.2298717920574965)
  10876. Firing prefer*rvt*predict-no*H0*6*v1*H1
  10877. -->
  10878. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10879. -->
  10880. (S1 ^operator O1954 = 0.7701964997777864)
  10881. inner elaboration loop at bottom goal.
  10882. Retracting rl*prefer*rvt*predict-no*H0*6
  10883. -->
  10884. (S1 ^operator O1952 = 0.2298717920574965)
  10885. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10886. -->
  10887. (S1 ^operator O1952 = 0.7701964997777864)
  10888. Retracting rl*prefer*rvt*predict-yes*H0*5
  10889. -->
  10890. (S1 ^operator O1951 = 0.2940010828283485)
  10891. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10892. -->
  10893. (S1 ^operator O1951 = -0.252585164213872)
  10894. --- END Proposal Phase ---
  10895. --- Decision Phase ---
  10896. RL update rl*prefer*rvt*predict-yes*H0*5 0.501074 -0.207073 0.294001 -> 0.50104 -0.207077 0.293964(R,m,v=1,0.84106,0.13457)
  10897. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499331 0.207118 0.70645 -> 0.499292 0.207114 0.706406(R,m,v=1,1,0)
  10898. =>WM: (13687: S1 ^operator O1954)
  10899. 977: O: O1954 (predict-no)
  10900. --- END Decision Phase ---
  10901. --- Application Phase ---
  10902. --- Firing Productions (PE) For State At Depth 1 ---
  10903. --- Inner Elaboration Phase, active level 1 (S1) ---
  10904. Firing apply*operator
  10905. -->
  10906. (I3 ^predict-no N977 + :O )
  10907. Firing apply*operator*complete
  10908. -->
  10909. (I3 ^predict-yes N976 - :O )
  10910. inner elaboration loop at bottom goal.
  10911. --- Change Working Memory (PE) ---
  10912. =>WM: (13688: I3 ^predict-no N977)
  10913. <=WM: (13676: N976 ^status complete)
  10914. <=WM: (13675: I3 ^predict-yes N976)
  10915. --- Firing Productions (IE) For State At Depth 1 ---
  10916. --- Inner Elaboration Phase, active level 1 (S1) ---
  10917. Firing monitor*world
  10918. -->
  10919. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10920. --- Change Working Memory (IE) ---
  10921. --- END Application Phase ---
  10922. --- Output Phase ---
  10923. ENV: Agent did: predict-no for direction R in state State-B
  10924. In State-B moving R
  10925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10926. predict error 0
  10927. dir: dir isU
  10928. --- END Output Phase ---
  10929. |\--- Input Phase ---
  10930. =>WM: (13692: I2 ^dir U)
  10931. =>WM: (13691: I2 ^reward 1)
  10932. =>WM: (13690: I2 ^see 0)
  10933. =>WM: (13689: N977 ^status complete)
  10934. <=WM: (13679: I2 ^dir R)
  10935. <=WM: (13678: I2 ^reward 1)
  10936. <=WM: (13677: I2 ^see 1)
  10937. =>WM: (13693: I2 ^level-1 R0-root)
  10938. <=WM: (13680: I2 ^level-1 R1-root)
  10939. --- END Input Phase ---
  10940. --- Proposal Phase ---
  10941. --- Inner Elaboration Phase, active level 1 (S1) ---
  10942. Firing elaborate*copy-see-to-output-link
  10943. -->
  10944. (I3 ^see 0 +)
  10945. Firing elaborate*reward*based*on*reward
  10946. -->
  10947. (R981 ^value 1 +)
  10948. (R1 ^reward R981 +)
  10949. Firing propose*predict-yes
  10950. -->
  10951. (O1955 ^name predict-yes +)
  10952. (S1 ^operator O1955 +)
  10953. Firing propose*predict-no
  10954. -->
  10955. (O1956 ^name predict-no +)
  10956. (S1 ^operator O1956 +)
  10957. Firing rl*prefer*rvt*predict-no*H0*4
  10958. -->
  10959. (S1 ^operator O1954 = 1.)
  10960. Firing rl*prefer*rvt*predict-yes*H0*3
  10961. -->
  10962. (S1 ^operator O1953 = 0.)
  10963. Firing prefer*rvt*predict-yes*H0
  10964. -->
  10965. Firing prefer*rvt*predict-no*H0
  10966. -->
  10967. Firing elaborate*copy-dir-to-output-link
  10968. -->
  10969. (I3 ^dir U +)
  10970. inner elaboration loop at bottom goal.
  10971. Retracting elaborate*copy-see-to-output-link
  10972. -->
  10973. (I3 ^see 1 +)
  10974. Retracting propose*predict-no
  10975. -->
  10976. (O1954 ^name predict-no +)
  10977. (S1 ^operator O1954 +)
  10978. Retracting propose*predict-yes
  10979. -->
  10980. (O1953 ^name predict-yes +)
  10981. (S1 ^operator O1953 +)
  10982. Retracting elaborate*reward*based*on*reward
  10983. -->
  10984. (R980 ^value 1 +)
  10985. (R1 ^reward R980 +)
  10986. Retracting elaborate*copy-dir-to-output-link
  10987. -->
  10988. (I3 ^dir R +)
  10989. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  10990. -->
  10991. (S1 ^operator O1954 = 0.7701964997777864)
  10992. Retracting rl*prefer*rvt*predict-no*H0*6
  10993. -->
  10994. (S1 ^operator O1954 = 0.2298717920574965)
  10995. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  10996. -->
  10997. (S1 ^operator O1953 = -0.252585164213872)
  10998. Retracting rl*prefer*rvt*predict-yes*H0*5
  10999. -->
  11000. (S1 ^operator O1953 = 0.2939636257009906)
  11001. =>WM: (13701: S1 ^operator O1956 +)
  11002. =>WM: (13700: S1 ^operator O1955 +)
  11003. =>WM: (13699: I3 ^dir U)
  11004. =>WM: (13698: O1956 ^name predict-no)
  11005. =>WM: (13697: O1955 ^name predict-yes)
  11006. =>WM: (13696: R981 ^value 1)
  11007. =>WM: (13695: R1 ^reward R981)
  11008. =>WM: (13694: I3 ^see 0)
  11009. <=WM: (13685: S1 ^operator O1953 +)
  11010. <=WM: (13686: S1 ^operator O1954 +)
  11011. <=WM: (13687: S1 ^operator O1954)
  11012. <=WM: (13671: I3 ^dir R)
  11013. <=WM: (13681: R1 ^reward R980)
  11014. <=WM: (13666: I3 ^see 1)
  11015. <=WM: (13684: O1954 ^name predict-no)
  11016. <=WM: (13683: O1953 ^name predict-yes)
  11017. <=WM: (13682: R980 ^value 1)
  11018. --- Inner Elaboration Phase, active level 1 (S1) ---
  11019. Firing prefer*rvt*predict-yes*H0
  11020. -->
  11021. Firing rl*prefer*rvt*predict-yes*H0*3
  11022. -->
  11023. (S1 ^operator O1955 = 0.)
  11024. Firing prefer*rvt*predict-no*H0
  11025. -->
  11026. Firing rl*prefer*rvt*predict-no*H0*4
  11027. -->
  11028. (S1 ^operator O1956 = 1.)
  11029. inner elaboration loop at bottom goal.
  11030. Retracting rl*prefer*rvt*predict-no*H0*4
  11031. -->
  11032. (S1 ^operator O1954 = 1.)
  11033. Retracting rl*prefer*rvt*predict-yes*H0*3
  11034. -->
  11035. (S1 ^operator O1953 = 0.)
  11036. --- END Proposal Phase ---
  11037. --- Decision Phase ---
  11038. RL update rl*prefer*rvt*predict-no*H0*6 0.611922 -0.38205 0.229872 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.843023,0.133109)
  11039. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388134 0.382063 0.770196 -> 0.388128 0.382061 0.77019(R,m,v=1,1,0)
  11040. =>WM: (13702: S1 ^operator O1956)
  11041. 978: O: O1956 (predict-no)
  11042. --- END Decision Phase ---
  11043. --- Application Phase ---
  11044. --- Firing Productions (PE) For State At Depth 1 ---
  11045. --- Inner Elaboration Phase, active level 1 (S1) ---
  11046. Firing apply*operator
  11047. -->
  11048. (I3 ^predict-no N978 + :O )
  11049. Firing apply*operator*complete
  11050. -->
  11051. (I3 ^predict-no N977 - :O )
  11052. inner elaboration loop at bottom goal.
  11053. --- Change Working Memory (PE) ---
  11054. =>WM: (13703: I3 ^predict-no N978)
  11055. <=WM: (13689: N977 ^status complete)
  11056. <=WM: (13688: I3 ^predict-no N977)
  11057. --- Firing Productions (IE) For State At Depth 1 ---
  11058. --- Inner Elaboration Phase, active level 1 (S1) ---
  11059. Firing monitor*world
  11060. -->
  11061. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11062. --- Change Working Memory (IE) ---
  11063. --- END Application Phase ---
  11064. --- Output Phase ---
  11065. ENV: Agent did: predict-no for direction U in state State-B
  11066. In State-B moving U
  11067. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11068. predict error 0
  11069. dir: dir isU
  11070. --- END Output Phase ---
  11071. -/|--- Input Phase ---
  11072. =>WM: (13707: I2 ^dir U)
  11073. =>WM: (13706: I2 ^reward 1)
  11074. =>WM: (13705: I2 ^see 0)
  11075. =>WM: (13704: N978 ^status complete)
  11076. <=WM: (13692: I2 ^dir U)
  11077. <=WM: (13691: I2 ^reward 1)
  11078. <=WM: (13690: I2 ^see 0)
  11079. =>WM: (13708: I2 ^level-1 R0-root)
  11080. <=WM: (13693: I2 ^level-1 R0-root)
  11081. --- END Input Phase ---
  11082. --- Proposal Phase ---
  11083. --- Inner Elaboration Phase, active level 1 (S1) ---
  11084. Firing elaborate*copy-see-to-output-link
  11085. -->
  11086. (I3 ^see 0 +)
  11087. Firing elaborate*reward*based*on*reward
  11088. -->
  11089. (R982 ^value 1 +)
  11090. (R1 ^reward R982 +)
  11091. Firing propose*predict-yes
  11092. -->
  11093. (O1957 ^name predict-yes +)
  11094. (S1 ^operator O1957 +)
  11095. Firing propose*predict-no
  11096. -->
  11097. (O1958 ^name predict-no +)
  11098. (S1 ^operator O1958 +)
  11099. Firing rl*prefer*rvt*predict-no*H0*4
  11100. -->
  11101. (S1 ^operator O1956 = 1.)
  11102. Firing rl*prefer*rvt*predict-yes*H0*3
  11103. -->
  11104. (S1 ^operator O1955 = 0.)
  11105. Firing prefer*rvt*predict-yes*H0
  11106. -->
  11107. Firing prefer*rvt*predict-no*H0
  11108. -->
  11109. Firing elaborate*copy-dir-to-output-link
  11110. -->
  11111. (I3 ^dir U +)
  11112. inner elaboration loop at bottom goal.
  11113. Retracting elaborate*copy-see-to-output-link
  11114. -->
  11115. (I3 ^see 0 +)
  11116. Retracting propose*predict-no
  11117. -->
  11118. (O1956 ^name predict-no +)
  11119. (S1 ^operator O1956 +)
  11120. Retracting propose*predict-yes
  11121. -->
  11122. (O1955 ^name predict-yes +)
  11123. (S1 ^operator O1955 +)
  11124. Retracting elaborate*reward*based*on*reward
  11125. -->
  11126. (R981 ^value 1 +)
  11127. (R1 ^reward R981 +)
  11128. Retracting elaborate*copy-dir-to-output-link
  11129. -->
  11130. (I3 ^dir U +)
  11131. Retracting rl*prefer*rvt*predict-no*H0*4
  11132. -->
  11133. (S1 ^operator O1956 = 1.)
  11134. Retracting rl*prefer*rvt*predict-yes*H0*3
  11135. -->
  11136. (S1 ^operator O1955 = 0.)
  11137. =>WM: (13714: S1 ^operator O1958 +)
  11138. =>WM: (13713: S1 ^operator O1957 +)
  11139. =>WM: (13712: O1958 ^name predict-no)
  11140. =>WM: (13711: O1957 ^name predict-yes)
  11141. =>WM: (13710: R982 ^value 1)
  11142. =>WM: (13709: R1 ^reward R982)
  11143. <=WM: (13700: S1 ^operator O1955 +)
  11144. <=WM: (13701: S1 ^operator O1956 +)
  11145. <=WM: (13702: S1 ^operator O1956)
  11146. <=WM: (13695: R1 ^reward R981)
  11147. <=WM: (13698: O1956 ^name predict-no)
  11148. <=WM: (13697: O1955 ^name predict-yes)
  11149. <=WM: (13696: R981 ^value 1)
  11150. --- Inner Elaboration Phase, active level 1 (S1) ---
  11151. Firing prefer*rvt*predict-yes*H0
  11152. -->
  11153. Firing rl*prefer*rvt*predict-yes*H0*3
  11154. -->
  11155. (S1 ^operator O1957 = 0.)
  11156. Firing prefer*rvt*predict-no*H0
  11157. -->
  11158. Firing rl*prefer*rvt*predict-no*H0*4
  11159. -->
  11160. (S1 ^operator O1958 = 1.)
  11161. inner elaboration loop at bottom goal.
  11162. Retracting rl*prefer*rvt*predict-no*H0*4
  11163. -->
  11164. (S1 ^operator O1956 = 1.)
  11165. Retracting rl*prefer*rvt*predict-yes*H0*3
  11166. -->
  11167. (S1 ^operator O1955 = 0.)
  11168. --- END Proposal Phase ---
  11169. --- Decision Phase ---
  11170. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11171. =>WM: (13715: S1 ^operator O1958)
  11172. 979: O: O1958 (predict-no)
  11173. --- END Decision Phase ---
  11174. --- Application Phase ---
  11175. --- Firing Productions (PE) For State At Depth 1 ---
  11176. --- Inner Elaboration Phase, active level 1 (S1) ---
  11177. Firing apply*operator
  11178. -->
  11179. (I3 ^predict-no N979 + :O )
  11180. Firing apply*operator*complete
  11181. -->
  11182. (I3 ^predict-no N978 - :O )
  11183. inner elaboration loop at bottom goal.
  11184. --- Change Working Memory (PE) ---
  11185. =>WM: (13716: I3 ^predict-no N979)
  11186. <=WM: (13704: N978 ^status complete)
  11187. <=WM: (13703: I3 ^predict-no N978)
  11188. --- Firing Productions (IE) For State At Depth 1 ---
  11189. --- Inner Elaboration Phase, active level 1 (S1) ---
  11190. Firing monitor*world
  11191. -->
  11192. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11193. --- Change Working Memory (IE) ---
  11194. --- END Application Phase ---
  11195. --- Output Phase ---
  11196. ENV: Agent did: predict-no for direction U in state State-B
  11197. In State-B moving U
  11198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11199. predict error 0
  11200. dir: dir isL
  11201. --- END Output Phase ---
  11202. \---- Input Phase ---
  11203. =>WM: (13720: I2 ^dir L)
  11204. =>WM: (13719: I2 ^reward 1)
  11205. =>WM: (13718: I2 ^see 0)
  11206. =>WM: (13717: N979 ^status complete)
  11207. <=WM: (13707: I2 ^dir U)
  11208. <=WM: (13706: I2 ^reward 1)
  11209. <=WM: (13705: I2 ^see 0)
  11210. =>WM: (13721: I2 ^level-1 R0-root)
  11211. <=WM: (13708: I2 ^level-1 R0-root)
  11212. --- END Input Phase ---
  11213. --- Proposal Phase ---
  11214. --- Inner Elaboration Phase, active level 1 (S1) ---
  11215. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11216. -->
  11217. (S1 ^operator O1957 = 0.6195601949549704)
  11218. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11219. -->
  11220. (S1 ^operator O1958 = -0.2190661556260421)
  11221. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11222. -->
  11223. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11224. -->
  11225. Firing elaborate*copy-see-to-output-link
  11226. -->
  11227. (I3 ^see 0 +)
  11228. Firing elaborate*reward*based*on*reward
  11229. -->
  11230. (R983 ^value 1 +)
  11231. (R1 ^reward R983 +)
  11232. Firing propose*predict-yes
  11233. -->
  11234. (O1959 ^name predict-yes +)
  11235. (S1 ^operator O1959 +)
  11236. Firing propose*predict-no
  11237. -->
  11238. (O1960 ^name predict-no +)
  11239. (S1 ^operator O1960 +)
  11240. Firing rl*prefer*rvt*predict-no*H0*2
  11241. -->
  11242. (S1 ^operator O1958 = 0.3140233963466647)
  11243. Firing rl*prefer*rvt*predict-yes*H0*1
  11244. -->
  11245. (S1 ^operator O1957 = 0.3804118472151704)
  11246. Firing prefer*rvt*predict-yes*H0
  11247. -->
  11248. Firing prefer*rvt*predict-no*H0
  11249. -->
  11250. Firing elaborate*copy-dir-to-output-link
  11251. -->
  11252. (I3 ^dir L +)
  11253. inner elaboration loop at bottom goal.
  11254. Retracting elaborate*copy-see-to-output-link
  11255. -->
  11256. (I3 ^see 0 +)
  11257. Retracting propose*predict-no
  11258. -->
  11259. (O1958 ^name predict-no +)
  11260. (S1 ^operator O1958 +)
  11261. Retracting propose*predict-yes
  11262. -->
  11263. (O1957 ^name predict-yes +)
  11264. (S1 ^operator O1957 +)
  11265. Retracting elaborate*reward*based*on*reward
  11266. -->
  11267. (R982 ^value 1 +)
  11268. (R1 ^reward R982 +)
  11269. Retracting elaborate*copy-dir-to-output-link
  11270. -->
  11271. (I3 ^dir U +)
  11272. Retracting rl*prefer*rvt*predict-no*H0*4
  11273. -->
  11274. (S1 ^operator O1958 = 1.)
  11275. Retracting rl*prefer*rvt*predict-yes*H0*3
  11276. -->
  11277. (S1 ^operator O1957 = 0.)
  11278. =>WM: (13728: S1 ^operator O1960 +)
  11279. =>WM: (13727: S1 ^operator O1959 +)
  11280. =>WM: (13726: I3 ^dir L)
  11281. =>WM: (13725: O1960 ^name predict-no)
  11282. =>WM: (13724: O1959 ^name predict-yes)
  11283. =>WM: (13723: R983 ^value 1)
  11284. =>WM: (13722: R1 ^reward R983)
  11285. <=WM: (13713: S1 ^operator O1957 +)
  11286. <=WM: (13714: S1 ^operator O1958 +)
  11287. <=WM: (13715: S1 ^operator O1958)
  11288. <=WM: (13699: I3 ^dir U)
  11289. <=WM: (13709: R1 ^reward R982)
  11290. <=WM: (13712: O1958 ^name predict-no)
  11291. <=WM: (13711: O1957 ^name predict-yes)
  11292. <=WM: (13710: R982 ^value 1)
  11293. --- Inner Elaboration Phase, active level 1 (S1) ---
  11294. Firing prefer*rvt*predict-yes*H0
  11295. -->
  11296. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11297. -->
  11298. (S1 ^operator O1959 = 0.6195601949549704)
  11299. Firing rl*prefer*rvt*predict-yes*H0*1
  11300. -->
  11301. (S1 ^operator O1959 = 0.3804118472151704)
  11302. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11303. -->
  11304. Firing prefer*rvt*predict-no*H0
  11305. -->
  11306. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11307. -->
  11308. (S1 ^operator O1960 = -0.2190661556260421)
  11309. Firing rl*prefer*rvt*predict-no*H0*2
  11310. -->
  11311. (S1 ^operator O1960 = 0.3140233963466647)
  11312. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11313. -->
  11314. inner elaboration loop at bottom goal.
  11315. Retracting rl*prefer*rvt*predict-no*H0*2
  11316. -->
  11317. (S1 ^operator O1958 = 0.3140233963466647)
  11318. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11319. -->
  11320. (S1 ^operator O1958 = -0.2190661556260421)
  11321. Retracting rl*prefer*rvt*predict-yes*H0*1
  11322. -->
  11323. (S1 ^operator O1957 = 0.3804118472151704)
  11324. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11325. -->
  11326. (S1 ^operator O1957 = 0.6195601949549704)
  11327. --- END Proposal Phase ---
  11328. --- Decision Phase ---
  11329. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11330. =>WM: (13729: S1 ^operator O1959)
  11331. 980: O: O1959 (predict-yes)
  11332. --- END Decision Phase ---
  11333. --- Application Phase ---
  11334. --- Firing Productions (PE) For State At Depth 1 ---
  11335. --- Inner Elaboration Phase, active level 1 (S1) ---
  11336. Firing apply*operator
  11337. -->
  11338. (I3 ^predict-yes N980 + :O )
  11339. Firing apply*operator*complete
  11340. -->
  11341. (I3 ^predict-no N979 - :O )
  11342. inner elaboration loop at bottom goal.
  11343. --- Change Working Memory (PE) ---
  11344. =>WM: (13730: I3 ^predict-yes N980)
  11345. <=WM: (13717: N979 ^status complete)
  11346. <=WM: (13716: I3 ^predict-no N979)
  11347. --- Firing Productions (IE) For State At Depth 1 ---
  11348. --- Inner Elaboration Phase, active level 1 (S1) ---
  11349. Firing monitor*world
  11350. -->
  11351. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11352. --- Change Working Memory (IE) ---
  11353. --- END Application Phase ---
  11354. --- Output Phase ---
  11355. ENV: Agent did: predict-yes for direction L in state State-B
  11356. In State-B moving L
  11357. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11358. predict error 0
  11359. dir: dir isR
  11360. --- END Output Phase ---
  11361. /|\--- Input Phase ---
  11362. =>WM: (13734: I2 ^dir R)
  11363. =>WM: (13733: I2 ^reward 1)
  11364. =>WM: (13732: I2 ^see 1)
  11365. =>WM: (13731: N980 ^status complete)
  11366. <=WM: (13720: I2 ^dir L)
  11367. <=WM: (13719: I2 ^reward 1)
  11368. <=WM: (13718: I2 ^see 0)
  11369. =>WM: (13735: I2 ^level-1 L1-root)
  11370. <=WM: (13721: I2 ^level-1 R0-root)
  11371. --- END Input Phase ---
  11372. --- Proposal Phase ---
  11373. --- Inner Elaboration Phase, active level 1 (S1) ---
  11374. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11375. -->
  11376. (S1 ^operator O1959 = 0.7064055971121673)
  11377. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11378. -->
  11379. (S1 ^operator O1960 = -0.1937987592593187)
  11380. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11381. -->
  11382. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11383. -->
  11384. Firing elaborate*copy-see-to-output-link
  11385. -->
  11386. (I3 ^see 1 +)
  11387. Firing elaborate*reward*based*on*reward
  11388. -->
  11389. (R984 ^value 1 +)
  11390. (R1 ^reward R984 +)
  11391. Firing propose*predict-yes
  11392. -->
  11393. (O1961 ^name predict-yes +)
  11394. (S1 ^operator O1961 +)
  11395. Firing propose*predict-no
  11396. -->
  11397. (O1962 ^name predict-no +)
  11398. (S1 ^operator O1962 +)
  11399. Firing rl*prefer*rvt*predict-no*H0*6
  11400. -->
  11401. (S1 ^operator O1960 = 0.2298662376128736)
  11402. Firing rl*prefer*rvt*predict-yes*H0*5
  11403. -->
  11404. (S1 ^operator O1959 = 0.2939636257009906)
  11405. Firing prefer*rvt*predict-yes*H0
  11406. -->
  11407. Firing prefer*rvt*predict-no*H0
  11408. -->
  11409. Firing elaborate*copy-dir-to-output-link
  11410. -->
  11411. (I3 ^dir R +)
  11412. inner elaboration loop at bottom goal.
  11413. Retracting elaborate*copy-see-to-output-link
  11414. -->
  11415. (I3 ^see 0 +)
  11416. Retracting propose*predict-no
  11417. -->
  11418. (O1960 ^name predict-no +)
  11419. (S1 ^operator O1960 +)
  11420. Retracting propose*predict-yes
  11421. -->
  11422. (O1959 ^name predict-yes +)
  11423. (S1 ^operator O1959 +)
  11424. Retracting elaborate*reward*based*on*reward
  11425. -->
  11426. (R983 ^value 1 +)
  11427. (R1 ^reward R983 +)
  11428. Retracting elaborate*copy-dir-to-output-link
  11429. -->
  11430. (I3 ^dir L +)
  11431. Retracting rl*prefer*rvt*predict-no*H0*2
  11432. -->
  11433. (S1 ^operator O1960 = 0.3140233963466647)
  11434. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11435. -->
  11436. (S1 ^operator O1960 = -0.2190661556260421)
  11437. Retracting rl*prefer*rvt*predict-yes*H0*1
  11438. -->
  11439. (S1 ^operator O1959 = 0.3804118472151704)
  11440. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11441. -->
  11442. (S1 ^operator O1959 = 0.6195601949549704)
  11443. =>WM: (13743: S1 ^operator O1962 +)
  11444. =>WM: (13742: S1 ^operator O1961 +)
  11445. =>WM: (13741: I3 ^dir R)
  11446. =>WM: (13740: O1962 ^name predict-no)
  11447. =>WM: (13739: O1961 ^name predict-yes)
  11448. =>WM: (13738: R984 ^value 1)
  11449. =>WM: (13737: R1 ^reward R984)
  11450. =>WM: (13736: I3 ^see 1)
  11451. <=WM: (13727: S1 ^operator O1959 +)
  11452. <=WM: (13729: S1 ^operator O1959)
  11453. <=WM: (13728: S1 ^operator O1960 +)
  11454. <=WM: (13726: I3 ^dir L)
  11455. <=WM: (13722: R1 ^reward R983)
  11456. <=WM: (13694: I3 ^see 0)
  11457. <=WM: (13725: O1960 ^name predict-no)
  11458. <=WM: (13724: O1959 ^name predict-yes)
  11459. <=WM: (13723: R983 ^value 1)
  11460. --- Inner Elaboration Phase, active level 1 (S1) ---
  11461. Firing prefer*rvt*predict-yes*H0
  11462. -->
  11463. Firing rl*prefer*rvt*predict-yes*H0*5
  11464. -->
  11465. (S1 ^operator O1961 = 0.2939636257009906)
  11466. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11467. -->
  11468. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11469. -->
  11470. (S1 ^operator O1961 = 0.7064055971121673)
  11471. Firing prefer*rvt*predict-no*H0
  11472. -->
  11473. Firing rl*prefer*rvt*predict-no*H0*6
  11474. -->
  11475. (S1 ^operator O1962 = 0.2298662376128736)
  11476. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11477. -->
  11478. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11479. -->
  11480. (S1 ^operator O1962 = -0.1937987592593187)
  11481. inner elaboration loop at bottom goal.
  11482. Retracting rl*prefer*rvt*predict-no*H0*6
  11483. -->
  11484. (S1 ^operator O1960 = 0.2298662376128736)
  11485. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11486. -->
  11487. (S1 ^operator O1960 = -0.1937987592593187)
  11488. Retracting rl*prefer*rvt*predict-yes*H0*5
  11489. -->
  11490. (S1 ^operator O1959 = 0.2939636257009906)
  11491. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11492. -->
  11493. (S1 ^operator O1959 = 0.7064055971121673)
  11494. --- END Proposal Phase ---
  11495. --- Decision Phase ---
  11496. RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.826087,0.144565)
  11497. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478628 0.140932 0.61956 -> 0.478631 0.140932 0.619563(R,m,v=1,1,0)
  11498. =>WM: (13744: S1 ^operator O1961)
  11499. 981: O: O1961 (predict-yes)
  11500. --- END Decision Phase ---
  11501. --- Application Phase ---
  11502. --- Firing Productions (PE) For State At Depth 1 ---
  11503. --- Inner Elaboration Phase, active level 1 (S1) ---
  11504. Firing apply*operator
  11505. -->
  11506. (I3 ^predict-yes N981 + :O )
  11507. Firing apply*operator*complete
  11508. -->
  11509. (I3 ^predict-yes N980 - :O )
  11510. inner elaboration loop at bottom goal.
  11511. --- Change Working Memory (PE) ---
  11512. =>WM: (13745: I3 ^predict-yes N981)
  11513. <=WM: (13731: N980 ^status complete)
  11514. <=WM: (13730: I3 ^predict-yes N980)
  11515. --- Firing Productions (IE) For State At Depth 1 ---
  11516. --- Inner Elaboration Phase, active level 1 (S1) ---
  11517. Firing monitor*world
  11518. -->
  11519. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11520. --- Change Working Memory (IE) ---
  11521. --- END Application Phase ---
  11522. --- Output Phase ---
  11523. ENV: Agent did: predict-yes for direction R in state State-A
  11524. In State-A moving R
  11525. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11526. predict error 0
  11527. dir: dir isU
  11528. --- END Output Phase ---
  11529. ---- Input Phase ---
  11530. =>WM: (13749: I2 ^dir U)
  11531. =>WM: (13748: I2 ^reward 1)
  11532. =>WM: (13747: I2 ^see 1)
  11533. =>WM: (13746: N981 ^status complete)
  11534. <=WM: (13734: I2 ^dir R)
  11535. <=WM: (13733: I2 ^reward 1)
  11536. <=WM: (13732: I2 ^see 1)
  11537. =>WM: (13750: I2 ^level-1 R1-root)
  11538. <=WM: (13735: I2 ^level-1 L1-root)
  11539. --- END Input Phase ---
  11540. --- Proposal Phase ---
  11541. --- Inner Elaboration Phase, active level 1 (S1) ---
  11542. Firing elaborate*copy-see-to-output-link
  11543. -->
  11544. (I3 ^see 1 +)
  11545. Firing elaborate*reward*based*on*reward
  11546. -->
  11547. (R985 ^value 1 +)
  11548. (R1 ^reward R985 +)
  11549. Firing propose*predict-yes
  11550. -->
  11551. (O1963 ^name predict-yes +)
  11552. (S1 ^operator O1963 +)
  11553. Firing propose*predict-no
  11554. -->
  11555. (O1964 ^name predict-no +)
  11556. (S1 ^operator O1964 +)
  11557. Firing rl*prefer*rvt*predict-no*H0*4
  11558. -->
  11559. (S1 ^operator O1962 = 1.)
  11560. Firing rl*prefer*rvt*predict-yes*H0*3
  11561. -->
  11562. (S1 ^operator O1961 = 0.)
  11563. Firing prefer*rvt*predict-yes*H0
  11564. -->
  11565. Firing prefer*rvt*predict-no*H0
  11566. -->
  11567. Firing elaborate*copy-dir-to-output-link
  11568. -->
  11569. (I3 ^dir U +)
  11570. inner elaboration loop at bottom goal.
  11571. Retracting elaborate*copy-see-to-output-link
  11572. -->
  11573. (I3 ^see 1 +)
  11574. Retracting propose*predict-no
  11575. -->
  11576. (O1962 ^name predict-no +)
  11577. (S1 ^operator O1962 +)
  11578. Retracting propose*predict-yes
  11579. -->
  11580. (O1961 ^name predict-yes +)
  11581. (S1 ^operator O1961 +)
  11582. Retracting elaborate*reward*based*on*reward
  11583. -->
  11584. (R984 ^value 1 +)
  11585. (R1 ^reward R984 +)
  11586. Retracting elaborate*copy-dir-to-output-link
  11587. -->
  11588. (I3 ^dir R +)
  11589. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  11590. -->
  11591. (S1 ^operator O1962 = -0.1937987592593187)
  11592. Retracting rl*prefer*rvt*predict-no*H0*6
  11593. -->
  11594. (S1 ^operator O1962 = 0.2298662376128736)
  11595. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  11596. -->
  11597. (S1 ^operator O1961 = 0.7064055971121673)
  11598. Retracting rl*prefer*rvt*predict-yes*H0*5
  11599. -->
  11600. (S1 ^operator O1961 = 0.2939636257009906)
  11601. =>WM: (13757: S1 ^operator O1964 +)
  11602. =>WM: (13756: S1 ^operator O1963 +)
  11603. =>WM: (13755: I3 ^dir U)
  11604. =>WM: (13754: O1964 ^name predict-no)
  11605. =>WM: (13753: O1963 ^name predict-yes)
  11606. =>WM: (13752: R985 ^value 1)
  11607. =>WM: (13751: R1 ^reward R985)
  11608. <=WM: (13742: S1 ^operator O1961 +)
  11609. <=WM: (13744: S1 ^operator O1961)
  11610. <=WM: (13743: S1 ^operator O1962 +)
  11611. <=WM: (13741: I3 ^dir R)
  11612. <=WM: (13737: R1 ^reward R984)
  11613. <=WM: (13740: O1962 ^name predict-no)
  11614. <=WM: (13739: O1961 ^name predict-yes)
  11615. <=WM: (13738: R984 ^value 1)
  11616. --- Inner Elaboration Phase, active level 1 (S1) ---
  11617. Firing prefer*rvt*predict-yes*H0
  11618. -->
  11619. Firing rl*prefer*rvt*predict-yes*H0*3
  11620. -->
  11621. (S1 ^operator O1963 = 0.)
  11622. Firing prefer*rvt*predict-no*H0
  11623. -->
  11624. Firing rl*prefer*rvt*predict-no*H0*4
  11625. -->
  11626. (S1 ^operator O1964 = 1.)
  11627. inner elaboration loop at bottom goal.
  11628. Retracting rl*prefer*rvt*predict-no*H0*4
  11629. -->
  11630. (S1 ^operator O1962 = 1.)
  11631. Retracting rl*prefer*rvt*predict-yes*H0*3
  11632. -->
  11633. (S1 ^operator O1961 = 0.)
  11634. --- END Proposal Phase ---
  11635. --- Decision Phase ---
  11636. RL update rl*prefer*rvt*predict-yes*H0*5 0.50104 -0.207077 0.293964 -> 0.501013 -0.20708 0.293933(R,m,v=1,0.842105,0.133845)
  11637. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499292 0.207114 0.706406 -> 0.499259 0.20711 0.70637(R,m,v=1,1,0)
  11638. =>WM: (13758: S1 ^operator O1964)
  11639. 982: O: O1964 (predict-no)
  11640. --- END Decision Phase ---
  11641. --- Application Phase ---
  11642. --- Firing Productions (PE) For State At Depth 1 ---
  11643. --- Inner Elaboration Phase, active level 1 (S1) ---
  11644. Firing apply*operator
  11645. -->
  11646. (I3 ^predict-no N982 + :O )
  11647. Firing apply*operator*complete
  11648. -->
  11649. (I3 ^predict-yes N981 - :O )
  11650. inner elaboration loop at bottom goal.
  11651. --- Change Working Memory (PE) ---
  11652. =>WM: (13759: I3 ^predict-no N982)
  11653. <=WM: (13746: N981 ^status complete)
  11654. <=WM: (13745: I3 ^predict-yes N981)
  11655. --- Firing Productions (IE) For State At Depth 1 ---
  11656. --- Inner Elaboration Phase, active level 1 (S1) ---
  11657. Firing monitor*world
  11658. -->
  11659. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11660. --- Change Working Memory (IE) ---
  11661. --- END Application Phase ---
  11662. --- Output Phase ---
  11663. ENV: Agent did: predict-no for direction U in state State-B
  11664. In State-B moving U
  11665. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11666. predict error 0
  11667. dir: dir isR
  11668. --- END Output Phase ---
  11669. /|--- Input Phase ---
  11670. =>WM: (13763: I2 ^dir R)
  11671. =>WM: (13762: I2 ^reward 1)
  11672. =>WM: (13761: I2 ^see 0)
  11673. =>WM: (13760: N982 ^status complete)
  11674. <=WM: (13749: I2 ^dir U)
  11675. <=WM: (13748: I2 ^reward 1)
  11676. <=WM: (13747: I2 ^see 1)
  11677. =>WM: (13764: I2 ^level-1 R1-root)
  11678. <=WM: (13750: I2 ^level-1 R1-root)
  11679. --- END Input Phase ---
  11680. --- Proposal Phase ---
  11681. --- Inner Elaboration Phase, active level 1 (S1) ---
  11682. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11683. -->
  11684. (S1 ^operator O1963 = -0.252585164213872)
  11685. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11686. -->
  11687. (S1 ^operator O1964 = 0.7701897521634826)
  11688. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11689. -->
  11690. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11691. -->
  11692. Firing elaborate*copy-see-to-output-link
  11693. -->
  11694. (I3 ^see 0 +)
  11695. Firing elaborate*reward*based*on*reward
  11696. -->
  11697. (R986 ^value 1 +)
  11698. (R1 ^reward R986 +)
  11699. Firing propose*predict-yes
  11700. -->
  11701. (O1965 ^name predict-yes +)
  11702. (S1 ^operator O1965 +)
  11703. Firing propose*predict-no
  11704. -->
  11705. (O1966 ^name predict-no +)
  11706. (S1 ^operator O1966 +)
  11707. Firing rl*prefer*rvt*predict-no*H0*6
  11708. -->
  11709. (S1 ^operator O1964 = 0.2298662376128736)
  11710. Firing rl*prefer*rvt*predict-yes*H0*5
  11711. -->
  11712. (S1 ^operator O1963 = 0.2939329791093226)
  11713. Firing prefer*rvt*predict-yes*H0
  11714. -->
  11715. Firing prefer*rvt*predict-no*H0
  11716. -->
  11717. Firing elaborate*copy-dir-to-output-link
  11718. -->
  11719. (I3 ^dir R +)
  11720. inner elaboration loop at bottom goal.
  11721. Retracting elaborate*copy-see-to-output-link
  11722. -->
  11723. (I3 ^see 1 +)
  11724. Retracting propose*predict-no
  11725. -->
  11726. (O1964 ^name predict-no +)
  11727. (S1 ^operator O1964 +)
  11728. Retracting propose*predict-yes
  11729. -->
  11730. (O1963 ^name predict-yes +)
  11731. (S1 ^operator O1963 +)
  11732. Retracting elaborate*reward*based*on*reward
  11733. -->
  11734. (R985 ^value 1 +)
  11735. (R1 ^reward R985 +)
  11736. Retracting elaborate*copy-dir-to-output-link
  11737. -->
  11738. (I3 ^dir U +)
  11739. Retracting rl*prefer*rvt*predict-no*H0*4
  11740. -->
  11741. (S1 ^operator O1964 = 1.)
  11742. Retracting rl*prefer*rvt*predict-yes*H0*3
  11743. -->
  11744. (S1 ^operator O1963 = 0.)
  11745. =>WM: (13772: S1 ^operator O1966 +)
  11746. =>WM: (13771: S1 ^operator O1965 +)
  11747. =>WM: (13770: I3 ^dir R)
  11748. =>WM: (13769: O1966 ^name predict-no)
  11749. =>WM: (13768: O1965 ^name predict-yes)
  11750. =>WM: (13767: R986 ^value 1)
  11751. =>WM: (13766: R1 ^reward R986)
  11752. =>WM: (13765: I3 ^see 0)
  11753. <=WM: (13756: S1 ^operator O1963 +)
  11754. <=WM: (13757: S1 ^operator O1964 +)
  11755. <=WM: (13758: S1 ^operator O1964)
  11756. <=WM: (13755: I3 ^dir U)
  11757. <=WM: (13751: R1 ^reward R985)
  11758. <=WM: (13736: I3 ^see 1)
  11759. <=WM: (13754: O1964 ^name predict-no)
  11760. <=WM: (13753: O1963 ^name predict-yes)
  11761. <=WM: (13752: R985 ^value 1)
  11762. --- Inner Elaboration Phase, active level 1 (S1) ---
  11763. Firing prefer*rvt*predict-yes*H0
  11764. -->
  11765. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11766. -->
  11767. (S1 ^operator O1965 = -0.252585164213872)
  11768. Firing rl*prefer*rvt*predict-yes*H0*5
  11769. -->
  11770. (S1 ^operator O1965 = 0.2939329791093226)
  11771. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11772. -->
  11773. Firing prefer*rvt*predict-no*H0
  11774. -->
  11775. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11776. -->
  11777. (S1 ^operator O1966 = 0.7701897521634826)
  11778. Firing rl*prefer*rvt*predict-no*H0*6
  11779. -->
  11780. (S1 ^operator O1966 = 0.2298662376128736)
  11781. Firing prefer*rvt*predict-no*H0*6*v1*H1
  11782. -->
  11783. inner elaboration loop at bottom goal.
  11784. Retracting rl*prefer*rvt*predict-no*H0*6
  11785. -->
  11786. (S1 ^operator O1964 = 0.2298662376128736)
  11787. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11788. -->
  11789. (S1 ^operator O1964 = 0.7701897521634826)
  11790. Retracting rl*prefer*rvt*predict-yes*H0*5
  11791. -->
  11792. (S1 ^operator O1963 = 0.2939329791093226)
  11793. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11794. -->
  11795. (S1 ^operator O1963 = -0.252585164213872)
  11796. --- END Proposal Phase ---
  11797. --- Decision Phase ---
  11798. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11799. =>WM: (13773: S1 ^operator O1966)
  11800. 983: O: O1966 (predict-no)
  11801. --- END Decision Phase ---
  11802. --- Application Phase ---
  11803. --- Firing Productions (PE) For State At Depth 1 ---
  11804. --- Inner Elaboration Phase, active level 1 (S1) ---
  11805. Firing apply*operator
  11806. -->
  11807. (I3 ^predict-no N983 + :O )
  11808. Firing apply*operator*complete
  11809. -->
  11810. (I3 ^predict-no N982 - :O )
  11811. inner elaboration loop at bottom goal.
  11812. --- Change Working Memory (PE) ---
  11813. =>WM: (13774: I3 ^predict-no N983)
  11814. <=WM: (13760: N982 ^status complete)
  11815. <=WM: (13759: I3 ^predict-no N982)
  11816. --- Firing Productions (IE) For State At Depth 1 ---
  11817. --- Inner Elaboration Phase, active level 1 (S1) ---
  11818. Firing monitor*world
  11819. -->
  11820. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11821. --- Change Working Memory (IE) ---
  11822. --- END Application Phase ---
  11823. --- Output Phase ---
  11824. ENV: Agent did: predict-no for direction R in state State-B
  11825. In State-B moving R
  11826. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11827. predict error 0
  11828. dir: dir isL
  11829. --- END Output Phase ---
  11830. \---- Input Phase ---
  11831. =>WM: (13778: I2 ^dir L)
  11832. =>WM: (13777: I2 ^reward 1)
  11833. =>WM: (13776: I2 ^see 0)
  11834. =>WM: (13775: N983 ^status complete)
  11835. <=WM: (13763: I2 ^dir R)
  11836. <=WM: (13762: I2 ^reward 1)
  11837. <=WM: (13761: I2 ^see 0)
  11838. =>WM: (13779: I2 ^level-1 R0-root)
  11839. <=WM: (13764: I2 ^level-1 R1-root)
  11840. --- END Input Phase ---
  11841. --- Proposal Phase ---
  11842. --- Inner Elaboration Phase, active level 1 (S1) ---
  11843. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11844. -->
  11845. (S1 ^operator O1965 = 0.6195629046335391)
  11846. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11847. -->
  11848. (S1 ^operator O1966 = -0.2190661556260421)
  11849. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11850. -->
  11851. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11852. -->
  11853. Firing elaborate*copy-see-to-output-link
  11854. -->
  11855. (I3 ^see 0 +)
  11856. Firing elaborate*reward*based*on*reward
  11857. -->
  11858. (R987 ^value 1 +)
  11859. (R1 ^reward R987 +)
  11860. Firing propose*predict-yes
  11861. -->
  11862. (O1967 ^name predict-yes +)
  11863. (S1 ^operator O1967 +)
  11864. Firing propose*predict-no
  11865. -->
  11866. (O1968 ^name predict-no +)
  11867. (S1 ^operator O1968 +)
  11868. Firing rl*prefer*rvt*predict-no*H0*2
  11869. -->
  11870. (S1 ^operator O1966 = 0.3140233963466647)
  11871. Firing rl*prefer*rvt*predict-yes*H0*1
  11872. -->
  11873. (S1 ^operator O1965 = 0.3804141458478695)
  11874. Firing prefer*rvt*predict-yes*H0
  11875. -->
  11876. Firing prefer*rvt*predict-no*H0
  11877. -->
  11878. Firing elaborate*copy-dir-to-output-link
  11879. -->
  11880. (I3 ^dir L +)
  11881. inner elaboration loop at bottom goal.
  11882. Retracting elaborate*copy-see-to-output-link
  11883. -->
  11884. (I3 ^see 0 +)
  11885. Retracting propose*predict-no
  11886. -->
  11887. (O1966 ^name predict-no +)
  11888. (S1 ^operator O1966 +)
  11889. Retracting propose*predict-yes
  11890. -->
  11891. (O1965 ^name predict-yes +)
  11892. (S1 ^operator O1965 +)
  11893. Retracting elaborate*reward*based*on*reward
  11894. -->
  11895. (R986 ^value 1 +)
  11896. (R1 ^reward R986 +)
  11897. Retracting elaborate*copy-dir-to-output-link
  11898. -->
  11899. (I3 ^dir R +)
  11900. Retracting rl*prefer*rvt*predict-no*H0*6
  11901. -->
  11902. (S1 ^operator O1966 = 0.2298662376128736)
  11903. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  11904. -->
  11905. (S1 ^operator O1966 = 0.7701897521634826)
  11906. Retracting rl*prefer*rvt*predict-yes*H0*5
  11907. -->
  11908. (S1 ^operator O1965 = 0.2939329791093226)
  11909. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11910. -->
  11911. (S1 ^operator O1965 = -0.252585164213872)
  11912. =>WM: (13786: S1 ^operator O1968 +)
  11913. =>WM: (13785: S1 ^operator O1967 +)
  11914. =>WM: (13784: I3 ^dir L)
  11915. =>WM: (13783: O1968 ^name predict-no)
  11916. =>WM: (13782: O1967 ^name predict-yes)
  11917. =>WM: (13781: R987 ^value 1)
  11918. =>WM: (13780: R1 ^reward R987)
  11919. <=WM: (13771: S1 ^operator O1965 +)
  11920. <=WM: (13772: S1 ^operator O1966 +)
  11921. <=WM: (13773: S1 ^operator O1966)
  11922. <=WM: (13770: I3 ^dir R)
  11923. <=WM: (13766: R1 ^reward R986)
  11924. <=WM: (13769: O1966 ^name predict-no)
  11925. <=WM: (13768: O1965 ^name predict-yes)
  11926. <=WM: (13767: R986 ^value 1)
  11927. --- Inner Elaboration Phase, active level 1 (S1) ---
  11928. Firing prefer*rvt*predict-yes*H0
  11929. -->
  11930. Firing rl*prefer*rvt*predict-yes*H0*1
  11931. -->
  11932. (S1 ^operator O1967 = 0.3804141458478695)
  11933. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  11934. -->
  11935. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11936. -->
  11937. (S1 ^operator O1967 = 0.6195629046335391)
  11938. Firing prefer*rvt*predict-no*H0
  11939. -->
  11940. Firing rl*prefer*rvt*predict-no*H0*2
  11941. -->
  11942. (S1 ^operator O1968 = 0.3140233963466647)
  11943. Firing prefer*rvt*predict-no*H0*2*v1*H1
  11944. -->
  11945. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11946. -->
  11947. (S1 ^operator O1968 = -0.2190661556260421)
  11948. inner elaboration loop at bottom goal.
  11949. Retracting rl*prefer*rvt*predict-no*H0*2
  11950. -->
  11951. (S1 ^operator O1966 = 0.3140233963466647)
  11952. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  11953. -->
  11954. (S1 ^operator O1966 = -0.2190661556260421)
  11955. Retracting rl*prefer*rvt*predict-yes*H0*1
  11956. -->
  11957. (S1 ^operator O1965 = 0.3804141458478695)
  11958. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  11959. -->
  11960. (S1 ^operator O1965 = 0.6195629046335391)
  11961. --- END Proposal Phase ---
  11962. --- Decision Phase ---
  11963. RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611913 -0.382052 0.229862(R,m,v=1,0.843931,0.132477)
  11964. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388128 0.382061 0.77019 -> 0.388124 0.38206 0.770184(R,m,v=1,1,0)
  11965. =>WM: (13787: S1 ^operator O1967)
  11966. 984: O: O1967 (predict-yes)
  11967. --- END Decision Phase ---
  11968. --- Application Phase ---
  11969. --- Firing Productions (PE) For State At Depth 1 ---
  11970. --- Inner Elaboration Phase, active level 1 (S1) ---
  11971. Firing apply*operator
  11972. -->
  11973. (I3 ^predict-yes N984 + :O )
  11974. Firing apply*operator*complete
  11975. -->
  11976. (I3 ^predict-no N983 - :O )
  11977. inner elaboration loop at bottom goal.
  11978. --- Change Working Memory (PE) ---
  11979. =>WM: (13788: I3 ^predict-yes N984)
  11980. <=WM: (13775: N983 ^status complete)
  11981. <=WM: (13774: I3 ^predict-no N983)
  11982. --- Firing Productions (IE) For State At Depth 1 ---
  11983. --- Inner Elaboration Phase, active level 1 (S1) ---
  11984. Firing monitor*world
  11985. -->
  11986. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11987. --- Change Working Memory (IE) ---
  11988. --- END Application Phase ---
  11989. --- Output Phase ---
  11990. ENV: Agent did: predict-yes for direction L in state State-B
  11991. In State-B moving L
  11992. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11993. predict error 0
  11994. dir: dir isU
  11995. --- END Output Phase ---
  11996. /|\--- Input Phase ---
  11997. =>WM: (13792: I2 ^dir U)
  11998. =>WM: (13791: I2 ^reward 1)
  11999. =>WM: (13790: I2 ^see 1)
  12000. =>WM: (13789: N984 ^status complete)
  12001. <=WM: (13778: I2 ^dir L)
  12002. <=WM: (13777: I2 ^reward 1)
  12003. <=WM: (13776: I2 ^see 0)
  12004. =>WM: (13793: I2 ^level-1 L1-root)
  12005. <=WM: (13779: I2 ^level-1 R0-root)
  12006. --- END Input Phase ---
  12007. --- Proposal Phase ---
  12008. --- Inner Elaboration Phase, active level 1 (S1) ---
  12009. Firing elaborate*copy-see-to-output-link
  12010. -->
  12011. (I3 ^see 1 +)
  12012. Firing elaborate*reward*based*on*reward
  12013. -->
  12014. (R988 ^value 1 +)
  12015. (R1 ^reward R988 +)
  12016. Firing propose*predict-yes
  12017. -->
  12018. (O1969 ^name predict-yes +)
  12019. (S1 ^operator O1969 +)
  12020. Firing propose*predict-no
  12021. -->
  12022. (O1970 ^name predict-no +)
  12023. (S1 ^operator O1970 +)
  12024. Firing rl*prefer*rvt*predict-no*H0*4
  12025. -->
  12026. (S1 ^operator O1968 = 1.)
  12027. Firing rl*prefer*rvt*predict-yes*H0*3
  12028. -->
  12029. (S1 ^operator O1967 = 0.)
  12030. Firing prefer*rvt*predict-yes*H0
  12031. -->
  12032. Firing prefer*rvt*predict-no*H0
  12033. -->
  12034. Firing elaborate*copy-dir-to-output-link
  12035. -->
  12036. (I3 ^dir U +)
  12037. inner elaboration loop at bottom goal.
  12038. Retracting elaborate*copy-see-to-output-link
  12039. -->
  12040. (I3 ^see 0 +)
  12041. Retracting propose*predict-no
  12042. -->
  12043. (O1968 ^name predict-no +)
  12044. (S1 ^operator O1968 +)
  12045. Retracting propose*predict-yes
  12046. -->
  12047. (O1967 ^name predict-yes +)
  12048. (S1 ^operator O1967 +)
  12049. Retracting elaborate*reward*based*on*reward
  12050. -->
  12051. (R987 ^value 1 +)
  12052. (R1 ^reward R987 +)
  12053. Retracting elaborate*copy-dir-to-output-link
  12054. -->
  12055. (I3 ^dir L +)
  12056. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12057. -->
  12058. (S1 ^operator O1968 = -0.2190661556260421)
  12059. Retracting rl*prefer*rvt*predict-no*H0*2
  12060. -->
  12061. (S1 ^operator O1968 = 0.3140233963466647)
  12062. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12063. -->
  12064. (S1 ^operator O1967 = 0.6195629046335391)
  12065. Retracting rl*prefer*rvt*predict-yes*H0*1
  12066. -->
  12067. (S1 ^operator O1967 = 0.3804141458478695)
  12068. =>WM: (13801: S1 ^operator O1970 +)
  12069. =>WM: (13800: S1 ^operator O1969 +)
  12070. =>WM: (13799: I3 ^dir U)
  12071. =>WM: (13798: O1970 ^name predict-no)
  12072. =>WM: (13797: O1969 ^name predict-yes)
  12073. =>WM: (13796: R988 ^value 1)
  12074. =>WM: (13795: R1 ^reward R988)
  12075. =>WM: (13794: I3 ^see 1)
  12076. <=WM: (13785: S1 ^operator O1967 +)
  12077. <=WM: (13787: S1 ^operator O1967)
  12078. <=WM: (13786: S1 ^operator O1968 +)
  12079. <=WM: (13784: I3 ^dir L)
  12080. <=WM: (13780: R1 ^reward R987)
  12081. <=WM: (13765: I3 ^see 0)
  12082. <=WM: (13783: O1968 ^name predict-no)
  12083. <=WM: (13782: O1967 ^name predict-yes)
  12084. <=WM: (13781: R987 ^value 1)
  12085. --- Inner Elaboration Phase, active level 1 (S1) ---
  12086. Firing prefer*rvt*predict-yes*H0
  12087. -->
  12088. Firing rl*prefer*rvt*predict-yes*H0*3
  12089. -->
  12090. (S1 ^operator O1969 = 0.)
  12091. Firing prefer*rvt*predict-no*H0
  12092. -->
  12093. Firing rl*prefer*rvt*predict-no*H0*4
  12094. -->
  12095. (S1 ^operator O1970 = 1.)
  12096. inner elaboration loop at bottom goal.
  12097. Retracting rl*prefer*rvt*predict-no*H0*4
  12098. -->
  12099. (S1 ^operator O1968 = 1.)
  12100. Retracting rl*prefer*rvt*predict-yes*H0*3
  12101. -->
  12102. (S1 ^operator O1967 = 0.)
  12103. --- END Proposal Phase ---
  12104. --- Decision Phase ---
  12105. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521346 -0.14093 0.380416(R,m,v=1,0.82716,0.143854)
  12106. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478631 0.140932 0.619563 -> 0.478633 0.140932 0.619565(R,m,v=1,1,0)
  12107. =>WM: (13802: S1 ^operator O1970)
  12108. 985: O: O1970 (predict-no)
  12109. --- END Decision Phase ---
  12110. --- Application Phase ---
  12111. --- Firing Productions (PE) For State At Depth 1 ---
  12112. --- Inner Elaboration Phase, active level 1 (S1) ---
  12113. Firing apply*operator
  12114. -->
  12115. (I3 ^predict-no N985 + :O )
  12116. Firing apply*operator*complete
  12117. -->
  12118. (I3 ^predict-yes N984 - :O )
  12119. inner elaboration loop at bottom goal.
  12120. --- Change Working Memory (PE) ---
  12121. =>WM: (13803: I3 ^predict-no N985)
  12122. <=WM: (13789: N984 ^status complete)
  12123. <=WM: (13788: I3 ^predict-yes N984)
  12124. --- Firing Productions (IE) For State At Depth 1 ---
  12125. --- Inner Elaboration Phase, active level 1 (S1) ---
  12126. Firing monitor*world
  12127. -->
  12128. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12129. --- Change Working Memory (IE) ---
  12130. --- END Application Phase ---
  12131. --- Output Phase ---
  12132. ENV: Agent did: predict-no for direction U in state State-A
  12133. In State-A moving U
  12134. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12135. predict error 0
  12136. dir: dir isR
  12137. --- END Output Phase ---
  12138. -/|--- Input Phase ---
  12139. =>WM: (13807: I2 ^dir R)
  12140. =>WM: (13806: I2 ^reward 1)
  12141. =>WM: (13805: I2 ^see 0)
  12142. =>WM: (13804: N985 ^status complete)
  12143. <=WM: (13792: I2 ^dir U)
  12144. <=WM: (13791: I2 ^reward 1)
  12145. <=WM: (13790: I2 ^see 1)
  12146. =>WM: (13808: I2 ^level-1 L1-root)
  12147. <=WM: (13793: I2 ^level-1 L1-root)
  12148. --- END Input Phase ---
  12149. --- Proposal Phase ---
  12150. --- Inner Elaboration Phase, active level 1 (S1) ---
  12151. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12152. -->
  12153. (S1 ^operator O1969 = 0.7063695903698597)
  12154. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12155. -->
  12156. (S1 ^operator O1970 = -0.1937987592593187)
  12157. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12158. -->
  12159. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12160. -->
  12161. Firing elaborate*copy-see-to-output-link
  12162. -->
  12163. (I3 ^see 0 +)
  12164. Firing elaborate*reward*based*on*reward
  12165. -->
  12166. (R989 ^value 1 +)
  12167. (R1 ^reward R989 +)
  12168. Firing propose*predict-yes
  12169. -->
  12170. (O1971 ^name predict-yes +)
  12171. (S1 ^operator O1971 +)
  12172. Firing propose*predict-no
  12173. -->
  12174. (O1972 ^name predict-no +)
  12175. (S1 ^operator O1972 +)
  12176. Firing rl*prefer*rvt*predict-no*H0*6
  12177. -->
  12178. (S1 ^operator O1970 = 0.2298616880335552)
  12179. Firing rl*prefer*rvt*predict-yes*H0*5
  12180. -->
  12181. (S1 ^operator O1969 = 0.2939329791093226)
  12182. Firing prefer*rvt*predict-yes*H0
  12183. -->
  12184. Firing prefer*rvt*predict-no*H0
  12185. -->
  12186. Firing elaborate*copy-dir-to-output-link
  12187. -->
  12188. (I3 ^dir R +)
  12189. inner elaboration loop at bottom goal.
  12190. Retracting elaborate*copy-see-to-output-link
  12191. -->
  12192. (I3 ^see 1 +)
  12193. Retracting propose*predict-no
  12194. -->
  12195. (O1970 ^name predict-no +)
  12196. (S1 ^operator O1970 +)
  12197. Retracting propose*predict-yes
  12198. -->
  12199. (O1969 ^name predict-yes +)
  12200. (S1 ^operator O1969 +)
  12201. Retracting elaborate*reward*based*on*reward
  12202. -->
  12203. (R988 ^value 1 +)
  12204. (R1 ^reward R988 +)
  12205. Retracting elaborate*copy-dir-to-output-link
  12206. -->
  12207. (I3 ^dir U +)
  12208. Retracting rl*prefer*rvt*predict-no*H0*4
  12209. -->
  12210. (S1 ^operator O1970 = 1.)
  12211. Retracting rl*prefer*rvt*predict-yes*H0*3
  12212. -->
  12213. (S1 ^operator O1969 = 0.)
  12214. =>WM: (13816: S1 ^operator O1972 +)
  12215. =>WM: (13815: S1 ^operator O1971 +)
  12216. =>WM: (13814: I3 ^dir R)
  12217. =>WM: (13813: O1972 ^name predict-no)
  12218. =>WM: (13812: O1971 ^name predict-yes)
  12219. =>WM: (13811: R989 ^value 1)
  12220. =>WM: (13810: R1 ^reward R989)
  12221. =>WM: (13809: I3 ^see 0)
  12222. <=WM: (13800: S1 ^operator O1969 +)
  12223. <=WM: (13801: S1 ^operator O1970 +)
  12224. <=WM: (13802: S1 ^operator O1970)
  12225. <=WM: (13799: I3 ^dir U)
  12226. <=WM: (13795: R1 ^reward R988)
  12227. <=WM: (13794: I3 ^see 1)
  12228. <=WM: (13798: O1970 ^name predict-no)
  12229. <=WM: (13797: O1969 ^name predict-yes)
  12230. <=WM: (13796: R988 ^value 1)
  12231. --- Inner Elaboration Phase, active level 1 (S1) ---
  12232. Firing prefer*rvt*predict-yes*H0
  12233. -->
  12234. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12235. -->
  12236. (S1 ^operator O1971 = 0.7063695903698597)
  12237. Firing rl*prefer*rvt*predict-yes*H0*5
  12238. -->
  12239. (S1 ^operator O1971 = 0.2939329791093226)
  12240. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12241. -->
  12242. Firing prefer*rvt*predict-no*H0
  12243. -->
  12244. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12245. -->
  12246. (S1 ^operator O1972 = -0.1937987592593187)
  12247. Firing rl*prefer*rvt*predict-no*H0*6
  12248. -->
  12249. (S1 ^operator O1972 = 0.2298616880335552)
  12250. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12251. -->
  12252. inner elaboration loop at bottom goal.
  12253. Retracting rl*prefer*rvt*predict-no*H0*6
  12254. -->
  12255. (S1 ^operator O1970 = 0.2298616880335552)
  12256. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12257. -->
  12258. (S1 ^operator O1970 = -0.1937987592593187)
  12259. Retracting rl*prefer*rvt*predict-yes*H0*5
  12260. -->
  12261. (S1 ^operator O1969 = 0.2939329791093226)
  12262. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12263. -->
  12264. (S1 ^operator O1969 = 0.7063695903698597)
  12265. --- END Proposal Phase ---
  12266. --- Decision Phase ---
  12267. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12268. =>WM: (13817: S1 ^operator O1971)
  12269. 986: O: O1971 (predict-yes)
  12270. --- END Decision Phase ---
  12271. --- Application Phase ---
  12272. --- Firing Productions (PE) For State At Depth 1 ---
  12273. --- Inner Elaboration Phase, active level 1 (S1) ---
  12274. Firing apply*operator
  12275. -->
  12276. (I3 ^predict-yes N986 + :O )
  12277. Firing apply*operator*complete
  12278. -->
  12279. (I3 ^predict-no N985 - :O )
  12280. inner elaboration loop at bottom goal.
  12281. --- Change Working Memory (PE) ---
  12282. =>WM: (13818: I3 ^predict-yes N986)
  12283. <=WM: (13804: N985 ^status complete)
  12284. <=WM: (13803: I3 ^predict-no N985)
  12285. --- Firing Productions (IE) For State At Depth 1 ---
  12286. --- Inner Elaboration Phase, active level 1 (S1) ---
  12287. Firing monitor*world
  12288. -->
  12289. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12290. --- Change Working Memory (IE) ---
  12291. --- END Application Phase ---
  12292. --- Output Phase ---
  12293. ENV: Agent did: predict-yes for direction R in state State-A
  12294. In State-A moving R
  12295. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12296. predict error 0
  12297. dir: dir isR
  12298. --- END Output Phase ---
  12299. \-/--- Input Phase ---
  12300. =>WM: (13822: I2 ^dir R)
  12301. =>WM: (13821: I2 ^reward 1)
  12302. =>WM: (13820: I2 ^see 1)
  12303. =>WM: (13819: N986 ^status complete)
  12304. <=WM: (13807: I2 ^dir R)
  12305. <=WM: (13806: I2 ^reward 1)
  12306. <=WM: (13805: I2 ^see 0)
  12307. =>WM: (13823: I2 ^level-1 R1-root)
  12308. <=WM: (13808: I2 ^level-1 L1-root)
  12309. --- END Input Phase ---
  12310. --- Proposal Phase ---
  12311. --- Inner Elaboration Phase, active level 1 (S1) ---
  12312. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12313. -->
  12314. (S1 ^operator O1971 = -0.252585164213872)
  12315. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12316. -->
  12317. (S1 ^operator O1972 = 0.7701842386860367)
  12318. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12319. -->
  12320. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12321. -->
  12322. Firing elaborate*copy-see-to-output-link
  12323. -->
  12324. (I3 ^see 1 +)
  12325. Firing elaborate*reward*based*on*reward
  12326. -->
  12327. (R990 ^value 1 +)
  12328. (R1 ^reward R990 +)
  12329. Firing propose*predict-yes
  12330. -->
  12331. (O1973 ^name predict-yes +)
  12332. (S1 ^operator O1973 +)
  12333. Firing propose*predict-no
  12334. -->
  12335. (O1974 ^name predict-no +)
  12336. (S1 ^operator O1974 +)
  12337. Firing rl*prefer*rvt*predict-no*H0*6
  12338. -->
  12339. (S1 ^operator O1972 = 0.2298616880335552)
  12340. Firing rl*prefer*rvt*predict-yes*H0*5
  12341. -->
  12342. (S1 ^operator O1971 = 0.2939329791093226)
  12343. Firing prefer*rvt*predict-yes*H0
  12344. -->
  12345. Firing prefer*rvt*predict-no*H0
  12346. -->
  12347. Firing elaborate*copy-dir-to-output-link
  12348. -->
  12349. (I3 ^dir R +)
  12350. inner elaboration loop at bottom goal.
  12351. Retracting elaborate*copy-see-to-output-link
  12352. -->
  12353. (I3 ^see 0 +)
  12354. Retracting propose*predict-no
  12355. -->
  12356. (O1972 ^name predict-no +)
  12357. (S1 ^operator O1972 +)
  12358. Retracting propose*predict-yes
  12359. -->
  12360. (O1971 ^name predict-yes +)
  12361. (S1 ^operator O1971 +)
  12362. Retracting elaborate*reward*based*on*reward
  12363. -->
  12364. (R989 ^value 1 +)
  12365. (R1 ^reward R989 +)
  12366. Retracting elaborate*copy-dir-to-output-link
  12367. -->
  12368. (I3 ^dir R +)
  12369. Retracting rl*prefer*rvt*predict-no*H0*6
  12370. -->
  12371. (S1 ^operator O1972 = 0.2298616880335552)
  12372. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12373. -->
  12374. (S1 ^operator O1972 = -0.1937987592593187)
  12375. Retracting rl*prefer*rvt*predict-yes*H0*5
  12376. -->
  12377. (S1 ^operator O1971 = 0.2939329791093226)
  12378. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12379. -->
  12380. (S1 ^operator O1971 = 0.7063695903698597)
  12381. =>WM: (13830: S1 ^operator O1974 +)
  12382. =>WM: (13829: S1 ^operator O1973 +)
  12383. =>WM: (13828: O1974 ^name predict-no)
  12384. =>WM: (13827: O1973 ^name predict-yes)
  12385. =>WM: (13826: R990 ^value 1)
  12386. =>WM: (13825: R1 ^reward R990)
  12387. =>WM: (13824: I3 ^see 1)
  12388. <=WM: (13815: S1 ^operator O1971 +)
  12389. <=WM: (13817: S1 ^operator O1971)
  12390. <=WM: (13816: S1 ^operator O1972 +)
  12391. <=WM: (13810: R1 ^reward R989)
  12392. <=WM: (13809: I3 ^see 0)
  12393. <=WM: (13813: O1972 ^name predict-no)
  12394. <=WM: (13812: O1971 ^name predict-yes)
  12395. <=WM: (13811: R989 ^value 1)
  12396. --- Inner Elaboration Phase, active level 1 (S1) ---
  12397. Firing prefer*rvt*predict-yes*H0
  12398. -->
  12399. Firing rl*prefer*rvt*predict-yes*H0*5
  12400. -->
  12401. (S1 ^operator O1973 = 0.2939329791093226)
  12402. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12403. -->
  12404. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12405. -->
  12406. (S1 ^operator O1973 = -0.252585164213872)
  12407. Firing prefer*rvt*predict-no*H0
  12408. -->
  12409. Firing rl*prefer*rvt*predict-no*H0*6
  12410. -->
  12411. (S1 ^operator O1974 = 0.2298616880335552)
  12412. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12413. -->
  12414. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12415. -->
  12416. (S1 ^operator O1974 = 0.7701842386860367)
  12417. inner elaboration loop at bottom goal.
  12418. Retracting rl*prefer*rvt*predict-no*H0*6
  12419. -->
  12420. (S1 ^operator O1972 = 0.2298616880335552)
  12421. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12422. -->
  12423. (S1 ^operator O1972 = 0.7701842386860367)
  12424. Retracting rl*prefer*rvt*predict-yes*H0*5
  12425. -->
  12426. (S1 ^operator O1971 = 0.2939329791093226)
  12427. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12428. -->
  12429. (S1 ^operator O1971 = -0.252585164213872)
  12430. --- END Proposal Phase ---
  12431. --- Decision Phase ---
  12432. RL update rl*prefer*rvt*predict-yes*H0*5 0.501013 -0.20708 0.293933 -> 0.50099 -0.207082 0.293908(R,m,v=1,0.843137,0.133127)
  12433. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499259 0.20711 0.70637 -> 0.499233 0.207107 0.70634(R,m,v=1,1,0)
  12434. =>WM: (13831: S1 ^operator O1974)
  12435. 987: O: O1974 (predict-no)
  12436. --- END Decision Phase ---
  12437. --- Application Phase ---
  12438. --- Firing Productions (PE) For State At Depth 1 ---
  12439. --- Inner Elaboration Phase, active level 1 (S1) ---
  12440. Firing apply*operator
  12441. -->
  12442. (I3 ^predict-no N987 + :O )
  12443. Firing apply*operator*complete
  12444. -->
  12445. (I3 ^predict-yes N986 - :O )
  12446. inner elaboration loop at bottom goal.
  12447. --- Change Working Memory (PE) ---
  12448. =>WM: (13832: I3 ^predict-no N987)
  12449. <=WM: (13819: N986 ^status complete)
  12450. <=WM: (13818: I3 ^predict-yes N986)
  12451. --- Firing Productions (IE) For State At Depth 1 ---
  12452. --- Inner Elaboration Phase, active level 1 (S1) ---
  12453. Firing monitor*world
  12454. -->
  12455. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12456. --- Change Working Memory (IE) ---
  12457. --- END Application Phase ---
  12458. --- Output Phase ---
  12459. ENV: Agent did: predict-no for direction R in state State-B
  12460. In State-B moving R
  12461. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12462. predict error 0
  12463. dir: dir isL
  12464. --- END Output Phase ---
  12465. |\---- Input Phase ---
  12466. =>WM: (13836: I2 ^dir L)
  12467. =>WM: (13835: I2 ^reward 1)
  12468. =>WM: (13834: I2 ^see 0)
  12469. =>WM: (13833: N987 ^status complete)
  12470. <=WM: (13822: I2 ^dir R)
  12471. <=WM: (13821: I2 ^reward 1)
  12472. <=WM: (13820: I2 ^see 1)
  12473. =>WM: (13837: I2 ^level-1 R0-root)
  12474. <=WM: (13823: I2 ^level-1 R1-root)
  12475. --- END Input Phase ---
  12476. --- Proposal Phase ---
  12477. --- Inner Elaboration Phase, active level 1 (S1) ---
  12478. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12479. -->
  12480. (S1 ^operator O1973 = 0.6195651222408995)
  12481. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12482. -->
  12483. (S1 ^operator O1974 = -0.2190661556260421)
  12484. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12485. -->
  12486. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12487. -->
  12488. Firing elaborate*copy-see-to-output-link
  12489. -->
  12490. (I3 ^see 0 +)
  12491. Firing elaborate*reward*based*on*reward
  12492. -->
  12493. (R991 ^value 1 +)
  12494. (R1 ^reward R991 +)
  12495. Firing propose*predict-yes
  12496. -->
  12497. (O1975 ^name predict-yes +)
  12498. (S1 ^operator O1975 +)
  12499. Firing propose*predict-no
  12500. -->
  12501. (O1976 ^name predict-no +)
  12502. (S1 ^operator O1976 +)
  12503. Firing rl*prefer*rvt*predict-no*H0*2
  12504. -->
  12505. (S1 ^operator O1974 = 0.3140233963466647)
  12506. Firing rl*prefer*rvt*predict-yes*H0*1
  12507. -->
  12508. (S1 ^operator O1973 = 0.3804160307887663)
  12509. Firing prefer*rvt*predict-yes*H0
  12510. -->
  12511. Firing prefer*rvt*predict-no*H0
  12512. -->
  12513. Firing elaborate*copy-dir-to-output-link
  12514. -->
  12515. (I3 ^dir L +)
  12516. inner elaboration loop at bottom goal.
  12517. Retracting elaborate*copy-see-to-output-link
  12518. -->
  12519. (I3 ^see 1 +)
  12520. Retracting propose*predict-no
  12521. -->
  12522. (O1974 ^name predict-no +)
  12523. (S1 ^operator O1974 +)
  12524. Retracting propose*predict-yes
  12525. -->
  12526. (O1973 ^name predict-yes +)
  12527. (S1 ^operator O1973 +)
  12528. Retracting elaborate*reward*based*on*reward
  12529. -->
  12530. (R990 ^value 1 +)
  12531. (R1 ^reward R990 +)
  12532. Retracting elaborate*copy-dir-to-output-link
  12533. -->
  12534. (I3 ^dir R +)
  12535. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  12536. -->
  12537. (S1 ^operator O1974 = 0.7701842386860367)
  12538. Retracting rl*prefer*rvt*predict-no*H0*6
  12539. -->
  12540. (S1 ^operator O1974 = 0.2298616880335552)
  12541. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  12542. -->
  12543. (S1 ^operator O1973 = -0.252585164213872)
  12544. Retracting rl*prefer*rvt*predict-yes*H0*5
  12545. -->
  12546. (S1 ^operator O1973 = 0.2939078922513593)
  12547. =>WM: (13845: S1 ^operator O1976 +)
  12548. =>WM: (13844: S1 ^operator O1975 +)
  12549. =>WM: (13843: I3 ^dir L)
  12550. =>WM: (13842: O1976 ^name predict-no)
  12551. =>WM: (13841: O1975 ^name predict-yes)
  12552. =>WM: (13840: R991 ^value 1)
  12553. =>WM: (13839: R1 ^reward R991)
  12554. =>WM: (13838: I3 ^see 0)
  12555. <=WM: (13829: S1 ^operator O1973 +)
  12556. <=WM: (13830: S1 ^operator O1974 +)
  12557. <=WM: (13831: S1 ^operator O1974)
  12558. <=WM: (13814: I3 ^dir R)
  12559. <=WM: (13825: R1 ^reward R990)
  12560. <=WM: (13824: I3 ^see 1)
  12561. <=WM: (13828: O1974 ^name predict-no)
  12562. <=WM: (13827: O1973 ^name predict-yes)
  12563. <=WM: (13826: R990 ^value 1)
  12564. --- Inner Elaboration Phase, active level 1 (S1) ---
  12565. Firing prefer*rvt*predict-yes*H0
  12566. -->
  12567. Firing rl*prefer*rvt*predict-yes*H0*1
  12568. -->
  12569. (S1 ^operator O1975 = 0.3804160307887663)
  12570. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12571. -->
  12572. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12573. -->
  12574. (S1 ^operator O1975 = 0.6195651222408995)
  12575. Firing prefer*rvt*predict-no*H0
  12576. -->
  12577. Firing rl*prefer*rvt*predict-no*H0*2
  12578. -->
  12579. (S1 ^operator O1976 = 0.3140233963466647)
  12580. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12581. -->
  12582. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12583. -->
  12584. (S1 ^operator O1976 = -0.2190661556260421)
  12585. inner elaboration loop at bottom goal.
  12586. Retracting rl*prefer*rvt*predict-no*H0*2
  12587. -->
  12588. (S1 ^operator O1974 = 0.3140233963466647)
  12589. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12590. -->
  12591. (S1 ^operator O1974 = -0.2190661556260421)
  12592. Retracting rl*prefer*rvt*predict-yes*H0*1
  12593. -->
  12594. (S1 ^operator O1973 = 0.3804160307887663)
  12595. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12596. -->
  12597. (S1 ^operator O1973 = 0.6195651222408995)
  12598. --- END Proposal Phase ---
  12599. --- Decision Phase ---
  12600. RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229862 -> 0.61191 -0.382052 0.229858(R,m,v=1,0.844828,0.131852)
  12601. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388124 0.38206 0.770184 -> 0.38812 0.38206 0.77018(R,m,v=1,1,0)
  12602. =>WM: (13846: S1 ^operator O1975)
  12603. 988: O: O1975 (predict-yes)
  12604. --- END Decision Phase ---
  12605. --- Application Phase ---
  12606. --- Firing Productions (PE) For State At Depth 1 ---
  12607. --- Inner Elaboration Phase, active level 1 (S1) ---
  12608. Firing apply*operator
  12609. -->
  12610. (I3 ^predict-yes N988 + :O )
  12611. Firing apply*operator*complete
  12612. -->
  12613. (I3 ^predict-no N987 - :O )
  12614. inner elaboration loop at bottom goal.
  12615. --- Change Working Memory (PE) ---
  12616. =>WM: (13847: I3 ^predict-yes N988)
  12617. <=WM: (13833: N987 ^status complete)
  12618. <=WM: (13832: I3 ^predict-no N987)
  12619. --- Firing Productions (IE) For State At Depth 1 ---
  12620. --- Inner Elaboration Phase, active level 1 (S1) ---
  12621. Firing monitor*world
  12622. -->
  12623. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12624. --- Change Working Memory (IE) ---
  12625. --- END Application Phase ---
  12626. --- Output Phase ---
  12627. ENV: Agent did: predict-yes for direction L in state State-B
  12628. In State-B moving L
  12629. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12630. predict error 0
  12631. dir: dir isU
  12632. --- END Output Phase ---
  12633. /|\--- Input Phase ---
  12634. =>WM: (13851: I2 ^dir U)
  12635. =>WM: (13850: I2 ^reward 1)
  12636. =>WM: (13849: I2 ^see 1)
  12637. =>WM: (13848: N988 ^status complete)
  12638. <=WM: (13836: I2 ^dir L)
  12639. <=WM: (13835: I2 ^reward 1)
  12640. <=WM: (13834: I2 ^see 0)
  12641. =>WM: (13852: I2 ^level-1 L1-root)
  12642. <=WM: (13837: I2 ^level-1 R0-root)
  12643. --- END Input Phase ---
  12644. --- Proposal Phase ---
  12645. --- Inner Elaboration Phase, active level 1 (S1) ---
  12646. Firing elaborate*copy-see-to-output-link
  12647. -->
  12648. (I3 ^see 1 +)
  12649. Firing elaborate*reward*based*on*reward
  12650. -->
  12651. (R992 ^value 1 +)
  12652. (R1 ^reward R992 +)
  12653. Firing propose*predict-yes
  12654. -->
  12655. (O1977 ^name predict-yes +)
  12656. (S1 ^operator O1977 +)
  12657. Firing propose*predict-no
  12658. -->
  12659. (O1978 ^name predict-no +)
  12660. (S1 ^operator O1978 +)
  12661. Firing rl*prefer*rvt*predict-no*H0*4
  12662. -->
  12663. (S1 ^operator O1976 = 1.)
  12664. Firing rl*prefer*rvt*predict-yes*H0*3
  12665. -->
  12666. (S1 ^operator O1975 = 0.)
  12667. Firing prefer*rvt*predict-yes*H0
  12668. -->
  12669. Firing prefer*rvt*predict-no*H0
  12670. -->
  12671. Firing elaborate*copy-dir-to-output-link
  12672. -->
  12673. (I3 ^dir U +)
  12674. inner elaboration loop at bottom goal.
  12675. Retracting elaborate*copy-see-to-output-link
  12676. -->
  12677. (I3 ^see 0 +)
  12678. Retracting propose*predict-no
  12679. -->
  12680. (O1976 ^name predict-no +)
  12681. (S1 ^operator O1976 +)
  12682. Retracting propose*predict-yes
  12683. -->
  12684. (O1975 ^name predict-yes +)
  12685. (S1 ^operator O1975 +)
  12686. Retracting elaborate*reward*based*on*reward
  12687. -->
  12688. (R991 ^value 1 +)
  12689. (R1 ^reward R991 +)
  12690. Retracting elaborate*copy-dir-to-output-link
  12691. -->
  12692. (I3 ^dir L +)
  12693. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  12694. -->
  12695. (S1 ^operator O1976 = -0.2190661556260421)
  12696. Retracting rl*prefer*rvt*predict-no*H0*2
  12697. -->
  12698. (S1 ^operator O1976 = 0.3140233963466647)
  12699. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  12700. -->
  12701. (S1 ^operator O1975 = 0.6195651222408995)
  12702. Retracting rl*prefer*rvt*predict-yes*H0*1
  12703. -->
  12704. (S1 ^operator O1975 = 0.3804160307887663)
  12705. =>WM: (13860: S1 ^operator O1978 +)
  12706. =>WM: (13859: S1 ^operator O1977 +)
  12707. =>WM: (13858: I3 ^dir U)
  12708. =>WM: (13857: O1978 ^name predict-no)
  12709. =>WM: (13856: O1977 ^name predict-yes)
  12710. =>WM: (13855: R992 ^value 1)
  12711. =>WM: (13854: R1 ^reward R992)
  12712. =>WM: (13853: I3 ^see 1)
  12713. <=WM: (13844: S1 ^operator O1975 +)
  12714. <=WM: (13846: S1 ^operator O1975)
  12715. <=WM: (13845: S1 ^operator O1976 +)
  12716. <=WM: (13843: I3 ^dir L)
  12717. <=WM: (13839: R1 ^reward R991)
  12718. <=WM: (13838: I3 ^see 0)
  12719. <=WM: (13842: O1976 ^name predict-no)
  12720. <=WM: (13841: O1975 ^name predict-yes)
  12721. <=WM: (13840: R991 ^value 1)
  12722. --- Inner Elaboration Phase, active level 1 (S1) ---
  12723. Firing prefer*rvt*predict-yes*H0
  12724. -->
  12725. Firing rl*prefer*rvt*predict-yes*H0*3
  12726. -->
  12727. (S1 ^operator O1977 = 0.)
  12728. Firing prefer*rvt*predict-no*H0
  12729. -->
  12730. Firing rl*prefer*rvt*predict-no*H0*4
  12731. -->
  12732. (S1 ^operator O1978 = 1.)
  12733. inner elaboration loop at bottom goal.
  12734. Retracting rl*prefer*rvt*predict-no*H0*4
  12735. -->
  12736. (S1 ^operator O1976 = 1.)
  12737. Retracting rl*prefer*rvt*predict-yes*H0*3
  12738. -->
  12739. (S1 ^operator O1975 = 0.)
  12740. --- END Proposal Phase ---
  12741. --- Decision Phase ---
  12742. RL update rl*prefer*rvt*predict-yes*H0*1 0.521346 -0.14093 0.380416 -> 0.521348 -0.14093 0.380418(R,m,v=1,0.828221,0.143149)
  12743. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478633 0.140932 0.619565 -> 0.478635 0.140932 0.619567(R,m,v=1,1,0)
  12744. =>WM: (13861: S1 ^operator O1978)
  12745. 989: O: O1978 (predict-no)
  12746. --- END Decision Phase ---
  12747. --- Application Phase ---
  12748. --- Firing Productions (PE) For State At Depth 1 ---
  12749. --- Inner Elaboration Phase, active level 1 (S1) ---
  12750. Firing apply*operator
  12751. -->
  12752. (I3 ^predict-no N989 + :O )
  12753. Firing apply*operator*complete
  12754. -->
  12755. (I3 ^predict-yes N988 - :O )
  12756. inner elaboration loop at bottom goal.
  12757. --- Change Working Memory (PE) ---
  12758. =>WM: (13862: I3 ^predict-no N989)
  12759. <=WM: (13848: N988 ^status complete)
  12760. <=WM: (13847: I3 ^predict-yes N988)
  12761. --- Firing Productions (IE) For State At Depth 1 ---
  12762. --- Inner Elaboration Phase, active level 1 (S1) ---
  12763. Firing monitor*world
  12764. -->
  12765. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12766. --- Change Working Memory (IE) ---
  12767. --- END Application Phase ---
  12768. --- Output Phase ---
  12769. ENV: Agent did: predict-no for direction U in state State-A
  12770. In State-A moving U
  12771. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12772. predict error 0
  12773. dir: dir isR
  12774. --- END Output Phase ---
  12775. -/|--- Input Phase ---
  12776. =>WM: (13866: I2 ^dir R)
  12777. =>WM: (13865: I2 ^reward 1)
  12778. =>WM: (13864: I2 ^see 0)
  12779. =>WM: (13863: N989 ^status complete)
  12780. <=WM: (13851: I2 ^dir U)
  12781. <=WM: (13850: I2 ^reward 1)
  12782. <=WM: (13849: I2 ^see 1)
  12783. =>WM: (13867: I2 ^level-1 L1-root)
  12784. <=WM: (13852: I2 ^level-1 L1-root)
  12785. --- END Input Phase ---
  12786. --- Proposal Phase ---
  12787. --- Inner Elaboration Phase, active level 1 (S1) ---
  12788. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12789. -->
  12790. (S1 ^operator O1977 = 0.7063401754803731)
  12791. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12792. -->
  12793. (S1 ^operator O1978 = -0.1937987592593187)
  12794. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12795. -->
  12796. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12797. -->
  12798. Firing elaborate*copy-see-to-output-link
  12799. -->
  12800. (I3 ^see 0 +)
  12801. Firing elaborate*reward*based*on*reward
  12802. -->
  12803. (R993 ^value 1 +)
  12804. (R1 ^reward R993 +)
  12805. Firing propose*predict-yes
  12806. -->
  12807. (O1979 ^name predict-yes +)
  12808. (S1 ^operator O1979 +)
  12809. Firing propose*predict-no
  12810. -->
  12811. (O1980 ^name predict-no +)
  12812. (S1 ^operator O1980 +)
  12813. Firing rl*prefer*rvt*predict-no*H0*6
  12814. -->
  12815. (S1 ^operator O1978 = 0.2298579596436188)
  12816. Firing rl*prefer*rvt*predict-yes*H0*5
  12817. -->
  12818. (S1 ^operator O1977 = 0.2939078922513593)
  12819. Firing prefer*rvt*predict-yes*H0
  12820. -->
  12821. Firing prefer*rvt*predict-no*H0
  12822. -->
  12823. Firing elaborate*copy-dir-to-output-link
  12824. -->
  12825. (I3 ^dir R +)
  12826. inner elaboration loop at bottom goal.
  12827. Retracting elaborate*copy-see-to-output-link
  12828. -->
  12829. (I3 ^see 1 +)
  12830. Retracting propose*predict-no
  12831. -->
  12832. (O1978 ^name predict-no +)
  12833. (S1 ^operator O1978 +)
  12834. Retracting propose*predict-yes
  12835. -->
  12836. (O1977 ^name predict-yes +)
  12837. (S1 ^operator O1977 +)
  12838. Retracting elaborate*reward*based*on*reward
  12839. -->
  12840. (R992 ^value 1 +)
  12841. (R1 ^reward R992 +)
  12842. Retracting elaborate*copy-dir-to-output-link
  12843. -->
  12844. (I3 ^dir U +)
  12845. Retracting rl*prefer*rvt*predict-no*H0*4
  12846. -->
  12847. (S1 ^operator O1978 = 1.)
  12848. Retracting rl*prefer*rvt*predict-yes*H0*3
  12849. -->
  12850. (S1 ^operator O1977 = 0.)
  12851. =>WM: (13875: S1 ^operator O1980 +)
  12852. =>WM: (13874: S1 ^operator O1979 +)
  12853. =>WM: (13873: I3 ^dir R)
  12854. =>WM: (13872: O1980 ^name predict-no)
  12855. =>WM: (13871: O1979 ^name predict-yes)
  12856. =>WM: (13870: R993 ^value 1)
  12857. =>WM: (13869: R1 ^reward R993)
  12858. =>WM: (13868: I3 ^see 0)
  12859. <=WM: (13859: S1 ^operator O1977 +)
  12860. <=WM: (13860: S1 ^operator O1978 +)
  12861. <=WM: (13861: S1 ^operator O1978)
  12862. <=WM: (13858: I3 ^dir U)
  12863. <=WM: (13854: R1 ^reward R992)
  12864. <=WM: (13853: I3 ^see 1)
  12865. <=WM: (13857: O1978 ^name predict-no)
  12866. <=WM: (13856: O1977 ^name predict-yes)
  12867. <=WM: (13855: R992 ^value 1)
  12868. --- Inner Elaboration Phase, active level 1 (S1) ---
  12869. Firing prefer*rvt*predict-yes*H0
  12870. -->
  12871. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12872. -->
  12873. (S1 ^operator O1979 = 0.7063401754803731)
  12874. Firing rl*prefer*rvt*predict-yes*H0*5
  12875. -->
  12876. (S1 ^operator O1979 = 0.2939078922513593)
  12877. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12878. -->
  12879. Firing prefer*rvt*predict-no*H0
  12880. -->
  12881. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12882. -->
  12883. (S1 ^operator O1980 = -0.1937987592593187)
  12884. Firing rl*prefer*rvt*predict-no*H0*6
  12885. -->
  12886. (S1 ^operator O1980 = 0.2298579596436188)
  12887. Firing prefer*rvt*predict-no*H0*6*v1*H1
  12888. -->
  12889. inner elaboration loop at bottom goal.
  12890. Retracting rl*prefer*rvt*predict-no*H0*6
  12891. -->
  12892. (S1 ^operator O1978 = 0.2298579596436188)
  12893. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  12894. -->
  12895. (S1 ^operator O1978 = -0.1937987592593187)
  12896. Retracting rl*prefer*rvt*predict-yes*H0*5
  12897. -->
  12898. (S1 ^operator O1977 = 0.2939078922513593)
  12899. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  12900. -->
  12901. (S1 ^operator O1977 = 0.7063401754803731)
  12902. --- END Proposal Phase ---
  12903. --- Decision Phase ---
  12904. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12905. =>WM: (13876: S1 ^operator O1979)
  12906. 990: O: O1979 (predict-yes)
  12907. --- END Decision Phase ---
  12908. --- Application Phase ---
  12909. --- Firing Productions (PE) For State At Depth 1 ---
  12910. --- Inner Elaboration Phase, active level 1 (S1) ---
  12911. Firing apply*operator
  12912. -->
  12913. (I3 ^predict-yes N990 + :O )
  12914. Firing apply*operator*complete
  12915. -->
  12916. (I3 ^predict-no N989 - :O )
  12917. inner elaboration loop at bottom goal.
  12918. --- Change Working Memory (PE) ---
  12919. =>WM: (13877: I3 ^predict-yes N990)
  12920. <=WM: (13863: N989 ^status complete)
  12921. <=WM: (13862: I3 ^predict-no N989)
  12922. --- Firing Productions (IE) For State At Depth 1 ---
  12923. --- Inner Elaboration Phase, active level 1 (S1) ---
  12924. Firing monitor*world
  12925. -->
  12926. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12927. --- Change Working Memory (IE) ---
  12928. --- END Application Phase ---
  12929. --- Output Phase ---
  12930. ENV: Agent did: predict-yes for direction R in state State-A
  12931. In State-A moving R
  12932. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12933. predict error 0
  12934. dir: dir isU
  12935. --- END Output Phase ---
  12936. \-/--- Input Phase ---
  12937. =>WM: (13881: I2 ^dir U)
  12938. =>WM: (13880: I2 ^reward 1)
  12939. =>WM: (13879: I2 ^see 1)
  12940. =>WM: (13878: N990 ^status complete)
  12941. <=WM: (13866: I2 ^dir R)
  12942. <=WM: (13865: I2 ^reward 1)
  12943. <=WM: (13864: I2 ^see 0)
  12944. =>WM: (13882: I2 ^level-1 R1-root)
  12945. <=WM: (13867: I2 ^level-1 L1-root)
  12946. --- END Input Phase ---
  12947. --- Proposal Phase ---
  12948. --- Inner Elaboration Phase, active level 1 (S1) ---
  12949. Firing elaborate*copy-see-to-output-link
  12950. -->
  12951. (I3 ^see 1 +)
  12952. Firing elaborate*reward*based*on*reward
  12953. -->
  12954. (R994 ^value 1 +)
  12955. (R1 ^reward R994 +)
  12956. Firing propose*predict-yes
  12957. -->
  12958. (O1981 ^name predict-yes +)
  12959. (S1 ^operator O1981 +)
  12960. Firing propose*predict-no
  12961. -->
  12962. (O1982 ^name predict-no +)
  12963. (S1 ^operator O1982 +)
  12964. Firing rl*prefer*rvt*predict-no*H0*4
  12965. -->
  12966. (S1 ^operator O1980 = 1.)
  12967. Firing rl*prefer*rvt*predict-yes*H0*3
  12968. -->
  12969. (S1 ^operator O1979 = 0.)
  12970. Firing prefer*rvt*predict-yes*H0
  12971. -->
  12972. Firing prefer*rvt*predict-no*H0
  12973. -->
  12974. Firing elaborate*copy-dir-to-output-link
  12975. -->
  12976. (I3 ^dir U +)
  12977. inner elaboration loop at bottom goal.
  12978. Retracting elaborate*copy-see-to-output-link
  12979. -->
  12980. (I3 ^see 0 +)
  12981. Retracting propose*predict-no
  12982. -->
  12983. (O1980 ^name predict-no +)
  12984. (S1 ^operator O1980 +)
  12985. Retracting propose*predict-yes
  12986. -->
  12987. (O1979 ^name predict-yes +)
  12988. (S1 ^operator O1979 +)
  12989. Retracting elaborate*reward*based*on*reward
  12990. -->
  12991. (R993 ^value 1 +)
  12992. (R1 ^reward R993 +)
  12993. Retracting elaborate*copy-dir-to-output-link
  12994. -->
  12995. (I3 ^dir R +)
  12996. Retracting rl*prefer*rvt*predict-no*H0*6
  12997. -->
  12998. (S1 ^operator O1980 = 0.2298579596436188)
  12999. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13000. -->
  13001. (S1 ^operator O1980 = -0.1937987592593187)
  13002. Retracting rl*prefer*rvt*predict-yes*H0*5
  13003. -->
  13004. (S1 ^operator O1979 = 0.2939078922513593)
  13005. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13006. -->
  13007. (S1 ^operator O1979 = 0.7063401754803731)
  13008. =>WM: (13890: S1 ^operator O1982 +)
  13009. =>WM: (13889: S1 ^operator O1981 +)
  13010. =>WM: (13888: I3 ^dir U)
  13011. =>WM: (13887: O1982 ^name predict-no)
  13012. =>WM: (13886: O1981 ^name predict-yes)
  13013. =>WM: (13885: R994 ^value 1)
  13014. =>WM: (13884: R1 ^reward R994)
  13015. =>WM: (13883: I3 ^see 1)
  13016. <=WM: (13874: S1 ^operator O1979 +)
  13017. <=WM: (13876: S1 ^operator O1979)
  13018. <=WM: (13875: S1 ^operator O1980 +)
  13019. <=WM: (13873: I3 ^dir R)
  13020. <=WM: (13869: R1 ^reward R993)
  13021. <=WM: (13868: I3 ^see 0)
  13022. <=WM: (13872: O1980 ^name predict-no)
  13023. <=WM: (13871: O1979 ^name predict-yes)
  13024. <=WM: (13870: R993 ^value 1)
  13025. --- Inner Elaboration Phase, active level 1 (S1) ---
  13026. Firing prefer*rvt*predict-yes*H0
  13027. -->
  13028. Firing rl*prefer*rvt*predict-yes*H0*3
  13029. -->
  13030. (S1 ^operator O1981 = 0.)
  13031. Firing prefer*rvt*predict-no*H0
  13032. -->
  13033. Firing rl*prefer*rvt*predict-no*H0*4
  13034. -->
  13035. (S1 ^operator O1982 = 1.)
  13036. inner elaboration loop at bottom goal.
  13037. Retracting rl*prefer*rvt*predict-no*H0*4
  13038. -->
  13039. (S1 ^operator O1980 = 1.)
  13040. Retracting rl*prefer*rvt*predict-yes*H0*3
  13041. -->
  13042. (S1 ^operator O1979 = 0.)
  13043. --- END Proposal Phase ---
  13044. --- Decision Phase ---
  13045. RL update rl*prefer*rvt*predict-yes*H0*5 0.50099 -0.207082 0.293908 -> 0.500972 -0.207084 0.293887(R,m,v=1,0.844156,0.132417)
  13046. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499233 0.207107 0.70634 -> 0.499211 0.207105 0.706316(R,m,v=1,1,0)
  13047. =>WM: (13891: S1 ^operator O1982)
  13048. 991: O: O1982 (predict-no)
  13049. --- END Decision Phase ---
  13050. --- Application Phase ---
  13051. --- Firing Productions (PE) For State At Depth 1 ---
  13052. --- Inner Elaboration Phase, active level 1 (S1) ---
  13053. Firing apply*operator
  13054. -->
  13055. (I3 ^predict-no N991 + :O )
  13056. Firing apply*operator*complete
  13057. -->
  13058. (I3 ^predict-yes N990 - :O )
  13059. inner elaboration loop at bottom goal.
  13060. --- Change Working Memory (PE) ---
  13061. =>WM: (13892: I3 ^predict-no N991)
  13062. <=WM: (13878: N990 ^status complete)
  13063. <=WM: (13877: I3 ^predict-yes N990)
  13064. --- Firing Productions (IE) For State At Depth 1 ---
  13065. --- Inner Elaboration Phase, active level 1 (S1) ---
  13066. Firing monitor*world
  13067. -->
  13068. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13069. --- Change Working Memory (IE) ---
  13070. --- END Application Phase ---
  13071. --- Output Phase ---
  13072. ENV: Agent did: predict-no for direction U in state State-B
  13073. In State-B moving U
  13074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13075. predict error 0
  13076. dir: dir isU
  13077. --- END Output Phase ---
  13078. |--- Input Phase ---
  13079. =>WM: (13896: I2 ^dir U)
  13080. =>WM: (13895: I2 ^reward 1)
  13081. =>WM: (13894: I2 ^see 0)
  13082. =>WM: (13893: N991 ^status complete)
  13083. <=WM: (13881: I2 ^dir U)
  13084. <=WM: (13880: I2 ^reward 1)
  13085. <=WM: (13879: I2 ^see 1)
  13086. =>WM: (13897: I2 ^level-1 R1-root)
  13087. <=WM: (13882: I2 ^level-1 R1-root)
  13088. --- END Input Phase ---
  13089. --- Proposal Phase ---
  13090. --- Inner Elaboration Phase, active level 1 (S1) ---
  13091. Firing elaborate*copy-see-to-output-link
  13092. -->
  13093. (I3 ^see 0 +)
  13094. Firing elaborate*reward*based*on*reward
  13095. -->
  13096. (R995 ^value 1 +)
  13097. (R1 ^reward R995 +)
  13098. Firing propose*predict-yes
  13099. -->
  13100. (O1983 ^name predict-yes +)
  13101. (S1 ^operator O1983 +)
  13102. Firing propose*predict-no
  13103. -->
  13104. (O1984 ^name predict-no +)
  13105. (S1 ^operator O1984 +)
  13106. Firing rl*prefer*rvt*predict-no*H0*4
  13107. -->
  13108. (S1 ^operator O1982 = 1.)
  13109. Firing rl*prefer*rvt*predict-yes*H0*3
  13110. -->
  13111. (S1 ^operator O1981 = 0.)
  13112. Firing prefer*rvt*predict-yes*H0
  13113. -->
  13114. Firing prefer*rvt*predict-no*H0
  13115. -->
  13116. Firing elaborate*copy-dir-to-output-link
  13117. -->
  13118. (I3 ^dir U +)
  13119. inner elaboration loop at bottom goal.
  13120. Retracting elaborate*copy-see-to-output-link
  13121. -->
  13122. (I3 ^see 1 +)
  13123. Retracting propose*predict-no
  13124. -->
  13125. (O1982 ^name predict-no +)
  13126. (S1 ^operator O1982 +)
  13127. Retracting propose*predict-yes
  13128. -->
  13129. (O1981 ^name predict-yes +)
  13130. (S1 ^operator O1981 +)
  13131. Retracting elaborate*reward*based*on*reward
  13132. -->
  13133. (R994 ^value 1 +)
  13134. (R1 ^reward R994 +)
  13135. Retracting elaborate*copy-dir-to-output-link
  13136. -->
  13137. (I3 ^dir U +)
  13138. Retracting rl*prefer*rvt*predict-no*H0*4
  13139. -->
  13140. (S1 ^operator O1982 = 1.)
  13141. Retracting rl*prefer*rvt*predict-yes*H0*3
  13142. -->
  13143. (S1 ^operator O1981 = 0.)
  13144. =>WM: (13904: S1 ^operator O1984 +)
  13145. =>WM: (13903: S1 ^operator O1983 +)
  13146. =>WM: (13902: O1984 ^name predict-no)
  13147. =>WM: (13901: O1983 ^name predict-yes)
  13148. =>WM: (13900: R995 ^value 1)
  13149. =>WM: (13899: R1 ^reward R995)
  13150. =>WM: (13898: I3 ^see 0)
  13151. <=WM: (13889: S1 ^operator O1981 +)
  13152. <=WM: (13890: S1 ^operator O1982 +)
  13153. <=WM: (13891: S1 ^operator O1982)
  13154. <=WM: (13884: R1 ^reward R994)
  13155. <=WM: (13883: I3 ^see 1)
  13156. <=WM: (13887: O1982 ^name predict-no)
  13157. <=WM: (13886: O1981 ^name predict-yes)
  13158. <=WM: (13885: R994 ^value 1)
  13159. --- Inner Elaboration Phase, active level 1 (S1) ---
  13160. Firing prefer*rvt*predict-yes*H0
  13161. -->
  13162. Firing rl*prefer*rvt*predict-yes*H0*3
  13163. -->
  13164. (S1 ^operator O1983 = 0.)
  13165. Firing prefer*rvt*predict-no*H0
  13166. -->
  13167. Firing rl*prefer*rvt*predict-no*H0*4
  13168. -->
  13169. (S1 ^operator O1984 = 1.)
  13170. inner elaboration loop at bottom goal.
  13171. Retracting rl*prefer*rvt*predict-no*H0*4
  13172. -->
  13173. (S1 ^operator O1982 = 1.)
  13174. Retracting rl*prefer*rvt*predict-yes*H0*3
  13175. -->
  13176. (S1 ^operator O1981 = 0.)
  13177. --- END Proposal Phase ---
  13178. --- Decision Phase ---
  13179. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13180. =>WM: (13905: S1 ^operator O1984)
  13181. 992: O: O1984 (predict-no)
  13182. --- END Decision Phase ---
  13183. --- Application Phase ---
  13184. --- Firing Productions (PE) For State At Depth 1 ---
  13185. --- Inner Elaboration Phase, active level 1 (S1) ---
  13186. Firing apply*operator
  13187. -->
  13188. (I3 ^predict-no N992 + :O )
  13189. Firing apply*operator*complete
  13190. -->
  13191. (I3 ^predict-no N991 - :O )
  13192. inner elaboration loop at bottom goal.
  13193. --- Change Working Memory (PE) ---
  13194. =>WM: (13906: I3 ^predict-no N992)
  13195. <=WM: (13893: N991 ^status complete)
  13196. <=WM: (13892: I3 ^predict-no N991)
  13197. --- Firing Productions (IE) For State At Depth 1 ---
  13198. --- Inner Elaboration Phase, active level 1 (S1) ---
  13199. Firing monitor*world
  13200. -->
  13201. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13202. --- Change Working Memory (IE) ---
  13203. --- END Application Phase ---
  13204. --- Output Phase ---
  13205. ENV: Agent did: predict-no for direction U in state State-B
  13206. In State-B moving U
  13207. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13208. predict error 0
  13209. dir: dir isL
  13210. --- END Output Phase ---
  13211. \-/--- Input Phase ---
  13212. =>WM: (13910: I2 ^dir L)
  13213. =>WM: (13909: I2 ^reward 1)
  13214. =>WM: (13908: I2 ^see 0)
  13215. =>WM: (13907: N992 ^status complete)
  13216. <=WM: (13896: I2 ^dir U)
  13217. <=WM: (13895: I2 ^reward 1)
  13218. <=WM: (13894: I2 ^see 0)
  13219. =>WM: (13911: I2 ^level-1 R1-root)
  13220. <=WM: (13897: I2 ^level-1 R1-root)
  13221. --- END Input Phase ---
  13222. --- Proposal Phase ---
  13223. --- Inner Elaboration Phase, active level 1 (S1) ---
  13224. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13225. -->
  13226. (S1 ^operator O1983 = 0.6196129817664832)
  13227. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13228. -->
  13229. (S1 ^operator O1984 = -0.1479504104026684)
  13230. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13231. -->
  13232. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13233. -->
  13234. Firing elaborate*copy-see-to-output-link
  13235. -->
  13236. (I3 ^see 0 +)
  13237. Firing elaborate*reward*based*on*reward
  13238. -->
  13239. (R996 ^value 1 +)
  13240. (R1 ^reward R996 +)
  13241. Firing propose*predict-yes
  13242. -->
  13243. (O1985 ^name predict-yes +)
  13244. (S1 ^operator O1985 +)
  13245. Firing propose*predict-no
  13246. -->
  13247. (O1986 ^name predict-no +)
  13248. (S1 ^operator O1986 +)
  13249. Firing rl*prefer*rvt*predict-no*H0*2
  13250. -->
  13251. (S1 ^operator O1984 = 0.3140233963466647)
  13252. Firing rl*prefer*rvt*predict-yes*H0*1
  13253. -->
  13254. (S1 ^operator O1983 = 0.380417577206794)
  13255. Firing prefer*rvt*predict-yes*H0
  13256. -->
  13257. Firing prefer*rvt*predict-no*H0
  13258. -->
  13259. Firing elaborate*copy-dir-to-output-link
  13260. -->
  13261. (I3 ^dir L +)
  13262. inner elaboration loop at bottom goal.
  13263. Retracting elaborate*copy-see-to-output-link
  13264. -->
  13265. (I3 ^see 0 +)
  13266. Retracting propose*predict-no
  13267. -->
  13268. (O1984 ^name predict-no +)
  13269. (S1 ^operator O1984 +)
  13270. Retracting propose*predict-yes
  13271. -->
  13272. (O1983 ^name predict-yes +)
  13273. (S1 ^operator O1983 +)
  13274. Retracting elaborate*reward*based*on*reward
  13275. -->
  13276. (R995 ^value 1 +)
  13277. (R1 ^reward R995 +)
  13278. Retracting elaborate*copy-dir-to-output-link
  13279. -->
  13280. (I3 ^dir U +)
  13281. Retracting rl*prefer*rvt*predict-no*H0*4
  13282. -->
  13283. (S1 ^operator O1984 = 1.)
  13284. Retracting rl*prefer*rvt*predict-yes*H0*3
  13285. -->
  13286. (S1 ^operator O1983 = 0.)
  13287. =>WM: (13918: S1 ^operator O1986 +)
  13288. =>WM: (13917: S1 ^operator O1985 +)
  13289. =>WM: (13916: I3 ^dir L)
  13290. =>WM: (13915: O1986 ^name predict-no)
  13291. =>WM: (13914: O1985 ^name predict-yes)
  13292. =>WM: (13913: R996 ^value 1)
  13293. =>WM: (13912: R1 ^reward R996)
  13294. <=WM: (13903: S1 ^operator O1983 +)
  13295. <=WM: (13904: S1 ^operator O1984 +)
  13296. <=WM: (13905: S1 ^operator O1984)
  13297. <=WM: (13888: I3 ^dir U)
  13298. <=WM: (13899: R1 ^reward R995)
  13299. <=WM: (13902: O1984 ^name predict-no)
  13300. <=WM: (13901: O1983 ^name predict-yes)
  13301. <=WM: (13900: R995 ^value 1)
  13302. --- Inner Elaboration Phase, active level 1 (S1) ---
  13303. Firing prefer*rvt*predict-yes*H0
  13304. -->
  13305. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13306. -->
  13307. (S1 ^operator O1985 = 0.6196129817664832)
  13308. Firing rl*prefer*rvt*predict-yes*H0*1
  13309. -->
  13310. (S1 ^operator O1985 = 0.380417577206794)
  13311. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13312. -->
  13313. Firing prefer*rvt*predict-no*H0
  13314. -->
  13315. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13316. -->
  13317. (S1 ^operator O1986 = -0.1479504104026684)
  13318. Firing rl*prefer*rvt*predict-no*H0*2
  13319. -->
  13320. (S1 ^operator O1986 = 0.3140233963466647)
  13321. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13322. -->
  13323. inner elaboration loop at bottom goal.
  13324. Retracting rl*prefer*rvt*predict-no*H0*2
  13325. -->
  13326. (S1 ^operator O1984 = 0.3140233963466647)
  13327. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13328. -->
  13329. (S1 ^operator O1984 = -0.1479504104026684)
  13330. Retracting rl*prefer*rvt*predict-yes*H0*1
  13331. -->
  13332. (S1 ^operator O1983 = 0.380417577206794)
  13333. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13334. -->
  13335. (S1 ^operator O1983 = 0.6196129817664832)
  13336. --- END Proposal Phase ---
  13337. --- Decision Phase ---
  13338. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13339. =>WM: (13919: S1 ^operator O1985)
  13340. 993: O: O1985 (predict-yes)
  13341. --- END Decision Phase ---
  13342. --- Application Phase ---
  13343. --- Firing Productions (PE) For State At Depth 1 ---
  13344. --- Inner Elaboration Phase, active level 1 (S1) ---
  13345. Firing apply*operator
  13346. -->
  13347. (I3 ^predict-yes N993 + :O )
  13348. Firing apply*operator*complete
  13349. -->
  13350. (I3 ^predict-no N992 - :O )
  13351. inner elaboration loop at bottom goal.
  13352. --- Change Working Memory (PE) ---
  13353. =>WM: (13920: I3 ^predict-yes N993)
  13354. <=WM: (13907: N992 ^status complete)
  13355. <=WM: (13906: I3 ^predict-no N992)
  13356. --- Firing Productions (IE) For State At Depth 1 ---
  13357. --- Inner Elaboration Phase, active level 1 (S1) ---
  13358. Firing monitor*world
  13359. -->
  13360. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13361. --- Change Working Memory (IE) ---
  13362. --- END Application Phase ---
  13363. --- Output Phase ---
  13364. ENV: Agent did: predict-yes for direction L in state State-B
  13365. In State-B moving L
  13366. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13367. predict error 0
  13368. dir: dir isR
  13369. --- END Output Phase ---
  13370. |\---- Input Phase ---
  13371. =>WM: (13924: I2 ^dir R)
  13372. =>WM: (13923: I2 ^reward 1)
  13373. =>WM: (13922: I2 ^see 1)
  13374. =>WM: (13921: N993 ^status complete)
  13375. <=WM: (13910: I2 ^dir L)
  13376. <=WM: (13909: I2 ^reward 1)
  13377. <=WM: (13908: I2 ^see 0)
  13378. =>WM: (13925: I2 ^level-1 L1-root)
  13379. <=WM: (13911: I2 ^level-1 R1-root)
  13380. --- END Input Phase ---
  13381. --- Proposal Phase ---
  13382. --- Inner Elaboration Phase, active level 1 (S1) ---
  13383. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13384. -->
  13385. (S1 ^operator O1985 = 0.7063161327052487)
  13386. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13387. -->
  13388. (S1 ^operator O1986 = -0.1937987592593187)
  13389. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13390. -->
  13391. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13392. -->
  13393. Firing elaborate*copy-see-to-output-link
  13394. -->
  13395. (I3 ^see 1 +)
  13396. Firing elaborate*reward*based*on*reward
  13397. -->
  13398. (R997 ^value 1 +)
  13399. (R1 ^reward R997 +)
  13400. Firing propose*predict-yes
  13401. -->
  13402. (O1987 ^name predict-yes +)
  13403. (S1 ^operator O1987 +)
  13404. Firing propose*predict-no
  13405. -->
  13406. (O1988 ^name predict-no +)
  13407. (S1 ^operator O1988 +)
  13408. Firing rl*prefer*rvt*predict-no*H0*6
  13409. -->
  13410. (S1 ^operator O1986 = 0.2298579596436188)
  13411. Firing rl*prefer*rvt*predict-yes*H0*5
  13412. -->
  13413. (S1 ^operator O1985 = 0.29388734647702)
  13414. Firing prefer*rvt*predict-yes*H0
  13415. -->
  13416. Firing prefer*rvt*predict-no*H0
  13417. -->
  13418. Firing elaborate*copy-dir-to-output-link
  13419. -->
  13420. (I3 ^dir R +)
  13421. inner elaboration loop at bottom goal.
  13422. Retracting elaborate*copy-see-to-output-link
  13423. -->
  13424. (I3 ^see 0 +)
  13425. Retracting propose*predict-no
  13426. -->
  13427. (O1986 ^name predict-no +)
  13428. (S1 ^operator O1986 +)
  13429. Retracting propose*predict-yes
  13430. -->
  13431. (O1985 ^name predict-yes +)
  13432. (S1 ^operator O1985 +)
  13433. Retracting elaborate*reward*based*on*reward
  13434. -->
  13435. (R996 ^value 1 +)
  13436. (R1 ^reward R996 +)
  13437. Retracting elaborate*copy-dir-to-output-link
  13438. -->
  13439. (I3 ^dir L +)
  13440. Retracting rl*prefer*rvt*predict-no*H0*2
  13441. -->
  13442. (S1 ^operator O1986 = 0.3140233963466647)
  13443. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  13444. -->
  13445. (S1 ^operator O1986 = -0.1479504104026684)
  13446. Retracting rl*prefer*rvt*predict-yes*H0*1
  13447. -->
  13448. (S1 ^operator O1985 = 0.380417577206794)
  13449. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  13450. -->
  13451. (S1 ^operator O1985 = 0.6196129817664832)
  13452. =>WM: (13933: S1 ^operator O1988 +)
  13453. =>WM: (13932: S1 ^operator O1987 +)
  13454. =>WM: (13931: I3 ^dir R)
  13455. =>WM: (13930: O1988 ^name predict-no)
  13456. =>WM: (13929: O1987 ^name predict-yes)
  13457. =>WM: (13928: R997 ^value 1)
  13458. =>WM: (13927: R1 ^reward R997)
  13459. =>WM: (13926: I3 ^see 1)
  13460. <=WM: (13917: S1 ^operator O1985 +)
  13461. <=WM: (13919: S1 ^operator O1985)
  13462. <=WM: (13918: S1 ^operator O1986 +)
  13463. <=WM: (13916: I3 ^dir L)
  13464. <=WM: (13912: R1 ^reward R996)
  13465. <=WM: (13898: I3 ^see 0)
  13466. <=WM: (13915: O1986 ^name predict-no)
  13467. <=WM: (13914: O1985 ^name predict-yes)
  13468. <=WM: (13913: R996 ^value 1)
  13469. --- Inner Elaboration Phase, active level 1 (S1) ---
  13470. Firing prefer*rvt*predict-yes*H0
  13471. -->
  13472. Firing rl*prefer*rvt*predict-yes*H0*5
  13473. -->
  13474. (S1 ^operator O1987 = 0.29388734647702)
  13475. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13476. -->
  13477. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13478. -->
  13479. (S1 ^operator O1987 = 0.7063161327052487)
  13480. Firing prefer*rvt*predict-no*H0
  13481. -->
  13482. Firing rl*prefer*rvt*predict-no*H0*6
  13483. -->
  13484. (S1 ^operator O1988 = 0.2298579596436188)
  13485. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13486. -->
  13487. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13488. -->
  13489. (S1 ^operator O1988 = -0.1937987592593187)
  13490. inner elaboration loop at bottom goal.
  13491. Retracting rl*prefer*rvt*predict-no*H0*6
  13492. -->
  13493. (S1 ^operator O1986 = 0.2298579596436188)
  13494. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13495. -->
  13496. (S1 ^operator O1986 = -0.1937987592593187)
  13497. Retracting rl*prefer*rvt*predict-yes*H0*5
  13498. -->
  13499. (S1 ^operator O1985 = 0.29388734647702)
  13500. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13501. -->
  13502. (S1 ^operator O1985 = 0.7063161327052487)
  13503. --- END Proposal Phase ---
  13504. --- Decision Phase ---
  13505. RL update rl*prefer*rvt*predict-yes*H0*1 0.521348 -0.14093 0.380418 -> 0.521345 -0.14093 0.380415(R,m,v=1,0.829268,0.142451)
  13506. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478686 0.140927 0.619613 -> 0.478682 0.140928 0.61961(R,m,v=1,1,0)
  13507. =>WM: (13934: S1 ^operator O1987)
  13508. 994: O: O1987 (predict-yes)
  13509. --- END Decision Phase ---
  13510. --- Application Phase ---
  13511. --- Firing Productions (PE) For State At Depth 1 ---
  13512. --- Inner Elaboration Phase, active level 1 (S1) ---
  13513. Firing apply*operator
  13514. -->
  13515. (I3 ^predict-yes N994 + :O )
  13516. Firing apply*operator*complete
  13517. -->
  13518. (I3 ^predict-yes N993 - :O )
  13519. inner elaboration loop at bottom goal.
  13520. --- Change Working Memory (PE) ---
  13521. =>WM: (13935: I3 ^predict-yes N994)
  13522. <=WM: (13921: N993 ^status complete)
  13523. <=WM: (13920: I3 ^predict-yes N993)
  13524. --- Firing Productions (IE) For State At Depth 1 ---
  13525. --- Inner Elaboration Phase, active level 1 (S1) ---
  13526. Firing monitor*world
  13527. -->
  13528. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13529. --- Change Working Memory (IE) ---
  13530. --- END Application Phase ---
  13531. --- Output Phase ---
  13532. ENV: Agent did: predict-yes for direction R in state State-A
  13533. In State-A moving R
  13534. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13535. predict error 0
  13536. dir: dir isR
  13537. --- END Output Phase ---
  13538. /|\--- Input Phase ---
  13539. =>WM: (13939: I2 ^dir R)
  13540. =>WM: (13938: I2 ^reward 1)
  13541. =>WM: (13937: I2 ^see 1)
  13542. =>WM: (13936: N994 ^status complete)
  13543. <=WM: (13924: I2 ^dir R)
  13544. <=WM: (13923: I2 ^reward 1)
  13545. <=WM: (13922: I2 ^see 1)
  13546. =>WM: (13940: I2 ^level-1 R1-root)
  13547. <=WM: (13925: I2 ^level-1 L1-root)
  13548. --- END Input Phase ---
  13549. --- Proposal Phase ---
  13550. --- Inner Elaboration Phase, active level 1 (S1) ---
  13551. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13552. -->
  13553. (S1 ^operator O1987 = -0.252585164213872)
  13554. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13555. -->
  13556. (S1 ^operator O1988 = 0.7701797310679288)
  13557. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13558. -->
  13559. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13560. -->
  13561. Firing elaborate*copy-see-to-output-link
  13562. -->
  13563. (I3 ^see 1 +)
  13564. Firing elaborate*reward*based*on*reward
  13565. -->
  13566. (R998 ^value 1 +)
  13567. (R1 ^reward R998 +)
  13568. Firing propose*predict-yes
  13569. -->
  13570. (O1989 ^name predict-yes +)
  13571. (S1 ^operator O1989 +)
  13572. Firing propose*predict-no
  13573. -->
  13574. (O1990 ^name predict-no +)
  13575. (S1 ^operator O1990 +)
  13576. Firing rl*prefer*rvt*predict-no*H0*6
  13577. -->
  13578. (S1 ^operator O1988 = 0.2298579596436188)
  13579. Firing rl*prefer*rvt*predict-yes*H0*5
  13580. -->
  13581. (S1 ^operator O1987 = 0.29388734647702)
  13582. Firing prefer*rvt*predict-yes*H0
  13583. -->
  13584. Firing prefer*rvt*predict-no*H0
  13585. -->
  13586. Firing elaborate*copy-dir-to-output-link
  13587. -->
  13588. (I3 ^dir R +)
  13589. inner elaboration loop at bottom goal.
  13590. Retracting elaborate*copy-see-to-output-link
  13591. -->
  13592. (I3 ^see 1 +)
  13593. Retracting propose*predict-no
  13594. -->
  13595. (O1988 ^name predict-no +)
  13596. (S1 ^operator O1988 +)
  13597. Retracting propose*predict-yes
  13598. -->
  13599. (O1987 ^name predict-yes +)
  13600. (S1 ^operator O1987 +)
  13601. Retracting elaborate*reward*based*on*reward
  13602. -->
  13603. (R997 ^value 1 +)
  13604. (R1 ^reward R997 +)
  13605. Retracting elaborate*copy-dir-to-output-link
  13606. -->
  13607. (I3 ^dir R +)
  13608. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  13609. -->
  13610. (S1 ^operator O1988 = -0.1937987592593187)
  13611. Retracting rl*prefer*rvt*predict-no*H0*6
  13612. -->
  13613. (S1 ^operator O1988 = 0.2298579596436188)
  13614. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  13615. -->
  13616. (S1 ^operator O1987 = 0.7063161327052487)
  13617. Retracting rl*prefer*rvt*predict-yes*H0*5
  13618. -->
  13619. (S1 ^operator O1987 = 0.29388734647702)
  13620. =>WM: (13946: S1 ^operator O1990 +)
  13621. =>WM: (13945: S1 ^operator O1989 +)
  13622. =>WM: (13944: O1990 ^name predict-no)
  13623. =>WM: (13943: O1989 ^name predict-yes)
  13624. =>WM: (13942: R998 ^value 1)
  13625. =>WM: (13941: R1 ^reward R998)
  13626. <=WM: (13932: S1 ^operator O1987 +)
  13627. <=WM: (13934: S1 ^operator O1987)
  13628. <=WM: (13933: S1 ^operator O1988 +)
  13629. <=WM: (13927: R1 ^reward R997)
  13630. <=WM: (13930: O1988 ^name predict-no)
  13631. <=WM: (13929: O1987 ^name predict-yes)
  13632. <=WM: (13928: R997 ^value 1)
  13633. --- Inner Elaboration Phase, active level 1 (S1) ---
  13634. Firing prefer*rvt*predict-yes*H0
  13635. -->
  13636. Firing rl*prefer*rvt*predict-yes*H0*5
  13637. -->
  13638. (S1 ^operator O1989 = 0.29388734647702)
  13639. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13640. -->
  13641. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13642. -->
  13643. (S1 ^operator O1989 = -0.252585164213872)
  13644. Firing prefer*rvt*predict-no*H0
  13645. -->
  13646. Firing rl*prefer*rvt*predict-no*H0*6
  13647. -->
  13648. (S1 ^operator O1990 = 0.2298579596436188)
  13649. Firing prefer*rvt*predict-no*H0*6*v1*H1
  13650. -->
  13651. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13652. -->
  13653. (S1 ^operator O1990 = 0.7701797310679288)
  13654. inner elaboration loop at bottom goal.
  13655. Retracting rl*prefer*rvt*predict-no*H0*6
  13656. -->
  13657. (S1 ^operator O1988 = 0.2298579596436188)
  13658. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13659. -->
  13660. (S1 ^operator O1988 = 0.7701797310679288)
  13661. Retracting rl*prefer*rvt*predict-yes*H0*5
  13662. -->
  13663. (S1 ^operator O1987 = 0.29388734647702)
  13664. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13665. -->
  13666. (S1 ^operator O1987 = -0.252585164213872)
  13667. --- END Proposal Phase ---
  13668. --- Decision Phase ---
  13669. RL update rl*prefer*rvt*predict-yes*H0*5 0.500972 -0.207084 0.293887 -> 0.500957 -0.207086 0.293871(R,m,v=1,0.845161,0.131713)
  13670. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499211 0.207105 0.706316 -> 0.499194 0.207103 0.706296(R,m,v=1,1,0)
  13671. =>WM: (13947: S1 ^operator O1990)
  13672. 995: O: O1990 (predict-no)
  13673. --- END Decision Phase ---
  13674. --- Application Phase ---
  13675. --- Firing Productions (PE) For State At Depth 1 ---
  13676. --- Inner Elaboration Phase, active level 1 (S1) ---
  13677. Firing apply*operator
  13678. -->
  13679. (I3 ^predict-no N995 + :O )
  13680. Firing apply*operator*complete
  13681. -->
  13682. (I3 ^predict-yes N994 - :O )
  13683. inner elaboration loop at bottom goal.
  13684. --- Change Working Memory (PE) ---
  13685. =>WM: (13948: I3 ^predict-no N995)
  13686. <=WM: (13936: N994 ^status complete)
  13687. <=WM: (13935: I3 ^predict-yes N994)
  13688. --- Firing Productions (IE) For State At Depth 1 ---
  13689. --- Inner Elaboration Phase, active level 1 (S1) ---
  13690. Firing monitor*world
  13691. -->
  13692. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13693. --- Change Working Memory (IE) ---
  13694. --- END Application Phase ---
  13695. --- Output Phase ---
  13696. ENV: Agent did: predict-no for direction R in state State-B
  13697. In State-B moving R
  13698. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13699. predict error 0
  13700. dir: dir isU
  13701. --- END Output Phase ---
  13702. -/|--- Input Phase ---
  13703. =>WM: (13952: I2 ^dir U)
  13704. =>WM: (13951: I2 ^reward 1)
  13705. =>WM: (13950: I2 ^see 0)
  13706. =>WM: (13949: N995 ^status complete)
  13707. <=WM: (13939: I2 ^dir R)
  13708. <=WM: (13938: I2 ^reward 1)
  13709. <=WM: (13937: I2 ^see 1)
  13710. =>WM: (13953: I2 ^level-1 R0-root)
  13711. <=WM: (13940: I2 ^level-1 R1-root)
  13712. --- END Input Phase ---
  13713. --- Proposal Phase ---
  13714. --- Inner Elaboration Phase, active level 1 (S1) ---
  13715. Firing elaborate*copy-see-to-output-link
  13716. -->
  13717. (I3 ^see 0 +)
  13718. Firing elaborate*reward*based*on*reward
  13719. -->
  13720. (R999 ^value 1 +)
  13721. (R1 ^reward R999 +)
  13722. Firing propose*predict-yes
  13723. -->
  13724. (O1991 ^name predict-yes +)
  13725. (S1 ^operator O1991 +)
  13726. Firing propose*predict-no
  13727. -->
  13728. (O1992 ^name predict-no +)
  13729. (S1 ^operator O1992 +)
  13730. Firing rl*prefer*rvt*predict-no*H0*4
  13731. -->
  13732. (S1 ^operator O1990 = 1.)
  13733. Firing rl*prefer*rvt*predict-yes*H0*3
  13734. -->
  13735. (S1 ^operator O1989 = 0.)
  13736. Firing prefer*rvt*predict-yes*H0
  13737. -->
  13738. Firing prefer*rvt*predict-no*H0
  13739. -->
  13740. Firing elaborate*copy-dir-to-output-link
  13741. -->
  13742. (I3 ^dir U +)
  13743. inner elaboration loop at bottom goal.
  13744. Retracting elaborate*copy-see-to-output-link
  13745. -->
  13746. (I3 ^see 1 +)
  13747. Retracting propose*predict-no
  13748. -->
  13749. (O1990 ^name predict-no +)
  13750. (S1 ^operator O1990 +)
  13751. Retracting propose*predict-yes
  13752. -->
  13753. (O1989 ^name predict-yes +)
  13754. (S1 ^operator O1989 +)
  13755. Retracting elaborate*reward*based*on*reward
  13756. -->
  13757. (R998 ^value 1 +)
  13758. (R1 ^reward R998 +)
  13759. Retracting elaborate*copy-dir-to-output-link
  13760. -->
  13761. (I3 ^dir R +)
  13762. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  13763. -->
  13764. (S1 ^operator O1990 = 0.7701797310679288)
  13765. Retracting rl*prefer*rvt*predict-no*H0*6
  13766. -->
  13767. (S1 ^operator O1990 = 0.2298579596436188)
  13768. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13769. -->
  13770. (S1 ^operator O1989 = -0.252585164213872)
  13771. Retracting rl*prefer*rvt*predict-yes*H0*5
  13772. -->
  13773. (S1 ^operator O1989 = 0.2938705117203769)
  13774. =>WM: (13961: S1 ^operator O1992 +)
  13775. =>WM: (13960: S1 ^operator O1991 +)
  13776. =>WM: (13959: I3 ^dir U)
  13777. =>WM: (13958: O1992 ^name predict-no)
  13778. =>WM: (13957: O1991 ^name predict-yes)
  13779. =>WM: (13956: R999 ^value 1)
  13780. =>WM: (13955: R1 ^reward R999)
  13781. =>WM: (13954: I3 ^see 0)
  13782. <=WM: (13945: S1 ^operator O1989 +)
  13783. <=WM: (13946: S1 ^operator O1990 +)
  13784. <=WM: (13947: S1 ^operator O1990)
  13785. <=WM: (13931: I3 ^dir R)
  13786. <=WM: (13941: R1 ^reward R998)
  13787. <=WM: (13926: I3 ^see 1)
  13788. <=WM: (13944: O1990 ^name predict-no)
  13789. <=WM: (13943: O1989 ^name predict-yes)
  13790. <=WM: (13942: R998 ^value 1)
  13791. --- Inner Elaboration Phase, active level 1 (S1) ---
  13792. Firing prefer*rvt*predict-yes*H0
  13793. -->
  13794. Firing rl*prefer*rvt*predict-yes*H0*3
  13795. -->
  13796. (S1 ^operator O1991 = 0.)
  13797. Firing prefer*rvt*predict-no*H0
  13798. -->
  13799. Firing rl*prefer*rvt*predict-no*H0*4
  13800. -->
  13801. (S1 ^operator O1992 = 1.)
  13802. inner elaboration loop at bottom goal.
  13803. Retracting rl*prefer*rvt*predict-no*H0*4
  13804. -->
  13805. (S1 ^operator O1990 = 1.)
  13806. Retracting rl*prefer*rvt*predict-yes*H0*3
  13807. -->
  13808. (S1 ^operator O1989 = 0.)
  13809. --- END Proposal Phase ---
  13810. --- Decision Phase ---
  13811. RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382052 0.229858 -> 0.611908 -0.382053 0.229855(R,m,v=1,0.845714,0.131232)
  13812. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.38812 0.38206 0.77018 -> 0.388117 0.382059 0.770176(R,m,v=1,1,0)
  13813. =>WM: (13962: S1 ^operator O1992)
  13814. 996: O: O1992 (predict-no)
  13815. --- END Decision Phase ---
  13816. --- Application Phase ---
  13817. --- Firing Productions (PE) For State At Depth 1 ---
  13818. --- Inner Elaboration Phase, active level 1 (S1) ---
  13819. Firing apply*operator
  13820. -->
  13821. (I3 ^predict-no N996 + :O )
  13822. Firing apply*operator*complete
  13823. -->
  13824. (I3 ^predict-no N995 - :O )
  13825. inner elaboration loop at bottom goal.
  13826. --- Change Working Memory (PE) ---
  13827. =>WM: (13963: I3 ^predict-no N996)
  13828. <=WM: (13949: N995 ^status complete)
  13829. <=WM: (13948: I3 ^predict-no N995)
  13830. --- Firing Productions (IE) For State At Depth 1 ---
  13831. --- Inner Elaboration Phase, active level 1 (S1) ---
  13832. Firing monitor*world
  13833. -->
  13834. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13835. --- Change Working Memory (IE) ---
  13836. --- END Application Phase ---
  13837. --- Output Phase ---
  13838. ENV: Agent did: predict-no for direction U in state State-B
  13839. In State-B moving U
  13840. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13841. predict error 0
  13842. dir: dir isU
  13843. --- END Output Phase ---
  13844. \-/--- Input Phase ---
  13845. =>WM: (13967: I2 ^dir U)
  13846. =>WM: (13966: I2 ^reward 1)
  13847. =>WM: (13965: I2 ^see 0)
  13848. =>WM: (13964: N996 ^status complete)
  13849. <=WM: (13952: I2 ^dir U)
  13850. <=WM: (13951: I2 ^reward 1)
  13851. <=WM: (13950: I2 ^see 0)
  13852. =>WM: (13968: I2 ^level-1 R0-root)
  13853. <=WM: (13953: I2 ^level-1 R0-root)
  13854. --- END Input Phase ---
  13855. --- Proposal Phase ---
  13856. --- Inner Elaboration Phase, active level 1 (S1) ---
  13857. Firing elaborate*copy-see-to-output-link
  13858. -->
  13859. (I3 ^see 0 +)
  13860. Firing elaborate*reward*based*on*reward
  13861. -->
  13862. (R1000 ^value 1 +)
  13863. (R1 ^reward R1000 +)
  13864. Firing propose*predict-yes
  13865. -->
  13866. (O1993 ^name predict-yes +)
  13867. (S1 ^operator O1993 +)
  13868. Firing propose*predict-no
  13869. -->
  13870. (O1994 ^name predict-no +)
  13871. (S1 ^operator O1994 +)
  13872. Firing rl*prefer*rvt*predict-no*H0*4
  13873. -->
  13874. (S1 ^operator O1992 = 1.)
  13875. Firing rl*prefer*rvt*predict-yes*H0*3
  13876. -->
  13877. (S1 ^operator O1991 = 0.)
  13878. Firing prefer*rvt*predict-yes*H0
  13879. -->
  13880. Firing prefer*rvt*predict-no*H0
  13881. -->
  13882. Firing elaborate*copy-dir-to-output-link
  13883. -->
  13884. (I3 ^dir U +)
  13885. inner elaboration loop at bottom goal.
  13886. Retracting elaborate*copy-see-to-output-link
  13887. -->
  13888. (I3 ^see 0 +)
  13889. Retracting propose*predict-no
  13890. -->
  13891. (O1992 ^name predict-no +)
  13892. (S1 ^operator O1992 +)
  13893. Retracting propose*predict-yes
  13894. -->
  13895. (O1991 ^name predict-yes +)
  13896. (S1 ^operator O1991 +)
  13897. Retracting elaborate*reward*based*on*reward
  13898. -->
  13899. (R999 ^value 1 +)
  13900. (R1 ^reward R999 +)
  13901. Retracting elaborate*copy-dir-to-output-link
  13902. -->
  13903. (I3 ^dir U +)
  13904. Retracting rl*prefer*rvt*predict-no*H0*4
  13905. -->
  13906. (S1 ^operator O1992 = 1.)
  13907. Retracting rl*prefer*rvt*predict-yes*H0*3
  13908. -->
  13909. (S1 ^operator O1991 = 0.)
  13910. =>WM: (13974: S1 ^operator O1994 +)
  13911. =>WM: (13973: S1 ^operator O1993 +)
  13912. =>WM: (13972: O1994 ^name predict-no)
  13913. =>WM: (13971: O1993 ^name predict-yes)
  13914. =>WM: (13970: R1000 ^value 1)
  13915. =>WM: (13969: R1 ^reward R1000)
  13916. <=WM: (13960: S1 ^operator O1991 +)
  13917. <=WM: (13961: S1 ^operator O1992 +)
  13918. <=WM: (13962: S1 ^operator O1992)
  13919. <=WM: (13955: R1 ^reward R999)
  13920. <=WM: (13958: O1992 ^name predict-no)
  13921. <=WM: (13957: O1991 ^name predict-yes)
  13922. <=WM: (13956: R999 ^value 1)
  13923. --- Inner Elaboration Phase, active level 1 (S1) ---
  13924. Firing prefer*rvt*predict-yes*H0
  13925. -->
  13926. Firing rl*prefer*rvt*predict-yes*H0*3
  13927. -->
  13928. (S1 ^operator O1993 = 0.)
  13929. Firing prefer*rvt*predict-no*H0
  13930. -->
  13931. Firing rl*prefer*rvt*predict-no*H0*4
  13932. -->
  13933. (S1 ^operator O1994 = 1.)
  13934. inner elaboration loop at bottom goal.
  13935. Retracting rl*prefer*rvt*predict-no*H0*4
  13936. -->
  13937. (S1 ^operator O1992 = 1.)
  13938. Retracting rl*prefer*rvt*predict-yes*H0*3
  13939. -->
  13940. (S1 ^operator O1991 = 0.)
  13941. --- END Proposal Phase ---
  13942. --- Decision Phase ---
  13943. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13944. =>WM: (13975: S1 ^operator O1994)
  13945. 997: O: O1994 (predict-no)
  13946. --- END Decision Phase ---
  13947. --- Application Phase ---
  13948. --- Firing Productions (PE) For State At Depth 1 ---
  13949. --- Inner Elaboration Phase, active level 1 (S1) ---
  13950. Firing apply*operator
  13951. -->
  13952. (I3 ^predict-no N997 + :O )
  13953. Firing apply*operator*complete
  13954. -->
  13955. (I3 ^predict-no N996 - :O )
  13956. inner elaboration loop at bottom goal.
  13957. --- Change Working Memory (PE) ---
  13958. =>WM: (13976: I3 ^predict-no N997)
  13959. <=WM: (13964: N996 ^status complete)
  13960. <=WM: (13963: I3 ^predict-no N996)
  13961. --- Firing Productions (IE) For State At Depth 1 ---
  13962. --- Inner Elaboration Phase, active level 1 (S1) ---
  13963. Firing monitor*world
  13964. -->
  13965. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13966. --- Change Working Memory (IE) ---
  13967. --- END Application Phase ---
  13968. --- Output Phase ---
  13969. ENV: Agent did: predict-no for direction U in state State-B
  13970. In State-B moving U
  13971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13972. predict error 0
  13973. dir: dir isL
  13974. --- END Output Phase ---
  13975. |\--- Input Phase ---
  13976. =>WM: (13980: I2 ^dir L)
  13977. =>WM: (13979: I2 ^reward 1)
  13978. =>WM: (13978: I2 ^see 0)
  13979. =>WM: (13977: N997 ^status complete)
  13980. <=WM: (13967: I2 ^dir U)
  13981. <=WM: (13966: I2 ^reward 1)
  13982. <=WM: (13965: I2 ^see 0)
  13983. =>WM: (13981: I2 ^level-1 R0-root)
  13984. <=WM: (13968: I2 ^level-1 R0-root)
  13985. --- END Input Phase ---
  13986. --- Proposal Phase ---
  13987. --- Inner Elaboration Phase, active level 1 (S1) ---
  13988. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  13989. -->
  13990. (S1 ^operator O1993 = 0.6195669380621123)
  13991. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  13992. -->
  13993. (S1 ^operator O1994 = -0.2190661556260421)
  13994. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13995. -->
  13996. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13997. -->
  13998. Firing elaborate*copy-see-to-output-link
  13999. -->
  14000. (I3 ^see 0 +)
  14001. Firing elaborate*reward*based*on*reward
  14002. -->
  14003. (R1001 ^value 1 +)
  14004. (R1 ^reward R1001 +)
  14005. Firing propose*predict-yes
  14006. -->
  14007. (O1995 ^name predict-yes +)
  14008. (S1 ^operator O1995 +)
  14009. Firing propose*predict-no
  14010. -->
  14011. (O1996 ^name predict-no +)
  14012. (S1 ^operator O1996 +)
  14013. Firing rl*prefer*rvt*predict-no*H0*2
  14014. -->
  14015. (S1 ^operator O1994 = 0.3140233963466647)
  14016. Firing rl*prefer*rvt*predict-yes*H0*1
  14017. -->
  14018. (S1 ^operator O1993 = 0.380415072318069)
  14019. Firing prefer*rvt*predict-yes*H0
  14020. -->
  14021. Firing prefer*rvt*predict-no*H0
  14022. -->
  14023. Firing elaborate*copy-dir-to-output-link
  14024. -->
  14025. (I3 ^dir L +)
  14026. inner elaboration loop at bottom goal.
  14027. Retracting elaborate*copy-see-to-output-link
  14028. -->
  14029. (I3 ^see 0 +)
  14030. Retracting propose*predict-no
  14031. -->
  14032. (O1994 ^name predict-no +)
  14033. (S1 ^operator O1994 +)
  14034. Retracting propose*predict-yes
  14035. -->
  14036. (O1993 ^name predict-yes +)
  14037. (S1 ^operator O1993 +)
  14038. Retracting elaborate*reward*based*on*reward
  14039. -->
  14040. (R1000 ^value 1 +)
  14041. (R1 ^reward R1000 +)
  14042. Retracting elaborate*copy-dir-to-output-link
  14043. -->
  14044. (I3 ^dir U +)
  14045. Retracting rl*prefer*rvt*predict-no*H0*4
  14046. -->
  14047. (S1 ^operator O1994 = 1.)
  14048. Retracting rl*prefer*rvt*predict-yes*H0*3
  14049. -->
  14050. (S1 ^operator O1993 = 0.)
  14051. =>WM: (13988: S1 ^operator O1996 +)
  14052. =>WM: (13987: S1 ^operator O1995 +)
  14053. =>WM: (13986: I3 ^dir L)
  14054. =>WM: (13985: O1996 ^name predict-no)
  14055. =>WM: (13984: O1995 ^name predict-yes)
  14056. =>WM: (13983: R1001 ^value 1)
  14057. =>WM: (13982: R1 ^reward R1001)
  14058. <=WM: (13973: S1 ^operator O1993 +)
  14059. <=WM: (13974: S1 ^operator O1994 +)
  14060. <=WM: (13975: S1 ^operator O1994)
  14061. <=WM: (13959: I3 ^dir U)
  14062. <=WM: (13969: R1 ^reward R1000)
  14063. <=WM: (13972: O1994 ^name predict-no)
  14064. <=WM: (13971: O1993 ^name predict-yes)
  14065. <=WM: (13970: R1000 ^value 1)
  14066. --- Inner Elaboration Phase, active level 1 (S1) ---
  14067. Firing prefer*rvt*predict-yes*H0
  14068. -->
  14069. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  14070. -->
  14071. (S1 ^operator O1995 = 0.6195669380621123)
  14072. Firing rl*prefer*rvt*predict-yes*H0*1
  14073. -->
  14074. (S1 ^operator O1995 = 0.380415072318069)
  14075. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14076. -->
  14077. Firing prefer*rvt*predict-no*H0
  14078. -->
  14079. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  14080. -->
  14081. (S1 ^operator O1996 = -0.2190661556260421)
  14082. Firing rl*prefer*rvt*predict-no*H0*2
  14083. -->
  14084. (S1 ^operator O1996 = 0.3140233963466647)
  14085. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14086. -->
  14087. inner elaboration loop at bottom goal.
  14088. Retracting rl*prefer*rvt*predict-no*H0*2
  14089. -->
  14090. (S1 ^operator O1994 = 0.3140233963466647)
  14091. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  14092. -->
  14093. (S1 ^operator O1994 = -0.2190661556260421)
  14094. Retracting rl*prefer*rvt*predict-yes*H0*1
  14095. -->
  14096. (S1 ^operator O1993 = 0.380415072318069)
  14097. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  14098. -->
  14099. (S1 ^operator O1993 = 0.6195669380621123)
  14100. --- END Proposal Phase ---
  14101. --- Decision Phase ---
  14102. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14103. =>WM: (13989: S1 ^operator O1995)
  14104. 998: O: O1995 (predict-yes)
  14105. --- END Decision Phase ---
  14106. --- Application Phase ---
  14107. --- Firing Productions (PE) For State At Depth 1 ---
  14108. --- Inner Elaboration Phase, active level 1 (S1) ---
  14109. Firing apply*operator
  14110. -->
  14111. (I3 ^predict-yes N998 + :O )
  14112. Firing apply*operator*complete
  14113. -->
  14114. (I3 ^predict-no N997 - :O )
  14115. inner elaboration loop at bottom goal.
  14116. --- Change Working Memory (PE) ---
  14117. =>WM: (13990: I3 ^predict-yes N998)
  14118. <=WM: (13977: N997 ^status complete)
  14119. <=WM: (13976: I3 ^predict-no N997)
  14120. --- Firing Productions (IE) For State At Depth 1 ---
  14121. --- Inner Elaboration Phase, active level 1 (S1) ---
  14122. Firing monitor*world
  14123. -->
  14124. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14125. --- Change Working Memory (IE) ---
  14126. --- END Application Phase ---
  14127. --- Output Phase ---
  14128. ENV: Agent did: predict-yes for direction L in state State-B
  14129. In State-B moving L
  14130. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14131. predict error 0
  14132. dir: dir isL
  14133. --- END Output Phase ---
  14134. -/|--- Input Phase ---
  14135. =>WM: (13994: I2 ^dir L)
  14136. =>WM: (13993: I2 ^reward 1)
  14137. =>WM: (13992: I2 ^see 1)
  14138. =>WM: (13991: N998 ^status complete)
  14139. <=WM: (13980: I2 ^dir L)
  14140. <=WM: (13979: I2 ^reward 1)
  14141. <=WM: (13978: I2 ^see 0)
  14142. =>WM: (13995: I2 ^level-1 L1-root)
  14143. <=WM: (13981: I2 ^level-1 R0-root)
  14144. --- END Input Phase ---
  14145. --- Proposal Phase ---
  14146. --- Inner Elaboration Phase, active level 1 (S1) ---
  14147. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14148. -->
  14149. (S1 ^operator O1995 = -0.3470159027404986)
  14150. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14151. -->
  14152. (S1 ^operator O1996 = 0.686145215235081)
  14153. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14154. -->
  14155. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14156. -->
  14157. Firing elaborate*copy-see-to-output-link
  14158. -->
  14159. (I3 ^see 1 +)
  14160. Firing elaborate*reward*based*on*reward
  14161. -->
  14162. (R1002 ^value 1 +)
  14163. (R1 ^reward R1002 +)
  14164. Firing propose*predict-yes
  14165. -->
  14166. (O1997 ^name predict-yes +)
  14167. (S1 ^operator O1997 +)
  14168. Firing propose*predict-no
  14169. -->
  14170. (O1998 ^name predict-no +)
  14171. (S1 ^operator O1998 +)
  14172. Firing rl*prefer*rvt*predict-no*H0*2
  14173. -->
  14174. (S1 ^operator O1996 = 0.3140233963466647)
  14175. Firing rl*prefer*rvt*predict-yes*H0*1
  14176. -->
  14177. (S1 ^operator O1995 = 0.380415072318069)
  14178. Firing prefer*rvt*predict-yes*H0
  14179. -->
  14180. Firing prefer*rvt*predict-no*H0
  14181. -->
  14182. Firing elaborate*copy-dir-to-output-link
  14183. -->
  14184. (I3 ^dir L +)
  14185. inner elaboration loop at bottom goal.
  14186. Retracting elaborate*copy-see-to-output-link
  14187. -->
  14188. (I3 ^see 0 +)
  14189. Retracting propose*predict-no
  14190. -->
  14191. (O1996 ^name predict-no +)
  14192. (S1 ^operator O1996 +)
  14193. Retracting propose*predict-yes
  14194. -->
  14195. (O1995 ^name predict-yes +)
  14196. (S1 ^operator O1995 +)
  14197. Retracting elaborate*reward*based*on*reward
  14198. -->
  14199. (R1001 ^value 1 +)
  14200. (R1 ^reward R1001 +)
  14201. Retracting elaborate*copy-dir-to-output-link
  14202. -->
  14203. (I3 ^dir L +)
  14204. Retracting rl*prefer*rvt*predict-no*H0*2
  14205. -->
  14206. (S1 ^operator O1996 = 0.3140233963466647)
  14207. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  14208. -->
  14209. (S1 ^operator O1996 = -0.2190661556260421)
  14210. Retracting rl*prefer*rvt*predict-yes*H0*1
  14211. -->
  14212. (S1 ^operator O1995 = 0.380415072318069)
  14213. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  14214. -->
  14215. (S1 ^operator O1995 = 0.6195669380621123)
  14216. =>WM: (14002: S1 ^operator O1998 +)
  14217. =>WM: (14001: S1 ^operator O1997 +)
  14218. =>WM: (14000: O1998 ^name predict-no)
  14219. =>WM: (13999: O1997 ^name predict-yes)
  14220. =>WM: (13998: R1002 ^value 1)
  14221. =>WM: (13997: R1 ^reward R1002)
  14222. =>WM: (13996: I3 ^see 1)
  14223. <=WM: (13987: S1 ^operator O1995 +)
  14224. <=WM: (13989: S1 ^operator O1995)
  14225. <=WM: (13988: S1 ^operator O1996 +)
  14226. <=WM: (13982: R1 ^reward R1001)
  14227. <=WM: (13954: I3 ^see 0)
  14228. <=WM: (13985: O1996 ^name predict-no)
  14229. <=WM: (13984: O1995 ^name predict-yes)
  14230. <=WM: (13983: R1001 ^value 1)
  14231. --- Inner Elaboration Phase, active level 1 (S1) ---
  14232. Firing prefer*rvt*predict-yes*H0
  14233. -->
  14234. Firing rl*prefer*rvt*predict-yes*H0*1
  14235. -->
  14236. (S1 ^operator O1997 = 0.380415072318069)
  14237. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14238. -->
  14239. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14240. -->
  14241. (S1 ^operator O1997 = -0.3470159027404986)
  14242. Firing prefer*rvt*predict-no*H0
  14243. -->
  14244. Firing rl*prefer*rvt*predict-no*H0*2
  14245. -->
  14246. (S1 ^operator O1998 = 0.3140233963466647)
  14247. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14248. -->
  14249. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14250. -->
  14251. (S1 ^operator O1998 = 0.686145215235081)
  14252. inner elaboration loop at bottom goal.
  14253. Retracting rl*prefer*rvt*predict-no*H0*2
  14254. -->
  14255. (S1 ^operator O1996 = 0.3140233963466647)
  14256. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14257. -->
  14258. (S1 ^operator O1996 = 0.686145215235081)
  14259. Retracting rl*prefer*rvt*predict-yes*H0*1
  14260. -->
  14261. (S1 ^operator O1995 = 0.380415072318069)
  14262. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14263. -->
  14264. (S1 ^operator O1995 = -0.3470159027404986)
  14265. --- END Proposal Phase ---
  14266. --- Decision Phase ---
  14267. RL update rl*prefer*rvt*predict-yes*H0*1 0.521345 -0.14093 0.380415 -> 0.521347 -0.14093 0.380417(R,m,v=1,0.830303,0.141759)
  14268. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478635 0.140932 0.619567 -> 0.478637 0.140932 0.619569(R,m,v=1,1,0)
  14269. =>WM: (14003: S1 ^operator O1998)
  14270. 999: O: O1998 (predict-no)
  14271. --- END Decision Phase ---
  14272. --- Application Phase ---
  14273. --- Firing Productions (PE) For State At Depth 1 ---
  14274. --- Inner Elaboration Phase, active level 1 (S1) ---
  14275. Firing apply*operator
  14276. -->
  14277. (I3 ^predict-no N999 + :O )
  14278. Firing apply*operator*complete
  14279. -->
  14280. (I3 ^predict-yes N998 - :O )
  14281. inner elaboration loop at bottom goal.
  14282. --- Change Working Memory (PE) ---
  14283. =>WM: (14004: I3 ^predict-no N999)
  14284. <=WM: (13991: N998 ^status complete)
  14285. <=WM: (13990: I3 ^predict-yes N998)
  14286. --- Firing Productions (IE) For State At Depth 1 ---
  14287. --- Inner Elaboration Phase, active level 1 (S1) ---
  14288. Firing monitor*world
  14289. -->
  14290. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14291. --- Change Working Memory (IE) ---
  14292. --- END Application Phase ---
  14293. --- Output Phase ---
  14294. ENV: Agent did: predict-no for direction L in state State-A
  14295. In State-A moving L
  14296. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14297. predict error 0
  14298. dir: dir isU
  14299. --- END Output Phase ---
  14300. \-/--- Input Phase ---
  14301. =>WM: (14008: I2 ^dir U)
  14302. =>WM: (14007: I2 ^reward 1)
  14303. =>WM: (14006: I2 ^see 0)
  14304. =>WM: (14005: N999 ^status complete)
  14305. <=WM: (13994: I2 ^dir L)
  14306. <=WM: (13993: I2 ^reward 1)
  14307. <=WM: (13992: I2 ^see 1)
  14308. =>WM: (14009: I2 ^level-1 L0-root)
  14309. <=WM: (13995: I2 ^level-1 L1-root)
  14310. --- END Input Phase ---
  14311. --- Proposal Phase ---
  14312. --- Inner Elaboration Phase, active level 1 (S1) ---
  14313. Firing elaborate*copy-see-to-output-link
  14314. -->
  14315. (I3 ^see 0 +)
  14316. Firing elaborate*reward*based*on*reward
  14317. -->
  14318. (R1003 ^value 1 +)
  14319. (R1 ^reward R1003 +)
  14320. Firing propose*predict-yes
  14321. -->
  14322. (O1999 ^name predict-yes +)
  14323. (S1 ^operator O1999 +)
  14324. Firing propose*predict-no
  14325. -->
  14326. (O2000 ^name predict-no +)
  14327. (S1 ^operator O2000 +)
  14328. Firing rl*prefer*rvt*predict-no*H0*4
  14329. -->
  14330. (S1 ^operator O1998 = 1.)
  14331. Firing rl*prefer*rvt*predict-yes*H0*3
  14332. -->
  14333. (S1 ^operator O1997 = 0.)
  14334. Firing prefer*rvt*predict-yes*H0
  14335. -->
  14336. Firing prefer*rvt*predict-no*H0
  14337. -->
  14338. Firing elaborate*copy-dir-to-output-link
  14339. -->
  14340. (I3 ^dir U +)
  14341. inner elaboration loop at bottom goal.
  14342. Retracting elaborate*copy-see-to-output-link
  14343. -->
  14344. (I3 ^see 1 +)
  14345. Retracting propose*predict-no
  14346. -->
  14347. (O1998 ^name predict-no +)
  14348. (S1 ^operator O1998 +)
  14349. Retracting propose*predict-yes
  14350. -->
  14351. (O1997 ^name predict-yes +)
  14352. (S1 ^operator O1997 +)
  14353. Retracting elaborate*reward*based*on*reward
  14354. -->
  14355. (R1002 ^value 1 +)
  14356. (R1 ^reward R1002 +)
  14357. Retracting elaborate*copy-dir-to-output-link
  14358. -->
  14359. (I3 ^dir L +)
  14360. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14361. -->
  14362. (S1 ^operator O1998 = 0.686145215235081)
  14363. Retracting rl*prefer*rvt*predict-no*H0*2
  14364. -->
  14365. (S1 ^operator O1998 = 0.3140233963466647)
  14366. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14367. -->
  14368. (S1 ^operator O1997 = -0.3470159027404986)
  14369. Retracting rl*prefer*rvt*predict-yes*H0*1
  14370. -->
  14371. (S1 ^operator O1997 = 0.3804165454412648)
  14372. =>WM: (14017: S1 ^operator O2000 +)
  14373. =>WM: (14016: S1 ^operator O1999 +)
  14374. =>WM: (14015: I3 ^dir U)
  14375. =>WM: (14014: O2000 ^name predict-no)
  14376. =>WM: (14013: O1999 ^name predict-yes)
  14377. =>WM: (14012: R1003 ^value 1)
  14378. =>WM: (14011: R1 ^reward R1003)
  14379. =>WM: (14010: I3 ^see 0)
  14380. <=WM: (14001: S1 ^operator O1997 +)
  14381. <=WM: (14002: S1 ^operator O1998 +)
  14382. <=WM: (14003: S1 ^operator O1998)
  14383. <=WM: (13986: I3 ^dir L)
  14384. <=WM: (13997: R1 ^reward R1002)
  14385. <=WM: (13996: I3 ^see 1)
  14386. <=WM: (14000: O1998 ^name predict-no)
  14387. <=WM: (13999: O1997 ^name predict-yes)
  14388. <=WM: (13998: R1002 ^value 1)
  14389. --- Inner Elaboration Phase, active level 1 (S1) ---
  14390. Firing prefer*rvt*predict-yes*H0
  14391. -->
  14392. Firing rl*prefer*rvt*predict-yes*H0*3
  14393. -->
  14394. (S1 ^operator O1999 = 0.)
  14395. Firing prefer*rvt*predict-no*H0
  14396. -->
  14397. Firing rl*prefer*rvt*predict-no*H0*4
  14398. -->
  14399. (S1 ^operator O2000 = 1.)
  14400. inner elaboration loop at bottom goal.
  14401. Retracting rl*prefer*rvt*predict-no*H0*4
  14402. -->
  14403. (S1 ^operator O1998 = 1.)
  14404. Retracting rl*prefer*rvt*predict-yes*H0*3
  14405. -->
  14406. (S1 ^operator O1997 = 0.)
  14407. --- END Proposal Phase ---
  14408. --- Decision Phase ---
  14409. RL update rl*prefer*rvt*predict-no*H0*2 0.485033 -0.171009 0.314023 -> 0.485022 -0.171012 0.314009(R,m,v=1,0.860927,0.12053)
  14410. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.5151 0.171045 0.686145 -> 0.515087 0.171042 0.686129(R,m,v=1,1,0)
  14411. =>WM: (14018: S1 ^operator O2000)
  14412. 1000: O: O2000 (predict-no)
  14413. --- END Decision Phase ---
  14414. --- Application Phase ---
  14415. --- Firing Productions (PE) For State At Depth 1 ---
  14416. --- Inner Elaboration Phase, active level 1 (S1) ---
  14417. Firing apply*operator
  14418. -->
  14419. (I3 ^predict-no N1000 + :O )
  14420. Firing apply*operator*complete
  14421. -->
  14422. (I3 ^predict-no N999 - :O )
  14423. inner elaboration loop at bottom goal.
  14424. --- Change Working Memory (PE) ---
  14425. =>WM: (14019: I3 ^predict-no N1000)
  14426. <=WM: (14005: N999 ^status complete)
  14427. <=WM: (14004: I3 ^predict-no N999)
  14428. --- Firing Productions (IE) For State At Depth 1 ---
  14429. --- Inner Elaboration Phase, active level 1 (S1) ---
  14430. Firing monitor*world
  14431. -->
  14432. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14433. --- Change Working Memory (IE) ---
  14434. --- END Application Phase ---
  14435. --- Output Phase ---
  14436. ENV: Agent did: predict-no for direction U in state State-A
  14437. In State-A moving U
  14438. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14439. predict error 0
  14440. dir: dir isR
  14441. --- END Output Phase ---
  14442. |\-/|\-/|\--- Input Phase ---
  14443. =>WM: (14023: I2 ^dir R)
  14444. =>WM: (14022: I2 ^reward 1)
  14445. =>WM: (14021: I2 ^see 0)
  14446. =>WM: (14020: N1000 ^status complete)
  14447. <=WM: (14008: I2 ^dir U)
  14448. <=WM: (14007: I2 ^reward 1)
  14449. <=WM: (14006: I2 ^see 0)
  14450. =>WM: (14024: I2 ^level-1 L0-root)
  14451. <=WM: (14009: I2 ^level-1 L0-root)
  14452. --- END Input Phase ---
  14453. --- Proposal Phase ---
  14454. --- Inner Elaboration Phase, active level 1 (S1) ---
  14455. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14456. -->
  14457. (S1 ^operator O1999 = 0.7055034804752064)
  14458. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14459. -->
  14460. (S1 ^operator O2000 = -0.2023211881870005)
  14461. Firing prefer*rvt*predict-no*H0*6*v1*H1
  14462. -->
  14463. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14464. -->
  14465. Firing elaborate*copy-see-to-output-link
  14466. -->
  14467. (I3 ^see 0 +)
  14468. Firing elaborate*reward*based*on*reward
  14469. -->
  14470. (R1004 ^value 1 +)
  14471. (R1 ^reward R1004 +)
  14472. Firing propose*predict-yes
  14473. -->
  14474. (O2001 ^name predict-yes +)
  14475. (S1 ^operator O2001 +)
  14476. Firing propose*predict-no
  14477. -->
  14478. (O2002 ^name predict-no +)
  14479. (S1 ^operator O2002 +)
  14480. Firing rl*prefer*rvt*predict-no*H0*6
  14481. -->
  14482. (S1 ^operator O2000 = 0.229854902707684)
  14483. Firing rl*prefer*rvt*predict-yes*H0*5
  14484. -->
  14485. (S1 ^operator O1999 = 0.2938705117203769)
  14486. Firing prefer*rvt*predict-yes*H0
  14487. -->
  14488. Firing prefer*rvt*predict-no*H0
  14489. -->
  14490. Firing elaborate*copy-dir-to-output-link
  14491. -->
  14492. (I3 ^dir R +)
  14493. inner elaboration loop at bottom goal.
  14494. Retracting elaborate*copy-see-to-output-link
  14495. -->
  14496. (I3 ^see 0 +)
  14497. Retracting propose*predict-no
  14498. -->
  14499. (O2000 ^name predict-no +)
  14500. (S1 ^operator O2000 +)
  14501. Retracting propose*predict-yes
  14502. -->
  14503. (O1999 ^name predict-yes +)
  14504. (S1 ^operator O1999 +)
  14505. Retracting elaborate*reward*based*on*reward
  14506. -->
  14507. (R1003 ^value 1 +)
  14508. (R1 ^reward R1003 +)
  14509. Retracting elaborate*copy-dir-to-output-link
  14510. -->
  14511. (I3 ^dir U +)
  14512. Retracting rl*prefer*rvt*predict-no*H0*4
  14513. -->
  14514. (S1 ^operator O2000 = 1.)
  14515. Retracting rl*prefer*rvt*predict-yes*H0*3
  14516. -->
  14517. (S1 ^operator O1999 = 0.)
  14518. =>WM: (14031: S1 ^operator O2002 +)
  14519. =>WM: (14030: S1 ^operator O2001 +)
  14520. =>WM: (14029: I3 ^dir R)
  14521. =>WM: (14028: O2002 ^name predict-no)
  14522. =>WM: (14027: O2001 ^name predict-yes)
  14523. =>WM: (14026: R1004 ^value 1)
  14524. =>WM: (14025: R1 ^reward R1004)
  14525. <=WM: (14016: S1 ^operator O1999 +)
  14526. <=WM: (14017: S1 ^operator O2000 +)
  14527. <=WM: (14018: S1 ^operator O2000)
  14528. <=WM: (14015: I3 ^dir U)
  14529. <=WM: (14011: R1 ^reward R1003)
  14530. <=WM: (14014: O2000 ^name predict-no)
  14531. <=WM: (14013: O1999 ^name predict-yes)
  14532. <=WM: (14012: R1003 ^value 1)
  14533. --- Inner Elaboration Phase, active level 1 (S1) ---
  14534. Firing prefer*rvt*predict-yes*H0
  14535. -->
  14536. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14537. -->
  14538. (S1 ^operator O2001 = 0.7055034804752064)
  14539. Firing rl*prefer*rvt*predict-yes*H0*5
  14540. -->
  14541. (S1 ^operator O2001 = 0.2938705117203769)
  14542. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14543. -->
  14544. Firing prefer*rvt*predict-no*H0
  14545. -->
  14546. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14547. -->
  14548. (S1 ^operator O2002 = -0.2023211881870005)
  14549. Firing rl*prefer*rvt*predict-no*H0*6
  14550. -->
  14551. (S1 ^operator O2002 = 0.229854902707684)
  14552. Firing prefer*rvt*predict-no*H0*6*v1*H1
  14553. -->
  14554. inner elaboration loop at bottom goal.
  14555. Retracting rl*prefer*rvt*predict-no*H0*6
  14556. -->
  14557. (S1 ^operator O2000 = 0.229854902707684)
  14558. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14559. -->
  14560. (S1 ^operator O2000 = -0.2023211881870005)
  14561. Retracting rl*prefer*rvt*predict-yes*H0*5
  14562. -->
  14563. (S1 ^operator O1999 = 0.2938705117203769)
  14564. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14565. -->
  14566. (S1 ^operator O1999 = 0.7055034804752064)
  14567. --- END Proposal Phase ---
  14568. --- Decision Phase ---
  14569. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14570. =>WM: (14032: S1 ^operator O2001)
  14571. 1001: O: O2001 (predict-yes)
  14572. --- END Decision Phase ---
  14573. --- Application Phase ---
  14574. --- Firing Productions (PE) For State At Depth 1 ---
  14575. --- Inner Elaboration Phase, active level 1 (S1) ---
  14576. Firing apply*operator
  14577. -->
  14578. (I3 ^predict-yes N1001 + :O )
  14579. Firing apply*operator*complete
  14580. -->
  14581. (I3 ^predict-no N1000 - :O )
  14582. inner elaboration loop at bottom goal.
  14583. --- Change Working Memory (PE) ---
  14584. =>WM: (14033: I3 ^predict-yes N1001)
  14585. <=WM: (14020: N1000 ^status complete)
  14586. <=WM: (14019: I3 ^predict-no N1000)
  14587. --- Firing Productions (IE) For State At Depth 1 ---
  14588. --- Inner Elaboration Phase, active level 1 (S1) ---
  14589. Firing monitor*world
  14590. -->
  14591. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14592. --- Change Working Memory (IE) ---
  14593. --- END Application Phase ---
  14594. --- Output Phase ---
  14595. ENV: Agent did: predict-yes for direction R in state State-A
  14596. In State-A moving R
  14597. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14598. predict error 0
  14599. dir: dir isL
  14600. --- END Output Phase ---
  14601. ---- Input Phase ---
  14602. =>WM: (14037: I2 ^dir L)
  14603. =>WM: (14036: I2 ^reward 1)
  14604. =>WM: (14035: I2 ^see 1)
  14605. =>WM: (14034: N1001 ^status complete)
  14606. <=WM: (14023: I2 ^dir R)
  14607. <=WM: (14022: I2 ^reward 1)
  14608. <=WM: (14021: I2 ^see 0)
  14609. =>WM: (14038: I2 ^level-1 R1-root)
  14610. <=WM: (14024: I2 ^level-1 L0-root)
  14611. --- END Input Phase ---
  14612. --- Proposal Phase ---
  14613. --- Inner Elaboration Phase, active level 1 (S1) ---
  14614. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14615. -->
  14616. (S1 ^operator O2001 = 0.6196100460529347)
  14617. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14618. -->
  14619. (S1 ^operator O2002 = -0.1479504104026684)
  14620. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14621. -->
  14622. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14623. -->
  14624. Firing elaborate*copy-see-to-output-link
  14625. -->
  14626. (I3 ^see 1 +)
  14627. Firing elaborate*reward*based*on*reward
  14628. -->
  14629. (R1005 ^value 1 +)
  14630. (R1 ^reward R1005 +)
  14631. Firing propose*predict-yes
  14632. -->
  14633. (O2003 ^name predict-yes +)
  14634. (S1 ^operator O2003 +)
  14635. Firing propose*predict-no
  14636. -->
  14637. (O2004 ^name predict-no +)
  14638. (S1 ^operator O2004 +)
  14639. Firing rl*prefer*rvt*predict-no*H0*2
  14640. -->
  14641. (S1 ^operator O2002 = 0.3140093857317092)
  14642. Firing rl*prefer*rvt*predict-yes*H0*1
  14643. -->
  14644. (S1 ^operator O2001 = 0.3804165454412648)
  14645. Firing prefer*rvt*predict-yes*H0
  14646. -->
  14647. Firing prefer*rvt*predict-no*H0
  14648. -->
  14649. Firing elaborate*copy-dir-to-output-link
  14650. -->
  14651. (I3 ^dir L +)
  14652. inner elaboration loop at bottom goal.
  14653. Retracting elaborate*copy-see-to-output-link
  14654. -->
  14655. (I3 ^see 0 +)
  14656. Retracting propose*predict-no
  14657. -->
  14658. (O2002 ^name predict-no +)
  14659. (S1 ^operator O2002 +)
  14660. Retracting propose*predict-yes
  14661. -->
  14662. (O2001 ^name predict-yes +)
  14663. (S1 ^operator O2001 +)
  14664. Retracting elaborate*reward*based*on*reward
  14665. -->
  14666. (R1004 ^value 1 +)
  14667. (R1 ^reward R1004 +)
  14668. Retracting elaborate*copy-dir-to-output-link
  14669. -->
  14670. (I3 ^dir R +)
  14671. Retracting rl*prefer*rvt*predict-no*H0*6
  14672. -->
  14673. (S1 ^operator O2002 = 0.229854902707684)
  14674. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14675. -->
  14676. (S1 ^operator O2002 = -0.2023211881870005)
  14677. Retracting rl*prefer*rvt*predict-yes*H0*5
  14678. -->
  14679. (S1 ^operator O2001 = 0.2938705117203769)
  14680. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14681. -->
  14682. (S1 ^operator O2001 = 0.7055034804752064)
  14683. =>WM: (14046: S1 ^operator O2004 +)
  14684. =>WM: (14045: S1 ^operator O2003 +)
  14685. =>WM: (14044: I3 ^dir L)
  14686. =>WM: (14043: O2004 ^name predict-no)
  14687. =>WM: (14042: O2003 ^name predict-yes)
  14688. =>WM: (14041: R1005 ^value 1)
  14689. =>WM: (14040: R1 ^reward R1005)
  14690. =>WM: (14039: I3 ^see 1)
  14691. <=WM: (14030: S1 ^operator O2001 +)
  14692. <=WM: (14032: S1 ^operator O2001)
  14693. <=WM: (14031: S1 ^operator O2002 +)
  14694. <=WM: (14029: I3 ^dir R)
  14695. <=WM: (14025: R1 ^reward R1004)
  14696. <=WM: (14010: I3 ^see 0)
  14697. <=WM: (14028: O2002 ^name predict-no)
  14698. <=WM: (14027: O2001 ^name predict-yes)
  14699. <=WM: (14026: R1004 ^value 1)
  14700. --- Inner Elaboration Phase, active level 1 (S1) ---
  14701. Firing prefer*rvt*predict-yes*H0
  14702. -->
  14703. Firing rl*prefer*rvt*predict-yes*H0*1
  14704. -->
  14705. (S1 ^operator O2003 = 0.3804165454412648)
  14706. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14707. -->
  14708. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14709. -->
  14710. (S1 ^operator O2003 = 0.6196100460529347)
  14711. Firing prefer*rvt*predict-no*H0
  14712. -->
  14713. Firing rl*prefer*rvt*predict-no*H0*2
  14714. -->
  14715. (S1 ^operator O2004 = 0.3140093857317092)
  14716. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14717. -->
  14718. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14719. -->
  14720. (S1 ^operator O2004 = -0.1479504104026684)
  14721. inner elaboration loop at bottom goal.
  14722. Retracting rl*prefer*rvt*predict-no*H0*2
  14723. -->
  14724. (S1 ^operator O2002 = 0.3140093857317092)
  14725. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14726. -->
  14727. (S1 ^operator O2002 = -0.1479504104026684)
  14728. Retracting rl*prefer*rvt*predict-yes*H0*1
  14729. -->
  14730. (S1 ^operator O2001 = 0.3804165454412648)
  14731. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14732. -->
  14733. (S1 ^operator O2001 = 0.6196100460529347)
  14734. --- END Proposal Phase ---
  14735. --- Decision Phase ---
  14736. RL update rl*prefer*rvt*predict-yes*H0*5 0.500957 -0.207086 0.293871 -> 0.501003 -0.207081 0.293922(R,m,v=1,0.846154,0.131017)
  14737. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498477 0.207026 0.705503 -> 0.498533 0.207032 0.705565(R,m,v=1,1,0)
  14738. =>WM: (14047: S1 ^operator O2003)
  14739. 1002: O: O2003 (predict-yes)
  14740. --- END Decision Phase ---
  14741. --- Application Phase ---
  14742. --- Firing Productions (PE) For State At Depth 1 ---
  14743. --- Inner Elaboration Phase, active level 1 (S1) ---
  14744. Firing apply*operator
  14745. -->
  14746. (I3 ^predict-yes N1002 + :O )
  14747. Firing apply*operator*complete
  14748. -->
  14749. (I3 ^predict-yes N1001 - :O )
  14750. inner elaboration loop at bottom goal.
  14751. --- Change Working Memory (PE) ---
  14752. =>WM: (14048: I3 ^predict-yes N1002)
  14753. <=WM: (14034: N1001 ^status complete)
  14754. <=WM: (14033: I3 ^predict-yes N1001)
  14755. --- Firing Productions (IE) For State At Depth 1 ---
  14756. --- Inner Elaboration Phase, active level 1 (S1) ---
  14757. Firing monitor*world
  14758. -->
  14759. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14760. --- Change Working Memory (IE) ---
  14761. --- END Application Phase ---
  14762. --- Output Phase ---
  14763. ENV: Agent did: predict-yes for direction L in state State-B
  14764. In State-B moving L
  14765. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14766. predict error 0
  14767. dir: dir isL
  14768. --- END Output Phase ---
  14769. /|\--- Input Phase ---
  14770. =>WM: (14052: I2 ^dir L)
  14771. =>WM: (14051: I2 ^reward 1)
  14772. =>WM: (14050: I2 ^see 1)
  14773. =>WM: (14049: N1002 ^status complete)
  14774. <=WM: (14037: I2 ^dir L)
  14775. <=WM: (14036: I2 ^reward 1)
  14776. <=WM: (14035: I2 ^see 1)
  14777. =>WM: (14053: I2 ^level-1 L1-root)
  14778. <=WM: (14038: I2 ^level-1 R1-root)
  14779. --- END Input Phase ---
  14780. --- Proposal Phase ---
  14781. --- Inner Elaboration Phase, active level 1 (S1) ---
  14782. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14783. -->
  14784. (S1 ^operator O2003 = -0.3470159027404986)
  14785. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14786. -->
  14787. (S1 ^operator O2004 = 0.6861287198581429)
  14788. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14789. -->
  14790. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14791. -->
  14792. Firing elaborate*copy-see-to-output-link
  14793. -->
  14794. (I3 ^see 1 +)
  14795. Firing elaborate*reward*based*on*reward
  14796. -->
  14797. (R1006 ^value 1 +)
  14798. (R1 ^reward R1006 +)
  14799. Firing propose*predict-yes
  14800. -->
  14801. (O2005 ^name predict-yes +)
  14802. (S1 ^operator O2005 +)
  14803. Firing propose*predict-no
  14804. -->
  14805. (O2006 ^name predict-no +)
  14806. (S1 ^operator O2006 +)
  14807. Firing rl*prefer*rvt*predict-no*H0*2
  14808. -->
  14809. (S1 ^operator O2004 = 0.3140093857317092)
  14810. Firing rl*prefer*rvt*predict-yes*H0*1
  14811. -->
  14812. (S1 ^operator O2003 = 0.3804165454412648)
  14813. Firing prefer*rvt*predict-yes*H0
  14814. -->
  14815. Firing prefer*rvt*predict-no*H0
  14816. -->
  14817. Firing elaborate*copy-dir-to-output-link
  14818. -->
  14819. (I3 ^dir L +)
  14820. inner elaboration loop at bottom goal.
  14821. Retracting elaborate*copy-see-to-output-link
  14822. -->
  14823. (I3 ^see 1 +)
  14824. Retracting propose*predict-no
  14825. -->
  14826. (O2004 ^name predict-no +)
  14827. (S1 ^operator O2004 +)
  14828. Retracting propose*predict-yes
  14829. -->
  14830. (O2003 ^name predict-yes +)
  14831. (S1 ^operator O2003 +)
  14832. Retracting elaborate*reward*based*on*reward
  14833. -->
  14834. (R1005 ^value 1 +)
  14835. (R1 ^reward R1005 +)
  14836. Retracting elaborate*copy-dir-to-output-link
  14837. -->
  14838. (I3 ^dir L +)
  14839. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  14840. -->
  14841. (S1 ^operator O2004 = -0.1479504104026684)
  14842. Retracting rl*prefer*rvt*predict-no*H0*2
  14843. -->
  14844. (S1 ^operator O2004 = 0.3140093857317092)
  14845. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  14846. -->
  14847. (S1 ^operator O2003 = 0.6196100460529347)
  14848. Retracting rl*prefer*rvt*predict-yes*H0*1
  14849. -->
  14850. (S1 ^operator O2003 = 0.3804165454412648)
  14851. =>WM: (14059: S1 ^operator O2006 +)
  14852. =>WM: (14058: S1 ^operator O2005 +)
  14853. =>WM: (14057: O2006 ^name predict-no)
  14854. =>WM: (14056: O2005 ^name predict-yes)
  14855. =>WM: (14055: R1006 ^value 1)
  14856. =>WM: (14054: R1 ^reward R1006)
  14857. <=WM: (14045: S1 ^operator O2003 +)
  14858. <=WM: (14047: S1 ^operator O2003)
  14859. <=WM: (14046: S1 ^operator O2004 +)
  14860. <=WM: (14040: R1 ^reward R1005)
  14861. <=WM: (14043: O2004 ^name predict-no)
  14862. <=WM: (14042: O2003 ^name predict-yes)
  14863. <=WM: (14041: R1005 ^value 1)
  14864. --- Inner Elaboration Phase, active level 1 (S1) ---
  14865. Firing prefer*rvt*predict-yes*H0
  14866. -->
  14867. Firing rl*prefer*rvt*predict-yes*H0*1
  14868. -->
  14869. (S1 ^operator O2005 = 0.3804165454412648)
  14870. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14871. -->
  14872. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14873. -->
  14874. (S1 ^operator O2005 = -0.3470159027404986)
  14875. Firing prefer*rvt*predict-no*H0
  14876. -->
  14877. Firing rl*prefer*rvt*predict-no*H0*2
  14878. -->
  14879. (S1 ^operator O2006 = 0.3140093857317092)
  14880. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14881. -->
  14882. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14883. -->
  14884. (S1 ^operator O2006 = 0.6861287198581429)
  14885. inner elaboration loop at bottom goal.
  14886. Retracting rl*prefer*rvt*predict-no*H0*2
  14887. -->
  14888. (S1 ^operator O2004 = 0.3140093857317092)
  14889. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  14890. -->
  14891. (S1 ^operator O2004 = 0.6861287198581429)
  14892. Retracting rl*prefer*rvt*predict-yes*H0*1
  14893. -->
  14894. (S1 ^operator O2003 = 0.3804165454412648)
  14895. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  14896. -->
  14897. (S1 ^operator O2003 = -0.3470159027404986)
  14898. --- END Proposal Phase ---
  14899. --- Decision Phase ---
  14900. RL update rl*prefer*rvt*predict-yes*H0*1 0.521347 -0.14093 0.380417 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.831325,0.141073)
  14901. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478682 0.140928 0.61961 -> 0.47868 0.140928 0.619607(R,m,v=1,1,0)
  14902. =>WM: (14060: S1 ^operator O2006)
  14903. 1003: O: O2006 (predict-no)
  14904. --- END Decision Phase ---
  14905. --- Application Phase ---
  14906. --- Firing Productions (PE) For State At Depth 1 ---
  14907. --- Inner Elaboration Phase, active level 1 (S1) ---
  14908. Firing apply*operator
  14909. -->
  14910. (I3 ^predict-no N1003 + :O )
  14911. Firing apply*operator*complete
  14912. -->
  14913. (I3 ^predict-yes N1002 - :O )
  14914. inner elaboration loop at bottom goal.
  14915. --- Change Working Memory (PE) ---
  14916. =>WM: (14061: I3 ^predict-no N1003)
  14917. <=WM: (14049: N1002 ^status complete)
  14918. <=WM: (14048: I3 ^predict-yes N1002)
  14919. --- Firing Productions (IE) For State At Depth 1 ---
  14920. --- Inner Elaboration Phase, active level 1 (S1) ---
  14921. Firing monitor*world
  14922. -->
  14923. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14924. --- Change Working Memory (IE) ---
  14925. --- END Application Phase ---
  14926. --- Output Phase ---
  14927. ENV: Agent did: predict-no for direction L in state State-A
  14928. In State-A moving L
  14929. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14930. predict error 0
  14931. dir: dir isR
  14932. --- END Output Phase ---
  14933. -/--- Input Phase ---
  14934. =>WM: (14065: I2 ^dir R)
  14935. =>WM: (14064: I2 ^reward 1)
  14936. =>WM: (14063: I2 ^see 0)
  14937. =>WM: (14062: N1003 ^status complete)
  14938. <=WM: (14052: I2 ^dir L)
  14939. <=WM: (14051: I2 ^reward 1)
  14940. <=WM: (14050: I2 ^see 1)
  14941. =>WM: (14066: I2 ^level-1 L0-root)
  14942. <=WM: (14053: I2 ^level-1 L1-root)
  14943. --- END Input Phase ---
  14944. --- Proposal Phase ---
  14945. --- Inner Elaboration Phase, active level 1 (S1) ---
  14946. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14947. -->
  14948. (S1 ^operator O2005 = 0.7055651252992311)
  14949. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  14950. -->
  14951. (S1 ^operator O2006 = -0.2023211881870005)
  14952. Firing prefer*rvt*predict-no*H0*6*v1*H1
  14953. -->
  14954. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14955. -->
  14956. Firing elaborate*copy-see-to-output-link
  14957. -->
  14958. (I3 ^see 0 +)
  14959. Firing elaborate*reward*based*on*reward
  14960. -->
  14961. (R1007 ^value 1 +)
  14962. (R1 ^reward R1007 +)
  14963. Firing propose*predict-yes
  14964. -->
  14965. (O2007 ^name predict-yes +)
  14966. (S1 ^operator O2007 +)
  14967. Firing propose*predict-no
  14968. -->
  14969. (O2008 ^name predict-no +)
  14970. (S1 ^operator O2008 +)
  14971. Firing rl*prefer*rvt*predict-no*H0*6
  14972. -->
  14973. (S1 ^operator O2006 = 0.229854902707684)
  14974. Firing rl*prefer*rvt*predict-yes*H0*5
  14975. -->
  14976. (S1 ^operator O2005 = 0.2939222491339341)
  14977. Firing prefer*rvt*predict-yes*H0
  14978. -->
  14979. Firing prefer*rvt*predict-no*H0
  14980. -->
  14981. Firing elaborate*copy-dir-to-output-link
  14982. -->
  14983. (I3 ^dir R +)
  14984. inner elaboration loop at bottom goal.
  14985. Retracting elaborate*copy-see-to-output-link
  14986. -->
  14987. (I3 ^see 1 +)
  14988. Retracting propose*predict-no
  14989. -->
  14990. (O2006 ^name predict-no +)
  14991. (S1 ^operator O2006 +)
  14992. Retracting propose*predict-yes
  14993. -->
  14994. (O2005 ^name predict-yes +)
  14995. (S1 ^operator O2005 +)
  14996. Retracting elaborate*reward*based*on*reward
  14997. -->
  14998. (R1006 ^value 1 +)
  14999. (R1 ^reward R1006 +)
  15000. Retracting elaborate*copy-dir-to-output-link
  15001. -->
  15002. (I3 ^dir L +)
  15003. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  15004. -->
  15005. (S1 ^operator O2006 = 0.6861287198581429)
  15006. Retracting rl*prefer*rvt*predict-no*H0*2
  15007. -->
  15008. (S1 ^operator O2006 = 0.3140093857317092)
  15009. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  15010. -->
  15011. (S1 ^operator O2005 = -0.3470159027404986)
  15012. Retracting rl*prefer*rvt*predict-yes*H0*1
  15013. -->
  15014. (S1 ^operator O2005 = 0.380414370085626)
  15015. =>WM: (14074: S1 ^operator O2008 +)
  15016. =>WM: (14073: S1 ^operator O2007 +)
  15017. =>WM: (14072: I3 ^dir R)
  15018. =>WM: (14071: O2008 ^name predict-no)
  15019. =>WM: (14070: O2007 ^name predict-yes)
  15020. =>WM: (14069: R1007 ^value 1)
  15021. =>WM: (14068: R1 ^reward R1007)
  15022. =>WM: (14067: I3 ^see 0)
  15023. <=WM: (14058: S1 ^operator O2005 +)
  15024. <=WM: (14059: S1 ^operator O2006 +)
  15025. <=WM: (14060: S1 ^operator O2006)
  15026. <=WM: (14044: I3 ^dir L)
  15027. <=WM: (14054: R1 ^reward R1006)
  15028. <=WM: (14039: I3 ^see 1)
  15029. <=WM: (14057: O2006 ^name predict-no)
  15030. <=WM: (14056: O2005 ^name predict-yes)
  15031. <=WM: (14055: R1006 ^value 1)
  15032. --- Inner Elaboration Phase, active level 1 (S1) ---
  15033. Firing prefer*rvt*predict-yes*H0
  15034. -->
  15035. Firing rl*prefer*rvt*predict-yes*H0*5
  15036. -->
  15037. (S1 ^operator O2007 = 0.2939222491339341)
  15038. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15039. -->
  15040. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15041. -->
  15042. (S1 ^operator O2007 = 0.7055651252992311)
  15043. Firing prefer*rvt*predict-no*H0
  15044. -->
  15045. Firing rl*prefer*rvt*predict-no*H0*6
  15046. -->
  15047. (S1 ^operator O2008 = 0.229854902707684)
  15048. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15049. -->
  15050. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  15051. -->
  15052. (S1 ^operator O2008 = -0.2023211881870005)
  15053. inner elaboration loop at bottom goal.
  15054. Retracting rl*prefer*rvt*predict-no*H0*6
  15055. -->
  15056. (S1 ^operator O2006 = 0.229854902707684)
  15057. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  15058. -->
  15059. (S1 ^operator O2006 = -0.2023211881870005)
  15060. Retracting rl*prefer*rvt*predict-yes*H0*5
  15061. -->
  15062. (S1 ^operator O2005 = 0.2939222491339341)
  15063. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15064. -->
  15065. (S1 ^operator O2005 = 0.7055651252992311)
  15066. --- END Proposal Phase ---
  15067. --- Decision Phase ---
  15068. RL update rl*prefer*rvt*predict-no*H0*2 0.485022 -0.171012 0.314009 -> 0.485013 -0.171015 0.313998(R,m,v=1,0.861842,0.119859)
  15069. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515087 0.171042 0.686129 -> 0.515077 0.171039 0.686115(R,m,v=1,1,0)
  15070. =>WM: (14075: S1 ^operator O2007)
  15071. 1004: O: O2007 (predict-yes)
  15072. --- END Decision Phase ---
  15073. --- Application Phase ---
  15074. --- Firing Productions (PE) For State At Depth 1 ---
  15075. --- Inner Elaboration Phase, active level 1 (S1) ---
  15076. Firing apply*operator
  15077. -->
  15078. (I3 ^predict-yes N1004 + :O )
  15079. Firing apply*operator*complete
  15080. -->
  15081. (I3 ^predict-no N1003 - :O )
  15082. inner elaboration loop at bottom goal.
  15083. --- Change Working Memory (PE) ---
  15084. =>WM: (14076: I3 ^predict-yes N1004)
  15085. <=WM: (14062: N1003 ^status complete)
  15086. <=WM: (14061: I3 ^predict-no N1003)
  15087. --- Firing Productions (IE) For State At Depth 1 ---
  15088. --- Inner Elaboration Phase, active level 1 (S1) ---
  15089. Firing monitor*world
  15090. -->
  15091. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15092. --- Change Working Memory (IE) ---
  15093. --- END Application Phase ---
  15094. --- Output Phase ---
  15095. ENV: Agent did: predict-yes for direction R in state State-A
  15096. In State-A moving R
  15097. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15098. predict error 0
  15099. dir: dir isR
  15100. --- END Output Phase ---
  15101. |\---- Input Phase ---
  15102. =>WM: (14080: I2 ^dir R)
  15103. =>WM: (14079: I2 ^reward 1)
  15104. =>WM: (14078: I2 ^see 1)
  15105. =>WM: (14077: N1004 ^status complete)
  15106. <=WM: (14065: I2 ^dir R)
  15107. <=WM: (14064: I2 ^reward 1)
  15108. <=WM: (14063: I2 ^see 0)
  15109. =>WM: (14081: I2 ^level-1 R1-root)
  15110. <=WM: (14066: I2 ^level-1 L0-root)
  15111. --- END Input Phase ---
  15112. --- Proposal Phase ---
  15113. --- Inner Elaboration Phase, active level 1 (S1) ---
  15114. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15115. -->
  15116. (S1 ^operator O2007 = -0.252585164213872)
  15117. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15118. -->
  15119. (S1 ^operator O2008 = 0.7701760437619466)
  15120. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15121. -->
  15122. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15123. -->
  15124. Firing elaborate*copy-see-to-output-link
  15125. -->
  15126. (I3 ^see 1 +)
  15127. Firing elaborate*reward*based*on*reward
  15128. -->
  15129. (R1008 ^value 1 +)
  15130. (R1 ^reward R1008 +)
  15131. Firing propose*predict-yes
  15132. -->
  15133. (O2009 ^name predict-yes +)
  15134. (S1 ^operator O2009 +)
  15135. Firing propose*predict-no
  15136. -->
  15137. (O2010 ^name predict-no +)
  15138. (S1 ^operator O2010 +)
  15139. Firing rl*prefer*rvt*predict-no*H0*6
  15140. -->
  15141. (S1 ^operator O2008 = 0.229854902707684)
  15142. Firing rl*prefer*rvt*predict-yes*H0*5
  15143. -->
  15144. (S1 ^operator O2007 = 0.2939222491339341)
  15145. Firing prefer*rvt*predict-yes*H0
  15146. -->
  15147. Firing prefer*rvt*predict-no*H0
  15148. -->
  15149. Firing elaborate*copy-dir-to-output-link
  15150. -->
  15151. (I3 ^dir R +)
  15152. inner elaboration loop at bottom goal.
  15153. Retracting elaborate*copy-see-to-output-link
  15154. -->
  15155. (I3 ^see 0 +)
  15156. Retracting propose*predict-no
  15157. -->
  15158. (O2008 ^name predict-no +)
  15159. (S1 ^operator O2008 +)
  15160. Retracting propose*predict-yes
  15161. -->
  15162. (O2007 ^name predict-yes +)
  15163. (S1 ^operator O2007 +)
  15164. Retracting elaborate*reward*based*on*reward
  15165. -->
  15166. (R1007 ^value 1 +)
  15167. (R1 ^reward R1007 +)
  15168. Retracting elaborate*copy-dir-to-output-link
  15169. -->
  15170. (I3 ^dir R +)
  15171. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  15172. -->
  15173. (S1 ^operator O2008 = -0.2023211881870005)
  15174. Retracting rl*prefer*rvt*predict-no*H0*6
  15175. -->
  15176. (S1 ^operator O2008 = 0.229854902707684)
  15177. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15178. -->
  15179. (S1 ^operator O2007 = 0.7055651252992311)
  15180. Retracting rl*prefer*rvt*predict-yes*H0*5
  15181. -->
  15182. (S1 ^operator O2007 = 0.2939222491339341)
  15183. =>WM: (14088: S1 ^operator O2010 +)
  15184. =>WM: (14087: S1 ^operator O2009 +)
  15185. =>WM: (14086: O2010 ^name predict-no)
  15186. =>WM: (14085: O2009 ^name predict-yes)
  15187. =>WM: (14084: R1008 ^value 1)
  15188. =>WM: (14083: R1 ^reward R1008)
  15189. =>WM: (14082: I3 ^see 1)
  15190. <=WM: (14073: S1 ^operator O2007 +)
  15191. <=WM: (14075: S1 ^operator O2007)
  15192. <=WM: (14074: S1 ^operator O2008 +)
  15193. <=WM: (14068: R1 ^reward R1007)
  15194. <=WM: (14067: I3 ^see 0)
  15195. <=WM: (14071: O2008 ^name predict-no)
  15196. <=WM: (14070: O2007 ^name predict-yes)
  15197. <=WM: (14069: R1007 ^value 1)
  15198. --- Inner Elaboration Phase, active level 1 (S1) ---
  15199. Firing prefer*rvt*predict-yes*H0
  15200. -->
  15201. Firing rl*prefer*rvt*predict-yes*H0*5
  15202. -->
  15203. (S1 ^operator O2009 = 0.2939222491339341)
  15204. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15205. -->
  15206. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15207. -->
  15208. (S1 ^operator O2009 = -0.252585164213872)
  15209. Firing prefer*rvt*predict-no*H0
  15210. -->
  15211. Firing rl*prefer*rvt*predict-no*H0*6
  15212. -->
  15213. (S1 ^operator O2010 = 0.229854902707684)
  15214. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15215. -->
  15216. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15217. -->
  15218. (S1 ^operator O2010 = 0.7701760437619466)
  15219. inner elaboration loop at bottom goal.
  15220. Retracting rl*prefer*rvt*predict-no*H0*6
  15221. -->
  15222. (S1 ^operator O2008 = 0.229854902707684)
  15223. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15224. -->
  15225. (S1 ^operator O2008 = 0.7701760437619466)
  15226. Retracting rl*prefer*rvt*predict-yes*H0*5
  15227. -->
  15228. (S1 ^operator O2007 = 0.2939222491339341)
  15229. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15230. -->
  15231. (S1 ^operator O2007 = -0.252585164213872)
  15232. --- END Proposal Phase ---
  15233. --- Decision Phase ---
  15234. RL update rl*prefer*rvt*predict-yes*H0*5 0.501003 -0.207081 0.293922 -> 0.501042 -0.207077 0.293965(R,m,v=1,0.847134,0.130328)
  15235. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498533 0.207032 0.705565 -> 0.498578 0.207037 0.705615(R,m,v=1,1,0)
  15236. =>WM: (14089: S1 ^operator O2010)
  15237. 1005: O: O2010 (predict-no)
  15238. --- END Decision Phase ---
  15239. --- Application Phase ---
  15240. --- Firing Productions (PE) For State At Depth 1 ---
  15241. --- Inner Elaboration Phase, active level 1 (S1) ---
  15242. Firing apply*operator
  15243. -->
  15244. (I3 ^predict-no N1005 + :O )
  15245. Firing apply*operator*complete
  15246. -->
  15247. (I3 ^predict-yes N1004 - :O )
  15248. inner elaboration loop at bottom goal.
  15249. --- Change Working Memory (PE) ---
  15250. =>WM: (14090: I3 ^predict-no N1005)
  15251. <=WM: (14077: N1004 ^status complete)
  15252. <=WM: (14076: I3 ^predict-yes N1004)
  15253. --- Firing Productions (IE) For State At Depth 1 ---
  15254. --- Inner Elaboration Phase, active level 1 (S1) ---
  15255. Firing monitor*world
  15256. -->
  15257. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15258. --- Change Working Memory (IE) ---
  15259. --- END Application Phase ---
  15260. --- Output Phase ---
  15261. ENV: Agent did: predict-no for direction R in state State-B
  15262. In State-B moving R
  15263. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15264. predict error 0
  15265. dir: dir isU
  15266. --- END Output Phase ---
  15267. /|--- Input Phase ---
  15268. =>WM: (14094: I2 ^dir U)
  15269. =>WM: (14093: I2 ^reward 1)
  15270. =>WM: (14092: I2 ^see 0)
  15271. =>WM: (14091: N1005 ^status complete)
  15272. <=WM: (14080: I2 ^dir R)
  15273. <=WM: (14079: I2 ^reward 1)
  15274. <=WM: (14078: I2 ^see 1)
  15275. =>WM: (14095: I2 ^level-1 R0-root)
  15276. <=WM: (14081: I2 ^level-1 R1-root)
  15277. --- END Input Phase ---
  15278. --- Proposal Phase ---
  15279. --- Inner Elaboration Phase, active level 1 (S1) ---
  15280. Firing elaborate*copy-see-to-output-link
  15281. -->
  15282. (I3 ^see 0 +)
  15283. Firing elaborate*reward*based*on*reward
  15284. -->
  15285. (R1009 ^value 1 +)
  15286. (R1 ^reward R1009 +)
  15287. Firing propose*predict-yes
  15288. -->
  15289. (O2011 ^name predict-yes +)
  15290. (S1 ^operator O2011 +)
  15291. Firing propose*predict-no
  15292. -->
  15293. (O2012 ^name predict-no +)
  15294. (S1 ^operator O2012 +)
  15295. Firing rl*prefer*rvt*predict-no*H0*4
  15296. -->
  15297. (S1 ^operator O2010 = 1.)
  15298. Firing rl*prefer*rvt*predict-yes*H0*3
  15299. -->
  15300. (S1 ^operator O2009 = 0.)
  15301. Firing prefer*rvt*predict-yes*H0
  15302. -->
  15303. Firing prefer*rvt*predict-no*H0
  15304. -->
  15305. Firing elaborate*copy-dir-to-output-link
  15306. -->
  15307. (I3 ^dir U +)
  15308. inner elaboration loop at bottom goal.
  15309. Retracting elaborate*copy-see-to-output-link
  15310. -->
  15311. (I3 ^see 1 +)
  15312. Retracting propose*predict-no
  15313. -->
  15314. (O2010 ^name predict-no +)
  15315. (S1 ^operator O2010 +)
  15316. Retracting propose*predict-yes
  15317. -->
  15318. (O2009 ^name predict-yes +)
  15319. (S1 ^operator O2009 +)
  15320. Retracting elaborate*reward*based*on*reward
  15321. -->
  15322. (R1008 ^value 1 +)
  15323. (R1 ^reward R1008 +)
  15324. Retracting elaborate*copy-dir-to-output-link
  15325. -->
  15326. (I3 ^dir R +)
  15327. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  15328. -->
  15329. (S1 ^operator O2010 = 0.7701760437619466)
  15330. Retracting rl*prefer*rvt*predict-no*H0*6
  15331. -->
  15332. (S1 ^operator O2010 = 0.229854902707684)
  15333. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15334. -->
  15335. (S1 ^operator O2009 = -0.252585164213872)
  15336. Retracting rl*prefer*rvt*predict-yes*H0*5
  15337. -->
  15338. (S1 ^operator O2009 = 0.2939645711914686)
  15339. =>WM: (14103: S1 ^operator O2012 +)
  15340. =>WM: (14102: S1 ^operator O2011 +)
  15341. =>WM: (14101: I3 ^dir U)
  15342. =>WM: (14100: O2012 ^name predict-no)
  15343. =>WM: (14099: O2011 ^name predict-yes)
  15344. =>WM: (14098: R1009 ^value 1)
  15345. =>WM: (14097: R1 ^reward R1009)
  15346. =>WM: (14096: I3 ^see 0)
  15347. <=WM: (14087: S1 ^operator O2009 +)
  15348. <=WM: (14088: S1 ^operator O2010 +)
  15349. <=WM: (14089: S1 ^operator O2010)
  15350. <=WM: (14072: I3 ^dir R)
  15351. <=WM: (14083: R1 ^reward R1008)
  15352. <=WM: (14082: I3 ^see 1)
  15353. <=WM: (14086: O2010 ^name predict-no)
  15354. <=WM: (14085: O2009 ^name predict-yes)
  15355. <=WM: (14084: R1008 ^value 1)
  15356. --- Inner Elaboration Phase, active level 1 (S1) ---
  15357. Firing prefer*rvt*predict-yes*H0
  15358. -->
  15359. Firing rl*prefer*rvt*predict-yes*H0*3
  15360. -->
  15361. (S1 ^operator O2011 = 0.)
  15362. Firing prefer*rvt*predict-no*H0
  15363. -->
  15364. Firing rl*prefer*rvt*predict-no*H0*4
  15365. -->
  15366. (S1 ^operator O2012 = 1.)
  15367. inner elaboration loop at bottom goal.
  15368. Retracting rl*prefer*rvt*predict-no*H0*4
  15369. -->
  15370. (S1 ^operator O2010 = 1.)
  15371. Retracting rl*prefer*rvt*predict-yes*H0*3
  15372. -->
  15373. (S1 ^operator O2009 = 0.)
  15374. --- END Proposal Phase ---
  15375. --- Decision Phase ---
  15376. RL update rl*prefer*rvt*predict-no*H0*6 0.611908 -0.382053 0.229855 -> 0.611906 -0.382053 0.229852(R,m,v=1,0.846591,0.130617)
  15377. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388117 0.382059 0.770176 -> 0.388115 0.382058 0.770173(R,m,v=1,1,0)
  15378. =>WM: (14104: S1 ^operator O2012)
  15379. 1006: O: O2012 (predict-no)
  15380. --- END Decision Phase ---
  15381. --- Application Phase ---
  15382. --- Firing Productions (PE) For State At Depth 1 ---
  15383. --- Inner Elaboration Phase, active level 1 (S1) ---
  15384. Firing apply*operator
  15385. -->
  15386. (I3 ^predict-no N1006 + :O )
  15387. Firing apply*operator*complete
  15388. -->
  15389. (I3 ^predict-no N1005 - :O )
  15390. inner elaboration loop at bottom goal.
  15391. --- Change Working Memory (PE) ---
  15392. =>WM: (14105: I3 ^predict-no N1006)
  15393. <=WM: (14091: N1005 ^status complete)
  15394. <=WM: (14090: I3 ^predict-no N1005)
  15395. --- Firing Productions (IE) For State At Depth 1 ---
  15396. --- Inner Elaboration Phase, active level 1 (S1) ---
  15397. Firing monitor*world
  15398. -->
  15399. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15400. --- Change Working Memory (IE) ---
  15401. --- END Application Phase ---
  15402. --- Output Phase ---
  15403. ENV: Agent did: predict-no for direction U in state State-B
  15404. In State-B moving U
  15405. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15406. predict error 0
  15407. dir: dir isR
  15408. --- END Output Phase ---
  15409. \-/--- Input Phase ---
  15410. =>WM: (14109: I2 ^dir R)
  15411. =>WM: (14108: I2 ^reward 1)
  15412. =>WM: (14107: I2 ^see 0)
  15413. =>WM: (14106: N1006 ^status complete)
  15414. <=WM: (14094: I2 ^dir U)
  15415. <=WM: (14093: I2 ^reward 1)
  15416. <=WM: (14092: I2 ^see 0)
  15417. =>WM: (14110: I2 ^level-1 R0-root)
  15418. <=WM: (14095: I2 ^level-1 R0-root)
  15419. --- END Input Phase ---
  15420. --- Proposal Phase ---
  15421. --- Inner Elaboration Phase, active level 1 (S1) ---
  15422. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15423. -->
  15424. (S1 ^operator O2011 = -0.1254042659579056)
  15425. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15426. -->
  15427. (S1 ^operator O2012 = 0.7700907188039023)
  15428. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15429. -->
  15430. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15431. -->
  15432. Firing elaborate*copy-see-to-output-link
  15433. -->
  15434. (I3 ^see 0 +)
  15435. Firing elaborate*reward*based*on*reward
  15436. -->
  15437. (R1010 ^value 1 +)
  15438. (R1 ^reward R1010 +)
  15439. Firing propose*predict-yes
  15440. -->
  15441. (O2013 ^name predict-yes +)
  15442. (S1 ^operator O2013 +)
  15443. Firing propose*predict-no
  15444. -->
  15445. (O2014 ^name predict-no +)
  15446. (S1 ^operator O2014 +)
  15447. Firing rl*prefer*rvt*predict-no*H0*6
  15448. -->
  15449. (S1 ^operator O2012 = 0.2298523950867538)
  15450. Firing rl*prefer*rvt*predict-yes*H0*5
  15451. -->
  15452. (S1 ^operator O2011 = 0.2939645711914686)
  15453. Firing prefer*rvt*predict-yes*H0
  15454. -->
  15455. Firing prefer*rvt*predict-no*H0
  15456. -->
  15457. Firing elaborate*copy-dir-to-output-link
  15458. -->
  15459. (I3 ^dir R +)
  15460. inner elaboration loop at bottom goal.
  15461. Retracting elaborate*copy-see-to-output-link
  15462. -->
  15463. (I3 ^see 0 +)
  15464. Retracting propose*predict-no
  15465. -->
  15466. (O2012 ^name predict-no +)
  15467. (S1 ^operator O2012 +)
  15468. Retracting propose*predict-yes
  15469. -->
  15470. (O2011 ^name predict-yes +)
  15471. (S1 ^operator O2011 +)
  15472. Retracting elaborate*reward*based*on*reward
  15473. -->
  15474. (R1009 ^value 1 +)
  15475. (R1 ^reward R1009 +)
  15476. Retracting elaborate*copy-dir-to-output-link
  15477. -->
  15478. (I3 ^dir U +)
  15479. Retracting rl*prefer*rvt*predict-no*H0*4
  15480. -->
  15481. (S1 ^operator O2012 = 1.)
  15482. Retracting rl*prefer*rvt*predict-yes*H0*3
  15483. -->
  15484. (S1 ^operator O2011 = 0.)
  15485. =>WM: (14117: S1 ^operator O2014 +)
  15486. =>WM: (14116: S1 ^operator O2013 +)
  15487. =>WM: (14115: I3 ^dir R)
  15488. =>WM: (14114: O2014 ^name predict-no)
  15489. =>WM: (14113: O2013 ^name predict-yes)
  15490. =>WM: (14112: R1010 ^value 1)
  15491. =>WM: (14111: R1 ^reward R1010)
  15492. <=WM: (14102: S1 ^operator O2011 +)
  15493. <=WM: (14103: S1 ^operator O2012 +)
  15494. <=WM: (14104: S1 ^operator O2012)
  15495. <=WM: (14101: I3 ^dir U)
  15496. <=WM: (14097: R1 ^reward R1009)
  15497. <=WM: (14100: O2012 ^name predict-no)
  15498. <=WM: (14099: O2011 ^name predict-yes)
  15499. <=WM: (14098: R1009 ^value 1)
  15500. --- Inner Elaboration Phase, active level 1 (S1) ---
  15501. Firing prefer*rvt*predict-yes*H0
  15502. -->
  15503. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15504. -->
  15505. (S1 ^operator O2013 = -0.1254042659579056)
  15506. Firing rl*prefer*rvt*predict-yes*H0*5
  15507. -->
  15508. (S1 ^operator O2013 = 0.2939645711914686)
  15509. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15510. -->
  15511. Firing prefer*rvt*predict-no*H0
  15512. -->
  15513. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15514. -->
  15515. (S1 ^operator O2014 = 0.7700907188039023)
  15516. Firing rl*prefer*rvt*predict-no*H0*6
  15517. -->
  15518. (S1 ^operator O2014 = 0.2298523950867538)
  15519. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15520. -->
  15521. inner elaboration loop at bottom goal.
  15522. Retracting rl*prefer*rvt*predict-no*H0*6
  15523. -->
  15524. (S1 ^operator O2012 = 0.2298523950867538)
  15525. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15526. -->
  15527. (S1 ^operator O2012 = 0.7700907188039023)
  15528. Retracting rl*prefer*rvt*predict-yes*H0*5
  15529. -->
  15530. (S1 ^operator O2011 = 0.2939645711914686)
  15531. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15532. -->
  15533. (S1 ^operator O2011 = -0.1254042659579056)
  15534. --- END Proposal Phase ---
  15535. --- Decision Phase ---
  15536. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15537. =>WM: (14118: S1 ^operator O2014)
  15538. 1007: O: O2014 (predict-no)
  15539. --- END Decision Phase ---
  15540. --- Application Phase ---
  15541. --- Firing Productions (PE) For State At Depth 1 ---
  15542. --- Inner Elaboration Phase, active level 1 (S1) ---
  15543. Firing apply*operator
  15544. -->
  15545. (I3 ^predict-no N1007 + :O )
  15546. Firing apply*operator*complete
  15547. -->
  15548. (I3 ^predict-no N1006 - :O )
  15549. inner elaboration loop at bottom goal.
  15550. --- Change Working Memory (PE) ---
  15551. =>WM: (14119: I3 ^predict-no N1007)
  15552. <=WM: (14106: N1006 ^status complete)
  15553. <=WM: (14105: I3 ^predict-no N1006)
  15554. --- Firing Productions (IE) For State At Depth 1 ---
  15555. --- Inner Elaboration Phase, active level 1 (S1) ---
  15556. Firing monitor*world
  15557. -->
  15558. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15559. --- Change Working Memory (IE) ---
  15560. --- END Application Phase ---
  15561. --- Output Phase ---
  15562. ENV: Agent did: predict-no for direction R in state State-B
  15563. In State-B moving R
  15564. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15565. predict error 0
  15566. dir: dir isR
  15567. --- END Output Phase ---
  15568. |\---- Input Phase ---
  15569. =>WM: (14123: I2 ^dir R)
  15570. =>WM: (14122: I2 ^reward 1)
  15571. =>WM: (14121: I2 ^see 0)
  15572. =>WM: (14120: N1007 ^status complete)
  15573. <=WM: (14109: I2 ^dir R)
  15574. <=WM: (14108: I2 ^reward 1)
  15575. <=WM: (14107: I2 ^see 0)
  15576. =>WM: (14124: I2 ^level-1 R0-root)
  15577. <=WM: (14110: I2 ^level-1 R0-root)
  15578. --- END Input Phase ---
  15579. --- Proposal Phase ---
  15580. --- Inner Elaboration Phase, active level 1 (S1) ---
  15581. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15582. -->
  15583. (S1 ^operator O2013 = -0.1254042659579056)
  15584. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15585. -->
  15586. (S1 ^operator O2014 = 0.7700907188039023)
  15587. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15588. -->
  15589. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15590. -->
  15591. Firing elaborate*copy-see-to-output-link
  15592. -->
  15593. (I3 ^see 0 +)
  15594. Firing elaborate*reward*based*on*reward
  15595. -->
  15596. (R1011 ^value 1 +)
  15597. (R1 ^reward R1011 +)
  15598. Firing propose*predict-yes
  15599. -->
  15600. (O2015 ^name predict-yes +)
  15601. (S1 ^operator O2015 +)
  15602. Firing propose*predict-no
  15603. -->
  15604. (O2016 ^name predict-no +)
  15605. (S1 ^operator O2016 +)
  15606. Firing rl*prefer*rvt*predict-no*H0*6
  15607. -->
  15608. (S1 ^operator O2014 = 0.2298523950867538)
  15609. Firing rl*prefer*rvt*predict-yes*H0*5
  15610. -->
  15611. (S1 ^operator O2013 = 0.2939645711914686)
  15612. Firing prefer*rvt*predict-yes*H0
  15613. -->
  15614. Firing prefer*rvt*predict-no*H0
  15615. -->
  15616. Firing elaborate*copy-dir-to-output-link
  15617. -->
  15618. (I3 ^dir R +)
  15619. inner elaboration loop at bottom goal.
  15620. Retracting elaborate*copy-see-to-output-link
  15621. -->
  15622. (I3 ^see 0 +)
  15623. Retracting propose*predict-no
  15624. -->
  15625. (O2014 ^name predict-no +)
  15626. (S1 ^operator O2014 +)
  15627. Retracting propose*predict-yes
  15628. -->
  15629. (O2013 ^name predict-yes +)
  15630. (S1 ^operator O2013 +)
  15631. Retracting elaborate*reward*based*on*reward
  15632. -->
  15633. (R1010 ^value 1 +)
  15634. (R1 ^reward R1010 +)
  15635. Retracting elaborate*copy-dir-to-output-link
  15636. -->
  15637. (I3 ^dir R +)
  15638. Retracting rl*prefer*rvt*predict-no*H0*6
  15639. -->
  15640. (S1 ^operator O2014 = 0.2298523950867538)
  15641. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15642. -->
  15643. (S1 ^operator O2014 = 0.7700907188039023)
  15644. Retracting rl*prefer*rvt*predict-yes*H0*5
  15645. -->
  15646. (S1 ^operator O2013 = 0.2939645711914686)
  15647. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15648. -->
  15649. (S1 ^operator O2013 = -0.1254042659579056)
  15650. =>WM: (14130: S1 ^operator O2016 +)
  15651. =>WM: (14129: S1 ^operator O2015 +)
  15652. =>WM: (14128: O2016 ^name predict-no)
  15653. =>WM: (14127: O2015 ^name predict-yes)
  15654. =>WM: (14126: R1011 ^value 1)
  15655. =>WM: (14125: R1 ^reward R1011)
  15656. <=WM: (14116: S1 ^operator O2013 +)
  15657. <=WM: (14117: S1 ^operator O2014 +)
  15658. <=WM: (14118: S1 ^operator O2014)
  15659. <=WM: (14111: R1 ^reward R1010)
  15660. <=WM: (14114: O2014 ^name predict-no)
  15661. <=WM: (14113: O2013 ^name predict-yes)
  15662. <=WM: (14112: R1010 ^value 1)
  15663. --- Inner Elaboration Phase, active level 1 (S1) ---
  15664. Firing prefer*rvt*predict-yes*H0
  15665. -->
  15666. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15667. -->
  15668. (S1 ^operator O2015 = -0.1254042659579056)
  15669. Firing rl*prefer*rvt*predict-yes*H0*5
  15670. -->
  15671. (S1 ^operator O2015 = 0.2939645711914686)
  15672. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15673. -->
  15674. Firing prefer*rvt*predict-no*H0
  15675. -->
  15676. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15677. -->
  15678. (S1 ^operator O2016 = 0.7700907188039023)
  15679. Firing rl*prefer*rvt*predict-no*H0*6
  15680. -->
  15681. (S1 ^operator O2016 = 0.2298523950867538)
  15682. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15683. -->
  15684. inner elaboration loop at bottom goal.
  15685. Retracting rl*prefer*rvt*predict-no*H0*6
  15686. -->
  15687. (S1 ^operator O2014 = 0.2298523950867538)
  15688. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15689. -->
  15690. (S1 ^operator O2014 = 0.7700907188039023)
  15691. Retracting rl*prefer*rvt*predict-yes*H0*5
  15692. -->
  15693. (S1 ^operator O2013 = 0.2939645711914686)
  15694. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15695. -->
  15696. (S1 ^operator O2013 = -0.1254042659579056)
  15697. --- END Proposal Phase ---
  15698. --- Decision Phase ---
  15699. RL update rl*prefer*rvt*predict-no*H0*6 0.611906 -0.382053 0.229852 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.847458,0.130008)
  15700. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388048 0.382043 0.770091 -> 0.388052 0.382044 0.770096(R,m,v=1,1,0)
  15701. =>WM: (14131: S1 ^operator O2016)
  15702. 1008: O: O2016 (predict-no)
  15703. --- END Decision Phase ---
  15704. --- Application Phase ---
  15705. --- Firing Productions (PE) For State At Depth 1 ---
  15706. --- Inner Elaboration Phase, active level 1 (S1) ---
  15707. Firing apply*operator
  15708. -->
  15709. (I3 ^predict-no N1008 + :O )
  15710. Firing apply*operator*complete
  15711. -->
  15712. (I3 ^predict-no N1007 - :O )
  15713. inner elaboration loop at bottom goal.
  15714. --- Change Working Memory (PE) ---
  15715. =>WM: (14132: I3 ^predict-no N1008)
  15716. <=WM: (14120: N1007 ^status complete)
  15717. <=WM: (14119: I3 ^predict-no N1007)
  15718. --- Firing Productions (IE) For State At Depth 1 ---
  15719. --- Inner Elaboration Phase, active level 1 (S1) ---
  15720. Firing monitor*world
  15721. -->
  15722. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15723. --- Change Working Memory (IE) ---
  15724. --- END Application Phase ---
  15725. --- Output Phase ---
  15726. ENV: Agent did: predict-no for direction R in state State-B
  15727. In State-B moving R
  15728. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15729. predict error 0
  15730. dir: dir isL
  15731. --- END Output Phase ---
  15732. /|\--- Input Phase ---
  15733. =>WM: (14136: I2 ^dir L)
  15734. =>WM: (14135: I2 ^reward 1)
  15735. =>WM: (14134: I2 ^see 0)
  15736. =>WM: (14133: N1008 ^status complete)
  15737. <=WM: (14123: I2 ^dir R)
  15738. <=WM: (14122: I2 ^reward 1)
  15739. <=WM: (14121: I2 ^see 0)
  15740. =>WM: (14137: I2 ^level-1 R0-root)
  15741. <=WM: (14124: I2 ^level-1 R0-root)
  15742. --- END Input Phase ---
  15743. --- Proposal Phase ---
  15744. --- Inner Elaboration Phase, active level 1 (S1) ---
  15745. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  15746. -->
  15747. (S1 ^operator O2015 = 0.6195686662736642)
  15748. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  15749. -->
  15750. (S1 ^operator O2016 = -0.2190661556260421)
  15751. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15752. -->
  15753. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15754. -->
  15755. Firing elaborate*copy-see-to-output-link
  15756. -->
  15757. (I3 ^see 0 +)
  15758. Firing elaborate*reward*based*on*reward
  15759. -->
  15760. (R1012 ^value 1 +)
  15761. (R1 ^reward R1012 +)
  15762. Firing propose*predict-yes
  15763. -->
  15764. (O2017 ^name predict-yes +)
  15765. (S1 ^operator O2017 +)
  15766. Firing propose*predict-no
  15767. -->
  15768. (O2018 ^name predict-no +)
  15769. (S1 ^operator O2018 +)
  15770. Firing rl*prefer*rvt*predict-no*H0*2
  15771. -->
  15772. (S1 ^operator O2016 = 0.3139979225569853)
  15773. Firing rl*prefer*rvt*predict-yes*H0*1
  15774. -->
  15775. (S1 ^operator O2015 = 0.380414370085626)
  15776. Firing prefer*rvt*predict-yes*H0
  15777. -->
  15778. Firing prefer*rvt*predict-no*H0
  15779. -->
  15780. Firing elaborate*copy-dir-to-output-link
  15781. -->
  15782. (I3 ^dir L +)
  15783. inner elaboration loop at bottom goal.
  15784. Retracting elaborate*copy-see-to-output-link
  15785. -->
  15786. (I3 ^see 0 +)
  15787. Retracting propose*predict-no
  15788. -->
  15789. (O2016 ^name predict-no +)
  15790. (S1 ^operator O2016 +)
  15791. Retracting propose*predict-yes
  15792. -->
  15793. (O2015 ^name predict-yes +)
  15794. (S1 ^operator O2015 +)
  15795. Retracting elaborate*reward*based*on*reward
  15796. -->
  15797. (R1011 ^value 1 +)
  15798. (R1 ^reward R1011 +)
  15799. Retracting elaborate*copy-dir-to-output-link
  15800. -->
  15801. (I3 ^dir R +)
  15802. Retracting rl*prefer*rvt*predict-no*H0*6
  15803. -->
  15804. (S1 ^operator O2016 = 0.229857000391985)
  15805. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  15806. -->
  15807. (S1 ^operator O2016 = 0.7700959914561893)
  15808. Retracting rl*prefer*rvt*predict-yes*H0*5
  15809. -->
  15810. (S1 ^operator O2015 = 0.2939645711914686)
  15811. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  15812. -->
  15813. (S1 ^operator O2015 = -0.1254042659579056)
  15814. =>WM: (14144: S1 ^operator O2018 +)
  15815. =>WM: (14143: S1 ^operator O2017 +)
  15816. =>WM: (14142: I3 ^dir L)
  15817. =>WM: (14141: O2018 ^name predict-no)
  15818. =>WM: (14140: O2017 ^name predict-yes)
  15819. =>WM: (14139: R1012 ^value 1)
  15820. =>WM: (14138: R1 ^reward R1012)
  15821. <=WM: (14129: S1 ^operator O2015 +)
  15822. <=WM: (14130: S1 ^operator O2016 +)
  15823. <=WM: (14131: S1 ^operator O2016)
  15824. <=WM: (14115: I3 ^dir R)
  15825. <=WM: (14125: R1 ^reward R1011)
  15826. <=WM: (14128: O2016 ^name predict-no)
  15827. <=WM: (14127: O2015 ^name predict-yes)
  15828. <=WM: (14126: R1011 ^value 1)
  15829. --- Inner Elaboration Phase, active level 1 (S1) ---
  15830. Firing prefer*rvt*predict-yes*H0
  15831. -->
  15832. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  15833. -->
  15834. (S1 ^operator O2017 = 0.6195686662736642)
  15835. Firing rl*prefer*rvt*predict-yes*H0*1
  15836. -->
  15837. (S1 ^operator O2017 = 0.380414370085626)
  15838. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15839. -->
  15840. Firing prefer*rvt*predict-no*H0
  15841. -->
  15842. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  15843. -->
  15844. (S1 ^operator O2018 = -0.2190661556260421)
  15845. Firing rl*prefer*rvt*predict-no*H0*2
  15846. -->
  15847. (S1 ^operator O2018 = 0.3139979225569853)
  15848. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15849. -->
  15850. inner elaboration loop at bottom goal.
  15851. Retracting rl*prefer*rvt*predict-no*H0*2
  15852. -->
  15853. (S1 ^operator O2016 = 0.3139979225569853)
  15854. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  15855. -->
  15856. (S1 ^operator O2016 = -0.2190661556260421)
  15857. Retracting rl*prefer*rvt*predict-yes*H0*1
  15858. -->
  15859. (S1 ^operator O2015 = 0.380414370085626)
  15860. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  15861. -->
  15862. (S1 ^operator O2015 = 0.6195686662736642)
  15863. --- END Proposal Phase ---
  15864. --- Decision Phase ---
  15865. RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382053 0.229857 -> 0.611913 -0.382052 0.229861(R,m,v=1,0.848315,0.129404)
  15866. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388052 0.382044 0.770096 -> 0.388056 0.382045 0.7701(R,m,v=1,1,0)
  15867. =>WM: (14145: S1 ^operator O2017)
  15868. 1009: O: O2017 (predict-yes)
  15869. --- END Decision Phase ---
  15870. --- Application Phase ---
  15871. --- Firing Productions (PE) For State At Depth 1 ---
  15872. --- Inner Elaboration Phase, active level 1 (S1) ---
  15873. Firing apply*operator
  15874. -->
  15875. (I3 ^predict-yes N1009 + :O )
  15876. Firing apply*operator*complete
  15877. -->
  15878. (I3 ^predict-no N1008 - :O )
  15879. inner elaboration loop at bottom goal.
  15880. --- Change Working Memory (PE) ---
  15881. =>WM: (14146: I3 ^predict-yes N1009)
  15882. <=WM: (14133: N1008 ^status complete)
  15883. <=WM: (14132: I3 ^predict-no N1008)
  15884. --- Firing Productions (IE) For State At Depth 1 ---
  15885. --- Inner Elaboration Phase, active level 1 (S1) ---
  15886. Firing monitor*world
  15887. -->
  15888. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15889. --- Change Working Memory (IE) ---
  15890. --- END Application Phase ---
  15891. --- Output Phase ---
  15892. ENV: Agent did: predict-yes for direction L in state State-B
  15893. In State-B moving L
  15894. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15895. predict error 0
  15896. dir: dir isR
  15897. --- END Output Phase ---
  15898. -/|--- Input Phase ---
  15899. =>WM: (14150: I2 ^dir R)
  15900. =>WM: (14149: I2 ^reward 1)
  15901. =>WM: (14148: I2 ^see 1)
  15902. =>WM: (14147: N1009 ^status complete)
  15903. <=WM: (14136: I2 ^dir L)
  15904. <=WM: (14135: I2 ^reward 1)
  15905. <=WM: (14134: I2 ^see 0)
  15906. =>WM: (14151: I2 ^level-1 L1-root)
  15907. <=WM: (14137: I2 ^level-1 R0-root)
  15908. --- END Input Phase ---
  15909. --- Proposal Phase ---
  15910. --- Inner Elaboration Phase, active level 1 (S1) ---
  15911. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  15912. -->
  15913. (S1 ^operator O2017 = 0.7062964705528377)
  15914. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  15915. -->
  15916. (S1 ^operator O2018 = -0.1937987592593187)
  15917. Firing prefer*rvt*predict-no*H0*6*v1*H1
  15918. -->
  15919. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15920. -->
  15921. Firing elaborate*copy-see-to-output-link
  15922. -->
  15923. (I3 ^see 1 +)
  15924. Firing elaborate*reward*based*on*reward
  15925. -->
  15926. (R1013 ^value 1 +)
  15927. (R1 ^reward R1013 +)
  15928. Firing propose*predict-yes
  15929. -->
  15930. (O2019 ^name predict-yes +)
  15931. (S1 ^operator O2019 +)
  15932. Firing propose*predict-no
  15933. -->
  15934. (O2020 ^name predict-no +)
  15935. (S1 ^operator O2020 +)
  15936. Firing rl*prefer*rvt*predict-no*H0*6
  15937. -->
  15938. (S1 ^operator O2018 = 0.2298608025432123)
  15939. Firing rl*prefer*rvt*predict-yes*H0*5
  15940. -->
  15941. (S1 ^operator O2017 = 0.2939645711914686)
  15942. Firing prefer*rvt*predict-yes*H0
  15943. -->
  15944. Firing prefer*rvt*predict-no*H0
  15945. -->
  15946. Firing elaborate*copy-dir-to-output-link
  15947. -->
  15948. (I3 ^dir R +)
  15949. inner elaboration loop at bottom goal.
  15950. Retracting elaborate*copy-see-to-output-link
  15951. -->
  15952. (I3 ^see 0 +)
  15953. Retracting propose*predict-no
  15954. -->
  15955. (O2018 ^name predict-no +)
  15956. (S1 ^operator O2018 +)
  15957. Retracting propose*predict-yes
  15958. -->
  15959. (O2017 ^name predict-yes +)
  15960. (S1 ^operator O2017 +)
  15961. Retracting elaborate*reward*based*on*reward
  15962. -->
  15963. (R1012 ^value 1 +)
  15964. (R1 ^reward R1012 +)
  15965. Retracting elaborate*copy-dir-to-output-link
  15966. -->
  15967. (I3 ^dir L +)
  15968. Retracting rl*prefer*rvt*predict-no*H0*2
  15969. -->
  15970. (S1 ^operator O2018 = 0.3139979225569853)
  15971. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  15972. -->
  15973. (S1 ^operator O2018 = -0.2190661556260421)
  15974. Retracting rl*prefer*rvt*predict-yes*H0*1
  15975. -->
  15976. (S1 ^operator O2017 = 0.380414370085626)
  15977. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  15978. -->
  15979. (S1 ^operator O2017 = 0.6195686662736642)
  15980. =>WM: (14159: S1 ^operator O2020 +)
  15981. =>WM: (14158: S1 ^operator O2019 +)
  15982. =>WM: (14157: I3 ^dir R)
  15983. =>WM: (14156: O2020 ^name predict-no)
  15984. =>WM: (14155: O2019 ^name predict-yes)
  15985. =>WM: (14154: R1013 ^value 1)
  15986. =>WM: (14153: R1 ^reward R1013)
  15987. =>WM: (14152: I3 ^see 1)
  15988. <=WM: (14143: S1 ^operator O2017 +)
  15989. <=WM: (14145: S1 ^operator O2017)
  15990. <=WM: (14144: S1 ^operator O2018 +)
  15991. <=WM: (14142: I3 ^dir L)
  15992. <=WM: (14138: R1 ^reward R1012)
  15993. <=WM: (14096: I3 ^see 0)
  15994. <=WM: (14141: O2018 ^name predict-no)
  15995. <=WM: (14140: O2017 ^name predict-yes)
  15996. <=WM: (14139: R1012 ^value 1)
  15997. --- Inner Elaboration Phase, active level 1 (S1) ---
  15998. Firing prefer*rvt*predict-yes*H0
  15999. -->
  16000. Firing rl*prefer*rvt*predict-yes*H0*5
  16001. -->
  16002. (S1 ^operator O2019 = 0.2939645711914686)
  16003. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16004. -->
  16005. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  16006. -->
  16007. (S1 ^operator O2019 = 0.7062964705528377)
  16008. Firing prefer*rvt*predict-no*H0
  16009. -->
  16010. Firing rl*prefer*rvt*predict-no*H0*6
  16011. -->
  16012. (S1 ^operator O2020 = 0.2298608025432123)
  16013. Firing prefer*rvt*predict-no*H0*6*v1*H1
  16014. -->
  16015. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  16016. -->
  16017. (S1 ^operator O2020 = -0.1937987592593187)
  16018. inner elaboration loop at bottom goal.
  16019. Retracting rl*prefer*rvt*predict-no*H0*6
  16020. -->
  16021. (S1 ^operator O2018 = 0.2298608025432123)
  16022. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  16023. -->
  16024. (S1 ^operator O2018 = -0.1937987592593187)
  16025. Retracting rl*prefer*rvt*predict-yes*H0*5
  16026. -->
  16027. (S1 ^operator O2017 = 0.2939645711914686)
  16028. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  16029. -->
  16030. (S1 ^operator O2017 = 0.7062964705528377)
  16031. --- END Proposal Phase ---
  16032. --- Decision Phase ---
  16033. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521346 -0.14093 0.380416(R,m,v=1,0.832335,0.140394)
  16034. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478637 0.140932 0.619569 -> 0.478639 0.140931 0.61957(R,m,v=1,1,0)
  16035. =>WM: (14160: S1 ^operator O2019)
  16036. 1010: O: O2019 (predict-yes)
  16037. --- END Decision Phase ---
  16038. --- Application Phase ---
  16039. --- Firing Productions (PE) For State At Depth 1 ---
  16040. --- Inner Elaboration Phase, active level 1 (S1) ---
  16041. Firing apply*operator
  16042. -->
  16043. (I3 ^predict-yes N1010 + :O )
  16044. Firing apply*operator*complete
  16045. -->
  16046. (I3 ^predict-yes N1009 - :O )
  16047. inner elaboration loop at bottom goal.
  16048. --- Change Working Memory (PE) ---
  16049. =>WM: (14161: I3 ^predict-yes N1010)
  16050. <=WM: (14147: N1009 ^status complete)
  16051. <=WM: (14146: I3 ^predict-yes N1009)
  16052. --- Firing Productions (IE) For State At Depth 1 ---
  16053. --- Inner Elaboration Phase, active level 1 (S1) ---
  16054. Firing monitor*world
  16055. -->
  16056. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16057. --- Change Working Memory (IE) ---
  16058. --- END Application Phase ---
  16059. --- Output Phase ---
  16060. ENV: Agent did: predict-yes for direction R in state State-A
  16061. In State-A moving R
  16062. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  16063. predict error 0
  16064. dir: dir isL
  16065. --- END Output Phase ---
  16066. \-/--- Input Phase ---
  16067. =>WM: (14165: I2 ^dir L)
  16068. =>WM: (14164: I2 ^reward 1)
  16069. =>WM: (14163: I2 ^see 1)
  16070. =>WM: (14162: N1010 ^status complete)
  16071. <=WM: (14150: I2 ^dir R)
  16072. <=WM: (14149: I2 ^reward 1)
  16073. <=WM: (14148: I2 ^see 1)
  16074. =>WM: (14166: I2 ^level-1 R1-root)
  16075. <=WM: (14151: I2 ^level-1 L1-root)
  16076. --- END Input Phase ---
  16077. --- Proposal Phase ---
  16078. --- Inner Elaboration Phase, active level 1 (S1) ---
  16079. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  16080. -->
  16081. (S1 ^operator O2019 = 0.6196074987347102)
  16082. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  16083. -->
  16084. (S1 ^operator O2020 = -0.1479504104026684)
  16085. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16086. -->
  16087. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16088. -->
  16089. Firing elaborate*copy-see-to-output-link
  16090. -->
  16091. (I3 ^see 1 +)
  16092. Firing elaborate*reward*based*on*reward
  16093. -->
  16094. (R1014 ^value 1 +)
  16095. (R1 ^reward R1014 +)
  16096. Firing propose*predict-yes
  16097. -->
  16098. (O2021 ^name predict-yes +)
  16099. (S1 ^operator O2021 +)
  16100. Firing propose*predict-no
  16101. -->
  16102. (O2022 ^name predict-no +)
  16103. (S1 ^operator O2022 +)
  16104. Firing rl*prefer*rvt*predict-no*H0*2
  16105. -->
  16106. (S1 ^operator O2020 = 0.3139979225569853)
  16107. Firing rl*prefer*rvt*predict-yes*H0*1
  16108. -->
  16109. (S1 ^operator O2019 = 0.3804157564584494)
  16110. Firing prefer*rvt*predict-yes*H0
  16111. -->
  16112. Firing prefer*rvt*predict-no*H0
  16113. -->
  16114. Firing elaborate*copy-dir-to-output-link
  16115. -->
  16116. (I3 ^dir L +)
  16117. inner elaboration loop at bottom goal.
  16118. Retracting elaborate*copy-see-to-output-link
  16119. -->
  16120. (I3 ^see 1 +)
  16121. Retracting propose*predict-no
  16122. -->
  16123. (O2020 ^name predict-no +)
  16124. (S1 ^operator O2020 +)
  16125. Retracting propose*predict-yes
  16126. -->
  16127. (O2019 ^name predict-yes +)
  16128. (S1 ^operator O2019 +)
  16129. Retracting elaborate*reward*based*on*reward
  16130. -->
  16131. (R1013 ^value 1 +)
  16132. (R1 ^reward R1013 +)
  16133. Retracting elaborate*copy-dir-to-output-link
  16134. -->
  16135. (I3 ^dir R +)
  16136. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  16137. -->
  16138. (S1 ^operator O2020 = -0.1937987592593187)
  16139. Retracting rl*prefer*rvt*predict-no*H0*6
  16140. -->
  16141. (S1 ^operator O2020 = 0.2298608025432123)
  16142. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  16143. -->
  16144. (S1 ^operator O2019 = 0.7062964705528377)
  16145. Retracting rl*prefer*rvt*predict-yes*H0*5
  16146. -->
  16147. (S1 ^operator O2019 = 0.2939645711914686)
  16148. =>WM: (14173: S1 ^operator O2022 +)
  16149. =>WM: (14172: S1 ^operator O2021 +)
  16150. =>WM: (14171: I3 ^dir L)
  16151. =>WM: (14170: O2022 ^name predict-no)
  16152. =>WM: (14169: O2021 ^name predict-yes)
  16153. =>WM: (14168: R1014 ^value 1)
  16154. =>WM: (14167: R1 ^reward R1014)
  16155. <=WM: (14158: S1 ^operator O2019 +)
  16156. <=WM: (14160: S1 ^operator O2019)
  16157. <=WM: (14159: S1 ^operator O2020 +)
  16158. <=WM: (14157: I3 ^dir R)
  16159. <=WM: (14153: R1 ^reward R1013)
  16160. <=WM: (14156: O2020 ^name predict-no)
  16161. <=WM: (14155: O2019 ^name predict-yes)
  16162. <=WM: (14154: R1013 ^value 1)
  16163. --- Inner Elaboration Phase, active level 1 (S1) ---
  16164. Firing prefer*rvt*predict-yes*H0
  16165. -->
  16166. Firing rl*prefer*rvt*predict-yes*H0*1
  16167. -->
  16168. (S1 ^operator O2021 = 0.3804157564584494)
  16169. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16170. -->
  16171. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  16172. -->
  16173. (S1 ^operator O2021 = 0.6196074987347102)
  16174. Firing prefer*rvt*predict-no*H0
  16175. -->
  16176. Firing rl*prefer*rvt*predict-no*H0*2
  16177. -->
  16178. (S1 ^operator O2022 = 0.3139979225569853)
  16179. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16180. -->
  16181. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  16182. -->
  16183. (S1 ^operator O2022 = -0.1479504104026684)
  16184. inner elaboration loop at bottom goal.
  16185. Retracting rl*prefer*rvt*predict-no*H0*2
  16186. -->
  16187. (S1 ^operator O2020 = 0.3139979225569853)
  16188. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  16189. -->
  16190. (S1 ^operator O2020 = -0.1479504104026684)
  16191. Retracting rl*prefer*rvt*predict-yes*H0*1
  16192. -->
  16193. (S1 ^operator O2019 = 0.3804157564584494)
  16194. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  16195. -->
  16196. (S1 ^operator O2019 = 0.6196074987347102)
  16197. --- END Proposal Phase ---
  16198. --- Decision Phase ---
  16199. RL update rl*prefer*rvt*predict-yes*H0*5 0.501042 -0.207077 0.293965 -> 0.501022 -0.207079 0.293943(R,m,v=1,0.848101,0.129646)
  16200. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499194 0.207103 0.706296 -> 0.499171 0.2071 0.706271(R,m,v=1,1,0)
  16201. =>WM: (14174: S1 ^operator O2021)
  16202. 1011: O: O2021 (predict-yes)
  16203. --- END Decision Phase ---
  16204. --- Application Phase ---
  16205. --- Firing Productions (PE) For State At Depth 1 ---
  16206. --- Inner Elaboration Phase, active level 1 (S1) ---
  16207. Firing apply*operator
  16208. -->
  16209. (I3 ^predict-yes N1011 + :O )
  16210. Firing apply*operator*complete
  16211. -->
  16212. (I3 ^predict-yes N1010 - :O )
  16213. inner elaboration loop at bottom goal.
  16214. --- Change Working Memory (PE) ---
  16215. =>WM: (14175: I3 ^predict-yes N1011)
  16216. <=WM: (14162: N1010 ^status complete)
  16217. <=WM: (14161: I3 ^predict-yes N1010)
  16218. --- Firing Productions (IE) For State At Depth 1 ---
  16219. --- Inner Elaboration Phase, active level 1 (S1) ---
  16220. Firing monitor*world
  16221. -->
  16222. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16223. --- Change Working Memory (IE) ---
  16224. --- END Application Phase ---
  16225. --- Output Phase ---
  16226. ENV: Agent did: predict-yes for direction L in state State-B
  16227. In State-B moving L
  16228. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  16229. predict error 0
  16230. dir: dir isU
  16231. --- END Output Phase ---
  16232. |--- Input Phase ---
  16233. =>WM: (14179: I2 ^dir U)
  16234. =>WM: (14178: I2 ^reward 1)
  16235. =>WM: (14177: I2 ^see 1)
  16236. =>WM: (14176: N1011 ^status complete)
  16237. <=WM: (14165: I2 ^dir L)
  16238. <=WM: (14164: I2 ^reward 1)
  16239. <=WM: (14163: I2 ^see 1)
  16240. =>WM: (14180: I2 ^level-1 L1-root)
  16241. <=WM: (14166: I2 ^level-1 R1-root)
  16242. --- END Input Phase ---
  16243. --- Proposal Phase ---
  16244. --- Inner Elaboration Phase, active level 1 (S1) ---
  16245. Firing elaborate*copy-see-to-output-link
  16246. -->
  16247. (I3 ^see 1 +)
  16248. Firing elaborate*reward*based*on*reward
  16249. -->
  16250. (R1015 ^value 1 +)
  16251. (R1 ^reward R1015 +)
  16252. Firing propose*predict-yes
  16253. -->
  16254. (O2023 ^name predict-yes +)
  16255. (S1 ^operator O2023 +)
  16256. Firing propose*predict-no
  16257. -->
  16258. (O2024 ^name predict-no +)
  16259. (S1 ^operator O2024 +)
  16260. Firing rl*prefer*rvt*predict-no*H0*4
  16261. -->
  16262. (S1 ^operator O2022 = 1.)
  16263. Firing rl*prefer*rvt*predict-yes*H0*3
  16264. -->
  16265. (S1 ^operator O2021 = 0.)
  16266. Firing prefer*rvt*predict-yes*H0
  16267. -->
  16268. Firing prefer*rvt*predict-no*H0
  16269. -->
  16270. Firing elaborate*copy-dir-to-output-link
  16271. -->
  16272. (I3 ^dir U +)
  16273. inner elaboration loop at bottom goal.
  16274. Retracting elaborate*copy-see-to-output-link
  16275. -->
  16276. (I3 ^see 1 +)
  16277. Retracting propose*predict-no
  16278. -->
  16279. (O2022 ^name predict-no +)
  16280. (S1 ^operator O2022 +)
  16281. Retracting propose*predict-yes
  16282. -->
  16283. (O2021 ^name predict-yes +)
  16284. (S1 ^operator O2021 +)
  16285. Retracting elaborate*reward*based*on*reward
  16286. -->
  16287. (R1014 ^value 1 +)
  16288. (R1 ^reward R1014 +)
  16289. Retracting elaborate*copy-dir-to-output-link
  16290. -->
  16291. (I3 ^dir L +)
  16292. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  16293. -->
  16294. (S1 ^operator O2022 = -0.1479504104026684)
  16295. Retracting rl*prefer*rvt*predict-no*H0*2
  16296. -->
  16297. (S1 ^operator O2022 = 0.3139979225569853)
  16298. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  16299. -->
  16300. (S1 ^operator O2021 = 0.6196074987347102)
  16301. Retracting rl*prefer*rvt*predict-yes*H0*1
  16302. -->
  16303. (S1 ^operator O2021 = 0.3804157564584494)
  16304. =>WM: (14187: S1 ^operator O2024 +)
  16305. =>WM: (14186: S1 ^operator O2023 +)
  16306. =>WM: (14185: I3 ^dir U)
  16307. =>WM: (14184: O2024 ^name predict-no)
  16308. =>WM: (14183: O2023 ^name predict-yes)
  16309. =>WM: (14182: R1015 ^value 1)
  16310. =>WM: (14181: R1 ^reward R1015)
  16311. <=WM: (14172: S1 ^operator O2021 +)
  16312. <=WM: (14174: S1 ^operator O2021)
  16313. <=WM: (14173: S1 ^operator O2022 +)
  16314. <=WM: (14171: I3 ^dir L)
  16315. <=WM: (14167: R1 ^reward R1014)
  16316. <=WM: (14170: O2022 ^name predict-no)
  16317. <=WM: (14169: O2021 ^name predict-yes)
  16318. <=WM: (14168: R1014 ^value 1)
  16319. --- Inner Elaboration Phase, active level 1 (S1) ---
  16320. Firing prefer*rvt*predict-yes*H0
  16321. -->
  16322. Firing rl*prefer*rvt*predict-yes*H0*3
  16323. -->
  16324. (S1 ^operator O2023 = 0.)
  16325. Firing prefer*rvt*predict-no*H0
  16326. -->
  16327. Firing rl*prefer*rvt*predict-no*H0*4
  16328. -->
  16329. (S1 ^operator O2024 = 1.)
  16330. inner elaboration loop at bottom goal.
  16331. Retracting rl*prefer*rvt*predict-no*H0*4
  16332. -->
  16333. (S1 ^operator O2022 = 1.)
  16334. Retracting rl*prefer*rvt*predict-yes*H0*3
  16335. -->
  16336. (S1 ^operator O2021 = 0.)
  16337. --- END Proposal Phase ---
  16338. --- Decision Phase ---
  16339. RL update rl*prefer*rvt*predict-yes*H0*1 0.521346 -0.14093 0.380416 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.833333,0.139721)
  16340. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.47868 0.140928 0.619607 -> 0.478677 0.140928 0.619605(R,m,v=1,1,0)
  16341. =>WM: (14188: S1 ^operator O2024)
  16342. 1012: O: O2024 (predict-no)
  16343. --- END Decision Phase ---
  16344. --- Application Phase ---
  16345. --- Firing Productions (PE) For State At Depth 1 ---
  16346. --- Inner Elaboration Phase, active level 1 (S1) ---
  16347. Firing apply*operator
  16348. -->
  16349. (I3 ^predict-no N1012 + :O )
  16350. Firing apply*operator*complete
  16351. -->
  16352. (I3 ^predict-yes N1011 - :O )
  16353. inner elaboration loop at bottom goal.
  16354. --- Change Working Memory (PE) ---
  16355. =>WM: (14189: I3 ^predict-no N1012)
  16356. <=WM: (14176: N1011 ^status complete)
  16357. <=WM: (14175: I3 ^predict-yes N1011)
  16358. --- Firing Productions (IE) For State At Depth 1 ---
  16359. --- Inner Elaboration Phase, active level 1 (S1) ---
  16360. Firing monitor*world
  16361. -->
  16362. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16363. --- Change Working Memory (IE) ---
  16364. --- END Application Phase ---
  16365. --- Output Phase ---
  16366. ENV: Agent did: predict-no for direction U in state State-A
  16367. In State-A moving U
  16368. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16369. predict error 0
  16370. dir: dir isL
  16371. --- END Output Phase ---
  16372. \---- Input Phase ---
  16373. =>WM: (14193: I2 ^dir L)
  16374. =>WM: (14192: I2 ^reward 1)
  16375. =>WM: (14191: I2 ^see 0)
  16376. =>WM: (14190: N1012 ^status complete)
  16377. <=WM: (14179: I2 ^dir U)
  16378. <=WM: (14178: I2 ^reward 1)
  16379. <=WM: (14177: I2 ^see 1)
  16380. =>WM: (14194: I2 ^level-1 L1-root)
  16381. <=WM: (14180: I2 ^level-1 L1-root)
  16382. --- END Input Phase ---
  16383. --- Proposal Phase ---
  16384. --- Inner Elaboration Phase, active level 1 (S1) ---
  16385. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  16386. -->
  16387. (S1 ^operator O2023 = -0.3470159027404986)
  16388. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  16389. -->
  16390. (S1 ^operator O2024 = 0.68611525175106)
  16391. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16392. -->
  16393. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16394. -->
  16395. Firing elaborate*copy-see-to-output-link
  16396. -->
  16397. (I3 ^see 0 +)
  16398. Firing elaborate*reward*based*on*reward
  16399. -->
  16400. (R1016 ^value 1 +)
  16401. (R1 ^reward R1016 +)
  16402. Firing propose*predict-yes
  16403. -->
  16404. (O2025 ^name predict-yes +)
  16405. (S1 ^operator O2025 +)
  16406. Firing propose*predict-no
  16407. -->
  16408. (O2026 ^name predict-no +)
  16409. (S1 ^operator O2026 +)
  16410. Firing rl*prefer*rvt*predict-no*H0*2
  16411. -->
  16412. (S1 ^operator O2024 = 0.3139979225569853)
  16413. Firing rl*prefer*rvt*predict-yes*H0*1
  16414. -->
  16415. (S1 ^operator O2023 = 0.3804138577541756)
  16416. Firing prefer*rvt*predict-yes*H0
  16417. -->
  16418. Firing prefer*rvt*predict-no*H0
  16419. -->
  16420. Firing elaborate*copy-dir-to-output-link
  16421. -->
  16422. (I3 ^dir L +)
  16423. inner elaboration loop at bottom goal.
  16424. Retracting elaborate*copy-see-to-output-link
  16425. -->
  16426. (I3 ^see 1 +)
  16427. Retracting propose*predict-no
  16428. -->
  16429. (O2024 ^name predict-no +)
  16430. (S1 ^operator O2024 +)
  16431. Retracting propose*predict-yes
  16432. -->
  16433. (O2023 ^name predict-yes +)
  16434. (S1 ^operator O2023 +)
  16435. Retracting elaborate*reward*based*on*reward
  16436. -->
  16437. (R1015 ^value 1 +)
  16438. (R1 ^reward R1015 +)
  16439. Retracting elaborate*copy-dir-to-output-link
  16440. -->
  16441. (I3 ^dir U +)
  16442. Retracting rl*prefer*rvt*predict-no*H0*4
  16443. -->
  16444. (S1 ^operator O2024 = 1.)
  16445. Retracting rl*prefer*rvt*predict-yes*H0*3
  16446. -->
  16447. (S1 ^operator O2023 = 0.)
  16448. =>WM: (14202: S1 ^operator O2026 +)
  16449. =>WM: (14201: S1 ^operator O2025 +)
  16450. =>WM: (14200: I3 ^dir L)
  16451. =>WM: (14199: O2026 ^name predict-no)
  16452. =>WM: (14198: O2025 ^name predict-yes)
  16453. =>WM: (14197: R1016 ^value 1)
  16454. =>WM: (14196: R1 ^reward R1016)
  16455. =>WM: (14195: I3 ^see 0)
  16456. <=WM: (14186: S1 ^operator O2023 +)
  16457. <=WM: (14187: S1 ^operator O2024 +)
  16458. <=WM: (14188: S1 ^operator O2024)
  16459. <=WM: (14185: I3 ^dir U)
  16460. <=WM: (14181: R1 ^reward R1015)
  16461. <=WM: (14152: I3 ^see 1)
  16462. <=WM: (14184: O2024 ^name predict-no)
  16463. <=WM: (14183: O2023 ^name predict-yes)
  16464. <=WM: (14182: R1015 ^value 1)
  16465. --- Inner Elaboration Phase, active level 1 (S1) ---
  16466. Firing prefer*rvt*predict-yes*H0
  16467. -->
  16468. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  16469. -->
  16470. (S1 ^operator O2025 = -0.3470159027404986)
  16471. Firing rl*prefer*rvt*predict-yes*H0*1
  16472. -->
  16473. (S1 ^operator O2025 = 0.3804138577541756)
  16474. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16475. -->
  16476. Firing prefer*rvt*predict-no*H0
  16477. -->
  16478. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  16479. -->
  16480. (S1 ^operator O2026 = 0.68611525175106)
  16481. Firing rl*prefer*rvt*predict-no*H0*2
  16482. -->
  16483. (S1 ^operator O2026 = 0.3139979225569853)
  16484. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16485. -->
  16486. inner elaboration loop at bottom goal.
  16487. Retracting rl*prefer*rvt*predict-no*H0*2
  16488. -->
  16489. (S1 ^operator O2024 = 0.3139979225569853)
  16490. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  16491. -->
  16492. (S1 ^operator O2024 = 0.68611525175106)
  16493. Retracting rl*prefer*rvt*predict-yes*H0*1
  16494. -->
  16495. (S1 ^operator O2023 = 0.3804138577541756)
  16496. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  16497. -->
  16498. (S1 ^operator O2023 = -0.3470159027404986)
  16499. --- END Proposal Phase ---
  16500. --- Decision Phase ---
  16501. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16502. =>WM: (14203: S1 ^operator O2026)
  16503. 1013: O: O2026 (predict-no)
  16504. --- END Decision Phase ---
  16505. --- Application Phase ---
  16506. --- Firing Productions (PE) For State At Depth 1 ---
  16507. --- Inner Elaboration Phase, active level 1 (S1) ---
  16508. Firing apply*operator
  16509. -->
  16510. (I3 ^predict-no N1013 + :O )
  16511. Firing apply*operator*complete
  16512. -->
  16513. (I3 ^predict-no N1012 - :O )
  16514. inner elaboration loop at bottom goal.
  16515. --- Change Working Memory (PE) ---
  16516. =>WM: (14204: I3 ^predict-no N1013)
  16517. <=WM: (14190: N1012 ^status complete)
  16518. <=WM: (14189: I3 ^predict-no N1012)
  16519. --- Firing Productions (IE) For State At Depth 1 ---
  16520. --- Inner Elaboration Phase, active level 1 (S1) ---
  16521. Firing monitor*world
  16522. -->
  16523. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16524. --- Change Working Memory (IE) ---
  16525. --- END Application Phase ---
  16526. --- Output Phase ---
  16527. ENV: Agent did: predict-no for direction L in state State-A
  16528. In State-A moving L
  16529. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16530. predict error 0
  16531. dir: dir isL
  16532. --- END Output Phase ---
  16533. /|\--- Input Phase ---
  16534. =>WM: (14208: I2 ^dir L)
  16535. =>WM: (14207: I2 ^reward 1)
  16536. =>WM: (14206: I2 ^see 0)
  16537. =>WM: (14205: N1013 ^status complete)
  16538. <=WM: (14193: I2 ^dir L)
  16539. <=WM: (14192: I2 ^reward 1)
  16540. <=WM: (14191: I2 ^see 0)
  16541. =>WM: (14209: I2 ^level-1 L0-root)
  16542. <=WM: (14194: I2 ^level-1 L1-root)
  16543. --- END Input Phase ---
  16544. --- Proposal Phase ---
  16545. --- Inner Elaboration Phase, active level 1 (S1) ---
  16546. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  16547. -->
  16548. (S1 ^operator O2025 = -0.3332708974800781)
  16549. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  16550. -->
  16551. (S1 ^operator O2026 = 0.6857730532944987)
  16552. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16553. -->
  16554. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16555. -->
  16556. Firing elaborate*copy-see-to-output-link
  16557. -->
  16558. (I3 ^see 0 +)
  16559. Firing elaborate*reward*based*on*reward
  16560. -->
  16561. (R1017 ^value 1 +)
  16562. (R1 ^reward R1017 +)
  16563. Firing propose*predict-yes
  16564. -->
  16565. (O2027 ^name predict-yes +)
  16566. (S1 ^operator O2027 +)
  16567. Firing propose*predict-no
  16568. -->
  16569. (O2028 ^name predict-no +)
  16570. (S1 ^operator O2028 +)
  16571. Firing rl*prefer*rvt*predict-no*H0*2
  16572. -->
  16573. (S1 ^operator O2026 = 0.3139979225569853)
  16574. Firing rl*prefer*rvt*predict-yes*H0*1
  16575. -->
  16576. (S1 ^operator O2025 = 0.3804138577541756)
  16577. Firing prefer*rvt*predict-yes*H0
  16578. -->
  16579. Firing prefer*rvt*predict-no*H0
  16580. -->
  16581. Firing elaborate*copy-dir-to-output-link
  16582. -->
  16583. (I3 ^dir L +)
  16584. inner elaboration loop at bottom goal.
  16585. Retracting elaborate*copy-see-to-output-link
  16586. -->
  16587. (I3 ^see 0 +)
  16588. Retracting propose*predict-no
  16589. -->
  16590. (O2026 ^name predict-no +)
  16591. (S1 ^operator O2026 +)
  16592. Retracting propose*predict-yes
  16593. -->
  16594. (O2025 ^name predict-yes +)
  16595. (S1 ^operator O2025 +)
  16596. Retracting elaborate*reward*based*on*reward
  16597. -->
  16598. (R1016 ^value 1 +)
  16599. (R1 ^reward R1016 +)
  16600. Retracting elaborate*copy-dir-to-output-link
  16601. -->
  16602. (I3 ^dir L +)
  16603. Retracting rl*prefer*rvt*predict-no*H0*2
  16604. -->
  16605. (S1 ^operator O2026 = 0.3139979225569853)
  16606. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  16607. -->
  16608. (S1 ^operator O2026 = 0.68611525175106)
  16609. Retracting rl*prefer*rvt*predict-yes*H0*1
  16610. -->
  16611. (S1 ^operator O2025 = 0.3804138577541756)
  16612. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  16613. -->
  16614. (S1 ^operator O2025 = -0.3470159027404986)
  16615. =>WM: (14215: S1 ^operator O2028 +)
  16616. =>WM: (14214: S1 ^operator O2027 +)
  16617. =>WM: (14213: O2028 ^name predict-no)
  16618. =>WM: (14212: O2027 ^name predict-yes)
  16619. =>WM: (14211: R1017 ^value 1)
  16620. =>WM: (14210: R1 ^reward R1017)
  16621. <=WM: (14201: S1 ^operator O2025 +)
  16622. <=WM: (14202: S1 ^operator O2026 +)
  16623. <=WM: (14203: S1 ^operator O2026)
  16624. <=WM: (14196: R1 ^reward R1016)
  16625. <=WM: (14199: O2026 ^name predict-no)
  16626. <=WM: (14198: O2025 ^name predict-yes)
  16627. <=WM: (14197: R1016 ^value 1)
  16628. --- Inner Elaboration Phase, active level 1 (S1) ---
  16629. Firing prefer*rvt*predict-yes*H0
  16630. -->
  16631. Firing rl*prefer*rvt*predict-yes*H0*1
  16632. -->
  16633. (S1 ^operator O2027 = 0.3804138577541756)
  16634. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16635. -->
  16636. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  16637. -->
  16638. (S1 ^operator O2027 = -0.3332708974800781)
  16639. Firing prefer*rvt*predict-no*H0
  16640. -->
  16641. Firing rl*prefer*rvt*predict-no*H0*2
  16642. -->
  16643. (S1 ^operator O2028 = 0.3139979225569853)
  16644. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16645. -->
  16646. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  16647. -->
  16648. (S1 ^operator O2028 = 0.6857730532944987)
  16649. inner elaboration loop at bottom goal.
  16650. Retracting rl*prefer*rvt*predict-no*H0*2
  16651. -->
  16652. (S1 ^operator O2026 = 0.3139979225569853)
  16653. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  16654. -->
  16655. (S1 ^operator O2026 = 0.6857730532944987)
  16656. Retracting rl*prefer*rvt*predict-yes*H0*1
  16657. -->
  16658. (S1 ^operator O2025 = 0.3804138577541756)
  16659. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  16660. -->
  16661. (S1 ^operator O2025 = -0.3332708974800781)
  16662. --- END Proposal Phase ---
  16663. --- Decision Phase ---
  16664. RL update rl*prefer*rvt*predict-no*H0*2 0.485013 -0.171015 0.313998 -> 0.485005 -0.171017 0.313989(R,m,v=1,0.862745,0.119195)
  16665. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515077 0.171039 0.686115 -> 0.515068 0.171036 0.686104(R,m,v=1,1,0)
  16666. =>WM: (14216: S1 ^operator O2028)
  16667. 1014: O: O2028 (predict-no)
  16668. --- END Decision Phase ---
  16669. --- Application Phase ---
  16670. --- Firing Productions (PE) For State At Depth 1 ---
  16671. --- Inner Elaboration Phase, active level 1 (S1) ---
  16672. Firing apply*operator
  16673. -->
  16674. (I3 ^predict-no N1014 + :O )
  16675. Firing apply*operator*complete
  16676. -->
  16677. (I3 ^predict-no N1013 - :O )
  16678. inner elaboration loop at bottom goal.
  16679. --- Change Working Memory (PE) ---
  16680. =>WM: (14217: I3 ^predict-no N1014)
  16681. <=WM: (14205: N1013 ^status complete)
  16682. <=WM: (14204: I3 ^predict-no N1013)
  16683. --- Firing Productions (IE) For State At Depth 1 ---
  16684. --- Inner Elaboration Phase, active level 1 (S1) ---
  16685. Firing monitor*world
  16686. -->
  16687. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16688. --- Change Working Memory (IE) ---
  16689. --- END Application Phase ---
  16690. --- Output Phase ---
  16691. ENV: Agent did: predict-no for direction L in state State-A
  16692. In State-A moving L
  16693. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16694. predict error 0
  16695. dir: dir isR
  16696. --- END Output Phase ---
  16697. -/|--- Input Phase ---
  16698. =>WM: (14221: I2 ^dir R)
  16699. =>WM: (14220: I2 ^reward 1)
  16700. =>WM: (14219: I2 ^see 0)
  16701. =>WM: (14218: N1014 ^status complete)
  16702. <=WM: (14208: I2 ^dir L)
  16703. <=WM: (14207: I2 ^reward 1)
  16704. <=WM: (14206: I2 ^see 0)
  16705. =>WM: (14222: I2 ^level-1 L0-root)
  16706. <=WM: (14209: I2 ^level-1 L0-root)
  16707. --- END Input Phase ---
  16708. --- Proposal Phase ---
  16709. --- Inner Elaboration Phase, active level 1 (S1) ---
  16710. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  16711. -->
  16712. (S1 ^operator O2027 = 0.7056154385005245)
  16713. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  16714. -->
  16715. (S1 ^operator O2028 = -0.2023211881870005)
  16716. Firing prefer*rvt*predict-no*H0*6*v1*H1
  16717. -->
  16718. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16719. -->
  16720. Firing elaborate*copy-see-to-output-link
  16721. -->
  16722. (I3 ^see 0 +)
  16723. Firing elaborate*reward*based*on*reward
  16724. -->
  16725. (R1018 ^value 1 +)
  16726. (R1 ^reward R1018 +)
  16727. Firing propose*predict-yes
  16728. -->
  16729. (O2029 ^name predict-yes +)
  16730. (S1 ^operator O2029 +)
  16731. Firing propose*predict-no
  16732. -->
  16733. (O2030 ^name predict-no +)
  16734. (S1 ^operator O2030 +)
  16735. Firing rl*prefer*rvt*predict-no*H0*6
  16736. -->
  16737. (S1 ^operator O2028 = 0.2298608025432123)
  16738. Firing rl*prefer*rvt*predict-yes*H0*5
  16739. -->
  16740. (S1 ^operator O2027 = 0.2939430423129205)
  16741. Firing prefer*rvt*predict-yes*H0
  16742. -->
  16743. Firing prefer*rvt*predict-no*H0
  16744. -->
  16745. Firing elaborate*copy-dir-to-output-link
  16746. -->
  16747. (I3 ^dir R +)
  16748. inner elaboration loop at bottom goal.
  16749. Retracting elaborate*copy-see-to-output-link
  16750. -->
  16751. (I3 ^see 0 +)
  16752. Retracting propose*predict-no
  16753. -->
  16754. (O2028 ^name predict-no +)
  16755. (S1 ^operator O2028 +)
  16756. Retracting propose*predict-yes
  16757. -->
  16758. (O2027 ^name predict-yes +)
  16759. (S1 ^operator O2027 +)
  16760. Retracting elaborate*reward*based*on*reward
  16761. -->
  16762. (R1017 ^value 1 +)
  16763. (R1 ^reward R1017 +)
  16764. Retracting elaborate*copy-dir-to-output-link
  16765. -->
  16766. (I3 ^dir L +)
  16767. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  16768. -->
  16769. (S1 ^operator O2028 = 0.6857730532944987)
  16770. Retracting rl*prefer*rvt*predict-no*H0*2
  16771. -->
  16772. (S1 ^operator O2028 = 0.3139885389674749)
  16773. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  16774. -->
  16775. (S1 ^operator O2027 = -0.3332708974800781)
  16776. Retracting rl*prefer*rvt*predict-yes*H0*1
  16777. -->
  16778. (S1 ^operator O2027 = 0.3804138577541756)
  16779. =>WM: (14229: S1 ^operator O2030 +)
  16780. =>WM: (14228: S1 ^operator O2029 +)
  16781. =>WM: (14227: I3 ^dir R)
  16782. =>WM: (14226: O2030 ^name predict-no)
  16783. =>WM: (14225: O2029 ^name predict-yes)
  16784. =>WM: (14224: R1018 ^value 1)
  16785. =>WM: (14223: R1 ^reward R1018)
  16786. <=WM: (14214: S1 ^operator O2027 +)
  16787. <=WM: (14215: S1 ^operator O2028 +)
  16788. <=WM: (14216: S1 ^operator O2028)
  16789. <=WM: (14200: I3 ^dir L)
  16790. <=WM: (14210: R1 ^reward R1017)
  16791. <=WM: (14213: O2028 ^name predict-no)
  16792. <=WM: (14212: O2027 ^name predict-yes)
  16793. <=WM: (14211: R1017 ^value 1)
  16794. --- Inner Elaboration Phase, active level 1 (S1) ---
  16795. Firing prefer*rvt*predict-yes*H0
  16796. -->
  16797. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  16798. -->
  16799. (S1 ^operator O2029 = 0.7056154385005245)
  16800. Firing rl*prefer*rvt*predict-yes*H0*5
  16801. -->
  16802. (S1 ^operator O2029 = 0.2939430423129205)
  16803. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16804. -->
  16805. Firing prefer*rvt*predict-no*H0
  16806. -->
  16807. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  16808. -->
  16809. (S1 ^operator O2030 = -0.2023211881870005)
  16810. Firing rl*prefer*rvt*predict-no*H0*6
  16811. -->
  16812. (S1 ^operator O2030 = 0.2298608025432123)
  16813. Firing prefer*rvt*predict-no*H0*6*v1*H1
  16814. -->
  16815. inner elaboration loop at bottom goal.
  16816. Retracting rl*prefer*rvt*predict-no*H0*6
  16817. -->
  16818. (S1 ^operator O2028 = 0.2298608025432123)
  16819. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  16820. -->
  16821. (S1 ^operator O2028 = -0.2023211881870005)
  16822. Retracting rl*prefer*rvt*predict-yes*H0*5
  16823. -->
  16824. (S1 ^operator O2027 = 0.2939430423129205)
  16825. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  16826. -->
  16827. (S1 ^operator O2027 = 0.7056154385005245)
  16828. --- END Proposal Phase ---
  16829. --- Decision Phase ---
  16830. RL update rl*prefer*rvt*predict-no*H0*2 0.485005 -0.171017 0.313989 -> 0.485021 -0.171013 0.314008(R,m,v=1,0.863636,0.118538)
  16831. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514806 0.170967 0.685773 -> 0.514825 0.170972 0.685796(R,m,v=1,1,0)
  16832. =>WM: (14230: S1 ^operator O2029)
  16833. 1015: O: O2029 (predict-yes)
  16834. --- END Decision Phase ---
  16835. --- Application Phase ---
  16836. --- Firing Productions (PE) For State At Depth 1 ---
  16837. --- Inner Elaboration Phase, active level 1 (S1) ---
  16838. Firing apply*operator
  16839. -->
  16840. (I3 ^predict-yes N1015 + :O )
  16841. Firing apply*operator*complete
  16842. -->
  16843. (I3 ^predict-no N1014 - :O )
  16844. inner elaboration loop at bottom goal.
  16845. --- Change Working Memory (PE) ---
  16846. =>WM: (14231: I3 ^predict-yes N1015)
  16847. <=WM: (14218: N1014 ^status complete)
  16848. <=WM: (14217: I3 ^predict-no N1014)
  16849. --- Firing Productions (IE) For State At Depth 1 ---
  16850. --- Inner Elaboration Phase, active level 1 (S1) ---
  16851. Firing monitor*world
  16852. -->
  16853. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16854. --- Change Working Memory (IE) ---
  16855. --- END Application Phase ---
  16856. --- Output Phase ---
  16857. ENV: Agent did: predict-yes for direction R in state State-A
  16858. In State-A moving R
  16859. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  16860. predict error 0
  16861. dir: dir isR
  16862. --- END Output Phase ---
  16863. \-/--- Input Phase ---
  16864. =>WM: (14235: I2 ^dir R)
  16865. =>WM: (14234: I2 ^reward 1)
  16866. =>WM: (14233: I2 ^see 1)
  16867. =>WM: (14232: N1015 ^status complete)
  16868. <=WM: (14221: I2 ^dir R)
  16869. <=WM: (14220: I2 ^reward 1)
  16870. <=WM: (14219: I2 ^see 0)
  16871. =>WM: (14236: I2 ^level-1 R1-root)
  16872. <=WM: (14222: I2 ^level-1 L0-root)
  16873. --- END Input Phase ---
  16874. --- Proposal Phase ---
  16875. --- Inner Elaboration Phase, active level 1 (S1) ---
  16876. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16877. -->
  16878. (S1 ^operator O2029 = -0.252585164213872)
  16879. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  16880. -->
  16881. (S1 ^operator O2030 = 0.7701730258510331)
  16882. Firing prefer*rvt*predict-no*H0*6*v1*H1
  16883. -->
  16884. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16885. -->
  16886. Firing elaborate*copy-see-to-output-link
  16887. -->
  16888. (I3 ^see 1 +)
  16889. Firing elaborate*reward*based*on*reward
  16890. -->
  16891. (R1019 ^value 1 +)
  16892. (R1 ^reward R1019 +)
  16893. Firing propose*predict-yes
  16894. -->
  16895. (O2031 ^name predict-yes +)
  16896. (S1 ^operator O2031 +)
  16897. Firing propose*predict-no
  16898. -->
  16899. (O2032 ^name predict-no +)
  16900. (S1 ^operator O2032 +)
  16901. Firing rl*prefer*rvt*predict-no*H0*6
  16902. -->
  16903. (S1 ^operator O2030 = 0.2298608025432123)
  16904. Firing rl*prefer*rvt*predict-yes*H0*5
  16905. -->
  16906. (S1 ^operator O2029 = 0.2939430423129205)
  16907. Firing prefer*rvt*predict-yes*H0
  16908. -->
  16909. Firing prefer*rvt*predict-no*H0
  16910. -->
  16911. Firing elaborate*copy-dir-to-output-link
  16912. -->
  16913. (I3 ^dir R +)
  16914. inner elaboration loop at bottom goal.
  16915. Retracting elaborate*copy-see-to-output-link
  16916. -->
  16917. (I3 ^see 0 +)
  16918. Retracting propose*predict-no
  16919. -->
  16920. (O2030 ^name predict-no +)
  16921. (S1 ^operator O2030 +)
  16922. Retracting propose*predict-yes
  16923. -->
  16924. (O2029 ^name predict-yes +)
  16925. (S1 ^operator O2029 +)
  16926. Retracting elaborate*reward*based*on*reward
  16927. -->
  16928. (R1018 ^value 1 +)
  16929. (R1 ^reward R1018 +)
  16930. Retracting elaborate*copy-dir-to-output-link
  16931. -->
  16932. (I3 ^dir R +)
  16933. Retracting rl*prefer*rvt*predict-no*H0*6
  16934. -->
  16935. (S1 ^operator O2030 = 0.2298608025432123)
  16936. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  16937. -->
  16938. (S1 ^operator O2030 = -0.2023211881870005)
  16939. Retracting rl*prefer*rvt*predict-yes*H0*5
  16940. -->
  16941. (S1 ^operator O2029 = 0.2939430423129205)
  16942. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  16943. -->
  16944. (S1 ^operator O2029 = 0.7056154385005245)
  16945. =>WM: (14243: S1 ^operator O2032 +)
  16946. =>WM: (14242: S1 ^operator O2031 +)
  16947. =>WM: (14241: O2032 ^name predict-no)
  16948. =>WM: (14240: O2031 ^name predict-yes)
  16949. =>WM: (14239: R1019 ^value 1)
  16950. =>WM: (14238: R1 ^reward R1019)
  16951. =>WM: (14237: I3 ^see 1)
  16952. <=WM: (14228: S1 ^operator O2029 +)
  16953. <=WM: (14230: S1 ^operator O2029)
  16954. <=WM: (14229: S1 ^operator O2030 +)
  16955. <=WM: (14223: R1 ^reward R1018)
  16956. <=WM: (14195: I3 ^see 0)
  16957. <=WM: (14226: O2030 ^name predict-no)
  16958. <=WM: (14225: O2029 ^name predict-yes)
  16959. <=WM: (14224: R1018 ^value 1)
  16960. --- Inner Elaboration Phase, active level 1 (S1) ---
  16961. Firing prefer*rvt*predict-yes*H0
  16962. -->
  16963. Firing rl*prefer*rvt*predict-yes*H0*5
  16964. -->
  16965. (S1 ^operator O2031 = 0.2939430423129205)
  16966. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16967. -->
  16968. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16969. -->
  16970. (S1 ^operator O2031 = -0.252585164213872)
  16971. Firing prefer*rvt*predict-no*H0
  16972. -->
  16973. Firing rl*prefer*rvt*predict-no*H0*6
  16974. -->
  16975. (S1 ^operator O2032 = 0.2298608025432123)
  16976. Firing prefer*rvt*predict-no*H0*6*v1*H1
  16977. -->
  16978. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  16979. -->
  16980. (S1 ^operator O2032 = 0.7701730258510331)
  16981. inner elaboration loop at bottom goal.
  16982. Retracting rl*prefer*rvt*predict-no*H0*6
  16983. -->
  16984. (S1 ^operator O2030 = 0.2298608025432123)
  16985. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  16986. -->
  16987. (S1 ^operator O2030 = 0.7701730258510331)
  16988. Retracting rl*prefer*rvt*predict-yes*H0*5
  16989. -->
  16990. (S1 ^operator O2029 = 0.2939430423129205)
  16991. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16992. -->
  16993. (S1 ^operator O2029 = -0.252585164213872)
  16994. --- END Proposal Phase ---
  16995. --- Decision Phase ---
  16996. RL update rl*prefer*rvt*predict-yes*H0*5 0.501022 -0.207079 0.293943 -> 0.501055 -0.207076 0.293979(R,m,v=1,0.849057,0.128971)
  16997. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498578 0.207037 0.705615 -> 0.498617 0.207041 0.705659(R,m,v=1,1,0)
  16998. =>WM: (14244: S1 ^operator O2032)
  16999. 1016: O: O2032 (predict-no)
  17000. --- END Decision Phase ---
  17001. --- Application Phase ---
  17002. --- Firing Productions (PE) For State At Depth 1 ---
  17003. --- Inner Elaboration Phase, active level 1 (S1) ---
  17004. Firing apply*operator
  17005. -->
  17006. (I3 ^predict-no N1016 + :O )
  17007. Firing apply*operator*complete
  17008. -->
  17009. (I3 ^predict-yes N1015 - :O )
  17010. inner elaboration loop at bottom goal.
  17011. --- Change Working Memory (PE) ---
  17012. =>WM: (14245: I3 ^predict-no N1016)
  17013. <=WM: (14232: N1015 ^status complete)
  17014. <=WM: (14231: I3 ^predict-yes N1015)
  17015. --- Firing Productions (IE) For State At Depth 1 ---
  17016. --- Inner Elaboration Phase, active level 1 (S1) ---
  17017. Firing monitor*world
  17018. -->
  17019. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17020. --- Change Working Memory (IE) ---
  17021. --- END Application Phase ---
  17022. --- Output Phase ---
  17023. ENV: Agent did: predict-no for direction R in state State-B
  17024. In State-B moving R
  17025. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17026. predict error 0
  17027. dir: dir isR
  17028. --- END Output Phase ---
  17029. |\---- Input Phase ---
  17030. =>WM: (14249: I2 ^dir R)
  17031. =>WM: (14248: I2 ^reward 1)
  17032. =>WM: (14247: I2 ^see 0)
  17033. =>WM: (14246: N1016 ^status complete)
  17034. <=WM: (14235: I2 ^dir R)
  17035. <=WM: (14234: I2 ^reward 1)
  17036. <=WM: (14233: I2 ^see 1)
  17037. =>WM: (14250: I2 ^level-1 R0-root)
  17038. <=WM: (14236: I2 ^level-1 R1-root)
  17039. --- END Input Phase ---
  17040. --- Proposal Phase ---
  17041. --- Inner Elaboration Phase, active level 1 (S1) ---
  17042. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17043. -->
  17044. (S1 ^operator O2031 = -0.1254042659579056)
  17045. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17046. -->
  17047. (S1 ^operator O2032 = 0.7701003386536001)
  17048. Firing prefer*rvt*predict-no*H0*6*v1*H1
  17049. -->
  17050. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17051. -->
  17052. Firing elaborate*copy-see-to-output-link
  17053. -->
  17054. (I3 ^see 0 +)
  17055. Firing elaborate*reward*based*on*reward
  17056. -->
  17057. (R1020 ^value 1 +)
  17058. (R1 ^reward R1020 +)
  17059. Firing propose*predict-yes
  17060. -->
  17061. (O2033 ^name predict-yes +)
  17062. (S1 ^operator O2033 +)
  17063. Firing propose*predict-no
  17064. -->
  17065. (O2034 ^name predict-no +)
  17066. (S1 ^operator O2034 +)
  17067. Firing rl*prefer*rvt*predict-no*H0*6
  17068. -->
  17069. (S1 ^operator O2032 = 0.2298608025432123)
  17070. Firing rl*prefer*rvt*predict-yes*H0*5
  17071. -->
  17072. (S1 ^operator O2031 = 0.2939794178406799)
  17073. Firing prefer*rvt*predict-yes*H0
  17074. -->
  17075. Firing prefer*rvt*predict-no*H0
  17076. -->
  17077. Firing elaborate*copy-dir-to-output-link
  17078. -->
  17079. (I3 ^dir R +)
  17080. inner elaboration loop at bottom goal.
  17081. Retracting elaborate*copy-see-to-output-link
  17082. -->
  17083. (I3 ^see 1 +)
  17084. Retracting propose*predict-no
  17085. -->
  17086. (O2032 ^name predict-no +)
  17087. (S1 ^operator O2032 +)
  17088. Retracting propose*predict-yes
  17089. -->
  17090. (O2031 ^name predict-yes +)
  17091. (S1 ^operator O2031 +)
  17092. Retracting elaborate*reward*based*on*reward
  17093. -->
  17094. (R1019 ^value 1 +)
  17095. (R1 ^reward R1019 +)
  17096. Retracting elaborate*copy-dir-to-output-link
  17097. -->
  17098. (I3 ^dir R +)
  17099. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  17100. -->
  17101. (S1 ^operator O2032 = 0.7701730258510331)
  17102. Retracting rl*prefer*rvt*predict-no*H0*6
  17103. -->
  17104. (S1 ^operator O2032 = 0.2298608025432123)
  17105. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  17106. -->
  17107. (S1 ^operator O2031 = -0.252585164213872)
  17108. Retracting rl*prefer*rvt*predict-yes*H0*5
  17109. -->
  17110. (S1 ^operator O2031 = 0.2939794178406799)
  17111. =>WM: (14257: S1 ^operator O2034 +)
  17112. =>WM: (14256: S1 ^operator O2033 +)
  17113. =>WM: (14255: O2034 ^name predict-no)
  17114. =>WM: (14254: O2033 ^name predict-yes)
  17115. =>WM: (14253: R1020 ^value 1)
  17116. =>WM: (14252: R1 ^reward R1020)
  17117. =>WM: (14251: I3 ^see 0)
  17118. <=WM: (14242: S1 ^operator O2031 +)
  17119. <=WM: (14243: S1 ^operator O2032 +)
  17120. <=WM: (14244: S1 ^operator O2032)
  17121. <=WM: (14238: R1 ^reward R1019)
  17122. <=WM: (14237: I3 ^see 1)
  17123. <=WM: (14241: O2032 ^name predict-no)
  17124. <=WM: (14240: O2031 ^name predict-yes)
  17125. <=WM: (14239: R1019 ^value 1)
  17126. --- Inner Elaboration Phase, active level 1 (S1) ---
  17127. Firing prefer*rvt*predict-yes*H0
  17128. -->
  17129. Firing rl*prefer*rvt*predict-yes*H0*5
  17130. -->
  17131. (S1 ^operator O2033 = 0.2939794178406799)
  17132. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17133. -->
  17134. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17135. -->
  17136. (S1 ^operator O2033 = -0.1254042659579056)
  17137. Firing prefer*rvt*predict-no*H0
  17138. -->
  17139. Firing rl*prefer*rvt*predict-no*H0*6
  17140. -->
  17141. (S1 ^operator O2034 = 0.2298608025432123)
  17142. Firing prefer*rvt*predict-no*H0*6*v1*H1
  17143. -->
  17144. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17145. -->
  17146. (S1 ^operator O2034 = 0.7701003386536001)
  17147. inner elaboration loop at bottom goal.
  17148. Retracting rl*prefer*rvt*predict-no*H0*6
  17149. -->
  17150. (S1 ^operator O2032 = 0.2298608025432123)
  17151. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17152. -->
  17153. (S1 ^operator O2032 = 0.7701003386536001)
  17154. Retracting rl*prefer*rvt*predict-yes*H0*5
  17155. -->
  17156. (S1 ^operator O2031 = 0.2939794178406799)
  17157. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17158. -->
  17159. (S1 ^operator O2031 = -0.1254042659579056)
  17160. --- END Proposal Phase ---
  17161. --- Decision Phase ---
  17162. RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229861 -> 0.611911 -0.382052 0.229858(R,m,v=1,0.849162,0.128805)
  17163. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388115 0.382058 0.770173 -> 0.388112 0.382058 0.77017(R,m,v=1,1,0)
  17164. =>WM: (14258: S1 ^operator O2034)
  17165. 1017: O: O2034 (predict-no)
  17166. --- END Decision Phase ---
  17167. --- Application Phase ---
  17168. --- Firing Productions (PE) For State At Depth 1 ---
  17169. --- Inner Elaboration Phase, active level 1 (S1) ---
  17170. Firing apply*operator
  17171. -->
  17172. (I3 ^predict-no N1017 + :O )
  17173. Firing apply*operator*complete
  17174. -->
  17175. (I3 ^predict-no N1016 - :O )
  17176. inner elaboration loop at bottom goal.
  17177. --- Change Working Memory (PE) ---
  17178. =>WM: (14259: I3 ^predict-no N1017)
  17179. <=WM: (14246: N1016 ^status complete)
  17180. <=WM: (14245: I3 ^predict-no N1016)
  17181. --- Firing Productions (IE) For State At Depth 1 ---
  17182. --- Inner Elaboration Phase, active level 1 (S1) ---
  17183. Firing monitor*world
  17184. -->
  17185. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17186. --- Change Working Memory (IE) ---
  17187. --- END Application Phase ---
  17188. --- Output Phase ---
  17189. ENV: Agent did: predict-no for direction R in state State-B
  17190. In State-B moving R
  17191. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17192. predict error 0
  17193. dir: dir isR
  17194. --- END Output Phase ---
  17195. /|\--- Input Phase ---
  17196. =>WM: (14263: I2 ^dir R)
  17197. =>WM: (14262: I2 ^reward 1)
  17198. =>WM: (14261: I2 ^see 0)
  17199. =>WM: (14260: N1017 ^status complete)
  17200. <=WM: (14249: I2 ^dir R)
  17201. <=WM: (14248: I2 ^reward 1)
  17202. <=WM: (14247: I2 ^see 0)
  17203. =>WM: (14264: I2 ^level-1 R0-root)
  17204. <=WM: (14250: I2 ^level-1 R0-root)
  17205. --- END Input Phase ---
  17206. --- Proposal Phase ---
  17207. --- Inner Elaboration Phase, active level 1 (S1) ---
  17208. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17209. -->
  17210. (S1 ^operator O2033 = -0.1254042659579056)
  17211. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17212. -->
  17213. (S1 ^operator O2034 = 0.7701003386536001)
  17214. Firing prefer*rvt*predict-no*H0*6*v1*H1
  17215. -->
  17216. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17217. -->
  17218. Firing elaborate*copy-see-to-output-link
  17219. -->
  17220. (I3 ^see 0 +)
  17221. Firing elaborate*reward*based*on*reward
  17222. -->
  17223. (R1021 ^value 1 +)
  17224. (R1 ^reward R1021 +)
  17225. Firing propose*predict-yes
  17226. -->
  17227. (O2035 ^name predict-yes +)
  17228. (S1 ^operator O2035 +)
  17229. Firing propose*predict-no
  17230. -->
  17231. (O2036 ^name predict-no +)
  17232. (S1 ^operator O2036 +)
  17233. Firing rl*prefer*rvt*predict-no*H0*6
  17234. -->
  17235. (S1 ^operator O2034 = 0.2298580688851452)
  17236. Firing rl*prefer*rvt*predict-yes*H0*5
  17237. -->
  17238. (S1 ^operator O2033 = 0.2939794178406799)
  17239. Firing prefer*rvt*predict-yes*H0
  17240. -->
  17241. Firing prefer*rvt*predict-no*H0
  17242. -->
  17243. Firing elaborate*copy-dir-to-output-link
  17244. -->
  17245. (I3 ^dir R +)
  17246. inner elaboration loop at bottom goal.
  17247. Retracting elaborate*copy-see-to-output-link
  17248. -->
  17249. (I3 ^see 0 +)
  17250. Retracting propose*predict-no
  17251. -->
  17252. (O2034 ^name predict-no +)
  17253. (S1 ^operator O2034 +)
  17254. Retracting propose*predict-yes
  17255. -->
  17256. (O2033 ^name predict-yes +)
  17257. (S1 ^operator O2033 +)
  17258. Retracting elaborate*reward*based*on*reward
  17259. -->
  17260. (R1020 ^value 1 +)
  17261. (R1 ^reward R1020 +)
  17262. Retracting elaborate*copy-dir-to-output-link
  17263. -->
  17264. (I3 ^dir R +)
  17265. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17266. -->
  17267. (S1 ^operator O2034 = 0.7701003386536001)
  17268. Retracting rl*prefer*rvt*predict-no*H0*6
  17269. -->
  17270. (S1 ^operator O2034 = 0.2298580688851452)
  17271. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17272. -->
  17273. (S1 ^operator O2033 = -0.1254042659579056)
  17274. Retracting rl*prefer*rvt*predict-yes*H0*5
  17275. -->
  17276. (S1 ^operator O2033 = 0.2939794178406799)
  17277. =>WM: (14270: S1 ^operator O2036 +)
  17278. =>WM: (14269: S1 ^operator O2035 +)
  17279. =>WM: (14268: O2036 ^name predict-no)
  17280. =>WM: (14267: O2035 ^name predict-yes)
  17281. =>WM: (14266: R1021 ^value 1)
  17282. =>WM: (14265: R1 ^reward R1021)
  17283. <=WM: (14256: S1 ^operator O2033 +)
  17284. <=WM: (14257: S1 ^operator O2034 +)
  17285. <=WM: (14258: S1 ^operator O2034)
  17286. <=WM: (14252: R1 ^reward R1020)
  17287. <=WM: (14255: O2034 ^name predict-no)
  17288. <=WM: (14254: O2033 ^name predict-yes)
  17289. <=WM: (14253: R1020 ^value 1)
  17290. --- Inner Elaboration Phase, active level 1 (S1) ---
  17291. Firing prefer*rvt*predict-yes*H0
  17292. -->
  17293. Firing rl*prefer*rvt*predict-yes*H0*5
  17294. -->
  17295. (S1 ^operator O2035 = 0.2939794178406799)
  17296. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17297. -->
  17298. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17299. -->
  17300. (S1 ^operator O2035 = -0.1254042659579056)
  17301. Firing prefer*rvt*predict-no*H0
  17302. -->
  17303. Firing rl*prefer*rvt*predict-no*H0*6
  17304. -->
  17305. (S1 ^operator O2036 = 0.2298580688851452)
  17306. Firing prefer*rvt*predict-no*H0*6*v1*H1
  17307. -->
  17308. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17309. -->
  17310. (S1 ^operator O2036 = 0.7701003386536001)
  17311. inner elaboration loop at bottom goal.
  17312. Retracting rl*prefer*rvt*predict-no*H0*6
  17313. -->
  17314. (S1 ^operator O2034 = 0.2298580688851452)
  17315. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17316. -->
  17317. (S1 ^operator O2034 = 0.7701003386536001)
  17318. Retracting rl*prefer*rvt*predict-yes*H0*5
  17319. -->
  17320. (S1 ^operator O2033 = 0.2939794178406799)
  17321. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17322. -->
  17323. (S1 ^operator O2033 = -0.1254042659579056)
  17324. --- END Proposal Phase ---
  17325. --- Decision Phase ---
  17326. RL update rl*prefer*rvt*predict-no*H0*6 0.611911 -0.382052 0.229858 -> 0.611913 -0.382052 0.229861(R,m,v=1,0.85,0.128212)
  17327. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388056 0.382045 0.7701 -> 0.388059 0.382045 0.770104(R,m,v=1,1,0)
  17328. =>WM: (14271: S1 ^operator O2036)
  17329. 1018: O: O2036 (predict-no)
  17330. --- END Decision Phase ---
  17331. --- Application Phase ---
  17332. --- Firing Productions (PE) For State At Depth 1 ---
  17333. --- Inner Elaboration Phase, active level 1 (S1) ---
  17334. Firing apply*operator
  17335. -->
  17336. (I3 ^predict-no N1018 + :O )
  17337. Firing apply*operator*complete
  17338. -->
  17339. (I3 ^predict-no N1017 - :O )
  17340. inner elaboration loop at bottom goal.
  17341. --- Change Working Memory (PE) ---
  17342. =>WM: (14272: I3 ^predict-no N1018)
  17343. <=WM: (14260: N1017 ^status complete)
  17344. <=WM: (14259: I3 ^predict-no N1017)
  17345. --- Firing Productions (IE) For State At Depth 1 ---
  17346. --- Inner Elaboration Phase, active level 1 (S1) ---
  17347. Firing monitor*world
  17348. -->
  17349. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17350. --- Change Working Memory (IE) ---
  17351. --- END Application Phase ---
  17352. --- Output Phase ---
  17353. ENV: Agent did: predict-no for direction R in state State-B
  17354. In State-B moving R
  17355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17356. predict error 0
  17357. dir: dir isL
  17358. --- END Output Phase ---
  17359. ---- Input Phase ---
  17360. =>WM: (14276: I2 ^dir L)
  17361. =>WM: (14275: I2 ^reward 1)
  17362. =>WM: (14274: I2 ^see 0)
  17363. =>WM: (14273: N1018 ^status complete)
  17364. <=WM: (14263: I2 ^dir R)
  17365. <=WM: (14262: I2 ^reward 1)
  17366. <=WM: (14261: I2 ^see 0)
  17367. =>WM: (14277: I2 ^level-1 R0-root)
  17368. <=WM: (14264: I2 ^level-1 R0-root)
  17369. --- END Input Phase ---
  17370. --- Proposal Phase ---
  17371. --- Inner Elaboration Phase, active level 1 (S1) ---
  17372. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  17373. -->
  17374. (S1 ^operator O2035 = 0.6195702912967189)
  17375. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  17376. -->
  17377. (S1 ^operator O2036 = -0.2190661556260421)
  17378. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17379. -->
  17380. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17381. -->
  17382. Firing elaborate*copy-see-to-output-link
  17383. -->
  17384. (I3 ^see 0 +)
  17385. Firing elaborate*reward*based*on*reward
  17386. -->
  17387. (R1022 ^value 1 +)
  17388. (R1 ^reward R1022 +)
  17389. Firing propose*predict-yes
  17390. -->
  17391. (O2037 ^name predict-yes +)
  17392. (S1 ^operator O2037 +)
  17393. Firing propose*predict-no
  17394. -->
  17395. (O2038 ^name predict-no +)
  17396. (S1 ^operator O2038 +)
  17397. Firing rl*prefer*rvt*predict-no*H0*2
  17398. -->
  17399. (S1 ^operator O2036 = 0.3140082846697959)
  17400. Firing rl*prefer*rvt*predict-yes*H0*1
  17401. -->
  17402. (S1 ^operator O2035 = 0.3804138577541756)
  17403. Firing prefer*rvt*predict-yes*H0
  17404. -->
  17405. Firing prefer*rvt*predict-no*H0
  17406. -->
  17407. Firing elaborate*copy-dir-to-output-link
  17408. -->
  17409. (I3 ^dir L +)
  17410. inner elaboration loop at bottom goal.
  17411. Retracting elaborate*copy-see-to-output-link
  17412. -->
  17413. (I3 ^see 0 +)
  17414. Retracting propose*predict-no
  17415. -->
  17416. (O2036 ^name predict-no +)
  17417. (S1 ^operator O2036 +)
  17418. Retracting propose*predict-yes
  17419. -->
  17420. (O2035 ^name predict-yes +)
  17421. (S1 ^operator O2035 +)
  17422. Retracting elaborate*reward*based*on*reward
  17423. -->
  17424. (R1021 ^value 1 +)
  17425. (R1 ^reward R1021 +)
  17426. Retracting elaborate*copy-dir-to-output-link
  17427. -->
  17428. (I3 ^dir R +)
  17429. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  17430. -->
  17431. (S1 ^operator O2036 = 0.7701041764174563)
  17432. Retracting rl*prefer*rvt*predict-no*H0*6
  17433. -->
  17434. (S1 ^operator O2036 = 0.2298614269306036)
  17435. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  17436. -->
  17437. (S1 ^operator O2035 = -0.1254042659579056)
  17438. Retracting rl*prefer*rvt*predict-yes*H0*5
  17439. -->
  17440. (S1 ^operator O2035 = 0.2939794178406799)
  17441. =>WM: (14284: S1 ^operator O2038 +)
  17442. =>WM: (14283: S1 ^operator O2037 +)
  17443. =>WM: (14282: I3 ^dir L)
  17444. =>WM: (14281: O2038 ^name predict-no)
  17445. =>WM: (14280: O2037 ^name predict-yes)
  17446. =>WM: (14279: R1022 ^value 1)
  17447. =>WM: (14278: R1 ^reward R1022)
  17448. <=WM: (14269: S1 ^operator O2035 +)
  17449. <=WM: (14270: S1 ^operator O2036 +)
  17450. <=WM: (14271: S1 ^operator O2036)
  17451. <=WM: (14227: I3 ^dir R)
  17452. <=WM: (14265: R1 ^reward R1021)
  17453. <=WM: (14268: O2036 ^name predict-no)
  17454. <=WM: (14267: O2035 ^name predict-yes)
  17455. <=WM: (14266: R1021 ^value 1)
  17456. --- Inner Elaboration Phase, active level 1 (S1) ---
  17457. Firing prefer*rvt*predict-yes*H0
  17458. -->
  17459. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  17460. -->
  17461. (S1 ^operator O2037 = 0.6195702912967189)
  17462. Firing rl*prefer*rvt*predict-yes*H0*1
  17463. -->
  17464. (S1 ^operator O2037 = 0.3804138577541756)
  17465. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17466. -->
  17467. Firing prefer*rvt*predict-no*H0
  17468. -->
  17469. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  17470. -->
  17471. (S1 ^operator O2038 = -0.2190661556260421)
  17472. Firing rl*prefer*rvt*predict-no*H0*2
  17473. -->
  17474. (S1 ^operator O2038 = 0.3140082846697959)
  17475. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17476. -->
  17477. inner elaboration loop at bottom goal.
  17478. Retracting rl*prefer*rvt*predict-no*H0*2
  17479. -->
  17480. (S1 ^operator O2036 = 0.3140082846697959)
  17481. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  17482. -->
  17483. (S1 ^operator O2036 = -0.2190661556260421)
  17484. Retracting rl*prefer*rvt*predict-yes*H0*1
  17485. -->
  17486. (S1 ^operator O2035 = 0.3804138577541756)
  17487. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  17488. -->
  17489. (S1 ^operator O2035 = 0.6195702912967189)
  17490. --- END Proposal Phase ---
  17491. --- Decision Phase ---
  17492. RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229861 -> 0.611915 -0.382051 0.229864(R,m,v=1,0.850829,0.127624)
  17493. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388059 0.382045 0.770104 -> 0.388061 0.382046 0.770107(R,m,v=1,1,0)
  17494. =>WM: (14285: S1 ^operator O2037)
  17495. 1019: O: O2037 (predict-yes)
  17496. --- END Decision Phase ---
  17497. --- Application Phase ---
  17498. --- Firing Productions (PE) For State At Depth 1 ---
  17499. --- Inner Elaboration Phase, active level 1 (S1) ---
  17500. Firing apply*operator
  17501. -->
  17502. (I3 ^predict-yes N1019 + :O )
  17503. Firing apply*operator*complete
  17504. -->
  17505. (I3 ^predict-no N1018 - :O )
  17506. inner elaboration loop at bottom goal.
  17507. --- Change Working Memory (PE) ---
  17508. =>WM: (14286: I3 ^predict-yes N1019)
  17509. <=WM: (14273: N1018 ^status complete)
  17510. <=WM: (14272: I3 ^predict-no N1018)
  17511. --- Firing Productions (IE) For State At Depth 1 ---
  17512. --- Inner Elaboration Phase, active level 1 (S1) ---
  17513. Firing monitor*world
  17514. -->
  17515. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17516. --- Change Working Memory (IE) ---
  17517. --- END Application Phase ---
  17518. --- Output Phase ---
  17519. ENV: Agent did: predict-yes for direction L in state State-B
  17520. In State-B moving L
  17521. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  17522. predict error 0
  17523. dir: dir isU
  17524. --- END Output Phase ---
  17525. /|\--- Input Phase ---
  17526. =>WM: (14290: I2 ^dir U)
  17527. =>WM: (14289: I2 ^reward 1)
  17528. =>WM: (14288: I2 ^see 1)
  17529. =>WM: (14287: N1019 ^status complete)
  17530. <=WM: (14276: I2 ^dir L)
  17531. <=WM: (14275: I2 ^reward 1)
  17532. <=WM: (14274: I2 ^see 0)
  17533. =>WM: (14291: I2 ^level-1 L1-root)
  17534. <=WM: (14277: I2 ^level-1 R0-root)
  17535. --- END Input Phase ---
  17536. --- Proposal Phase ---
  17537. --- Inner Elaboration Phase, active level 1 (S1) ---
  17538. Firing elaborate*copy-see-to-output-link
  17539. -->
  17540. (I3 ^see 1 +)
  17541. Firing elaborate*reward*based*on*reward
  17542. -->
  17543. (R1023 ^value 1 +)
  17544. (R1 ^reward R1023 +)
  17545. Firing propose*predict-yes
  17546. -->
  17547. (O2039 ^name predict-yes +)
  17548. (S1 ^operator O2039 +)
  17549. Firing propose*predict-no
  17550. -->
  17551. (O2040 ^name predict-no +)
  17552. (S1 ^operator O2040 +)
  17553. Firing rl*prefer*rvt*predict-no*H0*4
  17554. -->
  17555. (S1 ^operator O2038 = 1.)
  17556. Firing rl*prefer*rvt*predict-yes*H0*3
  17557. -->
  17558. (S1 ^operator O2037 = 0.)
  17559. Firing prefer*rvt*predict-yes*H0
  17560. -->
  17561. Firing prefer*rvt*predict-no*H0
  17562. -->
  17563. Firing elaborate*copy-dir-to-output-link
  17564. -->
  17565. (I3 ^dir U +)
  17566. inner elaboration loop at bottom goal.
  17567. Retracting elaborate*copy-see-to-output-link
  17568. -->
  17569. (I3 ^see 0 +)
  17570. Retracting propose*predict-no
  17571. -->
  17572. (O2038 ^name predict-no +)
  17573. (S1 ^operator O2038 +)
  17574. Retracting propose*predict-yes
  17575. -->
  17576. (O2037 ^name predict-yes +)
  17577. (S1 ^operator O2037 +)
  17578. Retracting elaborate*reward*based*on*reward
  17579. -->
  17580. (R1022 ^value 1 +)
  17581. (R1 ^reward R1022 +)
  17582. Retracting elaborate*copy-dir-to-output-link
  17583. -->
  17584. (I3 ^dir L +)
  17585. Retracting rl*prefer*rvt*predict-no*H0*2
  17586. -->
  17587. (S1 ^operator O2038 = 0.3140082846697959)
  17588. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  17589. -->
  17590. (S1 ^operator O2038 = -0.2190661556260421)
  17591. Retracting rl*prefer*rvt*predict-yes*H0*1
  17592. -->
  17593. (S1 ^operator O2037 = 0.3804138577541756)
  17594. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  17595. -->
  17596. (S1 ^operator O2037 = 0.6195702912967189)
  17597. =>WM: (14299: S1 ^operator O2040 +)
  17598. =>WM: (14298: S1 ^operator O2039 +)
  17599. =>WM: (14297: I3 ^dir U)
  17600. =>WM: (14296: O2040 ^name predict-no)
  17601. =>WM: (14295: O2039 ^name predict-yes)
  17602. =>WM: (14294: R1023 ^value 1)
  17603. =>WM: (14293: R1 ^reward R1023)
  17604. =>WM: (14292: I3 ^see 1)
  17605. <=WM: (14283: S1 ^operator O2037 +)
  17606. <=WM: (14285: S1 ^operator O2037)
  17607. <=WM: (14284: S1 ^operator O2038 +)
  17608. <=WM: (14282: I3 ^dir L)
  17609. <=WM: (14278: R1 ^reward R1022)
  17610. <=WM: (14251: I3 ^see 0)
  17611. <=WM: (14281: O2038 ^name predict-no)
  17612. <=WM: (14280: O2037 ^name predict-yes)
  17613. <=WM: (14279: R1022 ^value 1)
  17614. --- Inner Elaboration Phase, active level 1 (S1) ---
  17615. Firing prefer*rvt*predict-yes*H0
  17616. -->
  17617. Firing rl*prefer*rvt*predict-yes*H0*3
  17618. -->
  17619. (S1 ^operator O2039 = 0.)
  17620. Firing prefer*rvt*predict-no*H0
  17621. -->
  17622. Firing rl*prefer*rvt*predict-no*H0*4
  17623. -->
  17624. (S1 ^operator O2040 = 1.)
  17625. inner elaboration loop at bottom goal.
  17626. Retracting rl*prefer*rvt*predict-no*H0*4
  17627. -->
  17628. (S1 ^operator O2038 = 1.)
  17629. Retracting rl*prefer*rvt*predict-yes*H0*3
  17630. -->
  17631. (S1 ^operator O2037 = 0.)
  17632. --- END Proposal Phase ---
  17633. --- Decision Phase ---
  17634. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521345 -0.14093 0.380415(R,m,v=1,0.83432,0.139053)
  17635. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478639 0.140931 0.61957 -> 0.478641 0.140931 0.619572(R,m,v=1,1,0)
  17636. =>WM: (14300: S1 ^operator O2040)
  17637. 1020: O: O2040 (predict-no)
  17638. --- END Decision Phase ---
  17639. --- Application Phase ---
  17640. --- Firing Productions (PE) For State At Depth 1 ---
  17641. --- Inner Elaboration Phase, active level 1 (S1) ---
  17642. Firing apply*operator
  17643. -->
  17644. (I3 ^predict-no N1020 + :O )
  17645. Firing apply*operator*complete
  17646. -->
  17647. (I3 ^predict-yes N1019 - :O )
  17648. inner elaboration loop at bottom goal.
  17649. --- Change Working Memory (PE) ---
  17650. =>WM: (14301: I3 ^predict-no N1020)
  17651. <=WM: (14287: N1019 ^status complete)
  17652. <=WM: (14286: I3 ^predict-yes N1019)
  17653. --- Firing Productions (IE) For State At Depth 1 ---
  17654. --- Inner Elaboration Phase, active level 1 (S1) ---
  17655. Firing monitor*world
  17656. -->
  17657. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17658. --- Change Working Memory (IE) ---
  17659. --- END Application Phase ---
  17660. --- Output Phase ---
  17661. ENV: Agent did: predict-no for direction U in state State-A
  17662. In State-A moving U
  17663. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17664. predict error 0
  17665. dir: dir isU
  17666. --- END Output Phase ---
  17667. -/|--- Input Phase ---
  17668. =>WM: (14305: I2 ^dir U)
  17669. =>WM: (14304: I2 ^reward 1)
  17670. =>WM: (14303: I2 ^see 0)
  17671. =>WM: (14302: N1020 ^status complete)
  17672. <=WM: (14290: I2 ^dir U)
  17673. <=WM: (14289: I2 ^reward 1)
  17674. <=WM: (14288: I2 ^see 1)
  17675. =>WM: (14306: I2 ^level-1 L1-root)
  17676. <=WM: (14291: I2 ^level-1 L1-root)
  17677. --- END Input Phase ---
  17678. --- Proposal Phase ---
  17679. --- Inner Elaboration Phase, active level 1 (S1) ---
  17680. Firing elaborate*copy-see-to-output-link
  17681. -->
  17682. (I3 ^see 0 +)
  17683. Firing elaborate*reward*based*on*reward
  17684. -->
  17685. (R1024 ^value 1 +)
  17686. (R1 ^reward R1024 +)
  17687. Firing propose*predict-yes
  17688. -->
  17689. (O2041 ^name predict-yes +)
  17690. (S1 ^operator O2041 +)
  17691. Firing propose*predict-no
  17692. -->
  17693. (O2042 ^name predict-no +)
  17694. (S1 ^operator O2042 +)
  17695. Firing rl*prefer*rvt*predict-no*H0*4
  17696. -->
  17697. (S1 ^operator O2040 = 1.)
  17698. Firing rl*prefer*rvt*predict-yes*H0*3
  17699. -->
  17700. (S1 ^operator O2039 = 0.)
  17701. Firing prefer*rvt*predict-yes*H0
  17702. -->
  17703. Firing prefer*rvt*predict-no*H0
  17704. -->
  17705. Firing elaborate*copy-dir-to-output-link
  17706. -->
  17707. (I3 ^dir U +)
  17708. inner elaboration loop at bottom goal.
  17709. Retracting elaborate*copy-see-to-output-link
  17710. -->
  17711. (I3 ^see 1 +)
  17712. Retracting propose*predict-no
  17713. -->
  17714. (O2040 ^name predict-no +)
  17715. (S1 ^operator O2040 +)
  17716. Retracting propose*predict-yes
  17717. -->
  17718. (O2039 ^name predict-yes +)
  17719. (S1 ^operator O2039 +)
  17720. Retracting elaborate*reward*based*on*reward
  17721. -->
  17722. (R1023 ^value 1 +)
  17723. (R1 ^reward R1023 +)
  17724. Retracting elaborate*copy-dir-to-output-link
  17725. -->
  17726. (I3 ^dir U +)
  17727. Retracting rl*prefer*rvt*predict-no*H0*4
  17728. -->
  17729. (S1 ^operator O2040 = 1.)
  17730. Retracting rl*prefer*rvt*predict-yes*H0*3
  17731. -->
  17732. (S1 ^operator O2039 = 0.)
  17733. =>WM: (14313: S1 ^operator O2042 +)
  17734. =>WM: (14312: S1 ^operator O2041 +)
  17735. =>WM: (14311: O2042 ^name predict-no)
  17736. =>WM: (14310: O2041 ^name predict-yes)
  17737. =>WM: (14309: R1024 ^value 1)
  17738. =>WM: (14308: R1 ^reward R1024)
  17739. =>WM: (14307: I3 ^see 0)
  17740. <=WM: (14298: S1 ^operator O2039 +)
  17741. <=WM: (14299: S1 ^operator O2040 +)
  17742. <=WM: (14300: S1 ^operator O2040)
  17743. <=WM: (14293: R1 ^reward R1023)
  17744. <=WM: (14292: I3 ^see 1)
  17745. <=WM: (14296: O2040 ^name predict-no)
  17746. <=WM: (14295: O2039 ^name predict-yes)
  17747. <=WM: (14294: R1023 ^value 1)
  17748. --- Inner Elaboration Phase, active level 1 (S1) ---
  17749. Firing prefer*rvt*predict-yes*H0
  17750. -->
  17751. Firing rl*prefer*rvt*predict-yes*H0*3
  17752. -->
  17753. (S1 ^operator O2041 = 0.)
  17754. Firing prefer*rvt*predict-no*H0
  17755. -->
  17756. Firing rl*prefer*rvt*predict-no*H0*4
  17757. -->
  17758. (S1 ^operator O2042 = 1.)
  17759. inner elaboration loop at bottom goal.
  17760. Retracting rl*prefer*rvt*predict-no*H0*4
  17761. -->
  17762. (S1 ^operator O2040 = 1.)
  17763. Retracting rl*prefer*rvt*predict-yes*H0*3
  17764. -->
  17765. (S1 ^operator O2039 = 0.)
  17766. --- END Proposal Phase ---
  17767. --- Decision Phase ---
  17768. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17769. =>WM: (14314: S1 ^operator O2042)
  17770. 1021: O: O2042 (predict-no)
  17771. --- END Decision Phase ---
  17772. --- Application Phase ---
  17773. --- Firing Productions (PE) For State At Depth 1 ---
  17774. --- Inner Elaboration Phase, active level 1 (S1) ---
  17775. Firing apply*operator
  17776. -->
  17777. (I3 ^predict-no N1021 + :O )
  17778. Firing apply*operator*complete
  17779. -->
  17780. (I3 ^predict-no N1020 - :O )
  17781. inner elaboration loop at bottom goal.
  17782. --- Change Working Memory (PE) ---
  17783. =>WM: (14315: I3 ^predict-no N1021)
  17784. <=WM: (14302: N1020 ^status complete)
  17785. <=WM: (14301: I3 ^predict-no N1020)
  17786. --- Firing Productions (IE) For State At Depth 1 ---
  17787. --- Inner Elaboration Phase, active level 1 (S1) ---
  17788. Firing monitor*world
  17789. -->
  17790. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17791. --- Change Working Memory (IE) ---
  17792. --- END Application Phase ---
  17793. --- Output Phase ---
  17794. ENV: Agent did: predict-no for direction U in state State-A
  17795. In State-A moving U
  17796. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17797. predict error 0
  17798. dir: dir isR
  17799. --- END Output Phase ---
  17800. \--- Input Phase ---
  17801. =>WM: (14319: I2 ^dir R)
  17802. =>WM: (14318: I2 ^reward 1)
  17803. =>WM: (14317: I2 ^see 0)
  17804. =>WM: (14316: N1021 ^status complete)
  17805. <=WM: (14305: I2 ^dir U)
  17806. <=WM: (14304: I2 ^reward 1)
  17807. <=WM: (14303: I2 ^see 0)
  17808. =>WM: (14320: I2 ^level-1 L1-root)
  17809. <=WM: (14306: I2 ^level-1 L1-root)
  17810. --- END Input Phase ---
  17811. --- Proposal Phase ---
  17812. --- Inner Elaboration Phase, active level 1 (S1) ---
  17813. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  17814. -->
  17815. (S1 ^operator O2041 = 0.7062713203494733)
  17816. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  17817. -->
  17818. (S1 ^operator O2042 = -0.1937987592593187)
  17819. Firing prefer*rvt*predict-no*H0*6*v1*H1
  17820. -->
  17821. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17822. -->
  17823. Firing elaborate*copy-see-to-output-link
  17824. -->
  17825. (I3 ^see 0 +)
  17826. Firing elaborate*reward*based*on*reward
  17827. -->
  17828. (R1025 ^value 1 +)
  17829. (R1 ^reward R1025 +)
  17830. Firing propose*predict-yes
  17831. -->
  17832. (O2043 ^name predict-yes +)
  17833. (S1 ^operator O2043 +)
  17834. Firing propose*predict-no
  17835. -->
  17836. (O2044 ^name predict-no +)
  17837. (S1 ^operator O2044 +)
  17838. Firing rl*prefer*rvt*predict-no*H0*6
  17839. -->
  17840. (S1 ^operator O2042 = 0.229864201526749)
  17841. Firing rl*prefer*rvt*predict-yes*H0*5
  17842. -->
  17843. (S1 ^operator O2041 = 0.2939794178406799)
  17844. Firing prefer*rvt*predict-yes*H0
  17845. -->
  17846. Firing prefer*rvt*predict-no*H0
  17847. -->
  17848. Firing elaborate*copy-dir-to-output-link
  17849. -->
  17850. (I3 ^dir R +)
  17851. inner elaboration loop at bottom goal.
  17852. Retracting elaborate*copy-see-to-output-link
  17853. -->
  17854. (I3 ^see 0 +)
  17855. Retracting propose*predict-no
  17856. -->
  17857. (O2042 ^name predict-no +)
  17858. (S1 ^operator O2042 +)
  17859. Retracting propose*predict-yes
  17860. -->
  17861. (O2041 ^name predict-yes +)
  17862. (S1 ^operator O2041 +)
  17863. Retracting elaborate*reward*based*on*reward
  17864. -->
  17865. (R1024 ^value 1 +)
  17866. (R1 ^reward R1024 +)
  17867. Retracting elaborate*copy-dir-to-output-link
  17868. -->
  17869. (I3 ^dir U +)
  17870. Retracting rl*prefer*rvt*predict-no*H0*4
  17871. -->
  17872. (S1 ^operator O2042 = 1.)
  17873. Retracting rl*prefer*rvt*predict-yes*H0*3
  17874. -->
  17875. (S1 ^operator O2041 = 0.)
  17876. =>WM: (14327: S1 ^operator O2044 +)
  17877. =>WM: (14326: S1 ^operator O2043 +)
  17878. =>WM: (14325: I3 ^dir R)
  17879. =>WM: (14324: O2044 ^name predict-no)
  17880. =>WM: (14323: O2043 ^name predict-yes)
  17881. =>WM: (14322: R1025 ^value 1)
  17882. =>WM: (14321: R1 ^reward R1025)
  17883. <=WM: (14312: S1 ^operator O2041 +)
  17884. <=WM: (14313: S1 ^operator O2042 +)
  17885. <=WM: (14314: S1 ^operator O2042)
  17886. <=WM: (14297: I3 ^dir U)
  17887. <=WM: (14308: R1 ^reward R1024)
  17888. <=WM: (14311: O2042 ^name predict-no)
  17889. <=WM: (14310: O2041 ^name predict-yes)
  17890. <=WM: (14309: R1024 ^value 1)
  17891. --- Inner Elaboration Phase, active level 1 (S1) ---
  17892. Firing prefer*rvt*predict-yes*H0
  17893. -->
  17894. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  17895. -->
  17896. (S1 ^operator O2043 = 0.7062713203494733)
  17897. Firing rl*prefer*rvt*predict-yes*H0*5
  17898. -->
  17899. (S1 ^operator O2043 = 0.2939794178406799)
  17900. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17901. -->
  17902. Firing prefer*rvt*predict-no*H0
  17903. -->
  17904. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  17905. -->
  17906. (S1 ^operator O2044 = -0.1937987592593187)
  17907. Firing rl*prefer*rvt*predict-no*H0*6
  17908. -->
  17909. (S1 ^operator O2044 = 0.229864201526749)
  17910. Firing prefer*rvt*predict-no*H0*6*v1*H1
  17911. -->
  17912. inner elaboration loop at bottom goal.
  17913. Retracting rl*prefer*rvt*predict-no*H0*6
  17914. -->
  17915. (S1 ^operator O2042 = 0.229864201526749)
  17916. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  17917. -->
  17918. (S1 ^operator O2042 = -0.1937987592593187)
  17919. Retracting rl*prefer*rvt*predict-yes*H0*5
  17920. -->
  17921. (S1 ^operator O2041 = 0.2939794178406799)
  17922. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  17923. -->
  17924. (S1 ^operator O2041 = 0.7062713203494733)
  17925. --- END Proposal Phase ---
  17926. --- Decision Phase ---
  17927. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17928. =>WM: (14328: S1 ^operator O2043)
  17929. 1022: O: O2043 (predict-yes)
  17930. --- END Decision Phase ---
  17931. --- Application Phase ---
  17932. --- Firing Productions (PE) For State At Depth 1 ---
  17933. --- Inner Elaboration Phase, active level 1 (S1) ---
  17934. Firing apply*operator
  17935. -->
  17936. (I3 ^predict-yes N1022 + :O )
  17937. Firing apply*operator*complete
  17938. -->
  17939. (I3 ^predict-no N1021 - :O )
  17940. inner elaboration loop at bottom goal.
  17941. --- Change Working Memory (PE) ---
  17942. =>WM: (14329: I3 ^predict-yes N1022)
  17943. <=WM: (14316: N1021 ^status complete)
  17944. <=WM: (14315: I3 ^predict-no N1021)
  17945. --- Firing Productions (IE) For State At Depth 1 ---
  17946. --- Inner Elaboration Phase, active level 1 (S1) ---
  17947. Firing monitor*world
  17948. -->
  17949. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17950. --- Change Working Memory (IE) ---
  17951. --- END Application Phase ---
  17952. --- Output Phase ---
  17953. ENV: Agent did: predict-yes for direction R in state State-A
  17954. In State-A moving R
  17955. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17956. predict error 0
  17957. dir: dir isL
  17958. --- END Output Phase ---
  17959. -/|--- Input Phase ---
  17960. =>WM: (14333: I2 ^dir L)
  17961. =>WM: (14332: I2 ^reward 1)
  17962. =>WM: (14331: I2 ^see 1)
  17963. =>WM: (14330: N1022 ^status complete)
  17964. <=WM: (14319: I2 ^dir R)
  17965. <=WM: (14318: I2 ^reward 1)
  17966. <=WM: (14317: I2 ^see 0)
  17967. =>WM: (14334: I2 ^level-1 R1-root)
  17968. <=WM: (14320: I2 ^level-1 L1-root)
  17969. --- END Input Phase ---
  17970. --- Proposal Phase ---
  17971. --- Inner Elaboration Phase, active level 1 (S1) ---
  17972. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  17973. -->
  17974. (S1 ^operator O2043 = 0.6196052772291735)
  17975. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  17976. -->
  17977. (S1 ^operator O2044 = -0.1479504104026684)
  17978. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17979. -->
  17980. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17981. -->
  17982. Firing elaborate*copy-see-to-output-link
  17983. -->
  17984. (I3 ^see 1 +)
  17985. Firing elaborate*reward*based*on*reward
  17986. -->
  17987. (R1026 ^value 1 +)
  17988. (R1 ^reward R1026 +)
  17989. Firing propose*predict-yes
  17990. -->
  17991. (O2045 ^name predict-yes +)
  17992. (S1 ^operator O2045 +)
  17993. Firing propose*predict-no
  17994. -->
  17995. (O2046 ^name predict-no +)
  17996. (S1 ^operator O2046 +)
  17997. Firing rl*prefer*rvt*predict-no*H0*2
  17998. -->
  17999. (S1 ^operator O2044 = 0.3140082846697959)
  18000. Firing rl*prefer*rvt*predict-yes*H0*1
  18001. -->
  18002. (S1 ^operator O2043 = 0.3804151506751392)
  18003. Firing prefer*rvt*predict-yes*H0
  18004. -->
  18005. Firing prefer*rvt*predict-no*H0
  18006. -->
  18007. Firing elaborate*copy-dir-to-output-link
  18008. -->
  18009. (I3 ^dir L +)
  18010. inner elaboration loop at bottom goal.
  18011. Retracting elaborate*copy-see-to-output-link
  18012. -->
  18013. (I3 ^see 0 +)
  18014. Retracting propose*predict-no
  18015. -->
  18016. (O2044 ^name predict-no +)
  18017. (S1 ^operator O2044 +)
  18018. Retracting propose*predict-yes
  18019. -->
  18020. (O2043 ^name predict-yes +)
  18021. (S1 ^operator O2043 +)
  18022. Retracting elaborate*reward*based*on*reward
  18023. -->
  18024. (R1025 ^value 1 +)
  18025. (R1 ^reward R1025 +)
  18026. Retracting elaborate*copy-dir-to-output-link
  18027. -->
  18028. (I3 ^dir R +)
  18029. Retracting rl*prefer*rvt*predict-no*H0*6
  18030. -->
  18031. (S1 ^operator O2044 = 0.229864201526749)
  18032. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  18033. -->
  18034. (S1 ^operator O2044 = -0.1937987592593187)
  18035. Retracting rl*prefer*rvt*predict-yes*H0*5
  18036. -->
  18037. (S1 ^operator O2043 = 0.2939794178406799)
  18038. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  18039. -->
  18040. (S1 ^operator O2043 = 0.7062713203494733)
  18041. =>WM: (14342: S1 ^operator O2046 +)
  18042. =>WM: (14341: S1 ^operator O2045 +)
  18043. =>WM: (14340: I3 ^dir L)
  18044. =>WM: (14339: O2046 ^name predict-no)
  18045. =>WM: (14338: O2045 ^name predict-yes)
  18046. =>WM: (14337: R1026 ^value 1)
  18047. =>WM: (14336: R1 ^reward R1026)
  18048. =>WM: (14335: I3 ^see 1)
  18049. <=WM: (14326: S1 ^operator O2043 +)
  18050. <=WM: (14328: S1 ^operator O2043)
  18051. <=WM: (14327: S1 ^operator O2044 +)
  18052. <=WM: (14325: I3 ^dir R)
  18053. <=WM: (14321: R1 ^reward R1025)
  18054. <=WM: (14307: I3 ^see 0)
  18055. <=WM: (14324: O2044 ^name predict-no)
  18056. <=WM: (14323: O2043 ^name predict-yes)
  18057. <=WM: (14322: R1025 ^value 1)
  18058. --- Inner Elaboration Phase, active level 1 (S1) ---
  18059. Firing prefer*rvt*predict-yes*H0
  18060. -->
  18061. Firing rl*prefer*rvt*predict-yes*H0*1
  18062. -->
  18063. (S1 ^operator O2045 = 0.3804151506751392)
  18064. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18065. -->
  18066. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  18067. -->
  18068. (S1 ^operator O2045 = 0.6196052772291735)
  18069. Firing prefer*rvt*predict-no*H0
  18070. -->
  18071. Firing rl*prefer*rvt*predict-no*H0*2
  18072. -->
  18073. (S1 ^operator O2046 = 0.3140082846697959)
  18074. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18075. -->
  18076. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  18077. -->
  18078. (S1 ^operator O2046 = -0.1479504104026684)
  18079. inner elaboration loop at bottom goal.
  18080. Retracting rl*prefer*rvt*predict-no*H0*2
  18081. -->
  18082. (S1 ^operator O2044 = 0.3140082846697959)
  18083. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  18084. -->
  18085. (S1 ^operator O2044 = -0.1479504104026684)
  18086. Retracting rl*prefer*rvt*predict-yes*H0*1
  18087. -->
  18088. (S1 ^operator O2043 = 0.3804151506751392)
  18089. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  18090. -->
  18091. (S1 ^operator O2043 = 0.6196052772291735)
  18092. --- END Proposal Phase ---
  18093. --- Decision Phase ---
  18094. RL update rl*prefer*rvt*predict-yes*H0*5 0.501055 -0.207076 0.293979 -> 0.501037 -0.207078 0.293959(R,m,v=1,0.85,0.128302)
  18095. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499171 0.2071 0.706271 -> 0.499149 0.207098 0.706247(R,m,v=1,1,0)
  18096. =>WM: (14343: S1 ^operator O2045)
  18097. 1023: O: O2045 (predict-yes)
  18098. --- END Decision Phase ---
  18099. --- Application Phase ---
  18100. --- Firing Productions (PE) For State At Depth 1 ---
  18101. --- Inner Elaboration Phase, active level 1 (S1) ---
  18102. Firing apply*operator
  18103. -->
  18104. (I3 ^predict-yes N1023 + :O )
  18105. Firing apply*operator*complete
  18106. -->
  18107. (I3 ^predict-yes N1022 - :O )
  18108. inner elaboration loop at bottom goal.
  18109. --- Change Working Memory (PE) ---
  18110. =>WM: (14344: I3 ^predict-yes N1023)
  18111. <=WM: (14330: N1022 ^status complete)
  18112. <=WM: (14329: I3 ^predict-yes N1022)
  18113. --- Firing Productions (IE) For State At Depth 1 ---
  18114. --- Inner Elaboration Phase, active level 1 (S1) ---
  18115. Firing monitor*world
  18116. -->
  18117. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18118. --- Change Working Memory (IE) ---
  18119. --- END Application Phase ---
  18120. --- Output Phase ---
  18121. ENV: Agent did: predict-yes for direction L in state State-B
  18122. In State-B moving L
  18123. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  18124. predict error 0
  18125. dir: dir isL
  18126. --- END Output Phase ---
  18127. \-/--- Input Phase ---
  18128. =>WM: (14348: I2 ^dir L)
  18129. =>WM: (14347: I2 ^reward 1)
  18130. =>WM: (14346: I2 ^see 1)
  18131. =>WM: (14345: N1023 ^status complete)
  18132. <=WM: (14333: I2 ^dir L)
  18133. <=WM: (14332: I2 ^reward 1)
  18134. <=WM: (14331: I2 ^see 1)
  18135. =>WM: (14349: I2 ^level-1 L1-root)
  18136. <=WM: (14334: I2 ^level-1 R1-root)
  18137. --- END Input Phase ---
  18138. --- Proposal Phase ---
  18139. --- Inner Elaboration Phase, active level 1 (S1) ---
  18140. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  18141. -->
  18142. (S1 ^operator O2045 = -0.3470159027404986)
  18143. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  18144. -->
  18145. (S1 ^operator O2046 = 0.6861042492871868)
  18146. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18147. -->
  18148. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18149. -->
  18150. Firing elaborate*copy-see-to-output-link
  18151. -->
  18152. (I3 ^see 1 +)
  18153. Firing elaborate*reward*based*on*reward
  18154. -->
  18155. (R1027 ^value 1 +)
  18156. (R1 ^reward R1027 +)
  18157. Firing propose*predict-yes
  18158. -->
  18159. (O2047 ^name predict-yes +)
  18160. (S1 ^operator O2047 +)
  18161. Firing propose*predict-no
  18162. -->
  18163. (O2048 ^name predict-no +)
  18164. (S1 ^operator O2048 +)
  18165. Firing rl*prefer*rvt*predict-no*H0*2
  18166. -->
  18167. (S1 ^operator O2046 = 0.3140082846697959)
  18168. Firing rl*prefer*rvt*predict-yes*H0*1
  18169. -->
  18170. (S1 ^operator O2045 = 0.3804151506751392)
  18171. Firing prefer*rvt*predict-yes*H0
  18172. -->
  18173. Firing prefer*rvt*predict-no*H0
  18174. -->
  18175. Firing elaborate*copy-dir-to-output-link
  18176. -->
  18177. (I3 ^dir L +)
  18178. inner elaboration loop at bottom goal.
  18179. Retracting elaborate*copy-see-to-output-link
  18180. -->
  18181. (I3 ^see 1 +)
  18182. Retracting propose*predict-no
  18183. -->
  18184. (O2046 ^name predict-no +)
  18185. (S1 ^operator O2046 +)
  18186. Retracting propose*predict-yes
  18187. -->
  18188. (O2045 ^name predict-yes +)
  18189. (S1 ^operator O2045 +)
  18190. Retracting elaborate*reward*based*on*reward
  18191. -->
  18192. (R1026 ^value 1 +)
  18193. (R1 ^reward R1026 +)
  18194. Retracting elaborate*copy-dir-to-output-link
  18195. -->
  18196. (I3 ^dir L +)
  18197. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  18198. -->
  18199. (S1 ^operator O2046 = -0.1479504104026684)
  18200. Retracting rl*prefer*rvt*predict-no*H0*2
  18201. -->
  18202. (S1 ^operator O2046 = 0.3140082846697959)
  18203. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  18204. -->
  18205. (S1 ^operator O2045 = 0.6196052772291735)
  18206. Retracting rl*prefer*rvt*predict-yes*H0*1
  18207. -->
  18208. (S1 ^operator O2045 = 0.3804151506751392)
  18209. =>WM: (14355: S1 ^operator O2048 +)
  18210. =>WM: (14354: S1 ^operator O2047 +)
  18211. =>WM: (14353: O2048 ^name predict-no)
  18212. =>WM: (14352: O2047 ^name predict-yes)
  18213. =>WM: (14351: R1027 ^value 1)
  18214. =>WM: (14350: R1 ^reward R1027)
  18215. <=WM: (14341: S1 ^operator O2045 +)
  18216. <=WM: (14343: S1 ^operator O2045)
  18217. <=WM: (14342: S1 ^operator O2046 +)
  18218. <=WM: (14336: R1 ^reward R1026)
  18219. <=WM: (14339: O2046 ^name predict-no)
  18220. <=WM: (14338: O2045 ^name predict-yes)
  18221. <=WM: (14337: R1026 ^value 1)
  18222. --- Inner Elaboration Phase, active level 1 (S1) ---
  18223. Firing prefer*rvt*predict-yes*H0
  18224. -->
  18225. Firing rl*prefer*rvt*predict-yes*H0*1
  18226. -->
  18227. (S1 ^operator O2047 = 0.3804151506751392)
  18228. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18229. -->
  18230. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  18231. -->
  18232. (S1 ^operator O2047 = -0.3470159027404986)
  18233. Firing prefer*rvt*predict-no*H0
  18234. -->
  18235. Firing rl*prefer*rvt*predict-no*H0*2
  18236. -->
  18237. (S1 ^operator O2048 = 0.3140082846697959)
  18238. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18239. -->
  18240. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  18241. -->
  18242. (S1 ^operator O2048 = 0.6861042492871868)
  18243. inner elaboration loop at bottom goal.
  18244. Retracting rl*prefer*rvt*predict-no*H0*2
  18245. -->
  18246. (S1 ^operator O2046 = 0.3140082846697959)
  18247. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  18248. -->
  18249. (S1 ^operator O2046 = 0.6861042492871868)
  18250. Retracting rl*prefer*rvt*predict-yes*H0*1
  18251. -->
  18252. (S1 ^operator O2045 = 0.3804151506751392)
  18253. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  18254. -->
  18255. (S1 ^operator O2045 = -0.3470159027404986)
  18256. --- END Proposal Phase ---
  18257. --- Decision Phase ---
  18258. RL update rl*prefer*rvt*predict-yes*H0*1 0.521345 -0.14093 0.380415 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.835294,0.138392)
  18259. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478677 0.140928 0.619605 -> 0.478675 0.140928 0.619603(R,m,v=1,1,0)
  18260. =>WM: (14356: S1 ^operator O2048)
  18261. 1024: O: O2048 (predict-no)
  18262. --- END Decision Phase ---
  18263. --- Application Phase ---
  18264. --- Firing Productions (PE) For State At Depth 1 ---
  18265. --- Inner Elaboration Phase, active level 1 (S1) ---
  18266. Firing apply*operator
  18267. -->
  18268. (I3 ^predict-no N1024 + :O )
  18269. Firing apply*operator*complete
  18270. -->
  18271. (I3 ^predict-yes N1023 - :O )
  18272. inner elaboration loop at bottom goal.
  18273. --- Change Working Memory (PE) ---
  18274. =>WM: (14357: I3 ^predict-no N1024)
  18275. <=WM: (14345: N1023 ^status complete)
  18276. <=WM: (14344: I3 ^predict-yes N1023)
  18277. --- Firing Productions (IE) For State At Depth 1 ---
  18278. --- Inner Elaboration Phase, active level 1 (S1) ---
  18279. Firing monitor*world
  18280. -->
  18281. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18282. --- Change Working Memory (IE) ---
  18283. --- END Application Phase ---
  18284. --- Output Phase ---
  18285. ENV: Agent did: predict-no for direction L in state State-A
  18286. In State-A moving L
  18287. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18288. predict error 0
  18289. dir: dir isU
  18290. --- END Output Phase ---
  18291. |\---- Input Phase ---
  18292. =>WM: (14361: I2 ^dir U)
  18293. =>WM: (14360: I2 ^reward 1)
  18294. =>WM: (14359: I2 ^see 0)
  18295. =>WM: (14358: N1024 ^status complete)
  18296. <=WM: (14348: I2 ^dir L)
  18297. <=WM: (14347: I2 ^reward 1)
  18298. <=WM: (14346: I2 ^see 1)
  18299. =>WM: (14362: I2 ^level-1 L0-root)
  18300. <=WM: (14349: I2 ^level-1 L1-root)
  18301. --- END Input Phase ---
  18302. --- Proposal Phase ---
  18303. --- Inner Elaboration Phase, active level 1 (S1) ---
  18304. Firing elaborate*copy-see-to-output-link
  18305. -->
  18306. (I3 ^see 0 +)
  18307. Firing elaborate*reward*based*on*reward
  18308. -->
  18309. (R1028 ^value 1 +)
  18310. (R1 ^reward R1028 +)
  18311. Firing propose*predict-yes
  18312. -->
  18313. (O2049 ^name predict-yes +)
  18314. (S1 ^operator O2049 +)
  18315. Firing propose*predict-no
  18316. -->
  18317. (O2050 ^name predict-no +)
  18318. (S1 ^operator O2050 +)
  18319. Firing rl*prefer*rvt*predict-no*H0*4
  18320. -->
  18321. (S1 ^operator O2048 = 1.)
  18322. Firing rl*prefer*rvt*predict-yes*H0*3
  18323. -->
  18324. (S1 ^operator O2047 = 0.)
  18325. Firing prefer*rvt*predict-yes*H0
  18326. -->
  18327. Firing prefer*rvt*predict-no*H0
  18328. -->
  18329. Firing elaborate*copy-dir-to-output-link
  18330. -->
  18331. (I3 ^dir U +)
  18332. inner elaboration loop at bottom goal.
  18333. Retracting elaborate*copy-see-to-output-link
  18334. -->
  18335. (I3 ^see 1 +)
  18336. Retracting propose*predict-no
  18337. -->
  18338. (O2048 ^name predict-no +)
  18339. (S1 ^operator O2048 +)
  18340. Retracting propose*predict-yes
  18341. -->
  18342. (O2047 ^name predict-yes +)
  18343. (S1 ^operator O2047 +)
  18344. Retracting elaborate*reward*based*on*reward
  18345. -->
  18346. (R1027 ^value 1 +)
  18347. (R1 ^reward R1027 +)
  18348. Retracting elaborate*copy-dir-to-output-link
  18349. -->
  18350. (I3 ^dir L +)
  18351. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  18352. -->
  18353. (S1 ^operator O2048 = 0.6861042492871868)
  18354. Retracting rl*prefer*rvt*predict-no*H0*2
  18355. -->
  18356. (S1 ^operator O2048 = 0.3140082846697959)
  18357. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  18358. -->
  18359. (S1 ^operator O2047 = -0.3470159027404986)
  18360. Retracting rl*prefer*rvt*predict-yes*H0*1
  18361. -->
  18362. (S1 ^operator O2047 = 0.3804134860259072)
  18363. =>WM: (14370: S1 ^operator O2050 +)
  18364. =>WM: (14369: S1 ^operator O2049 +)
  18365. =>WM: (14368: I3 ^dir U)
  18366. =>WM: (14367: O2050 ^name predict-no)
  18367. =>WM: (14366: O2049 ^name predict-yes)
  18368. =>WM: (14365: R1028 ^value 1)
  18369. =>WM: (14364: R1 ^reward R1028)
  18370. =>WM: (14363: I3 ^see 0)
  18371. <=WM: (14354: S1 ^operator O2047 +)
  18372. <=WM: (14355: S1 ^operator O2048 +)
  18373. <=WM: (14356: S1 ^operator O2048)
  18374. <=WM: (14340: I3 ^dir L)
  18375. <=WM: (14350: R1 ^reward R1027)
  18376. <=WM: (14335: I3 ^see 1)
  18377. <=WM: (14353: O2048 ^name predict-no)
  18378. <=WM: (14352: O2047 ^name predict-yes)
  18379. <=WM: (14351: R1027 ^value 1)
  18380. --- Inner Elaboration Phase, active level 1 (S1) ---
  18381. Firing prefer*rvt*predict-yes*H0
  18382. -->
  18383. Firing rl*prefer*rvt*predict-yes*H0*3
  18384. -->
  18385. (S1 ^operator O2049 = 0.)
  18386. Firing prefer*rvt*predict-no*H0
  18387. -->
  18388. Firing rl*prefer*rvt*predict-no*H0*4
  18389. -->
  18390. (S1 ^operator O2050 = 1.)
  18391. inner elaboration loop at bottom goal.
  18392. Retracting rl*prefer*rvt*predict-no*H0*4
  18393. -->
  18394. (S1 ^operator O2048 = 1.)
  18395. Retracting rl*prefer*rvt*predict-yes*H0*3
  18396. -->
  18397. (S1 ^operator O2047 = 0.)
  18398. --- END Proposal Phase ---
  18399. --- Decision Phase ---
  18400. RL update rl*prefer*rvt*predict-no*H0*2 0.485021 -0.171013 0.314008 -> 0.485014 -0.171015 0.313999(R,m,v=1,0.864516,0.117889)
  18401. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515068 0.171036 0.686104 -> 0.515059 0.171034 0.686093(R,m,v=1,1,0)
  18402. =>WM: (14371: S1 ^operator O2050)
  18403. 1025: O: O2050 (predict-no)
  18404. --- END Decision Phase ---
  18405. --- Application Phase ---
  18406. --- Firing Productions (PE) For State At Depth 1 ---
  18407. --- Inner Elaboration Phase, active level 1 (S1) ---
  18408. Firing apply*operator
  18409. -->
  18410. (I3 ^predict-no N1025 + :O )
  18411. Firing apply*operator*complete
  18412. -->
  18413. (I3 ^predict-no N1024 - :O )
  18414. inner elaboration loop at bottom goal.
  18415. --- Change Working Memory (PE) ---
  18416. =>WM: (14372: I3 ^predict-no N1025)
  18417. <=WM: (14358: N1024 ^status complete)
  18418. <=WM: (14357: I3 ^predict-no N1024)
  18419. --- Firing Productions (IE) For State At Depth 1 ---
  18420. --- Inner Elaboration Phase, active level 1 (S1) ---
  18421. Firing monitor*world
  18422. -->
  18423. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18424. --- Change Working Memory (IE) ---
  18425. --- END Application Phase ---
  18426. --- Output Phase ---
  18427. ENV: Agent did: predict-no for direction U in state State-A
  18428. In State-A moving U
  18429. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18430. predict error 0
  18431. dir: dir isR
  18432. --- END Output Phase ---
  18433. /|\--- Input Phase ---
  18434. =>WM: (14376: I2 ^dir R)
  18435. =>WM: (14375: I2 ^reward 1)
  18436. =>WM: (14374: I2 ^see 0)
  18437. =>WM: (14373: N1025 ^status complete)
  18438. <=WM: (14361: I2 ^dir U)
  18439. <=WM: (14360: I2 ^reward 1)
  18440. <=WM: (14359: I2 ^see 0)
  18441. =>WM: (14377: I2 ^level-1 L0-root)
  18442. <=WM: (14362: I2 ^level-1 L0-root)
  18443. --- END Input Phase ---
  18444. --- Proposal Phase ---
  18445. --- Inner Elaboration Phase, active level 1 (S1) ---
  18446. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  18447. -->
  18448. (S1 ^operator O2049 = 0.70565863259984)
  18449. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  18450. -->
  18451. (S1 ^operator O2050 = -0.2023211881870005)
  18452. Firing prefer*rvt*predict-no*H0*6*v1*H1
  18453. -->
  18454. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  18455. -->
  18456. Firing elaborate*copy-see-to-output-link
  18457. -->
  18458. (I3 ^see 0 +)
  18459. Firing elaborate*reward*based*on*reward
  18460. -->
  18461. (R1029 ^value 1 +)
  18462. (R1 ^reward R1029 +)
  18463. Firing propose*predict-yes
  18464. -->
  18465. (O2051 ^name predict-yes +)
  18466. (S1 ^operator O2051 +)
  18467. Firing propose*predict-no
  18468. -->
  18469. (O2052 ^name predict-no +)
  18470. (S1 ^operator O2052 +)
  18471. Firing rl*prefer*rvt*predict-no*H0*6
  18472. -->
  18473. (S1 ^operator O2050 = 0.229864201526749)
  18474. Firing rl*prefer*rvt*predict-yes*H0*5
  18475. -->
  18476. (S1 ^operator O2049 = 0.2939587815430382)
  18477. Firing prefer*rvt*predict-yes*H0
  18478. -->
  18479. Firing prefer*rvt*predict-no*H0
  18480. -->
  18481. Firing elaborate*copy-dir-to-output-link
  18482. -->
  18483. (I3 ^dir R +)
  18484. inner elaboration loop at bottom goal.
  18485. Retracting elaborate*copy-see-to-output-link
  18486. -->
  18487. (I3 ^see 0 +)
  18488. Retracting propose*predict-no
  18489. -->
  18490. (O2050 ^name predict-no +)
  18491. (S1 ^operator O2050 +)
  18492. Retracting propose*predict-yes
  18493. -->
  18494. (O2049 ^name predict-yes +)
  18495. (S1 ^operator O2049 +)
  18496. Retracting elaborate*reward*based*on*reward
  18497. -->
  18498. (R1028 ^value 1 +)
  18499. (R1 ^reward R1028 +)
  18500. Retracting elaborate*copy-dir-to-output-link
  18501. -->
  18502. (I3 ^dir U +)
  18503. Retracting rl*prefer*rvt*predict-no*H0*4
  18504. -->
  18505. (S1 ^operator O2050 = 1.)
  18506. Retracting rl*prefer*rvt*predict-yes*H0*3
  18507. -->
  18508. (S1 ^operator O2049 = 0.)
  18509. =>WM: (14384: S1 ^operator O2052 +)
  18510. =>WM: (14383: S1 ^operator O2051 +)
  18511. =>WM: (14382: I3 ^dir R)
  18512. =>WM: (14381: O2052 ^name predict-no)
  18513. =>WM: (14380: O2051 ^name predict-yes)
  18514. =>WM: (14379: R1029 ^value 1)
  18515. =>WM: (14378: R1 ^reward R1029)
  18516. <=WM: (14369: S1 ^operator O2049 +)
  18517. <=WM: (14370: S1 ^operator O2050 +)
  18518. <=WM: (14371: S1 ^operator O2050)
  18519. <=WM: (14368: I3 ^dir U)
  18520. <=WM: (14364: R1 ^reward R1028)
  18521. <=WM: (14367: O2050 ^name predict-no)
  18522. <=WM: (14366: O2049 ^name predict-yes)
  18523. <=WM: (14365: R1028 ^value 1)
  18524. --- Inner Elaboration Phase, active level 1 (S1) ---
  18525. Firing prefer*rvt*predict-yes*H0
  18526. -->
  18527. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  18528. -->
  18529. (S1 ^operator O2051 = 0.70565863259984)
  18530. Firing rl*prefer*rvt*predict-yes*H0*5
  18531. -->
  18532. (S1 ^operator O2051 = 0.2939587815430382)
  18533. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  18534. -->
  18535. Firing prefer*rvt*predict-no*H0
  18536. -->
  18537. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  18538. -->
  18539. (S1 ^operator O2052 = -0.2023211881870005)
  18540. Firing rl*prefer*rvt*predict-no*H0*6
  18541. -->
  18542. (S1 ^operator O2052 = 0.229864201526749)
  18543. Firing prefer*rvt*predict-no*H0*6*v1*H1
  18544. -->
  18545. inner elaboration loop at bottom goal.
  18546. Retracting rl*prefer*rvt*predict-no*H0*6
  18547. -->
  18548. (S1 ^operator O2050 = 0.229864201526749)
  18549. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  18550. -->
  18551. (S1 ^operator O2050 = -0.2023211881870005)
  18552. Retracting rl*prefer*rvt*predict-yes*H0*5
  18553. -->
  18554. (S1 ^operator O2049 = 0.2939587815430382)
  18555. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  18556. -->
  18557. (S1 ^operator O2049 = 0.70565863259984)
  18558. --- END Proposal Phase ---
  18559. --- Decision Phase ---
  18560. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18561. =>WM: (14385: S1 ^operator O2051)
  18562. 1026: O: O2051 (predict-yes)
  18563. --- END Decision Phase ---
  18564. --- Application Phase ---
  18565. --- Firing Productions (PE) For State At Depth 1 ---
  18566. --- Inner Elaboration Phase, active level 1 (S1) ---
  18567. Firing apply*operator
  18568. -->
  18569. (I3 ^predict-yes N1026 + :O )
  18570. Firing apply*operator*complete
  18571. -->
  18572. (I3 ^predict-no N1025 - :O )
  18573. inner elaboration loop at bottom goal.
  18574. --- Change Working Memory (PE) ---
  18575. =>WM: (14386: I3 ^predict-yes N1026)
  18576. <=WM: (14373: N1025 ^status complete)
  18577. <=WM: (14372: I3 ^predict-no N1025)
  18578. --- Firing Productions (IE) For State At Depth 1 ---
  18579. --- Inner Elaboration Phase, active level 1 (S1) ---
  18580. Firing monitor*world
  18581. -->
  18582. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18583. --- Change Working Memory (IE) ---
  18584. --- END Application Phase ---
  18585. --- Output Phase ---
  18586. ENV: Agent did: predict-yes for direction R in state State-A
  18587. In State-A moving R
  18588. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  18589. predict error 0
  18590. dir: dir isU
  18591. --- END Output Phase ---
  18592. -/|--- Input Phase ---
  18593. =>WM: (14390: I2 ^dir U)
  18594. =>WM: (14389: I2 ^reward 1)
  18595. =>WM: (14388: I2 ^see 1)
  18596. =>WM: (14387: N1026 ^status complete)
  18597. <=WM: (14376: I2 ^dir R)
  18598. <=WM: (14375: I2 ^reward 1)
  18599. <=WM: (14374: I2 ^see 0)
  18600. =>WM: (14391: I2 ^level-1 R1-root)
  18601. <=WM: (14377: I2 ^level-1 L0-root)
  18602. --- END Input Phase ---
  18603. --- Proposal Phase ---
  18604. --- Inner Elaboration Phase, active level 1 (S1) ---
  18605. Firing elaborate*copy-see-to-output-link
  18606. -->
  18607. (I3 ^see 1 +)
  18608. Firing elaborate*reward*based*on*reward
  18609. -->
  18610. (R1030 ^value 1 +)
  18611. (R1 ^reward R1030 +)
  18612. Firing propose*predict-yes
  18613. -->
  18614. (O2053 ^name predict-yes +)
  18615. (S1 ^operator O2053 +)
  18616. Firing propose*predict-no
  18617. -->
  18618. (O2054 ^name predict-no +)
  18619. (S1 ^operator O2054 +)
  18620. Firing rl*prefer*rvt*predict-no*H0*4
  18621. -->
  18622. (S1 ^operator O2052 = 1.)
  18623. Firing rl*prefer*rvt*predict-yes*H0*3
  18624. -->
  18625. (S1 ^operator O2051 = 0.)
  18626. Firing prefer*rvt*predict-yes*H0
  18627. -->
  18628. Firing prefer*rvt*predict-no*H0
  18629. -->
  18630. Firing elaborate*copy-dir-to-output-link
  18631. -->
  18632. (I3 ^dir U +)
  18633. inner elaboration loop at bottom goal.
  18634. Retracting elaborate*copy-see-to-output-link
  18635. -->
  18636. (I3 ^see 0 +)
  18637. Retracting propose*predict-no
  18638. -->
  18639. (O2052 ^name predict-no +)
  18640. (S1 ^operator O2052 +)
  18641. Retracting propose*predict-yes
  18642. -->
  18643. (O2051 ^name predict-yes +)
  18644. (S1 ^operator O2051 +)
  18645. Retracting elaborate*reward*based*on*reward
  18646. -->
  18647. (R1029 ^value 1 +)
  18648. (R1 ^reward R1029 +)
  18649. Retracting elaborate*copy-dir-to-output-link
  18650. -->
  18651. (I3 ^dir R +)
  18652. Retracting rl*prefer*rvt*predict-no*H0*6
  18653. -->
  18654. (S1 ^operator O2052 = 0.229864201526749)
  18655. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  18656. -->
  18657. (S1 ^operator O2052 = -0.2023211881870005)
  18658. Retracting rl*prefer*rvt*predict-yes*H0*5
  18659. -->
  18660. (S1 ^operator O2051 = 0.2939587815430382)
  18661. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  18662. -->
  18663. (S1 ^operator O2051 = 0.70565863259984)
  18664. =>WM: (14399: S1 ^operator O2054 +)
  18665. =>WM: (14398: S1 ^operator O2053 +)
  18666. =>WM: (14397: I3 ^dir U)
  18667. =>WM: (14396: O2054 ^name predict-no)
  18668. =>WM: (14395: O2053 ^name predict-yes)
  18669. =>WM: (14394: R1030 ^value 1)
  18670. =>WM: (14393: R1 ^reward R1030)
  18671. =>WM: (14392: I3 ^see 1)
  18672. <=WM: (14383: S1 ^operator O2051 +)
  18673. <=WM: (14385: S1 ^operator O2051)
  18674. <=WM: (14384: S1 ^operator O2052 +)
  18675. <=WM: (14382: I3 ^dir R)
  18676. <=WM: (14378: R1 ^reward R1029)
  18677. <=WM: (14363: I3 ^see 0)
  18678. <=WM: (14381: O2052 ^name predict-no)
  18679. <=WM: (14380: O2051 ^name predict-yes)
  18680. <=WM: (14379: R1029 ^value 1)
  18681. --- Inner Elaboration Phase, active level 1 (S1) ---
  18682. Firing prefer*rvt*predict-yes*H0
  18683. -->
  18684. Firing rl*prefer*rvt*predict-yes*H0*3
  18685. -->
  18686. (S1 ^operator O2053 = 0.)
  18687. Firing prefer*rvt*predict-no*H0
  18688. -->
  18689. Firing rl*prefer*rvt*predict-no*H0*4
  18690. -->
  18691. (S1 ^operator O2054 = 1.)
  18692. inner elaboration loop at bottom goal.
  18693. Retracting rl*prefer*rvt*predict-no*H0*4
  18694. -->
  18695. (S1 ^operator O2052 = 1.)
  18696. Retracting rl*prefer*rvt*predict-yes*H0*3
  18697. -->
  18698. (S1 ^operator O2051 = 0.)
  18699. --- END Proposal Phase ---
  18700. --- Decision Phase ---
  18701. RL update rl*prefer*rvt*predict-yes*H0*5 0.501037 -0.207078 0.293959 -> 0.501065 -0.207075 0.29399(R,m,v=1,0.850932,0.12764)
  18702. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498617 0.207041 0.705659 -> 0.498651 0.207045 0.705696(R,m,v=1,1,0)
  18703. =>WM: (14400: S1 ^operator O2054)
  18704. 1027: O: O2054 (predict-no)
  18705. --- END Decision Phase ---
  18706. --- Application Phase ---
  18707. --- Firing Productions (PE) For State At Depth 1 ---
  18708. --- Inner Elaboration Phase, active level 1 (S1) ---
  18709. Firing apply*operator
  18710. -->
  18711. (I3 ^predict-no N1027 + :O )
  18712. Firing apply*operator*complete
  18713. -->
  18714. (I3 ^predict-yes N1026 - :O )
  18715. inner elaboration loop at bottom goal.
  18716. --- Change Working Memory (PE) ---
  18717. =>WM: (14401: I3 ^predict-no N1027)
  18718. <=WM: (14387: N1026 ^status complete)
  18719. <=WM: (14386: I3 ^predict-yes N1026)
  18720. --- Firing Productions (IE) For State At Depth 1 ---
  18721. --- Inner Elaboration Phase, active level 1 (S1) ---
  18722. Firing monitor*world
  18723. -->
  18724. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18725. --- Change Working Memory (IE) ---
  18726. --- END Application Phase ---
  18727. --- Output Phase ---
  18728. ENV: Agent did: predict-no for direction U in state State-B
  18729. In State-B moving U
  18730. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18731. predict error 0
  18732. dir: dir isU
  18733. --- END Output Phase ---
  18734. \-/--- Input Phase ---
  18735. =>WM: (14405: I2 ^dir U)
  18736. =>WM: (14404: I2 ^reward 1)
  18737. =>WM: (14403: I2 ^see 0)
  18738. =>WM: (14402: N1027 ^status complete)
  18739. <=WM: (14390: I2 ^dir U)
  18740. <=WM: (14389: I2 ^reward 1)
  18741. <=WM: (14388: I2 ^see 1)
  18742. =>WM: (14406: I2 ^level-1 R1-root)
  18743. <=WM: (14391: I2 ^level-1 R1-root)
  18744. --- END Input Phase ---
  18745. --- Proposal Phase ---
  18746. --- Inner Elaboration Phase, active level 1 (S1) ---
  18747. Firing elaborate*copy-see-to-output-link
  18748. -->
  18749. (I3 ^see 0 +)
  18750. Firing elaborate*reward*based*on*reward
  18751. -->
  18752. (R1031 ^value 1 +)
  18753. (R1 ^reward R1031 +)
  18754. Firing propose*predict-yes
  18755. -->
  18756. (O2055 ^name predict-yes +)
  18757. (S1 ^operator O2055 +)
  18758. Firing propose*predict-no
  18759. -->
  18760. (O2056 ^name predict-no +)
  18761. (S1 ^operator O2056 +)
  18762. Firing rl*prefer*rvt*predict-no*H0*4
  18763. -->
  18764. (S1 ^operator O2054 = 1.)
  18765. Firing rl*prefer*rvt*predict-yes*H0*3
  18766. -->
  18767. (S1 ^operator O2053 = 0.)
  18768. Firing prefer*rvt*predict-yes*H0
  18769. -->
  18770. Firing prefer*rvt*predict-no*H0
  18771. -->
  18772. Firing elaborate*copy-dir-to-output-link
  18773. -->
  18774. (I3 ^dir U +)
  18775. inner elaboration loop at bottom goal.
  18776. Retracting elaborate*copy-see-to-output-link
  18777. -->
  18778. (I3 ^see 1 +)
  18779. Retracting propose*predict-no
  18780. -->
  18781. (O2054 ^name predict-no +)
  18782. (S1 ^operator O2054 +)
  18783. Retracting propose*predict-yes
  18784. -->
  18785. (O2053 ^name predict-yes +)
  18786. (S1 ^operator O2053 +)
  18787. Retracting elaborate*reward*based*on*reward
  18788. -->
  18789. (R1030 ^value 1 +)
  18790. (R1 ^reward R1030 +)
  18791. Retracting elaborate*copy-dir-to-output-link
  18792. -->
  18793. (I3 ^dir U +)
  18794. Retracting rl*prefer*rvt*predict-no*H0*4
  18795. -->
  18796. (S1 ^operator O2054 = 1.)
  18797. Retracting rl*prefer*rvt*predict-yes*H0*3
  18798. -->
  18799. (S1 ^operator O2053 = 0.)
  18800. =>WM: (14413: S1 ^operator O2056 +)
  18801. =>WM: (14412: S1 ^operator O2055 +)
  18802. =>WM: (14411: O2056 ^name predict-no)
  18803. =>WM: (14410: O2055 ^name predict-yes)
  18804. =>WM: (14409: R1031 ^value 1)
  18805. =>WM: (14408: R1 ^reward R1031)
  18806. =>WM: (14407: I3 ^see 0)
  18807. <=WM: (14398: S1 ^operator O2053 +)
  18808. <=WM: (14399: S1 ^operator O2054 +)
  18809. <=WM: (14400: S1 ^operator O2054)
  18810. <=WM: (14393: R1 ^reward R1030)
  18811. <=WM: (14392: I3 ^see 1)
  18812. <=WM: (14396: O2054 ^name predict-no)
  18813. <=WM: (14395: O2053 ^name predict-yes)
  18814. <=WM: (14394: R1030 ^value 1)
  18815. --- Inner Elaboration Phase, active level 1 (S1) ---
  18816. Firing prefer*rvt*predict-yes*H0
  18817. -->
  18818. Firing rl*prefer*rvt*predict-yes*H0*3
  18819. -->
  18820. (S1 ^operator O2055 = 0.)
  18821. Firing prefer*rvt*predict-no*H0
  18822. -->
  18823. Firing rl*prefer*rvt*predict-no*H0*4
  18824. -->
  18825. (S1 ^operator O2056 = 1.)
  18826. inner elaboration loop at bottom goal.
  18827. Retracting rl*prefer*rvt*predict-no*H0*4
  18828. -->
  18829. (S1 ^operator O2054 = 1.)
  18830. Retracting rl*prefer*rvt*predict-yes*H0*3
  18831. -->
  18832. (S1 ^operator O2053 = 0.)
  18833. --- END Proposal Phase ---
  18834. --- Decision Phase ---
  18835. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18836. =>WM: (14414: S1 ^operator O2056)
  18837. 1028: O: O2056 (predict-no)
  18838. --- END Decision Phase ---
  18839. --- Application Phase ---
  18840. --- Firing Productions (PE) For State At Depth 1 ---
  18841. --- Inner Elaboration Phase, active level 1 (S1) ---
  18842. Firing apply*operator
  18843. -->
  18844. (I3 ^predict-no N1028 + :O )
  18845. Firing apply*operator*complete
  18846. -->
  18847. (I3 ^predict-no N1027 - :O )
  18848. inner elaboration loop at bottom goal.
  18849. --- Change Working Memory (PE) ---
  18850. =>WM: (14415: I3 ^predict-no N1028)
  18851. <=WM: (14402: N1027 ^status complete)
  18852. <=WM: (14401: I3 ^predict-no N1027)
  18853. --- Firing Productions (IE) For State At Depth 1 ---
  18854. --- Inner Elaboration Phase, active level 1 (S1) ---
  18855. Firing monitor*world
  18856. -->
  18857. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18858. --- Change Working Memory (IE) ---
  18859. --- END Application Phase ---
  18860. --- Output Phase ---
  18861. ENV: Agent did: predict-no for direction U in state State-B
  18862. In State-B moving U
  18863. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18864. predict error 0
  18865. dir: dir isL
  18866. --- END Output Phase ---
  18867. |\---- Input Phase ---
  18868. =>WM: (14419: I2 ^dir L)
  18869. =>WM: (14418: I2 ^reward 1)
  18870. =>WM: (14417: I2 ^see 0)
  18871. =>WM: (14416: N1028 ^status complete)
  18872. <=WM: (14405: I2 ^dir U)
  18873. <=WM: (14404: I2 ^reward 1)
  18874. <=WM: (14403: I2 ^see 0)
  18875. =>WM: (14420: I2 ^level-1 R1-root)
  18876. <=WM: (14406: I2 ^level-1 R1-root)
  18877. --- END Input Phase ---
  18878. --- Proposal Phase ---
  18879. --- Inner Elaboration Phase, active level 1 (S1) ---
  18880. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  18881. -->
  18882. (S1 ^operator O2055 = 0.6196033311566926)
  18883. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  18884. -->
  18885. (S1 ^operator O2056 = -0.1479504104026684)
  18886. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18887. -->
  18888. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18889. -->
  18890. Firing elaborate*copy-see-to-output-link
  18891. -->
  18892. (I3 ^see 0 +)
  18893. Firing elaborate*reward*based*on*reward
  18894. -->
  18895. (R1032 ^value 1 +)
  18896. (R1 ^reward R1032 +)
  18897. Firing propose*predict-yes
  18898. -->
  18899. (O2057 ^name predict-yes +)
  18900. (S1 ^operator O2057 +)
  18901. Firing propose*predict-no
  18902. -->
  18903. (O2058 ^name predict-no +)
  18904. (S1 ^operator O2058 +)
  18905. Firing rl*prefer*rvt*predict-no*H0*2
  18906. -->
  18907. (S1 ^operator O2056 = 0.313998974224576)
  18908. Firing rl*prefer*rvt*predict-yes*H0*1
  18909. -->
  18910. (S1 ^operator O2055 = 0.3804134860259072)
  18911. Firing prefer*rvt*predict-yes*H0
  18912. -->
  18913. Firing prefer*rvt*predict-no*H0
  18914. -->
  18915. Firing elaborate*copy-dir-to-output-link
  18916. -->
  18917. (I3 ^dir L +)
  18918. inner elaboration loop at bottom goal.
  18919. Retracting elaborate*copy-see-to-output-link
  18920. -->
  18921. (I3 ^see 0 +)
  18922. Retracting propose*predict-no
  18923. -->
  18924. (O2056 ^name predict-no +)
  18925. (S1 ^operator O2056 +)
  18926. Retracting propose*predict-yes
  18927. -->
  18928. (O2055 ^name predict-yes +)
  18929. (S1 ^operator O2055 +)
  18930. Retracting elaborate*reward*based*on*reward
  18931. -->
  18932. (R1031 ^value 1 +)
  18933. (R1 ^reward R1031 +)
  18934. Retracting elaborate*copy-dir-to-output-link
  18935. -->
  18936. (I3 ^dir U +)
  18937. Retracting rl*prefer*rvt*predict-no*H0*4
  18938. -->
  18939. (S1 ^operator O2056 = 1.)
  18940. Retracting rl*prefer*rvt*predict-yes*H0*3
  18941. -->
  18942. (S1 ^operator O2055 = 0.)
  18943. =>WM: (14427: S1 ^operator O2058 +)
  18944. =>WM: (14426: S1 ^operator O2057 +)
  18945. =>WM: (14425: I3 ^dir L)
  18946. =>WM: (14424: O2058 ^name predict-no)
  18947. =>WM: (14423: O2057 ^name predict-yes)
  18948. =>WM: (14422: R1032 ^value 1)
  18949. =>WM: (14421: R1 ^reward R1032)
  18950. <=WM: (14412: S1 ^operator O2055 +)
  18951. <=WM: (14413: S1 ^operator O2056 +)
  18952. <=WM: (14414: S1 ^operator O2056)
  18953. <=WM: (14397: I3 ^dir U)
  18954. <=WM: (14408: R1 ^reward R1031)
  18955. <=WM: (14411: O2056 ^name predict-no)
  18956. <=WM: (14410: O2055 ^name predict-yes)
  18957. <=WM: (14409: R1031 ^value 1)
  18958. --- Inner Elaboration Phase, active level 1 (S1) ---
  18959. Firing prefer*rvt*predict-yes*H0
  18960. -->
  18961. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  18962. -->
  18963. (S1 ^operator O2057 = 0.6196033311566926)
  18964. Firing rl*prefer*rvt*predict-yes*H0*1
  18965. -->
  18966. (S1 ^operator O2057 = 0.3804134860259072)
  18967. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18968. -->
  18969. Firing prefer*rvt*predict-no*H0
  18970. -->
  18971. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  18972. -->
  18973. (S1 ^operator O2058 = -0.1479504104026684)
  18974. Firing rl*prefer*rvt*predict-no*H0*2
  18975. -->
  18976. (S1 ^operator O2058 = 0.313998974224576)
  18977. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18978. -->
  18979. inner elaboration loop at bottom goal.
  18980. Retracting rl*prefer*rvt*predict-no*H0*2
  18981. -->
  18982. (S1 ^operator O2056 = 0.313998974224576)
  18983. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  18984. -->
  18985. (S1 ^operator O2056 = -0.1479504104026684)
  18986. Retracting rl*prefer*rvt*predict-yes*H0*1
  18987. -->
  18988. (S1 ^operator O2055 = 0.3804134860259072)
  18989. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  18990. -->
  18991. (S1 ^operator O2055 = 0.6196033311566926)
  18992. --- END Proposal Phase ---
  18993. --- Decision Phase ---
  18994. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18995. =>WM: (14428: S1 ^operator O2057)
  18996. 1029: O: O2057 (predict-yes)
  18997. --- END Decision Phase ---
  18998. --- Application Phase ---
  18999. --- Firing Productions (PE) For State At Depth 1 ---
  19000. --- Inner Elaboration Phase, active level 1 (S1) ---
  19001. Firing apply*operator
  19002. -->
  19003. (I3 ^predict-yes N1029 + :O )
  19004. Firing apply*operator*complete
  19005. -->
  19006. (I3 ^predict-no N1028 - :O )
  19007. inner elaboration loop at bottom goal.
  19008. --- Change Working Memory (PE) ---
  19009. =>WM: (14429: I3 ^predict-yes N1029)
  19010. <=WM: (14416: N1028 ^status complete)
  19011. <=WM: (14415: I3 ^predict-no N1028)
  19012. --- Firing Productions (IE) For State At Depth 1 ---
  19013. --- Inner Elaboration Phase, active level 1 (S1) ---
  19014. Firing monitor*world
  19015. -->
  19016. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19017. --- Change Working Memory (IE) ---
  19018. --- END Application Phase ---
  19019. --- Output Phase ---
  19020. ENV: Agent did: predict-yes for direction L in state State-B
  19021. In State-B moving L
  19022. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  19023. predict error 0
  19024. dir: dir isR
  19025. --- END Output Phase ---
  19026. /|\--- Input Phase ---
  19027. =>WM: (14433: I2 ^dir R)
  19028. =>WM: (14432: I2 ^reward 1)
  19029. =>WM: (14431: I2 ^see 1)
  19030. =>WM: (14430: N1029 ^status complete)
  19031. <=WM: (14419: I2 ^dir L)
  19032. <=WM: (14418: I2 ^reward 1)
  19033. <=WM: (14417: I2 ^see 0)
  19034. =>WM: (14434: I2 ^level-1 L1-root)
  19035. <=WM: (14420: I2 ^level-1 R1-root)
  19036. --- END Input Phase ---
  19037. --- Proposal Phase ---
  19038. --- Inner Elaboration Phase, active level 1 (S1) ---
  19039. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  19040. -->
  19041. (S1 ^operator O2057 = 0.7062472326455022)
  19042. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  19043. -->
  19044. (S1 ^operator O2058 = -0.1937987592593187)
  19045. Firing prefer*rvt*predict-no*H0*6*v1*H1
  19046. -->
  19047. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  19048. -->
  19049. Firing elaborate*copy-see-to-output-link
  19050. -->
  19051. (I3 ^see 1 +)
  19052. Firing elaborate*reward*based*on*reward
  19053. -->
  19054. (R1033 ^value 1 +)
  19055. (R1 ^reward R1033 +)
  19056. Firing propose*predict-yes
  19057. -->
  19058. (O2059 ^name predict-yes +)
  19059. (S1 ^operator O2059 +)
  19060. Firing propose*predict-no
  19061. -->
  19062. (O2060 ^name predict-no +)
  19063. (S1 ^operator O2060 +)
  19064. Firing rl*prefer*rvt*predict-no*H0*6
  19065. -->
  19066. (S1 ^operator O2058 = 0.229864201526749)
  19067. Firing rl*prefer*rvt*predict-yes*H0*5
  19068. -->
  19069. (S1 ^operator O2057 = 0.2939902369301627)
  19070. Firing prefer*rvt*predict-yes*H0
  19071. -->
  19072. Firing prefer*rvt*predict-no*H0
  19073. -->
  19074. Firing elaborate*copy-dir-to-output-link
  19075. -->
  19076. (I3 ^dir R +)
  19077. inner elaboration loop at bottom goal.
  19078. Retracting elaborate*copy-see-to-output-link
  19079. -->
  19080. (I3 ^see 0 +)
  19081. Retracting propose*predict-no
  19082. -->
  19083. (O2058 ^name predict-no +)
  19084. (S1 ^operator O2058 +)
  19085. Retracting propose*predict-yes
  19086. -->
  19087. (O2057 ^name predict-yes +)
  19088. (S1 ^operator O2057 +)
  19089. Retracting elaborate*reward*based*on*reward
  19090. -->
  19091. (R1032 ^value 1 +)
  19092. (R1 ^reward R1032 +)
  19093. Retracting elaborate*copy-dir-to-output-link
  19094. -->
  19095. (I3 ^dir L +)
  19096. Retracting rl*prefer*rvt*predict-no*H0*2
  19097. -->
  19098. (S1 ^operator O2058 = 0.313998974224576)
  19099. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  19100. -->
  19101. (S1 ^operator O2058 = -0.1479504104026684)
  19102. Retracting rl*prefer*rvt*predict-yes*H0*1
  19103. -->
  19104. (S1 ^operator O2057 = 0.3804134860259072)
  19105. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  19106. -->
  19107. (S1 ^operator O2057 = 0.6196033311566926)
  19108. =>WM: (14442: S1 ^operator O2060 +)
  19109. =>WM: (14441: S1 ^operator O2059 +)
  19110. =>WM: (14440: I3 ^dir R)
  19111. =>WM: (14439: O2060 ^name predict-no)
  19112. =>WM: (14438: O2059 ^name predict-yes)
  19113. =>WM: (14437: R1033 ^value 1)
  19114. =>WM: (14436: R1 ^reward R1033)
  19115. =>WM: (14435: I3 ^see 1)
  19116. <=WM: (14426: S1 ^operator O2057 +)
  19117. <=WM: (14428: S1 ^operator O2057)
  19118. <=WM: (14427: S1 ^operator O2058 +)
  19119. <=WM: (14425: I3 ^dir L)
  19120. <=WM: (14421: R1 ^reward R1032)
  19121. <=WM: (14407: I3 ^see 0)
  19122. <=WM: (14424: O2058 ^name predict-no)
  19123. <=WM: (14423: O2057 ^name predict-yes)
  19124. <=WM: (14422: R1032 ^value 1)
  19125. --- Inner Elaboration Phase, active level 1 (S1) ---
  19126. Firing prefer*rvt*predict-yes*H0
  19127. -->
  19128. Firing rl*prefer*rvt*predict-yes*H0*5
  19129. -->
  19130. (S1 ^operator O2059 = 0.2939902369301627)
  19131. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  19132. -->
  19133. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  19134. -->
  19135. (S1 ^operator O2059 = 0.7062472326455022)
  19136. Firing prefer*rvt*predict-no*H0
  19137. -->
  19138. Firing rl*prefer*rvt*predict-no*H0*6
  19139. -->
  19140. (S1 ^operator O2060 = 0.229864201526749)
  19141. Firing prefer*rvt*predict-no*H0*6*v1*H1
  19142. -->
  19143. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  19144. -->
  19145. (S1 ^operator O2060 = -0.1937987592593187)
  19146. inner elaboration loop at bottom goal.
  19147. Retracting rl*prefer*rvt*predict-no*H0*6
  19148. -->
  19149. (S1 ^operator O2058 = 0.229864201526749)
  19150. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  19151. -->
  19152. (S1 ^operator O2058 = -0.1937987592593187)
  19153. Retracting rl*prefer*rvt*predict-yes*H0*5
  19154. -->
  19155. (S1 ^operator O2057 = 0.2939902369301627)
  19156. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  19157. -->
  19158. (S1 ^operator O2057 = 0.7062472326455022)
  19159. --- END Proposal Phase ---
  19160. --- Decision Phase ---
  19161. RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.836257,0.137736)
  19162. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478675 0.140928 0.619603 -> 0.478673 0.140928 0.619602(R,m,v=1,1,0)
  19163. =>WM: (14443: S1 ^operator O2059)
  19164. 1030: O: O2059 (predict-yes)
  19165. --- END Decision Phase ---
  19166. --- Application Phase ---
  19167. --- Firing Productions (PE) For State At Depth 1 ---
  19168. --- Inner Elaboration Phase, active level 1 (S1) ---
  19169. Firing apply*operator
  19170. -->
  19171. (I3 ^predict-yes N1030 + :O )
  19172. Firing apply*operator*complete
  19173. -->
  19174. (I3 ^predict-yes N1029 - :O )
  19175. inner elaboration loop at bottom goal.
  19176. --- Change Working Memory (PE) ---
  19177. =>WM: (14444: I3 ^predict-yes N1030)
  19178. <=WM: (14430: N1029 ^status complete)
  19179. <=WM: (14429: I3 ^predict-yes N1029)
  19180. --- Firing Productions (IE) For State At Depth 1 ---
  19181. --- Inner Elaboration Phase, active level 1 (S1) ---
  19182. Firing monitor*world
  19183. -->
  19184. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19185. --- Change Working Memory (IE) ---
  19186. --- END Application Phase ---
  19187. --- Output Phase ---
  19188. ENV: Agent did: predict-yes for direction R in state State-A
  19189. In State-A moving R
  19190. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  19191. predict error 0
  19192. dir: dir isL
  19193. --- END Output Phase ---
  19194. -/|--- Input Phase ---
  19195. =>WM: (14448: I2 ^dir L)
  19196. =>WM: (14447: I2 ^reward 1)
  19197. =>WM: (14446: I2 ^see 1)
  19198. =>WM: (14445: N1030 ^status complete)
  19199. <=WM: (14433: I2 ^dir R)
  19200. <=WM: (14432: I2 ^reward 1)
  19201. <=WM: (14431: I2 ^see 1)
  19202. =>WM: (14449: I2 ^level-1 R1-root)
  19203. <=WM: (14434: I2 ^level-1 L1-root)
  19204. --- END Input Phase ---
  19205. --- Proposal Phase ---
  19206. --- Inner Elaboration Phase, active level 1 (S1) ---
  19207. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  19208. -->
  19209. (S1 ^operator O2059 = 0.6196017333792301)
  19210. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  19211. -->
  19212. (S1 ^operator O2060 = -0.1479504104026684)
  19213. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19214. -->
  19215. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19216. -->
  19217. Firing elaborate*copy-see-to-output-link
  19218. -->
  19219. (I3 ^see 1 +)
  19220. Firing elaborate*reward*based*on*reward
  19221. -->
  19222. (R1034 ^value 1 +)
  19223. (R1 ^reward R1034 +)
  19224. Firing propose*predict-yes
  19225. -->
  19226. (O2061 ^name predict-yes +)
  19227. (S1 ^operator O2061 +)
  19228. Firing propose*predict-no
  19229. -->
  19230. (O2062 ^name predict-no +)
  19231. (S1 ^operator O2062 +)
  19232. Firing rl*prefer*rvt*predict-no*H0*2
  19233. -->
  19234. (S1 ^operator O2060 = 0.313998974224576)
  19235. Firing rl*prefer*rvt*predict-yes*H0*1
  19236. -->
  19237. (S1 ^operator O2059 = 0.380412116919439)
  19238. Firing prefer*rvt*predict-yes*H0
  19239. -->
  19240. Firing prefer*rvt*predict-no*H0
  19241. -->
  19242. Firing elaborate*copy-dir-to-output-link
  19243. -->
  19244. (I3 ^dir L +)
  19245. inner elaboration loop at bottom goal.
  19246. Retracting elaborate*copy-see-to-output-link
  19247. -->
  19248. (I3 ^see 1 +)
  19249. Retracting propose*predict-no
  19250. -->
  19251. (O2060 ^name predict-no +)
  19252. (S1 ^operator O2060 +)
  19253. Retracting propose*predict-yes
  19254. -->
  19255. (O2059 ^name predict-yes +)
  19256. (S1 ^operator O2059 +)
  19257. Retracting elaborate*reward*based*on*reward
  19258. -->
  19259. (R1033 ^value 1 +)
  19260. (R1 ^reward R1033 +)
  19261. Retracting elaborate*copy-dir-to-output-link
  19262. -->
  19263. (I3 ^dir R +)
  19264. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  19265. -->
  19266. (S1 ^operator O2060 = -0.1937987592593187)
  19267. Retracting rl*prefer*rvt*predict-no*H0*6
  19268. -->
  19269. (S1 ^operator O2060 = 0.229864201526749)
  19270. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  19271. -->
  19272. (S1 ^operator O2059 = 0.7062472326455022)
  19273. Retracting rl*prefer*rvt*predict-yes*H0*5
  19274. -->
  19275. (S1 ^operator O2059 = 0.2939902369301627)
  19276. =>WM: (14456: S1 ^operator O2062 +)
  19277. =>WM: (14455: S1 ^operator O2061 +)
  19278. =>WM: (14454: I3 ^dir L)
  19279. =>WM: (14453: O2062 ^name predict-no)
  19280. =>WM: (14452: O2061 ^name predict-yes)
  19281. =>WM: (14451: R1034 ^value 1)
  19282. =>WM: (14450: R1 ^reward R1034)
  19283. <=WM: (14441: S1 ^operator O2059 +)
  19284. <=WM: (14443: S1 ^operator O2059)
  19285. <=WM: (14442: S1 ^operator O2060 +)
  19286. <=WM: (14440: I3 ^dir R)
  19287. <=WM: (14436: R1 ^reward R1033)
  19288. <=WM: (14439: O2060 ^name predict-no)
  19289. <=WM: (14438: O2059 ^name predict-yes)
  19290. <=WM: (14437: R1033 ^value 1)
  19291. --- Inner Elaboration Phase, active level 1 (S1) ---
  19292. Firing prefer*rvt*predict-yes*H0
  19293. -->
  19294. Firing rl*prefer*rvt*predict-yes*H0*1
  19295. -->
  19296. (S1 ^operator O2061 = 0.380412116919439)
  19297. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19298. -->
  19299. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  19300. -->
  19301. (S1 ^operator O2061 = 0.6196017333792301)
  19302. Firing prefer*rvt*predict-no*H0
  19303. -->
  19304. Firing rl*prefer*rvt*predict-no*H0*2
  19305. -->
  19306. (S1 ^operator O2062 = 0.313998974224576)
  19307. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19308. -->
  19309. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  19310. -->
  19311. (S1 ^operator O2062 = -0.1479504104026684)
  19312. inner elaboration loop at bottom goal.
  19313. Retracting rl*prefer*rvt*predict-no*H0*2
  19314. -->
  19315. (S1 ^operator O2060 = 0.313998974224576)
  19316. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  19317. -->
  19318. (S1 ^operator O2060 = -0.1479504104026684)
  19319. Retracting rl*prefer*rvt*predict-yes*H0*1
  19320. -->
  19321. (S1 ^operator O2059 = 0.380412116919439)
  19322. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  19323. -->
  19324. (S1 ^operator O2059 = 0.6196017333792301)
  19325. --- END Proposal Phase ---
  19326. --- Decision Phase ---
  19327. RL update rl*prefer*rvt*predict-yes*H0*5 0.501065 -0.207075 0.29399 -> 0.501047 -0.207077 0.293971(R,m,v=1,0.851852,0.126984)
  19328. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499149 0.207098 0.706247 -> 0.499129 0.207096 0.706224(R,m,v=1,1,0)
  19329. =>WM: (14457: S1 ^operator O2061)
  19330. 1031: O: O2061 (predict-yes)
  19331. --- END Decision Phase ---
  19332. --- Application Phase ---
  19333. --- Firing Productions (PE) For State At Depth 1 ---
  19334. --- Inner Elaboration Phase, active level 1 (S1) ---
  19335. Firing apply*operator
  19336. -->
  19337. (I3 ^predict-yes N1031 + :O )
  19338. Firing apply*operator*complete
  19339. -->
  19340. (I3 ^predict-yes N1030 - :O )
  19341. inner elaboration loop at bottom goal.
  19342. --- Change Working Memory (PE) ---
  19343. =>WM: (14458: I3 ^predict-yes N1031)
  19344. <=WM: (14445: N1030 ^status complete)
  19345. <=WM: (14444: I3 ^predict-yes N1030)
  19346. --- Firing Productions (IE) For State At Depth 1 ---
  19347. --- Inner Elaboration Phase, active level 1 (S1) ---
  19348. Firing monitor*world
  19349. -->
  19350. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19351. --- Change Working Memory (IE) ---
  19352. --- END Application Phase ---
  19353. --- Output Phase ---
  19354. ENV: Agent did: predict-yes for direction L in state State-B
  19355. In State-B moving L
  19356. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  19357. predict error 0
  19358. dir: dir isL
  19359. --- END Output Phase ---
  19360. \--- Input Phase ---
  19361. =>WM: (14462: I2 ^dir L)
  19362. =>WM: (14461: I2 ^reward 1)
  19363. =>WM: (14460: I2 ^see 1)
  19364. =>WM: (14459: N1031 ^status complete)
  19365. <=WM: (14448: I2 ^dir L)
  19366. <=WM: (14447: I2 ^reward 1)
  19367. <=WM: (14446: I2 ^see 1)
  19368. =>WM: (14463: I2 ^level-1 L1-root)
  19369. <=WM: (14449: I2 ^level-1 R1-root)
  19370. --- END Input Phase ---
  19371. --- Proposal Phase ---
  19372. --- Inner Elaboration Phase, active level 1 (S1) ---
  19373. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  19374. -->
  19375. (S1 ^operator O2061 = -0.3470159027404986)
  19376. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  19377. -->
  19378. (S1 ^operator O2062 = 0.6860933424731377)
  19379. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19380. -->
  19381. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19382. -->
  19383. Firing elaborate*copy-see-to-output-link
  19384. -->
  19385. (I3 ^see 1 +)
  19386. Firing elaborate*reward*based*on*reward
  19387. -->
  19388. (R1035 ^value 1 +)
  19389. (R1 ^reward R1035 +)
  19390. Firing propose*predict-yes
  19391. -->
  19392. (O2063 ^name predict-yes +)
  19393. (S1 ^operator O2063 +)
  19394. Firing propose*predict-no
  19395. -->
  19396. (O2064 ^name predict-no +)
  19397. (S1 ^operator O2064 +)
  19398. Firing rl*prefer*rvt*predict-no*H0*2
  19399. -->
  19400. (S1 ^operator O2062 = 0.313998974224576)
  19401. Firing rl*prefer*rvt*predict-yes*H0*1
  19402. -->
  19403. (S1 ^operator O2061 = 0.380412116919439)
  19404. Firing prefer*rvt*predict-yes*H0
  19405. -->
  19406. Firing prefer*rvt*predict-no*H0
  19407. -->
  19408. Firing elaborate*copy-dir-to-output-link
  19409. -->
  19410. (I3 ^dir L +)
  19411. inner elaboration loop at bottom goal.
  19412. Retracting elaborate*copy-see-to-output-link
  19413. -->
  19414. (I3 ^see 1 +)
  19415. Retracting propose*predict-no
  19416. -->
  19417. (O2062 ^name predict-no +)
  19418. (S1 ^operator O2062 +)
  19419. Retracting propose*predict-yes
  19420. -->
  19421. (O2061 ^name predict-yes +)
  19422. (S1 ^operator O2061 +)
  19423. Retracting elaborate*reward*based*on*reward
  19424. -->
  19425. (R1034 ^value 1 +)
  19426. (R1 ^reward R1034 +)
  19427. Retracting elaborate*copy-dir-to-output-link
  19428. -->
  19429. (I3 ^dir L +)
  19430. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  19431. -->
  19432. (S1 ^operator O2062 = -0.1479504104026684)
  19433. Retracting rl*prefer*rvt*predict-no*H0*2
  19434. -->
  19435. (S1 ^operator O2062 = 0.313998974224576)
  19436. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  19437. -->
  19438. (S1 ^operator O2061 = 0.6196017333792301)
  19439. Retracting rl*prefer*rvt*predict-yes*H0*1
  19440. -->
  19441. (S1 ^operator O2061 = 0.380412116919439)
  19442. =>WM: (14469: S1 ^operator O2064 +)
  19443. =>WM: (14468: S1 ^operator O2063 +)
  19444. =>WM: (14467: O2064 ^name predict-no)
  19445. =>WM: (14466: O2063 ^name predict-yes)
  19446. =>WM: (14465: R1035 ^value 1)
  19447. =>WM: (14464: R1 ^reward R1035)
  19448. <=WM: (14455: S1 ^operator O2061 +)
  19449. <=WM: (14457: S1 ^operator O2061)
  19450. <=WM: (14456: S1 ^operator O2062 +)
  19451. <=WM: (14450: R1 ^reward R1034)
  19452. <=WM: (14453: O2062 ^name predict-no)
  19453. <=WM: (14452: O2061 ^name predict-yes)
  19454. <=WM: (14451: R1034 ^value 1)
  19455. --- Inner Elaboration Phase, active level 1 (S1) ---
  19456. Firing prefer*rvt*predict-yes*H0
  19457. -->
  19458. Firing rl*prefer*rvt*predict-yes*H0*1
  19459. -->
  19460. (S1 ^operator O2063 = 0.380412116919439)
  19461. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19462. -->
  19463. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  19464. -->
  19465. (S1 ^operator O2063 = -0.3470159027404986)
  19466. Firing prefer*rvt*predict-no*H0
  19467. -->
  19468. Firing rl*prefer*rvt*predict-no*H0*2
  19469. -->
  19470. (S1 ^operator O2064 = 0.313998974224576)
  19471. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19472. -->
  19473. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  19474. -->
  19475. (S1 ^operator O2064 = 0.6860933424731377)
  19476. inner elaboration loop at bottom goal.
  19477. Retracting rl*prefer*rvt*predict-no*H0*2
  19478. -->
  19479. (S1 ^operator O2062 = 0.313998974224576)
  19480. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  19481. -->
  19482. (S1 ^operator O2062 = 0.6860933424731377)
  19483. Retracting rl*prefer*rvt*predict-yes*H0*1
  19484. -->
  19485. (S1 ^operator O2061 = 0.380412116919439)
  19486. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  19487. -->
  19488. (S1 ^operator O2061 = -0.3470159027404986)
  19489. --- END Proposal Phase ---
  19490. --- Decision Phase ---
  19491. RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521341 -0.14093 0.380411(R,m,v=1,0.837209,0.137087)
  19492. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478673 0.140928 0.619602 -> 0.478672 0.140929 0.6196(R,m,v=1,1,0)
  19493. =>WM: (14470: S1 ^operator O2064)
  19494. 1032: O: O2064 (predict-no)
  19495. --- END Decision Phase ---
  19496. --- Application Phase ---
  19497. --- Firing Productions (PE) For State At Depth 1 ---
  19498. --- Inner Elaboration Phase, active level 1 (S1) ---
  19499. Firing apply*operator
  19500. -->
  19501. (I3 ^predict-no N1032 + :O )
  19502. Firing apply*operator*complete
  19503. -->
  19504. (I3 ^predict-yes N1031 - :O )
  19505. inner elaboration loop at bottom goal.
  19506. --- Change Working Memory (PE) ---
  19507. =>WM: (14471: I3 ^predict-no N1032)
  19508. <=WM: (14459: N1031 ^status complete)
  19509. <=WM: (14458: I3 ^predict-yes N1031)
  19510. --- Firing Productions (IE) For State At Depth 1 ---
  19511. --- Inner Elaboration Phase, active level 1 (S1) ---
  19512. Firing monitor*world
  19513. -->
  19514. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19515. --- Change Working Memory (IE) ---
  19516. --- END Application Phase ---
  19517. --- Output Phase ---
  19518. ENV: Agent did: predict-no for direction L in state State-A
  19519. In State-A moving L
  19520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19521. predict error 0
  19522. dir: dir isL
  19523. --- END Output Phase ---
  19524. -/|--- Input Phase ---
  19525. =>WM: (14475: I2 ^dir L)
  19526. =>WM: (14474: I2 ^reward 1)
  19527. =>WM: (14473: I2 ^see 0)
  19528. =>WM: (14472: N1032 ^status complete)
  19529. <=WM: (14462: I2 ^dir L)
  19530. <=WM: (14461: I2 ^reward 1)
  19531. <=WM: (14460: I2 ^see 1)
  19532. =>WM: (14476: I2 ^level-1 L0-root)
  19533. <=WM: (14463: I2 ^level-1 L1-root)
  19534. --- END Input Phase ---
  19535. --- Proposal Phase ---
  19536. --- Inner Elaboration Phase, active level 1 (S1) ---
  19537. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19538. -->
  19539. (S1 ^operator O2063 = -0.3332708974800781)
  19540. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19541. -->
  19542. (S1 ^operator O2064 = 0.6857963029033564)
  19543. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19544. -->
  19545. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19546. -->
  19547. Firing elaborate*copy-see-to-output-link
  19548. -->
  19549. (I3 ^see 0 +)
  19550. Firing elaborate*reward*based*on*reward
  19551. -->
  19552. (R1036 ^value 1 +)
  19553. (R1 ^reward R1036 +)
  19554. Firing propose*predict-yes
  19555. -->
  19556. (O2065 ^name predict-yes +)
  19557. (S1 ^operator O2065 +)
  19558. Firing propose*predict-no
  19559. -->
  19560. (O2066 ^name predict-no +)
  19561. (S1 ^operator O2066 +)
  19562. Firing rl*prefer*rvt*predict-no*H0*2
  19563. -->
  19564. (S1 ^operator O2064 = 0.313998974224576)
  19565. Firing rl*prefer*rvt*predict-yes*H0*1
  19566. -->
  19567. (S1 ^operator O2063 = 0.3804109904199586)
  19568. Firing prefer*rvt*predict-yes*H0
  19569. -->
  19570. Firing prefer*rvt*predict-no*H0
  19571. -->
  19572. Firing elaborate*copy-dir-to-output-link
  19573. -->
  19574. (I3 ^dir L +)
  19575. inner elaboration loop at bottom goal.
  19576. Retracting elaborate*copy-see-to-output-link
  19577. -->
  19578. (I3 ^see 1 +)
  19579. Retracting propose*predict-no
  19580. -->
  19581. (O2064 ^name predict-no +)
  19582. (S1 ^operator O2064 +)
  19583. Retracting propose*predict-yes
  19584. -->
  19585. (O2063 ^name predict-yes +)
  19586. (S1 ^operator O2063 +)
  19587. Retracting elaborate*reward*based*on*reward
  19588. -->
  19589. (R1035 ^value 1 +)
  19590. (R1 ^reward R1035 +)
  19591. Retracting elaborate*copy-dir-to-output-link
  19592. -->
  19593. (I3 ^dir L +)
  19594. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  19595. -->
  19596. (S1 ^operator O2064 = 0.6860933424731377)
  19597. Retracting rl*prefer*rvt*predict-no*H0*2
  19598. -->
  19599. (S1 ^operator O2064 = 0.313998974224576)
  19600. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  19601. -->
  19602. (S1 ^operator O2063 = -0.3470159027404986)
  19603. Retracting rl*prefer*rvt*predict-yes*H0*1
  19604. -->
  19605. (S1 ^operator O2063 = 0.3804109904199586)
  19606. =>WM: (14483: S1 ^operator O2066 +)
  19607. =>WM: (14482: S1 ^operator O2065 +)
  19608. =>WM: (14481: O2066 ^name predict-no)
  19609. =>WM: (14480: O2065 ^name predict-yes)
  19610. =>WM: (14479: R1036 ^value 1)
  19611. =>WM: (14478: R1 ^reward R1036)
  19612. =>WM: (14477: I3 ^see 0)
  19613. <=WM: (14468: S1 ^operator O2063 +)
  19614. <=WM: (14469: S1 ^operator O2064 +)
  19615. <=WM: (14470: S1 ^operator O2064)
  19616. <=WM: (14464: R1 ^reward R1035)
  19617. <=WM: (14435: I3 ^see 1)
  19618. <=WM: (14467: O2064 ^name predict-no)
  19619. <=WM: (14466: O2063 ^name predict-yes)
  19620. <=WM: (14465: R1035 ^value 1)
  19621. --- Inner Elaboration Phase, active level 1 (S1) ---
  19622. Firing prefer*rvt*predict-yes*H0
  19623. -->
  19624. Firing rl*prefer*rvt*predict-yes*H0*1
  19625. -->
  19626. (S1 ^operator O2065 = 0.3804109904199586)
  19627. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19628. -->
  19629. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19630. -->
  19631. (S1 ^operator O2065 = -0.3332708974800781)
  19632. Firing prefer*rvt*predict-no*H0
  19633. -->
  19634. Firing rl*prefer*rvt*predict-no*H0*2
  19635. -->
  19636. (S1 ^operator O2066 = 0.313998974224576)
  19637. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19638. -->
  19639. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19640. -->
  19641. (S1 ^operator O2066 = 0.6857963029033564)
  19642. inner elaboration loop at bottom goal.
  19643. Retracting rl*prefer*rvt*predict-no*H0*2
  19644. -->
  19645. (S1 ^operator O2064 = 0.313998974224576)
  19646. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19647. -->
  19648. (S1 ^operator O2064 = 0.6857963029033564)
  19649. Retracting rl*prefer*rvt*predict-yes*H0*1
  19650. -->
  19651. (S1 ^operator O2063 = 0.3804109904199586)
  19652. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19653. -->
  19654. (S1 ^operator O2063 = -0.3332708974800781)
  19655. --- END Proposal Phase ---
  19656. --- Decision Phase ---
  19657. RL update rl*prefer*rvt*predict-no*H0*2 0.485014 -0.171015 0.313999 -> 0.485008 -0.171016 0.313991(R,m,v=1,0.865385,0.117246)
  19658. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515059 0.171034 0.686093 -> 0.515052 0.171032 0.686084(R,m,v=1,1,0)
  19659. =>WM: (14484: S1 ^operator O2066)
  19660. 1033: O: O2066 (predict-no)
  19661. --- END Decision Phase ---
  19662. --- Application Phase ---
  19663. --- Firing Productions (PE) For State At Depth 1 ---
  19664. --- Inner Elaboration Phase, active level 1 (S1) ---
  19665. Firing apply*operator
  19666. -->
  19667. (I3 ^predict-no N1033 + :O )
  19668. Firing apply*operator*complete
  19669. -->
  19670. (I3 ^predict-no N1032 - :O )
  19671. inner elaboration loop at bottom goal.
  19672. --- Change Working Memory (PE) ---
  19673. =>WM: (14485: I3 ^predict-no N1033)
  19674. <=WM: (14472: N1032 ^status complete)
  19675. <=WM: (14471: I3 ^predict-no N1032)
  19676. --- Firing Productions (IE) For State At Depth 1 ---
  19677. --- Inner Elaboration Phase, active level 1 (S1) ---
  19678. Firing monitor*world
  19679. -->
  19680. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19681. --- Change Working Memory (IE) ---
  19682. --- END Application Phase ---
  19683. --- Output Phase ---
  19684. ENV: Agent did: predict-no for direction L in state State-A
  19685. In State-A moving L
  19686. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19687. predict error 0
  19688. dir: dir isL
  19689. --- END Output Phase ---
  19690. \-/--- Input Phase ---
  19691. =>WM: (14489: I2 ^dir L)
  19692. =>WM: (14488: I2 ^reward 1)
  19693. =>WM: (14487: I2 ^see 0)
  19694. =>WM: (14486: N1033 ^status complete)
  19695. <=WM: (14475: I2 ^dir L)
  19696. <=WM: (14474: I2 ^reward 1)
  19697. <=WM: (14473: I2 ^see 0)
  19698. =>WM: (14490: I2 ^level-1 L0-root)
  19699. <=WM: (14476: I2 ^level-1 L0-root)
  19700. --- END Input Phase ---
  19701. --- Proposal Phase ---
  19702. --- Inner Elaboration Phase, active level 1 (S1) ---
  19703. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19704. -->
  19705. (S1 ^operator O2065 = -0.3332708974800781)
  19706. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19707. -->
  19708. (S1 ^operator O2066 = 0.6857963029033564)
  19709. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19710. -->
  19711. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19712. -->
  19713. Firing elaborate*copy-see-to-output-link
  19714. -->
  19715. (I3 ^see 0 +)
  19716. Firing elaborate*reward*based*on*reward
  19717. -->
  19718. (R1037 ^value 1 +)
  19719. (R1 ^reward R1037 +)
  19720. Firing propose*predict-yes
  19721. -->
  19722. (O2067 ^name predict-yes +)
  19723. (S1 ^operator O2067 +)
  19724. Firing propose*predict-no
  19725. -->
  19726. (O2068 ^name predict-no +)
  19727. (S1 ^operator O2068 +)
  19728. Firing rl*prefer*rvt*predict-no*H0*2
  19729. -->
  19730. (S1 ^operator O2066 = 0.3139913445638368)
  19731. Firing rl*prefer*rvt*predict-yes*H0*1
  19732. -->
  19733. (S1 ^operator O2065 = 0.3804109904199586)
  19734. Firing prefer*rvt*predict-yes*H0
  19735. -->
  19736. Firing prefer*rvt*predict-no*H0
  19737. -->
  19738. Firing elaborate*copy-dir-to-output-link
  19739. -->
  19740. (I3 ^dir L +)
  19741. inner elaboration loop at bottom goal.
  19742. Retracting elaborate*copy-see-to-output-link
  19743. -->
  19744. (I3 ^see 0 +)
  19745. Retracting propose*predict-no
  19746. -->
  19747. (O2066 ^name predict-no +)
  19748. (S1 ^operator O2066 +)
  19749. Retracting propose*predict-yes
  19750. -->
  19751. (O2065 ^name predict-yes +)
  19752. (S1 ^operator O2065 +)
  19753. Retracting elaborate*reward*based*on*reward
  19754. -->
  19755. (R1036 ^value 1 +)
  19756. (R1 ^reward R1036 +)
  19757. Retracting elaborate*copy-dir-to-output-link
  19758. -->
  19759. (I3 ^dir L +)
  19760. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19761. -->
  19762. (S1 ^operator O2066 = 0.6857963029033564)
  19763. Retracting rl*prefer*rvt*predict-no*H0*2
  19764. -->
  19765. (S1 ^operator O2066 = 0.3139913445638368)
  19766. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19767. -->
  19768. (S1 ^operator O2065 = -0.3332708974800781)
  19769. Retracting rl*prefer*rvt*predict-yes*H0*1
  19770. -->
  19771. (S1 ^operator O2065 = 0.3804109904199586)
  19772. =>WM: (14496: S1 ^operator O2068 +)
  19773. =>WM: (14495: S1 ^operator O2067 +)
  19774. =>WM: (14494: O2068 ^name predict-no)
  19775. =>WM: (14493: O2067 ^name predict-yes)
  19776. =>WM: (14492: R1037 ^value 1)
  19777. =>WM: (14491: R1 ^reward R1037)
  19778. <=WM: (14482: S1 ^operator O2065 +)
  19779. <=WM: (14483: S1 ^operator O2066 +)
  19780. <=WM: (14484: S1 ^operator O2066)
  19781. <=WM: (14478: R1 ^reward R1036)
  19782. <=WM: (14481: O2066 ^name predict-no)
  19783. <=WM: (14480: O2065 ^name predict-yes)
  19784. <=WM: (14479: R1036 ^value 1)
  19785. --- Inner Elaboration Phase, active level 1 (S1) ---
  19786. Firing prefer*rvt*predict-yes*H0
  19787. -->
  19788. Firing rl*prefer*rvt*predict-yes*H0*1
  19789. -->
  19790. (S1 ^operator O2067 = 0.3804109904199586)
  19791. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19792. -->
  19793. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19794. -->
  19795. (S1 ^operator O2067 = -0.3332708974800781)
  19796. Firing prefer*rvt*predict-no*H0
  19797. -->
  19798. Firing rl*prefer*rvt*predict-no*H0*2
  19799. -->
  19800. (S1 ^operator O2068 = 0.3139913445638368)
  19801. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19802. -->
  19803. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19804. -->
  19805. (S1 ^operator O2068 = 0.6857963029033564)
  19806. inner elaboration loop at bottom goal.
  19807. Retracting rl*prefer*rvt*predict-no*H0*2
  19808. -->
  19809. (S1 ^operator O2066 = 0.3139913445638368)
  19810. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19811. -->
  19812. (S1 ^operator O2066 = 0.6857963029033564)
  19813. Retracting rl*prefer*rvt*predict-yes*H0*1
  19814. -->
  19815. (S1 ^operator O2065 = 0.3804109904199586)
  19816. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19817. -->
  19818. (S1 ^operator O2065 = -0.3332708974800781)
  19819. --- END Proposal Phase ---
  19820. --- Decision Phase ---
  19821. RL update rl*prefer*rvt*predict-no*H0*2 0.485008 -0.171016 0.313991 -> 0.485021 -0.171012 0.314009(R,m,v=1,0.866242,0.11661)
  19822. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514825 0.170972 0.685796 -> 0.514841 0.170976 0.685817(R,m,v=1,1,0)
  19823. =>WM: (14497: S1 ^operator O2068)
  19824. 1034: O: O2068 (predict-no)
  19825. --- END Decision Phase ---
  19826. --- Application Phase ---
  19827. --- Firing Productions (PE) For State At Depth 1 ---
  19828. --- Inner Elaboration Phase, active level 1 (S1) ---
  19829. Firing apply*operator
  19830. -->
  19831. (I3 ^predict-no N1034 + :O )
  19832. Firing apply*operator*complete
  19833. -->
  19834. (I3 ^predict-no N1033 - :O )
  19835. inner elaboration loop at bottom goal.
  19836. --- Change Working Memory (PE) ---
  19837. =>WM: (14498: I3 ^predict-no N1034)
  19838. <=WM: (14486: N1033 ^status complete)
  19839. <=WM: (14485: I3 ^predict-no N1033)
  19840. --- Firing Productions (IE) For State At Depth 1 ---
  19841. --- Inner Elaboration Phase, active level 1 (S1) ---
  19842. Firing monitor*world
  19843. -->
  19844. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19845. --- Change Working Memory (IE) ---
  19846. --- END Application Phase ---
  19847. --- Output Phase ---
  19848. ENV: Agent did: predict-no for direction L in state State-A
  19849. In State-A moving L
  19850. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19851. predict error 0
  19852. dir: dir isL
  19853. --- END Output Phase ---
  19854. |\---- Input Phase ---
  19855. =>WM: (14502: I2 ^dir L)
  19856. =>WM: (14501: I2 ^reward 1)
  19857. =>WM: (14500: I2 ^see 0)
  19858. =>WM: (14499: N1034 ^status complete)
  19859. <=WM: (14489: I2 ^dir L)
  19860. <=WM: (14488: I2 ^reward 1)
  19861. <=WM: (14487: I2 ^see 0)
  19862. =>WM: (14503: I2 ^level-1 L0-root)
  19863. <=WM: (14490: I2 ^level-1 L0-root)
  19864. --- END Input Phase ---
  19865. --- Proposal Phase ---
  19866. --- Inner Elaboration Phase, active level 1 (S1) ---
  19867. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19868. -->
  19869. (S1 ^operator O2067 = -0.3332708974800781)
  19870. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19871. -->
  19872. (S1 ^operator O2068 = 0.6858169471742246)
  19873. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19874. -->
  19875. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19876. -->
  19877. Firing elaborate*copy-see-to-output-link
  19878. -->
  19879. (I3 ^see 0 +)
  19880. Firing elaborate*reward*based*on*reward
  19881. -->
  19882. (R1038 ^value 1 +)
  19883. (R1 ^reward R1038 +)
  19884. Firing propose*predict-yes
  19885. -->
  19886. (O2069 ^name predict-yes +)
  19887. (S1 ^operator O2069 +)
  19888. Firing propose*predict-no
  19889. -->
  19890. (O2070 ^name predict-no +)
  19891. (S1 ^operator O2070 +)
  19892. Firing rl*prefer*rvt*predict-no*H0*2
  19893. -->
  19894. (S1 ^operator O2068 = 0.3140088762608346)
  19895. Firing rl*prefer*rvt*predict-yes*H0*1
  19896. -->
  19897. (S1 ^operator O2067 = 0.3804109904199586)
  19898. Firing prefer*rvt*predict-yes*H0
  19899. -->
  19900. Firing prefer*rvt*predict-no*H0
  19901. -->
  19902. Firing elaborate*copy-dir-to-output-link
  19903. -->
  19904. (I3 ^dir L +)
  19905. inner elaboration loop at bottom goal.
  19906. Retracting elaborate*copy-see-to-output-link
  19907. -->
  19908. (I3 ^see 0 +)
  19909. Retracting propose*predict-no
  19910. -->
  19911. (O2068 ^name predict-no +)
  19912. (S1 ^operator O2068 +)
  19913. Retracting propose*predict-yes
  19914. -->
  19915. (O2067 ^name predict-yes +)
  19916. (S1 ^operator O2067 +)
  19917. Retracting elaborate*reward*based*on*reward
  19918. -->
  19919. (R1037 ^value 1 +)
  19920. (R1 ^reward R1037 +)
  19921. Retracting elaborate*copy-dir-to-output-link
  19922. -->
  19923. (I3 ^dir L +)
  19924. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19925. -->
  19926. (S1 ^operator O2068 = 0.6858169471742246)
  19927. Retracting rl*prefer*rvt*predict-no*H0*2
  19928. -->
  19929. (S1 ^operator O2068 = 0.3140088762608346)
  19930. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19931. -->
  19932. (S1 ^operator O2067 = -0.3332708974800781)
  19933. Retracting rl*prefer*rvt*predict-yes*H0*1
  19934. -->
  19935. (S1 ^operator O2067 = 0.3804109904199586)
  19936. =>WM: (14509: S1 ^operator O2070 +)
  19937. =>WM: (14508: S1 ^operator O2069 +)
  19938. =>WM: (14507: O2070 ^name predict-no)
  19939. =>WM: (14506: O2069 ^name predict-yes)
  19940. =>WM: (14505: R1038 ^value 1)
  19941. =>WM: (14504: R1 ^reward R1038)
  19942. <=WM: (14495: S1 ^operator O2067 +)
  19943. <=WM: (14496: S1 ^operator O2068 +)
  19944. <=WM: (14497: S1 ^operator O2068)
  19945. <=WM: (14491: R1 ^reward R1037)
  19946. <=WM: (14494: O2068 ^name predict-no)
  19947. <=WM: (14493: O2067 ^name predict-yes)
  19948. <=WM: (14492: R1037 ^value 1)
  19949. --- Inner Elaboration Phase, active level 1 (S1) ---
  19950. Firing prefer*rvt*predict-yes*H0
  19951. -->
  19952. Firing rl*prefer*rvt*predict-yes*H0*1
  19953. -->
  19954. (S1 ^operator O2069 = 0.3804109904199586)
  19955. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19956. -->
  19957. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19958. -->
  19959. (S1 ^operator O2069 = -0.3332708974800781)
  19960. Firing prefer*rvt*predict-no*H0
  19961. -->
  19962. Firing rl*prefer*rvt*predict-no*H0*2
  19963. -->
  19964. (S1 ^operator O2070 = 0.3140088762608346)
  19965. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19966. -->
  19967. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19968. -->
  19969. (S1 ^operator O2070 = 0.6858169471742246)
  19970. inner elaboration loop at bottom goal.
  19971. Retracting rl*prefer*rvt*predict-no*H0*2
  19972. -->
  19973. (S1 ^operator O2068 = 0.3140088762608346)
  19974. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  19975. -->
  19976. (S1 ^operator O2068 = 0.6858169471742246)
  19977. Retracting rl*prefer*rvt*predict-yes*H0*1
  19978. -->
  19979. (S1 ^operator O2067 = 0.3804109904199586)
  19980. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  19981. -->
  19982. (S1 ^operator O2067 = -0.3332708974800781)
  19983. --- END Proposal Phase ---
  19984. --- Decision Phase ---
  19985. RL update rl*prefer*rvt*predict-no*H0*2 0.485021 -0.171012 0.314009 -> 0.485033 -0.171009 0.314023(R,m,v=1,0.867089,0.11598)
  19986. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514841 0.170976 0.685817 -> 0.514854 0.170979 0.685834(R,m,v=1,1,0)
  19987. =>WM: (14510: S1 ^operator O2070)
  19988. 1035: O: O2070 (predict-no)
  19989. --- END Decision Phase ---
  19990. --- Application Phase ---
  19991. --- Firing Productions (PE) For State At Depth 1 ---
  19992. --- Inner Elaboration Phase, active level 1 (S1) ---
  19993. Firing apply*operator
  19994. -->
  19995. (I3 ^predict-no N1035 + :O )
  19996. Firing apply*operator*complete
  19997. -->
  19998. (I3 ^predict-no N1034 - :O )
  19999. inner elaboration loop at bottom goal.
  20000. --- Change Working Memory (PE) ---
  20001. =>WM: (14511: I3 ^predict-no N1035)
  20002. <=WM: (14499: N1034 ^status complete)
  20003. <=WM: (14498: I3 ^predict-no N1034)
  20004. --- Firing Productions (IE) For State At Depth 1 ---
  20005. --- Inner Elaboration Phase, active level 1 (S1) ---
  20006. Firing monitor*world
  20007. -->
  20008. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20009. --- Change Working Memory (IE) ---
  20010. --- END Application Phase ---
  20011. --- Output Phase ---
  20012. ENV: Agent did: predict-no for direction L in state State-A
  20013. In State-A moving L
  20014. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20015. predict error 0
  20016. dir: dir isR
  20017. --- END Output Phase ---
  20018. /|--- Input Phase ---
  20019. =>WM: (14515: I2 ^dir R)
  20020. =>WM: (14514: I2 ^reward 1)
  20021. =>WM: (14513: I2 ^see 0)
  20022. =>WM: (14512: N1035 ^status complete)
  20023. <=WM: (14502: I2 ^dir L)
  20024. <=WM: (14501: I2 ^reward 1)
  20025. <=WM: (14500: I2 ^see 0)
  20026. =>WM: (14516: I2 ^level-1 L0-root)
  20027. <=WM: (14503: I2 ^level-1 L0-root)
  20028. --- END Input Phase ---
  20029. --- Proposal Phase ---
  20030. --- Inner Elaboration Phase, active level 1 (S1) ---
  20031. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  20032. -->
  20033. (S1 ^operator O2069 = 0.7056959425110291)
  20034. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  20035. -->
  20036. (S1 ^operator O2070 = -0.2023211881870005)
  20037. Firing prefer*rvt*predict-no*H0*6*v1*H1
  20038. -->
  20039. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20040. -->
  20041. Firing elaborate*copy-see-to-output-link
  20042. -->
  20043. (I3 ^see 0 +)
  20044. Firing elaborate*reward*based*on*reward
  20045. -->
  20046. (R1039 ^value 1 +)
  20047. (R1 ^reward R1039 +)
  20048. Firing propose*predict-yes
  20049. -->
  20050. (O2071 ^name predict-yes +)
  20051. (S1 ^operator O2071 +)
  20052. Firing propose*predict-no
  20053. -->
  20054. (O2072 ^name predict-no +)
  20055. (S1 ^operator O2072 +)
  20056. Firing rl*prefer*rvt*predict-no*H0*6
  20057. -->
  20058. (S1 ^operator O2070 = 0.229864201526749)
  20059. Firing rl*prefer*rvt*predict-yes*H0*5
  20060. -->
  20061. (S1 ^operator O2069 = 0.2939707325508816)
  20062. Firing prefer*rvt*predict-yes*H0
  20063. -->
  20064. Firing prefer*rvt*predict-no*H0
  20065. -->
  20066. Firing elaborate*copy-dir-to-output-link
  20067. -->
  20068. (I3 ^dir R +)
  20069. inner elaboration loop at bottom goal.
  20070. Retracting elaborate*copy-see-to-output-link
  20071. -->
  20072. (I3 ^see 0 +)
  20073. Retracting propose*predict-no
  20074. -->
  20075. (O2070 ^name predict-no +)
  20076. (S1 ^operator O2070 +)
  20077. Retracting propose*predict-yes
  20078. -->
  20079. (O2069 ^name predict-yes +)
  20080. (S1 ^operator O2069 +)
  20081. Retracting elaborate*reward*based*on*reward
  20082. -->
  20083. (R1038 ^value 1 +)
  20084. (R1 ^reward R1038 +)
  20085. Retracting elaborate*copy-dir-to-output-link
  20086. -->
  20087. (I3 ^dir L +)
  20088. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  20089. -->
  20090. (S1 ^operator O2070 = 0.6858338284024019)
  20091. Retracting rl*prefer*rvt*predict-no*H0*2
  20092. -->
  20093. (S1 ^operator O2070 = 0.3140232411131785)
  20094. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  20095. -->
  20096. (S1 ^operator O2069 = -0.3332708974800781)
  20097. Retracting rl*prefer*rvt*predict-yes*H0*1
  20098. -->
  20099. (S1 ^operator O2069 = 0.3804109904199586)
  20100. =>WM: (14523: S1 ^operator O2072 +)
  20101. =>WM: (14522: S1 ^operator O2071 +)
  20102. =>WM: (14521: I3 ^dir R)
  20103. =>WM: (14520: O2072 ^name predict-no)
  20104. =>WM: (14519: O2071 ^name predict-yes)
  20105. =>WM: (14518: R1039 ^value 1)
  20106. =>WM: (14517: R1 ^reward R1039)
  20107. <=WM: (14508: S1 ^operator O2069 +)
  20108. <=WM: (14509: S1 ^operator O2070 +)
  20109. <=WM: (14510: S1 ^operator O2070)
  20110. <=WM: (14454: I3 ^dir L)
  20111. <=WM: (14504: R1 ^reward R1038)
  20112. <=WM: (14507: O2070 ^name predict-no)
  20113. <=WM: (14506: O2069 ^name predict-yes)
  20114. <=WM: (14505: R1038 ^value 1)
  20115. --- Inner Elaboration Phase, active level 1 (S1) ---
  20116. Firing prefer*rvt*predict-yes*H0
  20117. -->
  20118. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  20119. -->
  20120. (S1 ^operator O2071 = 0.7056959425110291)
  20121. Firing rl*prefer*rvt*predict-yes*H0*5
  20122. -->
  20123. (S1 ^operator O2071 = 0.2939707325508816)
  20124. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20125. -->
  20126. Firing prefer*rvt*predict-no*H0
  20127. -->
  20128. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  20129. -->
  20130. (S1 ^operator O2072 = -0.2023211881870005)
  20131. Firing rl*prefer*rvt*predict-no*H0*6
  20132. -->
  20133. (S1 ^operator O2072 = 0.229864201526749)
  20134. Firing prefer*rvt*predict-no*H0*6*v1*H1
  20135. -->
  20136. inner elaboration loop at bottom goal.
  20137. Retracting rl*prefer*rvt*predict-no*H0*6
  20138. -->
  20139. (S1 ^operator O2070 = 0.229864201526749)
  20140. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  20141. -->
  20142. (S1 ^operator O2070 = -0.2023211881870005)
  20143. Retracting rl*prefer*rvt*predict-yes*H0*5
  20144. -->
  20145. (S1 ^operator O2069 = 0.2939707325508816)
  20146. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  20147. -->
  20148. (S1 ^operator O2069 = 0.7056959425110291)
  20149. --- END Proposal Phase ---
  20150. --- Decision Phase ---
  20151. RL update rl*prefer*rvt*predict-no*H0*2 0.485033 -0.171009 0.314023 -> 0.485042 -0.171007 0.314035(R,m,v=1,0.867925,0.115357)
  20152. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514854 0.170979 0.685834 -> 0.514865 0.170982 0.685848(R,m,v=1,1,0)
  20153. =>WM: (14524: S1 ^operator O2071)
  20154. 1036: O: O2071 (predict-yes)
  20155. --- END Decision Phase ---
  20156. --- Application Phase ---
  20157. --- Firing Productions (PE) For State At Depth 1 ---
  20158. --- Inner Elaboration Phase, active level 1 (S1) ---
  20159. Firing apply*operator
  20160. -->
  20161. (I3 ^predict-yes N1036 + :O )
  20162. Firing apply*operator*complete
  20163. -->
  20164. (I3 ^predict-no N1035 - :O )
  20165. inner elaboration loop at bottom goal.
  20166. --- Change Working Memory (PE) ---
  20167. =>WM: (14525: I3 ^predict-yes N1036)
  20168. <=WM: (14512: N1035 ^status complete)
  20169. <=WM: (14511: I3 ^predict-no N1035)
  20170. --- Firing Productions (IE) For State At Depth 1 ---
  20171. --- Inner Elaboration Phase, active level 1 (S1) ---
  20172. Firing monitor*world
  20173. -->
  20174. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20175. --- Change Working Memory (IE) ---
  20176. --- END Application Phase ---
  20177. --- Output Phase ---
  20178. ENV: Agent did: predict-yes for direction R in state State-A
  20179. In State-A moving R
  20180. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  20181. predict error 0
  20182. dir: dir isU
  20183. --- END Output Phase ---
  20184. \-/--- Input Phase ---
  20185. =>WM: (14529: I2 ^dir U)
  20186. =>WM: (14528: I2 ^reward 1)
  20187. =>WM: (14527: I2 ^see 1)
  20188. =>WM: (14526: N1036 ^status complete)
  20189. <=WM: (14515: I2 ^dir R)
  20190. <=WM: (14514: I2 ^reward 1)
  20191. <=WM: (14513: I2 ^see 0)
  20192. =>WM: (14530: I2 ^level-1 R1-root)
  20193. <=WM: (14516: I2 ^level-1 L0-root)
  20194. --- END Input Phase ---
  20195. --- Proposal Phase ---
  20196. --- Inner Elaboration Phase, active level 1 (S1) ---
  20197. Firing elaborate*copy-see-to-output-link
  20198. -->
  20199. (I3 ^see 1 +)
  20200. Firing elaborate*reward*based*on*reward
  20201. -->
  20202. (R1040 ^value 1 +)
  20203. (R1 ^reward R1040 +)
  20204. Firing propose*predict-yes
  20205. -->
  20206. (O2073 ^name predict-yes +)
  20207. (S1 ^operator O2073 +)
  20208. Firing propose*predict-no
  20209. -->
  20210. (O2074 ^name predict-no +)
  20211. (S1 ^operator O2074 +)
  20212. Firing rl*prefer*rvt*predict-no*H0*4
  20213. -->
  20214. (S1 ^operator O2072 = 1.)
  20215. Firing rl*prefer*rvt*predict-yes*H0*3
  20216. -->
  20217. (S1 ^operator O2071 = 0.)
  20218. Firing prefer*rvt*predict-yes*H0
  20219. -->
  20220. Firing prefer*rvt*predict-no*H0
  20221. -->
  20222. Firing elaborate*copy-dir-to-output-link
  20223. -->
  20224. (I3 ^dir U +)
  20225. inner elaboration loop at bottom goal.
  20226. Retracting elaborate*copy-see-to-output-link
  20227. -->
  20228. (I3 ^see 0 +)
  20229. Retracting propose*predict-no
  20230. -->
  20231. (O2072 ^name predict-no +)
  20232. (S1 ^operator O2072 +)
  20233. Retracting propose*predict-yes
  20234. -->
  20235. (O2071 ^name predict-yes +)
  20236. (S1 ^operator O2071 +)
  20237. Retracting elaborate*reward*based*on*reward
  20238. -->
  20239. (R1039 ^value 1 +)
  20240. (R1 ^reward R1039 +)
  20241. Retracting elaborate*copy-dir-to-output-link
  20242. -->
  20243. (I3 ^dir R +)
  20244. Retracting rl*prefer*rvt*predict-no*H0*6
  20245. -->
  20246. (S1 ^operator O2072 = 0.229864201526749)
  20247. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  20248. -->
  20249. (S1 ^operator O2072 = -0.2023211881870005)
  20250. Retracting rl*prefer*rvt*predict-yes*H0*5
  20251. -->
  20252. (S1 ^operator O2071 = 0.2939707325508816)
  20253. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  20254. -->
  20255. (S1 ^operator O2071 = 0.7056959425110291)
  20256. =>WM: (14538: S1 ^operator O2074 +)
  20257. =>WM: (14537: S1 ^operator O2073 +)
  20258. =>WM: (14536: I3 ^dir U)
  20259. =>WM: (14535: O2074 ^name predict-no)
  20260. =>WM: (14534: O2073 ^name predict-yes)
  20261. =>WM: (14533: R1040 ^value 1)
  20262. =>WM: (14532: R1 ^reward R1040)
  20263. =>WM: (14531: I3 ^see 1)
  20264. <=WM: (14522: S1 ^operator O2071 +)
  20265. <=WM: (14524: S1 ^operator O2071)
  20266. <=WM: (14523: S1 ^operator O2072 +)
  20267. <=WM: (14521: I3 ^dir R)
  20268. <=WM: (14517: R1 ^reward R1039)
  20269. <=WM: (14477: I3 ^see 0)
  20270. <=WM: (14520: O2072 ^name predict-no)
  20271. <=WM: (14519: O2071 ^name predict-yes)
  20272. <=WM: (14518: R1039 ^value 1)
  20273. --- Inner Elaboration Phase, active level 1 (S1) ---
  20274. Firing prefer*rvt*predict-yes*H0
  20275. -->
  20276. Firing rl*prefer*rvt*predict-yes*H0*3
  20277. -->
  20278. (S1 ^operator O2073 = 0.)
  20279. Firing prefer*rvt*predict-no*H0
  20280. -->
  20281. Firing rl*prefer*rvt*predict-no*H0*4
  20282. -->
  20283. (S1 ^operator O2074 = 1.)
  20284. inner elaboration loop at bottom goal.
  20285. Retracting rl*prefer*rvt*predict-no*H0*4
  20286. -->
  20287. (S1 ^operator O2072 = 1.)
  20288. Retracting rl*prefer*rvt*predict-yes*H0*3
  20289. -->
  20290. (S1 ^operator O2071 = 0.)
  20291. --- END Proposal Phase ---
  20292. --- Decision Phase ---
  20293. RL update rl*prefer*rvt*predict-yes*H0*5 0.501047 -0.207077 0.293971 -> 0.501072 -0.207074 0.293998(R,m,v=1,0.852761,0.126335)
  20294. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498651 0.207045 0.705696 -> 0.49868 0.207048 0.705728(R,m,v=1,1,0)
  20295. =>WM: (14539: S1 ^operator O2074)
  20296. 1037: O: O2074 (predict-no)
  20297. --- END Decision Phase ---
  20298. --- Application Phase ---
  20299. --- Firing Productions (PE) For State At Depth 1 ---
  20300. --- Inner Elaboration Phase, active level 1 (S1) ---
  20301. Firing apply*operator
  20302. -->
  20303. (I3 ^predict-no N1037 + :O )
  20304. Firing apply*operator*complete
  20305. -->
  20306. (I3 ^predict-yes N1036 - :O )
  20307. inner elaboration loop at bottom goal.
  20308. --- Change Working Memory (PE) ---
  20309. =>WM: (14540: I3 ^predict-no N1037)
  20310. <=WM: (14526: N1036 ^status complete)
  20311. <=WM: (14525: I3 ^predict-yes N1036)
  20312. --- Firing Productions (IE) For State At Depth 1 ---
  20313. --- Inner Elaboration Phase, active level 1 (S1) ---
  20314. Firing monitor*world
  20315. -->
  20316. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20317. --- Change Working Memory (IE) ---
  20318. --- END Application Phase ---
  20319. --- Output Phase ---
  20320. ENV: Agent did: predict-no for direction U in state State-B
  20321. In State-B moving U
  20322. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20323. predict error 0
  20324. dir: dir isR
  20325. --- END Output Phase ---
  20326. |\---- Input Phase ---
  20327. =>WM: (14544: I2 ^dir R)
  20328. =>WM: (14543: I2 ^reward 1)
  20329. =>WM: (14542: I2 ^see 0)
  20330. =>WM: (14541: N1037 ^status complete)
  20331. <=WM: (14529: I2 ^dir U)
  20332. <=WM: (14528: I2 ^reward 1)
  20333. <=WM: (14527: I2 ^see 1)
  20334. =>WM: (14545: I2 ^level-1 R1-root)
  20335. <=WM: (14530: I2 ^level-1 R1-root)
  20336. --- END Input Phase ---
  20337. --- Proposal Phase ---
  20338. --- Inner Elaboration Phase, active level 1 (S1) ---
  20339. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20340. -->
  20341. (S1 ^operator O2073 = -0.252585164213872)
  20342. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  20343. -->
  20344. (S1 ^operator O2074 = 0.7701697371568763)
  20345. Firing prefer*rvt*predict-no*H0*6*v1*H1
  20346. -->
  20347. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20348. -->
  20349. Firing elaborate*copy-see-to-output-link
  20350. -->
  20351. (I3 ^see 0 +)
  20352. Firing elaborate*reward*based*on*reward
  20353. -->
  20354. (R1041 ^value 1 +)
  20355. (R1 ^reward R1041 +)
  20356. Firing propose*predict-yes
  20357. -->
  20358. (O2075 ^name predict-yes +)
  20359. (S1 ^operator O2075 +)
  20360. Firing propose*predict-no
  20361. -->
  20362. (O2076 ^name predict-no +)
  20363. (S1 ^operator O2076 +)
  20364. Firing rl*prefer*rvt*predict-no*H0*6
  20365. -->
  20366. (S1 ^operator O2074 = 0.229864201526749)
  20367. Firing rl*prefer*rvt*predict-yes*H0*5
  20368. -->
  20369. (S1 ^operator O2073 = 0.2939980822884902)
  20370. Firing prefer*rvt*predict-yes*H0
  20371. -->
  20372. Firing prefer*rvt*predict-no*H0
  20373. -->
  20374. Firing elaborate*copy-dir-to-output-link
  20375. -->
  20376. (I3 ^dir R +)
  20377. inner elaboration loop at bottom goal.
  20378. Retracting elaborate*copy-see-to-output-link
  20379. -->
  20380. (I3 ^see 1 +)
  20381. Retracting propose*predict-no
  20382. -->
  20383. (O2074 ^name predict-no +)
  20384. (S1 ^operator O2074 +)
  20385. Retracting propose*predict-yes
  20386. -->
  20387. (O2073 ^name predict-yes +)
  20388. (S1 ^operator O2073 +)
  20389. Retracting elaborate*reward*based*on*reward
  20390. -->
  20391. (R1040 ^value 1 +)
  20392. (R1 ^reward R1040 +)
  20393. Retracting elaborate*copy-dir-to-output-link
  20394. -->
  20395. (I3 ^dir U +)
  20396. Retracting rl*prefer*rvt*predict-no*H0*4
  20397. -->
  20398. (S1 ^operator O2074 = 1.)
  20399. Retracting rl*prefer*rvt*predict-yes*H0*3
  20400. -->
  20401. (S1 ^operator O2073 = 0.)
  20402. =>WM: (14553: S1 ^operator O2076 +)
  20403. =>WM: (14552: S1 ^operator O2075 +)
  20404. =>WM: (14551: I3 ^dir R)
  20405. =>WM: (14550: O2076 ^name predict-no)
  20406. =>WM: (14549: O2075 ^name predict-yes)
  20407. =>WM: (14548: R1041 ^value 1)
  20408. =>WM: (14547: R1 ^reward R1041)
  20409. =>WM: (14546: I3 ^see 0)
  20410. <=WM: (14537: S1 ^operator O2073 +)
  20411. <=WM: (14538: S1 ^operator O2074 +)
  20412. <=WM: (14539: S1 ^operator O2074)
  20413. <=WM: (14536: I3 ^dir U)
  20414. <=WM: (14532: R1 ^reward R1040)
  20415. <=WM: (14531: I3 ^see 1)
  20416. <=WM: (14535: O2074 ^name predict-no)
  20417. <=WM: (14534: O2073 ^name predict-yes)
  20418. <=WM: (14533: R1040 ^value 1)
  20419. --- Inner Elaboration Phase, active level 1 (S1) ---
  20420. Firing prefer*rvt*predict-yes*H0
  20421. -->
  20422. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20423. -->
  20424. (S1 ^operator O2075 = -0.252585164213872)
  20425. Firing rl*prefer*rvt*predict-yes*H0*5
  20426. -->
  20427. (S1 ^operator O2075 = 0.2939980822884902)
  20428. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20429. -->
  20430. Firing prefer*rvt*predict-no*H0
  20431. -->
  20432. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  20433. -->
  20434. (S1 ^operator O2076 = 0.7701697371568763)
  20435. Firing rl*prefer*rvt*predict-no*H0*6
  20436. -->
  20437. (S1 ^operator O2076 = 0.229864201526749)
  20438. Firing prefer*rvt*predict-no*H0*6*v1*H1
  20439. -->
  20440. inner elaboration loop at bottom goal.
  20441. Retracting rl*prefer*rvt*predict-no*H0*6
  20442. -->
  20443. (S1 ^operator O2074 = 0.229864201526749)
  20444. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  20445. -->
  20446. (S1 ^operator O2074 = 0.7701697371568763)
  20447. Retracting rl*prefer*rvt*predict-yes*H0*5
  20448. -->
  20449. (S1 ^operator O2073 = 0.2939980822884902)
  20450. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20451. -->
  20452. (S1 ^operator O2073 = -0.252585164213872)
  20453. --- END Proposal Phase ---
  20454. --- Decision Phase ---
  20455. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20456. =>WM: (14554: S1 ^operator O2076)
  20457. 1038: O: O2076 (predict-no)
  20458. --- END Decision Phase ---
  20459. --- Application Phase ---
  20460. --- Firing Productions (PE) For State At Depth 1 ---
  20461. --- Inner Elaboration Phase, active level 1 (S1) ---
  20462. Firing apply*operator
  20463. -->
  20464. (I3 ^predict-no N1038 + :O )
  20465. Firing apply*operator*complete
  20466. -->
  20467. (I3 ^predict-no N1037 - :O )
  20468. inner elaboration loop at bottom goal.
  20469. --- Change Working Memory (PE) ---
  20470. =>WM: (14555: I3 ^predict-no N1038)
  20471. <=WM: (14541: N1037 ^status complete)
  20472. <=WM: (14540: I3 ^predict-no N1037)
  20473. --- Firing Productions (IE) For State At Depth 1 ---
  20474. --- Inner Elaboration Phase, active level 1 (S1) ---
  20475. Firing monitor*world
  20476. -->
  20477. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20478. --- Change Working Memory (IE) ---
  20479. --- END Application Phase ---
  20480. --- Output Phase ---
  20481. ENV: Agent did: predict-no for direction R in state State-B
  20482. In State-B moving R
  20483. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20484. predict error 0
  20485. dir: dir isU
  20486. --- END Output Phase ---
  20487. /|--- Input Phase ---
  20488. =>WM: (14559: I2 ^dir U)
  20489. =>WM: (14558: I2 ^reward 1)
  20490. =>WM: (14557: I2 ^see 0)
  20491. =>WM: (14556: N1038 ^status complete)
  20492. <=WM: (14544: I2 ^dir R)
  20493. <=WM: (14543: I2 ^reward 1)
  20494. <=WM: (14542: I2 ^see 0)
  20495. =>WM: (14560: I2 ^level-1 R0-root)
  20496. <=WM: (14545: I2 ^level-1 R1-root)
  20497. --- END Input Phase ---
  20498. --- Proposal Phase ---
  20499. --- Inner Elaboration Phase, active level 1 (S1) ---
  20500. Firing elaborate*copy-see-to-output-link
  20501. -->
  20502. (I3 ^see 0 +)
  20503. Firing elaborate*reward*based*on*reward
  20504. -->
  20505. (R1042 ^value 1 +)
  20506. (R1 ^reward R1042 +)
  20507. Firing propose*predict-yes
  20508. -->
  20509. (O2077 ^name predict-yes +)
  20510. (S1 ^operator O2077 +)
  20511. Firing propose*predict-no
  20512. -->
  20513. (O2078 ^name predict-no +)
  20514. (S1 ^operator O2078 +)
  20515. Firing rl*prefer*rvt*predict-no*H0*4
  20516. -->
  20517. (S1 ^operator O2076 = 1.)
  20518. Firing rl*prefer*rvt*predict-yes*H0*3
  20519. -->
  20520. (S1 ^operator O2075 = 0.)
  20521. Firing prefer*rvt*predict-yes*H0
  20522. -->
  20523. Firing prefer*rvt*predict-no*H0
  20524. -->
  20525. Firing elaborate*copy-dir-to-output-link
  20526. -->
  20527. (I3 ^dir U +)
  20528. inner elaboration loop at bottom goal.
  20529. Retracting elaborate*copy-see-to-output-link
  20530. -->
  20531. (I3 ^see 0 +)
  20532. Retracting propose*predict-no
  20533. -->
  20534. (O2076 ^name predict-no +)
  20535. (S1 ^operator O2076 +)
  20536. Retracting propose*predict-yes
  20537. -->
  20538. (O2075 ^name predict-yes +)
  20539. (S1 ^operator O2075 +)
  20540. Retracting elaborate*reward*based*on*reward
  20541. -->
  20542. (R1041 ^value 1 +)
  20543. (R1 ^reward R1041 +)
  20544. Retracting elaborate*copy-dir-to-output-link
  20545. -->
  20546. (I3 ^dir R +)
  20547. Retracting rl*prefer*rvt*predict-no*H0*6
  20548. -->
  20549. (S1 ^operator O2076 = 0.229864201526749)
  20550. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  20551. -->
  20552. (S1 ^operator O2076 = 0.7701697371568763)
  20553. Retracting rl*prefer*rvt*predict-yes*H0*5
  20554. -->
  20555. (S1 ^operator O2075 = 0.2939980822884902)
  20556. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20557. -->
  20558. (S1 ^operator O2075 = -0.252585164213872)
  20559. =>WM: (14567: S1 ^operator O2078 +)
  20560. =>WM: (14566: S1 ^operator O2077 +)
  20561. =>WM: (14565: I3 ^dir U)
  20562. =>WM: (14564: O2078 ^name predict-no)
  20563. =>WM: (14563: O2077 ^name predict-yes)
  20564. =>WM: (14562: R1042 ^value 1)
  20565. =>WM: (14561: R1 ^reward R1042)
  20566. <=WM: (14552: S1 ^operator O2075 +)
  20567. <=WM: (14553: S1 ^operator O2076 +)
  20568. <=WM: (14554: S1 ^operator O2076)
  20569. <=WM: (14551: I3 ^dir R)
  20570. <=WM: (14547: R1 ^reward R1041)
  20571. <=WM: (14550: O2076 ^name predict-no)
  20572. <=WM: (14549: O2075 ^name predict-yes)
  20573. <=WM: (14548: R1041 ^value 1)
  20574. --- Inner Elaboration Phase, active level 1 (S1) ---
  20575. Firing prefer*rvt*predict-yes*H0
  20576. -->
  20577. Firing rl*prefer*rvt*predict-yes*H0*3
  20578. -->
  20579. (S1 ^operator O2077 = 0.)
  20580. Firing prefer*rvt*predict-no*H0
  20581. -->
  20582. Firing rl*prefer*rvt*predict-no*H0*4
  20583. -->
  20584. (S1 ^operator O2078 = 1.)
  20585. inner elaboration loop at bottom goal.
  20586. Retracting rl*prefer*rvt*predict-no*H0*4
  20587. -->
  20588. (S1 ^operator O2076 = 1.)
  20589. Retracting rl*prefer*rvt*predict-yes*H0*3
  20590. -->
  20591. (S1 ^operator O2075 = 0.)
  20592. --- END Proposal Phase ---
  20593. --- Decision Phase ---
  20594. RL update rl*prefer*rvt*predict-no*H0*6 0.611915 -0.382051 0.229864 -> 0.611913 -0.382052 0.229861(R,m,v=1,0.851648,0.127041)
  20595. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388112 0.382058 0.77017 -> 0.388109 0.382057 0.770166(R,m,v=1,1,0)
  20596. =>WM: (14568: S1 ^operator O2078)
  20597. 1039: O: O2078 (predict-no)
  20598. --- END Decision Phase ---
  20599. --- Application Phase ---
  20600. --- Firing Productions (PE) For State At Depth 1 ---
  20601. --- Inner Elaboration Phase, active level 1 (S1) ---
  20602. Firing apply*operator
  20603. -->
  20604. (I3 ^predict-no N1039 + :O )
  20605. Firing apply*operator*complete
  20606. -->
  20607. (I3 ^predict-no N1038 - :O )
  20608. inner elaboration loop at bottom goal.
  20609. --- Change Working Memory (PE) ---
  20610. =>WM: (14569: I3 ^predict-no N1039)
  20611. <=WM: (14556: N1038 ^status complete)
  20612. <=WM: (14555: I3 ^predict-no N1038)
  20613. --- Firing Productions (IE) For State At Depth 1 ---
  20614. --- Inner Elaboration Phase, active level 1 (S1) ---
  20615. Firing monitor*world
  20616. -->
  20617. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20618. --- Change Working Memory (IE) ---
  20619. --- END Application Phase ---
  20620. --- Output Phase ---
  20621. ENV: Agent did: predict-no for direction U in state State-B
  20622. In State-B moving U
  20623. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20624. predict error 0
  20625. dir: dir isL
  20626. --- END Output Phase ---
  20627. \-/--- Input Phase ---
  20628. =>WM: (14573: I2 ^dir L)
  20629. =>WM: (14572: I2 ^reward 1)
  20630. =>WM: (14571: I2 ^see 0)
  20631. =>WM: (14570: N1039 ^status complete)
  20632. <=WM: (14559: I2 ^dir U)
  20633. <=WM: (14558: I2 ^reward 1)
  20634. <=WM: (14557: I2 ^see 0)
  20635. =>WM: (14574: I2 ^level-1 R0-root)
  20636. <=WM: (14560: I2 ^level-1 R0-root)
  20637. --- END Input Phase ---
  20638. --- Proposal Phase ---
  20639. --- Inner Elaboration Phase, active level 1 (S1) ---
  20640. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  20641. -->
  20642. (S1 ^operator O2077 = 0.6195718054949008)
  20643. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  20644. -->
  20645. (S1 ^operator O2078 = -0.2190661556260421)
  20646. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20647. -->
  20648. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20649. -->
  20650. Firing elaborate*copy-see-to-output-link
  20651. -->
  20652. (I3 ^see 0 +)
  20653. Firing elaborate*reward*based*on*reward
  20654. -->
  20655. (R1043 ^value 1 +)
  20656. (R1 ^reward R1043 +)
  20657. Firing propose*predict-yes
  20658. -->
  20659. (O2079 ^name predict-yes +)
  20660. (S1 ^operator O2079 +)
  20661. Firing propose*predict-no
  20662. -->
  20663. (O2080 ^name predict-no +)
  20664. (S1 ^operator O2080 +)
  20665. Firing rl*prefer*rvt*predict-no*H0*2
  20666. -->
  20667. (S1 ^operator O2078 = 0.3140350167550124)
  20668. Firing rl*prefer*rvt*predict-yes*H0*1
  20669. -->
  20670. (S1 ^operator O2077 = 0.3804109904199586)
  20671. Firing prefer*rvt*predict-yes*H0
  20672. -->
  20673. Firing prefer*rvt*predict-no*H0
  20674. -->
  20675. Firing elaborate*copy-dir-to-output-link
  20676. -->
  20677. (I3 ^dir L +)
  20678. inner elaboration loop at bottom goal.
  20679. Retracting elaborate*copy-see-to-output-link
  20680. -->
  20681. (I3 ^see 0 +)
  20682. Retracting propose*predict-no
  20683. -->
  20684. (O2078 ^name predict-no +)
  20685. (S1 ^operator O2078 +)
  20686. Retracting propose*predict-yes
  20687. -->
  20688. (O2077 ^name predict-yes +)
  20689. (S1 ^operator O2077 +)
  20690. Retracting elaborate*reward*based*on*reward
  20691. -->
  20692. (R1042 ^value 1 +)
  20693. (R1 ^reward R1042 +)
  20694. Retracting elaborate*copy-dir-to-output-link
  20695. -->
  20696. (I3 ^dir U +)
  20697. Retracting rl*prefer*rvt*predict-no*H0*4
  20698. -->
  20699. (S1 ^operator O2078 = 1.)
  20700. Retracting rl*prefer*rvt*predict-yes*H0*3
  20701. -->
  20702. (S1 ^operator O2077 = 0.)
  20703. =>WM: (14581: S1 ^operator O2080 +)
  20704. =>WM: (14580: S1 ^operator O2079 +)
  20705. =>WM: (14579: I3 ^dir L)
  20706. =>WM: (14578: O2080 ^name predict-no)
  20707. =>WM: (14577: O2079 ^name predict-yes)
  20708. =>WM: (14576: R1043 ^value 1)
  20709. =>WM: (14575: R1 ^reward R1043)
  20710. <=WM: (14566: S1 ^operator O2077 +)
  20711. <=WM: (14567: S1 ^operator O2078 +)
  20712. <=WM: (14568: S1 ^operator O2078)
  20713. <=WM: (14565: I3 ^dir U)
  20714. <=WM: (14561: R1 ^reward R1042)
  20715. <=WM: (14564: O2078 ^name predict-no)
  20716. <=WM: (14563: O2077 ^name predict-yes)
  20717. <=WM: (14562: R1042 ^value 1)
  20718. --- Inner Elaboration Phase, active level 1 (S1) ---
  20719. Firing prefer*rvt*predict-yes*H0
  20720. -->
  20721. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  20722. -->
  20723. (S1 ^operator O2079 = 0.6195718054949008)
  20724. Firing rl*prefer*rvt*predict-yes*H0*1
  20725. -->
  20726. (S1 ^operator O2079 = 0.3804109904199586)
  20727. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20728. -->
  20729. Firing prefer*rvt*predict-no*H0
  20730. -->
  20731. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  20732. -->
  20733. (S1 ^operator O2080 = -0.2190661556260421)
  20734. Firing rl*prefer*rvt*predict-no*H0*2
  20735. -->
  20736. (S1 ^operator O2080 = 0.3140350167550124)
  20737. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20738. -->
  20739. inner elaboration loop at bottom goal.
  20740. Retracting rl*prefer*rvt*predict-no*H0*2
  20741. -->
  20742. (S1 ^operator O2078 = 0.3140350167550124)
  20743. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  20744. -->
  20745. (S1 ^operator O2078 = -0.2190661556260421)
  20746. Retracting rl*prefer*rvt*predict-yes*H0*1
  20747. -->
  20748. (S1 ^operator O2077 = 0.3804109904199586)
  20749. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  20750. -->
  20751. (S1 ^operator O2077 = 0.6195718054949008)
  20752. --- END Proposal Phase ---
  20753. --- Decision Phase ---
  20754. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20755. =>WM: (14582: S1 ^operator O2079)
  20756. 1040: O: O2079 (predict-yes)
  20757. --- END Decision Phase ---
  20758. --- Application Phase ---
  20759. --- Firing Productions (PE) For State At Depth 1 ---
  20760. --- Inner Elaboration Phase, active level 1 (S1) ---
  20761. Firing apply*operator
  20762. -->
  20763. (I3 ^predict-yes N1040 + :O )
  20764. Firing apply*operator*complete
  20765. -->
  20766. (I3 ^predict-no N1039 - :O )
  20767. inner elaboration loop at bottom goal.
  20768. --- Change Working Memory (PE) ---
  20769. =>WM: (14583: I3 ^predict-yes N1040)
  20770. <=WM: (14570: N1039 ^status complete)
  20771. <=WM: (14569: I3 ^predict-no N1039)
  20772. --- Firing Productions (IE) For State At Depth 1 ---
  20773. --- Inner Elaboration Phase, active level 1 (S1) ---
  20774. Firing monitor*world
  20775. -->
  20776. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20777. --- Change Working Memory (IE) ---
  20778. --- END Application Phase ---
  20779. --- Output Phase ---
  20780. ENV: Agent did: predict-yes for direction L in state State-B
  20781. In State-B moving L
  20782. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  20783. predict error 0
  20784. dir: dir isL
  20785. --- END Output Phase ---
  20786. |\--- Input Phase ---
  20787. =>WM: (14587: I2 ^dir L)
  20788. =>WM: (14586: I2 ^reward 1)
  20789. =>WM: (14585: I2 ^see 1)
  20790. =>WM: (14584: N1040 ^status complete)
  20791. <=WM: (14573: I2 ^dir L)
  20792. <=WM: (14572: I2 ^reward 1)
  20793. <=WM: (14571: I2 ^see 0)
  20794. =>WM: (14588: I2 ^level-1 L1-root)
  20795. <=WM: (14574: I2 ^level-1 R0-root)
  20796. --- END Input Phase ---
  20797. --- Proposal Phase ---
  20798. --- Inner Elaboration Phase, active level 1 (S1) ---
  20799. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  20800. -->
  20801. (S1 ^operator O2079 = -0.3470159027404986)
  20802. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  20803. -->
  20804. (S1 ^operator O2080 = 0.686084421929226)
  20805. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20806. -->
  20807. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20808. -->
  20809. Firing elaborate*copy-see-to-output-link
  20810. -->
  20811. (I3 ^see 1 +)
  20812. Firing elaborate*reward*based*on*reward
  20813. -->
  20814. (R1044 ^value 1 +)
  20815. (R1 ^reward R1044 +)
  20816. Firing propose*predict-yes
  20817. -->
  20818. (O2081 ^name predict-yes +)
  20819. (S1 ^operator O2081 +)
  20820. Firing propose*predict-no
  20821. -->
  20822. (O2082 ^name predict-no +)
  20823. (S1 ^operator O2082 +)
  20824. Firing rl*prefer*rvt*predict-no*H0*2
  20825. -->
  20826. (S1 ^operator O2080 = 0.3140350167550124)
  20827. Firing rl*prefer*rvt*predict-yes*H0*1
  20828. -->
  20829. (S1 ^operator O2079 = 0.3804109904199586)
  20830. Firing prefer*rvt*predict-yes*H0
  20831. -->
  20832. Firing prefer*rvt*predict-no*H0
  20833. -->
  20834. Firing elaborate*copy-dir-to-output-link
  20835. -->
  20836. (I3 ^dir L +)
  20837. inner elaboration loop at bottom goal.
  20838. Retracting elaborate*copy-see-to-output-link
  20839. -->
  20840. (I3 ^see 0 +)
  20841. Retracting propose*predict-no
  20842. -->
  20843. (O2080 ^name predict-no +)
  20844. (S1 ^operator O2080 +)
  20845. Retracting propose*predict-yes
  20846. -->
  20847. (O2079 ^name predict-yes +)
  20848. (S1 ^operator O2079 +)
  20849. Retracting elaborate*reward*based*on*reward
  20850. -->
  20851. (R1043 ^value 1 +)
  20852. (R1 ^reward R1043 +)
  20853. Retracting elaborate*copy-dir-to-output-link
  20854. -->
  20855. (I3 ^dir L +)
  20856. Retracting rl*prefer*rvt*predict-no*H0*2
  20857. -->
  20858. (S1 ^operator O2080 = 0.3140350167550124)
  20859. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  20860. -->
  20861. (S1 ^operator O2080 = -0.2190661556260421)
  20862. Retracting rl*prefer*rvt*predict-yes*H0*1
  20863. -->
  20864. (S1 ^operator O2079 = 0.3804109904199586)
  20865. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  20866. -->
  20867. (S1 ^operator O2079 = 0.6195718054949008)
  20868. =>WM: (14595: S1 ^operator O2082 +)
  20869. =>WM: (14594: S1 ^operator O2081 +)
  20870. =>WM: (14593: O2082 ^name predict-no)
  20871. =>WM: (14592: O2081 ^name predict-yes)
  20872. =>WM: (14591: R1044 ^value 1)
  20873. =>WM: (14590: R1 ^reward R1044)
  20874. =>WM: (14589: I3 ^see 1)
  20875. <=WM: (14580: S1 ^operator O2079 +)
  20876. <=WM: (14582: S1 ^operator O2079)
  20877. <=WM: (14581: S1 ^operator O2080 +)
  20878. <=WM: (14575: R1 ^reward R1043)
  20879. <=WM: (14546: I3 ^see 0)
  20880. <=WM: (14578: O2080 ^name predict-no)
  20881. <=WM: (14577: O2079 ^name predict-yes)
  20882. <=WM: (14576: R1043 ^value 1)
  20883. --- Inner Elaboration Phase, active level 1 (S1) ---
  20884. Firing prefer*rvt*predict-yes*H0
  20885. -->
  20886. Firing rl*prefer*rvt*predict-yes*H0*1
  20887. -->
  20888. (S1 ^operator O2081 = 0.3804109904199586)
  20889. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20890. -->
  20891. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  20892. -->
  20893. (S1 ^operator O2081 = -0.3470159027404986)
  20894. Firing prefer*rvt*predict-no*H0
  20895. -->
  20896. Firing rl*prefer*rvt*predict-no*H0*2
  20897. -->
  20898. (S1 ^operator O2082 = 0.3140350167550124)
  20899. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20900. -->
  20901. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  20902. -->
  20903. (S1 ^operator O2082 = 0.686084421929226)
  20904. inner elaboration loop at bottom goal.
  20905. Retracting rl*prefer*rvt*predict-no*H0*2
  20906. -->
  20907. (S1 ^operator O2080 = 0.3140350167550124)
  20908. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  20909. -->
  20910. (S1 ^operator O2080 = 0.686084421929226)
  20911. Retracting rl*prefer*rvt*predict-yes*H0*1
  20912. -->
  20913. (S1 ^operator O2079 = 0.3804109904199586)
  20914. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  20915. -->
  20916. (S1 ^operator O2079 = -0.3470159027404986)
  20917. --- END Proposal Phase ---
  20918. --- Decision Phase ---
  20919. RL update rl*prefer*rvt*predict-yes*H0*1 0.521341 -0.14093 0.380411 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.83815,0.136443)
  20920. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478641 0.140931 0.619572 -> 0.478642 0.140931 0.619573(R,m,v=1,1,0)
  20921. =>WM: (14596: S1 ^operator O2082)
  20922. 1041: O: O2082 (predict-no)
  20923. --- END Decision Phase ---
  20924. --- Application Phase ---
  20925. --- Firing Productions (PE) For State At Depth 1 ---
  20926. --- Inner Elaboration Phase, active level 1 (S1) ---
  20927. Firing apply*operator
  20928. -->
  20929. (I3 ^predict-no N1041 + :O )
  20930. Firing apply*operator*complete
  20931. -->
  20932. (I3 ^predict-yes N1040 - :O )
  20933. inner elaboration loop at bottom goal.
  20934. --- Change Working Memory (PE) ---
  20935. =>WM: (14597: I3 ^predict-no N1041)
  20936. <=WM: (14584: N1040 ^status complete)
  20937. <=WM: (14583: I3 ^predict-yes N1040)
  20938. --- Firing Productions (IE) For State At Depth 1 ---
  20939. --- Inner Elaboration Phase, active level 1 (S1) ---
  20940. Firing monitor*world
  20941. -->
  20942. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20943. --- Change Working Memory (IE) ---
  20944. --- END Application Phase ---
  20945. --- Output Phase ---
  20946. ENV: Agent did: predict-no for direction L in state State-A
  20947. In State-A moving L
  20948. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20949. predict error 0
  20950. dir: dir isR
  20951. --- END Output Phase ---
  20952. ---- Input Phase ---
  20953. =>WM: (14601: I2 ^dir R)
  20954. =>WM: (14600: I2 ^reward 1)
  20955. =>WM: (14599: I2 ^see 0)
  20956. =>WM: (14598: N1041 ^status complete)
  20957. <=WM: (14587: I2 ^dir L)
  20958. <=WM: (14586: I2 ^reward 1)
  20959. <=WM: (14585: I2 ^see 1)
  20960. =>WM: (14602: I2 ^level-1 L0-root)
  20961. <=WM: (14588: I2 ^level-1 L1-root)
  20962. --- END Input Phase ---
  20963. --- Proposal Phase ---
  20964. --- Inner Elaboration Phase, active level 1 (S1) ---
  20965. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  20966. -->
  20967. (S1 ^operator O2081 = 0.7057283473531946)
  20968. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  20969. -->
  20970. (S1 ^operator O2082 = -0.2023211881870005)
  20971. Firing prefer*rvt*predict-no*H0*6*v1*H1
  20972. -->
  20973. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20974. -->
  20975. Firing elaborate*copy-see-to-output-link
  20976. -->
  20977. (I3 ^see 0 +)
  20978. Firing elaborate*reward*based*on*reward
  20979. -->
  20980. (R1045 ^value 1 +)
  20981. (R1 ^reward R1045 +)
  20982. Firing propose*predict-yes
  20983. -->
  20984. (O2083 ^name predict-yes +)
  20985. (S1 ^operator O2083 +)
  20986. Firing propose*predict-no
  20987. -->
  20988. (O2084 ^name predict-no +)
  20989. (S1 ^operator O2084 +)
  20990. Firing rl*prefer*rvt*predict-no*H0*6
  20991. -->
  20992. (S1 ^operator O2082 = 0.2298614663037441)
  20993. Firing rl*prefer*rvt*predict-yes*H0*5
  20994. -->
  20995. (S1 ^operator O2081 = 0.2939980822884902)
  20996. Firing prefer*rvt*predict-yes*H0
  20997. -->
  20998. Firing prefer*rvt*predict-no*H0
  20999. -->
  21000. Firing elaborate*copy-dir-to-output-link
  21001. -->
  21002. (I3 ^dir R +)
  21003. inner elaboration loop at bottom goal.
  21004. Retracting elaborate*copy-see-to-output-link
  21005. -->
  21006. (I3 ^see 1 +)
  21007. Retracting propose*predict-no
  21008. -->
  21009. (O2082 ^name predict-no +)
  21010. (S1 ^operator O2082 +)
  21011. Retracting propose*predict-yes
  21012. -->
  21013. (O2081 ^name predict-yes +)
  21014. (S1 ^operator O2081 +)
  21015. Retracting elaborate*reward*based*on*reward
  21016. -->
  21017. (R1044 ^value 1 +)
  21018. (R1 ^reward R1044 +)
  21019. Retracting elaborate*copy-dir-to-output-link
  21020. -->
  21021. (I3 ^dir L +)
  21022. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  21023. -->
  21024. (S1 ^operator O2082 = 0.686084421929226)
  21025. Retracting rl*prefer*rvt*predict-no*H0*2
  21026. -->
  21027. (S1 ^operator O2082 = 0.3140350167550124)
  21028. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  21029. -->
  21030. (S1 ^operator O2081 = -0.3470159027404986)
  21031. Retracting rl*prefer*rvt*predict-yes*H0*1
  21032. -->
  21033. (S1 ^operator O2081 = 0.3804123883778544)
  21034. =>WM: (14610: S1 ^operator O2084 +)
  21035. =>WM: (14609: S1 ^operator O2083 +)
  21036. =>WM: (14608: I3 ^dir R)
  21037. =>WM: (14607: O2084 ^name predict-no)
  21038. =>WM: (14606: O2083 ^name predict-yes)
  21039. =>WM: (14605: R1045 ^value 1)
  21040. =>WM: (14604: R1 ^reward R1045)
  21041. =>WM: (14603: I3 ^see 0)
  21042. <=WM: (14594: S1 ^operator O2081 +)
  21043. <=WM: (14595: S1 ^operator O2082 +)
  21044. <=WM: (14596: S1 ^operator O2082)
  21045. <=WM: (14579: I3 ^dir L)
  21046. <=WM: (14590: R1 ^reward R1044)
  21047. <=WM: (14589: I3 ^see 1)
  21048. <=WM: (14593: O2082 ^name predict-no)
  21049. <=WM: (14592: O2081 ^name predict-yes)
  21050. <=WM: (14591: R1044 ^value 1)
  21051. --- Inner Elaboration Phase, active level 1 (S1) ---
  21052. Firing prefer*rvt*predict-yes*H0
  21053. -->
  21054. Firing rl*prefer*rvt*predict-yes*H0*5
  21055. -->
  21056. (S1 ^operator O2083 = 0.2939980822884902)
  21057. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21058. -->
  21059. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  21060. -->
  21061. (S1 ^operator O2083 = 0.7057283473531946)
  21062. Firing prefer*rvt*predict-no*H0
  21063. -->
  21064. Firing rl*prefer*rvt*predict-no*H0*6
  21065. -->
  21066. (S1 ^operator O2084 = 0.2298614663037441)
  21067. Firing prefer*rvt*predict-no*H0*6*v1*H1
  21068. -->
  21069. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  21070. -->
  21071. (S1 ^operator O2084 = -0.2023211881870005)
  21072. inner elaboration loop at bottom goal.
  21073. Retracting rl*prefer*rvt*predict-no*H0*6
  21074. -->
  21075. (S1 ^operator O2082 = 0.2298614663037441)
  21076. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  21077. -->
  21078. (S1 ^operator O2082 = -0.2023211881870005)
  21079. Retracting rl*prefer*rvt*predict-yes*H0*5
  21080. -->
  21081. (S1 ^operator O2081 = 0.2939980822884902)
  21082. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  21083. -->
  21084. (S1 ^operator O2081 = 0.7057283473531946)
  21085. --- END Proposal Phase ---
  21086. --- Decision Phase ---
  21087. RL update rl*prefer*rvt*predict-no*H0*2 0.485042 -0.171007 0.314035 -> 0.485034 -0.171009 0.314025(R,m,v=1,0.86875,0.114741)
  21088. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515052 0.171032 0.686084 -> 0.515043 0.17103 0.686073(R,m,v=1,1,0)
  21089. =>WM: (14611: S1 ^operator O2083)
  21090. 1042: O: O2083 (predict-yes)
  21091. --- END Decision Phase ---
  21092. --- Application Phase ---
  21093. --- Firing Productions (PE) For State At Depth 1 ---
  21094. --- Inner Elaboration Phase, active level 1 (S1) ---
  21095. Firing apply*operator
  21096. -->
  21097. (I3 ^predict-yes N1042 + :O )
  21098. Firing apply*operator*complete
  21099. -->
  21100. (I3 ^predict-no N1041 - :O )
  21101. inner elaboration loop at bottom goal.
  21102. --- Change Working Memory (PE) ---
  21103. =>WM: (14612: I3 ^predict-yes N1042)
  21104. <=WM: (14598: N1041 ^status complete)
  21105. <=WM: (14597: I3 ^predict-no N1041)
  21106. --- Firing Productions (IE) For State At Depth 1 ---
  21107. --- Inner Elaboration Phase, active level 1 (S1) ---
  21108. Firing monitor*world
  21109. -->
  21110. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21111. --- Change Working Memory (IE) ---
  21112. --- END Application Phase ---
  21113. --- Output Phase ---
  21114. ENV: Agent did: predict-yes for direction R in state State-A
  21115. In State-A moving R
  21116. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  21117. predict error 0
  21118. dir: dir isU
  21119. --- END Output Phase ---
  21120. /|--- Input Phase ---
  21121. =>WM: (14616: I2 ^dir U)
  21122. =>WM: (14615: I2 ^reward 1)
  21123. =>WM: (14614: I2 ^see 1)
  21124. =>WM: (14613: N1042 ^status complete)
  21125. <=WM: (14601: I2 ^dir R)
  21126. <=WM: (14600: I2 ^reward 1)
  21127. <=WM: (14599: I2 ^see 0)
  21128. =>WM: (14617: I2 ^level-1 R1-root)
  21129. <=WM: (14602: I2 ^level-1 L0-root)
  21130. --- END Input Phase ---
  21131. --- Proposal Phase ---
  21132. --- Inner Elaboration Phase, active level 1 (S1) ---
  21133. Firing elaborate*copy-see-to-output-link
  21134. -->
  21135. (I3 ^see 1 +)
  21136. Firing elaborate*reward*based*on*reward
  21137. -->
  21138. (R1046 ^value 1 +)
  21139. (R1 ^reward R1046 +)
  21140. Firing propose*predict-yes
  21141. -->
  21142. (O2085 ^name predict-yes +)
  21143. (S1 ^operator O2085 +)
  21144. Firing propose*predict-no
  21145. -->
  21146. (O2086 ^name predict-no +)
  21147. (S1 ^operator O2086 +)
  21148. Firing rl*prefer*rvt*predict-no*H0*4
  21149. -->
  21150. (S1 ^operator O2084 = 1.)
  21151. Firing rl*prefer*rvt*predict-yes*H0*3
  21152. -->
  21153. (S1 ^operator O2083 = 0.)
  21154. Firing prefer*rvt*predict-yes*H0
  21155. -->
  21156. Firing prefer*rvt*predict-no*H0
  21157. -->
  21158. Firing elaborate*copy-dir-to-output-link
  21159. -->
  21160. (I3 ^dir U +)
  21161. inner elaboration loop at bottom goal.
  21162. Retracting elaborate*copy-see-to-output-link
  21163. -->
  21164. (I3 ^see 0 +)
  21165. Retracting propose*predict-no
  21166. -->
  21167. (O2084 ^name predict-no +)
  21168. (S1 ^operator O2084 +)
  21169. Retracting propose*predict-yes
  21170. -->
  21171. (O2083 ^name predict-yes +)
  21172. (S1 ^operator O2083 +)
  21173. Retracting elaborate*reward*based*on*reward
  21174. -->
  21175. (R1045 ^value 1 +)
  21176. (R1 ^reward R1045 +)
  21177. Retracting elaborate*copy-dir-to-output-link
  21178. -->
  21179. (I3 ^dir R +)
  21180. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  21181. -->
  21182. (S1 ^operator O2084 = -0.2023211881870005)
  21183. Retracting rl*prefer*rvt*predict-no*H0*6
  21184. -->
  21185. (S1 ^operator O2084 = 0.2298614663037441)
  21186. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  21187. -->
  21188. (S1 ^operator O2083 = 0.7057283473531946)
  21189. Retracting rl*prefer*rvt*predict-yes*H0*5
  21190. -->
  21191. (S1 ^operator O2083 = 0.2939980822884902)
  21192. =>WM: (14625: S1 ^operator O2086 +)
  21193. =>WM: (14624: S1 ^operator O2085 +)
  21194. =>WM: (14623: I3 ^dir U)
  21195. =>WM: (14622: O2086 ^name predict-no)
  21196. =>WM: (14621: O2085 ^name predict-yes)
  21197. =>WM: (14620: R1046 ^value 1)
  21198. =>WM: (14619: R1 ^reward R1046)
  21199. =>WM: (14618: I3 ^see 1)
  21200. <=WM: (14609: S1 ^operator O2083 +)
  21201. <=WM: (14611: S1 ^operator O2083)
  21202. <=WM: (14610: S1 ^operator O2084 +)
  21203. <=WM: (14608: I3 ^dir R)
  21204. <=WM: (14604: R1 ^reward R1045)
  21205. <=WM: (14603: I3 ^see 0)
  21206. <=WM: (14607: O2084 ^name predict-no)
  21207. <=WM: (14606: O2083 ^name predict-yes)
  21208. <=WM: (14605: R1045 ^value 1)
  21209. --- Inner Elaboration Phase, active level 1 (S1) ---
  21210. Firing prefer*rvt*predict-yes*H0
  21211. -->
  21212. Firing rl*prefer*rvt*predict-yes*H0*3
  21213. -->
  21214. (S1 ^operator O2085 = 0.)
  21215. Firing prefer*rvt*predict-no*H0
  21216. -->
  21217. Firing rl*prefer*rvt*predict-no*H0*4
  21218. -->
  21219. (S1 ^operator O2086 = 1.)
  21220. inner elaboration loop at bottom goal.
  21221. Retracting rl*prefer*rvt*predict-no*H0*4
  21222. -->
  21223. (S1 ^operator O2084 = 1.)
  21224. Retracting rl*prefer*rvt*predict-yes*H0*3
  21225. -->
  21226. (S1 ^operator O2083 = 0.)
  21227. --- END Proposal Phase ---
  21228. --- Decision Phase ---
  21229. RL update rl*prefer*rvt*predict-yes*H0*5 0.501072 -0.207074 0.293998 -> 0.501092 -0.207072 0.294021(R,m,v=1,0.853659,0.125692)
  21230. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.49868 0.207048 0.705728 -> 0.498704 0.20705 0.705755(R,m,v=1,1,0)
  21231. =>WM: (14626: S1 ^operator O2086)
  21232. 1043: O: O2086 (predict-no)
  21233. --- END Decision Phase ---
  21234. --- Application Phase ---
  21235. --- Firing Productions (PE) For State At Depth 1 ---
  21236. --- Inner Elaboration Phase, active level 1 (S1) ---
  21237. Firing apply*operator
  21238. -->
  21239. (I3 ^predict-no N1043 + :O )
  21240. Firing apply*operator*complete
  21241. -->
  21242. (I3 ^predict-yes N1042 - :O )
  21243. inner elaboration loop at bottom goal.
  21244. --- Change Working Memory (PE) ---
  21245. =>WM: (14627: I3 ^predict-no N1043)
  21246. <=WM: (14613: N1042 ^status complete)
  21247. <=WM: (14612: I3 ^predict-yes N1042)
  21248. --- Firing Productions (IE) For State At Depth 1 ---
  21249. --- Inner Elaboration Phase, active level 1 (S1) ---
  21250. Firing monitor*world
  21251. -->
  21252. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21253. --- Change Working Memory (IE) ---
  21254. --- END Application Phase ---
  21255. --- Output Phase ---
  21256. ENV: Agent did: predict-no for direction U in state State-B
  21257. In State-B moving U
  21258. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21259. predict error 0
  21260. dir: dir isR
  21261. --- END Output Phase ---
  21262. \-/--- Input Phase ---
  21263. =>WM: (14631: I2 ^dir R)
  21264. =>WM: (14630: I2 ^reward 1)
  21265. =>WM: (14629: I2 ^see 0)
  21266. =>WM: (14628: N1043 ^status complete)
  21267. <=WM: (14616: I2 ^dir U)
  21268. <=WM: (14615: I2 ^reward 1)
  21269. <=WM: (14614: I2 ^see 1)
  21270. =>WM: (14632: I2 ^level-1 R1-root)
  21271. <=WM: (14617: I2 ^level-1 R1-root)
  21272. --- END Input Phase ---
  21273. --- Proposal Phase ---
  21274. --- Inner Elaboration Phase, active level 1 (S1) ---
  21275. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21276. -->
  21277. (S1 ^operator O2085 = -0.252585164213872)
  21278. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  21279. -->
  21280. (S1 ^operator O2086 = 0.7701664478127415)
  21281. Firing prefer*rvt*predict-no*H0*6*v1*H1
  21282. -->
  21283. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21284. -->
  21285. Firing elaborate*copy-see-to-output-link
  21286. -->
  21287. (I3 ^see 0 +)
  21288. Firing elaborate*reward*based*on*reward
  21289. -->
  21290. (R1047 ^value 1 +)
  21291. (R1 ^reward R1047 +)
  21292. Firing propose*predict-yes
  21293. -->
  21294. (O2087 ^name predict-yes +)
  21295. (S1 ^operator O2087 +)
  21296. Firing propose*predict-no
  21297. -->
  21298. (O2088 ^name predict-no +)
  21299. (S1 ^operator O2088 +)
  21300. Firing rl*prefer*rvt*predict-no*H0*6
  21301. -->
  21302. (S1 ^operator O2086 = 0.2298614663037441)
  21303. Firing rl*prefer*rvt*predict-yes*H0*5
  21304. -->
  21305. (S1 ^operator O2085 = 0.2940205065793785)
  21306. Firing prefer*rvt*predict-yes*H0
  21307. -->
  21308. Firing prefer*rvt*predict-no*H0
  21309. -->
  21310. Firing elaborate*copy-dir-to-output-link
  21311. -->
  21312. (I3 ^dir R +)
  21313. inner elaboration loop at bottom goal.
  21314. Retracting elaborate*copy-see-to-output-link
  21315. -->
  21316. (I3 ^see 1 +)
  21317. Retracting propose*predict-no
  21318. -->
  21319. (O2086 ^name predict-no +)
  21320. (S1 ^operator O2086 +)
  21321. Retracting propose*predict-yes
  21322. -->
  21323. (O2085 ^name predict-yes +)
  21324. (S1 ^operator O2085 +)
  21325. Retracting elaborate*reward*based*on*reward
  21326. -->
  21327. (R1046 ^value 1 +)
  21328. (R1 ^reward R1046 +)
  21329. Retracting elaborate*copy-dir-to-output-link
  21330. -->
  21331. (I3 ^dir U +)
  21332. Retracting rl*prefer*rvt*predict-no*H0*4
  21333. -->
  21334. (S1 ^operator O2086 = 1.)
  21335. Retracting rl*prefer*rvt*predict-yes*H0*3
  21336. -->
  21337. (S1 ^operator O2085 = 0.)
  21338. =>WM: (14640: S1 ^operator O2088 +)
  21339. =>WM: (14639: S1 ^operator O2087 +)
  21340. =>WM: (14638: I3 ^dir R)
  21341. =>WM: (14637: O2088 ^name predict-no)
  21342. =>WM: (14636: O2087 ^name predict-yes)
  21343. =>WM: (14635: R1047 ^value 1)
  21344. =>WM: (14634: R1 ^reward R1047)
  21345. =>WM: (14633: I3 ^see 0)
  21346. <=WM: (14624: S1 ^operator O2085 +)
  21347. <=WM: (14625: S1 ^operator O2086 +)
  21348. <=WM: (14626: S1 ^operator O2086)
  21349. <=WM: (14623: I3 ^dir U)
  21350. <=WM: (14619: R1 ^reward R1046)
  21351. <=WM: (14618: I3 ^see 1)
  21352. <=WM: (14622: O2086 ^name predict-no)
  21353. <=WM: (14621: O2085 ^name predict-yes)
  21354. <=WM: (14620: R1046 ^value 1)
  21355. --- Inner Elaboration Phase, active level 1 (S1) ---
  21356. Firing prefer*rvt*predict-yes*H0
  21357. -->
  21358. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21359. -->
  21360. (S1 ^operator O2087 = -0.252585164213872)
  21361. Firing rl*prefer*rvt*predict-yes*H0*5
  21362. -->
  21363. (S1 ^operator O2087 = 0.2940205065793785)
  21364. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21365. -->
  21366. Firing prefer*rvt*predict-no*H0
  21367. -->
  21368. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  21369. -->
  21370. (S1 ^operator O2088 = 0.7701664478127415)
  21371. Firing rl*prefer*rvt*predict-no*H0*6
  21372. -->
  21373. (S1 ^operator O2088 = 0.2298614663037441)
  21374. Firing prefer*rvt*predict-no*H0*6*v1*H1
  21375. -->
  21376. inner elaboration loop at bottom goal.
  21377. Retracting rl*prefer*rvt*predict-no*H0*6
  21378. -->
  21379. (S1 ^operator O2086 = 0.2298614663037441)
  21380. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  21381. -->
  21382. (S1 ^operator O2086 = 0.7701664478127415)
  21383. Retracting rl*prefer*rvt*predict-yes*H0*5
  21384. -->
  21385. (S1 ^operator O2085 = 0.2940205065793785)
  21386. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21387. -->
  21388. (S1 ^operator O2085 = -0.252585164213872)
  21389. --- END Proposal Phase ---
  21390. --- Decision Phase ---
  21391. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21392. =>WM: (14641: S1 ^operator O2088)
  21393. 1044: O: O2088 (predict-no)
  21394. --- END Decision Phase ---
  21395. --- Application Phase ---
  21396. --- Firing Productions (PE) For State At Depth 1 ---
  21397. --- Inner Elaboration Phase, active level 1 (S1) ---
  21398. Firing apply*operator
  21399. -->
  21400. (I3 ^predict-no N1044 + :O )
  21401. Firing apply*operator*complete
  21402. -->
  21403. (I3 ^predict-no N1043 - :O )
  21404. inner elaboration loop at bottom goal.
  21405. --- Change Working Memory (PE) ---
  21406. =>WM: (14642: I3 ^predict-no N1044)
  21407. <=WM: (14628: N1043 ^status complete)
  21408. <=WM: (14627: I3 ^predict-no N1043)
  21409. --- Firing Productions (IE) For State At Depth 1 ---
  21410. --- Inner Elaboration Phase, active level 1 (S1) ---
  21411. Firing monitor*world
  21412. -->
  21413. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21414. --- Change Working Memory (IE) ---
  21415. --- END Application Phase ---
  21416. --- Output Phase ---
  21417. ENV: Agent did: predict-no for direction R in state State-B
  21418. In State-B moving R
  21419. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21420. predict error 0
  21421. dir: dir isL
  21422. --- END Output Phase ---
  21423. |\---- Input Phase ---
  21424. =>WM: (14646: I2 ^dir L)
  21425. =>WM: (14645: I2 ^reward 1)
  21426. =>WM: (14644: I2 ^see 0)
  21427. =>WM: (14643: N1044 ^status complete)
  21428. <=WM: (14631: I2 ^dir R)
  21429. <=WM: (14630: I2 ^reward 1)
  21430. <=WM: (14629: I2 ^see 0)
  21431. =>WM: (14647: I2 ^level-1 R0-root)
  21432. <=WM: (14632: I2 ^level-1 R1-root)
  21433. --- END Input Phase ---
  21434. --- Proposal Phase ---
  21435. --- Inner Elaboration Phase, active level 1 (S1) ---
  21436. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  21437. -->
  21438. (S1 ^operator O2087 = 0.6195734444489578)
  21439. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  21440. -->
  21441. (S1 ^operator O2088 = -0.2190661556260421)
  21442. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21443. -->
  21444. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21445. -->
  21446. Firing elaborate*copy-see-to-output-link
  21447. -->
  21448. (I3 ^see 0 +)
  21449. Firing elaborate*reward*based*on*reward
  21450. -->
  21451. (R1048 ^value 1 +)
  21452. (R1 ^reward R1048 +)
  21453. Firing propose*predict-yes
  21454. -->
  21455. (O2089 ^name predict-yes +)
  21456. (S1 ^operator O2089 +)
  21457. Firing propose*predict-no
  21458. -->
  21459. (O2090 ^name predict-no +)
  21460. (S1 ^operator O2090 +)
  21461. Firing rl*prefer*rvt*predict-no*H0*2
  21462. -->
  21463. (S1 ^operator O2088 = 0.3140251866918842)
  21464. Firing rl*prefer*rvt*predict-yes*H0*1
  21465. -->
  21466. (S1 ^operator O2087 = 0.3804123883778544)
  21467. Firing prefer*rvt*predict-yes*H0
  21468. -->
  21469. Firing prefer*rvt*predict-no*H0
  21470. -->
  21471. Firing elaborate*copy-dir-to-output-link
  21472. -->
  21473. (I3 ^dir L +)
  21474. inner elaboration loop at bottom goal.
  21475. Retracting elaborate*copy-see-to-output-link
  21476. -->
  21477. (I3 ^see 0 +)
  21478. Retracting propose*predict-no
  21479. -->
  21480. (O2088 ^name predict-no +)
  21481. (S1 ^operator O2088 +)
  21482. Retracting propose*predict-yes
  21483. -->
  21484. (O2087 ^name predict-yes +)
  21485. (S1 ^operator O2087 +)
  21486. Retracting elaborate*reward*based*on*reward
  21487. -->
  21488. (R1047 ^value 1 +)
  21489. (R1 ^reward R1047 +)
  21490. Retracting elaborate*copy-dir-to-output-link
  21491. -->
  21492. (I3 ^dir R +)
  21493. Retracting rl*prefer*rvt*predict-no*H0*6
  21494. -->
  21495. (S1 ^operator O2088 = 0.2298614663037441)
  21496. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  21497. -->
  21498. (S1 ^operator O2088 = 0.7701664478127415)
  21499. Retracting rl*prefer*rvt*predict-yes*H0*5
  21500. -->
  21501. (S1 ^operator O2087 = 0.2940205065793785)
  21502. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21503. -->
  21504. (S1 ^operator O2087 = -0.252585164213872)
  21505. =>WM: (14654: S1 ^operator O2090 +)
  21506. =>WM: (14653: S1 ^operator O2089 +)
  21507. =>WM: (14652: I3 ^dir L)
  21508. =>WM: (14651: O2090 ^name predict-no)
  21509. =>WM: (14650: O2089 ^name predict-yes)
  21510. =>WM: (14649: R1048 ^value 1)
  21511. =>WM: (14648: R1 ^reward R1048)
  21512. <=WM: (14639: S1 ^operator O2087 +)
  21513. <=WM: (14640: S1 ^operator O2088 +)
  21514. <=WM: (14641: S1 ^operator O2088)
  21515. <=WM: (14638: I3 ^dir R)
  21516. <=WM: (14634: R1 ^reward R1047)
  21517. <=WM: (14637: O2088 ^name predict-no)
  21518. <=WM: (14636: O2087 ^name predict-yes)
  21519. <=WM: (14635: R1047 ^value 1)
  21520. --- Inner Elaboration Phase, active level 1 (S1) ---
  21521. Firing prefer*rvt*predict-yes*H0
  21522. -->
  21523. Firing rl*prefer*rvt*predict-yes*H0*1
  21524. -->
  21525. (S1 ^operator O2089 = 0.3804123883778544)
  21526. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21527. -->
  21528. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  21529. -->
  21530. (S1 ^operator O2089 = 0.6195734444489578)
  21531. Firing prefer*rvt*predict-no*H0
  21532. -->
  21533. Firing rl*prefer*rvt*predict-no*H0*2
  21534. -->
  21535. (S1 ^operator O2090 = 0.3140251866918842)
  21536. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21537. -->
  21538. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  21539. -->
  21540. (S1 ^operator O2090 = -0.2190661556260421)
  21541. inner elaboration loop at bottom goal.
  21542. Retracting rl*prefer*rvt*predict-no*H0*2
  21543. -->
  21544. (S1 ^operator O2088 = 0.3140251866918842)
  21545. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  21546. -->
  21547. (S1 ^operator O2088 = -0.2190661556260421)
  21548. Retracting rl*prefer*rvt*predict-yes*H0*1
  21549. -->
  21550. (S1 ^operator O2087 = 0.3804123883778544)
  21551. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  21552. -->
  21553. (S1 ^operator O2087 = 0.6195734444489578)
  21554. --- END Proposal Phase ---
  21555. --- Decision Phase ---
  21556. RL update rl*prefer*rvt*predict-no*H0*6 0.611913 -0.382052 0.229861 -> 0.611911 -0.382052 0.229859(R,m,v=1,0.852459,0.126464)
  21557. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388109 0.382057 0.770166 -> 0.388107 0.382057 0.770164(R,m,v=1,1,0)
  21558. =>WM: (14655: S1 ^operator O2089)
  21559. 1045: O: O2089 (predict-yes)
  21560. --- END Decision Phase ---
  21561. --- Application Phase ---
  21562. --- Firing Productions (PE) For State At Depth 1 ---
  21563. --- Inner Elaboration Phase, active level 1 (S1) ---
  21564. Firing apply*operator
  21565. -->
  21566. (I3 ^predict-yes N1045 + :O )
  21567. Firing apply*operator*complete
  21568. -->
  21569. (I3 ^predict-no N1044 - :O )
  21570. inner elaboration loop at bottom goal.
  21571. --- Change Working Memory (PE) ---
  21572. =>WM: (14656: I3 ^predict-yes N1045)
  21573. <=WM: (14643: N1044 ^status complete)
  21574. <=WM: (14642: I3 ^predict-no N1044)
  21575. --- Firing Productions (IE) For State At Depth 1 ---
  21576. --- Inner Elaboration Phase, active level 1 (S1) ---
  21577. Firing monitor*world
  21578. -->
  21579. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21580. --- Change Working Memory (IE) ---
  21581. --- END Application Phase ---
  21582. --- Output Phase ---
  21583. ENV: Agent did: predict-yes for direction L in state State-B
  21584. In State-B moving L
  21585. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  21586. predict error 0
  21587. dir: dir isL
  21588. --- END Output Phase ---
  21589. /|\--- Input Phase ---
  21590. =>WM: (14660: I2 ^dir L)
  21591. =>WM: (14659: I2 ^reward 1)
  21592. =>WM: (14658: I2 ^see 1)
  21593. =>WM: (14657: N1045 ^status complete)
  21594. <=WM: (14646: I2 ^dir L)
  21595. <=WM: (14645: I2 ^reward 1)
  21596. <=WM: (14644: I2 ^see 0)
  21597. =>WM: (14661: I2 ^level-1 L1-root)
  21598. <=WM: (14647: I2 ^level-1 R0-root)
  21599. --- END Input Phase ---
  21600. --- Proposal Phase ---
  21601. --- Inner Elaboration Phase, active level 1 (S1) ---
  21602. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  21603. -->
  21604. (S1 ^operator O2089 = -0.3470159027404986)
  21605. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  21606. -->
  21607. (S1 ^operator O2090 = 0.6860729145467337)
  21608. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21609. -->
  21610. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21611. -->
  21612. Firing elaborate*copy-see-to-output-link
  21613. -->
  21614. (I3 ^see 1 +)
  21615. Firing elaborate*reward*based*on*reward
  21616. -->
  21617. (R1049 ^value 1 +)
  21618. (R1 ^reward R1049 +)
  21619. Firing propose*predict-yes
  21620. -->
  21621. (O2091 ^name predict-yes +)
  21622. (S1 ^operator O2091 +)
  21623. Firing propose*predict-no
  21624. -->
  21625. (O2092 ^name predict-no +)
  21626. (S1 ^operator O2092 +)
  21627. Firing rl*prefer*rvt*predict-no*H0*2
  21628. -->
  21629. (S1 ^operator O2090 = 0.3140251866918842)
  21630. Firing rl*prefer*rvt*predict-yes*H0*1
  21631. -->
  21632. (S1 ^operator O2089 = 0.3804123883778544)
  21633. Firing prefer*rvt*predict-yes*H0
  21634. -->
  21635. Firing prefer*rvt*predict-no*H0
  21636. -->
  21637. Firing elaborate*copy-dir-to-output-link
  21638. -->
  21639. (I3 ^dir L +)
  21640. inner elaboration loop at bottom goal.
  21641. Retracting elaborate*copy-see-to-output-link
  21642. -->
  21643. (I3 ^see 0 +)
  21644. Retracting propose*predict-no
  21645. -->
  21646. (O2090 ^name predict-no +)
  21647. (S1 ^operator O2090 +)
  21648. Retracting propose*predict-yes
  21649. -->
  21650. (O2089 ^name predict-yes +)
  21651. (S1 ^operator O2089 +)
  21652. Retracting elaborate*reward*based*on*reward
  21653. -->
  21654. (R1048 ^value 1 +)
  21655. (R1 ^reward R1048 +)
  21656. Retracting elaborate*copy-dir-to-output-link
  21657. -->
  21658. (I3 ^dir L +)
  21659. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  21660. -->
  21661. (S1 ^operator O2090 = -0.2190661556260421)
  21662. Retracting rl*prefer*rvt*predict-no*H0*2
  21663. -->
  21664. (S1 ^operator O2090 = 0.3140251866918842)
  21665. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  21666. -->
  21667. (S1 ^operator O2089 = 0.6195734444489578)
  21668. Retracting rl*prefer*rvt*predict-yes*H0*1
  21669. -->
  21670. (S1 ^operator O2089 = 0.3804123883778544)
  21671. =>WM: (14668: S1 ^operator O2092 +)
  21672. =>WM: (14667: S1 ^operator O2091 +)
  21673. =>WM: (14666: O2092 ^name predict-no)
  21674. =>WM: (14665: O2091 ^name predict-yes)
  21675. =>WM: (14664: R1049 ^value 1)
  21676. =>WM: (14663: R1 ^reward R1049)
  21677. =>WM: (14662: I3 ^see 1)
  21678. <=WM: (14653: S1 ^operator O2089 +)
  21679. <=WM: (14655: S1 ^operator O2089)
  21680. <=WM: (14654: S1 ^operator O2090 +)
  21681. <=WM: (14648: R1 ^reward R1048)
  21682. <=WM: (14633: I3 ^see 0)
  21683. <=WM: (14651: O2090 ^name predict-no)
  21684. <=WM: (14650: O2089 ^name predict-yes)
  21685. <=WM: (14649: R1048 ^value 1)
  21686. --- Inner Elaboration Phase, active level 1 (S1) ---
  21687. Firing prefer*rvt*predict-yes*H0
  21688. -->
  21689. Firing rl*prefer*rvt*predict-yes*H0*1
  21690. -->
  21691. (S1 ^operator O2091 = 0.3804123883778544)
  21692. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21693. -->
  21694. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  21695. -->
  21696. (S1 ^operator O2091 = -0.3470159027404986)
  21697. Firing prefer*rvt*predict-no*H0
  21698. -->
  21699. Firing rl*prefer*rvt*predict-no*H0*2
  21700. -->
  21701. (S1 ^operator O2092 = 0.3140251866918842)
  21702. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21703. -->
  21704. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  21705. -->
  21706. (S1 ^operator O2092 = 0.6860729145467337)
  21707. inner elaboration loop at bottom goal.
  21708. Retracting rl*prefer*rvt*predict-no*H0*2
  21709. -->
  21710. (S1 ^operator O2090 = 0.3140251866918842)
  21711. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  21712. -->
  21713. (S1 ^operator O2090 = 0.6860729145467337)
  21714. Retracting rl*prefer*rvt*predict-yes*H0*1
  21715. -->
  21716. (S1 ^operator O2089 = 0.3804123883778544)
  21717. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  21718. -->
  21719. (S1 ^operator O2089 = -0.3470159027404986)
  21720. --- END Proposal Phase ---
  21721. --- Decision Phase ---
  21722. RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521343 -0.14093 0.380414(R,m,v=1,0.83908,0.135805)
  21723. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478642 0.140931 0.619573 -> 0.478644 0.140931 0.619575(R,m,v=1,1,0)
  21724. =>WM: (14669: S1 ^operator O2092)
  21725. 1046: O: O2092 (predict-no)
  21726. --- END Decision Phase ---
  21727. --- Application Phase ---
  21728. --- Firing Productions (PE) For State At Depth 1 ---
  21729. --- Inner Elaboration Phase, active level 1 (S1) ---
  21730. Firing apply*operator
  21731. -->
  21732. (I3 ^predict-no N1046 + :O )
  21733. Firing apply*operator*complete
  21734. -->
  21735. (I3 ^predict-yes N1045 - :O )
  21736. inner elaboration loop at bottom goal.
  21737. --- Change Working Memory (PE) ---
  21738. =>WM: (14670: I3 ^predict-no N1046)
  21739. <=WM: (14657: N1045 ^status complete)
  21740. <=WM: (14656: I3 ^predict-yes N1045)
  21741. --- Firing Productions (IE) For State At Depth 1 ---
  21742. --- Inner Elaboration Phase, active level 1 (S1) ---
  21743. Firing monitor*world
  21744. -->
  21745. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21746. --- Change Working Memory (IE) ---
  21747. --- END Application Phase ---
  21748. --- Output Phase ---
  21749. ENV: Agent did: predict-no for direction L in state State-A
  21750. In State-A moving L
  21751. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21752. predict error 0
  21753. dir: dir isL
  21754. --- END Output Phase ---
  21755. -/|--- Input Phase ---
  21756. =>WM: (14674: I2 ^dir L)
  21757. =>WM: (14673: I2 ^reward 1)
  21758. =>WM: (14672: I2 ^see 0)
  21759. =>WM: (14671: N1046 ^status complete)
  21760. <=WM: (14660: I2 ^dir L)
  21761. <=WM: (14659: I2 ^reward 1)
  21762. <=WM: (14658: I2 ^see 1)
  21763. =>WM: (14675: I2 ^level-1 L0-root)
  21764. <=WM: (14661: I2 ^level-1 L1-root)
  21765. --- END Input Phase ---
  21766. --- Proposal Phase ---
  21767. --- Inner Elaboration Phase, active level 1 (S1) ---
  21768. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  21769. -->
  21770. (S1 ^operator O2091 = -0.3332708974800781)
  21771. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  21772. -->
  21773. (S1 ^operator O2092 = 0.6858476397463316)
  21774. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21775. -->
  21776. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21777. -->
  21778. Firing elaborate*copy-see-to-output-link
  21779. -->
  21780. (I3 ^see 0 +)
  21781. Firing elaborate*reward*based*on*reward
  21782. -->
  21783. (R1050 ^value 1 +)
  21784. (R1 ^reward R1050 +)
  21785. Firing propose*predict-yes
  21786. -->
  21787. (O2093 ^name predict-yes +)
  21788. (S1 ^operator O2093 +)
  21789. Firing propose*predict-no
  21790. -->
  21791. (O2094 ^name predict-no +)
  21792. (S1 ^operator O2094 +)
  21793. Firing rl*prefer*rvt*predict-no*H0*2
  21794. -->
  21795. (S1 ^operator O2092 = 0.3140251866918842)
  21796. Firing rl*prefer*rvt*predict-yes*H0*1
  21797. -->
  21798. (S1 ^operator O2091 = 0.3804135384871243)
  21799. Firing prefer*rvt*predict-yes*H0
  21800. -->
  21801. Firing prefer*rvt*predict-no*H0
  21802. -->
  21803. Firing elaborate*copy-dir-to-output-link
  21804. -->
  21805. (I3 ^dir L +)
  21806. inner elaboration loop at bottom goal.
  21807. Retracting elaborate*copy-see-to-output-link
  21808. -->
  21809. (I3 ^see 1 +)
  21810. Retracting propose*predict-no
  21811. -->
  21812. (O2092 ^name predict-no +)
  21813. (S1 ^operator O2092 +)
  21814. Retracting propose*predict-yes
  21815. -->
  21816. (O2091 ^name predict-yes +)
  21817. (S1 ^operator O2091 +)
  21818. Retracting elaborate*reward*based*on*reward
  21819. -->
  21820. (R1049 ^value 1 +)
  21821. (R1 ^reward R1049 +)
  21822. Retracting elaborate*copy-dir-to-output-link
  21823. -->
  21824. (I3 ^dir L +)
  21825. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  21826. -->
  21827. (S1 ^operator O2092 = 0.6860729145467337)
  21828. Retracting rl*prefer*rvt*predict-no*H0*2
  21829. -->
  21830. (S1 ^operator O2092 = 0.3140251866918842)
  21831. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  21832. -->
  21833. (S1 ^operator O2091 = -0.3470159027404986)
  21834. Retracting rl*prefer*rvt*predict-yes*H0*1
  21835. -->
  21836. (S1 ^operator O2091 = 0.3804135384871243)
  21837. =>WM: (14682: S1 ^operator O2094 +)
  21838. =>WM: (14681: S1 ^operator O2093 +)
  21839. =>WM: (14680: O2094 ^name predict-no)
  21840. =>WM: (14679: O2093 ^name predict-yes)
  21841. =>WM: (14678: R1050 ^value 1)
  21842. =>WM: (14677: R1 ^reward R1050)
  21843. =>WM: (14676: I3 ^see 0)
  21844. <=WM: (14667: S1 ^operator O2091 +)
  21845. <=WM: (14668: S1 ^operator O2092 +)
  21846. <=WM: (14669: S1 ^operator O2092)
  21847. <=WM: (14663: R1 ^reward R1049)
  21848. <=WM: (14662: I3 ^see 1)
  21849. <=WM: (14666: O2092 ^name predict-no)
  21850. <=WM: (14665: O2091 ^name predict-yes)
  21851. <=WM: (14664: R1049 ^value 1)
  21852. --- Inner Elaboration Phase, active level 1 (S1) ---
  21853. Firing prefer*rvt*predict-yes*H0
  21854. -->
  21855. Firing rl*prefer*rvt*predict-yes*H0*1
  21856. -->
  21857. (S1 ^operator O2093 = 0.3804135384871243)
  21858. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21859. -->
  21860. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  21861. -->
  21862. (S1 ^operator O2093 = -0.3332708974800781)
  21863. Firing prefer*rvt*predict-no*H0
  21864. -->
  21865. Firing rl*prefer*rvt*predict-no*H0*2
  21866. -->
  21867. (S1 ^operator O2094 = 0.3140251866918842)
  21868. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21869. -->
  21870. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  21871. -->
  21872. (S1 ^operator O2094 = 0.6858476397463316)
  21873. inner elaboration loop at bottom goal.
  21874. Retracting rl*prefer*rvt*predict-no*H0*2
  21875. -->
  21876. (S1 ^operator O2092 = 0.3140251866918842)
  21877. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  21878. -->
  21879. (S1 ^operator O2092 = 0.6858476397463316)
  21880. Retracting rl*prefer*rvt*predict-yes*H0*1
  21881. -->
  21882. (S1 ^operator O2091 = 0.3804135384871243)
  21883. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  21884. -->
  21885. (S1 ^operator O2091 = -0.3332708974800781)
  21886. --- END Proposal Phase ---
  21887. --- Decision Phase ---
  21888. RL update rl*prefer*rvt*predict-no*H0*2 0.485034 -0.171009 0.314025 -> 0.485028 -0.171011 0.314017(R,m,v=1,0.869565,0.11413)
  21889. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515043 0.17103 0.686073 -> 0.515036 0.171028 0.686063(R,m,v=1,1,0)
  21890. =>WM: (14683: S1 ^operator O2094)
  21891. 1047: O: O2094 (predict-no)
  21892. --- END Decision Phase ---
  21893. --- Application Phase ---
  21894. --- Firing Productions (PE) For State At Depth 1 ---
  21895. --- Inner Elaboration Phase, active level 1 (S1) ---
  21896. Firing apply*operator
  21897. -->
  21898. (I3 ^predict-no N1047 + :O )
  21899. Firing apply*operator*complete
  21900. -->
  21901. (I3 ^predict-no N1046 - :O )
  21902. inner elaboration loop at bottom goal.
  21903. --- Change Working Memory (PE) ---
  21904. =>WM: (14684: I3 ^predict-no N1047)
  21905. <=WM: (14671: N1046 ^status complete)
  21906. <=WM: (14670: I3 ^predict-no N1046)
  21907. --- Firing Productions (IE) For State At Depth 1 ---
  21908. --- Inner Elaboration Phase, active level 1 (S1) ---
  21909. Firing monitor*world
  21910. -->
  21911. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21912. --- Change Working Memory (IE) ---
  21913. --- END Application Phase ---
  21914. --- Output Phase ---
  21915. ENV: Agent did: predict-no for direction L in state State-A
  21916. In State-A moving L
  21917. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21918. predict error 0
  21919. dir: dir isR
  21920. --- END Output Phase ---
  21921. \-/--- Input Phase ---
  21922. =>WM: (14688: I2 ^dir R)
  21923. =>WM: (14687: I2 ^reward 1)
  21924. =>WM: (14686: I2 ^see 0)
  21925. =>WM: (14685: N1047 ^status complete)
  21926. <=WM: (14674: I2 ^dir L)
  21927. <=WM: (14673: I2 ^reward 1)
  21928. <=WM: (14672: I2 ^see 0)
  21929. =>WM: (14689: I2 ^level-1 L0-root)
  21930. <=WM: (14675: I2 ^level-1 L0-root)
  21931. --- END Input Phase ---
  21932. --- Proposal Phase ---
  21933. --- Inner Elaboration Phase, active level 1 (S1) ---
  21934. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  21935. -->
  21936. (S1 ^operator O2093 = 0.7057548618480857)
  21937. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  21938. -->
  21939. (S1 ^operator O2094 = -0.2023211881870005)
  21940. Firing prefer*rvt*predict-no*H0*6*v1*H1
  21941. -->
  21942. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21943. -->
  21944. Firing elaborate*copy-see-to-output-link
  21945. -->
  21946. (I3 ^see 0 +)
  21947. Firing elaborate*reward*based*on*reward
  21948. -->
  21949. (R1051 ^value 1 +)
  21950. (R1 ^reward R1051 +)
  21951. Firing propose*predict-yes
  21952. -->
  21953. (O2095 ^name predict-yes +)
  21954. (S1 ^operator O2095 +)
  21955. Firing propose*predict-no
  21956. -->
  21957. (O2096 ^name predict-no +)
  21958. (S1 ^operator O2096 +)
  21959. Firing rl*prefer*rvt*predict-no*H0*6
  21960. -->
  21961. (S1 ^operator O2094 = 0.2298592186043533)
  21962. Firing rl*prefer*rvt*predict-yes*H0*5
  21963. -->
  21964. (S1 ^operator O2093 = 0.2940205065793785)
  21965. Firing prefer*rvt*predict-yes*H0
  21966. -->
  21967. Firing prefer*rvt*predict-no*H0
  21968. -->
  21969. Firing elaborate*copy-dir-to-output-link
  21970. -->
  21971. (I3 ^dir R +)
  21972. inner elaboration loop at bottom goal.
  21973. Retracting elaborate*copy-see-to-output-link
  21974. -->
  21975. (I3 ^see 0 +)
  21976. Retracting propose*predict-no
  21977. -->
  21978. (O2094 ^name predict-no +)
  21979. (S1 ^operator O2094 +)
  21980. Retracting propose*predict-yes
  21981. -->
  21982. (O2093 ^name predict-yes +)
  21983. (S1 ^operator O2093 +)
  21984. Retracting elaborate*reward*based*on*reward
  21985. -->
  21986. (R1050 ^value 1 +)
  21987. (R1 ^reward R1050 +)
  21988. Retracting elaborate*copy-dir-to-output-link
  21989. -->
  21990. (I3 ^dir L +)
  21991. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  21992. -->
  21993. (S1 ^operator O2094 = 0.6858476397463316)
  21994. Retracting rl*prefer*rvt*predict-no*H0*2
  21995. -->
  21996. (S1 ^operator O2094 = 0.3140171210188315)
  21997. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  21998. -->
  21999. (S1 ^operator O2093 = -0.3332708974800781)
  22000. Retracting rl*prefer*rvt*predict-yes*H0*1
  22001. -->
  22002. (S1 ^operator O2093 = 0.3804135384871243)
  22003. =>WM: (14696: S1 ^operator O2096 +)
  22004. =>WM: (14695: S1 ^operator O2095 +)
  22005. =>WM: (14694: I3 ^dir R)
  22006. =>WM: (14693: O2096 ^name predict-no)
  22007. =>WM: (14692: O2095 ^name predict-yes)
  22008. =>WM: (14691: R1051 ^value 1)
  22009. =>WM: (14690: R1 ^reward R1051)
  22010. <=WM: (14681: S1 ^operator O2093 +)
  22011. <=WM: (14682: S1 ^operator O2094 +)
  22012. <=WM: (14683: S1 ^operator O2094)
  22013. <=WM: (14652: I3 ^dir L)
  22014. <=WM: (14677: R1 ^reward R1050)
  22015. <=WM: (14680: O2094 ^name predict-no)
  22016. <=WM: (14679: O2093 ^name predict-yes)
  22017. <=WM: (14678: R1050 ^value 1)
  22018. --- Inner Elaboration Phase, active level 1 (S1) ---
  22019. Firing prefer*rvt*predict-yes*H0
  22020. -->
  22021. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22022. -->
  22023. (S1 ^operator O2095 = 0.7057548618480857)
  22024. Firing rl*prefer*rvt*predict-yes*H0*5
  22025. -->
  22026. (S1 ^operator O2095 = 0.2940205065793785)
  22027. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22028. -->
  22029. Firing prefer*rvt*predict-no*H0
  22030. -->
  22031. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22032. -->
  22033. (S1 ^operator O2096 = -0.2023211881870005)
  22034. Firing rl*prefer*rvt*predict-no*H0*6
  22035. -->
  22036. (S1 ^operator O2096 = 0.2298592186043533)
  22037. Firing prefer*rvt*predict-no*H0*6*v1*H1
  22038. -->
  22039. inner elaboration loop at bottom goal.
  22040. Retracting rl*prefer*rvt*predict-no*H0*6
  22041. -->
  22042. (S1 ^operator O2094 = 0.2298592186043533)
  22043. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22044. -->
  22045. (S1 ^operator O2094 = -0.2023211881870005)
  22046. Retracting rl*prefer*rvt*predict-yes*H0*5
  22047. -->
  22048. (S1 ^operator O2093 = 0.2940205065793785)
  22049. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22050. -->
  22051. (S1 ^operator O2093 = 0.7057548618480857)
  22052. --- END Proposal Phase ---
  22053. --- Decision Phase ---
  22054. RL update rl*prefer*rvt*predict-no*H0*2 0.485028 -0.171011 0.314017 -> 0.485037 -0.171008 0.314028(R,m,v=1,0.87037,0.113527)
  22055. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514865 0.170982 0.685848 -> 0.514876 0.170985 0.685861(R,m,v=1,1,0)
  22056. =>WM: (14697: S1 ^operator O2095)
  22057. 1048: O: O2095 (predict-yes)
  22058. --- END Decision Phase ---
  22059. --- Application Phase ---
  22060. --- Firing Productions (PE) For State At Depth 1 ---
  22061. --- Inner Elaboration Phase, active level 1 (S1) ---
  22062. Firing apply*operator
  22063. -->
  22064. (I3 ^predict-yes N1048 + :O )
  22065. Firing apply*operator*complete
  22066. -->
  22067. (I3 ^predict-no N1047 - :O )
  22068. inner elaboration loop at bottom goal.
  22069. --- Change Working Memory (PE) ---
  22070. =>WM: (14698: I3 ^predict-yes N1048)
  22071. <=WM: (14685: N1047 ^status complete)
  22072. <=WM: (14684: I3 ^predict-no N1047)
  22073. --- Firing Productions (IE) For State At Depth 1 ---
  22074. --- Inner Elaboration Phase, active level 1 (S1) ---
  22075. Firing monitor*world
  22076. -->
  22077. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22078. --- Change Working Memory (IE) ---
  22079. --- END Application Phase ---
  22080. --- Output Phase ---
  22081. ENV: Agent did: predict-yes for direction R in state State-A
  22082. In State-A moving R
  22083. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22084. predict error 0
  22085. dir: dir isL
  22086. --- END Output Phase ---
  22087. |\---- Input Phase ---
  22088. =>WM: (14702: I2 ^dir L)
  22089. =>WM: (14701: I2 ^reward 1)
  22090. =>WM: (14700: I2 ^see 1)
  22091. =>WM: (14699: N1048 ^status complete)
  22092. <=WM: (14688: I2 ^dir R)
  22093. <=WM: (14687: I2 ^reward 1)
  22094. <=WM: (14686: I2 ^see 0)
  22095. =>WM: (14703: I2 ^level-1 R1-root)
  22096. <=WM: (14689: I2 ^level-1 L0-root)
  22097. --- END Input Phase ---
  22098. --- Proposal Phase ---
  22099. --- Inner Elaboration Phase, active level 1 (S1) ---
  22100. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  22101. -->
  22102. (S1 ^operator O2095 = 0.619600420969239)
  22103. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  22104. -->
  22105. (S1 ^operator O2096 = -0.1479504104026684)
  22106. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22107. -->
  22108. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22109. -->
  22110. Firing elaborate*copy-see-to-output-link
  22111. -->
  22112. (I3 ^see 1 +)
  22113. Firing elaborate*reward*based*on*reward
  22114. -->
  22115. (R1052 ^value 1 +)
  22116. (R1 ^reward R1052 +)
  22117. Firing propose*predict-yes
  22118. -->
  22119. (O2097 ^name predict-yes +)
  22120. (S1 ^operator O2097 +)
  22121. Firing propose*predict-no
  22122. -->
  22123. (O2098 ^name predict-no +)
  22124. (S1 ^operator O2098 +)
  22125. Firing rl*prefer*rvt*predict-no*H0*2
  22126. -->
  22127. (S1 ^operator O2096 = 0.3140282287884166)
  22128. Firing rl*prefer*rvt*predict-yes*H0*1
  22129. -->
  22130. (S1 ^operator O2095 = 0.3804135384871243)
  22131. Firing prefer*rvt*predict-yes*H0
  22132. -->
  22133. Firing prefer*rvt*predict-no*H0
  22134. -->
  22135. Firing elaborate*copy-dir-to-output-link
  22136. -->
  22137. (I3 ^dir L +)
  22138. inner elaboration loop at bottom goal.
  22139. Retracting elaborate*copy-see-to-output-link
  22140. -->
  22141. (I3 ^see 0 +)
  22142. Retracting propose*predict-no
  22143. -->
  22144. (O2096 ^name predict-no +)
  22145. (S1 ^operator O2096 +)
  22146. Retracting propose*predict-yes
  22147. -->
  22148. (O2095 ^name predict-yes +)
  22149. (S1 ^operator O2095 +)
  22150. Retracting elaborate*reward*based*on*reward
  22151. -->
  22152. (R1051 ^value 1 +)
  22153. (R1 ^reward R1051 +)
  22154. Retracting elaborate*copy-dir-to-output-link
  22155. -->
  22156. (I3 ^dir R +)
  22157. Retracting rl*prefer*rvt*predict-no*H0*6
  22158. -->
  22159. (S1 ^operator O2096 = 0.2298592186043533)
  22160. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22161. -->
  22162. (S1 ^operator O2096 = -0.2023211881870005)
  22163. Retracting rl*prefer*rvt*predict-yes*H0*5
  22164. -->
  22165. (S1 ^operator O2095 = 0.2940205065793785)
  22166. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22167. -->
  22168. (S1 ^operator O2095 = 0.7057548618480857)
  22169. =>WM: (14711: S1 ^operator O2098 +)
  22170. =>WM: (14710: S1 ^operator O2097 +)
  22171. =>WM: (14709: I3 ^dir L)
  22172. =>WM: (14708: O2098 ^name predict-no)
  22173. =>WM: (14707: O2097 ^name predict-yes)
  22174. =>WM: (14706: R1052 ^value 1)
  22175. =>WM: (14705: R1 ^reward R1052)
  22176. =>WM: (14704: I3 ^see 1)
  22177. <=WM: (14695: S1 ^operator O2095 +)
  22178. <=WM: (14697: S1 ^operator O2095)
  22179. <=WM: (14696: S1 ^operator O2096 +)
  22180. <=WM: (14694: I3 ^dir R)
  22181. <=WM: (14690: R1 ^reward R1051)
  22182. <=WM: (14676: I3 ^see 0)
  22183. <=WM: (14693: O2096 ^name predict-no)
  22184. <=WM: (14692: O2095 ^name predict-yes)
  22185. <=WM: (14691: R1051 ^value 1)
  22186. --- Inner Elaboration Phase, active level 1 (S1) ---
  22187. Firing prefer*rvt*predict-yes*H0
  22188. -->
  22189. Firing rl*prefer*rvt*predict-yes*H0*1
  22190. -->
  22191. (S1 ^operator O2097 = 0.3804135384871243)
  22192. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22193. -->
  22194. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  22195. -->
  22196. (S1 ^operator O2097 = 0.619600420969239)
  22197. Firing prefer*rvt*predict-no*H0
  22198. -->
  22199. Firing rl*prefer*rvt*predict-no*H0*2
  22200. -->
  22201. (S1 ^operator O2098 = 0.3140282287884166)
  22202. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22203. -->
  22204. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  22205. -->
  22206. (S1 ^operator O2098 = -0.1479504104026684)
  22207. inner elaboration loop at bottom goal.
  22208. Retracting rl*prefer*rvt*predict-no*H0*2
  22209. -->
  22210. (S1 ^operator O2096 = 0.3140282287884166)
  22211. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  22212. -->
  22213. (S1 ^operator O2096 = -0.1479504104026684)
  22214. Retracting rl*prefer*rvt*predict-yes*H0*1
  22215. -->
  22216. (S1 ^operator O2095 = 0.3804135384871243)
  22217. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  22218. -->
  22219. (S1 ^operator O2095 = 0.619600420969239)
  22220. --- END Proposal Phase ---
  22221. --- Decision Phase ---
  22222. RL update rl*prefer*rvt*predict-yes*H0*5 0.501092 -0.207072 0.294021 -> 0.501109 -0.20707 0.294039(R,m,v=1,0.854545,0.125055)
  22223. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498704 0.20705 0.705755 -> 0.498724 0.207053 0.705777(R,m,v=1,1,0)
  22224. =>WM: (14712: S1 ^operator O2097)
  22225. 1049: O: O2097 (predict-yes)
  22226. --- END Decision Phase ---
  22227. --- Application Phase ---
  22228. --- Firing Productions (PE) For State At Depth 1 ---
  22229. --- Inner Elaboration Phase, active level 1 (S1) ---
  22230. Firing apply*operator
  22231. -->
  22232. (I3 ^predict-yes N1049 + :O )
  22233. Firing apply*operator*complete
  22234. -->
  22235. (I3 ^predict-yes N1048 - :O )
  22236. inner elaboration loop at bottom goal.
  22237. --- Change Working Memory (PE) ---
  22238. =>WM: (14713: I3 ^predict-yes N1049)
  22239. <=WM: (14699: N1048 ^status complete)
  22240. <=WM: (14698: I3 ^predict-yes N1048)
  22241. --- Firing Productions (IE) For State At Depth 1 ---
  22242. --- Inner Elaboration Phase, active level 1 (S1) ---
  22243. Firing monitor*world
  22244. -->
  22245. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22246. --- Change Working Memory (IE) ---
  22247. --- END Application Phase ---
  22248. --- Output Phase ---
  22249. ENV: Agent did: predict-yes for direction L in state State-B
  22250. In State-B moving L
  22251. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  22252. predict error 0
  22253. dir: dir isL
  22254. --- END Output Phase ---
  22255. /|\--- Input Phase ---
  22256. =>WM: (14717: I2 ^dir L)
  22257. =>WM: (14716: I2 ^reward 1)
  22258. =>WM: (14715: I2 ^see 1)
  22259. =>WM: (14714: N1049 ^status complete)
  22260. <=WM: (14702: I2 ^dir L)
  22261. <=WM: (14701: I2 ^reward 1)
  22262. <=WM: (14700: I2 ^see 1)
  22263. =>WM: (14718: I2 ^level-1 L1-root)
  22264. <=WM: (14703: I2 ^level-1 R1-root)
  22265. --- END Input Phase ---
  22266. --- Proposal Phase ---
  22267. --- Inner Elaboration Phase, active level 1 (S1) ---
  22268. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  22269. -->
  22270. (S1 ^operator O2097 = -0.3470159027404986)
  22271. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  22272. -->
  22273. (S1 ^operator O2098 = 0.6860634902400752)
  22274. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22275. -->
  22276. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22277. -->
  22278. Firing elaborate*copy-see-to-output-link
  22279. -->
  22280. (I3 ^see 1 +)
  22281. Firing elaborate*reward*based*on*reward
  22282. -->
  22283. (R1053 ^value 1 +)
  22284. (R1 ^reward R1053 +)
  22285. Firing propose*predict-yes
  22286. -->
  22287. (O2099 ^name predict-yes +)
  22288. (S1 ^operator O2099 +)
  22289. Firing propose*predict-no
  22290. -->
  22291. (O2100 ^name predict-no +)
  22292. (S1 ^operator O2100 +)
  22293. Firing rl*prefer*rvt*predict-no*H0*2
  22294. -->
  22295. (S1 ^operator O2098 = 0.3140282287884166)
  22296. Firing rl*prefer*rvt*predict-yes*H0*1
  22297. -->
  22298. (S1 ^operator O2097 = 0.3804135384871243)
  22299. Firing prefer*rvt*predict-yes*H0
  22300. -->
  22301. Firing prefer*rvt*predict-no*H0
  22302. -->
  22303. Firing elaborate*copy-dir-to-output-link
  22304. -->
  22305. (I3 ^dir L +)
  22306. inner elaboration loop at bottom goal.
  22307. Retracting elaborate*copy-see-to-output-link
  22308. -->
  22309. (I3 ^see 1 +)
  22310. Retracting propose*predict-no
  22311. -->
  22312. (O2098 ^name predict-no +)
  22313. (S1 ^operator O2098 +)
  22314. Retracting propose*predict-yes
  22315. -->
  22316. (O2097 ^name predict-yes +)
  22317. (S1 ^operator O2097 +)
  22318. Retracting elaborate*reward*based*on*reward
  22319. -->
  22320. (R1052 ^value 1 +)
  22321. (R1 ^reward R1052 +)
  22322. Retracting elaborate*copy-dir-to-output-link
  22323. -->
  22324. (I3 ^dir L +)
  22325. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  22326. -->
  22327. (S1 ^operator O2098 = -0.1479504104026684)
  22328. Retracting rl*prefer*rvt*predict-no*H0*2
  22329. -->
  22330. (S1 ^operator O2098 = 0.3140282287884166)
  22331. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  22332. -->
  22333. (S1 ^operator O2097 = 0.619600420969239)
  22334. Retracting rl*prefer*rvt*predict-yes*H0*1
  22335. -->
  22336. (S1 ^operator O2097 = 0.3804135384871243)
  22337. =>WM: (14724: S1 ^operator O2100 +)
  22338. =>WM: (14723: S1 ^operator O2099 +)
  22339. =>WM: (14722: O2100 ^name predict-no)
  22340. =>WM: (14721: O2099 ^name predict-yes)
  22341. =>WM: (14720: R1053 ^value 1)
  22342. =>WM: (14719: R1 ^reward R1053)
  22343. <=WM: (14710: S1 ^operator O2097 +)
  22344. <=WM: (14712: S1 ^operator O2097)
  22345. <=WM: (14711: S1 ^operator O2098 +)
  22346. <=WM: (14705: R1 ^reward R1052)
  22347. <=WM: (14708: O2098 ^name predict-no)
  22348. <=WM: (14707: O2097 ^name predict-yes)
  22349. <=WM: (14706: R1052 ^value 1)
  22350. --- Inner Elaboration Phase, active level 1 (S1) ---
  22351. Firing prefer*rvt*predict-yes*H0
  22352. -->
  22353. Firing rl*prefer*rvt*predict-yes*H0*1
  22354. -->
  22355. (S1 ^operator O2099 = 0.3804135384871243)
  22356. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22357. -->
  22358. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  22359. -->
  22360. (S1 ^operator O2099 = -0.3470159027404986)
  22361. Firing prefer*rvt*predict-no*H0
  22362. -->
  22363. Firing rl*prefer*rvt*predict-no*H0*2
  22364. -->
  22365. (S1 ^operator O2100 = 0.3140282287884166)
  22366. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22367. -->
  22368. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  22369. -->
  22370. (S1 ^operator O2100 = 0.6860634902400752)
  22371. inner elaboration loop at bottom goal.
  22372. Retracting rl*prefer*rvt*predict-no*H0*2
  22373. -->
  22374. (S1 ^operator O2098 = 0.3140282287884166)
  22375. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  22376. -->
  22377. (S1 ^operator O2098 = 0.6860634902400752)
  22378. Retracting rl*prefer*rvt*predict-yes*H0*1
  22379. -->
  22380. (S1 ^operator O2097 = 0.3804135384871243)
  22381. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  22382. -->
  22383. (S1 ^operator O2097 = -0.3470159027404986)
  22384. --- END Proposal Phase ---
  22385. --- Decision Phase ---
  22386. RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380414 -> 0.521342 -0.14093 0.380412(R,m,v=1,0.84,0.135172)
  22387. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478672 0.140929 0.6196 -> 0.47867 0.140929 0.619599(R,m,v=1,1,0)
  22388. =>WM: (14725: S1 ^operator O2100)
  22389. 1050: O: O2100 (predict-no)
  22390. --- END Decision Phase ---
  22391. --- Application Phase ---
  22392. --- Firing Productions (PE) For State At Depth 1 ---
  22393. --- Inner Elaboration Phase, active level 1 (S1) ---
  22394. Firing apply*operator
  22395. -->
  22396. (I3 ^predict-no N1050 + :O )
  22397. Firing apply*operator*complete
  22398. -->
  22399. (I3 ^predict-yes N1049 - :O )
  22400. inner elaboration loop at bottom goal.
  22401. --- Change Working Memory (PE) ---
  22402. =>WM: (14726: I3 ^predict-no N1050)
  22403. <=WM: (14714: N1049 ^status complete)
  22404. <=WM: (14713: I3 ^predict-yes N1049)
  22405. --- Firing Productions (IE) For State At Depth 1 ---
  22406. --- Inner Elaboration Phase, active level 1 (S1) ---
  22407. Firing monitor*world
  22408. -->
  22409. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22410. --- Change Working Memory (IE) ---
  22411. --- END Application Phase ---
  22412. --- Output Phase ---
  22413. ENV: Agent did: predict-no for direction L in state State-A
  22414. In State-A moving L
  22415. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22416. predict error 0
  22417. dir: dir isU
  22418. --- END Output Phase ---
  22419. -/|--- Input Phase ---
  22420. =>WM: (14730: I2 ^dir U)
  22421. =>WM: (14729: I2 ^reward 1)
  22422. =>WM: (14728: I2 ^see 0)
  22423. =>WM: (14727: N1050 ^status complete)
  22424. <=WM: (14717: I2 ^dir L)
  22425. <=WM: (14716: I2 ^reward 1)
  22426. <=WM: (14715: I2 ^see 1)
  22427. =>WM: (14731: I2 ^level-1 L0-root)
  22428. <=WM: (14718: I2 ^level-1 L1-root)
  22429. --- END Input Phase ---
  22430. --- Proposal Phase ---
  22431. --- Inner Elaboration Phase, active level 1 (S1) ---
  22432. Firing elaborate*copy-see-to-output-link
  22433. -->
  22434. (I3 ^see 0 +)
  22435. Firing elaborate*reward*based*on*reward
  22436. -->
  22437. (R1054 ^value 1 +)
  22438. (R1 ^reward R1054 +)
  22439. Firing propose*predict-yes
  22440. -->
  22441. (O2101 ^name predict-yes +)
  22442. (S1 ^operator O2101 +)
  22443. Firing propose*predict-no
  22444. -->
  22445. (O2102 ^name predict-no +)
  22446. (S1 ^operator O2102 +)
  22447. Firing rl*prefer*rvt*predict-no*H0*4
  22448. -->
  22449. (S1 ^operator O2100 = 1.)
  22450. Firing rl*prefer*rvt*predict-yes*H0*3
  22451. -->
  22452. (S1 ^operator O2099 = 0.)
  22453. Firing prefer*rvt*predict-yes*H0
  22454. -->
  22455. Firing prefer*rvt*predict-no*H0
  22456. -->
  22457. Firing elaborate*copy-dir-to-output-link
  22458. -->
  22459. (I3 ^dir U +)
  22460. inner elaboration loop at bottom goal.
  22461. Retracting elaborate*copy-see-to-output-link
  22462. -->
  22463. (I3 ^see 1 +)
  22464. Retracting propose*predict-no
  22465. -->
  22466. (O2100 ^name predict-no +)
  22467. (S1 ^operator O2100 +)
  22468. Retracting propose*predict-yes
  22469. -->
  22470. (O2099 ^name predict-yes +)
  22471. (S1 ^operator O2099 +)
  22472. Retracting elaborate*reward*based*on*reward
  22473. -->
  22474. (R1053 ^value 1 +)
  22475. (R1 ^reward R1053 +)
  22476. Retracting elaborate*copy-dir-to-output-link
  22477. -->
  22478. (I3 ^dir L +)
  22479. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  22480. -->
  22481. (S1 ^operator O2100 = 0.6860634902400752)
  22482. Retracting rl*prefer*rvt*predict-no*H0*2
  22483. -->
  22484. (S1 ^operator O2100 = 0.3140282287884166)
  22485. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  22486. -->
  22487. (S1 ^operator O2099 = -0.3470159027404986)
  22488. Retracting rl*prefer*rvt*predict-yes*H0*1
  22489. -->
  22490. (S1 ^operator O2099 = 0.3804124062940181)
  22491. =>WM: (14739: S1 ^operator O2102 +)
  22492. =>WM: (14738: S1 ^operator O2101 +)
  22493. =>WM: (14737: I3 ^dir U)
  22494. =>WM: (14736: O2102 ^name predict-no)
  22495. =>WM: (14735: O2101 ^name predict-yes)
  22496. =>WM: (14734: R1054 ^value 1)
  22497. =>WM: (14733: R1 ^reward R1054)
  22498. =>WM: (14732: I3 ^see 0)
  22499. <=WM: (14723: S1 ^operator O2099 +)
  22500. <=WM: (14724: S1 ^operator O2100 +)
  22501. <=WM: (14725: S1 ^operator O2100)
  22502. <=WM: (14709: I3 ^dir L)
  22503. <=WM: (14719: R1 ^reward R1053)
  22504. <=WM: (14704: I3 ^see 1)
  22505. <=WM: (14722: O2100 ^name predict-no)
  22506. <=WM: (14721: O2099 ^name predict-yes)
  22507. <=WM: (14720: R1053 ^value 1)
  22508. --- Inner Elaboration Phase, active level 1 (S1) ---
  22509. Firing prefer*rvt*predict-yes*H0
  22510. -->
  22511. Firing rl*prefer*rvt*predict-yes*H0*3
  22512. -->
  22513. (S1 ^operator O2101 = 0.)
  22514. Firing prefer*rvt*predict-no*H0
  22515. -->
  22516. Firing rl*prefer*rvt*predict-no*H0*4
  22517. -->
  22518. (S1 ^operator O2102 = 1.)
  22519. inner elaboration loop at bottom goal.
  22520. Retracting rl*prefer*rvt*predict-no*H0*4
  22521. -->
  22522. (S1 ^operator O2100 = 1.)
  22523. Retracting rl*prefer*rvt*predict-yes*H0*3
  22524. -->
  22525. (S1 ^operator O2099 = 0.)
  22526. --- END Proposal Phase ---
  22527. --- Decision Phase ---
  22528. RL update rl*prefer*rvt*predict-no*H0*2 0.485037 -0.171008 0.314028 -> 0.485031 -0.17101 0.314021(R,m,v=1,0.871166,0.112929)
  22529. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515036 0.171028 0.686063 -> 0.515029 0.171026 0.686055(R,m,v=1,1,0)
  22530. =>WM: (14740: S1 ^operator O2102)
  22531. 1051: O: O2102 (predict-no)
  22532. --- END Decision Phase ---
  22533. --- Application Phase ---
  22534. --- Firing Productions (PE) For State At Depth 1 ---
  22535. --- Inner Elaboration Phase, active level 1 (S1) ---
  22536. Firing apply*operator
  22537. -->
  22538. (I3 ^predict-no N1051 + :O )
  22539. Firing apply*operator*complete
  22540. -->
  22541. (I3 ^predict-no N1050 - :O )
  22542. inner elaboration loop at bottom goal.
  22543. --- Change Working Memory (PE) ---
  22544. =>WM: (14741: I3 ^predict-no N1051)
  22545. <=WM: (14727: N1050 ^status complete)
  22546. <=WM: (14726: I3 ^predict-no N1050)
  22547. --- Firing Productions (IE) For State At Depth 1 ---
  22548. --- Inner Elaboration Phase, active level 1 (S1) ---
  22549. Firing monitor*world
  22550. -->
  22551. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22552. --- Change Working Memory (IE) ---
  22553. --- END Application Phase ---
  22554. --- Output Phase ---
  22555. ENV: Agent did: predict-no for direction U in state State-A
  22556. In State-A moving U
  22557. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22558. predict error 0
  22559. dir: dir isU
  22560. --- END Output Phase ---
  22561. \--- Input Phase ---
  22562. =>WM: (14745: I2 ^dir U)
  22563. =>WM: (14744: I2 ^reward 1)
  22564. =>WM: (14743: I2 ^see 0)
  22565. =>WM: (14742: N1051 ^status complete)
  22566. <=WM: (14730: I2 ^dir U)
  22567. <=WM: (14729: I2 ^reward 1)
  22568. <=WM: (14728: I2 ^see 0)
  22569. =>WM: (14746: I2 ^level-1 L0-root)
  22570. <=WM: (14731: I2 ^level-1 L0-root)
  22571. --- END Input Phase ---
  22572. --- Proposal Phase ---
  22573. --- Inner Elaboration Phase, active level 1 (S1) ---
  22574. Firing elaborate*copy-see-to-output-link
  22575. -->
  22576. (I3 ^see 0 +)
  22577. Firing elaborate*reward*based*on*reward
  22578. -->
  22579. (R1055 ^value 1 +)
  22580. (R1 ^reward R1055 +)
  22581. Firing propose*predict-yes
  22582. -->
  22583. (O2103 ^name predict-yes +)
  22584. (S1 ^operator O2103 +)
  22585. Firing propose*predict-no
  22586. -->
  22587. (O2104 ^name predict-no +)
  22588. (S1 ^operator O2104 +)
  22589. Firing rl*prefer*rvt*predict-no*H0*4
  22590. -->
  22591. (S1 ^operator O2102 = 1.)
  22592. Firing rl*prefer*rvt*predict-yes*H0*3
  22593. -->
  22594. (S1 ^operator O2101 = 0.)
  22595. Firing prefer*rvt*predict-yes*H0
  22596. -->
  22597. Firing prefer*rvt*predict-no*H0
  22598. -->
  22599. Firing elaborate*copy-dir-to-output-link
  22600. -->
  22601. (I3 ^dir U +)
  22602. inner elaboration loop at bottom goal.
  22603. Retracting elaborate*copy-see-to-output-link
  22604. -->
  22605. (I3 ^see 0 +)
  22606. Retracting propose*predict-no
  22607. -->
  22608. (O2102 ^name predict-no +)
  22609. (S1 ^operator O2102 +)
  22610. Retracting propose*predict-yes
  22611. -->
  22612. (O2101 ^name predict-yes +)
  22613. (S1 ^operator O2101 +)
  22614. Retracting elaborate*reward*based*on*reward
  22615. -->
  22616. (R1054 ^value 1 +)
  22617. (R1 ^reward R1054 +)
  22618. Retracting elaborate*copy-dir-to-output-link
  22619. -->
  22620. (I3 ^dir U +)
  22621. Retracting rl*prefer*rvt*predict-no*H0*4
  22622. -->
  22623. (S1 ^operator O2102 = 1.)
  22624. Retracting rl*prefer*rvt*predict-yes*H0*3
  22625. -->
  22626. (S1 ^operator O2101 = 0.)
  22627. =>WM: (14752: S1 ^operator O2104 +)
  22628. =>WM: (14751: S1 ^operator O2103 +)
  22629. =>WM: (14750: O2104 ^name predict-no)
  22630. =>WM: (14749: O2103 ^name predict-yes)
  22631. =>WM: (14748: R1055 ^value 1)
  22632. =>WM: (14747: R1 ^reward R1055)
  22633. <=WM: (14738: S1 ^operator O2101 +)
  22634. <=WM: (14739: S1 ^operator O2102 +)
  22635. <=WM: (14740: S1 ^operator O2102)
  22636. <=WM: (14733: R1 ^reward R1054)
  22637. <=WM: (14736: O2102 ^name predict-no)
  22638. <=WM: (14735: O2101 ^name predict-yes)
  22639. <=WM: (14734: R1054 ^value 1)
  22640. --- Inner Elaboration Phase, active level 1 (S1) ---
  22641. Firing prefer*rvt*predict-yes*H0
  22642. -->
  22643. Firing rl*prefer*rvt*predict-yes*H0*3
  22644. -->
  22645. (S1 ^operator O2103 = 0.)
  22646. Firing prefer*rvt*predict-no*H0
  22647. -->
  22648. Firing rl*prefer*rvt*predict-no*H0*4
  22649. -->
  22650. (S1 ^operator O2104 = 1.)
  22651. inner elaboration loop at bottom goal.
  22652. Retracting rl*prefer*rvt*predict-no*H0*4
  22653. -->
  22654. (S1 ^operator O2102 = 1.)
  22655. Retracting rl*prefer*rvt*predict-yes*H0*3
  22656. -->
  22657. (S1 ^operator O2101 = 0.)
  22658. --- END Proposal Phase ---
  22659. --- Decision Phase ---
  22660. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22661. =>WM: (14753: S1 ^operator O2104)
  22662. 1052: O: O2104 (predict-no)
  22663. --- END Decision Phase ---
  22664. --- Application Phase ---
  22665. --- Firing Productions (PE) For State At Depth 1 ---
  22666. --- Inner Elaboration Phase, active level 1 (S1) ---
  22667. Firing apply*operator
  22668. -->
  22669. (I3 ^predict-no N1052 + :O )
  22670. Firing apply*operator*complete
  22671. -->
  22672. (I3 ^predict-no N1051 - :O )
  22673. inner elaboration loop at bottom goal.
  22674. --- Change Working Memory (PE) ---
  22675. =>WM: (14754: I3 ^predict-no N1052)
  22676. <=WM: (14742: N1051 ^status complete)
  22677. <=WM: (14741: I3 ^predict-no N1051)
  22678. --- Firing Productions (IE) For State At Depth 1 ---
  22679. --- Inner Elaboration Phase, active level 1 (S1) ---
  22680. Firing monitor*world
  22681. -->
  22682. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22683. --- Change Working Memory (IE) ---
  22684. --- END Application Phase ---
  22685. --- Output Phase ---
  22686. ENV: Agent did: predict-no for direction U in state State-A
  22687. In State-A moving U
  22688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22689. predict error 0
  22690. dir: dir isR
  22691. --- END Output Phase ---
  22692. -/|--- Input Phase ---
  22693. =>WM: (14758: I2 ^dir R)
  22694. =>WM: (14757: I2 ^reward 1)
  22695. =>WM: (14756: I2 ^see 0)
  22696. =>WM: (14755: N1052 ^status complete)
  22697. <=WM: (14745: I2 ^dir U)
  22698. <=WM: (14744: I2 ^reward 1)
  22699. <=WM: (14743: I2 ^see 0)
  22700. =>WM: (14759: I2 ^level-1 L0-root)
  22701. <=WM: (14746: I2 ^level-1 L0-root)
  22702. --- END Input Phase ---
  22703. --- Proposal Phase ---
  22704. --- Inner Elaboration Phase, active level 1 (S1) ---
  22705. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22706. -->
  22707. (S1 ^operator O2103 = 0.7057765679517091)
  22708. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22709. -->
  22710. (S1 ^operator O2104 = -0.2023211881870005)
  22711. Firing prefer*rvt*predict-no*H0*6*v1*H1
  22712. -->
  22713. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22714. -->
  22715. Firing elaborate*copy-see-to-output-link
  22716. -->
  22717. (I3 ^see 0 +)
  22718. Firing elaborate*reward*based*on*reward
  22719. -->
  22720. (R1056 ^value 1 +)
  22721. (R1 ^reward R1056 +)
  22722. Firing propose*predict-yes
  22723. -->
  22724. (O2105 ^name predict-yes +)
  22725. (S1 ^operator O2105 +)
  22726. Firing propose*predict-no
  22727. -->
  22728. (O2106 ^name predict-no +)
  22729. (S1 ^operator O2106 +)
  22730. Firing rl*prefer*rvt*predict-no*H0*6
  22731. -->
  22732. (S1 ^operator O2104 = 0.2298592186043533)
  22733. Firing rl*prefer*rvt*predict-yes*H0*5
  22734. -->
  22735. (S1 ^operator O2103 = 0.2940389010748334)
  22736. Firing prefer*rvt*predict-yes*H0
  22737. -->
  22738. Firing prefer*rvt*predict-no*H0
  22739. -->
  22740. Firing elaborate*copy-dir-to-output-link
  22741. -->
  22742. (I3 ^dir R +)
  22743. inner elaboration loop at bottom goal.
  22744. Retracting elaborate*copy-see-to-output-link
  22745. -->
  22746. (I3 ^see 0 +)
  22747. Retracting propose*predict-no
  22748. -->
  22749. (O2104 ^name predict-no +)
  22750. (S1 ^operator O2104 +)
  22751. Retracting propose*predict-yes
  22752. -->
  22753. (O2103 ^name predict-yes +)
  22754. (S1 ^operator O2103 +)
  22755. Retracting elaborate*reward*based*on*reward
  22756. -->
  22757. (R1055 ^value 1 +)
  22758. (R1 ^reward R1055 +)
  22759. Retracting elaborate*copy-dir-to-output-link
  22760. -->
  22761. (I3 ^dir U +)
  22762. Retracting rl*prefer*rvt*predict-no*H0*4
  22763. -->
  22764. (S1 ^operator O2104 = 1.)
  22765. Retracting rl*prefer*rvt*predict-yes*H0*3
  22766. -->
  22767. (S1 ^operator O2103 = 0.)
  22768. =>WM: (14766: S1 ^operator O2106 +)
  22769. =>WM: (14765: S1 ^operator O2105 +)
  22770. =>WM: (14764: I3 ^dir R)
  22771. =>WM: (14763: O2106 ^name predict-no)
  22772. =>WM: (14762: O2105 ^name predict-yes)
  22773. =>WM: (14761: R1056 ^value 1)
  22774. =>WM: (14760: R1 ^reward R1056)
  22775. <=WM: (14751: S1 ^operator O2103 +)
  22776. <=WM: (14752: S1 ^operator O2104 +)
  22777. <=WM: (14753: S1 ^operator O2104)
  22778. <=WM: (14737: I3 ^dir U)
  22779. <=WM: (14747: R1 ^reward R1055)
  22780. <=WM: (14750: O2104 ^name predict-no)
  22781. <=WM: (14749: O2103 ^name predict-yes)
  22782. <=WM: (14748: R1055 ^value 1)
  22783. --- Inner Elaboration Phase, active level 1 (S1) ---
  22784. Firing prefer*rvt*predict-yes*H0
  22785. -->
  22786. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22787. -->
  22788. (S1 ^operator O2105 = 0.7057765679517091)
  22789. Firing rl*prefer*rvt*predict-yes*H0*5
  22790. -->
  22791. (S1 ^operator O2105 = 0.2940389010748334)
  22792. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22793. -->
  22794. Firing prefer*rvt*predict-no*H0
  22795. -->
  22796. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22797. -->
  22798. (S1 ^operator O2106 = -0.2023211881870005)
  22799. Firing rl*prefer*rvt*predict-no*H0*6
  22800. -->
  22801. (S1 ^operator O2106 = 0.2298592186043533)
  22802. Firing prefer*rvt*predict-no*H0*6*v1*H1
  22803. -->
  22804. inner elaboration loop at bottom goal.
  22805. Retracting rl*prefer*rvt*predict-no*H0*6
  22806. -->
  22807. (S1 ^operator O2104 = 0.2298592186043533)
  22808. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22809. -->
  22810. (S1 ^operator O2104 = -0.2023211881870005)
  22811. Retracting rl*prefer*rvt*predict-yes*H0*5
  22812. -->
  22813. (S1 ^operator O2103 = 0.2940389010748334)
  22814. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22815. -->
  22816. (S1 ^operator O2103 = 0.7057765679517091)
  22817. --- END Proposal Phase ---
  22818. --- Decision Phase ---
  22819. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22820. =>WM: (14767: S1 ^operator O2105)
  22821. 1053: O: O2105 (predict-yes)
  22822. --- END Decision Phase ---
  22823. --- Application Phase ---
  22824. --- Firing Productions (PE) For State At Depth 1 ---
  22825. --- Inner Elaboration Phase, active level 1 (S1) ---
  22826. Firing apply*operator
  22827. -->
  22828. (I3 ^predict-yes N1053 + :O )
  22829. Firing apply*operator*complete
  22830. -->
  22831. (I3 ^predict-no N1052 - :O )
  22832. inner elaboration loop at bottom goal.
  22833. --- Change Working Memory (PE) ---
  22834. =>WM: (14768: I3 ^predict-yes N1053)
  22835. <=WM: (14755: N1052 ^status complete)
  22836. <=WM: (14754: I3 ^predict-no N1052)
  22837. --- Firing Productions (IE) For State At Depth 1 ---
  22838. --- Inner Elaboration Phase, active level 1 (S1) ---
  22839. Firing monitor*world
  22840. -->
  22841. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22842. --- Change Working Memory (IE) ---
  22843. --- END Application Phase ---
  22844. --- Output Phase ---
  22845. ENV: Agent did: predict-yes for direction R in state State-A
  22846. In State-A moving R
  22847. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22848. predict error 0
  22849. dir: dir isR
  22850. --- END Output Phase ---
  22851. \-/--- Input Phase ---
  22852. =>WM: (14772: I2 ^dir R)
  22853. =>WM: (14771: I2 ^reward 1)
  22854. =>WM: (14770: I2 ^see 1)
  22855. =>WM: (14769: N1053 ^status complete)
  22856. <=WM: (14758: I2 ^dir R)
  22857. <=WM: (14757: I2 ^reward 1)
  22858. <=WM: (14756: I2 ^see 0)
  22859. =>WM: (14773: I2 ^level-1 R1-root)
  22860. <=WM: (14759: I2 ^level-1 L0-root)
  22861. --- END Input Phase ---
  22862. --- Proposal Phase ---
  22863. --- Inner Elaboration Phase, active level 1 (S1) ---
  22864. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  22865. -->
  22866. (S1 ^operator O2105 = -0.252585164213872)
  22867. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  22868. -->
  22869. (S1 ^operator O2106 = 0.770163750477286)
  22870. Firing prefer*rvt*predict-no*H0*6*v1*H1
  22871. -->
  22872. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22873. -->
  22874. Firing elaborate*copy-see-to-output-link
  22875. -->
  22876. (I3 ^see 1 +)
  22877. Firing elaborate*reward*based*on*reward
  22878. -->
  22879. (R1057 ^value 1 +)
  22880. (R1 ^reward R1057 +)
  22881. Firing propose*predict-yes
  22882. -->
  22883. (O2107 ^name predict-yes +)
  22884. (S1 ^operator O2107 +)
  22885. Firing propose*predict-no
  22886. -->
  22887. (O2108 ^name predict-no +)
  22888. (S1 ^operator O2108 +)
  22889. Firing rl*prefer*rvt*predict-no*H0*6
  22890. -->
  22891. (S1 ^operator O2106 = 0.2298592186043533)
  22892. Firing rl*prefer*rvt*predict-yes*H0*5
  22893. -->
  22894. (S1 ^operator O2105 = 0.2940389010748334)
  22895. Firing prefer*rvt*predict-yes*H0
  22896. -->
  22897. Firing prefer*rvt*predict-no*H0
  22898. -->
  22899. Firing elaborate*copy-dir-to-output-link
  22900. -->
  22901. (I3 ^dir R +)
  22902. inner elaboration loop at bottom goal.
  22903. Retracting elaborate*copy-see-to-output-link
  22904. -->
  22905. (I3 ^see 0 +)
  22906. Retracting propose*predict-no
  22907. -->
  22908. (O2106 ^name predict-no +)
  22909. (S1 ^operator O2106 +)
  22910. Retracting propose*predict-yes
  22911. -->
  22912. (O2105 ^name predict-yes +)
  22913. (S1 ^operator O2105 +)
  22914. Retracting elaborate*reward*based*on*reward
  22915. -->
  22916. (R1056 ^value 1 +)
  22917. (R1 ^reward R1056 +)
  22918. Retracting elaborate*copy-dir-to-output-link
  22919. -->
  22920. (I3 ^dir R +)
  22921. Retracting rl*prefer*rvt*predict-no*H0*6
  22922. -->
  22923. (S1 ^operator O2106 = 0.2298592186043533)
  22924. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  22925. -->
  22926. (S1 ^operator O2106 = -0.2023211881870005)
  22927. Retracting rl*prefer*rvt*predict-yes*H0*5
  22928. -->
  22929. (S1 ^operator O2105 = 0.2940389010748334)
  22930. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  22931. -->
  22932. (S1 ^operator O2105 = 0.7057765679517091)
  22933. =>WM: (14780: S1 ^operator O2108 +)
  22934. =>WM: (14779: S1 ^operator O2107 +)
  22935. =>WM: (14778: O2108 ^name predict-no)
  22936. =>WM: (14777: O2107 ^name predict-yes)
  22937. =>WM: (14776: R1057 ^value 1)
  22938. =>WM: (14775: R1 ^reward R1057)
  22939. =>WM: (14774: I3 ^see 1)
  22940. <=WM: (14765: S1 ^operator O2105 +)
  22941. <=WM: (14767: S1 ^operator O2105)
  22942. <=WM: (14766: S1 ^operator O2106 +)
  22943. <=WM: (14760: R1 ^reward R1056)
  22944. <=WM: (14732: I3 ^see 0)
  22945. <=WM: (14763: O2106 ^name predict-no)
  22946. <=WM: (14762: O2105 ^name predict-yes)
  22947. <=WM: (14761: R1056 ^value 1)
  22948. --- Inner Elaboration Phase, active level 1 (S1) ---
  22949. Firing prefer*rvt*predict-yes*H0
  22950. -->
  22951. Firing rl*prefer*rvt*predict-yes*H0*5
  22952. -->
  22953. (S1 ^operator O2107 = 0.2940389010748334)
  22954. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22955. -->
  22956. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  22957. -->
  22958. (S1 ^operator O2107 = -0.252585164213872)
  22959. Firing prefer*rvt*predict-no*H0
  22960. -->
  22961. Firing rl*prefer*rvt*predict-no*H0*6
  22962. -->
  22963. (S1 ^operator O2108 = 0.2298592186043533)
  22964. Firing prefer*rvt*predict-no*H0*6*v1*H1
  22965. -->
  22966. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  22967. -->
  22968. (S1 ^operator O2108 = 0.770163750477286)
  22969. inner elaboration loop at bottom goal.
  22970. Retracting rl*prefer*rvt*predict-no*H0*6
  22971. -->
  22972. (S1 ^operator O2106 = 0.2298592186043533)
  22973. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  22974. -->
  22975. (S1 ^operator O2106 = 0.770163750477286)
  22976. Retracting rl*prefer*rvt*predict-yes*H0*5
  22977. -->
  22978. (S1 ^operator O2105 = 0.2940389010748334)
  22979. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  22980. -->
  22981. (S1 ^operator O2105 = -0.252585164213872)
  22982. --- END Proposal Phase ---
  22983. --- Decision Phase ---
  22984. RL update rl*prefer*rvt*predict-yes*H0*5 0.501109 -0.20707 0.294039 -> 0.501123 -0.207069 0.294054(R,m,v=1,0.855422,0.124425)
  22985. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498724 0.207053 0.705777 -> 0.49874 0.207054 0.705794(R,m,v=1,1,0)
  22986. =>WM: (14781: S1 ^operator O2108)
  22987. 1054: O: O2108 (predict-no)
  22988. --- END Decision Phase ---
  22989. --- Application Phase ---
  22990. --- Firing Productions (PE) For State At Depth 1 ---
  22991. --- Inner Elaboration Phase, active level 1 (S1) ---
  22992. Firing apply*operator
  22993. -->
  22994. (I3 ^predict-no N1054 + :O )
  22995. Firing apply*operator*complete
  22996. -->
  22997. (I3 ^predict-yes N1053 - :O )
  22998. inner elaboration loop at bottom goal.
  22999. --- Change Working Memory (PE) ---
  23000. =>WM: (14782: I3 ^predict-no N1054)
  23001. <=WM: (14769: N1053 ^status complete)
  23002. <=WM: (14768: I3 ^predict-yes N1053)
  23003. --- Firing Productions (IE) For State At Depth 1 ---
  23004. --- Inner Elaboration Phase, active level 1 (S1) ---
  23005. Firing monitor*world
  23006. -->
  23007. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23008. --- Change Working Memory (IE) ---
  23009. --- END Application Phase ---
  23010. --- Output Phase ---
  23011. ENV: Agent did: predict-no for direction R in state State-B
  23012. In State-B moving R
  23013. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23014. predict error 0
  23015. dir: dir isR
  23016. --- END Output Phase ---
  23017. |\---- Input Phase ---
  23018. =>WM: (14786: I2 ^dir R)
  23019. =>WM: (14785: I2 ^reward 1)
  23020. =>WM: (14784: I2 ^see 0)
  23021. =>WM: (14783: N1054 ^status complete)
  23022. <=WM: (14772: I2 ^dir R)
  23023. <=WM: (14771: I2 ^reward 1)
  23024. <=WM: (14770: I2 ^see 1)
  23025. =>WM: (14787: I2 ^level-1 R0-root)
  23026. <=WM: (14773: I2 ^level-1 R1-root)
  23027. --- END Input Phase ---
  23028. --- Proposal Phase ---
  23029. --- Inner Elaboration Phase, active level 1 (S1) ---
  23030. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  23031. -->
  23032. (S1 ^operator O2107 = -0.1254042659579056)
  23033. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  23034. -->
  23035. (S1 ^operator O2108 = 0.7701073432202794)
  23036. Firing prefer*rvt*predict-no*H0*6*v1*H1
  23037. -->
  23038. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  23039. -->
  23040. Firing elaborate*copy-see-to-output-link
  23041. -->
  23042. (I3 ^see 0 +)
  23043. Firing elaborate*reward*based*on*reward
  23044. -->
  23045. (R1058 ^value 1 +)
  23046. (R1 ^reward R1058 +)
  23047. Firing propose*predict-yes
  23048. -->
  23049. (O2109 ^name predict-yes +)
  23050. (S1 ^operator O2109 +)
  23051. Firing propose*predict-no
  23052. -->
  23053. (O2110 ^name predict-no +)
  23054. (S1 ^operator O2110 +)
  23055. Firing rl*prefer*rvt*predict-no*H0*6
  23056. -->
  23057. (S1 ^operator O2108 = 0.2298592186043533)
  23058. Firing rl*prefer*rvt*predict-yes*H0*5
  23059. -->
  23060. (S1 ^operator O2107 = 0.2940539968979803)
  23061. Firing prefer*rvt*predict-yes*H0
  23062. -->
  23063. Firing prefer*rvt*predict-no*H0
  23064. -->
  23065. Firing elaborate*copy-dir-to-output-link
  23066. -->
  23067. (I3 ^dir R +)
  23068. inner elaboration loop at bottom goal.
  23069. Retracting elaborate*copy-see-to-output-link
  23070. -->
  23071. (I3 ^see 1 +)
  23072. Retracting propose*predict-no
  23073. -->
  23074. (O2108 ^name predict-no +)
  23075. (S1 ^operator O2108 +)
  23076. Retracting propose*predict-yes
  23077. -->
  23078. (O2107 ^name predict-yes +)
  23079. (S1 ^operator O2107 +)
  23080. Retracting elaborate*reward*based*on*reward
  23081. -->
  23082. (R1057 ^value 1 +)
  23083. (R1 ^reward R1057 +)
  23084. Retracting elaborate*copy-dir-to-output-link
  23085. -->
  23086. (I3 ^dir R +)
  23087. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  23088. -->
  23089. (S1 ^operator O2108 = 0.770163750477286)
  23090. Retracting rl*prefer*rvt*predict-no*H0*6
  23091. -->
  23092. (S1 ^operator O2108 = 0.2298592186043533)
  23093. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  23094. -->
  23095. (S1 ^operator O2107 = -0.252585164213872)
  23096. Retracting rl*prefer*rvt*predict-yes*H0*5
  23097. -->
  23098. (S1 ^operator O2107 = 0.2940539968979803)
  23099. =>WM: (14794: S1 ^operator O2110 +)
  23100. =>WM: (14793: S1 ^operator O2109 +)
  23101. =>WM: (14792: O2110 ^name predict-no)
  23102. =>WM: (14791: O2109 ^name predict-yes)
  23103. =>WM: (14790: R1058 ^value 1)
  23104. =>WM: (14789: R1 ^reward R1058)
  23105. =>WM: (14788: I3 ^see 0)
  23106. <=WM: (14779: S1 ^operator O2107 +)
  23107. <=WM: (14780: S1 ^operator O2108 +)
  23108. <=WM: (14781: S1 ^operator O2108)
  23109. <=WM: (14775: R1 ^reward R1057)
  23110. <=WM: (14774: I3 ^see 1)
  23111. <=WM: (14778: O2108 ^name predict-no)
  23112. <=WM: (14777: O2107 ^name predict-yes)
  23113. <=WM: (14776: R1057 ^value 1)
  23114. --- Inner Elaboration Phase, active level 1 (S1) ---
  23115. Firing prefer*rvt*predict-yes*H0
  23116. -->
  23117. Firing rl*prefer*rvt*predict-yes*H0*5
  23118. -->
  23119. (S1 ^operator O2109 = 0.2940539968979803)
  23120. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  23121. -->
  23122. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  23123. -->
  23124. (S1 ^operator O2109 = -0.1254042659579056)
  23125. Firing prefer*rvt*predict-no*H0
  23126. -->
  23127. Firing rl*prefer*rvt*predict-no*H0*6
  23128. -->
  23129. (S1 ^operator O2110 = 0.2298592186043533)
  23130. Firing prefer*rvt*predict-no*H0*6*v1*H1
  23131. -->
  23132. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  23133. -->
  23134. (S1 ^operator O2110 = 0.7701073432202794)
  23135. inner elaboration loop at bottom goal.
  23136. Retracting rl*prefer*rvt*predict-no*H0*6
  23137. -->
  23138. (S1 ^operator O2108 = 0.2298592186043533)
  23139. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  23140. -->
  23141. (S1 ^operator O2108 = 0.7701073432202794)
  23142. Retracting rl*prefer*rvt*predict-yes*H0*5
  23143. -->
  23144. (S1 ^operator O2107 = 0.2940539968979803)
  23145. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  23146. -->
  23147. (S1 ^operator O2107 = -0.1254042659579056)
  23148. --- END Proposal Phase ---
  23149. --- Decision Phase ---
  23150. RL update rl*prefer*rvt*predict-no*H0*6 0.611911 -0.382052 0.229859 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.853261,0.125891)
  23151. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388107 0.382057 0.770164 -> 0.388105 0.382056 0.770162(R,m,v=1,1,0)
  23152. =>WM: (14795: S1 ^operator O2110)
  23153. 1055: O: O2110 (predict-no)
  23154. --- END Decision Phase ---
  23155. --- Application Phase ---
  23156. --- Firing Productions (PE) For State At Depth 1 ---
  23157. --- Inner Elaboration Phase, active level 1 (S1) ---
  23158. Firing apply*operator
  23159. -->
  23160. (I3 ^predict-no N1055 + :O )
  23161. Firing apply*operator*complete
  23162. -->
  23163. (I3 ^predict-no N1054 - :O )
  23164. inner elaboration loop at bottom goal.
  23165. --- Change Working Memory (PE) ---
  23166. =>WM: (14796: I3 ^predict-no N1055)
  23167. <=WM: (14783: N1054 ^status complete)
  23168. <=WM: (14782: I3 ^predict-no N1054)
  23169. --- Firing Productions (IE) For State At Depth 1 ---
  23170. --- Inner Elaboration Phase, active level 1 (S1) ---
  23171. Firing monitor*world
  23172. -->
  23173. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23174. --- Change Working Memory (IE) ---
  23175. --- END Application Phase ---
  23176. --- Output Phase ---
  23177. ENV: Agent did: predict-no for direction R in state State-B
  23178. In State-B moving R
  23179. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23180. predict error 0
  23181. dir: dir isL
  23182. --- END Output Phase ---
  23183. /|\--- Input Phase ---
  23184. =>WM: (14800: I2 ^dir L)
  23185. =>WM: (14799: I2 ^reward 1)
  23186. =>WM: (14798: I2 ^see 0)
  23187. =>WM: (14797: N1055 ^status complete)
  23188. <=WM: (14786: I2 ^dir R)
  23189. <=WM: (14785: I2 ^reward 1)
  23190. <=WM: (14784: I2 ^see 0)
  23191. =>WM: (14801: I2 ^level-1 R0-root)
  23192. <=WM: (14787: I2 ^level-1 R0-root)
  23193. --- END Input Phase ---
  23194. --- Proposal Phase ---
  23195. --- Inner Elaboration Phase, active level 1 (S1) ---
  23196. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  23197. -->
  23198. (S1 ^operator O2109 = 0.6195747904526593)
  23199. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  23200. -->
  23201. (S1 ^operator O2110 = -0.2190661556260421)
  23202. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23203. -->
  23204. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23205. -->
  23206. Firing elaborate*copy-see-to-output-link
  23207. -->
  23208. (I3 ^see 0 +)
  23209. Firing elaborate*reward*based*on*reward
  23210. -->
  23211. (R1059 ^value 1 +)
  23212. (R1 ^reward R1059 +)
  23213. Firing propose*predict-yes
  23214. -->
  23215. (O2111 ^name predict-yes +)
  23216. (S1 ^operator O2111 +)
  23217. Firing propose*predict-no
  23218. -->
  23219. (O2112 ^name predict-no +)
  23220. (S1 ^operator O2112 +)
  23221. Firing rl*prefer*rvt*predict-no*H0*2
  23222. -->
  23223. (S1 ^operator O2110 = 0.3140207031247883)
  23224. Firing rl*prefer*rvt*predict-yes*H0*1
  23225. -->
  23226. (S1 ^operator O2109 = 0.3804124062940181)
  23227. Firing prefer*rvt*predict-yes*H0
  23228. -->
  23229. Firing prefer*rvt*predict-no*H0
  23230. -->
  23231. Firing elaborate*copy-dir-to-output-link
  23232. -->
  23233. (I3 ^dir L +)
  23234. inner elaboration loop at bottom goal.
  23235. Retracting elaborate*copy-see-to-output-link
  23236. -->
  23237. (I3 ^see 0 +)
  23238. Retracting propose*predict-no
  23239. -->
  23240. (O2110 ^name predict-no +)
  23241. (S1 ^operator O2110 +)
  23242. Retracting propose*predict-yes
  23243. -->
  23244. (O2109 ^name predict-yes +)
  23245. (S1 ^operator O2109 +)
  23246. Retracting elaborate*reward*based*on*reward
  23247. -->
  23248. (R1058 ^value 1 +)
  23249. (R1 ^reward R1058 +)
  23250. Retracting elaborate*copy-dir-to-output-link
  23251. -->
  23252. (I3 ^dir R +)
  23253. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  23254. -->
  23255. (S1 ^operator O2110 = 0.7701073432202794)
  23256. Retracting rl*prefer*rvt*predict-no*H0*6
  23257. -->
  23258. (S1 ^operator O2110 = 0.2298573707106232)
  23259. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  23260. -->
  23261. (S1 ^operator O2109 = -0.1254042659579056)
  23262. Retracting rl*prefer*rvt*predict-yes*H0*5
  23263. -->
  23264. (S1 ^operator O2109 = 0.2940539968979803)
  23265. =>WM: (14808: S1 ^operator O2112 +)
  23266. =>WM: (14807: S1 ^operator O2111 +)
  23267. =>WM: (14806: I3 ^dir L)
  23268. =>WM: (14805: O2112 ^name predict-no)
  23269. =>WM: (14804: O2111 ^name predict-yes)
  23270. =>WM: (14803: R1059 ^value 1)
  23271. =>WM: (14802: R1 ^reward R1059)
  23272. <=WM: (14793: S1 ^operator O2109 +)
  23273. <=WM: (14794: S1 ^operator O2110 +)
  23274. <=WM: (14795: S1 ^operator O2110)
  23275. <=WM: (14764: I3 ^dir R)
  23276. <=WM: (14789: R1 ^reward R1058)
  23277. <=WM: (14792: O2110 ^name predict-no)
  23278. <=WM: (14791: O2109 ^name predict-yes)
  23279. <=WM: (14790: R1058 ^value 1)
  23280. --- Inner Elaboration Phase, active level 1 (S1) ---
  23281. Firing prefer*rvt*predict-yes*H0
  23282. -->
  23283. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  23284. -->
  23285. (S1 ^operator O2111 = 0.6195747904526593)
  23286. Firing rl*prefer*rvt*predict-yes*H0*1
  23287. -->
  23288. (S1 ^operator O2111 = 0.3804124062940181)
  23289. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23290. -->
  23291. Firing prefer*rvt*predict-no*H0
  23292. -->
  23293. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  23294. -->
  23295. (S1 ^operator O2112 = -0.2190661556260421)
  23296. Firing rl*prefer*rvt*predict-no*H0*2
  23297. -->
  23298. (S1 ^operator O2112 = 0.3140207031247883)
  23299. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23300. -->
  23301. inner elaboration loop at bottom goal.
  23302. Retracting rl*prefer*rvt*predict-no*H0*2
  23303. -->
  23304. (S1 ^operator O2110 = 0.3140207031247883)
  23305. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  23306. -->
  23307. (S1 ^operator O2110 = -0.2190661556260421)
  23308. Retracting rl*prefer*rvt*predict-yes*H0*1
  23309. -->
  23310. (S1 ^operator O2109 = 0.3804124062940181)
  23311. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  23312. -->
  23313. (S1 ^operator O2109 = 0.6195747904526593)
  23314. --- END Proposal Phase ---
  23315. --- Decision Phase ---
  23316. RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382053 0.229857 -> 0.611912 -0.382052 0.22986(R,m,v=1,0.854054,0.125323)
  23317. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388061 0.382046 0.770107 -> 0.388064 0.382047 0.770111(R,m,v=1,1,0)
  23318. =>WM: (14809: S1 ^operator O2111)
  23319. 1056: O: O2111 (predict-yes)
  23320. --- END Decision Phase ---
  23321. --- Application Phase ---
  23322. --- Firing Productions (PE) For State At Depth 1 ---
  23323. --- Inner Elaboration Phase, active level 1 (S1) ---
  23324. Firing apply*operator
  23325. -->
  23326. (I3 ^predict-yes N1056 + :O )
  23327. Firing apply*operator*complete
  23328. -->
  23329. (I3 ^predict-no N1055 - :O )
  23330. inner elaboration loop at bottom goal.
  23331. --- Change Working Memory (PE) ---
  23332. =>WM: (14810: I3 ^predict-yes N1056)
  23333. <=WM: (14797: N1055 ^status complete)
  23334. <=WM: (14796: I3 ^predict-no N1055)
  23335. --- Firing Productions (IE) For State At Depth 1 ---
  23336. --- Inner Elaboration Phase, active level 1 (S1) ---
  23337. Firing monitor*world
  23338. -->
  23339. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23340. --- Change Working Memory (IE) ---
  23341. --- END Application Phase ---
  23342. --- Output Phase ---
  23343. ENV: Agent did: predict-yes for direction L in state State-B
  23344. In State-B moving L
  23345. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  23346. predict error 0
  23347. dir: dir isL
  23348. --- END Output Phase ---
  23349. -/--- Input Phase ---
  23350. =>WM: (14814: I2 ^dir L)
  23351. =>WM: (14813: I2 ^reward 1)
  23352. =>WM: (14812: I2 ^see 1)
  23353. =>WM: (14811: N1056 ^status complete)
  23354. <=WM: (14800: I2 ^dir L)
  23355. <=WM: (14799: I2 ^reward 1)
  23356. <=WM: (14798: I2 ^see 0)
  23357. =>WM: (14815: I2 ^level-1 L1-root)
  23358. <=WM: (14801: I2 ^level-1 R0-root)
  23359. --- END Input Phase ---
  23360. --- Proposal Phase ---
  23361. --- Inner Elaboration Phase, active level 1 (S1) ---
  23362. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  23363. -->
  23364. (S1 ^operator O2111 = -0.3470159027404986)
  23365. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  23366. -->
  23367. (S1 ^operator O2112 = 0.6860547040638999)
  23368. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23369. -->
  23370. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23371. -->
  23372. Firing elaborate*copy-see-to-output-link
  23373. -->
  23374. (I3 ^see 1 +)
  23375. Firing elaborate*reward*based*on*reward
  23376. -->
  23377. (R1060 ^value 1 +)
  23378. (R1 ^reward R1060 +)
  23379. Firing propose*predict-yes
  23380. -->
  23381. (O2113 ^name predict-yes +)
  23382. (S1 ^operator O2113 +)
  23383. Firing propose*predict-no
  23384. -->
  23385. (O2114 ^name predict-no +)
  23386. (S1 ^operator O2114 +)
  23387. Firing rl*prefer*rvt*predict-no*H0*2
  23388. -->
  23389. (S1 ^operator O2112 = 0.3140207031247883)
  23390. Firing rl*prefer*rvt*predict-yes*H0*1
  23391. -->
  23392. (S1 ^operator O2111 = 0.3804124062940181)
  23393. Firing prefer*rvt*predict-yes*H0
  23394. -->
  23395. Firing prefer*rvt*predict-no*H0
  23396. -->
  23397. Firing elaborate*copy-dir-to-output-link
  23398. -->
  23399. (I3 ^dir L +)
  23400. inner elaboration loop at bottom goal.
  23401. Retracting elaborate*copy-see-to-output-link
  23402. -->
  23403. (I3 ^see 0 +)
  23404. Retracting propose*predict-no
  23405. -->
  23406. (O2112 ^name predict-no +)
  23407. (S1 ^operator O2112 +)
  23408. Retracting propose*predict-yes
  23409. -->
  23410. (O2111 ^name predict-yes +)
  23411. (S1 ^operator O2111 +)
  23412. Retracting elaborate*reward*based*on*reward
  23413. -->
  23414. (R1059 ^value 1 +)
  23415. (R1 ^reward R1059 +)
  23416. Retracting elaborate*copy-dir-to-output-link
  23417. -->
  23418. (I3 ^dir L +)
  23419. Retracting rl*prefer*rvt*predict-no*H0*2
  23420. -->
  23421. (S1 ^operator O2112 = 0.3140207031247883)
  23422. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  23423. -->
  23424. (S1 ^operator O2112 = -0.2190661556260421)
  23425. Retracting rl*prefer*rvt*predict-yes*H0*1
  23426. -->
  23427. (S1 ^operator O2111 = 0.3804124062940181)
  23428. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  23429. -->
  23430. (S1 ^operator O2111 = 0.6195747904526593)
  23431. =>WM: (14822: S1 ^operator O2114 +)
  23432. =>WM: (14821: S1 ^operator O2113 +)
  23433. =>WM: (14820: O2114 ^name predict-no)
  23434. =>WM: (14819: O2113 ^name predict-yes)
  23435. =>WM: (14818: R1060 ^value 1)
  23436. =>WM: (14817: R1 ^reward R1060)
  23437. =>WM: (14816: I3 ^see 1)
  23438. <=WM: (14807: S1 ^operator O2111 +)
  23439. <=WM: (14809: S1 ^operator O2111)
  23440. <=WM: (14808: S1 ^operator O2112 +)
  23441. <=WM: (14802: R1 ^reward R1059)
  23442. <=WM: (14788: I3 ^see 0)
  23443. <=WM: (14805: O2112 ^name predict-no)
  23444. <=WM: (14804: O2111 ^name predict-yes)
  23445. <=WM: (14803: R1059 ^value 1)
  23446. --- Inner Elaboration Phase, active level 1 (S1) ---
  23447. Firing prefer*rvt*predict-yes*H0
  23448. -->
  23449. Firing rl*prefer*rvt*predict-yes*H0*1
  23450. -->
  23451. (S1 ^operator O2113 = 0.3804124062940181)
  23452. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23453. -->
  23454. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  23455. -->
  23456. (S1 ^operator O2113 = -0.3470159027404986)
  23457. Firing prefer*rvt*predict-no*H0
  23458. -->
  23459. Firing rl*prefer*rvt*predict-no*H0*2
  23460. -->
  23461. (S1 ^operator O2114 = 0.3140207031247883)
  23462. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23463. -->
  23464. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  23465. -->
  23466. (S1 ^operator O2114 = 0.6860547040638999)
  23467. inner elaboration loop at bottom goal.
  23468. Retracting rl*prefer*rvt*predict-no*H0*2
  23469. -->
  23470. (S1 ^operator O2112 = 0.3140207031247883)
  23471. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  23472. -->
  23473. (S1 ^operator O2112 = 0.6860547040638999)
  23474. Retracting rl*prefer*rvt*predict-yes*H0*1
  23475. -->
  23476. (S1 ^operator O2111 = 0.3804124062940181)
  23477. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  23478. -->
  23479. (S1 ^operator O2111 = -0.3470159027404986)
  23480. --- END Proposal Phase ---
  23481. --- Decision Phase ---
  23482. RL update rl*prefer*rvt*predict-yes*H0*1 0.521342 -0.14093 0.380412 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.840909,0.134545)
  23483. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478644 0.140931 0.619575 -> 0.478645 0.140931 0.619576(R,m,v=1,1,0)
  23484. =>WM: (14823: S1 ^operator O2114)
  23485. 1057: O: O2114 (predict-no)
  23486. --- END Decision Phase ---
  23487. --- Application Phase ---
  23488. --- Firing Productions (PE) For State At Depth 1 ---
  23489. --- Inner Elaboration Phase, active level 1 (S1) ---
  23490. Firing apply*operator
  23491. -->
  23492. (I3 ^predict-no N1057 + :O )
  23493. Firing apply*operator*complete
  23494. -->
  23495. (I3 ^predict-yes N1056 - :O )
  23496. inner elaboration loop at bottom goal.
  23497. --- Change Working Memory (PE) ---
  23498. =>WM: (14824: I3 ^predict-no N1057)
  23499. <=WM: (14811: N1056 ^status complete)
  23500. <=WM: (14810: I3 ^predict-yes N1056)
  23501. --- Firing Productions (IE) For State At Depth 1 ---
  23502. --- Inner Elaboration Phase, active level 1 (S1) ---
  23503. Firing monitor*world
  23504. -->
  23505. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23506. --- Change Working Memory (IE) ---
  23507. --- END Application Phase ---
  23508. --- Output Phase ---
  23509. ENV: Agent did: predict-no for direction L in state State-A
  23510. In State-A moving L
  23511. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23512. predict error 0
  23513. dir: dir isL
  23514. --- END Output Phase ---
  23515. |\-/--- Input Phase ---
  23516. =>WM: (14828: I2 ^dir L)
  23517. =>WM: (14827: I2 ^reward 1)
  23518. =>WM: (14826: I2 ^see 0)
  23519. =>WM: (14825: N1057 ^status complete)
  23520. <=WM: (14814: I2 ^dir L)
  23521. <=WM: (14813: I2 ^reward 1)
  23522. <=WM: (14812: I2 ^see 1)
  23523. =>WM: (14829: I2 ^level-1 L0-root)
  23524. <=WM: (14815: I2 ^level-1 L1-root)
  23525. --- END Input Phase ---
  23526. --- Proposal Phase ---
  23527. --- Inner Elaboration Phase, active level 1 (S1) ---
  23528. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23529. -->
  23530. (S1 ^operator O2113 = -0.3332708974800781)
  23531. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23532. -->
  23533. (S1 ^operator O2114 = 0.685860669441134)
  23534. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23535. -->
  23536. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23537. -->
  23538. Firing elaborate*copy-see-to-output-link
  23539. -->
  23540. (I3 ^see 0 +)
  23541. Firing elaborate*reward*based*on*reward
  23542. -->
  23543. (R1061 ^value 1 +)
  23544. (R1 ^reward R1061 +)
  23545. Firing propose*predict-yes
  23546. -->
  23547. (O2115 ^name predict-yes +)
  23548. (S1 ^operator O2115 +)
  23549. Firing propose*predict-no
  23550. -->
  23551. (O2116 ^name predict-no +)
  23552. (S1 ^operator O2116 +)
  23553. Firing rl*prefer*rvt*predict-no*H0*2
  23554. -->
  23555. (S1 ^operator O2114 = 0.3140207031247883)
  23556. Firing rl*prefer*rvt*predict-yes*H0*1
  23557. -->
  23558. (S1 ^operator O2113 = 0.3804134437534242)
  23559. Firing prefer*rvt*predict-yes*H0
  23560. -->
  23561. Firing prefer*rvt*predict-no*H0
  23562. -->
  23563. Firing elaborate*copy-dir-to-output-link
  23564. -->
  23565. (I3 ^dir L +)
  23566. inner elaboration loop at bottom goal.
  23567. Retracting elaborate*copy-see-to-output-link
  23568. -->
  23569. (I3 ^see 1 +)
  23570. Retracting propose*predict-no
  23571. -->
  23572. (O2114 ^name predict-no +)
  23573. (S1 ^operator O2114 +)
  23574. Retracting propose*predict-yes
  23575. -->
  23576. (O2113 ^name predict-yes +)
  23577. (S1 ^operator O2113 +)
  23578. Retracting elaborate*reward*based*on*reward
  23579. -->
  23580. (R1060 ^value 1 +)
  23581. (R1 ^reward R1060 +)
  23582. Retracting elaborate*copy-dir-to-output-link
  23583. -->
  23584. (I3 ^dir L +)
  23585. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  23586. -->
  23587. (S1 ^operator O2114 = 0.6860547040638999)
  23588. Retracting rl*prefer*rvt*predict-no*H0*2
  23589. -->
  23590. (S1 ^operator O2114 = 0.3140207031247883)
  23591. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  23592. -->
  23593. (S1 ^operator O2113 = -0.3470159027404986)
  23594. Retracting rl*prefer*rvt*predict-yes*H0*1
  23595. -->
  23596. (S1 ^operator O2113 = 0.3804134437534242)
  23597. =>WM: (14836: S1 ^operator O2116 +)
  23598. =>WM: (14835: S1 ^operator O2115 +)
  23599. =>WM: (14834: O2116 ^name predict-no)
  23600. =>WM: (14833: O2115 ^name predict-yes)
  23601. =>WM: (14832: R1061 ^value 1)
  23602. =>WM: (14831: R1 ^reward R1061)
  23603. =>WM: (14830: I3 ^see 0)
  23604. <=WM: (14821: S1 ^operator O2113 +)
  23605. <=WM: (14822: S1 ^operator O2114 +)
  23606. <=WM: (14823: S1 ^operator O2114)
  23607. <=WM: (14817: R1 ^reward R1060)
  23608. <=WM: (14816: I3 ^see 1)
  23609. <=WM: (14820: O2114 ^name predict-no)
  23610. <=WM: (14819: O2113 ^name predict-yes)
  23611. <=WM: (14818: R1060 ^value 1)
  23612. --- Inner Elaboration Phase, active level 1 (S1) ---
  23613. Firing prefer*rvt*predict-yes*H0
  23614. -->
  23615. Firing rl*prefer*rvt*predict-yes*H0*1
  23616. -->
  23617. (S1 ^operator O2115 = 0.3804134437534242)
  23618. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23619. -->
  23620. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23621. -->
  23622. (S1 ^operator O2115 = -0.3332708974800781)
  23623. Firing prefer*rvt*predict-no*H0
  23624. -->
  23625. Firing rl*prefer*rvt*predict-no*H0*2
  23626. -->
  23627. (S1 ^operator O2116 = 0.3140207031247883)
  23628. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23629. -->
  23630. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23631. -->
  23632. (S1 ^operator O2116 = 0.685860669441134)
  23633. inner elaboration loop at bottom goal.
  23634. Retracting rl*prefer*rvt*predict-no*H0*2
  23635. -->
  23636. (S1 ^operator O2114 = 0.3140207031247883)
  23637. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23638. -->
  23639. (S1 ^operator O2114 = 0.685860669441134)
  23640. Retracting rl*prefer*rvt*predict-yes*H0*1
  23641. -->
  23642. (S1 ^operator O2113 = 0.3804134437534242)
  23643. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23644. -->
  23645. (S1 ^operator O2113 = -0.3332708974800781)
  23646. --- END Proposal Phase ---
  23647. --- Decision Phase ---
  23648. RL update rl*prefer*rvt*predict-no*H0*2 0.485031 -0.17101 0.314021 -> 0.485026 -0.171011 0.314015(R,m,v=1,0.871951,0.112337)
  23649. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515029 0.171026 0.686055 -> 0.515023 0.171024 0.686048(R,m,v=1,1,0)
  23650. =>WM: (14837: S1 ^operator O2116)
  23651. 1058: O: O2116 (predict-no)
  23652. --- END Decision Phase ---
  23653. --- Application Phase ---
  23654. --- Firing Productions (PE) For State At Depth 1 ---
  23655. --- Inner Elaboration Phase, active level 1 (S1) ---
  23656. Firing apply*operator
  23657. -->
  23658. (I3 ^predict-no N1058 + :O )
  23659. Firing apply*operator*complete
  23660. -->
  23661. (I3 ^predict-no N1057 - :O )
  23662. inner elaboration loop at bottom goal.
  23663. --- Change Working Memory (PE) ---
  23664. =>WM: (14838: I3 ^predict-no N1058)
  23665. <=WM: (14825: N1057 ^status complete)
  23666. <=WM: (14824: I3 ^predict-no N1057)
  23667. --- Firing Productions (IE) For State At Depth 1 ---
  23668. --- Inner Elaboration Phase, active level 1 (S1) ---
  23669. Firing monitor*world
  23670. -->
  23671. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23672. --- Change Working Memory (IE) ---
  23673. --- END Application Phase ---
  23674. --- Output Phase ---
  23675. ENV: Agent did: predict-no for direction L in state State-A
  23676. In State-A moving L
  23677. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23678. predict error 0
  23679. dir: dir isL
  23680. --- END Output Phase ---
  23681. |\---- Input Phase ---
  23682. =>WM: (14842: I2 ^dir L)
  23683. =>WM: (14841: I2 ^reward 1)
  23684. =>WM: (14840: I2 ^see 0)
  23685. =>WM: (14839: N1058 ^status complete)
  23686. <=WM: (14828: I2 ^dir L)
  23687. <=WM: (14827: I2 ^reward 1)
  23688. <=WM: (14826: I2 ^see 0)
  23689. =>WM: (14843: I2 ^level-1 L0-root)
  23690. <=WM: (14829: I2 ^level-1 L0-root)
  23691. --- END Input Phase ---
  23692. --- Proposal Phase ---
  23693. --- Inner Elaboration Phase, active level 1 (S1) ---
  23694. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23695. -->
  23696. (S1 ^operator O2115 = -0.3332708974800781)
  23697. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23698. -->
  23699. (S1 ^operator O2116 = 0.685860669441134)
  23700. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23701. -->
  23702. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23703. -->
  23704. Firing elaborate*copy-see-to-output-link
  23705. -->
  23706. (I3 ^see 0 +)
  23707. Firing elaborate*reward*based*on*reward
  23708. -->
  23709. (R1062 ^value 1 +)
  23710. (R1 ^reward R1062 +)
  23711. Firing propose*predict-yes
  23712. -->
  23713. (O2117 ^name predict-yes +)
  23714. (S1 ^operator O2117 +)
  23715. Firing propose*predict-no
  23716. -->
  23717. (O2118 ^name predict-no +)
  23718. (S1 ^operator O2118 +)
  23719. Firing rl*prefer*rvt*predict-no*H0*2
  23720. -->
  23721. (S1 ^operator O2116 = 0.3140145220723357)
  23722. Firing rl*prefer*rvt*predict-yes*H0*1
  23723. -->
  23724. (S1 ^operator O2115 = 0.3804134437534242)
  23725. Firing prefer*rvt*predict-yes*H0
  23726. -->
  23727. Firing prefer*rvt*predict-no*H0
  23728. -->
  23729. Firing elaborate*copy-dir-to-output-link
  23730. -->
  23731. (I3 ^dir L +)
  23732. inner elaboration loop at bottom goal.
  23733. Retracting elaborate*copy-see-to-output-link
  23734. -->
  23735. (I3 ^see 0 +)
  23736. Retracting propose*predict-no
  23737. -->
  23738. (O2116 ^name predict-no +)
  23739. (S1 ^operator O2116 +)
  23740. Retracting propose*predict-yes
  23741. -->
  23742. (O2115 ^name predict-yes +)
  23743. (S1 ^operator O2115 +)
  23744. Retracting elaborate*reward*based*on*reward
  23745. -->
  23746. (R1061 ^value 1 +)
  23747. (R1 ^reward R1061 +)
  23748. Retracting elaborate*copy-dir-to-output-link
  23749. -->
  23750. (I3 ^dir L +)
  23751. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23752. -->
  23753. (S1 ^operator O2116 = 0.685860669441134)
  23754. Retracting rl*prefer*rvt*predict-no*H0*2
  23755. -->
  23756. (S1 ^operator O2116 = 0.3140145220723357)
  23757. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23758. -->
  23759. (S1 ^operator O2115 = -0.3332708974800781)
  23760. Retracting rl*prefer*rvt*predict-yes*H0*1
  23761. -->
  23762. (S1 ^operator O2115 = 0.3804134437534242)
  23763. =>WM: (14849: S1 ^operator O2118 +)
  23764. =>WM: (14848: S1 ^operator O2117 +)
  23765. =>WM: (14847: O2118 ^name predict-no)
  23766. =>WM: (14846: O2117 ^name predict-yes)
  23767. =>WM: (14845: R1062 ^value 1)
  23768. =>WM: (14844: R1 ^reward R1062)
  23769. <=WM: (14835: S1 ^operator O2115 +)
  23770. <=WM: (14836: S1 ^operator O2116 +)
  23771. <=WM: (14837: S1 ^operator O2116)
  23772. <=WM: (14831: R1 ^reward R1061)
  23773. <=WM: (14834: O2116 ^name predict-no)
  23774. <=WM: (14833: O2115 ^name predict-yes)
  23775. <=WM: (14832: R1061 ^value 1)
  23776. --- Inner Elaboration Phase, active level 1 (S1) ---
  23777. Firing prefer*rvt*predict-yes*H0
  23778. -->
  23779. Firing rl*prefer*rvt*predict-yes*H0*1
  23780. -->
  23781. (S1 ^operator O2117 = 0.3804134437534242)
  23782. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  23783. -->
  23784. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23785. -->
  23786. (S1 ^operator O2117 = -0.3332708974800781)
  23787. Firing prefer*rvt*predict-no*H0
  23788. -->
  23789. Firing rl*prefer*rvt*predict-no*H0*2
  23790. -->
  23791. (S1 ^operator O2118 = 0.3140145220723357)
  23792. Firing prefer*rvt*predict-no*H0*2*v1*H1
  23793. -->
  23794. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23795. -->
  23796. (S1 ^operator O2118 = 0.685860669441134)
  23797. inner elaboration loop at bottom goal.
  23798. Retracting rl*prefer*rvt*predict-no*H0*2
  23799. -->
  23800. (S1 ^operator O2116 = 0.3140145220723357)
  23801. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23802. -->
  23803. (S1 ^operator O2116 = 0.685860669441134)
  23804. Retracting rl*prefer*rvt*predict-yes*H0*1
  23805. -->
  23806. (S1 ^operator O2115 = 0.3804134437534242)
  23807. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23808. -->
  23809. (S1 ^operator O2115 = -0.3332708974800781)
  23810. --- END Proposal Phase ---
  23811. --- Decision Phase ---
  23812. RL update rl*prefer*rvt*predict-no*H0*2 0.485026 -0.171011 0.314015 -> 0.485034 -0.171009 0.314025(R,m,v=1,0.872727,0.111752)
  23813. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514876 0.170985 0.685861 -> 0.514885 0.170988 0.685873(R,m,v=1,1,0)
  23814. =>WM: (14850: S1 ^operator O2118)
  23815. 1059: O: O2118 (predict-no)
  23816. --- END Decision Phase ---
  23817. --- Application Phase ---
  23818. --- Firing Productions (PE) For State At Depth 1 ---
  23819. --- Inner Elaboration Phase, active level 1 (S1) ---
  23820. Firing apply*operator
  23821. -->
  23822. (I3 ^predict-no N1059 + :O )
  23823. Firing apply*operator*complete
  23824. -->
  23825. (I3 ^predict-no N1058 - :O )
  23826. inner elaboration loop at bottom goal.
  23827. --- Change Working Memory (PE) ---
  23828. =>WM: (14851: I3 ^predict-no N1059)
  23829. <=WM: (14839: N1058 ^status complete)
  23830. <=WM: (14838: I3 ^predict-no N1058)
  23831. --- Firing Productions (IE) For State At Depth 1 ---
  23832. --- Inner Elaboration Phase, active level 1 (S1) ---
  23833. Firing monitor*world
  23834. -->
  23835. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23836. --- Change Working Memory (IE) ---
  23837. --- END Application Phase ---
  23838. --- Output Phase ---
  23839. ENV: Agent did: predict-no for direction L in state State-A
  23840. In State-A moving L
  23841. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23842. predict error 0
  23843. dir: dir isU
  23844. --- END Output Phase ---
  23845. /|\--- Input Phase ---
  23846. =>WM: (14855: I2 ^dir U)
  23847. =>WM: (14854: I2 ^reward 1)
  23848. =>WM: (14853: I2 ^see 0)
  23849. =>WM: (14852: N1059 ^status complete)
  23850. <=WM: (14842: I2 ^dir L)
  23851. <=WM: (14841: I2 ^reward 1)
  23852. <=WM: (14840: I2 ^see 0)
  23853. =>WM: (14856: I2 ^level-1 L0-root)
  23854. <=WM: (14843: I2 ^level-1 L0-root)
  23855. --- END Input Phase ---
  23856. --- Proposal Phase ---
  23857. --- Inner Elaboration Phase, active level 1 (S1) ---
  23858. Firing elaborate*copy-see-to-output-link
  23859. -->
  23860. (I3 ^see 0 +)
  23861. Firing elaborate*reward*based*on*reward
  23862. -->
  23863. (R1063 ^value 1 +)
  23864. (R1 ^reward R1063 +)
  23865. Firing propose*predict-yes
  23866. -->
  23867. (O2119 ^name predict-yes +)
  23868. (S1 ^operator O2119 +)
  23869. Firing propose*predict-no
  23870. -->
  23871. (O2120 ^name predict-no +)
  23872. (S1 ^operator O2120 +)
  23873. Firing rl*prefer*rvt*predict-no*H0*4
  23874. -->
  23875. (S1 ^operator O2118 = 1.)
  23876. Firing rl*prefer*rvt*predict-yes*H0*3
  23877. -->
  23878. (S1 ^operator O2117 = 0.)
  23879. Firing prefer*rvt*predict-yes*H0
  23880. -->
  23881. Firing prefer*rvt*predict-no*H0
  23882. -->
  23883. Firing elaborate*copy-dir-to-output-link
  23884. -->
  23885. (I3 ^dir U +)
  23886. inner elaboration loop at bottom goal.
  23887. Retracting elaborate*copy-see-to-output-link
  23888. -->
  23889. (I3 ^see 0 +)
  23890. Retracting propose*predict-no
  23891. -->
  23892. (O2118 ^name predict-no +)
  23893. (S1 ^operator O2118 +)
  23894. Retracting propose*predict-yes
  23895. -->
  23896. (O2117 ^name predict-yes +)
  23897. (S1 ^operator O2117 +)
  23898. Retracting elaborate*reward*based*on*reward
  23899. -->
  23900. (R1062 ^value 1 +)
  23901. (R1 ^reward R1062 +)
  23902. Retracting elaborate*copy-dir-to-output-link
  23903. -->
  23904. (I3 ^dir L +)
  23905. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  23906. -->
  23907. (S1 ^operator O2118 = 0.6858726594370528)
  23908. Retracting rl*prefer*rvt*predict-no*H0*2
  23909. -->
  23910. (S1 ^operator O2118 = 0.3140247423148079)
  23911. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  23912. -->
  23913. (S1 ^operator O2117 = -0.3332708974800781)
  23914. Retracting rl*prefer*rvt*predict-yes*H0*1
  23915. -->
  23916. (S1 ^operator O2117 = 0.3804134437534242)
  23917. =>WM: (14863: S1 ^operator O2120 +)
  23918. =>WM: (14862: S1 ^operator O2119 +)
  23919. =>WM: (14861: I3 ^dir U)
  23920. =>WM: (14860: O2120 ^name predict-no)
  23921. =>WM: (14859: O2119 ^name predict-yes)
  23922. =>WM: (14858: R1063 ^value 1)
  23923. =>WM: (14857: R1 ^reward R1063)
  23924. <=WM: (14848: S1 ^operator O2117 +)
  23925. <=WM: (14849: S1 ^operator O2118 +)
  23926. <=WM: (14850: S1 ^operator O2118)
  23927. <=WM: (14806: I3 ^dir L)
  23928. <=WM: (14844: R1 ^reward R1062)
  23929. <=WM: (14847: O2118 ^name predict-no)
  23930. <=WM: (14846: O2117 ^name predict-yes)
  23931. <=WM: (14845: R1062 ^value 1)
  23932. --- Inner Elaboration Phase, active level 1 (S1) ---
  23933. Firing prefer*rvt*predict-yes*H0
  23934. -->
  23935. Firing rl*prefer*rvt*predict-yes*H0*3
  23936. -->
  23937. (S1 ^operator O2119 = 0.)
  23938. Firing prefer*rvt*predict-no*H0
  23939. -->
  23940. Firing rl*prefer*rvt*predict-no*H0*4
  23941. -->
  23942. (S1 ^operator O2120 = 1.)
  23943. inner elaboration loop at bottom goal.
  23944. Retracting rl*prefer*rvt*predict-no*H0*4
  23945. -->
  23946. (S1 ^operator O2118 = 1.)
  23947. Retracting rl*prefer*rvt*predict-yes*H0*3
  23948. -->
  23949. (S1 ^operator O2117 = 0.)
  23950. --- END Proposal Phase ---
  23951. --- Decision Phase ---
  23952. RL update rl*prefer*rvt*predict-no*H0*2 0.485034 -0.171009 0.314025 -> 0.485041 -0.171007 0.314033(R,m,v=1,0.873494,0.111172)
  23953. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514885 0.170988 0.685873 -> 0.514893 0.17099 0.685882(R,m,v=1,1,0)
  23954. =>WM: (14864: S1 ^operator O2120)
  23955. 1060: O: O2120 (predict-no)
  23956. --- END Decision Phase ---
  23957. --- Application Phase ---
  23958. --- Firing Productions (PE) For State At Depth 1 ---
  23959. --- Inner Elaboration Phase, active level 1 (S1) ---
  23960. Firing apply*operator
  23961. -->
  23962. (I3 ^predict-no N1060 + :O )
  23963. Firing apply*operator*complete
  23964. -->
  23965. (I3 ^predict-no N1059 - :O )
  23966. inner elaboration loop at bottom goal.
  23967. --- Change Working Memory (PE) ---
  23968. =>WM: (14865: I3 ^predict-no N1060)
  23969. <=WM: (14852: N1059 ^status complete)
  23970. <=WM: (14851: I3 ^predict-no N1059)
  23971. --- Firing Productions (IE) For State At Depth 1 ---
  23972. --- Inner Elaboration Phase, active level 1 (S1) ---
  23973. Firing monitor*world
  23974. -->
  23975. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23976. --- Change Working Memory (IE) ---
  23977. --- END Application Phase ---
  23978. --- Output Phase ---
  23979. ENV: Agent did: predict-no for direction U in state State-A
  23980. In State-A moving U
  23981. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23982. predict error 0
  23983. dir: dir isU
  23984. --- END Output Phase ---
  23985. -/|--- Input Phase ---
  23986. =>WM: (14869: I2 ^dir U)
  23987. =>WM: (14868: I2 ^reward 1)
  23988. =>WM: (14867: I2 ^see 0)
  23989. =>WM: (14866: N1060 ^status complete)
  23990. <=WM: (14855: I2 ^dir U)
  23991. <=WM: (14854: I2 ^reward 1)
  23992. <=WM: (14853: I2 ^see 0)
  23993. =>WM: (14870: I2 ^level-1 L0-root)
  23994. <=WM: (14856: I2 ^level-1 L0-root)
  23995. --- END Input Phase ---
  23996. --- Proposal Phase ---
  23997. --- Inner Elaboration Phase, active level 1 (S1) ---
  23998. Firing elaborate*copy-see-to-output-link
  23999. -->
  24000. (I3 ^see 0 +)
  24001. Firing elaborate*reward*based*on*reward
  24002. -->
  24003. (R1064 ^value 1 +)
  24004. (R1 ^reward R1064 +)
  24005. Firing propose*predict-yes
  24006. -->
  24007. (O2121 ^name predict-yes +)
  24008. (S1 ^operator O2121 +)
  24009. Firing propose*predict-no
  24010. -->
  24011. (O2122 ^name predict-no +)
  24012. (S1 ^operator O2122 +)
  24013. Firing rl*prefer*rvt*predict-no*H0*4
  24014. -->
  24015. (S1 ^operator O2120 = 1.)
  24016. Firing rl*prefer*rvt*predict-yes*H0*3
  24017. -->
  24018. (S1 ^operator O2119 = 0.)
  24019. Firing prefer*rvt*predict-yes*H0
  24020. -->
  24021. Firing prefer*rvt*predict-no*H0
  24022. -->
  24023. Firing elaborate*copy-dir-to-output-link
  24024. -->
  24025. (I3 ^dir U +)
  24026. inner elaboration loop at bottom goal.
  24027. Retracting elaborate*copy-see-to-output-link
  24028. -->
  24029. (I3 ^see 0 +)
  24030. Retracting propose*predict-no
  24031. -->
  24032. (O2120 ^name predict-no +)
  24033. (S1 ^operator O2120 +)
  24034. Retracting propose*predict-yes
  24035. -->
  24036. (O2119 ^name predict-yes +)
  24037. (S1 ^operator O2119 +)
  24038. Retracting elaborate*reward*based*on*reward
  24039. -->
  24040. (R1063 ^value 1 +)
  24041. (R1 ^reward R1063 +)
  24042. Retracting elaborate*copy-dir-to-output-link
  24043. -->
  24044. (I3 ^dir U +)
  24045. Retracting rl*prefer*rvt*predict-no*H0*4
  24046. -->
  24047. (S1 ^operator O2120 = 1.)
  24048. Retracting rl*prefer*rvt*predict-yes*H0*3
  24049. -->
  24050. (S1 ^operator O2119 = 0.)
  24051. =>WM: (14876: S1 ^operator O2122 +)
  24052. =>WM: (14875: S1 ^operator O2121 +)
  24053. =>WM: (14874: O2122 ^name predict-no)
  24054. =>WM: (14873: O2121 ^name predict-yes)
  24055. =>WM: (14872: R1064 ^value 1)
  24056. =>WM: (14871: R1 ^reward R1064)
  24057. <=WM: (14862: S1 ^operator O2119 +)
  24058. <=WM: (14863: S1 ^operator O2120 +)
  24059. <=WM: (14864: S1 ^operator O2120)
  24060. <=WM: (14857: R1 ^reward R1063)
  24061. <=WM: (14860: O2120 ^name predict-no)
  24062. <=WM: (14859: O2119 ^name predict-yes)
  24063. <=WM: (14858: R1063 ^value 1)
  24064. --- Inner Elaboration Phase, active level 1 (S1) ---
  24065. Firing prefer*rvt*predict-yes*H0
  24066. -->
  24067. Firing rl*prefer*rvt*predict-yes*H0*3
  24068. -->
  24069. (S1 ^operator O2121 = 0.)
  24070. Firing prefer*rvt*predict-no*H0
  24071. -->
  24072. Firing rl*prefer*rvt*predict-no*H0*4
  24073. -->
  24074. (S1 ^operator O2122 = 1.)
  24075. inner elaboration loop at bottom goal.
  24076. Retracting rl*prefer*rvt*predict-no*H0*4
  24077. -->
  24078. (S1 ^operator O2120 = 1.)
  24079. Retracting rl*prefer*rvt*predict-yes*H0*3
  24080. -->
  24081. (S1 ^operator O2119 = 0.)
  24082. --- END Proposal Phase ---
  24083. --- Decision Phase ---
  24084. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24085. =>WM: (14877: S1 ^operator O2122)
  24086. 1061: O: O2122 (predict-no)
  24087. --- END Decision Phase ---
  24088. --- Application Phase ---
  24089. --- Firing Productions (PE) For State At Depth 1 ---
  24090. --- Inner Elaboration Phase, active level 1 (S1) ---
  24091. Firing apply*operator
  24092. -->
  24093. (I3 ^predict-no N1061 + :O )
  24094. Firing apply*operator*complete
  24095. -->
  24096. (I3 ^predict-no N1060 - :O )
  24097. inner elaboration loop at bottom goal.
  24098. --- Change Working Memory (PE) ---
  24099. =>WM: (14878: I3 ^predict-no N1061)
  24100. <=WM: (14866: N1060 ^status complete)
  24101. <=WM: (14865: I3 ^predict-no N1060)
  24102. --- Firing Productions (IE) For State At Depth 1 ---
  24103. --- Inner Elaboration Phase, active level 1 (S1) ---
  24104. Firing monitor*world
  24105. -->
  24106. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24107. --- Change Working Memory (IE) ---
  24108. --- END Application Phase ---
  24109. --- Output Phase ---
  24110. ENV: Agent did: predict-no for direction U in state State-A
  24111. In State-A moving U
  24112. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24113. predict error 0
  24114. dir: dir isL
  24115. --- END Output Phase ---
  24116. \--- Input Phase ---
  24117. =>WM: (14882: I2 ^dir L)
  24118. =>WM: (14881: I2 ^reward 1)
  24119. =>WM: (14880: I2 ^see 0)
  24120. =>WM: (14879: N1061 ^status complete)
  24121. <=WM: (14869: I2 ^dir U)
  24122. <=WM: (14868: I2 ^reward 1)
  24123. <=WM: (14867: I2 ^see 0)
  24124. =>WM: (14883: I2 ^level-1 L0-root)
  24125. <=WM: (14870: I2 ^level-1 L0-root)
  24126. --- END Input Phase ---
  24127. --- Proposal Phase ---
  24128. --- Inner Elaboration Phase, active level 1 (S1) ---
  24129. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24130. -->
  24131. (S1 ^operator O2121 = -0.3332708974800781)
  24132. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24133. -->
  24134. (S1 ^operator O2122 = 0.6858824877823619)
  24135. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24136. -->
  24137. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24138. -->
  24139. Firing elaborate*copy-see-to-output-link
  24140. -->
  24141. (I3 ^see 0 +)
  24142. Firing elaborate*reward*based*on*reward
  24143. -->
  24144. (R1065 ^value 1 +)
  24145. (R1 ^reward R1065 +)
  24146. Firing propose*predict-yes
  24147. -->
  24148. (O2123 ^name predict-yes +)
  24149. (S1 ^operator O2123 +)
  24150. Firing propose*predict-no
  24151. -->
  24152. (O2124 ^name predict-no +)
  24153. (S1 ^operator O2124 +)
  24154. Firing rl*prefer*rvt*predict-no*H0*2
  24155. -->
  24156. (S1 ^operator O2122 = 0.3140331355128715)
  24157. Firing rl*prefer*rvt*predict-yes*H0*1
  24158. -->
  24159. (S1 ^operator O2121 = 0.3804134437534242)
  24160. Firing prefer*rvt*predict-yes*H0
  24161. -->
  24162. Firing prefer*rvt*predict-no*H0
  24163. -->
  24164. Firing elaborate*copy-dir-to-output-link
  24165. -->
  24166. (I3 ^dir L +)
  24167. inner elaboration loop at bottom goal.
  24168. Retracting elaborate*copy-see-to-output-link
  24169. -->
  24170. (I3 ^see 0 +)
  24171. Retracting propose*predict-no
  24172. -->
  24173. (O2122 ^name predict-no +)
  24174. (S1 ^operator O2122 +)
  24175. Retracting propose*predict-yes
  24176. -->
  24177. (O2121 ^name predict-yes +)
  24178. (S1 ^operator O2121 +)
  24179. Retracting elaborate*reward*based*on*reward
  24180. -->
  24181. (R1064 ^value 1 +)
  24182. (R1 ^reward R1064 +)
  24183. Retracting elaborate*copy-dir-to-output-link
  24184. -->
  24185. (I3 ^dir U +)
  24186. Retracting rl*prefer*rvt*predict-no*H0*4
  24187. -->
  24188. (S1 ^operator O2122 = 1.)
  24189. Retracting rl*prefer*rvt*predict-yes*H0*3
  24190. -->
  24191. (S1 ^operator O2121 = 0.)
  24192. =>WM: (14890: S1 ^operator O2124 +)
  24193. =>WM: (14889: S1 ^operator O2123 +)
  24194. =>WM: (14888: I3 ^dir L)
  24195. =>WM: (14887: O2124 ^name predict-no)
  24196. =>WM: (14886: O2123 ^name predict-yes)
  24197. =>WM: (14885: R1065 ^value 1)
  24198. =>WM: (14884: R1 ^reward R1065)
  24199. <=WM: (14875: S1 ^operator O2121 +)
  24200. <=WM: (14876: S1 ^operator O2122 +)
  24201. <=WM: (14877: S1 ^operator O2122)
  24202. <=WM: (14861: I3 ^dir U)
  24203. <=WM: (14871: R1 ^reward R1064)
  24204. <=WM: (14874: O2122 ^name predict-no)
  24205. <=WM: (14873: O2121 ^name predict-yes)
  24206. <=WM: (14872: R1064 ^value 1)
  24207. --- Inner Elaboration Phase, active level 1 (S1) ---
  24208. Firing prefer*rvt*predict-yes*H0
  24209. -->
  24210. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24211. -->
  24212. (S1 ^operator O2123 = -0.3332708974800781)
  24213. Firing rl*prefer*rvt*predict-yes*H0*1
  24214. -->
  24215. (S1 ^operator O2123 = 0.3804134437534242)
  24216. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24217. -->
  24218. Firing prefer*rvt*predict-no*H0
  24219. -->
  24220. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24221. -->
  24222. (S1 ^operator O2124 = 0.6858824877823619)
  24223. Firing rl*prefer*rvt*predict-no*H0*2
  24224. -->
  24225. (S1 ^operator O2124 = 0.3140331355128715)
  24226. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24227. -->
  24228. inner elaboration loop at bottom goal.
  24229. Retracting rl*prefer*rvt*predict-no*H0*2
  24230. -->
  24231. (S1 ^operator O2122 = 0.3140331355128715)
  24232. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24233. -->
  24234. (S1 ^operator O2122 = 0.6858824877823619)
  24235. Retracting rl*prefer*rvt*predict-yes*H0*1
  24236. -->
  24237. (S1 ^operator O2121 = 0.3804134437534242)
  24238. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24239. -->
  24240. (S1 ^operator O2121 = -0.3332708974800781)
  24241. --- END Proposal Phase ---
  24242. --- Decision Phase ---
  24243. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24244. =>WM: (14891: S1 ^operator O2124)
  24245. 1062: O: O2124 (predict-no)
  24246. --- END Decision Phase ---
  24247. --- Application Phase ---
  24248. --- Firing Productions (PE) For State At Depth 1 ---
  24249. --- Inner Elaboration Phase, active level 1 (S1) ---
  24250. Firing apply*operator
  24251. -->
  24252. (I3 ^predict-no N1062 + :O )
  24253. Firing apply*operator*complete
  24254. -->
  24255. (I3 ^predict-no N1061 - :O )
  24256. inner elaboration loop at bottom goal.
  24257. --- Change Working Memory (PE) ---
  24258. =>WM: (14892: I3 ^predict-no N1062)
  24259. <=WM: (14879: N1061 ^status complete)
  24260. <=WM: (14878: I3 ^predict-no N1061)
  24261. --- Firing Productions (IE) For State At Depth 1 ---
  24262. --- Inner Elaboration Phase, active level 1 (S1) ---
  24263. Firing monitor*world
  24264. -->
  24265. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24266. --- Change Working Memory (IE) ---
  24267. --- END Application Phase ---
  24268. --- Output Phase ---
  24269. ENV: Agent did: predict-no for direction L in state State-A
  24270. In State-A moving L
  24271. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24272. predict error 0
  24273. dir: dir isL
  24274. --- END Output Phase ---
  24275. -/--- Input Phase ---
  24276. =>WM: (14896: I2 ^dir L)
  24277. =>WM: (14895: I2 ^reward 1)
  24278. =>WM: (14894: I2 ^see 0)
  24279. =>WM: (14893: N1062 ^status complete)
  24280. <=WM: (14882: I2 ^dir L)
  24281. <=WM: (14881: I2 ^reward 1)
  24282. <=WM: (14880: I2 ^see 0)
  24283. =>WM: (14897: I2 ^level-1 L0-root)
  24284. <=WM: (14883: I2 ^level-1 L0-root)
  24285. --- END Input Phase ---
  24286. --- Proposal Phase ---
  24287. --- Inner Elaboration Phase, active level 1 (S1) ---
  24288. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24289. -->
  24290. (S1 ^operator O2123 = -0.3332708974800781)
  24291. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24292. -->
  24293. (S1 ^operator O2124 = 0.6858824877823619)
  24294. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24295. -->
  24296. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24297. -->
  24298. Firing elaborate*copy-see-to-output-link
  24299. -->
  24300. (I3 ^see 0 +)
  24301. Firing elaborate*reward*based*on*reward
  24302. -->
  24303. (R1066 ^value 1 +)
  24304. (R1 ^reward R1066 +)
  24305. Firing propose*predict-yes
  24306. -->
  24307. (O2125 ^name predict-yes +)
  24308. (S1 ^operator O2125 +)
  24309. Firing propose*predict-no
  24310. -->
  24311. (O2126 ^name predict-no +)
  24312. (S1 ^operator O2126 +)
  24313. Firing rl*prefer*rvt*predict-no*H0*2
  24314. -->
  24315. (S1 ^operator O2124 = 0.3140331355128715)
  24316. Firing rl*prefer*rvt*predict-yes*H0*1
  24317. -->
  24318. (S1 ^operator O2123 = 0.3804134437534242)
  24319. Firing prefer*rvt*predict-yes*H0
  24320. -->
  24321. Firing prefer*rvt*predict-no*H0
  24322. -->
  24323. Firing elaborate*copy-dir-to-output-link
  24324. -->
  24325. (I3 ^dir L +)
  24326. inner elaboration loop at bottom goal.
  24327. Retracting elaborate*copy-see-to-output-link
  24328. -->
  24329. (I3 ^see 0 +)
  24330. Retracting propose*predict-no
  24331. -->
  24332. (O2124 ^name predict-no +)
  24333. (S1 ^operator O2124 +)
  24334. Retracting propose*predict-yes
  24335. -->
  24336. (O2123 ^name predict-yes +)
  24337. (S1 ^operator O2123 +)
  24338. Retracting elaborate*reward*based*on*reward
  24339. -->
  24340. (R1065 ^value 1 +)
  24341. (R1 ^reward R1065 +)
  24342. Retracting elaborate*copy-dir-to-output-link
  24343. -->
  24344. (I3 ^dir L +)
  24345. Retracting rl*prefer*rvt*predict-no*H0*2
  24346. -->
  24347. (S1 ^operator O2124 = 0.3140331355128715)
  24348. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24349. -->
  24350. (S1 ^operator O2124 = 0.6858824877823619)
  24351. Retracting rl*prefer*rvt*predict-yes*H0*1
  24352. -->
  24353. (S1 ^operator O2123 = 0.3804134437534242)
  24354. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24355. -->
  24356. (S1 ^operator O2123 = -0.3332708974800781)
  24357. =>WM: (14903: S1 ^operator O2126 +)
  24358. =>WM: (14902: S1 ^operator O2125 +)
  24359. =>WM: (14901: O2126 ^name predict-no)
  24360. =>WM: (14900: O2125 ^name predict-yes)
  24361. =>WM: (14899: R1066 ^value 1)
  24362. =>WM: (14898: R1 ^reward R1066)
  24363. <=WM: (14889: S1 ^operator O2123 +)
  24364. <=WM: (14890: S1 ^operator O2124 +)
  24365. <=WM: (14891: S1 ^operator O2124)
  24366. <=WM: (14884: R1 ^reward R1065)
  24367. <=WM: (14887: O2124 ^name predict-no)
  24368. <=WM: (14886: O2123 ^name predict-yes)
  24369. <=WM: (14885: R1065 ^value 1)
  24370. --- Inner Elaboration Phase, active level 1 (S1) ---
  24371. Firing prefer*rvt*predict-yes*H0
  24372. -->
  24373. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24374. -->
  24375. (S1 ^operator O2125 = -0.3332708974800781)
  24376. Firing rl*prefer*rvt*predict-yes*H0*1
  24377. -->
  24378. (S1 ^operator O2125 = 0.3804134437534242)
  24379. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24380. -->
  24381. Firing prefer*rvt*predict-no*H0
  24382. -->
  24383. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24384. -->
  24385. (S1 ^operator O2126 = 0.6858824877823619)
  24386. Firing rl*prefer*rvt*predict-no*H0*2
  24387. -->
  24388. (S1 ^operator O2126 = 0.3140331355128715)
  24389. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24390. -->
  24391. inner elaboration loop at bottom goal.
  24392. Retracting rl*prefer*rvt*predict-no*H0*2
  24393. -->
  24394. (S1 ^operator O2124 = 0.3140331355128715)
  24395. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24396. -->
  24397. (S1 ^operator O2124 = 0.6858824877823619)
  24398. Retracting rl*prefer*rvt*predict-yes*H0*1
  24399. -->
  24400. (S1 ^operator O2123 = 0.3804134437534242)
  24401. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24402. -->
  24403. (S1 ^operator O2123 = -0.3332708974800781)
  24404. --- END Proposal Phase ---
  24405. --- Decision Phase ---
  24406. RL update rl*prefer*rvt*predict-no*H0*2 0.485041 -0.171007 0.314033 -> 0.485046 -0.171006 0.31404(R,m,v=1,0.874251,0.110598)
  24407. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514893 0.17099 0.685882 -> 0.514899 0.170991 0.685891(R,m,v=1,1,0)
  24408. =>WM: (14904: S1 ^operator O2126)
  24409. 1063: O: O2126 (predict-no)
  24410. --- END Decision Phase ---
  24411. --- Application Phase ---
  24412. --- Firing Productions (PE) For State At Depth 1 ---
  24413. --- Inner Elaboration Phase, active level 1 (S1) ---
  24414. Firing apply*operator
  24415. -->
  24416. (I3 ^predict-no N1063 + :O )
  24417. Firing apply*operator*complete
  24418. -->
  24419. (I3 ^predict-no N1062 - :O )
  24420. inner elaboration loop at bottom goal.
  24421. --- Change Working Memory (PE) ---
  24422. =>WM: (14905: I3 ^predict-no N1063)
  24423. <=WM: (14893: N1062 ^status complete)
  24424. <=WM: (14892: I3 ^predict-no N1062)
  24425. --- Firing Productions (IE) For State At Depth 1 ---
  24426. --- Inner Elaboration Phase, active level 1 (S1) ---
  24427. Firing monitor*world
  24428. -->
  24429. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24430. --- Change Working Memory (IE) ---
  24431. --- END Application Phase ---
  24432. --- Output Phase ---
  24433. ENV: Agent did: predict-no for direction L in state State-A
  24434. In State-A moving L
  24435. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24436. predict error 0
  24437. dir: dir isU
  24438. --- END Output Phase ---
  24439. |\--- Input Phase ---
  24440. =>WM: (14909: I2 ^dir U)
  24441. =>WM: (14908: I2 ^reward 1)
  24442. =>WM: (14907: I2 ^see 0)
  24443. =>WM: (14906: N1063 ^status complete)
  24444. <=WM: (14896: I2 ^dir L)
  24445. <=WM: (14895: I2 ^reward 1)
  24446. <=WM: (14894: I2 ^see 0)
  24447. =>WM: (14910: I2 ^level-1 L0-root)
  24448. <=WM: (14897: I2 ^level-1 L0-root)
  24449. --- END Input Phase ---
  24450. --- Proposal Phase ---
  24451. --- Inner Elaboration Phase, active level 1 (S1) ---
  24452. Firing elaborate*copy-see-to-output-link
  24453. -->
  24454. (I3 ^see 0 +)
  24455. Firing elaborate*reward*based*on*reward
  24456. -->
  24457. (R1067 ^value 1 +)
  24458. (R1 ^reward R1067 +)
  24459. Firing propose*predict-yes
  24460. -->
  24461. (O2127 ^name predict-yes +)
  24462. (S1 ^operator O2127 +)
  24463. Firing propose*predict-no
  24464. -->
  24465. (O2128 ^name predict-no +)
  24466. (S1 ^operator O2128 +)
  24467. Firing rl*prefer*rvt*predict-no*H0*4
  24468. -->
  24469. (S1 ^operator O2126 = 1.)
  24470. Firing rl*prefer*rvt*predict-yes*H0*3
  24471. -->
  24472. (S1 ^operator O2125 = 0.)
  24473. Firing prefer*rvt*predict-yes*H0
  24474. -->
  24475. Firing prefer*rvt*predict-no*H0
  24476. -->
  24477. Firing elaborate*copy-dir-to-output-link
  24478. -->
  24479. (I3 ^dir U +)
  24480. inner elaboration loop at bottom goal.
  24481. Retracting elaborate*copy-see-to-output-link
  24482. -->
  24483. (I3 ^see 0 +)
  24484. Retracting propose*predict-no
  24485. -->
  24486. (O2126 ^name predict-no +)
  24487. (S1 ^operator O2126 +)
  24488. Retracting propose*predict-yes
  24489. -->
  24490. (O2125 ^name predict-yes +)
  24491. (S1 ^operator O2125 +)
  24492. Retracting elaborate*reward*based*on*reward
  24493. -->
  24494. (R1066 ^value 1 +)
  24495. (R1 ^reward R1066 +)
  24496. Retracting elaborate*copy-dir-to-output-link
  24497. -->
  24498. (I3 ^dir L +)
  24499. Retracting rl*prefer*rvt*predict-no*H0*2
  24500. -->
  24501. (S1 ^operator O2126 = 0.3140400312949982)
  24502. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24503. -->
  24504. (S1 ^operator O2126 = 0.6858905480601469)
  24505. Retracting rl*prefer*rvt*predict-yes*H0*1
  24506. -->
  24507. (S1 ^operator O2125 = 0.3804134437534242)
  24508. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24509. -->
  24510. (S1 ^operator O2125 = -0.3332708974800781)
  24511. =>WM: (14917: S1 ^operator O2128 +)
  24512. =>WM: (14916: S1 ^operator O2127 +)
  24513. =>WM: (14915: I3 ^dir U)
  24514. =>WM: (14914: O2128 ^name predict-no)
  24515. =>WM: (14913: O2127 ^name predict-yes)
  24516. =>WM: (14912: R1067 ^value 1)
  24517. =>WM: (14911: R1 ^reward R1067)
  24518. <=WM: (14902: S1 ^operator O2125 +)
  24519. <=WM: (14903: S1 ^operator O2126 +)
  24520. <=WM: (14904: S1 ^operator O2126)
  24521. <=WM: (14888: I3 ^dir L)
  24522. <=WM: (14898: R1 ^reward R1066)
  24523. <=WM: (14901: O2126 ^name predict-no)
  24524. <=WM: (14900: O2125 ^name predict-yes)
  24525. <=WM: (14899: R1066 ^value 1)
  24526. --- Inner Elaboration Phase, active level 1 (S1) ---
  24527. Firing prefer*rvt*predict-yes*H0
  24528. -->
  24529. Firing rl*prefer*rvt*predict-yes*H0*3
  24530. -->
  24531. (S1 ^operator O2127 = 0.)
  24532. Firing prefer*rvt*predict-no*H0
  24533. -->
  24534. Firing rl*prefer*rvt*predict-no*H0*4
  24535. -->
  24536. (S1 ^operator O2128 = 1.)
  24537. inner elaboration loop at bottom goal.
  24538. Retracting rl*prefer*rvt*predict-no*H0*4
  24539. -->
  24540. (S1 ^operator O2126 = 1.)
  24541. Retracting rl*prefer*rvt*predict-yes*H0*3
  24542. -->
  24543. (S1 ^operator O2125 = 0.)
  24544. --- END Proposal Phase ---
  24545. --- Decision Phase ---
  24546. RL update rl*prefer*rvt*predict-no*H0*2 0.485046 -0.171006 0.31404 -> 0.48505 -0.171005 0.314046(R,m,v=1,0.875,0.11003)
  24547. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514899 0.170991 0.685891 -> 0.514904 0.170993 0.685897(R,m,v=1,1,0)
  24548. =>WM: (14918: S1 ^operator O2128)
  24549. 1064: O: O2128 (predict-no)
  24550. --- END Decision Phase ---
  24551. --- Application Phase ---
  24552. --- Firing Productions (PE) For State At Depth 1 ---
  24553. --- Inner Elaboration Phase, active level 1 (S1) ---
  24554. Firing apply*operator
  24555. -->
  24556. (I3 ^predict-no N1064 + :O )
  24557. Firing apply*operator*complete
  24558. -->
  24559. (I3 ^predict-no N1063 - :O )
  24560. inner elaboration loop at bottom goal.
  24561. --- Change Working Memory (PE) ---
  24562. =>WM: (14919: I3 ^predict-no N1064)
  24563. <=WM: (14906: N1063 ^status complete)
  24564. <=WM: (14905: I3 ^predict-no N1063)
  24565. --- Firing Productions (IE) For State At Depth 1 ---
  24566. --- Inner Elaboration Phase, active level 1 (S1) ---
  24567. Firing monitor*world
  24568. -->
  24569. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24570. --- Change Working Memory (IE) ---
  24571. --- END Application Phase ---
  24572. --- Output Phase ---
  24573. ENV: Agent did: predict-no for direction U in state State-A
  24574. In State-A moving U
  24575. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24576. predict error 0
  24577. dir: dir isL
  24578. --- END Output Phase ---
  24579. -/|--- Input Phase ---
  24580. =>WM: (14923: I2 ^dir L)
  24581. =>WM: (14922: I2 ^reward 1)
  24582. =>WM: (14921: I2 ^see 0)
  24583. =>WM: (14920: N1064 ^status complete)
  24584. <=WM: (14909: I2 ^dir U)
  24585. <=WM: (14908: I2 ^reward 1)
  24586. <=WM: (14907: I2 ^see 0)
  24587. =>WM: (14924: I2 ^level-1 L0-root)
  24588. <=WM: (14910: I2 ^level-1 L0-root)
  24589. --- END Input Phase ---
  24590. --- Proposal Phase ---
  24591. --- Inner Elaboration Phase, active level 1 (S1) ---
  24592. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24593. -->
  24594. (S1 ^operator O2127 = -0.3332708974800781)
  24595. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24596. -->
  24597. (S1 ^operator O2128 = 0.6858971614456655)
  24598. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24599. -->
  24600. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24601. -->
  24602. Firing elaborate*copy-see-to-output-link
  24603. -->
  24604. (I3 ^see 0 +)
  24605. Firing elaborate*reward*based*on*reward
  24606. -->
  24607. (R1068 ^value 1 +)
  24608. (R1 ^reward R1068 +)
  24609. Firing propose*predict-yes
  24610. -->
  24611. (O2129 ^name predict-yes +)
  24612. (S1 ^operator O2129 +)
  24613. Firing propose*predict-no
  24614. -->
  24615. (O2130 ^name predict-no +)
  24616. (S1 ^operator O2130 +)
  24617. Firing rl*prefer*rvt*predict-no*H0*2
  24618. -->
  24619. (S1 ^operator O2128 = 0.3140456992451273)
  24620. Firing rl*prefer*rvt*predict-yes*H0*1
  24621. -->
  24622. (S1 ^operator O2127 = 0.3804134437534242)
  24623. Firing prefer*rvt*predict-yes*H0
  24624. -->
  24625. Firing prefer*rvt*predict-no*H0
  24626. -->
  24627. Firing elaborate*copy-dir-to-output-link
  24628. -->
  24629. (I3 ^dir L +)
  24630. inner elaboration loop at bottom goal.
  24631. Retracting elaborate*copy-see-to-output-link
  24632. -->
  24633. (I3 ^see 0 +)
  24634. Retracting propose*predict-no
  24635. -->
  24636. (O2128 ^name predict-no +)
  24637. (S1 ^operator O2128 +)
  24638. Retracting propose*predict-yes
  24639. -->
  24640. (O2127 ^name predict-yes +)
  24641. (S1 ^operator O2127 +)
  24642. Retracting elaborate*reward*based*on*reward
  24643. -->
  24644. (R1067 ^value 1 +)
  24645. (R1 ^reward R1067 +)
  24646. Retracting elaborate*copy-dir-to-output-link
  24647. -->
  24648. (I3 ^dir U +)
  24649. Retracting rl*prefer*rvt*predict-no*H0*4
  24650. -->
  24651. (S1 ^operator O2128 = 1.)
  24652. Retracting rl*prefer*rvt*predict-yes*H0*3
  24653. -->
  24654. (S1 ^operator O2127 = 0.)
  24655. =>WM: (14931: S1 ^operator O2130 +)
  24656. =>WM: (14930: S1 ^operator O2129 +)
  24657. =>WM: (14929: I3 ^dir L)
  24658. =>WM: (14928: O2130 ^name predict-no)
  24659. =>WM: (14927: O2129 ^name predict-yes)
  24660. =>WM: (14926: R1068 ^value 1)
  24661. =>WM: (14925: R1 ^reward R1068)
  24662. <=WM: (14916: S1 ^operator O2127 +)
  24663. <=WM: (14917: S1 ^operator O2128 +)
  24664. <=WM: (14918: S1 ^operator O2128)
  24665. <=WM: (14915: I3 ^dir U)
  24666. <=WM: (14911: R1 ^reward R1067)
  24667. <=WM: (14914: O2128 ^name predict-no)
  24668. <=WM: (14913: O2127 ^name predict-yes)
  24669. <=WM: (14912: R1067 ^value 1)
  24670. --- Inner Elaboration Phase, active level 1 (S1) ---
  24671. Firing prefer*rvt*predict-yes*H0
  24672. -->
  24673. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24674. -->
  24675. (S1 ^operator O2129 = -0.3332708974800781)
  24676. Firing rl*prefer*rvt*predict-yes*H0*1
  24677. -->
  24678. (S1 ^operator O2129 = 0.3804134437534242)
  24679. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24680. -->
  24681. Firing prefer*rvt*predict-no*H0
  24682. -->
  24683. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24684. -->
  24685. (S1 ^operator O2130 = 0.6858971614456655)
  24686. Firing rl*prefer*rvt*predict-no*H0*2
  24687. -->
  24688. (S1 ^operator O2130 = 0.3140456992451273)
  24689. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24690. -->
  24691. inner elaboration loop at bottom goal.
  24692. Retracting rl*prefer*rvt*predict-no*H0*2
  24693. -->
  24694. (S1 ^operator O2128 = 0.3140456992451273)
  24695. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24696. -->
  24697. (S1 ^operator O2128 = 0.6858971614456655)
  24698. Retracting rl*prefer*rvt*predict-yes*H0*1
  24699. -->
  24700. (S1 ^operator O2127 = 0.3804134437534242)
  24701. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24702. -->
  24703. (S1 ^operator O2127 = -0.3332708974800781)
  24704. --- END Proposal Phase ---
  24705. --- Decision Phase ---
  24706. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24707. =>WM: (14932: S1 ^operator O2130)
  24708. 1065: O: O2130 (predict-no)
  24709. --- END Decision Phase ---
  24710. --- Application Phase ---
  24711. --- Firing Productions (PE) For State At Depth 1 ---
  24712. --- Inner Elaboration Phase, active level 1 (S1) ---
  24713. Firing apply*operator
  24714. -->
  24715. (I3 ^predict-no N1065 + :O )
  24716. Firing apply*operator*complete
  24717. -->
  24718. (I3 ^predict-no N1064 - :O )
  24719. inner elaboration loop at bottom goal.
  24720. --- Change Working Memory (PE) ---
  24721. =>WM: (14933: I3 ^predict-no N1065)
  24722. <=WM: (14920: N1064 ^status complete)
  24723. <=WM: (14919: I3 ^predict-no N1064)
  24724. --- Firing Productions (IE) For State At Depth 1 ---
  24725. --- Inner Elaboration Phase, active level 1 (S1) ---
  24726. Firing monitor*world
  24727. -->
  24728. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24729. --- Change Working Memory (IE) ---
  24730. --- END Application Phase ---
  24731. --- Output Phase ---
  24732. ENV: Agent did: predict-no for direction L in state State-A
  24733. In State-A moving L
  24734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24735. predict error 0
  24736. dir: dir isL
  24737. --- END Output Phase ---
  24738. \-/--- Input Phase ---
  24739. =>WM: (14937: I2 ^dir L)
  24740. =>WM: (14936: I2 ^reward 1)
  24741. =>WM: (14935: I2 ^see 0)
  24742. =>WM: (14934: N1065 ^status complete)
  24743. <=WM: (14923: I2 ^dir L)
  24744. <=WM: (14922: I2 ^reward 1)
  24745. <=WM: (14921: I2 ^see 0)
  24746. =>WM: (14938: I2 ^level-1 L0-root)
  24747. <=WM: (14924: I2 ^level-1 L0-root)
  24748. --- END Input Phase ---
  24749. --- Proposal Phase ---
  24750. --- Inner Elaboration Phase, active level 1 (S1) ---
  24751. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24752. -->
  24753. (S1 ^operator O2129 = -0.3332708974800781)
  24754. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24755. -->
  24756. (S1 ^operator O2130 = 0.6858971614456655)
  24757. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24758. -->
  24759. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24760. -->
  24761. Firing elaborate*copy-see-to-output-link
  24762. -->
  24763. (I3 ^see 0 +)
  24764. Firing elaborate*reward*based*on*reward
  24765. -->
  24766. (R1069 ^value 1 +)
  24767. (R1 ^reward R1069 +)
  24768. Firing propose*predict-yes
  24769. -->
  24770. (O2131 ^name predict-yes +)
  24771. (S1 ^operator O2131 +)
  24772. Firing propose*predict-no
  24773. -->
  24774. (O2132 ^name predict-no +)
  24775. (S1 ^operator O2132 +)
  24776. Firing rl*prefer*rvt*predict-no*H0*2
  24777. -->
  24778. (S1 ^operator O2130 = 0.3140456992451273)
  24779. Firing rl*prefer*rvt*predict-yes*H0*1
  24780. -->
  24781. (S1 ^operator O2129 = 0.3804134437534242)
  24782. Firing prefer*rvt*predict-yes*H0
  24783. -->
  24784. Firing prefer*rvt*predict-no*H0
  24785. -->
  24786. Firing elaborate*copy-dir-to-output-link
  24787. -->
  24788. (I3 ^dir L +)
  24789. inner elaboration loop at bottom goal.
  24790. Retracting elaborate*copy-see-to-output-link
  24791. -->
  24792. (I3 ^see 0 +)
  24793. Retracting propose*predict-no
  24794. -->
  24795. (O2130 ^name predict-no +)
  24796. (S1 ^operator O2130 +)
  24797. Retracting propose*predict-yes
  24798. -->
  24799. (O2129 ^name predict-yes +)
  24800. (S1 ^operator O2129 +)
  24801. Retracting elaborate*reward*based*on*reward
  24802. -->
  24803. (R1068 ^value 1 +)
  24804. (R1 ^reward R1068 +)
  24805. Retracting elaborate*copy-dir-to-output-link
  24806. -->
  24807. (I3 ^dir L +)
  24808. Retracting rl*prefer*rvt*predict-no*H0*2
  24809. -->
  24810. (S1 ^operator O2130 = 0.3140456992451273)
  24811. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24812. -->
  24813. (S1 ^operator O2130 = 0.6858971614456655)
  24814. Retracting rl*prefer*rvt*predict-yes*H0*1
  24815. -->
  24816. (S1 ^operator O2129 = 0.3804134437534242)
  24817. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24818. -->
  24819. (S1 ^operator O2129 = -0.3332708974800781)
  24820. =>WM: (14944: S1 ^operator O2132 +)
  24821. =>WM: (14943: S1 ^operator O2131 +)
  24822. =>WM: (14942: O2132 ^name predict-no)
  24823. =>WM: (14941: O2131 ^name predict-yes)
  24824. =>WM: (14940: R1069 ^value 1)
  24825. =>WM: (14939: R1 ^reward R1069)
  24826. <=WM: (14930: S1 ^operator O2129 +)
  24827. <=WM: (14931: S1 ^operator O2130 +)
  24828. <=WM: (14932: S1 ^operator O2130)
  24829. <=WM: (14925: R1 ^reward R1068)
  24830. <=WM: (14928: O2130 ^name predict-no)
  24831. <=WM: (14927: O2129 ^name predict-yes)
  24832. <=WM: (14926: R1068 ^value 1)
  24833. --- Inner Elaboration Phase, active level 1 (S1) ---
  24834. Firing prefer*rvt*predict-yes*H0
  24835. -->
  24836. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24837. -->
  24838. (S1 ^operator O2131 = -0.3332708974800781)
  24839. Firing rl*prefer*rvt*predict-yes*H0*1
  24840. -->
  24841. (S1 ^operator O2131 = 0.3804134437534242)
  24842. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24843. -->
  24844. Firing prefer*rvt*predict-no*H0
  24845. -->
  24846. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24847. -->
  24848. (S1 ^operator O2132 = 0.6858971614456655)
  24849. Firing rl*prefer*rvt*predict-no*H0*2
  24850. -->
  24851. (S1 ^operator O2132 = 0.3140456992451273)
  24852. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24853. -->
  24854. inner elaboration loop at bottom goal.
  24855. Retracting rl*prefer*rvt*predict-no*H0*2
  24856. -->
  24857. (S1 ^operator O2130 = 0.3140456992451273)
  24858. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24859. -->
  24860. (S1 ^operator O2130 = 0.6858971614456655)
  24861. Retracting rl*prefer*rvt*predict-yes*H0*1
  24862. -->
  24863. (S1 ^operator O2129 = 0.3804134437534242)
  24864. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24865. -->
  24866. (S1 ^operator O2129 = -0.3332708974800781)
  24867. --- END Proposal Phase ---
  24868. --- Decision Phase ---
  24869. RL update rl*prefer*rvt*predict-no*H0*2 0.48505 -0.171005 0.314046 -> 0.485054 -0.171004 0.31405(R,m,v=1,0.87574,0.109467)
  24870. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514904 0.170993 0.685897 -> 0.514909 0.170994 0.685903(R,m,v=1,1,0)
  24871. =>WM: (14945: S1 ^operator O2132)
  24872. 1066: O: O2132 (predict-no)
  24873. --- END Decision Phase ---
  24874. --- Application Phase ---
  24875. --- Firing Productions (PE) For State At Depth 1 ---
  24876. --- Inner Elaboration Phase, active level 1 (S1) ---
  24877. Firing apply*operator
  24878. -->
  24879. (I3 ^predict-no N1066 + :O )
  24880. Firing apply*operator*complete
  24881. -->
  24882. (I3 ^predict-no N1065 - :O )
  24883. inner elaboration loop at bottom goal.
  24884. --- Change Working Memory (PE) ---
  24885. =>WM: (14946: I3 ^predict-no N1066)
  24886. <=WM: (14934: N1065 ^status complete)
  24887. <=WM: (14933: I3 ^predict-no N1065)
  24888. --- Firing Productions (IE) For State At Depth 1 ---
  24889. --- Inner Elaboration Phase, active level 1 (S1) ---
  24890. Firing monitor*world
  24891. -->
  24892. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24893. --- Change Working Memory (IE) ---
  24894. --- END Application Phase ---
  24895. --- Output Phase ---
  24896. ENV: Agent did: predict-no for direction L in state State-A
  24897. In State-A moving L
  24898. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24899. predict error 0
  24900. dir: dir isL
  24901. --- END Output Phase ---
  24902. |\---- Input Phase ---
  24903. =>WM: (14950: I2 ^dir L)
  24904. =>WM: (14949: I2 ^reward 1)
  24905. =>WM: (14948: I2 ^see 0)
  24906. =>WM: (14947: N1066 ^status complete)
  24907. <=WM: (14937: I2 ^dir L)
  24908. <=WM: (14936: I2 ^reward 1)
  24909. <=WM: (14935: I2 ^see 0)
  24910. =>WM: (14951: I2 ^level-1 L0-root)
  24911. <=WM: (14938: I2 ^level-1 L0-root)
  24912. --- END Input Phase ---
  24913. --- Proposal Phase ---
  24914. --- Inner Elaboration Phase, active level 1 (S1) ---
  24915. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24916. -->
  24917. (S1 ^operator O2131 = -0.3332708974800781)
  24918. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24919. -->
  24920. (S1 ^operator O2132 = 0.6859025901730954)
  24921. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24922. -->
  24923. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24924. -->
  24925. Firing elaborate*copy-see-to-output-link
  24926. -->
  24927. (I3 ^see 0 +)
  24928. Firing elaborate*reward*based*on*reward
  24929. -->
  24930. (R1070 ^value 1 +)
  24931. (R1 ^reward R1070 +)
  24932. Firing propose*predict-yes
  24933. -->
  24934. (O2133 ^name predict-yes +)
  24935. (S1 ^operator O2133 +)
  24936. Firing propose*predict-no
  24937. -->
  24938. (O2134 ^name predict-no +)
  24939. (S1 ^operator O2134 +)
  24940. Firing rl*prefer*rvt*predict-no*H0*2
  24941. -->
  24942. (S1 ^operator O2132 = 0.3140503599509452)
  24943. Firing rl*prefer*rvt*predict-yes*H0*1
  24944. -->
  24945. (S1 ^operator O2131 = 0.3804134437534242)
  24946. Firing prefer*rvt*predict-yes*H0
  24947. -->
  24948. Firing prefer*rvt*predict-no*H0
  24949. -->
  24950. Firing elaborate*copy-dir-to-output-link
  24951. -->
  24952. (I3 ^dir L +)
  24953. inner elaboration loop at bottom goal.
  24954. Retracting elaborate*copy-see-to-output-link
  24955. -->
  24956. (I3 ^see 0 +)
  24957. Retracting propose*predict-no
  24958. -->
  24959. (O2132 ^name predict-no +)
  24960. (S1 ^operator O2132 +)
  24961. Retracting propose*predict-yes
  24962. -->
  24963. (O2131 ^name predict-yes +)
  24964. (S1 ^operator O2131 +)
  24965. Retracting elaborate*reward*based*on*reward
  24966. -->
  24967. (R1069 ^value 1 +)
  24968. (R1 ^reward R1069 +)
  24969. Retracting elaborate*copy-dir-to-output-link
  24970. -->
  24971. (I3 ^dir L +)
  24972. Retracting rl*prefer*rvt*predict-no*H0*2
  24973. -->
  24974. (S1 ^operator O2132 = 0.3140503599509452)
  24975. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  24976. -->
  24977. (S1 ^operator O2132 = 0.6859025901730954)
  24978. Retracting rl*prefer*rvt*predict-yes*H0*1
  24979. -->
  24980. (S1 ^operator O2131 = 0.3804134437534242)
  24981. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  24982. -->
  24983. (S1 ^operator O2131 = -0.3332708974800781)
  24984. =>WM: (14957: S1 ^operator O2134 +)
  24985. =>WM: (14956: S1 ^operator O2133 +)
  24986. =>WM: (14955: O2134 ^name predict-no)
  24987. =>WM: (14954: O2133 ^name predict-yes)
  24988. =>WM: (14953: R1070 ^value 1)
  24989. =>WM: (14952: R1 ^reward R1070)
  24990. <=WM: (14943: S1 ^operator O2131 +)
  24991. <=WM: (14944: S1 ^operator O2132 +)
  24992. <=WM: (14945: S1 ^operator O2132)
  24993. <=WM: (14939: R1 ^reward R1069)
  24994. <=WM: (14942: O2132 ^name predict-no)
  24995. <=WM: (14941: O2131 ^name predict-yes)
  24996. <=WM: (14940: R1069 ^value 1)
  24997. --- Inner Elaboration Phase, active level 1 (S1) ---
  24998. Firing prefer*rvt*predict-yes*H0
  24999. -->
  25000. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25001. -->
  25002. (S1 ^operator O2133 = -0.3332708974800781)
  25003. Firing rl*prefer*rvt*predict-yes*H0*1
  25004. -->
  25005. (S1 ^operator O2133 = 0.3804134437534242)
  25006. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25007. -->
  25008. Firing prefer*rvt*predict-no*H0
  25009. -->
  25010. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25011. -->
  25012. (S1 ^operator O2134 = 0.6859025901730954)
  25013. Firing rl*prefer*rvt*predict-no*H0*2
  25014. -->
  25015. (S1 ^operator O2134 = 0.3140503599509452)
  25016. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25017. -->
  25018. inner elaboration loop at bottom goal.
  25019. Retracting rl*prefer*rvt*predict-no*H0*2
  25020. -->
  25021. (S1 ^operator O2132 = 0.3140503599509452)
  25022. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25023. -->
  25024. (S1 ^operator O2132 = 0.6859025901730954)
  25025. Retracting rl*prefer*rvt*predict-yes*H0*1
  25026. -->
  25027. (S1 ^operator O2131 = 0.3804134437534242)
  25028. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25029. -->
  25030. (S1 ^operator O2131 = -0.3332708974800781)
  25031. --- END Proposal Phase ---
  25032. --- Decision Phase ---
  25033. RL update rl*prefer*rvt*predict-no*H0*2 0.485054 -0.171004 0.31405 -> 0.485057 -0.171003 0.314054(R,m,v=1,0.876471,0.108911)
  25034. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514909 0.170994 0.685903 -> 0.514912 0.170995 0.685907(R,m,v=1,1,0)
  25035. =>WM: (14958: S1 ^operator O2134)
  25036. 1067: O: O2134 (predict-no)
  25037. --- END Decision Phase ---
  25038. --- Application Phase ---
  25039. --- Firing Productions (PE) For State At Depth 1 ---
  25040. --- Inner Elaboration Phase, active level 1 (S1) ---
  25041. Firing apply*operator
  25042. -->
  25043. (I3 ^predict-no N1067 + :O )
  25044. Firing apply*operator*complete
  25045. -->
  25046. (I3 ^predict-no N1066 - :O )
  25047. inner elaboration loop at bottom goal.
  25048. --- Change Working Memory (PE) ---
  25049. =>WM: (14959: I3 ^predict-no N1067)
  25050. <=WM: (14947: N1066 ^status complete)
  25051. <=WM: (14946: I3 ^predict-no N1066)
  25052. --- Firing Productions (IE) For State At Depth 1 ---
  25053. --- Inner Elaboration Phase, active level 1 (S1) ---
  25054. Firing monitor*world
  25055. -->
  25056. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25057. --- Change Working Memory (IE) ---
  25058. --- END Application Phase ---
  25059. --- Output Phase ---
  25060. ENV: Agent did: predict-no for direction L in state State-A
  25061. In State-A moving L
  25062. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25063. predict error 0
  25064. dir: dir isL
  25065. --- END Output Phase ---
  25066. /|\--- Input Phase ---
  25067. =>WM: (14963: I2 ^dir L)
  25068. =>WM: (14962: I2 ^reward 1)
  25069. =>WM: (14961: I2 ^see 0)
  25070. =>WM: (14960: N1067 ^status complete)
  25071. <=WM: (14950: I2 ^dir L)
  25072. <=WM: (14949: I2 ^reward 1)
  25073. <=WM: (14948: I2 ^see 0)
  25074. =>WM: (14964: I2 ^level-1 L0-root)
  25075. <=WM: (14951: I2 ^level-1 L0-root)
  25076. --- END Input Phase ---
  25077. --- Proposal Phase ---
  25078. --- Inner Elaboration Phase, active level 1 (S1) ---
  25079. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25080. -->
  25081. (S1 ^operator O2133 = -0.3332708974800781)
  25082. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25083. -->
  25084. (S1 ^operator O2134 = 0.6859070484688164)
  25085. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25086. -->
  25087. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25088. -->
  25089. Firing elaborate*copy-see-to-output-link
  25090. -->
  25091. (I3 ^see 0 +)
  25092. Firing elaborate*reward*based*on*reward
  25093. -->
  25094. (R1071 ^value 1 +)
  25095. (R1 ^reward R1071 +)
  25096. Firing propose*predict-yes
  25097. -->
  25098. (O2135 ^name predict-yes +)
  25099. (S1 ^operator O2135 +)
  25100. Firing propose*predict-no
  25101. -->
  25102. (O2136 ^name predict-no +)
  25103. (S1 ^operator O2136 +)
  25104. Firing rl*prefer*rvt*predict-no*H0*2
  25105. -->
  25106. (S1 ^operator O2134 = 0.3140541939976826)
  25107. Firing rl*prefer*rvt*predict-yes*H0*1
  25108. -->
  25109. (S1 ^operator O2133 = 0.3804134437534242)
  25110. Firing prefer*rvt*predict-yes*H0
  25111. -->
  25112. Firing prefer*rvt*predict-no*H0
  25113. -->
  25114. Firing elaborate*copy-dir-to-output-link
  25115. -->
  25116. (I3 ^dir L +)
  25117. inner elaboration loop at bottom goal.
  25118. Retracting elaborate*copy-see-to-output-link
  25119. -->
  25120. (I3 ^see 0 +)
  25121. Retracting propose*predict-no
  25122. -->
  25123. (O2134 ^name predict-no +)
  25124. (S1 ^operator O2134 +)
  25125. Retracting propose*predict-yes
  25126. -->
  25127. (O2133 ^name predict-yes +)
  25128. (S1 ^operator O2133 +)
  25129. Retracting elaborate*reward*based*on*reward
  25130. -->
  25131. (R1070 ^value 1 +)
  25132. (R1 ^reward R1070 +)
  25133. Retracting elaborate*copy-dir-to-output-link
  25134. -->
  25135. (I3 ^dir L +)
  25136. Retracting rl*prefer*rvt*predict-no*H0*2
  25137. -->
  25138. (S1 ^operator O2134 = 0.3140541939976826)
  25139. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25140. -->
  25141. (S1 ^operator O2134 = 0.6859070484688164)
  25142. Retracting rl*prefer*rvt*predict-yes*H0*1
  25143. -->
  25144. (S1 ^operator O2133 = 0.3804134437534242)
  25145. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25146. -->
  25147. (S1 ^operator O2133 = -0.3332708974800781)
  25148. =>WM: (14970: S1 ^operator O2136 +)
  25149. =>WM: (14969: S1 ^operator O2135 +)
  25150. =>WM: (14968: O2136 ^name predict-no)
  25151. =>WM: (14967: O2135 ^name predict-yes)
  25152. =>WM: (14966: R1071 ^value 1)
  25153. =>WM: (14965: R1 ^reward R1071)
  25154. <=WM: (14956: S1 ^operator O2133 +)
  25155. <=WM: (14957: S1 ^operator O2134 +)
  25156. <=WM: (14958: S1 ^operator O2134)
  25157. <=WM: (14952: R1 ^reward R1070)
  25158. <=WM: (14955: O2134 ^name predict-no)
  25159. <=WM: (14954: O2133 ^name predict-yes)
  25160. <=WM: (14953: R1070 ^value 1)
  25161. --- Inner Elaboration Phase, active level 1 (S1) ---
  25162. Firing prefer*rvt*predict-yes*H0
  25163. -->
  25164. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25165. -->
  25166. (S1 ^operator O2135 = -0.3332708974800781)
  25167. Firing rl*prefer*rvt*predict-yes*H0*1
  25168. -->
  25169. (S1 ^operator O2135 = 0.3804134437534242)
  25170. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25171. -->
  25172. Firing prefer*rvt*predict-no*H0
  25173. -->
  25174. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25175. -->
  25176. (S1 ^operator O2136 = 0.6859070484688164)
  25177. Firing rl*prefer*rvt*predict-no*H0*2
  25178. -->
  25179. (S1 ^operator O2136 = 0.3140541939976826)
  25180. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25181. -->
  25182. inner elaboration loop at bottom goal.
  25183. Retracting rl*prefer*rvt*predict-no*H0*2
  25184. -->
  25185. (S1 ^operator O2134 = 0.3140541939976826)
  25186. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25187. -->
  25188. (S1 ^operator O2134 = 0.6859070484688164)
  25189. Retracting rl*prefer*rvt*predict-yes*H0*1
  25190. -->
  25191. (S1 ^operator O2133 = 0.3804134437534242)
  25192. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25193. -->
  25194. (S1 ^operator O2133 = -0.3332708974800781)
  25195. --- END Proposal Phase ---
  25196. --- Decision Phase ---
  25197. RL update rl*prefer*rvt*predict-no*H0*2 0.485057 -0.171003 0.314054 -> 0.48506 -0.171002 0.314057(R,m,v=1,0.877193,0.108359)
  25198. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514912 0.170995 0.685907 -> 0.514915 0.170996 0.685911(R,m,v=1,1,0)
  25199. =>WM: (14971: S1 ^operator O2136)
  25200. 1068: O: O2136 (predict-no)
  25201. --- END Decision Phase ---
  25202. --- Application Phase ---
  25203. --- Firing Productions (PE) For State At Depth 1 ---
  25204. --- Inner Elaboration Phase, active level 1 (S1) ---
  25205. Firing apply*operator
  25206. -->
  25207. (I3 ^predict-no N1068 + :O )
  25208. Firing apply*operator*complete
  25209. -->
  25210. (I3 ^predict-no N1067 - :O )
  25211. inner elaboration loop at bottom goal.
  25212. --- Change Working Memory (PE) ---
  25213. =>WM: (14972: I3 ^predict-no N1068)
  25214. <=WM: (14960: N1067 ^status complete)
  25215. <=WM: (14959: I3 ^predict-no N1067)
  25216. --- Firing Productions (IE) For State At Depth 1 ---
  25217. --- Inner Elaboration Phase, active level 1 (S1) ---
  25218. Firing monitor*world
  25219. -->
  25220. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25221. --- Change Working Memory (IE) ---
  25222. --- END Application Phase ---
  25223. --- Output Phase ---
  25224. ENV: Agent did: predict-no for direction L in state State-A
  25225. In State-A moving L
  25226. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25227. predict error 0
  25228. dir: dir isL
  25229. --- END Output Phase ---
  25230. -/--- Input Phase ---
  25231. =>WM: (14976: I2 ^dir L)
  25232. =>WM: (14975: I2 ^reward 1)
  25233. =>WM: (14974: I2 ^see 0)
  25234. =>WM: (14973: N1068 ^status complete)
  25235. <=WM: (14963: I2 ^dir L)
  25236. <=WM: (14962: I2 ^reward 1)
  25237. <=WM: (14961: I2 ^see 0)
  25238. =>WM: (14977: I2 ^level-1 L0-root)
  25239. <=WM: (14964: I2 ^level-1 L0-root)
  25240. --- END Input Phase ---
  25241. --- Proposal Phase ---
  25242. --- Inner Elaboration Phase, active level 1 (S1) ---
  25243. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25244. -->
  25245. (S1 ^operator O2135 = -0.3332708974800781)
  25246. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25247. -->
  25248. (S1 ^operator O2136 = 0.6859107114336244)
  25249. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25250. -->
  25251. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25252. -->
  25253. Firing elaborate*copy-see-to-output-link
  25254. -->
  25255. (I3 ^see 0 +)
  25256. Firing elaborate*reward*based*on*reward
  25257. -->
  25258. (R1072 ^value 1 +)
  25259. (R1 ^reward R1072 +)
  25260. Firing propose*predict-yes
  25261. -->
  25262. (O2137 ^name predict-yes +)
  25263. (S1 ^operator O2137 +)
  25264. Firing propose*predict-no
  25265. -->
  25266. (O2138 ^name predict-no +)
  25267. (S1 ^operator O2138 +)
  25268. Firing rl*prefer*rvt*predict-no*H0*2
  25269. -->
  25270. (S1 ^operator O2136 = 0.3140573492937311)
  25271. Firing rl*prefer*rvt*predict-yes*H0*1
  25272. -->
  25273. (S1 ^operator O2135 = 0.3804134437534242)
  25274. Firing prefer*rvt*predict-yes*H0
  25275. -->
  25276. Firing prefer*rvt*predict-no*H0
  25277. -->
  25278. Firing elaborate*copy-dir-to-output-link
  25279. -->
  25280. (I3 ^dir L +)
  25281. inner elaboration loop at bottom goal.
  25282. Retracting elaborate*copy-see-to-output-link
  25283. -->
  25284. (I3 ^see 0 +)
  25285. Retracting propose*predict-no
  25286. -->
  25287. (O2136 ^name predict-no +)
  25288. (S1 ^operator O2136 +)
  25289. Retracting propose*predict-yes
  25290. -->
  25291. (O2135 ^name predict-yes +)
  25292. (S1 ^operator O2135 +)
  25293. Retracting elaborate*reward*based*on*reward
  25294. -->
  25295. (R1071 ^value 1 +)
  25296. (R1 ^reward R1071 +)
  25297. Retracting elaborate*copy-dir-to-output-link
  25298. -->
  25299. (I3 ^dir L +)
  25300. Retracting rl*prefer*rvt*predict-no*H0*2
  25301. -->
  25302. (S1 ^operator O2136 = 0.3140573492937311)
  25303. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25304. -->
  25305. (S1 ^operator O2136 = 0.6859107114336244)
  25306. Retracting rl*prefer*rvt*predict-yes*H0*1
  25307. -->
  25308. (S1 ^operator O2135 = 0.3804134437534242)
  25309. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25310. -->
  25311. (S1 ^operator O2135 = -0.3332708974800781)
  25312. =>WM: (14983: S1 ^operator O2138 +)
  25313. =>WM: (14982: S1 ^operator O2137 +)
  25314. =>WM: (14981: O2138 ^name predict-no)
  25315. =>WM: (14980: O2137 ^name predict-yes)
  25316. =>WM: (14979: R1072 ^value 1)
  25317. =>WM: (14978: R1 ^reward R1072)
  25318. <=WM: (14969: S1 ^operator O2135 +)
  25319. <=WM: (14970: S1 ^operator O2136 +)
  25320. <=WM: (14971: S1 ^operator O2136)
  25321. <=WM: (14965: R1 ^reward R1071)
  25322. <=WM: (14968: O2136 ^name predict-no)
  25323. <=WM: (14967: O2135 ^name predict-yes)
  25324. <=WM: (14966: R1071 ^value 1)
  25325. --- Inner Elaboration Phase, active level 1 (S1) ---
  25326. Firing prefer*rvt*predict-yes*H0
  25327. -->
  25328. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25329. -->
  25330. (S1 ^operator O2137 = -0.3332708974800781)
  25331. Firing rl*prefer*rvt*predict-yes*H0*1
  25332. -->
  25333. (S1 ^operator O2137 = 0.3804134437534242)
  25334. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25335. -->
  25336. Firing prefer*rvt*predict-no*H0
  25337. -->
  25338. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25339. -->
  25340. (S1 ^operator O2138 = 0.6859107114336244)
  25341. Firing rl*prefer*rvt*predict-no*H0*2
  25342. -->
  25343. (S1 ^operator O2138 = 0.3140573492937311)
  25344. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25345. -->
  25346. inner elaboration loop at bottom goal.
  25347. Retracting rl*prefer*rvt*predict-no*H0*2
  25348. -->
  25349. (S1 ^operator O2136 = 0.3140573492937311)
  25350. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25351. -->
  25352. (S1 ^operator O2136 = 0.6859107114336244)
  25353. Retracting rl*prefer*rvt*predict-yes*H0*1
  25354. -->
  25355. (S1 ^operator O2135 = 0.3804134437534242)
  25356. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25357. -->
  25358. (S1 ^operator O2135 = -0.3332708974800781)
  25359. --- END Proposal Phase ---
  25360. --- Decision Phase ---
  25361. RL update rl*prefer*rvt*predict-no*H0*2 0.48506 -0.171002 0.314057 -> 0.485062 -0.171002 0.31406(R,m,v=1,0.877907,0.107813)
  25362. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514915 0.170996 0.685911 -> 0.514917 0.170996 0.685914(R,m,v=1,1,0)
  25363. =>WM: (14984: S1 ^operator O2138)
  25364. 1069: O: O2138 (predict-no)
  25365. --- END Decision Phase ---
  25366. --- Application Phase ---
  25367. --- Firing Productions (PE) For State At Depth 1 ---
  25368. --- Inner Elaboration Phase, active level 1 (S1) ---
  25369. Firing apply*operator
  25370. -->
  25371. (I3 ^predict-no N1069 + :O )
  25372. Firing apply*operator*complete
  25373. -->
  25374. (I3 ^predict-no N1068 - :O )
  25375. inner elaboration loop at bottom goal.
  25376. --- Change Working Memory (PE) ---
  25377. =>WM: (14985: I3 ^predict-no N1069)
  25378. <=WM: (14973: N1068 ^status complete)
  25379. <=WM: (14972: I3 ^predict-no N1068)
  25380. --- Firing Productions (IE) For State At Depth 1 ---
  25381. --- Inner Elaboration Phase, active level 1 (S1) ---
  25382. Firing monitor*world
  25383. -->
  25384. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25385. --- Change Working Memory (IE) ---
  25386. --- END Application Phase ---
  25387. --- Output Phase ---
  25388. ENV: Agent did: predict-no for direction L in state State-A
  25389. In State-A moving L
  25390. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25391. predict error 0
  25392. dir: dir isL
  25393. --- END Output Phase ---
  25394. |\---- Input Phase ---
  25395. =>WM: (14989: I2 ^dir L)
  25396. =>WM: (14988: I2 ^reward 1)
  25397. =>WM: (14987: I2 ^see 0)
  25398. =>WM: (14986: N1069 ^status complete)
  25399. <=WM: (14976: I2 ^dir L)
  25400. <=WM: (14975: I2 ^reward 1)
  25401. <=WM: (14974: I2 ^see 0)
  25402. =>WM: (14990: I2 ^level-1 L0-root)
  25403. <=WM: (14977: I2 ^level-1 L0-root)
  25404. --- END Input Phase ---
  25405. --- Proposal Phase ---
  25406. --- Inner Elaboration Phase, active level 1 (S1) ---
  25407. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25408. -->
  25409. (S1 ^operator O2137 = -0.3332708974800781)
  25410. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25411. -->
  25412. (S1 ^operator O2138 = 0.6859137222632506)
  25413. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25414. -->
  25415. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25416. -->
  25417. Firing elaborate*copy-see-to-output-link
  25418. -->
  25419. (I3 ^see 0 +)
  25420. Firing elaborate*reward*based*on*reward
  25421. -->
  25422. (R1073 ^value 1 +)
  25423. (R1 ^reward R1073 +)
  25424. Firing propose*predict-yes
  25425. -->
  25426. (O2139 ^name predict-yes +)
  25427. (S1 ^operator O2139 +)
  25428. Firing propose*predict-no
  25429. -->
  25430. (O2140 ^name predict-no +)
  25431. (S1 ^operator O2140 +)
  25432. Firing rl*prefer*rvt*predict-no*H0*2
  25433. -->
  25434. (S1 ^operator O2138 = 0.3140599470408917)
  25435. Firing rl*prefer*rvt*predict-yes*H0*1
  25436. -->
  25437. (S1 ^operator O2137 = 0.3804134437534242)
  25438. Firing prefer*rvt*predict-yes*H0
  25439. -->
  25440. Firing prefer*rvt*predict-no*H0
  25441. -->
  25442. Firing elaborate*copy-dir-to-output-link
  25443. -->
  25444. (I3 ^dir L +)
  25445. inner elaboration loop at bottom goal.
  25446. Retracting elaborate*copy-see-to-output-link
  25447. -->
  25448. (I3 ^see 0 +)
  25449. Retracting propose*predict-no
  25450. -->
  25451. (O2138 ^name predict-no +)
  25452. (S1 ^operator O2138 +)
  25453. Retracting propose*predict-yes
  25454. -->
  25455. (O2137 ^name predict-yes +)
  25456. (S1 ^operator O2137 +)
  25457. Retracting elaborate*reward*based*on*reward
  25458. -->
  25459. (R1072 ^value 1 +)
  25460. (R1 ^reward R1072 +)
  25461. Retracting elaborate*copy-dir-to-output-link
  25462. -->
  25463. (I3 ^dir L +)
  25464. Retracting rl*prefer*rvt*predict-no*H0*2
  25465. -->
  25466. (S1 ^operator O2138 = 0.3140599470408917)
  25467. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25468. -->
  25469. (S1 ^operator O2138 = 0.6859137222632506)
  25470. Retracting rl*prefer*rvt*predict-yes*H0*1
  25471. -->
  25472. (S1 ^operator O2137 = 0.3804134437534242)
  25473. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25474. -->
  25475. (S1 ^operator O2137 = -0.3332708974800781)
  25476. =>WM: (14996: S1 ^operator O2140 +)
  25477. =>WM: (14995: S1 ^operator O2139 +)
  25478. =>WM: (14994: O2140 ^name predict-no)
  25479. =>WM: (14993: O2139 ^name predict-yes)
  25480. =>WM: (14992: R1073 ^value 1)
  25481. =>WM: (14991: R1 ^reward R1073)
  25482. <=WM: (14982: S1 ^operator O2137 +)
  25483. <=WM: (14983: S1 ^operator O2138 +)
  25484. <=WM: (14984: S1 ^operator O2138)
  25485. <=WM: (14978: R1 ^reward R1072)
  25486. <=WM: (14981: O2138 ^name predict-no)
  25487. <=WM: (14980: O2137 ^name predict-yes)
  25488. <=WM: (14979: R1072 ^value 1)
  25489. --- Inner Elaboration Phase, active level 1 (S1) ---
  25490. Firing prefer*rvt*predict-yes*H0
  25491. -->
  25492. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25493. -->
  25494. (S1 ^operator O2139 = -0.3332708974800781)
  25495. Firing rl*prefer*rvt*predict-yes*H0*1
  25496. -->
  25497. (S1 ^operator O2139 = 0.3804134437534242)
  25498. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25499. -->
  25500. Firing prefer*rvt*predict-no*H0
  25501. -->
  25502. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25503. -->
  25504. (S1 ^operator O2140 = 0.6859137222632506)
  25505. Firing rl*prefer*rvt*predict-no*H0*2
  25506. -->
  25507. (S1 ^operator O2140 = 0.3140599470408917)
  25508. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25509. -->
  25510. inner elaboration loop at bottom goal.
  25511. Retracting rl*prefer*rvt*predict-no*H0*2
  25512. -->
  25513. (S1 ^operator O2138 = 0.3140599470408917)
  25514. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25515. -->
  25516. (S1 ^operator O2138 = 0.6859137222632506)
  25517. Retracting rl*prefer*rvt*predict-yes*H0*1
  25518. -->
  25519. (S1 ^operator O2137 = 0.3804134437534242)
  25520. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25521. -->
  25522. (S1 ^operator O2137 = -0.3332708974800781)
  25523. --- END Proposal Phase ---
  25524. --- Decision Phase ---
  25525. RL update rl*prefer*rvt*predict-no*H0*2 0.485062 -0.171002 0.31406 -> 0.485063 -0.171001 0.314062(R,m,v=1,0.878613,0.107272)
  25526. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514917 0.170996 0.685914 -> 0.514919 0.170997 0.685916(R,m,v=1,1,0)
  25527. =>WM: (14997: S1 ^operator O2140)
  25528. 1070: O: O2140 (predict-no)
  25529. --- END Decision Phase ---
  25530. --- Application Phase ---
  25531. --- Firing Productions (PE) For State At Depth 1 ---
  25532. --- Inner Elaboration Phase, active level 1 (S1) ---
  25533. Firing apply*operator
  25534. -->
  25535. (I3 ^predict-no N1070 + :O )
  25536. Firing apply*operator*complete
  25537. -->
  25538. (I3 ^predict-no N1069 - :O )
  25539. inner elaboration loop at bottom goal.
  25540. --- Change Working Memory (PE) ---
  25541. =>WM: (14998: I3 ^predict-no N1070)
  25542. <=WM: (14986: N1069 ^status complete)
  25543. <=WM: (14985: I3 ^predict-no N1069)
  25544. --- Firing Productions (IE) For State At Depth 1 ---
  25545. --- Inner Elaboration Phase, active level 1 (S1) ---
  25546. Firing monitor*world
  25547. -->
  25548. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25549. --- Change Working Memory (IE) ---
  25550. --- END Application Phase ---
  25551. --- Output Phase ---
  25552. ENV: Agent did: predict-no for direction L in state State-A
  25553. In State-A moving L
  25554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25555. predict error 0
  25556. dir: dir isR
  25557. --- END Output Phase ---
  25558. /|\--- Input Phase ---
  25559. =>WM: (15002: I2 ^dir R)
  25560. =>WM: (15001: I2 ^reward 1)
  25561. =>WM: (15000: I2 ^see 0)
  25562. =>WM: (14999: N1070 ^status complete)
  25563. <=WM: (14989: I2 ^dir L)
  25564. <=WM: (14988: I2 ^reward 1)
  25565. <=WM: (14987: I2 ^see 0)
  25566. =>WM: (15003: I2 ^level-1 L0-root)
  25567. <=WM: (14990: I2 ^level-1 L0-root)
  25568. --- END Input Phase ---
  25569. --- Proposal Phase ---
  25570. --- Inner Elaboration Phase, active level 1 (S1) ---
  25571. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  25572. -->
  25573. (S1 ^operator O2139 = 0.7057943466848455)
  25574. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  25575. -->
  25576. (S1 ^operator O2140 = -0.2023211881870005)
  25577. Firing prefer*rvt*predict-no*H0*6*v1*H1
  25578. -->
  25579. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25580. -->
  25581. Firing elaborate*copy-see-to-output-link
  25582. -->
  25583. (I3 ^see 0 +)
  25584. Firing elaborate*reward*based*on*reward
  25585. -->
  25586. (R1074 ^value 1 +)
  25587. (R1 ^reward R1074 +)
  25588. Firing propose*predict-yes
  25589. -->
  25590. (O2141 ^name predict-yes +)
  25591. (S1 ^operator O2141 +)
  25592. Firing propose*predict-no
  25593. -->
  25594. (O2142 ^name predict-no +)
  25595. (S1 ^operator O2142 +)
  25596. Firing rl*prefer*rvt*predict-no*H0*6
  25597. -->
  25598. (S1 ^operator O2140 = 0.2298602070490972)
  25599. Firing rl*prefer*rvt*predict-yes*H0*5
  25600. -->
  25601. (S1 ^operator O2139 = 0.2940539968979803)
  25602. Firing prefer*rvt*predict-yes*H0
  25603. -->
  25604. Firing prefer*rvt*predict-no*H0
  25605. -->
  25606. Firing elaborate*copy-dir-to-output-link
  25607. -->
  25608. (I3 ^dir R +)
  25609. inner elaboration loop at bottom goal.
  25610. Retracting elaborate*copy-see-to-output-link
  25611. -->
  25612. (I3 ^see 0 +)
  25613. Retracting propose*predict-no
  25614. -->
  25615. (O2140 ^name predict-no +)
  25616. (S1 ^operator O2140 +)
  25617. Retracting propose*predict-yes
  25618. -->
  25619. (O2139 ^name predict-yes +)
  25620. (S1 ^operator O2139 +)
  25621. Retracting elaborate*reward*based*on*reward
  25622. -->
  25623. (R1073 ^value 1 +)
  25624. (R1 ^reward R1073 +)
  25625. Retracting elaborate*copy-dir-to-output-link
  25626. -->
  25627. (I3 ^dir L +)
  25628. Retracting rl*prefer*rvt*predict-no*H0*2
  25629. -->
  25630. (S1 ^operator O2140 = 0.3140620866027386)
  25631. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*38
  25632. -->
  25633. (S1 ^operator O2140 = 0.6859161981217101)
  25634. Retracting rl*prefer*rvt*predict-yes*H0*1
  25635. -->
  25636. (S1 ^operator O2139 = 0.3804134437534242)
  25637. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*39
  25638. -->
  25639. (S1 ^operator O2139 = -0.3332708974800781)
  25640. =>WM: (15010: S1 ^operator O2142 +)
  25641. =>WM: (15009: S1 ^operator O2141 +)
  25642. =>WM: (15008: I3 ^dir R)
  25643. =>WM: (15007: O2142 ^name predict-no)
  25644. =>WM: (15006: O2141 ^name predict-yes)
  25645. =>WM: (15005: R1074 ^value 1)
  25646. =>WM: (15004: R1 ^reward R1074)
  25647. <=WM: (14995: S1 ^operator O2139 +)
  25648. <=WM: (14996: S1 ^operator O2140 +)
  25649. <=WM: (14997: S1 ^operator O2140)
  25650. <=WM: (14929: I3 ^dir L)
  25651. <=WM: (14991: R1 ^reward R1073)
  25652. <=WM: (14994: O2140 ^name predict-no)
  25653. <=WM: (14993: O2139 ^name predict-yes)
  25654. <=WM: (14992: R1073 ^value 1)
  25655. --- Inner Elaboration Phase, active level 1 (S1) ---
  25656. Firing prefer*rvt*predict-yes*H0
  25657. -->
  25658. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  25659. -->
  25660. (S1 ^operator O2141 = 0.7057943466848455)
  25661. Firing rl*prefer*rvt*predict-yes*H0*5
  25662. -->
  25663. (S1 ^operator O2141 = 0.2940539968979803)
  25664. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25665. -->
  25666. Firing prefer*rvt*predict-no*H0
  25667. -->
  25668. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  25669. -->
  25670. (S1 ^operator O2142 = -0.2023211881870005)
  25671. Firing rl*prefer*rvt*predict-no*H0*6
  25672. -->
  25673. (S1 ^operator O2142 = 0.2298602070490972)
  25674. Firing prefer*rvt*predict-no*H0*6*v1*H1
  25675. -->
  25676. inner elaboration loop at bottom goal.
  25677. Retracting rl*prefer*rvt*predict-no*H0*6
  25678. -->
  25679. (S1 ^operator O2140 = 0.2298602070490972)
  25680. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  25681. -->
  25682. (S1 ^operator O2140 = -0.2023211881870005)
  25683. Retracting rl*prefer*rvt*predict-yes*H0*5
  25684. -->
  25685. (S1 ^operator O2139 = 0.2940539968979803)
  25686. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  25687. -->
  25688. (S1 ^operator O2139 = 0.7057943466848455)
  25689. --- END Proposal Phase ---
  25690. --- Decision Phase ---
  25691. RL update rl*prefer*rvt*predict-no*H0*2 0.485063 -0.171001 0.314062 -> 0.485065 -0.171001 0.314064(R,m,v=1,0.87931,0.106737)
  25692. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*38 0.514919 0.170997 0.685916 -> 0.514921 0.170997 0.685918(R,m,v=1,1,0)
  25693. =>WM: (15011: S1 ^operator O2141)
  25694. 1071: O: O2141 (predict-yes)
  25695. --- END Decision Phase ---
  25696. --- Application Phase ---
  25697. --- Firing Productions (PE) For State At Depth 1 ---
  25698. --- Inner Elaboration Phase, active level 1 (S1) ---
  25699. Firing apply*operator
  25700. -->
  25701. (I3 ^predict-yes N1071 + :O )
  25702. Firing apply*operator*complete
  25703. -->
  25704. (I3 ^predict-no N1070 - :O )
  25705. inner elaboration loop at bottom goal.
  25706. --- Change Working Memory (PE) ---
  25707. =>WM: (15012: I3 ^predict-yes N1071)
  25708. <=WM: (14999: N1070 ^status complete)
  25709. <=WM: (14998: I3 ^predict-no N1070)
  25710. --- Firing Productions (IE) For State At Depth 1 ---
  25711. --- Inner Elaboration Phase, active level 1 (S1) ---
  25712. Firing monitor*world
  25713. -->
  25714. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25715. --- Change Working Memory (IE) ---
  25716. --- END Application Phase ---
  25717. --- Output Phase ---
  25718. ENV: Agent did: predict-yes for direction R in state State-A
  25719. In State-A moving R
  25720. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25721. predict error 0
  25722. dir: dir isU
  25723. --- END Output Phase ---
  25724. ---- Input Phase ---
  25725. =>WM: (15016: I2 ^dir U)
  25726. =>WM: (15015: I2 ^reward 1)
  25727. =>WM: (15014: I2 ^see 1)
  25728. =>WM: (15013: N1071 ^status complete)
  25729. <=WM: (15002: I2 ^dir R)
  25730. <=WM: (15001: I2 ^reward 1)
  25731. <=WM: (15000: I2 ^see 0)
  25732. =>WM: (15017: I2 ^level-1 R1-root)
  25733. <=WM: (15003: I2 ^level-1 L0-root)
  25734. --- END Input Phase ---
  25735. --- Proposal Phase ---
  25736. --- Inner Elaboration Phase, active level 1 (S1) ---
  25737. Firing elaborate*copy-see-to-output-link
  25738. -->
  25739. (I3 ^see 1 +)
  25740. Firing elaborate*reward*based*on*reward
  25741. -->
  25742. (R1075 ^value 1 +)
  25743. (R1 ^reward R1075 +)
  25744. Firing propose*predict-yes
  25745. -->
  25746. (O2143 ^name predict-yes +)
  25747. (S1 ^operator O2143 +)
  25748. Firing propose*predict-no
  25749. -->
  25750. (O2144 ^name predict-no +)
  25751. (S1 ^operator O2144 +)
  25752. Firing rl*prefer*rvt*predict-no*H0*4
  25753. -->
  25754. (S1 ^operator O2142 = 1.)
  25755. Firing rl*prefer*rvt*predict-yes*H0*3
  25756. -->
  25757. (S1 ^operator O2141 = 0.)
  25758. Firing prefer*rvt*predict-yes*H0
  25759. -->
  25760. Firing prefer*rvt*predict-no*H0
  25761. -->
  25762. Firing elaborate*copy-dir-to-output-link
  25763. -->
  25764. (I3 ^dir U +)
  25765. inner elaboration loop at bottom goal.
  25766. Retracting elaborate*copy-see-to-output-link
  25767. -->
  25768. (I3 ^see 0 +)
  25769. Retracting propose*predict-no
  25770. -->
  25771. (O2142 ^name predict-no +)
  25772. (S1 ^operator O2142 +)
  25773. Retracting propose*predict-yes
  25774. -->
  25775. (O2141 ^name predict-yes +)
  25776. (S1 ^operator O2141 +)
  25777. Retracting elaborate*reward*based*on*reward
  25778. -->
  25779. (R1074 ^value 1 +)
  25780. (R1 ^reward R1074 +)
  25781. Retracting elaborate*copy-dir-to-output-link
  25782. -->
  25783. (I3 ^dir R +)
  25784. Retracting rl*prefer*rvt*predict-no*H0*6
  25785. -->
  25786. (S1 ^operator O2142 = 0.2298602070490972)
  25787. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  25788. -->
  25789. (S1 ^operator O2142 = -0.2023211881870005)
  25790. Retracting rl*prefer*rvt*predict-yes*H0*5
  25791. -->
  25792. (S1 ^operator O2141 = 0.2940539968979803)
  25793. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  25794. -->
  25795. (S1 ^operator O2141 = 0.7057943466848455)
  25796. =>WM: (15025: S1 ^operator O2144 +)
  25797. =>WM: (15024: S1 ^operator O2143 +)
  25798. =>WM: (15023: I3 ^dir U)
  25799. =>WM: (15022: O2144 ^name predict-no)
  25800. =>WM: (15021: O2143 ^name predict-yes)
  25801. =>WM: (15020: R1075 ^value 1)
  25802. =>WM: (15019: R1 ^reward R1075)
  25803. =>WM: (15018: I3 ^see 1)
  25804. <=WM: (15009: S1 ^operator O2141 +)
  25805. <=WM: (15011: S1 ^operator O2141)
  25806. <=WM: (15010: S1 ^operator O2142 +)
  25807. <=WM: (15008: I3 ^dir R)
  25808. <=WM: (15004: R1 ^reward R1074)
  25809. <=WM: (14830: I3 ^see 0)
  25810. <=WM: (15007: O2142 ^name predict-no)
  25811. <=WM: (15006: O2141 ^name predict-yes)
  25812. <=WM: (15005: R1074 ^value 1)
  25813. --- Inner Elaboration Phase, active level 1 (S1) ---
  25814. Firing prefer*rvt*predict-yes*H0
  25815. -->
  25816. Firing rl*prefer*rvt*predict-yes*H0*3
  25817. -->
  25818. (S1 ^operator O2143 = 0.)
  25819. Firing prefer*rvt*predict-no*H0
  25820. -->
  25821. Firing rl*prefer*rvt*predict-no*H0*4
  25822. -->
  25823. (S1 ^operator O2144 = 1.)
  25824. inner elaboration loop at bottom goal.
  25825. Retracting rl*prefer*rvt*predict-no*H0*4
  25826. -->
  25827. (S1 ^operator O2142 = 1.)
  25828. Retracting rl*prefer*rvt*predict-yes*H0*3
  25829. -->
  25830. (S1 ^operator O2141 = 0.)
  25831. --- END Proposal Phase ---
  25832. --- Decision Phase ---
  25833. RL update rl*prefer*rvt*predict-yes*H0*5 0.501123 -0.207069 0.294054 -> 0.501134 -0.207068 0.294066(R,m,v=1,0.856287,0.123801)
  25834. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.49874 0.207054 0.705794 -> 0.498753 0.207056 0.705809(R,m,v=1,1,0)
  25835. =>WM: (15026: S1 ^operator O2144)
  25836. 1072: O: O2144 (predict-no)
  25837. --- END Decision Phase ---
  25838. --- Application Phase ---
  25839. --- Firing Productions (PE) For State At Depth 1 ---
  25840. --- Inner Elaboration Phase, active level 1 (S1) ---
  25841. Firing apply*operator
  25842. -->
  25843. (I3 ^predict-no N1072 + :O )
  25844. Firing apply*operator*complete
  25845. -->
  25846. (I3 ^predict-yes N1071 - :O )
  25847. inner elaboration loop at bottom goal.
  25848. --- Change Working Memory (PE) ---
  25849. =>WM: (15027: I3 ^predict-no N1072)
  25850. <=WM: (15013: N1071 ^status complete)
  25851. <=WM: (15012: I3 ^predict-yes N1071)
  25852. --- Firing Productions (IE) For State At Depth 1 ---
  25853. --- Inner Elaboration Phase, active level 1 (S1) ---
  25854. Firing monitor*world
  25855. -->
  25856. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25857. --- Change Working Memory (IE) ---
  25858. --- END Application Phase ---
  25859. --- Output Phase ---
  25860. ENV: Agent did: predict-no for direction U in state State-B
  25861. In State-B moving U
  25862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25863. predict error 0
  25864. dir: dir isR
  25865. --- END Output Phase ---
  25866. /|\--- Input Phase ---
  25867. =>WM: (15031: I2 ^dir R)
  25868. =>WM: (15030: I2 ^reward 1)
  25869. =>WM: (15029: I2 ^see 0)
  25870. =>WM: (15028: N1072 ^status complete)
  25871. <=WM: (15016: I2 ^dir U)
  25872. <=WM: (15015: I2 ^reward 1)
  25873. <=WM: (15014: I2 ^see 1)
  25874. =>WM: (15032: I2 ^level-1 R1-root)
  25875. <=WM: (15017: I2 ^level-1 R1-root)
  25876. --- END Input Phase ---
  25877. --- Proposal Phase ---
  25878. --- Inner Elaboration Phase, active level 1 (S1) ---
  25879. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  25880. -->
  25881. (S1 ^operator O2143 = -0.252585164213872)
  25882. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  25883. -->
  25884. (S1 ^operator O2144 = 0.770161537509104)
  25885. Firing prefer*rvt*predict-no*H0*6*v1*H1
  25886. -->
  25887. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25888. -->
  25889. Firing elaborate*copy-see-to-output-link
  25890. -->
  25891. (I3 ^see 0 +)
  25892. Firing elaborate*reward*based*on*reward
  25893. -->
  25894. (R1076 ^value 1 +)
  25895. (R1 ^reward R1076 +)
  25896. Firing propose*predict-yes
  25897. -->
  25898. (O2145 ^name predict-yes +)
  25899. (S1 ^operator O2145 +)
  25900. Firing propose*predict-no
  25901. -->
  25902. (O2146 ^name predict-no +)
  25903. (S1 ^operator O2146 +)
  25904. Firing rl*prefer*rvt*predict-no*H0*6
  25905. -->
  25906. (S1 ^operator O2144 = 0.2298602070490972)
  25907. Firing rl*prefer*rvt*predict-yes*H0*5
  25908. -->
  25909. (S1 ^operator O2143 = 0.2940663911910953)
  25910. Firing prefer*rvt*predict-yes*H0
  25911. -->
  25912. Firing prefer*rvt*predict-no*H0
  25913. -->
  25914. Firing elaborate*copy-dir-to-output-link
  25915. -->
  25916. (I3 ^dir R +)
  25917. inner elaboration loop at bottom goal.
  25918. Retracting elaborate*copy-see-to-output-link
  25919. -->
  25920. (I3 ^see 1 +)
  25921. Retracting propose*predict-no
  25922. -->
  25923. (O2144 ^name predict-no +)
  25924. (S1 ^operator O2144 +)
  25925. Retracting propose*predict-yes
  25926. -->
  25927. (O2143 ^name predict-yes +)
  25928. (S1 ^operator O2143 +)
  25929. Retracting elaborate*reward*based*on*reward
  25930. -->
  25931. (R1075 ^value 1 +)
  25932. (R1 ^reward R1075 +)
  25933. Retracting elaborate*copy-dir-to-output-link
  25934. -->
  25935. (I3 ^dir U +)
  25936. Retracting rl*prefer*rvt*predict-no*H0*4
  25937. -->
  25938. (S1 ^operator O2144 = 1.)
  25939. Retracting rl*prefer*rvt*predict-yes*H0*3
  25940. -->
  25941. (S1 ^operator O2143 = 0.)
  25942. =>WM: (15040: S1 ^operator O2146 +)
  25943. =>WM: (15039: S1 ^operator O2145 +)
  25944. =>WM: (15038: I3 ^dir R)
  25945. =>WM: (15037: O2146 ^name predict-no)
  25946. =>WM: (15036: O2145 ^name predict-yes)
  25947. =>WM: (15035: R1076 ^value 1)
  25948. =>WM: (15034: R1 ^reward R1076)
  25949. =>WM: (15033: I3 ^see 0)
  25950. <=WM: (15024: S1 ^operator O2143 +)
  25951. <=WM: (15025: S1 ^operator O2144 +)
  25952. <=WM: (15026: S1 ^operator O2144)
  25953. <=WM: (15023: I3 ^dir U)
  25954. <=WM: (15019: R1 ^reward R1075)
  25955. <=WM: (15018: I3 ^see 1)
  25956. <=WM: (15022: O2144 ^name predict-no)
  25957. <=WM: (15021: O2143 ^name predict-yes)
  25958. <=WM: (15020: R1075 ^value 1)
  25959. --- Inner Elaboration Phase, active level 1 (S1) ---
  25960. Firing prefer*rvt*predict-yes*H0
  25961. -->
  25962. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  25963. -->
  25964. (S1 ^operator O2145 = -0.252585164213872)
  25965. Firing rl*prefer*rvt*predict-yes*H0*5
  25966. -->
  25967. (S1 ^operator O2145 = 0.2940663911910953)
  25968. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25969. -->
  25970. Firing prefer*rvt*predict-no*H0
  25971. -->
  25972. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  25973. -->
  25974. (S1 ^operator O2146 = 0.770161537509104)
  25975. Firing rl*prefer*rvt*predict-no*H0*6
  25976. -->
  25977. (S1 ^operator O2146 = 0.2298602070490972)
  25978. Firing prefer*rvt*predict-no*H0*6*v1*H1
  25979. -->
  25980. inner elaboration loop at bottom goal.
  25981. Retracting rl*prefer*rvt*predict-no*H0*6
  25982. -->
  25983. (S1 ^operator O2144 = 0.2298602070490972)
  25984. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  25985. -->
  25986. (S1 ^operator O2144 = 0.770161537509104)
  25987. Retracting rl*prefer*rvt*predict-yes*H0*5
  25988. -->
  25989. (S1 ^operator O2143 = 0.2940663911910953)
  25990. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  25991. -->
  25992. (S1 ^operator O2143 = -0.252585164213872)
  25993. --- END Proposal Phase ---
  25994. --- Decision Phase ---
  25995. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  25996. =>WM: (15041: S1 ^operator O2146)
  25997. 1073: O: O2146 (predict-no)
  25998. --- END Decision Phase ---
  25999. --- Application Phase ---
  26000. --- Firing Productions (PE) For State At Depth 1 ---
  26001. --- Inner Elaboration Phase, active level 1 (S1) ---
  26002. Firing apply*operator
  26003. -->
  26004. (I3 ^predict-no N1073 + :O )
  26005. Firing apply*operator*complete
  26006. -->
  26007. (I3 ^predict-no N1072 - :O )
  26008. inner elaboration loop at bottom goal.
  26009. --- Change Working Memory (PE) ---
  26010. =>WM: (15042: I3 ^predict-no N1073)
  26011. <=WM: (15028: N1072 ^status complete)
  26012. <=WM: (15027: I3 ^predict-no N1072)
  26013. --- Firing Productions (IE) For State At Depth 1 ---
  26014. --- Inner Elaboration Phase, active level 1 (S1) ---
  26015. Firing monitor*world
  26016. -->
  26017. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26018. --- Change Working Memory (IE) ---
  26019. --- END Application Phase ---
  26020. --- Output Phase ---
  26021. ENV: Agent did: predict-no for direction R in state State-B
  26022. In State-B moving R
  26023. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26024. predict error 0
  26025. dir: dir isL
  26026. --- END Output Phase ---
  26027. -/|--- Input Phase ---
  26028. =>WM: (15046: I2 ^dir L)
  26029. =>WM: (15045: I2 ^reward 1)
  26030. =>WM: (15044: I2 ^see 0)
  26031. =>WM: (15043: N1073 ^status complete)
  26032. <=WM: (15031: I2 ^dir R)
  26033. <=WM: (15030: I2 ^reward 1)
  26034. <=WM: (15029: I2 ^see 0)
  26035. =>WM: (15047: I2 ^level-1 R0-root)
  26036. <=WM: (15032: I2 ^level-1 R1-root)
  26037. --- END Input Phase ---
  26038. --- Proposal Phase ---
  26039. --- Inner Elaboration Phase, active level 1 (S1) ---
  26040. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  26041. -->
  26042. (S1 ^operator O2145 = 0.6195760036479832)
  26043. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  26044. -->
  26045. (S1 ^operator O2146 = -0.2190661556260421)
  26046. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26047. -->
  26048. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26049. -->
  26050. Firing elaborate*copy-see-to-output-link
  26051. -->
  26052. (I3 ^see 0 +)
  26053. Firing elaborate*reward*based*on*reward
  26054. -->
  26055. (R1077 ^value 1 +)
  26056. (R1 ^reward R1077 +)
  26057. Firing propose*predict-yes
  26058. -->
  26059. (O2147 ^name predict-yes +)
  26060. (S1 ^operator O2147 +)
  26061. Firing propose*predict-no
  26062. -->
  26063. (O2148 ^name predict-no +)
  26064. (S1 ^operator O2148 +)
  26065. Firing rl*prefer*rvt*predict-no*H0*2
  26066. -->
  26067. (S1 ^operator O2146 = 0.3140638494766289)
  26068. Firing rl*prefer*rvt*predict-yes*H0*1
  26069. -->
  26070. (S1 ^operator O2145 = 0.3804134437534242)
  26071. Firing prefer*rvt*predict-yes*H0
  26072. -->
  26073. Firing prefer*rvt*predict-no*H0
  26074. -->
  26075. Firing elaborate*copy-dir-to-output-link
  26076. -->
  26077. (I3 ^dir L +)
  26078. inner elaboration loop at bottom goal.
  26079. Retracting elaborate*copy-see-to-output-link
  26080. -->
  26081. (I3 ^see 0 +)
  26082. Retracting propose*predict-no
  26083. -->
  26084. (O2146 ^name predict-no +)
  26085. (S1 ^operator O2146 +)
  26086. Retracting propose*predict-yes
  26087. -->
  26088. (O2145 ^name predict-yes +)
  26089. (S1 ^operator O2145 +)
  26090. Retracting elaborate*reward*based*on*reward
  26091. -->
  26092. (R1076 ^value 1 +)
  26093. (R1 ^reward R1076 +)
  26094. Retracting elaborate*copy-dir-to-output-link
  26095. -->
  26096. (I3 ^dir R +)
  26097. Retracting rl*prefer*rvt*predict-no*H0*6
  26098. -->
  26099. (S1 ^operator O2146 = 0.2298602070490972)
  26100. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  26101. -->
  26102. (S1 ^operator O2146 = 0.770161537509104)
  26103. Retracting rl*prefer*rvt*predict-yes*H0*5
  26104. -->
  26105. (S1 ^operator O2145 = 0.2940663911910953)
  26106. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  26107. -->
  26108. (S1 ^operator O2145 = -0.252585164213872)
  26109. =>WM: (15054: S1 ^operator O2148 +)
  26110. =>WM: (15053: S1 ^operator O2147 +)
  26111. =>WM: (15052: I3 ^dir L)
  26112. =>WM: (15051: O2148 ^name predict-no)
  26113. =>WM: (15050: O2147 ^name predict-yes)
  26114. =>WM: (15049: R1077 ^value 1)
  26115. =>WM: (15048: R1 ^reward R1077)
  26116. <=WM: (15039: S1 ^operator O2145 +)
  26117. <=WM: (15040: S1 ^operator O2146 +)
  26118. <=WM: (15041: S1 ^operator O2146)
  26119. <=WM: (15038: I3 ^dir R)
  26120. <=WM: (15034: R1 ^reward R1076)
  26121. <=WM: (15037: O2146 ^name predict-no)
  26122. <=WM: (15036: O2145 ^name predict-yes)
  26123. <=WM: (15035: R1076 ^value 1)
  26124. --- Inner Elaboration Phase, active level 1 (S1) ---
  26125. Firing prefer*rvt*predict-yes*H0
  26126. -->
  26127. Firing rl*prefer*rvt*predict-yes*H0*1
  26128. -->
  26129. (S1 ^operator O2147 = 0.3804134437534242)
  26130. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26131. -->
  26132. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  26133. -->
  26134. (S1 ^operator O2147 = 0.6195760036479832)
  26135. Firing prefer*rvt*predict-no*H0
  26136. -->
  26137. Firing rl*prefer*rvt*predict-no*H0*2
  26138. -->
  26139. (S1 ^operator O2148 = 0.3140638494766289)
  26140. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26141. -->
  26142. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  26143. -->
  26144. (S1 ^operator O2148 = -0.2190661556260421)
  26145. inner elaboration loop at bottom goal.
  26146. Retracting rl*prefer*rvt*predict-no*H0*2
  26147. -->
  26148. (S1 ^operator O2146 = 0.3140638494766289)
  26149. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  26150. -->
  26151. (S1 ^operator O2146 = -0.2190661556260421)
  26152. Retracting rl*prefer*rvt*predict-yes*H0*1
  26153. -->
  26154. (S1 ^operator O2145 = 0.3804134437534242)
  26155. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  26156. -->
  26157. (S1 ^operator O2145 = 0.6195760036479832)
  26158. --- END Proposal Phase ---
  26159. --- Decision Phase ---
  26160. RL update rl*prefer*rvt*predict-no*H0*6 0.611912 -0.382052 0.22986 -> 0.611911 -0.382052 0.229858(R,m,v=1,0.854839,0.12476)
  26161. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388105 0.382056 0.770162 -> 0.388104 0.382056 0.770159(R,m,v=1,1,0)
  26162. =>WM: (15055: S1 ^operator O2147)
  26163. 1074: O: O2147 (predict-yes)
  26164. --- END Decision Phase ---
  26165. --- Application Phase ---
  26166. --- Firing Productions (PE) For State At Depth 1 ---
  26167. --- Inner Elaboration Phase, active level 1 (S1) ---
  26168. Firing apply*operator
  26169. -->
  26170. (I3 ^predict-yes N1074 + :O )
  26171. Firing apply*operator*complete
  26172. -->
  26173. (I3 ^predict-no N1073 - :O )
  26174. inner elaboration loop at bottom goal.
  26175. --- Change Working Memory (PE) ---
  26176. =>WM: (15056: I3 ^predict-yes N1074)
  26177. <=WM: (15043: N1073 ^status complete)
  26178. <=WM: (15042: I3 ^predict-no N1073)
  26179. --- Firing Productions (IE) For State At Depth 1 ---
  26180. --- Inner Elaboration Phase, active level 1 (S1) ---
  26181. Firing monitor*world
  26182. -->
  26183. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26184. --- Change Working Memory (IE) ---
  26185. --- END Application Phase ---
  26186. --- Output Phase ---
  26187. ENV: Agent did: predict-yes for direction L in state State-B
  26188. In State-B moving L
  26189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26190. predict error 0
  26191. dir: dir isU
  26192. --- END Output Phase ---
  26193. \-/--- Input Phase ---
  26194. =>WM: (15060: I2 ^dir U)
  26195. =>WM: (15059: I2 ^reward 1)
  26196. =>WM: (15058: I2 ^see 1)
  26197. =>WM: (15057: N1074 ^status complete)
  26198. <=WM: (15046: I2 ^dir L)
  26199. <=WM: (15045: I2 ^reward 1)
  26200. <=WM: (15044: I2 ^see 0)
  26201. =>WM: (15061: I2 ^level-1 L1-root)
  26202. <=WM: (15047: I2 ^level-1 R0-root)
  26203. --- END Input Phase ---
  26204. --- Proposal Phase ---
  26205. --- Inner Elaboration Phase, active level 1 (S1) ---
  26206. Firing elaborate*copy-see-to-output-link
  26207. -->
  26208. (I3 ^see 1 +)
  26209. Firing elaborate*reward*based*on*reward
  26210. -->
  26211. (R1078 ^value 1 +)
  26212. (R1 ^reward R1078 +)
  26213. Firing propose*predict-yes
  26214. -->
  26215. (O2149 ^name predict-yes +)
  26216. (S1 ^operator O2149 +)
  26217. Firing propose*predict-no
  26218. -->
  26219. (O2150 ^name predict-no +)
  26220. (S1 ^operator O2150 +)
  26221. Firing rl*prefer*rvt*predict-no*H0*4
  26222. -->
  26223. (S1 ^operator O2148 = 1.)
  26224. Firing rl*prefer*rvt*predict-yes*H0*3
  26225. -->
  26226. (S1 ^operator O2147 = 0.)
  26227. Firing prefer*rvt*predict-yes*H0
  26228. -->
  26229. Firing prefer*rvt*predict-no*H0
  26230. -->
  26231. Firing elaborate*copy-dir-to-output-link
  26232. -->
  26233. (I3 ^dir U +)
  26234. inner elaboration loop at bottom goal.
  26235. Retracting elaborate*copy-see-to-output-link
  26236. -->
  26237. (I3 ^see 0 +)
  26238. Retracting propose*predict-no
  26239. -->
  26240. (O2148 ^name predict-no +)
  26241. (S1 ^operator O2148 +)
  26242. Retracting propose*predict-yes
  26243. -->
  26244. (O2147 ^name predict-yes +)
  26245. (S1 ^operator O2147 +)
  26246. Retracting elaborate*reward*based*on*reward
  26247. -->
  26248. (R1077 ^value 1 +)
  26249. (R1 ^reward R1077 +)
  26250. Retracting elaborate*copy-dir-to-output-link
  26251. -->
  26252. (I3 ^dir L +)
  26253. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  26254. -->
  26255. (S1 ^operator O2148 = -0.2190661556260421)
  26256. Retracting rl*prefer*rvt*predict-no*H0*2
  26257. -->
  26258. (S1 ^operator O2148 = 0.3140638494766289)
  26259. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  26260. -->
  26261. (S1 ^operator O2147 = 0.6195760036479832)
  26262. Retracting rl*prefer*rvt*predict-yes*H0*1
  26263. -->
  26264. (S1 ^operator O2147 = 0.3804134437534242)
  26265. =>WM: (15069: S1 ^operator O2150 +)
  26266. =>WM: (15068: S1 ^operator O2149 +)
  26267. =>WM: (15067: I3 ^dir U)
  26268. =>WM: (15066: O2150 ^name predict-no)
  26269. =>WM: (15065: O2149 ^name predict-yes)
  26270. =>WM: (15064: R1078 ^value 1)
  26271. =>WM: (15063: R1 ^reward R1078)
  26272. =>WM: (15062: I3 ^see 1)
  26273. <=WM: (15053: S1 ^operator O2147 +)
  26274. <=WM: (15055: S1 ^operator O2147)
  26275. <=WM: (15054: S1 ^operator O2148 +)
  26276. <=WM: (15052: I3 ^dir L)
  26277. <=WM: (15048: R1 ^reward R1077)
  26278. <=WM: (15033: I3 ^see 0)
  26279. <=WM: (15051: O2148 ^name predict-no)
  26280. <=WM: (15050: O2147 ^name predict-yes)
  26281. <=WM: (15049: R1077 ^value 1)
  26282. --- Inner Elaboration Phase, active level 1 (S1) ---
  26283. Firing prefer*rvt*predict-yes*H0
  26284. -->
  26285. Firing rl*prefer*rvt*predict-yes*H0*3
  26286. -->
  26287. (S1 ^operator O2149 = 0.)
  26288. Firing prefer*rvt*predict-no*H0
  26289. -->
  26290. Firing rl*prefer*rvt*predict-no*H0*4
  26291. -->
  26292. (S1 ^operator O2150 = 1.)
  26293. inner elaboration loop at bottom goal.
  26294. Retracting rl*prefer*rvt*predict-no*H0*4
  26295. -->
  26296. (S1 ^operator O2148 = 1.)
  26297. Retracting rl*prefer*rvt*predict-yes*H0*3
  26298. -->
  26299. (S1 ^operator O2147 = 0.)
  26300. --- END Proposal Phase ---
  26301. --- Decision Phase ---
  26302. RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.841808,0.133924)
  26303. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478645 0.140931 0.619576 -> 0.478646 0.140931 0.619577(R,m,v=1,1,0)
  26304. =>WM: (15070: S1 ^operator O2150)
  26305. 1075: O: O2150 (predict-no)
  26306. --- END Decision Phase ---
  26307. --- Application Phase ---
  26308. --- Firing Productions (PE) For State At Depth 1 ---
  26309. --- Inner Elaboration Phase, active level 1 (S1) ---
  26310. Firing apply*operator
  26311. -->
  26312. (I3 ^predict-no N1075 + :O )
  26313. Firing apply*operator*complete
  26314. -->
  26315. (I3 ^predict-yes N1074 - :O )
  26316. inner elaboration loop at bottom goal.
  26317. --- Change Working Memory (PE) ---
  26318. =>WM: (15071: I3 ^predict-no N1075)
  26319. <=WM: (15057: N1074 ^status complete)
  26320. <=WM: (15056: I3 ^predict-yes N1074)
  26321. --- Firing Productions (IE) For State At Depth 1 ---
  26322. --- Inner Elaboration Phase, active level 1 (S1) ---
  26323. Firing monitor*world
  26324. -->
  26325. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26326. --- Change Working Memory (IE) ---
  26327. --- END Application Phase ---
  26328. --- Output Phase ---
  26329. ENV: Agent did: predict-no for direction U in state State-A
  26330. In State-A moving U
  26331. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26332. predict error 0
  26333. dir: dir isL
  26334. --- END Output Phase ---
  26335. |\---- Input Phase ---
  26336. =>WM: (15075: I2 ^dir L)
  26337. =>WM: (15074: I2 ^reward 1)
  26338. =>WM: (15073: I2 ^see 0)
  26339. =>WM: (15072: N1075 ^status complete)
  26340. <=WM: (15060: I2 ^dir U)
  26341. <=WM: (15059: I2 ^reward 1)
  26342. <=WM: (15058: I2 ^see 1)
  26343. =>WM: (15076: I2 ^level-1 L1-root)
  26344. <=WM: (15061: I2 ^level-1 L1-root)
  26345. --- END Input Phase ---
  26346. --- Proposal Phase ---
  26347. --- Inner Elaboration Phase, active level 1 (S1) ---
  26348. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  26349. -->
  26350. (S1 ^operator O2149 = -0.3470159027404986)
  26351. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  26352. -->
  26353. (S1 ^operator O2150 = 0.6860475006196615)
  26354. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26355. -->
  26356. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26357. -->
  26358. Firing elaborate*copy-see-to-output-link
  26359. -->
  26360. (I3 ^see 0 +)
  26361. Firing elaborate*reward*based*on*reward
  26362. -->
  26363. (R1079 ^value 1 +)
  26364. (R1 ^reward R1079 +)
  26365. Firing propose*predict-yes
  26366. -->
  26367. (O2151 ^name predict-yes +)
  26368. (S1 ^operator O2151 +)
  26369. Firing propose*predict-no
  26370. -->
  26371. (O2152 ^name predict-no +)
  26372. (S1 ^operator O2152 +)
  26373. Firing rl*prefer*rvt*predict-no*H0*2
  26374. -->
  26375. (S1 ^operator O2150 = 0.3140638494766289)
  26376. Firing rl*prefer*rvt*predict-yes*H0*1
  26377. -->
  26378. (S1 ^operator O2149 = 0.3804142980557849)
  26379. Firing prefer*rvt*predict-yes*H0
  26380. -->
  26381. Firing prefer*rvt*predict-no*H0
  26382. -->
  26383. Firing elaborate*copy-dir-to-output-link
  26384. -->
  26385. (I3 ^dir L +)
  26386. inner elaboration loop at bottom goal.
  26387. Retracting elaborate*copy-see-to-output-link
  26388. -->
  26389. (I3 ^see 1 +)
  26390. Retracting propose*predict-no
  26391. -->
  26392. (O2150 ^name predict-no +)
  26393. (S1 ^operator O2150 +)
  26394. Retracting propose*predict-yes
  26395. -->
  26396. (O2149 ^name predict-yes +)
  26397. (S1 ^operator O2149 +)
  26398. Retracting elaborate*reward*based*on*reward
  26399. -->
  26400. (R1078 ^value 1 +)
  26401. (R1 ^reward R1078 +)
  26402. Retracting elaborate*copy-dir-to-output-link
  26403. -->
  26404. (I3 ^dir U +)
  26405. Retracting rl*prefer*rvt*predict-no*H0*4
  26406. -->
  26407. (S1 ^operator O2150 = 1.)
  26408. Retracting rl*prefer*rvt*predict-yes*H0*3
  26409. -->
  26410. (S1 ^operator O2149 = 0.)
  26411. =>WM: (15084: S1 ^operator O2152 +)
  26412. =>WM: (15083: S1 ^operator O2151 +)
  26413. =>WM: (15082: I3 ^dir L)
  26414. =>WM: (15081: O2152 ^name predict-no)
  26415. =>WM: (15080: O2151 ^name predict-yes)
  26416. =>WM: (15079: R1079 ^value 1)
  26417. =>WM: (15078: R1 ^reward R1079)
  26418. =>WM: (15077: I3 ^see 0)
  26419. <=WM: (15068: S1 ^operator O2149 +)
  26420. <=WM: (15069: S1 ^operator O2150 +)
  26421. <=WM: (15070: S1 ^operator O2150)
  26422. <=WM: (15067: I3 ^dir U)
  26423. <=WM: (15063: R1 ^reward R1078)
  26424. <=WM: (15062: I3 ^see 1)
  26425. <=WM: (15066: O2150 ^name predict-no)
  26426. <=WM: (15065: O2149 ^name predict-yes)
  26427. <=WM: (15064: R1078 ^value 1)
  26428. --- Inner Elaboration Phase, active level 1 (S1) ---
  26429. Firing prefer*rvt*predict-yes*H0
  26430. -->
  26431. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  26432. -->
  26433. (S1 ^operator O2151 = -0.3470159027404986)
  26434. Firing rl*prefer*rvt*predict-yes*H0*1
  26435. -->
  26436. (S1 ^operator O2151 = 0.3804142980557849)
  26437. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26438. -->
  26439. Firing prefer*rvt*predict-no*H0
  26440. -->
  26441. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  26442. -->
  26443. (S1 ^operator O2152 = 0.6860475006196615)
  26444. Firing rl*prefer*rvt*predict-no*H0*2
  26445. -->
  26446. (S1 ^operator O2152 = 0.3140638494766289)
  26447. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26448. -->
  26449. inner elaboration loop at bottom goal.
  26450. Retracting rl*prefer*rvt*predict-no*H0*2
  26451. -->
  26452. (S1 ^operator O2150 = 0.3140638494766289)
  26453. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  26454. -->
  26455. (S1 ^operator O2150 = 0.6860475006196615)
  26456. Retracting rl*prefer*rvt*predict-yes*H0*1
  26457. -->
  26458. (S1 ^operator O2149 = 0.3804142980557849)
  26459. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  26460. -->
  26461. (S1 ^operator O2149 = -0.3470159027404986)
  26462. --- END Proposal Phase ---
  26463. --- Decision Phase ---
  26464. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  26465. =>WM: (15085: S1 ^operator O2152)
  26466. 1076: O: O2152 (predict-no)
  26467. --- END Decision Phase ---
  26468. --- Application Phase ---
  26469. --- Firing Productions (PE) For State At Depth 1 ---
  26470. --- Inner Elaboration Phase, active level 1 (S1) ---
  26471. Firing apply*operator
  26472. -->
  26473. (I3 ^predict-no N1076 + :O )
  26474. Firing apply*operator*complete
  26475. -->
  26476. (I3 ^predict-no N1075 - :O )
  26477. inner elaboration loop at bottom goal.
  26478. --- Change Working Memory (PE) ---
  26479. =>WM: (15086: I3 ^predict-no N1076)
  26480. <=WM: (15072: N1075 ^status complete)
  26481. <=WM: (15071: I3 ^predict-no N1075)
  26482. --- Firing Productions (IE) For State At Depth 1 ---
  26483. --- Inner Elaboration Phase, active level 1 (S1) ---
  26484. Firing monitor*world
  26485. -->
  26486. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26487. --- Change Working Memory (IE) ---
  26488. --- END Application Phase ---
  26489. --- Output Phase ---
  26490. ENV: Agent did: predict-no for direction L in state State-A
  26491. In State-A moving L
  26492. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26493. predict error 0
  26494. dir: dir isR
  26495. --- END Output Phase ---
  26496. /|--- Input Phase ---
  26497. =>WM: (15090: I2 ^dir R)
  26498. =>WM: (15089: I2 ^reward 1)
  26499. =>WM: (15088: I2 ^see 0)
  26500. =>WM: (15087: N1076 ^status complete)
  26501. <=WM: (15075: I2 ^dir L)
  26502. <=WM: (15074: I2 ^reward 1)
  26503. <=WM: (15073: I2 ^see 0)
  26504. =>WM: (15091: I2 ^level-1 L0-root)
  26505. <=WM: (15076: I2 ^level-1 L1-root)
  26506. --- END Input Phase ---
  26507. --- Proposal Phase ---
  26508. --- Inner Elaboration Phase, active level 1 (S1) ---
  26509. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26510. -->
  26511. (S1 ^operator O2151 = 0.7058089158850139)
  26512. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  26513. -->
  26514. (S1 ^operator O2152 = -0.2023211881870005)
  26515. Firing prefer*rvt*predict-no*H0*6*v1*H1
  26516. -->
  26517. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26518. -->
  26519. Firing elaborate*copy-see-to-output-link
  26520. -->
  26521. (I3 ^see 0 +)
  26522. Firing elaborate*reward*based*on*reward
  26523. -->
  26524. (R1080 ^value 1 +)
  26525. (R1 ^reward R1080 +)
  26526. Firing propose*predict-yes
  26527. -->
  26528. (O2153 ^name predict-yes +)
  26529. (S1 ^operator O2153 +)
  26530. Firing propose*predict-no
  26531. -->
  26532. (O2154 ^name predict-no +)
  26533. (S1 ^operator O2154 +)
  26534. Firing rl*prefer*rvt*predict-no*H0*6
  26535. -->
  26536. (S1 ^operator O2152 = 0.229858460707707)
  26537. Firing rl*prefer*rvt*predict-yes*H0*5
  26538. -->
  26539. (S1 ^operator O2151 = 0.2940663911910953)
  26540. Firing prefer*rvt*predict-yes*H0
  26541. -->
  26542. Firing prefer*rvt*predict-no*H0
  26543. -->
  26544. Firing elaborate*copy-dir-to-output-link
  26545. -->
  26546. (I3 ^dir R +)
  26547. inner elaboration loop at bottom goal.
  26548. Retracting elaborate*copy-see-to-output-link
  26549. -->
  26550. (I3 ^see 0 +)
  26551. Retracting propose*predict-no
  26552. -->
  26553. (O2152 ^name predict-no +)
  26554. (S1 ^operator O2152 +)
  26555. Retracting propose*predict-yes
  26556. -->
  26557. (O2151 ^name predict-yes +)
  26558. (S1 ^operator O2151 +)
  26559. Retracting elaborate*reward*based*on*reward
  26560. -->
  26561. (R1079 ^value 1 +)
  26562. (R1 ^reward R1079 +)
  26563. Retracting elaborate*copy-dir-to-output-link
  26564. -->
  26565. (I3 ^dir L +)
  26566. Retracting rl*prefer*rvt*predict-no*H0*2
  26567. -->
  26568. (S1 ^operator O2152 = 0.3140638494766289)
  26569. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  26570. -->
  26571. (S1 ^operator O2152 = 0.6860475006196615)
  26572. Retracting rl*prefer*rvt*predict-yes*H0*1
  26573. -->
  26574. (S1 ^operator O2151 = 0.3804142980557849)
  26575. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  26576. -->
  26577. (S1 ^operator O2151 = -0.3470159027404986)
  26578. =>WM: (15098: S1 ^operator O2154 +)
  26579. =>WM: (15097: S1 ^operator O2153 +)
  26580. =>WM: (15096: I3 ^dir R)
  26581. =>WM: (15095: O2154 ^name predict-no)
  26582. =>WM: (15094: O2153 ^name predict-yes)
  26583. =>WM: (15093: R1080 ^value 1)
  26584. =>WM: (15092: R1 ^reward R1080)
  26585. <=WM: (15083: S1 ^operator O2151 +)
  26586. <=WM: (15084: S1 ^operator O2152 +)
  26587. <=WM: (15085: S1 ^operator O2152)
  26588. <=WM: (15082: I3 ^dir L)
  26589. <=WM: (15078: R1 ^reward R1079)
  26590. <=WM: (15081: O2152 ^name predict-no)
  26591. <=WM: (15080: O2151 ^name predict-yes)
  26592. <=WM: (15079: R1079 ^value 1)
  26593. --- Inner Elaboration Phase, active level 1 (S1) ---
  26594. Firing prefer*rvt*predict-yes*H0
  26595. -->
  26596. Firing rl*prefer*rvt*predict-yes*H0*5
  26597. -->
  26598. (S1 ^operator O2153 = 0.2940663911910953)
  26599. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26600. -->
  26601. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26602. -->
  26603. (S1 ^operator O2153 = 0.7058089158850139)
  26604. Firing prefer*rvt*predict-no*H0
  26605. -->
  26606. Firing rl*prefer*rvt*predict-no*H0*6
  26607. -->
  26608. (S1 ^operator O2154 = 0.229858460707707)
  26609. Firing prefer*rvt*predict-no*H0*6*v1*H1
  26610. -->
  26611. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  26612. -->
  26613. (S1 ^operator O2154 = -0.2023211881870005)
  26614. inner elaboration loop at bottom goal.
  26615. Retracting rl*prefer*rvt*predict-no*H0*6
  26616. -->
  26617. (S1 ^operator O2152 = 0.229858460707707)
  26618. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  26619. -->
  26620. (S1 ^operator O2152 = -0.2023211881870005)
  26621. Retracting rl*prefer*rvt*predict-yes*H0*5
  26622. -->
  26623. (S1 ^operator O2151 = 0.2940663911910953)
  26624. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26625. -->
  26626. (S1 ^operator O2151 = 0.7058089158850139)
  26627. --- END Proposal Phase ---
  26628. --- Decision Phase ---
  26629. RL update rl*prefer*rvt*predict-no*H0*2 0.485065 -0.171001 0.314064 -> 0.485058 -0.171003 0.314055(R,m,v=1,0.88,0.106207)
  26630. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515023 0.171024 0.686048 -> 0.515015 0.171022 0.686037(R,m,v=1,1,0)
  26631. =>WM: (15099: S1 ^operator O2153)
  26632. 1077: O: O2153 (predict-yes)
  26633. --- END Decision Phase ---
  26634. --- Application Phase ---
  26635. --- Firing Productions (PE) For State At Depth 1 ---
  26636. --- Inner Elaboration Phase, active level 1 (S1) ---
  26637. Firing apply*operator
  26638. -->
  26639. (I3 ^predict-yes N1077 + :O )
  26640. Firing apply*operator*complete
  26641. -->
  26642. (I3 ^predict-no N1076 - :O )
  26643. inner elaboration loop at bottom goal.
  26644. --- Change Working Memory (PE) ---
  26645. =>WM: (15100: I3 ^predict-yes N1077)
  26646. <=WM: (15087: N1076 ^status complete)
  26647. <=WM: (15086: I3 ^predict-no N1076)
  26648. --- Firing Productions (IE) For State At Depth 1 ---
  26649. --- Inner Elaboration Phase, active level 1 (S1) ---
  26650. Firing monitor*world
  26651. -->
  26652. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26653. --- Change Working Memory (IE) ---
  26654. --- END Application Phase ---
  26655. --- Output Phase ---
  26656. ENV: Agent did: predict-yes for direction R in state State-A
  26657. In State-A moving R
  26658. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  26659. predict error 0
  26660. dir: dir isL
  26661. --- END Output Phase ---
  26662. \-/--- Input Phase ---
  26663. =>WM: (15104: I2 ^dir L)
  26664. =>WM: (15103: I2 ^reward 1)
  26665. =>WM: (15102: I2 ^see 1)
  26666. =>WM: (15101: N1077 ^status complete)
  26667. <=WM: (15090: I2 ^dir R)
  26668. <=WM: (15089: I2 ^reward 1)
  26669. <=WM: (15088: I2 ^see 0)
  26670. =>WM: (15105: I2 ^level-1 R1-root)
  26671. <=WM: (15091: I2 ^level-1 L0-root)
  26672. --- END Input Phase ---
  26673. --- Proposal Phase ---
  26674. --- Inner Elaboration Phase, active level 1 (S1) ---
  26675. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  26676. -->
  26677. (S1 ^operator O2153 = 0.6195991016645057)
  26678. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  26679. -->
  26680. (S1 ^operator O2154 = -0.1479504104026684)
  26681. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26682. -->
  26683. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26684. -->
  26685. Firing elaborate*copy-see-to-output-link
  26686. -->
  26687. (I3 ^see 1 +)
  26688. Firing elaborate*reward*based*on*reward
  26689. -->
  26690. (R1081 ^value 1 +)
  26691. (R1 ^reward R1081 +)
  26692. Firing propose*predict-yes
  26693. -->
  26694. (O2155 ^name predict-yes +)
  26695. (S1 ^operator O2155 +)
  26696. Firing propose*predict-no
  26697. -->
  26698. (O2156 ^name predict-no +)
  26699. (S1 ^operator O2156 +)
  26700. Firing rl*prefer*rvt*predict-no*H0*2
  26701. -->
  26702. (S1 ^operator O2154 = 0.3140548183361512)
  26703. Firing rl*prefer*rvt*predict-yes*H0*1
  26704. -->
  26705. (S1 ^operator O2153 = 0.3804142980557849)
  26706. Firing prefer*rvt*predict-yes*H0
  26707. -->
  26708. Firing prefer*rvt*predict-no*H0
  26709. -->
  26710. Firing elaborate*copy-dir-to-output-link
  26711. -->
  26712. (I3 ^dir L +)
  26713. inner elaboration loop at bottom goal.
  26714. Retracting elaborate*copy-see-to-output-link
  26715. -->
  26716. (I3 ^see 0 +)
  26717. Retracting propose*predict-no
  26718. -->
  26719. (O2154 ^name predict-no +)
  26720. (S1 ^operator O2154 +)
  26721. Retracting propose*predict-yes
  26722. -->
  26723. (O2153 ^name predict-yes +)
  26724. (S1 ^operator O2153 +)
  26725. Retracting elaborate*reward*based*on*reward
  26726. -->
  26727. (R1080 ^value 1 +)
  26728. (R1 ^reward R1080 +)
  26729. Retracting elaborate*copy-dir-to-output-link
  26730. -->
  26731. (I3 ^dir R +)
  26732. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  26733. -->
  26734. (S1 ^operator O2154 = -0.2023211881870005)
  26735. Retracting rl*prefer*rvt*predict-no*H0*6
  26736. -->
  26737. (S1 ^operator O2154 = 0.229858460707707)
  26738. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26739. -->
  26740. (S1 ^operator O2153 = 0.7058089158850139)
  26741. Retracting rl*prefer*rvt*predict-yes*H0*5
  26742. -->
  26743. (S1 ^operator O2153 = 0.2940663911910953)
  26744. =>WM: (15113: S1 ^operator O2156 +)
  26745. =>WM: (15112: S1 ^operator O2155 +)
  26746. =>WM: (15111: I3 ^dir L)
  26747. =>WM: (15110: O2156 ^name predict-no)
  26748. =>WM: (15109: O2155 ^name predict-yes)
  26749. =>WM: (15108: R1081 ^value 1)
  26750. =>WM: (15107: R1 ^reward R1081)
  26751. =>WM: (15106: I3 ^see 1)
  26752. <=WM: (15097: S1 ^operator O2153 +)
  26753. <=WM: (15099: S1 ^operator O2153)
  26754. <=WM: (15098: S1 ^operator O2154 +)
  26755. <=WM: (15096: I3 ^dir R)
  26756. <=WM: (15092: R1 ^reward R1080)
  26757. <=WM: (15077: I3 ^see 0)
  26758. <=WM: (15095: O2154 ^name predict-no)
  26759. <=WM: (15094: O2153 ^name predict-yes)
  26760. <=WM: (15093: R1080 ^value 1)
  26761. --- Inner Elaboration Phase, active level 1 (S1) ---
  26762. Firing prefer*rvt*predict-yes*H0
  26763. -->
  26764. Firing rl*prefer*rvt*predict-yes*H0*1
  26765. -->
  26766. (S1 ^operator O2155 = 0.3804142980557849)
  26767. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26768. -->
  26769. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  26770. -->
  26771. (S1 ^operator O2155 = 0.6195991016645057)
  26772. Firing prefer*rvt*predict-no*H0
  26773. -->
  26774. Firing rl*prefer*rvt*predict-no*H0*2
  26775. -->
  26776. (S1 ^operator O2156 = 0.3140548183361512)
  26777. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26778. -->
  26779. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  26780. -->
  26781. (S1 ^operator O2156 = -0.1479504104026684)
  26782. inner elaboration loop at bottom goal.
  26783. Retracting rl*prefer*rvt*predict-no*H0*2
  26784. -->
  26785. (S1 ^operator O2154 = 0.3140548183361512)
  26786. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  26787. -->
  26788. (S1 ^operator O2154 = -0.1479504104026684)
  26789. Retracting rl*prefer*rvt*predict-yes*H0*1
  26790. -->
  26791. (S1 ^operator O2153 = 0.3804142980557849)
  26792. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  26793. -->
  26794. (S1 ^operator O2153 = 0.6195991016645057)
  26795. --- END Proposal Phase ---
  26796. --- Decision Phase ---
  26797. RL update rl*prefer*rvt*predict-yes*H0*5 0.501134 -0.207068 0.294066 -> 0.501143 -0.207067 0.294077(R,m,v=1,0.857143,0.123182)
  26798. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498753 0.207056 0.705809 -> 0.498764 0.207057 0.705821(R,m,v=1,1,0)
  26799. =>WM: (15114: S1 ^operator O2155)
  26800. 1078: O: O2155 (predict-yes)
  26801. --- END Decision Phase ---
  26802. --- Application Phase ---
  26803. --- Firing Productions (PE) For State At Depth 1 ---
  26804. --- Inner Elaboration Phase, active level 1 (S1) ---
  26805. Firing apply*operator
  26806. -->
  26807. (I3 ^predict-yes N1078 + :O )
  26808. Firing apply*operator*complete
  26809. -->
  26810. (I3 ^predict-yes N1077 - :O )
  26811. inner elaboration loop at bottom goal.
  26812. --- Change Working Memory (PE) ---
  26813. =>WM: (15115: I3 ^predict-yes N1078)
  26814. <=WM: (15101: N1077 ^status complete)
  26815. <=WM: (15100: I3 ^predict-yes N1077)
  26816. --- Firing Productions (IE) For State At Depth 1 ---
  26817. --- Inner Elaboration Phase, active level 1 (S1) ---
  26818. Firing monitor*world
  26819. -->
  26820. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26821. --- Change Working Memory (IE) ---
  26822. --- END Application Phase ---
  26823. --- Output Phase ---
  26824. ENV: Agent did: predict-yes for direction L in state State-B
  26825. In State-B moving L
  26826. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26827. predict error 0
  26828. dir: dir isU
  26829. --- END Output Phase ---
  26830. |\---- Input Phase ---
  26831. =>WM: (15119: I2 ^dir U)
  26832. =>WM: (15118: I2 ^reward 1)
  26833. =>WM: (15117: I2 ^see 1)
  26834. =>WM: (15116: N1078 ^status complete)
  26835. <=WM: (15104: I2 ^dir L)
  26836. <=WM: (15103: I2 ^reward 1)
  26837. <=WM: (15102: I2 ^see 1)
  26838. =>WM: (15120: I2 ^level-1 L1-root)
  26839. <=WM: (15105: I2 ^level-1 R1-root)
  26840. --- END Input Phase ---
  26841. --- Proposal Phase ---
  26842. --- Inner Elaboration Phase, active level 1 (S1) ---
  26843. Firing elaborate*copy-see-to-output-link
  26844. -->
  26845. (I3 ^see 1 +)
  26846. Firing elaborate*reward*based*on*reward
  26847. -->
  26848. (R1082 ^value 1 +)
  26849. (R1 ^reward R1082 +)
  26850. Firing propose*predict-yes
  26851. -->
  26852. (O2157 ^name predict-yes +)
  26853. (S1 ^operator O2157 +)
  26854. Firing propose*predict-no
  26855. -->
  26856. (O2158 ^name predict-no +)
  26857. (S1 ^operator O2158 +)
  26858. Firing rl*prefer*rvt*predict-no*H0*4
  26859. -->
  26860. (S1 ^operator O2156 = 1.)
  26861. Firing rl*prefer*rvt*predict-yes*H0*3
  26862. -->
  26863. (S1 ^operator O2155 = 0.)
  26864. Firing prefer*rvt*predict-yes*H0
  26865. -->
  26866. Firing prefer*rvt*predict-no*H0
  26867. -->
  26868. Firing elaborate*copy-dir-to-output-link
  26869. -->
  26870. (I3 ^dir U +)
  26871. inner elaboration loop at bottom goal.
  26872. Retracting elaborate*copy-see-to-output-link
  26873. -->
  26874. (I3 ^see 1 +)
  26875. Retracting propose*predict-no
  26876. -->
  26877. (O2156 ^name predict-no +)
  26878. (S1 ^operator O2156 +)
  26879. Retracting propose*predict-yes
  26880. -->
  26881. (O2155 ^name predict-yes +)
  26882. (S1 ^operator O2155 +)
  26883. Retracting elaborate*reward*based*on*reward
  26884. -->
  26885. (R1081 ^value 1 +)
  26886. (R1 ^reward R1081 +)
  26887. Retracting elaborate*copy-dir-to-output-link
  26888. -->
  26889. (I3 ^dir L +)
  26890. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  26891. -->
  26892. (S1 ^operator O2156 = -0.1479504104026684)
  26893. Retracting rl*prefer*rvt*predict-no*H0*2
  26894. -->
  26895. (S1 ^operator O2156 = 0.3140548183361512)
  26896. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  26897. -->
  26898. (S1 ^operator O2155 = 0.6195991016645057)
  26899. Retracting rl*prefer*rvt*predict-yes*H0*1
  26900. -->
  26901. (S1 ^operator O2155 = 0.3804142980557849)
  26902. =>WM: (15127: S1 ^operator O2158 +)
  26903. =>WM: (15126: S1 ^operator O2157 +)
  26904. =>WM: (15125: I3 ^dir U)
  26905. =>WM: (15124: O2158 ^name predict-no)
  26906. =>WM: (15123: O2157 ^name predict-yes)
  26907. =>WM: (15122: R1082 ^value 1)
  26908. =>WM: (15121: R1 ^reward R1082)
  26909. <=WM: (15112: S1 ^operator O2155 +)
  26910. <=WM: (15114: S1 ^operator O2155)
  26911. <=WM: (15113: S1 ^operator O2156 +)
  26912. <=WM: (15111: I3 ^dir L)
  26913. <=WM: (15107: R1 ^reward R1081)
  26914. <=WM: (15110: O2156 ^name predict-no)
  26915. <=WM: (15109: O2155 ^name predict-yes)
  26916. <=WM: (15108: R1081 ^value 1)
  26917. --- Inner Elaboration Phase, active level 1 (S1) ---
  26918. Firing prefer*rvt*predict-yes*H0
  26919. -->
  26920. Firing rl*prefer*rvt*predict-yes*H0*3
  26921. -->
  26922. (S1 ^operator O2157 = 0.)
  26923. Firing prefer*rvt*predict-no*H0
  26924. -->
  26925. Firing rl*prefer*rvt*predict-no*H0*4
  26926. -->
  26927. (S1 ^operator O2158 = 1.)
  26928. inner elaboration loop at bottom goal.
  26929. Retracting rl*prefer*rvt*predict-no*H0*4
  26930. -->
  26931. (S1 ^operator O2156 = 1.)
  26932. Retracting rl*prefer*rvt*predict-yes*H0*3
  26933. -->
  26934. (S1 ^operator O2155 = 0.)
  26935. --- END Proposal Phase ---
  26936. --- Decision Phase ---
  26937. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.842697,0.133308)
  26938. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.47867 0.140929 0.619599 -> 0.478669 0.140929 0.619598(R,m,v=1,1,0)
  26939. =>WM: (15128: S1 ^operator O2158)
  26940. 1079: O: O2158 (predict-no)
  26941. --- END Decision Phase ---
  26942. --- Application Phase ---
  26943. --- Firing Productions (PE) For State At Depth 1 ---
  26944. --- Inner Elaboration Phase, active level 1 (S1) ---
  26945. Firing apply*operator
  26946. -->
  26947. (I3 ^predict-no N1079 + :O )
  26948. Firing apply*operator*complete
  26949. -->
  26950. (I3 ^predict-yes N1078 - :O )
  26951. inner elaboration loop at bottom goal.
  26952. --- Change Working Memory (PE) ---
  26953. =>WM: (15129: I3 ^predict-no N1079)
  26954. <=WM: (15116: N1078 ^status complete)
  26955. <=WM: (15115: I3 ^predict-yes N1078)
  26956. --- Firing Productions (IE) For State At Depth 1 ---
  26957. --- Inner Elaboration Phase, active level 1 (S1) ---
  26958. Firing monitor*world
  26959. -->
  26960. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26961. --- Change Working Memory (IE) ---
  26962. --- END Application Phase ---
  26963. --- Output Phase ---
  26964. ENV: Agent did: predict-no for direction U in state State-A
  26965. In State-A moving U
  26966. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26967. predict error 0
  26968. dir: dir isR
  26969. --- END Output Phase ---
  26970. /|\--- Input Phase ---
  26971. =>WM: (15133: I2 ^dir R)
  26972. =>WM: (15132: I2 ^reward 1)
  26973. =>WM: (15131: I2 ^see 0)
  26974. =>WM: (15130: N1079 ^status complete)
  26975. <=WM: (15119: I2 ^dir U)
  26976. <=WM: (15118: I2 ^reward 1)
  26977. <=WM: (15117: I2 ^see 1)
  26978. =>WM: (15134: I2 ^level-1 L1-root)
  26979. <=WM: (15120: I2 ^level-1 L1-root)
  26980. --- END Input Phase ---
  26981. --- Proposal Phase ---
  26982. --- Inner Elaboration Phase, active level 1 (S1) ---
  26983. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  26984. -->
  26985. (S1 ^operator O2157 = 0.70622448437219)
  26986. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  26987. -->
  26988. (S1 ^operator O2158 = -0.1937987592593187)
  26989. Firing prefer*rvt*predict-no*H0*6*v1*H1
  26990. -->
  26991. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26992. -->
  26993. Firing elaborate*copy-see-to-output-link
  26994. -->
  26995. (I3 ^see 0 +)
  26996. Firing elaborate*reward*based*on*reward
  26997. -->
  26998. (R1083 ^value 1 +)
  26999. (R1 ^reward R1083 +)
  27000. Firing propose*predict-yes
  27001. -->
  27002. (O2159 ^name predict-yes +)
  27003. (S1 ^operator O2159 +)
  27004. Firing propose*predict-no
  27005. -->
  27006. (O2160 ^name predict-no +)
  27007. (S1 ^operator O2160 +)
  27008. Firing rl*prefer*rvt*predict-no*H0*6
  27009. -->
  27010. (S1 ^operator O2158 = 0.229858460707707)
  27011. Firing rl*prefer*rvt*predict-yes*H0*5
  27012. -->
  27013. (S1 ^operator O2157 = 0.2940765719273235)
  27014. Firing prefer*rvt*predict-yes*H0
  27015. -->
  27016. Firing prefer*rvt*predict-no*H0
  27017. -->
  27018. Firing elaborate*copy-dir-to-output-link
  27019. -->
  27020. (I3 ^dir R +)
  27021. inner elaboration loop at bottom goal.
  27022. Retracting elaborate*copy-see-to-output-link
  27023. -->
  27024. (I3 ^see 1 +)
  27025. Retracting propose*predict-no
  27026. -->
  27027. (O2158 ^name predict-no +)
  27028. (S1 ^operator O2158 +)
  27029. Retracting propose*predict-yes
  27030. -->
  27031. (O2157 ^name predict-yes +)
  27032. (S1 ^operator O2157 +)
  27033. Retracting elaborate*reward*based*on*reward
  27034. -->
  27035. (R1082 ^value 1 +)
  27036. (R1 ^reward R1082 +)
  27037. Retracting elaborate*copy-dir-to-output-link
  27038. -->
  27039. (I3 ^dir U +)
  27040. Retracting rl*prefer*rvt*predict-no*H0*4
  27041. -->
  27042. (S1 ^operator O2158 = 1.)
  27043. Retracting rl*prefer*rvt*predict-yes*H0*3
  27044. -->
  27045. (S1 ^operator O2157 = 0.)
  27046. =>WM: (15142: S1 ^operator O2160 +)
  27047. =>WM: (15141: S1 ^operator O2159 +)
  27048. =>WM: (15140: I3 ^dir R)
  27049. =>WM: (15139: O2160 ^name predict-no)
  27050. =>WM: (15138: O2159 ^name predict-yes)
  27051. =>WM: (15137: R1083 ^value 1)
  27052. =>WM: (15136: R1 ^reward R1083)
  27053. =>WM: (15135: I3 ^see 0)
  27054. <=WM: (15126: S1 ^operator O2157 +)
  27055. <=WM: (15127: S1 ^operator O2158 +)
  27056. <=WM: (15128: S1 ^operator O2158)
  27057. <=WM: (15125: I3 ^dir U)
  27058. <=WM: (15121: R1 ^reward R1082)
  27059. <=WM: (15106: I3 ^see 1)
  27060. <=WM: (15124: O2158 ^name predict-no)
  27061. <=WM: (15123: O2157 ^name predict-yes)
  27062. <=WM: (15122: R1082 ^value 1)
  27063. --- Inner Elaboration Phase, active level 1 (S1) ---
  27064. Firing prefer*rvt*predict-yes*H0
  27065. -->
  27066. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  27067. -->
  27068. (S1 ^operator O2159 = 0.70622448437219)
  27069. Firing rl*prefer*rvt*predict-yes*H0*5
  27070. -->
  27071. (S1 ^operator O2159 = 0.2940765719273235)
  27072. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27073. -->
  27074. Firing prefer*rvt*predict-no*H0
  27075. -->
  27076. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  27077. -->
  27078. (S1 ^operator O2160 = -0.1937987592593187)
  27079. Firing rl*prefer*rvt*predict-no*H0*6
  27080. -->
  27081. (S1 ^operator O2160 = 0.229858460707707)
  27082. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27083. -->
  27084. inner elaboration loop at bottom goal.
  27085. Retracting rl*prefer*rvt*predict-no*H0*6
  27086. -->
  27087. (S1 ^operator O2158 = 0.229858460707707)
  27088. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  27089. -->
  27090. (S1 ^operator O2158 = -0.1937987592593187)
  27091. Retracting rl*prefer*rvt*predict-yes*H0*5
  27092. -->
  27093. (S1 ^operator O2157 = 0.2940765719273235)
  27094. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  27095. -->
  27096. (S1 ^operator O2157 = 0.70622448437219)
  27097. --- END Proposal Phase ---
  27098. --- Decision Phase ---
  27099. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27100. =>WM: (15143: S1 ^operator O2159)
  27101. 1080: O: O2159 (predict-yes)
  27102. --- END Decision Phase ---
  27103. --- Application Phase ---
  27104. --- Firing Productions (PE) For State At Depth 1 ---
  27105. --- Inner Elaboration Phase, active level 1 (S1) ---
  27106. Firing apply*operator
  27107. -->
  27108. (I3 ^predict-yes N1080 + :O )
  27109. Firing apply*operator*complete
  27110. -->
  27111. (I3 ^predict-no N1079 - :O )
  27112. inner elaboration loop at bottom goal.
  27113. --- Change Working Memory (PE) ---
  27114. =>WM: (15144: I3 ^predict-yes N1080)
  27115. <=WM: (15130: N1079 ^status complete)
  27116. <=WM: (15129: I3 ^predict-no N1079)
  27117. --- Firing Productions (IE) For State At Depth 1 ---
  27118. --- Inner Elaboration Phase, active level 1 (S1) ---
  27119. Firing monitor*world
  27120. -->
  27121. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  27122. --- Change Working Memory (IE) ---
  27123. --- END Application Phase ---
  27124. --- Output Phase ---
  27125. ENV: Agent did: predict-yes for direction R in state State-A
  27126. In State-A moving R
  27127. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  27128. predict error 0
  27129. dir: dir isU
  27130. --- END Output Phase ---
  27131. -/|--- Input Phase ---
  27132. =>WM: (15148: I2 ^dir U)
  27133. =>WM: (15147: I2 ^reward 1)
  27134. =>WM: (15146: I2 ^see 1)
  27135. =>WM: (15145: N1080 ^status complete)
  27136. <=WM: (15133: I2 ^dir R)
  27137. <=WM: (15132: I2 ^reward 1)
  27138. <=WM: (15131: I2 ^see 0)
  27139. =>WM: (15149: I2 ^level-1 R1-root)
  27140. <=WM: (15134: I2 ^level-1 L1-root)
  27141. --- END Input Phase ---
  27142. --- Proposal Phase ---
  27143. --- Inner Elaboration Phase, active level 1 (S1) ---
  27144. Firing elaborate*copy-see-to-output-link
  27145. -->
  27146. (I3 ^see 1 +)
  27147. Firing elaborate*reward*based*on*reward
  27148. -->
  27149. (R1084 ^value 1 +)
  27150. (R1 ^reward R1084 +)
  27151. Firing propose*predict-yes
  27152. -->
  27153. (O2161 ^name predict-yes +)
  27154. (S1 ^operator O2161 +)
  27155. Firing propose*predict-no
  27156. -->
  27157. (O2162 ^name predict-no +)
  27158. (S1 ^operator O2162 +)
  27159. Firing rl*prefer*rvt*predict-no*H0*4
  27160. -->
  27161. (S1 ^operator O2160 = 1.)
  27162. Firing rl*prefer*rvt*predict-yes*H0*3
  27163. -->
  27164. (S1 ^operator O2159 = 0.)
  27165. Firing prefer*rvt*predict-yes*H0
  27166. -->
  27167. Firing prefer*rvt*predict-no*H0
  27168. -->
  27169. Firing elaborate*copy-dir-to-output-link
  27170. -->
  27171. (I3 ^dir U +)
  27172. inner elaboration loop at bottom goal.
  27173. Retracting elaborate*copy-see-to-output-link
  27174. -->
  27175. (I3 ^see 0 +)
  27176. Retracting propose*predict-no
  27177. -->
  27178. (O2160 ^name predict-no +)
  27179. (S1 ^operator O2160 +)
  27180. Retracting propose*predict-yes
  27181. -->
  27182. (O2159 ^name predict-yes +)
  27183. (S1 ^operator O2159 +)
  27184. Retracting elaborate*reward*based*on*reward
  27185. -->
  27186. (R1083 ^value 1 +)
  27187. (R1 ^reward R1083 +)
  27188. Retracting elaborate*copy-dir-to-output-link
  27189. -->
  27190. (I3 ^dir R +)
  27191. Retracting rl*prefer*rvt*predict-no*H0*6
  27192. -->
  27193. (S1 ^operator O2160 = 0.229858460707707)
  27194. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  27195. -->
  27196. (S1 ^operator O2160 = -0.1937987592593187)
  27197. Retracting rl*prefer*rvt*predict-yes*H0*5
  27198. -->
  27199. (S1 ^operator O2159 = 0.2940765719273235)
  27200. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  27201. -->
  27202. (S1 ^operator O2159 = 0.70622448437219)
  27203. =>WM: (15157: S1 ^operator O2162 +)
  27204. =>WM: (15156: S1 ^operator O2161 +)
  27205. =>WM: (15155: I3 ^dir U)
  27206. =>WM: (15154: O2162 ^name predict-no)
  27207. =>WM: (15153: O2161 ^name predict-yes)
  27208. =>WM: (15152: R1084 ^value 1)
  27209. =>WM: (15151: R1 ^reward R1084)
  27210. =>WM: (15150: I3 ^see 1)
  27211. <=WM: (15141: S1 ^operator O2159 +)
  27212. <=WM: (15143: S1 ^operator O2159)
  27213. <=WM: (15142: S1 ^operator O2160 +)
  27214. <=WM: (15140: I3 ^dir R)
  27215. <=WM: (15136: R1 ^reward R1083)
  27216. <=WM: (15135: I3 ^see 0)
  27217. <=WM: (15139: O2160 ^name predict-no)
  27218. <=WM: (15138: O2159 ^name predict-yes)
  27219. <=WM: (15137: R1083 ^value 1)
  27220. --- Inner Elaboration Phase, active level 1 (S1) ---
  27221. Firing prefer*rvt*predict-yes*H0
  27222. -->
  27223. Firing rl*prefer*rvt*predict-yes*H0*3
  27224. -->
  27225. (S1 ^operator O2161 = 0.)
  27226. Firing prefer*rvt*predict-no*H0
  27227. -->
  27228. Firing rl*prefer*rvt*predict-no*H0*4
  27229. -->
  27230. (S1 ^operator O2162 = 1.)
  27231. inner elaboration loop at bottom goal.
  27232. Retracting rl*prefer*rvt*predict-no*H0*4
  27233. -->
  27234. (S1 ^operator O2160 = 1.)
  27235. Retracting rl*prefer*rvt*predict-yes*H0*3
  27236. -->
  27237. (S1 ^operator O2159 = 0.)
  27238. --- END Proposal Phase ---
  27239. --- Decision Phase ---
  27240. RL update rl*prefer*rvt*predict-yes*H0*5 0.501143 -0.207067 0.294077 -> 0.501121 -0.207069 0.294052(R,m,v=1,0.857988,0.12257)
  27241. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499129 0.207096 0.706224 -> 0.499103 0.207093 0.706196(R,m,v=1,1,0)
  27242. =>WM: (15158: S1 ^operator O2162)
  27243. 1081: O: O2162 (predict-no)
  27244. --- END Decision Phase ---
  27245. --- Application Phase ---
  27246. --- Firing Productions (PE) For State At Depth 1 ---
  27247. --- Inner Elaboration Phase, active level 1 (S1) ---
  27248. Firing apply*operator
  27249. -->
  27250. (I3 ^predict-no N1081 + :O )
  27251. Firing apply*operator*complete
  27252. -->
  27253. (I3 ^predict-yes N1080 - :O )
  27254. inner elaboration loop at bottom goal.
  27255. --- Change Working Memory (PE) ---
  27256. =>WM: (15159: I3 ^predict-no N1081)
  27257. <=WM: (15145: N1080 ^status complete)
  27258. <=WM: (15144: I3 ^predict-yes N1080)
  27259. --- Firing Productions (IE) For State At Depth 1 ---
  27260. --- Inner Elaboration Phase, active level 1 (S1) ---
  27261. Firing monitor*world
  27262. -->
  27263. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27264. --- Change Working Memory (IE) ---
  27265. --- END Application Phase ---
  27266. --- Output Phase ---
  27267. ENV: Agent did: predict-no for direction U in state State-B
  27268. In State-B moving U
  27269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27270. predict error 0
  27271. dir: dir isR
  27272. --- END Output Phase ---
  27273. \--- Input Phase ---
  27274. =>WM: (15163: I2 ^dir R)
  27275. =>WM: (15162: I2 ^reward 1)
  27276. =>WM: (15161: I2 ^see 0)
  27277. =>WM: (15160: N1081 ^status complete)
  27278. <=WM: (15148: I2 ^dir U)
  27279. <=WM: (15147: I2 ^reward 1)
  27280. <=WM: (15146: I2 ^see 1)
  27281. =>WM: (15164: I2 ^level-1 R1-root)
  27282. <=WM: (15149: I2 ^level-1 R1-root)
  27283. --- END Input Phase ---
  27284. --- Proposal Phase ---
  27285. --- Inner Elaboration Phase, active level 1 (S1) ---
  27286. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27287. -->
  27288. (S1 ^operator O2161 = -0.252585164213872)
  27289. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  27290. -->
  27291. (S1 ^operator O2162 = 0.7701594485713136)
  27292. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27293. -->
  27294. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27295. -->
  27296. Firing elaborate*copy-see-to-output-link
  27297. -->
  27298. (I3 ^see 0 +)
  27299. Firing elaborate*reward*based*on*reward
  27300. -->
  27301. (R1085 ^value 1 +)
  27302. (R1 ^reward R1085 +)
  27303. Firing propose*predict-yes
  27304. -->
  27305. (O2163 ^name predict-yes +)
  27306. (S1 ^operator O2163 +)
  27307. Firing propose*predict-no
  27308. -->
  27309. (O2164 ^name predict-no +)
  27310. (S1 ^operator O2164 +)
  27311. Firing rl*prefer*rvt*predict-no*H0*6
  27312. -->
  27313. (S1 ^operator O2162 = 0.229858460707707)
  27314. Firing rl*prefer*rvt*predict-yes*H0*5
  27315. -->
  27316. (S1 ^operator O2161 = 0.2940520155428289)
  27317. Firing prefer*rvt*predict-yes*H0
  27318. -->
  27319. Firing prefer*rvt*predict-no*H0
  27320. -->
  27321. Firing elaborate*copy-dir-to-output-link
  27322. -->
  27323. (I3 ^dir R +)
  27324. inner elaboration loop at bottom goal.
  27325. Retracting elaborate*copy-see-to-output-link
  27326. -->
  27327. (I3 ^see 1 +)
  27328. Retracting propose*predict-no
  27329. -->
  27330. (O2162 ^name predict-no +)
  27331. (S1 ^operator O2162 +)
  27332. Retracting propose*predict-yes
  27333. -->
  27334. (O2161 ^name predict-yes +)
  27335. (S1 ^operator O2161 +)
  27336. Retracting elaborate*reward*based*on*reward
  27337. -->
  27338. (R1084 ^value 1 +)
  27339. (R1 ^reward R1084 +)
  27340. Retracting elaborate*copy-dir-to-output-link
  27341. -->
  27342. (I3 ^dir U +)
  27343. Retracting rl*prefer*rvt*predict-no*H0*4
  27344. -->
  27345. (S1 ^operator O2162 = 1.)
  27346. Retracting rl*prefer*rvt*predict-yes*H0*3
  27347. -->
  27348. (S1 ^operator O2161 = 0.)
  27349. =>WM: (15172: S1 ^operator O2164 +)
  27350. =>WM: (15171: S1 ^operator O2163 +)
  27351. =>WM: (15170: I3 ^dir R)
  27352. =>WM: (15169: O2164 ^name predict-no)
  27353. =>WM: (15168: O2163 ^name predict-yes)
  27354. =>WM: (15167: R1085 ^value 1)
  27355. =>WM: (15166: R1 ^reward R1085)
  27356. =>WM: (15165: I3 ^see 0)
  27357. <=WM: (15156: S1 ^operator O2161 +)
  27358. <=WM: (15157: S1 ^operator O2162 +)
  27359. <=WM: (15158: S1 ^operator O2162)
  27360. <=WM: (15155: I3 ^dir U)
  27361. <=WM: (15151: R1 ^reward R1084)
  27362. <=WM: (15150: I3 ^see 1)
  27363. <=WM: (15154: O2162 ^name predict-no)
  27364. <=WM: (15153: O2161 ^name predict-yes)
  27365. <=WM: (15152: R1084 ^value 1)
  27366. --- Inner Elaboration Phase, active level 1 (S1) ---
  27367. Firing prefer*rvt*predict-yes*H0
  27368. -->
  27369. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27370. -->
  27371. (S1 ^operator O2163 = -0.252585164213872)
  27372. Firing rl*prefer*rvt*predict-yes*H0*5
  27373. -->
  27374. (S1 ^operator O2163 = 0.2940520155428289)
  27375. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27376. -->
  27377. Firing prefer*rvt*predict-no*H0
  27378. -->
  27379. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  27380. -->
  27381. (S1 ^operator O2164 = 0.7701594485713136)
  27382. Firing rl*prefer*rvt*predict-no*H0*6
  27383. -->
  27384. (S1 ^operator O2164 = 0.229858460707707)
  27385. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27386. -->
  27387. inner elaboration loop at bottom goal.
  27388. Retracting rl*prefer*rvt*predict-no*H0*6
  27389. -->
  27390. (S1 ^operator O2162 = 0.229858460707707)
  27391. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  27392. -->
  27393. (S1 ^operator O2162 = 0.7701594485713136)
  27394. Retracting rl*prefer*rvt*predict-yes*H0*5
  27395. -->
  27396. (S1 ^operator O2161 = 0.2940520155428289)
  27397. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27398. -->
  27399. (S1 ^operator O2161 = -0.252585164213872)
  27400. --- END Proposal Phase ---
  27401. --- Decision Phase ---
  27402. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27403. =>WM: (15173: S1 ^operator O2164)
  27404. 1082: O: O2164 (predict-no)
  27405. --- END Decision Phase ---
  27406. --- Application Phase ---
  27407. --- Firing Productions (PE) For State At Depth 1 ---
  27408. --- Inner Elaboration Phase, active level 1 (S1) ---
  27409. Firing apply*operator
  27410. -->
  27411. (I3 ^predict-no N1082 + :O )
  27412. Firing apply*operator*complete
  27413. -->
  27414. (I3 ^predict-no N1081 - :O )
  27415. inner elaboration loop at bottom goal.
  27416. --- Change Working Memory (PE) ---
  27417. =>WM: (15174: I3 ^predict-no N1082)
  27418. <=WM: (15160: N1081 ^status complete)
  27419. <=WM: (15159: I3 ^predict-no N1081)
  27420. --- Firing Productions (IE) For State At Depth 1 ---
  27421. --- Inner Elaboration Phase, active level 1 (S1) ---
  27422. Firing monitor*world
  27423. -->
  27424. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27425. --- Change Working Memory (IE) ---
  27426. --- END Application Phase ---
  27427. --- Output Phase ---
  27428. ENV: Agent did: predict-no for direction R in state State-B
  27429. In State-B moving R
  27430. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27431. predict error 0
  27432. dir: dir isU
  27433. --- END Output Phase ---
  27434. -/|--- Input Phase ---
  27435. =>WM: (15178: I2 ^dir U)
  27436. =>WM: (15177: I2 ^reward 1)
  27437. =>WM: (15176: I2 ^see 0)
  27438. =>WM: (15175: N1082 ^status complete)
  27439. <=WM: (15163: I2 ^dir R)
  27440. <=WM: (15162: I2 ^reward 1)
  27441. <=WM: (15161: I2 ^see 0)
  27442. =>WM: (15179: I2 ^level-1 R0-root)
  27443. <=WM: (15164: I2 ^level-1 R1-root)
  27444. --- END Input Phase ---
  27445. --- Proposal Phase ---
  27446. --- Inner Elaboration Phase, active level 1 (S1) ---
  27447. Firing elaborate*copy-see-to-output-link
  27448. -->
  27449. (I3 ^see 0 +)
  27450. Firing elaborate*reward*based*on*reward
  27451. -->
  27452. (R1086 ^value 1 +)
  27453. (R1 ^reward R1086 +)
  27454. Firing propose*predict-yes
  27455. -->
  27456. (O2165 ^name predict-yes +)
  27457. (S1 ^operator O2165 +)
  27458. Firing propose*predict-no
  27459. -->
  27460. (O2166 ^name predict-no +)
  27461. (S1 ^operator O2166 +)
  27462. Firing rl*prefer*rvt*predict-no*H0*4
  27463. -->
  27464. (S1 ^operator O2164 = 1.)
  27465. Firing rl*prefer*rvt*predict-yes*H0*3
  27466. -->
  27467. (S1 ^operator O2163 = 0.)
  27468. Firing prefer*rvt*predict-yes*H0
  27469. -->
  27470. Firing prefer*rvt*predict-no*H0
  27471. -->
  27472. Firing elaborate*copy-dir-to-output-link
  27473. -->
  27474. (I3 ^dir U +)
  27475. inner elaboration loop at bottom goal.
  27476. Retracting elaborate*copy-see-to-output-link
  27477. -->
  27478. (I3 ^see 0 +)
  27479. Retracting propose*predict-no
  27480. -->
  27481. (O2164 ^name predict-no +)
  27482. (S1 ^operator O2164 +)
  27483. Retracting propose*predict-yes
  27484. -->
  27485. (O2163 ^name predict-yes +)
  27486. (S1 ^operator O2163 +)
  27487. Retracting elaborate*reward*based*on*reward
  27488. -->
  27489. (R1085 ^value 1 +)
  27490. (R1 ^reward R1085 +)
  27491. Retracting elaborate*copy-dir-to-output-link
  27492. -->
  27493. (I3 ^dir R +)
  27494. Retracting rl*prefer*rvt*predict-no*H0*6
  27495. -->
  27496. (S1 ^operator O2164 = 0.229858460707707)
  27497. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  27498. -->
  27499. (S1 ^operator O2164 = 0.7701594485713136)
  27500. Retracting rl*prefer*rvt*predict-yes*H0*5
  27501. -->
  27502. (S1 ^operator O2163 = 0.2940520155428289)
  27503. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27504. -->
  27505. (S1 ^operator O2163 = -0.252585164213872)
  27506. =>WM: (15186: S1 ^operator O2166 +)
  27507. =>WM: (15185: S1 ^operator O2165 +)
  27508. =>WM: (15184: I3 ^dir U)
  27509. =>WM: (15183: O2166 ^name predict-no)
  27510. =>WM: (15182: O2165 ^name predict-yes)
  27511. =>WM: (15181: R1086 ^value 1)
  27512. =>WM: (15180: R1 ^reward R1086)
  27513. <=WM: (15171: S1 ^operator O2163 +)
  27514. <=WM: (15172: S1 ^operator O2164 +)
  27515. <=WM: (15173: S1 ^operator O2164)
  27516. <=WM: (15170: I3 ^dir R)
  27517. <=WM: (15166: R1 ^reward R1085)
  27518. <=WM: (15169: O2164 ^name predict-no)
  27519. <=WM: (15168: O2163 ^name predict-yes)
  27520. <=WM: (15167: R1085 ^value 1)
  27521. --- Inner Elaboration Phase, active level 1 (S1) ---
  27522. Firing prefer*rvt*predict-yes*H0
  27523. -->
  27524. Firing rl*prefer*rvt*predict-yes*H0*3
  27525. -->
  27526. (S1 ^operator O2165 = 0.)
  27527. Firing prefer*rvt*predict-no*H0
  27528. -->
  27529. Firing rl*prefer*rvt*predict-no*H0*4
  27530. -->
  27531. (S1 ^operator O2166 = 1.)
  27532. inner elaboration loop at bottom goal.
  27533. Retracting rl*prefer*rvt*predict-no*H0*4
  27534. -->
  27535. (S1 ^operator O2164 = 1.)
  27536. Retracting rl*prefer*rvt*predict-yes*H0*3
  27537. -->
  27538. (S1 ^operator O2163 = 0.)
  27539. --- END Proposal Phase ---
  27540. --- Decision Phase ---
  27541. RL update rl*prefer*rvt*predict-no*H0*6 0.611911 -0.382052 0.229858 -> 0.61191 -0.382053 0.229857(R,m,v=1,0.855615,0.124202)
  27542. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388104 0.382056 0.770159 -> 0.388102 0.382055 0.770158(R,m,v=1,1,0)
  27543. =>WM: (15187: S1 ^operator O2166)
  27544. 1083: O: O2166 (predict-no)
  27545. --- END Decision Phase ---
  27546. --- Application Phase ---
  27547. --- Firing Productions (PE) For State At Depth 1 ---
  27548. --- Inner Elaboration Phase, active level 1 (S1) ---
  27549. Firing apply*operator
  27550. -->
  27551. (I3 ^predict-no N1083 + :O )
  27552. Firing apply*operator*complete
  27553. -->
  27554. (I3 ^predict-no N1082 - :O )
  27555. inner elaboration loop at bottom goal.
  27556. --- Change Working Memory (PE) ---
  27557. =>WM: (15188: I3 ^predict-no N1083)
  27558. <=WM: (15175: N1082 ^status complete)
  27559. <=WM: (15174: I3 ^predict-no N1082)
  27560. --- Firing Productions (IE) For State At Depth 1 ---
  27561. --- Inner Elaboration Phase, active level 1 (S1) ---
  27562. Firing monitor*world
  27563. -->
  27564. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27565. --- Change Working Memory (IE) ---
  27566. --- END Application Phase ---
  27567. --- Output Phase ---
  27568. ENV: Agent did: predict-no for direction U in state State-B
  27569. In State-B moving U
  27570. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27571. predict error 0
  27572. dir: dir isR
  27573. --- END Output Phase ---
  27574. \-/--- Input Phase ---
  27575. =>WM: (15192: I2 ^dir R)
  27576. =>WM: (15191: I2 ^reward 1)
  27577. =>WM: (15190: I2 ^see 0)
  27578. =>WM: (15189: N1083 ^status complete)
  27579. <=WM: (15178: I2 ^dir U)
  27580. <=WM: (15177: I2 ^reward 1)
  27581. <=WM: (15176: I2 ^see 0)
  27582. =>WM: (15193: I2 ^level-1 R0-root)
  27583. <=WM: (15179: I2 ^level-1 R0-root)
  27584. --- END Input Phase ---
  27585. --- Proposal Phase ---
  27586. --- Inner Elaboration Phase, active level 1 (S1) ---
  27587. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27588. -->
  27589. (S1 ^operator O2165 = -0.1254042659579056)
  27590. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27591. -->
  27592. (S1 ^operator O2166 = 0.7701105848453105)
  27593. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27594. -->
  27595. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27596. -->
  27597. Firing elaborate*copy-see-to-output-link
  27598. -->
  27599. (I3 ^see 0 +)
  27600. Firing elaborate*reward*based*on*reward
  27601. -->
  27602. (R1087 ^value 1 +)
  27603. (R1 ^reward R1087 +)
  27604. Firing propose*predict-yes
  27605. -->
  27606. (O2167 ^name predict-yes +)
  27607. (S1 ^operator O2167 +)
  27608. Firing propose*predict-no
  27609. -->
  27610. (O2168 ^name predict-no +)
  27611. (S1 ^operator O2168 +)
  27612. Firing rl*prefer*rvt*predict-no*H0*6
  27613. -->
  27614. (S1 ^operator O2166 = 0.2298570236216184)
  27615. Firing rl*prefer*rvt*predict-yes*H0*5
  27616. -->
  27617. (S1 ^operator O2165 = 0.2940520155428289)
  27618. Firing prefer*rvt*predict-yes*H0
  27619. -->
  27620. Firing prefer*rvt*predict-no*H0
  27621. -->
  27622. Firing elaborate*copy-dir-to-output-link
  27623. -->
  27624. (I3 ^dir R +)
  27625. inner elaboration loop at bottom goal.
  27626. Retracting elaborate*copy-see-to-output-link
  27627. -->
  27628. (I3 ^see 0 +)
  27629. Retracting propose*predict-no
  27630. -->
  27631. (O2166 ^name predict-no +)
  27632. (S1 ^operator O2166 +)
  27633. Retracting propose*predict-yes
  27634. -->
  27635. (O2165 ^name predict-yes +)
  27636. (S1 ^operator O2165 +)
  27637. Retracting elaborate*reward*based*on*reward
  27638. -->
  27639. (R1086 ^value 1 +)
  27640. (R1 ^reward R1086 +)
  27641. Retracting elaborate*copy-dir-to-output-link
  27642. -->
  27643. (I3 ^dir U +)
  27644. Retracting rl*prefer*rvt*predict-no*H0*4
  27645. -->
  27646. (S1 ^operator O2166 = 1.)
  27647. Retracting rl*prefer*rvt*predict-yes*H0*3
  27648. -->
  27649. (S1 ^operator O2165 = 0.)
  27650. =>WM: (15200: S1 ^operator O2168 +)
  27651. =>WM: (15199: S1 ^operator O2167 +)
  27652. =>WM: (15198: I3 ^dir R)
  27653. =>WM: (15197: O2168 ^name predict-no)
  27654. =>WM: (15196: O2167 ^name predict-yes)
  27655. =>WM: (15195: R1087 ^value 1)
  27656. =>WM: (15194: R1 ^reward R1087)
  27657. <=WM: (15185: S1 ^operator O2165 +)
  27658. <=WM: (15186: S1 ^operator O2166 +)
  27659. <=WM: (15187: S1 ^operator O2166)
  27660. <=WM: (15184: I3 ^dir U)
  27661. <=WM: (15180: R1 ^reward R1086)
  27662. <=WM: (15183: O2166 ^name predict-no)
  27663. <=WM: (15182: O2165 ^name predict-yes)
  27664. <=WM: (15181: R1086 ^value 1)
  27665. --- Inner Elaboration Phase, active level 1 (S1) ---
  27666. Firing prefer*rvt*predict-yes*H0
  27667. -->
  27668. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27669. -->
  27670. (S1 ^operator O2167 = -0.1254042659579056)
  27671. Firing rl*prefer*rvt*predict-yes*H0*5
  27672. -->
  27673. (S1 ^operator O2167 = 0.2940520155428289)
  27674. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27675. -->
  27676. Firing prefer*rvt*predict-no*H0
  27677. -->
  27678. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27679. -->
  27680. (S1 ^operator O2168 = 0.7701105848453105)
  27681. Firing rl*prefer*rvt*predict-no*H0*6
  27682. -->
  27683. (S1 ^operator O2168 = 0.2298570236216184)
  27684. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27685. -->
  27686. inner elaboration loop at bottom goal.
  27687. Retracting rl*prefer*rvt*predict-no*H0*6
  27688. -->
  27689. (S1 ^operator O2166 = 0.2298570236216184)
  27690. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27691. -->
  27692. (S1 ^operator O2166 = 0.7701105848453105)
  27693. Retracting rl*prefer*rvt*predict-yes*H0*5
  27694. -->
  27695. (S1 ^operator O2165 = 0.2940520155428289)
  27696. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27697. -->
  27698. (S1 ^operator O2165 = -0.1254042659579056)
  27699. --- END Proposal Phase ---
  27700. --- Decision Phase ---
  27701. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27702. =>WM: (15201: S1 ^operator O2168)
  27703. 1084: O: O2168 (predict-no)
  27704. --- END Decision Phase ---
  27705. --- Application Phase ---
  27706. --- Firing Productions (PE) For State At Depth 1 ---
  27707. --- Inner Elaboration Phase, active level 1 (S1) ---
  27708. Firing apply*operator
  27709. -->
  27710. (I3 ^predict-no N1084 + :O )
  27711. Firing apply*operator*complete
  27712. -->
  27713. (I3 ^predict-no N1083 - :O )
  27714. inner elaboration loop at bottom goal.
  27715. --- Change Working Memory (PE) ---
  27716. =>WM: (15202: I3 ^predict-no N1084)
  27717. <=WM: (15189: N1083 ^status complete)
  27718. <=WM: (15188: I3 ^predict-no N1083)
  27719. --- Firing Productions (IE) For State At Depth 1 ---
  27720. --- Inner Elaboration Phase, active level 1 (S1) ---
  27721. Firing monitor*world
  27722. -->
  27723. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27724. --- Change Working Memory (IE) ---
  27725. --- END Application Phase ---
  27726. --- Output Phase ---
  27727. ENV: Agent did: predict-no for direction R in state State-B
  27728. In State-B moving R
  27729. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27730. predict error 0
  27731. dir: dir isU
  27732. --- END Output Phase ---
  27733. |\---- Input Phase ---
  27734. =>WM: (15206: I2 ^dir U)
  27735. =>WM: (15205: I2 ^reward 1)
  27736. =>WM: (15204: I2 ^see 0)
  27737. =>WM: (15203: N1084 ^status complete)
  27738. <=WM: (15192: I2 ^dir R)
  27739. <=WM: (15191: I2 ^reward 1)
  27740. <=WM: (15190: I2 ^see 0)
  27741. =>WM: (15207: I2 ^level-1 R0-root)
  27742. <=WM: (15193: I2 ^level-1 R0-root)
  27743. --- END Input Phase ---
  27744. --- Proposal Phase ---
  27745. --- Inner Elaboration Phase, active level 1 (S1) ---
  27746. Firing elaborate*copy-see-to-output-link
  27747. -->
  27748. (I3 ^see 0 +)
  27749. Firing elaborate*reward*based*on*reward
  27750. -->
  27751. (R1088 ^value 1 +)
  27752. (R1 ^reward R1088 +)
  27753. Firing propose*predict-yes
  27754. -->
  27755. (O2169 ^name predict-yes +)
  27756. (S1 ^operator O2169 +)
  27757. Firing propose*predict-no
  27758. -->
  27759. (O2170 ^name predict-no +)
  27760. (S1 ^operator O2170 +)
  27761. Firing rl*prefer*rvt*predict-no*H0*4
  27762. -->
  27763. (S1 ^operator O2168 = 1.)
  27764. Firing rl*prefer*rvt*predict-yes*H0*3
  27765. -->
  27766. (S1 ^operator O2167 = 0.)
  27767. Firing prefer*rvt*predict-yes*H0
  27768. -->
  27769. Firing prefer*rvt*predict-no*H0
  27770. -->
  27771. Firing elaborate*copy-dir-to-output-link
  27772. -->
  27773. (I3 ^dir U +)
  27774. inner elaboration loop at bottom goal.
  27775. Retracting elaborate*copy-see-to-output-link
  27776. -->
  27777. (I3 ^see 0 +)
  27778. Retracting propose*predict-no
  27779. -->
  27780. (O2168 ^name predict-no +)
  27781. (S1 ^operator O2168 +)
  27782. Retracting propose*predict-yes
  27783. -->
  27784. (O2167 ^name predict-yes +)
  27785. (S1 ^operator O2167 +)
  27786. Retracting elaborate*reward*based*on*reward
  27787. -->
  27788. (R1087 ^value 1 +)
  27789. (R1 ^reward R1087 +)
  27790. Retracting elaborate*copy-dir-to-output-link
  27791. -->
  27792. (I3 ^dir R +)
  27793. Retracting rl*prefer*rvt*predict-no*H0*6
  27794. -->
  27795. (S1 ^operator O2168 = 0.2298570236216184)
  27796. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27797. -->
  27798. (S1 ^operator O2168 = 0.7701105848453105)
  27799. Retracting rl*prefer*rvt*predict-yes*H0*5
  27800. -->
  27801. (S1 ^operator O2167 = 0.2940520155428289)
  27802. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27803. -->
  27804. (S1 ^operator O2167 = -0.1254042659579056)
  27805. =>WM: (15214: S1 ^operator O2170 +)
  27806. =>WM: (15213: S1 ^operator O2169 +)
  27807. =>WM: (15212: I3 ^dir U)
  27808. =>WM: (15211: O2170 ^name predict-no)
  27809. =>WM: (15210: O2169 ^name predict-yes)
  27810. =>WM: (15209: R1088 ^value 1)
  27811. =>WM: (15208: R1 ^reward R1088)
  27812. <=WM: (15199: S1 ^operator O2167 +)
  27813. <=WM: (15200: S1 ^operator O2168 +)
  27814. <=WM: (15201: S1 ^operator O2168)
  27815. <=WM: (15198: I3 ^dir R)
  27816. <=WM: (15194: R1 ^reward R1087)
  27817. <=WM: (15197: O2168 ^name predict-no)
  27818. <=WM: (15196: O2167 ^name predict-yes)
  27819. <=WM: (15195: R1087 ^value 1)
  27820. --- Inner Elaboration Phase, active level 1 (S1) ---
  27821. Firing prefer*rvt*predict-yes*H0
  27822. -->
  27823. Firing rl*prefer*rvt*predict-yes*H0*3
  27824. -->
  27825. (S1 ^operator O2169 = 0.)
  27826. Firing prefer*rvt*predict-no*H0
  27827. -->
  27828. Firing rl*prefer*rvt*predict-no*H0*4
  27829. -->
  27830. (S1 ^operator O2170 = 1.)
  27831. inner elaboration loop at bottom goal.
  27832. Retracting rl*prefer*rvt*predict-no*H0*4
  27833. -->
  27834. (S1 ^operator O2168 = 1.)
  27835. Retracting rl*prefer*rvt*predict-yes*H0*3
  27836. -->
  27837. (S1 ^operator O2167 = 0.)
  27838. --- END Proposal Phase ---
  27839. --- Decision Phase ---
  27840. RL update rl*prefer*rvt*predict-no*H0*6 0.61191 -0.382053 0.229857 -> 0.611912 -0.382052 0.22986(R,m,v=1,0.856383,0.123649)
  27841. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388064 0.382047 0.770111 -> 0.388066 0.382047 0.770114(R,m,v=1,1,0)
  27842. =>WM: (15215: S1 ^operator O2170)
  27843. 1085: O: O2170 (predict-no)
  27844. --- END Decision Phase ---
  27845. --- Application Phase ---
  27846. --- Firing Productions (PE) For State At Depth 1 ---
  27847. --- Inner Elaboration Phase, active level 1 (S1) ---
  27848. Firing apply*operator
  27849. -->
  27850. (I3 ^predict-no N1085 + :O )
  27851. Firing apply*operator*complete
  27852. -->
  27853. (I3 ^predict-no N1084 - :O )
  27854. inner elaboration loop at bottom goal.
  27855. --- Change Working Memory (PE) ---
  27856. =>WM: (15216: I3 ^predict-no N1085)
  27857. <=WM: (15203: N1084 ^status complete)
  27858. <=WM: (15202: I3 ^predict-no N1084)
  27859. --- Firing Productions (IE) For State At Depth 1 ---
  27860. --- Inner Elaboration Phase, active level 1 (S1) ---
  27861. Firing monitor*world
  27862. -->
  27863. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27864. --- Change Working Memory (IE) ---
  27865. --- END Application Phase ---
  27866. --- Output Phase ---
  27867. ENV: Agent did: predict-no for direction U in state State-B
  27868. In State-B moving U
  27869. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27870. predict error 0
  27871. dir: dir isR
  27872. --- END Output Phase ---
  27873. /|\--- Input Phase ---
  27874. =>WM: (15220: I2 ^dir R)
  27875. =>WM: (15219: I2 ^reward 1)
  27876. =>WM: (15218: I2 ^see 0)
  27877. =>WM: (15217: N1085 ^status complete)
  27878. <=WM: (15206: I2 ^dir U)
  27879. <=WM: (15205: I2 ^reward 1)
  27880. <=WM: (15204: I2 ^see 0)
  27881. =>WM: (15221: I2 ^level-1 R0-root)
  27882. <=WM: (15207: I2 ^level-1 R0-root)
  27883. --- END Input Phase ---
  27884. --- Proposal Phase ---
  27885. --- Inner Elaboration Phase, active level 1 (S1) ---
  27886. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27887. -->
  27888. (S1 ^operator O2169 = -0.1254042659579056)
  27889. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27890. -->
  27891. (S1 ^operator O2170 = 0.7701135541770483)
  27892. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27893. -->
  27894. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27895. -->
  27896. Firing elaborate*copy-see-to-output-link
  27897. -->
  27898. (I3 ^see 0 +)
  27899. Firing elaborate*reward*based*on*reward
  27900. -->
  27901. (R1089 ^value 1 +)
  27902. (R1 ^reward R1089 +)
  27903. Firing propose*predict-yes
  27904. -->
  27905. (O2171 ^name predict-yes +)
  27906. (S1 ^operator O2171 +)
  27907. Firing propose*predict-no
  27908. -->
  27909. (O2172 ^name predict-no +)
  27910. (S1 ^operator O2172 +)
  27911. Firing rl*prefer*rvt*predict-no*H0*6
  27912. -->
  27913. (S1 ^operator O2170 = 0.2298596205778046)
  27914. Firing rl*prefer*rvt*predict-yes*H0*5
  27915. -->
  27916. (S1 ^operator O2169 = 0.2940520155428289)
  27917. Firing prefer*rvt*predict-yes*H0
  27918. -->
  27919. Firing prefer*rvt*predict-no*H0
  27920. -->
  27921. Firing elaborate*copy-dir-to-output-link
  27922. -->
  27923. (I3 ^dir R +)
  27924. inner elaboration loop at bottom goal.
  27925. Retracting elaborate*copy-see-to-output-link
  27926. -->
  27927. (I3 ^see 0 +)
  27928. Retracting propose*predict-no
  27929. -->
  27930. (O2170 ^name predict-no +)
  27931. (S1 ^operator O2170 +)
  27932. Retracting propose*predict-yes
  27933. -->
  27934. (O2169 ^name predict-yes +)
  27935. (S1 ^operator O2169 +)
  27936. Retracting elaborate*reward*based*on*reward
  27937. -->
  27938. (R1088 ^value 1 +)
  27939. (R1 ^reward R1088 +)
  27940. Retracting elaborate*copy-dir-to-output-link
  27941. -->
  27942. (I3 ^dir U +)
  27943. Retracting rl*prefer*rvt*predict-no*H0*4
  27944. -->
  27945. (S1 ^operator O2170 = 1.)
  27946. Retracting rl*prefer*rvt*predict-yes*H0*3
  27947. -->
  27948. (S1 ^operator O2169 = 0.)
  27949. =>WM: (15228: S1 ^operator O2172 +)
  27950. =>WM: (15227: S1 ^operator O2171 +)
  27951. =>WM: (15226: I3 ^dir R)
  27952. =>WM: (15225: O2172 ^name predict-no)
  27953. =>WM: (15224: O2171 ^name predict-yes)
  27954. =>WM: (15223: R1089 ^value 1)
  27955. =>WM: (15222: R1 ^reward R1089)
  27956. <=WM: (15213: S1 ^operator O2169 +)
  27957. <=WM: (15214: S1 ^operator O2170 +)
  27958. <=WM: (15215: S1 ^operator O2170)
  27959. <=WM: (15212: I3 ^dir U)
  27960. <=WM: (15208: R1 ^reward R1088)
  27961. <=WM: (15211: O2170 ^name predict-no)
  27962. <=WM: (15210: O2169 ^name predict-yes)
  27963. <=WM: (15209: R1088 ^value 1)
  27964. --- Inner Elaboration Phase, active level 1 (S1) ---
  27965. Firing prefer*rvt*predict-yes*H0
  27966. -->
  27967. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27968. -->
  27969. (S1 ^operator O2171 = -0.1254042659579056)
  27970. Firing rl*prefer*rvt*predict-yes*H0*5
  27971. -->
  27972. (S1 ^operator O2171 = 0.2940520155428289)
  27973. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27974. -->
  27975. Firing prefer*rvt*predict-no*H0
  27976. -->
  27977. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27978. -->
  27979. (S1 ^operator O2172 = 0.7701135541770483)
  27980. Firing rl*prefer*rvt*predict-no*H0*6
  27981. -->
  27982. (S1 ^operator O2172 = 0.2298596205778046)
  27983. Firing prefer*rvt*predict-no*H0*6*v1*H1
  27984. -->
  27985. inner elaboration loop at bottom goal.
  27986. Retracting rl*prefer*rvt*predict-no*H0*6
  27987. -->
  27988. (S1 ^operator O2170 = 0.2298596205778046)
  27989. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  27990. -->
  27991. (S1 ^operator O2170 = 0.7701135541770483)
  27992. Retracting rl*prefer*rvt*predict-yes*H0*5
  27993. -->
  27994. (S1 ^operator O2169 = 0.2940520155428289)
  27995. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  27996. -->
  27997. (S1 ^operator O2169 = -0.1254042659579056)
  27998. --- END Proposal Phase ---
  27999. --- Decision Phase ---
  28000. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28001. =>WM: (15229: S1 ^operator O2172)
  28002. 1086: O: O2172 (predict-no)
  28003. --- END Decision Phase ---
  28004. --- Application Phase ---
  28005. --- Firing Productions (PE) For State At Depth 1 ---
  28006. --- Inner Elaboration Phase, active level 1 (S1) ---
  28007. Firing apply*operator
  28008. -->
  28009. (I3 ^predict-no N1086 + :O )
  28010. Firing apply*operator*complete
  28011. -->
  28012. (I3 ^predict-no N1085 - :O )
  28013. inner elaboration loop at bottom goal.
  28014. --- Change Working Memory (PE) ---
  28015. =>WM: (15230: I3 ^predict-no N1086)
  28016. <=WM: (15217: N1085 ^status complete)
  28017. <=WM: (15216: I3 ^predict-no N1085)
  28018. --- Firing Productions (IE) For State At Depth 1 ---
  28019. --- Inner Elaboration Phase, active level 1 (S1) ---
  28020. Firing monitor*world
  28021. -->
  28022. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28023. --- Change Working Memory (IE) ---
  28024. --- END Application Phase ---
  28025. --- Output Phase ---
  28026. ENV: Agent did: predict-no for direction R in state State-B
  28027. In State-B moving R
  28028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28029. predict error 0
  28030. dir: dir isU
  28031. --- END Output Phase ---
  28032. -/--- Input Phase ---
  28033. =>WM: (15234: I2 ^dir U)
  28034. =>WM: (15233: I2 ^reward 1)
  28035. =>WM: (15232: I2 ^see 0)
  28036. =>WM: (15231: N1086 ^status complete)
  28037. <=WM: (15220: I2 ^dir R)
  28038. <=WM: (15219: I2 ^reward 1)
  28039. <=WM: (15218: I2 ^see 0)
  28040. =>WM: (15235: I2 ^level-1 R0-root)
  28041. <=WM: (15221: I2 ^level-1 R0-root)
  28042. --- END Input Phase ---
  28043. --- Proposal Phase ---
  28044. --- Inner Elaboration Phase, active level 1 (S1) ---
  28045. Firing elaborate*copy-see-to-output-link
  28046. -->
  28047. (I3 ^see 0 +)
  28048. Firing elaborate*reward*based*on*reward
  28049. -->
  28050. (R1090 ^value 1 +)
  28051. (R1 ^reward R1090 +)
  28052. Firing propose*predict-yes
  28053. -->
  28054. (O2173 ^name predict-yes +)
  28055. (S1 ^operator O2173 +)
  28056. Firing propose*predict-no
  28057. -->
  28058. (O2174 ^name predict-no +)
  28059. (S1 ^operator O2174 +)
  28060. Firing rl*prefer*rvt*predict-no*H0*4
  28061. -->
  28062. (S1 ^operator O2172 = 1.)
  28063. Firing rl*prefer*rvt*predict-yes*H0*3
  28064. -->
  28065. (S1 ^operator O2171 = 0.)
  28066. Firing prefer*rvt*predict-yes*H0
  28067. -->
  28068. Firing prefer*rvt*predict-no*H0
  28069. -->
  28070. Firing elaborate*copy-dir-to-output-link
  28071. -->
  28072. (I3 ^dir U +)
  28073. inner elaboration loop at bottom goal.
  28074. Retracting elaborate*copy-see-to-output-link
  28075. -->
  28076. (I3 ^see 0 +)
  28077. Retracting propose*predict-no
  28078. -->
  28079. (O2172 ^name predict-no +)
  28080. (S1 ^operator O2172 +)
  28081. Retracting propose*predict-yes
  28082. -->
  28083. (O2171 ^name predict-yes +)
  28084. (S1 ^operator O2171 +)
  28085. Retracting elaborate*reward*based*on*reward
  28086. -->
  28087. (R1089 ^value 1 +)
  28088. (R1 ^reward R1089 +)
  28089. Retracting elaborate*copy-dir-to-output-link
  28090. -->
  28091. (I3 ^dir R +)
  28092. Retracting rl*prefer*rvt*predict-no*H0*6
  28093. -->
  28094. (S1 ^operator O2172 = 0.2298596205778046)
  28095. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28096. -->
  28097. (S1 ^operator O2172 = 0.7701135541770483)
  28098. Retracting rl*prefer*rvt*predict-yes*H0*5
  28099. -->
  28100. (S1 ^operator O2171 = 0.2940520155428289)
  28101. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28102. -->
  28103. (S1 ^operator O2171 = -0.1254042659579056)
  28104. =>WM: (15242: S1 ^operator O2174 +)
  28105. =>WM: (15241: S1 ^operator O2173 +)
  28106. =>WM: (15240: I3 ^dir U)
  28107. =>WM: (15239: O2174 ^name predict-no)
  28108. =>WM: (15238: O2173 ^name predict-yes)
  28109. =>WM: (15237: R1090 ^value 1)
  28110. =>WM: (15236: R1 ^reward R1090)
  28111. <=WM: (15227: S1 ^operator O2171 +)
  28112. <=WM: (15228: S1 ^operator O2172 +)
  28113. <=WM: (15229: S1 ^operator O2172)
  28114. <=WM: (15226: I3 ^dir R)
  28115. <=WM: (15222: R1 ^reward R1089)
  28116. <=WM: (15225: O2172 ^name predict-no)
  28117. <=WM: (15224: O2171 ^name predict-yes)
  28118. <=WM: (15223: R1089 ^value 1)
  28119. --- Inner Elaboration Phase, active level 1 (S1) ---
  28120. Firing prefer*rvt*predict-yes*H0
  28121. -->
  28122. Firing rl*prefer*rvt*predict-yes*H0*3
  28123. -->
  28124. (S1 ^operator O2173 = 0.)
  28125. Firing prefer*rvt*predict-no*H0
  28126. -->
  28127. Firing rl*prefer*rvt*predict-no*H0*4
  28128. -->
  28129. (S1 ^operator O2174 = 1.)
  28130. inner elaboration loop at bottom goal.
  28131. Retracting rl*prefer*rvt*predict-no*H0*4
  28132. -->
  28133. (S1 ^operator O2172 = 1.)
  28134. Retracting rl*prefer*rvt*predict-yes*H0*3
  28135. -->
  28136. (S1 ^operator O2171 = 0.)
  28137. --- END Proposal Phase ---
  28138. --- Decision Phase ---
  28139. RL update rl*prefer*rvt*predict-no*H0*6 0.611912 -0.382052 0.22986 -> 0.611914 -0.382052 0.229862(R,m,v=1,0.857143,0.1231)
  28140. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388066 0.382047 0.770114 -> 0.388068 0.382048 0.770116(R,m,v=1,1,0)
  28141. =>WM: (15243: S1 ^operator O2174)
  28142. 1087: O: O2174 (predict-no)
  28143. --- END Decision Phase ---
  28144. --- Application Phase ---
  28145. --- Firing Productions (PE) For State At Depth 1 ---
  28146. --- Inner Elaboration Phase, active level 1 (S1) ---
  28147. Firing apply*operator
  28148. -->
  28149. (I3 ^predict-no N1087 + :O )
  28150. Firing apply*operator*complete
  28151. -->
  28152. (I3 ^predict-no N1086 - :O )
  28153. inner elaboration loop at bottom goal.
  28154. --- Change Working Memory (PE) ---
  28155. =>WM: (15244: I3 ^predict-no N1087)
  28156. <=WM: (15231: N1086 ^status complete)
  28157. <=WM: (15230: I3 ^predict-no N1086)
  28158. --- Firing Productions (IE) For State At Depth 1 ---
  28159. --- Inner Elaboration Phase, active level 1 (S1) ---
  28160. Firing monitor*world
  28161. -->
  28162. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28163. --- Change Working Memory (IE) ---
  28164. --- END Application Phase ---
  28165. --- Output Phase ---
  28166. ENV: Agent did: predict-no for direction U in state State-B
  28167. In State-B moving U
  28168. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28169. predict error 0
  28170. dir: dir isR
  28171. --- END Output Phase ---
  28172. |\--- Input Phase ---
  28173. =>WM: (15248: I2 ^dir R)
  28174. =>WM: (15247: I2 ^reward 1)
  28175. =>WM: (15246: I2 ^see 0)
  28176. =>WM: (15245: N1087 ^status complete)
  28177. <=WM: (15234: I2 ^dir U)
  28178. <=WM: (15233: I2 ^reward 1)
  28179. <=WM: (15232: I2 ^see 0)
  28180. =>WM: (15249: I2 ^level-1 R0-root)
  28181. <=WM: (15235: I2 ^level-1 R0-root)
  28182. --- END Input Phase ---
  28183. --- Proposal Phase ---
  28184. --- Inner Elaboration Phase, active level 1 (S1) ---
  28185. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28186. -->
  28187. (S1 ^operator O2173 = -0.1254042659579056)
  28188. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28189. -->
  28190. (S1 ^operator O2174 = 0.7701160080460637)
  28191. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28192. -->
  28193. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28194. -->
  28195. Firing elaborate*copy-see-to-output-link
  28196. -->
  28197. (I3 ^see 0 +)
  28198. Firing elaborate*reward*based*on*reward
  28199. -->
  28200. (R1091 ^value 1 +)
  28201. (R1 ^reward R1091 +)
  28202. Firing propose*predict-yes
  28203. -->
  28204. (O2175 ^name predict-yes +)
  28205. (S1 ^operator O2175 +)
  28206. Firing propose*predict-no
  28207. -->
  28208. (O2176 ^name predict-no +)
  28209. (S1 ^operator O2176 +)
  28210. Firing rl*prefer*rvt*predict-no*H0*6
  28211. -->
  28212. (S1 ^operator O2174 = 0.229861769434934)
  28213. Firing rl*prefer*rvt*predict-yes*H0*5
  28214. -->
  28215. (S1 ^operator O2173 = 0.2940520155428289)
  28216. Firing prefer*rvt*predict-yes*H0
  28217. -->
  28218. Firing prefer*rvt*predict-no*H0
  28219. -->
  28220. Firing elaborate*copy-dir-to-output-link
  28221. -->
  28222. (I3 ^dir R +)
  28223. inner elaboration loop at bottom goal.
  28224. Retracting elaborate*copy-see-to-output-link
  28225. -->
  28226. (I3 ^see 0 +)
  28227. Retracting propose*predict-no
  28228. -->
  28229. (O2174 ^name predict-no +)
  28230. (S1 ^operator O2174 +)
  28231. Retracting propose*predict-yes
  28232. -->
  28233. (O2173 ^name predict-yes +)
  28234. (S1 ^operator O2173 +)
  28235. Retracting elaborate*reward*based*on*reward
  28236. -->
  28237. (R1090 ^value 1 +)
  28238. (R1 ^reward R1090 +)
  28239. Retracting elaborate*copy-dir-to-output-link
  28240. -->
  28241. (I3 ^dir U +)
  28242. Retracting rl*prefer*rvt*predict-no*H0*4
  28243. -->
  28244. (S1 ^operator O2174 = 1.)
  28245. Retracting rl*prefer*rvt*predict-yes*H0*3
  28246. -->
  28247. (S1 ^operator O2173 = 0.)
  28248. =>WM: (15256: S1 ^operator O2176 +)
  28249. =>WM: (15255: S1 ^operator O2175 +)
  28250. =>WM: (15254: I3 ^dir R)
  28251. =>WM: (15253: O2176 ^name predict-no)
  28252. =>WM: (15252: O2175 ^name predict-yes)
  28253. =>WM: (15251: R1091 ^value 1)
  28254. =>WM: (15250: R1 ^reward R1091)
  28255. <=WM: (15241: S1 ^operator O2173 +)
  28256. <=WM: (15242: S1 ^operator O2174 +)
  28257. <=WM: (15243: S1 ^operator O2174)
  28258. <=WM: (15240: I3 ^dir U)
  28259. <=WM: (15236: R1 ^reward R1090)
  28260. <=WM: (15239: O2174 ^name predict-no)
  28261. <=WM: (15238: O2173 ^name predict-yes)
  28262. <=WM: (15237: R1090 ^value 1)
  28263. --- Inner Elaboration Phase, active level 1 (S1) ---
  28264. Firing prefer*rvt*predict-yes*H0
  28265. -->
  28266. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28267. -->
  28268. (S1 ^operator O2175 = -0.1254042659579056)
  28269. Firing rl*prefer*rvt*predict-yes*H0*5
  28270. -->
  28271. (S1 ^operator O2175 = 0.2940520155428289)
  28272. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28273. -->
  28274. Firing prefer*rvt*predict-no*H0
  28275. -->
  28276. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28277. -->
  28278. (S1 ^operator O2176 = 0.7701160080460637)
  28279. Firing rl*prefer*rvt*predict-no*H0*6
  28280. -->
  28281. (S1 ^operator O2176 = 0.229861769434934)
  28282. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28283. -->
  28284. inner elaboration loop at bottom goal.
  28285. Retracting rl*prefer*rvt*predict-no*H0*6
  28286. -->
  28287. (S1 ^operator O2174 = 0.229861769434934)
  28288. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28289. -->
  28290. (S1 ^operator O2174 = 0.7701160080460637)
  28291. Retracting rl*prefer*rvt*predict-yes*H0*5
  28292. -->
  28293. (S1 ^operator O2173 = 0.2940520155428289)
  28294. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28295. -->
  28296. (S1 ^operator O2173 = -0.1254042659579056)
  28297. --- END Proposal Phase ---
  28298. --- Decision Phase ---
  28299. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28300. =>WM: (15257: S1 ^operator O2176)
  28301. 1088: O: O2176 (predict-no)
  28302. --- END Decision Phase ---
  28303. --- Application Phase ---
  28304. --- Firing Productions (PE) For State At Depth 1 ---
  28305. --- Inner Elaboration Phase, active level 1 (S1) ---
  28306. Firing apply*operator
  28307. -->
  28308. (I3 ^predict-no N1088 + :O )
  28309. Firing apply*operator*complete
  28310. -->
  28311. (I3 ^predict-no N1087 - :O )
  28312. inner elaboration loop at bottom goal.
  28313. --- Change Working Memory (PE) ---
  28314. =>WM: (15258: I3 ^predict-no N1088)
  28315. <=WM: (15245: N1087 ^status complete)
  28316. <=WM: (15244: I3 ^predict-no N1087)
  28317. --- Firing Productions (IE) For State At Depth 1 ---
  28318. --- Inner Elaboration Phase, active level 1 (S1) ---
  28319. Firing monitor*world
  28320. -->
  28321. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28322. --- Change Working Memory (IE) ---
  28323. --- END Application Phase ---
  28324. --- Output Phase ---
  28325. ENV: Agent did: predict-no for direction R in state State-B
  28326. In State-B moving R
  28327. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28328. predict error 0
  28329. dir: dir isR
  28330. --- END Output Phase ---
  28331. -/|--- Input Phase ---
  28332. =>WM: (15262: I2 ^dir R)
  28333. =>WM: (15261: I2 ^reward 1)
  28334. =>WM: (15260: I2 ^see 0)
  28335. =>WM: (15259: N1088 ^status complete)
  28336. <=WM: (15248: I2 ^dir R)
  28337. <=WM: (15247: I2 ^reward 1)
  28338. <=WM: (15246: I2 ^see 0)
  28339. =>WM: (15263: I2 ^level-1 R0-root)
  28340. <=WM: (15249: I2 ^level-1 R0-root)
  28341. --- END Input Phase ---
  28342. --- Proposal Phase ---
  28343. --- Inner Elaboration Phase, active level 1 (S1) ---
  28344. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28345. -->
  28346. (S1 ^operator O2175 = -0.1254042659579056)
  28347. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28348. -->
  28349. (S1 ^operator O2176 = 0.7701160080460637)
  28350. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28351. -->
  28352. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28353. -->
  28354. Firing elaborate*copy-see-to-output-link
  28355. -->
  28356. (I3 ^see 0 +)
  28357. Firing elaborate*reward*based*on*reward
  28358. -->
  28359. (R1092 ^value 1 +)
  28360. (R1 ^reward R1092 +)
  28361. Firing propose*predict-yes
  28362. -->
  28363. (O2177 ^name predict-yes +)
  28364. (S1 ^operator O2177 +)
  28365. Firing propose*predict-no
  28366. -->
  28367. (O2178 ^name predict-no +)
  28368. (S1 ^operator O2178 +)
  28369. Firing rl*prefer*rvt*predict-no*H0*6
  28370. -->
  28371. (S1 ^operator O2176 = 0.229861769434934)
  28372. Firing rl*prefer*rvt*predict-yes*H0*5
  28373. -->
  28374. (S1 ^operator O2175 = 0.2940520155428289)
  28375. Firing prefer*rvt*predict-yes*H0
  28376. -->
  28377. Firing prefer*rvt*predict-no*H0
  28378. -->
  28379. Firing elaborate*copy-dir-to-output-link
  28380. -->
  28381. (I3 ^dir R +)
  28382. inner elaboration loop at bottom goal.
  28383. Retracting elaborate*copy-see-to-output-link
  28384. -->
  28385. (I3 ^see 0 +)
  28386. Retracting propose*predict-no
  28387. -->
  28388. (O2176 ^name predict-no +)
  28389. (S1 ^operator O2176 +)
  28390. Retracting propose*predict-yes
  28391. -->
  28392. (O2175 ^name predict-yes +)
  28393. (S1 ^operator O2175 +)
  28394. Retracting elaborate*reward*based*on*reward
  28395. -->
  28396. (R1091 ^value 1 +)
  28397. (R1 ^reward R1091 +)
  28398. Retracting elaborate*copy-dir-to-output-link
  28399. -->
  28400. (I3 ^dir R +)
  28401. Retracting rl*prefer*rvt*predict-no*H0*6
  28402. -->
  28403. (S1 ^operator O2176 = 0.229861769434934)
  28404. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28405. -->
  28406. (S1 ^operator O2176 = 0.7701160080460637)
  28407. Retracting rl*prefer*rvt*predict-yes*H0*5
  28408. -->
  28409. (S1 ^operator O2175 = 0.2940520155428289)
  28410. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28411. -->
  28412. (S1 ^operator O2175 = -0.1254042659579056)
  28413. =>WM: (15269: S1 ^operator O2178 +)
  28414. =>WM: (15268: S1 ^operator O2177 +)
  28415. =>WM: (15267: O2178 ^name predict-no)
  28416. =>WM: (15266: O2177 ^name predict-yes)
  28417. =>WM: (15265: R1092 ^value 1)
  28418. =>WM: (15264: R1 ^reward R1092)
  28419. <=WM: (15255: S1 ^operator O2175 +)
  28420. <=WM: (15256: S1 ^operator O2176 +)
  28421. <=WM: (15257: S1 ^operator O2176)
  28422. <=WM: (15250: R1 ^reward R1091)
  28423. <=WM: (15253: O2176 ^name predict-no)
  28424. <=WM: (15252: O2175 ^name predict-yes)
  28425. <=WM: (15251: R1091 ^value 1)
  28426. --- Inner Elaboration Phase, active level 1 (S1) ---
  28427. Firing prefer*rvt*predict-yes*H0
  28428. -->
  28429. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28430. -->
  28431. (S1 ^operator O2177 = -0.1254042659579056)
  28432. Firing rl*prefer*rvt*predict-yes*H0*5
  28433. -->
  28434. (S1 ^operator O2177 = 0.2940520155428289)
  28435. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28436. -->
  28437. Firing prefer*rvt*predict-no*H0
  28438. -->
  28439. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28440. -->
  28441. (S1 ^operator O2178 = 0.7701160080460637)
  28442. Firing rl*prefer*rvt*predict-no*H0*6
  28443. -->
  28444. (S1 ^operator O2178 = 0.229861769434934)
  28445. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28446. -->
  28447. inner elaboration loop at bottom goal.
  28448. Retracting rl*prefer*rvt*predict-no*H0*6
  28449. -->
  28450. (S1 ^operator O2176 = 0.229861769434934)
  28451. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28452. -->
  28453. (S1 ^operator O2176 = 0.7701160080460637)
  28454. Retracting rl*prefer*rvt*predict-yes*H0*5
  28455. -->
  28456. (S1 ^operator O2175 = 0.2940520155428289)
  28457. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28458. -->
  28459. (S1 ^operator O2175 = -0.1254042659579056)
  28460. --- END Proposal Phase ---
  28461. --- Decision Phase ---
  28462. RL update rl*prefer*rvt*predict-no*H0*6 0.611914 -0.382052 0.229862 -> 0.611915 -0.382051 0.229864(R,m,v=1,0.857895,0.122556)
  28463. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388068 0.382048 0.770116 -> 0.38807 0.382048 0.770118(R,m,v=1,1,0)
  28464. =>WM: (15270: S1 ^operator O2178)
  28465. 1089: O: O2178 (predict-no)
  28466. --- END Decision Phase ---
  28467. --- Application Phase ---
  28468. --- Firing Productions (PE) For State At Depth 1 ---
  28469. --- Inner Elaboration Phase, active level 1 (S1) ---
  28470. Firing apply*operator
  28471. -->
  28472. (I3 ^predict-no N1089 + :O )
  28473. Firing apply*operator*complete
  28474. -->
  28475. (I3 ^predict-no N1088 - :O )
  28476. inner elaboration loop at bottom goal.
  28477. --- Change Working Memory (PE) ---
  28478. =>WM: (15271: I3 ^predict-no N1089)
  28479. <=WM: (15259: N1088 ^status complete)
  28480. <=WM: (15258: I3 ^predict-no N1088)
  28481. --- Firing Productions (IE) For State At Depth 1 ---
  28482. --- Inner Elaboration Phase, active level 1 (S1) ---
  28483. Firing monitor*world
  28484. -->
  28485. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28486. --- Change Working Memory (IE) ---
  28487. --- END Application Phase ---
  28488. --- Output Phase ---
  28489. ENV: Agent did: predict-no for direction R in state State-B
  28490. In State-B moving R
  28491. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28492. predict error 0
  28493. dir: dir isR
  28494. --- END Output Phase ---
  28495. \---- Input Phase ---
  28496. =>WM: (15275: I2 ^dir R)
  28497. =>WM: (15274: I2 ^reward 1)
  28498. =>WM: (15273: I2 ^see 0)
  28499. =>WM: (15272: N1089 ^status complete)
  28500. <=WM: (15262: I2 ^dir R)
  28501. <=WM: (15261: I2 ^reward 1)
  28502. <=WM: (15260: I2 ^see 0)
  28503. =>WM: (15276: I2 ^level-1 R0-root)
  28504. <=WM: (15263: I2 ^level-1 R0-root)
  28505. --- END Input Phase ---
  28506. --- Proposal Phase ---
  28507. --- Inner Elaboration Phase, active level 1 (S1) ---
  28508. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28509. -->
  28510. (S1 ^operator O2177 = -0.1254042659579056)
  28511. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28512. -->
  28513. (S1 ^operator O2178 = 0.7701180366340212)
  28514. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28515. -->
  28516. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28517. -->
  28518. Firing elaborate*copy-see-to-output-link
  28519. -->
  28520. (I3 ^see 0 +)
  28521. Firing elaborate*reward*based*on*reward
  28522. -->
  28523. (R1093 ^value 1 +)
  28524. (R1 ^reward R1093 +)
  28525. Firing propose*predict-yes
  28526. -->
  28527. (O2179 ^name predict-yes +)
  28528. (S1 ^operator O2179 +)
  28529. Firing propose*predict-no
  28530. -->
  28531. (O2180 ^name predict-no +)
  28532. (S1 ^operator O2180 +)
  28533. Firing rl*prefer*rvt*predict-no*H0*6
  28534. -->
  28535. (S1 ^operator O2178 = 0.229863548083355)
  28536. Firing rl*prefer*rvt*predict-yes*H0*5
  28537. -->
  28538. (S1 ^operator O2177 = 0.2940520155428289)
  28539. Firing prefer*rvt*predict-yes*H0
  28540. -->
  28541. Firing prefer*rvt*predict-no*H0
  28542. -->
  28543. Firing elaborate*copy-dir-to-output-link
  28544. -->
  28545. (I3 ^dir R +)
  28546. inner elaboration loop at bottom goal.
  28547. Retracting elaborate*copy-see-to-output-link
  28548. -->
  28549. (I3 ^see 0 +)
  28550. Retracting propose*predict-no
  28551. -->
  28552. (O2178 ^name predict-no +)
  28553. (S1 ^operator O2178 +)
  28554. Retracting propose*predict-yes
  28555. -->
  28556. (O2177 ^name predict-yes +)
  28557. (S1 ^operator O2177 +)
  28558. Retracting elaborate*reward*based*on*reward
  28559. -->
  28560. (R1092 ^value 1 +)
  28561. (R1 ^reward R1092 +)
  28562. Retracting elaborate*copy-dir-to-output-link
  28563. -->
  28564. (I3 ^dir R +)
  28565. Retracting rl*prefer*rvt*predict-no*H0*6
  28566. -->
  28567. (S1 ^operator O2178 = 0.229863548083355)
  28568. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28569. -->
  28570. (S1 ^operator O2178 = 0.7701180366340212)
  28571. Retracting rl*prefer*rvt*predict-yes*H0*5
  28572. -->
  28573. (S1 ^operator O2177 = 0.2940520155428289)
  28574. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28575. -->
  28576. (S1 ^operator O2177 = -0.1254042659579056)
  28577. =>WM: (15282: S1 ^operator O2180 +)
  28578. =>WM: (15281: S1 ^operator O2179 +)
  28579. =>WM: (15280: O2180 ^name predict-no)
  28580. =>WM: (15279: O2179 ^name predict-yes)
  28581. =>WM: (15278: R1093 ^value 1)
  28582. =>WM: (15277: R1 ^reward R1093)
  28583. <=WM: (15268: S1 ^operator O2177 +)
  28584. <=WM: (15269: S1 ^operator O2178 +)
  28585. <=WM: (15270: S1 ^operator O2178)
  28586. <=WM: (15264: R1 ^reward R1092)
  28587. <=WM: (15267: O2178 ^name predict-no)
  28588. <=WM: (15266: O2177 ^name predict-yes)
  28589. <=WM: (15265: R1092 ^value 1)
  28590. --- Inner Elaboration Phase, active level 1 (S1) ---
  28591. Firing prefer*rvt*predict-yes*H0
  28592. -->
  28593. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28594. -->
  28595. (S1 ^operator O2179 = -0.1254042659579056)
  28596. Firing rl*prefer*rvt*predict-yes*H0*5
  28597. -->
  28598. (S1 ^operator O2179 = 0.2940520155428289)
  28599. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28600. -->
  28601. Firing prefer*rvt*predict-no*H0
  28602. -->
  28603. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28604. -->
  28605. (S1 ^operator O2180 = 0.7701180366340212)
  28606. Firing rl*prefer*rvt*predict-no*H0*6
  28607. -->
  28608. (S1 ^operator O2180 = 0.229863548083355)
  28609. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28610. -->
  28611. inner elaboration loop at bottom goal.
  28612. Retracting rl*prefer*rvt*predict-no*H0*6
  28613. -->
  28614. (S1 ^operator O2178 = 0.229863548083355)
  28615. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28616. -->
  28617. (S1 ^operator O2178 = 0.7701180366340212)
  28618. Retracting rl*prefer*rvt*predict-yes*H0*5
  28619. -->
  28620. (S1 ^operator O2177 = 0.2940520155428289)
  28621. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28622. -->
  28623. (S1 ^operator O2177 = -0.1254042659579056)
  28624. --- END Proposal Phase ---
  28625. --- Decision Phase ---
  28626. RL update rl*prefer*rvt*predict-no*H0*6 0.611915 -0.382051 0.229864 -> 0.611916 -0.382051 0.229865(R,m,v=1,0.858639,0.122017)
  28627. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.38807 0.382048 0.770118 -> 0.388071 0.382048 0.77012(R,m,v=1,1,0)
  28628. =>WM: (15283: S1 ^operator O2180)
  28629. 1090: O: O2180 (predict-no)
  28630. --- END Decision Phase ---
  28631. --- Application Phase ---
  28632. --- Firing Productions (PE) For State At Depth 1 ---
  28633. --- Inner Elaboration Phase, active level 1 (S1) ---
  28634. Firing apply*operator
  28635. -->
  28636. (I3 ^predict-no N1090 + :O )
  28637. Firing apply*operator*complete
  28638. -->
  28639. (I3 ^predict-no N1089 - :O )
  28640. inner elaboration loop at bottom goal.
  28641. --- Change Working Memory (PE) ---
  28642. =>WM: (15284: I3 ^predict-no N1090)
  28643. <=WM: (15272: N1089 ^status complete)
  28644. <=WM: (15271: I3 ^predict-no N1089)
  28645. --- Firing Productions (IE) For State At Depth 1 ---
  28646. --- Inner Elaboration Phase, active level 1 (S1) ---
  28647. Firing monitor*world
  28648. -->
  28649. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28650. --- Change Working Memory (IE) ---
  28651. --- END Application Phase ---
  28652. --- Output Phase ---
  28653. ENV: Agent did: predict-no for direction R in state State-B
  28654. In State-B moving R
  28655. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28656. predict error 0
  28657. dir: dir isR
  28658. --- END Output Phase ---
  28659. /|\--- Input Phase ---
  28660. =>WM: (15288: I2 ^dir R)
  28661. =>WM: (15287: I2 ^reward 1)
  28662. =>WM: (15286: I2 ^see 0)
  28663. =>WM: (15285: N1090 ^status complete)
  28664. <=WM: (15275: I2 ^dir R)
  28665. <=WM: (15274: I2 ^reward 1)
  28666. <=WM: (15273: I2 ^see 0)
  28667. =>WM: (15289: I2 ^level-1 R0-root)
  28668. <=WM: (15276: I2 ^level-1 R0-root)
  28669. --- END Input Phase ---
  28670. --- Proposal Phase ---
  28671. --- Inner Elaboration Phase, active level 1 (S1) ---
  28672. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28673. -->
  28674. (S1 ^operator O2179 = -0.1254042659579056)
  28675. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28676. -->
  28677. (S1 ^operator O2180 = 0.7701197142167014)
  28678. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28679. -->
  28680. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28681. -->
  28682. Firing elaborate*copy-see-to-output-link
  28683. -->
  28684. (I3 ^see 0 +)
  28685. Firing elaborate*reward*based*on*reward
  28686. -->
  28687. (R1094 ^value 1 +)
  28688. (R1 ^reward R1094 +)
  28689. Firing propose*predict-yes
  28690. -->
  28691. (O2181 ^name predict-yes +)
  28692. (S1 ^operator O2181 +)
  28693. Firing propose*predict-no
  28694. -->
  28695. (O2182 ^name predict-no +)
  28696. (S1 ^operator O2182 +)
  28697. Firing rl*prefer*rvt*predict-no*H0*6
  28698. -->
  28699. (S1 ^operator O2180 = 0.2298650207702772)
  28700. Firing rl*prefer*rvt*predict-yes*H0*5
  28701. -->
  28702. (S1 ^operator O2179 = 0.2940520155428289)
  28703. Firing prefer*rvt*predict-yes*H0
  28704. -->
  28705. Firing prefer*rvt*predict-no*H0
  28706. -->
  28707. Firing elaborate*copy-dir-to-output-link
  28708. -->
  28709. (I3 ^dir R +)
  28710. inner elaboration loop at bottom goal.
  28711. Retracting elaborate*copy-see-to-output-link
  28712. -->
  28713. (I3 ^see 0 +)
  28714. Retracting propose*predict-no
  28715. -->
  28716. (O2180 ^name predict-no +)
  28717. (S1 ^operator O2180 +)
  28718. Retracting propose*predict-yes
  28719. -->
  28720. (O2179 ^name predict-yes +)
  28721. (S1 ^operator O2179 +)
  28722. Retracting elaborate*reward*based*on*reward
  28723. -->
  28724. (R1093 ^value 1 +)
  28725. (R1 ^reward R1093 +)
  28726. Retracting elaborate*copy-dir-to-output-link
  28727. -->
  28728. (I3 ^dir R +)
  28729. Retracting rl*prefer*rvt*predict-no*H0*6
  28730. -->
  28731. (S1 ^operator O2180 = 0.2298650207702772)
  28732. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28733. -->
  28734. (S1 ^operator O2180 = 0.7701197142167014)
  28735. Retracting rl*prefer*rvt*predict-yes*H0*5
  28736. -->
  28737. (S1 ^operator O2179 = 0.2940520155428289)
  28738. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28739. -->
  28740. (S1 ^operator O2179 = -0.1254042659579056)
  28741. =>WM: (15295: S1 ^operator O2182 +)
  28742. =>WM: (15294: S1 ^operator O2181 +)
  28743. =>WM: (15293: O2182 ^name predict-no)
  28744. =>WM: (15292: O2181 ^name predict-yes)
  28745. =>WM: (15291: R1094 ^value 1)
  28746. =>WM: (15290: R1 ^reward R1094)
  28747. <=WM: (15281: S1 ^operator O2179 +)
  28748. <=WM: (15282: S1 ^operator O2180 +)
  28749. <=WM: (15283: S1 ^operator O2180)
  28750. <=WM: (15277: R1 ^reward R1093)
  28751. <=WM: (15280: O2180 ^name predict-no)
  28752. <=WM: (15279: O2179 ^name predict-yes)
  28753. <=WM: (15278: R1093 ^value 1)
  28754. --- Inner Elaboration Phase, active level 1 (S1) ---
  28755. Firing prefer*rvt*predict-yes*H0
  28756. -->
  28757. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28758. -->
  28759. (S1 ^operator O2181 = -0.1254042659579056)
  28760. Firing rl*prefer*rvt*predict-yes*H0*5
  28761. -->
  28762. (S1 ^operator O2181 = 0.2940520155428289)
  28763. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28764. -->
  28765. Firing prefer*rvt*predict-no*H0
  28766. -->
  28767. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28768. -->
  28769. (S1 ^operator O2182 = 0.7701197142167014)
  28770. Firing rl*prefer*rvt*predict-no*H0*6
  28771. -->
  28772. (S1 ^operator O2182 = 0.2298650207702772)
  28773. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28774. -->
  28775. inner elaboration loop at bottom goal.
  28776. Retracting rl*prefer*rvt*predict-no*H0*6
  28777. -->
  28778. (S1 ^operator O2180 = 0.2298650207702772)
  28779. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28780. -->
  28781. (S1 ^operator O2180 = 0.7701197142167014)
  28782. Retracting rl*prefer*rvt*predict-yes*H0*5
  28783. -->
  28784. (S1 ^operator O2179 = 0.2940520155428289)
  28785. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28786. -->
  28787. (S1 ^operator O2179 = -0.1254042659579056)
  28788. --- END Proposal Phase ---
  28789. --- Decision Phase ---
  28790. RL update rl*prefer*rvt*predict-no*H0*6 0.611916 -0.382051 0.229865 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.859375,0.121482)
  28791. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388071 0.382048 0.77012 -> 0.388073 0.382049 0.770121(R,m,v=1,1,0)
  28792. =>WM: (15296: S1 ^operator O2182)
  28793. 1091: O: O2182 (predict-no)
  28794. --- END Decision Phase ---
  28795. --- Application Phase ---
  28796. --- Firing Productions (PE) For State At Depth 1 ---
  28797. --- Inner Elaboration Phase, active level 1 (S1) ---
  28798. Firing apply*operator
  28799. -->
  28800. (I3 ^predict-no N1091 + :O )
  28801. Firing apply*operator*complete
  28802. -->
  28803. (I3 ^predict-no N1090 - :O )
  28804. inner elaboration loop at bottom goal.
  28805. --- Change Working Memory (PE) ---
  28806. =>WM: (15297: I3 ^predict-no N1091)
  28807. <=WM: (15285: N1090 ^status complete)
  28808. <=WM: (15284: I3 ^predict-no N1090)
  28809. --- Firing Productions (IE) For State At Depth 1 ---
  28810. --- Inner Elaboration Phase, active level 1 (S1) ---
  28811. Firing monitor*world
  28812. -->
  28813. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28814. --- Change Working Memory (IE) ---
  28815. --- END Application Phase ---
  28816. --- Output Phase ---
  28817. ENV: Agent did: predict-no for direction R in state State-B
  28818. In State-B moving R
  28819. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28820. predict error 0
  28821. dir: dir isR
  28822. --- END Output Phase ---
  28823. ---- Input Phase ---
  28824. =>WM: (15301: I2 ^dir R)
  28825. =>WM: (15300: I2 ^reward 1)
  28826. =>WM: (15299: I2 ^see 0)
  28827. =>WM: (15298: N1091 ^status complete)
  28828. <=WM: (15288: I2 ^dir R)
  28829. <=WM: (15287: I2 ^reward 1)
  28830. <=WM: (15286: I2 ^see 0)
  28831. =>WM: (15302: I2 ^level-1 R0-root)
  28832. <=WM: (15289: I2 ^level-1 R0-root)
  28833. --- END Input Phase ---
  28834. --- Proposal Phase ---
  28835. --- Inner Elaboration Phase, active level 1 (S1) ---
  28836. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28837. -->
  28838. (S1 ^operator O2181 = -0.1254042659579056)
  28839. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28840. -->
  28841. (S1 ^operator O2182 = 0.7701211019931825)
  28842. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28843. -->
  28844. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28845. -->
  28846. Firing elaborate*copy-see-to-output-link
  28847. -->
  28848. (I3 ^see 0 +)
  28849. Firing elaborate*reward*based*on*reward
  28850. -->
  28851. (R1095 ^value 1 +)
  28852. (R1 ^reward R1095 +)
  28853. Firing propose*predict-yes
  28854. -->
  28855. (O2183 ^name predict-yes +)
  28856. (S1 ^operator O2183 +)
  28857. Firing propose*predict-no
  28858. -->
  28859. (O2184 ^name predict-no +)
  28860. (S1 ^operator O2184 +)
  28861. Firing rl*prefer*rvt*predict-no*H0*6
  28862. -->
  28863. (S1 ^operator O2182 = 0.2298662405085362)
  28864. Firing rl*prefer*rvt*predict-yes*H0*5
  28865. -->
  28866. (S1 ^operator O2181 = 0.2940520155428289)
  28867. Firing prefer*rvt*predict-yes*H0
  28868. -->
  28869. Firing prefer*rvt*predict-no*H0
  28870. -->
  28871. Firing elaborate*copy-dir-to-output-link
  28872. -->
  28873. (I3 ^dir R +)
  28874. inner elaboration loop at bottom goal.
  28875. Retracting elaborate*copy-see-to-output-link
  28876. -->
  28877. (I3 ^see 0 +)
  28878. Retracting propose*predict-no
  28879. -->
  28880. (O2182 ^name predict-no +)
  28881. (S1 ^operator O2182 +)
  28882. Retracting propose*predict-yes
  28883. -->
  28884. (O2181 ^name predict-yes +)
  28885. (S1 ^operator O2181 +)
  28886. Retracting elaborate*reward*based*on*reward
  28887. -->
  28888. (R1094 ^value 1 +)
  28889. (R1 ^reward R1094 +)
  28890. Retracting elaborate*copy-dir-to-output-link
  28891. -->
  28892. (I3 ^dir R +)
  28893. Retracting rl*prefer*rvt*predict-no*H0*6
  28894. -->
  28895. (S1 ^operator O2182 = 0.2298662405085362)
  28896. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28897. -->
  28898. (S1 ^operator O2182 = 0.7701211019931825)
  28899. Retracting rl*prefer*rvt*predict-yes*H0*5
  28900. -->
  28901. (S1 ^operator O2181 = 0.2940520155428289)
  28902. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28903. -->
  28904. (S1 ^operator O2181 = -0.1254042659579056)
  28905. =>WM: (15308: S1 ^operator O2184 +)
  28906. =>WM: (15307: S1 ^operator O2183 +)
  28907. =>WM: (15306: O2184 ^name predict-no)
  28908. =>WM: (15305: O2183 ^name predict-yes)
  28909. =>WM: (15304: R1095 ^value 1)
  28910. =>WM: (15303: R1 ^reward R1095)
  28911. <=WM: (15294: S1 ^operator O2181 +)
  28912. <=WM: (15295: S1 ^operator O2182 +)
  28913. <=WM: (15296: S1 ^operator O2182)
  28914. <=WM: (15290: R1 ^reward R1094)
  28915. <=WM: (15293: O2182 ^name predict-no)
  28916. <=WM: (15292: O2181 ^name predict-yes)
  28917. <=WM: (15291: R1094 ^value 1)
  28918. --- Inner Elaboration Phase, active level 1 (S1) ---
  28919. Firing prefer*rvt*predict-yes*H0
  28920. -->
  28921. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28922. -->
  28923. (S1 ^operator O2183 = -0.1254042659579056)
  28924. Firing rl*prefer*rvt*predict-yes*H0*5
  28925. -->
  28926. (S1 ^operator O2183 = 0.2940520155428289)
  28927. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28928. -->
  28929. Firing prefer*rvt*predict-no*H0
  28930. -->
  28931. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28932. -->
  28933. (S1 ^operator O2184 = 0.7701211019931825)
  28934. Firing rl*prefer*rvt*predict-no*H0*6
  28935. -->
  28936. (S1 ^operator O2184 = 0.2298662405085362)
  28937. Firing prefer*rvt*predict-no*H0*6*v1*H1
  28938. -->
  28939. inner elaboration loop at bottom goal.
  28940. Retracting rl*prefer*rvt*predict-no*H0*6
  28941. -->
  28942. (S1 ^operator O2182 = 0.2298662405085362)
  28943. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  28944. -->
  28945. (S1 ^operator O2182 = 0.7701211019931825)
  28946. Retracting rl*prefer*rvt*predict-yes*H0*5
  28947. -->
  28948. (S1 ^operator O2181 = 0.2940520155428289)
  28949. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  28950. -->
  28951. (S1 ^operator O2181 = -0.1254042659579056)
  28952. --- END Proposal Phase ---
  28953. --- Decision Phase ---
  28954. RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611918 -0.382051 0.229867(R,m,v=1,0.860104,0.120952)
  28955. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388073 0.382049 0.770121 -> 0.388073 0.382049 0.770122(R,m,v=1,1,0)
  28956. =>WM: (15309: S1 ^operator O2184)
  28957. 1092: O: O2184 (predict-no)
  28958. --- END Decision Phase ---
  28959. --- Application Phase ---
  28960. --- Firing Productions (PE) For State At Depth 1 ---
  28961. --- Inner Elaboration Phase, active level 1 (S1) ---
  28962. Firing apply*operator
  28963. -->
  28964. (I3 ^predict-no N1092 + :O )
  28965. Firing apply*operator*complete
  28966. -->
  28967. (I3 ^predict-no N1091 - :O )
  28968. inner elaboration loop at bottom goal.
  28969. --- Change Working Memory (PE) ---
  28970. =>WM: (15310: I3 ^predict-no N1092)
  28971. <=WM: (15298: N1091 ^status complete)
  28972. <=WM: (15297: I3 ^predict-no N1091)
  28973. --- Firing Productions (IE) For State At Depth 1 ---
  28974. --- Inner Elaboration Phase, active level 1 (S1) ---
  28975. Firing monitor*world
  28976. -->
  28977. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28978. --- Change Working Memory (IE) ---
  28979. --- END Application Phase ---
  28980. --- Output Phase ---
  28981. ENV: Agent did: predict-no for direction R in state State-B
  28982. In State-B moving R
  28983. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28984. predict error 0
  28985. dir: dir isU
  28986. --- END Output Phase ---
  28987. /|\--- Input Phase ---
  28988. =>WM: (15314: I2 ^dir U)
  28989. =>WM: (15313: I2 ^reward 1)
  28990. =>WM: (15312: I2 ^see 0)
  28991. =>WM: (15311: N1092 ^status complete)
  28992. <=WM: (15301: I2 ^dir R)
  28993. <=WM: (15300: I2 ^reward 1)
  28994. <=WM: (15299: I2 ^see 0)
  28995. =>WM: (15315: I2 ^level-1 R0-root)
  28996. <=WM: (15302: I2 ^level-1 R0-root)
  28997. --- END Input Phase ---
  28998. --- Proposal Phase ---
  28999. --- Inner Elaboration Phase, active level 1 (S1) ---
  29000. Firing elaborate*copy-see-to-output-link
  29001. -->
  29002. (I3 ^see 0 +)
  29003. Firing elaborate*reward*based*on*reward
  29004. -->
  29005. (R1096 ^value 1 +)
  29006. (R1 ^reward R1096 +)
  29007. Firing propose*predict-yes
  29008. -->
  29009. (O2185 ^name predict-yes +)
  29010. (S1 ^operator O2185 +)
  29011. Firing propose*predict-no
  29012. -->
  29013. (O2186 ^name predict-no +)
  29014. (S1 ^operator O2186 +)
  29015. Firing rl*prefer*rvt*predict-no*H0*4
  29016. -->
  29017. (S1 ^operator O2184 = 1.)
  29018. Firing rl*prefer*rvt*predict-yes*H0*3
  29019. -->
  29020. (S1 ^operator O2183 = 0.)
  29021. Firing prefer*rvt*predict-yes*H0
  29022. -->
  29023. Firing prefer*rvt*predict-no*H0
  29024. -->
  29025. Firing elaborate*copy-dir-to-output-link
  29026. -->
  29027. (I3 ^dir U +)
  29028. inner elaboration loop at bottom goal.
  29029. Retracting elaborate*copy-see-to-output-link
  29030. -->
  29031. (I3 ^see 0 +)
  29032. Retracting propose*predict-no
  29033. -->
  29034. (O2184 ^name predict-no +)
  29035. (S1 ^operator O2184 +)
  29036. Retracting propose*predict-yes
  29037. -->
  29038. (O2183 ^name predict-yes +)
  29039. (S1 ^operator O2183 +)
  29040. Retracting elaborate*reward*based*on*reward
  29041. -->
  29042. (R1095 ^value 1 +)
  29043. (R1 ^reward R1095 +)
  29044. Retracting elaborate*copy-dir-to-output-link
  29045. -->
  29046. (I3 ^dir R +)
  29047. Retracting rl*prefer*rvt*predict-no*H0*6
  29048. -->
  29049. (S1 ^operator O2184 = 0.2298672510565515)
  29050. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29051. -->
  29052. (S1 ^operator O2184 = 0.7701222504073515)
  29053. Retracting rl*prefer*rvt*predict-yes*H0*5
  29054. -->
  29055. (S1 ^operator O2183 = 0.2940520155428289)
  29056. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29057. -->
  29058. (S1 ^operator O2183 = -0.1254042659579056)
  29059. =>WM: (15322: S1 ^operator O2186 +)
  29060. =>WM: (15321: S1 ^operator O2185 +)
  29061. =>WM: (15320: I3 ^dir U)
  29062. =>WM: (15319: O2186 ^name predict-no)
  29063. =>WM: (15318: O2185 ^name predict-yes)
  29064. =>WM: (15317: R1096 ^value 1)
  29065. =>WM: (15316: R1 ^reward R1096)
  29066. <=WM: (15307: S1 ^operator O2183 +)
  29067. <=WM: (15308: S1 ^operator O2184 +)
  29068. <=WM: (15309: S1 ^operator O2184)
  29069. <=WM: (15254: I3 ^dir R)
  29070. <=WM: (15303: R1 ^reward R1095)
  29071. <=WM: (15306: O2184 ^name predict-no)
  29072. <=WM: (15305: O2183 ^name predict-yes)
  29073. <=WM: (15304: R1095 ^value 1)
  29074. --- Inner Elaboration Phase, active level 1 (S1) ---
  29075. Firing prefer*rvt*predict-yes*H0
  29076. -->
  29077. Firing rl*prefer*rvt*predict-yes*H0*3
  29078. -->
  29079. (S1 ^operator O2185 = 0.)
  29080. Firing prefer*rvt*predict-no*H0
  29081. -->
  29082. Firing rl*prefer*rvt*predict-no*H0*4
  29083. -->
  29084. (S1 ^operator O2186 = 1.)
  29085. inner elaboration loop at bottom goal.
  29086. Retracting rl*prefer*rvt*predict-no*H0*4
  29087. -->
  29088. (S1 ^operator O2184 = 1.)
  29089. Retracting rl*prefer*rvt*predict-yes*H0*3
  29090. -->
  29091. (S1 ^operator O2183 = 0.)
  29092. --- END Proposal Phase ---
  29093. --- Decision Phase ---
  29094. RL update rl*prefer*rvt*predict-no*H0*6 0.611918 -0.382051 0.229867 -> 0.611919 -0.382051 0.229868(R,m,v=1,0.860825,0.120426)
  29095. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388073 0.382049 0.770122 -> 0.388074 0.382049 0.770123(R,m,v=1,1,0)
  29096. =>WM: (15323: S1 ^operator O2186)
  29097. 1093: O: O2186 (predict-no)
  29098. --- END Decision Phase ---
  29099. --- Application Phase ---
  29100. --- Firing Productions (PE) For State At Depth 1 ---
  29101. --- Inner Elaboration Phase, active level 1 (S1) ---
  29102. Firing apply*operator
  29103. -->
  29104. (I3 ^predict-no N1093 + :O )
  29105. Firing apply*operator*complete
  29106. -->
  29107. (I3 ^predict-no N1092 - :O )
  29108. inner elaboration loop at bottom goal.
  29109. --- Change Working Memory (PE) ---
  29110. =>WM: (15324: I3 ^predict-no N1093)
  29111. <=WM: (15311: N1092 ^status complete)
  29112. <=WM: (15310: I3 ^predict-no N1092)
  29113. --- Firing Productions (IE) For State At Depth 1 ---
  29114. --- Inner Elaboration Phase, active level 1 (S1) ---
  29115. Firing monitor*world
  29116. -->
  29117. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29118. --- Change Working Memory (IE) ---
  29119. --- END Application Phase ---
  29120. --- Output Phase ---
  29121. ENV: Agent did: predict-no for direction U in state State-B
  29122. In State-B moving U
  29123. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29124. predict error 0
  29125. dir: dir isR
  29126. --- END Output Phase ---
  29127. -/--- Input Phase ---
  29128. =>WM: (15328: I2 ^dir R)
  29129. =>WM: (15327: I2 ^reward 1)
  29130. =>WM: (15326: I2 ^see 0)
  29131. =>WM: (15325: N1093 ^status complete)
  29132. <=WM: (15314: I2 ^dir U)
  29133. <=WM: (15313: I2 ^reward 1)
  29134. <=WM: (15312: I2 ^see 0)
  29135. =>WM: (15329: I2 ^level-1 R0-root)
  29136. <=WM: (15315: I2 ^level-1 R0-root)
  29137. --- END Input Phase ---
  29138. --- Proposal Phase ---
  29139. --- Inner Elaboration Phase, active level 1 (S1) ---
  29140. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29141. -->
  29142. (S1 ^operator O2185 = -0.1254042659579056)
  29143. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29144. -->
  29145. (S1 ^operator O2186 = 0.770123201053682)
  29146. Firing prefer*rvt*predict-no*H0*6*v1*H1
  29147. -->
  29148. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29149. -->
  29150. Firing elaborate*copy-see-to-output-link
  29151. -->
  29152. (I3 ^see 0 +)
  29153. Firing elaborate*reward*based*on*reward
  29154. -->
  29155. (R1097 ^value 1 +)
  29156. (R1 ^reward R1097 +)
  29157. Firing propose*predict-yes
  29158. -->
  29159. (O2187 ^name predict-yes +)
  29160. (S1 ^operator O2187 +)
  29161. Firing propose*predict-no
  29162. -->
  29163. (O2188 ^name predict-no +)
  29164. (S1 ^operator O2188 +)
  29165. Firing rl*prefer*rvt*predict-no*H0*6
  29166. -->
  29167. (S1 ^operator O2186 = 0.2298680885464747)
  29168. Firing rl*prefer*rvt*predict-yes*H0*5
  29169. -->
  29170. (S1 ^operator O2185 = 0.2940520155428289)
  29171. Firing prefer*rvt*predict-yes*H0
  29172. -->
  29173. Firing prefer*rvt*predict-no*H0
  29174. -->
  29175. Firing elaborate*copy-dir-to-output-link
  29176. -->
  29177. (I3 ^dir R +)
  29178. inner elaboration loop at bottom goal.
  29179. Retracting elaborate*copy-see-to-output-link
  29180. -->
  29181. (I3 ^see 0 +)
  29182. Retracting propose*predict-no
  29183. -->
  29184. (O2186 ^name predict-no +)
  29185. (S1 ^operator O2186 +)
  29186. Retracting propose*predict-yes
  29187. -->
  29188. (O2185 ^name predict-yes +)
  29189. (S1 ^operator O2185 +)
  29190. Retracting elaborate*reward*based*on*reward
  29191. -->
  29192. (R1096 ^value 1 +)
  29193. (R1 ^reward R1096 +)
  29194. Retracting elaborate*copy-dir-to-output-link
  29195. -->
  29196. (I3 ^dir U +)
  29197. Retracting rl*prefer*rvt*predict-no*H0*4
  29198. -->
  29199. (S1 ^operator O2186 = 1.)
  29200. Retracting rl*prefer*rvt*predict-yes*H0*3
  29201. -->
  29202. (S1 ^operator O2185 = 0.)
  29203. =>WM: (15336: S1 ^operator O2188 +)
  29204. =>WM: (15335: S1 ^operator O2187 +)
  29205. =>WM: (15334: I3 ^dir R)
  29206. =>WM: (15333: O2188 ^name predict-no)
  29207. =>WM: (15332: O2187 ^name predict-yes)
  29208. =>WM: (15331: R1097 ^value 1)
  29209. =>WM: (15330: R1 ^reward R1097)
  29210. <=WM: (15321: S1 ^operator O2185 +)
  29211. <=WM: (15322: S1 ^operator O2186 +)
  29212. <=WM: (15323: S1 ^operator O2186)
  29213. <=WM: (15320: I3 ^dir U)
  29214. <=WM: (15316: R1 ^reward R1096)
  29215. <=WM: (15319: O2186 ^name predict-no)
  29216. <=WM: (15318: O2185 ^name predict-yes)
  29217. <=WM: (15317: R1096 ^value 1)
  29218. --- Inner Elaboration Phase, active level 1 (S1) ---
  29219. Firing prefer*rvt*predict-yes*H0
  29220. -->
  29221. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29222. -->
  29223. (S1 ^operator O2187 = -0.1254042659579056)
  29224. Firing rl*prefer*rvt*predict-yes*H0*5
  29225. -->
  29226. (S1 ^operator O2187 = 0.2940520155428289)
  29227. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29228. -->
  29229. Firing prefer*rvt*predict-no*H0
  29230. -->
  29231. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29232. -->
  29233. (S1 ^operator O2188 = 0.770123201053682)
  29234. Firing rl*prefer*rvt*predict-no*H0*6
  29235. -->
  29236. (S1 ^operator O2188 = 0.2298680885464747)
  29237. Firing prefer*rvt*predict-no*H0*6*v1*H1
  29238. -->
  29239. inner elaboration loop at bottom goal.
  29240. Retracting rl*prefer*rvt*predict-no*H0*6
  29241. -->
  29242. (S1 ^operator O2186 = 0.2298680885464747)
  29243. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29244. -->
  29245. (S1 ^operator O2186 = 0.770123201053682)
  29246. Retracting rl*prefer*rvt*predict-yes*H0*5
  29247. -->
  29248. (S1 ^operator O2185 = 0.2940520155428289)
  29249. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29250. -->
  29251. (S1 ^operator O2185 = -0.1254042659579056)
  29252. --- END Proposal Phase ---
  29253. --- Decision Phase ---
  29254. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29255. =>WM: (15337: S1 ^operator O2188)
  29256. 1094: O: O2188 (predict-no)
  29257. --- END Decision Phase ---
  29258. --- Application Phase ---
  29259. --- Firing Productions (PE) For State At Depth 1 ---
  29260. --- Inner Elaboration Phase, active level 1 (S1) ---
  29261. Firing apply*operator
  29262. -->
  29263. (I3 ^predict-no N1094 + :O )
  29264. Firing apply*operator*complete
  29265. -->
  29266. (I3 ^predict-no N1093 - :O )
  29267. inner elaboration loop at bottom goal.
  29268. --- Change Working Memory (PE) ---
  29269. =>WM: (15338: I3 ^predict-no N1094)
  29270. <=WM: (15325: N1093 ^status complete)
  29271. <=WM: (15324: I3 ^predict-no N1093)
  29272. --- Firing Productions (IE) For State At Depth 1 ---
  29273. --- Inner Elaboration Phase, active level 1 (S1) ---
  29274. Firing monitor*world
  29275. -->
  29276. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29277. --- Change Working Memory (IE) ---
  29278. --- END Application Phase ---
  29279. --- Output Phase ---
  29280. ENV: Agent did: predict-no for direction R in state State-B
  29281. In State-B moving R
  29282. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29283. predict error 0
  29284. dir: dir isR
  29285. --- END Output Phase ---
  29286. |\---- Input Phase ---
  29287. =>WM: (15342: I2 ^dir R)
  29288. =>WM: (15341: I2 ^reward 1)
  29289. =>WM: (15340: I2 ^see 0)
  29290. =>WM: (15339: N1094 ^status complete)
  29291. <=WM: (15328: I2 ^dir R)
  29292. <=WM: (15327: I2 ^reward 1)
  29293. <=WM: (15326: I2 ^see 0)
  29294. =>WM: (15343: I2 ^level-1 R0-root)
  29295. <=WM: (15329: I2 ^level-1 R0-root)
  29296. --- END Input Phase ---
  29297. --- Proposal Phase ---
  29298. --- Inner Elaboration Phase, active level 1 (S1) ---
  29299. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29300. -->
  29301. (S1 ^operator O2187 = -0.1254042659579056)
  29302. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29303. -->
  29304. (S1 ^operator O2188 = 0.770123201053682)
  29305. Firing prefer*rvt*predict-no*H0*6*v1*H1
  29306. -->
  29307. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29308. -->
  29309. Firing elaborate*copy-see-to-output-link
  29310. -->
  29311. (I3 ^see 0 +)
  29312. Firing elaborate*reward*based*on*reward
  29313. -->
  29314. (R1098 ^value 1 +)
  29315. (R1 ^reward R1098 +)
  29316. Firing propose*predict-yes
  29317. -->
  29318. (O2189 ^name predict-yes +)
  29319. (S1 ^operator O2189 +)
  29320. Firing propose*predict-no
  29321. -->
  29322. (O2190 ^name predict-no +)
  29323. (S1 ^operator O2190 +)
  29324. Firing rl*prefer*rvt*predict-no*H0*6
  29325. -->
  29326. (S1 ^operator O2188 = 0.2298680885464747)
  29327. Firing rl*prefer*rvt*predict-yes*H0*5
  29328. -->
  29329. (S1 ^operator O2187 = 0.2940520155428289)
  29330. Firing prefer*rvt*predict-yes*H0
  29331. -->
  29332. Firing prefer*rvt*predict-no*H0
  29333. -->
  29334. Firing elaborate*copy-dir-to-output-link
  29335. -->
  29336. (I3 ^dir R +)
  29337. inner elaboration loop at bottom goal.
  29338. Retracting elaborate*copy-see-to-output-link
  29339. -->
  29340. (I3 ^see 0 +)
  29341. Retracting propose*predict-no
  29342. -->
  29343. (O2188 ^name predict-no +)
  29344. (S1 ^operator O2188 +)
  29345. Retracting propose*predict-yes
  29346. -->
  29347. (O2187 ^name predict-yes +)
  29348. (S1 ^operator O2187 +)
  29349. Retracting elaborate*reward*based*on*reward
  29350. -->
  29351. (R1097 ^value 1 +)
  29352. (R1 ^reward R1097 +)
  29353. Retracting elaborate*copy-dir-to-output-link
  29354. -->
  29355. (I3 ^dir R +)
  29356. Retracting rl*prefer*rvt*predict-no*H0*6
  29357. -->
  29358. (S1 ^operator O2188 = 0.2298680885464747)
  29359. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29360. -->
  29361. (S1 ^operator O2188 = 0.770123201053682)
  29362. Retracting rl*prefer*rvt*predict-yes*H0*5
  29363. -->
  29364. (S1 ^operator O2187 = 0.2940520155428289)
  29365. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29366. -->
  29367. (S1 ^operator O2187 = -0.1254042659579056)
  29368. =>WM: (15349: S1 ^operator O2190 +)
  29369. =>WM: (15348: S1 ^operator O2189 +)
  29370. =>WM: (15347: O2190 ^name predict-no)
  29371. =>WM: (15346: O2189 ^name predict-yes)
  29372. =>WM: (15345: R1098 ^value 1)
  29373. =>WM: (15344: R1 ^reward R1098)
  29374. <=WM: (15335: S1 ^operator O2187 +)
  29375. <=WM: (15336: S1 ^operator O2188 +)
  29376. <=WM: (15337: S1 ^operator O2188)
  29377. <=WM: (15330: R1 ^reward R1097)
  29378. <=WM: (15333: O2188 ^name predict-no)
  29379. <=WM: (15332: O2187 ^name predict-yes)
  29380. <=WM: (15331: R1097 ^value 1)
  29381. --- Inner Elaboration Phase, active level 1 (S1) ---
  29382. Firing prefer*rvt*predict-yes*H0
  29383. -->
  29384. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29385. -->
  29386. (S1 ^operator O2189 = -0.1254042659579056)
  29387. Firing rl*prefer*rvt*predict-yes*H0*5
  29388. -->
  29389. (S1 ^operator O2189 = 0.2940520155428289)
  29390. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29391. -->
  29392. Firing prefer*rvt*predict-no*H0
  29393. -->
  29394. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29395. -->
  29396. (S1 ^operator O2190 = 0.770123201053682)
  29397. Firing rl*prefer*rvt*predict-no*H0*6
  29398. -->
  29399. (S1 ^operator O2190 = 0.2298680885464747)
  29400. Firing prefer*rvt*predict-no*H0*6*v1*H1
  29401. -->
  29402. inner elaboration loop at bottom goal.
  29403. Retracting rl*prefer*rvt*predict-no*H0*6
  29404. -->
  29405. (S1 ^operator O2188 = 0.2298680885464747)
  29406. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29407. -->
  29408. (S1 ^operator O2188 = 0.770123201053682)
  29409. Retracting rl*prefer*rvt*predict-yes*H0*5
  29410. -->
  29411. (S1 ^operator O2187 = 0.2940520155428289)
  29412. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29413. -->
  29414. (S1 ^operator O2187 = -0.1254042659579056)
  29415. --- END Proposal Phase ---
  29416. --- Decision Phase ---
  29417. RL update rl*prefer*rvt*predict-no*H0*6 0.611919 -0.382051 0.229868 -> 0.611919 -0.38205 0.229869(R,m,v=1,0.861538,0.119905)
  29418. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388074 0.382049 0.770123 -> 0.388075 0.382049 0.770124(R,m,v=1,1,0)
  29419. =>WM: (15350: S1 ^operator O2190)
  29420. 1095: O: O2190 (predict-no)
  29421. --- END Decision Phase ---
  29422. --- Application Phase ---
  29423. --- Firing Productions (PE) For State At Depth 1 ---
  29424. --- Inner Elaboration Phase, active level 1 (S1) ---
  29425. Firing apply*operator
  29426. -->
  29427. (I3 ^predict-no N1095 + :O )
  29428. Firing apply*operator*complete
  29429. -->
  29430. (I3 ^predict-no N1094 - :O )
  29431. inner elaboration loop at bottom goal.
  29432. --- Change Working Memory (PE) ---
  29433. =>WM: (15351: I3 ^predict-no N1095)
  29434. <=WM: (15339: N1094 ^status complete)
  29435. <=WM: (15338: I3 ^predict-no N1094)
  29436. --- Firing Productions (IE) For State At Depth 1 ---
  29437. --- Inner Elaboration Phase, active level 1 (S1) ---
  29438. Firing monitor*world
  29439. -->
  29440. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29441. --- Change Working Memory (IE) ---
  29442. --- END Application Phase ---
  29443. --- Output Phase ---
  29444. ENV: Agent did: predict-no for direction R in state State-B
  29445. In State-B moving R
  29446. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29447. predict error 0
  29448. dir: dir isL
  29449. --- END Output Phase ---
  29450. /|--- Input Phase ---
  29451. =>WM: (15355: I2 ^dir L)
  29452. =>WM: (15354: I2 ^reward 1)
  29453. =>WM: (15353: I2 ^see 0)
  29454. =>WM: (15352: N1095 ^status complete)
  29455. <=WM: (15342: I2 ^dir R)
  29456. <=WM: (15341: I2 ^reward 1)
  29457. <=WM: (15340: I2 ^see 0)
  29458. =>WM: (15356: I2 ^level-1 R0-root)
  29459. <=WM: (15343: I2 ^level-1 R0-root)
  29460. --- END Input Phase ---
  29461. --- Proposal Phase ---
  29462. --- Inner Elaboration Phase, active level 1 (S1) ---
  29463. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  29464. -->
  29465. (S1 ^operator O2189 = 0.6195770009714396)
  29466. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  29467. -->
  29468. (S1 ^operator O2190 = -0.2190661556260421)
  29469. Firing prefer*rvt*predict-no*H0*2*v1*H1
  29470. -->
  29471. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  29472. -->
  29473. Firing elaborate*copy-see-to-output-link
  29474. -->
  29475. (I3 ^see 0 +)
  29476. Firing elaborate*reward*based*on*reward
  29477. -->
  29478. (R1099 ^value 1 +)
  29479. (R1 ^reward R1099 +)
  29480. Firing propose*predict-yes
  29481. -->
  29482. (O2191 ^name predict-yes +)
  29483. (S1 ^operator O2191 +)
  29484. Firing propose*predict-no
  29485. -->
  29486. (O2192 ^name predict-no +)
  29487. (S1 ^operator O2192 +)
  29488. Firing rl*prefer*rvt*predict-no*H0*2
  29489. -->
  29490. (S1 ^operator O2190 = 0.3140548183361512)
  29491. Firing rl*prefer*rvt*predict-yes*H0*1
  29492. -->
  29493. (S1 ^operator O2189 = 0.3804132142488074)
  29494. Firing prefer*rvt*predict-yes*H0
  29495. -->
  29496. Firing prefer*rvt*predict-no*H0
  29497. -->
  29498. Firing elaborate*copy-dir-to-output-link
  29499. -->
  29500. (I3 ^dir L +)
  29501. inner elaboration loop at bottom goal.
  29502. Retracting elaborate*copy-see-to-output-link
  29503. -->
  29504. (I3 ^see 0 +)
  29505. Retracting propose*predict-no
  29506. -->
  29507. (O2190 ^name predict-no +)
  29508. (S1 ^operator O2190 +)
  29509. Retracting propose*predict-yes
  29510. -->
  29511. (O2189 ^name predict-yes +)
  29512. (S1 ^operator O2189 +)
  29513. Retracting elaborate*reward*based*on*reward
  29514. -->
  29515. (R1098 ^value 1 +)
  29516. (R1 ^reward R1098 +)
  29517. Retracting elaborate*copy-dir-to-output-link
  29518. -->
  29519. (I3 ^dir R +)
  29520. Retracting rl*prefer*rvt*predict-no*H0*6
  29521. -->
  29522. (S1 ^operator O2190 = 0.2298687828235715)
  29523. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  29524. -->
  29525. (S1 ^operator O2190 = 0.7701239882424035)
  29526. Retracting rl*prefer*rvt*predict-yes*H0*5
  29527. -->
  29528. (S1 ^operator O2189 = 0.2940520155428289)
  29529. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  29530. -->
  29531. (S1 ^operator O2189 = -0.1254042659579056)
  29532. =>WM: (15363: S1 ^operator O2192 +)
  29533. =>WM: (15362: S1 ^operator O2191 +)
  29534. =>WM: (15361: I3 ^dir L)
  29535. =>WM: (15360: O2192 ^name predict-no)
  29536. =>WM: (15359: O2191 ^name predict-yes)
  29537. =>WM: (15358: R1099 ^value 1)
  29538. =>WM: (15357: R1 ^reward R1099)
  29539. <=WM: (15348: S1 ^operator O2189 +)
  29540. <=WM: (15349: S1 ^operator O2190 +)
  29541. <=WM: (15350: S1 ^operator O2190)
  29542. <=WM: (15334: I3 ^dir R)
  29543. <=WM: (15344: R1 ^reward R1098)
  29544. <=WM: (15347: O2190 ^name predict-no)
  29545. <=WM: (15346: O2189 ^name predict-yes)
  29546. <=WM: (15345: R1098 ^value 1)
  29547. --- Inner Elaboration Phase, active level 1 (S1) ---
  29548. Firing prefer*rvt*predict-yes*H0
  29549. -->
  29550. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  29551. -->
  29552. (S1 ^operator O2191 = 0.6195770009714396)
  29553. Firing rl*prefer*rvt*predict-yes*H0*1
  29554. -->
  29555. (S1 ^operator O2191 = 0.3804132142488074)
  29556. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  29557. -->
  29558. Firing prefer*rvt*predict-no*H0
  29559. -->
  29560. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  29561. -->
  29562. (S1 ^operator O2192 = -0.2190661556260421)
  29563. Firing rl*prefer*rvt*predict-no*H0*2
  29564. -->
  29565. (S1 ^operator O2192 = 0.3140548183361512)
  29566. Firing prefer*rvt*predict-no*H0*2*v1*H1
  29567. -->
  29568. inner elaboration loop at bottom goal.
  29569. Retracting rl*prefer*rvt*predict-no*H0*2
  29570. -->
  29571. (S1 ^operator O2190 = 0.3140548183361512)
  29572. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  29573. -->
  29574. (S1 ^operator O2190 = -0.2190661556260421)
  29575. Retracting rl*prefer*rvt*predict-yes*H0*1
  29576. -->
  29577. (S1 ^operator O2189 = 0.3804132142488074)
  29578. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  29579. -->
  29580. (S1 ^operator O2189 = 0.6195770009714396)
  29581. --- END Proposal Phase ---
  29582. --- Decision Phase ---
  29583. RL update rl*prefer*rvt*predict-no*H0*6 0.611919 -0.38205 0.229869 -> 0.61192 -0.38205 0.229869(R,m,v=1,0.862245,0.119388)
  29584. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388075 0.382049 0.770124 -> 0.388075 0.382049 0.770125(R,m,v=1,1,0)
  29585. =>WM: (15364: S1 ^operator O2191)
  29586. 1096: O: O2191 (predict-yes)
  29587. --- END Decision Phase ---
  29588. --- Application Phase ---
  29589. --- Firing Productions (PE) For State At Depth 1 ---
  29590. --- Inner Elaboration Phase, active level 1 (S1) ---
  29591. Firing apply*operator
  29592. -->
  29593. (I3 ^predict-yes N1096 + :O )
  29594. Firing apply*operator*complete
  29595. -->
  29596. (I3 ^predict-no N1095 - :O )
  29597. inner elaboration loop at bottom goal.
  29598. --- Change Working Memory (PE) ---
  29599. =>WM: (15365: I3 ^predict-yes N1096)
  29600. <=WM: (15352: N1095 ^status complete)
  29601. <=WM: (15351: I3 ^predict-no N1095)
  29602. --- Firing Productions (IE) For State At Depth 1 ---
  29603. --- Inner Elaboration Phase, active level 1 (S1) ---
  29604. Firing monitor*world
  29605. -->
  29606. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29607. --- Change Working Memory (IE) ---
  29608. --- END Application Phase ---
  29609. --- Output Phase ---
  29610. ENV: Agent did: predict-yes for direction L in state State-B
  29611. In State-B moving L
  29612. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  29613. predict error 0
  29614. dir: dir isR
  29615. --- END Output Phase ---
  29616. \-/--- Input Phase ---
  29617. =>WM: (15369: I2 ^dir R)
  29618. =>WM: (15368: I2 ^reward 1)
  29619. =>WM: (15367: I2 ^see 1)
  29620. =>WM: (15366: N1096 ^status complete)
  29621. <=WM: (15355: I2 ^dir L)
  29622. <=WM: (15354: I2 ^reward 1)
  29623. <=WM: (15353: I2 ^see 0)
  29624. =>WM: (15370: I2 ^level-1 L1-root)
  29625. <=WM: (15356: I2 ^level-1 R0-root)
  29626. --- END Input Phase ---
  29627. --- Proposal Phase ---
  29628. --- Inner Elaboration Phase, active level 1 (S1) ---
  29629. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  29630. -->
  29631. (S1 ^operator O2191 = 0.7061957252803326)
  29632. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  29633. -->
  29634. (S1 ^operator O2192 = -0.1937987592593187)
  29635. Firing prefer*rvt*predict-no*H0*6*v1*H1
  29636. -->
  29637. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29638. -->
  29639. Firing elaborate*copy-see-to-output-link
  29640. -->
  29641. (I3 ^see 1 +)
  29642. Firing elaborate*reward*based*on*reward
  29643. -->
  29644. (R1100 ^value 1 +)
  29645. (R1 ^reward R1100 +)
  29646. Firing propose*predict-yes
  29647. -->
  29648. (O2193 ^name predict-yes +)
  29649. (S1 ^operator O2193 +)
  29650. Firing propose*predict-no
  29651. -->
  29652. (O2194 ^name predict-no +)
  29653. (S1 ^operator O2194 +)
  29654. Firing rl*prefer*rvt*predict-no*H0*6
  29655. -->
  29656. (S1 ^operator O2192 = 0.2298693585484839)
  29657. Firing rl*prefer*rvt*predict-yes*H0*5
  29658. -->
  29659. (S1 ^operator O2191 = 0.2940520155428289)
  29660. Firing prefer*rvt*predict-yes*H0
  29661. -->
  29662. Firing prefer*rvt*predict-no*H0
  29663. -->
  29664. Firing elaborate*copy-dir-to-output-link
  29665. -->
  29666. (I3 ^dir R +)
  29667. inner elaboration loop at bottom goal.
  29668. Retracting elaborate*copy-see-to-output-link
  29669. -->
  29670. (I3 ^see 0 +)
  29671. Retracting propose*predict-no
  29672. -->
  29673. (O2192 ^name predict-no +)
  29674. (S1 ^operator O2192 +)
  29675. Retracting propose*predict-yes
  29676. -->
  29677. (O2191 ^name predict-yes +)
  29678. (S1 ^operator O2191 +)
  29679. Retracting elaborate*reward*based*on*reward
  29680. -->
  29681. (R1099 ^value 1 +)
  29682. (R1 ^reward R1099 +)
  29683. Retracting elaborate*copy-dir-to-output-link
  29684. -->
  29685. (I3 ^dir L +)
  29686. Retracting rl*prefer*rvt*predict-no*H0*2
  29687. -->
  29688. (S1 ^operator O2192 = 0.3140548183361512)
  29689. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  29690. -->
  29691. (S1 ^operator O2192 = -0.2190661556260421)
  29692. Retracting rl*prefer*rvt*predict-yes*H0*1
  29693. -->
  29694. (S1 ^operator O2191 = 0.3804132142488074)
  29695. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  29696. -->
  29697. (S1 ^operator O2191 = 0.6195770009714396)
  29698. =>WM: (15378: S1 ^operator O2194 +)
  29699. =>WM: (15377: S1 ^operator O2193 +)
  29700. =>WM: (15376: I3 ^dir R)
  29701. =>WM: (15375: O2194 ^name predict-no)
  29702. =>WM: (15374: O2193 ^name predict-yes)
  29703. =>WM: (15373: R1100 ^value 1)
  29704. =>WM: (15372: R1 ^reward R1100)
  29705. =>WM: (15371: I3 ^see 1)
  29706. <=WM: (15362: S1 ^operator O2191 +)
  29707. <=WM: (15364: S1 ^operator O2191)
  29708. <=WM: (15363: S1 ^operator O2192 +)
  29709. <=WM: (15361: I3 ^dir L)
  29710. <=WM: (15357: R1 ^reward R1099)
  29711. <=WM: (15165: I3 ^see 0)
  29712. <=WM: (15360: O2192 ^name predict-no)
  29713. <=WM: (15359: O2191 ^name predict-yes)
  29714. <=WM: (15358: R1099 ^value 1)
  29715. --- Inner Elaboration Phase, active level 1 (S1) ---
  29716. Firing prefer*rvt*predict-yes*H0
  29717. -->
  29718. Firing rl*prefer*rvt*predict-yes*H0*5
  29719. -->
  29720. (S1 ^operator O2193 = 0.2940520155428289)
  29721. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29722. -->
  29723. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  29724. -->
  29725. (S1 ^operator O2193 = 0.7061957252803326)
  29726. Firing prefer*rvt*predict-no*H0
  29727. -->
  29728. Firing rl*prefer*rvt*predict-no*H0*6
  29729. -->
  29730. (S1 ^operator O2194 = 0.2298693585484839)
  29731. Firing prefer*rvt*predict-no*H0*6*v1*H1
  29732. -->
  29733. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  29734. -->
  29735. (S1 ^operator O2194 = -0.1937987592593187)
  29736. inner elaboration loop at bottom goal.
  29737. Retracting rl*prefer*rvt*predict-no*H0*6
  29738. -->
  29739. (S1 ^operator O2192 = 0.2298693585484839)
  29740. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  29741. -->
  29742. (S1 ^operator O2192 = -0.1937987592593187)
  29743. Retracting rl*prefer*rvt*predict-yes*H0*5
  29744. -->
  29745. (S1 ^operator O2191 = 0.2940520155428289)
  29746. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  29747. -->
  29748. (S1 ^operator O2191 = 0.7061957252803326)
  29749. --- END Proposal Phase ---
  29750. --- Decision Phase ---
  29751. RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.843575,0.132697)
  29752. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478646 0.140931 0.619577 -> 0.478647 0.140931 0.619578(R,m,v=1,1,0)
  29753. =>WM: (15379: S1 ^operator O2193)
  29754. 1097: O: O2193 (predict-yes)
  29755. --- END Decision Phase ---
  29756. --- Application Phase ---
  29757. --- Firing Productions (PE) For State At Depth 1 ---
  29758. --- Inner Elaboration Phase, active level 1 (S1) ---
  29759. Firing apply*operator
  29760. -->
  29761. (I3 ^predict-yes N1097 + :O )
  29762. Firing apply*operator*complete
  29763. -->
  29764. (I3 ^predict-yes N1096 - :O )
  29765. inner elaboration loop at bottom goal.
  29766. --- Change Working Memory (PE) ---
  29767. =>WM: (15380: I3 ^predict-yes N1097)
  29768. <=WM: (15366: N1096 ^status complete)
  29769. <=WM: (15365: I3 ^predict-yes N1096)
  29770. --- Firing Productions (IE) For State At Depth 1 ---
  29771. --- Inner Elaboration Phase, active level 1 (S1) ---
  29772. Firing monitor*world
  29773. -->
  29774. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29775. --- Change Working Memory (IE) ---
  29776. --- END Application Phase ---
  29777. --- Output Phase ---
  29778. ENV: Agent did: predict-yes for direction R in state State-A
  29779. In State-A moving R
  29780. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  29781. predict error 0
  29782. dir: dir isL
  29783. --- END Output Phase ---
  29784. |\--- Input Phase ---
  29785. =>WM: (15384: I2 ^dir L)
  29786. =>WM: (15383: I2 ^reward 1)
  29787. =>WM: (15382: I2 ^see 1)
  29788. =>WM: (15381: N1097 ^status complete)
  29789. <=WM: (15369: I2 ^dir R)
  29790. <=WM: (15368: I2 ^reward 1)
  29791. <=WM: (15367: I2 ^see 1)
  29792. =>WM: (15385: I2 ^level-1 R1-root)
  29793. <=WM: (15370: I2 ^level-1 L1-root)
  29794. --- END Input Phase ---
  29795. --- Proposal Phase ---
  29796. --- Inner Elaboration Phase, active level 1 (S1) ---
  29797. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  29798. -->
  29799. (S1 ^operator O2193 = 0.6195978385087889)
  29800. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  29801. -->
  29802. (S1 ^operator O2194 = -0.1479504104026684)
  29803. Firing prefer*rvt*predict-no*H0*2*v1*H1
  29804. -->
  29805. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  29806. -->
  29807. Firing elaborate*copy-see-to-output-link
  29808. -->
  29809. (I3 ^see 1 +)
  29810. Firing elaborate*reward*based*on*reward
  29811. -->
  29812. (R1101 ^value 1 +)
  29813. (R1 ^reward R1101 +)
  29814. Firing propose*predict-yes
  29815. -->
  29816. (O2195 ^name predict-yes +)
  29817. (S1 ^operator O2195 +)
  29818. Firing propose*predict-no
  29819. -->
  29820. (O2196 ^name predict-no +)
  29821. (S1 ^operator O2196 +)
  29822. Firing rl*prefer*rvt*predict-no*H0*2
  29823. -->
  29824. (S1 ^operator O2194 = 0.3140548183361512)
  29825. Firing rl*prefer*rvt*predict-yes*H0*1
  29826. -->
  29827. (S1 ^operator O2193 = 0.3804140049526733)
  29828. Firing prefer*rvt*predict-yes*H0
  29829. -->
  29830. Firing prefer*rvt*predict-no*H0
  29831. -->
  29832. Firing elaborate*copy-dir-to-output-link
  29833. -->
  29834. (I3 ^dir L +)
  29835. inner elaboration loop at bottom goal.
  29836. Retracting elaborate*copy-see-to-output-link
  29837. -->
  29838. (I3 ^see 1 +)
  29839. Retracting propose*predict-no
  29840. -->
  29841. (O2194 ^name predict-no +)
  29842. (S1 ^operator O2194 +)
  29843. Retracting propose*predict-yes
  29844. -->
  29845. (O2193 ^name predict-yes +)
  29846. (S1 ^operator O2193 +)
  29847. Retracting elaborate*reward*based*on*reward
  29848. -->
  29849. (R1100 ^value 1 +)
  29850. (R1 ^reward R1100 +)
  29851. Retracting elaborate*copy-dir-to-output-link
  29852. -->
  29853. (I3 ^dir R +)
  29854. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  29855. -->
  29856. (S1 ^operator O2194 = -0.1937987592593187)
  29857. Retracting rl*prefer*rvt*predict-no*H0*6
  29858. -->
  29859. (S1 ^operator O2194 = 0.2298693585484839)
  29860. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  29861. -->
  29862. (S1 ^operator O2193 = 0.7061957252803326)
  29863. Retracting rl*prefer*rvt*predict-yes*H0*5
  29864. -->
  29865. (S1 ^operator O2193 = 0.2940520155428289)
  29866. =>WM: (15392: S1 ^operator O2196 +)
  29867. =>WM: (15391: S1 ^operator O2195 +)
  29868. =>WM: (15390: I3 ^dir L)
  29869. =>WM: (15389: O2196 ^name predict-no)
  29870. =>WM: (15388: O2195 ^name predict-yes)
  29871. =>WM: (15387: R1101 ^value 1)
  29872. =>WM: (15386: R1 ^reward R1101)
  29873. <=WM: (15377: S1 ^operator O2193 +)
  29874. <=WM: (15379: S1 ^operator O2193)
  29875. <=WM: (15378: S1 ^operator O2194 +)
  29876. <=WM: (15376: I3 ^dir R)
  29877. <=WM: (15372: R1 ^reward R1100)
  29878. <=WM: (15375: O2194 ^name predict-no)
  29879. <=WM: (15374: O2193 ^name predict-yes)
  29880. <=WM: (15373: R1100 ^value 1)
  29881. --- Inner Elaboration Phase, active level 1 (S1) ---
  29882. Firing prefer*rvt*predict-yes*H0
  29883. -->
  29884. Firing rl*prefer*rvt*predict-yes*H0*1
  29885. -->
  29886. (S1 ^operator O2195 = 0.3804140049526733)
  29887. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  29888. -->
  29889. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  29890. -->
  29891. (S1 ^operator O2195 = 0.6195978385087889)
  29892. Firing prefer*rvt*predict-no*H0
  29893. -->
  29894. Firing rl*prefer*rvt*predict-no*H0*2
  29895. -->
  29896. (S1 ^operator O2196 = 0.3140548183361512)
  29897. Firing prefer*rvt*predict-no*H0*2*v1*H1
  29898. -->
  29899. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  29900. -->
  29901. (S1 ^operator O2196 = -0.1479504104026684)
  29902. inner elaboration loop at bottom goal.
  29903. Retracting rl*prefer*rvt*predict-no*H0*2
  29904. -->
  29905. (S1 ^operator O2194 = 0.3140548183361512)
  29906. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  29907. -->
  29908. (S1 ^operator O2194 = -0.1479504104026684)
  29909. Retracting rl*prefer*rvt*predict-yes*H0*1
  29910. -->
  29911. (S1 ^operator O2193 = 0.3804140049526733)
  29912. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  29913. -->
  29914. (S1 ^operator O2193 = 0.6195978385087889)
  29915. --- END Proposal Phase ---
  29916. --- Decision Phase ---
  29917. RL update rl*prefer*rvt*predict-yes*H0*5 0.501121 -0.207069 0.294052 -> 0.501103 -0.207071 0.294032(R,m,v=1,0.858824,0.121963)
  29918. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499103 0.207093 0.706196 -> 0.499081 0.207091 0.706172(R,m,v=1,1,0)
  29919. =>WM: (15393: S1 ^operator O2195)
  29920. 1098: O: O2195 (predict-yes)
  29921. --- END Decision Phase ---
  29922. --- Application Phase ---
  29923. --- Firing Productions (PE) For State At Depth 1 ---
  29924. --- Inner Elaboration Phase, active level 1 (S1) ---
  29925. Firing apply*operator
  29926. -->
  29927. (I3 ^predict-yes N1098 + :O )
  29928. Firing apply*operator*complete
  29929. -->
  29930. (I3 ^predict-yes N1097 - :O )
  29931. inner elaboration loop at bottom goal.
  29932. --- Change Working Memory (PE) ---
  29933. =>WM: (15394: I3 ^predict-yes N1098)
  29934. <=WM: (15381: N1097 ^status complete)
  29935. <=WM: (15380: I3 ^predict-yes N1097)
  29936. --- Firing Productions (IE) For State At Depth 1 ---
  29937. --- Inner Elaboration Phase, active level 1 (S1) ---
  29938. Firing monitor*world
  29939. -->
  29940. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29941. --- Change Working Memory (IE) ---
  29942. --- END Application Phase ---
  29943. --- Output Phase ---
  29944. ENV: Agent did: predict-yes for direction L in state State-B
  29945. In State-B moving L
  29946. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  29947. predict error 0
  29948. dir: dir isL
  29949. --- END Output Phase ---
  29950. -/|--- Input Phase ---
  29951. =>WM: (15398: I2 ^dir L)
  29952. =>WM: (15397: I2 ^reward 1)
  29953. =>WM: (15396: I2 ^see 1)
  29954. =>WM: (15395: N1098 ^status complete)
  29955. <=WM: (15384: I2 ^dir L)
  29956. <=WM: (15383: I2 ^reward 1)
  29957. <=WM: (15382: I2 ^see 1)
  29958. =>WM: (15399: I2 ^level-1 L1-root)
  29959. <=WM: (15385: I2 ^level-1 R1-root)
  29960. --- END Input Phase ---
  29961. --- Proposal Phase ---
  29962. --- Inner Elaboration Phase, active level 1 (S1) ---
  29963. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  29964. -->
  29965. (S1 ^operator O2195 = -0.3470159027404986)
  29966. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  29967. -->
  29968. (S1 ^operator O2196 = 0.6860368928081693)
  29969. Firing prefer*rvt*predict-no*H0*2*v1*H1
  29970. -->
  29971. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  29972. -->
  29973. Firing elaborate*copy-see-to-output-link
  29974. -->
  29975. (I3 ^see 1 +)
  29976. Firing elaborate*reward*based*on*reward
  29977. -->
  29978. (R1102 ^value 1 +)
  29979. (R1 ^reward R1102 +)
  29980. Firing propose*predict-yes
  29981. -->
  29982. (O2197 ^name predict-yes +)
  29983. (S1 ^operator O2197 +)
  29984. Firing propose*predict-no
  29985. -->
  29986. (O2198 ^name predict-no +)
  29987. (S1 ^operator O2198 +)
  29988. Firing rl*prefer*rvt*predict-no*H0*2
  29989. -->
  29990. (S1 ^operator O2196 = 0.3140548183361512)
  29991. Firing rl*prefer*rvt*predict-yes*H0*1
  29992. -->
  29993. (S1 ^operator O2195 = 0.3804140049526733)
  29994. Firing prefer*rvt*predict-yes*H0
  29995. -->
  29996. Firing prefer*rvt*predict-no*H0
  29997. -->
  29998. Firing elaborate*copy-dir-to-output-link
  29999. -->
  30000. (I3 ^dir L +)
  30001. inner elaboration loop at bottom goal.
  30002. Retracting elaborate*copy-see-to-output-link
  30003. -->
  30004. (I3 ^see 1 +)
  30005. Retracting propose*predict-no
  30006. -->
  30007. (O2196 ^name predict-no +)
  30008. (S1 ^operator O2196 +)
  30009. Retracting propose*predict-yes
  30010. -->
  30011. (O2195 ^name predict-yes +)
  30012. (S1 ^operator O2195 +)
  30013. Retracting elaborate*reward*based*on*reward
  30014. -->
  30015. (R1101 ^value 1 +)
  30016. (R1 ^reward R1101 +)
  30017. Retracting elaborate*copy-dir-to-output-link
  30018. -->
  30019. (I3 ^dir L +)
  30020. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*28
  30021. -->
  30022. (S1 ^operator O2196 = -0.1479504104026684)
  30023. Retracting rl*prefer*rvt*predict-no*H0*2
  30024. -->
  30025. (S1 ^operator O2196 = 0.3140548183361512)
  30026. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*29
  30027. -->
  30028. (S1 ^operator O2195 = 0.6195978385087889)
  30029. Retracting rl*prefer*rvt*predict-yes*H0*1
  30030. -->
  30031. (S1 ^operator O2195 = 0.3804140049526733)
  30032. =>WM: (15405: S1 ^operator O2198 +)
  30033. =>WM: (15404: S1 ^operator O2197 +)
  30034. =>WM: (15403: O2198 ^name predict-no)
  30035. =>WM: (15402: O2197 ^name predict-yes)
  30036. =>WM: (15401: R1102 ^value 1)
  30037. =>WM: (15400: R1 ^reward R1102)
  30038. <=WM: (15391: S1 ^operator O2195 +)
  30039. <=WM: (15393: S1 ^operator O2195)
  30040. <=WM: (15392: S1 ^operator O2196 +)
  30041. <=WM: (15386: R1 ^reward R1101)
  30042. <=WM: (15389: O2196 ^name predict-no)
  30043. <=WM: (15388: O2195 ^name predict-yes)
  30044. <=WM: (15387: R1101 ^value 1)
  30045. --- Inner Elaboration Phase, active level 1 (S1) ---
  30046. Firing prefer*rvt*predict-yes*H0
  30047. -->
  30048. Firing rl*prefer*rvt*predict-yes*H0*1
  30049. -->
  30050. (S1 ^operator O2197 = 0.3804140049526733)
  30051. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  30052. -->
  30053. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30054. -->
  30055. (S1 ^operator O2197 = -0.3470159027404986)
  30056. Firing prefer*rvt*predict-no*H0
  30057. -->
  30058. Firing rl*prefer*rvt*predict-no*H0*2
  30059. -->
  30060. (S1 ^operator O2198 = 0.3140548183361512)
  30061. Firing prefer*rvt*predict-no*H0*2*v1*H1
  30062. -->
  30063. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30064. -->
  30065. (S1 ^operator O2198 = 0.6860368928081693)
  30066. inner elaboration loop at bottom goal.
  30067. Retracting rl*prefer*rvt*predict-no*H0*2
  30068. -->
  30069. (S1 ^operator O2196 = 0.3140548183361512)
  30070. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30071. -->
  30072. (S1 ^operator O2196 = 0.6860368928081693)
  30073. Retracting rl*prefer*rvt*predict-yes*H0*1
  30074. -->
  30075. (S1 ^operator O2195 = 0.3804140049526733)
  30076. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30077. -->
  30078. (S1 ^operator O2195 = -0.3470159027404986)
  30079. --- END Proposal Phase ---
  30080. --- Decision Phase ---
  30081. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521343 -0.14093 0.380413(R,m,v=1,0.844444,0.132092)
  30082. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*29 0.478669 0.140929 0.619598 -> 0.478668 0.140929 0.619597(R,m,v=1,1,0)
  30083. =>WM: (15406: S1 ^operator O2198)
  30084. 1099: O: O2198 (predict-no)
  30085. --- END Decision Phase ---
  30086. --- Application Phase ---
  30087. --- Firing Productions (PE) For State At Depth 1 ---
  30088. --- Inner Elaboration Phase, active level 1 (S1) ---
  30089. Firing apply*operator
  30090. -->
  30091. (I3 ^predict-no N1099 + :O )
  30092. Firing apply*operator*complete
  30093. -->
  30094. (I3 ^predict-yes N1098 - :O )
  30095. inner elaboration loop at bottom goal.
  30096. --- Change Working Memory (PE) ---
  30097. =>WM: (15407: I3 ^predict-no N1099)
  30098. <=WM: (15395: N1098 ^status complete)
  30099. <=WM: (15394: I3 ^predict-yes N1098)
  30100. --- Firing Productions (IE) For State At Depth 1 ---
  30101. --- Inner Elaboration Phase, active level 1 (S1) ---
  30102. Firing monitor*world
  30103. -->
  30104. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30105. --- Change Working Memory (IE) ---
  30106. --- END Application Phase ---
  30107. --- Output Phase ---
  30108. ENV: Agent did: predict-no for direction L in state State-A
  30109. In State-A moving L
  30110. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30111. predict error 0
  30112. dir: dir isR
  30113. --- END Output Phase ---
  30114. \-/--- Input Phase ---
  30115. =>WM: (15411: I2 ^dir R)
  30116. =>WM: (15410: I2 ^reward 1)
  30117. =>WM: (15409: I2 ^see 0)
  30118. =>WM: (15408: N1099 ^status complete)
  30119. <=WM: (15398: I2 ^dir L)
  30120. <=WM: (15397: I2 ^reward 1)
  30121. <=WM: (15396: I2 ^see 1)
  30122. =>WM: (15412: I2 ^level-1 L0-root)
  30123. <=WM: (15399: I2 ^level-1 L1-root)
  30124. --- END Input Phase ---
  30125. --- Proposal Phase ---
  30126. --- Inner Elaboration Phase, active level 1 (S1) ---
  30127. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30128. -->
  30129. (S1 ^operator O2197 = 0.7058208607781853)
  30130. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30131. -->
  30132. (S1 ^operator O2198 = -0.2023211881870005)
  30133. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30134. -->
  30135. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30136. -->
  30137. Firing elaborate*copy-see-to-output-link
  30138. -->
  30139. (I3 ^see 0 +)
  30140. Firing elaborate*reward*based*on*reward
  30141. -->
  30142. (R1103 ^value 1 +)
  30143. (R1 ^reward R1103 +)
  30144. Firing propose*predict-yes
  30145. -->
  30146. (O2199 ^name predict-yes +)
  30147. (S1 ^operator O2199 +)
  30148. Firing propose*predict-no
  30149. -->
  30150. (O2200 ^name predict-no +)
  30151. (S1 ^operator O2200 +)
  30152. Firing rl*prefer*rvt*predict-no*H0*6
  30153. -->
  30154. (S1 ^operator O2198 = 0.2298693585484839)
  30155. Firing rl*prefer*rvt*predict-yes*H0*5
  30156. -->
  30157. (S1 ^operator O2197 = 0.2940318273940734)
  30158. Firing prefer*rvt*predict-yes*H0
  30159. -->
  30160. Firing prefer*rvt*predict-no*H0
  30161. -->
  30162. Firing elaborate*copy-dir-to-output-link
  30163. -->
  30164. (I3 ^dir R +)
  30165. inner elaboration loop at bottom goal.
  30166. Retracting elaborate*copy-see-to-output-link
  30167. -->
  30168. (I3 ^see 1 +)
  30169. Retracting propose*predict-no
  30170. -->
  30171. (O2198 ^name predict-no +)
  30172. (S1 ^operator O2198 +)
  30173. Retracting propose*predict-yes
  30174. -->
  30175. (O2197 ^name predict-yes +)
  30176. (S1 ^operator O2197 +)
  30177. Retracting elaborate*reward*based*on*reward
  30178. -->
  30179. (R1102 ^value 1 +)
  30180. (R1 ^reward R1102 +)
  30181. Retracting elaborate*copy-dir-to-output-link
  30182. -->
  30183. (I3 ^dir L +)
  30184. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30185. -->
  30186. (S1 ^operator O2198 = 0.6860368928081693)
  30187. Retracting rl*prefer*rvt*predict-no*H0*2
  30188. -->
  30189. (S1 ^operator O2198 = 0.3140548183361512)
  30190. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30191. -->
  30192. (S1 ^operator O2197 = -0.3470159027404986)
  30193. Retracting rl*prefer*rvt*predict-yes*H0*1
  30194. -->
  30195. (S1 ^operator O2197 = 0.3804130487485735)
  30196. =>WM: (15420: S1 ^operator O2200 +)
  30197. =>WM: (15419: S1 ^operator O2199 +)
  30198. =>WM: (15418: I3 ^dir R)
  30199. =>WM: (15417: O2200 ^name predict-no)
  30200. =>WM: (15416: O2199 ^name predict-yes)
  30201. =>WM: (15415: R1103 ^value 1)
  30202. =>WM: (15414: R1 ^reward R1103)
  30203. =>WM: (15413: I3 ^see 0)
  30204. <=WM: (15404: S1 ^operator O2197 +)
  30205. <=WM: (15405: S1 ^operator O2198 +)
  30206. <=WM: (15406: S1 ^operator O2198)
  30207. <=WM: (15390: I3 ^dir L)
  30208. <=WM: (15400: R1 ^reward R1102)
  30209. <=WM: (15371: I3 ^see 1)
  30210. <=WM: (15403: O2198 ^name predict-no)
  30211. <=WM: (15402: O2197 ^name predict-yes)
  30212. <=WM: (15401: R1102 ^value 1)
  30213. --- Inner Elaboration Phase, active level 1 (S1) ---
  30214. Firing prefer*rvt*predict-yes*H0
  30215. -->
  30216. Firing rl*prefer*rvt*predict-yes*H0*5
  30217. -->
  30218. (S1 ^operator O2199 = 0.2940318273940734)
  30219. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30220. -->
  30221. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30222. -->
  30223. (S1 ^operator O2199 = 0.7058208607781853)
  30224. Firing prefer*rvt*predict-no*H0
  30225. -->
  30226. Firing rl*prefer*rvt*predict-no*H0*6
  30227. -->
  30228. (S1 ^operator O2200 = 0.2298693585484839)
  30229. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30230. -->
  30231. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30232. -->
  30233. (S1 ^operator O2200 = -0.2023211881870005)
  30234. inner elaboration loop at bottom goal.
  30235. Retracting rl*prefer*rvt*predict-no*H0*6
  30236. -->
  30237. (S1 ^operator O2198 = 0.2298693585484839)
  30238. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30239. -->
  30240. (S1 ^operator O2198 = -0.2023211881870005)
  30241. Retracting rl*prefer*rvt*predict-yes*H0*5
  30242. -->
  30243. (S1 ^operator O2197 = 0.2940318273940734)
  30244. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30245. -->
  30246. (S1 ^operator O2197 = 0.7058208607781853)
  30247. --- END Proposal Phase ---
  30248. --- Decision Phase ---
  30249. RL update rl*prefer*rvt*predict-no*H0*2 0.485058 -0.171003 0.314055 -> 0.485052 -0.171004 0.314047(R,m,v=1,0.880682,0.105682)
  30250. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515015 0.171022 0.686037 -> 0.515008 0.17102 0.686028(R,m,v=1,1,0)
  30251. =>WM: (15421: S1 ^operator O2199)
  30252. 1100: O: O2199 (predict-yes)
  30253. --- END Decision Phase ---
  30254. --- Application Phase ---
  30255. --- Firing Productions (PE) For State At Depth 1 ---
  30256. --- Inner Elaboration Phase, active level 1 (S1) ---
  30257. Firing apply*operator
  30258. -->
  30259. (I3 ^predict-yes N1100 + :O )
  30260. Firing apply*operator*complete
  30261. -->
  30262. (I3 ^predict-no N1099 - :O )
  30263. inner elaboration loop at bottom goal.
  30264. --- Change Working Memory (PE) ---
  30265. =>WM: (15422: I3 ^predict-yes N1100)
  30266. <=WM: (15408: N1099 ^status complete)
  30267. <=WM: (15407: I3 ^predict-no N1099)
  30268. --- Firing Productions (IE) For State At Depth 1 ---
  30269. --- Inner Elaboration Phase, active level 1 (S1) ---
  30270. Firing monitor*world
  30271. -->
  30272. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30273. --- Change Working Memory (IE) ---
  30274. --- END Application Phase ---
  30275. --- Output Phase ---
  30276. ENV: Agent did: predict-yes for direction R in state State-A
  30277. In State-A moving R
  30278. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  30279. predict error 0
  30280. dir: dir isR
  30281. --- END Output Phase ---
  30282. |\---- Input Phase ---
  30283. =>WM: (15426: I2 ^dir R)
  30284. =>WM: (15425: I2 ^reward 1)
  30285. =>WM: (15424: I2 ^see 1)
  30286. =>WM: (15423: N1100 ^status complete)
  30287. <=WM: (15411: I2 ^dir R)
  30288. <=WM: (15410: I2 ^reward 1)
  30289. <=WM: (15409: I2 ^see 0)
  30290. =>WM: (15427: I2 ^level-1 R1-root)
  30291. <=WM: (15412: I2 ^level-1 L0-root)
  30292. --- END Input Phase ---
  30293. --- Proposal Phase ---
  30294. --- Inner Elaboration Phase, active level 1 (S1) ---
  30295. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30296. -->
  30297. (S1 ^operator O2199 = -0.252585164213872)
  30298. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  30299. -->
  30300. (S1 ^operator O2200 = 0.7701577329613335)
  30301. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30302. -->
  30303. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30304. -->
  30305. Firing elaborate*copy-see-to-output-link
  30306. -->
  30307. (I3 ^see 1 +)
  30308. Firing elaborate*reward*based*on*reward
  30309. -->
  30310. (R1104 ^value 1 +)
  30311. (R1 ^reward R1104 +)
  30312. Firing propose*predict-yes
  30313. -->
  30314. (O2201 ^name predict-yes +)
  30315. (S1 ^operator O2201 +)
  30316. Firing propose*predict-no
  30317. -->
  30318. (O2202 ^name predict-no +)
  30319. (S1 ^operator O2202 +)
  30320. Firing rl*prefer*rvt*predict-no*H0*6
  30321. -->
  30322. (S1 ^operator O2200 = 0.2298693585484839)
  30323. Firing rl*prefer*rvt*predict-yes*H0*5
  30324. -->
  30325. (S1 ^operator O2199 = 0.2940318273940734)
  30326. Firing prefer*rvt*predict-yes*H0
  30327. -->
  30328. Firing prefer*rvt*predict-no*H0
  30329. -->
  30330. Firing elaborate*copy-dir-to-output-link
  30331. -->
  30332. (I3 ^dir R +)
  30333. inner elaboration loop at bottom goal.
  30334. Retracting elaborate*copy-see-to-output-link
  30335. -->
  30336. (I3 ^see 0 +)
  30337. Retracting propose*predict-no
  30338. -->
  30339. (O2200 ^name predict-no +)
  30340. (S1 ^operator O2200 +)
  30341. Retracting propose*predict-yes
  30342. -->
  30343. (O2199 ^name predict-yes +)
  30344. (S1 ^operator O2199 +)
  30345. Retracting elaborate*reward*based*on*reward
  30346. -->
  30347. (R1103 ^value 1 +)
  30348. (R1 ^reward R1103 +)
  30349. Retracting elaborate*copy-dir-to-output-link
  30350. -->
  30351. (I3 ^dir R +)
  30352. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30353. -->
  30354. (S1 ^operator O2200 = -0.2023211881870005)
  30355. Retracting rl*prefer*rvt*predict-no*H0*6
  30356. -->
  30357. (S1 ^operator O2200 = 0.2298693585484839)
  30358. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30359. -->
  30360. (S1 ^operator O2199 = 0.7058208607781853)
  30361. Retracting rl*prefer*rvt*predict-yes*H0*5
  30362. -->
  30363. (S1 ^operator O2199 = 0.2940318273940734)
  30364. =>WM: (15434: S1 ^operator O2202 +)
  30365. =>WM: (15433: S1 ^operator O2201 +)
  30366. =>WM: (15432: O2202 ^name predict-no)
  30367. =>WM: (15431: O2201 ^name predict-yes)
  30368. =>WM: (15430: R1104 ^value 1)
  30369. =>WM: (15429: R1 ^reward R1104)
  30370. =>WM: (15428: I3 ^see 1)
  30371. <=WM: (15419: S1 ^operator O2199 +)
  30372. <=WM: (15421: S1 ^operator O2199)
  30373. <=WM: (15420: S1 ^operator O2200 +)
  30374. <=WM: (15414: R1 ^reward R1103)
  30375. <=WM: (15413: I3 ^see 0)
  30376. <=WM: (15417: O2200 ^name predict-no)
  30377. <=WM: (15416: O2199 ^name predict-yes)
  30378. <=WM: (15415: R1103 ^value 1)
  30379. --- Inner Elaboration Phase, active level 1 (S1) ---
  30380. Firing prefer*rvt*predict-yes*H0
  30381. -->
  30382. Firing rl*prefer*rvt*predict-yes*H0*5
  30383. -->
  30384. (S1 ^operator O2201 = 0.2940318273940734)
  30385. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30386. -->
  30387. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30388. -->
  30389. (S1 ^operator O2201 = -0.252585164213872)
  30390. Firing prefer*rvt*predict-no*H0
  30391. -->
  30392. Firing rl*prefer*rvt*predict-no*H0*6
  30393. -->
  30394. (S1 ^operator O2202 = 0.2298693585484839)
  30395. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30396. -->
  30397. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  30398. -->
  30399. (S1 ^operator O2202 = 0.7701577329613335)
  30400. inner elaboration loop at bottom goal.
  30401. Retracting rl*prefer*rvt*predict-no*H0*6
  30402. -->
  30403. (S1 ^operator O2200 = 0.2298693585484839)
  30404. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  30405. -->
  30406. (S1 ^operator O2200 = 0.7701577329613335)
  30407. Retracting rl*prefer*rvt*predict-yes*H0*5
  30408. -->
  30409. (S1 ^operator O2199 = 0.2940318273940734)
  30410. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30411. -->
  30412. (S1 ^operator O2199 = -0.252585164213872)
  30413. --- END Proposal Phase ---
  30414. --- Decision Phase ---
  30415. RL update rl*prefer*rvt*predict-yes*H0*5 0.501103 -0.207071 0.294032 -> 0.501114 -0.20707 0.294044(R,m,v=1,0.859649,0.121362)
  30416. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498764 0.207057 0.705821 -> 0.498777 0.207058 0.705835(R,m,v=1,1,0)
  30417. =>WM: (15435: S1 ^operator O2202)
  30418. 1101: O: O2202 (predict-no)
  30419. --- END Decision Phase ---
  30420. --- Application Phase ---
  30421. --- Firing Productions (PE) For State At Depth 1 ---
  30422. --- Inner Elaboration Phase, active level 1 (S1) ---
  30423. Firing apply*operator
  30424. -->
  30425. (I3 ^predict-no N1101 + :O )
  30426. Firing apply*operator*complete
  30427. -->
  30428. (I3 ^predict-yes N1100 - :O )
  30429. inner elaboration loop at bottom goal.
  30430. --- Change Working Memory (PE) ---
  30431. =>WM: (15436: I3 ^predict-no N1101)
  30432. <=WM: (15423: N1100 ^status complete)
  30433. <=WM: (15422: I3 ^predict-yes N1100)
  30434. --- Firing Productions (IE) For State At Depth 1 ---
  30435. --- Inner Elaboration Phase, active level 1 (S1) ---
  30436. Firing monitor*world
  30437. -->
  30438. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30439. --- Change Working Memory (IE) ---
  30440. --- END Application Phase ---
  30441. --- Output Phase ---
  30442. ENV: Agent did: predict-no for direction R in state State-B
  30443. In State-B moving R
  30444. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30445. predict error 0
  30446. dir: dir isL
  30447. --- END Output Phase ---
  30448. /--- Input Phase ---
  30449. =>WM: (15440: I2 ^dir L)
  30450. =>WM: (15439: I2 ^reward 1)
  30451. =>WM: (15438: I2 ^see 0)
  30452. =>WM: (15437: N1101 ^status complete)
  30453. <=WM: (15426: I2 ^dir R)
  30454. <=WM: (15425: I2 ^reward 1)
  30455. <=WM: (15424: I2 ^see 1)
  30456. =>WM: (15441: I2 ^level-1 R0-root)
  30457. <=WM: (15427: I2 ^level-1 R1-root)
  30458. --- END Input Phase ---
  30459. --- Proposal Phase ---
  30460. --- Inner Elaboration Phase, active level 1 (S1) ---
  30461. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  30462. -->
  30463. (S1 ^operator O2201 = 0.6195779233564012)
  30464. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  30465. -->
  30466. (S1 ^operator O2202 = -0.2190661556260421)
  30467. Firing prefer*rvt*predict-no*H0*2*v1*H1
  30468. -->
  30469. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  30470. -->
  30471. Firing elaborate*copy-see-to-output-link
  30472. -->
  30473. (I3 ^see 0 +)
  30474. Firing elaborate*reward*based*on*reward
  30475. -->
  30476. (R1105 ^value 1 +)
  30477. (R1 ^reward R1105 +)
  30478. Firing propose*predict-yes
  30479. -->
  30480. (O2203 ^name predict-yes +)
  30481. (S1 ^operator O2203 +)
  30482. Firing propose*predict-no
  30483. -->
  30484. (O2204 ^name predict-no +)
  30485. (S1 ^operator O2204 +)
  30486. Firing rl*prefer*rvt*predict-no*H0*2
  30487. -->
  30488. (S1 ^operator O2202 = 0.3140473868976779)
  30489. Firing rl*prefer*rvt*predict-yes*H0*1
  30490. -->
  30491. (S1 ^operator O2201 = 0.3804130487485735)
  30492. Firing prefer*rvt*predict-yes*H0
  30493. -->
  30494. Firing prefer*rvt*predict-no*H0
  30495. -->
  30496. Firing elaborate*copy-dir-to-output-link
  30497. -->
  30498. (I3 ^dir L +)
  30499. inner elaboration loop at bottom goal.
  30500. Retracting elaborate*copy-see-to-output-link
  30501. -->
  30502. (I3 ^see 1 +)
  30503. Retracting propose*predict-no
  30504. -->
  30505. (O2202 ^name predict-no +)
  30506. (S1 ^operator O2202 +)
  30507. Retracting propose*predict-yes
  30508. -->
  30509. (O2201 ^name predict-yes +)
  30510. (S1 ^operator O2201 +)
  30511. Retracting elaborate*reward*based*on*reward
  30512. -->
  30513. (R1104 ^value 1 +)
  30514. (R1 ^reward R1104 +)
  30515. Retracting elaborate*copy-dir-to-output-link
  30516. -->
  30517. (I3 ^dir R +)
  30518. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  30519. -->
  30520. (S1 ^operator O2202 = 0.7701577329613335)
  30521. Retracting rl*prefer*rvt*predict-no*H0*6
  30522. -->
  30523. (S1 ^operator O2202 = 0.2298693585484839)
  30524. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30525. -->
  30526. (S1 ^operator O2201 = -0.252585164213872)
  30527. Retracting rl*prefer*rvt*predict-yes*H0*5
  30528. -->
  30529. (S1 ^operator O2201 = 0.2940438202219438)
  30530. =>WM: (15449: S1 ^operator O2204 +)
  30531. =>WM: (15448: S1 ^operator O2203 +)
  30532. =>WM: (15447: I3 ^dir L)
  30533. =>WM: (15446: O2204 ^name predict-no)
  30534. =>WM: (15445: O2203 ^name predict-yes)
  30535. =>WM: (15444: R1105 ^value 1)
  30536. =>WM: (15443: R1 ^reward R1105)
  30537. =>WM: (15442: I3 ^see 0)
  30538. <=WM: (15433: S1 ^operator O2201 +)
  30539. <=WM: (15434: S1 ^operator O2202 +)
  30540. <=WM: (15435: S1 ^operator O2202)
  30541. <=WM: (15418: I3 ^dir R)
  30542. <=WM: (15429: R1 ^reward R1104)
  30543. <=WM: (15428: I3 ^see 1)
  30544. <=WM: (15432: O2202 ^name predict-no)
  30545. <=WM: (15431: O2201 ^name predict-yes)
  30546. <=WM: (15430: R1104 ^value 1)
  30547. --- Inner Elaboration Phase, active level 1 (S1) ---
  30548. Firing prefer*rvt*predict-yes*H0
  30549. -->
  30550. Firing rl*prefer*rvt*predict-yes*H0*1
  30551. -->
  30552. (S1 ^operator O2203 = 0.3804130487485735)
  30553. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  30554. -->
  30555. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  30556. -->
  30557. (S1 ^operator O2203 = 0.6195779233564012)
  30558. Firing prefer*rvt*predict-no*H0
  30559. -->
  30560. Firing rl*prefer*rvt*predict-no*H0*2
  30561. -->
  30562. (S1 ^operator O2204 = 0.3140473868976779)
  30563. Firing prefer*rvt*predict-no*H0*2*v1*H1
  30564. -->
  30565. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  30566. -->
  30567. (S1 ^operator O2204 = -0.2190661556260421)
  30568. inner elaboration loop at bottom goal.
  30569. Retracting rl*prefer*rvt*predict-no*H0*2
  30570. -->
  30571. (S1 ^operator O2202 = 0.3140473868976779)
  30572. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  30573. -->
  30574. (S1 ^operator O2202 = -0.2190661556260421)
  30575. Retracting rl*prefer*rvt*predict-yes*H0*1
  30576. -->
  30577. (S1 ^operator O2201 = 0.3804130487485735)
  30578. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  30579. -->
  30580. (S1 ^operator O2201 = 0.6195779233564012)
  30581. --- END Proposal Phase ---
  30582. --- Decision Phase ---
  30583. RL update rl*prefer*rvt*predict-no*H0*6 0.61192 -0.38205 0.229869 -> 0.611918 -0.382051 0.229867(R,m,v=1,0.862944,0.118875)
  30584. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388102 0.382055 0.770158 -> 0.3881 0.382055 0.770155(R,m,v=1,1,0)
  30585. =>WM: (15450: S1 ^operator O2203)
  30586. 1102: O: O2203 (predict-yes)
  30587. --- END Decision Phase ---
  30588. --- Application Phase ---
  30589. --- Firing Productions (PE) For State At Depth 1 ---
  30590. --- Inner Elaboration Phase, active level 1 (S1) ---
  30591. Firing apply*operator
  30592. -->
  30593. (I3 ^predict-yes N1102 + :O )
  30594. Firing apply*operator*complete
  30595. -->
  30596. (I3 ^predict-no N1101 - :O )
  30597. inner elaboration loop at bottom goal.
  30598. --- Change Working Memory (PE) ---
  30599. =>WM: (15451: I3 ^predict-yes N1102)
  30600. <=WM: (15437: N1101 ^status complete)
  30601. <=WM: (15436: I3 ^predict-no N1101)
  30602. --- Firing Productions (IE) For State At Depth 1 ---
  30603. --- Inner Elaboration Phase, active level 1 (S1) ---
  30604. Firing monitor*world
  30605. -->
  30606. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30607. --- Change Working Memory (IE) ---
  30608. --- END Application Phase ---
  30609. --- Output Phase ---
  30610. ENV: Agent did: predict-yes for direction L in state State-B
  30611. In State-B moving L
  30612. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  30613. predict error 0
  30614. dir: dir isL
  30615. --- END Output Phase ---
  30616. |\---- Input Phase ---
  30617. =>WM: (15455: I2 ^dir L)
  30618. =>WM: (15454: I2 ^reward 1)
  30619. =>WM: (15453: I2 ^see 1)
  30620. =>WM: (15452: N1102 ^status complete)
  30621. <=WM: (15440: I2 ^dir L)
  30622. <=WM: (15439: I2 ^reward 1)
  30623. <=WM: (15438: I2 ^see 0)
  30624. =>WM: (15456: I2 ^level-1 L1-root)
  30625. <=WM: (15441: I2 ^level-1 R0-root)
  30626. --- END Input Phase ---
  30627. --- Proposal Phase ---
  30628. --- Inner Elaboration Phase, active level 1 (S1) ---
  30629. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30630. -->
  30631. (S1 ^operator O2203 = -0.3470159027404986)
  30632. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30633. -->
  30634. (S1 ^operator O2204 = 0.686028179458083)
  30635. Firing prefer*rvt*predict-no*H0*2*v1*H1
  30636. -->
  30637. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  30638. -->
  30639. Firing elaborate*copy-see-to-output-link
  30640. -->
  30641. (I3 ^see 1 +)
  30642. Firing elaborate*reward*based*on*reward
  30643. -->
  30644. (R1106 ^value 1 +)
  30645. (R1 ^reward R1106 +)
  30646. Firing propose*predict-yes
  30647. -->
  30648. (O2205 ^name predict-yes +)
  30649. (S1 ^operator O2205 +)
  30650. Firing propose*predict-no
  30651. -->
  30652. (O2206 ^name predict-no +)
  30653. (S1 ^operator O2206 +)
  30654. Firing rl*prefer*rvt*predict-no*H0*2
  30655. -->
  30656. (S1 ^operator O2204 = 0.3140473868976779)
  30657. Firing rl*prefer*rvt*predict-yes*H0*1
  30658. -->
  30659. (S1 ^operator O2203 = 0.3804130487485735)
  30660. Firing prefer*rvt*predict-yes*H0
  30661. -->
  30662. Firing prefer*rvt*predict-no*H0
  30663. -->
  30664. Firing elaborate*copy-dir-to-output-link
  30665. -->
  30666. (I3 ^dir L +)
  30667. inner elaboration loop at bottom goal.
  30668. Retracting elaborate*copy-see-to-output-link
  30669. -->
  30670. (I3 ^see 0 +)
  30671. Retracting propose*predict-no
  30672. -->
  30673. (O2204 ^name predict-no +)
  30674. (S1 ^operator O2204 +)
  30675. Retracting propose*predict-yes
  30676. -->
  30677. (O2203 ^name predict-yes +)
  30678. (S1 ^operator O2203 +)
  30679. Retracting elaborate*reward*based*on*reward
  30680. -->
  30681. (R1105 ^value 1 +)
  30682. (R1 ^reward R1105 +)
  30683. Retracting elaborate*copy-dir-to-output-link
  30684. -->
  30685. (I3 ^dir L +)
  30686. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  30687. -->
  30688. (S1 ^operator O2204 = -0.2190661556260421)
  30689. Retracting rl*prefer*rvt*predict-no*H0*2
  30690. -->
  30691. (S1 ^operator O2204 = 0.3140473868976779)
  30692. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  30693. -->
  30694. (S1 ^operator O2203 = 0.6195779233564012)
  30695. Retracting rl*prefer*rvt*predict-yes*H0*1
  30696. -->
  30697. (S1 ^operator O2203 = 0.3804130487485735)
  30698. =>WM: (15463: S1 ^operator O2206 +)
  30699. =>WM: (15462: S1 ^operator O2205 +)
  30700. =>WM: (15461: O2206 ^name predict-no)
  30701. =>WM: (15460: O2205 ^name predict-yes)
  30702. =>WM: (15459: R1106 ^value 1)
  30703. =>WM: (15458: R1 ^reward R1106)
  30704. =>WM: (15457: I3 ^see 1)
  30705. <=WM: (15448: S1 ^operator O2203 +)
  30706. <=WM: (15450: S1 ^operator O2203)
  30707. <=WM: (15449: S1 ^operator O2204 +)
  30708. <=WM: (15443: R1 ^reward R1105)
  30709. <=WM: (15442: I3 ^see 0)
  30710. <=WM: (15446: O2204 ^name predict-no)
  30711. <=WM: (15445: O2203 ^name predict-yes)
  30712. <=WM: (15444: R1105 ^value 1)
  30713. --- Inner Elaboration Phase, active level 1 (S1) ---
  30714. Firing prefer*rvt*predict-yes*H0
  30715. -->
  30716. Firing rl*prefer*rvt*predict-yes*H0*1
  30717. -->
  30718. (S1 ^operator O2205 = 0.3804130487485735)
  30719. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  30720. -->
  30721. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30722. -->
  30723. (S1 ^operator O2205 = -0.3470159027404986)
  30724. Firing prefer*rvt*predict-no*H0
  30725. -->
  30726. Firing rl*prefer*rvt*predict-no*H0*2
  30727. -->
  30728. (S1 ^operator O2206 = 0.3140473868976779)
  30729. Firing prefer*rvt*predict-no*H0*2*v1*H1
  30730. -->
  30731. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30732. -->
  30733. (S1 ^operator O2206 = 0.686028179458083)
  30734. inner elaboration loop at bottom goal.
  30735. Retracting rl*prefer*rvt*predict-no*H0*2
  30736. -->
  30737. (S1 ^operator O2204 = 0.3140473868976779)
  30738. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30739. -->
  30740. (S1 ^operator O2204 = 0.686028179458083)
  30741. Retracting rl*prefer*rvt*predict-yes*H0*1
  30742. -->
  30743. (S1 ^operator O2203 = 0.3804130487485735)
  30744. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30745. -->
  30746. (S1 ^operator O2203 = -0.3470159027404986)
  30747. --- END Proposal Phase ---
  30748. --- Decision Phase ---
  30749. RL update rl*prefer*rvt*predict-yes*H0*1 0.521343 -0.14093 0.380413 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.845304,0.131492)
  30750. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478647 0.140931 0.619578 -> 0.478648 0.140931 0.619579(R,m,v=1,1,0)
  30751. =>WM: (15464: S1 ^operator O2206)
  30752. 1103: O: O2206 (predict-no)
  30753. --- END Decision Phase ---
  30754. --- Application Phase ---
  30755. --- Firing Productions (PE) For State At Depth 1 ---
  30756. --- Inner Elaboration Phase, active level 1 (S1) ---
  30757. Firing apply*operator
  30758. -->
  30759. (I3 ^predict-no N1103 + :O )
  30760. Firing apply*operator*complete
  30761. -->
  30762. (I3 ^predict-yes N1102 - :O )
  30763. inner elaboration loop at bottom goal.
  30764. --- Change Working Memory (PE) ---
  30765. =>WM: (15465: I3 ^predict-no N1103)
  30766. <=WM: (15452: N1102 ^status complete)
  30767. <=WM: (15451: I3 ^predict-yes N1102)
  30768. --- Firing Productions (IE) For State At Depth 1 ---
  30769. --- Inner Elaboration Phase, active level 1 (S1) ---
  30770. Firing monitor*world
  30771. -->
  30772. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30773. --- Change Working Memory (IE) ---
  30774. --- END Application Phase ---
  30775. --- Output Phase ---
  30776. ENV: Agent did: predict-no for direction L in state State-A
  30777. In State-A moving L
  30778. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30779. predict error 0
  30780. dir: dir isR
  30781. --- END Output Phase ---
  30782. /|\--- Input Phase ---
  30783. =>WM: (15469: I2 ^dir R)
  30784. =>WM: (15468: I2 ^reward 1)
  30785. =>WM: (15467: I2 ^see 0)
  30786. =>WM: (15466: N1103 ^status complete)
  30787. <=WM: (15455: I2 ^dir L)
  30788. <=WM: (15454: I2 ^reward 1)
  30789. <=WM: (15453: I2 ^see 1)
  30790. =>WM: (15470: I2 ^level-1 L0-root)
  30791. <=WM: (15456: I2 ^level-1 L1-root)
  30792. --- END Input Phase ---
  30793. --- Proposal Phase ---
  30794. --- Inner Elaboration Phase, active level 1 (S1) ---
  30795. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30796. -->
  30797. (S1 ^operator O2205 = 0.7058349330775942)
  30798. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30799. -->
  30800. (S1 ^operator O2206 = -0.2023211881870005)
  30801. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30802. -->
  30803. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30804. -->
  30805. Firing elaborate*copy-see-to-output-link
  30806. -->
  30807. (I3 ^see 0 +)
  30808. Firing elaborate*reward*based*on*reward
  30809. -->
  30810. (R1107 ^value 1 +)
  30811. (R1 ^reward R1107 +)
  30812. Firing propose*predict-yes
  30813. -->
  30814. (O2207 ^name predict-yes +)
  30815. (S1 ^operator O2207 +)
  30816. Firing propose*predict-no
  30817. -->
  30818. (O2208 ^name predict-no +)
  30819. (S1 ^operator O2208 +)
  30820. Firing rl*prefer*rvt*predict-no*H0*6
  30821. -->
  30822. (S1 ^operator O2206 = 0.2298672026809531)
  30823. Firing rl*prefer*rvt*predict-yes*H0*5
  30824. -->
  30825. (S1 ^operator O2205 = 0.2940438202219438)
  30826. Firing prefer*rvt*predict-yes*H0
  30827. -->
  30828. Firing prefer*rvt*predict-no*H0
  30829. -->
  30830. Firing elaborate*copy-dir-to-output-link
  30831. -->
  30832. (I3 ^dir R +)
  30833. inner elaboration loop at bottom goal.
  30834. Retracting elaborate*copy-see-to-output-link
  30835. -->
  30836. (I3 ^see 1 +)
  30837. Retracting propose*predict-no
  30838. -->
  30839. (O2206 ^name predict-no +)
  30840. (S1 ^operator O2206 +)
  30841. Retracting propose*predict-yes
  30842. -->
  30843. (O2205 ^name predict-yes +)
  30844. (S1 ^operator O2205 +)
  30845. Retracting elaborate*reward*based*on*reward
  30846. -->
  30847. (R1106 ^value 1 +)
  30848. (R1 ^reward R1106 +)
  30849. Retracting elaborate*copy-dir-to-output-link
  30850. -->
  30851. (I3 ^dir L +)
  30852. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*36
  30853. -->
  30854. (S1 ^operator O2206 = 0.686028179458083)
  30855. Retracting rl*prefer*rvt*predict-no*H0*2
  30856. -->
  30857. (S1 ^operator O2206 = 0.3140473868976779)
  30858. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*37
  30859. -->
  30860. (S1 ^operator O2205 = -0.3470159027404986)
  30861. Retracting rl*prefer*rvt*predict-yes*H0*1
  30862. -->
  30863. (S1 ^operator O2205 = 0.3804137769811579)
  30864. =>WM: (15478: S1 ^operator O2208 +)
  30865. =>WM: (15477: S1 ^operator O2207 +)
  30866. =>WM: (15476: I3 ^dir R)
  30867. =>WM: (15475: O2208 ^name predict-no)
  30868. =>WM: (15474: O2207 ^name predict-yes)
  30869. =>WM: (15473: R1107 ^value 1)
  30870. =>WM: (15472: R1 ^reward R1107)
  30871. =>WM: (15471: I3 ^see 0)
  30872. <=WM: (15462: S1 ^operator O2205 +)
  30873. <=WM: (15463: S1 ^operator O2206 +)
  30874. <=WM: (15464: S1 ^operator O2206)
  30875. <=WM: (15447: I3 ^dir L)
  30876. <=WM: (15458: R1 ^reward R1106)
  30877. <=WM: (15457: I3 ^see 1)
  30878. <=WM: (15461: O2206 ^name predict-no)
  30879. <=WM: (15460: O2205 ^name predict-yes)
  30880. <=WM: (15459: R1106 ^value 1)
  30881. --- Inner Elaboration Phase, active level 1 (S1) ---
  30882. Firing prefer*rvt*predict-yes*H0
  30883. -->
  30884. Firing rl*prefer*rvt*predict-yes*H0*5
  30885. -->
  30886. (S1 ^operator O2207 = 0.2940438202219438)
  30887. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30888. -->
  30889. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30890. -->
  30891. (S1 ^operator O2207 = 0.7058349330775942)
  30892. Firing prefer*rvt*predict-no*H0
  30893. -->
  30894. Firing rl*prefer*rvt*predict-no*H0*6
  30895. -->
  30896. (S1 ^operator O2208 = 0.2298672026809531)
  30897. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30898. -->
  30899. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30900. -->
  30901. (S1 ^operator O2208 = -0.2023211881870005)
  30902. inner elaboration loop at bottom goal.
  30903. Retracting rl*prefer*rvt*predict-no*H0*6
  30904. -->
  30905. (S1 ^operator O2206 = 0.2298672026809531)
  30906. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  30907. -->
  30908. (S1 ^operator O2206 = -0.2023211881870005)
  30909. Retracting rl*prefer*rvt*predict-yes*H0*5
  30910. -->
  30911. (S1 ^operator O2205 = 0.2940438202219438)
  30912. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  30913. -->
  30914. (S1 ^operator O2205 = 0.7058349330775942)
  30915. --- END Proposal Phase ---
  30916. --- Decision Phase ---
  30917. RL update rl*prefer*rvt*predict-no*H0*2 0.485052 -0.171004 0.314047 -> 0.485047 -0.171006 0.314041(R,m,v=1,0.881356,0.105162)
  30918. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*36 0.515008 0.17102 0.686028 -> 0.515002 0.171019 0.686021(R,m,v=1,1,0)
  30919. =>WM: (15479: S1 ^operator O2207)
  30920. 1104: O: O2207 (predict-yes)
  30921. --- END Decision Phase ---
  30922. --- Application Phase ---
  30923. --- Firing Productions (PE) For State At Depth 1 ---
  30924. --- Inner Elaboration Phase, active level 1 (S1) ---
  30925. Firing apply*operator
  30926. -->
  30927. (I3 ^predict-yes N1104 + :O )
  30928. Firing apply*operator*complete
  30929. -->
  30930. (I3 ^predict-no N1103 - :O )
  30931. inner elaboration loop at bottom goal.
  30932. --- Change Working Memory (PE) ---
  30933. =>WM: (15480: I3 ^predict-yes N1104)
  30934. <=WM: (15466: N1103 ^status complete)
  30935. <=WM: (15465: I3 ^predict-no N1103)
  30936. --- Firing Productions (IE) For State At Depth 1 ---
  30937. --- Inner Elaboration Phase, active level 1 (S1) ---
  30938. Firing monitor*world
  30939. -->
  30940. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30941. --- Change Working Memory (IE) ---
  30942. --- END Application Phase ---
  30943. --- Output Phase ---
  30944. ENV: Agent did: predict-yes for direction R in state State-A
  30945. In State-A moving R
  30946. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  30947. predict error 0
  30948. dir: dir isR
  30949. --- END Output Phase ---
  30950. -/|--- Input Phase ---
  30951. =>WM: (15484: I2 ^dir R)
  30952. =>WM: (15483: I2 ^reward 1)
  30953. =>WM: (15482: I2 ^see 1)
  30954. =>WM: (15481: N1104 ^status complete)
  30955. <=WM: (15469: I2 ^dir R)
  30956. <=WM: (15468: I2 ^reward 1)
  30957. <=WM: (15467: I2 ^see 0)
  30958. =>WM: (15485: I2 ^level-1 R1-root)
  30959. <=WM: (15470: I2 ^level-1 L0-root)
  30960. --- END Input Phase ---
  30961. --- Proposal Phase ---
  30962. --- Inner Elaboration Phase, active level 1 (S1) ---
  30963. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30964. -->
  30965. (S1 ^operator O2207 = -0.252585164213872)
  30966. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  30967. -->
  30968. (S1 ^operator O2208 = 0.7701551449828702)
  30969. Firing prefer*rvt*predict-no*H0*6*v1*H1
  30970. -->
  30971. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30972. -->
  30973. Firing elaborate*copy-see-to-output-link
  30974. -->
  30975. (I3 ^see 1 +)
  30976. Firing elaborate*reward*based*on*reward
  30977. -->
  30978. (R1108 ^value 1 +)
  30979. (R1 ^reward R1108 +)
  30980. Firing propose*predict-yes
  30981. -->
  30982. (O2209 ^name predict-yes +)
  30983. (S1 ^operator O2209 +)
  30984. Firing propose*predict-no
  30985. -->
  30986. (O2210 ^name predict-no +)
  30987. (S1 ^operator O2210 +)
  30988. Firing rl*prefer*rvt*predict-no*H0*6
  30989. -->
  30990. (S1 ^operator O2208 = 0.2298672026809531)
  30991. Firing rl*prefer*rvt*predict-yes*H0*5
  30992. -->
  30993. (S1 ^operator O2207 = 0.2940438202219438)
  30994. Firing prefer*rvt*predict-yes*H0
  30995. -->
  30996. Firing prefer*rvt*predict-no*H0
  30997. -->
  30998. Firing elaborate*copy-dir-to-output-link
  30999. -->
  31000. (I3 ^dir R +)
  31001. inner elaboration loop at bottom goal.
  31002. Retracting elaborate*copy-see-to-output-link
  31003. -->
  31004. (I3 ^see 0 +)
  31005. Retracting propose*predict-no
  31006. -->
  31007. (O2208 ^name predict-no +)
  31008. (S1 ^operator O2208 +)
  31009. Retracting propose*predict-yes
  31010. -->
  31011. (O2207 ^name predict-yes +)
  31012. (S1 ^operator O2207 +)
  31013. Retracting elaborate*reward*based*on*reward
  31014. -->
  31015. (R1107 ^value 1 +)
  31016. (R1 ^reward R1107 +)
  31017. Retracting elaborate*copy-dir-to-output-link
  31018. -->
  31019. (I3 ^dir R +)
  31020. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*40
  31021. -->
  31022. (S1 ^operator O2208 = -0.2023211881870005)
  31023. Retracting rl*prefer*rvt*predict-no*H0*6
  31024. -->
  31025. (S1 ^operator O2208 = 0.2298672026809531)
  31026. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  31027. -->
  31028. (S1 ^operator O2207 = 0.7058349330775942)
  31029. Retracting rl*prefer*rvt*predict-yes*H0*5
  31030. -->
  31031. (S1 ^operator O2207 = 0.2940438202219438)
  31032. =>WM: (15492: S1 ^operator O2210 +)
  31033. =>WM: (15491: S1 ^operator O2209 +)
  31034. =>WM: (15490: O2210 ^name predict-no)
  31035. =>WM: (15489: O2209 ^name predict-yes)
  31036. =>WM: (15488: R1108 ^value 1)
  31037. =>WM: (15487: R1 ^reward R1108)
  31038. =>WM: (15486: I3 ^see 1)
  31039. <=WM: (15477: S1 ^operator O2207 +)
  31040. <=WM: (15479: S1 ^operator O2207)
  31041. <=WM: (15478: S1 ^operator O2208 +)
  31042. <=WM: (15472: R1 ^reward R1107)
  31043. <=WM: (15471: I3 ^see 0)
  31044. <=WM: (15475: O2208 ^name predict-no)
  31045. <=WM: (15474: O2207 ^name predict-yes)
  31046. <=WM: (15473: R1107 ^value 1)
  31047. --- Inner Elaboration Phase, active level 1 (S1) ---
  31048. Firing prefer*rvt*predict-yes*H0
  31049. -->
  31050. Firing rl*prefer*rvt*predict-yes*H0*5
  31051. -->
  31052. (S1 ^operator O2209 = 0.2940438202219438)
  31053. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31054. -->
  31055. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31056. -->
  31057. (S1 ^operator O2209 = -0.252585164213872)
  31058. Firing prefer*rvt*predict-no*H0
  31059. -->
  31060. Firing rl*prefer*rvt*predict-no*H0*6
  31061. -->
  31062. (S1 ^operator O2210 = 0.2298672026809531)
  31063. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31064. -->
  31065. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31066. -->
  31067. (S1 ^operator O2210 = 0.7701551449828702)
  31068. inner elaboration loop at bottom goal.
  31069. Retracting rl*prefer*rvt*predict-no*H0*6
  31070. -->
  31071. (S1 ^operator O2208 = 0.2298672026809531)
  31072. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31073. -->
  31074. (S1 ^operator O2208 = 0.7701551449828702)
  31075. Retracting rl*prefer*rvt*predict-yes*H0*5
  31076. -->
  31077. (S1 ^operator O2207 = 0.2940438202219438)
  31078. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31079. -->
  31080. (S1 ^operator O2207 = -0.252585164213872)
  31081. --- END Proposal Phase ---
  31082. --- Decision Phase ---
  31083. RL update rl*prefer*rvt*predict-yes*H0*5 0.501114 -0.20707 0.294044 -> 0.501123 -0.207069 0.294054(R,m,v=1,0.860465,0.120767)
  31084. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.498777 0.207058 0.705835 -> 0.498787 0.207059 0.705846(R,m,v=1,1,0)
  31085. =>WM: (15493: S1 ^operator O2210)
  31086. 1105: O: O2210 (predict-no)
  31087. --- END Decision Phase ---
  31088. --- Application Phase ---
  31089. --- Firing Productions (PE) For State At Depth 1 ---
  31090. --- Inner Elaboration Phase, active level 1 (S1) ---
  31091. Firing apply*operator
  31092. -->
  31093. (I3 ^predict-no N1105 + :O )
  31094. Firing apply*operator*complete
  31095. -->
  31096. (I3 ^predict-yes N1104 - :O )
  31097. inner elaboration loop at bottom goal.
  31098. --- Change Working Memory (PE) ---
  31099. =>WM: (15494: I3 ^predict-no N1105)
  31100. <=WM: (15481: N1104 ^status complete)
  31101. <=WM: (15480: I3 ^predict-yes N1104)
  31102. --- Firing Productions (IE) For State At Depth 1 ---
  31103. --- Inner Elaboration Phase, active level 1 (S1) ---
  31104. Firing monitor*world
  31105. -->
  31106. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31107. --- Change Working Memory (IE) ---
  31108. --- END Application Phase ---
  31109. --- Output Phase ---
  31110. ENV: Agent did: predict-no for direction R in state State-B
  31111. In State-B moving R
  31112. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31113. predict error 0
  31114. dir: dir isR
  31115. --- END Output Phase ---
  31116. \-/--- Input Phase ---
  31117. =>WM: (15498: I2 ^dir R)
  31118. =>WM: (15497: I2 ^reward 1)
  31119. =>WM: (15496: I2 ^see 0)
  31120. =>WM: (15495: N1105 ^status complete)
  31121. <=WM: (15484: I2 ^dir R)
  31122. <=WM: (15483: I2 ^reward 1)
  31123. <=WM: (15482: I2 ^see 1)
  31124. =>WM: (15499: I2 ^level-1 R0-root)
  31125. <=WM: (15485: I2 ^level-1 R1-root)
  31126. --- END Input Phase ---
  31127. --- Proposal Phase ---
  31128. --- Inner Elaboration Phase, active level 1 (S1) ---
  31129. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31130. -->
  31131. (S1 ^operator O2209 = -0.1254042659579056)
  31132. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31133. -->
  31134. (S1 ^operator O2210 = 0.7701246402854851)
  31135. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31136. -->
  31137. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31138. -->
  31139. Firing elaborate*copy-see-to-output-link
  31140. -->
  31141. (I3 ^see 0 +)
  31142. Firing elaborate*reward*based*on*reward
  31143. -->
  31144. (R1109 ^value 1 +)
  31145. (R1 ^reward R1109 +)
  31146. Firing propose*predict-yes
  31147. -->
  31148. (O2211 ^name predict-yes +)
  31149. (S1 ^operator O2211 +)
  31150. Firing propose*predict-no
  31151. -->
  31152. (O2212 ^name predict-no +)
  31153. (S1 ^operator O2212 +)
  31154. Firing rl*prefer*rvt*predict-no*H0*6
  31155. -->
  31156. (S1 ^operator O2210 = 0.2298672026809531)
  31157. Firing rl*prefer*rvt*predict-yes*H0*5
  31158. -->
  31159. (S1 ^operator O2209 = 0.2940536816948511)
  31160. Firing prefer*rvt*predict-yes*H0
  31161. -->
  31162. Firing prefer*rvt*predict-no*H0
  31163. -->
  31164. Firing elaborate*copy-dir-to-output-link
  31165. -->
  31166. (I3 ^dir R +)
  31167. inner elaboration loop at bottom goal.
  31168. Retracting elaborate*copy-see-to-output-link
  31169. -->
  31170. (I3 ^see 1 +)
  31171. Retracting propose*predict-no
  31172. -->
  31173. (O2210 ^name predict-no +)
  31174. (S1 ^operator O2210 +)
  31175. Retracting propose*predict-yes
  31176. -->
  31177. (O2209 ^name predict-yes +)
  31178. (S1 ^operator O2209 +)
  31179. Retracting elaborate*reward*based*on*reward
  31180. -->
  31181. (R1108 ^value 1 +)
  31182. (R1 ^reward R1108 +)
  31183. Retracting elaborate*copy-dir-to-output-link
  31184. -->
  31185. (I3 ^dir R +)
  31186. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31187. -->
  31188. (S1 ^operator O2210 = 0.7701551449828702)
  31189. Retracting rl*prefer*rvt*predict-no*H0*6
  31190. -->
  31191. (S1 ^operator O2210 = 0.2298672026809531)
  31192. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31193. -->
  31194. (S1 ^operator O2209 = -0.252585164213872)
  31195. Retracting rl*prefer*rvt*predict-yes*H0*5
  31196. -->
  31197. (S1 ^operator O2209 = 0.2940536816948511)
  31198. =>WM: (15506: S1 ^operator O2212 +)
  31199. =>WM: (15505: S1 ^operator O2211 +)
  31200. =>WM: (15504: O2212 ^name predict-no)
  31201. =>WM: (15503: O2211 ^name predict-yes)
  31202. =>WM: (15502: R1109 ^value 1)
  31203. =>WM: (15501: R1 ^reward R1109)
  31204. =>WM: (15500: I3 ^see 0)
  31205. <=WM: (15491: S1 ^operator O2209 +)
  31206. <=WM: (15492: S1 ^operator O2210 +)
  31207. <=WM: (15493: S1 ^operator O2210)
  31208. <=WM: (15487: R1 ^reward R1108)
  31209. <=WM: (15486: I3 ^see 1)
  31210. <=WM: (15490: O2210 ^name predict-no)
  31211. <=WM: (15489: O2209 ^name predict-yes)
  31212. <=WM: (15488: R1108 ^value 1)
  31213. --- Inner Elaboration Phase, active level 1 (S1) ---
  31214. Firing prefer*rvt*predict-yes*H0
  31215. -->
  31216. Firing rl*prefer*rvt*predict-yes*H0*5
  31217. -->
  31218. (S1 ^operator O2211 = 0.2940536816948511)
  31219. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31220. -->
  31221. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31222. -->
  31223. (S1 ^operator O2211 = -0.1254042659579056)
  31224. Firing prefer*rvt*predict-no*H0
  31225. -->
  31226. Firing rl*prefer*rvt*predict-no*H0*6
  31227. -->
  31228. (S1 ^operator O2212 = 0.2298672026809531)
  31229. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31230. -->
  31231. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31232. -->
  31233. (S1 ^operator O2212 = 0.7701246402854851)
  31234. inner elaboration loop at bottom goal.
  31235. Retracting rl*prefer*rvt*predict-no*H0*6
  31236. -->
  31237. (S1 ^operator O2210 = 0.2298672026809531)
  31238. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31239. -->
  31240. (S1 ^operator O2210 = 0.7701246402854851)
  31241. Retracting rl*prefer*rvt*predict-yes*H0*5
  31242. -->
  31243. (S1 ^operator O2209 = 0.2940536816948511)
  31244. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31245. -->
  31246. (S1 ^operator O2209 = -0.1254042659579056)
  31247. --- END Proposal Phase ---
  31248. --- Decision Phase ---
  31249. RL update rl*prefer*rvt*predict-no*H0*6 0.611918 -0.382051 0.229867 -> 0.611917 -0.382051 0.229865(R,m,v=1,0.863636,0.118366)
  31250. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.3881 0.382055 0.770155 -> 0.388098 0.382055 0.770153(R,m,v=1,1,0)
  31251. =>WM: (15507: S1 ^operator O2212)
  31252. 1106: O: O2212 (predict-no)
  31253. --- END Decision Phase ---
  31254. --- Application Phase ---
  31255. --- Firing Productions (PE) For State At Depth 1 ---
  31256. --- Inner Elaboration Phase, active level 1 (S1) ---
  31257. Firing apply*operator
  31258. -->
  31259. (I3 ^predict-no N1106 + :O )
  31260. Firing apply*operator*complete
  31261. -->
  31262. (I3 ^predict-no N1105 - :O )
  31263. inner elaboration loop at bottom goal.
  31264. --- Change Working Memory (PE) ---
  31265. =>WM: (15508: I3 ^predict-no N1106)
  31266. <=WM: (15495: N1105 ^status complete)
  31267. <=WM: (15494: I3 ^predict-no N1105)
  31268. --- Firing Productions (IE) For State At Depth 1 ---
  31269. --- Inner Elaboration Phase, active level 1 (S1) ---
  31270. Firing monitor*world
  31271. -->
  31272. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31273. --- Change Working Memory (IE) ---
  31274. --- END Application Phase ---
  31275. --- Output Phase ---
  31276. ENV: Agent did: predict-no for direction R in state State-B
  31277. In State-B moving R
  31278. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31279. predict error 0
  31280. dir: dir isL
  31281. --- END Output Phase ---
  31282. |\--- Input Phase ---
  31283. =>WM: (15512: I2 ^dir L)
  31284. =>WM: (15511: I2 ^reward 1)
  31285. =>WM: (15510: I2 ^see 0)
  31286. =>WM: (15509: N1106 ^status complete)
  31287. <=WM: (15498: I2 ^dir R)
  31288. <=WM: (15497: I2 ^reward 1)
  31289. <=WM: (15496: I2 ^see 0)
  31290. =>WM: (15513: I2 ^level-1 R0-root)
  31291. <=WM: (15499: I2 ^level-1 R0-root)
  31292. --- END Input Phase ---
  31293. --- Proposal Phase ---
  31294. --- Inner Elaboration Phase, active level 1 (S1) ---
  31295. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  31296. -->
  31297. (S1 ^operator O2211 = 0.6195787722435855)
  31298. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  31299. -->
  31300. (S1 ^operator O2212 = -0.2190661556260421)
  31301. Firing prefer*rvt*predict-no*H0*2*v1*H1
  31302. -->
  31303. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  31304. -->
  31305. Firing elaborate*copy-see-to-output-link
  31306. -->
  31307. (I3 ^see 0 +)
  31308. Firing elaborate*reward*based*on*reward
  31309. -->
  31310. (R1110 ^value 1 +)
  31311. (R1 ^reward R1110 +)
  31312. Firing propose*predict-yes
  31313. -->
  31314. (O2213 ^name predict-yes +)
  31315. (S1 ^operator O2213 +)
  31316. Firing propose*predict-no
  31317. -->
  31318. (O2214 ^name predict-no +)
  31319. (S1 ^operator O2214 +)
  31320. Firing rl*prefer*rvt*predict-no*H0*2
  31321. -->
  31322. (S1 ^operator O2212 = 0.314041269303462)
  31323. Firing rl*prefer*rvt*predict-yes*H0*1
  31324. -->
  31325. (S1 ^operator O2211 = 0.3804137769811579)
  31326. Firing prefer*rvt*predict-yes*H0
  31327. -->
  31328. Firing prefer*rvt*predict-no*H0
  31329. -->
  31330. Firing elaborate*copy-dir-to-output-link
  31331. -->
  31332. (I3 ^dir L +)
  31333. inner elaboration loop at bottom goal.
  31334. Retracting elaborate*copy-see-to-output-link
  31335. -->
  31336. (I3 ^see 0 +)
  31337. Retracting propose*predict-no
  31338. -->
  31339. (O2212 ^name predict-no +)
  31340. (S1 ^operator O2212 +)
  31341. Retracting propose*predict-yes
  31342. -->
  31343. (O2211 ^name predict-yes +)
  31344. (S1 ^operator O2211 +)
  31345. Retracting elaborate*reward*based*on*reward
  31346. -->
  31347. (R1109 ^value 1 +)
  31348. (R1 ^reward R1109 +)
  31349. Retracting elaborate*copy-dir-to-output-link
  31350. -->
  31351. (I3 ^dir R +)
  31352. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31353. -->
  31354. (S1 ^operator O2212 = 0.7701246402854851)
  31355. Retracting rl*prefer*rvt*predict-no*H0*6
  31356. -->
  31357. (S1 ^operator O2212 = 0.2298654257475218)
  31358. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31359. -->
  31360. (S1 ^operator O2211 = -0.1254042659579056)
  31361. Retracting rl*prefer*rvt*predict-yes*H0*5
  31362. -->
  31363. (S1 ^operator O2211 = 0.2940536816948511)
  31364. =>WM: (15520: S1 ^operator O2214 +)
  31365. =>WM: (15519: S1 ^operator O2213 +)
  31366. =>WM: (15518: I3 ^dir L)
  31367. =>WM: (15517: O2214 ^name predict-no)
  31368. =>WM: (15516: O2213 ^name predict-yes)
  31369. =>WM: (15515: R1110 ^value 1)
  31370. =>WM: (15514: R1 ^reward R1110)
  31371. <=WM: (15505: S1 ^operator O2211 +)
  31372. <=WM: (15506: S1 ^operator O2212 +)
  31373. <=WM: (15507: S1 ^operator O2212)
  31374. <=WM: (15476: I3 ^dir R)
  31375. <=WM: (15501: R1 ^reward R1109)
  31376. <=WM: (15504: O2212 ^name predict-no)
  31377. <=WM: (15503: O2211 ^name predict-yes)
  31378. <=WM: (15502: R1109 ^value 1)
  31379. --- Inner Elaboration Phase, active level 1 (S1) ---
  31380. Firing prefer*rvt*predict-yes*H0
  31381. -->
  31382. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  31383. -->
  31384. (S1 ^operator O2213 = 0.6195787722435855)
  31385. Firing rl*prefer*rvt*predict-yes*H0*1
  31386. -->
  31387. (S1 ^operator O2213 = 0.3804137769811579)
  31388. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  31389. -->
  31390. Firing prefer*rvt*predict-no*H0
  31391. -->
  31392. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  31393. -->
  31394. (S1 ^operator O2214 = -0.2190661556260421)
  31395. Firing rl*prefer*rvt*predict-no*H0*2
  31396. -->
  31397. (S1 ^operator O2214 = 0.314041269303462)
  31398. Firing prefer*rvt*predict-no*H0*2*v1*H1
  31399. -->
  31400. inner elaboration loop at bottom goal.
  31401. Retracting rl*prefer*rvt*predict-no*H0*2
  31402. -->
  31403. (S1 ^operator O2212 = 0.314041269303462)
  31404. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  31405. -->
  31406. (S1 ^operator O2212 = -0.2190661556260421)
  31407. Retracting rl*prefer*rvt*predict-yes*H0*1
  31408. -->
  31409. (S1 ^operator O2211 = 0.3804137769811579)
  31410. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  31411. -->
  31412. (S1 ^operator O2211 = 0.6195787722435855)
  31413. --- END Proposal Phase ---
  31414. --- Decision Phase ---
  31415. RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229865 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.864322,0.117862)
  31416. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388075 0.382049 0.770125 -> 0.388076 0.382049 0.770126(R,m,v=1,1,0)
  31417. =>WM: (15521: S1 ^operator O2213)
  31418. 1107: O: O2213 (predict-yes)
  31419. --- END Decision Phase ---
  31420. --- Application Phase ---
  31421. --- Firing Productions (PE) For State At Depth 1 ---
  31422. --- Inner Elaboration Phase, active level 1 (S1) ---
  31423. Firing apply*operator
  31424. -->
  31425. (I3 ^predict-yes N1107 + :O )
  31426. Firing apply*operator*complete
  31427. -->
  31428. (I3 ^predict-no N1106 - :O )
  31429. inner elaboration loop at bottom goal.
  31430. --- Change Working Memory (PE) ---
  31431. =>WM: (15522: I3 ^predict-yes N1107)
  31432. <=WM: (15509: N1106 ^status complete)
  31433. <=WM: (15508: I3 ^predict-no N1106)
  31434. --- Firing Productions (IE) For State At Depth 1 ---
  31435. --- Inner Elaboration Phase, active level 1 (S1) ---
  31436. Firing monitor*world
  31437. -->
  31438. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31439. --- Change Working Memory (IE) ---
  31440. --- END Application Phase ---
  31441. --- Output Phase ---
  31442. ENV: Agent did: predict-yes for direction L in state State-B
  31443. In State-B moving L
  31444. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  31445. predict error 0
  31446. dir: dir isR
  31447. --- END Output Phase ---
  31448. -/|--- Input Phase ---
  31449. =>WM: (15526: I2 ^dir R)
  31450. =>WM: (15525: I2 ^reward 1)
  31451. =>WM: (15524: I2 ^see 1)
  31452. =>WM: (15523: N1107 ^status complete)
  31453. <=WM: (15512: I2 ^dir L)
  31454. <=WM: (15511: I2 ^reward 1)
  31455. <=WM: (15510: I2 ^see 0)
  31456. =>WM: (15527: I2 ^level-1 L1-root)
  31457. <=WM: (15513: I2 ^level-1 R0-root)
  31458. --- END Input Phase ---
  31459. --- Proposal Phase ---
  31460. --- Inner Elaboration Phase, active level 1 (S1) ---
  31461. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  31462. -->
  31463. (S1 ^operator O2213 = 0.7061721241516533)
  31464. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  31465. -->
  31466. (S1 ^operator O2214 = -0.1937987592593187)
  31467. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31468. -->
  31469. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31470. -->
  31471. Firing elaborate*copy-see-to-output-link
  31472. -->
  31473. (I3 ^see 1 +)
  31474. Firing elaborate*reward*based*on*reward
  31475. -->
  31476. (R1111 ^value 1 +)
  31477. (R1 ^reward R1111 +)
  31478. Firing propose*predict-yes
  31479. -->
  31480. (O2215 ^name predict-yes +)
  31481. (S1 ^operator O2215 +)
  31482. Firing propose*predict-no
  31483. -->
  31484. (O2216 ^name predict-no +)
  31485. (S1 ^operator O2216 +)
  31486. Firing rl*prefer*rvt*predict-no*H0*6
  31487. -->
  31488. (S1 ^operator O2214 = 0.2298662149963561)
  31489. Firing rl*prefer*rvt*predict-yes*H0*5
  31490. -->
  31491. (S1 ^operator O2213 = 0.2940536816948511)
  31492. Firing prefer*rvt*predict-yes*H0
  31493. -->
  31494. Firing prefer*rvt*predict-no*H0
  31495. -->
  31496. Firing elaborate*copy-dir-to-output-link
  31497. -->
  31498. (I3 ^dir R +)
  31499. inner elaboration loop at bottom goal.
  31500. Retracting elaborate*copy-see-to-output-link
  31501. -->
  31502. (I3 ^see 0 +)
  31503. Retracting propose*predict-no
  31504. -->
  31505. (O2214 ^name predict-no +)
  31506. (S1 ^operator O2214 +)
  31507. Retracting propose*predict-yes
  31508. -->
  31509. (O2213 ^name predict-yes +)
  31510. (S1 ^operator O2213 +)
  31511. Retracting elaborate*reward*based*on*reward
  31512. -->
  31513. (R1110 ^value 1 +)
  31514. (R1 ^reward R1110 +)
  31515. Retracting elaborate*copy-dir-to-output-link
  31516. -->
  31517. (I3 ^dir L +)
  31518. Retracting rl*prefer*rvt*predict-no*H0*2
  31519. -->
  31520. (S1 ^operator O2214 = 0.314041269303462)
  31521. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  31522. -->
  31523. (S1 ^operator O2214 = -0.2190661556260421)
  31524. Retracting rl*prefer*rvt*predict-yes*H0*1
  31525. -->
  31526. (S1 ^operator O2213 = 0.3804137769811579)
  31527. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  31528. -->
  31529. (S1 ^operator O2213 = 0.6195787722435855)
  31530. =>WM: (15535: S1 ^operator O2216 +)
  31531. =>WM: (15534: S1 ^operator O2215 +)
  31532. =>WM: (15533: I3 ^dir R)
  31533. =>WM: (15532: O2216 ^name predict-no)
  31534. =>WM: (15531: O2215 ^name predict-yes)
  31535. =>WM: (15530: R1111 ^value 1)
  31536. =>WM: (15529: R1 ^reward R1111)
  31537. =>WM: (15528: I3 ^see 1)
  31538. <=WM: (15519: S1 ^operator O2213 +)
  31539. <=WM: (15521: S1 ^operator O2213)
  31540. <=WM: (15520: S1 ^operator O2214 +)
  31541. <=WM: (15518: I3 ^dir L)
  31542. <=WM: (15514: R1 ^reward R1110)
  31543. <=WM: (15500: I3 ^see 0)
  31544. <=WM: (15517: O2214 ^name predict-no)
  31545. <=WM: (15516: O2213 ^name predict-yes)
  31546. <=WM: (15515: R1110 ^value 1)
  31547. --- Inner Elaboration Phase, active level 1 (S1) ---
  31548. Firing prefer*rvt*predict-yes*H0
  31549. -->
  31550. Firing rl*prefer*rvt*predict-yes*H0*5
  31551. -->
  31552. (S1 ^operator O2215 = 0.2940536816948511)
  31553. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31554. -->
  31555. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  31556. -->
  31557. (S1 ^operator O2215 = 0.7061721241516533)
  31558. Firing prefer*rvt*predict-no*H0
  31559. -->
  31560. Firing rl*prefer*rvt*predict-no*H0*6
  31561. -->
  31562. (S1 ^operator O2216 = 0.2298662149963561)
  31563. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31564. -->
  31565. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  31566. -->
  31567. (S1 ^operator O2216 = -0.1937987592593187)
  31568. inner elaboration loop at bottom goal.
  31569. Retracting rl*prefer*rvt*predict-no*H0*6
  31570. -->
  31571. (S1 ^operator O2214 = 0.2298662149963561)
  31572. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  31573. -->
  31574. (S1 ^operator O2214 = -0.1937987592593187)
  31575. Retracting rl*prefer*rvt*predict-yes*H0*5
  31576. -->
  31577. (S1 ^operator O2213 = 0.2940536816948511)
  31578. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  31579. -->
  31580. (S1 ^operator O2213 = 0.7061721241516533)
  31581. --- END Proposal Phase ---
  31582. --- Decision Phase ---
  31583. RL update rl*prefer*rvt*predict-yes*H0*1 0.521344 -0.14093 0.380414 -> 0.521344 -0.14093 0.380414(R,m,v=1,0.846154,0.130897)
  31584. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*35 0.478648 0.140931 0.619579 -> 0.478649 0.140931 0.619579(R,m,v=1,1,0)
  31585. =>WM: (15536: S1 ^operator O2215)
  31586. 1108: O: O2215 (predict-yes)
  31587. --- END Decision Phase ---
  31588. --- Application Phase ---
  31589. --- Firing Productions (PE) For State At Depth 1 ---
  31590. --- Inner Elaboration Phase, active level 1 (S1) ---
  31591. Firing apply*operator
  31592. -->
  31593. (I3 ^predict-yes N1108 + :O )
  31594. Firing apply*operator*complete
  31595. -->
  31596. (I3 ^predict-yes N1107 - :O )
  31597. inner elaboration loop at bottom goal.
  31598. --- Change Working Memory (PE) ---
  31599. =>WM: (15537: I3 ^predict-yes N1108)
  31600. <=WM: (15523: N1107 ^status complete)
  31601. <=WM: (15522: I3 ^predict-yes N1107)
  31602. --- Firing Productions (IE) For State At Depth 1 ---
  31603. --- Inner Elaboration Phase, active level 1 (S1) ---
  31604. Firing monitor*world
  31605. -->
  31606. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31607. --- Change Working Memory (IE) ---
  31608. --- END Application Phase ---
  31609. --- Output Phase ---
  31610. ENV: Agent did: predict-yes for direction R in state State-A
  31611. In State-A moving R
  31612. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  31613. predict error 0
  31614. dir: dir isR
  31615. --- END Output Phase ---
  31616. \-/--- Input Phase ---
  31617. =>WM: (15541: I2 ^dir R)
  31618. =>WM: (15540: I2 ^reward 1)
  31619. =>WM: (15539: I2 ^see 1)
  31620. =>WM: (15538: N1108 ^status complete)
  31621. <=WM: (15526: I2 ^dir R)
  31622. <=WM: (15525: I2 ^reward 1)
  31623. <=WM: (15524: I2 ^see 1)
  31624. =>WM: (15542: I2 ^level-1 R1-root)
  31625. <=WM: (15527: I2 ^level-1 L1-root)
  31626. --- END Input Phase ---
  31627. --- Proposal Phase ---
  31628. --- Inner Elaboration Phase, active level 1 (S1) ---
  31629. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31630. -->
  31631. (S1 ^operator O2215 = -0.252585164213872)
  31632. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31633. -->
  31634. (S1 ^operator O2216 = 0.7701530160237312)
  31635. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31636. -->
  31637. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31638. -->
  31639. Firing elaborate*copy-see-to-output-link
  31640. -->
  31641. (I3 ^see 1 +)
  31642. Firing elaborate*reward*based*on*reward
  31643. -->
  31644. (R1112 ^value 1 +)
  31645. (R1 ^reward R1112 +)
  31646. Firing propose*predict-yes
  31647. -->
  31648. (O2217 ^name predict-yes +)
  31649. (S1 ^operator O2217 +)
  31650. Firing propose*predict-no
  31651. -->
  31652. (O2218 ^name predict-no +)
  31653. (S1 ^operator O2218 +)
  31654. Firing rl*prefer*rvt*predict-no*H0*6
  31655. -->
  31656. (S1 ^operator O2216 = 0.2298662149963561)
  31657. Firing rl*prefer*rvt*predict-yes*H0*5
  31658. -->
  31659. (S1 ^operator O2215 = 0.2940536816948511)
  31660. Firing prefer*rvt*predict-yes*H0
  31661. -->
  31662. Firing prefer*rvt*predict-no*H0
  31663. -->
  31664. Firing elaborate*copy-dir-to-output-link
  31665. -->
  31666. (I3 ^dir R +)
  31667. inner elaboration loop at bottom goal.
  31668. Retracting elaborate*copy-see-to-output-link
  31669. -->
  31670. (I3 ^see 1 +)
  31671. Retracting propose*predict-no
  31672. -->
  31673. (O2216 ^name predict-no +)
  31674. (S1 ^operator O2216 +)
  31675. Retracting propose*predict-yes
  31676. -->
  31677. (O2215 ^name predict-yes +)
  31678. (S1 ^operator O2215 +)
  31679. Retracting elaborate*reward*based*on*reward
  31680. -->
  31681. (R1111 ^value 1 +)
  31682. (R1 ^reward R1111 +)
  31683. Retracting elaborate*copy-dir-to-output-link
  31684. -->
  31685. (I3 ^dir R +)
  31686. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*26
  31687. -->
  31688. (S1 ^operator O2216 = -0.1937987592593187)
  31689. Retracting rl*prefer*rvt*predict-no*H0*6
  31690. -->
  31691. (S1 ^operator O2216 = 0.2298662149963561)
  31692. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*27
  31693. -->
  31694. (S1 ^operator O2215 = 0.7061721241516533)
  31695. Retracting rl*prefer*rvt*predict-yes*H0*5
  31696. -->
  31697. (S1 ^operator O2215 = 0.2940536816948511)
  31698. =>WM: (15548: S1 ^operator O2218 +)
  31699. =>WM: (15547: S1 ^operator O2217 +)
  31700. =>WM: (15546: O2218 ^name predict-no)
  31701. =>WM: (15545: O2217 ^name predict-yes)
  31702. =>WM: (15544: R1112 ^value 1)
  31703. =>WM: (15543: R1 ^reward R1112)
  31704. <=WM: (15534: S1 ^operator O2215 +)
  31705. <=WM: (15536: S1 ^operator O2215)
  31706. <=WM: (15535: S1 ^operator O2216 +)
  31707. <=WM: (15529: R1 ^reward R1111)
  31708. <=WM: (15532: O2216 ^name predict-no)
  31709. <=WM: (15531: O2215 ^name predict-yes)
  31710. <=WM: (15530: R1111 ^value 1)
  31711. --- Inner Elaboration Phase, active level 1 (S1) ---
  31712. Firing prefer*rvt*predict-yes*H0
  31713. -->
  31714. Firing rl*prefer*rvt*predict-yes*H0*5
  31715. -->
  31716. (S1 ^operator O2217 = 0.2940536816948511)
  31717. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31718. -->
  31719. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31720. -->
  31721. (S1 ^operator O2217 = -0.252585164213872)
  31722. Firing prefer*rvt*predict-no*H0
  31723. -->
  31724. Firing rl*prefer*rvt*predict-no*H0*6
  31725. -->
  31726. (S1 ^operator O2218 = 0.2298662149963561)
  31727. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31728. -->
  31729. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31730. -->
  31731. (S1 ^operator O2218 = 0.7701530160237312)
  31732. inner elaboration loop at bottom goal.
  31733. Retracting rl*prefer*rvt*predict-no*H0*6
  31734. -->
  31735. (S1 ^operator O2216 = 0.2298662149963561)
  31736. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31737. -->
  31738. (S1 ^operator O2216 = 0.7701530160237312)
  31739. Retracting rl*prefer*rvt*predict-yes*H0*5
  31740. -->
  31741. (S1 ^operator O2215 = 0.2940536816948511)
  31742. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31743. -->
  31744. (S1 ^operator O2215 = -0.252585164213872)
  31745. --- END Proposal Phase ---
  31746. --- Decision Phase ---
  31747. RL update rl*prefer*rvt*predict-yes*H0*5 0.501123 -0.207069 0.294054 -> 0.501106 -0.207071 0.294035(R,m,v=1,0.861272,0.120177)
  31748. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*27 0.499081 0.207091 0.706172 -> 0.499062 0.207089 0.706151(R,m,v=1,1,0)
  31749. =>WM: (15549: S1 ^operator O2218)
  31750. 1109: O: O2218 (predict-no)
  31751. --- END Decision Phase ---
  31752. --- Application Phase ---
  31753. --- Firing Productions (PE) For State At Depth 1 ---
  31754. --- Inner Elaboration Phase, active level 1 (S1) ---
  31755. Firing apply*operator
  31756. -->
  31757. (I3 ^predict-no N1109 + :O )
  31758. Firing apply*operator*complete
  31759. -->
  31760. (I3 ^predict-yes N1108 - :O )
  31761. inner elaboration loop at bottom goal.
  31762. --- Change Working Memory (PE) ---
  31763. =>WM: (15550: I3 ^predict-no N1109)
  31764. <=WM: (15538: N1108 ^status complete)
  31765. <=WM: (15537: I3 ^predict-yes N1108)
  31766. --- Firing Productions (IE) For State At Depth 1 ---
  31767. --- Inner Elaboration Phase, active level 1 (S1) ---
  31768. Firing monitor*world
  31769. -->
  31770. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31771. --- Change Working Memory (IE) ---
  31772. --- END Application Phase ---
  31773. --- Output Phase ---
  31774. ENV: Agent did: predict-no for direction R in state State-B
  31775. In State-B moving R
  31776. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31777. predict error 0
  31778. dir: dir isR
  31779. --- END Output Phase ---
  31780. |\---- Input Phase ---
  31781. =>WM: (15554: I2 ^dir R)
  31782. =>WM: (15553: I2 ^reward 1)
  31783. =>WM: (15552: I2 ^see 0)
  31784. =>WM: (15551: N1109 ^status complete)
  31785. <=WM: (15541: I2 ^dir R)
  31786. <=WM: (15540: I2 ^reward 1)
  31787. <=WM: (15539: I2 ^see 1)
  31788. =>WM: (15555: I2 ^level-1 R0-root)
  31789. <=WM: (15542: I2 ^level-1 R1-root)
  31790. --- END Input Phase ---
  31791. --- Proposal Phase ---
  31792. --- Inner Elaboration Phase, active level 1 (S1) ---
  31793. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31794. -->
  31795. (S1 ^operator O2217 = -0.1254042659579056)
  31796. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31797. -->
  31798. (S1 ^operator O2218 = 0.770125534612744)
  31799. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31800. -->
  31801. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31802. -->
  31803. Firing elaborate*copy-see-to-output-link
  31804. -->
  31805. (I3 ^see 0 +)
  31806. Firing elaborate*reward*based*on*reward
  31807. -->
  31808. (R1113 ^value 1 +)
  31809. (R1 ^reward R1113 +)
  31810. Firing propose*predict-yes
  31811. -->
  31812. (O2219 ^name predict-yes +)
  31813. (S1 ^operator O2219 +)
  31814. Firing propose*predict-no
  31815. -->
  31816. (O2220 ^name predict-no +)
  31817. (S1 ^operator O2220 +)
  31818. Firing rl*prefer*rvt*predict-no*H0*6
  31819. -->
  31820. (S1 ^operator O2218 = 0.2298662149963561)
  31821. Firing rl*prefer*rvt*predict-yes*H0*5
  31822. -->
  31823. (S1 ^operator O2217 = 0.2940353333163421)
  31824. Firing prefer*rvt*predict-yes*H0
  31825. -->
  31826. Firing prefer*rvt*predict-no*H0
  31827. -->
  31828. Firing elaborate*copy-dir-to-output-link
  31829. -->
  31830. (I3 ^dir R +)
  31831. inner elaboration loop at bottom goal.
  31832. Retracting elaborate*copy-see-to-output-link
  31833. -->
  31834. (I3 ^see 1 +)
  31835. Retracting propose*predict-no
  31836. -->
  31837. (O2218 ^name predict-no +)
  31838. (S1 ^operator O2218 +)
  31839. Retracting propose*predict-yes
  31840. -->
  31841. (O2217 ^name predict-yes +)
  31842. (S1 ^operator O2217 +)
  31843. Retracting elaborate*reward*based*on*reward
  31844. -->
  31845. (R1112 ^value 1 +)
  31846. (R1 ^reward R1112 +)
  31847. Retracting elaborate*copy-dir-to-output-link
  31848. -->
  31849. (I3 ^dir R +)
  31850. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*30
  31851. -->
  31852. (S1 ^operator O2218 = 0.7701530160237312)
  31853. Retracting rl*prefer*rvt*predict-no*H0*6
  31854. -->
  31855. (S1 ^operator O2218 = 0.2298662149963561)
  31856. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31857. -->
  31858. (S1 ^operator O2217 = -0.252585164213872)
  31859. Retracting rl*prefer*rvt*predict-yes*H0*5
  31860. -->
  31861. (S1 ^operator O2217 = 0.2940353333163421)
  31862. =>WM: (15562: S1 ^operator O2220 +)
  31863. =>WM: (15561: S1 ^operator O2219 +)
  31864. =>WM: (15560: O2220 ^name predict-no)
  31865. =>WM: (15559: O2219 ^name predict-yes)
  31866. =>WM: (15558: R1113 ^value 1)
  31867. =>WM: (15557: R1 ^reward R1113)
  31868. =>WM: (15556: I3 ^see 0)
  31869. <=WM: (15547: S1 ^operator O2217 +)
  31870. <=WM: (15548: S1 ^operator O2218 +)
  31871. <=WM: (15549: S1 ^operator O2218)
  31872. <=WM: (15543: R1 ^reward R1112)
  31873. <=WM: (15528: I3 ^see 1)
  31874. <=WM: (15546: O2218 ^name predict-no)
  31875. <=WM: (15545: O2217 ^name predict-yes)
  31876. <=WM: (15544: R1112 ^value 1)
  31877. --- Inner Elaboration Phase, active level 1 (S1) ---
  31878. Firing prefer*rvt*predict-yes*H0
  31879. -->
  31880. Firing rl*prefer*rvt*predict-yes*H0*5
  31881. -->
  31882. (S1 ^operator O2219 = 0.2940353333163421)
  31883. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31884. -->
  31885. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31886. -->
  31887. (S1 ^operator O2219 = -0.1254042659579056)
  31888. Firing prefer*rvt*predict-no*H0
  31889. -->
  31890. Firing rl*prefer*rvt*predict-no*H0*6
  31891. -->
  31892. (S1 ^operator O2220 = 0.2298662149963561)
  31893. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31894. -->
  31895. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31896. -->
  31897. (S1 ^operator O2220 = 0.770125534612744)
  31898. inner elaboration loop at bottom goal.
  31899. Retracting rl*prefer*rvt*predict-no*H0*6
  31900. -->
  31901. (S1 ^operator O2218 = 0.2298662149963561)
  31902. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31903. -->
  31904. (S1 ^operator O2218 = 0.770125534612744)
  31905. Retracting rl*prefer*rvt*predict-yes*H0*5
  31906. -->
  31907. (S1 ^operator O2217 = 0.2940353333163421)
  31908. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31909. -->
  31910. (S1 ^operator O2217 = -0.1254042659579056)
  31911. --- END Proposal Phase ---
  31912. --- Decision Phase ---
  31913. RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229866 -> 0.611916 -0.382051 0.229865(R,m,v=1,0.865,0.117362)
  31914. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*30 0.388098 0.382055 0.770153 -> 0.388097 0.382054 0.770151(R,m,v=1,1,0)
  31915. =>WM: (15563: S1 ^operator O2220)
  31916. 1110: O: O2220 (predict-no)
  31917. --- END Decision Phase ---
  31918. --- Application Phase ---
  31919. --- Firing Productions (PE) For State At Depth 1 ---
  31920. --- Inner Elaboration Phase, active level 1 (S1) ---
  31921. Firing apply*operator
  31922. -->
  31923. (I3 ^predict-no N1110 + :O )
  31924. Firing apply*operator*complete
  31925. -->
  31926. (I3 ^predict-no N1109 - :O )
  31927. inner elaboration loop at bottom goal.
  31928. --- Change Working Memory (PE) ---
  31929. =>WM: (15564: I3 ^predict-no N1110)
  31930. <=WM: (15551: N1109 ^status complete)
  31931. <=WM: (15550: I3 ^predict-no N1109)
  31932. --- Firing Productions (IE) For State At Depth 1 ---
  31933. --- Inner Elaboration Phase, active level 1 (S1) ---
  31934. Firing monitor*world
  31935. -->
  31936. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31937. --- Change Working Memory (IE) ---
  31938. --- END Application Phase ---
  31939. --- Output Phase ---
  31940. ENV: Agent did: predict-no for direction R in state State-B
  31941. In State-B moving R
  31942. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31943. predict error 0
  31944. dir: dir isR
  31945. --- END Output Phase ---
  31946. /|\--- Input Phase ---
  31947. =>WM: (15568: I2 ^dir R)
  31948. =>WM: (15567: I2 ^reward 1)
  31949. =>WM: (15566: I2 ^see 0)
  31950. =>WM: (15565: N1110 ^status complete)
  31951. <=WM: (15554: I2 ^dir R)
  31952. <=WM: (15553: I2 ^reward 1)
  31953. <=WM: (15552: I2 ^see 0)
  31954. =>WM: (15569: I2 ^level-1 R0-root)
  31955. <=WM: (15555: I2 ^level-1 R0-root)
  31956. --- END Input Phase ---
  31957. --- Proposal Phase ---
  31958. --- Inner Elaboration Phase, active level 1 (S1) ---
  31959. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  31960. -->
  31961. (S1 ^operator O2219 = -0.1254042659579056)
  31962. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  31963. -->
  31964. (S1 ^operator O2220 = 0.770125534612744)
  31965. Firing prefer*rvt*predict-no*H0*6*v1*H1
  31966. -->
  31967. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31968. -->
  31969. Firing elaborate*copy-see-to-output-link
  31970. -->
  31971. (I3 ^see 0 +)
  31972. Firing elaborate*reward*based*on*reward
  31973. -->
  31974. (R1114 ^value 1 +)
  31975. (R1 ^reward R1114 +)
  31976. Firing propose*predict-yes
  31977. -->
  31978. (O2221 ^name predict-yes +)
  31979. (S1 ^operator O2221 +)
  31980. Firing propose*predict-no
  31981. -->
  31982. (O2222 ^name predict-no +)
  31983. (S1 ^operator O2222 +)
  31984. Firing rl*prefer*rvt*predict-no*H0*6
  31985. -->
  31986. (S1 ^operator O2220 = 0.2298646883171679)
  31987. Firing rl*prefer*rvt*predict-yes*H0*5
  31988. -->
  31989. (S1 ^operator O2219 = 0.2940353333163421)
  31990. Firing prefer*rvt*predict-yes*H0
  31991. -->
  31992. Firing prefer*rvt*predict-no*H0
  31993. -->
  31994. Firing elaborate*copy-dir-to-output-link
  31995. -->
  31996. (I3 ^dir R +)
  31997. inner elaboration loop at bottom goal.
  31998. Retracting elaborate*copy-see-to-output-link
  31999. -->
  32000. (I3 ^see 0 +)
  32001. Retracting propose*predict-no
  32002. -->
  32003. (O2220 ^name predict-no +)
  32004. (S1 ^operator O2220 +)
  32005. Retracting propose*predict-yes
  32006. -->
  32007. (O2219 ^name predict-yes +)
  32008. (S1 ^operator O2219 +)
  32009. Retracting elaborate*reward*based*on*reward
  32010. -->
  32011. (R1113 ^value 1 +)
  32012. (R1 ^reward R1113 +)
  32013. Retracting elaborate*copy-dir-to-output-link
  32014. -->
  32015. (I3 ^dir R +)
  32016. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  32017. -->
  32018. (S1 ^operator O2220 = 0.770125534612744)
  32019. Retracting rl*prefer*rvt*predict-no*H0*6
  32020. -->
  32021. (S1 ^operator O2220 = 0.2298646883171679)
  32022. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  32023. -->
  32024. (S1 ^operator O2219 = -0.1254042659579056)
  32025. Retracting rl*prefer*rvt*predict-yes*H0*5
  32026. -->
  32027. (S1 ^operator O2219 = 0.2940353333163421)
  32028. =>WM: (15575: S1 ^operator O2222 +)
  32029. =>WM: (15574: S1 ^operator O2221 +)
  32030. =>WM: (15573: O2222 ^name predict-no)
  32031. =>WM: (15572: O2221 ^name predict-yes)
  32032. =>WM: (15571: R1114 ^value 1)
  32033. =>WM: (15570: R1 ^reward R1114)
  32034. <=WM: (15561: S1 ^operator O2219 +)
  32035. <=WM: (15562: S1 ^operator O2220 +)
  32036. <=WM: (15563: S1 ^operator O2220)
  32037. <=WM: (15557: R1 ^reward R1113)
  32038. <=WM: (15560: O2220 ^name predict-no)
  32039. <=WM: (15559: O2219 ^name predict-yes)
  32040. <=WM: (15558: R1113 ^value 1)
  32041. --- Inner Elaboration Phase, active level 1 (S1) ---
  32042. Firing prefer*rvt*predict-yes*H0
  32043. -->
  32044. Firing rl*prefer*rvt*predict-yes*H0*5
  32045. -->
  32046. (S1 ^operator O2221 = 0.2940353333163421)
  32047. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  32048. -->
  32049. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  32050. -->
  32051. (S1 ^operator O2221 = -0.1254042659579056)
  32052. Firing prefer*rvt*predict-no*H0
  32053. -->
  32054. Firing rl*prefer*rvt*predict-no*H0*6
  32055. -->
  32056. (S1 ^operator O2222 = 0.2298646883171679)
  32057. Firing prefer*rvt*predict-no*H0*6*v1*H1
  32058. -->
  32059. Firing rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  32060. -->
  32061. (S1 ^operator O2222 = 0.770125534612744)
  32062. inner elaboration loop at bottom goal.
  32063. Retracting rl*prefer*rvt*predict-no*H0*6
  32064. -->
  32065. (S1 ^operator O2220 = 0.2298646883171679)
  32066. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  32067. -->
  32068. (S1 ^operator O2220 = 0.770125534612744)
  32069. Retracting rl*prefer*rvt*predict-yes*H0*5
  32070. -->
  32071. (S1 ^operator O2219 = 0.2940353333163421)
  32072. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  32073. -->
  32074. (S1 ^operator O2219 = -0.1254042659579056)
  32075. --- END Proposal Phase ---
  32076. --- Decision Phase ---
  32077. RL update rl*prefer*rvt*predict-no*H0*6 0.611916 -0.382051 0.229865 -> 0.611917 -0.382051 0.229865(R,m,v=1,0.865672,0.116866)
  32078. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388076 0.382049 0.770126 -> 0.388077 0.38205 0.770126(R,m,v=1,1,0)
  32079. =>WM: (15576: S1 ^operator O2222)
  32080. 1111: O: O2222 (predict-no)
  32081. --- END Decision Phase ---
  32082. --- Application Phase ---
  32083. --- Firing Productions (PE) For State At Depth 1 ---
  32084. --- Inner Elaboration Phase, active level 1 (S1) ---
  32085. Firing apply*operator
  32086. -->
  32087. (I3 ^predict-no N1111 + :O )
  32088. Firing apply*operator*complete
  32089. -->
  32090. (I3 ^predict-no N1110 - :O )
  32091. inner elaboration loop at bottom goal.
  32092. --- Change Working Memory (PE) ---
  32093. =>WM: (15577: I3 ^predict-no N1111)
  32094. <=WM: (15565: N1110 ^status complete)
  32095. <=WM: (15564: I3 ^predict-no N1110)
  32096. --- Firing Productions (IE) For State At Depth 1 ---
  32097. --- Inner Elaboration Phase, active level 1 (S1) ---
  32098. Firing monitor*world
  32099. -->
  32100. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32101. --- Change Working Memory (IE) ---
  32102. --- END Application Phase ---
  32103. --- Output Phase ---
  32104. ENV: Agent did: predict-no for direction R in state State-B
  32105. In State-B moving R
  32106. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32107. predict error 0
  32108. dir: dir isU
  32109. --- END Output Phase ---
  32110. ---- Input Phase ---
  32111. =>WM: (15581: I2 ^dir U)
  32112. =>WM: (15580: I2 ^reward 1)
  32113. =>WM: (15579: I2 ^see 0)
  32114. =>WM: (15578: N1111 ^status complete)
  32115. <=WM: (15568: I2 ^dir R)
  32116. <=WM: (15567: I2 ^reward 1)
  32117. <=WM: (15566: I2 ^see 0)
  32118. =>WM: (15582: I2 ^level-1 R0-root)
  32119. <=WM: (15569: I2 ^level-1 R0-root)
  32120. --- END Input Phase ---
  32121. --- Proposal Phase ---
  32122. --- Inner Elaboration Phase, active level 1 (S1) ---
  32123. Firing elaborate*copy-see-to-output-link
  32124. -->
  32125. (I3 ^see 0 +)
  32126. Firing elaborate*reward*based*on*reward
  32127. -->
  32128. (R1115 ^value 1 +)
  32129. (R1 ^reward R1115 +)
  32130. Firing propose*predict-yes
  32131. -->
  32132. (O2223 ^name predict-yes +)
  32133. (S1 ^operator O2223 +)
  32134. Firing propose*predict-no
  32135. -->
  32136. (O2224 ^name predict-no +)
  32137. (S1 ^operator O2224 +)
  32138. Firing rl*prefer*rvt*predict-no*H0*4
  32139. -->
  32140. (S1 ^operator O2222 = 1.)
  32141. Firing rl*prefer*rvt*predict-yes*H0*3
  32142. -->
  32143. (S1 ^operator O2221 = 0.)
  32144. Firing prefer*rvt*predict-yes*H0
  32145. -->
  32146. Firing prefer*rvt*predict-no*H0
  32147. -->
  32148. Firing elaborate*copy-dir-to-output-link
  32149. -->
  32150. (I3 ^dir U +)
  32151. inner elaboration loop at bottom goal.
  32152. Retracting elaborate*copy-see-to-output-link
  32153. -->
  32154. (I3 ^see 0 +)
  32155. Retracting propose*predict-no
  32156. -->
  32157. (O2222 ^name predict-no +)
  32158. (S1 ^operator O2222 +)
  32159. Retracting propose*predict-yes
  32160. -->
  32161. (O2221 ^name predict-yes +)
  32162. (S1 ^operator O2221 +)
  32163. Retracting elaborate*reward*based*on*reward
  32164. -->
  32165. (R1114 ^value 1 +)
  32166. (R1 ^reward R1114 +)
  32167. Retracting elaborate*copy-dir-to-output-link
  32168. -->
  32169. (I3 ^dir R +)
  32170. Retracting rl*prefer*rvt*predict-no*H0*6*v1*H1*32
  32171. -->
  32172. (S1 ^operator O2222 = 0.7701264131585999)
  32173. Retracting rl*prefer*rvt*predict-no*H0*6
  32174. -->
  32175. (S1 ^operator O2222 = 0.2298654638682661)
  32176. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*33
  32177. -->
  32178. (S1 ^operator O2221 = -0.1254042659579056)
  32179. Retracting rl*prefer*rvt*predict-yes*H0*5
  32180. -->
  32181. (S1 ^operator O2221 = 0.2940353333163421)
  32182. =>WM: (15589: S1 ^operator O2224 +)
  32183. =>WM: (15588: S1 ^operator O2223 +)
  32184. =>WM: (15587: I3 ^dir U)
  32185. =>WM: (15586: O2224 ^name predict-no)
  32186. =>WM: (15585: O2223 ^name predict-yes)
  32187. =>WM: (15584: R1115 ^value 1)
  32188. =>WM: (15583: R1 ^reward R1115)
  32189. <=WM: (15574: S1 ^operator O2221 +)
  32190. <=WM: (15575: S1 ^operator O2222 +)
  32191. <=WM: (15576: S1 ^operator O2222)
  32192. <=WM: (15533: I3 ^dir R)
  32193. <=WM: (15570: R1 ^reward R1114)
  32194. <=WM: (15573: O2222 ^name predict-no)
  32195. <=WM: (15572: O2221 ^name predict-yes)
  32196. <=WM: (15571: R1114 ^value 1)
  32197. --- Inner Elaboration Phase, active level 1 (S1) ---
  32198. Firing prefer*rvt*predict-yes*H0
  32199. -->
  32200. Firing rl*prefer*rvt*predict-yes*H0*3
  32201. -->
  32202. (S1 ^operator O2223 = 0.)
  32203. Firing prefer*rvt*predict-no*H0
  32204. -->
  32205. Firing rl*prefer*rvt*predict-no*H0*4
  32206. -->
  32207. (S1 ^operator O2224 = 1.)
  32208. inner elaboration loop at bottom goal.
  32209. Retracting rl*prefer*rvt*predict-no*H0*4
  32210. -->
  32211. (S1 ^operator O2222 = 1.)
  32212. Retracting rl*prefer*rvt*predict-yes*H0*3
  32213. -->
  32214. (S1 ^operator O2221 = 0.)
  32215. --- END Proposal Phase ---
  32216. --- Decision Phase ---
  32217. RL update rl*prefer*rvt*predict-no*H0*6 0.611917 -0.382051 0.229865 -> 0.611917 -0.382051 0.229866(R,m,v=1,0.866337,0.116374)
  32218. RL update rl*prefer*rvt*predict-no*H0*6*v1*H1*32 0.388077 0.38205 0.770126 -> 0.388077 0.38205 0.770127(R,m,v=1,1,0)
  32219. =>WM: (15590: S1 ^operator O2224)
  32220. 1112: O: O2224 (predict-no)
  32221. --- END Decision Phase ---
  32222. --- Application Phase ---
  32223. --- Firing Productions (PE) For State At Depth 1 ---
  32224. --- Inner Elaboration Phase, active level 1 (S1) ---
  32225. Firing apply*operator
  32226. -->
  32227. (I3 ^predict-no N1112 + :O )
  32228. Firing apply*operator*complete
  32229. -->
  32230. (I3 ^predict-no N1111 - :O )
  32231. inner elaboration loop at bottom goal.
  32232. --- Change Working Memory (PE) ---
  32233. =>WM: (15591: I3 ^predict-no N1112)
  32234. <=WM: (15578: N1111 ^status complete)
  32235. <=WM: (15577: I3 ^predict-no N1111)
  32236. --- Firing Productions (IE) For State At Depth 1 ---
  32237. --- Inner Elaboration Phase, active level 1 (S1) ---
  32238. Firing monitor*world
  32239. -->
  32240. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32241. --- Change Working Memory (IE) ---
  32242. --- END Application Phase ---
  32243. --- Output Phase ---
  32244. ENV: Agent did: predict-no for direction U in state State-B
  32245. In State-B moving U
  32246. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32247. predict error 0
  32248. dir: dir isU
  32249. --- END Output Phase ---
  32250. /|--- Input Phase ---
  32251. =>WM: (15595: I2 ^dir U)
  32252. =>WM: (15594: I2 ^reward 1)
  32253. =>WM: (15593: I2 ^see 0)
  32254. =>WM: (15592: N1112 ^status complete)
  32255. <=WM: (15581: I2 ^dir U)
  32256. <=WM: (15580: I2 ^reward 1)
  32257. <=WM: (15579: I2 ^see 0)
  32258. =>WM: (15596: I2 ^level-1 R0-root)
  32259. <=WM: (15582: I2 ^level-1 R0-root)
  32260. --- END Input Phase ---
  32261. --- Proposal Phase ---
  32262. --- Inner Elaboration Phase, active level 1 (S1) ---
  32263. Firing elaborate*copy-see-to-output-link
  32264. -->
  32265. (I3 ^see 0 +)
  32266. Firing elaborate*reward*based*on*reward
  32267. -->
  32268. (R1116 ^value 1 +)
  32269. (R1 ^reward R1116 +)
  32270. Firing propose*predict-yes
  32271. -->
  32272. (O2225 ^name predict-yes +)
  32273. (S1 ^operator O2225 +)
  32274. Firing propose*predict-no
  32275. -->
  32276. (O2226 ^name predict-no +)
  32277. (S1 ^operator O2226 +)
  32278. Firing rl*prefer*rvt*predict-no*H0*4
  32279. -->
  32280. (S1 ^operator O2224 = 1.)
  32281. Firing rl*prefer*rvt*predict-yes*H0*3
  32282. -->
  32283. (S1 ^operator O2223 = 0.)
  32284. Firing prefer*rvt*predict-yes*H0
  32285. -->
  32286. Firing prefer*rvt*predict-no*H0
  32287. -->
  32288. Firing elaborate*copy-dir-to-output-link
  32289. -->
  32290. (I3 ^dir U +)
  32291. inner elaboration loop at bottom goal.
  32292. Retracting elaborate*copy-see-to-output-link
  32293. -->
  32294. (I3 ^see 0 +)
  32295. Retracting propose*predict-no
  32296. -->
  32297. (O2224 ^name predict-no +)
  32298. (S1 ^operator O2224 +)
  32299. Retracting propose*predict-yes
  32300. -->
  32301. (O2223 ^name predict-yes +)
  32302. (S1 ^operator O2223 +)
  32303. Retracting elaborate*reward*based*on*reward
  32304. -->
  32305. (R1115 ^value 1 +)
  32306. (R1 ^reward R1115 +)
  32307. Retracting elaborate*copy-dir-to-output-link
  32308. -->
  32309. (I3 ^dir U +)
  32310. Retracting rl*prefer*rvt*predict-no*H0*4
  32311. -->
  32312. (S1 ^operator O2224 = 1.)
  32313. Retracting rl*prefer*rvt*predict-yes*H0*3
  32314. -->
  32315. (S1 ^operator O2223 = 0.)
  32316. =>WM: (15602: S1 ^operator O2226 +)
  32317. =>WM: (15601: S1 ^operator O2225 +)
  32318. =>WM: (15600: O2226 ^name predict-no)
  32319. =>WM: (15599: O2225 ^name predict-yes)
  32320. =>WM: (15598: R1116 ^value 1)
  32321. =>WM: (15597: R1 ^reward R1116)
  32322. <=WM: (15588: S1 ^operator O2223 +)
  32323. <=WM: (15589: S1 ^operator O2224 +)
  32324. <=WM: (15590: S1 ^operator O2224)
  32325. <=WM: (15583: R1 ^reward R1115)
  32326. <=WM: (15586: O2224 ^name predict-no)
  32327. <=WM: (15585: O2223 ^name predict-yes)
  32328. <=WM: (15584: R1115 ^value 1)
  32329. --- Inner Elaboration Phase, active level 1 (S1) ---
  32330. Firing prefer*rvt*predict-yes*H0
  32331. -->
  32332. Firing rl*prefer*rvt*predict-yes*H0*3
  32333. -->
  32334. (S1 ^operator O2225 = 0.)
  32335. Firing prefer*rvt*predict-no*H0
  32336. -->
  32337. Firing rl*prefer*rvt*predict-no*H0*4
  32338. -->
  32339. (S1 ^operator O2226 = 1.)
  32340. inner elaboration loop at bottom goal.
  32341. Retracting rl*prefer*rvt*predict-no*H0*4
  32342. -->
  32343. (S1 ^operator O2224 = 1.)
  32344. Retracting rl*prefer*rvt*predict-yes*H0*3
  32345. -->
  32346. (S1 ^operator O2223 = 0.)
  32347. --- END Proposal Phase ---
  32348. --- Decision Phase ---
  32349. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32350. =>WM: (15603: S1 ^operator O2226)
  32351. 1113: O: O2226 (predict-no)
  32352. --- END Decision Phase ---
  32353. --- Application Phase ---
  32354. --- Firing Productions (PE) For State At Depth 1 ---
  32355. --- Inner Elaboration Phase, active level 1 (S1) ---
  32356. Firing apply*operator
  32357. -->
  32358. (I3 ^predict-no N1113 + :O )
  32359. Firing apply*operator*complete
  32360. -->
  32361. (I3 ^predict-no N1112 - :O )
  32362. inner elaboration loop at bottom goal.
  32363. --- Change Working Memory (PE) ---
  32364. =>WM: (15604: I3 ^predict-no N1113)
  32365. <=WM: (15592: N1112 ^status complete)
  32366. <=WM: (15591: I3 ^predict-no N1112)
  32367. --- Firing Productions (IE) For State At Depth 1 ---
  32368. --- Inner Elaboration Phase, active level 1 (S1) ---
  32369. Firing monitor*world
  32370. -->
  32371. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32372. --- Change Working Memory (IE) ---
  32373. --- END Application Phase ---
  32374. --- Output Phase ---
  32375. ENV: Agent did: predict-no for direction U in state State-B
  32376. In State-B moving U
  32377. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32378. predict error 0
  32379. dir: dir isU
  32380. --- END Output Phase ---
  32381. \-/--- Input Phase ---
  32382. =>WM: (15608: I2 ^dir U)
  32383. =>WM: (15607: I2 ^reward 1)
  32384. =>WM: (15606: I2 ^see 0)
  32385. =>WM: (15605: N1113 ^status complete)
  32386. <=WM: (15595: I2 ^dir U)
  32387. <=WM: (15594: I2 ^reward 1)
  32388. <=WM: (15593: I2 ^see 0)
  32389. =>WM: (15609: I2 ^level-1 R0-root)
  32390. <=WM: (15596: I2 ^level-1 R0-root)
  32391. --- END Input Phase ---
  32392. --- Proposal Phase ---
  32393. --- Inner Elaboration Phase, active level 1 (S1) ---
  32394. Firing elaborate*copy-see-to-output-link
  32395. -->
  32396. (I3 ^see 0 +)
  32397. Firing elaborate*reward*based*on*reward
  32398. -->
  32399. (R1117 ^value 1 +)
  32400. (R1 ^reward R1117 +)
  32401. Firing propose*predict-yes
  32402. -->
  32403. (O2227 ^name predict-yes +)
  32404. (S1 ^operator O2227 +)
  32405. Firing propose*predict-no
  32406. -->
  32407. (O2228 ^name predict-no +)
  32408. (S1 ^operator O2228 +)
  32409. Firing rl*prefer*rvt*predict-no*H0*4
  32410. -->
  32411. (S1 ^operator O2226 = 1.)
  32412. Firing rl*prefer*rvt*predict-yes*H0*3
  32413. -->
  32414. (S1 ^operator O2225 = 0.)
  32415. Firing prefer*rvt*predict-yes*H0
  32416. -->
  32417. Firing prefer*rvt*predict-no*H0
  32418. -->
  32419. Firing elaborate*copy-dir-to-output-link
  32420. -->
  32421. (I3 ^dir U +)
  32422. inner elaboration loop at bottom goal.
  32423. Retracting elaborate*copy-see-to-output-link
  32424. -->
  32425. (I3 ^see 0 +)
  32426. Retracting propose*predict-no
  32427. -->
  32428. (O2226 ^name predict-no +)
  32429. (S1 ^operator O2226 +)
  32430. Retracting propose*predict-yes
  32431. -->
  32432. (O2225 ^name predict-yes +)
  32433. (S1 ^operator O2225 +)
  32434. Retracting elaborate*reward*based*on*reward
  32435. -->
  32436. (R1116 ^value 1 +)
  32437. (R1 ^reward R1116 +)
  32438. Retracting elaborate*copy-dir-to-output-link
  32439. -->
  32440. (I3 ^dir U +)
  32441. Retracting rl*prefer*rvt*predict-no*H0*4
  32442. -->
  32443. (S1 ^operator O2226 = 1.)
  32444. Retracting rl*prefer*rvt*predict-yes*H0*3
  32445. -->
  32446. (S1 ^operator O2225 = 0.)
  32447. =>WM: (15615: S1 ^operator O2228 +)
  32448. =>WM: (15614: S1 ^operator O2227 +)
  32449. =>WM: (15613: O2228 ^name predict-no)
  32450. =>WM: (15612: O2227 ^name predict-yes)
  32451. =>WM: (15611: R1117 ^value 1)
  32452. =>WM: (15610: R1 ^reward R1117)
  32453. <=WM: (15601: S1 ^operator O2225 +)
  32454. <=WM: (15602: S1 ^operator O2226 +)
  32455. <=WM: (15603: S1 ^operator O2226)
  32456. <=WM: (15597: R1 ^reward R1116)
  32457. <=WM: (15600: O2226 ^name predict-no)
  32458. <=WM: (15599: O2225 ^name predict-yes)
  32459. <=WM: (15598: R1116 ^value 1)
  32460. --- Inner Elaboration Phase, active level 1 (S1) ---
  32461. Firing prefer*rvt*predict-yes*H0
  32462. -->
  32463. Firing rl*prefer*rvt*predict-yes*H0*3
  32464. -->
  32465. (S1 ^operator O2227 = 0.)
  32466. Firing prefer*rvt*predict-no*H0
  32467. -->
  32468. Firing rl*prefer*rvt*predict-no*H0*4
  32469. -->
  32470. (S1 ^operator O2228 = 1.)
  32471. inner elaboration loop at bottom goal.
  32472. Retracting rl*prefer*rvt*predict-no*H0*4
  32473. -->
  32474. (S1 ^operator O2226 = 1.)
  32475. Retracting rl*prefer*rvt*predict-yes*H0*3
  32476. -->
  32477. (S1 ^operator O2225 = 0.)
  32478. --- END Proposal Phase ---
  32479. --- Decision Phase ---
  32480. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32481. =>WM: (15616: S1 ^operator O2228)
  32482. 1114: O: O2228 (predict-no)
  32483. --- END Decision Phase ---
  32484. --- Application Phase ---
  32485. --- Firing Productions (PE) For State At Depth 1 ---
  32486. --- Inner Elaboration Phase, active level 1 (S1) ---
  32487. Firing apply*operator
  32488. -->
  32489. (I3 ^predict-no N1114 + :O )
  32490. Firing apply*operator*complete
  32491. -->
  32492. (I3 ^predict-no N1113 - :O )
  32493. inner elaboration loop at bottom goal.
  32494. --- Change Working Memory (PE) ---
  32495. =>WM: (15617: I3 ^predict-no N1114)
  32496. <=WM: (15605: N1113 ^status complete)
  32497. <=WM: (15604: I3 ^predict-no N1113)
  32498. --- Firing Productions (IE) For State At Depth 1 ---
  32499. --- Inner Elaboration Phase, active level 1 (S1) ---
  32500. Firing monitor*world
  32501. -->
  32502. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32503. --- Change Working Memory (IE) ---
  32504. --- END Application Phase ---
  32505. --- Output Phase ---
  32506. ENV: Agent did: predict-no for direction U in state State-B
  32507. In State-B moving U
  32508. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32509. predict error 0
  32510. dir: dir isL
  32511. --- END Output Phase ---
  32512. |\---- Input Phase ---
  32513. =>WM: (15621: I2 ^dir L)
  32514. =>WM: (15620: I2 ^reward 1)
  32515. =>WM: (15619: I2 ^see 0)
  32516. =>WM: (15618: N1114 ^status complete)
  32517. <=WM: (15608: I2 ^dir U)
  32518. <=WM: (15607: I2 ^reward 1)
  32519. <=WM: (15606: I2 ^see 0)
  32520. =>WM: (15622: I2 ^level-1 R0-root)
  32521. <=WM: (15609: I2 ^level-1 R0-root)
  32522. --- END Input Phase ---
  32523. --- Proposal Phase ---
  32524. --- Inner Elaboration Phase, active level 1 (S1) ---
  32525. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  32526. -->
  32527. (S1 ^operator O2227 = 0.6195794710944548)
  32528. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  32529. -->
  32530. (S1 ^operator O2228 = -0.2190661556260421)
  32531. Firing prefer*rvt*predict-no*H0*2*v1*H1
  32532. -->
  32533. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  32534. -->
  32535. Firing elaborate*copy-see-to-output-link
  32536. -->
  32537. (I3 ^see 0 +)
  32538. Firing elaborate*reward*based*on*reward
  32539. -->
  32540. (R1118 ^value 1 +)
  32541. (R1 ^reward R1118 +)
  32542. Firing propose*predict-yes
  32543. -->
  32544. (O2229 ^name predict-yes +)
  32545. (S1 ^operator O2229 +)
  32546. Firing propose*predict-no
  32547. -->
  32548. (O2230 ^name predict-no +)
  32549. (S1 ^operator O2230 +)
  32550. Firing rl*prefer*rvt*predict-no*H0*2
  32551. -->
  32552. (S1 ^operator O2228 = 0.314041269303462)
  32553. Firing rl*prefer*rvt*predict-yes*H0*1
  32554. -->
  32555. (S1 ^operator O2227 = 0.3804143774620755)
  32556. Firing prefer*rvt*predict-yes*H0
  32557. -->
  32558. Firing prefer*rvt*predict-no*H0
  32559. -->
  32560. Firing elaborate*copy-dir-to-output-link
  32561. -->
  32562. (I3 ^dir L +)
  32563. inner elaboration loop at bottom goal.
  32564. Retracting elaborate*copy-see-to-output-link
  32565. -->
  32566. (I3 ^see 0 +)
  32567. Retracting propose*predict-no
  32568. -->
  32569. (O2228 ^name predict-no +)
  32570. (S1 ^operator O2228 +)
  32571. Retracting propose*predict-yes
  32572. -->
  32573. (O2227 ^name predict-yes +)
  32574. (S1 ^operator O2227 +)
  32575. Retracting elaborate*reward*based*on*reward
  32576. -->
  32577. (R1117 ^value 1 +)
  32578. (R1 ^reward R1117 +)
  32579. Retracting elaborate*copy-dir-to-output-link
  32580. -->
  32581. (I3 ^dir U +)
  32582. Retracting rl*prefer*rvt*predict-no*H0*4
  32583. -->
  32584. (S1 ^operator O2228 = 1.)
  32585. Retracting rl*prefer*rvt*predict-yes*H0*3
  32586. -->
  32587. (S1 ^operator O2227 = 0.)
  32588. =>WM: (15629: S1 ^operator O2230 +)
  32589. =>WM: (15628: S1 ^operator O2229 +)
  32590. =>WM: (15627: I3 ^dir L)
  32591. =>WM: (15626: O2230 ^name predict-no)
  32592. =>WM: (15625: O2229 ^name predict-yes)
  32593. =>WM: (15624: R1118 ^value 1)
  32594. =>WM: (15623: R1 ^reward R1118)
  32595. <=WM: (15614: S1 ^operator O2227 +)
  32596. <=WM: (15615: S1 ^operator O2228 +)
  32597. <=WM: (15616: S1 ^operator O2228)
  32598. <=WM: (15587: I3 ^dir U)
  32599. <=WM: (15610: R1 ^reward R1117)
  32600. <=WM: (15613: O2228 ^name predict-no)
  32601. <=WM: (15612: O2227 ^name predict-yes)
  32602. <=WM: (15611: R1117 ^value 1)
  32603. --- Inner Elaboration Phase, active level 1 (S1) ---
  32604. Firing prefer*rvt*predict-yes*H0
  32605. -->
  32606. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  32607. -->
  32608. (S1 ^operator O2229 = 0.6195794710944548)
  32609. Firing rl*prefer*rvt*predict-yes*H0*1
  32610. -->
  32611. (S1 ^operator O2229 = 0.3804143774620755)
  32612. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  32613. -->
  32614. Firing prefer*rvt*predict-no*H0
  32615. -->
  32616. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  32617. -->
  32618. (S1 ^operator O2230 = -0.2190661556260421)
  32619. Firing rl*prefer*rvt*predict-no*H0*2
  32620. -->
  32621. (S1 ^operator O2230 = 0.314041269303462)
  32622. Firing prefer*rvt*predict-no*H0*2*v1*H1
  32623. -->
  32624. inner elaboration loop at bottom goal.
  32625. Retracting rl*prefer*rvt*predict-no*H0*2
  32626. -->
  32627. (S1 ^operator O2228 = 0.314041269303462)
  32628. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*34
  32629. -->
  32630. (S1 ^operator O2228 = -0.2190661556260421)
  32631. Retracting rl*prefer*rvt*predict-yes*H0*1
  32632. -->
  32633. (S1 ^operator O2227 = 0.3804143774620755)
  32634. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*35
  32635. -->
  32636. (S1 ^operator O2227 = 0.6195794710944548)
  32637. --- END Proposal Phase ---
  32638. --- Decision Phase ---
  32639. RL update rl*prefer*rvt*predict-no*H0*4 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32640. =>WM: (15630: S1 ^operator O2229)
  32641. 1115: O: O2229 (predict-yes)
  32642. --- END Decision Phase ---
  32643. --- Application Phase ---
  32644. --- Firing Productions (PE) For State At Depth 1 ---
  32645. --- Inner Elaboration Phase, active level 1 (S1) ---
  32646. Firing apply*operator
  32647. -->
  32648. (I3 ^predict-yes N1115 + :O )
  32649. Firing apply*operator*complete
  32650. -->
  32651. (I3 ^predict-no N1114 - :O )
  32652. inner elaboration loop at bottom goal.
  32653. --- Change Working Memory (PE) ---
  32654. =>WM: (15631: I3 ^predict-yes N1115)
  32655. <=WM: (15618: N1114 ^status complete)
  32656. <=WM: (15617: I3 ^predict-no N1114)
  32657. --- Firing Productions (IE) For State At Depth 1 ---
  32658. --- Inner Elaboration Phase, active level 1 (S1) ---
  32659. Firing monitor*world
  32660. -->
  32661. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32662. --- Change Working Memory (IE) ---
  32663. --- END Application Phase ---
  32664. --- Output Phase ---
  32665. ENV: Agent did: predict-yes for direction L in state State-B
  32666. In State-B moving L
  32667. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  32668. predict error 0
  32669. dir: dir isL
  32670. --- END Output Phase ---
  32671. /|--- Input Phase ---
  32672. =>WM: (15635: I2 ^dir L)
  32673. =>WM: (15634: I2 ^reward 1)
  32674. =>WM: (15633: I2 ^see 1)
  32675. =>WM: (15632: N1115 ^status complete)
  32676. <=WM: (15621: I2 ^dir L)
  32677. <=WM: (15620: I2 ^reward 1)
  32678. <=WM: (15619: I2 ^see 0)
  32679. =>WM: (15636: I2 ^level-1 L1-root)
  32680. <=WM: (15622: I2 ^level-1 R0-root)
  32681. --- END Input Phase ---
  32682. --- Proposal Phase ---
  32683. --- Inner Elaboration Phase, ac