/flipv2/20121113-091142-2.5K-ReLST-asneeded_epochs_50_5runs_noalphadecay/stdout-flip-2.5K_4.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 34662 lines · 32610 code · 2052 blank · 0 comment · 0 complexity · d3a0179b931f9256445520fb852fb030 MD5 · raw file

  1. Seeding... 4
  2. dir: dir isL
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 4 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_4.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\-/|sleeping...
  20. \-/|\-/sleeping...
  21. |1: O: O2 (predict-no)
  22. I see 0 and I'm going to do: predict-no
  23. ENV: Agent did: predict-no for direction L in state State-A
  24. In State-A moving L
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26. predict error 0
  27. dir: dir isL
  28. rule alias: '*'
  29. rule alias: '*'
  30. \-/|\-/2: O: O3 (predict-yes)
  31. I see 1 and I'm going to do: predict-yes
  32. ENV: Agent did: predict-yes for direction L in state State-A
  33. In State-A moving L
  34. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  35. predict error 1
  36. dir: dir isR
  37. |\-3: O: O6 (predict-no)
  38. I see 0 and I'm going to do: predict-no
  39. ENV: Agent did: predict-no for direction R in state State-A
  40. In State-A moving R
  41. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  42. predict error 1
  43. dir: dir isL
  44. /|\4: O: O8 (predict-no)
  45. I see 0 and I'm going to do: predict-no
  46. ENV: Agent did: predict-no for direction L in state State-B
  47. In State-B moving L
  48. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  49. predict error 1
  50. dir: dir isL
  51. -/5: O: O10 (predict-no)
  52. I see 0 and I'm going to do: predict-no
  53. ENV: Agent did: predict-no for direction L in state State-A
  54. In State-A moving L
  55. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  56. predict error 0
  57. dir: dir isR
  58. |\-6: O: O11 (predict-yes)
  59. I see 1 and I'm going to do: predict-yes
  60. ENV: Agent did: predict-yes for direction R in state State-A
  61. In State-A moving R
  62. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  63. predict error 0
  64. dir: dir isU
  65. /|7: O: O14 (predict-no)
  66. I see 1 and I'm going to do: predict-no
  67. ENV: Agent did: predict-no for direction U in state State-B
  68. In State-B moving U
  69. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  70. predict error 0
  71. dir: dir isU
  72. \-/8: O: O15 (predict-yes)
  73. I see 1 and I'm going to do: predict-yes
  74. ENV: Agent did: predict-yes for direction U in state State-B
  75. In State-B moving U
  76. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  77. predict error 1
  78. dir: dir isU
  79. |\-9: O: O17 (predict-yes)
  80. I see 0 and I'm going to do: predict-yes
  81. ENV: Agent did: predict-yes for direction U in state State-B
  82. In State-B moving U
  83. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  84. predict error 1
  85. dir: dir isL
  86. /|\-10: O: O19 (predict-yes)
  87. I see 0 and I'm going to do: predict-yes
  88. ENV: Agent did: predict-yes for direction L in state State-B
  89. In State-B moving L
  90. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  91. predict error 0
  92. dir: dir isR
  93. /|\11: O: O21 (predict-yes)
  94. I see 1 and I'm going to do: predict-yes
  95. ENV: Agent did: predict-yes for direction R in state State-A
  96. In State-A moving R
  97. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  98. predict error 0
  99. dir: dir isL
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. -12: O: O24 (predict-no)
  105. I see 1 and I'm going to do: predict-no
  106. ENV: Agent did: predict-no for direction L in state State-B
  107. In State-B moving L
  108. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  109. predict error 1
  110. dir: dir isL
  111. /|\13: O: O26 (predict-no)
  112. I see 0 and I'm going to do: predict-no
  113. ENV: Agent did: predict-no for direction L in state State-A
  114. In State-A moving L
  115. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  116. predict error 0
  117. dir: dir isL
  118. -/|14: O: O28 (predict-no)
  119. I see 1 and I'm going to do: predict-no
  120. ENV: Agent did: predict-no for direction L in state State-A
  121. In State-A moving L
  122. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  123. predict error 0
  124. dir: dir isL
  125. \-/15: O: O30 (predict-no)
  126. I see 1 and I'm going to do: predict-no
  127. ENV: Agent did: predict-no for direction L in state State-A
  128. In State-A moving L
  129. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  130. predict error 0
  131. dir: dir isU
  132. |\-16: O: O32 (predict-no)
  133. I see 1 and I'm going to do: predict-no
  134. ENV: Agent did: predict-no for direction U in state State-A
  135. In State-A moving U
  136. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  137. predict error 0
  138. dir: dir isU
  139. /|17: O: O34 (predict-no)
  140. I see 1 and I'm going to do: predict-no
  141. ENV: Agent did: predict-no for direction U in state State-A
  142. In State-A moving U
  143. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  144. predict error 0
  145. dir: dir isU
  146. \-18: O: O36 (predict-no)
  147. I see 1 and I'm going to do: predict-no
  148. ENV: Agent did: predict-no for direction U in state State-A
  149. In State-A moving U
  150. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  151. predict error 0
  152. dir: dir isU
  153. /|\19: O: O38 (predict-no)
  154. I see 1 and I'm going to do: predict-no
  155. ENV: Agent did: predict-no for direction U in state State-A
  156. In State-A moving U
  157. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  158. predict error 0
  159. dir: dir isL
  160. -/|20: O: O39 (predict-yes)
  161. I see 1 and I'm going to do: predict-yes
  162. ENV: Agent did: predict-yes for direction L in state State-A
  163. In State-A moving L
  164. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  165. predict error 1
  166. dir: dir isL
  167. \-/|sleeping...
  168. \21: O: O42 (predict-no)
  169. I see 0 and I'm going to do: predict-no
  170. ENV: Agent did: predict-no for direction L in state State-A
  171. In State-A moving L
  172. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  173. predict error 0
  174. dir: dir isR
  175. -22: O: O43 (predict-yes)
  176. I see 1 and I'm going to do: predict-yes
  177. ENV: Agent did: predict-yes for direction R in state State-A
  178. In State-A moving R
  179. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  180. predict error 0
  181. dir: dir isU
  182. /|23: O: O45 (predict-yes)
  183. I see 1 and I'm going to do: predict-yes
  184. ENV: Agent did: predict-yes for direction U in state State-B
  185. In State-B moving U
  186. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  187. predict error 1
  188. dir: dir isU
  189. \-/24: O: O48 (predict-no)
  190. I see 0 and I'm going to do: predict-no
  191. ENV: Agent did: predict-no for direction U in state State-B
  192. In State-B moving U
  193. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  194. predict error 0
  195. dir: dir isU
  196. |\-25: O: O50 (predict-no)
  197. I see 1 and I'm going to do: predict-no
  198. ENV: Agent did: predict-no for direction U in state State-B
  199. In State-B moving U
  200. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  201. predict error 0
  202. dir: dir isL
  203. /|\-26: O: O52 (predict-no)
  204. I see 1 and I'm going to do: predict-no
  205. ENV: Agent did: predict-no for direction L in state State-B
  206. In State-B moving L
  207. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  208. predict error 1
  209. dir: dir isR
  210. /|27: O: O53 (predict-yes)
  211. I see 0 and I'm going to do: predict-yes
  212. ENV: Agent did: predict-yes for direction R in state State-A
  213. In State-A moving R
  214. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  215. predict error 0
  216. dir: dir isU
  217. \-/28: O: O56 (predict-no)
  218. I see 1 and I'm going to do: predict-no
  219. ENV: Agent did: predict-no for direction U in state State-B
  220. In State-B moving U
  221. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  222. predict error 0
  223. dir: dir isR
  224. |\-29: O: O57 (predict-yes)
  225. I see 1 and I'm going to do: predict-yes
  226. ENV: Agent did: predict-yes for direction R in state State-B
  227. In State-B moving R
  228. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  229. predict error 1
  230. dir: dir isL
  231. /|\30: O: O60 (predict-no)
  232. I see 0 and I'm going to do: predict-no
  233. ENV: Agent did: predict-no for direction L in state State-B
  234. In State-B moving L
  235. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  236. predict error 1
  237. dir: dir isR
  238. -31: O: O61 (predict-yes)
  239. I see 0 and I'm going to do: predict-yes
  240. ENV: Agent did: predict-yes for direction R in state State-A
  241. In State-A moving R
  242. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  243. predict error 0
  244. dir: dir isL
  245. /32: O: O64 (predict-no)
  246. I see 1 and I'm going to do: predict-no
  247. ENV: Agent did: predict-no for direction L in state State-B
  248. In State-B moving L
  249. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  250. predict error 1
  251. dir: dir isU
  252. |\-33: O: O66 (predict-no)
  253. I see 0 and I'm going to do: predict-no
  254. ENV: Agent did: predict-no for direction U in state State-A
  255. In State-A moving U
  256. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  257. predict error 0
  258. dir: dir isU
  259. /|\34: O: O68 (predict-no)
  260. I see 1 and I'm going to do: predict-no
  261. ENV: Agent did: predict-no for direction U in state State-A
  262. In State-A moving U
  263. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  264. predict error 0
  265. dir: dir isR
  266. -/|35: O: O69 (predict-yes)
  267. I see 1 and I'm going to do: predict-yes
  268. ENV: Agent did: predict-yes for direction R in state State-A
  269. In State-A moving R
  270. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  271. predict error 0
  272. dir: dir isL
  273. \-36: O: O72 (predict-no)
  274. I see 1 and I'm going to do: predict-no
  275. ENV: Agent did: predict-no for direction L in state State-B
  276. In State-B moving L
  277. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  278. predict error 1
  279. dir: dir isU
  280. /|\37: O: O74 (predict-no)
  281. I see 0 and I'm going to do: predict-no
  282. ENV: Agent did: predict-no for direction U in state State-A
  283. In State-A moving U
  284. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  285. predict error 0
  286. dir: dir isR
  287. -/|38: O: O76 (predict-no)
  288. I see 1 and I'm going to do: predict-no
  289. ENV: Agent did: predict-no for direction R in state State-A
  290. In State-A moving R
  291. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  292. predict error 1
  293. dir: dir isU
  294. \-/39: O: O78 (predict-no)
  295. I see 0 and I'm going to do: predict-no
  296. ENV: Agent did: predict-no for direction U in state State-B
  297. In State-B moving U
  298. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  299. predict error 0
  300. dir: dir isU
  301. |\-40: O: O80 (predict-no)
  302. I see 1 and I'm going to do: predict-no
  303. ENV: Agent did: predict-no for direction U in state State-B
  304. In State-B moving U
  305. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  306. predict error 0
  307. dir: dir isR
  308. /|\-41: O: O81 (predict-yes)
  309. I see 1 and I'm going to do: predict-yes
  310. ENV: Agent did: predict-yes for direction R in state State-B
  311. In State-B moving R
  312. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  313. predict error 1
  314. dir: dir isR
  315. /42: O: O83 (predict-yes)
  316. I see 0 and I'm going to do: predict-yes
  317. ENV: Agent did: predict-yes for direction R in state State-B
  318. In State-B moving R
  319. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  320. predict error 1
  321. dir: dir isR
  322. |\-/43: O: O85 (predict-yes)
  323. I see 0 and I'm going to do: predict-yes
  324. ENV: Agent did: predict-yes for direction R in state State-B
  325. In State-B moving R
  326. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  327. predict error 1
  328. dir: dir isR
  329. |\-44: O: O87 (predict-yes)
  330. I see 0 and I'm going to do: predict-yes
  331. ENV: Agent did: predict-yes for direction R in state State-B
  332. In State-B moving R
  333. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  334. predict error 1
  335. dir: dir isL
  336. /|\45: O: O89 (predict-yes)
  337. I see 0 and I'm going to do: predict-yes
  338. ENV: Agent did: predict-yes for direction L in state State-B
  339. In State-B moving L
  340. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  341. predict error 0
  342. dir: dir isL
  343. -/|46: O: O91 (predict-yes)
  344. I see 1 and I'm going to do: predict-yes
  345. ENV: Agent did: predict-yes for direction L in state State-A
  346. In State-A moving L
  347. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  348. predict error 1
  349. dir: dir isU
  350. \-/47: O: O94 (predict-no)
  351. I see 0 and I'm going to do: predict-no
  352. ENV: Agent did: predict-no for direction U in state State-A
  353. In State-A moving U
  354. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  355. predict error 0
  356. dir: dir isL
  357. |\48: O: O95 (predict-yes)
  358. I see 1 and I'm going to do: predict-yes
  359. ENV: Agent did: predict-yes for direction L in state State-A
  360. In State-A moving L
  361. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  362. predict error 1
  363. dir: dir isL
  364. -/49: O: O97 (predict-yes)
  365. I see 0 and I'm going to do: predict-yes
  366. ENV: Agent did: predict-yes for direction L in state State-A
  367. In State-A moving L
  368. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  369. predict error 1
  370. dir: dir isR
  371. |\-/50: O: O99 (predict-yes)
  372. I see 0 and I'm going to do: predict-yes
  373. ENV: Agent did: predict-yes for direction R in state State-A
  374. In State-A moving R
  375. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  376. predict error 0
  377. dir: dir isL
  378. |\-/|\-sleeping...
  379. /sleeping...
  380. |51: O: O102 (predict-no)
  381. I see 1 and I'm going to do: predict-no
  382. ENV: Agent did: predict-no for direction L in state State-B
  383. In State-B moving L
  384. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  385. predict error 1
  386. dir: dir isR
  387. rule alias: '*'
  388. rule alias: '*'
  389. \52: O: O103 (predict-yes)
  390. I see 0 and I'm going to do: predict-yes
  391. ENV: Agent did: predict-yes for direction R in state State-A
  392. In State-A moving R
  393. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  394. predict error 0
  395. dir: dir isR
  396. -/53: O: O106 (predict-no)
  397. I see 1 and I'm going to do: predict-no
  398. ENV: Agent did: predict-no for direction R in state State-B
  399. In State-B moving R
  400. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  401. predict error 0
  402. dir: dir isR
  403. |\-54: O: O107 (predict-yes)
  404. I see 1 and I'm going to do: predict-yes
  405. ENV: Agent did: predict-yes for direction R in state State-B
  406. In State-B moving R
  407. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  408. predict error 1
  409. dir: dir isU
  410. /|\55: O: O110 (predict-no)
  411. I see 0 and I'm going to do: predict-no
  412. ENV: Agent did: predict-no for direction U in state State-B
  413. In State-B moving U
  414. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  415. predict error 0
  416. dir: dir isL
  417. -/|\sleeping...
  418. -56: O: O112 (predict-no)
  419. I see 1 and I'm going to do: predict-no
  420. ENV: Agent did: predict-no for direction L in state State-B
  421. In State-B moving L
  422. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  423. predict error 1
  424. dir: dir isR
  425. /|57: O: O113 (predict-yes)
  426. I see 0 and I'm going to do: predict-yes
  427. ENV: Agent did: predict-yes for direction R in state State-A
  428. In State-A moving R
  429. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  430. predict error 0
  431. dir: dir isL
  432. \-58: O: O115 (predict-yes)
  433. I see 1 and I'm going to do: predict-yes
  434. ENV: Agent did: predict-yes for direction L in state State-B
  435. In State-B moving L
  436. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  437. predict error 0
  438. dir: dir isR
  439. /|59: O: O117 (predict-yes)
  440. I see 1 and I'm going to do: predict-yes
  441. ENV: Agent did: predict-yes for direction R in state State-A
  442. In State-A moving R
  443. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  444. predict error 0
  445. dir: dir isL
  446. \-/60: O: O119 (predict-yes)
  447. I see 1 and I'm going to do: predict-yes
  448. ENV: Agent did: predict-yes for direction L in state State-B
  449. In State-B moving L
  450. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  451. predict error 0
  452. dir: dir isR
  453. |\-/61: O: O121 (predict-yes)
  454. I see 1 and I'm going to do: predict-yes
  455. ENV: Agent did: predict-yes for direction R in state State-A
  456. In State-A moving R
  457. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  458. predict error 0
  459. dir: dir isU
  460. rule alias: '*'
  461. rule alias: '*'
  462. rule alias: '*'
  463. rule alias: '*'
  464. rule alias: '*'
  465. rule alias: '*'
  466. |62: O: O124 (predict-no)
  467. I see 1 and I'm going to do: predict-no
  468. ENV: Agent did: predict-no for direction U in state State-B
  469. In State-B moving U
  470. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  471. predict error 0
  472. dir: dir isL
  473. \-/63: O: O125 (predict-yes)
  474. I see 1 and I'm going to do: predict-yes
  475. ENV: Agent did: predict-yes for direction L in state State-B
  476. In State-B moving L
  477. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  478. predict error 0
  479. dir: dir isR
  480. |\-64: O: O127 (predict-yes)
  481. I see 1 and I'm going to do: predict-yes
  482. ENV: Agent did: predict-yes for direction R in state State-A
  483. In State-A moving R
  484. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  485. predict error 0
  486. dir: dir isU
  487. /|65: O: O130 (predict-no)
  488. I see 1 and I'm going to do: predict-no
  489. ENV: Agent did: predict-no for direction U in state State-B
  490. In State-B moving U
  491. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  492. predict error 0
  493. dir: dir isL
  494. \-/66: O: O131 (predict-yes)
  495. I see 1 and I'm going to do: predict-yes
  496. ENV: Agent did: predict-yes for direction L in state State-B
  497. In State-B moving L
  498. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  499. predict error 0
  500. dir: dir isL
  501. |\-67: O: O133 (predict-yes)
  502. I see 1 and I'm going to do: predict-yes
  503. ENV: Agent did: predict-yes for direction L in state State-A
  504. In State-A moving L
  505. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  506. predict error 1
  507. dir: dir isL
  508. /|\68: O: O135 (predict-yes)
  509. I see 0 and I'm going to do: predict-yes
  510. ENV: Agent did: predict-yes for direction L in state State-A
  511. In State-A moving L
  512. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  513. predict error 1
  514. dir: dir isU
  515. -/|69: O: O138 (predict-no)
  516. I see 0 and I'm going to do: predict-no
  517. ENV: Agent did: predict-no for direction U in state State-A
  518. In State-A moving U
  519. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  520. predict error 0
  521. dir: dir isR
  522. \-70: O: O140 (predict-no)
  523. I see 1 and I'm going to do: predict-no
  524. ENV: Agent did: predict-no for direction R in state State-A
  525. In State-A moving R
  526. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  527. predict error 1
  528. dir: dir isL
  529. /|\-71: O: O141 (predict-yes)
  530. I see 0 and I'm going to do: predict-yes
  531. ENV: Agent did: predict-yes for direction L in state State-B
  532. In State-B moving L
  533. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  534. predict error 0
  535. dir: dir isL
  536. rule alias: '*'
  537. rule alias: '*'
  538. rule alias: '*'
  539. rule alias: '*'
  540. rule alias: '*'
  541. rule alias: '*'
  542. rule alias: '*'
  543. rule alias: '*'
  544. /72: O: O143 (predict-yes)
  545. I see 1 and I'm going to do: predict-yes
  546. ENV: Agent did: predict-yes for direction L in state State-A
  547. In State-A moving L
  548. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  549. predict error 1
  550. dir: dir isL
  551. |\-73: O: O145 (predict-yes)
  552. I see 0 and I'm going to do: predict-yes
  553. ENV: Agent did: predict-yes for direction L in state State-A
  554. In State-A moving L
  555. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  556. predict error 1
  557. dir: dir isR
  558. /|\74: O: O147 (predict-yes)
  559. I see 0 and I'm going to do: predict-yes
  560. ENV: Agent did: predict-yes for direction R in state State-A
  561. In State-A moving R
  562. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  563. predict error 0
  564. dir: dir isL
  565. -/75: O: O149 (predict-yes)
  566. I see 1 and I'm going to do: predict-yes
  567. ENV: Agent did: predict-yes for direction L in state State-B
  568. In State-B moving L
  569. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  570. predict error 0
  571. dir: dir isU
  572. |\-76: O: O151 (predict-yes)
  573. I see 1 and I'm going to do: predict-yes
  574. ENV: Agent did: predict-yes for direction U in state State-A
  575. In State-A moving U
  576. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  577. predict error 1
  578. dir: dir isU
  579. /|\77: O: O154 (predict-no)
  580. I see 0 and I'm going to do: predict-no
  581. ENV: Agent did: predict-no for direction U in state State-A
  582. In State-A moving U
  583. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  584. predict error 0
  585. dir: dir isU
  586. -/|\78: O: O156 (predict-no)
  587. I see 1 and I'm going to do: predict-no
  588. ENV: Agent did: predict-no for direction U in state State-A
  589. In State-A moving U
  590. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  591. predict error 0
  592. dir: dir isU
  593. -79: O: O158 (predict-no)
  594. I see 1 and I'm going to do: predict-no
  595. ENV: Agent did: predict-no for direction U in state State-A
  596. In State-A moving U
  597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  598. predict error 0
  599. dir: dir isU
  600. /|\80: O: O159 (predict-yes)
  601. I see 1 and I'm going to do: predict-yes
  602. ENV: Agent did: predict-yes for direction U in state State-A
  603. In State-A moving U
  604. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  605. predict error 1
  606. dir: dir isL
  607. -/|\81: O: O161 (predict-yes)
  608. I see 0 and I'm going to do: predict-yes
  609. ENV: Agent did: predict-yes for direction L in state State-A
  610. In State-A moving L
  611. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  612. predict error 1
  613. dir: dir isL
  614. rule alias: '*'
  615. rule alias: '*'
  616. rule alias: '*'
  617. rule alias: '*'
  618. -82: O: O163 (predict-yes)
  619. I see 0 and I'm going to do: predict-yes
  620. ENV: Agent did: predict-yes for direction L in state State-A
  621. In State-A moving L
  622. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  623. predict error 1
  624. dir: dir isU
  625. /|\83: O: O166 (predict-no)
  626. I see 0 and I'm going to do: predict-no
  627. ENV: Agent did: predict-no for direction U in state State-A
  628. In State-A moving U
  629. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  630. predict error 0
  631. dir: dir isU
  632. -/|\84: O: O168 (predict-no)
  633. I see 1 and I'm going to do: predict-no
  634. ENV: Agent did: predict-no for direction U in state State-A
  635. In State-A moving U
  636. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  637. predict error 0
  638. dir: dir isR
  639. -/85: O: O169 (predict-yes)
  640. I see 1 and I'm going to do: predict-yes
  641. ENV: Agent did: predict-yes for direction R in state State-A
  642. In State-A moving R
  643. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  644. predict error 0
  645. dir: dir isU
  646. |\-/86: O: O172 (predict-no)
  647. I see 1 and I'm going to do: predict-no
  648. ENV: Agent did: predict-no for direction U in state State-B
  649. In State-B moving U
  650. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  651. predict error 0
  652. dir: dir isR
  653. |\87: O: O173 (predict-yes)
  654. I see 1 and I'm going to do: predict-yes
  655. ENV: Agent did: predict-yes for direction R in state State-B
  656. In State-B moving R
  657. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  658. predict error 1
  659. dir: dir isU
  660. -/88: O: O176 (predict-no)
  661. I see 0 and I'm going to do: predict-no
  662. ENV: Agent did: predict-no for direction U in state State-B
  663. In State-B moving U
  664. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  665. predict error 0
  666. dir: dir isL
  667. |\-89: O: O177 (predict-yes)
  668. I see 1 and I'm going to do: predict-yes
  669. ENV: Agent did: predict-yes for direction L in state State-B
  670. In State-B moving L
  671. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  672. predict error 0
  673. dir: dir isL
  674. /|90: O: O180 (predict-no)
  675. I see 1 and I'm going to do: predict-no
  676. ENV: Agent did: predict-no for direction L in state State-A
  677. In State-A moving L
  678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  679. predict error 0
  680. dir: dir isR
  681. \-/91: O: O181 (predict-yes)
  682. I see 1 and I'm going to do: predict-yes
  683. ENV: Agent did: predict-yes for direction R in state State-A
  684. In State-A moving R
  685. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  686. predict error 0
  687. dir: dir isL
  688. rule alias: '*'
  689. rule alias: '*'
  690. rule alias: '*'
  691. rule alias: '*'
  692. |92: O: O183 (predict-yes)
  693. I see 1 and I'm going to do: predict-yes
  694. ENV: Agent did: predict-yes for direction L in state State-B
  695. In State-B moving L
  696. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  697. predict error 0
  698. dir: dir isR
  699. \-/93: O: O185 (predict-yes)
  700. I see 1 and I'm going to do: predict-yes
  701. ENV: Agent did: predict-yes for direction R in state State-A
  702. In State-A moving R
  703. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  704. predict error 0
  705. dir: dir isU
  706. |\-94: O: O188 (predict-no)
  707. I see 1 and I'm going to do: predict-no
  708. ENV: Agent did: predict-no for direction U in state State-B
  709. In State-B moving U
  710. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  711. predict error 0
  712. dir: dir isL
  713. /|\95: O: O189 (predict-yes)
  714. I see 1 and I'm going to do: predict-yes
  715. ENV: Agent did: predict-yes for direction L in state State-B
  716. In State-B moving L
  717. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  718. predict error 0
  719. dir: dir isL
  720. -/|\96: O: O192 (predict-no)
  721. I see 1 and I'm going to do: predict-no
  722. ENV: Agent did: predict-no for direction L in state State-A
  723. In State-A moving L
  724. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  725. predict error 0
  726. dir: dir isL
  727. -/|97: O: O194 (predict-no)
  728. I see 1 and I'm going to do: predict-no
  729. ENV: Agent did: predict-no for direction L in state State-A
  730. In State-A moving L
  731. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  732. predict error 0
  733. dir: dir isL
  734. \-/98: O: O196 (predict-no)
  735. I see 1 and I'm going to do: predict-no
  736. ENV: Agent did: predict-no for direction L in state State-A
  737. In State-A moving L
  738. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  739. predict error 0
  740. dir: dir isU
  741. |\99: O: O198 (predict-no)
  742. I see 1 and I'm going to do: predict-no
  743. ENV: Agent did: predict-no for direction U in state State-A
  744. In State-A moving U
  745. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  746. predict error 0
  747. dir: dir isR
  748. -100: O: O199 (predict-yes)
  749. I see 1 and I'm going to do: predict-yes
  750. ENV: Agent did: predict-yes for direction R in state State-A
  751. In State-A moving R
  752. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  753. predict error 0
  754. dir: dir isL
  755. /|\101: O: O201 (predict-yes)
  756. I see 1 and I'm going to do: predict-yes
  757. ENV: Agent did: predict-yes for direction L in state State-B
  758. In State-B moving L
  759. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  760. predict error 0
  761. dir: dir isL
  762. -/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|sleeping...
  763. \102: O: O204 (predict-no)
  764. I see 1 and I'm going to do: predict-no
  765. ENV: Agent did: predict-no for direction L in state State-A
  766. In State-A moving L
  767. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  768. predict error 0
  769. dir: dir isU
  770. -/|\103: O: O206 (predict-no)
  771. I see 1 and I'm going to do: predict-no
  772. ENV: Agent did: predict-no for direction U in state State-A
  773. In State-A moving U
  774. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  775. predict error 0
  776. dir: dir isR
  777. -/|104: O: O208 (predict-no)
  778. I see 1 and I'm going to do: predict-no
  779. ENV: Agent did: predict-no for direction R in state State-A
  780. In State-A moving R
  781. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  782. predict error 1
  783. dir: dir isL
  784. \-105: O: O209 (predict-yes)
  785. I see 0 and I'm going to do: predict-yes
  786. ENV: Agent did: predict-yes for direction L in state State-B
  787. In State-B moving L
  788. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  789. predict error 0
  790. dir: dir isL
  791. /106: O: O211 (predict-yes)
  792. I see 1 and I'm going to do: predict-yes
  793. ENV: Agent did: predict-yes for direction L in state State-A
  794. In State-A moving L
  795. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  796. predict error 1
  797. dir: dir isL
  798. |\107: O: O214 (predict-no)
  799. I see 0 and I'm going to do: predict-no
  800. ENV: Agent did: predict-no for direction L in state State-A
  801. In State-A moving L
  802. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  803. predict error 0
  804. dir: dir isL
  805. -/108: O: O216 (predict-no)
  806. I see 1 and I'm going to do: predict-no
  807. ENV: Agent did: predict-no for direction L in state State-A
  808. In State-A moving L
  809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  810. predict error 0
  811. dir: dir isU
  812. |\-109: O: O218 (predict-no)
  813. I see 1 and I'm going to do: predict-no
  814. ENV: Agent did: predict-no for direction U in state State-A
  815. In State-A moving U
  816. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  817. predict error 0
  818. dir: dir isL
  819. /|110: O: O220 (predict-no)
  820. I see 1 and I'm going to do: predict-no
  821. ENV: Agent did: predict-no for direction L in state State-A
  822. In State-A moving L
  823. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  824. predict error 0
  825. dir: dir isL
  826. \-111: O: O221 (predict-yes)
  827. I see 1 and I'm going to do: predict-yes
  828. ENV: Agent did: predict-yes for direction L in state State-A
  829. In State-A moving L
  830. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  831. predict error 1
  832. dir: dir isR
  833. rule alias: '*'
  834. rule alias: '*'
  835. rule alias: '*'
  836. rule alias: '*'
  837. rule alias: '*'
  838. rule alias: '*'
  839. rule alias: '*'
  840. rule alias: '*'
  841. /112: O: O223 (predict-yes)
  842. I see 0 and I'm going to do: predict-yes
  843. ENV: Agent did: predict-yes for direction R in state State-A
  844. In State-A moving R
  845. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  846. predict error 0
  847. dir: dir isL
  848. |\-113: O: O226 (predict-no)
  849. I see 1 and I'm going to do: predict-no
  850. ENV: Agent did: predict-no for direction L in state State-B
  851. In State-B moving L
  852. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  853. predict error 1
  854. dir: dir isU
  855. /|\114: O: O228 (predict-no)
  856. I see 0 and I'm going to do: predict-no
  857. ENV: Agent did: predict-no for direction U in state State-A
  858. In State-A moving U
  859. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  860. predict error 0
  861. dir: dir isL
  862. -/115: O: O229 (predict-yes)
  863. I see 1 and I'm going to do: predict-yes
  864. ENV: Agent did: predict-yes for direction L in state State-A
  865. In State-A moving L
  866. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  867. predict error 1
  868. dir: dir isR
  869. |\-116: O: O231 (predict-yes)
  870. I see 0 and I'm going to do: predict-yes
  871. ENV: Agent did: predict-yes for direction R in state State-A
  872. In State-A moving R
  873. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  874. predict error 0
  875. dir: dir isU
  876. /|\117: O: O234 (predict-no)
  877. I see 1 and I'm going to do: predict-no
  878. ENV: Agent did: predict-no for direction U in state State-B
  879. In State-B moving U
  880. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  881. predict error 0
  882. dir: dir isR
  883. -/|118: O: O235 (predict-yes)
  884. I see 1 and I'm going to do: predict-yes
  885. ENV: Agent did: predict-yes for direction R in state State-B
  886. In State-B moving R
  887. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  888. predict error 1
  889. dir: dir isU
  890. \-/119: O: O238 (predict-no)
  891. I see 0 and I'm going to do: predict-no
  892. ENV: Agent did: predict-no for direction U in state State-B
  893. In State-B moving U
  894. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  895. predict error 0
  896. dir: dir isR
  897. |\-120: O: O239 (predict-yes)
  898. I see 1 and I'm going to do: predict-yes
  899. ENV: Agent did: predict-yes for direction R in state State-B
  900. In State-B moving R
  901. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  902. predict error 1
  903. dir: dir isL
  904. /|\121: O: O242 (predict-no)
  905. I see 0 and I'm going to do: predict-no
  906. ENV: Agent did: predict-no for direction L in state State-B
  907. In State-B moving L
  908. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  909. predict error 1
  910. dir: dir isR
  911. rule alias: '*'
  912. rule alias: '*'
  913. rule alias: '*'
  914. rule alias: '*'
  915. rule alias: '*'
  916. rule alias: '*'
  917. -122: O: O243 (predict-yes)
  918. I see 0 and I'm going to do: predict-yes
  919. ENV: Agent did: predict-yes for direction R in state State-A
  920. In State-A moving R
  921. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  922. predict error 0
  923. dir: dir isL
  924. /|\-123: O: O245 (predict-yes)
  925. I see 1 and I'm going to do: predict-yes
  926. ENV: Agent did: predict-yes for direction L in state State-B
  927. In State-B moving L
  928. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  929. predict error 0
  930. dir: dir isR
  931. /|\124: O: O248 (predict-no)
  932. I see 1 and I'm going to do: predict-no
  933. ENV: Agent did: predict-no for direction R in state State-A
  934. In State-A moving R
  935. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  936. predict error 1
  937. dir: dir isL
  938. -125: O: O249 (predict-yes)
  939. I see 0 and I'm going to do: predict-yes
  940. ENV: Agent did: predict-yes for direction L in state State-B
  941. In State-B moving L
  942. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  943. predict error 0
  944. dir: dir isU
  945. /|\126: O: O252 (predict-no)
  946. I see 1 and I'm going to do: predict-no
  947. ENV: Agent did: predict-no for direction U in state State-A
  948. In State-A moving U
  949. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  950. predict error 0
  951. dir: dir isU
  952. -/|\127: O: O254 (predict-no)
  953. I see 1 and I'm going to do: predict-no
  954. ENV: Agent did: predict-no for direction U in state State-A
  955. In State-A moving U
  956. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  957. predict error 0
  958. dir: dir isR
  959. -/|\128: O: O255 (predict-yes)
  960. I see 1 and I'm going to do: predict-yes
  961. ENV: Agent did: predict-yes for direction R in state State-A
  962. In State-A moving R
  963. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  964. predict error 0
  965. dir: dir isL
  966. -/129: O: O257 (predict-yes)
  967. I see 1 and I'm going to do: predict-yes
  968. ENV: Agent did: predict-yes for direction L in state State-B
  969. In State-B moving L
  970. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  971. predict error 0
  972. dir: dir isL
  973. |\-130: O: O259 (predict-yes)
  974. I see 1 and I'm going to do: predict-yes
  975. ENV: Agent did: predict-yes for direction L in state State-A
  976. In State-A moving L
  977. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  978. predict error 1
  979. dir: dir isL
  980. /|131: O: O262 (predict-no)
  981. I see 0 and I'm going to do: predict-no
  982. ENV: Agent did: predict-no for direction L in state State-A
  983. In State-A moving L
  984. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  985. predict error 0
  986. dir: dir isL
  987. rule alias: '*'
  988. rule alias: '*'
  989. \132: O: O264 (predict-no)
  990. I see 1 and I'm going to do: predict-no
  991. ENV: Agent did: predict-no for direction L in state State-A
  992. In State-A moving L
  993. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  994. predict error 0
  995. dir: dir isL
  996. -/133: O: O266 (predict-no)
  997. I see 1 and I'm going to do: predict-no
  998. ENV: Agent did: predict-no for direction L in state State-A
  999. In State-A moving L
  1000. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1001. predict error 0
  1002. dir: dir isU
  1003. |\-134: O: O268 (predict-no)
  1004. I see 1 and I'm going to do: predict-no
  1005. ENV: Agent did: predict-no for direction U in state State-A
  1006. In State-A moving U
  1007. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1008. predict error 0
  1009. dir: dir isL
  1010. /|135: O: O270 (predict-no)
  1011. I see 1 and I'm going to do: predict-no
  1012. ENV: Agent did: predict-no for direction L in state State-A
  1013. In State-A moving L
  1014. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1015. predict error 0
  1016. dir: dir isL
  1017. \-/|136: O: O272 (predict-no)
  1018. I see 1 and I'm going to do: predict-no
  1019. ENV: Agent did: predict-no for direction L in state State-A
  1020. In State-A moving L
  1021. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1022. predict error 0
  1023. dir: dir isU
  1024. \-137: O: O274 (predict-no)
  1025. I see 1 and I'm going to do: predict-no
  1026. ENV: Agent did: predict-no for direction U in state State-A
  1027. In State-A moving U
  1028. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1029. predict error 0
  1030. dir: dir isR
  1031. /|\138: O: O275 (predict-yes)
  1032. I see 1 and I'm going to do: predict-yes
  1033. ENV: Agent did: predict-yes for direction R in state State-A
  1034. In State-A moving R
  1035. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1036. predict error 0
  1037. dir: dir isU
  1038. -/|\139: O: O278 (predict-no)
  1039. I see 1 and I'm going to do: predict-no
  1040. ENV: Agent did: predict-no for direction U in state State-B
  1041. In State-B moving U
  1042. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1043. predict error 0
  1044. dir: dir isR
  1045. -/140: O: O279 (predict-yes)
  1046. I see 1 and I'm going to do: predict-yes
  1047. ENV: Agent did: predict-yes for direction R in state State-B
  1048. In State-B moving R
  1049. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1050. predict error 1
  1051. dir: dir isU
  1052. |\141: O: O282 (predict-no)
  1053. I see 0 and I'm going to do: predict-no
  1054. ENV: Agent did: predict-no for direction U in state State-B
  1055. In State-B moving U
  1056. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1057. predict error 0
  1058. dir: dir isR
  1059. -142: O: O283 (predict-yes)
  1060. I see 1 and I'm going to do: predict-yes
  1061. ENV: Agent did: predict-yes for direction R in state State-B
  1062. In State-B moving R
  1063. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1064. predict error 1
  1065. dir: dir isU
  1066. /|\143: O: O286 (predict-no)
  1067. I see 0 and I'm going to do: predict-no
  1068. ENV: Agent did: predict-no for direction U in state State-B
  1069. In State-B moving U
  1070. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1071. predict error 0
  1072. dir: dir isU
  1073. -/144: O: O288 (predict-no)
  1074. I see 1 and I'm going to do: predict-no
  1075. ENV: Agent did: predict-no for direction U in state State-B
  1076. In State-B moving U
  1077. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1078. predict error 0
  1079. dir: dir isR
  1080. |\-145: O: O289 (predict-yes)
  1081. I see 1 and I'm going to do: predict-yes
  1082. ENV: Agent did: predict-yes for direction R in state State-B
  1083. In State-B moving R
  1084. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1085. predict error 1
  1086. dir: dir isL
  1087. /|146: O: O291 (predict-yes)
  1088. I see 0 and I'm going to do: predict-yes
  1089. ENV: Agent did: predict-yes for direction L in state State-B
  1090. In State-B moving L
  1091. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1092. predict error 0
  1093. dir: dir isL
  1094. \-/|147: O: O293 (predict-yes)
  1095. I see 1 and I'm going to do: predict-yes
  1096. ENV: Agent did: predict-yes for direction L in state State-A
  1097. In State-A moving L
  1098. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1099. predict error 1
  1100. dir: dir isU
  1101. \-/|148: O: O296 (predict-no)
  1102. I see 0 and I'm going to do: predict-no
  1103. ENV: Agent did: predict-no for direction U in state State-A
  1104. In State-A moving U
  1105. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1106. predict error 0
  1107. dir: dir isU
  1108. \-149: O: O298 (predict-no)
  1109. I see 1 and I'm going to do: predict-no
  1110. ENV: Agent did: predict-no for direction U in state State-A
  1111. In State-A moving U
  1112. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1113. predict error 0
  1114. dir: dir isU
  1115. /|150: O: O300 (predict-no)
  1116. I see 1 and I'm going to do: predict-no
  1117. ENV: Agent did: predict-no for direction U in state State-A
  1118. In State-A moving U
  1119. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1120. predict error 0
  1121. dir: dir isR
  1122. \-151: O: O301 (predict-yes)
  1123. I see 1 and I'm going to do: predict-yes
  1124. ENV: Agent did: predict-yes for direction R in state State-A
  1125. In State-A moving R
  1126. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1127. predict error 0
  1128. dir: dir isR
  1129. /152: O: O303 (predict-yes)
  1130. I see 1 and I'm going to do: predict-yes
  1131. ENV: Agent did: predict-yes for direction R in state State-B
  1132. In State-B moving R
  1133. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1134. predict error 1
  1135. dir: dir isU
  1136. |\-/153: O: O306 (predict-no)
  1137. I see 0 and I'm going to do: predict-no
  1138. ENV: Agent did: predict-no for direction U in state State-B
  1139. In State-B moving U
  1140. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1141. predict error 0
  1142. dir: dir isR
  1143. |\-/154: O: O307 (predict-yes)
  1144. I see 1 and I'm going to do: predict-yes
  1145. ENV: Agent did: predict-yes for direction R in state State-B
  1146. In State-B moving R
  1147. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1148. predict error 1
  1149. dir: dir isU
  1150. |\-/sleeping...
  1151. |155: O: O310 (predict-no)
  1152. I see 0 and I'm going to do: predict-no
  1153. ENV: Agent did: predict-no for direction U in state State-B
  1154. In State-B moving U
  1155. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1156. predict error 0
  1157. dir: dir isR
  1158. \-/156: O: O312 (predict-no)
  1159. I see 1 and I'm going to do: predict-no
  1160. ENV: Agent did: predict-no for direction R in state State-B
  1161. In State-B moving R
  1162. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1163. predict error 0
  1164. dir: dir isR
  1165. |\157: O: O314 (predict-no)
  1166. I see 1 and I'm going to do: predict-no
  1167. ENV: Agent did: predict-no for direction R in state State-B
  1168. In State-B moving R
  1169. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1170. predict error 0
  1171. dir: dir isU
  1172. -/|158: O: O315 (predict-yes)
  1173. I see 1 and I'm going to do: predict-yes
  1174. ENV: Agent did: predict-yes for direction U in state State-B
  1175. In State-B moving U
  1176. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1177. predict error 1
  1178. dir: dir isL
  1179. \-/159: O: O318 (predict-no)
  1180. I see 0 and I'm going to do: predict-no
  1181. ENV: Agent did: predict-no for direction L in state State-B
  1182. In State-B moving L
  1183. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1184. predict error 1
  1185. dir: dir isU
  1186. |\-/160: O: O319 (predict-yes)
  1187. I see 0 and I'm going to do: predict-yes
  1188. ENV: Agent did: predict-yes for direction U in state State-A
  1189. In State-A moving U
  1190. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1191. predict error 1
  1192. dir: dir isU
  1193. |\-161: O: O321 (predict-yes)
  1194. I see 0 and I'm going to do: predict-yes
  1195. ENV: Agent did: predict-yes for direction U in state State-A
  1196. In State-A moving U
  1197. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1198. predict error 1
  1199. dir: dir isL
  1200. /162: O: O323 (predict-yes)
  1201. I see 0 and I'm going to do: predict-yes
  1202. ENV: Agent did: predict-yes for direction L in state State-A
  1203. In State-A moving L
  1204. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1205. predict error 1
  1206. dir: dir isU
  1207. |\-163: O: O326 (predict-no)
  1208. I see 0 and I'm going to do: predict-no
  1209. ENV: Agent did: predict-no for direction U in state State-A
  1210. In State-A moving U
  1211. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1212. predict error 0
  1213. dir: dir isU
  1214. /|\-164: O: O327 (predict-yes)
  1215. I see 1 and I'm going to do: predict-yes
  1216. ENV: Agent did: predict-yes for direction U in state State-A
  1217. In State-A moving U
  1218. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1219. predict error 1
  1220. dir: dir isU
  1221. /|\165: O: O330 (predict-no)
  1222. I see 0 and I'm going to do: predict-no
  1223. ENV: Agent did: predict-no for direction U in state State-A
  1224. In State-A moving U
  1225. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1226. predict error 0
  1227. dir: dir isL
  1228. -/166: O: O332 (predict-no)
  1229. I see 1 and I'm going to do: predict-no
  1230. ENV: Agent did: predict-no for direction L in state State-A
  1231. In State-A moving L
  1232. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1233. predict error 0
  1234. dir: dir isU
  1235. |\167: O: O334 (predict-no)
  1236. I see 1 and I'm going to do: predict-no
  1237. ENV: Agent did: predict-no for direction U in state State-A
  1238. In State-A moving U
  1239. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1240. predict error 0
  1241. dir: dir isL
  1242. -/|168: O: O336 (predict-no)
  1243. I see 1 and I'm going to do: predict-no
  1244. ENV: Agent did: predict-no for direction L in state State-A
  1245. In State-A moving L
  1246. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1247. predict error 0
  1248. dir: dir isU
  1249. \-/169: O: O338 (predict-no)
  1250. I see 1 and I'm going to do: predict-no
  1251. ENV: Agent did: predict-no for direction U in state State-A
  1252. In State-A moving U
  1253. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1254. predict error 0
  1255. dir: dir isU
  1256. |\-170: O: O340 (predict-no)
  1257. I see 1 and I'm going to do: predict-no
  1258. ENV: Agent did: predict-no for direction U in state State-A
  1259. In State-A moving U
  1260. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1261. predict error 0
  1262. dir: dir isL
  1263. /|\-171: O: O342 (predict-no)
  1264. I see 1 and I'm going to do: predict-no
  1265. ENV: Agent did: predict-no for direction L in state State-A
  1266. In State-A moving L
  1267. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1268. predict error 0
  1269. dir: dir isL
  1270. /172: O: O344 (predict-no)
  1271. I see 1 and I'm going to do: predict-no
  1272. ENV: Agent did: predict-no for direction L in state State-A
  1273. In State-A moving L
  1274. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1275. predict error 0
  1276. dir: dir isR
  1277. |\173: O: O345 (predict-yes)
  1278. I see 1 and I'm going to do: predict-yes
  1279. ENV: Agent did: predict-yes for direction R in state State-A
  1280. In State-A moving R
  1281. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1282. predict error 0
  1283. dir: dir isR
  1284. -/174: O: O348 (predict-no)
  1285. I see 1 and I'm going to do: predict-no
  1286. ENV: Agent did: predict-no for direction R in state State-B
  1287. In State-B moving R
  1288. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1289. predict error 0
  1290. dir: dir isU
  1291. |\-175: O: O350 (predict-no)
  1292. I see 1 and I'm going to do: predict-no
  1293. ENV: Agent did: predict-no for direction U in state State-B
  1294. In State-B moving U
  1295. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1296. predict error 0
  1297. dir: dir isU
  1298. /|176: O: O352 (predict-no)
  1299. I see 1 and I'm going to do: predict-no
  1300. ENV: Agent did: predict-no for direction U in state State-B
  1301. In State-B moving U
  1302. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1303. predict error 0
  1304. dir: dir isL
  1305. \-/177: O: O353 (predict-yes)
  1306. I see 1 and I'm going to do: predict-yes
  1307. ENV: Agent did: predict-yes for direction L in state State-B
  1308. In State-B moving L
  1309. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1310. predict error 0
  1311. dir: dir isL
  1312. |\178: O: O356 (predict-no)
  1313. I see 1 and I'm going to do: predict-no
  1314. ENV: Agent did: predict-no for direction L in state State-A
  1315. In State-A moving L
  1316. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1317. predict error 0
  1318. dir: dir isU
  1319. -/|179: O: O358 (predict-no)
  1320. I see 1 and I'm going to do: predict-no
  1321. ENV: Agent did: predict-no for direction U in state State-A
  1322. In State-A moving U
  1323. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1324. predict error 0
  1325. dir: dir isL
  1326. \-/180: O: O360 (predict-no)
  1327. I see 1 and I'm going to do: predict-no
  1328. ENV: Agent did: predict-no for direction L in state State-A
  1329. In State-A moving L
  1330. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1331. predict error 0
  1332. dir: dir isU
  1333. |\-181: O: O362 (predict-no)
  1334. I see 1 and I'm going to do: predict-no
  1335. ENV: Agent did: predict-no for direction U in state State-A
  1336. In State-A moving U
  1337. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1338. predict error 0
  1339. dir: dir isL
  1340. /182: O: O364 (predict-no)
  1341. I see 1 and I'm going to do: predict-no
  1342. ENV: Agent did: predict-no for direction L in state State-A
  1343. In State-A moving L
  1344. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1345. predict error 0
  1346. dir: dir isU
  1347. |\-183: O: O366 (predict-no)
  1348. I see 1 and I'm going to do: predict-no
  1349. ENV: Agent did: predict-no for direction U in state State-A
  1350. In State-A moving U
  1351. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1352. predict error 0
  1353. dir: dir isU
  1354. /|184: O: O368 (predict-no)
  1355. I see 1 and I'm going to do: predict-no
  1356. ENV: Agent did: predict-no for direction U in state State-A
  1357. In State-A moving U
  1358. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1359. predict error 0
  1360. dir: dir isU
  1361. \-/185: O: O370 (predict-no)
  1362. I see 1 and I'm going to do: predict-no
  1363. ENV: Agent did: predict-no for direction U in state State-A
  1364. In State-A moving U
  1365. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1366. predict error 0
  1367. dir: dir isR
  1368. |\186: O: O371 (predict-yes)
  1369. I see 1 and I'm going to do: predict-yes
  1370. ENV: Agent did: predict-yes for direction R in state State-A
  1371. In State-A moving R
  1372. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1373. predict error 0
  1374. dir: dir isU
  1375. -/|187: O: O374 (predict-no)
  1376. I see 1 and I'm going to do: predict-no
  1377. ENV: Agent did: predict-no for direction U in state State-B
  1378. In State-B moving U
  1379. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1380. predict error 0
  1381. dir: dir isL
  1382. \-188: O: O375 (predict-yes)
  1383. I see 1 and I'm going to do: predict-yes
  1384. ENV: Agent did: predict-yes for direction L in state State-B
  1385. In State-B moving L
  1386. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1387. predict error 0
  1388. dir: dir isU
  1389. /|\189: O: O378 (predict-no)
  1390. I see 1 and I'm going to do: predict-no
  1391. ENV: Agent did: predict-no for direction U in state State-A
  1392. In State-A moving U
  1393. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1394. predict error 0
  1395. dir: dir isL
  1396. -/|190: O: O380 (predict-no)
  1397. I see 1 and I'm going to do: predict-no
  1398. ENV: Agent did: predict-no for direction L in state State-A
  1399. In State-A moving L
  1400. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1401. predict error 0
  1402. dir: dir isU
  1403. \-191: O: O381 (predict-yes)
  1404. I see 1 and I'm going to do: predict-yes
  1405. ENV: Agent did: predict-yes for direction U in state State-A
  1406. In State-A moving U
  1407. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1408. predict error 1
  1409. dir: dir isU
  1410. /192: O: O384 (predict-no)
  1411. I see 0 and I'm going to do: predict-no
  1412. ENV: Agent did: predict-no for direction U in state State-A
  1413. In State-A moving U
  1414. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1415. predict error 0
  1416. dir: dir isR
  1417. |\193: O: O385 (predict-yes)
  1418. I see 1 and I'm going to do: predict-yes
  1419. ENV: Agent did: predict-yes for direction R in state State-A
  1420. In State-A moving R
  1421. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1422. predict error 0
  1423. dir: dir isR
  1424. -/|194: O: O387 (predict-yes)
  1425. I see 1 and I'm going to do: predict-yes
  1426. ENV: Agent did: predict-yes for direction R in state State-B
  1427. In State-B moving R
  1428. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1429. predict error 1
  1430. dir: dir isL
  1431. \-/|sleeping...
  1432. \195: O: O390 (predict-no)
  1433. I see 0 and I'm going to do: predict-no
  1434. ENV: Agent did: predict-no for direction L in state State-B
  1435. In State-B moving L
  1436. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1437. predict error 1
  1438. dir: dir isL
  1439. -/|\196: O: O392 (predict-no)
  1440. I see 0 and I'm going to do: predict-no
  1441. ENV: Agent did: predict-no for direction L in state State-A
  1442. In State-A moving L
  1443. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1444. predict error 0
  1445. dir: dir isR
  1446. -/|197: O: O394 (predict-no)
  1447. I see 1 and I'm going to do: predict-no
  1448. ENV: Agent did: predict-no for direction R in state State-A
  1449. In State-A moving R
  1450. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1451. predict error 1
  1452. dir: dir isR
  1453. \-/198: O: O396 (predict-no)
  1454. I see 0 and I'm going to do: predict-no
  1455. ENV: Agent did: predict-no for direction R in state State-B
  1456. In State-B moving R
  1457. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1458. predict error 0
  1459. dir: dir isU
  1460. |\-199: O: O398 (predict-no)
  1461. I see 1 and I'm going to do: predict-no
  1462. ENV: Agent did: predict-no for direction U in state State-B
  1463. In State-B moving U
  1464. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1465. predict error 0
  1466. dir: dir isR
  1467. /|\200: O: O400 (predict-no)
  1468. I see 1 and I'm going to do: predict-no
  1469. ENV: Agent did: predict-no for direction R in state State-B
  1470. In State-B moving R
  1471. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1472. predict error 0
  1473. dir: dir isL
  1474. -/|201: O: O401 (predict-yes)
  1475. I see 1 and I'm going to do: predict-yes
  1476. ENV: Agent did: predict-yes for direction L in state State-B
  1477. In State-B moving L
  1478. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1479. predict error 0
  1480. dir: dir isR
  1481. \-202: O: O403 (predict-yes)
  1482. I see 1 and I'm going to do: predict-yes
  1483. ENV: Agent did: predict-yes for direction R in state State-A
  1484. In State-A moving R
  1485. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1486. predict error 0
  1487. dir: dir isU
  1488. /|\203: O: O406 (predict-no)
  1489. I see 1 and I'm going to do: predict-no
  1490. ENV: Agent did: predict-no for direction U in state State-B
  1491. In State-B moving U
  1492. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1493. predict error 0
  1494. dir: dir isR
  1495. -/|204: O: O408 (predict-no)
  1496. I see 1 and I'm going to do: predict-no
  1497. ENV: Agent did: predict-no for direction R in state State-B
  1498. In State-B moving R
  1499. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1500. predict error 0
  1501. dir: dir isL
  1502. \-205: O: O409 (predict-yes)
  1503. I see 1 and I'm going to do: predict-yes
  1504. ENV: Agent did: predict-yes for direction L in state State-B
  1505. In State-B moving L
  1506. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1507. predict error 0
  1508. dir: dir isU
  1509. /|\-206: O: O412 (predict-no)
  1510. I see 1 and I'm going to do: predict-no
  1511. ENV: Agent did: predict-no for direction U in state State-A
  1512. In State-A moving U
  1513. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1514. predict error 0
  1515. dir: dir isU
  1516. /|\207: O: O414 (predict-no)
  1517. I see 1 and I'm going to do: predict-no
  1518. ENV: Agent did: predict-no for direction U in state State-A
  1519. In State-A moving U
  1520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1521. predict error 0
  1522. dir: dir isL
  1523. -208: O: O416 (predict-no)
  1524. I see 1 and I'm going to do: predict-no
  1525. ENV: Agent did: predict-no for direction L in state State-A
  1526. In State-A moving L
  1527. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1528. predict error 0
  1529. dir: dir isR
  1530. /|\209: O: O417 (predict-yes)
  1531. I see 1 and I'm going to do: predict-yes
  1532. ENV: Agent did: predict-yes for direction R in state State-A
  1533. In State-A moving R
  1534. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1535. predict error 0
  1536. dir: dir isR
  1537. -/|210: O: O420 (predict-no)
  1538. I see 1 and I'm going to do: predict-no
  1539. ENV: Agent did: predict-no for direction R in state State-B
  1540. In State-B moving R
  1541. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1542. predict error 0
  1543. dir: dir isU
  1544. \-/211: O: O422 (predict-no)
  1545. I see 1 and I'm going to do: predict-no
  1546. ENV: Agent did: predict-no for direction U in state State-B
  1547. In State-B moving U
  1548. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1549. predict error 0
  1550. dir: dir isL
  1551. |212: O: O423 (predict-yes)
  1552. I see 1 and I'm going to do: predict-yes
  1553. ENV: Agent did: predict-yes for direction L in state State-B
  1554. In State-B moving L
  1555. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1556. predict error 0
  1557. dir: dir isL
  1558. \-/213: O: O426 (predict-no)
  1559. I see 1 and I'm going to do: predict-no
  1560. ENV: Agent did: predict-no for direction L in state State-A
  1561. In State-A moving L
  1562. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1563. predict error 0
  1564. dir: dir isR
  1565. |\-214: O: O428 (predict-no)
  1566. I see 1 and I'm going to do: predict-no
  1567. ENV: Agent did: predict-no for direction R in state State-A
  1568. In State-A moving R
  1569. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1570. predict error 1
  1571. dir: dir isU
  1572. /|\215: O: O430 (predict-no)
  1573. I see 0 and I'm going to do: predict-no
  1574. ENV: Agent did: predict-no for direction U in state State-B
  1575. In State-B moving U
  1576. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1577. predict error 0
  1578. dir: dir isL
  1579. -/216: O: O431 (predict-yes)
  1580. I see 1 and I'm going to do: predict-yes
  1581. ENV: Agent did: predict-yes for direction L in state State-B
  1582. In State-B moving L
  1583. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1584. predict error 0
  1585. dir: dir isL
  1586. |\-217: O: O434 (predict-no)
  1587. I see 1 and I'm going to do: predict-no
  1588. ENV: Agent did: predict-no for direction L in state State-A
  1589. In State-A moving L
  1590. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1591. predict error 0
  1592. dir: dir isR
  1593. /|\218: O: O435 (predict-yes)
  1594. I see 1 and I'm going to do: predict-yes
  1595. ENV: Agent did: predict-yes for direction R in state State-A
  1596. In State-A moving R
  1597. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1598. predict error 0
  1599. dir: dir isU
  1600. -/|219: O: O438 (predict-no)
  1601. I see 1 and I'm going to do: predict-no
  1602. ENV: Agent did: predict-no for direction U in state State-B
  1603. In State-B moving U
  1604. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1605. predict error 0
  1606. dir: dir isR
  1607. \-/220: O: O440 (predict-no)
  1608. I see 1 and I'm going to do: predict-no
  1609. ENV: Agent did: predict-no for direction R in state State-B
  1610. In State-B moving R
  1611. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1612. predict error 0
  1613. dir: dir isR
  1614. |\221: O: O442 (predict-no)
  1615. I see 1 and I'm going to do: predict-no
  1616. ENV: Agent did: predict-no for direction R in state State-B
  1617. In State-B moving R
  1618. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1619. predict error 0
  1620. dir: dir isU
  1621. -222: O: O444 (predict-no)
  1622. I see 1 and I'm going to do: predict-no
  1623. ENV: Agent did: predict-no for direction U in state State-B
  1624. In State-B moving U
  1625. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1626. predict error 0
  1627. dir: dir isU
  1628. /|\223: O: O446 (predict-no)
  1629. I see 1 and I'm going to do: predict-no
  1630. ENV: Agent did: predict-no for direction U in state State-B
  1631. In State-B moving U
  1632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1633. predict error 0
  1634. dir: dir isL
  1635. -/224: O: O447 (predict-yes)
  1636. I see 1 and I'm going to do: predict-yes
  1637. ENV: Agent did: predict-yes for direction L in state State-B
  1638. In State-B moving L
  1639. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1640. predict error 0
  1641. dir: dir isU
  1642. |225: O: O450 (predict-no)
  1643. I see 1 and I'm going to do: predict-no
  1644. ENV: Agent did: predict-no for direction U in state State-A
  1645. In State-A moving U
  1646. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1647. predict error 0
  1648. dir: dir isL
  1649. \-226: O: O452 (predict-no)
  1650. I see 1 and I'm going to do: predict-no
  1651. ENV: Agent did: predict-no for direction L in state State-A
  1652. In State-A moving L
  1653. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1654. predict error 0
  1655. dir: dir isL
  1656. /|\227: O: O454 (predict-no)
  1657. I see 1 and I'm going to do: predict-no
  1658. ENV: Agent did: predict-no for direction L in state State-A
  1659. In State-A moving L
  1660. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1661. predict error 0
  1662. dir: dir isU
  1663. -/|228: O: O456 (predict-no)
  1664. I see 1 and I'm going to do: predict-no
  1665. ENV: Agent did: predict-no for direction U in state State-A
  1666. In State-A moving U
  1667. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1668. predict error 0
  1669. dir: dir isR
  1670. \-229: O: O457 (predict-yes)
  1671. I see 1 and I'm going to do: predict-yes
  1672. ENV: Agent did: predict-yes for direction R in state State-A
  1673. In State-A moving R
  1674. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1675. predict error 0
  1676. dir: dir isU
  1677. /|\230: O: O460 (predict-no)
  1678. I see 1 and I'm going to do: predict-no
  1679. ENV: Agent did: predict-no for direction U in state State-B
  1680. In State-B moving U
  1681. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1682. predict error 0
  1683. dir: dir isL
  1684. -/|\231: O: O461 (predict-yes)
  1685. I see 1 and I'm going to do: predict-yes
  1686. ENV: Agent did: predict-yes for direction L in state State-B
  1687. In State-B moving L
  1688. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1689. predict error 0
  1690. dir: dir isU
  1691. -232: O: O463 (predict-yes)
  1692. I see 1 and I'm going to do: predict-yes
  1693. ENV: Agent did: predict-yes for direction U in state State-A
  1694. In State-A moving U
  1695. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1696. predict error 1
  1697. dir: dir isL
  1698. /|\233: O: O466 (predict-no)
  1699. I see 0 and I'm going to do: predict-no
  1700. ENV: Agent did: predict-no for direction L in state State-A
  1701. In State-A moving L
  1702. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1703. predict error 0
  1704. dir: dir isL
  1705. -/|234: O: O468 (predict-no)
  1706. I see 1 and I'm going to do: predict-no
  1707. ENV: Agent did: predict-no for direction L in state State-A
  1708. In State-A moving L
  1709. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1710. predict error 0
  1711. dir: dir isR
  1712. \-/|235: O: O469 (predict-yes)
  1713. I see 1 and I'm going to do: predict-yes
  1714. ENV: Agent did: predict-yes for direction R in state State-A
  1715. In State-A moving R
  1716. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1717. predict error 0
  1718. dir: dir isR
  1719. \-/236: O: O472 (predict-no)
  1720. I see 1 and I'm going to do: predict-no
  1721. ENV: Agent did: predict-no for direction R in state State-B
  1722. In State-B moving R
  1723. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1724. predict error 0
  1725. dir: dir isR
  1726. |\-/237: O: O474 (predict-no)
  1727. I see 1 and I'm going to do: predict-no
  1728. ENV: Agent did: predict-no for direction R in state State-B
  1729. In State-B moving R
  1730. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1731. predict error 0
  1732. dir: dir isU
  1733. |\-238: O: O475 (predict-yes)
  1734. I see 1 and I'm going to do: predict-yes
  1735. ENV: Agent did: predict-yes for direction U in state State-B
  1736. In State-B moving U
  1737. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1738. predict error 1
  1739. dir: dir isU
  1740. /|\-239: O: O477 (predict-yes)
  1741. I see 0 and I'm going to do: predict-yes
  1742. ENV: Agent did: predict-yes for direction U in state State-B
  1743. In State-B moving U
  1744. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1745. predict error 1
  1746. dir: dir isL
  1747. /|\240: O: O479 (predict-yes)
  1748. I see 0 and I'm going to do: predict-yes
  1749. ENV: Agent did: predict-yes for direction L in state State-B
  1750. In State-B moving L
  1751. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1752. predict error 0
  1753. dir: dir isL
  1754. -/241: O: O482 (predict-no)
  1755. I see 1 and I'm going to do: predict-no
  1756. ENV: Agent did: predict-no for direction L in state State-A
  1757. In State-A moving L
  1758. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1759. predict error 0
  1760. dir: dir isR
  1761. |242: O: O483 (predict-yes)
  1762. I see 1 and I'm going to do: predict-yes
  1763. ENV: Agent did: predict-yes for direction R in state State-A
  1764. In State-A moving R
  1765. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1766. predict error 0
  1767. dir: dir isR
  1768. \-/243: O: O485 (predict-yes)
  1769. I see 1 and I'm going to do: predict-yes
  1770. ENV: Agent did: predict-yes for direction R in state State-B
  1771. In State-B moving R
  1772. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1773. predict error 1
  1774. dir: dir isU
  1775. |\244: O: O488 (predict-no)
  1776. I see 0 and I'm going to do: predict-no
  1777. ENV: Agent did: predict-no for direction U in state State-B
  1778. In State-B moving U
  1779. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1780. predict error 0
  1781. dir: dir isR
  1782. -/|\245: O: O490 (predict-no)
  1783. I see 1 and I'm going to do: predict-no
  1784. ENV: Agent did: predict-no for direction R in state State-B
  1785. In State-B moving R
  1786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1787. predict error 0
  1788. dir: dir isL
  1789. -/|246: O: O491 (predict-yes)
  1790. I see 1 and I'm going to do: predict-yes
  1791. ENV: Agent did: predict-yes for direction L in state State-B
  1792. In State-B moving L
  1793. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1794. predict error 0
  1795. dir: dir isL
  1796. \-/|247: O: O494 (predict-no)
  1797. I see 1 and I'm going to do: predict-no
  1798. ENV: Agent did: predict-no for direction L in state State-A
  1799. In State-A moving L
  1800. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1801. predict error 0
  1802. dir: dir isU
  1803. \-248: O: O496 (predict-no)
  1804. I see 1 and I'm going to do: predict-no
  1805. ENV: Agent did: predict-no for direction U in state State-A
  1806. In State-A moving U
  1807. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1808. predict error 0
  1809. dir: dir isR
  1810. /|249: O: O497 (predict-yes)
  1811. I see 1 and I'm going to do: predict-yes
  1812. ENV: Agent did: predict-yes for direction R in state State-A
  1813. In State-A moving R
  1814. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1815. predict error 0
  1816. dir: dir isU
  1817. \-/250: O: O500 (predict-no)
  1818. I see 1 and I'm going to do: predict-no
  1819. ENV: Agent did: predict-no for direction U in state State-B
  1820. In State-B moving U
  1821. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1822. predict error 0
  1823. dir: dir isU
  1824. |\-/251: O: O502 (predict-no)
  1825. I see 1 and I'm going to do: predict-no
  1826. ENV: Agent did: predict-no for direction U in state State-B
  1827. In State-B moving U
  1828. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1829. predict error 0
  1830. dir: dir isR
  1831. |252: O: O504 (predict-no)
  1832. I see 1 and I'm going to do: predict-no
  1833. ENV: Agent did: predict-no for direction R in state State-B
  1834. In State-B moving R
  1835. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1836. predict error 0
  1837. dir: dir isU
  1838. \-/253: O: O506 (predict-no)
  1839. I see 1 and I'm going to do: predict-no
  1840. ENV: Agent did: predict-no for direction U in state State-B
  1841. In State-B moving U
  1842. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1843. predict error 0
  1844. dir: dir isR
  1845. |\-254: O: O508 (predict-no)
  1846. I see 1 and I'm going to do: predict-no
  1847. ENV: Agent did: predict-no for direction R in state State-B
  1848. In State-B moving R
  1849. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1850. predict error 0
  1851. dir: dir isL
  1852. /|\-255: O: O509 (predict-yes)
  1853. I see 1 and I'm going to do: predict-yes
  1854. ENV: Agent did: predict-yes for direction L in state State-B
  1855. In State-B moving L
  1856. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1857. predict error 0
  1858. dir: dir isU
  1859. /|\256: O: O512 (predict-no)
  1860. I see 1 and I'm going to do: predict-no
  1861. ENV: Agent did: predict-no for direction U in state State-A
  1862. In State-A moving U
  1863. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1864. predict error 0
  1865. dir: dir isR
  1866. -/|257: O: O513 (predict-yes)
  1867. I see 1 and I'm going to do: predict-yes
  1868. ENV: Agent did: predict-yes for direction R in state State-A
  1869. In State-A moving R
  1870. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1871. predict error 0
  1872. dir: dir isU
  1873. \-/|258: O: O516 (predict-no)
  1874. I see 1 and I'm going to do: predict-no
  1875. ENV: Agent did: predict-no for direction U in state State-B
  1876. In State-B moving U
  1877. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1878. predict error 0
  1879. dir: dir isL
  1880. \-/259: O: O517 (predict-yes)
  1881. I see 1 and I'm going to do: predict-yes
  1882. ENV: Agent did: predict-yes for direction L in state State-B
  1883. In State-B moving L
  1884. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1885. predict error 0
  1886. dir: dir isU
  1887. |\-260: O: O520 (predict-no)
  1888. I see 1 and I'm going to do: predict-no
  1889. ENV: Agent did: predict-no for direction U in state State-A
  1890. In State-A moving U
  1891. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1892. predict error 0
  1893. dir: dir isU
  1894. /|\261: O: O522 (predict-no)
  1895. I see 1 and I'm going to do: predict-no
  1896. ENV: Agent did: predict-no for direction U in state State-A
  1897. In State-A moving U
  1898. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1899. predict error 0
  1900. dir: dir isR
  1901. -262: O: O523 (predict-yes)
  1902. I see 1 and I'm going to do: predict-yes
  1903. ENV: Agent did: predict-yes for direction R in state State-A
  1904. In State-A moving R
  1905. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1906. predict error 0
  1907. dir: dir isR
  1908. /|263: O: O526 (predict-no)
  1909. I see 1 and I'm going to do: predict-no
  1910. ENV: Agent did: predict-no for direction R in state State-B
  1911. In State-B moving R
  1912. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1913. predict error 0
  1914. dir: dir isL
  1915. \-/264: O: O527 (predict-yes)
  1916. I see 1 and I'm going to do: predict-yes
  1917. ENV: Agent did: predict-yes for direction L in state State-B
  1918. In State-B moving L
  1919. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1920. predict error 0
  1921. dir: dir isU
  1922. |\-/265: O: O530 (predict-no)
  1923. I see 1 and I'm going to do: predict-no
  1924. ENV: Agent did: predict-no for direction U in state State-A
  1925. In State-A moving U
  1926. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1927. predict error 0
  1928. dir: dir isR
  1929. |\-266: O: O531 (predict-yes)
  1930. I see 1 and I'm going to do: predict-yes
  1931. ENV: Agent did: predict-yes for direction R in state State-A
  1932. In State-A moving R
  1933. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1934. predict error 0
  1935. dir: dir isR
  1936. /267: O: O534 (predict-no)
  1937. I see 1 and I'm going to do: predict-no
  1938. ENV: Agent did: predict-no for direction R in state State-B
  1939. In State-B moving R
  1940. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1941. predict error 0
  1942. dir: dir isL
  1943. |268: O: O535 (predict-yes)
  1944. I see 1 and I'm going to do: predict-yes
  1945. ENV: Agent did: predict-yes for direction L in state State-B
  1946. In State-B moving L
  1947. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1948. predict error 0
  1949. dir: dir isL
  1950. \-/|269: O: O538 (predict-no)
  1951. I see 1 and I'm going to do: predict-no
  1952. ENV: Agent did: predict-no for direction L in state State-A
  1953. In State-A moving L
  1954. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1955. predict error 0
  1956. dir: dir isL
  1957. \-/270: O: O540 (predict-no)
  1958. I see 1 and I'm going to do: predict-no
  1959. ENV: Agent did: predict-no for direction L in state State-A
  1960. In State-A moving L
  1961. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1962. predict error 0
  1963. dir: dir isU
  1964. |\271: O: O542 (predict-no)
  1965. I see 1 and I'm going to do: predict-no
  1966. ENV: Agent did: predict-no for direction U in state State-A
  1967. In State-A moving U
  1968. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1969. predict error 0
  1970. dir: dir isL
  1971. -272: O: O543 (predict-yes)
  1972. I see 1 and I'm going to do: predict-yes
  1973. ENV: Agent did: predict-yes for direction L in state State-A
  1974. In State-A moving L
  1975. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1976. predict error 1
  1977. dir: dir isU
  1978. /|\273: O: O546 (predict-no)
  1979. I see 0 and I'm going to do: predict-no
  1980. ENV: Agent did: predict-no for direction U in state State-A
  1981. In State-A moving U
  1982. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1983. predict error 0
  1984. dir: dir isR
  1985. -/|274: O: O547 (predict-yes)
  1986. I see 1 and I'm going to do: predict-yes
  1987. ENV: Agent did: predict-yes for direction R in state State-A
  1988. In State-A moving R
  1989. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1990. predict error 0
  1991. dir: dir isR
  1992. \-/275: O: O550 (predict-no)
  1993. I see 1 and I'm going to do: predict-no
  1994. ENV: Agent did: predict-no for direction R in state State-B
  1995. In State-B moving R
  1996. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1997. predict error 0
  1998. dir: dir isR
  1999. |\276: O: O552 (predict-no)
  2000. I see 1 and I'm going to do: predict-no
  2001. ENV: Agent did: predict-no for direction R in state State-B
  2002. In State-B moving R
  2003. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2004. predict error 0
  2005. dir: dir isR
  2006. -/|277: O: O554 (predict-no)
  2007. I see 1 and I'm going to do: predict-no
  2008. ENV: Agent did: predict-no for direction R in state State-B
  2009. In State-B moving R
  2010. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2011. predict error 0
  2012. dir: dir isL
  2013. \-/278: O: O555 (predict-yes)
  2014. I see 1 and I'm going to do: predict-yes
  2015. ENV: Agent did: predict-yes for direction L in state State-B
  2016. In State-B moving L
  2017. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2018. predict error 0
  2019. dir: dir isR
  2020. |\-279: O: O557 (predict-yes)
  2021. I see 1 and I'm going to do: predict-yes
  2022. ENV: Agent did: predict-yes for direction R in state State-A
  2023. In State-A moving R
  2024. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2025. predict error 0
  2026. dir: dir isU
  2027. /|\280: O: O560 (predict-no)
  2028. I see 1 and I'm going to do: predict-no
  2029. ENV: Agent did: predict-no for direction U in state State-B
  2030. In State-B moving U
  2031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2032. predict error 0
  2033. dir: dir isL
  2034. -/|281: O: O561 (predict-yes)
  2035. I see 1 and I'm going to do: predict-yes
  2036. ENV: Agent did: predict-yes for direction L in state State-B
  2037. In State-B moving L
  2038. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2039. predict error 0
  2040. dir: dir isR
  2041. \282: O: O564 (predict-no)
  2042. I see 1 and I'm going to do: predict-no
  2043. ENV: Agent did: predict-no for direction R in state State-A
  2044. In State-A moving R
  2045. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2046. predict error 1
  2047. dir: dir isL
  2048. -/|283: O: O565 (predict-yes)
  2049. I see 0 and I'm going to do: predict-yes
  2050. ENV: Agent did: predict-yes for direction L in state State-B
  2051. In State-B moving L
  2052. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2053. predict error 0
  2054. dir: dir isL
  2055. \-/284: O: O568 (predict-no)
  2056. I see 1 and I'm going to do: predict-no
  2057. ENV: Agent did: predict-no for direction L in state State-A
  2058. In State-A moving L
  2059. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2060. predict error 0
  2061. dir: dir isL
  2062. |\285: O: O570 (predict-no)
  2063. I see 1 and I'm going to do: predict-no
  2064. ENV: Agent did: predict-no for direction L in state State-A
  2065. In State-A moving L
  2066. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2067. predict error 0
  2068. dir: dir isL
  2069. -/|286: O: O572 (predict-no)
  2070. I see 1 and I'm going to do: predict-no
  2071. ENV: Agent did: predict-no for direction L in state State-A
  2072. In State-A moving L
  2073. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2074. predict error 0
  2075. dir: dir isU
  2076. \-/287: O: O574 (predict-no)
  2077. I see 1 and I'm going to do: predict-no
  2078. ENV: Agent did: predict-no for direction U in state State-A
  2079. In State-A moving U
  2080. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2081. predict error 0
  2082. dir: dir isU
  2083. |\288: O: O576 (predict-no)
  2084. I see 1 and I'm going to do: predict-no
  2085. ENV: Agent did: predict-no for direction U in state State-A
  2086. In State-A moving U
  2087. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2088. predict error 0
  2089. dir: dir isU
  2090. -/|289: O: O577 (predict-yes)
  2091. I see 1 and I'm going to do: predict-yes
  2092. ENV: Agent did: predict-yes for direction U in state State-A
  2093. In State-A moving U
  2094. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2095. predict error 1
  2096. dir: dir isU
  2097. \290: O: O579 (predict-yes)
  2098. I see 0 and I'm going to do: predict-yes
  2099. ENV: Agent did: predict-yes for direction U in state State-A
  2100. In State-A moving U
  2101. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2102. predict error 1
  2103. dir: dir isU
  2104. -/291: O: O582 (predict-no)
  2105. I see 0 and I'm going to do: predict-no
  2106. ENV: Agent did: predict-no for direction U in state State-A
  2107. In State-A moving U
  2108. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2109. predict error 0
  2110. dir: dir isL
  2111. |292: O: O584 (predict-no)
  2112. I see 1 and I'm going to do: predict-no
  2113. ENV: Agent did: predict-no for direction L in state State-A
  2114. In State-A moving L
  2115. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2116. predict error 0
  2117. dir: dir isR
  2118. \-/293: O: O585 (predict-yes)
  2119. I see 1 and I'm going to do: predict-yes
  2120. ENV: Agent did: predict-yes for direction R in state State-A
  2121. In State-A moving R
  2122. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2123. predict error 0
  2124. dir: dir isR
  2125. |\-294: O: O588 (predict-no)
  2126. I see 1 and I'm going to do: predict-no
  2127. ENV: Agent did: predict-no for direction R in state State-B
  2128. In State-B moving R
  2129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2130. predict error 0
  2131. dir: dir isR
  2132. /|295: O: O590 (predict-no)
  2133. I see 1 and I'm going to do: predict-no
  2134. ENV: Agent did: predict-no for direction R in state State-B
  2135. In State-B moving R
  2136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2137. predict error 0
  2138. dir: dir isU
  2139. \-296: O: O592 (predict-no)
  2140. I see 1 and I'm going to do: predict-no
  2141. ENV: Agent did: predict-no for direction U in state State-B
  2142. In State-B moving U
  2143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2144. predict error 0
  2145. dir: dir isU
  2146. /|297: O: O594 (predict-no)
  2147. I see 1 and I'm going to do: predict-no
  2148. ENV: Agent did: predict-no for direction U in state State-B
  2149. In State-B moving U
  2150. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2151. predict error 0
  2152. dir: dir isU
  2153. \-/298: O: O596 (predict-no)
  2154. I see 1 and I'm going to do: predict-no
  2155. ENV: Agent did: predict-no for direction U in state State-B
  2156. In State-B moving U
  2157. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2158. predict error 0
  2159. dir: dir isL
  2160. |\-/299: O: O597 (predict-yes)
  2161. I see 1 and I'm going to do: predict-yes
  2162. ENV: Agent did: predict-yes for direction L in state State-B
  2163. In State-B moving L
  2164. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2165. predict error 0
  2166. dir: dir isR
  2167. |\-300: O: O599 (predict-yes)
  2168. I see 1 and I'm going to do: predict-yes
  2169. ENV: Agent did: predict-yes for direction R in state State-A
  2170. In State-A moving R
  2171. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2172. predict error 0
  2173. dir: dir isU
  2174. /|\-/|301: O: O602 (predict-no)
  2175. I see 1 and I'm going to do: predict-no
  2176. ENV: Agent did: predict-no for direction U in state State-B
  2177. In State-B moving U
  2178. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2179. predict error 0
  2180. dir: dir isU
  2181. \302: O: O604 (predict-no)
  2182. I see 1 and I'm going to do: predict-no
  2183. ENV: Agent did: predict-no for direction U in state State-B
  2184. In State-B moving U
  2185. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2186. predict error 0
  2187. dir: dir isR
  2188. -303: O: O606 (predict-no)
  2189. I see 1 and I'm going to do: predict-no
  2190. ENV: Agent did: predict-no for direction R in state State-B
  2191. In State-B moving R
  2192. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2193. predict error 0
  2194. dir: dir isU
  2195. /|\304: O: O608 (predict-no)
  2196. I see 1 and I'm going to do: predict-no
  2197. ENV: Agent did: predict-no for direction U in state State-B
  2198. In State-B moving U
  2199. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2200. predict error 0
  2201. dir: dir isL
  2202. -/|305: O: O609 (predict-yes)
  2203. I see 1 and I'm going to do: predict-yes
  2204. ENV: Agent did: predict-yes for direction L in state State-B
  2205. In State-B moving L
  2206. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2207. predict error 0
  2208. dir: dir isU
  2209. \-/|306: O: O612 (predict-no)
  2210. I see 1 and I'm going to do: predict-no
  2211. ENV: Agent did: predict-no for direction U in state State-A
  2212. In State-A moving U
  2213. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2214. predict error 0
  2215. dir: dir isR
  2216. \-/307: O: O613 (predict-yes)
  2217. I see 1 and I'm going to do: predict-yes
  2218. ENV: Agent did: predict-yes for direction R in state State-A
  2219. In State-A moving R
  2220. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2221. predict error 0
  2222. dir: dir isR
  2223. |\-308: O: O616 (predict-no)
  2224. I see 1 and I'm going to do: predict-no
  2225. ENV: Agent did: predict-no for direction R in state State-B
  2226. In State-B moving R
  2227. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2228. predict error 0
  2229. dir: dir isU
  2230. /|\309: O: O618 (predict-no)
  2231. I see 1 and I'm going to do: predict-no
  2232. ENV: Agent did: predict-no for direction U in state State-B
  2233. In State-B moving U
  2234. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2235. predict error 0
  2236. dir: dir isU
  2237. -/|310: O: O620 (predict-no)
  2238. I see 1 and I'm going to do: predict-no
  2239. ENV: Agent did: predict-no for direction U in state State-B
  2240. In State-B moving U
  2241. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2242. predict error 0
  2243. dir: dir isL
  2244. \-311: O: O621 (predict-yes)
  2245. I see 1 and I'm going to do: predict-yes
  2246. ENV: Agent did: predict-yes for direction L in state State-B
  2247. In State-B moving L
  2248. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2249. predict error 0
  2250. dir: dir isL
  2251. /312: O: O624 (predict-no)
  2252. I see 1 and I'm going to do: predict-no
  2253. ENV: Agent did: predict-no for direction L in state State-A
  2254. In State-A moving L
  2255. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2256. predict error 0
  2257. dir: dir isL
  2258. |\313: O: O626 (predict-no)
  2259. I see 1 and I'm going to do: predict-no
  2260. ENV: Agent did: predict-no for direction L in state State-A
  2261. In State-A moving L
  2262. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2263. predict error 0
  2264. dir: dir isR
  2265. -/|314: O: O627 (predict-yes)
  2266. I see 1 and I'm going to do: predict-yes
  2267. ENV: Agent did: predict-yes for direction R in state State-A
  2268. In State-A moving R
  2269. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2270. predict error 0
  2271. dir: dir isU
  2272. \-315: O: O630 (predict-no)
  2273. I see 1 and I'm going to do: predict-no
  2274. ENV: Agent did: predict-no for direction U in state State-B
  2275. In State-B moving U
  2276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2277. predict error 0
  2278. dir: dir isU
  2279. /|\316: O: O632 (predict-no)
  2280. I see 1 and I'm going to do: predict-no
  2281. ENV: Agent did: predict-no for direction U in state State-B
  2282. In State-B moving U
  2283. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2284. predict error 0
  2285. dir: dir isR
  2286. -/|317: O: O634 (predict-no)
  2287. I see 1 and I'm going to do: predict-no
  2288. ENV: Agent did: predict-no for direction R in state State-B
  2289. In State-B moving R
  2290. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2291. predict error 0
  2292. dir: dir isR
  2293. \-/318: O: O636 (predict-no)
  2294. I see 1 and I'm going to do: predict-no
  2295. ENV: Agent did: predict-no for direction R in state State-B
  2296. In State-B moving R
  2297. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2298. predict error 0
  2299. dir: dir isU
  2300. |\-319: O: O638 (predict-no)
  2301. I see 1 and I'm going to do: predict-no
  2302. ENV: Agent did: predict-no for direction U in state State-B
  2303. In State-B moving U
  2304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2305. predict error 0
  2306. dir: dir isL
  2307. /|320: O: O639 (predict-yes)
  2308. I see 1 and I'm going to do: predict-yes
  2309. ENV: Agent did: predict-yes for direction L in state State-B
  2310. In State-B moving L
  2311. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2312. predict error 0
  2313. dir: dir isR
  2314. \-/|321: O: O641 (predict-yes)
  2315. I see 1 and I'm going to do: predict-yes
  2316. ENV: Agent did: predict-yes for direction R in state State-A
  2317. In State-A moving R
  2318. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2319. predict error 0
  2320. dir: dir isL
  2321. \322: O: O643 (predict-yes)
  2322. I see 1 and I'm going to do: predict-yes
  2323. ENV: Agent did: predict-yes for direction L in state State-B
  2324. In State-B moving L
  2325. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2326. predict error 0
  2327. dir: dir isL
  2328. -323: O: O646 (predict-no)
  2329. I see 1 and I'm going to do: predict-no
  2330. ENV: Agent did: predict-no for direction L in state State-A
  2331. In State-A moving L
  2332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2333. predict error 0
  2334. dir: dir isR
  2335. /|\324: O: O647 (predict-yes)
  2336. I see 1 and I'm going to do: predict-yes
  2337. ENV: Agent did: predict-yes for direction R in state State-A
  2338. In State-A moving R
  2339. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2340. predict error 0
  2341. dir: dir isU
  2342. -/|325: O: O650 (predict-no)
  2343. I see 1 and I'm going to do: predict-no
  2344. ENV: Agent did: predict-no for direction U in state State-B
  2345. In State-B moving U
  2346. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2347. predict error 0
  2348. dir: dir isR
  2349. \-/326: O: O652 (predict-no)
  2350. I see 1 and I'm going to do: predict-no
  2351. ENV: Agent did: predict-no for direction R in state State-B
  2352. In State-B moving R
  2353. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2354. predict error 0
  2355. dir: dir isR
  2356. |\-/327: O: O653 (predict-yes)
  2357. I see 1 and I'm going to do: predict-yes
  2358. ENV: Agent did: predict-yes for direction R in state State-B
  2359. In State-B moving R
  2360. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2361. predict error 1
  2362. dir: dir isU
  2363. |\-328: O: O656 (predict-no)
  2364. I see 0 and I'm going to do: predict-no
  2365. ENV: Agent did: predict-no for direction U in state State-B
  2366. In State-B moving U
  2367. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2368. predict error 0
  2369. dir: dir isR
  2370. /|\329: O: O658 (predict-no)
  2371. I see 1 and I'm going to do: predict-no
  2372. ENV: Agent did: predict-no for direction R in state State-B
  2373. In State-B moving R
  2374. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2375. predict error 0
  2376. dir: dir isL
  2377. -/330: O: O659 (predict-yes)
  2378. I see 1 and I'm going to do: predict-yes
  2379. ENV: Agent did: predict-yes for direction L in state State-B
  2380. In State-B moving L
  2381. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2382. predict error 0
  2383. dir: dir isL
  2384. |\331: O: O662 (predict-no)
  2385. I see 1 and I'm going to do: predict-no
  2386. ENV: Agent did: predict-no for direction L in state State-A
  2387. In State-A moving L
  2388. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2389. predict error 0
  2390. dir: dir isU
  2391. -332: O: O664 (predict-no)
  2392. I see 1 and I'm going to do: predict-no
  2393. ENV: Agent did: predict-no for direction U in state State-A
  2394. In State-A moving U
  2395. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2396. predict error 0
  2397. dir: dir isU
  2398. /|\-333: O: O666 (predict-no)
  2399. I see 1 and I'm going to do: predict-no
  2400. ENV: Agent did: predict-no for direction U in state State-A
  2401. In State-A moving U
  2402. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2403. predict error 0
  2404. dir: dir isU
  2405. /|\334: O: O668 (predict-no)
  2406. I see 1 and I'm going to do: predict-no
  2407. ENV: Agent did: predict-no for direction U in state State-A
  2408. In State-A moving U
  2409. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2410. predict error 0
  2411. dir: dir isR
  2412. -/|335: O: O669 (predict-yes)
  2413. I see 1 and I'm going to do: predict-yes
  2414. ENV: Agent did: predict-yes for direction R in state State-A
  2415. In State-A moving R
  2416. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2417. predict error 0
  2418. dir: dir isL
  2419. \-/336: O: O671 (predict-yes)
  2420. I see 1 and I'm going to do: predict-yes
  2421. ENV: Agent did: predict-yes for direction L in state State-B
  2422. In State-B moving L
  2423. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2424. predict error 0
  2425. dir: dir isL
  2426. |\-/337: O: O674 (predict-no)
  2427. I see 1 and I'm going to do: predict-no
  2428. ENV: Agent did: predict-no for direction L in state State-A
  2429. In State-A moving L
  2430. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2431. predict error 0
  2432. dir: dir isL
  2433. |\-338: O: O676 (predict-no)
  2434. I see 1 and I'm going to do: predict-no
  2435. ENV: Agent did: predict-no for direction L in state State-A
  2436. In State-A moving L
  2437. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2438. predict error 0
  2439. dir: dir isL
  2440. /|\339: O: O678 (predict-no)
  2441. I see 1 and I'm going to do: predict-no
  2442. ENV: Agent did: predict-no for direction L in state State-A
  2443. In State-A moving L
  2444. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2445. predict error 0
  2446. dir: dir isU
  2447. -/340: O: O680 (predict-no)
  2448. I see 1 and I'm going to do: predict-no
  2449. ENV: Agent did: predict-no for direction U in state State-A
  2450. In State-A moving U
  2451. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2452. predict error 0
  2453. dir: dir isU
  2454. |\-/341: O: O682 (predict-no)
  2455. I see 1 and I'm going to do: predict-no
  2456. ENV: Agent did: predict-no for direction U in state State-A
  2457. In State-A moving U
  2458. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2459. predict error 0
  2460. dir: dir isU
  2461. |342: O: O684 (predict-no)
  2462. I see 1 and I'm going to do: predict-no
  2463. ENV: Agent did: predict-no for direction U in state State-A
  2464. In State-A moving U
  2465. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2466. predict error 0
  2467. dir: dir isL
  2468. \-/|343: O: O686 (predict-no)
  2469. I see 1 and I'm going to do: predict-no
  2470. ENV: Agent did: predict-no for direction L in state State-A
  2471. In State-A moving L
  2472. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2473. predict error 0
  2474. dir: dir isL
  2475. \-/344: O: O688 (predict-no)
  2476. I see 1 and I'm going to do: predict-no
  2477. ENV: Agent did: predict-no for direction L in state State-A
  2478. In State-A moving L
  2479. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2480. predict error 0
  2481. dir: dir isL
  2482. |\-/345: O: O690 (predict-no)
  2483. I see 1 and I'm going to do: predict-no
  2484. ENV: Agent did: predict-no for direction L in state State-A
  2485. In State-A moving L
  2486. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2487. predict error 0
  2488. dir: dir isU
  2489. |\-346: O: O691 (predict-yes)
  2490. I see 1 and I'm going to do: predict-yes
  2491. ENV: Agent did: predict-yes for direction U in state State-A
  2492. In State-A moving U
  2493. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2494. predict error 1
  2495. dir: dir isL
  2496. /|347: O: O694 (predict-no)
  2497. I see 0 and I'm going to do: predict-no
  2498. ENV: Agent did: predict-no for direction L in state State-A
  2499. In State-A moving L
  2500. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2501. predict error 0
  2502. dir: dir isU
  2503. \-348: O: O696 (predict-no)
  2504. I see 1 and I'm going to do: predict-no
  2505. ENV: Agent did: predict-no for direction U in state State-A
  2506. In State-A moving U
  2507. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2508. predict error 0
  2509. dir: dir isU
  2510. /|\-sleeping...
  2511. /349: O: O698 (predict-no)
  2512. I see 1 and I'm going to do: predict-no
  2513. ENV: Agent did: predict-no for direction U in state State-A
  2514. In State-A moving U
  2515. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2516. predict error 0
  2517. dir: dir isR
  2518. |\-/350: O: O699 (predict-yes)
  2519. I see 1 and I'm going to do: predict-yes
  2520. ENV: Agent did: predict-yes for direction R in state State-A
  2521. In State-A moving R
  2522. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2523. predict error 0
  2524. dir: dir isU
  2525. |\351: O: O702 (predict-no)
  2526. I see 1 and I'm going to do: predict-no
  2527. ENV: Agent did: predict-no for direction U in state State-B
  2528. In State-B moving U
  2529. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2530. predict error 0
  2531. dir: dir isL
  2532. -352: O: O703 (predict-yes)
  2533. I see 1 and I'm going to do: predict-yes
  2534. ENV: Agent did: predict-yes for direction L in state State-B
  2535. In State-B moving L
  2536. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2537. predict error 0
  2538. dir: dir isR
  2539. /|353: O: O705 (predict-yes)
  2540. I see 1 and I'm going to do: predict-yes
  2541. ENV: Agent did: predict-yes for direction R in state State-A
  2542. In State-A moving R
  2543. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2544. predict error 0
  2545. dir: dir isL
  2546. \-354: O: O707 (predict-yes)
  2547. I see 1 and I'm going to do: predict-yes
  2548. ENV: Agent did: predict-yes for direction L in state State-B
  2549. In State-B moving L
  2550. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2551. predict error 0
  2552. dir: dir isL
  2553. /|\355: O: O710 (predict-no)
  2554. I see 1 and I'm going to do: predict-no
  2555. ENV: Agent did: predict-no for direction L in state State-A
  2556. In State-A moving L
  2557. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2558. predict error 0
  2559. dir: dir isL
  2560. -/|\356: O: O712 (predict-no)
  2561. I see 1 and I'm going to do: predict-no
  2562. ENV: Agent did: predict-no for direction L in state State-A
  2563. In State-A moving L
  2564. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2565. predict error 0
  2566. dir: dir isL
  2567. -/|\sleeping...
  2568. -357: O: O714 (predict-no)
  2569. I see 1 and I'm going to do: predict-no
  2570. ENV: Agent did: predict-no for direction L in state State-A
  2571. In State-A moving L
  2572. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2573. predict error 0
  2574. dir: dir isL
  2575. /|\-358: O: O716 (predict-no)
  2576. I see 1 and I'm going to do: predict-no
  2577. ENV: Agent did: predict-no for direction L in state State-A
  2578. In State-A moving L
  2579. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2580. predict error 0
  2581. dir: dir isL
  2582. /|\359: O: O718 (predict-no)
  2583. I see 1 and I'm going to do: predict-no
  2584. ENV: Agent did: predict-no for direction L in state State-A
  2585. In State-A moving L
  2586. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2587. predict error 0
  2588. dir: dir isU
  2589. -/|\360: O: O720 (predict-no)
  2590. I see 1 and I'm going to do: predict-no
  2591. ENV: Agent did: predict-no for direction U in state State-A
  2592. In State-A moving U
  2593. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2594. predict error 0
  2595. dir: dir isL
  2596. -/|361: O: O722 (predict-no)
  2597. I see 1 and I'm going to do: predict-no
  2598. ENV: Agent did: predict-no for direction L in state State-A
  2599. In State-A moving L
  2600. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2601. predict error 0
  2602. dir: dir isU
  2603. \362: O: O724 (predict-no)
  2604. I see 1 and I'm going to do: predict-no
  2605. ENV: Agent did: predict-no for direction U in state State-A
  2606. In State-A moving U
  2607. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2608. predict error 0
  2609. dir: dir isU
  2610. -/|363: O: O726 (predict-no)
  2611. I see 1 and I'm going to do: predict-no
  2612. ENV: Agent did: predict-no for direction U in state State-A
  2613. In State-A moving U
  2614. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2615. predict error 0
  2616. dir: dir isU
  2617. \-364: O: O728 (predict-no)
  2618. I see 1 and I'm going to do: predict-no
  2619. ENV: Agent did: predict-no for direction U in state State-A
  2620. In State-A moving U
  2621. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2622. predict error 0
  2623. dir: dir isR
  2624. /|\365: O: O729 (predict-yes)
  2625. I see 1 and I'm going to do: predict-yes
  2626. ENV: Agent did: predict-yes for direction R in state State-A
  2627. In State-A moving R
  2628. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2629. predict error 0
  2630. dir: dir isU
  2631. -/|366: O: O732 (predict-no)
  2632. I see 1 and I'm going to do: predict-no
  2633. ENV: Agent did: predict-no for direction U in state State-B
  2634. In State-B moving U
  2635. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2636. predict error 0
  2637. dir: dir isU
  2638. \-/367: O: O734 (predict-no)
  2639. I see 1 and I'm going to do: predict-no
  2640. ENV: Agent did: predict-no for direction U in state State-B
  2641. In State-B moving U
  2642. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2643. predict error 0
  2644. dir: dir isU
  2645. |\-/368: O: O736 (predict-no)
  2646. I see 1 and I'm going to do: predict-no
  2647. ENV: Agent did: predict-no for direction U in state State-B
  2648. In State-B moving U
  2649. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2650. predict error 0
  2651. dir: dir isR
  2652. |\-369: O: O738 (predict-no)
  2653. I see 1 and I'm going to do: predict-no
  2654. ENV: Agent did: predict-no for direction R in state State-B
  2655. In State-B moving R
  2656. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2657. predict error 0
  2658. dir: dir isU
  2659. /|\370: O: O740 (predict-no)
  2660. I see 1 and I'm going to do: predict-no
  2661. ENV: Agent did: predict-no for direction U in state State-B
  2662. In State-B moving U
  2663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2664. predict error 0
  2665. dir: dir isU
  2666. -/|371: O: O742 (predict-no)
  2667. I see 1 and I'm going to do: predict-no
  2668. ENV: Agent did: predict-no for direction U in state State-B
  2669. In State-B moving U
  2670. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2671. predict error 0
  2672. dir: dir isR
  2673. \372: O: O744 (predict-no)
  2674. I see 1 and I'm going to do: predict-no
  2675. ENV: Agent did: predict-no for direction R in state State-B
  2676. In State-B moving R
  2677. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2678. predict error 0
  2679. dir: dir isR
  2680. -/373: O: O746 (predict-no)
  2681. I see 1 and I'm going to do: predict-no
  2682. ENV: Agent did: predict-no for direction R in state State-B
  2683. In State-B moving R
  2684. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2685. predict error 0
  2686. dir: dir isL
  2687. |\-374: O: O747 (predict-yes)
  2688. I see 1 and I'm going to do: predict-yes
  2689. ENV: Agent did: predict-yes for direction L in state State-B
  2690. In State-B moving L
  2691. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2692. predict error 0
  2693. dir: dir isU
  2694. /|\375: O: O750 (predict-no)
  2695. I see 1 and I'm going to do: predict-no
  2696. ENV: Agent did: predict-no for direction U in state State-A
  2697. In State-A moving U
  2698. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2699. predict error 0
  2700. dir: dir isR
  2701. -/|376: O: O751 (predict-yes)
  2702. I see 1 and I'm going to do: predict-yes
  2703. ENV: Agent did: predict-yes for direction R in state State-A
  2704. In State-A moving R
  2705. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2706. predict error 0
  2707. dir: dir isL
  2708. \-377: O: O753 (predict-yes)
  2709. I see 1 and I'm going to do: predict-yes
  2710. ENV: Agent did: predict-yes for direction L in state State-B
  2711. In State-B moving L
  2712. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2713. predict error 0
  2714. dir: dir isL
  2715. /|\378: O: O756 (predict-no)
  2716. I see 1 and I'm going to do: predict-no
  2717. ENV: Agent did: predict-no for direction L in state State-A
  2718. In State-A moving L
  2719. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2720. predict error 0
  2721. dir: dir isU
  2722. -/|\379: O: O758 (predict-no)
  2723. I see 1 and I'm going to do: predict-no
  2724. ENV: Agent did: predict-no for direction U in state State-A
  2725. In State-A moving U
  2726. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2727. predict error 0
  2728. dir: dir isL
  2729. -/|380: O: O760 (predict-no)
  2730. I see 1 and I'm going to do: predict-no
  2731. ENV: Agent did: predict-no for direction L in state State-A
  2732. In State-A moving L
  2733. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2734. predict error 0
  2735. dir: dir isR
  2736. \-381: O: O761 (predict-yes)
  2737. I see 1 and I'm going to do: predict-yes
  2738. ENV: Agent did: predict-yes for direction R in state State-A
  2739. In State-A moving R
  2740. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2741. predict error 0
  2742. dir: dir isU
  2743. /382: O: O764 (predict-no)
  2744. I see 1 and I'm going to do: predict-no
  2745. ENV: Agent did: predict-no for direction U in state State-B
  2746. In State-B moving U
  2747. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2748. predict error 0
  2749. dir: dir isU
  2750. |\-383: O: O766 (predict-no)
  2751. I see 1 and I'm going to do: predict-no
  2752. ENV: Agent did: predict-no for direction U in state State-B
  2753. In State-B moving U
  2754. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2755. predict error 0
  2756. dir: dir isL
  2757. /|\-384: O: O767 (predict-yes)
  2758. I see 1 and I'm going to do: predict-yes
  2759. ENV: Agent did: predict-yes for direction L in state State-B
  2760. In State-B moving L
  2761. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2762. predict error 0
  2763. dir: dir isR
  2764. /|\385: O: O769 (predict-yes)
  2765. I see 1 and I'm going to do: predict-yes
  2766. ENV: Agent did: predict-yes for direction R in state State-A
  2767. In State-A moving R
  2768. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2769. predict error 0
  2770. dir: dir isR
  2771. -/|\386: O: O772 (predict-no)
  2772. I see 1 and I'm going to do: predict-no
  2773. ENV: Agent did: predict-no for direction R in state State-B
  2774. In State-B moving R
  2775. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2776. predict error 0
  2777. dir: dir isL
  2778. -/387: O: O773 (predict-yes)
  2779. I see 1 and I'm going to do: predict-yes
  2780. ENV: Agent did: predict-yes for direction L in state State-B
  2781. In State-B moving L
  2782. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2783. predict error 0
  2784. dir: dir isU
  2785. |\-388: O: O776 (predict-no)
  2786. I see 1 and I'm going to do: predict-no
  2787. ENV: Agent did: predict-no for direction U in state State-A
  2788. In State-A moving U
  2789. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2790. predict error 0
  2791. dir: dir isL
  2792. /|\389: O: O778 (predict-no)
  2793. I see 1 and I'm going to do: predict-no
  2794. ENV: Agent did: predict-no for direction L in state State-A
  2795. In State-A moving L
  2796. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2797. predict error 0
  2798. dir: dir isU
  2799. -/|390: O: O780 (predict-no)
  2800. I see 1 and I'm going to do: predict-no
  2801. ENV: Agent did: predict-no for direction U in state State-A
  2802. In State-A moving U
  2803. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2804. predict error 0
  2805. dir: dir isL
  2806. \-/|391: O: O782 (predict-no)
  2807. I see 1 and I'm going to do: predict-no
  2808. ENV: Agent did: predict-no for direction L in state State-A
  2809. In State-A moving L
  2810. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2811. predict error 0
  2812. dir: dir isL
  2813. \392: O: O784 (predict-no)
  2814. I see 1 and I'm going to do: predict-no
  2815. ENV: Agent did: predict-no for direction L in state State-A
  2816. In State-A moving L
  2817. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2818. predict error 0
  2819. dir: dir isU
  2820. -/|393: O: O786 (predict-no)
  2821. I see 1 and I'm going to do: predict-no
  2822. ENV: Agent did: predict-no for direction U in state State-A
  2823. In State-A moving U
  2824. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2825. predict error 0
  2826. dir: dir isU
  2827. \-/|394: O: O787 (predict-yes)
  2828. I see 1 and I'm going to do: predict-yes
  2829. ENV: Agent did: predict-yes for direction U in state State-A
  2830. In State-A moving U
  2831. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2832. predict error 1
  2833. dir: dir isL
  2834. \-/395: O: O790 (predict-no)
  2835. I see 0 and I'm going to do: predict-no
  2836. ENV: Agent did: predict-no for direction L in state State-A
  2837. In State-A moving L
  2838. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2839. predict error 0
  2840. dir: dir isR
  2841. |\-396: O: O791 (predict-yes)
  2842. I see 1 and I'm going to do: predict-yes
  2843. ENV: Agent did: predict-yes for direction R in state State-A
  2844. In State-A moving R
  2845. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2846. predict error 0
  2847. dir: dir isR
  2848. /|\397: O: O794 (predict-no)
  2849. I see 1 and I'm going to do: predict-no
  2850. ENV: Agent did: predict-no for direction R in state State-B
  2851. In State-B moving R
  2852. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2853. predict error 0
  2854. dir: dir isU
  2855. -/|398: O: O795 (predict-yes)
  2856. I see 1 and I'm going to do: predict-yes
  2857. ENV: Agent did: predict-yes for direction U in state State-B
  2858. In State-B moving U
  2859. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2860. predict error 1
  2861. dir: dir isL
  2862. \-/399: O: O797 (predict-yes)
  2863. I see 0 and I'm going to do: predict-yes
  2864. ENV: Agent did: predict-yes for direction L in state State-B
  2865. In State-B moving L
  2866. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2867. predict error 0
  2868. dir: dir isU
  2869. |\400: O: O800 (predict-no)
  2870. I see 1 and I'm going to do: predict-no
  2871. ENV: Agent did: predict-no for direction U in state State-A
  2872. In State-A moving U
  2873. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2874. predict error 0
  2875. dir: dir isU
  2876. -/|\401: O: O802 (predict-no)
  2877. I see 1 and I'm going to do: predict-no
  2878. ENV: Agent did: predict-no for direction U in state State-A
  2879. In State-A moving U
  2880. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2881. predict error 0
  2882. dir: dir isU
  2883. -402: O: O804 (predict-no)
  2884. I see 1 and I'm going to do: predict-no
  2885. ENV: Agent did: predict-no for direction U in state State-A
  2886. In State-A moving U
  2887. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2888. predict error 0
  2889. dir: dir isU
  2890. /|\403: O: O806 (predict-no)
  2891. I see 1 and I'm going to do: predict-no
  2892. ENV: Agent did: predict-no for direction U in state State-A
  2893. In State-A moving U
  2894. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2895. predict error 0
  2896. dir: dir isU
  2897. -/|404: O: O808 (predict-no)
  2898. I see 1 and I'm going to do: predict-no
  2899. ENV: Agent did: predict-no for direction U in state State-A
  2900. In State-A moving U
  2901. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2902. predict error 0
  2903. dir: dir isR
  2904. \-/405: O: O809 (predict-yes)
  2905. I see 1 and I'm going to do: predict-yes
  2906. ENV: Agent did: predict-yes for direction R in state State-A
  2907. In State-A moving R
  2908. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2909. predict error 0
  2910. dir: dir isU
  2911. |\-406: O: O812 (predict-no)
  2912. I see 1 and I'm going to do: predict-no
  2913. ENV: Agent did: predict-no for direction U in state State-B
  2914. In State-B moving U
  2915. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2916. predict error 0
  2917. dir: dir isU
  2918. /407: O: O814 (predict-no)
  2919. I see 1 and I'm going to do: predict-no
  2920. ENV: Agent did: predict-no for direction U in state State-B
  2921. In State-B moving U
  2922. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2923. predict error 0
  2924. dir: dir isR
  2925. |\-408: O: O816 (predict-no)
  2926. I see 1 and I'm going to do: predict-no
  2927. ENV: Agent did: predict-no for direction R in state State-B
  2928. In State-B moving R
  2929. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2930. predict error 0
  2931. dir: dir isL
  2932. /|\409: O: O817 (predict-yes)
  2933. I see 1 and I'm going to do: predict-yes
  2934. ENV: Agent did: predict-yes for direction L in state State-B
  2935. In State-B moving L
  2936. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2937. predict error 0
  2938. dir: dir isR
  2939. -/410: O: O819 (predict-yes)
  2940. I see 1 and I'm going to do: predict-yes
  2941. ENV: Agent did: predict-yes for direction R in state State-A
  2942. In State-A moving R
  2943. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2944. predict error 0
  2945. dir: dir isU
  2946. |\-411: O: O822 (predict-no)
  2947. I see 1 and I'm going to do: predict-no
  2948. ENV: Agent did: predict-no for direction U in state State-B
  2949. In State-B moving U
  2950. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2951. predict error 0
  2952. dir: dir isL
  2953. /412: O: O823 (predict-yes)
  2954. I see 1 and I'm going to do: predict-yes
  2955. ENV: Agent did: predict-yes for direction L in state State-B
  2956. In State-B moving L
  2957. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2958. predict error 0
  2959. dir: dir isR
  2960. |\-413: O: O825 (predict-yes)
  2961. I see 1 and I'm going to do: predict-yes
  2962. ENV: Agent did: predict-yes for direction R in state State-A
  2963. In State-A moving R
  2964. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2965. predict error 0
  2966. dir: dir isU
  2967. /|\414: O: O828 (predict-no)
  2968. I see 1 and I'm going to do: predict-no
  2969. ENV: Agent did: predict-no for direction U in state State-B
  2970. In State-B moving U
  2971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2972. predict error 0
  2973. dir: dir isR
  2974. -/|415: O: O830 (predict-no)
  2975. I see 1 and I'm going to do: predict-no
  2976. ENV: Agent did: predict-no for direction R in state State-B
  2977. In State-B moving R
  2978. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2979. predict error 0
  2980. dir: dir isL
  2981. \-/416: O: O831 (predict-yes)
  2982. I see 1 and I'm going to do: predict-yes
  2983. ENV: Agent did: predict-yes for direction L in state State-B
  2984. In State-B moving L
  2985. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2986. predict error 0
  2987. dir: dir isR
  2988. |\-417: O: O833 (predict-yes)
  2989. I see 1 and I'm going to do: predict-yes
  2990. ENV: Agent did: predict-yes for direction R in state State-A
  2991. In State-A moving R
  2992. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2993. predict error 0
  2994. dir: dir isL
  2995. /|\418: O: O835 (predict-yes)
  2996. I see 1 and I'm going to do: predict-yes
  2997. ENV: Agent did: predict-yes for direction L in state State-B
  2998. In State-B moving L
  2999. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3000. predict error 0
  3001. dir: dir isU
  3002. -/|419: O: O838 (predict-no)
  3003. I see 1 and I'm going to do: predict-no
  3004. ENV: Agent did: predict-no for direction U in state State-A
  3005. In State-A moving U
  3006. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3007. predict error 0
  3008. dir: dir isR
  3009. \-420: O: O839 (predict-yes)
  3010. I see 1 and I'm going to do: predict-yes
  3011. ENV: Agent did: predict-yes for direction R in state State-A
  3012. In State-A moving R
  3013. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3014. predict error 0
  3015. dir: dir isL
  3016. /|\421: O: O841 (predict-yes)
  3017. I see 1 and I'm going to do: predict-yes
  3018. ENV: Agent did: predict-yes for direction L in state State-B
  3019. In State-B moving L
  3020. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3021. predict error 0
  3022. dir: dir isU
  3023. -422: O: O844 (predict-no)
  3024. I see 1 and I'm going to do: predict-no
  3025. ENV: Agent did: predict-no for direction U in state State-A
  3026. In State-A moving U
  3027. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3028. predict error 0
  3029. dir: dir isR
  3030. /|\423: O: O846 (predict-no)
  3031. I see 1 and I'm going to do: predict-no
  3032. ENV: Agent did: predict-no for direction R in state State-A
  3033. In State-A moving R
  3034. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3035. predict error 1
  3036. dir: dir isR
  3037. -/|\424: O: O848 (predict-no)
  3038. I see 0 and I'm going to do: predict-no
  3039. ENV: Agent did: predict-no for direction R in state State-B
  3040. In State-B moving R
  3041. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3042. predict error 0
  3043. dir: dir isU
  3044. -/|\425: O: O850 (predict-no)
  3045. I see 1 and I'm going to do: predict-no
  3046. ENV: Agent did: predict-no for direction U in state State-B
  3047. In State-B moving U
  3048. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3049. predict error 0
  3050. dir: dir isU
  3051. -/426: O: O852 (predict-no)
  3052. I see 1 and I'm going to do: predict-no
  3053. ENV: Agent did: predict-no for direction U in state State-B
  3054. In State-B moving U
  3055. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3056. predict error 0
  3057. dir: dir isU
  3058. |\-427: O: O854 (predict-no)
  3059. I see 1 and I'm going to do: predict-no
  3060. ENV: Agent did: predict-no for direction U in state State-B
  3061. In State-B moving U
  3062. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3063. predict error 0
  3064. dir: dir isU
  3065. /|\428: O: O856 (predict-no)
  3066. I see 1 and I'm going to do: predict-no
  3067. ENV: Agent did: predict-no for direction U in state State-B
  3068. In State-B moving U
  3069. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3070. predict error 0
  3071. dir: dir isR
  3072. -/|\429: O: O858 (predict-no)
  3073. I see 1 and I'm going to do: predict-no
  3074. ENV: Agent did: predict-no for direction R in state State-B
  3075. In State-B moving R
  3076. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3077. predict error 0
  3078. dir: dir isU
  3079. -/430: O: O860 (predict-no)
  3080. I see 1 and I'm going to do: predict-no
  3081. ENV: Agent did: predict-no for direction U in state State-B
  3082. In State-B moving U
  3083. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3084. predict error 0
  3085. dir: dir isR
  3086. |\431: O: O862 (predict-no)
  3087. I see 1 and I'm going to do: predict-no
  3088. ENV: Agent did: predict-no for direction R in state State-B
  3089. In State-B moving R
  3090. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3091. predict error 0
  3092. dir: dir isR
  3093. -432: O: O864 (predict-no)
  3094. I see 1 and I'm going to do: predict-no
  3095. ENV: Agent did: predict-no for direction R in state State-B
  3096. In State-B moving R
  3097. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3098. predict error 0
  3099. dir: dir isU
  3100. /|\-433: O: O866 (predict-no)
  3101. I see 1 and I'm going to do: predict-no
  3102. ENV: Agent did: predict-no for direction U in state State-B
  3103. In State-B moving U
  3104. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3105. predict error 0
  3106. dir: dir isR
  3107. /|\434: O: O868 (predict-no)
  3108. I see 1 and I'm going to do: predict-no
  3109. ENV: Agent did: predict-no for direction R in state State-B
  3110. In State-B moving R
  3111. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3112. predict error 0
  3113. dir: dir isL
  3114. -/|\435: O: O869 (predict-yes)
  3115. I see 1 and I'm going to do: predict-yes
  3116. ENV: Agent did: predict-yes for direction L in state State-B
  3117. In State-B moving L
  3118. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3119. predict error 0
  3120. dir: dir isU
  3121. -/|436: O: O872 (predict-no)
  3122. I see 1 and I'm going to do: predict-no
  3123. ENV: Agent did: predict-no for direction U in state State-A
  3124. In State-A moving U
  3125. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3126. predict error 0
  3127. dir: dir isR
  3128. \-437: O: O873 (predict-yes)
  3129. I see 1 and I'm going to do: predict-yes
  3130. ENV: Agent did: predict-yes for direction R in state State-A
  3131. In State-A moving R
  3132. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3133. predict error 0
  3134. dir: dir isR
  3135. /|\438: O: O876 (predict-no)
  3136. I see 1 and I'm going to do: predict-no
  3137. ENV: Agent did: predict-no for direction R in state State-B
  3138. In State-B moving R
  3139. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3140. predict error 0
  3141. dir: dir isR
  3142. -/|\439: O: O878 (predict-no)
  3143. I see 1 and I'm going to do: predict-no
  3144. ENV: Agent did: predict-no for direction R in state State-B
  3145. In State-B moving R
  3146. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3147. predict error 0
  3148. dir: dir isL
  3149. -/|\440: O: O879 (predict-yes)
  3150. I see 1 and I'm going to do: predict-yes
  3151. ENV: Agent did: predict-yes for direction L in state State-B
  3152. In State-B moving L
  3153. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3154. predict error 0
  3155. dir: dir isL
  3156. -/|441: O: O882 (predict-no)
  3157. I see 1 and I'm going to do: predict-no
  3158. ENV: Agent did: predict-no for direction L in state State-A
  3159. In State-A moving L
  3160. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3161. predict error 0
  3162. dir: dir isL
  3163. \442: O: O884 (predict-no)
  3164. I see 1 and I'm going to do: predict-no
  3165. ENV: Agent did: predict-no for direction L in state State-A
  3166. In State-A moving L
  3167. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3168. predict error 0
  3169. dir: dir isU
  3170. -/443: O: O886 (predict-no)
  3171. I see 1 and I'm going to do: predict-no
  3172. ENV: Agent did: predict-no for direction U in state State-A
  3173. In State-A moving U
  3174. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3175. predict error 0
  3176. dir: dir isU
  3177. |\444: O: O888 (predict-no)
  3178. I see 1 and I'm going to do: predict-no
  3179. ENV: Agent did: predict-no for direction U in state State-A
  3180. In State-A moving U
  3181. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3182. predict error 0
  3183. dir: dir isU
  3184. -445: O: O890 (predict-no)
  3185. I see 1 and I'm going to do: predict-no
  3186. ENV: Agent did: predict-no for direction U in state State-A
  3187. In State-A moving U
  3188. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3189. predict error 0
  3190. dir: dir isL
  3191. /|\446: O: O892 (predict-no)
  3192. I see 1 and I'm going to do: predict-no
  3193. ENV: Agent did: predict-no for direction L in state State-A
  3194. In State-A moving L
  3195. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3196. predict error 0
  3197. dir: dir isU
  3198. -/447: O: O894 (predict-no)
  3199. I see 1 and I'm going to do: predict-no
  3200. ENV: Agent did: predict-no for direction U in state State-A
  3201. In State-A moving U
  3202. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3203. predict error 0
  3204. dir: dir isU
  3205. |\-448: O: O896 (predict-no)
  3206. I see 1 and I'm going to do: predict-no
  3207. ENV: Agent did: predict-no for direction U in state State-A
  3208. In State-A moving U
  3209. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3210. predict error 0
  3211. dir: dir isR
  3212. /|\449: O: O897 (predict-yes)
  3213. I see 1 and I'm going to do: predict-yes
  3214. ENV: Agent did: predict-yes for direction R in state State-A
  3215. In State-A moving R
  3216. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3217. predict error 0
  3218. dir: dir isL
  3219. -/|\450: O: O899 (predict-yes)
  3220. I see 1 and I'm going to do: predict-yes
  3221. ENV: Agent did: predict-yes for direction L in state State-B
  3222. In State-B moving L
  3223. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3224. predict error 0
  3225. dir: dir isU
  3226. -/|451: O: O902 (predict-no)
  3227. I see 1 and I'm going to do: predict-no
  3228. ENV: Agent did: predict-no for direction U in state State-A
  3229. In State-A moving U
  3230. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3231. predict error 0
  3232. dir: dir isR
  3233. \452: O: O903 (predict-yes)
  3234. I see 1 and I'm going to do: predict-yes
  3235. ENV: Agent did: predict-yes for direction R in state State-A
  3236. In State-A moving R
  3237. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3238. predict error 0
  3239. dir: dir isR
  3240. -/453: O: O906 (predict-no)
  3241. I see 1 and I'm going to do: predict-no
  3242. ENV: Agent did: predict-no for direction R in state State-B
  3243. In State-B moving R
  3244. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3245. predict error 0
  3246. dir: dir isL
  3247. |\-454: O: O907 (predict-yes)
  3248. I see 1 and I'm going to do: predict-yes
  3249. ENV: Agent did: predict-yes for direction L in state State-B
  3250. In State-B moving L
  3251. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3252. predict error 0
  3253. dir: dir isU
  3254. /|\-455: O: O910 (predict-no)
  3255. I see 1 and I'm going to do: predict-no
  3256. ENV: Agent did: predict-no for direction U in state State-A
  3257. In State-A moving U
  3258. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3259. predict error 0
  3260. dir: dir isL
  3261. /|456: O: O912 (predict-no)
  3262. I see 1 and I'm going to do: predict-no
  3263. ENV: Agent did: predict-no for direction L in state State-A
  3264. In State-A moving L
  3265. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3266. predict error 0
  3267. dir: dir isR
  3268. \-457: O: O913 (predict-yes)
  3269. I see 1 and I'm going to do: predict-yes
  3270. ENV: Agent did: predict-yes for direction R in state State-A
  3271. In State-A moving R
  3272. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3273. predict error 0
  3274. dir: dir isL
  3275. /|458: O: O915 (predict-yes)
  3276. I see 1 and I'm going to do: predict-yes
  3277. ENV: Agent did: predict-yes for direction L in state State-B
  3278. In State-B moving L
  3279. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3280. predict error 0
  3281. dir: dir isR
  3282. \-/459: O: O917 (predict-yes)
  3283. I see 1 and I'm going to do: predict-yes
  3284. ENV: Agent did: predict-yes for direction R in state State-A
  3285. In State-A moving R
  3286. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3287. predict error 0
  3288. dir: dir isU
  3289. |\460: O: O920 (predict-no)
  3290. I see 1 and I'm going to do: predict-no
  3291. ENV: Agent did: predict-no for direction U in state State-B
  3292. In State-B moving U
  3293. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3294. predict error 0
  3295. dir: dir isU
  3296. -/461: O: O921 (predict-yes)
  3297. I see 1 and I'm going to do: predict-yes
  3298. ENV: Agent did: predict-yes for direction U in state State-B
  3299. In State-B moving U
  3300. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3301. predict error 1
  3302. dir: dir isU
  3303. |462: O: O924 (predict-no)
  3304. I see 0 and I'm going to do: predict-no
  3305. ENV: Agent did: predict-no for direction U in state State-B
  3306. In State-B moving U
  3307. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3308. predict error 0
  3309. dir: dir isL
  3310. \-/463: O: O925 (predict-yes)
  3311. I see 1 and I'm going to do: predict-yes
  3312. ENV: Agent did: predict-yes for direction L in state State-B
  3313. In State-B moving L
  3314. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3315. predict error 0
  3316. dir: dir isR
  3317. |\-464: O: O927 (predict-yes)
  3318. I see 1 and I'm going to do: predict-yes
  3319. ENV: Agent did: predict-yes for direction R in state State-A
  3320. In State-A moving R
  3321. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3322. predict error 0
  3323. dir: dir isR
  3324. /|465: O: O930 (predict-no)
  3325. I see 1 and I'm going to do: predict-no
  3326. ENV: Agent did: predict-no for direction R in state State-B
  3327. In State-B moving R
  3328. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3329. predict error 0
  3330. dir: dir isL
  3331. \-466: O: O931 (predict-yes)
  3332. I see 1 and I'm going to do: predict-yes
  3333. ENV: Agent did: predict-yes for direction L in state State-B
  3334. In State-B moving L
  3335. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3336. predict error 0
  3337. dir: dir isR
  3338. /|\467: O: O933 (predict-yes)
  3339. I see 1 and I'm going to do: predict-yes
  3340. ENV: Agent did: predict-yes for direction R in state State-A
  3341. In State-A moving R
  3342. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3343. predict error 0
  3344. dir: dir isU
  3345. -/|468: O: O936 (predict-no)
  3346. I see 1 and I'm going to do: predict-no
  3347. ENV: Agent did: predict-no for direction U in state State-B
  3348. In State-B moving U
  3349. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3350. predict error 0
  3351. dir: dir isU
  3352. \-469: O: O938 (predict-no)
  3353. I see 1 and I'm going to do: predict-no
  3354. ENV: Agent did: predict-no for direction U in state State-B
  3355. In State-B moving U
  3356. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3357. predict error 0
  3358. dir: dir isU
  3359. /|470: O: O940 (predict-no)
  3360. I see 1 and I'm going to do: predict-no
  3361. ENV: Agent did: predict-no for direction U in state State-B
  3362. In State-B moving U
  3363. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3364. predict error 0
  3365. dir: dir isL
  3366. \-471: O: O941 (predict-yes)
  3367. I see 1 and I'm going to do: predict-yes
  3368. ENV: Agent did: predict-yes for direction L in state State-B
  3369. In State-B moving L
  3370. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3371. predict error 0
  3372. dir: dir isU
  3373. /472: O: O944 (predict-no)
  3374. I see 1 and I'm going to do: predict-no
  3375. ENV: Agent did: predict-no for direction U in state State-A
  3376. In State-A moving U
  3377. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3378. predict error 0
  3379. dir: dir isU
  3380. |\473: O: O946 (predict-no)
  3381. I see 1 and I'm going to do: predict-no
  3382. ENV: Agent did: predict-no for direction U in state State-A
  3383. In State-A moving U
  3384. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3385. predict error 0
  3386. dir: dir isU
  3387. -/|474: O: O948 (predict-no)
  3388. I see 1 and I'm going to do: predict-no
  3389. ENV: Agent did: predict-no for direction U in state State-A
  3390. In State-A moving U
  3391. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3392. predict error 0
  3393. dir: dir isR
  3394. \-/|475: O: O949 (predict-yes)
  3395. I see 1 and I'm going to do: predict-yes
  3396. ENV: Agent did: predict-yes for direction R in state State-A
  3397. In State-A moving R
  3398. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3399. predict error 0
  3400. dir: dir isL
  3401. \-/476: O: O951 (predict-yes)
  3402. I see 1 and I'm going to do: predict-yes
  3403. ENV: Agent did: predict-yes for direction L in state State-B
  3404. In State-B moving L
  3405. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3406. predict error 0
  3407. dir: dir isL
  3408. |\477: O: O954 (predict-no)
  3409. I see 1 and I'm going to do: predict-no
  3410. ENV: Agent did: predict-no for direction L in state State-A
  3411. In State-A moving L
  3412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3413. predict error 0
  3414. dir: dir isU
  3415. -/|478: O: O956 (predict-no)
  3416. I see 1 and I'm going to do: predict-no
  3417. ENV: Agent did: predict-no for direction U in state State-A
  3418. In State-A moving U
  3419. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3420. predict error 0
  3421. dir: dir isU
  3422. \-/479: O: O958 (predict-no)
  3423. I see 1 and I'm going to do: predict-no
  3424. ENV: Agent did: predict-no for direction U in state State-A
  3425. In State-A moving U
  3426. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3427. predict error 0
  3428. dir: dir isL
  3429. |\-480: O: O960 (predict-no)
  3430. I see 1 and I'm going to do: predict-no
  3431. ENV: Agent did: predict-no for direction L in state State-A
  3432. In State-A moving L
  3433. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3434. predict error 0
  3435. dir: dir isU
  3436. /|\481: O: O962 (predict-no)
  3437. I see 1 and I'm going to do: predict-no
  3438. ENV: Agent did: predict-no for direction U in state State-A
  3439. In State-A moving U
  3440. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3441. predict error 0
  3442. dir: dir isR
  3443. -482: O: O963 (predict-yes)
  3444. I see 1 and I'm going to do: predict-yes
  3445. ENV: Agent did: predict-yes for direction R in state State-A
  3446. In State-A moving R
  3447. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3448. predict error 0
  3449. dir: dir isR
  3450. /|\-sleeping...
  3451. /483: O: O966 (predict-no)
  3452. I see 1 and I'm going to do: predict-no
  3453. ENV: Agent did: predict-no for direction R in state State-B
  3454. In State-B moving R
  3455. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3456. predict error 0
  3457. dir: dir isU
  3458. |\-484: O: O968 (predict-no)
  3459. I see 1 and I'm going to do: predict-no
  3460. ENV: Agent did: predict-no for direction U in state State-B
  3461. In State-B moving U
  3462. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3463. predict error 0
  3464. dir: dir isL
  3465. /|485: O: O969 (predict-yes)
  3466. I see 1 and I'm going to do: predict-yes
  3467. ENV: Agent did: predict-yes for direction L in state State-B
  3468. In State-B moving L
  3469. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3470. predict error 0
  3471. dir: dir isR
  3472. \-/486: O: O971 (predict-yes)
  3473. I see 1 and I'm going to do: predict-yes
  3474. ENV: Agent did: predict-yes for direction R in state State-A
  3475. In State-A moving R
  3476. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3477. predict error 0
  3478. dir: dir isL
  3479. |\-487: O: O973 (predict-yes)
  3480. I see 1 and I'm going to do: predict-yes
  3481. ENV: Agent did: predict-yes for direction L in state State-B
  3482. In State-B moving L
  3483. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3484. predict error 0
  3485. dir: dir isR
  3486. /|\488: O: O975 (predict-yes)
  3487. I see 1 and I'm going to do: predict-yes
  3488. ENV: Agent did: predict-yes for direction R in state State-A
  3489. In State-A moving R
  3490. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3491. predict error 0
  3492. dir: dir isL
  3493. -/|489: O: O977 (predict-yes)
  3494. I see 1 and I'm going to do: predict-yes
  3495. ENV: Agent did: predict-yes for direction L in state State-B
  3496. In State-B moving L
  3497. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3498. predict error 0
  3499. dir: dir isL
  3500. \-490: O: O980 (predict-no)
  3501. I see 1 and I'm going to do: predict-no
  3502. ENV: Agent did: predict-no for direction L in state State-A
  3503. In State-A moving L
  3504. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3505. predict error 0
  3506. dir: dir isL
  3507. /|491: O: O982 (predict-no)
  3508. I see 1 and I'm going to do: predict-no
  3509. ENV: Agent did: predict-no for direction L in state State-A
  3510. In State-A moving L
  3511. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3512. predict error 0
  3513. dir: dir isU
  3514. \492: O: O984 (predict-no)
  3515. I see 1 and I'm going to do: predict-no
  3516. ENV: Agent did: predict-no for direction U in state State-A
  3517. In State-A moving U
  3518. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3519. predict error 0
  3520. dir: dir isL
  3521. -/493: O: O986 (predict-no)
  3522. I see 1 and I'm going to do: predict-no
  3523. ENV: Agent did: predict-no for direction L in state State-A
  3524. In State-A moving L
  3525. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3526. predict error 0
  3527. dir: dir isU
  3528. |\-494: O: O988 (predict-no)
  3529. I see 1 and I'm going to do: predict-no
  3530. ENV: Agent did: predict-no for direction U in state State-A
  3531. In State-A moving U
  3532. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3533. predict error 0
  3534. dir: dir isL
  3535. /|\495: O: O990 (predict-no)
  3536. I see 1 and I'm going to do: predict-no
  3537. ENV: Agent did: predict-no for direction L in state State-A
  3538. In State-A moving L
  3539. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3540. predict error 0
  3541. dir: dir isU
  3542. -/|496: O: O992 (predict-no)
  3543. I see 1 and I'm going to do: predict-no
  3544. ENV: Agent did: predict-no for direction U in state State-A
  3545. In State-A moving U
  3546. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3547. predict error 0
  3548. dir: dir isL
  3549. \-/|497: O: O994 (predict-no)
  3550. I see 1 and I'm going to do: predict-no
  3551. ENV: Agent did: predict-no for direction L in state State-A
  3552. In State-A moving L
  3553. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3554. predict error 0
  3555. dir: dir isL
  3556. \-498: O: O996 (predict-no)
  3557. I see 1 and I'm going to do: predict-no
  3558. ENV: Agent did: predict-no for direction L in state State-A
  3559. In State-A moving L
  3560. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3561. predict error 0
  3562. dir: dir isL
  3563. /|\499: O: O998 (predict-no)
  3564. I see 1 and I'm going to do: predict-no
  3565. ENV: Agent did: predict-no for direction L in state State-A
  3566. In State-A moving L
  3567. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3568. predict error 0
  3569. dir: dir isU
  3570. -/|500: O: O1000 (predict-no)
  3571. I see 1 and I'm going to do: predict-no
  3572. ENV: Agent did: predict-no for direction U in state State-A
  3573. In State-A moving U
  3574. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3575. predict error 0
  3576. dir: dir isU
  3577. \-/|\-501: O: O1002 (predict-no)
  3578. I see 1 and I'm going to do: predict-no
  3579. ENV: Agent did: predict-no for direction U in state State-A
  3580. In State-A moving U
  3581. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3582. predict error 0
  3583. dir: dir isU
  3584. /502: O: O1004 (predict-no)
  3585. I see 1 and I'm going to do: predict-no
  3586. ENV: Agent did: predict-no for direction U in state State-A
  3587. In State-A moving U
  3588. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3589. predict error 0
  3590. dir: dir isR
  3591. |\-503: O: O1005 (predict-yes)
  3592. I see 1 and I'm going to do: predict-yes
  3593. ENV: Agent did: predict-yes for direction R in state State-A
  3594. In State-A moving R
  3595. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3596. predict error 0
  3597. dir: dir isL
  3598. /|\504: O: O1007 (predict-yes)
  3599. I see 1 and I'm going to do: predict-yes
  3600. ENV: Agent did: predict-yes for direction L in state State-B
  3601. In State-B moving L
  3602. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3603. predict error 0
  3604. dir: dir isR
  3605. -505: O: O1009 (predict-yes)
  3606. I see 1 and I'm going to do: predict-yes
  3607. ENV: Agent did: predict-yes for direction R in state State-A
  3608. In State-A moving R
  3609. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3610. predict error 0
  3611. dir: dir isR
  3612. /|\506: O: O1012 (predict-no)
  3613. I see 1 and I'm going to do: predict-no
  3614. ENV: Agent did: predict-no for direction R in state State-B
  3615. In State-B moving R
  3616. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3617. predict error 0
  3618. dir: dir isU
  3619. -/|507: O: O1014 (predict-no)
  3620. I see 1 and I'm going to do: predict-no
  3621. ENV: Agent did: predict-no for direction U in state State-B
  3622. In State-B moving U
  3623. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3624. predict error 0
  3625. dir: dir isL
  3626. \-508: O: O1015 (predict-yes)
  3627. I see 1 and I'm going to do: predict-yes
  3628. ENV: Agent did: predict-yes for direction L in state State-B
  3629. In State-B moving L
  3630. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3631. predict error 0
  3632. dir: dir isL
  3633. /|509: O: O1018 (predict-no)
  3634. I see 1 and I'm going to do: predict-no
  3635. ENV: Agent did: predict-no for direction L in state State-A
  3636. In State-A moving L
  3637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3638. predict error 0
  3639. dir: dir isU
  3640. \-/510: O: O1020 (predict-no)
  3641. I see 1 and I'm going to do: predict-no
  3642. ENV: Agent did: predict-no for direction U in state State-A
  3643. In State-A moving U
  3644. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3645. predict error 0
  3646. dir: dir isL
  3647. |\-511: O: O1022 (predict-no)
  3648. I see 1 and I'm going to do: predict-no
  3649. ENV: Agent did: predict-no for direction L in state State-A
  3650. In State-A moving L
  3651. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3652. predict error 0
  3653. dir: dir isU
  3654. /512: O: O1024 (predict-no)
  3655. I see 1 and I'm going to do: predict-no
  3656. ENV: Agent did: predict-no for direction U in state State-A
  3657. In State-A moving U
  3658. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3659. predict error 0
  3660. dir: dir isL
  3661. |\-513: O: O1026 (predict-no)
  3662. I see 1 and I'm going to do: predict-no
  3663. ENV: Agent did: predict-no for direction L in state State-A
  3664. In State-A moving L
  3665. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3666. predict error 0
  3667. dir: dir isR
  3668. /|\514: O: O1027 (predict-yes)
  3669. I see 1 and I'm going to do: predict-yes
  3670. ENV: Agent did: predict-yes for direction R in state State-A
  3671. In State-A moving R
  3672. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3673. predict error 0
  3674. dir: dir isR
  3675. -/515: O: O1030 (predict-no)
  3676. I see 1 and I'm going to do: predict-no
  3677. ENV: Agent did: predict-no for direction R in state State-B
  3678. In State-B moving R
  3679. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3680. predict error 0
  3681. dir: dir isU
  3682. |\-/516: O: O1032 (predict-no)
  3683. I see 1 and I'm going to do: predict-no
  3684. ENV: Agent did: predict-no for direction U in state State-B
  3685. In State-B moving U
  3686. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3687. predict error 0
  3688. dir: dir isL
  3689. |\517: O: O1033 (predict-yes)
  3690. I see 1 and I'm going to do: predict-yes
  3691. ENV: Agent did: predict-yes for direction L in state State-B
  3692. In State-B moving L
  3693. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3694. predict error 0
  3695. dir: dir isU
  3696. -/|518: O: O1036 (predict-no)
  3697. I see 1 and I'm going to do: predict-no
  3698. ENV: Agent did: predict-no for direction U in state State-A
  3699. In State-A moving U
  3700. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3701. predict error 0
  3702. dir: dir isL
  3703. \-519: O: O1038 (predict-no)
  3704. I see 1 and I'm going to do: predict-no
  3705. ENV: Agent did: predict-no for direction L in state State-A
  3706. In State-A moving L
  3707. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3708. predict error 0
  3709. dir: dir isU
  3710. /|\520: O: O1040 (predict-no)
  3711. I see 1 and I'm going to do: predict-no
  3712. ENV: Agent did: predict-no for direction U in state State-A
  3713. In State-A moving U
  3714. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3715. predict error 0
  3716. dir: dir isL
  3717. -/521: O: O1042 (predict-no)
  3718. I see 1 and I'm going to do: predict-no
  3719. ENV: Agent did: predict-no for direction L in state State-A
  3720. In State-A moving L
  3721. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3722. predict error 0
  3723. dir: dir isU
  3724. |522: O: O1044 (predict-no)
  3725. I see 1 and I'm going to do: predict-no
  3726. ENV: Agent did: predict-no for direction U in state State-A
  3727. In State-A moving U
  3728. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3729. predict error 0
  3730. dir: dir isL
  3731. \-/523: O: O1046 (predict-no)
  3732. I see 1 and I'm going to do: predict-no
  3733. ENV: Agent did: predict-no for direction L in state State-A
  3734. In State-A moving L
  3735. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3736. predict error 0
  3737. dir: dir isL
  3738. |\-/524: O: O1048 (predict-no)
  3739. I see 1 and I'm going to do: predict-no
  3740. ENV: Agent did: predict-no for direction L in state State-A
  3741. In State-A moving L
  3742. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3743. predict error 0
  3744. dir: dir isL
  3745. |\-525: O: O1050 (predict-no)
  3746. I see 1 and I'm going to do: predict-no
  3747. ENV: Agent did: predict-no for direction L in state State-A
  3748. In State-A moving L
  3749. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3750. predict error 0
  3751. dir: dir isL
  3752. /|\-526: O: O1052 (predict-no)
  3753. I see 1 and I'm going to do: predict-no
  3754. ENV: Agent did: predict-no for direction L in state State-A
  3755. In State-A moving L
  3756. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3757. predict error 0
  3758. dir: dir isL
  3759. /|\527: O: O1054 (predict-no)
  3760. I see 1 and I'm going to do: predict-no
  3761. ENV: Agent did: predict-no for direction L in state State-A
  3762. In State-A moving L
  3763. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3764. predict error 0
  3765. dir: dir isU
  3766. -/528: O: O1056 (predict-no)
  3767. I see 1 and I'm going to do: predict-no
  3768. ENV: Agent did: predict-no for direction U in state State-A
  3769. In State-A moving U
  3770. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3771. predict error 0
  3772. dir: dir isR
  3773. |\-/529: O: O1057 (predict-yes)
  3774. I see 1 and I'm going to do: predict-yes
  3775. ENV: Agent did: predict-yes for direction R in state State-A
  3776. In State-A moving R
  3777. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3778. predict error 0
  3779. dir: dir isR
  3780. |\-530: O: O1060 (predict-no)
  3781. I see 1 and I'm going to do: predict-no
  3782. ENV: Agent did: predict-no for direction R in state State-B
  3783. In State-B moving R
  3784. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3785. predict error 0
  3786. dir: dir isU
  3787. /|\-531: O: O1062 (predict-no)
  3788. I see 1 and I'm going to do: predict-no
  3789. ENV: Agent did: predict-no for direction U in state State-B
  3790. In State-B moving U
  3791. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3792. predict error 0
  3793. dir: dir isL
  3794. /532: O: O1063 (predict-yes)
  3795. I see 1 and I'm going to do: predict-yes
  3796. ENV: Agent did: predict-yes for direction L in state State-B
  3797. In State-B moving L
  3798. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3799. predict error 0
  3800. dir: dir isL
  3801. |533: O: O1066 (predict-no)
  3802. I see 1 and I'm going to do: predict-no
  3803. ENV: Agent did: predict-no for direction L in state State-A
  3804. In State-A moving L
  3805. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3806. predict error 0
  3807. dir: dir isU
  3808. \-/534: O: O1068 (predict-no)
  3809. I see 1 and I'm going to do: predict-no
  3810. ENV: Agent did: predict-no for direction U in state State-A
  3811. In State-A moving U
  3812. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3813. predict error 0
  3814. dir: dir isU
  3815. |\535: O: O1070 (predict-no)
  3816. I see 1 and I'm going to do: predict-no
  3817. ENV: Agent did: predict-no for direction U in state State-A
  3818. In State-A moving U
  3819. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3820. predict error 0
  3821. dir: dir isU
  3822. -/|536: O: O1072 (predict-no)
  3823. I see 1 and I'm going to do: predict-no
  3824. ENV: Agent did: predict-no for direction U in state State-A
  3825. In State-A moving U
  3826. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3827. predict error 0
  3828. dir: dir isU
  3829. \-537: O: O1074 (predict-no)
  3830. I see 1 and I'm going to do: predict-no
  3831. ENV: Agent did: predict-no for direction U in state State-A
  3832. In State-A moving U
  3833. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3834. predict error 0
  3835. dir: dir isL
  3836. /|538: O: O1076 (predict-no)
  3837. I see 1 and I'm going to do: predict-no
  3838. ENV: Agent did: predict-no for direction L in state State-A
  3839. In State-A moving L
  3840. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3841. predict error 0
  3842. dir: dir isU
  3843. \-/539: O: O1078 (predict-no)
  3844. I see 1 and I'm going to do: predict-no
  3845. ENV: Agent did: predict-no for direction U in state State-A
  3846. In State-A moving U
  3847. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3848. predict error 0
  3849. dir: dir isU
  3850. |\-540: O: O1080 (predict-no)
  3851. I see 1 and I'm going to do: predict-no
  3852. ENV: Agent did: predict-no for direction U in state State-A
  3853. In State-A moving U
  3854. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3855. predict error 0
  3856. dir: dir isU
  3857. /|\541: O: O1082 (predict-no)
  3858. I see 1 and I'm going to do: predict-no
  3859. ENV: Agent did: predict-no for direction U in state State-A
  3860. In State-A moving U
  3861. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3862. predict error 0
  3863. dir: dir isU
  3864. -542: O: O1084 (predict-no)
  3865. I see 1 and I'm going to do: predict-no
  3866. ENV: Agent did: predict-no for direction U in state State-A
  3867. In State-A moving U
  3868. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3869. predict error 0
  3870. dir: dir isL
  3871. /|\-543: O: O1086 (predict-no)
  3872. I see 1 and I'm going to do: predict-no
  3873. ENV: Agent did: predict-no for direction L in state State-A
  3874. In State-A moving L
  3875. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3876. predict error 0
  3877. dir: dir isL
  3878. /|\-544: O: O1088 (predict-no)
  3879. I see 1 and I'm going to do: predict-no
  3880. ENV: Agent did: predict-no for direction L in state State-A
  3881. In State-A moving L
  3882. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3883. predict error 0
  3884. dir: dir isL
  3885. /|545: O: O1090 (predict-no)
  3886. I see 1 and I'm going to do: predict-no
  3887. ENV: Agent did: predict-no for direction L in state State-A
  3888. In State-A moving L
  3889. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3890. predict error 0
  3891. dir: dir isR
  3892. \-/546: O: O1091 (predict-yes)
  3893. I see 1 and I'm going to do: predict-yes
  3894. ENV: Agent did: predict-yes for direction R in state State-A
  3895. In State-A moving R
  3896. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3897. predict error 0
  3898. dir: dir isR
  3899. |\-547: O: O1094 (predict-no)
  3900. I see 1 and I'm going to do: predict-no
  3901. ENV: Agent did: predict-no for direction R in state State-B
  3902. In State-B moving R
  3903. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3904. predict error 0
  3905. dir: dir isR
  3906. /|\548: O: O1096 (predict-no)
  3907. I see 1 and I'm going to do: predict-no
  3908. ENV: Agent did: predict-no for direction R in state State-B
  3909. In State-B moving R
  3910. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3911. predict error 0
  3912. dir: dir isR
  3913. -/|549: O: O1098 (predict-no)
  3914. I see 1 and I'm going to do: predict-no
  3915. ENV: Agent did: predict-no for direction R in state State-B
  3916. In State-B moving R
  3917. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3918. predict error 0
  3919. dir: dir isU
  3920. \-/550: O: O1100 (predict-no)
  3921. I see 1 and I'm going to do: predict-no
  3922. ENV: Agent did: predict-no for direction U in state State-B
  3923. In State-B moving U
  3924. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3925. predict error 0
  3926. dir: dir isU
  3927. |\-551: O: O1102 (predict-no)
  3928. I see 1 and I'm going to do: predict-no
  3929. ENV: Agent did: predict-no for direction U in state State-B
  3930. In State-B moving U
  3931. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3932. predict error 0
  3933. dir: dir isU
  3934. /552: O: O1104 (predict-no)
  3935. I see 1 and I'm going to do: predict-no
  3936. ENV: Agent did: predict-no for direction U in state State-B
  3937. In State-B moving U
  3938. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3939. predict error 0
  3940. dir: dir isL
  3941. |\553: O: O1105 (predict-yes)
  3942. I see 1 and I'm going to do: predict-yes
  3943. ENV: Agent did: predict-yes for direction L in state State-B
  3944. In State-B moving L
  3945. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3946. predict error 0
  3947. dir: dir isL
  3948. -/|554: O: O1108 (predict-no)
  3949. I see 1 and I'm going to do: predict-no
  3950. ENV: Agent did: predict-no for direction L in state State-A
  3951. In State-A moving L
  3952. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3953. predict error 0
  3954. dir: dir isL
  3955. \-/|555: O: O1110 (predict-no)
  3956. I see 1 and I'm going to do: predict-no
  3957. ENV: Agent did: predict-no for direction L in state State-A
  3958. In State-A moving L
  3959. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3960. predict error 0
  3961. dir: dir isL
  3962. \-/556: O: O1112 (predict-no)
  3963. I see 1 and I'm going to do: predict-no
  3964. ENV: Agent did: predict-no for direction L in state State-A
  3965. In State-A moving L
  3966. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3967. predict error 0
  3968. dir: dir isR
  3969. |\-557: O: O1113 (predict-yes)
  3970. I see 1 and I'm going to do: predict-yes
  3971. ENV: Agent did: predict-yes for direction R in state State-A
  3972. In State-A moving R
  3973. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3974. predict error 0
  3975. dir: dir isL
  3976. /|\558: O: O1115 (predict-yes)
  3977. I see 1 and I'm going to do: predict-yes
  3978. ENV: Agent did: predict-yes for direction L in state State-B
  3979. In State-B moving L
  3980. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3981. predict error 0
  3982. dir: dir isR
  3983. -/559: O: O1117 (predict-yes)
  3984. I see 1 and I'm going to do: predict-yes
  3985. ENV: Agent did: predict-yes for direction R in state State-A
  3986. In State-A moving R
  3987. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3988. predict error 0
  3989. dir: dir isR
  3990. |\-/560: O: O1120 (predict-no)
  3991. I see 1 and I'm going to do: predict-no
  3992. ENV: Agent did: predict-no for direction R in state State-B
  3993. In State-B moving R
  3994. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3995. predict error 0
  3996. dir: dir isR
  3997. |\-561: O: O1122 (predict-no)
  3998. I see 1 and I'm going to do: predict-no
  3999. ENV: Agent did: predict-no for direction R in state State-B
  4000. In State-B moving R
  4001. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4002. predict error 0
  4003. dir: dir isU
  4004. /562: O: O1124 (predict-no)
  4005. I see 1 and I'm going to do: predict-no
  4006. ENV: Agent did: predict-no for direction U in state State-B
  4007. In State-B moving U
  4008. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4009. predict error 0
  4010. dir: dir isU
  4011. |\-563: O: O1126 (predict-no)
  4012. I see 1 and I'm going to do: predict-no
  4013. ENV: Agent did: predict-no for direction U in state State-B
  4014. In State-B moving U
  4015. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4016. predict error 0
  4017. dir: dir isL
  4018. /|\564: O: O1127 (predict-yes)
  4019. I see 1 and I'm going to do: predict-yes
  4020. ENV: Agent did: predict-yes for direction L in state State-B
  4021. In State-B moving L
  4022. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4023. predict error 0
  4024. dir: dir isR
  4025. -/|565: O: O1129 (predict-yes)
  4026. I see 1 and I'm going to do: predict-yes
  4027. ENV: Agent did: predict-yes for direction R in state State-A
  4028. In State-A moving R
  4029. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4030. predict error 0
  4031. dir: dir isU
  4032. \-566: O: O1132 (predict-no)
  4033. I see 1 and I'm going to do: predict-no
  4034. ENV: Agent did: predict-no for direction U in state State-B
  4035. In State-B moving U
  4036. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4037. predict error 0
  4038. dir: dir isL
  4039. /|\567: O: O1133 (predict-yes)
  4040. I see 1 and I'm going to do: predict-yes
  4041. ENV: Agent did: predict-yes for direction L in state State-B
  4042. In State-B moving L
  4043. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4044. predict error 0
  4045. dir: dir isL
  4046. -/568: O: O1136 (predict-no)
  4047. I see 1 and I'm going to do: predict-no
  4048. ENV: Agent did: predict-no for direction L in state State-A
  4049. In State-A moving L
  4050. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4051. predict error 0
  4052. dir: dir isL
  4053. |\569: O: O1138 (predict-no)
  4054. I see 1 and I'm going to do: predict-no
  4055. ENV: Agent did: predict-no for direction L in state State-A
  4056. In State-A moving L
  4057. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4058. predict error 0
  4059. dir: dir isU
  4060. -/|570: O: O1140 (predict-no)
  4061. I see 1 and I'm going to do: predict-no
  4062. ENV: Agent did: predict-no for direction U in state State-A
  4063. In State-A moving U
  4064. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4065. predict error 0
  4066. dir: dir isU
  4067. \-/571: O: O1142 (predict-no)
  4068. I see 1 and I'm going to do: predict-no
  4069. ENV: Agent did: predict-no for direction U in state State-A
  4070. In State-A moving U
  4071. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4072. predict error 0
  4073. dir: dir isU
  4074. |572: O: O1144 (predict-no)
  4075. I see 1 and I'm going to do: predict-no
  4076. ENV: Agent did: predict-no for direction U in state State-A
  4077. In State-A moving U
  4078. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4079. predict error 0
  4080. dir: dir isR
  4081. \-/573: O: O1145 (predict-yes)
  4082. I see 1 and I'm going to do: predict-yes
  4083. ENV: Agent did: predict-yes for direction R in state State-A
  4084. In State-A moving R
  4085. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4086. predict error 0
  4087. dir: dir isL
  4088. |\-574: O: O1147 (predict-yes)
  4089. I see 1 and I'm going to do: predict-yes
  4090. ENV: Agent did: predict-yes for direction L in state State-B
  4091. In State-B moving L
  4092. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4093. predict error 0
  4094. dir: dir isL
  4095. /|\575: O: O1150 (predict-no)
  4096. I see 1 and I'm going to do: predict-no
  4097. ENV: Agent did: predict-no for direction L in state State-A
  4098. In State-A moving L
  4099. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4100. predict error 0
  4101. dir: dir isU
  4102. -/|576: O: O1152 (predict-no)
  4103. I see 1 and I'm going to do: predict-no
  4104. ENV: Agent did: predict-no for direction U in state State-A
  4105. In State-A moving U
  4106. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4107. predict error 0
  4108. dir: dir isR
  4109. \-/|577: O: O1153 (predict-yes)
  4110. I see 1 and I'm going to do: predict-yes
  4111. ENV: Agent did: predict-yes for direction R in state State-A
  4112. In State-A moving R
  4113. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4114. predict error 0
  4115. dir: dir isL
  4116. \-/|578: O: O1155 (predict-yes)
  4117. I see 1 and I'm going to do: predict-yes
  4118. ENV: Agent did: predict-yes for direction L in state State-B
  4119. In State-B moving L
  4120. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4121. predict error 0
  4122. dir: dir isR
  4123. \-/579: O: O1157 (predict-yes)
  4124. I see 1 and I'm going to do: predict-yes
  4125. ENV: Agent did: predict-yes for direction R in state State-A
  4126. In State-A moving R
  4127. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4128. predict error 0
  4129. dir: dir isR
  4130. |\580: O: O1160 (predict-no)
  4131. I see 1 and I'm going to do: predict-no
  4132. ENV: Agent did: predict-no for direction R in state State-B
  4133. In State-B moving R
  4134. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4135. predict error 0
  4136. dir: dir isR
  4137. -/581: O: O1162 (predict-no)
  4138. I see 1 and I'm going to do: predict-no
  4139. ENV: Agent did: predict-no for direction R in state State-B
  4140. In State-B moving R
  4141. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4142. predict error 0
  4143. dir: dir isR
  4144. |582: O: O1164 (predict-no)
  4145. I see 1 and I'm going to do: predict-no
  4146. ENV: Agent did: predict-no for direction R in state State-B
  4147. In State-B moving R
  4148. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4149. predict error 0
  4150. dir: dir isL
  4151. \-583: O: O1165 (predict-yes)
  4152. I see 1 and I'm going to do: predict-yes
  4153. ENV: Agent did: predict-yes for direction L in state State-B
  4154. In State-B moving L
  4155. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4156. predict error 0
  4157. dir: dir isL
  4158. /|\584: O: O1168 (predict-no)
  4159. I see 1 and I'm going to do: predict-no
  4160. ENV: Agent did: predict-no for direction L in state State-A
  4161. In State-A moving L
  4162. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4163. predict error 0
  4164. dir: dir isU
  4165. -/585: O: O1170 (predict-no)
  4166. I see 1 and I'm going to do: predict-no
  4167. ENV: Agent did: predict-no for direction U in state State-A
  4168. In State-A moving U
  4169. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4170. predict error 0
  4171. dir: dir isU
  4172. |\-586: O: O1172 (predict-no)
  4173. I see 1 and I'm going to do: predict-no
  4174. ENV: Agent did: predict-no for direction U in state State-A
  4175. In State-A moving U
  4176. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4177. predict error 0
  4178. dir: dir isR
  4179. /|587: O: O1173 (predict-yes)
  4180. I see 1 and I'm going to do: predict-yes
  4181. ENV: Agent did: predict-yes for direction R in state State-A
  4182. In State-A moving R
  4183. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4184. predict error 0
  4185. dir: dir isL
  4186. \-/588: O: O1175 (predict-yes)
  4187. I see 1 and I'm going to do: predict-yes
  4188. ENV: Agent did: predict-yes for direction L in state State-B
  4189. In State-B moving L
  4190. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4191. predict error 0
  4192. dir: dir isL
  4193. |\-589: O: O1178 (predict-no)
  4194. I see 1 and I'm going to do: predict-no
  4195. ENV: Agent did: predict-no for direction L in state State-A
  4196. In State-A moving L
  4197. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4198. predict error 0
  4199. dir: dir isR
  4200. /|\590: O: O1179 (predict-yes)
  4201. I see 1 and I'm going to do: predict-yes
  4202. ENV: Agent did: predict-yes for direction R in state State-A
  4203. In State-A moving R
  4204. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4205. predict error 0
  4206. dir: dir isU
  4207. -/|591: O: O1182 (predict-no)
  4208. I see 1 and I'm going to do: predict-no
  4209. ENV: Agent did: predict-no for direction U in state State-B
  4210. In State-B moving U
  4211. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4212. predict error 0
  4213. dir: dir isR
  4214. \592: O: O1184 (predict-no)
  4215. I see 1 and I'm going to do: predict-no
  4216. ENV: Agent did: predict-no for direction R in state State-B
  4217. In State-B moving R
  4218. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4219. predict error 0
  4220. dir: dir isU
  4221. -/593: O: O1186 (predict-no)
  4222. I see 1 and I'm going to do: predict-no
  4223. ENV: Agent did: predict-no for direction U in state State-B
  4224. In State-B moving U
  4225. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4226. predict error 0
  4227. dir: dir isR
  4228. |\-594: O: O1188 (predict-no)
  4229. I see 1 and I'm going to do: predict-no
  4230. ENV: Agent did: predict-no for direction R in state State-B
  4231. In State-B moving R
  4232. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4233. predict error 0
  4234. dir: dir isL
  4235. /|\-595: O: O1189 (predict-yes)
  4236. I see 1 and I'm going to do: predict-yes
  4237. ENV: Agent did: predict-yes for direction L in state State-B
  4238. In State-B moving L
  4239. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4240. predict error 0
  4241. dir: dir isR
  4242. /|\596: O: O1191 (predict-yes)
  4243. I see 1 and I'm going to do: predict-yes
  4244. ENV: Agent did: predict-yes for direction R in state State-A
  4245. In State-A moving R
  4246. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4247. predict error 0
  4248. dir: dir isR
  4249. -/|597: O: O1194 (predict-no)
  4250. I see 1 and I'm going to do: predict-no
  4251. ENV: Agent did: predict-no for direction R in state State-B
  4252. In State-B moving R
  4253. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4254. predict error 0
  4255. dir: dir isR
  4256. \-598: O: O1196 (predict-no)
  4257. I see 1 and I'm going to do: predict-no
  4258. ENV: Agent did: predict-no for direction R in state State-B
  4259. In State-B moving R
  4260. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4261. predict error 0
  4262. dir: dir isU
  4263. /|\599: O: O1198 (predict-no)
  4264. I see 1 and I'm going to do: predict-no
  4265. ENV: Agent did: predict-no for direction U in state State-B
  4266. In State-B moving U
  4267. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4268. predict error 0
  4269. dir: dir isL
  4270. -/|600: O: O1199 (predict-yes)
  4271. I see 1 and I'm going to do: predict-yes
  4272. ENV: Agent did: predict-yes for direction L in state State-B
  4273. In State-B moving L
  4274. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4275. predict error 0
  4276. dir: dir isR
  4277. \-/601: O: O1201 (predict-yes)
  4278. I see 1 and I'm going to do: predict-yes
  4279. ENV: Agent did: predict-yes for direction R in state State-A
  4280. In State-A moving R
  4281. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4282. predict error 0
  4283. dir: dir isU
  4284. |602: O: O1204 (predict-no)
  4285. I see 1 and I'm going to do: predict-no
  4286. ENV: Agent did: predict-no for direction U in state State-B
  4287. In State-B moving U
  4288. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4289. predict error 0
  4290. dir: dir isL
  4291. \-/603: O: O1205 (predict-yes)
  4292. I see 1 and I'm going to do: predict-yes
  4293. ENV: Agent did: predict-yes for direction L in state State-B
  4294. In State-B moving L
  4295. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4296. predict error 0
  4297. dir: dir isL
  4298. |\604: O: O1208 (predict-no)
  4299. I see 1 and I'm going to do: predict-no
  4300. ENV: Agent did: predict-no for direction L in state State-A
  4301. In State-A moving L
  4302. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4303. predict error 0
  4304. dir: dir isL
  4305. -/|605: O: O1210 (predict-no)
  4306. I see 1 and I'm going to do: predict-no
  4307. ENV: Agent did: predict-no for direction L in state State-A
  4308. In State-A moving L
  4309. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4310. predict error 0
  4311. dir: dir isL
  4312. \-606: O: O1212 (predict-no)
  4313. I see 1 and I'm going to do: predict-no
  4314. ENV: Agent did: predict-no for direction L in state State-A
  4315. In State-A moving L
  4316. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4317. predict error 0
  4318. dir: dir isR
  4319. /|\-607: O: O1213 (predict-yes)
  4320. I see 1 and I'm going to do: predict-yes
  4321. ENV: Agent did: predict-yes for direction R in state State-A
  4322. In State-A moving R
  4323. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4324. predict error 0
  4325. dir: dir isL
  4326. /|\608: O: O1215 (predict-yes)
  4327. I see 1 and I'm going to do: predict-yes
  4328. ENV: Agent did: predict-yes for direction L in state State-B
  4329. In State-B moving L
  4330. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4331. predict error 0
  4332. dir: dir isR
  4333. -/609: O: O1217 (predict-yes)
  4334. I see 1 and I'm going to do: predict-yes
  4335. ENV: Agent did: predict-yes for direction R in state State-A
  4336. In State-A moving R
  4337. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4338. predict error 0
  4339. dir: dir isL
  4340. |\610: O: O1219 (predict-yes)
  4341. I see 1 and I'm going to do: predict-yes
  4342. ENV: Agent did: predict-yes for direction L in state State-B
  4343. In State-B moving L
  4344. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4345. predict error 0
  4346. dir: dir isR
  4347. -/|\611: O: O1221 (predict-yes)
  4348. I see 1 and I'm going to do: predict-yes
  4349. ENV: Agent did: predict-yes for direction R in state State-A
  4350. In State-A moving R
  4351. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4352. predict error 0
  4353. dir: dir isU
  4354. -612: O: O1224 (predict-no)
  4355. I see 1 and I'm going to do: predict-no
  4356. ENV: Agent did: predict-no for direction U in state State-B
  4357. In State-B moving U
  4358. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4359. predict error 0
  4360. dir: dir isR
  4361. /|\-613: O: O1226 (predict-no)
  4362. I see 1 and I'm going to do: predict-no
  4363. ENV: Agent did: predict-no for direction R in state State-B
  4364. In State-B moving R
  4365. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4366. predict error 0
  4367. dir: dir isU
  4368. /|\614: O: O1228 (predict-no)
  4369. I see 1 and I'm going to do: predict-no
  4370. ENV: Agent did: predict-no for direction U in state State-B
  4371. In State-B moving U
  4372. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4373. predict error 0
  4374. dir: dir isU
  4375. -/|615: O: O1230 (predict-no)
  4376. I see 1 and I'm going to do: predict-no
  4377. ENV: Agent did: predict-no for direction U in state State-B
  4378. In State-B moving U
  4379. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4380. predict error 0
  4381. dir: dir isL
  4382. \-616: O: O1231 (predict-yes)
  4383. I see 1 and I'm going to do: predict-yes
  4384. ENV: Agent did: predict-yes for direction L in state State-B
  4385. In State-B moving L
  4386. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4387. predict error 0
  4388. dir: dir isR
  4389. /|\-617: O: O1233 (predict-yes)
  4390. I see 1 and I'm going to do: predict-yes
  4391. ENV: Agent did: predict-yes for direction R in state State-A
  4392. In State-A moving R
  4393. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4394. predict error 0
  4395. dir: dir isR
  4396. /|\618: O: O1236 (predict-no)
  4397. I see 1 and I'm going to do: predict-no
  4398. ENV: Agent did: predict-no for direction R in state State-B
  4399. In State-B moving R
  4400. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4401. predict error 0
  4402. dir: dir isR
  4403. -/619: O: O1238 (predict-no)
  4404. I see 1 and I'm going to do: predict-no
  4405. ENV: Agent did: predict-no for direction R in state State-B
  4406. In State-B moving R
  4407. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4408. predict error 0
  4409. dir: dir isR
  4410. |\-620: O: O1240 (predict-no)
  4411. I see 1 and I'm going to do: predict-no
  4412. ENV: Agent did: predict-no for direction R in state State-B
  4413. In State-B moving R
  4414. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4415. predict error 0
  4416. dir: dir isU
  4417. /|\621: O: O1242 (predict-no)
  4418. I see 1 and I'm going to do: predict-no
  4419. ENV: Agent did: predict-no for direction U in state State-B
  4420. In State-B moving U
  4421. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4422. predict error 0
  4423. dir: dir isL
  4424. -622: O: O1243 (predict-yes)
  4425. I see 1 and I'm going to do: predict-yes
  4426. ENV: Agent did: predict-yes for direction L in state State-B
  4427. In State-B moving L
  4428. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4429. predict error 0
  4430. dir: dir isR
  4431. /|623: O: O1245 (predict-yes)
  4432. I see 1 and I'm going to do: predict-yes
  4433. ENV: Agent did: predict-yes for direction R in state State-A
  4434. In State-A moving R
  4435. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4436. predict error 0
  4437. dir: dir isL
  4438. \-/624: O: O1247 (predict-yes)
  4439. I see 1 and I'm going to do: predict-yes
  4440. ENV: Agent did: predict-yes for direction L in state State-B
  4441. In State-B moving L
  4442. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4443. predict error 0
  4444. dir: dir isU
  4445. |\-/625: O: O1250 (predict-no)
  4446. I see 1 and I'm going to do: predict-no
  4447. ENV: Agent did: predict-no for direction U in state State-A
  4448. In State-A moving U
  4449. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4450. predict error 0
  4451. dir: dir isR
  4452. |\626: O: O1251 (predict-yes)
  4453. I see 1 and I'm going to do: predict-yes
  4454. ENV: Agent did: predict-yes for direction R in state State-A
  4455. In State-A moving R
  4456. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4457. predict error 0
  4458. dir: dir isU
  4459. -/|627: O: O1254 (predict-no)
  4460. I see 1 and I'm going to do: predict-no
  4461. ENV: Agent did: predict-no for direction U in state State-B
  4462. In State-B moving U
  4463. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4464. predict error 0
  4465. dir: dir isL
  4466. \-/|628: O: O1255 (predict-yes)
  4467. I see 1 and I'm going to do: predict-yes
  4468. ENV: Agent did: predict-yes for direction L in state State-B
  4469. In State-B moving L
  4470. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4471. predict error 0
  4472. dir: dir isU
  4473. \-/629: O: O1258 (predict-no)
  4474. I see 1 and I'm going to do: predict-no
  4475. ENV: Agent did: predict-no for direction U in state State-A
  4476. In State-A moving U
  4477. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4478. predict error 0
  4479. dir: dir isR
  4480. |\-630: O: O1259 (predict-yes)
  4481. I see 1 and I'm going to do: predict-yes
  4482. ENV: Agent did: predict-yes for direction R in state State-A
  4483. In State-A moving R
  4484. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4485. predict error 0
  4486. dir: dir isL
  4487. /|\631: O: O1261 (predict-yes)
  4488. I see 1 and I'm going to do: predict-yes
  4489. ENV: Agent did: predict-yes for direction L in state State-B
  4490. In State-B moving L
  4491. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4492. predict error 0
  4493. dir: dir isU
  4494. -632: O: O1264 (predict-no)
  4495. I see 1 and I'm going to do: predict-no
  4496. ENV: Agent did: predict-no for direction U in state State-A
  4497. In State-A moving U
  4498. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4499. predict error 0
  4500. dir: dir isU
  4501. /|\633: O: O1266 (predict-no)
  4502. I see 1 and I'm going to do: predict-no
  4503. ENV: Agent did: predict-no for direction U in state State-A
  4504. In State-A moving U
  4505. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4506. predict error 0
  4507. dir: dir isL
  4508. -/|634: O: O1268 (predict-no)
  4509. I see 1 and I'm going to do: predict-no
  4510. ENV: Agent did: predict-no for direction L in state State-A
  4511. In State-A moving L
  4512. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4513. predict error 0
  4514. dir: dir isU
  4515. \-/635: O: O1270 (predict-no)
  4516. I see 1 and I'm going to do: predict-no
  4517. ENV: Agent did: predict-no for direction U in state State-A
  4518. In State-A moving U
  4519. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4520. predict error 0
  4521. dir: dir isU
  4522. |\-636: O: O1272 (predict-no)
  4523. I see 1 and I'm going to do: predict-no
  4524. ENV: Agent did: predict-no for direction U in state State-A
  4525. In State-A moving U
  4526. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4527. predict error 0
  4528. dir: dir isR
  4529. /|637: O: O1273 (predict-yes)
  4530. I see 1 and I'm going to do: predict-yes
  4531. ENV: Agent did: predict-yes for direction R in state State-A
  4532. In State-A moving R
  4533. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4534. predict error 0
  4535. dir: dir isR
  4536. \-/638: O: O1276 (predict-no)
  4537. I see 1 and I'm going to do: predict-no
  4538. ENV: Agent did: predict-no for direction R in state State-B
  4539. In State-B moving R
  4540. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4541. predict error 0
  4542. dir: dir isL
  4543. |\-/639: O: O1277 (predict-yes)
  4544. I see 1 and I'm going to do: predict-yes
  4545. ENV: Agent did: predict-yes for direction L in state State-B
  4546. In State-B moving L
  4547. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4548. predict error 0
  4549. dir: dir isL
  4550. |\-640: O: O1280 (predict-no)
  4551. I see 1 and I'm going to do: predict-no
  4552. ENV: Agent did: predict-no for direction L in state State-A
  4553. In State-A moving L
  4554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4555. predict error 0
  4556. dir: dir isR
  4557. /|641: O: O1281 (predict-yes)
  4558. I see 1 and I'm going to do: predict-yes
  4559. ENV: Agent did: predict-yes for direction R in state State-A
  4560. In State-A moving R
  4561. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4562. predict error 0
  4563. dir: dir isU
  4564. \642: O: O1284 (predict-no)
  4565. I see 1 and I'm going to do: predict-no
  4566. ENV: Agent did: predict-no for direction U in state State-B
  4567. In State-B moving U
  4568. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4569. predict error 0
  4570. dir: dir isL
  4571. -/|643: O: O1285 (predict-yes)
  4572. I see 1 and I'm going to do: predict-yes
  4573. ENV: Agent did: predict-yes for direction L in state State-B
  4574. In State-B moving L
  4575. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4576. predict error 0
  4577. dir: dir isR
  4578. \-644: O: O1287 (predict-yes)
  4579. I see 1 and I'm going to do: predict-yes
  4580. ENV: Agent did: predict-yes for direction R in state State-A
  4581. In State-A moving R
  4582. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4583. predict error 0
  4584. dir: dir isL
  4585. /|\645: O: O1289 (predict-yes)
  4586. I see 1 and I'm going to do: predict-yes
  4587. ENV: Agent did: predict-yes for direction L in state State-B
  4588. In State-B moving L
  4589. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4590. predict error 0
  4591. dir: dir isU
  4592. -/|\646: O: O1292 (predict-no)
  4593. I see 1 and I'm going to do: predict-no
  4594. ENV: Agent did: predict-no for direction U in state State-A
  4595. In State-A moving U
  4596. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4597. predict error 0
  4598. dir: dir isU
  4599. -/647: O: O1294 (predict-no)
  4600. I see 1 and I'm going to do: predict-no
  4601. ENV: Agent did: predict-no for direction U in state State-A
  4602. In State-A moving U
  4603. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4604. predict error 0
  4605. dir: dir isU
  4606. |\-/648: O: O1296 (predict-no)
  4607. I see 1 and I'm going to do: predict-no
  4608. ENV: Agent did: predict-no for direction U in state State-A
  4609. In State-A moving U
  4610. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4611. predict error 0
  4612. dir: dir isL
  4613. |\649: O: O1298 (predict-no)
  4614. I see 1 and I'm going to do: predict-no
  4615. ENV: Agent did: predict-no for direction L in state State-A
  4616. In State-A moving L
  4617. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4618. predict error 0
  4619. dir: dir isR
  4620. -/|\650: O: O1299 (predict-yes)
  4621. I see 1 and I'm going to do: predict-yes
  4622. ENV: Agent did: predict-yes for direction R in state State-A
  4623. In State-A moving R
  4624. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4625. predict error 0
  4626. dir: dir isL
  4627. -/651: O: O1301 (predict-yes)
  4628. I see 1 and I'm going to do: predict-yes
  4629. ENV: Agent did: predict-yes for direction L in state State-B
  4630. In State-B moving L
  4631. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4632. predict error 0
  4633. dir: dir isR
  4634. |652: O: O1303 (predict-yes)
  4635. I see 1 and I'm going to do: predict-yes
  4636. ENV: Agent did: predict-yes for direction R in state State-A
  4637. In State-A moving R
  4638. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4639. predict error 0
  4640. dir: dir isU
  4641. \-653: O: O1306 (predict-no)
  4642. I see 1 and I'm going to do: predict-no
  4643. ENV: Agent did: predict-no for direction U in state State-B
  4644. In State-B moving U
  4645. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4646. predict error 0
  4647. dir: dir isU
  4648. /|654: O: O1308 (predict-no)
  4649. I see 1 and I'm going to do: predict-no
  4650. ENV: Agent did: predict-no for direction U in state State-B
  4651. In State-B moving U
  4652. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4653. predict error 0
  4654. dir: dir isL
  4655. \-655: O: O1309 (predict-yes)
  4656. I see 1 and I'm going to do: predict-yes
  4657. ENV: Agent did: predict-yes for direction L in state State-B
  4658. In State-B moving L
  4659. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4660. predict error 0
  4661. dir: dir isR
  4662. /|\656: O: O1311 (predict-yes)
  4663. I see 1 and I'm going to do: predict-yes
  4664. ENV: Agent did: predict-yes for direction R in state State-A
  4665. In State-A moving R
  4666. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4667. predict error 0
  4668. dir: dir isL
  4669. -/657: O: O1313 (predict-yes)
  4670. I see 1 and I'm going to do: predict-yes
  4671. ENV: Agent did: predict-yes for direction L in state State-B
  4672. In State-B moving L
  4673. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4674. predict error 0
  4675. dir: dir isR
  4676. |\-658: O: O1315 (predict-yes)
  4677. I see 1 and I'm going to do: predict-yes
  4678. ENV: Agent did: predict-yes for direction R in state State-A
  4679. In State-A moving R
  4680. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4681. predict error 0
  4682. dir: dir isU
  4683. /|\659: O: O1318 (predict-no)
  4684. I see 1 and I'm going to do: predict-no
  4685. ENV: Agent did: predict-no for direction U in state State-B
  4686. In State-B moving U
  4687. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4688. predict error 0
  4689. dir: dir isL
  4690. -/|660: O: O1319 (predict-yes)
  4691. I see 1 and I'm going to do: predict-yes
  4692. ENV: Agent did: predict-yes for direction L in state State-B
  4693. In State-B moving L
  4694. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4695. predict error 0
  4696. dir: dir isU
  4697. \-/661: O: O1322 (predict-no)
  4698. I see 1 and I'm going to do: predict-no
  4699. ENV: Agent did: predict-no for direction U in state State-A
  4700. In State-A moving U
  4701. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4702. predict error 0
  4703. dir: dir isU
  4704. |662: O: O1324 (predict-no)
  4705. I see 1 and I'm going to do: predict-no
  4706. ENV: Agent did: predict-no for direction U in state State-A
  4707. In State-A moving U
  4708. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4709. predict error 0
  4710. dir: dir isU
  4711. \-/663: O: O1326 (predict-no)
  4712. I see 1 and I'm going to do: predict-no
  4713. ENV: Agent did: predict-no for direction U in state State-A
  4714. In State-A moving U
  4715. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4716. predict error 0
  4717. dir: dir isL
  4718. |\664: O: O1328 (predict-no)
  4719. I see 1 and I'm going to do: predict-no
  4720. ENV: Agent did: predict-no for direction L in state State-A
  4721. In State-A moving L
  4722. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4723. predict error 0
  4724. dir: dir isU
  4725. -665: O: O1330 (predict-no)
  4726. I see 1 and I'm going to do: predict-no
  4727. ENV: Agent did: predict-no for direction U in state State-A
  4728. In State-A moving U
  4729. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4730. predict error 0
  4731. dir: dir isU
  4732. /|666: O: O1332 (predict-no)
  4733. I see 1 and I'm going to do: predict-no
  4734. ENV: Agent did: predict-no for direction U in state State-A
  4735. In State-A moving U
  4736. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4737. predict error 0
  4738. dir: dir isU
  4739. \-/667: O: O1334 (predict-no)
  4740. I see 1 and I'm going to do: predict-no
  4741. ENV: Agent did: predict-no for direction U in state State-A
  4742. In State-A moving U
  4743. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4744. predict error 0
  4745. dir: dir isL
  4746. |\-668: O: O1336 (predict-no)
  4747. I see 1 and I'm going to do: predict-no
  4748. ENV: Agent did: predict-no for direction L in state State-A
  4749. In State-A moving L
  4750. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4751. predict error 0
  4752. dir: dir isL
  4753. /|669: O: O1338 (predict-no)
  4754. I see 1 and I'm going to do: predict-no
  4755. ENV: Agent did: predict-no for direction L in state State-A
  4756. In State-A moving L
  4757. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4758. predict error 0
  4759. dir: dir isL
  4760. \-/|670: O: O1340 (predict-no)
  4761. I see 1 and I'm going to do: predict-no
  4762. ENV: Agent did: predict-no for direction L in state State-A
  4763. In State-A moving L
  4764. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4765. predict error 0
  4766. dir: dir isR
  4767. \-/671: O: O1341 (predict-yes)
  4768. I see 1 and I'm going to do: predict-yes
  4769. ENV: Agent did: predict-yes for direction R in state State-A
  4770. In State-A moving R
  4771. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4772. predict error 0
  4773. dir: dir isR
  4774. |672: O: O1344 (predict-no)
  4775. I see 1 and I'm going to do: predict-no
  4776. ENV: Agent did: predict-no for direction R in state State-B
  4777. In State-B moving R
  4778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4779. predict error 0
  4780. dir: dir isL
  4781. \-673: O: O1345 (predict-yes)
  4782. I see 1 and I'm going to do: predict-yes
  4783. ENV: Agent did: predict-yes for direction L in state State-B
  4784. In State-B moving L
  4785. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4786. predict error 0
  4787. dir: dir isR
  4788. /|\674: O: O1347 (predict-yes)
  4789. I see 1 and I'm going to do: predict-yes
  4790. ENV: Agent did: predict-yes for direction R in state State-A
  4791. In State-A moving R
  4792. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4793. predict error 0
  4794. dir: dir isU
  4795. -/|675: O: O1350 (predict-no)
  4796. I see 1 and I'm going to do: predict-no
  4797. ENV: Agent did: predict-no for direction U in state State-B
  4798. In State-B moving U
  4799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4800. predict error 0
  4801. dir: dir isR
  4802. \-/676: O: O1352 (predict-no)
  4803. I see 1 and I'm going to do: predict-no
  4804. ENV: Agent did: predict-no for direction R in state State-B
  4805. In State-B moving R
  4806. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4807. predict error 0
  4808. dir: dir isR
  4809. |\-677: O: O1354 (predict-no)
  4810. I see 1 and I'm going to do: predict-no
  4811. ENV: Agent did: predict-no for direction R in state State-B
  4812. In State-B moving R
  4813. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4814. predict error 0
  4815. dir: dir isR
  4816. /|\678: O: O1356 (predict-no)
  4817. I see 1 and I'm going to do: predict-no
  4818. ENV: Agent did: predict-no for direction R in state State-B
  4819. In State-B moving R
  4820. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4821. predict error 0
  4822. dir: dir isU
  4823. -/679: O: O1358 (predict-no)
  4824. I see 1 and I'm going to do: predict-no
  4825. ENV: Agent did: predict-no for direction U in state State-B
  4826. In State-B moving U
  4827. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4828. predict error 0
  4829. dir: dir isL
  4830. |\-/680: O: O1359 (predict-yes)
  4831. I see 1 and I'm going to do: predict-yes
  4832. ENV: Agent did: predict-yes for direction L in state State-B
  4833. In State-B moving L
  4834. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4835. predict error 0
  4836. dir: dir isR
  4837. |\681: O: O1361 (predict-yes)
  4838. I see 1 and I'm going to do: predict-yes
  4839. ENV: Agent did: predict-yes for direction R in state State-A
  4840. In State-A moving R
  4841. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4842. predict error 0
  4843. dir: dir isL
  4844. -682: O: O1363 (predict-yes)
  4845. I see 1 and I'm going to do: predict-yes
  4846. ENV: Agent did: predict-yes for direction L in state State-B
  4847. In State-B moving L
  4848. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4849. predict error 0
  4850. dir: dir isL
  4851. /|\-683: O: O1366 (predict-no)
  4852. I see 1 and I'm going to do: predict-no
  4853. ENV: Agent did: predict-no for direction L in state State-A
  4854. In State-A moving L
  4855. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4856. predict error 0
  4857. dir: dir isU
  4858. /|\684: O: O1368 (predict-no)
  4859. I see 1 and I'm going to do: predict-no
  4860. ENV: Agent did: predict-no for direction U in state State-A
  4861. In State-A moving U
  4862. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4863. predict error 0
  4864. dir: dir isL
  4865. -/|685: O: O1370 (predict-no)
  4866. I see 1 and I'm going to do: predict-no
  4867. ENV: Agent did: predict-no for direction L in state State-A
  4868. In State-A moving L
  4869. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4870. predict error 0
  4871. dir: dir isR
  4872. \-/686: O: O1371 (predict-yes)
  4873. I see 1 and I'm going to do: predict-yes
  4874. ENV: Agent did: predict-yes for direction R in state State-A
  4875. In State-A moving R
  4876. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4877. predict error 0
  4878. dir: dir isL
  4879. |\-687: O: O1373 (predict-yes)
  4880. I see 1 and I'm going to do: predict-yes
  4881. ENV: Agent did: predict-yes for direction L in state State-B
  4882. In State-B moving L
  4883. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4884. predict error 0
  4885. dir: dir isR
  4886. /|688: O: O1375 (predict-yes)
  4887. I see 1 and I'm going to do: predict-yes
  4888. ENV: Agent did: predict-yes for direction R in state State-A
  4889. In State-A moving R
  4890. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4891. predict error 0
  4892. dir: dir isL
  4893. \-689: O: O1377 (predict-yes)
  4894. I see 1 and I'm going to do: predict-yes
  4895. ENV: Agent did: predict-yes for direction L in state State-B
  4896. In State-B moving L
  4897. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4898. predict error 0
  4899. dir: dir isR
  4900. /|\690: O: O1379 (predict-yes)
  4901. I see 1 and I'm going to do: predict-yes
  4902. ENV: Agent did: predict-yes for direction R in state State-A
  4903. In State-A moving R
  4904. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4905. predict error 0
  4906. dir: dir isL
  4907. -/|691: O: O1381 (predict-yes)
  4908. I see 1 and I'm going to do: predict-yes
  4909. ENV: Agent did: predict-yes for direction L in state State-B
  4910. In State-B moving L
  4911. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4912. predict error 0
  4913. dir: dir isU
  4914. \692: O: O1384 (predict-no)
  4915. I see 1 and I'm going to do: predict-no
  4916. ENV: Agent did: predict-no for direction U in state State-A
  4917. In State-A moving U
  4918. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4919. predict error 0
  4920. dir: dir isL
  4921. -/693: O: O1386 (predict-no)
  4922. I see 1 and I'm going to do: predict-no
  4923. ENV: Agent did: predict-no for direction L in state State-A
  4924. In State-A moving L
  4925. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4926. predict error 0
  4927. dir: dir isU
  4928. |\-694: O: O1388 (predict-no)
  4929. I see 1 and I'm going to do: predict-no
  4930. ENV: Agent did: predict-no for direction U in state State-A
  4931. In State-A moving U
  4932. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4933. predict error 0
  4934. dir: dir isR
  4935. /|\-695: O: O1389 (predict-yes)
  4936. I see 1 and I'm going to do: predict-yes
  4937. ENV: Agent did: predict-yes for direction R in state State-A
  4938. In State-A moving R
  4939. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4940. predict error 0
  4941. dir: dir isR
  4942. /|\696: O: O1392 (predict-no)
  4943. I see 1 and I'm going to do: predict-no
  4944. ENV: Agent did: predict-no for direction R in state State-B
  4945. In State-B moving R
  4946. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4947. predict error 0
  4948. dir: dir isL
  4949. -/|697: O: O1393 (predict-yes)
  4950. I see 1 and I'm going to do: predict-yes
  4951. ENV: Agent did: predict-yes for direction L in state State-B
  4952. In State-B moving L
  4953. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4954. predict error 0
  4955. dir: dir isR
  4956. \-698: O: O1395 (predict-yes)
  4957. I see 1 and I'm going to do: predict-yes
  4958. ENV: Agent did: predict-yes for direction R in state State-A
  4959. In State-A moving R
  4960. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4961. predict error 0
  4962. dir: dir isU
  4963. /|\699: O: O1398 (predict-no)
  4964. I see 1 and I'm going to do: predict-no
  4965. ENV: Agent did: predict-no for direction U in state State-B
  4966. In State-B moving U
  4967. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4968. predict error 0
  4969. dir: dir isR
  4970. -/|\700: O: O1400 (predict-no)
  4971. I see 1 and I'm going to do: predict-no
  4972. ENV: Agent did: predict-no for direction R in state State-B
  4973. In State-B moving R
  4974. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4975. predict error 0
  4976. dir: dir isR
  4977. -/|701: O: O1402 (predict-no)
  4978. I see 1 and I'm going to do: predict-no
  4979. ENV: Agent did: predict-no for direction R in state State-B
  4980. In State-B moving R
  4981. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4982. predict error 0
  4983. dir: dir isL
  4984. \702: O: O1403 (predict-yes)
  4985. I see 1 and I'm going to do: predict-yes
  4986. ENV: Agent did: predict-yes for direction L in state State-B
  4987. In State-B moving L
  4988. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4989. predict error 0
  4990. dir: dir isR
  4991. -/703: O: O1405 (predict-yes)
  4992. I see 1 and I'm going to do: predict-yes
  4993. ENV: Agent did: predict-yes for direction R in state State-A
  4994. In State-A moving R
  4995. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4996. predict error 0
  4997. dir: dir isL
  4998. |\704: O: O1407 (predict-yes)
  4999. I see 1 and I'm going to do: predict-yes
  5000. ENV: Agent did: predict-yes for direction L in state State-B
  5001. In State-B moving L
  5002. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5003. predict error 0
  5004. dir: dir isR
  5005. -/|705: O: O1409 (predict-yes)
  5006. I see 1 and I'm going to do: predict-yes
  5007. ENV: Agent did: predict-yes for direction R in state State-A
  5008. In State-A moving R
  5009. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5010. predict error 0
  5011. dir: dir isR
  5012. \-/706: O: O1412 (predict-no)
  5013. I see 1 and I'm going to do: predict-no
  5014. ENV: Agent did: predict-no for direction R in state State-B
  5015. In State-B moving R
  5016. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5017. predict error 0
  5018. dir: dir isR
  5019. |\707: O: O1414 (predict-no)
  5020. I see 1 and I'm going to do: predict-no
  5021. ENV: Agent did: predict-no for direction R in state State-B
  5022. In State-B moving R
  5023. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5024. predict error 0
  5025. dir: dir isR
  5026. -/|708: O: O1416 (predict-no)
  5027. I see 1 and I'm going to do: predict-no
  5028. ENV: Agent did: predict-no for direction R in state State-B
  5029. In State-B moving R
  5030. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5031. predict error 0
  5032. dir: dir isR
  5033. \-/709: O: O1418 (predict-no)
  5034. I see 1 and I'm going to do: predict-no
  5035. ENV: Agent did: predict-no for direction R in state State-B
  5036. In State-B moving R
  5037. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5038. predict error 0
  5039. dir: dir isL
  5040. |\-710: O: O1419 (predict-yes)
  5041. I see 1 and I'm going to do: predict-yes
  5042. ENV: Agent did: predict-yes for direction L in state State-B
  5043. In State-B moving L
  5044. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5045. predict error 0
  5046. dir: dir isU
  5047. /|\-711: O: O1422 (predict-no)
  5048. I see 1 and I'm going to do: predict-no
  5049. ENV: Agent did: predict-no for direction U in state State-A
  5050. In State-A moving U
  5051. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5052. predict error 0
  5053. dir: dir isU
  5054. /712: O: O1424 (predict-no)
  5055. I see 1 and I'm going to do: predict-no
  5056. ENV: Agent did: predict-no for direction U in state State-A
  5057. In State-A moving U
  5058. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5059. predict error 0
  5060. dir: dir isU
  5061. |\-/713: O: O1426 (predict-no)
  5062. I see 1 and I'm going to do: predict-no
  5063. ENV: Agent did: predict-no for direction U in state State-A
  5064. In State-A moving U
  5065. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5066. predict error 0
  5067. dir: dir isU
  5068. |714: O: O1428 (predict-no)
  5069. I see 1 and I'm going to do: predict-no
  5070. ENV: Agent did: predict-no for direction U in state State-A
  5071. In State-A moving U
  5072. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5073. predict error 0
  5074. dir: dir isR
  5075. \-/|715: O: O1429 (predict-yes)
  5076. I see 1 and I'm going to do: predict-yes
  5077. ENV: Agent did: predict-yes for direction R in state State-A
  5078. In State-A moving R
  5079. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5080. predict error 0
  5081. dir: dir isL
  5082. \-/716: O: O1431 (predict-yes)
  5083. I see 1 and I'm going to do: predict-yes
  5084. ENV: Agent did: predict-yes for direction L in state State-B
  5085. In State-B moving L
  5086. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5087. predict error 0
  5088. dir: dir isL
  5089. |\-717: O: O1434 (predict-no)
  5090. I see 1 and I'm going to do: predict-no
  5091. ENV: Agent did: predict-no for direction L in state State-A
  5092. In State-A moving L
  5093. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5094. predict error 0
  5095. dir: dir isL
  5096. /|718: O: O1436 (predict-no)
  5097. I see 1 and I'm going to do: predict-no
  5098. ENV: Agent did: predict-no for direction L in state State-A
  5099. In State-A moving L
  5100. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5101. predict error 0
  5102. dir: dir isL
  5103. \-/719: O: O1438 (predict-no)
  5104. I see 1 and I'm going to do: predict-no
  5105. ENV: Agent did: predict-no for direction L in state State-A
  5106. In State-A moving L
  5107. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5108. predict error 0
  5109. dir: dir isL
  5110. |720: O: O1440 (predict-no)
  5111. I see 1 and I'm going to do: predict-no
  5112. ENV: Agent did: predict-no for direction L in state State-A
  5113. In State-A moving L
  5114. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5115. predict error 0
  5116. dir: dir isL
  5117. \-/721: O: O1442 (predict-no)
  5118. I see 1 and I'm going to do: predict-no
  5119. ENV: Agent did: predict-no for direction L in state State-A
  5120. In State-A moving L
  5121. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5122. predict error 0
  5123. dir: dir isR
  5124. |722: O: O1443 (predict-yes)
  5125. I see 1 and I'm going to do: predict-yes
  5126. ENV: Agent did: predict-yes for direction R in state State-A
  5127. In State-A moving R
  5128. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5129. predict error 0
  5130. dir: dir isL
  5131. \-723: O: O1445 (predict-yes)
  5132. I see 1 and I'm going to do: predict-yes
  5133. ENV: Agent did: predict-yes for direction L in state State-B
  5134. In State-B moving L
  5135. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5136. predict error 0
  5137. dir: dir isR
  5138. /|724: O: O1447 (predict-yes)
  5139. I see 1 and I'm going to do: predict-yes
  5140. ENV: Agent did: predict-yes for direction R in state State-A
  5141. In State-A moving R
  5142. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5143. predict error 0
  5144. dir: dir isR
  5145. \-/725: O: O1450 (predict-no)
  5146. I see 1 and I'm going to do: predict-no
  5147. ENV: Agent did: predict-no for direction R in state State-B
  5148. In State-B moving R
  5149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5150. predict error 0
  5151. dir: dir isL
  5152. |\726: O: O1451 (predict-yes)
  5153. I see 1 and I'm going to do: predict-yes
  5154. ENV: Agent did: predict-yes for direction L in state State-B
  5155. In State-B moving L
  5156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5157. predict error 0
  5158. dir: dir isU
  5159. -/|\727: O: O1454 (predict-no)
  5160. I see 1 and I'm going to do: predict-no
  5161. ENV: Agent did: predict-no for direction U in state State-A
  5162. In State-A moving U
  5163. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5164. predict error 0
  5165. dir: dir isU
  5166. -/|728: O: O1456 (predict-no)
  5167. I see 1 and I'm going to do: predict-no
  5168. ENV: Agent did: predict-no for direction U in state State-A
  5169. In State-A moving U
  5170. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5171. predict error 0
  5172. dir: dir isL
  5173. \-729: O: O1458 (predict-no)
  5174. I see 1 and I'm going to do: predict-no
  5175. ENV: Agent did: predict-no for direction L in state State-A
  5176. In State-A moving L
  5177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5178. predict error 0
  5179. dir: dir isU
  5180. /|730: O: O1460 (predict-no)
  5181. I see 1 and I'm going to do: predict-no
  5182. ENV: Agent did: predict-no for direction U in state State-A
  5183. In State-A moving U
  5184. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5185. predict error 0
  5186. dir: dir isL
  5187. \731: O: O1462 (predict-no)
  5188. I see 1 and I'm going to do: predict-no
  5189. ENV: Agent did: predict-no for direction L in state State-A
  5190. In State-A moving L
  5191. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5192. predict error 0
  5193. dir: dir isL
  5194. -732: O: O1464 (predict-no)
  5195. I see 1 and I'm going to do: predict-no
  5196. ENV: Agent did: predict-no for direction L in state State-A
  5197. In State-A moving L
  5198. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5199. predict error 0
  5200. dir: dir isU
  5201. /|733: O: O1466 (predict-no)
  5202. I see 1 and I'm going to do: predict-no
  5203. ENV: Agent did: predict-no for direction U in state State-A
  5204. In State-A moving U
  5205. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5206. predict error 0
  5207. dir: dir isL
  5208. \-/734: O: O1468 (predict-no)
  5209. I see 1 and I'm going to do: predict-no
  5210. ENV: Agent did: predict-no for direction L in state State-A
  5211. In State-A moving L
  5212. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5213. predict error 0
  5214. dir: dir isL
  5215. |\735: O: O1470 (predict-no)
  5216. I see 1 and I'm going to do: predict-no
  5217. ENV: Agent did: predict-no for direction L in state State-A
  5218. In State-A moving L
  5219. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5220. predict error 0
  5221. dir: dir isU
  5222. -/|736: O: O1472 (predict-no)
  5223. I see 1 and I'm going to do: predict-no
  5224. ENV: Agent did: predict-no for direction U in state State-A
  5225. In State-A moving U
  5226. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5227. predict error 0
  5228. dir: dir isL
  5229. \-/737: O: O1474 (predict-no)
  5230. I see 1 and I'm going to do: predict-no
  5231. ENV: Agent did: predict-no for direction L in state State-A
  5232. In State-A moving L
  5233. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5234. predict error 0
  5235. dir: dir isR
  5236. |\738: O: O1475 (predict-yes)
  5237. I see 1 and I'm going to do: predict-yes
  5238. ENV: Agent did: predict-yes for direction R in state State-A
  5239. In State-A moving R
  5240. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5241. predict error 0
  5242. dir: dir isR
  5243. -/739: O: O1478 (predict-no)
  5244. I see 1 and I'm going to do: predict-no
  5245. ENV: Agent did: predict-no for direction R in state State-B
  5246. In State-B moving R
  5247. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5248. predict error 0
  5249. dir: dir isL
  5250. |\-/sleeping...
  5251. |740: O: O1479 (predict-yes)
  5252. I see 1 and I'm going to do: predict-yes
  5253. ENV: Agent did: predict-yes for direction L in state State-B
  5254. In State-B moving L
  5255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5256. predict error 0
  5257. dir: dir isL
  5258. \-741: O: O1482 (predict-no)
  5259. I see 1 and I'm going to do: predict-no
  5260. ENV: Agent did: predict-no for direction L in state State-A
  5261. In State-A moving L
  5262. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5263. predict error 0
  5264. dir: dir isU
  5265. /742: O: O1484 (predict-no)
  5266. I see 1 and I'm going to do: predict-no
  5267. ENV: Agent did: predict-no for direction U in state State-A
  5268. In State-A moving U
  5269. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5270. predict error 0
  5271. dir: dir isR
  5272. |\743: O: O1485 (predict-yes)
  5273. I see 1 and I'm going to do: predict-yes
  5274. ENV: Agent did: predict-yes for direction R in state State-A
  5275. In State-A moving R
  5276. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5277. predict error 0
  5278. dir: dir isL
  5279. -/|744: O: O1487 (predict-yes)
  5280. I see 1 and I'm going to do: predict-yes
  5281. ENV: Agent did: predict-yes for direction L in state State-B
  5282. In State-B moving L
  5283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5284. predict error 0
  5285. dir: dir isR
  5286. \-/|745: O: O1489 (predict-yes)
  5287. I see 1 and I'm going to do: predict-yes
  5288. ENV: Agent did: predict-yes for direction R in state State-A
  5289. In State-A moving R
  5290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5291. predict error 0
  5292. dir: dir isU
  5293. \-/746: O: O1492 (predict-no)
  5294. I see 1 and I'm going to do: predict-no
  5295. ENV: Agent did: predict-no for direction U in state State-B
  5296. In State-B moving U
  5297. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5298. predict error 0
  5299. dir: dir isL
  5300. |\747: O: O1493 (predict-yes)
  5301. I see 1 and I'm going to do: predict-yes
  5302. ENV: Agent did: predict-yes for direction L in state State-B
  5303. In State-B moving L
  5304. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5305. predict error 0
  5306. dir: dir isL
  5307. -/|\748: O: O1496 (predict-no)
  5308. I see 1 and I'm going to do: predict-no
  5309. ENV: Agent did: predict-no for direction L in state State-A
  5310. In State-A moving L
  5311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5312. predict error 0
  5313. dir: dir isU
  5314. -/749: O: O1498 (predict-no)
  5315. I see 1 and I'm going to do: predict-no
  5316. ENV: Agent did: predict-no for direction U in state State-A
  5317. In State-A moving U
  5318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5319. predict error 0
  5320. dir: dir isL
  5321. |\750: O: O1500 (predict-no)
  5322. I see 1 and I'm going to do: predict-no
  5323. ENV: Agent did: predict-no for direction L in state State-A
  5324. In State-A moving L
  5325. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5326. predict error 0
  5327. dir: dir isL
  5328. -/|\751: O: O1502 (predict-no)
  5329. I see 1 and I'm going to do: predict-no
  5330. ENV: Agent did: predict-no for direction L in state State-A
  5331. In State-A moving L
  5332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5333. predict error 0
  5334. dir: dir isL
  5335. -752: O: O1504 (predict-no)
  5336. I see 1 and I'm going to do: predict-no
  5337. ENV: Agent did: predict-no for direction L in state State-A
  5338. In State-A moving L
  5339. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5340. predict error 0
  5341. dir: dir isU
  5342. /|\753: O: O1506 (predict-no)
  5343. I see 1 and I'm going to do: predict-no
  5344. ENV: Agent did: predict-no for direction U in state State-A
  5345. In State-A moving U
  5346. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5347. predict error 0
  5348. dir: dir isR
  5349. -/|754: O: O1507 (predict-yes)
  5350. I see 1 and I'm going to do: predict-yes
  5351. ENV: Agent did: predict-yes for direction R in state State-A
  5352. In State-A moving R
  5353. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5354. predict error 0
  5355. dir: dir isU
  5356. \-/755: O: O1510 (predict-no)
  5357. I see 1 and I'm going to do: predict-no
  5358. ENV: Agent did: predict-no for direction U in state State-B
  5359. In State-B moving U
  5360. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5361. predict error 0
  5362. dir: dir isR
  5363. |\-/756: O: O1512 (predict-no)
  5364. I see 1 and I'm going to do: predict-no
  5365. ENV: Agent did: predict-no for direction R in state State-B
  5366. In State-B moving R
  5367. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5368. predict error 0
  5369. dir: dir isU
  5370. |\757: O: O1514 (predict-no)
  5371. I see 1 and I'm going to do: predict-no
  5372. ENV: Agent did: predict-no for direction U in state State-B
  5373. In State-B moving U
  5374. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5375. predict error 0
  5376. dir: dir isR
  5377. -/|758: O: O1516 (predict-no)
  5378. I see 1 and I'm going to do: predict-no
  5379. ENV: Agent did: predict-no for direction R in state State-B
  5380. In State-B moving R
  5381. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5382. predict error 0
  5383. dir: dir isR
  5384. \-/759: O: O1518 (predict-no)
  5385. I see 1 and I'm going to do: predict-no
  5386. ENV: Agent did: predict-no for direction R in state State-B
  5387. In State-B moving R
  5388. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5389. predict error 0
  5390. dir: dir isU
  5391. |\760: O: O1520 (predict-no)
  5392. I see 1 and I'm going to do: predict-no
  5393. ENV: Agent did: predict-no for direction U in state State-B
  5394. In State-B moving U
  5395. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5396. predict error 0
  5397. dir: dir isL
  5398. -761: O: O1521 (predict-yes)
  5399. I see 1 and I'm going to do: predict-yes
  5400. ENV: Agent did: predict-yes for direction L in state State-B
  5401. In State-B moving L
  5402. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5403. predict error 0
  5404. dir: dir isR
  5405. /762: O: O1523 (predict-yes)
  5406. I see 1 and I'm going to do: predict-yes
  5407. ENV: Agent did: predict-yes for direction R in state State-A
  5408. In State-A moving R
  5409. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5410. predict error 0
  5411. dir: dir isU
  5412. |\-763: O: O1526 (predict-no)
  5413. I see 1 and I'm going to do: predict-no
  5414. ENV: Agent did: predict-no for direction U in state State-B
  5415. In State-B moving U
  5416. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5417. predict error 0
  5418. dir: dir isR
  5419. /|\-764: O: O1528 (predict-no)
  5420. I see 1 and I'm going to do: predict-no
  5421. ENV: Agent did: predict-no for direction R in state State-B
  5422. In State-B moving R
  5423. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5424. predict error 0
  5425. dir: dir isR
  5426. /|\765: O: O1530 (predict-no)
  5427. I see 1 and I'm going to do: predict-no
  5428. ENV: Agent did: predict-no for direction R in state State-B
  5429. In State-B moving R
  5430. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5431. predict error 0
  5432. dir: dir isU
  5433. -/|766: O: O1532 (predict-no)
  5434. I see 1 and I'm going to do: predict-no
  5435. ENV: Agent did: predict-no for direction U in state State-B
  5436. In State-B moving U
  5437. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5438. predict error 0
  5439. dir: dir isR
  5440. \-767: O: O1534 (predict-no)
  5441. I see 1 and I'm going to do: predict-no
  5442. ENV: Agent did: predict-no for direction R in state State-B
  5443. In State-B moving R
  5444. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5445. predict error 0
  5446. dir: dir isU
  5447. /|\768: O: O1536 (predict-no)
  5448. I see 1 and I'm going to do: predict-no
  5449. ENV: Agent did: predict-no for direction U in state State-B
  5450. In State-B moving U
  5451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5452. predict error 0
  5453. dir: dir isR
  5454. -/769: O: O1538 (predict-no)
  5455. I see 1 and I'm going to do: predict-no
  5456. ENV: Agent did: predict-no for direction R in state State-B
  5457. In State-B moving R
  5458. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5459. predict error 0
  5460. dir: dir isL
  5461. |\770: O: O1539 (predict-yes)
  5462. I see 1 and I'm going to do: predict-yes
  5463. ENV: Agent did: predict-yes for direction L in state State-B
  5464. In State-B moving L
  5465. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5466. predict error 0
  5467. dir: dir isR
  5468. -/|771: O: O1541 (predict-yes)
  5469. I see 1 and I'm going to do: predict-yes
  5470. ENV: Agent did: predict-yes for direction R in state State-A
  5471. In State-A moving R
  5472. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5473. predict error 0
  5474. dir: dir isL
  5475. \772: O: O1543 (predict-yes)
  5476. I see 1 and I'm going to do: predict-yes
  5477. ENV: Agent did: predict-yes for direction L in state State-B
  5478. In State-B moving L
  5479. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5480. predict error 0
  5481. dir: dir isL
  5482. -/773: O: O1546 (predict-no)
  5483. I see 1 and I'm going to do: predict-no
  5484. ENV: Agent did: predict-no for direction L in state State-A
  5485. In State-A moving L
  5486. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5487. predict error 0
  5488. dir: dir isL
  5489. |\-774: O: O1548 (predict-no)
  5490. I see 1 and I'm going to do: predict-no
  5491. ENV: Agent did: predict-no for direction L in state State-A
  5492. In State-A moving L
  5493. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5494. predict error 0
  5495. dir: dir isL
  5496. /|775: O: O1550 (predict-no)
  5497. I see 1 and I'm going to do: predict-no
  5498. ENV: Agent did: predict-no for direction L in state State-A
  5499. In State-A moving L
  5500. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5501. predict error 0
  5502. dir: dir isR
  5503. \-/776: O: O1551 (predict-yes)
  5504. I see 1 and I'm going to do: predict-yes
  5505. ENV: Agent did: predict-yes for direction R in state State-A
  5506. In State-A moving R
  5507. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5508. predict error 0
  5509. dir: dir isU
  5510. |\777: O: O1554 (predict-no)
  5511. I see 1 and I'm going to do: predict-no
  5512. ENV: Agent did: predict-no for direction U in state State-B
  5513. In State-B moving U
  5514. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5515. predict error 0
  5516. dir: dir isU
  5517. -/|\778: O: O1556 (predict-no)
  5518. I see 1 and I'm going to do: predict-no
  5519. ENV: Agent did: predict-no for direction U in state State-B
  5520. In State-B moving U
  5521. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5522. predict error 0
  5523. dir: dir isU
  5524. -/|779: O: O1558 (predict-no)
  5525. I see 1 and I'm going to do: predict-no
  5526. ENV: Agent did: predict-no for direction U in state State-B
  5527. In State-B moving U
  5528. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5529. predict error 0
  5530. dir: dir isL
  5531. \-/780: O: O1559 (predict-yes)
  5532. I see 1 and I'm going to do: predict-yes
  5533. ENV: Agent did: predict-yes for direction L in state State-B
  5534. In State-B moving L
  5535. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5536. predict error 0
  5537. dir: dir isU
  5538. |\781: O: O1562 (predict-no)
  5539. I see 1 and I'm going to do: predict-no
  5540. ENV: Agent did: predict-no for direction U in state State-A
  5541. In State-A moving U
  5542. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5543. predict error 0
  5544. dir: dir isU
  5545. -782: O: O1564 (predict-no)
  5546. I see 1 and I'm going to do: predict-no
  5547. ENV: Agent did: predict-no for direction U in state State-A
  5548. In State-A moving U
  5549. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5550. predict error 0
  5551. dir: dir isR
  5552. /|\783: O: O1565 (predict-yes)
  5553. I see 1 and I'm going to do: predict-yes
  5554. ENV: Agent did: predict-yes for direction R in state State-A
  5555. In State-A moving R
  5556. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5557. predict error 0
  5558. dir: dir isU
  5559. -/|784: O: O1568 (predict-no)
  5560. I see 1 and I'm going to do: predict-no
  5561. ENV: Agent did: predict-no for direction U in state State-B
  5562. In State-B moving U
  5563. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5564. predict error 0
  5565. dir: dir isL
  5566. \-/785: O: O1569 (predict-yes)
  5567. I see 1 and I'm going to do: predict-yes
  5568. ENV: Agent did: predict-yes for direction L in state State-B
  5569. In State-B moving L
  5570. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5571. predict error 0
  5572. dir: dir isU
  5573. |\-786: O: O1572 (predict-no)
  5574. I see 1 and I'm going to do: predict-no
  5575. ENV: Agent did: predict-no for direction U in state State-A
  5576. In State-A moving U
  5577. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5578. predict error 0
  5579. dir: dir isU
  5580. /|787: O: O1574 (predict-no)
  5581. I see 1 and I'm going to do: predict-no
  5582. ENV: Agent did: predict-no for direction U in state State-A
  5583. In State-A moving U
  5584. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5585. predict error 0
  5586. dir: dir isU
  5587. \-788: O: O1576 (predict-no)
  5588. I see 1 and I'm going to do: predict-no
  5589. ENV: Agent did: predict-no for direction U in state State-A
  5590. In State-A moving U
  5591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5592. predict error 0
  5593. dir: dir isU
  5594. /|789: O: O1578 (predict-no)
  5595. I see 1 and I'm going to do: predict-no
  5596. ENV: Agent did: predict-no for direction U in state State-A
  5597. In State-A moving U
  5598. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5599. predict error 0
  5600. dir: dir isU
  5601. \-790: O: O1580 (predict-no)
  5602. I see 1 and I'm going to do: predict-no
  5603. ENV: Agent did: predict-no for direction U in state State-A
  5604. In State-A moving U
  5605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5606. predict error 0
  5607. dir: dir isR
  5608. /|\-sleeping...
  5609. /791: O: O1581 (predict-yes)
  5610. I see 1 and I'm going to do: predict-yes
  5611. ENV: Agent did: predict-yes for direction R in state State-A
  5612. In State-A moving R
  5613. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5614. predict error 0
  5615. dir: dir isU
  5616. |792: O: O1584 (predict-no)
  5617. I see 1 and I'm going to do: predict-no
  5618. ENV: Agent did: predict-no for direction U in state State-B
  5619. In State-B moving U
  5620. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5621. predict error 0
  5622. dir: dir isU
  5623. \-/793: O: O1586 (predict-no)
  5624. I see 1 and I'm going to do: predict-no
  5625. ENV: Agent did: predict-no for direction U in state State-B
  5626. In State-B moving U
  5627. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5628. predict error 0
  5629. dir: dir isR
  5630. |\-794: O: O1588 (predict-no)
  5631. I see 1 and I'm going to do: predict-no
  5632. ENV: Agent did: predict-no for direction R in state State-B
  5633. In State-B moving R
  5634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5635. predict error 0
  5636. dir: dir isR
  5637. /|\-795: O: O1590 (predict-no)
  5638. I see 1 and I'm going to do: predict-no
  5639. ENV: Agent did: predict-no for direction R in state State-B
  5640. In State-B moving R
  5641. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5642. predict error 0
  5643. dir: dir isU
  5644. /|\796: O: O1592 (predict-no)
  5645. I see 1 and I'm going to do: predict-no
  5646. ENV: Agent did: predict-no for direction U in state State-B
  5647. In State-B moving U
  5648. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5649. predict error 0
  5650. dir: dir isL
  5651. -/|797: O: O1593 (predict-yes)
  5652. I see 1 and I'm going to do: predict-yes
  5653. ENV: Agent did: predict-yes for direction L in state State-B
  5654. In State-B moving L
  5655. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5656. predict error 0
  5657. dir: dir isR
  5658. \-/798: O: O1595 (predict-yes)
  5659. I see 1 and I'm going to do: predict-yes
  5660. ENV: Agent did: predict-yes for direction R in state State-A
  5661. In State-A moving R
  5662. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5663. predict error 0
  5664. dir: dir isL
  5665. |\-799: O: O1597 (predict-yes)
  5666. I see 1 and I'm going to do: predict-yes
  5667. ENV: Agent did: predict-yes for direction L in state State-B
  5668. In State-B moving L
  5669. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5670. predict error 0
  5671. dir: dir isU
  5672. /|\800: O: O1600 (predict-no)
  5673. I see 1 and I'm going to do: predict-no
  5674. ENV: Agent did: predict-no for direction U in state State-A
  5675. In State-A moving U
  5676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5677. predict error 0
  5678. dir: dir isR
  5679. -/|801: O: O1601 (predict-yes)
  5680. I see 1 and I'm going to do: predict-yes
  5681. ENV: Agent did: predict-yes for direction R in state State-A
  5682. In State-A moving R
  5683. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5684. predict error 0
  5685. dir: dir isU
  5686. \802: O: O1604 (predict-no)
  5687. I see 1 and I'm going to do: predict-no
  5688. ENV: Agent did: predict-no for direction U in state State-B
  5689. In State-B moving U
  5690. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5691. predict error 0
  5692. dir: dir isL
  5693. -/|803: O: O1605 (predict-yes)
  5694. I see 1 and I'm going to do: predict-yes
  5695. ENV: Agent did: predict-yes for direction L in state State-B
  5696. In State-B moving L
  5697. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5698. predict error 0
  5699. dir: dir isL
  5700. \-/804: O: O1608 (predict-no)
  5701. I see 1 and I'm going to do: predict-no
  5702. ENV: Agent did: predict-no for direction L in state State-A
  5703. In State-A moving L
  5704. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5705. predict error 0
  5706. dir: dir isR
  5707. |\805: O: O1609 (predict-yes)
  5708. I see 1 and I'm going to do: predict-yes
  5709. ENV: Agent did: predict-yes for direction R in state State-A
  5710. In State-A moving R
  5711. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5712. predict error 0
  5713. dir: dir isU
  5714. -/|806: O: O1612 (predict-no)
  5715. I see 1 and I'm going to do: predict-no
  5716. ENV: Agent did: predict-no for direction U in state State-B
  5717. In State-B moving U
  5718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5719. predict error 0
  5720. dir: dir isR
  5721. \-/807: O: O1614 (predict-no)
  5722. I see 1 and I'm going to do: predict-no
  5723. ENV: Agent did: predict-no for direction R in state State-B
  5724. In State-B moving R
  5725. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5726. predict error 0
  5727. dir: dir isR
  5728. |\808: O: O1616 (predict-no)
  5729. I see 1 and I'm going to do: predict-no
  5730. ENV: Agent did: predict-no for direction R in state State-B
  5731. In State-B moving R
  5732. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5733. predict error 0
  5734. dir: dir isR
  5735. -/|809: O: O1618 (predict-no)
  5736. I see 1 and I'm going to do: predict-no
  5737. ENV: Agent did: predict-no for direction R in state State-B
  5738. In State-B moving R
  5739. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5740. predict error 0
  5741. dir: dir isU
  5742. \-/810: O: O1620 (predict-no)
  5743. I see 1 and I'm going to do: predict-no
  5744. ENV: Agent did: predict-no for direction U in state State-B
  5745. In State-B moving U
  5746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5747. predict error 0
  5748. dir: dir isL
  5749. |\-811: O: O1621 (predict-yes)
  5750. I see 1 and I'm going to do: predict-yes
  5751. ENV: Agent did: predict-yes for direction L in state State-B
  5752. In State-B moving L
  5753. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5754. predict error 0
  5755. dir: dir isU
  5756. /812: O: O1624 (predict-no)
  5757. I see 1 and I'm going to do: predict-no
  5758. ENV: Agent did: predict-no for direction U in state State-A
  5759. In State-A moving U
  5760. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5761. predict error 0
  5762. dir: dir isL
  5763. |\-/813: O: O1626 (predict-no)
  5764. I see 1 and I'm going to do: predict-no
  5765. ENV: Agent did: predict-no for direction L in state State-A
  5766. In State-A moving L
  5767. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5768. predict error 0
  5769. dir: dir isU
  5770. |\-814: O: O1628 (predict-no)
  5771. I see 1 and I'm going to do: predict-no
  5772. ENV: Agent did: predict-no for direction U in state State-A
  5773. In State-A moving U
  5774. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5775. predict error 0
  5776. dir: dir isU
  5777. /|\815: O: O1630 (predict-no)
  5778. I see 1 and I'm going to do: predict-no
  5779. ENV: Agent did: predict-no for direction U in state State-A
  5780. In State-A moving U
  5781. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5782. predict error 0
  5783. dir: dir isR
  5784. -/|816: O: O1631 (predict-yes)
  5785. I see 1 and I'm going to do: predict-yes
  5786. ENV: Agent did: predict-yes for direction R in state State-A
  5787. In State-A moving R
  5788. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5789. predict error 0
  5790. dir: dir isU
  5791. \-/|817: O: O1634 (predict-no)
  5792. I see 1 and I'm going to do: predict-no
  5793. ENV: Agent did: predict-no for direction U in state State-B
  5794. In State-B moving U
  5795. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5796. predict error 0
  5797. dir: dir isL
  5798. \-/818: O: O1635 (predict-yes)
  5799. I see 1 and I'm going to do: predict-yes
  5800. ENV: Agent did: predict-yes for direction L in state State-B
  5801. In State-B moving L
  5802. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5803. predict error 0
  5804. dir: dir isL
  5805. |\-819: O: O1638 (predict-no)
  5806. I see 1 and I'm going to do: predict-no
  5807. ENV: Agent did: predict-no for direction L in state State-A
  5808. In State-A moving L
  5809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5810. predict error 0
  5811. dir: dir isR
  5812. /|\-820: O: O1639 (predict-yes)
  5813. I see 1 and I'm going to do: predict-yes
  5814. ENV: Agent did: predict-yes for direction R in state State-A
  5815. In State-A moving R
  5816. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5817. predict error 0
  5818. dir: dir isL
  5819. /|\821: O: O1641 (predict-yes)
  5820. I see 1 and I'm going to do: predict-yes
  5821. ENV: Agent did: predict-yes for direction L in state State-B
  5822. In State-B moving L
  5823. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5824. predict error 0
  5825. dir: dir isU
  5826. -822: O: O1644 (predict-no)
  5827. I see 1 and I'm going to do: predict-no
  5828. ENV: Agent did: predict-no for direction U in state State-A
  5829. In State-A moving U
  5830. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5831. predict error 0
  5832. dir: dir isR
  5833. /|823: O: O1645 (predict-yes)
  5834. I see 1 and I'm going to do: predict-yes
  5835. ENV: Agent did: predict-yes for direction R in state State-A
  5836. In State-A moving R
  5837. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5838. predict error 0
  5839. dir: dir isR
  5840. \-/|824: O: O1648 (predict-no)
  5841. I see 1 and I'm going to do: predict-no
  5842. ENV: Agent did: predict-no for direction R in state State-B
  5843. In State-B moving R
  5844. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5845. predict error 0
  5846. dir: dir isU
  5847. \-/825: O: O1650 (predict-no)
  5848. I see 1 and I'm going to do: predict-no
  5849. ENV: Agent did: predict-no for direction U in state State-B
  5850. In State-B moving U
  5851. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5852. predict error 0
  5853. dir: dir isU
  5854. |\826: O: O1652 (predict-no)
  5855. I see 1 and I'm going to do: predict-no
  5856. ENV: Agent did: predict-no for direction U in state State-B
  5857. In State-B moving U
  5858. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5859. predict error 0
  5860. dir: dir isU
  5861. -/|\827: O: O1654 (predict-no)
  5862. I see 1 and I'm going to do: predict-no
  5863. ENV: Agent did: predict-no for direction U in state State-B
  5864. In State-B moving U
  5865. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5866. predict error 0
  5867. dir: dir isU
  5868. -828: O: O1656 (predict-no)
  5869. I see 1 and I'm going to do: predict-no
  5870. ENV: Agent did: predict-no for direction U in state State-B
  5871. In State-B moving U
  5872. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5873. predict error 0
  5874. dir: dir isU
  5875. /|829: O: O1658 (predict-no)
  5876. I see 1 and I'm going to do: predict-no
  5877. ENV: Agent did: predict-no for direction U in state State-B
  5878. In State-B moving U
  5879. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5880. predict error 0
  5881. dir: dir isU
  5882. \-/|830: O: O1660 (predict-no)
  5883. I see 1 and I'm going to do: predict-no
  5884. ENV: Agent did: predict-no for direction U in state State-B
  5885. In State-B moving U
  5886. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5887. predict error 0
  5888. dir: dir isR
  5889. \-/831: O: O1662 (predict-no)
  5890. I see 1 and I'm going to do: predict-no
  5891. ENV: Agent did: predict-no for direction R in state State-B
  5892. In State-B moving R
  5893. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5894. predict error 0
  5895. dir: dir isR
  5896. |832: O: O1664 (predict-no)
  5897. I see 1 and I'm going to do: predict-no
  5898. ENV: Agent did: predict-no for direction R in state State-B
  5899. In State-B moving R
  5900. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5901. predict error 0
  5902. dir: dir isU
  5903. \-/|833: O: O1666 (predict-no)
  5904. I see 1 and I'm going to do: predict-no
  5905. ENV: Agent did: predict-no for direction U in state State-B
  5906. In State-B moving U
  5907. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5908. predict error 0
  5909. dir: dir isU
  5910. \-/834: O: O1668 (predict-no)
  5911. I see 1 and I'm going to do: predict-no
  5912. ENV: Agent did: predict-no for direction U in state State-B
  5913. In State-B moving U
  5914. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5915. predict error 0
  5916. dir: dir isU
  5917. |\-835: O: O1670 (predict-no)
  5918. I see 1 and I'm going to do: predict-no
  5919. ENV: Agent did: predict-no for direction U in state State-B
  5920. In State-B moving U
  5921. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5922. predict error 0
  5923. dir: dir isU
  5924. /|836: O: O1672 (predict-no)
  5925. I see 1 and I'm going to do: predict-no
  5926. ENV: Agent did: predict-no for direction U in state State-B
  5927. In State-B moving U
  5928. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5929. predict error 0
  5930. dir: dir isU
  5931. \-/837: O: O1674 (predict-no)
  5932. I see 1 and I'm going to do: predict-no
  5933. ENV: Agent did: predict-no for direction U in state State-B
  5934. In State-B moving U
  5935. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5936. predict error 0
  5937. dir: dir isL
  5938. |\-838: O: O1675 (predict-yes)
  5939. I see 1 and I'm going to do: predict-yes
  5940. ENV: Agent did: predict-yes for direction L in state State-B
  5941. In State-B moving L
  5942. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5943. predict error 0
  5944. dir: dir isR
  5945. /|839: O: O1677 (predict-yes)
  5946. I see 1 and I'm going to do: predict-yes
  5947. ENV: Agent did: predict-yes for direction R in state State-A
  5948. In State-A moving R
  5949. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5950. predict error 0
  5951. dir: dir isL
  5952. \-840: O: O1679 (predict-yes)
  5953. I see 1 and I'm going to do: predict-yes
  5954. ENV: Agent did: predict-yes for direction L in state State-B
  5955. In State-B moving L
  5956. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5957. predict error 0
  5958. dir: dir isR
  5959. /|\841: O: O1681 (predict-yes)
  5960. I see 1 and I'm going to do: predict-yes
  5961. ENV: Agent did: predict-yes for direction R in state State-A
  5962. In State-A moving R
  5963. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5964. predict error 0
  5965. dir: dir isL
  5966. -842: O: O1683 (predict-yes)
  5967. I see 1 and I'm going to do: predict-yes
  5968. ENV: Agent did: predict-yes for direction L in state State-B
  5969. In State-B moving L
  5970. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5971. predict error 0
  5972. dir: dir isL
  5973. /|\-843: O: O1686 (predict-no)
  5974. I see 1 and I'm going to do: predict-no
  5975. ENV: Agent did: predict-no for direction L in state State-A
  5976. In State-A moving L
  5977. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5978. predict error 0
  5979. dir: dir isU
  5980. /|844: O: O1688 (predict-no)
  5981. I see 1 and I'm going to do: predict-no
  5982. ENV: Agent did: predict-no for direction U in state State-A
  5983. In State-A moving U
  5984. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5985. predict error 0
  5986. dir: dir isU
  5987. \-/845: O: O1690 (predict-no)
  5988. I see 1 and I'm going to do: predict-no
  5989. ENV: Agent did: predict-no for direction U in state State-A
  5990. In State-A moving U
  5991. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5992. predict error 0
  5993. dir: dir isL
  5994. |\846: O: O1692 (predict-no)
  5995. I see 1 and I'm going to do: predict-no
  5996. ENV: Agent did: predict-no for direction L in state State-A
  5997. In State-A moving L
  5998. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5999. predict error 0
  6000. dir: dir isL
  6001. -/847: O: O1694 (predict-no)
  6002. I see 1 and I'm going to do: predict-no
  6003. ENV: Agent did: predict-no for direction L in state State-A
  6004. In State-A moving L
  6005. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6006. predict error 0
  6007. dir: dir isU
  6008. |\848: O: O1696 (predict-no)
  6009. I see 1 and I'm going to do: predict-no
  6010. ENV: Agent did: predict-no for direction U in state State-A
  6011. In State-A moving U
  6012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6013. predict error 0
  6014. dir: dir isU
  6015. -/|849: O: O1698 (predict-no)
  6016. I see 1 and I'm going to do: predict-no
  6017. ENV: Agent did: predict-no for direction U in state State-A
  6018. In State-A moving U
  6019. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6020. predict error 0
  6021. dir: dir isL
  6022. \-/850: O: O1700 (predict-no)
  6023. I see 1 and I'm going to do: predict-no
  6024. ENV: Agent did: predict-no for direction L in state State-A
  6025. In State-A moving L
  6026. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6027. predict error 0
  6028. dir: dir isU
  6029. |851: O: O1702 (predict-no)
  6030. I see 1 and I'm going to do: predict-no
  6031. ENV: Agent did: predict-no for direction U in state State-A
  6032. In State-A moving U
  6033. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6034. predict error 0
  6035. dir: dir isU
  6036. \852: O: O1704 (predict-no)
  6037. I see 1 and I'm going to do: predict-no
  6038. ENV: Agent did: predict-no for direction U in state State-A
  6039. In State-A moving U
  6040. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6041. predict error 0
  6042. dir: dir isR
  6043. -/|\853: O: O1705 (predict-yes)
  6044. I see 1 and I'm going to do: predict-yes
  6045. ENV: Agent did: predict-yes for direction R in state State-A
  6046. In State-A moving R
  6047. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6048. predict error 0
  6049. dir: dir isL
  6050. -/|854: O: O1707 (predict-yes)
  6051. I see 1 and I'm going to do: predict-yes
  6052. ENV: Agent did: predict-yes for direction L in state State-B
  6053. In State-B moving L
  6054. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6055. predict error 0
  6056. dir: dir isL
  6057. \-/855: O: O1710 (predict-no)
  6058. I see 1 and I'm going to do: predict-no
  6059. ENV: Agent did: predict-no for direction L in state State-A
  6060. In State-A moving L
  6061. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6062. predict error 0
  6063. dir: dir isL
  6064. |\856: O: O1712 (predict-no)
  6065. I see 1 and I'm going to do: predict-no
  6066. ENV: Agent did: predict-no for direction L in state State-A
  6067. In State-A moving L
  6068. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6069. predict error 0
  6070. dir: dir isR
  6071. -/857: O: O1713 (predict-yes)
  6072. I see 1 and I'm going to do: predict-yes
  6073. ENV: Agent did: predict-yes for direction R in state State-A
  6074. In State-A moving R
  6075. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6076. predict error 0
  6077. dir: dir isU
  6078. |\-858: O: O1716 (predict-no)
  6079. I see 1 and I'm going to do: predict-no
  6080. ENV: Agent did: predict-no for direction U in state State-B
  6081. In State-B moving U
  6082. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6083. predict error 0
  6084. dir: dir isR
  6085. /|859: O: O1718 (predict-no)
  6086. I see 1 and I'm going to do: predict-no
  6087. ENV: Agent did: predict-no for direction R in state State-B
  6088. In State-B moving R
  6089. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6090. predict error 0
  6091. dir: dir isL
  6092. \-860: O: O1719 (predict-yes)
  6093. I see 1 and I'm going to do: predict-yes
  6094. ENV: Agent did: predict-yes for direction L in state State-B
  6095. In State-B moving L
  6096. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6097. predict error 0
  6098. dir: dir isU
  6099. /|\861: O: O1722 (predict-no)
  6100. I see 1 and I'm going to do: predict-no
  6101. ENV: Agent did: predict-no for direction U in state State-A
  6102. In State-A moving U
  6103. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6104. predict error 0
  6105. dir: dir isL
  6106. -862: O: O1724 (predict-no)
  6107. I see 1 and I'm going to do: predict-no
  6108. ENV: Agent did: predict-no for direction L in state State-A
  6109. In State-A moving L
  6110. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6111. predict error 0
  6112. dir: dir isR
  6113. /|863: O: O1725 (predict-yes)
  6114. I see 1 and I'm going to do: predict-yes
  6115. ENV: Agent did: predict-yes for direction R in state State-A
  6116. In State-A moving R
  6117. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6118. predict error 0
  6119. dir: dir isL
  6120. \-/|864: O: O1727 (predict-yes)
  6121. I see 1 and I'm going to do: predict-yes
  6122. ENV: Agent did: predict-yes for direction L in state State-B
  6123. In State-B moving L
  6124. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6125. predict error 0
  6126. dir: dir isL
  6127. \-/865: O: O1730 (predict-no)
  6128. I see 1 and I'm going to do: predict-no
  6129. ENV: Agent did: predict-no for direction L in state State-A
  6130. In State-A moving L
  6131. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6132. predict error 0
  6133. dir: dir isL
  6134. |\-/866: O: O1732 (predict-no)
  6135. I see 1 and I'm going to do: predict-no
  6136. ENV: Agent did: predict-no for direction L in state State-A
  6137. In State-A moving L
  6138. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6139. predict error 0
  6140. dir: dir isU
  6141. |\867: O: O1734 (predict-no)
  6142. I see 1 and I'm going to do: predict-no
  6143. ENV: Agent did: predict-no for direction U in state State-A
  6144. In State-A moving U
  6145. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6146. predict error 0
  6147. dir: dir isL
  6148. -/|868: O: O1736 (predict-no)
  6149. I see 1 and I'm going to do: predict-no
  6150. ENV: Agent did: predict-no for direction L in state State-A
  6151. In State-A moving L
  6152. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6153. predict error 0
  6154. dir: dir isL
  6155. \-869: O: O1738 (predict-no)
  6156. I see 1 and I'm going to do: predict-no
  6157. ENV: Agent did: predict-no for direction L in state State-A
  6158. In State-A moving L
  6159. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6160. predict error 0
  6161. dir: dir isR
  6162. /|870: O: O1739 (predict-yes)
  6163. I see 1 and I'm going to do: predict-yes
  6164. ENV: Agent did: predict-yes for direction R in state State-A
  6165. In State-A moving R
  6166. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6167. predict error 0
  6168. dir: dir isR
  6169. \-/871: O: O1742 (predict-no)
  6170. I see 1 and I'm going to do: predict-no
  6171. ENV: Agent did: predict-no for direction R in state State-B
  6172. In State-B moving R
  6173. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6174. predict error 0
  6175. dir: dir isR
  6176. |872: O: O1744 (predict-no)
  6177. I see 1 and I'm going to do: predict-no
  6178. ENV: Agent did: predict-no for direction R in state State-B
  6179. In State-B moving R
  6180. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6181. predict error 0
  6182. dir: dir isR
  6183. \-873: O: O1746 (predict-no)
  6184. I see 1 and I'm going to do: predict-no
  6185. ENV: Agent did: predict-no for direction R in state State-B
  6186. In State-B moving R
  6187. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6188. predict error 0
  6189. dir: dir isU
  6190. /|874: O: O1748 (predict-no)
  6191. I see 1 and I'm going to do: predict-no
  6192. ENV: Agent did: predict-no for direction U in state State-B
  6193. In State-B moving U
  6194. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6195. predict error 0
  6196. dir: dir isU
  6197. \-/875: O: O1750 (predict-no)
  6198. I see 1 and I'm going to do: predict-no
  6199. ENV: Agent did: predict-no for direction U in state State-B
  6200. In State-B moving U
  6201. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6202. predict error 0
  6203. dir: dir isR
  6204. |\-876: O: O1752 (predict-no)
  6205. I see 1 and I'm going to do: predict-no
  6206. ENV: Agent did: predict-no for direction R in state State-B
  6207. In State-B moving R
  6208. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6209. predict error 0
  6210. dir: dir isR
  6211. /|877: O: O1754 (predict-no)
  6212. I see 1 and I'm going to do: predict-no
  6213. ENV: Agent did: predict-no for direction R in state State-B
  6214. In State-B moving R
  6215. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6216. predict error 0
  6217. dir: dir isL
  6218. \-878: O: O1755 (predict-yes)
  6219. I see 1 and I'm going to do: predict-yes
  6220. ENV: Agent did: predict-yes for direction L in state State-B
  6221. In State-B moving L
  6222. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6223. predict error 0
  6224. dir: dir isL
  6225. /|\879: O: O1758 (predict-no)
  6226. I see 1 and I'm going to do: predict-no
  6227. ENV: Agent did: predict-no for direction L in state State-A
  6228. In State-A moving L
  6229. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6230. predict error 0
  6231. dir: dir isL
  6232. -/|880: O: O1760 (predict-no)
  6233. I see 1 and I'm going to do: predict-no
  6234. ENV: Agent did: predict-no for direction L in state State-A
  6235. In State-A moving L
  6236. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6237. predict error 0
  6238. dir: dir isU
  6239. \-881: O: O1762 (predict-no)
  6240. I see 1 and I'm going to do: predict-no
  6241. ENV: Agent did: predict-no for direction U in state State-A
  6242. In State-A moving U
  6243. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6244. predict error 0
  6245. dir: dir isR
  6246. /882: O: O1763 (predict-yes)
  6247. I see 1 and I'm going to do: predict-yes
  6248. ENV: Agent did: predict-yes for direction R in state State-A
  6249. In State-A moving R
  6250. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6251. predict error 0
  6252. dir: dir isU
  6253. |\-883: O: O1766 (predict-no)
  6254. I see 1 and I'm going to do: predict-no
  6255. ENV: Agent did: predict-no for direction U in state State-B
  6256. In State-B moving U
  6257. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6258. predict error 0
  6259. dir: dir isU
  6260. /|884: O: O1768 (predict-no)
  6261. I see 1 and I'm going to do: predict-no
  6262. ENV: Agent did: predict-no for direction U in state State-B
  6263. In State-B moving U
  6264. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6265. predict error 0
  6266. dir: dir isR
  6267. \-/885: O: O1770 (predict-no)
  6268. I see 1 and I'm going to do: predict-no
  6269. ENV: Agent did: predict-no for direction R in state State-B
  6270. In State-B moving R
  6271. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6272. predict error 0
  6273. dir: dir isL
  6274. |\-/886: O: O1771 (predict-yes)
  6275. I see 1 and I'm going to do: predict-yes
  6276. ENV: Agent did: predict-yes for direction L in state State-B
  6277. In State-B moving L
  6278. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6279. predict error 0
  6280. dir: dir isL
  6281. |\-887: O: O1774 (predict-no)
  6282. I see 1 and I'm going to do: predict-no
  6283. ENV: Agent did: predict-no for direction L in state State-A
  6284. In State-A moving L
  6285. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6286. predict error 0
  6287. dir: dir isR
  6288. /|\-888: O: O1775 (predict-yes)
  6289. I see 1 and I'm going to do: predict-yes
  6290. ENV: Agent did: predict-yes for direction R in state State-A
  6291. In State-A moving R
  6292. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6293. predict error 0
  6294. dir: dir isR
  6295. /|\889: O: O1778 (predict-no)
  6296. I see 1 and I'm going to do: predict-no
  6297. ENV: Agent did: predict-no for direction R in state State-B
  6298. In State-B moving R
  6299. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6300. predict error 0
  6301. dir: dir isL
  6302. -/|890: O: O1779 (predict-yes)
  6303. I see 1 and I'm going to do: predict-yes
  6304. ENV: Agent did: predict-yes for direction L in state State-B
  6305. In State-B moving L
  6306. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6307. predict error 0
  6308. dir: dir isU
  6309. \-891: O: O1782 (predict-no)
  6310. I see 1 and I'm going to do: predict-no
  6311. ENV: Agent did: predict-no for direction U in state State-A
  6312. In State-A moving U
  6313. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6314. predict error 0
  6315. dir: dir isR
  6316. /892: O: O1783 (predict-yes)
  6317. I see 1 and I'm going to do: predict-yes
  6318. ENV: Agent did: predict-yes for direction R in state State-A
  6319. In State-A moving R
  6320. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6321. predict error 0
  6322. dir: dir isU
  6323. |\893: O: O1786 (predict-no)
  6324. I see 1 and I'm going to do: predict-no
  6325. ENV: Agent did: predict-no for direction U in state State-B
  6326. In State-B moving U
  6327. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6328. predict error 0
  6329. dir: dir isU
  6330. -/894: O: O1788 (predict-no)
  6331. I see 1 and I'm going to do: predict-no
  6332. ENV: Agent did: predict-no for direction U in state State-B
  6333. In State-B moving U
  6334. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6335. predict error 0
  6336. dir: dir isU
  6337. |\895: O: O1790 (predict-no)
  6338. I see 1 and I'm going to do: predict-no
  6339. ENV: Agent did: predict-no for direction U in state State-B
  6340. In State-B moving U
  6341. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6342. predict error 0
  6343. dir: dir isU
  6344. -/896: O: O1792 (predict-no)
  6345. I see 1 and I'm going to do: predict-no
  6346. ENV: Agent did: predict-no for direction U in state State-B
  6347. In State-B moving U
  6348. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6349. predict error 0
  6350. dir: dir isU
  6351. |\897: O: O1794 (predict-no)
  6352. I see 1 and I'm going to do: predict-no
  6353. ENV: Agent did: predict-no for direction U in state State-B
  6354. In State-B moving U
  6355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6356. predict error 0
  6357. dir: dir isR
  6358. -/898: O: O1796 (predict-no)
  6359. I see 1 and I'm going to do: predict-no
  6360. ENV: Agent did: predict-no for direction R in state State-B
  6361. In State-B moving R
  6362. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6363. predict error 0
  6364. dir: dir isU
  6365. |\899: O: O1798 (predict-no)
  6366. I see 1 and I'm going to do: predict-no
  6367. ENV: Agent did: predict-no for direction U in state State-B
  6368. In State-B moving U
  6369. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6370. predict error 0
  6371. dir: dir isU
  6372. -/|900: O: O1800 (predict-no)
  6373. I see 1 and I'm going to do: predict-no
  6374. ENV: Agent did: predict-no for direction U in state State-B
  6375. In State-B moving U
  6376. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6377. predict error 0
  6378. dir: dir isL
  6379. \-901: O: O1801 (predict-yes)
  6380. I see 1 and I'm going to do: predict-yes
  6381. ENV: Agent did: predict-yes for direction L in state State-B
  6382. In State-B moving L
  6383. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6384. predict error 0
  6385. dir: dir isL
  6386. /902: O: O1804 (predict-no)
  6387. I see 1 and I'm going to do: predict-no
  6388. ENV: Agent did: predict-no for direction L in state State-A
  6389. In State-A moving L
  6390. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6391. predict error 0
  6392. dir: dir isL
  6393. |\-903: O: O1806 (predict-no)
  6394. I see 1 and I'm going to do: predict-no
  6395. ENV: Agent did: predict-no for direction L in state State-A
  6396. In State-A moving L
  6397. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6398. predict error 0
  6399. dir: dir isR
  6400. /|\904: O: O1807 (predict-yes)
  6401. I see 1 and I'm going to do: predict-yes
  6402. ENV: Agent did: predict-yes for direction R in state State-A
  6403. In State-A moving R
  6404. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6405. predict error 0
  6406. dir: dir isU
  6407. -/|905: O: O1810 (predict-no)
  6408. I see 1 and I'm going to do: predict-no
  6409. ENV: Agent did: predict-no for direction U in state State-B
  6410. In State-B moving U
  6411. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6412. predict error 0
  6413. dir: dir isR
  6414. \-/|906: O: O1812 (predict-no)
  6415. I see 1 and I'm going to do: predict-no
  6416. ENV: Agent did: predict-no for direction R in state State-B
  6417. In State-B moving R
  6418. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6419. predict error 0
  6420. dir: dir isU
  6421. \-907: O: O1814 (predict-no)
  6422. I see 1 and I'm going to do: predict-no
  6423. ENV: Agent did: predict-no for direction U in state State-B
  6424. In State-B moving U
  6425. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6426. predict error 0
  6427. dir: dir isR
  6428. /|\908: O: O1816 (predict-no)
  6429. I see 1 and I'm going to do: predict-no
  6430. ENV: Agent did: predict-no for direction R in state State-B
  6431. In State-B moving R
  6432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6433. predict error 0
  6434. dir: dir isR
  6435. -/909: O: O1818 (predict-no)
  6436. I see 1 and I'm going to do: predict-no
  6437. ENV: Agent did: predict-no for direction R in state State-B
  6438. In State-B moving R
  6439. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6440. predict error 0
  6441. dir: dir isL
  6442. |\910: O: O1819 (predict-yes)
  6443. I see 1 and I'm going to do: predict-yes
  6444. ENV: Agent did: predict-yes for direction L in state State-B
  6445. In State-B moving L
  6446. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6447. predict error 0
  6448. dir: dir isR
  6449. -/|911: O: O1821 (predict-yes)
  6450. I see 1 and I'm going to do: predict-yes
  6451. ENV: Agent did: predict-yes for direction R in state State-A
  6452. In State-A moving R
  6453. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6454. predict error 0
  6455. dir: dir isL
  6456. \912: O: O1823 (predict-yes)
  6457. I see 1 and I'm going to do: predict-yes
  6458. ENV: Agent did: predict-yes for direction L in state State-B
  6459. In State-B moving L
  6460. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6461. predict error 0
  6462. dir: dir isL
  6463. -/|\913: O: O1826 (predict-no)
  6464. I see 1 and I'm going to do: predict-no
  6465. ENV: Agent did: predict-no for direction L in state State-A
  6466. In State-A moving L
  6467. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6468. predict error 0
  6469. dir: dir isR
  6470. -914: O: O1827 (predict-yes)
  6471. I see 1 and I'm going to do: predict-yes
  6472. ENV: Agent did: predict-yes for direction R in state State-A
  6473. In State-A moving R
  6474. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6475. predict error 0
  6476. dir: dir isR
  6477. /|915: O: O1830 (predict-no)
  6478. I see 1 and I'm going to do: predict-no
  6479. ENV: Agent did: predict-no for direction R in state State-B
  6480. In State-B moving R
  6481. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6482. predict error 0
  6483. dir: dir isU
  6484. \-/916: O: O1832 (predict-no)
  6485. I see 1 and I'm going to do: predict-no
  6486. ENV: Agent did: predict-no for direction U in state State-B
  6487. In State-B moving U
  6488. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6489. predict error 0
  6490. dir: dir isL
  6491. |\917: O: O1833 (predict-yes)
  6492. I see 1 and I'm going to do: predict-yes
  6493. ENV: Agent did: predict-yes for direction L in state State-B
  6494. In State-B moving L
  6495. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6496. predict error 0
  6497. dir: dir isU
  6498. -/918: O: O1836 (predict-no)
  6499. I see 1 and I'm going to do: predict-no
  6500. ENV: Agent did: predict-no for direction U in state State-A
  6501. In State-A moving U
  6502. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6503. predict error 0
  6504. dir: dir isU
  6505. |\-/919: O: O1838 (predict-no)
  6506. I see 1 and I'm going to do: predict-no
  6507. ENV: Agent did: predict-no for direction U in state State-A
  6508. In State-A moving U
  6509. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6510. predict error 0
  6511. dir: dir isL
  6512. |\-920: O: O1840 (predict-no)
  6513. I see 1 and I'm going to do: predict-no
  6514. ENV: Agent did: predict-no for direction L in state State-A
  6515. In State-A moving L
  6516. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6517. predict error 0
  6518. dir: dir isU
  6519. /|\921: O: O1842 (predict-no)
  6520. I see 1 and I'm going to do: predict-no
  6521. ENV: Agent did: predict-no for direction U in state State-A
  6522. In State-A moving U
  6523. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6524. predict error 0
  6525. dir: dir isU
  6526. -922: O: O1844 (predict-no)
  6527. I see 1 and I'm going to do: predict-no
  6528. ENV: Agent did: predict-no for direction U in state State-A
  6529. In State-A moving U
  6530. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6531. predict error 0
  6532. dir: dir isU
  6533. /|\923: O: O1846 (predict-no)
  6534. I see 1 and I'm going to do: predict-no
  6535. ENV: Agent did: predict-no for direction U in state State-A
  6536. In State-A moving U
  6537. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6538. predict error 0
  6539. dir: dir isU
  6540. -/924: O: O1848 (predict-no)
  6541. I see 1 and I'm going to do: predict-no
  6542. ENV: Agent did: predict-no for direction U in state State-A
  6543. In State-A moving U
  6544. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6545. predict error 0
  6546. dir: dir isR
  6547. |\925: O: O1849 (predict-yes)
  6548. I see 1 and I'm going to do: predict-yes
  6549. ENV: Agent did: predict-yes for direction R in state State-A
  6550. In State-A moving R
  6551. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6552. predict error 0
  6553. dir: dir isR
  6554. -926: O: O1852 (predict-no)
  6555. I see 1 and I'm going to do: predict-no
  6556. ENV: Agent did: predict-no for direction R in state State-B
  6557. In State-B moving R
  6558. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6559. predict error 0
  6560. dir: dir isR
  6561. /|\927: O: O1854 (predict-no)
  6562. I see 1 and I'm going to do: predict-no
  6563. ENV: Agent did: predict-no for direction R in state State-B
  6564. In State-B moving R
  6565. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6566. predict error 0
  6567. dir: dir isR
  6568. -/|928: O: O1856 (predict-no)
  6569. I see 1 and I'm going to do: predict-no
  6570. ENV: Agent did: predict-no for direction R in state State-B
  6571. In State-B moving R
  6572. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6573. predict error 0
  6574. dir: dir isL
  6575. \-/929: O: O1857 (predict-yes)
  6576. I see 1 and I'm going to do: predict-yes
  6577. ENV: Agent did: predict-yes for direction L in state State-B
  6578. In State-B moving L
  6579. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6580. predict error 0
  6581. dir: dir isU
  6582. |\930: O: O1860 (predict-no)
  6583. I see 1 and I'm going to do: predict-no
  6584. ENV: Agent did: predict-no for direction U in state State-A
  6585. In State-A moving U
  6586. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6587. predict error 0
  6588. dir: dir isR
  6589. -/931: O: O1861 (predict-yes)
  6590. I see 1 and I'm going to do: predict-yes
  6591. ENV: Agent did: predict-yes for direction R in state State-A
  6592. In State-A moving R
  6593. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6594. predict error 0
  6595. dir: dir isL
  6596. |932: O: O1863 (predict-yes)
  6597. I see 1 and I'm going to do: predict-yes
  6598. ENV: Agent did: predict-yes for direction L in state State-B
  6599. In State-B moving L
  6600. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6601. predict error 0
  6602. dir: dir isL
  6603. \-933: O: O1866 (predict-no)
  6604. I see 1 and I'm going to do: predict-no
  6605. ENV: Agent did: predict-no for direction L in state State-A
  6606. In State-A moving L
  6607. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6608. predict error 0
  6609. dir: dir isL
  6610. /|\934: O: O1868 (predict-no)
  6611. I see 1 and I'm going to do: predict-no
  6612. ENV: Agent did: predict-no for direction L in state State-A
  6613. In State-A moving L
  6614. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6615. predict error 0
  6616. dir: dir isR
  6617. -/|935: O: O1869 (predict-yes)
  6618. I see 1 and I'm going to do: predict-yes
  6619. ENV: Agent did: predict-yes for direction R in state State-A
  6620. In State-A moving R
  6621. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6622. predict error 0
  6623. dir: dir isL
  6624. \-/936: O: O1871 (predict-yes)
  6625. I see 1 and I'm going to do: predict-yes
  6626. ENV: Agent did: predict-yes for direction L in state State-B
  6627. In State-B moving L
  6628. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6629. predict error 0
  6630. dir: dir isR
  6631. |\937: O: O1873 (predict-yes)
  6632. I see 1 and I'm going to do: predict-yes
  6633. ENV: Agent did: predict-yes for direction R in state State-A
  6634. In State-A moving R
  6635. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6636. predict error 0
  6637. dir: dir isL
  6638. -/|938: O: O1875 (predict-yes)
  6639. I see 1 and I'm going to do: predict-yes
  6640. ENV: Agent did: predict-yes for direction L in state State-B
  6641. In State-B moving L
  6642. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6643. predict error 0
  6644. dir: dir isU
  6645. \-/939: O: O1878 (predict-no)
  6646. I see 1 and I'm going to do: predict-no
  6647. ENV: Agent did: predict-no for direction U in state State-A
  6648. In State-A moving U
  6649. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6650. predict error 0
  6651. dir: dir isU
  6652. |\-940: O: O1880 (predict-no)
  6653. I see 1 and I'm going to do: predict-no
  6654. ENV: Agent did: predict-no for direction U in state State-A
  6655. In State-A moving U
  6656. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6657. predict error 0
  6658. dir: dir isL
  6659. /|\-sleeping...
  6660. /941: O: O1882 (predict-no)
  6661. I see 1 and I'm going to do: predict-no
  6662. ENV: Agent did: predict-no for direction L in state State-A
  6663. In State-A moving L
  6664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6665. predict error 0
  6666. dir: dir isU
  6667. |942: O: O1884 (predict-no)
  6668. I see 1 and I'm going to do: predict-no
  6669. ENV: Agent did: predict-no for direction U in state State-A
  6670. In State-A moving U
  6671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6672. predict error 0
  6673. dir: dir isR
  6674. \-/943: O: O1885 (predict-yes)
  6675. I see 1 and I'm going to do: predict-yes
  6676. ENV: Agent did: predict-yes for direction R in state State-A
  6677. In State-A moving R
  6678. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6679. predict error 0
  6680. dir: dir isL
  6681. |\-944: O: O1887 (predict-yes)
  6682. I see 1 and I'm going to do: predict-yes
  6683. ENV: Agent did: predict-yes for direction L in state State-B
  6684. In State-B moving L
  6685. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6686. predict error 0
  6687. dir: dir isU
  6688. /|\-945: O: O1890 (predict-no)
  6689. I see 1 and I'm going to do: predict-no
  6690. ENV: Agent did: predict-no for direction U in state State-A
  6691. In State-A moving U
  6692. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6693. predict error 0
  6694. dir: dir isL
  6695. /|\946: O: O1892 (predict-no)
  6696. I see 1 and I'm going to do: predict-no
  6697. ENV: Agent did: predict-no for direction L in state State-A
  6698. In State-A moving L
  6699. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6700. predict error 0
  6701. dir: dir isR
  6702. -/|947: O: O1893 (predict-yes)
  6703. I see 1 and I'm going to do: predict-yes
  6704. ENV: Agent did: predict-yes for direction R in state State-A
  6705. In State-A moving R
  6706. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6707. predict error 0
  6708. dir: dir isR
  6709. \-948: O: O1896 (predict-no)
  6710. I see 1 and I'm going to do: predict-no
  6711. ENV: Agent did: predict-no for direction R in state State-B
  6712. In State-B moving R
  6713. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6714. predict error 0
  6715. dir: dir isL
  6716. /|949: O: O1897 (predict-yes)
  6717. I see 1 and I'm going to do: predict-yes
  6718. ENV: Agent did: predict-yes for direction L in state State-B
  6719. In State-B moving L
  6720. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6721. predict error 0
  6722. dir: dir isR
  6723. \-/950: O: O1899 (predict-yes)
  6724. I see 1 and I'm going to do: predict-yes
  6725. ENV: Agent did: predict-yes for direction R in state State-A
  6726. In State-A moving R
  6727. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6728. predict error 0
  6729. dir: dir isU
  6730. |\-/|\-/|--- Input Phase ---
  6731. =>WM: (13313: I2 ^dir U)
  6732. =>WM: (13312: I2 ^reward 1)
  6733. =>WM: (13311: I2 ^see 1)
  6734. =>WM: (13310: N950 ^status complete)
  6735. <=WM: (13298: I2 ^dir R)
  6736. <=WM: (13297: I2 ^reward 1)
  6737. <=WM: (13296: I2 ^see 1)
  6738. =>WM: (13314: I2 ^level-1 R1-root)
  6739. <=WM: (13299: I2 ^level-1 L1-root)
  6740. --- END Input Phase ---
  6741. --- Proposal Phase ---
  6742. --- Inner Elaboration Phase, active level 1 (S1) ---
  6743. Firing elaborate*copy-see-to-output-link
  6744. -->
  6745. (I3 ^see 1 +)
  6746. Firing elaborate*reward*based*on*reward
  6747. -->
  6748. (R954 ^value 1 +)
  6749. (R1 ^reward R954 +)
  6750. Firing propose*predict-yes
  6751. -->
  6752. (O1901 ^name predict-yes +)
  6753. (S1 ^operator O1901 +)
  6754. Firing propose*predict-no
  6755. -->
  6756. (O1902 ^name predict-no +)
  6757. (S1 ^operator O1902 +)
  6758. Firing rl*prefer*rvt*predict-no*H0*6
  6759. -->
  6760. (S1 ^operator O1900 = 0.9999999999999999)
  6761. Firing rl*prefer*rvt*predict-yes*H0*5
  6762. -->
  6763. (S1 ^operator O1899 = 0.)
  6764. Firing prefer*rvt*predict-yes*H0
  6765. -->
  6766. Firing prefer*rvt*predict-no*H0
  6767. -->
  6768. Firing elaborate*copy-dir-to-output-link
  6769. -->
  6770. (I3 ^dir U +)
  6771. inner elaboration loop at bottom goal.
  6772. Retracting elaborate*copy-see-to-output-link
  6773. -->
  6774. (I3 ^see 1 +)
  6775. Retracting propose*predict-no
  6776. -->
  6777. (O1900 ^name predict-no +)
  6778. (S1 ^operator O1900 +)
  6779. Retracting propose*predict-yes
  6780. -->
  6781. (O1899 ^name predict-yes +)
  6782. (S1 ^operator O1899 +)
  6783. Retracting elaborate*reward*based*on*reward
  6784. -->
  6785. (R953 ^value 1 +)
  6786. (R1 ^reward R953 +)
  6787. Retracting elaborate*copy-dir-to-output-link
  6788. -->
  6789. (I3 ^dir R +)
  6790. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  6791. -->
  6792. (S1 ^operator O1900 = -0.02155734064455064)
  6793. Retracting rl*prefer*rvt*predict-no*H0*4
  6794. -->
  6795. (S1 ^operator O1900 = 0.4476192676183378)
  6796. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  6797. -->
  6798. (S1 ^operator O1899 = 0.8155729125006117)
  6799. Retracting rl*prefer*rvt*predict-yes*H0*3
  6800. -->
  6801. (S1 ^operator O1899 = 0.1844075378173239)
  6802. =>WM: (13321: S1 ^operator O1902 +)
  6803. =>WM: (13320: S1 ^operator O1901 +)
  6804. =>WM: (13319: I3 ^dir U)
  6805. =>WM: (13318: O1902 ^name predict-no)
  6806. =>WM: (13317: O1901 ^name predict-yes)
  6807. =>WM: (13316: R954 ^value 1)
  6808. =>WM: (13315: R1 ^reward R954)
  6809. <=WM: (13306: S1 ^operator O1899 +)
  6810. <=WM: (13308: S1 ^operator O1899)
  6811. <=WM: (13307: S1 ^operator O1900 +)
  6812. <=WM: (13305: I3 ^dir R)
  6813. <=WM: (13301: R1 ^reward R953)
  6814. <=WM: (13304: O1900 ^name predict-no)
  6815. <=WM: (13303: O1899 ^name predict-yes)
  6816. <=WM: (13302: R953 ^value 1)
  6817. --- Inner Elaboration Phase, active level 1 (S1) ---
  6818. Firing prefer*rvt*predict-yes*H0
  6819. -->
  6820. Firing rl*prefer*rvt*predict-yes*H0*5
  6821. -->
  6822. (S1 ^operator O1901 = 0.)
  6823. Firing prefer*rvt*predict-no*H0
  6824. -->
  6825. Firing rl*prefer*rvt*predict-no*H0*6
  6826. -->
  6827. (S1 ^operator O1902 = 0.9999999999999999)
  6828. inner elaboration loop at bottom goal.
  6829. Retracting rl*prefer*rvt*predict-no*H0*6
  6830. -->
  6831. (S1 ^operator O1900 = 0.9999999999999999)
  6832. Retracting rl*prefer*rvt*predict-yes*H0*5
  6833. -->
  6834. (S1 ^operator O1899 = 0.)
  6835. --- END Proposal Phase ---
  6836. --- Decision Phase ---
  6837. RL update rl*prefer*rvt*predict-yes*H0*3 0.675409 -0.491002 0.184408 -> 0.675413 -0.491002 0.18441(R,m,v=1,0.89441,0.0950311)
  6838. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324566 0.491007 0.815573 -> 0.324569 0.491006 0.815576(R,m,v=1,1,0)
  6839. =>WM: (13322: S1 ^operator O1902)
  6840. 951: O: O1902 (predict-no)
  6841. --- END Decision Phase ---
  6842. --- Application Phase ---
  6843. --- Firing Productions (PE) For State At Depth 1 ---
  6844. --- Inner Elaboration Phase, active level 1 (S1) ---
  6845. Firing apply*operator
  6846. -->
  6847. (I3 ^predict-no N951 + :O )
  6848. Firing apply*operator*complete
  6849. -->
  6850. (I3 ^predict-yes N950 - :O )
  6851. inner elaboration loop at bottom goal.
  6852. --- Change Working Memory (PE) ---
  6853. =>WM: (13323: I3 ^predict-no N951)
  6854. <=WM: (13310: N950 ^status complete)
  6855. <=WM: (13309: I3 ^predict-yes N950)
  6856. --- Firing Productions (IE) For State At Depth 1 ---
  6857. --- Inner Elaboration Phase, active level 1 (S1) ---
  6858. Firing monitor*world
  6859. -->
  6860. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6861. --- Change Working Memory (IE) ---
  6862. --- END Application Phase ---
  6863. --- Output Phase ---
  6864. ENV: Agent did: predict-no for direction U in state State-B
  6865. In State-B moving U
  6866. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6867. predict error 0
  6868. dir: dir isR
  6869. --- END Output Phase ---
  6870. \--- Input Phase ---
  6871. =>WM: (13327: I2 ^dir R)
  6872. =>WM: (13326: I2 ^reward 1)
  6873. =>WM: (13325: I2 ^see 0)
  6874. =>WM: (13324: N951 ^status complete)
  6875. <=WM: (13313: I2 ^dir U)
  6876. <=WM: (13312: I2 ^reward 1)
  6877. <=WM: (13311: I2 ^see 1)
  6878. =>WM: (13328: I2 ^level-1 R1-root)
  6879. <=WM: (13314: I2 ^level-1 R1-root)
  6880. --- END Input Phase ---
  6881. --- Proposal Phase ---
  6882. --- Inner Elaboration Phase, active level 1 (S1) ---
  6883. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  6884. -->
  6885. (S1 ^operator O1901 = 0.1398795999120246)
  6886. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  6887. -->
  6888. (S1 ^operator O1902 = 0.5523833737960075)
  6889. Firing prefer*rvt*predict-no*H0*4*v1*H1
  6890. -->
  6891. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  6892. -->
  6893. Firing elaborate*copy-see-to-output-link
  6894. -->
  6895. (I3 ^see 0 +)
  6896. Firing elaborate*reward*based*on*reward
  6897. -->
  6898. (R955 ^value 1 +)
  6899. (R1 ^reward R955 +)
  6900. Firing propose*predict-yes
  6901. -->
  6902. (O1903 ^name predict-yes +)
  6903. (S1 ^operator O1903 +)
  6904. Firing propose*predict-no
  6905. -->
  6906. (O1904 ^name predict-no +)
  6907. (S1 ^operator O1904 +)
  6908. Firing rl*prefer*rvt*predict-no*H0*4
  6909. -->
  6910. (S1 ^operator O1902 = 0.4476192676183378)
  6911. Firing rl*prefer*rvt*predict-yes*H0*3
  6912. -->
  6913. (S1 ^operator O1901 = 0.1844104702696336)
  6914. Firing prefer*rvt*predict-yes*H0
  6915. -->
  6916. Firing prefer*rvt*predict-no*H0
  6917. -->
  6918. Firing elaborate*copy-dir-to-output-link
  6919. -->
  6920. (I3 ^dir R +)
  6921. inner elaboration loop at bottom goal.
  6922. Retracting elaborate*copy-see-to-output-link
  6923. -->
  6924. (I3 ^see 1 +)
  6925. Retracting propose*predict-no
  6926. -->
  6927. (O1902 ^name predict-no +)
  6928. (S1 ^operator O1902 +)
  6929. Retracting propose*predict-yes
  6930. -->
  6931. (O1901 ^name predict-yes +)
  6932. (S1 ^operator O1901 +)
  6933. Retracting elaborate*reward*based*on*reward
  6934. -->
  6935. (R954 ^value 1 +)
  6936. (R1 ^reward R954 +)
  6937. Retracting elaborate*copy-dir-to-output-link
  6938. -->
  6939. (I3 ^dir U +)
  6940. Retracting rl*prefer*rvt*predict-no*H0*6
  6941. -->
  6942. (S1 ^operator O1902 = 0.9999999999999999)
  6943. Retracting rl*prefer*rvt*predict-yes*H0*5
  6944. -->
  6945. (S1 ^operator O1901 = 0.)
  6946. =>WM: (13336: S1 ^operator O1904 +)
  6947. =>WM: (13335: S1 ^operator O1903 +)
  6948. =>WM: (13334: I3 ^dir R)
  6949. =>WM: (13333: O1904 ^name predict-no)
  6950. =>WM: (13332: O1903 ^name predict-yes)
  6951. =>WM: (13331: R955 ^value 1)
  6952. =>WM: (13330: R1 ^reward R955)
  6953. =>WM: (13329: I3 ^see 0)
  6954. <=WM: (13320: S1 ^operator O1901 +)
  6955. <=WM: (13321: S1 ^operator O1902 +)
  6956. <=WM: (13322: S1 ^operator O1902)
  6957. <=WM: (13319: I3 ^dir U)
  6958. <=WM: (13315: R1 ^reward R954)
  6959. <=WM: (13300: I3 ^see 1)
  6960. <=WM: (13318: O1902 ^name predict-no)
  6961. <=WM: (13317: O1901 ^name predict-yes)
  6962. <=WM: (13316: R954 ^value 1)
  6963. --- Inner Elaboration Phase, active level 1 (S1) ---
  6964. Firing prefer*rvt*predict-yes*H0
  6965. -->
  6966. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  6967. -->
  6968. (S1 ^operator O1903 = 0.1398795999120246)
  6969. Firing rl*prefer*rvt*predict-yes*H0*3
  6970. -->
  6971. (S1 ^operator O1903 = 0.1844104702696336)
  6972. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  6973. -->
  6974. Firing prefer*rvt*predict-no*H0
  6975. -->
  6976. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  6977. -->
  6978. (S1 ^operator O1904 = 0.5523833737960075)
  6979. Firing rl*prefer*rvt*predict-no*H0*4
  6980. -->
  6981. (S1 ^operator O1904 = 0.4476192676183378)
  6982. Firing prefer*rvt*predict-no*H0*4*v1*H1
  6983. -->
  6984. inner elaboration loop at bottom goal.
  6985. Retracting rl*prefer*rvt*predict-no*H0*4
  6986. -->
  6987. (S1 ^operator O1902 = 0.4476192676183378)
  6988. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  6989. -->
  6990. (S1 ^operator O1902 = 0.5523833737960075)
  6991. Retracting rl*prefer*rvt*predict-yes*H0*3
  6992. -->
  6993. (S1 ^operator O1901 = 0.1844104702696336)
  6994. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  6995. -->
  6996. (S1 ^operator O1901 = 0.1398795999120246)
  6997. --- END Proposal Phase ---
  6998. --- Decision Phase ---
  6999. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7000. =>WM: (13337: S1 ^operator O1904)
  7001. 952: O: O1904 (predict-no)
  7002. --- END Decision Phase ---
  7003. --- Application Phase ---
  7004. --- Firing Productions (PE) For State At Depth 1 ---
  7005. --- Inner Elaboration Phase, active level 1 (S1) ---
  7006. Firing apply*operator
  7007. -->
  7008. (I3 ^predict-no N952 + :O )
  7009. Firing apply*operator*complete
  7010. -->
  7011. (I3 ^predict-no N951 - :O )
  7012. inner elaboration loop at bottom goal.
  7013. --- Change Working Memory (PE) ---
  7014. =>WM: (13338: I3 ^predict-no N952)
  7015. <=WM: (13324: N951 ^status complete)
  7016. <=WM: (13323: I3 ^predict-no N951)
  7017. --- Firing Productions (IE) For State At Depth 1 ---
  7018. --- Inner Elaboration Phase, active level 1 (S1) ---
  7019. Firing monitor*world
  7020. -->
  7021. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7022. --- Change Working Memory (IE) ---
  7023. --- END Application Phase ---
  7024. --- Output Phase ---
  7025. ENV: Agent did: predict-no for direction R in state State-B
  7026. In State-B moving R
  7027. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7028. predict error 0
  7029. dir: dir isU
  7030. --- END Output Phase ---
  7031. -/|--- Input Phase ---
  7032. =>WM: (13342: I2 ^dir U)
  7033. =>WM: (13341: I2 ^reward 1)
  7034. =>WM: (13340: I2 ^see 0)
  7035. =>WM: (13339: N952 ^status complete)
  7036. <=WM: (13327: I2 ^dir R)
  7037. <=WM: (13326: I2 ^reward 1)
  7038. <=WM: (13325: I2 ^see 0)
  7039. =>WM: (13343: I2 ^level-1 R0-root)
  7040. <=WM: (13328: I2 ^level-1 R1-root)
  7041. --- END Input Phase ---
  7042. --- Proposal Phase ---
  7043. --- Inner Elaboration Phase, active level 1 (S1) ---
  7044. Firing elaborate*copy-see-to-output-link
  7045. -->
  7046. (I3 ^see 0 +)
  7047. Firing elaborate*reward*based*on*reward
  7048. -->
  7049. (R956 ^value 1 +)
  7050. (R1 ^reward R956 +)
  7051. Firing propose*predict-yes
  7052. -->
  7053. (O1905 ^name predict-yes +)
  7054. (S1 ^operator O1905 +)
  7055. Firing propose*predict-no
  7056. -->
  7057. (O1906 ^name predict-no +)
  7058. (S1 ^operator O1906 +)
  7059. Firing rl*prefer*rvt*predict-no*H0*6
  7060. -->
  7061. (S1 ^operator O1904 = 0.9999999999999999)
  7062. Firing rl*prefer*rvt*predict-yes*H0*5
  7063. -->
  7064. (S1 ^operator O1903 = 0.)
  7065. Firing prefer*rvt*predict-yes*H0
  7066. -->
  7067. Firing prefer*rvt*predict-no*H0
  7068. -->
  7069. Firing elaborate*copy-dir-to-output-link
  7070. -->
  7071. (I3 ^dir U +)
  7072. inner elaboration loop at bottom goal.
  7073. Retracting elaborate*copy-see-to-output-link
  7074. -->
  7075. (I3 ^see 0 +)
  7076. Retracting propose*predict-no
  7077. -->
  7078. (O1904 ^name predict-no +)
  7079. (S1 ^operator O1904 +)
  7080. Retracting propose*predict-yes
  7081. -->
  7082. (O1903 ^name predict-yes +)
  7083. (S1 ^operator O1903 +)
  7084. Retracting elaborate*reward*based*on*reward
  7085. -->
  7086. (R955 ^value 1 +)
  7087. (R1 ^reward R955 +)
  7088. Retracting elaborate*copy-dir-to-output-link
  7089. -->
  7090. (I3 ^dir R +)
  7091. Retracting rl*prefer*rvt*predict-no*H0*4
  7092. -->
  7093. (S1 ^operator O1904 = 0.4476192676183378)
  7094. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7095. -->
  7096. (S1 ^operator O1904 = 0.5523833737960075)
  7097. Retracting rl*prefer*rvt*predict-yes*H0*3
  7098. -->
  7099. (S1 ^operator O1903 = 0.1844104702696336)
  7100. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7101. -->
  7102. (S1 ^operator O1903 = 0.1398795999120246)
  7103. =>WM: (13350: S1 ^operator O1906 +)
  7104. =>WM: (13349: S1 ^operator O1905 +)
  7105. =>WM: (13348: I3 ^dir U)
  7106. =>WM: (13347: O1906 ^name predict-no)
  7107. =>WM: (13346: O1905 ^name predict-yes)
  7108. =>WM: (13345: R956 ^value 1)
  7109. =>WM: (13344: R1 ^reward R956)
  7110. <=WM: (13335: S1 ^operator O1903 +)
  7111. <=WM: (13336: S1 ^operator O1904 +)
  7112. <=WM: (13337: S1 ^operator O1904)
  7113. <=WM: (13334: I3 ^dir R)
  7114. <=WM: (13330: R1 ^reward R955)
  7115. <=WM: (13333: O1904 ^name predict-no)
  7116. <=WM: (13332: O1903 ^name predict-yes)
  7117. <=WM: (13331: R955 ^value 1)
  7118. --- Inner Elaboration Phase, active level 1 (S1) ---
  7119. Firing prefer*rvt*predict-yes*H0
  7120. -->
  7121. Firing rl*prefer*rvt*predict-yes*H0*5
  7122. -->
  7123. (S1 ^operator O1905 = 0.)
  7124. Firing prefer*rvt*predict-no*H0
  7125. -->
  7126. Firing rl*prefer*rvt*predict-no*H0*6
  7127. -->
  7128. (S1 ^operator O1906 = 0.9999999999999999)
  7129. inner elaboration loop at bottom goal.
  7130. Retracting rl*prefer*rvt*predict-no*H0*6
  7131. -->
  7132. (S1 ^operator O1904 = 0.9999999999999999)
  7133. Retracting rl*prefer*rvt*predict-yes*H0*5
  7134. -->
  7135. (S1 ^operator O1903 = 0.)
  7136. --- END Proposal Phase ---
  7137. --- Decision Phase ---
  7138. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622532 -0.174914 0.447619(R,m,v=1,0.925,0.069958)
  7139. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377469 0.174914 0.552383(R,m,v=1,1,0)
  7140. =>WM: (13351: S1 ^operator O1906)
  7141. 953: O: O1906 (predict-no)
  7142. --- END Decision Phase ---
  7143. --- Application Phase ---
  7144. --- Firing Productions (PE) For State At Depth 1 ---
  7145. --- Inner Elaboration Phase, active level 1 (S1) ---
  7146. Firing apply*operator
  7147. -->
  7148. (I3 ^predict-no N953 + :O )
  7149. Firing apply*operator*complete
  7150. -->
  7151. (I3 ^predict-no N952 - :O )
  7152. inner elaboration loop at bottom goal.
  7153. --- Change Working Memory (PE) ---
  7154. =>WM: (13352: I3 ^predict-no N953)
  7155. <=WM: (13339: N952 ^status complete)
  7156. <=WM: (13338: I3 ^predict-no N952)
  7157. --- Firing Productions (IE) For State At Depth 1 ---
  7158. --- Inner Elaboration Phase, active level 1 (S1) ---
  7159. Firing monitor*world
  7160. -->
  7161. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7162. --- Change Working Memory (IE) ---
  7163. --- END Application Phase ---
  7164. --- Output Phase ---
  7165. ENV: Agent did: predict-no for direction U in state State-B
  7166. In State-B moving U
  7167. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7168. predict error 0
  7169. dir: dir isL
  7170. --- END Output Phase ---
  7171. \-/--- Input Phase ---
  7172. =>WM: (13356: I2 ^dir L)
  7173. =>WM: (13355: I2 ^reward 1)
  7174. =>WM: (13354: I2 ^see 0)
  7175. =>WM: (13353: N953 ^status complete)
  7176. <=WM: (13342: I2 ^dir U)
  7177. <=WM: (13341: I2 ^reward 1)
  7178. <=WM: (13340: I2 ^see 0)
  7179. =>WM: (13357: I2 ^level-1 R0-root)
  7180. <=WM: (13343: I2 ^level-1 R0-root)
  7181. --- END Input Phase ---
  7182. --- Proposal Phase ---
  7183. --- Inner Elaboration Phase, active level 1 (S1) ---
  7184. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  7185. -->
  7186. (S1 ^operator O1905 = 0.6104621686166466)
  7187. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  7188. -->
  7189. (S1 ^operator O1906 = 0.1063475139796038)
  7190. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7191. -->
  7192. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7193. -->
  7194. Firing elaborate*copy-see-to-output-link
  7195. -->
  7196. (I3 ^see 0 +)
  7197. Firing elaborate*reward*based*on*reward
  7198. -->
  7199. (R957 ^value 1 +)
  7200. (R1 ^reward R957 +)
  7201. Firing propose*predict-yes
  7202. -->
  7203. (O1907 ^name predict-yes +)
  7204. (S1 ^operator O1907 +)
  7205. Firing propose*predict-no
  7206. -->
  7207. (O1908 ^name predict-no +)
  7208. (S1 ^operator O1908 +)
  7209. Firing rl*prefer*rvt*predict-no*H0*2
  7210. -->
  7211. (S1 ^operator O1906 = 0.3873365065796835)
  7212. Firing rl*prefer*rvt*predict-yes*H0*1
  7213. -->
  7214. (S1 ^operator O1905 = 0.3895397770301633)
  7215. Firing prefer*rvt*predict-yes*H0
  7216. -->
  7217. Firing prefer*rvt*predict-no*H0
  7218. -->
  7219. Firing elaborate*copy-dir-to-output-link
  7220. -->
  7221. (I3 ^dir L +)
  7222. inner elaboration loop at bottom goal.
  7223. Retracting elaborate*copy-see-to-output-link
  7224. -->
  7225. (I3 ^see 0 +)
  7226. Retracting propose*predict-no
  7227. -->
  7228. (O1906 ^name predict-no +)
  7229. (S1 ^operator O1906 +)
  7230. Retracting propose*predict-yes
  7231. -->
  7232. (O1905 ^name predict-yes +)
  7233. (S1 ^operator O1905 +)
  7234. Retracting elaborate*reward*based*on*reward
  7235. -->
  7236. (R956 ^value 1 +)
  7237. (R1 ^reward R956 +)
  7238. Retracting elaborate*copy-dir-to-output-link
  7239. -->
  7240. (I3 ^dir U +)
  7241. Retracting rl*prefer*rvt*predict-no*H0*6
  7242. -->
  7243. (S1 ^operator O1906 = 0.9999999999999999)
  7244. Retracting rl*prefer*rvt*predict-yes*H0*5
  7245. -->
  7246. (S1 ^operator O1905 = 0.)
  7247. =>WM: (13364: S1 ^operator O1908 +)
  7248. =>WM: (13363: S1 ^operator O1907 +)
  7249. =>WM: (13362: I3 ^dir L)
  7250. =>WM: (13361: O1908 ^name predict-no)
  7251. =>WM: (13360: O1907 ^name predict-yes)
  7252. =>WM: (13359: R957 ^value 1)
  7253. =>WM: (13358: R1 ^reward R957)
  7254. <=WM: (13349: S1 ^operator O1905 +)
  7255. <=WM: (13350: S1 ^operator O1906 +)
  7256. <=WM: (13351: S1 ^operator O1906)
  7257. <=WM: (13348: I3 ^dir U)
  7258. <=WM: (13344: R1 ^reward R956)
  7259. <=WM: (13347: O1906 ^name predict-no)
  7260. <=WM: (13346: O1905 ^name predict-yes)
  7261. <=WM: (13345: R956 ^value 1)
  7262. --- Inner Elaboration Phase, active level 1 (S1) ---
  7263. Firing prefer*rvt*predict-yes*H0
  7264. -->
  7265. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  7266. -->
  7267. (S1 ^operator O1907 = 0.6104621686166466)
  7268. Firing rl*prefer*rvt*predict-yes*H0*1
  7269. -->
  7270. (S1 ^operator O1907 = 0.3895397770301633)
  7271. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7272. -->
  7273. Firing prefer*rvt*predict-no*H0
  7274. -->
  7275. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  7276. -->
  7277. (S1 ^operator O1908 = 0.1063475139796038)
  7278. Firing rl*prefer*rvt*predict-no*H0*2
  7279. -->
  7280. (S1 ^operator O1908 = 0.3873365065796835)
  7281. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7282. -->
  7283. inner elaboration loop at bottom goal.
  7284. Retracting rl*prefer*rvt*predict-no*H0*2
  7285. -->
  7286. (S1 ^operator O1906 = 0.3873365065796835)
  7287. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  7288. -->
  7289. (S1 ^operator O1906 = 0.1063475139796038)
  7290. Retracting rl*prefer*rvt*predict-yes*H0*1
  7291. -->
  7292. (S1 ^operator O1905 = 0.3895397770301633)
  7293. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  7294. -->
  7295. (S1 ^operator O1905 = 0.6104621686166466)
  7296. --- END Proposal Phase ---
  7297. --- Decision Phase ---
  7298. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7299. =>WM: (13365: S1 ^operator O1907)
  7300. 954: O: O1907 (predict-yes)
  7301. --- END Decision Phase ---
  7302. --- Application Phase ---
  7303. --- Firing Productions (PE) For State At Depth 1 ---
  7304. --- Inner Elaboration Phase, active level 1 (S1) ---
  7305. Firing apply*operator
  7306. -->
  7307. (I3 ^predict-yes N954 + :O )
  7308. Firing apply*operator*complete
  7309. -->
  7310. (I3 ^predict-no N953 - :O )
  7311. inner elaboration loop at bottom goal.
  7312. --- Change Working Memory (PE) ---
  7313. =>WM: (13366: I3 ^predict-yes N954)
  7314. <=WM: (13353: N953 ^status complete)
  7315. <=WM: (13352: I3 ^predict-no N953)
  7316. --- Firing Productions (IE) For State At Depth 1 ---
  7317. --- Inner Elaboration Phase, active level 1 (S1) ---
  7318. Firing monitor*world
  7319. -->
  7320. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7321. --- Change Working Memory (IE) ---
  7322. --- END Application Phase ---
  7323. --- Output Phase ---
  7324. ENV: Agent did: predict-yes for direction L in state State-B
  7325. In State-B moving L
  7326. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7327. predict error 0
  7328. dir: dir isU
  7329. --- END Output Phase ---
  7330. |\--- Input Phase ---
  7331. =>WM: (13370: I2 ^dir U)
  7332. =>WM: (13369: I2 ^reward 1)
  7333. =>WM: (13368: I2 ^see 1)
  7334. =>WM: (13367: N954 ^status complete)
  7335. <=WM: (13356: I2 ^dir L)
  7336. <=WM: (13355: I2 ^reward 1)
  7337. <=WM: (13354: I2 ^see 0)
  7338. =>WM: (13371: I2 ^level-1 L1-root)
  7339. <=WM: (13357: I2 ^level-1 R0-root)
  7340. --- END Input Phase ---
  7341. --- Proposal Phase ---
  7342. --- Inner Elaboration Phase, active level 1 (S1) ---
  7343. Firing elaborate*copy-see-to-output-link
  7344. -->
  7345. (I3 ^see 1 +)
  7346. Firing elaborate*reward*based*on*reward
  7347. -->
  7348. (R958 ^value 1 +)
  7349. (R1 ^reward R958 +)
  7350. Firing propose*predict-yes
  7351. -->
  7352. (O1909 ^name predict-yes +)
  7353. (S1 ^operator O1909 +)
  7354. Firing propose*predict-no
  7355. -->
  7356. (O1910 ^name predict-no +)
  7357. (S1 ^operator O1910 +)
  7358. Firing rl*prefer*rvt*predict-no*H0*6
  7359. -->
  7360. (S1 ^operator O1908 = 0.9999999999999999)
  7361. Firing rl*prefer*rvt*predict-yes*H0*5
  7362. -->
  7363. (S1 ^operator O1907 = 0.)
  7364. Firing prefer*rvt*predict-yes*H0
  7365. -->
  7366. Firing prefer*rvt*predict-no*H0
  7367. -->
  7368. Firing elaborate*copy-dir-to-output-link
  7369. -->
  7370. (I3 ^dir U +)
  7371. inner elaboration loop at bottom goal.
  7372. Retracting elaborate*copy-see-to-output-link
  7373. -->
  7374. (I3 ^see 0 +)
  7375. Retracting propose*predict-no
  7376. -->
  7377. (O1908 ^name predict-no +)
  7378. (S1 ^operator O1908 +)
  7379. Retracting propose*predict-yes
  7380. -->
  7381. (O1907 ^name predict-yes +)
  7382. (S1 ^operator O1907 +)
  7383. Retracting elaborate*reward*based*on*reward
  7384. -->
  7385. (R957 ^value 1 +)
  7386. (R1 ^reward R957 +)
  7387. Retracting elaborate*copy-dir-to-output-link
  7388. -->
  7389. (I3 ^dir L +)
  7390. Retracting rl*prefer*rvt*predict-no*H0*2
  7391. -->
  7392. (S1 ^operator O1908 = 0.3873365065796835)
  7393. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  7394. -->
  7395. (S1 ^operator O1908 = 0.1063475139796038)
  7396. Retracting rl*prefer*rvt*predict-yes*H0*1
  7397. -->
  7398. (S1 ^operator O1907 = 0.3895397770301633)
  7399. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  7400. -->
  7401. (S1 ^operator O1907 = 0.6104621686166466)
  7402. =>WM: (13379: S1 ^operator O1910 +)
  7403. =>WM: (13378: S1 ^operator O1909 +)
  7404. =>WM: (13377: I3 ^dir U)
  7405. =>WM: (13376: O1910 ^name predict-no)
  7406. =>WM: (13375: O1909 ^name predict-yes)
  7407. =>WM: (13374: R958 ^value 1)
  7408. =>WM: (13373: R1 ^reward R958)
  7409. =>WM: (13372: I3 ^see 1)
  7410. <=WM: (13363: S1 ^operator O1907 +)
  7411. <=WM: (13365: S1 ^operator O1907)
  7412. <=WM: (13364: S1 ^operator O1908 +)
  7413. <=WM: (13362: I3 ^dir L)
  7414. <=WM: (13358: R1 ^reward R957)
  7415. <=WM: (13329: I3 ^see 0)
  7416. <=WM: (13361: O1908 ^name predict-no)
  7417. <=WM: (13360: O1907 ^name predict-yes)
  7418. <=WM: (13359: R957 ^value 1)
  7419. --- Inner Elaboration Phase, active level 1 (S1) ---
  7420. Firing prefer*rvt*predict-yes*H0
  7421. -->
  7422. Firing rl*prefer*rvt*predict-yes*H0*5
  7423. -->
  7424. (S1 ^operator O1909 = 0.)
  7425. Firing prefer*rvt*predict-no*H0
  7426. -->
  7427. Firing rl*prefer*rvt*predict-no*H0*6
  7428. -->
  7429. (S1 ^operator O1910 = 0.9999999999999999)
  7430. inner elaboration loop at bottom goal.
  7431. Retracting rl*prefer*rvt*predict-no*H0*6
  7432. -->
  7433. (S1 ^operator O1908 = 0.9999999999999999)
  7434. Retracting rl*prefer*rvt*predict-yes*H0*5
  7435. -->
  7436. (S1 ^operator O1907 = 0.)
  7437. --- END Proposal Phase ---
  7438. --- Decision Phase ---
  7439. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.886792,0.101027)
  7440. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610462 -> 0.288049 0.322413 0.610462(R,m,v=1,1,0)
  7441. =>WM: (13380: S1 ^operator O1910)
  7442. 955: O: O1910 (predict-no)
  7443. --- END Decision Phase ---
  7444. --- Application Phase ---
  7445. --- Firing Productions (PE) For State At Depth 1 ---
  7446. --- Inner Elaboration Phase, active level 1 (S1) ---
  7447. Firing apply*operator
  7448. -->
  7449. (I3 ^predict-no N955 + :O )
  7450. Firing apply*operator*complete
  7451. -->
  7452. (I3 ^predict-yes N954 - :O )
  7453. inner elaboration loop at bottom goal.
  7454. --- Change Working Memory (PE) ---
  7455. =>WM: (13381: I3 ^predict-no N955)
  7456. <=WM: (13367: N954 ^status complete)
  7457. <=WM: (13366: I3 ^predict-yes N954)
  7458. --- Firing Productions (IE) For State At Depth 1 ---
  7459. --- Inner Elaboration Phase, active level 1 (S1) ---
  7460. Firing monitor*world
  7461. -->
  7462. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7463. --- Change Working Memory (IE) ---
  7464. --- END Application Phase ---
  7465. --- Output Phase ---
  7466. ENV: Agent did: predict-no for direction U in state State-A
  7467. In State-A moving U
  7468. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7469. predict error 0
  7470. dir: dir isU
  7471. --- END Output Phase ---
  7472. -/--- Input Phase ---
  7473. =>WM: (13385: I2 ^dir U)
  7474. =>WM: (13384: I2 ^reward 1)
  7475. =>WM: (13383: I2 ^see 0)
  7476. =>WM: (13382: N955 ^status complete)
  7477. <=WM: (13370: I2 ^dir U)
  7478. <=WM: (13369: I2 ^reward 1)
  7479. <=WM: (13368: I2 ^see 1)
  7480. =>WM: (13386: I2 ^level-1 L1-root)
  7481. <=WM: (13371: I2 ^level-1 L1-root)
  7482. --- END Input Phase ---
  7483. --- Proposal Phase ---
  7484. --- Inner Elaboration Phase, active level 1 (S1) ---
  7485. Firing elaborate*copy-see-to-output-link
  7486. -->
  7487. (I3 ^see 0 +)
  7488. Firing elaborate*reward*based*on*reward
  7489. -->
  7490. (R959 ^value 1 +)
  7491. (R1 ^reward R959 +)
  7492. Firing propose*predict-yes
  7493. -->
  7494. (O1911 ^name predict-yes +)
  7495. (S1 ^operator O1911 +)
  7496. Firing propose*predict-no
  7497. -->
  7498. (O1912 ^name predict-no +)
  7499. (S1 ^operator O1912 +)
  7500. Firing rl*prefer*rvt*predict-no*H0*6
  7501. -->
  7502. (S1 ^operator O1910 = 0.9999999999999999)
  7503. Firing rl*prefer*rvt*predict-yes*H0*5
  7504. -->
  7505. (S1 ^operator O1909 = 0.)
  7506. Firing prefer*rvt*predict-yes*H0
  7507. -->
  7508. Firing prefer*rvt*predict-no*H0
  7509. -->
  7510. Firing elaborate*copy-dir-to-output-link
  7511. -->
  7512. (I3 ^dir U +)
  7513. inner elaboration loop at bottom goal.
  7514. Retracting elaborate*copy-see-to-output-link
  7515. -->
  7516. (I3 ^see 1 +)
  7517. Retracting propose*predict-no
  7518. -->
  7519. (O1910 ^name predict-no +)
  7520. (S1 ^operator O1910 +)
  7521. Retracting propose*predict-yes
  7522. -->
  7523. (O1909 ^name predict-yes +)
  7524. (S1 ^operator O1909 +)
  7525. Retracting elaborate*reward*based*on*reward
  7526. -->
  7527. (R958 ^value 1 +)
  7528. (R1 ^reward R958 +)
  7529. Retracting elaborate*copy-dir-to-output-link
  7530. -->
  7531. (I3 ^dir U +)
  7532. Retracting rl*prefer*rvt*predict-no*H0*6
  7533. -->
  7534. (S1 ^operator O1910 = 0.9999999999999999)
  7535. Retracting rl*prefer*rvt*predict-yes*H0*5
  7536. -->
  7537. (S1 ^operator O1909 = 0.)
  7538. =>WM: (13393: S1 ^operator O1912 +)
  7539. =>WM: (13392: S1 ^operator O1911 +)
  7540. =>WM: (13391: O1912 ^name predict-no)
  7541. =>WM: (13390: O1911 ^name predict-yes)
  7542. =>WM: (13389: R959 ^value 1)
  7543. =>WM: (13388: R1 ^reward R959)
  7544. =>WM: (13387: I3 ^see 0)
  7545. <=WM: (13378: S1 ^operator O1909 +)
  7546. <=WM: (13379: S1 ^operator O1910 +)
  7547. <=WM: (13380: S1 ^operator O1910)
  7548. <=WM: (13373: R1 ^reward R958)
  7549. <=WM: (13372: I3 ^see 1)
  7550. <=WM: (13376: O1910 ^name predict-no)
  7551. <=WM: (13375: O1909 ^name predict-yes)
  7552. <=WM: (13374: R958 ^value 1)
  7553. --- Inner Elaboration Phase, active level 1 (S1) ---
  7554. Firing prefer*rvt*predict-yes*H0
  7555. -->
  7556. Firing rl*prefer*rvt*predict-yes*H0*5
  7557. -->
  7558. (S1 ^operator O1911 = 0.)
  7559. Firing prefer*rvt*predict-no*H0
  7560. -->
  7561. Firing rl*prefer*rvt*predict-no*H0*6
  7562. -->
  7563. (S1 ^operator O1912 = 0.9999999999999999)
  7564. inner elaboration loop at bottom goal.
  7565. Retracting rl*prefer*rvt*predict-no*H0*6
  7566. -->
  7567. (S1 ^operator O1910 = 0.9999999999999999)
  7568. Retracting rl*prefer*rvt*predict-yes*H0*5
  7569. -->
  7570. (S1 ^operator O1909 = 0.)
  7571. --- END Proposal Phase ---
  7572. --- Decision Phase ---
  7573. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7574. =>WM: (13394: S1 ^operator O1912)
  7575. 956: O: O1912 (predict-no)
  7576. --- END Decision Phase ---
  7577. --- Application Phase ---
  7578. --- Firing Productions (PE) For State At Depth 1 ---
  7579. --- Inner Elaboration Phase, active level 1 (S1) ---
  7580. Firing apply*operator
  7581. -->
  7582. (I3 ^predict-no N956 + :O )
  7583. Firing apply*operator*complete
  7584. -->
  7585. (I3 ^predict-no N955 - :O )
  7586. inner elaboration loop at bottom goal.
  7587. --- Change Working Memory (PE) ---
  7588. =>WM: (13395: I3 ^predict-no N956)
  7589. <=WM: (13382: N955 ^status complete)
  7590. <=WM: (13381: I3 ^predict-no N955)
  7591. --- Firing Productions (IE) For State At Depth 1 ---
  7592. --- Inner Elaboration Phase, active level 1 (S1) ---
  7593. Firing monitor*world
  7594. -->
  7595. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7596. --- Change Working Memory (IE) ---
  7597. --- END Application Phase ---
  7598. --- Output Phase ---
  7599. ENV: Agent did: predict-no for direction U in state State-A
  7600. In State-A moving U
  7601. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7602. predict error 0
  7603. dir: dir isL
  7604. --- END Output Phase ---
  7605. |\---- Input Phase ---
  7606. =>WM: (13399: I2 ^dir L)
  7607. =>WM: (13398: I2 ^reward 1)
  7608. =>WM: (13397: I2 ^see 0)
  7609. =>WM: (13396: N956 ^status complete)
  7610. <=WM: (13385: I2 ^dir U)
  7611. <=WM: (13384: I2 ^reward 1)
  7612. <=WM: (13383: I2 ^see 0)
  7613. =>WM: (13400: I2 ^level-1 L1-root)
  7614. <=WM: (13386: I2 ^level-1 L1-root)
  7615. --- END Input Phase ---
  7616. --- Proposal Phase ---
  7617. --- Inner Elaboration Phase, active level 1 (S1) ---
  7618. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  7619. -->
  7620. (S1 ^operator O1912 = 0.6126622914849755)
  7621. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  7622. -->
  7623. (S1 ^operator O1911 = -0.02274740735326741)
  7624. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7625. -->
  7626. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7627. -->
  7628. Firing elaborate*copy-see-to-output-link
  7629. -->
  7630. (I3 ^see 0 +)
  7631. Firing elaborate*reward*based*on*reward
  7632. -->
  7633. (R960 ^value 1 +)
  7634. (R1 ^reward R960 +)
  7635. Firing propose*predict-yes
  7636. -->
  7637. (O1913 ^name predict-yes +)
  7638. (S1 ^operator O1913 +)
  7639. Firing propose*predict-no
  7640. -->
  7641. (O1914 ^name predict-no +)
  7642. (S1 ^operator O1914 +)
  7643. Firing rl*prefer*rvt*predict-no*H0*2
  7644. -->
  7645. (S1 ^operator O1912 = 0.3873365065796835)
  7646. Firing rl*prefer*rvt*predict-yes*H0*1
  7647. -->
  7648. (S1 ^operator O1911 = 0.3895394851831418)
  7649. Firing prefer*rvt*predict-yes*H0
  7650. -->
  7651. Firing prefer*rvt*predict-no*H0
  7652. -->
  7653. Firing elaborate*copy-dir-to-output-link
  7654. -->
  7655. (I3 ^dir L +)
  7656. inner elaboration loop at bottom goal.
  7657. Retracting elaborate*copy-see-to-output-link
  7658. -->
  7659. (I3 ^see 0 +)
  7660. Retracting propose*predict-no
  7661. -->
  7662. (O1912 ^name predict-no +)
  7663. (S1 ^operator O1912 +)
  7664. Retracting propose*predict-yes
  7665. -->
  7666. (O1911 ^name predict-yes +)
  7667. (S1 ^operator O1911 +)
  7668. Retracting elaborate*reward*based*on*reward
  7669. -->
  7670. (R959 ^value 1 +)
  7671. (R1 ^reward R959 +)
  7672. Retracting elaborate*copy-dir-to-output-link
  7673. -->
  7674. (I3 ^dir U +)
  7675. Retracting rl*prefer*rvt*predict-no*H0*6
  7676. -->
  7677. (S1 ^operator O1912 = 0.9999999999999999)
  7678. Retracting rl*prefer*rvt*predict-yes*H0*5
  7679. -->
  7680. (S1 ^operator O1911 = 0.)
  7681. =>WM: (13407: S1 ^operator O1914 +)
  7682. =>WM: (13406: S1 ^operator O1913 +)
  7683. =>WM: (13405: I3 ^dir L)
  7684. =>WM: (13404: O1914 ^name predict-no)
  7685. =>WM: (13403: O1913 ^name predict-yes)
  7686. =>WM: (13402: R960 ^value 1)
  7687. =>WM: (13401: R1 ^reward R960)
  7688. <=WM: (13392: S1 ^operator O1911 +)
  7689. <=WM: (13393: S1 ^operator O1912 +)
  7690. <=WM: (13394: S1 ^operator O1912)
  7691. <=WM: (13377: I3 ^dir U)
  7692. <=WM: (13388: R1 ^reward R959)
  7693. <=WM: (13391: O1912 ^name predict-no)
  7694. <=WM: (13390: O1911 ^name predict-yes)
  7695. <=WM: (13389: R959 ^value 1)
  7696. --- Inner Elaboration Phase, active level 1 (S1) ---
  7697. Firing prefer*rvt*predict-yes*H0
  7698. -->
  7699. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  7700. -->
  7701. (S1 ^operator O1913 = -0.02274740735326741)
  7702. Firing rl*prefer*rvt*predict-yes*H0*1
  7703. -->
  7704. (S1 ^operator O1913 = 0.3895394851831418)
  7705. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  7706. -->
  7707. Firing prefer*rvt*predict-no*H0
  7708. -->
  7709. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  7710. -->
  7711. (S1 ^operator O1914 = 0.6126622914849755)
  7712. Firing rl*prefer*rvt*predict-no*H0*2
  7713. -->
  7714. (S1 ^operator O1914 = 0.3873365065796835)
  7715. Firing prefer*rvt*predict-no*H0*2*v1*H1
  7716. -->
  7717. inner elaboration loop at bottom goal.
  7718. Retracting rl*prefer*rvt*predict-no*H0*2
  7719. -->
  7720. (S1 ^operator O1912 = 0.3873365065796835)
  7721. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  7722. -->
  7723. (S1 ^operator O1912 = 0.6126622914849755)
  7724. Retracting rl*prefer*rvt*predict-yes*H0*1
  7725. -->
  7726. (S1 ^operator O1911 = 0.3895394851831418)
  7727. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  7728. -->
  7729. (S1 ^operator O1911 = -0.02274740735326741)
  7730. --- END Proposal Phase ---
  7731. --- Decision Phase ---
  7732. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7733. =>WM: (13408: S1 ^operator O1914)
  7734. 957: O: O1914 (predict-no)
  7735. --- END Decision Phase ---
  7736. --- Application Phase ---
  7737. --- Firing Productions (PE) For State At Depth 1 ---
  7738. --- Inner Elaboration Phase, active level 1 (S1) ---
  7739. Firing apply*operator
  7740. -->
  7741. (I3 ^predict-no N957 + :O )
  7742. Firing apply*operator*complete
  7743. -->
  7744. (I3 ^predict-no N956 - :O )
  7745. inner elaboration loop at bottom goal.
  7746. --- Change Working Memory (PE) ---
  7747. =>WM: (13409: I3 ^predict-no N957)
  7748. <=WM: (13396: N956 ^status complete)
  7749. <=WM: (13395: I3 ^predict-no N956)
  7750. --- Firing Productions (IE) For State At Depth 1 ---
  7751. --- Inner Elaboration Phase, active level 1 (S1) ---
  7752. Firing monitor*world
  7753. -->
  7754. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7755. --- Change Working Memory (IE) ---
  7756. --- END Application Phase ---
  7757. --- Output Phase ---
  7758. ENV: Agent did: predict-no for direction L in state State-A
  7759. In State-A moving L
  7760. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7761. predict error 0
  7762. dir: dir isU
  7763. --- END Output Phase ---
  7764. /|\--- Input Phase ---
  7765. =>WM: (13413: I2 ^dir U)
  7766. =>WM: (13412: I2 ^reward 1)
  7767. =>WM: (13411: I2 ^see 0)
  7768. =>WM: (13410: N957 ^status complete)
  7769. <=WM: (13399: I2 ^dir L)
  7770. <=WM: (13398: I2 ^reward 1)
  7771. <=WM: (13397: I2 ^see 0)
  7772. =>WM: (13414: I2 ^level-1 L0-root)
  7773. <=WM: (13400: I2 ^level-1 L1-root)
  7774. --- END Input Phase ---
  7775. --- Proposal Phase ---
  7776. --- Inner Elaboration Phase, active level 1 (S1) ---
  7777. Firing elaborate*copy-see-to-output-link
  7778. -->
  7779. (I3 ^see 0 +)
  7780. Firing elaborate*reward*based*on*reward
  7781. -->
  7782. (R961 ^value 1 +)
  7783. (R1 ^reward R961 +)
  7784. Firing propose*predict-yes
  7785. -->
  7786. (O1915 ^name predict-yes +)
  7787. (S1 ^operator O1915 +)
  7788. Firing propose*predict-no
  7789. -->
  7790. (O1916 ^name predict-no +)
  7791. (S1 ^operator O1916 +)
  7792. Firing rl*prefer*rvt*predict-no*H0*6
  7793. -->
  7794. (S1 ^operator O1914 = 0.9999999999999999)
  7795. Firing rl*prefer*rvt*predict-yes*H0*5
  7796. -->
  7797. (S1 ^operator O1913 = 0.)
  7798. Firing prefer*rvt*predict-yes*H0
  7799. -->
  7800. Firing prefer*rvt*predict-no*H0
  7801. -->
  7802. Firing elaborate*copy-dir-to-output-link
  7803. -->
  7804. (I3 ^dir U +)
  7805. inner elaboration loop at bottom goal.
  7806. Retracting elaborate*copy-see-to-output-link
  7807. -->
  7808. (I3 ^see 0 +)
  7809. Retracting propose*predict-no
  7810. -->
  7811. (O1914 ^name predict-no +)
  7812. (S1 ^operator O1914 +)
  7813. Retracting propose*predict-yes
  7814. -->
  7815. (O1913 ^name predict-yes +)
  7816. (S1 ^operator O1913 +)
  7817. Retracting elaborate*reward*based*on*reward
  7818. -->
  7819. (R960 ^value 1 +)
  7820. (R1 ^reward R960 +)
  7821. Retracting elaborate*copy-dir-to-output-link
  7822. -->
  7823. (I3 ^dir L +)
  7824. Retracting rl*prefer*rvt*predict-no*H0*2
  7825. -->
  7826. (S1 ^operator O1914 = 0.3873365065796835)
  7827. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  7828. -->
  7829. (S1 ^operator O1914 = 0.6126622914849755)
  7830. Retracting rl*prefer*rvt*predict-yes*H0*1
  7831. -->
  7832. (S1 ^operator O1913 = 0.3895394851831418)
  7833. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  7834. -->
  7835. (S1 ^operator O1913 = -0.02274740735326741)
  7836. =>WM: (13421: S1 ^operator O1916 +)
  7837. =>WM: (13420: S1 ^operator O1915 +)
  7838. =>WM: (13419: I3 ^dir U)
  7839. =>WM: (13418: O1916 ^name predict-no)
  7840. =>WM: (13417: O1915 ^name predict-yes)
  7841. =>WM: (13416: R961 ^value 1)
  7842. =>WM: (13415: R1 ^reward R961)
  7843. <=WM: (13406: S1 ^operator O1913 +)
  7844. <=WM: (13407: S1 ^operator O1914 +)
  7845. <=WM: (13408: S1 ^operator O1914)
  7846. <=WM: (13405: I3 ^dir L)
  7847. <=WM: (13401: R1 ^reward R960)
  7848. <=WM: (13404: O1914 ^name predict-no)
  7849. <=WM: (13403: O1913 ^name predict-yes)
  7850. <=WM: (13402: R960 ^value 1)
  7851. --- Inner Elaboration Phase, active level 1 (S1) ---
  7852. Firing prefer*rvt*predict-yes*H0
  7853. -->
  7854. Firing rl*prefer*rvt*predict-yes*H0*5
  7855. -->
  7856. (S1 ^operator O1915 = 0.)
  7857. Firing prefer*rvt*predict-no*H0
  7858. -->
  7859. Firing rl*prefer*rvt*predict-no*H0*6
  7860. -->
  7861. (S1 ^operator O1916 = 0.9999999999999999)
  7862. inner elaboration loop at bottom goal.
  7863. Retracting rl*prefer*rvt*predict-no*H0*6
  7864. -->
  7865. (S1 ^operator O1914 = 0.9999999999999999)
  7866. Retracting rl*prefer*rvt*predict-yes*H0*5
  7867. -->
  7868. (S1 ^operator O1913 = 0.)
  7869. --- END Proposal Phase ---
  7870. --- Decision Phase ---
  7871. RL update rl*prefer*rvt*predict-no*H0*2 0.71908 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.930233,0.0652795)
  7872. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280918 0.331744 0.612662 -> 0.280918 0.331744 0.612662(R,m,v=1,1,0)
  7873. =>WM: (13422: S1 ^operator O1916)
  7874. 958: O: O1916 (predict-no)
  7875. --- END Decision Phase ---
  7876. --- Application Phase ---
  7877. --- Firing Productions (PE) For State At Depth 1 ---
  7878. --- Inner Elaboration Phase, active level 1 (S1) ---
  7879. Firing apply*operator
  7880. -->
  7881. (I3 ^predict-no N958 + :O )
  7882. Firing apply*operator*complete
  7883. -->
  7884. (I3 ^predict-no N957 - :O )
  7885. inner elaboration loop at bottom goal.
  7886. --- Change Working Memory (PE) ---
  7887. =>WM: (13423: I3 ^predict-no N958)
  7888. <=WM: (13410: N957 ^status complete)
  7889. <=WM: (13409: I3 ^predict-no N957)
  7890. --- Firing Productions (IE) For State At Depth 1 ---
  7891. --- Inner Elaboration Phase, active level 1 (S1) ---
  7892. Firing monitor*world
  7893. -->
  7894. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7895. --- Change Working Memory (IE) ---
  7896. --- END Application Phase ---
  7897. --- Output Phase ---
  7898. ENV: Agent did: predict-no for direction U in state State-A
  7899. In State-A moving U
  7900. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7901. predict error 0
  7902. dir: dir isR
  7903. --- END Output Phase ---
  7904. -/--- Input Phase ---
  7905. =>WM: (13427: I2 ^dir R)
  7906. =>WM: (13426: I2 ^reward 1)
  7907. =>WM: (13425: I2 ^see 0)
  7908. =>WM: (13424: N958 ^status complete)
  7909. <=WM: (13413: I2 ^dir U)
  7910. <=WM: (13412: I2 ^reward 1)
  7911. <=WM: (13411: I2 ^see 0)
  7912. =>WM: (13428: I2 ^level-1 L0-root)
  7913. <=WM: (13414: I2 ^level-1 L0-root)
  7914. --- END Input Phase ---
  7915. --- Proposal Phase ---
  7916. --- Inner Elaboration Phase, active level 1 (S1) ---
  7917. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  7918. -->
  7919. (S1 ^operator O1915 = 0.8155985324859676)
  7920. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  7921. -->
  7922. (S1 ^operator O1916 = -0.00558448899823713)
  7923. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7924. -->
  7925. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7926. -->
  7927. Firing elaborate*copy-see-to-output-link
  7928. -->
  7929. (I3 ^see 0 +)
  7930. Firing elaborate*reward*based*on*reward
  7931. -->
  7932. (R962 ^value 1 +)
  7933. (R1 ^reward R962 +)
  7934. Firing propose*predict-yes
  7935. -->
  7936. (O1917 ^name predict-yes +)
  7937. (S1 ^operator O1917 +)
  7938. Firing propose*predict-no
  7939. -->
  7940. (O1918 ^name predict-no +)
  7941. (S1 ^operator O1918 +)
  7942. Firing rl*prefer*rvt*predict-no*H0*4
  7943. -->
  7944. (S1 ^operator O1916 = 0.4476188714061859)
  7945. Firing rl*prefer*rvt*predict-yes*H0*3
  7946. -->
  7947. (S1 ^operator O1915 = 0.1844104702696336)
  7948. Firing prefer*rvt*predict-yes*H0
  7949. -->
  7950. Firing prefer*rvt*predict-no*H0
  7951. -->
  7952. Firing elaborate*copy-dir-to-output-link
  7953. -->
  7954. (I3 ^dir R +)
  7955. inner elaboration loop at bottom goal.
  7956. Retracting elaborate*copy-see-to-output-link
  7957. -->
  7958. (I3 ^see 0 +)
  7959. Retracting propose*predict-no
  7960. -->
  7961. (O1916 ^name predict-no +)
  7962. (S1 ^operator O1916 +)
  7963. Retracting propose*predict-yes
  7964. -->
  7965. (O1915 ^name predict-yes +)
  7966. (S1 ^operator O1915 +)
  7967. Retracting elaborate*reward*based*on*reward
  7968. -->
  7969. (R961 ^value 1 +)
  7970. (R1 ^reward R961 +)
  7971. Retracting elaborate*copy-dir-to-output-link
  7972. -->
  7973. (I3 ^dir U +)
  7974. Retracting rl*prefer*rvt*predict-no*H0*6
  7975. -->
  7976. (S1 ^operator O1916 = 0.9999999999999999)
  7977. Retracting rl*prefer*rvt*predict-yes*H0*5
  7978. -->
  7979. (S1 ^operator O1915 = 0.)
  7980. =>WM: (13435: S1 ^operator O1918 +)
  7981. =>WM: (13434: S1 ^operator O1917 +)
  7982. =>WM: (13433: I3 ^dir R)
  7983. =>WM: (13432: O1918 ^name predict-no)
  7984. =>WM: (13431: O1917 ^name predict-yes)
  7985. =>WM: (13430: R962 ^value 1)
  7986. =>WM: (13429: R1 ^reward R962)
  7987. <=WM: (13420: S1 ^operator O1915 +)
  7988. <=WM: (13421: S1 ^operator O1916 +)
  7989. <=WM: (13422: S1 ^operator O1916)
  7990. <=WM: (13419: I3 ^dir U)
  7991. <=WM: (13415: R1 ^reward R961)
  7992. <=WM: (13418: O1916 ^name predict-no)
  7993. <=WM: (13417: O1915 ^name predict-yes)
  7994. <=WM: (13416: R961 ^value 1)
  7995. --- Inner Elaboration Phase, active level 1 (S1) ---
  7996. Firing prefer*rvt*predict-yes*H0
  7997. -->
  7998. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  7999. -->
  8000. (S1 ^operator O1917 = 0.8155985324859676)
  8001. Firing rl*prefer*rvt*predict-yes*H0*3
  8002. -->
  8003. (S1 ^operator O1917 = 0.1844104702696336)
  8004. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8005. -->
  8006. Firing prefer*rvt*predict-no*H0
  8007. -->
  8008. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8009. -->
  8010. (S1 ^operator O1918 = -0.00558448899823713)
  8011. Firing rl*prefer*rvt*predict-no*H0*4
  8012. -->
  8013. (S1 ^operator O1918 = 0.4476188714061859)
  8014. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8015. -->
  8016. inner elaboration loop at bottom goal.
  8017. Retracting rl*prefer*rvt*predict-no*H0*4
  8018. -->
  8019. (S1 ^operator O1916 = 0.4476188714061859)
  8020. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8021. -->
  8022. (S1 ^operator O1916 = -0.00558448899823713)
  8023. Retracting rl*prefer*rvt*predict-yes*H0*3
  8024. -->
  8025. (S1 ^operator O1915 = 0.1844104702696336)
  8026. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8027. -->
  8028. (S1 ^operator O1915 = 0.8155985324859676)
  8029. --- END Proposal Phase ---
  8030. --- Decision Phase ---
  8031. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8032. =>WM: (13436: S1 ^operator O1917)
  8033. 959: O: O1917 (predict-yes)
  8034. --- END Decision Phase ---
  8035. --- Application Phase ---
  8036. --- Firing Productions (PE) For State At Depth 1 ---
  8037. --- Inner Elaboration Phase, active level 1 (S1) ---
  8038. Firing apply*operator
  8039. -->
  8040. (I3 ^predict-yes N959 + :O )
  8041. Firing apply*operator*complete
  8042. -->
  8043. (I3 ^predict-no N958 - :O )
  8044. inner elaboration loop at bottom goal.
  8045. --- Change Working Memory (PE) ---
  8046. =>WM: (13437: I3 ^predict-yes N959)
  8047. <=WM: (13424: N958 ^status complete)
  8048. <=WM: (13423: I3 ^predict-no N958)
  8049. --- Firing Productions (IE) For State At Depth 1 ---
  8050. --- Inner Elaboration Phase, active level 1 (S1) ---
  8051. Firing monitor*world
  8052. -->
  8053. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8054. --- Change Working Memory (IE) ---
  8055. --- END Application Phase ---
  8056. --- Output Phase ---
  8057. ENV: Agent did: predict-yes for direction R in state State-A
  8058. In State-A moving R
  8059. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8060. predict error 0
  8061. dir: dir isL
  8062. --- END Output Phase ---
  8063. |\---- Input Phase ---
  8064. =>WM: (13441: I2 ^dir L)
  8065. =>WM: (13440: I2 ^reward 1)
  8066. =>WM: (13439: I2 ^see 1)
  8067. =>WM: (13438: N959 ^status complete)
  8068. <=WM: (13427: I2 ^dir R)
  8069. <=WM: (13426: I2 ^reward 1)
  8070. <=WM: (13425: I2 ^see 0)
  8071. =>WM: (13442: I2 ^level-1 R1-root)
  8072. <=WM: (13428: I2 ^level-1 L0-root)
  8073. --- END Input Phase ---
  8074. --- Proposal Phase ---
  8075. --- Inner Elaboration Phase, active level 1 (S1) ---
  8076. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  8077. -->
  8078. (S1 ^operator O1917 = 0.6104587229728515)
  8079. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  8080. -->
  8081. (S1 ^operator O1918 = 0.2714993082286609)
  8082. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8083. -->
  8084. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8085. -->
  8086. Firing elaborate*copy-see-to-output-link
  8087. -->
  8088. (I3 ^see 1 +)
  8089. Firing elaborate*reward*based*on*reward
  8090. -->
  8091. (R963 ^value 1 +)
  8092. (R1 ^reward R963 +)
  8093. Firing propose*predict-yes
  8094. -->
  8095. (O1919 ^name predict-yes +)
  8096. (S1 ^operator O1919 +)
  8097. Firing propose*predict-no
  8098. -->
  8099. (O1920 ^name predict-no +)
  8100. (S1 ^operator O1920 +)
  8101. Firing rl*prefer*rvt*predict-no*H0*2
  8102. -->
  8103. (S1 ^operator O1918 = 0.3873366868699847)
  8104. Firing rl*prefer*rvt*predict-yes*H0*1
  8105. -->
  8106. (S1 ^operator O1917 = 0.3895394851831418)
  8107. Firing prefer*rvt*predict-yes*H0
  8108. -->
  8109. Firing prefer*rvt*predict-no*H0
  8110. -->
  8111. Firing elaborate*copy-dir-to-output-link
  8112. -->
  8113. (I3 ^dir L +)
  8114. inner elaboration loop at bottom goal.
  8115. Retracting elaborate*copy-see-to-output-link
  8116. -->
  8117. (I3 ^see 0 +)
  8118. Retracting propose*predict-no
  8119. -->
  8120. (O1918 ^name predict-no +)
  8121. (S1 ^operator O1918 +)
  8122. Retracting propose*predict-yes
  8123. -->
  8124. (O1917 ^name predict-yes +)
  8125. (S1 ^operator O1917 +)
  8126. Retracting elaborate*reward*based*on*reward
  8127. -->
  8128. (R962 ^value 1 +)
  8129. (R1 ^reward R962 +)
  8130. Retracting elaborate*copy-dir-to-output-link
  8131. -->
  8132. (I3 ^dir R +)
  8133. Retracting rl*prefer*rvt*predict-no*H0*4
  8134. -->
  8135. (S1 ^operator O1918 = 0.4476188714061859)
  8136. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8137. -->
  8138. (S1 ^operator O1918 = -0.00558448899823713)
  8139. Retracting rl*prefer*rvt*predict-yes*H0*3
  8140. -->
  8141. (S1 ^operator O1917 = 0.1844104702696336)
  8142. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8143. -->
  8144. (S1 ^operator O1917 = 0.8155985324859676)
  8145. =>WM: (13450: S1 ^operator O1920 +)
  8146. =>WM: (13449: S1 ^operator O1919 +)
  8147. =>WM: (13448: I3 ^dir L)
  8148. =>WM: (13447: O1920 ^name predict-no)
  8149. =>WM: (13446: O1919 ^name predict-yes)
  8150. =>WM: (13445: R963 ^value 1)
  8151. =>WM: (13444: R1 ^reward R963)
  8152. =>WM: (13443: I3 ^see 1)
  8153. <=WM: (13434: S1 ^operator O1917 +)
  8154. <=WM: (13436: S1 ^operator O1917)
  8155. <=WM: (13435: S1 ^operator O1918 +)
  8156. <=WM: (13433: I3 ^dir R)
  8157. <=WM: (13429: R1 ^reward R962)
  8158. <=WM: (13387: I3 ^see 0)
  8159. <=WM: (13432: O1918 ^name predict-no)
  8160. <=WM: (13431: O1917 ^name predict-yes)
  8161. <=WM: (13430: R962 ^value 1)
  8162. --- Inner Elaboration Phase, active level 1 (S1) ---
  8163. Firing prefer*rvt*predict-yes*H0
  8164. -->
  8165. Firing rl*prefer*rvt*predict-yes*H0*1
  8166. -->
  8167. (S1 ^operator O1919 = 0.3895394851831418)
  8168. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8169. -->
  8170. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  8171. -->
  8172. (S1 ^operator O1919 = 0.6104587229728515)
  8173. Firing prefer*rvt*predict-no*H0
  8174. -->
  8175. Firing rl*prefer*rvt*predict-no*H0*2
  8176. -->
  8177. (S1 ^operator O1920 = 0.3873366868699847)
  8178. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8179. -->
  8180. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  8181. -->
  8182. (S1 ^operator O1920 = 0.2714993082286609)
  8183. inner elaboration loop at bottom goal.
  8184. Retracting rl*prefer*rvt*predict-no*H0*2
  8185. -->
  8186. (S1 ^operator O1918 = 0.3873366868699847)
  8187. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  8188. -->
  8189. (S1 ^operator O1918 = 0.2714993082286609)
  8190. Retracting rl*prefer*rvt*predict-yes*H0*1
  8191. -->
  8192. (S1 ^operator O1917 = 0.3895394851831418)
  8193. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  8194. -->
  8195. (S1 ^operator O1917 = 0.6104587229728515)
  8196. --- END Proposal Phase ---
  8197. --- Decision Phase ---
  8198. RL update rl*prefer*rvt*predict-yes*H0*3 0.675413 -0.491002 0.18441 -> 0.675411 -0.491002 0.184409(R,m,v=1,0.895062,0.0945096)
  8199. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324599 0.491 0.815599 -> 0.324597 0.491 0.815597(R,m,v=1,1,0)
  8200. =>WM: (13451: S1 ^operator O1919)
  8201. 960: O: O1919 (predict-yes)
  8202. --- END Decision Phase ---
  8203. --- Application Phase ---
  8204. --- Firing Productions (PE) For State At Depth 1 ---
  8205. --- Inner Elaboration Phase, active level 1 (S1) ---
  8206. Firing apply*operator
  8207. -->
  8208. (I3 ^predict-yes N960 + :O )
  8209. Firing apply*operator*complete
  8210. -->
  8211. (I3 ^predict-yes N959 - :O )
  8212. inner elaboration loop at bottom goal.
  8213. --- Change Working Memory (PE) ---
  8214. =>WM: (13452: I3 ^predict-yes N960)
  8215. <=WM: (13438: N959 ^status complete)
  8216. <=WM: (13437: I3 ^predict-yes N959)
  8217. --- Firing Productions (IE) For State At Depth 1 ---
  8218. --- Inner Elaboration Phase, active level 1 (S1) ---
  8219. Firing monitor*world
  8220. -->
  8221. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8222. --- Change Working Memory (IE) ---
  8223. --- END Application Phase ---
  8224. --- Output Phase ---
  8225. ENV: Agent did: predict-yes for direction L in state State-B
  8226. In State-B moving L
  8227. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8228. predict error 0
  8229. dir: dir isL
  8230. --- END Output Phase ---
  8231. /|\--- Input Phase ---
  8232. =>WM: (13456: I2 ^dir L)
  8233. =>WM: (13455: I2 ^reward 1)
  8234. =>WM: (13454: I2 ^see 1)
  8235. =>WM: (13453: N960 ^status complete)
  8236. <=WM: (13441: I2 ^dir L)
  8237. <=WM: (13440: I2 ^reward 1)
  8238. <=WM: (13439: I2 ^see 1)
  8239. =>WM: (13457: I2 ^level-1 L1-root)
  8240. <=WM: (13442: I2 ^level-1 R1-root)
  8241. --- END Input Phase ---
  8242. --- Proposal Phase ---
  8243. --- Inner Elaboration Phase, active level 1 (S1) ---
  8244. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  8245. -->
  8246. (S1 ^operator O1920 = 0.6126624717752767)
  8247. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  8248. -->
  8249. (S1 ^operator O1919 = -0.02274740735326741)
  8250. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8251. -->
  8252. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8253. -->
  8254. Firing elaborate*copy-see-to-output-link
  8255. -->
  8256. (I3 ^see 1 +)
  8257. Firing elaborate*reward*based*on*reward
  8258. -->
  8259. (R964 ^value 1 +)
  8260. (R1 ^reward R964 +)
  8261. Firing propose*predict-yes
  8262. -->
  8263. (O1921 ^name predict-yes +)
  8264. (S1 ^operator O1921 +)
  8265. Firing propose*predict-no
  8266. -->
  8267. (O1922 ^name predict-no +)
  8268. (S1 ^operator O1922 +)
  8269. Firing rl*prefer*rvt*predict-no*H0*2
  8270. -->
  8271. (S1 ^operator O1920 = 0.3873366868699847)
  8272. Firing rl*prefer*rvt*predict-yes*H0*1
  8273. -->
  8274. (S1 ^operator O1919 = 0.3895394851831418)
  8275. Firing prefer*rvt*predict-yes*H0
  8276. -->
  8277. Firing prefer*rvt*predict-no*H0
  8278. -->
  8279. Firing elaborate*copy-dir-to-output-link
  8280. -->
  8281. (I3 ^dir L +)
  8282. inner elaboration loop at bottom goal.
  8283. Retracting elaborate*copy-see-to-output-link
  8284. -->
  8285. (I3 ^see 1 +)
  8286. Retracting propose*predict-no
  8287. -->
  8288. (O1920 ^name predict-no +)
  8289. (S1 ^operator O1920 +)
  8290. Retracting propose*predict-yes
  8291. -->
  8292. (O1919 ^name predict-yes +)
  8293. (S1 ^operator O1919 +)
  8294. Retracting elaborate*reward*based*on*reward
  8295. -->
  8296. (R963 ^value 1 +)
  8297. (R1 ^reward R963 +)
  8298. Retracting elaborate*copy-dir-to-output-link
  8299. -->
  8300. (I3 ^dir L +)
  8301. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  8302. -->
  8303. (S1 ^operator O1920 = 0.2714993082286609)
  8304. Retracting rl*prefer*rvt*predict-no*H0*2
  8305. -->
  8306. (S1 ^operator O1920 = 0.3873366868699847)
  8307. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  8308. -->
  8309. (S1 ^operator O1919 = 0.6104587229728515)
  8310. Retracting rl*prefer*rvt*predict-yes*H0*1
  8311. -->
  8312. (S1 ^operator O1919 = 0.3895394851831418)
  8313. =>WM: (13463: S1 ^operator O1922 +)
  8314. =>WM: (13462: S1 ^operator O1921 +)
  8315. =>WM: (13461: O1922 ^name predict-no)
  8316. =>WM: (13460: O1921 ^name predict-yes)
  8317. =>WM: (13459: R964 ^value 1)
  8318. =>WM: (13458: R1 ^reward R964)
  8319. <=WM: (13449: S1 ^operator O1919 +)
  8320. <=WM: (13451: S1 ^operator O1919)
  8321. <=WM: (13450: S1 ^operator O1920 +)
  8322. <=WM: (13444: R1 ^reward R963)
  8323. <=WM: (13447: O1920 ^name predict-no)
  8324. <=WM: (13446: O1919 ^name predict-yes)
  8325. <=WM: (13445: R963 ^value 1)
  8326. --- Inner Elaboration Phase, active level 1 (S1) ---
  8327. Firing prefer*rvt*predict-yes*H0
  8328. -->
  8329. Firing rl*prefer*rvt*predict-yes*H0*1
  8330. -->
  8331. (S1 ^operator O1921 = 0.3895394851831418)
  8332. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8333. -->
  8334. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  8335. -->
  8336. (S1 ^operator O1921 = -0.02274740735326741)
  8337. Firing prefer*rvt*predict-no*H0
  8338. -->
  8339. Firing rl*prefer*rvt*predict-no*H0*2
  8340. -->
  8341. (S1 ^operator O1922 = 0.3873366868699847)
  8342. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8343. -->
  8344. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  8345. -->
  8346. (S1 ^operator O1922 = 0.6126624717752767)
  8347. inner elaboration loop at bottom goal.
  8348. Retracting rl*prefer*rvt*predict-no*H0*2
  8349. -->
  8350. (S1 ^operator O1920 = 0.3873366868699847)
  8351. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  8352. -->
  8353. (S1 ^operator O1920 = 0.6126624717752767)
  8354. Retracting rl*prefer*rvt*predict-yes*H0*1
  8355. -->
  8356. (S1 ^operator O1919 = 0.3895394851831418)
  8357. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  8358. -->
  8359. (S1 ^operator O1919 = -0.02274740735326741)
  8360. --- END Proposal Phase ---
  8361. --- Decision Phase ---
  8362. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.8875,0.100472)
  8363. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.32241 0.610459 -> 0.288049 0.32241 0.610459(R,m,v=1,1,0)
  8364. =>WM: (13464: S1 ^operator O1922)
  8365. 961: O: O1922 (predict-no)
  8366. --- END Decision Phase ---
  8367. --- Application Phase ---
  8368. --- Firing Productions (PE) For State At Depth 1 ---
  8369. --- Inner Elaboration Phase, active level 1 (S1) ---
  8370. Firing apply*operator
  8371. -->
  8372. (I3 ^predict-no N961 + :O )
  8373. Firing apply*operator*complete
  8374. -->
  8375. (I3 ^predict-yes N960 - :O )
  8376. inner elaboration loop at bottom goal.
  8377. --- Change Working Memory (PE) ---
  8378. =>WM: (13465: I3 ^predict-no N961)
  8379. <=WM: (13453: N960 ^status complete)
  8380. <=WM: (13452: I3 ^predict-yes N960)
  8381. --- Firing Productions (IE) For State At Depth 1 ---
  8382. --- Inner Elaboration Phase, active level 1 (S1) ---
  8383. Firing monitor*world
  8384. -->
  8385. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8386. --- Change Working Memory (IE) ---
  8387. --- END Application Phase ---
  8388. --- Output Phase ---
  8389. ENV: Agent did: predict-no for direction L in state State-A
  8390. In State-A moving L
  8391. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8392. predict error 0
  8393. dir: dir isU
  8394. --- END Output Phase ---
  8395. ---- Input Phase ---
  8396. =>WM: (13469: I2 ^dir U)
  8397. =>WM: (13468: I2 ^reward 1)
  8398. =>WM: (13467: I2 ^see 0)
  8399. =>WM: (13466: N961 ^status complete)
  8400. <=WM: (13456: I2 ^dir L)
  8401. <=WM: (13455: I2 ^reward 1)
  8402. <=WM: (13454: I2 ^see 1)
  8403. =>WM: (13470: I2 ^level-1 L0-root)
  8404. <=WM: (13457: I2 ^level-1 L1-root)
  8405. --- END Input Phase ---
  8406. --- Proposal Phase ---
  8407. --- Inner Elaboration Phase, active level 1 (S1) ---
  8408. Firing elaborate*copy-see-to-output-link
  8409. -->
  8410. (I3 ^see 0 +)
  8411. Firing elaborate*reward*based*on*reward
  8412. -->
  8413. (R965 ^value 1 +)
  8414. (R1 ^reward R965 +)
  8415. Firing propose*predict-yes
  8416. -->
  8417. (O1923 ^name predict-yes +)
  8418. (S1 ^operator O1923 +)
  8419. Firing propose*predict-no
  8420. -->
  8421. (O1924 ^name predict-no +)
  8422. (S1 ^operator O1924 +)
  8423. Firing rl*prefer*rvt*predict-no*H0*6
  8424. -->
  8425. (S1 ^operator O1922 = 0.9999999999999999)
  8426. Firing rl*prefer*rvt*predict-yes*H0*5
  8427. -->
  8428. (S1 ^operator O1921 = 0.)
  8429. Firing prefer*rvt*predict-yes*H0
  8430. -->
  8431. Firing prefer*rvt*predict-no*H0
  8432. -->
  8433. Firing elaborate*copy-dir-to-output-link
  8434. -->
  8435. (I3 ^dir U +)
  8436. inner elaboration loop at bottom goal.
  8437. Retracting elaborate*copy-see-to-output-link
  8438. -->
  8439. (I3 ^see 1 +)
  8440. Retracting propose*predict-no
  8441. -->
  8442. (O1922 ^name predict-no +)
  8443. (S1 ^operator O1922 +)
  8444. Retracting propose*predict-yes
  8445. -->
  8446. (O1921 ^name predict-yes +)
  8447. (S1 ^operator O1921 +)
  8448. Retracting elaborate*reward*based*on*reward
  8449. -->
  8450. (R964 ^value 1 +)
  8451. (R1 ^reward R964 +)
  8452. Retracting elaborate*copy-dir-to-output-link
  8453. -->
  8454. (I3 ^dir L +)
  8455. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  8456. -->
  8457. (S1 ^operator O1922 = 0.6126624717752767)
  8458. Retracting rl*prefer*rvt*predict-no*H0*2
  8459. -->
  8460. (S1 ^operator O1922 = 0.3873366868699847)
  8461. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  8462. -->
  8463. (S1 ^operator O1921 = -0.02274740735326741)
  8464. Retracting rl*prefer*rvt*predict-yes*H0*1
  8465. -->
  8466. (S1 ^operator O1921 = 0.3895397539597428)
  8467. =>WM: (13478: S1 ^operator O1924 +)
  8468. =>WM: (13477: S1 ^operator O1923 +)
  8469. =>WM: (13476: I3 ^dir U)
  8470. =>WM: (13475: O1924 ^name predict-no)
  8471. =>WM: (13474: O1923 ^name predict-yes)
  8472. =>WM: (13473: R965 ^value 1)
  8473. =>WM: (13472: R1 ^reward R965)
  8474. =>WM: (13471: I3 ^see 0)
  8475. <=WM: (13462: S1 ^operator O1921 +)
  8476. <=WM: (13463: S1 ^operator O1922 +)
  8477. <=WM: (13464: S1 ^operator O1922)
  8478. <=WM: (13448: I3 ^dir L)
  8479. <=WM: (13458: R1 ^reward R964)
  8480. <=WM: (13443: I3 ^see 1)
  8481. <=WM: (13461: O1922 ^name predict-no)
  8482. <=WM: (13460: O1921 ^name predict-yes)
  8483. <=WM: (13459: R964 ^value 1)
  8484. --- Inner Elaboration Phase, active level 1 (S1) ---
  8485. Firing prefer*rvt*predict-yes*H0
  8486. -->
  8487. Firing rl*prefer*rvt*predict-yes*H0*5
  8488. -->
  8489. (S1 ^operator O1923 = 0.)
  8490. Firing prefer*rvt*predict-no*H0
  8491. -->
  8492. Firing rl*prefer*rvt*predict-no*H0*6
  8493. -->
  8494. (S1 ^operator O1924 = 0.9999999999999999)
  8495. inner elaboration loop at bottom goal.
  8496. Retracting rl*prefer*rvt*predict-no*H0*6
  8497. -->
  8498. (S1 ^operator O1922 = 0.9999999999999999)
  8499. Retracting rl*prefer*rvt*predict-yes*H0*5
  8500. -->
  8501. (S1 ^operator O1921 = 0.)
  8502. --- END Proposal Phase ---
  8503. --- Decision Phase ---
  8504. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.930636,0.0649281)
  8505. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280918 0.331744 0.612662 -> 0.280918 0.331744 0.612663(R,m,v=1,1,0)
  8506. =>WM: (13479: S1 ^operator O1924)
  8507. 962: O: O1924 (predict-no)
  8508. --- END Decision Phase ---
  8509. --- Application Phase ---
  8510. --- Firing Productions (PE) For State At Depth 1 ---
  8511. --- Inner Elaboration Phase, active level 1 (S1) ---
  8512. Firing apply*operator
  8513. -->
  8514. (I3 ^predict-no N962 + :O )
  8515. Firing apply*operator*complete
  8516. -->
  8517. (I3 ^predict-no N961 - :O )
  8518. inner elaboration loop at bottom goal.
  8519. --- Change Working Memory (PE) ---
  8520. =>WM: (13480: I3 ^predict-no N962)
  8521. <=WM: (13466: N961 ^status complete)
  8522. <=WM: (13465: I3 ^predict-no N961)
  8523. --- Firing Productions (IE) For State At Depth 1 ---
  8524. --- Inner Elaboration Phase, active level 1 (S1) ---
  8525. Firing monitor*world
  8526. -->
  8527. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8528. --- Change Working Memory (IE) ---
  8529. --- END Application Phase ---
  8530. --- Output Phase ---
  8531. ENV: Agent did: predict-no for direction U in state State-A
  8532. In State-A moving U
  8533. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8534. predict error 0
  8535. dir: dir isR
  8536. --- END Output Phase ---
  8537. /|\---- Input Phase ---
  8538. =>WM: (13484: I2 ^dir R)
  8539. =>WM: (13483: I2 ^reward 1)
  8540. =>WM: (13482: I2 ^see 0)
  8541. =>WM: (13481: N962 ^status complete)
  8542. <=WM: (13469: I2 ^dir U)
  8543. <=WM: (13468: I2 ^reward 1)
  8544. <=WM: (13467: I2 ^see 0)
  8545. =>WM: (13485: I2 ^level-1 L0-root)
  8546. <=WM: (13470: I2 ^level-1 L0-root)
  8547. --- END Input Phase ---
  8548. --- Proposal Phase ---
  8549. --- Inner Elaboration Phase, active level 1 (S1) ---
  8550. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8551. -->
  8552. (S1 ^operator O1923 = 0.8155971820726273)
  8553. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8554. -->
  8555. (S1 ^operator O1924 = -0.00558448899823713)
  8556. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8557. -->
  8558. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8559. -->
  8560. Firing elaborate*copy-see-to-output-link
  8561. -->
  8562. (I3 ^see 0 +)
  8563. Firing elaborate*reward*based*on*reward
  8564. -->
  8565. (R966 ^value 1 +)
  8566. (R1 ^reward R966 +)
  8567. Firing propose*predict-yes
  8568. -->
  8569. (O1925 ^name predict-yes +)
  8570. (S1 ^operator O1925 +)
  8571. Firing propose*predict-no
  8572. -->
  8573. (O1926 ^name predict-no +)
  8574. (S1 ^operator O1926 +)
  8575. Firing rl*prefer*rvt*predict-no*H0*4
  8576. -->
  8577. (S1 ^operator O1924 = 0.4476188714061859)
  8578. Firing rl*prefer*rvt*predict-yes*H0*3
  8579. -->
  8580. (S1 ^operator O1923 = 0.1844091198562935)
  8581. Firing prefer*rvt*predict-yes*H0
  8582. -->
  8583. Firing prefer*rvt*predict-no*H0
  8584. -->
  8585. Firing elaborate*copy-dir-to-output-link
  8586. -->
  8587. (I3 ^dir R +)
  8588. inner elaboration loop at bottom goal.
  8589. Retracting elaborate*copy-see-to-output-link
  8590. -->
  8591. (I3 ^see 0 +)
  8592. Retracting propose*predict-no
  8593. -->
  8594. (O1924 ^name predict-no +)
  8595. (S1 ^operator O1924 +)
  8596. Retracting propose*predict-yes
  8597. -->
  8598. (O1923 ^name predict-yes +)
  8599. (S1 ^operator O1923 +)
  8600. Retracting elaborate*reward*based*on*reward
  8601. -->
  8602. (R965 ^value 1 +)
  8603. (R1 ^reward R965 +)
  8604. Retracting elaborate*copy-dir-to-output-link
  8605. -->
  8606. (I3 ^dir U +)
  8607. Retracting rl*prefer*rvt*predict-no*H0*6
  8608. -->
  8609. (S1 ^operator O1924 = 0.9999999999999999)
  8610. Retracting rl*prefer*rvt*predict-yes*H0*5
  8611. -->
  8612. (S1 ^operator O1923 = 0.)
  8613. =>WM: (13492: S1 ^operator O1926 +)
  8614. =>WM: (13491: S1 ^operator O1925 +)
  8615. =>WM: (13490: I3 ^dir R)
  8616. =>WM: (13489: O1926 ^name predict-no)
  8617. =>WM: (13488: O1925 ^name predict-yes)
  8618. =>WM: (13487: R966 ^value 1)
  8619. =>WM: (13486: R1 ^reward R966)
  8620. <=WM: (13477: S1 ^operator O1923 +)
  8621. <=WM: (13478: S1 ^operator O1924 +)
  8622. <=WM: (13479: S1 ^operator O1924)
  8623. <=WM: (13476: I3 ^dir U)
  8624. <=WM: (13472: R1 ^reward R965)
  8625. <=WM: (13475: O1924 ^name predict-no)
  8626. <=WM: (13474: O1923 ^name predict-yes)
  8627. <=WM: (13473: R965 ^value 1)
  8628. --- Inner Elaboration Phase, active level 1 (S1) ---
  8629. Firing prefer*rvt*predict-yes*H0
  8630. -->
  8631. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8632. -->
  8633. (S1 ^operator O1925 = 0.8155971820726273)
  8634. Firing rl*prefer*rvt*predict-yes*H0*3
  8635. -->
  8636. (S1 ^operator O1925 = 0.1844091198562935)
  8637. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8638. -->
  8639. Firing prefer*rvt*predict-no*H0
  8640. -->
  8641. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8642. -->
  8643. (S1 ^operator O1926 = -0.00558448899823713)
  8644. Firing rl*prefer*rvt*predict-no*H0*4
  8645. -->
  8646. (S1 ^operator O1926 = 0.4476188714061859)
  8647. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8648. -->
  8649. inner elaboration loop at bottom goal.
  8650. Retracting rl*prefer*rvt*predict-no*H0*4
  8651. -->
  8652. (S1 ^operator O1924 = 0.4476188714061859)
  8653. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8654. -->
  8655. (S1 ^operator O1924 = -0.00558448899823713)
  8656. Retracting rl*prefer*rvt*predict-yes*H0*3
  8657. -->
  8658. (S1 ^operator O1923 = 0.1844091198562935)
  8659. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8660. -->
  8661. (S1 ^operator O1923 = 0.8155971820726273)
  8662. --- END Proposal Phase ---
  8663. --- Decision Phase ---
  8664. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8665. =>WM: (13493: S1 ^operator O1925)
  8666. 963: O: O1925 (predict-yes)
  8667. --- END Decision Phase ---
  8668. --- Application Phase ---
  8669. --- Firing Productions (PE) For State At Depth 1 ---
  8670. --- Inner Elaboration Phase, active level 1 (S1) ---
  8671. Firing apply*operator
  8672. -->
  8673. (I3 ^predict-yes N963 + :O )
  8674. Firing apply*operator*complete
  8675. -->
  8676. (I3 ^predict-no N962 - :O )
  8677. inner elaboration loop at bottom goal.
  8678. --- Change Working Memory (PE) ---
  8679. =>WM: (13494: I3 ^predict-yes N963)
  8680. <=WM: (13481: N962 ^status complete)
  8681. <=WM: (13480: I3 ^predict-no N962)
  8682. --- Firing Productions (IE) For State At Depth 1 ---
  8683. --- Inner Elaboration Phase, active level 1 (S1) ---
  8684. Firing monitor*world
  8685. -->
  8686. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8687. --- Change Working Memory (IE) ---
  8688. --- END Application Phase ---
  8689. --- Output Phase ---
  8690. ENV: Agent did: predict-yes for direction R in state State-A
  8691. In State-A moving R
  8692. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8693. predict error 0
  8694. dir: dir isR
  8695. --- END Output Phase ---
  8696. /|\--- Input Phase ---
  8697. =>WM: (13498: I2 ^dir R)
  8698. =>WM: (13497: I2 ^reward 1)
  8699. =>WM: (13496: I2 ^see 1)
  8700. =>WM: (13495: N963 ^status complete)
  8701. <=WM: (13484: I2 ^dir R)
  8702. <=WM: (13483: I2 ^reward 1)
  8703. <=WM: (13482: I2 ^see 0)
  8704. =>WM: (13499: I2 ^level-1 R1-root)
  8705. <=WM: (13485: I2 ^level-1 L0-root)
  8706. --- END Input Phase ---
  8707. --- Proposal Phase ---
  8708. --- Inner Elaboration Phase, active level 1 (S1) ---
  8709. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  8710. -->
  8711. (S1 ^operator O1925 = 0.1398795999120246)
  8712. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  8713. -->
  8714. (S1 ^operator O1926 = 0.5523829775838558)
  8715. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8716. -->
  8717. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8718. -->
  8719. Firing elaborate*copy-see-to-output-link
  8720. -->
  8721. (I3 ^see 1 +)
  8722. Firing elaborate*reward*based*on*reward
  8723. -->
  8724. (R967 ^value 1 +)
  8725. (R1 ^reward R967 +)
  8726. Firing propose*predict-yes
  8727. -->
  8728. (O1927 ^name predict-yes +)
  8729. (S1 ^operator O1927 +)
  8730. Firing propose*predict-no
  8731. -->
  8732. (O1928 ^name predict-no +)
  8733. (S1 ^operator O1928 +)
  8734. Firing rl*prefer*rvt*predict-no*H0*4
  8735. -->
  8736. (S1 ^operator O1926 = 0.4476188714061859)
  8737. Firing rl*prefer*rvt*predict-yes*H0*3
  8738. -->
  8739. (S1 ^operator O1925 = 0.1844091198562935)
  8740. Firing prefer*rvt*predict-yes*H0
  8741. -->
  8742. Firing prefer*rvt*predict-no*H0
  8743. -->
  8744. Firing elaborate*copy-dir-to-output-link
  8745. -->
  8746. (I3 ^dir R +)
  8747. inner elaboration loop at bottom goal.
  8748. Retracting elaborate*copy-see-to-output-link
  8749. -->
  8750. (I3 ^see 0 +)
  8751. Retracting propose*predict-no
  8752. -->
  8753. (O1926 ^name predict-no +)
  8754. (S1 ^operator O1926 +)
  8755. Retracting propose*predict-yes
  8756. -->
  8757. (O1925 ^name predict-yes +)
  8758. (S1 ^operator O1925 +)
  8759. Retracting elaborate*reward*based*on*reward
  8760. -->
  8761. (R966 ^value 1 +)
  8762. (R1 ^reward R966 +)
  8763. Retracting elaborate*copy-dir-to-output-link
  8764. -->
  8765. (I3 ^dir R +)
  8766. Retracting rl*prefer*rvt*predict-no*H0*4
  8767. -->
  8768. (S1 ^operator O1926 = 0.4476188714061859)
  8769. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8770. -->
  8771. (S1 ^operator O1926 = -0.00558448899823713)
  8772. Retracting rl*prefer*rvt*predict-yes*H0*3
  8773. -->
  8774. (S1 ^operator O1925 = 0.1844091198562935)
  8775. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8776. -->
  8777. (S1 ^operator O1925 = 0.8155971820726273)
  8778. =>WM: (13506: S1 ^operator O1928 +)
  8779. =>WM: (13505: S1 ^operator O1927 +)
  8780. =>WM: (13504: O1928 ^name predict-no)
  8781. =>WM: (13503: O1927 ^name predict-yes)
  8782. =>WM: (13502: R967 ^value 1)
  8783. =>WM: (13501: R1 ^reward R967)
  8784. =>WM: (13500: I3 ^see 1)
  8785. <=WM: (13491: S1 ^operator O1925 +)
  8786. <=WM: (13493: S1 ^operator O1925)
  8787. <=WM: (13492: S1 ^operator O1926 +)
  8788. <=WM: (13486: R1 ^reward R966)
  8789. <=WM: (13471: I3 ^see 0)
  8790. <=WM: (13489: O1926 ^name predict-no)
  8791. <=WM: (13488: O1925 ^name predict-yes)
  8792. <=WM: (13487: R966 ^value 1)
  8793. --- Inner Elaboration Phase, active level 1 (S1) ---
  8794. Firing prefer*rvt*predict-yes*H0
  8795. -->
  8796. Firing rl*prefer*rvt*predict-yes*H0*3
  8797. -->
  8798. (S1 ^operator O1927 = 0.1844091198562935)
  8799. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8800. -->
  8801. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  8802. -->
  8803. (S1 ^operator O1927 = 0.1398795999120246)
  8804. Firing prefer*rvt*predict-no*H0
  8805. -->
  8806. Firing rl*prefer*rvt*predict-no*H0*4
  8807. -->
  8808. (S1 ^operator O1928 = 0.4476188714061859)
  8809. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8810. -->
  8811. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  8812. -->
  8813. (S1 ^operator O1928 = 0.5523829775838558)
  8814. inner elaboration loop at bottom goal.
  8815. Retracting rl*prefer*rvt*predict-no*H0*4
  8816. -->
  8817. (S1 ^operator O1926 = 0.4476188714061859)
  8818. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  8819. -->
  8820. (S1 ^operator O1926 = 0.5523829775838558)
  8821. Retracting rl*prefer*rvt*predict-yes*H0*3
  8822. -->
  8823. (S1 ^operator O1925 = 0.1844091198562935)
  8824. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  8825. -->
  8826. (S1 ^operator O1925 = 0.1398795999120246)
  8827. --- END Proposal Phase ---
  8828. --- Decision Phase ---
  8829. RL update rl*prefer*rvt*predict-yes*H0*3 0.675411 -0.491002 0.184409 -> 0.67541 -0.491002 0.184408(R,m,v=1,0.895706,0.0939938)
  8830. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324597 0.491 0.815597 -> 0.324596 0.491 0.815596(R,m,v=1,1,0)
  8831. =>WM: (13507: S1 ^operator O1928)
  8832. 964: O: O1928 (predict-no)
  8833. --- END Decision Phase ---
  8834. --- Application Phase ---
  8835. --- Firing Productions (PE) For State At Depth 1 ---
  8836. --- Inner Elaboration Phase, active level 1 (S1) ---
  8837. Firing apply*operator
  8838. -->
  8839. (I3 ^predict-no N964 + :O )
  8840. Firing apply*operator*complete
  8841. -->
  8842. (I3 ^predict-yes N963 - :O )
  8843. inner elaboration loop at bottom goal.
  8844. --- Change Working Memory (PE) ---
  8845. =>WM: (13508: I3 ^predict-no N964)
  8846. <=WM: (13495: N963 ^status complete)
  8847. <=WM: (13494: I3 ^predict-yes N963)
  8848. --- Firing Productions (IE) For State At Depth 1 ---
  8849. --- Inner Elaboration Phase, active level 1 (S1) ---
  8850. Firing monitor*world
  8851. -->
  8852. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8853. --- Change Working Memory (IE) ---
  8854. --- END Application Phase ---
  8855. --- Output Phase ---
  8856. ENV: Agent did: predict-no for direction R in state State-B
  8857. In State-B moving R
  8858. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8859. predict error 0
  8860. dir: dir isL
  8861. --- END Output Phase ---
  8862. -/|--- Input Phase ---
  8863. =>WM: (13512: I2 ^dir L)
  8864. =>WM: (13511: I2 ^reward 1)
  8865. =>WM: (13510: I2 ^see 0)
  8866. =>WM: (13509: N964 ^status complete)
  8867. <=WM: (13498: I2 ^dir R)
  8868. <=WM: (13497: I2 ^reward 1)
  8869. <=WM: (13496: I2 ^see 1)
  8870. =>WM: (13513: I2 ^level-1 R0-root)
  8871. <=WM: (13499: I2 ^level-1 R1-root)
  8872. --- END Input Phase ---
  8873. --- Proposal Phase ---
  8874. --- Inner Elaboration Phase, active level 1 (S1) ---
  8875. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  8876. -->
  8877. (S1 ^operator O1927 = 0.6104618767696252)
  8878. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  8879. -->
  8880. (S1 ^operator O1928 = 0.1063475139796038)
  8881. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8882. -->
  8883. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8884. -->
  8885. Firing elaborate*copy-see-to-output-link
  8886. -->
  8887. (I3 ^see 0 +)
  8888. Firing elaborate*reward*based*on*reward
  8889. -->
  8890. (R968 ^value 1 +)
  8891. (R1 ^reward R968 +)
  8892. Firing propose*predict-yes
  8893. -->
  8894. (O1929 ^name predict-yes +)
  8895. (S1 ^operator O1929 +)
  8896. Firing propose*predict-no
  8897. -->
  8898. (O1930 ^name predict-no +)
  8899. (S1 ^operator O1930 +)
  8900. Firing rl*prefer*rvt*predict-no*H0*2
  8901. -->
  8902. (S1 ^operator O1928 = 0.3873368130731955)
  8903. Firing rl*prefer*rvt*predict-yes*H0*1
  8904. -->
  8905. (S1 ^operator O1927 = 0.3895397539597428)
  8906. Firing prefer*rvt*predict-yes*H0
  8907. -->
  8908. Firing prefer*rvt*predict-no*H0
  8909. -->
  8910. Firing elaborate*copy-dir-to-output-link
  8911. -->
  8912. (I3 ^dir L +)
  8913. inner elaboration loop at bottom goal.
  8914. Retracting elaborate*copy-see-to-output-link
  8915. -->
  8916. (I3 ^see 1 +)
  8917. Retracting propose*predict-no
  8918. -->
  8919. (O1928 ^name predict-no +)
  8920. (S1 ^operator O1928 +)
  8921. Retracting propose*predict-yes
  8922. -->
  8923. (O1927 ^name predict-yes +)
  8924. (S1 ^operator O1927 +)
  8925. Retracting elaborate*reward*based*on*reward
  8926. -->
  8927. (R967 ^value 1 +)
  8928. (R1 ^reward R967 +)
  8929. Retracting elaborate*copy-dir-to-output-link
  8930. -->
  8931. (I3 ^dir R +)
  8932. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  8933. -->
  8934. (S1 ^operator O1928 = 0.5523829775838558)
  8935. Retracting rl*prefer*rvt*predict-no*H0*4
  8936. -->
  8937. (S1 ^operator O1928 = 0.4476188714061859)
  8938. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  8939. -->
  8940. (S1 ^operator O1927 = 0.1398795999120246)
  8941. Retracting rl*prefer*rvt*predict-yes*H0*3
  8942. -->
  8943. (S1 ^operator O1927 = 0.1844081745669553)
  8944. =>WM: (13521: S1 ^operator O1930 +)
  8945. =>WM: (13520: S1 ^operator O1929 +)
  8946. =>WM: (13519: I3 ^dir L)
  8947. =>WM: (13518: O1930 ^name predict-no)
  8948. =>WM: (13517: O1929 ^name predict-yes)
  8949. =>WM: (13516: R968 ^value 1)
  8950. =>WM: (13515: R1 ^reward R968)
  8951. =>WM: (13514: I3 ^see 0)
  8952. <=WM: (13505: S1 ^operator O1927 +)
  8953. <=WM: (13506: S1 ^operator O1928 +)
  8954. <=WM: (13507: S1 ^operator O1928)
  8955. <=WM: (13490: I3 ^dir R)
  8956. <=WM: (13501: R1 ^reward R967)
  8957. <=WM: (13500: I3 ^see 1)
  8958. <=WM: (13504: O1928 ^name predict-no)
  8959. <=WM: (13503: O1927 ^name predict-yes)
  8960. <=WM: (13502: R967 ^value 1)
  8961. --- Inner Elaboration Phase, active level 1 (S1) ---
  8962. Firing prefer*rvt*predict-yes*H0
  8963. -->
  8964. Firing rl*prefer*rvt*predict-yes*H0*1
  8965. -->
  8966. (S1 ^operator O1929 = 0.3895397539597428)
  8967. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  8968. -->
  8969. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  8970. -->
  8971. (S1 ^operator O1929 = 0.6104618767696252)
  8972. Firing prefer*rvt*predict-no*H0
  8973. -->
  8974. Firing rl*prefer*rvt*predict-no*H0*2
  8975. -->
  8976. (S1 ^operator O1930 = 0.3873368130731955)
  8977. Firing prefer*rvt*predict-no*H0*2*v1*H1
  8978. -->
  8979. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  8980. -->
  8981. (S1 ^operator O1930 = 0.1063475139796038)
  8982. inner elaboration loop at bottom goal.
  8983. Retracting rl*prefer*rvt*predict-no*H0*2
  8984. -->
  8985. (S1 ^operator O1928 = 0.3873368130731955)
  8986. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  8987. -->
  8988. (S1 ^operator O1928 = 0.1063475139796038)
  8989. Retracting rl*prefer*rvt*predict-yes*H0*1
  8990. -->
  8991. (S1 ^operator O1927 = 0.3895397539597428)
  8992. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  8993. -->
  8994. (S1 ^operator O1927 = 0.6104618767696252)
  8995. --- END Proposal Phase ---
  8996. --- Decision Phase ---
  8997. RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447619 -> 0.622532 -0.174914 0.447619(R,m,v=1,0.92562,0.0694215)
  8998. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377469 0.174914 0.552383(R,m,v=1,1,0)
  8999. =>WM: (13522: S1 ^operator O1929)
  9000. 965: O: O1929 (predict-yes)
  9001. --- END Decision Phase ---
  9002. --- Application Phase ---
  9003. --- Firing Productions (PE) For State At Depth 1 ---
  9004. --- Inner Elaboration Phase, active level 1 (S1) ---
  9005. Firing apply*operator
  9006. -->
  9007. (I3 ^predict-yes N965 + :O )
  9008. Firing apply*operator*complete
  9009. -->
  9010. (I3 ^predict-no N964 - :O )
  9011. inner elaboration loop at bottom goal.
  9012. --- Change Working Memory (PE) ---
  9013. =>WM: (13523: I3 ^predict-yes N965)
  9014. <=WM: (13509: N964 ^status complete)
  9015. <=WM: (13508: I3 ^predict-no N964)
  9016. --- Firing Productions (IE) For State At Depth 1 ---
  9017. --- Inner Elaboration Phase, active level 1 (S1) ---
  9018. Firing monitor*world
  9019. -->
  9020. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9021. --- Change Working Memory (IE) ---
  9022. --- END Application Phase ---
  9023. --- Output Phase ---
  9024. ENV: Agent did: predict-yes for direction L in state State-B
  9025. In State-B moving L
  9026. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9027. predict error 0
  9028. dir: dir isL
  9029. --- END Output Phase ---
  9030. \---- Input Phase ---
  9031. =>WM: (13527: I2 ^dir L)
  9032. =>WM: (13526: I2 ^reward 1)
  9033. =>WM: (13525: I2 ^see 1)
  9034. =>WM: (13524: N965 ^status complete)
  9035. <=WM: (13512: I2 ^dir L)
  9036. <=WM: (13511: I2 ^reward 1)
  9037. <=WM: (13510: I2 ^see 0)
  9038. =>WM: (13528: I2 ^level-1 L1-root)
  9039. <=WM: (13513: I2 ^level-1 R0-root)
  9040. --- END Input Phase ---
  9041. --- Proposal Phase ---
  9042. --- Inner Elaboration Phase, active level 1 (S1) ---
  9043. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  9044. -->
  9045. (S1 ^operator O1930 = 0.6126625979784875)
  9046. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  9047. -->
  9048. (S1 ^operator O1929 = -0.02274740735326741)
  9049. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9050. -->
  9051. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9052. -->
  9053. Firing elaborate*copy-see-to-output-link
  9054. -->
  9055. (I3 ^see 1 +)
  9056. Firing elaborate*reward*based*on*reward
  9057. -->
  9058. (R969 ^value 1 +)
  9059. (R1 ^reward R969 +)
  9060. Firing propose*predict-yes
  9061. -->
  9062. (O1931 ^name predict-yes +)
  9063. (S1 ^operator O1931 +)
  9064. Firing propose*predict-no
  9065. -->
  9066. (O1932 ^name predict-no +)
  9067. (S1 ^operator O1932 +)
  9068. Firing rl*prefer*rvt*predict-no*H0*2
  9069. -->
  9070. (S1 ^operator O1930 = 0.3873368130731955)
  9071. Firing rl*prefer*rvt*predict-yes*H0*1
  9072. -->
  9073. (S1 ^operator O1929 = 0.3895397539597428)
  9074. Firing prefer*rvt*predict-yes*H0
  9075. -->
  9076. Firing prefer*rvt*predict-no*H0
  9077. -->
  9078. Firing elaborate*copy-dir-to-output-link
  9079. -->
  9080. (I3 ^dir L +)
  9081. inner elaboration loop at bottom goal.
  9082. Retracting elaborate*copy-see-to-output-link
  9083. -->
  9084. (I3 ^see 0 +)
  9085. Retracting propose*predict-no
  9086. -->
  9087. (O1930 ^name predict-no +)
  9088. (S1 ^operator O1930 +)
  9089. Retracting propose*predict-yes
  9090. -->
  9091. (O1929 ^name predict-yes +)
  9092. (S1 ^operator O1929 +)
  9093. Retracting elaborate*reward*based*on*reward
  9094. -->
  9095. (R968 ^value 1 +)
  9096. (R1 ^reward R968 +)
  9097. Retracting elaborate*copy-dir-to-output-link
  9098. -->
  9099. (I3 ^dir L +)
  9100. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  9101. -->
  9102. (S1 ^operator O1930 = 0.1063475139796038)
  9103. Retracting rl*prefer*rvt*predict-no*H0*2
  9104. -->
  9105. (S1 ^operator O1930 = 0.3873368130731955)
  9106. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  9107. -->
  9108. (S1 ^operator O1929 = 0.6104618767696252)
  9109. Retracting rl*prefer*rvt*predict-yes*H0*1
  9110. -->
  9111. (S1 ^operator O1929 = 0.3895397539597428)
  9112. =>WM: (13535: S1 ^operator O1932 +)
  9113. =>WM: (13534: S1 ^operator O1931 +)
  9114. =>WM: (13533: O1932 ^name predict-no)
  9115. =>WM: (13532: O1931 ^name predict-yes)
  9116. =>WM: (13531: R969 ^value 1)
  9117. =>WM: (13530: R1 ^reward R969)
  9118. =>WM: (13529: I3 ^see 1)
  9119. <=WM: (13520: S1 ^operator O1929 +)
  9120. <=WM: (13522: S1 ^operator O1929)
  9121. <=WM: (13521: S1 ^operator O1930 +)
  9122. <=WM: (13515: R1 ^reward R968)
  9123. <=WM: (13514: I3 ^see 0)
  9124. <=WM: (13518: O1930 ^name predict-no)
  9125. <=WM: (13517: O1929 ^name predict-yes)
  9126. <=WM: (13516: R968 ^value 1)
  9127. --- Inner Elaboration Phase, active level 1 (S1) ---
  9128. Firing prefer*rvt*predict-yes*H0
  9129. -->
  9130. Firing rl*prefer*rvt*predict-yes*H0*1
  9131. -->
  9132. (S1 ^operator O1931 = 0.3895397539597428)
  9133. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9134. -->
  9135. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  9136. -->
  9137. (S1 ^operator O1931 = -0.02274740735326741)
  9138. Firing prefer*rvt*predict-no*H0
  9139. -->
  9140. Firing rl*prefer*rvt*predict-no*H0*2
  9141. -->
  9142. (S1 ^operator O1932 = 0.3873368130731955)
  9143. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9144. -->
  9145. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  9146. -->
  9147. (S1 ^operator O1932 = 0.6126625979784875)
  9148. inner elaboration loop at bottom goal.
  9149. Retracting rl*prefer*rvt*predict-no*H0*2
  9150. -->
  9151. (S1 ^operator O1930 = 0.3873368130731955)
  9152. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  9153. -->
  9154. (S1 ^operator O1930 = 0.6126625979784875)
  9155. Retracting rl*prefer*rvt*predict-yes*H0*1
  9156. -->
  9157. (S1 ^operator O1929 = 0.3895397539597428)
  9158. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  9159. -->
  9160. (S1 ^operator O1929 = -0.02274740735326741)
  9161. --- END Proposal Phase ---
  9162. --- Decision Phase ---
  9163. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.888199,0.0999224)
  9164. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610462 -> 0.288049 0.322413 0.610462(R,m,v=1,1,0)
  9165. =>WM: (13536: S1 ^operator O1932)
  9166. 966: O: O1932 (predict-no)
  9167. --- END Decision Phase ---
  9168. --- Application Phase ---
  9169. --- Firing Productions (PE) For State At Depth 1 ---
  9170. --- Inner Elaboration Phase, active level 1 (S1) ---
  9171. Firing apply*operator
  9172. -->
  9173. (I3 ^predict-no N966 + :O )
  9174. Firing apply*operator*complete
  9175. -->
  9176. (I3 ^predict-yes N965 - :O )
  9177. inner elaboration loop at bottom goal.
  9178. --- Change Working Memory (PE) ---
  9179. =>WM: (13537: I3 ^predict-no N966)
  9180. <=WM: (13524: N965 ^status complete)
  9181. <=WM: (13523: I3 ^predict-yes N965)
  9182. --- Firing Productions (IE) For State At Depth 1 ---
  9183. --- Inner Elaboration Phase, active level 1 (S1) ---
  9184. Firing monitor*world
  9185. -->
  9186. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9187. --- Change Working Memory (IE) ---
  9188. --- END Application Phase ---
  9189. --- Output Phase ---
  9190. ENV: Agent did: predict-no for direction L in state State-A
  9191. In State-A moving L
  9192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9193. predict error 0
  9194. dir: dir isR
  9195. --- END Output Phase ---
  9196. /|\--- Input Phase ---
  9197. =>WM: (13541: I2 ^dir R)
  9198. =>WM: (13540: I2 ^reward 1)
  9199. =>WM: (13539: I2 ^see 0)
  9200. =>WM: (13538: N966 ^status complete)
  9201. <=WM: (13527: I2 ^dir L)
  9202. <=WM: (13526: I2 ^reward 1)
  9203. <=WM: (13525: I2 ^see 1)
  9204. =>WM: (13542: I2 ^level-1 L0-root)
  9205. <=WM: (13528: I2 ^level-1 L1-root)
  9206. --- END Input Phase ---
  9207. --- Proposal Phase ---
  9208. --- Inner Elaboration Phase, active level 1 (S1) ---
  9209. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9210. -->
  9211. (S1 ^operator O1931 = 0.8155962367832892)
  9212. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9213. -->
  9214. (S1 ^operator O1932 = -0.00558448899823713)
  9215. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9216. -->
  9217. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9218. -->
  9219. Firing elaborate*copy-see-to-output-link
  9220. -->
  9221. (I3 ^see 0 +)
  9222. Firing elaborate*reward*based*on*reward
  9223. -->
  9224. (R970 ^value 1 +)
  9225. (R1 ^reward R970 +)
  9226. Firing propose*predict-yes
  9227. -->
  9228. (O1933 ^name predict-yes +)
  9229. (S1 ^operator O1933 +)
  9230. Firing propose*predict-no
  9231. -->
  9232. (O1934 ^name predict-no +)
  9233. (S1 ^operator O1934 +)
  9234. Firing rl*prefer*rvt*predict-no*H0*4
  9235. -->
  9236. (S1 ^operator O1932 = 0.4476185940576797)
  9237. Firing rl*prefer*rvt*predict-yes*H0*3
  9238. -->
  9239. (S1 ^operator O1931 = 0.1844081745669553)
  9240. Firing prefer*rvt*predict-yes*H0
  9241. -->
  9242. Firing prefer*rvt*predict-no*H0
  9243. -->
  9244. Firing elaborate*copy-dir-to-output-link
  9245. -->
  9246. (I3 ^dir R +)
  9247. inner elaboration loop at bottom goal.
  9248. Retracting elaborate*copy-see-to-output-link
  9249. -->
  9250. (I3 ^see 1 +)
  9251. Retracting propose*predict-no
  9252. -->
  9253. (O1932 ^name predict-no +)
  9254. (S1 ^operator O1932 +)
  9255. Retracting propose*predict-yes
  9256. -->
  9257. (O1931 ^name predict-yes +)
  9258. (S1 ^operator O1931 +)
  9259. Retracting elaborate*reward*based*on*reward
  9260. -->
  9261. (R969 ^value 1 +)
  9262. (R1 ^reward R969 +)
  9263. Retracting elaborate*copy-dir-to-output-link
  9264. -->
  9265. (I3 ^dir L +)
  9266. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  9267. -->
  9268. (S1 ^operator O1932 = 0.6126625979784875)
  9269. Retracting rl*prefer*rvt*predict-no*H0*2
  9270. -->
  9271. (S1 ^operator O1932 = 0.3873368130731955)
  9272. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  9273. -->
  9274. (S1 ^operator O1931 = -0.02274740735326741)
  9275. Retracting rl*prefer*rvt*predict-yes*H0*1
  9276. -->
  9277. (S1 ^operator O1931 = 0.3895395093503376)
  9278. =>WM: (13550: S1 ^operator O1934 +)
  9279. =>WM: (13549: S1 ^operator O1933 +)
  9280. =>WM: (13548: I3 ^dir R)
  9281. =>WM: (13547: O1934 ^name predict-no)
  9282. =>WM: (13546: O1933 ^name predict-yes)
  9283. =>WM: (13545: R970 ^value 1)
  9284. =>WM: (13544: R1 ^reward R970)
  9285. =>WM: (13543: I3 ^see 0)
  9286. <=WM: (13534: S1 ^operator O1931 +)
  9287. <=WM: (13535: S1 ^operator O1932 +)
  9288. <=WM: (13536: S1 ^operator O1932)
  9289. <=WM: (13519: I3 ^dir L)
  9290. <=WM: (13530: R1 ^reward R969)
  9291. <=WM: (13529: I3 ^see 1)
  9292. <=WM: (13533: O1932 ^name predict-no)
  9293. <=WM: (13532: O1931 ^name predict-yes)
  9294. <=WM: (13531: R969 ^value 1)
  9295. --- Inner Elaboration Phase, active level 1 (S1) ---
  9296. Firing prefer*rvt*predict-yes*H0
  9297. -->
  9298. Firing rl*prefer*rvt*predict-yes*H0*3
  9299. -->
  9300. (S1 ^operator O1933 = 0.1844081745669553)
  9301. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9302. -->
  9303. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9304. -->
  9305. (S1 ^operator O1933 = 0.8155962367832892)
  9306. Firing prefer*rvt*predict-no*H0
  9307. -->
  9308. Firing rl*prefer*rvt*predict-no*H0*4
  9309. -->
  9310. (S1 ^operator O1934 = 0.4476185940576797)
  9311. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9312. -->
  9313. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9314. -->
  9315. (S1 ^operator O1934 = -0.00558448899823713)
  9316. inner elaboration loop at bottom goal.
  9317. Retracting rl*prefer*rvt*predict-no*H0*4
  9318. -->
  9319. (S1 ^operator O1932 = 0.4476185940576797)
  9320. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9321. -->
  9322. (S1 ^operator O1932 = -0.00558448899823713)
  9323. Retracting rl*prefer*rvt*predict-yes*H0*3
  9324. -->
  9325. (S1 ^operator O1931 = 0.1844081745669553)
  9326. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9327. -->
  9328. (S1 ^operator O1931 = 0.8155962367832892)
  9329. --- END Proposal Phase ---
  9330. --- Decision Phase ---
  9331. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.931034,0.0645804)
  9332. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280918 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  9333. =>WM: (13551: S1 ^operator O1933)
  9334. 967: O: O1933 (predict-yes)
  9335. --- END Decision Phase ---
  9336. --- Application Phase ---
  9337. --- Firing Productions (PE) For State At Depth 1 ---
  9338. --- Inner Elaboration Phase, active level 1 (S1) ---
  9339. Firing apply*operator
  9340. -->
  9341. (I3 ^predict-yes N967 + :O )
  9342. Firing apply*operator*complete
  9343. -->
  9344. (I3 ^predict-no N966 - :O )
  9345. inner elaboration loop at bottom goal.
  9346. --- Change Working Memory (PE) ---
  9347. =>WM: (13552: I3 ^predict-yes N967)
  9348. <=WM: (13538: N966 ^status complete)
  9349. <=WM: (13537: I3 ^predict-no N966)
  9350. --- Firing Productions (IE) For State At Depth 1 ---
  9351. --- Inner Elaboration Phase, active level 1 (S1) ---
  9352. Firing monitor*world
  9353. -->
  9354. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9355. --- Change Working Memory (IE) ---
  9356. --- END Application Phase ---
  9357. --- Output Phase ---
  9358. ENV: Agent did: predict-yes for direction R in state State-A
  9359. In State-A moving R
  9360. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9361. predict error 0
  9362. dir: dir isR
  9363. --- END Output Phase ---
  9364. -/|--- Input Phase ---
  9365. =>WM: (13556: I2 ^dir R)
  9366. =>WM: (13555: I2 ^reward 1)
  9367. =>WM: (13554: I2 ^see 1)
  9368. =>WM: (13553: N967 ^status complete)
  9369. <=WM: (13541: I2 ^dir R)
  9370. <=WM: (13540: I2 ^reward 1)
  9371. <=WM: (13539: I2 ^see 0)
  9372. =>WM: (13557: I2 ^level-1 R1-root)
  9373. <=WM: (13542: I2 ^level-1 L0-root)
  9374. --- END Input Phase ---
  9375. --- Proposal Phase ---
  9376. --- Inner Elaboration Phase, active level 1 (S1) ---
  9377. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  9378. -->
  9379. (S1 ^operator O1933 = 0.1398795999120246)
  9380. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  9381. -->
  9382. (S1 ^operator O1934 = 0.5523827002353495)
  9383. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9384. -->
  9385. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9386. -->
  9387. Firing elaborate*copy-see-to-output-link
  9388. -->
  9389. (I3 ^see 1 +)
  9390. Firing elaborate*reward*based*on*reward
  9391. -->
  9392. (R971 ^value 1 +)
  9393. (R1 ^reward R971 +)
  9394. Firing propose*predict-yes
  9395. -->
  9396. (O1935 ^name predict-yes +)
  9397. (S1 ^operator O1935 +)
  9398. Firing propose*predict-no
  9399. -->
  9400. (O1936 ^name predict-no +)
  9401. (S1 ^operator O1936 +)
  9402. Firing rl*prefer*rvt*predict-no*H0*4
  9403. -->
  9404. (S1 ^operator O1934 = 0.4476185940576797)
  9405. Firing rl*prefer*rvt*predict-yes*H0*3
  9406. -->
  9407. (S1 ^operator O1933 = 0.1844081745669553)
  9408. Firing prefer*rvt*predict-yes*H0
  9409. -->
  9410. Firing prefer*rvt*predict-no*H0
  9411. -->
  9412. Firing elaborate*copy-dir-to-output-link
  9413. -->
  9414. (I3 ^dir R +)
  9415. inner elaboration loop at bottom goal.
  9416. Retracting elaborate*copy-see-to-output-link
  9417. -->
  9418. (I3 ^see 0 +)
  9419. Retracting propose*predict-no
  9420. -->
  9421. (O1934 ^name predict-no +)
  9422. (S1 ^operator O1934 +)
  9423. Retracting propose*predict-yes
  9424. -->
  9425. (O1933 ^name predict-yes +)
  9426. (S1 ^operator O1933 +)
  9427. Retracting elaborate*reward*based*on*reward
  9428. -->
  9429. (R970 ^value 1 +)
  9430. (R1 ^reward R970 +)
  9431. Retracting elaborate*copy-dir-to-output-link
  9432. -->
  9433. (I3 ^dir R +)
  9434. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9435. -->
  9436. (S1 ^operator O1934 = -0.00558448899823713)
  9437. Retracting rl*prefer*rvt*predict-no*H0*4
  9438. -->
  9439. (S1 ^operator O1934 = 0.4476185940576797)
  9440. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9441. -->
  9442. (S1 ^operator O1933 = 0.8155962367832892)
  9443. Retracting rl*prefer*rvt*predict-yes*H0*3
  9444. -->
  9445. (S1 ^operator O1933 = 0.1844081745669553)
  9446. =>WM: (13564: S1 ^operator O1936 +)
  9447. =>WM: (13563: S1 ^operator O1935 +)
  9448. =>WM: (13562: O1936 ^name predict-no)
  9449. =>WM: (13561: O1935 ^name predict-yes)
  9450. =>WM: (13560: R971 ^value 1)
  9451. =>WM: (13559: R1 ^reward R971)
  9452. =>WM: (13558: I3 ^see 1)
  9453. <=WM: (13549: S1 ^operator O1933 +)
  9454. <=WM: (13551: S1 ^operator O1933)
  9455. <=WM: (13550: S1 ^operator O1934 +)
  9456. <=WM: (13544: R1 ^reward R970)
  9457. <=WM: (13543: I3 ^see 0)
  9458. <=WM: (13547: O1934 ^name predict-no)
  9459. <=WM: (13546: O1933 ^name predict-yes)
  9460. <=WM: (13545: R970 ^value 1)
  9461. --- Inner Elaboration Phase, active level 1 (S1) ---
  9462. Firing prefer*rvt*predict-yes*H0
  9463. -->
  9464. Firing rl*prefer*rvt*predict-yes*H0*3
  9465. -->
  9466. (S1 ^operator O1935 = 0.1844081745669553)
  9467. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9468. -->
  9469. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  9470. -->
  9471. (S1 ^operator O1935 = 0.1398795999120246)
  9472. Firing prefer*rvt*predict-no*H0
  9473. -->
  9474. Firing rl*prefer*rvt*predict-no*H0*4
  9475. -->
  9476. (S1 ^operator O1936 = 0.4476185940576797)
  9477. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9478. -->
  9479. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  9480. -->
  9481. (S1 ^operator O1936 = 0.5523827002353495)
  9482. inner elaboration loop at bottom goal.
  9483. Retracting rl*prefer*rvt*predict-no*H0*4
  9484. -->
  9485. (S1 ^operator O1934 = 0.4476185940576797)
  9486. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  9487. -->
  9488. (S1 ^operator O1934 = 0.5523827002353495)
  9489. Retracting rl*prefer*rvt*predict-yes*H0*3
  9490. -->
  9491. (S1 ^operator O1933 = 0.1844081745669553)
  9492. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  9493. -->
  9494. (S1 ^operator O1933 = 0.1398795999120246)
  9495. --- END Proposal Phase ---
  9496. --- Decision Phase ---
  9497. RL update rl*prefer*rvt*predict-yes*H0*3 0.67541 -0.491002 0.184408 -> 0.675409 -0.491002 0.184408(R,m,v=1,0.896341,0.0934835)
  9498. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324596 0.491 0.815596 -> 0.324595 0.491001 0.815596(R,m,v=1,1,0)
  9499. =>WM: (13565: S1 ^operator O1936)
  9500. 968: O: O1936 (predict-no)
  9501. --- END Decision Phase ---
  9502. --- Application Phase ---
  9503. --- Firing Productions (PE) For State At Depth 1 ---
  9504. --- Inner Elaboration Phase, active level 1 (S1) ---
  9505. Firing apply*operator
  9506. -->
  9507. (I3 ^predict-no N968 + :O )
  9508. Firing apply*operator*complete
  9509. -->
  9510. (I3 ^predict-yes N967 - :O )
  9511. inner elaboration loop at bottom goal.
  9512. --- Change Working Memory (PE) ---
  9513. =>WM: (13566: I3 ^predict-no N968)
  9514. <=WM: (13553: N967 ^status complete)
  9515. <=WM: (13552: I3 ^predict-yes N967)
  9516. --- Firing Productions (IE) For State At Depth 1 ---
  9517. --- Inner Elaboration Phase, active level 1 (S1) ---
  9518. Firing monitor*world
  9519. -->
  9520. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9521. --- Change Working Memory (IE) ---
  9522. --- END Application Phase ---
  9523. --- Output Phase ---
  9524. ENV: Agent did: predict-no for direction R in state State-B
  9525. In State-B moving R
  9526. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9527. predict error 0
  9528. dir: dir isU
  9529. --- END Output Phase ---
  9530. \-/--- Input Phase ---
  9531. =>WM: (13570: I2 ^dir U)
  9532. =>WM: (13569: I2 ^reward 1)
  9533. =>WM: (13568: I2 ^see 0)
  9534. =>WM: (13567: N968 ^status complete)
  9535. <=WM: (13556: I2 ^dir R)
  9536. <=WM: (13555: I2 ^reward 1)
  9537. <=WM: (13554: I2 ^see 1)
  9538. =>WM: (13571: I2 ^level-1 R0-root)
  9539. <=WM: (13557: I2 ^level-1 R1-root)
  9540. --- END Input Phase ---
  9541. --- Proposal Phase ---
  9542. --- Inner Elaboration Phase, active level 1 (S1) ---
  9543. Firing elaborate*copy-see-to-output-link
  9544. -->
  9545. (I3 ^see 0 +)
  9546. Firing elaborate*reward*based*on*reward
  9547. -->
  9548. (R972 ^value 1 +)
  9549. (R1 ^reward R972 +)
  9550. Firing propose*predict-yes
  9551. -->
  9552. (O1937 ^name predict-yes +)
  9553. (S1 ^operator O1937 +)
  9554. Firing propose*predict-no
  9555. -->
  9556. (O1938 ^name predict-no +)
  9557. (S1 ^operator O1938 +)
  9558. Firing rl*prefer*rvt*predict-no*H0*6
  9559. -->
  9560. (S1 ^operator O1936 = 0.9999999999999999)
  9561. Firing rl*prefer*rvt*predict-yes*H0*5
  9562. -->
  9563. (S1 ^operator O1935 = 0.)
  9564. Firing prefer*rvt*predict-yes*H0
  9565. -->
  9566. Firing prefer*rvt*predict-no*H0
  9567. -->
  9568. Firing elaborate*copy-dir-to-output-link
  9569. -->
  9570. (I3 ^dir U +)
  9571. inner elaboration loop at bottom goal.
  9572. Retracting elaborate*copy-see-to-output-link
  9573. -->
  9574. (I3 ^see 1 +)
  9575. Retracting propose*predict-no
  9576. -->
  9577. (O1936 ^name predict-no +)
  9578. (S1 ^operator O1936 +)
  9579. Retracting propose*predict-yes
  9580. -->
  9581. (O1935 ^name predict-yes +)
  9582. (S1 ^operator O1935 +)
  9583. Retracting elaborate*reward*based*on*reward
  9584. -->
  9585. (R971 ^value 1 +)
  9586. (R1 ^reward R971 +)
  9587. Retracting elaborate*copy-dir-to-output-link
  9588. -->
  9589. (I3 ^dir R +)
  9590. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  9591. -->
  9592. (S1 ^operator O1936 = 0.5523827002353495)
  9593. Retracting rl*prefer*rvt*predict-no*H0*4
  9594. -->
  9595. (S1 ^operator O1936 = 0.4476185940576797)
  9596. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  9597. -->
  9598. (S1 ^operator O1935 = 0.1398795999120246)
  9599. Retracting rl*prefer*rvt*predict-yes*H0*3
  9600. -->
  9601. (S1 ^operator O1935 = 0.1844075128644186)
  9602. =>WM: (13579: S1 ^operator O1938 +)
  9603. =>WM: (13578: S1 ^operator O1937 +)
  9604. =>WM: (13577: I3 ^dir U)
  9605. =>WM: (13576: O1938 ^name predict-no)
  9606. =>WM: (13575: O1937 ^name predict-yes)
  9607. =>WM: (13574: R972 ^value 1)
  9608. =>WM: (13573: R1 ^reward R972)
  9609. =>WM: (13572: I3 ^see 0)
  9610. <=WM: (13563: S1 ^operator O1935 +)
  9611. <=WM: (13564: S1 ^operator O1936 +)
  9612. <=WM: (13565: S1 ^operator O1936)
  9613. <=WM: (13548: I3 ^dir R)
  9614. <=WM: (13559: R1 ^reward R971)
  9615. <=WM: (13558: I3 ^see 1)
  9616. <=WM: (13562: O1936 ^name predict-no)
  9617. <=WM: (13561: O1935 ^name predict-yes)
  9618. <=WM: (13560: R971 ^value 1)
  9619. --- Inner Elaboration Phase, active level 1 (S1) ---
  9620. Firing prefer*rvt*predict-yes*H0
  9621. -->
  9622. Firing rl*prefer*rvt*predict-yes*H0*5
  9623. -->
  9624. (S1 ^operator O1937 = 0.)
  9625. Firing prefer*rvt*predict-no*H0
  9626. -->
  9627. Firing rl*prefer*rvt*predict-no*H0*6
  9628. -->
  9629. (S1 ^operator O1938 = 0.9999999999999999)
  9630. inner elaboration loop at bottom goal.
  9631. Retracting rl*prefer*rvt*predict-no*H0*6
  9632. -->
  9633. (S1 ^operator O1936 = 0.9999999999999999)
  9634. Retracting rl*prefer*rvt*predict-yes*H0*5
  9635. -->
  9636. (S1 ^operator O1935 = 0.)
  9637. --- END Proposal Phase ---
  9638. --- Decision Phase ---
  9639. RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447619 -> 0.622532 -0.174914 0.447618(R,m,v=1,0.92623,0.0688931)
  9640. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377469 0.174914 0.552383(R,m,v=1,1,0)
  9641. =>WM: (13580: S1 ^operator O1938)
  9642. 969: O: O1938 (predict-no)
  9643. --- END Decision Phase ---
  9644. --- Application Phase ---
  9645. --- Firing Productions (PE) For State At Depth 1 ---
  9646. --- Inner Elaboration Phase, active level 1 (S1) ---
  9647. Firing apply*operator
  9648. -->
  9649. (I3 ^predict-no N969 + :O )
  9650. Firing apply*operator*complete
  9651. -->
  9652. (I3 ^predict-no N968 - :O )
  9653. inner elaboration loop at bottom goal.
  9654. --- Change Working Memory (PE) ---
  9655. =>WM: (13581: I3 ^predict-no N969)
  9656. <=WM: (13567: N968 ^status complete)
  9657. <=WM: (13566: I3 ^predict-no N968)
  9658. --- Firing Productions (IE) For State At Depth 1 ---
  9659. --- Inner Elaboration Phase, active level 1 (S1) ---
  9660. Firing monitor*world
  9661. -->
  9662. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9663. --- Change Working Memory (IE) ---
  9664. --- END Application Phase ---
  9665. --- Output Phase ---
  9666. ENV: Agent did: predict-no for direction U in state State-B
  9667. In State-B moving U
  9668. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9669. predict error 0
  9670. dir: dir isR
  9671. --- END Output Phase ---
  9672. |\---- Input Phase ---
  9673. =>WM: (13585: I2 ^dir R)
  9674. =>WM: (13584: I2 ^reward 1)
  9675. =>WM: (13583: I2 ^see 0)
  9676. =>WM: (13582: N969 ^status complete)
  9677. <=WM: (13570: I2 ^dir U)
  9678. <=WM: (13569: I2 ^reward 1)
  9679. <=WM: (13568: I2 ^see 0)
  9680. =>WM: (13586: I2 ^level-1 R0-root)
  9681. <=WM: (13571: I2 ^level-1 R0-root)
  9682. --- END Input Phase ---
  9683. --- Proposal Phase ---
  9684. --- Inner Elaboration Phase, active level 1 (S1) ---
  9685. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  9686. -->
  9687. (S1 ^operator O1937 = 0.1664311307472832)
  9688. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  9689. -->
  9690. (S1 ^operator O1938 = 0.5523777234651187)
  9691. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9692. -->
  9693. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9694. -->
  9695. Firing elaborate*copy-see-to-output-link
  9696. -->
  9697. (I3 ^see 0 +)
  9698. Firing elaborate*reward*based*on*reward
  9699. -->
  9700. (R973 ^value 1 +)
  9701. (R1 ^reward R973 +)
  9702. Firing propose*predict-yes
  9703. -->
  9704. (O1939 ^name predict-yes +)
  9705. (S1 ^operator O1939 +)
  9706. Firing propose*predict-no
  9707. -->
  9708. (O1940 ^name predict-no +)
  9709. (S1 ^operator O1940 +)
  9710. Firing rl*prefer*rvt*predict-no*H0*4
  9711. -->
  9712. (S1 ^operator O1938 = 0.4476183999137253)
  9713. Firing rl*prefer*rvt*predict-yes*H0*3
  9714. -->
  9715. (S1 ^operator O1937 = 0.1844075128644186)
  9716. Firing prefer*rvt*predict-yes*H0
  9717. -->
  9718. Firing prefer*rvt*predict-no*H0
  9719. -->
  9720. Firing elaborate*copy-dir-to-output-link
  9721. -->
  9722. (I3 ^dir R +)
  9723. inner elaboration loop at bottom goal.
  9724. Retracting elaborate*copy-see-to-output-link
  9725. -->
  9726. (I3 ^see 0 +)
  9727. Retracting propose*predict-no
  9728. -->
  9729. (O1938 ^name predict-no +)
  9730. (S1 ^operator O1938 +)
  9731. Retracting propose*predict-yes
  9732. -->
  9733. (O1937 ^name predict-yes +)
  9734. (S1 ^operator O1937 +)
  9735. Retracting elaborate*reward*based*on*reward
  9736. -->
  9737. (R972 ^value 1 +)
  9738. (R1 ^reward R972 +)
  9739. Retracting elaborate*copy-dir-to-output-link
  9740. -->
  9741. (I3 ^dir U +)
  9742. Retracting rl*prefer*rvt*predict-no*H0*6
  9743. -->
  9744. (S1 ^operator O1938 = 0.9999999999999999)
  9745. Retracting rl*prefer*rvt*predict-yes*H0*5
  9746. -->
  9747. (S1 ^operator O1937 = 0.)
  9748. =>WM: (13593: S1 ^operator O1940 +)
  9749. =>WM: (13592: S1 ^operator O1939 +)
  9750. =>WM: (13591: I3 ^dir R)
  9751. =>WM: (13590: O1940 ^name predict-no)
  9752. =>WM: (13589: O1939 ^name predict-yes)
  9753. =>WM: (13588: R973 ^value 1)
  9754. =>WM: (13587: R1 ^reward R973)
  9755. <=WM: (13578: S1 ^operator O1937 +)
  9756. <=WM: (13579: S1 ^operator O1938 +)
  9757. <=WM: (13580: S1 ^operator O1938)
  9758. <=WM: (13577: I3 ^dir U)
  9759. <=WM: (13573: R1 ^reward R972)
  9760. <=WM: (13576: O1938 ^name predict-no)
  9761. <=WM: (13575: O1937 ^name predict-yes)
  9762. <=WM: (13574: R972 ^value 1)
  9763. --- Inner Elaboration Phase, active level 1 (S1) ---
  9764. Firing prefer*rvt*predict-yes*H0
  9765. -->
  9766. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  9767. -->
  9768. (S1 ^operator O1939 = 0.1664311307472832)
  9769. Firing rl*prefer*rvt*predict-yes*H0*3
  9770. -->
  9771. (S1 ^operator O1939 = 0.1844075128644186)
  9772. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9773. -->
  9774. Firing prefer*rvt*predict-no*H0
  9775. -->
  9776. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  9777. -->
  9778. (S1 ^operator O1940 = 0.5523777234651187)
  9779. Firing rl*prefer*rvt*predict-no*H0*4
  9780. -->
  9781. (S1 ^operator O1940 = 0.4476183999137253)
  9782. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9783. -->
  9784. inner elaboration loop at bottom goal.
  9785. Retracting rl*prefer*rvt*predict-no*H0*4
  9786. -->
  9787. (S1 ^operator O1938 = 0.4476183999137253)
  9788. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  9789. -->
  9790. (S1 ^operator O1938 = 0.5523777234651187)
  9791. Retracting rl*prefer*rvt*predict-yes*H0*3
  9792. -->
  9793. (S1 ^operator O1937 = 0.1844075128644186)
  9794. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  9795. -->
  9796. (S1 ^operator O1937 = 0.1664311307472832)
  9797. --- END Proposal Phase ---
  9798. --- Decision Phase ---
  9799. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9800. =>WM: (13594: S1 ^operator O1940)
  9801. 970: O: O1940 (predict-no)
  9802. --- END Decision Phase ---
  9803. --- Application Phase ---
  9804. --- Firing Productions (PE) For State At Depth 1 ---
  9805. --- Inner Elaboration Phase, active level 1 (S1) ---
  9806. Firing apply*operator
  9807. -->
  9808. (I3 ^predict-no N970 + :O )
  9809. Firing apply*operator*complete
  9810. -->
  9811. (I3 ^predict-no N969 - :O )
  9812. inner elaboration loop at bottom goal.
  9813. --- Change Working Memory (PE) ---
  9814. =>WM: (13595: I3 ^predict-no N970)
  9815. <=WM: (13582: N969 ^status complete)
  9816. <=WM: (13581: I3 ^predict-no N969)
  9817. --- Firing Productions (IE) For State At Depth 1 ---
  9818. --- Inner Elaboration Phase, active level 1 (S1) ---
  9819. Firing monitor*world
  9820. -->
  9821. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9822. --- Change Working Memory (IE) ---
  9823. --- END Application Phase ---
  9824. --- Output Phase ---
  9825. ENV: Agent did: predict-no for direction R in state State-B
  9826. In State-B moving R
  9827. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9828. predict error 0
  9829. dir: dir isL
  9830. --- END Output Phase ---
  9831. /|\--- Input Phase ---
  9832. =>WM: (13599: I2 ^dir L)
  9833. =>WM: (13598: I2 ^reward 1)
  9834. =>WM: (13597: I2 ^see 0)
  9835. =>WM: (13596: N970 ^status complete)
  9836. <=WM: (13585: I2 ^dir R)
  9837. <=WM: (13584: I2 ^reward 1)
  9838. <=WM: (13583: I2 ^see 0)
  9839. =>WM: (13600: I2 ^level-1 R0-root)
  9840. <=WM: (13586: I2 ^level-1 R0-root)
  9841. --- END Input Phase ---
  9842. --- Proposal Phase ---
  9843. --- Inner Elaboration Phase, active level 1 (S1) ---
  9844. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  9845. -->
  9846. (S1 ^operator O1939 = 0.61046163216022)
  9847. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  9848. -->
  9849. (S1 ^operator O1940 = 0.1063475139796038)
  9850. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9851. -->
  9852. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9853. -->
  9854. Firing elaborate*copy-see-to-output-link
  9855. -->
  9856. (I3 ^see 0 +)
  9857. Firing elaborate*reward*based*on*reward
  9858. -->
  9859. (R974 ^value 1 +)
  9860. (R1 ^reward R974 +)
  9861. Firing propose*predict-yes
  9862. -->
  9863. (O1941 ^name predict-yes +)
  9864. (S1 ^operator O1941 +)
  9865. Firing propose*predict-no
  9866. -->
  9867. (O1942 ^name predict-no +)
  9868. (S1 ^operator O1942 +)
  9869. Firing rl*prefer*rvt*predict-no*H0*2
  9870. -->
  9871. (S1 ^operator O1940 = 0.387336901415443)
  9872. Firing rl*prefer*rvt*predict-yes*H0*1
  9873. -->
  9874. (S1 ^operator O1939 = 0.3895395093503376)
  9875. Firing prefer*rvt*predict-yes*H0
  9876. -->
  9877. Firing prefer*rvt*predict-no*H0
  9878. -->
  9879. Firing elaborate*copy-dir-to-output-link
  9880. -->
  9881. (I3 ^dir L +)
  9882. inner elaboration loop at bottom goal.
  9883. Retracting elaborate*copy-see-to-output-link
  9884. -->
  9885. (I3 ^see 0 +)
  9886. Retracting propose*predict-no
  9887. -->
  9888. (O1940 ^name predict-no +)
  9889. (S1 ^operator O1940 +)
  9890. Retracting propose*predict-yes
  9891. -->
  9892. (O1939 ^name predict-yes +)
  9893. (S1 ^operator O1939 +)
  9894. Retracting elaborate*reward*based*on*reward
  9895. -->
  9896. (R973 ^value 1 +)
  9897. (R1 ^reward R973 +)
  9898. Retracting elaborate*copy-dir-to-output-link
  9899. -->
  9900. (I3 ^dir R +)
  9901. Retracting rl*prefer*rvt*predict-no*H0*4
  9902. -->
  9903. (S1 ^operator O1940 = 0.4476183999137253)
  9904. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  9905. -->
  9906. (S1 ^operator O1940 = 0.5523777234651187)
  9907. Retracting rl*prefer*rvt*predict-yes*H0*3
  9908. -->
  9909. (S1 ^operator O1939 = 0.1844075128644186)
  9910. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  9911. -->
  9912. (S1 ^operator O1939 = 0.1664311307472832)
  9913. =>WM: (13607: S1 ^operator O1942 +)
  9914. =>WM: (13606: S1 ^operator O1941 +)
  9915. =>WM: (13605: I3 ^dir L)
  9916. =>WM: (13604: O1942 ^name predict-no)
  9917. =>WM: (13603: O1941 ^name predict-yes)
  9918. =>WM: (13602: R974 ^value 1)
  9919. =>WM: (13601: R1 ^reward R974)
  9920. <=WM: (13592: S1 ^operator O1939 +)
  9921. <=WM: (13593: S1 ^operator O1940 +)
  9922. <=WM: (13594: S1 ^operator O1940)
  9923. <=WM: (13591: I3 ^dir R)
  9924. <=WM: (13587: R1 ^reward R973)
  9925. <=WM: (13590: O1940 ^name predict-no)
  9926. <=WM: (13589: O1939 ^name predict-yes)
  9927. <=WM: (13588: R973 ^value 1)
  9928. --- Inner Elaboration Phase, active level 1 (S1) ---
  9929. Firing prefer*rvt*predict-yes*H0
  9930. -->
  9931. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  9932. -->
  9933. (S1 ^operator O1941 = 0.61046163216022)
  9934. Firing rl*prefer*rvt*predict-yes*H0*1
  9935. -->
  9936. (S1 ^operator O1941 = 0.3895395093503376)
  9937. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  9938. -->
  9939. Firing prefer*rvt*predict-no*H0
  9940. -->
  9941. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  9942. -->
  9943. (S1 ^operator O1942 = 0.1063475139796038)
  9944. Firing rl*prefer*rvt*predict-no*H0*2
  9945. -->
  9946. (S1 ^operator O1942 = 0.387336901415443)
  9947. Firing prefer*rvt*predict-no*H0*2*v1*H1
  9948. -->
  9949. inner elaboration loop at bottom goal.
  9950. Retracting rl*prefer*rvt*predict-no*H0*2
  9951. -->
  9952. (S1 ^operator O1940 = 0.387336901415443)
  9953. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  9954. -->
  9955. (S1 ^operator O1940 = 0.1063475139796038)
  9956. Retracting rl*prefer*rvt*predict-yes*H0*1
  9957. -->
  9958. (S1 ^operator O1939 = 0.3895395093503376)
  9959. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  9960. -->
  9961. (S1 ^operator O1939 = 0.61046163216022)
  9962. --- END Proposal Phase ---
  9963. --- Decision Phase ---
  9964. RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447618 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.926829,0.0683727)
  9965. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377465 0.174913 0.552378 -> 0.377465 0.174913 0.552378(R,m,v=1,1,0)
  9966. =>WM: (13608: S1 ^operator O1941)
  9967. 971: O: O1941 (predict-yes)
  9968. --- END Decision Phase ---
  9969. --- Application Phase ---
  9970. --- Firing Productions (PE) For State At Depth 1 ---
  9971. --- Inner Elaboration Phase, active level 1 (S1) ---
  9972. Firing apply*operator
  9973. -->
  9974. (I3 ^predict-yes N971 + :O )
  9975. Firing apply*operator*complete
  9976. -->
  9977. (I3 ^predict-no N970 - :O )
  9978. inner elaboration loop at bottom goal.
  9979. --- Change Working Memory (PE) ---
  9980. =>WM: (13609: I3 ^predict-yes N971)
  9981. <=WM: (13596: N970 ^status complete)
  9982. <=WM: (13595: I3 ^predict-no N970)
  9983. --- Firing Productions (IE) For State At Depth 1 ---
  9984. --- Inner Elaboration Phase, active level 1 (S1) ---
  9985. Firing monitor*world
  9986. -->
  9987. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9988. --- Change Working Memory (IE) ---
  9989. --- END Application Phase ---
  9990. --- Output Phase ---
  9991. ENV: Agent did: predict-yes for direction L in state State-B
  9992. In State-B moving L
  9993. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9994. predict error 0
  9995. dir: dir isU
  9996. --- END Output Phase ---
  9997. ---- Input Phase ---
  9998. =>WM: (13613: I2 ^dir U)
  9999. =>WM: (13612: I2 ^reward 1)
  10000. =>WM: (13611: I2 ^see 1)
  10001. =>WM: (13610: N971 ^status complete)
  10002. <=WM: (13599: I2 ^dir L)
  10003. <=WM: (13598: I2 ^reward 1)
  10004. <=WM: (13597: I2 ^see 0)
  10005. =>WM: (13614: I2 ^level-1 L1-root)
  10006. <=WM: (13600: I2 ^level-1 R0-root)
  10007. --- END Input Phase ---
  10008. --- Proposal Phase ---
  10009. --- Inner Elaboration Phase, active level 1 (S1) ---
  10010. Firing elaborate*copy-see-to-output-link
  10011. -->
  10012. (I3 ^see 1 +)
  10013. Firing elaborate*reward*based*on*reward
  10014. -->
  10015. (R975 ^value 1 +)
  10016. (R1 ^reward R975 +)
  10017. Firing propose*predict-yes
  10018. -->
  10019. (O1943 ^name predict-yes +)
  10020. (S1 ^operator O1943 +)
  10021. Firing propose*predict-no
  10022. -->
  10023. (O1944 ^name predict-no +)
  10024. (S1 ^operator O1944 +)
  10025. Firing rl*prefer*rvt*predict-no*H0*6
  10026. -->
  10027. (S1 ^operator O1942 = 0.9999999999999999)
  10028. Firing rl*prefer*rvt*predict-yes*H0*5
  10029. -->
  10030. (S1 ^operator O1941 = 0.)
  10031. Firing prefer*rvt*predict-yes*H0
  10032. -->
  10033. Firing prefer*rvt*predict-no*H0
  10034. -->
  10035. Firing elaborate*copy-dir-to-output-link
  10036. -->
  10037. (I3 ^dir U +)
  10038. inner elaboration loop at bottom goal.
  10039. Retracting elaborate*copy-see-to-output-link
  10040. -->
  10041. (I3 ^see 0 +)
  10042. Retracting propose*predict-no
  10043. -->
  10044. (O1942 ^name predict-no +)
  10045. (S1 ^operator O1942 +)
  10046. Retracting propose*predict-yes
  10047. -->
  10048. (O1941 ^name predict-yes +)
  10049. (S1 ^operator O1941 +)
  10050. Retracting elaborate*reward*based*on*reward
  10051. -->
  10052. (R974 ^value 1 +)
  10053. (R1 ^reward R974 +)
  10054. Retracting elaborate*copy-dir-to-output-link
  10055. -->
  10056. (I3 ^dir L +)
  10057. Retracting rl*prefer*rvt*predict-no*H0*2
  10058. -->
  10059. (S1 ^operator O1942 = 0.387336901415443)
  10060. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  10061. -->
  10062. (S1 ^operator O1942 = 0.1063475139796038)
  10063. Retracting rl*prefer*rvt*predict-yes*H0*1
  10064. -->
  10065. (S1 ^operator O1941 = 0.3895395093503376)
  10066. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  10067. -->
  10068. (S1 ^operator O1941 = 0.61046163216022)
  10069. =>WM: (13622: S1 ^operator O1944 +)
  10070. =>WM: (13621: S1 ^operator O1943 +)
  10071. =>WM: (13620: I3 ^dir U)
  10072. =>WM: (13619: O1944 ^name predict-no)
  10073. =>WM: (13618: O1943 ^name predict-yes)
  10074. =>WM: (13617: R975 ^value 1)
  10075. =>WM: (13616: R1 ^reward R975)
  10076. =>WM: (13615: I3 ^see 1)
  10077. <=WM: (13606: S1 ^operator O1941 +)
  10078. <=WM: (13608: S1 ^operator O1941)
  10079. <=WM: (13607: S1 ^operator O1942 +)
  10080. <=WM: (13605: I3 ^dir L)
  10081. <=WM: (13601: R1 ^reward R974)
  10082. <=WM: (13572: I3 ^see 0)
  10083. <=WM: (13604: O1942 ^name predict-no)
  10084. <=WM: (13603: O1941 ^name predict-yes)
  10085. <=WM: (13602: R974 ^value 1)
  10086. --- Inner Elaboration Phase, active level 1 (S1) ---
  10087. Firing prefer*rvt*predict-yes*H0
  10088. -->
  10089. Firing rl*prefer*rvt*predict-yes*H0*5
  10090. -->
  10091. (S1 ^operator O1943 = 0.)
  10092. Firing prefer*rvt*predict-no*H0
  10093. -->
  10094. Firing rl*prefer*rvt*predict-no*H0*6
  10095. -->
  10096. (S1 ^operator O1944 = 0.9999999999999999)
  10097. inner elaboration loop at bottom goal.
  10098. Retracting rl*prefer*rvt*predict-no*H0*6
  10099. -->
  10100. (S1 ^operator O1942 = 0.9999999999999999)
  10101. Retracting rl*prefer*rvt*predict-yes*H0*5
  10102. -->
  10103. (S1 ^operator O1941 = 0.)
  10104. --- END Proposal Phase ---
  10105. --- Decision Phase ---
  10106. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.888889,0.0993789)
  10107. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610462 -> 0.288049 0.322413 0.610461(R,m,v=1,1,0)
  10108. =>WM: (13623: S1 ^operator O1944)
  10109. 972: O: O1944 (predict-no)
  10110. --- END Decision Phase ---
  10111. --- Application Phase ---
  10112. --- Firing Productions (PE) For State At Depth 1 ---
  10113. --- Inner Elaboration Phase, active level 1 (S1) ---
  10114. Firing apply*operator
  10115. -->
  10116. (I3 ^predict-no N972 + :O )
  10117. Firing apply*operator*complete
  10118. -->
  10119. (I3 ^predict-yes N971 - :O )
  10120. inner elaboration loop at bottom goal.
  10121. --- Change Working Memory (PE) ---
  10122. =>WM: (13624: I3 ^predict-no N972)
  10123. <=WM: (13610: N971 ^status complete)
  10124. <=WM: (13609: I3 ^predict-yes N971)
  10125. --- Firing Productions (IE) For State At Depth 1 ---
  10126. --- Inner Elaboration Phase, active level 1 (S1) ---
  10127. Firing monitor*world
  10128. -->
  10129. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10130. --- Change Working Memory (IE) ---
  10131. --- END Application Phase ---
  10132. --- Output Phase ---
  10133. ENV: Agent did: predict-no for direction U in state State-A
  10134. In State-A moving U
  10135. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10136. predict error 0
  10137. dir: dir isU
  10138. --- END Output Phase ---
  10139. /|\--- Input Phase ---
  10140. =>WM: (13628: I2 ^dir U)
  10141. =>WM: (13627: I2 ^reward 1)
  10142. =>WM: (13626: I2 ^see 0)
  10143. =>WM: (13625: N972 ^status complete)
  10144. <=WM: (13613: I2 ^dir U)
  10145. <=WM: (13612: I2 ^reward 1)
  10146. <=WM: (13611: I2 ^see 1)
  10147. =>WM: (13629: I2 ^level-1 L1-root)
  10148. <=WM: (13614: I2 ^level-1 L1-root)
  10149. --- END Input Phase ---
  10150. --- Proposal Phase ---
  10151. --- Inner Elaboration Phase, active level 1 (S1) ---
  10152. Firing elaborate*copy-see-to-output-link
  10153. -->
  10154. (I3 ^see 0 +)
  10155. Firing elaborate*reward*based*on*reward
  10156. -->
  10157. (R976 ^value 1 +)
  10158. (R1 ^reward R976 +)
  10159. Firing propose*predict-yes
  10160. -->
  10161. (O1945 ^name predict-yes +)
  10162. (S1 ^operator O1945 +)
  10163. Firing propose*predict-no
  10164. -->
  10165. (O1946 ^name predict-no +)
  10166. (S1 ^operator O1946 +)
  10167. Firing rl*prefer*rvt*predict-no*H0*6
  10168. -->
  10169. (S1 ^operator O1944 = 0.9999999999999999)
  10170. Firing rl*prefer*rvt*predict-yes*H0*5
  10171. -->
  10172. (S1 ^operator O1943 = 0.)
  10173. Firing prefer*rvt*predict-yes*H0
  10174. -->
  10175. Firing prefer*rvt*predict-no*H0
  10176. -->
  10177. Firing elaborate*copy-dir-to-output-link
  10178. -->
  10179. (I3 ^dir U +)
  10180. inner elaboration loop at bottom goal.
  10181. Retracting elaborate*copy-see-to-output-link
  10182. -->
  10183. (I3 ^see 1 +)
  10184. Retracting propose*predict-no
  10185. -->
  10186. (O1944 ^name predict-no +)
  10187. (S1 ^operator O1944 +)
  10188. Retracting propose*predict-yes
  10189. -->
  10190. (O1943 ^name predict-yes +)
  10191. (S1 ^operator O1943 +)
  10192. Retracting elaborate*reward*based*on*reward
  10193. -->
  10194. (R975 ^value 1 +)
  10195. (R1 ^reward R975 +)
  10196. Retracting elaborate*copy-dir-to-output-link
  10197. -->
  10198. (I3 ^dir U +)
  10199. Retracting rl*prefer*rvt*predict-no*H0*6
  10200. -->
  10201. (S1 ^operator O1944 = 0.9999999999999999)
  10202. Retracting rl*prefer*rvt*predict-yes*H0*5
  10203. -->
  10204. (S1 ^operator O1943 = 0.)
  10205. =>WM: (13636: S1 ^operator O1946 +)
  10206. =>WM: (13635: S1 ^operator O1945 +)
  10207. =>WM: (13634: O1946 ^name predict-no)
  10208. =>WM: (13633: O1945 ^name predict-yes)
  10209. =>WM: (13632: R976 ^value 1)
  10210. =>WM: (13631: R1 ^reward R976)
  10211. =>WM: (13630: I3 ^see 0)
  10212. <=WM: (13621: S1 ^operator O1943 +)
  10213. <=WM: (13622: S1 ^operator O1944 +)
  10214. <=WM: (13623: S1 ^operator O1944)
  10215. <=WM: (13616: R1 ^reward R975)
  10216. <=WM: (13615: I3 ^see 1)
  10217. <=WM: (13619: O1944 ^name predict-no)
  10218. <=WM: (13618: O1943 ^name predict-yes)
  10219. <=WM: (13617: R975 ^value 1)
  10220. --- Inner Elaboration Phase, active level 1 (S1) ---
  10221. Firing prefer*rvt*predict-yes*H0
  10222. -->
  10223. Firing rl*prefer*rvt*predict-yes*H0*5
  10224. -->
  10225. (S1 ^operator O1945 = 0.)
  10226. Firing prefer*rvt*predict-no*H0
  10227. -->
  10228. Firing rl*prefer*rvt*predict-no*H0*6
  10229. -->
  10230. (S1 ^operator O1946 = 0.9999999999999999)
  10231. inner elaboration loop at bottom goal.
  10232. Retracting rl*prefer*rvt*predict-no*H0*6
  10233. -->
  10234. (S1 ^operator O1944 = 0.9999999999999999)
  10235. Retracting rl*prefer*rvt*predict-yes*H0*5
  10236. -->
  10237. (S1 ^operator O1943 = 0.)
  10238. --- END Proposal Phase ---
  10239. --- Decision Phase ---
  10240. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10241. =>WM: (13637: S1 ^operator O1946)
  10242. 973: O: O1946 (predict-no)
  10243. --- END Decision Phase ---
  10244. --- Application Phase ---
  10245. --- Firing Productions (PE) For State At Depth 1 ---
  10246. --- Inner Elaboration Phase, active level 1 (S1) ---
  10247. Firing apply*operator
  10248. -->
  10249. (I3 ^predict-no N973 + :O )
  10250. Firing apply*operator*complete
  10251. -->
  10252. (I3 ^predict-no N972 - :O )
  10253. inner elaboration loop at bottom goal.
  10254. --- Change Working Memory (PE) ---
  10255. =>WM: (13638: I3 ^predict-no N973)
  10256. <=WM: (13625: N972 ^status complete)
  10257. <=WM: (13624: I3 ^predict-no N972)
  10258. --- Firing Productions (IE) For State At Depth 1 ---
  10259. --- Inner Elaboration Phase, active level 1 (S1) ---
  10260. Firing monitor*world
  10261. -->
  10262. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10263. --- Change Working Memory (IE) ---
  10264. --- END Application Phase ---
  10265. --- Output Phase ---
  10266. ENV: Agent did: predict-no for direction U in state State-A
  10267. In State-A moving U
  10268. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10269. predict error 0
  10270. dir: dir isU
  10271. --- END Output Phase ---
  10272. -/--- Input Phase ---
  10273. =>WM: (13642: I2 ^dir U)
  10274. =>WM: (13641: I2 ^reward 1)
  10275. =>WM: (13640: I2 ^see 0)
  10276. =>WM: (13639: N973 ^status complete)
  10277. <=WM: (13628: I2 ^dir U)
  10278. <=WM: (13627: I2 ^reward 1)
  10279. <=WM: (13626: I2 ^see 0)
  10280. =>WM: (13643: I2 ^level-1 L1-root)
  10281. <=WM: (13629: I2 ^level-1 L1-root)
  10282. --- END Input Phase ---
  10283. --- Proposal Phase ---
  10284. --- Inner Elaboration Phase, active level 1 (S1) ---
  10285. Firing elaborate*copy-see-to-output-link
  10286. -->
  10287. (I3 ^see 0 +)
  10288. Firing elaborate*reward*based*on*reward
  10289. -->
  10290. (R977 ^value 1 +)
  10291. (R1 ^reward R977 +)
  10292. Firing propose*predict-yes
  10293. -->
  10294. (O1947 ^name predict-yes +)
  10295. (S1 ^operator O1947 +)
  10296. Firing propose*predict-no
  10297. -->
  10298. (O1948 ^name predict-no +)
  10299. (S1 ^operator O1948 +)
  10300. Firing rl*prefer*rvt*predict-no*H0*6
  10301. -->
  10302. (S1 ^operator O1946 = 0.9999999999999999)
  10303. Firing rl*prefer*rvt*predict-yes*H0*5
  10304. -->
  10305. (S1 ^operator O1945 = 0.)
  10306. Firing prefer*rvt*predict-yes*H0
  10307. -->
  10308. Firing prefer*rvt*predict-no*H0
  10309. -->
  10310. Firing elaborate*copy-dir-to-output-link
  10311. -->
  10312. (I3 ^dir U +)
  10313. inner elaboration loop at bottom goal.
  10314. Retracting elaborate*copy-see-to-output-link
  10315. -->
  10316. (I3 ^see 0 +)
  10317. Retracting propose*predict-no
  10318. -->
  10319. (O1946 ^name predict-no +)
  10320. (S1 ^operator O1946 +)
  10321. Retracting propose*predict-yes
  10322. -->
  10323. (O1945 ^name predict-yes +)
  10324. (S1 ^operator O1945 +)
  10325. Retracting elaborate*reward*based*on*reward
  10326. -->
  10327. (R976 ^value 1 +)
  10328. (R1 ^reward R976 +)
  10329. Retracting elaborate*copy-dir-to-output-link
  10330. -->
  10331. (I3 ^dir U +)
  10332. Retracting rl*prefer*rvt*predict-no*H0*6
  10333. -->
  10334. (S1 ^operator O1946 = 0.9999999999999999)
  10335. Retracting rl*prefer*rvt*predict-yes*H0*5
  10336. -->
  10337. (S1 ^operator O1945 = 0.)
  10338. =>WM: (13649: S1 ^operator O1948 +)
  10339. =>WM: (13648: S1 ^operator O1947 +)
  10340. =>WM: (13647: O1948 ^name predict-no)
  10341. =>WM: (13646: O1947 ^name predict-yes)
  10342. =>WM: (13645: R977 ^value 1)
  10343. =>WM: (13644: R1 ^reward R977)
  10344. <=WM: (13635: S1 ^operator O1945 +)
  10345. <=WM: (13636: S1 ^operator O1946 +)
  10346. <=WM: (13637: S1 ^operator O1946)
  10347. <=WM: (13631: R1 ^reward R976)
  10348. <=WM: (13634: O1946 ^name predict-no)
  10349. <=WM: (13633: O1945 ^name predict-yes)
  10350. <=WM: (13632: R976 ^value 1)
  10351. --- Inner Elaboration Phase, active level 1 (S1) ---
  10352. Firing prefer*rvt*predict-yes*H0
  10353. -->
  10354. Firing rl*prefer*rvt*predict-yes*H0*5
  10355. -->
  10356. (S1 ^operator O1947 = 0.)
  10357. Firing prefer*rvt*predict-no*H0
  10358. -->
  10359. Firing rl*prefer*rvt*predict-no*H0*6
  10360. -->
  10361. (S1 ^operator O1948 = 0.9999999999999999)
  10362. inner elaboration loop at bottom goal.
  10363. Retracting rl*prefer*rvt*predict-no*H0*6
  10364. -->
  10365. (S1 ^operator O1946 = 0.9999999999999999)
  10366. Retracting rl*prefer*rvt*predict-yes*H0*5
  10367. -->
  10368. (S1 ^operator O1945 = 0.)
  10369. --- END Proposal Phase ---
  10370. --- Decision Phase ---
  10371. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10372. =>WM: (13650: S1 ^operator O1948)
  10373. 974: O: O1948 (predict-no)
  10374. --- END Decision Phase ---
  10375. --- Application Phase ---
  10376. --- Firing Productions (PE) For State At Depth 1 ---
  10377. --- Inner Elaboration Phase, active level 1 (S1) ---
  10378. Firing apply*operator
  10379. -->
  10380. (I3 ^predict-no N974 + :O )
  10381. Firing apply*operator*complete
  10382. -->
  10383. (I3 ^predict-no N973 - :O )
  10384. inner elaboration loop at bottom goal.
  10385. --- Change Working Memory (PE) ---
  10386. =>WM: (13651: I3 ^predict-no N974)
  10387. <=WM: (13639: N973 ^status complete)
  10388. <=WM: (13638: I3 ^predict-no N973)
  10389. --- Firing Productions (IE) For State At Depth 1 ---
  10390. --- Inner Elaboration Phase, active level 1 (S1) ---
  10391. Firing monitor*world
  10392. -->
  10393. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10394. --- Change Working Memory (IE) ---
  10395. --- END Application Phase ---
  10396. --- Output Phase ---
  10397. ENV: Agent did: predict-no for direction U in state State-A
  10398. In State-A moving U
  10399. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10400. predict error 0
  10401. dir: dir isU
  10402. --- END Output Phase ---
  10403. |\---- Input Phase ---
  10404. =>WM: (13655: I2 ^dir U)
  10405. =>WM: (13654: I2 ^reward 1)
  10406. =>WM: (13653: I2 ^see 0)
  10407. =>WM: (13652: N974 ^status complete)
  10408. <=WM: (13642: I2 ^dir U)
  10409. <=WM: (13641: I2 ^reward 1)
  10410. <=WM: (13640: I2 ^see 0)
  10411. =>WM: (13656: I2 ^level-1 L1-root)
  10412. <=WM: (13643: I2 ^level-1 L1-root)
  10413. --- END Input Phase ---
  10414. --- Proposal Phase ---
  10415. --- Inner Elaboration Phase, active level 1 (S1) ---
  10416. Firing elaborate*copy-see-to-output-link
  10417. -->
  10418. (I3 ^see 0 +)
  10419. Firing elaborate*reward*based*on*reward
  10420. -->
  10421. (R978 ^value 1 +)
  10422. (R1 ^reward R978 +)
  10423. Firing propose*predict-yes
  10424. -->
  10425. (O1949 ^name predict-yes +)
  10426. (S1 ^operator O1949 +)
  10427. Firing propose*predict-no
  10428. -->
  10429. (O1950 ^name predict-no +)
  10430. (S1 ^operator O1950 +)
  10431. Firing rl*prefer*rvt*predict-no*H0*6
  10432. -->
  10433. (S1 ^operator O1948 = 0.9999999999999999)
  10434. Firing rl*prefer*rvt*predict-yes*H0*5
  10435. -->
  10436. (S1 ^operator O1947 = 0.)
  10437. Firing prefer*rvt*predict-yes*H0
  10438. -->
  10439. Firing prefer*rvt*predict-no*H0
  10440. -->
  10441. Firing elaborate*copy-dir-to-output-link
  10442. -->
  10443. (I3 ^dir U +)
  10444. inner elaboration loop at bottom goal.
  10445. Retracting elaborate*copy-see-to-output-link
  10446. -->
  10447. (I3 ^see 0 +)
  10448. Retracting propose*predict-no
  10449. -->
  10450. (O1948 ^name predict-no +)
  10451. (S1 ^operator O1948 +)
  10452. Retracting propose*predict-yes
  10453. -->
  10454. (O1947 ^name predict-yes +)
  10455. (S1 ^operator O1947 +)
  10456. Retracting elaborate*reward*based*on*reward
  10457. -->
  10458. (R977 ^value 1 +)
  10459. (R1 ^reward R977 +)
  10460. Retracting elaborate*copy-dir-to-output-link
  10461. -->
  10462. (I3 ^dir U +)
  10463. Retracting rl*prefer*rvt*predict-no*H0*6
  10464. -->
  10465. (S1 ^operator O1948 = 0.9999999999999999)
  10466. Retracting rl*prefer*rvt*predict-yes*H0*5
  10467. -->
  10468. (S1 ^operator O1947 = 0.)
  10469. =>WM: (13662: S1 ^operator O1950 +)
  10470. =>WM: (13661: S1 ^operator O1949 +)
  10471. =>WM: (13660: O1950 ^name predict-no)
  10472. =>WM: (13659: O1949 ^name predict-yes)
  10473. =>WM: (13658: R978 ^value 1)
  10474. =>WM: (13657: R1 ^reward R978)
  10475. <=WM: (13648: S1 ^operator O1947 +)
  10476. <=WM: (13649: S1 ^operator O1948 +)
  10477. <=WM: (13650: S1 ^operator O1948)
  10478. <=WM: (13644: R1 ^reward R977)
  10479. <=WM: (13647: O1948 ^name predict-no)
  10480. <=WM: (13646: O1947 ^name predict-yes)
  10481. <=WM: (13645: R977 ^value 1)
  10482. --- Inner Elaboration Phase, active level 1 (S1) ---
  10483. Firing prefer*rvt*predict-yes*H0
  10484. -->
  10485. Firing rl*prefer*rvt*predict-yes*H0*5
  10486. -->
  10487. (S1 ^operator O1949 = 0.)
  10488. Firing prefer*rvt*predict-no*H0
  10489. -->
  10490. Firing rl*prefer*rvt*predict-no*H0*6
  10491. -->
  10492. (S1 ^operator O1950 = 0.9999999999999999)
  10493. inner elaboration loop at bottom goal.
  10494. Retracting rl*prefer*rvt*predict-no*H0*6
  10495. -->
  10496. (S1 ^operator O1948 = 0.9999999999999999)
  10497. Retracting rl*prefer*rvt*predict-yes*H0*5
  10498. -->
  10499. (S1 ^operator O1947 = 0.)
  10500. --- END Proposal Phase ---
  10501. --- Decision Phase ---
  10502. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10503. =>WM: (13663: S1 ^operator O1950)
  10504. 975: O: O1950 (predict-no)
  10505. --- END Decision Phase ---
  10506. --- Application Phase ---
  10507. --- Firing Productions (PE) For State At Depth 1 ---
  10508. --- Inner Elaboration Phase, active level 1 (S1) ---
  10509. Firing apply*operator
  10510. -->
  10511. (I3 ^predict-no N975 + :O )
  10512. Firing apply*operator*complete
  10513. -->
  10514. (I3 ^predict-no N974 - :O )
  10515. inner elaboration loop at bottom goal.
  10516. --- Change Working Memory (PE) ---
  10517. =>WM: (13664: I3 ^predict-no N975)
  10518. <=WM: (13652: N974 ^status complete)
  10519. <=WM: (13651: I3 ^predict-no N974)
  10520. --- Firing Productions (IE) For State At Depth 1 ---
  10521. --- Inner Elaboration Phase, active level 1 (S1) ---
  10522. Firing monitor*world
  10523. -->
  10524. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10525. --- Change Working Memory (IE) ---
  10526. --- END Application Phase ---
  10527. --- Output Phase ---
  10528. ENV: Agent did: predict-no for direction U in state State-A
  10529. In State-A moving U
  10530. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10531. predict error 0
  10532. dir: dir isR
  10533. --- END Output Phase ---
  10534. /|\--- Input Phase ---
  10535. =>WM: (13668: I2 ^dir R)
  10536. =>WM: (13667: I2 ^reward 1)
  10537. =>WM: (13666: I2 ^see 0)
  10538. =>WM: (13665: N975 ^status complete)
  10539. <=WM: (13655: I2 ^dir U)
  10540. <=WM: (13654: I2 ^reward 1)
  10541. <=WM: (13653: I2 ^see 0)
  10542. =>WM: (13669: I2 ^level-1 L1-root)
  10543. <=WM: (13656: I2 ^level-1 L1-root)
  10544. --- END Input Phase ---
  10545. --- Proposal Phase ---
  10546. --- Inner Elaboration Phase, active level 1 (S1) ---
  10547. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  10548. -->
  10549. (S1 ^operator O1950 = -0.02155734064455064)
  10550. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  10551. -->
  10552. (S1 ^operator O1949 = 0.8155758449529213)
  10553. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10554. -->
  10555. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10556. -->
  10557. Firing elaborate*copy-see-to-output-link
  10558. -->
  10559. (I3 ^see 0 +)
  10560. Firing elaborate*reward*based*on*reward
  10561. -->
  10562. (R979 ^value 1 +)
  10563. (R1 ^reward R979 +)
  10564. Firing propose*predict-yes
  10565. -->
  10566. (O1951 ^name predict-yes +)
  10567. (S1 ^operator O1951 +)
  10568. Firing propose*predict-no
  10569. -->
  10570. (O1952 ^name predict-no +)
  10571. (S1 ^operator O1952 +)
  10572. Firing rl*prefer*rvt*predict-no*H0*4
  10573. -->
  10574. (S1 ^operator O1950 = 0.4476189814068987)
  10575. Firing rl*prefer*rvt*predict-yes*H0*3
  10576. -->
  10577. (S1 ^operator O1949 = 0.1844075128644186)
  10578. Firing prefer*rvt*predict-yes*H0
  10579. -->
  10580. Firing prefer*rvt*predict-no*H0
  10581. -->
  10582. Firing elaborate*copy-dir-to-output-link
  10583. -->
  10584. (I3 ^dir R +)
  10585. inner elaboration loop at bottom goal.
  10586. Retracting elaborate*copy-see-to-output-link
  10587. -->
  10588. (I3 ^see 0 +)
  10589. Retracting propose*predict-no
  10590. -->
  10591. (O1950 ^name predict-no +)
  10592. (S1 ^operator O1950 +)
  10593. Retracting propose*predict-yes
  10594. -->
  10595. (O1949 ^name predict-yes +)
  10596. (S1 ^operator O1949 +)
  10597. Retracting elaborate*reward*based*on*reward
  10598. -->
  10599. (R978 ^value 1 +)
  10600. (R1 ^reward R978 +)
  10601. Retracting elaborate*copy-dir-to-output-link
  10602. -->
  10603. (I3 ^dir U +)
  10604. Retracting rl*prefer*rvt*predict-no*H0*6
  10605. -->
  10606. (S1 ^operator O1950 = 0.9999999999999999)
  10607. Retracting rl*prefer*rvt*predict-yes*H0*5
  10608. -->
  10609. (S1 ^operator O1949 = 0.)
  10610. =>WM: (13676: S1 ^operator O1952 +)
  10611. =>WM: (13675: S1 ^operator O1951 +)
  10612. =>WM: (13674: I3 ^dir R)
  10613. =>WM: (13673: O1952 ^name predict-no)
  10614. =>WM: (13672: O1951 ^name predict-yes)
  10615. =>WM: (13671: R979 ^value 1)
  10616. =>WM: (13670: R1 ^reward R979)
  10617. <=WM: (13661: S1 ^operator O1949 +)
  10618. <=WM: (13662: S1 ^operator O1950 +)
  10619. <=WM: (13663: S1 ^operator O1950)
  10620. <=WM: (13620: I3 ^dir U)
  10621. <=WM: (13657: R1 ^reward R978)
  10622. <=WM: (13660: O1950 ^name predict-no)
  10623. <=WM: (13659: O1949 ^name predict-yes)
  10624. <=WM: (13658: R978 ^value 1)
  10625. --- Inner Elaboration Phase, active level 1 (S1) ---
  10626. Firing prefer*rvt*predict-yes*H0
  10627. -->
  10628. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  10629. -->
  10630. (S1 ^operator O1951 = 0.8155758449529213)
  10631. Firing rl*prefer*rvt*predict-yes*H0*3
  10632. -->
  10633. (S1 ^operator O1951 = 0.1844075128644186)
  10634. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10635. -->
  10636. Firing prefer*rvt*predict-no*H0
  10637. -->
  10638. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  10639. -->
  10640. (S1 ^operator O1952 = -0.02155734064455064)
  10641. Firing rl*prefer*rvt*predict-no*H0*4
  10642. -->
  10643. (S1 ^operator O1952 = 0.4476189814068987)
  10644. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10645. -->
  10646. inner elaboration loop at bottom goal.
  10647. Retracting rl*prefer*rvt*predict-no*H0*4
  10648. -->
  10649. (S1 ^operator O1950 = 0.4476189814068987)
  10650. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  10651. -->
  10652. (S1 ^operator O1950 = -0.02155734064455064)
  10653. Retracting rl*prefer*rvt*predict-yes*H0*3
  10654. -->
  10655. (S1 ^operator O1949 = 0.1844075128644186)
  10656. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  10657. -->
  10658. (S1 ^operator O1949 = 0.8155758449529213)
  10659. --- END Proposal Phase ---
  10660. --- Decision Phase ---
  10661. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10662. =>WM: (13677: S1 ^operator O1951)
  10663. 976: O: O1951 (predict-yes)
  10664. --- END Decision Phase ---
  10665. --- Application Phase ---
  10666. --- Firing Productions (PE) For State At Depth 1 ---
  10667. --- Inner Elaboration Phase, active level 1 (S1) ---
  10668. Firing apply*operator
  10669. -->
  10670. (I3 ^predict-yes N976 + :O )
  10671. Firing apply*operator*complete
  10672. -->
  10673. (I3 ^predict-no N975 - :O )
  10674. inner elaboration loop at bottom goal.
  10675. --- Change Working Memory (PE) ---
  10676. =>WM: (13678: I3 ^predict-yes N976)
  10677. <=WM: (13665: N975 ^status complete)
  10678. <=WM: (13664: I3 ^predict-no N975)
  10679. --- Firing Productions (IE) For State At Depth 1 ---
  10680. --- Inner Elaboration Phase, active level 1 (S1) ---
  10681. Firing monitor*world
  10682. -->
  10683. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10684. --- Change Working Memory (IE) ---
  10685. --- END Application Phase ---
  10686. --- Output Phase ---
  10687. ENV: Agent did: predict-yes for direction R in state State-A
  10688. In State-A moving R
  10689. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10690. predict error 0
  10691. dir: dir isL
  10692. --- END Output Phase ---
  10693. -/|--- Input Phase ---
  10694. =>WM: (13682: I2 ^dir L)
  10695. =>WM: (13681: I2 ^reward 1)
  10696. =>WM: (13680: I2 ^see 1)
  10697. =>WM: (13679: N976 ^status complete)
  10698. <=WM: (13668: I2 ^dir R)
  10699. <=WM: (13667: I2 ^reward 1)
  10700. <=WM: (13666: I2 ^see 0)
  10701. =>WM: (13683: I2 ^level-1 R1-root)
  10702. <=WM: (13669: I2 ^level-1 L1-root)
  10703. --- END Input Phase ---
  10704. --- Proposal Phase ---
  10705. --- Inner Elaboration Phase, active level 1 (S1) ---
  10706. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  10707. -->
  10708. (S1 ^operator O1951 = 0.6104589917494525)
  10709. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  10710. -->
  10711. (S1 ^operator O1952 = 0.2714993082286609)
  10712. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10713. -->
  10714. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10715. -->
  10716. Firing elaborate*copy-see-to-output-link
  10717. -->
  10718. (I3 ^see 1 +)
  10719. Firing elaborate*reward*based*on*reward
  10720. -->
  10721. (R980 ^value 1 +)
  10722. (R1 ^reward R980 +)
  10723. Firing propose*predict-yes
  10724. -->
  10725. (O1953 ^name predict-yes +)
  10726. (S1 ^operator O1953 +)
  10727. Firing propose*predict-no
  10728. -->
  10729. (O1954 ^name predict-no +)
  10730. (S1 ^operator O1954 +)
  10731. Firing rl*prefer*rvt*predict-no*H0*2
  10732. -->
  10733. (S1 ^operator O1952 = 0.387336901415443)
  10734. Firing rl*prefer*rvt*predict-yes*H0*1
  10735. -->
  10736. (S1 ^operator O1951 = 0.389539338123754)
  10737. Firing prefer*rvt*predict-yes*H0
  10738. -->
  10739. Firing prefer*rvt*predict-no*H0
  10740. -->
  10741. Firing elaborate*copy-dir-to-output-link
  10742. -->
  10743. (I3 ^dir L +)
  10744. inner elaboration loop at bottom goal.
  10745. Retracting elaborate*copy-see-to-output-link
  10746. -->
  10747. (I3 ^see 0 +)
  10748. Retracting propose*predict-no
  10749. -->
  10750. (O1952 ^name predict-no +)
  10751. (S1 ^operator O1952 +)
  10752. Retracting propose*predict-yes
  10753. -->
  10754. (O1951 ^name predict-yes +)
  10755. (S1 ^operator O1951 +)
  10756. Retracting elaborate*reward*based*on*reward
  10757. -->
  10758. (R979 ^value 1 +)
  10759. (R1 ^reward R979 +)
  10760. Retracting elaborate*copy-dir-to-output-link
  10761. -->
  10762. (I3 ^dir R +)
  10763. Retracting rl*prefer*rvt*predict-no*H0*4
  10764. -->
  10765. (S1 ^operator O1952 = 0.4476189814068987)
  10766. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  10767. -->
  10768. (S1 ^operator O1952 = -0.02155734064455064)
  10769. Retracting rl*prefer*rvt*predict-yes*H0*3
  10770. -->
  10771. (S1 ^operator O1951 = 0.1844075128644186)
  10772. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  10773. -->
  10774. (S1 ^operator O1951 = 0.8155758449529213)
  10775. =>WM: (13691: S1 ^operator O1954 +)
  10776. =>WM: (13690: S1 ^operator O1953 +)
  10777. =>WM: (13689: I3 ^dir L)
  10778. =>WM: (13688: O1954 ^name predict-no)
  10779. =>WM: (13687: O1953 ^name predict-yes)
  10780. =>WM: (13686: R980 ^value 1)
  10781. =>WM: (13685: R1 ^reward R980)
  10782. =>WM: (13684: I3 ^see 1)
  10783. <=WM: (13675: S1 ^operator O1951 +)
  10784. <=WM: (13677: S1 ^operator O1951)
  10785. <=WM: (13676: S1 ^operator O1952 +)
  10786. <=WM: (13674: I3 ^dir R)
  10787. <=WM: (13670: R1 ^reward R979)
  10788. <=WM: (13630: I3 ^see 0)
  10789. <=WM: (13673: O1952 ^name predict-no)
  10790. <=WM: (13672: O1951 ^name predict-yes)
  10791. <=WM: (13671: R979 ^value 1)
  10792. --- Inner Elaboration Phase, active level 1 (S1) ---
  10793. Firing prefer*rvt*predict-yes*H0
  10794. -->
  10795. Firing rl*prefer*rvt*predict-yes*H0*1
  10796. -->
  10797. (S1 ^operator O1953 = 0.389539338123754)
  10798. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10799. -->
  10800. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  10801. -->
  10802. (S1 ^operator O1953 = 0.6104589917494525)
  10803. Firing prefer*rvt*predict-no*H0
  10804. -->
  10805. Firing rl*prefer*rvt*predict-no*H0*2
  10806. -->
  10807. (S1 ^operator O1954 = 0.387336901415443)
  10808. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10809. -->
  10810. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  10811. -->
  10812. (S1 ^operator O1954 = 0.2714993082286609)
  10813. inner elaboration loop at bottom goal.
  10814. Retracting rl*prefer*rvt*predict-no*H0*2
  10815. -->
  10816. (S1 ^operator O1952 = 0.387336901415443)
  10817. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  10818. -->
  10819. (S1 ^operator O1952 = 0.2714993082286609)
  10820. Retracting rl*prefer*rvt*predict-yes*H0*1
  10821. -->
  10822. (S1 ^operator O1951 = 0.389539338123754)
  10823. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  10824. -->
  10825. (S1 ^operator O1951 = 0.6104589917494525)
  10826. --- END Proposal Phase ---
  10827. --- Decision Phase ---
  10828. RL update rl*prefer*rvt*predict-yes*H0*3 0.675409 -0.491002 0.184408 -> 0.675412 -0.491002 0.18441(R,m,v=1,0.89697,0.0929786)
  10829. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324569 0.491006 0.815576 -> 0.324573 0.491006 0.815578(R,m,v=1,1,0)
  10830. =>WM: (13692: S1 ^operator O1953)
  10831. 977: O: O1953 (predict-yes)
  10832. --- END Decision Phase ---
  10833. --- Application Phase ---
  10834. --- Firing Productions (PE) For State At Depth 1 ---
  10835. --- Inner Elaboration Phase, active level 1 (S1) ---
  10836. Firing apply*operator
  10837. -->
  10838. (I3 ^predict-yes N977 + :O )
  10839. Firing apply*operator*complete
  10840. -->
  10841. (I3 ^predict-yes N976 - :O )
  10842. inner elaboration loop at bottom goal.
  10843. --- Change Working Memory (PE) ---
  10844. =>WM: (13693: I3 ^predict-yes N977)
  10845. <=WM: (13679: N976 ^status complete)
  10846. <=WM: (13678: I3 ^predict-yes N976)
  10847. --- Firing Productions (IE) For State At Depth 1 ---
  10848. --- Inner Elaboration Phase, active level 1 (S1) ---
  10849. Firing monitor*world
  10850. -->
  10851. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10852. --- Change Working Memory (IE) ---
  10853. --- END Application Phase ---
  10854. --- Output Phase ---
  10855. ENV: Agent did: predict-yes for direction L in state State-B
  10856. In State-B moving L
  10857. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10858. predict error 0
  10859. dir: dir isL
  10860. --- END Output Phase ---
  10861. \-/--- Input Phase ---
  10862. =>WM: (13697: I2 ^dir L)
  10863. =>WM: (13696: I2 ^reward 1)
  10864. =>WM: (13695: I2 ^see 1)
  10865. =>WM: (13694: N977 ^status complete)
  10866. <=WM: (13682: I2 ^dir L)
  10867. <=WM: (13681: I2 ^reward 1)
  10868. <=WM: (13680: I2 ^see 1)
  10869. =>WM: (13698: I2 ^level-1 L1-root)
  10870. <=WM: (13683: I2 ^level-1 R1-root)
  10871. --- END Input Phase ---
  10872. --- Proposal Phase ---
  10873. --- Inner Elaboration Phase, active level 1 (S1) ---
  10874. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  10875. -->
  10876. (S1 ^operator O1954 = 0.6126626863207351)
  10877. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  10878. -->
  10879. (S1 ^operator O1953 = -0.02274740735326741)
  10880. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10881. -->
  10882. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10883. -->
  10884. Firing elaborate*copy-see-to-output-link
  10885. -->
  10886. (I3 ^see 1 +)
  10887. Firing elaborate*reward*based*on*reward
  10888. -->
  10889. (R981 ^value 1 +)
  10890. (R1 ^reward R981 +)
  10891. Firing propose*predict-yes
  10892. -->
  10893. (O1955 ^name predict-yes +)
  10894. (S1 ^operator O1955 +)
  10895. Firing propose*predict-no
  10896. -->
  10897. (O1956 ^name predict-no +)
  10898. (S1 ^operator O1956 +)
  10899. Firing rl*prefer*rvt*predict-no*H0*2
  10900. -->
  10901. (S1 ^operator O1954 = 0.387336901415443)
  10902. Firing rl*prefer*rvt*predict-yes*H0*1
  10903. -->
  10904. (S1 ^operator O1953 = 0.389539338123754)
  10905. Firing prefer*rvt*predict-yes*H0
  10906. -->
  10907. Firing prefer*rvt*predict-no*H0
  10908. -->
  10909. Firing elaborate*copy-dir-to-output-link
  10910. -->
  10911. (I3 ^dir L +)
  10912. inner elaboration loop at bottom goal.
  10913. Retracting elaborate*copy-see-to-output-link
  10914. -->
  10915. (I3 ^see 1 +)
  10916. Retracting propose*predict-no
  10917. -->
  10918. (O1954 ^name predict-no +)
  10919. (S1 ^operator O1954 +)
  10920. Retracting propose*predict-yes
  10921. -->
  10922. (O1953 ^name predict-yes +)
  10923. (S1 ^operator O1953 +)
  10924. Retracting elaborate*reward*based*on*reward
  10925. -->
  10926. (R980 ^value 1 +)
  10927. (R1 ^reward R980 +)
  10928. Retracting elaborate*copy-dir-to-output-link
  10929. -->
  10930. (I3 ^dir L +)
  10931. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  10932. -->
  10933. (S1 ^operator O1954 = 0.2714993082286609)
  10934. Retracting rl*prefer*rvt*predict-no*H0*2
  10935. -->
  10936. (S1 ^operator O1954 = 0.387336901415443)
  10937. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  10938. -->
  10939. (S1 ^operator O1953 = 0.6104589917494525)
  10940. Retracting rl*prefer*rvt*predict-yes*H0*1
  10941. -->
  10942. (S1 ^operator O1953 = 0.389539338123754)
  10943. =>WM: (13704: S1 ^operator O1956 +)
  10944. =>WM: (13703: S1 ^operator O1955 +)
  10945. =>WM: (13702: O1956 ^name predict-no)
  10946. =>WM: (13701: O1955 ^name predict-yes)
  10947. =>WM: (13700: R981 ^value 1)
  10948. =>WM: (13699: R1 ^reward R981)
  10949. <=WM: (13690: S1 ^operator O1953 +)
  10950. <=WM: (13692: S1 ^operator O1953)
  10951. <=WM: (13691: S1 ^operator O1954 +)
  10952. <=WM: (13685: R1 ^reward R980)
  10953. <=WM: (13688: O1954 ^name predict-no)
  10954. <=WM: (13687: O1953 ^name predict-yes)
  10955. <=WM: (13686: R980 ^value 1)
  10956. --- Inner Elaboration Phase, active level 1 (S1) ---
  10957. Firing prefer*rvt*predict-yes*H0
  10958. -->
  10959. Firing rl*prefer*rvt*predict-yes*H0*1
  10960. -->
  10961. (S1 ^operator O1955 = 0.389539338123754)
  10962. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  10963. -->
  10964. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  10965. -->
  10966. (S1 ^operator O1955 = -0.02274740735326741)
  10967. Firing prefer*rvt*predict-no*H0
  10968. -->
  10969. Firing rl*prefer*rvt*predict-no*H0*2
  10970. -->
  10971. (S1 ^operator O1956 = 0.387336901415443)
  10972. Firing prefer*rvt*predict-no*H0*2*v1*H1
  10973. -->
  10974. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  10975. -->
  10976. (S1 ^operator O1956 = 0.6126626863207351)
  10977. inner elaboration loop at bottom goal.
  10978. Retracting rl*prefer*rvt*predict-no*H0*2
  10979. -->
  10980. (S1 ^operator O1954 = 0.387336901415443)
  10981. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  10982. -->
  10983. (S1 ^operator O1954 = 0.6126626863207351)
  10984. Retracting rl*prefer*rvt*predict-yes*H0*1
  10985. -->
  10986. (S1 ^operator O1953 = 0.389539338123754)
  10987. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  10988. -->
  10989. (S1 ^operator O1953 = -0.02274740735326741)
  10990. --- END Proposal Phase ---
  10991. --- Decision Phase ---
  10992. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.889571,0.0988412)
  10993. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.32241 0.610459 -> 0.288049 0.322411 0.610459(R,m,v=1,1,0)
  10994. =>WM: (13705: S1 ^operator O1956)
  10995. 978: O: O1956 (predict-no)
  10996. --- END Decision Phase ---
  10997. --- Application Phase ---
  10998. --- Firing Productions (PE) For State At Depth 1 ---
  10999. --- Inner Elaboration Phase, active level 1 (S1) ---
  11000. Firing apply*operator
  11001. -->
  11002. (I3 ^predict-no N978 + :O )
  11003. Firing apply*operator*complete
  11004. -->
  11005. (I3 ^predict-yes N977 - :O )
  11006. inner elaboration loop at bottom goal.
  11007. --- Change Working Memory (PE) ---
  11008. =>WM: (13706: I3 ^predict-no N978)
  11009. <=WM: (13694: N977 ^status complete)
  11010. <=WM: (13693: I3 ^predict-yes N977)
  11011. --- Firing Productions (IE) For State At Depth 1 ---
  11012. --- Inner Elaboration Phase, active level 1 (S1) ---
  11013. Firing monitor*world
  11014. -->
  11015. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11016. --- Change Working Memory (IE) ---
  11017. --- END Application Phase ---
  11018. --- Output Phase ---
  11019. ENV: Agent did: predict-no for direction L in state State-A
  11020. In State-A moving L
  11021. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11022. predict error 0
  11023. dir: dir isR
  11024. --- END Output Phase ---
  11025. |\---- Input Phase ---
  11026. =>WM: (13710: I2 ^dir R)
  11027. =>WM: (13709: I2 ^reward 1)
  11028. =>WM: (13708: I2 ^see 0)
  11029. =>WM: (13707: N978 ^status complete)
  11030. <=WM: (13697: I2 ^dir L)
  11031. <=WM: (13696: I2 ^reward 1)
  11032. <=WM: (13695: I2 ^see 1)
  11033. =>WM: (13711: I2 ^level-1 L0-root)
  11034. <=WM: (13698: I2 ^level-1 L1-root)
  11035. --- END Input Phase ---
  11036. --- Proposal Phase ---
  11037. --- Inner Elaboration Phase, active level 1 (S1) ---
  11038. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11039. -->
  11040. (S1 ^operator O1955 = 0.8155955750807526)
  11041. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11042. -->
  11043. (S1 ^operator O1956 = -0.00558448899823713)
  11044. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11045. -->
  11046. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11047. -->
  11048. Firing elaborate*copy-see-to-output-link
  11049. -->
  11050. (I3 ^see 0 +)
  11051. Firing elaborate*reward*based*on*reward
  11052. -->
  11053. (R982 ^value 1 +)
  11054. (R1 ^reward R982 +)
  11055. Firing propose*predict-yes
  11056. -->
  11057. (O1957 ^name predict-yes +)
  11058. (S1 ^operator O1957 +)
  11059. Firing propose*predict-no
  11060. -->
  11061. (O1958 ^name predict-no +)
  11062. (S1 ^operator O1958 +)
  11063. Firing rl*prefer*rvt*predict-no*H0*4
  11064. -->
  11065. (S1 ^operator O1956 = 0.4476189814068987)
  11066. Firing rl*prefer*rvt*predict-yes*H0*3
  11067. -->
  11068. (S1 ^operator O1955 = 0.1844100091918176)
  11069. Firing prefer*rvt*predict-yes*H0
  11070. -->
  11071. Firing prefer*rvt*predict-no*H0
  11072. -->
  11073. Firing elaborate*copy-dir-to-output-link
  11074. -->
  11075. (I3 ^dir R +)
  11076. inner elaboration loop at bottom goal.
  11077. Retracting elaborate*copy-see-to-output-link
  11078. -->
  11079. (I3 ^see 1 +)
  11080. Retracting propose*predict-no
  11081. -->
  11082. (O1956 ^name predict-no +)
  11083. (S1 ^operator O1956 +)
  11084. Retracting propose*predict-yes
  11085. -->
  11086. (O1955 ^name predict-yes +)
  11087. (S1 ^operator O1955 +)
  11088. Retracting elaborate*reward*based*on*reward
  11089. -->
  11090. (R981 ^value 1 +)
  11091. (R1 ^reward R981 +)
  11092. Retracting elaborate*copy-dir-to-output-link
  11093. -->
  11094. (I3 ^dir L +)
  11095. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  11096. -->
  11097. (S1 ^operator O1956 = 0.6126626863207351)
  11098. Retracting rl*prefer*rvt*predict-no*H0*2
  11099. -->
  11100. (S1 ^operator O1956 = 0.387336901415443)
  11101. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  11102. -->
  11103. (S1 ^operator O1955 = -0.02274740735326741)
  11104. Retracting rl*prefer*rvt*predict-yes*H0*1
  11105. -->
  11106. (S1 ^operator O1955 = 0.389539588642773)
  11107. =>WM: (13719: S1 ^operator O1958 +)
  11108. =>WM: (13718: S1 ^operator O1957 +)
  11109. =>WM: (13717: I3 ^dir R)
  11110. =>WM: (13716: O1958 ^name predict-no)
  11111. =>WM: (13715: O1957 ^name predict-yes)
  11112. =>WM: (13714: R982 ^value 1)
  11113. =>WM: (13713: R1 ^reward R982)
  11114. =>WM: (13712: I3 ^see 0)
  11115. <=WM: (13703: S1 ^operator O1955 +)
  11116. <=WM: (13704: S1 ^operator O1956 +)
  11117. <=WM: (13705: S1 ^operator O1956)
  11118. <=WM: (13689: I3 ^dir L)
  11119. <=WM: (13699: R1 ^reward R981)
  11120. <=WM: (13684: I3 ^see 1)
  11121. <=WM: (13702: O1956 ^name predict-no)
  11122. <=WM: (13701: O1955 ^name predict-yes)
  11123. <=WM: (13700: R981 ^value 1)
  11124. --- Inner Elaboration Phase, active level 1 (S1) ---
  11125. Firing prefer*rvt*predict-yes*H0
  11126. -->
  11127. Firing rl*prefer*rvt*predict-yes*H0*3
  11128. -->
  11129. (S1 ^operator O1957 = 0.1844100091918176)
  11130. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11131. -->
  11132. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11133. -->
  11134. (S1 ^operator O1957 = 0.8155955750807526)
  11135. Firing prefer*rvt*predict-no*H0
  11136. -->
  11137. Firing rl*prefer*rvt*predict-no*H0*4
  11138. -->
  11139. (S1 ^operator O1958 = 0.4476189814068987)
  11140. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11141. -->
  11142. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11143. -->
  11144. (S1 ^operator O1958 = -0.00558448899823713)
  11145. inner elaboration loop at bottom goal.
  11146. Retracting rl*prefer*rvt*predict-no*H0*4
  11147. -->
  11148. (S1 ^operator O1956 = 0.4476189814068987)
  11149. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11150. -->
  11151. (S1 ^operator O1956 = -0.00558448899823713)
  11152. Retracting rl*prefer*rvt*predict-yes*H0*3
  11153. -->
  11154. (S1 ^operator O1955 = 0.1844100091918176)
  11155. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11156. -->
  11157. (S1 ^operator O1955 = 0.8155955750807526)
  11158. --- END Proposal Phase ---
  11159. --- Decision Phase ---
  11160. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.931429,0.0642365)
  11161. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  11162. =>WM: (13720: S1 ^operator O1957)
  11163. 979: O: O1957 (predict-yes)
  11164. --- END Decision Phase ---
  11165. --- Application Phase ---
  11166. --- Firing Productions (PE) For State At Depth 1 ---
  11167. --- Inner Elaboration Phase, active level 1 (S1) ---
  11168. Firing apply*operator
  11169. -->
  11170. (I3 ^predict-yes N979 + :O )
  11171. Firing apply*operator*complete
  11172. -->
  11173. (I3 ^predict-no N978 - :O )
  11174. inner elaboration loop at bottom goal.
  11175. --- Change Working Memory (PE) ---
  11176. =>WM: (13721: I3 ^predict-yes N979)
  11177. <=WM: (13707: N978 ^status complete)
  11178. <=WM: (13706: I3 ^predict-no N978)
  11179. --- Firing Productions (IE) For State At Depth 1 ---
  11180. --- Inner Elaboration Phase, active level 1 (S1) ---
  11181. Firing monitor*world
  11182. -->
  11183. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11184. --- Change Working Memory (IE) ---
  11185. --- END Application Phase ---
  11186. --- Output Phase ---
  11187. ENV: Agent did: predict-yes for direction R in state State-A
  11188. In State-A moving R
  11189. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11190. predict error 0
  11191. dir: dir isU
  11192. --- END Output Phase ---
  11193. /|\--- Input Phase ---
  11194. =>WM: (13725: I2 ^dir U)
  11195. =>WM: (13724: I2 ^reward 1)
  11196. =>WM: (13723: I2 ^see 1)
  11197. =>WM: (13722: N979 ^status complete)
  11198. <=WM: (13710: I2 ^dir R)
  11199. <=WM: (13709: I2 ^reward 1)
  11200. <=WM: (13708: I2 ^see 0)
  11201. =>WM: (13726: I2 ^level-1 R1-root)
  11202. <=WM: (13711: I2 ^level-1 L0-root)
  11203. --- END Input Phase ---
  11204. --- Proposal Phase ---
  11205. --- Inner Elaboration Phase, active level 1 (S1) ---
  11206. Firing elaborate*copy-see-to-output-link
  11207. -->
  11208. (I3 ^see 1 +)
  11209. Firing elaborate*reward*based*on*reward
  11210. -->
  11211. (R983 ^value 1 +)
  11212. (R1 ^reward R983 +)
  11213. Firing propose*predict-yes
  11214. -->
  11215. (O1959 ^name predict-yes +)
  11216. (S1 ^operator O1959 +)
  11217. Firing propose*predict-no
  11218. -->
  11219. (O1960 ^name predict-no +)
  11220. (S1 ^operator O1960 +)
  11221. Firing rl*prefer*rvt*predict-no*H0*6
  11222. -->
  11223. (S1 ^operator O1958 = 0.9999999999999999)
  11224. Firing rl*prefer*rvt*predict-yes*H0*5
  11225. -->
  11226. (S1 ^operator O1957 = 0.)
  11227. Firing prefer*rvt*predict-yes*H0
  11228. -->
  11229. Firing prefer*rvt*predict-no*H0
  11230. -->
  11231. Firing elaborate*copy-dir-to-output-link
  11232. -->
  11233. (I3 ^dir U +)
  11234. inner elaboration loop at bottom goal.
  11235. Retracting elaborate*copy-see-to-output-link
  11236. -->
  11237. (I3 ^see 0 +)
  11238. Retracting propose*predict-no
  11239. -->
  11240. (O1958 ^name predict-no +)
  11241. (S1 ^operator O1958 +)
  11242. Retracting propose*predict-yes
  11243. -->
  11244. (O1957 ^name predict-yes +)
  11245. (S1 ^operator O1957 +)
  11246. Retracting elaborate*reward*based*on*reward
  11247. -->
  11248. (R982 ^value 1 +)
  11249. (R1 ^reward R982 +)
  11250. Retracting elaborate*copy-dir-to-output-link
  11251. -->
  11252. (I3 ^dir R +)
  11253. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11254. -->
  11255. (S1 ^operator O1958 = -0.00558448899823713)
  11256. Retracting rl*prefer*rvt*predict-no*H0*4
  11257. -->
  11258. (S1 ^operator O1958 = 0.4476189814068987)
  11259. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11260. -->
  11261. (S1 ^operator O1957 = 0.8155955750807526)
  11262. Retracting rl*prefer*rvt*predict-yes*H0*3
  11263. -->
  11264. (S1 ^operator O1957 = 0.1844100091918176)
  11265. =>WM: (13734: S1 ^operator O1960 +)
  11266. =>WM: (13733: S1 ^operator O1959 +)
  11267. =>WM: (13732: I3 ^dir U)
  11268. =>WM: (13731: O1960 ^name predict-no)
  11269. =>WM: (13730: O1959 ^name predict-yes)
  11270. =>WM: (13729: R983 ^value 1)
  11271. =>WM: (13728: R1 ^reward R983)
  11272. =>WM: (13727: I3 ^see 1)
  11273. <=WM: (13718: S1 ^operator O1957 +)
  11274. <=WM: (13720: S1 ^operator O1957)
  11275. <=WM: (13719: S1 ^operator O1958 +)
  11276. <=WM: (13717: I3 ^dir R)
  11277. <=WM: (13713: R1 ^reward R982)
  11278. <=WM: (13712: I3 ^see 0)
  11279. <=WM: (13716: O1958 ^name predict-no)
  11280. <=WM: (13715: O1957 ^name predict-yes)
  11281. <=WM: (13714: R982 ^value 1)
  11282. --- Inner Elaboration Phase, active level 1 (S1) ---
  11283. Firing prefer*rvt*predict-yes*H0
  11284. -->
  11285. Firing rl*prefer*rvt*predict-yes*H0*5
  11286. -->
  11287. (S1 ^operator O1959 = 0.)
  11288. Firing prefer*rvt*predict-no*H0
  11289. -->
  11290. Firing rl*prefer*rvt*predict-no*H0*6
  11291. -->
  11292. (S1 ^operator O1960 = 0.9999999999999999)
  11293. inner elaboration loop at bottom goal.
  11294. Retracting rl*prefer*rvt*predict-no*H0*6
  11295. -->
  11296. (S1 ^operator O1958 = 0.9999999999999999)
  11297. Retracting rl*prefer*rvt*predict-yes*H0*5
  11298. -->
  11299. (S1 ^operator O1957 = 0.)
  11300. --- END Proposal Phase ---
  11301. --- Decision Phase ---
  11302. RL update rl*prefer*rvt*predict-yes*H0*3 0.675412 -0.491002 0.18441 -> 0.675411 -0.491002 0.184409(R,m,v=1,0.89759,0.092479)
  11303. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324595 0.491001 0.815596 -> 0.324594 0.491001 0.815595(R,m,v=1,1,0)
  11304. =>WM: (13735: S1 ^operator O1960)
  11305. 980: O: O1960 (predict-no)
  11306. --- END Decision Phase ---
  11307. --- Application Phase ---
  11308. --- Firing Productions (PE) For State At Depth 1 ---
  11309. --- Inner Elaboration Phase, active level 1 (S1) ---
  11310. Firing apply*operator
  11311. -->
  11312. (I3 ^predict-no N980 + :O )
  11313. Firing apply*operator*complete
  11314. -->
  11315. (I3 ^predict-yes N979 - :O )
  11316. inner elaboration loop at bottom goal.
  11317. --- Change Working Memory (PE) ---
  11318. =>WM: (13736: I3 ^predict-no N980)
  11319. <=WM: (13722: N979 ^status complete)
  11320. <=WM: (13721: I3 ^predict-yes N979)
  11321. --- Firing Productions (IE) For State At Depth 1 ---
  11322. --- Inner Elaboration Phase, active level 1 (S1) ---
  11323. Firing monitor*world
  11324. -->
  11325. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11326. --- Change Working Memory (IE) ---
  11327. --- END Application Phase ---
  11328. --- Output Phase ---
  11329. ENV: Agent did: predict-no for direction U in state State-B
  11330. In State-B moving U
  11331. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11332. predict error 0
  11333. dir: dir isU
  11334. --- END Output Phase ---
  11335. -/|--- Input Phase ---
  11336. =>WM: (13740: I2 ^dir U)
  11337. =>WM: (13739: I2 ^reward 1)
  11338. =>WM: (13738: I2 ^see 0)
  11339. =>WM: (13737: N980 ^status complete)
  11340. <=WM: (13725: I2 ^dir U)
  11341. <=WM: (13724: I2 ^reward 1)
  11342. <=WM: (13723: I2 ^see 1)
  11343. =>WM: (13741: I2 ^level-1 R1-root)
  11344. <=WM: (13726: I2 ^level-1 R1-root)
  11345. --- END Input Phase ---
  11346. --- Proposal Phase ---
  11347. --- Inner Elaboration Phase, active level 1 (S1) ---
  11348. Firing elaborate*copy-see-to-output-link
  11349. -->
  11350. (I3 ^see 0 +)
  11351. Firing elaborate*reward*based*on*reward
  11352. -->
  11353. (R984 ^value 1 +)
  11354. (R1 ^reward R984 +)
  11355. Firing propose*predict-yes
  11356. -->
  11357. (O1961 ^name predict-yes +)
  11358. (S1 ^operator O1961 +)
  11359. Firing propose*predict-no
  11360. -->
  11361. (O1962 ^name predict-no +)
  11362. (S1 ^operator O1962 +)
  11363. Firing rl*prefer*rvt*predict-no*H0*6
  11364. -->
  11365. (S1 ^operator O1960 = 0.9999999999999999)
  11366. Firing rl*prefer*rvt*predict-yes*H0*5
  11367. -->
  11368. (S1 ^operator O1959 = 0.)
  11369. Firing prefer*rvt*predict-yes*H0
  11370. -->
  11371. Firing prefer*rvt*predict-no*H0
  11372. -->
  11373. Firing elaborate*copy-dir-to-output-link
  11374. -->
  11375. (I3 ^dir U +)
  11376. inner elaboration loop at bottom goal.
  11377. Retracting elaborate*copy-see-to-output-link
  11378. -->
  11379. (I3 ^see 1 +)
  11380. Retracting propose*predict-no
  11381. -->
  11382. (O1960 ^name predict-no +)
  11383. (S1 ^operator O1960 +)
  11384. Retracting propose*predict-yes
  11385. -->
  11386. (O1959 ^name predict-yes +)
  11387. (S1 ^operator O1959 +)
  11388. Retracting elaborate*reward*based*on*reward
  11389. -->
  11390. (R983 ^value 1 +)
  11391. (R1 ^reward R983 +)
  11392. Retracting elaborate*copy-dir-to-output-link
  11393. -->
  11394. (I3 ^dir U +)
  11395. Retracting rl*prefer*rvt*predict-no*H0*6
  11396. -->
  11397. (S1 ^operator O1960 = 0.9999999999999999)
  11398. Retracting rl*prefer*rvt*predict-yes*H0*5
  11399. -->
  11400. (S1 ^operator O1959 = 0.)
  11401. =>WM: (13748: S1 ^operator O1962 +)
  11402. =>WM: (13747: S1 ^operator O1961 +)
  11403. =>WM: (13746: O1962 ^name predict-no)
  11404. =>WM: (13745: O1961 ^name predict-yes)
  11405. =>WM: (13744: R984 ^value 1)
  11406. =>WM: (13743: R1 ^reward R984)
  11407. =>WM: (13742: I3 ^see 0)
  11408. <=WM: (13733: S1 ^operator O1959 +)
  11409. <=WM: (13734: S1 ^operator O1960 +)
  11410. <=WM: (13735: S1 ^operator O1960)
  11411. <=WM: (13728: R1 ^reward R983)
  11412. <=WM: (13727: I3 ^see 1)
  11413. <=WM: (13731: O1960 ^name predict-no)
  11414. <=WM: (13730: O1959 ^name predict-yes)
  11415. <=WM: (13729: R983 ^value 1)
  11416. --- Inner Elaboration Phase, active level 1 (S1) ---
  11417. Firing prefer*rvt*predict-yes*H0
  11418. -->
  11419. Firing rl*prefer*rvt*predict-yes*H0*5
  11420. -->
  11421. (S1 ^operator O1961 = 0.)
  11422. Firing prefer*rvt*predict-no*H0
  11423. -->
  11424. Firing rl*prefer*rvt*predict-no*H0*6
  11425. -->
  11426. (S1 ^operator O1962 = 0.9999999999999999)
  11427. inner elaboration loop at bottom goal.
  11428. Retracting rl*prefer*rvt*predict-no*H0*6
  11429. -->
  11430. (S1 ^operator O1960 = 0.9999999999999999)
  11431. Retracting rl*prefer*rvt*predict-yes*H0*5
  11432. -->
  11433. (S1 ^operator O1959 = 0.)
  11434. --- END Proposal Phase ---
  11435. --- Decision Phase ---
  11436. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11437. =>WM: (13749: S1 ^operator O1962)
  11438. 981: O: O1962 (predict-no)
  11439. --- END Decision Phase ---
  11440. --- Application Phase ---
  11441. --- Firing Productions (PE) For State At Depth 1 ---
  11442. --- Inner Elaboration Phase, active level 1 (S1) ---
  11443. Firing apply*operator
  11444. -->
  11445. (I3 ^predict-no N981 + :O )
  11446. Firing apply*operator*complete
  11447. -->
  11448. (I3 ^predict-no N980 - :O )
  11449. inner elaboration loop at bottom goal.
  11450. --- Change Working Memory (PE) ---
  11451. =>WM: (13750: I3 ^predict-no N981)
  11452. <=WM: (13737: N980 ^status complete)
  11453. <=WM: (13736: I3 ^predict-no N980)
  11454. --- Firing Productions (IE) For State At Depth 1 ---
  11455. --- Inner Elaboration Phase, active level 1 (S1) ---
  11456. Firing monitor*world
  11457. -->
  11458. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11459. --- Change Working Memory (IE) ---
  11460. --- END Application Phase ---
  11461. --- Output Phase ---
  11462. ENV: Agent did: predict-no for direction U in state State-B
  11463. In State-B moving U
  11464. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11465. predict error 0
  11466. dir: dir isU
  11467. --- END Output Phase ---
  11468. \--- Input Phase ---
  11469. =>WM: (13754: I2 ^dir U)
  11470. =>WM: (13753: I2 ^reward 1)
  11471. =>WM: (13752: I2 ^see 0)
  11472. =>WM: (13751: N981 ^status complete)
  11473. <=WM: (13740: I2 ^dir U)
  11474. <=WM: (13739: I2 ^reward 1)
  11475. <=WM: (13738: I2 ^see 0)
  11476. =>WM: (13755: I2 ^level-1 R1-root)
  11477. <=WM: (13741: I2 ^level-1 R1-root)
  11478. --- END Input Phase ---
  11479. --- Proposal Phase ---
  11480. --- Inner Elaboration Phase, active level 1 (S1) ---
  11481. Firing elaborate*copy-see-to-output-link
  11482. -->
  11483. (I3 ^see 0 +)
  11484. Firing elaborate*reward*based*on*reward
  11485. -->
  11486. (R985 ^value 1 +)
  11487. (R1 ^reward R985 +)
  11488. Firing propose*predict-yes
  11489. -->
  11490. (O1963 ^name predict-yes +)
  11491. (S1 ^operator O1963 +)
  11492. Firing propose*predict-no
  11493. -->
  11494. (O1964 ^name predict-no +)
  11495. (S1 ^operator O1964 +)
  11496. Firing rl*prefer*rvt*predict-no*H0*6
  11497. -->
  11498. (S1 ^operator O1962 = 0.9999999999999999)
  11499. Firing rl*prefer*rvt*predict-yes*H0*5
  11500. -->
  11501. (S1 ^operator O1961 = 0.)
  11502. Firing prefer*rvt*predict-yes*H0
  11503. -->
  11504. Firing prefer*rvt*predict-no*H0
  11505. -->
  11506. Firing elaborate*copy-dir-to-output-link
  11507. -->
  11508. (I3 ^dir U +)
  11509. inner elaboration loop at bottom goal.
  11510. Retracting elaborate*copy-see-to-output-link
  11511. -->
  11512. (I3 ^see 0 +)
  11513. Retracting propose*predict-no
  11514. -->
  11515. (O1962 ^name predict-no +)
  11516. (S1 ^operator O1962 +)
  11517. Retracting propose*predict-yes
  11518. -->
  11519. (O1961 ^name predict-yes +)
  11520. (S1 ^operator O1961 +)
  11521. Retracting elaborate*reward*based*on*reward
  11522. -->
  11523. (R984 ^value 1 +)
  11524. (R1 ^reward R984 +)
  11525. Retracting elaborate*copy-dir-to-output-link
  11526. -->
  11527. (I3 ^dir U +)
  11528. Retracting rl*prefer*rvt*predict-no*H0*6
  11529. -->
  11530. (S1 ^operator O1962 = 0.9999999999999999)
  11531. Retracting rl*prefer*rvt*predict-yes*H0*5
  11532. -->
  11533. (S1 ^operator O1961 = 0.)
  11534. =>WM: (13761: S1 ^operator O1964 +)
  11535. =>WM: (13760: S1 ^operator O1963 +)
  11536. =>WM: (13759: O1964 ^name predict-no)
  11537. =>WM: (13758: O1963 ^name predict-yes)
  11538. =>WM: (13757: R985 ^value 1)
  11539. =>WM: (13756: R1 ^reward R985)
  11540. <=WM: (13747: S1 ^operator O1961 +)
  11541. <=WM: (13748: S1 ^operator O1962 +)
  11542. <=WM: (13749: S1 ^operator O1962)
  11543. <=WM: (13743: R1 ^reward R984)
  11544. <=WM: (13746: O1962 ^name predict-no)
  11545. <=WM: (13745: O1961 ^name predict-yes)
  11546. <=WM: (13744: R984 ^value 1)
  11547. --- Inner Elaboration Phase, active level 1 (S1) ---
  11548. Firing prefer*rvt*predict-yes*H0
  11549. -->
  11550. Firing rl*prefer*rvt*predict-yes*H0*5
  11551. -->
  11552. (S1 ^operator O1963 = 0.)
  11553. Firing prefer*rvt*predict-no*H0
  11554. -->
  11555. Firing rl*prefer*rvt*predict-no*H0*6
  11556. -->
  11557. (S1 ^operator O1964 = 0.9999999999999999)
  11558. inner elaboration loop at bottom goal.
  11559. Retracting rl*prefer*rvt*predict-no*H0*6
  11560. -->
  11561. (S1 ^operator O1962 = 0.9999999999999999)
  11562. Retracting rl*prefer*rvt*predict-yes*H0*5
  11563. -->
  11564. (S1 ^operator O1961 = 0.)
  11565. --- END Proposal Phase ---
  11566. --- Decision Phase ---
  11567. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11568. =>WM: (13762: S1 ^operator O1964)
  11569. 982: O: O1964 (predict-no)
  11570. --- END Decision Phase ---
  11571. --- Application Phase ---
  11572. --- Firing Productions (PE) For State At Depth 1 ---
  11573. --- Inner Elaboration Phase, active level 1 (S1) ---
  11574. Firing apply*operator
  11575. -->
  11576. (I3 ^predict-no N982 + :O )
  11577. Firing apply*operator*complete
  11578. -->
  11579. (I3 ^predict-no N981 - :O )
  11580. inner elaboration loop at bottom goal.
  11581. --- Change Working Memory (PE) ---
  11582. =>WM: (13763: I3 ^predict-no N982)
  11583. <=WM: (13751: N981 ^status complete)
  11584. <=WM: (13750: I3 ^predict-no N981)
  11585. --- Firing Productions (IE) For State At Depth 1 ---
  11586. --- Inner Elaboration Phase, active level 1 (S1) ---
  11587. Firing monitor*world
  11588. -->
  11589. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11590. --- Change Working Memory (IE) ---
  11591. --- END Application Phase ---
  11592. --- Output Phase ---
  11593. ENV: Agent did: predict-no for direction U in state State-B
  11594. In State-B moving U
  11595. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11596. predict error 0
  11597. dir: dir isR
  11598. --- END Output Phase ---
  11599. -/|--- Input Phase ---
  11600. =>WM: (13767: I2 ^dir R)
  11601. =>WM: (13766: I2 ^reward 1)
  11602. =>WM: (13765: I2 ^see 0)
  11603. =>WM: (13764: N982 ^status complete)
  11604. <=WM: (13754: I2 ^dir U)
  11605. <=WM: (13753: I2 ^reward 1)
  11606. <=WM: (13752: I2 ^see 0)
  11607. =>WM: (13768: I2 ^level-1 R1-root)
  11608. <=WM: (13755: I2 ^level-1 R1-root)
  11609. --- END Input Phase ---
  11610. --- Proposal Phase ---
  11611. --- Inner Elaboration Phase, active level 1 (S1) ---
  11612. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  11613. -->
  11614. (S1 ^operator O1963 = 0.1398795999120246)
  11615. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  11616. -->
  11617. (S1 ^operator O1964 = 0.5523825060913952)
  11618. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11619. -->
  11620. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11621. -->
  11622. Firing elaborate*copy-see-to-output-link
  11623. -->
  11624. (I3 ^see 0 +)
  11625. Firing elaborate*reward*based*on*reward
  11626. -->
  11627. (R986 ^value 1 +)
  11628. (R1 ^reward R986 +)
  11629. Firing propose*predict-yes
  11630. -->
  11631. (O1965 ^name predict-yes +)
  11632. (S1 ^operator O1965 +)
  11633. Firing propose*predict-no
  11634. -->
  11635. (O1966 ^name predict-no +)
  11636. (S1 ^operator O1966 +)
  11637. Firing rl*prefer*rvt*predict-no*H0*4
  11638. -->
  11639. (S1 ^operator O1964 = 0.4476189814068987)
  11640. Firing rl*prefer*rvt*predict-yes*H0*3
  11641. -->
  11642. (S1 ^operator O1963 = 0.1844091715509321)
  11643. Firing prefer*rvt*predict-yes*H0
  11644. -->
  11645. Firing prefer*rvt*predict-no*H0
  11646. -->
  11647. Firing elaborate*copy-dir-to-output-link
  11648. -->
  11649. (I3 ^dir R +)
  11650. inner elaboration loop at bottom goal.
  11651. Retracting elaborate*copy-see-to-output-link
  11652. -->
  11653. (I3 ^see 0 +)
  11654. Retracting propose*predict-no
  11655. -->
  11656. (O1964 ^name predict-no +)
  11657. (S1 ^operator O1964 +)
  11658. Retracting propose*predict-yes
  11659. -->
  11660. (O1963 ^name predict-yes +)
  11661. (S1 ^operator O1963 +)
  11662. Retracting elaborate*reward*based*on*reward
  11663. -->
  11664. (R985 ^value 1 +)
  11665. (R1 ^reward R985 +)
  11666. Retracting elaborate*copy-dir-to-output-link
  11667. -->
  11668. (I3 ^dir U +)
  11669. Retracting rl*prefer*rvt*predict-no*H0*6
  11670. -->
  11671. (S1 ^operator O1964 = 0.9999999999999999)
  11672. Retracting rl*prefer*rvt*predict-yes*H0*5
  11673. -->
  11674. (S1 ^operator O1963 = 0.)
  11675. =>WM: (13775: S1 ^operator O1966 +)
  11676. =>WM: (13774: S1 ^operator O1965 +)
  11677. =>WM: (13773: I3 ^dir R)
  11678. =>WM: (13772: O1966 ^name predict-no)
  11679. =>WM: (13771: O1965 ^name predict-yes)
  11680. =>WM: (13770: R986 ^value 1)
  11681. =>WM: (13769: R1 ^reward R986)
  11682. <=WM: (13760: S1 ^operator O1963 +)
  11683. <=WM: (13761: S1 ^operator O1964 +)
  11684. <=WM: (13762: S1 ^operator O1964)
  11685. <=WM: (13732: I3 ^dir U)
  11686. <=WM: (13756: R1 ^reward R985)
  11687. <=WM: (13759: O1964 ^name predict-no)
  11688. <=WM: (13758: O1963 ^name predict-yes)
  11689. <=WM: (13757: R985 ^value 1)
  11690. --- Inner Elaboration Phase, active level 1 (S1) ---
  11691. Firing prefer*rvt*predict-yes*H0
  11692. -->
  11693. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  11694. -->
  11695. (S1 ^operator O1965 = 0.1398795999120246)
  11696. Firing rl*prefer*rvt*predict-yes*H0*3
  11697. -->
  11698. (S1 ^operator O1965 = 0.1844091715509321)
  11699. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11700. -->
  11701. Firing prefer*rvt*predict-no*H0
  11702. -->
  11703. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  11704. -->
  11705. (S1 ^operator O1966 = 0.5523825060913952)
  11706. Firing rl*prefer*rvt*predict-no*H0*4
  11707. -->
  11708. (S1 ^operator O1966 = 0.4476189814068987)
  11709. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11710. -->
  11711. inner elaboration loop at bottom goal.
  11712. Retracting rl*prefer*rvt*predict-no*H0*4
  11713. -->
  11714. (S1 ^operator O1964 = 0.4476189814068987)
  11715. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  11716. -->
  11717. (S1 ^operator O1964 = 0.5523825060913952)
  11718. Retracting rl*prefer*rvt*predict-yes*H0*3
  11719. -->
  11720. (S1 ^operator O1963 = 0.1844091715509321)
  11721. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  11722. -->
  11723. (S1 ^operator O1963 = 0.1398795999120246)
  11724. --- END Proposal Phase ---
  11725. --- Decision Phase ---
  11726. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11727. =>WM: (13776: S1 ^operator O1966)
  11728. 983: O: O1966 (predict-no)
  11729. --- END Decision Phase ---
  11730. --- Application Phase ---
  11731. --- Firing Productions (PE) For State At Depth 1 ---
  11732. --- Inner Elaboration Phase, active level 1 (S1) ---
  11733. Firing apply*operator
  11734. -->
  11735. (I3 ^predict-no N983 + :O )
  11736. Firing apply*operator*complete
  11737. -->
  11738. (I3 ^predict-no N982 - :O )
  11739. inner elaboration loop at bottom goal.
  11740. --- Change Working Memory (PE) ---
  11741. =>WM: (13777: I3 ^predict-no N983)
  11742. <=WM: (13764: N982 ^status complete)
  11743. <=WM: (13763: I3 ^predict-no N982)
  11744. --- Firing Productions (IE) For State At Depth 1 ---
  11745. --- Inner Elaboration Phase, active level 1 (S1) ---
  11746. Firing monitor*world
  11747. -->
  11748. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11749. --- Change Working Memory (IE) ---
  11750. --- END Application Phase ---
  11751. --- Output Phase ---
  11752. ENV: Agent did: predict-no for direction R in state State-B
  11753. In State-B moving R
  11754. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11755. predict error 0
  11756. dir: dir isR
  11757. --- END Output Phase ---
  11758. \-/|--- Input Phase ---
  11759. =>WM: (13781: I2 ^dir R)
  11760. =>WM: (13780: I2 ^reward 1)
  11761. =>WM: (13779: I2 ^see 0)
  11762. =>WM: (13778: N983 ^status complete)
  11763. <=WM: (13767: I2 ^dir R)
  11764. <=WM: (13766: I2 ^reward 1)
  11765. <=WM: (13765: I2 ^see 0)
  11766. =>WM: (13782: I2 ^level-1 R0-root)
  11767. <=WM: (13768: I2 ^level-1 R1-root)
  11768. --- END Input Phase ---
  11769. --- Proposal Phase ---
  11770. --- Inner Elaboration Phase, active level 1 (S1) ---
  11771. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  11772. -->
  11773. (S1 ^operator O1965 = 0.1664311307472832)
  11774. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  11775. -->
  11776. (S1 ^operator O1966 = 0.5523783049582921)
  11777. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11778. -->
  11779. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11780. -->
  11781. Firing elaborate*copy-see-to-output-link
  11782. -->
  11783. (I3 ^see 0 +)
  11784. Firing elaborate*reward*based*on*reward
  11785. -->
  11786. (R987 ^value 1 +)
  11787. (R1 ^reward R987 +)
  11788. Firing propose*predict-yes
  11789. -->
  11790. (O1967 ^name predict-yes +)
  11791. (S1 ^operator O1967 +)
  11792. Firing propose*predict-no
  11793. -->
  11794. (O1968 ^name predict-no +)
  11795. (S1 ^operator O1968 +)
  11796. Firing rl*prefer*rvt*predict-no*H0*4
  11797. -->
  11798. (S1 ^operator O1966 = 0.4476189814068987)
  11799. Firing rl*prefer*rvt*predict-yes*H0*3
  11800. -->
  11801. (S1 ^operator O1965 = 0.1844091715509321)
  11802. Firing prefer*rvt*predict-yes*H0
  11803. -->
  11804. Firing prefer*rvt*predict-no*H0
  11805. -->
  11806. Firing elaborate*copy-dir-to-output-link
  11807. -->
  11808. (I3 ^dir R +)
  11809. inner elaboration loop at bottom goal.
  11810. Retracting elaborate*copy-see-to-output-link
  11811. -->
  11812. (I3 ^see 0 +)
  11813. Retracting propose*predict-no
  11814. -->
  11815. (O1966 ^name predict-no +)
  11816. (S1 ^operator O1966 +)
  11817. Retracting propose*predict-yes
  11818. -->
  11819. (O1965 ^name predict-yes +)
  11820. (S1 ^operator O1965 +)
  11821. Retracting elaborate*reward*based*on*reward
  11822. -->
  11823. (R986 ^value 1 +)
  11824. (R1 ^reward R986 +)
  11825. Retracting elaborate*copy-dir-to-output-link
  11826. -->
  11827. (I3 ^dir R +)
  11828. Retracting rl*prefer*rvt*predict-no*H0*4
  11829. -->
  11830. (S1 ^operator O1966 = 0.4476189814068987)
  11831. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  11832. -->
  11833. (S1 ^operator O1966 = 0.5523825060913952)
  11834. Retracting rl*prefer*rvt*predict-yes*H0*3
  11835. -->
  11836. (S1 ^operator O1965 = 0.1844091715509321)
  11837. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  11838. -->
  11839. (S1 ^operator O1965 = 0.1398795999120246)
  11840. =>WM: (13788: S1 ^operator O1968 +)
  11841. =>WM: (13787: S1 ^operator O1967 +)
  11842. =>WM: (13786: O1968 ^name predict-no)
  11843. =>WM: (13785: O1967 ^name predict-yes)
  11844. =>WM: (13784: R987 ^value 1)
  11845. =>WM: (13783: R1 ^reward R987)
  11846. <=WM: (13774: S1 ^operator O1965 +)
  11847. <=WM: (13775: S1 ^operator O1966 +)
  11848. <=WM: (13776: S1 ^operator O1966)
  11849. <=WM: (13769: R1 ^reward R986)
  11850. <=WM: (13772: O1966 ^name predict-no)
  11851. <=WM: (13771: O1965 ^name predict-yes)
  11852. <=WM: (13770: R986 ^value 1)
  11853. --- Inner Elaboration Phase, active level 1 (S1) ---
  11854. Firing prefer*rvt*predict-yes*H0
  11855. -->
  11856. Firing rl*prefer*rvt*predict-yes*H0*3
  11857. -->
  11858. (S1 ^operator O1967 = 0.1844091715509321)
  11859. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11860. -->
  11861. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  11862. -->
  11863. (S1 ^operator O1967 = 0.1664311307472832)
  11864. Firing prefer*rvt*predict-no*H0
  11865. -->
  11866. Firing rl*prefer*rvt*predict-no*H0*4
  11867. -->
  11868. (S1 ^operator O1968 = 0.4476189814068987)
  11869. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11870. -->
  11871. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  11872. -->
  11873. (S1 ^operator O1968 = 0.5523783049582921)
  11874. inner elaboration loop at bottom goal.
  11875. Retracting rl*prefer*rvt*predict-no*H0*4
  11876. -->
  11877. (S1 ^operator O1966 = 0.4476189814068987)
  11878. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  11879. -->
  11880. (S1 ^operator O1966 = 0.5523783049582921)
  11881. Retracting rl*prefer*rvt*predict-yes*H0*3
  11882. -->
  11883. (S1 ^operator O1965 = 0.1844091715509321)
  11884. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  11885. -->
  11886. (S1 ^operator O1965 = 0.1664311307472832)
  11887. --- END Proposal Phase ---
  11888. --- Decision Phase ---
  11889. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622532 -0.174914 0.447619(R,m,v=1,0.927419,0.06786)
  11890. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377469 0.174914 0.552383 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
  11891. =>WM: (13789: S1 ^operator O1968)
  11892. 984: O: O1968 (predict-no)
  11893. --- END Decision Phase ---
  11894. --- Application Phase ---
  11895. --- Firing Productions (PE) For State At Depth 1 ---
  11896. --- Inner Elaboration Phase, active level 1 (S1) ---
  11897. Firing apply*operator
  11898. -->
  11899. (I3 ^predict-no N984 + :O )
  11900. Firing apply*operator*complete
  11901. -->
  11902. (I3 ^predict-no N983 - :O )
  11903. inner elaboration loop at bottom goal.
  11904. --- Change Working Memory (PE) ---
  11905. =>WM: (13790: I3 ^predict-no N984)
  11906. <=WM: (13778: N983 ^status complete)
  11907. <=WM: (13777: I3 ^predict-no N983)
  11908. --- Firing Productions (IE) For State At Depth 1 ---
  11909. --- Inner Elaboration Phase, active level 1 (S1) ---
  11910. Firing monitor*world
  11911. -->
  11912. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11913. --- Change Working Memory (IE) ---
  11914. --- END Application Phase ---
  11915. --- Output Phase ---
  11916. ENV: Agent did: predict-no for direction R in state State-B
  11917. In State-B moving R
  11918. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11919. predict error 0
  11920. dir: dir isU
  11921. --- END Output Phase ---
  11922. \-/--- Input Phase ---
  11923. =>WM: (13794: I2 ^dir U)
  11924. =>WM: (13793: I2 ^reward 1)
  11925. =>WM: (13792: I2 ^see 0)
  11926. =>WM: (13791: N984 ^status complete)
  11927. <=WM: (13781: I2 ^dir R)
  11928. <=WM: (13780: I2 ^reward 1)
  11929. <=WM: (13779: I2 ^see 0)
  11930. =>WM: (13795: I2 ^level-1 R0-root)
  11931. <=WM: (13782: I2 ^level-1 R0-root)
  11932. --- END Input Phase ---
  11933. --- Proposal Phase ---
  11934. --- Inner Elaboration Phase, active level 1 (S1) ---
  11935. Firing elaborate*copy-see-to-output-link
  11936. -->
  11937. (I3 ^see 0 +)
  11938. Firing elaborate*reward*based*on*reward
  11939. -->
  11940. (R988 ^value 1 +)
  11941. (R1 ^reward R988 +)
  11942. Firing propose*predict-yes
  11943. -->
  11944. (O1969 ^name predict-yes +)
  11945. (S1 ^operator O1969 +)
  11946. Firing propose*predict-no
  11947. -->
  11948. (O1970 ^name predict-no +)
  11949. (S1 ^operator O1970 +)
  11950. Firing rl*prefer*rvt*predict-no*H0*6
  11951. -->
  11952. (S1 ^operator O1968 = 0.9999999999999999)
  11953. Firing rl*prefer*rvt*predict-yes*H0*5
  11954. -->
  11955. (S1 ^operator O1967 = 0.)
  11956. Firing prefer*rvt*predict-yes*H0
  11957. -->
  11958. Firing prefer*rvt*predict-no*H0
  11959. -->
  11960. Firing elaborate*copy-dir-to-output-link
  11961. -->
  11962. (I3 ^dir U +)
  11963. inner elaboration loop at bottom goal.
  11964. Retracting elaborate*copy-see-to-output-link
  11965. -->
  11966. (I3 ^see 0 +)
  11967. Retracting propose*predict-no
  11968. -->
  11969. (O1968 ^name predict-no +)
  11970. (S1 ^operator O1968 +)
  11971. Retracting propose*predict-yes
  11972. -->
  11973. (O1967 ^name predict-yes +)
  11974. (S1 ^operator O1967 +)
  11975. Retracting elaborate*reward*based*on*reward
  11976. -->
  11977. (R987 ^value 1 +)
  11978. (R1 ^reward R987 +)
  11979. Retracting elaborate*copy-dir-to-output-link
  11980. -->
  11981. (I3 ^dir R +)
  11982. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  11983. -->
  11984. (S1 ^operator O1968 = 0.5523783049582921)
  11985. Retracting rl*prefer*rvt*predict-no*H0*4
  11986. -->
  11987. (S1 ^operator O1968 = 0.4476187582821546)
  11988. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  11989. -->
  11990. (S1 ^operator O1967 = 0.1664311307472832)
  11991. Retracting rl*prefer*rvt*predict-yes*H0*3
  11992. -->
  11993. (S1 ^operator O1967 = 0.1844091715509321)
  11994. =>WM: (13802: S1 ^operator O1970 +)
  11995. =>WM: (13801: S1 ^operator O1969 +)
  11996. =>WM: (13800: I3 ^dir U)
  11997. =>WM: (13799: O1970 ^name predict-no)
  11998. =>WM: (13798: O1969 ^name predict-yes)
  11999. =>WM: (13797: R988 ^value 1)
  12000. =>WM: (13796: R1 ^reward R988)
  12001. <=WM: (13787: S1 ^operator O1967 +)
  12002. <=WM: (13788: S1 ^operator O1968 +)
  12003. <=WM: (13789: S1 ^operator O1968)
  12004. <=WM: (13773: I3 ^dir R)
  12005. <=WM: (13783: R1 ^reward R987)
  12006. <=WM: (13786: O1968 ^name predict-no)
  12007. <=WM: (13785: O1967 ^name predict-yes)
  12008. <=WM: (13784: R987 ^value 1)
  12009. --- Inner Elaboration Phase, active level 1 (S1) ---
  12010. Firing prefer*rvt*predict-yes*H0
  12011. -->
  12012. Firing rl*prefer*rvt*predict-yes*H0*5
  12013. -->
  12014. (S1 ^operator O1969 = 0.)
  12015. Firing prefer*rvt*predict-no*H0
  12016. -->
  12017. Firing rl*prefer*rvt*predict-no*H0*6
  12018. -->
  12019. (S1 ^operator O1970 = 0.9999999999999999)
  12020. inner elaboration loop at bottom goal.
  12021. Retracting rl*prefer*rvt*predict-no*H0*6
  12022. -->
  12023. (S1 ^operator O1968 = 0.9999999999999999)
  12024. Retracting rl*prefer*rvt*predict-yes*H0*5
  12025. -->
  12026. (S1 ^operator O1967 = 0.)
  12027. --- END Proposal Phase ---
  12028. --- Decision Phase ---
  12029. RL update rl*prefer*rvt*predict-no*H0*4 0.622532 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.928,0.0673548)
  12030. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377465 0.174913 0.552378 -> 0.377466 0.174913 0.552379(R,m,v=1,1,0)
  12031. =>WM: (13803: S1 ^operator O1970)
  12032. 985: O: O1970 (predict-no)
  12033. --- END Decision Phase ---
  12034. --- Application Phase ---
  12035. --- Firing Productions (PE) For State At Depth 1 ---
  12036. --- Inner Elaboration Phase, active level 1 (S1) ---
  12037. Firing apply*operator
  12038. -->
  12039. (I3 ^predict-no N985 + :O )
  12040. Firing apply*operator*complete
  12041. -->
  12042. (I3 ^predict-no N984 - :O )
  12043. inner elaboration loop at bottom goal.
  12044. --- Change Working Memory (PE) ---
  12045. =>WM: (13804: I3 ^predict-no N985)
  12046. <=WM: (13791: N984 ^status complete)
  12047. <=WM: (13790: I3 ^predict-no N984)
  12048. --- Firing Productions (IE) For State At Depth 1 ---
  12049. --- Inner Elaboration Phase, active level 1 (S1) ---
  12050. Firing monitor*world
  12051. -->
  12052. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12053. --- Change Working Memory (IE) ---
  12054. --- END Application Phase ---
  12055. --- Output Phase ---
  12056. ENV: Agent did: predict-no for direction U in state State-B
  12057. In State-B moving U
  12058. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12059. predict error 0
  12060. dir: dir isL
  12061. --- END Output Phase ---
  12062. |\-/--- Input Phase ---
  12063. =>WM: (13808: I2 ^dir L)
  12064. =>WM: (13807: I2 ^reward 1)
  12065. =>WM: (13806: I2 ^see 0)
  12066. =>WM: (13805: N985 ^status complete)
  12067. <=WM: (13794: I2 ^dir U)
  12068. <=WM: (13793: I2 ^reward 1)
  12069. <=WM: (13792: I2 ^see 0)
  12070. =>WM: (13809: I2 ^level-1 R0-root)
  12071. <=WM: (13795: I2 ^level-1 R0-root)
  12072. --- END Input Phase ---
  12073. --- Proposal Phase ---
  12074. --- Inner Elaboration Phase, active level 1 (S1) ---
  12075. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  12076. -->
  12077. (S1 ^operator O1969 = 0.6104614609336363)
  12078. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  12079. -->
  12080. (S1 ^operator O1970 = 0.1063475139796038)
  12081. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12082. -->
  12083. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12084. -->
  12085. Firing elaborate*copy-see-to-output-link
  12086. -->
  12087. (I3 ^see 0 +)
  12088. Firing elaborate*reward*based*on*reward
  12089. -->
  12090. (R989 ^value 1 +)
  12091. (R1 ^reward R989 +)
  12092. Firing propose*predict-yes
  12093. -->
  12094. (O1971 ^name predict-yes +)
  12095. (S1 ^operator O1971 +)
  12096. Firing propose*predict-no
  12097. -->
  12098. (O1972 ^name predict-no +)
  12099. (S1 ^operator O1972 +)
  12100. Firing rl*prefer*rvt*predict-no*H0*2
  12101. -->
  12102. (S1 ^operator O1970 = 0.3873369632550164)
  12103. Firing rl*prefer*rvt*predict-yes*H0*1
  12104. -->
  12105. (S1 ^operator O1969 = 0.389539588642773)
  12106. Firing prefer*rvt*predict-yes*H0
  12107. -->
  12108. Firing prefer*rvt*predict-no*H0
  12109. -->
  12110. Firing elaborate*copy-dir-to-output-link
  12111. -->
  12112. (I3 ^dir L +)
  12113. inner elaboration loop at bottom goal.
  12114. Retracting elaborate*copy-see-to-output-link
  12115. -->
  12116. (I3 ^see 0 +)
  12117. Retracting propose*predict-no
  12118. -->
  12119. (O1970 ^name predict-no +)
  12120. (S1 ^operator O1970 +)
  12121. Retracting propose*predict-yes
  12122. -->
  12123. (O1969 ^name predict-yes +)
  12124. (S1 ^operator O1969 +)
  12125. Retracting elaborate*reward*based*on*reward
  12126. -->
  12127. (R988 ^value 1 +)
  12128. (R1 ^reward R988 +)
  12129. Retracting elaborate*copy-dir-to-output-link
  12130. -->
  12131. (I3 ^dir U +)
  12132. Retracting rl*prefer*rvt*predict-no*H0*6
  12133. -->
  12134. (S1 ^operator O1970 = 0.9999999999999999)
  12135. Retracting rl*prefer*rvt*predict-yes*H0*5
  12136. -->
  12137. (S1 ^operator O1969 = 0.)
  12138. =>WM: (13816: S1 ^operator O1972 +)
  12139. =>WM: (13815: S1 ^operator O1971 +)
  12140. =>WM: (13814: I3 ^dir L)
  12141. =>WM: (13813: O1972 ^name predict-no)
  12142. =>WM: (13812: O1971 ^name predict-yes)
  12143. =>WM: (13811: R989 ^value 1)
  12144. =>WM: (13810: R1 ^reward R989)
  12145. <=WM: (13801: S1 ^operator O1969 +)
  12146. <=WM: (13802: S1 ^operator O1970 +)
  12147. <=WM: (13803: S1 ^operator O1970)
  12148. <=WM: (13800: I3 ^dir U)
  12149. <=WM: (13796: R1 ^reward R988)
  12150. <=WM: (13799: O1970 ^name predict-no)
  12151. <=WM: (13798: O1969 ^name predict-yes)
  12152. <=WM: (13797: R988 ^value 1)
  12153. --- Inner Elaboration Phase, active level 1 (S1) ---
  12154. Firing prefer*rvt*predict-yes*H0
  12155. -->
  12156. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  12157. -->
  12158. (S1 ^operator O1971 = 0.6104614609336363)
  12159. Firing rl*prefer*rvt*predict-yes*H0*1
  12160. -->
  12161. (S1 ^operator O1971 = 0.389539588642773)
  12162. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12163. -->
  12164. Firing prefer*rvt*predict-no*H0
  12165. -->
  12166. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  12167. -->
  12168. (S1 ^operator O1972 = 0.1063475139796038)
  12169. Firing rl*prefer*rvt*predict-no*H0*2
  12170. -->
  12171. (S1 ^operator O1972 = 0.3873369632550164)
  12172. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12173. -->
  12174. inner elaboration loop at bottom goal.
  12175. Retracting rl*prefer*rvt*predict-no*H0*2
  12176. -->
  12177. (S1 ^operator O1970 = 0.3873369632550164)
  12178. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  12179. -->
  12180. (S1 ^operator O1970 = 0.1063475139796038)
  12181. Retracting rl*prefer*rvt*predict-yes*H0*1
  12182. -->
  12183. (S1 ^operator O1969 = 0.389539588642773)
  12184. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  12185. -->
  12186. (S1 ^operator O1969 = 0.6104614609336363)
  12187. --- END Proposal Phase ---
  12188. --- Decision Phase ---
  12189. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12190. =>WM: (13817: S1 ^operator O1971)
  12191. 986: O: O1971 (predict-yes)
  12192. --- END Decision Phase ---
  12193. --- Application Phase ---
  12194. --- Firing Productions (PE) For State At Depth 1 ---
  12195. --- Inner Elaboration Phase, active level 1 (S1) ---
  12196. Firing apply*operator
  12197. -->
  12198. (I3 ^predict-yes N986 + :O )
  12199. Firing apply*operator*complete
  12200. -->
  12201. (I3 ^predict-no N985 - :O )
  12202. inner elaboration loop at bottom goal.
  12203. --- Change Working Memory (PE) ---
  12204. =>WM: (13818: I3 ^predict-yes N986)
  12205. <=WM: (13805: N985 ^status complete)
  12206. <=WM: (13804: I3 ^predict-no N985)
  12207. --- Firing Productions (IE) For State At Depth 1 ---
  12208. --- Inner Elaboration Phase, active level 1 (S1) ---
  12209. Firing monitor*world
  12210. -->
  12211. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12212. --- Change Working Memory (IE) ---
  12213. --- END Application Phase ---
  12214. --- Output Phase ---
  12215. ENV: Agent did: predict-yes for direction L in state State-B
  12216. In State-B moving L
  12217. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12218. predict error 0
  12219. dir: dir isR
  12220. --- END Output Phase ---
  12221. |\---- Input Phase ---
  12222. =>WM: (13822: I2 ^dir R)
  12223. =>WM: (13821: I2 ^reward 1)
  12224. =>WM: (13820: I2 ^see 1)
  12225. =>WM: (13819: N986 ^status complete)
  12226. <=WM: (13808: I2 ^dir L)
  12227. <=WM: (13807: I2 ^reward 1)
  12228. <=WM: (13806: I2 ^see 0)
  12229. =>WM: (13823: I2 ^level-1 L1-root)
  12230. <=WM: (13809: I2 ^level-1 R0-root)
  12231. --- END Input Phase ---
  12232. --- Proposal Phase ---
  12233. --- Inner Elaboration Phase, active level 1 (S1) ---
  12234. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  12235. -->
  12236. (S1 ^operator O1972 = -0.02155734064455064)
  12237. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  12238. -->
  12239. (S1 ^operator O1971 = 0.8155783412803204)
  12240. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12241. -->
  12242. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12243. -->
  12244. Firing elaborate*copy-see-to-output-link
  12245. -->
  12246. (I3 ^see 1 +)
  12247. Firing elaborate*reward*based*on*reward
  12248. -->
  12249. (R990 ^value 1 +)
  12250. (R1 ^reward R990 +)
  12251. Firing propose*predict-yes
  12252. -->
  12253. (O1973 ^name predict-yes +)
  12254. (S1 ^operator O1973 +)
  12255. Firing propose*predict-no
  12256. -->
  12257. (O1974 ^name predict-no +)
  12258. (S1 ^operator O1974 +)
  12259. Firing rl*prefer*rvt*predict-no*H0*4
  12260. -->
  12261. (S1 ^operator O1972 = 0.4476191987960876)
  12262. Firing rl*prefer*rvt*predict-yes*H0*3
  12263. -->
  12264. (S1 ^operator O1971 = 0.1844091715509321)
  12265. Firing prefer*rvt*predict-yes*H0
  12266. -->
  12267. Firing prefer*rvt*predict-no*H0
  12268. -->
  12269. Firing elaborate*copy-dir-to-output-link
  12270. -->
  12271. (I3 ^dir R +)
  12272. inner elaboration loop at bottom goal.
  12273. Retracting elaborate*copy-see-to-output-link
  12274. -->
  12275. (I3 ^see 0 +)
  12276. Retracting propose*predict-no
  12277. -->
  12278. (O1972 ^name predict-no +)
  12279. (S1 ^operator O1972 +)
  12280. Retracting propose*predict-yes
  12281. -->
  12282. (O1971 ^name predict-yes +)
  12283. (S1 ^operator O1971 +)
  12284. Retracting elaborate*reward*based*on*reward
  12285. -->
  12286. (R989 ^value 1 +)
  12287. (R1 ^reward R989 +)
  12288. Retracting elaborate*copy-dir-to-output-link
  12289. -->
  12290. (I3 ^dir L +)
  12291. Retracting rl*prefer*rvt*predict-no*H0*2
  12292. -->
  12293. (S1 ^operator O1972 = 0.3873369632550164)
  12294. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  12295. -->
  12296. (S1 ^operator O1972 = 0.1063475139796038)
  12297. Retracting rl*prefer*rvt*predict-yes*H0*1
  12298. -->
  12299. (S1 ^operator O1971 = 0.389539588642773)
  12300. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  12301. -->
  12302. (S1 ^operator O1971 = 0.6104614609336363)
  12303. =>WM: (13831: S1 ^operator O1974 +)
  12304. =>WM: (13830: S1 ^operator O1973 +)
  12305. =>WM: (13829: I3 ^dir R)
  12306. =>WM: (13828: O1974 ^name predict-no)
  12307. =>WM: (13827: O1973 ^name predict-yes)
  12308. =>WM: (13826: R990 ^value 1)
  12309. =>WM: (13825: R1 ^reward R990)
  12310. =>WM: (13824: I3 ^see 1)
  12311. <=WM: (13815: S1 ^operator O1971 +)
  12312. <=WM: (13817: S1 ^operator O1971)
  12313. <=WM: (13816: S1 ^operator O1972 +)
  12314. <=WM: (13814: I3 ^dir L)
  12315. <=WM: (13810: R1 ^reward R989)
  12316. <=WM: (13742: I3 ^see 0)
  12317. <=WM: (13813: O1972 ^name predict-no)
  12318. <=WM: (13812: O1971 ^name predict-yes)
  12319. <=WM: (13811: R989 ^value 1)
  12320. --- Inner Elaboration Phase, active level 1 (S1) ---
  12321. Firing prefer*rvt*predict-yes*H0
  12322. -->
  12323. Firing rl*prefer*rvt*predict-yes*H0*3
  12324. -->
  12325. (S1 ^operator O1973 = 0.1844091715509321)
  12326. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12327. -->
  12328. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  12329. -->
  12330. (S1 ^operator O1973 = 0.8155783412803204)
  12331. Firing prefer*rvt*predict-no*H0
  12332. -->
  12333. Firing rl*prefer*rvt*predict-no*H0*4
  12334. -->
  12335. (S1 ^operator O1974 = 0.4476191987960876)
  12336. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12337. -->
  12338. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  12339. -->
  12340. (S1 ^operator O1974 = -0.02155734064455064)
  12341. inner elaboration loop at bottom goal.
  12342. Retracting rl*prefer*rvt*predict-no*H0*4
  12343. -->
  12344. (S1 ^operator O1972 = 0.4476191987960876)
  12345. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  12346. -->
  12347. (S1 ^operator O1972 = -0.02155734064455064)
  12348. Retracting rl*prefer*rvt*predict-yes*H0*3
  12349. -->
  12350. (S1 ^operator O1971 = 0.1844091715509321)
  12351. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  12352. -->
  12353. (S1 ^operator O1971 = 0.8155783412803204)
  12354. --- END Proposal Phase ---
  12355. --- Decision Phase ---
  12356. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.890244,0.0983091)
  12357. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322413 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  12358. =>WM: (13832: S1 ^operator O1973)
  12359. 987: O: O1973 (predict-yes)
  12360. --- END Decision Phase ---
  12361. --- Application Phase ---
  12362. --- Firing Productions (PE) For State At Depth 1 ---
  12363. --- Inner Elaboration Phase, active level 1 (S1) ---
  12364. Firing apply*operator
  12365. -->
  12366. (I3 ^predict-yes N987 + :O )
  12367. Firing apply*operator*complete
  12368. -->
  12369. (I3 ^predict-yes N986 - :O )
  12370. inner elaboration loop at bottom goal.
  12371. --- Change Working Memory (PE) ---
  12372. =>WM: (13833: I3 ^predict-yes N987)
  12373. <=WM: (13819: N986 ^status complete)
  12374. <=WM: (13818: I3 ^predict-yes N986)
  12375. --- Firing Productions (IE) For State At Depth 1 ---
  12376. --- Inner Elaboration Phase, active level 1 (S1) ---
  12377. Firing monitor*world
  12378. -->
  12379. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12380. --- Change Working Memory (IE) ---
  12381. --- END Application Phase ---
  12382. --- Output Phase ---
  12383. ENV: Agent did: predict-yes for direction R in state State-A
  12384. In State-A moving R
  12385. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12386. predict error 0
  12387. dir: dir isR
  12388. --- END Output Phase ---
  12389. /|--- Input Phase ---
  12390. =>WM: (13837: I2 ^dir R)
  12391. =>WM: (13836: I2 ^reward 1)
  12392. =>WM: (13835: I2 ^see 1)
  12393. =>WM: (13834: N987 ^status complete)
  12394. <=WM: (13822: I2 ^dir R)
  12395. <=WM: (13821: I2 ^reward 1)
  12396. <=WM: (13820: I2 ^see 1)
  12397. =>WM: (13838: I2 ^level-1 R1-root)
  12398. <=WM: (13823: I2 ^level-1 L1-root)
  12399. --- END Input Phase ---
  12400. --- Proposal Phase ---
  12401. --- Inner Elaboration Phase, active level 1 (S1) ---
  12402. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12403. -->
  12404. (S1 ^operator O1973 = 0.1398795999120246)
  12405. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12406. -->
  12407. (S1 ^operator O1974 = 0.552382282966651)
  12408. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12409. -->
  12410. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12411. -->
  12412. Firing elaborate*copy-see-to-output-link
  12413. -->
  12414. (I3 ^see 1 +)
  12415. Firing elaborate*reward*based*on*reward
  12416. -->
  12417. (R991 ^value 1 +)
  12418. (R1 ^reward R991 +)
  12419. Firing propose*predict-yes
  12420. -->
  12421. (O1975 ^name predict-yes +)
  12422. (S1 ^operator O1975 +)
  12423. Firing propose*predict-no
  12424. -->
  12425. (O1976 ^name predict-no +)
  12426. (S1 ^operator O1976 +)
  12427. Firing rl*prefer*rvt*predict-no*H0*4
  12428. -->
  12429. (S1 ^operator O1974 = 0.4476191987960876)
  12430. Firing rl*prefer*rvt*predict-yes*H0*3
  12431. -->
  12432. (S1 ^operator O1973 = 0.1844091715509321)
  12433. Firing prefer*rvt*predict-yes*H0
  12434. -->
  12435. Firing prefer*rvt*predict-no*H0
  12436. -->
  12437. Firing elaborate*copy-dir-to-output-link
  12438. -->
  12439. (I3 ^dir R +)
  12440. inner elaboration loop at bottom goal.
  12441. Retracting elaborate*copy-see-to-output-link
  12442. -->
  12443. (I3 ^see 1 +)
  12444. Retracting propose*predict-no
  12445. -->
  12446. (O1974 ^name predict-no +)
  12447. (S1 ^operator O1974 +)
  12448. Retracting propose*predict-yes
  12449. -->
  12450. (O1973 ^name predict-yes +)
  12451. (S1 ^operator O1973 +)
  12452. Retracting elaborate*reward*based*on*reward
  12453. -->
  12454. (R990 ^value 1 +)
  12455. (R1 ^reward R990 +)
  12456. Retracting elaborate*copy-dir-to-output-link
  12457. -->
  12458. (I3 ^dir R +)
  12459. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  12460. -->
  12461. (S1 ^operator O1974 = -0.02155734064455064)
  12462. Retracting rl*prefer*rvt*predict-no*H0*4
  12463. -->
  12464. (S1 ^operator O1974 = 0.4476191987960876)
  12465. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  12466. -->
  12467. (S1 ^operator O1973 = 0.8155783412803204)
  12468. Retracting rl*prefer*rvt*predict-yes*H0*3
  12469. -->
  12470. (S1 ^operator O1973 = 0.1844091715509321)
  12471. =>WM: (13844: S1 ^operator O1976 +)
  12472. =>WM: (13843: S1 ^operator O1975 +)
  12473. =>WM: (13842: O1976 ^name predict-no)
  12474. =>WM: (13841: O1975 ^name predict-yes)
  12475. =>WM: (13840: R991 ^value 1)
  12476. =>WM: (13839: R1 ^reward R991)
  12477. <=WM: (13830: S1 ^operator O1973 +)
  12478. <=WM: (13832: S1 ^operator O1973)
  12479. <=WM: (13831: S1 ^operator O1974 +)
  12480. <=WM: (13825: R1 ^reward R990)
  12481. <=WM: (13828: O1974 ^name predict-no)
  12482. <=WM: (13827: O1973 ^name predict-yes)
  12483. <=WM: (13826: R990 ^value 1)
  12484. --- Inner Elaboration Phase, active level 1 (S1) ---
  12485. Firing prefer*rvt*predict-yes*H0
  12486. -->
  12487. Firing rl*prefer*rvt*predict-yes*H0*3
  12488. -->
  12489. (S1 ^operator O1975 = 0.1844091715509321)
  12490. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12491. -->
  12492. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12493. -->
  12494. (S1 ^operator O1975 = 0.1398795999120246)
  12495. Firing prefer*rvt*predict-no*H0
  12496. -->
  12497. Firing rl*prefer*rvt*predict-no*H0*4
  12498. -->
  12499. (S1 ^operator O1976 = 0.4476191987960876)
  12500. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12501. -->
  12502. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12503. -->
  12504. (S1 ^operator O1976 = 0.552382282966651)
  12505. inner elaboration loop at bottom goal.
  12506. Retracting rl*prefer*rvt*predict-no*H0*4
  12507. -->
  12508. (S1 ^operator O1974 = 0.4476191987960876)
  12509. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12510. -->
  12511. (S1 ^operator O1974 = 0.552382282966651)
  12512. Retracting rl*prefer*rvt*predict-yes*H0*3
  12513. -->
  12514. (S1 ^operator O1973 = 0.1844091715509321)
  12515. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12516. -->
  12517. (S1 ^operator O1973 = 0.1398795999120246)
  12518. --- END Proposal Phase ---
  12519. --- Decision Phase ---
  12520. RL update rl*prefer*rvt*predict-yes*H0*3 0.675411 -0.491002 0.184409 -> 0.675414 -0.491003 0.184411(R,m,v=1,0.898204,0.0919847)
  12521. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324573 0.491006 0.815578 -> 0.324575 0.491005 0.81558(R,m,v=1,1,0)
  12522. =>WM: (13845: S1 ^operator O1976)
  12523. 988: O: O1976 (predict-no)
  12524. --- END Decision Phase ---
  12525. --- Application Phase ---
  12526. --- Firing Productions (PE) For State At Depth 1 ---
  12527. --- Inner Elaboration Phase, active level 1 (S1) ---
  12528. Firing apply*operator
  12529. -->
  12530. (I3 ^predict-no N988 + :O )
  12531. Firing apply*operator*complete
  12532. -->
  12533. (I3 ^predict-yes N987 - :O )
  12534. inner elaboration loop at bottom goal.
  12535. --- Change Working Memory (PE) ---
  12536. =>WM: (13846: I3 ^predict-no N988)
  12537. <=WM: (13834: N987 ^status complete)
  12538. <=WM: (13833: I3 ^predict-yes N987)
  12539. --- Firing Productions (IE) For State At Depth 1 ---
  12540. --- Inner Elaboration Phase, active level 1 (S1) ---
  12541. Firing monitor*world
  12542. -->
  12543. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12544. --- Change Working Memory (IE) ---
  12545. --- END Application Phase ---
  12546. --- Output Phase ---
  12547. ENV: Agent did: predict-no for direction R in state State-B
  12548. In State-B moving R
  12549. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12550. predict error 0
  12551. dir: dir isR
  12552. --- END Output Phase ---
  12553. \---- Input Phase ---
  12554. =>WM: (13850: I2 ^dir R)
  12555. =>WM: (13849: I2 ^reward 1)
  12556. =>WM: (13848: I2 ^see 0)
  12557. =>WM: (13847: N988 ^status complete)
  12558. <=WM: (13837: I2 ^dir R)
  12559. <=WM: (13836: I2 ^reward 1)
  12560. <=WM: (13835: I2 ^see 1)
  12561. =>WM: (13851: I2 ^level-1 R0-root)
  12562. <=WM: (13838: I2 ^level-1 R1-root)
  12563. --- END Input Phase ---
  12564. --- Proposal Phase ---
  12565. --- Inner Elaboration Phase, active level 1 (S1) ---
  12566. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12567. -->
  12568. (S1 ^operator O1975 = 0.1664311307472832)
  12569. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12570. -->
  12571. (S1 ^operator O1976 = 0.5523787454722251)
  12572. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12573. -->
  12574. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12575. -->
  12576. Firing elaborate*copy-see-to-output-link
  12577. -->
  12578. (I3 ^see 0 +)
  12579. Firing elaborate*reward*based*on*reward
  12580. -->
  12581. (R992 ^value 1 +)
  12582. (R1 ^reward R992 +)
  12583. Firing propose*predict-yes
  12584. -->
  12585. (O1977 ^name predict-yes +)
  12586. (S1 ^operator O1977 +)
  12587. Firing propose*predict-no
  12588. -->
  12589. (O1978 ^name predict-no +)
  12590. (S1 ^operator O1978 +)
  12591. Firing rl*prefer*rvt*predict-no*H0*4
  12592. -->
  12593. (S1 ^operator O1976 = 0.4476191987960876)
  12594. Firing rl*prefer*rvt*predict-yes*H0*3
  12595. -->
  12596. (S1 ^operator O1975 = 0.1844110446262441)
  12597. Firing prefer*rvt*predict-yes*H0
  12598. -->
  12599. Firing prefer*rvt*predict-no*H0
  12600. -->
  12601. Firing elaborate*copy-dir-to-output-link
  12602. -->
  12603. (I3 ^dir R +)
  12604. inner elaboration loop at bottom goal.
  12605. Retracting elaborate*copy-see-to-output-link
  12606. -->
  12607. (I3 ^see 1 +)
  12608. Retracting propose*predict-no
  12609. -->
  12610. (O1976 ^name predict-no +)
  12611. (S1 ^operator O1976 +)
  12612. Retracting propose*predict-yes
  12613. -->
  12614. (O1975 ^name predict-yes +)
  12615. (S1 ^operator O1975 +)
  12616. Retracting elaborate*reward*based*on*reward
  12617. -->
  12618. (R991 ^value 1 +)
  12619. (R1 ^reward R991 +)
  12620. Retracting elaborate*copy-dir-to-output-link
  12621. -->
  12622. (I3 ^dir R +)
  12623. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12624. -->
  12625. (S1 ^operator O1976 = 0.552382282966651)
  12626. Retracting rl*prefer*rvt*predict-no*H0*4
  12627. -->
  12628. (S1 ^operator O1976 = 0.4476191987960876)
  12629. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12630. -->
  12631. (S1 ^operator O1975 = 0.1398795999120246)
  12632. Retracting rl*prefer*rvt*predict-yes*H0*3
  12633. -->
  12634. (S1 ^operator O1975 = 0.1844110446262441)
  12635. =>WM: (13858: S1 ^operator O1978 +)
  12636. =>WM: (13857: S1 ^operator O1977 +)
  12637. =>WM: (13856: O1978 ^name predict-no)
  12638. =>WM: (13855: O1977 ^name predict-yes)
  12639. =>WM: (13854: R992 ^value 1)
  12640. =>WM: (13853: R1 ^reward R992)
  12641. =>WM: (13852: I3 ^see 0)
  12642. <=WM: (13843: S1 ^operator O1975 +)
  12643. <=WM: (13844: S1 ^operator O1976 +)
  12644. <=WM: (13845: S1 ^operator O1976)
  12645. <=WM: (13839: R1 ^reward R991)
  12646. <=WM: (13824: I3 ^see 1)
  12647. <=WM: (13842: O1976 ^name predict-no)
  12648. <=WM: (13841: O1975 ^name predict-yes)
  12649. <=WM: (13840: R991 ^value 1)
  12650. --- Inner Elaboration Phase, active level 1 (S1) ---
  12651. Firing prefer*rvt*predict-yes*H0
  12652. -->
  12653. Firing rl*prefer*rvt*predict-yes*H0*3
  12654. -->
  12655. (S1 ^operator O1977 = 0.1844110446262441)
  12656. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12657. -->
  12658. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12659. -->
  12660. (S1 ^operator O1977 = 0.1664311307472832)
  12661. Firing prefer*rvt*predict-no*H0
  12662. -->
  12663. Firing rl*prefer*rvt*predict-no*H0*4
  12664. -->
  12665. (S1 ^operator O1978 = 0.4476191987960876)
  12666. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12667. -->
  12668. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12669. -->
  12670. (S1 ^operator O1978 = 0.5523787454722251)
  12671. inner elaboration loop at bottom goal.
  12672. Retracting rl*prefer*rvt*predict-no*H0*4
  12673. -->
  12674. (S1 ^operator O1976 = 0.4476191987960876)
  12675. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12676. -->
  12677. (S1 ^operator O1976 = 0.5523787454722251)
  12678. Retracting rl*prefer*rvt*predict-yes*H0*3
  12679. -->
  12680. (S1 ^operator O1975 = 0.1844110446262441)
  12681. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12682. -->
  12683. (S1 ^operator O1975 = 0.1664311307472832)
  12684. --- END Proposal Phase ---
  12685. --- Decision Phase ---
  12686. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.928571,0.0668571)
  12687. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552382 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
  12688. =>WM: (13859: S1 ^operator O1978)
  12689. 989: O: O1978 (predict-no)
  12690. --- END Decision Phase ---
  12691. --- Application Phase ---
  12692. --- Firing Productions (PE) For State At Depth 1 ---
  12693. --- Inner Elaboration Phase, active level 1 (S1) ---
  12694. Firing apply*operator
  12695. -->
  12696. (I3 ^predict-no N989 + :O )
  12697. Firing apply*operator*complete
  12698. -->
  12699. (I3 ^predict-no N988 - :O )
  12700. inner elaboration loop at bottom goal.
  12701. --- Change Working Memory (PE) ---
  12702. =>WM: (13860: I3 ^predict-no N989)
  12703. <=WM: (13847: N988 ^status complete)
  12704. <=WM: (13846: I3 ^predict-no N988)
  12705. --- Firing Productions (IE) For State At Depth 1 ---
  12706. --- Inner Elaboration Phase, active level 1 (S1) ---
  12707. Firing monitor*world
  12708. -->
  12709. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12710. --- Change Working Memory (IE) ---
  12711. --- END Application Phase ---
  12712. --- Output Phase ---
  12713. ENV: Agent did: predict-no for direction R in state State-B
  12714. In State-B moving R
  12715. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12716. predict error 0
  12717. dir: dir isR
  12718. --- END Output Phase ---
  12719. /|\--- Input Phase ---
  12720. =>WM: (13864: I2 ^dir R)
  12721. =>WM: (13863: I2 ^reward 1)
  12722. =>WM: (13862: I2 ^see 0)
  12723. =>WM: (13861: N989 ^status complete)
  12724. <=WM: (13850: I2 ^dir R)
  12725. <=WM: (13849: I2 ^reward 1)
  12726. <=WM: (13848: I2 ^see 0)
  12727. =>WM: (13865: I2 ^level-1 R0-root)
  12728. <=WM: (13851: I2 ^level-1 R0-root)
  12729. --- END Input Phase ---
  12730. --- Proposal Phase ---
  12731. --- Inner Elaboration Phase, active level 1 (S1) ---
  12732. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12733. -->
  12734. (S1 ^operator O1977 = 0.1664311307472832)
  12735. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12736. -->
  12737. (S1 ^operator O1978 = 0.5523787454722251)
  12738. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12739. -->
  12740. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12741. -->
  12742. Firing elaborate*copy-see-to-output-link
  12743. -->
  12744. (I3 ^see 0 +)
  12745. Firing elaborate*reward*based*on*reward
  12746. -->
  12747. (R993 ^value 1 +)
  12748. (R1 ^reward R993 +)
  12749. Firing propose*predict-yes
  12750. -->
  12751. (O1979 ^name predict-yes +)
  12752. (S1 ^operator O1979 +)
  12753. Firing propose*predict-no
  12754. -->
  12755. (O1980 ^name predict-no +)
  12756. (S1 ^operator O1980 +)
  12757. Firing rl*prefer*rvt*predict-no*H0*4
  12758. -->
  12759. (S1 ^operator O1978 = 0.4476189765316768)
  12760. Firing rl*prefer*rvt*predict-yes*H0*3
  12761. -->
  12762. (S1 ^operator O1977 = 0.1844110446262441)
  12763. Firing prefer*rvt*predict-yes*H0
  12764. -->
  12765. Firing prefer*rvt*predict-no*H0
  12766. -->
  12767. Firing elaborate*copy-dir-to-output-link
  12768. -->
  12769. (I3 ^dir R +)
  12770. inner elaboration loop at bottom goal.
  12771. Retracting elaborate*copy-see-to-output-link
  12772. -->
  12773. (I3 ^see 0 +)
  12774. Retracting propose*predict-no
  12775. -->
  12776. (O1978 ^name predict-no +)
  12777. (S1 ^operator O1978 +)
  12778. Retracting propose*predict-yes
  12779. -->
  12780. (O1977 ^name predict-yes +)
  12781. (S1 ^operator O1977 +)
  12782. Retracting elaborate*reward*based*on*reward
  12783. -->
  12784. (R992 ^value 1 +)
  12785. (R1 ^reward R992 +)
  12786. Retracting elaborate*copy-dir-to-output-link
  12787. -->
  12788. (I3 ^dir R +)
  12789. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12790. -->
  12791. (S1 ^operator O1978 = 0.5523787454722251)
  12792. Retracting rl*prefer*rvt*predict-no*H0*4
  12793. -->
  12794. (S1 ^operator O1978 = 0.4476189765316768)
  12795. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12796. -->
  12797. (S1 ^operator O1977 = 0.1664311307472832)
  12798. Retracting rl*prefer*rvt*predict-yes*H0*3
  12799. -->
  12800. (S1 ^operator O1977 = 0.1844110446262441)
  12801. =>WM: (13871: S1 ^operator O1980 +)
  12802. =>WM: (13870: S1 ^operator O1979 +)
  12803. =>WM: (13869: O1980 ^name predict-no)
  12804. =>WM: (13868: O1979 ^name predict-yes)
  12805. =>WM: (13867: R993 ^value 1)
  12806. =>WM: (13866: R1 ^reward R993)
  12807. <=WM: (13857: S1 ^operator O1977 +)
  12808. <=WM: (13858: S1 ^operator O1978 +)
  12809. <=WM: (13859: S1 ^operator O1978)
  12810. <=WM: (13853: R1 ^reward R992)
  12811. <=WM: (13856: O1978 ^name predict-no)
  12812. <=WM: (13855: O1977 ^name predict-yes)
  12813. <=WM: (13854: R992 ^value 1)
  12814. --- Inner Elaboration Phase, active level 1 (S1) ---
  12815. Firing prefer*rvt*predict-yes*H0
  12816. -->
  12817. Firing rl*prefer*rvt*predict-yes*H0*3
  12818. -->
  12819. (S1 ^operator O1979 = 0.1844110446262441)
  12820. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12821. -->
  12822. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12823. -->
  12824. (S1 ^operator O1979 = 0.1664311307472832)
  12825. Firing prefer*rvt*predict-no*H0
  12826. -->
  12827. Firing rl*prefer*rvt*predict-no*H0*4
  12828. -->
  12829. (S1 ^operator O1980 = 0.4476189765316768)
  12830. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12831. -->
  12832. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12833. -->
  12834. (S1 ^operator O1980 = 0.5523787454722251)
  12835. inner elaboration loop at bottom goal.
  12836. Retracting rl*prefer*rvt*predict-no*H0*4
  12837. -->
  12838. (S1 ^operator O1978 = 0.4476189765316768)
  12839. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12840. -->
  12841. (S1 ^operator O1978 = 0.5523787454722251)
  12842. Retracting rl*prefer*rvt*predict-yes*H0*3
  12843. -->
  12844. (S1 ^operator O1977 = 0.1844110446262441)
  12845. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12846. -->
  12847. (S1 ^operator O1977 = 0.1664311307472832)
  12848. --- END Proposal Phase ---
  12849. --- Decision Phase ---
  12850. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.929134,0.0663667)
  12851. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.552379 -> 0.377466 0.174913 0.552379(R,m,v=1,1,0)
  12852. =>WM: (13872: S1 ^operator O1980)
  12853. 990: O: O1980 (predict-no)
  12854. --- END Decision Phase ---
  12855. --- Application Phase ---
  12856. --- Firing Productions (PE) For State At Depth 1 ---
  12857. --- Inner Elaboration Phase, active level 1 (S1) ---
  12858. Firing apply*operator
  12859. -->
  12860. (I3 ^predict-no N990 + :O )
  12861. Firing apply*operator*complete
  12862. -->
  12863. (I3 ^predict-no N989 - :O )
  12864. inner elaboration loop at bottom goal.
  12865. --- Change Working Memory (PE) ---
  12866. =>WM: (13873: I3 ^predict-no N990)
  12867. <=WM: (13861: N989 ^status complete)
  12868. <=WM: (13860: I3 ^predict-no N989)
  12869. --- Firing Productions (IE) For State At Depth 1 ---
  12870. --- Inner Elaboration Phase, active level 1 (S1) ---
  12871. Firing monitor*world
  12872. -->
  12873. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12874. --- Change Working Memory (IE) ---
  12875. --- END Application Phase ---
  12876. --- Output Phase ---
  12877. ENV: Agent did: predict-no for direction R in state State-B
  12878. In State-B moving R
  12879. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12880. predict error 0
  12881. dir: dir isL
  12882. --- END Output Phase ---
  12883. -/|\--- Input Phase ---
  12884. =>WM: (13877: I2 ^dir L)
  12885. =>WM: (13876: I2 ^reward 1)
  12886. =>WM: (13875: I2 ^see 0)
  12887. =>WM: (13874: N990 ^status complete)
  12888. <=WM: (13864: I2 ^dir R)
  12889. <=WM: (13863: I2 ^reward 1)
  12890. <=WM: (13862: I2 ^see 0)
  12891. =>WM: (13878: I2 ^level-1 R0-root)
  12892. <=WM: (13865: I2 ^level-1 R0-root)
  12893. --- END Input Phase ---
  12894. --- Proposal Phase ---
  12895. --- Inner Elaboration Phase, active level 1 (S1) ---
  12896. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  12897. -->
  12898. (S1 ^operator O1979 = 0.6104613034971749)
  12899. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  12900. -->
  12901. (S1 ^operator O1980 = 0.1063475139796038)
  12902. Firing prefer*rvt*predict-no*H0*2*v1*H1
  12903. -->
  12904. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12905. -->
  12906. Firing elaborate*copy-see-to-output-link
  12907. -->
  12908. (I3 ^see 0 +)
  12909. Firing elaborate*reward*based*on*reward
  12910. -->
  12911. (R994 ^value 1 +)
  12912. (R1 ^reward R994 +)
  12913. Firing propose*predict-yes
  12914. -->
  12915. (O1981 ^name predict-yes +)
  12916. (S1 ^operator O1981 +)
  12917. Firing propose*predict-no
  12918. -->
  12919. (O1982 ^name predict-no +)
  12920. (S1 ^operator O1982 +)
  12921. Firing rl*prefer*rvt*predict-no*H0*2
  12922. -->
  12923. (S1 ^operator O1980 = 0.3873369632550164)
  12924. Firing rl*prefer*rvt*predict-yes*H0*1
  12925. -->
  12926. (S1 ^operator O1979 = 0.3895394312063116)
  12927. Firing prefer*rvt*predict-yes*H0
  12928. -->
  12929. Firing prefer*rvt*predict-no*H0
  12930. -->
  12931. Firing elaborate*copy-dir-to-output-link
  12932. -->
  12933. (I3 ^dir L +)
  12934. inner elaboration loop at bottom goal.
  12935. Retracting elaborate*copy-see-to-output-link
  12936. -->
  12937. (I3 ^see 0 +)
  12938. Retracting propose*predict-no
  12939. -->
  12940. (O1980 ^name predict-no +)
  12941. (S1 ^operator O1980 +)
  12942. Retracting propose*predict-yes
  12943. -->
  12944. (O1979 ^name predict-yes +)
  12945. (S1 ^operator O1979 +)
  12946. Retracting elaborate*reward*based*on*reward
  12947. -->
  12948. (R993 ^value 1 +)
  12949. (R1 ^reward R993 +)
  12950. Retracting elaborate*copy-dir-to-output-link
  12951. -->
  12952. (I3 ^dir R +)
  12953. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  12954. -->
  12955. (S1 ^operator O1980 = 0.5523790871716397)
  12956. Retracting rl*prefer*rvt*predict-no*H0*4
  12957. -->
  12958. (S1 ^operator O1980 = 0.4476193182310915)
  12959. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  12960. -->
  12961. (S1 ^operator O1979 = 0.1664311307472832)
  12962. Retracting rl*prefer*rvt*predict-yes*H0*3
  12963. -->
  12964. (S1 ^operator O1979 = 0.1844110446262441)
  12965. =>WM: (13885: S1 ^operator O1982 +)
  12966. =>WM: (13884: S1 ^operator O1981 +)
  12967. =>WM: (13883: I3 ^dir L)
  12968. =>WM: (13882: O1982 ^name predict-no)
  12969. =>WM: (13881: O1981 ^name predict-yes)
  12970. =>WM: (13880: R994 ^value 1)
  12971. =>WM: (13879: R1 ^reward R994)
  12972. <=WM: (13870: S1 ^operator O1979 +)
  12973. <=WM: (13871: S1 ^operator O1980 +)
  12974. <=WM: (13872: S1 ^operator O1980)
  12975. <=WM: (13829: I3 ^dir R)
  12976. <=WM: (13866: R1 ^reward R993)
  12977. <=WM: (13869: O1980 ^name predict-no)
  12978. <=WM: (13868: O1979 ^name predict-yes)
  12979. <=WM: (13867: R993 ^value 1)
  12980. --- Inner Elaboration Phase, active level 1 (S1) ---
  12981. Firing prefer*rvt*predict-yes*H0
  12982. -->
  12983. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  12984. -->
  12985. (S1 ^operator O1981 = 0.6104613034971749)
  12986. Firing rl*prefer*rvt*predict-yes*H0*1
  12987. -->
  12988. (S1 ^operator O1981 = 0.3895394312063116)
  12989. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  12990. -->
  12991. Firing prefer*rvt*predict-no*H0
  12992. -->
  12993. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  12994. -->
  12995. (S1 ^operator O1982 = 0.1063475139796038)
  12996. Firing rl*prefer*rvt*predict-no*H0*2
  12997. -->
  12998. (S1 ^operator O1982 = 0.3873369632550164)
  12999. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13000. -->
  13001. inner elaboration loop at bottom goal.
  13002. Retracting rl*prefer*rvt*predict-no*H0*2
  13003. -->
  13004. (S1 ^operator O1980 = 0.3873369632550164)
  13005. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  13006. -->
  13007. (S1 ^operator O1980 = 0.1063475139796038)
  13008. Retracting rl*prefer*rvt*predict-yes*H0*1
  13009. -->
  13010. (S1 ^operator O1979 = 0.3895394312063116)
  13011. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  13012. -->
  13013. (S1 ^operator O1979 = 0.6104613034971749)
  13014. --- END Proposal Phase ---
  13015. --- Decision Phase ---
  13016. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174913 0.44762(R,m,v=1,0.929687,0.0658834)
  13017. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.552379 -> 0.377466 0.174913 0.552379(R,m,v=1,1,0)
  13018. =>WM: (13886: S1 ^operator O1981)
  13019. 991: O: O1981 (predict-yes)
  13020. --- END Decision Phase ---
  13021. --- Application Phase ---
  13022. --- Firing Productions (PE) For State At Depth 1 ---
  13023. --- Inner Elaboration Phase, active level 1 (S1) ---
  13024. Firing apply*operator
  13025. -->
  13026. (I3 ^predict-yes N991 + :O )
  13027. Firing apply*operator*complete
  13028. -->
  13029. (I3 ^predict-no N990 - :O )
  13030. inner elaboration loop at bottom goal.
  13031. --- Change Working Memory (PE) ---
  13032. =>WM: (13887: I3 ^predict-yes N991)
  13033. <=WM: (13874: N990 ^status complete)
  13034. <=WM: (13873: I3 ^predict-no N990)
  13035. --- Firing Productions (IE) For State At Depth 1 ---
  13036. --- Inner Elaboration Phase, active level 1 (S1) ---
  13037. Firing monitor*world
  13038. -->
  13039. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13040. --- Change Working Memory (IE) ---
  13041. --- END Application Phase ---
  13042. --- Output Phase ---
  13043. ENV: Agent did: predict-yes for direction L in state State-B
  13044. In State-B moving L
  13045. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13046. predict error 0
  13047. dir: dir isR
  13048. --- END Output Phase ---
  13049. ---- Input Phase ---
  13050. =>WM: (13891: I2 ^dir R)
  13051. =>WM: (13890: I2 ^reward 1)
  13052. =>WM: (13889: I2 ^see 1)
  13053. =>WM: (13888: N991 ^status complete)
  13054. <=WM: (13877: I2 ^dir L)
  13055. <=WM: (13876: I2 ^reward 1)
  13056. <=WM: (13875: I2 ^see 0)
  13057. =>WM: (13892: I2 ^level-1 L1-root)
  13058. <=WM: (13878: I2 ^level-1 R0-root)
  13059. --- END Input Phase ---
  13060. --- Proposal Phase ---
  13061. --- Inner Elaboration Phase, active level 1 (S1) ---
  13062. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13063. -->
  13064. (S1 ^operator O1982 = -0.02155734064455064)
  13065. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13066. -->
  13067. (S1 ^operator O1981 = 0.8155802143556325)
  13068. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13069. -->
  13070. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13071. -->
  13072. Firing elaborate*copy-see-to-output-link
  13073. -->
  13074. (I3 ^see 1 +)
  13075. Firing elaborate*reward*based*on*reward
  13076. -->
  13077. (R995 ^value 1 +)
  13078. (R1 ^reward R995 +)
  13079. Firing propose*predict-yes
  13080. -->
  13081. (O1983 ^name predict-yes +)
  13082. (S1 ^operator O1983 +)
  13083. Firing propose*predict-no
  13084. -->
  13085. (O1984 ^name predict-no +)
  13086. (S1 ^operator O1984 +)
  13087. Firing rl*prefer*rvt*predict-no*H0*4
  13088. -->
  13089. (S1 ^operator O1982 = 0.4476195574206818)
  13090. Firing rl*prefer*rvt*predict-yes*H0*3
  13091. -->
  13092. (S1 ^operator O1981 = 0.1844110446262441)
  13093. Firing prefer*rvt*predict-yes*H0
  13094. -->
  13095. Firing prefer*rvt*predict-no*H0
  13096. -->
  13097. Firing elaborate*copy-dir-to-output-link
  13098. -->
  13099. (I3 ^dir R +)
  13100. inner elaboration loop at bottom goal.
  13101. Retracting elaborate*copy-see-to-output-link
  13102. -->
  13103. (I3 ^see 0 +)
  13104. Retracting propose*predict-no
  13105. -->
  13106. (O1982 ^name predict-no +)
  13107. (S1 ^operator O1982 +)
  13108. Retracting propose*predict-yes
  13109. -->
  13110. (O1981 ^name predict-yes +)
  13111. (S1 ^operator O1981 +)
  13112. Retracting elaborate*reward*based*on*reward
  13113. -->
  13114. (R994 ^value 1 +)
  13115. (R1 ^reward R994 +)
  13116. Retracting elaborate*copy-dir-to-output-link
  13117. -->
  13118. (I3 ^dir L +)
  13119. Retracting rl*prefer*rvt*predict-no*H0*2
  13120. -->
  13121. (S1 ^operator O1982 = 0.3873369632550164)
  13122. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  13123. -->
  13124. (S1 ^operator O1982 = 0.1063475139796038)
  13125. Retracting rl*prefer*rvt*predict-yes*H0*1
  13126. -->
  13127. (S1 ^operator O1981 = 0.3895394312063116)
  13128. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  13129. -->
  13130. (S1 ^operator O1981 = 0.6104613034971749)
  13131. =>WM: (13900: S1 ^operator O1984 +)
  13132. =>WM: (13899: S1 ^operator O1983 +)
  13133. =>WM: (13898: I3 ^dir R)
  13134. =>WM: (13897: O1984 ^name predict-no)
  13135. =>WM: (13896: O1983 ^name predict-yes)
  13136. =>WM: (13895: R995 ^value 1)
  13137. =>WM: (13894: R1 ^reward R995)
  13138. =>WM: (13893: I3 ^see 1)
  13139. <=WM: (13884: S1 ^operator O1981 +)
  13140. <=WM: (13886: S1 ^operator O1981)
  13141. <=WM: (13885: S1 ^operator O1982 +)
  13142. <=WM: (13883: I3 ^dir L)
  13143. <=WM: (13879: R1 ^reward R994)
  13144. <=WM: (13852: I3 ^see 0)
  13145. <=WM: (13882: O1982 ^name predict-no)
  13146. <=WM: (13881: O1981 ^name predict-yes)
  13147. <=WM: (13880: R994 ^value 1)
  13148. --- Inner Elaboration Phase, active level 1 (S1) ---
  13149. Firing prefer*rvt*predict-yes*H0
  13150. -->
  13151. Firing rl*prefer*rvt*predict-yes*H0*3
  13152. -->
  13153. (S1 ^operator O1983 = 0.1844110446262441)
  13154. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13155. -->
  13156. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13157. -->
  13158. (S1 ^operator O1983 = 0.8155802143556325)
  13159. Firing prefer*rvt*predict-no*H0
  13160. -->
  13161. Firing rl*prefer*rvt*predict-no*H0*4
  13162. -->
  13163. (S1 ^operator O1984 = 0.4476195574206818)
  13164. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13165. -->
  13166. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13167. -->
  13168. (S1 ^operator O1984 = -0.02155734064455064)
  13169. inner elaboration loop at bottom goal.
  13170. Retracting rl*prefer*rvt*predict-no*H0*4
  13171. -->
  13172. (S1 ^operator O1982 = 0.4476195574206818)
  13173. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13174. -->
  13175. (S1 ^operator O1982 = -0.02155734064455064)
  13176. Retracting rl*prefer*rvt*predict-yes*H0*3
  13177. -->
  13178. (S1 ^operator O1981 = 0.1844110446262441)
  13179. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13180. -->
  13181. (S1 ^operator O1981 = 0.8155802143556325)
  13182. --- END Proposal Phase ---
  13183. --- Decision Phase ---
  13184. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.890909,0.0977827)
  13185. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  13186. =>WM: (13901: S1 ^operator O1983)
  13187. 992: O: O1983 (predict-yes)
  13188. --- END Decision Phase ---
  13189. --- Application Phase ---
  13190. --- Firing Productions (PE) For State At Depth 1 ---
  13191. --- Inner Elaboration Phase, active level 1 (S1) ---
  13192. Firing apply*operator
  13193. -->
  13194. (I3 ^predict-yes N992 + :O )
  13195. Firing apply*operator*complete
  13196. -->
  13197. (I3 ^predict-yes N991 - :O )
  13198. inner elaboration loop at bottom goal.
  13199. --- Change Working Memory (PE) ---
  13200. =>WM: (13902: I3 ^predict-yes N992)
  13201. <=WM: (13888: N991 ^status complete)
  13202. <=WM: (13887: I3 ^predict-yes N991)
  13203. --- Firing Productions (IE) For State At Depth 1 ---
  13204. --- Inner Elaboration Phase, active level 1 (S1) ---
  13205. Firing monitor*world
  13206. -->
  13207. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13208. --- Change Working Memory (IE) ---
  13209. --- END Application Phase ---
  13210. --- Output Phase ---
  13211. ENV: Agent did: predict-yes for direction R in state State-A
  13212. In State-A moving R
  13213. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13214. predict error 0
  13215. dir: dir isL
  13216. --- END Output Phase ---
  13217. /|--- Input Phase ---
  13218. =>WM: (13906: I2 ^dir L)
  13219. =>WM: (13905: I2 ^reward 1)
  13220. =>WM: (13904: I2 ^see 1)
  13221. =>WM: (13903: N992 ^status complete)
  13222. <=WM: (13891: I2 ^dir R)
  13223. <=WM: (13890: I2 ^reward 1)
  13224. <=WM: (13889: I2 ^see 1)
  13225. =>WM: (13907: I2 ^level-1 R1-root)
  13226. <=WM: (13892: I2 ^level-1 L1-root)
  13227. --- END Input Phase ---
  13228. --- Proposal Phase ---
  13229. --- Inner Elaboration Phase, active level 1 (S1) ---
  13230. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13231. -->
  13232. (S1 ^operator O1983 = 0.6104592422684716)
  13233. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13234. -->
  13235. (S1 ^operator O1984 = 0.2714993082286609)
  13236. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13237. -->
  13238. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13239. -->
  13240. Firing elaborate*copy-see-to-output-link
  13241. -->
  13242. (I3 ^see 1 +)
  13243. Firing elaborate*reward*based*on*reward
  13244. -->
  13245. (R996 ^value 1 +)
  13246. (R1 ^reward R996 +)
  13247. Firing propose*predict-yes
  13248. -->
  13249. (O1985 ^name predict-yes +)
  13250. (S1 ^operator O1985 +)
  13251. Firing propose*predict-no
  13252. -->
  13253. (O1986 ^name predict-no +)
  13254. (S1 ^operator O1986 +)
  13255. Firing rl*prefer*rvt*predict-no*H0*2
  13256. -->
  13257. (S1 ^operator O1984 = 0.3873369632550164)
  13258. Firing rl*prefer*rvt*predict-yes*H0*1
  13259. -->
  13260. (S1 ^operator O1983 = 0.3895393210007886)
  13261. Firing prefer*rvt*predict-yes*H0
  13262. -->
  13263. Firing prefer*rvt*predict-no*H0
  13264. -->
  13265. Firing elaborate*copy-dir-to-output-link
  13266. -->
  13267. (I3 ^dir L +)
  13268. inner elaboration loop at bottom goal.
  13269. Retracting elaborate*copy-see-to-output-link
  13270. -->
  13271. (I3 ^see 1 +)
  13272. Retracting propose*predict-no
  13273. -->
  13274. (O1984 ^name predict-no +)
  13275. (S1 ^operator O1984 +)
  13276. Retracting propose*predict-yes
  13277. -->
  13278. (O1983 ^name predict-yes +)
  13279. (S1 ^operator O1983 +)
  13280. Retracting elaborate*reward*based*on*reward
  13281. -->
  13282. (R995 ^value 1 +)
  13283. (R1 ^reward R995 +)
  13284. Retracting elaborate*copy-dir-to-output-link
  13285. -->
  13286. (I3 ^dir R +)
  13287. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13288. -->
  13289. (S1 ^operator O1984 = -0.02155734064455064)
  13290. Retracting rl*prefer*rvt*predict-no*H0*4
  13291. -->
  13292. (S1 ^operator O1984 = 0.4476195574206818)
  13293. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13294. -->
  13295. (S1 ^operator O1983 = 0.8155802143556325)
  13296. Retracting rl*prefer*rvt*predict-yes*H0*3
  13297. -->
  13298. (S1 ^operator O1983 = 0.1844110446262441)
  13299. =>WM: (13914: S1 ^operator O1986 +)
  13300. =>WM: (13913: S1 ^operator O1985 +)
  13301. =>WM: (13912: I3 ^dir L)
  13302. =>WM: (13911: O1986 ^name predict-no)
  13303. =>WM: (13910: O1985 ^name predict-yes)
  13304. =>WM: (13909: R996 ^value 1)
  13305. =>WM: (13908: R1 ^reward R996)
  13306. <=WM: (13899: S1 ^operator O1983 +)
  13307. <=WM: (13901: S1 ^operator O1983)
  13308. <=WM: (13900: S1 ^operator O1984 +)
  13309. <=WM: (13898: I3 ^dir R)
  13310. <=WM: (13894: R1 ^reward R995)
  13311. <=WM: (13897: O1984 ^name predict-no)
  13312. <=WM: (13896: O1983 ^name predict-yes)
  13313. <=WM: (13895: R995 ^value 1)
  13314. --- Inner Elaboration Phase, active level 1 (S1) ---
  13315. Firing prefer*rvt*predict-yes*H0
  13316. -->
  13317. Firing rl*prefer*rvt*predict-yes*H0*1
  13318. -->
  13319. (S1 ^operator O1985 = 0.3895393210007886)
  13320. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13321. -->
  13322. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13323. -->
  13324. (S1 ^operator O1985 = 0.6104592422684716)
  13325. Firing prefer*rvt*predict-no*H0
  13326. -->
  13327. Firing rl*prefer*rvt*predict-no*H0*2
  13328. -->
  13329. (S1 ^operator O1986 = 0.3873369632550164)
  13330. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13331. -->
  13332. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13333. -->
  13334. (S1 ^operator O1986 = 0.2714993082286609)
  13335. inner elaboration loop at bottom goal.
  13336. Retracting rl*prefer*rvt*predict-no*H0*2
  13337. -->
  13338. (S1 ^operator O1984 = 0.3873369632550164)
  13339. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13340. -->
  13341. (S1 ^operator O1984 = 0.2714993082286609)
  13342. Retracting rl*prefer*rvt*predict-yes*H0*1
  13343. -->
  13344. (S1 ^operator O1983 = 0.3895393210007886)
  13345. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13346. -->
  13347. (S1 ^operator O1983 = 0.6104592422684716)
  13348. --- END Proposal Phase ---
  13349. --- Decision Phase ---
  13350. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184411 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.89881,0.0914956)
  13351. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324575 0.491005 0.81558 -> 0.324577 0.491005 0.815582(R,m,v=1,1,0)
  13352. =>WM: (13915: S1 ^operator O1985)
  13353. 993: O: O1985 (predict-yes)
  13354. --- END Decision Phase ---
  13355. --- Application Phase ---
  13356. --- Firing Productions (PE) For State At Depth 1 ---
  13357. --- Inner Elaboration Phase, active level 1 (S1) ---
  13358. Firing apply*operator
  13359. -->
  13360. (I3 ^predict-yes N993 + :O )
  13361. Firing apply*operator*complete
  13362. -->
  13363. (I3 ^predict-yes N992 - :O )
  13364. inner elaboration loop at bottom goal.
  13365. --- Change Working Memory (PE) ---
  13366. =>WM: (13916: I3 ^predict-yes N993)
  13367. <=WM: (13903: N992 ^status complete)
  13368. <=WM: (13902: I3 ^predict-yes N992)
  13369. --- Firing Productions (IE) For State At Depth 1 ---
  13370. --- Inner Elaboration Phase, active level 1 (S1) ---
  13371. Firing monitor*world
  13372. -->
  13373. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13374. --- Change Working Memory (IE) ---
  13375. --- END Application Phase ---
  13376. --- Output Phase ---
  13377. ENV: Agent did: predict-yes for direction L in state State-B
  13378. In State-B moving L
  13379. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13380. predict error 0
  13381. dir: dir isR
  13382. --- END Output Phase ---
  13383. \--- Input Phase ---
  13384. =>WM: (13920: I2 ^dir R)
  13385. =>WM: (13919: I2 ^reward 1)
  13386. =>WM: (13918: I2 ^see 1)
  13387. =>WM: (13917: N993 ^status complete)
  13388. <=WM: (13906: I2 ^dir L)
  13389. <=WM: (13905: I2 ^reward 1)
  13390. <=WM: (13904: I2 ^see 1)
  13391. =>WM: (13921: I2 ^level-1 L1-root)
  13392. <=WM: (13907: I2 ^level-1 R1-root)
  13393. --- END Input Phase ---
  13394. --- Proposal Phase ---
  13395. --- Inner Elaboration Phase, active level 1 (S1) ---
  13396. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13397. -->
  13398. (S1 ^operator O1986 = -0.02155734064455064)
  13399. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13400. -->
  13401. (S1 ^operator O1985 = 0.8155815255083509)
  13402. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13403. -->
  13404. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13405. -->
  13406. Firing elaborate*copy-see-to-output-link
  13407. -->
  13408. (I3 ^see 1 +)
  13409. Firing elaborate*reward*based*on*reward
  13410. -->
  13411. (R997 ^value 1 +)
  13412. (R1 ^reward R997 +)
  13413. Firing propose*predict-yes
  13414. -->
  13415. (O1987 ^name predict-yes +)
  13416. (S1 ^operator O1987 +)
  13417. Firing propose*predict-no
  13418. -->
  13419. (O1988 ^name predict-no +)
  13420. (S1 ^operator O1988 +)
  13421. Firing rl*prefer*rvt*predict-no*H0*4
  13422. -->
  13423. (S1 ^operator O1986 = 0.4476195574206818)
  13424. Firing rl*prefer*rvt*predict-yes*H0*3
  13425. -->
  13426. (S1 ^operator O1985 = 0.1844123557789626)
  13427. Firing prefer*rvt*predict-yes*H0
  13428. -->
  13429. Firing prefer*rvt*predict-no*H0
  13430. -->
  13431. Firing elaborate*copy-dir-to-output-link
  13432. -->
  13433. (I3 ^dir R +)
  13434. inner elaboration loop at bottom goal.
  13435. Retracting elaborate*copy-see-to-output-link
  13436. -->
  13437. (I3 ^see 1 +)
  13438. Retracting propose*predict-no
  13439. -->
  13440. (O1986 ^name predict-no +)
  13441. (S1 ^operator O1986 +)
  13442. Retracting propose*predict-yes
  13443. -->
  13444. (O1985 ^name predict-yes +)
  13445. (S1 ^operator O1985 +)
  13446. Retracting elaborate*reward*based*on*reward
  13447. -->
  13448. (R996 ^value 1 +)
  13449. (R1 ^reward R996 +)
  13450. Retracting elaborate*copy-dir-to-output-link
  13451. -->
  13452. (I3 ^dir L +)
  13453. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13454. -->
  13455. (S1 ^operator O1986 = 0.2714993082286609)
  13456. Retracting rl*prefer*rvt*predict-no*H0*2
  13457. -->
  13458. (S1 ^operator O1986 = 0.3873369632550164)
  13459. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13460. -->
  13461. (S1 ^operator O1985 = 0.6104592422684716)
  13462. Retracting rl*prefer*rvt*predict-yes*H0*1
  13463. -->
  13464. (S1 ^operator O1985 = 0.3895393210007886)
  13465. =>WM: (13928: S1 ^operator O1988 +)
  13466. =>WM: (13927: S1 ^operator O1987 +)
  13467. =>WM: (13926: I3 ^dir R)
  13468. =>WM: (13925: O1988 ^name predict-no)
  13469. =>WM: (13924: O1987 ^name predict-yes)
  13470. =>WM: (13923: R997 ^value 1)
  13471. =>WM: (13922: R1 ^reward R997)
  13472. <=WM: (13913: S1 ^operator O1985 +)
  13473. <=WM: (13915: S1 ^operator O1985)
  13474. <=WM: (13914: S1 ^operator O1986 +)
  13475. <=WM: (13912: I3 ^dir L)
  13476. <=WM: (13908: R1 ^reward R996)
  13477. <=WM: (13911: O1986 ^name predict-no)
  13478. <=WM: (13910: O1985 ^name predict-yes)
  13479. <=WM: (13909: R996 ^value 1)
  13480. --- Inner Elaboration Phase, active level 1 (S1) ---
  13481. Firing prefer*rvt*predict-yes*H0
  13482. -->
  13483. Firing rl*prefer*rvt*predict-yes*H0*3
  13484. -->
  13485. (S1 ^operator O1987 = 0.1844123557789626)
  13486. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13487. -->
  13488. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13489. -->
  13490. (S1 ^operator O1987 = 0.8155815255083509)
  13491. Firing prefer*rvt*predict-no*H0
  13492. -->
  13493. Firing rl*prefer*rvt*predict-no*H0*4
  13494. -->
  13495. (S1 ^operator O1988 = 0.4476195574206818)
  13496. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13497. -->
  13498. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13499. -->
  13500. (S1 ^operator O1988 = -0.02155734064455064)
  13501. inner elaboration loop at bottom goal.
  13502. Retracting rl*prefer*rvt*predict-no*H0*4
  13503. -->
  13504. (S1 ^operator O1986 = 0.4476195574206818)
  13505. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13506. -->
  13507. (S1 ^operator O1986 = -0.02155734064455064)
  13508. Retracting rl*prefer*rvt*predict-yes*H0*3
  13509. -->
  13510. (S1 ^operator O1985 = 0.1844123557789626)
  13511. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13512. -->
  13513. (S1 ^operator O1985 = 0.8155815255083509)
  13514. --- END Proposal Phase ---
  13515. --- Decision Phase ---
  13516. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.891566,0.0972618)
  13517. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.610459 -> 0.288049 0.322411 0.610459(R,m,v=1,1,0)
  13518. =>WM: (13929: S1 ^operator O1987)
  13519. 994: O: O1987 (predict-yes)
  13520. --- END Decision Phase ---
  13521. --- Application Phase ---
  13522. --- Firing Productions (PE) For State At Depth 1 ---
  13523. --- Inner Elaboration Phase, active level 1 (S1) ---
  13524. Firing apply*operator
  13525. -->
  13526. (I3 ^predict-yes N994 + :O )
  13527. Firing apply*operator*complete
  13528. -->
  13529. (I3 ^predict-yes N993 - :O )
  13530. inner elaboration loop at bottom goal.
  13531. --- Change Working Memory (PE) ---
  13532. =>WM: (13930: I3 ^predict-yes N994)
  13533. <=WM: (13917: N993 ^status complete)
  13534. <=WM: (13916: I3 ^predict-yes N993)
  13535. --- Firing Productions (IE) For State At Depth 1 ---
  13536. --- Inner Elaboration Phase, active level 1 (S1) ---
  13537. Firing monitor*world
  13538. -->
  13539. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13540. --- Change Working Memory (IE) ---
  13541. --- END Application Phase ---
  13542. --- Output Phase ---
  13543. ENV: Agent did: predict-yes for direction R in state State-A
  13544. In State-A moving R
  13545. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13546. predict error 0
  13547. dir: dir isL
  13548. --- END Output Phase ---
  13549. -/|--- Input Phase ---
  13550. =>WM: (13934: I2 ^dir L)
  13551. =>WM: (13933: I2 ^reward 1)
  13552. =>WM: (13932: I2 ^see 1)
  13553. =>WM: (13931: N994 ^status complete)
  13554. <=WM: (13920: I2 ^dir R)
  13555. <=WM: (13919: I2 ^reward 1)
  13556. <=WM: (13918: I2 ^see 1)
  13557. =>WM: (13935: I2 ^level-1 R1-root)
  13558. <=WM: (13921: I2 ^level-1 L1-root)
  13559. --- END Input Phase ---
  13560. --- Proposal Phase ---
  13561. --- Inner Elaboration Phase, active level 1 (S1) ---
  13562. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13563. -->
  13564. (S1 ^operator O1987 = 0.6104594577780825)
  13565. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13566. -->
  13567. (S1 ^operator O1988 = 0.2714993082286609)
  13568. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13569. -->
  13570. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13571. -->
  13572. Firing elaborate*copy-see-to-output-link
  13573. -->
  13574. (I3 ^see 1 +)
  13575. Firing elaborate*reward*based*on*reward
  13576. -->
  13577. (R998 ^value 1 +)
  13578. (R1 ^reward R998 +)
  13579. Firing propose*predict-yes
  13580. -->
  13581. (O1989 ^name predict-yes +)
  13582. (S1 ^operator O1989 +)
  13583. Firing propose*predict-no
  13584. -->
  13585. (O1990 ^name predict-no +)
  13586. (S1 ^operator O1990 +)
  13587. Firing rl*prefer*rvt*predict-no*H0*2
  13588. -->
  13589. (S1 ^operator O1988 = 0.3873369632550164)
  13590. Firing rl*prefer*rvt*predict-yes*H0*1
  13591. -->
  13592. (S1 ^operator O1987 = 0.3895395365103996)
  13593. Firing prefer*rvt*predict-yes*H0
  13594. -->
  13595. Firing prefer*rvt*predict-no*H0
  13596. -->
  13597. Firing elaborate*copy-dir-to-output-link
  13598. -->
  13599. (I3 ^dir L +)
  13600. inner elaboration loop at bottom goal.
  13601. Retracting elaborate*copy-see-to-output-link
  13602. -->
  13603. (I3 ^see 1 +)
  13604. Retracting propose*predict-no
  13605. -->
  13606. (O1988 ^name predict-no +)
  13607. (S1 ^operator O1988 +)
  13608. Retracting propose*predict-yes
  13609. -->
  13610. (O1987 ^name predict-yes +)
  13611. (S1 ^operator O1987 +)
  13612. Retracting elaborate*reward*based*on*reward
  13613. -->
  13614. (R997 ^value 1 +)
  13615. (R1 ^reward R997 +)
  13616. Retracting elaborate*copy-dir-to-output-link
  13617. -->
  13618. (I3 ^dir R +)
  13619. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  13620. -->
  13621. (S1 ^operator O1988 = -0.02155734064455064)
  13622. Retracting rl*prefer*rvt*predict-no*H0*4
  13623. -->
  13624. (S1 ^operator O1988 = 0.4476195574206818)
  13625. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  13626. -->
  13627. (S1 ^operator O1987 = 0.8155815255083509)
  13628. Retracting rl*prefer*rvt*predict-yes*H0*3
  13629. -->
  13630. (S1 ^operator O1987 = 0.1844123557789626)
  13631. =>WM: (13942: S1 ^operator O1990 +)
  13632. =>WM: (13941: S1 ^operator O1989 +)
  13633. =>WM: (13940: I3 ^dir L)
  13634. =>WM: (13939: O1990 ^name predict-no)
  13635. =>WM: (13938: O1989 ^name predict-yes)
  13636. =>WM: (13937: R998 ^value 1)
  13637. =>WM: (13936: R1 ^reward R998)
  13638. <=WM: (13927: S1 ^operator O1987 +)
  13639. <=WM: (13929: S1 ^operator O1987)
  13640. <=WM: (13928: S1 ^operator O1988 +)
  13641. <=WM: (13926: I3 ^dir R)
  13642. <=WM: (13922: R1 ^reward R997)
  13643. <=WM: (13925: O1988 ^name predict-no)
  13644. <=WM: (13924: O1987 ^name predict-yes)
  13645. <=WM: (13923: R997 ^value 1)
  13646. --- Inner Elaboration Phase, active level 1 (S1) ---
  13647. Firing prefer*rvt*predict-yes*H0
  13648. -->
  13649. Firing rl*prefer*rvt*predict-yes*H0*1
  13650. -->
  13651. (S1 ^operator O1989 = 0.3895395365103996)
  13652. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13653. -->
  13654. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13655. -->
  13656. (S1 ^operator O1989 = 0.6104594577780825)
  13657. Firing prefer*rvt*predict-no*H0
  13658. -->
  13659. Firing rl*prefer*rvt*predict-no*H0*2
  13660. -->
  13661. (S1 ^operator O1990 = 0.3873369632550164)
  13662. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13663. -->
  13664. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13665. -->
  13666. (S1 ^operator O1990 = 0.2714993082286609)
  13667. inner elaboration loop at bottom goal.
  13668. Retracting rl*prefer*rvt*predict-no*H0*2
  13669. -->
  13670. (S1 ^operator O1988 = 0.3873369632550164)
  13671. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13672. -->
  13673. (S1 ^operator O1988 = 0.2714993082286609)
  13674. Retracting rl*prefer*rvt*predict-yes*H0*1
  13675. -->
  13676. (S1 ^operator O1987 = 0.3895395365103996)
  13677. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13678. -->
  13679. (S1 ^operator O1987 = 0.6104594577780825)
  13680. --- END Proposal Phase ---
  13681. --- Decision Phase ---
  13682. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675417 -0.491003 0.184413(R,m,v=1,0.899408,0.0910116)
  13683. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324577 0.491005 0.815582 -> 0.324578 0.491005 0.815582(R,m,v=1,1,0)
  13684. =>WM: (13943: S1 ^operator O1989)
  13685. 995: O: O1989 (predict-yes)
  13686. --- END Decision Phase ---
  13687. --- Application Phase ---
  13688. --- Firing Productions (PE) For State At Depth 1 ---
  13689. --- Inner Elaboration Phase, active level 1 (S1) ---
  13690. Firing apply*operator
  13691. -->
  13692. (I3 ^predict-yes N995 + :O )
  13693. Firing apply*operator*complete
  13694. -->
  13695. (I3 ^predict-yes N994 - :O )
  13696. inner elaboration loop at bottom goal.
  13697. --- Change Working Memory (PE) ---
  13698. =>WM: (13944: I3 ^predict-yes N995)
  13699. <=WM: (13931: N994 ^status complete)
  13700. <=WM: (13930: I3 ^predict-yes N994)
  13701. --- Firing Productions (IE) For State At Depth 1 ---
  13702. --- Inner Elaboration Phase, active level 1 (S1) ---
  13703. Firing monitor*world
  13704. -->
  13705. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13706. --- Change Working Memory (IE) ---
  13707. --- END Application Phase ---
  13708. --- Output Phase ---
  13709. ENV: Agent did: predict-yes for direction L in state State-B
  13710. In State-B moving L
  13711. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13712. predict error 0
  13713. dir: dir isL
  13714. --- END Output Phase ---
  13715. \---- Input Phase ---
  13716. =>WM: (13948: I2 ^dir L)
  13717. =>WM: (13947: I2 ^reward 1)
  13718. =>WM: (13946: I2 ^see 1)
  13719. =>WM: (13945: N995 ^status complete)
  13720. <=WM: (13934: I2 ^dir L)
  13721. <=WM: (13933: I2 ^reward 1)
  13722. <=WM: (13932: I2 ^see 1)
  13723. =>WM: (13949: I2 ^level-1 L1-root)
  13724. <=WM: (13935: I2 ^level-1 R1-root)
  13725. --- END Input Phase ---
  13726. --- Proposal Phase ---
  13727. --- Inner Elaboration Phase, active level 1 (S1) ---
  13728. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  13729. -->
  13730. (S1 ^operator O1990 = 0.6126627481603084)
  13731. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  13732. -->
  13733. (S1 ^operator O1989 = -0.02274740735326741)
  13734. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13735. -->
  13736. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13737. -->
  13738. Firing elaborate*copy-see-to-output-link
  13739. -->
  13740. (I3 ^see 1 +)
  13741. Firing elaborate*reward*based*on*reward
  13742. -->
  13743. (R999 ^value 1 +)
  13744. (R1 ^reward R999 +)
  13745. Firing propose*predict-yes
  13746. -->
  13747. (O1991 ^name predict-yes +)
  13748. (S1 ^operator O1991 +)
  13749. Firing propose*predict-no
  13750. -->
  13751. (O1992 ^name predict-no +)
  13752. (S1 ^operator O1992 +)
  13753. Firing rl*prefer*rvt*predict-no*H0*2
  13754. -->
  13755. (S1 ^operator O1990 = 0.3873369632550164)
  13756. Firing rl*prefer*rvt*predict-yes*H0*1
  13757. -->
  13758. (S1 ^operator O1989 = 0.3895395365103996)
  13759. Firing prefer*rvt*predict-yes*H0
  13760. -->
  13761. Firing prefer*rvt*predict-no*H0
  13762. -->
  13763. Firing elaborate*copy-dir-to-output-link
  13764. -->
  13765. (I3 ^dir L +)
  13766. inner elaboration loop at bottom goal.
  13767. Retracting elaborate*copy-see-to-output-link
  13768. -->
  13769. (I3 ^see 1 +)
  13770. Retracting propose*predict-no
  13771. -->
  13772. (O1990 ^name predict-no +)
  13773. (S1 ^operator O1990 +)
  13774. Retracting propose*predict-yes
  13775. -->
  13776. (O1989 ^name predict-yes +)
  13777. (S1 ^operator O1989 +)
  13778. Retracting elaborate*reward*based*on*reward
  13779. -->
  13780. (R998 ^value 1 +)
  13781. (R1 ^reward R998 +)
  13782. Retracting elaborate*copy-dir-to-output-link
  13783. -->
  13784. (I3 ^dir L +)
  13785. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  13786. -->
  13787. (S1 ^operator O1990 = 0.2714993082286609)
  13788. Retracting rl*prefer*rvt*predict-no*H0*2
  13789. -->
  13790. (S1 ^operator O1990 = 0.3873369632550164)
  13791. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  13792. -->
  13793. (S1 ^operator O1989 = 0.6104594577780825)
  13794. Retracting rl*prefer*rvt*predict-yes*H0*1
  13795. -->
  13796. (S1 ^operator O1989 = 0.3895395365103996)
  13797. =>WM: (13955: S1 ^operator O1992 +)
  13798. =>WM: (13954: S1 ^operator O1991 +)
  13799. =>WM: (13953: O1992 ^name predict-no)
  13800. =>WM: (13952: O1991 ^name predict-yes)
  13801. =>WM: (13951: R999 ^value 1)
  13802. =>WM: (13950: R1 ^reward R999)
  13803. <=WM: (13941: S1 ^operator O1989 +)
  13804. <=WM: (13943: S1 ^operator O1989)
  13805. <=WM: (13942: S1 ^operator O1990 +)
  13806. <=WM: (13936: R1 ^reward R998)
  13807. <=WM: (13939: O1990 ^name predict-no)
  13808. <=WM: (13938: O1989 ^name predict-yes)
  13809. <=WM: (13937: R998 ^value 1)
  13810. --- Inner Elaboration Phase, active level 1 (S1) ---
  13811. Firing prefer*rvt*predict-yes*H0
  13812. -->
  13813. Firing rl*prefer*rvt*predict-yes*H0*1
  13814. -->
  13815. (S1 ^operator O1991 = 0.3895395365103996)
  13816. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  13817. -->
  13818. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  13819. -->
  13820. (S1 ^operator O1991 = -0.02274740735326741)
  13821. Firing prefer*rvt*predict-no*H0
  13822. -->
  13823. Firing rl*prefer*rvt*predict-no*H0*2
  13824. -->
  13825. (S1 ^operator O1992 = 0.3873369632550164)
  13826. Firing prefer*rvt*predict-no*H0*2*v1*H1
  13827. -->
  13828. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  13829. -->
  13830. (S1 ^operator O1992 = 0.6126627481603084)
  13831. inner elaboration loop at bottom goal.
  13832. Retracting rl*prefer*rvt*predict-no*H0*2
  13833. -->
  13834. (S1 ^operator O1990 = 0.3873369632550164)
  13835. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  13836. -->
  13837. (S1 ^operator O1990 = 0.6126627481603084)
  13838. Retracting rl*prefer*rvt*predict-yes*H0*1
  13839. -->
  13840. (S1 ^operator O1989 = 0.3895395365103996)
  13841. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  13842. -->
  13843. (S1 ^operator O1989 = -0.02274740735326741)
  13844. --- END Proposal Phase ---
  13845. --- Decision Phase ---
  13846. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.892216,0.0967463)
  13847. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.610459 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  13848. =>WM: (13956: S1 ^operator O1992)
  13849. 996: O: O1992 (predict-no)
  13850. --- END Decision Phase ---
  13851. --- Application Phase ---
  13852. --- Firing Productions (PE) For State At Depth 1 ---
  13853. --- Inner Elaboration Phase, active level 1 (S1) ---
  13854. Firing apply*operator
  13855. -->
  13856. (I3 ^predict-no N996 + :O )
  13857. Firing apply*operator*complete
  13858. -->
  13859. (I3 ^predict-yes N995 - :O )
  13860. inner elaboration loop at bottom goal.
  13861. --- Change Working Memory (PE) ---
  13862. =>WM: (13957: I3 ^predict-no N996)
  13863. <=WM: (13945: N995 ^status complete)
  13864. <=WM: (13944: I3 ^predict-yes N995)
  13865. --- Firing Productions (IE) For State At Depth 1 ---
  13866. --- Inner Elaboration Phase, active level 1 (S1) ---
  13867. Firing monitor*world
  13868. -->
  13869. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13870. --- Change Working Memory (IE) ---
  13871. --- END Application Phase ---
  13872. --- Output Phase ---
  13873. ENV: Agent did: predict-no for direction L in state State-A
  13874. In State-A moving L
  13875. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13876. predict error 0
  13877. dir: dir isR
  13878. --- END Output Phase ---
  13879. /|\---- Input Phase ---
  13880. =>WM: (13961: I2 ^dir R)
  13881. =>WM: (13960: I2 ^reward 1)
  13882. =>WM: (13959: I2 ^see 0)
  13883. =>WM: (13958: N996 ^status complete)
  13884. <=WM: (13948: I2 ^dir L)
  13885. <=WM: (13947: I2 ^reward 1)
  13886. <=WM: (13946: I2 ^see 1)
  13887. =>WM: (13962: I2 ^level-1 L0-root)
  13888. <=WM: (13949: I2 ^level-1 L1-root)
  13889. --- END Input Phase ---
  13890. --- Proposal Phase ---
  13891. --- Inner Elaboration Phase, active level 1 (S1) ---
  13892. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  13893. -->
  13894. (S1 ^operator O1991 = 0.8155947374398671)
  13895. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  13896. -->
  13897. (S1 ^operator O1992 = -0.00558448899823713)
  13898. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13899. -->
  13900. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13901. -->
  13902. Firing elaborate*copy-see-to-output-link
  13903. -->
  13904. (I3 ^see 0 +)
  13905. Firing elaborate*reward*based*on*reward
  13906. -->
  13907. (R1000 ^value 1 +)
  13908. (R1 ^reward R1000 +)
  13909. Firing propose*predict-yes
  13910. -->
  13911. (O1993 ^name predict-yes +)
  13912. (S1 ^operator O1993 +)
  13913. Firing propose*predict-no
  13914. -->
  13915. (O1994 ^name predict-no +)
  13916. (S1 ^operator O1994 +)
  13917. Firing rl*prefer*rvt*predict-no*H0*4
  13918. -->
  13919. (S1 ^operator O1992 = 0.4476195574206818)
  13920. Firing rl*prefer*rvt*predict-yes*H0*3
  13921. -->
  13922. (S1 ^operator O1991 = 0.1844132735858656)
  13923. Firing prefer*rvt*predict-yes*H0
  13924. -->
  13925. Firing prefer*rvt*predict-no*H0
  13926. -->
  13927. Firing elaborate*copy-dir-to-output-link
  13928. -->
  13929. (I3 ^dir R +)
  13930. inner elaboration loop at bottom goal.
  13931. Retracting elaborate*copy-see-to-output-link
  13932. -->
  13933. (I3 ^see 1 +)
  13934. Retracting propose*predict-no
  13935. -->
  13936. (O1992 ^name predict-no +)
  13937. (S1 ^operator O1992 +)
  13938. Retracting propose*predict-yes
  13939. -->
  13940. (O1991 ^name predict-yes +)
  13941. (S1 ^operator O1991 +)
  13942. Retracting elaborate*reward*based*on*reward
  13943. -->
  13944. (R999 ^value 1 +)
  13945. (R1 ^reward R999 +)
  13946. Retracting elaborate*copy-dir-to-output-link
  13947. -->
  13948. (I3 ^dir L +)
  13949. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  13950. -->
  13951. (S1 ^operator O1992 = 0.6126627481603084)
  13952. Retracting rl*prefer*rvt*predict-no*H0*2
  13953. -->
  13954. (S1 ^operator O1992 = 0.3873369632550164)
  13955. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  13956. -->
  13957. (S1 ^operator O1991 = -0.02274740735326741)
  13958. Retracting rl*prefer*rvt*predict-yes*H0*1
  13959. -->
  13960. (S1 ^operator O1991 = 0.3895396873671274)
  13961. =>WM: (13970: S1 ^operator O1994 +)
  13962. =>WM: (13969: S1 ^operator O1993 +)
  13963. =>WM: (13968: I3 ^dir R)
  13964. =>WM: (13967: O1994 ^name predict-no)
  13965. =>WM: (13966: O1993 ^name predict-yes)
  13966. =>WM: (13965: R1000 ^value 1)
  13967. =>WM: (13964: R1 ^reward R1000)
  13968. =>WM: (13963: I3 ^see 0)
  13969. <=WM: (13954: S1 ^operator O1991 +)
  13970. <=WM: (13955: S1 ^operator O1992 +)
  13971. <=WM: (13956: S1 ^operator O1992)
  13972. <=WM: (13940: I3 ^dir L)
  13973. <=WM: (13950: R1 ^reward R999)
  13974. <=WM: (13893: I3 ^see 1)
  13975. <=WM: (13953: O1992 ^name predict-no)
  13976. <=WM: (13952: O1991 ^name predict-yes)
  13977. <=WM: (13951: R999 ^value 1)
  13978. --- Inner Elaboration Phase, active level 1 (S1) ---
  13979. Firing prefer*rvt*predict-yes*H0
  13980. -->
  13981. Firing rl*prefer*rvt*predict-yes*H0*3
  13982. -->
  13983. (S1 ^operator O1993 = 0.1844132735858656)
  13984. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13985. -->
  13986. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  13987. -->
  13988. (S1 ^operator O1993 = 0.8155947374398671)
  13989. Firing prefer*rvt*predict-no*H0
  13990. -->
  13991. Firing rl*prefer*rvt*predict-no*H0*4
  13992. -->
  13993. (S1 ^operator O1994 = 0.4476195574206818)
  13994. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13995. -->
  13996. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  13997. -->
  13998. (S1 ^operator O1994 = -0.00558448899823713)
  13999. inner elaboration loop at bottom goal.
  14000. Retracting rl*prefer*rvt*predict-no*H0*4
  14001. -->
  14002. (S1 ^operator O1992 = 0.4476195574206818)
  14003. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14004. -->
  14005. (S1 ^operator O1992 = -0.00558448899823713)
  14006. Retracting rl*prefer*rvt*predict-yes*H0*3
  14007. -->
  14008. (S1 ^operator O1991 = 0.1844132735858656)
  14009. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14010. -->
  14011. (S1 ^operator O1991 = 0.8155947374398671)
  14012. --- END Proposal Phase ---
  14013. --- Decision Phase ---
  14014. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.931818,0.0638961)
  14015. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  14016. =>WM: (13971: S1 ^operator O1993)
  14017. 997: O: O1993 (predict-yes)
  14018. --- END Decision Phase ---
  14019. --- Application Phase ---
  14020. --- Firing Productions (PE) For State At Depth 1 ---
  14021. --- Inner Elaboration Phase, active level 1 (S1) ---
  14022. Firing apply*operator
  14023. -->
  14024. (I3 ^predict-yes N997 + :O )
  14025. Firing apply*operator*complete
  14026. -->
  14027. (I3 ^predict-no N996 - :O )
  14028. inner elaboration loop at bottom goal.
  14029. --- Change Working Memory (PE) ---
  14030. =>WM: (13972: I3 ^predict-yes N997)
  14031. <=WM: (13958: N996 ^status complete)
  14032. <=WM: (13957: I3 ^predict-no N996)
  14033. --- Firing Productions (IE) For State At Depth 1 ---
  14034. --- Inner Elaboration Phase, active level 1 (S1) ---
  14035. Firing monitor*world
  14036. -->
  14037. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14038. --- Change Working Memory (IE) ---
  14039. --- END Application Phase ---
  14040. --- Output Phase ---
  14041. ENV: Agent did: predict-yes for direction R in state State-A
  14042. In State-A moving R
  14043. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14044. predict error 0
  14045. dir: dir isR
  14046. --- END Output Phase ---
  14047. /|\--- Input Phase ---
  14048. =>WM: (13976: I2 ^dir R)
  14049. =>WM: (13975: I2 ^reward 1)
  14050. =>WM: (13974: I2 ^see 1)
  14051. =>WM: (13973: N997 ^status complete)
  14052. <=WM: (13961: I2 ^dir R)
  14053. <=WM: (13960: I2 ^reward 1)
  14054. <=WM: (13959: I2 ^see 0)
  14055. =>WM: (13977: I2 ^level-1 R1-root)
  14056. <=WM: (13962: I2 ^level-1 L0-root)
  14057. --- END Input Phase ---
  14058. --- Proposal Phase ---
  14059. --- Inner Elaboration Phase, active level 1 (S1) ---
  14060. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  14061. -->
  14062. (S1 ^operator O1993 = 0.1398795999120246)
  14063. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  14064. -->
  14065. (S1 ^operator O1994 = 0.5523820607022403)
  14066. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14067. -->
  14068. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14069. -->
  14070. Firing elaborate*copy-see-to-output-link
  14071. -->
  14072. (I3 ^see 1 +)
  14073. Firing elaborate*reward*based*on*reward
  14074. -->
  14075. (R1001 ^value 1 +)
  14076. (R1 ^reward R1001 +)
  14077. Firing propose*predict-yes
  14078. -->
  14079. (O1995 ^name predict-yes +)
  14080. (S1 ^operator O1995 +)
  14081. Firing propose*predict-no
  14082. -->
  14083. (O1996 ^name predict-no +)
  14084. (S1 ^operator O1996 +)
  14085. Firing rl*prefer*rvt*predict-no*H0*4
  14086. -->
  14087. (S1 ^operator O1994 = 0.4476195574206818)
  14088. Firing rl*prefer*rvt*predict-yes*H0*3
  14089. -->
  14090. (S1 ^operator O1993 = 0.1844132735858656)
  14091. Firing prefer*rvt*predict-yes*H0
  14092. -->
  14093. Firing prefer*rvt*predict-no*H0
  14094. -->
  14095. Firing elaborate*copy-dir-to-output-link
  14096. -->
  14097. (I3 ^dir R +)
  14098. inner elaboration loop at bottom goal.
  14099. Retracting elaborate*copy-see-to-output-link
  14100. -->
  14101. (I3 ^see 0 +)
  14102. Retracting propose*predict-no
  14103. -->
  14104. (O1994 ^name predict-no +)
  14105. (S1 ^operator O1994 +)
  14106. Retracting propose*predict-yes
  14107. -->
  14108. (O1993 ^name predict-yes +)
  14109. (S1 ^operator O1993 +)
  14110. Retracting elaborate*reward*based*on*reward
  14111. -->
  14112. (R1000 ^value 1 +)
  14113. (R1 ^reward R1000 +)
  14114. Retracting elaborate*copy-dir-to-output-link
  14115. -->
  14116. (I3 ^dir R +)
  14117. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14118. -->
  14119. (S1 ^operator O1994 = -0.00558448899823713)
  14120. Retracting rl*prefer*rvt*predict-no*H0*4
  14121. -->
  14122. (S1 ^operator O1994 = 0.4476195574206818)
  14123. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14124. -->
  14125. (S1 ^operator O1993 = 0.8155947374398671)
  14126. Retracting rl*prefer*rvt*predict-yes*H0*3
  14127. -->
  14128. (S1 ^operator O1993 = 0.1844132735858656)
  14129. =>WM: (13984: S1 ^operator O1996 +)
  14130. =>WM: (13983: S1 ^operator O1995 +)
  14131. =>WM: (13982: O1996 ^name predict-no)
  14132. =>WM: (13981: O1995 ^name predict-yes)
  14133. =>WM: (13980: R1001 ^value 1)
  14134. =>WM: (13979: R1 ^reward R1001)
  14135. =>WM: (13978: I3 ^see 1)
  14136. <=WM: (13969: S1 ^operator O1993 +)
  14137. <=WM: (13971: S1 ^operator O1993)
  14138. <=WM: (13970: S1 ^operator O1994 +)
  14139. <=WM: (13964: R1 ^reward R1000)
  14140. <=WM: (13963: I3 ^see 0)
  14141. <=WM: (13967: O1994 ^name predict-no)
  14142. <=WM: (13966: O1993 ^name predict-yes)
  14143. <=WM: (13965: R1000 ^value 1)
  14144. --- Inner Elaboration Phase, active level 1 (S1) ---
  14145. Firing prefer*rvt*predict-yes*H0
  14146. -->
  14147. Firing rl*prefer*rvt*predict-yes*H0*3
  14148. -->
  14149. (S1 ^operator O1995 = 0.1844132735858656)
  14150. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14151. -->
  14152. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  14153. -->
  14154. (S1 ^operator O1995 = 0.1398795999120246)
  14155. Firing prefer*rvt*predict-no*H0
  14156. -->
  14157. Firing rl*prefer*rvt*predict-no*H0*4
  14158. -->
  14159. (S1 ^operator O1996 = 0.4476195574206818)
  14160. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14161. -->
  14162. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  14163. -->
  14164. (S1 ^operator O1996 = 0.5523820607022403)
  14165. inner elaboration loop at bottom goal.
  14166. Retracting rl*prefer*rvt*predict-no*H0*4
  14167. -->
  14168. (S1 ^operator O1994 = 0.4476195574206818)
  14169. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  14170. -->
  14171. (S1 ^operator O1994 = 0.5523820607022403)
  14172. Retracting rl*prefer*rvt*predict-yes*H0*3
  14173. -->
  14174. (S1 ^operator O1993 = 0.1844132735858656)
  14175. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  14176. -->
  14177. (S1 ^operator O1993 = 0.1398795999120246)
  14178. --- END Proposal Phase ---
  14179. --- Decision Phase ---
  14180. RL update rl*prefer*rvt*predict-yes*H0*3 0.675417 -0.491003 0.184413 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.9,0.0905325)
  14181. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324594 0.491001 0.815595 -> 0.324592 0.491001 0.815594(R,m,v=1,1,0)
  14182. =>WM: (13985: S1 ^operator O1996)
  14183. 998: O: O1996 (predict-no)
  14184. --- END Decision Phase ---
  14185. --- Application Phase ---
  14186. --- Firing Productions (PE) For State At Depth 1 ---
  14187. --- Inner Elaboration Phase, active level 1 (S1) ---
  14188. Firing apply*operator
  14189. -->
  14190. (I3 ^predict-no N998 + :O )
  14191. Firing apply*operator*complete
  14192. -->
  14193. (I3 ^predict-yes N997 - :O )
  14194. inner elaboration loop at bottom goal.
  14195. --- Change Working Memory (PE) ---
  14196. =>WM: (13986: I3 ^predict-no N998)
  14197. <=WM: (13973: N997 ^status complete)
  14198. <=WM: (13972: I3 ^predict-yes N997)
  14199. --- Firing Productions (IE) For State At Depth 1 ---
  14200. --- Inner Elaboration Phase, active level 1 (S1) ---
  14201. Firing monitor*world
  14202. -->
  14203. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14204. --- Change Working Memory (IE) ---
  14205. --- END Application Phase ---
  14206. --- Output Phase ---
  14207. ENV: Agent did: predict-no for direction R in state State-B
  14208. In State-B moving R
  14209. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14210. predict error 0
  14211. dir: dir isL
  14212. --- END Output Phase ---
  14213. -/|--- Input Phase ---
  14214. =>WM: (13990: I2 ^dir L)
  14215. =>WM: (13989: I2 ^reward 1)
  14216. =>WM: (13988: I2 ^see 0)
  14217. =>WM: (13987: N998 ^status complete)
  14218. <=WM: (13976: I2 ^dir R)
  14219. <=WM: (13975: I2 ^reward 1)
  14220. <=WM: (13974: I2 ^see 1)
  14221. =>WM: (13991: I2 ^level-1 R0-root)
  14222. <=WM: (13977: I2 ^level-1 R1-root)
  14223. --- END Input Phase ---
  14224. --- Proposal Phase ---
  14225. --- Inner Elaboration Phase, active level 1 (S1) ---
  14226. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  14227. -->
  14228. (S1 ^operator O1995 = 0.6104611932916519)
  14229. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  14230. -->
  14231. (S1 ^operator O1996 = 0.1063475139796038)
  14232. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14233. -->
  14234. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14235. -->
  14236. Firing elaborate*copy-see-to-output-link
  14237. -->
  14238. (I3 ^see 0 +)
  14239. Firing elaborate*reward*based*on*reward
  14240. -->
  14241. (R1002 ^value 1 +)
  14242. (R1 ^reward R1002 +)
  14243. Firing propose*predict-yes
  14244. -->
  14245. (O1997 ^name predict-yes +)
  14246. (S1 ^operator O1997 +)
  14247. Firing propose*predict-no
  14248. -->
  14249. (O1998 ^name predict-no +)
  14250. (S1 ^operator O1998 +)
  14251. Firing rl*prefer*rvt*predict-no*H0*2
  14252. -->
  14253. (S1 ^operator O1996 = 0.3873370065427176)
  14254. Firing rl*prefer*rvt*predict-yes*H0*1
  14255. -->
  14256. (S1 ^operator O1995 = 0.3895396873671274)
  14257. Firing prefer*rvt*predict-yes*H0
  14258. -->
  14259. Firing prefer*rvt*predict-no*H0
  14260. -->
  14261. Firing elaborate*copy-dir-to-output-link
  14262. -->
  14263. (I3 ^dir L +)
  14264. inner elaboration loop at bottom goal.
  14265. Retracting elaborate*copy-see-to-output-link
  14266. -->
  14267. (I3 ^see 1 +)
  14268. Retracting propose*predict-no
  14269. -->
  14270. (O1996 ^name predict-no +)
  14271. (S1 ^operator O1996 +)
  14272. Retracting propose*predict-yes
  14273. -->
  14274. (O1995 ^name predict-yes +)
  14275. (S1 ^operator O1995 +)
  14276. Retracting elaborate*reward*based*on*reward
  14277. -->
  14278. (R1001 ^value 1 +)
  14279. (R1 ^reward R1001 +)
  14280. Retracting elaborate*copy-dir-to-output-link
  14281. -->
  14282. (I3 ^dir R +)
  14283. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  14284. -->
  14285. (S1 ^operator O1996 = 0.5523820607022403)
  14286. Retracting rl*prefer*rvt*predict-no*H0*4
  14287. -->
  14288. (S1 ^operator O1996 = 0.4476195574206818)
  14289. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  14290. -->
  14291. (S1 ^operator O1995 = 0.1398795999120246)
  14292. Retracting rl*prefer*rvt*predict-yes*H0*3
  14293. -->
  14294. (S1 ^operator O1995 = 0.1844120719320057)
  14295. =>WM: (13999: S1 ^operator O1998 +)
  14296. =>WM: (13998: S1 ^operator O1997 +)
  14297. =>WM: (13997: I3 ^dir L)
  14298. =>WM: (13996: O1998 ^name predict-no)
  14299. =>WM: (13995: O1997 ^name predict-yes)
  14300. =>WM: (13994: R1002 ^value 1)
  14301. =>WM: (13993: R1 ^reward R1002)
  14302. =>WM: (13992: I3 ^see 0)
  14303. <=WM: (13983: S1 ^operator O1995 +)
  14304. <=WM: (13984: S1 ^operator O1996 +)
  14305. <=WM: (13985: S1 ^operator O1996)
  14306. <=WM: (13968: I3 ^dir R)
  14307. <=WM: (13979: R1 ^reward R1001)
  14308. <=WM: (13978: I3 ^see 1)
  14309. <=WM: (13982: O1996 ^name predict-no)
  14310. <=WM: (13981: O1995 ^name predict-yes)
  14311. <=WM: (13980: R1001 ^value 1)
  14312. --- Inner Elaboration Phase, active level 1 (S1) ---
  14313. Firing prefer*rvt*predict-yes*H0
  14314. -->
  14315. Firing rl*prefer*rvt*predict-yes*H0*1
  14316. -->
  14317. (S1 ^operator O1997 = 0.3895396873671274)
  14318. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14319. -->
  14320. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  14321. -->
  14322. (S1 ^operator O1997 = 0.6104611932916519)
  14323. Firing prefer*rvt*predict-no*H0
  14324. -->
  14325. Firing rl*prefer*rvt*predict-no*H0*2
  14326. -->
  14327. (S1 ^operator O1998 = 0.3873370065427176)
  14328. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14329. -->
  14330. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  14331. -->
  14332. (S1 ^operator O1998 = 0.1063475139796038)
  14333. inner elaboration loop at bottom goal.
  14334. Retracting rl*prefer*rvt*predict-no*H0*2
  14335. -->
  14336. (S1 ^operator O1996 = 0.3873370065427176)
  14337. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  14338. -->
  14339. (S1 ^operator O1996 = 0.1063475139796038)
  14340. Retracting rl*prefer*rvt*predict-yes*H0*1
  14341. -->
  14342. (S1 ^operator O1995 = 0.3895396873671274)
  14343. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  14344. -->
  14345. (S1 ^operator O1995 = 0.6104611932916519)
  14346. --- END Proposal Phase ---
  14347. --- Decision Phase ---
  14348. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174913 0.44762 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.930233,0.065407)
  14349. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552382 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
  14350. =>WM: (14000: S1 ^operator O1997)
  14351. 999: O: O1997 (predict-yes)
  14352. --- END Decision Phase ---
  14353. --- Application Phase ---
  14354. --- Firing Productions (PE) For State At Depth 1 ---
  14355. --- Inner Elaboration Phase, active level 1 (S1) ---
  14356. Firing apply*operator
  14357. -->
  14358. (I3 ^predict-yes N999 + :O )
  14359. Firing apply*operator*complete
  14360. -->
  14361. (I3 ^predict-no N998 - :O )
  14362. inner elaboration loop at bottom goal.
  14363. --- Change Working Memory (PE) ---
  14364. =>WM: (14001: I3 ^predict-yes N999)
  14365. <=WM: (13987: N998 ^status complete)
  14366. <=WM: (13986: I3 ^predict-no N998)
  14367. --- Firing Productions (IE) For State At Depth 1 ---
  14368. --- Inner Elaboration Phase, active level 1 (S1) ---
  14369. Firing monitor*world
  14370. -->
  14371. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14372. --- Change Working Memory (IE) ---
  14373. --- END Application Phase ---
  14374. --- Output Phase ---
  14375. ENV: Agent did: predict-yes for direction L in state State-B
  14376. In State-B moving L
  14377. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14378. predict error 0
  14379. dir: dir isR
  14380. --- END Output Phase ---
  14381. \-/--- Input Phase ---
  14382. =>WM: (14005: I2 ^dir R)
  14383. =>WM: (14004: I2 ^reward 1)
  14384. =>WM: (14003: I2 ^see 1)
  14385. =>WM: (14002: N999 ^status complete)
  14386. <=WM: (13990: I2 ^dir L)
  14387. <=WM: (13989: I2 ^reward 1)
  14388. <=WM: (13988: I2 ^see 0)
  14389. =>WM: (14006: I2 ^level-1 L1-root)
  14390. <=WM: (13991: I2 ^level-1 R0-root)
  14391. --- END Input Phase ---
  14392. --- Proposal Phase ---
  14393. --- Inner Elaboration Phase, active level 1 (S1) ---
  14394. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  14395. -->
  14396. (S1 ^operator O1998 = -0.02155734064455064)
  14397. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  14398. -->
  14399. (S1 ^operator O1997 = 0.815582443315254)
  14400. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14401. -->
  14402. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14403. -->
  14404. Firing elaborate*copy-see-to-output-link
  14405. -->
  14406. (I3 ^see 1 +)
  14407. Firing elaborate*reward*based*on*reward
  14408. -->
  14409. (R1003 ^value 1 +)
  14410. (R1 ^reward R1003 +)
  14411. Firing propose*predict-yes
  14412. -->
  14413. (O1999 ^name predict-yes +)
  14414. (S1 ^operator O1999 +)
  14415. Firing propose*predict-no
  14416. -->
  14417. (O2000 ^name predict-no +)
  14418. (S1 ^operator O2000 +)
  14419. Firing rl*prefer*rvt*predict-no*H0*4
  14420. -->
  14421. (S1 ^operator O1998 = 0.4476193147022436)
  14422. Firing rl*prefer*rvt*predict-yes*H0*3
  14423. -->
  14424. (S1 ^operator O1997 = 0.1844120719320057)
  14425. Firing prefer*rvt*predict-yes*H0
  14426. -->
  14427. Firing prefer*rvt*predict-no*H0
  14428. -->
  14429. Firing elaborate*copy-dir-to-output-link
  14430. -->
  14431. (I3 ^dir R +)
  14432. inner elaboration loop at bottom goal.
  14433. Retracting elaborate*copy-see-to-output-link
  14434. -->
  14435. (I3 ^see 0 +)
  14436. Retracting propose*predict-no
  14437. -->
  14438. (O1998 ^name predict-no +)
  14439. (S1 ^operator O1998 +)
  14440. Retracting propose*predict-yes
  14441. -->
  14442. (O1997 ^name predict-yes +)
  14443. (S1 ^operator O1997 +)
  14444. Retracting elaborate*reward*based*on*reward
  14445. -->
  14446. (R1002 ^value 1 +)
  14447. (R1 ^reward R1002 +)
  14448. Retracting elaborate*copy-dir-to-output-link
  14449. -->
  14450. (I3 ^dir L +)
  14451. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  14452. -->
  14453. (S1 ^operator O1998 = 0.1063475139796038)
  14454. Retracting rl*prefer*rvt*predict-no*H0*2
  14455. -->
  14456. (S1 ^operator O1998 = 0.3873370065427176)
  14457. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  14458. -->
  14459. (S1 ^operator O1997 = 0.6104611932916519)
  14460. Retracting rl*prefer*rvt*predict-yes*H0*1
  14461. -->
  14462. (S1 ^operator O1997 = 0.3895396873671274)
  14463. =>WM: (14014: S1 ^operator O2000 +)
  14464. =>WM: (14013: S1 ^operator O1999 +)
  14465. =>WM: (14012: I3 ^dir R)
  14466. =>WM: (14011: O2000 ^name predict-no)
  14467. =>WM: (14010: O1999 ^name predict-yes)
  14468. =>WM: (14009: R1003 ^value 1)
  14469. =>WM: (14008: R1 ^reward R1003)
  14470. =>WM: (14007: I3 ^see 1)
  14471. <=WM: (13998: S1 ^operator O1997 +)
  14472. <=WM: (14000: S1 ^operator O1997)
  14473. <=WM: (13999: S1 ^operator O1998 +)
  14474. <=WM: (13997: I3 ^dir L)
  14475. <=WM: (13993: R1 ^reward R1002)
  14476. <=WM: (13992: I3 ^see 0)
  14477. <=WM: (13996: O1998 ^name predict-no)
  14478. <=WM: (13995: O1997 ^name predict-yes)
  14479. <=WM: (13994: R1002 ^value 1)
  14480. --- Inner Elaboration Phase, active level 1 (S1) ---
  14481. Firing prefer*rvt*predict-yes*H0
  14482. -->
  14483. Firing rl*prefer*rvt*predict-yes*H0*3
  14484. -->
  14485. (S1 ^operator O1999 = 0.1844120719320057)
  14486. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14487. -->
  14488. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  14489. -->
  14490. (S1 ^operator O1999 = 0.815582443315254)
  14491. Firing prefer*rvt*predict-no*H0
  14492. -->
  14493. Firing rl*prefer*rvt*predict-no*H0*4
  14494. -->
  14495. (S1 ^operator O2000 = 0.4476193147022436)
  14496. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14497. -->
  14498. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  14499. -->
  14500. (S1 ^operator O2000 = -0.02155734064455064)
  14501. inner elaboration loop at bottom goal.
  14502. Retracting rl*prefer*rvt*predict-no*H0*4
  14503. -->
  14504. (S1 ^operator O1998 = 0.4476193147022436)
  14505. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  14506. -->
  14507. (S1 ^operator O1998 = -0.02155734064455064)
  14508. Retracting rl*prefer*rvt*predict-yes*H0*3
  14509. -->
  14510. (S1 ^operator O1997 = 0.1844120719320057)
  14511. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  14512. -->
  14513. (S1 ^operator O1997 = 0.815582443315254)
  14514. --- END Proposal Phase ---
  14515. --- Decision Phase ---
  14516. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.892857,0.0962361)
  14517. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  14518. =>WM: (14015: S1 ^operator O1999)
  14519. 1000: O: O1999 (predict-yes)
  14520. --- END Decision Phase ---
  14521. --- Application Phase ---
  14522. --- Firing Productions (PE) For State At Depth 1 ---
  14523. --- Inner Elaboration Phase, active level 1 (S1) ---
  14524. Firing apply*operator
  14525. -->
  14526. (I3 ^predict-yes N1000 + :O )
  14527. Firing apply*operator*complete
  14528. -->
  14529. (I3 ^predict-yes N999 - :O )
  14530. inner elaboration loop at bottom goal.
  14531. --- Change Working Memory (PE) ---
  14532. =>WM: (14016: I3 ^predict-yes N1000)
  14533. <=WM: (14002: N999 ^status complete)
  14534. <=WM: (14001: I3 ^predict-yes N999)
  14535. --- Firing Productions (IE) For State At Depth 1 ---
  14536. --- Inner Elaboration Phase, active level 1 (S1) ---
  14537. Firing monitor*world
  14538. -->
  14539. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14540. --- Change Working Memory (IE) ---
  14541. --- END Application Phase ---
  14542. --- Output Phase ---
  14543. ENV: Agent did: predict-yes for direction R in state State-A
  14544. In State-A moving R
  14545. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14546. predict error 0
  14547. dir: dir isU
  14548. --- END Output Phase ---
  14549. |\-/|\-/|\---- Input Phase ---
  14550. =>WM: (14020: I2 ^dir U)
  14551. =>WM: (14019: I2 ^reward 1)
  14552. =>WM: (14018: I2 ^see 1)
  14553. =>WM: (14017: N1000 ^status complete)
  14554. <=WM: (14005: I2 ^dir R)
  14555. <=WM: (14004: I2 ^reward 1)
  14556. <=WM: (14003: I2 ^see 1)
  14557. =>WM: (14021: I2 ^level-1 R1-root)
  14558. <=WM: (14006: I2 ^level-1 L1-root)
  14559. --- END Input Phase ---
  14560. --- Proposal Phase ---
  14561. --- Inner Elaboration Phase, active level 1 (S1) ---
  14562. Firing elaborate*copy-see-to-output-link
  14563. -->
  14564. (I3 ^see 1 +)
  14565. Firing elaborate*reward*based*on*reward
  14566. -->
  14567. (R1004 ^value 1 +)
  14568. (R1 ^reward R1004 +)
  14569. Firing propose*predict-yes
  14570. -->
  14571. (O2001 ^name predict-yes +)
  14572. (S1 ^operator O2001 +)
  14573. Firing propose*predict-no
  14574. -->
  14575. (O2002 ^name predict-no +)
  14576. (S1 ^operator O2002 +)
  14577. Firing rl*prefer*rvt*predict-no*H0*6
  14578. -->
  14579. (S1 ^operator O2000 = 0.9999999999999999)
  14580. Firing rl*prefer*rvt*predict-yes*H0*5
  14581. -->
  14582. (S1 ^operator O1999 = 0.)
  14583. Firing prefer*rvt*predict-yes*H0
  14584. -->
  14585. Firing prefer*rvt*predict-no*H0
  14586. -->
  14587. Firing elaborate*copy-dir-to-output-link
  14588. -->
  14589. (I3 ^dir U +)
  14590. inner elaboration loop at bottom goal.
  14591. Retracting elaborate*copy-see-to-output-link
  14592. -->
  14593. (I3 ^see 1 +)
  14594. Retracting propose*predict-no
  14595. -->
  14596. (O2000 ^name predict-no +)
  14597. (S1 ^operator O2000 +)
  14598. Retracting propose*predict-yes
  14599. -->
  14600. (O1999 ^name predict-yes +)
  14601. (S1 ^operator O1999 +)
  14602. Retracting elaborate*reward*based*on*reward
  14603. -->
  14604. (R1003 ^value 1 +)
  14605. (R1 ^reward R1003 +)
  14606. Retracting elaborate*copy-dir-to-output-link
  14607. -->
  14608. (I3 ^dir R +)
  14609. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  14610. -->
  14611. (S1 ^operator O2000 = -0.02155734064455064)
  14612. Retracting rl*prefer*rvt*predict-no*H0*4
  14613. -->
  14614. (S1 ^operator O2000 = 0.4476193147022436)
  14615. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  14616. -->
  14617. (S1 ^operator O1999 = 0.815582443315254)
  14618. Retracting rl*prefer*rvt*predict-yes*H0*3
  14619. -->
  14620. (S1 ^operator O1999 = 0.1844120719320057)
  14621. =>WM: (14028: S1 ^operator O2002 +)
  14622. =>WM: (14027: S1 ^operator O2001 +)
  14623. =>WM: (14026: I3 ^dir U)
  14624. =>WM: (14025: O2002 ^name predict-no)
  14625. =>WM: (14024: O2001 ^name predict-yes)
  14626. =>WM: (14023: R1004 ^value 1)
  14627. =>WM: (14022: R1 ^reward R1004)
  14628. <=WM: (14013: S1 ^operator O1999 +)
  14629. <=WM: (14015: S1 ^operator O1999)
  14630. <=WM: (14014: S1 ^operator O2000 +)
  14631. <=WM: (14012: I3 ^dir R)
  14632. <=WM: (14008: R1 ^reward R1003)
  14633. <=WM: (14011: O2000 ^name predict-no)
  14634. <=WM: (14010: O1999 ^name predict-yes)
  14635. <=WM: (14009: R1003 ^value 1)
  14636. --- Inner Elaboration Phase, active level 1 (S1) ---
  14637. Firing prefer*rvt*predict-yes*H0
  14638. -->
  14639. Firing rl*prefer*rvt*predict-yes*H0*5
  14640. -->
  14641. (S1 ^operator O2001 = 0.)
  14642. Firing prefer*rvt*predict-no*H0
  14643. -->
  14644. Firing rl*prefer*rvt*predict-no*H0*6
  14645. -->
  14646. (S1 ^operator O2002 = 0.9999999999999999)
  14647. inner elaboration loop at bottom goal.
  14648. Retracting rl*prefer*rvt*predict-no*H0*6
  14649. -->
  14650. (S1 ^operator O2000 = 0.9999999999999999)
  14651. Retracting rl*prefer*rvt*predict-yes*H0*5
  14652. -->
  14653. (S1 ^operator O1999 = 0.)
  14654. --- END Proposal Phase ---
  14655. --- Decision Phase ---
  14656. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675416 -0.491003 0.184413(R,m,v=1,0.900585,0.0900585)
  14657. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324578 0.491005 0.815582 -> 0.324579 0.491004 0.815583(R,m,v=1,1,0)
  14658. =>WM: (14029: S1 ^operator O2002)
  14659. 1001: O: O2002 (predict-no)
  14660. --- END Decision Phase ---
  14661. --- Application Phase ---
  14662. --- Firing Productions (PE) For State At Depth 1 ---
  14663. --- Inner Elaboration Phase, active level 1 (S1) ---
  14664. Firing apply*operator
  14665. -->
  14666. (I3 ^predict-no N1001 + :O )
  14667. Firing apply*operator*complete
  14668. -->
  14669. (I3 ^predict-yes N1000 - :O )
  14670. inner elaboration loop at bottom goal.
  14671. --- Change Working Memory (PE) ---
  14672. =>WM: (14030: I3 ^predict-no N1001)
  14673. <=WM: (14017: N1000 ^status complete)
  14674. <=WM: (14016: I3 ^predict-yes N1000)
  14675. --- Firing Productions (IE) For State At Depth 1 ---
  14676. --- Inner Elaboration Phase, active level 1 (S1) ---
  14677. Firing monitor*world
  14678. -->
  14679. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14680. --- Change Working Memory (IE) ---
  14681. --- END Application Phase ---
  14682. --- Output Phase ---
  14683. ENV: Agent did: predict-no for direction U in state State-B
  14684. In State-B moving U
  14685. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14686. predict error 0
  14687. dir: dir isL
  14688. --- END Output Phase ---
  14689. /--- Input Phase ---
  14690. =>WM: (14034: I2 ^dir L)
  14691. =>WM: (14033: I2 ^reward 1)
  14692. =>WM: (14032: I2 ^see 0)
  14693. =>WM: (14031: N1001 ^status complete)
  14694. <=WM: (14020: I2 ^dir U)
  14695. <=WM: (14019: I2 ^reward 1)
  14696. <=WM: (14018: I2 ^see 1)
  14697. =>WM: (14035: I2 ^level-1 R1-root)
  14698. <=WM: (14021: I2 ^level-1 R1-root)
  14699. --- END Input Phase ---
  14700. --- Proposal Phase ---
  14701. --- Inner Elaboration Phase, active level 1 (S1) ---
  14702. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  14703. -->
  14704. (S1 ^operator O2001 = 0.6104596086348102)
  14705. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  14706. -->
  14707. (S1 ^operator O2002 = 0.2714993082286609)
  14708. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14709. -->
  14710. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14711. -->
  14712. Firing elaborate*copy-see-to-output-link
  14713. -->
  14714. (I3 ^see 0 +)
  14715. Firing elaborate*reward*based*on*reward
  14716. -->
  14717. (R1005 ^value 1 +)
  14718. (R1 ^reward R1005 +)
  14719. Firing propose*predict-yes
  14720. -->
  14721. (O2003 ^name predict-yes +)
  14722. (S1 ^operator O2003 +)
  14723. Firing propose*predict-no
  14724. -->
  14725. (O2004 ^name predict-no +)
  14726. (S1 ^operator O2004 +)
  14727. Firing rl*prefer*rvt*predict-no*H0*2
  14728. -->
  14729. (S1 ^operator O2002 = 0.3873370065427176)
  14730. Firing rl*prefer*rvt*predict-yes*H0*1
  14731. -->
  14732. (S1 ^operator O2001 = 0.3895395552683104)
  14733. Firing prefer*rvt*predict-yes*H0
  14734. -->
  14735. Firing prefer*rvt*predict-no*H0
  14736. -->
  14737. Firing elaborate*copy-dir-to-output-link
  14738. -->
  14739. (I3 ^dir L +)
  14740. inner elaboration loop at bottom goal.
  14741. Retracting elaborate*copy-see-to-output-link
  14742. -->
  14743. (I3 ^see 1 +)
  14744. Retracting propose*predict-no
  14745. -->
  14746. (O2002 ^name predict-no +)
  14747. (S1 ^operator O2002 +)
  14748. Retracting propose*predict-yes
  14749. -->
  14750. (O2001 ^name predict-yes +)
  14751. (S1 ^operator O2001 +)
  14752. Retracting elaborate*reward*based*on*reward
  14753. -->
  14754. (R1004 ^value 1 +)
  14755. (R1 ^reward R1004 +)
  14756. Retracting elaborate*copy-dir-to-output-link
  14757. -->
  14758. (I3 ^dir U +)
  14759. Retracting rl*prefer*rvt*predict-no*H0*6
  14760. -->
  14761. (S1 ^operator O2002 = 0.9999999999999999)
  14762. Retracting rl*prefer*rvt*predict-yes*H0*5
  14763. -->
  14764. (S1 ^operator O2001 = 0.)
  14765. =>WM: (14043: S1 ^operator O2004 +)
  14766. =>WM: (14042: S1 ^operator O2003 +)
  14767. =>WM: (14041: I3 ^dir L)
  14768. =>WM: (14040: O2004 ^name predict-no)
  14769. =>WM: (14039: O2003 ^name predict-yes)
  14770. =>WM: (14038: R1005 ^value 1)
  14771. =>WM: (14037: R1 ^reward R1005)
  14772. =>WM: (14036: I3 ^see 0)
  14773. <=WM: (14027: S1 ^operator O2001 +)
  14774. <=WM: (14028: S1 ^operator O2002 +)
  14775. <=WM: (14029: S1 ^operator O2002)
  14776. <=WM: (14026: I3 ^dir U)
  14777. <=WM: (14022: R1 ^reward R1004)
  14778. <=WM: (14007: I3 ^see 1)
  14779. <=WM: (14025: O2002 ^name predict-no)
  14780. <=WM: (14024: O2001 ^name predict-yes)
  14781. <=WM: (14023: R1004 ^value 1)
  14782. --- Inner Elaboration Phase, active level 1 (S1) ---
  14783. Firing prefer*rvt*predict-yes*H0
  14784. -->
  14785. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  14786. -->
  14787. (S1 ^operator O2003 = 0.6104596086348102)
  14788. Firing rl*prefer*rvt*predict-yes*H0*1
  14789. -->
  14790. (S1 ^operator O2003 = 0.3895395552683104)
  14791. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14792. -->
  14793. Firing prefer*rvt*predict-no*H0
  14794. -->
  14795. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  14796. -->
  14797. (S1 ^operator O2004 = 0.2714993082286609)
  14798. Firing rl*prefer*rvt*predict-no*H0*2
  14799. -->
  14800. (S1 ^operator O2004 = 0.3873370065427176)
  14801. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14802. -->
  14803. inner elaboration loop at bottom goal.
  14804. Retracting rl*prefer*rvt*predict-no*H0*2
  14805. -->
  14806. (S1 ^operator O2002 = 0.3873370065427176)
  14807. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  14808. -->
  14809. (S1 ^operator O2002 = 0.2714993082286609)
  14810. Retracting rl*prefer*rvt*predict-yes*H0*1
  14811. -->
  14812. (S1 ^operator O2001 = 0.3895395552683104)
  14813. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  14814. -->
  14815. (S1 ^operator O2001 = 0.6104596086348102)
  14816. --- END Proposal Phase ---
  14817. --- Decision Phase ---
  14818. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14819. =>WM: (14044: S1 ^operator O2003)
  14820. 1002: O: O2003 (predict-yes)
  14821. --- END Decision Phase ---
  14822. --- Application Phase ---
  14823. --- Firing Productions (PE) For State At Depth 1 ---
  14824. --- Inner Elaboration Phase, active level 1 (S1) ---
  14825. Firing apply*operator
  14826. -->
  14827. (I3 ^predict-yes N1002 + :O )
  14828. Firing apply*operator*complete
  14829. -->
  14830. (I3 ^predict-no N1001 - :O )
  14831. inner elaboration loop at bottom goal.
  14832. --- Change Working Memory (PE) ---
  14833. =>WM: (14045: I3 ^predict-yes N1002)
  14834. <=WM: (14031: N1001 ^status complete)
  14835. <=WM: (14030: I3 ^predict-no N1001)
  14836. --- Firing Productions (IE) For State At Depth 1 ---
  14837. --- Inner Elaboration Phase, active level 1 (S1) ---
  14838. Firing monitor*world
  14839. -->
  14840. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14841. --- Change Working Memory (IE) ---
  14842. --- END Application Phase ---
  14843. --- Output Phase ---
  14844. ENV: Agent did: predict-yes for direction L in state State-B
  14845. In State-B moving L
  14846. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14847. predict error 0
  14848. dir: dir isL
  14849. --- END Output Phase ---
  14850. |\-/--- Input Phase ---
  14851. =>WM: (14049: I2 ^dir L)
  14852. =>WM: (14048: I2 ^reward 1)
  14853. =>WM: (14047: I2 ^see 1)
  14854. =>WM: (14046: N1002 ^status complete)
  14855. <=WM: (14034: I2 ^dir L)
  14856. <=WM: (14033: I2 ^reward 1)
  14857. <=WM: (14032: I2 ^see 0)
  14858. =>WM: (14050: I2 ^level-1 L1-root)
  14859. <=WM: (14035: I2 ^level-1 R1-root)
  14860. --- END Input Phase ---
  14861. --- Proposal Phase ---
  14862. --- Inner Elaboration Phase, active level 1 (S1) ---
  14863. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  14864. -->
  14865. (S1 ^operator O2004 = 0.6126627914480096)
  14866. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  14867. -->
  14868. (S1 ^operator O2003 = -0.02274740735326741)
  14869. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14870. -->
  14871. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14872. -->
  14873. Firing elaborate*copy-see-to-output-link
  14874. -->
  14875. (I3 ^see 1 +)
  14876. Firing elaborate*reward*based*on*reward
  14877. -->
  14878. (R1006 ^value 1 +)
  14879. (R1 ^reward R1006 +)
  14880. Firing propose*predict-yes
  14881. -->
  14882. (O2005 ^name predict-yes +)
  14883. (S1 ^operator O2005 +)
  14884. Firing propose*predict-no
  14885. -->
  14886. (O2006 ^name predict-no +)
  14887. (S1 ^operator O2006 +)
  14888. Firing rl*prefer*rvt*predict-no*H0*2
  14889. -->
  14890. (S1 ^operator O2004 = 0.3873370065427176)
  14891. Firing rl*prefer*rvt*predict-yes*H0*1
  14892. -->
  14893. (S1 ^operator O2003 = 0.3895395552683104)
  14894. Firing prefer*rvt*predict-yes*H0
  14895. -->
  14896. Firing prefer*rvt*predict-no*H0
  14897. -->
  14898. Firing elaborate*copy-dir-to-output-link
  14899. -->
  14900. (I3 ^dir L +)
  14901. inner elaboration loop at bottom goal.
  14902. Retracting elaborate*copy-see-to-output-link
  14903. -->
  14904. (I3 ^see 0 +)
  14905. Retracting propose*predict-no
  14906. -->
  14907. (O2004 ^name predict-no +)
  14908. (S1 ^operator O2004 +)
  14909. Retracting propose*predict-yes
  14910. -->
  14911. (O2003 ^name predict-yes +)
  14912. (S1 ^operator O2003 +)
  14913. Retracting elaborate*reward*based*on*reward
  14914. -->
  14915. (R1005 ^value 1 +)
  14916. (R1 ^reward R1005 +)
  14917. Retracting elaborate*copy-dir-to-output-link
  14918. -->
  14919. (I3 ^dir L +)
  14920. Retracting rl*prefer*rvt*predict-no*H0*2
  14921. -->
  14922. (S1 ^operator O2004 = 0.3873370065427176)
  14923. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  14924. -->
  14925. (S1 ^operator O2004 = 0.2714993082286609)
  14926. Retracting rl*prefer*rvt*predict-yes*H0*1
  14927. -->
  14928. (S1 ^operator O2003 = 0.3895395552683104)
  14929. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  14930. -->
  14931. (S1 ^operator O2003 = 0.6104596086348102)
  14932. =>WM: (14057: S1 ^operator O2006 +)
  14933. =>WM: (14056: S1 ^operator O2005 +)
  14934. =>WM: (14055: O2006 ^name predict-no)
  14935. =>WM: (14054: O2005 ^name predict-yes)
  14936. =>WM: (14053: R1006 ^value 1)
  14937. =>WM: (14052: R1 ^reward R1006)
  14938. =>WM: (14051: I3 ^see 1)
  14939. <=WM: (14042: S1 ^operator O2003 +)
  14940. <=WM: (14044: S1 ^operator O2003)
  14941. <=WM: (14043: S1 ^operator O2004 +)
  14942. <=WM: (14037: R1 ^reward R1005)
  14943. <=WM: (14036: I3 ^see 0)
  14944. <=WM: (14040: O2004 ^name predict-no)
  14945. <=WM: (14039: O2003 ^name predict-yes)
  14946. <=WM: (14038: R1005 ^value 1)
  14947. --- Inner Elaboration Phase, active level 1 (S1) ---
  14948. Firing prefer*rvt*predict-yes*H0
  14949. -->
  14950. Firing rl*prefer*rvt*predict-yes*H0*1
  14951. -->
  14952. (S1 ^operator O2005 = 0.3895395552683104)
  14953. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  14954. -->
  14955. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  14956. -->
  14957. (S1 ^operator O2005 = -0.02274740735326741)
  14958. Firing prefer*rvt*predict-no*H0
  14959. -->
  14960. Firing rl*prefer*rvt*predict-no*H0*2
  14961. -->
  14962. (S1 ^operator O2006 = 0.3873370065427176)
  14963. Firing prefer*rvt*predict-no*H0*2*v1*H1
  14964. -->
  14965. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  14966. -->
  14967. (S1 ^operator O2006 = 0.6126627914480096)
  14968. inner elaboration loop at bottom goal.
  14969. Retracting rl*prefer*rvt*predict-no*H0*2
  14970. -->
  14971. (S1 ^operator O2004 = 0.3873370065427176)
  14972. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  14973. -->
  14974. (S1 ^operator O2004 = 0.6126627914480096)
  14975. Retracting rl*prefer*rvt*predict-yes*H0*1
  14976. -->
  14977. (S1 ^operator O2003 = 0.3895395552683104)
  14978. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  14979. -->
  14980. (S1 ^operator O2003 = -0.02274740735326741)
  14981. --- END Proposal Phase ---
  14982. --- Decision Phase ---
  14983. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.893491,0.0957312)
  14984. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  14985. =>WM: (14058: S1 ^operator O2006)
  14986. 1003: O: O2006 (predict-no)
  14987. --- END Decision Phase ---
  14988. --- Application Phase ---
  14989. --- Firing Productions (PE) For State At Depth 1 ---
  14990. --- Inner Elaboration Phase, active level 1 (S1) ---
  14991. Firing apply*operator
  14992. -->
  14993. (I3 ^predict-no N1003 + :O )
  14994. Firing apply*operator*complete
  14995. -->
  14996. (I3 ^predict-yes N1002 - :O )
  14997. inner elaboration loop at bottom goal.
  14998. --- Change Working Memory (PE) ---
  14999. =>WM: (14059: I3 ^predict-no N1003)
  15000. <=WM: (14046: N1002 ^status complete)
  15001. <=WM: (14045: I3 ^predict-yes N1002)
  15002. --- Firing Productions (IE) For State At Depth 1 ---
  15003. --- Inner Elaboration Phase, active level 1 (S1) ---
  15004. Firing monitor*world
  15005. -->
  15006. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15007. --- Change Working Memory (IE) ---
  15008. --- END Application Phase ---
  15009. --- Output Phase ---
  15010. ENV: Agent did: predict-no for direction L in state State-A
  15011. In State-A moving L
  15012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15013. predict error 0
  15014. dir: dir isR
  15015. --- END Output Phase ---
  15016. |\---- Input Phase ---
  15017. =>WM: (14063: I2 ^dir R)
  15018. =>WM: (14062: I2 ^reward 1)
  15019. =>WM: (14061: I2 ^see 0)
  15020. =>WM: (14060: N1003 ^status complete)
  15021. <=WM: (14049: I2 ^dir L)
  15022. <=WM: (14048: I2 ^reward 1)
  15023. <=WM: (14047: I2 ^see 1)
  15024. =>WM: (14064: I2 ^level-1 L0-root)
  15025. <=WM: (14050: I2 ^level-1 L1-root)
  15026. --- END Input Phase ---
  15027. --- Proposal Phase ---
  15028. --- Inner Elaboration Phase, active level 1 (S1) ---
  15029. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15030. -->
  15031. (S1 ^operator O2005 = 0.8155935357860071)
  15032. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15033. -->
  15034. (S1 ^operator O2006 = -0.00558448899823713)
  15035. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15036. -->
  15037. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15038. -->
  15039. Firing elaborate*copy-see-to-output-link
  15040. -->
  15041. (I3 ^see 0 +)
  15042. Firing elaborate*reward*based*on*reward
  15043. -->
  15044. (R1007 ^value 1 +)
  15045. (R1 ^reward R1007 +)
  15046. Firing propose*predict-yes
  15047. -->
  15048. (O2007 ^name predict-yes +)
  15049. (S1 ^operator O2007 +)
  15050. Firing propose*predict-no
  15051. -->
  15052. (O2008 ^name predict-no +)
  15053. (S1 ^operator O2008 +)
  15054. Firing rl*prefer*rvt*predict-no*H0*4
  15055. -->
  15056. (S1 ^operator O2006 = 0.4476193147022436)
  15057. Firing rl*prefer*rvt*predict-yes*H0*3
  15058. -->
  15059. (S1 ^operator O2005 = 0.1844128946449167)
  15060. Firing prefer*rvt*predict-yes*H0
  15061. -->
  15062. Firing prefer*rvt*predict-no*H0
  15063. -->
  15064. Firing elaborate*copy-dir-to-output-link
  15065. -->
  15066. (I3 ^dir R +)
  15067. inner elaboration loop at bottom goal.
  15068. Retracting elaborate*copy-see-to-output-link
  15069. -->
  15070. (I3 ^see 1 +)
  15071. Retracting propose*predict-no
  15072. -->
  15073. (O2006 ^name predict-no +)
  15074. (S1 ^operator O2006 +)
  15075. Retracting propose*predict-yes
  15076. -->
  15077. (O2005 ^name predict-yes +)
  15078. (S1 ^operator O2005 +)
  15079. Retracting elaborate*reward*based*on*reward
  15080. -->
  15081. (R1006 ^value 1 +)
  15082. (R1 ^reward R1006 +)
  15083. Retracting elaborate*copy-dir-to-output-link
  15084. -->
  15085. (I3 ^dir L +)
  15086. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  15087. -->
  15088. (S1 ^operator O2006 = 0.6126627914480096)
  15089. Retracting rl*prefer*rvt*predict-no*H0*2
  15090. -->
  15091. (S1 ^operator O2006 = 0.3873370065427176)
  15092. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  15093. -->
  15094. (S1 ^operator O2005 = -0.02274740735326741)
  15095. Retracting rl*prefer*rvt*predict-yes*H0*1
  15096. -->
  15097. (S1 ^operator O2005 = 0.3895396806828423)
  15098. =>WM: (14072: S1 ^operator O2008 +)
  15099. =>WM: (14071: S1 ^operator O2007 +)
  15100. =>WM: (14070: I3 ^dir R)
  15101. =>WM: (14069: O2008 ^name predict-no)
  15102. =>WM: (14068: O2007 ^name predict-yes)
  15103. =>WM: (14067: R1007 ^value 1)
  15104. =>WM: (14066: R1 ^reward R1007)
  15105. =>WM: (14065: I3 ^see 0)
  15106. <=WM: (14056: S1 ^operator O2005 +)
  15107. <=WM: (14057: S1 ^operator O2006 +)
  15108. <=WM: (14058: S1 ^operator O2006)
  15109. <=WM: (14041: I3 ^dir L)
  15110. <=WM: (14052: R1 ^reward R1006)
  15111. <=WM: (14051: I3 ^see 1)
  15112. <=WM: (14055: O2006 ^name predict-no)
  15113. <=WM: (14054: O2005 ^name predict-yes)
  15114. <=WM: (14053: R1006 ^value 1)
  15115. --- Inner Elaboration Phase, active level 1 (S1) ---
  15116. Firing prefer*rvt*predict-yes*H0
  15117. -->
  15118. Firing rl*prefer*rvt*predict-yes*H0*3
  15119. -->
  15120. (S1 ^operator O2007 = 0.1844128946449167)
  15121. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15122. -->
  15123. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15124. -->
  15125. (S1 ^operator O2007 = 0.8155935357860071)
  15126. Firing prefer*rvt*predict-no*H0
  15127. -->
  15128. Firing rl*prefer*rvt*predict-no*H0*4
  15129. -->
  15130. (S1 ^operator O2008 = 0.4476193147022436)
  15131. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15132. -->
  15133. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15134. -->
  15135. (S1 ^operator O2008 = -0.00558448899823713)
  15136. inner elaboration loop at bottom goal.
  15137. Retracting rl*prefer*rvt*predict-no*H0*4
  15138. -->
  15139. (S1 ^operator O2006 = 0.4476193147022436)
  15140. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15141. -->
  15142. (S1 ^operator O2006 = -0.00558448899823713)
  15143. Retracting rl*prefer*rvt*predict-yes*H0*3
  15144. -->
  15145. (S1 ^operator O2005 = 0.1844128946449167)
  15146. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15147. -->
  15148. (S1 ^operator O2005 = 0.8155935357860071)
  15149. --- END Proposal Phase ---
  15150. --- Decision Phase ---
  15151. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.932203,0.0635593)
  15152. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  15153. =>WM: (14073: S1 ^operator O2007)
  15154. 1004: O: O2007 (predict-yes)
  15155. --- END Decision Phase ---
  15156. --- Application Phase ---
  15157. --- Firing Productions (PE) For State At Depth 1 ---
  15158. --- Inner Elaboration Phase, active level 1 (S1) ---
  15159. Firing apply*operator
  15160. -->
  15161. (I3 ^predict-yes N1004 + :O )
  15162. Firing apply*operator*complete
  15163. -->
  15164. (I3 ^predict-no N1003 - :O )
  15165. inner elaboration loop at bottom goal.
  15166. --- Change Working Memory (PE) ---
  15167. =>WM: (14074: I3 ^predict-yes N1004)
  15168. <=WM: (14060: N1003 ^status complete)
  15169. <=WM: (14059: I3 ^predict-no N1003)
  15170. --- Firing Productions (IE) For State At Depth 1 ---
  15171. --- Inner Elaboration Phase, active level 1 (S1) ---
  15172. Firing monitor*world
  15173. -->
  15174. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15175. --- Change Working Memory (IE) ---
  15176. --- END Application Phase ---
  15177. --- Output Phase ---
  15178. ENV: Agent did: predict-yes for direction R in state State-A
  15179. In State-A moving R
  15180. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15181. predict error 0
  15182. dir: dir isU
  15183. --- END Output Phase ---
  15184. /|--- Input Phase ---
  15185. =>WM: (14078: I2 ^dir U)
  15186. =>WM: (14077: I2 ^reward 1)
  15187. =>WM: (14076: I2 ^see 1)
  15188. =>WM: (14075: N1004 ^status complete)
  15189. <=WM: (14063: I2 ^dir R)
  15190. <=WM: (14062: I2 ^reward 1)
  15191. <=WM: (14061: I2 ^see 0)
  15192. =>WM: (14079: I2 ^level-1 R1-root)
  15193. <=WM: (14064: I2 ^level-1 L0-root)
  15194. --- END Input Phase ---
  15195. --- Proposal Phase ---
  15196. --- Inner Elaboration Phase, active level 1 (S1) ---
  15197. Firing elaborate*copy-see-to-output-link
  15198. -->
  15199. (I3 ^see 1 +)
  15200. Firing elaborate*reward*based*on*reward
  15201. -->
  15202. (R1008 ^value 1 +)
  15203. (R1 ^reward R1008 +)
  15204. Firing propose*predict-yes
  15205. -->
  15206. (O2009 ^name predict-yes +)
  15207. (S1 ^operator O2009 +)
  15208. Firing propose*predict-no
  15209. -->
  15210. (O2010 ^name predict-no +)
  15211. (S1 ^operator O2010 +)
  15212. Firing rl*prefer*rvt*predict-no*H0*6
  15213. -->
  15214. (S1 ^operator O2008 = 0.9999999999999999)
  15215. Firing rl*prefer*rvt*predict-yes*H0*5
  15216. -->
  15217. (S1 ^operator O2007 = 0.)
  15218. Firing prefer*rvt*predict-yes*H0
  15219. -->
  15220. Firing prefer*rvt*predict-no*H0
  15221. -->
  15222. Firing elaborate*copy-dir-to-output-link
  15223. -->
  15224. (I3 ^dir U +)
  15225. inner elaboration loop at bottom goal.
  15226. Retracting elaborate*copy-see-to-output-link
  15227. -->
  15228. (I3 ^see 0 +)
  15229. Retracting propose*predict-no
  15230. -->
  15231. (O2008 ^name predict-no +)
  15232. (S1 ^operator O2008 +)
  15233. Retracting propose*predict-yes
  15234. -->
  15235. (O2007 ^name predict-yes +)
  15236. (S1 ^operator O2007 +)
  15237. Retracting elaborate*reward*based*on*reward
  15238. -->
  15239. (R1007 ^value 1 +)
  15240. (R1 ^reward R1007 +)
  15241. Retracting elaborate*copy-dir-to-output-link
  15242. -->
  15243. (I3 ^dir R +)
  15244. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15245. -->
  15246. (S1 ^operator O2008 = -0.00558448899823713)
  15247. Retracting rl*prefer*rvt*predict-no*H0*4
  15248. -->
  15249. (S1 ^operator O2008 = 0.4476193147022436)
  15250. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15251. -->
  15252. (S1 ^operator O2007 = 0.8155935357860071)
  15253. Retracting rl*prefer*rvt*predict-yes*H0*3
  15254. -->
  15255. (S1 ^operator O2007 = 0.1844128946449167)
  15256. =>WM: (14087: S1 ^operator O2010 +)
  15257. =>WM: (14086: S1 ^operator O2009 +)
  15258. =>WM: (14085: I3 ^dir U)
  15259. =>WM: (14084: O2010 ^name predict-no)
  15260. =>WM: (14083: O2009 ^name predict-yes)
  15261. =>WM: (14082: R1008 ^value 1)
  15262. =>WM: (14081: R1 ^reward R1008)
  15263. =>WM: (14080: I3 ^see 1)
  15264. <=WM: (14071: S1 ^operator O2007 +)
  15265. <=WM: (14073: S1 ^operator O2007)
  15266. <=WM: (14072: S1 ^operator O2008 +)
  15267. <=WM: (14070: I3 ^dir R)
  15268. <=WM: (14066: R1 ^reward R1007)
  15269. <=WM: (14065: I3 ^see 0)
  15270. <=WM: (14069: O2008 ^name predict-no)
  15271. <=WM: (14068: O2007 ^name predict-yes)
  15272. <=WM: (14067: R1007 ^value 1)
  15273. --- Inner Elaboration Phase, active level 1 (S1) ---
  15274. Firing prefer*rvt*predict-yes*H0
  15275. -->
  15276. Firing rl*prefer*rvt*predict-yes*H0*5
  15277. -->
  15278. (S1 ^operator O2009 = 0.)
  15279. Firing prefer*rvt*predict-no*H0
  15280. -->
  15281. Firing rl*prefer*rvt*predict-no*H0*6
  15282. -->
  15283. (S1 ^operator O2010 = 0.9999999999999999)
  15284. inner elaboration loop at bottom goal.
  15285. Retracting rl*prefer*rvt*predict-no*H0*6
  15286. -->
  15287. (S1 ^operator O2008 = 0.9999999999999999)
  15288. Retracting rl*prefer*rvt*predict-yes*H0*5
  15289. -->
  15290. (S1 ^operator O2007 = 0.)
  15291. --- END Proposal Phase ---
  15292. --- Decision Phase ---
  15293. RL update rl*prefer*rvt*predict-yes*H0*3 0.675416 -0.491003 0.184413 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.901163,0.0895893)
  15294. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324592 0.491001 0.815594 -> 0.324591 0.491001 0.815593(R,m,v=1,1,0)
  15295. =>WM: (14088: S1 ^operator O2010)
  15296. 1005: O: O2010 (predict-no)
  15297. --- END Decision Phase ---
  15298. --- Application Phase ---
  15299. --- Firing Productions (PE) For State At Depth 1 ---
  15300. --- Inner Elaboration Phase, active level 1 (S1) ---
  15301. Firing apply*operator
  15302. -->
  15303. (I3 ^predict-no N1005 + :O )
  15304. Firing apply*operator*complete
  15305. -->
  15306. (I3 ^predict-yes N1004 - :O )
  15307. inner elaboration loop at bottom goal.
  15308. --- Change Working Memory (PE) ---
  15309. =>WM: (14089: I3 ^predict-no N1005)
  15310. <=WM: (14075: N1004 ^status complete)
  15311. <=WM: (14074: I3 ^predict-yes N1004)
  15312. --- Firing Productions (IE) For State At Depth 1 ---
  15313. --- Inner Elaboration Phase, active level 1 (S1) ---
  15314. Firing monitor*world
  15315. -->
  15316. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15317. --- Change Working Memory (IE) ---
  15318. --- END Application Phase ---
  15319. --- Output Phase ---
  15320. ENV: Agent did: predict-no for direction U in state State-B
  15321. In State-B moving U
  15322. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15323. predict error 0
  15324. dir: dir isL
  15325. --- END Output Phase ---
  15326. \-/|--- Input Phase ---
  15327. =>WM: (14093: I2 ^dir L)
  15328. =>WM: (14092: I2 ^reward 1)
  15329. =>WM: (14091: I2 ^see 0)
  15330. =>WM: (14090: N1005 ^status complete)
  15331. <=WM: (14078: I2 ^dir U)
  15332. <=WM: (14077: I2 ^reward 1)
  15333. <=WM: (14076: I2 ^see 1)
  15334. =>WM: (14094: I2 ^level-1 R1-root)
  15335. <=WM: (14079: I2 ^level-1 R1-root)
  15336. --- END Input Phase ---
  15337. --- Proposal Phase ---
  15338. --- Inner Elaboration Phase, active level 1 (S1) ---
  15339. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  15340. -->
  15341. (S1 ^operator O2009 = 0.6104597340493421)
  15342. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  15343. -->
  15344. (S1 ^operator O2010 = 0.2714993082286609)
  15345. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15346. -->
  15347. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15348. -->
  15349. Firing elaborate*copy-see-to-output-link
  15350. -->
  15351. (I3 ^see 0 +)
  15352. Firing elaborate*reward*based*on*reward
  15353. -->
  15354. (R1009 ^value 1 +)
  15355. (R1 ^reward R1009 +)
  15356. Firing propose*predict-yes
  15357. -->
  15358. (O2011 ^name predict-yes +)
  15359. (S1 ^operator O2011 +)
  15360. Firing propose*predict-no
  15361. -->
  15362. (O2012 ^name predict-no +)
  15363. (S1 ^operator O2012 +)
  15364. Firing rl*prefer*rvt*predict-no*H0*2
  15365. -->
  15366. (S1 ^operator O2010 = 0.3873370368441085)
  15367. Firing rl*prefer*rvt*predict-yes*H0*1
  15368. -->
  15369. (S1 ^operator O2009 = 0.3895396806828423)
  15370. Firing prefer*rvt*predict-yes*H0
  15371. -->
  15372. Firing prefer*rvt*predict-no*H0
  15373. -->
  15374. Firing elaborate*copy-dir-to-output-link
  15375. -->
  15376. (I3 ^dir L +)
  15377. inner elaboration loop at bottom goal.
  15378. Retracting elaborate*copy-see-to-output-link
  15379. -->
  15380. (I3 ^see 1 +)
  15381. Retracting propose*predict-no
  15382. -->
  15383. (O2010 ^name predict-no +)
  15384. (S1 ^operator O2010 +)
  15385. Retracting propose*predict-yes
  15386. -->
  15387. (O2009 ^name predict-yes +)
  15388. (S1 ^operator O2009 +)
  15389. Retracting elaborate*reward*based*on*reward
  15390. -->
  15391. (R1008 ^value 1 +)
  15392. (R1 ^reward R1008 +)
  15393. Retracting elaborate*copy-dir-to-output-link
  15394. -->
  15395. (I3 ^dir U +)
  15396. Retracting rl*prefer*rvt*predict-no*H0*6
  15397. -->
  15398. (S1 ^operator O2010 = 0.9999999999999999)
  15399. Retracting rl*prefer*rvt*predict-yes*H0*5
  15400. -->
  15401. (S1 ^operator O2009 = 0.)
  15402. =>WM: (14102: S1 ^operator O2012 +)
  15403. =>WM: (14101: S1 ^operator O2011 +)
  15404. =>WM: (14100: I3 ^dir L)
  15405. =>WM: (14099: O2012 ^name predict-no)
  15406. =>WM: (14098: O2011 ^name predict-yes)
  15407. =>WM: (14097: R1009 ^value 1)
  15408. =>WM: (14096: R1 ^reward R1009)
  15409. =>WM: (14095: I3 ^see 0)
  15410. <=WM: (14086: S1 ^operator O2009 +)
  15411. <=WM: (14087: S1 ^operator O2010 +)
  15412. <=WM: (14088: S1 ^operator O2010)
  15413. <=WM: (14085: I3 ^dir U)
  15414. <=WM: (14081: R1 ^reward R1008)
  15415. <=WM: (14080: I3 ^see 1)
  15416. <=WM: (14084: O2010 ^name predict-no)
  15417. <=WM: (14083: O2009 ^name predict-yes)
  15418. <=WM: (14082: R1008 ^value 1)
  15419. --- Inner Elaboration Phase, active level 1 (S1) ---
  15420. Firing prefer*rvt*predict-yes*H0
  15421. -->
  15422. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  15423. -->
  15424. (S1 ^operator O2011 = 0.6104597340493421)
  15425. Firing rl*prefer*rvt*predict-yes*H0*1
  15426. -->
  15427. (S1 ^operator O2011 = 0.3895396806828423)
  15428. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15429. -->
  15430. Firing prefer*rvt*predict-no*H0
  15431. -->
  15432. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  15433. -->
  15434. (S1 ^operator O2012 = 0.2714993082286609)
  15435. Firing rl*prefer*rvt*predict-no*H0*2
  15436. -->
  15437. (S1 ^operator O2012 = 0.3873370368441085)
  15438. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15439. -->
  15440. inner elaboration loop at bottom goal.
  15441. Retracting rl*prefer*rvt*predict-no*H0*2
  15442. -->
  15443. (S1 ^operator O2010 = 0.3873370368441085)
  15444. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  15445. -->
  15446. (S1 ^operator O2010 = 0.2714993082286609)
  15447. Retracting rl*prefer*rvt*predict-yes*H0*1
  15448. -->
  15449. (S1 ^operator O2009 = 0.3895396806828423)
  15450. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  15451. -->
  15452. (S1 ^operator O2009 = 0.6104597340493421)
  15453. --- END Proposal Phase ---
  15454. --- Decision Phase ---
  15455. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15456. =>WM: (14103: S1 ^operator O2011)
  15457. 1006: O: O2011 (predict-yes)
  15458. --- END Decision Phase ---
  15459. --- Application Phase ---
  15460. --- Firing Productions (PE) For State At Depth 1 ---
  15461. --- Inner Elaboration Phase, active level 1 (S1) ---
  15462. Firing apply*operator
  15463. -->
  15464. (I3 ^predict-yes N1006 + :O )
  15465. Firing apply*operator*complete
  15466. -->
  15467. (I3 ^predict-no N1005 - :O )
  15468. inner elaboration loop at bottom goal.
  15469. --- Change Working Memory (PE) ---
  15470. =>WM: (14104: I3 ^predict-yes N1006)
  15471. <=WM: (14090: N1005 ^status complete)
  15472. <=WM: (14089: I3 ^predict-no N1005)
  15473. --- Firing Productions (IE) For State At Depth 1 ---
  15474. --- Inner Elaboration Phase, active level 1 (S1) ---
  15475. Firing monitor*world
  15476. -->
  15477. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15478. --- Change Working Memory (IE) ---
  15479. --- END Application Phase ---
  15480. --- Output Phase ---
  15481. ENV: Agent did: predict-yes for direction L in state State-B
  15482. In State-B moving L
  15483. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15484. predict error 0
  15485. dir: dir isU
  15486. --- END Output Phase ---
  15487. \-/--- Input Phase ---
  15488. =>WM: (14108: I2 ^dir U)
  15489. =>WM: (14107: I2 ^reward 1)
  15490. =>WM: (14106: I2 ^see 1)
  15491. =>WM: (14105: N1006 ^status complete)
  15492. <=WM: (14093: I2 ^dir L)
  15493. <=WM: (14092: I2 ^reward 1)
  15494. <=WM: (14091: I2 ^see 0)
  15495. =>WM: (14109: I2 ^level-1 L1-root)
  15496. <=WM: (14094: I2 ^level-1 R1-root)
  15497. --- END Input Phase ---
  15498. --- Proposal Phase ---
  15499. --- Inner Elaboration Phase, active level 1 (S1) ---
  15500. Firing elaborate*copy-see-to-output-link
  15501. -->
  15502. (I3 ^see 1 +)
  15503. Firing elaborate*reward*based*on*reward
  15504. -->
  15505. (R1010 ^value 1 +)
  15506. (R1 ^reward R1010 +)
  15507. Firing propose*predict-yes
  15508. -->
  15509. (O2013 ^name predict-yes +)
  15510. (S1 ^operator O2013 +)
  15511. Firing propose*predict-no
  15512. -->
  15513. (O2014 ^name predict-no +)
  15514. (S1 ^operator O2014 +)
  15515. Firing rl*prefer*rvt*predict-no*H0*6
  15516. -->
  15517. (S1 ^operator O2012 = 0.9999999999999999)
  15518. Firing rl*prefer*rvt*predict-yes*H0*5
  15519. -->
  15520. (S1 ^operator O2011 = 0.)
  15521. Firing prefer*rvt*predict-yes*H0
  15522. -->
  15523. Firing prefer*rvt*predict-no*H0
  15524. -->
  15525. Firing elaborate*copy-dir-to-output-link
  15526. -->
  15527. (I3 ^dir U +)
  15528. inner elaboration loop at bottom goal.
  15529. Retracting elaborate*copy-see-to-output-link
  15530. -->
  15531. (I3 ^see 0 +)
  15532. Retracting propose*predict-no
  15533. -->
  15534. (O2012 ^name predict-no +)
  15535. (S1 ^operator O2012 +)
  15536. Retracting propose*predict-yes
  15537. -->
  15538. (O2011 ^name predict-yes +)
  15539. (S1 ^operator O2011 +)
  15540. Retracting elaborate*reward*based*on*reward
  15541. -->
  15542. (R1009 ^value 1 +)
  15543. (R1 ^reward R1009 +)
  15544. Retracting elaborate*copy-dir-to-output-link
  15545. -->
  15546. (I3 ^dir L +)
  15547. Retracting rl*prefer*rvt*predict-no*H0*2
  15548. -->
  15549. (S1 ^operator O2012 = 0.3873370368441085)
  15550. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  15551. -->
  15552. (S1 ^operator O2012 = 0.2714993082286609)
  15553. Retracting rl*prefer*rvt*predict-yes*H0*1
  15554. -->
  15555. (S1 ^operator O2011 = 0.3895396806828423)
  15556. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  15557. -->
  15558. (S1 ^operator O2011 = 0.6104597340493421)
  15559. =>WM: (14117: S1 ^operator O2014 +)
  15560. =>WM: (14116: S1 ^operator O2013 +)
  15561. =>WM: (14115: I3 ^dir U)
  15562. =>WM: (14114: O2014 ^name predict-no)
  15563. =>WM: (14113: O2013 ^name predict-yes)
  15564. =>WM: (14112: R1010 ^value 1)
  15565. =>WM: (14111: R1 ^reward R1010)
  15566. =>WM: (14110: I3 ^see 1)
  15567. <=WM: (14101: S1 ^operator O2011 +)
  15568. <=WM: (14103: S1 ^operator O2011)
  15569. <=WM: (14102: S1 ^operator O2012 +)
  15570. <=WM: (14100: I3 ^dir L)
  15571. <=WM: (14096: R1 ^reward R1009)
  15572. <=WM: (14095: I3 ^see 0)
  15573. <=WM: (14099: O2012 ^name predict-no)
  15574. <=WM: (14098: O2011 ^name predict-yes)
  15575. <=WM: (14097: R1009 ^value 1)
  15576. --- Inner Elaboration Phase, active level 1 (S1) ---
  15577. Firing prefer*rvt*predict-yes*H0
  15578. -->
  15579. Firing rl*prefer*rvt*predict-yes*H0*5
  15580. -->
  15581. (S1 ^operator O2013 = 0.)
  15582. Firing prefer*rvt*predict-no*H0
  15583. -->
  15584. Firing rl*prefer*rvt*predict-no*H0*6
  15585. -->
  15586. (S1 ^operator O2014 = 0.9999999999999999)
  15587. inner elaboration loop at bottom goal.
  15588. Retracting rl*prefer*rvt*predict-no*H0*6
  15589. -->
  15590. (S1 ^operator O2012 = 0.9999999999999999)
  15591. Retracting rl*prefer*rvt*predict-yes*H0*5
  15592. -->
  15593. (S1 ^operator O2011 = 0.)
  15594. --- END Proposal Phase ---
  15595. --- Decision Phase ---
  15596. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.894118,0.0952315)
  15597. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  15598. =>WM: (14118: S1 ^operator O2014)
  15599. 1007: O: O2014 (predict-no)
  15600. --- END Decision Phase ---
  15601. --- Application Phase ---
  15602. --- Firing Productions (PE) For State At Depth 1 ---
  15603. --- Inner Elaboration Phase, active level 1 (S1) ---
  15604. Firing apply*operator
  15605. -->
  15606. (I3 ^predict-no N1007 + :O )
  15607. Firing apply*operator*complete
  15608. -->
  15609. (I3 ^predict-yes N1006 - :O )
  15610. inner elaboration loop at bottom goal.
  15611. --- Change Working Memory (PE) ---
  15612. =>WM: (14119: I3 ^predict-no N1007)
  15613. <=WM: (14105: N1006 ^status complete)
  15614. <=WM: (14104: I3 ^predict-yes N1006)
  15615. --- Firing Productions (IE) For State At Depth 1 ---
  15616. --- Inner Elaboration Phase, active level 1 (S1) ---
  15617. Firing monitor*world
  15618. -->
  15619. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15620. --- Change Working Memory (IE) ---
  15621. --- END Application Phase ---
  15622. --- Output Phase ---
  15623. ENV: Agent did: predict-no for direction U in state State-A
  15624. In State-A moving U
  15625. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15626. predict error 0
  15627. dir: dir isL
  15628. --- END Output Phase ---
  15629. |\---- Input Phase ---
  15630. =>WM: (14123: I2 ^dir L)
  15631. =>WM: (14122: I2 ^reward 1)
  15632. =>WM: (14121: I2 ^see 0)
  15633. =>WM: (14120: N1007 ^status complete)
  15634. <=WM: (14108: I2 ^dir U)
  15635. <=WM: (14107: I2 ^reward 1)
  15636. <=WM: (14106: I2 ^see 1)
  15637. =>WM: (14124: I2 ^level-1 L1-root)
  15638. <=WM: (14109: I2 ^level-1 L1-root)
  15639. --- END Input Phase ---
  15640. --- Proposal Phase ---
  15641. --- Inner Elaboration Phase, active level 1 (S1) ---
  15642. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  15643. -->
  15644. (S1 ^operator O2014 = 0.6126628217494006)
  15645. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  15646. -->
  15647. (S1 ^operator O2013 = -0.02274740735326741)
  15648. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15649. -->
  15650. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15651. -->
  15652. Firing elaborate*copy-see-to-output-link
  15653. -->
  15654. (I3 ^see 0 +)
  15655. Firing elaborate*reward*based*on*reward
  15656. -->
  15657. (R1011 ^value 1 +)
  15658. (R1 ^reward R1011 +)
  15659. Firing propose*predict-yes
  15660. -->
  15661. (O2015 ^name predict-yes +)
  15662. (S1 ^operator O2015 +)
  15663. Firing propose*predict-no
  15664. -->
  15665. (O2016 ^name predict-no +)
  15666. (S1 ^operator O2016 +)
  15667. Firing rl*prefer*rvt*predict-no*H0*2
  15668. -->
  15669. (S1 ^operator O2014 = 0.3873370368441085)
  15670. Firing rl*prefer*rvt*predict-yes*H0*1
  15671. -->
  15672. (S1 ^operator O2013 = 0.3895397684730147)
  15673. Firing prefer*rvt*predict-yes*H0
  15674. -->
  15675. Firing prefer*rvt*predict-no*H0
  15676. -->
  15677. Firing elaborate*copy-dir-to-output-link
  15678. -->
  15679. (I3 ^dir L +)
  15680. inner elaboration loop at bottom goal.
  15681. Retracting elaborate*copy-see-to-output-link
  15682. -->
  15683. (I3 ^see 1 +)
  15684. Retracting propose*predict-no
  15685. -->
  15686. (O2014 ^name predict-no +)
  15687. (S1 ^operator O2014 +)
  15688. Retracting propose*predict-yes
  15689. -->
  15690. (O2013 ^name predict-yes +)
  15691. (S1 ^operator O2013 +)
  15692. Retracting elaborate*reward*based*on*reward
  15693. -->
  15694. (R1010 ^value 1 +)
  15695. (R1 ^reward R1010 +)
  15696. Retracting elaborate*copy-dir-to-output-link
  15697. -->
  15698. (I3 ^dir U +)
  15699. Retracting rl*prefer*rvt*predict-no*H0*6
  15700. -->
  15701. (S1 ^operator O2014 = 0.9999999999999999)
  15702. Retracting rl*prefer*rvt*predict-yes*H0*5
  15703. -->
  15704. (S1 ^operator O2013 = 0.)
  15705. =>WM: (14132: S1 ^operator O2016 +)
  15706. =>WM: (14131: S1 ^operator O2015 +)
  15707. =>WM: (14130: I3 ^dir L)
  15708. =>WM: (14129: O2016 ^name predict-no)
  15709. =>WM: (14128: O2015 ^name predict-yes)
  15710. =>WM: (14127: R1011 ^value 1)
  15711. =>WM: (14126: R1 ^reward R1011)
  15712. =>WM: (14125: I3 ^see 0)
  15713. <=WM: (14116: S1 ^operator O2013 +)
  15714. <=WM: (14117: S1 ^operator O2014 +)
  15715. <=WM: (14118: S1 ^operator O2014)
  15716. <=WM: (14115: I3 ^dir U)
  15717. <=WM: (14111: R1 ^reward R1010)
  15718. <=WM: (14110: I3 ^see 1)
  15719. <=WM: (14114: O2014 ^name predict-no)
  15720. <=WM: (14113: O2013 ^name predict-yes)
  15721. <=WM: (14112: R1010 ^value 1)
  15722. --- Inner Elaboration Phase, active level 1 (S1) ---
  15723. Firing prefer*rvt*predict-yes*H0
  15724. -->
  15725. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  15726. -->
  15727. (S1 ^operator O2015 = -0.02274740735326741)
  15728. Firing rl*prefer*rvt*predict-yes*H0*1
  15729. -->
  15730. (S1 ^operator O2015 = 0.3895397684730147)
  15731. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15732. -->
  15733. Firing prefer*rvt*predict-no*H0
  15734. -->
  15735. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  15736. -->
  15737. (S1 ^operator O2016 = 0.6126628217494006)
  15738. Firing rl*prefer*rvt*predict-no*H0*2
  15739. -->
  15740. (S1 ^operator O2016 = 0.3873370368441085)
  15741. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15742. -->
  15743. inner elaboration loop at bottom goal.
  15744. Retracting rl*prefer*rvt*predict-no*H0*2
  15745. -->
  15746. (S1 ^operator O2014 = 0.3873370368441085)
  15747. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  15748. -->
  15749. (S1 ^operator O2014 = 0.6126628217494006)
  15750. Retracting rl*prefer*rvt*predict-yes*H0*1
  15751. -->
  15752. (S1 ^operator O2013 = 0.3895397684730147)
  15753. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  15754. -->
  15755. (S1 ^operator O2013 = -0.02274740735326741)
  15756. --- END Proposal Phase ---
  15757. --- Decision Phase ---
  15758. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15759. =>WM: (14133: S1 ^operator O2016)
  15760. 1008: O: O2016 (predict-no)
  15761. --- END Decision Phase ---
  15762. --- Application Phase ---
  15763. --- Firing Productions (PE) For State At Depth 1 ---
  15764. --- Inner Elaboration Phase, active level 1 (S1) ---
  15765. Firing apply*operator
  15766. -->
  15767. (I3 ^predict-no N1008 + :O )
  15768. Firing apply*operator*complete
  15769. -->
  15770. (I3 ^predict-no N1007 - :O )
  15771. inner elaboration loop at bottom goal.
  15772. --- Change Working Memory (PE) ---
  15773. =>WM: (14134: I3 ^predict-no N1008)
  15774. <=WM: (14120: N1007 ^status complete)
  15775. <=WM: (14119: I3 ^predict-no N1007)
  15776. --- Firing Productions (IE) For State At Depth 1 ---
  15777. --- Inner Elaboration Phase, active level 1 (S1) ---
  15778. Firing monitor*world
  15779. -->
  15780. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15781. --- Change Working Memory (IE) ---
  15782. --- END Application Phase ---
  15783. --- Output Phase ---
  15784. ENV: Agent did: predict-no for direction L in state State-A
  15785. In State-A moving L
  15786. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15787. predict error 0
  15788. dir: dir isU
  15789. --- END Output Phase ---
  15790. /|\--- Input Phase ---
  15791. =>WM: (14138: I2 ^dir U)
  15792. =>WM: (14137: I2 ^reward 1)
  15793. =>WM: (14136: I2 ^see 0)
  15794. =>WM: (14135: N1008 ^status complete)
  15795. <=WM: (14123: I2 ^dir L)
  15796. <=WM: (14122: I2 ^reward 1)
  15797. <=WM: (14121: I2 ^see 0)
  15798. =>WM: (14139: I2 ^level-1 L0-root)
  15799. <=WM: (14124: I2 ^level-1 L1-root)
  15800. --- END Input Phase ---
  15801. --- Proposal Phase ---
  15802. --- Inner Elaboration Phase, active level 1 (S1) ---
  15803. Firing elaborate*copy-see-to-output-link
  15804. -->
  15805. (I3 ^see 0 +)
  15806. Firing elaborate*reward*based*on*reward
  15807. -->
  15808. (R1012 ^value 1 +)
  15809. (R1 ^reward R1012 +)
  15810. Firing propose*predict-yes
  15811. -->
  15812. (O2017 ^name predict-yes +)
  15813. (S1 ^operator O2017 +)
  15814. Firing propose*predict-no
  15815. -->
  15816. (O2018 ^name predict-no +)
  15817. (S1 ^operator O2018 +)
  15818. Firing rl*prefer*rvt*predict-no*H0*6
  15819. -->
  15820. (S1 ^operator O2016 = 0.9999999999999999)
  15821. Firing rl*prefer*rvt*predict-yes*H0*5
  15822. -->
  15823. (S1 ^operator O2015 = 0.)
  15824. Firing prefer*rvt*predict-yes*H0
  15825. -->
  15826. Firing prefer*rvt*predict-no*H0
  15827. -->
  15828. Firing elaborate*copy-dir-to-output-link
  15829. -->
  15830. (I3 ^dir U +)
  15831. inner elaboration loop at bottom goal.
  15832. Retracting elaborate*copy-see-to-output-link
  15833. -->
  15834. (I3 ^see 0 +)
  15835. Retracting propose*predict-no
  15836. -->
  15837. (O2016 ^name predict-no +)
  15838. (S1 ^operator O2016 +)
  15839. Retracting propose*predict-yes
  15840. -->
  15841. (O2015 ^name predict-yes +)
  15842. (S1 ^operator O2015 +)
  15843. Retracting elaborate*reward*based*on*reward
  15844. -->
  15845. (R1011 ^value 1 +)
  15846. (R1 ^reward R1011 +)
  15847. Retracting elaborate*copy-dir-to-output-link
  15848. -->
  15849. (I3 ^dir L +)
  15850. Retracting rl*prefer*rvt*predict-no*H0*2
  15851. -->
  15852. (S1 ^operator O2016 = 0.3873370368441085)
  15853. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  15854. -->
  15855. (S1 ^operator O2016 = 0.6126628217494006)
  15856. Retracting rl*prefer*rvt*predict-yes*H0*1
  15857. -->
  15858. (S1 ^operator O2015 = 0.3895397684730147)
  15859. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  15860. -->
  15861. (S1 ^operator O2015 = -0.02274740735326741)
  15862. =>WM: (14146: S1 ^operator O2018 +)
  15863. =>WM: (14145: S1 ^operator O2017 +)
  15864. =>WM: (14144: I3 ^dir U)
  15865. =>WM: (14143: O2018 ^name predict-no)
  15866. =>WM: (14142: O2017 ^name predict-yes)
  15867. =>WM: (14141: R1012 ^value 1)
  15868. =>WM: (14140: R1 ^reward R1012)
  15869. <=WM: (14131: S1 ^operator O2015 +)
  15870. <=WM: (14132: S1 ^operator O2016 +)
  15871. <=WM: (14133: S1 ^operator O2016)
  15872. <=WM: (14130: I3 ^dir L)
  15873. <=WM: (14126: R1 ^reward R1011)
  15874. <=WM: (14129: O2016 ^name predict-no)
  15875. <=WM: (14128: O2015 ^name predict-yes)
  15876. <=WM: (14127: R1011 ^value 1)
  15877. --- Inner Elaboration Phase, active level 1 (S1) ---
  15878. Firing prefer*rvt*predict-yes*H0
  15879. -->
  15880. Firing rl*prefer*rvt*predict-yes*H0*5
  15881. -->
  15882. (S1 ^operator O2017 = 0.)
  15883. Firing prefer*rvt*predict-no*H0
  15884. -->
  15885. Firing rl*prefer*rvt*predict-no*H0*6
  15886. -->
  15887. (S1 ^operator O2018 = 0.9999999999999999)
  15888. inner elaboration loop at bottom goal.
  15889. Retracting rl*prefer*rvt*predict-no*H0*6
  15890. -->
  15891. (S1 ^operator O2016 = 0.9999999999999999)
  15892. Retracting rl*prefer*rvt*predict-yes*H0*5
  15893. -->
  15894. (S1 ^operator O2015 = 0.)
  15895. --- END Proposal Phase ---
  15896. --- Decision Phase ---
  15897. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.719081 -0.331744 0.387337(R,m,v=1,0.932584,0.0632261)
  15898. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  15899. =>WM: (14147: S1 ^operator O2018)
  15900. 1009: O: O2018 (predict-no)
  15901. --- END Decision Phase ---
  15902. --- Application Phase ---
  15903. --- Firing Productions (PE) For State At Depth 1 ---
  15904. --- Inner Elaboration Phase, active level 1 (S1) ---
  15905. Firing apply*operator
  15906. -->
  15907. (I3 ^predict-no N1009 + :O )
  15908. Firing apply*operator*complete
  15909. -->
  15910. (I3 ^predict-no N1008 - :O )
  15911. inner elaboration loop at bottom goal.
  15912. --- Change Working Memory (PE) ---
  15913. =>WM: (14148: I3 ^predict-no N1009)
  15914. <=WM: (14135: N1008 ^status complete)
  15915. <=WM: (14134: I3 ^predict-no N1008)
  15916. --- Firing Productions (IE) For State At Depth 1 ---
  15917. --- Inner Elaboration Phase, active level 1 (S1) ---
  15918. Firing monitor*world
  15919. -->
  15920. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15921. --- Change Working Memory (IE) ---
  15922. --- END Application Phase ---
  15923. --- Output Phase ---
  15924. ENV: Agent did: predict-no for direction U in state State-A
  15925. In State-A moving U
  15926. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15927. predict error 0
  15928. dir: dir isL
  15929. --- END Output Phase ---
  15930. -/|\--- Input Phase ---
  15931. =>WM: (14152: I2 ^dir L)
  15932. =>WM: (14151: I2 ^reward 1)
  15933. =>WM: (14150: I2 ^see 0)
  15934. =>WM: (14149: N1009 ^status complete)
  15935. <=WM: (14138: I2 ^dir U)
  15936. <=WM: (14137: I2 ^reward 1)
  15937. <=WM: (14136: I2 ^see 0)
  15938. =>WM: (14153: I2 ^level-1 L0-root)
  15939. <=WM: (14139: I2 ^level-1 L0-root)
  15940. --- END Input Phase ---
  15941. --- Proposal Phase ---
  15942. --- Inner Elaboration Phase, active level 1 (S1) ---
  15943. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  15944. -->
  15945. (S1 ^operator O2017 = 0.1599599085218832)
  15946. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  15947. -->
  15948. (S1 ^operator O2018 = 0.6126679931585133)
  15949. Firing prefer*rvt*predict-no*H0*2*v1*H1
  15950. -->
  15951. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  15952. -->
  15953. Firing elaborate*copy-see-to-output-link
  15954. -->
  15955. (I3 ^see 0 +)
  15956. Firing elaborate*reward*based*on*reward
  15957. -->
  15958. (R1013 ^value 1 +)
  15959. (R1 ^reward R1013 +)
  15960. Firing propose*predict-yes
  15961. -->
  15962. (O2019 ^name predict-yes +)
  15963. (S1 ^operator O2019 +)
  15964. Firing propose*predict-no
  15965. -->
  15966. (O2020 ^name predict-no +)
  15967. (S1 ^operator O2020 +)
  15968. Firing rl*prefer*rvt*predict-no*H0*2
  15969. -->
  15970. (S1 ^operator O2018 = 0.3873370580550821)
  15971. Firing rl*prefer*rvt*predict-yes*H0*1
  15972. -->
  15973. (S1 ^operator O2017 = 0.3895397684730147)
  15974. Firing prefer*rvt*predict-yes*H0
  15975. -->
  15976. Firing prefer*rvt*predict-no*H0
  15977. -->
  15978. Firing elaborate*copy-dir-to-output-link
  15979. -->
  15980. (I3 ^dir L +)
  15981. inner elaboration loop at bottom goal.
  15982. Retracting elaborate*copy-see-to-output-link
  15983. -->
  15984. (I3 ^see 0 +)
  15985. Retracting propose*predict-no
  15986. -->
  15987. (O2018 ^name predict-no +)
  15988. (S1 ^operator O2018 +)
  15989. Retracting propose*predict-yes
  15990. -->
  15991. (O2017 ^name predict-yes +)
  15992. (S1 ^operator O2017 +)
  15993. Retracting elaborate*reward*based*on*reward
  15994. -->
  15995. (R1012 ^value 1 +)
  15996. (R1 ^reward R1012 +)
  15997. Retracting elaborate*copy-dir-to-output-link
  15998. -->
  15999. (I3 ^dir U +)
  16000. Retracting rl*prefer*rvt*predict-no*H0*6
  16001. -->
  16002. (S1 ^operator O2018 = 0.9999999999999999)
  16003. Retracting rl*prefer*rvt*predict-yes*H0*5
  16004. -->
  16005. (S1 ^operator O2017 = 0.)
  16006. =>WM: (14160: S1 ^operator O2020 +)
  16007. =>WM: (14159: S1 ^operator O2019 +)
  16008. =>WM: (14158: I3 ^dir L)
  16009. =>WM: (14157: O2020 ^name predict-no)
  16010. =>WM: (14156: O2019 ^name predict-yes)
  16011. =>WM: (14155: R1013 ^value 1)
  16012. =>WM: (14154: R1 ^reward R1013)
  16013. <=WM: (14145: S1 ^operator O2017 +)
  16014. <=WM: (14146: S1 ^operator O2018 +)
  16015. <=WM: (14147: S1 ^operator O2018)
  16016. <=WM: (14144: I3 ^dir U)
  16017. <=WM: (14140: R1 ^reward R1012)
  16018. <=WM: (14143: O2018 ^name predict-no)
  16019. <=WM: (14142: O2017 ^name predict-yes)
  16020. <=WM: (14141: R1012 ^value 1)
  16021. --- Inner Elaboration Phase, active level 1 (S1) ---
  16022. Firing prefer*rvt*predict-yes*H0
  16023. -->
  16024. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16025. -->
  16026. (S1 ^operator O2019 = 0.1599599085218832)
  16027. Firing rl*prefer*rvt*predict-yes*H0*1
  16028. -->
  16029. (S1 ^operator O2019 = 0.3895397684730147)
  16030. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16031. -->
  16032. Firing prefer*rvt*predict-no*H0
  16033. -->
  16034. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16035. -->
  16036. (S1 ^operator O2020 = 0.6126679931585133)
  16037. Firing rl*prefer*rvt*predict-no*H0*2
  16038. -->
  16039. (S1 ^operator O2020 = 0.3873370580550821)
  16040. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16041. -->
  16042. inner elaboration loop at bottom goal.
  16043. Retracting rl*prefer*rvt*predict-no*H0*2
  16044. -->
  16045. (S1 ^operator O2018 = 0.3873370580550821)
  16046. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16047. -->
  16048. (S1 ^operator O2018 = 0.6126679931585133)
  16049. Retracting rl*prefer*rvt*predict-yes*H0*1
  16050. -->
  16051. (S1 ^operator O2017 = 0.3895397684730147)
  16052. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16053. -->
  16054. (S1 ^operator O2017 = 0.1599599085218832)
  16055. --- END Proposal Phase ---
  16056. --- Decision Phase ---
  16057. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16058. =>WM: (14161: S1 ^operator O2020)
  16059. 1010: O: O2020 (predict-no)
  16060. --- END Decision Phase ---
  16061. --- Application Phase ---
  16062. --- Firing Productions (PE) For State At Depth 1 ---
  16063. --- Inner Elaboration Phase, active level 1 (S1) ---
  16064. Firing apply*operator
  16065. -->
  16066. (I3 ^predict-no N1010 + :O )
  16067. Firing apply*operator*complete
  16068. -->
  16069. (I3 ^predict-no N1009 - :O )
  16070. inner elaboration loop at bottom goal.
  16071. --- Change Working Memory (PE) ---
  16072. =>WM: (14162: I3 ^predict-no N1010)
  16073. <=WM: (14149: N1009 ^status complete)
  16074. <=WM: (14148: I3 ^predict-no N1009)
  16075. --- Firing Productions (IE) For State At Depth 1 ---
  16076. --- Inner Elaboration Phase, active level 1 (S1) ---
  16077. Firing monitor*world
  16078. -->
  16079. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16080. --- Change Working Memory (IE) ---
  16081. --- END Application Phase ---
  16082. --- Output Phase ---
  16083. ENV: Agent did: predict-no for direction L in state State-A
  16084. In State-A moving L
  16085. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16086. predict error 0
  16087. dir: dir isU
  16088. --- END Output Phase ---
  16089. -/|--- Input Phase ---
  16090. =>WM: (14166: I2 ^dir U)
  16091. =>WM: (14165: I2 ^reward 1)
  16092. =>WM: (14164: I2 ^see 0)
  16093. =>WM: (14163: N1010 ^status complete)
  16094. <=WM: (14152: I2 ^dir L)
  16095. <=WM: (14151: I2 ^reward 1)
  16096. <=WM: (14150: I2 ^see 0)
  16097. =>WM: (14167: I2 ^level-1 L0-root)
  16098. <=WM: (14153: I2 ^level-1 L0-root)
  16099. --- END Input Phase ---
  16100. --- Proposal Phase ---
  16101. --- Inner Elaboration Phase, active level 1 (S1) ---
  16102. Firing elaborate*copy-see-to-output-link
  16103. -->
  16104. (I3 ^see 0 +)
  16105. Firing elaborate*reward*based*on*reward
  16106. -->
  16107. (R1014 ^value 1 +)
  16108. (R1 ^reward R1014 +)
  16109. Firing propose*predict-yes
  16110. -->
  16111. (O2021 ^name predict-yes +)
  16112. (S1 ^operator O2021 +)
  16113. Firing propose*predict-no
  16114. -->
  16115. (O2022 ^name predict-no +)
  16116. (S1 ^operator O2022 +)
  16117. Firing rl*prefer*rvt*predict-no*H0*6
  16118. -->
  16119. (S1 ^operator O2020 = 0.9999999999999999)
  16120. Firing rl*prefer*rvt*predict-yes*H0*5
  16121. -->
  16122. (S1 ^operator O2019 = 0.)
  16123. Firing prefer*rvt*predict-yes*H0
  16124. -->
  16125. Firing prefer*rvt*predict-no*H0
  16126. -->
  16127. Firing elaborate*copy-dir-to-output-link
  16128. -->
  16129. (I3 ^dir U +)
  16130. inner elaboration loop at bottom goal.
  16131. Retracting elaborate*copy-see-to-output-link
  16132. -->
  16133. (I3 ^see 0 +)
  16134. Retracting propose*predict-no
  16135. -->
  16136. (O2020 ^name predict-no +)
  16137. (S1 ^operator O2020 +)
  16138. Retracting propose*predict-yes
  16139. -->
  16140. (O2019 ^name predict-yes +)
  16141. (S1 ^operator O2019 +)
  16142. Retracting elaborate*reward*based*on*reward
  16143. -->
  16144. (R1013 ^value 1 +)
  16145. (R1 ^reward R1013 +)
  16146. Retracting elaborate*copy-dir-to-output-link
  16147. -->
  16148. (I3 ^dir L +)
  16149. Retracting rl*prefer*rvt*predict-no*H0*2
  16150. -->
  16151. (S1 ^operator O2020 = 0.3873370580550821)
  16152. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16153. -->
  16154. (S1 ^operator O2020 = 0.6126679931585133)
  16155. Retracting rl*prefer*rvt*predict-yes*H0*1
  16156. -->
  16157. (S1 ^operator O2019 = 0.3895397684730147)
  16158. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16159. -->
  16160. (S1 ^operator O2019 = 0.1599599085218832)
  16161. =>WM: (14174: S1 ^operator O2022 +)
  16162. =>WM: (14173: S1 ^operator O2021 +)
  16163. =>WM: (14172: I3 ^dir U)
  16164. =>WM: (14171: O2022 ^name predict-no)
  16165. =>WM: (14170: O2021 ^name predict-yes)
  16166. =>WM: (14169: R1014 ^value 1)
  16167. =>WM: (14168: R1 ^reward R1014)
  16168. <=WM: (14159: S1 ^operator O2019 +)
  16169. <=WM: (14160: S1 ^operator O2020 +)
  16170. <=WM: (14161: S1 ^operator O2020)
  16171. <=WM: (14158: I3 ^dir L)
  16172. <=WM: (14154: R1 ^reward R1013)
  16173. <=WM: (14157: O2020 ^name predict-no)
  16174. <=WM: (14156: O2019 ^name predict-yes)
  16175. <=WM: (14155: R1013 ^value 1)
  16176. --- Inner Elaboration Phase, active level 1 (S1) ---
  16177. Firing prefer*rvt*predict-yes*H0
  16178. -->
  16179. Firing rl*prefer*rvt*predict-yes*H0*5
  16180. -->
  16181. (S1 ^operator O2021 = 0.)
  16182. Firing prefer*rvt*predict-no*H0
  16183. -->
  16184. Firing rl*prefer*rvt*predict-no*H0*6
  16185. -->
  16186. (S1 ^operator O2022 = 0.9999999999999999)
  16187. inner elaboration loop at bottom goal.
  16188. Retracting rl*prefer*rvt*predict-no*H0*6
  16189. -->
  16190. (S1 ^operator O2020 = 0.9999999999999999)
  16191. Retracting rl*prefer*rvt*predict-yes*H0*5
  16192. -->
  16193. (S1 ^operator O2019 = 0.)
  16194. --- END Proposal Phase ---
  16195. --- Decision Phase ---
  16196. RL update rl*prefer*rvt*predict-no*H0*2 0.719081 -0.331744 0.387337 -> 0.71908 -0.331744 0.387336(R,m,v=1,0.932961,0.0628962)
  16197. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280926 0.331742 0.612668 -> 0.280925 0.331742 0.612667(R,m,v=1,1,0)
  16198. =>WM: (14175: S1 ^operator O2022)
  16199. 1011: O: O2022 (predict-no)
  16200. --- END Decision Phase ---
  16201. --- Application Phase ---
  16202. --- Firing Productions (PE) For State At Depth 1 ---
  16203. --- Inner Elaboration Phase, active level 1 (S1) ---
  16204. Firing apply*operator
  16205. -->
  16206. (I3 ^predict-no N1011 + :O )
  16207. Firing apply*operator*complete
  16208. -->
  16209. (I3 ^predict-no N1010 - :O )
  16210. inner elaboration loop at bottom goal.
  16211. --- Change Working Memory (PE) ---
  16212. =>WM: (14176: I3 ^predict-no N1011)
  16213. <=WM: (14163: N1010 ^status complete)
  16214. <=WM: (14162: I3 ^predict-no N1010)
  16215. --- Firing Productions (IE) For State At Depth 1 ---
  16216. --- Inner Elaboration Phase, active level 1 (S1) ---
  16217. Firing monitor*world
  16218. -->
  16219. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16220. --- Change Working Memory (IE) ---
  16221. --- END Application Phase ---
  16222. --- Output Phase ---
  16223. ENV: Agent did: predict-no for direction U in state State-A
  16224. In State-A moving U
  16225. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16226. predict error 0
  16227. dir: dir isL
  16228. --- END Output Phase ---
  16229. \--- Input Phase ---
  16230. =>WM: (14180: I2 ^dir L)
  16231. =>WM: (14179: I2 ^reward 1)
  16232. =>WM: (14178: I2 ^see 0)
  16233. =>WM: (14177: N1011 ^status complete)
  16234. <=WM: (14166: I2 ^dir U)
  16235. <=WM: (14165: I2 ^reward 1)
  16236. <=WM: (14164: I2 ^see 0)
  16237. =>WM: (14181: I2 ^level-1 L0-root)
  16238. <=WM: (14167: I2 ^level-1 L0-root)
  16239. --- END Input Phase ---
  16240. --- Proposal Phase ---
  16241. --- Inner Elaboration Phase, active level 1 (S1) ---
  16242. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16243. -->
  16244. (S1 ^operator O2021 = 0.1599599085218832)
  16245. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16246. -->
  16247. (S1 ^operator O2022 = 0.6126672354764739)
  16248. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16249. -->
  16250. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16251. -->
  16252. Firing elaborate*copy-see-to-output-link
  16253. -->
  16254. (I3 ^see 0 +)
  16255. Firing elaborate*reward*based*on*reward
  16256. -->
  16257. (R1015 ^value 1 +)
  16258. (R1 ^reward R1015 +)
  16259. Firing propose*predict-yes
  16260. -->
  16261. (O2023 ^name predict-yes +)
  16262. (S1 ^operator O2023 +)
  16263. Firing propose*predict-no
  16264. -->
  16265. (O2024 ^name predict-no +)
  16266. (S1 ^operator O2024 +)
  16267. Firing rl*prefer*rvt*predict-no*H0*2
  16268. -->
  16269. (S1 ^operator O2022 = 0.3873363003730427)
  16270. Firing rl*prefer*rvt*predict-yes*H0*1
  16271. -->
  16272. (S1 ^operator O2021 = 0.3895397684730147)
  16273. Firing prefer*rvt*predict-yes*H0
  16274. -->
  16275. Firing prefer*rvt*predict-no*H0
  16276. -->
  16277. Firing elaborate*copy-dir-to-output-link
  16278. -->
  16279. (I3 ^dir L +)
  16280. inner elaboration loop at bottom goal.
  16281. Retracting elaborate*copy-see-to-output-link
  16282. -->
  16283. (I3 ^see 0 +)
  16284. Retracting propose*predict-no
  16285. -->
  16286. (O2022 ^name predict-no +)
  16287. (S1 ^operator O2022 +)
  16288. Retracting propose*predict-yes
  16289. -->
  16290. (O2021 ^name predict-yes +)
  16291. (S1 ^operator O2021 +)
  16292. Retracting elaborate*reward*based*on*reward
  16293. -->
  16294. (R1014 ^value 1 +)
  16295. (R1 ^reward R1014 +)
  16296. Retracting elaborate*copy-dir-to-output-link
  16297. -->
  16298. (I3 ^dir U +)
  16299. Retracting rl*prefer*rvt*predict-no*H0*6
  16300. -->
  16301. (S1 ^operator O2022 = 0.9999999999999999)
  16302. Retracting rl*prefer*rvt*predict-yes*H0*5
  16303. -->
  16304. (S1 ^operator O2021 = 0.)
  16305. =>WM: (14188: S1 ^operator O2024 +)
  16306. =>WM: (14187: S1 ^operator O2023 +)
  16307. =>WM: (14186: I3 ^dir L)
  16308. =>WM: (14185: O2024 ^name predict-no)
  16309. =>WM: (14184: O2023 ^name predict-yes)
  16310. =>WM: (14183: R1015 ^value 1)
  16311. =>WM: (14182: R1 ^reward R1015)
  16312. <=WM: (14173: S1 ^operator O2021 +)
  16313. <=WM: (14174: S1 ^operator O2022 +)
  16314. <=WM: (14175: S1 ^operator O2022)
  16315. <=WM: (14172: I3 ^dir U)
  16316. <=WM: (14168: R1 ^reward R1014)
  16317. <=WM: (14171: O2022 ^name predict-no)
  16318. <=WM: (14170: O2021 ^name predict-yes)
  16319. <=WM: (14169: R1014 ^value 1)
  16320. --- Inner Elaboration Phase, active level 1 (S1) ---
  16321. Firing prefer*rvt*predict-yes*H0
  16322. -->
  16323. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16324. -->
  16325. (S1 ^operator O2023 = 0.1599599085218832)
  16326. Firing rl*prefer*rvt*predict-yes*H0*1
  16327. -->
  16328. (S1 ^operator O2023 = 0.3895397684730147)
  16329. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16330. -->
  16331. Firing prefer*rvt*predict-no*H0
  16332. -->
  16333. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16334. -->
  16335. (S1 ^operator O2024 = 0.6126672354764739)
  16336. Firing rl*prefer*rvt*predict-no*H0*2
  16337. -->
  16338. (S1 ^operator O2024 = 0.3873363003730427)
  16339. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16340. -->
  16341. inner elaboration loop at bottom goal.
  16342. Retracting rl*prefer*rvt*predict-no*H0*2
  16343. -->
  16344. (S1 ^operator O2022 = 0.3873363003730427)
  16345. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16346. -->
  16347. (S1 ^operator O2022 = 0.6126672354764739)
  16348. Retracting rl*prefer*rvt*predict-yes*H0*1
  16349. -->
  16350. (S1 ^operator O2021 = 0.3895397684730147)
  16351. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16352. -->
  16353. (S1 ^operator O2021 = 0.1599599085218832)
  16354. --- END Proposal Phase ---
  16355. --- Decision Phase ---
  16356. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16357. =>WM: (14189: S1 ^operator O2024)
  16358. 1012: O: O2024 (predict-no)
  16359. --- END Decision Phase ---
  16360. --- Application Phase ---
  16361. --- Firing Productions (PE) For State At Depth 1 ---
  16362. --- Inner Elaboration Phase, active level 1 (S1) ---
  16363. Firing apply*operator
  16364. -->
  16365. (I3 ^predict-no N1012 + :O )
  16366. Firing apply*operator*complete
  16367. -->
  16368. (I3 ^predict-no N1011 - :O )
  16369. inner elaboration loop at bottom goal.
  16370. --- Change Working Memory (PE) ---
  16371. =>WM: (14190: I3 ^predict-no N1012)
  16372. <=WM: (14177: N1011 ^status complete)
  16373. <=WM: (14176: I3 ^predict-no N1011)
  16374. --- Firing Productions (IE) For State At Depth 1 ---
  16375. --- Inner Elaboration Phase, active level 1 (S1) ---
  16376. Firing monitor*world
  16377. -->
  16378. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16379. --- Change Working Memory (IE) ---
  16380. --- END Application Phase ---
  16381. --- Output Phase ---
  16382. ENV: Agent did: predict-no for direction L in state State-A
  16383. In State-A moving L
  16384. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16385. predict error 0
  16386. dir: dir isR
  16387. --- END Output Phase ---
  16388. -/|--- Input Phase ---
  16389. =>WM: (14194: I2 ^dir R)
  16390. =>WM: (14193: I2 ^reward 1)
  16391. =>WM: (14192: I2 ^see 0)
  16392. =>WM: (14191: N1012 ^status complete)
  16393. <=WM: (14180: I2 ^dir L)
  16394. <=WM: (14179: I2 ^reward 1)
  16395. <=WM: (14178: I2 ^see 0)
  16396. =>WM: (14195: I2 ^level-1 L0-root)
  16397. <=WM: (14181: I2 ^level-1 L0-root)
  16398. --- END Input Phase ---
  16399. --- Proposal Phase ---
  16400. --- Inner Elaboration Phase, active level 1 (S1) ---
  16401. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16402. -->
  16403. (S1 ^operator O2023 = 0.8155925712213685)
  16404. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16405. -->
  16406. (S1 ^operator O2024 = -0.00558448899823713)
  16407. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16408. -->
  16409. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16410. -->
  16411. Firing elaborate*copy-see-to-output-link
  16412. -->
  16413. (I3 ^see 0 +)
  16414. Firing elaborate*reward*based*on*reward
  16415. -->
  16416. (R1016 ^value 1 +)
  16417. (R1 ^reward R1016 +)
  16418. Firing propose*predict-yes
  16419. -->
  16420. (O2025 ^name predict-yes +)
  16421. (S1 ^operator O2025 +)
  16422. Firing propose*predict-no
  16423. -->
  16424. (O2026 ^name predict-no +)
  16425. (S1 ^operator O2026 +)
  16426. Firing rl*prefer*rvt*predict-no*H0*4
  16427. -->
  16428. (S1 ^operator O2024 = 0.4476193147022436)
  16429. Firing rl*prefer*rvt*predict-yes*H0*3
  16430. -->
  16431. (S1 ^operator O2023 = 0.1844119300802781)
  16432. Firing prefer*rvt*predict-yes*H0
  16433. -->
  16434. Firing prefer*rvt*predict-no*H0
  16435. -->
  16436. Firing elaborate*copy-dir-to-output-link
  16437. -->
  16438. (I3 ^dir R +)
  16439. inner elaboration loop at bottom goal.
  16440. Retracting elaborate*copy-see-to-output-link
  16441. -->
  16442. (I3 ^see 0 +)
  16443. Retracting propose*predict-no
  16444. -->
  16445. (O2024 ^name predict-no +)
  16446. (S1 ^operator O2024 +)
  16447. Retracting propose*predict-yes
  16448. -->
  16449. (O2023 ^name predict-yes +)
  16450. (S1 ^operator O2023 +)
  16451. Retracting elaborate*reward*based*on*reward
  16452. -->
  16453. (R1015 ^value 1 +)
  16454. (R1 ^reward R1015 +)
  16455. Retracting elaborate*copy-dir-to-output-link
  16456. -->
  16457. (I3 ^dir L +)
  16458. Retracting rl*prefer*rvt*predict-no*H0*2
  16459. -->
  16460. (S1 ^operator O2024 = 0.3873363003730427)
  16461. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  16462. -->
  16463. (S1 ^operator O2024 = 0.6126672354764739)
  16464. Retracting rl*prefer*rvt*predict-yes*H0*1
  16465. -->
  16466. (S1 ^operator O2023 = 0.3895397684730147)
  16467. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  16468. -->
  16469. (S1 ^operator O2023 = 0.1599599085218832)
  16470. =>WM: (14202: S1 ^operator O2026 +)
  16471. =>WM: (14201: S1 ^operator O2025 +)
  16472. =>WM: (14200: I3 ^dir R)
  16473. =>WM: (14199: O2026 ^name predict-no)
  16474. =>WM: (14198: O2025 ^name predict-yes)
  16475. =>WM: (14197: R1016 ^value 1)
  16476. =>WM: (14196: R1 ^reward R1016)
  16477. <=WM: (14187: S1 ^operator O2023 +)
  16478. <=WM: (14188: S1 ^operator O2024 +)
  16479. <=WM: (14189: S1 ^operator O2024)
  16480. <=WM: (14186: I3 ^dir L)
  16481. <=WM: (14182: R1 ^reward R1015)
  16482. <=WM: (14185: O2024 ^name predict-no)
  16483. <=WM: (14184: O2023 ^name predict-yes)
  16484. <=WM: (14183: R1015 ^value 1)
  16485. --- Inner Elaboration Phase, active level 1 (S1) ---
  16486. Firing prefer*rvt*predict-yes*H0
  16487. -->
  16488. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16489. -->
  16490. (S1 ^operator O2025 = 0.8155925712213685)
  16491. Firing rl*prefer*rvt*predict-yes*H0*3
  16492. -->
  16493. (S1 ^operator O2025 = 0.1844119300802781)
  16494. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16495. -->
  16496. Firing prefer*rvt*predict-no*H0
  16497. -->
  16498. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16499. -->
  16500. (S1 ^operator O2026 = -0.00558448899823713)
  16501. Firing rl*prefer*rvt*predict-no*H0*4
  16502. -->
  16503. (S1 ^operator O2026 = 0.4476193147022436)
  16504. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16505. -->
  16506. inner elaboration loop at bottom goal.
  16507. Retracting rl*prefer*rvt*predict-no*H0*4
  16508. -->
  16509. (S1 ^operator O2024 = 0.4476193147022436)
  16510. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16511. -->
  16512. (S1 ^operator O2024 = -0.00558448899823713)
  16513. Retracting rl*prefer*rvt*predict-yes*H0*3
  16514. -->
  16515. (S1 ^operator O2023 = 0.1844119300802781)
  16516. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16517. -->
  16518. (S1 ^operator O2023 = 0.8155925712213685)
  16519. --- END Proposal Phase ---
  16520. --- Decision Phase ---
  16521. RL update rl*prefer*rvt*predict-no*H0*2 0.71908 -0.331744 0.387336 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.933333,0.0625698)
  16522. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280925 0.331742 0.612667 -> 0.280924 0.331742 0.612667(R,m,v=1,1,0)
  16523. =>WM: (14203: S1 ^operator O2025)
  16524. 1013: O: O2025 (predict-yes)
  16525. --- END Decision Phase ---
  16526. --- Application Phase ---
  16527. --- Firing Productions (PE) For State At Depth 1 ---
  16528. --- Inner Elaboration Phase, active level 1 (S1) ---
  16529. Firing apply*operator
  16530. -->
  16531. (I3 ^predict-yes N1013 + :O )
  16532. Firing apply*operator*complete
  16533. -->
  16534. (I3 ^predict-no N1012 - :O )
  16535. inner elaboration loop at bottom goal.
  16536. --- Change Working Memory (PE) ---
  16537. =>WM: (14204: I3 ^predict-yes N1013)
  16538. <=WM: (14191: N1012 ^status complete)
  16539. <=WM: (14190: I3 ^predict-no N1012)
  16540. --- Firing Productions (IE) For State At Depth 1 ---
  16541. --- Inner Elaboration Phase, active level 1 (S1) ---
  16542. Firing monitor*world
  16543. -->
  16544. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16545. --- Change Working Memory (IE) ---
  16546. --- END Application Phase ---
  16547. --- Output Phase ---
  16548. ENV: Agent did: predict-yes for direction R in state State-A
  16549. In State-A moving R
  16550. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  16551. predict error 0
  16552. dir: dir isL
  16553. --- END Output Phase ---
  16554. \-/--- Input Phase ---
  16555. =>WM: (14208: I2 ^dir L)
  16556. =>WM: (14207: I2 ^reward 1)
  16557. =>WM: (14206: I2 ^see 1)
  16558. =>WM: (14205: N1013 ^status complete)
  16559. <=WM: (14194: I2 ^dir R)
  16560. <=WM: (14193: I2 ^reward 1)
  16561. <=WM: (14192: I2 ^see 0)
  16562. =>WM: (14209: I2 ^level-1 R1-root)
  16563. <=WM: (14195: I2 ^level-1 L0-root)
  16564. --- END Input Phase ---
  16565. --- Proposal Phase ---
  16566. --- Inner Elaboration Phase, active level 1 (S1) ---
  16567. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  16568. -->
  16569. (S1 ^operator O2025 = 0.6104598218395145)
  16570. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  16571. -->
  16572. (S1 ^operator O2026 = 0.2714993082286609)
  16573. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16574. -->
  16575. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16576. -->
  16577. Firing elaborate*copy-see-to-output-link
  16578. -->
  16579. (I3 ^see 1 +)
  16580. Firing elaborate*reward*based*on*reward
  16581. -->
  16582. (R1017 ^value 1 +)
  16583. (R1 ^reward R1017 +)
  16584. Firing propose*predict-yes
  16585. -->
  16586. (O2027 ^name predict-yes +)
  16587. (S1 ^operator O2027 +)
  16588. Firing propose*predict-no
  16589. -->
  16590. (O2028 ^name predict-no +)
  16591. (S1 ^operator O2028 +)
  16592. Firing rl*prefer*rvt*predict-no*H0*2
  16593. -->
  16594. (S1 ^operator O2026 = 0.3873357699956153)
  16595. Firing rl*prefer*rvt*predict-yes*H0*1
  16596. -->
  16597. (S1 ^operator O2025 = 0.3895397684730147)
  16598. Firing prefer*rvt*predict-yes*H0
  16599. -->
  16600. Firing prefer*rvt*predict-no*H0
  16601. -->
  16602. Firing elaborate*copy-dir-to-output-link
  16603. -->
  16604. (I3 ^dir L +)
  16605. inner elaboration loop at bottom goal.
  16606. Retracting elaborate*copy-see-to-output-link
  16607. -->
  16608. (I3 ^see 0 +)
  16609. Retracting propose*predict-no
  16610. -->
  16611. (O2026 ^name predict-no +)
  16612. (S1 ^operator O2026 +)
  16613. Retracting propose*predict-yes
  16614. -->
  16615. (O2025 ^name predict-yes +)
  16616. (S1 ^operator O2025 +)
  16617. Retracting elaborate*reward*based*on*reward
  16618. -->
  16619. (R1016 ^value 1 +)
  16620. (R1 ^reward R1016 +)
  16621. Retracting elaborate*copy-dir-to-output-link
  16622. -->
  16623. (I3 ^dir R +)
  16624. Retracting rl*prefer*rvt*predict-no*H0*4
  16625. -->
  16626. (S1 ^operator O2026 = 0.4476193147022436)
  16627. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16628. -->
  16629. (S1 ^operator O2026 = -0.00558448899823713)
  16630. Retracting rl*prefer*rvt*predict-yes*H0*3
  16631. -->
  16632. (S1 ^operator O2025 = 0.1844119300802781)
  16633. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16634. -->
  16635. (S1 ^operator O2025 = 0.8155925712213685)
  16636. =>WM: (14217: S1 ^operator O2028 +)
  16637. =>WM: (14216: S1 ^operator O2027 +)
  16638. =>WM: (14215: I3 ^dir L)
  16639. =>WM: (14214: O2028 ^name predict-no)
  16640. =>WM: (14213: O2027 ^name predict-yes)
  16641. =>WM: (14212: R1017 ^value 1)
  16642. =>WM: (14211: R1 ^reward R1017)
  16643. =>WM: (14210: I3 ^see 1)
  16644. <=WM: (14201: S1 ^operator O2025 +)
  16645. <=WM: (14203: S1 ^operator O2025)
  16646. <=WM: (14202: S1 ^operator O2026 +)
  16647. <=WM: (14200: I3 ^dir R)
  16648. <=WM: (14196: R1 ^reward R1016)
  16649. <=WM: (14125: I3 ^see 0)
  16650. <=WM: (14199: O2026 ^name predict-no)
  16651. <=WM: (14198: O2025 ^name predict-yes)
  16652. <=WM: (14197: R1016 ^value 1)
  16653. --- Inner Elaboration Phase, active level 1 (S1) ---
  16654. Firing prefer*rvt*predict-yes*H0
  16655. -->
  16656. Firing rl*prefer*rvt*predict-yes*H0*1
  16657. -->
  16658. (S1 ^operator O2027 = 0.3895397684730147)
  16659. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  16660. -->
  16661. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  16662. -->
  16663. (S1 ^operator O2027 = 0.6104598218395145)
  16664. Firing prefer*rvt*predict-no*H0
  16665. -->
  16666. Firing rl*prefer*rvt*predict-no*H0*2
  16667. -->
  16668. (S1 ^operator O2028 = 0.3873357699956153)
  16669. Firing prefer*rvt*predict-no*H0*2*v1*H1
  16670. -->
  16671. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  16672. -->
  16673. (S1 ^operator O2028 = 0.2714993082286609)
  16674. inner elaboration loop at bottom goal.
  16675. Retracting rl*prefer*rvt*predict-no*H0*2
  16676. -->
  16677. (S1 ^operator O2026 = 0.3873357699956153)
  16678. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  16679. -->
  16680. (S1 ^operator O2026 = 0.2714993082286609)
  16681. Retracting rl*prefer*rvt*predict-yes*H0*1
  16682. -->
  16683. (S1 ^operator O2025 = 0.3895397684730147)
  16684. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  16685. -->
  16686. (S1 ^operator O2025 = 0.6104598218395145)
  16687. --- END Proposal Phase ---
  16688. --- Decision Phase ---
  16689. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675414 -0.491003 0.184411(R,m,v=1,0.901734,0.0891249)
  16690. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324591 0.491001 0.815593 -> 0.32459 0.491002 0.815592(R,m,v=1,1,0)
  16691. =>WM: (14218: S1 ^operator O2027)
  16692. 1014: O: O2027 (predict-yes)
  16693. --- END Decision Phase ---
  16694. --- Application Phase ---
  16695. --- Firing Productions (PE) For State At Depth 1 ---
  16696. --- Inner Elaboration Phase, active level 1 (S1) ---
  16697. Firing apply*operator
  16698. -->
  16699. (I3 ^predict-yes N1014 + :O )
  16700. Firing apply*operator*complete
  16701. -->
  16702. (I3 ^predict-yes N1013 - :O )
  16703. inner elaboration loop at bottom goal.
  16704. --- Change Working Memory (PE) ---
  16705. =>WM: (14219: I3 ^predict-yes N1014)
  16706. <=WM: (14205: N1013 ^status complete)
  16707. <=WM: (14204: I3 ^predict-yes N1013)
  16708. --- Firing Productions (IE) For State At Depth 1 ---
  16709. --- Inner Elaboration Phase, active level 1 (S1) ---
  16710. Firing monitor*world
  16711. -->
  16712. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16713. --- Change Working Memory (IE) ---
  16714. --- END Application Phase ---
  16715. --- Output Phase ---
  16716. ENV: Agent did: predict-yes for direction L in state State-B
  16717. In State-B moving L
  16718. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  16719. predict error 0
  16720. dir: dir isU
  16721. --- END Output Phase ---
  16722. |\---- Input Phase ---
  16723. =>WM: (14223: I2 ^dir U)
  16724. =>WM: (14222: I2 ^reward 1)
  16725. =>WM: (14221: I2 ^see 1)
  16726. =>WM: (14220: N1014 ^status complete)
  16727. <=WM: (14208: I2 ^dir L)
  16728. <=WM: (14207: I2 ^reward 1)
  16729. <=WM: (14206: I2 ^see 1)
  16730. =>WM: (14224: I2 ^level-1 L1-root)
  16731. <=WM: (14209: I2 ^level-1 R1-root)
  16732. --- END Input Phase ---
  16733. --- Proposal Phase ---
  16734. --- Inner Elaboration Phase, active level 1 (S1) ---
  16735. Firing elaborate*copy-see-to-output-link
  16736. -->
  16737. (I3 ^see 1 +)
  16738. Firing elaborate*reward*based*on*reward
  16739. -->
  16740. (R1018 ^value 1 +)
  16741. (R1 ^reward R1018 +)
  16742. Firing propose*predict-yes
  16743. -->
  16744. (O2029 ^name predict-yes +)
  16745. (S1 ^operator O2029 +)
  16746. Firing propose*predict-no
  16747. -->
  16748. (O2030 ^name predict-no +)
  16749. (S1 ^operator O2030 +)
  16750. Firing rl*prefer*rvt*predict-no*H0*6
  16751. -->
  16752. (S1 ^operator O2028 = 0.9999999999999999)
  16753. Firing rl*prefer*rvt*predict-yes*H0*5
  16754. -->
  16755. (S1 ^operator O2027 = 0.)
  16756. Firing prefer*rvt*predict-yes*H0
  16757. -->
  16758. Firing prefer*rvt*predict-no*H0
  16759. -->
  16760. Firing elaborate*copy-dir-to-output-link
  16761. -->
  16762. (I3 ^dir U +)
  16763. inner elaboration loop at bottom goal.
  16764. Retracting elaborate*copy-see-to-output-link
  16765. -->
  16766. (I3 ^see 1 +)
  16767. Retracting propose*predict-no
  16768. -->
  16769. (O2028 ^name predict-no +)
  16770. (S1 ^operator O2028 +)
  16771. Retracting propose*predict-yes
  16772. -->
  16773. (O2027 ^name predict-yes +)
  16774. (S1 ^operator O2027 +)
  16775. Retracting elaborate*reward*based*on*reward
  16776. -->
  16777. (R1017 ^value 1 +)
  16778. (R1 ^reward R1017 +)
  16779. Retracting elaborate*copy-dir-to-output-link
  16780. -->
  16781. (I3 ^dir L +)
  16782. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  16783. -->
  16784. (S1 ^operator O2028 = 0.2714993082286609)
  16785. Retracting rl*prefer*rvt*predict-no*H0*2
  16786. -->
  16787. (S1 ^operator O2028 = 0.3873357699956153)
  16788. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  16789. -->
  16790. (S1 ^operator O2027 = 0.6104598218395145)
  16791. Retracting rl*prefer*rvt*predict-yes*H0*1
  16792. -->
  16793. (S1 ^operator O2027 = 0.3895397684730147)
  16794. =>WM: (14231: S1 ^operator O2030 +)
  16795. =>WM: (14230: S1 ^operator O2029 +)
  16796. =>WM: (14229: I3 ^dir U)
  16797. =>WM: (14228: O2030 ^name predict-no)
  16798. =>WM: (14227: O2029 ^name predict-yes)
  16799. =>WM: (14226: R1018 ^value 1)
  16800. =>WM: (14225: R1 ^reward R1018)
  16801. <=WM: (14216: S1 ^operator O2027 +)
  16802. <=WM: (14218: S1 ^operator O2027)
  16803. <=WM: (14217: S1 ^operator O2028 +)
  16804. <=WM: (14215: I3 ^dir L)
  16805. <=WM: (14211: R1 ^reward R1017)
  16806. <=WM: (14214: O2028 ^name predict-no)
  16807. <=WM: (14213: O2027 ^name predict-yes)
  16808. <=WM: (14212: R1017 ^value 1)
  16809. --- Inner Elaboration Phase, active level 1 (S1) ---
  16810. Firing prefer*rvt*predict-yes*H0
  16811. -->
  16812. Firing rl*prefer*rvt*predict-yes*H0*5
  16813. -->
  16814. (S1 ^operator O2029 = 0.)
  16815. Firing prefer*rvt*predict-no*H0
  16816. -->
  16817. Firing rl*prefer*rvt*predict-no*H0*6
  16818. -->
  16819. (S1 ^operator O2030 = 0.9999999999999999)
  16820. inner elaboration loop at bottom goal.
  16821. Retracting rl*prefer*rvt*predict-no*H0*6
  16822. -->
  16823. (S1 ^operator O2028 = 0.9999999999999999)
  16824. Retracting rl*prefer*rvt*predict-yes*H0*5
  16825. -->
  16826. (S1 ^operator O2027 = 0.)
  16827. --- END Proposal Phase ---
  16828. --- Decision Phase ---
  16829. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322411 0.38954(R,m,v=1,0.894737,0.0947368)
  16830. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  16831. =>WM: (14232: S1 ^operator O2030)
  16832. 1015: O: O2030 (predict-no)
  16833. --- END Decision Phase ---
  16834. --- Application Phase ---
  16835. --- Firing Productions (PE) For State At Depth 1 ---
  16836. --- Inner Elaboration Phase, active level 1 (S1) ---
  16837. Firing apply*operator
  16838. -->
  16839. (I3 ^predict-no N1015 + :O )
  16840. Firing apply*operator*complete
  16841. -->
  16842. (I3 ^predict-yes N1014 - :O )
  16843. inner elaboration loop at bottom goal.
  16844. --- Change Working Memory (PE) ---
  16845. =>WM: (14233: I3 ^predict-no N1015)
  16846. <=WM: (14220: N1014 ^status complete)
  16847. <=WM: (14219: I3 ^predict-yes N1014)
  16848. --- Firing Productions (IE) For State At Depth 1 ---
  16849. --- Inner Elaboration Phase, active level 1 (S1) ---
  16850. Firing monitor*world
  16851. -->
  16852. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16853. --- Change Working Memory (IE) ---
  16854. --- END Application Phase ---
  16855. --- Output Phase ---
  16856. ENV: Agent did: predict-no for direction U in state State-A
  16857. In State-A moving U
  16858. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16859. predict error 0
  16860. dir: dir isU
  16861. --- END Output Phase ---
  16862. /|\--- Input Phase ---
  16863. =>WM: (14237: I2 ^dir U)
  16864. =>WM: (14236: I2 ^reward 1)
  16865. =>WM: (14235: I2 ^see 0)
  16866. =>WM: (14234: N1015 ^status complete)
  16867. <=WM: (14223: I2 ^dir U)
  16868. <=WM: (14222: I2 ^reward 1)
  16869. <=WM: (14221: I2 ^see 1)
  16870. =>WM: (14238: I2 ^level-1 L1-root)
  16871. <=WM: (14224: I2 ^level-1 L1-root)
  16872. --- END Input Phase ---
  16873. --- Proposal Phase ---
  16874. --- Inner Elaboration Phase, active level 1 (S1) ---
  16875. Firing elaborate*copy-see-to-output-link
  16876. -->
  16877. (I3 ^see 0 +)
  16878. Firing elaborate*reward*based*on*reward
  16879. -->
  16880. (R1019 ^value 1 +)
  16881. (R1 ^reward R1019 +)
  16882. Firing propose*predict-yes
  16883. -->
  16884. (O2031 ^name predict-yes +)
  16885. (S1 ^operator O2031 +)
  16886. Firing propose*predict-no
  16887. -->
  16888. (O2032 ^name predict-no +)
  16889. (S1 ^operator O2032 +)
  16890. Firing rl*prefer*rvt*predict-no*H0*6
  16891. -->
  16892. (S1 ^operator O2030 = 0.9999999999999999)
  16893. Firing rl*prefer*rvt*predict-yes*H0*5
  16894. -->
  16895. (S1 ^operator O2029 = 0.)
  16896. Firing prefer*rvt*predict-yes*H0
  16897. -->
  16898. Firing prefer*rvt*predict-no*H0
  16899. -->
  16900. Firing elaborate*copy-dir-to-output-link
  16901. -->
  16902. (I3 ^dir U +)
  16903. inner elaboration loop at bottom goal.
  16904. Retracting elaborate*copy-see-to-output-link
  16905. -->
  16906. (I3 ^see 1 +)
  16907. Retracting propose*predict-no
  16908. -->
  16909. (O2030 ^name predict-no +)
  16910. (S1 ^operator O2030 +)
  16911. Retracting propose*predict-yes
  16912. -->
  16913. (O2029 ^name predict-yes +)
  16914. (S1 ^operator O2029 +)
  16915. Retracting elaborate*reward*based*on*reward
  16916. -->
  16917. (R1018 ^value 1 +)
  16918. (R1 ^reward R1018 +)
  16919. Retracting elaborate*copy-dir-to-output-link
  16920. -->
  16921. (I3 ^dir U +)
  16922. Retracting rl*prefer*rvt*predict-no*H0*6
  16923. -->
  16924. (S1 ^operator O2030 = 0.9999999999999999)
  16925. Retracting rl*prefer*rvt*predict-yes*H0*5
  16926. -->
  16927. (S1 ^operator O2029 = 0.)
  16928. =>WM: (14245: S1 ^operator O2032 +)
  16929. =>WM: (14244: S1 ^operator O2031 +)
  16930. =>WM: (14243: O2032 ^name predict-no)
  16931. =>WM: (14242: O2031 ^name predict-yes)
  16932. =>WM: (14241: R1019 ^value 1)
  16933. =>WM: (14240: R1 ^reward R1019)
  16934. =>WM: (14239: I3 ^see 0)
  16935. <=WM: (14230: S1 ^operator O2029 +)
  16936. <=WM: (14231: S1 ^operator O2030 +)
  16937. <=WM: (14232: S1 ^operator O2030)
  16938. <=WM: (14225: R1 ^reward R1018)
  16939. <=WM: (14210: I3 ^see 1)
  16940. <=WM: (14228: O2030 ^name predict-no)
  16941. <=WM: (14227: O2029 ^name predict-yes)
  16942. <=WM: (14226: R1018 ^value 1)
  16943. --- Inner Elaboration Phase, active level 1 (S1) ---
  16944. Firing prefer*rvt*predict-yes*H0
  16945. -->
  16946. Firing rl*prefer*rvt*predict-yes*H0*5
  16947. -->
  16948. (S1 ^operator O2031 = 0.)
  16949. Firing prefer*rvt*predict-no*H0
  16950. -->
  16951. Firing rl*prefer*rvt*predict-no*H0*6
  16952. -->
  16953. (S1 ^operator O2032 = 0.9999999999999999)
  16954. inner elaboration loop at bottom goal.
  16955. Retracting rl*prefer*rvt*predict-no*H0*6
  16956. -->
  16957. (S1 ^operator O2030 = 0.9999999999999999)
  16958. Retracting rl*prefer*rvt*predict-yes*H0*5
  16959. -->
  16960. (S1 ^operator O2029 = 0.)
  16961. --- END Proposal Phase ---
  16962. --- Decision Phase ---
  16963. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16964. =>WM: (14246: S1 ^operator O2032)
  16965. 1016: O: O2032 (predict-no)
  16966. --- END Decision Phase ---
  16967. --- Application Phase ---
  16968. --- Firing Productions (PE) For State At Depth 1 ---
  16969. --- Inner Elaboration Phase, active level 1 (S1) ---
  16970. Firing apply*operator
  16971. -->
  16972. (I3 ^predict-no N1016 + :O )
  16973. Firing apply*operator*complete
  16974. -->
  16975. (I3 ^predict-no N1015 - :O )
  16976. inner elaboration loop at bottom goal.
  16977. --- Change Working Memory (PE) ---
  16978. =>WM: (14247: I3 ^predict-no N1016)
  16979. <=WM: (14234: N1015 ^status complete)
  16980. <=WM: (14233: I3 ^predict-no N1015)
  16981. --- Firing Productions (IE) For State At Depth 1 ---
  16982. --- Inner Elaboration Phase, active level 1 (S1) ---
  16983. Firing monitor*world
  16984. -->
  16985. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16986. --- Change Working Memory (IE) ---
  16987. --- END Application Phase ---
  16988. --- Output Phase ---
  16989. ENV: Agent did: predict-no for direction U in state State-A
  16990. In State-A moving U
  16991. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16992. predict error 0
  16993. dir: dir isL
  16994. --- END Output Phase ---
  16995. -/|--- Input Phase ---
  16996. =>WM: (14251: I2 ^dir L)
  16997. =>WM: (14250: I2 ^reward 1)
  16998. =>WM: (14249: I2 ^see 0)
  16999. =>WM: (14248: N1016 ^status complete)
  17000. <=WM: (14237: I2 ^dir U)
  17001. <=WM: (14236: I2 ^reward 1)
  17002. <=WM: (14235: I2 ^see 0)
  17003. =>WM: (14252: I2 ^level-1 L1-root)
  17004. <=WM: (14238: I2 ^level-1 L1-root)
  17005. --- END Input Phase ---
  17006. --- Proposal Phase ---
  17007. --- Inner Elaboration Phase, active level 1 (S1) ---
  17008. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  17009. -->
  17010. (S1 ^operator O2032 = 0.6126628429603742)
  17011. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  17012. -->
  17013. (S1 ^operator O2031 = -0.02274740735326741)
  17014. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17015. -->
  17016. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17017. -->
  17018. Firing elaborate*copy-see-to-output-link
  17019. -->
  17020. (I3 ^see 0 +)
  17021. Firing elaborate*reward*based*on*reward
  17022. -->
  17023. (R1020 ^value 1 +)
  17024. (R1 ^reward R1020 +)
  17025. Firing propose*predict-yes
  17026. -->
  17027. (O2033 ^name predict-yes +)
  17028. (S1 ^operator O2033 +)
  17029. Firing propose*predict-no
  17030. -->
  17031. (O2034 ^name predict-no +)
  17032. (S1 ^operator O2034 +)
  17033. Firing rl*prefer*rvt*predict-no*H0*2
  17034. -->
  17035. (S1 ^operator O2032 = 0.3873357699956153)
  17036. Firing rl*prefer*rvt*predict-yes*H0*1
  17037. -->
  17038. (S1 ^operator O2031 = 0.3895398299261354)
  17039. Firing prefer*rvt*predict-yes*H0
  17040. -->
  17041. Firing prefer*rvt*predict-no*H0
  17042. -->
  17043. Firing elaborate*copy-dir-to-output-link
  17044. -->
  17045. (I3 ^dir L +)
  17046. inner elaboration loop at bottom goal.
  17047. Retracting elaborate*copy-see-to-output-link
  17048. -->
  17049. (I3 ^see 0 +)
  17050. Retracting propose*predict-no
  17051. -->
  17052. (O2032 ^name predict-no +)
  17053. (S1 ^operator O2032 +)
  17054. Retracting propose*predict-yes
  17055. -->
  17056. (O2031 ^name predict-yes +)
  17057. (S1 ^operator O2031 +)
  17058. Retracting elaborate*reward*based*on*reward
  17059. -->
  17060. (R1019 ^value 1 +)
  17061. (R1 ^reward R1019 +)
  17062. Retracting elaborate*copy-dir-to-output-link
  17063. -->
  17064. (I3 ^dir U +)
  17065. Retracting rl*prefer*rvt*predict-no*H0*6
  17066. -->
  17067. (S1 ^operator O2032 = 0.9999999999999999)
  17068. Retracting rl*prefer*rvt*predict-yes*H0*5
  17069. -->
  17070. (S1 ^operator O2031 = 0.)
  17071. =>WM: (14259: S1 ^operator O2034 +)
  17072. =>WM: (14258: S1 ^operator O2033 +)
  17073. =>WM: (14257: I3 ^dir L)
  17074. =>WM: (14256: O2034 ^name predict-no)
  17075. =>WM: (14255: O2033 ^name predict-yes)
  17076. =>WM: (14254: R1020 ^value 1)
  17077. =>WM: (14253: R1 ^reward R1020)
  17078. <=WM: (14244: S1 ^operator O2031 +)
  17079. <=WM: (14245: S1 ^operator O2032 +)
  17080. <=WM: (14246: S1 ^operator O2032)
  17081. <=WM: (14229: I3 ^dir U)
  17082. <=WM: (14240: R1 ^reward R1019)
  17083. <=WM: (14243: O2032 ^name predict-no)
  17084. <=WM: (14242: O2031 ^name predict-yes)
  17085. <=WM: (14241: R1019 ^value 1)
  17086. --- Inner Elaboration Phase, active level 1 (S1) ---
  17087. Firing prefer*rvt*predict-yes*H0
  17088. -->
  17089. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  17090. -->
  17091. (S1 ^operator O2033 = -0.02274740735326741)
  17092. Firing rl*prefer*rvt*predict-yes*H0*1
  17093. -->
  17094. (S1 ^operator O2033 = 0.3895398299261354)
  17095. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17096. -->
  17097. Firing prefer*rvt*predict-no*H0
  17098. -->
  17099. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  17100. -->
  17101. (S1 ^operator O2034 = 0.6126628429603742)
  17102. Firing rl*prefer*rvt*predict-no*H0*2
  17103. -->
  17104. (S1 ^operator O2034 = 0.3873357699956153)
  17105. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17106. -->
  17107. inner elaboration loop at bottom goal.
  17108. Retracting rl*prefer*rvt*predict-no*H0*2
  17109. -->
  17110. (S1 ^operator O2032 = 0.3873357699956153)
  17111. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  17112. -->
  17113. (S1 ^operator O2032 = 0.6126628429603742)
  17114. Retracting rl*prefer*rvt*predict-yes*H0*1
  17115. -->
  17116. (S1 ^operator O2031 = 0.3895398299261354)
  17117. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  17118. -->
  17119. (S1 ^operator O2031 = -0.02274740735326741)
  17120. --- END Proposal Phase ---
  17121. --- Decision Phase ---
  17122. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17123. =>WM: (14260: S1 ^operator O2034)
  17124. 1017: O: O2034 (predict-no)
  17125. --- END Decision Phase ---
  17126. --- Application Phase ---
  17127. --- Firing Productions (PE) For State At Depth 1 ---
  17128. --- Inner Elaboration Phase, active level 1 (S1) ---
  17129. Firing apply*operator
  17130. -->
  17131. (I3 ^predict-no N1017 + :O )
  17132. Firing apply*operator*complete
  17133. -->
  17134. (I3 ^predict-no N1016 - :O )
  17135. inner elaboration loop at bottom goal.
  17136. --- Change Working Memory (PE) ---
  17137. =>WM: (14261: I3 ^predict-no N1017)
  17138. <=WM: (14248: N1016 ^status complete)
  17139. <=WM: (14247: I3 ^predict-no N1016)
  17140. --- Firing Productions (IE) For State At Depth 1 ---
  17141. --- Inner Elaboration Phase, active level 1 (S1) ---
  17142. Firing monitor*world
  17143. -->
  17144. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17145. --- Change Working Memory (IE) ---
  17146. --- END Application Phase ---
  17147. --- Output Phase ---
  17148. ENV: Agent did: predict-no for direction L in state State-A
  17149. In State-A moving L
  17150. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17151. predict error 0
  17152. dir: dir isL
  17153. --- END Output Phase ---
  17154. \-/--- Input Phase ---
  17155. =>WM: (14265: I2 ^dir L)
  17156. =>WM: (14264: I2 ^reward 1)
  17157. =>WM: (14263: I2 ^see 0)
  17158. =>WM: (14262: N1017 ^status complete)
  17159. <=WM: (14251: I2 ^dir L)
  17160. <=WM: (14250: I2 ^reward 1)
  17161. <=WM: (14249: I2 ^see 0)
  17162. =>WM: (14266: I2 ^level-1 L0-root)
  17163. <=WM: (14252: I2 ^level-1 L1-root)
  17164. --- END Input Phase ---
  17165. --- Proposal Phase ---
  17166. --- Inner Elaboration Phase, active level 1 (S1) ---
  17167. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  17168. -->
  17169. (S1 ^operator O2033 = 0.1599599085218832)
  17170. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  17171. -->
  17172. (S1 ^operator O2034 = 0.6126667050990464)
  17173. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17174. -->
  17175. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17176. -->
  17177. Firing elaborate*copy-see-to-output-link
  17178. -->
  17179. (I3 ^see 0 +)
  17180. Firing elaborate*reward*based*on*reward
  17181. -->
  17182. (R1021 ^value 1 +)
  17183. (R1 ^reward R1021 +)
  17184. Firing propose*predict-yes
  17185. -->
  17186. (O2035 ^name predict-yes +)
  17187. (S1 ^operator O2035 +)
  17188. Firing propose*predict-no
  17189. -->
  17190. (O2036 ^name predict-no +)
  17191. (S1 ^operator O2036 +)
  17192. Firing rl*prefer*rvt*predict-no*H0*2
  17193. -->
  17194. (S1 ^operator O2034 = 0.3873357699956153)
  17195. Firing rl*prefer*rvt*predict-yes*H0*1
  17196. -->
  17197. (S1 ^operator O2033 = 0.3895398299261354)
  17198. Firing prefer*rvt*predict-yes*H0
  17199. -->
  17200. Firing prefer*rvt*predict-no*H0
  17201. -->
  17202. Firing elaborate*copy-dir-to-output-link
  17203. -->
  17204. (I3 ^dir L +)
  17205. inner elaboration loop at bottom goal.
  17206. Retracting elaborate*copy-see-to-output-link
  17207. -->
  17208. (I3 ^see 0 +)
  17209. Retracting propose*predict-no
  17210. -->
  17211. (O2034 ^name predict-no +)
  17212. (S1 ^operator O2034 +)
  17213. Retracting propose*predict-yes
  17214. -->
  17215. (O2033 ^name predict-yes +)
  17216. (S1 ^operator O2033 +)
  17217. Retracting elaborate*reward*based*on*reward
  17218. -->
  17219. (R1020 ^value 1 +)
  17220. (R1 ^reward R1020 +)
  17221. Retracting elaborate*copy-dir-to-output-link
  17222. -->
  17223. (I3 ^dir L +)
  17224. Retracting rl*prefer*rvt*predict-no*H0*2
  17225. -->
  17226. (S1 ^operator O2034 = 0.3873357699956153)
  17227. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  17228. -->
  17229. (S1 ^operator O2034 = 0.6126628429603742)
  17230. Retracting rl*prefer*rvt*predict-yes*H0*1
  17231. -->
  17232. (S1 ^operator O2033 = 0.3895398299261354)
  17233. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  17234. -->
  17235. (S1 ^operator O2033 = -0.02274740735326741)
  17236. =>WM: (14272: S1 ^operator O2036 +)
  17237. =>WM: (14271: S1 ^operator O2035 +)
  17238. =>WM: (14270: O2036 ^name predict-no)
  17239. =>WM: (14269: O2035 ^name predict-yes)
  17240. =>WM: (14268: R1021 ^value 1)
  17241. =>WM: (14267: R1 ^reward R1021)
  17242. <=WM: (14258: S1 ^operator O2033 +)
  17243. <=WM: (14259: S1 ^operator O2034 +)
  17244. <=WM: (14260: S1 ^operator O2034)
  17245. <=WM: (14253: R1 ^reward R1020)
  17246. <=WM: (14256: O2034 ^name predict-no)
  17247. <=WM: (14255: O2033 ^name predict-yes)
  17248. <=WM: (14254: R1020 ^value 1)
  17249. --- Inner Elaboration Phase, active level 1 (S1) ---
  17250. Firing prefer*rvt*predict-yes*H0
  17251. -->
  17252. Firing rl*prefer*rvt*predict-yes*H0*1
  17253. -->
  17254. (S1 ^operator O2035 = 0.3895398299261354)
  17255. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  17256. -->
  17257. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  17258. -->
  17259. (S1 ^operator O2035 = 0.1599599085218832)
  17260. Firing prefer*rvt*predict-no*H0
  17261. -->
  17262. Firing rl*prefer*rvt*predict-no*H0*2
  17263. -->
  17264. (S1 ^operator O2036 = 0.3873357699956153)
  17265. Firing prefer*rvt*predict-no*H0*2*v1*H1
  17266. -->
  17267. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  17268. -->
  17269. (S1 ^operator O2036 = 0.6126667050990464)
  17270. inner elaboration loop at bottom goal.
  17271. Retracting rl*prefer*rvt*predict-no*H0*2
  17272. -->
  17273. (S1 ^operator O2034 = 0.3873357699956153)
  17274. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  17275. -->
  17276. (S1 ^operator O2034 = 0.6126667050990464)
  17277. Retracting rl*prefer*rvt*predict-yes*H0*1
  17278. -->
  17279. (S1 ^operator O2033 = 0.3895398299261354)
  17280. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  17281. -->
  17282. (S1 ^operator O2033 = 0.1599599085218832)
  17283. --- END Proposal Phase ---
  17284. --- Decision Phase ---
  17285. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.71908 -0.331744 0.387336(R,m,v=1,0.933702,0.0622468)
  17286. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  17287. =>WM: (14273: S1 ^operator O2036)
  17288. 1018: O: O2036 (predict-no)
  17289. --- END Decision Phase ---
  17290. --- Application Phase ---
  17291. --- Firing Productions (PE) For State At Depth 1 ---
  17292. --- Inner Elaboration Phase, active level 1 (S1) ---
  17293. Firing apply*operator
  17294. -->
  17295. (I3 ^predict-no N1018 + :O )
  17296. Firing apply*operator*complete
  17297. -->
  17298. (I3 ^predict-no N1017 - :O )
  17299. inner elaboration loop at bottom goal.
  17300. --- Change Working Memory (PE) ---
  17301. =>WM: (14274: I3 ^predict-no N1018)
  17302. <=WM: (14262: N1017 ^status complete)
  17303. <=WM: (14261: I3 ^predict-no N1017)
  17304. --- Firing Productions (IE) For State At Depth 1 ---
  17305. --- Inner Elaboration Phase, active level 1 (S1) ---
  17306. Firing monitor*world
  17307. -->
  17308. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17309. --- Change Working Memory (IE) ---
  17310. --- END Application Phase ---
  17311. --- Output Phase ---
  17312. ENV: Agent did: predict-no for direction L in state State-A
  17313. In State-A moving L
  17314. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17315. predict error 0
  17316. dir: dir isU
  17317. --- END Output Phase ---
  17318. |\--- Input Phase ---
  17319. =>WM: (14278: I2 ^dir U)
  17320. =>WM: (14277: I2 ^reward 1)
  17321. =>WM: (14276: I2 ^see 0)
  17322. =>WM: (14275: N1018 ^status complete)
  17323. <=WM: (14265: I2 ^dir L)
  17324. <=WM: (14264: I2 ^reward 1)
  17325. <=WM: (14263: I2 ^see 0)
  17326. =>WM: (14279: I2 ^level-1 L0-root)
  17327. <=WM: (14266: I2 ^level-1 L0-root)
  17328. --- END Input Phase ---
  17329. --- Proposal Phase ---
  17330. --- Inner Elaboration Phase, active level 1 (S1) ---
  17331. Firing elaborate*copy-see-to-output-link
  17332. -->
  17333. (I3 ^see 0 +)
  17334. Firing elaborate*reward*based*on*reward
  17335. -->
  17336. (R1022 ^value 1 +)
  17337. (R1 ^reward R1022 +)
  17338. Firing propose*predict-yes
  17339. -->
  17340. (O2037 ^name predict-yes +)
  17341. (S1 ^operator O2037 +)
  17342. Firing propose*predict-no
  17343. -->
  17344. (O2038 ^name predict-no +)
  17345. (S1 ^operator O2038 +)
  17346. Firing rl*prefer*rvt*predict-no*H0*6
  17347. -->
  17348. (S1 ^operator O2036 = 0.9999999999999999)
  17349. Firing rl*prefer*rvt*predict-yes*H0*5
  17350. -->
  17351. (S1 ^operator O2035 = 0.)
  17352. Firing prefer*rvt*predict-yes*H0
  17353. -->
  17354. Firing prefer*rvt*predict-no*H0
  17355. -->
  17356. Firing elaborate*copy-dir-to-output-link
  17357. -->
  17358. (I3 ^dir U +)
  17359. inner elaboration loop at bottom goal.
  17360. Retracting elaborate*copy-see-to-output-link
  17361. -->
  17362. (I3 ^see 0 +)
  17363. Retracting propose*predict-no
  17364. -->
  17365. (O2036 ^name predict-no +)
  17366. (S1 ^operator O2036 +)
  17367. Retracting propose*predict-yes
  17368. -->
  17369. (O2035 ^name predict-yes +)
  17370. (S1 ^operator O2035 +)
  17371. Retracting elaborate*reward*based*on*reward
  17372. -->
  17373. (R1021 ^value 1 +)
  17374. (R1 ^reward R1021 +)
  17375. Retracting elaborate*copy-dir-to-output-link
  17376. -->
  17377. (I3 ^dir L +)
  17378. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  17379. -->
  17380. (S1 ^operator O2036 = 0.6126667050990464)
  17381. Retracting rl*prefer*rvt*predict-no*H0*2
  17382. -->
  17383. (S1 ^operator O2036 = 0.3873359780522169)
  17384. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  17385. -->
  17386. (S1 ^operator O2035 = 0.1599599085218832)
  17387. Retracting rl*prefer*rvt*predict-yes*H0*1
  17388. -->
  17389. (S1 ^operator O2035 = 0.3895398299261354)
  17390. =>WM: (14286: S1 ^operator O2038 +)
  17391. =>WM: (14285: S1 ^operator O2037 +)
  17392. =>WM: (14284: I3 ^dir U)
  17393. =>WM: (14283: O2038 ^name predict-no)
  17394. =>WM: (14282: O2037 ^name predict-yes)
  17395. =>WM: (14281: R1022 ^value 1)
  17396. =>WM: (14280: R1 ^reward R1022)
  17397. <=WM: (14271: S1 ^operator O2035 +)
  17398. <=WM: (14272: S1 ^operator O2036 +)
  17399. <=WM: (14273: S1 ^operator O2036)
  17400. <=WM: (14257: I3 ^dir L)
  17401. <=WM: (14267: R1 ^reward R1021)
  17402. <=WM: (14270: O2036 ^name predict-no)
  17403. <=WM: (14269: O2035 ^name predict-yes)
  17404. <=WM: (14268: R1021 ^value 1)
  17405. --- Inner Elaboration Phase, active level 1 (S1) ---
  17406. Firing prefer*rvt*predict-yes*H0
  17407. -->
  17408. Firing rl*prefer*rvt*predict-yes*H0*5
  17409. -->
  17410. (S1 ^operator O2037 = 0.)
  17411. Firing prefer*rvt*predict-no*H0
  17412. -->
  17413. Firing rl*prefer*rvt*predict-no*H0*6
  17414. -->
  17415. (S1 ^operator O2038 = 0.9999999999999999)
  17416. inner elaboration loop at bottom goal.
  17417. Retracting rl*prefer*rvt*predict-no*H0*6
  17418. -->
  17419. (S1 ^operator O2036 = 0.9999999999999999)
  17420. Retracting rl*prefer*rvt*predict-yes*H0*5
  17421. -->
  17422. (S1 ^operator O2035 = 0.)
  17423. --- END Proposal Phase ---
  17424. --- Decision Phase ---
  17425. RL update rl*prefer*rvt*predict-no*H0*2 0.71908 -0.331744 0.387336 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.934066,0.061927)
  17426. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280924 0.331742 0.612667 -> 0.280924 0.331742 0.612666(R,m,v=1,1,0)
  17427. =>WM: (14287: S1 ^operator O2038)
  17428. 1019: O: O2038 (predict-no)
  17429. --- END Decision Phase ---
  17430. --- Application Phase ---
  17431. --- Firing Productions (PE) For State At Depth 1 ---
  17432. --- Inner Elaboration Phase, active level 1 (S1) ---
  17433. Firing apply*operator
  17434. -->
  17435. (I3 ^predict-no N1019 + :O )
  17436. Firing apply*operator*complete
  17437. -->
  17438. (I3 ^predict-no N1018 - :O )
  17439. inner elaboration loop at bottom goal.
  17440. --- Change Working Memory (PE) ---
  17441. =>WM: (14288: I3 ^predict-no N1019)
  17442. <=WM: (14275: N1018 ^status complete)
  17443. <=WM: (14274: I3 ^predict-no N1018)
  17444. --- Firing Productions (IE) For State At Depth 1 ---
  17445. --- Inner Elaboration Phase, active level 1 (S1) ---
  17446. Firing monitor*world
  17447. -->
  17448. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17449. --- Change Working Memory (IE) ---
  17450. --- END Application Phase ---
  17451. --- Output Phase ---
  17452. ENV: Agent did: predict-no for direction U in state State-A
  17453. In State-A moving U
  17454. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17455. predict error 0
  17456. dir: dir isR
  17457. --- END Output Phase ---
  17458. -/|--- Input Phase ---
  17459. =>WM: (14292: I2 ^dir R)
  17460. =>WM: (14291: I2 ^reward 1)
  17461. =>WM: (14290: I2 ^see 0)
  17462. =>WM: (14289: N1019 ^status complete)
  17463. <=WM: (14278: I2 ^dir U)
  17464. <=WM: (14277: I2 ^reward 1)
  17465. <=WM: (14276: I2 ^see 0)
  17466. =>WM: (14293: I2 ^level-1 L0-root)
  17467. <=WM: (14279: I2 ^level-1 L0-root)
  17468. --- END Input Phase ---
  17469. --- Proposal Phase ---
  17470. --- Inner Elaboration Phase, active level 1 (S1) ---
  17471. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17472. -->
  17473. (S1 ^operator O2037 = 0.8155918960261216)
  17474. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17475. -->
  17476. (S1 ^operator O2038 = -0.00558448899823713)
  17477. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17478. -->
  17479. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17480. -->
  17481. Firing elaborate*copy-see-to-output-link
  17482. -->
  17483. (I3 ^see 0 +)
  17484. Firing elaborate*reward*based*on*reward
  17485. -->
  17486. (R1023 ^value 1 +)
  17487. (R1 ^reward R1023 +)
  17488. Firing propose*predict-yes
  17489. -->
  17490. (O2039 ^name predict-yes +)
  17491. (S1 ^operator O2039 +)
  17492. Firing propose*predict-no
  17493. -->
  17494. (O2040 ^name predict-no +)
  17495. (S1 ^operator O2040 +)
  17496. Firing rl*prefer*rvt*predict-no*H0*4
  17497. -->
  17498. (S1 ^operator O2038 = 0.4476193147022436)
  17499. Firing rl*prefer*rvt*predict-yes*H0*3
  17500. -->
  17501. (S1 ^operator O2037 = 0.1844112548850312)
  17502. Firing prefer*rvt*predict-yes*H0
  17503. -->
  17504. Firing prefer*rvt*predict-no*H0
  17505. -->
  17506. Firing elaborate*copy-dir-to-output-link
  17507. -->
  17508. (I3 ^dir R +)
  17509. inner elaboration loop at bottom goal.
  17510. Retracting elaborate*copy-see-to-output-link
  17511. -->
  17512. (I3 ^see 0 +)
  17513. Retracting propose*predict-no
  17514. -->
  17515. (O2038 ^name predict-no +)
  17516. (S1 ^operator O2038 +)
  17517. Retracting propose*predict-yes
  17518. -->
  17519. (O2037 ^name predict-yes +)
  17520. (S1 ^operator O2037 +)
  17521. Retracting elaborate*reward*based*on*reward
  17522. -->
  17523. (R1022 ^value 1 +)
  17524. (R1 ^reward R1022 +)
  17525. Retracting elaborate*copy-dir-to-output-link
  17526. -->
  17527. (I3 ^dir U +)
  17528. Retracting rl*prefer*rvt*predict-no*H0*6
  17529. -->
  17530. (S1 ^operator O2038 = 0.9999999999999999)
  17531. Retracting rl*prefer*rvt*predict-yes*H0*5
  17532. -->
  17533. (S1 ^operator O2037 = 0.)
  17534. =>WM: (14300: S1 ^operator O2040 +)
  17535. =>WM: (14299: S1 ^operator O2039 +)
  17536. =>WM: (14298: I3 ^dir R)
  17537. =>WM: (14297: O2040 ^name predict-no)
  17538. =>WM: (14296: O2039 ^name predict-yes)
  17539. =>WM: (14295: R1023 ^value 1)
  17540. =>WM: (14294: R1 ^reward R1023)
  17541. <=WM: (14285: S1 ^operator O2037 +)
  17542. <=WM: (14286: S1 ^operator O2038 +)
  17543. <=WM: (14287: S1 ^operator O2038)
  17544. <=WM: (14284: I3 ^dir U)
  17545. <=WM: (14280: R1 ^reward R1022)
  17546. <=WM: (14283: O2038 ^name predict-no)
  17547. <=WM: (14282: O2037 ^name predict-yes)
  17548. <=WM: (14281: R1022 ^value 1)
  17549. --- Inner Elaboration Phase, active level 1 (S1) ---
  17550. Firing prefer*rvt*predict-yes*H0
  17551. -->
  17552. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17553. -->
  17554. (S1 ^operator O2039 = 0.8155918960261216)
  17555. Firing rl*prefer*rvt*predict-yes*H0*3
  17556. -->
  17557. (S1 ^operator O2039 = 0.1844112548850312)
  17558. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17559. -->
  17560. Firing prefer*rvt*predict-no*H0
  17561. -->
  17562. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17563. -->
  17564. (S1 ^operator O2040 = -0.00558448899823713)
  17565. Firing rl*prefer*rvt*predict-no*H0*4
  17566. -->
  17567. (S1 ^operator O2040 = 0.4476193147022436)
  17568. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17569. -->
  17570. inner elaboration loop at bottom goal.
  17571. Retracting rl*prefer*rvt*predict-no*H0*4
  17572. -->
  17573. (S1 ^operator O2038 = 0.4476193147022436)
  17574. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17575. -->
  17576. (S1 ^operator O2038 = -0.00558448899823713)
  17577. Retracting rl*prefer*rvt*predict-yes*H0*3
  17578. -->
  17579. (S1 ^operator O2037 = 0.1844112548850312)
  17580. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17581. -->
  17582. (S1 ^operator O2037 = 0.8155918960261216)
  17583. --- END Proposal Phase ---
  17584. --- Decision Phase ---
  17585. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17586. =>WM: (14301: S1 ^operator O2039)
  17587. 1020: O: O2039 (predict-yes)
  17588. --- END Decision Phase ---
  17589. --- Application Phase ---
  17590. --- Firing Productions (PE) For State At Depth 1 ---
  17591. --- Inner Elaboration Phase, active level 1 (S1) ---
  17592. Firing apply*operator
  17593. -->
  17594. (I3 ^predict-yes N1020 + :O )
  17595. Firing apply*operator*complete
  17596. -->
  17597. (I3 ^predict-no N1019 - :O )
  17598. inner elaboration loop at bottom goal.
  17599. --- Change Working Memory (PE) ---
  17600. =>WM: (14302: I3 ^predict-yes N1020)
  17601. <=WM: (14289: N1019 ^status complete)
  17602. <=WM: (14288: I3 ^predict-no N1019)
  17603. --- Firing Productions (IE) For State At Depth 1 ---
  17604. --- Inner Elaboration Phase, active level 1 (S1) ---
  17605. Firing monitor*world
  17606. -->
  17607. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17608. --- Change Working Memory (IE) ---
  17609. --- END Application Phase ---
  17610. --- Output Phase ---
  17611. ENV: Agent did: predict-yes for direction R in state State-A
  17612. In State-A moving R
  17613. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17614. predict error 0
  17615. dir: dir isR
  17616. --- END Output Phase ---
  17617. \---- Input Phase ---
  17618. =>WM: (14306: I2 ^dir R)
  17619. =>WM: (14305: I2 ^reward 1)
  17620. =>WM: (14304: I2 ^see 1)
  17621. =>WM: (14303: N1020 ^status complete)
  17622. <=WM: (14292: I2 ^dir R)
  17623. <=WM: (14291: I2 ^reward 1)
  17624. <=WM: (14290: I2 ^see 0)
  17625. =>WM: (14307: I2 ^level-1 R1-root)
  17626. <=WM: (14293: I2 ^level-1 L0-root)
  17627. --- END Input Phase ---
  17628. --- Proposal Phase ---
  17629. --- Inner Elaboration Phase, active level 1 (S1) ---
  17630. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17631. -->
  17632. (S1 ^operator O2039 = 0.1398795999120246)
  17633. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17634. -->
  17635. (S1 ^operator O2040 = 0.5523818179838019)
  17636. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17637. -->
  17638. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17639. -->
  17640. Firing elaborate*copy-see-to-output-link
  17641. -->
  17642. (I3 ^see 1 +)
  17643. Firing elaborate*reward*based*on*reward
  17644. -->
  17645. (R1024 ^value 1 +)
  17646. (R1 ^reward R1024 +)
  17647. Firing propose*predict-yes
  17648. -->
  17649. (O2041 ^name predict-yes +)
  17650. (S1 ^operator O2041 +)
  17651. Firing propose*predict-no
  17652. -->
  17653. (O2042 ^name predict-no +)
  17654. (S1 ^operator O2042 +)
  17655. Firing rl*prefer*rvt*predict-no*H0*4
  17656. -->
  17657. (S1 ^operator O2040 = 0.4476193147022436)
  17658. Firing rl*prefer*rvt*predict-yes*H0*3
  17659. -->
  17660. (S1 ^operator O2039 = 0.1844112548850312)
  17661. Firing prefer*rvt*predict-yes*H0
  17662. -->
  17663. Firing prefer*rvt*predict-no*H0
  17664. -->
  17665. Firing elaborate*copy-dir-to-output-link
  17666. -->
  17667. (I3 ^dir R +)
  17668. inner elaboration loop at bottom goal.
  17669. Retracting elaborate*copy-see-to-output-link
  17670. -->
  17671. (I3 ^see 0 +)
  17672. Retracting propose*predict-no
  17673. -->
  17674. (O2040 ^name predict-no +)
  17675. (S1 ^operator O2040 +)
  17676. Retracting propose*predict-yes
  17677. -->
  17678. (O2039 ^name predict-yes +)
  17679. (S1 ^operator O2039 +)
  17680. Retracting elaborate*reward*based*on*reward
  17681. -->
  17682. (R1023 ^value 1 +)
  17683. (R1 ^reward R1023 +)
  17684. Retracting elaborate*copy-dir-to-output-link
  17685. -->
  17686. (I3 ^dir R +)
  17687. Retracting rl*prefer*rvt*predict-no*H0*4
  17688. -->
  17689. (S1 ^operator O2040 = 0.4476193147022436)
  17690. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17691. -->
  17692. (S1 ^operator O2040 = -0.00558448899823713)
  17693. Retracting rl*prefer*rvt*predict-yes*H0*3
  17694. -->
  17695. (S1 ^operator O2039 = 0.1844112548850312)
  17696. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17697. -->
  17698. (S1 ^operator O2039 = 0.8155918960261216)
  17699. =>WM: (14314: S1 ^operator O2042 +)
  17700. =>WM: (14313: S1 ^operator O2041 +)
  17701. =>WM: (14312: O2042 ^name predict-no)
  17702. =>WM: (14311: O2041 ^name predict-yes)
  17703. =>WM: (14310: R1024 ^value 1)
  17704. =>WM: (14309: R1 ^reward R1024)
  17705. =>WM: (14308: I3 ^see 1)
  17706. <=WM: (14299: S1 ^operator O2039 +)
  17707. <=WM: (14301: S1 ^operator O2039)
  17708. <=WM: (14300: S1 ^operator O2040 +)
  17709. <=WM: (14294: R1 ^reward R1023)
  17710. <=WM: (14239: I3 ^see 0)
  17711. <=WM: (14297: O2040 ^name predict-no)
  17712. <=WM: (14296: O2039 ^name predict-yes)
  17713. <=WM: (14295: R1023 ^value 1)
  17714. --- Inner Elaboration Phase, active level 1 (S1) ---
  17715. Firing prefer*rvt*predict-yes*H0
  17716. -->
  17717. Firing rl*prefer*rvt*predict-yes*H0*3
  17718. -->
  17719. (S1 ^operator O2041 = 0.1844112548850312)
  17720. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17721. -->
  17722. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17723. -->
  17724. (S1 ^operator O2041 = 0.1398795999120246)
  17725. Firing prefer*rvt*predict-no*H0
  17726. -->
  17727. Firing rl*prefer*rvt*predict-no*H0*4
  17728. -->
  17729. (S1 ^operator O2042 = 0.4476193147022436)
  17730. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17731. -->
  17732. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17733. -->
  17734. (S1 ^operator O2042 = 0.5523818179838019)
  17735. inner elaboration loop at bottom goal.
  17736. Retracting rl*prefer*rvt*predict-no*H0*4
  17737. -->
  17738. (S1 ^operator O2040 = 0.4476193147022436)
  17739. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17740. -->
  17741. (S1 ^operator O2040 = 0.5523818179838019)
  17742. Retracting rl*prefer*rvt*predict-yes*H0*3
  17743. -->
  17744. (S1 ^operator O2039 = 0.1844112548850312)
  17745. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17746. -->
  17747. (S1 ^operator O2039 = 0.1398795999120246)
  17748. --- END Proposal Phase ---
  17749. --- Decision Phase ---
  17750. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184411 -> 0.675413 -0.491002 0.184411(R,m,v=1,0.902299,0.0886652)
  17751. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.32459 0.491002 0.815592 -> 0.32459 0.491002 0.815591(R,m,v=1,1,0)
  17752. =>WM: (14315: S1 ^operator O2042)
  17753. 1021: O: O2042 (predict-no)
  17754. --- END Decision Phase ---
  17755. --- Application Phase ---
  17756. --- Firing Productions (PE) For State At Depth 1 ---
  17757. --- Inner Elaboration Phase, active level 1 (S1) ---
  17758. Firing apply*operator
  17759. -->
  17760. (I3 ^predict-no N1021 + :O )
  17761. Firing apply*operator*complete
  17762. -->
  17763. (I3 ^predict-yes N1020 - :O )
  17764. inner elaboration loop at bottom goal.
  17765. --- Change Working Memory (PE) ---
  17766. =>WM: (14316: I3 ^predict-no N1021)
  17767. <=WM: (14303: N1020 ^status complete)
  17768. <=WM: (14302: I3 ^predict-yes N1020)
  17769. --- Firing Productions (IE) For State At Depth 1 ---
  17770. --- Inner Elaboration Phase, active level 1 (S1) ---
  17771. Firing monitor*world
  17772. -->
  17773. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17774. --- Change Working Memory (IE) ---
  17775. --- END Application Phase ---
  17776. --- Output Phase ---
  17777. ENV: Agent did: predict-no for direction R in state State-B
  17778. In State-B moving R
  17779. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17780. predict error 0
  17781. dir: dir isR
  17782. --- END Output Phase ---
  17783. /--- Input Phase ---
  17784. =>WM: (14320: I2 ^dir R)
  17785. =>WM: (14319: I2 ^reward 1)
  17786. =>WM: (14318: I2 ^see 0)
  17787. =>WM: (14317: N1021 ^status complete)
  17788. <=WM: (14306: I2 ^dir R)
  17789. <=WM: (14305: I2 ^reward 1)
  17790. <=WM: (14304: I2 ^see 1)
  17791. =>WM: (14321: I2 ^level-1 R0-root)
  17792. <=WM: (14307: I2 ^level-1 R1-root)
  17793. --- END Input Phase ---
  17794. --- Proposal Phase ---
  17795. --- Inner Elaboration Phase, active level 1 (S1) ---
  17796. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  17797. -->
  17798. (S1 ^operator O2041 = 0.1664311307472832)
  17799. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  17800. -->
  17801. (S1 ^operator O2042 = 0.5523793263612301)
  17802. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17803. -->
  17804. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17805. -->
  17806. Firing elaborate*copy-see-to-output-link
  17807. -->
  17808. (I3 ^see 0 +)
  17809. Firing elaborate*reward*based*on*reward
  17810. -->
  17811. (R1025 ^value 1 +)
  17812. (R1 ^reward R1025 +)
  17813. Firing propose*predict-yes
  17814. -->
  17815. (O2043 ^name predict-yes +)
  17816. (S1 ^operator O2043 +)
  17817. Firing propose*predict-no
  17818. -->
  17819. (O2044 ^name predict-no +)
  17820. (S1 ^operator O2044 +)
  17821. Firing rl*prefer*rvt*predict-no*H0*4
  17822. -->
  17823. (S1 ^operator O2042 = 0.4476193147022436)
  17824. Firing rl*prefer*rvt*predict-yes*H0*3
  17825. -->
  17826. (S1 ^operator O2041 = 0.1844107822483583)
  17827. Firing prefer*rvt*predict-yes*H0
  17828. -->
  17829. Firing prefer*rvt*predict-no*H0
  17830. -->
  17831. Firing elaborate*copy-dir-to-output-link
  17832. -->
  17833. (I3 ^dir R +)
  17834. inner elaboration loop at bottom goal.
  17835. Retracting elaborate*copy-see-to-output-link
  17836. -->
  17837. (I3 ^see 1 +)
  17838. Retracting propose*predict-no
  17839. -->
  17840. (O2042 ^name predict-no +)
  17841. (S1 ^operator O2042 +)
  17842. Retracting propose*predict-yes
  17843. -->
  17844. (O2041 ^name predict-yes +)
  17845. (S1 ^operator O2041 +)
  17846. Retracting elaborate*reward*based*on*reward
  17847. -->
  17848. (R1024 ^value 1 +)
  17849. (R1 ^reward R1024 +)
  17850. Retracting elaborate*copy-dir-to-output-link
  17851. -->
  17852. (I3 ^dir R +)
  17853. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17854. -->
  17855. (S1 ^operator O2042 = 0.5523818179838019)
  17856. Retracting rl*prefer*rvt*predict-no*H0*4
  17857. -->
  17858. (S1 ^operator O2042 = 0.4476193147022436)
  17859. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17860. -->
  17861. (S1 ^operator O2041 = 0.1398795999120246)
  17862. Retracting rl*prefer*rvt*predict-yes*H0*3
  17863. -->
  17864. (S1 ^operator O2041 = 0.1844107822483583)
  17865. =>WM: (14328: S1 ^operator O2044 +)
  17866. =>WM: (14327: S1 ^operator O2043 +)
  17867. =>WM: (14326: O2044 ^name predict-no)
  17868. =>WM: (14325: O2043 ^name predict-yes)
  17869. =>WM: (14324: R1025 ^value 1)
  17870. =>WM: (14323: R1 ^reward R1025)
  17871. =>WM: (14322: I3 ^see 0)
  17872. <=WM: (14313: S1 ^operator O2041 +)
  17873. <=WM: (14314: S1 ^operator O2042 +)
  17874. <=WM: (14315: S1 ^operator O2042)
  17875. <=WM: (14309: R1 ^reward R1024)
  17876. <=WM: (14308: I3 ^see 1)
  17877. <=WM: (14312: O2042 ^name predict-no)
  17878. <=WM: (14311: O2041 ^name predict-yes)
  17879. <=WM: (14310: R1024 ^value 1)
  17880. --- Inner Elaboration Phase, active level 1 (S1) ---
  17881. Firing prefer*rvt*predict-yes*H0
  17882. -->
  17883. Firing rl*prefer*rvt*predict-yes*H0*3
  17884. -->
  17885. (S1 ^operator O2043 = 0.1844107822483583)
  17886. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17887. -->
  17888. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  17889. -->
  17890. (S1 ^operator O2043 = 0.1664311307472832)
  17891. Firing prefer*rvt*predict-no*H0
  17892. -->
  17893. Firing rl*prefer*rvt*predict-no*H0*4
  17894. -->
  17895. (S1 ^operator O2044 = 0.4476193147022436)
  17896. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17897. -->
  17898. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  17899. -->
  17900. (S1 ^operator O2044 = 0.5523793263612301)
  17901. inner elaboration loop at bottom goal.
  17902. Retracting rl*prefer*rvt*predict-no*H0*4
  17903. -->
  17904. (S1 ^operator O2042 = 0.4476193147022436)
  17905. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  17906. -->
  17907. (S1 ^operator O2042 = 0.5523793263612301)
  17908. Retracting rl*prefer*rvt*predict-yes*H0*3
  17909. -->
  17910. (S1 ^operator O2041 = 0.1844107822483583)
  17911. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  17912. -->
  17913. (S1 ^operator O2041 = 0.1664311307472832)
  17914. --- END Proposal Phase ---
  17915. --- Decision Phase ---
  17916. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.930769,0.0649374)
  17917. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552382 -> 0.377468 0.174914 0.552382(R,m,v=1,1,0)
  17918. =>WM: (14329: S1 ^operator O2044)
  17919. 1022: O: O2044 (predict-no)
  17920. --- END Decision Phase ---
  17921. --- Application Phase ---
  17922. --- Firing Productions (PE) For State At Depth 1 ---
  17923. --- Inner Elaboration Phase, active level 1 (S1) ---
  17924. Firing apply*operator
  17925. -->
  17926. (I3 ^predict-no N1022 + :O )
  17927. Firing apply*operator*complete
  17928. -->
  17929. (I3 ^predict-no N1021 - :O )
  17930. inner elaboration loop at bottom goal.
  17931. --- Change Working Memory (PE) ---
  17932. =>WM: (14330: I3 ^predict-no N1022)
  17933. <=WM: (14317: N1021 ^status complete)
  17934. <=WM: (14316: I3 ^predict-no N1021)
  17935. --- Firing Productions (IE) For State At Depth 1 ---
  17936. --- Inner Elaboration Phase, active level 1 (S1) ---
  17937. Firing monitor*world
  17938. -->
  17939. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17940. --- Change Working Memory (IE) ---
  17941. --- END Application Phase ---
  17942. --- Output Phase ---
  17943. ENV: Agent did: predict-no for direction R in state State-B
  17944. In State-B moving R
  17945. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17946. predict error 0
  17947. dir: dir isR
  17948. --- END Output Phase ---
  17949. |\-/--- Input Phase ---
  17950. =>WM: (14334: I2 ^dir R)
  17951. =>WM: (14333: I2 ^reward 1)
  17952. =>WM: (14332: I2 ^see 0)
  17953. =>WM: (14331: N1022 ^status complete)
  17954. <=WM: (14320: I2 ^dir R)
  17955. <=WM: (14319: I2 ^reward 1)
  17956. <=WM: (14318: I2 ^see 0)
  17957. =>WM: (14335: I2 ^level-1 R0-root)
  17958. <=WM: (14321: I2 ^level-1 R0-root)
  17959. --- END Input Phase ---
  17960. --- Proposal Phase ---
  17961. --- Inner Elaboration Phase, active level 1 (S1) ---
  17962. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  17963. -->
  17964. (S1 ^operator O2043 = 0.1664311307472832)
  17965. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  17966. -->
  17967. (S1 ^operator O2044 = 0.5523793263612301)
  17968. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17969. -->
  17970. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17971. -->
  17972. Firing elaborate*copy-see-to-output-link
  17973. -->
  17974. (I3 ^see 0 +)
  17975. Firing elaborate*reward*based*on*reward
  17976. -->
  17977. (R1026 ^value 1 +)
  17978. (R1 ^reward R1026 +)
  17979. Firing propose*predict-yes
  17980. -->
  17981. (O2045 ^name predict-yes +)
  17982. (S1 ^operator O2045 +)
  17983. Firing propose*predict-no
  17984. -->
  17985. (O2046 ^name predict-no +)
  17986. (S1 ^operator O2046 +)
  17987. Firing rl*prefer*rvt*predict-no*H0*4
  17988. -->
  17989. (S1 ^operator O2044 = 0.4476191447993367)
  17990. Firing rl*prefer*rvt*predict-yes*H0*3
  17991. -->
  17992. (S1 ^operator O2043 = 0.1844107822483583)
  17993. Firing prefer*rvt*predict-yes*H0
  17994. -->
  17995. Firing prefer*rvt*predict-no*H0
  17996. -->
  17997. Firing elaborate*copy-dir-to-output-link
  17998. -->
  17999. (I3 ^dir R +)
  18000. inner elaboration loop at bottom goal.
  18001. Retracting elaborate*copy-see-to-output-link
  18002. -->
  18003. (I3 ^see 0 +)
  18004. Retracting propose*predict-no
  18005. -->
  18006. (O2044 ^name predict-no +)
  18007. (S1 ^operator O2044 +)
  18008. Retracting propose*predict-yes
  18009. -->
  18010. (O2043 ^name predict-yes +)
  18011. (S1 ^operator O2043 +)
  18012. Retracting elaborate*reward*based*on*reward
  18013. -->
  18014. (R1025 ^value 1 +)
  18015. (R1 ^reward R1025 +)
  18016. Retracting elaborate*copy-dir-to-output-link
  18017. -->
  18018. (I3 ^dir R +)
  18019. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18020. -->
  18021. (S1 ^operator O2044 = 0.5523793263612301)
  18022. Retracting rl*prefer*rvt*predict-no*H0*4
  18023. -->
  18024. (S1 ^operator O2044 = 0.4476191447993367)
  18025. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18026. -->
  18027. (S1 ^operator O2043 = 0.1664311307472832)
  18028. Retracting rl*prefer*rvt*predict-yes*H0*3
  18029. -->
  18030. (S1 ^operator O2043 = 0.1844107822483583)
  18031. =>WM: (14341: S1 ^operator O2046 +)
  18032. =>WM: (14340: S1 ^operator O2045 +)
  18033. =>WM: (14339: O2046 ^name predict-no)
  18034. =>WM: (14338: O2045 ^name predict-yes)
  18035. =>WM: (14337: R1026 ^value 1)
  18036. =>WM: (14336: R1 ^reward R1026)
  18037. <=WM: (14327: S1 ^operator O2043 +)
  18038. <=WM: (14328: S1 ^operator O2044 +)
  18039. <=WM: (14329: S1 ^operator O2044)
  18040. <=WM: (14323: R1 ^reward R1025)
  18041. <=WM: (14326: O2044 ^name predict-no)
  18042. <=WM: (14325: O2043 ^name predict-yes)
  18043. <=WM: (14324: R1025 ^value 1)
  18044. --- Inner Elaboration Phase, active level 1 (S1) ---
  18045. Firing prefer*rvt*predict-yes*H0
  18046. -->
  18047. Firing rl*prefer*rvt*predict-yes*H0*3
  18048. -->
  18049. (S1 ^operator O2045 = 0.1844107822483583)
  18050. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18051. -->
  18052. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18053. -->
  18054. (S1 ^operator O2045 = 0.1664311307472832)
  18055. Firing prefer*rvt*predict-no*H0
  18056. -->
  18057. Firing rl*prefer*rvt*predict-no*H0*4
  18058. -->
  18059. (S1 ^operator O2046 = 0.4476191447993367)
  18060. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18061. -->
  18062. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18063. -->
  18064. (S1 ^operator O2046 = 0.5523793263612301)
  18065. inner elaboration loop at bottom goal.
  18066. Retracting rl*prefer*rvt*predict-no*H0*4
  18067. -->
  18068. (S1 ^operator O2044 = 0.4476191447993367)
  18069. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18070. -->
  18071. (S1 ^operator O2044 = 0.5523793263612301)
  18072. Retracting rl*prefer*rvt*predict-yes*H0*3
  18073. -->
  18074. (S1 ^operator O2043 = 0.1844107822483583)
  18075. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18076. -->
  18077. (S1 ^operator O2043 = 0.1664311307472832)
  18078. --- END Proposal Phase ---
  18079. --- Decision Phase ---
  18080. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.931298,0.0644745)
  18081. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.552379 -> 0.377466 0.174913 0.55238(R,m,v=1,1,0)
  18082. =>WM: (14342: S1 ^operator O2046)
  18083. 1023: O: O2046 (predict-no)
  18084. --- END Decision Phase ---
  18085. --- Application Phase ---
  18086. --- Firing Productions (PE) For State At Depth 1 ---
  18087. --- Inner Elaboration Phase, active level 1 (S1) ---
  18088. Firing apply*operator
  18089. -->
  18090. (I3 ^predict-no N1023 + :O )
  18091. Firing apply*operator*complete
  18092. -->
  18093. (I3 ^predict-no N1022 - :O )
  18094. inner elaboration loop at bottom goal.
  18095. --- Change Working Memory (PE) ---
  18096. =>WM: (14343: I3 ^predict-no N1023)
  18097. <=WM: (14331: N1022 ^status complete)
  18098. <=WM: (14330: I3 ^predict-no N1022)
  18099. --- Firing Productions (IE) For State At Depth 1 ---
  18100. --- Inner Elaboration Phase, active level 1 (S1) ---
  18101. Firing monitor*world
  18102. -->
  18103. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18104. --- Change Working Memory (IE) ---
  18105. --- END Application Phase ---
  18106. --- Output Phase ---
  18107. ENV: Agent did: predict-no for direction R in state State-B
  18108. In State-B moving R
  18109. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18110. predict error 0
  18111. dir: dir isR
  18112. --- END Output Phase ---
  18113. |\---- Input Phase ---
  18114. =>WM: (14347: I2 ^dir R)
  18115. =>WM: (14346: I2 ^reward 1)
  18116. =>WM: (14345: I2 ^see 0)
  18117. =>WM: (14344: N1023 ^status complete)
  18118. <=WM: (14334: I2 ^dir R)
  18119. <=WM: (14333: I2 ^reward 1)
  18120. <=WM: (14332: I2 ^see 0)
  18121. =>WM: (14348: I2 ^level-1 R0-root)
  18122. <=WM: (14335: I2 ^level-1 R0-root)
  18123. --- END Input Phase ---
  18124. --- Proposal Phase ---
  18125. --- Inner Elaboration Phase, active level 1 (S1) ---
  18126. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18127. -->
  18128. (S1 ^operator O2045 = 0.1664311307472832)
  18129. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18130. -->
  18131. (S1 ^operator O2046 = 0.552379555687145)
  18132. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18133. -->
  18134. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18135. -->
  18136. Firing elaborate*copy-see-to-output-link
  18137. -->
  18138. (I3 ^see 0 +)
  18139. Firing elaborate*reward*based*on*reward
  18140. -->
  18141. (R1027 ^value 1 +)
  18142. (R1 ^reward R1027 +)
  18143. Firing propose*predict-yes
  18144. -->
  18145. (O2047 ^name predict-yes +)
  18146. (S1 ^operator O2047 +)
  18147. Firing propose*predict-no
  18148. -->
  18149. (O2048 ^name predict-no +)
  18150. (S1 ^operator O2048 +)
  18151. Firing rl*prefer*rvt*predict-no*H0*4
  18152. -->
  18153. (S1 ^operator O2046 = 0.4476193741252518)
  18154. Firing rl*prefer*rvt*predict-yes*H0*3
  18155. -->
  18156. (S1 ^operator O2045 = 0.1844107822483583)
  18157. Firing prefer*rvt*predict-yes*H0
  18158. -->
  18159. Firing prefer*rvt*predict-no*H0
  18160. -->
  18161. Firing elaborate*copy-dir-to-output-link
  18162. -->
  18163. (I3 ^dir R +)
  18164. inner elaboration loop at bottom goal.
  18165. Retracting elaborate*copy-see-to-output-link
  18166. -->
  18167. (I3 ^see 0 +)
  18168. Retracting propose*predict-no
  18169. -->
  18170. (O2046 ^name predict-no +)
  18171. (S1 ^operator O2046 +)
  18172. Retracting propose*predict-yes
  18173. -->
  18174. (O2045 ^name predict-yes +)
  18175. (S1 ^operator O2045 +)
  18176. Retracting elaborate*reward*based*on*reward
  18177. -->
  18178. (R1026 ^value 1 +)
  18179. (R1 ^reward R1026 +)
  18180. Retracting elaborate*copy-dir-to-output-link
  18181. -->
  18182. (I3 ^dir R +)
  18183. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18184. -->
  18185. (S1 ^operator O2046 = 0.552379555687145)
  18186. Retracting rl*prefer*rvt*predict-no*H0*4
  18187. -->
  18188. (S1 ^operator O2046 = 0.4476193741252518)
  18189. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18190. -->
  18191. (S1 ^operator O2045 = 0.1664311307472832)
  18192. Retracting rl*prefer*rvt*predict-yes*H0*3
  18193. -->
  18194. (S1 ^operator O2045 = 0.1844107822483583)
  18195. =>WM: (14354: S1 ^operator O2048 +)
  18196. =>WM: (14353: S1 ^operator O2047 +)
  18197. =>WM: (14352: O2048 ^name predict-no)
  18198. =>WM: (14351: O2047 ^name predict-yes)
  18199. =>WM: (14350: R1027 ^value 1)
  18200. =>WM: (14349: R1 ^reward R1027)
  18201. <=WM: (14340: S1 ^operator O2045 +)
  18202. <=WM: (14341: S1 ^operator O2046 +)
  18203. <=WM: (14342: S1 ^operator O2046)
  18204. <=WM: (14336: R1 ^reward R1026)
  18205. <=WM: (14339: O2046 ^name predict-no)
  18206. <=WM: (14338: O2045 ^name predict-yes)
  18207. <=WM: (14337: R1026 ^value 1)
  18208. --- Inner Elaboration Phase, active level 1 (S1) ---
  18209. Firing prefer*rvt*predict-yes*H0
  18210. -->
  18211. Firing rl*prefer*rvt*predict-yes*H0*3
  18212. -->
  18213. (S1 ^operator O2047 = 0.1844107822483583)
  18214. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18215. -->
  18216. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18217. -->
  18218. (S1 ^operator O2047 = 0.1664311307472832)
  18219. Firing prefer*rvt*predict-no*H0
  18220. -->
  18221. Firing rl*prefer*rvt*predict-no*H0*4
  18222. -->
  18223. (S1 ^operator O2048 = 0.4476193741252518)
  18224. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18225. -->
  18226. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18227. -->
  18228. (S1 ^operator O2048 = 0.552379555687145)
  18229. inner elaboration loop at bottom goal.
  18230. Retracting rl*prefer*rvt*predict-no*H0*4
  18231. -->
  18232. (S1 ^operator O2046 = 0.4476193741252518)
  18233. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18234. -->
  18235. (S1 ^operator O2046 = 0.552379555687145)
  18236. Retracting rl*prefer*rvt*predict-yes*H0*3
  18237. -->
  18238. (S1 ^operator O2045 = 0.1844107822483583)
  18239. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18240. -->
  18241. (S1 ^operator O2045 = 0.1664311307472832)
  18242. --- END Proposal Phase ---
  18243. --- Decision Phase ---
  18244. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.44762(R,m,v=1,0.931818,0.064018)
  18245. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.55238 -> 0.377466 0.174913 0.55238(R,m,v=1,1,0)
  18246. =>WM: (14355: S1 ^operator O2048)
  18247. 1024: O: O2048 (predict-no)
  18248. --- END Decision Phase ---
  18249. --- Application Phase ---
  18250. --- Firing Productions (PE) For State At Depth 1 ---
  18251. --- Inner Elaboration Phase, active level 1 (S1) ---
  18252. Firing apply*operator
  18253. -->
  18254. (I3 ^predict-no N1024 + :O )
  18255. Firing apply*operator*complete
  18256. -->
  18257. (I3 ^predict-no N1023 - :O )
  18258. inner elaboration loop at bottom goal.
  18259. --- Change Working Memory (PE) ---
  18260. =>WM: (14356: I3 ^predict-no N1024)
  18261. <=WM: (14344: N1023 ^status complete)
  18262. <=WM: (14343: I3 ^predict-no N1023)
  18263. --- Firing Productions (IE) For State At Depth 1 ---
  18264. --- Inner Elaboration Phase, active level 1 (S1) ---
  18265. Firing monitor*world
  18266. -->
  18267. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18268. --- Change Working Memory (IE) ---
  18269. --- END Application Phase ---
  18270. --- Output Phase ---
  18271. ENV: Agent did: predict-no for direction R in state State-B
  18272. In State-B moving R
  18273. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18274. predict error 0
  18275. dir: dir isR
  18276. --- END Output Phase ---
  18277. /|\---- Input Phase ---
  18278. =>WM: (14360: I2 ^dir R)
  18279. =>WM: (14359: I2 ^reward 1)
  18280. =>WM: (14358: I2 ^see 0)
  18281. =>WM: (14357: N1024 ^status complete)
  18282. <=WM: (14347: I2 ^dir R)
  18283. <=WM: (14346: I2 ^reward 1)
  18284. <=WM: (14345: I2 ^see 0)
  18285. =>WM: (14361: I2 ^level-1 R0-root)
  18286. <=WM: (14348: I2 ^level-1 R0-root)
  18287. --- END Input Phase ---
  18288. --- Proposal Phase ---
  18289. --- Inner Elaboration Phase, active level 1 (S1) ---
  18290. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18291. -->
  18292. (S1 ^operator O2047 = 0.1664311307472832)
  18293. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18294. -->
  18295. (S1 ^operator O2048 = 0.5523797162152855)
  18296. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18297. -->
  18298. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18299. -->
  18300. Firing elaborate*copy-see-to-output-link
  18301. -->
  18302. (I3 ^see 0 +)
  18303. Firing elaborate*reward*based*on*reward
  18304. -->
  18305. (R1028 ^value 1 +)
  18306. (R1 ^reward R1028 +)
  18307. Firing propose*predict-yes
  18308. -->
  18309. (O2049 ^name predict-yes +)
  18310. (S1 ^operator O2049 +)
  18311. Firing propose*predict-no
  18312. -->
  18313. (O2050 ^name predict-no +)
  18314. (S1 ^operator O2050 +)
  18315. Firing rl*prefer*rvt*predict-no*H0*4
  18316. -->
  18317. (S1 ^operator O2048 = 0.4476195346533923)
  18318. Firing rl*prefer*rvt*predict-yes*H0*3
  18319. -->
  18320. (S1 ^operator O2047 = 0.1844107822483583)
  18321. Firing prefer*rvt*predict-yes*H0
  18322. -->
  18323. Firing prefer*rvt*predict-no*H0
  18324. -->
  18325. Firing elaborate*copy-dir-to-output-link
  18326. -->
  18327. (I3 ^dir R +)
  18328. inner elaboration loop at bottom goal.
  18329. Retracting elaborate*copy-see-to-output-link
  18330. -->
  18331. (I3 ^see 0 +)
  18332. Retracting propose*predict-no
  18333. -->
  18334. (O2048 ^name predict-no +)
  18335. (S1 ^operator O2048 +)
  18336. Retracting propose*predict-yes
  18337. -->
  18338. (O2047 ^name predict-yes +)
  18339. (S1 ^operator O2047 +)
  18340. Retracting elaborate*reward*based*on*reward
  18341. -->
  18342. (R1027 ^value 1 +)
  18343. (R1 ^reward R1027 +)
  18344. Retracting elaborate*copy-dir-to-output-link
  18345. -->
  18346. (I3 ^dir R +)
  18347. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18348. -->
  18349. (S1 ^operator O2048 = 0.5523797162152855)
  18350. Retracting rl*prefer*rvt*predict-no*H0*4
  18351. -->
  18352. (S1 ^operator O2048 = 0.4476195346533923)
  18353. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18354. -->
  18355. (S1 ^operator O2047 = 0.1664311307472832)
  18356. Retracting rl*prefer*rvt*predict-yes*H0*3
  18357. -->
  18358. (S1 ^operator O2047 = 0.1844107822483583)
  18359. =>WM: (14367: S1 ^operator O2050 +)
  18360. =>WM: (14366: S1 ^operator O2049 +)
  18361. =>WM: (14365: O2050 ^name predict-no)
  18362. =>WM: (14364: O2049 ^name predict-yes)
  18363. =>WM: (14363: R1028 ^value 1)
  18364. =>WM: (14362: R1 ^reward R1028)
  18365. <=WM: (14353: S1 ^operator O2047 +)
  18366. <=WM: (14354: S1 ^operator O2048 +)
  18367. <=WM: (14355: S1 ^operator O2048)
  18368. <=WM: (14349: R1 ^reward R1027)
  18369. <=WM: (14352: O2048 ^name predict-no)
  18370. <=WM: (14351: O2047 ^name predict-yes)
  18371. <=WM: (14350: R1027 ^value 1)
  18372. --- Inner Elaboration Phase, active level 1 (S1) ---
  18373. Firing prefer*rvt*predict-yes*H0
  18374. -->
  18375. Firing rl*prefer*rvt*predict-yes*H0*3
  18376. -->
  18377. (S1 ^operator O2049 = 0.1844107822483583)
  18378. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18379. -->
  18380. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18381. -->
  18382. (S1 ^operator O2049 = 0.1664311307472832)
  18383. Firing prefer*rvt*predict-no*H0
  18384. -->
  18385. Firing rl*prefer*rvt*predict-no*H0*4
  18386. -->
  18387. (S1 ^operator O2050 = 0.4476195346533923)
  18388. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18389. -->
  18390. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18391. -->
  18392. (S1 ^operator O2050 = 0.5523797162152855)
  18393. inner elaboration loop at bottom goal.
  18394. Retracting rl*prefer*rvt*predict-no*H0*4
  18395. -->
  18396. (S1 ^operator O2048 = 0.4476195346533923)
  18397. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18398. -->
  18399. (S1 ^operator O2048 = 0.5523797162152855)
  18400. Retracting rl*prefer*rvt*predict-yes*H0*3
  18401. -->
  18402. (S1 ^operator O2047 = 0.1844107822483583)
  18403. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18404. -->
  18405. (S1 ^operator O2047 = 0.1664311307472832)
  18406. --- END Proposal Phase ---
  18407. --- Decision Phase ---
  18408. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.44762 -> 0.622533 -0.174913 0.44762(R,m,v=1,0.932331,0.063568)
  18409. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.55238 -> 0.377466 0.174913 0.55238(R,m,v=1,1,0)
  18410. =>WM: (14368: S1 ^operator O2050)
  18411. 1025: O: O2050 (predict-no)
  18412. --- END Decision Phase ---
  18413. --- Application Phase ---
  18414. --- Firing Productions (PE) For State At Depth 1 ---
  18415. --- Inner Elaboration Phase, active level 1 (S1) ---
  18416. Firing apply*operator
  18417. -->
  18418. (I3 ^predict-no N1025 + :O )
  18419. Firing apply*operator*complete
  18420. -->
  18421. (I3 ^predict-no N1024 - :O )
  18422. inner elaboration loop at bottom goal.
  18423. --- Change Working Memory (PE) ---
  18424. =>WM: (14369: I3 ^predict-no N1025)
  18425. <=WM: (14357: N1024 ^status complete)
  18426. <=WM: (14356: I3 ^predict-no N1024)
  18427. --- Firing Productions (IE) For State At Depth 1 ---
  18428. --- Inner Elaboration Phase, active level 1 (S1) ---
  18429. Firing monitor*world
  18430. -->
  18431. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18432. --- Change Working Memory (IE) ---
  18433. --- END Application Phase ---
  18434. --- Output Phase ---
  18435. ENV: Agent did: predict-no for direction R in state State-B
  18436. In State-B moving R
  18437. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18438. predict error 0
  18439. dir: dir isL
  18440. --- END Output Phase ---
  18441. /|--- Input Phase ---
  18442. =>WM: (14373: I2 ^dir L)
  18443. =>WM: (14372: I2 ^reward 1)
  18444. =>WM: (14371: I2 ^see 0)
  18445. =>WM: (14370: N1025 ^status complete)
  18446. <=WM: (14360: I2 ^dir R)
  18447. <=WM: (14359: I2 ^reward 1)
  18448. <=WM: (14358: I2 ^see 0)
  18449. =>WM: (14374: I2 ^level-1 R0-root)
  18450. <=WM: (14361: I2 ^level-1 R0-root)
  18451. --- END Input Phase ---
  18452. --- Proposal Phase ---
  18453. --- Inner Elaboration Phase, active level 1 (S1) ---
  18454. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  18455. -->
  18456. (S1 ^operator O2049 = 0.6104610611928351)
  18457. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  18458. -->
  18459. (S1 ^operator O2050 = 0.1063475139796038)
  18460. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18461. -->
  18462. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18463. -->
  18464. Firing elaborate*copy-see-to-output-link
  18465. -->
  18466. (I3 ^see 0 +)
  18467. Firing elaborate*reward*based*on*reward
  18468. -->
  18469. (R1029 ^value 1 +)
  18470. (R1 ^reward R1029 +)
  18471. Firing propose*predict-yes
  18472. -->
  18473. (O2051 ^name predict-yes +)
  18474. (S1 ^operator O2051 +)
  18475. Firing propose*predict-no
  18476. -->
  18477. (O2052 ^name predict-no +)
  18478. (S1 ^operator O2052 +)
  18479. Firing rl*prefer*rvt*predict-no*H0*2
  18480. -->
  18481. (S1 ^operator O2050 = 0.3873355755795274)
  18482. Firing rl*prefer*rvt*predict-yes*H0*1
  18483. -->
  18484. (S1 ^operator O2049 = 0.3895398299261354)
  18485. Firing prefer*rvt*predict-yes*H0
  18486. -->
  18487. Firing prefer*rvt*predict-no*H0
  18488. -->
  18489. Firing elaborate*copy-dir-to-output-link
  18490. -->
  18491. (I3 ^dir L +)
  18492. inner elaboration loop at bottom goal.
  18493. Retracting elaborate*copy-see-to-output-link
  18494. -->
  18495. (I3 ^see 0 +)
  18496. Retracting propose*predict-no
  18497. -->
  18498. (O2050 ^name predict-no +)
  18499. (S1 ^operator O2050 +)
  18500. Retracting propose*predict-yes
  18501. -->
  18502. (O2049 ^name predict-yes +)
  18503. (S1 ^operator O2049 +)
  18504. Retracting elaborate*reward*based*on*reward
  18505. -->
  18506. (R1028 ^value 1 +)
  18507. (R1 ^reward R1028 +)
  18508. Retracting elaborate*copy-dir-to-output-link
  18509. -->
  18510. (I3 ^dir R +)
  18511. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  18512. -->
  18513. (S1 ^operator O2050 = 0.5523798285849839)
  18514. Retracting rl*prefer*rvt*predict-no*H0*4
  18515. -->
  18516. (S1 ^operator O2050 = 0.4476196470230906)
  18517. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  18518. -->
  18519. (S1 ^operator O2049 = 0.1664311307472832)
  18520. Retracting rl*prefer*rvt*predict-yes*H0*3
  18521. -->
  18522. (S1 ^operator O2049 = 0.1844107822483583)
  18523. =>WM: (14381: S1 ^operator O2052 +)
  18524. =>WM: (14380: S1 ^operator O2051 +)
  18525. =>WM: (14379: I3 ^dir L)
  18526. =>WM: (14378: O2052 ^name predict-no)
  18527. =>WM: (14377: O2051 ^name predict-yes)
  18528. =>WM: (14376: R1029 ^value 1)
  18529. =>WM: (14375: R1 ^reward R1029)
  18530. <=WM: (14366: S1 ^operator O2049 +)
  18531. <=WM: (14367: S1 ^operator O2050 +)
  18532. <=WM: (14368: S1 ^operator O2050)
  18533. <=WM: (14298: I3 ^dir R)
  18534. <=WM: (14362: R1 ^reward R1028)
  18535. <=WM: (14365: O2050 ^name predict-no)
  18536. <=WM: (14364: O2049 ^name predict-yes)
  18537. <=WM: (14363: R1028 ^value 1)
  18538. --- Inner Elaboration Phase, active level 1 (S1) ---
  18539. Firing prefer*rvt*predict-yes*H0
  18540. -->
  18541. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  18542. -->
  18543. (S1 ^operator O2051 = 0.6104610611928351)
  18544. Firing rl*prefer*rvt*predict-yes*H0*1
  18545. -->
  18546. (S1 ^operator O2051 = 0.3895398299261354)
  18547. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  18548. -->
  18549. Firing prefer*rvt*predict-no*H0
  18550. -->
  18551. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  18552. -->
  18553. (S1 ^operator O2052 = 0.1063475139796038)
  18554. Firing rl*prefer*rvt*predict-no*H0*2
  18555. -->
  18556. (S1 ^operator O2052 = 0.3873355755795274)
  18557. Firing prefer*rvt*predict-no*H0*2*v1*H1
  18558. -->
  18559. inner elaboration loop at bottom goal.
  18560. Retracting rl*prefer*rvt*predict-no*H0*2
  18561. -->
  18562. (S1 ^operator O2050 = 0.3873355755795274)
  18563. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  18564. -->
  18565. (S1 ^operator O2050 = 0.1063475139796038)
  18566. Retracting rl*prefer*rvt*predict-yes*H0*1
  18567. -->
  18568. (S1 ^operator O2049 = 0.3895398299261354)
  18569. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  18570. -->
  18571. (S1 ^operator O2049 = 0.6104610611928351)
  18572. --- END Proposal Phase ---
  18573. --- Decision Phase ---
  18574. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174913 0.44762 -> 0.622533 -0.174913 0.44762(R,m,v=1,0.932836,0.0631242)
  18575. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.55238 -> 0.377466 0.174913 0.55238(R,m,v=1,1,0)
  18576. =>WM: (14382: S1 ^operator O2051)
  18577. 1026: O: O2051 (predict-yes)
  18578. --- END Decision Phase ---
  18579. --- Application Phase ---
  18580. --- Firing Productions (PE) For State At Depth 1 ---
  18581. --- Inner Elaboration Phase, active level 1 (S1) ---
  18582. Firing apply*operator
  18583. -->
  18584. (I3 ^predict-yes N1026 + :O )
  18585. Firing apply*operator*complete
  18586. -->
  18587. (I3 ^predict-no N1025 - :O )
  18588. inner elaboration loop at bottom goal.
  18589. --- Change Working Memory (PE) ---
  18590. =>WM: (14383: I3 ^predict-yes N1026)
  18591. <=WM: (14370: N1025 ^status complete)
  18592. <=WM: (14369: I3 ^predict-no N1025)
  18593. --- Firing Productions (IE) For State At Depth 1 ---
  18594. --- Inner Elaboration Phase, active level 1 (S1) ---
  18595. Firing monitor*world
  18596. -->
  18597. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18598. --- Change Working Memory (IE) ---
  18599. --- END Application Phase ---
  18600. --- Output Phase ---
  18601. ENV: Agent did: predict-yes for direction L in state State-B
  18602. In State-B moving L
  18603. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  18604. predict error 0
  18605. dir: dir isR
  18606. --- END Output Phase ---
  18607. \-/--- Input Phase ---
  18608. =>WM: (14387: I2 ^dir R)
  18609. =>WM: (14386: I2 ^reward 1)
  18610. =>WM: (14385: I2 ^see 1)
  18611. =>WM: (14384: N1026 ^status complete)
  18612. <=WM: (14373: I2 ^dir L)
  18613. <=WM: (14372: I2 ^reward 1)
  18614. <=WM: (14371: I2 ^see 0)
  18615. =>WM: (14388: I2 ^level-1 L1-root)
  18616. <=WM: (14374: I2 ^level-1 R0-root)
  18617. --- END Input Phase ---
  18618. --- Proposal Phase ---
  18619. --- Inner Elaboration Phase, active level 1 (S1) ---
  18620. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  18621. -->
  18622. (S1 ^operator O2052 = -0.02155734064455064)
  18623. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  18624. -->
  18625. (S1 ^operator O2051 = 0.815583266028165)
  18626. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18627. -->
  18628. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18629. -->
  18630. Firing elaborate*copy-see-to-output-link
  18631. -->
  18632. (I3 ^see 1 +)
  18633. Firing elaborate*reward*based*on*reward
  18634. -->
  18635. (R1030 ^value 1 +)
  18636. (R1 ^reward R1030 +)
  18637. Firing propose*predict-yes
  18638. -->
  18639. (O2053 ^name predict-yes +)
  18640. (S1 ^operator O2053 +)
  18641. Firing propose*predict-no
  18642. -->
  18643. (O2054 ^name predict-no +)
  18644. (S1 ^operator O2054 +)
  18645. Firing rl*prefer*rvt*predict-no*H0*4
  18646. -->
  18647. (S1 ^operator O2052 = 0.4476197256818795)
  18648. Firing rl*prefer*rvt*predict-yes*H0*3
  18649. -->
  18650. (S1 ^operator O2051 = 0.1844107822483583)
  18651. Firing prefer*rvt*predict-yes*H0
  18652. -->
  18653. Firing prefer*rvt*predict-no*H0
  18654. -->
  18655. Firing elaborate*copy-dir-to-output-link
  18656. -->
  18657. (I3 ^dir R +)
  18658. inner elaboration loop at bottom goal.
  18659. Retracting elaborate*copy-see-to-output-link
  18660. -->
  18661. (I3 ^see 0 +)
  18662. Retracting propose*predict-no
  18663. -->
  18664. (O2052 ^name predict-no +)
  18665. (S1 ^operator O2052 +)
  18666. Retracting propose*predict-yes
  18667. -->
  18668. (O2051 ^name predict-yes +)
  18669. (S1 ^operator O2051 +)
  18670. Retracting elaborate*reward*based*on*reward
  18671. -->
  18672. (R1029 ^value 1 +)
  18673. (R1 ^reward R1029 +)
  18674. Retracting elaborate*copy-dir-to-output-link
  18675. -->
  18676. (I3 ^dir L +)
  18677. Retracting rl*prefer*rvt*predict-no*H0*2
  18678. -->
  18679. (S1 ^operator O2052 = 0.3873355755795274)
  18680. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  18681. -->
  18682. (S1 ^operator O2052 = 0.1063475139796038)
  18683. Retracting rl*prefer*rvt*predict-yes*H0*1
  18684. -->
  18685. (S1 ^operator O2051 = 0.3895398299261354)
  18686. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  18687. -->
  18688. (S1 ^operator O2051 = 0.6104610611928351)
  18689. =>WM: (14396: S1 ^operator O2054 +)
  18690. =>WM: (14395: S1 ^operator O2053 +)
  18691. =>WM: (14394: I3 ^dir R)
  18692. =>WM: (14393: O2054 ^name predict-no)
  18693. =>WM: (14392: O2053 ^name predict-yes)
  18694. =>WM: (14391: R1030 ^value 1)
  18695. =>WM: (14390: R1 ^reward R1030)
  18696. =>WM: (14389: I3 ^see 1)
  18697. <=WM: (14380: S1 ^operator O2051 +)
  18698. <=WM: (14382: S1 ^operator O2051)
  18699. <=WM: (14381: S1 ^operator O2052 +)
  18700. <=WM: (14379: I3 ^dir L)
  18701. <=WM: (14375: R1 ^reward R1029)
  18702. <=WM: (14322: I3 ^see 0)
  18703. <=WM: (14378: O2052 ^name predict-no)
  18704. <=WM: (14377: O2051 ^name predict-yes)
  18705. <=WM: (14376: R1029 ^value 1)
  18706. --- Inner Elaboration Phase, active level 1 (S1) ---
  18707. Firing prefer*rvt*predict-yes*H0
  18708. -->
  18709. Firing rl*prefer*rvt*predict-yes*H0*3
  18710. -->
  18711. (S1 ^operator O2053 = 0.1844107822483583)
  18712. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18713. -->
  18714. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  18715. -->
  18716. (S1 ^operator O2053 = 0.815583266028165)
  18717. Firing prefer*rvt*predict-no*H0
  18718. -->
  18719. Firing rl*prefer*rvt*predict-no*H0*4
  18720. -->
  18721. (S1 ^operator O2054 = 0.4476197256818795)
  18722. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18723. -->
  18724. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  18725. -->
  18726. (S1 ^operator O2054 = -0.02155734064455064)
  18727. inner elaboration loop at bottom goal.
  18728. Retracting rl*prefer*rvt*predict-no*H0*4
  18729. -->
  18730. (S1 ^operator O2052 = 0.4476197256818795)
  18731. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  18732. -->
  18733. (S1 ^operator O2052 = -0.02155734064455064)
  18734. Retracting rl*prefer*rvt*predict-yes*H0*3
  18735. -->
  18736. (S1 ^operator O2051 = 0.1844107822483583)
  18737. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  18738. -->
  18739. (S1 ^operator O2051 = 0.815583266028165)
  18740. --- END Proposal Phase ---
  18741. --- Decision Phase ---
  18742. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322411 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.895349,0.0942472)
  18743. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  18744. =>WM: (14397: S1 ^operator O2053)
  18745. 1027: O: O2053 (predict-yes)
  18746. --- END Decision Phase ---
  18747. --- Application Phase ---
  18748. --- Firing Productions (PE) For State At Depth 1 ---
  18749. --- Inner Elaboration Phase, active level 1 (S1) ---
  18750. Firing apply*operator
  18751. -->
  18752. (I3 ^predict-yes N1027 + :O )
  18753. Firing apply*operator*complete
  18754. -->
  18755. (I3 ^predict-yes N1026 - :O )
  18756. inner elaboration loop at bottom goal.
  18757. --- Change Working Memory (PE) ---
  18758. =>WM: (14398: I3 ^predict-yes N1027)
  18759. <=WM: (14384: N1026 ^status complete)
  18760. <=WM: (14383: I3 ^predict-yes N1026)
  18761. --- Firing Productions (IE) For State At Depth 1 ---
  18762. --- Inner Elaboration Phase, active level 1 (S1) ---
  18763. Firing monitor*world
  18764. -->
  18765. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18766. --- Change Working Memory (IE) ---
  18767. --- END Application Phase ---
  18768. --- Output Phase ---
  18769. ENV: Agent did: predict-yes for direction R in state State-A
  18770. In State-A moving R
  18771. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  18772. predict error 0
  18773. dir: dir isU
  18774. --- END Output Phase ---
  18775. |\---- Input Phase ---
  18776. =>WM: (14402: I2 ^dir U)
  18777. =>WM: (14401: I2 ^reward 1)
  18778. =>WM: (14400: I2 ^see 1)
  18779. =>WM: (14399: N1027 ^status complete)
  18780. <=WM: (14387: I2 ^dir R)
  18781. <=WM: (14386: I2 ^reward 1)
  18782. <=WM: (14385: I2 ^see 1)
  18783. =>WM: (14403: I2 ^level-1 R1-root)
  18784. <=WM: (14388: I2 ^level-1 L1-root)
  18785. --- END Input Phase ---
  18786. --- Proposal Phase ---
  18787. --- Inner Elaboration Phase, active level 1 (S1) ---
  18788. Firing elaborate*copy-see-to-output-link
  18789. -->
  18790. (I3 ^see 1 +)
  18791. Firing elaborate*reward*based*on*reward
  18792. -->
  18793. (R1031 ^value 1 +)
  18794. (R1 ^reward R1031 +)
  18795. Firing propose*predict-yes
  18796. -->
  18797. (O2055 ^name predict-yes +)
  18798. (S1 ^operator O2055 +)
  18799. Firing propose*predict-no
  18800. -->
  18801. (O2056 ^name predict-no +)
  18802. (S1 ^operator O2056 +)
  18803. Firing rl*prefer*rvt*predict-no*H0*6
  18804. -->
  18805. (S1 ^operator O2054 = 0.9999999999999999)
  18806. Firing rl*prefer*rvt*predict-yes*H0*5
  18807. -->
  18808. (S1 ^operator O2053 = 0.)
  18809. Firing prefer*rvt*predict-yes*H0
  18810. -->
  18811. Firing prefer*rvt*predict-no*H0
  18812. -->
  18813. Firing elaborate*copy-dir-to-output-link
  18814. -->
  18815. (I3 ^dir U +)
  18816. inner elaboration loop at bottom goal.
  18817. Retracting elaborate*copy-see-to-output-link
  18818. -->
  18819. (I3 ^see 1 +)
  18820. Retracting propose*predict-no
  18821. -->
  18822. (O2054 ^name predict-no +)
  18823. (S1 ^operator O2054 +)
  18824. Retracting propose*predict-yes
  18825. -->
  18826. (O2053 ^name predict-yes +)
  18827. (S1 ^operator O2053 +)
  18828. Retracting elaborate*reward*based*on*reward
  18829. -->
  18830. (R1030 ^value 1 +)
  18831. (R1 ^reward R1030 +)
  18832. Retracting elaborate*copy-dir-to-output-link
  18833. -->
  18834. (I3 ^dir R +)
  18835. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  18836. -->
  18837. (S1 ^operator O2054 = -0.02155734064455064)
  18838. Retracting rl*prefer*rvt*predict-no*H0*4
  18839. -->
  18840. (S1 ^operator O2054 = 0.4476197256818795)
  18841. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  18842. -->
  18843. (S1 ^operator O2053 = 0.815583266028165)
  18844. Retracting rl*prefer*rvt*predict-yes*H0*3
  18845. -->
  18846. (S1 ^operator O2053 = 0.1844107822483583)
  18847. =>WM: (14410: S1 ^operator O2056 +)
  18848. =>WM: (14409: S1 ^operator O2055 +)
  18849. =>WM: (14408: I3 ^dir U)
  18850. =>WM: (14407: O2056 ^name predict-no)
  18851. =>WM: (14406: O2055 ^name predict-yes)
  18852. =>WM: (14405: R1031 ^value 1)
  18853. =>WM: (14404: R1 ^reward R1031)
  18854. <=WM: (14395: S1 ^operator O2053 +)
  18855. <=WM: (14397: S1 ^operator O2053)
  18856. <=WM: (14396: S1 ^operator O2054 +)
  18857. <=WM: (14394: I3 ^dir R)
  18858. <=WM: (14390: R1 ^reward R1030)
  18859. <=WM: (14393: O2054 ^name predict-no)
  18860. <=WM: (14392: O2053 ^name predict-yes)
  18861. <=WM: (14391: R1030 ^value 1)
  18862. --- Inner Elaboration Phase, active level 1 (S1) ---
  18863. Firing prefer*rvt*predict-yes*H0
  18864. -->
  18865. Firing rl*prefer*rvt*predict-yes*H0*5
  18866. -->
  18867. (S1 ^operator O2055 = 0.)
  18868. Firing prefer*rvt*predict-no*H0
  18869. -->
  18870. Firing rl*prefer*rvt*predict-no*H0*6
  18871. -->
  18872. (S1 ^operator O2056 = 0.9999999999999999)
  18873. inner elaboration loop at bottom goal.
  18874. Retracting rl*prefer*rvt*predict-no*H0*6
  18875. -->
  18876. (S1 ^operator O2054 = 0.9999999999999999)
  18877. Retracting rl*prefer*rvt*predict-yes*H0*5
  18878. -->
  18879. (S1 ^operator O2053 = 0.)
  18880. --- END Proposal Phase ---
  18881. --- Decision Phase ---
  18882. RL update rl*prefer*rvt*predict-yes*H0*3 0.675413 -0.491002 0.184411 -> 0.675414 -0.491003 0.184412(R,m,v=1,0.902857,0.0882102)
  18883. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324579 0.491004 0.815583 -> 0.32458 0.491004 0.815584(R,m,v=1,1,0)
  18884. =>WM: (14411: S1 ^operator O2056)
  18885. 1028: O: O2056 (predict-no)
  18886. --- END Decision Phase ---
  18887. --- Application Phase ---
  18888. --- Firing Productions (PE) For State At Depth 1 ---
  18889. --- Inner Elaboration Phase, active level 1 (S1) ---
  18890. Firing apply*operator
  18891. -->
  18892. (I3 ^predict-no N1028 + :O )
  18893. Firing apply*operator*complete
  18894. -->
  18895. (I3 ^predict-yes N1027 - :O )
  18896. inner elaboration loop at bottom goal.
  18897. --- Change Working Memory (PE) ---
  18898. =>WM: (14412: I3 ^predict-no N1028)
  18899. <=WM: (14399: N1027 ^status complete)
  18900. <=WM: (14398: I3 ^predict-yes N1027)
  18901. --- Firing Productions (IE) For State At Depth 1 ---
  18902. --- Inner Elaboration Phase, active level 1 (S1) ---
  18903. Firing monitor*world
  18904. -->
  18905. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18906. --- Change Working Memory (IE) ---
  18907. --- END Application Phase ---
  18908. --- Output Phase ---
  18909. ENV: Agent did: predict-no for direction U in state State-B
  18910. In State-B moving U
  18911. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18912. predict error 0
  18913. dir: dir isU
  18914. --- END Output Phase ---
  18915. /|\--- Input Phase ---
  18916. =>WM: (14416: I2 ^dir U)
  18917. =>WM: (14415: I2 ^reward 1)
  18918. =>WM: (14414: I2 ^see 0)
  18919. =>WM: (14413: N1028 ^status complete)
  18920. <=WM: (14402: I2 ^dir U)
  18921. <=WM: (14401: I2 ^reward 1)
  18922. <=WM: (14400: I2 ^see 1)
  18923. =>WM: (14417: I2 ^level-1 R1-root)
  18924. <=WM: (14403: I2 ^level-1 R1-root)
  18925. --- END Input Phase ---
  18926. --- Proposal Phase ---
  18927. --- Inner Elaboration Phase, active level 1 (S1) ---
  18928. Firing elaborate*copy-see-to-output-link
  18929. -->
  18930. (I3 ^see 0 +)
  18931. Firing elaborate*reward*based*on*reward
  18932. -->
  18933. (R1032 ^value 1 +)
  18934. (R1 ^reward R1032 +)
  18935. Firing propose*predict-yes
  18936. -->
  18937. (O2057 ^name predict-yes +)
  18938. (S1 ^operator O2057 +)
  18939. Firing propose*predict-no
  18940. -->
  18941. (O2058 ^name predict-no +)
  18942. (S1 ^operator O2058 +)
  18943. Firing rl*prefer*rvt*predict-no*H0*6
  18944. -->
  18945. (S1 ^operator O2056 = 0.9999999999999999)
  18946. Firing rl*prefer*rvt*predict-yes*H0*5
  18947. -->
  18948. (S1 ^operator O2055 = 0.)
  18949. Firing prefer*rvt*predict-yes*H0
  18950. -->
  18951. Firing prefer*rvt*predict-no*H0
  18952. -->
  18953. Firing elaborate*copy-dir-to-output-link
  18954. -->
  18955. (I3 ^dir U +)
  18956. inner elaboration loop at bottom goal.
  18957. Retracting elaborate*copy-see-to-output-link
  18958. -->
  18959. (I3 ^see 1 +)
  18960. Retracting propose*predict-no
  18961. -->
  18962. (O2056 ^name predict-no +)
  18963. (S1 ^operator O2056 +)
  18964. Retracting propose*predict-yes
  18965. -->
  18966. (O2055 ^name predict-yes +)
  18967. (S1 ^operator O2055 +)
  18968. Retracting elaborate*reward*based*on*reward
  18969. -->
  18970. (R1031 ^value 1 +)
  18971. (R1 ^reward R1031 +)
  18972. Retracting elaborate*copy-dir-to-output-link
  18973. -->
  18974. (I3 ^dir U +)
  18975. Retracting rl*prefer*rvt*predict-no*H0*6
  18976. -->
  18977. (S1 ^operator O2056 = 0.9999999999999999)
  18978. Retracting rl*prefer*rvt*predict-yes*H0*5
  18979. -->
  18980. (S1 ^operator O2055 = 0.)
  18981. =>WM: (14424: S1 ^operator O2058 +)
  18982. =>WM: (14423: S1 ^operator O2057 +)
  18983. =>WM: (14422: O2058 ^name predict-no)
  18984. =>WM: (14421: O2057 ^name predict-yes)
  18985. =>WM: (14420: R1032 ^value 1)
  18986. =>WM: (14419: R1 ^reward R1032)
  18987. =>WM: (14418: I3 ^see 0)
  18988. <=WM: (14409: S1 ^operator O2055 +)
  18989. <=WM: (14410: S1 ^operator O2056 +)
  18990. <=WM: (14411: S1 ^operator O2056)
  18991. <=WM: (14404: R1 ^reward R1031)
  18992. <=WM: (14389: I3 ^see 1)
  18993. <=WM: (14407: O2056 ^name predict-no)
  18994. <=WM: (14406: O2055 ^name predict-yes)
  18995. <=WM: (14405: R1031 ^value 1)
  18996. --- Inner Elaboration Phase, active level 1 (S1) ---
  18997. Firing prefer*rvt*predict-yes*H0
  18998. -->
  18999. Firing rl*prefer*rvt*predict-yes*H0*5
  19000. -->
  19001. (S1 ^operator O2057 = 0.)
  19002. Firing prefer*rvt*predict-no*H0
  19003. -->
  19004. Firing rl*prefer*rvt*predict-no*H0*6
  19005. -->
  19006. (S1 ^operator O2058 = 0.9999999999999999)
  19007. inner elaboration loop at bottom goal.
  19008. Retracting rl*prefer*rvt*predict-no*H0*6
  19009. -->
  19010. (S1 ^operator O2056 = 0.9999999999999999)
  19011. Retracting rl*prefer*rvt*predict-yes*H0*5
  19012. -->
  19013. (S1 ^operator O2055 = 0.)
  19014. --- END Proposal Phase ---
  19015. --- Decision Phase ---
  19016. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19017. =>WM: (14425: S1 ^operator O2058)
  19018. 1029: O: O2058 (predict-no)
  19019. --- END Decision Phase ---
  19020. --- Application Phase ---
  19021. --- Firing Productions (PE) For State At Depth 1 ---
  19022. --- Inner Elaboration Phase, active level 1 (S1) ---
  19023. Firing apply*operator
  19024. -->
  19025. (I3 ^predict-no N1029 + :O )
  19026. Firing apply*operator*complete
  19027. -->
  19028. (I3 ^predict-no N1028 - :O )
  19029. inner elaboration loop at bottom goal.
  19030. --- Change Working Memory (PE) ---
  19031. =>WM: (14426: I3 ^predict-no N1029)
  19032. <=WM: (14413: N1028 ^status complete)
  19033. <=WM: (14412: I3 ^predict-no N1028)
  19034. --- Firing Productions (IE) For State At Depth 1 ---
  19035. --- Inner Elaboration Phase, active level 1 (S1) ---
  19036. Firing monitor*world
  19037. -->
  19038. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19039. --- Change Working Memory (IE) ---
  19040. --- END Application Phase ---
  19041. --- Output Phase ---
  19042. ENV: Agent did: predict-no for direction U in state State-B
  19043. In State-B moving U
  19044. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19045. predict error 0
  19046. dir: dir isU
  19047. --- END Output Phase ---
  19048. -/|--- Input Phase ---
  19049. =>WM: (14430: I2 ^dir U)
  19050. =>WM: (14429: I2 ^reward 1)
  19051. =>WM: (14428: I2 ^see 0)
  19052. =>WM: (14427: N1029 ^status complete)
  19053. <=WM: (14416: I2 ^dir U)
  19054. <=WM: (14415: I2 ^reward 1)
  19055. <=WM: (14414: I2 ^see 0)
  19056. =>WM: (14431: I2 ^level-1 R1-root)
  19057. <=WM: (14417: I2 ^level-1 R1-root)
  19058. --- END Input Phase ---
  19059. --- Proposal Phase ---
  19060. --- Inner Elaboration Phase, active level 1 (S1) ---
  19061. Firing elaborate*copy-see-to-output-link
  19062. -->
  19063. (I3 ^see 0 +)
  19064. Firing elaborate*reward*based*on*reward
  19065. -->
  19066. (R1033 ^value 1 +)
  19067. (R1 ^reward R1033 +)
  19068. Firing propose*predict-yes
  19069. -->
  19070. (O2059 ^name predict-yes +)
  19071. (S1 ^operator O2059 +)
  19072. Firing propose*predict-no
  19073. -->
  19074. (O2060 ^name predict-no +)
  19075. (S1 ^operator O2060 +)
  19076. Firing rl*prefer*rvt*predict-no*H0*6
  19077. -->
  19078. (S1 ^operator O2058 = 0.9999999999999999)
  19079. Firing rl*prefer*rvt*predict-yes*H0*5
  19080. -->
  19081. (S1 ^operator O2057 = 0.)
  19082. Firing prefer*rvt*predict-yes*H0
  19083. -->
  19084. Firing prefer*rvt*predict-no*H0
  19085. -->
  19086. Firing elaborate*copy-dir-to-output-link
  19087. -->
  19088. (I3 ^dir U +)
  19089. inner elaboration loop at bottom goal.
  19090. Retracting elaborate*copy-see-to-output-link
  19091. -->
  19092. (I3 ^see 0 +)
  19093. Retracting propose*predict-no
  19094. -->
  19095. (O2058 ^name predict-no +)
  19096. (S1 ^operator O2058 +)
  19097. Retracting propose*predict-yes
  19098. -->
  19099. (O2057 ^name predict-yes +)
  19100. (S1 ^operator O2057 +)
  19101. Retracting elaborate*reward*based*on*reward
  19102. -->
  19103. (R1032 ^value 1 +)
  19104. (R1 ^reward R1032 +)
  19105. Retracting elaborate*copy-dir-to-output-link
  19106. -->
  19107. (I3 ^dir U +)
  19108. Retracting rl*prefer*rvt*predict-no*H0*6
  19109. -->
  19110. (S1 ^operator O2058 = 0.9999999999999999)
  19111. Retracting rl*prefer*rvt*predict-yes*H0*5
  19112. -->
  19113. (S1 ^operator O2057 = 0.)
  19114. =>WM: (14437: S1 ^operator O2060 +)
  19115. =>WM: (14436: S1 ^operator O2059 +)
  19116. =>WM: (14435: O2060 ^name predict-no)
  19117. =>WM: (14434: O2059 ^name predict-yes)
  19118. =>WM: (14433: R1033 ^value 1)
  19119. =>WM: (14432: R1 ^reward R1033)
  19120. <=WM: (14423: S1 ^operator O2057 +)
  19121. <=WM: (14424: S1 ^operator O2058 +)
  19122. <=WM: (14425: S1 ^operator O2058)
  19123. <=WM: (14419: R1 ^reward R1032)
  19124. <=WM: (14422: O2058 ^name predict-no)
  19125. <=WM: (14421: O2057 ^name predict-yes)
  19126. <=WM: (14420: R1032 ^value 1)
  19127. --- Inner Elaboration Phase, active level 1 (S1) ---
  19128. Firing prefer*rvt*predict-yes*H0
  19129. -->
  19130. Firing rl*prefer*rvt*predict-yes*H0*5
  19131. -->
  19132. (S1 ^operator O2059 = 0.)
  19133. Firing prefer*rvt*predict-no*H0
  19134. -->
  19135. Firing rl*prefer*rvt*predict-no*H0*6
  19136. -->
  19137. (S1 ^operator O2060 = 0.9999999999999999)
  19138. inner elaboration loop at bottom goal.
  19139. Retracting rl*prefer*rvt*predict-no*H0*6
  19140. -->
  19141. (S1 ^operator O2058 = 0.9999999999999999)
  19142. Retracting rl*prefer*rvt*predict-yes*H0*5
  19143. -->
  19144. (S1 ^operator O2057 = 0.)
  19145. --- END Proposal Phase ---
  19146. --- Decision Phase ---
  19147. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19148. =>WM: (14438: S1 ^operator O2060)
  19149. 1030: O: O2060 (predict-no)
  19150. --- END Decision Phase ---
  19151. --- Application Phase ---
  19152. --- Firing Productions (PE) For State At Depth 1 ---
  19153. --- Inner Elaboration Phase, active level 1 (S1) ---
  19154. Firing apply*operator
  19155. -->
  19156. (I3 ^predict-no N1030 + :O )
  19157. Firing apply*operator*complete
  19158. -->
  19159. (I3 ^predict-no N1029 - :O )
  19160. inner elaboration loop at bottom goal.
  19161. --- Change Working Memory (PE) ---
  19162. =>WM: (14439: I3 ^predict-no N1030)
  19163. <=WM: (14427: N1029 ^status complete)
  19164. <=WM: (14426: I3 ^predict-no N1029)
  19165. --- Firing Productions (IE) For State At Depth 1 ---
  19166. --- Inner Elaboration Phase, active level 1 (S1) ---
  19167. Firing monitor*world
  19168. -->
  19169. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19170. --- Change Working Memory (IE) ---
  19171. --- END Application Phase ---
  19172. --- Output Phase ---
  19173. ENV: Agent did: predict-no for direction U in state State-B
  19174. In State-B moving U
  19175. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19176. predict error 0
  19177. dir: dir isU
  19178. --- END Output Phase ---
  19179. \---- Input Phase ---
  19180. =>WM: (14443: I2 ^dir U)
  19181. =>WM: (14442: I2 ^reward 1)
  19182. =>WM: (14441: I2 ^see 0)
  19183. =>WM: (14440: N1030 ^status complete)
  19184. <=WM: (14430: I2 ^dir U)
  19185. <=WM: (14429: I2 ^reward 1)
  19186. <=WM: (14428: I2 ^see 0)
  19187. =>WM: (14444: I2 ^level-1 R1-root)
  19188. <=WM: (14431: I2 ^level-1 R1-root)
  19189. --- END Input Phase ---
  19190. --- Proposal Phase ---
  19191. --- Inner Elaboration Phase, active level 1 (S1) ---
  19192. Firing elaborate*copy-see-to-output-link
  19193. -->
  19194. (I3 ^see 0 +)
  19195. Firing elaborate*reward*based*on*reward
  19196. -->
  19197. (R1034 ^value 1 +)
  19198. (R1 ^reward R1034 +)
  19199. Firing propose*predict-yes
  19200. -->
  19201. (O2061 ^name predict-yes +)
  19202. (S1 ^operator O2061 +)
  19203. Firing propose*predict-no
  19204. -->
  19205. (O2062 ^name predict-no +)
  19206. (S1 ^operator O2062 +)
  19207. Firing rl*prefer*rvt*predict-no*H0*6
  19208. -->
  19209. (S1 ^operator O2060 = 0.9999999999999999)
  19210. Firing rl*prefer*rvt*predict-yes*H0*5
  19211. -->
  19212. (S1 ^operator O2059 = 0.)
  19213. Firing prefer*rvt*predict-yes*H0
  19214. -->
  19215. Firing prefer*rvt*predict-no*H0
  19216. -->
  19217. Firing elaborate*copy-dir-to-output-link
  19218. -->
  19219. (I3 ^dir U +)
  19220. inner elaboration loop at bottom goal.
  19221. Retracting elaborate*copy-see-to-output-link
  19222. -->
  19223. (I3 ^see 0 +)
  19224. Retracting propose*predict-no
  19225. -->
  19226. (O2060 ^name predict-no +)
  19227. (S1 ^operator O2060 +)
  19228. Retracting propose*predict-yes
  19229. -->
  19230. (O2059 ^name predict-yes +)
  19231. (S1 ^operator O2059 +)
  19232. Retracting elaborate*reward*based*on*reward
  19233. -->
  19234. (R1033 ^value 1 +)
  19235. (R1 ^reward R1033 +)
  19236. Retracting elaborate*copy-dir-to-output-link
  19237. -->
  19238. (I3 ^dir U +)
  19239. Retracting rl*prefer*rvt*predict-no*H0*6
  19240. -->
  19241. (S1 ^operator O2060 = 0.9999999999999999)
  19242. Retracting rl*prefer*rvt*predict-yes*H0*5
  19243. -->
  19244. (S1 ^operator O2059 = 0.)
  19245. =>WM: (14450: S1 ^operator O2062 +)
  19246. =>WM: (14449: S1 ^operator O2061 +)
  19247. =>WM: (14448: O2062 ^name predict-no)
  19248. =>WM: (14447: O2061 ^name predict-yes)
  19249. =>WM: (14446: R1034 ^value 1)
  19250. =>WM: (14445: R1 ^reward R1034)
  19251. <=WM: (14436: S1 ^operator O2059 +)
  19252. <=WM: (14437: S1 ^operator O2060 +)
  19253. <=WM: (14438: S1 ^operator O2060)
  19254. <=WM: (14432: R1 ^reward R1033)
  19255. <=WM: (14435: O2060 ^name predict-no)
  19256. <=WM: (14434: O2059 ^name predict-yes)
  19257. <=WM: (14433: R1033 ^value 1)
  19258. --- Inner Elaboration Phase, active level 1 (S1) ---
  19259. Firing prefer*rvt*predict-yes*H0
  19260. -->
  19261. Firing rl*prefer*rvt*predict-yes*H0*5
  19262. -->
  19263. (S1 ^operator O2061 = 0.)
  19264. Firing prefer*rvt*predict-no*H0
  19265. -->
  19266. Firing rl*prefer*rvt*predict-no*H0*6
  19267. -->
  19268. (S1 ^operator O2062 = 0.9999999999999999)
  19269. inner elaboration loop at bottom goal.
  19270. Retracting rl*prefer*rvt*predict-no*H0*6
  19271. -->
  19272. (S1 ^operator O2060 = 0.9999999999999999)
  19273. Retracting rl*prefer*rvt*predict-yes*H0*5
  19274. -->
  19275. (S1 ^operator O2059 = 0.)
  19276. --- END Proposal Phase ---
  19277. --- Decision Phase ---
  19278. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19279. =>WM: (14451: S1 ^operator O2062)
  19280. 1031: O: O2062 (predict-no)
  19281. --- END Decision Phase ---
  19282. --- Application Phase ---
  19283. --- Firing Productions (PE) For State At Depth 1 ---
  19284. --- Inner Elaboration Phase, active level 1 (S1) ---
  19285. Firing apply*operator
  19286. -->
  19287. (I3 ^predict-no N1031 + :O )
  19288. Firing apply*operator*complete
  19289. -->
  19290. (I3 ^predict-no N1030 - :O )
  19291. inner elaboration loop at bottom goal.
  19292. --- Change Working Memory (PE) ---
  19293. =>WM: (14452: I3 ^predict-no N1031)
  19294. <=WM: (14440: N1030 ^status complete)
  19295. <=WM: (14439: I3 ^predict-no N1030)
  19296. --- Firing Productions (IE) For State At Depth 1 ---
  19297. --- Inner Elaboration Phase, active level 1 (S1) ---
  19298. Firing monitor*world
  19299. -->
  19300. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19301. --- Change Working Memory (IE) ---
  19302. --- END Application Phase ---
  19303. --- Output Phase ---
  19304. ENV: Agent did: predict-no for direction U in state State-B
  19305. In State-B moving U
  19306. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19307. predict error 0
  19308. dir: dir isR
  19309. --- END Output Phase ---
  19310. /--- Input Phase ---
  19311. =>WM: (14456: I2 ^dir R)
  19312. =>WM: (14455: I2 ^reward 1)
  19313. =>WM: (14454: I2 ^see 0)
  19314. =>WM: (14453: N1031 ^status complete)
  19315. <=WM: (14443: I2 ^dir U)
  19316. <=WM: (14442: I2 ^reward 1)
  19317. <=WM: (14441: I2 ^see 0)
  19318. =>WM: (14457: I2 ^level-1 R1-root)
  19319. <=WM: (14444: I2 ^level-1 R1-root)
  19320. --- END Input Phase ---
  19321. --- Proposal Phase ---
  19322. --- Inner Elaboration Phase, active level 1 (S1) ---
  19323. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19324. -->
  19325. (S1 ^operator O2061 = 0.1398795999120246)
  19326. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19327. -->
  19328. (S1 ^operator O2062 = 0.5523816480808952)
  19329. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19330. -->
  19331. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19332. -->
  19333. Firing elaborate*copy-see-to-output-link
  19334. -->
  19335. (I3 ^see 0 +)
  19336. Firing elaborate*reward*based*on*reward
  19337. -->
  19338. (R1035 ^value 1 +)
  19339. (R1 ^reward R1035 +)
  19340. Firing propose*predict-yes
  19341. -->
  19342. (O2063 ^name predict-yes +)
  19343. (S1 ^operator O2063 +)
  19344. Firing propose*predict-no
  19345. -->
  19346. (O2064 ^name predict-no +)
  19347. (S1 ^operator O2064 +)
  19348. Firing rl*prefer*rvt*predict-no*H0*4
  19349. -->
  19350. (S1 ^operator O2062 = 0.4476197256818795)
  19351. Firing rl*prefer*rvt*predict-yes*H0*3
  19352. -->
  19353. (S1 ^operator O2061 = 0.1844116750068798)
  19354. Firing prefer*rvt*predict-yes*H0
  19355. -->
  19356. Firing prefer*rvt*predict-no*H0
  19357. -->
  19358. Firing elaborate*copy-dir-to-output-link
  19359. -->
  19360. (I3 ^dir R +)
  19361. inner elaboration loop at bottom goal.
  19362. Retracting elaborate*copy-see-to-output-link
  19363. -->
  19364. (I3 ^see 0 +)
  19365. Retracting propose*predict-no
  19366. -->
  19367. (O2062 ^name predict-no +)
  19368. (S1 ^operator O2062 +)
  19369. Retracting propose*predict-yes
  19370. -->
  19371. (O2061 ^name predict-yes +)
  19372. (S1 ^operator O2061 +)
  19373. Retracting elaborate*reward*based*on*reward
  19374. -->
  19375. (R1034 ^value 1 +)
  19376. (R1 ^reward R1034 +)
  19377. Retracting elaborate*copy-dir-to-output-link
  19378. -->
  19379. (I3 ^dir U +)
  19380. Retracting rl*prefer*rvt*predict-no*H0*6
  19381. -->
  19382. (S1 ^operator O2062 = 0.9999999999999999)
  19383. Retracting rl*prefer*rvt*predict-yes*H0*5
  19384. -->
  19385. (S1 ^operator O2061 = 0.)
  19386. =>WM: (14464: S1 ^operator O2064 +)
  19387. =>WM: (14463: S1 ^operator O2063 +)
  19388. =>WM: (14462: I3 ^dir R)
  19389. =>WM: (14461: O2064 ^name predict-no)
  19390. =>WM: (14460: O2063 ^name predict-yes)
  19391. =>WM: (14459: R1035 ^value 1)
  19392. =>WM: (14458: R1 ^reward R1035)
  19393. <=WM: (14449: S1 ^operator O2061 +)
  19394. <=WM: (14450: S1 ^operator O2062 +)
  19395. <=WM: (14451: S1 ^operator O2062)
  19396. <=WM: (14408: I3 ^dir U)
  19397. <=WM: (14445: R1 ^reward R1034)
  19398. <=WM: (14448: O2062 ^name predict-no)
  19399. <=WM: (14447: O2061 ^name predict-yes)
  19400. <=WM: (14446: R1034 ^value 1)
  19401. --- Inner Elaboration Phase, active level 1 (S1) ---
  19402. Firing prefer*rvt*predict-yes*H0
  19403. -->
  19404. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19405. -->
  19406. (S1 ^operator O2063 = 0.1398795999120246)
  19407. Firing rl*prefer*rvt*predict-yes*H0*3
  19408. -->
  19409. (S1 ^operator O2063 = 0.1844116750068798)
  19410. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19411. -->
  19412. Firing prefer*rvt*predict-no*H0
  19413. -->
  19414. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19415. -->
  19416. (S1 ^operator O2064 = 0.5523816480808952)
  19417. Firing rl*prefer*rvt*predict-no*H0*4
  19418. -->
  19419. (S1 ^operator O2064 = 0.4476197256818795)
  19420. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19421. -->
  19422. inner elaboration loop at bottom goal.
  19423. Retracting rl*prefer*rvt*predict-no*H0*4
  19424. -->
  19425. (S1 ^operator O2062 = 0.4476197256818795)
  19426. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19427. -->
  19428. (S1 ^operator O2062 = 0.5523816480808952)
  19429. Retracting rl*prefer*rvt*predict-yes*H0*3
  19430. -->
  19431. (S1 ^operator O2061 = 0.1844116750068798)
  19432. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19433. -->
  19434. (S1 ^operator O2061 = 0.1398795999120246)
  19435. --- END Proposal Phase ---
  19436. --- Decision Phase ---
  19437. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19438. =>WM: (14465: S1 ^operator O2064)
  19439. 1032: O: O2064 (predict-no)
  19440. --- END Decision Phase ---
  19441. --- Application Phase ---
  19442. --- Firing Productions (PE) For State At Depth 1 ---
  19443. --- Inner Elaboration Phase, active level 1 (S1) ---
  19444. Firing apply*operator
  19445. -->
  19446. (I3 ^predict-no N1032 + :O )
  19447. Firing apply*operator*complete
  19448. -->
  19449. (I3 ^predict-no N1031 - :O )
  19450. inner elaboration loop at bottom goal.
  19451. --- Change Working Memory (PE) ---
  19452. =>WM: (14466: I3 ^predict-no N1032)
  19453. <=WM: (14453: N1031 ^status complete)
  19454. <=WM: (14452: I3 ^predict-no N1031)
  19455. --- Firing Productions (IE) For State At Depth 1 ---
  19456. --- Inner Elaboration Phase, active level 1 (S1) ---
  19457. Firing monitor*world
  19458. -->
  19459. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19460. --- Change Working Memory (IE) ---
  19461. --- END Application Phase ---
  19462. --- Output Phase ---
  19463. ENV: Agent did: predict-no for direction R in state State-B
  19464. In State-B moving R
  19465. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19466. predict error 0
  19467. dir: dir isR
  19468. --- END Output Phase ---
  19469. |\---- Input Phase ---
  19470. =>WM: (14470: I2 ^dir R)
  19471. =>WM: (14469: I2 ^reward 1)
  19472. =>WM: (14468: I2 ^see 0)
  19473. =>WM: (14467: N1032 ^status complete)
  19474. <=WM: (14456: I2 ^dir R)
  19475. <=WM: (14455: I2 ^reward 1)
  19476. <=WM: (14454: I2 ^see 0)
  19477. =>WM: (14471: I2 ^level-1 R0-root)
  19478. <=WM: (14457: I2 ^level-1 R1-root)
  19479. --- END Input Phase ---
  19480. --- Proposal Phase ---
  19481. --- Inner Elaboration Phase, active level 1 (S1) ---
  19482. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  19483. -->
  19484. (S1 ^operator O2063 = 0.1664311307472832)
  19485. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  19486. -->
  19487. (S1 ^operator O2064 = 0.5523799072437727)
  19488. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19489. -->
  19490. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19491. -->
  19492. Firing elaborate*copy-see-to-output-link
  19493. -->
  19494. (I3 ^see 0 +)
  19495. Firing elaborate*reward*based*on*reward
  19496. -->
  19497. (R1036 ^value 1 +)
  19498. (R1 ^reward R1036 +)
  19499. Firing propose*predict-yes
  19500. -->
  19501. (O2065 ^name predict-yes +)
  19502. (S1 ^operator O2065 +)
  19503. Firing propose*predict-no
  19504. -->
  19505. (O2066 ^name predict-no +)
  19506. (S1 ^operator O2066 +)
  19507. Firing rl*prefer*rvt*predict-no*H0*4
  19508. -->
  19509. (S1 ^operator O2064 = 0.4476197256818795)
  19510. Firing rl*prefer*rvt*predict-yes*H0*3
  19511. -->
  19512. (S1 ^operator O2063 = 0.1844116750068798)
  19513. Firing prefer*rvt*predict-yes*H0
  19514. -->
  19515. Firing prefer*rvt*predict-no*H0
  19516. -->
  19517. Firing elaborate*copy-dir-to-output-link
  19518. -->
  19519. (I3 ^dir R +)
  19520. inner elaboration loop at bottom goal.
  19521. Retracting elaborate*copy-see-to-output-link
  19522. -->
  19523. (I3 ^see 0 +)
  19524. Retracting propose*predict-no
  19525. -->
  19526. (O2064 ^name predict-no +)
  19527. (S1 ^operator O2064 +)
  19528. Retracting propose*predict-yes
  19529. -->
  19530. (O2063 ^name predict-yes +)
  19531. (S1 ^operator O2063 +)
  19532. Retracting elaborate*reward*based*on*reward
  19533. -->
  19534. (R1035 ^value 1 +)
  19535. (R1 ^reward R1035 +)
  19536. Retracting elaborate*copy-dir-to-output-link
  19537. -->
  19538. (I3 ^dir R +)
  19539. Retracting rl*prefer*rvt*predict-no*H0*4
  19540. -->
  19541. (S1 ^operator O2064 = 0.4476197256818795)
  19542. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19543. -->
  19544. (S1 ^operator O2064 = 0.5523816480808952)
  19545. Retracting rl*prefer*rvt*predict-yes*H0*3
  19546. -->
  19547. (S1 ^operator O2063 = 0.1844116750068798)
  19548. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19549. -->
  19550. (S1 ^operator O2063 = 0.1398795999120246)
  19551. =>WM: (14477: S1 ^operator O2066 +)
  19552. =>WM: (14476: S1 ^operator O2065 +)
  19553. =>WM: (14475: O2066 ^name predict-no)
  19554. =>WM: (14474: O2065 ^name predict-yes)
  19555. =>WM: (14473: R1036 ^value 1)
  19556. =>WM: (14472: R1 ^reward R1036)
  19557. <=WM: (14463: S1 ^operator O2063 +)
  19558. <=WM: (14464: S1 ^operator O2064 +)
  19559. <=WM: (14465: S1 ^operator O2064)
  19560. <=WM: (14458: R1 ^reward R1035)
  19561. <=WM: (14461: O2064 ^name predict-no)
  19562. <=WM: (14460: O2063 ^name predict-yes)
  19563. <=WM: (14459: R1035 ^value 1)
  19564. --- Inner Elaboration Phase, active level 1 (S1) ---
  19565. Firing prefer*rvt*predict-yes*H0
  19566. -->
  19567. Firing rl*prefer*rvt*predict-yes*H0*3
  19568. -->
  19569. (S1 ^operator O2065 = 0.1844116750068798)
  19570. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19571. -->
  19572. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  19573. -->
  19574. (S1 ^operator O2065 = 0.1664311307472832)
  19575. Firing prefer*rvt*predict-no*H0
  19576. -->
  19577. Firing rl*prefer*rvt*predict-no*H0*4
  19578. -->
  19579. (S1 ^operator O2066 = 0.4476197256818795)
  19580. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19581. -->
  19582. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  19583. -->
  19584. (S1 ^operator O2066 = 0.5523799072437727)
  19585. inner elaboration loop at bottom goal.
  19586. Retracting rl*prefer*rvt*predict-no*H0*4
  19587. -->
  19588. (S1 ^operator O2064 = 0.4476197256818795)
  19589. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  19590. -->
  19591. (S1 ^operator O2064 = 0.5523799072437727)
  19592. Retracting rl*prefer*rvt*predict-yes*H0*3
  19593. -->
  19594. (S1 ^operator O2063 = 0.1844116750068798)
  19595. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  19596. -->
  19597. (S1 ^operator O2063 = 0.1664311307472832)
  19598. --- END Proposal Phase ---
  19599. --- Decision Phase ---
  19600. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174913 0.44762 -> 0.622533 -0.174914 0.44762(R,m,v=1,0.933333,0.0626866)
  19601. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552382 -> 0.377468 0.174914 0.552381(R,m,v=1,1,0)
  19602. =>WM: (14478: S1 ^operator O2066)
  19603. 1033: O: O2066 (predict-no)
  19604. --- END Decision Phase ---
  19605. --- Application Phase ---
  19606. --- Firing Productions (PE) For State At Depth 1 ---
  19607. --- Inner Elaboration Phase, active level 1 (S1) ---
  19608. Firing apply*operator
  19609. -->
  19610. (I3 ^predict-no N1033 + :O )
  19611. Firing apply*operator*complete
  19612. -->
  19613. (I3 ^predict-no N1032 - :O )
  19614. inner elaboration loop at bottom goal.
  19615. --- Change Working Memory (PE) ---
  19616. =>WM: (14479: I3 ^predict-no N1033)
  19617. <=WM: (14467: N1032 ^status complete)
  19618. <=WM: (14466: I3 ^predict-no N1032)
  19619. --- Firing Productions (IE) For State At Depth 1 ---
  19620. --- Inner Elaboration Phase, active level 1 (S1) ---
  19621. Firing monitor*world
  19622. -->
  19623. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19624. --- Change Working Memory (IE) ---
  19625. --- END Application Phase ---
  19626. --- Output Phase ---
  19627. ENV: Agent did: predict-no for direction R in state State-B
  19628. In State-B moving R
  19629. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19630. predict error 0
  19631. dir: dir isU
  19632. --- END Output Phase ---
  19633. /|\--- Input Phase ---
  19634. =>WM: (14483: I2 ^dir U)
  19635. =>WM: (14482: I2 ^reward 1)
  19636. =>WM: (14481: I2 ^see 0)
  19637. =>WM: (14480: N1033 ^status complete)
  19638. <=WM: (14470: I2 ^dir R)
  19639. <=WM: (14469: I2 ^reward 1)
  19640. <=WM: (14468: I2 ^see 0)
  19641. =>WM: (14484: I2 ^level-1 R0-root)
  19642. <=WM: (14471: I2 ^level-1 R0-root)
  19643. --- END Input Phase ---
  19644. --- Proposal Phase ---
  19645. --- Inner Elaboration Phase, active level 1 (S1) ---
  19646. Firing elaborate*copy-see-to-output-link
  19647. -->
  19648. (I3 ^see 0 +)
  19649. Firing elaborate*reward*based*on*reward
  19650. -->
  19651. (R1037 ^value 1 +)
  19652. (R1 ^reward R1037 +)
  19653. Firing propose*predict-yes
  19654. -->
  19655. (O2067 ^name predict-yes +)
  19656. (S1 ^operator O2067 +)
  19657. Firing propose*predict-no
  19658. -->
  19659. (O2068 ^name predict-no +)
  19660. (S1 ^operator O2068 +)
  19661. Firing rl*prefer*rvt*predict-no*H0*6
  19662. -->
  19663. (S1 ^operator O2066 = 0.9999999999999999)
  19664. Firing rl*prefer*rvt*predict-yes*H0*5
  19665. -->
  19666. (S1 ^operator O2065 = 0.)
  19667. Firing prefer*rvt*predict-yes*H0
  19668. -->
  19669. Firing prefer*rvt*predict-no*H0
  19670. -->
  19671. Firing elaborate*copy-dir-to-output-link
  19672. -->
  19673. (I3 ^dir U +)
  19674. inner elaboration loop at bottom goal.
  19675. Retracting elaborate*copy-see-to-output-link
  19676. -->
  19677. (I3 ^see 0 +)
  19678. Retracting propose*predict-no
  19679. -->
  19680. (O2066 ^name predict-no +)
  19681. (S1 ^operator O2066 +)
  19682. Retracting propose*predict-yes
  19683. -->
  19684. (O2065 ^name predict-yes +)
  19685. (S1 ^operator O2065 +)
  19686. Retracting elaborate*reward*based*on*reward
  19687. -->
  19688. (R1036 ^value 1 +)
  19689. (R1 ^reward R1036 +)
  19690. Retracting elaborate*copy-dir-to-output-link
  19691. -->
  19692. (I3 ^dir R +)
  19693. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  19694. -->
  19695. (S1 ^operator O2066 = 0.5523799072437727)
  19696. Retracting rl*prefer*rvt*predict-no*H0*4
  19697. -->
  19698. (S1 ^operator O2066 = 0.4476195196174632)
  19699. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  19700. -->
  19701. (S1 ^operator O2065 = 0.1664311307472832)
  19702. Retracting rl*prefer*rvt*predict-yes*H0*3
  19703. -->
  19704. (S1 ^operator O2065 = 0.1844116750068798)
  19705. =>WM: (14491: S1 ^operator O2068 +)
  19706. =>WM: (14490: S1 ^operator O2067 +)
  19707. =>WM: (14489: I3 ^dir U)
  19708. =>WM: (14488: O2068 ^name predict-no)
  19709. =>WM: (14487: O2067 ^name predict-yes)
  19710. =>WM: (14486: R1037 ^value 1)
  19711. =>WM: (14485: R1 ^reward R1037)
  19712. <=WM: (14476: S1 ^operator O2065 +)
  19713. <=WM: (14477: S1 ^operator O2066 +)
  19714. <=WM: (14478: S1 ^operator O2066)
  19715. <=WM: (14462: I3 ^dir R)
  19716. <=WM: (14472: R1 ^reward R1036)
  19717. <=WM: (14475: O2066 ^name predict-no)
  19718. <=WM: (14474: O2065 ^name predict-yes)
  19719. <=WM: (14473: R1036 ^value 1)
  19720. --- Inner Elaboration Phase, active level 1 (S1) ---
  19721. Firing prefer*rvt*predict-yes*H0
  19722. -->
  19723. Firing rl*prefer*rvt*predict-yes*H0*5
  19724. -->
  19725. (S1 ^operator O2067 = 0.)
  19726. Firing prefer*rvt*predict-no*H0
  19727. -->
  19728. Firing rl*prefer*rvt*predict-no*H0*6
  19729. -->
  19730. (S1 ^operator O2068 = 0.9999999999999999)
  19731. inner elaboration loop at bottom goal.
  19732. Retracting rl*prefer*rvt*predict-no*H0*6
  19733. -->
  19734. (S1 ^operator O2066 = 0.9999999999999999)
  19735. Retracting rl*prefer*rvt*predict-yes*H0*5
  19736. -->
  19737. (S1 ^operator O2065 = 0.)
  19738. --- END Proposal Phase ---
  19739. --- Decision Phase ---
  19740. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.44762 -> 0.622533 -0.174913 0.44762(R,m,v=1,0.933824,0.0622549)
  19741. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377466 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  19742. =>WM: (14492: S1 ^operator O2068)
  19743. 1034: O: O2068 (predict-no)
  19744. --- END Decision Phase ---
  19745. --- Application Phase ---
  19746. --- Firing Productions (PE) For State At Depth 1 ---
  19747. --- Inner Elaboration Phase, active level 1 (S1) ---
  19748. Firing apply*operator
  19749. -->
  19750. (I3 ^predict-no N1034 + :O )
  19751. Firing apply*operator*complete
  19752. -->
  19753. (I3 ^predict-no N1033 - :O )
  19754. inner elaboration loop at bottom goal.
  19755. --- Change Working Memory (PE) ---
  19756. =>WM: (14493: I3 ^predict-no N1034)
  19757. <=WM: (14480: N1033 ^status complete)
  19758. <=WM: (14479: I3 ^predict-no N1033)
  19759. --- Firing Productions (IE) For State At Depth 1 ---
  19760. --- Inner Elaboration Phase, active level 1 (S1) ---
  19761. Firing monitor*world
  19762. -->
  19763. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19764. --- Change Working Memory (IE) ---
  19765. --- END Application Phase ---
  19766. --- Output Phase ---
  19767. ENV: Agent did: predict-no for direction U in state State-B
  19768. In State-B moving U
  19769. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19770. predict error 0
  19771. dir: dir isL
  19772. --- END Output Phase ---
  19773. -/|\--- Input Phase ---
  19774. =>WM: (14497: I2 ^dir L)
  19775. =>WM: (14496: I2 ^reward 1)
  19776. =>WM: (14495: I2 ^see 0)
  19777. =>WM: (14494: N1034 ^status complete)
  19778. <=WM: (14483: I2 ^dir U)
  19779. <=WM: (14482: I2 ^reward 1)
  19780. <=WM: (14481: I2 ^see 0)
  19781. =>WM: (14498: I2 ^level-1 R0-root)
  19782. <=WM: (14484: I2 ^level-1 R0-root)
  19783. --- END Input Phase ---
  19784. --- Proposal Phase ---
  19785. --- Inner Elaboration Phase, active level 1 (S1) ---
  19786. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  19787. -->
  19788. (S1 ^operator O2067 = 0.6104609275249895)
  19789. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  19790. -->
  19791. (S1 ^operator O2068 = 0.1063475139796038)
  19792. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19793. -->
  19794. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19795. -->
  19796. Firing elaborate*copy-see-to-output-link
  19797. -->
  19798. (I3 ^see 0 +)
  19799. Firing elaborate*reward*based*on*reward
  19800. -->
  19801. (R1038 ^value 1 +)
  19802. (R1 ^reward R1038 +)
  19803. Firing propose*predict-yes
  19804. -->
  19805. (O2069 ^name predict-yes +)
  19806. (S1 ^operator O2069 +)
  19807. Firing propose*predict-no
  19808. -->
  19809. (O2070 ^name predict-no +)
  19810. (S1 ^operator O2070 +)
  19811. Firing rl*prefer*rvt*predict-no*H0*2
  19812. -->
  19813. (S1 ^operator O2068 = 0.3873355755795274)
  19814. Firing rl*prefer*rvt*predict-yes*H0*1
  19815. -->
  19816. (S1 ^operator O2067 = 0.3895396962582899)
  19817. Firing prefer*rvt*predict-yes*H0
  19818. -->
  19819. Firing prefer*rvt*predict-no*H0
  19820. -->
  19821. Firing elaborate*copy-dir-to-output-link
  19822. -->
  19823. (I3 ^dir L +)
  19824. inner elaboration loop at bottom goal.
  19825. Retracting elaborate*copy-see-to-output-link
  19826. -->
  19827. (I3 ^see 0 +)
  19828. Retracting propose*predict-no
  19829. -->
  19830. (O2068 ^name predict-no +)
  19831. (S1 ^operator O2068 +)
  19832. Retracting propose*predict-yes
  19833. -->
  19834. (O2067 ^name predict-yes +)
  19835. (S1 ^operator O2067 +)
  19836. Retracting elaborate*reward*based*on*reward
  19837. -->
  19838. (R1037 ^value 1 +)
  19839. (R1 ^reward R1037 +)
  19840. Retracting elaborate*copy-dir-to-output-link
  19841. -->
  19842. (I3 ^dir U +)
  19843. Retracting rl*prefer*rvt*predict-no*H0*6
  19844. -->
  19845. (S1 ^operator O2068 = 0.9999999999999999)
  19846. Retracting rl*prefer*rvt*predict-yes*H0*5
  19847. -->
  19848. (S1 ^operator O2067 = 0.)
  19849. =>WM: (14505: S1 ^operator O2070 +)
  19850. =>WM: (14504: S1 ^operator O2069 +)
  19851. =>WM: (14503: I3 ^dir L)
  19852. =>WM: (14502: O2070 ^name predict-no)
  19853. =>WM: (14501: O2069 ^name predict-yes)
  19854. =>WM: (14500: R1038 ^value 1)
  19855. =>WM: (14499: R1 ^reward R1038)
  19856. <=WM: (14490: S1 ^operator O2067 +)
  19857. <=WM: (14491: S1 ^operator O2068 +)
  19858. <=WM: (14492: S1 ^operator O2068)
  19859. <=WM: (14489: I3 ^dir U)
  19860. <=WM: (14485: R1 ^reward R1037)
  19861. <=WM: (14488: O2068 ^name predict-no)
  19862. <=WM: (14487: O2067 ^name predict-yes)
  19863. <=WM: (14486: R1037 ^value 1)
  19864. --- Inner Elaboration Phase, active level 1 (S1) ---
  19865. Firing prefer*rvt*predict-yes*H0
  19866. -->
  19867. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  19868. -->
  19869. (S1 ^operator O2069 = 0.6104609275249895)
  19870. Firing rl*prefer*rvt*predict-yes*H0*1
  19871. -->
  19872. (S1 ^operator O2069 = 0.3895396962582899)
  19873. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  19874. -->
  19875. Firing prefer*rvt*predict-no*H0
  19876. -->
  19877. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  19878. -->
  19879. (S1 ^operator O2070 = 0.1063475139796038)
  19880. Firing rl*prefer*rvt*predict-no*H0*2
  19881. -->
  19882. (S1 ^operator O2070 = 0.3873355755795274)
  19883. Firing prefer*rvt*predict-no*H0*2*v1*H1
  19884. -->
  19885. inner elaboration loop at bottom goal.
  19886. Retracting rl*prefer*rvt*predict-no*H0*2
  19887. -->
  19888. (S1 ^operator O2068 = 0.3873355755795274)
  19889. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  19890. -->
  19891. (S1 ^operator O2068 = 0.1063475139796038)
  19892. Retracting rl*prefer*rvt*predict-yes*H0*1
  19893. -->
  19894. (S1 ^operator O2067 = 0.3895396962582899)
  19895. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  19896. -->
  19897. (S1 ^operator O2067 = 0.6104609275249895)
  19898. --- END Proposal Phase ---
  19899. --- Decision Phase ---
  19900. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19901. =>WM: (14506: S1 ^operator O2069)
  19902. 1035: O: O2069 (predict-yes)
  19903. --- END Decision Phase ---
  19904. --- Application Phase ---
  19905. --- Firing Productions (PE) For State At Depth 1 ---
  19906. --- Inner Elaboration Phase, active level 1 (S1) ---
  19907. Firing apply*operator
  19908. -->
  19909. (I3 ^predict-yes N1035 + :O )
  19910. Firing apply*operator*complete
  19911. -->
  19912. (I3 ^predict-no N1034 - :O )
  19913. inner elaboration loop at bottom goal.
  19914. --- Change Working Memory (PE) ---
  19915. =>WM: (14507: I3 ^predict-yes N1035)
  19916. <=WM: (14494: N1034 ^status complete)
  19917. <=WM: (14493: I3 ^predict-no N1034)
  19918. --- Firing Productions (IE) For State At Depth 1 ---
  19919. --- Inner Elaboration Phase, active level 1 (S1) ---
  19920. Firing monitor*world
  19921. -->
  19922. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19923. --- Change Working Memory (IE) ---
  19924. --- END Application Phase ---
  19925. --- Output Phase ---
  19926. ENV: Agent did: predict-yes for direction L in state State-B
  19927. In State-B moving L
  19928. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  19929. predict error 0
  19930. dir: dir isU
  19931. --- END Output Phase ---
  19932. -/|--- Input Phase ---
  19933. =>WM: (14511: I2 ^dir U)
  19934. =>WM: (14510: I2 ^reward 1)
  19935. =>WM: (14509: I2 ^see 1)
  19936. =>WM: (14508: N1035 ^status complete)
  19937. <=WM: (14497: I2 ^dir L)
  19938. <=WM: (14496: I2 ^reward 1)
  19939. <=WM: (14495: I2 ^see 0)
  19940. =>WM: (14512: I2 ^level-1 L1-root)
  19941. <=WM: (14498: I2 ^level-1 R0-root)
  19942. --- END Input Phase ---
  19943. --- Proposal Phase ---
  19944. --- Inner Elaboration Phase, active level 1 (S1) ---
  19945. Firing elaborate*copy-see-to-output-link
  19946. -->
  19947. (I3 ^see 1 +)
  19948. Firing elaborate*reward*based*on*reward
  19949. -->
  19950. (R1039 ^value 1 +)
  19951. (R1 ^reward R1039 +)
  19952. Firing propose*predict-yes
  19953. -->
  19954. (O2071 ^name predict-yes +)
  19955. (S1 ^operator O2071 +)
  19956. Firing propose*predict-no
  19957. -->
  19958. (O2072 ^name predict-no +)
  19959. (S1 ^operator O2072 +)
  19960. Firing rl*prefer*rvt*predict-no*H0*6
  19961. -->
  19962. (S1 ^operator O2070 = 0.9999999999999999)
  19963. Firing rl*prefer*rvt*predict-yes*H0*5
  19964. -->
  19965. (S1 ^operator O2069 = 0.)
  19966. Firing prefer*rvt*predict-yes*H0
  19967. -->
  19968. Firing prefer*rvt*predict-no*H0
  19969. -->
  19970. Firing elaborate*copy-dir-to-output-link
  19971. -->
  19972. (I3 ^dir U +)
  19973. inner elaboration loop at bottom goal.
  19974. Retracting elaborate*copy-see-to-output-link
  19975. -->
  19976. (I3 ^see 0 +)
  19977. Retracting propose*predict-no
  19978. -->
  19979. (O2070 ^name predict-no +)
  19980. (S1 ^operator O2070 +)
  19981. Retracting propose*predict-yes
  19982. -->
  19983. (O2069 ^name predict-yes +)
  19984. (S1 ^operator O2069 +)
  19985. Retracting elaborate*reward*based*on*reward
  19986. -->
  19987. (R1038 ^value 1 +)
  19988. (R1 ^reward R1038 +)
  19989. Retracting elaborate*copy-dir-to-output-link
  19990. -->
  19991. (I3 ^dir L +)
  19992. Retracting rl*prefer*rvt*predict-no*H0*2
  19993. -->
  19994. (S1 ^operator O2070 = 0.3873355755795274)
  19995. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  19996. -->
  19997. (S1 ^operator O2070 = 0.1063475139796038)
  19998. Retracting rl*prefer*rvt*predict-yes*H0*1
  19999. -->
  20000. (S1 ^operator O2069 = 0.3895396962582899)
  20001. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  20002. -->
  20003. (S1 ^operator O2069 = 0.6104609275249895)
  20004. =>WM: (14520: S1 ^operator O2072 +)
  20005. =>WM: (14519: S1 ^operator O2071 +)
  20006. =>WM: (14518: I3 ^dir U)
  20007. =>WM: (14517: O2072 ^name predict-no)
  20008. =>WM: (14516: O2071 ^name predict-yes)
  20009. =>WM: (14515: R1039 ^value 1)
  20010. =>WM: (14514: R1 ^reward R1039)
  20011. =>WM: (14513: I3 ^see 1)
  20012. <=WM: (14504: S1 ^operator O2069 +)
  20013. <=WM: (14506: S1 ^operator O2069)
  20014. <=WM: (14505: S1 ^operator O2070 +)
  20015. <=WM: (14503: I3 ^dir L)
  20016. <=WM: (14499: R1 ^reward R1038)
  20017. <=WM: (14418: I3 ^see 0)
  20018. <=WM: (14502: O2070 ^name predict-no)
  20019. <=WM: (14501: O2069 ^name predict-yes)
  20020. <=WM: (14500: R1038 ^value 1)
  20021. --- Inner Elaboration Phase, active level 1 (S1) ---
  20022. Firing prefer*rvt*predict-yes*H0
  20023. -->
  20024. Firing rl*prefer*rvt*predict-yes*H0*5
  20025. -->
  20026. (S1 ^operator O2071 = 0.)
  20027. Firing prefer*rvt*predict-no*H0
  20028. -->
  20029. Firing rl*prefer*rvt*predict-no*H0*6
  20030. -->
  20031. (S1 ^operator O2072 = 0.9999999999999999)
  20032. inner elaboration loop at bottom goal.
  20033. Retracting rl*prefer*rvt*predict-no*H0*6
  20034. -->
  20035. (S1 ^operator O2070 = 0.9999999999999999)
  20036. Retracting rl*prefer*rvt*predict-yes*H0*5
  20037. -->
  20038. (S1 ^operator O2069 = 0.)
  20039. --- END Proposal Phase ---
  20040. --- Decision Phase ---
  20041. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.895954,0.0937626)
  20042. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  20043. =>WM: (14521: S1 ^operator O2072)
  20044. 1036: O: O2072 (predict-no)
  20045. --- END Decision Phase ---
  20046. --- Application Phase ---
  20047. --- Firing Productions (PE) For State At Depth 1 ---
  20048. --- Inner Elaboration Phase, active level 1 (S1) ---
  20049. Firing apply*operator
  20050. -->
  20051. (I3 ^predict-no N1036 + :O )
  20052. Firing apply*operator*complete
  20053. -->
  20054. (I3 ^predict-yes N1035 - :O )
  20055. inner elaboration loop at bottom goal.
  20056. --- Change Working Memory (PE) ---
  20057. =>WM: (14522: I3 ^predict-no N1036)
  20058. <=WM: (14508: N1035 ^status complete)
  20059. <=WM: (14507: I3 ^predict-yes N1035)
  20060. --- Firing Productions (IE) For State At Depth 1 ---
  20061. --- Inner Elaboration Phase, active level 1 (S1) ---
  20062. Firing monitor*world
  20063. -->
  20064. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20065. --- Change Working Memory (IE) ---
  20066. --- END Application Phase ---
  20067. --- Output Phase ---
  20068. ENV: Agent did: predict-no for direction U in state State-A
  20069. In State-A moving U
  20070. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20071. predict error 0
  20072. dir: dir isL
  20073. --- END Output Phase ---
  20074. \-/--- Input Phase ---
  20075. =>WM: (14526: I2 ^dir L)
  20076. =>WM: (14525: I2 ^reward 1)
  20077. =>WM: (14524: I2 ^see 0)
  20078. =>WM: (14523: N1036 ^status complete)
  20079. <=WM: (14511: I2 ^dir U)
  20080. <=WM: (14510: I2 ^reward 1)
  20081. <=WM: (14509: I2 ^see 1)
  20082. =>WM: (14527: I2 ^level-1 L1-root)
  20083. <=WM: (14512: I2 ^level-1 L1-root)
  20084. --- END Input Phase ---
  20085. --- Proposal Phase ---
  20086. --- Inner Elaboration Phase, active level 1 (S1) ---
  20087. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  20088. -->
  20089. (S1 ^operator O2072 = 0.6126630510169757)
  20090. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  20091. -->
  20092. (S1 ^operator O2071 = -0.02274740735326741)
  20093. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20094. -->
  20095. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20096. -->
  20097. Firing elaborate*copy-see-to-output-link
  20098. -->
  20099. (I3 ^see 0 +)
  20100. Firing elaborate*reward*based*on*reward
  20101. -->
  20102. (R1040 ^value 1 +)
  20103. (R1 ^reward R1040 +)
  20104. Firing propose*predict-yes
  20105. -->
  20106. (O2073 ^name predict-yes +)
  20107. (S1 ^operator O2073 +)
  20108. Firing propose*predict-no
  20109. -->
  20110. (O2074 ^name predict-no +)
  20111. (S1 ^operator O2074 +)
  20112. Firing rl*prefer*rvt*predict-no*H0*2
  20113. -->
  20114. (S1 ^operator O2072 = 0.3873355755795274)
  20115. Firing rl*prefer*rvt*predict-yes*H0*1
  20116. -->
  20117. (S1 ^operator O2071 = 0.389539602690798)
  20118. Firing prefer*rvt*predict-yes*H0
  20119. -->
  20120. Firing prefer*rvt*predict-no*H0
  20121. -->
  20122. Firing elaborate*copy-dir-to-output-link
  20123. -->
  20124. (I3 ^dir L +)
  20125. inner elaboration loop at bottom goal.
  20126. Retracting elaborate*copy-see-to-output-link
  20127. -->
  20128. (I3 ^see 1 +)
  20129. Retracting propose*predict-no
  20130. -->
  20131. (O2072 ^name predict-no +)
  20132. (S1 ^operator O2072 +)
  20133. Retracting propose*predict-yes
  20134. -->
  20135. (O2071 ^name predict-yes +)
  20136. (S1 ^operator O2071 +)
  20137. Retracting elaborate*reward*based*on*reward
  20138. -->
  20139. (R1039 ^value 1 +)
  20140. (R1 ^reward R1039 +)
  20141. Retracting elaborate*copy-dir-to-output-link
  20142. -->
  20143. (I3 ^dir U +)
  20144. Retracting rl*prefer*rvt*predict-no*H0*6
  20145. -->
  20146. (S1 ^operator O2072 = 0.9999999999999999)
  20147. Retracting rl*prefer*rvt*predict-yes*H0*5
  20148. -->
  20149. (S1 ^operator O2071 = 0.)
  20150. =>WM: (14535: S1 ^operator O2074 +)
  20151. =>WM: (14534: S1 ^operator O2073 +)
  20152. =>WM: (14533: I3 ^dir L)
  20153. =>WM: (14532: O2074 ^name predict-no)
  20154. =>WM: (14531: O2073 ^name predict-yes)
  20155. =>WM: (14530: R1040 ^value 1)
  20156. =>WM: (14529: R1 ^reward R1040)
  20157. =>WM: (14528: I3 ^see 0)
  20158. <=WM: (14519: S1 ^operator O2071 +)
  20159. <=WM: (14520: S1 ^operator O2072 +)
  20160. <=WM: (14521: S1 ^operator O2072)
  20161. <=WM: (14518: I3 ^dir U)
  20162. <=WM: (14514: R1 ^reward R1039)
  20163. <=WM: (14513: I3 ^see 1)
  20164. <=WM: (14517: O2072 ^name predict-no)
  20165. <=WM: (14516: O2071 ^name predict-yes)
  20166. <=WM: (14515: R1039 ^value 1)
  20167. --- Inner Elaboration Phase, active level 1 (S1) ---
  20168. Firing prefer*rvt*predict-yes*H0
  20169. -->
  20170. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  20171. -->
  20172. (S1 ^operator O2073 = -0.02274740735326741)
  20173. Firing rl*prefer*rvt*predict-yes*H0*1
  20174. -->
  20175. (S1 ^operator O2073 = 0.389539602690798)
  20176. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20177. -->
  20178. Firing prefer*rvt*predict-no*H0
  20179. -->
  20180. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  20181. -->
  20182. (S1 ^operator O2074 = 0.6126630510169757)
  20183. Firing rl*prefer*rvt*predict-no*H0*2
  20184. -->
  20185. (S1 ^operator O2074 = 0.3873355755795274)
  20186. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20187. -->
  20188. inner elaboration loop at bottom goal.
  20189. Retracting rl*prefer*rvt*predict-no*H0*2
  20190. -->
  20191. (S1 ^operator O2072 = 0.3873355755795274)
  20192. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  20193. -->
  20194. (S1 ^operator O2072 = 0.6126630510169757)
  20195. Retracting rl*prefer*rvt*predict-yes*H0*1
  20196. -->
  20197. (S1 ^operator O2071 = 0.389539602690798)
  20198. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  20199. -->
  20200. (S1 ^operator O2071 = -0.02274740735326741)
  20201. --- END Proposal Phase ---
  20202. --- Decision Phase ---
  20203. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20204. =>WM: (14536: S1 ^operator O2074)
  20205. 1037: O: O2074 (predict-no)
  20206. --- END Decision Phase ---
  20207. --- Application Phase ---
  20208. --- Firing Productions (PE) For State At Depth 1 ---
  20209. --- Inner Elaboration Phase, active level 1 (S1) ---
  20210. Firing apply*operator
  20211. -->
  20212. (I3 ^predict-no N1037 + :O )
  20213. Firing apply*operator*complete
  20214. -->
  20215. (I3 ^predict-no N1036 - :O )
  20216. inner elaboration loop at bottom goal.
  20217. --- Change Working Memory (PE) ---
  20218. =>WM: (14537: I3 ^predict-no N1037)
  20219. <=WM: (14523: N1036 ^status complete)
  20220. <=WM: (14522: I3 ^predict-no N1036)
  20221. --- Firing Productions (IE) For State At Depth 1 ---
  20222. --- Inner Elaboration Phase, active level 1 (S1) ---
  20223. Firing monitor*world
  20224. -->
  20225. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20226. --- Change Working Memory (IE) ---
  20227. --- END Application Phase ---
  20228. --- Output Phase ---
  20229. ENV: Agent did: predict-no for direction L in state State-A
  20230. In State-A moving L
  20231. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20232. predict error 0
  20233. dir: dir isR
  20234. --- END Output Phase ---
  20235. |\---- Input Phase ---
  20236. =>WM: (14541: I2 ^dir R)
  20237. =>WM: (14540: I2 ^reward 1)
  20238. =>WM: (14539: I2 ^see 0)
  20239. =>WM: (14538: N1037 ^status complete)
  20240. <=WM: (14526: I2 ^dir L)
  20241. <=WM: (14525: I2 ^reward 1)
  20242. <=WM: (14524: I2 ^see 0)
  20243. =>WM: (14542: I2 ^level-1 L0-root)
  20244. <=WM: (14527: I2 ^level-1 L1-root)
  20245. --- END Input Phase ---
  20246. --- Proposal Phase ---
  20247. --- Inner Elaboration Phase, active level 1 (S1) ---
  20248. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20249. -->
  20250. (S1 ^operator O2073 = 0.8155914233894487)
  20251. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20252. -->
  20253. (S1 ^operator O2074 = -0.00558448899823713)
  20254. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20255. -->
  20256. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20257. -->
  20258. Firing elaborate*copy-see-to-output-link
  20259. -->
  20260. (I3 ^see 0 +)
  20261. Firing elaborate*reward*based*on*reward
  20262. -->
  20263. (R1041 ^value 1 +)
  20264. (R1 ^reward R1041 +)
  20265. Firing propose*predict-yes
  20266. -->
  20267. (O2075 ^name predict-yes +)
  20268. (S1 ^operator O2075 +)
  20269. Firing propose*predict-no
  20270. -->
  20271. (O2076 ^name predict-no +)
  20272. (S1 ^operator O2076 +)
  20273. Firing rl*prefer*rvt*predict-no*H0*4
  20274. -->
  20275. (S1 ^operator O2074 = 0.4476196055882778)
  20276. Firing rl*prefer*rvt*predict-yes*H0*3
  20277. -->
  20278. (S1 ^operator O2073 = 0.1844116750068798)
  20279. Firing prefer*rvt*predict-yes*H0
  20280. -->
  20281. Firing prefer*rvt*predict-no*H0
  20282. -->
  20283. Firing elaborate*copy-dir-to-output-link
  20284. -->
  20285. (I3 ^dir R +)
  20286. inner elaboration loop at bottom goal.
  20287. Retracting elaborate*copy-see-to-output-link
  20288. -->
  20289. (I3 ^see 0 +)
  20290. Retracting propose*predict-no
  20291. -->
  20292. (O2074 ^name predict-no +)
  20293. (S1 ^operator O2074 +)
  20294. Retracting propose*predict-yes
  20295. -->
  20296. (O2073 ^name predict-yes +)
  20297. (S1 ^operator O2073 +)
  20298. Retracting elaborate*reward*based*on*reward
  20299. -->
  20300. (R1040 ^value 1 +)
  20301. (R1 ^reward R1040 +)
  20302. Retracting elaborate*copy-dir-to-output-link
  20303. -->
  20304. (I3 ^dir L +)
  20305. Retracting rl*prefer*rvt*predict-no*H0*2
  20306. -->
  20307. (S1 ^operator O2074 = 0.3873355755795274)
  20308. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  20309. -->
  20310. (S1 ^operator O2074 = 0.6126630510169757)
  20311. Retracting rl*prefer*rvt*predict-yes*H0*1
  20312. -->
  20313. (S1 ^operator O2073 = 0.389539602690798)
  20314. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  20315. -->
  20316. (S1 ^operator O2073 = -0.02274740735326741)
  20317. =>WM: (14549: S1 ^operator O2076 +)
  20318. =>WM: (14548: S1 ^operator O2075 +)
  20319. =>WM: (14547: I3 ^dir R)
  20320. =>WM: (14546: O2076 ^name predict-no)
  20321. =>WM: (14545: O2075 ^name predict-yes)
  20322. =>WM: (14544: R1041 ^value 1)
  20323. =>WM: (14543: R1 ^reward R1041)
  20324. <=WM: (14534: S1 ^operator O2073 +)
  20325. <=WM: (14535: S1 ^operator O2074 +)
  20326. <=WM: (14536: S1 ^operator O2074)
  20327. <=WM: (14533: I3 ^dir L)
  20328. <=WM: (14529: R1 ^reward R1040)
  20329. <=WM: (14532: O2074 ^name predict-no)
  20330. <=WM: (14531: O2073 ^name predict-yes)
  20331. <=WM: (14530: R1040 ^value 1)
  20332. --- Inner Elaboration Phase, active level 1 (S1) ---
  20333. Firing prefer*rvt*predict-yes*H0
  20334. -->
  20335. Firing rl*prefer*rvt*predict-yes*H0*3
  20336. -->
  20337. (S1 ^operator O2075 = 0.1844116750068798)
  20338. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20339. -->
  20340. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20341. -->
  20342. (S1 ^operator O2075 = 0.8155914233894487)
  20343. Firing prefer*rvt*predict-no*H0
  20344. -->
  20345. Firing rl*prefer*rvt*predict-no*H0*4
  20346. -->
  20347. (S1 ^operator O2076 = 0.4476196055882778)
  20348. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20349. -->
  20350. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20351. -->
  20352. (S1 ^operator O2076 = -0.00558448899823713)
  20353. inner elaboration loop at bottom goal.
  20354. Retracting rl*prefer*rvt*predict-no*H0*4
  20355. -->
  20356. (S1 ^operator O2074 = 0.4476196055882778)
  20357. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20358. -->
  20359. (S1 ^operator O2074 = -0.00558448899823713)
  20360. Retracting rl*prefer*rvt*predict-yes*H0*3
  20361. -->
  20362. (S1 ^operator O2073 = 0.1844116750068798)
  20363. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20364. -->
  20365. (S1 ^operator O2073 = 0.8155914233894487)
  20366. --- END Proposal Phase ---
  20367. --- Decision Phase ---
  20368. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.934426,0.0616105)
  20369. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.280919 0.331744 0.612663(R,m,v=1,1,0)
  20370. =>WM: (14550: S1 ^operator O2075)
  20371. 1038: O: O2075 (predict-yes)
  20372. --- END Decision Phase ---
  20373. --- Application Phase ---
  20374. --- Firing Productions (PE) For State At Depth 1 ---
  20375. --- Inner Elaboration Phase, active level 1 (S1) ---
  20376. Firing apply*operator
  20377. -->
  20378. (I3 ^predict-yes N1038 + :O )
  20379. Firing apply*operator*complete
  20380. -->
  20381. (I3 ^predict-no N1037 - :O )
  20382. inner elaboration loop at bottom goal.
  20383. --- Change Working Memory (PE) ---
  20384. =>WM: (14551: I3 ^predict-yes N1038)
  20385. <=WM: (14538: N1037 ^status complete)
  20386. <=WM: (14537: I3 ^predict-no N1037)
  20387. --- Firing Productions (IE) For State At Depth 1 ---
  20388. --- Inner Elaboration Phase, active level 1 (S1) ---
  20389. Firing monitor*world
  20390. -->
  20391. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20392. --- Change Working Memory (IE) ---
  20393. --- END Application Phase ---
  20394. --- Output Phase ---
  20395. ENV: Agent did: predict-yes for direction R in state State-A
  20396. In State-A moving R
  20397. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  20398. predict error 0
  20399. dir: dir isR
  20400. --- END Output Phase ---
  20401. /|\--- Input Phase ---
  20402. =>WM: (14555: I2 ^dir R)
  20403. =>WM: (14554: I2 ^reward 1)
  20404. =>WM: (14553: I2 ^see 1)
  20405. =>WM: (14552: N1038 ^status complete)
  20406. <=WM: (14541: I2 ^dir R)
  20407. <=WM: (14540: I2 ^reward 1)
  20408. <=WM: (14539: I2 ^see 0)
  20409. =>WM: (14556: I2 ^level-1 R1-root)
  20410. <=WM: (14542: I2 ^level-1 L0-root)
  20411. --- END Input Phase ---
  20412. --- Proposal Phase ---
  20413. --- Inner Elaboration Phase, active level 1 (S1) ---
  20414. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  20415. -->
  20416. (S1 ^operator O2075 = 0.1398795999120246)
  20417. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  20418. -->
  20419. (S1 ^operator O2076 = 0.552381442016479)
  20420. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20421. -->
  20422. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20423. -->
  20424. Firing elaborate*copy-see-to-output-link
  20425. -->
  20426. (I3 ^see 1 +)
  20427. Firing elaborate*reward*based*on*reward
  20428. -->
  20429. (R1042 ^value 1 +)
  20430. (R1 ^reward R1042 +)
  20431. Firing propose*predict-yes
  20432. -->
  20433. (O2077 ^name predict-yes +)
  20434. (S1 ^operator O2077 +)
  20435. Firing propose*predict-no
  20436. -->
  20437. (O2078 ^name predict-no +)
  20438. (S1 ^operator O2078 +)
  20439. Firing rl*prefer*rvt*predict-no*H0*4
  20440. -->
  20441. (S1 ^operator O2076 = 0.4476196055882778)
  20442. Firing rl*prefer*rvt*predict-yes*H0*3
  20443. -->
  20444. (S1 ^operator O2075 = 0.1844116750068798)
  20445. Firing prefer*rvt*predict-yes*H0
  20446. -->
  20447. Firing prefer*rvt*predict-no*H0
  20448. -->
  20449. Firing elaborate*copy-dir-to-output-link
  20450. -->
  20451. (I3 ^dir R +)
  20452. inner elaboration loop at bottom goal.
  20453. Retracting elaborate*copy-see-to-output-link
  20454. -->
  20455. (I3 ^see 0 +)
  20456. Retracting propose*predict-no
  20457. -->
  20458. (O2076 ^name predict-no +)
  20459. (S1 ^operator O2076 +)
  20460. Retracting propose*predict-yes
  20461. -->
  20462. (O2075 ^name predict-yes +)
  20463. (S1 ^operator O2075 +)
  20464. Retracting elaborate*reward*based*on*reward
  20465. -->
  20466. (R1041 ^value 1 +)
  20467. (R1 ^reward R1041 +)
  20468. Retracting elaborate*copy-dir-to-output-link
  20469. -->
  20470. (I3 ^dir R +)
  20471. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20472. -->
  20473. (S1 ^operator O2076 = -0.00558448899823713)
  20474. Retracting rl*prefer*rvt*predict-no*H0*4
  20475. -->
  20476. (S1 ^operator O2076 = 0.4476196055882778)
  20477. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20478. -->
  20479. (S1 ^operator O2075 = 0.8155914233894487)
  20480. Retracting rl*prefer*rvt*predict-yes*H0*3
  20481. -->
  20482. (S1 ^operator O2075 = 0.1844116750068798)
  20483. =>WM: (14563: S1 ^operator O2078 +)
  20484. =>WM: (14562: S1 ^operator O2077 +)
  20485. =>WM: (14561: O2078 ^name predict-no)
  20486. =>WM: (14560: O2077 ^name predict-yes)
  20487. =>WM: (14559: R1042 ^value 1)
  20488. =>WM: (14558: R1 ^reward R1042)
  20489. =>WM: (14557: I3 ^see 1)
  20490. <=WM: (14548: S1 ^operator O2075 +)
  20491. <=WM: (14550: S1 ^operator O2075)
  20492. <=WM: (14549: S1 ^operator O2076 +)
  20493. <=WM: (14543: R1 ^reward R1041)
  20494. <=WM: (14528: I3 ^see 0)
  20495. <=WM: (14546: O2076 ^name predict-no)
  20496. <=WM: (14545: O2075 ^name predict-yes)
  20497. <=WM: (14544: R1041 ^value 1)
  20498. --- Inner Elaboration Phase, active level 1 (S1) ---
  20499. Firing prefer*rvt*predict-yes*H0
  20500. -->
  20501. Firing rl*prefer*rvt*predict-yes*H0*3
  20502. -->
  20503. (S1 ^operator O2077 = 0.1844116750068798)
  20504. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20505. -->
  20506. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  20507. -->
  20508. (S1 ^operator O2077 = 0.1398795999120246)
  20509. Firing prefer*rvt*predict-no*H0
  20510. -->
  20511. Firing rl*prefer*rvt*predict-no*H0*4
  20512. -->
  20513. (S1 ^operator O2078 = 0.4476196055882778)
  20514. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20515. -->
  20516. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  20517. -->
  20518. (S1 ^operator O2078 = 0.552381442016479)
  20519. inner elaboration loop at bottom goal.
  20520. Retracting rl*prefer*rvt*predict-no*H0*4
  20521. -->
  20522. (S1 ^operator O2076 = 0.4476196055882778)
  20523. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  20524. -->
  20525. (S1 ^operator O2076 = 0.552381442016479)
  20526. Retracting rl*prefer*rvt*predict-yes*H0*3
  20527. -->
  20528. (S1 ^operator O2075 = 0.1844116750068798)
  20529. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  20530. -->
  20531. (S1 ^operator O2075 = 0.1398795999120246)
  20532. --- END Proposal Phase ---
  20533. --- Decision Phase ---
  20534. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184412 -> 0.675414 -0.491003 0.184411(R,m,v=1,0.903409,0.0877597)
  20535. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.32459 0.491002 0.815591 -> 0.324589 0.491002 0.815591(R,m,v=1,1,0)
  20536. =>WM: (14564: S1 ^operator O2078)
  20537. 1039: O: O2078 (predict-no)
  20538. --- END Decision Phase ---
  20539. --- Application Phase ---
  20540. --- Firing Productions (PE) For State At Depth 1 ---
  20541. --- Inner Elaboration Phase, active level 1 (S1) ---
  20542. Firing apply*operator
  20543. -->
  20544. (I3 ^predict-no N1039 + :O )
  20545. Firing apply*operator*complete
  20546. -->
  20547. (I3 ^predict-yes N1038 - :O )
  20548. inner elaboration loop at bottom goal.
  20549. --- Change Working Memory (PE) ---
  20550. =>WM: (14565: I3 ^predict-no N1039)
  20551. <=WM: (14552: N1038 ^status complete)
  20552. <=WM: (14551: I3 ^predict-yes N1038)
  20553. --- Firing Productions (IE) For State At Depth 1 ---
  20554. --- Inner Elaboration Phase, active level 1 (S1) ---
  20555. Firing monitor*world
  20556. -->
  20557. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20558. --- Change Working Memory (IE) ---
  20559. --- END Application Phase ---
  20560. --- Output Phase ---
  20561. ENV: Agent did: predict-no for direction R in state State-B
  20562. In State-B moving R
  20563. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20564. predict error 0
  20565. dir: dir isU
  20566. --- END Output Phase ---
  20567. ---- Input Phase ---
  20568. =>WM: (14569: I2 ^dir U)
  20569. =>WM: (14568: I2 ^reward 1)
  20570. =>WM: (14567: I2 ^see 0)
  20571. =>WM: (14566: N1039 ^status complete)
  20572. <=WM: (14555: I2 ^dir R)
  20573. <=WM: (14554: I2 ^reward 1)
  20574. <=WM: (14553: I2 ^see 1)
  20575. =>WM: (14570: I2 ^level-1 R0-root)
  20576. <=WM: (14556: I2 ^level-1 R1-root)
  20577. --- END Input Phase ---
  20578. --- Proposal Phase ---
  20579. --- Inner Elaboration Phase, active level 1 (S1) ---
  20580. Firing elaborate*copy-see-to-output-link
  20581. -->
  20582. (I3 ^see 0 +)
  20583. Firing elaborate*reward*based*on*reward
  20584. -->
  20585. (R1043 ^value 1 +)
  20586. (R1 ^reward R1043 +)
  20587. Firing propose*predict-yes
  20588. -->
  20589. (O2079 ^name predict-yes +)
  20590. (S1 ^operator O2079 +)
  20591. Firing propose*predict-no
  20592. -->
  20593. (O2080 ^name predict-no +)
  20594. (S1 ^operator O2080 +)
  20595. Firing rl*prefer*rvt*predict-no*H0*6
  20596. -->
  20597. (S1 ^operator O2078 = 0.9999999999999999)
  20598. Firing rl*prefer*rvt*predict-yes*H0*5
  20599. -->
  20600. (S1 ^operator O2077 = 0.)
  20601. Firing prefer*rvt*predict-yes*H0
  20602. -->
  20603. Firing prefer*rvt*predict-no*H0
  20604. -->
  20605. Firing elaborate*copy-dir-to-output-link
  20606. -->
  20607. (I3 ^dir U +)
  20608. inner elaboration loop at bottom goal.
  20609. Retracting elaborate*copy-see-to-output-link
  20610. -->
  20611. (I3 ^see 1 +)
  20612. Retracting propose*predict-no
  20613. -->
  20614. (O2078 ^name predict-no +)
  20615. (S1 ^operator O2078 +)
  20616. Retracting propose*predict-yes
  20617. -->
  20618. (O2077 ^name predict-yes +)
  20619. (S1 ^operator O2077 +)
  20620. Retracting elaborate*reward*based*on*reward
  20621. -->
  20622. (R1042 ^value 1 +)
  20623. (R1 ^reward R1042 +)
  20624. Retracting elaborate*copy-dir-to-output-link
  20625. -->
  20626. (I3 ^dir R +)
  20627. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  20628. -->
  20629. (S1 ^operator O2078 = 0.552381442016479)
  20630. Retracting rl*prefer*rvt*predict-no*H0*4
  20631. -->
  20632. (S1 ^operator O2078 = 0.4476196055882778)
  20633. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  20634. -->
  20635. (S1 ^operator O2077 = 0.1398795999120246)
  20636. Retracting rl*prefer*rvt*predict-yes*H0*3
  20637. -->
  20638. (S1 ^operator O2077 = 0.1844112102474305)
  20639. =>WM: (14578: S1 ^operator O2080 +)
  20640. =>WM: (14577: S1 ^operator O2079 +)
  20641. =>WM: (14576: I3 ^dir U)
  20642. =>WM: (14575: O2080 ^name predict-no)
  20643. =>WM: (14574: O2079 ^name predict-yes)
  20644. =>WM: (14573: R1043 ^value 1)
  20645. =>WM: (14572: R1 ^reward R1043)
  20646. =>WM: (14571: I3 ^see 0)
  20647. <=WM: (14562: S1 ^operator O2077 +)
  20648. <=WM: (14563: S1 ^operator O2078 +)
  20649. <=WM: (14564: S1 ^operator O2078)
  20650. <=WM: (14547: I3 ^dir R)
  20651. <=WM: (14558: R1 ^reward R1042)
  20652. <=WM: (14557: I3 ^see 1)
  20653. <=WM: (14561: O2078 ^name predict-no)
  20654. <=WM: (14560: O2077 ^name predict-yes)
  20655. <=WM: (14559: R1042 ^value 1)
  20656. --- Inner Elaboration Phase, active level 1 (S1) ---
  20657. Firing prefer*rvt*predict-yes*H0
  20658. -->
  20659. Firing rl*prefer*rvt*predict-yes*H0*5
  20660. -->
  20661. (S1 ^operator O2079 = 0.)
  20662. Firing prefer*rvt*predict-no*H0
  20663. -->
  20664. Firing rl*prefer*rvt*predict-no*H0*6
  20665. -->
  20666. (S1 ^operator O2080 = 0.9999999999999999)
  20667. inner elaboration loop at bottom goal.
  20668. Retracting rl*prefer*rvt*predict-no*H0*6
  20669. -->
  20670. (S1 ^operator O2078 = 0.9999999999999999)
  20671. Retracting rl*prefer*rvt*predict-yes*H0*5
  20672. -->
  20673. (S1 ^operator O2077 = 0.)
  20674. --- END Proposal Phase ---
  20675. --- Decision Phase ---
  20676. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174913 0.44762 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.934307,0.0618291)
  20677. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552381 -> 0.377468 0.174914 0.552381(R,m,v=1,1,0)
  20678. =>WM: (14579: S1 ^operator O2080)
  20679. 1040: O: O2080 (predict-no)
  20680. --- END Decision Phase ---
  20681. --- Application Phase ---
  20682. --- Firing Productions (PE) For State At Depth 1 ---
  20683. --- Inner Elaboration Phase, active level 1 (S1) ---
  20684. Firing apply*operator
  20685. -->
  20686. (I3 ^predict-no N1040 + :O )
  20687. Firing apply*operator*complete
  20688. -->
  20689. (I3 ^predict-no N1039 - :O )
  20690. inner elaboration loop at bottom goal.
  20691. --- Change Working Memory (PE) ---
  20692. =>WM: (14580: I3 ^predict-no N1040)
  20693. <=WM: (14566: N1039 ^status complete)
  20694. <=WM: (14565: I3 ^predict-no N1039)
  20695. --- Firing Productions (IE) For State At Depth 1 ---
  20696. --- Inner Elaboration Phase, active level 1 (S1) ---
  20697. Firing monitor*world
  20698. -->
  20699. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20700. --- Change Working Memory (IE) ---
  20701. --- END Application Phase ---
  20702. --- Output Phase ---
  20703. ENV: Agent did: predict-no for direction U in state State-B
  20704. In State-B moving U
  20705. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20706. predict error 0
  20707. dir: dir isL
  20708. --- END Output Phase ---
  20709. /|\--- Input Phase ---
  20710. =>WM: (14584: I2 ^dir L)
  20711. =>WM: (14583: I2 ^reward 1)
  20712. =>WM: (14582: I2 ^see 0)
  20713. =>WM: (14581: N1040 ^status complete)
  20714. <=WM: (14569: I2 ^dir U)
  20715. <=WM: (14568: I2 ^reward 1)
  20716. <=WM: (14567: I2 ^see 0)
  20717. =>WM: (14585: I2 ^level-1 R0-root)
  20718. <=WM: (14570: I2 ^level-1 R0-root)
  20719. --- END Input Phase ---
  20720. --- Proposal Phase ---
  20721. --- Inner Elaboration Phase, active level 1 (S1) ---
  20722. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  20723. -->
  20724. (S1 ^operator O2079 = 0.6104608339574975)
  20725. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  20726. -->
  20727. (S1 ^operator O2080 = 0.1063475139796038)
  20728. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20729. -->
  20730. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20731. -->
  20732. Firing elaborate*copy-see-to-output-link
  20733. -->
  20734. (I3 ^see 0 +)
  20735. Firing elaborate*reward*based*on*reward
  20736. -->
  20737. (R1044 ^value 1 +)
  20738. (R1 ^reward R1044 +)
  20739. Firing propose*predict-yes
  20740. -->
  20741. (O2081 ^name predict-yes +)
  20742. (S1 ^operator O2081 +)
  20743. Firing propose*predict-no
  20744. -->
  20745. (O2082 ^name predict-no +)
  20746. (S1 ^operator O2082 +)
  20747. Firing rl*prefer*rvt*predict-no*H0*2
  20748. -->
  20749. (S1 ^operator O2080 = 0.3873357815900519)
  20750. Firing rl*prefer*rvt*predict-yes*H0*1
  20751. -->
  20752. (S1 ^operator O2079 = 0.389539602690798)
  20753. Firing prefer*rvt*predict-yes*H0
  20754. -->
  20755. Firing prefer*rvt*predict-no*H0
  20756. -->
  20757. Firing elaborate*copy-dir-to-output-link
  20758. -->
  20759. (I3 ^dir L +)
  20760. inner elaboration loop at bottom goal.
  20761. Retracting elaborate*copy-see-to-output-link
  20762. -->
  20763. (I3 ^see 0 +)
  20764. Retracting propose*predict-no
  20765. -->
  20766. (O2080 ^name predict-no +)
  20767. (S1 ^operator O2080 +)
  20768. Retracting propose*predict-yes
  20769. -->
  20770. (O2079 ^name predict-yes +)
  20771. (S1 ^operator O2079 +)
  20772. Retracting elaborate*reward*based*on*reward
  20773. -->
  20774. (R1043 ^value 1 +)
  20775. (R1 ^reward R1043 +)
  20776. Retracting elaborate*copy-dir-to-output-link
  20777. -->
  20778. (I3 ^dir U +)
  20779. Retracting rl*prefer*rvt*predict-no*H0*6
  20780. -->
  20781. (S1 ^operator O2080 = 0.9999999999999999)
  20782. Retracting rl*prefer*rvt*predict-yes*H0*5
  20783. -->
  20784. (S1 ^operator O2079 = 0.)
  20785. =>WM: (14592: S1 ^operator O2082 +)
  20786. =>WM: (14591: S1 ^operator O2081 +)
  20787. =>WM: (14590: I3 ^dir L)
  20788. =>WM: (14589: O2082 ^name predict-no)
  20789. =>WM: (14588: O2081 ^name predict-yes)
  20790. =>WM: (14587: R1044 ^value 1)
  20791. =>WM: (14586: R1 ^reward R1044)
  20792. <=WM: (14577: S1 ^operator O2079 +)
  20793. <=WM: (14578: S1 ^operator O2080 +)
  20794. <=WM: (14579: S1 ^operator O2080)
  20795. <=WM: (14576: I3 ^dir U)
  20796. <=WM: (14572: R1 ^reward R1043)
  20797. <=WM: (14575: O2080 ^name predict-no)
  20798. <=WM: (14574: O2079 ^name predict-yes)
  20799. <=WM: (14573: R1043 ^value 1)
  20800. --- Inner Elaboration Phase, active level 1 (S1) ---
  20801. Firing prefer*rvt*predict-yes*H0
  20802. -->
  20803. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  20804. -->
  20805. (S1 ^operator O2081 = 0.6104608339574975)
  20806. Firing rl*prefer*rvt*predict-yes*H0*1
  20807. -->
  20808. (S1 ^operator O2081 = 0.389539602690798)
  20809. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  20810. -->
  20811. Firing prefer*rvt*predict-no*H0
  20812. -->
  20813. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  20814. -->
  20815. (S1 ^operator O2082 = 0.1063475139796038)
  20816. Firing rl*prefer*rvt*predict-no*H0*2
  20817. -->
  20818. (S1 ^operator O2082 = 0.3873357815900519)
  20819. Firing prefer*rvt*predict-no*H0*2*v1*H1
  20820. -->
  20821. inner elaboration loop at bottom goal.
  20822. Retracting rl*prefer*rvt*predict-no*H0*2
  20823. -->
  20824. (S1 ^operator O2080 = 0.3873357815900519)
  20825. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  20826. -->
  20827. (S1 ^operator O2080 = 0.1063475139796038)
  20828. Retracting rl*prefer*rvt*predict-yes*H0*1
  20829. -->
  20830. (S1 ^operator O2079 = 0.389539602690798)
  20831. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  20832. -->
  20833. (S1 ^operator O2079 = 0.6104608339574975)
  20834. --- END Proposal Phase ---
  20835. --- Decision Phase ---
  20836. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20837. =>WM: (14593: S1 ^operator O2081)
  20838. 1041: O: O2081 (predict-yes)
  20839. --- END Decision Phase ---
  20840. --- Application Phase ---
  20841. --- Firing Productions (PE) For State At Depth 1 ---
  20842. --- Inner Elaboration Phase, active level 1 (S1) ---
  20843. Firing apply*operator
  20844. -->
  20845. (I3 ^predict-yes N1041 + :O )
  20846. Firing apply*operator*complete
  20847. -->
  20848. (I3 ^predict-no N1040 - :O )
  20849. inner elaboration loop at bottom goal.
  20850. --- Change Working Memory (PE) ---
  20851. =>WM: (14594: I3 ^predict-yes N1041)
  20852. <=WM: (14581: N1040 ^status complete)
  20853. <=WM: (14580: I3 ^predict-no N1040)
  20854. --- Firing Productions (IE) For State At Depth 1 ---
  20855. --- Inner Elaboration Phase, active level 1 (S1) ---
  20856. Firing monitor*world
  20857. -->
  20858. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20859. --- Change Working Memory (IE) ---
  20860. --- END Application Phase ---
  20861. --- Output Phase ---
  20862. ENV: Agent did: predict-yes for direction L in state State-B
  20863. In State-B moving L
  20864. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  20865. predict error 0
  20866. dir: dir isU
  20867. --- END Output Phase ---
  20868. ---- Input Phase ---
  20869. =>WM: (14598: I2 ^dir U)
  20870. =>WM: (14597: I2 ^reward 1)
  20871. =>WM: (14596: I2 ^see 1)
  20872. =>WM: (14595: N1041 ^status complete)
  20873. <=WM: (14584: I2 ^dir L)
  20874. <=WM: (14583: I2 ^reward 1)
  20875. <=WM: (14582: I2 ^see 0)
  20876. =>WM: (14599: I2 ^level-1 L1-root)
  20877. <=WM: (14585: I2 ^level-1 R0-root)
  20878. --- END Input Phase ---
  20879. --- Proposal Phase ---
  20880. --- Inner Elaboration Phase, active level 1 (S1) ---
  20881. Firing elaborate*copy-see-to-output-link
  20882. -->
  20883. (I3 ^see 1 +)
  20884. Firing elaborate*reward*based*on*reward
  20885. -->
  20886. (R1045 ^value 1 +)
  20887. (R1 ^reward R1045 +)
  20888. Firing propose*predict-yes
  20889. -->
  20890. (O2083 ^name predict-yes +)
  20891. (S1 ^operator O2083 +)
  20892. Firing propose*predict-no
  20893. -->
  20894. (O2084 ^name predict-no +)
  20895. (S1 ^operator O2084 +)
  20896. Firing rl*prefer*rvt*predict-no*H0*6
  20897. -->
  20898. (S1 ^operator O2082 = 0.9999999999999999)
  20899. Firing rl*prefer*rvt*predict-yes*H0*5
  20900. -->
  20901. (S1 ^operator O2081 = 0.)
  20902. Firing prefer*rvt*predict-yes*H0
  20903. -->
  20904. Firing prefer*rvt*predict-no*H0
  20905. -->
  20906. Firing elaborate*copy-dir-to-output-link
  20907. -->
  20908. (I3 ^dir U +)
  20909. inner elaboration loop at bottom goal.
  20910. Retracting elaborate*copy-see-to-output-link
  20911. -->
  20912. (I3 ^see 0 +)
  20913. Retracting propose*predict-no
  20914. -->
  20915. (O2082 ^name predict-no +)
  20916. (S1 ^operator O2082 +)
  20917. Retracting propose*predict-yes
  20918. -->
  20919. (O2081 ^name predict-yes +)
  20920. (S1 ^operator O2081 +)
  20921. Retracting elaborate*reward*based*on*reward
  20922. -->
  20923. (R1044 ^value 1 +)
  20924. (R1 ^reward R1044 +)
  20925. Retracting elaborate*copy-dir-to-output-link
  20926. -->
  20927. (I3 ^dir L +)
  20928. Retracting rl*prefer*rvt*predict-no*H0*2
  20929. -->
  20930. (S1 ^operator O2082 = 0.3873357815900519)
  20931. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  20932. -->
  20933. (S1 ^operator O2082 = 0.1063475139796038)
  20934. Retracting rl*prefer*rvt*predict-yes*H0*1
  20935. -->
  20936. (S1 ^operator O2081 = 0.389539602690798)
  20937. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  20938. -->
  20939. (S1 ^operator O2081 = 0.6104608339574975)
  20940. =>WM: (14607: S1 ^operator O2084 +)
  20941. =>WM: (14606: S1 ^operator O2083 +)
  20942. =>WM: (14605: I3 ^dir U)
  20943. =>WM: (14604: O2084 ^name predict-no)
  20944. =>WM: (14603: O2083 ^name predict-yes)
  20945. =>WM: (14602: R1045 ^value 1)
  20946. =>WM: (14601: R1 ^reward R1045)
  20947. =>WM: (14600: I3 ^see 1)
  20948. <=WM: (14591: S1 ^operator O2081 +)
  20949. <=WM: (14593: S1 ^operator O2081)
  20950. <=WM: (14592: S1 ^operator O2082 +)
  20951. <=WM: (14590: I3 ^dir L)
  20952. <=WM: (14586: R1 ^reward R1044)
  20953. <=WM: (14571: I3 ^see 0)
  20954. <=WM: (14589: O2082 ^name predict-no)
  20955. <=WM: (14588: O2081 ^name predict-yes)
  20956. <=WM: (14587: R1044 ^value 1)
  20957. --- Inner Elaboration Phase, active level 1 (S1) ---
  20958. Firing prefer*rvt*predict-yes*H0
  20959. -->
  20960. Firing rl*prefer*rvt*predict-yes*H0*5
  20961. -->
  20962. (S1 ^operator O2083 = 0.)
  20963. Firing prefer*rvt*predict-no*H0
  20964. -->
  20965. Firing rl*prefer*rvt*predict-no*H0*6
  20966. -->
  20967. (S1 ^operator O2084 = 0.9999999999999999)
  20968. inner elaboration loop at bottom goal.
  20969. Retracting rl*prefer*rvt*predict-no*H0*6
  20970. -->
  20971. (S1 ^operator O2082 = 0.9999999999999999)
  20972. Retracting rl*prefer*rvt*predict-yes*H0*5
  20973. -->
  20974. (S1 ^operator O2081 = 0.)
  20975. --- END Proposal Phase ---
  20976. --- Decision Phase ---
  20977. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.896552,0.0932828)
  20978. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  20979. =>WM: (14608: S1 ^operator O2084)
  20980. 1042: O: O2084 (predict-no)
  20981. --- END Decision Phase ---
  20982. --- Application Phase ---
  20983. --- Firing Productions (PE) For State At Depth 1 ---
  20984. --- Inner Elaboration Phase, active level 1 (S1) ---
  20985. Firing apply*operator
  20986. -->
  20987. (I3 ^predict-no N1042 + :O )
  20988. Firing apply*operator*complete
  20989. -->
  20990. (I3 ^predict-yes N1041 - :O )
  20991. inner elaboration loop at bottom goal.
  20992. --- Change Working Memory (PE) ---
  20993. =>WM: (14609: I3 ^predict-no N1042)
  20994. <=WM: (14595: N1041 ^status complete)
  20995. <=WM: (14594: I3 ^predict-yes N1041)
  20996. --- Firing Productions (IE) For State At Depth 1 ---
  20997. --- Inner Elaboration Phase, active level 1 (S1) ---
  20998. Firing monitor*world
  20999. -->
  21000. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21001. --- Change Working Memory (IE) ---
  21002. --- END Application Phase ---
  21003. --- Output Phase ---
  21004. ENV: Agent did: predict-no for direction U in state State-A
  21005. In State-A moving U
  21006. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21007. predict error 0
  21008. dir: dir isU
  21009. --- END Output Phase ---
  21010. /|\---- Input Phase ---
  21011. =>WM: (14613: I2 ^dir U)
  21012. =>WM: (14612: I2 ^reward 1)
  21013. =>WM: (14611: I2 ^see 0)
  21014. =>WM: (14610: N1042 ^status complete)
  21015. <=WM: (14598: I2 ^dir U)
  21016. <=WM: (14597: I2 ^reward 1)
  21017. <=WM: (14596: I2 ^see 1)
  21018. =>WM: (14614: I2 ^level-1 L1-root)
  21019. <=WM: (14599: I2 ^level-1 L1-root)
  21020. --- END Input Phase ---
  21021. --- Proposal Phase ---
  21022. --- Inner Elaboration Phase, active level 1 (S1) ---
  21023. Firing elaborate*copy-see-to-output-link
  21024. -->
  21025. (I3 ^see 0 +)
  21026. Firing elaborate*reward*based*on*reward
  21027. -->
  21028. (R1046 ^value 1 +)
  21029. (R1 ^reward R1046 +)
  21030. Firing propose*predict-yes
  21031. -->
  21032. (O2085 ^name predict-yes +)
  21033. (S1 ^operator O2085 +)
  21034. Firing propose*predict-no
  21035. -->
  21036. (O2086 ^name predict-no +)
  21037. (S1 ^operator O2086 +)
  21038. Firing rl*prefer*rvt*predict-no*H0*6
  21039. -->
  21040. (S1 ^operator O2084 = 0.9999999999999999)
  21041. Firing rl*prefer*rvt*predict-yes*H0*5
  21042. -->
  21043. (S1 ^operator O2083 = 0.)
  21044. Firing prefer*rvt*predict-yes*H0
  21045. -->
  21046. Firing prefer*rvt*predict-no*H0
  21047. -->
  21048. Firing elaborate*copy-dir-to-output-link
  21049. -->
  21050. (I3 ^dir U +)
  21051. inner elaboration loop at bottom goal.
  21052. Retracting elaborate*copy-see-to-output-link
  21053. -->
  21054. (I3 ^see 1 +)
  21055. Retracting propose*predict-no
  21056. -->
  21057. (O2084 ^name predict-no +)
  21058. (S1 ^operator O2084 +)
  21059. Retracting propose*predict-yes
  21060. -->
  21061. (O2083 ^name predict-yes +)
  21062. (S1 ^operator O2083 +)
  21063. Retracting elaborate*reward*based*on*reward
  21064. -->
  21065. (R1045 ^value 1 +)
  21066. (R1 ^reward R1045 +)
  21067. Retracting elaborate*copy-dir-to-output-link
  21068. -->
  21069. (I3 ^dir U +)
  21070. Retracting rl*prefer*rvt*predict-no*H0*6
  21071. -->
  21072. (S1 ^operator O2084 = 0.9999999999999999)
  21073. Retracting rl*prefer*rvt*predict-yes*H0*5
  21074. -->
  21075. (S1 ^operator O2083 = 0.)
  21076. =>WM: (14621: S1 ^operator O2086 +)
  21077. =>WM: (14620: S1 ^operator O2085 +)
  21078. =>WM: (14619: O2086 ^name predict-no)
  21079. =>WM: (14618: O2085 ^name predict-yes)
  21080. =>WM: (14617: R1046 ^value 1)
  21081. =>WM: (14616: R1 ^reward R1046)
  21082. =>WM: (14615: I3 ^see 0)
  21083. <=WM: (14606: S1 ^operator O2083 +)
  21084. <=WM: (14607: S1 ^operator O2084 +)
  21085. <=WM: (14608: S1 ^operator O2084)
  21086. <=WM: (14601: R1 ^reward R1045)
  21087. <=WM: (14600: I3 ^see 1)
  21088. <=WM: (14604: O2084 ^name predict-no)
  21089. <=WM: (14603: O2083 ^name predict-yes)
  21090. <=WM: (14602: R1045 ^value 1)
  21091. --- Inner Elaboration Phase, active level 1 (S1) ---
  21092. Firing prefer*rvt*predict-yes*H0
  21093. -->
  21094. Firing rl*prefer*rvt*predict-yes*H0*5
  21095. -->
  21096. (S1 ^operator O2085 = 0.)
  21097. Firing prefer*rvt*predict-no*H0
  21098. -->
  21099. Firing rl*prefer*rvt*predict-no*H0*6
  21100. -->
  21101. (S1 ^operator O2086 = 0.9999999999999999)
  21102. inner elaboration loop at bottom goal.
  21103. Retracting rl*prefer*rvt*predict-no*H0*6
  21104. -->
  21105. (S1 ^operator O2084 = 0.9999999999999999)
  21106. Retracting rl*prefer*rvt*predict-yes*H0*5
  21107. -->
  21108. (S1 ^operator O2083 = 0.)
  21109. --- END Proposal Phase ---
  21110. --- Decision Phase ---
  21111. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21112. =>WM: (14622: S1 ^operator O2086)
  21113. 1043: O: O2086 (predict-no)
  21114. --- END Decision Phase ---
  21115. --- Application Phase ---
  21116. --- Firing Productions (PE) For State At Depth 1 ---
  21117. --- Inner Elaboration Phase, active level 1 (S1) ---
  21118. Firing apply*operator
  21119. -->
  21120. (I3 ^predict-no N1043 + :O )
  21121. Firing apply*operator*complete
  21122. -->
  21123. (I3 ^predict-no N1042 - :O )
  21124. inner elaboration loop at bottom goal.
  21125. --- Change Working Memory (PE) ---
  21126. =>WM: (14623: I3 ^predict-no N1043)
  21127. <=WM: (14610: N1042 ^status complete)
  21128. <=WM: (14609: I3 ^predict-no N1042)
  21129. --- Firing Productions (IE) For State At Depth 1 ---
  21130. --- Inner Elaboration Phase, active level 1 (S1) ---
  21131. Firing monitor*world
  21132. -->
  21133. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21134. --- Change Working Memory (IE) ---
  21135. --- END Application Phase ---
  21136. --- Output Phase ---
  21137. ENV: Agent did: predict-no for direction U in state State-A
  21138. In State-A moving U
  21139. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21140. predict error 0
  21141. dir: dir isL
  21142. --- END Output Phase ---
  21143. /|\--- Input Phase ---
  21144. =>WM: (14627: I2 ^dir L)
  21145. =>WM: (14626: I2 ^reward 1)
  21146. =>WM: (14625: I2 ^see 0)
  21147. =>WM: (14624: N1043 ^status complete)
  21148. <=WM: (14613: I2 ^dir U)
  21149. <=WM: (14612: I2 ^reward 1)
  21150. <=WM: (14611: I2 ^see 0)
  21151. =>WM: (14628: I2 ^level-1 L1-root)
  21152. <=WM: (14614: I2 ^level-1 L1-root)
  21153. --- END Input Phase ---
  21154. --- Proposal Phase ---
  21155. --- Inner Elaboration Phase, active level 1 (S1) ---
  21156. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  21157. -->
  21158. (S1 ^operator O2086 = 0.6126632570275004)
  21159. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  21160. -->
  21161. (S1 ^operator O2085 = -0.02274740735326741)
  21162. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21163. -->
  21164. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21165. -->
  21166. Firing elaborate*copy-see-to-output-link
  21167. -->
  21168. (I3 ^see 0 +)
  21169. Firing elaborate*reward*based*on*reward
  21170. -->
  21171. (R1047 ^value 1 +)
  21172. (R1 ^reward R1047 +)
  21173. Firing propose*predict-yes
  21174. -->
  21175. (O2087 ^name predict-yes +)
  21176. (S1 ^operator O2087 +)
  21177. Firing propose*predict-no
  21178. -->
  21179. (O2088 ^name predict-no +)
  21180. (S1 ^operator O2088 +)
  21181. Firing rl*prefer*rvt*predict-no*H0*2
  21182. -->
  21183. (S1 ^operator O2086 = 0.3873357815900519)
  21184. Firing rl*prefer*rvt*predict-yes*H0*1
  21185. -->
  21186. (S1 ^operator O2085 = 0.3895395371935536)
  21187. Firing prefer*rvt*predict-yes*H0
  21188. -->
  21189. Firing prefer*rvt*predict-no*H0
  21190. -->
  21191. Firing elaborate*copy-dir-to-output-link
  21192. -->
  21193. (I3 ^dir L +)
  21194. inner elaboration loop at bottom goal.
  21195. Retracting elaborate*copy-see-to-output-link
  21196. -->
  21197. (I3 ^see 0 +)
  21198. Retracting propose*predict-no
  21199. -->
  21200. (O2086 ^name predict-no +)
  21201. (S1 ^operator O2086 +)
  21202. Retracting propose*predict-yes
  21203. -->
  21204. (O2085 ^name predict-yes +)
  21205. (S1 ^operator O2085 +)
  21206. Retracting elaborate*reward*based*on*reward
  21207. -->
  21208. (R1046 ^value 1 +)
  21209. (R1 ^reward R1046 +)
  21210. Retracting elaborate*copy-dir-to-output-link
  21211. -->
  21212. (I3 ^dir U +)
  21213. Retracting rl*prefer*rvt*predict-no*H0*6
  21214. -->
  21215. (S1 ^operator O2086 = 0.9999999999999999)
  21216. Retracting rl*prefer*rvt*predict-yes*H0*5
  21217. -->
  21218. (S1 ^operator O2085 = 0.)
  21219. =>WM: (14635: S1 ^operator O2088 +)
  21220. =>WM: (14634: S1 ^operator O2087 +)
  21221. =>WM: (14633: I3 ^dir L)
  21222. =>WM: (14632: O2088 ^name predict-no)
  21223. =>WM: (14631: O2087 ^name predict-yes)
  21224. =>WM: (14630: R1047 ^value 1)
  21225. =>WM: (14629: R1 ^reward R1047)
  21226. <=WM: (14620: S1 ^operator O2085 +)
  21227. <=WM: (14621: S1 ^operator O2086 +)
  21228. <=WM: (14622: S1 ^operator O2086)
  21229. <=WM: (14605: I3 ^dir U)
  21230. <=WM: (14616: R1 ^reward R1046)
  21231. <=WM: (14619: O2086 ^name predict-no)
  21232. <=WM: (14618: O2085 ^name predict-yes)
  21233. <=WM: (14617: R1046 ^value 1)
  21234. --- Inner Elaboration Phase, active level 1 (S1) ---
  21235. Firing prefer*rvt*predict-yes*H0
  21236. -->
  21237. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  21238. -->
  21239. (S1 ^operator O2087 = -0.02274740735326741)
  21240. Firing rl*prefer*rvt*predict-yes*H0*1
  21241. -->
  21242. (S1 ^operator O2087 = 0.3895395371935536)
  21243. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21244. -->
  21245. Firing prefer*rvt*predict-no*H0
  21246. -->
  21247. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  21248. -->
  21249. (S1 ^operator O2088 = 0.6126632570275004)
  21250. Firing rl*prefer*rvt*predict-no*H0*2
  21251. -->
  21252. (S1 ^operator O2088 = 0.3873357815900519)
  21253. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21254. -->
  21255. inner elaboration loop at bottom goal.
  21256. Retracting rl*prefer*rvt*predict-no*H0*2
  21257. -->
  21258. (S1 ^operator O2086 = 0.3873357815900519)
  21259. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  21260. -->
  21261. (S1 ^operator O2086 = 0.6126632570275004)
  21262. Retracting rl*prefer*rvt*predict-yes*H0*1
  21263. -->
  21264. (S1 ^operator O2085 = 0.3895395371935536)
  21265. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  21266. -->
  21267. (S1 ^operator O2085 = -0.02274740735326741)
  21268. --- END Proposal Phase ---
  21269. --- Decision Phase ---
  21270. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21271. =>WM: (14636: S1 ^operator O2088)
  21272. 1044: O: O2088 (predict-no)
  21273. --- END Decision Phase ---
  21274. --- Application Phase ---
  21275. --- Firing Productions (PE) For State At Depth 1 ---
  21276. --- Inner Elaboration Phase, active level 1 (S1) ---
  21277. Firing apply*operator
  21278. -->
  21279. (I3 ^predict-no N1044 + :O )
  21280. Firing apply*operator*complete
  21281. -->
  21282. (I3 ^predict-no N1043 - :O )
  21283. inner elaboration loop at bottom goal.
  21284. --- Change Working Memory (PE) ---
  21285. =>WM: (14637: I3 ^predict-no N1044)
  21286. <=WM: (14624: N1043 ^status complete)
  21287. <=WM: (14623: I3 ^predict-no N1043)
  21288. --- Firing Productions (IE) For State At Depth 1 ---
  21289. --- Inner Elaboration Phase, active level 1 (S1) ---
  21290. Firing monitor*world
  21291. -->
  21292. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21293. --- Change Working Memory (IE) ---
  21294. --- END Application Phase ---
  21295. --- Output Phase ---
  21296. ENV: Agent did: predict-no for direction L in state State-A
  21297. In State-A moving L
  21298. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21299. predict error 0
  21300. dir: dir isU
  21301. --- END Output Phase ---
  21302. -/|--- Input Phase ---
  21303. =>WM: (14641: I2 ^dir U)
  21304. =>WM: (14640: I2 ^reward 1)
  21305. =>WM: (14639: I2 ^see 0)
  21306. =>WM: (14638: N1044 ^status complete)
  21307. <=WM: (14627: I2 ^dir L)
  21308. <=WM: (14626: I2 ^reward 1)
  21309. <=WM: (14625: I2 ^see 0)
  21310. =>WM: (14642: I2 ^level-1 L0-root)
  21311. <=WM: (14628: I2 ^level-1 L1-root)
  21312. --- END Input Phase ---
  21313. --- Proposal Phase ---
  21314. --- Inner Elaboration Phase, active level 1 (S1) ---
  21315. Firing elaborate*copy-see-to-output-link
  21316. -->
  21317. (I3 ^see 0 +)
  21318. Firing elaborate*reward*based*on*reward
  21319. -->
  21320. (R1048 ^value 1 +)
  21321. (R1 ^reward R1048 +)
  21322. Firing propose*predict-yes
  21323. -->
  21324. (O2089 ^name predict-yes +)
  21325. (S1 ^operator O2089 +)
  21326. Firing propose*predict-no
  21327. -->
  21328. (O2090 ^name predict-no +)
  21329. (S1 ^operator O2090 +)
  21330. Firing rl*prefer*rvt*predict-no*H0*6
  21331. -->
  21332. (S1 ^operator O2088 = 0.9999999999999999)
  21333. Firing rl*prefer*rvt*predict-yes*H0*5
  21334. -->
  21335. (S1 ^operator O2087 = 0.)
  21336. Firing prefer*rvt*predict-yes*H0
  21337. -->
  21338. Firing prefer*rvt*predict-no*H0
  21339. -->
  21340. Firing elaborate*copy-dir-to-output-link
  21341. -->
  21342. (I3 ^dir U +)
  21343. inner elaboration loop at bottom goal.
  21344. Retracting elaborate*copy-see-to-output-link
  21345. -->
  21346. (I3 ^see 0 +)
  21347. Retracting propose*predict-no
  21348. -->
  21349. (O2088 ^name predict-no +)
  21350. (S1 ^operator O2088 +)
  21351. Retracting propose*predict-yes
  21352. -->
  21353. (O2087 ^name predict-yes +)
  21354. (S1 ^operator O2087 +)
  21355. Retracting elaborate*reward*based*on*reward
  21356. -->
  21357. (R1047 ^value 1 +)
  21358. (R1 ^reward R1047 +)
  21359. Retracting elaborate*copy-dir-to-output-link
  21360. -->
  21361. (I3 ^dir L +)
  21362. Retracting rl*prefer*rvt*predict-no*H0*2
  21363. -->
  21364. (S1 ^operator O2088 = 0.3873357815900519)
  21365. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  21366. -->
  21367. (S1 ^operator O2088 = 0.6126632570275004)
  21368. Retracting rl*prefer*rvt*predict-yes*H0*1
  21369. -->
  21370. (S1 ^operator O2087 = 0.3895395371935536)
  21371. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  21372. -->
  21373. (S1 ^operator O2087 = -0.02274740735326741)
  21374. =>WM: (14649: S1 ^operator O2090 +)
  21375. =>WM: (14648: S1 ^operator O2089 +)
  21376. =>WM: (14647: I3 ^dir U)
  21377. =>WM: (14646: O2090 ^name predict-no)
  21378. =>WM: (14645: O2089 ^name predict-yes)
  21379. =>WM: (14644: R1048 ^value 1)
  21380. =>WM: (14643: R1 ^reward R1048)
  21381. <=WM: (14634: S1 ^operator O2087 +)
  21382. <=WM: (14635: S1 ^operator O2088 +)
  21383. <=WM: (14636: S1 ^operator O2088)
  21384. <=WM: (14633: I3 ^dir L)
  21385. <=WM: (14629: R1 ^reward R1047)
  21386. <=WM: (14632: O2088 ^name predict-no)
  21387. <=WM: (14631: O2087 ^name predict-yes)
  21388. <=WM: (14630: R1047 ^value 1)
  21389. --- Inner Elaboration Phase, active level 1 (S1) ---
  21390. Firing prefer*rvt*predict-yes*H0
  21391. -->
  21392. Firing rl*prefer*rvt*predict-yes*H0*5
  21393. -->
  21394. (S1 ^operator O2089 = 0.)
  21395. Firing prefer*rvt*predict-no*H0
  21396. -->
  21397. Firing rl*prefer*rvt*predict-no*H0*6
  21398. -->
  21399. (S1 ^operator O2090 = 0.9999999999999999)
  21400. inner elaboration loop at bottom goal.
  21401. Retracting rl*prefer*rvt*predict-no*H0*6
  21402. -->
  21403. (S1 ^operator O2088 = 0.9999999999999999)
  21404. Retracting rl*prefer*rvt*predict-yes*H0*5
  21405. -->
  21406. (S1 ^operator O2087 = 0.)
  21407. --- END Proposal Phase ---
  21408. --- Decision Phase ---
  21409. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331744 0.387336(R,m,v=1,0.934783,0.0612972)
  21410. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.280919 0.331744 0.612663 -> 0.28092 0.331744 0.612663(R,m,v=1,1,0)
  21411. =>WM: (14650: S1 ^operator O2090)
  21412. 1045: O: O2090 (predict-no)
  21413. --- END Decision Phase ---
  21414. --- Application Phase ---
  21415. --- Firing Productions (PE) For State At Depth 1 ---
  21416. --- Inner Elaboration Phase, active level 1 (S1) ---
  21417. Firing apply*operator
  21418. -->
  21419. (I3 ^predict-no N1045 + :O )
  21420. Firing apply*operator*complete
  21421. -->
  21422. (I3 ^predict-no N1044 - :O )
  21423. inner elaboration loop at bottom goal.
  21424. --- Change Working Memory (PE) ---
  21425. =>WM: (14651: I3 ^predict-no N1045)
  21426. <=WM: (14638: N1044 ^status complete)
  21427. <=WM: (14637: I3 ^predict-no N1044)
  21428. --- Firing Productions (IE) For State At Depth 1 ---
  21429. --- Inner Elaboration Phase, active level 1 (S1) ---
  21430. Firing monitor*world
  21431. -->
  21432. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21433. --- Change Working Memory (IE) ---
  21434. --- END Application Phase ---
  21435. --- Output Phase ---
  21436. ENV: Agent did: predict-no for direction U in state State-A
  21437. In State-A moving U
  21438. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21439. predict error 0
  21440. dir: dir isL
  21441. --- END Output Phase ---
  21442. \-/--- Input Phase ---
  21443. =>WM: (14655: I2 ^dir L)
  21444. =>WM: (14654: I2 ^reward 1)
  21445. =>WM: (14653: I2 ^see 0)
  21446. =>WM: (14652: N1045 ^status complete)
  21447. <=WM: (14641: I2 ^dir U)
  21448. <=WM: (14640: I2 ^reward 1)
  21449. <=WM: (14639: I2 ^see 0)
  21450. =>WM: (14656: I2 ^level-1 L0-root)
  21451. <=WM: (14642: I2 ^level-1 L0-root)
  21452. --- END Input Phase ---
  21453. --- Proposal Phase ---
  21454. --- Inner Elaboration Phase, active level 1 (S1) ---
  21455. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21456. -->
  21457. (S1 ^operator O2089 = 0.1599599085218832)
  21458. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21459. -->
  21460. (S1 ^operator O2090 = 0.6126663026263569)
  21461. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21462. -->
  21463. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21464. -->
  21465. Firing elaborate*copy-see-to-output-link
  21466. -->
  21467. (I3 ^see 0 +)
  21468. Firing elaborate*reward*based*on*reward
  21469. -->
  21470. (R1049 ^value 1 +)
  21471. (R1 ^reward R1049 +)
  21472. Firing propose*predict-yes
  21473. -->
  21474. (O2091 ^name predict-yes +)
  21475. (S1 ^operator O2091 +)
  21476. Firing propose*predict-no
  21477. -->
  21478. (O2092 ^name predict-no +)
  21479. (S1 ^operator O2092 +)
  21480. Firing rl*prefer*rvt*predict-no*H0*2
  21481. -->
  21482. (S1 ^operator O2090 = 0.3873359257974192)
  21483. Firing rl*prefer*rvt*predict-yes*H0*1
  21484. -->
  21485. (S1 ^operator O2089 = 0.3895395371935536)
  21486. Firing prefer*rvt*predict-yes*H0
  21487. -->
  21488. Firing prefer*rvt*predict-no*H0
  21489. -->
  21490. Firing elaborate*copy-dir-to-output-link
  21491. -->
  21492. (I3 ^dir L +)
  21493. inner elaboration loop at bottom goal.
  21494. Retracting elaborate*copy-see-to-output-link
  21495. -->
  21496. (I3 ^see 0 +)
  21497. Retracting propose*predict-no
  21498. -->
  21499. (O2090 ^name predict-no +)
  21500. (S1 ^operator O2090 +)
  21501. Retracting propose*predict-yes
  21502. -->
  21503. (O2089 ^name predict-yes +)
  21504. (S1 ^operator O2089 +)
  21505. Retracting elaborate*reward*based*on*reward
  21506. -->
  21507. (R1048 ^value 1 +)
  21508. (R1 ^reward R1048 +)
  21509. Retracting elaborate*copy-dir-to-output-link
  21510. -->
  21511. (I3 ^dir U +)
  21512. Retracting rl*prefer*rvt*predict-no*H0*6
  21513. -->
  21514. (S1 ^operator O2090 = 0.9999999999999999)
  21515. Retracting rl*prefer*rvt*predict-yes*H0*5
  21516. -->
  21517. (S1 ^operator O2089 = 0.)
  21518. =>WM: (14663: S1 ^operator O2092 +)
  21519. =>WM: (14662: S1 ^operator O2091 +)
  21520. =>WM: (14661: I3 ^dir L)
  21521. =>WM: (14660: O2092 ^name predict-no)
  21522. =>WM: (14659: O2091 ^name predict-yes)
  21523. =>WM: (14658: R1049 ^value 1)
  21524. =>WM: (14657: R1 ^reward R1049)
  21525. <=WM: (14648: S1 ^operator O2089 +)
  21526. <=WM: (14649: S1 ^operator O2090 +)
  21527. <=WM: (14650: S1 ^operator O2090)
  21528. <=WM: (14647: I3 ^dir U)
  21529. <=WM: (14643: R1 ^reward R1048)
  21530. <=WM: (14646: O2090 ^name predict-no)
  21531. <=WM: (14645: O2089 ^name predict-yes)
  21532. <=WM: (14644: R1048 ^value 1)
  21533. --- Inner Elaboration Phase, active level 1 (S1) ---
  21534. Firing prefer*rvt*predict-yes*H0
  21535. -->
  21536. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21537. -->
  21538. (S1 ^operator O2091 = 0.1599599085218832)
  21539. Firing rl*prefer*rvt*predict-yes*H0*1
  21540. -->
  21541. (S1 ^operator O2091 = 0.3895395371935536)
  21542. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21543. -->
  21544. Firing prefer*rvt*predict-no*H0
  21545. -->
  21546. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21547. -->
  21548. (S1 ^operator O2092 = 0.6126663026263569)
  21549. Firing rl*prefer*rvt*predict-no*H0*2
  21550. -->
  21551. (S1 ^operator O2092 = 0.3873359257974192)
  21552. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21553. -->
  21554. inner elaboration loop at bottom goal.
  21555. Retracting rl*prefer*rvt*predict-no*H0*2
  21556. -->
  21557. (S1 ^operator O2090 = 0.3873359257974192)
  21558. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21559. -->
  21560. (S1 ^operator O2090 = 0.6126663026263569)
  21561. Retracting rl*prefer*rvt*predict-yes*H0*1
  21562. -->
  21563. (S1 ^operator O2089 = 0.3895395371935536)
  21564. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21565. -->
  21566. (S1 ^operator O2089 = 0.1599599085218832)
  21567. --- END Proposal Phase ---
  21568. --- Decision Phase ---
  21569. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21570. =>WM: (14664: S1 ^operator O2092)
  21571. 1046: O: O2092 (predict-no)
  21572. --- END Decision Phase ---
  21573. --- Application Phase ---
  21574. --- Firing Productions (PE) For State At Depth 1 ---
  21575. --- Inner Elaboration Phase, active level 1 (S1) ---
  21576. Firing apply*operator
  21577. -->
  21578. (I3 ^predict-no N1046 + :O )
  21579. Firing apply*operator*complete
  21580. -->
  21581. (I3 ^predict-no N1045 - :O )
  21582. inner elaboration loop at bottom goal.
  21583. --- Change Working Memory (PE) ---
  21584. =>WM: (14665: I3 ^predict-no N1046)
  21585. <=WM: (14652: N1045 ^status complete)
  21586. <=WM: (14651: I3 ^predict-no N1045)
  21587. --- Firing Productions (IE) For State At Depth 1 ---
  21588. --- Inner Elaboration Phase, active level 1 (S1) ---
  21589. Firing monitor*world
  21590. -->
  21591. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21592. --- Change Working Memory (IE) ---
  21593. --- END Application Phase ---
  21594. --- Output Phase ---
  21595. ENV: Agent did: predict-no for direction L in state State-A
  21596. In State-A moving L
  21597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21598. predict error 0
  21599. dir: dir isU
  21600. --- END Output Phase ---
  21601. |\---- Input Phase ---
  21602. =>WM: (14669: I2 ^dir U)
  21603. =>WM: (14668: I2 ^reward 1)
  21604. =>WM: (14667: I2 ^see 0)
  21605. =>WM: (14666: N1046 ^status complete)
  21606. <=WM: (14655: I2 ^dir L)
  21607. <=WM: (14654: I2 ^reward 1)
  21608. <=WM: (14653: I2 ^see 0)
  21609. =>WM: (14670: I2 ^level-1 L0-root)
  21610. <=WM: (14656: I2 ^level-1 L0-root)
  21611. --- END Input Phase ---
  21612. --- Proposal Phase ---
  21613. --- Inner Elaboration Phase, active level 1 (S1) ---
  21614. Firing elaborate*copy-see-to-output-link
  21615. -->
  21616. (I3 ^see 0 +)
  21617. Firing elaborate*reward*based*on*reward
  21618. -->
  21619. (R1050 ^value 1 +)
  21620. (R1 ^reward R1050 +)
  21621. Firing propose*predict-yes
  21622. -->
  21623. (O2093 ^name predict-yes +)
  21624. (S1 ^operator O2093 +)
  21625. Firing propose*predict-no
  21626. -->
  21627. (O2094 ^name predict-no +)
  21628. (S1 ^operator O2094 +)
  21629. Firing rl*prefer*rvt*predict-no*H0*6
  21630. -->
  21631. (S1 ^operator O2092 = 0.9999999999999999)
  21632. Firing rl*prefer*rvt*predict-yes*H0*5
  21633. -->
  21634. (S1 ^operator O2091 = 0.)
  21635. Firing prefer*rvt*predict-yes*H0
  21636. -->
  21637. Firing prefer*rvt*predict-no*H0
  21638. -->
  21639. Firing elaborate*copy-dir-to-output-link
  21640. -->
  21641. (I3 ^dir U +)
  21642. inner elaboration loop at bottom goal.
  21643. Retracting elaborate*copy-see-to-output-link
  21644. -->
  21645. (I3 ^see 0 +)
  21646. Retracting propose*predict-no
  21647. -->
  21648. (O2092 ^name predict-no +)
  21649. (S1 ^operator O2092 +)
  21650. Retracting propose*predict-yes
  21651. -->
  21652. (O2091 ^name predict-yes +)
  21653. (S1 ^operator O2091 +)
  21654. Retracting elaborate*reward*based*on*reward
  21655. -->
  21656. (R1049 ^value 1 +)
  21657. (R1 ^reward R1049 +)
  21658. Retracting elaborate*copy-dir-to-output-link
  21659. -->
  21660. (I3 ^dir L +)
  21661. Retracting rl*prefer*rvt*predict-no*H0*2
  21662. -->
  21663. (S1 ^operator O2092 = 0.3873359257974192)
  21664. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21665. -->
  21666. (S1 ^operator O2092 = 0.6126663026263569)
  21667. Retracting rl*prefer*rvt*predict-yes*H0*1
  21668. -->
  21669. (S1 ^operator O2091 = 0.3895395371935536)
  21670. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21671. -->
  21672. (S1 ^operator O2091 = 0.1599599085218832)
  21673. =>WM: (14677: S1 ^operator O2094 +)
  21674. =>WM: (14676: S1 ^operator O2093 +)
  21675. =>WM: (14675: I3 ^dir U)
  21676. =>WM: (14674: O2094 ^name predict-no)
  21677. =>WM: (14673: O2093 ^name predict-yes)
  21678. =>WM: (14672: R1050 ^value 1)
  21679. =>WM: (14671: R1 ^reward R1050)
  21680. <=WM: (14662: S1 ^operator O2091 +)
  21681. <=WM: (14663: S1 ^operator O2092 +)
  21682. <=WM: (14664: S1 ^operator O2092)
  21683. <=WM: (14661: I3 ^dir L)
  21684. <=WM: (14657: R1 ^reward R1049)
  21685. <=WM: (14660: O2092 ^name predict-no)
  21686. <=WM: (14659: O2091 ^name predict-yes)
  21687. <=WM: (14658: R1049 ^value 1)
  21688. --- Inner Elaboration Phase, active level 1 (S1) ---
  21689. Firing prefer*rvt*predict-yes*H0
  21690. -->
  21691. Firing rl*prefer*rvt*predict-yes*H0*5
  21692. -->
  21693. (S1 ^operator O2093 = 0.)
  21694. Firing prefer*rvt*predict-no*H0
  21695. -->
  21696. Firing rl*prefer*rvt*predict-no*H0*6
  21697. -->
  21698. (S1 ^operator O2094 = 0.9999999999999999)
  21699. inner elaboration loop at bottom goal.
  21700. Retracting rl*prefer*rvt*predict-no*H0*6
  21701. -->
  21702. (S1 ^operator O2092 = 0.9999999999999999)
  21703. Retracting rl*prefer*rvt*predict-yes*H0*5
  21704. -->
  21705. (S1 ^operator O2091 = 0.)
  21706. --- END Proposal Phase ---
  21707. --- Decision Phase ---
  21708. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331744 0.387336 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.935135,0.0609871)
  21709. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280924 0.331742 0.612666 -> 0.280923 0.331743 0.612666(R,m,v=1,1,0)
  21710. =>WM: (14678: S1 ^operator O2094)
  21711. 1047: O: O2094 (predict-no)
  21712. --- END Decision Phase ---
  21713. --- Application Phase ---
  21714. --- Firing Productions (PE) For State At Depth 1 ---
  21715. --- Inner Elaboration Phase, active level 1 (S1) ---
  21716. Firing apply*operator
  21717. -->
  21718. (I3 ^predict-no N1047 + :O )
  21719. Firing apply*operator*complete
  21720. -->
  21721. (I3 ^predict-no N1046 - :O )
  21722. inner elaboration loop at bottom goal.
  21723. --- Change Working Memory (PE) ---
  21724. =>WM: (14679: I3 ^predict-no N1047)
  21725. <=WM: (14666: N1046 ^status complete)
  21726. <=WM: (14665: I3 ^predict-no N1046)
  21727. --- Firing Productions (IE) For State At Depth 1 ---
  21728. --- Inner Elaboration Phase, active level 1 (S1) ---
  21729. Firing monitor*world
  21730. -->
  21731. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21732. --- Change Working Memory (IE) ---
  21733. --- END Application Phase ---
  21734. --- Output Phase ---
  21735. ENV: Agent did: predict-no for direction U in state State-A
  21736. In State-A moving U
  21737. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21738. predict error 0
  21739. dir: dir isU
  21740. --- END Output Phase ---
  21741. /|\---- Input Phase ---
  21742. =>WM: (14683: I2 ^dir U)
  21743. =>WM: (14682: I2 ^reward 1)
  21744. =>WM: (14681: I2 ^see 0)
  21745. =>WM: (14680: N1047 ^status complete)
  21746. <=WM: (14669: I2 ^dir U)
  21747. <=WM: (14668: I2 ^reward 1)
  21748. <=WM: (14667: I2 ^see 0)
  21749. =>WM: (14684: I2 ^level-1 L0-root)
  21750. <=WM: (14670: I2 ^level-1 L0-root)
  21751. --- END Input Phase ---
  21752. --- Proposal Phase ---
  21753. --- Inner Elaboration Phase, active level 1 (S1) ---
  21754. Firing elaborate*copy-see-to-output-link
  21755. -->
  21756. (I3 ^see 0 +)
  21757. Firing elaborate*reward*based*on*reward
  21758. -->
  21759. (R1051 ^value 1 +)
  21760. (R1 ^reward R1051 +)
  21761. Firing propose*predict-yes
  21762. -->
  21763. (O2095 ^name predict-yes +)
  21764. (S1 ^operator O2095 +)
  21765. Firing propose*predict-no
  21766. -->
  21767. (O2096 ^name predict-no +)
  21768. (S1 ^operator O2096 +)
  21769. Firing rl*prefer*rvt*predict-no*H0*6
  21770. -->
  21771. (S1 ^operator O2094 = 0.9999999999999999)
  21772. Firing rl*prefer*rvt*predict-yes*H0*5
  21773. -->
  21774. (S1 ^operator O2093 = 0.)
  21775. Firing prefer*rvt*predict-yes*H0
  21776. -->
  21777. Firing prefer*rvt*predict-no*H0
  21778. -->
  21779. Firing elaborate*copy-dir-to-output-link
  21780. -->
  21781. (I3 ^dir U +)
  21782. inner elaboration loop at bottom goal.
  21783. Retracting elaborate*copy-see-to-output-link
  21784. -->
  21785. (I3 ^see 0 +)
  21786. Retracting propose*predict-no
  21787. -->
  21788. (O2094 ^name predict-no +)
  21789. (S1 ^operator O2094 +)
  21790. Retracting propose*predict-yes
  21791. -->
  21792. (O2093 ^name predict-yes +)
  21793. (S1 ^operator O2093 +)
  21794. Retracting elaborate*reward*based*on*reward
  21795. -->
  21796. (R1050 ^value 1 +)
  21797. (R1 ^reward R1050 +)
  21798. Retracting elaborate*copy-dir-to-output-link
  21799. -->
  21800. (I3 ^dir U +)
  21801. Retracting rl*prefer*rvt*predict-no*H0*6
  21802. -->
  21803. (S1 ^operator O2094 = 0.9999999999999999)
  21804. Retracting rl*prefer*rvt*predict-yes*H0*5
  21805. -->
  21806. (S1 ^operator O2093 = 0.)
  21807. =>WM: (14690: S1 ^operator O2096 +)
  21808. =>WM: (14689: S1 ^operator O2095 +)
  21809. =>WM: (14688: O2096 ^name predict-no)
  21810. =>WM: (14687: O2095 ^name predict-yes)
  21811. =>WM: (14686: R1051 ^value 1)
  21812. =>WM: (14685: R1 ^reward R1051)
  21813. <=WM: (14676: S1 ^operator O2093 +)
  21814. <=WM: (14677: S1 ^operator O2094 +)
  21815. <=WM: (14678: S1 ^operator O2094)
  21816. <=WM: (14671: R1 ^reward R1050)
  21817. <=WM: (14674: O2094 ^name predict-no)
  21818. <=WM: (14673: O2093 ^name predict-yes)
  21819. <=WM: (14672: R1050 ^value 1)
  21820. --- Inner Elaboration Phase, active level 1 (S1) ---
  21821. Firing prefer*rvt*predict-yes*H0
  21822. -->
  21823. Firing rl*prefer*rvt*predict-yes*H0*5
  21824. -->
  21825. (S1 ^operator O2095 = 0.)
  21826. Firing prefer*rvt*predict-no*H0
  21827. -->
  21828. Firing rl*prefer*rvt*predict-no*H0*6
  21829. -->
  21830. (S1 ^operator O2096 = 0.9999999999999999)
  21831. inner elaboration loop at bottom goal.
  21832. Retracting rl*prefer*rvt*predict-no*H0*6
  21833. -->
  21834. (S1 ^operator O2094 = 0.9999999999999999)
  21835. Retracting rl*prefer*rvt*predict-yes*H0*5
  21836. -->
  21837. (S1 ^operator O2093 = 0.)
  21838. --- END Proposal Phase ---
  21839. --- Decision Phase ---
  21840. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21841. =>WM: (14691: S1 ^operator O2096)
  21842. 1048: O: O2096 (predict-no)
  21843. --- END Decision Phase ---
  21844. --- Application Phase ---
  21845. --- Firing Productions (PE) For State At Depth 1 ---
  21846. --- Inner Elaboration Phase, active level 1 (S1) ---
  21847. Firing apply*operator
  21848. -->
  21849. (I3 ^predict-no N1048 + :O )
  21850. Firing apply*operator*complete
  21851. -->
  21852. (I3 ^predict-no N1047 - :O )
  21853. inner elaboration loop at bottom goal.
  21854. --- Change Working Memory (PE) ---
  21855. =>WM: (14692: I3 ^predict-no N1048)
  21856. <=WM: (14680: N1047 ^status complete)
  21857. <=WM: (14679: I3 ^predict-no N1047)
  21858. --- Firing Productions (IE) For State At Depth 1 ---
  21859. --- Inner Elaboration Phase, active level 1 (S1) ---
  21860. Firing monitor*world
  21861. -->
  21862. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21863. --- Change Working Memory (IE) ---
  21864. --- END Application Phase ---
  21865. --- Output Phase ---
  21866. ENV: Agent did: predict-no for direction U in state State-A
  21867. In State-A moving U
  21868. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21869. predict error 0
  21870. dir: dir isL
  21871. --- END Output Phase ---
  21872. /|\---- Input Phase ---
  21873. =>WM: (14696: I2 ^dir L)
  21874. =>WM: (14695: I2 ^reward 1)
  21875. =>WM: (14694: I2 ^see 0)
  21876. =>WM: (14693: N1048 ^status complete)
  21877. <=WM: (14683: I2 ^dir U)
  21878. <=WM: (14682: I2 ^reward 1)
  21879. <=WM: (14681: I2 ^see 0)
  21880. =>WM: (14697: I2 ^level-1 L0-root)
  21881. <=WM: (14684: I2 ^level-1 L0-root)
  21882. --- END Input Phase ---
  21883. --- Proposal Phase ---
  21884. --- Inner Elaboration Phase, active level 1 (S1) ---
  21885. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21886. -->
  21887. (S1 ^operator O2095 = 0.1599599085218832)
  21888. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21889. -->
  21890. (S1 ^operator O2096 = 0.6126659683627904)
  21891. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21892. -->
  21893. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21894. -->
  21895. Firing elaborate*copy-see-to-output-link
  21896. -->
  21897. (I3 ^see 0 +)
  21898. Firing elaborate*reward*based*on*reward
  21899. -->
  21900. (R1052 ^value 1 +)
  21901. (R1 ^reward R1052 +)
  21902. Firing propose*predict-yes
  21903. -->
  21904. (O2097 ^name predict-yes +)
  21905. (S1 ^operator O2097 +)
  21906. Firing propose*predict-no
  21907. -->
  21908. (O2098 ^name predict-no +)
  21909. (S1 ^operator O2098 +)
  21910. Firing rl*prefer*rvt*predict-no*H0*2
  21911. -->
  21912. (S1 ^operator O2096 = 0.3873355915338527)
  21913. Firing rl*prefer*rvt*predict-yes*H0*1
  21914. -->
  21915. (S1 ^operator O2095 = 0.3895395371935536)
  21916. Firing prefer*rvt*predict-yes*H0
  21917. -->
  21918. Firing prefer*rvt*predict-no*H0
  21919. -->
  21920. Firing elaborate*copy-dir-to-output-link
  21921. -->
  21922. (I3 ^dir L +)
  21923. inner elaboration loop at bottom goal.
  21924. Retracting elaborate*copy-see-to-output-link
  21925. -->
  21926. (I3 ^see 0 +)
  21927. Retracting propose*predict-no
  21928. -->
  21929. (O2096 ^name predict-no +)
  21930. (S1 ^operator O2096 +)
  21931. Retracting propose*predict-yes
  21932. -->
  21933. (O2095 ^name predict-yes +)
  21934. (S1 ^operator O2095 +)
  21935. Retracting elaborate*reward*based*on*reward
  21936. -->
  21937. (R1051 ^value 1 +)
  21938. (R1 ^reward R1051 +)
  21939. Retracting elaborate*copy-dir-to-output-link
  21940. -->
  21941. (I3 ^dir U +)
  21942. Retracting rl*prefer*rvt*predict-no*H0*6
  21943. -->
  21944. (S1 ^operator O2096 = 0.9999999999999999)
  21945. Retracting rl*prefer*rvt*predict-yes*H0*5
  21946. -->
  21947. (S1 ^operator O2095 = 0.)
  21948. =>WM: (14704: S1 ^operator O2098 +)
  21949. =>WM: (14703: S1 ^operator O2097 +)
  21950. =>WM: (14702: I3 ^dir L)
  21951. =>WM: (14701: O2098 ^name predict-no)
  21952. =>WM: (14700: O2097 ^name predict-yes)
  21953. =>WM: (14699: R1052 ^value 1)
  21954. =>WM: (14698: R1 ^reward R1052)
  21955. <=WM: (14689: S1 ^operator O2095 +)
  21956. <=WM: (14690: S1 ^operator O2096 +)
  21957. <=WM: (14691: S1 ^operator O2096)
  21958. <=WM: (14675: I3 ^dir U)
  21959. <=WM: (14685: R1 ^reward R1051)
  21960. <=WM: (14688: O2096 ^name predict-no)
  21961. <=WM: (14687: O2095 ^name predict-yes)
  21962. <=WM: (14686: R1051 ^value 1)
  21963. --- Inner Elaboration Phase, active level 1 (S1) ---
  21964. Firing prefer*rvt*predict-yes*H0
  21965. -->
  21966. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21967. -->
  21968. (S1 ^operator O2097 = 0.1599599085218832)
  21969. Firing rl*prefer*rvt*predict-yes*H0*1
  21970. -->
  21971. (S1 ^operator O2097 = 0.3895395371935536)
  21972. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  21973. -->
  21974. Firing prefer*rvt*predict-no*H0
  21975. -->
  21976. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21977. -->
  21978. (S1 ^operator O2098 = 0.6126659683627904)
  21979. Firing rl*prefer*rvt*predict-no*H0*2
  21980. -->
  21981. (S1 ^operator O2098 = 0.3873355915338527)
  21982. Firing prefer*rvt*predict-no*H0*2*v1*H1
  21983. -->
  21984. inner elaboration loop at bottom goal.
  21985. Retracting rl*prefer*rvt*predict-no*H0*2
  21986. -->
  21987. (S1 ^operator O2096 = 0.3873355915338527)
  21988. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  21989. -->
  21990. (S1 ^operator O2096 = 0.6126659683627904)
  21991. Retracting rl*prefer*rvt*predict-yes*H0*1
  21992. -->
  21993. (S1 ^operator O2095 = 0.3895395371935536)
  21994. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  21995. -->
  21996. (S1 ^operator O2095 = 0.1599599085218832)
  21997. --- END Proposal Phase ---
  21998. --- Decision Phase ---
  21999. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22000. =>WM: (14705: S1 ^operator O2098)
  22001. 1049: O: O2098 (predict-no)
  22002. --- END Decision Phase ---
  22003. --- Application Phase ---
  22004. --- Firing Productions (PE) For State At Depth 1 ---
  22005. --- Inner Elaboration Phase, active level 1 (S1) ---
  22006. Firing apply*operator
  22007. -->
  22008. (I3 ^predict-no N1049 + :O )
  22009. Firing apply*operator*complete
  22010. -->
  22011. (I3 ^predict-no N1048 - :O )
  22012. inner elaboration loop at bottom goal.
  22013. --- Change Working Memory (PE) ---
  22014. =>WM: (14706: I3 ^predict-no N1049)
  22015. <=WM: (14693: N1048 ^status complete)
  22016. <=WM: (14692: I3 ^predict-no N1048)
  22017. --- Firing Productions (IE) For State At Depth 1 ---
  22018. --- Inner Elaboration Phase, active level 1 (S1) ---
  22019. Firing monitor*world
  22020. -->
  22021. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22022. --- Change Working Memory (IE) ---
  22023. --- END Application Phase ---
  22024. --- Output Phase ---
  22025. ENV: Agent did: predict-no for direction L in state State-A
  22026. In State-A moving L
  22027. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22028. predict error 0
  22029. dir: dir isR
  22030. --- END Output Phase ---
  22031. /|\--- Input Phase ---
  22032. =>WM: (14710: I2 ^dir R)
  22033. =>WM: (14709: I2 ^reward 1)
  22034. =>WM: (14708: I2 ^see 0)
  22035. =>WM: (14707: N1049 ^status complete)
  22036. <=WM: (14696: I2 ^dir L)
  22037. <=WM: (14695: I2 ^reward 1)
  22038. <=WM: (14694: I2 ^see 0)
  22039. =>WM: (14711: I2 ^level-1 L0-root)
  22040. <=WM: (14697: I2 ^level-1 L0-root)
  22041. --- END Input Phase ---
  22042. --- Proposal Phase ---
  22043. --- Inner Elaboration Phase, active level 1 (S1) ---
  22044. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22045. -->
  22046. (S1 ^operator O2097 = 0.8155909586299994)
  22047. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22048. -->
  22049. (S1 ^operator O2098 = -0.00558448899823713)
  22050. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22051. -->
  22052. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22053. -->
  22054. Firing elaborate*copy-see-to-output-link
  22055. -->
  22056. (I3 ^see 0 +)
  22057. Firing elaborate*reward*based*on*reward
  22058. -->
  22059. (R1053 ^value 1 +)
  22060. (R1 ^reward R1053 +)
  22061. Firing propose*predict-yes
  22062. -->
  22063. (O2099 ^name predict-yes +)
  22064. (S1 ^operator O2099 +)
  22065. Firing propose*predict-no
  22066. -->
  22067. (O2100 ^name predict-no +)
  22068. (S1 ^operator O2100 +)
  22069. Firing rl*prefer*rvt*predict-no*H0*4
  22070. -->
  22071. (S1 ^operator O2098 = 0.4476194484475643)
  22072. Firing rl*prefer*rvt*predict-yes*H0*3
  22073. -->
  22074. (S1 ^operator O2097 = 0.1844112102474305)
  22075. Firing prefer*rvt*predict-yes*H0
  22076. -->
  22077. Firing prefer*rvt*predict-no*H0
  22078. -->
  22079. Firing elaborate*copy-dir-to-output-link
  22080. -->
  22081. (I3 ^dir R +)
  22082. inner elaboration loop at bottom goal.
  22083. Retracting elaborate*copy-see-to-output-link
  22084. -->
  22085. (I3 ^see 0 +)
  22086. Retracting propose*predict-no
  22087. -->
  22088. (O2098 ^name predict-no +)
  22089. (S1 ^operator O2098 +)
  22090. Retracting propose*predict-yes
  22091. -->
  22092. (O2097 ^name predict-yes +)
  22093. (S1 ^operator O2097 +)
  22094. Retracting elaborate*reward*based*on*reward
  22095. -->
  22096. (R1052 ^value 1 +)
  22097. (R1 ^reward R1052 +)
  22098. Retracting elaborate*copy-dir-to-output-link
  22099. -->
  22100. (I3 ^dir L +)
  22101. Retracting rl*prefer*rvt*predict-no*H0*2
  22102. -->
  22103. (S1 ^operator O2098 = 0.3873355915338527)
  22104. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  22105. -->
  22106. (S1 ^operator O2098 = 0.6126659683627904)
  22107. Retracting rl*prefer*rvt*predict-yes*H0*1
  22108. -->
  22109. (S1 ^operator O2097 = 0.3895395371935536)
  22110. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  22111. -->
  22112. (S1 ^operator O2097 = 0.1599599085218832)
  22113. =>WM: (14718: S1 ^operator O2100 +)
  22114. =>WM: (14717: S1 ^operator O2099 +)
  22115. =>WM: (14716: I3 ^dir R)
  22116. =>WM: (14715: O2100 ^name predict-no)
  22117. =>WM: (14714: O2099 ^name predict-yes)
  22118. =>WM: (14713: R1053 ^value 1)
  22119. =>WM: (14712: R1 ^reward R1053)
  22120. <=WM: (14703: S1 ^operator O2097 +)
  22121. <=WM: (14704: S1 ^operator O2098 +)
  22122. <=WM: (14705: S1 ^operator O2098)
  22123. <=WM: (14702: I3 ^dir L)
  22124. <=WM: (14698: R1 ^reward R1052)
  22125. <=WM: (14701: O2098 ^name predict-no)
  22126. <=WM: (14700: O2097 ^name predict-yes)
  22127. <=WM: (14699: R1052 ^value 1)
  22128. --- Inner Elaboration Phase, active level 1 (S1) ---
  22129. Firing prefer*rvt*predict-yes*H0
  22130. -->
  22131. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22132. -->
  22133. (S1 ^operator O2099 = 0.8155909586299994)
  22134. Firing rl*prefer*rvt*predict-yes*H0*3
  22135. -->
  22136. (S1 ^operator O2099 = 0.1844112102474305)
  22137. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22138. -->
  22139. Firing prefer*rvt*predict-no*H0
  22140. -->
  22141. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22142. -->
  22143. (S1 ^operator O2100 = -0.00558448899823713)
  22144. Firing rl*prefer*rvt*predict-no*H0*4
  22145. -->
  22146. (S1 ^operator O2100 = 0.4476194484475643)
  22147. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22148. -->
  22149. inner elaboration loop at bottom goal.
  22150. Retracting rl*prefer*rvt*predict-no*H0*4
  22151. -->
  22152. (S1 ^operator O2098 = 0.4476194484475643)
  22153. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22154. -->
  22155. (S1 ^operator O2098 = -0.00558448899823713)
  22156. Retracting rl*prefer*rvt*predict-yes*H0*3
  22157. -->
  22158. (S1 ^operator O2097 = 0.1844112102474305)
  22159. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22160. -->
  22161. (S1 ^operator O2097 = 0.8155909586299994)
  22162. --- END Proposal Phase ---
  22163. --- Decision Phase ---
  22164. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331743 0.387335(R,m,v=1,0.935484,0.06068)
  22165. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280923 0.331743 0.612666 -> 0.280923 0.331743 0.612666(R,m,v=1,1,0)
  22166. =>WM: (14719: S1 ^operator O2099)
  22167. 1050: O: O2099 (predict-yes)
  22168. --- END Decision Phase ---
  22169. --- Application Phase ---
  22170. --- Firing Productions (PE) For State At Depth 1 ---
  22171. --- Inner Elaboration Phase, active level 1 (S1) ---
  22172. Firing apply*operator
  22173. -->
  22174. (I3 ^predict-yes N1050 + :O )
  22175. Firing apply*operator*complete
  22176. -->
  22177. (I3 ^predict-no N1049 - :O )
  22178. inner elaboration loop at bottom goal.
  22179. --- Change Working Memory (PE) ---
  22180. =>WM: (14720: I3 ^predict-yes N1050)
  22181. <=WM: (14707: N1049 ^status complete)
  22182. <=WM: (14706: I3 ^predict-no N1049)
  22183. --- Firing Productions (IE) For State At Depth 1 ---
  22184. --- Inner Elaboration Phase, active level 1 (S1) ---
  22185. Firing monitor*world
  22186. -->
  22187. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22188. --- Change Working Memory (IE) ---
  22189. --- END Application Phase ---
  22190. --- Output Phase ---
  22191. ENV: Agent did: predict-yes for direction R in state State-A
  22192. In State-A moving R
  22193. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22194. predict error 0
  22195. dir: dir isR
  22196. --- END Output Phase ---
  22197. -/--- Input Phase ---
  22198. =>WM: (14724: I2 ^dir R)
  22199. =>WM: (14723: I2 ^reward 1)
  22200. =>WM: (14722: I2 ^see 1)
  22201. =>WM: (14721: N1050 ^status complete)
  22202. <=WM: (14710: I2 ^dir R)
  22203. <=WM: (14709: I2 ^reward 1)
  22204. <=WM: (14708: I2 ^see 0)
  22205. =>WM: (14725: I2 ^level-1 R1-root)
  22206. <=WM: (14711: I2 ^level-1 L0-root)
  22207. --- END Input Phase ---
  22208. --- Proposal Phase ---
  22209. --- Inner Elaboration Phase, active level 1 (S1) ---
  22210. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22211. -->
  22212. (S1 ^operator O2099 = 0.1398795999120246)
  22213. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22214. -->
  22215. (S1 ^operator O2100 = 0.5523812848757654)
  22216. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22217. -->
  22218. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22219. -->
  22220. Firing elaborate*copy-see-to-output-link
  22221. -->
  22222. (I3 ^see 1 +)
  22223. Firing elaborate*reward*based*on*reward
  22224. -->
  22225. (R1054 ^value 1 +)
  22226. (R1 ^reward R1054 +)
  22227. Firing propose*predict-yes
  22228. -->
  22229. (O2101 ^name predict-yes +)
  22230. (S1 ^operator O2101 +)
  22231. Firing propose*predict-no
  22232. -->
  22233. (O2102 ^name predict-no +)
  22234. (S1 ^operator O2102 +)
  22235. Firing rl*prefer*rvt*predict-no*H0*4
  22236. -->
  22237. (S1 ^operator O2100 = 0.4476194484475643)
  22238. Firing rl*prefer*rvt*predict-yes*H0*3
  22239. -->
  22240. (S1 ^operator O2099 = 0.1844112102474305)
  22241. Firing prefer*rvt*predict-yes*H0
  22242. -->
  22243. Firing prefer*rvt*predict-no*H0
  22244. -->
  22245. Firing elaborate*copy-dir-to-output-link
  22246. -->
  22247. (I3 ^dir R +)
  22248. inner elaboration loop at bottom goal.
  22249. Retracting elaborate*copy-see-to-output-link
  22250. -->
  22251. (I3 ^see 0 +)
  22252. Retracting propose*predict-no
  22253. -->
  22254. (O2100 ^name predict-no +)
  22255. (S1 ^operator O2100 +)
  22256. Retracting propose*predict-yes
  22257. -->
  22258. (O2099 ^name predict-yes +)
  22259. (S1 ^operator O2099 +)
  22260. Retracting elaborate*reward*based*on*reward
  22261. -->
  22262. (R1053 ^value 1 +)
  22263. (R1 ^reward R1053 +)
  22264. Retracting elaborate*copy-dir-to-output-link
  22265. -->
  22266. (I3 ^dir R +)
  22267. Retracting rl*prefer*rvt*predict-no*H0*4
  22268. -->
  22269. (S1 ^operator O2100 = 0.4476194484475643)
  22270. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22271. -->
  22272. (S1 ^operator O2100 = -0.00558448899823713)
  22273. Retracting rl*prefer*rvt*predict-yes*H0*3
  22274. -->
  22275. (S1 ^operator O2099 = 0.1844112102474305)
  22276. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22277. -->
  22278. (S1 ^operator O2099 = 0.8155909586299994)
  22279. =>WM: (14732: S1 ^operator O2102 +)
  22280. =>WM: (14731: S1 ^operator O2101 +)
  22281. =>WM: (14730: O2102 ^name predict-no)
  22282. =>WM: (14729: O2101 ^name predict-yes)
  22283. =>WM: (14728: R1054 ^value 1)
  22284. =>WM: (14727: R1 ^reward R1054)
  22285. =>WM: (14726: I3 ^see 1)
  22286. <=WM: (14717: S1 ^operator O2099 +)
  22287. <=WM: (14719: S1 ^operator O2099)
  22288. <=WM: (14718: S1 ^operator O2100 +)
  22289. <=WM: (14712: R1 ^reward R1053)
  22290. <=WM: (14615: I3 ^see 0)
  22291. <=WM: (14715: O2100 ^name predict-no)
  22292. <=WM: (14714: O2099 ^name predict-yes)
  22293. <=WM: (14713: R1053 ^value 1)
  22294. --- Inner Elaboration Phase, active level 1 (S1) ---
  22295. Firing prefer*rvt*predict-yes*H0
  22296. -->
  22297. Firing rl*prefer*rvt*predict-yes*H0*3
  22298. -->
  22299. (S1 ^operator O2101 = 0.1844112102474305)
  22300. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22301. -->
  22302. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22303. -->
  22304. (S1 ^operator O2101 = 0.1398795999120246)
  22305. Firing prefer*rvt*predict-no*H0
  22306. -->
  22307. Firing rl*prefer*rvt*predict-no*H0*4
  22308. -->
  22309. (S1 ^operator O2102 = 0.4476194484475643)
  22310. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22311. -->
  22312. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22313. -->
  22314. (S1 ^operator O2102 = 0.5523812848757654)
  22315. inner elaboration loop at bottom goal.
  22316. Retracting rl*prefer*rvt*predict-no*H0*4
  22317. -->
  22318. (S1 ^operator O2100 = 0.4476194484475643)
  22319. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22320. -->
  22321. (S1 ^operator O2100 = 0.5523812848757654)
  22322. Retracting rl*prefer*rvt*predict-yes*H0*3
  22323. -->
  22324. (S1 ^operator O2099 = 0.1844112102474305)
  22325. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22326. -->
  22327. (S1 ^operator O2099 = 0.1398795999120246)
  22328. --- END Proposal Phase ---
  22329. --- Decision Phase ---
  22330. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184411 -> 0.675413 -0.491003 0.184411(R,m,v=1,0.903955,0.0873138)
  22331. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324589 0.491002 0.815591 -> 0.324589 0.491002 0.815591(R,m,v=1,1,0)
  22332. =>WM: (14733: S1 ^operator O2102)
  22333. 1051: O: O2102 (predict-no)
  22334. --- END Decision Phase ---
  22335. --- Application Phase ---
  22336. --- Firing Productions (PE) For State At Depth 1 ---
  22337. --- Inner Elaboration Phase, active level 1 (S1) ---
  22338. Firing apply*operator
  22339. -->
  22340. (I3 ^predict-no N1051 + :O )
  22341. Firing apply*operator*complete
  22342. -->
  22343. (I3 ^predict-yes N1050 - :O )
  22344. inner elaboration loop at bottom goal.
  22345. --- Change Working Memory (PE) ---
  22346. =>WM: (14734: I3 ^predict-no N1051)
  22347. <=WM: (14721: N1050 ^status complete)
  22348. <=WM: (14720: I3 ^predict-yes N1050)
  22349. --- Firing Productions (IE) For State At Depth 1 ---
  22350. --- Inner Elaboration Phase, active level 1 (S1) ---
  22351. Firing monitor*world
  22352. -->
  22353. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22354. --- Change Working Memory (IE) ---
  22355. --- END Application Phase ---
  22356. --- Output Phase ---
  22357. ENV: Agent did: predict-no for direction R in state State-B
  22358. In State-B moving R
  22359. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22360. predict error 0
  22361. dir: dir isU
  22362. --- END Output Phase ---
  22363. |--- Input Phase ---
  22364. =>WM: (14738: I2 ^dir U)
  22365. =>WM: (14737: I2 ^reward 1)
  22366. =>WM: (14736: I2 ^see 0)
  22367. =>WM: (14735: N1051 ^status complete)
  22368. <=WM: (14724: I2 ^dir R)
  22369. <=WM: (14723: I2 ^reward 1)
  22370. <=WM: (14722: I2 ^see 1)
  22371. =>WM: (14739: I2 ^level-1 R0-root)
  22372. <=WM: (14725: I2 ^level-1 R1-root)
  22373. --- END Input Phase ---
  22374. --- Proposal Phase ---
  22375. --- Inner Elaboration Phase, active level 1 (S1) ---
  22376. Firing elaborate*copy-see-to-output-link
  22377. -->
  22378. (I3 ^see 0 +)
  22379. Firing elaborate*reward*based*on*reward
  22380. -->
  22381. (R1055 ^value 1 +)
  22382. (R1 ^reward R1055 +)
  22383. Firing propose*predict-yes
  22384. -->
  22385. (O2103 ^name predict-yes +)
  22386. (S1 ^operator O2103 +)
  22387. Firing propose*predict-no
  22388. -->
  22389. (O2104 ^name predict-no +)
  22390. (S1 ^operator O2104 +)
  22391. Firing rl*prefer*rvt*predict-no*H0*6
  22392. -->
  22393. (S1 ^operator O2102 = 0.9999999999999999)
  22394. Firing rl*prefer*rvt*predict-yes*H0*5
  22395. -->
  22396. (S1 ^operator O2101 = 0.)
  22397. Firing prefer*rvt*predict-yes*H0
  22398. -->
  22399. Firing prefer*rvt*predict-no*H0
  22400. -->
  22401. Firing elaborate*copy-dir-to-output-link
  22402. -->
  22403. (I3 ^dir U +)
  22404. inner elaboration loop at bottom goal.
  22405. Retracting elaborate*copy-see-to-output-link
  22406. -->
  22407. (I3 ^see 1 +)
  22408. Retracting propose*predict-no
  22409. -->
  22410. (O2102 ^name predict-no +)
  22411. (S1 ^operator O2102 +)
  22412. Retracting propose*predict-yes
  22413. -->
  22414. (O2101 ^name predict-yes +)
  22415. (S1 ^operator O2101 +)
  22416. Retracting elaborate*reward*based*on*reward
  22417. -->
  22418. (R1054 ^value 1 +)
  22419. (R1 ^reward R1054 +)
  22420. Retracting elaborate*copy-dir-to-output-link
  22421. -->
  22422. (I3 ^dir R +)
  22423. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22424. -->
  22425. (S1 ^operator O2102 = 0.5523812848757654)
  22426. Retracting rl*prefer*rvt*predict-no*H0*4
  22427. -->
  22428. (S1 ^operator O2102 = 0.4476194484475643)
  22429. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22430. -->
  22431. (S1 ^operator O2101 = 0.1398795999120246)
  22432. Retracting rl*prefer*rvt*predict-yes*H0*3
  22433. -->
  22434. (S1 ^operator O2101 = 0.1844108849158159)
  22435. =>WM: (14747: S1 ^operator O2104 +)
  22436. =>WM: (14746: S1 ^operator O2103 +)
  22437. =>WM: (14745: I3 ^dir U)
  22438. =>WM: (14744: O2104 ^name predict-no)
  22439. =>WM: (14743: O2103 ^name predict-yes)
  22440. =>WM: (14742: R1055 ^value 1)
  22441. =>WM: (14741: R1 ^reward R1055)
  22442. =>WM: (14740: I3 ^see 0)
  22443. <=WM: (14731: S1 ^operator O2101 +)
  22444. <=WM: (14732: S1 ^operator O2102 +)
  22445. <=WM: (14733: S1 ^operator O2102)
  22446. <=WM: (14716: I3 ^dir R)
  22447. <=WM: (14727: R1 ^reward R1054)
  22448. <=WM: (14726: I3 ^see 1)
  22449. <=WM: (14730: O2102 ^name predict-no)
  22450. <=WM: (14729: O2101 ^name predict-yes)
  22451. <=WM: (14728: R1054 ^value 1)
  22452. --- Inner Elaboration Phase, active level 1 (S1) ---
  22453. Firing prefer*rvt*predict-yes*H0
  22454. -->
  22455. Firing rl*prefer*rvt*predict-yes*H0*5
  22456. -->
  22457. (S1 ^operator O2103 = 0.)
  22458. Firing prefer*rvt*predict-no*H0
  22459. -->
  22460. Firing rl*prefer*rvt*predict-no*H0*6
  22461. -->
  22462. (S1 ^operator O2104 = 0.9999999999999999)
  22463. inner elaboration loop at bottom goal.
  22464. Retracting rl*prefer*rvt*predict-no*H0*6
  22465. -->
  22466. (S1 ^operator O2102 = 0.9999999999999999)
  22467. Retracting rl*prefer*rvt*predict-yes*H0*5
  22468. -->
  22469. (S1 ^operator O2101 = 0.)
  22470. --- END Proposal Phase ---
  22471. --- Decision Phase ---
  22472. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.934783,0.0614091)
  22473. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552381 -> 0.377468 0.174914 0.552381(R,m,v=1,1,0)
  22474. =>WM: (14748: S1 ^operator O2104)
  22475. 1052: O: O2104 (predict-no)
  22476. --- END Decision Phase ---
  22477. --- Application Phase ---
  22478. --- Firing Productions (PE) For State At Depth 1 ---
  22479. --- Inner Elaboration Phase, active level 1 (S1) ---
  22480. Firing apply*operator
  22481. -->
  22482. (I3 ^predict-no N1052 + :O )
  22483. Firing apply*operator*complete
  22484. -->
  22485. (I3 ^predict-no N1051 - :O )
  22486. inner elaboration loop at bottom goal.
  22487. --- Change Working Memory (PE) ---
  22488. =>WM: (14749: I3 ^predict-no N1052)
  22489. <=WM: (14735: N1051 ^status complete)
  22490. <=WM: (14734: I3 ^predict-no N1051)
  22491. --- Firing Productions (IE) For State At Depth 1 ---
  22492. --- Inner Elaboration Phase, active level 1 (S1) ---
  22493. Firing monitor*world
  22494. -->
  22495. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22496. --- Change Working Memory (IE) ---
  22497. --- END Application Phase ---
  22498. --- Output Phase ---
  22499. ENV: Agent did: predict-no for direction U in state State-B
  22500. In State-B moving U
  22501. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22502. predict error 0
  22503. dir: dir isU
  22504. --- END Output Phase ---
  22505. \-/--- Input Phase ---
  22506. =>WM: (14753: I2 ^dir U)
  22507. =>WM: (14752: I2 ^reward 1)
  22508. =>WM: (14751: I2 ^see 0)
  22509. =>WM: (14750: N1052 ^status complete)
  22510. <=WM: (14738: I2 ^dir U)
  22511. <=WM: (14737: I2 ^reward 1)
  22512. <=WM: (14736: I2 ^see 0)
  22513. =>WM: (14754: I2 ^level-1 R0-root)
  22514. <=WM: (14739: I2 ^level-1 R0-root)
  22515. --- END Input Phase ---
  22516. --- Proposal Phase ---
  22517. --- Inner Elaboration Phase, active level 1 (S1) ---
  22518. Firing elaborate*copy-see-to-output-link
  22519. -->
  22520. (I3 ^see 0 +)
  22521. Firing elaborate*reward*based*on*reward
  22522. -->
  22523. (R1056 ^value 1 +)
  22524. (R1 ^reward R1056 +)
  22525. Firing propose*predict-yes
  22526. -->
  22527. (O2105 ^name predict-yes +)
  22528. (S1 ^operator O2105 +)
  22529. Firing propose*predict-no
  22530. -->
  22531. (O2106 ^name predict-no +)
  22532. (S1 ^operator O2106 +)
  22533. Firing rl*prefer*rvt*predict-no*H0*6
  22534. -->
  22535. (S1 ^operator O2104 = 0.9999999999999999)
  22536. Firing rl*prefer*rvt*predict-yes*H0*5
  22537. -->
  22538. (S1 ^operator O2103 = 0.)
  22539. Firing prefer*rvt*predict-yes*H0
  22540. -->
  22541. Firing prefer*rvt*predict-no*H0
  22542. -->
  22543. Firing elaborate*copy-dir-to-output-link
  22544. -->
  22545. (I3 ^dir U +)
  22546. inner elaboration loop at bottom goal.
  22547. Retracting elaborate*copy-see-to-output-link
  22548. -->
  22549. (I3 ^see 0 +)
  22550. Retracting propose*predict-no
  22551. -->
  22552. (O2104 ^name predict-no +)
  22553. (S1 ^operator O2104 +)
  22554. Retracting propose*predict-yes
  22555. -->
  22556. (O2103 ^name predict-yes +)
  22557. (S1 ^operator O2103 +)
  22558. Retracting elaborate*reward*based*on*reward
  22559. -->
  22560. (R1055 ^value 1 +)
  22561. (R1 ^reward R1055 +)
  22562. Retracting elaborate*copy-dir-to-output-link
  22563. -->
  22564. (I3 ^dir U +)
  22565. Retracting rl*prefer*rvt*predict-no*H0*6
  22566. -->
  22567. (S1 ^operator O2104 = 0.9999999999999999)
  22568. Retracting rl*prefer*rvt*predict-yes*H0*5
  22569. -->
  22570. (S1 ^operator O2103 = 0.)
  22571. =>WM: (14760: S1 ^operator O2106 +)
  22572. =>WM: (14759: S1 ^operator O2105 +)
  22573. =>WM: (14758: O2106 ^name predict-no)
  22574. =>WM: (14757: O2105 ^name predict-yes)
  22575. =>WM: (14756: R1056 ^value 1)
  22576. =>WM: (14755: R1 ^reward R1056)
  22577. <=WM: (14746: S1 ^operator O2103 +)
  22578. <=WM: (14747: S1 ^operator O2104 +)
  22579. <=WM: (14748: S1 ^operator O2104)
  22580. <=WM: (14741: R1 ^reward R1055)
  22581. <=WM: (14744: O2104 ^name predict-no)
  22582. <=WM: (14743: O2103 ^name predict-yes)
  22583. <=WM: (14742: R1055 ^value 1)
  22584. --- Inner Elaboration Phase, active level 1 (S1) ---
  22585. Firing prefer*rvt*predict-yes*H0
  22586. -->
  22587. Firing rl*prefer*rvt*predict-yes*H0*5
  22588. -->
  22589. (S1 ^operator O2105 = 0.)
  22590. Firing prefer*rvt*predict-no*H0
  22591. -->
  22592. Firing rl*prefer*rvt*predict-no*H0*6
  22593. -->
  22594. (S1 ^operator O2106 = 0.9999999999999999)
  22595. inner elaboration loop at bottom goal.
  22596. Retracting rl*prefer*rvt*predict-no*H0*6
  22597. -->
  22598. (S1 ^operator O2104 = 0.9999999999999999)
  22599. Retracting rl*prefer*rvt*predict-yes*H0*5
  22600. -->
  22601. (S1 ^operator O2103 = 0.)
  22602. --- END Proposal Phase ---
  22603. --- Decision Phase ---
  22604. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22605. =>WM: (14761: S1 ^operator O2106)
  22606. 1053: O: O2106 (predict-no)
  22607. --- END Decision Phase ---
  22608. --- Application Phase ---
  22609. --- Firing Productions (PE) For State At Depth 1 ---
  22610. --- Inner Elaboration Phase, active level 1 (S1) ---
  22611. Firing apply*operator
  22612. -->
  22613. (I3 ^predict-no N1053 + :O )
  22614. Firing apply*operator*complete
  22615. -->
  22616. (I3 ^predict-no N1052 - :O )
  22617. inner elaboration loop at bottom goal.
  22618. --- Change Working Memory (PE) ---
  22619. =>WM: (14762: I3 ^predict-no N1053)
  22620. <=WM: (14750: N1052 ^status complete)
  22621. <=WM: (14749: I3 ^predict-no N1052)
  22622. --- Firing Productions (IE) For State At Depth 1 ---
  22623. --- Inner Elaboration Phase, active level 1 (S1) ---
  22624. Firing monitor*world
  22625. -->
  22626. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22627. --- Change Working Memory (IE) ---
  22628. --- END Application Phase ---
  22629. --- Output Phase ---
  22630. ENV: Agent did: predict-no for direction U in state State-B
  22631. In State-B moving U
  22632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22633. predict error 0
  22634. dir: dir isL
  22635. --- END Output Phase ---
  22636. |\---- Input Phase ---
  22637. =>WM: (14766: I2 ^dir L)
  22638. =>WM: (14765: I2 ^reward 1)
  22639. =>WM: (14764: I2 ^see 0)
  22640. =>WM: (14763: N1053 ^status complete)
  22641. <=WM: (14753: I2 ^dir U)
  22642. <=WM: (14752: I2 ^reward 1)
  22643. <=WM: (14751: I2 ^see 0)
  22644. =>WM: (14767: I2 ^level-1 R0-root)
  22645. <=WM: (14754: I2 ^level-1 R0-root)
  22646. --- END Input Phase ---
  22647. --- Proposal Phase ---
  22648. --- Inner Elaboration Phase, active level 1 (S1) ---
  22649. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  22650. -->
  22651. (S1 ^operator O2105 = 0.6104607684602532)
  22652. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  22653. -->
  22654. (S1 ^operator O2106 = 0.1063475139796038)
  22655. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22656. -->
  22657. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22658. -->
  22659. Firing elaborate*copy-see-to-output-link
  22660. -->
  22661. (I3 ^see 0 +)
  22662. Firing elaborate*reward*based*on*reward
  22663. -->
  22664. (R1057 ^value 1 +)
  22665. (R1 ^reward R1057 +)
  22666. Firing propose*predict-yes
  22667. -->
  22668. (O2107 ^name predict-yes +)
  22669. (S1 ^operator O2107 +)
  22670. Firing propose*predict-no
  22671. -->
  22672. (O2108 ^name predict-no +)
  22673. (S1 ^operator O2108 +)
  22674. Firing rl*prefer*rvt*predict-no*H0*2
  22675. -->
  22676. (S1 ^operator O2106 = 0.3873353575493562)
  22677. Firing rl*prefer*rvt*predict-yes*H0*1
  22678. -->
  22679. (S1 ^operator O2105 = 0.3895395371935536)
  22680. Firing prefer*rvt*predict-yes*H0
  22681. -->
  22682. Firing prefer*rvt*predict-no*H0
  22683. -->
  22684. Firing elaborate*copy-dir-to-output-link
  22685. -->
  22686. (I3 ^dir L +)
  22687. inner elaboration loop at bottom goal.
  22688. Retracting elaborate*copy-see-to-output-link
  22689. -->
  22690. (I3 ^see 0 +)
  22691. Retracting propose*predict-no
  22692. -->
  22693. (O2106 ^name predict-no +)
  22694. (S1 ^operator O2106 +)
  22695. Retracting propose*predict-yes
  22696. -->
  22697. (O2105 ^name predict-yes +)
  22698. (S1 ^operator O2105 +)
  22699. Retracting elaborate*reward*based*on*reward
  22700. -->
  22701. (R1056 ^value 1 +)
  22702. (R1 ^reward R1056 +)
  22703. Retracting elaborate*copy-dir-to-output-link
  22704. -->
  22705. (I3 ^dir U +)
  22706. Retracting rl*prefer*rvt*predict-no*H0*6
  22707. -->
  22708. (S1 ^operator O2106 = 0.9999999999999999)
  22709. Retracting rl*prefer*rvt*predict-yes*H0*5
  22710. -->
  22711. (S1 ^operator O2105 = 0.)
  22712. =>WM: (14774: S1 ^operator O2108 +)
  22713. =>WM: (14773: S1 ^operator O2107 +)
  22714. =>WM: (14772: I3 ^dir L)
  22715. =>WM: (14771: O2108 ^name predict-no)
  22716. =>WM: (14770: O2107 ^name predict-yes)
  22717. =>WM: (14769: R1057 ^value 1)
  22718. =>WM: (14768: R1 ^reward R1057)
  22719. <=WM: (14759: S1 ^operator O2105 +)
  22720. <=WM: (14760: S1 ^operator O2106 +)
  22721. <=WM: (14761: S1 ^operator O2106)
  22722. <=WM: (14745: I3 ^dir U)
  22723. <=WM: (14755: R1 ^reward R1056)
  22724. <=WM: (14758: O2106 ^name predict-no)
  22725. <=WM: (14757: O2105 ^name predict-yes)
  22726. <=WM: (14756: R1056 ^value 1)
  22727. --- Inner Elaboration Phase, active level 1 (S1) ---
  22728. Firing prefer*rvt*predict-yes*H0
  22729. -->
  22730. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  22731. -->
  22732. (S1 ^operator O2107 = 0.6104607684602532)
  22733. Firing rl*prefer*rvt*predict-yes*H0*1
  22734. -->
  22735. (S1 ^operator O2107 = 0.3895395371935536)
  22736. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22737. -->
  22738. Firing prefer*rvt*predict-no*H0
  22739. -->
  22740. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  22741. -->
  22742. (S1 ^operator O2108 = 0.1063475139796038)
  22743. Firing rl*prefer*rvt*predict-no*H0*2
  22744. -->
  22745. (S1 ^operator O2108 = 0.3873353575493562)
  22746. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22747. -->
  22748. inner elaboration loop at bottom goal.
  22749. Retracting rl*prefer*rvt*predict-no*H0*2
  22750. -->
  22751. (S1 ^operator O2106 = 0.3873353575493562)
  22752. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  22753. -->
  22754. (S1 ^operator O2106 = 0.1063475139796038)
  22755. Retracting rl*prefer*rvt*predict-yes*H0*1
  22756. -->
  22757. (S1 ^operator O2105 = 0.3895395371935536)
  22758. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  22759. -->
  22760. (S1 ^operator O2105 = 0.6104607684602532)
  22761. --- END Proposal Phase ---
  22762. --- Decision Phase ---
  22763. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22764. =>WM: (14775: S1 ^operator O2107)
  22765. 1054: O: O2107 (predict-yes)
  22766. --- END Decision Phase ---
  22767. --- Application Phase ---
  22768. --- Firing Productions (PE) For State At Depth 1 ---
  22769. --- Inner Elaboration Phase, active level 1 (S1) ---
  22770. Firing apply*operator
  22771. -->
  22772. (I3 ^predict-yes N1054 + :O )
  22773. Firing apply*operator*complete
  22774. -->
  22775. (I3 ^predict-no N1053 - :O )
  22776. inner elaboration loop at bottom goal.
  22777. --- Change Working Memory (PE) ---
  22778. =>WM: (14776: I3 ^predict-yes N1054)
  22779. <=WM: (14763: N1053 ^status complete)
  22780. <=WM: (14762: I3 ^predict-no N1053)
  22781. --- Firing Productions (IE) For State At Depth 1 ---
  22782. --- Inner Elaboration Phase, active level 1 (S1) ---
  22783. Firing monitor*world
  22784. -->
  22785. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22786. --- Change Working Memory (IE) ---
  22787. --- END Application Phase ---
  22788. --- Output Phase ---
  22789. ENV: Agent did: predict-yes for direction L in state State-B
  22790. In State-B moving L
  22791. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  22792. predict error 0
  22793. dir: dir isL
  22794. --- END Output Phase ---
  22795. /|\--- Input Phase ---
  22796. =>WM: (14780: I2 ^dir L)
  22797. =>WM: (14779: I2 ^reward 1)
  22798. =>WM: (14778: I2 ^see 1)
  22799. =>WM: (14777: N1054 ^status complete)
  22800. <=WM: (14766: I2 ^dir L)
  22801. <=WM: (14765: I2 ^reward 1)
  22802. <=WM: (14764: I2 ^see 0)
  22803. =>WM: (14781: I2 ^level-1 L1-root)
  22804. <=WM: (14767: I2 ^level-1 R0-root)
  22805. --- END Input Phase ---
  22806. --- Proposal Phase ---
  22807. --- Inner Elaboration Phase, active level 1 (S1) ---
  22808. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  22809. -->
  22810. (S1 ^operator O2108 = 0.6126634012348675)
  22811. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  22812. -->
  22813. (S1 ^operator O2107 = -0.02274740735326741)
  22814. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22815. -->
  22816. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22817. -->
  22818. Firing elaborate*copy-see-to-output-link
  22819. -->
  22820. (I3 ^see 1 +)
  22821. Firing elaborate*reward*based*on*reward
  22822. -->
  22823. (R1058 ^value 1 +)
  22824. (R1 ^reward R1058 +)
  22825. Firing propose*predict-yes
  22826. -->
  22827. (O2109 ^name predict-yes +)
  22828. (S1 ^operator O2109 +)
  22829. Firing propose*predict-no
  22830. -->
  22831. (O2110 ^name predict-no +)
  22832. (S1 ^operator O2110 +)
  22833. Firing rl*prefer*rvt*predict-no*H0*2
  22834. -->
  22835. (S1 ^operator O2108 = 0.3873353575493562)
  22836. Firing rl*prefer*rvt*predict-yes*H0*1
  22837. -->
  22838. (S1 ^operator O2107 = 0.3895395371935536)
  22839. Firing prefer*rvt*predict-yes*H0
  22840. -->
  22841. Firing prefer*rvt*predict-no*H0
  22842. -->
  22843. Firing elaborate*copy-dir-to-output-link
  22844. -->
  22845. (I3 ^dir L +)
  22846. inner elaboration loop at bottom goal.
  22847. Retracting elaborate*copy-see-to-output-link
  22848. -->
  22849. (I3 ^see 0 +)
  22850. Retracting propose*predict-no
  22851. -->
  22852. (O2108 ^name predict-no +)
  22853. (S1 ^operator O2108 +)
  22854. Retracting propose*predict-yes
  22855. -->
  22856. (O2107 ^name predict-yes +)
  22857. (S1 ^operator O2107 +)
  22858. Retracting elaborate*reward*based*on*reward
  22859. -->
  22860. (R1057 ^value 1 +)
  22861. (R1 ^reward R1057 +)
  22862. Retracting elaborate*copy-dir-to-output-link
  22863. -->
  22864. (I3 ^dir L +)
  22865. Retracting rl*prefer*rvt*predict-no*H0*2
  22866. -->
  22867. (S1 ^operator O2108 = 0.3873353575493562)
  22868. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  22869. -->
  22870. (S1 ^operator O2108 = 0.1063475139796038)
  22871. Retracting rl*prefer*rvt*predict-yes*H0*1
  22872. -->
  22873. (S1 ^operator O2107 = 0.3895395371935536)
  22874. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  22875. -->
  22876. (S1 ^operator O2107 = 0.6104607684602532)
  22877. =>WM: (14788: S1 ^operator O2110 +)
  22878. =>WM: (14787: S1 ^operator O2109 +)
  22879. =>WM: (14786: O2110 ^name predict-no)
  22880. =>WM: (14785: O2109 ^name predict-yes)
  22881. =>WM: (14784: R1058 ^value 1)
  22882. =>WM: (14783: R1 ^reward R1058)
  22883. =>WM: (14782: I3 ^see 1)
  22884. <=WM: (14773: S1 ^operator O2107 +)
  22885. <=WM: (14775: S1 ^operator O2107)
  22886. <=WM: (14774: S1 ^operator O2108 +)
  22887. <=WM: (14768: R1 ^reward R1057)
  22888. <=WM: (14740: I3 ^see 0)
  22889. <=WM: (14771: O2108 ^name predict-no)
  22890. <=WM: (14770: O2107 ^name predict-yes)
  22891. <=WM: (14769: R1057 ^value 1)
  22892. --- Inner Elaboration Phase, active level 1 (S1) ---
  22893. Firing prefer*rvt*predict-yes*H0
  22894. -->
  22895. Firing rl*prefer*rvt*predict-yes*H0*1
  22896. -->
  22897. (S1 ^operator O2109 = 0.3895395371935536)
  22898. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  22899. -->
  22900. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  22901. -->
  22902. (S1 ^operator O2109 = -0.02274740735326741)
  22903. Firing prefer*rvt*predict-no*H0
  22904. -->
  22905. Firing rl*prefer*rvt*predict-no*H0*2
  22906. -->
  22907. (S1 ^operator O2110 = 0.3873353575493562)
  22908. Firing prefer*rvt*predict-no*H0*2*v1*H1
  22909. -->
  22910. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  22911. -->
  22912. (S1 ^operator O2110 = 0.6126634012348675)
  22913. inner elaboration loop at bottom goal.
  22914. Retracting rl*prefer*rvt*predict-no*H0*2
  22915. -->
  22916. (S1 ^operator O2108 = 0.3873353575493562)
  22917. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  22918. -->
  22919. (S1 ^operator O2108 = 0.6126634012348675)
  22920. Retracting rl*prefer*rvt*predict-yes*H0*1
  22921. -->
  22922. (S1 ^operator O2107 = 0.3895395371935536)
  22923. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  22924. -->
  22925. (S1 ^operator O2107 = -0.02274740735326741)
  22926. --- END Proposal Phase ---
  22927. --- Decision Phase ---
  22928. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.897143,0.0928079)
  22929. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  22930. =>WM: (14789: S1 ^operator O2110)
  22931. 1055: O: O2110 (predict-no)
  22932. --- END Decision Phase ---
  22933. --- Application Phase ---
  22934. --- Firing Productions (PE) For State At Depth 1 ---
  22935. --- Inner Elaboration Phase, active level 1 (S1) ---
  22936. Firing apply*operator
  22937. -->
  22938. (I3 ^predict-no N1055 + :O )
  22939. Firing apply*operator*complete
  22940. -->
  22941. (I3 ^predict-yes N1054 - :O )
  22942. inner elaboration loop at bottom goal.
  22943. --- Change Working Memory (PE) ---
  22944. =>WM: (14790: I3 ^predict-no N1055)
  22945. <=WM: (14777: N1054 ^status complete)
  22946. <=WM: (14776: I3 ^predict-yes N1054)
  22947. --- Firing Productions (IE) For State At Depth 1 ---
  22948. --- Inner Elaboration Phase, active level 1 (S1) ---
  22949. Firing monitor*world
  22950. -->
  22951. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22952. --- Change Working Memory (IE) ---
  22953. --- END Application Phase ---
  22954. --- Output Phase ---
  22955. ENV: Agent did: predict-no for direction L in state State-A
  22956. In State-A moving L
  22957. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22958. predict error 0
  22959. dir: dir isR
  22960. --- END Output Phase ---
  22961. -/--- Input Phase ---
  22962. =>WM: (14794: I2 ^dir R)
  22963. =>WM: (14793: I2 ^reward 1)
  22964. =>WM: (14792: I2 ^see 0)
  22965. =>WM: (14791: N1055 ^status complete)
  22966. <=WM: (14780: I2 ^dir L)
  22967. <=WM: (14779: I2 ^reward 1)
  22968. <=WM: (14778: I2 ^see 1)
  22969. =>WM: (14795: I2 ^level-1 L0-root)
  22970. <=WM: (14781: I2 ^level-1 L1-root)
  22971. --- END Input Phase ---
  22972. --- Proposal Phase ---
  22973. --- Inner Elaboration Phase, active level 1 (S1) ---
  22974. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22975. -->
  22976. (S1 ^operator O2109 = 0.8155906332983849)
  22977. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22978. -->
  22979. (S1 ^operator O2110 = -0.00558448899823713)
  22980. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22981. -->
  22982. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22983. -->
  22984. Firing elaborate*copy-see-to-output-link
  22985. -->
  22986. (I3 ^see 0 +)
  22987. Firing elaborate*reward*based*on*reward
  22988. -->
  22989. (R1059 ^value 1 +)
  22990. (R1 ^reward R1059 +)
  22991. Firing propose*predict-yes
  22992. -->
  22993. (O2111 ^name predict-yes +)
  22994. (S1 ^operator O2111 +)
  22995. Firing propose*predict-no
  22996. -->
  22997. (O2112 ^name predict-no +)
  22998. (S1 ^operator O2112 +)
  22999. Firing rl*prefer*rvt*predict-no*H0*4
  23000. -->
  23001. (S1 ^operator O2110 = 0.4476193384490649)
  23002. Firing rl*prefer*rvt*predict-yes*H0*3
  23003. -->
  23004. (S1 ^operator O2109 = 0.1844108849158159)
  23005. Firing prefer*rvt*predict-yes*H0
  23006. -->
  23007. Firing prefer*rvt*predict-no*H0
  23008. -->
  23009. Firing elaborate*copy-dir-to-output-link
  23010. -->
  23011. (I3 ^dir R +)
  23012. inner elaboration loop at bottom goal.
  23013. Retracting elaborate*copy-see-to-output-link
  23014. -->
  23015. (I3 ^see 1 +)
  23016. Retracting propose*predict-no
  23017. -->
  23018. (O2110 ^name predict-no +)
  23019. (S1 ^operator O2110 +)
  23020. Retracting propose*predict-yes
  23021. -->
  23022. (O2109 ^name predict-yes +)
  23023. (S1 ^operator O2109 +)
  23024. Retracting elaborate*reward*based*on*reward
  23025. -->
  23026. (R1058 ^value 1 +)
  23027. (R1 ^reward R1058 +)
  23028. Retracting elaborate*copy-dir-to-output-link
  23029. -->
  23030. (I3 ^dir L +)
  23031. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  23032. -->
  23033. (S1 ^operator O2110 = 0.6126634012348675)
  23034. Retracting rl*prefer*rvt*predict-no*H0*2
  23035. -->
  23036. (S1 ^operator O2110 = 0.3873353575493562)
  23037. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  23038. -->
  23039. (S1 ^operator O2109 = -0.02274740735326741)
  23040. Retracting rl*prefer*rvt*predict-yes*H0*1
  23041. -->
  23042. (S1 ^operator O2109 = 0.3895394913454826)
  23043. =>WM: (14803: S1 ^operator O2112 +)
  23044. =>WM: (14802: S1 ^operator O2111 +)
  23045. =>WM: (14801: I3 ^dir R)
  23046. =>WM: (14800: O2112 ^name predict-no)
  23047. =>WM: (14799: O2111 ^name predict-yes)
  23048. =>WM: (14798: R1059 ^value 1)
  23049. =>WM: (14797: R1 ^reward R1059)
  23050. =>WM: (14796: I3 ^see 0)
  23051. <=WM: (14787: S1 ^operator O2109 +)
  23052. <=WM: (14788: S1 ^operator O2110 +)
  23053. <=WM: (14789: S1 ^operator O2110)
  23054. <=WM: (14772: I3 ^dir L)
  23055. <=WM: (14783: R1 ^reward R1058)
  23056. <=WM: (14782: I3 ^see 1)
  23057. <=WM: (14786: O2110 ^name predict-no)
  23058. <=WM: (14785: O2109 ^name predict-yes)
  23059. <=WM: (14784: R1058 ^value 1)
  23060. --- Inner Elaboration Phase, active level 1 (S1) ---
  23061. Firing prefer*rvt*predict-yes*H0
  23062. -->
  23063. Firing rl*prefer*rvt*predict-yes*H0*3
  23064. -->
  23065. (S1 ^operator O2111 = 0.1844108849158159)
  23066. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  23067. -->
  23068. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  23069. -->
  23070. (S1 ^operator O2111 = 0.8155906332983849)
  23071. Firing prefer*rvt*predict-no*H0
  23072. -->
  23073. Firing rl*prefer*rvt*predict-no*H0*4
  23074. -->
  23075. (S1 ^operator O2112 = 0.4476193384490649)
  23076. Firing prefer*rvt*predict-no*H0*4*v1*H1
  23077. -->
  23078. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  23079. -->
  23080. (S1 ^operator O2112 = -0.00558448899823713)
  23081. inner elaboration loop at bottom goal.
  23082. Retracting rl*prefer*rvt*predict-no*H0*4
  23083. -->
  23084. (S1 ^operator O2110 = 0.4476193384490649)
  23085. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  23086. -->
  23087. (S1 ^operator O2110 = -0.00558448899823713)
  23088. Retracting rl*prefer*rvt*predict-yes*H0*3
  23089. -->
  23090. (S1 ^operator O2109 = 0.1844108849158159)
  23091. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  23092. -->
  23093. (S1 ^operator O2109 = 0.8155906332983849)
  23094. --- END Proposal Phase ---
  23095. --- Decision Phase ---
  23096. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387335 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.935829,0.0603761)
  23097. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.28092 0.331744 0.612663 -> 0.28092 0.331744 0.612664(R,m,v=1,1,0)
  23098. =>WM: (14804: S1 ^operator O2111)
  23099. 1056: O: O2111 (predict-yes)
  23100. --- END Decision Phase ---
  23101. --- Application Phase ---
  23102. --- Firing Productions (PE) For State At Depth 1 ---
  23103. --- Inner Elaboration Phase, active level 1 (S1) ---
  23104. Firing apply*operator
  23105. -->
  23106. (I3 ^predict-yes N1056 + :O )
  23107. Firing apply*operator*complete
  23108. -->
  23109. (I3 ^predict-no N1055 - :O )
  23110. inner elaboration loop at bottom goal.
  23111. --- Change Working Memory (PE) ---
  23112. =>WM: (14805: I3 ^predict-yes N1056)
  23113. <=WM: (14791: N1055 ^status complete)
  23114. <=WM: (14790: I3 ^predict-no N1055)
  23115. --- Firing Productions (IE) For State At Depth 1 ---
  23116. --- Inner Elaboration Phase, active level 1 (S1) ---
  23117. Firing monitor*world
  23118. -->
  23119. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23120. --- Change Working Memory (IE) ---
  23121. --- END Application Phase ---
  23122. --- Output Phase ---
  23123. ENV: Agent did: predict-yes for direction R in state State-A
  23124. In State-A moving R
  23125. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  23126. predict error 0
  23127. dir: dir isU
  23128. --- END Output Phase ---
  23129. |\---- Input Phase ---
  23130. =>WM: (14809: I2 ^dir U)
  23131. =>WM: (14808: I2 ^reward 1)
  23132. =>WM: (14807: I2 ^see 1)
  23133. =>WM: (14806: N1056 ^status complete)
  23134. <=WM: (14794: I2 ^dir R)
  23135. <=WM: (14793: I2 ^reward 1)
  23136. <=WM: (14792: I2 ^see 0)
  23137. =>WM: (14810: I2 ^level-1 R1-root)
  23138. <=WM: (14795: I2 ^level-1 L0-root)
  23139. --- END Input Phase ---
  23140. --- Proposal Phase ---
  23141. --- Inner Elaboration Phase, active level 1 (S1) ---
  23142. Firing elaborate*copy-see-to-output-link
  23143. -->
  23144. (I3 ^see 1 +)
  23145. Firing elaborate*reward*based*on*reward
  23146. -->
  23147. (R1060 ^value 1 +)
  23148. (R1 ^reward R1060 +)
  23149. Firing propose*predict-yes
  23150. -->
  23151. (O2113 ^name predict-yes +)
  23152. (S1 ^operator O2113 +)
  23153. Firing propose*predict-no
  23154. -->
  23155. (O2114 ^name predict-no +)
  23156. (S1 ^operator O2114 +)
  23157. Firing rl*prefer*rvt*predict-no*H0*6
  23158. -->
  23159. (S1 ^operator O2112 = 0.9999999999999999)
  23160. Firing rl*prefer*rvt*predict-yes*H0*5
  23161. -->
  23162. (S1 ^operator O2111 = 0.)
  23163. Firing prefer*rvt*predict-yes*H0
  23164. -->
  23165. Firing prefer*rvt*predict-no*H0
  23166. -->
  23167. Firing elaborate*copy-dir-to-output-link
  23168. -->
  23169. (I3 ^dir U +)
  23170. inner elaboration loop at bottom goal.
  23171. Retracting elaborate*copy-see-to-output-link
  23172. -->
  23173. (I3 ^see 0 +)
  23174. Retracting propose*predict-no
  23175. -->
  23176. (O2112 ^name predict-no +)
  23177. (S1 ^operator O2112 +)
  23178. Retracting propose*predict-yes
  23179. -->
  23180. (O2111 ^name predict-yes +)
  23181. (S1 ^operator O2111 +)
  23182. Retracting elaborate*reward*based*on*reward
  23183. -->
  23184. (R1059 ^value 1 +)
  23185. (R1 ^reward R1059 +)
  23186. Retracting elaborate*copy-dir-to-output-link
  23187. -->
  23188. (I3 ^dir R +)
  23189. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  23190. -->
  23191. (S1 ^operator O2112 = -0.00558448899823713)
  23192. Retracting rl*prefer*rvt*predict-no*H0*4
  23193. -->
  23194. (S1 ^operator O2112 = 0.4476193384490649)
  23195. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  23196. -->
  23197. (S1 ^operator O2111 = 0.8155906332983849)
  23198. Retracting rl*prefer*rvt*predict-yes*H0*3
  23199. -->
  23200. (S1 ^operator O2111 = 0.1844108849158159)
  23201. =>WM: (14818: S1 ^operator O2114 +)
  23202. =>WM: (14817: S1 ^operator O2113 +)
  23203. =>WM: (14816: I3 ^dir U)
  23204. =>WM: (14815: O2114 ^name predict-no)
  23205. =>WM: (14814: O2113 ^name predict-yes)
  23206. =>WM: (14813: R1060 ^value 1)
  23207. =>WM: (14812: R1 ^reward R1060)
  23208. =>WM: (14811: I3 ^see 1)
  23209. <=WM: (14802: S1 ^operator O2111 +)
  23210. <=WM: (14804: S1 ^operator O2111)
  23211. <=WM: (14803: S1 ^operator O2112 +)
  23212. <=WM: (14801: I3 ^dir R)
  23213. <=WM: (14797: R1 ^reward R1059)
  23214. <=WM: (14796: I3 ^see 0)
  23215. <=WM: (14800: O2112 ^name predict-no)
  23216. <=WM: (14799: O2111 ^name predict-yes)
  23217. <=WM: (14798: R1059 ^value 1)
  23218. --- Inner Elaboration Phase, active level 1 (S1) ---
  23219. Firing prefer*rvt*predict-yes*H0
  23220. -->
  23221. Firing rl*prefer*rvt*predict-yes*H0*5
  23222. -->
  23223. (S1 ^operator O2113 = 0.)
  23224. Firing prefer*rvt*predict-no*H0
  23225. -->
  23226. Firing rl*prefer*rvt*predict-no*H0*6
  23227. -->
  23228. (S1 ^operator O2114 = 0.9999999999999999)
  23229. inner elaboration loop at bottom goal.
  23230. Retracting rl*prefer*rvt*predict-no*H0*6
  23231. -->
  23232. (S1 ^operator O2112 = 0.9999999999999999)
  23233. Retracting rl*prefer*rvt*predict-yes*H0*5
  23234. -->
  23235. (S1 ^operator O2111 = 0.)
  23236. --- END Proposal Phase ---
  23237. --- Decision Phase ---
  23238. RL update rl*prefer*rvt*predict-yes*H0*3 0.675413 -0.491003 0.184411 -> 0.675413 -0.491002 0.184411(R,m,v=1,0.904494,0.0868723)
  23239. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324589 0.491002 0.815591 -> 0.324588 0.491002 0.81559(R,m,v=1,1,0)
  23240. =>WM: (14819: S1 ^operator O2114)
  23241. 1057: O: O2114 (predict-no)
  23242. --- END Decision Phase ---
  23243. --- Application Phase ---
  23244. --- Firing Productions (PE) For State At Depth 1 ---
  23245. --- Inner Elaboration Phase, active level 1 (S1) ---
  23246. Firing apply*operator
  23247. -->
  23248. (I3 ^predict-no N1057 + :O )
  23249. Firing apply*operator*complete
  23250. -->
  23251. (I3 ^predict-yes N1056 - :O )
  23252. inner elaboration loop at bottom goal.
  23253. --- Change Working Memory (PE) ---
  23254. =>WM: (14820: I3 ^predict-no N1057)
  23255. <=WM: (14806: N1056 ^status complete)
  23256. <=WM: (14805: I3 ^predict-yes N1056)
  23257. --- Firing Productions (IE) For State At Depth 1 ---
  23258. --- Inner Elaboration Phase, active level 1 (S1) ---
  23259. Firing monitor*world
  23260. -->
  23261. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23262. --- Change Working Memory (IE) ---
  23263. --- END Application Phase ---
  23264. --- Output Phase ---
  23265. ENV: Agent did: predict-no for direction U in state State-B
  23266. In State-B moving U
  23267. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23268. predict error 0
  23269. dir: dir isU
  23270. --- END Output Phase ---
  23271. /|\--- Input Phase ---
  23272. =>WM: (14824: I2 ^dir U)
  23273. =>WM: (14823: I2 ^reward 1)
  23274. =>WM: (14822: I2 ^see 0)
  23275. =>WM: (14821: N1057 ^status complete)
  23276. <=WM: (14809: I2 ^dir U)
  23277. <=WM: (14808: I2 ^reward 1)
  23278. <=WM: (14807: I2 ^see 1)
  23279. =>WM: (14825: I2 ^level-1 R1-root)
  23280. <=WM: (14810: I2 ^level-1 R1-root)
  23281. --- END Input Phase ---
  23282. --- Proposal Phase ---
  23283. --- Inner Elaboration Phase, active level 1 (S1) ---
  23284. Firing elaborate*copy-see-to-output-link
  23285. -->
  23286. (I3 ^see 0 +)
  23287. Firing elaborate*reward*based*on*reward
  23288. -->
  23289. (R1061 ^value 1 +)
  23290. (R1 ^reward R1061 +)
  23291. Firing propose*predict-yes
  23292. -->
  23293. (O2115 ^name predict-yes +)
  23294. (S1 ^operator O2115 +)
  23295. Firing propose*predict-no
  23296. -->
  23297. (O2116 ^name predict-no +)
  23298. (S1 ^operator O2116 +)
  23299. Firing rl*prefer*rvt*predict-no*H0*6
  23300. -->
  23301. (S1 ^operator O2114 = 0.9999999999999999)
  23302. Firing rl*prefer*rvt*predict-yes*H0*5
  23303. -->
  23304. (S1 ^operator O2113 = 0.)
  23305. Firing prefer*rvt*predict-yes*H0
  23306. -->
  23307. Firing prefer*rvt*predict-no*H0
  23308. -->
  23309. Firing elaborate*copy-dir-to-output-link
  23310. -->
  23311. (I3 ^dir U +)
  23312. inner elaboration loop at bottom goal.
  23313. Retracting elaborate*copy-see-to-output-link
  23314. -->
  23315. (I3 ^see 1 +)
  23316. Retracting propose*predict-no
  23317. -->
  23318. (O2114 ^name predict-no +)
  23319. (S1 ^operator O2114 +)
  23320. Retracting propose*predict-yes
  23321. -->
  23322. (O2113 ^name predict-yes +)
  23323. (S1 ^operator O2113 +)
  23324. Retracting elaborate*reward*based*on*reward
  23325. -->
  23326. (R1060 ^value 1 +)
  23327. (R1 ^reward R1060 +)
  23328. Retracting elaborate*copy-dir-to-output-link
  23329. -->
  23330. (I3 ^dir U +)
  23331. Retracting rl*prefer*rvt*predict-no*H0*6
  23332. -->
  23333. (S1 ^operator O2114 = 0.9999999999999999)
  23334. Retracting rl*prefer*rvt*predict-yes*H0*5
  23335. -->
  23336. (S1 ^operator O2113 = 0.)
  23337. =>WM: (14832: S1 ^operator O2116 +)
  23338. =>WM: (14831: S1 ^operator O2115 +)
  23339. =>WM: (14830: O2116 ^name predict-no)
  23340. =>WM: (14829: O2115 ^name predict-yes)
  23341. =>WM: (14828: R1061 ^value 1)
  23342. =>WM: (14827: R1 ^reward R1061)
  23343. =>WM: (14826: I3 ^see 0)
  23344. <=WM: (14817: S1 ^operator O2113 +)
  23345. <=WM: (14818: S1 ^operator O2114 +)
  23346. <=WM: (14819: S1 ^operator O2114)
  23347. <=WM: (14812: R1 ^reward R1060)
  23348. <=WM: (14811: I3 ^see 1)
  23349. <=WM: (14815: O2114 ^name predict-no)
  23350. <=WM: (14814: O2113 ^name predict-yes)
  23351. <=WM: (14813: R1060 ^value 1)
  23352. --- Inner Elaboration Phase, active level 1 (S1) ---
  23353. Firing prefer*rvt*predict-yes*H0
  23354. -->
  23355. Firing rl*prefer*rvt*predict-yes*H0*5
  23356. -->
  23357. (S1 ^operator O2115 = 0.)
  23358. Firing prefer*rvt*predict-no*H0
  23359. -->
  23360. Firing rl*prefer*rvt*predict-no*H0*6
  23361. -->
  23362. (S1 ^operator O2116 = 0.9999999999999999)
  23363. inner elaboration loop at bottom goal.
  23364. Retracting rl*prefer*rvt*predict-no*H0*6
  23365. -->
  23366. (S1 ^operator O2114 = 0.9999999999999999)
  23367. Retracting rl*prefer*rvt*predict-yes*H0*5
  23368. -->
  23369. (S1 ^operator O2113 = 0.)
  23370. --- END Proposal Phase ---
  23371. --- Decision Phase ---
  23372. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23373. =>WM: (14833: S1 ^operator O2116)
  23374. 1058: O: O2116 (predict-no)
  23375. --- END Decision Phase ---
  23376. --- Application Phase ---
  23377. --- Firing Productions (PE) For State At Depth 1 ---
  23378. --- Inner Elaboration Phase, active level 1 (S1) ---
  23379. Firing apply*operator
  23380. -->
  23381. (I3 ^predict-no N1058 + :O )
  23382. Firing apply*operator*complete
  23383. -->
  23384. (I3 ^predict-no N1057 - :O )
  23385. inner elaboration loop at bottom goal.
  23386. --- Change Working Memory (PE) ---
  23387. =>WM: (14834: I3 ^predict-no N1058)
  23388. <=WM: (14821: N1057 ^status complete)
  23389. <=WM: (14820: I3 ^predict-no N1057)
  23390. --- Firing Productions (IE) For State At Depth 1 ---
  23391. --- Inner Elaboration Phase, active level 1 (S1) ---
  23392. Firing monitor*world
  23393. -->
  23394. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23395. --- Change Working Memory (IE) ---
  23396. --- END Application Phase ---
  23397. --- Output Phase ---
  23398. ENV: Agent did: predict-no for direction U in state State-B
  23399. In State-B moving U
  23400. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23401. predict error 0
  23402. dir: dir isR
  23403. --- END Output Phase ---
  23404. -/|\--- Input Phase ---
  23405. =>WM: (14838: I2 ^dir R)
  23406. =>WM: (14837: I2 ^reward 1)
  23407. =>WM: (14836: I2 ^see 0)
  23408. =>WM: (14835: N1058 ^status complete)
  23409. <=WM: (14824: I2 ^dir U)
  23410. <=WM: (14823: I2 ^reward 1)
  23411. <=WM: (14822: I2 ^see 0)
  23412. =>WM: (14839: I2 ^level-1 R1-root)
  23413. <=WM: (14825: I2 ^level-1 R1-root)
  23414. --- END Input Phase ---
  23415. --- Proposal Phase ---
  23416. --- Inner Elaboration Phase, active level 1 (S1) ---
  23417. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  23418. -->
  23419. (S1 ^operator O2115 = 0.1398795999120246)
  23420. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  23421. -->
  23422. (S1 ^operator O2116 = 0.552381174877266)
  23423. Firing prefer*rvt*predict-no*H0*4*v1*H1
  23424. -->
  23425. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  23426. -->
  23427. Firing elaborate*copy-see-to-output-link
  23428. -->
  23429. (I3 ^see 0 +)
  23430. Firing elaborate*reward*based*on*reward
  23431. -->
  23432. (R1062 ^value 1 +)
  23433. (R1 ^reward R1062 +)
  23434. Firing propose*predict-yes
  23435. -->
  23436. (O2117 ^name predict-yes +)
  23437. (S1 ^operator O2117 +)
  23438. Firing propose*predict-no
  23439. -->
  23440. (O2118 ^name predict-no +)
  23441. (S1 ^operator O2118 +)
  23442. Firing rl*prefer*rvt*predict-no*H0*4
  23443. -->
  23444. (S1 ^operator O2116 = 0.4476193384490649)
  23445. Firing rl*prefer*rvt*predict-yes*H0*3
  23446. -->
  23447. (S1 ^operator O2115 = 0.1844106571836858)
  23448. Firing prefer*rvt*predict-yes*H0
  23449. -->
  23450. Firing prefer*rvt*predict-no*H0
  23451. -->
  23452. Firing elaborate*copy-dir-to-output-link
  23453. -->
  23454. (I3 ^dir R +)
  23455. inner elaboration loop at bottom goal.
  23456. Retracting elaborate*copy-see-to-output-link
  23457. -->
  23458. (I3 ^see 0 +)
  23459. Retracting propose*predict-no
  23460. -->
  23461. (O2116 ^name predict-no +)
  23462. (S1 ^operator O2116 +)
  23463. Retracting propose*predict-yes
  23464. -->
  23465. (O2115 ^name predict-yes +)
  23466. (S1 ^operator O2115 +)
  23467. Retracting elaborate*reward*based*on*reward
  23468. -->
  23469. (R1061 ^value 1 +)
  23470. (R1 ^reward R1061 +)
  23471. Retracting elaborate*copy-dir-to-output-link
  23472. -->
  23473. (I3 ^dir U +)
  23474. Retracting rl*prefer*rvt*predict-no*H0*6
  23475. -->
  23476. (S1 ^operator O2116 = 0.9999999999999999)
  23477. Retracting rl*prefer*rvt*predict-yes*H0*5
  23478. -->
  23479. (S1 ^operator O2115 = 0.)
  23480. =>WM: (14846: S1 ^operator O2118 +)
  23481. =>WM: (14845: S1 ^operator O2117 +)
  23482. =>WM: (14844: I3 ^dir R)
  23483. =>WM: (14843: O2118 ^name predict-no)
  23484. =>WM: (14842: O2117 ^name predict-yes)
  23485. =>WM: (14841: R1062 ^value 1)
  23486. =>WM: (14840: R1 ^reward R1062)
  23487. <=WM: (14831: S1 ^operator O2115 +)
  23488. <=WM: (14832: S1 ^operator O2116 +)
  23489. <=WM: (14833: S1 ^operator O2116)
  23490. <=WM: (14816: I3 ^dir U)
  23491. <=WM: (14827: R1 ^reward R1061)
  23492. <=WM: (14830: O2116 ^name predict-no)
  23493. <=WM: (14829: O2115 ^name predict-yes)
  23494. <=WM: (14828: R1061 ^value 1)
  23495. --- Inner Elaboration Phase, active level 1 (S1) ---
  23496. Firing prefer*rvt*predict-yes*H0
  23497. -->
  23498. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  23499. -->
  23500. (S1 ^operator O2117 = 0.1398795999120246)
  23501. Firing rl*prefer*rvt*predict-yes*H0*3
  23502. -->
  23503. (S1 ^operator O2117 = 0.1844106571836858)
  23504. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  23505. -->
  23506. Firing prefer*rvt*predict-no*H0
  23507. -->
  23508. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  23509. -->
  23510. (S1 ^operator O2118 = 0.552381174877266)
  23511. Firing rl*prefer*rvt*predict-no*H0*4
  23512. -->
  23513. (S1 ^operator O2118 = 0.4476193384490649)
  23514. Firing prefer*rvt*predict-no*H0*4*v1*H1
  23515. -->
  23516. inner elaboration loop at bottom goal.
  23517. Retracting rl*prefer*rvt*predict-no*H0*4
  23518. -->
  23519. (S1 ^operator O2116 = 0.4476193384490649)
  23520. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  23521. -->
  23522. (S1 ^operator O2116 = 0.552381174877266)
  23523. Retracting rl*prefer*rvt*predict-yes*H0*3
  23524. -->
  23525. (S1 ^operator O2115 = 0.1844106571836858)
  23526. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  23527. -->
  23528. (S1 ^operator O2115 = 0.1398795999120246)
  23529. --- END Proposal Phase ---
  23530. --- Decision Phase ---
  23531. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23532. =>WM: (14847: S1 ^operator O2118)
  23533. 1059: O: O2118 (predict-no)
  23534. --- END Decision Phase ---
  23535. --- Application Phase ---
  23536. --- Firing Productions (PE) For State At Depth 1 ---
  23537. --- Inner Elaboration Phase, active level 1 (S1) ---
  23538. Firing apply*operator
  23539. -->
  23540. (I3 ^predict-no N1059 + :O )
  23541. Firing apply*operator*complete
  23542. -->
  23543. (I3 ^predict-no N1058 - :O )
  23544. inner elaboration loop at bottom goal.
  23545. --- Change Working Memory (PE) ---
  23546. =>WM: (14848: I3 ^predict-no N1059)
  23547. <=WM: (14835: N1058 ^status complete)
  23548. <=WM: (14834: I3 ^predict-no N1058)
  23549. --- Firing Productions (IE) For State At Depth 1 ---
  23550. --- Inner Elaboration Phase, active level 1 (S1) ---
  23551. Firing monitor*world
  23552. -->
  23553. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23554. --- Change Working Memory (IE) ---
  23555. --- END Application Phase ---
  23556. --- Output Phase ---
  23557. ENV: Agent did: predict-no for direction R in state State-B
  23558. In State-B moving R
  23559. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23560. predict error 0
  23561. dir: dir isU
  23562. --- END Output Phase ---
  23563. -/--- Input Phase ---
  23564. =>WM: (14852: I2 ^dir U)
  23565. =>WM: (14851: I2 ^reward 1)
  23566. =>WM: (14850: I2 ^see 0)
  23567. =>WM: (14849: N1059 ^status complete)
  23568. <=WM: (14838: I2 ^dir R)
  23569. <=WM: (14837: I2 ^reward 1)
  23570. <=WM: (14836: I2 ^see 0)
  23571. =>WM: (14853: I2 ^level-1 R0-root)
  23572. <=WM: (14839: I2 ^level-1 R1-root)
  23573. --- END Input Phase ---
  23574. --- Proposal Phase ---
  23575. --- Inner Elaboration Phase, active level 1 (S1) ---
  23576. Firing elaborate*copy-see-to-output-link
  23577. -->
  23578. (I3 ^see 0 +)
  23579. Firing elaborate*reward*based*on*reward
  23580. -->
  23581. (R1063 ^value 1 +)
  23582. (R1 ^reward R1063 +)
  23583. Firing propose*predict-yes
  23584. -->
  23585. (O2119 ^name predict-yes +)
  23586. (S1 ^operator O2119 +)
  23587. Firing propose*predict-no
  23588. -->
  23589. (O2120 ^name predict-no +)
  23590. (S1 ^operator O2120 +)
  23591. Firing rl*prefer*rvt*predict-no*H0*6
  23592. -->
  23593. (S1 ^operator O2118 = 0.9999999999999999)
  23594. Firing rl*prefer*rvt*predict-yes*H0*5
  23595. -->
  23596. (S1 ^operator O2117 = 0.)
  23597. Firing prefer*rvt*predict-yes*H0
  23598. -->
  23599. Firing prefer*rvt*predict-no*H0
  23600. -->
  23601. Firing elaborate*copy-dir-to-output-link
  23602. -->
  23603. (I3 ^dir U +)
  23604. inner elaboration loop at bottom goal.
  23605. Retracting elaborate*copy-see-to-output-link
  23606. -->
  23607. (I3 ^see 0 +)
  23608. Retracting propose*predict-no
  23609. -->
  23610. (O2118 ^name predict-no +)
  23611. (S1 ^operator O2118 +)
  23612. Retracting propose*predict-yes
  23613. -->
  23614. (O2117 ^name predict-yes +)
  23615. (S1 ^operator O2117 +)
  23616. Retracting elaborate*reward*based*on*reward
  23617. -->
  23618. (R1062 ^value 1 +)
  23619. (R1 ^reward R1062 +)
  23620. Retracting elaborate*copy-dir-to-output-link
  23621. -->
  23622. (I3 ^dir R +)
  23623. Retracting rl*prefer*rvt*predict-no*H0*4
  23624. -->
  23625. (S1 ^operator O2118 = 0.4476193384490649)
  23626. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  23627. -->
  23628. (S1 ^operator O2118 = 0.552381174877266)
  23629. Retracting rl*prefer*rvt*predict-yes*H0*3
  23630. -->
  23631. (S1 ^operator O2117 = 0.1844106571836858)
  23632. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  23633. -->
  23634. (S1 ^operator O2117 = 0.1398795999120246)
  23635. =>WM: (14860: S1 ^operator O2120 +)
  23636. =>WM: (14859: S1 ^operator O2119 +)
  23637. =>WM: (14858: I3 ^dir U)
  23638. =>WM: (14857: O2120 ^name predict-no)
  23639. =>WM: (14856: O2119 ^name predict-yes)
  23640. =>WM: (14855: R1063 ^value 1)
  23641. =>WM: (14854: R1 ^reward R1063)
  23642. <=WM: (14845: S1 ^operator O2117 +)
  23643. <=WM: (14846: S1 ^operator O2118 +)
  23644. <=WM: (14847: S1 ^operator O2118)
  23645. <=WM: (14844: I3 ^dir R)
  23646. <=WM: (14840: R1 ^reward R1062)
  23647. <=WM: (14843: O2118 ^name predict-no)
  23648. <=WM: (14842: O2117 ^name predict-yes)
  23649. <=WM: (14841: R1062 ^value 1)
  23650. --- Inner Elaboration Phase, active level 1 (S1) ---
  23651. Firing prefer*rvt*predict-yes*H0
  23652. -->
  23653. Firing rl*prefer*rvt*predict-yes*H0*5
  23654. -->
  23655. (S1 ^operator O2119 = 0.)
  23656. Firing prefer*rvt*predict-no*H0
  23657. -->
  23658. Firing rl*prefer*rvt*predict-no*H0*6
  23659. -->
  23660. (S1 ^operator O2120 = 0.9999999999999999)
  23661. inner elaboration loop at bottom goal.
  23662. Retracting rl*prefer*rvt*predict-no*H0*6
  23663. -->
  23664. (S1 ^operator O2118 = 0.9999999999999999)
  23665. Retracting rl*prefer*rvt*predict-yes*H0*5
  23666. -->
  23667. (S1 ^operator O2117 = 0.)
  23668. --- END Proposal Phase ---
  23669. --- Decision Phase ---
  23670. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.935252,0.0609947)
  23671. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377468 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  23672. =>WM: (14861: S1 ^operator O2120)
  23673. 1060: O: O2120 (predict-no)
  23674. --- END Decision Phase ---
  23675. --- Application Phase ---
  23676. --- Firing Productions (PE) For State At Depth 1 ---
  23677. --- Inner Elaboration Phase, active level 1 (S1) ---
  23678. Firing apply*operator
  23679. -->
  23680. (I3 ^predict-no N1060 + :O )
  23681. Firing apply*operator*complete
  23682. -->
  23683. (I3 ^predict-no N1059 - :O )
  23684. inner elaboration loop at bottom goal.
  23685. --- Change Working Memory (PE) ---
  23686. =>WM: (14862: I3 ^predict-no N1060)
  23687. <=WM: (14849: N1059 ^status complete)
  23688. <=WM: (14848: I3 ^predict-no N1059)
  23689. --- Firing Productions (IE) For State At Depth 1 ---
  23690. --- Inner Elaboration Phase, active level 1 (S1) ---
  23691. Firing monitor*world
  23692. -->
  23693. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23694. --- Change Working Memory (IE) ---
  23695. --- END Application Phase ---
  23696. --- Output Phase ---
  23697. ENV: Agent did: predict-no for direction U in state State-B
  23698. In State-B moving U
  23699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23700. predict error 0
  23701. dir: dir isU
  23702. --- END Output Phase ---
  23703. |\---- Input Phase ---
  23704. =>WM: (14866: I2 ^dir U)
  23705. =>WM: (14865: I2 ^reward 1)
  23706. =>WM: (14864: I2 ^see 0)
  23707. =>WM: (14863: N1060 ^status complete)
  23708. <=WM: (14852: I2 ^dir U)
  23709. <=WM: (14851: I2 ^reward 1)
  23710. <=WM: (14850: I2 ^see 0)
  23711. =>WM: (14867: I2 ^level-1 R0-root)
  23712. <=WM: (14853: I2 ^level-1 R0-root)
  23713. --- END Input Phase ---
  23714. --- Proposal Phase ---
  23715. --- Inner Elaboration Phase, active level 1 (S1) ---
  23716. Firing elaborate*copy-see-to-output-link
  23717. -->
  23718. (I3 ^see 0 +)
  23719. Firing elaborate*reward*based*on*reward
  23720. -->
  23721. (R1064 ^value 1 +)
  23722. (R1 ^reward R1064 +)
  23723. Firing propose*predict-yes
  23724. -->
  23725. (O2121 ^name predict-yes +)
  23726. (S1 ^operator O2121 +)
  23727. Firing propose*predict-no
  23728. -->
  23729. (O2122 ^name predict-no +)
  23730. (S1 ^operator O2122 +)
  23731. Firing rl*prefer*rvt*predict-no*H0*6
  23732. -->
  23733. (S1 ^operator O2120 = 0.9999999999999999)
  23734. Firing rl*prefer*rvt*predict-yes*H0*5
  23735. -->
  23736. (S1 ^operator O2119 = 0.)
  23737. Firing prefer*rvt*predict-yes*H0
  23738. -->
  23739. Firing prefer*rvt*predict-no*H0
  23740. -->
  23741. Firing elaborate*copy-dir-to-output-link
  23742. -->
  23743. (I3 ^dir U +)
  23744. inner elaboration loop at bottom goal.
  23745. Retracting elaborate*copy-see-to-output-link
  23746. -->
  23747. (I3 ^see 0 +)
  23748. Retracting propose*predict-no
  23749. -->
  23750. (O2120 ^name predict-no +)
  23751. (S1 ^operator O2120 +)
  23752. Retracting propose*predict-yes
  23753. -->
  23754. (O2119 ^name predict-yes +)
  23755. (S1 ^operator O2119 +)
  23756. Retracting elaborate*reward*based*on*reward
  23757. -->
  23758. (R1063 ^value 1 +)
  23759. (R1 ^reward R1063 +)
  23760. Retracting elaborate*copy-dir-to-output-link
  23761. -->
  23762. (I3 ^dir U +)
  23763. Retracting rl*prefer*rvt*predict-no*H0*6
  23764. -->
  23765. (S1 ^operator O2120 = 0.9999999999999999)
  23766. Retracting rl*prefer*rvt*predict-yes*H0*5
  23767. -->
  23768. (S1 ^operator O2119 = 0.)
  23769. =>WM: (14873: S1 ^operator O2122 +)
  23770. =>WM: (14872: S1 ^operator O2121 +)
  23771. =>WM: (14871: O2122 ^name predict-no)
  23772. =>WM: (14870: O2121 ^name predict-yes)
  23773. =>WM: (14869: R1064 ^value 1)
  23774. =>WM: (14868: R1 ^reward R1064)
  23775. <=WM: (14859: S1 ^operator O2119 +)
  23776. <=WM: (14860: S1 ^operator O2120 +)
  23777. <=WM: (14861: S1 ^operator O2120)
  23778. <=WM: (14854: R1 ^reward R1063)
  23779. <=WM: (14857: O2120 ^name predict-no)
  23780. <=WM: (14856: O2119 ^name predict-yes)
  23781. <=WM: (14855: R1063 ^value 1)
  23782. --- Inner Elaboration Phase, active level 1 (S1) ---
  23783. Firing prefer*rvt*predict-yes*H0
  23784. -->
  23785. Firing rl*prefer*rvt*predict-yes*H0*5
  23786. -->
  23787. (S1 ^operator O2121 = 0.)
  23788. Firing prefer*rvt*predict-no*H0
  23789. -->
  23790. Firing rl*prefer*rvt*predict-no*H0*6
  23791. -->
  23792. (S1 ^operator O2122 = 0.9999999999999999)
  23793. inner elaboration loop at bottom goal.
  23794. Retracting rl*prefer*rvt*predict-no*H0*6
  23795. -->
  23796. (S1 ^operator O2120 = 0.9999999999999999)
  23797. Retracting rl*prefer*rvt*predict-yes*H0*5
  23798. -->
  23799. (S1 ^operator O2119 = 0.)
  23800. --- END Proposal Phase ---
  23801. --- Decision Phase ---
  23802. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23803. =>WM: (14874: S1 ^operator O2122)
  23804. 1061: O: O2122 (predict-no)
  23805. --- END Decision Phase ---
  23806. --- Application Phase ---
  23807. --- Firing Productions (PE) For State At Depth 1 ---
  23808. --- Inner Elaboration Phase, active level 1 (S1) ---
  23809. Firing apply*operator
  23810. -->
  23811. (I3 ^predict-no N1061 + :O )
  23812. Firing apply*operator*complete
  23813. -->
  23814. (I3 ^predict-no N1060 - :O )
  23815. inner elaboration loop at bottom goal.
  23816. --- Change Working Memory (PE) ---
  23817. =>WM: (14875: I3 ^predict-no N1061)
  23818. <=WM: (14863: N1060 ^status complete)
  23819. <=WM: (14862: I3 ^predict-no N1060)
  23820. --- Firing Productions (IE) For State At Depth 1 ---
  23821. --- Inner Elaboration Phase, active level 1 (S1) ---
  23822. Firing monitor*world
  23823. -->
  23824. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23825. --- Change Working Memory (IE) ---
  23826. --- END Application Phase ---
  23827. --- Output Phase ---
  23828. ENV: Agent did: predict-no for direction U in state State-B
  23829. In State-B moving U
  23830. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23831. predict error 0
  23832. dir: dir isU
  23833. --- END Output Phase ---
  23834. /--- Input Phase ---
  23835. =>WM: (14879: I2 ^dir U)
  23836. =>WM: (14878: I2 ^reward 1)
  23837. =>WM: (14877: I2 ^see 0)
  23838. =>WM: (14876: N1061 ^status complete)
  23839. <=WM: (14866: I2 ^dir U)
  23840. <=WM: (14865: I2 ^reward 1)
  23841. <=WM: (14864: I2 ^see 0)
  23842. =>WM: (14880: I2 ^level-1 R0-root)
  23843. <=WM: (14867: I2 ^level-1 R0-root)
  23844. --- END Input Phase ---
  23845. --- Proposal Phase ---
  23846. --- Inner Elaboration Phase, active level 1 (S1) ---
  23847. Firing elaborate*copy-see-to-output-link
  23848. -->
  23849. (I3 ^see 0 +)
  23850. Firing elaborate*reward*based*on*reward
  23851. -->
  23852. (R1065 ^value 1 +)
  23853. (R1 ^reward R1065 +)
  23854. Firing propose*predict-yes
  23855. -->
  23856. (O2123 ^name predict-yes +)
  23857. (S1 ^operator O2123 +)
  23858. Firing propose*predict-no
  23859. -->
  23860. (O2124 ^name predict-no +)
  23861. (S1 ^operator O2124 +)
  23862. Firing rl*prefer*rvt*predict-no*H0*6
  23863. -->
  23864. (S1 ^operator O2122 = 0.9999999999999999)
  23865. Firing rl*prefer*rvt*predict-yes*H0*5
  23866. -->
  23867. (S1 ^operator O2121 = 0.)
  23868. Firing prefer*rvt*predict-yes*H0
  23869. -->
  23870. Firing prefer*rvt*predict-no*H0
  23871. -->
  23872. Firing elaborate*copy-dir-to-output-link
  23873. -->
  23874. (I3 ^dir U +)
  23875. inner elaboration loop at bottom goal.
  23876. Retracting elaborate*copy-see-to-output-link
  23877. -->
  23878. (I3 ^see 0 +)
  23879. Retracting propose*predict-no
  23880. -->
  23881. (O2122 ^name predict-no +)
  23882. (S1 ^operator O2122 +)
  23883. Retracting propose*predict-yes
  23884. -->
  23885. (O2121 ^name predict-yes +)
  23886. (S1 ^operator O2121 +)
  23887. Retracting elaborate*reward*based*on*reward
  23888. -->
  23889. (R1064 ^value 1 +)
  23890. (R1 ^reward R1064 +)
  23891. Retracting elaborate*copy-dir-to-output-link
  23892. -->
  23893. (I3 ^dir U +)
  23894. Retracting rl*prefer*rvt*predict-no*H0*6
  23895. -->
  23896. (S1 ^operator O2122 = 0.9999999999999999)
  23897. Retracting rl*prefer*rvt*predict-yes*H0*5
  23898. -->
  23899. (S1 ^operator O2121 = 0.)
  23900. =>WM: (14886: S1 ^operator O2124 +)
  23901. =>WM: (14885: S1 ^operator O2123 +)
  23902. =>WM: (14884: O2124 ^name predict-no)
  23903. =>WM: (14883: O2123 ^name predict-yes)
  23904. =>WM: (14882: R1065 ^value 1)
  23905. =>WM: (14881: R1 ^reward R1065)
  23906. <=WM: (14872: S1 ^operator O2121 +)
  23907. <=WM: (14873: S1 ^operator O2122 +)
  23908. <=WM: (14874: S1 ^operator O2122)
  23909. <=WM: (14868: R1 ^reward R1064)
  23910. <=WM: (14871: O2122 ^name predict-no)
  23911. <=WM: (14870: O2121 ^name predict-yes)
  23912. <=WM: (14869: R1064 ^value 1)
  23913. --- Inner Elaboration Phase, active level 1 (S1) ---
  23914. Firing prefer*rvt*predict-yes*H0
  23915. -->
  23916. Firing rl*prefer*rvt*predict-yes*H0*5
  23917. -->
  23918. (S1 ^operator O2123 = 0.)
  23919. Firing prefer*rvt*predict-no*H0
  23920. -->
  23921. Firing rl*prefer*rvt*predict-no*H0*6
  23922. -->
  23923. (S1 ^operator O2124 = 0.9999999999999999)
  23924. inner elaboration loop at bottom goal.
  23925. Retracting rl*prefer*rvt*predict-no*H0*6
  23926. -->
  23927. (S1 ^operator O2122 = 0.9999999999999999)
  23928. Retracting rl*prefer*rvt*predict-yes*H0*5
  23929. -->
  23930. (S1 ^operator O2121 = 0.)
  23931. --- END Proposal Phase ---
  23932. --- Decision Phase ---
  23933. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23934. =>WM: (14887: S1 ^operator O2124)
  23935. 1062: O: O2124 (predict-no)
  23936. --- END Decision Phase ---
  23937. --- Application Phase ---
  23938. --- Firing Productions (PE) For State At Depth 1 ---
  23939. --- Inner Elaboration Phase, active level 1 (S1) ---
  23940. Firing apply*operator
  23941. -->
  23942. (I3 ^predict-no N1062 + :O )
  23943. Firing apply*operator*complete
  23944. -->
  23945. (I3 ^predict-no N1061 - :O )
  23946. inner elaboration loop at bottom goal.
  23947. --- Change Working Memory (PE) ---
  23948. =>WM: (14888: I3 ^predict-no N1062)
  23949. <=WM: (14876: N1061 ^status complete)
  23950. <=WM: (14875: I3 ^predict-no N1061)
  23951. --- Firing Productions (IE) For State At Depth 1 ---
  23952. --- Inner Elaboration Phase, active level 1 (S1) ---
  23953. Firing monitor*world
  23954. -->
  23955. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23956. --- Change Working Memory (IE) ---
  23957. --- END Application Phase ---
  23958. --- Output Phase ---
  23959. ENV: Agent did: predict-no for direction U in state State-B
  23960. In State-B moving U
  23961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23962. predict error 0
  23963. dir: dir isR
  23964. --- END Output Phase ---
  23965. |\---- Input Phase ---
  23966. =>WM: (14892: I2 ^dir R)
  23967. =>WM: (14891: I2 ^reward 1)
  23968. =>WM: (14890: I2 ^see 0)
  23969. =>WM: (14889: N1062 ^status complete)
  23970. <=WM: (14879: I2 ^dir U)
  23971. <=WM: (14878: I2 ^reward 1)
  23972. <=WM: (14877: I2 ^see 0)
  23973. =>WM: (14893: I2 ^level-1 R0-root)
  23974. <=WM: (14880: I2 ^level-1 R0-root)
  23975. --- END Input Phase ---
  23976. --- Proposal Phase ---
  23977. --- Inner Elaboration Phase, active level 1 (S1) ---
  23978. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  23979. -->
  23980. (S1 ^operator O2123 = 0.1664311307472832)
  23981. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  23982. -->
  23983. (S1 ^operator O2124 = 0.5523799932145873)
  23984. Firing prefer*rvt*predict-no*H0*4*v1*H1
  23985. -->
  23986. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  23987. -->
  23988. Firing elaborate*copy-see-to-output-link
  23989. -->
  23990. (I3 ^see 0 +)
  23991. Firing elaborate*reward*based*on*reward
  23992. -->
  23993. (R1066 ^value 1 +)
  23994. (R1 ^reward R1066 +)
  23995. Firing propose*predict-yes
  23996. -->
  23997. (O2125 ^name predict-yes +)
  23998. (S1 ^operator O2125 +)
  23999. Firing propose*predict-no
  24000. -->
  24001. (O2126 ^name predict-no +)
  24002. (S1 ^operator O2126 +)
  24003. Firing rl*prefer*rvt*predict-no*H0*4
  24004. -->
  24005. (S1 ^operator O2124 = 0.4476192614501152)
  24006. Firing rl*prefer*rvt*predict-yes*H0*3
  24007. -->
  24008. (S1 ^operator O2123 = 0.1844106571836858)
  24009. Firing prefer*rvt*predict-yes*H0
  24010. -->
  24011. Firing prefer*rvt*predict-no*H0
  24012. -->
  24013. Firing elaborate*copy-dir-to-output-link
  24014. -->
  24015. (I3 ^dir R +)
  24016. inner elaboration loop at bottom goal.
  24017. Retracting elaborate*copy-see-to-output-link
  24018. -->
  24019. (I3 ^see 0 +)
  24020. Retracting propose*predict-no
  24021. -->
  24022. (O2124 ^name predict-no +)
  24023. (S1 ^operator O2124 +)
  24024. Retracting propose*predict-yes
  24025. -->
  24026. (O2123 ^name predict-yes +)
  24027. (S1 ^operator O2123 +)
  24028. Retracting elaborate*reward*based*on*reward
  24029. -->
  24030. (R1065 ^value 1 +)
  24031. (R1 ^reward R1065 +)
  24032. Retracting elaborate*copy-dir-to-output-link
  24033. -->
  24034. (I3 ^dir U +)
  24035. Retracting rl*prefer*rvt*predict-no*H0*6
  24036. -->
  24037. (S1 ^operator O2124 = 0.9999999999999999)
  24038. Retracting rl*prefer*rvt*predict-yes*H0*5
  24039. -->
  24040. (S1 ^operator O2123 = 0.)
  24041. =>WM: (14900: S1 ^operator O2126 +)
  24042. =>WM: (14899: S1 ^operator O2125 +)
  24043. =>WM: (14898: I3 ^dir R)
  24044. =>WM: (14897: O2126 ^name predict-no)
  24045. =>WM: (14896: O2125 ^name predict-yes)
  24046. =>WM: (14895: R1066 ^value 1)
  24047. =>WM: (14894: R1 ^reward R1066)
  24048. <=WM: (14885: S1 ^operator O2123 +)
  24049. <=WM: (14886: S1 ^operator O2124 +)
  24050. <=WM: (14887: S1 ^operator O2124)
  24051. <=WM: (14858: I3 ^dir U)
  24052. <=WM: (14881: R1 ^reward R1065)
  24053. <=WM: (14884: O2124 ^name predict-no)
  24054. <=WM: (14883: O2123 ^name predict-yes)
  24055. <=WM: (14882: R1065 ^value 1)
  24056. --- Inner Elaboration Phase, active level 1 (S1) ---
  24057. Firing prefer*rvt*predict-yes*H0
  24058. -->
  24059. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  24060. -->
  24061. (S1 ^operator O2125 = 0.1664311307472832)
  24062. Firing rl*prefer*rvt*predict-yes*H0*3
  24063. -->
  24064. (S1 ^operator O2125 = 0.1844106571836858)
  24065. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  24066. -->
  24067. Firing prefer*rvt*predict-no*H0
  24068. -->
  24069. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  24070. -->
  24071. (S1 ^operator O2126 = 0.5523799932145873)
  24072. Firing rl*prefer*rvt*predict-no*H0*4
  24073. -->
  24074. (S1 ^operator O2126 = 0.4476192614501152)
  24075. Firing prefer*rvt*predict-no*H0*4*v1*H1
  24076. -->
  24077. inner elaboration loop at bottom goal.
  24078. Retracting rl*prefer*rvt*predict-no*H0*4
  24079. -->
  24080. (S1 ^operator O2124 = 0.4476192614501152)
  24081. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  24082. -->
  24083. (S1 ^operator O2124 = 0.5523799932145873)
  24084. Retracting rl*prefer*rvt*predict-yes*H0*3
  24085. -->
  24086. (S1 ^operator O2123 = 0.1844106571836858)
  24087. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  24088. -->
  24089. (S1 ^operator O2123 = 0.1664311307472832)
  24090. --- END Proposal Phase ---
  24091. --- Decision Phase ---
  24092. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24093. =>WM: (14901: S1 ^operator O2126)
  24094. 1063: O: O2126 (predict-no)
  24095. --- END Decision Phase ---
  24096. --- Application Phase ---
  24097. --- Firing Productions (PE) For State At Depth 1 ---
  24098. --- Inner Elaboration Phase, active level 1 (S1) ---
  24099. Firing apply*operator
  24100. -->
  24101. (I3 ^predict-no N1063 + :O )
  24102. Firing apply*operator*complete
  24103. -->
  24104. (I3 ^predict-no N1062 - :O )
  24105. inner elaboration loop at bottom goal.
  24106. --- Change Working Memory (PE) ---
  24107. =>WM: (14902: I3 ^predict-no N1063)
  24108. <=WM: (14889: N1062 ^status complete)
  24109. <=WM: (14888: I3 ^predict-no N1062)
  24110. --- Firing Productions (IE) For State At Depth 1 ---
  24111. --- Inner Elaboration Phase, active level 1 (S1) ---
  24112. Firing monitor*world
  24113. -->
  24114. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24115. --- Change Working Memory (IE) ---
  24116. --- END Application Phase ---
  24117. --- Output Phase ---
  24118. ENV: Agent did: predict-no for direction R in state State-B
  24119. In State-B moving R
  24120. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24121. predict error 0
  24122. dir: dir isL
  24123. --- END Output Phase ---
  24124. /|\--- Input Phase ---
  24125. =>WM: (14906: I2 ^dir L)
  24126. =>WM: (14905: I2 ^reward 1)
  24127. =>WM: (14904: I2 ^see 0)
  24128. =>WM: (14903: N1063 ^status complete)
  24129. <=WM: (14892: I2 ^dir R)
  24130. <=WM: (14891: I2 ^reward 1)
  24131. <=WM: (14890: I2 ^see 0)
  24132. =>WM: (14907: I2 ^level-1 R0-root)
  24133. <=WM: (14893: I2 ^level-1 R0-root)
  24134. --- END Input Phase ---
  24135. --- Proposal Phase ---
  24136. --- Inner Elaboration Phase, active level 1 (S1) ---
  24137. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  24138. -->
  24139. (S1 ^operator O2125 = 0.6104607226121822)
  24140. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  24141. -->
  24142. (S1 ^operator O2126 = 0.1063475139796038)
  24143. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24144. -->
  24145. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24146. -->
  24147. Firing elaborate*copy-see-to-output-link
  24148. -->
  24149. (I3 ^see 0 +)
  24150. Firing elaborate*reward*based*on*reward
  24151. -->
  24152. (R1067 ^value 1 +)
  24153. (R1 ^reward R1067 +)
  24154. Firing propose*predict-yes
  24155. -->
  24156. (O2127 ^name predict-yes +)
  24157. (S1 ^operator O2127 +)
  24158. Firing propose*predict-no
  24159. -->
  24160. (O2128 ^name predict-no +)
  24161. (S1 ^operator O2128 +)
  24162. Firing rl*prefer*rvt*predict-no*H0*2
  24163. -->
  24164. (S1 ^operator O2126 = 0.3873355437317227)
  24165. Firing rl*prefer*rvt*predict-yes*H0*1
  24166. -->
  24167. (S1 ^operator O2125 = 0.3895394913454826)
  24168. Firing prefer*rvt*predict-yes*H0
  24169. -->
  24170. Firing prefer*rvt*predict-no*H0
  24171. -->
  24172. Firing elaborate*copy-dir-to-output-link
  24173. -->
  24174. (I3 ^dir L +)
  24175. inner elaboration loop at bottom goal.
  24176. Retracting elaborate*copy-see-to-output-link
  24177. -->
  24178. (I3 ^see 0 +)
  24179. Retracting propose*predict-no
  24180. -->
  24181. (O2126 ^name predict-no +)
  24182. (S1 ^operator O2126 +)
  24183. Retracting propose*predict-yes
  24184. -->
  24185. (O2125 ^name predict-yes +)
  24186. (S1 ^operator O2125 +)
  24187. Retracting elaborate*reward*based*on*reward
  24188. -->
  24189. (R1066 ^value 1 +)
  24190. (R1 ^reward R1066 +)
  24191. Retracting elaborate*copy-dir-to-output-link
  24192. -->
  24193. (I3 ^dir R +)
  24194. Retracting rl*prefer*rvt*predict-no*H0*4
  24195. -->
  24196. (S1 ^operator O2126 = 0.4476192614501152)
  24197. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  24198. -->
  24199. (S1 ^operator O2126 = 0.5523799932145873)
  24200. Retracting rl*prefer*rvt*predict-yes*H0*3
  24201. -->
  24202. (S1 ^operator O2125 = 0.1844106571836858)
  24203. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  24204. -->
  24205. (S1 ^operator O2125 = 0.1664311307472832)
  24206. =>WM: (14914: S1 ^operator O2128 +)
  24207. =>WM: (14913: S1 ^operator O2127 +)
  24208. =>WM: (14912: I3 ^dir L)
  24209. =>WM: (14911: O2128 ^name predict-no)
  24210. =>WM: (14910: O2127 ^name predict-yes)
  24211. =>WM: (14909: R1067 ^value 1)
  24212. =>WM: (14908: R1 ^reward R1067)
  24213. <=WM: (14899: S1 ^operator O2125 +)
  24214. <=WM: (14900: S1 ^operator O2126 +)
  24215. <=WM: (14901: S1 ^operator O2126)
  24216. <=WM: (14898: I3 ^dir R)
  24217. <=WM: (14894: R1 ^reward R1066)
  24218. <=WM: (14897: O2126 ^name predict-no)
  24219. <=WM: (14896: O2125 ^name predict-yes)
  24220. <=WM: (14895: R1066 ^value 1)
  24221. --- Inner Elaboration Phase, active level 1 (S1) ---
  24222. Firing prefer*rvt*predict-yes*H0
  24223. -->
  24224. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  24225. -->
  24226. (S1 ^operator O2127 = 0.6104607226121822)
  24227. Firing rl*prefer*rvt*predict-yes*H0*1
  24228. -->
  24229. (S1 ^operator O2127 = 0.3895394913454826)
  24230. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24231. -->
  24232. Firing prefer*rvt*predict-no*H0
  24233. -->
  24234. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  24235. -->
  24236. (S1 ^operator O2128 = 0.1063475139796038)
  24237. Firing rl*prefer*rvt*predict-no*H0*2
  24238. -->
  24239. (S1 ^operator O2128 = 0.3873355437317227)
  24240. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24241. -->
  24242. inner elaboration loop at bottom goal.
  24243. Retracting rl*prefer*rvt*predict-no*H0*2
  24244. -->
  24245. (S1 ^operator O2126 = 0.3873355437317227)
  24246. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  24247. -->
  24248. (S1 ^operator O2126 = 0.1063475139796038)
  24249. Retracting rl*prefer*rvt*predict-yes*H0*1
  24250. -->
  24251. (S1 ^operator O2125 = 0.3895394913454826)
  24252. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  24253. -->
  24254. (S1 ^operator O2125 = 0.6104607226121822)
  24255. --- END Proposal Phase ---
  24256. --- Decision Phase ---
  24257. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.935714,0.0605858)
  24258. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  24259. =>WM: (14915: S1 ^operator O2127)
  24260. 1064: O: O2127 (predict-yes)
  24261. --- END Decision Phase ---
  24262. --- Application Phase ---
  24263. --- Firing Productions (PE) For State At Depth 1 ---
  24264. --- Inner Elaboration Phase, active level 1 (S1) ---
  24265. Firing apply*operator
  24266. -->
  24267. (I3 ^predict-yes N1064 + :O )
  24268. Firing apply*operator*complete
  24269. -->
  24270. (I3 ^predict-no N1063 - :O )
  24271. inner elaboration loop at bottom goal.
  24272. --- Change Working Memory (PE) ---
  24273. =>WM: (14916: I3 ^predict-yes N1064)
  24274. <=WM: (14903: N1063 ^status complete)
  24275. <=WM: (14902: I3 ^predict-no N1063)
  24276. --- Firing Productions (IE) For State At Depth 1 ---
  24277. --- Inner Elaboration Phase, active level 1 (S1) ---
  24278. Firing monitor*world
  24279. -->
  24280. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24281. --- Change Working Memory (IE) ---
  24282. --- END Application Phase ---
  24283. --- Output Phase ---
  24284. ENV: Agent did: predict-yes for direction L in state State-B
  24285. In State-B moving L
  24286. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  24287. predict error 0
  24288. dir: dir isU
  24289. --- END Output Phase ---
  24290. -/--- Input Phase ---
  24291. =>WM: (14920: I2 ^dir U)
  24292. =>WM: (14919: I2 ^reward 1)
  24293. =>WM: (14918: I2 ^see 1)
  24294. =>WM: (14917: N1064 ^status complete)
  24295. <=WM: (14906: I2 ^dir L)
  24296. <=WM: (14905: I2 ^reward 1)
  24297. <=WM: (14904: I2 ^see 0)
  24298. =>WM: (14921: I2 ^level-1 L1-root)
  24299. <=WM: (14907: I2 ^level-1 R0-root)
  24300. --- END Input Phase ---
  24301. --- Proposal Phase ---
  24302. --- Inner Elaboration Phase, active level 1 (S1) ---
  24303. Firing elaborate*copy-see-to-output-link
  24304. -->
  24305. (I3 ^see 1 +)
  24306. Firing elaborate*reward*based*on*reward
  24307. -->
  24308. (R1068 ^value 1 +)
  24309. (R1 ^reward R1068 +)
  24310. Firing propose*predict-yes
  24311. -->
  24312. (O2129 ^name predict-yes +)
  24313. (S1 ^operator O2129 +)
  24314. Firing propose*predict-no
  24315. -->
  24316. (O2130 ^name predict-no +)
  24317. (S1 ^operator O2130 +)
  24318. Firing rl*prefer*rvt*predict-no*H0*6
  24319. -->
  24320. (S1 ^operator O2128 = 0.9999999999999999)
  24321. Firing rl*prefer*rvt*predict-yes*H0*5
  24322. -->
  24323. (S1 ^operator O2127 = 0.)
  24324. Firing prefer*rvt*predict-yes*H0
  24325. -->
  24326. Firing prefer*rvt*predict-no*H0
  24327. -->
  24328. Firing elaborate*copy-dir-to-output-link
  24329. -->
  24330. (I3 ^dir U +)
  24331. inner elaboration loop at bottom goal.
  24332. Retracting elaborate*copy-see-to-output-link
  24333. -->
  24334. (I3 ^see 0 +)
  24335. Retracting propose*predict-no
  24336. -->
  24337. (O2128 ^name predict-no +)
  24338. (S1 ^operator O2128 +)
  24339. Retracting propose*predict-yes
  24340. -->
  24341. (O2127 ^name predict-yes +)
  24342. (S1 ^operator O2127 +)
  24343. Retracting elaborate*reward*based*on*reward
  24344. -->
  24345. (R1067 ^value 1 +)
  24346. (R1 ^reward R1067 +)
  24347. Retracting elaborate*copy-dir-to-output-link
  24348. -->
  24349. (I3 ^dir L +)
  24350. Retracting rl*prefer*rvt*predict-no*H0*2
  24351. -->
  24352. (S1 ^operator O2128 = 0.3873355437317227)
  24353. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  24354. -->
  24355. (S1 ^operator O2128 = 0.1063475139796038)
  24356. Retracting rl*prefer*rvt*predict-yes*H0*1
  24357. -->
  24358. (S1 ^operator O2127 = 0.3895394913454826)
  24359. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  24360. -->
  24361. (S1 ^operator O2127 = 0.6104607226121822)
  24362. =>WM: (14929: S1 ^operator O2130 +)
  24363. =>WM: (14928: S1 ^operator O2129 +)
  24364. =>WM: (14927: I3 ^dir U)
  24365. =>WM: (14926: O2130 ^name predict-no)
  24366. =>WM: (14925: O2129 ^name predict-yes)
  24367. =>WM: (14924: R1068 ^value 1)
  24368. =>WM: (14923: R1 ^reward R1068)
  24369. =>WM: (14922: I3 ^see 1)
  24370. <=WM: (14913: S1 ^operator O2127 +)
  24371. <=WM: (14915: S1 ^operator O2127)
  24372. <=WM: (14914: S1 ^operator O2128 +)
  24373. <=WM: (14912: I3 ^dir L)
  24374. <=WM: (14908: R1 ^reward R1067)
  24375. <=WM: (14826: I3 ^see 0)
  24376. <=WM: (14911: O2128 ^name predict-no)
  24377. <=WM: (14910: O2127 ^name predict-yes)
  24378. <=WM: (14909: R1067 ^value 1)
  24379. --- Inner Elaboration Phase, active level 1 (S1) ---
  24380. Firing prefer*rvt*predict-yes*H0
  24381. -->
  24382. Firing rl*prefer*rvt*predict-yes*H0*5
  24383. -->
  24384. (S1 ^operator O2129 = 0.)
  24385. Firing prefer*rvt*predict-no*H0
  24386. -->
  24387. Firing rl*prefer*rvt*predict-no*H0*6
  24388. -->
  24389. (S1 ^operator O2130 = 0.9999999999999999)
  24390. inner elaboration loop at bottom goal.
  24391. Retracting rl*prefer*rvt*predict-no*H0*6
  24392. -->
  24393. (S1 ^operator O2128 = 0.9999999999999999)
  24394. Retracting rl*prefer*rvt*predict-yes*H0*5
  24395. -->
  24396. (S1 ^operator O2127 = 0.)
  24397. --- END Proposal Phase ---
  24398. --- Decision Phase ---
  24399. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.389539(R,m,v=1,0.897727,0.0923377)
  24400. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  24401. =>WM: (14930: S1 ^operator O2130)
  24402. 1065: O: O2130 (predict-no)
  24403. --- END Decision Phase ---
  24404. --- Application Phase ---
  24405. --- Firing Productions (PE) For State At Depth 1 ---
  24406. --- Inner Elaboration Phase, active level 1 (S1) ---
  24407. Firing apply*operator
  24408. -->
  24409. (I3 ^predict-no N1065 + :O )
  24410. Firing apply*operator*complete
  24411. -->
  24412. (I3 ^predict-yes N1064 - :O )
  24413. inner elaboration loop at bottom goal.
  24414. --- Change Working Memory (PE) ---
  24415. =>WM: (14931: I3 ^predict-no N1065)
  24416. <=WM: (14917: N1064 ^status complete)
  24417. <=WM: (14916: I3 ^predict-yes N1064)
  24418. --- Firing Productions (IE) For State At Depth 1 ---
  24419. --- Inner Elaboration Phase, active level 1 (S1) ---
  24420. Firing monitor*world
  24421. -->
  24422. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24423. --- Change Working Memory (IE) ---
  24424. --- END Application Phase ---
  24425. --- Output Phase ---
  24426. ENV: Agent did: predict-no for direction U in state State-A
  24427. In State-A moving U
  24428. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24429. predict error 0
  24430. dir: dir isU
  24431. --- END Output Phase ---
  24432. |\--- Input Phase ---
  24433. =>WM: (14935: I2 ^dir U)
  24434. =>WM: (14934: I2 ^reward 1)
  24435. =>WM: (14933: I2 ^see 0)
  24436. =>WM: (14932: N1065 ^status complete)
  24437. <=WM: (14920: I2 ^dir U)
  24438. <=WM: (14919: I2 ^reward 1)
  24439. <=WM: (14918: I2 ^see 1)
  24440. =>WM: (14936: I2 ^level-1 L1-root)
  24441. <=WM: (14921: I2 ^level-1 L1-root)
  24442. --- END Input Phase ---
  24443. --- Proposal Phase ---
  24444. --- Inner Elaboration Phase, active level 1 (S1) ---
  24445. Firing elaborate*copy-see-to-output-link
  24446. -->
  24447. (I3 ^see 0 +)
  24448. Firing elaborate*reward*based*on*reward
  24449. -->
  24450. (R1069 ^value 1 +)
  24451. (R1 ^reward R1069 +)
  24452. Firing propose*predict-yes
  24453. -->
  24454. (O2131 ^name predict-yes +)
  24455. (S1 ^operator O2131 +)
  24456. Firing propose*predict-no
  24457. -->
  24458. (O2132 ^name predict-no +)
  24459. (S1 ^operator O2132 +)
  24460. Firing rl*prefer*rvt*predict-no*H0*6
  24461. -->
  24462. (S1 ^operator O2130 = 0.9999999999999999)
  24463. Firing rl*prefer*rvt*predict-yes*H0*5
  24464. -->
  24465. (S1 ^operator O2129 = 0.)
  24466. Firing prefer*rvt*predict-yes*H0
  24467. -->
  24468. Firing prefer*rvt*predict-no*H0
  24469. -->
  24470. Firing elaborate*copy-dir-to-output-link
  24471. -->
  24472. (I3 ^dir U +)
  24473. inner elaboration loop at bottom goal.
  24474. Retracting elaborate*copy-see-to-output-link
  24475. -->
  24476. (I3 ^see 1 +)
  24477. Retracting propose*predict-no
  24478. -->
  24479. (O2130 ^name predict-no +)
  24480. (S1 ^operator O2130 +)
  24481. Retracting propose*predict-yes
  24482. -->
  24483. (O2129 ^name predict-yes +)
  24484. (S1 ^operator O2129 +)
  24485. Retracting elaborate*reward*based*on*reward
  24486. -->
  24487. (R1068 ^value 1 +)
  24488. (R1 ^reward R1068 +)
  24489. Retracting elaborate*copy-dir-to-output-link
  24490. -->
  24491. (I3 ^dir U +)
  24492. Retracting rl*prefer*rvt*predict-no*H0*6
  24493. -->
  24494. (S1 ^operator O2130 = 0.9999999999999999)
  24495. Retracting rl*prefer*rvt*predict-yes*H0*5
  24496. -->
  24497. (S1 ^operator O2129 = 0.)
  24498. =>WM: (14943: S1 ^operator O2132 +)
  24499. =>WM: (14942: S1 ^operator O2131 +)
  24500. =>WM: (14941: O2132 ^name predict-no)
  24501. =>WM: (14940: O2131 ^name predict-yes)
  24502. =>WM: (14939: R1069 ^value 1)
  24503. =>WM: (14938: R1 ^reward R1069)
  24504. =>WM: (14937: I3 ^see 0)
  24505. <=WM: (14928: S1 ^operator O2129 +)
  24506. <=WM: (14929: S1 ^operator O2130 +)
  24507. <=WM: (14930: S1 ^operator O2130)
  24508. <=WM: (14923: R1 ^reward R1068)
  24509. <=WM: (14922: I3 ^see 1)
  24510. <=WM: (14926: O2130 ^name predict-no)
  24511. <=WM: (14925: O2129 ^name predict-yes)
  24512. <=WM: (14924: R1068 ^value 1)
  24513. --- Inner Elaboration Phase, active level 1 (S1) ---
  24514. Firing prefer*rvt*predict-yes*H0
  24515. -->
  24516. Firing rl*prefer*rvt*predict-yes*H0*5
  24517. -->
  24518. (S1 ^operator O2131 = 0.)
  24519. Firing prefer*rvt*predict-no*H0
  24520. -->
  24521. Firing rl*prefer*rvt*predict-no*H0*6
  24522. -->
  24523. (S1 ^operator O2132 = 0.9999999999999999)
  24524. inner elaboration loop at bottom goal.
  24525. Retracting rl*prefer*rvt*predict-no*H0*6
  24526. -->
  24527. (S1 ^operator O2130 = 0.9999999999999999)
  24528. Retracting rl*prefer*rvt*predict-yes*H0*5
  24529. -->
  24530. (S1 ^operator O2129 = 0.)
  24531. --- END Proposal Phase ---
  24532. --- Decision Phase ---
  24533. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24534. =>WM: (14944: S1 ^operator O2132)
  24535. 1066: O: O2132 (predict-no)
  24536. --- END Decision Phase ---
  24537. --- Application Phase ---
  24538. --- Firing Productions (PE) For State At Depth 1 ---
  24539. --- Inner Elaboration Phase, active level 1 (S1) ---
  24540. Firing apply*operator
  24541. -->
  24542. (I3 ^predict-no N1066 + :O )
  24543. Firing apply*operator*complete
  24544. -->
  24545. (I3 ^predict-no N1065 - :O )
  24546. inner elaboration loop at bottom goal.
  24547. --- Change Working Memory (PE) ---
  24548. =>WM: (14945: I3 ^predict-no N1066)
  24549. <=WM: (14932: N1065 ^status complete)
  24550. <=WM: (14931: I3 ^predict-no N1065)
  24551. --- Firing Productions (IE) For State At Depth 1 ---
  24552. --- Inner Elaboration Phase, active level 1 (S1) ---
  24553. Firing monitor*world
  24554. -->
  24555. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24556. --- Change Working Memory (IE) ---
  24557. --- END Application Phase ---
  24558. --- Output Phase ---
  24559. ENV: Agent did: predict-no for direction U in state State-A
  24560. In State-A moving U
  24561. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24562. predict error 0
  24563. dir: dir isU
  24564. --- END Output Phase ---
  24565. -/--- Input Phase ---
  24566. =>WM: (14949: I2 ^dir U)
  24567. =>WM: (14948: I2 ^reward 1)
  24568. =>WM: (14947: I2 ^see 0)
  24569. =>WM: (14946: N1066 ^status complete)
  24570. <=WM: (14935: I2 ^dir U)
  24571. <=WM: (14934: I2 ^reward 1)
  24572. <=WM: (14933: I2 ^see 0)
  24573. =>WM: (14950: I2 ^level-1 L1-root)
  24574. <=WM: (14936: I2 ^level-1 L1-root)
  24575. --- END Input Phase ---
  24576. --- Proposal Phase ---
  24577. --- Inner Elaboration Phase, active level 1 (S1) ---
  24578. Firing elaborate*copy-see-to-output-link
  24579. -->
  24580. (I3 ^see 0 +)
  24581. Firing elaborate*reward*based*on*reward
  24582. -->
  24583. (R1070 ^value 1 +)
  24584. (R1 ^reward R1070 +)
  24585. Firing propose*predict-yes
  24586. -->
  24587. (O2133 ^name predict-yes +)
  24588. (S1 ^operator O2133 +)
  24589. Firing propose*predict-no
  24590. -->
  24591. (O2134 ^name predict-no +)
  24592. (S1 ^operator O2134 +)
  24593. Firing rl*prefer*rvt*predict-no*H0*6
  24594. -->
  24595. (S1 ^operator O2132 = 0.9999999999999999)
  24596. Firing rl*prefer*rvt*predict-yes*H0*5
  24597. -->
  24598. (S1 ^operator O2131 = 0.)
  24599. Firing prefer*rvt*predict-yes*H0
  24600. -->
  24601. Firing prefer*rvt*predict-no*H0
  24602. -->
  24603. Firing elaborate*copy-dir-to-output-link
  24604. -->
  24605. (I3 ^dir U +)
  24606. inner elaboration loop at bottom goal.
  24607. Retracting elaborate*copy-see-to-output-link
  24608. -->
  24609. (I3 ^see 0 +)
  24610. Retracting propose*predict-no
  24611. -->
  24612. (O2132 ^name predict-no +)
  24613. (S1 ^operator O2132 +)
  24614. Retracting propose*predict-yes
  24615. -->
  24616. (O2131 ^name predict-yes +)
  24617. (S1 ^operator O2131 +)
  24618. Retracting elaborate*reward*based*on*reward
  24619. -->
  24620. (R1069 ^value 1 +)
  24621. (R1 ^reward R1069 +)
  24622. Retracting elaborate*copy-dir-to-output-link
  24623. -->
  24624. (I3 ^dir U +)
  24625. Retracting rl*prefer*rvt*predict-no*H0*6
  24626. -->
  24627. (S1 ^operator O2132 = 0.9999999999999999)
  24628. Retracting rl*prefer*rvt*predict-yes*H0*5
  24629. -->
  24630. (S1 ^operator O2131 = 0.)
  24631. =>WM: (14956: S1 ^operator O2134 +)
  24632. =>WM: (14955: S1 ^operator O2133 +)
  24633. =>WM: (14954: O2134 ^name predict-no)
  24634. =>WM: (14953: O2133 ^name predict-yes)
  24635. =>WM: (14952: R1070 ^value 1)
  24636. =>WM: (14951: R1 ^reward R1070)
  24637. <=WM: (14942: S1 ^operator O2131 +)
  24638. <=WM: (14943: S1 ^operator O2132 +)
  24639. <=WM: (14944: S1 ^operator O2132)
  24640. <=WM: (14938: R1 ^reward R1069)
  24641. <=WM: (14941: O2132 ^name predict-no)
  24642. <=WM: (14940: O2131 ^name predict-yes)
  24643. <=WM: (14939: R1069 ^value 1)
  24644. --- Inner Elaboration Phase, active level 1 (S1) ---
  24645. Firing prefer*rvt*predict-yes*H0
  24646. -->
  24647. Firing rl*prefer*rvt*predict-yes*H0*5
  24648. -->
  24649. (S1 ^operator O2133 = 0.)
  24650. Firing prefer*rvt*predict-no*H0
  24651. -->
  24652. Firing rl*prefer*rvt*predict-no*H0*6
  24653. -->
  24654. (S1 ^operator O2134 = 0.9999999999999999)
  24655. inner elaboration loop at bottom goal.
  24656. Retracting rl*prefer*rvt*predict-no*H0*6
  24657. -->
  24658. (S1 ^operator O2132 = 0.9999999999999999)
  24659. Retracting rl*prefer*rvt*predict-yes*H0*5
  24660. -->
  24661. (S1 ^operator O2131 = 0.)
  24662. --- END Proposal Phase ---
  24663. --- Decision Phase ---
  24664. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24665. =>WM: (14957: S1 ^operator O2134)
  24666. 1067: O: O2134 (predict-no)
  24667. --- END Decision Phase ---
  24668. --- Application Phase ---
  24669. --- Firing Productions (PE) For State At Depth 1 ---
  24670. --- Inner Elaboration Phase, active level 1 (S1) ---
  24671. Firing apply*operator
  24672. -->
  24673. (I3 ^predict-no N1067 + :O )
  24674. Firing apply*operator*complete
  24675. -->
  24676. (I3 ^predict-no N1066 - :O )
  24677. inner elaboration loop at bottom goal.
  24678. --- Change Working Memory (PE) ---
  24679. =>WM: (14958: I3 ^predict-no N1067)
  24680. <=WM: (14946: N1066 ^status complete)
  24681. <=WM: (14945: I3 ^predict-no N1066)
  24682. --- Firing Productions (IE) For State At Depth 1 ---
  24683. --- Inner Elaboration Phase, active level 1 (S1) ---
  24684. Firing monitor*world
  24685. -->
  24686. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24687. --- Change Working Memory (IE) ---
  24688. --- END Application Phase ---
  24689. --- Output Phase ---
  24690. ENV: Agent did: predict-no for direction U in state State-A
  24691. In State-A moving U
  24692. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24693. predict error 0
  24694. dir: dir isL
  24695. --- END Output Phase ---
  24696. |\---- Input Phase ---
  24697. =>WM: (14962: I2 ^dir L)
  24698. =>WM: (14961: I2 ^reward 1)
  24699. =>WM: (14960: I2 ^see 0)
  24700. =>WM: (14959: N1067 ^status complete)
  24701. <=WM: (14949: I2 ^dir U)
  24702. <=WM: (14948: I2 ^reward 1)
  24703. <=WM: (14947: I2 ^see 0)
  24704. =>WM: (14963: I2 ^level-1 L1-root)
  24705. <=WM: (14950: I2 ^level-1 L1-root)
  24706. --- END Input Phase ---
  24707. --- Proposal Phase ---
  24708. --- Inner Elaboration Phase, active level 1 (S1) ---
  24709. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  24710. -->
  24711. (S1 ^operator O2134 = 0.6126635874172339)
  24712. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  24713. -->
  24714. (S1 ^operator O2133 = -0.02274740735326741)
  24715. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24716. -->
  24717. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24718. -->
  24719. Firing elaborate*copy-see-to-output-link
  24720. -->
  24721. (I3 ^see 0 +)
  24722. Firing elaborate*reward*based*on*reward
  24723. -->
  24724. (R1071 ^value 1 +)
  24725. (R1 ^reward R1071 +)
  24726. Firing propose*predict-yes
  24727. -->
  24728. (O2135 ^name predict-yes +)
  24729. (S1 ^operator O2135 +)
  24730. Firing propose*predict-no
  24731. -->
  24732. (O2136 ^name predict-no +)
  24733. (S1 ^operator O2136 +)
  24734. Firing rl*prefer*rvt*predict-no*H0*2
  24735. -->
  24736. (S1 ^operator O2134 = 0.3873355437317227)
  24737. Firing rl*prefer*rvt*predict-yes*H0*1
  24738. -->
  24739. (S1 ^operator O2133 = 0.3895394592518329)
  24740. Firing prefer*rvt*predict-yes*H0
  24741. -->
  24742. Firing prefer*rvt*predict-no*H0
  24743. -->
  24744. Firing elaborate*copy-dir-to-output-link
  24745. -->
  24746. (I3 ^dir L +)
  24747. inner elaboration loop at bottom goal.
  24748. Retracting elaborate*copy-see-to-output-link
  24749. -->
  24750. (I3 ^see 0 +)
  24751. Retracting propose*predict-no
  24752. -->
  24753. (O2134 ^name predict-no +)
  24754. (S1 ^operator O2134 +)
  24755. Retracting propose*predict-yes
  24756. -->
  24757. (O2133 ^name predict-yes +)
  24758. (S1 ^operator O2133 +)
  24759. Retracting elaborate*reward*based*on*reward
  24760. -->
  24761. (R1070 ^value 1 +)
  24762. (R1 ^reward R1070 +)
  24763. Retracting elaborate*copy-dir-to-output-link
  24764. -->
  24765. (I3 ^dir U +)
  24766. Retracting rl*prefer*rvt*predict-no*H0*6
  24767. -->
  24768. (S1 ^operator O2134 = 0.9999999999999999)
  24769. Retracting rl*prefer*rvt*predict-yes*H0*5
  24770. -->
  24771. (S1 ^operator O2133 = 0.)
  24772. =>WM: (14970: S1 ^operator O2136 +)
  24773. =>WM: (14969: S1 ^operator O2135 +)
  24774. =>WM: (14968: I3 ^dir L)
  24775. =>WM: (14967: O2136 ^name predict-no)
  24776. =>WM: (14966: O2135 ^name predict-yes)
  24777. =>WM: (14965: R1071 ^value 1)
  24778. =>WM: (14964: R1 ^reward R1071)
  24779. <=WM: (14955: S1 ^operator O2133 +)
  24780. <=WM: (14956: S1 ^operator O2134 +)
  24781. <=WM: (14957: S1 ^operator O2134)
  24782. <=WM: (14927: I3 ^dir U)
  24783. <=WM: (14951: R1 ^reward R1070)
  24784. <=WM: (14954: O2134 ^name predict-no)
  24785. <=WM: (14953: O2133 ^name predict-yes)
  24786. <=WM: (14952: R1070 ^value 1)
  24787. --- Inner Elaboration Phase, active level 1 (S1) ---
  24788. Firing prefer*rvt*predict-yes*H0
  24789. -->
  24790. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  24791. -->
  24792. (S1 ^operator O2135 = -0.02274740735326741)
  24793. Firing rl*prefer*rvt*predict-yes*H0*1
  24794. -->
  24795. (S1 ^operator O2135 = 0.3895394592518329)
  24796. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24797. -->
  24798. Firing prefer*rvt*predict-no*H0
  24799. -->
  24800. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  24801. -->
  24802. (S1 ^operator O2136 = 0.6126635874172339)
  24803. Firing rl*prefer*rvt*predict-no*H0*2
  24804. -->
  24805. (S1 ^operator O2136 = 0.3873355437317227)
  24806. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24807. -->
  24808. inner elaboration loop at bottom goal.
  24809. Retracting rl*prefer*rvt*predict-no*H0*2
  24810. -->
  24811. (S1 ^operator O2134 = 0.3873355437317227)
  24812. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  24813. -->
  24814. (S1 ^operator O2134 = 0.6126635874172339)
  24815. Retracting rl*prefer*rvt*predict-yes*H0*1
  24816. -->
  24817. (S1 ^operator O2133 = 0.3895394592518329)
  24818. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  24819. -->
  24820. (S1 ^operator O2133 = -0.02274740735326741)
  24821. --- END Proposal Phase ---
  24822. --- Decision Phase ---
  24823. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24824. =>WM: (14971: S1 ^operator O2136)
  24825. 1068: O: O2136 (predict-no)
  24826. --- END Decision Phase ---
  24827. --- Application Phase ---
  24828. --- Firing Productions (PE) For State At Depth 1 ---
  24829. --- Inner Elaboration Phase, active level 1 (S1) ---
  24830. Firing apply*operator
  24831. -->
  24832. (I3 ^predict-no N1068 + :O )
  24833. Firing apply*operator*complete
  24834. -->
  24835. (I3 ^predict-no N1067 - :O )
  24836. inner elaboration loop at bottom goal.
  24837. --- Change Working Memory (PE) ---
  24838. =>WM: (14972: I3 ^predict-no N1068)
  24839. <=WM: (14959: N1067 ^status complete)
  24840. <=WM: (14958: I3 ^predict-no N1067)
  24841. --- Firing Productions (IE) For State At Depth 1 ---
  24842. --- Inner Elaboration Phase, active level 1 (S1) ---
  24843. Firing monitor*world
  24844. -->
  24845. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24846. --- Change Working Memory (IE) ---
  24847. --- END Application Phase ---
  24848. --- Output Phase ---
  24849. ENV: Agent did: predict-no for direction L in state State-A
  24850. In State-A moving L
  24851. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24852. predict error 0
  24853. dir: dir isL
  24854. --- END Output Phase ---
  24855. /|\---- Input Phase ---
  24856. =>WM: (14976: I2 ^dir L)
  24857. =>WM: (14975: I2 ^reward 1)
  24858. =>WM: (14974: I2 ^see 0)
  24859. =>WM: (14973: N1068 ^status complete)
  24860. <=WM: (14962: I2 ^dir L)
  24861. <=WM: (14961: I2 ^reward 1)
  24862. <=WM: (14960: I2 ^see 0)
  24863. =>WM: (14977: I2 ^level-1 L0-root)
  24864. <=WM: (14963: I2 ^level-1 L1-root)
  24865. --- END Input Phase ---
  24866. --- Proposal Phase ---
  24867. --- Inner Elaboration Phase, active level 1 (S1) ---
  24868. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  24869. -->
  24870. (S1 ^operator O2135 = 0.1599599085218832)
  24871. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  24872. -->
  24873. (S1 ^operator O2136 = 0.612665734378294)
  24874. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24875. -->
  24876. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24877. -->
  24878. Firing elaborate*copy-see-to-output-link
  24879. -->
  24880. (I3 ^see 0 +)
  24881. Firing elaborate*reward*based*on*reward
  24882. -->
  24883. (R1072 ^value 1 +)
  24884. (R1 ^reward R1072 +)
  24885. Firing propose*predict-yes
  24886. -->
  24887. (O2137 ^name predict-yes +)
  24888. (S1 ^operator O2137 +)
  24889. Firing propose*predict-no
  24890. -->
  24891. (O2138 ^name predict-no +)
  24892. (S1 ^operator O2138 +)
  24893. Firing rl*prefer*rvt*predict-no*H0*2
  24894. -->
  24895. (S1 ^operator O2136 = 0.3873355437317227)
  24896. Firing rl*prefer*rvt*predict-yes*H0*1
  24897. -->
  24898. (S1 ^operator O2135 = 0.3895394592518329)
  24899. Firing prefer*rvt*predict-yes*H0
  24900. -->
  24901. Firing prefer*rvt*predict-no*H0
  24902. -->
  24903. Firing elaborate*copy-dir-to-output-link
  24904. -->
  24905. (I3 ^dir L +)
  24906. inner elaboration loop at bottom goal.
  24907. Retracting elaborate*copy-see-to-output-link
  24908. -->
  24909. (I3 ^see 0 +)
  24910. Retracting propose*predict-no
  24911. -->
  24912. (O2136 ^name predict-no +)
  24913. (S1 ^operator O2136 +)
  24914. Retracting propose*predict-yes
  24915. -->
  24916. (O2135 ^name predict-yes +)
  24917. (S1 ^operator O2135 +)
  24918. Retracting elaborate*reward*based*on*reward
  24919. -->
  24920. (R1071 ^value 1 +)
  24921. (R1 ^reward R1071 +)
  24922. Retracting elaborate*copy-dir-to-output-link
  24923. -->
  24924. (I3 ^dir L +)
  24925. Retracting rl*prefer*rvt*predict-no*H0*2
  24926. -->
  24927. (S1 ^operator O2136 = 0.3873355437317227)
  24928. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  24929. -->
  24930. (S1 ^operator O2136 = 0.6126635874172339)
  24931. Retracting rl*prefer*rvt*predict-yes*H0*1
  24932. -->
  24933. (S1 ^operator O2135 = 0.3895394592518329)
  24934. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  24935. -->
  24936. (S1 ^operator O2135 = -0.02274740735326741)
  24937. =>WM: (14983: S1 ^operator O2138 +)
  24938. =>WM: (14982: S1 ^operator O2137 +)
  24939. =>WM: (14981: O2138 ^name predict-no)
  24940. =>WM: (14980: O2137 ^name predict-yes)
  24941. =>WM: (14979: R1072 ^value 1)
  24942. =>WM: (14978: R1 ^reward R1072)
  24943. <=WM: (14969: S1 ^operator O2135 +)
  24944. <=WM: (14970: S1 ^operator O2136 +)
  24945. <=WM: (14971: S1 ^operator O2136)
  24946. <=WM: (14964: R1 ^reward R1071)
  24947. <=WM: (14967: O2136 ^name predict-no)
  24948. <=WM: (14966: O2135 ^name predict-yes)
  24949. <=WM: (14965: R1071 ^value 1)
  24950. --- Inner Elaboration Phase, active level 1 (S1) ---
  24951. Firing prefer*rvt*predict-yes*H0
  24952. -->
  24953. Firing rl*prefer*rvt*predict-yes*H0*1
  24954. -->
  24955. (S1 ^operator O2137 = 0.3895394592518329)
  24956. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  24957. -->
  24958. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  24959. -->
  24960. (S1 ^operator O2137 = 0.1599599085218832)
  24961. Firing prefer*rvt*predict-no*H0
  24962. -->
  24963. Firing rl*prefer*rvt*predict-no*H0*2
  24964. -->
  24965. (S1 ^operator O2138 = 0.3873355437317227)
  24966. Firing prefer*rvt*predict-no*H0*2*v1*H1
  24967. -->
  24968. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  24969. -->
  24970. (S1 ^operator O2138 = 0.612665734378294)
  24971. inner elaboration loop at bottom goal.
  24972. Retracting rl*prefer*rvt*predict-no*H0*2
  24973. -->
  24974. (S1 ^operator O2136 = 0.3873355437317227)
  24975. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  24976. -->
  24977. (S1 ^operator O2136 = 0.612665734378294)
  24978. Retracting rl*prefer*rvt*predict-yes*H0*1
  24979. -->
  24980. (S1 ^operator O2135 = 0.3895394592518329)
  24981. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  24982. -->
  24983. (S1 ^operator O2135 = 0.1599599085218832)
  24984. --- END Proposal Phase ---
  24985. --- Decision Phase ---
  24986. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.93617,0.0600751)
  24987. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.28092 0.331744 0.612664 -> 0.28092 0.331744 0.612664(R,m,v=1,1,0)
  24988. =>WM: (14984: S1 ^operator O2138)
  24989. 1069: O: O2138 (predict-no)
  24990. --- END Decision Phase ---
  24991. --- Application Phase ---
  24992. --- Firing Productions (PE) For State At Depth 1 ---
  24993. --- Inner Elaboration Phase, active level 1 (S1) ---
  24994. Firing apply*operator
  24995. -->
  24996. (I3 ^predict-no N1069 + :O )
  24997. Firing apply*operator*complete
  24998. -->
  24999. (I3 ^predict-no N1068 - :O )
  25000. inner elaboration loop at bottom goal.
  25001. --- Change Working Memory (PE) ---
  25002. =>WM: (14985: I3 ^predict-no N1069)
  25003. <=WM: (14973: N1068 ^status complete)
  25004. <=WM: (14972: I3 ^predict-no N1068)
  25005. --- Firing Productions (IE) For State At Depth 1 ---
  25006. --- Inner Elaboration Phase, active level 1 (S1) ---
  25007. Firing monitor*world
  25008. -->
  25009. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25010. --- Change Working Memory (IE) ---
  25011. --- END Application Phase ---
  25012. --- Output Phase ---
  25013. ENV: Agent did: predict-no for direction L in state State-A
  25014. In State-A moving L
  25015. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25016. predict error 0
  25017. dir: dir isR
  25018. --- END Output Phase ---
  25019. /|\--- Input Phase ---
  25020. =>WM: (14989: I2 ^dir R)
  25021. =>WM: (14988: I2 ^reward 1)
  25022. =>WM: (14987: I2 ^see 0)
  25023. =>WM: (14986: N1069 ^status complete)
  25024. <=WM: (14976: I2 ^dir L)
  25025. <=WM: (14975: I2 ^reward 1)
  25026. <=WM: (14974: I2 ^see 0)
  25027. =>WM: (14990: I2 ^level-1 L0-root)
  25028. <=WM: (14977: I2 ^level-1 L0-root)
  25029. --- END Input Phase ---
  25030. --- Proposal Phase ---
  25031. --- Inner Elaboration Phase, active level 1 (S1) ---
  25032. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25033. -->
  25034. (S1 ^operator O2137 = 0.8155904055662546)
  25035. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25036. -->
  25037. (S1 ^operator O2138 = -0.00558448899823713)
  25038. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25039. -->
  25040. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25041. -->
  25042. Firing elaborate*copy-see-to-output-link
  25043. -->
  25044. (I3 ^see 0 +)
  25045. Firing elaborate*reward*based*on*reward
  25046. -->
  25047. (R1073 ^value 1 +)
  25048. (R1 ^reward R1073 +)
  25049. Firing propose*predict-yes
  25050. -->
  25051. (O2139 ^name predict-yes +)
  25052. (S1 ^operator O2139 +)
  25053. Firing propose*predict-no
  25054. -->
  25055. (O2140 ^name predict-no +)
  25056. (S1 ^operator O2140 +)
  25057. Firing rl*prefer*rvt*predict-no*H0*4
  25058. -->
  25059. (S1 ^operator O2138 = 0.4476193732504098)
  25060. Firing rl*prefer*rvt*predict-yes*H0*3
  25061. -->
  25062. (S1 ^operator O2137 = 0.1844106571836858)
  25063. Firing prefer*rvt*predict-yes*H0
  25064. -->
  25065. Firing prefer*rvt*predict-no*H0
  25066. -->
  25067. Firing elaborate*copy-dir-to-output-link
  25068. -->
  25069. (I3 ^dir R +)
  25070. inner elaboration loop at bottom goal.
  25071. Retracting elaborate*copy-see-to-output-link
  25072. -->
  25073. (I3 ^see 0 +)
  25074. Retracting propose*predict-no
  25075. -->
  25076. (O2138 ^name predict-no +)
  25077. (S1 ^operator O2138 +)
  25078. Retracting propose*predict-yes
  25079. -->
  25080. (O2137 ^name predict-yes +)
  25081. (S1 ^operator O2137 +)
  25082. Retracting elaborate*reward*based*on*reward
  25083. -->
  25084. (R1072 ^value 1 +)
  25085. (R1 ^reward R1072 +)
  25086. Retracting elaborate*copy-dir-to-output-link
  25087. -->
  25088. (I3 ^dir L +)
  25089. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  25090. -->
  25091. (S1 ^operator O2138 = 0.612665734378294)
  25092. Retracting rl*prefer*rvt*predict-no*H0*2
  25093. -->
  25094. (S1 ^operator O2138 = 0.3873356740593792)
  25095. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  25096. -->
  25097. (S1 ^operator O2137 = 0.1599599085218832)
  25098. Retracting rl*prefer*rvt*predict-yes*H0*1
  25099. -->
  25100. (S1 ^operator O2137 = 0.3895394592518329)
  25101. =>WM: (14997: S1 ^operator O2140 +)
  25102. =>WM: (14996: S1 ^operator O2139 +)
  25103. =>WM: (14995: I3 ^dir R)
  25104. =>WM: (14994: O2140 ^name predict-no)
  25105. =>WM: (14993: O2139 ^name predict-yes)
  25106. =>WM: (14992: R1073 ^value 1)
  25107. =>WM: (14991: R1 ^reward R1073)
  25108. <=WM: (14982: S1 ^operator O2137 +)
  25109. <=WM: (14983: S1 ^operator O2138 +)
  25110. <=WM: (14984: S1 ^operator O2138)
  25111. <=WM: (14968: I3 ^dir L)
  25112. <=WM: (14978: R1 ^reward R1072)
  25113. <=WM: (14981: O2138 ^name predict-no)
  25114. <=WM: (14980: O2137 ^name predict-yes)
  25115. <=WM: (14979: R1072 ^value 1)
  25116. --- Inner Elaboration Phase, active level 1 (S1) ---
  25117. Firing prefer*rvt*predict-yes*H0
  25118. -->
  25119. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25120. -->
  25121. (S1 ^operator O2139 = 0.8155904055662546)
  25122. Firing rl*prefer*rvt*predict-yes*H0*3
  25123. -->
  25124. (S1 ^operator O2139 = 0.1844106571836858)
  25125. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25126. -->
  25127. Firing prefer*rvt*predict-no*H0
  25128. -->
  25129. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25130. -->
  25131. (S1 ^operator O2140 = -0.00558448899823713)
  25132. Firing rl*prefer*rvt*predict-no*H0*4
  25133. -->
  25134. (S1 ^operator O2140 = 0.4476193732504098)
  25135. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25136. -->
  25137. inner elaboration loop at bottom goal.
  25138. Retracting rl*prefer*rvt*predict-no*H0*4
  25139. -->
  25140. (S1 ^operator O2138 = 0.4476193732504098)
  25141. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25142. -->
  25143. (S1 ^operator O2138 = -0.00558448899823713)
  25144. Retracting rl*prefer*rvt*predict-yes*H0*3
  25145. -->
  25146. (S1 ^operator O2137 = 0.1844106571836858)
  25147. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25148. -->
  25149. (S1 ^operator O2137 = 0.8155904055662546)
  25150. --- END Proposal Phase ---
  25151. --- Decision Phase ---
  25152. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331743 0.387335(R,m,v=1,0.936508,0.0597771)
  25153. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280923 0.331743 0.612666 -> 0.280923 0.331743 0.612666(R,m,v=1,1,0)
  25154. =>WM: (14998: S1 ^operator O2139)
  25155. 1070: O: O2139 (predict-yes)
  25156. --- END Decision Phase ---
  25157. --- Application Phase ---
  25158. --- Firing Productions (PE) For State At Depth 1 ---
  25159. --- Inner Elaboration Phase, active level 1 (S1) ---
  25160. Firing apply*operator
  25161. -->
  25162. (I3 ^predict-yes N1070 + :O )
  25163. Firing apply*operator*complete
  25164. -->
  25165. (I3 ^predict-no N1069 - :O )
  25166. inner elaboration loop at bottom goal.
  25167. --- Change Working Memory (PE) ---
  25168. =>WM: (14999: I3 ^predict-yes N1070)
  25169. <=WM: (14986: N1069 ^status complete)
  25170. <=WM: (14985: I3 ^predict-no N1069)
  25171. --- Firing Productions (IE) For State At Depth 1 ---
  25172. --- Inner Elaboration Phase, active level 1 (S1) ---
  25173. Firing monitor*world
  25174. -->
  25175. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25176. --- Change Working Memory (IE) ---
  25177. --- END Application Phase ---
  25178. --- Output Phase ---
  25179. ENV: Agent did: predict-yes for direction R in state State-A
  25180. In State-A moving R
  25181. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25182. predict error 0
  25183. dir: dir isL
  25184. --- END Output Phase ---
  25185. -/|--- Input Phase ---
  25186. =>WM: (15003: I2 ^dir L)
  25187. =>WM: (15002: I2 ^reward 1)
  25188. =>WM: (15001: I2 ^see 1)
  25189. =>WM: (15000: N1070 ^status complete)
  25190. <=WM: (14989: I2 ^dir R)
  25191. <=WM: (14988: I2 ^reward 1)
  25192. <=WM: (14987: I2 ^see 0)
  25193. =>WM: (15004: I2 ^level-1 R1-root)
  25194. <=WM: (14990: I2 ^level-1 L0-root)
  25195. --- END Input Phase ---
  25196. --- Proposal Phase ---
  25197. --- Inner Elaboration Phase, active level 1 (S1) ---
  25198. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25199. -->
  25200. (S1 ^operator O2139 = 0.6104598832926351)
  25201. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25202. -->
  25203. (S1 ^operator O2140 = 0.2714993082286609)
  25204. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25205. -->
  25206. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25207. -->
  25208. Firing elaborate*copy-see-to-output-link
  25209. -->
  25210. (I3 ^see 1 +)
  25211. Firing elaborate*reward*based*on*reward
  25212. -->
  25213. (R1074 ^value 1 +)
  25214. (R1 ^reward R1074 +)
  25215. Firing propose*predict-yes
  25216. -->
  25217. (O2141 ^name predict-yes +)
  25218. (S1 ^operator O2141 +)
  25219. Firing propose*predict-no
  25220. -->
  25221. (O2142 ^name predict-no +)
  25222. (S1 ^operator O2142 +)
  25223. Firing rl*prefer*rvt*predict-no*H0*2
  25224. -->
  25225. (S1 ^operator O2140 = 0.3873354627937282)
  25226. Firing rl*prefer*rvt*predict-yes*H0*1
  25227. -->
  25228. (S1 ^operator O2139 = 0.3895394592518329)
  25229. Firing prefer*rvt*predict-yes*H0
  25230. -->
  25231. Firing prefer*rvt*predict-no*H0
  25232. -->
  25233. Firing elaborate*copy-dir-to-output-link
  25234. -->
  25235. (I3 ^dir L +)
  25236. inner elaboration loop at bottom goal.
  25237. Retracting elaborate*copy-see-to-output-link
  25238. -->
  25239. (I3 ^see 0 +)
  25240. Retracting propose*predict-no
  25241. -->
  25242. (O2140 ^name predict-no +)
  25243. (S1 ^operator O2140 +)
  25244. Retracting propose*predict-yes
  25245. -->
  25246. (O2139 ^name predict-yes +)
  25247. (S1 ^operator O2139 +)
  25248. Retracting elaborate*reward*based*on*reward
  25249. -->
  25250. (R1073 ^value 1 +)
  25251. (R1 ^reward R1073 +)
  25252. Retracting elaborate*copy-dir-to-output-link
  25253. -->
  25254. (I3 ^dir R +)
  25255. Retracting rl*prefer*rvt*predict-no*H0*4
  25256. -->
  25257. (S1 ^operator O2140 = 0.4476193732504098)
  25258. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25259. -->
  25260. (S1 ^operator O2140 = -0.00558448899823713)
  25261. Retracting rl*prefer*rvt*predict-yes*H0*3
  25262. -->
  25263. (S1 ^operator O2139 = 0.1844106571836858)
  25264. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25265. -->
  25266. (S1 ^operator O2139 = 0.8155904055662546)
  25267. =>WM: (15012: S1 ^operator O2142 +)
  25268. =>WM: (15011: S1 ^operator O2141 +)
  25269. =>WM: (15010: I3 ^dir L)
  25270. =>WM: (15009: O2142 ^name predict-no)
  25271. =>WM: (15008: O2141 ^name predict-yes)
  25272. =>WM: (15007: R1074 ^value 1)
  25273. =>WM: (15006: R1 ^reward R1074)
  25274. =>WM: (15005: I3 ^see 1)
  25275. <=WM: (14996: S1 ^operator O2139 +)
  25276. <=WM: (14998: S1 ^operator O2139)
  25277. <=WM: (14997: S1 ^operator O2140 +)
  25278. <=WM: (14995: I3 ^dir R)
  25279. <=WM: (14991: R1 ^reward R1073)
  25280. <=WM: (14937: I3 ^see 0)
  25281. <=WM: (14994: O2140 ^name predict-no)
  25282. <=WM: (14993: O2139 ^name predict-yes)
  25283. <=WM: (14992: R1073 ^value 1)
  25284. --- Inner Elaboration Phase, active level 1 (S1) ---
  25285. Firing prefer*rvt*predict-yes*H0
  25286. -->
  25287. Firing rl*prefer*rvt*predict-yes*H0*1
  25288. -->
  25289. (S1 ^operator O2141 = 0.3895394592518329)
  25290. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25291. -->
  25292. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25293. -->
  25294. (S1 ^operator O2141 = 0.6104598832926351)
  25295. Firing prefer*rvt*predict-no*H0
  25296. -->
  25297. Firing rl*prefer*rvt*predict-no*H0*2
  25298. -->
  25299. (S1 ^operator O2142 = 0.3873354627937282)
  25300. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25301. -->
  25302. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25303. -->
  25304. (S1 ^operator O2142 = 0.2714993082286609)
  25305. inner elaboration loop at bottom goal.
  25306. Retracting rl*prefer*rvt*predict-no*H0*2
  25307. -->
  25308. (S1 ^operator O2140 = 0.3873354627937282)
  25309. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25310. -->
  25311. (S1 ^operator O2140 = 0.2714993082286609)
  25312. Retracting rl*prefer*rvt*predict-yes*H0*1
  25313. -->
  25314. (S1 ^operator O2139 = 0.3895394592518329)
  25315. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25316. -->
  25317. (S1 ^operator O2139 = 0.6104598832926351)
  25318. --- END Proposal Phase ---
  25319. --- Decision Phase ---
  25320. RL update rl*prefer*rvt*predict-yes*H0*3 0.675413 -0.491002 0.184411 -> 0.675413 -0.491002 0.18441(R,m,v=1,0.905028,0.0864353)
  25321. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324588 0.491002 0.81559 -> 0.324588 0.491002 0.81559(R,m,v=1,1,0)
  25322. =>WM: (15013: S1 ^operator O2141)
  25323. 1071: O: O2141 (predict-yes)
  25324. --- END Decision Phase ---
  25325. --- Application Phase ---
  25326. --- Firing Productions (PE) For State At Depth 1 ---
  25327. --- Inner Elaboration Phase, active level 1 (S1) ---
  25328. Firing apply*operator
  25329. -->
  25330. (I3 ^predict-yes N1071 + :O )
  25331. Firing apply*operator*complete
  25332. -->
  25333. (I3 ^predict-yes N1070 - :O )
  25334. inner elaboration loop at bottom goal.
  25335. --- Change Working Memory (PE) ---
  25336. =>WM: (15014: I3 ^predict-yes N1071)
  25337. <=WM: (15000: N1070 ^status complete)
  25338. <=WM: (14999: I3 ^predict-yes N1070)
  25339. --- Firing Productions (IE) For State At Depth 1 ---
  25340. --- Inner Elaboration Phase, active level 1 (S1) ---
  25341. Firing monitor*world
  25342. -->
  25343. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25344. --- Change Working Memory (IE) ---
  25345. --- END Application Phase ---
  25346. --- Output Phase ---
  25347. ENV: Agent did: predict-yes for direction L in state State-B
  25348. In State-B moving L
  25349. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  25350. predict error 0
  25351. dir: dir isR
  25352. --- END Output Phase ---
  25353. \--- Input Phase ---
  25354. =>WM: (15018: I2 ^dir R)
  25355. =>WM: (15017: I2 ^reward 1)
  25356. =>WM: (15016: I2 ^see 1)
  25357. =>WM: (15015: N1071 ^status complete)
  25358. <=WM: (15003: I2 ^dir L)
  25359. <=WM: (15002: I2 ^reward 1)
  25360. <=WM: (15001: I2 ^see 1)
  25361. =>WM: (15019: I2 ^level-1 L1-root)
  25362. <=WM: (15004: I2 ^level-1 R1-root)
  25363. --- END Input Phase ---
  25364. --- Proposal Phase ---
  25365. --- Inner Elaboration Phase, active level 1 (S1) ---
  25366. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25367. -->
  25368. (S1 ^operator O2142 = -0.02155734064455064)
  25369. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25370. -->
  25371. (S1 ^operator O2141 = 0.8155841587866866)
  25372. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25373. -->
  25374. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25375. -->
  25376. Firing elaborate*copy-see-to-output-link
  25377. -->
  25378. (I3 ^see 1 +)
  25379. Firing elaborate*reward*based*on*reward
  25380. -->
  25381. (R1075 ^value 1 +)
  25382. (R1 ^reward R1075 +)
  25383. Firing propose*predict-yes
  25384. -->
  25385. (O2143 ^name predict-yes +)
  25386. (S1 ^operator O2143 +)
  25387. Firing propose*predict-no
  25388. -->
  25389. (O2144 ^name predict-no +)
  25390. (S1 ^operator O2144 +)
  25391. Firing rl*prefer*rvt*predict-no*H0*4
  25392. -->
  25393. (S1 ^operator O2142 = 0.4476193732504098)
  25394. Firing rl*prefer*rvt*predict-yes*H0*3
  25395. -->
  25396. (S1 ^operator O2141 = 0.1844104977711947)
  25397. Firing prefer*rvt*predict-yes*H0
  25398. -->
  25399. Firing prefer*rvt*predict-no*H0
  25400. -->
  25401. Firing elaborate*copy-dir-to-output-link
  25402. -->
  25403. (I3 ^dir R +)
  25404. inner elaboration loop at bottom goal.
  25405. Retracting elaborate*copy-see-to-output-link
  25406. -->
  25407. (I3 ^see 1 +)
  25408. Retracting propose*predict-no
  25409. -->
  25410. (O2142 ^name predict-no +)
  25411. (S1 ^operator O2142 +)
  25412. Retracting propose*predict-yes
  25413. -->
  25414. (O2141 ^name predict-yes +)
  25415. (S1 ^operator O2141 +)
  25416. Retracting elaborate*reward*based*on*reward
  25417. -->
  25418. (R1074 ^value 1 +)
  25419. (R1 ^reward R1074 +)
  25420. Retracting elaborate*copy-dir-to-output-link
  25421. -->
  25422. (I3 ^dir L +)
  25423. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25424. -->
  25425. (S1 ^operator O2142 = 0.2714993082286609)
  25426. Retracting rl*prefer*rvt*predict-no*H0*2
  25427. -->
  25428. (S1 ^operator O2142 = 0.3873354627937282)
  25429. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25430. -->
  25431. (S1 ^operator O2141 = 0.6104598832926351)
  25432. Retracting rl*prefer*rvt*predict-yes*H0*1
  25433. -->
  25434. (S1 ^operator O2141 = 0.3895394592518329)
  25435. =>WM: (15026: S1 ^operator O2144 +)
  25436. =>WM: (15025: S1 ^operator O2143 +)
  25437. =>WM: (15024: I3 ^dir R)
  25438. =>WM: (15023: O2144 ^name predict-no)
  25439. =>WM: (15022: O2143 ^name predict-yes)
  25440. =>WM: (15021: R1075 ^value 1)
  25441. =>WM: (15020: R1 ^reward R1075)
  25442. <=WM: (15011: S1 ^operator O2141 +)
  25443. <=WM: (15013: S1 ^operator O2141)
  25444. <=WM: (15012: S1 ^operator O2142 +)
  25445. <=WM: (15010: I3 ^dir L)
  25446. <=WM: (15006: R1 ^reward R1074)
  25447. <=WM: (15009: O2142 ^name predict-no)
  25448. <=WM: (15008: O2141 ^name predict-yes)
  25449. <=WM: (15007: R1074 ^value 1)
  25450. --- Inner Elaboration Phase, active level 1 (S1) ---
  25451. Firing prefer*rvt*predict-yes*H0
  25452. -->
  25453. Firing rl*prefer*rvt*predict-yes*H0*3
  25454. -->
  25455. (S1 ^operator O2143 = 0.1844104977711947)
  25456. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25457. -->
  25458. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25459. -->
  25460. (S1 ^operator O2143 = 0.8155841587866866)
  25461. Firing prefer*rvt*predict-no*H0
  25462. -->
  25463. Firing rl*prefer*rvt*predict-no*H0*4
  25464. -->
  25465. (S1 ^operator O2144 = 0.4476193732504098)
  25466. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25467. -->
  25468. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25469. -->
  25470. (S1 ^operator O2144 = -0.02155734064455064)
  25471. inner elaboration loop at bottom goal.
  25472. Retracting rl*prefer*rvt*predict-no*H0*4
  25473. -->
  25474. (S1 ^operator O2142 = 0.4476193732504098)
  25475. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25476. -->
  25477. (S1 ^operator O2142 = -0.02155734064455064)
  25478. Retracting rl*prefer*rvt*predict-yes*H0*3
  25479. -->
  25480. (S1 ^operator O2141 = 0.1844104977711947)
  25481. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25482. -->
  25483. (S1 ^operator O2141 = 0.8155841587866866)
  25484. --- END Proposal Phase ---
  25485. --- Decision Phase ---
  25486. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.389539 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.898305,0.0918721)
  25487. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  25488. =>WM: (15027: S1 ^operator O2143)
  25489. 1072: O: O2143 (predict-yes)
  25490. --- END Decision Phase ---
  25491. --- Application Phase ---
  25492. --- Firing Productions (PE) For State At Depth 1 ---
  25493. --- Inner Elaboration Phase, active level 1 (S1) ---
  25494. Firing apply*operator
  25495. -->
  25496. (I3 ^predict-yes N1072 + :O )
  25497. Firing apply*operator*complete
  25498. -->
  25499. (I3 ^predict-yes N1071 - :O )
  25500. inner elaboration loop at bottom goal.
  25501. --- Change Working Memory (PE) ---
  25502. =>WM: (15028: I3 ^predict-yes N1072)
  25503. <=WM: (15015: N1071 ^status complete)
  25504. <=WM: (15014: I3 ^predict-yes N1071)
  25505. --- Firing Productions (IE) For State At Depth 1 ---
  25506. --- Inner Elaboration Phase, active level 1 (S1) ---
  25507. Firing monitor*world
  25508. -->
  25509. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25510. --- Change Working Memory (IE) ---
  25511. --- END Application Phase ---
  25512. --- Output Phase ---
  25513. ENV: Agent did: predict-yes for direction R in state State-A
  25514. In State-A moving R
  25515. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25516. predict error 0
  25517. dir: dir isL
  25518. --- END Output Phase ---
  25519. -/|--- Input Phase ---
  25520. =>WM: (15032: I2 ^dir L)
  25521. =>WM: (15031: I2 ^reward 1)
  25522. =>WM: (15030: I2 ^see 1)
  25523. =>WM: (15029: N1072 ^status complete)
  25524. <=WM: (15018: I2 ^dir R)
  25525. <=WM: (15017: I2 ^reward 1)
  25526. <=WM: (15016: I2 ^see 1)
  25527. =>WM: (15033: I2 ^level-1 R1-root)
  25528. <=WM: (15019: I2 ^level-1 L1-root)
  25529. --- END Input Phase ---
  25530. --- Proposal Phase ---
  25531. --- Inner Elaboration Phase, active level 1 (S1) ---
  25532. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25533. -->
  25534. (S1 ^operator O2143 = 0.6104599819109648)
  25535. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25536. -->
  25537. (S1 ^operator O2144 = 0.2714993082286609)
  25538. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25539. -->
  25540. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25541. -->
  25542. Firing elaborate*copy-see-to-output-link
  25543. -->
  25544. (I3 ^see 1 +)
  25545. Firing elaborate*reward*based*on*reward
  25546. -->
  25547. (R1076 ^value 1 +)
  25548. (R1 ^reward R1076 +)
  25549. Firing propose*predict-yes
  25550. -->
  25551. (O2145 ^name predict-yes +)
  25552. (S1 ^operator O2145 +)
  25553. Firing propose*predict-no
  25554. -->
  25555. (O2146 ^name predict-no +)
  25556. (S1 ^operator O2146 +)
  25557. Firing rl*prefer*rvt*predict-no*H0*2
  25558. -->
  25559. (S1 ^operator O2144 = 0.3873354627937282)
  25560. Firing rl*prefer*rvt*predict-yes*H0*1
  25561. -->
  25562. (S1 ^operator O2143 = 0.3895395578701628)
  25563. Firing prefer*rvt*predict-yes*H0
  25564. -->
  25565. Firing prefer*rvt*predict-no*H0
  25566. -->
  25567. Firing elaborate*copy-dir-to-output-link
  25568. -->
  25569. (I3 ^dir L +)
  25570. inner elaboration loop at bottom goal.
  25571. Retracting elaborate*copy-see-to-output-link
  25572. -->
  25573. (I3 ^see 1 +)
  25574. Retracting propose*predict-no
  25575. -->
  25576. (O2144 ^name predict-no +)
  25577. (S1 ^operator O2144 +)
  25578. Retracting propose*predict-yes
  25579. -->
  25580. (O2143 ^name predict-yes +)
  25581. (S1 ^operator O2143 +)
  25582. Retracting elaborate*reward*based*on*reward
  25583. -->
  25584. (R1075 ^value 1 +)
  25585. (R1 ^reward R1075 +)
  25586. Retracting elaborate*copy-dir-to-output-link
  25587. -->
  25588. (I3 ^dir R +)
  25589. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25590. -->
  25591. (S1 ^operator O2144 = -0.02155734064455064)
  25592. Retracting rl*prefer*rvt*predict-no*H0*4
  25593. -->
  25594. (S1 ^operator O2144 = 0.4476193732504098)
  25595. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25596. -->
  25597. (S1 ^operator O2143 = 0.8155841587866866)
  25598. Retracting rl*prefer*rvt*predict-yes*H0*3
  25599. -->
  25600. (S1 ^operator O2143 = 0.1844104977711947)
  25601. =>WM: (15040: S1 ^operator O2146 +)
  25602. =>WM: (15039: S1 ^operator O2145 +)
  25603. =>WM: (15038: I3 ^dir L)
  25604. =>WM: (15037: O2146 ^name predict-no)
  25605. =>WM: (15036: O2145 ^name predict-yes)
  25606. =>WM: (15035: R1076 ^value 1)
  25607. =>WM: (15034: R1 ^reward R1076)
  25608. <=WM: (15025: S1 ^operator O2143 +)
  25609. <=WM: (15027: S1 ^operator O2143)
  25610. <=WM: (15026: S1 ^operator O2144 +)
  25611. <=WM: (15024: I3 ^dir R)
  25612. <=WM: (15020: R1 ^reward R1075)
  25613. <=WM: (15023: O2144 ^name predict-no)
  25614. <=WM: (15022: O2143 ^name predict-yes)
  25615. <=WM: (15021: R1075 ^value 1)
  25616. --- Inner Elaboration Phase, active level 1 (S1) ---
  25617. Firing prefer*rvt*predict-yes*H0
  25618. -->
  25619. Firing rl*prefer*rvt*predict-yes*H0*1
  25620. -->
  25621. (S1 ^operator O2145 = 0.3895395578701628)
  25622. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  25623. -->
  25624. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25625. -->
  25626. (S1 ^operator O2145 = 0.6104599819109648)
  25627. Firing prefer*rvt*predict-no*H0
  25628. -->
  25629. Firing rl*prefer*rvt*predict-no*H0*2
  25630. -->
  25631. (S1 ^operator O2146 = 0.3873354627937282)
  25632. Firing prefer*rvt*predict-no*H0*2*v1*H1
  25633. -->
  25634. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25635. -->
  25636. (S1 ^operator O2146 = 0.2714993082286609)
  25637. inner elaboration loop at bottom goal.
  25638. Retracting rl*prefer*rvt*predict-no*H0*2
  25639. -->
  25640. (S1 ^operator O2144 = 0.3873354627937282)
  25641. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25642. -->
  25643. (S1 ^operator O2144 = 0.2714993082286609)
  25644. Retracting rl*prefer*rvt*predict-yes*H0*1
  25645. -->
  25646. (S1 ^operator O2143 = 0.3895395578701628)
  25647. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25648. -->
  25649. (S1 ^operator O2143 = 0.6104599819109648)
  25650. --- END Proposal Phase ---
  25651. --- Decision Phase ---
  25652. RL update rl*prefer*rvt*predict-yes*H0*3 0.675413 -0.491002 0.18441 -> 0.675414 -0.491003 0.184411(R,m,v=1,0.905556,0.0860025)
  25653. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.32458 0.491004 0.815584 -> 0.324581 0.491004 0.815585(R,m,v=1,1,0)
  25654. =>WM: (15041: S1 ^operator O2145)
  25655. 1073: O: O2145 (predict-yes)
  25656. --- END Decision Phase ---
  25657. --- Application Phase ---
  25658. --- Firing Productions (PE) For State At Depth 1 ---
  25659. --- Inner Elaboration Phase, active level 1 (S1) ---
  25660. Firing apply*operator
  25661. -->
  25662. (I3 ^predict-yes N1073 + :O )
  25663. Firing apply*operator*complete
  25664. -->
  25665. (I3 ^predict-yes N1072 - :O )
  25666. inner elaboration loop at bottom goal.
  25667. --- Change Working Memory (PE) ---
  25668. =>WM: (15042: I3 ^predict-yes N1073)
  25669. <=WM: (15029: N1072 ^status complete)
  25670. <=WM: (15028: I3 ^predict-yes N1072)
  25671. --- Firing Productions (IE) For State At Depth 1 ---
  25672. --- Inner Elaboration Phase, active level 1 (S1) ---
  25673. Firing monitor*world
  25674. -->
  25675. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25676. --- Change Working Memory (IE) ---
  25677. --- END Application Phase ---
  25678. --- Output Phase ---
  25679. ENV: Agent did: predict-yes for direction L in state State-B
  25680. In State-B moving L
  25681. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  25682. predict error 0
  25683. dir: dir isR
  25684. --- END Output Phase ---
  25685. \-/--- Input Phase ---
  25686. =>WM: (15046: I2 ^dir R)
  25687. =>WM: (15045: I2 ^reward 1)
  25688. =>WM: (15044: I2 ^see 1)
  25689. =>WM: (15043: N1073 ^status complete)
  25690. <=WM: (15032: I2 ^dir L)
  25691. <=WM: (15031: I2 ^reward 1)
  25692. <=WM: (15030: I2 ^see 1)
  25693. =>WM: (15047: I2 ^level-1 L1-root)
  25694. <=WM: (15033: I2 ^level-1 R1-root)
  25695. --- END Input Phase ---
  25696. --- Proposal Phase ---
  25697. --- Inner Elaboration Phase, active level 1 (S1) ---
  25698. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25699. -->
  25700. (S1 ^operator O2146 = -0.02155734064455064)
  25701. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25702. -->
  25703. (S1 ^operator O2145 = 0.8155849603030043)
  25704. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25705. -->
  25706. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25707. -->
  25708. Firing elaborate*copy-see-to-output-link
  25709. -->
  25710. (I3 ^see 1 +)
  25711. Firing elaborate*reward*based*on*reward
  25712. -->
  25713. (R1077 ^value 1 +)
  25714. (R1 ^reward R1077 +)
  25715. Firing propose*predict-yes
  25716. -->
  25717. (O2147 ^name predict-yes +)
  25718. (S1 ^operator O2147 +)
  25719. Firing propose*predict-no
  25720. -->
  25721. (O2148 ^name predict-no +)
  25722. (S1 ^operator O2148 +)
  25723. Firing rl*prefer*rvt*predict-no*H0*4
  25724. -->
  25725. (S1 ^operator O2146 = 0.4476193732504098)
  25726. Firing rl*prefer*rvt*predict-yes*H0*3
  25727. -->
  25728. (S1 ^operator O2145 = 0.1844112992875125)
  25729. Firing prefer*rvt*predict-yes*H0
  25730. -->
  25731. Firing prefer*rvt*predict-no*H0
  25732. -->
  25733. Firing elaborate*copy-dir-to-output-link
  25734. -->
  25735. (I3 ^dir R +)
  25736. inner elaboration loop at bottom goal.
  25737. Retracting elaborate*copy-see-to-output-link
  25738. -->
  25739. (I3 ^see 1 +)
  25740. Retracting propose*predict-no
  25741. -->
  25742. (O2146 ^name predict-no +)
  25743. (S1 ^operator O2146 +)
  25744. Retracting propose*predict-yes
  25745. -->
  25746. (O2145 ^name predict-yes +)
  25747. (S1 ^operator O2145 +)
  25748. Retracting elaborate*reward*based*on*reward
  25749. -->
  25750. (R1076 ^value 1 +)
  25751. (R1 ^reward R1076 +)
  25752. Retracting elaborate*copy-dir-to-output-link
  25753. -->
  25754. (I3 ^dir L +)
  25755. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  25756. -->
  25757. (S1 ^operator O2146 = 0.2714993082286609)
  25758. Retracting rl*prefer*rvt*predict-no*H0*2
  25759. -->
  25760. (S1 ^operator O2146 = 0.3873354627937282)
  25761. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  25762. -->
  25763. (S1 ^operator O2145 = 0.6104599819109648)
  25764. Retracting rl*prefer*rvt*predict-yes*H0*1
  25765. -->
  25766. (S1 ^operator O2145 = 0.3895395578701628)
  25767. =>WM: (15054: S1 ^operator O2148 +)
  25768. =>WM: (15053: S1 ^operator O2147 +)
  25769. =>WM: (15052: I3 ^dir R)
  25770. =>WM: (15051: O2148 ^name predict-no)
  25771. =>WM: (15050: O2147 ^name predict-yes)
  25772. =>WM: (15049: R1077 ^value 1)
  25773. =>WM: (15048: R1 ^reward R1077)
  25774. <=WM: (15039: S1 ^operator O2145 +)
  25775. <=WM: (15041: S1 ^operator O2145)
  25776. <=WM: (15040: S1 ^operator O2146 +)
  25777. <=WM: (15038: I3 ^dir L)
  25778. <=WM: (15034: R1 ^reward R1076)
  25779. <=WM: (15037: O2146 ^name predict-no)
  25780. <=WM: (15036: O2145 ^name predict-yes)
  25781. <=WM: (15035: R1076 ^value 1)
  25782. --- Inner Elaboration Phase, active level 1 (S1) ---
  25783. Firing prefer*rvt*predict-yes*H0
  25784. -->
  25785. Firing rl*prefer*rvt*predict-yes*H0*3
  25786. -->
  25787. (S1 ^operator O2147 = 0.1844112992875125)
  25788. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25789. -->
  25790. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25791. -->
  25792. (S1 ^operator O2147 = 0.8155849603030043)
  25793. Firing prefer*rvt*predict-no*H0
  25794. -->
  25795. Firing rl*prefer*rvt*predict-no*H0*4
  25796. -->
  25797. (S1 ^operator O2148 = 0.4476193732504098)
  25798. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25799. -->
  25800. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25801. -->
  25802. (S1 ^operator O2148 = -0.02155734064455064)
  25803. inner elaboration loop at bottom goal.
  25804. Retracting rl*prefer*rvt*predict-no*H0*4
  25805. -->
  25806. (S1 ^operator O2146 = 0.4476193732504098)
  25807. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25808. -->
  25809. (S1 ^operator O2146 = -0.02155734064455064)
  25810. Retracting rl*prefer*rvt*predict-yes*H0*3
  25811. -->
  25812. (S1 ^operator O2145 = 0.1844112992875125)
  25813. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25814. -->
  25815. (S1 ^operator O2145 = 0.8155849603030043)
  25816. --- END Proposal Phase ---
  25817. --- Decision Phase ---
  25818. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.898876,0.0914112)
  25819. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  25820. =>WM: (15055: S1 ^operator O2147)
  25821. 1074: O: O2147 (predict-yes)
  25822. --- END Decision Phase ---
  25823. --- Application Phase ---
  25824. --- Firing Productions (PE) For State At Depth 1 ---
  25825. --- Inner Elaboration Phase, active level 1 (S1) ---
  25826. Firing apply*operator
  25827. -->
  25828. (I3 ^predict-yes N1074 + :O )
  25829. Firing apply*operator*complete
  25830. -->
  25831. (I3 ^predict-yes N1073 - :O )
  25832. inner elaboration loop at bottom goal.
  25833. --- Change Working Memory (PE) ---
  25834. =>WM: (15056: I3 ^predict-yes N1074)
  25835. <=WM: (15043: N1073 ^status complete)
  25836. <=WM: (15042: I3 ^predict-yes N1073)
  25837. --- Firing Productions (IE) For State At Depth 1 ---
  25838. --- Inner Elaboration Phase, active level 1 (S1) ---
  25839. Firing monitor*world
  25840. -->
  25841. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25842. --- Change Working Memory (IE) ---
  25843. --- END Application Phase ---
  25844. --- Output Phase ---
  25845. ENV: Agent did: predict-yes for direction R in state State-A
  25846. In State-A moving R
  25847. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25848. predict error 0
  25849. dir: dir isR
  25850. --- END Output Phase ---
  25851. |\---- Input Phase ---
  25852. =>WM: (15060: I2 ^dir R)
  25853. =>WM: (15059: I2 ^reward 1)
  25854. =>WM: (15058: I2 ^see 1)
  25855. =>WM: (15057: N1074 ^status complete)
  25856. <=WM: (15046: I2 ^dir R)
  25857. <=WM: (15045: I2 ^reward 1)
  25858. <=WM: (15044: I2 ^see 1)
  25859. =>WM: (15061: I2 ^level-1 R1-root)
  25860. <=WM: (15047: I2 ^level-1 L1-root)
  25861. --- END Input Phase ---
  25862. --- Proposal Phase ---
  25863. --- Inner Elaboration Phase, active level 1 (S1) ---
  25864. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  25865. -->
  25866. (S1 ^operator O2147 = 0.1398795999120246)
  25867. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  25868. -->
  25869. (S1 ^operator O2148 = 0.5523810978783164)
  25870. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25871. -->
  25872. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25873. -->
  25874. Firing elaborate*copy-see-to-output-link
  25875. -->
  25876. (I3 ^see 1 +)
  25877. Firing elaborate*reward*based*on*reward
  25878. -->
  25879. (R1078 ^value 1 +)
  25880. (R1 ^reward R1078 +)
  25881. Firing propose*predict-yes
  25882. -->
  25883. (O2149 ^name predict-yes +)
  25884. (S1 ^operator O2149 +)
  25885. Firing propose*predict-no
  25886. -->
  25887. (O2150 ^name predict-no +)
  25888. (S1 ^operator O2150 +)
  25889. Firing rl*prefer*rvt*predict-no*H0*4
  25890. -->
  25891. (S1 ^operator O2148 = 0.4476193732504098)
  25892. Firing rl*prefer*rvt*predict-yes*H0*3
  25893. -->
  25894. (S1 ^operator O2147 = 0.1844112992875125)
  25895. Firing prefer*rvt*predict-yes*H0
  25896. -->
  25897. Firing prefer*rvt*predict-no*H0
  25898. -->
  25899. Firing elaborate*copy-dir-to-output-link
  25900. -->
  25901. (I3 ^dir R +)
  25902. inner elaboration loop at bottom goal.
  25903. Retracting elaborate*copy-see-to-output-link
  25904. -->
  25905. (I3 ^see 1 +)
  25906. Retracting propose*predict-no
  25907. -->
  25908. (O2148 ^name predict-no +)
  25909. (S1 ^operator O2148 +)
  25910. Retracting propose*predict-yes
  25911. -->
  25912. (O2147 ^name predict-yes +)
  25913. (S1 ^operator O2147 +)
  25914. Retracting elaborate*reward*based*on*reward
  25915. -->
  25916. (R1077 ^value 1 +)
  25917. (R1 ^reward R1077 +)
  25918. Retracting elaborate*copy-dir-to-output-link
  25919. -->
  25920. (I3 ^dir R +)
  25921. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  25922. -->
  25923. (S1 ^operator O2148 = -0.02155734064455064)
  25924. Retracting rl*prefer*rvt*predict-no*H0*4
  25925. -->
  25926. (S1 ^operator O2148 = 0.4476193732504098)
  25927. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  25928. -->
  25929. (S1 ^operator O2147 = 0.8155849603030043)
  25930. Retracting rl*prefer*rvt*predict-yes*H0*3
  25931. -->
  25932. (S1 ^operator O2147 = 0.1844112992875125)
  25933. =>WM: (15067: S1 ^operator O2150 +)
  25934. =>WM: (15066: S1 ^operator O2149 +)
  25935. =>WM: (15065: O2150 ^name predict-no)
  25936. =>WM: (15064: O2149 ^name predict-yes)
  25937. =>WM: (15063: R1078 ^value 1)
  25938. =>WM: (15062: R1 ^reward R1078)
  25939. <=WM: (15053: S1 ^operator O2147 +)
  25940. <=WM: (15055: S1 ^operator O2147)
  25941. <=WM: (15054: S1 ^operator O2148 +)
  25942. <=WM: (15048: R1 ^reward R1077)
  25943. <=WM: (15051: O2148 ^name predict-no)
  25944. <=WM: (15050: O2147 ^name predict-yes)
  25945. <=WM: (15049: R1077 ^value 1)
  25946. --- Inner Elaboration Phase, active level 1 (S1) ---
  25947. Firing prefer*rvt*predict-yes*H0
  25948. -->
  25949. Firing rl*prefer*rvt*predict-yes*H0*3
  25950. -->
  25951. (S1 ^operator O2149 = 0.1844112992875125)
  25952. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25953. -->
  25954. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  25955. -->
  25956. (S1 ^operator O2149 = 0.1398795999120246)
  25957. Firing prefer*rvt*predict-no*H0
  25958. -->
  25959. Firing rl*prefer*rvt*predict-no*H0*4
  25960. -->
  25961. (S1 ^operator O2150 = 0.4476193732504098)
  25962. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25963. -->
  25964. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  25965. -->
  25966. (S1 ^operator O2150 = 0.5523810978783164)
  25967. inner elaboration loop at bottom goal.
  25968. Retracting rl*prefer*rvt*predict-no*H0*4
  25969. -->
  25970. (S1 ^operator O2148 = 0.4476193732504098)
  25971. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  25972. -->
  25973. (S1 ^operator O2148 = 0.5523810978783164)
  25974. Retracting rl*prefer*rvt*predict-yes*H0*3
  25975. -->
  25976. (S1 ^operator O2147 = 0.1844112992875125)
  25977. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  25978. -->
  25979. (S1 ^operator O2147 = 0.1398795999120246)
  25980. --- END Proposal Phase ---
  25981. --- Decision Phase ---
  25982. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184411 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.906077,0.085574)
  25983. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324581 0.491004 0.815585 -> 0.324582 0.491004 0.815586(R,m,v=1,1,0)
  25984. =>WM: (15068: S1 ^operator O2150)
  25985. 1075: O: O2150 (predict-no)
  25986. --- END Decision Phase ---
  25987. --- Application Phase ---
  25988. --- Firing Productions (PE) For State At Depth 1 ---
  25989. --- Inner Elaboration Phase, active level 1 (S1) ---
  25990. Firing apply*operator
  25991. -->
  25992. (I3 ^predict-no N1075 + :O )
  25993. Firing apply*operator*complete
  25994. -->
  25995. (I3 ^predict-yes N1074 - :O )
  25996. inner elaboration loop at bottom goal.
  25997. --- Change Working Memory (PE) ---
  25998. =>WM: (15069: I3 ^predict-no N1075)
  25999. <=WM: (15057: N1074 ^status complete)
  26000. <=WM: (15056: I3 ^predict-yes N1074)
  26001. --- Firing Productions (IE) For State At Depth 1 ---
  26002. --- Inner Elaboration Phase, active level 1 (S1) ---
  26003. Firing monitor*world
  26004. -->
  26005. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26006. --- Change Working Memory (IE) ---
  26007. --- END Application Phase ---
  26008. --- Output Phase ---
  26009. ENV: Agent did: predict-no for direction R in state State-B
  26010. In State-B moving R
  26011. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26012. predict error 0
  26013. dir: dir isL
  26014. --- END Output Phase ---
  26015. /|\--- Input Phase ---
  26016. =>WM: (15073: I2 ^dir L)
  26017. =>WM: (15072: I2 ^reward 1)
  26018. =>WM: (15071: I2 ^see 0)
  26019. =>WM: (15070: N1075 ^status complete)
  26020. <=WM: (15060: I2 ^dir R)
  26021. <=WM: (15059: I2 ^reward 1)
  26022. <=WM: (15058: I2 ^see 1)
  26023. =>WM: (15074: I2 ^level-1 R0-root)
  26024. <=WM: (15061: I2 ^level-1 R1-root)
  26025. --- END Input Phase ---
  26026. --- Proposal Phase ---
  26027. --- Inner Elaboration Phase, active level 1 (S1) ---
  26028. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  26029. -->
  26030. (S1 ^operator O2149 = 0.6104606905185325)
  26031. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  26032. -->
  26033. (S1 ^operator O2150 = 0.1063475139796038)
  26034. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26035. -->
  26036. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26037. -->
  26038. Firing elaborate*copy-see-to-output-link
  26039. -->
  26040. (I3 ^see 0 +)
  26041. Firing elaborate*reward*based*on*reward
  26042. -->
  26043. (R1079 ^value 1 +)
  26044. (R1 ^reward R1079 +)
  26045. Firing propose*predict-yes
  26046. -->
  26047. (O2151 ^name predict-yes +)
  26048. (S1 ^operator O2151 +)
  26049. Firing propose*predict-no
  26050. -->
  26051. (O2152 ^name predict-no +)
  26052. (S1 ^operator O2152 +)
  26053. Firing rl*prefer*rvt*predict-no*H0*2
  26054. -->
  26055. (S1 ^operator O2150 = 0.3873354627937282)
  26056. Firing rl*prefer*rvt*predict-yes*H0*1
  26057. -->
  26058. (S1 ^operator O2149 = 0.3895396269029936)
  26059. Firing prefer*rvt*predict-yes*H0
  26060. -->
  26061. Firing prefer*rvt*predict-no*H0
  26062. -->
  26063. Firing elaborate*copy-dir-to-output-link
  26064. -->
  26065. (I3 ^dir L +)
  26066. inner elaboration loop at bottom goal.
  26067. Retracting elaborate*copy-see-to-output-link
  26068. -->
  26069. (I3 ^see 1 +)
  26070. Retracting propose*predict-no
  26071. -->
  26072. (O2150 ^name predict-no +)
  26073. (S1 ^operator O2150 +)
  26074. Retracting propose*predict-yes
  26075. -->
  26076. (O2149 ^name predict-yes +)
  26077. (S1 ^operator O2149 +)
  26078. Retracting elaborate*reward*based*on*reward
  26079. -->
  26080. (R1078 ^value 1 +)
  26081. (R1 ^reward R1078 +)
  26082. Retracting elaborate*copy-dir-to-output-link
  26083. -->
  26084. (I3 ^dir R +)
  26085. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  26086. -->
  26087. (S1 ^operator O2150 = 0.5523810978783164)
  26088. Retracting rl*prefer*rvt*predict-no*H0*4
  26089. -->
  26090. (S1 ^operator O2150 = 0.4476193732504098)
  26091. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  26092. -->
  26093. (S1 ^operator O2149 = 0.1398795999120246)
  26094. Retracting rl*prefer*rvt*predict-yes*H0*3
  26095. -->
  26096. (S1 ^operator O2149 = 0.1844118603489351)
  26097. =>WM: (15082: S1 ^operator O2152 +)
  26098. =>WM: (15081: S1 ^operator O2151 +)
  26099. =>WM: (15080: I3 ^dir L)
  26100. =>WM: (15079: O2152 ^name predict-no)
  26101. =>WM: (15078: O2151 ^name predict-yes)
  26102. =>WM: (15077: R1079 ^value 1)
  26103. =>WM: (15076: R1 ^reward R1079)
  26104. =>WM: (15075: I3 ^see 0)
  26105. <=WM: (15066: S1 ^operator O2149 +)
  26106. <=WM: (15067: S1 ^operator O2150 +)
  26107. <=WM: (15068: S1 ^operator O2150)
  26108. <=WM: (15052: I3 ^dir R)
  26109. <=WM: (15062: R1 ^reward R1078)
  26110. <=WM: (15005: I3 ^see 1)
  26111. <=WM: (15065: O2150 ^name predict-no)
  26112. <=WM: (15064: O2149 ^name predict-yes)
  26113. <=WM: (15063: R1078 ^value 1)
  26114. --- Inner Elaboration Phase, active level 1 (S1) ---
  26115. Firing prefer*rvt*predict-yes*H0
  26116. -->
  26117. Firing rl*prefer*rvt*predict-yes*H0*1
  26118. -->
  26119. (S1 ^operator O2151 = 0.3895396269029936)
  26120. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26121. -->
  26122. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  26123. -->
  26124. (S1 ^operator O2151 = 0.6104606905185325)
  26125. Firing prefer*rvt*predict-no*H0
  26126. -->
  26127. Firing rl*prefer*rvt*predict-no*H0*2
  26128. -->
  26129. (S1 ^operator O2152 = 0.3873354627937282)
  26130. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26131. -->
  26132. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  26133. -->
  26134. (S1 ^operator O2152 = 0.1063475139796038)
  26135. inner elaboration loop at bottom goal.
  26136. Retracting rl*prefer*rvt*predict-no*H0*2
  26137. -->
  26138. (S1 ^operator O2150 = 0.3873354627937282)
  26139. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  26140. -->
  26141. (S1 ^operator O2150 = 0.1063475139796038)
  26142. Retracting rl*prefer*rvt*predict-yes*H0*1
  26143. -->
  26144. (S1 ^operator O2149 = 0.3895396269029936)
  26145. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  26146. -->
  26147. (S1 ^operator O2149 = 0.6104606905185325)
  26148. --- END Proposal Phase ---
  26149. --- Decision Phase ---
  26150. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.93617,0.0601824)
  26151. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377467 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  26152. =>WM: (15083: S1 ^operator O2151)
  26153. 1076: O: O2151 (predict-yes)
  26154. --- END Decision Phase ---
  26155. --- Application Phase ---
  26156. --- Firing Productions (PE) For State At Depth 1 ---
  26157. --- Inner Elaboration Phase, active level 1 (S1) ---
  26158. Firing apply*operator
  26159. -->
  26160. (I3 ^predict-yes N1076 + :O )
  26161. Firing apply*operator*complete
  26162. -->
  26163. (I3 ^predict-no N1075 - :O )
  26164. inner elaboration loop at bottom goal.
  26165. --- Change Working Memory (PE) ---
  26166. =>WM: (15084: I3 ^predict-yes N1076)
  26167. <=WM: (15070: N1075 ^status complete)
  26168. <=WM: (15069: I3 ^predict-no N1075)
  26169. --- Firing Productions (IE) For State At Depth 1 ---
  26170. --- Inner Elaboration Phase, active level 1 (S1) ---
  26171. Firing monitor*world
  26172. -->
  26173. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26174. --- Change Working Memory (IE) ---
  26175. --- END Application Phase ---
  26176. --- Output Phase ---
  26177. ENV: Agent did: predict-yes for direction L in state State-B
  26178. In State-B moving L
  26179. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26180. predict error 0
  26181. dir: dir isL
  26182. --- END Output Phase ---
  26183. -/|--- Input Phase ---
  26184. =>WM: (15088: I2 ^dir L)
  26185. =>WM: (15087: I2 ^reward 1)
  26186. =>WM: (15086: I2 ^see 1)
  26187. =>WM: (15085: N1076 ^status complete)
  26188. <=WM: (15073: I2 ^dir L)
  26189. <=WM: (15072: I2 ^reward 1)
  26190. <=WM: (15071: I2 ^see 0)
  26191. =>WM: (15089: I2 ^level-1 L1-root)
  26192. <=WM: (15074: I2 ^level-1 R0-root)
  26193. --- END Input Phase ---
  26194. --- Proposal Phase ---
  26195. --- Inner Elaboration Phase, active level 1 (S1) ---
  26196. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26197. -->
  26198. (S1 ^operator O2152 = 0.6126637177448905)
  26199. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26200. -->
  26201. (S1 ^operator O2151 = -0.02274740735326741)
  26202. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26203. -->
  26204. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26205. -->
  26206. Firing elaborate*copy-see-to-output-link
  26207. -->
  26208. (I3 ^see 1 +)
  26209. Firing elaborate*reward*based*on*reward
  26210. -->
  26211. (R1080 ^value 1 +)
  26212. (R1 ^reward R1080 +)
  26213. Firing propose*predict-yes
  26214. -->
  26215. (O2153 ^name predict-yes +)
  26216. (S1 ^operator O2153 +)
  26217. Firing propose*predict-no
  26218. -->
  26219. (O2154 ^name predict-no +)
  26220. (S1 ^operator O2154 +)
  26221. Firing rl*prefer*rvt*predict-no*H0*2
  26222. -->
  26223. (S1 ^operator O2152 = 0.3873354627937282)
  26224. Firing rl*prefer*rvt*predict-yes*H0*1
  26225. -->
  26226. (S1 ^operator O2151 = 0.3895396269029936)
  26227. Firing prefer*rvt*predict-yes*H0
  26228. -->
  26229. Firing prefer*rvt*predict-no*H0
  26230. -->
  26231. Firing elaborate*copy-dir-to-output-link
  26232. -->
  26233. (I3 ^dir L +)
  26234. inner elaboration loop at bottom goal.
  26235. Retracting elaborate*copy-see-to-output-link
  26236. -->
  26237. (I3 ^see 0 +)
  26238. Retracting propose*predict-no
  26239. -->
  26240. (O2152 ^name predict-no +)
  26241. (S1 ^operator O2152 +)
  26242. Retracting propose*predict-yes
  26243. -->
  26244. (O2151 ^name predict-yes +)
  26245. (S1 ^operator O2151 +)
  26246. Retracting elaborate*reward*based*on*reward
  26247. -->
  26248. (R1079 ^value 1 +)
  26249. (R1 ^reward R1079 +)
  26250. Retracting elaborate*copy-dir-to-output-link
  26251. -->
  26252. (I3 ^dir L +)
  26253. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  26254. -->
  26255. (S1 ^operator O2152 = 0.1063475139796038)
  26256. Retracting rl*prefer*rvt*predict-no*H0*2
  26257. -->
  26258. (S1 ^operator O2152 = 0.3873354627937282)
  26259. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  26260. -->
  26261. (S1 ^operator O2151 = 0.6104606905185325)
  26262. Retracting rl*prefer*rvt*predict-yes*H0*1
  26263. -->
  26264. (S1 ^operator O2151 = 0.3895396269029936)
  26265. =>WM: (15096: S1 ^operator O2154 +)
  26266. =>WM: (15095: S1 ^operator O2153 +)
  26267. =>WM: (15094: O2154 ^name predict-no)
  26268. =>WM: (15093: O2153 ^name predict-yes)
  26269. =>WM: (15092: R1080 ^value 1)
  26270. =>WM: (15091: R1 ^reward R1080)
  26271. =>WM: (15090: I3 ^see 1)
  26272. <=WM: (15081: S1 ^operator O2151 +)
  26273. <=WM: (15083: S1 ^operator O2151)
  26274. <=WM: (15082: S1 ^operator O2152 +)
  26275. <=WM: (15076: R1 ^reward R1079)
  26276. <=WM: (15075: I3 ^see 0)
  26277. <=WM: (15079: O2152 ^name predict-no)
  26278. <=WM: (15078: O2151 ^name predict-yes)
  26279. <=WM: (15077: R1079 ^value 1)
  26280. --- Inner Elaboration Phase, active level 1 (S1) ---
  26281. Firing prefer*rvt*predict-yes*H0
  26282. -->
  26283. Firing rl*prefer*rvt*predict-yes*H0*1
  26284. -->
  26285. (S1 ^operator O2153 = 0.3895396269029936)
  26286. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26287. -->
  26288. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26289. -->
  26290. (S1 ^operator O2153 = -0.02274740735326741)
  26291. Firing prefer*rvt*predict-no*H0
  26292. -->
  26293. Firing rl*prefer*rvt*predict-no*H0*2
  26294. -->
  26295. (S1 ^operator O2154 = 0.3873354627937282)
  26296. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26297. -->
  26298. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26299. -->
  26300. (S1 ^operator O2154 = 0.6126637177448905)
  26301. inner elaboration loop at bottom goal.
  26302. Retracting rl*prefer*rvt*predict-no*H0*2
  26303. -->
  26304. (S1 ^operator O2152 = 0.3873354627937282)
  26305. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26306. -->
  26307. (S1 ^operator O2152 = 0.6126637177448905)
  26308. Retracting rl*prefer*rvt*predict-yes*H0*1
  26309. -->
  26310. (S1 ^operator O2151 = 0.3895396269029936)
  26311. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26312. -->
  26313. (S1 ^operator O2151 = -0.02274740735326741)
  26314. --- END Proposal Phase ---
  26315. --- Decision Phase ---
  26316. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.899441,0.0909547)
  26317. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  26318. =>WM: (15097: S1 ^operator O2154)
  26319. 1077: O: O2154 (predict-no)
  26320. --- END Decision Phase ---
  26321. --- Application Phase ---
  26322. --- Firing Productions (PE) For State At Depth 1 ---
  26323. --- Inner Elaboration Phase, active level 1 (S1) ---
  26324. Firing apply*operator
  26325. -->
  26326. (I3 ^predict-no N1077 + :O )
  26327. Firing apply*operator*complete
  26328. -->
  26329. (I3 ^predict-yes N1076 - :O )
  26330. inner elaboration loop at bottom goal.
  26331. --- Change Working Memory (PE) ---
  26332. =>WM: (15098: I3 ^predict-no N1077)
  26333. <=WM: (15085: N1076 ^status complete)
  26334. <=WM: (15084: I3 ^predict-yes N1076)
  26335. --- Firing Productions (IE) For State At Depth 1 ---
  26336. --- Inner Elaboration Phase, active level 1 (S1) ---
  26337. Firing monitor*world
  26338. -->
  26339. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26340. --- Change Working Memory (IE) ---
  26341. --- END Application Phase ---
  26342. --- Output Phase ---
  26343. ENV: Agent did: predict-no for direction L in state State-A
  26344. In State-A moving L
  26345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26346. predict error 0
  26347. dir: dir isR
  26348. --- END Output Phase ---
  26349. \---- Input Phase ---
  26350. =>WM: (15102: I2 ^dir R)
  26351. =>WM: (15101: I2 ^reward 1)
  26352. =>WM: (15100: I2 ^see 0)
  26353. =>WM: (15099: N1077 ^status complete)
  26354. <=WM: (15088: I2 ^dir L)
  26355. <=WM: (15087: I2 ^reward 1)
  26356. <=WM: (15086: I2 ^see 1)
  26357. =>WM: (15103: I2 ^level-1 L0-root)
  26358. <=WM: (15089: I2 ^level-1 L1-root)
  26359. --- END Input Phase ---
  26360. --- Proposal Phase ---
  26361. --- Inner Elaboration Phase, active level 1 (S1) ---
  26362. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  26363. -->
  26364. (S1 ^operator O2153 = 0.8155902461537636)
  26365. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  26366. -->
  26367. (S1 ^operator O2154 = -0.00558448899823713)
  26368. Firing prefer*rvt*predict-no*H0*4*v1*H1
  26369. -->
  26370. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  26371. -->
  26372. Firing elaborate*copy-see-to-output-link
  26373. -->
  26374. (I3 ^see 0 +)
  26375. Firing elaborate*reward*based*on*reward
  26376. -->
  26377. (R1081 ^value 1 +)
  26378. (R1 ^reward R1081 +)
  26379. Firing propose*predict-yes
  26380. -->
  26381. (O2155 ^name predict-yes +)
  26382. (S1 ^operator O2155 +)
  26383. Firing propose*predict-no
  26384. -->
  26385. (O2156 ^name predict-no +)
  26386. (S1 ^operator O2156 +)
  26387. Firing rl*prefer*rvt*predict-no*H0*4
  26388. -->
  26389. (S1 ^operator O2154 = 0.4476193025811009)
  26390. Firing rl*prefer*rvt*predict-yes*H0*3
  26391. -->
  26392. (S1 ^operator O2153 = 0.1844118603489351)
  26393. Firing prefer*rvt*predict-yes*H0
  26394. -->
  26395. Firing prefer*rvt*predict-no*H0
  26396. -->
  26397. Firing elaborate*copy-dir-to-output-link
  26398. -->
  26399. (I3 ^dir R +)
  26400. inner elaboration loop at bottom goal.
  26401. Retracting elaborate*copy-see-to-output-link
  26402. -->
  26403. (I3 ^see 1 +)
  26404. Retracting propose*predict-no
  26405. -->
  26406. (O2154 ^name predict-no +)
  26407. (S1 ^operator O2154 +)
  26408. Retracting propose*predict-yes
  26409. -->
  26410. (O2153 ^name predict-yes +)
  26411. (S1 ^operator O2153 +)
  26412. Retracting elaborate*reward*based*on*reward
  26413. -->
  26414. (R1080 ^value 1 +)
  26415. (R1 ^reward R1080 +)
  26416. Retracting elaborate*copy-dir-to-output-link
  26417. -->
  26418. (I3 ^dir L +)
  26419. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26420. -->
  26421. (S1 ^operator O2154 = 0.6126637177448905)
  26422. Retracting rl*prefer*rvt*predict-no*H0*2
  26423. -->
  26424. (S1 ^operator O2154 = 0.3873354627937282)
  26425. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26426. -->
  26427. (S1 ^operator O2153 = -0.02274740735326741)
  26428. Retracting rl*prefer*rvt*predict-yes*H0*1
  26429. -->
  26430. (S1 ^operator O2153 = 0.3895395792897647)
  26431. =>WM: (15111: S1 ^operator O2156 +)
  26432. =>WM: (15110: S1 ^operator O2155 +)
  26433. =>WM: (15109: I3 ^dir R)
  26434. =>WM: (15108: O2156 ^name predict-no)
  26435. =>WM: (15107: O2155 ^name predict-yes)
  26436. =>WM: (15106: R1081 ^value 1)
  26437. =>WM: (15105: R1 ^reward R1081)
  26438. =>WM: (15104: I3 ^see 0)
  26439. <=WM: (15095: S1 ^operator O2153 +)
  26440. <=WM: (15096: S1 ^operator O2154 +)
  26441. <=WM: (15097: S1 ^operator O2154)
  26442. <=WM: (15080: I3 ^dir L)
  26443. <=WM: (15091: R1 ^reward R1080)
  26444. <=WM: (15090: I3 ^see 1)
  26445. <=WM: (15094: O2154 ^name predict-no)
  26446. <=WM: (15093: O2153 ^name predict-yes)
  26447. <=WM: (15092: R1080 ^value 1)
  26448. --- Inner Elaboration Phase, active level 1 (S1) ---
  26449. Firing prefer*rvt*predict-yes*H0
  26450. -->
  26451. Firing rl*prefer*rvt*predict-yes*H0*3
  26452. -->
  26453. (S1 ^operator O2155 = 0.1844118603489351)
  26454. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  26455. -->
  26456. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  26457. -->
  26458. (S1 ^operator O2155 = 0.8155902461537636)
  26459. Firing prefer*rvt*predict-no*H0
  26460. -->
  26461. Firing rl*prefer*rvt*predict-no*H0*4
  26462. -->
  26463. (S1 ^operator O2156 = 0.4476193025811009)
  26464. Firing prefer*rvt*predict-no*H0*4*v1*H1
  26465. -->
  26466. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  26467. -->
  26468. (S1 ^operator O2156 = -0.00558448899823713)
  26469. inner elaboration loop at bottom goal.
  26470. Retracting rl*prefer*rvt*predict-no*H0*4
  26471. -->
  26472. (S1 ^operator O2154 = 0.4476193025811009)
  26473. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  26474. -->
  26475. (S1 ^operator O2154 = -0.00558448899823713)
  26476. Retracting rl*prefer*rvt*predict-yes*H0*3
  26477. -->
  26478. (S1 ^operator O2153 = 0.1844118603489351)
  26479. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  26480. -->
  26481. (S1 ^operator O2153 = 0.8155902461537636)
  26482. --- END Proposal Phase ---
  26483. --- Decision Phase ---
  26484. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387335 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.936842,0.059482)
  26485. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.28092 0.331744 0.612664 -> 0.28092 0.331744 0.612664(R,m,v=1,1,0)
  26486. =>WM: (15112: S1 ^operator O2155)
  26487. 1078: O: O2155 (predict-yes)
  26488. --- END Decision Phase ---
  26489. --- Application Phase ---
  26490. --- Firing Productions (PE) For State At Depth 1 ---
  26491. --- Inner Elaboration Phase, active level 1 (S1) ---
  26492. Firing apply*operator
  26493. -->
  26494. (I3 ^predict-yes N1078 + :O )
  26495. Firing apply*operator*complete
  26496. -->
  26497. (I3 ^predict-no N1077 - :O )
  26498. inner elaboration loop at bottom goal.
  26499. --- Change Working Memory (PE) ---
  26500. =>WM: (15113: I3 ^predict-yes N1078)
  26501. <=WM: (15099: N1077 ^status complete)
  26502. <=WM: (15098: I3 ^predict-no N1077)
  26503. --- Firing Productions (IE) For State At Depth 1 ---
  26504. --- Inner Elaboration Phase, active level 1 (S1) ---
  26505. Firing monitor*world
  26506. -->
  26507. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26508. --- Change Working Memory (IE) ---
  26509. --- END Application Phase ---
  26510. --- Output Phase ---
  26511. ENV: Agent did: predict-yes for direction R in state State-A
  26512. In State-A moving R
  26513. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  26514. predict error 0
  26515. dir: dir isL
  26516. --- END Output Phase ---
  26517. /|--- Input Phase ---
  26518. =>WM: (15117: I2 ^dir L)
  26519. =>WM: (15116: I2 ^reward 1)
  26520. =>WM: (15115: I2 ^see 1)
  26521. =>WM: (15114: N1078 ^status complete)
  26522. <=WM: (15102: I2 ^dir R)
  26523. <=WM: (15101: I2 ^reward 1)
  26524. <=WM: (15100: I2 ^see 0)
  26525. =>WM: (15118: I2 ^level-1 R1-root)
  26526. <=WM: (15103: I2 ^level-1 L0-root)
  26527. --- END Input Phase ---
  26528. --- Proposal Phase ---
  26529. --- Inner Elaboration Phase, active level 1 (S1) ---
  26530. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  26531. -->
  26532. (S1 ^operator O2155 = 0.6104600509437957)
  26533. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  26534. -->
  26535. (S1 ^operator O2156 = 0.2714993082286609)
  26536. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26537. -->
  26538. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26539. -->
  26540. Firing elaborate*copy-see-to-output-link
  26541. -->
  26542. (I3 ^see 1 +)
  26543. Firing elaborate*reward*based*on*reward
  26544. -->
  26545. (R1082 ^value 1 +)
  26546. (R1 ^reward R1082 +)
  26547. Firing propose*predict-yes
  26548. -->
  26549. (O2157 ^name predict-yes +)
  26550. (S1 ^operator O2157 +)
  26551. Firing propose*predict-no
  26552. -->
  26553. (O2158 ^name predict-no +)
  26554. (S1 ^operator O2158 +)
  26555. Firing rl*prefer*rvt*predict-no*H0*2
  26556. -->
  26557. (S1 ^operator O2156 = 0.3873355857129354)
  26558. Firing rl*prefer*rvt*predict-yes*H0*1
  26559. -->
  26560. (S1 ^operator O2155 = 0.3895395792897647)
  26561. Firing prefer*rvt*predict-yes*H0
  26562. -->
  26563. Firing prefer*rvt*predict-no*H0
  26564. -->
  26565. Firing elaborate*copy-dir-to-output-link
  26566. -->
  26567. (I3 ^dir L +)
  26568. inner elaboration loop at bottom goal.
  26569. Retracting elaborate*copy-see-to-output-link
  26570. -->
  26571. (I3 ^see 0 +)
  26572. Retracting propose*predict-no
  26573. -->
  26574. (O2156 ^name predict-no +)
  26575. (S1 ^operator O2156 +)
  26576. Retracting propose*predict-yes
  26577. -->
  26578. (O2155 ^name predict-yes +)
  26579. (S1 ^operator O2155 +)
  26580. Retracting elaborate*reward*based*on*reward
  26581. -->
  26582. (R1081 ^value 1 +)
  26583. (R1 ^reward R1081 +)
  26584. Retracting elaborate*copy-dir-to-output-link
  26585. -->
  26586. (I3 ^dir R +)
  26587. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  26588. -->
  26589. (S1 ^operator O2156 = -0.00558448899823713)
  26590. Retracting rl*prefer*rvt*predict-no*H0*4
  26591. -->
  26592. (S1 ^operator O2156 = 0.4476193025811009)
  26593. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  26594. -->
  26595. (S1 ^operator O2155 = 0.8155902461537636)
  26596. Retracting rl*prefer*rvt*predict-yes*H0*3
  26597. -->
  26598. (S1 ^operator O2155 = 0.1844118603489351)
  26599. =>WM: (15126: S1 ^operator O2158 +)
  26600. =>WM: (15125: S1 ^operator O2157 +)
  26601. =>WM: (15124: I3 ^dir L)
  26602. =>WM: (15123: O2158 ^name predict-no)
  26603. =>WM: (15122: O2157 ^name predict-yes)
  26604. =>WM: (15121: R1082 ^value 1)
  26605. =>WM: (15120: R1 ^reward R1082)
  26606. =>WM: (15119: I3 ^see 1)
  26607. <=WM: (15110: S1 ^operator O2155 +)
  26608. <=WM: (15112: S1 ^operator O2155)
  26609. <=WM: (15111: S1 ^operator O2156 +)
  26610. <=WM: (15109: I3 ^dir R)
  26611. <=WM: (15105: R1 ^reward R1081)
  26612. <=WM: (15104: I3 ^see 0)
  26613. <=WM: (15108: O2156 ^name predict-no)
  26614. <=WM: (15107: O2155 ^name predict-yes)
  26615. <=WM: (15106: R1081 ^value 1)
  26616. --- Inner Elaboration Phase, active level 1 (S1) ---
  26617. Firing prefer*rvt*predict-yes*H0
  26618. -->
  26619. Firing rl*prefer*rvt*predict-yes*H0*1
  26620. -->
  26621. (S1 ^operator O2157 = 0.3895395792897647)
  26622. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26623. -->
  26624. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  26625. -->
  26626. (S1 ^operator O2157 = 0.6104600509437957)
  26627. Firing prefer*rvt*predict-no*H0
  26628. -->
  26629. Firing rl*prefer*rvt*predict-no*H0*2
  26630. -->
  26631. (S1 ^operator O2158 = 0.3873355857129354)
  26632. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26633. -->
  26634. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  26635. -->
  26636. (S1 ^operator O2158 = 0.2714993082286609)
  26637. inner elaboration loop at bottom goal.
  26638. Retracting rl*prefer*rvt*predict-no*H0*2
  26639. -->
  26640. (S1 ^operator O2156 = 0.3873355857129354)
  26641. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  26642. -->
  26643. (S1 ^operator O2156 = 0.2714993082286609)
  26644. Retracting rl*prefer*rvt*predict-yes*H0*1
  26645. -->
  26646. (S1 ^operator O2155 = 0.3895395792897647)
  26647. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  26648. -->
  26649. (S1 ^operator O2155 = 0.6104600509437957)
  26650. --- END Proposal Phase ---
  26651. --- Decision Phase ---
  26652. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675414 -0.491003 0.184412(R,m,v=1,0.906593,0.0851497)
  26653. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324588 0.491002 0.81559 -> 0.324588 0.491002 0.81559(R,m,v=1,1,0)
  26654. =>WM: (15127: S1 ^operator O2157)
  26655. 1079: O: O2157 (predict-yes)
  26656. --- END Decision Phase ---
  26657. --- Application Phase ---
  26658. --- Firing Productions (PE) For State At Depth 1 ---
  26659. --- Inner Elaboration Phase, active level 1 (S1) ---
  26660. Firing apply*operator
  26661. -->
  26662. (I3 ^predict-yes N1079 + :O )
  26663. Firing apply*operator*complete
  26664. -->
  26665. (I3 ^predict-yes N1078 - :O )
  26666. inner elaboration loop at bottom goal.
  26667. --- Change Working Memory (PE) ---
  26668. =>WM: (15128: I3 ^predict-yes N1079)
  26669. <=WM: (15114: N1078 ^status complete)
  26670. <=WM: (15113: I3 ^predict-yes N1078)
  26671. --- Firing Productions (IE) For State At Depth 1 ---
  26672. --- Inner Elaboration Phase, active level 1 (S1) ---
  26673. Firing monitor*world
  26674. -->
  26675. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26676. --- Change Working Memory (IE) ---
  26677. --- END Application Phase ---
  26678. --- Output Phase ---
  26679. ENV: Agent did: predict-yes for direction L in state State-B
  26680. In State-B moving L
  26681. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26682. predict error 0
  26683. dir: dir isU
  26684. --- END Output Phase ---
  26685. \-/--- Input Phase ---
  26686. =>WM: (15132: I2 ^dir U)
  26687. =>WM: (15131: I2 ^reward 1)
  26688. =>WM: (15130: I2 ^see 1)
  26689. =>WM: (15129: N1079 ^status complete)
  26690. <=WM: (15117: I2 ^dir L)
  26691. <=WM: (15116: I2 ^reward 1)
  26692. <=WM: (15115: I2 ^see 1)
  26693. =>WM: (15133: I2 ^level-1 L1-root)
  26694. <=WM: (15118: I2 ^level-1 R1-root)
  26695. --- END Input Phase ---
  26696. --- Proposal Phase ---
  26697. --- Inner Elaboration Phase, active level 1 (S1) ---
  26698. Firing elaborate*copy-see-to-output-link
  26699. -->
  26700. (I3 ^see 1 +)
  26701. Firing elaborate*reward*based*on*reward
  26702. -->
  26703. (R1083 ^value 1 +)
  26704. (R1 ^reward R1083 +)
  26705. Firing propose*predict-yes
  26706. -->
  26707. (O2159 ^name predict-yes +)
  26708. (S1 ^operator O2159 +)
  26709. Firing propose*predict-no
  26710. -->
  26711. (O2160 ^name predict-no +)
  26712. (S1 ^operator O2160 +)
  26713. Firing rl*prefer*rvt*predict-no*H0*6
  26714. -->
  26715. (S1 ^operator O2158 = 0.9999999999999999)
  26716. Firing rl*prefer*rvt*predict-yes*H0*5
  26717. -->
  26718. (S1 ^operator O2157 = 0.)
  26719. Firing prefer*rvt*predict-yes*H0
  26720. -->
  26721. Firing prefer*rvt*predict-no*H0
  26722. -->
  26723. Firing elaborate*copy-dir-to-output-link
  26724. -->
  26725. (I3 ^dir U +)
  26726. inner elaboration loop at bottom goal.
  26727. Retracting elaborate*copy-see-to-output-link
  26728. -->
  26729. (I3 ^see 1 +)
  26730. Retracting propose*predict-no
  26731. -->
  26732. (O2158 ^name predict-no +)
  26733. (S1 ^operator O2158 +)
  26734. Retracting propose*predict-yes
  26735. -->
  26736. (O2157 ^name predict-yes +)
  26737. (S1 ^operator O2157 +)
  26738. Retracting elaborate*reward*based*on*reward
  26739. -->
  26740. (R1082 ^value 1 +)
  26741. (R1 ^reward R1082 +)
  26742. Retracting elaborate*copy-dir-to-output-link
  26743. -->
  26744. (I3 ^dir L +)
  26745. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  26746. -->
  26747. (S1 ^operator O2158 = 0.2714993082286609)
  26748. Retracting rl*prefer*rvt*predict-no*H0*2
  26749. -->
  26750. (S1 ^operator O2158 = 0.3873355857129354)
  26751. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  26752. -->
  26753. (S1 ^operator O2157 = 0.6104600509437957)
  26754. Retracting rl*prefer*rvt*predict-yes*H0*1
  26755. -->
  26756. (S1 ^operator O2157 = 0.3895395792897647)
  26757. =>WM: (15140: S1 ^operator O2160 +)
  26758. =>WM: (15139: S1 ^operator O2159 +)
  26759. =>WM: (15138: I3 ^dir U)
  26760. =>WM: (15137: O2160 ^name predict-no)
  26761. =>WM: (15136: O2159 ^name predict-yes)
  26762. =>WM: (15135: R1083 ^value 1)
  26763. =>WM: (15134: R1 ^reward R1083)
  26764. <=WM: (15125: S1 ^operator O2157 +)
  26765. <=WM: (15127: S1 ^operator O2157)
  26766. <=WM: (15126: S1 ^operator O2158 +)
  26767. <=WM: (15124: I3 ^dir L)
  26768. <=WM: (15120: R1 ^reward R1082)
  26769. <=WM: (15123: O2158 ^name predict-no)
  26770. <=WM: (15122: O2157 ^name predict-yes)
  26771. <=WM: (15121: R1082 ^value 1)
  26772. --- Inner Elaboration Phase, active level 1 (S1) ---
  26773. Firing prefer*rvt*predict-yes*H0
  26774. -->
  26775. Firing rl*prefer*rvt*predict-yes*H0*5
  26776. -->
  26777. (S1 ^operator O2159 = 0.)
  26778. Firing prefer*rvt*predict-no*H0
  26779. -->
  26780. Firing rl*prefer*rvt*predict-no*H0*6
  26781. -->
  26782. (S1 ^operator O2160 = 0.9999999999999999)
  26783. inner elaboration loop at bottom goal.
  26784. Retracting rl*prefer*rvt*predict-no*H0*6
  26785. -->
  26786. (S1 ^operator O2158 = 0.9999999999999999)
  26787. Retracting rl*prefer*rvt*predict-yes*H0*5
  26788. -->
  26789. (S1 ^operator O2157 = 0.)
  26790. --- END Proposal Phase ---
  26791. --- Decision Phase ---
  26792. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.9,0.0905028)
  26793. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  26794. =>WM: (15141: S1 ^operator O2160)
  26795. 1080: O: O2160 (predict-no)
  26796. --- END Decision Phase ---
  26797. --- Application Phase ---
  26798. --- Firing Productions (PE) For State At Depth 1 ---
  26799. --- Inner Elaboration Phase, active level 1 (S1) ---
  26800. Firing apply*operator
  26801. -->
  26802. (I3 ^predict-no N1080 + :O )
  26803. Firing apply*operator*complete
  26804. -->
  26805. (I3 ^predict-yes N1079 - :O )
  26806. inner elaboration loop at bottom goal.
  26807. --- Change Working Memory (PE) ---
  26808. =>WM: (15142: I3 ^predict-no N1080)
  26809. <=WM: (15129: N1079 ^status complete)
  26810. <=WM: (15128: I3 ^predict-yes N1079)
  26811. --- Firing Productions (IE) For State At Depth 1 ---
  26812. --- Inner Elaboration Phase, active level 1 (S1) ---
  26813. Firing monitor*world
  26814. -->
  26815. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26816. --- Change Working Memory (IE) ---
  26817. --- END Application Phase ---
  26818. --- Output Phase ---
  26819. ENV: Agent did: predict-no for direction U in state State-A
  26820. In State-A moving U
  26821. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26822. predict error 0
  26823. dir: dir isL
  26824. --- END Output Phase ---
  26825. |\---- Input Phase ---
  26826. =>WM: (15146: I2 ^dir L)
  26827. =>WM: (15145: I2 ^reward 1)
  26828. =>WM: (15144: I2 ^see 0)
  26829. =>WM: (15143: N1080 ^status complete)
  26830. <=WM: (15132: I2 ^dir U)
  26831. <=WM: (15131: I2 ^reward 1)
  26832. <=WM: (15130: I2 ^see 1)
  26833. =>WM: (15147: I2 ^level-1 L1-root)
  26834. <=WM: (15133: I2 ^level-1 L1-root)
  26835. --- END Input Phase ---
  26836. --- Proposal Phase ---
  26837. --- Inner Elaboration Phase, active level 1 (S1) ---
  26838. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26839. -->
  26840. (S1 ^operator O2160 = 0.6126638406640976)
  26841. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26842. -->
  26843. (S1 ^operator O2159 = -0.02274740735326741)
  26844. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26845. -->
  26846. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26847. -->
  26848. Firing elaborate*copy-see-to-output-link
  26849. -->
  26850. (I3 ^see 0 +)
  26851. Firing elaborate*reward*based*on*reward
  26852. -->
  26853. (R1084 ^value 1 +)
  26854. (R1 ^reward R1084 +)
  26855. Firing propose*predict-yes
  26856. -->
  26857. (O2161 ^name predict-yes +)
  26858. (S1 ^operator O2161 +)
  26859. Firing propose*predict-no
  26860. -->
  26861. (O2162 ^name predict-no +)
  26862. (S1 ^operator O2162 +)
  26863. Firing rl*prefer*rvt*predict-no*H0*2
  26864. -->
  26865. (S1 ^operator O2160 = 0.3873355857129354)
  26866. Firing rl*prefer*rvt*predict-yes*H0*1
  26867. -->
  26868. (S1 ^operator O2159 = 0.3895396347547306)
  26869. Firing prefer*rvt*predict-yes*H0
  26870. -->
  26871. Firing prefer*rvt*predict-no*H0
  26872. -->
  26873. Firing elaborate*copy-dir-to-output-link
  26874. -->
  26875. (I3 ^dir L +)
  26876. inner elaboration loop at bottom goal.
  26877. Retracting elaborate*copy-see-to-output-link
  26878. -->
  26879. (I3 ^see 1 +)
  26880. Retracting propose*predict-no
  26881. -->
  26882. (O2160 ^name predict-no +)
  26883. (S1 ^operator O2160 +)
  26884. Retracting propose*predict-yes
  26885. -->
  26886. (O2159 ^name predict-yes +)
  26887. (S1 ^operator O2159 +)
  26888. Retracting elaborate*reward*based*on*reward
  26889. -->
  26890. (R1083 ^value 1 +)
  26891. (R1 ^reward R1083 +)
  26892. Retracting elaborate*copy-dir-to-output-link
  26893. -->
  26894. (I3 ^dir U +)
  26895. Retracting rl*prefer*rvt*predict-no*H0*6
  26896. -->
  26897. (S1 ^operator O2160 = 0.9999999999999999)
  26898. Retracting rl*prefer*rvt*predict-yes*H0*5
  26899. -->
  26900. (S1 ^operator O2159 = 0.)
  26901. =>WM: (15155: S1 ^operator O2162 +)
  26902. =>WM: (15154: S1 ^operator O2161 +)
  26903. =>WM: (15153: I3 ^dir L)
  26904. =>WM: (15152: O2162 ^name predict-no)
  26905. =>WM: (15151: O2161 ^name predict-yes)
  26906. =>WM: (15150: R1084 ^value 1)
  26907. =>WM: (15149: R1 ^reward R1084)
  26908. =>WM: (15148: I3 ^see 0)
  26909. <=WM: (15139: S1 ^operator O2159 +)
  26910. <=WM: (15140: S1 ^operator O2160 +)
  26911. <=WM: (15141: S1 ^operator O2160)
  26912. <=WM: (15138: I3 ^dir U)
  26913. <=WM: (15134: R1 ^reward R1083)
  26914. <=WM: (15119: I3 ^see 1)
  26915. <=WM: (15137: O2160 ^name predict-no)
  26916. <=WM: (15136: O2159 ^name predict-yes)
  26917. <=WM: (15135: R1083 ^value 1)
  26918. --- Inner Elaboration Phase, active level 1 (S1) ---
  26919. Firing prefer*rvt*predict-yes*H0
  26920. -->
  26921. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26922. -->
  26923. (S1 ^operator O2161 = -0.02274740735326741)
  26924. Firing rl*prefer*rvt*predict-yes*H0*1
  26925. -->
  26926. (S1 ^operator O2161 = 0.3895396347547306)
  26927. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  26928. -->
  26929. Firing prefer*rvt*predict-no*H0
  26930. -->
  26931. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26932. -->
  26933. (S1 ^operator O2162 = 0.6126638406640976)
  26934. Firing rl*prefer*rvt*predict-no*H0*2
  26935. -->
  26936. (S1 ^operator O2162 = 0.3873355857129354)
  26937. Firing prefer*rvt*predict-no*H0*2*v1*H1
  26938. -->
  26939. inner elaboration loop at bottom goal.
  26940. Retracting rl*prefer*rvt*predict-no*H0*2
  26941. -->
  26942. (S1 ^operator O2160 = 0.3873355857129354)
  26943. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  26944. -->
  26945. (S1 ^operator O2160 = 0.6126638406640976)
  26946. Retracting rl*prefer*rvt*predict-yes*H0*1
  26947. -->
  26948. (S1 ^operator O2159 = 0.3895396347547306)
  26949. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  26950. -->
  26951. (S1 ^operator O2159 = -0.02274740735326741)
  26952. --- END Proposal Phase ---
  26953. --- Decision Phase ---
  26954. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  26955. =>WM: (15156: S1 ^operator O2162)
  26956. 1081: O: O2162 (predict-no)
  26957. --- END Decision Phase ---
  26958. --- Application Phase ---
  26959. --- Firing Productions (PE) For State At Depth 1 ---
  26960. --- Inner Elaboration Phase, active level 1 (S1) ---
  26961. Firing apply*operator
  26962. -->
  26963. (I3 ^predict-no N1081 + :O )
  26964. Firing apply*operator*complete
  26965. -->
  26966. (I3 ^predict-no N1080 - :O )
  26967. inner elaboration loop at bottom goal.
  26968. --- Change Working Memory (PE) ---
  26969. =>WM: (15157: I3 ^predict-no N1081)
  26970. <=WM: (15143: N1080 ^status complete)
  26971. <=WM: (15142: I3 ^predict-no N1080)
  26972. --- Firing Productions (IE) For State At Depth 1 ---
  26973. --- Inner Elaboration Phase, active level 1 (S1) ---
  26974. Firing monitor*world
  26975. -->
  26976. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26977. --- Change Working Memory (IE) ---
  26978. --- END Application Phase ---
  26979. --- Output Phase ---
  26980. ENV: Agent did: predict-no for direction L in state State-A
  26981. In State-A moving L
  26982. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26983. predict error 0
  26984. dir: dir isU
  26985. --- END Output Phase ---
  26986. /--- Input Phase ---
  26987. =>WM: (15161: I2 ^dir U)
  26988. =>WM: (15160: I2 ^reward 1)
  26989. =>WM: (15159: I2 ^see 0)
  26990. =>WM: (15158: N1081 ^status complete)
  26991. <=WM: (15146: I2 ^dir L)
  26992. <=WM: (15145: I2 ^reward 1)
  26993. <=WM: (15144: I2 ^see 0)
  26994. =>WM: (15162: I2 ^level-1 L0-root)
  26995. <=WM: (15147: I2 ^level-1 L1-root)
  26996. --- END Input Phase ---
  26997. --- Proposal Phase ---
  26998. --- Inner Elaboration Phase, active level 1 (S1) ---
  26999. Firing elaborate*copy-see-to-output-link
  27000. -->
  27001. (I3 ^see 0 +)
  27002. Firing elaborate*reward*based*on*reward
  27003. -->
  27004. (R1085 ^value 1 +)
  27005. (R1 ^reward R1085 +)
  27006. Firing propose*predict-yes
  27007. -->
  27008. (O2163 ^name predict-yes +)
  27009. (S1 ^operator O2163 +)
  27010. Firing propose*predict-no
  27011. -->
  27012. (O2164 ^name predict-no +)
  27013. (S1 ^operator O2164 +)
  27014. Firing rl*prefer*rvt*predict-no*H0*6
  27015. -->
  27016. (S1 ^operator O2162 = 0.9999999999999999)
  27017. Firing rl*prefer*rvt*predict-yes*H0*5
  27018. -->
  27019. (S1 ^operator O2161 = 0.)
  27020. Firing prefer*rvt*predict-yes*H0
  27021. -->
  27022. Firing prefer*rvt*predict-no*H0
  27023. -->
  27024. Firing elaborate*copy-dir-to-output-link
  27025. -->
  27026. (I3 ^dir U +)
  27027. inner elaboration loop at bottom goal.
  27028. Retracting elaborate*copy-see-to-output-link
  27029. -->
  27030. (I3 ^see 0 +)
  27031. Retracting propose*predict-no
  27032. -->
  27033. (O2162 ^name predict-no +)
  27034. (S1 ^operator O2162 +)
  27035. Retracting propose*predict-yes
  27036. -->
  27037. (O2161 ^name predict-yes +)
  27038. (S1 ^operator O2161 +)
  27039. Retracting elaborate*reward*based*on*reward
  27040. -->
  27041. (R1084 ^value 1 +)
  27042. (R1 ^reward R1084 +)
  27043. Retracting elaborate*copy-dir-to-output-link
  27044. -->
  27045. (I3 ^dir L +)
  27046. Retracting rl*prefer*rvt*predict-no*H0*2
  27047. -->
  27048. (S1 ^operator O2162 = 0.3873355857129354)
  27049. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*32
  27050. -->
  27051. (S1 ^operator O2162 = 0.6126638406640976)
  27052. Retracting rl*prefer*rvt*predict-yes*H0*1
  27053. -->
  27054. (S1 ^operator O2161 = 0.3895396347547306)
  27055. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*31
  27056. -->
  27057. (S1 ^operator O2161 = -0.02274740735326741)
  27058. =>WM: (15169: S1 ^operator O2164 +)
  27059. =>WM: (15168: S1 ^operator O2163 +)
  27060. =>WM: (15167: I3 ^dir U)
  27061. =>WM: (15166: O2164 ^name predict-no)
  27062. =>WM: (15165: O2163 ^name predict-yes)
  27063. =>WM: (15164: R1085 ^value 1)
  27064. =>WM: (15163: R1 ^reward R1085)
  27065. <=WM: (15154: S1 ^operator O2161 +)
  27066. <=WM: (15155: S1 ^operator O2162 +)
  27067. <=WM: (15156: S1 ^operator O2162)
  27068. <=WM: (15153: I3 ^dir L)
  27069. <=WM: (15149: R1 ^reward R1084)
  27070. <=WM: (15152: O2162 ^name predict-no)
  27071. <=WM: (15151: O2161 ^name predict-yes)
  27072. <=WM: (15150: R1084 ^value 1)
  27073. --- Inner Elaboration Phase, active level 1 (S1) ---
  27074. Firing prefer*rvt*predict-yes*H0
  27075. -->
  27076. Firing rl*prefer*rvt*predict-yes*H0*5
  27077. -->
  27078. (S1 ^operator O2163 = 0.)
  27079. Firing prefer*rvt*predict-no*H0
  27080. -->
  27081. Firing rl*prefer*rvt*predict-no*H0*6
  27082. -->
  27083. (S1 ^operator O2164 = 0.9999999999999999)
  27084. inner elaboration loop at bottom goal.
  27085. Retracting rl*prefer*rvt*predict-no*H0*6
  27086. -->
  27087. (S1 ^operator O2162 = 0.9999999999999999)
  27088. Retracting rl*prefer*rvt*predict-yes*H0*5
  27089. -->
  27090. (S1 ^operator O2161 = 0.)
  27091. --- END Proposal Phase ---
  27092. --- Decision Phase ---
  27093. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331743 0.387336(R,m,v=1,0.937173,0.0591899)
  27094. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*32 0.28092 0.331744 0.612664 -> 0.28092 0.331744 0.612664(R,m,v=1,1,0)
  27095. =>WM: (15170: S1 ^operator O2164)
  27096. 1082: O: O2164 (predict-no)
  27097. --- END Decision Phase ---
  27098. --- Application Phase ---
  27099. --- Firing Productions (PE) For State At Depth 1 ---
  27100. --- Inner Elaboration Phase, active level 1 (S1) ---
  27101. Firing apply*operator
  27102. -->
  27103. (I3 ^predict-no N1082 + :O )
  27104. Firing apply*operator*complete
  27105. -->
  27106. (I3 ^predict-no N1081 - :O )
  27107. inner elaboration loop at bottom goal.
  27108. --- Change Working Memory (PE) ---
  27109. =>WM: (15171: I3 ^predict-no N1082)
  27110. <=WM: (15158: N1081 ^status complete)
  27111. <=WM: (15157: I3 ^predict-no N1081)
  27112. --- Firing Productions (IE) For State At Depth 1 ---
  27113. --- Inner Elaboration Phase, active level 1 (S1) ---
  27114. Firing monitor*world
  27115. -->
  27116. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27117. --- Change Working Memory (IE) ---
  27118. --- END Application Phase ---
  27119. --- Output Phase ---
  27120. ENV: Agent did: predict-no for direction U in state State-A
  27121. In State-A moving U
  27122. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27123. predict error 0
  27124. dir: dir isL
  27125. --- END Output Phase ---
  27126. |\-/--- Input Phase ---
  27127. =>WM: (15175: I2 ^dir L)
  27128. =>WM: (15174: I2 ^reward 1)
  27129. =>WM: (15173: I2 ^see 0)
  27130. =>WM: (15172: N1082 ^status complete)
  27131. <=WM: (15161: I2 ^dir U)
  27132. <=WM: (15160: I2 ^reward 1)
  27133. <=WM: (15159: I2 ^see 0)
  27134. =>WM: (15176: I2 ^level-1 L0-root)
  27135. <=WM: (15162: I2 ^level-1 L0-root)
  27136. --- END Input Phase ---
  27137. --- Proposal Phase ---
  27138. --- Inner Elaboration Phase, active level 1 (S1) ---
  27139. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  27140. -->
  27141. (S1 ^operator O2163 = 0.1599599085218832)
  27142. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  27143. -->
  27144. (S1 ^operator O2164 = 0.612665523112643)
  27145. Firing prefer*rvt*predict-no*H0*2*v1*H1
  27146. -->
  27147. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  27148. -->
  27149. Firing elaborate*copy-see-to-output-link
  27150. -->
  27151. (I3 ^see 0 +)
  27152. Firing elaborate*reward*based*on*reward
  27153. -->
  27154. (R1086 ^value 1 +)
  27155. (R1 ^reward R1086 +)
  27156. Firing propose*predict-yes
  27157. -->
  27158. (O2165 ^name predict-yes +)
  27159. (S1 ^operator O2165 +)
  27160. Firing propose*predict-no
  27161. -->
  27162. (O2166 ^name predict-no +)
  27163. (S1 ^operator O2166 +)
  27164. Firing rl*prefer*rvt*predict-no*H0*2
  27165. -->
  27166. (S1 ^operator O2164 = 0.3873356717563805)
  27167. Firing rl*prefer*rvt*predict-yes*H0*1
  27168. -->
  27169. (S1 ^operator O2163 = 0.3895396347547306)
  27170. Firing prefer*rvt*predict-yes*H0
  27171. -->
  27172. Firing prefer*rvt*predict-no*H0
  27173. -->
  27174. Firing elaborate*copy-dir-to-output-link
  27175. -->
  27176. (I3 ^dir L +)
  27177. inner elaboration loop at bottom goal.
  27178. Retracting elaborate*copy-see-to-output-link
  27179. -->
  27180. (I3 ^see 0 +)
  27181. Retracting propose*predict-no
  27182. -->
  27183. (O2164 ^name predict-no +)
  27184. (S1 ^operator O2164 +)
  27185. Retracting propose*predict-yes
  27186. -->
  27187. (O2163 ^name predict-yes +)
  27188. (S1 ^operator O2163 +)
  27189. Retracting elaborate*reward*based*on*reward
  27190. -->
  27191. (R1085 ^value 1 +)
  27192. (R1 ^reward R1085 +)
  27193. Retracting elaborate*copy-dir-to-output-link
  27194. -->
  27195. (I3 ^dir U +)
  27196. Retracting rl*prefer*rvt*predict-no*H0*6
  27197. -->
  27198. (S1 ^operator O2164 = 0.9999999999999999)
  27199. Retracting rl*prefer*rvt*predict-yes*H0*5
  27200. -->
  27201. (S1 ^operator O2163 = 0.)
  27202. =>WM: (15183: S1 ^operator O2166 +)
  27203. =>WM: (15182: S1 ^operator O2165 +)
  27204. =>WM: (15181: I3 ^dir L)
  27205. =>WM: (15180: O2166 ^name predict-no)
  27206. =>WM: (15179: O2165 ^name predict-yes)
  27207. =>WM: (15178: R1086 ^value 1)
  27208. =>WM: (15177: R1 ^reward R1086)
  27209. <=WM: (15168: S1 ^operator O2163 +)
  27210. <=WM: (15169: S1 ^operator O2164 +)
  27211. <=WM: (15170: S1 ^operator O2164)
  27212. <=WM: (15167: I3 ^dir U)
  27213. <=WM: (15163: R1 ^reward R1085)
  27214. <=WM: (15166: O2164 ^name predict-no)
  27215. <=WM: (15165: O2163 ^name predict-yes)
  27216. <=WM: (15164: R1085 ^value 1)
  27217. --- Inner Elaboration Phase, active level 1 (S1) ---
  27218. Firing prefer*rvt*predict-yes*H0
  27219. -->
  27220. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  27221. -->
  27222. (S1 ^operator O2165 = 0.1599599085218832)
  27223. Firing rl*prefer*rvt*predict-yes*H0*1
  27224. -->
  27225. (S1 ^operator O2165 = 0.3895396347547306)
  27226. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  27227. -->
  27228. Firing prefer*rvt*predict-no*H0
  27229. -->
  27230. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  27231. -->
  27232. (S1 ^operator O2166 = 0.612665523112643)
  27233. Firing rl*prefer*rvt*predict-no*H0*2
  27234. -->
  27235. (S1 ^operator O2166 = 0.3873356717563805)
  27236. Firing prefer*rvt*predict-no*H0*2*v1*H1
  27237. -->
  27238. inner elaboration loop at bottom goal.
  27239. Retracting rl*prefer*rvt*predict-no*H0*2
  27240. -->
  27241. (S1 ^operator O2164 = 0.3873356717563805)
  27242. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  27243. -->
  27244. (S1 ^operator O2164 = 0.612665523112643)
  27245. Retracting rl*prefer*rvt*predict-yes*H0*1
  27246. -->
  27247. (S1 ^operator O2163 = 0.3895396347547306)
  27248. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  27249. -->
  27250. (S1 ^operator O2163 = 0.1599599085218832)
  27251. --- END Proposal Phase ---
  27252. --- Decision Phase ---
  27253. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27254. =>WM: (15184: S1 ^operator O2166)
  27255. 1083: O: O2166 (predict-no)
  27256. --- END Decision Phase ---
  27257. --- Application Phase ---
  27258. --- Firing Productions (PE) For State At Depth 1 ---
  27259. --- Inner Elaboration Phase, active level 1 (S1) ---
  27260. Firing apply*operator
  27261. -->
  27262. (I3 ^predict-no N1083 + :O )
  27263. Firing apply*operator*complete
  27264. -->
  27265. (I3 ^predict-no N1082 - :O )
  27266. inner elaboration loop at bottom goal.
  27267. --- Change Working Memory (PE) ---
  27268. =>WM: (15185: I3 ^predict-no N1083)
  27269. <=WM: (15172: N1082 ^status complete)
  27270. <=WM: (15171: I3 ^predict-no N1082)
  27271. --- Firing Productions (IE) For State At Depth 1 ---
  27272. --- Inner Elaboration Phase, active level 1 (S1) ---
  27273. Firing monitor*world
  27274. -->
  27275. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27276. --- Change Working Memory (IE) ---
  27277. --- END Application Phase ---
  27278. --- Output Phase ---
  27279. ENV: Agent did: predict-no for direction L in state State-A
  27280. In State-A moving L
  27281. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27282. predict error 0
  27283. dir: dir isR
  27284. --- END Output Phase ---
  27285. |\---- Input Phase ---
  27286. =>WM: (15189: I2 ^dir R)
  27287. =>WM: (15188: I2 ^reward 1)
  27288. =>WM: (15187: I2 ^see 0)
  27289. =>WM: (15186: N1083 ^status complete)
  27290. <=WM: (15175: I2 ^dir L)
  27291. <=WM: (15174: I2 ^reward 1)
  27292. <=WM: (15173: I2 ^see 0)
  27293. =>WM: (15190: I2 ^level-1 L0-root)
  27294. <=WM: (15176: I2 ^level-1 L0-root)
  27295. --- END Input Phase ---
  27296. --- Proposal Phase ---
  27297. --- Inner Elaboration Phase, active level 1 (S1) ---
  27298. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27299. -->
  27300. (S1 ^operator O2165 = 0.8155899301783588)
  27301. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27302. -->
  27303. (S1 ^operator O2166 = -0.00558448899823713)
  27304. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27305. -->
  27306. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27307. -->
  27308. Firing elaborate*copy-see-to-output-link
  27309. -->
  27310. (I3 ^see 0 +)
  27311. Firing elaborate*reward*based*on*reward
  27312. -->
  27313. (R1087 ^value 1 +)
  27314. (R1 ^reward R1087 +)
  27315. Firing propose*predict-yes
  27316. -->
  27317. (O2167 ^name predict-yes +)
  27318. (S1 ^operator O2167 +)
  27319. Firing propose*predict-no
  27320. -->
  27321. (O2168 ^name predict-no +)
  27322. (S1 ^operator O2168 +)
  27323. Firing rl*prefer*rvt*predict-no*H0*4
  27324. -->
  27325. (S1 ^operator O2166 = 0.4476193025811009)
  27326. Firing rl*prefer*rvt*predict-yes*H0*3
  27327. -->
  27328. (S1 ^operator O2165 = 0.1844115443735304)
  27329. Firing prefer*rvt*predict-yes*H0
  27330. -->
  27331. Firing prefer*rvt*predict-no*H0
  27332. -->
  27333. Firing elaborate*copy-dir-to-output-link
  27334. -->
  27335. (I3 ^dir R +)
  27336. inner elaboration loop at bottom goal.
  27337. Retracting elaborate*copy-see-to-output-link
  27338. -->
  27339. (I3 ^see 0 +)
  27340. Retracting propose*predict-no
  27341. -->
  27342. (O2166 ^name predict-no +)
  27343. (S1 ^operator O2166 +)
  27344. Retracting propose*predict-yes
  27345. -->
  27346. (O2165 ^name predict-yes +)
  27347. (S1 ^operator O2165 +)
  27348. Retracting elaborate*reward*based*on*reward
  27349. -->
  27350. (R1086 ^value 1 +)
  27351. (R1 ^reward R1086 +)
  27352. Retracting elaborate*copy-dir-to-output-link
  27353. -->
  27354. (I3 ^dir L +)
  27355. Retracting rl*prefer*rvt*predict-no*H0*2
  27356. -->
  27357. (S1 ^operator O2166 = 0.3873356717563805)
  27358. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*37
  27359. -->
  27360. (S1 ^operator O2166 = 0.612665523112643)
  27361. Retracting rl*prefer*rvt*predict-yes*H0*1
  27362. -->
  27363. (S1 ^operator O2165 = 0.3895396347547306)
  27364. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*38
  27365. -->
  27366. (S1 ^operator O2165 = 0.1599599085218832)
  27367. =>WM: (15197: S1 ^operator O2168 +)
  27368. =>WM: (15196: S1 ^operator O2167 +)
  27369. =>WM: (15195: I3 ^dir R)
  27370. =>WM: (15194: O2168 ^name predict-no)
  27371. =>WM: (15193: O2167 ^name predict-yes)
  27372. =>WM: (15192: R1087 ^value 1)
  27373. =>WM: (15191: R1 ^reward R1087)
  27374. <=WM: (15182: S1 ^operator O2165 +)
  27375. <=WM: (15183: S1 ^operator O2166 +)
  27376. <=WM: (15184: S1 ^operator O2166)
  27377. <=WM: (15181: I3 ^dir L)
  27378. <=WM: (15177: R1 ^reward R1086)
  27379. <=WM: (15180: O2166 ^name predict-no)
  27380. <=WM: (15179: O2165 ^name predict-yes)
  27381. <=WM: (15178: R1086 ^value 1)
  27382. --- Inner Elaboration Phase, active level 1 (S1) ---
  27383. Firing prefer*rvt*predict-yes*H0
  27384. -->
  27385. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27386. -->
  27387. (S1 ^operator O2167 = 0.8155899301783588)
  27388. Firing rl*prefer*rvt*predict-yes*H0*3
  27389. -->
  27390. (S1 ^operator O2167 = 0.1844115443735304)
  27391. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27392. -->
  27393. Firing prefer*rvt*predict-no*H0
  27394. -->
  27395. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27396. -->
  27397. (S1 ^operator O2168 = -0.00558448899823713)
  27398. Firing rl*prefer*rvt*predict-no*H0*4
  27399. -->
  27400. (S1 ^operator O2168 = 0.4476193025811009)
  27401. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27402. -->
  27403. inner elaboration loop at bottom goal.
  27404. Retracting rl*prefer*rvt*predict-no*H0*4
  27405. -->
  27406. (S1 ^operator O2166 = 0.4476193025811009)
  27407. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27408. -->
  27409. (S1 ^operator O2166 = -0.00558448899823713)
  27410. Retracting rl*prefer*rvt*predict-yes*H0*3
  27411. -->
  27412. (S1 ^operator O2165 = 0.1844115443735304)
  27413. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27414. -->
  27415. (S1 ^operator O2165 = 0.8155899301783588)
  27416. --- END Proposal Phase ---
  27417. --- Decision Phase ---
  27418. RL update rl*prefer*rvt*predict-no*H0*2 0.719079 -0.331743 0.387336 -> 0.719079 -0.331743 0.387335(R,m,v=1,0.9375,0.0589005)
  27419. RL update rl*prefer*rvt*predict-no*H0*2*v1*H1*37 0.280923 0.331743 0.612666 -> 0.280922 0.331743 0.612665(R,m,v=1,1,0)
  27420. =>WM: (15198: S1 ^operator O2167)
  27421. 1084: O: O2167 (predict-yes)
  27422. --- END Decision Phase ---
  27423. --- Application Phase ---
  27424. --- Firing Productions (PE) For State At Depth 1 ---
  27425. --- Inner Elaboration Phase, active level 1 (S1) ---
  27426. Firing apply*operator
  27427. -->
  27428. (I3 ^predict-yes N1084 + :O )
  27429. Firing apply*operator*complete
  27430. -->
  27431. (I3 ^predict-no N1083 - :O )
  27432. inner elaboration loop at bottom goal.
  27433. --- Change Working Memory (PE) ---
  27434. =>WM: (15199: I3 ^predict-yes N1084)
  27435. <=WM: (15186: N1083 ^status complete)
  27436. <=WM: (15185: I3 ^predict-no N1083)
  27437. --- Firing Productions (IE) For State At Depth 1 ---
  27438. --- Inner Elaboration Phase, active level 1 (S1) ---
  27439. Firing monitor*world
  27440. -->
  27441. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  27442. --- Change Working Memory (IE) ---
  27443. --- END Application Phase ---
  27444. --- Output Phase ---
  27445. ENV: Agent did: predict-yes for direction R in state State-A
  27446. In State-A moving R
  27447. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  27448. predict error 0
  27449. dir: dir isU
  27450. --- END Output Phase ---
  27451. /|\--- Input Phase ---
  27452. =>WM: (15203: I2 ^dir U)
  27453. =>WM: (15202: I2 ^reward 1)
  27454. =>WM: (15201: I2 ^see 1)
  27455. =>WM: (15200: N1084 ^status complete)
  27456. <=WM: (15189: I2 ^dir R)
  27457. <=WM: (15188: I2 ^reward 1)
  27458. <=WM: (15187: I2 ^see 0)
  27459. =>WM: (15204: I2 ^level-1 R1-root)
  27460. <=WM: (15190: I2 ^level-1 L0-root)
  27461. --- END Input Phase ---
  27462. --- Proposal Phase ---
  27463. --- Inner Elaboration Phase, active level 1 (S1) ---
  27464. Firing elaborate*copy-see-to-output-link
  27465. -->
  27466. (I3 ^see 1 +)
  27467. Firing elaborate*reward*based*on*reward
  27468. -->
  27469. (R1088 ^value 1 +)
  27470. (R1 ^reward R1088 +)
  27471. Firing propose*predict-yes
  27472. -->
  27473. (O2169 ^name predict-yes +)
  27474. (S1 ^operator O2169 +)
  27475. Firing propose*predict-no
  27476. -->
  27477. (O2170 ^name predict-no +)
  27478. (S1 ^operator O2170 +)
  27479. Firing rl*prefer*rvt*predict-no*H0*6
  27480. -->
  27481. (S1 ^operator O2168 = 0.9999999999999999)
  27482. Firing rl*prefer*rvt*predict-yes*H0*5
  27483. -->
  27484. (S1 ^operator O2167 = 0.)
  27485. Firing prefer*rvt*predict-yes*H0
  27486. -->
  27487. Firing prefer*rvt*predict-no*H0
  27488. -->
  27489. Firing elaborate*copy-dir-to-output-link
  27490. -->
  27491. (I3 ^dir U +)
  27492. inner elaboration loop at bottom goal.
  27493. Retracting elaborate*copy-see-to-output-link
  27494. -->
  27495. (I3 ^see 0 +)
  27496. Retracting propose*predict-no
  27497. -->
  27498. (O2168 ^name predict-no +)
  27499. (S1 ^operator O2168 +)
  27500. Retracting propose*predict-yes
  27501. -->
  27502. (O2167 ^name predict-yes +)
  27503. (S1 ^operator O2167 +)
  27504. Retracting elaborate*reward*based*on*reward
  27505. -->
  27506. (R1087 ^value 1 +)
  27507. (R1 ^reward R1087 +)
  27508. Retracting elaborate*copy-dir-to-output-link
  27509. -->
  27510. (I3 ^dir R +)
  27511. Retracting rl*prefer*rvt*predict-no*H0*4
  27512. -->
  27513. (S1 ^operator O2168 = 0.4476193025811009)
  27514. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27515. -->
  27516. (S1 ^operator O2168 = -0.00558448899823713)
  27517. Retracting rl*prefer*rvt*predict-yes*H0*3
  27518. -->
  27519. (S1 ^operator O2167 = 0.1844115443735304)
  27520. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27521. -->
  27522. (S1 ^operator O2167 = 0.8155899301783588)
  27523. =>WM: (15212: S1 ^operator O2170 +)
  27524. =>WM: (15211: S1 ^operator O2169 +)
  27525. =>WM: (15210: I3 ^dir U)
  27526. =>WM: (15209: O2170 ^name predict-no)
  27527. =>WM: (15208: O2169 ^name predict-yes)
  27528. =>WM: (15207: R1088 ^value 1)
  27529. =>WM: (15206: R1 ^reward R1088)
  27530. =>WM: (15205: I3 ^see 1)
  27531. <=WM: (15196: S1 ^operator O2167 +)
  27532. <=WM: (15198: S1 ^operator O2167)
  27533. <=WM: (15197: S1 ^operator O2168 +)
  27534. <=WM: (15195: I3 ^dir R)
  27535. <=WM: (15191: R1 ^reward R1087)
  27536. <=WM: (15148: I3 ^see 0)
  27537. <=WM: (15194: O2168 ^name predict-no)
  27538. <=WM: (15193: O2167 ^name predict-yes)
  27539. <=WM: (15192: R1087 ^value 1)
  27540. --- Inner Elaboration Phase, active level 1 (S1) ---
  27541. Firing prefer*rvt*predict-yes*H0
  27542. -->
  27543. Firing rl*prefer*rvt*predict-yes*H0*5
  27544. -->
  27545. (S1 ^operator O2169 = 0.)
  27546. Firing prefer*rvt*predict-no*H0
  27547. -->
  27548. Firing rl*prefer*rvt*predict-no*H0*6
  27549. -->
  27550. (S1 ^operator O2170 = 0.9999999999999999)
  27551. inner elaboration loop at bottom goal.
  27552. Retracting rl*prefer*rvt*predict-no*H0*6
  27553. -->
  27554. (S1 ^operator O2168 = 0.9999999999999999)
  27555. Retracting rl*prefer*rvt*predict-yes*H0*5
  27556. -->
  27557. (S1 ^operator O2167 = 0.)
  27558. --- END Proposal Phase ---
  27559. --- Decision Phase ---
  27560. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184412 -> 0.675414 -0.491003 0.184411(R,m,v=1,0.907104,0.0847295)
  27561. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.324588 0.491002 0.81559 -> 0.324587 0.491002 0.81559(R,m,v=1,1,0)
  27562. =>WM: (15213: S1 ^operator O2170)
  27563. 1085: O: O2170 (predict-no)
  27564. --- END Decision Phase ---
  27565. --- Application Phase ---
  27566. --- Firing Productions (PE) For State At Depth 1 ---
  27567. --- Inner Elaboration Phase, active level 1 (S1) ---
  27568. Firing apply*operator
  27569. -->
  27570. (I3 ^predict-no N1085 + :O )
  27571. Firing apply*operator*complete
  27572. -->
  27573. (I3 ^predict-yes N1084 - :O )
  27574. inner elaboration loop at bottom goal.
  27575. --- Change Working Memory (PE) ---
  27576. =>WM: (15214: I3 ^predict-no N1085)
  27577. <=WM: (15200: N1084 ^status complete)
  27578. <=WM: (15199: I3 ^predict-yes N1084)
  27579. --- Firing Productions (IE) For State At Depth 1 ---
  27580. --- Inner Elaboration Phase, active level 1 (S1) ---
  27581. Firing monitor*world
  27582. -->
  27583. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27584. --- Change Working Memory (IE) ---
  27585. --- END Application Phase ---
  27586. --- Output Phase ---
  27587. ENV: Agent did: predict-no for direction U in state State-B
  27588. In State-B moving U
  27589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27590. predict error 0
  27591. dir: dir isR
  27592. --- END Output Phase ---
  27593. -/|--- Input Phase ---
  27594. =>WM: (15218: I2 ^dir R)
  27595. =>WM: (15217: I2 ^reward 1)
  27596. =>WM: (15216: I2 ^see 0)
  27597. =>WM: (15215: N1085 ^status complete)
  27598. <=WM: (15203: I2 ^dir U)
  27599. <=WM: (15202: I2 ^reward 1)
  27600. <=WM: (15201: I2 ^see 1)
  27601. =>WM: (15219: I2 ^level-1 R1-root)
  27602. <=WM: (15204: I2 ^level-1 R1-root)
  27603. --- END Input Phase ---
  27604. --- Proposal Phase ---
  27605. --- Inner Elaboration Phase, active level 1 (S1) ---
  27606. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  27607. -->
  27608. (S1 ^operator O2169 = 0.1398795999120246)
  27609. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  27610. -->
  27611. (S1 ^operator O2170 = 0.5523810272090074)
  27612. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27613. -->
  27614. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27615. -->
  27616. Firing elaborate*copy-see-to-output-link
  27617. -->
  27618. (I3 ^see 0 +)
  27619. Firing elaborate*reward*based*on*reward
  27620. -->
  27621. (R1089 ^value 1 +)
  27622. (R1 ^reward R1089 +)
  27623. Firing propose*predict-yes
  27624. -->
  27625. (O2171 ^name predict-yes +)
  27626. (S1 ^operator O2171 +)
  27627. Firing propose*predict-no
  27628. -->
  27629. (O2172 ^name predict-no +)
  27630. (S1 ^operator O2172 +)
  27631. Firing rl*prefer*rvt*predict-no*H0*4
  27632. -->
  27633. (S1 ^operator O2170 = 0.4476193025811009)
  27634. Firing rl*prefer*rvt*predict-yes*H0*3
  27635. -->
  27636. (S1 ^operator O2169 = 0.1844113231907469)
  27637. Firing prefer*rvt*predict-yes*H0
  27638. -->
  27639. Firing prefer*rvt*predict-no*H0
  27640. -->
  27641. Firing elaborate*copy-dir-to-output-link
  27642. -->
  27643. (I3 ^dir R +)
  27644. inner elaboration loop at bottom goal.
  27645. Retracting elaborate*copy-see-to-output-link
  27646. -->
  27647. (I3 ^see 1 +)
  27648. Retracting propose*predict-no
  27649. -->
  27650. (O2170 ^name predict-no +)
  27651. (S1 ^operator O2170 +)
  27652. Retracting propose*predict-yes
  27653. -->
  27654. (O2169 ^name predict-yes +)
  27655. (S1 ^operator O2169 +)
  27656. Retracting elaborate*reward*based*on*reward
  27657. -->
  27658. (R1088 ^value 1 +)
  27659. (R1 ^reward R1088 +)
  27660. Retracting elaborate*copy-dir-to-output-link
  27661. -->
  27662. (I3 ^dir U +)
  27663. Retracting rl*prefer*rvt*predict-no*H0*6
  27664. -->
  27665. (S1 ^operator O2170 = 0.9999999999999999)
  27666. Retracting rl*prefer*rvt*predict-yes*H0*5
  27667. -->
  27668. (S1 ^operator O2169 = 0.)
  27669. =>WM: (15227: S1 ^operator O2172 +)
  27670. =>WM: (15226: S1 ^operator O2171 +)
  27671. =>WM: (15225: I3 ^dir R)
  27672. =>WM: (15224: O2172 ^name predict-no)
  27673. =>WM: (15223: O2171 ^name predict-yes)
  27674. =>WM: (15222: R1089 ^value 1)
  27675. =>WM: (15221: R1 ^reward R1089)
  27676. =>WM: (15220: I3 ^see 0)
  27677. <=WM: (15211: S1 ^operator O2169 +)
  27678. <=WM: (15212: S1 ^operator O2170 +)
  27679. <=WM: (15213: S1 ^operator O2170)
  27680. <=WM: (15210: I3 ^dir U)
  27681. <=WM: (15206: R1 ^reward R1088)
  27682. <=WM: (15205: I3 ^see 1)
  27683. <=WM: (15209: O2170 ^name predict-no)
  27684. <=WM: (15208: O2169 ^name predict-yes)
  27685. <=WM: (15207: R1088 ^value 1)
  27686. --- Inner Elaboration Phase, active level 1 (S1) ---
  27687. Firing prefer*rvt*predict-yes*H0
  27688. -->
  27689. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  27690. -->
  27691. (S1 ^operator O2171 = 0.1398795999120246)
  27692. Firing rl*prefer*rvt*predict-yes*H0*3
  27693. -->
  27694. (S1 ^operator O2171 = 0.1844113231907469)
  27695. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27696. -->
  27697. Firing prefer*rvt*predict-no*H0
  27698. -->
  27699. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  27700. -->
  27701. (S1 ^operator O2172 = 0.5523810272090074)
  27702. Firing rl*prefer*rvt*predict-no*H0*4
  27703. -->
  27704. (S1 ^operator O2172 = 0.4476193025811009)
  27705. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27706. -->
  27707. inner elaboration loop at bottom goal.
  27708. Retracting rl*prefer*rvt*predict-no*H0*4
  27709. -->
  27710. (S1 ^operator O2170 = 0.4476193025811009)
  27711. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  27712. -->
  27713. (S1 ^operator O2170 = 0.5523810272090074)
  27714. Retracting rl*prefer*rvt*predict-yes*H0*3
  27715. -->
  27716. (S1 ^operator O2169 = 0.1844113231907469)
  27717. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  27718. -->
  27719. (S1 ^operator O2169 = 0.1398795999120246)
  27720. --- END Proposal Phase ---
  27721. --- Decision Phase ---
  27722. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27723. =>WM: (15228: S1 ^operator O2172)
  27724. 1086: O: O2172 (predict-no)
  27725. --- END Decision Phase ---
  27726. --- Application Phase ---
  27727. --- Firing Productions (PE) For State At Depth 1 ---
  27728. --- Inner Elaboration Phase, active level 1 (S1) ---
  27729. Firing apply*operator
  27730. -->
  27731. (I3 ^predict-no N1086 + :O )
  27732. Firing apply*operator*complete
  27733. -->
  27734. (I3 ^predict-no N1085 - :O )
  27735. inner elaboration loop at bottom goal.
  27736. --- Change Working Memory (PE) ---
  27737. =>WM: (15229: I3 ^predict-no N1086)
  27738. <=WM: (15215: N1085 ^status complete)
  27739. <=WM: (15214: I3 ^predict-no N1085)
  27740. --- Firing Productions (IE) For State At Depth 1 ---
  27741. --- Inner Elaboration Phase, active level 1 (S1) ---
  27742. Firing monitor*world
  27743. -->
  27744. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27745. --- Change Working Memory (IE) ---
  27746. --- END Application Phase ---
  27747. --- Output Phase ---
  27748. ENV: Agent did: predict-no for direction R in state State-B
  27749. In State-B moving R
  27750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27751. predict error 0
  27752. dir: dir isR
  27753. --- END Output Phase ---
  27754. \-/--- Input Phase ---
  27755. =>WM: (15233: I2 ^dir R)
  27756. =>WM: (15232: I2 ^reward 1)
  27757. =>WM: (15231: I2 ^see 0)
  27758. =>WM: (15230: N1086 ^status complete)
  27759. <=WM: (15218: I2 ^dir R)
  27760. <=WM: (15217: I2 ^reward 1)
  27761. <=WM: (15216: I2 ^see 0)
  27762. =>WM: (15234: I2 ^level-1 R0-root)
  27763. <=WM: (15219: I2 ^level-1 R1-root)
  27764. --- END Input Phase ---
  27765. --- Proposal Phase ---
  27766. --- Inner Elaboration Phase, active level 1 (S1) ---
  27767. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  27768. -->
  27769. (S1 ^operator O2171 = 0.1664311307472832)
  27770. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  27771. -->
  27772. (S1 ^operator O2172 = 0.552380105014882)
  27773. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27774. -->
  27775. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27776. -->
  27777. Firing elaborate*copy-see-to-output-link
  27778. -->
  27779. (I3 ^see 0 +)
  27780. Firing elaborate*reward*based*on*reward
  27781. -->
  27782. (R1090 ^value 1 +)
  27783. (R1 ^reward R1090 +)
  27784. Firing propose*predict-yes
  27785. -->
  27786. (O2173 ^name predict-yes +)
  27787. (S1 ^operator O2173 +)
  27788. Firing propose*predict-no
  27789. -->
  27790. (O2174 ^name predict-no +)
  27791. (S1 ^operator O2174 +)
  27792. Firing rl*prefer*rvt*predict-no*H0*4
  27793. -->
  27794. (S1 ^operator O2172 = 0.4476193025811009)
  27795. Firing rl*prefer*rvt*predict-yes*H0*3
  27796. -->
  27797. (S1 ^operator O2171 = 0.1844113231907469)
  27798. Firing prefer*rvt*predict-yes*H0
  27799. -->
  27800. Firing prefer*rvt*predict-no*H0
  27801. -->
  27802. Firing elaborate*copy-dir-to-output-link
  27803. -->
  27804. (I3 ^dir R +)
  27805. inner elaboration loop at bottom goal.
  27806. Retracting elaborate*copy-see-to-output-link
  27807. -->
  27808. (I3 ^see 0 +)
  27809. Retracting propose*predict-no
  27810. -->
  27811. (O2172 ^name predict-no +)
  27812. (S1 ^operator O2172 +)
  27813. Retracting propose*predict-yes
  27814. -->
  27815. (O2171 ^name predict-yes +)
  27816. (S1 ^operator O2171 +)
  27817. Retracting elaborate*reward*based*on*reward
  27818. -->
  27819. (R1089 ^value 1 +)
  27820. (R1 ^reward R1089 +)
  27821. Retracting elaborate*copy-dir-to-output-link
  27822. -->
  27823. (I3 ^dir R +)
  27824. Retracting rl*prefer*rvt*predict-no*H0*4
  27825. -->
  27826. (S1 ^operator O2172 = 0.4476193025811009)
  27827. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  27828. -->
  27829. (S1 ^operator O2172 = 0.5523810272090074)
  27830. Retracting rl*prefer*rvt*predict-yes*H0*3
  27831. -->
  27832. (S1 ^operator O2171 = 0.1844113231907469)
  27833. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  27834. -->
  27835. (S1 ^operator O2171 = 0.1398795999120246)
  27836. =>WM: (15240: S1 ^operator O2174 +)
  27837. =>WM: (15239: S1 ^operator O2173 +)
  27838. =>WM: (15238: O2174 ^name predict-no)
  27839. =>WM: (15237: O2173 ^name predict-yes)
  27840. =>WM: (15236: R1090 ^value 1)
  27841. =>WM: (15235: R1 ^reward R1090)
  27842. <=WM: (15226: S1 ^operator O2171 +)
  27843. <=WM: (15227: S1 ^operator O2172 +)
  27844. <=WM: (15228: S1 ^operator O2172)
  27845. <=WM: (15221: R1 ^reward R1089)
  27846. <=WM: (15224: O2172 ^name predict-no)
  27847. <=WM: (15223: O2171 ^name predict-yes)
  27848. <=WM: (15222: R1089 ^value 1)
  27849. --- Inner Elaboration Phase, active level 1 (S1) ---
  27850. Firing prefer*rvt*predict-yes*H0
  27851. -->
  27852. Firing rl*prefer*rvt*predict-yes*H0*3
  27853. -->
  27854. (S1 ^operator O2173 = 0.1844113231907469)
  27855. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27856. -->
  27857. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  27858. -->
  27859. (S1 ^operator O2173 = 0.1664311307472832)
  27860. Firing prefer*rvt*predict-no*H0
  27861. -->
  27862. Firing rl*prefer*rvt*predict-no*H0*4
  27863. -->
  27864. (S1 ^operator O2174 = 0.4476193025811009)
  27865. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27866. -->
  27867. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  27868. -->
  27869. (S1 ^operator O2174 = 0.552380105014882)
  27870. inner elaboration loop at bottom goal.
  27871. Retracting rl*prefer*rvt*predict-no*H0*4
  27872. -->
  27873. (S1 ^operator O2172 = 0.4476193025811009)
  27874. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  27875. -->
  27876. (S1 ^operator O2172 = 0.552380105014882)
  27877. Retracting rl*prefer*rvt*predict-yes*H0*3
  27878. -->
  27879. (S1 ^operator O2171 = 0.1844113231907469)
  27880. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  27881. -->
  27882. (S1 ^operator O2171 = 0.1664311307472832)
  27883. --- END Proposal Phase ---
  27884. --- Decision Phase ---
  27885. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.93662,0.0597842)
  27886. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377467 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  27887. =>WM: (15241: S1 ^operator O2174)
  27888. 1087: O: O2174 (predict-no)
  27889. --- END Decision Phase ---
  27890. --- Application Phase ---
  27891. --- Firing Productions (PE) For State At Depth 1 ---
  27892. --- Inner Elaboration Phase, active level 1 (S1) ---
  27893. Firing apply*operator
  27894. -->
  27895. (I3 ^predict-no N1087 + :O )
  27896. Firing apply*operator*complete
  27897. -->
  27898. (I3 ^predict-no N1086 - :O )
  27899. inner elaboration loop at bottom goal.
  27900. --- Change Working Memory (PE) ---
  27901. =>WM: (15242: I3 ^predict-no N1087)
  27902. <=WM: (15230: N1086 ^status complete)
  27903. <=WM: (15229: I3 ^predict-no N1086)
  27904. --- Firing Productions (IE) For State At Depth 1 ---
  27905. --- Inner Elaboration Phase, active level 1 (S1) ---
  27906. Firing monitor*world
  27907. -->
  27908. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27909. --- Change Working Memory (IE) ---
  27910. --- END Application Phase ---
  27911. --- Output Phase ---
  27912. ENV: Agent did: predict-no for direction R in state State-B
  27913. In State-B moving R
  27914. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27915. predict error 0
  27916. dir: dir isU
  27917. --- END Output Phase ---
  27918. |\-/--- Input Phase ---
  27919. =>WM: (15246: I2 ^dir U)
  27920. =>WM: (15245: I2 ^reward 1)
  27921. =>WM: (15244: I2 ^see 0)
  27922. =>WM: (15243: N1087 ^status complete)
  27923. <=WM: (15233: I2 ^dir R)
  27924. <=WM: (15232: I2 ^reward 1)
  27925. <=WM: (15231: I2 ^see 0)
  27926. =>WM: (15247: I2 ^level-1 R0-root)
  27927. <=WM: (15234: I2 ^level-1 R0-root)
  27928. --- END Input Phase ---
  27929. --- Proposal Phase ---
  27930. --- Inner Elaboration Phase, active level 1 (S1) ---
  27931. Firing elaborate*copy-see-to-output-link
  27932. -->
  27933. (I3 ^see 0 +)
  27934. Firing elaborate*reward*based*on*reward
  27935. -->
  27936. (R1091 ^value 1 +)
  27937. (R1 ^reward R1091 +)
  27938. Firing propose*predict-yes
  27939. -->
  27940. (O2175 ^name predict-yes +)
  27941. (S1 ^operator O2175 +)
  27942. Firing propose*predict-no
  27943. -->
  27944. (O2176 ^name predict-no +)
  27945. (S1 ^operator O2176 +)
  27946. Firing rl*prefer*rvt*predict-no*H0*6
  27947. -->
  27948. (S1 ^operator O2174 = 0.9999999999999999)
  27949. Firing rl*prefer*rvt*predict-yes*H0*5
  27950. -->
  27951. (S1 ^operator O2173 = 0.)
  27952. Firing prefer*rvt*predict-yes*H0
  27953. -->
  27954. Firing prefer*rvt*predict-no*H0
  27955. -->
  27956. Firing elaborate*copy-dir-to-output-link
  27957. -->
  27958. (I3 ^dir U +)
  27959. inner elaboration loop at bottom goal.
  27960. Retracting elaborate*copy-see-to-output-link
  27961. -->
  27962. (I3 ^see 0 +)
  27963. Retracting propose*predict-no
  27964. -->
  27965. (O2174 ^name predict-no +)
  27966. (S1 ^operator O2174 +)
  27967. Retracting propose*predict-yes
  27968. -->
  27969. (O2173 ^name predict-yes +)
  27970. (S1 ^operator O2173 +)
  27971. Retracting elaborate*reward*based*on*reward
  27972. -->
  27973. (R1090 ^value 1 +)
  27974. (R1 ^reward R1090 +)
  27975. Retracting elaborate*copy-dir-to-output-link
  27976. -->
  27977. (I3 ^dir R +)
  27978. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  27979. -->
  27980. (S1 ^operator O2174 = 0.552380105014882)
  27981. Retracting rl*prefer*rvt*predict-no*H0*4
  27982. -->
  27983. (S1 ^operator O2174 = 0.4476192531125847)
  27984. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  27985. -->
  27986. (S1 ^operator O2173 = 0.1664311307472832)
  27987. Retracting rl*prefer*rvt*predict-yes*H0*3
  27988. -->
  27989. (S1 ^operator O2173 = 0.1844113231907469)
  27990. =>WM: (15254: S1 ^operator O2176 +)
  27991. =>WM: (15253: S1 ^operator O2175 +)
  27992. =>WM: (15252: I3 ^dir U)
  27993. =>WM: (15251: O2176 ^name predict-no)
  27994. =>WM: (15250: O2175 ^name predict-yes)
  27995. =>WM: (15249: R1091 ^value 1)
  27996. =>WM: (15248: R1 ^reward R1091)
  27997. <=WM: (15239: S1 ^operator O2173 +)
  27998. <=WM: (15240: S1 ^operator O2174 +)
  27999. <=WM: (15241: S1 ^operator O2174)
  28000. <=WM: (15225: I3 ^dir R)
  28001. <=WM: (15235: R1 ^reward R1090)
  28002. <=WM: (15238: O2174 ^name predict-no)
  28003. <=WM: (15237: O2173 ^name predict-yes)
  28004. <=WM: (15236: R1090 ^value 1)
  28005. --- Inner Elaboration Phase, active level 1 (S1) ---
  28006. Firing prefer*rvt*predict-yes*H0
  28007. -->
  28008. Firing rl*prefer*rvt*predict-yes*H0*5
  28009. -->
  28010. (S1 ^operator O2175 = 0.)
  28011. Firing prefer*rvt*predict-no*H0
  28012. -->
  28013. Firing rl*prefer*rvt*predict-no*H0*6
  28014. -->
  28015. (S1 ^operator O2176 = 0.9999999999999999)
  28016. inner elaboration loop at bottom goal.
  28017. Retracting rl*prefer*rvt*predict-no*H0*6
  28018. -->
  28019. (S1 ^operator O2174 = 0.9999999999999999)
  28020. Retracting rl*prefer*rvt*predict-yes*H0*5
  28021. -->
  28022. (S1 ^operator O2173 = 0.)
  28023. --- END Proposal Phase ---
  28024. --- Decision Phase ---
  28025. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.937063,0.0593913)
  28026. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  28027. =>WM: (15255: S1 ^operator O2176)
  28028. 1088: O: O2176 (predict-no)
  28029. --- END Decision Phase ---
  28030. --- Application Phase ---
  28031. --- Firing Productions (PE) For State At Depth 1 ---
  28032. --- Inner Elaboration Phase, active level 1 (S1) ---
  28033. Firing apply*operator
  28034. -->
  28035. (I3 ^predict-no N1088 + :O )
  28036. Firing apply*operator*complete
  28037. -->
  28038. (I3 ^predict-no N1087 - :O )
  28039. inner elaboration loop at bottom goal.
  28040. --- Change Working Memory (PE) ---
  28041. =>WM: (15256: I3 ^predict-no N1088)
  28042. <=WM: (15243: N1087 ^status complete)
  28043. <=WM: (15242: I3 ^predict-no N1087)
  28044. --- Firing Productions (IE) For State At Depth 1 ---
  28045. --- Inner Elaboration Phase, active level 1 (S1) ---
  28046. Firing monitor*world
  28047. -->
  28048. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28049. --- Change Working Memory (IE) ---
  28050. --- END Application Phase ---
  28051. --- Output Phase ---
  28052. ENV: Agent did: predict-no for direction U in state State-B
  28053. In State-B moving U
  28054. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28055. predict error 0
  28056. dir: dir isL
  28057. --- END Output Phase ---
  28058. |\---- Input Phase ---
  28059. =>WM: (15260: I2 ^dir L)
  28060. =>WM: (15259: I2 ^reward 1)
  28061. =>WM: (15258: I2 ^see 0)
  28062. =>WM: (15257: N1088 ^status complete)
  28063. <=WM: (15246: I2 ^dir U)
  28064. <=WM: (15245: I2 ^reward 1)
  28065. <=WM: (15244: I2 ^see 0)
  28066. =>WM: (15261: I2 ^level-1 R0-root)
  28067. <=WM: (15247: I2 ^level-1 R0-root)
  28068. --- END Input Phase ---
  28069. --- Proposal Phase ---
  28070. --- Inner Elaboration Phase, active level 1 (S1) ---
  28071. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  28072. -->
  28073. (S1 ^operator O2175 = 0.6104606429053037)
  28074. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  28075. -->
  28076. (S1 ^operator O2176 = 0.1063475139796038)
  28077. Firing prefer*rvt*predict-no*H0*2*v1*H1
  28078. -->
  28079. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  28080. -->
  28081. Firing elaborate*copy-see-to-output-link
  28082. -->
  28083. (I3 ^see 0 +)
  28084. Firing elaborate*reward*based*on*reward
  28085. -->
  28086. (R1092 ^value 1 +)
  28087. (R1 ^reward R1092 +)
  28088. Firing propose*predict-yes
  28089. -->
  28090. (O2177 ^name predict-yes +)
  28091. (S1 ^operator O2177 +)
  28092. Firing propose*predict-no
  28093. -->
  28094. (O2178 ^name predict-no +)
  28095. (S1 ^operator O2178 +)
  28096. Firing rl*prefer*rvt*predict-no*H0*2
  28097. -->
  28098. (S1 ^operator O2176 = 0.3873354925260269)
  28099. Firing rl*prefer*rvt*predict-yes*H0*1
  28100. -->
  28101. (S1 ^operator O2175 = 0.3895396347547306)
  28102. Firing prefer*rvt*predict-yes*H0
  28103. -->
  28104. Firing prefer*rvt*predict-no*H0
  28105. -->
  28106. Firing elaborate*copy-dir-to-output-link
  28107. -->
  28108. (I3 ^dir L +)
  28109. inner elaboration loop at bottom goal.
  28110. Retracting elaborate*copy-see-to-output-link
  28111. -->
  28112. (I3 ^see 0 +)
  28113. Retracting propose*predict-no
  28114. -->
  28115. (O2176 ^name predict-no +)
  28116. (S1 ^operator O2176 +)
  28117. Retracting propose*predict-yes
  28118. -->
  28119. (O2175 ^name predict-yes +)
  28120. (S1 ^operator O2175 +)
  28121. Retracting elaborate*reward*based*on*reward
  28122. -->
  28123. (R1091 ^value 1 +)
  28124. (R1 ^reward R1091 +)
  28125. Retracting elaborate*copy-dir-to-output-link
  28126. -->
  28127. (I3 ^dir U +)
  28128. Retracting rl*prefer*rvt*predict-no*H0*6
  28129. -->
  28130. (S1 ^operator O2176 = 0.9999999999999999)
  28131. Retracting rl*prefer*rvt*predict-yes*H0*5
  28132. -->
  28133. (S1 ^operator O2175 = 0.)
  28134. =>WM: (15268: S1 ^operator O2178 +)
  28135. =>WM: (15267: S1 ^operator O2177 +)
  28136. =>WM: (15266: I3 ^dir L)
  28137. =>WM: (15265: O2178 ^name predict-no)
  28138. =>WM: (15264: O2177 ^name predict-yes)
  28139. =>WM: (15263: R1092 ^value 1)
  28140. =>WM: (15262: R1 ^reward R1092)
  28141. <=WM: (15253: S1 ^operator O2175 +)
  28142. <=WM: (15254: S1 ^operator O2176 +)
  28143. <=WM: (15255: S1 ^operator O2176)
  28144. <=WM: (15252: I3 ^dir U)
  28145. <=WM: (15248: R1 ^reward R1091)
  28146. <=WM: (15251: O2176 ^name predict-no)
  28147. <=WM: (15250: O2175 ^name predict-yes)
  28148. <=WM: (15249: R1091 ^value 1)
  28149. --- Inner Elaboration Phase, active level 1 (S1) ---
  28150. Firing prefer*rvt*predict-yes*H0
  28151. -->
  28152. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  28153. -->
  28154. (S1 ^operator O2177 = 0.6104606429053037)
  28155. Firing rl*prefer*rvt*predict-yes*H0*1
  28156. -->
  28157. (S1 ^operator O2177 = 0.3895396347547306)
  28158. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  28159. -->
  28160. Firing prefer*rvt*predict-no*H0
  28161. -->
  28162. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  28163. -->
  28164. (S1 ^operator O2178 = 0.1063475139796038)
  28165. Firing rl*prefer*rvt*predict-no*H0*2
  28166. -->
  28167. (S1 ^operator O2178 = 0.3873354925260269)
  28168. Firing prefer*rvt*predict-no*H0*2*v1*H1
  28169. -->
  28170. inner elaboration loop at bottom goal.
  28171. Retracting rl*prefer*rvt*predict-no*H0*2
  28172. -->
  28173. (S1 ^operator O2176 = 0.3873354925260269)
  28174. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  28175. -->
  28176. (S1 ^operator O2176 = 0.1063475139796038)
  28177. Retracting rl*prefer*rvt*predict-yes*H0*1
  28178. -->
  28179. (S1 ^operator O2175 = 0.3895396347547306)
  28180. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  28181. -->
  28182. (S1 ^operator O2175 = 0.6104606429053037)
  28183. --- END Proposal Phase ---
  28184. --- Decision Phase ---
  28185. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28186. =>WM: (15269: S1 ^operator O2177)
  28187. 1089: O: O2177 (predict-yes)
  28188. --- END Decision Phase ---
  28189. --- Application Phase ---
  28190. --- Firing Productions (PE) For State At Depth 1 ---
  28191. --- Inner Elaboration Phase, active level 1 (S1) ---
  28192. Firing apply*operator
  28193. -->
  28194. (I3 ^predict-yes N1089 + :O )
  28195. Firing apply*operator*complete
  28196. -->
  28197. (I3 ^predict-no N1088 - :O )
  28198. inner elaboration loop at bottom goal.
  28199. --- Change Working Memory (PE) ---
  28200. =>WM: (15270: I3 ^predict-yes N1089)
  28201. <=WM: (15257: N1088 ^status complete)
  28202. <=WM: (15256: I3 ^predict-no N1088)
  28203. --- Firing Productions (IE) For State At Depth 1 ---
  28204. --- Inner Elaboration Phase, active level 1 (S1) ---
  28205. Firing monitor*world
  28206. -->
  28207. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28208. --- Change Working Memory (IE) ---
  28209. --- END Application Phase ---
  28210. --- Output Phase ---
  28211. ENV: Agent did: predict-yes for direction L in state State-B
  28212. In State-B moving L
  28213. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28214. predict error 0
  28215. dir: dir isU
  28216. --- END Output Phase ---
  28217. /|\---- Input Phase ---
  28218. =>WM: (15274: I2 ^dir U)
  28219. =>WM: (15273: I2 ^reward 1)
  28220. =>WM: (15272: I2 ^see 1)
  28221. =>WM: (15271: N1089 ^status complete)
  28222. <=WM: (15260: I2 ^dir L)
  28223. <=WM: (15259: I2 ^reward 1)
  28224. <=WM: (15258: I2 ^see 0)
  28225. =>WM: (15275: I2 ^level-1 L1-root)
  28226. <=WM: (15261: I2 ^level-1 R0-root)
  28227. --- END Input Phase ---
  28228. --- Proposal Phase ---
  28229. --- Inner Elaboration Phase, active level 1 (S1) ---
  28230. Firing elaborate*copy-see-to-output-link
  28231. -->
  28232. (I3 ^see 1 +)
  28233. Firing elaborate*reward*based*on*reward
  28234. -->
  28235. (R1093 ^value 1 +)
  28236. (R1 ^reward R1093 +)
  28237. Firing propose*predict-yes
  28238. -->
  28239. (O2179 ^name predict-yes +)
  28240. (S1 ^operator O2179 +)
  28241. Firing propose*predict-no
  28242. -->
  28243. (O2180 ^name predict-no +)
  28244. (S1 ^operator O2180 +)
  28245. Firing rl*prefer*rvt*predict-no*H0*6
  28246. -->
  28247. (S1 ^operator O2178 = 0.9999999999999999)
  28248. Firing rl*prefer*rvt*predict-yes*H0*5
  28249. -->
  28250. (S1 ^operator O2177 = 0.)
  28251. Firing prefer*rvt*predict-yes*H0
  28252. -->
  28253. Firing prefer*rvt*predict-no*H0
  28254. -->
  28255. Firing elaborate*copy-dir-to-output-link
  28256. -->
  28257. (I3 ^dir U +)
  28258. inner elaboration loop at bottom goal.
  28259. Retracting elaborate*copy-see-to-output-link
  28260. -->
  28261. (I3 ^see 0 +)
  28262. Retracting propose*predict-no
  28263. -->
  28264. (O2178 ^name predict-no +)
  28265. (S1 ^operator O2178 +)
  28266. Retracting propose*predict-yes
  28267. -->
  28268. (O2177 ^name predict-yes +)
  28269. (S1 ^operator O2177 +)
  28270. Retracting elaborate*reward*based*on*reward
  28271. -->
  28272. (R1092 ^value 1 +)
  28273. (R1 ^reward R1092 +)
  28274. Retracting elaborate*copy-dir-to-output-link
  28275. -->
  28276. (I3 ^dir L +)
  28277. Retracting rl*prefer*rvt*predict-no*H0*2
  28278. -->
  28279. (S1 ^operator O2178 = 0.3873354925260269)
  28280. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  28281. -->
  28282. (S1 ^operator O2178 = 0.1063475139796038)
  28283. Retracting rl*prefer*rvt*predict-yes*H0*1
  28284. -->
  28285. (S1 ^operator O2177 = 0.3895396347547306)
  28286. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  28287. -->
  28288. (S1 ^operator O2177 = 0.6104606429053037)
  28289. =>WM: (15283: S1 ^operator O2180 +)
  28290. =>WM: (15282: S1 ^operator O2179 +)
  28291. =>WM: (15281: I3 ^dir U)
  28292. =>WM: (15280: O2180 ^name predict-no)
  28293. =>WM: (15279: O2179 ^name predict-yes)
  28294. =>WM: (15278: R1093 ^value 1)
  28295. =>WM: (15277: R1 ^reward R1093)
  28296. =>WM: (15276: I3 ^see 1)
  28297. <=WM: (15267: S1 ^operator O2177 +)
  28298. <=WM: (15269: S1 ^operator O2177)
  28299. <=WM: (15268: S1 ^operator O2178 +)
  28300. <=WM: (15266: I3 ^dir L)
  28301. <=WM: (15262: R1 ^reward R1092)
  28302. <=WM: (15220: I3 ^see 0)
  28303. <=WM: (15265: O2178 ^name predict-no)
  28304. <=WM: (15264: O2177 ^name predict-yes)
  28305. <=WM: (15263: R1092 ^value 1)
  28306. --- Inner Elaboration Phase, active level 1 (S1) ---
  28307. Firing prefer*rvt*predict-yes*H0
  28308. -->
  28309. Firing rl*prefer*rvt*predict-yes*H0*5
  28310. -->
  28311. (S1 ^operator O2179 = 0.)
  28312. Firing prefer*rvt*predict-no*H0
  28313. -->
  28314. Firing rl*prefer*rvt*predict-no*H0*6
  28315. -->
  28316. (S1 ^operator O2180 = 0.9999999999999999)
  28317. inner elaboration loop at bottom goal.
  28318. Retracting rl*prefer*rvt*predict-no*H0*6
  28319. -->
  28320. (S1 ^operator O2178 = 0.9999999999999999)
  28321. Retracting rl*prefer*rvt*predict-yes*H0*5
  28322. -->
  28323. (S1 ^operator O2177 = 0.)
  28324. --- END Proposal Phase ---
  28325. --- Decision Phase ---
  28326. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.900552,0.0900552)
  28327. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  28328. =>WM: (15284: S1 ^operator O2180)
  28329. 1090: O: O2180 (predict-no)
  28330. --- END Decision Phase ---
  28331. --- Application Phase ---
  28332. --- Firing Productions (PE) For State At Depth 1 ---
  28333. --- Inner Elaboration Phase, active level 1 (S1) ---
  28334. Firing apply*operator
  28335. -->
  28336. (I3 ^predict-no N1090 + :O )
  28337. Firing apply*operator*complete
  28338. -->
  28339. (I3 ^predict-yes N1089 - :O )
  28340. inner elaboration loop at bottom goal.
  28341. --- Change Working Memory (PE) ---
  28342. =>WM: (15285: I3 ^predict-no N1090)
  28343. <=WM: (15271: N1089 ^status complete)
  28344. <=WM: (15270: I3 ^predict-yes N1089)
  28345. --- Firing Productions (IE) For State At Depth 1 ---
  28346. --- Inner Elaboration Phase, active level 1 (S1) ---
  28347. Firing monitor*world
  28348. -->
  28349. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28350. --- Change Working Memory (IE) ---
  28351. --- END Application Phase ---
  28352. --- Output Phase ---
  28353. ENV: Agent did: predict-no for direction U in state State-A
  28354. In State-A moving U
  28355. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28356. predict error 0
  28357. dir: dir isR
  28358. --- END Output Phase ---
  28359. /|\---- Input Phase ---
  28360. =>WM: (15289: I2 ^dir R)
  28361. =>WM: (15288: I2 ^reward 1)
  28362. =>WM: (15287: I2 ^see 0)
  28363. =>WM: (15286: N1090 ^status complete)
  28364. <=WM: (15274: I2 ^dir U)
  28365. <=WM: (15273: I2 ^reward 1)
  28366. <=WM: (15272: I2 ^see 1)
  28367. =>WM: (15290: I2 ^level-1 L1-root)
  28368. <=WM: (15275: I2 ^level-1 L1-root)
  28369. --- END Input Phase ---
  28370. --- Proposal Phase ---
  28371. --- Inner Elaboration Phase, active level 1 (S1) ---
  28372. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28373. -->
  28374. (S1 ^operator O2180 = -0.02155734064455064)
  28375. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28376. -->
  28377. (S1 ^operator O2179 = 0.8155855213644267)
  28378. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28379. -->
  28380. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28381. -->
  28382. Firing elaborate*copy-see-to-output-link
  28383. -->
  28384. (I3 ^see 0 +)
  28385. Firing elaborate*reward*based*on*reward
  28386. -->
  28387. (R1094 ^value 1 +)
  28388. (R1 ^reward R1094 +)
  28389. Firing propose*predict-yes
  28390. -->
  28391. (O2181 ^name predict-yes +)
  28392. (S1 ^operator O2181 +)
  28393. Firing propose*predict-no
  28394. -->
  28395. (O2182 ^name predict-no +)
  28396. (S1 ^operator O2182 +)
  28397. Firing rl*prefer*rvt*predict-no*H0*4
  28398. -->
  28399. (S1 ^operator O2180 = 0.4476193493934647)
  28400. Firing rl*prefer*rvt*predict-yes*H0*3
  28401. -->
  28402. (S1 ^operator O2179 = 0.1844113231907469)
  28403. Firing prefer*rvt*predict-yes*H0
  28404. -->
  28405. Firing prefer*rvt*predict-no*H0
  28406. -->
  28407. Firing elaborate*copy-dir-to-output-link
  28408. -->
  28409. (I3 ^dir R +)
  28410. inner elaboration loop at bottom goal.
  28411. Retracting elaborate*copy-see-to-output-link
  28412. -->
  28413. (I3 ^see 1 +)
  28414. Retracting propose*predict-no
  28415. -->
  28416. (O2180 ^name predict-no +)
  28417. (S1 ^operator O2180 +)
  28418. Retracting propose*predict-yes
  28419. -->
  28420. (O2179 ^name predict-yes +)
  28421. (S1 ^operator O2179 +)
  28422. Retracting elaborate*reward*based*on*reward
  28423. -->
  28424. (R1093 ^value 1 +)
  28425. (R1 ^reward R1093 +)
  28426. Retracting elaborate*copy-dir-to-output-link
  28427. -->
  28428. (I3 ^dir U +)
  28429. Retracting rl*prefer*rvt*predict-no*H0*6
  28430. -->
  28431. (S1 ^operator O2180 = 0.9999999999999999)
  28432. Retracting rl*prefer*rvt*predict-yes*H0*5
  28433. -->
  28434. (S1 ^operator O2179 = 0.)
  28435. =>WM: (15298: S1 ^operator O2182 +)
  28436. =>WM: (15297: S1 ^operator O2181 +)
  28437. =>WM: (15296: I3 ^dir R)
  28438. =>WM: (15295: O2182 ^name predict-no)
  28439. =>WM: (15294: O2181 ^name predict-yes)
  28440. =>WM: (15293: R1094 ^value 1)
  28441. =>WM: (15292: R1 ^reward R1094)
  28442. =>WM: (15291: I3 ^see 0)
  28443. <=WM: (15282: S1 ^operator O2179 +)
  28444. <=WM: (15283: S1 ^operator O2180 +)
  28445. <=WM: (15284: S1 ^operator O2180)
  28446. <=WM: (15281: I3 ^dir U)
  28447. <=WM: (15277: R1 ^reward R1093)
  28448. <=WM: (15276: I3 ^see 1)
  28449. <=WM: (15280: O2180 ^name predict-no)
  28450. <=WM: (15279: O2179 ^name predict-yes)
  28451. <=WM: (15278: R1093 ^value 1)
  28452. --- Inner Elaboration Phase, active level 1 (S1) ---
  28453. Firing prefer*rvt*predict-yes*H0
  28454. -->
  28455. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28456. -->
  28457. (S1 ^operator O2181 = 0.8155855213644267)
  28458. Firing rl*prefer*rvt*predict-yes*H0*3
  28459. -->
  28460. (S1 ^operator O2181 = 0.1844113231907469)
  28461. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28462. -->
  28463. Firing prefer*rvt*predict-no*H0
  28464. -->
  28465. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28466. -->
  28467. (S1 ^operator O2182 = -0.02155734064455064)
  28468. Firing rl*prefer*rvt*predict-no*H0*4
  28469. -->
  28470. (S1 ^operator O2182 = 0.4476193493934647)
  28471. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28472. -->
  28473. inner elaboration loop at bottom goal.
  28474. Retracting rl*prefer*rvt*predict-no*H0*4
  28475. -->
  28476. (S1 ^operator O2180 = 0.4476193493934647)
  28477. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28478. -->
  28479. (S1 ^operator O2180 = -0.02155734064455064)
  28480. Retracting rl*prefer*rvt*predict-yes*H0*3
  28481. -->
  28482. (S1 ^operator O2179 = 0.1844113231907469)
  28483. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28484. -->
  28485. (S1 ^operator O2179 = 0.8155855213644267)
  28486. --- END Proposal Phase ---
  28487. --- Decision Phase ---
  28488. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28489. =>WM: (15299: S1 ^operator O2181)
  28490. 1091: O: O2181 (predict-yes)
  28491. --- END Decision Phase ---
  28492. --- Application Phase ---
  28493. --- Firing Productions (PE) For State At Depth 1 ---
  28494. --- Inner Elaboration Phase, active level 1 (S1) ---
  28495. Firing apply*operator
  28496. -->
  28497. (I3 ^predict-yes N1091 + :O )
  28498. Firing apply*operator*complete
  28499. -->
  28500. (I3 ^predict-no N1090 - :O )
  28501. inner elaboration loop at bottom goal.
  28502. --- Change Working Memory (PE) ---
  28503. =>WM: (15300: I3 ^predict-yes N1091)
  28504. <=WM: (15286: N1090 ^status complete)
  28505. <=WM: (15285: I3 ^predict-no N1090)
  28506. --- Firing Productions (IE) For State At Depth 1 ---
  28507. --- Inner Elaboration Phase, active level 1 (S1) ---
  28508. Firing monitor*world
  28509. -->
  28510. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28511. --- Change Working Memory (IE) ---
  28512. --- END Application Phase ---
  28513. --- Output Phase ---
  28514. ENV: Agent did: predict-yes for direction R in state State-A
  28515. In State-A moving R
  28516. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  28517. predict error 0
  28518. dir: dir isL
  28519. --- END Output Phase ---
  28520. /--- Input Phase ---
  28521. =>WM: (15304: I2 ^dir L)
  28522. =>WM: (15303: I2 ^reward 1)
  28523. =>WM: (15302: I2 ^see 1)
  28524. =>WM: (15301: N1091 ^status complete)
  28525. <=WM: (15289: I2 ^dir R)
  28526. <=WM: (15288: I2 ^reward 1)
  28527. <=WM: (15287: I2 ^see 0)
  28528. =>WM: (15305: I2 ^level-1 R1-root)
  28529. <=WM: (15290: I2 ^level-1 L1-root)
  28530. --- END Input Phase ---
  28531. --- Proposal Phase ---
  28532. --- Inner Elaboration Phase, active level 1 (S1) ---
  28533. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28534. -->
  28535. (S1 ^operator O2181 = 0.6104601064087616)
  28536. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28537. -->
  28538. (S1 ^operator O2182 = 0.2714993082286609)
  28539. Firing prefer*rvt*predict-no*H0*2*v1*H1
  28540. -->
  28541. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  28542. -->
  28543. Firing elaborate*copy-see-to-output-link
  28544. -->
  28545. (I3 ^see 1 +)
  28546. Firing elaborate*reward*based*on*reward
  28547. -->
  28548. (R1095 ^value 1 +)
  28549. (R1 ^reward R1095 +)
  28550. Firing propose*predict-yes
  28551. -->
  28552. (O2183 ^name predict-yes +)
  28553. (S1 ^operator O2183 +)
  28554. Firing propose*predict-no
  28555. -->
  28556. (O2184 ^name predict-no +)
  28557. (S1 ^operator O2184 +)
  28558. Firing rl*prefer*rvt*predict-no*H0*2
  28559. -->
  28560. (S1 ^operator O2182 = 0.3873354925260269)
  28561. Firing rl*prefer*rvt*predict-yes*H0*1
  28562. -->
  28563. (S1 ^operator O2181 = 0.3895395931057254)
  28564. Firing prefer*rvt*predict-yes*H0
  28565. -->
  28566. Firing prefer*rvt*predict-no*H0
  28567. -->
  28568. Firing elaborate*copy-dir-to-output-link
  28569. -->
  28570. (I3 ^dir L +)
  28571. inner elaboration loop at bottom goal.
  28572. Retracting elaborate*copy-see-to-output-link
  28573. -->
  28574. (I3 ^see 0 +)
  28575. Retracting propose*predict-no
  28576. -->
  28577. (O2182 ^name predict-no +)
  28578. (S1 ^operator O2182 +)
  28579. Retracting propose*predict-yes
  28580. -->
  28581. (O2181 ^name predict-yes +)
  28582. (S1 ^operator O2181 +)
  28583. Retracting elaborate*reward*based*on*reward
  28584. -->
  28585. (R1094 ^value 1 +)
  28586. (R1 ^reward R1094 +)
  28587. Retracting elaborate*copy-dir-to-output-link
  28588. -->
  28589. (I3 ^dir R +)
  28590. Retracting rl*prefer*rvt*predict-no*H0*4
  28591. -->
  28592. (S1 ^operator O2182 = 0.4476193493934647)
  28593. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28594. -->
  28595. (S1 ^operator O2182 = -0.02155734064455064)
  28596. Retracting rl*prefer*rvt*predict-yes*H0*3
  28597. -->
  28598. (S1 ^operator O2181 = 0.1844113231907469)
  28599. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28600. -->
  28601. (S1 ^operator O2181 = 0.8155855213644267)
  28602. =>WM: (15313: S1 ^operator O2184 +)
  28603. =>WM: (15312: S1 ^operator O2183 +)
  28604. =>WM: (15311: I3 ^dir L)
  28605. =>WM: (15310: O2184 ^name predict-no)
  28606. =>WM: (15309: O2183 ^name predict-yes)
  28607. =>WM: (15308: R1095 ^value 1)
  28608. =>WM: (15307: R1 ^reward R1095)
  28609. =>WM: (15306: I3 ^see 1)
  28610. <=WM: (15297: S1 ^operator O2181 +)
  28611. <=WM: (15299: S1 ^operator O2181)
  28612. <=WM: (15298: S1 ^operator O2182 +)
  28613. <=WM: (15296: I3 ^dir R)
  28614. <=WM: (15292: R1 ^reward R1094)
  28615. <=WM: (15291: I3 ^see 0)
  28616. <=WM: (15295: O2182 ^name predict-no)
  28617. <=WM: (15294: O2181 ^name predict-yes)
  28618. <=WM: (15293: R1094 ^value 1)
  28619. --- Inner Elaboration Phase, active level 1 (S1) ---
  28620. Firing prefer*rvt*predict-yes*H0
  28621. -->
  28622. Firing rl*prefer*rvt*predict-yes*H0*1
  28623. -->
  28624. (S1 ^operator O2183 = 0.3895395931057254)
  28625. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  28626. -->
  28627. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28628. -->
  28629. (S1 ^operator O2183 = 0.6104601064087616)
  28630. Firing prefer*rvt*predict-no*H0
  28631. -->
  28632. Firing rl*prefer*rvt*predict-no*H0*2
  28633. -->
  28634. (S1 ^operator O2184 = 0.3873354925260269)
  28635. Firing prefer*rvt*predict-no*H0*2*v1*H1
  28636. -->
  28637. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28638. -->
  28639. (S1 ^operator O2184 = 0.2714993082286609)
  28640. inner elaboration loop at bottom goal.
  28641. Retracting rl*prefer*rvt*predict-no*H0*2
  28642. -->
  28643. (S1 ^operator O2182 = 0.3873354925260269)
  28644. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28645. -->
  28646. (S1 ^operator O2182 = 0.2714993082286609)
  28647. Retracting rl*prefer*rvt*predict-yes*H0*1
  28648. -->
  28649. (S1 ^operator O2181 = 0.3895395931057254)
  28650. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28651. -->
  28652. (S1 ^operator O2181 = 0.6104601064087616)
  28653. --- END Proposal Phase ---
  28654. --- Decision Phase ---
  28655. RL update rl*prefer*rvt*predict-yes*H0*3 0.675414 -0.491003 0.184411 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.907609,0.0843134)
  28656. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324582 0.491004 0.815586 -> 0.324583 0.491003 0.815586(R,m,v=1,1,0)
  28657. =>WM: (15314: S1 ^operator O2183)
  28658. 1092: O: O2183 (predict-yes)
  28659. --- END Decision Phase ---
  28660. --- Application Phase ---
  28661. --- Firing Productions (PE) For State At Depth 1 ---
  28662. --- Inner Elaboration Phase, active level 1 (S1) ---
  28663. Firing apply*operator
  28664. -->
  28665. (I3 ^predict-yes N1092 + :O )
  28666. Firing apply*operator*complete
  28667. -->
  28668. (I3 ^predict-yes N1091 - :O )
  28669. inner elaboration loop at bottom goal.
  28670. --- Change Working Memory (PE) ---
  28671. =>WM: (15315: I3 ^predict-yes N1092)
  28672. <=WM: (15301: N1091 ^status complete)
  28673. <=WM: (15300: I3 ^predict-yes N1091)
  28674. --- Firing Productions (IE) For State At Depth 1 ---
  28675. --- Inner Elaboration Phase, active level 1 (S1) ---
  28676. Firing monitor*world
  28677. -->
  28678. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28679. --- Change Working Memory (IE) ---
  28680. --- END Application Phase ---
  28681. --- Output Phase ---
  28682. ENV: Agent did: predict-yes for direction L in state State-B
  28683. In State-B moving L
  28684. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28685. predict error 0
  28686. dir: dir isR
  28687. --- END Output Phase ---
  28688. |\---- Input Phase ---
  28689. =>WM: (15319: I2 ^dir R)
  28690. =>WM: (15318: I2 ^reward 1)
  28691. =>WM: (15317: I2 ^see 1)
  28692. =>WM: (15316: N1092 ^status complete)
  28693. <=WM: (15304: I2 ^dir L)
  28694. <=WM: (15303: I2 ^reward 1)
  28695. <=WM: (15302: I2 ^see 1)
  28696. =>WM: (15320: I2 ^level-1 L1-root)
  28697. <=WM: (15305: I2 ^level-1 R1-root)
  28698. --- END Input Phase ---
  28699. --- Proposal Phase ---
  28700. --- Inner Elaboration Phase, active level 1 (S1) ---
  28701. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28702. -->
  28703. (S1 ^operator O2184 = -0.02155734064455064)
  28704. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28705. -->
  28706. (S1 ^operator O2183 = 0.8155859946811508)
  28707. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28708. -->
  28709. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28710. -->
  28711. Firing elaborate*copy-see-to-output-link
  28712. -->
  28713. (I3 ^see 1 +)
  28714. Firing elaborate*reward*based*on*reward
  28715. -->
  28716. (R1096 ^value 1 +)
  28717. (R1 ^reward R1096 +)
  28718. Firing propose*predict-yes
  28719. -->
  28720. (O2185 ^name predict-yes +)
  28721. (S1 ^operator O2185 +)
  28722. Firing propose*predict-no
  28723. -->
  28724. (O2186 ^name predict-no +)
  28725. (S1 ^operator O2186 +)
  28726. Firing rl*prefer*rvt*predict-no*H0*4
  28727. -->
  28728. (S1 ^operator O2184 = 0.4476193493934647)
  28729. Firing rl*prefer*rvt*predict-yes*H0*3
  28730. -->
  28731. (S1 ^operator O2183 = 0.1844117965074709)
  28732. Firing prefer*rvt*predict-yes*H0
  28733. -->
  28734. Firing prefer*rvt*predict-no*H0
  28735. -->
  28736. Firing elaborate*copy-dir-to-output-link
  28737. -->
  28738. (I3 ^dir R +)
  28739. inner elaboration loop at bottom goal.
  28740. Retracting elaborate*copy-see-to-output-link
  28741. -->
  28742. (I3 ^see 1 +)
  28743. Retracting propose*predict-no
  28744. -->
  28745. (O2184 ^name predict-no +)
  28746. (S1 ^operator O2184 +)
  28747. Retracting propose*predict-yes
  28748. -->
  28749. (O2183 ^name predict-yes +)
  28750. (S1 ^operator O2183 +)
  28751. Retracting elaborate*reward*based*on*reward
  28752. -->
  28753. (R1095 ^value 1 +)
  28754. (R1 ^reward R1095 +)
  28755. Retracting elaborate*copy-dir-to-output-link
  28756. -->
  28757. (I3 ^dir L +)
  28758. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28759. -->
  28760. (S1 ^operator O2184 = 0.2714993082286609)
  28761. Retracting rl*prefer*rvt*predict-no*H0*2
  28762. -->
  28763. (S1 ^operator O2184 = 0.3873354925260269)
  28764. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28765. -->
  28766. (S1 ^operator O2183 = 0.6104601064087616)
  28767. Retracting rl*prefer*rvt*predict-yes*H0*1
  28768. -->
  28769. (S1 ^operator O2183 = 0.3895395931057254)
  28770. =>WM: (15327: S1 ^operator O2186 +)
  28771. =>WM: (15326: S1 ^operator O2185 +)
  28772. =>WM: (15325: I3 ^dir R)
  28773. =>WM: (15324: O2186 ^name predict-no)
  28774. =>WM: (15323: O2185 ^name predict-yes)
  28775. =>WM: (15322: R1096 ^value 1)
  28776. =>WM: (15321: R1 ^reward R1096)
  28777. <=WM: (15312: S1 ^operator O2183 +)
  28778. <=WM: (15314: S1 ^operator O2183)
  28779. <=WM: (15313: S1 ^operator O2184 +)
  28780. <=WM: (15311: I3 ^dir L)
  28781. <=WM: (15307: R1 ^reward R1095)
  28782. <=WM: (15310: O2184 ^name predict-no)
  28783. <=WM: (15309: O2183 ^name predict-yes)
  28784. <=WM: (15308: R1095 ^value 1)
  28785. --- Inner Elaboration Phase, active level 1 (S1) ---
  28786. Firing prefer*rvt*predict-yes*H0
  28787. -->
  28788. Firing rl*prefer*rvt*predict-yes*H0*3
  28789. -->
  28790. (S1 ^operator O2185 = 0.1844117965074709)
  28791. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28792. -->
  28793. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28794. -->
  28795. (S1 ^operator O2185 = 0.8155859946811508)
  28796. Firing prefer*rvt*predict-no*H0
  28797. -->
  28798. Firing rl*prefer*rvt*predict-no*H0*4
  28799. -->
  28800. (S1 ^operator O2186 = 0.4476193493934647)
  28801. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28802. -->
  28803. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28804. -->
  28805. (S1 ^operator O2186 = -0.02155734064455064)
  28806. inner elaboration loop at bottom goal.
  28807. Retracting rl*prefer*rvt*predict-no*H0*4
  28808. -->
  28809. (S1 ^operator O2184 = 0.4476193493934647)
  28810. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28811. -->
  28812. (S1 ^operator O2184 = -0.02155734064455064)
  28813. Retracting rl*prefer*rvt*predict-yes*H0*3
  28814. -->
  28815. (S1 ^operator O2183 = 0.1844117965074709)
  28816. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28817. -->
  28818. (S1 ^operator O2183 = 0.8155859946811508)
  28819. --- END Proposal Phase ---
  28820. --- Decision Phase ---
  28821. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.901099,0.089612)
  28822. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  28823. =>WM: (15328: S1 ^operator O2185)
  28824. 1093: O: O2185 (predict-yes)
  28825. --- END Decision Phase ---
  28826. --- Application Phase ---
  28827. --- Firing Productions (PE) For State At Depth 1 ---
  28828. --- Inner Elaboration Phase, active level 1 (S1) ---
  28829. Firing apply*operator
  28830. -->
  28831. (I3 ^predict-yes N1093 + :O )
  28832. Firing apply*operator*complete
  28833. -->
  28834. (I3 ^predict-yes N1092 - :O )
  28835. inner elaboration loop at bottom goal.
  28836. --- Change Working Memory (PE) ---
  28837. =>WM: (15329: I3 ^predict-yes N1093)
  28838. <=WM: (15316: N1092 ^status complete)
  28839. <=WM: (15315: I3 ^predict-yes N1092)
  28840. --- Firing Productions (IE) For State At Depth 1 ---
  28841. --- Inner Elaboration Phase, active level 1 (S1) ---
  28842. Firing monitor*world
  28843. -->
  28844. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28845. --- Change Working Memory (IE) ---
  28846. --- END Application Phase ---
  28847. --- Output Phase ---
  28848. ENV: Agent did: predict-yes for direction R in state State-A
  28849. In State-A moving R
  28850. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  28851. predict error 0
  28852. dir: dir isL
  28853. --- END Output Phase ---
  28854. /|\--- Input Phase ---
  28855. =>WM: (15333: I2 ^dir L)
  28856. =>WM: (15332: I2 ^reward 1)
  28857. =>WM: (15331: I2 ^see 1)
  28858. =>WM: (15330: N1093 ^status complete)
  28859. <=WM: (15319: I2 ^dir R)
  28860. <=WM: (15318: I2 ^reward 1)
  28861. <=WM: (15317: I2 ^see 1)
  28862. =>WM: (15334: I2 ^level-1 R1-root)
  28863. <=WM: (15320: I2 ^level-1 L1-root)
  28864. --- END Input Phase ---
  28865. --- Proposal Phase ---
  28866. --- Inner Elaboration Phase, active level 1 (S1) ---
  28867. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28868. -->
  28869. (S1 ^operator O2185 = 0.6104601514815886)
  28870. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28871. -->
  28872. (S1 ^operator O2186 = 0.2714993082286609)
  28873. Firing prefer*rvt*predict-no*H0*2*v1*H1
  28874. -->
  28875. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  28876. -->
  28877. Firing elaborate*copy-see-to-output-link
  28878. -->
  28879. (I3 ^see 1 +)
  28880. Firing elaborate*reward*based*on*reward
  28881. -->
  28882. (R1097 ^value 1 +)
  28883. (R1 ^reward R1097 +)
  28884. Firing propose*predict-yes
  28885. -->
  28886. (O2187 ^name predict-yes +)
  28887. (S1 ^operator O2187 +)
  28888. Firing propose*predict-no
  28889. -->
  28890. (O2188 ^name predict-no +)
  28891. (S1 ^operator O2188 +)
  28892. Firing rl*prefer*rvt*predict-no*H0*2
  28893. -->
  28894. (S1 ^operator O2186 = 0.3873354925260269)
  28895. Firing rl*prefer*rvt*predict-yes*H0*1
  28896. -->
  28897. (S1 ^operator O2185 = 0.3895396381785524)
  28898. Firing prefer*rvt*predict-yes*H0
  28899. -->
  28900. Firing prefer*rvt*predict-no*H0
  28901. -->
  28902. Firing elaborate*copy-dir-to-output-link
  28903. -->
  28904. (I3 ^dir L +)
  28905. inner elaboration loop at bottom goal.
  28906. Retracting elaborate*copy-see-to-output-link
  28907. -->
  28908. (I3 ^see 1 +)
  28909. Retracting propose*predict-no
  28910. -->
  28911. (O2186 ^name predict-no +)
  28912. (S1 ^operator O2186 +)
  28913. Retracting propose*predict-yes
  28914. -->
  28915. (O2185 ^name predict-yes +)
  28916. (S1 ^operator O2185 +)
  28917. Retracting elaborate*reward*based*on*reward
  28918. -->
  28919. (R1096 ^value 1 +)
  28920. (R1 ^reward R1096 +)
  28921. Retracting elaborate*copy-dir-to-output-link
  28922. -->
  28923. (I3 ^dir R +)
  28924. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  28925. -->
  28926. (S1 ^operator O2186 = -0.02155734064455064)
  28927. Retracting rl*prefer*rvt*predict-no*H0*4
  28928. -->
  28929. (S1 ^operator O2186 = 0.4476193493934647)
  28930. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  28931. -->
  28932. (S1 ^operator O2185 = 0.8155859946811508)
  28933. Retracting rl*prefer*rvt*predict-yes*H0*3
  28934. -->
  28935. (S1 ^operator O2185 = 0.1844117965074709)
  28936. =>WM: (15341: S1 ^operator O2188 +)
  28937. =>WM: (15340: S1 ^operator O2187 +)
  28938. =>WM: (15339: I3 ^dir L)
  28939. =>WM: (15338: O2188 ^name predict-no)
  28940. =>WM: (15337: O2187 ^name predict-yes)
  28941. =>WM: (15336: R1097 ^value 1)
  28942. =>WM: (15335: R1 ^reward R1097)
  28943. <=WM: (15326: S1 ^operator O2185 +)
  28944. <=WM: (15328: S1 ^operator O2185)
  28945. <=WM: (15327: S1 ^operator O2186 +)
  28946. <=WM: (15325: I3 ^dir R)
  28947. <=WM: (15321: R1 ^reward R1096)
  28948. <=WM: (15324: O2186 ^name predict-no)
  28949. <=WM: (15323: O2185 ^name predict-yes)
  28950. <=WM: (15322: R1096 ^value 1)
  28951. --- Inner Elaboration Phase, active level 1 (S1) ---
  28952. Firing prefer*rvt*predict-yes*H0
  28953. -->
  28954. Firing rl*prefer*rvt*predict-yes*H0*1
  28955. -->
  28956. (S1 ^operator O2187 = 0.3895396381785524)
  28957. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  28958. -->
  28959. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28960. -->
  28961. (S1 ^operator O2187 = 0.6104601514815886)
  28962. Firing prefer*rvt*predict-no*H0
  28963. -->
  28964. Firing rl*prefer*rvt*predict-no*H0*2
  28965. -->
  28966. (S1 ^operator O2188 = 0.3873354925260269)
  28967. Firing prefer*rvt*predict-no*H0*2*v1*H1
  28968. -->
  28969. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28970. -->
  28971. (S1 ^operator O2188 = 0.2714993082286609)
  28972. inner elaboration loop at bottom goal.
  28973. Retracting rl*prefer*rvt*predict-no*H0*2
  28974. -->
  28975. (S1 ^operator O2186 = 0.3873354925260269)
  28976. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  28977. -->
  28978. (S1 ^operator O2186 = 0.2714993082286609)
  28979. Retracting rl*prefer*rvt*predict-yes*H0*1
  28980. -->
  28981. (S1 ^operator O2185 = 0.3895396381785524)
  28982. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  28983. -->
  28984. (S1 ^operator O2185 = 0.6104601514815886)
  28985. --- END Proposal Phase ---
  28986. --- Decision Phase ---
  28987. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.908108,0.0839013)
  28988. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324583 0.491003 0.815586 -> 0.324583 0.491003 0.815586(R,m,v=1,1,0)
  28989. =>WM: (15342: S1 ^operator O2187)
  28990. 1094: O: O2187 (predict-yes)
  28991. --- END Decision Phase ---
  28992. --- Application Phase ---
  28993. --- Firing Productions (PE) For State At Depth 1 ---
  28994. --- Inner Elaboration Phase, active level 1 (S1) ---
  28995. Firing apply*operator
  28996. -->
  28997. (I3 ^predict-yes N1094 + :O )
  28998. Firing apply*operator*complete
  28999. -->
  29000. (I3 ^predict-yes N1093 - :O )
  29001. inner elaboration loop at bottom goal.
  29002. --- Change Working Memory (PE) ---
  29003. =>WM: (15343: I3 ^predict-yes N1094)
  29004. <=WM: (15330: N1093 ^status complete)
  29005. <=WM: (15329: I3 ^predict-yes N1093)
  29006. --- Firing Productions (IE) For State At Depth 1 ---
  29007. --- Inner Elaboration Phase, active level 1 (S1) ---
  29008. Firing monitor*world
  29009. -->
  29010. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29011. --- Change Working Memory (IE) ---
  29012. --- END Application Phase ---
  29013. --- Output Phase ---
  29014. ENV: Agent did: predict-yes for direction L in state State-B
  29015. In State-B moving L
  29016. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  29017. predict error 0
  29018. dir: dir isR
  29019. --- END Output Phase ---
  29020. -/|--- Input Phase ---
  29021. =>WM: (15347: I2 ^dir R)
  29022. =>WM: (15346: I2 ^reward 1)
  29023. =>WM: (15345: I2 ^see 1)
  29024. =>WM: (15344: N1094 ^status complete)
  29025. <=WM: (15333: I2 ^dir L)
  29026. <=WM: (15332: I2 ^reward 1)
  29027. <=WM: (15331: I2 ^see 1)
  29028. =>WM: (15348: I2 ^level-1 L1-root)
  29029. <=WM: (15334: I2 ^level-1 R1-root)
  29030. --- END Input Phase ---
  29031. --- Proposal Phase ---
  29032. --- Inner Elaboration Phase, active level 1 (S1) ---
  29033. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  29034. -->
  29035. (S1 ^operator O2188 = -0.02155734064455064)
  29036. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  29037. -->
  29038. (S1 ^operator O2187 = 0.8155863260028575)
  29039. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29040. -->
  29041. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29042. -->
  29043. Firing elaborate*copy-see-to-output-link
  29044. -->
  29045. (I3 ^see 1 +)
  29046. Firing elaborate*reward*based*on*reward
  29047. -->
  29048. (R1098 ^value 1 +)
  29049. (R1 ^reward R1098 +)
  29050. Firing propose*predict-yes
  29051. -->
  29052. (O2189 ^name predict-yes +)
  29053. (S1 ^operator O2189 +)
  29054. Firing propose*predict-no
  29055. -->
  29056. (O2190 ^name predict-no +)
  29057. (S1 ^operator O2190 +)
  29058. Firing rl*prefer*rvt*predict-no*H0*4
  29059. -->
  29060. (S1 ^operator O2188 = 0.4476193493934647)
  29061. Firing rl*prefer*rvt*predict-yes*H0*3
  29062. -->
  29063. (S1 ^operator O2187 = 0.1844121278291776)
  29064. Firing prefer*rvt*predict-yes*H0
  29065. -->
  29066. Firing prefer*rvt*predict-no*H0
  29067. -->
  29068. Firing elaborate*copy-dir-to-output-link
  29069. -->
  29070. (I3 ^dir R +)
  29071. inner elaboration loop at bottom goal.
  29072. Retracting elaborate*copy-see-to-output-link
  29073. -->
  29074. (I3 ^see 1 +)
  29075. Retracting propose*predict-no
  29076. -->
  29077. (O2188 ^name predict-no +)
  29078. (S1 ^operator O2188 +)
  29079. Retracting propose*predict-yes
  29080. -->
  29081. (O2187 ^name predict-yes +)
  29082. (S1 ^operator O2187 +)
  29083. Retracting elaborate*reward*based*on*reward
  29084. -->
  29085. (R1097 ^value 1 +)
  29086. (R1 ^reward R1097 +)
  29087. Retracting elaborate*copy-dir-to-output-link
  29088. -->
  29089. (I3 ^dir L +)
  29090. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*35
  29091. -->
  29092. (S1 ^operator O2188 = 0.2714993082286609)
  29093. Retracting rl*prefer*rvt*predict-no*H0*2
  29094. -->
  29095. (S1 ^operator O2188 = 0.3873354925260269)
  29096. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*36
  29097. -->
  29098. (S1 ^operator O2187 = 0.6104601514815886)
  29099. Retracting rl*prefer*rvt*predict-yes*H0*1
  29100. -->
  29101. (S1 ^operator O2187 = 0.3895396381785524)
  29102. =>WM: (15355: S1 ^operator O2190 +)
  29103. =>WM: (15354: S1 ^operator O2189 +)
  29104. =>WM: (15353: I3 ^dir R)
  29105. =>WM: (15352: O2190 ^name predict-no)
  29106. =>WM: (15351: O2189 ^name predict-yes)
  29107. =>WM: (15350: R1098 ^value 1)
  29108. =>WM: (15349: R1 ^reward R1098)
  29109. <=WM: (15340: S1 ^operator O2187 +)
  29110. <=WM: (15342: S1 ^operator O2187)
  29111. <=WM: (15341: S1 ^operator O2188 +)
  29112. <=WM: (15339: I3 ^dir L)
  29113. <=WM: (15335: R1 ^reward R1097)
  29114. <=WM: (15338: O2188 ^name predict-no)
  29115. <=WM: (15337: O2187 ^name predict-yes)
  29116. <=WM: (15336: R1097 ^value 1)
  29117. --- Inner Elaboration Phase, active level 1 (S1) ---
  29118. Firing prefer*rvt*predict-yes*H0
  29119. -->
  29120. Firing rl*prefer*rvt*predict-yes*H0*3
  29121. -->
  29122. (S1 ^operator O2189 = 0.1844121278291776)
  29123. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29124. -->
  29125. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  29126. -->
  29127. (S1 ^operator O2189 = 0.8155863260028575)
  29128. Firing prefer*rvt*predict-no*H0
  29129. -->
  29130. Firing rl*prefer*rvt*predict-no*H0*4
  29131. -->
  29132. (S1 ^operator O2190 = 0.4476193493934647)
  29133. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29134. -->
  29135. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  29136. -->
  29137. (S1 ^operator O2190 = -0.02155734064455064)
  29138. inner elaboration loop at bottom goal.
  29139. Retracting rl*prefer*rvt*predict-no*H0*4
  29140. -->
  29141. (S1 ^operator O2188 = 0.4476193493934647)
  29142. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  29143. -->
  29144. (S1 ^operator O2188 = -0.02155734064455064)
  29145. Retracting rl*prefer*rvt*predict-yes*H0*3
  29146. -->
  29147. (S1 ^operator O2187 = 0.1844121278291776)
  29148. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  29149. -->
  29150. (S1 ^operator O2187 = 0.8155863260028575)
  29151. --- END Proposal Phase ---
  29152. --- Decision Phase ---
  29153. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.901639,0.0891731)
  29154. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*36 0.288049 0.322411 0.61046 -> 0.288049 0.322411 0.61046(R,m,v=1,1,0)
  29155. =>WM: (15356: S1 ^operator O2189)
  29156. 1095: O: O2189 (predict-yes)
  29157. --- END Decision Phase ---
  29158. --- Application Phase ---
  29159. --- Firing Productions (PE) For State At Depth 1 ---
  29160. --- Inner Elaboration Phase, active level 1 (S1) ---
  29161. Firing apply*operator
  29162. -->
  29163. (I3 ^predict-yes N1095 + :O )
  29164. Firing apply*operator*complete
  29165. -->
  29166. (I3 ^predict-yes N1094 - :O )
  29167. inner elaboration loop at bottom goal.
  29168. --- Change Working Memory (PE) ---
  29169. =>WM: (15357: I3 ^predict-yes N1095)
  29170. <=WM: (15344: N1094 ^status complete)
  29171. <=WM: (15343: I3 ^predict-yes N1094)
  29172. --- Firing Productions (IE) For State At Depth 1 ---
  29173. --- Inner Elaboration Phase, active level 1 (S1) ---
  29174. Firing monitor*world
  29175. -->
  29176. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29177. --- Change Working Memory (IE) ---
  29178. --- END Application Phase ---
  29179. --- Output Phase ---
  29180. ENV: Agent did: predict-yes for direction R in state State-A
  29181. In State-A moving R
  29182. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  29183. predict error 0
  29184. dir: dir isR
  29185. --- END Output Phase ---
  29186. \-/--- Input Phase ---
  29187. =>WM: (15361: I2 ^dir R)
  29188. =>WM: (15360: I2 ^reward 1)
  29189. =>WM: (15359: I2 ^see 1)
  29190. =>WM: (15358: N1095 ^status complete)
  29191. <=WM: (15347: I2 ^dir R)
  29192. <=WM: (15346: I2 ^reward 1)
  29193. <=WM: (15345: I2 ^see 1)
  29194. =>WM: (15362: I2 ^level-1 R1-root)
  29195. <=WM: (15348: I2 ^level-1 L1-root)
  29196. --- END Input Phase ---
  29197. --- Proposal Phase ---
  29198. --- Inner Elaboration Phase, active level 1 (S1) ---
  29199. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  29200. -->
  29201. (S1 ^operator O2189 = 0.1398795999120246)
  29202. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  29203. -->
  29204. (S1 ^operator O2190 = 0.5523809777404911)
  29205. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29206. -->
  29207. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29208. -->
  29209. Firing elaborate*copy-see-to-output-link
  29210. -->
  29211. (I3 ^see 1 +)
  29212. Firing elaborate*reward*based*on*reward
  29213. -->
  29214. (R1099 ^value 1 +)
  29215. (R1 ^reward R1099 +)
  29216. Firing propose*predict-yes
  29217. -->
  29218. (O2191 ^name predict-yes +)
  29219. (S1 ^operator O2191 +)
  29220. Firing propose*predict-no
  29221. -->
  29222. (O2192 ^name predict-no +)
  29223. (S1 ^operator O2192 +)
  29224. Firing rl*prefer*rvt*predict-no*H0*4
  29225. -->
  29226. (S1 ^operator O2190 = 0.4476193493934647)
  29227. Firing rl*prefer*rvt*predict-yes*H0*3
  29228. -->
  29229. (S1 ^operator O2189 = 0.1844121278291776)
  29230. Firing prefer*rvt*predict-yes*H0
  29231. -->
  29232. Firing prefer*rvt*predict-no*H0
  29233. -->
  29234. Firing elaborate*copy-dir-to-output-link
  29235. -->
  29236. (I3 ^dir R +)
  29237. inner elaboration loop at bottom goal.
  29238. Retracting elaborate*copy-see-to-output-link
  29239. -->
  29240. (I3 ^see 1 +)
  29241. Retracting propose*predict-no
  29242. -->
  29243. (O2190 ^name predict-no +)
  29244. (S1 ^operator O2190 +)
  29245. Retracting propose*predict-yes
  29246. -->
  29247. (O2189 ^name predict-yes +)
  29248. (S1 ^operator O2189 +)
  29249. Retracting elaborate*reward*based*on*reward
  29250. -->
  29251. (R1098 ^value 1 +)
  29252. (R1 ^reward R1098 +)
  29253. Retracting elaborate*copy-dir-to-output-link
  29254. -->
  29255. (I3 ^dir R +)
  29256. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  29257. -->
  29258. (S1 ^operator O2190 = -0.02155734064455064)
  29259. Retracting rl*prefer*rvt*predict-no*H0*4
  29260. -->
  29261. (S1 ^operator O2190 = 0.4476193493934647)
  29262. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  29263. -->
  29264. (S1 ^operator O2189 = 0.8155863260028575)
  29265. Retracting rl*prefer*rvt*predict-yes*H0*3
  29266. -->
  29267. (S1 ^operator O2189 = 0.1844121278291776)
  29268. =>WM: (15368: S1 ^operator O2192 +)
  29269. =>WM: (15367: S1 ^operator O2191 +)
  29270. =>WM: (15366: O2192 ^name predict-no)
  29271. =>WM: (15365: O2191 ^name predict-yes)
  29272. =>WM: (15364: R1099 ^value 1)
  29273. =>WM: (15363: R1 ^reward R1099)
  29274. <=WM: (15354: S1 ^operator O2189 +)
  29275. <=WM: (15356: S1 ^operator O2189)
  29276. <=WM: (15355: S1 ^operator O2190 +)
  29277. <=WM: (15349: R1 ^reward R1098)
  29278. <=WM: (15352: O2190 ^name predict-no)
  29279. <=WM: (15351: O2189 ^name predict-yes)
  29280. <=WM: (15350: R1098 ^value 1)
  29281. --- Inner Elaboration Phase, active level 1 (S1) ---
  29282. Firing prefer*rvt*predict-yes*H0
  29283. -->
  29284. Firing rl*prefer*rvt*predict-yes*H0*3
  29285. -->
  29286. (S1 ^operator O2191 = 0.1844121278291776)
  29287. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29288. -->
  29289. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  29290. -->
  29291. (S1 ^operator O2191 = 0.1398795999120246)
  29292. Firing prefer*rvt*predict-no*H0
  29293. -->
  29294. Firing rl*prefer*rvt*predict-no*H0*4
  29295. -->
  29296. (S1 ^operator O2192 = 0.4476193493934647)
  29297. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29298. -->
  29299. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  29300. -->
  29301. (S1 ^operator O2192 = 0.5523809777404911)
  29302. inner elaboration loop at bottom goal.
  29303. Retracting rl*prefer*rvt*predict-no*H0*4
  29304. -->
  29305. (S1 ^operator O2190 = 0.4476193493934647)
  29306. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  29307. -->
  29308. (S1 ^operator O2190 = 0.5523809777404911)
  29309. Retracting rl*prefer*rvt*predict-yes*H0*3
  29310. -->
  29311. (S1 ^operator O2189 = 0.1844121278291776)
  29312. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  29313. -->
  29314. (S1 ^operator O2189 = 0.1398795999120246)
  29315. --- END Proposal Phase ---
  29316. --- Decision Phase ---
  29317. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675415 -0.491003 0.184412(R,m,v=1,0.908602,0.0834932)
  29318. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324583 0.491003 0.815586 -> 0.324583 0.491003 0.815587(R,m,v=1,1,0)
  29319. =>WM: (15369: S1 ^operator O2192)
  29320. 1096: O: O2192 (predict-no)
  29321. --- END Decision Phase ---
  29322. --- Application Phase ---
  29323. --- Firing Productions (PE) For State At Depth 1 ---
  29324. --- Inner Elaboration Phase, active level 1 (S1) ---
  29325. Firing apply*operator
  29326. -->
  29327. (I3 ^predict-no N1096 + :O )
  29328. Firing apply*operator*complete
  29329. -->
  29330. (I3 ^predict-yes N1095 - :O )
  29331. inner elaboration loop at bottom goal.
  29332. --- Change Working Memory (PE) ---
  29333. =>WM: (15370: I3 ^predict-no N1096)
  29334. <=WM: (15358: N1095 ^status complete)
  29335. <=WM: (15357: I3 ^predict-yes N1095)
  29336. --- Firing Productions (IE) For State At Depth 1 ---
  29337. --- Inner Elaboration Phase, active level 1 (S1) ---
  29338. Firing monitor*world
  29339. -->
  29340. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29341. --- Change Working Memory (IE) ---
  29342. --- END Application Phase ---
  29343. --- Output Phase ---
  29344. ENV: Agent did: predict-no for direction R in state State-B
  29345. In State-B moving R
  29346. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29347. predict error 0
  29348. dir: dir isR
  29349. --- END Output Phase ---
  29350. |\-/sleeping...
  29351. |--- Input Phase ---
  29352. =>WM: (15374: I2 ^dir R)
  29353. =>WM: (15373: I2 ^reward 1)
  29354. =>WM: (15372: I2 ^see 0)
  29355. =>WM: (15371: N1096 ^status complete)
  29356. <=WM: (15361: I2 ^dir R)
  29357. <=WM: (15360: I2 ^reward 1)
  29358. <=WM: (15359: I2 ^see 1)
  29359. =>WM: (15375: I2 ^level-1 R0-root)
  29360. <=WM: (15362: I2 ^level-1 R1-root)
  29361. --- END Input Phase ---
  29362. --- Proposal Phase ---
  29363. --- Inner Elaboration Phase, active level 1 (S1) ---
  29364. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29365. -->
  29366. (S1 ^operator O2191 = 0.1664311307472832)
  29367. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29368. -->
  29369. (S1 ^operator O2192 = 0.5523802012957619)
  29370. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29371. -->
  29372. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29373. -->
  29374. Firing elaborate*copy-see-to-output-link
  29375. -->
  29376. (I3 ^see 0 +)
  29377. Firing elaborate*reward*based*on*reward
  29378. -->
  29379. (R1100 ^value 1 +)
  29380. (R1 ^reward R1100 +)
  29381. Firing propose*predict-yes
  29382. -->
  29383. (O2193 ^name predict-yes +)
  29384. (S1 ^operator O2193 +)
  29385. Firing propose*predict-no
  29386. -->
  29387. (O2194 ^name predict-no +)
  29388. (S1 ^operator O2194 +)
  29389. Firing rl*prefer*rvt*predict-no*H0*4
  29390. -->
  29391. (S1 ^operator O2192 = 0.4476193493934647)
  29392. Firing rl*prefer*rvt*predict-yes*H0*3
  29393. -->
  29394. (S1 ^operator O2191 = 0.1844123597543724)
  29395. Firing prefer*rvt*predict-yes*H0
  29396. -->
  29397. Firing prefer*rvt*predict-no*H0
  29398. -->
  29399. Firing elaborate*copy-dir-to-output-link
  29400. -->
  29401. (I3 ^dir R +)
  29402. inner elaboration loop at bottom goal.
  29403. Retracting elaborate*copy-see-to-output-link
  29404. -->
  29405. (I3 ^see 1 +)
  29406. Retracting propose*predict-no
  29407. -->
  29408. (O2192 ^name predict-no +)
  29409. (S1 ^operator O2192 +)
  29410. Retracting propose*predict-yes
  29411. -->
  29412. (O2191 ^name predict-yes +)
  29413. (S1 ^operator O2191 +)
  29414. Retracting elaborate*reward*based*on*reward
  29415. -->
  29416. (R1099 ^value 1 +)
  29417. (R1 ^reward R1099 +)
  29418. Retracting elaborate*copy-dir-to-output-link
  29419. -->
  29420. (I3 ^dir R +)
  29421. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  29422. -->
  29423. (S1 ^operator O2192 = 0.5523809777404911)
  29424. Retracting rl*prefer*rvt*predict-no*H0*4
  29425. -->
  29426. (S1 ^operator O2192 = 0.4476193493934647)
  29427. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  29428. -->
  29429. (S1 ^operator O2191 = 0.1398795999120246)
  29430. Retracting rl*prefer*rvt*predict-yes*H0*3
  29431. -->
  29432. (S1 ^operator O2191 = 0.1844123597543724)
  29433. =>WM: (15382: S1 ^operator O2194 +)
  29434. =>WM: (15381: S1 ^operator O2193 +)
  29435. =>WM: (15380: O2194 ^name predict-no)
  29436. =>WM: (15379: O2193 ^name predict-yes)
  29437. =>WM: (15378: R1100 ^value 1)
  29438. =>WM: (15377: R1 ^reward R1100)
  29439. =>WM: (15376: I3 ^see 0)
  29440. <=WM: (15367: S1 ^operator O2191 +)
  29441. <=WM: (15368: S1 ^operator O2192 +)
  29442. <=WM: (15369: S1 ^operator O2192)
  29443. <=WM: (15363: R1 ^reward R1099)
  29444. <=WM: (15306: I3 ^see 1)
  29445. <=WM: (15366: O2192 ^name predict-no)
  29446. <=WM: (15365: O2191 ^name predict-yes)
  29447. <=WM: (15364: R1099 ^value 1)
  29448. --- Inner Elaboration Phase, active level 1 (S1) ---
  29449. Firing prefer*rvt*predict-yes*H0
  29450. -->
  29451. Firing rl*prefer*rvt*predict-yes*H0*3
  29452. -->
  29453. (S1 ^operator O2193 = 0.1844123597543724)
  29454. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29455. -->
  29456. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29457. -->
  29458. (S1 ^operator O2193 = 0.1664311307472832)
  29459. Firing prefer*rvt*predict-no*H0
  29460. -->
  29461. Firing rl*prefer*rvt*predict-no*H0*4
  29462. -->
  29463. (S1 ^operator O2194 = 0.4476193493934647)
  29464. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29465. -->
  29466. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29467. -->
  29468. (S1 ^operator O2194 = 0.5523802012957619)
  29469. inner elaboration loop at bottom goal.
  29470. Retracting rl*prefer*rvt*predict-no*H0*4
  29471. -->
  29472. (S1 ^operator O2192 = 0.4476193493934647)
  29473. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29474. -->
  29475. (S1 ^operator O2192 = 0.5523802012957619)
  29476. Retracting rl*prefer*rvt*predict-yes*H0*3
  29477. -->
  29478. (S1 ^operator O2191 = 0.1844123597543724)
  29479. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29480. -->
  29481. (S1 ^operator O2191 = 0.1664311307472832)
  29482. --- END Proposal Phase ---
  29483. --- Decision Phase ---
  29484. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.9375,0.0590035)
  29485. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377467 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  29486. =>WM: (15383: S1 ^operator O2194)
  29487. 1097: O: O2194 (predict-no)
  29488. --- END Decision Phase ---
  29489. --- Application Phase ---
  29490. --- Firing Productions (PE) For State At Depth 1 ---
  29491. --- Inner Elaboration Phase, active level 1 (S1) ---
  29492. Firing apply*operator
  29493. -->
  29494. (I3 ^predict-no N1097 + :O )
  29495. Firing apply*operator*complete
  29496. -->
  29497. (I3 ^predict-no N1096 - :O )
  29498. inner elaboration loop at bottom goal.
  29499. --- Change Working Memory (PE) ---
  29500. =>WM: (15384: I3 ^predict-no N1097)
  29501. <=WM: (15371: N1096 ^status complete)
  29502. <=WM: (15370: I3 ^predict-no N1096)
  29503. --- Firing Productions (IE) For State At Depth 1 ---
  29504. --- Inner Elaboration Phase, active level 1 (S1) ---
  29505. Firing monitor*world
  29506. -->
  29507. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29508. --- Change Working Memory (IE) ---
  29509. --- END Application Phase ---
  29510. --- Output Phase ---
  29511. ENV: Agent did: predict-no for direction R in state State-B
  29512. In State-B moving R
  29513. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29514. predict error 0
  29515. dir: dir isR
  29516. --- END Output Phase ---
  29517. \-/--- Input Phase ---
  29518. =>WM: (15388: I2 ^dir R)
  29519. =>WM: (15387: I2 ^reward 1)
  29520. =>WM: (15386: I2 ^see 0)
  29521. =>WM: (15385: N1097 ^status complete)
  29522. <=WM: (15374: I2 ^dir R)
  29523. <=WM: (15373: I2 ^reward 1)
  29524. <=WM: (15372: I2 ^see 0)
  29525. =>WM: (15389: I2 ^level-1 R0-root)
  29526. <=WM: (15375: I2 ^level-1 R0-root)
  29527. --- END Input Phase ---
  29528. --- Proposal Phase ---
  29529. --- Inner Elaboration Phase, active level 1 (S1) ---
  29530. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29531. -->
  29532. (S1 ^operator O2193 = 0.1664311307472832)
  29533. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29534. -->
  29535. (S1 ^operator O2194 = 0.5523802012957619)
  29536. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29537. -->
  29538. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29539. -->
  29540. Firing elaborate*copy-see-to-output-link
  29541. -->
  29542. (I3 ^see 0 +)
  29543. Firing elaborate*reward*based*on*reward
  29544. -->
  29545. (R1101 ^value 1 +)
  29546. (R1 ^reward R1101 +)
  29547. Firing propose*predict-yes
  29548. -->
  29549. (O2195 ^name predict-yes +)
  29550. (S1 ^operator O2195 +)
  29551. Firing propose*predict-no
  29552. -->
  29553. (O2196 ^name predict-no +)
  29554. (S1 ^operator O2196 +)
  29555. Firing rl*prefer*rvt*predict-no*H0*4
  29556. -->
  29557. (S1 ^operator O2194 = 0.4476193003233713)
  29558. Firing rl*prefer*rvt*predict-yes*H0*3
  29559. -->
  29560. (S1 ^operator O2193 = 0.1844123597543724)
  29561. Firing prefer*rvt*predict-yes*H0
  29562. -->
  29563. Firing prefer*rvt*predict-no*H0
  29564. -->
  29565. Firing elaborate*copy-dir-to-output-link
  29566. -->
  29567. (I3 ^dir R +)
  29568. inner elaboration loop at bottom goal.
  29569. Retracting elaborate*copy-see-to-output-link
  29570. -->
  29571. (I3 ^see 0 +)
  29572. Retracting propose*predict-no
  29573. -->
  29574. (O2194 ^name predict-no +)
  29575. (S1 ^operator O2194 +)
  29576. Retracting propose*predict-yes
  29577. -->
  29578. (O2193 ^name predict-yes +)
  29579. (S1 ^operator O2193 +)
  29580. Retracting elaborate*reward*based*on*reward
  29581. -->
  29582. (R1100 ^value 1 +)
  29583. (R1 ^reward R1100 +)
  29584. Retracting elaborate*copy-dir-to-output-link
  29585. -->
  29586. (I3 ^dir R +)
  29587. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29588. -->
  29589. (S1 ^operator O2194 = 0.5523802012957619)
  29590. Retracting rl*prefer*rvt*predict-no*H0*4
  29591. -->
  29592. (S1 ^operator O2194 = 0.4476193003233713)
  29593. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29594. -->
  29595. (S1 ^operator O2193 = 0.1664311307472832)
  29596. Retracting rl*prefer*rvt*predict-yes*H0*3
  29597. -->
  29598. (S1 ^operator O2193 = 0.1844123597543724)
  29599. =>WM: (15395: S1 ^operator O2196 +)
  29600. =>WM: (15394: S1 ^operator O2195 +)
  29601. =>WM: (15393: O2196 ^name predict-no)
  29602. =>WM: (15392: O2195 ^name predict-yes)
  29603. =>WM: (15391: R1101 ^value 1)
  29604. =>WM: (15390: R1 ^reward R1101)
  29605. <=WM: (15381: S1 ^operator O2193 +)
  29606. <=WM: (15382: S1 ^operator O2194 +)
  29607. <=WM: (15383: S1 ^operator O2194)
  29608. <=WM: (15377: R1 ^reward R1100)
  29609. <=WM: (15380: O2194 ^name predict-no)
  29610. <=WM: (15379: O2193 ^name predict-yes)
  29611. <=WM: (15378: R1100 ^value 1)
  29612. --- Inner Elaboration Phase, active level 1 (S1) ---
  29613. Firing prefer*rvt*predict-yes*H0
  29614. -->
  29615. Firing rl*prefer*rvt*predict-yes*H0*3
  29616. -->
  29617. (S1 ^operator O2195 = 0.1844123597543724)
  29618. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  29619. -->
  29620. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29621. -->
  29622. (S1 ^operator O2195 = 0.1664311307472832)
  29623. Firing prefer*rvt*predict-no*H0
  29624. -->
  29625. Firing rl*prefer*rvt*predict-no*H0*4
  29626. -->
  29627. (S1 ^operator O2196 = 0.4476193003233713)
  29628. Firing prefer*rvt*predict-no*H0*4*v1*H1
  29629. -->
  29630. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29631. -->
  29632. (S1 ^operator O2196 = 0.5523802012957619)
  29633. inner elaboration loop at bottom goal.
  29634. Retracting rl*prefer*rvt*predict-no*H0*4
  29635. -->
  29636. (S1 ^operator O2194 = 0.4476193003233713)
  29637. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29638. -->
  29639. (S1 ^operator O2194 = 0.5523802012957619)
  29640. Retracting rl*prefer*rvt*predict-yes*H0*3
  29641. -->
  29642. (S1 ^operator O2193 = 0.1844123597543724)
  29643. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29644. -->
  29645. (S1 ^operator O2193 = 0.1664311307472832)
  29646. --- END Proposal Phase ---
  29647. --- Decision Phase ---
  29648. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.937931,0.0586207)
  29649. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  29650. =>WM: (15396: S1 ^operator O2196)
  29651. 1098: O: O2196 (predict-no)
  29652. --- END Decision Phase ---
  29653. --- Application Phase ---
  29654. --- Firing Productions (PE) For State At Depth 1 ---
  29655. --- Inner Elaboration Phase, active level 1 (S1) ---
  29656. Firing apply*operator
  29657. -->
  29658. (I3 ^predict-no N1098 + :O )
  29659. Firing apply*operator*complete
  29660. -->
  29661. (I3 ^predict-no N1097 - :O )
  29662. inner elaboration loop at bottom goal.
  29663. --- Change Working Memory (PE) ---
  29664. =>WM: (15397: I3 ^predict-no N1098)
  29665. <=WM: (15385: N1097 ^status complete)
  29666. <=WM: (15384: I3 ^predict-no N1097)
  29667. --- Firing Productions (IE) For State At Depth 1 ---
  29668. --- Inner Elaboration Phase, active level 1 (S1) ---
  29669. Firing monitor*world
  29670. -->
  29671. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29672. --- Change Working Memory (IE) ---
  29673. --- END Application Phase ---
  29674. --- Output Phase ---
  29675. ENV: Agent did: predict-no for direction R in state State-B
  29676. In State-B moving R
  29677. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29678. predict error 0
  29679. dir: dir isU
  29680. --- END Output Phase ---
  29681. |\--- Input Phase ---
  29682. =>WM: (15401: I2 ^dir U)
  29683. =>WM: (15400: I2 ^reward 1)
  29684. =>WM: (15399: I2 ^see 0)
  29685. =>WM: (15398: N1098 ^status complete)
  29686. <=WM: (15388: I2 ^dir R)
  29687. <=WM: (15387: I2 ^reward 1)
  29688. <=WM: (15386: I2 ^see 0)
  29689. =>WM: (15402: I2 ^level-1 R0-root)
  29690. <=WM: (15389: I2 ^level-1 R0-root)
  29691. --- END Input Phase ---
  29692. --- Proposal Phase ---
  29693. --- Inner Elaboration Phase, active level 1 (S1) ---
  29694. Firing elaborate*copy-see-to-output-link
  29695. -->
  29696. (I3 ^see 0 +)
  29697. Firing elaborate*reward*based*on*reward
  29698. -->
  29699. (R1102 ^value 1 +)
  29700. (R1 ^reward R1102 +)
  29701. Firing propose*predict-yes
  29702. -->
  29703. (O2197 ^name predict-yes +)
  29704. (S1 ^operator O2197 +)
  29705. Firing propose*predict-no
  29706. -->
  29707. (O2198 ^name predict-no +)
  29708. (S1 ^operator O2198 +)
  29709. Firing rl*prefer*rvt*predict-no*H0*6
  29710. -->
  29711. (S1 ^operator O2196 = 0.9999999999999999)
  29712. Firing rl*prefer*rvt*predict-yes*H0*5
  29713. -->
  29714. (S1 ^operator O2195 = 0.)
  29715. Firing prefer*rvt*predict-yes*H0
  29716. -->
  29717. Firing prefer*rvt*predict-no*H0
  29718. -->
  29719. Firing elaborate*copy-dir-to-output-link
  29720. -->
  29721. (I3 ^dir U +)
  29722. inner elaboration loop at bottom goal.
  29723. Retracting elaborate*copy-see-to-output-link
  29724. -->
  29725. (I3 ^see 0 +)
  29726. Retracting propose*predict-no
  29727. -->
  29728. (O2196 ^name predict-no +)
  29729. (S1 ^operator O2196 +)
  29730. Retracting propose*predict-yes
  29731. -->
  29732. (O2195 ^name predict-yes +)
  29733. (S1 ^operator O2195 +)
  29734. Retracting elaborate*reward*based*on*reward
  29735. -->
  29736. (R1101 ^value 1 +)
  29737. (R1 ^reward R1101 +)
  29738. Retracting elaborate*copy-dir-to-output-link
  29739. -->
  29740. (I3 ^dir R +)
  29741. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  29742. -->
  29743. (S1 ^operator O2196 = 0.552380276052892)
  29744. Retracting rl*prefer*rvt*predict-no*H0*4
  29745. -->
  29746. (S1 ^operator O2196 = 0.4476193750805013)
  29747. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  29748. -->
  29749. (S1 ^operator O2195 = 0.1664311307472832)
  29750. Retracting rl*prefer*rvt*predict-yes*H0*3
  29751. -->
  29752. (S1 ^operator O2195 = 0.1844123597543724)
  29753. =>WM: (15409: S1 ^operator O2198 +)
  29754. =>WM: (15408: S1 ^operator O2197 +)
  29755. =>WM: (15407: I3 ^dir U)
  29756. =>WM: (15406: O2198 ^name predict-no)
  29757. =>WM: (15405: O2197 ^name predict-yes)
  29758. =>WM: (15404: R1102 ^value 1)
  29759. =>WM: (15403: R1 ^reward R1102)
  29760. <=WM: (15394: S1 ^operator O2195 +)
  29761. <=WM: (15395: S1 ^operator O2196 +)
  29762. <=WM: (15396: S1 ^operator O2196)
  29763. <=WM: (15353: I3 ^dir R)
  29764. <=WM: (15390: R1 ^reward R1101)
  29765. <=WM: (15393: O2196 ^name predict-no)
  29766. <=WM: (15392: O2195 ^name predict-yes)
  29767. <=WM: (15391: R1101 ^value 1)
  29768. --- Inner Elaboration Phase, active level 1 (S1) ---
  29769. Firing prefer*rvt*predict-yes*H0
  29770. -->
  29771. Firing rl*prefer*rvt*predict-yes*H0*5
  29772. -->
  29773. (S1 ^operator O2197 = 0.)
  29774. Firing prefer*rvt*predict-no*H0
  29775. -->
  29776. Firing rl*prefer*rvt*predict-no*H0*6
  29777. -->
  29778. (S1 ^operator O2198 = 0.9999999999999999)
  29779. inner elaboration loop at bottom goal.
  29780. Retracting rl*prefer*rvt*predict-no*H0*6
  29781. -->
  29782. (S1 ^operator O2196 = 0.9999999999999999)
  29783. Retracting rl*prefer*rvt*predict-yes*H0*5
  29784. -->
  29785. (S1 ^operator O2195 = 0.)
  29786. --- END Proposal Phase ---
  29787. --- Decision Phase ---
  29788. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.938356,0.0582428)
  29789. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  29790. =>WM: (15410: S1 ^operator O2198)
  29791. 1099: O: O2198 (predict-no)
  29792. --- END Decision Phase ---
  29793. --- Application Phase ---
  29794. --- Firing Productions (PE) For State At Depth 1 ---
  29795. --- Inner Elaboration Phase, active level 1 (S1) ---
  29796. Firing apply*operator
  29797. -->
  29798. (I3 ^predict-no N1099 + :O )
  29799. Firing apply*operator*complete
  29800. -->
  29801. (I3 ^predict-no N1098 - :O )
  29802. inner elaboration loop at bottom goal.
  29803. --- Change Working Memory (PE) ---
  29804. =>WM: (15411: I3 ^predict-no N1099)
  29805. <=WM: (15398: N1098 ^status complete)
  29806. <=WM: (15397: I3 ^predict-no N1098)
  29807. --- Firing Productions (IE) For State At Depth 1 ---
  29808. --- Inner Elaboration Phase, active level 1 (S1) ---
  29809. Firing monitor*world
  29810. -->
  29811. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29812. --- Change Working Memory (IE) ---
  29813. --- END Application Phase ---
  29814. --- Output Phase ---
  29815. ENV: Agent did: predict-no for direction U in state State-B
  29816. In State-B moving U
  29817. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29818. predict error 0
  29819. dir: dir isU
  29820. --- END Output Phase ---
  29821. -/|\--- Input Phase ---
  29822. =>WM: (15415: I2 ^dir U)
  29823. =>WM: (15414: I2 ^reward 1)
  29824. =>WM: (15413: I2 ^see 0)
  29825. =>WM: (15412: N1099 ^status complete)
  29826. <=WM: (15401: I2 ^dir U)
  29827. <=WM: (15400: I2 ^reward 1)
  29828. <=WM: (15399: I2 ^see 0)
  29829. =>WM: (15416: I2 ^level-1 R0-root)
  29830. <=WM: (15402: I2 ^level-1 R0-root)
  29831. --- END Input Phase ---
  29832. --- Proposal Phase ---
  29833. --- Inner Elaboration Phase, active level 1 (S1) ---
  29834. Firing elaborate*copy-see-to-output-link
  29835. -->
  29836. (I3 ^see 0 +)
  29837. Firing elaborate*reward*based*on*reward
  29838. -->
  29839. (R1103 ^value 1 +)
  29840. (R1 ^reward R1103 +)
  29841. Firing propose*predict-yes
  29842. -->
  29843. (O2199 ^name predict-yes +)
  29844. (S1 ^operator O2199 +)
  29845. Firing propose*predict-no
  29846. -->
  29847. (O2200 ^name predict-no +)
  29848. (S1 ^operator O2200 +)
  29849. Firing rl*prefer*rvt*predict-no*H0*6
  29850. -->
  29851. (S1 ^operator O2198 = 0.9999999999999999)
  29852. Firing rl*prefer*rvt*predict-yes*H0*5
  29853. -->
  29854. (S1 ^operator O2197 = 0.)
  29855. Firing prefer*rvt*predict-yes*H0
  29856. -->
  29857. Firing prefer*rvt*predict-no*H0
  29858. -->
  29859. Firing elaborate*copy-dir-to-output-link
  29860. -->
  29861. (I3 ^dir U +)
  29862. inner elaboration loop at bottom goal.
  29863. Retracting elaborate*copy-see-to-output-link
  29864. -->
  29865. (I3 ^see 0 +)
  29866. Retracting propose*predict-no
  29867. -->
  29868. (O2198 ^name predict-no +)
  29869. (S1 ^operator O2198 +)
  29870. Retracting propose*predict-yes
  29871. -->
  29872. (O2197 ^name predict-yes +)
  29873. (S1 ^operator O2197 +)
  29874. Retracting elaborate*reward*based*on*reward
  29875. -->
  29876. (R1102 ^value 1 +)
  29877. (R1 ^reward R1102 +)
  29878. Retracting elaborate*copy-dir-to-output-link
  29879. -->
  29880. (I3 ^dir U +)
  29881. Retracting rl*prefer*rvt*predict-no*H0*6
  29882. -->
  29883. (S1 ^operator O2198 = 0.9999999999999999)
  29884. Retracting rl*prefer*rvt*predict-yes*H0*5
  29885. -->
  29886. (S1 ^operator O2197 = 0.)
  29887. =>WM: (15422: S1 ^operator O2200 +)
  29888. =>WM: (15421: S1 ^operator O2199 +)
  29889. =>WM: (15420: O2200 ^name predict-no)
  29890. =>WM: (15419: O2199 ^name predict-yes)
  29891. =>WM: (15418: R1103 ^value 1)
  29892. =>WM: (15417: R1 ^reward R1103)
  29893. <=WM: (15408: S1 ^operator O2197 +)
  29894. <=WM: (15409: S1 ^operator O2198 +)
  29895. <=WM: (15410: S1 ^operator O2198)
  29896. <=WM: (15403: R1 ^reward R1102)
  29897. <=WM: (15406: O2198 ^name predict-no)
  29898. <=WM: (15405: O2197 ^name predict-yes)
  29899. <=WM: (15404: R1102 ^value 1)
  29900. --- Inner Elaboration Phase, active level 1 (S1) ---
  29901. Firing prefer*rvt*predict-yes*H0
  29902. -->
  29903. Firing rl*prefer*rvt*predict-yes*H0*5
  29904. -->
  29905. (S1 ^operator O2199 = 0.)
  29906. Firing prefer*rvt*predict-no*H0
  29907. -->
  29908. Firing rl*prefer*rvt*predict-no*H0*6
  29909. -->
  29910. (S1 ^operator O2200 = 0.9999999999999999)
  29911. inner elaboration loop at bottom goal.
  29912. Retracting rl*prefer*rvt*predict-no*H0*6
  29913. -->
  29914. (S1 ^operator O2198 = 0.9999999999999999)
  29915. Retracting rl*prefer*rvt*predict-yes*H0*5
  29916. -->
  29917. (S1 ^operator O2197 = 0.)
  29918. --- END Proposal Phase ---
  29919. --- Decision Phase ---
  29920. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29921. =>WM: (15423: S1 ^operator O2200)
  29922. 1100: O: O2200 (predict-no)
  29923. --- END Decision Phase ---
  29924. --- Application Phase ---
  29925. --- Firing Productions (PE) For State At Depth 1 ---
  29926. --- Inner Elaboration Phase, active level 1 (S1) ---
  29927. Firing apply*operator
  29928. -->
  29929. (I3 ^predict-no N1100 + :O )
  29930. Firing apply*operator*complete
  29931. -->
  29932. (I3 ^predict-no N1099 - :O )
  29933. inner elaboration loop at bottom goal.
  29934. --- Change Working Memory (PE) ---
  29935. =>WM: (15424: I3 ^predict-no N1100)
  29936. <=WM: (15412: N1099 ^status complete)
  29937. <=WM: (15411: I3 ^predict-no N1099)
  29938. --- Firing Productions (IE) For State At Depth 1 ---
  29939. --- Inner Elaboration Phase, active level 1 (S1) ---
  29940. Firing monitor*world
  29941. -->
  29942. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29943. --- Change Working Memory (IE) ---
  29944. --- END Application Phase ---
  29945. --- Output Phase ---
  29946. ENV: Agent did: predict-no for direction U in state State-B
  29947. In State-B moving U
  29948. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  29949. predict error 0
  29950. dir: dir isL
  29951. --- END Output Phase ---
  29952. -/|--- Input Phase ---
  29953. =>WM: (15428: I2 ^dir L)
  29954. =>WM: (15427: I2 ^reward 1)
  29955. =>WM: (15426: I2 ^see 0)
  29956. =>WM: (15425: N1100 ^status complete)
  29957. <=WM: (15415: I2 ^dir U)
  29958. <=WM: (15414: I2 ^reward 1)
  29959. <=WM: (15413: I2 ^see 0)
  29960. =>WM: (15429: I2 ^level-1 R0-root)
  29961. <=WM: (15416: I2 ^level-1 R0-root)
  29962. --- END Input Phase ---
  29963. --- Proposal Phase ---
  29964. --- Inner Elaboration Phase, active level 1 (S1) ---
  29965. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  29966. -->
  29967. (S1 ^operator O2199 = 0.6104606012562985)
  29968. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  29969. -->
  29970. (S1 ^operator O2200 = 0.1063475139796038)
  29971. Firing prefer*rvt*predict-no*H0*2*v1*H1
  29972. -->
  29973. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  29974. -->
  29975. Firing elaborate*copy-see-to-output-link
  29976. -->
  29977. (I3 ^see 0 +)
  29978. Firing elaborate*reward*based*on*reward
  29979. -->
  29980. (R1104 ^value 1 +)
  29981. (R1 ^reward R1104 +)
  29982. Firing propose*predict-yes
  29983. -->
  29984. (O2201 ^name predict-yes +)
  29985. (S1 ^operator O2201 +)
  29986. Firing propose*predict-no
  29987. -->
  29988. (O2202 ^name predict-no +)
  29989. (S1 ^operator O2202 +)
  29990. Firing rl*prefer*rvt*predict-no*H0*2
  29991. -->
  29992. (S1 ^operator O2200 = 0.3873354925260269)
  29993. Firing rl*prefer*rvt*predict-yes*H0*1
  29994. -->
  29995. (S1 ^operator O2199 = 0.3895396697295312)
  29996. Firing prefer*rvt*predict-yes*H0
  29997. -->
  29998. Firing prefer*rvt*predict-no*H0
  29999. -->
  30000. Firing elaborate*copy-dir-to-output-link
  30001. -->
  30002. (I3 ^dir L +)
  30003. inner elaboration loop at bottom goal.
  30004. Retracting elaborate*copy-see-to-output-link
  30005. -->
  30006. (I3 ^see 0 +)
  30007. Retracting propose*predict-no
  30008. -->
  30009. (O2200 ^name predict-no +)
  30010. (S1 ^operator O2200 +)
  30011. Retracting propose*predict-yes
  30012. -->
  30013. (O2199 ^name predict-yes +)
  30014. (S1 ^operator O2199 +)
  30015. Retracting elaborate*reward*based*on*reward
  30016. -->
  30017. (R1103 ^value 1 +)
  30018. (R1 ^reward R1103 +)
  30019. Retracting elaborate*copy-dir-to-output-link
  30020. -->
  30021. (I3 ^dir U +)
  30022. Retracting rl*prefer*rvt*predict-no*H0*6
  30023. -->
  30024. (S1 ^operator O2200 = 0.9999999999999999)
  30025. Retracting rl*prefer*rvt*predict-yes*H0*5
  30026. -->
  30027. (S1 ^operator O2199 = 0.)
  30028. =>WM: (15436: S1 ^operator O2202 +)
  30029. =>WM: (15435: S1 ^operator O2201 +)
  30030. =>WM: (15434: I3 ^dir L)
  30031. =>WM: (15433: O2202 ^name predict-no)
  30032. =>WM: (15432: O2201 ^name predict-yes)
  30033. =>WM: (15431: R1104 ^value 1)
  30034. =>WM: (15430: R1 ^reward R1104)
  30035. <=WM: (15421: S1 ^operator O2199 +)
  30036. <=WM: (15422: S1 ^operator O2200 +)
  30037. <=WM: (15423: S1 ^operator O2200)
  30038. <=WM: (15407: I3 ^dir U)
  30039. <=WM: (15417: R1 ^reward R1103)
  30040. <=WM: (15420: O2200 ^name predict-no)
  30041. <=WM: (15419: O2199 ^name predict-yes)
  30042. <=WM: (15418: R1103 ^value 1)
  30043. --- Inner Elaboration Phase, active level 1 (S1) ---
  30044. Firing prefer*rvt*predict-yes*H0
  30045. -->
  30046. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  30047. -->
  30048. (S1 ^operator O2201 = 0.6104606012562985)
  30049. Firing rl*prefer*rvt*predict-yes*H0*1
  30050. -->
  30051. (S1 ^operator O2201 = 0.3895396697295312)
  30052. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  30053. -->
  30054. Firing prefer*rvt*predict-no*H0
  30055. -->
  30056. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  30057. -->
  30058. (S1 ^operator O2202 = 0.1063475139796038)
  30059. Firing rl*prefer*rvt*predict-no*H0*2
  30060. -->
  30061. (S1 ^operator O2202 = 0.3873354925260269)
  30062. Firing prefer*rvt*predict-no*H0*2*v1*H1
  30063. -->
  30064. inner elaboration loop at bottom goal.
  30065. Retracting rl*prefer*rvt*predict-no*H0*2
  30066. -->
  30067. (S1 ^operator O2200 = 0.3873354925260269)
  30068. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  30069. -->
  30070. (S1 ^operator O2200 = 0.1063475139796038)
  30071. Retracting rl*prefer*rvt*predict-yes*H0*1
  30072. -->
  30073. (S1 ^operator O2199 = 0.3895396697295312)
  30074. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  30075. -->
  30076. (S1 ^operator O2199 = 0.6104606012562985)
  30077. --- END Proposal Phase ---
  30078. --- Decision Phase ---
  30079. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  30080. =>WM: (15437: S1 ^operator O2201)
  30081. 1101: O: O2201 (predict-yes)
  30082. --- END Decision Phase ---
  30083. --- Application Phase ---
  30084. --- Firing Productions (PE) For State At Depth 1 ---
  30085. --- Inner Elaboration Phase, active level 1 (S1) ---
  30086. Firing apply*operator
  30087. -->
  30088. (I3 ^predict-yes N1101 + :O )
  30089. Firing apply*operator*complete
  30090. -->
  30091. (I3 ^predict-no N1100 - :O )
  30092. inner elaboration loop at bottom goal.
  30093. --- Change Working Memory (PE) ---
  30094. =>WM: (15438: I3 ^predict-yes N1101)
  30095. <=WM: (15425: N1100 ^status complete)
  30096. <=WM: (15424: I3 ^predict-no N1100)
  30097. --- Firing Productions (IE) For State At Depth 1 ---
  30098. --- Inner Elaboration Phase, active level 1 (S1) ---
  30099. Firing monitor*world
  30100. -->
  30101. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30102. --- Change Working Memory (IE) ---
  30103. --- END Application Phase ---
  30104. --- Output Phase ---
  30105. ENV: Agent did: predict-yes for direction L in state State-B
  30106. In State-B moving L
  30107. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  30108. predict error 0
  30109. dir: dir isR
  30110. --- END Output Phase ---
  30111. \--- Input Phase ---
  30112. =>WM: (15442: I2 ^dir R)
  30113. =>WM: (15441: I2 ^reward 1)
  30114. =>WM: (15440: I2 ^see 1)
  30115. =>WM: (15439: N1101 ^status complete)
  30116. <=WM: (15428: I2 ^dir L)
  30117. <=WM: (15427: I2 ^reward 1)
  30118. <=WM: (15426: I2 ^see 0)
  30119. =>WM: (15443: I2 ^level-1 L1-root)
  30120. <=WM: (15429: I2 ^level-1 R0-root)
  30121. --- END Input Phase ---
  30122. --- Proposal Phase ---
  30123. --- Inner Elaboration Phase, active level 1 (S1) ---
  30124. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  30125. -->
  30126. (S1 ^operator O2202 = -0.02155734064455064)
  30127. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  30128. -->
  30129. (S1 ^operator O2201 = 0.8155865579280523)
  30130. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30131. -->
  30132. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30133. -->
  30134. Firing elaborate*copy-see-to-output-link
  30135. -->
  30136. (I3 ^see 1 +)
  30137. Firing elaborate*reward*based*on*reward
  30138. -->
  30139. (R1105 ^value 1 +)
  30140. (R1 ^reward R1105 +)
  30141. Firing propose*predict-yes
  30142. -->
  30143. (O2203 ^name predict-yes +)
  30144. (S1 ^operator O2203 +)
  30145. Firing propose*predict-no
  30146. -->
  30147. (O2204 ^name predict-no +)
  30148. (S1 ^operator O2204 +)
  30149. Firing rl*prefer*rvt*predict-no*H0*4
  30150. -->
  30151. (S1 ^operator O2202 = 0.4476194274104922)
  30152. Firing rl*prefer*rvt*predict-yes*H0*3
  30153. -->
  30154. (S1 ^operator O2201 = 0.1844123597543724)
  30155. Firing prefer*rvt*predict-yes*H0
  30156. -->
  30157. Firing prefer*rvt*predict-no*H0
  30158. -->
  30159. Firing elaborate*copy-dir-to-output-link
  30160. -->
  30161. (I3 ^dir R +)
  30162. inner elaboration loop at bottom goal.
  30163. Retracting elaborate*copy-see-to-output-link
  30164. -->
  30165. (I3 ^see 0 +)
  30166. Retracting propose*predict-no
  30167. -->
  30168. (O2202 ^name predict-no +)
  30169. (S1 ^operator O2202 +)
  30170. Retracting propose*predict-yes
  30171. -->
  30172. (O2201 ^name predict-yes +)
  30173. (S1 ^operator O2201 +)
  30174. Retracting elaborate*reward*based*on*reward
  30175. -->
  30176. (R1104 ^value 1 +)
  30177. (R1 ^reward R1104 +)
  30178. Retracting elaborate*copy-dir-to-output-link
  30179. -->
  30180. (I3 ^dir L +)
  30181. Retracting rl*prefer*rvt*predict-no*H0*2
  30182. -->
  30183. (S1 ^operator O2202 = 0.3873354925260269)
  30184. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  30185. -->
  30186. (S1 ^operator O2202 = 0.1063475139796038)
  30187. Retracting rl*prefer*rvt*predict-yes*H0*1
  30188. -->
  30189. (S1 ^operator O2201 = 0.3895396697295312)
  30190. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  30191. -->
  30192. (S1 ^operator O2201 = 0.6104606012562985)
  30193. =>WM: (15451: S1 ^operator O2204 +)
  30194. =>WM: (15450: S1 ^operator O2203 +)
  30195. =>WM: (15449: I3 ^dir R)
  30196. =>WM: (15448: O2204 ^name predict-no)
  30197. =>WM: (15447: O2203 ^name predict-yes)
  30198. =>WM: (15446: R1105 ^value 1)
  30199. =>WM: (15445: R1 ^reward R1105)
  30200. =>WM: (15444: I3 ^see 1)
  30201. <=WM: (15435: S1 ^operator O2201 +)
  30202. <=WM: (15437: S1 ^operator O2201)
  30203. <=WM: (15436: S1 ^operator O2202 +)
  30204. <=WM: (15434: I3 ^dir L)
  30205. <=WM: (15430: R1 ^reward R1104)
  30206. <=WM: (15376: I3 ^see 0)
  30207. <=WM: (15433: O2202 ^name predict-no)
  30208. <=WM: (15432: O2201 ^name predict-yes)
  30209. <=WM: (15431: R1104 ^value 1)
  30210. --- Inner Elaboration Phase, active level 1 (S1) ---
  30211. Firing prefer*rvt*predict-yes*H0
  30212. -->
  30213. Firing rl*prefer*rvt*predict-yes*H0*3
  30214. -->
  30215. (S1 ^operator O2203 = 0.1844123597543724)
  30216. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30217. -->
  30218. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  30219. -->
  30220. (S1 ^operator O2203 = 0.8155865579280523)
  30221. Firing prefer*rvt*predict-no*H0
  30222. -->
  30223. Firing rl*prefer*rvt*predict-no*H0*4
  30224. -->
  30225. (S1 ^operator O2204 = 0.4476194274104922)
  30226. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30227. -->
  30228. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  30229. -->
  30230. (S1 ^operator O2204 = -0.02155734064455064)
  30231. inner elaboration loop at bottom goal.
  30232. Retracting rl*prefer*rvt*predict-no*H0*4
  30233. -->
  30234. (S1 ^operator O2202 = 0.4476194274104922)
  30235. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  30236. -->
  30237. (S1 ^operator O2202 = -0.02155734064455064)
  30238. Retracting rl*prefer*rvt*predict-yes*H0*3
  30239. -->
  30240. (S1 ^operator O2201 = 0.1844123597543724)
  30241. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  30242. -->
  30243. (S1 ^operator O2201 = 0.8155865579280523)
  30244. --- END Proposal Phase ---
  30245. --- Decision Phase ---
  30246. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.902174,0.0887384)
  30247. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  30248. =>WM: (15452: S1 ^operator O2203)
  30249. 1102: O: O2203 (predict-yes)
  30250. --- END Decision Phase ---
  30251. --- Application Phase ---
  30252. --- Firing Productions (PE) For State At Depth 1 ---
  30253. --- Inner Elaboration Phase, active level 1 (S1) ---
  30254. Firing apply*operator
  30255. -->
  30256. (I3 ^predict-yes N1102 + :O )
  30257. Firing apply*operator*complete
  30258. -->
  30259. (I3 ^predict-yes N1101 - :O )
  30260. inner elaboration loop at bottom goal.
  30261. --- Change Working Memory (PE) ---
  30262. =>WM: (15453: I3 ^predict-yes N1102)
  30263. <=WM: (15439: N1101 ^status complete)
  30264. <=WM: (15438: I3 ^predict-yes N1101)
  30265. --- Firing Productions (IE) For State At Depth 1 ---
  30266. --- Inner Elaboration Phase, active level 1 (S1) ---
  30267. Firing monitor*world
  30268. -->
  30269. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30270. --- Change Working Memory (IE) ---
  30271. --- END Application Phase ---
  30272. --- Output Phase ---
  30273. ENV: Agent did: predict-yes for direction R in state State-A
  30274. In State-A moving R
  30275. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  30276. predict error 0
  30277. dir: dir isU
  30278. --- END Output Phase ---
  30279. -/|--- Input Phase ---
  30280. =>WM: (15457: I2 ^dir U)
  30281. =>WM: (15456: I2 ^reward 1)
  30282. =>WM: (15455: I2 ^see 1)
  30283. =>WM: (15454: N1102 ^status complete)
  30284. <=WM: (15442: I2 ^dir R)
  30285. <=WM: (15441: I2 ^reward 1)
  30286. <=WM: (15440: I2 ^see 1)
  30287. =>WM: (15458: I2 ^level-1 R1-root)
  30288. <=WM: (15443: I2 ^level-1 L1-root)
  30289. --- END Input Phase ---
  30290. --- Proposal Phase ---
  30291. --- Inner Elaboration Phase, active level 1 (S1) ---
  30292. Firing elaborate*copy-see-to-output-link
  30293. -->
  30294. (I3 ^see 1 +)
  30295. Firing elaborate*reward*based*on*reward
  30296. -->
  30297. (R1106 ^value 1 +)
  30298. (R1 ^reward R1106 +)
  30299. Firing propose*predict-yes
  30300. -->
  30301. (O2205 ^name predict-yes +)
  30302. (S1 ^operator O2205 +)
  30303. Firing propose*predict-no
  30304. -->
  30305. (O2206 ^name predict-no +)
  30306. (S1 ^operator O2206 +)
  30307. Firing rl*prefer*rvt*predict-no*H0*6
  30308. -->
  30309. (S1 ^operator O2204 = 0.9999999999999999)
  30310. Firing rl*prefer*rvt*predict-yes*H0*5
  30311. -->
  30312. (S1 ^operator O2203 = 0.)
  30313. Firing prefer*rvt*predict-yes*H0
  30314. -->
  30315. Firing prefer*rvt*predict-no*H0
  30316. -->
  30317. Firing elaborate*copy-dir-to-output-link
  30318. -->
  30319. (I3 ^dir U +)
  30320. inner elaboration loop at bottom goal.
  30321. Retracting elaborate*copy-see-to-output-link
  30322. -->
  30323. (I3 ^see 1 +)
  30324. Retracting propose*predict-no
  30325. -->
  30326. (O2204 ^name predict-no +)
  30327. (S1 ^operator O2204 +)
  30328. Retracting propose*predict-yes
  30329. -->
  30330. (O2203 ^name predict-yes +)
  30331. (S1 ^operator O2203 +)
  30332. Retracting elaborate*reward*based*on*reward
  30333. -->
  30334. (R1105 ^value 1 +)
  30335. (R1 ^reward R1105 +)
  30336. Retracting elaborate*copy-dir-to-output-link
  30337. -->
  30338. (I3 ^dir R +)
  30339. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  30340. -->
  30341. (S1 ^operator O2204 = -0.02155734064455064)
  30342. Retracting rl*prefer*rvt*predict-no*H0*4
  30343. -->
  30344. (S1 ^operator O2204 = 0.4476194274104922)
  30345. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  30346. -->
  30347. (S1 ^operator O2203 = 0.8155865579280523)
  30348. Retracting rl*prefer*rvt*predict-yes*H0*3
  30349. -->
  30350. (S1 ^operator O2203 = 0.1844123597543724)
  30351. =>WM: (15465: S1 ^operator O2206 +)
  30352. =>WM: (15464: S1 ^operator O2205 +)
  30353. =>WM: (15463: I3 ^dir U)
  30354. =>WM: (15462: O2206 ^name predict-no)
  30355. =>WM: (15461: O2205 ^name predict-yes)
  30356. =>WM: (15460: R1106 ^value 1)
  30357. =>WM: (15459: R1 ^reward R1106)
  30358. <=WM: (15450: S1 ^operator O2203 +)
  30359. <=WM: (15452: S1 ^operator O2203)
  30360. <=WM: (15451: S1 ^operator O2204 +)
  30361. <=WM: (15449: I3 ^dir R)
  30362. <=WM: (15445: R1 ^reward R1105)
  30363. <=WM: (15448: O2204 ^name predict-no)
  30364. <=WM: (15447: O2203 ^name predict-yes)
  30365. <=WM: (15446: R1105 ^value 1)
  30366. --- Inner Elaboration Phase, active level 1 (S1) ---
  30367. Firing prefer*rvt*predict-yes*H0
  30368. -->
  30369. Firing rl*prefer*rvt*predict-yes*H0*5
  30370. -->
  30371. (S1 ^operator O2205 = 0.)
  30372. Firing prefer*rvt*predict-no*H0
  30373. -->
  30374. Firing rl*prefer*rvt*predict-no*H0*6
  30375. -->
  30376. (S1 ^operator O2206 = 0.9999999999999999)
  30377. inner elaboration loop at bottom goal.
  30378. Retracting rl*prefer*rvt*predict-no*H0*6
  30379. -->
  30380. (S1 ^operator O2204 = 0.9999999999999999)
  30381. Retracting rl*prefer*rvt*predict-yes*H0*5
  30382. -->
  30383. (S1 ^operator O2203 = 0.)
  30384. --- END Proposal Phase ---
  30385. --- Decision Phase ---
  30386. RL update rl*prefer*rvt*predict-yes*H0*3 0.675415 -0.491003 0.184412 -> 0.675416 -0.491003 0.184413(R,m,v=1,0.909091,0.083089)
  30387. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324583 0.491003 0.815587 -> 0.324583 0.491003 0.815587(R,m,v=1,1,0)
  30388. =>WM: (15466: S1 ^operator O2206)
  30389. 1103: O: O2206 (predict-no)
  30390. --- END Decision Phase ---
  30391. --- Application Phase ---
  30392. --- Firing Productions (PE) For State At Depth 1 ---
  30393. --- Inner Elaboration Phase, active level 1 (S1) ---
  30394. Firing apply*operator
  30395. -->
  30396. (I3 ^predict-no N1103 + :O )
  30397. Firing apply*operator*complete
  30398. -->
  30399. (I3 ^predict-yes N1102 - :O )
  30400. inner elaboration loop at bottom goal.
  30401. --- Change Working Memory (PE) ---
  30402. =>WM: (15467: I3 ^predict-no N1103)
  30403. <=WM: (15454: N1102 ^status complete)
  30404. <=WM: (15453: I3 ^predict-yes N1102)
  30405. --- Firing Productions (IE) For State At Depth 1 ---
  30406. --- Inner Elaboration Phase, active level 1 (S1) ---
  30407. Firing monitor*world
  30408. -->
  30409. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30410. --- Change Working Memory (IE) ---
  30411. --- END Application Phase ---
  30412. --- Output Phase ---
  30413. ENV: Agent did: predict-no for direction U in state State-B
  30414. In State-B moving U
  30415. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30416. predict error 0
  30417. dir: dir isU
  30418. --- END Output Phase ---
  30419. \-/--- Input Phase ---
  30420. =>WM: (15471: I2 ^dir U)
  30421. =>WM: (15470: I2 ^reward 1)
  30422. =>WM: (15469: I2 ^see 0)
  30423. =>WM: (15468: N1103 ^status complete)
  30424. <=WM: (15457: I2 ^dir U)
  30425. <=WM: (15456: I2 ^reward 1)
  30426. <=WM: (15455: I2 ^see 1)
  30427. =>WM: (15472: I2 ^level-1 R1-root)
  30428. <=WM: (15458: I2 ^level-1 R1-root)
  30429. --- END Input Phase ---
  30430. --- Proposal Phase ---
  30431. --- Inner Elaboration Phase, active level 1 (S1) ---
  30432. Firing elaborate*copy-see-to-output-link
  30433. -->
  30434. (I3 ^see 0 +)
  30435. Firing elaborate*reward*based*on*reward
  30436. -->
  30437. (R1107 ^value 1 +)
  30438. (R1 ^reward R1107 +)
  30439. Firing propose*predict-yes
  30440. -->
  30441. (O2207 ^name predict-yes +)
  30442. (S1 ^operator O2207 +)
  30443. Firing propose*predict-no
  30444. -->
  30445. (O2208 ^name predict-no +)
  30446. (S1 ^operator O2208 +)
  30447. Firing rl*prefer*rvt*predict-no*H0*6
  30448. -->
  30449. (S1 ^operator O2206 = 0.9999999999999999)
  30450. Firing rl*prefer*rvt*predict-yes*H0*5
  30451. -->
  30452. (S1 ^operator O2205 = 0.)
  30453. Firing prefer*rvt*predict-yes*H0
  30454. -->
  30455. Firing prefer*rvt*predict-no*H0
  30456. -->
  30457. Firing elaborate*copy-dir-to-output-link
  30458. -->
  30459. (I3 ^dir U +)
  30460. inner elaboration loop at bottom goal.
  30461. Retracting elaborate*copy-see-to-output-link
  30462. -->
  30463. (I3 ^see 1 +)
  30464. Retracting propose*predict-no
  30465. -->
  30466. (O2206 ^name predict-no +)
  30467. (S1 ^operator O2206 +)
  30468. Retracting propose*predict-yes
  30469. -->
  30470. (O2205 ^name predict-yes +)
  30471. (S1 ^operator O2205 +)
  30472. Retracting elaborate*reward*based*on*reward
  30473. -->
  30474. (R1106 ^value 1 +)
  30475. (R1 ^reward R1106 +)
  30476. Retracting elaborate*copy-dir-to-output-link
  30477. -->
  30478. (I3 ^dir U +)
  30479. Retracting rl*prefer*rvt*predict-no*H0*6
  30480. -->
  30481. (S1 ^operator O2206 = 0.9999999999999999)
  30482. Retracting rl*prefer*rvt*predict-yes*H0*5
  30483. -->
  30484. (S1 ^operator O2205 = 0.)
  30485. =>WM: (15479: S1 ^operator O2208 +)
  30486. =>WM: (15478: S1 ^operator O2207 +)
  30487. =>WM: (15477: O2208 ^name predict-no)
  30488. =>WM: (15476: O2207 ^name predict-yes)
  30489. =>WM: (15475: R1107 ^value 1)
  30490. =>WM: (15474: R1 ^reward R1107)
  30491. =>WM: (15473: I3 ^see 0)
  30492. <=WM: (15464: S1 ^operator O2205 +)
  30493. <=WM: (15465: S1 ^operator O2206 +)
  30494. <=WM: (15466: S1 ^operator O2206)
  30495. <=WM: (15459: R1 ^reward R1106)
  30496. <=WM: (15444: I3 ^see 1)
  30497. <=WM: (15462: O2206 ^name predict-no)
  30498. <=WM: (15461: O2205 ^name predict-yes)
  30499. <=WM: (15460: R1106 ^value 1)
  30500. --- Inner Elaboration Phase, active level 1 (S1) ---
  30501. Firing prefer*rvt*predict-yes*H0
  30502. -->
  30503. Firing rl*prefer*rvt*predict-yes*H0*5
  30504. -->
  30505. (S1 ^operator O2207 = 0.)
  30506. Firing prefer*rvt*predict-no*H0
  30507. -->
  30508. Firing rl*prefer*rvt*predict-no*H0*6
  30509. -->
  30510. (S1 ^operator O2208 = 0.9999999999999999)
  30511. inner elaboration loop at bottom goal.
  30512. Retracting rl*prefer*rvt*predict-no*H0*6
  30513. -->
  30514. (S1 ^operator O2206 = 0.9999999999999999)
  30515. Retracting rl*prefer*rvt*predict-yes*H0*5
  30516. -->
  30517. (S1 ^operator O2205 = 0.)
  30518. --- END Proposal Phase ---
  30519. --- Decision Phase ---
  30520. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  30521. =>WM: (15480: S1 ^operator O2208)
  30522. 1104: O: O2208 (predict-no)
  30523. --- END Decision Phase ---
  30524. --- Application Phase ---
  30525. --- Firing Productions (PE) For State At Depth 1 ---
  30526. --- Inner Elaboration Phase, active level 1 (S1) ---
  30527. Firing apply*operator
  30528. -->
  30529. (I3 ^predict-no N1104 + :O )
  30530. Firing apply*operator*complete
  30531. -->
  30532. (I3 ^predict-no N1103 - :O )
  30533. inner elaboration loop at bottom goal.
  30534. --- Change Working Memory (PE) ---
  30535. =>WM: (15481: I3 ^predict-no N1104)
  30536. <=WM: (15468: N1103 ^status complete)
  30537. <=WM: (15467: I3 ^predict-no N1103)
  30538. --- Firing Productions (IE) For State At Depth 1 ---
  30539. --- Inner Elaboration Phase, active level 1 (S1) ---
  30540. Firing monitor*world
  30541. -->
  30542. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30543. --- Change Working Memory (IE) ---
  30544. --- END Application Phase ---
  30545. --- Output Phase ---
  30546. ENV: Agent did: predict-no for direction U in state State-B
  30547. In State-B moving U
  30548. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30549. predict error 0
  30550. dir: dir isR
  30551. --- END Output Phase ---
  30552. |\---- Input Phase ---
  30553. =>WM: (15485: I2 ^dir R)
  30554. =>WM: (15484: I2 ^reward 1)
  30555. =>WM: (15483: I2 ^see 0)
  30556. =>WM: (15482: N1104 ^status complete)
  30557. <=WM: (15471: I2 ^dir U)
  30558. <=WM: (15470: I2 ^reward 1)
  30559. <=WM: (15469: I2 ^see 0)
  30560. =>WM: (15486: I2 ^level-1 R1-root)
  30561. <=WM: (15472: I2 ^level-1 R1-root)
  30562. --- END Input Phase ---
  30563. --- Proposal Phase ---
  30564. --- Inner Elaboration Phase, active level 1 (S1) ---
  30565. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  30566. -->
  30567. (S1 ^operator O2207 = 0.1398795999120246)
  30568. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  30569. -->
  30570. (S1 ^operator O2208 = 0.5523809286703978)
  30571. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30572. -->
  30573. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30574. -->
  30575. Firing elaborate*copy-see-to-output-link
  30576. -->
  30577. (I3 ^see 0 +)
  30578. Firing elaborate*reward*based*on*reward
  30579. -->
  30580. (R1108 ^value 1 +)
  30581. (R1 ^reward R1108 +)
  30582. Firing propose*predict-yes
  30583. -->
  30584. (O2209 ^name predict-yes +)
  30585. (S1 ^operator O2209 +)
  30586. Firing propose*predict-no
  30587. -->
  30588. (O2210 ^name predict-no +)
  30589. (S1 ^operator O2210 +)
  30590. Firing rl*prefer*rvt*predict-no*H0*4
  30591. -->
  30592. (S1 ^operator O2208 = 0.4476194274104922)
  30593. Firing rl*prefer*rvt*predict-yes*H0*3
  30594. -->
  30595. (S1 ^operator O2207 = 0.1844125221020087)
  30596. Firing prefer*rvt*predict-yes*H0
  30597. -->
  30598. Firing prefer*rvt*predict-no*H0
  30599. -->
  30600. Firing elaborate*copy-dir-to-output-link
  30601. -->
  30602. (I3 ^dir R +)
  30603. inner elaboration loop at bottom goal.
  30604. Retracting elaborate*copy-see-to-output-link
  30605. -->
  30606. (I3 ^see 0 +)
  30607. Retracting propose*predict-no
  30608. -->
  30609. (O2208 ^name predict-no +)
  30610. (S1 ^operator O2208 +)
  30611. Retracting propose*predict-yes
  30612. -->
  30613. (O2207 ^name predict-yes +)
  30614. (S1 ^operator O2207 +)
  30615. Retracting elaborate*reward*based*on*reward
  30616. -->
  30617. (R1107 ^value 1 +)
  30618. (R1 ^reward R1107 +)
  30619. Retracting elaborate*copy-dir-to-output-link
  30620. -->
  30621. (I3 ^dir U +)
  30622. Retracting rl*prefer*rvt*predict-no*H0*6
  30623. -->
  30624. (S1 ^operator O2208 = 0.9999999999999999)
  30625. Retracting rl*prefer*rvt*predict-yes*H0*5
  30626. -->
  30627. (S1 ^operator O2207 = 0.)
  30628. =>WM: (15493: S1 ^operator O2210 +)
  30629. =>WM: (15492: S1 ^operator O2209 +)
  30630. =>WM: (15491: I3 ^dir R)
  30631. =>WM: (15490: O2210 ^name predict-no)
  30632. =>WM: (15489: O2209 ^name predict-yes)
  30633. =>WM: (15488: R1108 ^value 1)
  30634. =>WM: (15487: R1 ^reward R1108)
  30635. <=WM: (15478: S1 ^operator O2207 +)
  30636. <=WM: (15479: S1 ^operator O2208 +)
  30637. <=WM: (15480: S1 ^operator O2208)
  30638. <=WM: (15463: I3 ^dir U)
  30639. <=WM: (15474: R1 ^reward R1107)
  30640. <=WM: (15477: O2208 ^name predict-no)
  30641. <=WM: (15476: O2207 ^name predict-yes)
  30642. <=WM: (15475: R1107 ^value 1)
  30643. --- Inner Elaboration Phase, active level 1 (S1) ---
  30644. Firing prefer*rvt*predict-yes*H0
  30645. -->
  30646. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  30647. -->
  30648. (S1 ^operator O2209 = 0.1398795999120246)
  30649. Firing rl*prefer*rvt*predict-yes*H0*3
  30650. -->
  30651. (S1 ^operator O2209 = 0.1844125221020087)
  30652. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30653. -->
  30654. Firing prefer*rvt*predict-no*H0
  30655. -->
  30656. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  30657. -->
  30658. (S1 ^operator O2210 = 0.5523809286703978)
  30659. Firing rl*prefer*rvt*predict-no*H0*4
  30660. -->
  30661. (S1 ^operator O2210 = 0.4476194274104922)
  30662. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30663. -->
  30664. inner elaboration loop at bottom goal.
  30665. Retracting rl*prefer*rvt*predict-no*H0*4
  30666. -->
  30667. (S1 ^operator O2208 = 0.4476194274104922)
  30668. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  30669. -->
  30670. (S1 ^operator O2208 = 0.5523809286703978)
  30671. Retracting rl*prefer*rvt*predict-yes*H0*3
  30672. -->
  30673. (S1 ^operator O2207 = 0.1844125221020087)
  30674. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  30675. -->
  30676. (S1 ^operator O2207 = 0.1398795999120246)
  30677. --- END Proposal Phase ---
  30678. --- Decision Phase ---
  30679. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  30680. =>WM: (15494: S1 ^operator O2210)
  30681. 1105: O: O2210 (predict-no)
  30682. --- END Decision Phase ---
  30683. --- Application Phase ---
  30684. --- Firing Productions (PE) For State At Depth 1 ---
  30685. --- Inner Elaboration Phase, active level 1 (S1) ---
  30686. Firing apply*operator
  30687. -->
  30688. (I3 ^predict-no N1105 + :O )
  30689. Firing apply*operator*complete
  30690. -->
  30691. (I3 ^predict-no N1104 - :O )
  30692. inner elaboration loop at bottom goal.
  30693. --- Change Working Memory (PE) ---
  30694. =>WM: (15495: I3 ^predict-no N1105)
  30695. <=WM: (15482: N1104 ^status complete)
  30696. <=WM: (15481: I3 ^predict-no N1104)
  30697. --- Firing Productions (IE) For State At Depth 1 ---
  30698. --- Inner Elaboration Phase, active level 1 (S1) ---
  30699. Firing monitor*world
  30700. -->
  30701. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30702. --- Change Working Memory (IE) ---
  30703. --- END Application Phase ---
  30704. --- Output Phase ---
  30705. ENV: Agent did: predict-no for direction R in state State-B
  30706. In State-B moving R
  30707. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30708. predict error 0
  30709. dir: dir isR
  30710. --- END Output Phase ---
  30711. /|\--- Input Phase ---
  30712. =>WM: (15499: I2 ^dir R)
  30713. =>WM: (15498: I2 ^reward 1)
  30714. =>WM: (15497: I2 ^see 0)
  30715. =>WM: (15496: N1105 ^status complete)
  30716. <=WM: (15485: I2 ^dir R)
  30717. <=WM: (15484: I2 ^reward 1)
  30718. <=WM: (15483: I2 ^see 0)
  30719. =>WM: (15500: I2 ^level-1 R0-root)
  30720. <=WM: (15486: I2 ^level-1 R1-root)
  30721. --- END Input Phase ---
  30722. --- Proposal Phase ---
  30723. --- Inner Elaboration Phase, active level 1 (S1) ---
  30724. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  30725. -->
  30726. (S1 ^operator O2209 = 0.1664311307472832)
  30727. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  30728. -->
  30729. (S1 ^operator O2210 = 0.552380328382883)
  30730. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30731. -->
  30732. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30733. -->
  30734. Firing elaborate*copy-see-to-output-link
  30735. -->
  30736. (I3 ^see 0 +)
  30737. Firing elaborate*reward*based*on*reward
  30738. -->
  30739. (R1109 ^value 1 +)
  30740. (R1 ^reward R1109 +)
  30741. Firing propose*predict-yes
  30742. -->
  30743. (O2211 ^name predict-yes +)
  30744. (S1 ^operator O2211 +)
  30745. Firing propose*predict-no
  30746. -->
  30747. (O2212 ^name predict-no +)
  30748. (S1 ^operator O2212 +)
  30749. Firing rl*prefer*rvt*predict-no*H0*4
  30750. -->
  30751. (S1 ^operator O2210 = 0.4476194274104922)
  30752. Firing rl*prefer*rvt*predict-yes*H0*3
  30753. -->
  30754. (S1 ^operator O2209 = 0.1844125221020087)
  30755. Firing prefer*rvt*predict-yes*H0
  30756. -->
  30757. Firing prefer*rvt*predict-no*H0
  30758. -->
  30759. Firing elaborate*copy-dir-to-output-link
  30760. -->
  30761. (I3 ^dir R +)
  30762. inner elaboration loop at bottom goal.
  30763. Retracting elaborate*copy-see-to-output-link
  30764. -->
  30765. (I3 ^see 0 +)
  30766. Retracting propose*predict-no
  30767. -->
  30768. (O2210 ^name predict-no +)
  30769. (S1 ^operator O2210 +)
  30770. Retracting propose*predict-yes
  30771. -->
  30772. (O2209 ^name predict-yes +)
  30773. (S1 ^operator O2209 +)
  30774. Retracting elaborate*reward*based*on*reward
  30775. -->
  30776. (R1108 ^value 1 +)
  30777. (R1 ^reward R1108 +)
  30778. Retracting elaborate*copy-dir-to-output-link
  30779. -->
  30780. (I3 ^dir R +)
  30781. Retracting rl*prefer*rvt*predict-no*H0*4
  30782. -->
  30783. (S1 ^operator O2210 = 0.4476194274104922)
  30784. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  30785. -->
  30786. (S1 ^operator O2210 = 0.5523809286703978)
  30787. Retracting rl*prefer*rvt*predict-yes*H0*3
  30788. -->
  30789. (S1 ^operator O2209 = 0.1844125221020087)
  30790. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  30791. -->
  30792. (S1 ^operator O2209 = 0.1398795999120246)
  30793. =>WM: (15506: S1 ^operator O2212 +)
  30794. =>WM: (15505: S1 ^operator O2211 +)
  30795. =>WM: (15504: O2212 ^name predict-no)
  30796. =>WM: (15503: O2211 ^name predict-yes)
  30797. =>WM: (15502: R1109 ^value 1)
  30798. =>WM: (15501: R1 ^reward R1109)
  30799. <=WM: (15492: S1 ^operator O2209 +)
  30800. <=WM: (15493: S1 ^operator O2210 +)
  30801. <=WM: (15494: S1 ^operator O2210)
  30802. <=WM: (15487: R1 ^reward R1108)
  30803. <=WM: (15490: O2210 ^name predict-no)
  30804. <=WM: (15489: O2209 ^name predict-yes)
  30805. <=WM: (15488: R1108 ^value 1)
  30806. --- Inner Elaboration Phase, active level 1 (S1) ---
  30807. Firing prefer*rvt*predict-yes*H0
  30808. -->
  30809. Firing rl*prefer*rvt*predict-yes*H0*3
  30810. -->
  30811. (S1 ^operator O2211 = 0.1844125221020087)
  30812. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30813. -->
  30814. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  30815. -->
  30816. (S1 ^operator O2211 = 0.1664311307472832)
  30817. Firing prefer*rvt*predict-no*H0
  30818. -->
  30819. Firing rl*prefer*rvt*predict-no*H0*4
  30820. -->
  30821. (S1 ^operator O2212 = 0.4476194274104922)
  30822. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30823. -->
  30824. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  30825. -->
  30826. (S1 ^operator O2212 = 0.552380328382883)
  30827. inner elaboration loop at bottom goal.
  30828. Retracting rl*prefer*rvt*predict-no*H0*4
  30829. -->
  30830. (S1 ^operator O2210 = 0.4476194274104922)
  30831. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  30832. -->
  30833. (S1 ^operator O2210 = 0.552380328382883)
  30834. Retracting rl*prefer*rvt*predict-yes*H0*3
  30835. -->
  30836. (S1 ^operator O2209 = 0.1844125221020087)
  30837. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  30838. -->
  30839. (S1 ^operator O2209 = 0.1664311307472832)
  30840. --- END Proposal Phase ---
  30841. --- Decision Phase ---
  30842. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.938776,0.0578697)
  30843. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377467 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  30844. =>WM: (15507: S1 ^operator O2212)
  30845. 1106: O: O2212 (predict-no)
  30846. --- END Decision Phase ---
  30847. --- Application Phase ---
  30848. --- Firing Productions (PE) For State At Depth 1 ---
  30849. --- Inner Elaboration Phase, active level 1 (S1) ---
  30850. Firing apply*operator
  30851. -->
  30852. (I3 ^predict-no N1106 + :O )
  30853. Firing apply*operator*complete
  30854. -->
  30855. (I3 ^predict-no N1105 - :O )
  30856. inner elaboration loop at bottom goal.
  30857. --- Change Working Memory (PE) ---
  30858. =>WM: (15508: I3 ^predict-no N1106)
  30859. <=WM: (15496: N1105 ^status complete)
  30860. <=WM: (15495: I3 ^predict-no N1105)
  30861. --- Firing Productions (IE) For State At Depth 1 ---
  30862. --- Inner Elaboration Phase, active level 1 (S1) ---
  30863. Firing monitor*world
  30864. -->
  30865. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30866. --- Change Working Memory (IE) ---
  30867. --- END Application Phase ---
  30868. --- Output Phase ---
  30869. ENV: Agent did: predict-no for direction R in state State-B
  30870. In State-B moving R
  30871. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30872. predict error 0
  30873. dir: dir isU
  30874. --- END Output Phase ---
  30875. -/|--- Input Phase ---
  30876. =>WM: (15512: I2 ^dir U)
  30877. =>WM: (15511: I2 ^reward 1)
  30878. =>WM: (15510: I2 ^see 0)
  30879. =>WM: (15509: N1106 ^status complete)
  30880. <=WM: (15499: I2 ^dir R)
  30881. <=WM: (15498: I2 ^reward 1)
  30882. <=WM: (15497: I2 ^see 0)
  30883. =>WM: (15513: I2 ^level-1 R0-root)
  30884. <=WM: (15500: I2 ^level-1 R0-root)
  30885. --- END Input Phase ---
  30886. --- Proposal Phase ---
  30887. --- Inner Elaboration Phase, active level 1 (S1) ---
  30888. Firing elaborate*copy-see-to-output-link
  30889. -->
  30890. (I3 ^see 0 +)
  30891. Firing elaborate*reward*based*on*reward
  30892. -->
  30893. (R1110 ^value 1 +)
  30894. (R1 ^reward R1110 +)
  30895. Firing propose*predict-yes
  30896. -->
  30897. (O2213 ^name predict-yes +)
  30898. (S1 ^operator O2213 +)
  30899. Firing propose*predict-no
  30900. -->
  30901. (O2214 ^name predict-no +)
  30902. (S1 ^operator O2214 +)
  30903. Firing rl*prefer*rvt*predict-no*H0*6
  30904. -->
  30905. (S1 ^operator O2212 = 0.9999999999999999)
  30906. Firing rl*prefer*rvt*predict-yes*H0*5
  30907. -->
  30908. (S1 ^operator O2211 = 0.)
  30909. Firing prefer*rvt*predict-yes*H0
  30910. -->
  30911. Firing prefer*rvt*predict-no*H0
  30912. -->
  30913. Firing elaborate*copy-dir-to-output-link
  30914. -->
  30915. (I3 ^dir U +)
  30916. inner elaboration loop at bottom goal.
  30917. Retracting elaborate*copy-see-to-output-link
  30918. -->
  30919. (I3 ^see 0 +)
  30920. Retracting propose*predict-no
  30921. -->
  30922. (O2212 ^name predict-no +)
  30923. (S1 ^operator O2212 +)
  30924. Retracting propose*predict-yes
  30925. -->
  30926. (O2211 ^name predict-yes +)
  30927. (S1 ^operator O2211 +)
  30928. Retracting elaborate*reward*based*on*reward
  30929. -->
  30930. (R1109 ^value 1 +)
  30931. (R1 ^reward R1109 +)
  30932. Retracting elaborate*copy-dir-to-output-link
  30933. -->
  30934. (I3 ^dir R +)
  30935. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  30936. -->
  30937. (S1 ^operator O2212 = 0.552380328382883)
  30938. Retracting rl*prefer*rvt*predict-no*H0*4
  30939. -->
  30940. (S1 ^operator O2212 = 0.4476193739983587)
  30941. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  30942. -->
  30943. (S1 ^operator O2211 = 0.1664311307472832)
  30944. Retracting rl*prefer*rvt*predict-yes*H0*3
  30945. -->
  30946. (S1 ^operator O2211 = 0.1844125221020087)
  30947. =>WM: (15520: S1 ^operator O2214 +)
  30948. =>WM: (15519: S1 ^operator O2213 +)
  30949. =>WM: (15518: I3 ^dir U)
  30950. =>WM: (15517: O2214 ^name predict-no)
  30951. =>WM: (15516: O2213 ^name predict-yes)
  30952. =>WM: (15515: R1110 ^value 1)
  30953. =>WM: (15514: R1 ^reward R1110)
  30954. <=WM: (15505: S1 ^operator O2211 +)
  30955. <=WM: (15506: S1 ^operator O2212 +)
  30956. <=WM: (15507: S1 ^operator O2212)
  30957. <=WM: (15491: I3 ^dir R)
  30958. <=WM: (15501: R1 ^reward R1109)
  30959. <=WM: (15504: O2212 ^name predict-no)
  30960. <=WM: (15503: O2211 ^name predict-yes)
  30961. <=WM: (15502: R1109 ^value 1)
  30962. --- Inner Elaboration Phase, active level 1 (S1) ---
  30963. Firing prefer*rvt*predict-yes*H0
  30964. -->
  30965. Firing rl*prefer*rvt*predict-yes*H0*5
  30966. -->
  30967. (S1 ^operator O2213 = 0.)
  30968. Firing prefer*rvt*predict-no*H0
  30969. -->
  30970. Firing rl*prefer*rvt*predict-no*H0*6
  30971. -->
  30972. (S1 ^operator O2214 = 0.9999999999999999)
  30973. inner elaboration loop at bottom goal.
  30974. Retracting rl*prefer*rvt*predict-no*H0*6
  30975. -->
  30976. (S1 ^operator O2212 = 0.9999999999999999)
  30977. Retracting rl*prefer*rvt*predict-yes*H0*5
  30978. -->
  30979. (S1 ^operator O2211 = 0.)
  30980. --- END Proposal Phase ---
  30981. --- Decision Phase ---
  30982. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.939189,0.0575014)
  30983. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  30984. =>WM: (15521: S1 ^operator O2214)
  30985. 1107: O: O2214 (predict-no)
  30986. --- END Decision Phase ---
  30987. --- Application Phase ---
  30988. --- Firing Productions (PE) For State At Depth 1 ---
  30989. --- Inner Elaboration Phase, active level 1 (S1) ---
  30990. Firing apply*operator
  30991. -->
  30992. (I3 ^predict-no N1107 + :O )
  30993. Firing apply*operator*complete
  30994. -->
  30995. (I3 ^predict-no N1106 - :O )
  30996. inner elaboration loop at bottom goal.
  30997. --- Change Working Memory (PE) ---
  30998. =>WM: (15522: I3 ^predict-no N1107)
  30999. <=WM: (15509: N1106 ^status complete)
  31000. <=WM: (15508: I3 ^predict-no N1106)
  31001. --- Firing Productions (IE) For State At Depth 1 ---
  31002. --- Inner Elaboration Phase, active level 1 (S1) ---
  31003. Firing monitor*world
  31004. -->
  31005. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31006. --- Change Working Memory (IE) ---
  31007. --- END Application Phase ---
  31008. --- Output Phase ---
  31009. ENV: Agent did: predict-no for direction U in state State-B
  31010. In State-B moving U
  31011. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31012. predict error 0
  31013. dir: dir isR
  31014. --- END Output Phase ---
  31015. \-/--- Input Phase ---
  31016. =>WM: (15526: I2 ^dir R)
  31017. =>WM: (15525: I2 ^reward 1)
  31018. =>WM: (15524: I2 ^see 0)
  31019. =>WM: (15523: N1107 ^status complete)
  31020. <=WM: (15512: I2 ^dir U)
  31021. <=WM: (15511: I2 ^reward 1)
  31022. <=WM: (15510: I2 ^see 0)
  31023. =>WM: (15527: I2 ^level-1 R0-root)
  31024. <=WM: (15513: I2 ^level-1 R0-root)
  31025. --- END Input Phase ---
  31026. --- Proposal Phase ---
  31027. --- Inner Elaboration Phase, active level 1 (S1) ---
  31028. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31029. -->
  31030. (S1 ^operator O2213 = 0.1664311307472832)
  31031. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31032. -->
  31033. (S1 ^operator O2214 = 0.5523803730256968)
  31034. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31035. -->
  31036. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31037. -->
  31038. Firing elaborate*copy-see-to-output-link
  31039. -->
  31040. (I3 ^see 0 +)
  31041. Firing elaborate*reward*based*on*reward
  31042. -->
  31043. (R1111 ^value 1 +)
  31044. (R1 ^reward R1111 +)
  31045. Firing propose*predict-yes
  31046. -->
  31047. (O2215 ^name predict-yes +)
  31048. (S1 ^operator O2215 +)
  31049. Firing propose*predict-no
  31050. -->
  31051. (O2216 ^name predict-no +)
  31052. (S1 ^operator O2216 +)
  31053. Firing rl*prefer*rvt*predict-no*H0*4
  31054. -->
  31055. (S1 ^operator O2214 = 0.4476194186411724)
  31056. Firing rl*prefer*rvt*predict-yes*H0*3
  31057. -->
  31058. (S1 ^operator O2213 = 0.1844125221020087)
  31059. Firing prefer*rvt*predict-yes*H0
  31060. -->
  31061. Firing prefer*rvt*predict-no*H0
  31062. -->
  31063. Firing elaborate*copy-dir-to-output-link
  31064. -->
  31065. (I3 ^dir R +)
  31066. inner elaboration loop at bottom goal.
  31067. Retracting elaborate*copy-see-to-output-link
  31068. -->
  31069. (I3 ^see 0 +)
  31070. Retracting propose*predict-no
  31071. -->
  31072. (O2214 ^name predict-no +)
  31073. (S1 ^operator O2214 +)
  31074. Retracting propose*predict-yes
  31075. -->
  31076. (O2213 ^name predict-yes +)
  31077. (S1 ^operator O2213 +)
  31078. Retracting elaborate*reward*based*on*reward
  31079. -->
  31080. (R1110 ^value 1 +)
  31081. (R1 ^reward R1110 +)
  31082. Retracting elaborate*copy-dir-to-output-link
  31083. -->
  31084. (I3 ^dir U +)
  31085. Retracting rl*prefer*rvt*predict-no*H0*6
  31086. -->
  31087. (S1 ^operator O2214 = 0.9999999999999999)
  31088. Retracting rl*prefer*rvt*predict-yes*H0*5
  31089. -->
  31090. (S1 ^operator O2213 = 0.)
  31091. =>WM: (15534: S1 ^operator O2216 +)
  31092. =>WM: (15533: S1 ^operator O2215 +)
  31093. =>WM: (15532: I3 ^dir R)
  31094. =>WM: (15531: O2216 ^name predict-no)
  31095. =>WM: (15530: O2215 ^name predict-yes)
  31096. =>WM: (15529: R1111 ^value 1)
  31097. =>WM: (15528: R1 ^reward R1111)
  31098. <=WM: (15519: S1 ^operator O2213 +)
  31099. <=WM: (15520: S1 ^operator O2214 +)
  31100. <=WM: (15521: S1 ^operator O2214)
  31101. <=WM: (15518: I3 ^dir U)
  31102. <=WM: (15514: R1 ^reward R1110)
  31103. <=WM: (15517: O2214 ^name predict-no)
  31104. <=WM: (15516: O2213 ^name predict-yes)
  31105. <=WM: (15515: R1110 ^value 1)
  31106. --- Inner Elaboration Phase, active level 1 (S1) ---
  31107. Firing prefer*rvt*predict-yes*H0
  31108. -->
  31109. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31110. -->
  31111. (S1 ^operator O2215 = 0.1664311307472832)
  31112. Firing rl*prefer*rvt*predict-yes*H0*3
  31113. -->
  31114. (S1 ^operator O2215 = 0.1844125221020087)
  31115. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31116. -->
  31117. Firing prefer*rvt*predict-no*H0
  31118. -->
  31119. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31120. -->
  31121. (S1 ^operator O2216 = 0.5523803730256968)
  31122. Firing rl*prefer*rvt*predict-no*H0*4
  31123. -->
  31124. (S1 ^operator O2216 = 0.4476194186411724)
  31125. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31126. -->
  31127. inner elaboration loop at bottom goal.
  31128. Retracting rl*prefer*rvt*predict-no*H0*4
  31129. -->
  31130. (S1 ^operator O2214 = 0.4476194186411724)
  31131. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31132. -->
  31133. (S1 ^operator O2214 = 0.5523803730256968)
  31134. Retracting rl*prefer*rvt*predict-yes*H0*3
  31135. -->
  31136. (S1 ^operator O2213 = 0.1844125221020087)
  31137. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31138. -->
  31139. (S1 ^operator O2213 = 0.1664311307472832)
  31140. --- END Proposal Phase ---
  31141. --- Decision Phase ---
  31142. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31143. =>WM: (15535: S1 ^operator O2216)
  31144. 1108: O: O2216 (predict-no)
  31145. --- END Decision Phase ---
  31146. --- Application Phase ---
  31147. --- Firing Productions (PE) For State At Depth 1 ---
  31148. --- Inner Elaboration Phase, active level 1 (S1) ---
  31149. Firing apply*operator
  31150. -->
  31151. (I3 ^predict-no N1108 + :O )
  31152. Firing apply*operator*complete
  31153. -->
  31154. (I3 ^predict-no N1107 - :O )
  31155. inner elaboration loop at bottom goal.
  31156. --- Change Working Memory (PE) ---
  31157. =>WM: (15536: I3 ^predict-no N1108)
  31158. <=WM: (15523: N1107 ^status complete)
  31159. <=WM: (15522: I3 ^predict-no N1107)
  31160. --- Firing Productions (IE) For State At Depth 1 ---
  31161. --- Inner Elaboration Phase, active level 1 (S1) ---
  31162. Firing monitor*world
  31163. -->
  31164. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31165. --- Change Working Memory (IE) ---
  31166. --- END Application Phase ---
  31167. --- Output Phase ---
  31168. ENV: Agent did: predict-no for direction R in state State-B
  31169. In State-B moving R
  31170. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31171. predict error 0
  31172. dir: dir isL
  31173. --- END Output Phase ---
  31174. |\---- Input Phase ---
  31175. =>WM: (15540: I2 ^dir L)
  31176. =>WM: (15539: I2 ^reward 1)
  31177. =>WM: (15538: I2 ^see 0)
  31178. =>WM: (15537: N1108 ^status complete)
  31179. <=WM: (15526: I2 ^dir R)
  31180. <=WM: (15525: I2 ^reward 1)
  31181. <=WM: (15524: I2 ^see 0)
  31182. =>WM: (15541: I2 ^level-1 R0-root)
  31183. <=WM: (15527: I2 ^level-1 R0-root)
  31184. --- END Input Phase ---
  31185. --- Proposal Phase ---
  31186. --- Inner Elaboration Phase, active level 1 (S1) ---
  31187. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  31188. -->
  31189. (S1 ^operator O2215 = 0.610460560608424)
  31190. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  31191. -->
  31192. (S1 ^operator O2216 = 0.1063475139796038)
  31193. Firing prefer*rvt*predict-no*H0*2*v1*H1
  31194. -->
  31195. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  31196. -->
  31197. Firing elaborate*copy-see-to-output-link
  31198. -->
  31199. (I3 ^see 0 +)
  31200. Firing elaborate*reward*based*on*reward
  31201. -->
  31202. (R1112 ^value 1 +)
  31203. (R1 ^reward R1112 +)
  31204. Firing propose*predict-yes
  31205. -->
  31206. (O2217 ^name predict-yes +)
  31207. (S1 ^operator O2217 +)
  31208. Firing propose*predict-no
  31209. -->
  31210. (O2218 ^name predict-no +)
  31211. (S1 ^operator O2218 +)
  31212. Firing rl*prefer*rvt*predict-no*H0*2
  31213. -->
  31214. (S1 ^operator O2216 = 0.3873354925260269)
  31215. Firing rl*prefer*rvt*predict-yes*H0*1
  31216. -->
  31217. (S1 ^operator O2215 = 0.3895396290816568)
  31218. Firing prefer*rvt*predict-yes*H0
  31219. -->
  31220. Firing prefer*rvt*predict-no*H0
  31221. -->
  31222. Firing elaborate*copy-dir-to-output-link
  31223. -->
  31224. (I3 ^dir L +)
  31225. inner elaboration loop at bottom goal.
  31226. Retracting elaborate*copy-see-to-output-link
  31227. -->
  31228. (I3 ^see 0 +)
  31229. Retracting propose*predict-no
  31230. -->
  31231. (O2216 ^name predict-no +)
  31232. (S1 ^operator O2216 +)
  31233. Retracting propose*predict-yes
  31234. -->
  31235. (O2215 ^name predict-yes +)
  31236. (S1 ^operator O2215 +)
  31237. Retracting elaborate*reward*based*on*reward
  31238. -->
  31239. (R1111 ^value 1 +)
  31240. (R1 ^reward R1111 +)
  31241. Retracting elaborate*copy-dir-to-output-link
  31242. -->
  31243. (I3 ^dir R +)
  31244. Retracting rl*prefer*rvt*predict-no*H0*4
  31245. -->
  31246. (S1 ^operator O2216 = 0.4476194186411724)
  31247. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31248. -->
  31249. (S1 ^operator O2216 = 0.5523803730256968)
  31250. Retracting rl*prefer*rvt*predict-yes*H0*3
  31251. -->
  31252. (S1 ^operator O2215 = 0.1844125221020087)
  31253. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31254. -->
  31255. (S1 ^operator O2215 = 0.1664311307472832)
  31256. =>WM: (15548: S1 ^operator O2218 +)
  31257. =>WM: (15547: S1 ^operator O2217 +)
  31258. =>WM: (15546: I3 ^dir L)
  31259. =>WM: (15545: O2218 ^name predict-no)
  31260. =>WM: (15544: O2217 ^name predict-yes)
  31261. =>WM: (15543: R1112 ^value 1)
  31262. =>WM: (15542: R1 ^reward R1112)
  31263. <=WM: (15533: S1 ^operator O2215 +)
  31264. <=WM: (15534: S1 ^operator O2216 +)
  31265. <=WM: (15535: S1 ^operator O2216)
  31266. <=WM: (15532: I3 ^dir R)
  31267. <=WM: (15528: R1 ^reward R1111)
  31268. <=WM: (15531: O2216 ^name predict-no)
  31269. <=WM: (15530: O2215 ^name predict-yes)
  31270. <=WM: (15529: R1111 ^value 1)
  31271. --- Inner Elaboration Phase, active level 1 (S1) ---
  31272. Firing prefer*rvt*predict-yes*H0
  31273. -->
  31274. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  31275. -->
  31276. (S1 ^operator O2217 = 0.610460560608424)
  31277. Firing rl*prefer*rvt*predict-yes*H0*1
  31278. -->
  31279. (S1 ^operator O2217 = 0.3895396290816568)
  31280. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  31281. -->
  31282. Firing prefer*rvt*predict-no*H0
  31283. -->
  31284. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  31285. -->
  31286. (S1 ^operator O2218 = 0.1063475139796038)
  31287. Firing rl*prefer*rvt*predict-no*H0*2
  31288. -->
  31289. (S1 ^operator O2218 = 0.3873354925260269)
  31290. Firing prefer*rvt*predict-no*H0*2*v1*H1
  31291. -->
  31292. inner elaboration loop at bottom goal.
  31293. Retracting rl*prefer*rvt*predict-no*H0*2
  31294. -->
  31295. (S1 ^operator O2216 = 0.3873354925260269)
  31296. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  31297. -->
  31298. (S1 ^operator O2216 = 0.1063475139796038)
  31299. Retracting rl*prefer*rvt*predict-yes*H0*1
  31300. -->
  31301. (S1 ^operator O2215 = 0.3895396290816568)
  31302. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  31303. -->
  31304. (S1 ^operator O2215 = 0.610460560608424)
  31305. --- END Proposal Phase ---
  31306. --- Decision Phase ---
  31307. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.939597,0.0571377)
  31308. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  31309. =>WM: (15549: S1 ^operator O2217)
  31310. 1109: O: O2217 (predict-yes)
  31311. --- END Decision Phase ---
  31312. --- Application Phase ---
  31313. --- Firing Productions (PE) For State At Depth 1 ---
  31314. --- Inner Elaboration Phase, active level 1 (S1) ---
  31315. Firing apply*operator
  31316. -->
  31317. (I3 ^predict-yes N1109 + :O )
  31318. Firing apply*operator*complete
  31319. -->
  31320. (I3 ^predict-no N1108 - :O )
  31321. inner elaboration loop at bottom goal.
  31322. --- Change Working Memory (PE) ---
  31323. =>WM: (15550: I3 ^predict-yes N1109)
  31324. <=WM: (15537: N1108 ^status complete)
  31325. <=WM: (15536: I3 ^predict-no N1108)
  31326. --- Firing Productions (IE) For State At Depth 1 ---
  31327. --- Inner Elaboration Phase, active level 1 (S1) ---
  31328. Firing monitor*world
  31329. -->
  31330. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31331. --- Change Working Memory (IE) ---
  31332. --- END Application Phase ---
  31333. --- Output Phase ---
  31334. ENV: Agent did: predict-yes for direction L in state State-B
  31335. In State-B moving L
  31336. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  31337. predict error 0
  31338. dir: dir isR
  31339. --- END Output Phase ---
  31340. /|--- Input Phase ---
  31341. =>WM: (15554: I2 ^dir R)
  31342. =>WM: (15553: I2 ^reward 1)
  31343. =>WM: (15552: I2 ^see 1)
  31344. =>WM: (15551: N1109 ^status complete)
  31345. <=WM: (15540: I2 ^dir L)
  31346. <=WM: (15539: I2 ^reward 1)
  31347. <=WM: (15538: I2 ^see 0)
  31348. =>WM: (15555: I2 ^level-1 L1-root)
  31349. <=WM: (15541: I2 ^level-1 R0-root)
  31350. --- END Input Phase ---
  31351. --- Proposal Phase ---
  31352. --- Inner Elaboration Phase, active level 1 (S1) ---
  31353. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  31354. -->
  31355. (S1 ^operator O2218 = -0.02155734064455064)
  31356. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  31357. -->
  31358. (S1 ^operator O2217 = 0.8155867202756886)
  31359. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31360. -->
  31361. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31362. -->
  31363. Firing elaborate*copy-see-to-output-link
  31364. -->
  31365. (I3 ^see 1 +)
  31366. Firing elaborate*reward*based*on*reward
  31367. -->
  31368. (R1113 ^value 1 +)
  31369. (R1 ^reward R1113 +)
  31370. Firing propose*predict-yes
  31371. -->
  31372. (O2219 ^name predict-yes +)
  31373. (S1 ^operator O2219 +)
  31374. Firing propose*predict-no
  31375. -->
  31376. (O2220 ^name predict-no +)
  31377. (S1 ^operator O2220 +)
  31378. Firing rl*prefer*rvt*predict-no*H0*4
  31379. -->
  31380. (S1 ^operator O2218 = 0.447619449891142)
  31381. Firing rl*prefer*rvt*predict-yes*H0*3
  31382. -->
  31383. (S1 ^operator O2217 = 0.1844125221020087)
  31384. Firing prefer*rvt*predict-yes*H0
  31385. -->
  31386. Firing prefer*rvt*predict-no*H0
  31387. -->
  31388. Firing elaborate*copy-dir-to-output-link
  31389. -->
  31390. (I3 ^dir R +)
  31391. inner elaboration loop at bottom goal.
  31392. Retracting elaborate*copy-see-to-output-link
  31393. -->
  31394. (I3 ^see 0 +)
  31395. Retracting propose*predict-no
  31396. -->
  31397. (O2218 ^name predict-no +)
  31398. (S1 ^operator O2218 +)
  31399. Retracting propose*predict-yes
  31400. -->
  31401. (O2217 ^name predict-yes +)
  31402. (S1 ^operator O2217 +)
  31403. Retracting elaborate*reward*based*on*reward
  31404. -->
  31405. (R1112 ^value 1 +)
  31406. (R1 ^reward R1112 +)
  31407. Retracting elaborate*copy-dir-to-output-link
  31408. -->
  31409. (I3 ^dir L +)
  31410. Retracting rl*prefer*rvt*predict-no*H0*2
  31411. -->
  31412. (S1 ^operator O2218 = 0.3873354925260269)
  31413. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  31414. -->
  31415. (S1 ^operator O2218 = 0.1063475139796038)
  31416. Retracting rl*prefer*rvt*predict-yes*H0*1
  31417. -->
  31418. (S1 ^operator O2217 = 0.3895396290816568)
  31419. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  31420. -->
  31421. (S1 ^operator O2217 = 0.610460560608424)
  31422. =>WM: (15563: S1 ^operator O2220 +)
  31423. =>WM: (15562: S1 ^operator O2219 +)
  31424. =>WM: (15561: I3 ^dir R)
  31425. =>WM: (15560: O2220 ^name predict-no)
  31426. =>WM: (15559: O2219 ^name predict-yes)
  31427. =>WM: (15558: R1113 ^value 1)
  31428. =>WM: (15557: R1 ^reward R1113)
  31429. =>WM: (15556: I3 ^see 1)
  31430. <=WM: (15547: S1 ^operator O2217 +)
  31431. <=WM: (15549: S1 ^operator O2217)
  31432. <=WM: (15548: S1 ^operator O2218 +)
  31433. <=WM: (15546: I3 ^dir L)
  31434. <=WM: (15542: R1 ^reward R1112)
  31435. <=WM: (15473: I3 ^see 0)
  31436. <=WM: (15545: O2218 ^name predict-no)
  31437. <=WM: (15544: O2217 ^name predict-yes)
  31438. <=WM: (15543: R1112 ^value 1)
  31439. --- Inner Elaboration Phase, active level 1 (S1) ---
  31440. Firing prefer*rvt*predict-yes*H0
  31441. -->
  31442. Firing rl*prefer*rvt*predict-yes*H0*3
  31443. -->
  31444. (S1 ^operator O2219 = 0.1844125221020087)
  31445. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31446. -->
  31447. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  31448. -->
  31449. (S1 ^operator O2219 = 0.8155867202756886)
  31450. Firing prefer*rvt*predict-no*H0
  31451. -->
  31452. Firing rl*prefer*rvt*predict-no*H0*4
  31453. -->
  31454. (S1 ^operator O2220 = 0.447619449891142)
  31455. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31456. -->
  31457. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  31458. -->
  31459. (S1 ^operator O2220 = -0.02155734064455064)
  31460. inner elaboration loop at bottom goal.
  31461. Retracting rl*prefer*rvt*predict-no*H0*4
  31462. -->
  31463. (S1 ^operator O2218 = 0.447619449891142)
  31464. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  31465. -->
  31466. (S1 ^operator O2218 = -0.02155734064455064)
  31467. Retracting rl*prefer*rvt*predict-yes*H0*3
  31468. -->
  31469. (S1 ^operator O2217 = 0.1844125221020087)
  31470. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  31471. -->
  31472. (S1 ^operator O2217 = 0.8155867202756886)
  31473. --- END Proposal Phase ---
  31474. --- Decision Phase ---
  31475. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.902703,0.0883079)
  31476. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  31477. =>WM: (15564: S1 ^operator O2219)
  31478. 1110: O: O2219 (predict-yes)
  31479. --- END Decision Phase ---
  31480. --- Application Phase ---
  31481. --- Firing Productions (PE) For State At Depth 1 ---
  31482. --- Inner Elaboration Phase, active level 1 (S1) ---
  31483. Firing apply*operator
  31484. -->
  31485. (I3 ^predict-yes N1110 + :O )
  31486. Firing apply*operator*complete
  31487. -->
  31488. (I3 ^predict-yes N1109 - :O )
  31489. inner elaboration loop at bottom goal.
  31490. --- Change Working Memory (PE) ---
  31491. =>WM: (15565: I3 ^predict-yes N1110)
  31492. <=WM: (15551: N1109 ^status complete)
  31493. <=WM: (15550: I3 ^predict-yes N1109)
  31494. --- Firing Productions (IE) For State At Depth 1 ---
  31495. --- Inner Elaboration Phase, active level 1 (S1) ---
  31496. Firing monitor*world
  31497. -->
  31498. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31499. --- Change Working Memory (IE) ---
  31500. --- END Application Phase ---
  31501. --- Output Phase ---
  31502. ENV: Agent did: predict-yes for direction R in state State-A
  31503. In State-A moving R
  31504. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  31505. predict error 0
  31506. dir: dir isR
  31507. --- END Output Phase ---
  31508. \-/--- Input Phase ---
  31509. =>WM: (15569: I2 ^dir R)
  31510. =>WM: (15568: I2 ^reward 1)
  31511. =>WM: (15567: I2 ^see 1)
  31512. =>WM: (15566: N1110 ^status complete)
  31513. <=WM: (15554: I2 ^dir R)
  31514. <=WM: (15553: I2 ^reward 1)
  31515. <=WM: (15552: I2 ^see 1)
  31516. =>WM: (15570: I2 ^level-1 R1-root)
  31517. <=WM: (15555: I2 ^level-1 L1-root)
  31518. --- END Input Phase ---
  31519. --- Proposal Phase ---
  31520. --- Inner Elaboration Phase, active level 1 (S1) ---
  31521. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  31522. -->
  31523. (S1 ^operator O2219 = 0.1398795999120246)
  31524. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  31525. -->
  31526. (S1 ^operator O2220 = 0.5523808752582643)
  31527. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31528. -->
  31529. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31530. -->
  31531. Firing elaborate*copy-see-to-output-link
  31532. -->
  31533. (I3 ^see 1 +)
  31534. Firing elaborate*reward*based*on*reward
  31535. -->
  31536. (R1114 ^value 1 +)
  31537. (R1 ^reward R1114 +)
  31538. Firing propose*predict-yes
  31539. -->
  31540. (O2221 ^name predict-yes +)
  31541. (S1 ^operator O2221 +)
  31542. Firing propose*predict-no
  31543. -->
  31544. (O2222 ^name predict-no +)
  31545. (S1 ^operator O2222 +)
  31546. Firing rl*prefer*rvt*predict-no*H0*4
  31547. -->
  31548. (S1 ^operator O2220 = 0.447619449891142)
  31549. Firing rl*prefer*rvt*predict-yes*H0*3
  31550. -->
  31551. (S1 ^operator O2219 = 0.1844125221020087)
  31552. Firing prefer*rvt*predict-yes*H0
  31553. -->
  31554. Firing prefer*rvt*predict-no*H0
  31555. -->
  31556. Firing elaborate*copy-dir-to-output-link
  31557. -->
  31558. (I3 ^dir R +)
  31559. inner elaboration loop at bottom goal.
  31560. Retracting elaborate*copy-see-to-output-link
  31561. -->
  31562. (I3 ^see 1 +)
  31563. Retracting propose*predict-no
  31564. -->
  31565. (O2220 ^name predict-no +)
  31566. (S1 ^operator O2220 +)
  31567. Retracting propose*predict-yes
  31568. -->
  31569. (O2219 ^name predict-yes +)
  31570. (S1 ^operator O2219 +)
  31571. Retracting elaborate*reward*based*on*reward
  31572. -->
  31573. (R1113 ^value 1 +)
  31574. (R1 ^reward R1113 +)
  31575. Retracting elaborate*copy-dir-to-output-link
  31576. -->
  31577. (I3 ^dir R +)
  31578. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  31579. -->
  31580. (S1 ^operator O2220 = -0.02155734064455064)
  31581. Retracting rl*prefer*rvt*predict-no*H0*4
  31582. -->
  31583. (S1 ^operator O2220 = 0.447619449891142)
  31584. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  31585. -->
  31586. (S1 ^operator O2219 = 0.8155867202756886)
  31587. Retracting rl*prefer*rvt*predict-yes*H0*3
  31588. -->
  31589. (S1 ^operator O2219 = 0.1844125221020087)
  31590. =>WM: (15576: S1 ^operator O2222 +)
  31591. =>WM: (15575: S1 ^operator O2221 +)
  31592. =>WM: (15574: O2222 ^name predict-no)
  31593. =>WM: (15573: O2221 ^name predict-yes)
  31594. =>WM: (15572: R1114 ^value 1)
  31595. =>WM: (15571: R1 ^reward R1114)
  31596. <=WM: (15562: S1 ^operator O2219 +)
  31597. <=WM: (15564: S1 ^operator O2219)
  31598. <=WM: (15563: S1 ^operator O2220 +)
  31599. <=WM: (15557: R1 ^reward R1113)
  31600. <=WM: (15560: O2220 ^name predict-no)
  31601. <=WM: (15559: O2219 ^name predict-yes)
  31602. <=WM: (15558: R1113 ^value 1)
  31603. --- Inner Elaboration Phase, active level 1 (S1) ---
  31604. Firing prefer*rvt*predict-yes*H0
  31605. -->
  31606. Firing rl*prefer*rvt*predict-yes*H0*3
  31607. -->
  31608. (S1 ^operator O2221 = 0.1844125221020087)
  31609. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31610. -->
  31611. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  31612. -->
  31613. (S1 ^operator O2221 = 0.1398795999120246)
  31614. Firing prefer*rvt*predict-no*H0
  31615. -->
  31616. Firing rl*prefer*rvt*predict-no*H0*4
  31617. -->
  31618. (S1 ^operator O2222 = 0.447619449891142)
  31619. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31620. -->
  31621. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  31622. -->
  31623. (S1 ^operator O2222 = 0.5523808752582643)
  31624. inner elaboration loop at bottom goal.
  31625. Retracting rl*prefer*rvt*predict-no*H0*4
  31626. -->
  31627. (S1 ^operator O2220 = 0.447619449891142)
  31628. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  31629. -->
  31630. (S1 ^operator O2220 = 0.5523808752582643)
  31631. Retracting rl*prefer*rvt*predict-yes*H0*3
  31632. -->
  31633. (S1 ^operator O2219 = 0.1844125221020087)
  31634. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  31635. -->
  31636. (S1 ^operator O2219 = 0.1398795999120246)
  31637. --- END Proposal Phase ---
  31638. --- Decision Phase ---
  31639. RL update rl*prefer*rvt*predict-yes*H0*3 0.675416 -0.491003 0.184413 -> 0.675416 -0.491003 0.184413(R,m,v=1,0.909574,0.0826886)
  31640. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324583 0.491003 0.815587 -> 0.324584 0.491003 0.815587(R,m,v=1,1,0)
  31641. =>WM: (15577: S1 ^operator O2222)
  31642. 1111: O: O2222 (predict-no)
  31643. --- END Decision Phase ---
  31644. --- Application Phase ---
  31645. --- Firing Productions (PE) For State At Depth 1 ---
  31646. --- Inner Elaboration Phase, active level 1 (S1) ---
  31647. Firing apply*operator
  31648. -->
  31649. (I3 ^predict-no N1111 + :O )
  31650. Firing apply*operator*complete
  31651. -->
  31652. (I3 ^predict-yes N1110 - :O )
  31653. inner elaboration loop at bottom goal.
  31654. --- Change Working Memory (PE) ---
  31655. =>WM: (15578: I3 ^predict-no N1111)
  31656. <=WM: (15566: N1110 ^status complete)
  31657. <=WM: (15565: I3 ^predict-yes N1110)
  31658. --- Firing Productions (IE) For State At Depth 1 ---
  31659. --- Inner Elaboration Phase, active level 1 (S1) ---
  31660. Firing monitor*world
  31661. -->
  31662. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31663. --- Change Working Memory (IE) ---
  31664. --- END Application Phase ---
  31665. --- Output Phase ---
  31666. ENV: Agent did: predict-no for direction R in state State-B
  31667. In State-B moving R
  31668. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31669. predict error 0
  31670. dir: dir isU
  31671. --- END Output Phase ---
  31672. |--- Input Phase ---
  31673. =>WM: (15582: I2 ^dir U)
  31674. =>WM: (15581: I2 ^reward 1)
  31675. =>WM: (15580: I2 ^see 0)
  31676. =>WM: (15579: N1111 ^status complete)
  31677. <=WM: (15569: I2 ^dir R)
  31678. <=WM: (15568: I2 ^reward 1)
  31679. <=WM: (15567: I2 ^see 1)
  31680. =>WM: (15583: I2 ^level-1 R0-root)
  31681. <=WM: (15570: I2 ^level-1 R1-root)
  31682. --- END Input Phase ---
  31683. --- Proposal Phase ---
  31684. --- Inner Elaboration Phase, active level 1 (S1) ---
  31685. Firing elaborate*copy-see-to-output-link
  31686. -->
  31687. (I3 ^see 0 +)
  31688. Firing elaborate*reward*based*on*reward
  31689. -->
  31690. (R1115 ^value 1 +)
  31691. (R1 ^reward R1115 +)
  31692. Firing propose*predict-yes
  31693. -->
  31694. (O2223 ^name predict-yes +)
  31695. (S1 ^operator O2223 +)
  31696. Firing propose*predict-no
  31697. -->
  31698. (O2224 ^name predict-no +)
  31699. (S1 ^operator O2224 +)
  31700. Firing rl*prefer*rvt*predict-no*H0*6
  31701. -->
  31702. (S1 ^operator O2222 = 0.9999999999999999)
  31703. Firing rl*prefer*rvt*predict-yes*H0*5
  31704. -->
  31705. (S1 ^operator O2221 = 0.)
  31706. Firing prefer*rvt*predict-yes*H0
  31707. -->
  31708. Firing prefer*rvt*predict-no*H0
  31709. -->
  31710. Firing elaborate*copy-dir-to-output-link
  31711. -->
  31712. (I3 ^dir U +)
  31713. inner elaboration loop at bottom goal.
  31714. Retracting elaborate*copy-see-to-output-link
  31715. -->
  31716. (I3 ^see 1 +)
  31717. Retracting propose*predict-no
  31718. -->
  31719. (O2222 ^name predict-no +)
  31720. (S1 ^operator O2222 +)
  31721. Retracting propose*predict-yes
  31722. -->
  31723. (O2221 ^name predict-yes +)
  31724. (S1 ^operator O2221 +)
  31725. Retracting elaborate*reward*based*on*reward
  31726. -->
  31727. (R1114 ^value 1 +)
  31728. (R1 ^reward R1114 +)
  31729. Retracting elaborate*copy-dir-to-output-link
  31730. -->
  31731. (I3 ^dir R +)
  31732. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  31733. -->
  31734. (S1 ^operator O2222 = 0.5523808752582643)
  31735. Retracting rl*prefer*rvt*predict-no*H0*4
  31736. -->
  31737. (S1 ^operator O2222 = 0.447619449891142)
  31738. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  31739. -->
  31740. (S1 ^operator O2221 = 0.1398795999120246)
  31741. Retracting rl*prefer*rvt*predict-yes*H0*3
  31742. -->
  31743. (S1 ^operator O2221 = 0.1844126357453541)
  31744. =>WM: (15591: S1 ^operator O2224 +)
  31745. =>WM: (15590: S1 ^operator O2223 +)
  31746. =>WM: (15589: I3 ^dir U)
  31747. =>WM: (15588: O2224 ^name predict-no)
  31748. =>WM: (15587: O2223 ^name predict-yes)
  31749. =>WM: (15586: R1115 ^value 1)
  31750. =>WM: (15585: R1 ^reward R1115)
  31751. =>WM: (15584: I3 ^see 0)
  31752. <=WM: (15575: S1 ^operator O2221 +)
  31753. <=WM: (15576: S1 ^operator O2222 +)
  31754. <=WM: (15577: S1 ^operator O2222)
  31755. <=WM: (15561: I3 ^dir R)
  31756. <=WM: (15571: R1 ^reward R1114)
  31757. <=WM: (15556: I3 ^see 1)
  31758. <=WM: (15574: O2222 ^name predict-no)
  31759. <=WM: (15573: O2221 ^name predict-yes)
  31760. <=WM: (15572: R1114 ^value 1)
  31761. --- Inner Elaboration Phase, active level 1 (S1) ---
  31762. Firing prefer*rvt*predict-yes*H0
  31763. -->
  31764. Firing rl*prefer*rvt*predict-yes*H0*5
  31765. -->
  31766. (S1 ^operator O2223 = 0.)
  31767. Firing prefer*rvt*predict-no*H0
  31768. -->
  31769. Firing rl*prefer*rvt*predict-no*H0*6
  31770. -->
  31771. (S1 ^operator O2224 = 0.9999999999999999)
  31772. inner elaboration loop at bottom goal.
  31773. Retracting rl*prefer*rvt*predict-no*H0*6
  31774. -->
  31775. (S1 ^operator O2222 = 0.9999999999999999)
  31776. Retracting rl*prefer*rvt*predict-yes*H0*5
  31777. -->
  31778. (S1 ^operator O2221 = 0.)
  31779. --- END Proposal Phase ---
  31780. --- Decision Phase ---
  31781. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.94,0.0567785)
  31782. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377467 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  31783. =>WM: (15592: S1 ^operator O2224)
  31784. 1112: O: O2224 (predict-no)
  31785. --- END Decision Phase ---
  31786. --- Application Phase ---
  31787. --- Firing Productions (PE) For State At Depth 1 ---
  31788. --- Inner Elaboration Phase, active level 1 (S1) ---
  31789. Firing apply*operator
  31790. -->
  31791. (I3 ^predict-no N1112 + :O )
  31792. Firing apply*operator*complete
  31793. -->
  31794. (I3 ^predict-no N1111 - :O )
  31795. inner elaboration loop at bottom goal.
  31796. --- Change Working Memory (PE) ---
  31797. =>WM: (15593: I3 ^predict-no N1112)
  31798. <=WM: (15579: N1111 ^status complete)
  31799. <=WM: (15578: I3 ^predict-no N1111)
  31800. --- Firing Productions (IE) For State At Depth 1 ---
  31801. --- Inner Elaboration Phase, active level 1 (S1) ---
  31802. Firing monitor*world
  31803. -->
  31804. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31805. --- Change Working Memory (IE) ---
  31806. --- END Application Phase ---
  31807. --- Output Phase ---
  31808. ENV: Agent did: predict-no for direction U in state State-B
  31809. In State-B moving U
  31810. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31811. predict error 0
  31812. dir: dir isR
  31813. --- END Output Phase ---
  31814. \-/--- Input Phase ---
  31815. =>WM: (15597: I2 ^dir R)
  31816. =>WM: (15596: I2 ^reward 1)
  31817. =>WM: (15595: I2 ^see 0)
  31818. =>WM: (15594: N1112 ^status complete)
  31819. <=WM: (15582: I2 ^dir U)
  31820. <=WM: (15581: I2 ^reward 1)
  31821. <=WM: (15580: I2 ^see 0)
  31822. =>WM: (15598: I2 ^level-1 R0-root)
  31823. <=WM: (15583: I2 ^level-1 R0-root)
  31824. --- END Input Phase ---
  31825. --- Proposal Phase ---
  31826. --- Inner Elaboration Phase, active level 1 (S1) ---
  31827. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31828. -->
  31829. (S1 ^operator O2223 = 0.1664311307472832)
  31830. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31831. -->
  31832. (S1 ^operator O2224 = 0.5523804042756664)
  31833. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31834. -->
  31835. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31836. -->
  31837. Firing elaborate*copy-see-to-output-link
  31838. -->
  31839. (I3 ^see 0 +)
  31840. Firing elaborate*reward*based*on*reward
  31841. -->
  31842. (R1116 ^value 1 +)
  31843. (R1 ^reward R1116 +)
  31844. Firing propose*predict-yes
  31845. -->
  31846. (O2225 ^name predict-yes +)
  31847. (S1 ^operator O2225 +)
  31848. Firing propose*predict-no
  31849. -->
  31850. (O2226 ^name predict-no +)
  31851. (S1 ^operator O2226 +)
  31852. Firing rl*prefer*rvt*predict-no*H0*4
  31853. -->
  31854. (S1 ^operator O2224 = 0.447619401118731)
  31855. Firing rl*prefer*rvt*predict-yes*H0*3
  31856. -->
  31857. (S1 ^operator O2223 = 0.1844126357453541)
  31858. Firing prefer*rvt*predict-yes*H0
  31859. -->
  31860. Firing prefer*rvt*predict-no*H0
  31861. -->
  31862. Firing elaborate*copy-dir-to-output-link
  31863. -->
  31864. (I3 ^dir R +)
  31865. inner elaboration loop at bottom goal.
  31866. Retracting elaborate*copy-see-to-output-link
  31867. -->
  31868. (I3 ^see 0 +)
  31869. Retracting propose*predict-no
  31870. -->
  31871. (O2224 ^name predict-no +)
  31872. (S1 ^operator O2224 +)
  31873. Retracting propose*predict-yes
  31874. -->
  31875. (O2223 ^name predict-yes +)
  31876. (S1 ^operator O2223 +)
  31877. Retracting elaborate*reward*based*on*reward
  31878. -->
  31879. (R1115 ^value 1 +)
  31880. (R1 ^reward R1115 +)
  31881. Retracting elaborate*copy-dir-to-output-link
  31882. -->
  31883. (I3 ^dir U +)
  31884. Retracting rl*prefer*rvt*predict-no*H0*6
  31885. -->
  31886. (S1 ^operator O2224 = 0.9999999999999999)
  31887. Retracting rl*prefer*rvt*predict-yes*H0*5
  31888. -->
  31889. (S1 ^operator O2223 = 0.)
  31890. =>WM: (15605: S1 ^operator O2226 +)
  31891. =>WM: (15604: S1 ^operator O2225 +)
  31892. =>WM: (15603: I3 ^dir R)
  31893. =>WM: (15602: O2226 ^name predict-no)
  31894. =>WM: (15601: O2225 ^name predict-yes)
  31895. =>WM: (15600: R1116 ^value 1)
  31896. =>WM: (15599: R1 ^reward R1116)
  31897. <=WM: (15590: S1 ^operator O2223 +)
  31898. <=WM: (15591: S1 ^operator O2224 +)
  31899. <=WM: (15592: S1 ^operator O2224)
  31900. <=WM: (15589: I3 ^dir U)
  31901. <=WM: (15585: R1 ^reward R1115)
  31902. <=WM: (15588: O2224 ^name predict-no)
  31903. <=WM: (15587: O2223 ^name predict-yes)
  31904. <=WM: (15586: R1115 ^value 1)
  31905. --- Inner Elaboration Phase, active level 1 (S1) ---
  31906. Firing prefer*rvt*predict-yes*H0
  31907. -->
  31908. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31909. -->
  31910. (S1 ^operator O2225 = 0.1664311307472832)
  31911. Firing rl*prefer*rvt*predict-yes*H0*3
  31912. -->
  31913. (S1 ^operator O2225 = 0.1844126357453541)
  31914. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31915. -->
  31916. Firing prefer*rvt*predict-no*H0
  31917. -->
  31918. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31919. -->
  31920. (S1 ^operator O2226 = 0.5523804042756664)
  31921. Firing rl*prefer*rvt*predict-no*H0*4
  31922. -->
  31923. (S1 ^operator O2226 = 0.447619401118731)
  31924. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31925. -->
  31926. inner elaboration loop at bottom goal.
  31927. Retracting rl*prefer*rvt*predict-no*H0*4
  31928. -->
  31929. (S1 ^operator O2224 = 0.447619401118731)
  31930. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  31931. -->
  31932. (S1 ^operator O2224 = 0.5523804042756664)
  31933. Retracting rl*prefer*rvt*predict-yes*H0*3
  31934. -->
  31935. (S1 ^operator O2223 = 0.1844126357453541)
  31936. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  31937. -->
  31938. (S1 ^operator O2223 = 0.1664311307472832)
  31939. --- END Proposal Phase ---
  31940. --- Decision Phase ---
  31941. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31942. =>WM: (15606: S1 ^operator O2226)
  31943. 1113: O: O2226 (predict-no)
  31944. --- END Decision Phase ---
  31945. --- Application Phase ---
  31946. --- Firing Productions (PE) For State At Depth 1 ---
  31947. --- Inner Elaboration Phase, active level 1 (S1) ---
  31948. Firing apply*operator
  31949. -->
  31950. (I3 ^predict-no N1113 + :O )
  31951. Firing apply*operator*complete
  31952. -->
  31953. (I3 ^predict-no N1112 - :O )
  31954. inner elaboration loop at bottom goal.
  31955. --- Change Working Memory (PE) ---
  31956. =>WM: (15607: I3 ^predict-no N1113)
  31957. <=WM: (15594: N1112 ^status complete)
  31958. <=WM: (15593: I3 ^predict-no N1112)
  31959. --- Firing Productions (IE) For State At Depth 1 ---
  31960. --- Inner Elaboration Phase, active level 1 (S1) ---
  31961. Firing monitor*world
  31962. -->
  31963. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31964. --- Change Working Memory (IE) ---
  31965. --- END Application Phase ---
  31966. --- Output Phase ---
  31967. ENV: Agent did: predict-no for direction R in state State-B
  31968. In State-B moving R
  31969. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31970. predict error 0
  31971. dir: dir isL
  31972. --- END Output Phase ---
  31973. |\---- Input Phase ---
  31974. =>WM: (15611: I2 ^dir L)
  31975. =>WM: (15610: I2 ^reward 1)
  31976. =>WM: (15609: I2 ^see 0)
  31977. =>WM: (15608: N1113 ^status complete)
  31978. <=WM: (15597: I2 ^dir R)
  31979. <=WM: (15596: I2 ^reward 1)
  31980. <=WM: (15595: I2 ^see 0)
  31981. =>WM: (15612: I2 ^level-1 R0-root)
  31982. <=WM: (15598: I2 ^level-1 R0-root)
  31983. --- END Input Phase ---
  31984. --- Proposal Phase ---
  31985. --- Inner Elaboration Phase, active level 1 (S1) ---
  31986. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  31987. -->
  31988. (S1 ^operator O2225 = 0.6104605321549119)
  31989. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  31990. -->
  31991. (S1 ^operator O2226 = 0.1063475139796038)
  31992. Firing prefer*rvt*predict-no*H0*2*v1*H1
  31993. -->
  31994. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  31995. -->
  31996. Firing elaborate*copy-see-to-output-link
  31997. -->
  31998. (I3 ^see 0 +)
  31999. Firing elaborate*reward*based*on*reward
  32000. -->
  32001. (R1117 ^value 1 +)
  32002. (R1 ^reward R1117 +)
  32003. Firing propose*predict-yes
  32004. -->
  32005. (O2227 ^name predict-yes +)
  32006. (S1 ^operator O2227 +)
  32007. Firing propose*predict-no
  32008. -->
  32009. (O2228 ^name predict-no +)
  32010. (S1 ^operator O2228 +)
  32011. Firing rl*prefer*rvt*predict-no*H0*2
  32012. -->
  32013. (S1 ^operator O2226 = 0.3873354925260269)
  32014. Firing rl*prefer*rvt*predict-yes*H0*1
  32015. -->
  32016. (S1 ^operator O2225 = 0.3895396006281447)
  32017. Firing prefer*rvt*predict-yes*H0
  32018. -->
  32019. Firing prefer*rvt*predict-no*H0
  32020. -->
  32021. Firing elaborate*copy-dir-to-output-link
  32022. -->
  32023. (I3 ^dir L +)
  32024. inner elaboration loop at bottom goal.
  32025. Retracting elaborate*copy-see-to-output-link
  32026. -->
  32027. (I3 ^see 0 +)
  32028. Retracting propose*predict-no
  32029. -->
  32030. (O2226 ^name predict-no +)
  32031. (S1 ^operator O2226 +)
  32032. Retracting propose*predict-yes
  32033. -->
  32034. (O2225 ^name predict-yes +)
  32035. (S1 ^operator O2225 +)
  32036. Retracting elaborate*reward*based*on*reward
  32037. -->
  32038. (R1116 ^value 1 +)
  32039. (R1 ^reward R1116 +)
  32040. Retracting elaborate*copy-dir-to-output-link
  32041. -->
  32042. (I3 ^dir R +)
  32043. Retracting rl*prefer*rvt*predict-no*H0*4
  32044. -->
  32045. (S1 ^operator O2226 = 0.447619401118731)
  32046. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*41
  32047. -->
  32048. (S1 ^operator O2226 = 0.5523804042756664)
  32049. Retracting rl*prefer*rvt*predict-yes*H0*3
  32050. -->
  32051. (S1 ^operator O2225 = 0.1844126357453541)
  32052. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*42
  32053. -->
  32054. (S1 ^operator O2225 = 0.1664311307472832)
  32055. =>WM: (15619: S1 ^operator O2228 +)
  32056. =>WM: (15618: S1 ^operator O2227 +)
  32057. =>WM: (15617: I3 ^dir L)
  32058. =>WM: (15616: O2228 ^name predict-no)
  32059. =>WM: (15615: O2227 ^name predict-yes)
  32060. =>WM: (15614: R1117 ^value 1)
  32061. =>WM: (15613: R1 ^reward R1117)
  32062. <=WM: (15604: S1 ^operator O2225 +)
  32063. <=WM: (15605: S1 ^operator O2226 +)
  32064. <=WM: (15606: S1 ^operator O2226)
  32065. <=WM: (15603: I3 ^dir R)
  32066. <=WM: (15599: R1 ^reward R1116)
  32067. <=WM: (15602: O2226 ^name predict-no)
  32068. <=WM: (15601: O2225 ^name predict-yes)
  32069. <=WM: (15600: R1116 ^value 1)
  32070. --- Inner Elaboration Phase, active level 1 (S1) ---
  32071. Firing prefer*rvt*predict-yes*H0
  32072. -->
  32073. Firing rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  32074. -->
  32075. (S1 ^operator O2227 = 0.6104605321549119)
  32076. Firing rl*prefer*rvt*predict-yes*H0*1
  32077. -->
  32078. (S1 ^operator O2227 = 0.3895396006281447)
  32079. Firing prefer*rvt*predict-yes*H0*1*v1*H1
  32080. -->
  32081. Firing prefer*rvt*predict-no*H0
  32082. -->
  32083. Firing rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  32084. -->
  32085. (S1 ^operator O2228 = 0.1063475139796038)
  32086. Firing rl*prefer*rvt*predict-no*H0*2
  32087. -->
  32088. (S1 ^operator O2228 = 0.3873354925260269)
  32089. Firing prefer*rvt*predict-no*H0*2*v1*H1
  32090. -->
  32091. inner elaboration loop at bottom goal.
  32092. Retracting rl*prefer*rvt*predict-no*H0*2
  32093. -->
  32094. (S1 ^operator O2226 = 0.3873354925260269)
  32095. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  32096. -->
  32097. (S1 ^operator O2226 = 0.1063475139796038)
  32098. Retracting rl*prefer*rvt*predict-yes*H0*1
  32099. -->
  32100. (S1 ^operator O2225 = 0.3895396006281447)
  32101. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  32102. -->
  32103. (S1 ^operator O2225 = 0.6104605321549119)
  32104. --- END Proposal Phase ---
  32105. --- Decision Phase ---
  32106. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.940397,0.0564238)
  32107. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*41 0.377467 0.174913 0.55238 -> 0.377467 0.174913 0.55238(R,m,v=1,1,0)
  32108. =>WM: (15620: S1 ^operator O2227)
  32109. 1114: O: O2227 (predict-yes)
  32110. --- END Decision Phase ---
  32111. --- Application Phase ---
  32112. --- Firing Productions (PE) For State At Depth 1 ---
  32113. --- Inner Elaboration Phase, active level 1 (S1) ---
  32114. Firing apply*operator
  32115. -->
  32116. (I3 ^predict-yes N1114 + :O )
  32117. Firing apply*operator*complete
  32118. -->
  32119. (I3 ^predict-no N1113 - :O )
  32120. inner elaboration loop at bottom goal.
  32121. --- Change Working Memory (PE) ---
  32122. =>WM: (15621: I3 ^predict-yes N1114)
  32123. <=WM: (15608: N1113 ^status complete)
  32124. <=WM: (15607: I3 ^predict-no N1113)
  32125. --- Firing Productions (IE) For State At Depth 1 ---
  32126. --- Inner Elaboration Phase, active level 1 (S1) ---
  32127. Firing monitor*world
  32128. -->
  32129. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32130. --- Change Working Memory (IE) ---
  32131. --- END Application Phase ---
  32132. --- Output Phase ---
  32133. ENV: Agent did: predict-yes for direction L in state State-B
  32134. In State-B moving L
  32135. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  32136. predict error 0
  32137. dir: dir isR
  32138. --- END Output Phase ---
  32139. /|\--- Input Phase ---
  32140. =>WM: (15625: I2 ^dir R)
  32141. =>WM: (15624: I2 ^reward 1)
  32142. =>WM: (15623: I2 ^see 1)
  32143. =>WM: (15622: N1114 ^status complete)
  32144. <=WM: (15611: I2 ^dir L)
  32145. <=WM: (15610: I2 ^reward 1)
  32146. <=WM: (15609: I2 ^see 0)
  32147. =>WM: (15626: I2 ^level-1 L1-root)
  32148. <=WM: (15612: I2 ^level-1 R0-root)
  32149. --- END Input Phase ---
  32150. --- Proposal Phase ---
  32151. --- Inner Elaboration Phase, active level 1 (S1) ---
  32152. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  32153. -->
  32154. (S1 ^operator O2228 = -0.02155734064455064)
  32155. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  32156. -->
  32157. (S1 ^operator O2227 = 0.815586833919034)
  32158. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32159. -->
  32160. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32161. -->
  32162. Firing elaborate*copy-see-to-output-link
  32163. -->
  32164. (I3 ^see 1 +)
  32165. Firing elaborate*reward*based*on*reward
  32166. -->
  32167. (R1118 ^value 1 +)
  32168. (R1 ^reward R1118 +)
  32169. Firing propose*predict-yes
  32170. -->
  32171. (O2229 ^name predict-yes +)
  32172. (S1 ^operator O2229 +)
  32173. Firing propose*predict-no
  32174. -->
  32175. (O2230 ^name predict-no +)
  32176. (S1 ^operator O2230 +)
  32177. Firing rl*prefer*rvt*predict-no*H0*4
  32178. -->
  32179. (S1 ^operator O2228 = 0.4476194303095714)
  32180. Firing rl*prefer*rvt*predict-yes*H0*3
  32181. -->
  32182. (S1 ^operator O2227 = 0.1844126357453541)
  32183. Firing prefer*rvt*predict-yes*H0
  32184. -->
  32185. Firing prefer*rvt*predict-no*H0
  32186. -->
  32187. Firing elaborate*copy-dir-to-output-link
  32188. -->
  32189. (I3 ^dir R +)
  32190. inner elaboration loop at bottom goal.
  32191. Retracting elaborate*copy-see-to-output-link
  32192. -->
  32193. (I3 ^see 0 +)
  32194. Retracting propose*predict-no
  32195. -->
  32196. (O2228 ^name predict-no +)
  32197. (S1 ^operator O2228 +)
  32198. Retracting propose*predict-yes
  32199. -->
  32200. (O2227 ^name predict-yes +)
  32201. (S1 ^operator O2227 +)
  32202. Retracting elaborate*reward*based*on*reward
  32203. -->
  32204. (R1117 ^value 1 +)
  32205. (R1 ^reward R1117 +)
  32206. Retracting elaborate*copy-dir-to-output-link
  32207. -->
  32208. (I3 ^dir L +)
  32209. Retracting rl*prefer*rvt*predict-no*H0*2
  32210. -->
  32211. (S1 ^operator O2228 = 0.3873354925260269)
  32212. Retracting rl*prefer*rvt*predict-no*H0*2*v1*H1*43
  32213. -->
  32214. (S1 ^operator O2228 = 0.1063475139796038)
  32215. Retracting rl*prefer*rvt*predict-yes*H0*1
  32216. -->
  32217. (S1 ^operator O2227 = 0.3895396006281447)
  32218. Retracting rl*prefer*rvt*predict-yes*H0*1*v1*H1*44
  32219. -->
  32220. (S1 ^operator O2227 = 0.6104605321549119)
  32221. =>WM: (15634: S1 ^operator O2230 +)
  32222. =>WM: (15633: S1 ^operator O2229 +)
  32223. =>WM: (15632: I3 ^dir R)
  32224. =>WM: (15631: O2230 ^name predict-no)
  32225. =>WM: (15630: O2229 ^name predict-yes)
  32226. =>WM: (15629: R1118 ^value 1)
  32227. =>WM: (15628: R1 ^reward R1118)
  32228. =>WM: (15627: I3 ^see 1)
  32229. <=WM: (15618: S1 ^operator O2227 +)
  32230. <=WM: (15620: S1 ^operator O2227)
  32231. <=WM: (15619: S1 ^operator O2228 +)
  32232. <=WM: (15617: I3 ^dir L)
  32233. <=WM: (15613: R1 ^reward R1117)
  32234. <=WM: (15584: I3 ^see 0)
  32235. <=WM: (15616: O2228 ^name predict-no)
  32236. <=WM: (15615: O2227 ^name predict-yes)
  32237. <=WM: (15614: R1117 ^value 1)
  32238. --- Inner Elaboration Phase, active level 1 (S1) ---
  32239. Firing prefer*rvt*predict-yes*H0
  32240. -->
  32241. Firing rl*prefer*rvt*predict-yes*H0*3
  32242. -->
  32243. (S1 ^operator O2229 = 0.1844126357453541)
  32244. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32245. -->
  32246. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  32247. -->
  32248. (S1 ^operator O2229 = 0.815586833919034)
  32249. Firing prefer*rvt*predict-no*H0
  32250. -->
  32251. Firing rl*prefer*rvt*predict-no*H0*4
  32252. -->
  32253. (S1 ^operator O2230 = 0.4476194303095714)
  32254. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32255. -->
  32256. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  32257. -->
  32258. (S1 ^operator O2230 = -0.02155734064455064)
  32259. inner elaboration loop at bottom goal.
  32260. Retracting rl*prefer*rvt*predict-no*H0*4
  32261. -->
  32262. (S1 ^operator O2228 = 0.4476194303095714)
  32263. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  32264. -->
  32265. (S1 ^operator O2228 = -0.02155734064455064)
  32266. Retracting rl*prefer*rvt*predict-yes*H0*3
  32267. -->
  32268. (S1 ^operator O2227 = 0.1844126357453541)
  32269. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  32270. -->
  32271. (S1 ^operator O2227 = 0.815586833919034)
  32272. --- END Proposal Phase ---
  32273. --- Decision Phase ---
  32274. RL update rl*prefer*rvt*predict-yes*H0*1 0.711951 -0.322412 0.38954 -> 0.711951 -0.322412 0.38954(R,m,v=1,0.903226,0.0878814)
  32275. RL update rl*prefer*rvt*predict-yes*H0*1*v1*H1*44 0.288049 0.322412 0.610461 -> 0.288049 0.322412 0.610461(R,m,v=1,1,0)
  32276. =>WM: (15635: S1 ^operator O2229)
  32277. 1115: O: O2229 (predict-yes)
  32278. --- END Decision Phase ---
  32279. --- Application Phase ---
  32280. --- Firing Productions (PE) For State At Depth 1 ---
  32281. --- Inner Elaboration Phase, active level 1 (S1) ---
  32282. Firing apply*operator
  32283. -->
  32284. (I3 ^predict-yes N1115 + :O )
  32285. Firing apply*operator*complete
  32286. -->
  32287. (I3 ^predict-yes N1114 - :O )
  32288. inner elaboration loop at bottom goal.
  32289. --- Change Working Memory (PE) ---
  32290. =>WM: (15636: I3 ^predict-yes N1115)
  32291. <=WM: (15622: N1114 ^status complete)
  32292. <=WM: (15621: I3 ^predict-yes N1114)
  32293. --- Firing Productions (IE) For State At Depth 1 ---
  32294. --- Inner Elaboration Phase, active level 1 (S1) ---
  32295. Firing monitor*world
  32296. -->
  32297. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32298. --- Change Working Memory (IE) ---
  32299. --- END Application Phase ---
  32300. --- Output Phase ---
  32301. ENV: Agent did: predict-yes for direction R in state State-A
  32302. In State-A moving R
  32303. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  32304. predict error 0
  32305. dir: dir isR
  32306. --- END Output Phase ---
  32307. -/|--- Input Phase ---
  32308. =>WM: (15640: I2 ^dir R)
  32309. =>WM: (15639: I2 ^reward 1)
  32310. =>WM: (15638: I2 ^see 1)
  32311. =>WM: (15637: N1115 ^status complete)
  32312. <=WM: (15625: I2 ^dir R)
  32313. <=WM: (15624: I2 ^reward 1)
  32314. <=WM: (15623: I2 ^see 1)
  32315. =>WM: (15641: I2 ^level-1 R1-root)
  32316. <=WM: (15626: I2 ^level-1 L1-root)
  32317. --- END Input Phase ---
  32318. --- Proposal Phase ---
  32319. --- Inner Elaboration Phase, active level 1 (S1) ---
  32320. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32321. -->
  32322. (S1 ^operator O2229 = 0.1398795999120246)
  32323. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32324. -->
  32325. (S1 ^operator O2230 = 0.5523808264858534)
  32326. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32327. -->
  32328. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32329. -->
  32330. Firing elaborate*copy-see-to-output-link
  32331. -->
  32332. (I3 ^see 1 +)
  32333. Firing elaborate*reward*based*on*reward
  32334. -->
  32335. (R1119 ^value 1 +)
  32336. (R1 ^reward R1119 +)
  32337. Firing propose*predict-yes
  32338. -->
  32339. (O2231 ^name predict-yes +)
  32340. (S1 ^operator O2231 +)
  32341. Firing propose*predict-no
  32342. -->
  32343. (O2232 ^name predict-no +)
  32344. (S1 ^operator O2232 +)
  32345. Firing rl*prefer*rvt*predict-no*H0*4
  32346. -->
  32347. (S1 ^operator O2230 = 0.4476194303095714)
  32348. Firing rl*prefer*rvt*predict-yes*H0*3
  32349. -->
  32350. (S1 ^operator O2229 = 0.1844126357453541)
  32351. Firing prefer*rvt*predict-yes*H0
  32352. -->
  32353. Firing prefer*rvt*predict-no*H0
  32354. -->
  32355. Firing elaborate*copy-dir-to-output-link
  32356. -->
  32357. (I3 ^dir R +)
  32358. inner elaboration loop at bottom goal.
  32359. Retracting elaborate*copy-see-to-output-link
  32360. -->
  32361. (I3 ^see 1 +)
  32362. Retracting propose*predict-no
  32363. -->
  32364. (O2230 ^name predict-no +)
  32365. (S1 ^operator O2230 +)
  32366. Retracting propose*predict-yes
  32367. -->
  32368. (O2229 ^name predict-yes +)
  32369. (S1 ^operator O2229 +)
  32370. Retracting elaborate*reward*based*on*reward
  32371. -->
  32372. (R1118 ^value 1 +)
  32373. (R1 ^reward R1118 +)
  32374. Retracting elaborate*copy-dir-to-output-link
  32375. -->
  32376. (I3 ^dir R +)
  32377. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*45
  32378. -->
  32379. (S1 ^operator O2230 = -0.02155734064455064)
  32380. Retracting rl*prefer*rvt*predict-no*H0*4
  32381. -->
  32382. (S1 ^operator O2230 = 0.4476194303095714)
  32383. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*46
  32384. -->
  32385. (S1 ^operator O2229 = 0.815586833919034)
  32386. Retracting rl*prefer*rvt*predict-yes*H0*3
  32387. -->
  32388. (S1 ^operator O2229 = 0.1844126357453541)
  32389. =>WM: (15647: S1 ^operator O2232 +)
  32390. =>WM: (15646: S1 ^operator O2231 +)
  32391. =>WM: (15645: O2232 ^name predict-no)
  32392. =>WM: (15644: O2231 ^name predict-yes)
  32393. =>WM: (15643: R1119 ^value 1)
  32394. =>WM: (15642: R1 ^reward R1119)
  32395. <=WM: (15633: S1 ^operator O2229 +)
  32396. <=WM: (15635: S1 ^operator O2229)
  32397. <=WM: (15634: S1 ^operator O2230 +)
  32398. <=WM: (15628: R1 ^reward R1118)
  32399. <=WM: (15631: O2230 ^name predict-no)
  32400. <=WM: (15630: O2229 ^name predict-yes)
  32401. <=WM: (15629: R1118 ^value 1)
  32402. --- Inner Elaboration Phase, active level 1 (S1) ---
  32403. Firing prefer*rvt*predict-yes*H0
  32404. -->
  32405. Firing rl*prefer*rvt*predict-yes*H0*3
  32406. -->
  32407. (S1 ^operator O2231 = 0.1844126357453541)
  32408. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32409. -->
  32410. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32411. -->
  32412. (S1 ^operator O2231 = 0.1398795999120246)
  32413. Firing prefer*rvt*predict-no*H0
  32414. -->
  32415. Firing rl*prefer*rvt*predict-no*H0*4
  32416. -->
  32417. (S1 ^operator O2232 = 0.4476194303095714)
  32418. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32419. -->
  32420. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32421. -->
  32422. (S1 ^operator O2232 = 0.5523808264858534)
  32423. inner elaboration loop at bottom goal.
  32424. Retracting rl*prefer*rvt*predict-no*H0*4
  32425. -->
  32426. (S1 ^operator O2230 = 0.4476194303095714)
  32427. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32428. -->
  32429. (S1 ^operator O2230 = 0.5523808264858534)
  32430. Retracting rl*prefer*rvt*predict-yes*H0*3
  32431. -->
  32432. (S1 ^operator O2229 = 0.1844126357453541)
  32433. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32434. -->
  32435. (S1 ^operator O2229 = 0.1398795999120246)
  32436. --- END Proposal Phase ---
  32437. --- Decision Phase ---
  32438. RL update rl*prefer*rvt*predict-yes*H0*3 0.675416 -0.491003 0.184413 -> 0.675416 -0.491003 0.184413(R,m,v=1,0.910053,0.082292)
  32439. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*46 0.324584 0.491003 0.815587 -> 0.324584 0.491003 0.815587(R,m,v=1,1,0)
  32440. =>WM: (15648: S1 ^operator O2232)
  32441. 1116: O: O2232 (predict-no)
  32442. --- END Decision Phase ---
  32443. --- Application Phase ---
  32444. --- Firing Productions (PE) For State At Depth 1 ---
  32445. --- Inner Elaboration Phase, active level 1 (S1) ---
  32446. Firing apply*operator
  32447. -->
  32448. (I3 ^predict-no N1116 + :O )
  32449. Firing apply*operator*complete
  32450. -->
  32451. (I3 ^predict-yes N1115 - :O )
  32452. inner elaboration loop at bottom goal.
  32453. --- Change Working Memory (PE) ---
  32454. =>WM: (15649: I3 ^predict-no N1116)
  32455. <=WM: (15637: N1115 ^status complete)
  32456. <=WM: (15636: I3 ^predict-yes N1115)
  32457. --- Firing Productions (IE) For State At Depth 1 ---
  32458. --- Inner Elaboration Phase, active level 1 (S1) ---
  32459. Firing monitor*world
  32460. -->
  32461. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32462. --- Change Working Memory (IE) ---
  32463. --- END Application Phase ---
  32464. --- Output Phase ---
  32465. ENV: Agent did: predict-no for direction R in state State-B
  32466. In State-B moving R
  32467. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32468. predict error 0
  32469. dir: dir isU
  32470. --- END Output Phase ---
  32471. \-/--- Input Phase ---
  32472. =>WM: (15653: I2 ^dir U)
  32473. =>WM: (15652: I2 ^reward 1)
  32474. =>WM: (15651: I2 ^see 0)
  32475. =>WM: (15650: N1116 ^status complete)
  32476. <=WM: (15640: I2 ^dir R)
  32477. <=WM: (15639: I2 ^reward 1)
  32478. <=WM: (15638: I2 ^see 1)
  32479. =>WM: (15654: I2 ^level-1 R0-root)
  32480. <=WM: (15641: I2 ^level-1 R1-root)
  32481. --- END Input Phase ---
  32482. --- Proposal Phase ---
  32483. --- Inner Elaboration Phase, active level 1 (S1) ---
  32484. Firing elaborate*copy-see-to-output-link
  32485. -->
  32486. (I3 ^see 0 +)
  32487. Firing elaborate*reward*based*on*reward
  32488. -->
  32489. (R1120 ^value 1 +)
  32490. (R1 ^reward R1120 +)
  32491. Firing propose*predict-yes
  32492. -->
  32493. (O2233 ^name predict-yes +)
  32494. (S1 ^operator O2233 +)
  32495. Firing propose*predict-no
  32496. -->
  32497. (O2234 ^name predict-no +)
  32498. (S1 ^operator O2234 +)
  32499. Firing rl*prefer*rvt*predict-no*H0*6
  32500. -->
  32501. (S1 ^operator O2232 = 0.9999999999999999)
  32502. Firing rl*prefer*rvt*predict-yes*H0*5
  32503. -->
  32504. (S1 ^operator O2231 = 0.)
  32505. Firing prefer*rvt*predict-yes*H0
  32506. -->
  32507. Firing prefer*rvt*predict-no*H0
  32508. -->
  32509. Firing elaborate*copy-dir-to-output-link
  32510. -->
  32511. (I3 ^dir U +)
  32512. inner elaboration loop at bottom goal.
  32513. Retracting elaborate*copy-see-to-output-link
  32514. -->
  32515. (I3 ^see 1 +)
  32516. Retracting propose*predict-no
  32517. -->
  32518. (O2232 ^name predict-no +)
  32519. (S1 ^operator O2232 +)
  32520. Retracting propose*predict-yes
  32521. -->
  32522. (O2231 ^name predict-yes +)
  32523. (S1 ^operator O2231 +)
  32524. Retracting elaborate*reward*based*on*reward
  32525. -->
  32526. (R1119 ^value 1 +)
  32527. (R1 ^reward R1119 +)
  32528. Retracting elaborate*copy-dir-to-output-link
  32529. -->
  32530. (I3 ^dir R +)
  32531. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32532. -->
  32533. (S1 ^operator O2232 = 0.5523808264858534)
  32534. Retracting rl*prefer*rvt*predict-no*H0*4
  32535. -->
  32536. (S1 ^operator O2232 = 0.4476194303095714)
  32537. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32538. -->
  32539. (S1 ^operator O2231 = 0.1398795999120246)
  32540. Retracting rl*prefer*rvt*predict-yes*H0*3
  32541. -->
  32542. (S1 ^operator O2231 = 0.1844127152956959)
  32543. =>WM: (15662: S1 ^operator O2234 +)
  32544. =>WM: (15661: S1 ^operator O2233 +)
  32545. =>WM: (15660: I3 ^dir U)
  32546. =>WM: (15659: O2234 ^name predict-no)
  32547. =>WM: (15658: O2233 ^name predict-yes)
  32548. =>WM: (15657: R1120 ^value 1)
  32549. =>WM: (15656: R1 ^reward R1120)
  32550. =>WM: (15655: I3 ^see 0)
  32551. <=WM: (15646: S1 ^operator O2231 +)
  32552. <=WM: (15647: S1 ^operator O2232 +)
  32553. <=WM: (15648: S1 ^operator O2232)
  32554. <=WM: (15632: I3 ^dir R)
  32555. <=WM: (15642: R1 ^reward R1119)
  32556. <=WM: (15627: I3 ^see 1)
  32557. <=WM: (15645: O2232 ^name predict-no)
  32558. <=WM: (15644: O2231 ^name predict-yes)
  32559. <=WM: (15643: R1119 ^value 1)
  32560. --- Inner Elaboration Phase, active level 1 (S1) ---
  32561. Firing prefer*rvt*predict-yes*H0
  32562. -->
  32563. Firing rl*prefer*rvt*predict-yes*H0*5
  32564. -->
  32565. (S1 ^operator O2233 = 0.)
  32566. Firing prefer*rvt*predict-no*H0
  32567. -->
  32568. Firing rl*prefer*rvt*predict-no*H0*6
  32569. -->
  32570. (S1 ^operator O2234 = 0.9999999999999999)
  32571. inner elaboration loop at bottom goal.
  32572. Retracting rl*prefer*rvt*predict-no*H0*6
  32573. -->
  32574. (S1 ^operator O2232 = 0.9999999999999999)
  32575. Retracting rl*prefer*rvt*predict-yes*H0*5
  32576. -->
  32577. (S1 ^operator O2231 = 0.)
  32578. --- END Proposal Phase ---
  32579. --- Decision Phase ---
  32580. RL update rl*prefer*rvt*predict-no*H0*4 0.622533 -0.174914 0.447619 -> 0.622533 -0.174914 0.447619(R,m,v=1,0.940789,0.0560735)
  32581. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.377467 0.174914 0.552381 -> 0.377467 0.174914 0.552381(R,m,v=1,1,0)
  32582. =>WM: (15663: S1 ^operator O2234)
  32583. 1117: O: O2234 (predict-no)
  32584. --- END Decision Phase ---
  32585. --- Application Phase ---
  32586. --- Firing Productions (PE) For State At Depth 1 ---
  32587. --- Inner Elaboration Phase, active level 1 (S1) ---
  32588. Firing apply*operator
  32589. -->
  32590. (I3 ^predict-no N1117 + :O )
  32591. Firing apply*operator*complete
  32592. -->
  32593. (I3 ^predict-no N1116 - :O )
  32594. inner elaboration loop at bottom goal.
  32595. --- Change Working Memory (PE) ---
  32596. =>WM: (15664: I3 ^predict-no N1117)
  32597. <=WM: (15650: N1116 ^status complete)
  32598. <=WM: (15649: I3 ^predict-no N1116)
  32599. --- Firing Productions (IE) For State At Depth 1 ---
  32600. --- Inner Elaboration Phase, active level 1 (S1) ---
  32601. Firing monitor*world
  32602. -->
  32603. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32604. --- Change Working Memory (IE) ---
  32605. --- END Application Phase ---
  32606. --- Output Phase ---
  32607. ENV: Agent did: predict-no for direction U in state State-B
  32608. In State-B moving U
  32609. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32610. predict er