/flipv2/20121113-091142-2.5K-ReLST-asneeded_epochs_50_5runs_noalphadecay/stdout-flip-2.5K_3.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 34851 lines · 32808 code · 2043 blank · 0 comment · 0 complexity · 117951999a9c01c4172ea6893d520a0c MD5 · raw file

  1. Seeding... 3
  2. dir: dir isL
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 3 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_3.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\-sleeping...
  20. /|\-/|\sleeping...
  21. -1: O: O2 (predict-no)
  22. I see 0 and I'm going to do: predict-no
  23. ENV: Agent did: predict-no for direction L in state State-A
  24. In State-A moving L
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26. predict error 0
  27. dir: dir isR
  28. rule alias: '*'
  29. rule alias: '*'
  30. /|\-/|\2: O: O3 (predict-yes)
  31. I see 1 and I'm going to do: predict-yes
  32. ENV: Agent did: predict-yes for direction R in state State-A
  33. In State-A moving R
  34. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  35. predict error 0
  36. dir: dir isR
  37. -/|3: O: O6 (predict-no)
  38. I see 1 and I'm going to do: predict-no
  39. ENV: Agent did: predict-no for direction R in state State-B
  40. In State-B moving R
  41. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  42. predict error 0
  43. dir: dir isR
  44. \-/4: O: O7 (predict-yes)
  45. I see 1 and I'm going to do: predict-yes
  46. ENV: Agent did: predict-yes for direction R in state State-B
  47. In State-B moving R
  48. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  49. predict error 1
  50. dir: dir isR
  51. |\5: O: O9 (predict-yes)
  52. I see 0 and I'm going to do: predict-yes
  53. ENV: Agent did: predict-yes for direction R in state State-B
  54. In State-B moving R
  55. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  56. predict error 1
  57. dir: dir isL
  58. -/|6: O: O12 (predict-no)
  59. I see 0 and I'm going to do: predict-no
  60. ENV: Agent did: predict-no for direction L in state State-B
  61. In State-B moving L
  62. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  63. predict error 1
  64. dir: dir isL
  65. \-/|7: O: O14 (predict-no)
  66. I see 0 and I'm going to do: predict-no
  67. ENV: Agent did: predict-no for direction L in state State-A
  68. In State-A moving L
  69. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  70. predict error 0
  71. dir: dir isU
  72. \-8: O: O15 (predict-yes)
  73. I see 1 and I'm going to do: predict-yes
  74. ENV: Agent did: predict-yes for direction U in state State-A
  75. In State-A moving U
  76. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  77. predict error 1
  78. dir: dir isL
  79. /|9: O: O18 (predict-no)
  80. I see 0 and I'm going to do: predict-no
  81. ENV: Agent did: predict-no for direction L in state State-A
  82. In State-A moving L
  83. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  84. predict error 0
  85. dir: dir isL
  86. \-/10: O: O19 (predict-yes)
  87. I see 1 and I'm going to do: predict-yes
  88. ENV: Agent did: predict-yes for direction L in state State-A
  89. In State-A moving L
  90. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  91. predict error 1
  92. dir: dir isU
  93. |11: O: O22 (predict-no)
  94. I see 0 and I'm going to do: predict-no
  95. ENV: Agent did: predict-no for direction U in state State-A
  96. In State-A moving U
  97. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  98. predict error 0
  99. dir: dir isR
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. \12: O: O24 (predict-no)
  105. I see 1 and I'm going to do: predict-no
  106. ENV: Agent did: predict-no for direction R in state State-A
  107. In State-A moving R
  108. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  109. predict error 1
  110. dir: dir isU
  111. -/|13: O: O25 (predict-yes)
  112. I see 0 and I'm going to do: predict-yes
  113. ENV: Agent did: predict-yes for direction U in state State-B
  114. In State-B moving U
  115. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  116. predict error 1
  117. dir: dir isR
  118. \14: O: O28 (predict-no)
  119. I see 0 and I'm going to do: predict-no
  120. ENV: Agent did: predict-no for direction R in state State-B
  121. In State-B moving R
  122. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  123. predict error 0
  124. dir: dir isR
  125. -/15: O: O30 (predict-no)
  126. I see 1 and I'm going to do: predict-no
  127. ENV: Agent did: predict-no for direction R in state State-B
  128. In State-B moving R
  129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  130. predict error 0
  131. dir: dir isL
  132. |\-16: O: O32 (predict-no)
  133. I see 1 and I'm going to do: predict-no
  134. ENV: Agent did: predict-no for direction L in state State-B
  135. In State-B moving L
  136. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  137. predict error 1
  138. dir: dir isR
  139. /|17: O: O33 (predict-yes)
  140. I see 0 and I'm going to do: predict-yes
  141. ENV: Agent did: predict-yes for direction R in state State-A
  142. In State-A moving R
  143. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  144. predict error 0
  145. dir: dir isU
  146. \-/18: O: O36 (predict-no)
  147. I see 1 and I'm going to do: predict-no
  148. ENV: Agent did: predict-no for direction U in state State-B
  149. In State-B moving U
  150. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  151. predict error 0
  152. dir: dir isR
  153. |\-19: O: O38 (predict-no)
  154. I see 1 and I'm going to do: predict-no
  155. ENV: Agent did: predict-no for direction R in state State-B
  156. In State-B moving R
  157. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  158. predict error 0
  159. dir: dir isU
  160. /|\20: O: O40 (predict-no)
  161. I see 1 and I'm going to do: predict-no
  162. ENV: Agent did: predict-no for direction U in state State-B
  163. In State-B moving U
  164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  165. predict error 0
  166. dir: dir isU
  167. -/21: O: O42 (predict-no)
  168. I see 1 and I'm going to do: predict-no
  169. ENV: Agent did: predict-no for direction U in state State-B
  170. In State-B moving U
  171. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  172. predict error 0
  173. dir: dir isL
  174. |22: O: O44 (predict-no)
  175. I see 1 and I'm going to do: predict-no
  176. ENV: Agent did: predict-no for direction L in state State-B
  177. In State-B moving L
  178. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  179. predict error 1
  180. dir: dir isU
  181. \-23: O: O45 (predict-yes)
  182. I see 0 and I'm going to do: predict-yes
  183. ENV: Agent did: predict-yes for direction U in state State-A
  184. In State-A moving U
  185. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  186. predict error 1
  187. dir: dir isR
  188. /|\-24: O: O48 (predict-no)
  189. I see 0 and I'm going to do: predict-no
  190. ENV: Agent did: predict-no for direction R in state State-A
  191. In State-A moving R
  192. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  193. predict error 1
  194. dir: dir isL
  195. /|\25: O: O50 (predict-no)
  196. I see 0 and I'm going to do: predict-no
  197. ENV: Agent did: predict-no for direction L in state State-B
  198. In State-B moving L
  199. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  200. predict error 1
  201. dir: dir isL
  202. -/|26: O: O52 (predict-no)
  203. I see 0 and I'm going to do: predict-no
  204. ENV: Agent did: predict-no for direction L in state State-A
  205. In State-A moving L
  206. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  207. predict error 0
  208. dir: dir isU
  209. \-/27: O: O54 (predict-no)
  210. I see 1 and I'm going to do: predict-no
  211. ENV: Agent did: predict-no for direction U in state State-A
  212. In State-A moving U
  213. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  214. predict error 0
  215. dir: dir isR
  216. |\28: O: O56 (predict-no)
  217. I see 1 and I'm going to do: predict-no
  218. ENV: Agent did: predict-no for direction R in state State-A
  219. In State-A moving R
  220. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  221. predict error 1
  222. dir: dir isU
  223. -/|29: O: O58 (predict-no)
  224. I see 0 and I'm going to do: predict-no
  225. ENV: Agent did: predict-no for direction U in state State-B
  226. In State-B moving U
  227. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  228. predict error 0
  229. dir: dir isU
  230. \-30: O: O60 (predict-no)
  231. I see 1 and I'm going to do: predict-no
  232. ENV: Agent did: predict-no for direction U in state State-B
  233. In State-B moving U
  234. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  235. predict error 0
  236. dir: dir isU
  237. /31: O: O62 (predict-no)
  238. I see 1 and I'm going to do: predict-no
  239. ENV: Agent did: predict-no for direction U in state State-B
  240. In State-B moving U
  241. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  242. predict error 0
  243. dir: dir isU
  244. |32: O: O64 (predict-no)
  245. I see 1 and I'm going to do: predict-no
  246. ENV: Agent did: predict-no for direction U in state State-B
  247. In State-B moving U
  248. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  249. predict error 0
  250. dir: dir isR
  251. \-/33: O: O66 (predict-no)
  252. I see 1 and I'm going to do: predict-no
  253. ENV: Agent did: predict-no for direction R in state State-B
  254. In State-B moving R
  255. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  256. predict error 0
  257. dir: dir isU
  258. |\-34: O: O68 (predict-no)
  259. I see 1 and I'm going to do: predict-no
  260. ENV: Agent did: predict-no for direction U in state State-B
  261. In State-B moving U
  262. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  263. predict error 0
  264. dir: dir isR
  265. /|\35: O: O70 (predict-no)
  266. I see 1 and I'm going to do: predict-no
  267. ENV: Agent did: predict-no for direction R in state State-B
  268. In State-B moving R
  269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  270. predict error 0
  271. dir: dir isU
  272. -/|36: O: O72 (predict-no)
  273. I see 1 and I'm going to do: predict-no
  274. ENV: Agent did: predict-no for direction U in state State-B
  275. In State-B moving U
  276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  277. predict error 0
  278. dir: dir isU
  279. \37: O: O74 (predict-no)
  280. I see 1 and I'm going to do: predict-no
  281. ENV: Agent did: predict-no for direction U in state State-B
  282. In State-B moving U
  283. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  284. predict error 0
  285. dir: dir isL
  286. -/38: O: O76 (predict-no)
  287. I see 1 and I'm going to do: predict-no
  288. ENV: Agent did: predict-no for direction L in state State-B
  289. In State-B moving L
  290. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  291. predict error 1
  292. dir: dir isL
  293. |\-39: O: O78 (predict-no)
  294. I see 0 and I'm going to do: predict-no
  295. ENV: Agent did: predict-no for direction L in state State-A
  296. In State-A moving L
  297. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  298. predict error 0
  299. dir: dir isL
  300. /|40: O: O80 (predict-no)
  301. I see 1 and I'm going to do: predict-no
  302. ENV: Agent did: predict-no for direction L in state State-A
  303. In State-A moving L
  304. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  305. predict error 0
  306. dir: dir isU
  307. \-41: O: O82 (predict-no)
  308. I see 1 and I'm going to do: predict-no
  309. ENV: Agent did: predict-no for direction U in state State-A
  310. In State-A moving U
  311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  312. predict error 0
  313. dir: dir isR
  314. /42: O: O84 (predict-no)
  315. I see 1 and I'm going to do: predict-no
  316. ENV: Agent did: predict-no for direction R in state State-A
  317. In State-A moving R
  318. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  319. predict error 1
  320. dir: dir isR
  321. |\-43: O: O86 (predict-no)
  322. I see 0 and I'm going to do: predict-no
  323. ENV: Agent did: predict-no for direction R in state State-B
  324. In State-B moving R
  325. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  326. predict error 0
  327. dir: dir isL
  328. /|\44: O: O88 (predict-no)
  329. I see 1 and I'm going to do: predict-no
  330. ENV: Agent did: predict-no for direction L in state State-B
  331. In State-B moving L
  332. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  333. predict error 1
  334. dir: dir isR
  335. -/45: O: O90 (predict-no)
  336. I see 0 and I'm going to do: predict-no
  337. ENV: Agent did: predict-no for direction R in state State-A
  338. In State-A moving R
  339. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  340. predict error 1
  341. dir: dir isR
  342. |\-46: O: O92 (predict-no)
  343. I see 0 and I'm going to do: predict-no
  344. ENV: Agent did: predict-no for direction R in state State-B
  345. In State-B moving R
  346. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  347. predict error 0
  348. dir: dir isR
  349. /|\47: O: O94 (predict-no)
  350. I see 1 and I'm going to do: predict-no
  351. ENV: Agent did: predict-no for direction R in state State-B
  352. In State-B moving R
  353. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  354. predict error 0
  355. dir: dir isR
  356. -/48: O: O96 (predict-no)
  357. I see 1 and I'm going to do: predict-no
  358. ENV: Agent did: predict-no for direction R in state State-B
  359. In State-B moving R
  360. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  361. predict error 0
  362. dir: dir isR
  363. |\-49: O: O98 (predict-no)
  364. I see 1 and I'm going to do: predict-no
  365. ENV: Agent did: predict-no for direction R in state State-B
  366. In State-B moving R
  367. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  368. predict error 0
  369. dir: dir isU
  370. /50: O: O100 (predict-no)
  371. I see 1 and I'm going to do: predict-no
  372. ENV: Agent did: predict-no for direction U in state State-B
  373. In State-B moving U
  374. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  375. predict error 0
  376. dir: dir isU
  377. |\-/|\sleeping...
  378. -51: O: O102 (predict-no)
  379. I see 1 and I'm going to do: predict-no
  380. ENV: Agent did: predict-no for direction U in state State-B
  381. In State-B moving U
  382. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  383. predict error 0
  384. dir: dir isU
  385. /52: O: O104 (predict-no)
  386. I see 1 and I'm going to do: predict-no
  387. ENV: Agent did: predict-no for direction U in state State-B
  388. In State-B moving U
  389. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  390. predict error 0
  391. dir: dir isU
  392. |\53: O: O106 (predict-no)
  393. I see 1 and I'm going to do: predict-no
  394. ENV: Agent did: predict-no for direction U in state State-B
  395. In State-B moving U
  396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  397. predict error 0
  398. dir: dir isU
  399. -/54: O: O108 (predict-no)
  400. I see 1 and I'm going to do: predict-no
  401. ENV: Agent did: predict-no for direction U in state State-B
  402. In State-B moving U
  403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  404. predict error 0
  405. dir: dir isU
  406. |\55: O: O110 (predict-no)
  407. I see 1 and I'm going to do: predict-no
  408. ENV: Agent did: predict-no for direction U in state State-B
  409. In State-B moving U
  410. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  411. predict error 0
  412. dir: dir isL
  413. -/|56: O: O112 (predict-no)
  414. I see 1 and I'm going to do: predict-no
  415. ENV: Agent did: predict-no for direction L in state State-B
  416. In State-B moving L
  417. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  418. predict error 1
  419. dir: dir isU
  420. \-57: O: O114 (predict-no)
  421. I see 0 and I'm going to do: predict-no
  422. ENV: Agent did: predict-no for direction U in state State-A
  423. In State-A moving U
  424. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  425. predict error 0
  426. dir: dir isU
  427. /|\58: O: O116 (predict-no)
  428. I see 1 and I'm going to do: predict-no
  429. ENV: Agent did: predict-no for direction U in state State-A
  430. In State-A moving U
  431. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  432. predict error 0
  433. dir: dir isU
  434. -/59: O: O118 (predict-no)
  435. I see 1 and I'm going to do: predict-no
  436. ENV: Agent did: predict-no for direction U in state State-A
  437. In State-A moving U
  438. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  439. predict error 0
  440. dir: dir isR
  441. |\60: O: O119 (predict-yes)
  442. I see 1 and I'm going to do: predict-yes
  443. ENV: Agent did: predict-yes for direction R in state State-A
  444. In State-A moving R
  445. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  446. predict error 0
  447. dir: dir isU
  448. -/61: O: O122 (predict-no)
  449. I see 1 and I'm going to do: predict-no
  450. ENV: Agent did: predict-no for direction U in state State-B
  451. In State-B moving U
  452. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  453. predict error 0
  454. dir: dir isL
  455. rule alias: '*'
  456. rule alias: '*'
  457. rule alias: '*'
  458. |62: O: O124 (predict-no)
  459. I see 1 and I'm going to do: predict-no
  460. ENV: Agent did: predict-no for direction L in state State-B
  461. In State-B moving L
  462. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  463. predict error 1
  464. dir: dir isU
  465. \-63: O: O126 (predict-no)
  466. I see 0 and I'm going to do: predict-no
  467. ENV: Agent did: predict-no for direction U in state State-A
  468. In State-A moving U
  469. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  470. predict error 0
  471. dir: dir isR
  472. /|\64: O: O128 (predict-no)
  473. I see 1 and I'm going to do: predict-no
  474. ENV: Agent did: predict-no for direction R in state State-A
  475. In State-A moving R
  476. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  477. predict error 1
  478. dir: dir isL
  479. -/|65: O: O130 (predict-no)
  480. I see 0 and I'm going to do: predict-no
  481. ENV: Agent did: predict-no for direction L in state State-B
  482. In State-B moving L
  483. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  484. predict error 1
  485. dir: dir isL
  486. \-/66: O: O132 (predict-no)
  487. I see 0 and I'm going to do: predict-no
  488. ENV: Agent did: predict-no for direction L in state State-A
  489. In State-A moving L
  490. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  491. predict error 0
  492. dir: dir isU
  493. |\-67: O: O134 (predict-no)
  494. I see 1 and I'm going to do: predict-no
  495. ENV: Agent did: predict-no for direction U in state State-A
  496. In State-A moving U
  497. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  498. predict error 0
  499. dir: dir isU
  500. /|\68: O: O136 (predict-no)
  501. I see 1 and I'm going to do: predict-no
  502. ENV: Agent did: predict-no for direction U in state State-A
  503. In State-A moving U
  504. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  505. predict error 0
  506. dir: dir isL
  507. -69: O: O138 (predict-no)
  508. I see 1 and I'm going to do: predict-no
  509. ENV: Agent did: predict-no for direction L in state State-A
  510. In State-A moving L
  511. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  512. predict error 0
  513. dir: dir isU
  514. /|\70: O: O140 (predict-no)
  515. I see 1 and I'm going to do: predict-no
  516. ENV: Agent did: predict-no for direction U in state State-A
  517. In State-A moving U
  518. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  519. predict error 0
  520. dir: dir isR
  521. -/71: O: O142 (predict-no)
  522. I see 1 and I'm going to do: predict-no
  523. ENV: Agent did: predict-no for direction R in state State-A
  524. In State-A moving R
  525. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  526. predict error 1
  527. dir: dir isL
  528. rule alias: '*'
  529. rule alias: '*'
  530. rule alias: '*'
  531. rule alias: '*'
  532. rule alias: '*'
  533. |72: O: O144 (predict-no)
  534. I see 0 and I'm going to do: predict-no
  535. ENV: Agent did: predict-no for direction L in state State-B
  536. In State-B moving L
  537. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  538. predict error 1
  539. dir: dir isL
  540. \-/|73: O: O146 (predict-no)
  541. I see 0 and I'm going to do: predict-no
  542. ENV: Agent did: predict-no for direction L in state State-A
  543. In State-A moving L
  544. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  545. predict error 0
  546. dir: dir isU
  547. \-/74: O: O148 (predict-no)
  548. I see 1 and I'm going to do: predict-no
  549. ENV: Agent did: predict-no for direction U in state State-A
  550. In State-A moving U
  551. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  552. predict error 0
  553. dir: dir isU
  554. |\75: O: O150 (predict-no)
  555. I see 1 and I'm going to do: predict-no
  556. ENV: Agent did: predict-no for direction U in state State-A
  557. In State-A moving U
  558. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  559. predict error 0
  560. dir: dir isL
  561. -/76: O: O152 (predict-no)
  562. I see 1 and I'm going to do: predict-no
  563. ENV: Agent did: predict-no for direction L in state State-A
  564. In State-A moving L
  565. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  566. predict error 0
  567. dir: dir isR
  568. |\-77: O: O154 (predict-no)
  569. I see 1 and I'm going to do: predict-no
  570. ENV: Agent did: predict-no for direction R in state State-A
  571. In State-A moving R
  572. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  573. predict error 1
  574. dir: dir isL
  575. /|\78: O: O156 (predict-no)
  576. I see 0 and I'm going to do: predict-no
  577. ENV: Agent did: predict-no for direction L in state State-B
  578. In State-B moving L
  579. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  580. predict error 1
  581. dir: dir isU
  582. -/|79: O: O158 (predict-no)
  583. I see 0 and I'm going to do: predict-no
  584. ENV: Agent did: predict-no for direction U in state State-A
  585. In State-A moving U
  586. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  587. predict error 0
  588. dir: dir isL
  589. \-/80: O: O159 (predict-yes)
  590. I see 1 and I'm going to do: predict-yes
  591. ENV: Agent did: predict-yes for direction L in state State-A
  592. In State-A moving L
  593. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  594. predict error 1
  595. dir: dir isU
  596. |\-81: O: O162 (predict-no)
  597. I see 0 and I'm going to do: predict-no
  598. ENV: Agent did: predict-no for direction U in state State-A
  599. In State-A moving U
  600. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  601. predict error 0
  602. dir: dir isU
  603. /82: O: O163 (predict-yes)
  604. I see 1 and I'm going to do: predict-yes
  605. ENV: Agent did: predict-yes for direction U in state State-A
  606. In State-A moving U
  607. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  608. predict error 1
  609. dir: dir isR
  610. |\-83: O: O165 (predict-yes)
  611. I see 0 and I'm going to do: predict-yes
  612. ENV: Agent did: predict-yes for direction R in state State-A
  613. In State-A moving R
  614. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  615. predict error 0
  616. dir: dir isU
  617. /|\84: O: O168 (predict-no)
  618. I see 1 and I'm going to do: predict-no
  619. ENV: Agent did: predict-no for direction U in state State-B
  620. In State-B moving U
  621. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  622. predict error 0
  623. dir: dir isL
  624. -/85: O: O170 (predict-no)
  625. I see 1 and I'm going to do: predict-no
  626. ENV: Agent did: predict-no for direction L in state State-B
  627. In State-B moving L
  628. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  629. predict error 1
  630. dir: dir isL
  631. |\-86: O: O172 (predict-no)
  632. I see 0 and I'm going to do: predict-no
  633. ENV: Agent did: predict-no for direction L in state State-A
  634. In State-A moving L
  635. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  636. predict error 0
  637. dir: dir isR
  638. /|\87: O: O173 (predict-yes)
  639. I see 1 and I'm going to do: predict-yes
  640. ENV: Agent did: predict-yes for direction R in state State-A
  641. In State-A moving R
  642. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  643. predict error 0
  644. dir: dir isL
  645. -/|88: O: O176 (predict-no)
  646. I see 1 and I'm going to do: predict-no
  647. ENV: Agent did: predict-no for direction L in state State-B
  648. In State-B moving L
  649. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  650. predict error 1
  651. dir: dir isL
  652. \-/89: O: O178 (predict-no)
  653. I see 0 and I'm going to do: predict-no
  654. ENV: Agent did: predict-no for direction L in state State-A
  655. In State-A moving L
  656. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  657. predict error 0
  658. dir: dir isR
  659. |\-90: O: O179 (predict-yes)
  660. I see 1 and I'm going to do: predict-yes
  661. ENV: Agent did: predict-yes for direction R in state State-A
  662. In State-A moving R
  663. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  664. predict error 0
  665. dir: dir isR
  666. /|91: O: O182 (predict-no)
  667. I see 1 and I'm going to do: predict-no
  668. ENV: Agent did: predict-no for direction R in state State-B
  669. In State-B moving R
  670. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  671. predict error 0
  672. dir: dir isL
  673. rule alias: '*'
  674. rule alias: '*'
  675. \92: O: O184 (predict-no)
  676. I see 1 and I'm going to do: predict-no
  677. ENV: Agent did: predict-no for direction L in state State-B
  678. In State-B moving L
  679. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  680. predict error 1
  681. dir: dir isL
  682. -/|93: O: O186 (predict-no)
  683. I see 0 and I'm going to do: predict-no
  684. ENV: Agent did: predict-no for direction L in state State-A
  685. In State-A moving L
  686. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  687. predict error 0
  688. dir: dir isU
  689. \-94: O: O188 (predict-no)
  690. I see 1 and I'm going to do: predict-no
  691. ENV: Agent did: predict-no for direction U in state State-A
  692. In State-A moving U
  693. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  694. predict error 0
  695. dir: dir isL
  696. /|95: O: O190 (predict-no)
  697. I see 1 and I'm going to do: predict-no
  698. ENV: Agent did: predict-no for direction L in state State-A
  699. In State-A moving L
  700. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  701. predict error 0
  702. dir: dir isU
  703. \-96: O: O192 (predict-no)
  704. I see 1 and I'm going to do: predict-no
  705. ENV: Agent did: predict-no for direction U in state State-A
  706. In State-A moving U
  707. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  708. predict error 0
  709. dir: dir isU
  710. /|97: O: O194 (predict-no)
  711. I see 1 and I'm going to do: predict-no
  712. ENV: Agent did: predict-no for direction U in state State-A
  713. In State-A moving U
  714. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  715. predict error 0
  716. dir: dir isR
  717. \-98: O: O195 (predict-yes)
  718. I see 1 and I'm going to do: predict-yes
  719. ENV: Agent did: predict-yes for direction R in state State-A
  720. In State-A moving R
  721. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  722. predict error 0
  723. dir: dir isR
  724. /|\99: O: O198 (predict-no)
  725. I see 1 and I'm going to do: predict-no
  726. ENV: Agent did: predict-no for direction R in state State-B
  727. In State-B moving R
  728. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  729. predict error 0
  730. dir: dir isR
  731. -/100: O: O200 (predict-no)
  732. I see 1 and I'm going to do: predict-no
  733. ENV: Agent did: predict-no for direction R in state State-B
  734. In State-B moving R
  735. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  736. predict error 0
  737. dir: dir isR
  738. |\101: O: O202 (predict-no)
  739. I see 1 and I'm going to do: predict-no
  740. ENV: Agent did: predict-no for direction R in state State-B
  741. In State-B moving R
  742. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  743. predict error 0
  744. dir: dir isR
  745. rule alias: '*'
  746. rule alias: '*'
  747. -/102: O: O204 (predict-no)
  748. I see 1 and I'm going to do: predict-no
  749. ENV: Agent did: predict-no for direction R in state State-B
  750. In State-B moving R
  751. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  752. predict error 0
  753. dir: dir isR
  754. |\-103: O: O206 (predict-no)
  755. I see 1 and I'm going to do: predict-no
  756. ENV: Agent did: predict-no for direction R in state State-B
  757. In State-B moving R
  758. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  759. predict error 0
  760. dir: dir isR
  761. /|\104: O: O208 (predict-no)
  762. I see 1 and I'm going to do: predict-no
  763. ENV: Agent did: predict-no for direction R in state State-B
  764. In State-B moving R
  765. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  766. predict error 0
  767. dir: dir isU
  768. -/105: O: O210 (predict-no)
  769. I see 1 and I'm going to do: predict-no
  770. ENV: Agent did: predict-no for direction U in state State-B
  771. In State-B moving U
  772. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  773. predict error 0
  774. dir: dir isR
  775. |\106: O: O212 (predict-no)
  776. I see 1 and I'm going to do: predict-no
  777. ENV: Agent did: predict-no for direction R in state State-B
  778. In State-B moving R
  779. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  780. predict error 0
  781. dir: dir isR
  782. -/107: O: O214 (predict-no)
  783. I see 1 and I'm going to do: predict-no
  784. ENV: Agent did: predict-no for direction R in state State-B
  785. In State-B moving R
  786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  787. predict error 0
  788. dir: dir isU
  789. |\108: O: O216 (predict-no)
  790. I see 1 and I'm going to do: predict-no
  791. ENV: Agent did: predict-no for direction U in state State-B
  792. In State-B moving U
  793. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  794. predict error 0
  795. dir: dir isL
  796. -/|109: O: O217 (predict-yes)
  797. I see 1 and I'm going to do: predict-yes
  798. ENV: Agent did: predict-yes for direction L in state State-B
  799. In State-B moving L
  800. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  801. predict error 0
  802. dir: dir isL
  803. \-/110: O: O220 (predict-no)
  804. I see 1 and I'm going to do: predict-no
  805. ENV: Agent did: predict-no for direction L in state State-A
  806. In State-A moving L
  807. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  808. predict error 0
  809. dir: dir isU
  810. |\-111: O: O222 (predict-no)
  811. I see 1 and I'm going to do: predict-no
  812. ENV: Agent did: predict-no for direction U in state State-A
  813. In State-A moving U
  814. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  815. predict error 0
  816. dir: dir isR
  817. /112: O: O223 (predict-yes)
  818. I see 1 and I'm going to do: predict-yes
  819. ENV: Agent did: predict-yes for direction R in state State-A
  820. In State-A moving R
  821. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  822. predict error 0
  823. dir: dir isR
  824. |\-113: O: O226 (predict-no)
  825. I see 1 and I'm going to do: predict-no
  826. ENV: Agent did: predict-no for direction R in state State-B
  827. In State-B moving R
  828. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  829. predict error 0
  830. dir: dir isL
  831. /|\114: O: O227 (predict-yes)
  832. I see 1 and I'm going to do: predict-yes
  833. ENV: Agent did: predict-yes for direction L in state State-B
  834. In State-B moving L
  835. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  836. predict error 0
  837. dir: dir isR
  838. -/|115: O: O230 (predict-no)
  839. I see 1 and I'm going to do: predict-no
  840. ENV: Agent did: predict-no for direction R in state State-A
  841. In State-A moving R
  842. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  843. predict error 1
  844. dir: dir isR
  845. \-/116: O: O232 (predict-no)
  846. I see 0 and I'm going to do: predict-no
  847. ENV: Agent did: predict-no for direction R in state State-B
  848. In State-B moving R
  849. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  850. predict error 0
  851. dir: dir isL
  852. |\117: O: O233 (predict-yes)
  853. I see 1 and I'm going to do: predict-yes
  854. ENV: Agent did: predict-yes for direction L in state State-B
  855. In State-B moving L
  856. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  857. predict error 0
  858. dir: dir isR
  859. -/|118: O: O235 (predict-yes)
  860. I see 1 and I'm going to do: predict-yes
  861. ENV: Agent did: predict-yes for direction R in state State-A
  862. In State-A moving R
  863. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  864. predict error 0
  865. dir: dir isR
  866. \-119: O: O238 (predict-no)
  867. I see 1 and I'm going to do: predict-no
  868. ENV: Agent did: predict-no for direction R in state State-B
  869. In State-B moving R
  870. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  871. predict error 0
  872. dir: dir isL
  873. /|\-120: O: O239 (predict-yes)
  874. I see 1 and I'm going to do: predict-yes
  875. ENV: Agent did: predict-yes for direction L in state State-B
  876. In State-B moving L
  877. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  878. predict error 0
  879. dir: dir isR
  880. /|121: O: O241 (predict-yes)
  881. I see 1 and I'm going to do: predict-yes
  882. ENV: Agent did: predict-yes for direction R in state State-A
  883. In State-A moving R
  884. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  885. predict error 0
  886. dir: dir isR
  887. rule alias: '*'
  888. \122: O: O244 (predict-no)
  889. I see 1 and I'm going to do: predict-no
  890. ENV: Agent did: predict-no for direction R in state State-B
  891. In State-B moving R
  892. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  893. predict error 0
  894. dir: dir isU
  895. -/|123: O: O246 (predict-no)
  896. I see 1 and I'm going to do: predict-no
  897. ENV: Agent did: predict-no for direction U in state State-B
  898. In State-B moving U
  899. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  900. predict error 0
  901. dir: dir isR
  902. \-124: O: O248 (predict-no)
  903. I see 1 and I'm going to do: predict-no
  904. ENV: Agent did: predict-no for direction R in state State-B
  905. In State-B moving R
  906. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  907. predict error 0
  908. dir: dir isU
  909. /|\125: O: O250 (predict-no)
  910. I see 1 and I'm going to do: predict-no
  911. ENV: Agent did: predict-no for direction U in state State-B
  912. In State-B moving U
  913. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  914. predict error 0
  915. dir: dir isU
  916. -/126: O: O252 (predict-no)
  917. I see 1 and I'm going to do: predict-no
  918. ENV: Agent did: predict-no for direction U in state State-B
  919. In State-B moving U
  920. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  921. predict error 0
  922. dir: dir isL
  923. |\-127: O: O253 (predict-yes)
  924. I see 1 and I'm going to do: predict-yes
  925. ENV: Agent did: predict-yes for direction L in state State-B
  926. In State-B moving L
  927. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  928. predict error 0
  929. dir: dir isL
  930. /|128: O: O256 (predict-no)
  931. I see 1 and I'm going to do: predict-no
  932. ENV: Agent did: predict-no for direction L in state State-A
  933. In State-A moving L
  934. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  935. predict error 0
  936. dir: dir isU
  937. \-129: O: O257 (predict-yes)
  938. I see 1 and I'm going to do: predict-yes
  939. ENV: Agent did: predict-yes for direction U in state State-A
  940. In State-A moving U
  941. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  942. predict error 1
  943. dir: dir isU
  944. /|130: O: O259 (predict-yes)
  945. I see 0 and I'm going to do: predict-yes
  946. ENV: Agent did: predict-yes for direction U in state State-A
  947. In State-A moving U
  948. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  949. predict error 1
  950. dir: dir isL
  951. \-131: O: O262 (predict-no)
  952. I see 0 and I'm going to do: predict-no
  953. ENV: Agent did: predict-no for direction L in state State-A
  954. In State-A moving L
  955. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  956. predict error 0
  957. dir: dir isR
  958. rule alias: '*'
  959. rule alias: '*'
  960. /132: O: O263 (predict-yes)
  961. I see 1 and I'm going to do: predict-yes
  962. ENV: Agent did: predict-yes for direction R in state State-A
  963. In State-A moving R
  964. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  965. predict error 0
  966. dir: dir isR
  967. |\-133: O: O266 (predict-no)
  968. I see 1 and I'm going to do: predict-no
  969. ENV: Agent did: predict-no for direction R in state State-B
  970. In State-B moving R
  971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  972. predict error 0
  973. dir: dir isL
  974. /|\-sleeping...
  975. /134: O: O267 (predict-yes)
  976. I see 1 and I'm going to do: predict-yes
  977. ENV: Agent did: predict-yes for direction L in state State-B
  978. In State-B moving L
  979. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  980. predict error 0
  981. dir: dir isR
  982. |\-135: O: O269 (predict-yes)
  983. I see 1 and I'm going to do: predict-yes
  984. ENV: Agent did: predict-yes for direction R in state State-A
  985. In State-A moving R
  986. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  987. predict error 0
  988. dir: dir isL
  989. /|\136: O: O271 (predict-yes)
  990. I see 1 and I'm going to do: predict-yes
  991. ENV: Agent did: predict-yes for direction L in state State-B
  992. In State-B moving L
  993. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  994. predict error 0
  995. dir: dir isR
  996. -/137: O: O273 (predict-yes)
  997. I see 1 and I'm going to do: predict-yes
  998. ENV: Agent did: predict-yes for direction R in state State-A
  999. In State-A moving R
  1000. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1001. predict error 0
  1002. dir: dir isR
  1003. |\138: O: O276 (predict-no)
  1004. I see 1 and I'm going to do: predict-no
  1005. ENV: Agent did: predict-no for direction R in state State-B
  1006. In State-B moving R
  1007. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1008. predict error 0
  1009. dir: dir isL
  1010. -/|139: O: O277 (predict-yes)
  1011. I see 1 and I'm going to do: predict-yes
  1012. ENV: Agent did: predict-yes for direction L in state State-B
  1013. In State-B moving L
  1014. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1015. predict error 0
  1016. dir: dir isR
  1017. \-/140: O: O279 (predict-yes)
  1018. I see 1 and I'm going to do: predict-yes
  1019. ENV: Agent did: predict-yes for direction R in state State-A
  1020. In State-A moving R
  1021. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1022. predict error 0
  1023. dir: dir isU
  1024. |\141: O: O282 (predict-no)
  1025. I see 1 and I'm going to do: predict-no
  1026. ENV: Agent did: predict-no for direction U in state State-B
  1027. In State-B moving U
  1028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1029. predict error 0
  1030. dir: dir isL
  1031. rule alias: '*'
  1032. -142: O: O283 (predict-yes)
  1033. I see 1 and I'm going to do: predict-yes
  1034. ENV: Agent did: predict-yes for direction L in state State-B
  1035. In State-B moving L
  1036. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1037. predict error 0
  1038. dir: dir isR
  1039. /|\143: O: O285 (predict-yes)
  1040. I see 1 and I'm going to do: predict-yes
  1041. ENV: Agent did: predict-yes for direction R in state State-A
  1042. In State-A moving R
  1043. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1044. predict error 0
  1045. dir: dir isU
  1046. -/144: O: O288 (predict-no)
  1047. I see 1 and I'm going to do: predict-no
  1048. ENV: Agent did: predict-no for direction U in state State-B
  1049. In State-B moving U
  1050. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1051. predict error 0
  1052. dir: dir isL
  1053. |\-145: O: O289 (predict-yes)
  1054. I see 1 and I'm going to do: predict-yes
  1055. ENV: Agent did: predict-yes for direction L in state State-B
  1056. In State-B moving L
  1057. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1058. predict error 0
  1059. dir: dir isL
  1060. /146: O: O292 (predict-no)
  1061. I see 1 and I'm going to do: predict-no
  1062. ENV: Agent did: predict-no for direction L in state State-A
  1063. In State-A moving L
  1064. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1065. predict error 0
  1066. dir: dir isU
  1067. |147: O: O294 (predict-no)
  1068. I see 1 and I'm going to do: predict-no
  1069. ENV: Agent did: predict-no for direction U in state State-A
  1070. In State-A moving U
  1071. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1072. predict error 0
  1073. dir: dir isL
  1074. \-/148: O: O296 (predict-no)
  1075. I see 1 and I'm going to do: predict-no
  1076. ENV: Agent did: predict-no for direction L in state State-A
  1077. In State-A moving L
  1078. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1079. predict error 0
  1080. dir: dir isL
  1081. |\149: O: O298 (predict-no)
  1082. I see 1 and I'm going to do: predict-no
  1083. ENV: Agent did: predict-no for direction L in state State-A
  1084. In State-A moving L
  1085. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1086. predict error 0
  1087. dir: dir isR
  1088. -/150: O: O299 (predict-yes)
  1089. I see 1 and I'm going to do: predict-yes
  1090. ENV: Agent did: predict-yes for direction R in state State-A
  1091. In State-A moving R
  1092. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1093. predict error 0
  1094. dir: dir isU
  1095. |\-151: O: O302 (predict-no)
  1096. I see 1 and I'm going to do: predict-no
  1097. ENV: Agent did: predict-no for direction U in state State-B
  1098. In State-B moving U
  1099. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1100. predict error 0
  1101. dir: dir isL
  1102. /152: O: O303 (predict-yes)
  1103. I see 1 and I'm going to do: predict-yes
  1104. ENV: Agent did: predict-yes for direction L in state State-B
  1105. In State-B moving L
  1106. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1107. predict error 0
  1108. dir: dir isL
  1109. |\153: O: O306 (predict-no)
  1110. I see 1 and I'm going to do: predict-no
  1111. ENV: Agent did: predict-no for direction L in state State-A
  1112. In State-A moving L
  1113. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1114. predict error 0
  1115. dir: dir isR
  1116. -/|154: O: O307 (predict-yes)
  1117. I see 1 and I'm going to do: predict-yes
  1118. ENV: Agent did: predict-yes for direction R in state State-A
  1119. In State-A moving R
  1120. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1121. predict error 0
  1122. dir: dir isU
  1123. \-/155: O: O310 (predict-no)
  1124. I see 1 and I'm going to do: predict-no
  1125. ENV: Agent did: predict-no for direction U in state State-B
  1126. In State-B moving U
  1127. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1128. predict error 0
  1129. dir: dir isR
  1130. |\156: O: O312 (predict-no)
  1131. I see 1 and I'm going to do: predict-no
  1132. ENV: Agent did: predict-no for direction R in state State-B
  1133. In State-B moving R
  1134. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1135. predict error 0
  1136. dir: dir isU
  1137. -/|157: O: O314 (predict-no)
  1138. I see 1 and I'm going to do: predict-no
  1139. ENV: Agent did: predict-no for direction U in state State-B
  1140. In State-B moving U
  1141. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1142. predict error 0
  1143. dir: dir isL
  1144. \-/158: O: O315 (predict-yes)
  1145. I see 1 and I'm going to do: predict-yes
  1146. ENV: Agent did: predict-yes for direction L in state State-B
  1147. In State-B moving L
  1148. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1149. predict error 0
  1150. dir: dir isR
  1151. |\-159: O: O317 (predict-yes)
  1152. I see 1 and I'm going to do: predict-yes
  1153. ENV: Agent did: predict-yes for direction R in state State-A
  1154. In State-A moving R
  1155. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1156. predict error 0
  1157. dir: dir isR
  1158. /|\160: O: O320 (predict-no)
  1159. I see 1 and I'm going to do: predict-no
  1160. ENV: Agent did: predict-no for direction R in state State-B
  1161. In State-B moving R
  1162. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1163. predict error 0
  1164. dir: dir isU
  1165. -/161: O: O322 (predict-no)
  1166. I see 1 and I'm going to do: predict-no
  1167. ENV: Agent did: predict-no for direction U in state State-B
  1168. In State-B moving U
  1169. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1170. predict error 0
  1171. dir: dir isR
  1172. |162: O: O324 (predict-no)
  1173. I see 1 and I'm going to do: predict-no
  1174. ENV: Agent did: predict-no for direction R in state State-B
  1175. In State-B moving R
  1176. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1177. predict error 0
  1178. dir: dir isL
  1179. \-163: O: O325 (predict-yes)
  1180. I see 1 and I'm going to do: predict-yes
  1181. ENV: Agent did: predict-yes for direction L in state State-B
  1182. In State-B moving L
  1183. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1184. predict error 0
  1185. dir: dir isL
  1186. /|164: O: O328 (predict-no)
  1187. I see 1 and I'm going to do: predict-no
  1188. ENV: Agent did: predict-no for direction L in state State-A
  1189. In State-A moving L
  1190. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1191. predict error 0
  1192. dir: dir isR
  1193. \-/165: O: O329 (predict-yes)
  1194. I see 1 and I'm going to do: predict-yes
  1195. ENV: Agent did: predict-yes for direction R in state State-A
  1196. In State-A moving R
  1197. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1198. predict error 0
  1199. dir: dir isL
  1200. |\-/166: O: O331 (predict-yes)
  1201. I see 1 and I'm going to do: predict-yes
  1202. ENV: Agent did: predict-yes for direction L in state State-B
  1203. In State-B moving L
  1204. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1205. predict error 0
  1206. dir: dir isU
  1207. |\-167: O: O334 (predict-no)
  1208. I see 1 and I'm going to do: predict-no
  1209. ENV: Agent did: predict-no for direction U in state State-A
  1210. In State-A moving U
  1211. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1212. predict error 0
  1213. dir: dir isU
  1214. /|\168: O: O336 (predict-no)
  1215. I see 1 and I'm going to do: predict-no
  1216. ENV: Agent did: predict-no for direction U in state State-A
  1217. In State-A moving U
  1218. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1219. predict error 0
  1220. dir: dir isL
  1221. -/|169: O: O338 (predict-no)
  1222. I see 1 and I'm going to do: predict-no
  1223. ENV: Agent did: predict-no for direction L in state State-A
  1224. In State-A moving L
  1225. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1226. predict error 0
  1227. dir: dir isL
  1228. \-/170: O: O340 (predict-no)
  1229. I see 1 and I'm going to do: predict-no
  1230. ENV: Agent did: predict-no for direction L in state State-A
  1231. In State-A moving L
  1232. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1233. predict error 0
  1234. dir: dir isU
  1235. |\-171: O: O342 (predict-no)
  1236. I see 1 and I'm going to do: predict-no
  1237. ENV: Agent did: predict-no for direction U in state State-A
  1238. In State-A moving U
  1239. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1240. predict error 0
  1241. dir: dir isR
  1242. /172: O: O343 (predict-yes)
  1243. I see 1 and I'm going to do: predict-yes
  1244. ENV: Agent did: predict-yes for direction R in state State-A
  1245. In State-A moving R
  1246. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1247. predict error 0
  1248. dir: dir isU
  1249. |\-173: O: O346 (predict-no)
  1250. I see 1 and I'm going to do: predict-no
  1251. ENV: Agent did: predict-no for direction U in state State-B
  1252. In State-B moving U
  1253. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1254. predict error 0
  1255. dir: dir isR
  1256. /|\174: O: O347 (predict-yes)
  1257. I see 1 and I'm going to do: predict-yes
  1258. ENV: Agent did: predict-yes for direction R in state State-B
  1259. In State-B moving R
  1260. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1261. predict error 1
  1262. dir: dir isL
  1263. -/|175: O: O349 (predict-yes)
  1264. I see 0 and I'm going to do: predict-yes
  1265. ENV: Agent did: predict-yes for direction L in state State-B
  1266. In State-B moving L
  1267. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1268. predict error 0
  1269. dir: dir isL
  1270. \-/176: O: O352 (predict-no)
  1271. I see 1 and I'm going to do: predict-no
  1272. ENV: Agent did: predict-no for direction L in state State-A
  1273. In State-A moving L
  1274. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1275. predict error 0
  1276. dir: dir isU
  1277. |\177: O: O354 (predict-no)
  1278. I see 1 and I'm going to do: predict-no
  1279. ENV: Agent did: predict-no for direction U in state State-A
  1280. In State-A moving U
  1281. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1282. predict error 0
  1283. dir: dir isL
  1284. -/|\178: O: O356 (predict-no)
  1285. I see 1 and I'm going to do: predict-no
  1286. ENV: Agent did: predict-no for direction L in state State-A
  1287. In State-A moving L
  1288. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1289. predict error 0
  1290. dir: dir isR
  1291. -/179: O: O357 (predict-yes)
  1292. I see 1 and I'm going to do: predict-yes
  1293. ENV: Agent did: predict-yes for direction R in state State-A
  1294. In State-A moving R
  1295. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1296. predict error 0
  1297. dir: dir isR
  1298. |\-180: O: O360 (predict-no)
  1299. I see 1 and I'm going to do: predict-no
  1300. ENV: Agent did: predict-no for direction R in state State-B
  1301. In State-B moving R
  1302. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1303. predict error 0
  1304. dir: dir isR
  1305. /181: O: O362 (predict-no)
  1306. I see 1 and I'm going to do: predict-no
  1307. ENV: Agent did: predict-no for direction R in state State-B
  1308. In State-B moving R
  1309. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1310. predict error 0
  1311. dir: dir isR
  1312. |182: O: O364 (predict-no)
  1313. I see 1 and I'm going to do: predict-no
  1314. ENV: Agent did: predict-no for direction R in state State-B
  1315. In State-B moving R
  1316. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1317. predict error 0
  1318. dir: dir isU
  1319. \-/183: O: O366 (predict-no)
  1320. I see 1 and I'm going to do: predict-no
  1321. ENV: Agent did: predict-no for direction U in state State-B
  1322. In State-B moving U
  1323. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1324. predict error 0
  1325. dir: dir isL
  1326. |\-184: O: O367 (predict-yes)
  1327. I see 1 and I'm going to do: predict-yes
  1328. ENV: Agent did: predict-yes for direction L in state State-B
  1329. In State-B moving L
  1330. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1331. predict error 0
  1332. dir: dir isL
  1333. /|185: O: O370 (predict-no)
  1334. I see 1 and I'm going to do: predict-no
  1335. ENV: Agent did: predict-no for direction L in state State-A
  1336. In State-A moving L
  1337. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1338. predict error 0
  1339. dir: dir isU
  1340. \-/186: O: O372 (predict-no)
  1341. I see 1 and I'm going to do: predict-no
  1342. ENV: Agent did: predict-no for direction U in state State-A
  1343. In State-A moving U
  1344. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1345. predict error 0
  1346. dir: dir isU
  1347. |\187: O: O374 (predict-no)
  1348. I see 1 and I'm going to do: predict-no
  1349. ENV: Agent did: predict-no for direction U in state State-A
  1350. In State-A moving U
  1351. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1352. predict error 0
  1353. dir: dir isU
  1354. -/|188: O: O375 (predict-yes)
  1355. I see 1 and I'm going to do: predict-yes
  1356. ENV: Agent did: predict-yes for direction U in state State-A
  1357. In State-A moving U
  1358. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1359. predict error 1
  1360. dir: dir isR
  1361. \-189: O: O377 (predict-yes)
  1362. I see 0 and I'm going to do: predict-yes
  1363. ENV: Agent did: predict-yes for direction R in state State-A
  1364. In State-A moving R
  1365. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1366. predict error 0
  1367. dir: dir isU
  1368. /|190: O: O380 (predict-no)
  1369. I see 1 and I'm going to do: predict-no
  1370. ENV: Agent did: predict-no for direction U in state State-B
  1371. In State-B moving U
  1372. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1373. predict error 0
  1374. dir: dir isU
  1375. \-/191: O: O382 (predict-no)
  1376. I see 1 and I'm going to do: predict-no
  1377. ENV: Agent did: predict-no for direction U in state State-B
  1378. In State-B moving U
  1379. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1380. predict error 0
  1381. dir: dir isL
  1382. |192: O: O383 (predict-yes)
  1383. I see 1 and I'm going to do: predict-yes
  1384. ENV: Agent did: predict-yes for direction L in state State-B
  1385. In State-B moving L
  1386. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1387. predict error 0
  1388. dir: dir isU
  1389. \-/193: O: O386 (predict-no)
  1390. I see 1 and I'm going to do: predict-no
  1391. ENV: Agent did: predict-no for direction U in state State-A
  1392. In State-A moving U
  1393. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1394. predict error 0
  1395. dir: dir isU
  1396. |\194: O: O388 (predict-no)
  1397. I see 1 and I'm going to do: predict-no
  1398. ENV: Agent did: predict-no for direction U in state State-A
  1399. In State-A moving U
  1400. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1401. predict error 0
  1402. dir: dir isR
  1403. -/195: O: O389 (predict-yes)
  1404. I see 1 and I'm going to do: predict-yes
  1405. ENV: Agent did: predict-yes for direction R in state State-A
  1406. In State-A moving R
  1407. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1408. predict error 0
  1409. dir: dir isR
  1410. |\-196: O: O392 (predict-no)
  1411. I see 1 and I'm going to do: predict-no
  1412. ENV: Agent did: predict-no for direction R in state State-B
  1413. In State-B moving R
  1414. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1415. predict error 0
  1416. dir: dir isL
  1417. /|\197: O: O393 (predict-yes)
  1418. I see 1 and I'm going to do: predict-yes
  1419. ENV: Agent did: predict-yes for direction L in state State-B
  1420. In State-B moving L
  1421. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1422. predict error 0
  1423. dir: dir isR
  1424. -/|198: O: O395 (predict-yes)
  1425. I see 1 and I'm going to do: predict-yes
  1426. ENV: Agent did: predict-yes for direction R in state State-A
  1427. In State-A moving R
  1428. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1429. predict error 0
  1430. dir: dir isL
  1431. \-199: O: O397 (predict-yes)
  1432. I see 1 and I'm going to do: predict-yes
  1433. ENV: Agent did: predict-yes for direction L in state State-B
  1434. In State-B moving L
  1435. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1436. predict error 0
  1437. dir: dir isL
  1438. /|\200: O: O400 (predict-no)
  1439. I see 1 and I'm going to do: predict-no
  1440. ENV: Agent did: predict-no for direction L in state State-A
  1441. In State-A moving L
  1442. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1443. predict error 0
  1444. dir: dir isR
  1445. -/|201: O: O401 (predict-yes)
  1446. I see 1 and I'm going to do: predict-yes
  1447. ENV: Agent did: predict-yes for direction R in state State-A
  1448. In State-A moving R
  1449. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1450. predict error 0
  1451. dir: dir isL
  1452. \202: O: O403 (predict-yes)
  1453. I see 1 and I'm going to do: predict-yes
  1454. ENV: Agent did: predict-yes for direction L in state State-B
  1455. In State-B moving L
  1456. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1457. predict error 0
  1458. dir: dir isU
  1459. -/|203: O: O406 (predict-no)
  1460. I see 1 and I'm going to do: predict-no
  1461. ENV: Agent did: predict-no for direction U in state State-A
  1462. In State-A moving U
  1463. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1464. predict error 0
  1465. dir: dir isL
  1466. \-/204: O: O408 (predict-no)
  1467. I see 1 and I'm going to do: predict-no
  1468. ENV: Agent did: predict-no for direction L in state State-A
  1469. In State-A moving L
  1470. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1471. predict error 0
  1472. dir: dir isU
  1473. |\-205: O: O410 (predict-no)
  1474. I see 1 and I'm going to do: predict-no
  1475. ENV: Agent did: predict-no for direction U in state State-A
  1476. In State-A moving U
  1477. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1478. predict error 0
  1479. dir: dir isU
  1480. /|206: O: O411 (predict-yes)
  1481. I see 1 and I'm going to do: predict-yes
  1482. ENV: Agent did: predict-yes for direction U in state State-A
  1483. In State-A moving U
  1484. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1485. predict error 1
  1486. dir: dir isU
  1487. \-/207: O: O414 (predict-no)
  1488. I see 0 and I'm going to do: predict-no
  1489. ENV: Agent did: predict-no for direction U in state State-A
  1490. In State-A moving U
  1491. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1492. predict error 0
  1493. dir: dir isU
  1494. |\-208: O: O416 (predict-no)
  1495. I see 1 and I'm going to do: predict-no
  1496. ENV: Agent did: predict-no for direction U in state State-A
  1497. In State-A moving U
  1498. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1499. predict error 0
  1500. dir: dir isU
  1501. /|209: O: O418 (predict-no)
  1502. I see 1 and I'm going to do: predict-no
  1503. ENV: Agent did: predict-no for direction U in state State-A
  1504. In State-A moving U
  1505. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1506. predict error 0
  1507. dir: dir isR
  1508. \-/210: O: O419 (predict-yes)
  1509. I see 1 and I'm going to do: predict-yes
  1510. ENV: Agent did: predict-yes for direction R in state State-A
  1511. In State-A moving R
  1512. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1513. predict error 0
  1514. dir: dir isL
  1515. |\-211: O: O421 (predict-yes)
  1516. I see 1 and I'm going to do: predict-yes
  1517. ENV: Agent did: predict-yes for direction L in state State-B
  1518. In State-B moving L
  1519. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1520. predict error 0
  1521. dir: dir isU
  1522. /212: O: O423 (predict-yes)
  1523. I see 1 and I'm going to do: predict-yes
  1524. ENV: Agent did: predict-yes for direction U in state State-A
  1525. In State-A moving U
  1526. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1527. predict error 1
  1528. dir: dir isL
  1529. |\213: O: O425 (predict-yes)
  1530. I see 0 and I'm going to do: predict-yes
  1531. ENV: Agent did: predict-yes for direction L in state State-A
  1532. In State-A moving L
  1533. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1534. predict error 1
  1535. dir: dir isL
  1536. -/|214: O: O428 (predict-no)
  1537. I see 0 and I'm going to do: predict-no
  1538. ENV: Agent did: predict-no for direction L in state State-A
  1539. In State-A moving L
  1540. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1541. predict error 0
  1542. dir: dir isR
  1543. \-/215: O: O429 (predict-yes)
  1544. I see 1 and I'm going to do: predict-yes
  1545. ENV: Agent did: predict-yes for direction R in state State-A
  1546. In State-A moving R
  1547. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1548. predict error 0
  1549. dir: dir isR
  1550. |\-/216: O: O432 (predict-no)
  1551. I see 1 and I'm going to do: predict-no
  1552. ENV: Agent did: predict-no for direction R in state State-B
  1553. In State-B moving R
  1554. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1555. predict error 0
  1556. dir: dir isR
  1557. |\217: O: O434 (predict-no)
  1558. I see 1 and I'm going to do: predict-no
  1559. ENV: Agent did: predict-no for direction R in state State-B
  1560. In State-B moving R
  1561. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1562. predict error 0
  1563. dir: dir isU
  1564. -/218: O: O436 (predict-no)
  1565. I see 1 and I'm going to do: predict-no
  1566. ENV: Agent did: predict-no for direction U in state State-B
  1567. In State-B moving U
  1568. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1569. predict error 0
  1570. dir: dir isR
  1571. |\-219: O: O438 (predict-no)
  1572. I see 1 and I'm going to do: predict-no
  1573. ENV: Agent did: predict-no for direction R in state State-B
  1574. In State-B moving R
  1575. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1576. predict error 0
  1577. dir: dir isR
  1578. /|\220: O: O440 (predict-no)
  1579. I see 1 and I'm going to do: predict-no
  1580. ENV: Agent did: predict-no for direction R in state State-B
  1581. In State-B moving R
  1582. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1583. predict error 0
  1584. dir: dir isL
  1585. -/|\221: O: O441 (predict-yes)
  1586. I see 1 and I'm going to do: predict-yes
  1587. ENV: Agent did: predict-yes for direction L in state State-B
  1588. In State-B moving L
  1589. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1590. predict error 0
  1591. dir: dir isU
  1592. -222: O: O444 (predict-no)
  1593. I see 1 and I'm going to do: predict-no
  1594. ENV: Agent did: predict-no for direction U in state State-A
  1595. In State-A moving U
  1596. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1597. predict error 0
  1598. dir: dir isR
  1599. /|\223: O: O445 (predict-yes)
  1600. I see 1 and I'm going to do: predict-yes
  1601. ENV: Agent did: predict-yes for direction R in state State-A
  1602. In State-A moving R
  1603. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1604. predict error 0
  1605. dir: dir isU
  1606. -/|224: O: O447 (predict-yes)
  1607. I see 1 and I'm going to do: predict-yes
  1608. ENV: Agent did: predict-yes for direction U in state State-B
  1609. In State-B moving U
  1610. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1611. predict error 1
  1612. dir: dir isR
  1613. \-/225: O: O450 (predict-no)
  1614. I see 0 and I'm going to do: predict-no
  1615. ENV: Agent did: predict-no for direction R in state State-B
  1616. In State-B moving R
  1617. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1618. predict error 0
  1619. dir: dir isL
  1620. |\-/226: O: O451 (predict-yes)
  1621. I see 1 and I'm going to do: predict-yes
  1622. ENV: Agent did: predict-yes for direction L in state State-B
  1623. In State-B moving L
  1624. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1625. predict error 0
  1626. dir: dir isR
  1627. |\-/sleeping...
  1628. |227: O: O453 (predict-yes)
  1629. I see 1 and I'm going to do: predict-yes
  1630. ENV: Agent did: predict-yes for direction R in state State-A
  1631. In State-A moving R
  1632. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1633. predict error 0
  1634. dir: dir isU
  1635. \-/228: O: O456 (predict-no)
  1636. I see 1 and I'm going to do: predict-no
  1637. ENV: Agent did: predict-no for direction U in state State-B
  1638. In State-B moving U
  1639. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1640. predict error 0
  1641. dir: dir isL
  1642. |\-229: O: O457 (predict-yes)
  1643. I see 1 and I'm going to do: predict-yes
  1644. ENV: Agent did: predict-yes for direction L in state State-B
  1645. In State-B moving L
  1646. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1647. predict error 0
  1648. dir: dir isU
  1649. /|230: O: O459 (predict-yes)
  1650. I see 1 and I'm going to do: predict-yes
  1651. ENV: Agent did: predict-yes for direction U in state State-A
  1652. In State-A moving U
  1653. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1654. predict error 1
  1655. dir: dir isU
  1656. \231: O: O462 (predict-no)
  1657. I see 0 and I'm going to do: predict-no
  1658. ENV: Agent did: predict-no for direction U in state State-A
  1659. In State-A moving U
  1660. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1661. predict error 0
  1662. dir: dir isU
  1663. -232: O: O464 (predict-no)
  1664. I see 1 and I'm going to do: predict-no
  1665. ENV: Agent did: predict-no for direction U in state State-A
  1666. In State-A moving U
  1667. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1668. predict error 0
  1669. dir: dir isU
  1670. /|233: O: O466 (predict-no)
  1671. I see 1 and I'm going to do: predict-no
  1672. ENV: Agent did: predict-no for direction U in state State-A
  1673. In State-A moving U
  1674. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1675. predict error 0
  1676. dir: dir isL
  1677. \-/|234: O: O468 (predict-no)
  1678. I see 1 and I'm going to do: predict-no
  1679. ENV: Agent did: predict-no for direction L in state State-A
  1680. In State-A moving L
  1681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1682. predict error 0
  1683. dir: dir isR
  1684. \-/235: O: O469 (predict-yes)
  1685. I see 1 and I'm going to do: predict-yes
  1686. ENV: Agent did: predict-yes for direction R in state State-A
  1687. In State-A moving R
  1688. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1689. predict error 0
  1690. dir: dir isU
  1691. |\-236: O: O472 (predict-no)
  1692. I see 1 and I'm going to do: predict-no
  1693. ENV: Agent did: predict-no for direction U in state State-B
  1694. In State-B moving U
  1695. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1696. predict error 0
  1697. dir: dir isL
  1698. /|237: O: O474 (predict-no)
  1699. I see 1 and I'm going to do: predict-no
  1700. ENV: Agent did: predict-no for direction L in state State-B
  1701. In State-B moving L
  1702. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1703. predict error 1
  1704. dir: dir isL
  1705. \-/238: O: O476 (predict-no)
  1706. I see 0 and I'm going to do: predict-no
  1707. ENV: Agent did: predict-no for direction L in state State-A
  1708. In State-A moving L
  1709. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1710. predict error 0
  1711. dir: dir isL
  1712. |\-239: O: O478 (predict-no)
  1713. I see 1 and I'm going to do: predict-no
  1714. ENV: Agent did: predict-no for direction L in state State-A
  1715. In State-A moving L
  1716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1717. predict error 0
  1718. dir: dir isR
  1719. /|\240: O: O479 (predict-yes)
  1720. I see 1 and I'm going to do: predict-yes
  1721. ENV: Agent did: predict-yes for direction R in state State-A
  1722. In State-A moving R
  1723. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1724. predict error 0
  1725. dir: dir isR
  1726. -/|241: O: O482 (predict-no)
  1727. I see 1 and I'm going to do: predict-no
  1728. ENV: Agent did: predict-no for direction R in state State-B
  1729. In State-B moving R
  1730. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1731. predict error 0
  1732. dir: dir isR
  1733. \242: O: O484 (predict-no)
  1734. I see 1 and I'm going to do: predict-no
  1735. ENV: Agent did: predict-no for direction R in state State-B
  1736. In State-B moving R
  1737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1738. predict error 0
  1739. dir: dir isU
  1740. -/|243: O: O486 (predict-no)
  1741. I see 1 and I'm going to do: predict-no
  1742. ENV: Agent did: predict-no for direction U in state State-B
  1743. In State-B moving U
  1744. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1745. predict error 0
  1746. dir: dir isL
  1747. \-244: O: O487 (predict-yes)
  1748. I see 1 and I'm going to do: predict-yes
  1749. ENV: Agent did: predict-yes for direction L in state State-B
  1750. In State-B moving L
  1751. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1752. predict error 0
  1753. dir: dir isL
  1754. /|245: O: O490 (predict-no)
  1755. I see 1 and I'm going to do: predict-no
  1756. ENV: Agent did: predict-no for direction L in state State-A
  1757. In State-A moving L
  1758. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1759. predict error 0
  1760. dir: dir isL
  1761. \-246: O: O492 (predict-no)
  1762. I see 1 and I'm going to do: predict-no
  1763. ENV: Agent did: predict-no for direction L in state State-A
  1764. In State-A moving L
  1765. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1766. predict error 0
  1767. dir: dir isL
  1768. /|\-247: O: O494 (predict-no)
  1769. I see 1 and I'm going to do: predict-no
  1770. ENV: Agent did: predict-no for direction L in state State-A
  1771. In State-A moving L
  1772. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1773. predict error 0
  1774. dir: dir isL
  1775. /|\248: O: O496 (predict-no)
  1776. I see 1 and I'm going to do: predict-no
  1777. ENV: Agent did: predict-no for direction L in state State-A
  1778. In State-A moving L
  1779. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1780. predict error 0
  1781. dir: dir isU
  1782. -/|249: O: O498 (predict-no)
  1783. I see 1 and I'm going to do: predict-no
  1784. ENV: Agent did: predict-no for direction U in state State-A
  1785. In State-A moving U
  1786. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1787. predict error 0
  1788. dir: dir isU
  1789. \-/250: O: O500 (predict-no)
  1790. I see 1 and I'm going to do: predict-no
  1791. ENV: Agent did: predict-no for direction U in state State-A
  1792. In State-A moving U
  1793. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1794. predict error 0
  1795. dir: dir isL
  1796. |\-251: O: O502 (predict-no)
  1797. I see 1 and I'm going to do: predict-no
  1798. ENV: Agent did: predict-no for direction L in state State-A
  1799. In State-A moving L
  1800. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1801. predict error 0
  1802. dir: dir isR
  1803. /252: O: O503 (predict-yes)
  1804. I see 1 and I'm going to do: predict-yes
  1805. ENV: Agent did: predict-yes for direction R in state State-A
  1806. In State-A moving R
  1807. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1808. predict error 0
  1809. dir: dir isL
  1810. |\-253: O: O505 (predict-yes)
  1811. I see 1 and I'm going to do: predict-yes
  1812. ENV: Agent did: predict-yes for direction L in state State-B
  1813. In State-B moving L
  1814. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1815. predict error 0
  1816. dir: dir isL
  1817. /|254: O: O508 (predict-no)
  1818. I see 1 and I'm going to do: predict-no
  1819. ENV: Agent did: predict-no for direction L in state State-A
  1820. In State-A moving L
  1821. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1822. predict error 0
  1823. dir: dir isR
  1824. \-/|255: O: O509 (predict-yes)
  1825. I see 1 and I'm going to do: predict-yes
  1826. ENV: Agent did: predict-yes for direction R in state State-A
  1827. In State-A moving R
  1828. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1829. predict error 0
  1830. dir: dir isR
  1831. \-256: O: O512 (predict-no)
  1832. I see 1 and I'm going to do: predict-no
  1833. ENV: Agent did: predict-no for direction R in state State-B
  1834. In State-B moving R
  1835. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1836. predict error 0
  1837. dir: dir isR
  1838. /|257: O: O514 (predict-no)
  1839. I see 1 and I'm going to do: predict-no
  1840. ENV: Agent did: predict-no for direction R in state State-B
  1841. In State-B moving R
  1842. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1843. predict error 0
  1844. dir: dir isR
  1845. \-/258: O: O516 (predict-no)
  1846. I see 1 and I'm going to do: predict-no
  1847. ENV: Agent did: predict-no for direction R in state State-B
  1848. In State-B moving R
  1849. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1850. predict error 0
  1851. dir: dir isL
  1852. |\-259: O: O517 (predict-yes)
  1853. I see 1 and I'm going to do: predict-yes
  1854. ENV: Agent did: predict-yes for direction L in state State-B
  1855. In State-B moving L
  1856. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1857. predict error 0
  1858. dir: dir isL
  1859. /|260: O: O520 (predict-no)
  1860. I see 1 and I'm going to do: predict-no
  1861. ENV: Agent did: predict-no for direction L in state State-A
  1862. In State-A moving L
  1863. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1864. predict error 0
  1865. dir: dir isL
  1866. \-/261: O: O522 (predict-no)
  1867. I see 1 and I'm going to do: predict-no
  1868. ENV: Agent did: predict-no for direction L in state State-A
  1869. In State-A moving L
  1870. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1871. predict error 0
  1872. dir: dir isR
  1873. |262: O: O523 (predict-yes)
  1874. I see 1 and I'm going to do: predict-yes
  1875. ENV: Agent did: predict-yes for direction R in state State-A
  1876. In State-A moving R
  1877. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1878. predict error 0
  1879. dir: dir isU
  1880. \-/263: O: O526 (predict-no)
  1881. I see 1 and I'm going to do: predict-no
  1882. ENV: Agent did: predict-no for direction U in state State-B
  1883. In State-B moving U
  1884. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1885. predict error 0
  1886. dir: dir isR
  1887. |\-264: O: O528 (predict-no)
  1888. I see 1 and I'm going to do: predict-no
  1889. ENV: Agent did: predict-no for direction R in state State-B
  1890. In State-B moving R
  1891. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1892. predict error 0
  1893. dir: dir isL
  1894. /|\265: O: O529 (predict-yes)
  1895. I see 1 and I'm going to do: predict-yes
  1896. ENV: Agent did: predict-yes for direction L in state State-B
  1897. In State-B moving L
  1898. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1899. predict error 0
  1900. dir: dir isL
  1901. -/266: O: O532 (predict-no)
  1902. I see 1 and I'm going to do: predict-no
  1903. ENV: Agent did: predict-no for direction L in state State-A
  1904. In State-A moving L
  1905. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1906. predict error 0
  1907. dir: dir isR
  1908. |\267: O: O533 (predict-yes)
  1909. I see 1 and I'm going to do: predict-yes
  1910. ENV: Agent did: predict-yes for direction R in state State-A
  1911. In State-A moving R
  1912. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1913. predict error 0
  1914. dir: dir isR
  1915. -/268: O: O536 (predict-no)
  1916. I see 1 and I'm going to do: predict-no
  1917. ENV: Agent did: predict-no for direction R in state State-B
  1918. In State-B moving R
  1919. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1920. predict error 0
  1921. dir: dir isU
  1922. |\269: O: O538 (predict-no)
  1923. I see 1 and I'm going to do: predict-no
  1924. ENV: Agent did: predict-no for direction U in state State-B
  1925. In State-B moving U
  1926. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1927. predict error 0
  1928. dir: dir isR
  1929. -/|270: O: O540 (predict-no)
  1930. I see 1 and I'm going to do: predict-no
  1931. ENV: Agent did: predict-no for direction R in state State-B
  1932. In State-B moving R
  1933. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1934. predict error 0
  1935. dir: dir isU
  1936. \-/271: O: O542 (predict-no)
  1937. I see 1 and I'm going to do: predict-no
  1938. ENV: Agent did: predict-no for direction U in state State-B
  1939. In State-B moving U
  1940. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1941. predict error 0
  1942. dir: dir isR
  1943. |272: O: O544 (predict-no)
  1944. I see 1 and I'm going to do: predict-no
  1945. ENV: Agent did: predict-no for direction R in state State-B
  1946. In State-B moving R
  1947. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1948. predict error 0
  1949. dir: dir isR
  1950. \-/273: O: O546 (predict-no)
  1951. I see 1 and I'm going to do: predict-no
  1952. ENV: Agent did: predict-no for direction R in state State-B
  1953. In State-B moving R
  1954. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1955. predict error 0
  1956. dir: dir isR
  1957. |\-274: O: O548 (predict-no)
  1958. I see 1 and I'm going to do: predict-no
  1959. ENV: Agent did: predict-no for direction R in state State-B
  1960. In State-B moving R
  1961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1962. predict error 0
  1963. dir: dir isR
  1964. /|275: O: O550 (predict-no)
  1965. I see 1 and I'm going to do: predict-no
  1966. ENV: Agent did: predict-no for direction R in state State-B
  1967. In State-B moving R
  1968. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1969. predict error 0
  1970. dir: dir isU
  1971. \-/276: O: O552 (predict-no)
  1972. I see 1 and I'm going to do: predict-no
  1973. ENV: Agent did: predict-no for direction U in state State-B
  1974. In State-B moving U
  1975. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1976. predict error 0
  1977. dir: dir isR
  1978. |\-/277: O: O554 (predict-no)
  1979. I see 1 and I'm going to do: predict-no
  1980. ENV: Agent did: predict-no for direction R in state State-B
  1981. In State-B moving R
  1982. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1983. predict error 0
  1984. dir: dir isU
  1985. |\-278: O: O556 (predict-no)
  1986. I see 1 and I'm going to do: predict-no
  1987. ENV: Agent did: predict-no for direction U in state State-B
  1988. In State-B moving U
  1989. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1990. predict error 0
  1991. dir: dir isR
  1992. /|\279: O: O558 (predict-no)
  1993. I see 1 and I'm going to do: predict-no
  1994. ENV: Agent did: predict-no for direction R in state State-B
  1995. In State-B moving R
  1996. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1997. predict error 0
  1998. dir: dir isL
  1999. -/280: O: O560 (predict-no)
  2000. I see 1 and I'm going to do: predict-no
  2001. ENV: Agent did: predict-no for direction L in state State-B
  2002. In State-B moving L
  2003. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2004. predict error 1
  2005. dir: dir isR
  2006. |\281: O: O561 (predict-yes)
  2007. I see 0 and I'm going to do: predict-yes
  2008. ENV: Agent did: predict-yes for direction R in state State-A
  2009. In State-A moving R
  2010. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2011. predict error 0
  2012. dir: dir isU
  2013. -282: O: O564 (predict-no)
  2014. I see 1 and I'm going to do: predict-no
  2015. ENV: Agent did: predict-no for direction U in state State-B
  2016. In State-B moving U
  2017. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2018. predict error 0
  2019. dir: dir isL
  2020. /|\283: O: O565 (predict-yes)
  2021. I see 1 and I'm going to do: predict-yes
  2022. ENV: Agent did: predict-yes for direction L in state State-B
  2023. In State-B moving L
  2024. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2025. predict error 0
  2026. dir: dir isR
  2027. -/284: O: O567 (predict-yes)
  2028. I see 1 and I'm going to do: predict-yes
  2029. ENV: Agent did: predict-yes for direction R in state State-A
  2030. In State-A moving R
  2031. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2032. predict error 0
  2033. dir: dir isR
  2034. |\285: O: O570 (predict-no)
  2035. I see 1 and I'm going to do: predict-no
  2036. ENV: Agent did: predict-no for direction R in state State-B
  2037. In State-B moving R
  2038. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2039. predict error 0
  2040. dir: dir isU
  2041. -/|286: O: O572 (predict-no)
  2042. I see 1 and I'm going to do: predict-no
  2043. ENV: Agent did: predict-no for direction U in state State-B
  2044. In State-B moving U
  2045. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2046. predict error 0
  2047. dir: dir isU
  2048. \-/287: O: O574 (predict-no)
  2049. I see 1 and I'm going to do: predict-no
  2050. ENV: Agent did: predict-no for direction U in state State-B
  2051. In State-B moving U
  2052. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2053. predict error 0
  2054. dir: dir isR
  2055. |\288: O: O576 (predict-no)
  2056. I see 1 and I'm going to do: predict-no
  2057. ENV: Agent did: predict-no for direction R in state State-B
  2058. In State-B moving R
  2059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2060. predict error 0
  2061. dir: dir isU
  2062. -/289: O: O578 (predict-no)
  2063. I see 1 and I'm going to do: predict-no
  2064. ENV: Agent did: predict-no for direction U in state State-B
  2065. In State-B moving U
  2066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2067. predict error 0
  2068. dir: dir isU
  2069. |\290: O: O580 (predict-no)
  2070. I see 1 and I'm going to do: predict-no
  2071. ENV: Agent did: predict-no for direction U in state State-B
  2072. In State-B moving U
  2073. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2074. predict error 0
  2075. dir: dir isL
  2076. -/|291: O: O581 (predict-yes)
  2077. I see 1 and I'm going to do: predict-yes
  2078. ENV: Agent did: predict-yes for direction L in state State-B
  2079. In State-B moving L
  2080. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2081. predict error 0
  2082. dir: dir isR
  2083. \292: O: O583 (predict-yes)
  2084. I see 1 and I'm going to do: predict-yes
  2085. ENV: Agent did: predict-yes for direction R in state State-A
  2086. In State-A moving R
  2087. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2088. predict error 0
  2089. dir: dir isL
  2090. -/|293: O: O585 (predict-yes)
  2091. I see 1 and I'm going to do: predict-yes
  2092. ENV: Agent did: predict-yes for direction L in state State-B
  2093. In State-B moving L
  2094. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2095. predict error 0
  2096. dir: dir isU
  2097. \-/|294: O: O588 (predict-no)
  2098. I see 1 and I'm going to do: predict-no
  2099. ENV: Agent did: predict-no for direction U in state State-A
  2100. In State-A moving U
  2101. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2102. predict error 0
  2103. dir: dir isR
  2104. \-/295: O: O589 (predict-yes)
  2105. I see 1 and I'm going to do: predict-yes
  2106. ENV: Agent did: predict-yes for direction R in state State-A
  2107. In State-A moving R
  2108. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2109. predict error 0
  2110. dir: dir isU
  2111. |296: O: O591 (predict-yes)
  2112. I see 1 and I'm going to do: predict-yes
  2113. ENV: Agent did: predict-yes for direction U in state State-B
  2114. In State-B moving U
  2115. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2116. predict error 1
  2117. dir: dir isU
  2118. \297: O: O594 (predict-no)
  2119. I see 0 and I'm going to do: predict-no
  2120. ENV: Agent did: predict-no for direction U in state State-B
  2121. In State-B moving U
  2122. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2123. predict error 0
  2124. dir: dir isU
  2125. -/|298: O: O596 (predict-no)
  2126. I see 1 and I'm going to do: predict-no
  2127. ENV: Agent did: predict-no for direction U in state State-B
  2128. In State-B moving U
  2129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2130. predict error 0
  2131. dir: dir isU
  2132. \-/299: O: O598 (predict-no)
  2133. I see 1 and I'm going to do: predict-no
  2134. ENV: Agent did: predict-no for direction U in state State-B
  2135. In State-B moving U
  2136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2137. predict error 0
  2138. dir: dir isR
  2139. |\-300: O: O600 (predict-no)
  2140. I see 1 and I'm going to do: predict-no
  2141. ENV: Agent did: predict-no for direction R in state State-B
  2142. In State-B moving R
  2143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2144. predict error 0
  2145. dir: dir isL
  2146. /|\-/301: O: O601 (predict-yes)
  2147. I see 1 and I'm going to do: predict-yes
  2148. ENV: Agent did: predict-yes for direction L in state State-B
  2149. In State-B moving L
  2150. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2151. predict error 0
  2152. dir: dir isR
  2153. |302: O: O603 (predict-yes)
  2154. I see 1 and I'm going to do: predict-yes
  2155. ENV: Agent did: predict-yes for direction R in state State-A
  2156. In State-A moving R
  2157. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2158. predict error 0
  2159. dir: dir isL
  2160. \-/303: O: O605 (predict-yes)
  2161. I see 1 and I'm going to do: predict-yes
  2162. ENV: Agent did: predict-yes for direction L in state State-B
  2163. In State-B moving L
  2164. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2165. predict error 0
  2166. dir: dir isL
  2167. |\304: O: O608 (predict-no)
  2168. I see 1 and I'm going to do: predict-no
  2169. ENV: Agent did: predict-no for direction L in state State-A
  2170. In State-A moving L
  2171. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2172. predict error 0
  2173. dir: dir isU
  2174. -/|305: O: O610 (predict-no)
  2175. I see 1 and I'm going to do: predict-no
  2176. ENV: Agent did: predict-no for direction U in state State-A
  2177. In State-A moving U
  2178. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2179. predict error 0
  2180. dir: dir isL
  2181. \-/306: O: O612 (predict-no)
  2182. I see 1 and I'm going to do: predict-no
  2183. ENV: Agent did: predict-no for direction L in state State-A
  2184. In State-A moving L
  2185. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2186. predict error 0
  2187. dir: dir isL
  2188. |\307: O: O614 (predict-no)
  2189. I see 1 and I'm going to do: predict-no
  2190. ENV: Agent did: predict-no for direction L in state State-A
  2191. In State-A moving L
  2192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2193. predict error 0
  2194. dir: dir isL
  2195. -/|308: O: O616 (predict-no)
  2196. I see 1 and I'm going to do: predict-no
  2197. ENV: Agent did: predict-no for direction L in state State-A
  2198. In State-A moving L
  2199. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2200. predict error 0
  2201. dir: dir isU
  2202. \-/309: O: O618 (predict-no)
  2203. I see 1 and I'm going to do: predict-no
  2204. ENV: Agent did: predict-no for direction U in state State-A
  2205. In State-A moving U
  2206. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2207. predict error 0
  2208. dir: dir isL
  2209. |\310: O: O620 (predict-no)
  2210. I see 1 and I'm going to do: predict-no
  2211. ENV: Agent did: predict-no for direction L in state State-A
  2212. In State-A moving L
  2213. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2214. predict error 0
  2215. dir: dir isL
  2216. -/|311: O: O622 (predict-no)
  2217. I see 1 and I'm going to do: predict-no
  2218. ENV: Agent did: predict-no for direction L in state State-A
  2219. In State-A moving L
  2220. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2221. predict error 0
  2222. dir: dir isU
  2223. \312: O: O624 (predict-no)
  2224. I see 1 and I'm going to do: predict-no
  2225. ENV: Agent did: predict-no for direction U in state State-A
  2226. In State-A moving U
  2227. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2228. predict error 0
  2229. dir: dir isL
  2230. -/|313: O: O626 (predict-no)
  2231. I see 1 and I'm going to do: predict-no
  2232. ENV: Agent did: predict-no for direction L in state State-A
  2233. In State-A moving L
  2234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2235. predict error 0
  2236. dir: dir isU
  2237. \-/314: O: O628 (predict-no)
  2238. I see 1 and I'm going to do: predict-no
  2239. ENV: Agent did: predict-no for direction U in state State-A
  2240. In State-A moving U
  2241. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2242. predict error 0
  2243. dir: dir isU
  2244. |\-315: O: O630 (predict-no)
  2245. I see 1 and I'm going to do: predict-no
  2246. ENV: Agent did: predict-no for direction U in state State-A
  2247. In State-A moving U
  2248. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2249. predict error 0
  2250. dir: dir isU
  2251. /|\316: O: O632 (predict-no)
  2252. I see 1 and I'm going to do: predict-no
  2253. ENV: Agent did: predict-no for direction U in state State-A
  2254. In State-A moving U
  2255. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2256. predict error 0
  2257. dir: dir isU
  2258. -/|317: O: O634 (predict-no)
  2259. I see 1 and I'm going to do: predict-no
  2260. ENV: Agent did: predict-no for direction U in state State-A
  2261. In State-A moving U
  2262. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2263. predict error 0
  2264. dir: dir isR
  2265. \-/318: O: O635 (predict-yes)
  2266. I see 1 and I'm going to do: predict-yes
  2267. ENV: Agent did: predict-yes for direction R in state State-A
  2268. In State-A moving R
  2269. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2270. predict error 0
  2271. dir: dir isU
  2272. |\-319: O: O638 (predict-no)
  2273. I see 1 and I'm going to do: predict-no
  2274. ENV: Agent did: predict-no for direction U in state State-B
  2275. In State-B moving U
  2276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2277. predict error 0
  2278. dir: dir isL
  2279. /|\320: O: O639 (predict-yes)
  2280. I see 1 and I'm going to do: predict-yes
  2281. ENV: Agent did: predict-yes for direction L in state State-B
  2282. In State-B moving L
  2283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2284. predict error 0
  2285. dir: dir isU
  2286. -321: O: O642 (predict-no)
  2287. I see 1 and I'm going to do: predict-no
  2288. ENV: Agent did: predict-no for direction U in state State-A
  2289. In State-A moving U
  2290. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2291. predict error 0
  2292. dir: dir isR
  2293. /322: O: O643 (predict-yes)
  2294. I see 1 and I'm going to do: predict-yes
  2295. ENV: Agent did: predict-yes for direction R in state State-A
  2296. In State-A moving R
  2297. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2298. predict error 0
  2299. dir: dir isU
  2300. |\-323: O: O646 (predict-no)
  2301. I see 1 and I'm going to do: predict-no
  2302. ENV: Agent did: predict-no for direction U in state State-B
  2303. In State-B moving U
  2304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2305. predict error 0
  2306. dir: dir isL
  2307. /|\324: O: O647 (predict-yes)
  2308. I see 1 and I'm going to do: predict-yes
  2309. ENV: Agent did: predict-yes for direction L in state State-B
  2310. In State-B moving L
  2311. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2312. predict error 0
  2313. dir: dir isU
  2314. -/325: O: O650 (predict-no)
  2315. I see 1 and I'm going to do: predict-no
  2316. ENV: Agent did: predict-no for direction U in state State-A
  2317. In State-A moving U
  2318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2319. predict error 0
  2320. dir: dir isU
  2321. |\326: O: O652 (predict-no)
  2322. I see 1 and I'm going to do: predict-no
  2323. ENV: Agent did: predict-no for direction U in state State-A
  2324. In State-A moving U
  2325. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2326. predict error 0
  2327. dir: dir isL
  2328. -/327: O: O654 (predict-no)
  2329. I see 1 and I'm going to do: predict-no
  2330. ENV: Agent did: predict-no for direction L in state State-A
  2331. In State-A moving L
  2332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2333. predict error 0
  2334. dir: dir isL
  2335. |\-/sleeping...
  2336. |328: O: O656 (predict-no)
  2337. I see 1 and I'm going to do: predict-no
  2338. ENV: Agent did: predict-no for direction L in state State-A
  2339. In State-A moving L
  2340. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2341. predict error 0
  2342. dir: dir isR
  2343. \-329: O: O657 (predict-yes)
  2344. I see 1 and I'm going to do: predict-yes
  2345. ENV: Agent did: predict-yes for direction R in state State-A
  2346. In State-A moving R
  2347. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2348. predict error 0
  2349. dir: dir isU
  2350. /|\330: O: O660 (predict-no)
  2351. I see 1 and I'm going to do: predict-no
  2352. ENV: Agent did: predict-no for direction U in state State-B
  2353. In State-B moving U
  2354. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2355. predict error 0
  2356. dir: dir isL
  2357. -/331: O: O661 (predict-yes)
  2358. I see 1 and I'm going to do: predict-yes
  2359. ENV: Agent did: predict-yes for direction L in state State-B
  2360. In State-B moving L
  2361. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2362. predict error 0
  2363. dir: dir isL
  2364. |332: O: O664 (predict-no)
  2365. I see 1 and I'm going to do: predict-no
  2366. ENV: Agent did: predict-no for direction L in state State-A
  2367. In State-A moving L
  2368. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2369. predict error 0
  2370. dir: dir isL
  2371. \-/333: O: O666 (predict-no)
  2372. I see 1 and I'm going to do: predict-no
  2373. ENV: Agent did: predict-no for direction L in state State-A
  2374. In State-A moving L
  2375. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2376. predict error 0
  2377. dir: dir isR
  2378. |\-334: O: O667 (predict-yes)
  2379. I see 1 and I'm going to do: predict-yes
  2380. ENV: Agent did: predict-yes for direction R in state State-A
  2381. In State-A moving R
  2382. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2383. predict error 0
  2384. dir: dir isU
  2385. /|335: O: O669 (predict-yes)
  2386. I see 1 and I'm going to do: predict-yes
  2387. ENV: Agent did: predict-yes for direction U in state State-B
  2388. In State-B moving U
  2389. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2390. predict error 1
  2391. dir: dir isR
  2392. \-/336: O: O672 (predict-no)
  2393. I see 0 and I'm going to do: predict-no
  2394. ENV: Agent did: predict-no for direction R in state State-B
  2395. In State-B moving R
  2396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2397. predict error 0
  2398. dir: dir isR
  2399. |\-337: O: O674 (predict-no)
  2400. I see 1 and I'm going to do: predict-no
  2401. ENV: Agent did: predict-no for direction R in state State-B
  2402. In State-B moving R
  2403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2404. predict error 0
  2405. dir: dir isR
  2406. /|\338: O: O676 (predict-no)
  2407. I see 1 and I'm going to do: predict-no
  2408. ENV: Agent did: predict-no for direction R in state State-B
  2409. In State-B moving R
  2410. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2411. predict error 0
  2412. dir: dir isR
  2413. -/339: O: O678 (predict-no)
  2414. I see 1 and I'm going to do: predict-no
  2415. ENV: Agent did: predict-no for direction R in state State-B
  2416. In State-B moving R
  2417. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2418. predict error 0
  2419. dir: dir isR
  2420. |\-340: O: O679 (predict-yes)
  2421. I see 1 and I'm going to do: predict-yes
  2422. ENV: Agent did: predict-yes for direction R in state State-B
  2423. In State-B moving R
  2424. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2425. predict error 1
  2426. dir: dir isL
  2427. /|341: O: O681 (predict-yes)
  2428. I see 0 and I'm going to do: predict-yes
  2429. ENV: Agent did: predict-yes for direction L in state State-B
  2430. In State-B moving L
  2431. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2432. predict error 0
  2433. dir: dir isR
  2434. \342: O: O683 (predict-yes)
  2435. I see 1 and I'm going to do: predict-yes
  2436. ENV: Agent did: predict-yes for direction R in state State-A
  2437. In State-A moving R
  2438. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2439. predict error 0
  2440. dir: dir isU
  2441. -/|343: O: O686 (predict-no)
  2442. I see 1 and I'm going to do: predict-no
  2443. ENV: Agent did: predict-no for direction U in state State-B
  2444. In State-B moving U
  2445. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2446. predict error 0
  2447. dir: dir isR
  2448. \-/344: O: O687 (predict-yes)
  2449. I see 1 and I'm going to do: predict-yes
  2450. ENV: Agent did: predict-yes for direction R in state State-B
  2451. In State-B moving R
  2452. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2453. predict error 1
  2454. dir: dir isU
  2455. |\-345: O: O690 (predict-no)
  2456. I see 0 and I'm going to do: predict-no
  2457. ENV: Agent did: predict-no for direction U in state State-B
  2458. In State-B moving U
  2459. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2460. predict error 0
  2461. dir: dir isL
  2462. /|346: O: O691 (predict-yes)
  2463. I see 1 and I'm going to do: predict-yes
  2464. ENV: Agent did: predict-yes for direction L in state State-B
  2465. In State-B moving L
  2466. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2467. predict error 0
  2468. dir: dir isU
  2469. \-/347: O: O694 (predict-no)
  2470. I see 1 and I'm going to do: predict-no
  2471. ENV: Agent did: predict-no for direction U in state State-A
  2472. In State-A moving U
  2473. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2474. predict error 0
  2475. dir: dir isU
  2476. |\-348: O: O696 (predict-no)
  2477. I see 1 and I'm going to do: predict-no
  2478. ENV: Agent did: predict-no for direction U in state State-A
  2479. In State-A moving U
  2480. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2481. predict error 0
  2482. dir: dir isU
  2483. /|\349: O: O698 (predict-no)
  2484. I see 1 and I'm going to do: predict-no
  2485. ENV: Agent did: predict-no for direction U in state State-A
  2486. In State-A moving U
  2487. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2488. predict error 0
  2489. dir: dir isR
  2490. -/|350: O: O699 (predict-yes)
  2491. I see 1 and I'm going to do: predict-yes
  2492. ENV: Agent did: predict-yes for direction R in state State-A
  2493. In State-A moving R
  2494. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2495. predict error 0
  2496. dir: dir isR
  2497. \-/351: O: O702 (predict-no)
  2498. I see 1 and I'm going to do: predict-no
  2499. ENV: Agent did: predict-no for direction R in state State-B
  2500. In State-B moving R
  2501. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2502. predict error 0
  2503. dir: dir isR
  2504. |352: O: O704 (predict-no)
  2505. I see 1 and I'm going to do: predict-no
  2506. ENV: Agent did: predict-no for direction R in state State-B
  2507. In State-B moving R
  2508. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2509. predict error 0
  2510. dir: dir isU
  2511. \-353: O: O706 (predict-no)
  2512. I see 1 and I'm going to do: predict-no
  2513. ENV: Agent did: predict-no for direction U in state State-B
  2514. In State-B moving U
  2515. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2516. predict error 0
  2517. dir: dir isL
  2518. /|\354: O: O707 (predict-yes)
  2519. I see 1 and I'm going to do: predict-yes
  2520. ENV: Agent did: predict-yes for direction L in state State-B
  2521. In State-B moving L
  2522. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2523. predict error 0
  2524. dir: dir isL
  2525. -/|355: O: O710 (predict-no)
  2526. I see 1 and I'm going to do: predict-no
  2527. ENV: Agent did: predict-no for direction L in state State-A
  2528. In State-A moving L
  2529. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2530. predict error 0
  2531. dir: dir isU
  2532. \-/356: O: O712 (predict-no)
  2533. I see 1 and I'm going to do: predict-no
  2534. ENV: Agent did: predict-no for direction U in state State-A
  2535. In State-A moving U
  2536. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2537. predict error 0
  2538. dir: dir isU
  2539. |\-357: O: O714 (predict-no)
  2540. I see 1 and I'm going to do: predict-no
  2541. ENV: Agent did: predict-no for direction U in state State-A
  2542. In State-A moving U
  2543. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2544. predict error 0
  2545. dir: dir isR
  2546. /|\358: O: O715 (predict-yes)
  2547. I see 1 and I'm going to do: predict-yes
  2548. ENV: Agent did: predict-yes for direction R in state State-A
  2549. In State-A moving R
  2550. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2551. predict error 0
  2552. dir: dir isR
  2553. -/|359: O: O718 (predict-no)
  2554. I see 1 and I'm going to do: predict-no
  2555. ENV: Agent did: predict-no for direction R in state State-B
  2556. In State-B moving R
  2557. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2558. predict error 0
  2559. dir: dir isU
  2560. \-/360: O: O720 (predict-no)
  2561. I see 1 and I'm going to do: predict-no
  2562. ENV: Agent did: predict-no for direction U in state State-B
  2563. In State-B moving U
  2564. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2565. predict error 0
  2566. dir: dir isL
  2567. |\361: O: O721 (predict-yes)
  2568. I see 1 and I'm going to do: predict-yes
  2569. ENV: Agent did: predict-yes for direction L in state State-B
  2570. In State-B moving L
  2571. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2572. predict error 0
  2573. dir: dir isL
  2574. -362: O: O724 (predict-no)
  2575. I see 1 and I'm going to do: predict-no
  2576. ENV: Agent did: predict-no for direction L in state State-A
  2577. In State-A moving L
  2578. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2579. predict error 0
  2580. dir: dir isL
  2581. /|\363: O: O726 (predict-no)
  2582. I see 1 and I'm going to do: predict-no
  2583. ENV: Agent did: predict-no for direction L in state State-A
  2584. In State-A moving L
  2585. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2586. predict error 0
  2587. dir: dir isR
  2588. -/|364: O: O727 (predict-yes)
  2589. I see 1 and I'm going to do: predict-yes
  2590. ENV: Agent did: predict-yes for direction R in state State-A
  2591. In State-A moving R
  2592. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2593. predict error 0
  2594. dir: dir isL
  2595. \-/365: O: O729 (predict-yes)
  2596. I see 1 and I'm going to do: predict-yes
  2597. ENV: Agent did: predict-yes for direction L in state State-B
  2598. In State-B moving L
  2599. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2600. predict error 0
  2601. dir: dir isL
  2602. |366: O: O732 (predict-no)
  2603. I see 1 and I'm going to do: predict-no
  2604. ENV: Agent did: predict-no for direction L in state State-A
  2605. In State-A moving L
  2606. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2607. predict error 0
  2608. dir: dir isU
  2609. \-/367: O: O734 (predict-no)
  2610. I see 1 and I'm going to do: predict-no
  2611. ENV: Agent did: predict-no for direction U in state State-A
  2612. In State-A moving U
  2613. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2614. predict error 0
  2615. dir: dir isU
  2616. |\-368: O: O735 (predict-yes)
  2617. I see 1 and I'm going to do: predict-yes
  2618. ENV: Agent did: predict-yes for direction U in state State-A
  2619. In State-A moving U
  2620. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2621. predict error 1
  2622. dir: dir isL
  2623. /|\369: O: O738 (predict-no)
  2624. I see 0 and I'm going to do: predict-no
  2625. ENV: Agent did: predict-no for direction L in state State-A
  2626. In State-A moving L
  2627. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2628. predict error 0
  2629. dir: dir isU
  2630. -/|370: O: O740 (predict-no)
  2631. I see 1 and I'm going to do: predict-no
  2632. ENV: Agent did: predict-no for direction U in state State-A
  2633. In State-A moving U
  2634. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2635. predict error 0
  2636. dir: dir isR
  2637. \-371: O: O741 (predict-yes)
  2638. I see 1 and I'm going to do: predict-yes
  2639. ENV: Agent did: predict-yes for direction R in state State-A
  2640. In State-A moving R
  2641. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2642. predict error 0
  2643. dir: dir isU
  2644. /372: O: O744 (predict-no)
  2645. I see 1 and I'm going to do: predict-no
  2646. ENV: Agent did: predict-no for direction U in state State-B
  2647. In State-B moving U
  2648. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2649. predict error 0
  2650. dir: dir isR
  2651. |\-373: O: O745 (predict-yes)
  2652. I see 1 and I'm going to do: predict-yes
  2653. ENV: Agent did: predict-yes for direction R in state State-B
  2654. In State-B moving R
  2655. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2656. predict error 1
  2657. dir: dir isL
  2658. /|374: O: O747 (predict-yes)
  2659. I see 0 and I'm going to do: predict-yes
  2660. ENV: Agent did: predict-yes for direction L in state State-B
  2661. In State-B moving L
  2662. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2663. predict error 0
  2664. dir: dir isL
  2665. \-/375: O: O750 (predict-no)
  2666. I see 1 and I'm going to do: predict-no
  2667. ENV: Agent did: predict-no for direction L in state State-A
  2668. In State-A moving L
  2669. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2670. predict error 0
  2671. dir: dir isL
  2672. |\-376: O: O752 (predict-no)
  2673. I see 1 and I'm going to do: predict-no
  2674. ENV: Agent did: predict-no for direction L in state State-A
  2675. In State-A moving L
  2676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2677. predict error 0
  2678. dir: dir isR
  2679. /|\377: O: O753 (predict-yes)
  2680. I see 1 and I'm going to do: predict-yes
  2681. ENV: Agent did: predict-yes for direction R in state State-A
  2682. In State-A moving R
  2683. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2684. predict error 0
  2685. dir: dir isR
  2686. -/|378: O: O756 (predict-no)
  2687. I see 1 and I'm going to do: predict-no
  2688. ENV: Agent did: predict-no for direction R in state State-B
  2689. In State-B moving R
  2690. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2691. predict error 0
  2692. dir: dir isL
  2693. \-/379: O: O757 (predict-yes)
  2694. I see 1 and I'm going to do: predict-yes
  2695. ENV: Agent did: predict-yes for direction L in state State-B
  2696. In State-B moving L
  2697. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2698. predict error 0
  2699. dir: dir isR
  2700. |\380: O: O759 (predict-yes)
  2701. I see 1 and I'm going to do: predict-yes
  2702. ENV: Agent did: predict-yes for direction R in state State-A
  2703. In State-A moving R
  2704. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2705. predict error 0
  2706. dir: dir isL
  2707. -/381: O: O761 (predict-yes)
  2708. I see 1 and I'm going to do: predict-yes
  2709. ENV: Agent did: predict-yes for direction L in state State-B
  2710. In State-B moving L
  2711. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2712. predict error 0
  2713. dir: dir isU
  2714. |382: O: O764 (predict-no)
  2715. I see 1 and I'm going to do: predict-no
  2716. ENV: Agent did: predict-no for direction U in state State-A
  2717. In State-A moving U
  2718. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2719. predict error 0
  2720. dir: dir isL
  2721. \-383: O: O766 (predict-no)
  2722. I see 1 and I'm going to do: predict-no
  2723. ENV: Agent did: predict-no for direction L in state State-A
  2724. In State-A moving L
  2725. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2726. predict error 0
  2727. dir: dir isU
  2728. /|384: O: O768 (predict-no)
  2729. I see 1 and I'm going to do: predict-no
  2730. ENV: Agent did: predict-no for direction U in state State-A
  2731. In State-A moving U
  2732. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2733. predict error 0
  2734. dir: dir isR
  2735. \-385: O: O769 (predict-yes)
  2736. I see 1 and I'm going to do: predict-yes
  2737. ENV: Agent did: predict-yes for direction R in state State-A
  2738. In State-A moving R
  2739. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2740. predict error 0
  2741. dir: dir isR
  2742. /|386: O: O772 (predict-no)
  2743. I see 1 and I'm going to do: predict-no
  2744. ENV: Agent did: predict-no for direction R in state State-B
  2745. In State-B moving R
  2746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2747. predict error 0
  2748. dir: dir isR
  2749. \-387: O: O774 (predict-no)
  2750. I see 1 and I'm going to do: predict-no
  2751. ENV: Agent did: predict-no for direction R in state State-B
  2752. In State-B moving R
  2753. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2754. predict error 0
  2755. dir: dir isU
  2756. /|\-388: O: O776 (predict-no)
  2757. I see 1 and I'm going to do: predict-no
  2758. ENV: Agent did: predict-no for direction U in state State-B
  2759. In State-B moving U
  2760. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2761. predict error 0
  2762. dir: dir isU
  2763. /|\389: O: O778 (predict-no)
  2764. I see 1 and I'm going to do: predict-no
  2765. ENV: Agent did: predict-no for direction U in state State-B
  2766. In State-B moving U
  2767. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2768. predict error 0
  2769. dir: dir isR
  2770. -/|390: O: O780 (predict-no)
  2771. I see 1 and I'm going to do: predict-no
  2772. ENV: Agent did: predict-no for direction R in state State-B
  2773. In State-B moving R
  2774. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2775. predict error 0
  2776. dir: dir isR
  2777. \-391: O: O782 (predict-no)
  2778. I see 1 and I'm going to do: predict-no
  2779. ENV: Agent did: predict-no for direction R in state State-B
  2780. In State-B moving R
  2781. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2782. predict error 0
  2783. dir: dir isU
  2784. /392: O: O784 (predict-no)
  2785. I see 1 and I'm going to do: predict-no
  2786. ENV: Agent did: predict-no for direction U in state State-B
  2787. In State-B moving U
  2788. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2789. predict error 0
  2790. dir: dir isU
  2791. |\393: O: O786 (predict-no)
  2792. I see 1 and I'm going to do: predict-no
  2793. ENV: Agent did: predict-no for direction U in state State-B
  2794. In State-B moving U
  2795. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2796. predict error 0
  2797. dir: dir isR
  2798. -/394: O: O788 (predict-no)
  2799. I see 1 and I'm going to do: predict-no
  2800. ENV: Agent did: predict-no for direction R in state State-B
  2801. In State-B moving R
  2802. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2803. predict error 0
  2804. dir: dir isU
  2805. |\-395: O: O790 (predict-no)
  2806. I see 1 and I'm going to do: predict-no
  2807. ENV: Agent did: predict-no for direction U in state State-B
  2808. In State-B moving U
  2809. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2810. predict error 0
  2811. dir: dir isU
  2812. /|396: O: O792 (predict-no)
  2813. I see 1 and I'm going to do: predict-no
  2814. ENV: Agent did: predict-no for direction U in state State-B
  2815. In State-B moving U
  2816. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2817. predict error 0
  2818. dir: dir isR
  2819. \-/397: O: O794 (predict-no)
  2820. I see 1 and I'm going to do: predict-no
  2821. ENV: Agent did: predict-no for direction R in state State-B
  2822. In State-B moving R
  2823. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2824. predict error 0
  2825. dir: dir isL
  2826. |\-398: O: O795 (predict-yes)
  2827. I see 1 and I'm going to do: predict-yes
  2828. ENV: Agent did: predict-yes for direction L in state State-B
  2829. In State-B moving L
  2830. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2831. predict error 0
  2832. dir: dir isL
  2833. /|399: O: O798 (predict-no)
  2834. I see 1 and I'm going to do: predict-no
  2835. ENV: Agent did: predict-no for direction L in state State-A
  2836. In State-A moving L
  2837. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2838. predict error 0
  2839. dir: dir isU
  2840. \-/400: O: O800 (predict-no)
  2841. I see 1 and I'm going to do: predict-no
  2842. ENV: Agent did: predict-no for direction U in state State-A
  2843. In State-A moving U
  2844. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2845. predict error 0
  2846. dir: dir isU
  2847. |\-401: O: O802 (predict-no)
  2848. I see 1 and I'm going to do: predict-no
  2849. ENV: Agent did: predict-no for direction U in state State-A
  2850. In State-A moving U
  2851. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2852. predict error 0
  2853. dir: dir isU
  2854. /402: O: O804 (predict-no)
  2855. I see 1 and I'm going to do: predict-no
  2856. ENV: Agent did: predict-no for direction U in state State-A
  2857. In State-A moving U
  2858. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2859. predict error 0
  2860. dir: dir isU
  2861. |\-403: O: O806 (predict-no)
  2862. I see 1 and I'm going to do: predict-no
  2863. ENV: Agent did: predict-no for direction U in state State-A
  2864. In State-A moving U
  2865. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2866. predict error 0
  2867. dir: dir isL
  2868. /|\404: O: O808 (predict-no)
  2869. I see 1 and I'm going to do: predict-no
  2870. ENV: Agent did: predict-no for direction L in state State-A
  2871. In State-A moving L
  2872. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2873. predict error 0
  2874. dir: dir isL
  2875. -/|405: O: O810 (predict-no)
  2876. I see 1 and I'm going to do: predict-no
  2877. ENV: Agent did: predict-no for direction L in state State-A
  2878. In State-A moving L
  2879. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2880. predict error 0
  2881. dir: dir isL
  2882. \-/406: O: O812 (predict-no)
  2883. I see 1 and I'm going to do: predict-no
  2884. ENV: Agent did: predict-no for direction L in state State-A
  2885. In State-A moving L
  2886. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2887. predict error 0
  2888. dir: dir isL
  2889. |\-407: O: O814 (predict-no)
  2890. I see 1 and I'm going to do: predict-no
  2891. ENV: Agent did: predict-no for direction L in state State-A
  2892. In State-A moving L
  2893. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2894. predict error 0
  2895. dir: dir isU
  2896. /|\408: O: O816 (predict-no)
  2897. I see 1 and I'm going to do: predict-no
  2898. ENV: Agent did: predict-no for direction U in state State-A
  2899. In State-A moving U
  2900. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2901. predict error 0
  2902. dir: dir isU
  2903. -/|409: O: O818 (predict-no)
  2904. I see 1 and I'm going to do: predict-no
  2905. ENV: Agent did: predict-no for direction U in state State-A
  2906. In State-A moving U
  2907. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2908. predict error 0
  2909. dir: dir isU
  2910. \-/410: O: O820 (predict-no)
  2911. I see 1 and I'm going to do: predict-no
  2912. ENV: Agent did: predict-no for direction U in state State-A
  2913. In State-A moving U
  2914. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2915. predict error 0
  2916. dir: dir isL
  2917. |\411: O: O822 (predict-no)
  2918. I see 1 and I'm going to do: predict-no
  2919. ENV: Agent did: predict-no for direction L in state State-A
  2920. In State-A moving L
  2921. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2922. predict error 0
  2923. dir: dir isU
  2924. -412: O: O824 (predict-no)
  2925. I see 1 and I'm going to do: predict-no
  2926. ENV: Agent did: predict-no for direction U in state State-A
  2927. In State-A moving U
  2928. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2929. predict error 0
  2930. dir: dir isL
  2931. /|413: O: O826 (predict-no)
  2932. I see 1 and I'm going to do: predict-no
  2933. ENV: Agent did: predict-no for direction L in state State-A
  2934. In State-A moving L
  2935. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2936. predict error 0
  2937. dir: dir isR
  2938. \-/414: O: O827 (predict-yes)
  2939. I see 1 and I'm going to do: predict-yes
  2940. ENV: Agent did: predict-yes for direction R in state State-A
  2941. In State-A moving R
  2942. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2943. predict error 0
  2944. dir: dir isU
  2945. |\-415: O: O829 (predict-yes)
  2946. I see 1 and I'm going to do: predict-yes
  2947. ENV: Agent did: predict-yes for direction U in state State-B
  2948. In State-B moving U
  2949. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2950. predict error 1
  2951. dir: dir isL
  2952. /|416: O: O831 (predict-yes)
  2953. I see 0 and I'm going to do: predict-yes
  2954. ENV: Agent did: predict-yes for direction L in state State-B
  2955. In State-B moving L
  2956. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2957. predict error 0
  2958. dir: dir isL
  2959. \-417: O: O834 (predict-no)
  2960. I see 1 and I'm going to do: predict-no
  2961. ENV: Agent did: predict-no for direction L in state State-A
  2962. In State-A moving L
  2963. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2964. predict error 0
  2965. dir: dir isU
  2966. /|\418: O: O835 (predict-yes)
  2967. I see 1 and I'm going to do: predict-yes
  2968. ENV: Agent did: predict-yes for direction U in state State-A
  2969. In State-A moving U
  2970. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2971. predict error 1
  2972. dir: dir isL
  2973. -/419: O: O838 (predict-no)
  2974. I see 0 and I'm going to do: predict-no
  2975. ENV: Agent did: predict-no for direction L in state State-A
  2976. In State-A moving L
  2977. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2978. predict error 0
  2979. dir: dir isR
  2980. |\-420: O: O839 (predict-yes)
  2981. I see 1 and I'm going to do: predict-yes
  2982. ENV: Agent did: predict-yes for direction R in state State-A
  2983. In State-A moving R
  2984. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2985. predict error 0
  2986. dir: dir isR
  2987. /421: O: O842 (predict-no)
  2988. I see 1 and I'm going to do: predict-no
  2989. ENV: Agent did: predict-no for direction R in state State-B
  2990. In State-B moving R
  2991. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2992. predict error 0
  2993. dir: dir isU
  2994. |422: O: O844 (predict-no)
  2995. I see 1 and I'm going to do: predict-no
  2996. ENV: Agent did: predict-no for direction U in state State-B
  2997. In State-B moving U
  2998. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2999. predict error 0
  3000. dir: dir isL
  3001. \-423: O: O845 (predict-yes)
  3002. I see 1 and I'm going to do: predict-yes
  3003. ENV: Agent did: predict-yes for direction L in state State-B
  3004. In State-B moving L
  3005. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3006. predict error 0
  3007. dir: dir isR
  3008. /424: O: O847 (predict-yes)
  3009. I see 1 and I'm going to do: predict-yes
  3010. ENV: Agent did: predict-yes for direction R in state State-A
  3011. In State-A moving R
  3012. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3013. predict error 0
  3014. dir: dir isR
  3015. |\-425: O: O850 (predict-no)
  3016. I see 1 and I'm going to do: predict-no
  3017. ENV: Agent did: predict-no for direction R in state State-B
  3018. In State-B moving R
  3019. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3020. predict error 0
  3021. dir: dir isR
  3022. /|426: O: O852 (predict-no)
  3023. I see 1 and I'm going to do: predict-no
  3024. ENV: Agent did: predict-no for direction R in state State-B
  3025. In State-B moving R
  3026. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3027. predict error 0
  3028. dir: dir isL
  3029. \-/427: O: O853 (predict-yes)
  3030. I see 1 and I'm going to do: predict-yes
  3031. ENV: Agent did: predict-yes for direction L in state State-B
  3032. In State-B moving L
  3033. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3034. predict error 0
  3035. dir: dir isR
  3036. |\-428: O: O855 (predict-yes)
  3037. I see 1 and I'm going to do: predict-yes
  3038. ENV: Agent did: predict-yes for direction R in state State-A
  3039. In State-A moving R
  3040. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3041. predict error 0
  3042. dir: dir isU
  3043. /|\429: O: O858 (predict-no)
  3044. I see 1 and I'm going to do: predict-no
  3045. ENV: Agent did: predict-no for direction U in state State-B
  3046. In State-B moving U
  3047. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3048. predict error 0
  3049. dir: dir isR
  3050. -/430: O: O860 (predict-no)
  3051. I see 1 and I'm going to do: predict-no
  3052. ENV: Agent did: predict-no for direction R in state State-B
  3053. In State-B moving R
  3054. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3055. predict error 0
  3056. dir: dir isR
  3057. |\431: O: O862 (predict-no)
  3058. I see 1 and I'm going to do: predict-no
  3059. ENV: Agent did: predict-no for direction R in state State-B
  3060. In State-B moving R
  3061. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3062. predict error 0
  3063. dir: dir isL
  3064. -432: O: O863 (predict-yes)
  3065. I see 1 and I'm going to do: predict-yes
  3066. ENV: Agent did: predict-yes for direction L in state State-B
  3067. In State-B moving L
  3068. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3069. predict error 0
  3070. dir: dir isL
  3071. /|\433: O: O866 (predict-no)
  3072. I see 1 and I'm going to do: predict-no
  3073. ENV: Agent did: predict-no for direction L in state State-A
  3074. In State-A moving L
  3075. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3076. predict error 0
  3077. dir: dir isR
  3078. -/434: O: O867 (predict-yes)
  3079. I see 1 and I'm going to do: predict-yes
  3080. ENV: Agent did: predict-yes for direction R in state State-A
  3081. In State-A moving R
  3082. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3083. predict error 0
  3084. dir: dir isL
  3085. |\-435: O: O869 (predict-yes)
  3086. I see 1 and I'm going to do: predict-yes
  3087. ENV: Agent did: predict-yes for direction L in state State-B
  3088. In State-B moving L
  3089. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3090. predict error 0
  3091. dir: dir isU
  3092. /|\436: O: O872 (predict-no)
  3093. I see 1 and I'm going to do: predict-no
  3094. ENV: Agent did: predict-no for direction U in state State-A
  3095. In State-A moving U
  3096. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3097. predict error 0
  3098. dir: dir isU
  3099. -/|437: O: O874 (predict-no)
  3100. I see 1 and I'm going to do: predict-no
  3101. ENV: Agent did: predict-no for direction U in state State-A
  3102. In State-A moving U
  3103. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3104. predict error 0
  3105. dir: dir isL
  3106. \-/438: O: O876 (predict-no)
  3107. I see 1 and I'm going to do: predict-no
  3108. ENV: Agent did: predict-no for direction L in state State-A
  3109. In State-A moving L
  3110. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3111. predict error 0
  3112. dir: dir isU
  3113. |\-439: O: O878 (predict-no)
  3114. I see 1 and I'm going to do: predict-no
  3115. ENV: Agent did: predict-no for direction U in state State-A
  3116. In State-A moving U
  3117. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3118. predict error 0
  3119. dir: dir isR
  3120. /|\440: O: O879 (predict-yes)
  3121. I see 1 and I'm going to do: predict-yes
  3122. ENV: Agent did: predict-yes for direction R in state State-A
  3123. In State-A moving R
  3124. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3125. predict error 0
  3126. dir: dir isU
  3127. -/|441: O: O882 (predict-no)
  3128. I see 1 and I'm going to do: predict-no
  3129. ENV: Agent did: predict-no for direction U in state State-B
  3130. In State-B moving U
  3131. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3132. predict error 0
  3133. dir: dir isU
  3134. \442: O: O884 (predict-no)
  3135. I see 1 and I'm going to do: predict-no
  3136. ENV: Agent did: predict-no for direction U in state State-B
  3137. In State-B moving U
  3138. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3139. predict error 0
  3140. dir: dir isL
  3141. -/|443: O: O885 (predict-yes)
  3142. I see 1 and I'm going to do: predict-yes
  3143. ENV: Agent did: predict-yes for direction L in state State-B
  3144. In State-B moving L
  3145. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3146. predict error 0
  3147. dir: dir isU
  3148. \-444: O: O888 (predict-no)
  3149. I see 1 and I'm going to do: predict-no
  3150. ENV: Agent did: predict-no for direction U in state State-A
  3151. In State-A moving U
  3152. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3153. predict error 0
  3154. dir: dir isR
  3155. /445: O: O889 (predict-yes)
  3156. I see 1 and I'm going to do: predict-yes
  3157. ENV: Agent did: predict-yes for direction R in state State-A
  3158. In State-A moving R
  3159. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3160. predict error 0
  3161. dir: dir isU
  3162. |\-446: O: O892 (predict-no)
  3163. I see 1 and I'm going to do: predict-no
  3164. ENV: Agent did: predict-no for direction U in state State-B
  3165. In State-B moving U
  3166. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3167. predict error 0
  3168. dir: dir isL
  3169. /|447: O: O893 (predict-yes)
  3170. I see 1 and I'm going to do: predict-yes
  3171. ENV: Agent did: predict-yes for direction L in state State-B
  3172. In State-B moving L
  3173. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3174. predict error 0
  3175. dir: dir isU
  3176. \-/448: O: O896 (predict-no)
  3177. I see 1 and I'm going to do: predict-no
  3178. ENV: Agent did: predict-no for direction U in state State-A
  3179. In State-A moving U
  3180. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3181. predict error 0
  3182. dir: dir isU
  3183. |\-449: O: O898 (predict-no)
  3184. I see 1 and I'm going to do: predict-no
  3185. ENV: Agent did: predict-no for direction U in state State-A
  3186. In State-A moving U
  3187. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3188. predict error 0
  3189. dir: dir isU
  3190. /|\450: O: O900 (predict-no)
  3191. I see 1 and I'm going to do: predict-no
  3192. ENV: Agent did: predict-no for direction U in state State-A
  3193. In State-A moving U
  3194. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3195. predict error 0
  3196. dir: dir isR
  3197. -451: O: O901 (predict-yes)
  3198. I see 1 and I'm going to do: predict-yes
  3199. ENV: Agent did: predict-yes for direction R in state State-A
  3200. In State-A moving R
  3201. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3202. predict error 0
  3203. dir: dir isL
  3204. /452: O: O903 (predict-yes)
  3205. I see 1 and I'm going to do: predict-yes
  3206. ENV: Agent did: predict-yes for direction L in state State-B
  3207. In State-B moving L
  3208. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3209. predict error 0
  3210. dir: dir isR
  3211. |\-/sleeping...
  3212. |453: O: O905 (predict-yes)
  3213. I see 1 and I'm going to do: predict-yes
  3214. ENV: Agent did: predict-yes for direction R in state State-A
  3215. In State-A moving R
  3216. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3217. predict error 0
  3218. dir: dir isR
  3219. \-/454: O: O908 (predict-no)
  3220. I see 1 and I'm going to do: predict-no
  3221. ENV: Agent did: predict-no for direction R in state State-B
  3222. In State-B moving R
  3223. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3224. predict error 0
  3225. dir: dir isU
  3226. |\-455: O: O910 (predict-no)
  3227. I see 1 and I'm going to do: predict-no
  3228. ENV: Agent did: predict-no for direction U in state State-B
  3229. In State-B moving U
  3230. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3231. predict error 0
  3232. dir: dir isU
  3233. /|\456: O: O912 (predict-no)
  3234. I see 1 and I'm going to do: predict-no
  3235. ENV: Agent did: predict-no for direction U in state State-B
  3236. In State-B moving U
  3237. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3238. predict error 0
  3239. dir: dir isR
  3240. -/|457: O: O914 (predict-no)
  3241. I see 1 and I'm going to do: predict-no
  3242. ENV: Agent did: predict-no for direction R in state State-B
  3243. In State-B moving R
  3244. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3245. predict error 0
  3246. dir: dir isL
  3247. \458: O: O915 (predict-yes)
  3248. I see 1 and I'm going to do: predict-yes
  3249. ENV: Agent did: predict-yes for direction L in state State-B
  3250. In State-B moving L
  3251. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3252. predict error 0
  3253. dir: dir isR
  3254. -/459: O: O917 (predict-yes)
  3255. I see 1 and I'm going to do: predict-yes
  3256. ENV: Agent did: predict-yes for direction R in state State-A
  3257. In State-A moving R
  3258. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3259. predict error 0
  3260. dir: dir isR
  3261. |\-460: O: O920 (predict-no)
  3262. I see 1 and I'm going to do: predict-no
  3263. ENV: Agent did: predict-no for direction R in state State-B
  3264. In State-B moving R
  3265. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3266. predict error 0
  3267. dir: dir isL
  3268. /|\461: O: O921 (predict-yes)
  3269. I see 1 and I'm going to do: predict-yes
  3270. ENV: Agent did: predict-yes for direction L in state State-B
  3271. In State-B moving L
  3272. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3273. predict error 0
  3274. dir: dir isU
  3275. -462: O: O924 (predict-no)
  3276. I see 1 and I'm going to do: predict-no
  3277. ENV: Agent did: predict-no for direction U in state State-A
  3278. In State-A moving U
  3279. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3280. predict error 0
  3281. dir: dir isR
  3282. /|463: O: O925 (predict-yes)
  3283. I see 1 and I'm going to do: predict-yes
  3284. ENV: Agent did: predict-yes for direction R in state State-A
  3285. In State-A moving R
  3286. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3287. predict error 0
  3288. dir: dir isL
  3289. \464: O: O927 (predict-yes)
  3290. I see 1 and I'm going to do: predict-yes
  3291. ENV: Agent did: predict-yes for direction L in state State-B
  3292. In State-B moving L
  3293. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3294. predict error 0
  3295. dir: dir isU
  3296. -/465: O: O930 (predict-no)
  3297. I see 1 and I'm going to do: predict-no
  3298. ENV: Agent did: predict-no for direction U in state State-A
  3299. In State-A moving U
  3300. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3301. predict error 0
  3302. dir: dir isL
  3303. |\-466: O: O932 (predict-no)
  3304. I see 1 and I'm going to do: predict-no
  3305. ENV: Agent did: predict-no for direction L in state State-A
  3306. In State-A moving L
  3307. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3308. predict error 0
  3309. dir: dir isL
  3310. /|\-467: O: O934 (predict-no)
  3311. I see 1 and I'm going to do: predict-no
  3312. ENV: Agent did: predict-no for direction L in state State-A
  3313. In State-A moving L
  3314. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3315. predict error 0
  3316. dir: dir isU
  3317. /|\468: O: O936 (predict-no)
  3318. I see 1 and I'm going to do: predict-no
  3319. ENV: Agent did: predict-no for direction U in state State-A
  3320. In State-A moving U
  3321. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3322. predict error 0
  3323. dir: dir isR
  3324. -/469: O: O937 (predict-yes)
  3325. I see 1 and I'm going to do: predict-yes
  3326. ENV: Agent did: predict-yes for direction R in state State-A
  3327. In State-A moving R
  3328. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3329. predict error 0
  3330. dir: dir isR
  3331. |\470: O: O940 (predict-no)
  3332. I see 1 and I'm going to do: predict-no
  3333. ENV: Agent did: predict-no for direction R in state State-B
  3334. In State-B moving R
  3335. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3336. predict error 0
  3337. dir: dir isR
  3338. -/|\471: O: O942 (predict-no)
  3339. I see 1 and I'm going to do: predict-no
  3340. ENV: Agent did: predict-no for direction R in state State-B
  3341. In State-B moving R
  3342. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3343. predict error 0
  3344. dir: dir isR
  3345. -472: O: O944 (predict-no)
  3346. I see 1 and I'm going to do: predict-no
  3347. ENV: Agent did: predict-no for direction R in state State-B
  3348. In State-B moving R
  3349. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3350. predict error 0
  3351. dir: dir isL
  3352. /|473: O: O945 (predict-yes)
  3353. I see 1 and I'm going to do: predict-yes
  3354. ENV: Agent did: predict-yes for direction L in state State-B
  3355. In State-B moving L
  3356. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3357. predict error 0
  3358. dir: dir isL
  3359. \-/474: O: O948 (predict-no)
  3360. I see 1 and I'm going to do: predict-no
  3361. ENV: Agent did: predict-no for direction L in state State-A
  3362. In State-A moving L
  3363. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3364. predict error 0
  3365. dir: dir isL
  3366. |\-475: O: O950 (predict-no)
  3367. I see 1 and I'm going to do: predict-no
  3368. ENV: Agent did: predict-no for direction L in state State-A
  3369. In State-A moving L
  3370. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3371. predict error 0
  3372. dir: dir isU
  3373. /|\476: O: O952 (predict-no)
  3374. I see 1 and I'm going to do: predict-no
  3375. ENV: Agent did: predict-no for direction U in state State-A
  3376. In State-A moving U
  3377. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3378. predict error 0
  3379. dir: dir isU
  3380. -/|477: O: O954 (predict-no)
  3381. I see 1 and I'm going to do: predict-no
  3382. ENV: Agent did: predict-no for direction U in state State-A
  3383. In State-A moving U
  3384. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3385. predict error 0
  3386. dir: dir isR
  3387. \-478: O: O955 (predict-yes)
  3388. I see 1 and I'm going to do: predict-yes
  3389. ENV: Agent did: predict-yes for direction R in state State-A
  3390. In State-A moving R
  3391. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3392. predict error 0
  3393. dir: dir isU
  3394. /|479: O: O958 (predict-no)
  3395. I see 1 and I'm going to do: predict-no
  3396. ENV: Agent did: predict-no for direction U in state State-B
  3397. In State-B moving U
  3398. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3399. predict error 0
  3400. dir: dir isR
  3401. \-/480: O: O960 (predict-no)
  3402. I see 1 and I'm going to do: predict-no
  3403. ENV: Agent did: predict-no for direction R in state State-B
  3404. In State-B moving R
  3405. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3406. predict error 0
  3407. dir: dir isL
  3408. |\-481: O: O961 (predict-yes)
  3409. I see 1 and I'm going to do: predict-yes
  3410. ENV: Agent did: predict-yes for direction L in state State-B
  3411. In State-B moving L
  3412. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3413. predict error 0
  3414. dir: dir isR
  3415. /482: O: O963 (predict-yes)
  3416. I see 1 and I'm going to do: predict-yes
  3417. ENV: Agent did: predict-yes for direction R in state State-A
  3418. In State-A moving R
  3419. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3420. predict error 0
  3421. dir: dir isU
  3422. |\-483: O: O966 (predict-no)
  3423. I see 1 and I'm going to do: predict-no
  3424. ENV: Agent did: predict-no for direction U in state State-B
  3425. In State-B moving U
  3426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3427. predict error 0
  3428. dir: dir isL
  3429. /|\484: O: O967 (predict-yes)
  3430. I see 1 and I'm going to do: predict-yes
  3431. ENV: Agent did: predict-yes for direction L in state State-B
  3432. In State-B moving L
  3433. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3434. predict error 0
  3435. dir: dir isR
  3436. -/485: O: O969 (predict-yes)
  3437. I see 1 and I'm going to do: predict-yes
  3438. ENV: Agent did: predict-yes for direction R in state State-A
  3439. In State-A moving R
  3440. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3441. predict error 0
  3442. dir: dir isL
  3443. |\-486: O: O971 (predict-yes)
  3444. I see 1 and I'm going to do: predict-yes
  3445. ENV: Agent did: predict-yes for direction L in state State-B
  3446. In State-B moving L
  3447. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3448. predict error 0
  3449. dir: dir isU
  3450. /|\487: O: O974 (predict-no)
  3451. I see 1 and I'm going to do: predict-no
  3452. ENV: Agent did: predict-no for direction U in state State-A
  3453. In State-A moving U
  3454. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3455. predict error 0
  3456. dir: dir isR
  3457. -/488: O: O975 (predict-yes)
  3458. I see 1 and I'm going to do: predict-yes
  3459. ENV: Agent did: predict-yes for direction R in state State-A
  3460. In State-A moving R
  3461. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3462. predict error 0
  3463. dir: dir isL
  3464. |489: O: O977 (predict-yes)
  3465. I see 1 and I'm going to do: predict-yes
  3466. ENV: Agent did: predict-yes for direction L in state State-B
  3467. In State-B moving L
  3468. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3469. predict error 0
  3470. dir: dir isR
  3471. \-/490: O: O979 (predict-yes)
  3472. I see 1 and I'm going to do: predict-yes
  3473. ENV: Agent did: predict-yes for direction R in state State-A
  3474. In State-A moving R
  3475. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3476. predict error 0
  3477. dir: dir isR
  3478. |491: O: O982 (predict-no)
  3479. I see 1 and I'm going to do: predict-no
  3480. ENV: Agent did: predict-no for direction R in state State-B
  3481. In State-B moving R
  3482. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3483. predict error 0
  3484. dir: dir isU
  3485. \492: O: O984 (predict-no)
  3486. I see 1 and I'm going to do: predict-no
  3487. ENV: Agent did: predict-no for direction U in state State-B
  3488. In State-B moving U
  3489. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3490. predict error 0
  3491. dir: dir isR
  3492. -/|493: O: O986 (predict-no)
  3493. I see 1 and I'm going to do: predict-no
  3494. ENV: Agent did: predict-no for direction R in state State-B
  3495. In State-B moving R
  3496. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3497. predict error 0
  3498. dir: dir isU
  3499. \-/494: O: O988 (predict-no)
  3500. I see 1 and I'm going to do: predict-no
  3501. ENV: Agent did: predict-no for direction U in state State-B
  3502. In State-B moving U
  3503. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3504. predict error 0
  3505. dir: dir isL
  3506. |\-495: O: O989 (predict-yes)
  3507. I see 1 and I'm going to do: predict-yes
  3508. ENV: Agent did: predict-yes for direction L in state State-B
  3509. In State-B moving L
  3510. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3511. predict error 0
  3512. dir: dir isL
  3513. /|\496: O: O992 (predict-no)
  3514. I see 1 and I'm going to do: predict-no
  3515. ENV: Agent did: predict-no for direction L in state State-A
  3516. In State-A moving L
  3517. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3518. predict error 0
  3519. dir: dir isL
  3520. -/|497: O: O994 (predict-no)
  3521. I see 1 and I'm going to do: predict-no
  3522. ENV: Agent did: predict-no for direction L in state State-A
  3523. In State-A moving L
  3524. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3525. predict error 0
  3526. dir: dir isL
  3527. \-/498: O: O996 (predict-no)
  3528. I see 1 and I'm going to do: predict-no
  3529. ENV: Agent did: predict-no for direction L in state State-A
  3530. In State-A moving L
  3531. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3532. predict error 0
  3533. dir: dir isU
  3534. |\-/499: O: O998 (predict-no)
  3535. I see 1 and I'm going to do: predict-no
  3536. ENV: Agent did: predict-no for direction U in state State-A
  3537. In State-A moving U
  3538. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3539. predict error 0
  3540. dir: dir isU
  3541. |\500: O: O1000 (predict-no)
  3542. I see 1 and I'm going to do: predict-no
  3543. ENV: Agent did: predict-no for direction U in state State-A
  3544. In State-A moving U
  3545. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3546. predict error 0
  3547. dir: dir isL
  3548. -/|\501: O: O1002 (predict-no)
  3549. I see 1 and I'm going to do: predict-no
  3550. ENV: Agent did: predict-no for direction L in state State-A
  3551. In State-A moving L
  3552. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3553. predict error 0
  3554. dir: dir isL
  3555. -502: O: O1004 (predict-no)
  3556. I see 1 and I'm going to do: predict-no
  3557. ENV: Agent did: predict-no for direction L in state State-A
  3558. In State-A moving L
  3559. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3560. predict error 0
  3561. dir: dir isR
  3562. /|\-503: O: O1005 (predict-yes)
  3563. I see 1 and I'm going to do: predict-yes
  3564. ENV: Agent did: predict-yes for direction R in state State-A
  3565. In State-A moving R
  3566. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3567. predict error 0
  3568. dir: dir isU
  3569. /|504: O: O1008 (predict-no)
  3570. I see 1 and I'm going to do: predict-no
  3571. ENV: Agent did: predict-no for direction U in state State-B
  3572. In State-B moving U
  3573. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3574. predict error 0
  3575. dir: dir isU
  3576. \-505: O: O1010 (predict-no)
  3577. I see 1 and I'm going to do: predict-no
  3578. ENV: Agent did: predict-no for direction U in state State-B
  3579. In State-B moving U
  3580. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3581. predict error 0
  3582. dir: dir isU
  3583. /|\506: O: O1012 (predict-no)
  3584. I see 1 and I'm going to do: predict-no
  3585. ENV: Agent did: predict-no for direction U in state State-B
  3586. In State-B moving U
  3587. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3588. predict error 0
  3589. dir: dir isR
  3590. -/|507: O: O1014 (predict-no)
  3591. I see 1 and I'm going to do: predict-no
  3592. ENV: Agent did: predict-no for direction R in state State-B
  3593. In State-B moving R
  3594. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3595. predict error 0
  3596. dir: dir isL
  3597. \-/508: O: O1015 (predict-yes)
  3598. I see 1 and I'm going to do: predict-yes
  3599. ENV: Agent did: predict-yes for direction L in state State-B
  3600. In State-B moving L
  3601. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3602. predict error 0
  3603. dir: dir isR
  3604. |\-509: O: O1017 (predict-yes)
  3605. I see 1 and I'm going to do: predict-yes
  3606. ENV: Agent did: predict-yes for direction R in state State-A
  3607. In State-A moving R
  3608. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3609. predict error 0
  3610. dir: dir isL
  3611. /|\510: O: O1019 (predict-yes)
  3612. I see 1 and I'm going to do: predict-yes
  3613. ENV: Agent did: predict-yes for direction L in state State-B
  3614. In State-B moving L
  3615. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3616. predict error 0
  3617. dir: dir isR
  3618. -/|511: O: O1021 (predict-yes)
  3619. I see 1 and I'm going to do: predict-yes
  3620. ENV: Agent did: predict-yes for direction R in state State-A
  3621. In State-A moving R
  3622. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3623. predict error 0
  3624. dir: dir isR
  3625. \512: O: O1024 (predict-no)
  3626. I see 1 and I'm going to do: predict-no
  3627. ENV: Agent did: predict-no for direction R in state State-B
  3628. In State-B moving R
  3629. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3630. predict error 0
  3631. dir: dir isL
  3632. -/513: O: O1025 (predict-yes)
  3633. I see 1 and I'm going to do: predict-yes
  3634. ENV: Agent did: predict-yes for direction L in state State-B
  3635. In State-B moving L
  3636. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3637. predict error 0
  3638. dir: dir isL
  3639. |\-514: O: O1028 (predict-no)
  3640. I see 1 and I'm going to do: predict-no
  3641. ENV: Agent did: predict-no for direction L in state State-A
  3642. In State-A moving L
  3643. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3644. predict error 0
  3645. dir: dir isU
  3646. /|\515: O: O1030 (predict-no)
  3647. I see 1 and I'm going to do: predict-no
  3648. ENV: Agent did: predict-no for direction U in state State-A
  3649. In State-A moving U
  3650. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3651. predict error 0
  3652. dir: dir isL
  3653. -/|516: O: O1032 (predict-no)
  3654. I see 1 and I'm going to do: predict-no
  3655. ENV: Agent did: predict-no for direction L in state State-A
  3656. In State-A moving L
  3657. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3658. predict error 0
  3659. dir: dir isU
  3660. \-517: O: O1034 (predict-no)
  3661. I see 1 and I'm going to do: predict-no
  3662. ENV: Agent did: predict-no for direction U in state State-A
  3663. In State-A moving U
  3664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3665. predict error 0
  3666. dir: dir isU
  3667. /|\518: O: O1036 (predict-no)
  3668. I see 1 and I'm going to do: predict-no
  3669. ENV: Agent did: predict-no for direction U in state State-A
  3670. In State-A moving U
  3671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3672. predict error 0
  3673. dir: dir isU
  3674. -/519: O: O1038 (predict-no)
  3675. I see 1 and I'm going to do: predict-no
  3676. ENV: Agent did: predict-no for direction U in state State-A
  3677. In State-A moving U
  3678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3679. predict error 0
  3680. dir: dir isR
  3681. |\520: O: O1039 (predict-yes)
  3682. I see 1 and I'm going to do: predict-yes
  3683. ENV: Agent did: predict-yes for direction R in state State-A
  3684. In State-A moving R
  3685. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3686. predict error 0
  3687. dir: dir isR
  3688. -/|521: O: O1042 (predict-no)
  3689. I see 1 and I'm going to do: predict-no
  3690. ENV: Agent did: predict-no for direction R in state State-B
  3691. In State-B moving R
  3692. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3693. predict error 0
  3694. dir: dir isR
  3695. \522: O: O1044 (predict-no)
  3696. I see 1 and I'm going to do: predict-no
  3697. ENV: Agent did: predict-no for direction R in state State-B
  3698. In State-B moving R
  3699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3700. predict error 0
  3701. dir: dir isR
  3702. -/523: O: O1046 (predict-no)
  3703. I see 1 and I'm going to do: predict-no
  3704. ENV: Agent did: predict-no for direction R in state State-B
  3705. In State-B moving R
  3706. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3707. predict error 0
  3708. dir: dir isU
  3709. |\-524: O: O1048 (predict-no)
  3710. I see 1 and I'm going to do: predict-no
  3711. ENV: Agent did: predict-no for direction U in state State-B
  3712. In State-B moving U
  3713. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3714. predict error 0
  3715. dir: dir isU
  3716. /|\525: O: O1050 (predict-no)
  3717. I see 1 and I'm going to do: predict-no
  3718. ENV: Agent did: predict-no for direction U in state State-B
  3719. In State-B moving U
  3720. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3721. predict error 0
  3722. dir: dir isL
  3723. -/|526: O: O1051 (predict-yes)
  3724. I see 1 and I'm going to do: predict-yes
  3725. ENV: Agent did: predict-yes for direction L in state State-B
  3726. In State-B moving L
  3727. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3728. predict error 0
  3729. dir: dir isU
  3730. \-/527: O: O1054 (predict-no)
  3731. I see 1 and I'm going to do: predict-no
  3732. ENV: Agent did: predict-no for direction U in state State-A
  3733. In State-A moving U
  3734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3735. predict error 0
  3736. dir: dir isR
  3737. |\528: O: O1055 (predict-yes)
  3738. I see 1 and I'm going to do: predict-yes
  3739. ENV: Agent did: predict-yes for direction R in state State-A
  3740. In State-A moving R
  3741. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3742. predict error 0
  3743. dir: dir isR
  3744. -529: O: O1058 (predict-no)
  3745. I see 1 and I'm going to do: predict-no
  3746. ENV: Agent did: predict-no for direction R in state State-B
  3747. In State-B moving R
  3748. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3749. predict error 0
  3750. dir: dir isU
  3751. /|530: O: O1060 (predict-no)
  3752. I see 1 and I'm going to do: predict-no
  3753. ENV: Agent did: predict-no for direction U in state State-B
  3754. In State-B moving U
  3755. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3756. predict error 0
  3757. dir: dir isL
  3758. \-/531: O: O1061 (predict-yes)
  3759. I see 1 and I'm going to do: predict-yes
  3760. ENV: Agent did: predict-yes for direction L in state State-B
  3761. In State-B moving L
  3762. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3763. predict error 0
  3764. dir: dir isU
  3765. |532: O: O1064 (predict-no)
  3766. I see 1 and I'm going to do: predict-no
  3767. ENV: Agent did: predict-no for direction U in state State-A
  3768. In State-A moving U
  3769. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3770. predict error 0
  3771. dir: dir isR
  3772. \-/533: O: O1065 (predict-yes)
  3773. I see 1 and I'm going to do: predict-yes
  3774. ENV: Agent did: predict-yes for direction R in state State-A
  3775. In State-A moving R
  3776. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3777. predict error 0
  3778. dir: dir isR
  3779. |\-534: O: O1068 (predict-no)
  3780. I see 1 and I'm going to do: predict-no
  3781. ENV: Agent did: predict-no for direction R in state State-B
  3782. In State-B moving R
  3783. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3784. predict error 0
  3785. dir: dir isR
  3786. /|\535: O: O1070 (predict-no)
  3787. I see 1 and I'm going to do: predict-no
  3788. ENV: Agent did: predict-no for direction R in state State-B
  3789. In State-B moving R
  3790. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3791. predict error 0
  3792. dir: dir isR
  3793. -/|536: O: O1072 (predict-no)
  3794. I see 1 and I'm going to do: predict-no
  3795. ENV: Agent did: predict-no for direction R in state State-B
  3796. In State-B moving R
  3797. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3798. predict error 0
  3799. dir: dir isL
  3800. \-537: O: O1073 (predict-yes)
  3801. I see 1 and I'm going to do: predict-yes
  3802. ENV: Agent did: predict-yes for direction L in state State-B
  3803. In State-B moving L
  3804. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3805. predict error 0
  3806. dir: dir isR
  3807. /|\538: O: O1075 (predict-yes)
  3808. I see 1 and I'm going to do: predict-yes
  3809. ENV: Agent did: predict-yes for direction R in state State-A
  3810. In State-A moving R
  3811. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3812. predict error 0
  3813. dir: dir isL
  3814. -539: O: O1077 (predict-yes)
  3815. I see 1 and I'm going to do: predict-yes
  3816. ENV: Agent did: predict-yes for direction L in state State-B
  3817. In State-B moving L
  3818. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3819. predict error 0
  3820. dir: dir isR
  3821. /|\540: O: O1079 (predict-yes)
  3822. I see 1 and I'm going to do: predict-yes
  3823. ENV: Agent did: predict-yes for direction R in state State-A
  3824. In State-A moving R
  3825. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3826. predict error 0
  3827. dir: dir isR
  3828. -/541: O: O1082 (predict-no)
  3829. I see 1 and I'm going to do: predict-no
  3830. ENV: Agent did: predict-no for direction R in state State-B
  3831. In State-B moving R
  3832. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3833. predict error 0
  3834. dir: dir isR
  3835. |542: O: O1084 (predict-no)
  3836. I see 1 and I'm going to do: predict-no
  3837. ENV: Agent did: predict-no for direction R in state State-B
  3838. In State-B moving R
  3839. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3840. predict error 0
  3841. dir: dir isR
  3842. \-543: O: O1086 (predict-no)
  3843. I see 1 and I'm going to do: predict-no
  3844. ENV: Agent did: predict-no for direction R in state State-B
  3845. In State-B moving R
  3846. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3847. predict error 0
  3848. dir: dir isU
  3849. /544: O: O1088 (predict-no)
  3850. I see 1 and I'm going to do: predict-no
  3851. ENV: Agent did: predict-no for direction U in state State-B
  3852. In State-B moving U
  3853. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3854. predict error 0
  3855. dir: dir isU
  3856. |\545: O: O1090 (predict-no)
  3857. I see 1 and I'm going to do: predict-no
  3858. ENV: Agent did: predict-no for direction U in state State-B
  3859. In State-B moving U
  3860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3861. predict error 0
  3862. dir: dir isL
  3863. -/|546: O: O1091 (predict-yes)
  3864. I see 1 and I'm going to do: predict-yes
  3865. ENV: Agent did: predict-yes for direction L in state State-B
  3866. In State-B moving L
  3867. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3868. predict error 0
  3869. dir: dir isU
  3870. \-/547: O: O1094 (predict-no)
  3871. I see 1 and I'm going to do: predict-no
  3872. ENV: Agent did: predict-no for direction U in state State-A
  3873. In State-A moving U
  3874. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3875. predict error 0
  3876. dir: dir isR
  3877. |\-548: O: O1095 (predict-yes)
  3878. I see 1 and I'm going to do: predict-yes
  3879. ENV: Agent did: predict-yes for direction R in state State-A
  3880. In State-A moving R
  3881. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3882. predict error 0
  3883. dir: dir isL
  3884. /|\549: O: O1097 (predict-yes)
  3885. I see 1 and I'm going to do: predict-yes
  3886. ENV: Agent did: predict-yes for direction L in state State-B
  3887. In State-B moving L
  3888. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3889. predict error 0
  3890. dir: dir isR
  3891. -/|550: O: O1099 (predict-yes)
  3892. I see 1 and I'm going to do: predict-yes
  3893. ENV: Agent did: predict-yes for direction R in state State-A
  3894. In State-A moving R
  3895. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3896. predict error 0
  3897. dir: dir isL
  3898. \-/551: O: O1101 (predict-yes)
  3899. I see 1 and I'm going to do: predict-yes
  3900. ENV: Agent did: predict-yes for direction L in state State-B
  3901. In State-B moving L
  3902. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3903. predict error 0
  3904. dir: dir isL
  3905. |552: O: O1104 (predict-no)
  3906. I see 1 and I'm going to do: predict-no
  3907. ENV: Agent did: predict-no for direction L in state State-A
  3908. In State-A moving L
  3909. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3910. predict error 0
  3911. dir: dir isR
  3912. \-553: O: O1105 (predict-yes)
  3913. I see 1 and I'm going to do: predict-yes
  3914. ENV: Agent did: predict-yes for direction R in state State-A
  3915. In State-A moving R
  3916. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3917. predict error 0
  3918. dir: dir isU
  3919. /|\554: O: O1108 (predict-no)
  3920. I see 1 and I'm going to do: predict-no
  3921. ENV: Agent did: predict-no for direction U in state State-B
  3922. In State-B moving U
  3923. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3924. predict error 0
  3925. dir: dir isL
  3926. -555: O: O1109 (predict-yes)
  3927. I see 1 and I'm going to do: predict-yes
  3928. ENV: Agent did: predict-yes for direction L in state State-B
  3929. In State-B moving L
  3930. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3931. predict error 0
  3932. dir: dir isU
  3933. /|\556: O: O1112 (predict-no)
  3934. I see 1 and I'm going to do: predict-no
  3935. ENV: Agent did: predict-no for direction U in state State-A
  3936. In State-A moving U
  3937. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3938. predict error 0
  3939. dir: dir isU
  3940. -/|557: O: O1114 (predict-no)
  3941. I see 1 and I'm going to do: predict-no
  3942. ENV: Agent did: predict-no for direction U in state State-A
  3943. In State-A moving U
  3944. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3945. predict error 0
  3946. dir: dir isL
  3947. \-/558: O: O1116 (predict-no)
  3948. I see 1 and I'm going to do: predict-no
  3949. ENV: Agent did: predict-no for direction L in state State-A
  3950. In State-A moving L
  3951. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3952. predict error 0
  3953. dir: dir isU
  3954. |\-559: O: O1118 (predict-no)
  3955. I see 1 and I'm going to do: predict-no
  3956. ENV: Agent did: predict-no for direction U in state State-A
  3957. In State-A moving U
  3958. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3959. predict error 0
  3960. dir: dir isR
  3961. /|\560: O: O1119 (predict-yes)
  3962. I see 1 and I'm going to do: predict-yes
  3963. ENV: Agent did: predict-yes for direction R in state State-A
  3964. In State-A moving R
  3965. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3966. predict error 0
  3967. dir: dir isU
  3968. -/|\561: O: O1122 (predict-no)
  3969. I see 1 and I'm going to do: predict-no
  3970. ENV: Agent did: predict-no for direction U in state State-B
  3971. In State-B moving U
  3972. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3973. predict error 0
  3974. dir: dir isU
  3975. -562: O: O1124 (predict-no)
  3976. I see 1 and I'm going to do: predict-no
  3977. ENV: Agent did: predict-no for direction U in state State-B
  3978. In State-B moving U
  3979. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3980. predict error 0
  3981. dir: dir isU
  3982. /|563: O: O1126 (predict-no)
  3983. I see 1 and I'm going to do: predict-no
  3984. ENV: Agent did: predict-no for direction U in state State-B
  3985. In State-B moving U
  3986. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3987. predict error 0
  3988. dir: dir isR
  3989. \-/564: O: O1128 (predict-no)
  3990. I see 1 and I'm going to do: predict-no
  3991. ENV: Agent did: predict-no for direction R in state State-B
  3992. In State-B moving R
  3993. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3994. predict error 0
  3995. dir: dir isU
  3996. |\565: O: O1130 (predict-no)
  3997. I see 1 and I'm going to do: predict-no
  3998. ENV: Agent did: predict-no for direction U in state State-B
  3999. In State-B moving U
  4000. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4001. predict error 0
  4002. dir: dir isU
  4003. -/|566: O: O1132 (predict-no)
  4004. I see 1 and I'm going to do: predict-no
  4005. ENV: Agent did: predict-no for direction U in state State-B
  4006. In State-B moving U
  4007. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4008. predict error 0
  4009. dir: dir isU
  4010. \-/567: O: O1134 (predict-no)
  4011. I see 1 and I'm going to do: predict-no
  4012. ENV: Agent did: predict-no for direction U in state State-B
  4013. In State-B moving U
  4014. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4015. predict error 0
  4016. dir: dir isR
  4017. |\-568: O: O1136 (predict-no)
  4018. I see 1 and I'm going to do: predict-no
  4019. ENV: Agent did: predict-no for direction R in state State-B
  4020. In State-B moving R
  4021. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4022. predict error 0
  4023. dir: dir isU
  4024. /|\569: O: O1138 (predict-no)
  4025. I see 1 and I'm going to do: predict-no
  4026. ENV: Agent did: predict-no for direction U in state State-B
  4027. In State-B moving U
  4028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4029. predict error 0
  4030. dir: dir isR
  4031. -/|570: O: O1140 (predict-no)
  4032. I see 1 and I'm going to do: predict-no
  4033. ENV: Agent did: predict-no for direction R in state State-B
  4034. In State-B moving R
  4035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4036. predict error 0
  4037. dir: dir isL
  4038. \571: O: O1141 (predict-yes)
  4039. I see 1 and I'm going to do: predict-yes
  4040. ENV: Agent did: predict-yes for direction L in state State-B
  4041. In State-B moving L
  4042. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4043. predict error 0
  4044. dir: dir isL
  4045. -572: O: O1144 (predict-no)
  4046. I see 1 and I'm going to do: predict-no
  4047. ENV: Agent did: predict-no for direction L in state State-A
  4048. In State-A moving L
  4049. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4050. predict error 0
  4051. dir: dir isL
  4052. /|\573: O: O1146 (predict-no)
  4053. I see 1 and I'm going to do: predict-no
  4054. ENV: Agent did: predict-no for direction L in state State-A
  4055. In State-A moving L
  4056. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4057. predict error 0
  4058. dir: dir isL
  4059. -/|574: O: O1148 (predict-no)
  4060. I see 1 and I'm going to do: predict-no
  4061. ENV: Agent did: predict-no for direction L in state State-A
  4062. In State-A moving L
  4063. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4064. predict error 0
  4065. dir: dir isL
  4066. \-/575: O: O1150 (predict-no)
  4067. I see 1 and I'm going to do: predict-no
  4068. ENV: Agent did: predict-no for direction L in state State-A
  4069. In State-A moving L
  4070. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4071. predict error 0
  4072. dir: dir isR
  4073. |\-576: O: O1151 (predict-yes)
  4074. I see 1 and I'm going to do: predict-yes
  4075. ENV: Agent did: predict-yes for direction R in state State-A
  4076. In State-A moving R
  4077. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4078. predict error 0
  4079. dir: dir isU
  4080. /|\577: O: O1154 (predict-no)
  4081. I see 1 and I'm going to do: predict-no
  4082. ENV: Agent did: predict-no for direction U in state State-B
  4083. In State-B moving U
  4084. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4085. predict error 0
  4086. dir: dir isR
  4087. -/578: O: O1156 (predict-no)
  4088. I see 1 and I'm going to do: predict-no
  4089. ENV: Agent did: predict-no for direction R in state State-B
  4090. In State-B moving R
  4091. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4092. predict error 0
  4093. dir: dir isR
  4094. |\-579: O: O1158 (predict-no)
  4095. I see 1 and I'm going to do: predict-no
  4096. ENV: Agent did: predict-no for direction R in state State-B
  4097. In State-B moving R
  4098. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4099. predict error 0
  4100. dir: dir isR
  4101. /|\580: O: O1160 (predict-no)
  4102. I see 1 and I'm going to do: predict-no
  4103. ENV: Agent did: predict-no for direction R in state State-B
  4104. In State-B moving R
  4105. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4106. predict error 0
  4107. dir: dir isL
  4108. -/581: O: O1161 (predict-yes)
  4109. I see 1 and I'm going to do: predict-yes
  4110. ENV: Agent did: predict-yes for direction L in state State-B
  4111. In State-B moving L
  4112. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4113. predict error 0
  4114. dir: dir isR
  4115. |582: O: O1163 (predict-yes)
  4116. I see 1 and I'm going to do: predict-yes
  4117. ENV: Agent did: predict-yes for direction R in state State-A
  4118. In State-A moving R
  4119. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4120. predict error 0
  4121. dir: dir isU
  4122. \-583: O: O1166 (predict-no)
  4123. I see 1 and I'm going to do: predict-no
  4124. ENV: Agent did: predict-no for direction U in state State-B
  4125. In State-B moving U
  4126. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4127. predict error 0
  4128. dir: dir isR
  4129. /|\584: O: O1168 (predict-no)
  4130. I see 1 and I'm going to do: predict-no
  4131. ENV: Agent did: predict-no for direction R in state State-B
  4132. In State-B moving R
  4133. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4134. predict error 0
  4135. dir: dir isL
  4136. -/|\585: O: O1169 (predict-yes)
  4137. I see 1 and I'm going to do: predict-yes
  4138. ENV: Agent did: predict-yes for direction L in state State-B
  4139. In State-B moving L
  4140. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4141. predict error 0
  4142. dir: dir isU
  4143. -/|586: O: O1172 (predict-no)
  4144. I see 1 and I'm going to do: predict-no
  4145. ENV: Agent did: predict-no for direction U in state State-A
  4146. In State-A moving U
  4147. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4148. predict error 0
  4149. dir: dir isU
  4150. \-/587: O: O1174 (predict-no)
  4151. I see 1 and I'm going to do: predict-no
  4152. ENV: Agent did: predict-no for direction U in state State-A
  4153. In State-A moving U
  4154. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4155. predict error 0
  4156. dir: dir isR
  4157. |\-588: O: O1175 (predict-yes)
  4158. I see 1 and I'm going to do: predict-yes
  4159. ENV: Agent did: predict-yes for direction R in state State-A
  4160. In State-A moving R
  4161. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4162. predict error 0
  4163. dir: dir isR
  4164. /|\589: O: O1178 (predict-no)
  4165. I see 1 and I'm going to do: predict-no
  4166. ENV: Agent did: predict-no for direction R in state State-B
  4167. In State-B moving R
  4168. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4169. predict error 0
  4170. dir: dir isR
  4171. -/|590: O: O1180 (predict-no)
  4172. I see 1 and I'm going to do: predict-no
  4173. ENV: Agent did: predict-no for direction R in state State-B
  4174. In State-B moving R
  4175. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4176. predict error 0
  4177. dir: dir isR
  4178. \-591: O: O1182 (predict-no)
  4179. I see 1 and I'm going to do: predict-no
  4180. ENV: Agent did: predict-no for direction R in state State-B
  4181. In State-B moving R
  4182. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4183. predict error 0
  4184. dir: dir isL
  4185. /592: O: O1183 (predict-yes)
  4186. I see 1 and I'm going to do: predict-yes
  4187. ENV: Agent did: predict-yes for direction L in state State-B
  4188. In State-B moving L
  4189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4190. predict error 0
  4191. dir: dir isL
  4192. |\-593: O: O1186 (predict-no)
  4193. I see 1 and I'm going to do: predict-no
  4194. ENV: Agent did: predict-no for direction L in state State-A
  4195. In State-A moving L
  4196. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4197. predict error 0
  4198. dir: dir isL
  4199. /|\-594: O: O1188 (predict-no)
  4200. I see 1 and I'm going to do: predict-no
  4201. ENV: Agent did: predict-no for direction L in state State-A
  4202. In State-A moving L
  4203. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4204. predict error 0
  4205. dir: dir isU
  4206. /|595: O: O1190 (predict-no)
  4207. I see 1 and I'm going to do: predict-no
  4208. ENV: Agent did: predict-no for direction U in state State-A
  4209. In State-A moving U
  4210. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4211. predict error 0
  4212. dir: dir isL
  4213. \-/596: O: O1192 (predict-no)
  4214. I see 1 and I'm going to do: predict-no
  4215. ENV: Agent did: predict-no for direction L in state State-A
  4216. In State-A moving L
  4217. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4218. predict error 0
  4219. dir: dir isR
  4220. |\-597: O: O1193 (predict-yes)
  4221. I see 1 and I'm going to do: predict-yes
  4222. ENV: Agent did: predict-yes for direction R in state State-A
  4223. In State-A moving R
  4224. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4225. predict error 0
  4226. dir: dir isL
  4227. /|\598: O: O1195 (predict-yes)
  4228. I see 1 and I'm going to do: predict-yes
  4229. ENV: Agent did: predict-yes for direction L in state State-B
  4230. In State-B moving L
  4231. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4232. predict error 0
  4233. dir: dir isL
  4234. -/|599: O: O1198 (predict-no)
  4235. I see 1 and I'm going to do: predict-no
  4236. ENV: Agent did: predict-no for direction L in state State-A
  4237. In State-A moving L
  4238. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4239. predict error 0
  4240. dir: dir isL
  4241. \-600: O: O1200 (predict-no)
  4242. I see 1 and I'm going to do: predict-no
  4243. ENV: Agent did: predict-no for direction L in state State-A
  4244. In State-A moving L
  4245. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4246. predict error 0
  4247. dir: dir isR
  4248. /|\601: O: O1201 (predict-yes)
  4249. I see 1 and I'm going to do: predict-yes
  4250. ENV: Agent did: predict-yes for direction R in state State-A
  4251. In State-A moving R
  4252. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4253. predict error 0
  4254. dir: dir isL
  4255. -602: O: O1203 (predict-yes)
  4256. I see 1 and I'm going to do: predict-yes
  4257. ENV: Agent did: predict-yes for direction L in state State-B
  4258. In State-B moving L
  4259. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4260. predict error 0
  4261. dir: dir isL
  4262. /|\-603: O: O1206 (predict-no)
  4263. I see 1 and I'm going to do: predict-no
  4264. ENV: Agent did: predict-no for direction L in state State-A
  4265. In State-A moving L
  4266. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4267. predict error 0
  4268. dir: dir isL
  4269. /|604: O: O1208 (predict-no)
  4270. I see 1 and I'm going to do: predict-no
  4271. ENV: Agent did: predict-no for direction L in state State-A
  4272. In State-A moving L
  4273. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4274. predict error 0
  4275. dir: dir isL
  4276. \-/605: O: O1210 (predict-no)
  4277. I see 1 and I'm going to do: predict-no
  4278. ENV: Agent did: predict-no for direction L in state State-A
  4279. In State-A moving L
  4280. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4281. predict error 0
  4282. dir: dir isR
  4283. |\-/606: O: O1211 (predict-yes)
  4284. I see 1 and I'm going to do: predict-yes
  4285. ENV: Agent did: predict-yes for direction R in state State-A
  4286. In State-A moving R
  4287. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4288. predict error 0
  4289. dir: dir isL
  4290. |\607: O: O1213 (predict-yes)
  4291. I see 1 and I'm going to do: predict-yes
  4292. ENV: Agent did: predict-yes for direction L in state State-B
  4293. In State-B moving L
  4294. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4295. predict error 0
  4296. dir: dir isL
  4297. -/|608: O: O1216 (predict-no)
  4298. I see 1 and I'm going to do: predict-no
  4299. ENV: Agent did: predict-no for direction L in state State-A
  4300. In State-A moving L
  4301. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4302. predict error 0
  4303. dir: dir isR
  4304. \-/609: O: O1217 (predict-yes)
  4305. I see 1 and I'm going to do: predict-yes
  4306. ENV: Agent did: predict-yes for direction R in state State-A
  4307. In State-A moving R
  4308. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4309. predict error 0
  4310. dir: dir isR
  4311. |\610: O: O1220 (predict-no)
  4312. I see 1 and I'm going to do: predict-no
  4313. ENV: Agent did: predict-no for direction R in state State-B
  4314. In State-B moving R
  4315. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4316. predict error 0
  4317. dir: dir isU
  4318. -/|611: O: O1222 (predict-no)
  4319. I see 1 and I'm going to do: predict-no
  4320. ENV: Agent did: predict-no for direction U in state State-B
  4321. In State-B moving U
  4322. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4323. predict error 0
  4324. dir: dir isL
  4325. \612: O: O1223 (predict-yes)
  4326. I see 1 and I'm going to do: predict-yes
  4327. ENV: Agent did: predict-yes for direction L in state State-B
  4328. In State-B moving L
  4329. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4330. predict error 0
  4331. dir: dir isR
  4332. -/|613: O: O1225 (predict-yes)
  4333. I see 1 and I'm going to do: predict-yes
  4334. ENV: Agent did: predict-yes for direction R in state State-A
  4335. In State-A moving R
  4336. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4337. predict error 0
  4338. dir: dir isR
  4339. \-/614: O: O1228 (predict-no)
  4340. I see 1 and I'm going to do: predict-no
  4341. ENV: Agent did: predict-no for direction R in state State-B
  4342. In State-B moving R
  4343. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4344. predict error 0
  4345. dir: dir isL
  4346. |\-615: O: O1229 (predict-yes)
  4347. I see 1 and I'm going to do: predict-yes
  4348. ENV: Agent did: predict-yes for direction L in state State-B
  4349. In State-B moving L
  4350. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4351. predict error 0
  4352. dir: dir isU
  4353. /|\616: O: O1232 (predict-no)
  4354. I see 1 and I'm going to do: predict-no
  4355. ENV: Agent did: predict-no for direction U in state State-A
  4356. In State-A moving U
  4357. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4358. predict error 0
  4359. dir: dir isL
  4360. -/|617: O: O1234 (predict-no)
  4361. I see 1 and I'm going to do: predict-no
  4362. ENV: Agent did: predict-no for direction L in state State-A
  4363. In State-A moving L
  4364. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4365. predict error 0
  4366. dir: dir isL
  4367. \-618: O: O1236 (predict-no)
  4368. I see 1 and I'm going to do: predict-no
  4369. ENV: Agent did: predict-no for direction L in state State-A
  4370. In State-A moving L
  4371. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4372. predict error 0
  4373. dir: dir isR
  4374. /|619: O: O1237 (predict-yes)
  4375. I see 1 and I'm going to do: predict-yes
  4376. ENV: Agent did: predict-yes for direction R in state State-A
  4377. In State-A moving R
  4378. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4379. predict error 0
  4380. dir: dir isL
  4381. \-/620: O: O1239 (predict-yes)
  4382. I see 1 and I'm going to do: predict-yes
  4383. ENV: Agent did: predict-yes for direction L in state State-B
  4384. In State-B moving L
  4385. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4386. predict error 0
  4387. dir: dir isR
  4388. |\-621: O: O1241 (predict-yes)
  4389. I see 1 and I'm going to do: predict-yes
  4390. ENV: Agent did: predict-yes for direction R in state State-A
  4391. In State-A moving R
  4392. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4393. predict error 0
  4394. dir: dir isR
  4395. /622: O: O1244 (predict-no)
  4396. I see 1 and I'm going to do: predict-no
  4397. ENV: Agent did: predict-no for direction R in state State-B
  4398. In State-B moving R
  4399. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4400. predict error 0
  4401. dir: dir isL
  4402. |\623: O: O1245 (predict-yes)
  4403. I see 1 and I'm going to do: predict-yes
  4404. ENV: Agent did: predict-yes for direction L in state State-B
  4405. In State-B moving L
  4406. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4407. predict error 0
  4408. dir: dir isL
  4409. -/624: O: O1248 (predict-no)
  4410. I see 1 and I'm going to do: predict-no
  4411. ENV: Agent did: predict-no for direction L in state State-A
  4412. In State-A moving L
  4413. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4414. predict error 0
  4415. dir: dir isU
  4416. |\625: O: O1250 (predict-no)
  4417. I see 1 and I'm going to do: predict-no
  4418. ENV: Agent did: predict-no for direction U in state State-A
  4419. In State-A moving U
  4420. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4421. predict error 0
  4422. dir: dir isL
  4423. -/|626: O: O1252 (predict-no)
  4424. I see 1 and I'm going to do: predict-no
  4425. ENV: Agent did: predict-no for direction L in state State-A
  4426. In State-A moving L
  4427. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4428. predict error 0
  4429. dir: dir isU
  4430. \-627: O: O1254 (predict-no)
  4431. I see 1 and I'm going to do: predict-no
  4432. ENV: Agent did: predict-no for direction U in state State-A
  4433. In State-A moving U
  4434. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4435. predict error 0
  4436. dir: dir isR
  4437. /|\628: O: O1255 (predict-yes)
  4438. I see 1 and I'm going to do: predict-yes
  4439. ENV: Agent did: predict-yes for direction R in state State-A
  4440. In State-A moving R
  4441. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4442. predict error 0
  4443. dir: dir isU
  4444. -/|\629: O: O1258 (predict-no)
  4445. I see 1 and I'm going to do: predict-no
  4446. ENV: Agent did: predict-no for direction U in state State-B
  4447. In State-B moving U
  4448. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4449. predict error 0
  4450. dir: dir isL
  4451. -/630: O: O1259 (predict-yes)
  4452. I see 1 and I'm going to do: predict-yes
  4453. ENV: Agent did: predict-yes for direction L in state State-B
  4454. In State-B moving L
  4455. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4456. predict error 0
  4457. dir: dir isL
  4458. |\631: O: O1262 (predict-no)
  4459. I see 1 and I'm going to do: predict-no
  4460. ENV: Agent did: predict-no for direction L in state State-A
  4461. In State-A moving L
  4462. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4463. predict error 0
  4464. dir: dir isL
  4465. -632: O: O1264 (predict-no)
  4466. I see 1 and I'm going to do: predict-no
  4467. ENV: Agent did: predict-no for direction L in state State-A
  4468. In State-A moving L
  4469. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4470. predict error 0
  4471. dir: dir isU
  4472. /|\633: O: O1266 (predict-no)
  4473. I see 1 and I'm going to do: predict-no
  4474. ENV: Agent did: predict-no for direction U in state State-A
  4475. In State-A moving U
  4476. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4477. predict error 0
  4478. dir: dir isR
  4479. -/|\634: O: O1267 (predict-yes)
  4480. I see 1 and I'm going to do: predict-yes
  4481. ENV: Agent did: predict-yes for direction R in state State-A
  4482. In State-A moving R
  4483. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4484. predict error 0
  4485. dir: dir isR
  4486. -/|\635: O: O1270 (predict-no)
  4487. I see 1 and I'm going to do: predict-no
  4488. ENV: Agent did: predict-no for direction R in state State-B
  4489. In State-B moving R
  4490. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4491. predict error 0
  4492. dir: dir isL
  4493. -/636: O: O1271 (predict-yes)
  4494. I see 1 and I'm going to do: predict-yes
  4495. ENV: Agent did: predict-yes for direction L in state State-B
  4496. In State-B moving L
  4497. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4498. predict error 0
  4499. dir: dir isU
  4500. |\-637: O: O1274 (predict-no)
  4501. I see 1 and I'm going to do: predict-no
  4502. ENV: Agent did: predict-no for direction U in state State-A
  4503. In State-A moving U
  4504. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4505. predict error 0
  4506. dir: dir isU
  4507. /|638: O: O1276 (predict-no)
  4508. I see 1 and I'm going to do: predict-no
  4509. ENV: Agent did: predict-no for direction U in state State-A
  4510. In State-A moving U
  4511. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4512. predict error 0
  4513. dir: dir isR
  4514. \-/639: O: O1277 (predict-yes)
  4515. I see 1 and I'm going to do: predict-yes
  4516. ENV: Agent did: predict-yes for direction R in state State-A
  4517. In State-A moving R
  4518. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4519. predict error 0
  4520. dir: dir isL
  4521. |\-640: O: O1279 (predict-yes)
  4522. I see 1 and I'm going to do: predict-yes
  4523. ENV: Agent did: predict-yes for direction L in state State-B
  4524. In State-B moving L
  4525. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4526. predict error 0
  4527. dir: dir isL
  4528. /|\641: O: O1282 (predict-no)
  4529. I see 1 and I'm going to do: predict-no
  4530. ENV: Agent did: predict-no for direction L in state State-A
  4531. In State-A moving L
  4532. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4533. predict error 0
  4534. dir: dir isL
  4535. -642: O: O1284 (predict-no)
  4536. I see 1 and I'm going to do: predict-no
  4537. ENV: Agent did: predict-no for direction L in state State-A
  4538. In State-A moving L
  4539. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4540. predict error 0
  4541. dir: dir isL
  4542. /|\643: O: O1286 (predict-no)
  4543. I see 1 and I'm going to do: predict-no
  4544. ENV: Agent did: predict-no for direction L in state State-A
  4545. In State-A moving L
  4546. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4547. predict error 0
  4548. dir: dir isL
  4549. -/644: O: O1288 (predict-no)
  4550. I see 1 and I'm going to do: predict-no
  4551. ENV: Agent did: predict-no for direction L in state State-A
  4552. In State-A moving L
  4553. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4554. predict error 0
  4555. dir: dir isL
  4556. |\-645: O: O1290 (predict-no)
  4557. I see 1 and I'm going to do: predict-no
  4558. ENV: Agent did: predict-no for direction L in state State-A
  4559. In State-A moving L
  4560. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4561. predict error 0
  4562. dir: dir isR
  4563. /|\646: O: O1291 (predict-yes)
  4564. I see 1 and I'm going to do: predict-yes
  4565. ENV: Agent did: predict-yes for direction R in state State-A
  4566. In State-A moving R
  4567. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4568. predict error 0
  4569. dir: dir isU
  4570. -/|647: O: O1294 (predict-no)
  4571. I see 1 and I'm going to do: predict-no
  4572. ENV: Agent did: predict-no for direction U in state State-B
  4573. In State-B moving U
  4574. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4575. predict error 0
  4576. dir: dir isL
  4577. \-/648: O: O1295 (predict-yes)
  4578. I see 1 and I'm going to do: predict-yes
  4579. ENV: Agent did: predict-yes for direction L in state State-B
  4580. In State-B moving L
  4581. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4582. predict error 0
  4583. dir: dir isU
  4584. |\-649: O: O1298 (predict-no)
  4585. I see 1 and I'm going to do: predict-no
  4586. ENV: Agent did: predict-no for direction U in state State-A
  4587. In State-A moving U
  4588. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4589. predict error 0
  4590. dir: dir isR
  4591. /|\650: O: O1299 (predict-yes)
  4592. I see 1 and I'm going to do: predict-yes
  4593. ENV: Agent did: predict-yes for direction R in state State-A
  4594. In State-A moving R
  4595. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4596. predict error 0
  4597. dir: dir isU
  4598. -/|651: O: O1302 (predict-no)
  4599. I see 1 and I'm going to do: predict-no
  4600. ENV: Agent did: predict-no for direction U in state State-B
  4601. In State-B moving U
  4602. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4603. predict error 0
  4604. dir: dir isU
  4605. \652: O: O1304 (predict-no)
  4606. I see 1 and I'm going to do: predict-no
  4607. ENV: Agent did: predict-no for direction U in state State-B
  4608. In State-B moving U
  4609. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4610. predict error 0
  4611. dir: dir isU
  4612. -/653: O: O1306 (predict-no)
  4613. I see 1 and I'm going to do: predict-no
  4614. ENV: Agent did: predict-no for direction U in state State-B
  4615. In State-B moving U
  4616. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4617. predict error 0
  4618. dir: dir isL
  4619. |\654: O: O1307 (predict-yes)
  4620. I see 1 and I'm going to do: predict-yes
  4621. ENV: Agent did: predict-yes for direction L in state State-B
  4622. In State-B moving L
  4623. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4624. predict error 0
  4625. dir: dir isL
  4626. -/655: O: O1310 (predict-no)
  4627. I see 1 and I'm going to do: predict-no
  4628. ENV: Agent did: predict-no for direction L in state State-A
  4629. In State-A moving L
  4630. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4631. predict error 0
  4632. dir: dir isR
  4633. |656: O: O1311 (predict-yes)
  4634. I see 1 and I'm going to do: predict-yes
  4635. ENV: Agent did: predict-yes for direction R in state State-A
  4636. In State-A moving R
  4637. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4638. predict error 0
  4639. dir: dir isU
  4640. \-/657: O: O1314 (predict-no)
  4641. I see 1 and I'm going to do: predict-no
  4642. ENV: Agent did: predict-no for direction U in state State-B
  4643. In State-B moving U
  4644. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4645. predict error 0
  4646. dir: dir isL
  4647. |\-658: O: O1315 (predict-yes)
  4648. I see 1 and I'm going to do: predict-yes
  4649. ENV: Agent did: predict-yes for direction L in state State-B
  4650. In State-B moving L
  4651. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4652. predict error 0
  4653. dir: dir isL
  4654. /|\659: O: O1318 (predict-no)
  4655. I see 1 and I'm going to do: predict-no
  4656. ENV: Agent did: predict-no for direction L in state State-A
  4657. In State-A moving L
  4658. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4659. predict error 0
  4660. dir: dir isU
  4661. -/|660: O: O1320 (predict-no)
  4662. I see 1 and I'm going to do: predict-no
  4663. ENV: Agent did: predict-no for direction U in state State-A
  4664. In State-A moving U
  4665. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4666. predict error 0
  4667. dir: dir isL
  4668. \-/661: O: O1322 (predict-no)
  4669. I see 1 and I'm going to do: predict-no
  4670. ENV: Agent did: predict-no for direction L in state State-A
  4671. In State-A moving L
  4672. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4673. predict error 0
  4674. dir: dir isR
  4675. |662: O: O1323 (predict-yes)
  4676. I see 1 and I'm going to do: predict-yes
  4677. ENV: Agent did: predict-yes for direction R in state State-A
  4678. In State-A moving R
  4679. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4680. predict error 0
  4681. dir: dir isR
  4682. \-/663: O: O1326 (predict-no)
  4683. I see 1 and I'm going to do: predict-no
  4684. ENV: Agent did: predict-no for direction R in state State-B
  4685. In State-B moving R
  4686. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4687. predict error 0
  4688. dir: dir isU
  4689. |664: O: O1328 (predict-no)
  4690. I see 1 and I'm going to do: predict-no
  4691. ENV: Agent did: predict-no for direction U in state State-B
  4692. In State-B moving U
  4693. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4694. predict error 0
  4695. dir: dir isU
  4696. \-/665: O: O1330 (predict-no)
  4697. I see 1 and I'm going to do: predict-no
  4698. ENV: Agent did: predict-no for direction U in state State-B
  4699. In State-B moving U
  4700. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4701. predict error 0
  4702. dir: dir isL
  4703. |666: O: O1331 (predict-yes)
  4704. I see 1 and I'm going to do: predict-yes
  4705. ENV: Agent did: predict-yes for direction L in state State-B
  4706. In State-B moving L
  4707. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4708. predict error 0
  4709. dir: dir isU
  4710. \-/667: O: O1334 (predict-no)
  4711. I see 1 and I'm going to do: predict-no
  4712. ENV: Agent did: predict-no for direction U in state State-A
  4713. In State-A moving U
  4714. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4715. predict error 0
  4716. dir: dir isU
  4717. |\668: O: O1336 (predict-no)
  4718. I see 1 and I'm going to do: predict-no
  4719. ENV: Agent did: predict-no for direction U in state State-A
  4720. In State-A moving U
  4721. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4722. predict error 0
  4723. dir: dir isU
  4724. -669: O: O1338 (predict-no)
  4725. I see 1 and I'm going to do: predict-no
  4726. ENV: Agent did: predict-no for direction U in state State-A
  4727. In State-A moving U
  4728. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4729. predict error 0
  4730. dir: dir isR
  4731. /|\670: O: O1339 (predict-yes)
  4732. I see 1 and I'm going to do: predict-yes
  4733. ENV: Agent did: predict-yes for direction R in state State-A
  4734. In State-A moving R
  4735. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4736. predict error 0
  4737. dir: dir isU
  4738. -/|671: O: O1342 (predict-no)
  4739. I see 1 and I'm going to do: predict-no
  4740. ENV: Agent did: predict-no for direction U in state State-B
  4741. In State-B moving U
  4742. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4743. predict error 0
  4744. dir: dir isR
  4745. \672: O: O1344 (predict-no)
  4746. I see 1 and I'm going to do: predict-no
  4747. ENV: Agent did: predict-no for direction R in state State-B
  4748. In State-B moving R
  4749. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4750. predict error 0
  4751. dir: dir isR
  4752. -/|673: O: O1346 (predict-no)
  4753. I see 1 and I'm going to do: predict-no
  4754. ENV: Agent did: predict-no for direction R in state State-B
  4755. In State-B moving R
  4756. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4757. predict error 0
  4758. dir: dir isL
  4759. \-/674: O: O1347 (predict-yes)
  4760. I see 1 and I'm going to do: predict-yes
  4761. ENV: Agent did: predict-yes for direction L in state State-B
  4762. In State-B moving L
  4763. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4764. predict error 0
  4765. dir: dir isU
  4766. |\-675: O: O1350 (predict-no)
  4767. I see 1 and I'm going to do: predict-no
  4768. ENV: Agent did: predict-no for direction U in state State-A
  4769. In State-A moving U
  4770. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4771. predict error 0
  4772. dir: dir isR
  4773. /|\676: O: O1351 (predict-yes)
  4774. I see 1 and I'm going to do: predict-yes
  4775. ENV: Agent did: predict-yes for direction R in state State-A
  4776. In State-A moving R
  4777. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4778. predict error 0
  4779. dir: dir isL
  4780. -/|\677: O: O1353 (predict-yes)
  4781. I see 1 and I'm going to do: predict-yes
  4782. ENV: Agent did: predict-yes for direction L in state State-B
  4783. In State-B moving L
  4784. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4785. predict error 0
  4786. dir: dir isL
  4787. -/678: O: O1356 (predict-no)
  4788. I see 1 and I'm going to do: predict-no
  4789. ENV: Agent did: predict-no for direction L in state State-A
  4790. In State-A moving L
  4791. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4792. predict error 0
  4793. dir: dir isU
  4794. |\-679: O: O1358 (predict-no)
  4795. I see 1 and I'm going to do: predict-no
  4796. ENV: Agent did: predict-no for direction U in state State-A
  4797. In State-A moving U
  4798. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4799. predict error 0
  4800. dir: dir isR
  4801. /|\680: O: O1359 (predict-yes)
  4802. I see 1 and I'm going to do: predict-yes
  4803. ENV: Agent did: predict-yes for direction R in state State-A
  4804. In State-A moving R
  4805. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4806. predict error 0
  4807. dir: dir isU
  4808. -/|681: O: O1362 (predict-no)
  4809. I see 1 and I'm going to do: predict-no
  4810. ENV: Agent did: predict-no for direction U in state State-B
  4811. In State-B moving U
  4812. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4813. predict error 0
  4814. dir: dir isR
  4815. \682: O: O1364 (predict-no)
  4816. I see 1 and I'm going to do: predict-no
  4817. ENV: Agent did: predict-no for direction R in state State-B
  4818. In State-B moving R
  4819. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4820. predict error 0
  4821. dir: dir isR
  4822. -/|683: O: O1366 (predict-no)
  4823. I see 1 and I'm going to do: predict-no
  4824. ENV: Agent did: predict-no for direction R in state State-B
  4825. In State-B moving R
  4826. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4827. predict error 0
  4828. dir: dir isL
  4829. \-/684: O: O1367 (predict-yes)
  4830. I see 1 and I'm going to do: predict-yes
  4831. ENV: Agent did: predict-yes for direction L in state State-B
  4832. In State-B moving L
  4833. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4834. predict error 0
  4835. dir: dir isU
  4836. |\-685: O: O1370 (predict-no)
  4837. I see 1 and I'm going to do: predict-no
  4838. ENV: Agent did: predict-no for direction U in state State-A
  4839. In State-A moving U
  4840. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4841. predict error 0
  4842. dir: dir isL
  4843. /|\686: O: O1372 (predict-no)
  4844. I see 1 and I'm going to do: predict-no
  4845. ENV: Agent did: predict-no for direction L in state State-A
  4846. In State-A moving L
  4847. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4848. predict error 0
  4849. dir: dir isR
  4850. -/|687: O: O1373 (predict-yes)
  4851. I see 1 and I'm going to do: predict-yes
  4852. ENV: Agent did: predict-yes for direction R in state State-A
  4853. In State-A moving R
  4854. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4855. predict error 0
  4856. dir: dir isU
  4857. \-688: O: O1376 (predict-no)
  4858. I see 1 and I'm going to do: predict-no
  4859. ENV: Agent did: predict-no for direction U in state State-B
  4860. In State-B moving U
  4861. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4862. predict error 0
  4863. dir: dir isU
  4864. /|689: O: O1378 (predict-no)
  4865. I see 1 and I'm going to do: predict-no
  4866. ENV: Agent did: predict-no for direction U in state State-B
  4867. In State-B moving U
  4868. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4869. predict error 0
  4870. dir: dir isL
  4871. \-/690: O: O1379 (predict-yes)
  4872. I see 1 and I'm going to do: predict-yes
  4873. ENV: Agent did: predict-yes for direction L in state State-B
  4874. In State-B moving L
  4875. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4876. predict error 0
  4877. dir: dir isL
  4878. |\-/691: O: O1382 (predict-no)
  4879. I see 1 and I'm going to do: predict-no
  4880. ENV: Agent did: predict-no for direction L in state State-A
  4881. In State-A moving L
  4882. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4883. predict error 0
  4884. dir: dir isU
  4885. |692: O: O1384 (predict-no)
  4886. I see 1 and I'm going to do: predict-no
  4887. ENV: Agent did: predict-no for direction U in state State-A
  4888. In State-A moving U
  4889. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4890. predict error 0
  4891. dir: dir isR
  4892. \-693: O: O1385 (predict-yes)
  4893. I see 1 and I'm going to do: predict-yes
  4894. ENV: Agent did: predict-yes for direction R in state State-A
  4895. In State-A moving R
  4896. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4897. predict error 0
  4898. dir: dir isU
  4899. /|694: O: O1388 (predict-no)
  4900. I see 1 and I'm going to do: predict-no
  4901. ENV: Agent did: predict-no for direction U in state State-B
  4902. In State-B moving U
  4903. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4904. predict error 0
  4905. dir: dir isU
  4906. \-695: O: O1390 (predict-no)
  4907. I see 1 and I'm going to do: predict-no
  4908. ENV: Agent did: predict-no for direction U in state State-B
  4909. In State-B moving U
  4910. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4911. predict error 0
  4912. dir: dir isR
  4913. /|\696: O: O1392 (predict-no)
  4914. I see 1 and I'm going to do: predict-no
  4915. ENV: Agent did: predict-no for direction R in state State-B
  4916. In State-B moving R
  4917. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4918. predict error 0
  4919. dir: dir isU
  4920. -/|697: O: O1394 (predict-no)
  4921. I see 1 and I'm going to do: predict-no
  4922. ENV: Agent did: predict-no for direction U in state State-B
  4923. In State-B moving U
  4924. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4925. predict error 0
  4926. dir: dir isU
  4927. \-/698: O: O1396 (predict-no)
  4928. I see 1 and I'm going to do: predict-no
  4929. ENV: Agent did: predict-no for direction U in state State-B
  4930. In State-B moving U
  4931. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4932. predict error 0
  4933. dir: dir isR
  4934. |\-/699: O: O1398 (predict-no)
  4935. I see 1 and I'm going to do: predict-no
  4936. ENV: Agent did: predict-no for direction R in state State-B
  4937. In State-B moving R
  4938. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4939. predict error 0
  4940. dir: dir isR
  4941. |\-700: O: O1400 (predict-no)
  4942. I see 1 and I'm going to do: predict-no
  4943. ENV: Agent did: predict-no for direction R in state State-B
  4944. In State-B moving R
  4945. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4946. predict error 0
  4947. dir: dir isL
  4948. /|701: O: O1401 (predict-yes)
  4949. I see 1 and I'm going to do: predict-yes
  4950. ENV: Agent did: predict-yes for direction L in state State-B
  4951. In State-B moving L
  4952. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4953. predict error 0
  4954. dir: dir isR
  4955. \702: O: O1403 (predict-yes)
  4956. I see 1 and I'm going to do: predict-yes
  4957. ENV: Agent did: predict-yes for direction R in state State-A
  4958. In State-A moving R
  4959. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4960. predict error 0
  4961. dir: dir isU
  4962. -/|703: O: O1406 (predict-no)
  4963. I see 1 and I'm going to do: predict-no
  4964. ENV: Agent did: predict-no for direction U in state State-B
  4965. In State-B moving U
  4966. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4967. predict error 0
  4968. dir: dir isL
  4969. \-/704: O: O1407 (predict-yes)
  4970. I see 1 and I'm going to do: predict-yes
  4971. ENV: Agent did: predict-yes for direction L in state State-B
  4972. In State-B moving L
  4973. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4974. predict error 0
  4975. dir: dir isR
  4976. |\705: O: O1409 (predict-yes)
  4977. I see 1 and I'm going to do: predict-yes
  4978. ENV: Agent did: predict-yes for direction R in state State-A
  4979. In State-A moving R
  4980. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4981. predict error 0
  4982. dir: dir isL
  4983. -/706: O: O1411 (predict-yes)
  4984. I see 1 and I'm going to do: predict-yes
  4985. ENV: Agent did: predict-yes for direction L in state State-B
  4986. In State-B moving L
  4987. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4988. predict error 0
  4989. dir: dir isL
  4990. |\-707: O: O1414 (predict-no)
  4991. I see 1 and I'm going to do: predict-no
  4992. ENV: Agent did: predict-no for direction L in state State-A
  4993. In State-A moving L
  4994. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4995. predict error 0
  4996. dir: dir isR
  4997. /|708: O: O1415 (predict-yes)
  4998. I see 1 and I'm going to do: predict-yes
  4999. ENV: Agent did: predict-yes for direction R in state State-A
  5000. In State-A moving R
  5001. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5002. predict error 0
  5003. dir: dir isL
  5004. \-/709: O: O1417 (predict-yes)
  5005. I see 1 and I'm going to do: predict-yes
  5006. ENV: Agent did: predict-yes for direction L in state State-B
  5007. In State-B moving L
  5008. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5009. predict error 0
  5010. dir: dir isL
  5011. |\-710: O: O1420 (predict-no)
  5012. I see 1 and I'm going to do: predict-no
  5013. ENV: Agent did: predict-no for direction L in state State-A
  5014. In State-A moving L
  5015. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5016. predict error 0
  5017. dir: dir isL
  5018. /|\711: O: O1422 (predict-no)
  5019. I see 1 and I'm going to do: predict-no
  5020. ENV: Agent did: predict-no for direction L in state State-A
  5021. In State-A moving L
  5022. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5023. predict error 0
  5024. dir: dir isR
  5025. -712: O: O1423 (predict-yes)
  5026. I see 1 and I'm going to do: predict-yes
  5027. ENV: Agent did: predict-yes for direction R in state State-A
  5028. In State-A moving R
  5029. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5030. predict error 0
  5031. dir: dir isR
  5032. /|713: O: O1426 (predict-no)
  5033. I see 1 and I'm going to do: predict-no
  5034. ENV: Agent did: predict-no for direction R in state State-B
  5035. In State-B moving R
  5036. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5037. predict error 0
  5038. dir: dir isL
  5039. \-/714: O: O1427 (predict-yes)
  5040. I see 1 and I'm going to do: predict-yes
  5041. ENV: Agent did: predict-yes for direction L in state State-B
  5042. In State-B moving L
  5043. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5044. predict error 0
  5045. dir: dir isL
  5046. |\-715: O: O1430 (predict-no)
  5047. I see 1 and I'm going to do: predict-no
  5048. ENV: Agent did: predict-no for direction L in state State-A
  5049. In State-A moving L
  5050. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5051. predict error 0
  5052. dir: dir isU
  5053. /|716: O: O1432 (predict-no)
  5054. I see 1 and I'm going to do: predict-no
  5055. ENV: Agent did: predict-no for direction U in state State-A
  5056. In State-A moving U
  5057. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5058. predict error 0
  5059. dir: dir isL
  5060. \-717: O: O1434 (predict-no)
  5061. I see 1 and I'm going to do: predict-no
  5062. ENV: Agent did: predict-no for direction L in state State-A
  5063. In State-A moving L
  5064. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5065. predict error 0
  5066. dir: dir isR
  5067. /|\-718: O: O1435 (predict-yes)
  5068. I see 1 and I'm going to do: predict-yes
  5069. ENV: Agent did: predict-yes for direction R in state State-A
  5070. In State-A moving R
  5071. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5072. predict error 0
  5073. dir: dir isR
  5074. /|\719: O: O1438 (predict-no)
  5075. I see 1 and I'm going to do: predict-no
  5076. ENV: Agent did: predict-no for direction R in state State-B
  5077. In State-B moving R
  5078. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5079. predict error 0
  5080. dir: dir isU
  5081. -/720: O: O1440 (predict-no)
  5082. I see 1 and I'm going to do: predict-no
  5083. ENV: Agent did: predict-no for direction U in state State-B
  5084. In State-B moving U
  5085. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5086. predict error 0
  5087. dir: dir isR
  5088. |\-721: O: O1442 (predict-no)
  5089. I see 1 and I'm going to do: predict-no
  5090. ENV: Agent did: predict-no for direction R in state State-B
  5091. In State-B moving R
  5092. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5093. predict error 0
  5094. dir: dir isU
  5095. /722: O: O1444 (predict-no)
  5096. I see 1 and I'm going to do: predict-no
  5097. ENV: Agent did: predict-no for direction U in state State-B
  5098. In State-B moving U
  5099. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5100. predict error 0
  5101. dir: dir isR
  5102. |\723: O: O1446 (predict-no)
  5103. I see 1 and I'm going to do: predict-no
  5104. ENV: Agent did: predict-no for direction R in state State-B
  5105. In State-B moving R
  5106. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5107. predict error 0
  5108. dir: dir isL
  5109. -/724: O: O1447 (predict-yes)
  5110. I see 1 and I'm going to do: predict-yes
  5111. ENV: Agent did: predict-yes for direction L in state State-B
  5112. In State-B moving L
  5113. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5114. predict error 0
  5115. dir: dir isL
  5116. |\-725: O: O1450 (predict-no)
  5117. I see 1 and I'm going to do: predict-no
  5118. ENV: Agent did: predict-no for direction L in state State-A
  5119. In State-A moving L
  5120. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5121. predict error 0
  5122. dir: dir isL
  5123. /|726: O: O1452 (predict-no)
  5124. I see 1 and I'm going to do: predict-no
  5125. ENV: Agent did: predict-no for direction L in state State-A
  5126. In State-A moving L
  5127. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5128. predict error 0
  5129. dir: dir isU
  5130. \-/727: O: O1454 (predict-no)
  5131. I see 1 and I'm going to do: predict-no
  5132. ENV: Agent did: predict-no for direction U in state State-A
  5133. In State-A moving U
  5134. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5135. predict error 0
  5136. dir: dir isL
  5137. |\-/728: O: O1456 (predict-no)
  5138. I see 1 and I'm going to do: predict-no
  5139. ENV: Agent did: predict-no for direction L in state State-A
  5140. In State-A moving L
  5141. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5142. predict error 0
  5143. dir: dir isL
  5144. |\729: O: O1458 (predict-no)
  5145. I see 1 and I'm going to do: predict-no
  5146. ENV: Agent did: predict-no for direction L in state State-A
  5147. In State-A moving L
  5148. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5149. predict error 0
  5150. dir: dir isU
  5151. -/|730: O: O1460 (predict-no)
  5152. I see 1 and I'm going to do: predict-no
  5153. ENV: Agent did: predict-no for direction U in state State-A
  5154. In State-A moving U
  5155. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5156. predict error 0
  5157. dir: dir isL
  5158. \-731: O: O1462 (predict-no)
  5159. I see 1 and I'm going to do: predict-no
  5160. ENV: Agent did: predict-no for direction L in state State-A
  5161. In State-A moving L
  5162. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5163. predict error 0
  5164. dir: dir isU
  5165. /732: O: O1464 (predict-no)
  5166. I see 1 and I'm going to do: predict-no
  5167. ENV: Agent did: predict-no for direction U in state State-A
  5168. In State-A moving U
  5169. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5170. predict error 0
  5171. dir: dir isL
  5172. |\733: O: O1466 (predict-no)
  5173. I see 1 and I'm going to do: predict-no
  5174. ENV: Agent did: predict-no for direction L in state State-A
  5175. In State-A moving L
  5176. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5177. predict error 0
  5178. dir: dir isR
  5179. -/734: O: O1467 (predict-yes)
  5180. I see 1 and I'm going to do: predict-yes
  5181. ENV: Agent did: predict-yes for direction R in state State-A
  5182. In State-A moving R
  5183. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5184. predict error 0
  5185. dir: dir isL
  5186. |\-735: O: O1469 (predict-yes)
  5187. I see 1 and I'm going to do: predict-yes
  5188. ENV: Agent did: predict-yes for direction L in state State-B
  5189. In State-B moving L
  5190. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5191. predict error 0
  5192. dir: dir isU
  5193. /|\736: O: O1472 (predict-no)
  5194. I see 1 and I'm going to do: predict-no
  5195. ENV: Agent did: predict-no for direction U in state State-A
  5196. In State-A moving U
  5197. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5198. predict error 0
  5199. dir: dir isR
  5200. -737: O: O1473 (predict-yes)
  5201. I see 1 and I'm going to do: predict-yes
  5202. ENV: Agent did: predict-yes for direction R in state State-A
  5203. In State-A moving R
  5204. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5205. predict error 0
  5206. dir: dir isR
  5207. /|\738: O: O1476 (predict-no)
  5208. I see 1 and I'm going to do: predict-no
  5209. ENV: Agent did: predict-no for direction R in state State-B
  5210. In State-B moving R
  5211. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5212. predict error 0
  5213. dir: dir isU
  5214. -/|739: O: O1478 (predict-no)
  5215. I see 1 and I'm going to do: predict-no
  5216. ENV: Agent did: predict-no for direction U in state State-B
  5217. In State-B moving U
  5218. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5219. predict error 0
  5220. dir: dir isU
  5221. \-740: O: O1480 (predict-no)
  5222. I see 1 and I'm going to do: predict-no
  5223. ENV: Agent did: predict-no for direction U in state State-B
  5224. In State-B moving U
  5225. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5226. predict error 0
  5227. dir: dir isR
  5228. /|741: O: O1482 (predict-no)
  5229. I see 1 and I'm going to do: predict-no
  5230. ENV: Agent did: predict-no for direction R in state State-B
  5231. In State-B moving R
  5232. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5233. predict error 0
  5234. dir: dir isR
  5235. \742: O: O1484 (predict-no)
  5236. I see 1 and I'm going to do: predict-no
  5237. ENV: Agent did: predict-no for direction R in state State-B
  5238. In State-B moving R
  5239. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5240. predict error 0
  5241. dir: dir isR
  5242. -/743: O: O1486 (predict-no)
  5243. I see 1 and I'm going to do: predict-no
  5244. ENV: Agent did: predict-no for direction R in state State-B
  5245. In State-B moving R
  5246. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5247. predict error 0
  5248. dir: dir isL
  5249. |\744: O: O1487 (predict-yes)
  5250. I see 1 and I'm going to do: predict-yes
  5251. ENV: Agent did: predict-yes for direction L in state State-B
  5252. In State-B moving L
  5253. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5254. predict error 0
  5255. dir: dir isU
  5256. -/|745: O: O1490 (predict-no)
  5257. I see 1 and I'm going to do: predict-no
  5258. ENV: Agent did: predict-no for direction U in state State-A
  5259. In State-A moving U
  5260. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5261. predict error 0
  5262. dir: dir isR
  5263. \-746: O: O1491 (predict-yes)
  5264. I see 1 and I'm going to do: predict-yes
  5265. ENV: Agent did: predict-yes for direction R in state State-A
  5266. In State-A moving R
  5267. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5268. predict error 0
  5269. dir: dir isU
  5270. /|\747: O: O1494 (predict-no)
  5271. I see 1 and I'm going to do: predict-no
  5272. ENV: Agent did: predict-no for direction U in state State-B
  5273. In State-B moving U
  5274. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5275. predict error 0
  5276. dir: dir isL
  5277. -/|748: O: O1495 (predict-yes)
  5278. I see 1 and I'm going to do: predict-yes
  5279. ENV: Agent did: predict-yes for direction L in state State-B
  5280. In State-B moving L
  5281. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5282. predict error 0
  5283. dir: dir isR
  5284. \-/749: O: O1497 (predict-yes)
  5285. I see 1 and I'm going to do: predict-yes
  5286. ENV: Agent did: predict-yes for direction R in state State-A
  5287. In State-A moving R
  5288. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5289. predict error 0
  5290. dir: dir isR
  5291. |\-750: O: O1500 (predict-no)
  5292. I see 1 and I'm going to do: predict-no
  5293. ENV: Agent did: predict-no for direction R in state State-B
  5294. In State-B moving R
  5295. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5296. predict error 0
  5297. dir: dir isU
  5298. /|\751: O: O1502 (predict-no)
  5299. I see 1 and I'm going to do: predict-no
  5300. ENV: Agent did: predict-no for direction U in state State-B
  5301. In State-B moving U
  5302. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5303. predict error 0
  5304. dir: dir isL
  5305. -752: O: O1503 (predict-yes)
  5306. I see 1 and I'm going to do: predict-yes
  5307. ENV: Agent did: predict-yes for direction L in state State-B
  5308. In State-B moving L
  5309. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5310. predict error 0
  5311. dir: dir isR
  5312. /|\753: O: O1505 (predict-yes)
  5313. I see 1 and I'm going to do: predict-yes
  5314. ENV: Agent did: predict-yes for direction R in state State-A
  5315. In State-A moving R
  5316. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5317. predict error 0
  5318. dir: dir isR
  5319. -/|754: O: O1508 (predict-no)
  5320. I see 1 and I'm going to do: predict-no
  5321. ENV: Agent did: predict-no for direction R in state State-B
  5322. In State-B moving R
  5323. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5324. predict error 0
  5325. dir: dir isL
  5326. \-755: O: O1509 (predict-yes)
  5327. I see 1 and I'm going to do: predict-yes
  5328. ENV: Agent did: predict-yes for direction L in state State-B
  5329. In State-B moving L
  5330. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5331. predict error 0
  5332. dir: dir isR
  5333. /|\756: O: O1511 (predict-yes)
  5334. I see 1 and I'm going to do: predict-yes
  5335. ENV: Agent did: predict-yes for direction R in state State-A
  5336. In State-A moving R
  5337. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5338. predict error 0
  5339. dir: dir isU
  5340. -/|757: O: O1514 (predict-no)
  5341. I see 1 and I'm going to do: predict-no
  5342. ENV: Agent did: predict-no for direction U in state State-B
  5343. In State-B moving U
  5344. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5345. predict error 0
  5346. dir: dir isR
  5347. \-/|758: O: O1516 (predict-no)
  5348. I see 1 and I'm going to do: predict-no
  5349. ENV: Agent did: predict-no for direction R in state State-B
  5350. In State-B moving R
  5351. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5352. predict error 0
  5353. dir: dir isR
  5354. \-/759: O: O1518 (predict-no)
  5355. I see 1 and I'm going to do: predict-no
  5356. ENV: Agent did: predict-no for direction R in state State-B
  5357. In State-B moving R
  5358. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5359. predict error 0
  5360. dir: dir isR
  5361. |\760: O: O1520 (predict-no)
  5362. I see 1 and I'm going to do: predict-no
  5363. ENV: Agent did: predict-no for direction R in state State-B
  5364. In State-B moving R
  5365. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5366. predict error 0
  5367. dir: dir isL
  5368. -/|761: O: O1521 (predict-yes)
  5369. I see 1 and I'm going to do: predict-yes
  5370. ENV: Agent did: predict-yes for direction L in state State-B
  5371. In State-B moving L
  5372. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5373. predict error 0
  5374. dir: dir isR
  5375. \762: O: O1523 (predict-yes)
  5376. I see 1 and I'm going to do: predict-yes
  5377. ENV: Agent did: predict-yes for direction R in state State-A
  5378. In State-A moving R
  5379. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5380. predict error 0
  5381. dir: dir isU
  5382. -/|\763: O: O1526 (predict-no)
  5383. I see 1 and I'm going to do: predict-no
  5384. ENV: Agent did: predict-no for direction U in state State-B
  5385. In State-B moving U
  5386. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5387. predict error 0
  5388. dir: dir isR
  5389. -/|764: O: O1528 (predict-no)
  5390. I see 1 and I'm going to do: predict-no
  5391. ENV: Agent did: predict-no for direction R in state State-B
  5392. In State-B moving R
  5393. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5394. predict error 0
  5395. dir: dir isR
  5396. \-/765: O: O1530 (predict-no)
  5397. I see 1 and I'm going to do: predict-no
  5398. ENV: Agent did: predict-no for direction R in state State-B
  5399. In State-B moving R
  5400. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5401. predict error 0
  5402. dir: dir isL
  5403. |\-766: O: O1531 (predict-yes)
  5404. I see 1 and I'm going to do: predict-yes
  5405. ENV: Agent did: predict-yes for direction L in state State-B
  5406. In State-B moving L
  5407. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5408. predict error 0
  5409. dir: dir isL
  5410. /|767: O: O1534 (predict-no)
  5411. I see 1 and I'm going to do: predict-no
  5412. ENV: Agent did: predict-no for direction L in state State-A
  5413. In State-A moving L
  5414. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5415. predict error 0
  5416. dir: dir isL
  5417. \-/768: O: O1536 (predict-no)
  5418. I see 1 and I'm going to do: predict-no
  5419. ENV: Agent did: predict-no for direction L in state State-A
  5420. In State-A moving L
  5421. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5422. predict error 0
  5423. dir: dir isU
  5424. |\769: O: O1538 (predict-no)
  5425. I see 1 and I'm going to do: predict-no
  5426. ENV: Agent did: predict-no for direction U in state State-A
  5427. In State-A moving U
  5428. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5429. predict error 0
  5430. dir: dir isU
  5431. -/770: O: O1540 (predict-no)
  5432. I see 1 and I'm going to do: predict-no
  5433. ENV: Agent did: predict-no for direction U in state State-A
  5434. In State-A moving U
  5435. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5436. predict error 0
  5437. dir: dir isU
  5438. |\-771: O: O1542 (predict-no)
  5439. I see 1 and I'm going to do: predict-no
  5440. ENV: Agent did: predict-no for direction U in state State-A
  5441. In State-A moving U
  5442. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5443. predict error 0
  5444. dir: dir isU
  5445. /772: O: O1544 (predict-no)
  5446. I see 1 and I'm going to do: predict-no
  5447. ENV: Agent did: predict-no for direction U in state State-A
  5448. In State-A moving U
  5449. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5450. predict error 0
  5451. dir: dir isR
  5452. |\-773: O: O1545 (predict-yes)
  5453. I see 1 and I'm going to do: predict-yes
  5454. ENV: Agent did: predict-yes for direction R in state State-A
  5455. In State-A moving R
  5456. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5457. predict error 0
  5458. dir: dir isU
  5459. /|\774: O: O1548 (predict-no)
  5460. I see 1 and I'm going to do: predict-no
  5461. ENV: Agent did: predict-no for direction U in state State-B
  5462. In State-B moving U
  5463. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5464. predict error 0
  5465. dir: dir isR
  5466. -/|775: O: O1550 (predict-no)
  5467. I see 1 and I'm going to do: predict-no
  5468. ENV: Agent did: predict-no for direction R in state State-B
  5469. In State-B moving R
  5470. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5471. predict error 0
  5472. dir: dir isR
  5473. \-/776: O: O1552 (predict-no)
  5474. I see 1 and I'm going to do: predict-no
  5475. ENV: Agent did: predict-no for direction R in state State-B
  5476. In State-B moving R
  5477. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5478. predict error 0
  5479. dir: dir isU
  5480. |\-777: O: O1554 (predict-no)
  5481. I see 1 and I'm going to do: predict-no
  5482. ENV: Agent did: predict-no for direction U in state State-B
  5483. In State-B moving U
  5484. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5485. predict error 0
  5486. dir: dir isU
  5487. /|\778: O: O1556 (predict-no)
  5488. I see 1 and I'm going to do: predict-no
  5489. ENV: Agent did: predict-no for direction U in state State-B
  5490. In State-B moving U
  5491. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5492. predict error 0
  5493. dir: dir isU
  5494. -/779: O: O1558 (predict-no)
  5495. I see 1 and I'm going to do: predict-no
  5496. ENV: Agent did: predict-no for direction U in state State-B
  5497. In State-B moving U
  5498. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5499. predict error 0
  5500. dir: dir isR
  5501. |\780: O: O1560 (predict-no)
  5502. I see 1 and I'm going to do: predict-no
  5503. ENV: Agent did: predict-no for direction R in state State-B
  5504. In State-B moving R
  5505. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5506. predict error 0
  5507. dir: dir isU
  5508. -/|\781: O: O1562 (predict-no)
  5509. I see 1 and I'm going to do: predict-no
  5510. ENV: Agent did: predict-no for direction U in state State-B
  5511. In State-B moving U
  5512. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5513. predict error 0
  5514. dir: dir isR
  5515. -782: O: O1564 (predict-no)
  5516. I see 1 and I'm going to do: predict-no
  5517. ENV: Agent did: predict-no for direction R in state State-B
  5518. In State-B moving R
  5519. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5520. predict error 0
  5521. dir: dir isU
  5522. /783: O: O1566 (predict-no)
  5523. I see 1 and I'm going to do: predict-no
  5524. ENV: Agent did: predict-no for direction U in state State-B
  5525. In State-B moving U
  5526. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5527. predict error 0
  5528. dir: dir isU
  5529. |\-784: O: O1568 (predict-no)
  5530. I see 1 and I'm going to do: predict-no
  5531. ENV: Agent did: predict-no for direction U in state State-B
  5532. In State-B moving U
  5533. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5534. predict error 0
  5535. dir: dir isL
  5536. /|\785: O: O1569 (predict-yes)
  5537. I see 1 and I'm going to do: predict-yes
  5538. ENV: Agent did: predict-yes for direction L in state State-B
  5539. In State-B moving L
  5540. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5541. predict error 0
  5542. dir: dir isU
  5543. -/|\786: O: O1572 (predict-no)
  5544. I see 1 and I'm going to do: predict-no
  5545. ENV: Agent did: predict-no for direction U in state State-A
  5546. In State-A moving U
  5547. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5548. predict error 0
  5549. dir: dir isL
  5550. -/|787: O: O1574 (predict-no)
  5551. I see 1 and I'm going to do: predict-no
  5552. ENV: Agent did: predict-no for direction L in state State-A
  5553. In State-A moving L
  5554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5555. predict error 0
  5556. dir: dir isL
  5557. \-788: O: O1576 (predict-no)
  5558. I see 1 and I'm going to do: predict-no
  5559. ENV: Agent did: predict-no for direction L in state State-A
  5560. In State-A moving L
  5561. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5562. predict error 0
  5563. dir: dir isU
  5564. /|\789: O: O1578 (predict-no)
  5565. I see 1 and I'm going to do: predict-no
  5566. ENV: Agent did: predict-no for direction U in state State-A
  5567. In State-A moving U
  5568. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5569. predict error 0
  5570. dir: dir isL
  5571. -/|790: O: O1580 (predict-no)
  5572. I see 1 and I'm going to do: predict-no
  5573. ENV: Agent did: predict-no for direction L in state State-A
  5574. In State-A moving L
  5575. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5576. predict error 0
  5577. dir: dir isR
  5578. \-791: O: O1581 (predict-yes)
  5579. I see 1 and I'm going to do: predict-yes
  5580. ENV: Agent did: predict-yes for direction R in state State-A
  5581. In State-A moving R
  5582. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5583. predict error 0
  5584. dir: dir isR
  5585. /792: O: O1584 (predict-no)
  5586. I see 1 and I'm going to do: predict-no
  5587. ENV: Agent did: predict-no for direction R in state State-B
  5588. In State-B moving R
  5589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5590. predict error 0
  5591. dir: dir isL
  5592. |\793: O: O1585 (predict-yes)
  5593. I see 1 and I'm going to do: predict-yes
  5594. ENV: Agent did: predict-yes for direction L in state State-B
  5595. In State-B moving L
  5596. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5597. predict error 0
  5598. dir: dir isU
  5599. -/|794: O: O1588 (predict-no)
  5600. I see 1 and I'm going to do: predict-no
  5601. ENV: Agent did: predict-no for direction U in state State-A
  5602. In State-A moving U
  5603. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5604. predict error 0
  5605. dir: dir isL
  5606. \-/795: O: O1590 (predict-no)
  5607. I see 1 and I'm going to do: predict-no
  5608. ENV: Agent did: predict-no for direction L in state State-A
  5609. In State-A moving L
  5610. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5611. predict error 0
  5612. dir: dir isR
  5613. |\796: O: O1591 (predict-yes)
  5614. I see 1 and I'm going to do: predict-yes
  5615. ENV: Agent did: predict-yes for direction R in state State-A
  5616. In State-A moving R
  5617. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5618. predict error 0
  5619. dir: dir isR
  5620. -/|797: O: O1594 (predict-no)
  5621. I see 1 and I'm going to do: predict-no
  5622. ENV: Agent did: predict-no for direction R in state State-B
  5623. In State-B moving R
  5624. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5625. predict error 0
  5626. dir: dir isU
  5627. \-/798: O: O1596 (predict-no)
  5628. I see 1 and I'm going to do: predict-no
  5629. ENV: Agent did: predict-no for direction U in state State-B
  5630. In State-B moving U
  5631. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5632. predict error 0
  5633. dir: dir isU
  5634. |\799: O: O1598 (predict-no)
  5635. I see 1 and I'm going to do: predict-no
  5636. ENV: Agent did: predict-no for direction U in state State-B
  5637. In State-B moving U
  5638. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5639. predict error 0
  5640. dir: dir isL
  5641. -/|800: O: O1599 (predict-yes)
  5642. I see 1 and I'm going to do: predict-yes
  5643. ENV: Agent did: predict-yes for direction L in state State-B
  5644. In State-B moving L
  5645. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5646. predict error 0
  5647. dir: dir isL
  5648. \-/801: O: O1602 (predict-no)
  5649. I see 1 and I'm going to do: predict-no
  5650. ENV: Agent did: predict-no for direction L in state State-A
  5651. In State-A moving L
  5652. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5653. predict error 0
  5654. dir: dir isL
  5655. |802: O: O1604 (predict-no)
  5656. I see 1 and I'm going to do: predict-no
  5657. ENV: Agent did: predict-no for direction L in state State-A
  5658. In State-A moving L
  5659. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5660. predict error 0
  5661. dir: dir isR
  5662. \-/803: O: O1605 (predict-yes)
  5663. I see 1 and I'm going to do: predict-yes
  5664. ENV: Agent did: predict-yes for direction R in state State-A
  5665. In State-A moving R
  5666. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5667. predict error 0
  5668. dir: dir isL
  5669. |\-804: O: O1607 (predict-yes)
  5670. I see 1 and I'm going to do: predict-yes
  5671. ENV: Agent did: predict-yes for direction L in state State-B
  5672. In State-B moving L
  5673. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5674. predict error 0
  5675. dir: dir isL
  5676. /|\805: O: O1610 (predict-no)
  5677. I see 1 and I'm going to do: predict-no
  5678. ENV: Agent did: predict-no for direction L in state State-A
  5679. In State-A moving L
  5680. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5681. predict error 0
  5682. dir: dir isU
  5683. -/806: O: O1612 (predict-no)
  5684. I see 1 and I'm going to do: predict-no
  5685. ENV: Agent did: predict-no for direction U in state State-A
  5686. In State-A moving U
  5687. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5688. predict error 0
  5689. dir: dir isU
  5690. |\-807: O: O1614 (predict-no)
  5691. I see 1 and I'm going to do: predict-no
  5692. ENV: Agent did: predict-no for direction U in state State-A
  5693. In State-A moving U
  5694. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5695. predict error 0
  5696. dir: dir isU
  5697. /|\808: O: O1616 (predict-no)
  5698. I see 1 and I'm going to do: predict-no
  5699. ENV: Agent did: predict-no for direction U in state State-A
  5700. In State-A moving U
  5701. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5702. predict error 0
  5703. dir: dir isU
  5704. -/|809: O: O1618 (predict-no)
  5705. I see 1 and I'm going to do: predict-no
  5706. ENV: Agent did: predict-no for direction U in state State-A
  5707. In State-A moving U
  5708. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5709. predict error 0
  5710. dir: dir isU
  5711. \-/810: O: O1620 (predict-no)
  5712. I see 1 and I'm going to do: predict-no
  5713. ENV: Agent did: predict-no for direction U in state State-A
  5714. In State-A moving U
  5715. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5716. predict error 0
  5717. dir: dir isR
  5718. |\811: O: O1621 (predict-yes)
  5719. I see 1 and I'm going to do: predict-yes
  5720. ENV: Agent did: predict-yes for direction R in state State-A
  5721. In State-A moving R
  5722. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5723. predict error 0
  5724. dir: dir isL
  5725. -812: O: O1623 (predict-yes)
  5726. I see 1 and I'm going to do: predict-yes
  5727. ENV: Agent did: predict-yes for direction L in state State-B
  5728. In State-B moving L
  5729. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5730. predict error 0
  5731. dir: dir isL
  5732. /|\813: O: O1626 (predict-no)
  5733. I see 1 and I'm going to do: predict-no
  5734. ENV: Agent did: predict-no for direction L in state State-A
  5735. In State-A moving L
  5736. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5737. predict error 0
  5738. dir: dir isR
  5739. -/|814: O: O1627 (predict-yes)
  5740. I see 1 and I'm going to do: predict-yes
  5741. ENV: Agent did: predict-yes for direction R in state State-A
  5742. In State-A moving R
  5743. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5744. predict error 0
  5745. dir: dir isR
  5746. \-815: O: O1630 (predict-no)
  5747. I see 1 and I'm going to do: predict-no
  5748. ENV: Agent did: predict-no for direction R in state State-B
  5749. In State-B moving R
  5750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5751. predict error 0
  5752. dir: dir isR
  5753. /|\816: O: O1632 (predict-no)
  5754. I see 1 and I'm going to do: predict-no
  5755. ENV: Agent did: predict-no for direction R in state State-B
  5756. In State-B moving R
  5757. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5758. predict error 0
  5759. dir: dir isR
  5760. -/817: O: O1634 (predict-no)
  5761. I see 1 and I'm going to do: predict-no
  5762. ENV: Agent did: predict-no for direction R in state State-B
  5763. In State-B moving R
  5764. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5765. predict error 0
  5766. dir: dir isU
  5767. |\-818: O: O1636 (predict-no)
  5768. I see 1 and I'm going to do: predict-no
  5769. ENV: Agent did: predict-no for direction U in state State-B
  5770. In State-B moving U
  5771. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5772. predict error 0
  5773. dir: dir isL
  5774. /|819: O: O1637 (predict-yes)
  5775. I see 1 and I'm going to do: predict-yes
  5776. ENV: Agent did: predict-yes for direction L in state State-B
  5777. In State-B moving L
  5778. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5779. predict error 0
  5780. dir: dir isR
  5781. \820: O: O1639 (predict-yes)
  5782. I see 1 and I'm going to do: predict-yes
  5783. ENV: Agent did: predict-yes for direction R in state State-A
  5784. In State-A moving R
  5785. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5786. predict error 0
  5787. dir: dir isU
  5788. -/|821: O: O1642 (predict-no)
  5789. I see 1 and I'm going to do: predict-no
  5790. ENV: Agent did: predict-no for direction U in state State-B
  5791. In State-B moving U
  5792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5793. predict error 0
  5794. dir: dir isU
  5795. \822: O: O1644 (predict-no)
  5796. I see 1 and I'm going to do: predict-no
  5797. ENV: Agent did: predict-no for direction U in state State-B
  5798. In State-B moving U
  5799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5800. predict error 0
  5801. dir: dir isL
  5802. -/|823: O: O1645 (predict-yes)
  5803. I see 1 and I'm going to do: predict-yes
  5804. ENV: Agent did: predict-yes for direction L in state State-B
  5805. In State-B moving L
  5806. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5807. predict error 0
  5808. dir: dir isL
  5809. \-824: O: O1648 (predict-no)
  5810. I see 1 and I'm going to do: predict-no
  5811. ENV: Agent did: predict-no for direction L in state State-A
  5812. In State-A moving L
  5813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5814. predict error 0
  5815. dir: dir isU
  5816. /|\-825: O: O1650 (predict-no)
  5817. I see 1 and I'm going to do: predict-no
  5818. ENV: Agent did: predict-no for direction U in state State-A
  5819. In State-A moving U
  5820. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5821. predict error 0
  5822. dir: dir isL
  5823. /|826: O: O1652 (predict-no)
  5824. I see 1 and I'm going to do: predict-no
  5825. ENV: Agent did: predict-no for direction L in state State-A
  5826. In State-A moving L
  5827. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5828. predict error 0
  5829. dir: dir isL
  5830. \-/827: O: O1654 (predict-no)
  5831. I see 1 and I'm going to do: predict-no
  5832. ENV: Agent did: predict-no for direction L in state State-A
  5833. In State-A moving L
  5834. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5835. predict error 0
  5836. dir: dir isR
  5837. |\-828: O: O1655 (predict-yes)
  5838. I see 1 and I'm going to do: predict-yes
  5839. ENV: Agent did: predict-yes for direction R in state State-A
  5840. In State-A moving R
  5841. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5842. predict error 0
  5843. dir: dir isR
  5844. /|829: O: O1658 (predict-no)
  5845. I see 1 and I'm going to do: predict-no
  5846. ENV: Agent did: predict-no for direction R in state State-B
  5847. In State-B moving R
  5848. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5849. predict error 0
  5850. dir: dir isL
  5851. \-/830: O: O1659 (predict-yes)
  5852. I see 1 and I'm going to do: predict-yes
  5853. ENV: Agent did: predict-yes for direction L in state State-B
  5854. In State-B moving L
  5855. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5856. predict error 0
  5857. dir: dir isL
  5858. |\-831: O: O1662 (predict-no)
  5859. I see 1 and I'm going to do: predict-no
  5860. ENV: Agent did: predict-no for direction L in state State-A
  5861. In State-A moving L
  5862. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5863. predict error 0
  5864. dir: dir isL
  5865. /832: O: O1664 (predict-no)
  5866. I see 1 and I'm going to do: predict-no
  5867. ENV: Agent did: predict-no for direction L in state State-A
  5868. In State-A moving L
  5869. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5870. predict error 0
  5871. dir: dir isU
  5872. |833: O: O1666 (predict-no)
  5873. I see 1 and I'm going to do: predict-no
  5874. ENV: Agent did: predict-no for direction U in state State-A
  5875. In State-A moving U
  5876. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5877. predict error 0
  5878. dir: dir isR
  5879. \-/834: O: O1667 (predict-yes)
  5880. I see 1 and I'm going to do: predict-yes
  5881. ENV: Agent did: predict-yes for direction R in state State-A
  5882. In State-A moving R
  5883. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5884. predict error 0
  5885. dir: dir isU
  5886. |\-835: O: O1670 (predict-no)
  5887. I see 1 and I'm going to do: predict-no
  5888. ENV: Agent did: predict-no for direction U in state State-B
  5889. In State-B moving U
  5890. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5891. predict error 0
  5892. dir: dir isU
  5893. /|\836: O: O1672 (predict-no)
  5894. I see 1 and I'm going to do: predict-no
  5895. ENV: Agent did: predict-no for direction U in state State-B
  5896. In State-B moving U
  5897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5898. predict error 0
  5899. dir: dir isU
  5900. -/|837: O: O1674 (predict-no)
  5901. I see 1 and I'm going to do: predict-no
  5902. ENV: Agent did: predict-no for direction U in state State-B
  5903. In State-B moving U
  5904. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5905. predict error 0
  5906. dir: dir isL
  5907. \-838: O: O1675 (predict-yes)
  5908. I see 1 and I'm going to do: predict-yes
  5909. ENV: Agent did: predict-yes for direction L in state State-B
  5910. In State-B moving L
  5911. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5912. predict error 0
  5913. dir: dir isL
  5914. /|\839: O: O1678 (predict-no)
  5915. I see 1 and I'm going to do: predict-no
  5916. ENV: Agent did: predict-no for direction L in state State-A
  5917. In State-A moving L
  5918. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5919. predict error 0
  5920. dir: dir isL
  5921. -/|\840: O: O1680 (predict-no)
  5922. I see 1 and I'm going to do: predict-no
  5923. ENV: Agent did: predict-no for direction L in state State-A
  5924. In State-A moving L
  5925. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5926. predict error 0
  5927. dir: dir isR
  5928. -/|841: O: O1681 (predict-yes)
  5929. I see 1 and I'm going to do: predict-yes
  5930. ENV: Agent did: predict-yes for direction R in state State-A
  5931. In State-A moving R
  5932. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5933. predict error 0
  5934. dir: dir isR
  5935. \842: O: O1684 (predict-no)
  5936. I see 1 and I'm going to do: predict-no
  5937. ENV: Agent did: predict-no for direction R in state State-B
  5938. In State-B moving R
  5939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5940. predict error 0
  5941. dir: dir isL
  5942. -/843: O: O1685 (predict-yes)
  5943. I see 1 and I'm going to do: predict-yes
  5944. ENV: Agent did: predict-yes for direction L in state State-B
  5945. In State-B moving L
  5946. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5947. predict error 0
  5948. dir: dir isR
  5949. |\-844: O: O1687 (predict-yes)
  5950. I see 1 and I'm going to do: predict-yes
  5951. ENV: Agent did: predict-yes for direction R in state State-A
  5952. In State-A moving R
  5953. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5954. predict error 0
  5955. dir: dir isL
  5956. /|\845: O: O1689 (predict-yes)
  5957. I see 1 and I'm going to do: predict-yes
  5958. ENV: Agent did: predict-yes for direction L in state State-B
  5959. In State-B moving L
  5960. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5961. predict error 0
  5962. dir: dir isL
  5963. -/|846: O: O1692 (predict-no)
  5964. I see 1 and I'm going to do: predict-no
  5965. ENV: Agent did: predict-no for direction L in state State-A
  5966. In State-A moving L
  5967. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5968. predict error 0
  5969. dir: dir isL
  5970. \-847: O: O1694 (predict-no)
  5971. I see 1 and I'm going to do: predict-no
  5972. ENV: Agent did: predict-no for direction L in state State-A
  5973. In State-A moving L
  5974. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5975. predict error 0
  5976. dir: dir isU
  5977. /|\848: O: O1696 (predict-no)
  5978. I see 1 and I'm going to do: predict-no
  5979. ENV: Agent did: predict-no for direction U in state State-A
  5980. In State-A moving U
  5981. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5982. predict error 0
  5983. dir: dir isR
  5984. -/849: O: O1697 (predict-yes)
  5985. I see 1 and I'm going to do: predict-yes
  5986. ENV: Agent did: predict-yes for direction R in state State-A
  5987. In State-A moving R
  5988. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5989. predict error 0
  5990. dir: dir isL
  5991. |\-/850: O: O1699 (predict-yes)
  5992. I see 1 and I'm going to do: predict-yes
  5993. ENV: Agent did: predict-yes for direction L in state State-B
  5994. In State-B moving L
  5995. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5996. predict error 0
  5997. dir: dir isL
  5998. |\-851: O: O1702 (predict-no)
  5999. I see 1 and I'm going to do: predict-no
  6000. ENV: Agent did: predict-no for direction L in state State-A
  6001. In State-A moving L
  6002. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6003. predict error 0
  6004. dir: dir isR
  6005. /852: O: O1703 (predict-yes)
  6006. I see 1 and I'm going to do: predict-yes
  6007. ENV: Agent did: predict-yes for direction R in state State-A
  6008. In State-A moving R
  6009. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6010. predict error 0
  6011. dir: dir isR
  6012. |\853: O: O1706 (predict-no)
  6013. I see 1 and I'm going to do: predict-no
  6014. ENV: Agent did: predict-no for direction R in state State-B
  6015. In State-B moving R
  6016. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6017. predict error 0
  6018. dir: dir isU
  6019. -/|854: O: O1708 (predict-no)
  6020. I see 1 and I'm going to do: predict-no
  6021. ENV: Agent did: predict-no for direction U in state State-B
  6022. In State-B moving U
  6023. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6024. predict error 0
  6025. dir: dir isU
  6026. \-/855: O: O1710 (predict-no)
  6027. I see 1 and I'm going to do: predict-no
  6028. ENV: Agent did: predict-no for direction U in state State-B
  6029. In State-B moving U
  6030. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6031. predict error 0
  6032. dir: dir isU
  6033. |\-856: O: O1712 (predict-no)
  6034. I see 1 and I'm going to do: predict-no
  6035. ENV: Agent did: predict-no for direction U in state State-B
  6036. In State-B moving U
  6037. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6038. predict error 0
  6039. dir: dir isL
  6040. /|\857: O: O1713 (predict-yes)
  6041. I see 1 and I'm going to do: predict-yes
  6042. ENV: Agent did: predict-yes for direction L in state State-B
  6043. In State-B moving L
  6044. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6045. predict error 0
  6046. dir: dir isL
  6047. -/858: O: O1716 (predict-no)
  6048. I see 1 and I'm going to do: predict-no
  6049. ENV: Agent did: predict-no for direction L in state State-A
  6050. In State-A moving L
  6051. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6052. predict error 0
  6053. dir: dir isU
  6054. |\-859: O: O1718 (predict-no)
  6055. I see 1 and I'm going to do: predict-no
  6056. ENV: Agent did: predict-no for direction U in state State-A
  6057. In State-A moving U
  6058. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6059. predict error 0
  6060. dir: dir isU
  6061. /|\860: O: O1720 (predict-no)
  6062. I see 1 and I'm going to do: predict-no
  6063. ENV: Agent did: predict-no for direction U in state State-A
  6064. In State-A moving U
  6065. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6066. predict error 0
  6067. dir: dir isR
  6068. -/|861: O: O1721 (predict-yes)
  6069. I see 1 and I'm going to do: predict-yes
  6070. ENV: Agent did: predict-yes for direction R in state State-A
  6071. In State-A moving R
  6072. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6073. predict error 0
  6074. dir: dir isU
  6075. \862: O: O1724 (predict-no)
  6076. I see 1 and I'm going to do: predict-no
  6077. ENV: Agent did: predict-no for direction U in state State-B
  6078. In State-B moving U
  6079. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6080. predict error 0
  6081. dir: dir isR
  6082. -/|863: O: O1726 (predict-no)
  6083. I see 1 and I'm going to do: predict-no
  6084. ENV: Agent did: predict-no for direction R in state State-B
  6085. In State-B moving R
  6086. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6087. predict error 0
  6088. dir: dir isL
  6089. \-/864: O: O1727 (predict-yes)
  6090. I see 1 and I'm going to do: predict-yes
  6091. ENV: Agent did: predict-yes for direction L in state State-B
  6092. In State-B moving L
  6093. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6094. predict error 0
  6095. dir: dir isL
  6096. |\-865: O: O1730 (predict-no)
  6097. I see 1 and I'm going to do: predict-no
  6098. ENV: Agent did: predict-no for direction L in state State-A
  6099. In State-A moving L
  6100. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6101. predict error 0
  6102. dir: dir isL
  6103. /|866: O: O1732 (predict-no)
  6104. I see 1 and I'm going to do: predict-no
  6105. ENV: Agent did: predict-no for direction L in state State-A
  6106. In State-A moving L
  6107. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6108. predict error 0
  6109. dir: dir isR
  6110. \-867: O: O1733 (predict-yes)
  6111. I see 1 and I'm going to do: predict-yes
  6112. ENV: Agent did: predict-yes for direction R in state State-A
  6113. In State-A moving R
  6114. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6115. predict error 0
  6116. dir: dir isU
  6117. /|\868: O: O1736 (predict-no)
  6118. I see 1 and I'm going to do: predict-no
  6119. ENV: Agent did: predict-no for direction U in state State-B
  6120. In State-B moving U
  6121. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6122. predict error 0
  6123. dir: dir isU
  6124. -/|869: O: O1738 (predict-no)
  6125. I see 1 and I'm going to do: predict-no
  6126. ENV: Agent did: predict-no for direction U in state State-B
  6127. In State-B moving U
  6128. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6129. predict error 0
  6130. dir: dir isR
  6131. \-870: O: O1740 (predict-no)
  6132. I see 1 and I'm going to do: predict-no
  6133. ENV: Agent did: predict-no for direction R in state State-B
  6134. In State-B moving R
  6135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6136. predict error 0
  6137. dir: dir isR
  6138. /|\871: O: O1742 (predict-no)
  6139. I see 1 and I'm going to do: predict-no
  6140. ENV: Agent did: predict-no for direction R in state State-B
  6141. In State-B moving R
  6142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6143. predict error 0
  6144. dir: dir isU
  6145. -872: O: O1744 (predict-no)
  6146. I see 1 and I'm going to do: predict-no
  6147. ENV: Agent did: predict-no for direction U in state State-B
  6148. In State-B moving U
  6149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6150. predict error 0
  6151. dir: dir isU
  6152. /|873: O: O1746 (predict-no)
  6153. I see 1 and I'm going to do: predict-no
  6154. ENV: Agent did: predict-no for direction U in state State-B
  6155. In State-B moving U
  6156. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6157. predict error 0
  6158. dir: dir isR
  6159. \-/874: O: O1748 (predict-no)
  6160. I see 1 and I'm going to do: predict-no
  6161. ENV: Agent did: predict-no for direction R in state State-B
  6162. In State-B moving R
  6163. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6164. predict error 0
  6165. dir: dir isR
  6166. |\-875: O: O1750 (predict-no)
  6167. I see 1 and I'm going to do: predict-no
  6168. ENV: Agent did: predict-no for direction R in state State-B
  6169. In State-B moving R
  6170. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6171. predict error 0
  6172. dir: dir isR
  6173. /|\876: O: O1752 (predict-no)
  6174. I see 1 and I'm going to do: predict-no
  6175. ENV: Agent did: predict-no for direction R in state State-B
  6176. In State-B moving R
  6177. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6178. predict error 0
  6179. dir: dir isL
  6180. -/|877: O: O1753 (predict-yes)
  6181. I see 1 and I'm going to do: predict-yes
  6182. ENV: Agent did: predict-yes for direction L in state State-B
  6183. In State-B moving L
  6184. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6185. predict error 0
  6186. dir: dir isL
  6187. \-/878: O: O1756 (predict-no)
  6188. I see 1 and I'm going to do: predict-no
  6189. ENV: Agent did: predict-no for direction L in state State-A
  6190. In State-A moving L
  6191. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6192. predict error 0
  6193. dir: dir isU
  6194. |\-879: O: O1758 (predict-no)
  6195. I see 1 and I'm going to do: predict-no
  6196. ENV: Agent did: predict-no for direction U in state State-A
  6197. In State-A moving U
  6198. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6199. predict error 0
  6200. dir: dir isU
  6201. /|880: O: O1760 (predict-no)
  6202. I see 1 and I'm going to do: predict-no
  6203. ENV: Agent did: predict-no for direction U in state State-A
  6204. In State-A moving U
  6205. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6206. predict error 0
  6207. dir: dir isR
  6208. \-881: O: O1761 (predict-yes)
  6209. I see 1 and I'm going to do: predict-yes
  6210. ENV: Agent did: predict-yes for direction R in state State-A
  6211. In State-A moving R
  6212. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6213. predict error 0
  6214. dir: dir isR
  6215. /882: O: O1764 (predict-no)
  6216. I see 1 and I'm going to do: predict-no
  6217. ENV: Agent did: predict-no for direction R in state State-B
  6218. In State-B moving R
  6219. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6220. predict error 0
  6221. dir: dir isR
  6222. |\-883: O: O1766 (predict-no)
  6223. I see 1 and I'm going to do: predict-no
  6224. ENV: Agent did: predict-no for direction R in state State-B
  6225. In State-B moving R
  6226. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6227. predict error 0
  6228. dir: dir isR
  6229. /|\884: O: O1768 (predict-no)
  6230. I see 1 and I'm going to do: predict-no
  6231. ENV: Agent did: predict-no for direction R in state State-B
  6232. In State-B moving R
  6233. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6234. predict error 0
  6235. dir: dir isU
  6236. -/|885: O: O1770 (predict-no)
  6237. I see 1 and I'm going to do: predict-no
  6238. ENV: Agent did: predict-no for direction U in state State-B
  6239. In State-B moving U
  6240. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6241. predict error 0
  6242. dir: dir isR
  6243. \-886: O: O1772 (predict-no)
  6244. I see 1 and I'm going to do: predict-no
  6245. ENV: Agent did: predict-no for direction R in state State-B
  6246. In State-B moving R
  6247. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6248. predict error 0
  6249. dir: dir isU
  6250. /|887: O: O1774 (predict-no)
  6251. I see 1 and I'm going to do: predict-no
  6252. ENV: Agent did: predict-no for direction U in state State-B
  6253. In State-B moving U
  6254. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6255. predict error 0
  6256. dir: dir isL
  6257. \-/888: O: O1775 (predict-yes)
  6258. I see 1 and I'm going to do: predict-yes
  6259. ENV: Agent did: predict-yes for direction L in state State-B
  6260. In State-B moving L
  6261. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6262. predict error 0
  6263. dir: dir isL
  6264. |\-889: O: O1778 (predict-no)
  6265. I see 1 and I'm going to do: predict-no
  6266. ENV: Agent did: predict-no for direction L in state State-A
  6267. In State-A moving L
  6268. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6269. predict error 0
  6270. dir: dir isR
  6271. /|\890: O: O1779 (predict-yes)
  6272. I see 1 and I'm going to do: predict-yes
  6273. ENV: Agent did: predict-yes for direction R in state State-A
  6274. In State-A moving R
  6275. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6276. predict error 0
  6277. dir: dir isR
  6278. -/|891: O: O1782 (predict-no)
  6279. I see 1 and I'm going to do: predict-no
  6280. ENV: Agent did: predict-no for direction R in state State-B
  6281. In State-B moving R
  6282. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6283. predict error 0
  6284. dir: dir isU
  6285. \892: O: O1784 (predict-no)
  6286. I see 1 and I'm going to do: predict-no
  6287. ENV: Agent did: predict-no for direction U in state State-B
  6288. In State-B moving U
  6289. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6290. predict error 0
  6291. dir: dir isU
  6292. -/|893: O: O1786 (predict-no)
  6293. I see 1 and I'm going to do: predict-no
  6294. ENV: Agent did: predict-no for direction U in state State-B
  6295. In State-B moving U
  6296. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6297. predict error 0
  6298. dir: dir isU
  6299. \-/894: O: O1788 (predict-no)
  6300. I see 1 and I'm going to do: predict-no
  6301. ENV: Agent did: predict-no for direction U in state State-B
  6302. In State-B moving U
  6303. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6304. predict error 0
  6305. dir: dir isR
  6306. |\895: O: O1790 (predict-no)
  6307. I see 1 and I'm going to do: predict-no
  6308. ENV: Agent did: predict-no for direction R in state State-B
  6309. In State-B moving R
  6310. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6311. predict error 0
  6312. dir: dir isL
  6313. -/|896: O: O1791 (predict-yes)
  6314. I see 1 and I'm going to do: predict-yes
  6315. ENV: Agent did: predict-yes for direction L in state State-B
  6316. In State-B moving L
  6317. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6318. predict error 0
  6319. dir: dir isR
  6320. \-897: O: O1793 (predict-yes)
  6321. I see 1 and I'm going to do: predict-yes
  6322. ENV: Agent did: predict-yes for direction R in state State-A
  6323. In State-A moving R
  6324. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6325. predict error 0
  6326. dir: dir isR
  6327. /|\898: O: O1796 (predict-no)
  6328. I see 1 and I'm going to do: predict-no
  6329. ENV: Agent did: predict-no for direction R in state State-B
  6330. In State-B moving R
  6331. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6332. predict error 0
  6333. dir: dir isR
  6334. -/|899: O: O1798 (predict-no)
  6335. I see 1 and I'm going to do: predict-no
  6336. ENV: Agent did: predict-no for direction R in state State-B
  6337. In State-B moving R
  6338. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6339. predict error 0
  6340. dir: dir isL
  6341. \-/900: O: O1799 (predict-yes)
  6342. I see 1 and I'm going to do: predict-yes
  6343. ENV: Agent did: predict-yes for direction L in state State-B
  6344. In State-B moving L
  6345. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6346. predict error 0
  6347. dir: dir isU
  6348. |\-901: O: O1802 (predict-no)
  6349. I see 1 and I'm going to do: predict-no
  6350. ENV: Agent did: predict-no for direction U in state State-A
  6351. In State-A moving U
  6352. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6353. predict error 0
  6354. dir: dir isU
  6355. /902: O: O1804 (predict-no)
  6356. I see 1 and I'm going to do: predict-no
  6357. ENV: Agent did: predict-no for direction U in state State-A
  6358. In State-A moving U
  6359. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6360. predict error 0
  6361. dir: dir isL
  6362. |\-903: O: O1806 (predict-no)
  6363. I see 1 and I'm going to do: predict-no
  6364. ENV: Agent did: predict-no for direction L in state State-A
  6365. In State-A moving L
  6366. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6367. predict error 0
  6368. dir: dir isU
  6369. /|904: O: O1808 (predict-no)
  6370. I see 1 and I'm going to do: predict-no
  6371. ENV: Agent did: predict-no for direction U in state State-A
  6372. In State-A moving U
  6373. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6374. predict error 0
  6375. dir: dir isR
  6376. \-/905: O: O1809 (predict-yes)
  6377. I see 1 and I'm going to do: predict-yes
  6378. ENV: Agent did: predict-yes for direction R in state State-A
  6379. In State-A moving R
  6380. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6381. predict error 0
  6382. dir: dir isR
  6383. |\906: O: O1812 (predict-no)
  6384. I see 1 and I'm going to do: predict-no
  6385. ENV: Agent did: predict-no for direction R in state State-B
  6386. In State-B moving R
  6387. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6388. predict error 0
  6389. dir: dir isU
  6390. -/|907: O: O1814 (predict-no)
  6391. I see 1 and I'm going to do: predict-no
  6392. ENV: Agent did: predict-no for direction U in state State-B
  6393. In State-B moving U
  6394. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6395. predict error 0
  6396. dir: dir isU
  6397. \-/908: O: O1816 (predict-no)
  6398. I see 1 and I'm going to do: predict-no
  6399. ENV: Agent did: predict-no for direction U in state State-B
  6400. In State-B moving U
  6401. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6402. predict error 0
  6403. dir: dir isR
  6404. |\909: O: O1818 (predict-no)
  6405. I see 1 and I'm going to do: predict-no
  6406. ENV: Agent did: predict-no for direction R in state State-B
  6407. In State-B moving R
  6408. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6409. predict error 0
  6410. dir: dir isL
  6411. -/|910: O: O1819 (predict-yes)
  6412. I see 1 and I'm going to do: predict-yes
  6413. ENV: Agent did: predict-yes for direction L in state State-B
  6414. In State-B moving L
  6415. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6416. predict error 0
  6417. dir: dir isU
  6418. \-/911: O: O1822 (predict-no)
  6419. I see 1 and I'm going to do: predict-no
  6420. ENV: Agent did: predict-no for direction U in state State-A
  6421. In State-A moving U
  6422. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6423. predict error 0
  6424. dir: dir isL
  6425. |912: O: O1824 (predict-no)
  6426. I see 1 and I'm going to do: predict-no
  6427. ENV: Agent did: predict-no for direction L in state State-A
  6428. In State-A moving L
  6429. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6430. predict error 0
  6431. dir: dir isR
  6432. \-913: O: O1825 (predict-yes)
  6433. I see 1 and I'm going to do: predict-yes
  6434. ENV: Agent did: predict-yes for direction R in state State-A
  6435. In State-A moving R
  6436. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6437. predict error 0
  6438. dir: dir isU
  6439. /|914: O: O1828 (predict-no)
  6440. I see 1 and I'm going to do: predict-no
  6441. ENV: Agent did: predict-no for direction U in state State-B
  6442. In State-B moving U
  6443. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6444. predict error 0
  6445. dir: dir isL
  6446. \-/915: O: O1829 (predict-yes)
  6447. I see 1 and I'm going to do: predict-yes
  6448. ENV: Agent did: predict-yes for direction L in state State-B
  6449. In State-B moving L
  6450. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6451. predict error 0
  6452. dir: dir isL
  6453. |\916: O: O1832 (predict-no)
  6454. I see 1 and I'm going to do: predict-no
  6455. ENV: Agent did: predict-no for direction L in state State-A
  6456. In State-A moving L
  6457. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6458. predict error 0
  6459. dir: dir isU
  6460. -/|\917: O: O1834 (predict-no)
  6461. I see 1 and I'm going to do: predict-no
  6462. ENV: Agent did: predict-no for direction U in state State-A
  6463. In State-A moving U
  6464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6465. predict error 0
  6466. dir: dir isU
  6467. -/918: O: O1836 (predict-no)
  6468. I see 1 and I'm going to do: predict-no
  6469. ENV: Agent did: predict-no for direction U in state State-A
  6470. In State-A moving U
  6471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6472. predict error 0
  6473. dir: dir isL
  6474. |919: O: O1838 (predict-no)
  6475. I see 1 and I'm going to do: predict-no
  6476. ENV: Agent did: predict-no for direction L in state State-A
  6477. In State-A moving L
  6478. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6479. predict error 0
  6480. dir: dir isR
  6481. \-/920: O: O1839 (predict-yes)
  6482. I see 1 and I'm going to do: predict-yes
  6483. ENV: Agent did: predict-yes for direction R in state State-A
  6484. In State-A moving R
  6485. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6486. predict error 0
  6487. dir: dir isL
  6488. |\-921: O: O1841 (predict-yes)
  6489. I see 1 and I'm going to do: predict-yes
  6490. ENV: Agent did: predict-yes for direction L in state State-B
  6491. In State-B moving L
  6492. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6493. predict error 0
  6494. dir: dir isL
  6495. /922: O: O1844 (predict-no)
  6496. I see 1 and I'm going to do: predict-no
  6497. ENV: Agent did: predict-no for direction L in state State-A
  6498. In State-A moving L
  6499. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6500. predict error 0
  6501. dir: dir isL
  6502. |\923: O: O1846 (predict-no)
  6503. I see 1 and I'm going to do: predict-no
  6504. ENV: Agent did: predict-no for direction L in state State-A
  6505. In State-A moving L
  6506. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6507. predict error 0
  6508. dir: dir isU
  6509. -/|924: O: O1848 (predict-no)
  6510. I see 1 and I'm going to do: predict-no
  6511. ENV: Agent did: predict-no for direction U in state State-A
  6512. In State-A moving U
  6513. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6514. predict error 0
  6515. dir: dir isL
  6516. \-/925: O: O1850 (predict-no)
  6517. I see 1 and I'm going to do: predict-no
  6518. ENV: Agent did: predict-no for direction L in state State-A
  6519. In State-A moving L
  6520. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6521. predict error 0
  6522. dir: dir isL
  6523. |\926: O: O1852 (predict-no)
  6524. I see 1 and I'm going to do: predict-no
  6525. ENV: Agent did: predict-no for direction L in state State-A
  6526. In State-A moving L
  6527. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6528. predict error 0
  6529. dir: dir isL
  6530. -/|927: O: O1854 (predict-no)
  6531. I see 1 and I'm going to do: predict-no
  6532. ENV: Agent did: predict-no for direction L in state State-A
  6533. In State-A moving L
  6534. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6535. predict error 0
  6536. dir: dir isL
  6537. \-/928: O: O1856 (predict-no)
  6538. I see 1 and I'm going to do: predict-no
  6539. ENV: Agent did: predict-no for direction L in state State-A
  6540. In State-A moving L
  6541. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6542. predict error 0
  6543. dir: dir isR
  6544. |\-929: O: O1857 (predict-yes)
  6545. I see 1 and I'm going to do: predict-yes
  6546. ENV: Agent did: predict-yes for direction R in state State-A
  6547. In State-A moving R
  6548. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6549. predict error 0
  6550. dir: dir isU
  6551. /|\930: O: O1860 (predict-no)
  6552. I see 1 and I'm going to do: predict-no
  6553. ENV: Agent did: predict-no for direction U in state State-B
  6554. In State-B moving U
  6555. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6556. predict error 0
  6557. dir: dir isU
  6558. -/|931: O: O1862 (predict-no)
  6559. I see 1 and I'm going to do: predict-no
  6560. ENV: Agent did: predict-no for direction U in state State-B
  6561. In State-B moving U
  6562. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6563. predict error 0
  6564. dir: dir isR
  6565. \932: O: O1864 (predict-no)
  6566. I see 1 and I'm going to do: predict-no
  6567. ENV: Agent did: predict-no for direction R in state State-B
  6568. In State-B moving R
  6569. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6570. predict error 0
  6571. dir: dir isU
  6572. -/933: O: O1866 (predict-no)
  6573. I see 1 and I'm going to do: predict-no
  6574. ENV: Agent did: predict-no for direction U in state State-B
  6575. In State-B moving U
  6576. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6577. predict error 0
  6578. dir: dir isL
  6579. |\934: O: O1867 (predict-yes)
  6580. I see 1 and I'm going to do: predict-yes
  6581. ENV: Agent did: predict-yes for direction L in state State-B
  6582. In State-B moving L
  6583. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6584. predict error 0
  6585. dir: dir isL
  6586. -/|935: O: O1870 (predict-no)
  6587. I see 1 and I'm going to do: predict-no
  6588. ENV: Agent did: predict-no for direction L in state State-A
  6589. In State-A moving L
  6590. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6591. predict error 0
  6592. dir: dir isU
  6593. \-/936: O: O1872 (predict-no)
  6594. I see 1 and I'm going to do: predict-no
  6595. ENV: Agent did: predict-no for direction U in state State-A
  6596. In State-A moving U
  6597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6598. predict error 0
  6599. dir: dir isL
  6600. |\937: O: O1874 (predict-no)
  6601. I see 1 and I'm going to do: predict-no
  6602. ENV: Agent did: predict-no for direction L in state State-A
  6603. In State-A moving L
  6604. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6605. predict error 0
  6606. dir: dir isL
  6607. -/938: O: O1876 (predict-no)
  6608. I see 1 and I'm going to do: predict-no
  6609. ENV: Agent did: predict-no for direction L in state State-A
  6610. In State-A moving L
  6611. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6612. predict error 0
  6613. dir: dir isR
  6614. |\939: O: O1877 (predict-yes)
  6615. I see 1 and I'm going to do: predict-yes
  6616. ENV: Agent did: predict-yes for direction R in state State-A
  6617. In State-A moving R
  6618. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6619. predict error 0
  6620. dir: dir isL
  6621. -/940: O: O1879 (predict-yes)
  6622. I see 1 and I'm going to do: predict-yes
  6623. ENV: Agent did: predict-yes for direction L in state State-B
  6624. In State-B moving L
  6625. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6626. predict error 0
  6627. dir: dir isR
  6628. |\-941: O: O1881 (predict-yes)
  6629. I see 1 and I'm going to do: predict-yes
  6630. ENV: Agent did: predict-yes for direction R in state State-A
  6631. In State-A moving R
  6632. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6633. predict error 0
  6634. dir: dir isL
  6635. /942: O: O1883 (predict-yes)
  6636. I see 1 and I'm going to do: predict-yes
  6637. ENV: Agent did: predict-yes for direction L in state State-B
  6638. In State-B moving L
  6639. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6640. predict error 0
  6641. dir: dir isL
  6642. |\-943: O: O1886 (predict-no)
  6643. I see 1 and I'm going to do: predict-no
  6644. ENV: Agent did: predict-no for direction L in state State-A
  6645. In State-A moving L
  6646. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6647. predict error 0
  6648. dir: dir isU
  6649. /|\944: O: O1888 (predict-no)
  6650. I see 1 and I'm going to do: predict-no
  6651. ENV: Agent did: predict-no for direction U in state State-A
  6652. In State-A moving U
  6653. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6654. predict error 0
  6655. dir: dir isL
  6656. -/945: O: O1890 (predict-no)
  6657. I see 1 and I'm going to do: predict-no
  6658. ENV: Agent did: predict-no for direction L in state State-A
  6659. In State-A moving L
  6660. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6661. predict error 0
  6662. dir: dir isU
  6663. |\-946: O: O1892 (predict-no)
  6664. I see 1 and I'm going to do: predict-no
  6665. ENV: Agent did: predict-no for direction U in state State-A
  6666. In State-A moving U
  6667. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6668. predict error 0
  6669. dir: dir isL
  6670. /|947: O: O1894 (predict-no)
  6671. I see 1 and I'm going to do: predict-no
  6672. ENV: Agent did: predict-no for direction L in state State-A
  6673. In State-A moving L
  6674. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6675. predict error 0
  6676. dir: dir isU
  6677. \948: O: O1896 (predict-no)
  6678. I see 1 and I'm going to do: predict-no
  6679. ENV: Agent did: predict-no for direction U in state State-A
  6680. In State-A moving U
  6681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6682. predict error 0
  6683. dir: dir isU
  6684. -/|949: O: O1898 (predict-no)
  6685. I see 1 and I'm going to do: predict-no
  6686. ENV: Agent did: predict-no for direction U in state State-A
  6687. In State-A moving U
  6688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6689. predict error 0
  6690. dir: dir isR
  6691. \-/950: O: O1899 (predict-yes)
  6692. I see 1 and I'm going to do: predict-yes
  6693. ENV: Agent did: predict-yes for direction R in state State-A
  6694. In State-A moving R
  6695. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6696. predict error 0
  6697. dir: dir isL
  6698. |\-/|\-/|--- Input Phase ---
  6699. =>WM: (13351: I2 ^dir L)
  6700. =>WM: (13350: I2 ^reward 1)
  6701. =>WM: (13349: I2 ^see 1)
  6702. =>WM: (13348: N950 ^status complete)
  6703. <=WM: (13337: I2 ^dir R)
  6704. <=WM: (13336: I2 ^reward 1)
  6705. <=WM: (13335: I2 ^see 0)
  6706. =>WM: (13352: I2 ^level-1 R1-root)
  6707. <=WM: (13338: I2 ^level-1 L0-root)
  6708. --- END Input Phase ---
  6709. --- Proposal Phase ---
  6710. --- Inner Elaboration Phase, active level 1 (S1) ---
  6711. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  6712. -->
  6713. (S1 ^operator O1899 = 0.4768760547163575)
  6714. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  6715. -->
  6716. (S1 ^operator O1900 = -0.01194930198035649)
  6717. Firing prefer*rvt*predict-no*H0*2*H1
  6718. -->
  6719. Firing prefer*rvt*predict-yes*H0*1*H1
  6720. -->
  6721. Firing elaborate*copy-see-to-output-link
  6722. -->
  6723. (I3 ^see 1 +)
  6724. Firing elaborate*reward*based*on*reward
  6725. -->
  6726. (R954 ^value 1 +)
  6727. (R1 ^reward R954 +)
  6728. Firing propose*predict-yes
  6729. -->
  6730. (O1901 ^name predict-yes +)
  6731. (S1 ^operator O1901 +)
  6732. Firing propose*predict-no
  6733. -->
  6734. (O1902 ^name predict-no +)
  6735. (S1 ^operator O1902 +)
  6736. Firing rl*prefer*rvt*predict-no*H0*2
  6737. -->
  6738. (S1 ^operator O1900 = 0.2550132695707557)
  6739. Firing rl*prefer*rvt*predict-yes*H0*1
  6740. -->
  6741. (S1 ^operator O1899 = 0.5231202597544767)
  6742. Firing prefer*rvt*predict-yes*H0
  6743. -->
  6744. Firing prefer*rvt*predict-no*H0
  6745. -->
  6746. Firing elaborate*copy-dir-to-output-link
  6747. -->
  6748. (I3 ^dir L +)
  6749. inner elaboration loop at bottom goal.
  6750. Retracting elaborate*copy-see-to-output-link
  6751. -->
  6752. (I3 ^see 0 +)
  6753. Retracting propose*predict-no
  6754. -->
  6755. (O1900 ^name predict-no +)
  6756. (S1 ^operator O1900 +)
  6757. Retracting propose*predict-yes
  6758. -->
  6759. (O1899 ^name predict-yes +)
  6760. (S1 ^operator O1899 +)
  6761. Retracting elaborate*reward*based*on*reward
  6762. -->
  6763. (R953 ^value 1 +)
  6764. (R1 ^reward R953 +)
  6765. Retracting elaborate*copy-dir-to-output-link
  6766. -->
  6767. (I3 ^dir R +)
  6768. Retracting rl*prefer*rvt*predict-no*H0*4
  6769. -->
  6770. (S1 ^operator O1900 = 0.1269768259493387)
  6771. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  6772. -->
  6773. (S1 ^operator O1900 = 0.4910065094545203)
  6774. Retracting rl*prefer*rvt*predict-yes*H0*3
  6775. -->
  6776. (S1 ^operator O1899 = 0.3829293116822346)
  6777. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  6778. -->
  6779. (S1 ^operator O1899 = 0.6170848495907595)
  6780. =>WM: (13360: S1 ^operator O1902 +)
  6781. =>WM: (13359: S1 ^operator O1901 +)
  6782. =>WM: (13358: I3 ^dir L)
  6783. =>WM: (13357: O1902 ^name predict-no)
  6784. =>WM: (13356: O1901 ^name predict-yes)
  6785. =>WM: (13355: R954 ^value 1)
  6786. =>WM: (13354: R1 ^reward R954)
  6787. =>WM: (13353: I3 ^see 1)
  6788. <=WM: (13344: S1 ^operator O1899 +)
  6789. <=WM: (13346: S1 ^operator O1899)
  6790. <=WM: (13345: S1 ^operator O1900 +)
  6791. <=WM: (13343: I3 ^dir R)
  6792. <=WM: (13339: R1 ^reward R953)
  6793. <=WM: (13255: I3 ^see 0)
  6794. <=WM: (13342: O1900 ^name predict-no)
  6795. <=WM: (13341: O1899 ^name predict-yes)
  6796. <=WM: (13340: R953 ^value 1)
  6797. --- Inner Elaboration Phase, active level 1 (S1) ---
  6798. Firing prefer*rvt*predict-yes*H0
  6799. -->
  6800. Firing rl*prefer*rvt*predict-yes*H0*1
  6801. -->
  6802. (S1 ^operator O1901 = 0.5231202597544767)
  6803. Firing prefer*rvt*predict-yes*H0*1*H1
  6804. -->
  6805. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  6806. -->
  6807. (S1 ^operator O1901 = 0.4768760547163575)
  6808. Firing prefer*rvt*predict-no*H0
  6809. -->
  6810. Firing rl*prefer*rvt*predict-no*H0*2
  6811. -->
  6812. (S1 ^operator O1902 = 0.2550132695707557)
  6813. Firing prefer*rvt*predict-no*H0*2*H1
  6814. -->
  6815. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  6816. -->
  6817. (S1 ^operator O1902 = -0.01194930198035649)
  6818. inner elaboration loop at bottom goal.
  6819. Retracting rl*prefer*rvt*predict-no*H0*2
  6820. -->
  6821. (S1 ^operator O1900 = 0.2550132695707557)
  6822. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  6823. -->
  6824. (S1 ^operator O1900 = -0.01194930198035649)
  6825. Retracting rl*prefer*rvt*predict-yes*H0*1
  6826. -->
  6827. (S1 ^operator O1899 = 0.5231202597544767)
  6828. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  6829. -->
  6830. (S1 ^operator O1899 = 0.4768760547163575)
  6831. --- END Proposal Phase ---
  6832. --- Decision Phase ---
  6833. RL update rl*prefer*rvt*predict-yes*H0*3 0.673123 -0.290194 0.382929 -> 0.673122 -0.290194 0.382927(R,m,v=1,0.958904,0.0396788)
  6834. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326889 0.290195 0.617085 -> 0.326888 0.290195 0.617083(R,m,v=1,1,0)
  6835. =>WM: (13361: S1 ^operator O1901)
  6836. 951: O: O1901 (predict-yes)
  6837. --- END Decision Phase ---
  6838. --- Application Phase ---
  6839. --- Firing Productions (PE) For State At Depth 1 ---
  6840. --- Inner Elaboration Phase, active level 1 (S1) ---
  6841. Firing apply*operator
  6842. -->
  6843. (I3 ^predict-yes N951 + :O )
  6844. Firing apply*operator*complete
  6845. -->
  6846. (I3 ^predict-yes N950 - :O )
  6847. inner elaboration loop at bottom goal.
  6848. --- Change Working Memory (PE) ---
  6849. =>WM: (13362: I3 ^predict-yes N951)
  6850. <=WM: (13348: N950 ^status complete)
  6851. <=WM: (13347: I3 ^predict-yes N950)
  6852. --- Firing Productions (IE) For State At Depth 1 ---
  6853. --- Inner Elaboration Phase, active level 1 (S1) ---
  6854. Firing monitor*world
  6855. -->
  6856. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  6857. --- Change Working Memory (IE) ---
  6858. --- END Application Phase ---
  6859. --- Output Phase ---
  6860. ENV: Agent did: predict-yes for direction L in state State-B
  6861. In State-B moving L
  6862. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6863. predict error 0
  6864. dir: dir isL
  6865. --- END Output Phase ---
  6866. \--- Input Phase ---
  6867. =>WM: (13366: I2 ^dir L)
  6868. =>WM: (13365: I2 ^reward 1)
  6869. =>WM: (13364: I2 ^see 1)
  6870. =>WM: (13363: N951 ^status complete)
  6871. <=WM: (13351: I2 ^dir L)
  6872. <=WM: (13350: I2 ^reward 1)
  6873. <=WM: (13349: I2 ^see 1)
  6874. =>WM: (13367: I2 ^level-1 L1-root)
  6875. <=WM: (13352: I2 ^level-1 R1-root)
  6876. --- END Input Phase ---
  6877. --- Proposal Phase ---
  6878. --- Inner Elaboration Phase, active level 1 (S1) ---
  6879. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  6880. -->
  6881. (S1 ^operator O1901 = 0.1693592933936033)
  6882. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  6883. -->
  6884. (S1 ^operator O1902 = 0.7449862034212327)
  6885. Firing prefer*rvt*predict-no*H0*2*H1
  6886. -->
  6887. Firing prefer*rvt*predict-yes*H0*1*H1
  6888. -->
  6889. Firing elaborate*copy-see-to-output-link
  6890. -->
  6891. (I3 ^see 1 +)
  6892. Firing elaborate*reward*based*on*reward
  6893. -->
  6894. (R955 ^value 1 +)
  6895. (R1 ^reward R955 +)
  6896. Firing propose*predict-yes
  6897. -->
  6898. (O1903 ^name predict-yes +)
  6899. (S1 ^operator O1903 +)
  6900. Firing propose*predict-no
  6901. -->
  6902. (O1904 ^name predict-no +)
  6903. (S1 ^operator O1904 +)
  6904. Firing rl*prefer*rvt*predict-no*H0*2
  6905. -->
  6906. (S1 ^operator O1902 = 0.2550132695707557)
  6907. Firing rl*prefer*rvt*predict-yes*H0*1
  6908. -->
  6909. (S1 ^operator O1901 = 0.5231202597544767)
  6910. Firing prefer*rvt*predict-yes*H0
  6911. -->
  6912. Firing prefer*rvt*predict-no*H0
  6913. -->
  6914. Firing elaborate*copy-dir-to-output-link
  6915. -->
  6916. (I3 ^dir L +)
  6917. inner elaboration loop at bottom goal.
  6918. Retracting elaborate*copy-see-to-output-link
  6919. -->
  6920. (I3 ^see 1 +)
  6921. Retracting propose*predict-no
  6922. -->
  6923. (O1902 ^name predict-no +)
  6924. (S1 ^operator O1902 +)
  6925. Retracting propose*predict-yes
  6926. -->
  6927. (O1901 ^name predict-yes +)
  6928. (S1 ^operator O1901 +)
  6929. Retracting elaborate*reward*based*on*reward
  6930. -->
  6931. (R954 ^value 1 +)
  6932. (R1 ^reward R954 +)
  6933. Retracting elaborate*copy-dir-to-output-link
  6934. -->
  6935. (I3 ^dir L +)
  6936. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  6937. -->
  6938. (S1 ^operator O1902 = -0.01194930198035649)
  6939. Retracting rl*prefer*rvt*predict-no*H0*2
  6940. -->
  6941. (S1 ^operator O1902 = 0.2550132695707557)
  6942. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  6943. -->
  6944. (S1 ^operator O1901 = 0.4768760547163575)
  6945. Retracting rl*prefer*rvt*predict-yes*H0*1
  6946. -->
  6947. (S1 ^operator O1901 = 0.5231202597544767)
  6948. =>WM: (13373: S1 ^operator O1904 +)
  6949. =>WM: (13372: S1 ^operator O1903 +)
  6950. =>WM: (13371: O1904 ^name predict-no)
  6951. =>WM: (13370: O1903 ^name predict-yes)
  6952. =>WM: (13369: R955 ^value 1)
  6953. =>WM: (13368: R1 ^reward R955)
  6954. <=WM: (13359: S1 ^operator O1901 +)
  6955. <=WM: (13361: S1 ^operator O1901)
  6956. <=WM: (13360: S1 ^operator O1902 +)
  6957. <=WM: (13354: R1 ^reward R954)
  6958. <=WM: (13357: O1902 ^name predict-no)
  6959. <=WM: (13356: O1901 ^name predict-yes)
  6960. <=WM: (13355: R954 ^value 1)
  6961. --- Inner Elaboration Phase, active level 1 (S1) ---
  6962. Firing prefer*rvt*predict-yes*H0
  6963. -->
  6964. Firing rl*prefer*rvt*predict-yes*H0*1
  6965. -->
  6966. (S1 ^operator O1903 = 0.5231202597544767)
  6967. Firing prefer*rvt*predict-yes*H0*1*H1
  6968. -->
  6969. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  6970. -->
  6971. (S1 ^operator O1903 = 0.1693592933936033)
  6972. Firing prefer*rvt*predict-no*H0
  6973. -->
  6974. Firing rl*prefer*rvt*predict-no*H0*2
  6975. -->
  6976. (S1 ^operator O1904 = 0.2550132695707557)
  6977. Firing prefer*rvt*predict-no*H0*2*H1
  6978. -->
  6979. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  6980. -->
  6981. (S1 ^operator O1904 = 0.7449862034212327)
  6982. inner elaboration loop at bottom goal.
  6983. Retracting rl*prefer*rvt*predict-no*H0*2
  6984. -->
  6985. (S1 ^operator O1902 = 0.2550132695707557)
  6986. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  6987. -->
  6988. (S1 ^operator O1902 = 0.7449862034212327)
  6989. Retracting rl*prefer*rvt*predict-yes*H0*1
  6990. -->
  6991. (S1 ^operator O1901 = 0.5231202597544767)
  6992. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  6993. -->
  6994. (S1 ^operator O1901 = 0.1693592933936033)
  6995. --- END Proposal Phase ---
  6996. --- Decision Phase ---
  6997. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.727961 -0.20484 0.523121(R,m,v=1,0.977941,0.021732)
  6998. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272035 0.204841 0.476876 -> 0.272036 0.204841 0.476877(R,m,v=1,1,0)
  6999. =>WM: (13374: S1 ^operator O1904)
  7000. 952: O: O1904 (predict-no)
  7001. --- END Decision Phase ---
  7002. --- Application Phase ---
  7003. --- Firing Productions (PE) For State At Depth 1 ---
  7004. --- Inner Elaboration Phase, active level 1 (S1) ---
  7005. Firing apply*operator
  7006. -->
  7007. (I3 ^predict-no N952 + :O )
  7008. Firing apply*operator*complete
  7009. -->
  7010. (I3 ^predict-yes N951 - :O )
  7011. inner elaboration loop at bottom goal.
  7012. --- Change Working Memory (PE) ---
  7013. =>WM: (13375: I3 ^predict-no N952)
  7014. <=WM: (13363: N951 ^status complete)
  7015. <=WM: (13362: I3 ^predict-yes N951)
  7016. --- Firing Productions (IE) For State At Depth 1 ---
  7017. --- Inner Elaboration Phase, active level 1 (S1) ---
  7018. Firing monitor*world
  7019. -->
  7020. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7021. --- Change Working Memory (IE) ---
  7022. --- END Application Phase ---
  7023. --- Output Phase ---
  7024. ENV: Agent did: predict-no for direction L in state State-A
  7025. In State-A moving L
  7026. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7027. predict error 0
  7028. dir: dir isU
  7029. --- END Output Phase ---
  7030. -/|\--- Input Phase ---
  7031. =>WM: (13379: I2 ^dir U)
  7032. =>WM: (13378: I2 ^reward 1)
  7033. =>WM: (13377: I2 ^see 0)
  7034. =>WM: (13376: N952 ^status complete)
  7035. <=WM: (13366: I2 ^dir L)
  7036. <=WM: (13365: I2 ^reward 1)
  7037. <=WM: (13364: I2 ^see 1)
  7038. =>WM: (13380: I2 ^level-1 L0-root)
  7039. <=WM: (13367: I2 ^level-1 L1-root)
  7040. --- END Input Phase ---
  7041. --- Proposal Phase ---
  7042. --- Inner Elaboration Phase, active level 1 (S1) ---
  7043. Firing elaborate*copy-see-to-output-link
  7044. -->
  7045. (I3 ^see 0 +)
  7046. Firing elaborate*reward*based*on*reward
  7047. -->
  7048. (R956 ^value 1 +)
  7049. (R1 ^reward R956 +)
  7050. Firing propose*predict-yes
  7051. -->
  7052. (O1905 ^name predict-yes +)
  7053. (S1 ^operator O1905 +)
  7054. Firing propose*predict-no
  7055. -->
  7056. (O1906 ^name predict-no +)
  7057. (S1 ^operator O1906 +)
  7058. Firing rl*prefer*rvt*predict-no*H0*6
  7059. -->
  7060. (S1 ^operator O1904 = 0.9999999999999999)
  7061. Firing rl*prefer*rvt*predict-yes*H0*5
  7062. -->
  7063. (S1 ^operator O1903 = 0.)
  7064. Firing prefer*rvt*predict-yes*H0
  7065. -->
  7066. Firing prefer*rvt*predict-no*H0
  7067. -->
  7068. Firing elaborate*copy-dir-to-output-link
  7069. -->
  7070. (I3 ^dir U +)
  7071. inner elaboration loop at bottom goal.
  7072. Retracting elaborate*copy-see-to-output-link
  7073. -->
  7074. (I3 ^see 1 +)
  7075. Retracting propose*predict-no
  7076. -->
  7077. (O1904 ^name predict-no +)
  7078. (S1 ^operator O1904 +)
  7079. Retracting propose*predict-yes
  7080. -->
  7081. (O1903 ^name predict-yes +)
  7082. (S1 ^operator O1903 +)
  7083. Retracting elaborate*reward*based*on*reward
  7084. -->
  7085. (R955 ^value 1 +)
  7086. (R1 ^reward R955 +)
  7087. Retracting elaborate*copy-dir-to-output-link
  7088. -->
  7089. (I3 ^dir L +)
  7090. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  7091. -->
  7092. (S1 ^operator O1904 = 0.7449862034212327)
  7093. Retracting rl*prefer*rvt*predict-no*H0*2
  7094. -->
  7095. (S1 ^operator O1904 = 0.2550132695707557)
  7096. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  7097. -->
  7098. (S1 ^operator O1903 = 0.1693592933936033)
  7099. Retracting rl*prefer*rvt*predict-yes*H0*1
  7100. -->
  7101. (S1 ^operator O1903 = 0.5231208125838516)
  7102. =>WM: (13388: S1 ^operator O1906 +)
  7103. =>WM: (13387: S1 ^operator O1905 +)
  7104. =>WM: (13386: I3 ^dir U)
  7105. =>WM: (13385: O1906 ^name predict-no)
  7106. =>WM: (13384: O1905 ^name predict-yes)
  7107. =>WM: (13383: R956 ^value 1)
  7108. =>WM: (13382: R1 ^reward R956)
  7109. =>WM: (13381: I3 ^see 0)
  7110. <=WM: (13372: S1 ^operator O1903 +)
  7111. <=WM: (13373: S1 ^operator O1904 +)
  7112. <=WM: (13374: S1 ^operator O1904)
  7113. <=WM: (13358: I3 ^dir L)
  7114. <=WM: (13368: R1 ^reward R955)
  7115. <=WM: (13353: I3 ^see 1)
  7116. <=WM: (13371: O1904 ^name predict-no)
  7117. <=WM: (13370: O1903 ^name predict-yes)
  7118. <=WM: (13369: R955 ^value 1)
  7119. --- Inner Elaboration Phase, active level 1 (S1) ---
  7120. Firing prefer*rvt*predict-yes*H0
  7121. -->
  7122. Firing rl*prefer*rvt*predict-yes*H0*5
  7123. -->
  7124. (S1 ^operator O1905 = 0.)
  7125. Firing prefer*rvt*predict-no*H0
  7126. -->
  7127. Firing rl*prefer*rvt*predict-no*H0*6
  7128. -->
  7129. (S1 ^operator O1906 = 0.9999999999999999)
  7130. inner elaboration loop at bottom goal.
  7131. Retracting rl*prefer*rvt*predict-no*H0*6
  7132. -->
  7133. (S1 ^operator O1904 = 0.9999999999999999)
  7134. Retracting rl*prefer*rvt*predict-yes*H0*5
  7135. -->
  7136. (S1 ^operator O1903 = 0.)
  7137. --- END Proposal Phase ---
  7138. --- Decision Phase ---
  7139. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.913043,0.0798289)
  7140. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376481 0.744986 -> 0.368505 0.376482 0.744986(R,m,v=1,1,0)
  7141. =>WM: (13389: S1 ^operator O1906)
  7142. 953: O: O1906 (predict-no)
  7143. --- END Decision Phase ---
  7144. --- Application Phase ---
  7145. --- Firing Productions (PE) For State At Depth 1 ---
  7146. --- Inner Elaboration Phase, active level 1 (S1) ---
  7147. Firing apply*operator
  7148. -->
  7149. (I3 ^predict-no N953 + :O )
  7150. Firing apply*operator*complete
  7151. -->
  7152. (I3 ^predict-no N952 - :O )
  7153. inner elaboration loop at bottom goal.
  7154. --- Change Working Memory (PE) ---
  7155. =>WM: (13390: I3 ^predict-no N953)
  7156. <=WM: (13376: N952 ^status complete)
  7157. <=WM: (13375: I3 ^predict-no N952)
  7158. --- Firing Productions (IE) For State At Depth 1 ---
  7159. --- Inner Elaboration Phase, active level 1 (S1) ---
  7160. Firing monitor*world
  7161. -->
  7162. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7163. --- Change Working Memory (IE) ---
  7164. --- END Application Phase ---
  7165. --- Output Phase ---
  7166. ENV: Agent did: predict-no for direction U in state State-A
  7167. In State-A moving U
  7168. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7169. predict error 0
  7170. dir: dir isL
  7171. --- END Output Phase ---
  7172. -/|--- Input Phase ---
  7173. =>WM: (13394: I2 ^dir L)
  7174. =>WM: (13393: I2 ^reward 1)
  7175. =>WM: (13392: I2 ^see 0)
  7176. =>WM: (13391: N953 ^status complete)
  7177. <=WM: (13379: I2 ^dir U)
  7178. <=WM: (13378: I2 ^reward 1)
  7179. <=WM: (13377: I2 ^see 0)
  7180. =>WM: (13395: I2 ^level-1 L0-root)
  7181. <=WM: (13380: I2 ^level-1 L0-root)
  7182. --- END Input Phase ---
  7183. --- Proposal Phase ---
  7184. --- Inner Elaboration Phase, active level 1 (S1) ---
  7185. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  7186. -->
  7187. (S1 ^operator O1905 = 0.3)
  7188. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  7189. -->
  7190. (S1 ^operator O1906 = 0.7449868594607382)
  7191. Firing prefer*rvt*predict-no*H0*2*H1
  7192. -->
  7193. Firing prefer*rvt*predict-yes*H0*1*H1
  7194. -->
  7195. Firing elaborate*copy-see-to-output-link
  7196. -->
  7197. (I3 ^see 0 +)
  7198. Firing elaborate*reward*based*on*reward
  7199. -->
  7200. (R957 ^value 1 +)
  7201. (R1 ^reward R957 +)
  7202. Firing propose*predict-yes
  7203. -->
  7204. (O1907 ^name predict-yes +)
  7205. (S1 ^operator O1907 +)
  7206. Firing propose*predict-no
  7207. -->
  7208. (O1908 ^name predict-no +)
  7209. (S1 ^operator O1908 +)
  7210. Firing rl*prefer*rvt*predict-no*H0*2
  7211. -->
  7212. (S1 ^operator O1906 = 0.2550133486219575)
  7213. Firing rl*prefer*rvt*predict-yes*H0*1
  7214. -->
  7215. (S1 ^operator O1905 = 0.5231208125838516)
  7216. Firing prefer*rvt*predict-yes*H0
  7217. -->
  7218. Firing prefer*rvt*predict-no*H0
  7219. -->
  7220. Firing elaborate*copy-dir-to-output-link
  7221. -->
  7222. (I3 ^dir L +)
  7223. inner elaboration loop at bottom goal.
  7224. Retracting elaborate*copy-see-to-output-link
  7225. -->
  7226. (I3 ^see 0 +)
  7227. Retracting propose*predict-no
  7228. -->
  7229. (O1906 ^name predict-no +)
  7230. (S1 ^operator O1906 +)
  7231. Retracting propose*predict-yes
  7232. -->
  7233. (O1905 ^name predict-yes +)
  7234. (S1 ^operator O1905 +)
  7235. Retracting elaborate*reward*based*on*reward
  7236. -->
  7237. (R956 ^value 1 +)
  7238. (R1 ^reward R956 +)
  7239. Retracting elaborate*copy-dir-to-output-link
  7240. -->
  7241. (I3 ^dir U +)
  7242. Retracting rl*prefer*rvt*predict-no*H0*6
  7243. -->
  7244. (S1 ^operator O1906 = 0.9999999999999999)
  7245. Retracting rl*prefer*rvt*predict-yes*H0*5
  7246. -->
  7247. (S1 ^operator O1905 = 0.)
  7248. =>WM: (13402: S1 ^operator O1908 +)
  7249. =>WM: (13401: S1 ^operator O1907 +)
  7250. =>WM: (13400: I3 ^dir L)
  7251. =>WM: (13399: O1908 ^name predict-no)
  7252. =>WM: (13398: O1907 ^name predict-yes)
  7253. =>WM: (13397: R957 ^value 1)
  7254. =>WM: (13396: R1 ^reward R957)
  7255. <=WM: (13387: S1 ^operator O1905 +)
  7256. <=WM: (13388: S1 ^operator O1906 +)
  7257. <=WM: (13389: S1 ^operator O1906)
  7258. <=WM: (13386: I3 ^dir U)
  7259. <=WM: (13382: R1 ^reward R956)
  7260. <=WM: (13385: O1906 ^name predict-no)
  7261. <=WM: (13384: O1905 ^name predict-yes)
  7262. <=WM: (13383: R956 ^value 1)
  7263. --- Inner Elaboration Phase, active level 1 (S1) ---
  7264. Firing prefer*rvt*predict-yes*H0
  7265. -->
  7266. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  7267. -->
  7268. (S1 ^operator O1907 = 0.3)
  7269. Firing rl*prefer*rvt*predict-yes*H0*1
  7270. -->
  7271. (S1 ^operator O1907 = 0.5231208125838516)
  7272. Firing prefer*rvt*predict-yes*H0*1*H1
  7273. -->
  7274. Firing prefer*rvt*predict-no*H0
  7275. -->
  7276. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  7277. -->
  7278. (S1 ^operator O1908 = 0.7449868594607382)
  7279. Firing rl*prefer*rvt*predict-no*H0*2
  7280. -->
  7281. (S1 ^operator O1908 = 0.2550133486219575)
  7282. Firing prefer*rvt*predict-no*H0*2*H1
  7283. -->
  7284. inner elaboration loop at bottom goal.
  7285. Retracting rl*prefer*rvt*predict-no*H0*2
  7286. -->
  7287. (S1 ^operator O1906 = 0.2550133486219575)
  7288. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  7289. -->
  7290. (S1 ^operator O1906 = 0.7449868594607382)
  7291. Retracting rl*prefer*rvt*predict-yes*H0*1
  7292. -->
  7293. (S1 ^operator O1905 = 0.5231208125838516)
  7294. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  7295. -->
  7296. (S1 ^operator O1905 = 0.3)
  7297. --- END Proposal Phase ---
  7298. --- Decision Phase ---
  7299. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7300. =>WM: (13403: S1 ^operator O1908)
  7301. 954: O: O1908 (predict-no)
  7302. --- END Decision Phase ---
  7303. --- Application Phase ---
  7304. --- Firing Productions (PE) For State At Depth 1 ---
  7305. --- Inner Elaboration Phase, active level 1 (S1) ---
  7306. Firing apply*operator
  7307. -->
  7308. (I3 ^predict-no N954 + :O )
  7309. Firing apply*operator*complete
  7310. -->
  7311. (I3 ^predict-no N953 - :O )
  7312. inner elaboration loop at bottom goal.
  7313. --- Change Working Memory (PE) ---
  7314. =>WM: (13404: I3 ^predict-no N954)
  7315. <=WM: (13391: N953 ^status complete)
  7316. <=WM: (13390: I3 ^predict-no N953)
  7317. --- Firing Productions (IE) For State At Depth 1 ---
  7318. --- Inner Elaboration Phase, active level 1 (S1) ---
  7319. Firing monitor*world
  7320. -->
  7321. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7322. --- Change Working Memory (IE) ---
  7323. --- END Application Phase ---
  7324. --- Output Phase ---
  7325. ENV: Agent did: predict-no for direction L in state State-A
  7326. In State-A moving L
  7327. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7328. predict error 0
  7329. dir: dir isL
  7330. --- END Output Phase ---
  7331. \-/--- Input Phase ---
  7332. =>WM: (13408: I2 ^dir L)
  7333. =>WM: (13407: I2 ^reward 1)
  7334. =>WM: (13406: I2 ^see 0)
  7335. =>WM: (13405: N954 ^status complete)
  7336. <=WM: (13394: I2 ^dir L)
  7337. <=WM: (13393: I2 ^reward 1)
  7338. <=WM: (13392: I2 ^see 0)
  7339. =>WM: (13409: I2 ^level-1 L0-root)
  7340. <=WM: (13395: I2 ^level-1 L0-root)
  7341. --- END Input Phase ---
  7342. --- Proposal Phase ---
  7343. --- Inner Elaboration Phase, active level 1 (S1) ---
  7344. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  7345. -->
  7346. (S1 ^operator O1907 = 0.3)
  7347. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  7348. -->
  7349. (S1 ^operator O1908 = 0.7449868594607382)
  7350. Firing prefer*rvt*predict-no*H0*2*H1
  7351. -->
  7352. Firing prefer*rvt*predict-yes*H0*1*H1
  7353. -->
  7354. Firing elaborate*copy-see-to-output-link
  7355. -->
  7356. (I3 ^see 0 +)
  7357. Firing elaborate*reward*based*on*reward
  7358. -->
  7359. (R958 ^value 1 +)
  7360. (R1 ^reward R958 +)
  7361. Firing propose*predict-yes
  7362. -->
  7363. (O1909 ^name predict-yes +)
  7364. (S1 ^operator O1909 +)
  7365. Firing propose*predict-no
  7366. -->
  7367. (O1910 ^name predict-no +)
  7368. (S1 ^operator O1910 +)
  7369. Firing rl*prefer*rvt*predict-no*H0*2
  7370. -->
  7371. (S1 ^operator O1908 = 0.2550133486219575)
  7372. Firing rl*prefer*rvt*predict-yes*H0*1
  7373. -->
  7374. (S1 ^operator O1907 = 0.5231208125838516)
  7375. Firing prefer*rvt*predict-yes*H0
  7376. -->
  7377. Firing prefer*rvt*predict-no*H0
  7378. -->
  7379. Firing elaborate*copy-dir-to-output-link
  7380. -->
  7381. (I3 ^dir L +)
  7382. inner elaboration loop at bottom goal.
  7383. Retracting elaborate*copy-see-to-output-link
  7384. -->
  7385. (I3 ^see 0 +)
  7386. Retracting propose*predict-no
  7387. -->
  7388. (O1908 ^name predict-no +)
  7389. (S1 ^operator O1908 +)
  7390. Retracting propose*predict-yes
  7391. -->
  7392. (O1907 ^name predict-yes +)
  7393. (S1 ^operator O1907 +)
  7394. Retracting elaborate*reward*based*on*reward
  7395. -->
  7396. (R957 ^value 1 +)
  7397. (R1 ^reward R957 +)
  7398. Retracting elaborate*copy-dir-to-output-link
  7399. -->
  7400. (I3 ^dir L +)
  7401. Retracting rl*prefer*rvt*predict-no*H0*2
  7402. -->
  7403. (S1 ^operator O1908 = 0.2550133486219575)
  7404. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  7405. -->
  7406. (S1 ^operator O1908 = 0.7449868594607382)
  7407. Retracting rl*prefer*rvt*predict-yes*H0*1
  7408. -->
  7409. (S1 ^operator O1907 = 0.5231208125838516)
  7410. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  7411. -->
  7412. (S1 ^operator O1907 = 0.3)
  7413. =>WM: (13415: S1 ^operator O1910 +)
  7414. =>WM: (13414: S1 ^operator O1909 +)
  7415. =>WM: (13413: O1910 ^name predict-no)
  7416. =>WM: (13412: O1909 ^name predict-yes)
  7417. =>WM: (13411: R958 ^value 1)
  7418. =>WM: (13410: R1 ^reward R958)
  7419. <=WM: (13401: S1 ^operator O1907 +)
  7420. <=WM: (13402: S1 ^operator O1908 +)
  7421. <=WM: (13403: S1 ^operator O1908)
  7422. <=WM: (13396: R1 ^reward R957)
  7423. <=WM: (13399: O1908 ^name predict-no)
  7424. <=WM: (13398: O1907 ^name predict-yes)
  7425. <=WM: (13397: R957 ^value 1)
  7426. --- Inner Elaboration Phase, active level 1 (S1) ---
  7427. Firing prefer*rvt*predict-yes*H0
  7428. -->
  7429. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  7430. -->
  7431. (S1 ^operator O1909 = 0.3)
  7432. Firing rl*prefer*rvt*predict-yes*H0*1
  7433. -->
  7434. (S1 ^operator O1909 = 0.5231208125838516)
  7435. Firing prefer*rvt*predict-yes*H0*1*H1
  7436. -->
  7437. Firing prefer*rvt*predict-no*H0
  7438. -->
  7439. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  7440. -->
  7441. (S1 ^operator O1910 = 0.7449868594607382)
  7442. Firing rl*prefer*rvt*predict-no*H0*2
  7443. -->
  7444. (S1 ^operator O1910 = 0.2550133486219575)
  7445. Firing prefer*rvt*predict-no*H0*2*H1
  7446. -->
  7447. inner elaboration loop at bottom goal.
  7448. Retracting rl*prefer*rvt*predict-no*H0*2
  7449. -->
  7450. (S1 ^operator O1908 = 0.2550133486219575)
  7451. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  7452. -->
  7453. (S1 ^operator O1908 = 0.7449868594607382)
  7454. Retracting rl*prefer*rvt*predict-yes*H0*1
  7455. -->
  7456. (S1 ^operator O1907 = 0.5231208125838516)
  7457. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  7458. -->
  7459. (S1 ^operator O1907 = 0.3)
  7460. --- END Proposal Phase ---
  7461. --- Decision Phase ---
  7462. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.913514,0.079436)
  7463. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  7464. =>WM: (13416: S1 ^operator O1910)
  7465. 955: O: O1910 (predict-no)
  7466. --- END Decision Phase ---
  7467. --- Application Phase ---
  7468. --- Firing Productions (PE) For State At Depth 1 ---
  7469. --- Inner Elaboration Phase, active level 1 (S1) ---
  7470. Firing apply*operator
  7471. -->
  7472. (I3 ^predict-no N955 + :O )
  7473. Firing apply*operator*complete
  7474. -->
  7475. (I3 ^predict-no N954 - :O )
  7476. inner elaboration loop at bottom goal.
  7477. --- Change Working Memory (PE) ---
  7478. =>WM: (13417: I3 ^predict-no N955)
  7479. <=WM: (13405: N954 ^status complete)
  7480. <=WM: (13404: I3 ^predict-no N954)
  7481. --- Firing Productions (IE) For State At Depth 1 ---
  7482. --- Inner Elaboration Phase, active level 1 (S1) ---
  7483. Firing monitor*world
  7484. -->
  7485. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7486. --- Change Working Memory (IE) ---
  7487. --- END Application Phase ---
  7488. --- Output Phase ---
  7489. ENV: Agent did: predict-no for direction L in state State-A
  7490. In State-A moving L
  7491. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7492. predict error 0
  7493. dir: dir isU
  7494. --- END Output Phase ---
  7495. |\---- Input Phase ---
  7496. =>WM: (13421: I2 ^dir U)
  7497. =>WM: (13420: I2 ^reward 1)
  7498. =>WM: (13419: I2 ^see 0)
  7499. =>WM: (13418: N955 ^status complete)
  7500. <=WM: (13408: I2 ^dir L)
  7501. <=WM: (13407: I2 ^reward 1)
  7502. <=WM: (13406: I2 ^see 0)
  7503. =>WM: (13422: I2 ^level-1 L0-root)
  7504. <=WM: (13409: I2 ^level-1 L0-root)
  7505. --- END Input Phase ---
  7506. --- Proposal Phase ---
  7507. --- Inner Elaboration Phase, active level 1 (S1) ---
  7508. Firing elaborate*copy-see-to-output-link
  7509. -->
  7510. (I3 ^see 0 +)
  7511. Firing elaborate*reward*based*on*reward
  7512. -->
  7513. (R959 ^value 1 +)
  7514. (R1 ^reward R959 +)
  7515. Firing propose*predict-yes
  7516. -->
  7517. (O1911 ^name predict-yes +)
  7518. (S1 ^operator O1911 +)
  7519. Firing propose*predict-no
  7520. -->
  7521. (O1912 ^name predict-no +)
  7522. (S1 ^operator O1912 +)
  7523. Firing rl*prefer*rvt*predict-no*H0*6
  7524. -->
  7525. (S1 ^operator O1910 = 0.9999999999999999)
  7526. Firing rl*prefer*rvt*predict-yes*H0*5
  7527. -->
  7528. (S1 ^operator O1909 = 0.)
  7529. Firing prefer*rvt*predict-yes*H0
  7530. -->
  7531. Firing prefer*rvt*predict-no*H0
  7532. -->
  7533. Firing elaborate*copy-dir-to-output-link
  7534. -->
  7535. (I3 ^dir U +)
  7536. inner elaboration loop at bottom goal.
  7537. Retracting elaborate*copy-see-to-output-link
  7538. -->
  7539. (I3 ^see 0 +)
  7540. Retracting propose*predict-no
  7541. -->
  7542. (O1910 ^name predict-no +)
  7543. (S1 ^operator O1910 +)
  7544. Retracting propose*predict-yes
  7545. -->
  7546. (O1909 ^name predict-yes +)
  7547. (S1 ^operator O1909 +)
  7548. Retracting elaborate*reward*based*on*reward
  7549. -->
  7550. (R958 ^value 1 +)
  7551. (R1 ^reward R958 +)
  7552. Retracting elaborate*copy-dir-to-output-link
  7553. -->
  7554. (I3 ^dir L +)
  7555. Retracting rl*prefer*rvt*predict-no*H0*2
  7556. -->
  7557. (S1 ^operator O1910 = 0.2550133174095531)
  7558. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  7559. -->
  7560. (S1 ^operator O1910 = 0.7449868282483338)
  7561. Retracting rl*prefer*rvt*predict-yes*H0*1
  7562. -->
  7563. (S1 ^operator O1909 = 0.5231208125838516)
  7564. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  7565. -->
  7566. (S1 ^operator O1909 = 0.3)
  7567. =>WM: (13429: S1 ^operator O1912 +)
  7568. =>WM: (13428: S1 ^operator O1911 +)
  7569. =>WM: (13427: I3 ^dir U)
  7570. =>WM: (13426: O1912 ^name predict-no)
  7571. =>WM: (13425: O1911 ^name predict-yes)
  7572. =>WM: (13424: R959 ^value 1)
  7573. =>WM: (13423: R1 ^reward R959)
  7574. <=WM: (13414: S1 ^operator O1909 +)
  7575. <=WM: (13415: S1 ^operator O1910 +)
  7576. <=WM: (13416: S1 ^operator O1910)
  7577. <=WM: (13400: I3 ^dir L)
  7578. <=WM: (13410: R1 ^reward R958)
  7579. <=WM: (13413: O1910 ^name predict-no)
  7580. <=WM: (13412: O1909 ^name predict-yes)
  7581. <=WM: (13411: R958 ^value 1)
  7582. --- Inner Elaboration Phase, active level 1 (S1) ---
  7583. Firing prefer*rvt*predict-yes*H0
  7584. -->
  7585. Firing rl*prefer*rvt*predict-yes*H0*5
  7586. -->
  7587. (S1 ^operator O1911 = 0.)
  7588. Firing prefer*rvt*predict-no*H0
  7589. -->
  7590. Firing rl*prefer*rvt*predict-no*H0*6
  7591. -->
  7592. (S1 ^operator O1912 = 0.9999999999999999)
  7593. inner elaboration loop at bottom goal.
  7594. Retracting rl*prefer*rvt*predict-no*H0*6
  7595. -->
  7596. (S1 ^operator O1910 = 0.9999999999999999)
  7597. Retracting rl*prefer*rvt*predict-yes*H0*5
  7598. -->
  7599. (S1 ^operator O1909 = 0.)
  7600. --- END Proposal Phase ---
  7601. --- Decision Phase ---
  7602. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.913978,0.0790468)
  7603. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  7604. =>WM: (13430: S1 ^operator O1912)
  7605. 956: O: O1912 (predict-no)
  7606. --- END Decision Phase ---
  7607. --- Application Phase ---
  7608. --- Firing Productions (PE) For State At Depth 1 ---
  7609. --- Inner Elaboration Phase, active level 1 (S1) ---
  7610. Firing apply*operator
  7611. -->
  7612. (I3 ^predict-no N956 + :O )
  7613. Firing apply*operator*complete
  7614. -->
  7615. (I3 ^predict-no N955 - :O )
  7616. inner elaboration loop at bottom goal.
  7617. --- Change Working Memory (PE) ---
  7618. =>WM: (13431: I3 ^predict-no N956)
  7619. <=WM: (13418: N955 ^status complete)
  7620. <=WM: (13417: I3 ^predict-no N955)
  7621. --- Firing Productions (IE) For State At Depth 1 ---
  7622. --- Inner Elaboration Phase, active level 1 (S1) ---
  7623. Firing monitor*world
  7624. -->
  7625. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7626. --- Change Working Memory (IE) ---
  7627. --- END Application Phase ---
  7628. --- Output Phase ---
  7629. ENV: Agent did: predict-no for direction U in state State-A
  7630. In State-A moving U
  7631. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7632. predict error 0
  7633. dir: dir isU
  7634. --- END Output Phase ---
  7635. /|\--- Input Phase ---
  7636. =>WM: (13435: I2 ^dir U)
  7637. =>WM: (13434: I2 ^reward 1)
  7638. =>WM: (13433: I2 ^see 0)
  7639. =>WM: (13432: N956 ^status complete)
  7640. <=WM: (13421: I2 ^dir U)
  7641. <=WM: (13420: I2 ^reward 1)
  7642. <=WM: (13419: I2 ^see 0)
  7643. =>WM: (13436: I2 ^level-1 L0-root)
  7644. <=WM: (13422: I2 ^level-1 L0-root)
  7645. --- END Input Phase ---
  7646. --- Proposal Phase ---
  7647. --- Inner Elaboration Phase, active level 1 (S1) ---
  7648. Firing elaborate*copy-see-to-output-link
  7649. -->
  7650. (I3 ^see 0 +)
  7651. Firing elaborate*reward*based*on*reward
  7652. -->
  7653. (R960 ^value 1 +)
  7654. (R1 ^reward R960 +)
  7655. Firing propose*predict-yes
  7656. -->
  7657. (O1913 ^name predict-yes +)
  7658. (S1 ^operator O1913 +)
  7659. Firing propose*predict-no
  7660. -->
  7661. (O1914 ^name predict-no +)
  7662. (S1 ^operator O1914 +)
  7663. Firing rl*prefer*rvt*predict-no*H0*6
  7664. -->
  7665. (S1 ^operator O1912 = 0.9999999999999999)
  7666. Firing rl*prefer*rvt*predict-yes*H0*5
  7667. -->
  7668. (S1 ^operator O1911 = 0.)
  7669. Firing prefer*rvt*predict-yes*H0
  7670. -->
  7671. Firing prefer*rvt*predict-no*H0
  7672. -->
  7673. Firing elaborate*copy-dir-to-output-link
  7674. -->
  7675. (I3 ^dir U +)
  7676. inner elaboration loop at bottom goal.
  7677. Retracting elaborate*copy-see-to-output-link
  7678. -->
  7679. (I3 ^see 0 +)
  7680. Retracting propose*predict-no
  7681. -->
  7682. (O1912 ^name predict-no +)
  7683. (S1 ^operator O1912 +)
  7684. Retracting propose*predict-yes
  7685. -->
  7686. (O1911 ^name predict-yes +)
  7687. (S1 ^operator O1911 +)
  7688. Retracting elaborate*reward*based*on*reward
  7689. -->
  7690. (R959 ^value 1 +)
  7691. (R1 ^reward R959 +)
  7692. Retracting elaborate*copy-dir-to-output-link
  7693. -->
  7694. (I3 ^dir U +)
  7695. Retracting rl*prefer*rvt*predict-no*H0*6
  7696. -->
  7697. (S1 ^operator O1912 = 0.9999999999999999)
  7698. Retracting rl*prefer*rvt*predict-yes*H0*5
  7699. -->
  7700. (S1 ^operator O1911 = 0.)
  7701. =>WM: (13442: S1 ^operator O1914 +)
  7702. =>WM: (13441: S1 ^operator O1913 +)
  7703. =>WM: (13440: O1914 ^name predict-no)
  7704. =>WM: (13439: O1913 ^name predict-yes)
  7705. =>WM: (13438: R960 ^value 1)
  7706. =>WM: (13437: R1 ^reward R960)
  7707. <=WM: (13428: S1 ^operator O1911 +)
  7708. <=WM: (13429: S1 ^operator O1912 +)
  7709. <=WM: (13430: S1 ^operator O1912)
  7710. <=WM: (13423: R1 ^reward R959)
  7711. <=WM: (13426: O1912 ^name predict-no)
  7712. <=WM: (13425: O1911 ^name predict-yes)
  7713. <=WM: (13424: R959 ^value 1)
  7714. --- Inner Elaboration Phase, active level 1 (S1) ---
  7715. Firing prefer*rvt*predict-yes*H0
  7716. -->
  7717. Firing rl*prefer*rvt*predict-yes*H0*5
  7718. -->
  7719. (S1 ^operator O1913 = 0.)
  7720. Firing prefer*rvt*predict-no*H0
  7721. -->
  7722. Firing rl*prefer*rvt*predict-no*H0*6
  7723. -->
  7724. (S1 ^operator O1914 = 0.9999999999999999)
  7725. inner elaboration loop at bottom goal.
  7726. Retracting rl*prefer*rvt*predict-no*H0*6
  7727. -->
  7728. (S1 ^operator O1912 = 0.9999999999999999)
  7729. Retracting rl*prefer*rvt*predict-yes*H0*5
  7730. -->
  7731. (S1 ^operator O1911 = 0.)
  7732. --- END Proposal Phase ---
  7733. --- Decision Phase ---
  7734. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7735. =>WM: (13443: S1 ^operator O1914)
  7736. 957: O: O1914 (predict-no)
  7737. --- END Decision Phase ---
  7738. --- Application Phase ---
  7739. --- Firing Productions (PE) For State At Depth 1 ---
  7740. --- Inner Elaboration Phase, active level 1 (S1) ---
  7741. Firing apply*operator
  7742. -->
  7743. (I3 ^predict-no N957 + :O )
  7744. Firing apply*operator*complete
  7745. -->
  7746. (I3 ^predict-no N956 - :O )
  7747. inner elaboration loop at bottom goal.
  7748. --- Change Working Memory (PE) ---
  7749. =>WM: (13444: I3 ^predict-no N957)
  7750. <=WM: (13432: N956 ^status complete)
  7751. <=WM: (13431: I3 ^predict-no N956)
  7752. --- Firing Productions (IE) For State At Depth 1 ---
  7753. --- Inner Elaboration Phase, active level 1 (S1) ---
  7754. Firing monitor*world
  7755. -->
  7756. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7757. --- Change Working Memory (IE) ---
  7758. --- END Application Phase ---
  7759. --- Output Phase ---
  7760. ENV: Agent did: predict-no for direction U in state State-A
  7761. In State-A moving U
  7762. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7763. predict error 0
  7764. dir: dir isL
  7765. --- END Output Phase ---
  7766. -/|--- Input Phase ---
  7767. =>WM: (13448: I2 ^dir L)
  7768. =>WM: (13447: I2 ^reward 1)
  7769. =>WM: (13446: I2 ^see 0)
  7770. =>WM: (13445: N957 ^status complete)
  7771. <=WM: (13435: I2 ^dir U)
  7772. <=WM: (13434: I2 ^reward 1)
  7773. <=WM: (13433: I2 ^see 0)
  7774. =>WM: (13449: I2 ^level-1 L0-root)
  7775. <=WM: (13436: I2 ^level-1 L0-root)
  7776. --- END Input Phase ---
  7777. --- Proposal Phase ---
  7778. --- Inner Elaboration Phase, active level 1 (S1) ---
  7779. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  7780. -->
  7781. (S1 ^operator O1913 = 0.3)
  7782. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  7783. -->
  7784. (S1 ^operator O1914 = 0.7449868063996508)
  7785. Firing prefer*rvt*predict-no*H0*2*H1
  7786. -->
  7787. Firing prefer*rvt*predict-yes*H0*1*H1
  7788. -->
  7789. Firing elaborate*copy-see-to-output-link
  7790. -->
  7791. (I3 ^see 0 +)
  7792. Firing elaborate*reward*based*on*reward
  7793. -->
  7794. (R961 ^value 1 +)
  7795. (R1 ^reward R961 +)
  7796. Firing propose*predict-yes
  7797. -->
  7798. (O1915 ^name predict-yes +)
  7799. (S1 ^operator O1915 +)
  7800. Firing propose*predict-no
  7801. -->
  7802. (O1916 ^name predict-no +)
  7803. (S1 ^operator O1916 +)
  7804. Firing rl*prefer*rvt*predict-no*H0*2
  7805. -->
  7806. (S1 ^operator O1914 = 0.2550132955608701)
  7807. Firing rl*prefer*rvt*predict-yes*H0*1
  7808. -->
  7809. (S1 ^operator O1913 = 0.5231208125838516)
  7810. Firing prefer*rvt*predict-yes*H0
  7811. -->
  7812. Firing prefer*rvt*predict-no*H0
  7813. -->
  7814. Firing elaborate*copy-dir-to-output-link
  7815. -->
  7816. (I3 ^dir L +)
  7817. inner elaboration loop at bottom goal.
  7818. Retracting elaborate*copy-see-to-output-link
  7819. -->
  7820. (I3 ^see 0 +)
  7821. Retracting propose*predict-no
  7822. -->
  7823. (O1914 ^name predict-no +)
  7824. (S1 ^operator O1914 +)
  7825. Retracting propose*predict-yes
  7826. -->
  7827. (O1913 ^name predict-yes +)
  7828. (S1 ^operator O1913 +)
  7829. Retracting elaborate*reward*based*on*reward
  7830. -->
  7831. (R960 ^value 1 +)
  7832. (R1 ^reward R960 +)
  7833. Retracting elaborate*copy-dir-to-output-link
  7834. -->
  7835. (I3 ^dir U +)
  7836. Retracting rl*prefer*rvt*predict-no*H0*6
  7837. -->
  7838. (S1 ^operator O1914 = 0.9999999999999999)
  7839. Retracting rl*prefer*rvt*predict-yes*H0*5
  7840. -->
  7841. (S1 ^operator O1913 = 0.)
  7842. =>WM: (13456: S1 ^operator O1916 +)
  7843. =>WM: (13455: S1 ^operator O1915 +)
  7844. =>WM: (13454: I3 ^dir L)
  7845. =>WM: (13453: O1916 ^name predict-no)
  7846. =>WM: (13452: O1915 ^name predict-yes)
  7847. =>WM: (13451: R961 ^value 1)
  7848. =>WM: (13450: R1 ^reward R961)
  7849. <=WM: (13441: S1 ^operator O1913 +)
  7850. <=WM: (13442: S1 ^operator O1914 +)
  7851. <=WM: (13443: S1 ^operator O1914)
  7852. <=WM: (13427: I3 ^dir U)
  7853. <=WM: (13437: R1 ^reward R960)
  7854. <=WM: (13440: O1914 ^name predict-no)
  7855. <=WM: (13439: O1913 ^name predict-yes)
  7856. <=WM: (13438: R960 ^value 1)
  7857. --- Inner Elaboration Phase, active level 1 (S1) ---
  7858. Firing prefer*rvt*predict-yes*H0
  7859. -->
  7860. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  7861. -->
  7862. (S1 ^operator O1915 = 0.3)
  7863. Firing rl*prefer*rvt*predict-yes*H0*1
  7864. -->
  7865. (S1 ^operator O1915 = 0.5231208125838516)
  7866. Firing prefer*rvt*predict-yes*H0*1*H1
  7867. -->
  7868. Firing prefer*rvt*predict-no*H0
  7869. -->
  7870. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  7871. -->
  7872. (S1 ^operator O1916 = 0.7449868063996508)
  7873. Firing rl*prefer*rvt*predict-no*H0*2
  7874. -->
  7875. (S1 ^operator O1916 = 0.2550132955608701)
  7876. Firing prefer*rvt*predict-no*H0*2*H1
  7877. -->
  7878. inner elaboration loop at bottom goal.
  7879. Retracting rl*prefer*rvt*predict-no*H0*2
  7880. -->
  7881. (S1 ^operator O1914 = 0.2550132955608701)
  7882. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  7883. -->
  7884. (S1 ^operator O1914 = 0.7449868063996508)
  7885. Retracting rl*prefer*rvt*predict-yes*H0*1
  7886. -->
  7887. (S1 ^operator O1913 = 0.5231208125838516)
  7888. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  7889. -->
  7890. (S1 ^operator O1913 = 0.3)
  7891. --- END Proposal Phase ---
  7892. --- Decision Phase ---
  7893. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7894. =>WM: (13457: S1 ^operator O1916)
  7895. 958: O: O1916 (predict-no)
  7896. --- END Decision Phase ---
  7897. --- Application Phase ---
  7898. --- Firing Productions (PE) For State At Depth 1 ---
  7899. --- Inner Elaboration Phase, active level 1 (S1) ---
  7900. Firing apply*operator
  7901. -->
  7902. (I3 ^predict-no N958 + :O )
  7903. Firing apply*operator*complete
  7904. -->
  7905. (I3 ^predict-no N957 - :O )
  7906. inner elaboration loop at bottom goal.
  7907. --- Change Working Memory (PE) ---
  7908. =>WM: (13458: I3 ^predict-no N958)
  7909. <=WM: (13445: N957 ^status complete)
  7910. <=WM: (13444: I3 ^predict-no N957)
  7911. --- Firing Productions (IE) For State At Depth 1 ---
  7912. --- Inner Elaboration Phase, active level 1 (S1) ---
  7913. Firing monitor*world
  7914. -->
  7915. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7916. --- Change Working Memory (IE) ---
  7917. --- END Application Phase ---
  7918. --- Output Phase ---
  7919. ENV: Agent did: predict-no for direction L in state State-A
  7920. In State-A moving L
  7921. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7922. predict error 0
  7923. dir: dir isU
  7924. --- END Output Phase ---
  7925. \-/--- Input Phase ---
  7926. =>WM: (13462: I2 ^dir U)
  7927. =>WM: (13461: I2 ^reward 1)
  7928. =>WM: (13460: I2 ^see 0)
  7929. =>WM: (13459: N958 ^status complete)
  7930. <=WM: (13448: I2 ^dir L)
  7931. <=WM: (13447: I2 ^reward 1)
  7932. <=WM: (13446: I2 ^see 0)
  7933. =>WM: (13463: I2 ^level-1 L0-root)
  7934. <=WM: (13449: I2 ^level-1 L0-root)
  7935. --- END Input Phase ---
  7936. --- Proposal Phase ---
  7937. --- Inner Elaboration Phase, active level 1 (S1) ---
  7938. Firing elaborate*copy-see-to-output-link
  7939. -->
  7940. (I3 ^see 0 +)
  7941. Firing elaborate*reward*based*on*reward
  7942. -->
  7943. (R962 ^value 1 +)
  7944. (R1 ^reward R962 +)
  7945. Firing propose*predict-yes
  7946. -->
  7947. (O1917 ^name predict-yes +)
  7948. (S1 ^operator O1917 +)
  7949. Firing propose*predict-no
  7950. -->
  7951. (O1918 ^name predict-no +)
  7952. (S1 ^operator O1918 +)
  7953. Firing rl*prefer*rvt*predict-no*H0*6
  7954. -->
  7955. (S1 ^operator O1916 = 0.9999999999999999)
  7956. Firing rl*prefer*rvt*predict-yes*H0*5
  7957. -->
  7958. (S1 ^operator O1915 = 0.)
  7959. Firing prefer*rvt*predict-yes*H0
  7960. -->
  7961. Firing prefer*rvt*predict-no*H0
  7962. -->
  7963. Firing elaborate*copy-dir-to-output-link
  7964. -->
  7965. (I3 ^dir U +)
  7966. inner elaboration loop at bottom goal.
  7967. Retracting elaborate*copy-see-to-output-link
  7968. -->
  7969. (I3 ^see 0 +)
  7970. Retracting propose*predict-no
  7971. -->
  7972. (O1916 ^name predict-no +)
  7973. (S1 ^operator O1916 +)
  7974. Retracting propose*predict-yes
  7975. -->
  7976. (O1915 ^name predict-yes +)
  7977. (S1 ^operator O1915 +)
  7978. Retracting elaborate*reward*based*on*reward
  7979. -->
  7980. (R961 ^value 1 +)
  7981. (R1 ^reward R961 +)
  7982. Retracting elaborate*copy-dir-to-output-link
  7983. -->
  7984. (I3 ^dir L +)
  7985. Retracting rl*prefer*rvt*predict-no*H0*2
  7986. -->
  7987. (S1 ^operator O1916 = 0.2550132955608701)
  7988. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  7989. -->
  7990. (S1 ^operator O1916 = 0.7449868063996508)
  7991. Retracting rl*prefer*rvt*predict-yes*H0*1
  7992. -->
  7993. (S1 ^operator O1915 = 0.5231208125838516)
  7994. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  7995. -->
  7996. (S1 ^operator O1915 = 0.3)
  7997. =>WM: (13470: S1 ^operator O1918 +)
  7998. =>WM: (13469: S1 ^operator O1917 +)
  7999. =>WM: (13468: I3 ^dir U)
  8000. =>WM: (13467: O1918 ^name predict-no)
  8001. =>WM: (13466: O1917 ^name predict-yes)
  8002. =>WM: (13465: R962 ^value 1)
  8003. =>WM: (13464: R1 ^reward R962)
  8004. <=WM: (13455: S1 ^operator O1915 +)
  8005. <=WM: (13456: S1 ^operator O1916 +)
  8006. <=WM: (13457: S1 ^operator O1916)
  8007. <=WM: (13454: I3 ^dir L)
  8008. <=WM: (13450: R1 ^reward R961)
  8009. <=WM: (13453: O1916 ^name predict-no)
  8010. <=WM: (13452: O1915 ^name predict-yes)
  8011. <=WM: (13451: R961 ^value 1)
  8012. --- Inner Elaboration Phase, active level 1 (S1) ---
  8013. Firing prefer*rvt*predict-yes*H0
  8014. -->
  8015. Firing rl*prefer*rvt*predict-yes*H0*5
  8016. -->
  8017. (S1 ^operator O1917 = 0.)
  8018. Firing prefer*rvt*predict-no*H0
  8019. -->
  8020. Firing rl*prefer*rvt*predict-no*H0*6
  8021. -->
  8022. (S1 ^operator O1918 = 0.9999999999999999)
  8023. inner elaboration loop at bottom goal.
  8024. Retracting rl*prefer*rvt*predict-no*H0*6
  8025. -->
  8026. (S1 ^operator O1916 = 0.9999999999999999)
  8027. Retracting rl*prefer*rvt*predict-yes*H0*5
  8028. -->
  8029. (S1 ^operator O1915 = 0.)
  8030. --- END Proposal Phase ---
  8031. --- Decision Phase ---
  8032. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.914439,0.0786614)
  8033. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  8034. =>WM: (13471: S1 ^operator O1918)
  8035. 959: O: O1918 (predict-no)
  8036. --- END Decision Phase ---
  8037. --- Application Phase ---
  8038. --- Firing Productions (PE) For State At Depth 1 ---
  8039. --- Inner Elaboration Phase, active level 1 (S1) ---
  8040. Firing apply*operator
  8041. -->
  8042. (I3 ^predict-no N959 + :O )
  8043. Firing apply*operator*complete
  8044. -->
  8045. (I3 ^predict-no N958 - :O )
  8046. inner elaboration loop at bottom goal.
  8047. --- Change Working Memory (PE) ---
  8048. =>WM: (13472: I3 ^predict-no N959)
  8049. <=WM: (13459: N958 ^status complete)
  8050. <=WM: (13458: I3 ^predict-no N958)
  8051. --- Firing Productions (IE) For State At Depth 1 ---
  8052. --- Inner Elaboration Phase, active level 1 (S1) ---
  8053. Firing monitor*world
  8054. -->
  8055. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8056. --- Change Working Memory (IE) ---
  8057. --- END Application Phase ---
  8058. --- Output Phase ---
  8059. ENV: Agent did: predict-no for direction U in state State-A
  8060. In State-A moving U
  8061. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8062. predict error 0
  8063. dir: dir isR
  8064. --- END Output Phase ---
  8065. |\---- Input Phase ---
  8066. =>WM: (13476: I2 ^dir R)
  8067. =>WM: (13475: I2 ^reward 1)
  8068. =>WM: (13474: I2 ^see 0)
  8069. =>WM: (13473: N959 ^status complete)
  8070. <=WM: (13462: I2 ^dir U)
  8071. <=WM: (13461: I2 ^reward 1)
  8072. <=WM: (13460: I2 ^see 0)
  8073. =>WM: (13477: I2 ^level-1 L0-root)
  8074. <=WM: (13463: I2 ^level-1 L0-root)
  8075. --- END Input Phase ---
  8076. --- Proposal Phase ---
  8077. --- Inner Elaboration Phase, active level 1 (S1) ---
  8078. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  8079. -->
  8080. (S1 ^operator O1917 = 0.6170827253998104)
  8081. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  8082. -->
  8083. (S1 ^operator O1918 = 0.4910065094545203)
  8084. Firing prefer*rvt*predict-no*H0*4*H1
  8085. -->
  8086. Firing prefer*rvt*predict-yes*H0*3*H1
  8087. -->
  8088. Firing elaborate*copy-see-to-output-link
  8089. -->
  8090. (I3 ^see 0 +)
  8091. Firing elaborate*reward*based*on*reward
  8092. -->
  8093. (R963 ^value 1 +)
  8094. (R1 ^reward R963 +)
  8095. Firing propose*predict-yes
  8096. -->
  8097. (O1919 ^name predict-yes +)
  8098. (S1 ^operator O1919 +)
  8099. Firing propose*predict-no
  8100. -->
  8101. (O1920 ^name predict-no +)
  8102. (S1 ^operator O1920 +)
  8103. Firing rl*prefer*rvt*predict-no*H0*4
  8104. -->
  8105. (S1 ^operator O1918 = 0.1269768259493387)
  8106. Firing rl*prefer*rvt*predict-yes*H0*3
  8107. -->
  8108. (S1 ^operator O1917 = 0.3829271874912855)
  8109. Firing prefer*rvt*predict-yes*H0
  8110. -->
  8111. Firing prefer*rvt*predict-no*H0
  8112. -->
  8113. Firing elaborate*copy-dir-to-output-link
  8114. -->
  8115. (I3 ^dir R +)
  8116. inner elaboration loop at bottom goal.
  8117. Retracting elaborate*copy-see-to-output-link
  8118. -->
  8119. (I3 ^see 0 +)
  8120. Retracting propose*predict-no
  8121. -->
  8122. (O1918 ^name predict-no +)
  8123. (S1 ^operator O1918 +)
  8124. Retracting propose*predict-yes
  8125. -->
  8126. (O1917 ^name predict-yes +)
  8127. (S1 ^operator O1917 +)
  8128. Retracting elaborate*reward*based*on*reward
  8129. -->
  8130. (R962 ^value 1 +)
  8131. (R1 ^reward R962 +)
  8132. Retracting elaborate*copy-dir-to-output-link
  8133. -->
  8134. (I3 ^dir U +)
  8135. Retracting rl*prefer*rvt*predict-no*H0*6
  8136. -->
  8137. (S1 ^operator O1918 = 0.9999999999999999)
  8138. Retracting rl*prefer*rvt*predict-yes*H0*5
  8139. -->
  8140. (S1 ^operator O1917 = 0.)
  8141. =>WM: (13484: S1 ^operator O1920 +)
  8142. =>WM: (13483: S1 ^operator O1919 +)
  8143. =>WM: (13482: I3 ^dir R)
  8144. =>WM: (13481: O1920 ^name predict-no)
  8145. =>WM: (13480: O1919 ^name predict-yes)
  8146. =>WM: (13479: R963 ^value 1)
  8147. =>WM: (13478: R1 ^reward R963)
  8148. <=WM: (13469: S1 ^operator O1917 +)
  8149. <=WM: (13470: S1 ^operator O1918 +)
  8150. <=WM: (13471: S1 ^operator O1918)
  8151. <=WM: (13468: I3 ^dir U)
  8152. <=WM: (13464: R1 ^reward R962)
  8153. <=WM: (13467: O1918 ^name predict-no)
  8154. <=WM: (13466: O1917 ^name predict-yes)
  8155. <=WM: (13465: R962 ^value 1)
  8156. --- Inner Elaboration Phase, active level 1 (S1) ---
  8157. Firing prefer*rvt*predict-yes*H0
  8158. -->
  8159. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  8160. -->
  8161. (S1 ^operator O1919 = 0.6170827253998104)
  8162. Firing rl*prefer*rvt*predict-yes*H0*3
  8163. -->
  8164. (S1 ^operator O1919 = 0.3829271874912855)
  8165. Firing prefer*rvt*predict-yes*H0*3*H1
  8166. -->
  8167. Firing prefer*rvt*predict-no*H0
  8168. -->
  8169. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  8170. -->
  8171. (S1 ^operator O1920 = 0.4910065094545203)
  8172. Firing rl*prefer*rvt*predict-no*H0*4
  8173. -->
  8174. (S1 ^operator O1920 = 0.1269768259493387)
  8175. Firing prefer*rvt*predict-no*H0*4*H1
  8176. -->
  8177. inner elaboration loop at bottom goal.
  8178. Retracting rl*prefer*rvt*predict-no*H0*4
  8179. -->
  8180. (S1 ^operator O1918 = 0.1269768259493387)
  8181. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  8182. -->
  8183. (S1 ^operator O1918 = 0.4910065094545203)
  8184. Retracting rl*prefer*rvt*predict-yes*H0*3
  8185. -->
  8186. (S1 ^operator O1917 = 0.3829271874912855)
  8187. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  8188. -->
  8189. (S1 ^operator O1917 = 0.6170827253998104)
  8190. --- END Proposal Phase ---
  8191. --- Decision Phase ---
  8192. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8193. =>WM: (13485: S1 ^operator O1919)
  8194. 960: O: O1919 (predict-yes)
  8195. --- END Decision Phase ---
  8196. --- Application Phase ---
  8197. --- Firing Productions (PE) For State At Depth 1 ---
  8198. --- Inner Elaboration Phase, active level 1 (S1) ---
  8199. Firing apply*operator
  8200. -->
  8201. (I3 ^predict-yes N960 + :O )
  8202. Firing apply*operator*complete
  8203. -->
  8204. (I3 ^predict-no N959 - :O )
  8205. inner elaboration loop at bottom goal.
  8206. --- Change Working Memory (PE) ---
  8207. =>WM: (13486: I3 ^predict-yes N960)
  8208. <=WM: (13473: N959 ^status complete)
  8209. <=WM: (13472: I3 ^predict-no N959)
  8210. --- Firing Productions (IE) For State At Depth 1 ---
  8211. --- Inner Elaboration Phase, active level 1 (S1) ---
  8212. Firing monitor*world
  8213. -->
  8214. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8215. --- Change Working Memory (IE) ---
  8216. --- END Application Phase ---
  8217. --- Output Phase ---
  8218. ENV: Agent did: predict-yes for direction R in state State-A
  8219. In State-A moving R
  8220. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8221. predict error 0
  8222. dir: dir isR
  8223. --- END Output Phase ---
  8224. /|\--- Input Phase ---
  8225. =>WM: (13490: I2 ^dir R)
  8226. =>WM: (13489: I2 ^reward 1)
  8227. =>WM: (13488: I2 ^see 1)
  8228. =>WM: (13487: N960 ^status complete)
  8229. <=WM: (13476: I2 ^dir R)
  8230. <=WM: (13475: I2 ^reward 1)
  8231. <=WM: (13474: I2 ^see 0)
  8232. =>WM: (13491: I2 ^level-1 R1-root)
  8233. <=WM: (13477: I2 ^level-1 L0-root)
  8234. --- END Input Phase ---
  8235. --- Proposal Phase ---
  8236. --- Inner Elaboration Phase, active level 1 (S1) ---
  8237. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  8238. -->
  8239. (S1 ^operator O1919 = 0.08783148430849691)
  8240. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  8241. -->
  8242. (S1 ^operator O1920 = 0.873023493232603)
  8243. Firing prefer*rvt*predict-no*H0*4*H1
  8244. -->
  8245. Firing prefer*rvt*predict-yes*H0*3*H1
  8246. -->
  8247. Firing elaborate*copy-see-to-output-link
  8248. -->
  8249. (I3 ^see 1 +)
  8250. Firing elaborate*reward*based*on*reward
  8251. -->
  8252. (R964 ^value 1 +)
  8253. (R1 ^reward R964 +)
  8254. Firing propose*predict-yes
  8255. -->
  8256. (O1921 ^name predict-yes +)
  8257. (S1 ^operator O1921 +)
  8258. Firing propose*predict-no
  8259. -->
  8260. (O1922 ^name predict-no +)
  8261. (S1 ^operator O1922 +)
  8262. Firing rl*prefer*rvt*predict-no*H0*4
  8263. -->
  8264. (S1 ^operator O1920 = 0.1269768259493387)
  8265. Firing rl*prefer*rvt*predict-yes*H0*3
  8266. -->
  8267. (S1 ^operator O1919 = 0.3829271874912855)
  8268. Firing prefer*rvt*predict-yes*H0
  8269. -->
  8270. Firing prefer*rvt*predict-no*H0
  8271. -->
  8272. Firing elaborate*copy-dir-to-output-link
  8273. -->
  8274. (I3 ^dir R +)
  8275. inner elaboration loop at bottom goal.
  8276. Retracting elaborate*copy-see-to-output-link
  8277. -->
  8278. (I3 ^see 0 +)
  8279. Retracting propose*predict-no
  8280. -->
  8281. (O1920 ^name predict-no +)
  8282. (S1 ^operator O1920 +)
  8283. Retracting propose*predict-yes
  8284. -->
  8285. (O1919 ^name predict-yes +)
  8286. (S1 ^operator O1919 +)
  8287. Retracting elaborate*reward*based*on*reward
  8288. -->
  8289. (R963 ^value 1 +)
  8290. (R1 ^reward R963 +)
  8291. Retracting elaborate*copy-dir-to-output-link
  8292. -->
  8293. (I3 ^dir R +)
  8294. Retracting rl*prefer*rvt*predict-no*H0*4
  8295. -->
  8296. (S1 ^operator O1920 = 0.1269768259493387)
  8297. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  8298. -->
  8299. (S1 ^operator O1920 = 0.4910065094545203)
  8300. Retracting rl*prefer*rvt*predict-yes*H0*3
  8301. -->
  8302. (S1 ^operator O1919 = 0.3829271874912855)
  8303. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  8304. -->
  8305. (S1 ^operator O1919 = 0.6170827253998104)
  8306. =>WM: (13498: S1 ^operator O1922 +)
  8307. =>WM: (13497: S1 ^operator O1921 +)
  8308. =>WM: (13496: O1922 ^name predict-no)
  8309. =>WM: (13495: O1921 ^name predict-yes)
  8310. =>WM: (13494: R964 ^value 1)
  8311. =>WM: (13493: R1 ^reward R964)
  8312. =>WM: (13492: I3 ^see 1)
  8313. <=WM: (13483: S1 ^operator O1919 +)
  8314. <=WM: (13485: S1 ^operator O1919)
  8315. <=WM: (13484: S1 ^operator O1920 +)
  8316. <=WM: (13478: R1 ^reward R963)
  8317. <=WM: (13381: I3 ^see 0)
  8318. <=WM: (13481: O1920 ^name predict-no)
  8319. <=WM: (13480: O1919 ^name predict-yes)
  8320. <=WM: (13479: R963 ^value 1)
  8321. --- Inner Elaboration Phase, active level 1 (S1) ---
  8322. Firing prefer*rvt*predict-yes*H0
  8323. -->
  8324. Firing rl*prefer*rvt*predict-yes*H0*3
  8325. -->
  8326. (S1 ^operator O1921 = 0.3829271874912855)
  8327. Firing prefer*rvt*predict-yes*H0*3*H1
  8328. -->
  8329. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  8330. -->
  8331. (S1 ^operator O1921 = 0.08783148430849691)
  8332. Firing prefer*rvt*predict-no*H0
  8333. -->
  8334. Firing rl*prefer*rvt*predict-no*H0*4
  8335. -->
  8336. (S1 ^operator O1922 = 0.1269768259493387)
  8337. Firing prefer*rvt*predict-no*H0*4*H1
  8338. -->
  8339. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  8340. -->
  8341. (S1 ^operator O1922 = 0.873023493232603)
  8342. inner elaboration loop at bottom goal.
  8343. Retracting rl*prefer*rvt*predict-no*H0*4
  8344. -->
  8345. (S1 ^operator O1920 = 0.1269768259493387)
  8346. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  8347. -->
  8348. (S1 ^operator O1920 = 0.873023493232603)
  8349. Retracting rl*prefer*rvt*predict-yes*H0*3
  8350. -->
  8351. (S1 ^operator O1919 = 0.3829271874912855)
  8352. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  8353. -->
  8354. (S1 ^operator O1919 = 0.08783148430849691)
  8355. --- END Proposal Phase ---
  8356. --- Decision Phase ---
  8357. RL update rl*prefer*rvt*predict-yes*H0*3 0.673122 -0.290194 0.382927 -> 0.67312 -0.290194 0.382926(R,m,v=1,0.959184,0.0394185)
  8358. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326888 0.290195 0.617083 -> 0.326886 0.290195 0.617081(R,m,v=1,1,0)
  8359. =>WM: (13499: S1 ^operator O1922)
  8360. 961: O: O1922 (predict-no)
  8361. --- END Decision Phase ---
  8362. --- Application Phase ---
  8363. --- Firing Productions (PE) For State At Depth 1 ---
  8364. --- Inner Elaboration Phase, active level 1 (S1) ---
  8365. Firing apply*operator
  8366. -->
  8367. (I3 ^predict-no N961 + :O )
  8368. Firing apply*operator*complete
  8369. -->
  8370. (I3 ^predict-yes N960 - :O )
  8371. inner elaboration loop at bottom goal.
  8372. --- Change Working Memory (PE) ---
  8373. =>WM: (13500: I3 ^predict-no N961)
  8374. <=WM: (13487: N960 ^status complete)
  8375. <=WM: (13486: I3 ^predict-yes N960)
  8376. --- Firing Productions (IE) For State At Depth 1 ---
  8377. --- Inner Elaboration Phase, active level 1 (S1) ---
  8378. Firing monitor*world
  8379. -->
  8380. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8381. --- Change Working Memory (IE) ---
  8382. --- END Application Phase ---
  8383. --- Output Phase ---
  8384. ENV: Agent did: predict-no for direction R in state State-B
  8385. In State-B moving R
  8386. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8387. predict error 0
  8388. dir: dir isL
  8389. --- END Output Phase ---
  8390. ---- Input Phase ---
  8391. =>WM: (13504: I2 ^dir L)
  8392. =>WM: (13503: I2 ^reward 1)
  8393. =>WM: (13502: I2 ^see 0)
  8394. =>WM: (13501: N961 ^status complete)
  8395. <=WM: (13490: I2 ^dir R)
  8396. <=WM: (13489: I2 ^reward 1)
  8397. <=WM: (13488: I2 ^see 1)
  8398. =>WM: (13505: I2 ^level-1 R0-root)
  8399. <=WM: (13491: I2 ^level-1 R1-root)
  8400. --- END Input Phase ---
  8401. --- Proposal Phase ---
  8402. --- Inner Elaboration Phase, active level 1 (S1) ---
  8403. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  8404. -->
  8405. (S1 ^operator O1921 = 0.4768849116445159)
  8406. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  8407. -->
  8408. (S1 ^operator O1922 = 0.1700769046561409)
  8409. Firing prefer*rvt*predict-no*H0*2*H1
  8410. -->
  8411. Firing prefer*rvt*predict-yes*H0*1*H1
  8412. -->
  8413. Firing elaborate*copy-see-to-output-link
  8414. -->
  8415. (I3 ^see 0 +)
  8416. Firing elaborate*reward*based*on*reward
  8417. -->
  8418. (R965 ^value 1 +)
  8419. (R1 ^reward R965 +)
  8420. Firing propose*predict-yes
  8421. -->
  8422. (O1923 ^name predict-yes +)
  8423. (S1 ^operator O1923 +)
  8424. Firing propose*predict-no
  8425. -->
  8426. (O1924 ^name predict-no +)
  8427. (S1 ^operator O1924 +)
  8428. Firing rl*prefer*rvt*predict-no*H0*2
  8429. -->
  8430. (S1 ^operator O1922 = 0.255013280266792)
  8431. Firing rl*prefer*rvt*predict-yes*H0*1
  8432. -->
  8433. (S1 ^operator O1921 = 0.5231208125838516)
  8434. Firing prefer*rvt*predict-yes*H0
  8435. -->
  8436. Firing prefer*rvt*predict-no*H0
  8437. -->
  8438. Firing elaborate*copy-dir-to-output-link
  8439. -->
  8440. (I3 ^dir L +)
  8441. inner elaboration loop at bottom goal.
  8442. Retracting elaborate*copy-see-to-output-link
  8443. -->
  8444. (I3 ^see 1 +)
  8445. Retracting propose*predict-no
  8446. -->
  8447. (O1922 ^name predict-no +)
  8448. (S1 ^operator O1922 +)
  8449. Retracting propose*predict-yes
  8450. -->
  8451. (O1921 ^name predict-yes +)
  8452. (S1 ^operator O1921 +)
  8453. Retracting elaborate*reward*based*on*reward
  8454. -->
  8455. (R964 ^value 1 +)
  8456. (R1 ^reward R964 +)
  8457. Retracting elaborate*copy-dir-to-output-link
  8458. -->
  8459. (I3 ^dir R +)
  8460. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  8461. -->
  8462. (S1 ^operator O1922 = 0.873023493232603)
  8463. Retracting rl*prefer*rvt*predict-no*H0*4
  8464. -->
  8465. (S1 ^operator O1922 = 0.1269768259493387)
  8466. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  8467. -->
  8468. (S1 ^operator O1921 = 0.08783148430849691)
  8469. Retracting rl*prefer*rvt*predict-yes*H0*3
  8470. -->
  8471. (S1 ^operator O1921 = 0.3829257005576211)
  8472. =>WM: (13513: S1 ^operator O1924 +)
  8473. =>WM: (13512: S1 ^operator O1923 +)
  8474. =>WM: (13511: I3 ^dir L)
  8475. =>WM: (13510: O1924 ^name predict-no)
  8476. =>WM: (13509: O1923 ^name predict-yes)
  8477. =>WM: (13508: R965 ^value 1)
  8478. =>WM: (13507: R1 ^reward R965)
  8479. =>WM: (13506: I3 ^see 0)
  8480. <=WM: (13497: S1 ^operator O1921 +)
  8481. <=WM: (13498: S1 ^operator O1922 +)
  8482. <=WM: (13499: S1 ^operator O1922)
  8483. <=WM: (13482: I3 ^dir R)
  8484. <=WM: (13493: R1 ^reward R964)
  8485. <=WM: (13492: I3 ^see 1)
  8486. <=WM: (13496: O1922 ^name predict-no)
  8487. <=WM: (13495: O1921 ^name predict-yes)
  8488. <=WM: (13494: R964 ^value 1)
  8489. --- Inner Elaboration Phase, active level 1 (S1) ---
  8490. Firing prefer*rvt*predict-yes*H0
  8491. -->
  8492. Firing rl*prefer*rvt*predict-yes*H0*1
  8493. -->
  8494. (S1 ^operator O1923 = 0.5231208125838516)
  8495. Firing prefer*rvt*predict-yes*H0*1*H1
  8496. -->
  8497. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  8498. -->
  8499. (S1 ^operator O1923 = 0.4768849116445159)
  8500. Firing prefer*rvt*predict-no*H0
  8501. -->
  8502. Firing rl*prefer*rvt*predict-no*H0*2
  8503. -->
  8504. (S1 ^operator O1924 = 0.255013280266792)
  8505. Firing prefer*rvt*predict-no*H0*2*H1
  8506. -->
  8507. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  8508. -->
  8509. (S1 ^operator O1924 = 0.1700769046561409)
  8510. inner elaboration loop at bottom goal.
  8511. Retracting rl*prefer*rvt*predict-no*H0*2
  8512. -->
  8513. (S1 ^operator O1922 = 0.255013280266792)
  8514. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  8515. -->
  8516. (S1 ^operator O1922 = 0.1700769046561409)
  8517. Retracting rl*prefer*rvt*predict-yes*H0*1
  8518. -->
  8519. (S1 ^operator O1921 = 0.5231208125838516)
  8520. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  8521. -->
  8522. (S1 ^operator O1921 = 0.4768849116445159)
  8523. --- END Proposal Phase ---
  8524. --- Decision Phase ---
  8525. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.947674,0.0498776)
  8526. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  8527. =>WM: (13514: S1 ^operator O1923)
  8528. 962: O: O1923 (predict-yes)
  8529. --- END Decision Phase ---
  8530. --- Application Phase ---
  8531. --- Firing Productions (PE) For State At Depth 1 ---
  8532. --- Inner Elaboration Phase, active level 1 (S1) ---
  8533. Firing apply*operator
  8534. -->
  8535. (I3 ^predict-yes N962 + :O )
  8536. Firing apply*operator*complete
  8537. -->
  8538. (I3 ^predict-no N961 - :O )
  8539. inner elaboration loop at bottom goal.
  8540. --- Change Working Memory (PE) ---
  8541. =>WM: (13515: I3 ^predict-yes N962)
  8542. <=WM: (13501: N961 ^status complete)
  8543. <=WM: (13500: I3 ^predict-no N961)
  8544. --- Firing Productions (IE) For State At Depth 1 ---
  8545. --- Inner Elaboration Phase, active level 1 (S1) ---
  8546. Firing monitor*world
  8547. -->
  8548. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8549. --- Change Working Memory (IE) ---
  8550. --- END Application Phase ---
  8551. --- Output Phase ---
  8552. ENV: Agent did: predict-yes for direction L in state State-B
  8553. In State-B moving L
  8554. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8555. predict error 0
  8556. dir: dir isR
  8557. --- END Output Phase ---
  8558. /|\--- Input Phase ---
  8559. =>WM: (13519: I2 ^dir R)
  8560. =>WM: (13518: I2 ^reward 1)
  8561. =>WM: (13517: I2 ^see 1)
  8562. =>WM: (13516: N962 ^status complete)
  8563. <=WM: (13504: I2 ^dir L)
  8564. <=WM: (13503: I2 ^reward 1)
  8565. <=WM: (13502: I2 ^see 0)
  8566. =>WM: (13520: I2 ^level-1 L1-root)
  8567. <=WM: (13505: I2 ^level-1 R0-root)
  8568. --- END Input Phase ---
  8569. --- Proposal Phase ---
  8570. --- Inner Elaboration Phase, active level 1 (S1) ---
  8571. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  8572. -->
  8573. (S1 ^operator O1923 = 0.6170188666021243)
  8574. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  8575. -->
  8576. (S1 ^operator O1924 = 0.4901349546100854)
  8577. Firing prefer*rvt*predict-no*H0*4*H1
  8578. -->
  8579. Firing prefer*rvt*predict-yes*H0*3*H1
  8580. -->
  8581. Firing elaborate*copy-see-to-output-link
  8582. -->
  8583. (I3 ^see 1 +)
  8584. Firing elaborate*reward*based*on*reward
  8585. -->
  8586. (R966 ^value 1 +)
  8587. (R1 ^reward R966 +)
  8588. Firing propose*predict-yes
  8589. -->
  8590. (O1925 ^name predict-yes +)
  8591. (S1 ^operator O1925 +)
  8592. Firing propose*predict-no
  8593. -->
  8594. (O1926 ^name predict-no +)
  8595. (S1 ^operator O1926 +)
  8596. Firing rl*prefer*rvt*predict-no*H0*4
  8597. -->
  8598. (S1 ^operator O1924 = 0.1269767780720474)
  8599. Firing rl*prefer*rvt*predict-yes*H0*3
  8600. -->
  8601. (S1 ^operator O1923 = 0.3829257005576211)
  8602. Firing prefer*rvt*predict-yes*H0
  8603. -->
  8604. Firing prefer*rvt*predict-no*H0
  8605. -->
  8606. Firing elaborate*copy-dir-to-output-link
  8607. -->
  8608. (I3 ^dir R +)
  8609. inner elaboration loop at bottom goal.
  8610. Retracting elaborate*copy-see-to-output-link
  8611. -->
  8612. (I3 ^see 0 +)
  8613. Retracting propose*predict-no
  8614. -->
  8615. (O1924 ^name predict-no +)
  8616. (S1 ^operator O1924 +)
  8617. Retracting propose*predict-yes
  8618. -->
  8619. (O1923 ^name predict-yes +)
  8620. (S1 ^operator O1923 +)
  8621. Retracting elaborate*reward*based*on*reward
  8622. -->
  8623. (R965 ^value 1 +)
  8624. (R1 ^reward R965 +)
  8625. Retracting elaborate*copy-dir-to-output-link
  8626. -->
  8627. (I3 ^dir L +)
  8628. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  8629. -->
  8630. (S1 ^operator O1924 = 0.1700769046561409)
  8631. Retracting rl*prefer*rvt*predict-no*H0*2
  8632. -->
  8633. (S1 ^operator O1924 = 0.255013280266792)
  8634. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  8635. -->
  8636. (S1 ^operator O1923 = 0.4768849116445159)
  8637. Retracting rl*prefer*rvt*predict-yes*H0*1
  8638. -->
  8639. (S1 ^operator O1923 = 0.5231208125838516)
  8640. =>WM: (13528: S1 ^operator O1926 +)
  8641. =>WM: (13527: S1 ^operator O1925 +)
  8642. =>WM: (13526: I3 ^dir R)
  8643. =>WM: (13525: O1926 ^name predict-no)
  8644. =>WM: (13524: O1925 ^name predict-yes)
  8645. =>WM: (13523: R966 ^value 1)
  8646. =>WM: (13522: R1 ^reward R966)
  8647. =>WM: (13521: I3 ^see 1)
  8648. <=WM: (13512: S1 ^operator O1923 +)
  8649. <=WM: (13514: S1 ^operator O1923)
  8650. <=WM: (13513: S1 ^operator O1924 +)
  8651. <=WM: (13511: I3 ^dir L)
  8652. <=WM: (13507: R1 ^reward R965)
  8653. <=WM: (13506: I3 ^see 0)
  8654. <=WM: (13510: O1924 ^name predict-no)
  8655. <=WM: (13509: O1923 ^name predict-yes)
  8656. <=WM: (13508: R965 ^value 1)
  8657. --- Inner Elaboration Phase, active level 1 (S1) ---
  8658. Firing prefer*rvt*predict-yes*H0
  8659. -->
  8660. Firing rl*prefer*rvt*predict-yes*H0*3
  8661. -->
  8662. (S1 ^operator O1925 = 0.3829257005576211)
  8663. Firing prefer*rvt*predict-yes*H0*3*H1
  8664. -->
  8665. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  8666. -->
  8667. (S1 ^operator O1925 = 0.6170188666021243)
  8668. Firing prefer*rvt*predict-no*H0
  8669. -->
  8670. Firing rl*prefer*rvt*predict-no*H0*4
  8671. -->
  8672. (S1 ^operator O1926 = 0.1269767780720474)
  8673. Firing prefer*rvt*predict-no*H0*4*H1
  8674. -->
  8675. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  8676. -->
  8677. (S1 ^operator O1926 = 0.4901349546100854)
  8678. inner elaboration loop at bottom goal.
  8679. Retracting rl*prefer*rvt*predict-no*H0*4
  8680. -->
  8681. (S1 ^operator O1924 = 0.1269767780720474)
  8682. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  8683. -->
  8684. (S1 ^operator O1924 = 0.4901349546100854)
  8685. Retracting rl*prefer*rvt*predict-yes*H0*3
  8686. -->
  8687. (S1 ^operator O1923 = 0.3829257005576211)
  8688. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  8689. -->
  8690. (S1 ^operator O1923 = 0.6170188666021243)
  8691. --- END Proposal Phase ---
  8692. --- Decision Phase ---
  8693. RL update rl*prefer*rvt*predict-yes*H0*1 0.727961 -0.20484 0.523121 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.978102,0.0215758)
  8694. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272047 0.204838 0.476885 -> 0.272045 0.204839 0.476884(R,m,v=1,1,0)
  8695. =>WM: (13529: S1 ^operator O1925)
  8696. 963: O: O1925 (predict-yes)
  8697. --- END Decision Phase ---
  8698. --- Application Phase ---
  8699. --- Firing Productions (PE) For State At Depth 1 ---
  8700. --- Inner Elaboration Phase, active level 1 (S1) ---
  8701. Firing apply*operator
  8702. -->
  8703. (I3 ^predict-yes N963 + :O )
  8704. Firing apply*operator*complete
  8705. -->
  8706. (I3 ^predict-yes N962 - :O )
  8707. inner elaboration loop at bottom goal.
  8708. --- Change Working Memory (PE) ---
  8709. =>WM: (13530: I3 ^predict-yes N963)
  8710. <=WM: (13516: N962 ^status complete)
  8711. <=WM: (13515: I3 ^predict-yes N962)
  8712. --- Firing Productions (IE) For State At Depth 1 ---
  8713. --- Inner Elaboration Phase, active level 1 (S1) ---
  8714. Firing monitor*world
  8715. -->
  8716. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8717. --- Change Working Memory (IE) ---
  8718. --- END Application Phase ---
  8719. --- Output Phase ---
  8720. ENV: Agent did: predict-yes for direction R in state State-A
  8721. In State-A moving R
  8722. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8723. predict error 0
  8724. dir: dir isU
  8725. --- END Output Phase ---
  8726. -/--- Input Phase ---
  8727. =>WM: (13534: I2 ^dir U)
  8728. =>WM: (13533: I2 ^reward 1)
  8729. =>WM: (13532: I2 ^see 1)
  8730. =>WM: (13531: N963 ^status complete)
  8731. <=WM: (13519: I2 ^dir R)
  8732. <=WM: (13518: I2 ^reward 1)
  8733. <=WM: (13517: I2 ^see 1)
  8734. =>WM: (13535: I2 ^level-1 R1-root)
  8735. <=WM: (13520: I2 ^level-1 L1-root)
  8736. --- END Input Phase ---
  8737. --- Proposal Phase ---
  8738. --- Inner Elaboration Phase, active level 1 (S1) ---
  8739. Firing elaborate*copy-see-to-output-link
  8740. -->
  8741. (I3 ^see 1 +)
  8742. Firing elaborate*reward*based*on*reward
  8743. -->
  8744. (R967 ^value 1 +)
  8745. (R1 ^reward R967 +)
  8746. Firing propose*predict-yes
  8747. -->
  8748. (O1927 ^name predict-yes +)
  8749. (S1 ^operator O1927 +)
  8750. Firing propose*predict-no
  8751. -->
  8752. (O1928 ^name predict-no +)
  8753. (S1 ^operator O1928 +)
  8754. Firing rl*prefer*rvt*predict-no*H0*6
  8755. -->
  8756. (S1 ^operator O1926 = 0.9999999999999999)
  8757. Firing rl*prefer*rvt*predict-yes*H0*5
  8758. -->
  8759. (S1 ^operator O1925 = 0.)
  8760. Firing prefer*rvt*predict-yes*H0
  8761. -->
  8762. Firing prefer*rvt*predict-no*H0
  8763. -->
  8764. Firing elaborate*copy-dir-to-output-link
  8765. -->
  8766. (I3 ^dir U +)
  8767. inner elaboration loop at bottom goal.
  8768. Retracting elaborate*copy-see-to-output-link
  8769. -->
  8770. (I3 ^see 1 +)
  8771. Retracting propose*predict-no
  8772. -->
  8773. (O1926 ^name predict-no +)
  8774. (S1 ^operator O1926 +)
  8775. Retracting propose*predict-yes
  8776. -->
  8777. (O1925 ^name predict-yes +)
  8778. (S1 ^operator O1925 +)
  8779. Retracting elaborate*reward*based*on*reward
  8780. -->
  8781. (R966 ^value 1 +)
  8782. (R1 ^reward R966 +)
  8783. Retracting elaborate*copy-dir-to-output-link
  8784. -->
  8785. (I3 ^dir R +)
  8786. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  8787. -->
  8788. (S1 ^operator O1926 = 0.4901349546100854)
  8789. Retracting rl*prefer*rvt*predict-no*H0*4
  8790. -->
  8791. (S1 ^operator O1926 = 0.1269767780720474)
  8792. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  8793. -->
  8794. (S1 ^operator O1925 = 0.6170188666021243)
  8795. Retracting rl*prefer*rvt*predict-yes*H0*3
  8796. -->
  8797. (S1 ^operator O1925 = 0.3829257005576211)
  8798. =>WM: (13542: S1 ^operator O1928 +)
  8799. =>WM: (13541: S1 ^operator O1927 +)
  8800. =>WM: (13540: I3 ^dir U)
  8801. =>WM: (13539: O1928 ^name predict-no)
  8802. =>WM: (13538: O1927 ^name predict-yes)
  8803. =>WM: (13537: R967 ^value 1)
  8804. =>WM: (13536: R1 ^reward R967)
  8805. <=WM: (13527: S1 ^operator O1925 +)
  8806. <=WM: (13529: S1 ^operator O1925)
  8807. <=WM: (13528: S1 ^operator O1926 +)
  8808. <=WM: (13526: I3 ^dir R)
  8809. <=WM: (13522: R1 ^reward R966)
  8810. <=WM: (13525: O1926 ^name predict-no)
  8811. <=WM: (13524: O1925 ^name predict-yes)
  8812. <=WM: (13523: R966 ^value 1)
  8813. --- Inner Elaboration Phase, active level 1 (S1) ---
  8814. Firing prefer*rvt*predict-yes*H0
  8815. -->
  8816. Firing rl*prefer*rvt*predict-yes*H0*5
  8817. -->
  8818. (S1 ^operator O1927 = 0.)
  8819. Firing prefer*rvt*predict-no*H0
  8820. -->
  8821. Firing rl*prefer*rvt*predict-no*H0*6
  8822. -->
  8823. (S1 ^operator O1928 = 0.9999999999999999)
  8824. inner elaboration loop at bottom goal.
  8825. Retracting rl*prefer*rvt*predict-no*H0*6
  8826. -->
  8827. (S1 ^operator O1926 = 0.9999999999999999)
  8828. Retracting rl*prefer*rvt*predict-yes*H0*5
  8829. -->
  8830. (S1 ^operator O1925 = 0.)
  8831. --- END Proposal Phase ---
  8832. --- Decision Phase ---
  8833. RL update rl*prefer*rvt*predict-yes*H0*3 0.67312 -0.290194 0.382926 -> 0.673128 -0.290194 0.382934(R,m,v=1,0.959459,0.0391616)
  8834. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326829 0.29019 0.617019 -> 0.326837 0.29019 0.617027(R,m,v=1,1,0)
  8835. =>WM: (13543: S1 ^operator O1928)
  8836. 964: O: O1928 (predict-no)
  8837. --- END Decision Phase ---
  8838. --- Application Phase ---
  8839. --- Firing Productions (PE) For State At Depth 1 ---
  8840. --- Inner Elaboration Phase, active level 1 (S1) ---
  8841. Firing apply*operator
  8842. -->
  8843. (I3 ^predict-no N964 + :O )
  8844. Firing apply*operator*complete
  8845. -->
  8846. (I3 ^predict-yes N963 - :O )
  8847. inner elaboration loop at bottom goal.
  8848. --- Change Working Memory (PE) ---
  8849. =>WM: (13544: I3 ^predict-no N964)
  8850. <=WM: (13531: N963 ^status complete)
  8851. <=WM: (13530: I3 ^predict-yes N963)
  8852. --- Firing Productions (IE) For State At Depth 1 ---
  8853. --- Inner Elaboration Phase, active level 1 (S1) ---
  8854. Firing monitor*world
  8855. -->
  8856. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8857. --- Change Working Memory (IE) ---
  8858. --- END Application Phase ---
  8859. --- Output Phase ---
  8860. ENV: Agent did: predict-no for direction U in state State-B
  8861. In State-B moving U
  8862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8863. predict error 0
  8864. dir: dir isL
  8865. --- END Output Phase ---
  8866. |\---- Input Phase ---
  8867. =>WM: (13548: I2 ^dir L)
  8868. =>WM: (13547: I2 ^reward 1)
  8869. =>WM: (13546: I2 ^see 0)
  8870. =>WM: (13545: N964 ^status complete)
  8871. <=WM: (13534: I2 ^dir U)
  8872. <=WM: (13533: I2 ^reward 1)
  8873. <=WM: (13532: I2 ^see 1)
  8874. =>WM: (13549: I2 ^level-1 R1-root)
  8875. <=WM: (13535: I2 ^level-1 R1-root)
  8876. --- END Input Phase ---
  8877. --- Proposal Phase ---
  8878. --- Inner Elaboration Phase, active level 1 (S1) ---
  8879. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  8880. -->
  8881. (S1 ^operator O1927 = 0.4768766075457324)
  8882. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  8883. -->
  8884. (S1 ^operator O1928 = -0.01194930198035649)
  8885. Firing prefer*rvt*predict-no*H0*2*H1
  8886. -->
  8887. Firing prefer*rvt*predict-yes*H0*1*H1
  8888. -->
  8889. Firing elaborate*copy-see-to-output-link
  8890. -->
  8891. (I3 ^see 0 +)
  8892. Firing elaborate*reward*based*on*reward
  8893. -->
  8894. (R968 ^value 1 +)
  8895. (R1 ^reward R968 +)
  8896. Firing propose*predict-yes
  8897. -->
  8898. (O1929 ^name predict-yes +)
  8899. (S1 ^operator O1929 +)
  8900. Firing propose*predict-no
  8901. -->
  8902. (O1930 ^name predict-no +)
  8903. (S1 ^operator O1930 +)
  8904. Firing rl*prefer*rvt*predict-no*H0*2
  8905. -->
  8906. (S1 ^operator O1928 = 0.255013280266792)
  8907. Firing rl*prefer*rvt*predict-yes*H0*1
  8908. -->
  8909. (S1 ^operator O1927 = 0.5231199539495964)
  8910. Firing prefer*rvt*predict-yes*H0
  8911. -->
  8912. Firing prefer*rvt*predict-no*H0
  8913. -->
  8914. Firing elaborate*copy-dir-to-output-link
  8915. -->
  8916. (I3 ^dir L +)
  8917. inner elaboration loop at bottom goal.
  8918. Retracting elaborate*copy-see-to-output-link
  8919. -->
  8920. (I3 ^see 1 +)
  8921. Retracting propose*predict-no
  8922. -->
  8923. (O1928 ^name predict-no +)
  8924. (S1 ^operator O1928 +)
  8925. Retracting propose*predict-yes
  8926. -->
  8927. (O1927 ^name predict-yes +)
  8928. (S1 ^operator O1927 +)
  8929. Retracting elaborate*reward*based*on*reward
  8930. -->
  8931. (R967 ^value 1 +)
  8932. (R1 ^reward R967 +)
  8933. Retracting elaborate*copy-dir-to-output-link
  8934. -->
  8935. (I3 ^dir U +)
  8936. Retracting rl*prefer*rvt*predict-no*H0*6
  8937. -->
  8938. (S1 ^operator O1928 = 0.9999999999999999)
  8939. Retracting rl*prefer*rvt*predict-yes*H0*5
  8940. -->
  8941. (S1 ^operator O1927 = 0.)
  8942. =>WM: (13557: S1 ^operator O1930 +)
  8943. =>WM: (13556: S1 ^operator O1929 +)
  8944. =>WM: (13555: I3 ^dir L)
  8945. =>WM: (13554: O1930 ^name predict-no)
  8946. =>WM: (13553: O1929 ^name predict-yes)
  8947. =>WM: (13552: R968 ^value 1)
  8948. =>WM: (13551: R1 ^reward R968)
  8949. =>WM: (13550: I3 ^see 0)
  8950. <=WM: (13541: S1 ^operator O1927 +)
  8951. <=WM: (13542: S1 ^operator O1928 +)
  8952. <=WM: (13543: S1 ^operator O1928)
  8953. <=WM: (13540: I3 ^dir U)
  8954. <=WM: (13536: R1 ^reward R967)
  8955. <=WM: (13521: I3 ^see 1)
  8956. <=WM: (13539: O1928 ^name predict-no)
  8957. <=WM: (13538: O1927 ^name predict-yes)
  8958. <=WM: (13537: R967 ^value 1)
  8959. --- Inner Elaboration Phase, active level 1 (S1) ---
  8960. Firing prefer*rvt*predict-yes*H0
  8961. -->
  8962. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  8963. -->
  8964. (S1 ^operator O1929 = 0.4768766075457324)
  8965. Firing rl*prefer*rvt*predict-yes*H0*1
  8966. -->
  8967. (S1 ^operator O1929 = 0.5231199539495964)
  8968. Firing prefer*rvt*predict-yes*H0*1*H1
  8969. -->
  8970. Firing prefer*rvt*predict-no*H0
  8971. -->
  8972. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  8973. -->
  8974. (S1 ^operator O1930 = -0.01194930198035649)
  8975. Firing rl*prefer*rvt*predict-no*H0*2
  8976. -->
  8977. (S1 ^operator O1930 = 0.255013280266792)
  8978. Firing prefer*rvt*predict-no*H0*2*H1
  8979. -->
  8980. inner elaboration loop at bottom goal.
  8981. Retracting rl*prefer*rvt*predict-no*H0*2
  8982. -->
  8983. (S1 ^operator O1928 = 0.255013280266792)
  8984. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  8985. -->
  8986. (S1 ^operator O1928 = -0.01194930198035649)
  8987. Retracting rl*prefer*rvt*predict-yes*H0*1
  8988. -->
  8989. (S1 ^operator O1927 = 0.5231199539495964)
  8990. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  8991. -->
  8992. (S1 ^operator O1927 = 0.4768766075457324)
  8993. --- END Proposal Phase ---
  8994. --- Decision Phase ---
  8995. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8996. =>WM: (13558: S1 ^operator O1929)
  8997. 965: O: O1929 (predict-yes)
  8998. --- END Decision Phase ---
  8999. --- Application Phase ---
  9000. --- Firing Productions (PE) For State At Depth 1 ---
  9001. --- Inner Elaboration Phase, active level 1 (S1) ---
  9002. Firing apply*operator
  9003. -->
  9004. (I3 ^predict-yes N965 + :O )
  9005. Firing apply*operator*complete
  9006. -->
  9007. (I3 ^predict-no N964 - :O )
  9008. inner elaboration loop at bottom goal.
  9009. --- Change Working Memory (PE) ---
  9010. =>WM: (13559: I3 ^predict-yes N965)
  9011. <=WM: (13545: N964 ^status complete)
  9012. <=WM: (13544: I3 ^predict-no N964)
  9013. --- Firing Productions (IE) For State At Depth 1 ---
  9014. --- Inner Elaboration Phase, active level 1 (S1) ---
  9015. Firing monitor*world
  9016. -->
  9017. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9018. --- Change Working Memory (IE) ---
  9019. --- END Application Phase ---
  9020. --- Output Phase ---
  9021. ENV: Agent did: predict-yes for direction L in state State-B
  9022. In State-B moving L
  9023. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9024. predict error 0
  9025. dir: dir isL
  9026. --- END Output Phase ---
  9027. /|\--- Input Phase ---
  9028. =>WM: (13563: I2 ^dir L)
  9029. =>WM: (13562: I2 ^reward 1)
  9030. =>WM: (13561: I2 ^see 1)
  9031. =>WM: (13560: N965 ^status complete)
  9032. <=WM: (13548: I2 ^dir L)
  9033. <=WM: (13547: I2 ^reward 1)
  9034. <=WM: (13546: I2 ^see 0)
  9035. =>WM: (13564: I2 ^level-1 L1-root)
  9036. <=WM: (13549: I2 ^level-1 R1-root)
  9037. --- END Input Phase ---
  9038. --- Proposal Phase ---
  9039. --- Inner Elaboration Phase, active level 1 (S1) ---
  9040. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  9041. -->
  9042. (S1 ^operator O1929 = 0.1693592933936033)
  9043. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  9044. -->
  9045. (S1 ^operator O1930 = 0.7449862824724345)
  9046. Firing prefer*rvt*predict-no*H0*2*H1
  9047. -->
  9048. Firing prefer*rvt*predict-yes*H0*1*H1
  9049. -->
  9050. Firing elaborate*copy-see-to-output-link
  9051. -->
  9052. (I3 ^see 1 +)
  9053. Firing elaborate*reward*based*on*reward
  9054. -->
  9055. (R969 ^value 1 +)
  9056. (R1 ^reward R969 +)
  9057. Firing propose*predict-yes
  9058. -->
  9059. (O1931 ^name predict-yes +)
  9060. (S1 ^operator O1931 +)
  9061. Firing propose*predict-no
  9062. -->
  9063. (O1932 ^name predict-no +)
  9064. (S1 ^operator O1932 +)
  9065. Firing rl*prefer*rvt*predict-no*H0*2
  9066. -->
  9067. (S1 ^operator O1930 = 0.255013280266792)
  9068. Firing rl*prefer*rvt*predict-yes*H0*1
  9069. -->
  9070. (S1 ^operator O1929 = 0.5231199539495964)
  9071. Firing prefer*rvt*predict-yes*H0
  9072. -->
  9073. Firing prefer*rvt*predict-no*H0
  9074. -->
  9075. Firing elaborate*copy-dir-to-output-link
  9076. -->
  9077. (I3 ^dir L +)
  9078. inner elaboration loop at bottom goal.
  9079. Retracting elaborate*copy-see-to-output-link
  9080. -->
  9081. (I3 ^see 0 +)
  9082. Retracting propose*predict-no
  9083. -->
  9084. (O1930 ^name predict-no +)
  9085. (S1 ^operator O1930 +)
  9086. Retracting propose*predict-yes
  9087. -->
  9088. (O1929 ^name predict-yes +)
  9089. (S1 ^operator O1929 +)
  9090. Retracting elaborate*reward*based*on*reward
  9091. -->
  9092. (R968 ^value 1 +)
  9093. (R1 ^reward R968 +)
  9094. Retracting elaborate*copy-dir-to-output-link
  9095. -->
  9096. (I3 ^dir L +)
  9097. Retracting rl*prefer*rvt*predict-no*H0*2
  9098. -->
  9099. (S1 ^operator O1930 = 0.255013280266792)
  9100. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  9101. -->
  9102. (S1 ^operator O1930 = -0.01194930198035649)
  9103. Retracting rl*prefer*rvt*predict-yes*H0*1
  9104. -->
  9105. (S1 ^operator O1929 = 0.5231199539495964)
  9106. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  9107. -->
  9108. (S1 ^operator O1929 = 0.4768766075457324)
  9109. =>WM: (13571: S1 ^operator O1932 +)
  9110. =>WM: (13570: S1 ^operator O1931 +)
  9111. =>WM: (13569: O1932 ^name predict-no)
  9112. =>WM: (13568: O1931 ^name predict-yes)
  9113. =>WM: (13567: R969 ^value 1)
  9114. =>WM: (13566: R1 ^reward R969)
  9115. =>WM: (13565: I3 ^see 1)
  9116. <=WM: (13556: S1 ^operator O1929 +)
  9117. <=WM: (13558: S1 ^operator O1929)
  9118. <=WM: (13557: S1 ^operator O1930 +)
  9119. <=WM: (13551: R1 ^reward R968)
  9120. <=WM: (13550: I3 ^see 0)
  9121. <=WM: (13554: O1930 ^name predict-no)
  9122. <=WM: (13553: O1929 ^name predict-yes)
  9123. <=WM: (13552: R968 ^value 1)
  9124. --- Inner Elaboration Phase, active level 1 (S1) ---
  9125. Firing prefer*rvt*predict-yes*H0
  9126. -->
  9127. Firing rl*prefer*rvt*predict-yes*H0*1
  9128. -->
  9129. (S1 ^operator O1931 = 0.5231199539495964)
  9130. Firing prefer*rvt*predict-yes*H0*1*H1
  9131. -->
  9132. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  9133. -->
  9134. (S1 ^operator O1931 = 0.1693592933936033)
  9135. Firing prefer*rvt*predict-no*H0
  9136. -->
  9137. Firing rl*prefer*rvt*predict-no*H0*2
  9138. -->
  9139. (S1 ^operator O1932 = 0.255013280266792)
  9140. Firing prefer*rvt*predict-no*H0*2*H1
  9141. -->
  9142. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  9143. -->
  9144. (S1 ^operator O1932 = 0.7449862824724345)
  9145. inner elaboration loop at bottom goal.
  9146. Retracting rl*prefer*rvt*predict-no*H0*2
  9147. -->
  9148. (S1 ^operator O1930 = 0.255013280266792)
  9149. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  9150. -->
  9151. (S1 ^operator O1930 = 0.7449862824724345)
  9152. Retracting rl*prefer*rvt*predict-yes*H0*1
  9153. -->
  9154. (S1 ^operator O1929 = 0.5231199539495964)
  9155. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  9156. -->
  9157. (S1 ^operator O1929 = 0.1693592933936033)
  9158. --- END Proposal Phase ---
  9159. --- Decision Phase ---
  9160. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.727961 -0.20484 0.52312(R,m,v=1,0.978261,0.0214218)
  9161. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272036 0.204841 0.476877 -> 0.272036 0.204841 0.476877(R,m,v=1,1,0)
  9162. =>WM: (13572: S1 ^operator O1932)
  9163. 966: O: O1932 (predict-no)
  9164. --- END Decision Phase ---
  9165. --- Application Phase ---
  9166. --- Firing Productions (PE) For State At Depth 1 ---
  9167. --- Inner Elaboration Phase, active level 1 (S1) ---
  9168. Firing apply*operator
  9169. -->
  9170. (I3 ^predict-no N966 + :O )
  9171. Firing apply*operator*complete
  9172. -->
  9173. (I3 ^predict-yes N965 - :O )
  9174. inner elaboration loop at bottom goal.
  9175. --- Change Working Memory (PE) ---
  9176. =>WM: (13573: I3 ^predict-no N966)
  9177. <=WM: (13560: N965 ^status complete)
  9178. <=WM: (13559: I3 ^predict-yes N965)
  9179. --- Firing Productions (IE) For State At Depth 1 ---
  9180. --- Inner Elaboration Phase, active level 1 (S1) ---
  9181. Firing monitor*world
  9182. -->
  9183. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9184. --- Change Working Memory (IE) ---
  9185. --- END Application Phase ---
  9186. --- Output Phase ---
  9187. ENV: Agent did: predict-no for direction L in state State-A
  9188. In State-A moving L
  9189. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9190. predict error 0
  9191. dir: dir isL
  9192. --- END Output Phase ---
  9193. ---- Input Phase ---
  9194. =>WM: (13577: I2 ^dir L)
  9195. =>WM: (13576: I2 ^reward 1)
  9196. =>WM: (13575: I2 ^see 0)
  9197. =>WM: (13574: N966 ^status complete)
  9198. <=WM: (13563: I2 ^dir L)
  9199. <=WM: (13562: I2 ^reward 1)
  9200. <=WM: (13561: I2 ^see 1)
  9201. =>WM: (13578: I2 ^level-1 L0-root)
  9202. <=WM: (13564: I2 ^level-1 L1-root)
  9203. --- END Input Phase ---
  9204. --- Proposal Phase ---
  9205. --- Inner Elaboration Phase, active level 1 (S1) ---
  9206. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  9207. -->
  9208. (S1 ^operator O1931 = 0.3)
  9209. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  9210. -->
  9211. (S1 ^operator O1932 = 0.7449867911055725)
  9212. Firing prefer*rvt*predict-no*H0*2*H1
  9213. -->
  9214. Firing prefer*rvt*predict-yes*H0*1*H1
  9215. -->
  9216. Firing elaborate*copy-see-to-output-link
  9217. -->
  9218. (I3 ^see 0 +)
  9219. Firing elaborate*reward*based*on*reward
  9220. -->
  9221. (R970 ^value 1 +)
  9222. (R1 ^reward R970 +)
  9223. Firing propose*predict-yes
  9224. -->
  9225. (O1933 ^name predict-yes +)
  9226. (S1 ^operator O1933 +)
  9227. Firing propose*predict-no
  9228. -->
  9229. (O1934 ^name predict-no +)
  9230. (S1 ^operator O1934 +)
  9231. Firing rl*prefer*rvt*predict-no*H0*2
  9232. -->
  9233. (S1 ^operator O1932 = 0.255013280266792)
  9234. Firing rl*prefer*rvt*predict-yes*H0*1
  9235. -->
  9236. (S1 ^operator O1931 = 0.5231204697252971)
  9237. Firing prefer*rvt*predict-yes*H0
  9238. -->
  9239. Firing prefer*rvt*predict-no*H0
  9240. -->
  9241. Firing elaborate*copy-dir-to-output-link
  9242. -->
  9243. (I3 ^dir L +)
  9244. inner elaboration loop at bottom goal.
  9245. Retracting elaborate*copy-see-to-output-link
  9246. -->
  9247. (I3 ^see 1 +)
  9248. Retracting propose*predict-no
  9249. -->
  9250. (O1932 ^name predict-no +)
  9251. (S1 ^operator O1932 +)
  9252. Retracting propose*predict-yes
  9253. -->
  9254. (O1931 ^name predict-yes +)
  9255. (S1 ^operator O1931 +)
  9256. Retracting elaborate*reward*based*on*reward
  9257. -->
  9258. (R969 ^value 1 +)
  9259. (R1 ^reward R969 +)
  9260. Retracting elaborate*copy-dir-to-output-link
  9261. -->
  9262. (I3 ^dir L +)
  9263. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  9264. -->
  9265. (S1 ^operator O1932 = 0.7449862824724345)
  9266. Retracting rl*prefer*rvt*predict-no*H0*2
  9267. -->
  9268. (S1 ^operator O1932 = 0.255013280266792)
  9269. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  9270. -->
  9271. (S1 ^operator O1931 = 0.1693592933936033)
  9272. Retracting rl*prefer*rvt*predict-yes*H0*1
  9273. -->
  9274. (S1 ^operator O1931 = 0.5231204697252971)
  9275. =>WM: (13585: S1 ^operator O1934 +)
  9276. =>WM: (13584: S1 ^operator O1933 +)
  9277. =>WM: (13583: O1934 ^name predict-no)
  9278. =>WM: (13582: O1933 ^name predict-yes)
  9279. =>WM: (13581: R970 ^value 1)
  9280. =>WM: (13580: R1 ^reward R970)
  9281. =>WM: (13579: I3 ^see 0)
  9282. <=WM: (13570: S1 ^operator O1931 +)
  9283. <=WM: (13571: S1 ^operator O1932 +)
  9284. <=WM: (13572: S1 ^operator O1932)
  9285. <=WM: (13566: R1 ^reward R969)
  9286. <=WM: (13565: I3 ^see 1)
  9287. <=WM: (13569: O1932 ^name predict-no)
  9288. <=WM: (13568: O1931 ^name predict-yes)
  9289. <=WM: (13567: R969 ^value 1)
  9290. --- Inner Elaboration Phase, active level 1 (S1) ---
  9291. Firing prefer*rvt*predict-yes*H0
  9292. -->
  9293. Firing rl*prefer*rvt*predict-yes*H0*1
  9294. -->
  9295. (S1 ^operator O1933 = 0.5231204697252971)
  9296. Firing prefer*rvt*predict-yes*H0*1*H1
  9297. -->
  9298. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  9299. -->
  9300. (S1 ^operator O1933 = 0.3)
  9301. Firing prefer*rvt*predict-no*H0
  9302. -->
  9303. Firing rl*prefer*rvt*predict-no*H0*2
  9304. -->
  9305. (S1 ^operator O1934 = 0.255013280266792)
  9306. Firing prefer*rvt*predict-no*H0*2*H1
  9307. -->
  9308. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  9309. -->
  9310. (S1 ^operator O1934 = 0.7449867911055725)
  9311. inner elaboration loop at bottom goal.
  9312. Retracting rl*prefer*rvt*predict-no*H0*2
  9313. -->
  9314. (S1 ^operator O1932 = 0.255013280266792)
  9315. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  9316. -->
  9317. (S1 ^operator O1932 = 0.7449867911055725)
  9318. Retracting rl*prefer*rvt*predict-yes*H0*1
  9319. -->
  9320. (S1 ^operator O1931 = 0.5231204697252971)
  9321. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  9322. -->
  9323. (S1 ^operator O1931 = 0.3)
  9324. --- END Proposal Phase ---
  9325. --- Decision Phase ---
  9326. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.914894,0.0782797)
  9327. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744986 -> 0.368505 0.376482 0.744986(R,m,v=1,1,0)
  9328. =>WM: (13586: S1 ^operator O1934)
  9329. 967: O: O1934 (predict-no)
  9330. --- END Decision Phase ---
  9331. --- Application Phase ---
  9332. --- Firing Productions (PE) For State At Depth 1 ---
  9333. --- Inner Elaboration Phase, active level 1 (S1) ---
  9334. Firing apply*operator
  9335. -->
  9336. (I3 ^predict-no N967 + :O )
  9337. Firing apply*operator*complete
  9338. -->
  9339. (I3 ^predict-no N966 - :O )
  9340. inner elaboration loop at bottom goal.
  9341. --- Change Working Memory (PE) ---
  9342. =>WM: (13587: I3 ^predict-no N967)
  9343. <=WM: (13574: N966 ^status complete)
  9344. <=WM: (13573: I3 ^predict-no N966)
  9345. --- Firing Productions (IE) For State At Depth 1 ---
  9346. --- Inner Elaboration Phase, active level 1 (S1) ---
  9347. Firing monitor*world
  9348. -->
  9349. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9350. --- Change Working Memory (IE) ---
  9351. --- END Application Phase ---
  9352. --- Output Phase ---
  9353. ENV: Agent did: predict-no for direction L in state State-A
  9354. In State-A moving L
  9355. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9356. predict error 0
  9357. dir: dir isL
  9358. --- END Output Phase ---
  9359. /|\--- Input Phase ---
  9360. =>WM: (13591: I2 ^dir L)
  9361. =>WM: (13590: I2 ^reward 1)
  9362. =>WM: (13589: I2 ^see 0)
  9363. =>WM: (13588: N967 ^status complete)
  9364. <=WM: (13577: I2 ^dir L)
  9365. <=WM: (13576: I2 ^reward 1)
  9366. <=WM: (13575: I2 ^see 0)
  9367. =>WM: (13592: I2 ^level-1 L0-root)
  9368. <=WM: (13578: I2 ^level-1 L0-root)
  9369. --- END Input Phase ---
  9370. --- Proposal Phase ---
  9371. --- Inner Elaboration Phase, active level 1 (S1) ---
  9372. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  9373. -->
  9374. (S1 ^operator O1933 = 0.3)
  9375. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  9376. -->
  9377. (S1 ^operator O1934 = 0.7449867911055725)
  9378. Firing prefer*rvt*predict-no*H0*2*H1
  9379. -->
  9380. Firing prefer*rvt*predict-yes*H0*1*H1
  9381. -->
  9382. Firing elaborate*copy-see-to-output-link
  9383. -->
  9384. (I3 ^see 0 +)
  9385. Firing elaborate*reward*based*on*reward
  9386. -->
  9387. (R971 ^value 1 +)
  9388. (R1 ^reward R971 +)
  9389. Firing propose*predict-yes
  9390. -->
  9391. (O1935 ^name predict-yes +)
  9392. (S1 ^operator O1935 +)
  9393. Firing propose*predict-no
  9394. -->
  9395. (O1936 ^name predict-no +)
  9396. (S1 ^operator O1936 +)
  9397. Firing rl*prefer*rvt*predict-no*H0*2
  9398. -->
  9399. (S1 ^operator O1934 = 0.255013345855908)
  9400. Firing rl*prefer*rvt*predict-yes*H0*1
  9401. -->
  9402. (S1 ^operator O1933 = 0.5231204697252971)
  9403. Firing prefer*rvt*predict-yes*H0
  9404. -->
  9405. Firing prefer*rvt*predict-no*H0
  9406. -->
  9407. Firing elaborate*copy-dir-to-output-link
  9408. -->
  9409. (I3 ^dir L +)
  9410. inner elaboration loop at bottom goal.
  9411. Retracting elaborate*copy-see-to-output-link
  9412. -->
  9413. (I3 ^see 0 +)
  9414. Retracting propose*predict-no
  9415. -->
  9416. (O1934 ^name predict-no +)
  9417. (S1 ^operator O1934 +)
  9418. Retracting propose*predict-yes
  9419. -->
  9420. (O1933 ^name predict-yes +)
  9421. (S1 ^operator O1933 +)
  9422. Retracting elaborate*reward*based*on*reward
  9423. -->
  9424. (R970 ^value 1 +)
  9425. (R1 ^reward R970 +)
  9426. Retracting elaborate*copy-dir-to-output-link
  9427. -->
  9428. (I3 ^dir L +)
  9429. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  9430. -->
  9431. (S1 ^operator O1934 = 0.7449867911055725)
  9432. Retracting rl*prefer*rvt*predict-no*H0*2
  9433. -->
  9434. (S1 ^operator O1934 = 0.255013345855908)
  9435. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  9436. -->
  9437. (S1 ^operator O1933 = 0.3)
  9438. Retracting rl*prefer*rvt*predict-yes*H0*1
  9439. -->
  9440. (S1 ^operator O1933 = 0.5231204697252971)
  9441. =>WM: (13598: S1 ^operator O1936 +)
  9442. =>WM: (13597: S1 ^operator O1935 +)
  9443. =>WM: (13596: O1936 ^name predict-no)
  9444. =>WM: (13595: O1935 ^name predict-yes)
  9445. =>WM: (13594: R971 ^value 1)
  9446. =>WM: (13593: R1 ^reward R971)
  9447. <=WM: (13584: S1 ^operator O1933 +)
  9448. <=WM: (13585: S1 ^operator O1934 +)
  9449. <=WM: (13586: S1 ^operator O1934)
  9450. <=WM: (13580: R1 ^reward R970)
  9451. <=WM: (13583: O1934 ^name predict-no)
  9452. <=WM: (13582: O1933 ^name predict-yes)
  9453. <=WM: (13581: R970 ^value 1)
  9454. --- Inner Elaboration Phase, active level 1 (S1) ---
  9455. Firing prefer*rvt*predict-yes*H0
  9456. -->
  9457. Firing rl*prefer*rvt*predict-yes*H0*1
  9458. -->
  9459. (S1 ^operator O1935 = 0.5231204697252971)
  9460. Firing prefer*rvt*predict-yes*H0*1*H1
  9461. -->
  9462. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  9463. -->
  9464. (S1 ^operator O1935 = 0.3)
  9465. Firing prefer*rvt*predict-no*H0
  9466. -->
  9467. Firing rl*prefer*rvt*predict-no*H0*2
  9468. -->
  9469. (S1 ^operator O1936 = 0.255013345855908)
  9470. Firing prefer*rvt*predict-no*H0*2*H1
  9471. -->
  9472. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  9473. -->
  9474. (S1 ^operator O1936 = 0.7449867911055725)
  9475. inner elaboration loop at bottom goal.
  9476. Retracting rl*prefer*rvt*predict-no*H0*2
  9477. -->
  9478. (S1 ^operator O1934 = 0.255013345855908)
  9479. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  9480. -->
  9481. (S1 ^operator O1934 = 0.7449867911055725)
  9482. Retracting rl*prefer*rvt*predict-yes*H0*1
  9483. -->
  9484. (S1 ^operator O1933 = 0.5231204697252971)
  9485. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  9486. -->
  9487. (S1 ^operator O1933 = 0.3)
  9488. --- END Proposal Phase ---
  9489. --- Decision Phase ---
  9490. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.915344,0.0779016)
  9491. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  9492. =>WM: (13599: S1 ^operator O1936)
  9493. 968: O: O1936 (predict-no)
  9494. --- END Decision Phase ---
  9495. --- Application Phase ---
  9496. --- Firing Productions (PE) For State At Depth 1 ---
  9497. --- Inner Elaboration Phase, active level 1 (S1) ---
  9498. Firing apply*operator
  9499. -->
  9500. (I3 ^predict-no N968 + :O )
  9501. Firing apply*operator*complete
  9502. -->
  9503. (I3 ^predict-no N967 - :O )
  9504. inner elaboration loop at bottom goal.
  9505. --- Change Working Memory (PE) ---
  9506. =>WM: (13600: I3 ^predict-no N968)
  9507. <=WM: (13588: N967 ^status complete)
  9508. <=WM: (13587: I3 ^predict-no N967)
  9509. --- Firing Productions (IE) For State At Depth 1 ---
  9510. --- Inner Elaboration Phase, active level 1 (S1) ---
  9511. Firing monitor*world
  9512. -->
  9513. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9514. --- Change Working Memory (IE) ---
  9515. --- END Application Phase ---
  9516. --- Output Phase ---
  9517. ENV: Agent did: predict-no for direction L in state State-A
  9518. In State-A moving L
  9519. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9520. predict error 0
  9521. dir: dir isR
  9522. --- END Output Phase ---
  9523. -/--- Input Phase ---
  9524. =>WM: (13604: I2 ^dir R)
  9525. =>WM: (13603: I2 ^reward 1)
  9526. =>WM: (13602: I2 ^see 0)
  9527. =>WM: (13601: N968 ^status complete)
  9528. <=WM: (13591: I2 ^dir L)
  9529. <=WM: (13590: I2 ^reward 1)
  9530. <=WM: (13589: I2 ^see 0)
  9531. =>WM: (13605: I2 ^level-1 L0-root)
  9532. <=WM: (13592: I2 ^level-1 L0-root)
  9533. --- END Input Phase ---
  9534. --- Proposal Phase ---
  9535. --- Inner Elaboration Phase, active level 1 (S1) ---
  9536. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  9537. -->
  9538. (S1 ^operator O1935 = 0.6170812384661459)
  9539. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  9540. -->
  9541. (S1 ^operator O1936 = 0.4910065094545203)
  9542. Firing prefer*rvt*predict-no*H0*4*H1
  9543. -->
  9544. Firing prefer*rvt*predict-yes*H0*3*H1
  9545. -->
  9546. Firing elaborate*copy-see-to-output-link
  9547. -->
  9548. (I3 ^see 0 +)
  9549. Firing elaborate*reward*based*on*reward
  9550. -->
  9551. (R972 ^value 1 +)
  9552. (R1 ^reward R972 +)
  9553. Firing propose*predict-yes
  9554. -->
  9555. (O1937 ^name predict-yes +)
  9556. (S1 ^operator O1937 +)
  9557. Firing propose*predict-no
  9558. -->
  9559. (O1938 ^name predict-no +)
  9560. (S1 ^operator O1938 +)
  9561. Firing rl*prefer*rvt*predict-no*H0*4
  9562. -->
  9563. (S1 ^operator O1936 = 0.1269767780720474)
  9564. Firing rl*prefer*rvt*predict-yes*H0*3
  9565. -->
  9566. (S1 ^operator O1935 = 0.3829340154836592)
  9567. Firing prefer*rvt*predict-yes*H0
  9568. -->
  9569. Firing prefer*rvt*predict-no*H0
  9570. -->
  9571. Firing elaborate*copy-dir-to-output-link
  9572. -->
  9573. (I3 ^dir R +)
  9574. inner elaboration loop at bottom goal.
  9575. Retracting elaborate*copy-see-to-output-link
  9576. -->
  9577. (I3 ^see 0 +)
  9578. Retracting propose*predict-no
  9579. -->
  9580. (O1936 ^name predict-no +)
  9581. (S1 ^operator O1936 +)
  9582. Retracting propose*predict-yes
  9583. -->
  9584. (O1935 ^name predict-yes +)
  9585. (S1 ^operator O1935 +)
  9586. Retracting elaborate*reward*based*on*reward
  9587. -->
  9588. (R971 ^value 1 +)
  9589. (R1 ^reward R971 +)
  9590. Retracting elaborate*copy-dir-to-output-link
  9591. -->
  9592. (I3 ^dir L +)
  9593. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  9594. -->
  9595. (S1 ^operator O1936 = 0.7449867705613504)
  9596. Retracting rl*prefer*rvt*predict-no*H0*2
  9597. -->
  9598. (S1 ^operator O1936 = 0.255013325311686)
  9599. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  9600. -->
  9601. (S1 ^operator O1935 = 0.3)
  9602. Retracting rl*prefer*rvt*predict-yes*H0*1
  9603. -->
  9604. (S1 ^operator O1935 = 0.5231204697252971)
  9605. =>WM: (13612: S1 ^operator O1938 +)
  9606. =>WM: (13611: S1 ^operator O1937 +)
  9607. =>WM: (13610: I3 ^dir R)
  9608. =>WM: (13609: O1938 ^name predict-no)
  9609. =>WM: (13608: O1937 ^name predict-yes)
  9610. =>WM: (13607: R972 ^value 1)
  9611. =>WM: (13606: R1 ^reward R972)
  9612. <=WM: (13597: S1 ^operator O1935 +)
  9613. <=WM: (13598: S1 ^operator O1936 +)
  9614. <=WM: (13599: S1 ^operator O1936)
  9615. <=WM: (13555: I3 ^dir L)
  9616. <=WM: (13593: R1 ^reward R971)
  9617. <=WM: (13596: O1936 ^name predict-no)
  9618. <=WM: (13595: O1935 ^name predict-yes)
  9619. <=WM: (13594: R971 ^value 1)
  9620. --- Inner Elaboration Phase, active level 1 (S1) ---
  9621. Firing prefer*rvt*predict-yes*H0
  9622. -->
  9623. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  9624. -->
  9625. (S1 ^operator O1937 = 0.6170812384661459)
  9626. Firing rl*prefer*rvt*predict-yes*H0*3
  9627. -->
  9628. (S1 ^operator O1937 = 0.3829340154836592)
  9629. Firing prefer*rvt*predict-yes*H0*3*H1
  9630. -->
  9631. Firing prefer*rvt*predict-no*H0
  9632. -->
  9633. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  9634. -->
  9635. (S1 ^operator O1938 = 0.4910065094545203)
  9636. Firing rl*prefer*rvt*predict-no*H0*4
  9637. -->
  9638. (S1 ^operator O1938 = 0.1269767780720474)
  9639. Firing prefer*rvt*predict-no*H0*4*H1
  9640. -->
  9641. inner elaboration loop at bottom goal.
  9642. Retracting rl*prefer*rvt*predict-no*H0*4
  9643. -->
  9644. (S1 ^operator O1936 = 0.1269767780720474)
  9645. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  9646. -->
  9647. (S1 ^operator O1936 = 0.4910065094545203)
  9648. Retracting rl*prefer*rvt*predict-yes*H0*3
  9649. -->
  9650. (S1 ^operator O1935 = 0.3829340154836592)
  9651. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  9652. -->
  9653. (S1 ^operator O1935 = 0.6170812384661459)
  9654. --- END Proposal Phase ---
  9655. --- Decision Phase ---
  9656. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.915789,0.0775272)
  9657. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  9658. =>WM: (13613: S1 ^operator O1937)
  9659. 969: O: O1937 (predict-yes)
  9660. --- END Decision Phase ---
  9661. --- Application Phase ---
  9662. --- Firing Productions (PE) For State At Depth 1 ---
  9663. --- Inner Elaboration Phase, active level 1 (S1) ---
  9664. Firing apply*operator
  9665. -->
  9666. (I3 ^predict-yes N969 + :O )
  9667. Firing apply*operator*complete
  9668. -->
  9669. (I3 ^predict-no N968 - :O )
  9670. inner elaboration loop at bottom goal.
  9671. --- Change Working Memory (PE) ---
  9672. =>WM: (13614: I3 ^predict-yes N969)
  9673. <=WM: (13601: N968 ^status complete)
  9674. <=WM: (13600: I3 ^predict-no N968)
  9675. --- Firing Productions (IE) For State At Depth 1 ---
  9676. --- Inner Elaboration Phase, active level 1 (S1) ---
  9677. Firing monitor*world
  9678. -->
  9679. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9680. --- Change Working Memory (IE) ---
  9681. --- END Application Phase ---
  9682. --- Output Phase ---
  9683. ENV: Agent did: predict-yes for direction R in state State-A
  9684. In State-A moving R
  9685. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9686. predict error 0
  9687. dir: dir isU
  9688. --- END Output Phase ---
  9689. |\---- Input Phase ---
  9690. =>WM: (13618: I2 ^dir U)
  9691. =>WM: (13617: I2 ^reward 1)
  9692. =>WM: (13616: I2 ^see 1)
  9693. =>WM: (13615: N969 ^status complete)
  9694. <=WM: (13604: I2 ^dir R)
  9695. <=WM: (13603: I2 ^reward 1)
  9696. <=WM: (13602: I2 ^see 0)
  9697. =>WM: (13619: I2 ^level-1 R1-root)
  9698. <=WM: (13605: I2 ^level-1 L0-root)
  9699. --- END Input Phase ---
  9700. --- Proposal Phase ---
  9701. --- Inner Elaboration Phase, active level 1 (S1) ---
  9702. Firing elaborate*copy-see-to-output-link
  9703. -->
  9704. (I3 ^see 1 +)
  9705. Firing elaborate*reward*based*on*reward
  9706. -->
  9707. (R973 ^value 1 +)
  9708. (R1 ^reward R973 +)
  9709. Firing propose*predict-yes
  9710. -->
  9711. (O1939 ^name predict-yes +)
  9712. (S1 ^operator O1939 +)
  9713. Firing propose*predict-no
  9714. -->
  9715. (O1940 ^name predict-no +)
  9716. (S1 ^operator O1940 +)
  9717. Firing rl*prefer*rvt*predict-no*H0*6
  9718. -->
  9719. (S1 ^operator O1938 = 0.9999999999999999)
  9720. Firing rl*prefer*rvt*predict-yes*H0*5
  9721. -->
  9722. (S1 ^operator O1937 = 0.)
  9723. Firing prefer*rvt*predict-yes*H0
  9724. -->
  9725. Firing prefer*rvt*predict-no*H0
  9726. -->
  9727. Firing elaborate*copy-dir-to-output-link
  9728. -->
  9729. (I3 ^dir U +)
  9730. inner elaboration loop at bottom goal.
  9731. Retracting elaborate*copy-see-to-output-link
  9732. -->
  9733. (I3 ^see 0 +)
  9734. Retracting propose*predict-no
  9735. -->
  9736. (O1938 ^name predict-no +)
  9737. (S1 ^operator O1938 +)
  9738. Retracting propose*predict-yes
  9739. -->
  9740. (O1937 ^name predict-yes +)
  9741. (S1 ^operator O1937 +)
  9742. Retracting elaborate*reward*based*on*reward
  9743. -->
  9744. (R972 ^value 1 +)
  9745. (R1 ^reward R972 +)
  9746. Retracting elaborate*copy-dir-to-output-link
  9747. -->
  9748. (I3 ^dir R +)
  9749. Retracting rl*prefer*rvt*predict-no*H0*4
  9750. -->
  9751. (S1 ^operator O1938 = 0.1269767780720474)
  9752. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  9753. -->
  9754. (S1 ^operator O1938 = 0.4910065094545203)
  9755. Retracting rl*prefer*rvt*predict-yes*H0*3
  9756. -->
  9757. (S1 ^operator O1937 = 0.3829340154836592)
  9758. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  9759. -->
  9760. (S1 ^operator O1937 = 0.6170812384661459)
  9761. =>WM: (13627: S1 ^operator O1940 +)
  9762. =>WM: (13626: S1 ^operator O1939 +)
  9763. =>WM: (13625: I3 ^dir U)
  9764. =>WM: (13624: O1940 ^name predict-no)
  9765. =>WM: (13623: O1939 ^name predict-yes)
  9766. =>WM: (13622: R973 ^value 1)
  9767. =>WM: (13621: R1 ^reward R973)
  9768. =>WM: (13620: I3 ^see 1)
  9769. <=WM: (13611: S1 ^operator O1937 +)
  9770. <=WM: (13613: S1 ^operator O1937)
  9771. <=WM: (13612: S1 ^operator O1938 +)
  9772. <=WM: (13610: I3 ^dir R)
  9773. <=WM: (13606: R1 ^reward R972)
  9774. <=WM: (13579: I3 ^see 0)
  9775. <=WM: (13609: O1938 ^name predict-no)
  9776. <=WM: (13608: O1937 ^name predict-yes)
  9777. <=WM: (13607: R972 ^value 1)
  9778. --- Inner Elaboration Phase, active level 1 (S1) ---
  9779. Firing prefer*rvt*predict-yes*H0
  9780. -->
  9781. Firing rl*prefer*rvt*predict-yes*H0*5
  9782. -->
  9783. (S1 ^operator O1939 = 0.)
  9784. Firing prefer*rvt*predict-no*H0
  9785. -->
  9786. Firing rl*prefer*rvt*predict-no*H0*6
  9787. -->
  9788. (S1 ^operator O1940 = 0.9999999999999999)
  9789. inner elaboration loop at bottom goal.
  9790. Retracting rl*prefer*rvt*predict-no*H0*6
  9791. -->
  9792. (S1 ^operator O1938 = 0.9999999999999999)
  9793. Retracting rl*prefer*rvt*predict-yes*H0*5
  9794. -->
  9795. (S1 ^operator O1937 = 0.)
  9796. --- END Proposal Phase ---
  9797. --- Decision Phase ---
  9798. RL update rl*prefer*rvt*predict-yes*H0*3 0.673128 -0.290194 0.382934 -> 0.673126 -0.290194 0.382932(R,m,v=1,0.959732,0.038908)
  9799. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326886 0.290195 0.617081 -> 0.326884 0.290195 0.617079(R,m,v=1,1,0)
  9800. =>WM: (13628: S1 ^operator O1940)
  9801. 970: O: O1940 (predict-no)
  9802. --- END Decision Phase ---
  9803. --- Application Phase ---
  9804. --- Firing Productions (PE) For State At Depth 1 ---
  9805. --- Inner Elaboration Phase, active level 1 (S1) ---
  9806. Firing apply*operator
  9807. -->
  9808. (I3 ^predict-no N970 + :O )
  9809. Firing apply*operator*complete
  9810. -->
  9811. (I3 ^predict-yes N969 - :O )
  9812. inner elaboration loop at bottom goal.
  9813. --- Change Working Memory (PE) ---
  9814. =>WM: (13629: I3 ^predict-no N970)
  9815. <=WM: (13615: N969 ^status complete)
  9816. <=WM: (13614: I3 ^predict-yes N969)
  9817. --- Firing Productions (IE) For State At Depth 1 ---
  9818. --- Inner Elaboration Phase, active level 1 (S1) ---
  9819. Firing monitor*world
  9820. -->
  9821. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9822. --- Change Working Memory (IE) ---
  9823. --- END Application Phase ---
  9824. --- Output Phase ---
  9825. ENV: Agent did: predict-no for direction U in state State-B
  9826. In State-B moving U
  9827. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9828. predict error 0
  9829. dir: dir isL
  9830. --- END Output Phase ---
  9831. /|--- Input Phase ---
  9832. =>WM: (13633: I2 ^dir L)
  9833. =>WM: (13632: I2 ^reward 1)
  9834. =>WM: (13631: I2 ^see 0)
  9835. =>WM: (13630: N970 ^status complete)
  9836. <=WM: (13618: I2 ^dir U)
  9837. <=WM: (13617: I2 ^reward 1)
  9838. <=WM: (13616: I2 ^see 1)
  9839. =>WM: (13634: I2 ^level-1 R1-root)
  9840. <=WM: (13619: I2 ^level-1 R1-root)
  9841. --- END Input Phase ---
  9842. --- Proposal Phase ---
  9843. --- Inner Elaboration Phase, active level 1 (S1) ---
  9844. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  9845. -->
  9846. (S1 ^operator O1939 = 0.4768771233214331)
  9847. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  9848. -->
  9849. (S1 ^operator O1940 = -0.01194930198035649)
  9850. Firing prefer*rvt*predict-no*H0*2*H1
  9851. -->
  9852. Firing prefer*rvt*predict-yes*H0*1*H1
  9853. -->
  9854. Firing elaborate*copy-see-to-output-link
  9855. -->
  9856. (I3 ^see 0 +)
  9857. Firing elaborate*reward*based*on*reward
  9858. -->
  9859. (R974 ^value 1 +)
  9860. (R1 ^reward R974 +)
  9861. Firing propose*predict-yes
  9862. -->
  9863. (O1941 ^name predict-yes +)
  9864. (S1 ^operator O1941 +)
  9865. Firing propose*predict-no
  9866. -->
  9867. (O1942 ^name predict-no +)
  9868. (S1 ^operator O1942 +)
  9869. Firing rl*prefer*rvt*predict-no*H0*2
  9870. -->
  9871. (S1 ^operator O1940 = 0.2550133109307305)
  9872. Firing rl*prefer*rvt*predict-yes*H0*1
  9873. -->
  9874. (S1 ^operator O1939 = 0.5231204697252971)
  9875. Firing prefer*rvt*predict-yes*H0
  9876. -->
  9877. Firing prefer*rvt*predict-no*H0
  9878. -->
  9879. Firing elaborate*copy-dir-to-output-link
  9880. -->
  9881. (I3 ^dir L +)
  9882. inner elaboration loop at bottom goal.
  9883. Retracting elaborate*copy-see-to-output-link
  9884. -->
  9885. (I3 ^see 1 +)
  9886. Retracting propose*predict-no
  9887. -->
  9888. (O1940 ^name predict-no +)
  9889. (S1 ^operator O1940 +)
  9890. Retracting propose*predict-yes
  9891. -->
  9892. (O1939 ^name predict-yes +)
  9893. (S1 ^operator O1939 +)
  9894. Retracting elaborate*reward*based*on*reward
  9895. -->
  9896. (R973 ^value 1 +)
  9897. (R1 ^reward R973 +)
  9898. Retracting elaborate*copy-dir-to-output-link
  9899. -->
  9900. (I3 ^dir U +)
  9901. Retracting rl*prefer*rvt*predict-no*H0*6
  9902. -->
  9903. (S1 ^operator O1940 = 0.9999999999999999)
  9904. Retracting rl*prefer*rvt*predict-yes*H0*5
  9905. -->
  9906. (S1 ^operator O1939 = 0.)
  9907. =>WM: (13642: S1 ^operator O1942 +)
  9908. =>WM: (13641: S1 ^operator O1941 +)
  9909. =>WM: (13640: I3 ^dir L)
  9910. =>WM: (13639: O1942 ^name predict-no)
  9911. =>WM: (13638: O1941 ^name predict-yes)
  9912. =>WM: (13637: R974 ^value 1)
  9913. =>WM: (13636: R1 ^reward R974)
  9914. =>WM: (13635: I3 ^see 0)
  9915. <=WM: (13626: S1 ^operator O1939 +)
  9916. <=WM: (13627: S1 ^operator O1940 +)
  9917. <=WM: (13628: S1 ^operator O1940)
  9918. <=WM: (13625: I3 ^dir U)
  9919. <=WM: (13621: R1 ^reward R973)
  9920. <=WM: (13620: I3 ^see 1)
  9921. <=WM: (13624: O1940 ^name predict-no)
  9922. <=WM: (13623: O1939 ^name predict-yes)
  9923. <=WM: (13622: R973 ^value 1)
  9924. --- Inner Elaboration Phase, active level 1 (S1) ---
  9925. Firing prefer*rvt*predict-yes*H0
  9926. -->
  9927. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  9928. -->
  9929. (S1 ^operator O1941 = 0.4768771233214331)
  9930. Firing rl*prefer*rvt*predict-yes*H0*1
  9931. -->
  9932. (S1 ^operator O1941 = 0.5231204697252971)
  9933. Firing prefer*rvt*predict-yes*H0*1*H1
  9934. -->
  9935. Firing prefer*rvt*predict-no*H0
  9936. -->
  9937. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  9938. -->
  9939. (S1 ^operator O1942 = -0.01194930198035649)
  9940. Firing rl*prefer*rvt*predict-no*H0*2
  9941. -->
  9942. (S1 ^operator O1942 = 0.2550133109307305)
  9943. Firing prefer*rvt*predict-no*H0*2*H1
  9944. -->
  9945. inner elaboration loop at bottom goal.
  9946. Retracting rl*prefer*rvt*predict-no*H0*2
  9947. -->
  9948. (S1 ^operator O1940 = 0.2550133109307305)
  9949. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  9950. -->
  9951. (S1 ^operator O1940 = -0.01194930198035649)
  9952. Retracting rl*prefer*rvt*predict-yes*H0*1
  9953. -->
  9954. (S1 ^operator O1939 = 0.5231204697252971)
  9955. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  9956. -->
  9957. (S1 ^operator O1939 = 0.4768771233214331)
  9958. --- END Proposal Phase ---
  9959. --- Decision Phase ---
  9960. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9961. =>WM: (13643: S1 ^operator O1941)
  9962. 971: O: O1941 (predict-yes)
  9963. --- END Decision Phase ---
  9964. --- Application Phase ---
  9965. --- Firing Productions (PE) For State At Depth 1 ---
  9966. --- Inner Elaboration Phase, active level 1 (S1) ---
  9967. Firing apply*operator
  9968. -->
  9969. (I3 ^predict-yes N971 + :O )
  9970. Firing apply*operator*complete
  9971. -->
  9972. (I3 ^predict-no N970 - :O )
  9973. inner elaboration loop at bottom goal.
  9974. --- Change Working Memory (PE) ---
  9975. =>WM: (13644: I3 ^predict-yes N971)
  9976. <=WM: (13630: N970 ^status complete)
  9977. <=WM: (13629: I3 ^predict-no N970)
  9978. --- Firing Productions (IE) For State At Depth 1 ---
  9979. --- Inner Elaboration Phase, active level 1 (S1) ---
  9980. Firing monitor*world
  9981. -->
  9982. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9983. --- Change Working Memory (IE) ---
  9984. --- END Application Phase ---
  9985. --- Output Phase ---
  9986. ENV: Agent did: predict-yes for direction L in state State-B
  9987. In State-B moving L
  9988. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9989. predict error 0
  9990. dir: dir isL
  9991. --- END Output Phase ---
  9992. \--- Input Phase ---
  9993. =>WM: (13648: I2 ^dir L)
  9994. =>WM: (13647: I2 ^reward 1)
  9995. =>WM: (13646: I2 ^see 1)
  9996. =>WM: (13645: N971 ^status complete)
  9997. <=WM: (13633: I2 ^dir L)
  9998. <=WM: (13632: I2 ^reward 1)
  9999. <=WM: (13631: I2 ^see 0)
  10000. =>WM: (13649: I2 ^level-1 L1-root)
  10001. <=WM: (13634: I2 ^level-1 R1-root)
  10002. --- END Input Phase ---
  10003. --- Proposal Phase ---
  10004. --- Inner Elaboration Phase, active level 1 (S1) ---
  10005. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  10006. -->
  10007. (S1 ^operator O1941 = 0.1693592933936033)
  10008. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  10009. -->
  10010. (S1 ^operator O1942 = 0.7449863480615504)
  10011. Firing prefer*rvt*predict-no*H0*2*H1
  10012. -->
  10013. Firing prefer*rvt*predict-yes*H0*1*H1
  10014. -->
  10015. Firing elaborate*copy-see-to-output-link
  10016. -->
  10017. (I3 ^see 1 +)
  10018. Firing elaborate*reward*based*on*reward
  10019. -->
  10020. (R975 ^value 1 +)
  10021. (R1 ^reward R975 +)
  10022. Firing propose*predict-yes
  10023. -->
  10024. (O1943 ^name predict-yes +)
  10025. (S1 ^operator O1943 +)
  10026. Firing propose*predict-no
  10027. -->
  10028. (O1944 ^name predict-no +)
  10029. (S1 ^operator O1944 +)
  10030. Firing rl*prefer*rvt*predict-no*H0*2
  10031. -->
  10032. (S1 ^operator O1942 = 0.2550133109307305)
  10033. Firing rl*prefer*rvt*predict-yes*H0*1
  10034. -->
  10035. (S1 ^operator O1941 = 0.5231204697252971)
  10036. Firing prefer*rvt*predict-yes*H0
  10037. -->
  10038. Firing prefer*rvt*predict-no*H0
  10039. -->
  10040. Firing elaborate*copy-dir-to-output-link
  10041. -->
  10042. (I3 ^dir L +)
  10043. inner elaboration loop at bottom goal.
  10044. Retracting elaborate*copy-see-to-output-link
  10045. -->
  10046. (I3 ^see 0 +)
  10047. Retracting propose*predict-no
  10048. -->
  10049. (O1942 ^name predict-no +)
  10050. (S1 ^operator O1942 +)
  10051. Retracting propose*predict-yes
  10052. -->
  10053. (O1941 ^name predict-yes +)
  10054. (S1 ^operator O1941 +)
  10055. Retracting elaborate*reward*based*on*reward
  10056. -->
  10057. (R974 ^value 1 +)
  10058. (R1 ^reward R974 +)
  10059. Retracting elaborate*copy-dir-to-output-link
  10060. -->
  10061. (I3 ^dir L +)
  10062. Retracting rl*prefer*rvt*predict-no*H0*2
  10063. -->
  10064. (S1 ^operator O1942 = 0.2550133109307305)
  10065. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  10066. -->
  10067. (S1 ^operator O1942 = -0.01194930198035649)
  10068. Retracting rl*prefer*rvt*predict-yes*H0*1
  10069. -->
  10070. (S1 ^operator O1941 = 0.5231204697252971)
  10071. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  10072. -->
  10073. (S1 ^operator O1941 = 0.4768771233214331)
  10074. =>WM: (13656: S1 ^operator O1944 +)
  10075. =>WM: (13655: S1 ^operator O1943 +)
  10076. =>WM: (13654: O1944 ^name predict-no)
  10077. =>WM: (13653: O1943 ^name predict-yes)
  10078. =>WM: (13652: R975 ^value 1)
  10079. =>WM: (13651: R1 ^reward R975)
  10080. =>WM: (13650: I3 ^see 1)
  10081. <=WM: (13641: S1 ^operator O1941 +)
  10082. <=WM: (13643: S1 ^operator O1941)
  10083. <=WM: (13642: S1 ^operator O1942 +)
  10084. <=WM: (13636: R1 ^reward R974)
  10085. <=WM: (13635: I3 ^see 0)
  10086. <=WM: (13639: O1942 ^name predict-no)
  10087. <=WM: (13638: O1941 ^name predict-yes)
  10088. <=WM: (13637: R974 ^value 1)
  10089. --- Inner Elaboration Phase, active level 1 (S1) ---
  10090. Firing prefer*rvt*predict-yes*H0
  10091. -->
  10092. Firing rl*prefer*rvt*predict-yes*H0*1
  10093. -->
  10094. (S1 ^operator O1943 = 0.5231204697252971)
  10095. Firing prefer*rvt*predict-yes*H0*1*H1
  10096. -->
  10097. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  10098. -->
  10099. (S1 ^operator O1943 = 0.1693592933936033)
  10100. Firing prefer*rvt*predict-no*H0
  10101. -->
  10102. Firing rl*prefer*rvt*predict-no*H0*2
  10103. -->
  10104. (S1 ^operator O1944 = 0.2550133109307305)
  10105. Firing prefer*rvt*predict-no*H0*2*H1
  10106. -->
  10107. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  10108. -->
  10109. (S1 ^operator O1944 = 0.7449863480615504)
  10110. inner elaboration loop at bottom goal.
  10111. Retracting rl*prefer*rvt*predict-no*H0*2
  10112. -->
  10113. (S1 ^operator O1942 = 0.2550133109307305)
  10114. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  10115. -->
  10116. (S1 ^operator O1942 = 0.7449863480615504)
  10117. Retracting rl*prefer*rvt*predict-yes*H0*1
  10118. -->
  10119. (S1 ^operator O1941 = 0.5231204697252971)
  10120. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  10121. -->
  10122. (S1 ^operator O1941 = 0.1693592933936033)
  10123. --- END Proposal Phase ---
  10124. --- Decision Phase ---
  10125. RL update rl*prefer*rvt*predict-yes*H0*1 0.727961 -0.20484 0.52312 -> 0.727961 -0.20484 0.523121(R,m,v=1,0.978417,0.0212699)
  10126. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272036 0.204841 0.476877 -> 0.272037 0.204841 0.476877(R,m,v=1,1,0)
  10127. =>WM: (13657: S1 ^operator O1944)
  10128. 972: O: O1944 (predict-no)
  10129. --- END Decision Phase ---
  10130. --- Application Phase ---
  10131. --- Firing Productions (PE) For State At Depth 1 ---
  10132. --- Inner Elaboration Phase, active level 1 (S1) ---
  10133. Firing apply*operator
  10134. -->
  10135. (I3 ^predict-no N972 + :O )
  10136. Firing apply*operator*complete
  10137. -->
  10138. (I3 ^predict-yes N971 - :O )
  10139. inner elaboration loop at bottom goal.
  10140. --- Change Working Memory (PE) ---
  10141. =>WM: (13658: I3 ^predict-no N972)
  10142. <=WM: (13645: N971 ^status complete)
  10143. <=WM: (13644: I3 ^predict-yes N971)
  10144. --- Firing Productions (IE) For State At Depth 1 ---
  10145. --- Inner Elaboration Phase, active level 1 (S1) ---
  10146. Firing monitor*world
  10147. -->
  10148. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10149. --- Change Working Memory (IE) ---
  10150. --- END Application Phase ---
  10151. --- Output Phase ---
  10152. ENV: Agent did: predict-no for direction L in state State-A
  10153. In State-A moving L
  10154. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10155. predict error 0
  10156. dir: dir isU
  10157. --- END Output Phase ---
  10158. -/--- Input Phase ---
  10159. =>WM: (13662: I2 ^dir U)
  10160. =>WM: (13661: I2 ^reward 1)
  10161. =>WM: (13660: I2 ^see 0)
  10162. =>WM: (13659: N972 ^status complete)
  10163. <=WM: (13648: I2 ^dir L)
  10164. <=WM: (13647: I2 ^reward 1)
  10165. <=WM: (13646: I2 ^see 1)
  10166. =>WM: (13663: I2 ^level-1 L0-root)
  10167. <=WM: (13649: I2 ^level-1 L1-root)
  10168. --- END Input Phase ---
  10169. --- Proposal Phase ---
  10170. --- Inner Elaboration Phase, active level 1 (S1) ---
  10171. Firing elaborate*copy-see-to-output-link
  10172. -->
  10173. (I3 ^see 0 +)
  10174. Firing elaborate*reward*based*on*reward
  10175. -->
  10176. (R976 ^value 1 +)
  10177. (R1 ^reward R976 +)
  10178. Firing propose*predict-yes
  10179. -->
  10180. (O1945 ^name predict-yes +)
  10181. (S1 ^operator O1945 +)
  10182. Firing propose*predict-no
  10183. -->
  10184. (O1946 ^name predict-no +)
  10185. (S1 ^operator O1946 +)
  10186. Firing rl*prefer*rvt*predict-no*H0*6
  10187. -->
  10188. (S1 ^operator O1944 = 0.9999999999999999)
  10189. Firing rl*prefer*rvt*predict-yes*H0*5
  10190. -->
  10191. (S1 ^operator O1943 = 0.)
  10192. Firing prefer*rvt*predict-yes*H0
  10193. -->
  10194. Firing prefer*rvt*predict-no*H0
  10195. -->
  10196. Firing elaborate*copy-dir-to-output-link
  10197. -->
  10198. (I3 ^dir U +)
  10199. inner elaboration loop at bottom goal.
  10200. Retracting elaborate*copy-see-to-output-link
  10201. -->
  10202. (I3 ^see 1 +)
  10203. Retracting propose*predict-no
  10204. -->
  10205. (O1944 ^name predict-no +)
  10206. (S1 ^operator O1944 +)
  10207. Retracting propose*predict-yes
  10208. -->
  10209. (O1943 ^name predict-yes +)
  10210. (S1 ^operator O1943 +)
  10211. Retracting elaborate*reward*based*on*reward
  10212. -->
  10213. (R975 ^value 1 +)
  10214. (R1 ^reward R975 +)
  10215. Retracting elaborate*copy-dir-to-output-link
  10216. -->
  10217. (I3 ^dir L +)
  10218. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  10219. -->
  10220. (S1 ^operator O1944 = 0.7449863480615504)
  10221. Retracting rl*prefer*rvt*predict-no*H0*2
  10222. -->
  10223. (S1 ^operator O1944 = 0.2550133109307305)
  10224. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  10225. -->
  10226. (S1 ^operator O1943 = 0.1693592933936033)
  10227. Retracting rl*prefer*rvt*predict-yes*H0*1
  10228. -->
  10229. (S1 ^operator O1943 = 0.5231208307682875)
  10230. =>WM: (13671: S1 ^operator O1946 +)
  10231. =>WM: (13670: S1 ^operator O1945 +)
  10232. =>WM: (13669: I3 ^dir U)
  10233. =>WM: (13668: O1946 ^name predict-no)
  10234. =>WM: (13667: O1945 ^name predict-yes)
  10235. =>WM: (13666: R976 ^value 1)
  10236. =>WM: (13665: R1 ^reward R976)
  10237. =>WM: (13664: I3 ^see 0)
  10238. <=WM: (13655: S1 ^operator O1943 +)
  10239. <=WM: (13656: S1 ^operator O1944 +)
  10240. <=WM: (13657: S1 ^operator O1944)
  10241. <=WM: (13640: I3 ^dir L)
  10242. <=WM: (13651: R1 ^reward R975)
  10243. <=WM: (13650: I3 ^see 1)
  10244. <=WM: (13654: O1944 ^name predict-no)
  10245. <=WM: (13653: O1943 ^name predict-yes)
  10246. <=WM: (13652: R975 ^value 1)
  10247. --- Inner Elaboration Phase, active level 1 (S1) ---
  10248. Firing prefer*rvt*predict-yes*H0
  10249. -->
  10250. Firing rl*prefer*rvt*predict-yes*H0*5
  10251. -->
  10252. (S1 ^operator O1945 = 0.)
  10253. Firing prefer*rvt*predict-no*H0
  10254. -->
  10255. Firing rl*prefer*rvt*predict-no*H0*6
  10256. -->
  10257. (S1 ^operator O1946 = 0.9999999999999999)
  10258. inner elaboration loop at bottom goal.
  10259. Retracting rl*prefer*rvt*predict-no*H0*6
  10260. -->
  10261. (S1 ^operator O1944 = 0.9999999999999999)
  10262. Retracting rl*prefer*rvt*predict-yes*H0*5
  10263. -->
  10264. (S1 ^operator O1943 = 0.)
  10265. --- END Proposal Phase ---
  10266. --- Decision Phase ---
  10267. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.91623,0.0771562)
  10268. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744986 -> 0.368505 0.376482 0.744986(R,m,v=1,1,0)
  10269. =>WM: (13672: S1 ^operator O1946)
  10270. 973: O: O1946 (predict-no)
  10271. --- END Decision Phase ---
  10272. --- Application Phase ---
  10273. --- Firing Productions (PE) For State At Depth 1 ---
  10274. --- Inner Elaboration Phase, active level 1 (S1) ---
  10275. Firing apply*operator
  10276. -->
  10277. (I3 ^predict-no N973 + :O )
  10278. Firing apply*operator*complete
  10279. -->
  10280. (I3 ^predict-no N972 - :O )
  10281. inner elaboration loop at bottom goal.
  10282. --- Change Working Memory (PE) ---
  10283. =>WM: (13673: I3 ^predict-no N973)
  10284. <=WM: (13659: N972 ^status complete)
  10285. <=WM: (13658: I3 ^predict-no N972)
  10286. --- Firing Productions (IE) For State At Depth 1 ---
  10287. --- Inner Elaboration Phase, active level 1 (S1) ---
  10288. Firing monitor*world
  10289. -->
  10290. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10291. --- Change Working Memory (IE) ---
  10292. --- END Application Phase ---
  10293. --- Output Phase ---
  10294. ENV: Agent did: predict-no for direction U in state State-A
  10295. In State-A moving U
  10296. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10297. predict error 0
  10298. dir: dir isU
  10299. --- END Output Phase ---
  10300. |\-/--- Input Phase ---
  10301. =>WM: (13677: I2 ^dir U)
  10302. =>WM: (13676: I2 ^reward 1)
  10303. =>WM: (13675: I2 ^see 0)
  10304. =>WM: (13674: N973 ^status complete)
  10305. <=WM: (13662: I2 ^dir U)
  10306. <=WM: (13661: I2 ^reward 1)
  10307. <=WM: (13660: I2 ^see 0)
  10308. =>WM: (13678: I2 ^level-1 L0-root)
  10309. <=WM: (13663: I2 ^level-1 L0-root)
  10310. --- END Input Phase ---
  10311. --- Proposal Phase ---
  10312. --- Inner Elaboration Phase, active level 1 (S1) ---
  10313. Firing elaborate*copy-see-to-output-link
  10314. -->
  10315. (I3 ^see 0 +)
  10316. Firing elaborate*reward*based*on*reward
  10317. -->
  10318. (R977 ^value 1 +)
  10319. (R1 ^reward R977 +)
  10320. Firing propose*predict-yes
  10321. -->
  10322. (O1947 ^name predict-yes +)
  10323. (S1 ^operator O1947 +)
  10324. Firing propose*predict-no
  10325. -->
  10326. (O1948 ^name predict-no +)
  10327. (S1 ^operator O1948 +)
  10328. Firing rl*prefer*rvt*predict-no*H0*6
  10329. -->
  10330. (S1 ^operator O1946 = 0.9999999999999999)
  10331. Firing rl*prefer*rvt*predict-yes*H0*5
  10332. -->
  10333. (S1 ^operator O1945 = 0.)
  10334. Firing prefer*rvt*predict-yes*H0
  10335. -->
  10336. Firing prefer*rvt*predict-no*H0
  10337. -->
  10338. Firing elaborate*copy-dir-to-output-link
  10339. -->
  10340. (I3 ^dir U +)
  10341. inner elaboration loop at bottom goal.
  10342. Retracting elaborate*copy-see-to-output-link
  10343. -->
  10344. (I3 ^see 0 +)
  10345. Retracting propose*predict-no
  10346. -->
  10347. (O1946 ^name predict-no +)
  10348. (S1 ^operator O1946 +)
  10349. Retracting propose*predict-yes
  10350. -->
  10351. (O1945 ^name predict-yes +)
  10352. (S1 ^operator O1945 +)
  10353. Retracting elaborate*reward*based*on*reward
  10354. -->
  10355. (R976 ^value 1 +)
  10356. (R1 ^reward R976 +)
  10357. Retracting elaborate*copy-dir-to-output-link
  10358. -->
  10359. (I3 ^dir U +)
  10360. Retracting rl*prefer*rvt*predict-no*H0*6
  10361. -->
  10362. (S1 ^operator O1946 = 0.9999999999999999)
  10363. Retracting rl*prefer*rvt*predict-yes*H0*5
  10364. -->
  10365. (S1 ^operator O1945 = 0.)
  10366. =>WM: (13684: S1 ^operator O1948 +)
  10367. =>WM: (13683: S1 ^operator O1947 +)
  10368. =>WM: (13682: O1948 ^name predict-no)
  10369. =>WM: (13681: O1947 ^name predict-yes)
  10370. =>WM: (13680: R977 ^value 1)
  10371. =>WM: (13679: R1 ^reward R977)
  10372. <=WM: (13670: S1 ^operator O1945 +)
  10373. <=WM: (13671: S1 ^operator O1946 +)
  10374. <=WM: (13672: S1 ^operator O1946)
  10375. <=WM: (13665: R1 ^reward R976)
  10376. <=WM: (13668: O1946 ^name predict-no)
  10377. <=WM: (13667: O1945 ^name predict-yes)
  10378. <=WM: (13666: R976 ^value 1)
  10379. --- Inner Elaboration Phase, active level 1 (S1) ---
  10380. Firing prefer*rvt*predict-yes*H0
  10381. -->
  10382. Firing rl*prefer*rvt*predict-yes*H0*5
  10383. -->
  10384. (S1 ^operator O1947 = 0.)
  10385. Firing prefer*rvt*predict-no*H0
  10386. -->
  10387. Firing rl*prefer*rvt*predict-no*H0*6
  10388. -->
  10389. (S1 ^operator O1948 = 0.9999999999999999)
  10390. inner elaboration loop at bottom goal.
  10391. Retracting rl*prefer*rvt*predict-no*H0*6
  10392. -->
  10393. (S1 ^operator O1946 = 0.9999999999999999)
  10394. Retracting rl*prefer*rvt*predict-yes*H0*5
  10395. -->
  10396. (S1 ^operator O1945 = 0.)
  10397. --- END Proposal Phase ---
  10398. --- Decision Phase ---
  10399. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10400. =>WM: (13685: S1 ^operator O1948)
  10401. 974: O: O1948 (predict-no)
  10402. --- END Decision Phase ---
  10403. --- Application Phase ---
  10404. --- Firing Productions (PE) For State At Depth 1 ---
  10405. --- Inner Elaboration Phase, active level 1 (S1) ---
  10406. Firing apply*operator
  10407. -->
  10408. (I3 ^predict-no N974 + :O )
  10409. Firing apply*operator*complete
  10410. -->
  10411. (I3 ^predict-no N973 - :O )
  10412. inner elaboration loop at bottom goal.
  10413. --- Change Working Memory (PE) ---
  10414. =>WM: (13686: I3 ^predict-no N974)
  10415. <=WM: (13674: N973 ^status complete)
  10416. <=WM: (13673: I3 ^predict-no N973)
  10417. --- Firing Productions (IE) For State At Depth 1 ---
  10418. --- Inner Elaboration Phase, active level 1 (S1) ---
  10419. Firing monitor*world
  10420. -->
  10421. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10422. --- Change Working Memory (IE) ---
  10423. --- END Application Phase ---
  10424. --- Output Phase ---
  10425. ENV: Agent did: predict-no for direction U in state State-A
  10426. In State-A moving U
  10427. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10428. predict error 0
  10429. dir: dir isU
  10430. --- END Output Phase ---
  10431. |\--- Input Phase ---
  10432. =>WM: (13690: I2 ^dir U)
  10433. =>WM: (13689: I2 ^reward 1)
  10434. =>WM: (13688: I2 ^see 0)
  10435. =>WM: (13687: N974 ^status complete)
  10436. <=WM: (13677: I2 ^dir U)
  10437. <=WM: (13676: I2 ^reward 1)
  10438. <=WM: (13675: I2 ^see 0)
  10439. =>WM: (13691: I2 ^level-1 L0-root)
  10440. <=WM: (13678: I2 ^level-1 L0-root)
  10441. --- END Input Phase ---
  10442. --- Proposal Phase ---
  10443. --- Inner Elaboration Phase, active level 1 (S1) ---
  10444. Firing elaborate*copy-see-to-output-link
  10445. -->
  10446. (I3 ^see 0 +)
  10447. Firing elaborate*reward*based*on*reward
  10448. -->
  10449. (R978 ^value 1 +)
  10450. (R1 ^reward R978 +)
  10451. Firing propose*predict-yes
  10452. -->
  10453. (O1949 ^name predict-yes +)
  10454. (S1 ^operator O1949 +)
  10455. Firing propose*predict-no
  10456. -->
  10457. (O1950 ^name predict-no +)
  10458. (S1 ^operator O1950 +)
  10459. Firing rl*prefer*rvt*predict-no*H0*6
  10460. -->
  10461. (S1 ^operator O1948 = 0.9999999999999999)
  10462. Firing rl*prefer*rvt*predict-yes*H0*5
  10463. -->
  10464. (S1 ^operator O1947 = 0.)
  10465. Firing prefer*rvt*predict-yes*H0
  10466. -->
  10467. Firing prefer*rvt*predict-no*H0
  10468. -->
  10469. Firing elaborate*copy-dir-to-output-link
  10470. -->
  10471. (I3 ^dir U +)
  10472. inner elaboration loop at bottom goal.
  10473. Retracting elaborate*copy-see-to-output-link
  10474. -->
  10475. (I3 ^see 0 +)
  10476. Retracting propose*predict-no
  10477. -->
  10478. (O1948 ^name predict-no +)
  10479. (S1 ^operator O1948 +)
  10480. Retracting propose*predict-yes
  10481. -->
  10482. (O1947 ^name predict-yes +)
  10483. (S1 ^operator O1947 +)
  10484. Retracting elaborate*reward*based*on*reward
  10485. -->
  10486. (R977 ^value 1 +)
  10487. (R1 ^reward R977 +)
  10488. Retracting elaborate*copy-dir-to-output-link
  10489. -->
  10490. (I3 ^dir U +)
  10491. Retracting rl*prefer*rvt*predict-no*H0*6
  10492. -->
  10493. (S1 ^operator O1948 = 0.9999999999999999)
  10494. Retracting rl*prefer*rvt*predict-yes*H0*5
  10495. -->
  10496. (S1 ^operator O1947 = 0.)
  10497. =>WM: (13697: S1 ^operator O1950 +)
  10498. =>WM: (13696: S1 ^operator O1949 +)
  10499. =>WM: (13695: O1950 ^name predict-no)
  10500. =>WM: (13694: O1949 ^name predict-yes)
  10501. =>WM: (13693: R978 ^value 1)
  10502. =>WM: (13692: R1 ^reward R978)
  10503. <=WM: (13683: S1 ^operator O1947 +)
  10504. <=WM: (13684: S1 ^operator O1948 +)
  10505. <=WM: (13685: S1 ^operator O1948)
  10506. <=WM: (13679: R1 ^reward R977)
  10507. <=WM: (13682: O1948 ^name predict-no)
  10508. <=WM: (13681: O1947 ^name predict-yes)
  10509. <=WM: (13680: R977 ^value 1)
  10510. --- Inner Elaboration Phase, active level 1 (S1) ---
  10511. Firing prefer*rvt*predict-yes*H0
  10512. -->
  10513. Firing rl*prefer*rvt*predict-yes*H0*5
  10514. -->
  10515. (S1 ^operator O1949 = 0.)
  10516. Firing prefer*rvt*predict-no*H0
  10517. -->
  10518. Firing rl*prefer*rvt*predict-no*H0*6
  10519. -->
  10520. (S1 ^operator O1950 = 0.9999999999999999)
  10521. inner elaboration loop at bottom goal.
  10522. Retracting rl*prefer*rvt*predict-no*H0*6
  10523. -->
  10524. (S1 ^operator O1948 = 0.9999999999999999)
  10525. Retracting rl*prefer*rvt*predict-yes*H0*5
  10526. -->
  10527. (S1 ^operator O1947 = 0.)
  10528. --- END Proposal Phase ---
  10529. --- Decision Phase ---
  10530. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10531. =>WM: (13698: S1 ^operator O1950)
  10532. 975: O: O1950 (predict-no)
  10533. --- END Decision Phase ---
  10534. --- Application Phase ---
  10535. --- Firing Productions (PE) For State At Depth 1 ---
  10536. --- Inner Elaboration Phase, active level 1 (S1) ---
  10537. Firing apply*operator
  10538. -->
  10539. (I3 ^predict-no N975 + :O )
  10540. Firing apply*operator*complete
  10541. -->
  10542. (I3 ^predict-no N974 - :O )
  10543. inner elaboration loop at bottom goal.
  10544. --- Change Working Memory (PE) ---
  10545. =>WM: (13699: I3 ^predict-no N975)
  10546. <=WM: (13687: N974 ^status complete)
  10547. <=WM: (13686: I3 ^predict-no N974)
  10548. --- Firing Productions (IE) For State At Depth 1 ---
  10549. --- Inner Elaboration Phase, active level 1 (S1) ---
  10550. Firing monitor*world
  10551. -->
  10552. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10553. --- Change Working Memory (IE) ---
  10554. --- END Application Phase ---
  10555. --- Output Phase ---
  10556. ENV: Agent did: predict-no for direction U in state State-A
  10557. In State-A moving U
  10558. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10559. predict error 0
  10560. dir: dir isL
  10561. --- END Output Phase ---
  10562. -/--- Input Phase ---
  10563. =>WM: (13703: I2 ^dir L)
  10564. =>WM: (13702: I2 ^reward 1)
  10565. =>WM: (13701: I2 ^see 0)
  10566. =>WM: (13700: N975 ^status complete)
  10567. <=WM: (13690: I2 ^dir U)
  10568. <=WM: (13689: I2 ^reward 1)
  10569. <=WM: (13688: I2 ^see 0)
  10570. =>WM: (13704: I2 ^level-1 L0-root)
  10571. <=WM: (13691: I2 ^level-1 L0-root)
  10572. --- END Input Phase ---
  10573. --- Proposal Phase ---
  10574. --- Inner Elaboration Phase, active level 1 (S1) ---
  10575. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  10576. -->
  10577. (S1 ^operator O1949 = 0.3)
  10578. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  10579. -->
  10580. (S1 ^operator O1950 = 0.744986756180395)
  10581. Firing prefer*rvt*predict-no*H0*2*H1
  10582. -->
  10583. Firing prefer*rvt*predict-yes*H0*1*H1
  10584. -->
  10585. Firing elaborate*copy-see-to-output-link
  10586. -->
  10587. (I3 ^see 0 +)
  10588. Firing elaborate*reward*based*on*reward
  10589. -->
  10590. (R979 ^value 1 +)
  10591. (R1 ^reward R979 +)
  10592. Firing propose*predict-yes
  10593. -->
  10594. (O1951 ^name predict-yes +)
  10595. (S1 ^operator O1951 +)
  10596. Firing propose*predict-no
  10597. -->
  10598. (O1952 ^name predict-no +)
  10599. (S1 ^operator O1952 +)
  10600. Firing rl*prefer*rvt*predict-no*H0*2
  10601. -->
  10602. (S1 ^operator O1950 = 0.2550133620818883)
  10603. Firing rl*prefer*rvt*predict-yes*H0*1
  10604. -->
  10605. (S1 ^operator O1949 = 0.5231208307682875)
  10606. Firing prefer*rvt*predict-yes*H0
  10607. -->
  10608. Firing prefer*rvt*predict-no*H0
  10609. -->
  10610. Firing elaborate*copy-dir-to-output-link
  10611. -->
  10612. (I3 ^dir L +)
  10613. inner elaboration loop at bottom goal.
  10614. Retracting elaborate*copy-see-to-output-link
  10615. -->
  10616. (I3 ^see 0 +)
  10617. Retracting propose*predict-no
  10618. -->
  10619. (O1950 ^name predict-no +)
  10620. (S1 ^operator O1950 +)
  10621. Retracting propose*predict-yes
  10622. -->
  10623. (O1949 ^name predict-yes +)
  10624. (S1 ^operator O1949 +)
  10625. Retracting elaborate*reward*based*on*reward
  10626. -->
  10627. (R978 ^value 1 +)
  10628. (R1 ^reward R978 +)
  10629. Retracting elaborate*copy-dir-to-output-link
  10630. -->
  10631. (I3 ^dir U +)
  10632. Retracting rl*prefer*rvt*predict-no*H0*6
  10633. -->
  10634. (S1 ^operator O1950 = 0.9999999999999999)
  10635. Retracting rl*prefer*rvt*predict-yes*H0*5
  10636. -->
  10637. (S1 ^operator O1949 = 0.)
  10638. =>WM: (13711: S1 ^operator O1952 +)
  10639. =>WM: (13710: S1 ^operator O1951 +)
  10640. =>WM: (13709: I3 ^dir L)
  10641. =>WM: (13708: O1952 ^name predict-no)
  10642. =>WM: (13707: O1951 ^name predict-yes)
  10643. =>WM: (13706: R979 ^value 1)
  10644. =>WM: (13705: R1 ^reward R979)
  10645. <=WM: (13696: S1 ^operator O1949 +)
  10646. <=WM: (13697: S1 ^operator O1950 +)
  10647. <=WM: (13698: S1 ^operator O1950)
  10648. <=WM: (13669: I3 ^dir U)
  10649. <=WM: (13692: R1 ^reward R978)
  10650. <=WM: (13695: O1950 ^name predict-no)
  10651. <=WM: (13694: O1949 ^name predict-yes)
  10652. <=WM: (13693: R978 ^value 1)
  10653. --- Inner Elaboration Phase, active level 1 (S1) ---
  10654. Firing prefer*rvt*predict-yes*H0
  10655. -->
  10656. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  10657. -->
  10658. (S1 ^operator O1951 = 0.3)
  10659. Firing rl*prefer*rvt*predict-yes*H0*1
  10660. -->
  10661. (S1 ^operator O1951 = 0.5231208307682875)
  10662. Firing prefer*rvt*predict-yes*H0*1*H1
  10663. -->
  10664. Firing prefer*rvt*predict-no*H0
  10665. -->
  10666. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  10667. -->
  10668. (S1 ^operator O1952 = 0.744986756180395)
  10669. Firing rl*prefer*rvt*predict-no*H0*2
  10670. -->
  10671. (S1 ^operator O1952 = 0.2550133620818883)
  10672. Firing prefer*rvt*predict-no*H0*2*H1
  10673. -->
  10674. inner elaboration loop at bottom goal.
  10675. Retracting rl*prefer*rvt*predict-no*H0*2
  10676. -->
  10677. (S1 ^operator O1950 = 0.2550133620818883)
  10678. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  10679. -->
  10680. (S1 ^operator O1950 = 0.744986756180395)
  10681. Retracting rl*prefer*rvt*predict-yes*H0*1
  10682. -->
  10683. (S1 ^operator O1949 = 0.5231208307682875)
  10684. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  10685. -->
  10686. (S1 ^operator O1949 = 0.3)
  10687. --- END Proposal Phase ---
  10688. --- Decision Phase ---
  10689. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10690. =>WM: (13712: S1 ^operator O1952)
  10691. 976: O: O1952 (predict-no)
  10692. --- END Decision Phase ---
  10693. --- Application Phase ---
  10694. --- Firing Productions (PE) For State At Depth 1 ---
  10695. --- Inner Elaboration Phase, active level 1 (S1) ---
  10696. Firing apply*operator
  10697. -->
  10698. (I3 ^predict-no N976 + :O )
  10699. Firing apply*operator*complete
  10700. -->
  10701. (I3 ^predict-no N975 - :O )
  10702. inner elaboration loop at bottom goal.
  10703. --- Change Working Memory (PE) ---
  10704. =>WM: (13713: I3 ^predict-no N976)
  10705. <=WM: (13700: N975 ^status complete)
  10706. <=WM: (13699: I3 ^predict-no N975)
  10707. --- Firing Productions (IE) For State At Depth 1 ---
  10708. --- Inner Elaboration Phase, active level 1 (S1) ---
  10709. Firing monitor*world
  10710. -->
  10711. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10712. --- Change Working Memory (IE) ---
  10713. --- END Application Phase ---
  10714. --- Output Phase ---
  10715. ENV: Agent did: predict-no for direction L in state State-A
  10716. In State-A moving L
  10717. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10718. predict error 0
  10719. dir: dir isR
  10720. --- END Output Phase ---
  10721. |\---- Input Phase ---
  10722. =>WM: (13717: I2 ^dir R)
  10723. =>WM: (13716: I2 ^reward 1)
  10724. =>WM: (13715: I2 ^see 0)
  10725. =>WM: (13714: N976 ^status complete)
  10726. <=WM: (13703: I2 ^dir L)
  10727. <=WM: (13702: I2 ^reward 1)
  10728. <=WM: (13701: I2 ^see 0)
  10729. =>WM: (13718: I2 ^level-1 L0-root)
  10730. <=WM: (13704: I2 ^level-1 L0-root)
  10731. --- END Input Phase ---
  10732. --- Proposal Phase ---
  10733. --- Inner Elaboration Phase, active level 1 (S1) ---
  10734. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  10735. -->
  10736. (S1 ^operator O1951 = 0.6170789503736752)
  10737. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  10738. -->
  10739. (S1 ^operator O1952 = 0.4910065094545203)
  10740. Firing prefer*rvt*predict-no*H0*4*H1
  10741. -->
  10742. Firing prefer*rvt*predict-yes*H0*3*H1
  10743. -->
  10744. Firing elaborate*copy-see-to-output-link
  10745. -->
  10746. (I3 ^see 0 +)
  10747. Firing elaborate*reward*based*on*reward
  10748. -->
  10749. (R980 ^value 1 +)
  10750. (R1 ^reward R980 +)
  10751. Firing propose*predict-yes
  10752. -->
  10753. (O1953 ^name predict-yes +)
  10754. (S1 ^operator O1953 +)
  10755. Firing propose*predict-no
  10756. -->
  10757. (O1954 ^name predict-no +)
  10758. (S1 ^operator O1954 +)
  10759. Firing rl*prefer*rvt*predict-no*H0*4
  10760. -->
  10761. (S1 ^operator O1952 = 0.1269767780720474)
  10762. Firing rl*prefer*rvt*predict-yes*H0*3
  10763. -->
  10764. (S1 ^operator O1951 = 0.3829317273911885)
  10765. Firing prefer*rvt*predict-yes*H0
  10766. -->
  10767. Firing prefer*rvt*predict-no*H0
  10768. -->
  10769. Firing elaborate*copy-dir-to-output-link
  10770. -->
  10771. (I3 ^dir R +)
  10772. inner elaboration loop at bottom goal.
  10773. Retracting elaborate*copy-see-to-output-link
  10774. -->
  10775. (I3 ^see 0 +)
  10776. Retracting propose*predict-no
  10777. -->
  10778. (O1952 ^name predict-no +)
  10779. (S1 ^operator O1952 +)
  10780. Retracting propose*predict-yes
  10781. -->
  10782. (O1951 ^name predict-yes +)
  10783. (S1 ^operator O1951 +)
  10784. Retracting elaborate*reward*based*on*reward
  10785. -->
  10786. (R979 ^value 1 +)
  10787. (R1 ^reward R979 +)
  10788. Retracting elaborate*copy-dir-to-output-link
  10789. -->
  10790. (I3 ^dir L +)
  10791. Retracting rl*prefer*rvt*predict-no*H0*2
  10792. -->
  10793. (S1 ^operator O1952 = 0.2550133620818883)
  10794. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  10795. -->
  10796. (S1 ^operator O1952 = 0.744986756180395)
  10797. Retracting rl*prefer*rvt*predict-yes*H0*1
  10798. -->
  10799. (S1 ^operator O1951 = 0.5231208307682875)
  10800. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  10801. -->
  10802. (S1 ^operator O1951 = 0.3)
  10803. =>WM: (13725: S1 ^operator O1954 +)
  10804. =>WM: (13724: S1 ^operator O1953 +)
  10805. =>WM: (13723: I3 ^dir R)
  10806. =>WM: (13722: O1954 ^name predict-no)
  10807. =>WM: (13721: O1953 ^name predict-yes)
  10808. =>WM: (13720: R980 ^value 1)
  10809. =>WM: (13719: R1 ^reward R980)
  10810. <=WM: (13710: S1 ^operator O1951 +)
  10811. <=WM: (13711: S1 ^operator O1952 +)
  10812. <=WM: (13712: S1 ^operator O1952)
  10813. <=WM: (13709: I3 ^dir L)
  10814. <=WM: (13705: R1 ^reward R979)
  10815. <=WM: (13708: O1952 ^name predict-no)
  10816. <=WM: (13707: O1951 ^name predict-yes)
  10817. <=WM: (13706: R979 ^value 1)
  10818. --- Inner Elaboration Phase, active level 1 (S1) ---
  10819. Firing prefer*rvt*predict-yes*H0
  10820. -->
  10821. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  10822. -->
  10823. (S1 ^operator O1953 = 0.6170789503736752)
  10824. Firing rl*prefer*rvt*predict-yes*H0*3
  10825. -->
  10826. (S1 ^operator O1953 = 0.3829317273911885)
  10827. Firing prefer*rvt*predict-yes*H0*3*H1
  10828. -->
  10829. Firing prefer*rvt*predict-no*H0
  10830. -->
  10831. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  10832. -->
  10833. (S1 ^operator O1954 = 0.4910065094545203)
  10834. Firing rl*prefer*rvt*predict-no*H0*4
  10835. -->
  10836. (S1 ^operator O1954 = 0.1269767780720474)
  10837. Firing prefer*rvt*predict-no*H0*4*H1
  10838. -->
  10839. inner elaboration loop at bottom goal.
  10840. Retracting rl*prefer*rvt*predict-no*H0*4
  10841. -->
  10842. (S1 ^operator O1952 = 0.1269767780720474)
  10843. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  10844. -->
  10845. (S1 ^operator O1952 = 0.4910065094545203)
  10846. Retracting rl*prefer*rvt*predict-yes*H0*3
  10847. -->
  10848. (S1 ^operator O1951 = 0.3829317273911885)
  10849. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  10850. -->
  10851. (S1 ^operator O1951 = 0.6170789503736752)
  10852. --- END Proposal Phase ---
  10853. --- Decision Phase ---
  10854. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.916667,0.0767888)
  10855. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  10856. =>WM: (13726: S1 ^operator O1953)
  10857. 977: O: O1953 (predict-yes)
  10858. --- END Decision Phase ---
  10859. --- Application Phase ---
  10860. --- Firing Productions (PE) For State At Depth 1 ---
  10861. --- Inner Elaboration Phase, active level 1 (S1) ---
  10862. Firing apply*operator
  10863. -->
  10864. (I3 ^predict-yes N977 + :O )
  10865. Firing apply*operator*complete
  10866. -->
  10867. (I3 ^predict-no N976 - :O )
  10868. inner elaboration loop at bottom goal.
  10869. --- Change Working Memory (PE) ---
  10870. =>WM: (13727: I3 ^predict-yes N977)
  10871. <=WM: (13714: N976 ^status complete)
  10872. <=WM: (13713: I3 ^predict-no N976)
  10873. --- Firing Productions (IE) For State At Depth 1 ---
  10874. --- Inner Elaboration Phase, active level 1 (S1) ---
  10875. Firing monitor*world
  10876. -->
  10877. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10878. --- Change Working Memory (IE) ---
  10879. --- END Application Phase ---
  10880. --- Output Phase ---
  10881. ENV: Agent did: predict-yes for direction R in state State-A
  10882. In State-A moving R
  10883. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10884. predict error 0
  10885. dir: dir isU
  10886. --- END Output Phase ---
  10887. /|\--- Input Phase ---
  10888. =>WM: (13731: I2 ^dir U)
  10889. =>WM: (13730: I2 ^reward 1)
  10890. =>WM: (13729: I2 ^see 1)
  10891. =>WM: (13728: N977 ^status complete)
  10892. <=WM: (13717: I2 ^dir R)
  10893. <=WM: (13716: I2 ^reward 1)
  10894. <=WM: (13715: I2 ^see 0)
  10895. =>WM: (13732: I2 ^level-1 R1-root)
  10896. <=WM: (13718: I2 ^level-1 L0-root)
  10897. --- END Input Phase ---
  10898. --- Proposal Phase ---
  10899. --- Inner Elaboration Phase, active level 1 (S1) ---
  10900. Firing elaborate*copy-see-to-output-link
  10901. -->
  10902. (I3 ^see 1 +)
  10903. Firing elaborate*reward*based*on*reward
  10904. -->
  10905. (R981 ^value 1 +)
  10906. (R1 ^reward R981 +)
  10907. Firing propose*predict-yes
  10908. -->
  10909. (O1955 ^name predict-yes +)
  10910. (S1 ^operator O1955 +)
  10911. Firing propose*predict-no
  10912. -->
  10913. (O1956 ^name predict-no +)
  10914. (S1 ^operator O1956 +)
  10915. Firing rl*prefer*rvt*predict-no*H0*6
  10916. -->
  10917. (S1 ^operator O1954 = 0.9999999999999999)
  10918. Firing rl*prefer*rvt*predict-yes*H0*5
  10919. -->
  10920. (S1 ^operator O1953 = 0.)
  10921. Firing prefer*rvt*predict-yes*H0
  10922. -->
  10923. Firing prefer*rvt*predict-no*H0
  10924. -->
  10925. Firing elaborate*copy-dir-to-output-link
  10926. -->
  10927. (I3 ^dir U +)
  10928. inner elaboration loop at bottom goal.
  10929. Retracting elaborate*copy-see-to-output-link
  10930. -->
  10931. (I3 ^see 0 +)
  10932. Retracting propose*predict-no
  10933. -->
  10934. (O1954 ^name predict-no +)
  10935. (S1 ^operator O1954 +)
  10936. Retracting propose*predict-yes
  10937. -->
  10938. (O1953 ^name predict-yes +)
  10939. (S1 ^operator O1953 +)
  10940. Retracting elaborate*reward*based*on*reward
  10941. -->
  10942. (R980 ^value 1 +)
  10943. (R1 ^reward R980 +)
  10944. Retracting elaborate*copy-dir-to-output-link
  10945. -->
  10946. (I3 ^dir R +)
  10947. Retracting rl*prefer*rvt*predict-no*H0*4
  10948. -->
  10949. (S1 ^operator O1954 = 0.1269767780720474)
  10950. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  10951. -->
  10952. (S1 ^operator O1954 = 0.4910065094545203)
  10953. Retracting rl*prefer*rvt*predict-yes*H0*3
  10954. -->
  10955. (S1 ^operator O1953 = 0.3829317273911885)
  10956. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  10957. -->
  10958. (S1 ^operator O1953 = 0.6170789503736752)
  10959. =>WM: (13740: S1 ^operator O1956 +)
  10960. =>WM: (13739: S1 ^operator O1955 +)
  10961. =>WM: (13738: I3 ^dir U)
  10962. =>WM: (13737: O1956 ^name predict-no)
  10963. =>WM: (13736: O1955 ^name predict-yes)
  10964. =>WM: (13735: R981 ^value 1)
  10965. =>WM: (13734: R1 ^reward R981)
  10966. =>WM: (13733: I3 ^see 1)
  10967. <=WM: (13724: S1 ^operator O1953 +)
  10968. <=WM: (13726: S1 ^operator O1953)
  10969. <=WM: (13725: S1 ^operator O1954 +)
  10970. <=WM: (13723: I3 ^dir R)
  10971. <=WM: (13719: R1 ^reward R980)
  10972. <=WM: (13664: I3 ^see 0)
  10973. <=WM: (13722: O1954 ^name predict-no)
  10974. <=WM: (13721: O1953 ^name predict-yes)
  10975. <=WM: (13720: R980 ^value 1)
  10976. --- Inner Elaboration Phase, active level 1 (S1) ---
  10977. Firing prefer*rvt*predict-yes*H0
  10978. -->
  10979. Firing rl*prefer*rvt*predict-yes*H0*5
  10980. -->
  10981. (S1 ^operator O1955 = 0.)
  10982. Firing prefer*rvt*predict-no*H0
  10983. -->
  10984. Firing rl*prefer*rvt*predict-no*H0*6
  10985. -->
  10986. (S1 ^operator O1956 = 0.9999999999999999)
  10987. inner elaboration loop at bottom goal.
  10988. Retracting rl*prefer*rvt*predict-no*H0*6
  10989. -->
  10990. (S1 ^operator O1954 = 0.9999999999999999)
  10991. Retracting rl*prefer*rvt*predict-yes*H0*5
  10992. -->
  10993. (S1 ^operator O1953 = 0.)
  10994. --- END Proposal Phase ---
  10995. --- Decision Phase ---
  10996. RL update rl*prefer*rvt*predict-yes*H0*3 0.673126 -0.290194 0.382932 -> 0.673124 -0.290194 0.38293(R,m,v=1,0.96,0.0386577)
  10997. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326884 0.290195 0.617079 -> 0.326883 0.290195 0.617077(R,m,v=1,1,0)
  10998. =>WM: (13741: S1 ^operator O1956)
  10999. 978: O: O1956 (predict-no)
  11000. --- END Decision Phase ---
  11001. --- Application Phase ---
  11002. --- Firing Productions (PE) For State At Depth 1 ---
  11003. --- Inner Elaboration Phase, active level 1 (S1) ---
  11004. Firing apply*operator
  11005. -->
  11006. (I3 ^predict-no N978 + :O )
  11007. Firing apply*operator*complete
  11008. -->
  11009. (I3 ^predict-yes N977 - :O )
  11010. inner elaboration loop at bottom goal.
  11011. --- Change Working Memory (PE) ---
  11012. =>WM: (13742: I3 ^predict-no N978)
  11013. <=WM: (13728: N977 ^status complete)
  11014. <=WM: (13727: I3 ^predict-yes N977)
  11015. --- Firing Productions (IE) For State At Depth 1 ---
  11016. --- Inner Elaboration Phase, active level 1 (S1) ---
  11017. Firing monitor*world
  11018. -->
  11019. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11020. --- Change Working Memory (IE) ---
  11021. --- END Application Phase ---
  11022. --- Output Phase ---
  11023. ENV: Agent did: predict-no for direction U in state State-B
  11024. In State-B moving U
  11025. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11026. predict error 0
  11027. dir: dir isR
  11028. --- END Output Phase ---
  11029. -/|--- Input Phase ---
  11030. =>WM: (13746: I2 ^dir R)
  11031. =>WM: (13745: I2 ^reward 1)
  11032. =>WM: (13744: I2 ^see 0)
  11033. =>WM: (13743: N978 ^status complete)
  11034. <=WM: (13731: I2 ^dir U)
  11035. <=WM: (13730: I2 ^reward 1)
  11036. <=WM: (13729: I2 ^see 1)
  11037. =>WM: (13747: I2 ^level-1 R1-root)
  11038. <=WM: (13732: I2 ^level-1 R1-root)
  11039. --- END Input Phase ---
  11040. --- Proposal Phase ---
  11041. --- Inner Elaboration Phase, active level 1 (S1) ---
  11042. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  11043. -->
  11044. (S1 ^operator O1955 = 0.08783148430849691)
  11045. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  11046. -->
  11047. (S1 ^operator O1956 = 0.8730234453553117)
  11048. Firing prefer*rvt*predict-no*H0*4*H1
  11049. -->
  11050. Firing prefer*rvt*predict-yes*H0*3*H1
  11051. -->
  11052. Firing elaborate*copy-see-to-output-link
  11053. -->
  11054. (I3 ^see 0 +)
  11055. Firing elaborate*reward*based*on*reward
  11056. -->
  11057. (R982 ^value 1 +)
  11058. (R1 ^reward R982 +)
  11059. Firing propose*predict-yes
  11060. -->
  11061. (O1957 ^name predict-yes +)
  11062. (S1 ^operator O1957 +)
  11063. Firing propose*predict-no
  11064. -->
  11065. (O1958 ^name predict-no +)
  11066. (S1 ^operator O1958 +)
  11067. Firing rl*prefer*rvt*predict-no*H0*4
  11068. -->
  11069. (S1 ^operator O1956 = 0.1269767780720474)
  11070. Firing rl*prefer*rvt*predict-yes*H0*3
  11071. -->
  11072. (S1 ^operator O1955 = 0.3829301257264589)
  11073. Firing prefer*rvt*predict-yes*H0
  11074. -->
  11075. Firing prefer*rvt*predict-no*H0
  11076. -->
  11077. Firing elaborate*copy-dir-to-output-link
  11078. -->
  11079. (I3 ^dir R +)
  11080. inner elaboration loop at bottom goal.
  11081. Retracting elaborate*copy-see-to-output-link
  11082. -->
  11083. (I3 ^see 1 +)
  11084. Retracting propose*predict-no
  11085. -->
  11086. (O1956 ^name predict-no +)
  11087. (S1 ^operator O1956 +)
  11088. Retracting propose*predict-yes
  11089. -->
  11090. (O1955 ^name predict-yes +)
  11091. (S1 ^operator O1955 +)
  11092. Retracting elaborate*reward*based*on*reward
  11093. -->
  11094. (R981 ^value 1 +)
  11095. (R1 ^reward R981 +)
  11096. Retracting elaborate*copy-dir-to-output-link
  11097. -->
  11098. (I3 ^dir U +)
  11099. Retracting rl*prefer*rvt*predict-no*H0*6
  11100. -->
  11101. (S1 ^operator O1956 = 0.9999999999999999)
  11102. Retracting rl*prefer*rvt*predict-yes*H0*5
  11103. -->
  11104. (S1 ^operator O1955 = 0.)
  11105. =>WM: (13755: S1 ^operator O1958 +)
  11106. =>WM: (13754: S1 ^operator O1957 +)
  11107. =>WM: (13753: I3 ^dir R)
  11108. =>WM: (13752: O1958 ^name predict-no)
  11109. =>WM: (13751: O1957 ^name predict-yes)
  11110. =>WM: (13750: R982 ^value 1)
  11111. =>WM: (13749: R1 ^reward R982)
  11112. =>WM: (13748: I3 ^see 0)
  11113. <=WM: (13739: S1 ^operator O1955 +)
  11114. <=WM: (13740: S1 ^operator O1956 +)
  11115. <=WM: (13741: S1 ^operator O1956)
  11116. <=WM: (13738: I3 ^dir U)
  11117. <=WM: (13734: R1 ^reward R981)
  11118. <=WM: (13733: I3 ^see 1)
  11119. <=WM: (13737: O1956 ^name predict-no)
  11120. <=WM: (13736: O1955 ^name predict-yes)
  11121. <=WM: (13735: R981 ^value 1)
  11122. --- Inner Elaboration Phase, active level 1 (S1) ---
  11123. Firing prefer*rvt*predict-yes*H0
  11124. -->
  11125. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  11126. -->
  11127. (S1 ^operator O1957 = 0.08783148430849691)
  11128. Firing rl*prefer*rvt*predict-yes*H0*3
  11129. -->
  11130. (S1 ^operator O1957 = 0.3829301257264589)
  11131. Firing prefer*rvt*predict-yes*H0*3*H1
  11132. -->
  11133. Firing prefer*rvt*predict-no*H0
  11134. -->
  11135. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  11136. -->
  11137. (S1 ^operator O1958 = 0.8730234453553117)
  11138. Firing rl*prefer*rvt*predict-no*H0*4
  11139. -->
  11140. (S1 ^operator O1958 = 0.1269767780720474)
  11141. Firing prefer*rvt*predict-no*H0*4*H1
  11142. -->
  11143. inner elaboration loop at bottom goal.
  11144. Retracting rl*prefer*rvt*predict-no*H0*4
  11145. -->
  11146. (S1 ^operator O1956 = 0.1269767780720474)
  11147. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  11148. -->
  11149. (S1 ^operator O1956 = 0.8730234453553117)
  11150. Retracting rl*prefer*rvt*predict-yes*H0*3
  11151. -->
  11152. (S1 ^operator O1955 = 0.3829301257264589)
  11153. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  11154. -->
  11155. (S1 ^operator O1955 = 0.08783148430849691)
  11156. --- END Proposal Phase ---
  11157. --- Decision Phase ---
  11158. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11159. =>WM: (13756: S1 ^operator O1958)
  11160. 979: O: O1958 (predict-no)
  11161. --- END Decision Phase ---
  11162. --- Application Phase ---
  11163. --- Firing Productions (PE) For State At Depth 1 ---
  11164. --- Inner Elaboration Phase, active level 1 (S1) ---
  11165. Firing apply*operator
  11166. -->
  11167. (I3 ^predict-no N979 + :O )
  11168. Firing apply*operator*complete
  11169. -->
  11170. (I3 ^predict-no N978 - :O )
  11171. inner elaboration loop at bottom goal.
  11172. --- Change Working Memory (PE) ---
  11173. =>WM: (13757: I3 ^predict-no N979)
  11174. <=WM: (13743: N978 ^status complete)
  11175. <=WM: (13742: I3 ^predict-no N978)
  11176. --- Firing Productions (IE) For State At Depth 1 ---
  11177. --- Inner Elaboration Phase, active level 1 (S1) ---
  11178. Firing monitor*world
  11179. -->
  11180. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11181. --- Change Working Memory (IE) ---
  11182. --- END Application Phase ---
  11183. --- Output Phase ---
  11184. ENV: Agent did: predict-no for direction R in state State-B
  11185. In State-B moving R
  11186. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11187. predict error 0
  11188. dir: dir isU
  11189. --- END Output Phase ---
  11190. \-/--- Input Phase ---
  11191. =>WM: (13761: I2 ^dir U)
  11192. =>WM: (13760: I2 ^reward 1)
  11193. =>WM: (13759: I2 ^see 0)
  11194. =>WM: (13758: N979 ^status complete)
  11195. <=WM: (13746: I2 ^dir R)
  11196. <=WM: (13745: I2 ^reward 1)
  11197. <=WM: (13744: I2 ^see 0)
  11198. =>WM: (13762: I2 ^level-1 R0-root)
  11199. <=WM: (13747: I2 ^level-1 R1-root)
  11200. --- END Input Phase ---
  11201. --- Proposal Phase ---
  11202. --- Inner Elaboration Phase, active level 1 (S1) ---
  11203. Firing elaborate*copy-see-to-output-link
  11204. -->
  11205. (I3 ^see 0 +)
  11206. Firing elaborate*reward*based*on*reward
  11207. -->
  11208. (R983 ^value 1 +)
  11209. (R1 ^reward R983 +)
  11210. Firing propose*predict-yes
  11211. -->
  11212. (O1959 ^name predict-yes +)
  11213. (S1 ^operator O1959 +)
  11214. Firing propose*predict-no
  11215. -->
  11216. (O1960 ^name predict-no +)
  11217. (S1 ^operator O1960 +)
  11218. Firing rl*prefer*rvt*predict-no*H0*6
  11219. -->
  11220. (S1 ^operator O1958 = 0.9999999999999999)
  11221. Firing rl*prefer*rvt*predict-yes*H0*5
  11222. -->
  11223. (S1 ^operator O1957 = 0.)
  11224. Firing prefer*rvt*predict-yes*H0
  11225. -->
  11226. Firing prefer*rvt*predict-no*H0
  11227. -->
  11228. Firing elaborate*copy-dir-to-output-link
  11229. -->
  11230. (I3 ^dir U +)
  11231. inner elaboration loop at bottom goal.
  11232. Retracting elaborate*copy-see-to-output-link
  11233. -->
  11234. (I3 ^see 0 +)
  11235. Retracting propose*predict-no
  11236. -->
  11237. (O1958 ^name predict-no +)
  11238. (S1 ^operator O1958 +)
  11239. Retracting propose*predict-yes
  11240. -->
  11241. (O1957 ^name predict-yes +)
  11242. (S1 ^operator O1957 +)
  11243. Retracting elaborate*reward*based*on*reward
  11244. -->
  11245. (R982 ^value 1 +)
  11246. (R1 ^reward R982 +)
  11247. Retracting elaborate*copy-dir-to-output-link
  11248. -->
  11249. (I3 ^dir R +)
  11250. Retracting rl*prefer*rvt*predict-no*H0*4
  11251. -->
  11252. (S1 ^operator O1958 = 0.1269767780720474)
  11253. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  11254. -->
  11255. (S1 ^operator O1958 = 0.8730234453553117)
  11256. Retracting rl*prefer*rvt*predict-yes*H0*3
  11257. -->
  11258. (S1 ^operator O1957 = 0.3829301257264589)
  11259. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  11260. -->
  11261. (S1 ^operator O1957 = 0.08783148430849691)
  11262. =>WM: (13769: S1 ^operator O1960 +)
  11263. =>WM: (13768: S1 ^operator O1959 +)
  11264. =>WM: (13767: I3 ^dir U)
  11265. =>WM: (13766: O1960 ^name predict-no)
  11266. =>WM: (13765: O1959 ^name predict-yes)
  11267. =>WM: (13764: R983 ^value 1)
  11268. =>WM: (13763: R1 ^reward R983)
  11269. <=WM: (13754: S1 ^operator O1957 +)
  11270. <=WM: (13755: S1 ^operator O1958 +)
  11271. <=WM: (13756: S1 ^operator O1958)
  11272. <=WM: (13753: I3 ^dir R)
  11273. <=WM: (13749: R1 ^reward R982)
  11274. <=WM: (13752: O1958 ^name predict-no)
  11275. <=WM: (13751: O1957 ^name predict-yes)
  11276. <=WM: (13750: R982 ^value 1)
  11277. --- Inner Elaboration Phase, active level 1 (S1) ---
  11278. Firing prefer*rvt*predict-yes*H0
  11279. -->
  11280. Firing rl*prefer*rvt*predict-yes*H0*5
  11281. -->
  11282. (S1 ^operator O1959 = 0.)
  11283. Firing prefer*rvt*predict-no*H0
  11284. -->
  11285. Firing rl*prefer*rvt*predict-no*H0*6
  11286. -->
  11287. (S1 ^operator O1960 = 0.9999999999999999)
  11288. inner elaboration loop at bottom goal.
  11289. Retracting rl*prefer*rvt*predict-no*H0*6
  11290. -->
  11291. (S1 ^operator O1958 = 0.9999999999999999)
  11292. Retracting rl*prefer*rvt*predict-yes*H0*5
  11293. -->
  11294. (S1 ^operator O1957 = 0.)
  11295. --- END Proposal Phase ---
  11296. --- Decision Phase ---
  11297. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.947977,0.0496034)
  11298. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  11299. =>WM: (13770: S1 ^operator O1960)
  11300. 980: O: O1960 (predict-no)
  11301. --- END Decision Phase ---
  11302. --- Application Phase ---
  11303. --- Firing Productions (PE) For State At Depth 1 ---
  11304. --- Inner Elaboration Phase, active level 1 (S1) ---
  11305. Firing apply*operator
  11306. -->
  11307. (I3 ^predict-no N980 + :O )
  11308. Firing apply*operator*complete
  11309. -->
  11310. (I3 ^predict-no N979 - :O )
  11311. inner elaboration loop at bottom goal.
  11312. --- Change Working Memory (PE) ---
  11313. =>WM: (13771: I3 ^predict-no N980)
  11314. <=WM: (13758: N979 ^status complete)
  11315. <=WM: (13757: I3 ^predict-no N979)
  11316. --- Firing Productions (IE) For State At Depth 1 ---
  11317. --- Inner Elaboration Phase, active level 1 (S1) ---
  11318. Firing monitor*world
  11319. -->
  11320. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11321. --- Change Working Memory (IE) ---
  11322. --- END Application Phase ---
  11323. --- Output Phase ---
  11324. ENV: Agent did: predict-no for direction U in state State-B
  11325. In State-B moving U
  11326. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11327. predict error 0
  11328. dir: dir isL
  11329. --- END Output Phase ---
  11330. |\--- Input Phase ---
  11331. =>WM: (13775: I2 ^dir L)
  11332. =>WM: (13774: I2 ^reward 1)
  11333. =>WM: (13773: I2 ^see 0)
  11334. =>WM: (13772: N980 ^status complete)
  11335. <=WM: (13761: I2 ^dir U)
  11336. <=WM: (13760: I2 ^reward 1)
  11337. <=WM: (13759: I2 ^see 0)
  11338. =>WM: (13776: I2 ^level-1 R0-root)
  11339. <=WM: (13762: I2 ^level-1 R0-root)
  11340. --- END Input Phase ---
  11341. --- Proposal Phase ---
  11342. --- Inner Elaboration Phase, active level 1 (S1) ---
  11343. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  11344. -->
  11345. (S1 ^operator O1959 = 0.4768840530102607)
  11346. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  11347. -->
  11348. (S1 ^operator O1960 = 0.1700769046561409)
  11349. Firing prefer*rvt*predict-no*H0*2*H1
  11350. -->
  11351. Firing prefer*rvt*predict-yes*H0*1*H1
  11352. -->
  11353. Firing elaborate*copy-see-to-output-link
  11354. -->
  11355. (I3 ^see 0 +)
  11356. Firing elaborate*reward*based*on*reward
  11357. -->
  11358. (R984 ^value 1 +)
  11359. (R1 ^reward R984 +)
  11360. Firing propose*predict-yes
  11361. -->
  11362. (O1961 ^name predict-yes +)
  11363. (S1 ^operator O1961 +)
  11364. Firing propose*predict-no
  11365. -->
  11366. (O1962 ^name predict-no +)
  11367. (S1 ^operator O1962 +)
  11368. Firing rl*prefer*rvt*predict-no*H0*2
  11369. -->
  11370. (S1 ^operator O1960 = 0.2550133443425458)
  11371. Firing rl*prefer*rvt*predict-yes*H0*1
  11372. -->
  11373. (S1 ^operator O1959 = 0.5231208307682875)
  11374. Firing prefer*rvt*predict-yes*H0
  11375. -->
  11376. Firing prefer*rvt*predict-no*H0
  11377. -->
  11378. Firing elaborate*copy-dir-to-output-link
  11379. -->
  11380. (I3 ^dir L +)
  11381. inner elaboration loop at bottom goal.
  11382. Retracting elaborate*copy-see-to-output-link
  11383. -->
  11384. (I3 ^see 0 +)
  11385. Retracting propose*predict-no
  11386. -->
  11387. (O1960 ^name predict-no +)
  11388. (S1 ^operator O1960 +)
  11389. Retracting propose*predict-yes
  11390. -->
  11391. (O1959 ^name predict-yes +)
  11392. (S1 ^operator O1959 +)
  11393. Retracting elaborate*reward*based*on*reward
  11394. -->
  11395. (R983 ^value 1 +)
  11396. (R1 ^reward R983 +)
  11397. Retracting elaborate*copy-dir-to-output-link
  11398. -->
  11399. (I3 ^dir U +)
  11400. Retracting rl*prefer*rvt*predict-no*H0*6
  11401. -->
  11402. (S1 ^operator O1960 = 0.9999999999999999)
  11403. Retracting rl*prefer*rvt*predict-yes*H0*5
  11404. -->
  11405. (S1 ^operator O1959 = 0.)
  11406. =>WM: (13783: S1 ^operator O1962 +)
  11407. =>WM: (13782: S1 ^operator O1961 +)
  11408. =>WM: (13781: I3 ^dir L)
  11409. =>WM: (13780: O1962 ^name predict-no)
  11410. =>WM: (13779: O1961 ^name predict-yes)
  11411. =>WM: (13778: R984 ^value 1)
  11412. =>WM: (13777: R1 ^reward R984)
  11413. <=WM: (13768: S1 ^operator O1959 +)
  11414. <=WM: (13769: S1 ^operator O1960 +)
  11415. <=WM: (13770: S1 ^operator O1960)
  11416. <=WM: (13767: I3 ^dir U)
  11417. <=WM: (13763: R1 ^reward R983)
  11418. <=WM: (13766: O1960 ^name predict-no)
  11419. <=WM: (13765: O1959 ^name predict-yes)
  11420. <=WM: (13764: R983 ^value 1)
  11421. --- Inner Elaboration Phase, active level 1 (S1) ---
  11422. Firing prefer*rvt*predict-yes*H0
  11423. -->
  11424. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  11425. -->
  11426. (S1 ^operator O1961 = 0.4768840530102607)
  11427. Firing rl*prefer*rvt*predict-yes*H0*1
  11428. -->
  11429. (S1 ^operator O1961 = 0.5231208307682875)
  11430. Firing prefer*rvt*predict-yes*H0*1*H1
  11431. -->
  11432. Firing prefer*rvt*predict-no*H0
  11433. -->
  11434. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  11435. -->
  11436. (S1 ^operator O1962 = 0.1700769046561409)
  11437. Firing rl*prefer*rvt*predict-no*H0*2
  11438. -->
  11439. (S1 ^operator O1962 = 0.2550133443425458)
  11440. Firing prefer*rvt*predict-no*H0*2*H1
  11441. -->
  11442. inner elaboration loop at bottom goal.
  11443. Retracting rl*prefer*rvt*predict-no*H0*2
  11444. -->
  11445. (S1 ^operator O1960 = 0.2550133443425458)
  11446. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  11447. -->
  11448. (S1 ^operator O1960 = 0.1700769046561409)
  11449. Retracting rl*prefer*rvt*predict-yes*H0*1
  11450. -->
  11451. (S1 ^operator O1959 = 0.5231208307682875)
  11452. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  11453. -->
  11454. (S1 ^operator O1959 = 0.4768840530102607)
  11455. --- END Proposal Phase ---
  11456. --- Decision Phase ---
  11457. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11458. =>WM: (13784: S1 ^operator O1961)
  11459. 981: O: O1961 (predict-yes)
  11460. --- END Decision Phase ---
  11461. --- Application Phase ---
  11462. --- Firing Productions (PE) For State At Depth 1 ---
  11463. --- Inner Elaboration Phase, active level 1 (S1) ---
  11464. Firing apply*operator
  11465. -->
  11466. (I3 ^predict-yes N981 + :O )
  11467. Firing apply*operator*complete
  11468. -->
  11469. (I3 ^predict-no N980 - :O )
  11470. inner elaboration loop at bottom goal.
  11471. --- Change Working Memory (PE) ---
  11472. =>WM: (13785: I3 ^predict-yes N981)
  11473. <=WM: (13772: N980 ^status complete)
  11474. <=WM: (13771: I3 ^predict-no N980)
  11475. --- Firing Productions (IE) For State At Depth 1 ---
  11476. --- Inner Elaboration Phase, active level 1 (S1) ---
  11477. Firing monitor*world
  11478. -->
  11479. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11480. --- Change Working Memory (IE) ---
  11481. --- END Application Phase ---
  11482. --- Output Phase ---
  11483. ENV: Agent did: predict-yes for direction L in state State-B
  11484. In State-B moving L
  11485. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11486. predict error 0
  11487. dir: dir isL
  11488. --- END Output Phase ---
  11489. ---- Input Phase ---
  11490. =>WM: (13789: I2 ^dir L)
  11491. =>WM: (13788: I2 ^reward 1)
  11492. =>WM: (13787: I2 ^see 1)
  11493. =>WM: (13786: N981 ^status complete)
  11494. <=WM: (13775: I2 ^dir L)
  11495. <=WM: (13774: I2 ^reward 1)
  11496. <=WM: (13773: I2 ^see 0)
  11497. =>WM: (13790: I2 ^level-1 L1-root)
  11498. <=WM: (13776: I2 ^level-1 R0-root)
  11499. --- END Input Phase ---
  11500. --- Proposal Phase ---
  11501. --- Inner Elaboration Phase, active level 1 (S1) ---
  11502. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  11503. -->
  11504. (S1 ^operator O1961 = 0.1693592933936033)
  11505. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  11506. -->
  11507. (S1 ^operator O1962 = 0.7449863992127084)
  11508. Firing prefer*rvt*predict-no*H0*2*H1
  11509. -->
  11510. Firing prefer*rvt*predict-yes*H0*1*H1
  11511. -->
  11512. Firing elaborate*copy-see-to-output-link
  11513. -->
  11514. (I3 ^see 1 +)
  11515. Firing elaborate*reward*based*on*reward
  11516. -->
  11517. (R985 ^value 1 +)
  11518. (R1 ^reward R985 +)
  11519. Firing propose*predict-yes
  11520. -->
  11521. (O1963 ^name predict-yes +)
  11522. (S1 ^operator O1963 +)
  11523. Firing propose*predict-no
  11524. -->
  11525. (O1964 ^name predict-no +)
  11526. (S1 ^operator O1964 +)
  11527. Firing rl*prefer*rvt*predict-no*H0*2
  11528. -->
  11529. (S1 ^operator O1962 = 0.2550133443425458)
  11530. Firing rl*prefer*rvt*predict-yes*H0*1
  11531. -->
  11532. (S1 ^operator O1961 = 0.5231208307682875)
  11533. Firing prefer*rvt*predict-yes*H0
  11534. -->
  11535. Firing prefer*rvt*predict-no*H0
  11536. -->
  11537. Firing elaborate*copy-dir-to-output-link
  11538. -->
  11539. (I3 ^dir L +)
  11540. inner elaboration loop at bottom goal.
  11541. Retracting elaborate*copy-see-to-output-link
  11542. -->
  11543. (I3 ^see 0 +)
  11544. Retracting propose*predict-no
  11545. -->
  11546. (O1962 ^name predict-no +)
  11547. (S1 ^operator O1962 +)
  11548. Retracting propose*predict-yes
  11549. -->
  11550. (O1961 ^name predict-yes +)
  11551. (S1 ^operator O1961 +)
  11552. Retracting elaborate*reward*based*on*reward
  11553. -->
  11554. (R984 ^value 1 +)
  11555. (R1 ^reward R984 +)
  11556. Retracting elaborate*copy-dir-to-output-link
  11557. -->
  11558. (I3 ^dir L +)
  11559. Retracting rl*prefer*rvt*predict-no*H0*2
  11560. -->
  11561. (S1 ^operator O1962 = 0.2550133443425458)
  11562. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  11563. -->
  11564. (S1 ^operator O1962 = 0.1700769046561409)
  11565. Retracting rl*prefer*rvt*predict-yes*H0*1
  11566. -->
  11567. (S1 ^operator O1961 = 0.5231208307682875)
  11568. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  11569. -->
  11570. (S1 ^operator O1961 = 0.4768840530102607)
  11571. =>WM: (13797: S1 ^operator O1964 +)
  11572. =>WM: (13796: S1 ^operator O1963 +)
  11573. =>WM: (13795: O1964 ^name predict-no)
  11574. =>WM: (13794: O1963 ^name predict-yes)
  11575. =>WM: (13793: R985 ^value 1)
  11576. =>WM: (13792: R1 ^reward R985)
  11577. =>WM: (13791: I3 ^see 1)
  11578. <=WM: (13782: S1 ^operator O1961 +)
  11579. <=WM: (13784: S1 ^operator O1961)
  11580. <=WM: (13783: S1 ^operator O1962 +)
  11581. <=WM: (13777: R1 ^reward R984)
  11582. <=WM: (13748: I3 ^see 0)
  11583. <=WM: (13780: O1962 ^name predict-no)
  11584. <=WM: (13779: O1961 ^name predict-yes)
  11585. <=WM: (13778: R984 ^value 1)
  11586. --- Inner Elaboration Phase, active level 1 (S1) ---
  11587. Firing prefer*rvt*predict-yes*H0
  11588. -->
  11589. Firing rl*prefer*rvt*predict-yes*H0*1
  11590. -->
  11591. (S1 ^operator O1963 = 0.5231208307682875)
  11592. Firing prefer*rvt*predict-yes*H0*1*H1
  11593. -->
  11594. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  11595. -->
  11596. (S1 ^operator O1963 = 0.1693592933936033)
  11597. Firing prefer*rvt*predict-no*H0
  11598. -->
  11599. Firing rl*prefer*rvt*predict-no*H0*2
  11600. -->
  11601. (S1 ^operator O1964 = 0.2550133443425458)
  11602. Firing prefer*rvt*predict-no*H0*2*H1
  11603. -->
  11604. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  11605. -->
  11606. (S1 ^operator O1964 = 0.7449863992127084)
  11607. inner elaboration loop at bottom goal.
  11608. Retracting rl*prefer*rvt*predict-no*H0*2
  11609. -->
  11610. (S1 ^operator O1962 = 0.2550133443425458)
  11611. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  11612. -->
  11613. (S1 ^operator O1962 = 0.7449863992127084)
  11614. Retracting rl*prefer*rvt*predict-yes*H0*1
  11615. -->
  11616. (S1 ^operator O1961 = 0.5231208307682875)
  11617. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  11618. -->
  11619. (S1 ^operator O1961 = 0.1693592933936033)
  11620. --- END Proposal Phase ---
  11621. --- Decision Phase ---
  11622. RL update rl*prefer*rvt*predict-yes*H0*1 0.727961 -0.20484 0.523121 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.978571,0.0211202)
  11623. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272045 0.204839 0.476884 -> 0.272045 0.204839 0.476883(R,m,v=1,1,0)
  11624. =>WM: (13798: S1 ^operator O1964)
  11625. 982: O: O1964 (predict-no)
  11626. --- END Decision Phase ---
  11627. --- Application Phase ---
  11628. --- Firing Productions (PE) For State At Depth 1 ---
  11629. --- Inner Elaboration Phase, active level 1 (S1) ---
  11630. Firing apply*operator
  11631. -->
  11632. (I3 ^predict-no N982 + :O )
  11633. Firing apply*operator*complete
  11634. -->
  11635. (I3 ^predict-yes N981 - :O )
  11636. inner elaboration loop at bottom goal.
  11637. --- Change Working Memory (PE) ---
  11638. =>WM: (13799: I3 ^predict-no N982)
  11639. <=WM: (13786: N981 ^status complete)
  11640. <=WM: (13785: I3 ^predict-yes N981)
  11641. --- Firing Productions (IE) For State At Depth 1 ---
  11642. --- Inner Elaboration Phase, active level 1 (S1) ---
  11643. Firing monitor*world
  11644. -->
  11645. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11646. --- Change Working Memory (IE) ---
  11647. --- END Application Phase ---
  11648. --- Output Phase ---
  11649. ENV: Agent did: predict-no for direction L in state State-A
  11650. In State-A moving L
  11651. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11652. predict error 0
  11653. dir: dir isR
  11654. --- END Output Phase ---
  11655. /|\--- Input Phase ---
  11656. =>WM: (13803: I2 ^dir R)
  11657. =>WM: (13802: I2 ^reward 1)
  11658. =>WM: (13801: I2 ^see 0)
  11659. =>WM: (13800: N982 ^status complete)
  11660. <=WM: (13789: I2 ^dir L)
  11661. <=WM: (13788: I2 ^reward 1)
  11662. <=WM: (13787: I2 ^see 1)
  11663. =>WM: (13804: I2 ^level-1 L0-root)
  11664. <=WM: (13790: I2 ^level-1 L1-root)
  11665. --- END Input Phase ---
  11666. --- Proposal Phase ---
  11667. --- Inner Elaboration Phase, active level 1 (S1) ---
  11668. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  11669. -->
  11670. (S1 ^operator O1963 = 0.6170773487089456)
  11671. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  11672. -->
  11673. (S1 ^operator O1964 = 0.4910065094545203)
  11674. Firing prefer*rvt*predict-no*H0*4*H1
  11675. -->
  11676. Firing prefer*rvt*predict-yes*H0*3*H1
  11677. -->
  11678. Firing elaborate*copy-see-to-output-link
  11679. -->
  11680. (I3 ^see 0 +)
  11681. Firing elaborate*reward*based*on*reward
  11682. -->
  11683. (R986 ^value 1 +)
  11684. (R1 ^reward R986 +)
  11685. Firing propose*predict-yes
  11686. -->
  11687. (O1965 ^name predict-yes +)
  11688. (S1 ^operator O1965 +)
  11689. Firing propose*predict-no
  11690. -->
  11691. (O1966 ^name predict-no +)
  11692. (S1 ^operator O1966 +)
  11693. Firing rl*prefer*rvt*predict-no*H0*4
  11694. -->
  11695. (S1 ^operator O1964 = 0.1269767445579436)
  11696. Firing rl*prefer*rvt*predict-yes*H0*3
  11697. -->
  11698. (S1 ^operator O1963 = 0.3829301257264589)
  11699. Firing prefer*rvt*predict-yes*H0
  11700. -->
  11701. Firing prefer*rvt*predict-no*H0
  11702. -->
  11703. Firing elaborate*copy-dir-to-output-link
  11704. -->
  11705. (I3 ^dir R +)
  11706. inner elaboration loop at bottom goal.
  11707. Retracting elaborate*copy-see-to-output-link
  11708. -->
  11709. (I3 ^see 1 +)
  11710. Retracting propose*predict-no
  11711. -->
  11712. (O1964 ^name predict-no +)
  11713. (S1 ^operator O1964 +)
  11714. Retracting propose*predict-yes
  11715. -->
  11716. (O1963 ^name predict-yes +)
  11717. (S1 ^operator O1963 +)
  11718. Retracting elaborate*reward*based*on*reward
  11719. -->
  11720. (R985 ^value 1 +)
  11721. (R1 ^reward R985 +)
  11722. Retracting elaborate*copy-dir-to-output-link
  11723. -->
  11724. (I3 ^dir L +)
  11725. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  11726. -->
  11727. (S1 ^operator O1964 = 0.7449863992127084)
  11728. Retracting rl*prefer*rvt*predict-no*H0*2
  11729. -->
  11730. (S1 ^operator O1964 = 0.2550133443425458)
  11731. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  11732. -->
  11733. (S1 ^operator O1963 = 0.1693592933936033)
  11734. Retracting rl*prefer*rvt*predict-yes*H0*1
  11735. -->
  11736. (S1 ^operator O1963 = 0.5231200982015054)
  11737. =>WM: (13812: S1 ^operator O1966 +)
  11738. =>WM: (13811: S1 ^operator O1965 +)
  11739. =>WM: (13810: I3 ^dir R)
  11740. =>WM: (13809: O1966 ^name predict-no)
  11741. =>WM: (13808: O1965 ^name predict-yes)
  11742. =>WM: (13807: R986 ^value 1)
  11743. =>WM: (13806: R1 ^reward R986)
  11744. =>WM: (13805: I3 ^see 0)
  11745. <=WM: (13796: S1 ^operator O1963 +)
  11746. <=WM: (13797: S1 ^operator O1964 +)
  11747. <=WM: (13798: S1 ^operator O1964)
  11748. <=WM: (13781: I3 ^dir L)
  11749. <=WM: (13792: R1 ^reward R985)
  11750. <=WM: (13791: I3 ^see 1)
  11751. <=WM: (13795: O1964 ^name predict-no)
  11752. <=WM: (13794: O1963 ^name predict-yes)
  11753. <=WM: (13793: R985 ^value 1)
  11754. --- Inner Elaboration Phase, active level 1 (S1) ---
  11755. Firing prefer*rvt*predict-yes*H0
  11756. -->
  11757. Firing rl*prefer*rvt*predict-yes*H0*3
  11758. -->
  11759. (S1 ^operator O1965 = 0.3829301257264589)
  11760. Firing prefer*rvt*predict-yes*H0*3*H1
  11761. -->
  11762. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  11763. -->
  11764. (S1 ^operator O1965 = 0.6170773487089456)
  11765. Firing prefer*rvt*predict-no*H0
  11766. -->
  11767. Firing rl*prefer*rvt*predict-no*H0*4
  11768. -->
  11769. (S1 ^operator O1966 = 0.1269767445579436)
  11770. Firing prefer*rvt*predict-no*H0*4*H1
  11771. -->
  11772. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  11773. -->
  11774. (S1 ^operator O1966 = 0.4910065094545203)
  11775. inner elaboration loop at bottom goal.
  11776. Retracting rl*prefer*rvt*predict-no*H0*4
  11777. -->
  11778. (S1 ^operator O1964 = 0.1269767445579436)
  11779. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  11780. -->
  11781. (S1 ^operator O1964 = 0.4910065094545203)
  11782. Retracting rl*prefer*rvt*predict-yes*H0*3
  11783. -->
  11784. (S1 ^operator O1963 = 0.3829301257264589)
  11785. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  11786. -->
  11787. (S1 ^operator O1963 = 0.6170773487089456)
  11788. --- END Proposal Phase ---
  11789. --- Decision Phase ---
  11790. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.917098,0.0764249)
  11791. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744986 -> 0.368505 0.376482 0.744986(R,m,v=1,1,0)
  11792. =>WM: (13813: S1 ^operator O1965)
  11793. 983: O: O1965 (predict-yes)
  11794. --- END Decision Phase ---
  11795. --- Application Phase ---
  11796. --- Firing Productions (PE) For State At Depth 1 ---
  11797. --- Inner Elaboration Phase, active level 1 (S1) ---
  11798. Firing apply*operator
  11799. -->
  11800. (I3 ^predict-yes N983 + :O )
  11801. Firing apply*operator*complete
  11802. -->
  11803. (I3 ^predict-no N982 - :O )
  11804. inner elaboration loop at bottom goal.
  11805. --- Change Working Memory (PE) ---
  11806. =>WM: (13814: I3 ^predict-yes N983)
  11807. <=WM: (13800: N982 ^status complete)
  11808. <=WM: (13799: I3 ^predict-no N982)
  11809. --- Firing Productions (IE) For State At Depth 1 ---
  11810. --- Inner Elaboration Phase, active level 1 (S1) ---
  11811. Firing monitor*world
  11812. -->
  11813. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11814. --- Change Working Memory (IE) ---
  11815. --- END Application Phase ---
  11816. --- Output Phase ---
  11817. ENV: Agent did: predict-yes for direction R in state State-A
  11818. In State-A moving R
  11819. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11820. predict error 0
  11821. dir: dir isU
  11822. --- END Output Phase ---
  11823. -/|--- Input Phase ---
  11824. =>WM: (13818: I2 ^dir U)
  11825. =>WM: (13817: I2 ^reward 1)
  11826. =>WM: (13816: I2 ^see 1)
  11827. =>WM: (13815: N983 ^status complete)
  11828. <=WM: (13803: I2 ^dir R)
  11829. <=WM: (13802: I2 ^reward 1)
  11830. <=WM: (13801: I2 ^see 0)
  11831. =>WM: (13819: I2 ^level-1 R1-root)
  11832. <=WM: (13804: I2 ^level-1 L0-root)
  11833. --- END Input Phase ---
  11834. --- Proposal Phase ---
  11835. --- Inner Elaboration Phase, active level 1 (S1) ---
  11836. Firing elaborate*copy-see-to-output-link
  11837. -->
  11838. (I3 ^see 1 +)
  11839. Firing elaborate*reward*based*on*reward
  11840. -->
  11841. (R987 ^value 1 +)
  11842. (R1 ^reward R987 +)
  11843. Firing propose*predict-yes
  11844. -->
  11845. (O1967 ^name predict-yes +)
  11846. (S1 ^operator O1967 +)
  11847. Firing propose*predict-no
  11848. -->
  11849. (O1968 ^name predict-no +)
  11850. (S1 ^operator O1968 +)
  11851. Firing rl*prefer*rvt*predict-no*H0*6
  11852. -->
  11853. (S1 ^operator O1966 = 0.9999999999999999)
  11854. Firing rl*prefer*rvt*predict-yes*H0*5
  11855. -->
  11856. (S1 ^operator O1965 = 0.)
  11857. Firing prefer*rvt*predict-yes*H0
  11858. -->
  11859. Firing prefer*rvt*predict-no*H0
  11860. -->
  11861. Firing elaborate*copy-dir-to-output-link
  11862. -->
  11863. (I3 ^dir U +)
  11864. inner elaboration loop at bottom goal.
  11865. Retracting elaborate*copy-see-to-output-link
  11866. -->
  11867. (I3 ^see 0 +)
  11868. Retracting propose*predict-no
  11869. -->
  11870. (O1966 ^name predict-no +)
  11871. (S1 ^operator O1966 +)
  11872. Retracting propose*predict-yes
  11873. -->
  11874. (O1965 ^name predict-yes +)
  11875. (S1 ^operator O1965 +)
  11876. Retracting elaborate*reward*based*on*reward
  11877. -->
  11878. (R986 ^value 1 +)
  11879. (R1 ^reward R986 +)
  11880. Retracting elaborate*copy-dir-to-output-link
  11881. -->
  11882. (I3 ^dir R +)
  11883. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  11884. -->
  11885. (S1 ^operator O1966 = 0.4910065094545203)
  11886. Retracting rl*prefer*rvt*predict-no*H0*4
  11887. -->
  11888. (S1 ^operator O1966 = 0.1269767445579436)
  11889. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  11890. -->
  11891. (S1 ^operator O1965 = 0.6170773487089456)
  11892. Retracting rl*prefer*rvt*predict-yes*H0*3
  11893. -->
  11894. (S1 ^operator O1965 = 0.3829301257264589)
  11895. =>WM: (13827: S1 ^operator O1968 +)
  11896. =>WM: (13826: S1 ^operator O1967 +)
  11897. =>WM: (13825: I3 ^dir U)
  11898. =>WM: (13824: O1968 ^name predict-no)
  11899. =>WM: (13823: O1967 ^name predict-yes)
  11900. =>WM: (13822: R987 ^value 1)
  11901. =>WM: (13821: R1 ^reward R987)
  11902. =>WM: (13820: I3 ^see 1)
  11903. <=WM: (13811: S1 ^operator O1965 +)
  11904. <=WM: (13813: S1 ^operator O1965)
  11905. <=WM: (13812: S1 ^operator O1966 +)
  11906. <=WM: (13810: I3 ^dir R)
  11907. <=WM: (13806: R1 ^reward R986)
  11908. <=WM: (13805: I3 ^see 0)
  11909. <=WM: (13809: O1966 ^name predict-no)
  11910. <=WM: (13808: O1965 ^name predict-yes)
  11911. <=WM: (13807: R986 ^value 1)
  11912. --- Inner Elaboration Phase, active level 1 (S1) ---
  11913. Firing prefer*rvt*predict-yes*H0
  11914. -->
  11915. Firing rl*prefer*rvt*predict-yes*H0*5
  11916. -->
  11917. (S1 ^operator O1967 = 0.)
  11918. Firing prefer*rvt*predict-no*H0
  11919. -->
  11920. Firing rl*prefer*rvt*predict-no*H0*6
  11921. -->
  11922. (S1 ^operator O1968 = 0.9999999999999999)
  11923. inner elaboration loop at bottom goal.
  11924. Retracting rl*prefer*rvt*predict-no*H0*6
  11925. -->
  11926. (S1 ^operator O1966 = 0.9999999999999999)
  11927. Retracting rl*prefer*rvt*predict-yes*H0*5
  11928. -->
  11929. (S1 ^operator O1965 = 0.)
  11930. --- END Proposal Phase ---
  11931. --- Decision Phase ---
  11932. RL update rl*prefer*rvt*predict-yes*H0*3 0.673124 -0.290194 0.38293 -> 0.673123 -0.290194 0.382929(R,m,v=1,0.960265,0.0384106)
  11933. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326883 0.290195 0.617077 -> 0.326882 0.290195 0.617076(R,m,v=1,1,0)
  11934. =>WM: (13828: S1 ^operator O1968)
  11935. 984: O: O1968 (predict-no)
  11936. --- END Decision Phase ---
  11937. --- Application Phase ---
  11938. --- Firing Productions (PE) For State At Depth 1 ---
  11939. --- Inner Elaboration Phase, active level 1 (S1) ---
  11940. Firing apply*operator
  11941. -->
  11942. (I3 ^predict-no N984 + :O )
  11943. Firing apply*operator*complete
  11944. -->
  11945. (I3 ^predict-yes N983 - :O )
  11946. inner elaboration loop at bottom goal.
  11947. --- Change Working Memory (PE) ---
  11948. =>WM: (13829: I3 ^predict-no N984)
  11949. <=WM: (13815: N983 ^status complete)
  11950. <=WM: (13814: I3 ^predict-yes N983)
  11951. --- Firing Productions (IE) For State At Depth 1 ---
  11952. --- Inner Elaboration Phase, active level 1 (S1) ---
  11953. Firing monitor*world
  11954. -->
  11955. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11956. --- Change Working Memory (IE) ---
  11957. --- END Application Phase ---
  11958. --- Output Phase ---
  11959. ENV: Agent did: predict-no for direction U in state State-B
  11960. In State-B moving U
  11961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11962. predict error 0
  11963. dir: dir isU
  11964. --- END Output Phase ---
  11965. \-/--- Input Phase ---
  11966. =>WM: (13833: I2 ^dir U)
  11967. =>WM: (13832: I2 ^reward 1)
  11968. =>WM: (13831: I2 ^see 0)
  11969. =>WM: (13830: N984 ^status complete)
  11970. <=WM: (13818: I2 ^dir U)
  11971. <=WM: (13817: I2 ^reward 1)
  11972. <=WM: (13816: I2 ^see 1)
  11973. =>WM: (13834: I2 ^level-1 R1-root)
  11974. <=WM: (13819: I2 ^level-1 R1-root)
  11975. --- END Input Phase ---
  11976. --- Proposal Phase ---
  11977. --- Inner Elaboration Phase, active level 1 (S1) ---
  11978. Firing elaborate*copy-see-to-output-link
  11979. -->
  11980. (I3 ^see 0 +)
  11981. Firing elaborate*reward*based*on*reward
  11982. -->
  11983. (R988 ^value 1 +)
  11984. (R1 ^reward R988 +)
  11985. Firing propose*predict-yes
  11986. -->
  11987. (O1969 ^name predict-yes +)
  11988. (S1 ^operator O1969 +)
  11989. Firing propose*predict-no
  11990. -->
  11991. (O1970 ^name predict-no +)
  11992. (S1 ^operator O1970 +)
  11993. Firing rl*prefer*rvt*predict-no*H0*6
  11994. -->
  11995. (S1 ^operator O1968 = 0.9999999999999999)
  11996. Firing rl*prefer*rvt*predict-yes*H0*5
  11997. -->
  11998. (S1 ^operator O1967 = 0.)
  11999. Firing prefer*rvt*predict-yes*H0
  12000. -->
  12001. Firing prefer*rvt*predict-no*H0
  12002. -->
  12003. Firing elaborate*copy-dir-to-output-link
  12004. -->
  12005. (I3 ^dir U +)
  12006. inner elaboration loop at bottom goal.
  12007. Retracting elaborate*copy-see-to-output-link
  12008. -->
  12009. (I3 ^see 1 +)
  12010. Retracting propose*predict-no
  12011. -->
  12012. (O1968 ^name predict-no +)
  12013. (S1 ^operator O1968 +)
  12014. Retracting propose*predict-yes
  12015. -->
  12016. (O1967 ^name predict-yes +)
  12017. (S1 ^operator O1967 +)
  12018. Retracting elaborate*reward*based*on*reward
  12019. -->
  12020. (R987 ^value 1 +)
  12021. (R1 ^reward R987 +)
  12022. Retracting elaborate*copy-dir-to-output-link
  12023. -->
  12024. (I3 ^dir U +)
  12025. Retracting rl*prefer*rvt*predict-no*H0*6
  12026. -->
  12027. (S1 ^operator O1968 = 0.9999999999999999)
  12028. Retracting rl*prefer*rvt*predict-yes*H0*5
  12029. -->
  12030. (S1 ^operator O1967 = 0.)
  12031. =>WM: (13841: S1 ^operator O1970 +)
  12032. =>WM: (13840: S1 ^operator O1969 +)
  12033. =>WM: (13839: O1970 ^name predict-no)
  12034. =>WM: (13838: O1969 ^name predict-yes)
  12035. =>WM: (13837: R988 ^value 1)
  12036. =>WM: (13836: R1 ^reward R988)
  12037. =>WM: (13835: I3 ^see 0)
  12038. <=WM: (13826: S1 ^operator O1967 +)
  12039. <=WM: (13827: S1 ^operator O1968 +)
  12040. <=WM: (13828: S1 ^operator O1968)
  12041. <=WM: (13821: R1 ^reward R987)
  12042. <=WM: (13820: I3 ^see 1)
  12043. <=WM: (13824: O1968 ^name predict-no)
  12044. <=WM: (13823: O1967 ^name predict-yes)
  12045. <=WM: (13822: R987 ^value 1)
  12046. --- Inner Elaboration Phase, active level 1 (S1) ---
  12047. Firing prefer*rvt*predict-yes*H0
  12048. -->
  12049. Firing rl*prefer*rvt*predict-yes*H0*5
  12050. -->
  12051. (S1 ^operator O1969 = 0.)
  12052. Firing prefer*rvt*predict-no*H0
  12053. -->
  12054. Firing rl*prefer*rvt*predict-no*H0*6
  12055. -->
  12056. (S1 ^operator O1970 = 0.9999999999999999)
  12057. inner elaboration loop at bottom goal.
  12058. Retracting rl*prefer*rvt*predict-no*H0*6
  12059. -->
  12060. (S1 ^operator O1968 = 0.9999999999999999)
  12061. Retracting rl*prefer*rvt*predict-yes*H0*5
  12062. -->
  12063. (S1 ^operator O1967 = 0.)
  12064. --- END Proposal Phase ---
  12065. --- Decision Phase ---
  12066. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12067. =>WM: (13842: S1 ^operator O1970)
  12068. 985: O: O1970 (predict-no)
  12069. --- END Decision Phase ---
  12070. --- Application Phase ---
  12071. --- Firing Productions (PE) For State At Depth 1 ---
  12072. --- Inner Elaboration Phase, active level 1 (S1) ---
  12073. Firing apply*operator
  12074. -->
  12075. (I3 ^predict-no N985 + :O )
  12076. Firing apply*operator*complete
  12077. -->
  12078. (I3 ^predict-no N984 - :O )
  12079. inner elaboration loop at bottom goal.
  12080. --- Change Working Memory (PE) ---
  12081. =>WM: (13843: I3 ^predict-no N985)
  12082. <=WM: (13830: N984 ^status complete)
  12083. <=WM: (13829: I3 ^predict-no N984)
  12084. --- Firing Productions (IE) For State At Depth 1 ---
  12085. --- Inner Elaboration Phase, active level 1 (S1) ---
  12086. Firing monitor*world
  12087. -->
  12088. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12089. --- Change Working Memory (IE) ---
  12090. --- END Application Phase ---
  12091. --- Output Phase ---
  12092. ENV: Agent did: predict-no for direction U in state State-B
  12093. In State-B moving U
  12094. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12095. predict error 0
  12096. dir: dir isR
  12097. --- END Output Phase ---
  12098. |\--- Input Phase ---
  12099. =>WM: (13847: I2 ^dir R)
  12100. =>WM: (13846: I2 ^reward 1)
  12101. =>WM: (13845: I2 ^see 0)
  12102. =>WM: (13844: N985 ^status complete)
  12103. <=WM: (13833: I2 ^dir U)
  12104. <=WM: (13832: I2 ^reward 1)
  12105. <=WM: (13831: I2 ^see 0)
  12106. =>WM: (13848: I2 ^level-1 R1-root)
  12107. <=WM: (13834: I2 ^level-1 R1-root)
  12108. --- END Input Phase ---
  12109. --- Proposal Phase ---
  12110. --- Inner Elaboration Phase, active level 1 (S1) ---
  12111. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  12112. -->
  12113. (S1 ^operator O1969 = 0.08783148430849691)
  12114. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  12115. -->
  12116. (S1 ^operator O1970 = 0.8730234118412079)
  12117. Firing prefer*rvt*predict-no*H0*4*H1
  12118. -->
  12119. Firing prefer*rvt*predict-yes*H0*3*H1
  12120. -->
  12121. Firing elaborate*copy-see-to-output-link
  12122. -->
  12123. (I3 ^see 0 +)
  12124. Firing elaborate*reward*based*on*reward
  12125. -->
  12126. (R989 ^value 1 +)
  12127. (R1 ^reward R989 +)
  12128. Firing propose*predict-yes
  12129. -->
  12130. (O1971 ^name predict-yes +)
  12131. (S1 ^operator O1971 +)
  12132. Firing propose*predict-no
  12133. -->
  12134. (O1972 ^name predict-no +)
  12135. (S1 ^operator O1972 +)
  12136. Firing rl*prefer*rvt*predict-no*H0*4
  12137. -->
  12138. (S1 ^operator O1970 = 0.1269767445579436)
  12139. Firing rl*prefer*rvt*predict-yes*H0*3
  12140. -->
  12141. (S1 ^operator O1969 = 0.3829290045611482)
  12142. Firing prefer*rvt*predict-yes*H0
  12143. -->
  12144. Firing prefer*rvt*predict-no*H0
  12145. -->
  12146. Firing elaborate*copy-dir-to-output-link
  12147. -->
  12148. (I3 ^dir R +)
  12149. inner elaboration loop at bottom goal.
  12150. Retracting elaborate*copy-see-to-output-link
  12151. -->
  12152. (I3 ^see 0 +)
  12153. Retracting propose*predict-no
  12154. -->
  12155. (O1970 ^name predict-no +)
  12156. (S1 ^operator O1970 +)
  12157. Retracting propose*predict-yes
  12158. -->
  12159. (O1969 ^name predict-yes +)
  12160. (S1 ^operator O1969 +)
  12161. Retracting elaborate*reward*based*on*reward
  12162. -->
  12163. (R988 ^value 1 +)
  12164. (R1 ^reward R988 +)
  12165. Retracting elaborate*copy-dir-to-output-link
  12166. -->
  12167. (I3 ^dir U +)
  12168. Retracting rl*prefer*rvt*predict-no*H0*6
  12169. -->
  12170. (S1 ^operator O1970 = 0.9999999999999999)
  12171. Retracting rl*prefer*rvt*predict-yes*H0*5
  12172. -->
  12173. (S1 ^operator O1969 = 0.)
  12174. =>WM: (13855: S1 ^operator O1972 +)
  12175. =>WM: (13854: S1 ^operator O1971 +)
  12176. =>WM: (13853: I3 ^dir R)
  12177. =>WM: (13852: O1972 ^name predict-no)
  12178. =>WM: (13851: O1971 ^name predict-yes)
  12179. =>WM: (13850: R989 ^value 1)
  12180. =>WM: (13849: R1 ^reward R989)
  12181. <=WM: (13840: S1 ^operator O1969 +)
  12182. <=WM: (13841: S1 ^operator O1970 +)
  12183. <=WM: (13842: S1 ^operator O1970)
  12184. <=WM: (13825: I3 ^dir U)
  12185. <=WM: (13836: R1 ^reward R988)
  12186. <=WM: (13839: O1970 ^name predict-no)
  12187. <=WM: (13838: O1969 ^name predict-yes)
  12188. <=WM: (13837: R988 ^value 1)
  12189. --- Inner Elaboration Phase, active level 1 (S1) ---
  12190. Firing prefer*rvt*predict-yes*H0
  12191. -->
  12192. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  12193. -->
  12194. (S1 ^operator O1971 = 0.08783148430849691)
  12195. Firing rl*prefer*rvt*predict-yes*H0*3
  12196. -->
  12197. (S1 ^operator O1971 = 0.3829290045611482)
  12198. Firing prefer*rvt*predict-yes*H0*3*H1
  12199. -->
  12200. Firing prefer*rvt*predict-no*H0
  12201. -->
  12202. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  12203. -->
  12204. (S1 ^operator O1972 = 0.8730234118412079)
  12205. Firing rl*prefer*rvt*predict-no*H0*4
  12206. -->
  12207. (S1 ^operator O1972 = 0.1269767445579436)
  12208. Firing prefer*rvt*predict-no*H0*4*H1
  12209. -->
  12210. inner elaboration loop at bottom goal.
  12211. Retracting rl*prefer*rvt*predict-no*H0*4
  12212. -->
  12213. (S1 ^operator O1970 = 0.1269767445579436)
  12214. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  12215. -->
  12216. (S1 ^operator O1970 = 0.8730234118412079)
  12217. Retracting rl*prefer*rvt*predict-yes*H0*3
  12218. -->
  12219. (S1 ^operator O1969 = 0.3829290045611482)
  12220. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  12221. -->
  12222. (S1 ^operator O1969 = 0.08783148430849691)
  12223. --- END Proposal Phase ---
  12224. --- Decision Phase ---
  12225. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12226. =>WM: (13856: S1 ^operator O1972)
  12227. 986: O: O1972 (predict-no)
  12228. --- END Decision Phase ---
  12229. --- Application Phase ---
  12230. --- Firing Productions (PE) For State At Depth 1 ---
  12231. --- Inner Elaboration Phase, active level 1 (S1) ---
  12232. Firing apply*operator
  12233. -->
  12234. (I3 ^predict-no N986 + :O )
  12235. Firing apply*operator*complete
  12236. -->
  12237. (I3 ^predict-no N985 - :O )
  12238. inner elaboration loop at bottom goal.
  12239. --- Change Working Memory (PE) ---
  12240. =>WM: (13857: I3 ^predict-no N986)
  12241. <=WM: (13844: N985 ^status complete)
  12242. <=WM: (13843: I3 ^predict-no N985)
  12243. --- Firing Productions (IE) For State At Depth 1 ---
  12244. --- Inner Elaboration Phase, active level 1 (S1) ---
  12245. Firing monitor*world
  12246. -->
  12247. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12248. --- Change Working Memory (IE) ---
  12249. --- END Application Phase ---
  12250. --- Output Phase ---
  12251. ENV: Agent did: predict-no for direction R in state State-B
  12252. In State-B moving R
  12253. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12254. predict error 0
  12255. dir: dir isR
  12256. --- END Output Phase ---
  12257. -/|--- Input Phase ---
  12258. =>WM: (13861: I2 ^dir R)
  12259. =>WM: (13860: I2 ^reward 1)
  12260. =>WM: (13859: I2 ^see 0)
  12261. =>WM: (13858: N986 ^status complete)
  12262. <=WM: (13847: I2 ^dir R)
  12263. <=WM: (13846: I2 ^reward 1)
  12264. <=WM: (13845: I2 ^see 0)
  12265. =>WM: (13862: I2 ^level-1 R0-root)
  12266. <=WM: (13848: I2 ^level-1 R1-root)
  12267. --- END Input Phase ---
  12268. --- Proposal Phase ---
  12269. --- Inner Elaboration Phase, active level 1 (S1) ---
  12270. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12271. -->
  12272. (S1 ^operator O1971 = 0.2696941111808541)
  12273. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12274. -->
  12275. (S1 ^operator O1972 = 0.8730228631156078)
  12276. Firing prefer*rvt*predict-no*H0*4*H1
  12277. -->
  12278. Firing prefer*rvt*predict-yes*H0*3*H1
  12279. -->
  12280. Firing elaborate*copy-see-to-output-link
  12281. -->
  12282. (I3 ^see 0 +)
  12283. Firing elaborate*reward*based*on*reward
  12284. -->
  12285. (R990 ^value 1 +)
  12286. (R1 ^reward R990 +)
  12287. Firing propose*predict-yes
  12288. -->
  12289. (O1973 ^name predict-yes +)
  12290. (S1 ^operator O1973 +)
  12291. Firing propose*predict-no
  12292. -->
  12293. (O1974 ^name predict-no +)
  12294. (S1 ^operator O1974 +)
  12295. Firing rl*prefer*rvt*predict-no*H0*4
  12296. -->
  12297. (S1 ^operator O1972 = 0.1269767445579436)
  12298. Firing rl*prefer*rvt*predict-yes*H0*3
  12299. -->
  12300. (S1 ^operator O1971 = 0.3829290045611482)
  12301. Firing prefer*rvt*predict-yes*H0
  12302. -->
  12303. Firing prefer*rvt*predict-no*H0
  12304. -->
  12305. Firing elaborate*copy-dir-to-output-link
  12306. -->
  12307. (I3 ^dir R +)
  12308. inner elaboration loop at bottom goal.
  12309. Retracting elaborate*copy-see-to-output-link
  12310. -->
  12311. (I3 ^see 0 +)
  12312. Retracting propose*predict-no
  12313. -->
  12314. (O1972 ^name predict-no +)
  12315. (S1 ^operator O1972 +)
  12316. Retracting propose*predict-yes
  12317. -->
  12318. (O1971 ^name predict-yes +)
  12319. (S1 ^operator O1971 +)
  12320. Retracting elaborate*reward*based*on*reward
  12321. -->
  12322. (R989 ^value 1 +)
  12323. (R1 ^reward R989 +)
  12324. Retracting elaborate*copy-dir-to-output-link
  12325. -->
  12326. (I3 ^dir R +)
  12327. Retracting rl*prefer*rvt*predict-no*H0*4
  12328. -->
  12329. (S1 ^operator O1972 = 0.1269767445579436)
  12330. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  12331. -->
  12332. (S1 ^operator O1972 = 0.8730234118412079)
  12333. Retracting rl*prefer*rvt*predict-yes*H0*3
  12334. -->
  12335. (S1 ^operator O1971 = 0.3829290045611482)
  12336. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  12337. -->
  12338. (S1 ^operator O1971 = 0.08783148430849691)
  12339. =>WM: (13868: S1 ^operator O1974 +)
  12340. =>WM: (13867: S1 ^operator O1973 +)
  12341. =>WM: (13866: O1974 ^name predict-no)
  12342. =>WM: (13865: O1973 ^name predict-yes)
  12343. =>WM: (13864: R990 ^value 1)
  12344. =>WM: (13863: R1 ^reward R990)
  12345. <=WM: (13854: S1 ^operator O1971 +)
  12346. <=WM: (13855: S1 ^operator O1972 +)
  12347. <=WM: (13856: S1 ^operator O1972)
  12348. <=WM: (13849: R1 ^reward R989)
  12349. <=WM: (13852: O1972 ^name predict-no)
  12350. <=WM: (13851: O1971 ^name predict-yes)
  12351. <=WM: (13850: R989 ^value 1)
  12352. --- Inner Elaboration Phase, active level 1 (S1) ---
  12353. Firing prefer*rvt*predict-yes*H0
  12354. -->
  12355. Firing rl*prefer*rvt*predict-yes*H0*3
  12356. -->
  12357. (S1 ^operator O1973 = 0.3829290045611482)
  12358. Firing prefer*rvt*predict-yes*H0*3*H1
  12359. -->
  12360. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12361. -->
  12362. (S1 ^operator O1973 = 0.2696941111808541)
  12363. Firing prefer*rvt*predict-no*H0
  12364. -->
  12365. Firing rl*prefer*rvt*predict-no*H0*4
  12366. -->
  12367. (S1 ^operator O1974 = 0.1269767445579436)
  12368. Firing prefer*rvt*predict-no*H0*4*H1
  12369. -->
  12370. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12371. -->
  12372. (S1 ^operator O1974 = 0.8730228631156078)
  12373. inner elaboration loop at bottom goal.
  12374. Retracting rl*prefer*rvt*predict-no*H0*4
  12375. -->
  12376. (S1 ^operator O1972 = 0.1269767445579436)
  12377. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12378. -->
  12379. (S1 ^operator O1972 = 0.8730228631156078)
  12380. Retracting rl*prefer*rvt*predict-yes*H0*3
  12381. -->
  12382. (S1 ^operator O1971 = 0.3829290045611482)
  12383. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12384. -->
  12385. (S1 ^operator O1971 = 0.2696941111808541)
  12386. --- END Proposal Phase ---
  12387. --- Decision Phase ---
  12388. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.948276,0.0493323)
  12389. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  12390. =>WM: (13869: S1 ^operator O1974)
  12391. 987: O: O1974 (predict-no)
  12392. --- END Decision Phase ---
  12393. --- Application Phase ---
  12394. --- Firing Productions (PE) For State At Depth 1 ---
  12395. --- Inner Elaboration Phase, active level 1 (S1) ---
  12396. Firing apply*operator
  12397. -->
  12398. (I3 ^predict-no N987 + :O )
  12399. Firing apply*operator*complete
  12400. -->
  12401. (I3 ^predict-no N986 - :O )
  12402. inner elaboration loop at bottom goal.
  12403. --- Change Working Memory (PE) ---
  12404. =>WM: (13870: I3 ^predict-no N987)
  12405. <=WM: (13858: N986 ^status complete)
  12406. <=WM: (13857: I3 ^predict-no N986)
  12407. --- Firing Productions (IE) For State At Depth 1 ---
  12408. --- Inner Elaboration Phase, active level 1 (S1) ---
  12409. Firing monitor*world
  12410. -->
  12411. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12412. --- Change Working Memory (IE) ---
  12413. --- END Application Phase ---
  12414. --- Output Phase ---
  12415. ENV: Agent did: predict-no for direction R in state State-B
  12416. In State-B moving R
  12417. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12418. predict error 0
  12419. dir: dir isR
  12420. --- END Output Phase ---
  12421. \-/|--- Input Phase ---
  12422. =>WM: (13874: I2 ^dir R)
  12423. =>WM: (13873: I2 ^reward 1)
  12424. =>WM: (13872: I2 ^see 0)
  12425. =>WM: (13871: N987 ^status complete)
  12426. <=WM: (13861: I2 ^dir R)
  12427. <=WM: (13860: I2 ^reward 1)
  12428. <=WM: (13859: I2 ^see 0)
  12429. =>WM: (13875: I2 ^level-1 R0-root)
  12430. <=WM: (13862: I2 ^level-1 R0-root)
  12431. --- END Input Phase ---
  12432. --- Proposal Phase ---
  12433. --- Inner Elaboration Phase, active level 1 (S1) ---
  12434. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12435. -->
  12436. (S1 ^operator O1973 = 0.2696941111808541)
  12437. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12438. -->
  12439. (S1 ^operator O1974 = 0.8730228631156078)
  12440. Firing prefer*rvt*predict-no*H0*4*H1
  12441. -->
  12442. Firing prefer*rvt*predict-yes*H0*3*H1
  12443. -->
  12444. Firing elaborate*copy-see-to-output-link
  12445. -->
  12446. (I3 ^see 0 +)
  12447. Firing elaborate*reward*based*on*reward
  12448. -->
  12449. (R991 ^value 1 +)
  12450. (R1 ^reward R991 +)
  12451. Firing propose*predict-yes
  12452. -->
  12453. (O1975 ^name predict-yes +)
  12454. (S1 ^operator O1975 +)
  12455. Firing propose*predict-no
  12456. -->
  12457. (O1976 ^name predict-no +)
  12458. (S1 ^operator O1976 +)
  12459. Firing rl*prefer*rvt*predict-no*H0*4
  12460. -->
  12461. (S1 ^operator O1974 = 0.1269767210980709)
  12462. Firing rl*prefer*rvt*predict-yes*H0*3
  12463. -->
  12464. (S1 ^operator O1973 = 0.3829290045611482)
  12465. Firing prefer*rvt*predict-yes*H0
  12466. -->
  12467. Firing prefer*rvt*predict-no*H0
  12468. -->
  12469. Firing elaborate*copy-dir-to-output-link
  12470. -->
  12471. (I3 ^dir R +)
  12472. inner elaboration loop at bottom goal.
  12473. Retracting elaborate*copy-see-to-output-link
  12474. -->
  12475. (I3 ^see 0 +)
  12476. Retracting propose*predict-no
  12477. -->
  12478. (O1974 ^name predict-no +)
  12479. (S1 ^operator O1974 +)
  12480. Retracting propose*predict-yes
  12481. -->
  12482. (O1973 ^name predict-yes +)
  12483. (S1 ^operator O1973 +)
  12484. Retracting elaborate*reward*based*on*reward
  12485. -->
  12486. (R990 ^value 1 +)
  12487. (R1 ^reward R990 +)
  12488. Retracting elaborate*copy-dir-to-output-link
  12489. -->
  12490. (I3 ^dir R +)
  12491. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12492. -->
  12493. (S1 ^operator O1974 = 0.8730228631156078)
  12494. Retracting rl*prefer*rvt*predict-no*H0*4
  12495. -->
  12496. (S1 ^operator O1974 = 0.1269767210980709)
  12497. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12498. -->
  12499. (S1 ^operator O1973 = 0.2696941111808541)
  12500. Retracting rl*prefer*rvt*predict-yes*H0*3
  12501. -->
  12502. (S1 ^operator O1973 = 0.3829290045611482)
  12503. =>WM: (13881: S1 ^operator O1976 +)
  12504. =>WM: (13880: S1 ^operator O1975 +)
  12505. =>WM: (13879: O1976 ^name predict-no)
  12506. =>WM: (13878: O1975 ^name predict-yes)
  12507. =>WM: (13877: R991 ^value 1)
  12508. =>WM: (13876: R1 ^reward R991)
  12509. <=WM: (13867: S1 ^operator O1973 +)
  12510. <=WM: (13868: S1 ^operator O1974 +)
  12511. <=WM: (13869: S1 ^operator O1974)
  12512. <=WM: (13863: R1 ^reward R990)
  12513. <=WM: (13866: O1974 ^name predict-no)
  12514. <=WM: (13865: O1973 ^name predict-yes)
  12515. <=WM: (13864: R990 ^value 1)
  12516. --- Inner Elaboration Phase, active level 1 (S1) ---
  12517. Firing prefer*rvt*predict-yes*H0
  12518. -->
  12519. Firing rl*prefer*rvt*predict-yes*H0*3
  12520. -->
  12521. (S1 ^operator O1975 = 0.3829290045611482)
  12522. Firing prefer*rvt*predict-yes*H0*3*H1
  12523. -->
  12524. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12525. -->
  12526. (S1 ^operator O1975 = 0.2696941111808541)
  12527. Firing prefer*rvt*predict-no*H0
  12528. -->
  12529. Firing rl*prefer*rvt*predict-no*H0*4
  12530. -->
  12531. (S1 ^operator O1976 = 0.1269767210980709)
  12532. Firing prefer*rvt*predict-no*H0*4*H1
  12533. -->
  12534. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12535. -->
  12536. (S1 ^operator O1976 = 0.8730228631156078)
  12537. inner elaboration loop at bottom goal.
  12538. Retracting rl*prefer*rvt*predict-no*H0*4
  12539. -->
  12540. (S1 ^operator O1974 = 0.1269767210980709)
  12541. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12542. -->
  12543. (S1 ^operator O1974 = 0.8730228631156078)
  12544. Retracting rl*prefer*rvt*predict-yes*H0*3
  12545. -->
  12546. (S1 ^operator O1973 = 0.3829290045611482)
  12547. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12548. -->
  12549. (S1 ^operator O1973 = 0.2696941111808541)
  12550. --- END Proposal Phase ---
  12551. --- Decision Phase ---
  12552. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.948571,0.049064)
  12553. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  12554. =>WM: (13882: S1 ^operator O1976)
  12555. 988: O: O1976 (predict-no)
  12556. --- END Decision Phase ---
  12557. --- Application Phase ---
  12558. --- Firing Productions (PE) For State At Depth 1 ---
  12559. --- Inner Elaboration Phase, active level 1 (S1) ---
  12560. Firing apply*operator
  12561. -->
  12562. (I3 ^predict-no N988 + :O )
  12563. Firing apply*operator*complete
  12564. -->
  12565. (I3 ^predict-no N987 - :O )
  12566. inner elaboration loop at bottom goal.
  12567. --- Change Working Memory (PE) ---
  12568. =>WM: (13883: I3 ^predict-no N988)
  12569. <=WM: (13871: N987 ^status complete)
  12570. <=WM: (13870: I3 ^predict-no N987)
  12571. --- Firing Productions (IE) For State At Depth 1 ---
  12572. --- Inner Elaboration Phase, active level 1 (S1) ---
  12573. Firing monitor*world
  12574. -->
  12575. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12576. --- Change Working Memory (IE) ---
  12577. --- END Application Phase ---
  12578. --- Output Phase ---
  12579. ENV: Agent did: predict-no for direction R in state State-B
  12580. In State-B moving R
  12581. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12582. predict error 0
  12583. dir: dir isR
  12584. --- END Output Phase ---
  12585. \---- Input Phase ---
  12586. =>WM: (13887: I2 ^dir R)
  12587. =>WM: (13886: I2 ^reward 1)
  12588. =>WM: (13885: I2 ^see 0)
  12589. =>WM: (13884: N988 ^status complete)
  12590. <=WM: (13874: I2 ^dir R)
  12591. <=WM: (13873: I2 ^reward 1)
  12592. <=WM: (13872: I2 ^see 0)
  12593. =>WM: (13888: I2 ^level-1 R0-root)
  12594. <=WM: (13875: I2 ^level-1 R0-root)
  12595. --- END Input Phase ---
  12596. --- Proposal Phase ---
  12597. --- Inner Elaboration Phase, active level 1 (S1) ---
  12598. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12599. -->
  12600. (S1 ^operator O1975 = 0.2696941111808541)
  12601. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12602. -->
  12603. (S1 ^operator O1976 = 0.8730229254835561)
  12604. Firing prefer*rvt*predict-no*H0*4*H1
  12605. -->
  12606. Firing prefer*rvt*predict-yes*H0*3*H1
  12607. -->
  12608. Firing elaborate*copy-see-to-output-link
  12609. -->
  12610. (I3 ^see 0 +)
  12611. Firing elaborate*reward*based*on*reward
  12612. -->
  12613. (R992 ^value 1 +)
  12614. (R1 ^reward R992 +)
  12615. Firing propose*predict-yes
  12616. -->
  12617. (O1977 ^name predict-yes +)
  12618. (S1 ^operator O1977 +)
  12619. Firing propose*predict-no
  12620. -->
  12621. (O1978 ^name predict-no +)
  12622. (S1 ^operator O1978 +)
  12623. Firing rl*prefer*rvt*predict-no*H0*4
  12624. -->
  12625. (S1 ^operator O1976 = 0.126976783466019)
  12626. Firing rl*prefer*rvt*predict-yes*H0*3
  12627. -->
  12628. (S1 ^operator O1975 = 0.3829290045611482)
  12629. Firing prefer*rvt*predict-yes*H0
  12630. -->
  12631. Firing prefer*rvt*predict-no*H0
  12632. -->
  12633. Firing elaborate*copy-dir-to-output-link
  12634. -->
  12635. (I3 ^dir R +)
  12636. inner elaboration loop at bottom goal.
  12637. Retracting elaborate*copy-see-to-output-link
  12638. -->
  12639. (I3 ^see 0 +)
  12640. Retracting propose*predict-no
  12641. -->
  12642. (O1976 ^name predict-no +)
  12643. (S1 ^operator O1976 +)
  12644. Retracting propose*predict-yes
  12645. -->
  12646. (O1975 ^name predict-yes +)
  12647. (S1 ^operator O1975 +)
  12648. Retracting elaborate*reward*based*on*reward
  12649. -->
  12650. (R991 ^value 1 +)
  12651. (R1 ^reward R991 +)
  12652. Retracting elaborate*copy-dir-to-output-link
  12653. -->
  12654. (I3 ^dir R +)
  12655. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12656. -->
  12657. (S1 ^operator O1976 = 0.8730229254835561)
  12658. Retracting rl*prefer*rvt*predict-no*H0*4
  12659. -->
  12660. (S1 ^operator O1976 = 0.126976783466019)
  12661. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12662. -->
  12663. (S1 ^operator O1975 = 0.2696941111808541)
  12664. Retracting rl*prefer*rvt*predict-yes*H0*3
  12665. -->
  12666. (S1 ^operator O1975 = 0.3829290045611482)
  12667. =>WM: (13894: S1 ^operator O1978 +)
  12668. =>WM: (13893: S1 ^operator O1977 +)
  12669. =>WM: (13892: O1978 ^name predict-no)
  12670. =>WM: (13891: O1977 ^name predict-yes)
  12671. =>WM: (13890: R992 ^value 1)
  12672. =>WM: (13889: R1 ^reward R992)
  12673. <=WM: (13880: S1 ^operator O1975 +)
  12674. <=WM: (13881: S1 ^operator O1976 +)
  12675. <=WM: (13882: S1 ^operator O1976)
  12676. <=WM: (13876: R1 ^reward R991)
  12677. <=WM: (13879: O1976 ^name predict-no)
  12678. <=WM: (13878: O1975 ^name predict-yes)
  12679. <=WM: (13877: R991 ^value 1)
  12680. --- Inner Elaboration Phase, active level 1 (S1) ---
  12681. Firing prefer*rvt*predict-yes*H0
  12682. -->
  12683. Firing rl*prefer*rvt*predict-yes*H0*3
  12684. -->
  12685. (S1 ^operator O1977 = 0.3829290045611482)
  12686. Firing prefer*rvt*predict-yes*H0*3*H1
  12687. -->
  12688. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12689. -->
  12690. (S1 ^operator O1977 = 0.2696941111808541)
  12691. Firing prefer*rvt*predict-no*H0
  12692. -->
  12693. Firing rl*prefer*rvt*predict-no*H0*4
  12694. -->
  12695. (S1 ^operator O1978 = 0.126976783466019)
  12696. Firing prefer*rvt*predict-no*H0*4*H1
  12697. -->
  12698. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12699. -->
  12700. (S1 ^operator O1978 = 0.8730229254835561)
  12701. inner elaboration loop at bottom goal.
  12702. Retracting rl*prefer*rvt*predict-no*H0*4
  12703. -->
  12704. (S1 ^operator O1976 = 0.126976783466019)
  12705. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12706. -->
  12707. (S1 ^operator O1976 = 0.8730229254835561)
  12708. Retracting rl*prefer*rvt*predict-yes*H0*3
  12709. -->
  12710. (S1 ^operator O1975 = 0.3829290045611482)
  12711. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12712. -->
  12713. (S1 ^operator O1975 = 0.2696941111808541)
  12714. --- END Proposal Phase ---
  12715. --- Decision Phase ---
  12716. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.948864,0.0487987)
  12717. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  12718. =>WM: (13895: S1 ^operator O1978)
  12719. 989: O: O1978 (predict-no)
  12720. --- END Decision Phase ---
  12721. --- Application Phase ---
  12722. --- Firing Productions (PE) For State At Depth 1 ---
  12723. --- Inner Elaboration Phase, active level 1 (S1) ---
  12724. Firing apply*operator
  12725. -->
  12726. (I3 ^predict-no N989 + :O )
  12727. Firing apply*operator*complete
  12728. -->
  12729. (I3 ^predict-no N988 - :O )
  12730. inner elaboration loop at bottom goal.
  12731. --- Change Working Memory (PE) ---
  12732. =>WM: (13896: I3 ^predict-no N989)
  12733. <=WM: (13884: N988 ^status complete)
  12734. <=WM: (13883: I3 ^predict-no N988)
  12735. --- Firing Productions (IE) For State At Depth 1 ---
  12736. --- Inner Elaboration Phase, active level 1 (S1) ---
  12737. Firing monitor*world
  12738. -->
  12739. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12740. --- Change Working Memory (IE) ---
  12741. --- END Application Phase ---
  12742. --- Output Phase ---
  12743. ENV: Agent did: predict-no for direction R in state State-B
  12744. In State-B moving R
  12745. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12746. predict error 0
  12747. dir: dir isR
  12748. --- END Output Phase ---
  12749. /|\--- Input Phase ---
  12750. =>WM: (13900: I2 ^dir R)
  12751. =>WM: (13899: I2 ^reward 1)
  12752. =>WM: (13898: I2 ^see 0)
  12753. =>WM: (13897: N989 ^status complete)
  12754. <=WM: (13887: I2 ^dir R)
  12755. <=WM: (13886: I2 ^reward 1)
  12756. <=WM: (13885: I2 ^see 0)
  12757. =>WM: (13901: I2 ^level-1 R0-root)
  12758. <=WM: (13888: I2 ^level-1 R0-root)
  12759. --- END Input Phase ---
  12760. --- Proposal Phase ---
  12761. --- Inner Elaboration Phase, active level 1 (S1) ---
  12762. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12763. -->
  12764. (S1 ^operator O1977 = 0.2696941111808541)
  12765. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12766. -->
  12767. (S1 ^operator O1978 = 0.8730229691411198)
  12768. Firing prefer*rvt*predict-no*H0*4*H1
  12769. -->
  12770. Firing prefer*rvt*predict-yes*H0*3*H1
  12771. -->
  12772. Firing elaborate*copy-see-to-output-link
  12773. -->
  12774. (I3 ^see 0 +)
  12775. Firing elaborate*reward*based*on*reward
  12776. -->
  12777. (R993 ^value 1 +)
  12778. (R1 ^reward R993 +)
  12779. Firing propose*predict-yes
  12780. -->
  12781. (O1979 ^name predict-yes +)
  12782. (S1 ^operator O1979 +)
  12783. Firing propose*predict-no
  12784. -->
  12785. (O1980 ^name predict-no +)
  12786. (S1 ^operator O1980 +)
  12787. Firing rl*prefer*rvt*predict-no*H0*4
  12788. -->
  12789. (S1 ^operator O1978 = 0.1269768271235827)
  12790. Firing rl*prefer*rvt*predict-yes*H0*3
  12791. -->
  12792. (S1 ^operator O1977 = 0.3829290045611482)
  12793. Firing prefer*rvt*predict-yes*H0
  12794. -->
  12795. Firing prefer*rvt*predict-no*H0
  12796. -->
  12797. Firing elaborate*copy-dir-to-output-link
  12798. -->
  12799. (I3 ^dir R +)
  12800. inner elaboration loop at bottom goal.
  12801. Retracting elaborate*copy-see-to-output-link
  12802. -->
  12803. (I3 ^see 0 +)
  12804. Retracting propose*predict-no
  12805. -->
  12806. (O1978 ^name predict-no +)
  12807. (S1 ^operator O1978 +)
  12808. Retracting propose*predict-yes
  12809. -->
  12810. (O1977 ^name predict-yes +)
  12811. (S1 ^operator O1977 +)
  12812. Retracting elaborate*reward*based*on*reward
  12813. -->
  12814. (R992 ^value 1 +)
  12815. (R1 ^reward R992 +)
  12816. Retracting elaborate*copy-dir-to-output-link
  12817. -->
  12818. (I3 ^dir R +)
  12819. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12820. -->
  12821. (S1 ^operator O1978 = 0.8730229691411198)
  12822. Retracting rl*prefer*rvt*predict-no*H0*4
  12823. -->
  12824. (S1 ^operator O1978 = 0.1269768271235827)
  12825. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12826. -->
  12827. (S1 ^operator O1977 = 0.2696941111808541)
  12828. Retracting rl*prefer*rvt*predict-yes*H0*3
  12829. -->
  12830. (S1 ^operator O1977 = 0.3829290045611482)
  12831. =>WM: (13907: S1 ^operator O1980 +)
  12832. =>WM: (13906: S1 ^operator O1979 +)
  12833. =>WM: (13905: O1980 ^name predict-no)
  12834. =>WM: (13904: O1979 ^name predict-yes)
  12835. =>WM: (13903: R993 ^value 1)
  12836. =>WM: (13902: R1 ^reward R993)
  12837. <=WM: (13893: S1 ^operator O1977 +)
  12838. <=WM: (13894: S1 ^operator O1978 +)
  12839. <=WM: (13895: S1 ^operator O1978)
  12840. <=WM: (13889: R1 ^reward R992)
  12841. <=WM: (13892: O1978 ^name predict-no)
  12842. <=WM: (13891: O1977 ^name predict-yes)
  12843. <=WM: (13890: R992 ^value 1)
  12844. --- Inner Elaboration Phase, active level 1 (S1) ---
  12845. Firing prefer*rvt*predict-yes*H0
  12846. -->
  12847. Firing rl*prefer*rvt*predict-yes*H0*3
  12848. -->
  12849. (S1 ^operator O1979 = 0.3829290045611482)
  12850. Firing prefer*rvt*predict-yes*H0*3*H1
  12851. -->
  12852. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  12853. -->
  12854. (S1 ^operator O1979 = 0.2696941111808541)
  12855. Firing prefer*rvt*predict-no*H0
  12856. -->
  12857. Firing rl*prefer*rvt*predict-no*H0*4
  12858. -->
  12859. (S1 ^operator O1980 = 0.1269768271235827)
  12860. Firing prefer*rvt*predict-no*H0*4*H1
  12861. -->
  12862. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  12863. -->
  12864. (S1 ^operator O1980 = 0.8730229691411198)
  12865. inner elaboration loop at bottom goal.
  12866. Retracting rl*prefer*rvt*predict-no*H0*4
  12867. -->
  12868. (S1 ^operator O1978 = 0.1269768271235827)
  12869. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12870. -->
  12871. (S1 ^operator O1978 = 0.8730229691411198)
  12872. Retracting rl*prefer*rvt*predict-yes*H0*3
  12873. -->
  12874. (S1 ^operator O1977 = 0.3829290045611482)
  12875. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12876. -->
  12877. (S1 ^operator O1977 = 0.2696941111808541)
  12878. --- END Proposal Phase ---
  12879. --- Decision Phase ---
  12880. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.949153,0.0485362)
  12881. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  12882. =>WM: (13908: S1 ^operator O1980)
  12883. 990: O: O1980 (predict-no)
  12884. --- END Decision Phase ---
  12885. --- Application Phase ---
  12886. --- Firing Productions (PE) For State At Depth 1 ---
  12887. --- Inner Elaboration Phase, active level 1 (S1) ---
  12888. Firing apply*operator
  12889. -->
  12890. (I3 ^predict-no N990 + :O )
  12891. Firing apply*operator*complete
  12892. -->
  12893. (I3 ^predict-no N989 - :O )
  12894. inner elaboration loop at bottom goal.
  12895. --- Change Working Memory (PE) ---
  12896. =>WM: (13909: I3 ^predict-no N990)
  12897. <=WM: (13897: N989 ^status complete)
  12898. <=WM: (13896: I3 ^predict-no N989)
  12899. --- Firing Productions (IE) For State At Depth 1 ---
  12900. --- Inner Elaboration Phase, active level 1 (S1) ---
  12901. Firing monitor*world
  12902. -->
  12903. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12904. --- Change Working Memory (IE) ---
  12905. --- END Application Phase ---
  12906. --- Output Phase ---
  12907. ENV: Agent did: predict-no for direction R in state State-B
  12908. In State-B moving R
  12909. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12910. predict error 0
  12911. dir: dir isU
  12912. --- END Output Phase ---
  12913. -/--- Input Phase ---
  12914. =>WM: (13913: I2 ^dir U)
  12915. =>WM: (13912: I2 ^reward 1)
  12916. =>WM: (13911: I2 ^see 0)
  12917. =>WM: (13910: N990 ^status complete)
  12918. <=WM: (13900: I2 ^dir R)
  12919. <=WM: (13899: I2 ^reward 1)
  12920. <=WM: (13898: I2 ^see 0)
  12921. =>WM: (13914: I2 ^level-1 R0-root)
  12922. <=WM: (13901: I2 ^level-1 R0-root)
  12923. --- END Input Phase ---
  12924. --- Proposal Phase ---
  12925. --- Inner Elaboration Phase, active level 1 (S1) ---
  12926. Firing elaborate*copy-see-to-output-link
  12927. -->
  12928. (I3 ^see 0 +)
  12929. Firing elaborate*reward*based*on*reward
  12930. -->
  12931. (R994 ^value 1 +)
  12932. (R1 ^reward R994 +)
  12933. Firing propose*predict-yes
  12934. -->
  12935. (O1981 ^name predict-yes +)
  12936. (S1 ^operator O1981 +)
  12937. Firing propose*predict-no
  12938. -->
  12939. (O1982 ^name predict-no +)
  12940. (S1 ^operator O1982 +)
  12941. Firing rl*prefer*rvt*predict-no*H0*6
  12942. -->
  12943. (S1 ^operator O1980 = 0.9999999999999999)
  12944. Firing rl*prefer*rvt*predict-yes*H0*5
  12945. -->
  12946. (S1 ^operator O1979 = 0.)
  12947. Firing prefer*rvt*predict-yes*H0
  12948. -->
  12949. Firing prefer*rvt*predict-no*H0
  12950. -->
  12951. Firing elaborate*copy-dir-to-output-link
  12952. -->
  12953. (I3 ^dir U +)
  12954. inner elaboration loop at bottom goal.
  12955. Retracting elaborate*copy-see-to-output-link
  12956. -->
  12957. (I3 ^see 0 +)
  12958. Retracting propose*predict-no
  12959. -->
  12960. (O1980 ^name predict-no +)
  12961. (S1 ^operator O1980 +)
  12962. Retracting propose*predict-yes
  12963. -->
  12964. (O1979 ^name predict-yes +)
  12965. (S1 ^operator O1979 +)
  12966. Retracting elaborate*reward*based*on*reward
  12967. -->
  12968. (R993 ^value 1 +)
  12969. (R1 ^reward R993 +)
  12970. Retracting elaborate*copy-dir-to-output-link
  12971. -->
  12972. (I3 ^dir R +)
  12973. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  12974. -->
  12975. (S1 ^operator O1980 = 0.8730229997014144)
  12976. Retracting rl*prefer*rvt*predict-no*H0*4
  12977. -->
  12978. (S1 ^operator O1980 = 0.1269768576838773)
  12979. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  12980. -->
  12981. (S1 ^operator O1979 = 0.2696941111808541)
  12982. Retracting rl*prefer*rvt*predict-yes*H0*3
  12983. -->
  12984. (S1 ^operator O1979 = 0.3829290045611482)
  12985. =>WM: (13921: S1 ^operator O1982 +)
  12986. =>WM: (13920: S1 ^operator O1981 +)
  12987. =>WM: (13919: I3 ^dir U)
  12988. =>WM: (13918: O1982 ^name predict-no)
  12989. =>WM: (13917: O1981 ^name predict-yes)
  12990. =>WM: (13916: R994 ^value 1)
  12991. =>WM: (13915: R1 ^reward R994)
  12992. <=WM: (13906: S1 ^operator O1979 +)
  12993. <=WM: (13907: S1 ^operator O1980 +)
  12994. <=WM: (13908: S1 ^operator O1980)
  12995. <=WM: (13853: I3 ^dir R)
  12996. <=WM: (13902: R1 ^reward R993)
  12997. <=WM: (13905: O1980 ^name predict-no)
  12998. <=WM: (13904: O1979 ^name predict-yes)
  12999. <=WM: (13903: R993 ^value 1)
  13000. --- Inner Elaboration Phase, active level 1 (S1) ---
  13001. Firing prefer*rvt*predict-yes*H0
  13002. -->
  13003. Firing rl*prefer*rvt*predict-yes*H0*5
  13004. -->
  13005. (S1 ^operator O1981 = 0.)
  13006. Firing prefer*rvt*predict-no*H0
  13007. -->
  13008. Firing rl*prefer*rvt*predict-no*H0*6
  13009. -->
  13010. (S1 ^operator O1982 = 0.9999999999999999)
  13011. inner elaboration loop at bottom goal.
  13012. Retracting rl*prefer*rvt*predict-no*H0*6
  13013. -->
  13014. (S1 ^operator O1980 = 0.9999999999999999)
  13015. Retracting rl*prefer*rvt*predict-yes*H0*5
  13016. -->
  13017. (S1 ^operator O1979 = 0.)
  13018. --- END Proposal Phase ---
  13019. --- Decision Phase ---
  13020. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.949438,0.0482765)
  13021. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  13022. =>WM: (13922: S1 ^operator O1982)
  13023. 991: O: O1982 (predict-no)
  13024. --- END Decision Phase ---
  13025. --- Application Phase ---
  13026. --- Firing Productions (PE) For State At Depth 1 ---
  13027. --- Inner Elaboration Phase, active level 1 (S1) ---
  13028. Firing apply*operator
  13029. -->
  13030. (I3 ^predict-no N991 + :O )
  13031. Firing apply*operator*complete
  13032. -->
  13033. (I3 ^predict-no N990 - :O )
  13034. inner elaboration loop at bottom goal.
  13035. --- Change Working Memory (PE) ---
  13036. =>WM: (13923: I3 ^predict-no N991)
  13037. <=WM: (13910: N990 ^status complete)
  13038. <=WM: (13909: I3 ^predict-no N990)
  13039. --- Firing Productions (IE) For State At Depth 1 ---
  13040. --- Inner Elaboration Phase, active level 1 (S1) ---
  13041. Firing monitor*world
  13042. -->
  13043. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13044. --- Change Working Memory (IE) ---
  13045. --- END Application Phase ---
  13046. --- Output Phase ---
  13047. ENV: Agent did: predict-no for direction U in state State-B
  13048. In State-B moving U
  13049. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13050. predict error 0
  13051. dir: dir isL
  13052. --- END Output Phase ---
  13053. |--- Input Phase ---
  13054. =>WM: (13927: I2 ^dir L)
  13055. =>WM: (13926: I2 ^reward 1)
  13056. =>WM: (13925: I2 ^see 0)
  13057. =>WM: (13924: N991 ^status complete)
  13058. <=WM: (13913: I2 ^dir U)
  13059. <=WM: (13912: I2 ^reward 1)
  13060. <=WM: (13911: I2 ^see 0)
  13061. =>WM: (13928: I2 ^level-1 R0-root)
  13062. <=WM: (13914: I2 ^level-1 R0-root)
  13063. --- END Input Phase ---
  13064. --- Proposal Phase ---
  13065. --- Inner Elaboration Phase, active level 1 (S1) ---
  13066. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  13067. -->
  13068. (S1 ^operator O1981 = 0.4768833204434785)
  13069. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  13070. -->
  13071. (S1 ^operator O1982 = 0.1700769046561409)
  13072. Firing prefer*rvt*predict-no*H0*2*H1
  13073. -->
  13074. Firing prefer*rvt*predict-yes*H0*1*H1
  13075. -->
  13076. Firing elaborate*copy-see-to-output-link
  13077. -->
  13078. (I3 ^see 0 +)
  13079. Firing elaborate*reward*based*on*reward
  13080. -->
  13081. (R995 ^value 1 +)
  13082. (R1 ^reward R995 +)
  13083. Firing propose*predict-yes
  13084. -->
  13085. (O1983 ^name predict-yes +)
  13086. (S1 ^operator O1983 +)
  13087. Firing propose*predict-no
  13088. -->
  13089. (O1984 ^name predict-no +)
  13090. (S1 ^operator O1984 +)
  13091. Firing rl*prefer*rvt*predict-no*H0*2
  13092. -->
  13093. (S1 ^operator O1982 = 0.2550133828092577)
  13094. Firing rl*prefer*rvt*predict-yes*H0*1
  13095. -->
  13096. (S1 ^operator O1981 = 0.5231200982015054)
  13097. Firing prefer*rvt*predict-yes*H0
  13098. -->
  13099. Firing prefer*rvt*predict-no*H0
  13100. -->
  13101. Firing elaborate*copy-dir-to-output-link
  13102. -->
  13103. (I3 ^dir L +)
  13104. inner elaboration loop at bottom goal.
  13105. Retracting elaborate*copy-see-to-output-link
  13106. -->
  13107. (I3 ^see 0 +)
  13108. Retracting propose*predict-no
  13109. -->
  13110. (O1982 ^name predict-no +)
  13111. (S1 ^operator O1982 +)
  13112. Retracting propose*predict-yes
  13113. -->
  13114. (O1981 ^name predict-yes +)
  13115. (S1 ^operator O1981 +)
  13116. Retracting elaborate*reward*based*on*reward
  13117. -->
  13118. (R994 ^value 1 +)
  13119. (R1 ^reward R994 +)
  13120. Retracting elaborate*copy-dir-to-output-link
  13121. -->
  13122. (I3 ^dir U +)
  13123. Retracting rl*prefer*rvt*predict-no*H0*6
  13124. -->
  13125. (S1 ^operator O1982 = 0.9999999999999999)
  13126. Retracting rl*prefer*rvt*predict-yes*H0*5
  13127. -->
  13128. (S1 ^operator O1981 = 0.)
  13129. =>WM: (13935: S1 ^operator O1984 +)
  13130. =>WM: (13934: S1 ^operator O1983 +)
  13131. =>WM: (13933: I3 ^dir L)
  13132. =>WM: (13932: O1984 ^name predict-no)
  13133. =>WM: (13931: O1983 ^name predict-yes)
  13134. =>WM: (13930: R995 ^value 1)
  13135. =>WM: (13929: R1 ^reward R995)
  13136. <=WM: (13920: S1 ^operator O1981 +)
  13137. <=WM: (13921: S1 ^operator O1982 +)
  13138. <=WM: (13922: S1 ^operator O1982)
  13139. <=WM: (13919: I3 ^dir U)
  13140. <=WM: (13915: R1 ^reward R994)
  13141. <=WM: (13918: O1982 ^name predict-no)
  13142. <=WM: (13917: O1981 ^name predict-yes)
  13143. <=WM: (13916: R994 ^value 1)
  13144. --- Inner Elaboration Phase, active level 1 (S1) ---
  13145. Firing prefer*rvt*predict-yes*H0
  13146. -->
  13147. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  13148. -->
  13149. (S1 ^operator O1983 = 0.4768833204434785)
  13150. Firing rl*prefer*rvt*predict-yes*H0*1
  13151. -->
  13152. (S1 ^operator O1983 = 0.5231200982015054)
  13153. Firing prefer*rvt*predict-yes*H0*1*H1
  13154. -->
  13155. Firing prefer*rvt*predict-no*H0
  13156. -->
  13157. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  13158. -->
  13159. (S1 ^operator O1984 = 0.1700769046561409)
  13160. Firing rl*prefer*rvt*predict-no*H0*2
  13161. -->
  13162. (S1 ^operator O1984 = 0.2550133828092577)
  13163. Firing prefer*rvt*predict-no*H0*2*H1
  13164. -->
  13165. inner elaboration loop at bottom goal.
  13166. Retracting rl*prefer*rvt*predict-no*H0*2
  13167. -->
  13168. (S1 ^operator O1982 = 0.2550133828092577)
  13169. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  13170. -->
  13171. (S1 ^operator O1982 = 0.1700769046561409)
  13172. Retracting rl*prefer*rvt*predict-yes*H0*1
  13173. -->
  13174. (S1 ^operator O1981 = 0.5231200982015054)
  13175. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  13176. -->
  13177. (S1 ^operator O1981 = 0.4768833204434785)
  13178. --- END Proposal Phase ---
  13179. --- Decision Phase ---
  13180. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13181. =>WM: (13936: S1 ^operator O1983)
  13182. 992: O: O1983 (predict-yes)
  13183. --- END Decision Phase ---
  13184. --- Application Phase ---
  13185. --- Firing Productions (PE) For State At Depth 1 ---
  13186. --- Inner Elaboration Phase, active level 1 (S1) ---
  13187. Firing apply*operator
  13188. -->
  13189. (I3 ^predict-yes N992 + :O )
  13190. Firing apply*operator*complete
  13191. -->
  13192. (I3 ^predict-no N991 - :O )
  13193. inner elaboration loop at bottom goal.
  13194. --- Change Working Memory (PE) ---
  13195. =>WM: (13937: I3 ^predict-yes N992)
  13196. <=WM: (13924: N991 ^status complete)
  13197. <=WM: (13923: I3 ^predict-no N991)
  13198. --- Firing Productions (IE) For State At Depth 1 ---
  13199. --- Inner Elaboration Phase, active level 1 (S1) ---
  13200. Firing monitor*world
  13201. -->
  13202. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13203. --- Change Working Memory (IE) ---
  13204. --- END Application Phase ---
  13205. --- Output Phase ---
  13206. ENV: Agent did: predict-yes for direction L in state State-B
  13207. In State-B moving L
  13208. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13209. predict error 0
  13210. dir: dir isR
  13211. --- END Output Phase ---
  13212. \-/--- Input Phase ---
  13213. =>WM: (13941: I2 ^dir R)
  13214. =>WM: (13940: I2 ^reward 1)
  13215. =>WM: (13939: I2 ^see 1)
  13216. =>WM: (13938: N992 ^status complete)
  13217. <=WM: (13927: I2 ^dir L)
  13218. <=WM: (13926: I2 ^reward 1)
  13219. <=WM: (13925: I2 ^see 0)
  13220. =>WM: (13942: I2 ^level-1 L1-root)
  13221. <=WM: (13928: I2 ^level-1 R0-root)
  13222. --- END Input Phase ---
  13223. --- Proposal Phase ---
  13224. --- Inner Elaboration Phase, active level 1 (S1) ---
  13225. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  13226. -->
  13227. (S1 ^operator O1983 = 0.6170271815281626)
  13228. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  13229. -->
  13230. (S1 ^operator O1984 = 0.4901349546100854)
  13231. Firing prefer*rvt*predict-no*H0*4*H1
  13232. -->
  13233. Firing prefer*rvt*predict-yes*H0*3*H1
  13234. -->
  13235. Firing elaborate*copy-see-to-output-link
  13236. -->
  13237. (I3 ^see 1 +)
  13238. Firing elaborate*reward*based*on*reward
  13239. -->
  13240. (R996 ^value 1 +)
  13241. (R1 ^reward R996 +)
  13242. Firing propose*predict-yes
  13243. -->
  13244. (O1985 ^name predict-yes +)
  13245. (S1 ^operator O1985 +)
  13246. Firing propose*predict-no
  13247. -->
  13248. (O1986 ^name predict-no +)
  13249. (S1 ^operator O1986 +)
  13250. Firing rl*prefer*rvt*predict-no*H0*4
  13251. -->
  13252. (S1 ^operator O1984 = 0.1269768790760836)
  13253. Firing rl*prefer*rvt*predict-yes*H0*3
  13254. -->
  13255. (S1 ^operator O1983 = 0.3829290045611482)
  13256. Firing prefer*rvt*predict-yes*H0
  13257. -->
  13258. Firing prefer*rvt*predict-no*H0
  13259. -->
  13260. Firing elaborate*copy-dir-to-output-link
  13261. -->
  13262. (I3 ^dir R +)
  13263. inner elaboration loop at bottom goal.
  13264. Retracting elaborate*copy-see-to-output-link
  13265. -->
  13266. (I3 ^see 0 +)
  13267. Retracting propose*predict-no
  13268. -->
  13269. (O1984 ^name predict-no +)
  13270. (S1 ^operator O1984 +)
  13271. Retracting propose*predict-yes
  13272. -->
  13273. (O1983 ^name predict-yes +)
  13274. (S1 ^operator O1983 +)
  13275. Retracting elaborate*reward*based*on*reward
  13276. -->
  13277. (R995 ^value 1 +)
  13278. (R1 ^reward R995 +)
  13279. Retracting elaborate*copy-dir-to-output-link
  13280. -->
  13281. (I3 ^dir L +)
  13282. Retracting rl*prefer*rvt*predict-no*H0*2
  13283. -->
  13284. (S1 ^operator O1984 = 0.2550133828092577)
  13285. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  13286. -->
  13287. (S1 ^operator O1984 = 0.1700769046561409)
  13288. Retracting rl*prefer*rvt*predict-yes*H0*1
  13289. -->
  13290. (S1 ^operator O1983 = 0.5231200982015054)
  13291. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  13292. -->
  13293. (S1 ^operator O1983 = 0.4768833204434785)
  13294. =>WM: (13950: S1 ^operator O1986 +)
  13295. =>WM: (13949: S1 ^operator O1985 +)
  13296. =>WM: (13948: I3 ^dir R)
  13297. =>WM: (13947: O1986 ^name predict-no)
  13298. =>WM: (13946: O1985 ^name predict-yes)
  13299. =>WM: (13945: R996 ^value 1)
  13300. =>WM: (13944: R1 ^reward R996)
  13301. =>WM: (13943: I3 ^see 1)
  13302. <=WM: (13934: S1 ^operator O1983 +)
  13303. <=WM: (13936: S1 ^operator O1983)
  13304. <=WM: (13935: S1 ^operator O1984 +)
  13305. <=WM: (13933: I3 ^dir L)
  13306. <=WM: (13929: R1 ^reward R995)
  13307. <=WM: (13835: I3 ^see 0)
  13308. <=WM: (13932: O1984 ^name predict-no)
  13309. <=WM: (13931: O1983 ^name predict-yes)
  13310. <=WM: (13930: R995 ^value 1)
  13311. --- Inner Elaboration Phase, active level 1 (S1) ---
  13312. Firing prefer*rvt*predict-yes*H0
  13313. -->
  13314. Firing rl*prefer*rvt*predict-yes*H0*3
  13315. -->
  13316. (S1 ^operator O1985 = 0.3829290045611482)
  13317. Firing prefer*rvt*predict-yes*H0*3*H1
  13318. -->
  13319. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  13320. -->
  13321. (S1 ^operator O1985 = 0.6170271815281626)
  13322. Firing prefer*rvt*predict-no*H0
  13323. -->
  13324. Firing rl*prefer*rvt*predict-no*H0*4
  13325. -->
  13326. (S1 ^operator O1986 = 0.1269768790760836)
  13327. Firing prefer*rvt*predict-no*H0*4*H1
  13328. -->
  13329. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  13330. -->
  13331. (S1 ^operator O1986 = 0.4901349546100854)
  13332. inner elaboration loop at bottom goal.
  13333. Retracting rl*prefer*rvt*predict-no*H0*4
  13334. -->
  13335. (S1 ^operator O1984 = 0.1269768790760836)
  13336. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  13337. -->
  13338. (S1 ^operator O1984 = 0.4901349546100854)
  13339. Retracting rl*prefer*rvt*predict-yes*H0*3
  13340. -->
  13341. (S1 ^operator O1983 = 0.3829290045611482)
  13342. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  13343. -->
  13344. (S1 ^operator O1983 = 0.6170271815281626)
  13345. --- END Proposal Phase ---
  13346. --- Decision Phase ---
  13347. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.727959 -0.20484 0.52312(R,m,v=1,0.978723,0.0209726)
  13348. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272045 0.204839 0.476883 -> 0.272044 0.204839 0.476883(R,m,v=1,1,0)
  13349. =>WM: (13951: S1 ^operator O1985)
  13350. 993: O: O1985 (predict-yes)
  13351. --- END Decision Phase ---
  13352. --- Application Phase ---
  13353. --- Firing Productions (PE) For State At Depth 1 ---
  13354. --- Inner Elaboration Phase, active level 1 (S1) ---
  13355. Firing apply*operator
  13356. -->
  13357. (I3 ^predict-yes N993 + :O )
  13358. Firing apply*operator*complete
  13359. -->
  13360. (I3 ^predict-yes N992 - :O )
  13361. inner elaboration loop at bottom goal.
  13362. --- Change Working Memory (PE) ---
  13363. =>WM: (13952: I3 ^predict-yes N993)
  13364. <=WM: (13938: N992 ^status complete)
  13365. <=WM: (13937: I3 ^predict-yes N992)
  13366. --- Firing Productions (IE) For State At Depth 1 ---
  13367. --- Inner Elaboration Phase, active level 1 (S1) ---
  13368. Firing monitor*world
  13369. -->
  13370. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13371. --- Change Working Memory (IE) ---
  13372. --- END Application Phase ---
  13373. --- Output Phase ---
  13374. ENV: Agent did: predict-yes for direction R in state State-A
  13375. In State-A moving R
  13376. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  13377. predict error 0
  13378. dir: dir isU
  13379. --- END Output Phase ---
  13380. |\---- Input Phase ---
  13381. =>WM: (13956: I2 ^dir U)
  13382. =>WM: (13955: I2 ^reward 1)
  13383. =>WM: (13954: I2 ^see 1)
  13384. =>WM: (13953: N993 ^status complete)
  13385. <=WM: (13941: I2 ^dir R)
  13386. <=WM: (13940: I2 ^reward 1)
  13387. <=WM: (13939: I2 ^see 1)
  13388. =>WM: (13957: I2 ^level-1 R1-root)
  13389. <=WM: (13942: I2 ^level-1 L1-root)
  13390. --- END Input Phase ---
  13391. --- Proposal Phase ---
  13392. --- Inner Elaboration Phase, active level 1 (S1) ---
  13393. Firing elaborate*copy-see-to-output-link
  13394. -->
  13395. (I3 ^see 1 +)
  13396. Firing elaborate*reward*based*on*reward
  13397. -->
  13398. (R997 ^value 1 +)
  13399. (R1 ^reward R997 +)
  13400. Firing propose*predict-yes
  13401. -->
  13402. (O1987 ^name predict-yes +)
  13403. (S1 ^operator O1987 +)
  13404. Firing propose*predict-no
  13405. -->
  13406. (O1988 ^name predict-no +)
  13407. (S1 ^operator O1988 +)
  13408. Firing rl*prefer*rvt*predict-no*H0*6
  13409. -->
  13410. (S1 ^operator O1986 = 0.9999999999999999)
  13411. Firing rl*prefer*rvt*predict-yes*H0*5
  13412. -->
  13413. (S1 ^operator O1985 = 0.)
  13414. Firing prefer*rvt*predict-yes*H0
  13415. -->
  13416. Firing prefer*rvt*predict-no*H0
  13417. -->
  13418. Firing elaborate*copy-dir-to-output-link
  13419. -->
  13420. (I3 ^dir U +)
  13421. inner elaboration loop at bottom goal.
  13422. Retracting elaborate*copy-see-to-output-link
  13423. -->
  13424. (I3 ^see 1 +)
  13425. Retracting propose*predict-no
  13426. -->
  13427. (O1986 ^name predict-no +)
  13428. (S1 ^operator O1986 +)
  13429. Retracting propose*predict-yes
  13430. -->
  13431. (O1985 ^name predict-yes +)
  13432. (S1 ^operator O1985 +)
  13433. Retracting elaborate*reward*based*on*reward
  13434. -->
  13435. (R996 ^value 1 +)
  13436. (R1 ^reward R996 +)
  13437. Retracting elaborate*copy-dir-to-output-link
  13438. -->
  13439. (I3 ^dir R +)
  13440. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  13441. -->
  13442. (S1 ^operator O1986 = 0.4901349546100854)
  13443. Retracting rl*prefer*rvt*predict-no*H0*4
  13444. -->
  13445. (S1 ^operator O1986 = 0.1269768790760836)
  13446. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  13447. -->
  13448. (S1 ^operator O1985 = 0.6170271815281626)
  13449. Retracting rl*prefer*rvt*predict-yes*H0*3
  13450. -->
  13451. (S1 ^operator O1985 = 0.3829290045611482)
  13452. =>WM: (13964: S1 ^operator O1988 +)
  13453. =>WM: (13963: S1 ^operator O1987 +)
  13454. =>WM: (13962: I3 ^dir U)
  13455. =>WM: (13961: O1988 ^name predict-no)
  13456. =>WM: (13960: O1987 ^name predict-yes)
  13457. =>WM: (13959: R997 ^value 1)
  13458. =>WM: (13958: R1 ^reward R997)
  13459. <=WM: (13949: S1 ^operator O1985 +)
  13460. <=WM: (13951: S1 ^operator O1985)
  13461. <=WM: (13950: S1 ^operator O1986 +)
  13462. <=WM: (13948: I3 ^dir R)
  13463. <=WM: (13944: R1 ^reward R996)
  13464. <=WM: (13947: O1986 ^name predict-no)
  13465. <=WM: (13946: O1985 ^name predict-yes)
  13466. <=WM: (13945: R996 ^value 1)
  13467. --- Inner Elaboration Phase, active level 1 (S1) ---
  13468. Firing prefer*rvt*predict-yes*H0
  13469. -->
  13470. Firing rl*prefer*rvt*predict-yes*H0*5
  13471. -->
  13472. (S1 ^operator O1987 = 0.)
  13473. Firing prefer*rvt*predict-no*H0
  13474. -->
  13475. Firing rl*prefer*rvt*predict-no*H0*6
  13476. -->
  13477. (S1 ^operator O1988 = 0.9999999999999999)
  13478. inner elaboration loop at bottom goal.
  13479. Retracting rl*prefer*rvt*predict-no*H0*6
  13480. -->
  13481. (S1 ^operator O1986 = 0.9999999999999999)
  13482. Retracting rl*prefer*rvt*predict-yes*H0*5
  13483. -->
  13484. (S1 ^operator O1985 = 0.)
  13485. --- END Proposal Phase ---
  13486. --- Decision Phase ---
  13487. RL update rl*prefer*rvt*predict-yes*H0*3 0.673123 -0.290194 0.382929 -> 0.673129 -0.290194 0.382936(R,m,v=1,0.960526,0.0381666)
  13488. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326837 0.29019 0.617027 -> 0.326843 0.290191 0.617034(R,m,v=1,1,0)
  13489. =>WM: (13965: S1 ^operator O1988)
  13490. 994: O: O1988 (predict-no)
  13491. --- END Decision Phase ---
  13492. --- Application Phase ---
  13493. --- Firing Productions (PE) For State At Depth 1 ---
  13494. --- Inner Elaboration Phase, active level 1 (S1) ---
  13495. Firing apply*operator
  13496. -->
  13497. (I3 ^predict-no N994 + :O )
  13498. Firing apply*operator*complete
  13499. -->
  13500. (I3 ^predict-yes N993 - :O )
  13501. inner elaboration loop at bottom goal.
  13502. --- Change Working Memory (PE) ---
  13503. =>WM: (13966: I3 ^predict-no N994)
  13504. <=WM: (13953: N993 ^status complete)
  13505. <=WM: (13952: I3 ^predict-yes N993)
  13506. --- Firing Productions (IE) For State At Depth 1 ---
  13507. --- Inner Elaboration Phase, active level 1 (S1) ---
  13508. Firing monitor*world
  13509. -->
  13510. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13511. --- Change Working Memory (IE) ---
  13512. --- END Application Phase ---
  13513. --- Output Phase ---
  13514. ENV: Agent did: predict-no for direction U in state State-B
  13515. In State-B moving U
  13516. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13517. predict error 0
  13518. dir: dir isU
  13519. --- END Output Phase ---
  13520. /|\--- Input Phase ---
  13521. =>WM: (13970: I2 ^dir U)
  13522. =>WM: (13969: I2 ^reward 1)
  13523. =>WM: (13968: I2 ^see 0)
  13524. =>WM: (13967: N994 ^status complete)
  13525. <=WM: (13956: I2 ^dir U)
  13526. <=WM: (13955: I2 ^reward 1)
  13527. <=WM: (13954: I2 ^see 1)
  13528. =>WM: (13971: I2 ^level-1 R1-root)
  13529. <=WM: (13957: I2 ^level-1 R1-root)
  13530. --- END Input Phase ---
  13531. --- Proposal Phase ---
  13532. --- Inner Elaboration Phase, active level 1 (S1) ---
  13533. Firing elaborate*copy-see-to-output-link
  13534. -->
  13535. (I3 ^see 0 +)
  13536. Firing elaborate*reward*based*on*reward
  13537. -->
  13538. (R998 ^value 1 +)
  13539. (R1 ^reward R998 +)
  13540. Firing propose*predict-yes
  13541. -->
  13542. (O1989 ^name predict-yes +)
  13543. (S1 ^operator O1989 +)
  13544. Firing propose*predict-no
  13545. -->
  13546. (O1990 ^name predict-no +)
  13547. (S1 ^operator O1990 +)
  13548. Firing rl*prefer*rvt*predict-no*H0*6
  13549. -->
  13550. (S1 ^operator O1988 = 0.9999999999999999)
  13551. Firing rl*prefer*rvt*predict-yes*H0*5
  13552. -->
  13553. (S1 ^operator O1987 = 0.)
  13554. Firing prefer*rvt*predict-yes*H0
  13555. -->
  13556. Firing prefer*rvt*predict-no*H0
  13557. -->
  13558. Firing elaborate*copy-dir-to-output-link
  13559. -->
  13560. (I3 ^dir U +)
  13561. inner elaboration loop at bottom goal.
  13562. Retracting elaborate*copy-see-to-output-link
  13563. -->
  13564. (I3 ^see 1 +)
  13565. Retracting propose*predict-no
  13566. -->
  13567. (O1988 ^name predict-no +)
  13568. (S1 ^operator O1988 +)
  13569. Retracting propose*predict-yes
  13570. -->
  13571. (O1987 ^name predict-yes +)
  13572. (S1 ^operator O1987 +)
  13573. Retracting elaborate*reward*based*on*reward
  13574. -->
  13575. (R997 ^value 1 +)
  13576. (R1 ^reward R997 +)
  13577. Retracting elaborate*copy-dir-to-output-link
  13578. -->
  13579. (I3 ^dir U +)
  13580. Retracting rl*prefer*rvt*predict-no*H0*6
  13581. -->
  13582. (S1 ^operator O1988 = 0.9999999999999999)
  13583. Retracting rl*prefer*rvt*predict-yes*H0*5
  13584. -->
  13585. (S1 ^operator O1987 = 0.)
  13586. =>WM: (13978: S1 ^operator O1990 +)
  13587. =>WM: (13977: S1 ^operator O1989 +)
  13588. =>WM: (13976: O1990 ^name predict-no)
  13589. =>WM: (13975: O1989 ^name predict-yes)
  13590. =>WM: (13974: R998 ^value 1)
  13591. =>WM: (13973: R1 ^reward R998)
  13592. =>WM: (13972: I3 ^see 0)
  13593. <=WM: (13963: S1 ^operator O1987 +)
  13594. <=WM: (13964: S1 ^operator O1988 +)
  13595. <=WM: (13965: S1 ^operator O1988)
  13596. <=WM: (13958: R1 ^reward R997)
  13597. <=WM: (13943: I3 ^see 1)
  13598. <=WM: (13961: O1988 ^name predict-no)
  13599. <=WM: (13960: O1987 ^name predict-yes)
  13600. <=WM: (13959: R997 ^value 1)
  13601. --- Inner Elaboration Phase, active level 1 (S1) ---
  13602. Firing prefer*rvt*predict-yes*H0
  13603. -->
  13604. Firing rl*prefer*rvt*predict-yes*H0*5
  13605. -->
  13606. (S1 ^operator O1989 = 0.)
  13607. Firing prefer*rvt*predict-no*H0
  13608. -->
  13609. Firing rl*prefer*rvt*predict-no*H0*6
  13610. -->
  13611. (S1 ^operator O1990 = 0.9999999999999999)
  13612. inner elaboration loop at bottom goal.
  13613. Retracting rl*prefer*rvt*predict-no*H0*6
  13614. -->
  13615. (S1 ^operator O1988 = 0.9999999999999999)
  13616. Retracting rl*prefer*rvt*predict-yes*H0*5
  13617. -->
  13618. (S1 ^operator O1987 = 0.)
  13619. --- END Proposal Phase ---
  13620. --- Decision Phase ---
  13621. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13622. =>WM: (13979: S1 ^operator O1990)
  13623. 995: O: O1990 (predict-no)
  13624. --- END Decision Phase ---
  13625. --- Application Phase ---
  13626. --- Firing Productions (PE) For State At Depth 1 ---
  13627. --- Inner Elaboration Phase, active level 1 (S1) ---
  13628. Firing apply*operator
  13629. -->
  13630. (I3 ^predict-no N995 + :O )
  13631. Firing apply*operator*complete
  13632. -->
  13633. (I3 ^predict-no N994 - :O )
  13634. inner elaboration loop at bottom goal.
  13635. --- Change Working Memory (PE) ---
  13636. =>WM: (13980: I3 ^predict-no N995)
  13637. <=WM: (13967: N994 ^status complete)
  13638. <=WM: (13966: I3 ^predict-no N994)
  13639. --- Firing Productions (IE) For State At Depth 1 ---
  13640. --- Inner Elaboration Phase, active level 1 (S1) ---
  13641. Firing monitor*world
  13642. -->
  13643. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13644. --- Change Working Memory (IE) ---
  13645. --- END Application Phase ---
  13646. --- Output Phase ---
  13647. ENV: Agent did: predict-no for direction U in state State-B
  13648. In State-B moving U
  13649. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13650. predict error 0
  13651. dir: dir isL
  13652. --- END Output Phase ---
  13653. -/--- Input Phase ---
  13654. =>WM: (13984: I2 ^dir L)
  13655. =>WM: (13983: I2 ^reward 1)
  13656. =>WM: (13982: I2 ^see 0)
  13657. =>WM: (13981: N995 ^status complete)
  13658. <=WM: (13970: I2 ^dir U)
  13659. <=WM: (13969: I2 ^reward 1)
  13660. <=WM: (13968: I2 ^see 0)
  13661. =>WM: (13985: I2 ^level-1 R1-root)
  13662. <=WM: (13971: I2 ^level-1 R1-root)
  13663. --- END Input Phase ---
  13664. --- Proposal Phase ---
  13665. --- Inner Elaboration Phase, active level 1 (S1) ---
  13666. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  13667. -->
  13668. (S1 ^operator O1989 = 0.4768774843644236)
  13669. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  13670. -->
  13671. (S1 ^operator O1990 = -0.01194930198035649)
  13672. Firing prefer*rvt*predict-no*H0*2*H1
  13673. -->
  13674. Firing prefer*rvt*predict-yes*H0*1*H1
  13675. -->
  13676. Firing elaborate*copy-see-to-output-link
  13677. -->
  13678. (I3 ^see 0 +)
  13679. Firing elaborate*reward*based*on*reward
  13680. -->
  13681. (R999 ^value 1 +)
  13682. (R1 ^reward R999 +)
  13683. Firing propose*predict-yes
  13684. -->
  13685. (O1991 ^name predict-yes +)
  13686. (S1 ^operator O1991 +)
  13687. Firing propose*predict-no
  13688. -->
  13689. (O1992 ^name predict-no +)
  13690. (S1 ^operator O1992 +)
  13691. Firing rl*prefer*rvt*predict-no*H0*2
  13692. -->
  13693. (S1 ^operator O1990 = 0.2550133828092577)
  13694. Firing rl*prefer*rvt*predict-yes*H0*1
  13695. -->
  13696. (S1 ^operator O1989 = 0.5231195854047579)
  13697. Firing prefer*rvt*predict-yes*H0
  13698. -->
  13699. Firing prefer*rvt*predict-no*H0
  13700. -->
  13701. Firing elaborate*copy-dir-to-output-link
  13702. -->
  13703. (I3 ^dir L +)
  13704. inner elaboration loop at bottom goal.
  13705. Retracting elaborate*copy-see-to-output-link
  13706. -->
  13707. (I3 ^see 0 +)
  13708. Retracting propose*predict-no
  13709. -->
  13710. (O1990 ^name predict-no +)
  13711. (S1 ^operator O1990 +)
  13712. Retracting propose*predict-yes
  13713. -->
  13714. (O1989 ^name predict-yes +)
  13715. (S1 ^operator O1989 +)
  13716. Retracting elaborate*reward*based*on*reward
  13717. -->
  13718. (R998 ^value 1 +)
  13719. (R1 ^reward R998 +)
  13720. Retracting elaborate*copy-dir-to-output-link
  13721. -->
  13722. (I3 ^dir U +)
  13723. Retracting rl*prefer*rvt*predict-no*H0*6
  13724. -->
  13725. (S1 ^operator O1990 = 0.9999999999999999)
  13726. Retracting rl*prefer*rvt*predict-yes*H0*5
  13727. -->
  13728. (S1 ^operator O1989 = 0.)
  13729. =>WM: (13992: S1 ^operator O1992 +)
  13730. =>WM: (13991: S1 ^operator O1991 +)
  13731. =>WM: (13990: I3 ^dir L)
  13732. =>WM: (13989: O1992 ^name predict-no)
  13733. =>WM: (13988: O1991 ^name predict-yes)
  13734. =>WM: (13987: R999 ^value 1)
  13735. =>WM: (13986: R1 ^reward R999)
  13736. <=WM: (13977: S1 ^operator O1989 +)
  13737. <=WM: (13978: S1 ^operator O1990 +)
  13738. <=WM: (13979: S1 ^operator O1990)
  13739. <=WM: (13962: I3 ^dir U)
  13740. <=WM: (13973: R1 ^reward R998)
  13741. <=WM: (13976: O1990 ^name predict-no)
  13742. <=WM: (13975: O1989 ^name predict-yes)
  13743. <=WM: (13974: R998 ^value 1)
  13744. --- Inner Elaboration Phase, active level 1 (S1) ---
  13745. Firing prefer*rvt*predict-yes*H0
  13746. -->
  13747. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  13748. -->
  13749. (S1 ^operator O1991 = 0.4768774843644236)
  13750. Firing rl*prefer*rvt*predict-yes*H0*1
  13751. -->
  13752. (S1 ^operator O1991 = 0.5231195854047579)
  13753. Firing prefer*rvt*predict-yes*H0*1*H1
  13754. -->
  13755. Firing prefer*rvt*predict-no*H0
  13756. -->
  13757. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  13758. -->
  13759. (S1 ^operator O1992 = -0.01194930198035649)
  13760. Firing rl*prefer*rvt*predict-no*H0*2
  13761. -->
  13762. (S1 ^operator O1992 = 0.2550133828092577)
  13763. Firing prefer*rvt*predict-no*H0*2*H1
  13764. -->
  13765. inner elaboration loop at bottom goal.
  13766. Retracting rl*prefer*rvt*predict-no*H0*2
  13767. -->
  13768. (S1 ^operator O1990 = 0.2550133828092577)
  13769. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  13770. -->
  13771. (S1 ^operator O1990 = -0.01194930198035649)
  13772. Retracting rl*prefer*rvt*predict-yes*H0*1
  13773. -->
  13774. (S1 ^operator O1989 = 0.5231195854047579)
  13775. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  13776. -->
  13777. (S1 ^operator O1989 = 0.4768774843644236)
  13778. --- END Proposal Phase ---
  13779. --- Decision Phase ---
  13780. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13781. =>WM: (13993: S1 ^operator O1991)
  13782. 996: O: O1991 (predict-yes)
  13783. --- END Decision Phase ---
  13784. --- Application Phase ---
  13785. --- Firing Productions (PE) For State At Depth 1 ---
  13786. --- Inner Elaboration Phase, active level 1 (S1) ---
  13787. Firing apply*operator
  13788. -->
  13789. (I3 ^predict-yes N996 + :O )
  13790. Firing apply*operator*complete
  13791. -->
  13792. (I3 ^predict-no N995 - :O )
  13793. inner elaboration loop at bottom goal.
  13794. --- Change Working Memory (PE) ---
  13795. =>WM: (13994: I3 ^predict-yes N996)
  13796. <=WM: (13981: N995 ^status complete)
  13797. <=WM: (13980: I3 ^predict-no N995)
  13798. --- Firing Productions (IE) For State At Depth 1 ---
  13799. --- Inner Elaboration Phase, active level 1 (S1) ---
  13800. Firing monitor*world
  13801. -->
  13802. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13803. --- Change Working Memory (IE) ---
  13804. --- END Application Phase ---
  13805. --- Output Phase ---
  13806. ENV: Agent did: predict-yes for direction L in state State-B
  13807. In State-B moving L
  13808. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13809. predict error 0
  13810. dir: dir isU
  13811. --- END Output Phase ---
  13812. |\--- Input Phase ---
  13813. =>WM: (13998: I2 ^dir U)
  13814. =>WM: (13997: I2 ^reward 1)
  13815. =>WM: (13996: I2 ^see 1)
  13816. =>WM: (13995: N996 ^status complete)
  13817. <=WM: (13984: I2 ^dir L)
  13818. <=WM: (13983: I2 ^reward 1)
  13819. <=WM: (13982: I2 ^see 0)
  13820. =>WM: (13999: I2 ^level-1 L1-root)
  13821. <=WM: (13985: I2 ^level-1 R1-root)
  13822. --- END Input Phase ---
  13823. --- Proposal Phase ---
  13824. --- Inner Elaboration Phase, active level 1 (S1) ---
  13825. Firing elaborate*copy-see-to-output-link
  13826. -->
  13827. (I3 ^see 1 +)
  13828. Firing elaborate*reward*based*on*reward
  13829. -->
  13830. (R1000 ^value 1 +)
  13831. (R1 ^reward R1000 +)
  13832. Firing propose*predict-yes
  13833. -->
  13834. (O1993 ^name predict-yes +)
  13835. (S1 ^operator O1993 +)
  13836. Firing propose*predict-no
  13837. -->
  13838. (O1994 ^name predict-no +)
  13839. (S1 ^operator O1994 +)
  13840. Firing rl*prefer*rvt*predict-no*H0*6
  13841. -->
  13842. (S1 ^operator O1992 = 0.9999999999999999)
  13843. Firing rl*prefer*rvt*predict-yes*H0*5
  13844. -->
  13845. (S1 ^operator O1991 = 0.)
  13846. Firing prefer*rvt*predict-yes*H0
  13847. -->
  13848. Firing prefer*rvt*predict-no*H0
  13849. -->
  13850. Firing elaborate*copy-dir-to-output-link
  13851. -->
  13852. (I3 ^dir U +)
  13853. inner elaboration loop at bottom goal.
  13854. Retracting elaborate*copy-see-to-output-link
  13855. -->
  13856. (I3 ^see 0 +)
  13857. Retracting propose*predict-no
  13858. -->
  13859. (O1992 ^name predict-no +)
  13860. (S1 ^operator O1992 +)
  13861. Retracting propose*predict-yes
  13862. -->
  13863. (O1991 ^name predict-yes +)
  13864. (S1 ^operator O1991 +)
  13865. Retracting elaborate*reward*based*on*reward
  13866. -->
  13867. (R999 ^value 1 +)
  13868. (R1 ^reward R999 +)
  13869. Retracting elaborate*copy-dir-to-output-link
  13870. -->
  13871. (I3 ^dir L +)
  13872. Retracting rl*prefer*rvt*predict-no*H0*2
  13873. -->
  13874. (S1 ^operator O1992 = 0.2550133828092577)
  13875. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  13876. -->
  13877. (S1 ^operator O1992 = -0.01194930198035649)
  13878. Retracting rl*prefer*rvt*predict-yes*H0*1
  13879. -->
  13880. (S1 ^operator O1991 = 0.5231195854047579)
  13881. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  13882. -->
  13883. (S1 ^operator O1991 = 0.4768774843644236)
  13884. =>WM: (14007: S1 ^operator O1994 +)
  13885. =>WM: (14006: S1 ^operator O1993 +)
  13886. =>WM: (14005: I3 ^dir U)
  13887. =>WM: (14004: O1994 ^name predict-no)
  13888. =>WM: (14003: O1993 ^name predict-yes)
  13889. =>WM: (14002: R1000 ^value 1)
  13890. =>WM: (14001: R1 ^reward R1000)
  13891. =>WM: (14000: I3 ^see 1)
  13892. <=WM: (13991: S1 ^operator O1991 +)
  13893. <=WM: (13993: S1 ^operator O1991)
  13894. <=WM: (13992: S1 ^operator O1992 +)
  13895. <=WM: (13990: I3 ^dir L)
  13896. <=WM: (13986: R1 ^reward R999)
  13897. <=WM: (13972: I3 ^see 0)
  13898. <=WM: (13989: O1992 ^name predict-no)
  13899. <=WM: (13988: O1991 ^name predict-yes)
  13900. <=WM: (13987: R999 ^value 1)
  13901. --- Inner Elaboration Phase, active level 1 (S1) ---
  13902. Firing prefer*rvt*predict-yes*H0
  13903. -->
  13904. Firing rl*prefer*rvt*predict-yes*H0*5
  13905. -->
  13906. (S1 ^operator O1993 = 0.)
  13907. Firing prefer*rvt*predict-no*H0
  13908. -->
  13909. Firing rl*prefer*rvt*predict-no*H0*6
  13910. -->
  13911. (S1 ^operator O1994 = 0.9999999999999999)
  13912. inner elaboration loop at bottom goal.
  13913. Retracting rl*prefer*rvt*predict-no*H0*6
  13914. -->
  13915. (S1 ^operator O1992 = 0.9999999999999999)
  13916. Retracting rl*prefer*rvt*predict-yes*H0*5
  13917. -->
  13918. (S1 ^operator O1991 = 0.)
  13919. --- END Proposal Phase ---
  13920. --- Decision Phase ---
  13921. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.52312 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.978873,0.0208271)
  13922. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272037 0.204841 0.476877 -> 0.272037 0.204841 0.476878(R,m,v=1,1,0)
  13923. =>WM: (14008: S1 ^operator O1994)
  13924. 997: O: O1994 (predict-no)
  13925. --- END Decision Phase ---
  13926. --- Application Phase ---
  13927. --- Firing Productions (PE) For State At Depth 1 ---
  13928. --- Inner Elaboration Phase, active level 1 (S1) ---
  13929. Firing apply*operator
  13930. -->
  13931. (I3 ^predict-no N997 + :O )
  13932. Firing apply*operator*complete
  13933. -->
  13934. (I3 ^predict-yes N996 - :O )
  13935. inner elaboration loop at bottom goal.
  13936. --- Change Working Memory (PE) ---
  13937. =>WM: (14009: I3 ^predict-no N997)
  13938. <=WM: (13995: N996 ^status complete)
  13939. <=WM: (13994: I3 ^predict-yes N996)
  13940. --- Firing Productions (IE) For State At Depth 1 ---
  13941. --- Inner Elaboration Phase, active level 1 (S1) ---
  13942. Firing monitor*world
  13943. -->
  13944. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13945. --- Change Working Memory (IE) ---
  13946. --- END Application Phase ---
  13947. --- Output Phase ---
  13948. ENV: Agent did: predict-no for direction U in state State-A
  13949. In State-A moving U
  13950. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13951. predict error 0
  13952. dir: dir isU
  13953. --- END Output Phase ---
  13954. -/|--- Input Phase ---
  13955. =>WM: (14013: I2 ^dir U)
  13956. =>WM: (14012: I2 ^reward 1)
  13957. =>WM: (14011: I2 ^see 0)
  13958. =>WM: (14010: N997 ^status complete)
  13959. <=WM: (13998: I2 ^dir U)
  13960. <=WM: (13997: I2 ^reward 1)
  13961. <=WM: (13996: I2 ^see 1)
  13962. =>WM: (14014: I2 ^level-1 L1-root)
  13963. <=WM: (13999: I2 ^level-1 L1-root)
  13964. --- END Input Phase ---
  13965. --- Proposal Phase ---
  13966. --- Inner Elaboration Phase, active level 1 (S1) ---
  13967. Firing elaborate*copy-see-to-output-link
  13968. -->
  13969. (I3 ^see 0 +)
  13970. Firing elaborate*reward*based*on*reward
  13971. -->
  13972. (R1001 ^value 1 +)
  13973. (R1 ^reward R1001 +)
  13974. Firing propose*predict-yes
  13975. -->
  13976. (O1995 ^name predict-yes +)
  13977. (S1 ^operator O1995 +)
  13978. Firing propose*predict-no
  13979. -->
  13980. (O1996 ^name predict-no +)
  13981. (S1 ^operator O1996 +)
  13982. Firing rl*prefer*rvt*predict-no*H0*6
  13983. -->
  13984. (S1 ^operator O1994 = 0.9999999999999999)
  13985. Firing rl*prefer*rvt*predict-yes*H0*5
  13986. -->
  13987. (S1 ^operator O1993 = 0.)
  13988. Firing prefer*rvt*predict-yes*H0
  13989. -->
  13990. Firing prefer*rvt*predict-no*H0
  13991. -->
  13992. Firing elaborate*copy-dir-to-output-link
  13993. -->
  13994. (I3 ^dir U +)
  13995. inner elaboration loop at bottom goal.
  13996. Retracting elaborate*copy-see-to-output-link
  13997. -->
  13998. (I3 ^see 1 +)
  13999. Retracting propose*predict-no
  14000. -->
  14001. (O1994 ^name predict-no +)
  14002. (S1 ^operator O1994 +)
  14003. Retracting propose*predict-yes
  14004. -->
  14005. (O1993 ^name predict-yes +)
  14006. (S1 ^operator O1993 +)
  14007. Retracting elaborate*reward*based*on*reward
  14008. -->
  14009. (R1000 ^value 1 +)
  14010. (R1 ^reward R1000 +)
  14011. Retracting elaborate*copy-dir-to-output-link
  14012. -->
  14013. (I3 ^dir U +)
  14014. Retracting rl*prefer*rvt*predict-no*H0*6
  14015. -->
  14016. (S1 ^operator O1994 = 0.9999999999999999)
  14017. Retracting rl*prefer*rvt*predict-yes*H0*5
  14018. -->
  14019. (S1 ^operator O1993 = 0.)
  14020. =>WM: (14021: S1 ^operator O1996 +)
  14021. =>WM: (14020: S1 ^operator O1995 +)
  14022. =>WM: (14019: O1996 ^name predict-no)
  14023. =>WM: (14018: O1995 ^name predict-yes)
  14024. =>WM: (14017: R1001 ^value 1)
  14025. =>WM: (14016: R1 ^reward R1001)
  14026. =>WM: (14015: I3 ^see 0)
  14027. <=WM: (14006: S1 ^operator O1993 +)
  14028. <=WM: (14007: S1 ^operator O1994 +)
  14029. <=WM: (14008: S1 ^operator O1994)
  14030. <=WM: (14001: R1 ^reward R1000)
  14031. <=WM: (14000: I3 ^see 1)
  14032. <=WM: (14004: O1994 ^name predict-no)
  14033. <=WM: (14003: O1993 ^name predict-yes)
  14034. <=WM: (14002: R1000 ^value 1)
  14035. --- Inner Elaboration Phase, active level 1 (S1) ---
  14036. Firing prefer*rvt*predict-yes*H0
  14037. -->
  14038. Firing rl*prefer*rvt*predict-yes*H0*5
  14039. -->
  14040. (S1 ^operator O1995 = 0.)
  14041. Firing prefer*rvt*predict-no*H0
  14042. -->
  14043. Firing rl*prefer*rvt*predict-no*H0*6
  14044. -->
  14045. (S1 ^operator O1996 = 0.9999999999999999)
  14046. inner elaboration loop at bottom goal.
  14047. Retracting rl*prefer*rvt*predict-no*H0*6
  14048. -->
  14049. (S1 ^operator O1994 = 0.9999999999999999)
  14050. Retracting rl*prefer*rvt*predict-yes*H0*5
  14051. -->
  14052. (S1 ^operator O1993 = 0.)
  14053. --- END Proposal Phase ---
  14054. --- Decision Phase ---
  14055. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14056. =>WM: (14022: S1 ^operator O1996)
  14057. 998: O: O1996 (predict-no)
  14058. --- END Decision Phase ---
  14059. --- Application Phase ---
  14060. --- Firing Productions (PE) For State At Depth 1 ---
  14061. --- Inner Elaboration Phase, active level 1 (S1) ---
  14062. Firing apply*operator
  14063. -->
  14064. (I3 ^predict-no N998 + :O )
  14065. Firing apply*operator*complete
  14066. -->
  14067. (I3 ^predict-no N997 - :O )
  14068. inner elaboration loop at bottom goal.
  14069. --- Change Working Memory (PE) ---
  14070. =>WM: (14023: I3 ^predict-no N998)
  14071. <=WM: (14010: N997 ^status complete)
  14072. <=WM: (14009: I3 ^predict-no N997)
  14073. --- Firing Productions (IE) For State At Depth 1 ---
  14074. --- Inner Elaboration Phase, active level 1 (S1) ---
  14075. Firing monitor*world
  14076. -->
  14077. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14078. --- Change Working Memory (IE) ---
  14079. --- END Application Phase ---
  14080. --- Output Phase ---
  14081. ENV: Agent did: predict-no for direction U in state State-A
  14082. In State-A moving U
  14083. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14084. predict error 0
  14085. dir: dir isL
  14086. --- END Output Phase ---
  14087. \-/--- Input Phase ---
  14088. =>WM: (14027: I2 ^dir L)
  14089. =>WM: (14026: I2 ^reward 1)
  14090. =>WM: (14025: I2 ^see 0)
  14091. =>WM: (14024: N998 ^status complete)
  14092. <=WM: (14013: I2 ^dir U)
  14093. <=WM: (14012: I2 ^reward 1)
  14094. <=WM: (14011: I2 ^see 0)
  14095. =>WM: (14028: I2 ^level-1 L1-root)
  14096. <=WM: (14014: I2 ^level-1 L1-root)
  14097. --- END Input Phase ---
  14098. --- Proposal Phase ---
  14099. --- Inner Elaboration Phase, active level 1 (S1) ---
  14100. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  14101. -->
  14102. (S1 ^operator O1995 = 0.1693592933936033)
  14103. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  14104. -->
  14105. (S1 ^operator O1996 = 0.7449864376794202)
  14106. Firing prefer*rvt*predict-no*H0*2*H1
  14107. -->
  14108. Firing prefer*rvt*predict-yes*H0*1*H1
  14109. -->
  14110. Firing elaborate*copy-see-to-output-link
  14111. -->
  14112. (I3 ^see 0 +)
  14113. Firing elaborate*reward*based*on*reward
  14114. -->
  14115. (R1002 ^value 1 +)
  14116. (R1 ^reward R1002 +)
  14117. Firing propose*predict-yes
  14118. -->
  14119. (O1997 ^name predict-yes +)
  14120. (S1 ^operator O1997 +)
  14121. Firing propose*predict-no
  14122. -->
  14123. (O1998 ^name predict-no +)
  14124. (S1 ^operator O1998 +)
  14125. Firing rl*prefer*rvt*predict-no*H0*2
  14126. -->
  14127. (S1 ^operator O1996 = 0.2550133828092577)
  14128. Firing rl*prefer*rvt*predict-yes*H0*1
  14129. -->
  14130. (S1 ^operator O1995 = 0.5231200249393807)
  14131. Firing prefer*rvt*predict-yes*H0
  14132. -->
  14133. Firing prefer*rvt*predict-no*H0
  14134. -->
  14135. Firing elaborate*copy-dir-to-output-link
  14136. -->
  14137. (I3 ^dir L +)
  14138. inner elaboration loop at bottom goal.
  14139. Retracting elaborate*copy-see-to-output-link
  14140. -->
  14141. (I3 ^see 0 +)
  14142. Retracting propose*predict-no
  14143. -->
  14144. (O1996 ^name predict-no +)
  14145. (S1 ^operator O1996 +)
  14146. Retracting propose*predict-yes
  14147. -->
  14148. (O1995 ^name predict-yes +)
  14149. (S1 ^operator O1995 +)
  14150. Retracting elaborate*reward*based*on*reward
  14151. -->
  14152. (R1001 ^value 1 +)
  14153. (R1 ^reward R1001 +)
  14154. Retracting elaborate*copy-dir-to-output-link
  14155. -->
  14156. (I3 ^dir U +)
  14157. Retracting rl*prefer*rvt*predict-no*H0*6
  14158. -->
  14159. (S1 ^operator O1996 = 0.9999999999999999)
  14160. Retracting rl*prefer*rvt*predict-yes*H0*5
  14161. -->
  14162. (S1 ^operator O1995 = 0.)
  14163. =>WM: (14035: S1 ^operator O1998 +)
  14164. =>WM: (14034: S1 ^operator O1997 +)
  14165. =>WM: (14033: I3 ^dir L)
  14166. =>WM: (14032: O1998 ^name predict-no)
  14167. =>WM: (14031: O1997 ^name predict-yes)
  14168. =>WM: (14030: R1002 ^value 1)
  14169. =>WM: (14029: R1 ^reward R1002)
  14170. <=WM: (14020: S1 ^operator O1995 +)
  14171. <=WM: (14021: S1 ^operator O1996 +)
  14172. <=WM: (14022: S1 ^operator O1996)
  14173. <=WM: (14005: I3 ^dir U)
  14174. <=WM: (14016: R1 ^reward R1001)
  14175. <=WM: (14019: O1996 ^name predict-no)
  14176. <=WM: (14018: O1995 ^name predict-yes)
  14177. <=WM: (14017: R1001 ^value 1)
  14178. --- Inner Elaboration Phase, active level 1 (S1) ---
  14179. Firing prefer*rvt*predict-yes*H0
  14180. -->
  14181. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  14182. -->
  14183. (S1 ^operator O1997 = 0.1693592933936033)
  14184. Firing rl*prefer*rvt*predict-yes*H0*1
  14185. -->
  14186. (S1 ^operator O1997 = 0.5231200249393807)
  14187. Firing prefer*rvt*predict-yes*H0*1*H1
  14188. -->
  14189. Firing prefer*rvt*predict-no*H0
  14190. -->
  14191. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  14192. -->
  14193. (S1 ^operator O1998 = 0.7449864376794202)
  14194. Firing rl*prefer*rvt*predict-no*H0*2
  14195. -->
  14196. (S1 ^operator O1998 = 0.2550133828092577)
  14197. Firing prefer*rvt*predict-no*H0*2*H1
  14198. -->
  14199. inner elaboration loop at bottom goal.
  14200. Retracting rl*prefer*rvt*predict-no*H0*2
  14201. -->
  14202. (S1 ^operator O1996 = 0.2550133828092577)
  14203. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  14204. -->
  14205. (S1 ^operator O1996 = 0.7449864376794202)
  14206. Retracting rl*prefer*rvt*predict-yes*H0*1
  14207. -->
  14208. (S1 ^operator O1995 = 0.5231200249393807)
  14209. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  14210. -->
  14211. (S1 ^operator O1995 = 0.1693592933936033)
  14212. --- END Proposal Phase ---
  14213. --- Decision Phase ---
  14214. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14215. =>WM: (14036: S1 ^operator O1998)
  14216. 999: O: O1998 (predict-no)
  14217. --- END Decision Phase ---
  14218. --- Application Phase ---
  14219. --- Firing Productions (PE) For State At Depth 1 ---
  14220. --- Inner Elaboration Phase, active level 1 (S1) ---
  14221. Firing apply*operator
  14222. -->
  14223. (I3 ^predict-no N999 + :O )
  14224. Firing apply*operator*complete
  14225. -->
  14226. (I3 ^predict-no N998 - :O )
  14227. inner elaboration loop at bottom goal.
  14228. --- Change Working Memory (PE) ---
  14229. =>WM: (14037: I3 ^predict-no N999)
  14230. <=WM: (14024: N998 ^status complete)
  14231. <=WM: (14023: I3 ^predict-no N998)
  14232. --- Firing Productions (IE) For State At Depth 1 ---
  14233. --- Inner Elaboration Phase, active level 1 (S1) ---
  14234. Firing monitor*world
  14235. -->
  14236. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14237. --- Change Working Memory (IE) ---
  14238. --- END Application Phase ---
  14239. --- Output Phase ---
  14240. ENV: Agent did: predict-no for direction L in state State-A
  14241. In State-A moving L
  14242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14243. predict error 0
  14244. dir: dir isL
  14245. --- END Output Phase ---
  14246. |\---- Input Phase ---
  14247. =>WM: (14041: I2 ^dir L)
  14248. =>WM: (14040: I2 ^reward 1)
  14249. =>WM: (14039: I2 ^see 0)
  14250. =>WM: (14038: N999 ^status complete)
  14251. <=WM: (14027: I2 ^dir L)
  14252. <=WM: (14026: I2 ^reward 1)
  14253. <=WM: (14025: I2 ^see 0)
  14254. =>WM: (14042: I2 ^level-1 L0-root)
  14255. <=WM: (14028: I2 ^level-1 L1-root)
  14256. --- END Input Phase ---
  14257. --- Proposal Phase ---
  14258. --- Inner Elaboration Phase, active level 1 (S1) ---
  14259. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  14260. -->
  14261. (S1 ^operator O1997 = 0.3)
  14262. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  14263. -->
  14264. (S1 ^operator O1998 = 0.7449867384410525)
  14265. Firing prefer*rvt*predict-no*H0*2*H1
  14266. -->
  14267. Firing prefer*rvt*predict-yes*H0*1*H1
  14268. -->
  14269. Firing elaborate*copy-see-to-output-link
  14270. -->
  14271. (I3 ^see 0 +)
  14272. Firing elaborate*reward*based*on*reward
  14273. -->
  14274. (R1003 ^value 1 +)
  14275. (R1 ^reward R1003 +)
  14276. Firing propose*predict-yes
  14277. -->
  14278. (O1999 ^name predict-yes +)
  14279. (S1 ^operator O1999 +)
  14280. Firing propose*predict-no
  14281. -->
  14282. (O2000 ^name predict-no +)
  14283. (S1 ^operator O2000 +)
  14284. Firing rl*prefer*rvt*predict-no*H0*2
  14285. -->
  14286. (S1 ^operator O1998 = 0.2550133828092577)
  14287. Firing rl*prefer*rvt*predict-yes*H0*1
  14288. -->
  14289. (S1 ^operator O1997 = 0.5231200249393807)
  14290. Firing prefer*rvt*predict-yes*H0
  14291. -->
  14292. Firing prefer*rvt*predict-no*H0
  14293. -->
  14294. Firing elaborate*copy-dir-to-output-link
  14295. -->
  14296. (I3 ^dir L +)
  14297. inner elaboration loop at bottom goal.
  14298. Retracting elaborate*copy-see-to-output-link
  14299. -->
  14300. (I3 ^see 0 +)
  14301. Retracting propose*predict-no
  14302. -->
  14303. (O1998 ^name predict-no +)
  14304. (S1 ^operator O1998 +)
  14305. Retracting propose*predict-yes
  14306. -->
  14307. (O1997 ^name predict-yes +)
  14308. (S1 ^operator O1997 +)
  14309. Retracting elaborate*reward*based*on*reward
  14310. -->
  14311. (R1002 ^value 1 +)
  14312. (R1 ^reward R1002 +)
  14313. Retracting elaborate*copy-dir-to-output-link
  14314. -->
  14315. (I3 ^dir L +)
  14316. Retracting rl*prefer*rvt*predict-no*H0*2
  14317. -->
  14318. (S1 ^operator O1998 = 0.2550133828092577)
  14319. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  14320. -->
  14321. (S1 ^operator O1998 = 0.7449864376794202)
  14322. Retracting rl*prefer*rvt*predict-yes*H0*1
  14323. -->
  14324. (S1 ^operator O1997 = 0.5231200249393807)
  14325. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  14326. -->
  14327. (S1 ^operator O1997 = 0.1693592933936033)
  14328. =>WM: (14048: S1 ^operator O2000 +)
  14329. =>WM: (14047: S1 ^operator O1999 +)
  14330. =>WM: (14046: O2000 ^name predict-no)
  14331. =>WM: (14045: O1999 ^name predict-yes)
  14332. =>WM: (14044: R1003 ^value 1)
  14333. =>WM: (14043: R1 ^reward R1003)
  14334. <=WM: (14034: S1 ^operator O1997 +)
  14335. <=WM: (14035: S1 ^operator O1998 +)
  14336. <=WM: (14036: S1 ^operator O1998)
  14337. <=WM: (14029: R1 ^reward R1002)
  14338. <=WM: (14032: O1998 ^name predict-no)
  14339. <=WM: (14031: O1997 ^name predict-yes)
  14340. <=WM: (14030: R1002 ^value 1)
  14341. --- Inner Elaboration Phase, active level 1 (S1) ---
  14342. Firing prefer*rvt*predict-yes*H0
  14343. -->
  14344. Firing rl*prefer*rvt*predict-yes*H0*1
  14345. -->
  14346. (S1 ^operator O1999 = 0.5231200249393807)
  14347. Firing prefer*rvt*predict-yes*H0*1*H1
  14348. -->
  14349. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  14350. -->
  14351. (S1 ^operator O1999 = 0.3)
  14352. Firing prefer*rvt*predict-no*H0
  14353. -->
  14354. Firing rl*prefer*rvt*predict-no*H0*2
  14355. -->
  14356. (S1 ^operator O2000 = 0.2550133828092577)
  14357. Firing prefer*rvt*predict-no*H0*2*H1
  14358. -->
  14359. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  14360. -->
  14361. (S1 ^operator O2000 = 0.7449867384410525)
  14362. inner elaboration loop at bottom goal.
  14363. Retracting rl*prefer*rvt*predict-no*H0*2
  14364. -->
  14365. (S1 ^operator O1998 = 0.2550133828092577)
  14366. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  14367. -->
  14368. (S1 ^operator O1998 = 0.7449867384410525)
  14369. Retracting rl*prefer*rvt*predict-yes*H0*1
  14370. -->
  14371. (S1 ^operator O1997 = 0.5231200249393807)
  14372. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  14373. -->
  14374. (S1 ^operator O1997 = 0.3)
  14375. --- END Proposal Phase ---
  14376. --- Decision Phase ---
  14377. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.917526,0.0760643)
  14378. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744986 -> 0.368505 0.376482 0.744986(R,m,v=1,1,0)
  14379. =>WM: (14049: S1 ^operator O2000)
  14380. 1000: O: O2000 (predict-no)
  14381. --- END Decision Phase ---
  14382. --- Application Phase ---
  14383. --- Firing Productions (PE) For State At Depth 1 ---
  14384. --- Inner Elaboration Phase, active level 1 (S1) ---
  14385. Firing apply*operator
  14386. -->
  14387. (I3 ^predict-no N1000 + :O )
  14388. Firing apply*operator*complete
  14389. -->
  14390. (I3 ^predict-no N999 - :O )
  14391. inner elaboration loop at bottom goal.
  14392. --- Change Working Memory (PE) ---
  14393. =>WM: (14050: I3 ^predict-no N1000)
  14394. <=WM: (14038: N999 ^status complete)
  14395. <=WM: (14037: I3 ^predict-no N999)
  14396. --- Firing Productions (IE) For State At Depth 1 ---
  14397. --- Inner Elaboration Phase, active level 1 (S1) ---
  14398. Firing monitor*world
  14399. -->
  14400. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14401. --- Change Working Memory (IE) ---
  14402. --- END Application Phase ---
  14403. --- Output Phase ---
  14404. ENV: Agent did: predict-no for direction L in state State-A
  14405. In State-A moving L
  14406. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14407. predict error 0
  14408. dir: dir isL
  14409. --- END Output Phase ---
  14410. /|\-/--- Input Phase ---
  14411. =>WM: (14054: I2 ^dir L)
  14412. =>WM: (14053: I2 ^reward 1)
  14413. =>WM: (14052: I2 ^see 0)
  14414. =>WM: (14051: N1000 ^status complete)
  14415. <=WM: (14041: I2 ^dir L)
  14416. <=WM: (14040: I2 ^reward 1)
  14417. <=WM: (14039: I2 ^see 0)
  14418. =>WM: (14055: I2 ^level-1 L0-root)
  14419. <=WM: (14042: I2 ^level-1 L0-root)
  14420. --- END Input Phase ---
  14421. --- Proposal Phase ---
  14422. --- Inner Elaboration Phase, active level 1 (S1) ---
  14423. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  14424. -->
  14425. (S1 ^operator O1999 = 0.3)
  14426. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  14427. -->
  14428. (S1 ^operator O2000 = 0.7449867384410525)
  14429. Firing prefer*rvt*predict-no*H0*2*H1
  14430. -->
  14431. Firing prefer*rvt*predict-yes*H0*1*H1
  14432. -->
  14433. Firing elaborate*copy-see-to-output-link
  14434. -->
  14435. (I3 ^see 0 +)
  14436. Firing elaborate*reward*based*on*reward
  14437. -->
  14438. (R1004 ^value 1 +)
  14439. (R1 ^reward R1004 +)
  14440. Firing propose*predict-yes
  14441. -->
  14442. (O2001 ^name predict-yes +)
  14443. (S1 ^operator O2001 +)
  14444. Firing propose*predict-no
  14445. -->
  14446. (O2002 ^name predict-no +)
  14447. (S1 ^operator O2002 +)
  14448. Firing rl*prefer*rvt*predict-no*H0*2
  14449. -->
  14450. (S1 ^operator O2000 = 0.255013409735956)
  14451. Firing rl*prefer*rvt*predict-yes*H0*1
  14452. -->
  14453. (S1 ^operator O1999 = 0.5231200249393807)
  14454. Firing prefer*rvt*predict-yes*H0
  14455. -->
  14456. Firing prefer*rvt*predict-no*H0
  14457. -->
  14458. Firing elaborate*copy-dir-to-output-link
  14459. -->
  14460. (I3 ^dir L +)
  14461. inner elaboration loop at bottom goal.
  14462. Retracting elaborate*copy-see-to-output-link
  14463. -->
  14464. (I3 ^see 0 +)
  14465. Retracting propose*predict-no
  14466. -->
  14467. (O2000 ^name predict-no +)
  14468. (S1 ^operator O2000 +)
  14469. Retracting propose*predict-yes
  14470. -->
  14471. (O1999 ^name predict-yes +)
  14472. (S1 ^operator O1999 +)
  14473. Retracting elaborate*reward*based*on*reward
  14474. -->
  14475. (R1003 ^value 1 +)
  14476. (R1 ^reward R1003 +)
  14477. Retracting elaborate*copy-dir-to-output-link
  14478. -->
  14479. (I3 ^dir L +)
  14480. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  14481. -->
  14482. (S1 ^operator O2000 = 0.7449867384410525)
  14483. Retracting rl*prefer*rvt*predict-no*H0*2
  14484. -->
  14485. (S1 ^operator O2000 = 0.255013409735956)
  14486. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  14487. -->
  14488. (S1 ^operator O1999 = 0.3)
  14489. Retracting rl*prefer*rvt*predict-yes*H0*1
  14490. -->
  14491. (S1 ^operator O1999 = 0.5231200249393807)
  14492. =>WM: (14061: S1 ^operator O2002 +)
  14493. =>WM: (14060: S1 ^operator O2001 +)
  14494. =>WM: (14059: O2002 ^name predict-no)
  14495. =>WM: (14058: O2001 ^name predict-yes)
  14496. =>WM: (14057: R1004 ^value 1)
  14497. =>WM: (14056: R1 ^reward R1004)
  14498. <=WM: (14047: S1 ^operator O1999 +)
  14499. <=WM: (14048: S1 ^operator O2000 +)
  14500. <=WM: (14049: S1 ^operator O2000)
  14501. <=WM: (14043: R1 ^reward R1003)
  14502. <=WM: (14046: O2000 ^name predict-no)
  14503. <=WM: (14045: O1999 ^name predict-yes)
  14504. <=WM: (14044: R1003 ^value 1)
  14505. --- Inner Elaboration Phase, active level 1 (S1) ---
  14506. Firing prefer*rvt*predict-yes*H0
  14507. -->
  14508. Firing rl*prefer*rvt*predict-yes*H0*1
  14509. -->
  14510. (S1 ^operator O2001 = 0.5231200249393807)
  14511. Firing prefer*rvt*predict-yes*H0*1*H1
  14512. -->
  14513. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  14514. -->
  14515. (S1 ^operator O2001 = 0.3)
  14516. Firing prefer*rvt*predict-no*H0
  14517. -->
  14518. Firing rl*prefer*rvt*predict-no*H0*2
  14519. -->
  14520. (S1 ^operator O2002 = 0.255013409735956)
  14521. Firing prefer*rvt*predict-no*H0*2*H1
  14522. -->
  14523. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  14524. -->
  14525. (S1 ^operator O2002 = 0.7449867384410525)
  14526. inner elaboration loop at bottom goal.
  14527. Retracting rl*prefer*rvt*predict-no*H0*2
  14528. -->
  14529. (S1 ^operator O2000 = 0.255013409735956)
  14530. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  14531. -->
  14532. (S1 ^operator O2000 = 0.7449867384410525)
  14533. Retracting rl*prefer*rvt*predict-yes*H0*1
  14534. -->
  14535. (S1 ^operator O1999 = 0.5231200249393807)
  14536. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  14537. -->
  14538. (S1 ^operator O1999 = 0.3)
  14539. --- END Proposal Phase ---
  14540. --- Decision Phase ---
  14541. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.917949,0.0757071)
  14542. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  14543. =>WM: (14062: S1 ^operator O2002)
  14544. 1001: O: O2002 (predict-no)
  14545. --- END Decision Phase ---
  14546. --- Application Phase ---
  14547. --- Firing Productions (PE) For State At Depth 1 ---
  14548. --- Inner Elaboration Phase, active level 1 (S1) ---
  14549. Firing apply*operator
  14550. -->
  14551. (I3 ^predict-no N1001 + :O )
  14552. Firing apply*operator*complete
  14553. -->
  14554. (I3 ^predict-no N1000 - :O )
  14555. inner elaboration loop at bottom goal.
  14556. --- Change Working Memory (PE) ---
  14557. =>WM: (14063: I3 ^predict-no N1001)
  14558. <=WM: (14051: N1000 ^status complete)
  14559. <=WM: (14050: I3 ^predict-no N1000)
  14560. --- Firing Productions (IE) For State At Depth 1 ---
  14561. --- Inner Elaboration Phase, active level 1 (S1) ---
  14562. Firing monitor*world
  14563. -->
  14564. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14565. --- Change Working Memory (IE) ---
  14566. --- END Application Phase ---
  14567. --- Output Phase ---
  14568. ENV: Agent did: predict-no for direction L in state State-A
  14569. In State-A moving L
  14570. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14571. predict error 0
  14572. dir: dir isL
  14573. --- END Output Phase ---
  14574. |--- Input Phase ---
  14575. =>WM: (14067: I2 ^dir L)
  14576. =>WM: (14066: I2 ^reward 1)
  14577. =>WM: (14065: I2 ^see 0)
  14578. =>WM: (14064: N1001 ^status complete)
  14579. <=WM: (14054: I2 ^dir L)
  14580. <=WM: (14053: I2 ^reward 1)
  14581. <=WM: (14052: I2 ^see 0)
  14582. =>WM: (14068: I2 ^level-1 L0-root)
  14583. <=WM: (14055: I2 ^level-1 L0-root)
  14584. --- END Input Phase ---
  14585. --- Proposal Phase ---
  14586. --- Inner Elaboration Phase, active level 1 (S1) ---
  14587. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  14588. -->
  14589. (S1 ^operator O2001 = 0.3)
  14590. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  14591. -->
  14592. (S1 ^operator O2002 = 0.7449867162145012)
  14593. Firing prefer*rvt*predict-no*H0*2*H1
  14594. -->
  14595. Firing prefer*rvt*predict-yes*H0*1*H1
  14596. -->
  14597. Firing elaborate*copy-see-to-output-link
  14598. -->
  14599. (I3 ^see 0 +)
  14600. Firing elaborate*reward*based*on*reward
  14601. -->
  14602. (R1005 ^value 1 +)
  14603. (R1 ^reward R1005 +)
  14604. Firing propose*predict-yes
  14605. -->
  14606. (O2003 ^name predict-yes +)
  14607. (S1 ^operator O2003 +)
  14608. Firing propose*predict-no
  14609. -->
  14610. (O2004 ^name predict-no +)
  14611. (S1 ^operator O2004 +)
  14612. Firing rl*prefer*rvt*predict-no*H0*2
  14613. -->
  14614. (S1 ^operator O2002 = 0.2550133875094047)
  14615. Firing rl*prefer*rvt*predict-yes*H0*1
  14616. -->
  14617. (S1 ^operator O2001 = 0.5231200249393807)
  14618. Firing prefer*rvt*predict-yes*H0
  14619. -->
  14620. Firing prefer*rvt*predict-no*H0
  14621. -->
  14622. Firing elaborate*copy-dir-to-output-link
  14623. -->
  14624. (I3 ^dir L +)
  14625. inner elaboration loop at bottom goal.
  14626. Retracting elaborate*copy-see-to-output-link
  14627. -->
  14628. (I3 ^see 0 +)
  14629. Retracting propose*predict-no
  14630. -->
  14631. (O2002 ^name predict-no +)
  14632. (S1 ^operator O2002 +)
  14633. Retracting propose*predict-yes
  14634. -->
  14635. (O2001 ^name predict-yes +)
  14636. (S1 ^operator O2001 +)
  14637. Retracting elaborate*reward*based*on*reward
  14638. -->
  14639. (R1004 ^value 1 +)
  14640. (R1 ^reward R1004 +)
  14641. Retracting elaborate*copy-dir-to-output-link
  14642. -->
  14643. (I3 ^dir L +)
  14644. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  14645. -->
  14646. (S1 ^operator O2002 = 0.7449867162145012)
  14647. Retracting rl*prefer*rvt*predict-no*H0*2
  14648. -->
  14649. (S1 ^operator O2002 = 0.2550133875094047)
  14650. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  14651. -->
  14652. (S1 ^operator O2001 = 0.3)
  14653. Retracting rl*prefer*rvt*predict-yes*H0*1
  14654. -->
  14655. (S1 ^operator O2001 = 0.5231200249393807)
  14656. =>WM: (14074: S1 ^operator O2004 +)
  14657. =>WM: (14073: S1 ^operator O2003 +)
  14658. =>WM: (14072: O2004 ^name predict-no)
  14659. =>WM: (14071: O2003 ^name predict-yes)
  14660. =>WM: (14070: R1005 ^value 1)
  14661. =>WM: (14069: R1 ^reward R1005)
  14662. <=WM: (14060: S1 ^operator O2001 +)
  14663. <=WM: (14061: S1 ^operator O2002 +)
  14664. <=WM: (14062: S1 ^operator O2002)
  14665. <=WM: (14056: R1 ^reward R1004)
  14666. <=WM: (14059: O2002 ^name predict-no)
  14667. <=WM: (14058: O2001 ^name predict-yes)
  14668. <=WM: (14057: R1004 ^value 1)
  14669. --- Inner Elaboration Phase, active level 1 (S1) ---
  14670. Firing prefer*rvt*predict-yes*H0
  14671. -->
  14672. Firing rl*prefer*rvt*predict-yes*H0*1
  14673. -->
  14674. (S1 ^operator O2003 = 0.5231200249393807)
  14675. Firing prefer*rvt*predict-yes*H0*1*H1
  14676. -->
  14677. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  14678. -->
  14679. (S1 ^operator O2003 = 0.3)
  14680. Firing prefer*rvt*predict-no*H0
  14681. -->
  14682. Firing rl*prefer*rvt*predict-no*H0*2
  14683. -->
  14684. (S1 ^operator O2004 = 0.2550133875094047)
  14685. Firing prefer*rvt*predict-no*H0*2*H1
  14686. -->
  14687. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  14688. -->
  14689. (S1 ^operator O2004 = 0.7449867162145012)
  14690. inner elaboration loop at bottom goal.
  14691. Retracting rl*prefer*rvt*predict-no*H0*2
  14692. -->
  14693. (S1 ^operator O2002 = 0.2550133875094047)
  14694. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  14695. -->
  14696. (S1 ^operator O2002 = 0.7449867162145012)
  14697. Retracting rl*prefer*rvt*predict-yes*H0*1
  14698. -->
  14699. (S1 ^operator O2001 = 0.5231200249393807)
  14700. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  14701. -->
  14702. (S1 ^operator O2001 = 0.3)
  14703. --- END Proposal Phase ---
  14704. --- Decision Phase ---
  14705. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.918367,0.0753532)
  14706. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  14707. =>WM: (14075: S1 ^operator O2004)
  14708. 1002: O: O2004 (predict-no)
  14709. --- END Decision Phase ---
  14710. --- Application Phase ---
  14711. --- Firing Productions (PE) For State At Depth 1 ---
  14712. --- Inner Elaboration Phase, active level 1 (S1) ---
  14713. Firing apply*operator
  14714. -->
  14715. (I3 ^predict-no N1002 + :O )
  14716. Firing apply*operator*complete
  14717. -->
  14718. (I3 ^predict-no N1001 - :O )
  14719. inner elaboration loop at bottom goal.
  14720. --- Change Working Memory (PE) ---
  14721. =>WM: (14076: I3 ^predict-no N1002)
  14722. <=WM: (14064: N1001 ^status complete)
  14723. <=WM: (14063: I3 ^predict-no N1001)
  14724. --- Firing Productions (IE) For State At Depth 1 ---
  14725. --- Inner Elaboration Phase, active level 1 (S1) ---
  14726. Firing monitor*world
  14727. -->
  14728. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14729. --- Change Working Memory (IE) ---
  14730. --- END Application Phase ---
  14731. --- Output Phase ---
  14732. ENV: Agent did: predict-no for direction L in state State-A
  14733. In State-A moving L
  14734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14735. predict error 0
  14736. dir: dir isU
  14737. --- END Output Phase ---
  14738. \---- Input Phase ---
  14739. =>WM: (14080: I2 ^dir U)
  14740. =>WM: (14079: I2 ^reward 1)
  14741. =>WM: (14078: I2 ^see 0)
  14742. =>WM: (14077: N1002 ^status complete)
  14743. <=WM: (14067: I2 ^dir L)
  14744. <=WM: (14066: I2 ^reward 1)
  14745. <=WM: (14065: I2 ^see 0)
  14746. =>WM: (14081: I2 ^level-1 L0-root)
  14747. <=WM: (14068: I2 ^level-1 L0-root)
  14748. --- END Input Phase ---
  14749. --- Proposal Phase ---
  14750. --- Inner Elaboration Phase, active level 1 (S1) ---
  14751. Firing elaborate*copy-see-to-output-link
  14752. -->
  14753. (I3 ^see 0 +)
  14754. Firing elaborate*reward*based*on*reward
  14755. -->
  14756. (R1006 ^value 1 +)
  14757. (R1 ^reward R1006 +)
  14758. Firing propose*predict-yes
  14759. -->
  14760. (O2005 ^name predict-yes +)
  14761. (S1 ^operator O2005 +)
  14762. Firing propose*predict-no
  14763. -->
  14764. (O2006 ^name predict-no +)
  14765. (S1 ^operator O2006 +)
  14766. Firing rl*prefer*rvt*predict-no*H0*6
  14767. -->
  14768. (S1 ^operator O2004 = 0.9999999999999999)
  14769. Firing rl*prefer*rvt*predict-yes*H0*5
  14770. -->
  14771. (S1 ^operator O2003 = 0.)
  14772. Firing prefer*rvt*predict-yes*H0
  14773. -->
  14774. Firing prefer*rvt*predict-no*H0
  14775. -->
  14776. Firing elaborate*copy-dir-to-output-link
  14777. -->
  14778. (I3 ^dir U +)
  14779. inner elaboration loop at bottom goal.
  14780. Retracting elaborate*copy-see-to-output-link
  14781. -->
  14782. (I3 ^see 0 +)
  14783. Retracting propose*predict-no
  14784. -->
  14785. (O2004 ^name predict-no +)
  14786. (S1 ^operator O2004 +)
  14787. Retracting propose*predict-yes
  14788. -->
  14789. (O2003 ^name predict-yes +)
  14790. (S1 ^operator O2003 +)
  14791. Retracting elaborate*reward*based*on*reward
  14792. -->
  14793. (R1005 ^value 1 +)
  14794. (R1 ^reward R1005 +)
  14795. Retracting elaborate*copy-dir-to-output-link
  14796. -->
  14797. (I3 ^dir L +)
  14798. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  14799. -->
  14800. (S1 ^operator O2004 = 0.7449867006559153)
  14801. Retracting rl*prefer*rvt*predict-no*H0*2
  14802. -->
  14803. (S1 ^operator O2004 = 0.2550133719508188)
  14804. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  14805. -->
  14806. (S1 ^operator O2003 = 0.3)
  14807. Retracting rl*prefer*rvt*predict-yes*H0*1
  14808. -->
  14809. (S1 ^operator O2003 = 0.5231200249393807)
  14810. =>WM: (14088: S1 ^operator O2006 +)
  14811. =>WM: (14087: S1 ^operator O2005 +)
  14812. =>WM: (14086: I3 ^dir U)
  14813. =>WM: (14085: O2006 ^name predict-no)
  14814. =>WM: (14084: O2005 ^name predict-yes)
  14815. =>WM: (14083: R1006 ^value 1)
  14816. =>WM: (14082: R1 ^reward R1006)
  14817. <=WM: (14073: S1 ^operator O2003 +)
  14818. <=WM: (14074: S1 ^operator O2004 +)
  14819. <=WM: (14075: S1 ^operator O2004)
  14820. <=WM: (14033: I3 ^dir L)
  14821. <=WM: (14069: R1 ^reward R1005)
  14822. <=WM: (14072: O2004 ^name predict-no)
  14823. <=WM: (14071: O2003 ^name predict-yes)
  14824. <=WM: (14070: R1005 ^value 1)
  14825. --- Inner Elaboration Phase, active level 1 (S1) ---
  14826. Firing prefer*rvt*predict-yes*H0
  14827. -->
  14828. Firing rl*prefer*rvt*predict-yes*H0*5
  14829. -->
  14830. (S1 ^operator O2005 = 0.)
  14831. Firing prefer*rvt*predict-no*H0
  14832. -->
  14833. Firing rl*prefer*rvt*predict-no*H0*6
  14834. -->
  14835. (S1 ^operator O2006 = 0.9999999999999999)
  14836. inner elaboration loop at bottom goal.
  14837. Retracting rl*prefer*rvt*predict-no*H0*6
  14838. -->
  14839. (S1 ^operator O2004 = 0.9999999999999999)
  14840. Retracting rl*prefer*rvt*predict-yes*H0*5
  14841. -->
  14842. (S1 ^operator O2003 = 0.)
  14843. --- END Proposal Phase ---
  14844. --- Decision Phase ---
  14845. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.918782,0.0750026)
  14846. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  14847. =>WM: (14089: S1 ^operator O2006)
  14848. 1003: O: O2006 (predict-no)
  14849. --- END Decision Phase ---
  14850. --- Application Phase ---
  14851. --- Firing Productions (PE) For State At Depth 1 ---
  14852. --- Inner Elaboration Phase, active level 1 (S1) ---
  14853. Firing apply*operator
  14854. -->
  14855. (I3 ^predict-no N1003 + :O )
  14856. Firing apply*operator*complete
  14857. -->
  14858. (I3 ^predict-no N1002 - :O )
  14859. inner elaboration loop at bottom goal.
  14860. --- Change Working Memory (PE) ---
  14861. =>WM: (14090: I3 ^predict-no N1003)
  14862. <=WM: (14077: N1002 ^status complete)
  14863. <=WM: (14076: I3 ^predict-no N1002)
  14864. --- Firing Productions (IE) For State At Depth 1 ---
  14865. --- Inner Elaboration Phase, active level 1 (S1) ---
  14866. Firing monitor*world
  14867. -->
  14868. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14869. --- Change Working Memory (IE) ---
  14870. --- END Application Phase ---
  14871. --- Output Phase ---
  14872. ENV: Agent did: predict-no for direction U in state State-A
  14873. In State-A moving U
  14874. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14875. predict error 0
  14876. dir: dir isR
  14877. --- END Output Phase ---
  14878. /|\--- Input Phase ---
  14879. =>WM: (14094: I2 ^dir R)
  14880. =>WM: (14093: I2 ^reward 1)
  14881. =>WM: (14092: I2 ^see 0)
  14882. =>WM: (14091: N1003 ^status complete)
  14883. <=WM: (14080: I2 ^dir U)
  14884. <=WM: (14079: I2 ^reward 1)
  14885. <=WM: (14078: I2 ^see 0)
  14886. =>WM: (14095: I2 ^level-1 L0-root)
  14887. <=WM: (14081: I2 ^level-1 L0-root)
  14888. --- END Input Phase ---
  14889. --- Proposal Phase ---
  14890. --- Inner Elaboration Phase, active level 1 (S1) ---
  14891. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  14892. -->
  14893. (S1 ^operator O2005 = 0.617076227543635)
  14894. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  14895. -->
  14896. (S1 ^operator O2006 = 0.4910065094545203)
  14897. Firing prefer*rvt*predict-no*H0*4*H1
  14898. -->
  14899. Firing prefer*rvt*predict-yes*H0*3*H1
  14900. -->
  14901. Firing elaborate*copy-see-to-output-link
  14902. -->
  14903. (I3 ^see 0 +)
  14904. Firing elaborate*reward*based*on*reward
  14905. -->
  14906. (R1007 ^value 1 +)
  14907. (R1 ^reward R1007 +)
  14908. Firing propose*predict-yes
  14909. -->
  14910. (O2007 ^name predict-yes +)
  14911. (S1 ^operator O2007 +)
  14912. Firing propose*predict-no
  14913. -->
  14914. (O2008 ^name predict-no +)
  14915. (S1 ^operator O2008 +)
  14916. Firing rl*prefer*rvt*predict-no*H0*4
  14917. -->
  14918. (S1 ^operator O2006 = 0.1269768790760836)
  14919. Firing rl*prefer*rvt*predict-yes*H0*3
  14920. -->
  14921. (S1 ^operator O2005 = 0.3829355766477516)
  14922. Firing prefer*rvt*predict-yes*H0
  14923. -->
  14924. Firing prefer*rvt*predict-no*H0
  14925. -->
  14926. Firing elaborate*copy-dir-to-output-link
  14927. -->
  14928. (I3 ^dir R +)
  14929. inner elaboration loop at bottom goal.
  14930. Retracting elaborate*copy-see-to-output-link
  14931. -->
  14932. (I3 ^see 0 +)
  14933. Retracting propose*predict-no
  14934. -->
  14935. (O2006 ^name predict-no +)
  14936. (S1 ^operator O2006 +)
  14937. Retracting propose*predict-yes
  14938. -->
  14939. (O2005 ^name predict-yes +)
  14940. (S1 ^operator O2005 +)
  14941. Retracting elaborate*reward*based*on*reward
  14942. -->
  14943. (R1006 ^value 1 +)
  14944. (R1 ^reward R1006 +)
  14945. Retracting elaborate*copy-dir-to-output-link
  14946. -->
  14947. (I3 ^dir U +)
  14948. Retracting rl*prefer*rvt*predict-no*H0*6
  14949. -->
  14950. (S1 ^operator O2006 = 0.9999999999999999)
  14951. Retracting rl*prefer*rvt*predict-yes*H0*5
  14952. -->
  14953. (S1 ^operator O2005 = 0.)
  14954. =>WM: (14102: S1 ^operator O2008 +)
  14955. =>WM: (14101: S1 ^operator O2007 +)
  14956. =>WM: (14100: I3 ^dir R)
  14957. =>WM: (14099: O2008 ^name predict-no)
  14958. =>WM: (14098: O2007 ^name predict-yes)
  14959. =>WM: (14097: R1007 ^value 1)
  14960. =>WM: (14096: R1 ^reward R1007)
  14961. <=WM: (14087: S1 ^operator O2005 +)
  14962. <=WM: (14088: S1 ^operator O2006 +)
  14963. <=WM: (14089: S1 ^operator O2006)
  14964. <=WM: (14086: I3 ^dir U)
  14965. <=WM: (14082: R1 ^reward R1006)
  14966. <=WM: (14085: O2006 ^name predict-no)
  14967. <=WM: (14084: O2005 ^name predict-yes)
  14968. <=WM: (14083: R1006 ^value 1)
  14969. --- Inner Elaboration Phase, active level 1 (S1) ---
  14970. Firing prefer*rvt*predict-yes*H0
  14971. -->
  14972. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  14973. -->
  14974. (S1 ^operator O2007 = 0.617076227543635)
  14975. Firing rl*prefer*rvt*predict-yes*H0*3
  14976. -->
  14977. (S1 ^operator O2007 = 0.3829355766477516)
  14978. Firing prefer*rvt*predict-yes*H0*3*H1
  14979. -->
  14980. Firing prefer*rvt*predict-no*H0
  14981. -->
  14982. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  14983. -->
  14984. (S1 ^operator O2008 = 0.4910065094545203)
  14985. Firing rl*prefer*rvt*predict-no*H0*4
  14986. -->
  14987. (S1 ^operator O2008 = 0.1269768790760836)
  14988. Firing prefer*rvt*predict-no*H0*4*H1
  14989. -->
  14990. inner elaboration loop at bottom goal.
  14991. Retracting rl*prefer*rvt*predict-no*H0*4
  14992. -->
  14993. (S1 ^operator O2006 = 0.1269768790760836)
  14994. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  14995. -->
  14996. (S1 ^operator O2006 = 0.4910065094545203)
  14997. Retracting rl*prefer*rvt*predict-yes*H0*3
  14998. -->
  14999. (S1 ^operator O2005 = 0.3829355766477516)
  15000. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  15001. -->
  15002. (S1 ^operator O2005 = 0.617076227543635)
  15003. --- END Proposal Phase ---
  15004. --- Decision Phase ---
  15005. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15006. =>WM: (14103: S1 ^operator O2007)
  15007. 1004: O: O2007 (predict-yes)
  15008. --- END Decision Phase ---
  15009. --- Application Phase ---
  15010. --- Firing Productions (PE) For State At Depth 1 ---
  15011. --- Inner Elaboration Phase, active level 1 (S1) ---
  15012. Firing apply*operator
  15013. -->
  15014. (I3 ^predict-yes N1004 + :O )
  15015. Firing apply*operator*complete
  15016. -->
  15017. (I3 ^predict-no N1003 - :O )
  15018. inner elaboration loop at bottom goal.
  15019. --- Change Working Memory (PE) ---
  15020. =>WM: (14104: I3 ^predict-yes N1004)
  15021. <=WM: (14091: N1003 ^status complete)
  15022. <=WM: (14090: I3 ^predict-no N1003)
  15023. --- Firing Productions (IE) For State At Depth 1 ---
  15024. --- Inner Elaboration Phase, active level 1 (S1) ---
  15025. Firing monitor*world
  15026. -->
  15027. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15028. --- Change Working Memory (IE) ---
  15029. --- END Application Phase ---
  15030. --- Output Phase ---
  15031. ENV: Agent did: predict-yes for direction R in state State-A
  15032. In State-A moving R
  15033. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15034. predict error 0
  15035. dir: dir isU
  15036. --- END Output Phase ---
  15037. -/|--- Input Phase ---
  15038. =>WM: (14108: I2 ^dir U)
  15039. =>WM: (14107: I2 ^reward 1)
  15040. =>WM: (14106: I2 ^see 1)
  15041. =>WM: (14105: N1004 ^status complete)
  15042. <=WM: (14094: I2 ^dir R)
  15043. <=WM: (14093: I2 ^reward 1)
  15044. <=WM: (14092: I2 ^see 0)
  15045. =>WM: (14109: I2 ^level-1 R1-root)
  15046. <=WM: (14095: I2 ^level-1 L0-root)
  15047. --- END Input Phase ---
  15048. --- Proposal Phase ---
  15049. --- Inner Elaboration Phase, active level 1 (S1) ---
  15050. Firing elaborate*copy-see-to-output-link
  15051. -->
  15052. (I3 ^see 1 +)
  15053. Firing elaborate*reward*based*on*reward
  15054. -->
  15055. (R1008 ^value 1 +)
  15056. (R1 ^reward R1008 +)
  15057. Firing propose*predict-yes
  15058. -->
  15059. (O2009 ^name predict-yes +)
  15060. (S1 ^operator O2009 +)
  15061. Firing propose*predict-no
  15062. -->
  15063. (O2010 ^name predict-no +)
  15064. (S1 ^operator O2010 +)
  15065. Firing rl*prefer*rvt*predict-no*H0*6
  15066. -->
  15067. (S1 ^operator O2008 = 0.9999999999999999)
  15068. Firing rl*prefer*rvt*predict-yes*H0*5
  15069. -->
  15070. (S1 ^operator O2007 = 0.)
  15071. Firing prefer*rvt*predict-yes*H0
  15072. -->
  15073. Firing prefer*rvt*predict-no*H0
  15074. -->
  15075. Firing elaborate*copy-dir-to-output-link
  15076. -->
  15077. (I3 ^dir U +)
  15078. inner elaboration loop at bottom goal.
  15079. Retracting elaborate*copy-see-to-output-link
  15080. -->
  15081. (I3 ^see 0 +)
  15082. Retracting propose*predict-no
  15083. -->
  15084. (O2008 ^name predict-no +)
  15085. (S1 ^operator O2008 +)
  15086. Retracting propose*predict-yes
  15087. -->
  15088. (O2007 ^name predict-yes +)
  15089. (S1 ^operator O2007 +)
  15090. Retracting elaborate*reward*based*on*reward
  15091. -->
  15092. (R1007 ^value 1 +)
  15093. (R1 ^reward R1007 +)
  15094. Retracting elaborate*copy-dir-to-output-link
  15095. -->
  15096. (I3 ^dir R +)
  15097. Retracting rl*prefer*rvt*predict-no*H0*4
  15098. -->
  15099. (S1 ^operator O2008 = 0.1269768790760836)
  15100. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  15101. -->
  15102. (S1 ^operator O2008 = 0.4910065094545203)
  15103. Retracting rl*prefer*rvt*predict-yes*H0*3
  15104. -->
  15105. (S1 ^operator O2007 = 0.3829355766477516)
  15106. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  15107. -->
  15108. (S1 ^operator O2007 = 0.617076227543635)
  15109. =>WM: (14117: S1 ^operator O2010 +)
  15110. =>WM: (14116: S1 ^operator O2009 +)
  15111. =>WM: (14115: I3 ^dir U)
  15112. =>WM: (14114: O2010 ^name predict-no)
  15113. =>WM: (14113: O2009 ^name predict-yes)
  15114. =>WM: (14112: R1008 ^value 1)
  15115. =>WM: (14111: R1 ^reward R1008)
  15116. =>WM: (14110: I3 ^see 1)
  15117. <=WM: (14101: S1 ^operator O2007 +)
  15118. <=WM: (14103: S1 ^operator O2007)
  15119. <=WM: (14102: S1 ^operator O2008 +)
  15120. <=WM: (14100: I3 ^dir R)
  15121. <=WM: (14096: R1 ^reward R1007)
  15122. <=WM: (14015: I3 ^see 0)
  15123. <=WM: (14099: O2008 ^name predict-no)
  15124. <=WM: (14098: O2007 ^name predict-yes)
  15125. <=WM: (14097: R1007 ^value 1)
  15126. --- Inner Elaboration Phase, active level 1 (S1) ---
  15127. Firing prefer*rvt*predict-yes*H0
  15128. -->
  15129. Firing rl*prefer*rvt*predict-yes*H0*5
  15130. -->
  15131. (S1 ^operator O2009 = 0.)
  15132. Firing prefer*rvt*predict-no*H0
  15133. -->
  15134. Firing rl*prefer*rvt*predict-no*H0*6
  15135. -->
  15136. (S1 ^operator O2010 = 0.9999999999999999)
  15137. inner elaboration loop at bottom goal.
  15138. Retracting rl*prefer*rvt*predict-no*H0*6
  15139. -->
  15140. (S1 ^operator O2008 = 0.9999999999999999)
  15141. Retracting rl*prefer*rvt*predict-yes*H0*5
  15142. -->
  15143. (S1 ^operator O2007 = 0.)
  15144. --- END Proposal Phase ---
  15145. --- Decision Phase ---
  15146. RL update rl*prefer*rvt*predict-yes*H0*3 0.673129 -0.290194 0.382936 -> 0.673128 -0.290194 0.382934(R,m,v=1,0.960784,0.0379257)
  15147. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326882 0.290195 0.617076 -> 0.32688 0.290194 0.617074(R,m,v=1,1,0)
  15148. =>WM: (14118: S1 ^operator O2010)
  15149. 1005: O: O2010 (predict-no)
  15150. --- END Decision Phase ---
  15151. --- Application Phase ---
  15152. --- Firing Productions (PE) For State At Depth 1 ---
  15153. --- Inner Elaboration Phase, active level 1 (S1) ---
  15154. Firing apply*operator
  15155. -->
  15156. (I3 ^predict-no N1005 + :O )
  15157. Firing apply*operator*complete
  15158. -->
  15159. (I3 ^predict-yes N1004 - :O )
  15160. inner elaboration loop at bottom goal.
  15161. --- Change Working Memory (PE) ---
  15162. =>WM: (14119: I3 ^predict-no N1005)
  15163. <=WM: (14105: N1004 ^status complete)
  15164. <=WM: (14104: I3 ^predict-yes N1004)
  15165. --- Firing Productions (IE) For State At Depth 1 ---
  15166. --- Inner Elaboration Phase, active level 1 (S1) ---
  15167. Firing monitor*world
  15168. -->
  15169. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15170. --- Change Working Memory (IE) ---
  15171. --- END Application Phase ---
  15172. --- Output Phase ---
  15173. ENV: Agent did: predict-no for direction U in state State-B
  15174. In State-B moving U
  15175. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15176. predict error 0
  15177. dir: dir isL
  15178. --- END Output Phase ---
  15179. \-/--- Input Phase ---
  15180. =>WM: (14123: I2 ^dir L)
  15181. =>WM: (14122: I2 ^reward 1)
  15182. =>WM: (14121: I2 ^see 0)
  15183. =>WM: (14120: N1005 ^status complete)
  15184. <=WM: (14108: I2 ^dir U)
  15185. <=WM: (14107: I2 ^reward 1)
  15186. <=WM: (14106: I2 ^see 1)
  15187. =>WM: (14124: I2 ^level-1 R1-root)
  15188. <=WM: (14109: I2 ^level-1 R1-root)
  15189. --- END Input Phase ---
  15190. --- Proposal Phase ---
  15191. --- Inner Elaboration Phase, active level 1 (S1) ---
  15192. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  15193. -->
  15194. (S1 ^operator O2009 = 0.4768779238990463)
  15195. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  15196. -->
  15197. (S1 ^operator O2010 = -0.01194930198035649)
  15198. Firing prefer*rvt*predict-no*H0*2*H1
  15199. -->
  15200. Firing prefer*rvt*predict-yes*H0*1*H1
  15201. -->
  15202. Firing elaborate*copy-see-to-output-link
  15203. -->
  15204. (I3 ^see 0 +)
  15205. Firing elaborate*reward*based*on*reward
  15206. -->
  15207. (R1009 ^value 1 +)
  15208. (R1 ^reward R1009 +)
  15209. Firing propose*predict-yes
  15210. -->
  15211. (O2011 ^name predict-yes +)
  15212. (S1 ^operator O2011 +)
  15213. Firing propose*predict-no
  15214. -->
  15215. (O2012 ^name predict-no +)
  15216. (S1 ^operator O2012 +)
  15217. Firing rl*prefer*rvt*predict-no*H0*2
  15218. -->
  15219. (S1 ^operator O2010 = 0.2550133610598087)
  15220. Firing rl*prefer*rvt*predict-yes*H0*1
  15221. -->
  15222. (S1 ^operator O2009 = 0.5231200249393807)
  15223. Firing prefer*rvt*predict-yes*H0
  15224. -->
  15225. Firing prefer*rvt*predict-no*H0
  15226. -->
  15227. Firing elaborate*copy-dir-to-output-link
  15228. -->
  15229. (I3 ^dir L +)
  15230. inner elaboration loop at bottom goal.
  15231. Retracting elaborate*copy-see-to-output-link
  15232. -->
  15233. (I3 ^see 1 +)
  15234. Retracting propose*predict-no
  15235. -->
  15236. (O2010 ^name predict-no +)
  15237. (S1 ^operator O2010 +)
  15238. Retracting propose*predict-yes
  15239. -->
  15240. (O2009 ^name predict-yes +)
  15241. (S1 ^operator O2009 +)
  15242. Retracting elaborate*reward*based*on*reward
  15243. -->
  15244. (R1008 ^value 1 +)
  15245. (R1 ^reward R1008 +)
  15246. Retracting elaborate*copy-dir-to-output-link
  15247. -->
  15248. (I3 ^dir U +)
  15249. Retracting rl*prefer*rvt*predict-no*H0*6
  15250. -->
  15251. (S1 ^operator O2010 = 0.9999999999999999)
  15252. Retracting rl*prefer*rvt*predict-yes*H0*5
  15253. -->
  15254. (S1 ^operator O2009 = 0.)
  15255. =>WM: (14132: S1 ^operator O2012 +)
  15256. =>WM: (14131: S1 ^operator O2011 +)
  15257. =>WM: (14130: I3 ^dir L)
  15258. =>WM: (14129: O2012 ^name predict-no)
  15259. =>WM: (14128: O2011 ^name predict-yes)
  15260. =>WM: (14127: R1009 ^value 1)
  15261. =>WM: (14126: R1 ^reward R1009)
  15262. =>WM: (14125: I3 ^see 0)
  15263. <=WM: (14116: S1 ^operator O2009 +)
  15264. <=WM: (14117: S1 ^operator O2010 +)
  15265. <=WM: (14118: S1 ^operator O2010)
  15266. <=WM: (14115: I3 ^dir U)
  15267. <=WM: (14111: R1 ^reward R1008)
  15268. <=WM: (14110: I3 ^see 1)
  15269. <=WM: (14114: O2010 ^name predict-no)
  15270. <=WM: (14113: O2009 ^name predict-yes)
  15271. <=WM: (14112: R1008 ^value 1)
  15272. --- Inner Elaboration Phase, active level 1 (S1) ---
  15273. Firing prefer*rvt*predict-yes*H0
  15274. -->
  15275. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  15276. -->
  15277. (S1 ^operator O2011 = 0.4768779238990463)
  15278. Firing rl*prefer*rvt*predict-yes*H0*1
  15279. -->
  15280. (S1 ^operator O2011 = 0.5231200249393807)
  15281. Firing prefer*rvt*predict-yes*H0*1*H1
  15282. -->
  15283. Firing prefer*rvt*predict-no*H0
  15284. -->
  15285. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  15286. -->
  15287. (S1 ^operator O2012 = -0.01194930198035649)
  15288. Firing rl*prefer*rvt*predict-no*H0*2
  15289. -->
  15290. (S1 ^operator O2012 = 0.2550133610598087)
  15291. Firing prefer*rvt*predict-no*H0*2*H1
  15292. -->
  15293. inner elaboration loop at bottom goal.
  15294. Retracting rl*prefer*rvt*predict-no*H0*2
  15295. -->
  15296. (S1 ^operator O2010 = 0.2550133610598087)
  15297. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  15298. -->
  15299. (S1 ^operator O2010 = -0.01194930198035649)
  15300. Retracting rl*prefer*rvt*predict-yes*H0*1
  15301. -->
  15302. (S1 ^operator O2009 = 0.5231200249393807)
  15303. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  15304. -->
  15305. (S1 ^operator O2009 = 0.4768779238990463)
  15306. --- END Proposal Phase ---
  15307. --- Decision Phase ---
  15308. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15309. =>WM: (14133: S1 ^operator O2011)
  15310. 1006: O: O2011 (predict-yes)
  15311. --- END Decision Phase ---
  15312. --- Application Phase ---
  15313. --- Firing Productions (PE) For State At Depth 1 ---
  15314. --- Inner Elaboration Phase, active level 1 (S1) ---
  15315. Firing apply*operator
  15316. -->
  15317. (I3 ^predict-yes N1006 + :O )
  15318. Firing apply*operator*complete
  15319. -->
  15320. (I3 ^predict-no N1005 - :O )
  15321. inner elaboration loop at bottom goal.
  15322. --- Change Working Memory (PE) ---
  15323. =>WM: (14134: I3 ^predict-yes N1006)
  15324. <=WM: (14120: N1005 ^status complete)
  15325. <=WM: (14119: I3 ^predict-no N1005)
  15326. --- Firing Productions (IE) For State At Depth 1 ---
  15327. --- Inner Elaboration Phase, active level 1 (S1) ---
  15328. Firing monitor*world
  15329. -->
  15330. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15331. --- Change Working Memory (IE) ---
  15332. --- END Application Phase ---
  15333. --- Output Phase ---
  15334. ENV: Agent did: predict-yes for direction L in state State-B
  15335. In State-B moving L
  15336. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15337. predict error 0
  15338. dir: dir isL
  15339. --- END Output Phase ---
  15340. |\---- Input Phase ---
  15341. =>WM: (14138: I2 ^dir L)
  15342. =>WM: (14137: I2 ^reward 1)
  15343. =>WM: (14136: I2 ^see 1)
  15344. =>WM: (14135: N1006 ^status complete)
  15345. <=WM: (14123: I2 ^dir L)
  15346. <=WM: (14122: I2 ^reward 1)
  15347. <=WM: (14121: I2 ^see 0)
  15348. =>WM: (14139: I2 ^level-1 L1-root)
  15349. <=WM: (14124: I2 ^level-1 R1-root)
  15350. --- END Input Phase ---
  15351. --- Proposal Phase ---
  15352. --- Inner Elaboration Phase, active level 1 (S1) ---
  15353. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  15354. -->
  15355. (S1 ^operator O2011 = 0.1693592933936033)
  15356. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  15357. -->
  15358. (S1 ^operator O2012 = 0.7449864646061185)
  15359. Firing prefer*rvt*predict-no*H0*2*H1
  15360. -->
  15361. Firing prefer*rvt*predict-yes*H0*1*H1
  15362. -->
  15363. Firing elaborate*copy-see-to-output-link
  15364. -->
  15365. (I3 ^see 1 +)
  15366. Firing elaborate*reward*based*on*reward
  15367. -->
  15368. (R1010 ^value 1 +)
  15369. (R1 ^reward R1010 +)
  15370. Firing propose*predict-yes
  15371. -->
  15372. (O2013 ^name predict-yes +)
  15373. (S1 ^operator O2013 +)
  15374. Firing propose*predict-no
  15375. -->
  15376. (O2014 ^name predict-no +)
  15377. (S1 ^operator O2014 +)
  15378. Firing rl*prefer*rvt*predict-no*H0*2
  15379. -->
  15380. (S1 ^operator O2012 = 0.2550133610598087)
  15381. Firing rl*prefer*rvt*predict-yes*H0*1
  15382. -->
  15383. (S1 ^operator O2011 = 0.5231200249393807)
  15384. Firing prefer*rvt*predict-yes*H0
  15385. -->
  15386. Firing prefer*rvt*predict-no*H0
  15387. -->
  15388. Firing elaborate*copy-dir-to-output-link
  15389. -->
  15390. (I3 ^dir L +)
  15391. inner elaboration loop at bottom goal.
  15392. Retracting elaborate*copy-see-to-output-link
  15393. -->
  15394. (I3 ^see 0 +)
  15395. Retracting propose*predict-no
  15396. -->
  15397. (O2012 ^name predict-no +)
  15398. (S1 ^operator O2012 +)
  15399. Retracting propose*predict-yes
  15400. -->
  15401. (O2011 ^name predict-yes +)
  15402. (S1 ^operator O2011 +)
  15403. Retracting elaborate*reward*based*on*reward
  15404. -->
  15405. (R1009 ^value 1 +)
  15406. (R1 ^reward R1009 +)
  15407. Retracting elaborate*copy-dir-to-output-link
  15408. -->
  15409. (I3 ^dir L +)
  15410. Retracting rl*prefer*rvt*predict-no*H0*2
  15411. -->
  15412. (S1 ^operator O2012 = 0.2550133610598087)
  15413. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  15414. -->
  15415. (S1 ^operator O2012 = -0.01194930198035649)
  15416. Retracting rl*prefer*rvt*predict-yes*H0*1
  15417. -->
  15418. (S1 ^operator O2011 = 0.5231200249393807)
  15419. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  15420. -->
  15421. (S1 ^operator O2011 = 0.4768779238990463)
  15422. =>WM: (14146: S1 ^operator O2014 +)
  15423. =>WM: (14145: S1 ^operator O2013 +)
  15424. =>WM: (14144: O2014 ^name predict-no)
  15425. =>WM: (14143: O2013 ^name predict-yes)
  15426. =>WM: (14142: R1010 ^value 1)
  15427. =>WM: (14141: R1 ^reward R1010)
  15428. =>WM: (14140: I3 ^see 1)
  15429. <=WM: (14131: S1 ^operator O2011 +)
  15430. <=WM: (14133: S1 ^operator O2011)
  15431. <=WM: (14132: S1 ^operator O2012 +)
  15432. <=WM: (14126: R1 ^reward R1009)
  15433. <=WM: (14125: I3 ^see 0)
  15434. <=WM: (14129: O2012 ^name predict-no)
  15435. <=WM: (14128: O2011 ^name predict-yes)
  15436. <=WM: (14127: R1009 ^value 1)
  15437. --- Inner Elaboration Phase, active level 1 (S1) ---
  15438. Firing prefer*rvt*predict-yes*H0
  15439. -->
  15440. Firing rl*prefer*rvt*predict-yes*H0*1
  15441. -->
  15442. (S1 ^operator O2013 = 0.5231200249393807)
  15443. Firing prefer*rvt*predict-yes*H0*1*H1
  15444. -->
  15445. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  15446. -->
  15447. (S1 ^operator O2013 = 0.1693592933936033)
  15448. Firing prefer*rvt*predict-no*H0
  15449. -->
  15450. Firing rl*prefer*rvt*predict-no*H0*2
  15451. -->
  15452. (S1 ^operator O2014 = 0.2550133610598087)
  15453. Firing prefer*rvt*predict-no*H0*2*H1
  15454. -->
  15455. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  15456. -->
  15457. (S1 ^operator O2014 = 0.7449864646061185)
  15458. inner elaboration loop at bottom goal.
  15459. Retracting rl*prefer*rvt*predict-no*H0*2
  15460. -->
  15461. (S1 ^operator O2012 = 0.2550133610598087)
  15462. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  15463. -->
  15464. (S1 ^operator O2012 = 0.7449864646061185)
  15465. Retracting rl*prefer*rvt*predict-yes*H0*1
  15466. -->
  15467. (S1 ^operator O2011 = 0.5231200249393807)
  15468. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  15469. -->
  15470. (S1 ^operator O2011 = 0.1693592933936033)
  15471. --- END Proposal Phase ---
  15472. --- Decision Phase ---
  15473. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.979021,0.0206835)
  15474. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272037 0.204841 0.476878 -> 0.272038 0.20484 0.476878(R,m,v=1,1,0)
  15475. =>WM: (14147: S1 ^operator O2014)
  15476. 1007: O: O2014 (predict-no)
  15477. --- END Decision Phase ---
  15478. --- Application Phase ---
  15479. --- Firing Productions (PE) For State At Depth 1 ---
  15480. --- Inner Elaboration Phase, active level 1 (S1) ---
  15481. Firing apply*operator
  15482. -->
  15483. (I3 ^predict-no N1007 + :O )
  15484. Firing apply*operator*complete
  15485. -->
  15486. (I3 ^predict-yes N1006 - :O )
  15487. inner elaboration loop at bottom goal.
  15488. --- Change Working Memory (PE) ---
  15489. =>WM: (14148: I3 ^predict-no N1007)
  15490. <=WM: (14135: N1006 ^status complete)
  15491. <=WM: (14134: I3 ^predict-yes N1006)
  15492. --- Firing Productions (IE) For State At Depth 1 ---
  15493. --- Inner Elaboration Phase, active level 1 (S1) ---
  15494. Firing monitor*world
  15495. -->
  15496. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15497. --- Change Working Memory (IE) ---
  15498. --- END Application Phase ---
  15499. --- Output Phase ---
  15500. ENV: Agent did: predict-no for direction L in state State-A
  15501. In State-A moving L
  15502. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15503. predict error 0
  15504. dir: dir isU
  15505. --- END Output Phase ---
  15506. /|\--- Input Phase ---
  15507. =>WM: (14152: I2 ^dir U)
  15508. =>WM: (14151: I2 ^reward 1)
  15509. =>WM: (14150: I2 ^see 0)
  15510. =>WM: (14149: N1007 ^status complete)
  15511. <=WM: (14138: I2 ^dir L)
  15512. <=WM: (14137: I2 ^reward 1)
  15513. <=WM: (14136: I2 ^see 1)
  15514. =>WM: (14153: I2 ^level-1 L0-root)
  15515. <=WM: (14139: I2 ^level-1 L1-root)
  15516. --- END Input Phase ---
  15517. --- Proposal Phase ---
  15518. --- Inner Elaboration Phase, active level 1 (S1) ---
  15519. Firing elaborate*copy-see-to-output-link
  15520. -->
  15521. (I3 ^see 0 +)
  15522. Firing elaborate*reward*based*on*reward
  15523. -->
  15524. (R1011 ^value 1 +)
  15525. (R1 ^reward R1011 +)
  15526. Firing propose*predict-yes
  15527. -->
  15528. (O2015 ^name predict-yes +)
  15529. (S1 ^operator O2015 +)
  15530. Firing propose*predict-no
  15531. -->
  15532. (O2016 ^name predict-no +)
  15533. (S1 ^operator O2016 +)
  15534. Firing rl*prefer*rvt*predict-no*H0*6
  15535. -->
  15536. (S1 ^operator O2014 = 0.9999999999999999)
  15537. Firing rl*prefer*rvt*predict-yes*H0*5
  15538. -->
  15539. (S1 ^operator O2013 = 0.)
  15540. Firing prefer*rvt*predict-yes*H0
  15541. -->
  15542. Firing prefer*rvt*predict-no*H0
  15543. -->
  15544. Firing elaborate*copy-dir-to-output-link
  15545. -->
  15546. (I3 ^dir U +)
  15547. inner elaboration loop at bottom goal.
  15548. Retracting elaborate*copy-see-to-output-link
  15549. -->
  15550. (I3 ^see 1 +)
  15551. Retracting propose*predict-no
  15552. -->
  15553. (O2014 ^name predict-no +)
  15554. (S1 ^operator O2014 +)
  15555. Retracting propose*predict-yes
  15556. -->
  15557. (O2013 ^name predict-yes +)
  15558. (S1 ^operator O2013 +)
  15559. Retracting elaborate*reward*based*on*reward
  15560. -->
  15561. (R1010 ^value 1 +)
  15562. (R1 ^reward R1010 +)
  15563. Retracting elaborate*copy-dir-to-output-link
  15564. -->
  15565. (I3 ^dir L +)
  15566. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  15567. -->
  15568. (S1 ^operator O2014 = 0.7449864646061185)
  15569. Retracting rl*prefer*rvt*predict-no*H0*2
  15570. -->
  15571. (S1 ^operator O2014 = 0.2550133610598087)
  15572. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  15573. -->
  15574. (S1 ^operator O2013 = 0.1693592933936033)
  15575. Retracting rl*prefer*rvt*predict-yes*H0*1
  15576. -->
  15577. (S1 ^operator O2013 = 0.5231203326136166)
  15578. =>WM: (14161: S1 ^operator O2016 +)
  15579. =>WM: (14160: S1 ^operator O2015 +)
  15580. =>WM: (14159: I3 ^dir U)
  15581. =>WM: (14158: O2016 ^name predict-no)
  15582. =>WM: (14157: O2015 ^name predict-yes)
  15583. =>WM: (14156: R1011 ^value 1)
  15584. =>WM: (14155: R1 ^reward R1011)
  15585. =>WM: (14154: I3 ^see 0)
  15586. <=WM: (14145: S1 ^operator O2013 +)
  15587. <=WM: (14146: S1 ^operator O2014 +)
  15588. <=WM: (14147: S1 ^operator O2014)
  15589. <=WM: (14130: I3 ^dir L)
  15590. <=WM: (14141: R1 ^reward R1010)
  15591. <=WM: (14140: I3 ^see 1)
  15592. <=WM: (14144: O2014 ^name predict-no)
  15593. <=WM: (14143: O2013 ^name predict-yes)
  15594. <=WM: (14142: R1010 ^value 1)
  15595. --- Inner Elaboration Phase, active level 1 (S1) ---
  15596. Firing prefer*rvt*predict-yes*H0
  15597. -->
  15598. Firing rl*prefer*rvt*predict-yes*H0*5
  15599. -->
  15600. (S1 ^operator O2015 = 0.)
  15601. Firing prefer*rvt*predict-no*H0
  15602. -->
  15603. Firing rl*prefer*rvt*predict-no*H0*6
  15604. -->
  15605. (S1 ^operator O2016 = 0.9999999999999999)
  15606. inner elaboration loop at bottom goal.
  15607. Retracting rl*prefer*rvt*predict-no*H0*6
  15608. -->
  15609. (S1 ^operator O2014 = 0.9999999999999999)
  15610. Retracting rl*prefer*rvt*predict-yes*H0*5
  15611. -->
  15612. (S1 ^operator O2013 = 0.)
  15613. --- END Proposal Phase ---
  15614. --- Decision Phase ---
  15615. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.919192,0.0746552)
  15616. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744986 -> 0.368505 0.376482 0.744986(R,m,v=1,1,0)
  15617. =>WM: (14162: S1 ^operator O2016)
  15618. 1008: O: O2016 (predict-no)
  15619. --- END Decision Phase ---
  15620. --- Application Phase ---
  15621. --- Firing Productions (PE) For State At Depth 1 ---
  15622. --- Inner Elaboration Phase, active level 1 (S1) ---
  15623. Firing apply*operator
  15624. -->
  15625. (I3 ^predict-no N1008 + :O )
  15626. Firing apply*operator*complete
  15627. -->
  15628. (I3 ^predict-no N1007 - :O )
  15629. inner elaboration loop at bottom goal.
  15630. --- Change Working Memory (PE) ---
  15631. =>WM: (14163: I3 ^predict-no N1008)
  15632. <=WM: (14149: N1007 ^status complete)
  15633. <=WM: (14148: I3 ^predict-no N1007)
  15634. --- Firing Productions (IE) For State At Depth 1 ---
  15635. --- Inner Elaboration Phase, active level 1 (S1) ---
  15636. Firing monitor*world
  15637. -->
  15638. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15639. --- Change Working Memory (IE) ---
  15640. --- END Application Phase ---
  15641. --- Output Phase ---
  15642. ENV: Agent did: predict-no for direction U in state State-A
  15643. In State-A moving U
  15644. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15645. predict error 0
  15646. dir: dir isU
  15647. --- END Output Phase ---
  15648. -/|--- Input Phase ---
  15649. =>WM: (14167: I2 ^dir U)
  15650. =>WM: (14166: I2 ^reward 1)
  15651. =>WM: (14165: I2 ^see 0)
  15652. =>WM: (14164: N1008 ^status complete)
  15653. <=WM: (14152: I2 ^dir U)
  15654. <=WM: (14151: I2 ^reward 1)
  15655. <=WM: (14150: I2 ^see 0)
  15656. =>WM: (14168: I2 ^level-1 L0-root)
  15657. <=WM: (14153: I2 ^level-1 L0-root)
  15658. --- END Input Phase ---
  15659. --- Proposal Phase ---
  15660. --- Inner Elaboration Phase, active level 1 (S1) ---
  15661. Firing elaborate*copy-see-to-output-link
  15662. -->
  15663. (I3 ^see 0 +)
  15664. Firing elaborate*reward*based*on*reward
  15665. -->
  15666. (R1012 ^value 1 +)
  15667. (R1 ^reward R1012 +)
  15668. Firing propose*predict-yes
  15669. -->
  15670. (O2017 ^name predict-yes +)
  15671. (S1 ^operator O2017 +)
  15672. Firing propose*predict-no
  15673. -->
  15674. (O2018 ^name predict-no +)
  15675. (S1 ^operator O2018 +)
  15676. Firing rl*prefer*rvt*predict-no*H0*6
  15677. -->
  15678. (S1 ^operator O2016 = 0.9999999999999999)
  15679. Firing rl*prefer*rvt*predict-yes*H0*5
  15680. -->
  15681. (S1 ^operator O2015 = 0.)
  15682. Firing prefer*rvt*predict-yes*H0
  15683. -->
  15684. Firing prefer*rvt*predict-no*H0
  15685. -->
  15686. Firing elaborate*copy-dir-to-output-link
  15687. -->
  15688. (I3 ^dir U +)
  15689. inner elaboration loop at bottom goal.
  15690. Retracting elaborate*copy-see-to-output-link
  15691. -->
  15692. (I3 ^see 0 +)
  15693. Retracting propose*predict-no
  15694. -->
  15695. (O2016 ^name predict-no +)
  15696. (S1 ^operator O2016 +)
  15697. Retracting propose*predict-yes
  15698. -->
  15699. (O2015 ^name predict-yes +)
  15700. (S1 ^operator O2015 +)
  15701. Retracting elaborate*reward*based*on*reward
  15702. -->
  15703. (R1011 ^value 1 +)
  15704. (R1 ^reward R1011 +)
  15705. Retracting elaborate*copy-dir-to-output-link
  15706. -->
  15707. (I3 ^dir U +)
  15708. Retracting rl*prefer*rvt*predict-no*H0*6
  15709. -->
  15710. (S1 ^operator O2016 = 0.9999999999999999)
  15711. Retracting rl*prefer*rvt*predict-yes*H0*5
  15712. -->
  15713. (S1 ^operator O2015 = 0.)
  15714. =>WM: (14174: S1 ^operator O2018 +)
  15715. =>WM: (14173: S1 ^operator O2017 +)
  15716. =>WM: (14172: O2018 ^name predict-no)
  15717. =>WM: (14171: O2017 ^name predict-yes)
  15718. =>WM: (14170: R1012 ^value 1)
  15719. =>WM: (14169: R1 ^reward R1012)
  15720. <=WM: (14160: S1 ^operator O2015 +)
  15721. <=WM: (14161: S1 ^operator O2016 +)
  15722. <=WM: (14162: S1 ^operator O2016)
  15723. <=WM: (14155: R1 ^reward R1011)
  15724. <=WM: (14158: O2016 ^name predict-no)
  15725. <=WM: (14157: O2015 ^name predict-yes)
  15726. <=WM: (14156: R1011 ^value 1)
  15727. --- Inner Elaboration Phase, active level 1 (S1) ---
  15728. Firing prefer*rvt*predict-yes*H0
  15729. -->
  15730. Firing rl*prefer*rvt*predict-yes*H0*5
  15731. -->
  15732. (S1 ^operator O2017 = 0.)
  15733. Firing prefer*rvt*predict-no*H0
  15734. -->
  15735. Firing rl*prefer*rvt*predict-no*H0*6
  15736. -->
  15737. (S1 ^operator O2018 = 0.9999999999999999)
  15738. inner elaboration loop at bottom goal.
  15739. Retracting rl*prefer*rvt*predict-no*H0*6
  15740. -->
  15741. (S1 ^operator O2016 = 0.9999999999999999)
  15742. Retracting rl*prefer*rvt*predict-yes*H0*5
  15743. -->
  15744. (S1 ^operator O2015 = 0.)
  15745. --- END Proposal Phase ---
  15746. --- Decision Phase ---
  15747. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15748. =>WM: (14175: S1 ^operator O2018)
  15749. 1009: O: O2018 (predict-no)
  15750. --- END Decision Phase ---
  15751. --- Application Phase ---
  15752. --- Firing Productions (PE) For State At Depth 1 ---
  15753. --- Inner Elaboration Phase, active level 1 (S1) ---
  15754. Firing apply*operator
  15755. -->
  15756. (I3 ^predict-no N1009 + :O )
  15757. Firing apply*operator*complete
  15758. -->
  15759. (I3 ^predict-no N1008 - :O )
  15760. inner elaboration loop at bottom goal.
  15761. --- Change Working Memory (PE) ---
  15762. =>WM: (14176: I3 ^predict-no N1009)
  15763. <=WM: (14164: N1008 ^status complete)
  15764. <=WM: (14163: I3 ^predict-no N1008)
  15765. --- Firing Productions (IE) For State At Depth 1 ---
  15766. --- Inner Elaboration Phase, active level 1 (S1) ---
  15767. Firing monitor*world
  15768. -->
  15769. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15770. --- Change Working Memory (IE) ---
  15771. --- END Application Phase ---
  15772. --- Output Phase ---
  15773. ENV: Agent did: predict-no for direction U in state State-A
  15774. In State-A moving U
  15775. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15776. predict error 0
  15777. dir: dir isU
  15778. --- END Output Phase ---
  15779. \-/--- Input Phase ---
  15780. =>WM: (14180: I2 ^dir U)
  15781. =>WM: (14179: I2 ^reward 1)
  15782. =>WM: (14178: I2 ^see 0)
  15783. =>WM: (14177: N1009 ^status complete)
  15784. <=WM: (14167: I2 ^dir U)
  15785. <=WM: (14166: I2 ^reward 1)
  15786. <=WM: (14165: I2 ^see 0)
  15787. =>WM: (14181: I2 ^level-1 L0-root)
  15788. <=WM: (14168: I2 ^level-1 L0-root)
  15789. --- END Input Phase ---
  15790. --- Proposal Phase ---
  15791. --- Inner Elaboration Phase, active level 1 (S1) ---
  15792. Firing elaborate*copy-see-to-output-link
  15793. -->
  15794. (I3 ^see 0 +)
  15795. Firing elaborate*reward*based*on*reward
  15796. -->
  15797. (R1013 ^value 1 +)
  15798. (R1 ^reward R1013 +)
  15799. Firing propose*predict-yes
  15800. -->
  15801. (O2019 ^name predict-yes +)
  15802. (S1 ^operator O2019 +)
  15803. Firing propose*predict-no
  15804. -->
  15805. (O2020 ^name predict-no +)
  15806. (S1 ^operator O2020 +)
  15807. Firing rl*prefer*rvt*predict-no*H0*6
  15808. -->
  15809. (S1 ^operator O2018 = 0.9999999999999999)
  15810. Firing rl*prefer*rvt*predict-yes*H0*5
  15811. -->
  15812. (S1 ^operator O2017 = 0.)
  15813. Firing prefer*rvt*predict-yes*H0
  15814. -->
  15815. Firing prefer*rvt*predict-no*H0
  15816. -->
  15817. Firing elaborate*copy-dir-to-output-link
  15818. -->
  15819. (I3 ^dir U +)
  15820. inner elaboration loop at bottom goal.
  15821. Retracting elaborate*copy-see-to-output-link
  15822. -->
  15823. (I3 ^see 0 +)
  15824. Retracting propose*predict-no
  15825. -->
  15826. (O2018 ^name predict-no +)
  15827. (S1 ^operator O2018 +)
  15828. Retracting propose*predict-yes
  15829. -->
  15830. (O2017 ^name predict-yes +)
  15831. (S1 ^operator O2017 +)
  15832. Retracting elaborate*reward*based*on*reward
  15833. -->
  15834. (R1012 ^value 1 +)
  15835. (R1 ^reward R1012 +)
  15836. Retracting elaborate*copy-dir-to-output-link
  15837. -->
  15838. (I3 ^dir U +)
  15839. Retracting rl*prefer*rvt*predict-no*H0*6
  15840. -->
  15841. (S1 ^operator O2018 = 0.9999999999999999)
  15842. Retracting rl*prefer*rvt*predict-yes*H0*5
  15843. -->
  15844. (S1 ^operator O2017 = 0.)
  15845. =>WM: (14187: S1 ^operator O2020 +)
  15846. =>WM: (14186: S1 ^operator O2019 +)
  15847. =>WM: (14185: O2020 ^name predict-no)
  15848. =>WM: (14184: O2019 ^name predict-yes)
  15849. =>WM: (14183: R1013 ^value 1)
  15850. =>WM: (14182: R1 ^reward R1013)
  15851. <=WM: (14173: S1 ^operator O2017 +)
  15852. <=WM: (14174: S1 ^operator O2018 +)
  15853. <=WM: (14175: S1 ^operator O2018)
  15854. <=WM: (14169: R1 ^reward R1012)
  15855. <=WM: (14172: O2018 ^name predict-no)
  15856. <=WM: (14171: O2017 ^name predict-yes)
  15857. <=WM: (14170: R1012 ^value 1)
  15858. --- Inner Elaboration Phase, active level 1 (S1) ---
  15859. Firing prefer*rvt*predict-yes*H0
  15860. -->
  15861. Firing rl*prefer*rvt*predict-yes*H0*5
  15862. -->
  15863. (S1 ^operator O2019 = 0.)
  15864. Firing prefer*rvt*predict-no*H0
  15865. -->
  15866. Firing rl*prefer*rvt*predict-no*H0*6
  15867. -->
  15868. (S1 ^operator O2020 = 0.9999999999999999)
  15869. inner elaboration loop at bottom goal.
  15870. Retracting rl*prefer*rvt*predict-no*H0*6
  15871. -->
  15872. (S1 ^operator O2018 = 0.9999999999999999)
  15873. Retracting rl*prefer*rvt*predict-yes*H0*5
  15874. -->
  15875. (S1 ^operator O2017 = 0.)
  15876. --- END Proposal Phase ---
  15877. --- Decision Phase ---
  15878. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15879. =>WM: (14188: S1 ^operator O2020)
  15880. 1010: O: O2020 (predict-no)
  15881. --- END Decision Phase ---
  15882. --- Application Phase ---
  15883. --- Firing Productions (PE) For State At Depth 1 ---
  15884. --- Inner Elaboration Phase, active level 1 (S1) ---
  15885. Firing apply*operator
  15886. -->
  15887. (I3 ^predict-no N1010 + :O )
  15888. Firing apply*operator*complete
  15889. -->
  15890. (I3 ^predict-no N1009 - :O )
  15891. inner elaboration loop at bottom goal.
  15892. --- Change Working Memory (PE) ---
  15893. =>WM: (14189: I3 ^predict-no N1010)
  15894. <=WM: (14177: N1009 ^status complete)
  15895. <=WM: (14176: I3 ^predict-no N1009)
  15896. --- Firing Productions (IE) For State At Depth 1 ---
  15897. --- Inner Elaboration Phase, active level 1 (S1) ---
  15898. Firing monitor*world
  15899. -->
  15900. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15901. --- Change Working Memory (IE) ---
  15902. --- END Application Phase ---
  15903. --- Output Phase ---
  15904. ENV: Agent did: predict-no for direction U in state State-A
  15905. In State-A moving U
  15906. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15907. predict error 0
  15908. dir: dir isU
  15909. --- END Output Phase ---
  15910. |\---- Input Phase ---
  15911. =>WM: (14193: I2 ^dir U)
  15912. =>WM: (14192: I2 ^reward 1)
  15913. =>WM: (14191: I2 ^see 0)
  15914. =>WM: (14190: N1010 ^status complete)
  15915. <=WM: (14180: I2 ^dir U)
  15916. <=WM: (14179: I2 ^reward 1)
  15917. <=WM: (14178: I2 ^see 0)
  15918. =>WM: (14194: I2 ^level-1 L0-root)
  15919. <=WM: (14181: I2 ^level-1 L0-root)
  15920. --- END Input Phase ---
  15921. --- Proposal Phase ---
  15922. --- Inner Elaboration Phase, active level 1 (S1) ---
  15923. Firing elaborate*copy-see-to-output-link
  15924. -->
  15925. (I3 ^see 0 +)
  15926. Firing elaborate*reward*based*on*reward
  15927. -->
  15928. (R1014 ^value 1 +)
  15929. (R1 ^reward R1014 +)
  15930. Firing propose*predict-yes
  15931. -->
  15932. (O2021 ^name predict-yes +)
  15933. (S1 ^operator O2021 +)
  15934. Firing propose*predict-no
  15935. -->
  15936. (O2022 ^name predict-no +)
  15937. (S1 ^operator O2022 +)
  15938. Firing rl*prefer*rvt*predict-no*H0*6
  15939. -->
  15940. (S1 ^operator O2020 = 0.9999999999999999)
  15941. Firing rl*prefer*rvt*predict-yes*H0*5
  15942. -->
  15943. (S1 ^operator O2019 = 0.)
  15944. Firing prefer*rvt*predict-yes*H0
  15945. -->
  15946. Firing prefer*rvt*predict-no*H0
  15947. -->
  15948. Firing elaborate*copy-dir-to-output-link
  15949. -->
  15950. (I3 ^dir U +)
  15951. inner elaboration loop at bottom goal.
  15952. Retracting elaborate*copy-see-to-output-link
  15953. -->
  15954. (I3 ^see 0 +)
  15955. Retracting propose*predict-no
  15956. -->
  15957. (O2020 ^name predict-no +)
  15958. (S1 ^operator O2020 +)
  15959. Retracting propose*predict-yes
  15960. -->
  15961. (O2019 ^name predict-yes +)
  15962. (S1 ^operator O2019 +)
  15963. Retracting elaborate*reward*based*on*reward
  15964. -->
  15965. (R1013 ^value 1 +)
  15966. (R1 ^reward R1013 +)
  15967. Retracting elaborate*copy-dir-to-output-link
  15968. -->
  15969. (I3 ^dir U +)
  15970. Retracting rl*prefer*rvt*predict-no*H0*6
  15971. -->
  15972. (S1 ^operator O2020 = 0.9999999999999999)
  15973. Retracting rl*prefer*rvt*predict-yes*H0*5
  15974. -->
  15975. (S1 ^operator O2019 = 0.)
  15976. =>WM: (14200: S1 ^operator O2022 +)
  15977. =>WM: (14199: S1 ^operator O2021 +)
  15978. =>WM: (14198: O2022 ^name predict-no)
  15979. =>WM: (14197: O2021 ^name predict-yes)
  15980. =>WM: (14196: R1014 ^value 1)
  15981. =>WM: (14195: R1 ^reward R1014)
  15982. <=WM: (14186: S1 ^operator O2019 +)
  15983. <=WM: (14187: S1 ^operator O2020 +)
  15984. <=WM: (14188: S1 ^operator O2020)
  15985. <=WM: (14182: R1 ^reward R1013)
  15986. <=WM: (14185: O2020 ^name predict-no)
  15987. <=WM: (14184: O2019 ^name predict-yes)
  15988. <=WM: (14183: R1013 ^value 1)
  15989. --- Inner Elaboration Phase, active level 1 (S1) ---
  15990. Firing prefer*rvt*predict-yes*H0
  15991. -->
  15992. Firing rl*prefer*rvt*predict-yes*H0*5
  15993. -->
  15994. (S1 ^operator O2021 = 0.)
  15995. Firing prefer*rvt*predict-no*H0
  15996. -->
  15997. Firing rl*prefer*rvt*predict-no*H0*6
  15998. -->
  15999. (S1 ^operator O2022 = 0.9999999999999999)
  16000. inner elaboration loop at bottom goal.
  16001. Retracting rl*prefer*rvt*predict-no*H0*6
  16002. -->
  16003. (S1 ^operator O2020 = 0.9999999999999999)
  16004. Retracting rl*prefer*rvt*predict-yes*H0*5
  16005. -->
  16006. (S1 ^operator O2019 = 0.)
  16007. --- END Proposal Phase ---
  16008. --- Decision Phase ---
  16009. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16010. =>WM: (14201: S1 ^operator O2022)
  16011. 1011: O: O2022 (predict-no)
  16012. --- END Decision Phase ---
  16013. --- Application Phase ---
  16014. --- Firing Productions (PE) For State At Depth 1 ---
  16015. --- Inner Elaboration Phase, active level 1 (S1) ---
  16016. Firing apply*operator
  16017. -->
  16018. (I3 ^predict-no N1011 + :O )
  16019. Firing apply*operator*complete
  16020. -->
  16021. (I3 ^predict-no N1010 - :O )
  16022. inner elaboration loop at bottom goal.
  16023. --- Change Working Memory (PE) ---
  16024. =>WM: (14202: I3 ^predict-no N1011)
  16025. <=WM: (14190: N1010 ^status complete)
  16026. <=WM: (14189: I3 ^predict-no N1010)
  16027. --- Firing Productions (IE) For State At Depth 1 ---
  16028. --- Inner Elaboration Phase, active level 1 (S1) ---
  16029. Firing monitor*world
  16030. -->
  16031. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16032. --- Change Working Memory (IE) ---
  16033. --- END Application Phase ---
  16034. --- Output Phase ---
  16035. ENV: Agent did: predict-no for direction U in state State-A
  16036. In State-A moving U
  16037. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16038. predict error 0
  16039. dir: dir isR
  16040. --- END Output Phase ---
  16041. /--- Input Phase ---
  16042. =>WM: (14206: I2 ^dir R)
  16043. =>WM: (14205: I2 ^reward 1)
  16044. =>WM: (14204: I2 ^see 0)
  16045. =>WM: (14203: N1011 ^status complete)
  16046. <=WM: (14193: I2 ^dir U)
  16047. <=WM: (14192: I2 ^reward 1)
  16048. <=WM: (14191: I2 ^see 0)
  16049. =>WM: (14207: I2 ^level-1 L0-root)
  16050. <=WM: (14194: I2 ^level-1 L0-root)
  16051. --- END Input Phase ---
  16052. --- Proposal Phase ---
  16053. --- Inner Elaboration Phase, active level 1 (S1) ---
  16054. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  16055. -->
  16056. (S1 ^operator O2021 = 0.6170744569149269)
  16057. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  16058. -->
  16059. (S1 ^operator O2022 = 0.4910065094545203)
  16060. Firing prefer*rvt*predict-no*H0*4*H1
  16061. -->
  16062. Firing prefer*rvt*predict-yes*H0*3*H1
  16063. -->
  16064. Firing elaborate*copy-see-to-output-link
  16065. -->
  16066. (I3 ^see 0 +)
  16067. Firing elaborate*reward*based*on*reward
  16068. -->
  16069. (R1015 ^value 1 +)
  16070. (R1 ^reward R1015 +)
  16071. Firing propose*predict-yes
  16072. -->
  16073. (O2023 ^name predict-yes +)
  16074. (S1 ^operator O2023 +)
  16075. Firing propose*predict-no
  16076. -->
  16077. (O2024 ^name predict-no +)
  16078. (S1 ^operator O2024 +)
  16079. Firing rl*prefer*rvt*predict-no*H0*4
  16080. -->
  16081. (S1 ^operator O2022 = 0.1269768790760836)
  16082. Firing rl*prefer*rvt*predict-yes*H0*3
  16083. -->
  16084. (S1 ^operator O2021 = 0.3829338060190436)
  16085. Firing prefer*rvt*predict-yes*H0
  16086. -->
  16087. Firing prefer*rvt*predict-no*H0
  16088. -->
  16089. Firing elaborate*copy-dir-to-output-link
  16090. -->
  16091. (I3 ^dir R +)
  16092. inner elaboration loop at bottom goal.
  16093. Retracting elaborate*copy-see-to-output-link
  16094. -->
  16095. (I3 ^see 0 +)
  16096. Retracting propose*predict-no
  16097. -->
  16098. (O2022 ^name predict-no +)
  16099. (S1 ^operator O2022 +)
  16100. Retracting propose*predict-yes
  16101. -->
  16102. (O2021 ^name predict-yes +)
  16103. (S1 ^operator O2021 +)
  16104. Retracting elaborate*reward*based*on*reward
  16105. -->
  16106. (R1014 ^value 1 +)
  16107. (R1 ^reward R1014 +)
  16108. Retracting elaborate*copy-dir-to-output-link
  16109. -->
  16110. (I3 ^dir U +)
  16111. Retracting rl*prefer*rvt*predict-no*H0*6
  16112. -->
  16113. (S1 ^operator O2022 = 0.9999999999999999)
  16114. Retracting rl*prefer*rvt*predict-yes*H0*5
  16115. -->
  16116. (S1 ^operator O2021 = 0.)
  16117. =>WM: (14214: S1 ^operator O2024 +)
  16118. =>WM: (14213: S1 ^operator O2023 +)
  16119. =>WM: (14212: I3 ^dir R)
  16120. =>WM: (14211: O2024 ^name predict-no)
  16121. =>WM: (14210: O2023 ^name predict-yes)
  16122. =>WM: (14209: R1015 ^value 1)
  16123. =>WM: (14208: R1 ^reward R1015)
  16124. <=WM: (14199: S1 ^operator O2021 +)
  16125. <=WM: (14200: S1 ^operator O2022 +)
  16126. <=WM: (14201: S1 ^operator O2022)
  16127. <=WM: (14159: I3 ^dir U)
  16128. <=WM: (14195: R1 ^reward R1014)
  16129. <=WM: (14198: O2022 ^name predict-no)
  16130. <=WM: (14197: O2021 ^name predict-yes)
  16131. <=WM: (14196: R1014 ^value 1)
  16132. --- Inner Elaboration Phase, active level 1 (S1) ---
  16133. Firing prefer*rvt*predict-yes*H0
  16134. -->
  16135. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  16136. -->
  16137. (S1 ^operator O2023 = 0.6170744569149269)
  16138. Firing rl*prefer*rvt*predict-yes*H0*3
  16139. -->
  16140. (S1 ^operator O2023 = 0.3829338060190436)
  16141. Firing prefer*rvt*predict-yes*H0*3*H1
  16142. -->
  16143. Firing prefer*rvt*predict-no*H0
  16144. -->
  16145. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  16146. -->
  16147. (S1 ^operator O2024 = 0.4910065094545203)
  16148. Firing rl*prefer*rvt*predict-no*H0*4
  16149. -->
  16150. (S1 ^operator O2024 = 0.1269768790760836)
  16151. Firing prefer*rvt*predict-no*H0*4*H1
  16152. -->
  16153. inner elaboration loop at bottom goal.
  16154. Retracting rl*prefer*rvt*predict-no*H0*4
  16155. -->
  16156. (S1 ^operator O2022 = 0.1269768790760836)
  16157. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  16158. -->
  16159. (S1 ^operator O2022 = 0.4910065094545203)
  16160. Retracting rl*prefer*rvt*predict-yes*H0*3
  16161. -->
  16162. (S1 ^operator O2021 = 0.3829338060190436)
  16163. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  16164. -->
  16165. (S1 ^operator O2021 = 0.6170744569149269)
  16166. --- END Proposal Phase ---
  16167. --- Decision Phase ---
  16168. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16169. =>WM: (14215: S1 ^operator O2023)
  16170. 1012: O: O2023 (predict-yes)
  16171. --- END Decision Phase ---
  16172. --- Application Phase ---
  16173. --- Firing Productions (PE) For State At Depth 1 ---
  16174. --- Inner Elaboration Phase, active level 1 (S1) ---
  16175. Firing apply*operator
  16176. -->
  16177. (I3 ^predict-yes N1012 + :O )
  16178. Firing apply*operator*complete
  16179. -->
  16180. (I3 ^predict-no N1011 - :O )
  16181. inner elaboration loop at bottom goal.
  16182. --- Change Working Memory (PE) ---
  16183. =>WM: (14216: I3 ^predict-yes N1012)
  16184. <=WM: (14203: N1011 ^status complete)
  16185. <=WM: (14202: I3 ^predict-no N1011)
  16186. --- Firing Productions (IE) For State At Depth 1 ---
  16187. --- Inner Elaboration Phase, active level 1 (S1) ---
  16188. Firing monitor*world
  16189. -->
  16190. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16191. --- Change Working Memory (IE) ---
  16192. --- END Application Phase ---
  16193. --- Output Phase ---
  16194. ENV: Agent did: predict-yes for direction R in state State-A
  16195. In State-A moving R
  16196. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  16197. predict error 0
  16198. dir: dir isU
  16199. --- END Output Phase ---
  16200. |\---- Input Phase ---
  16201. =>WM: (14220: I2 ^dir U)
  16202. =>WM: (14219: I2 ^reward 1)
  16203. =>WM: (14218: I2 ^see 1)
  16204. =>WM: (14217: N1012 ^status complete)
  16205. <=WM: (14206: I2 ^dir R)
  16206. <=WM: (14205: I2 ^reward 1)
  16207. <=WM: (14204: I2 ^see 0)
  16208. =>WM: (14221: I2 ^level-1 R1-root)
  16209. <=WM: (14207: I2 ^level-1 L0-root)
  16210. --- END Input Phase ---
  16211. --- Proposal Phase ---
  16212. --- Inner Elaboration Phase, active level 1 (S1) ---
  16213. Firing elaborate*copy-see-to-output-link
  16214. -->
  16215. (I3 ^see 1 +)
  16216. Firing elaborate*reward*based*on*reward
  16217. -->
  16218. (R1016 ^value 1 +)
  16219. (R1 ^reward R1016 +)
  16220. Firing propose*predict-yes
  16221. -->
  16222. (O2025 ^name predict-yes +)
  16223. (S1 ^operator O2025 +)
  16224. Firing propose*predict-no
  16225. -->
  16226. (O2026 ^name predict-no +)
  16227. (S1 ^operator O2026 +)
  16228. Firing rl*prefer*rvt*predict-no*H0*6
  16229. -->
  16230. (S1 ^operator O2024 = 0.9999999999999999)
  16231. Firing rl*prefer*rvt*predict-yes*H0*5
  16232. -->
  16233. (S1 ^operator O2023 = 0.)
  16234. Firing prefer*rvt*predict-yes*H0
  16235. -->
  16236. Firing prefer*rvt*predict-no*H0
  16237. -->
  16238. Firing elaborate*copy-dir-to-output-link
  16239. -->
  16240. (I3 ^dir U +)
  16241. inner elaboration loop at bottom goal.
  16242. Retracting elaborate*copy-see-to-output-link
  16243. -->
  16244. (I3 ^see 0 +)
  16245. Retracting propose*predict-no
  16246. -->
  16247. (O2024 ^name predict-no +)
  16248. (S1 ^operator O2024 +)
  16249. Retracting propose*predict-yes
  16250. -->
  16251. (O2023 ^name predict-yes +)
  16252. (S1 ^operator O2023 +)
  16253. Retracting elaborate*reward*based*on*reward
  16254. -->
  16255. (R1015 ^value 1 +)
  16256. (R1 ^reward R1015 +)
  16257. Retracting elaborate*copy-dir-to-output-link
  16258. -->
  16259. (I3 ^dir R +)
  16260. Retracting rl*prefer*rvt*predict-no*H0*4
  16261. -->
  16262. (S1 ^operator O2024 = 0.1269768790760836)
  16263. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  16264. -->
  16265. (S1 ^operator O2024 = 0.4910065094545203)
  16266. Retracting rl*prefer*rvt*predict-yes*H0*3
  16267. -->
  16268. (S1 ^operator O2023 = 0.3829338060190436)
  16269. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  16270. -->
  16271. (S1 ^operator O2023 = 0.6170744569149269)
  16272. =>WM: (14229: S1 ^operator O2026 +)
  16273. =>WM: (14228: S1 ^operator O2025 +)
  16274. =>WM: (14227: I3 ^dir U)
  16275. =>WM: (14226: O2026 ^name predict-no)
  16276. =>WM: (14225: O2025 ^name predict-yes)
  16277. =>WM: (14224: R1016 ^value 1)
  16278. =>WM: (14223: R1 ^reward R1016)
  16279. =>WM: (14222: I3 ^see 1)
  16280. <=WM: (14213: S1 ^operator O2023 +)
  16281. <=WM: (14215: S1 ^operator O2023)
  16282. <=WM: (14214: S1 ^operator O2024 +)
  16283. <=WM: (14212: I3 ^dir R)
  16284. <=WM: (14208: R1 ^reward R1015)
  16285. <=WM: (14154: I3 ^see 0)
  16286. <=WM: (14211: O2024 ^name predict-no)
  16287. <=WM: (14210: O2023 ^name predict-yes)
  16288. <=WM: (14209: R1015 ^value 1)
  16289. --- Inner Elaboration Phase, active level 1 (S1) ---
  16290. Firing prefer*rvt*predict-yes*H0
  16291. -->
  16292. Firing rl*prefer*rvt*predict-yes*H0*5
  16293. -->
  16294. (S1 ^operator O2025 = 0.)
  16295. Firing prefer*rvt*predict-no*H0
  16296. -->
  16297. Firing rl*prefer*rvt*predict-no*H0*6
  16298. -->
  16299. (S1 ^operator O2026 = 0.9999999999999999)
  16300. inner elaboration loop at bottom goal.
  16301. Retracting rl*prefer*rvt*predict-no*H0*6
  16302. -->
  16303. (S1 ^operator O2024 = 0.9999999999999999)
  16304. Retracting rl*prefer*rvt*predict-yes*H0*5
  16305. -->
  16306. (S1 ^operator O2023 = 0.)
  16307. --- END Proposal Phase ---
  16308. --- Decision Phase ---
  16309. RL update rl*prefer*rvt*predict-yes*H0*3 0.673128 -0.290194 0.382934 -> 0.673126 -0.290194 0.382933(R,m,v=1,0.961039,0.0376878)
  16310. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.32688 0.290194 0.617074 -> 0.326879 0.290194 0.617073(R,m,v=1,1,0)
  16311. =>WM: (14230: S1 ^operator O2026)
  16312. 1013: O: O2026 (predict-no)
  16313. --- END Decision Phase ---
  16314. --- Application Phase ---
  16315. --- Firing Productions (PE) For State At Depth 1 ---
  16316. --- Inner Elaboration Phase, active level 1 (S1) ---
  16317. Firing apply*operator
  16318. -->
  16319. (I3 ^predict-no N1013 + :O )
  16320. Firing apply*operator*complete
  16321. -->
  16322. (I3 ^predict-yes N1012 - :O )
  16323. inner elaboration loop at bottom goal.
  16324. --- Change Working Memory (PE) ---
  16325. =>WM: (14231: I3 ^predict-no N1013)
  16326. <=WM: (14217: N1012 ^status complete)
  16327. <=WM: (14216: I3 ^predict-yes N1012)
  16328. --- Firing Productions (IE) For State At Depth 1 ---
  16329. --- Inner Elaboration Phase, active level 1 (S1) ---
  16330. Firing monitor*world
  16331. -->
  16332. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16333. --- Change Working Memory (IE) ---
  16334. --- END Application Phase ---
  16335. --- Output Phase ---
  16336. ENV: Agent did: predict-no for direction U in state State-B
  16337. In State-B moving U
  16338. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16339. predict error 0
  16340. dir: dir isL
  16341. --- END Output Phase ---
  16342. /|\--- Input Phase ---
  16343. =>WM: (14235: I2 ^dir L)
  16344. =>WM: (14234: I2 ^reward 1)
  16345. =>WM: (14233: I2 ^see 0)
  16346. =>WM: (14232: N1013 ^status complete)
  16347. <=WM: (14220: I2 ^dir U)
  16348. <=WM: (14219: I2 ^reward 1)
  16349. <=WM: (14218: I2 ^see 1)
  16350. =>WM: (14236: I2 ^level-1 R1-root)
  16351. <=WM: (14221: I2 ^level-1 R1-root)
  16352. --- END Input Phase ---
  16353. --- Proposal Phase ---
  16354. --- Inner Elaboration Phase, active level 1 (S1) ---
  16355. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  16356. -->
  16357. (S1 ^operator O2025 = 0.4768782315732822)
  16358. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  16359. -->
  16360. (S1 ^operator O2026 = -0.01194930198035649)
  16361. Firing prefer*rvt*predict-no*H0*2*H1
  16362. -->
  16363. Firing prefer*rvt*predict-yes*H0*1*H1
  16364. -->
  16365. Firing elaborate*copy-see-to-output-link
  16366. -->
  16367. (I3 ^see 0 +)
  16368. Firing elaborate*reward*based*on*reward
  16369. -->
  16370. (R1017 ^value 1 +)
  16371. (R1 ^reward R1017 +)
  16372. Firing propose*predict-yes
  16373. -->
  16374. (O2027 ^name predict-yes +)
  16375. (S1 ^operator O2027 +)
  16376. Firing propose*predict-no
  16377. -->
  16378. (O2028 ^name predict-no +)
  16379. (S1 ^operator O2028 +)
  16380. Firing rl*prefer*rvt*predict-no*H0*2
  16381. -->
  16382. (S1 ^operator O2026 = 0.2550133872099196)
  16383. Firing rl*prefer*rvt*predict-yes*H0*1
  16384. -->
  16385. (S1 ^operator O2025 = 0.5231203326136166)
  16386. Firing prefer*rvt*predict-yes*H0
  16387. -->
  16388. Firing prefer*rvt*predict-no*H0
  16389. -->
  16390. Firing elaborate*copy-dir-to-output-link
  16391. -->
  16392. (I3 ^dir L +)
  16393. inner elaboration loop at bottom goal.
  16394. Retracting elaborate*copy-see-to-output-link
  16395. -->
  16396. (I3 ^see 1 +)
  16397. Retracting propose*predict-no
  16398. -->
  16399. (O2026 ^name predict-no +)
  16400. (S1 ^operator O2026 +)
  16401. Retracting propose*predict-yes
  16402. -->
  16403. (O2025 ^name predict-yes +)
  16404. (S1 ^operator O2025 +)
  16405. Retracting elaborate*reward*based*on*reward
  16406. -->
  16407. (R1016 ^value 1 +)
  16408. (R1 ^reward R1016 +)
  16409. Retracting elaborate*copy-dir-to-output-link
  16410. -->
  16411. (I3 ^dir U +)
  16412. Retracting rl*prefer*rvt*predict-no*H0*6
  16413. -->
  16414. (S1 ^operator O2026 = 0.9999999999999999)
  16415. Retracting rl*prefer*rvt*predict-yes*H0*5
  16416. -->
  16417. (S1 ^operator O2025 = 0.)
  16418. =>WM: (14244: S1 ^operator O2028 +)
  16419. =>WM: (14243: S1 ^operator O2027 +)
  16420. =>WM: (14242: I3 ^dir L)
  16421. =>WM: (14241: O2028 ^name predict-no)
  16422. =>WM: (14240: O2027 ^name predict-yes)
  16423. =>WM: (14239: R1017 ^value 1)
  16424. =>WM: (14238: R1 ^reward R1017)
  16425. =>WM: (14237: I3 ^see 0)
  16426. <=WM: (14228: S1 ^operator O2025 +)
  16427. <=WM: (14229: S1 ^operator O2026 +)
  16428. <=WM: (14230: S1 ^operator O2026)
  16429. <=WM: (14227: I3 ^dir U)
  16430. <=WM: (14223: R1 ^reward R1016)
  16431. <=WM: (14222: I3 ^see 1)
  16432. <=WM: (14226: O2026 ^name predict-no)
  16433. <=WM: (14225: O2025 ^name predict-yes)
  16434. <=WM: (14224: R1016 ^value 1)
  16435. --- Inner Elaboration Phase, active level 1 (S1) ---
  16436. Firing prefer*rvt*predict-yes*H0
  16437. -->
  16438. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  16439. -->
  16440. (S1 ^operator O2027 = 0.4768782315732822)
  16441. Firing rl*prefer*rvt*predict-yes*H0*1
  16442. -->
  16443. (S1 ^operator O2027 = 0.5231203326136166)
  16444. Firing prefer*rvt*predict-yes*H0*1*H1
  16445. -->
  16446. Firing prefer*rvt*predict-no*H0
  16447. -->
  16448. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  16449. -->
  16450. (S1 ^operator O2028 = -0.01194930198035649)
  16451. Firing rl*prefer*rvt*predict-no*H0*2
  16452. -->
  16453. (S1 ^operator O2028 = 0.2550133872099196)
  16454. Firing prefer*rvt*predict-no*H0*2*H1
  16455. -->
  16456. inner elaboration loop at bottom goal.
  16457. Retracting rl*prefer*rvt*predict-no*H0*2
  16458. -->
  16459. (S1 ^operator O2026 = 0.2550133872099196)
  16460. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  16461. -->
  16462. (S1 ^operator O2026 = -0.01194930198035649)
  16463. Retracting rl*prefer*rvt*predict-yes*H0*1
  16464. -->
  16465. (S1 ^operator O2025 = 0.5231203326136166)
  16466. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  16467. -->
  16468. (S1 ^operator O2025 = 0.4768782315732822)
  16469. --- END Proposal Phase ---
  16470. --- Decision Phase ---
  16471. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16472. =>WM: (14245: S1 ^operator O2027)
  16473. 1014: O: O2027 (predict-yes)
  16474. --- END Decision Phase ---
  16475. --- Application Phase ---
  16476. --- Firing Productions (PE) For State At Depth 1 ---
  16477. --- Inner Elaboration Phase, active level 1 (S1) ---
  16478. Firing apply*operator
  16479. -->
  16480. (I3 ^predict-yes N1014 + :O )
  16481. Firing apply*operator*complete
  16482. -->
  16483. (I3 ^predict-no N1013 - :O )
  16484. inner elaboration loop at bottom goal.
  16485. --- Change Working Memory (PE) ---
  16486. =>WM: (14246: I3 ^predict-yes N1014)
  16487. <=WM: (14232: N1013 ^status complete)
  16488. <=WM: (14231: I3 ^predict-no N1013)
  16489. --- Firing Productions (IE) For State At Depth 1 ---
  16490. --- Inner Elaboration Phase, active level 1 (S1) ---
  16491. Firing monitor*world
  16492. -->
  16493. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16494. --- Change Working Memory (IE) ---
  16495. --- END Application Phase ---
  16496. --- Output Phase ---
  16497. ENV: Agent did: predict-yes for direction L in state State-B
  16498. In State-B moving L
  16499. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  16500. predict error 0
  16501. dir: dir isR
  16502. --- END Output Phase ---
  16503. -/|--- Input Phase ---
  16504. =>WM: (14250: I2 ^dir R)
  16505. =>WM: (14249: I2 ^reward 1)
  16506. =>WM: (14248: I2 ^see 1)
  16507. =>WM: (14247: N1014 ^status complete)
  16508. <=WM: (14235: I2 ^dir L)
  16509. <=WM: (14234: I2 ^reward 1)
  16510. <=WM: (14233: I2 ^see 0)
  16511. =>WM: (14251: I2 ^level-1 L1-root)
  16512. <=WM: (14236: I2 ^level-1 R1-root)
  16513. --- END Input Phase ---
  16514. --- Proposal Phase ---
  16515. --- Inner Elaboration Phase, active level 1 (S1) ---
  16516. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  16517. -->
  16518. (S1 ^operator O2027 = 0.617033753614766)
  16519. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  16520. -->
  16521. (S1 ^operator O2028 = 0.4901349546100854)
  16522. Firing prefer*rvt*predict-no*H0*4*H1
  16523. -->
  16524. Firing prefer*rvt*predict-yes*H0*3*H1
  16525. -->
  16526. Firing elaborate*copy-see-to-output-link
  16527. -->
  16528. (I3 ^see 1 +)
  16529. Firing elaborate*reward*based*on*reward
  16530. -->
  16531. (R1018 ^value 1 +)
  16532. (R1 ^reward R1018 +)
  16533. Firing propose*predict-yes
  16534. -->
  16535. (O2029 ^name predict-yes +)
  16536. (S1 ^operator O2029 +)
  16537. Firing propose*predict-no
  16538. -->
  16539. (O2030 ^name predict-no +)
  16540. (S1 ^operator O2030 +)
  16541. Firing rl*prefer*rvt*predict-no*H0*4
  16542. -->
  16543. (S1 ^operator O2028 = 0.1269768790760836)
  16544. Firing rl*prefer*rvt*predict-yes*H0*3
  16545. -->
  16546. (S1 ^operator O2027 = 0.382932566578948)
  16547. Firing prefer*rvt*predict-yes*H0
  16548. -->
  16549. Firing prefer*rvt*predict-no*H0
  16550. -->
  16551. Firing elaborate*copy-dir-to-output-link
  16552. -->
  16553. (I3 ^dir R +)
  16554. inner elaboration loop at bottom goal.
  16555. Retracting elaborate*copy-see-to-output-link
  16556. -->
  16557. (I3 ^see 0 +)
  16558. Retracting propose*predict-no
  16559. -->
  16560. (O2028 ^name predict-no +)
  16561. (S1 ^operator O2028 +)
  16562. Retracting propose*predict-yes
  16563. -->
  16564. (O2027 ^name predict-yes +)
  16565. (S1 ^operator O2027 +)
  16566. Retracting elaborate*reward*based*on*reward
  16567. -->
  16568. (R1017 ^value 1 +)
  16569. (R1 ^reward R1017 +)
  16570. Retracting elaborate*copy-dir-to-output-link
  16571. -->
  16572. (I3 ^dir L +)
  16573. Retracting rl*prefer*rvt*predict-no*H0*2
  16574. -->
  16575. (S1 ^operator O2028 = 0.2550133872099196)
  16576. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  16577. -->
  16578. (S1 ^operator O2028 = -0.01194930198035649)
  16579. Retracting rl*prefer*rvt*predict-yes*H0*1
  16580. -->
  16581. (S1 ^operator O2027 = 0.5231203326136166)
  16582. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  16583. -->
  16584. (S1 ^operator O2027 = 0.4768782315732822)
  16585. =>WM: (14259: S1 ^operator O2030 +)
  16586. =>WM: (14258: S1 ^operator O2029 +)
  16587. =>WM: (14257: I3 ^dir R)
  16588. =>WM: (14256: O2030 ^name predict-no)
  16589. =>WM: (14255: O2029 ^name predict-yes)
  16590. =>WM: (14254: R1018 ^value 1)
  16591. =>WM: (14253: R1 ^reward R1018)
  16592. =>WM: (14252: I3 ^see 1)
  16593. <=WM: (14243: S1 ^operator O2027 +)
  16594. <=WM: (14245: S1 ^operator O2027)
  16595. <=WM: (14244: S1 ^operator O2028 +)
  16596. <=WM: (14242: I3 ^dir L)
  16597. <=WM: (14238: R1 ^reward R1017)
  16598. <=WM: (14237: I3 ^see 0)
  16599. <=WM: (14241: O2028 ^name predict-no)
  16600. <=WM: (14240: O2027 ^name predict-yes)
  16601. <=WM: (14239: R1017 ^value 1)
  16602. --- Inner Elaboration Phase, active level 1 (S1) ---
  16603. Firing prefer*rvt*predict-yes*H0
  16604. -->
  16605. Firing rl*prefer*rvt*predict-yes*H0*3
  16606. -->
  16607. (S1 ^operator O2029 = 0.382932566578948)
  16608. Firing prefer*rvt*predict-yes*H0*3*H1
  16609. -->
  16610. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  16611. -->
  16612. (S1 ^operator O2029 = 0.617033753614766)
  16613. Firing prefer*rvt*predict-no*H0
  16614. -->
  16615. Firing rl*prefer*rvt*predict-no*H0*4
  16616. -->
  16617. (S1 ^operator O2030 = 0.1269768790760836)
  16618. Firing prefer*rvt*predict-no*H0*4*H1
  16619. -->
  16620. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  16621. -->
  16622. (S1 ^operator O2030 = 0.4901349546100854)
  16623. inner elaboration loop at bottom goal.
  16624. Retracting rl*prefer*rvt*predict-no*H0*4
  16625. -->
  16626. (S1 ^operator O2028 = 0.1269768790760836)
  16627. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  16628. -->
  16629. (S1 ^operator O2028 = 0.4901349546100854)
  16630. Retracting rl*prefer*rvt*predict-yes*H0*3
  16631. -->
  16632. (S1 ^operator O2027 = 0.382932566578948)
  16633. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  16634. -->
  16635. (S1 ^operator O2027 = 0.617033753614766)
  16636. --- END Proposal Phase ---
  16637. --- Decision Phase ---
  16638. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.727961 -0.20484 0.523121(R,m,v=1,0.979167,0.020542)
  16639. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272038 0.20484 0.476878 -> 0.272038 0.20484 0.476878(R,m,v=1,1,0)
  16640. =>WM: (14260: S1 ^operator O2029)
  16641. 1015: O: O2029 (predict-yes)
  16642. --- END Decision Phase ---
  16643. --- Application Phase ---
  16644. --- Firing Productions (PE) For State At Depth 1 ---
  16645. --- Inner Elaboration Phase, active level 1 (S1) ---
  16646. Firing apply*operator
  16647. -->
  16648. (I3 ^predict-yes N1015 + :O )
  16649. Firing apply*operator*complete
  16650. -->
  16651. (I3 ^predict-yes N1014 - :O )
  16652. inner elaboration loop at bottom goal.
  16653. --- Change Working Memory (PE) ---
  16654. =>WM: (14261: I3 ^predict-yes N1015)
  16655. <=WM: (14247: N1014 ^status complete)
  16656. <=WM: (14246: I3 ^predict-yes N1014)
  16657. --- Firing Productions (IE) For State At Depth 1 ---
  16658. --- Inner Elaboration Phase, active level 1 (S1) ---
  16659. Firing monitor*world
  16660. -->
  16661. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16662. --- Change Working Memory (IE) ---
  16663. --- END Application Phase ---
  16664. --- Output Phase ---
  16665. ENV: Agent did: predict-yes for direction R in state State-A
  16666. In State-A moving R
  16667. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  16668. predict error 0
  16669. dir: dir isR
  16670. --- END Output Phase ---
  16671. \--- Input Phase ---
  16672. =>WM: (14265: I2 ^dir R)
  16673. =>WM: (14264: I2 ^reward 1)
  16674. =>WM: (14263: I2 ^see 1)
  16675. =>WM: (14262: N1015 ^status complete)
  16676. <=WM: (14250: I2 ^dir R)
  16677. <=WM: (14249: I2 ^reward 1)
  16678. <=WM: (14248: I2 ^see 1)
  16679. =>WM: (14266: I2 ^level-1 R1-root)
  16680. <=WM: (14251: I2 ^level-1 L1-root)
  16681. --- END Input Phase ---
  16682. --- Proposal Phase ---
  16683. --- Inner Elaboration Phase, active level 1 (S1) ---
  16684. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  16685. -->
  16686. (S1 ^operator O2029 = 0.08783148430849691)
  16687. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  16688. -->
  16689. (S1 ^operator O2030 = 0.8730233883813352)
  16690. Firing prefer*rvt*predict-no*H0*4*H1
  16691. -->
  16692. Firing prefer*rvt*predict-yes*H0*3*H1
  16693. -->
  16694. Firing elaborate*copy-see-to-output-link
  16695. -->
  16696. (I3 ^see 1 +)
  16697. Firing elaborate*reward*based*on*reward
  16698. -->
  16699. (R1019 ^value 1 +)
  16700. (R1 ^reward R1019 +)
  16701. Firing propose*predict-yes
  16702. -->
  16703. (O2031 ^name predict-yes +)
  16704. (S1 ^operator O2031 +)
  16705. Firing propose*predict-no
  16706. -->
  16707. (O2032 ^name predict-no +)
  16708. (S1 ^operator O2032 +)
  16709. Firing rl*prefer*rvt*predict-no*H0*4
  16710. -->
  16711. (S1 ^operator O2030 = 0.1269768790760836)
  16712. Firing rl*prefer*rvt*predict-yes*H0*3
  16713. -->
  16714. (S1 ^operator O2029 = 0.382932566578948)
  16715. Firing prefer*rvt*predict-yes*H0
  16716. -->
  16717. Firing prefer*rvt*predict-no*H0
  16718. -->
  16719. Firing elaborate*copy-dir-to-output-link
  16720. -->
  16721. (I3 ^dir R +)
  16722. inner elaboration loop at bottom goal.
  16723. Retracting elaborate*copy-see-to-output-link
  16724. -->
  16725. (I3 ^see 1 +)
  16726. Retracting propose*predict-no
  16727. -->
  16728. (O2030 ^name predict-no +)
  16729. (S1 ^operator O2030 +)
  16730. Retracting propose*predict-yes
  16731. -->
  16732. (O2029 ^name predict-yes +)
  16733. (S1 ^operator O2029 +)
  16734. Retracting elaborate*reward*based*on*reward
  16735. -->
  16736. (R1018 ^value 1 +)
  16737. (R1 ^reward R1018 +)
  16738. Retracting elaborate*copy-dir-to-output-link
  16739. -->
  16740. (I3 ^dir R +)
  16741. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  16742. -->
  16743. (S1 ^operator O2030 = 0.4901349546100854)
  16744. Retracting rl*prefer*rvt*predict-no*H0*4
  16745. -->
  16746. (S1 ^operator O2030 = 0.1269768790760836)
  16747. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  16748. -->
  16749. (S1 ^operator O2029 = 0.617033753614766)
  16750. Retracting rl*prefer*rvt*predict-yes*H0*3
  16751. -->
  16752. (S1 ^operator O2029 = 0.382932566578948)
  16753. =>WM: (14272: S1 ^operator O2032 +)
  16754. =>WM: (14271: S1 ^operator O2031 +)
  16755. =>WM: (14270: O2032 ^name predict-no)
  16756. =>WM: (14269: O2031 ^name predict-yes)
  16757. =>WM: (14268: R1019 ^value 1)
  16758. =>WM: (14267: R1 ^reward R1019)
  16759. <=WM: (14258: S1 ^operator O2029 +)
  16760. <=WM: (14260: S1 ^operator O2029)
  16761. <=WM: (14259: S1 ^operator O2030 +)
  16762. <=WM: (14253: R1 ^reward R1018)
  16763. <=WM: (14256: O2030 ^name predict-no)
  16764. <=WM: (14255: O2029 ^name predict-yes)
  16765. <=WM: (14254: R1018 ^value 1)
  16766. --- Inner Elaboration Phase, active level 1 (S1) ---
  16767. Firing prefer*rvt*predict-yes*H0
  16768. -->
  16769. Firing rl*prefer*rvt*predict-yes*H0*3
  16770. -->
  16771. (S1 ^operator O2031 = 0.382932566578948)
  16772. Firing prefer*rvt*predict-yes*H0*3*H1
  16773. -->
  16774. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  16775. -->
  16776. (S1 ^operator O2031 = 0.08783148430849691)
  16777. Firing prefer*rvt*predict-no*H0
  16778. -->
  16779. Firing rl*prefer*rvt*predict-no*H0*4
  16780. -->
  16781. (S1 ^operator O2032 = 0.1269768790760836)
  16782. Firing prefer*rvt*predict-no*H0*4*H1
  16783. -->
  16784. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  16785. -->
  16786. (S1 ^operator O2032 = 0.8730233883813352)
  16787. inner elaboration loop at bottom goal.
  16788. Retracting rl*prefer*rvt*predict-no*H0*4
  16789. -->
  16790. (S1 ^operator O2030 = 0.1269768790760836)
  16791. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  16792. -->
  16793. (S1 ^operator O2030 = 0.8730233883813352)
  16794. Retracting rl*prefer*rvt*predict-yes*H0*3
  16795. -->
  16796. (S1 ^operator O2029 = 0.382932566578948)
  16797. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  16798. -->
  16799. (S1 ^operator O2029 = 0.08783148430849691)
  16800. --- END Proposal Phase ---
  16801. --- Decision Phase ---
  16802. RL update rl*prefer*rvt*predict-yes*H0*3 0.673126 -0.290194 0.382933 -> 0.673131 -0.290193 0.382938(R,m,v=1,0.96129,0.0374529)
  16803. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326843 0.290191 0.617034 -> 0.326848 0.290191 0.617039(R,m,v=1,1,0)
  16804. =>WM: (14273: S1 ^operator O2032)
  16805. 1016: O: O2032 (predict-no)
  16806. --- END Decision Phase ---
  16807. --- Application Phase ---
  16808. --- Firing Productions (PE) For State At Depth 1 ---
  16809. --- Inner Elaboration Phase, active level 1 (S1) ---
  16810. Firing apply*operator
  16811. -->
  16812. (I3 ^predict-no N1016 + :O )
  16813. Firing apply*operator*complete
  16814. -->
  16815. (I3 ^predict-yes N1015 - :O )
  16816. inner elaboration loop at bottom goal.
  16817. --- Change Working Memory (PE) ---
  16818. =>WM: (14274: I3 ^predict-no N1016)
  16819. <=WM: (14262: N1015 ^status complete)
  16820. <=WM: (14261: I3 ^predict-yes N1015)
  16821. --- Firing Productions (IE) For State At Depth 1 ---
  16822. --- Inner Elaboration Phase, active level 1 (S1) ---
  16823. Firing monitor*world
  16824. -->
  16825. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16826. --- Change Working Memory (IE) ---
  16827. --- END Application Phase ---
  16828. --- Output Phase ---
  16829. ENV: Agent did: predict-no for direction R in state State-B
  16830. In State-B moving R
  16831. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16832. predict error 0
  16833. dir: dir isL
  16834. --- END Output Phase ---
  16835. -/|--- Input Phase ---
  16836. =>WM: (14278: I2 ^dir L)
  16837. =>WM: (14277: I2 ^reward 1)
  16838. =>WM: (14276: I2 ^see 0)
  16839. =>WM: (14275: N1016 ^status complete)
  16840. <=WM: (14265: I2 ^dir R)
  16841. <=WM: (14264: I2 ^reward 1)
  16842. <=WM: (14263: I2 ^see 1)
  16843. =>WM: (14279: I2 ^level-1 R0-root)
  16844. <=WM: (14266: I2 ^level-1 R1-root)
  16845. --- END Input Phase ---
  16846. --- Proposal Phase ---
  16847. --- Inner Elaboration Phase, active level 1 (S1) ---
  16848. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  16849. -->
  16850. (S1 ^operator O2031 = 0.476882807646731)
  16851. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  16852. -->
  16853. (S1 ^operator O2032 = 0.1700769046561409)
  16854. Firing prefer*rvt*predict-no*H0*2*H1
  16855. -->
  16856. Firing prefer*rvt*predict-yes*H0*1*H1
  16857. -->
  16858. Firing elaborate*copy-see-to-output-link
  16859. -->
  16860. (I3 ^see 0 +)
  16861. Firing elaborate*reward*based*on*reward
  16862. -->
  16863. (R1020 ^value 1 +)
  16864. (R1 ^reward R1020 +)
  16865. Firing propose*predict-yes
  16866. -->
  16867. (O2033 ^name predict-yes +)
  16868. (S1 ^operator O2033 +)
  16869. Firing propose*predict-no
  16870. -->
  16871. (O2034 ^name predict-no +)
  16872. (S1 ^operator O2034 +)
  16873. Firing rl*prefer*rvt*predict-no*H0*2
  16874. -->
  16875. (S1 ^operator O2032 = 0.2550133872099196)
  16876. Firing rl*prefer*rvt*predict-yes*H0*1
  16877. -->
  16878. (S1 ^operator O2031 = 0.5231205479855817)
  16879. Firing prefer*rvt*predict-yes*H0
  16880. -->
  16881. Firing prefer*rvt*predict-no*H0
  16882. -->
  16883. Firing elaborate*copy-dir-to-output-link
  16884. -->
  16885. (I3 ^dir L +)
  16886. inner elaboration loop at bottom goal.
  16887. Retracting elaborate*copy-see-to-output-link
  16888. -->
  16889. (I3 ^see 1 +)
  16890. Retracting propose*predict-no
  16891. -->
  16892. (O2032 ^name predict-no +)
  16893. (S1 ^operator O2032 +)
  16894. Retracting propose*predict-yes
  16895. -->
  16896. (O2031 ^name predict-yes +)
  16897. (S1 ^operator O2031 +)
  16898. Retracting elaborate*reward*based*on*reward
  16899. -->
  16900. (R1019 ^value 1 +)
  16901. (R1 ^reward R1019 +)
  16902. Retracting elaborate*copy-dir-to-output-link
  16903. -->
  16904. (I3 ^dir R +)
  16905. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  16906. -->
  16907. (S1 ^operator O2032 = 0.8730233883813352)
  16908. Retracting rl*prefer*rvt*predict-no*H0*4
  16909. -->
  16910. (S1 ^operator O2032 = 0.1269768790760836)
  16911. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  16912. -->
  16913. (S1 ^operator O2031 = 0.08783148430849691)
  16914. Retracting rl*prefer*rvt*predict-yes*H0*3
  16915. -->
  16916. (S1 ^operator O2031 = 0.3829376185498909)
  16917. =>WM: (14287: S1 ^operator O2034 +)
  16918. =>WM: (14286: S1 ^operator O2033 +)
  16919. =>WM: (14285: I3 ^dir L)
  16920. =>WM: (14284: O2034 ^name predict-no)
  16921. =>WM: (14283: O2033 ^name predict-yes)
  16922. =>WM: (14282: R1020 ^value 1)
  16923. =>WM: (14281: R1 ^reward R1020)
  16924. =>WM: (14280: I3 ^see 0)
  16925. <=WM: (14271: S1 ^operator O2031 +)
  16926. <=WM: (14272: S1 ^operator O2032 +)
  16927. <=WM: (14273: S1 ^operator O2032)
  16928. <=WM: (14257: I3 ^dir R)
  16929. <=WM: (14267: R1 ^reward R1019)
  16930. <=WM: (14252: I3 ^see 1)
  16931. <=WM: (14270: O2032 ^name predict-no)
  16932. <=WM: (14269: O2031 ^name predict-yes)
  16933. <=WM: (14268: R1019 ^value 1)
  16934. --- Inner Elaboration Phase, active level 1 (S1) ---
  16935. Firing prefer*rvt*predict-yes*H0
  16936. -->
  16937. Firing rl*prefer*rvt*predict-yes*H0*1
  16938. -->
  16939. (S1 ^operator O2033 = 0.5231205479855817)
  16940. Firing prefer*rvt*predict-yes*H0*1*H1
  16941. -->
  16942. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  16943. -->
  16944. (S1 ^operator O2033 = 0.476882807646731)
  16945. Firing prefer*rvt*predict-no*H0
  16946. -->
  16947. Firing rl*prefer*rvt*predict-no*H0*2
  16948. -->
  16949. (S1 ^operator O2034 = 0.2550133872099196)
  16950. Firing prefer*rvt*predict-no*H0*2*H1
  16951. -->
  16952. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  16953. -->
  16954. (S1 ^operator O2034 = 0.1700769046561409)
  16955. inner elaboration loop at bottom goal.
  16956. Retracting rl*prefer*rvt*predict-no*H0*2
  16957. -->
  16958. (S1 ^operator O2032 = 0.2550133872099196)
  16959. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  16960. -->
  16961. (S1 ^operator O2032 = 0.1700769046561409)
  16962. Retracting rl*prefer*rvt*predict-yes*H0*1
  16963. -->
  16964. (S1 ^operator O2031 = 0.5231205479855817)
  16965. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  16966. -->
  16967. (S1 ^operator O2031 = 0.476882807646731)
  16968. --- END Proposal Phase ---
  16969. --- Decision Phase ---
  16970. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.949721,0.0480196)
  16971. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  16972. =>WM: (14288: S1 ^operator O2033)
  16973. 1017: O: O2033 (predict-yes)
  16974. --- END Decision Phase ---
  16975. --- Application Phase ---
  16976. --- Firing Productions (PE) For State At Depth 1 ---
  16977. --- Inner Elaboration Phase, active level 1 (S1) ---
  16978. Firing apply*operator
  16979. -->
  16980. (I3 ^predict-yes N1017 + :O )
  16981. Firing apply*operator*complete
  16982. -->
  16983. (I3 ^predict-no N1016 - :O )
  16984. inner elaboration loop at bottom goal.
  16985. --- Change Working Memory (PE) ---
  16986. =>WM: (14289: I3 ^predict-yes N1017)
  16987. <=WM: (14275: N1016 ^status complete)
  16988. <=WM: (14274: I3 ^predict-no N1016)
  16989. --- Firing Productions (IE) For State At Depth 1 ---
  16990. --- Inner Elaboration Phase, active level 1 (S1) ---
  16991. Firing monitor*world
  16992. -->
  16993. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16994. --- Change Working Memory (IE) ---
  16995. --- END Application Phase ---
  16996. --- Output Phase ---
  16997. ENV: Agent did: predict-yes for direction L in state State-B
  16998. In State-B moving L
  16999. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  17000. predict error 0
  17001. dir: dir isR
  17002. --- END Output Phase ---
  17003. \-/--- Input Phase ---
  17004. =>WM: (14293: I2 ^dir R)
  17005. =>WM: (14292: I2 ^reward 1)
  17006. =>WM: (14291: I2 ^see 1)
  17007. =>WM: (14290: N1017 ^status complete)
  17008. <=WM: (14278: I2 ^dir L)
  17009. <=WM: (14277: I2 ^reward 1)
  17010. <=WM: (14276: I2 ^see 0)
  17011. =>WM: (14294: I2 ^level-1 L1-root)
  17012. <=WM: (14279: I2 ^level-1 R0-root)
  17013. --- END Input Phase ---
  17014. --- Proposal Phase ---
  17015. --- Inner Elaboration Phase, active level 1 (S1) ---
  17016. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  17017. -->
  17018. (S1 ^operator O2033 = 0.6170388055857089)
  17019. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  17020. -->
  17021. (S1 ^operator O2034 = 0.4901349546100854)
  17022. Firing prefer*rvt*predict-no*H0*4*H1
  17023. -->
  17024. Firing prefer*rvt*predict-yes*H0*3*H1
  17025. -->
  17026. Firing elaborate*copy-see-to-output-link
  17027. -->
  17028. (I3 ^see 1 +)
  17029. Firing elaborate*reward*based*on*reward
  17030. -->
  17031. (R1021 ^value 1 +)
  17032. (R1 ^reward R1021 +)
  17033. Firing propose*predict-yes
  17034. -->
  17035. (O2035 ^name predict-yes +)
  17036. (S1 ^operator O2035 +)
  17037. Firing propose*predict-no
  17038. -->
  17039. (O2036 ^name predict-no +)
  17040. (S1 ^operator O2036 +)
  17041. Firing rl*prefer*rvt*predict-no*H0*4
  17042. -->
  17043. (S1 ^operator O2034 = 0.1269768389574707)
  17044. Firing rl*prefer*rvt*predict-yes*H0*3
  17045. -->
  17046. (S1 ^operator O2033 = 0.3829376185498909)
  17047. Firing prefer*rvt*predict-yes*H0
  17048. -->
  17049. Firing prefer*rvt*predict-no*H0
  17050. -->
  17051. Firing elaborate*copy-dir-to-output-link
  17052. -->
  17053. (I3 ^dir R +)
  17054. inner elaboration loop at bottom goal.
  17055. Retracting elaborate*copy-see-to-output-link
  17056. -->
  17057. (I3 ^see 0 +)
  17058. Retracting propose*predict-no
  17059. -->
  17060. (O2034 ^name predict-no +)
  17061. (S1 ^operator O2034 +)
  17062. Retracting propose*predict-yes
  17063. -->
  17064. (O2033 ^name predict-yes +)
  17065. (S1 ^operator O2033 +)
  17066. Retracting elaborate*reward*based*on*reward
  17067. -->
  17068. (R1020 ^value 1 +)
  17069. (R1 ^reward R1020 +)
  17070. Retracting elaborate*copy-dir-to-output-link
  17071. -->
  17072. (I3 ^dir L +)
  17073. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  17074. -->
  17075. (S1 ^operator O2034 = 0.1700769046561409)
  17076. Retracting rl*prefer*rvt*predict-no*H0*2
  17077. -->
  17078. (S1 ^operator O2034 = 0.2550133872099196)
  17079. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  17080. -->
  17081. (S1 ^operator O2033 = 0.476882807646731)
  17082. Retracting rl*prefer*rvt*predict-yes*H0*1
  17083. -->
  17084. (S1 ^operator O2033 = 0.5231205479855817)
  17085. =>WM: (14302: S1 ^operator O2036 +)
  17086. =>WM: (14301: S1 ^operator O2035 +)
  17087. =>WM: (14300: I3 ^dir R)
  17088. =>WM: (14299: O2036 ^name predict-no)
  17089. =>WM: (14298: O2035 ^name predict-yes)
  17090. =>WM: (14297: R1021 ^value 1)
  17091. =>WM: (14296: R1 ^reward R1021)
  17092. =>WM: (14295: I3 ^see 1)
  17093. <=WM: (14286: S1 ^operator O2033 +)
  17094. <=WM: (14288: S1 ^operator O2033)
  17095. <=WM: (14287: S1 ^operator O2034 +)
  17096. <=WM: (14285: I3 ^dir L)
  17097. <=WM: (14281: R1 ^reward R1020)
  17098. <=WM: (14280: I3 ^see 0)
  17099. <=WM: (14284: O2034 ^name predict-no)
  17100. <=WM: (14283: O2033 ^name predict-yes)
  17101. <=WM: (14282: R1020 ^value 1)
  17102. --- Inner Elaboration Phase, active level 1 (S1) ---
  17103. Firing prefer*rvt*predict-yes*H0
  17104. -->
  17105. Firing rl*prefer*rvt*predict-yes*H0*3
  17106. -->
  17107. (S1 ^operator O2035 = 0.3829376185498909)
  17108. Firing prefer*rvt*predict-yes*H0*3*H1
  17109. -->
  17110. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  17111. -->
  17112. (S1 ^operator O2035 = 0.6170388055857089)
  17113. Firing prefer*rvt*predict-no*H0
  17114. -->
  17115. Firing rl*prefer*rvt*predict-no*H0*4
  17116. -->
  17117. (S1 ^operator O2036 = 0.1269768389574707)
  17118. Firing prefer*rvt*predict-no*H0*4*H1
  17119. -->
  17120. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  17121. -->
  17122. (S1 ^operator O2036 = 0.4901349546100854)
  17123. inner elaboration loop at bottom goal.
  17124. Retracting rl*prefer*rvt*predict-no*H0*4
  17125. -->
  17126. (S1 ^operator O2034 = 0.1269768389574707)
  17127. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  17128. -->
  17129. (S1 ^operator O2034 = 0.4901349546100854)
  17130. Retracting rl*prefer*rvt*predict-yes*H0*3
  17131. -->
  17132. (S1 ^operator O2033 = 0.3829376185498909)
  17133. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  17134. -->
  17135. (S1 ^operator O2033 = 0.6170388055857089)
  17136. --- END Proposal Phase ---
  17137. --- Decision Phase ---
  17138. RL update rl*prefer*rvt*predict-yes*H0*1 0.727961 -0.20484 0.523121 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.97931,0.0204023)
  17139. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272044 0.204839 0.476883 -> 0.272043 0.204839 0.476882(R,m,v=1,1,0)
  17140. =>WM: (14303: S1 ^operator O2035)
  17141. 1018: O: O2035 (predict-yes)
  17142. --- END Decision Phase ---
  17143. --- Application Phase ---
  17144. --- Firing Productions (PE) For State At Depth 1 ---
  17145. --- Inner Elaboration Phase, active level 1 (S1) ---
  17146. Firing apply*operator
  17147. -->
  17148. (I3 ^predict-yes N1018 + :O )
  17149. Firing apply*operator*complete
  17150. -->
  17151. (I3 ^predict-yes N1017 - :O )
  17152. inner elaboration loop at bottom goal.
  17153. --- Change Working Memory (PE) ---
  17154. =>WM: (14304: I3 ^predict-yes N1018)
  17155. <=WM: (14290: N1017 ^status complete)
  17156. <=WM: (14289: I3 ^predict-yes N1017)
  17157. --- Firing Productions (IE) For State At Depth 1 ---
  17158. --- Inner Elaboration Phase, active level 1 (S1) ---
  17159. Firing monitor*world
  17160. -->
  17161. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17162. --- Change Working Memory (IE) ---
  17163. --- END Application Phase ---
  17164. --- Output Phase ---
  17165. ENV: Agent did: predict-yes for direction R in state State-A
  17166. In State-A moving R
  17167. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17168. predict error 0
  17169. dir: dir isL
  17170. --- END Output Phase ---
  17171. |\---- Input Phase ---
  17172. =>WM: (14308: I2 ^dir L)
  17173. =>WM: (14307: I2 ^reward 1)
  17174. =>WM: (14306: I2 ^see 1)
  17175. =>WM: (14305: N1018 ^status complete)
  17176. <=WM: (14293: I2 ^dir R)
  17177. <=WM: (14292: I2 ^reward 1)
  17178. <=WM: (14291: I2 ^see 1)
  17179. =>WM: (14309: I2 ^level-1 R1-root)
  17180. <=WM: (14294: I2 ^level-1 L1-root)
  17181. --- END Input Phase ---
  17182. --- Proposal Phase ---
  17183. --- Inner Elaboration Phase, active level 1 (S1) ---
  17184. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  17185. -->
  17186. (S1 ^operator O2035 = 0.4768784469452474)
  17187. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  17188. -->
  17189. (S1 ^operator O2036 = -0.01194930198035649)
  17190. Firing prefer*rvt*predict-no*H0*2*H1
  17191. -->
  17192. Firing prefer*rvt*predict-yes*H0*1*H1
  17193. -->
  17194. Firing elaborate*copy-see-to-output-link
  17195. -->
  17196. (I3 ^see 1 +)
  17197. Firing elaborate*reward*based*on*reward
  17198. -->
  17199. (R1022 ^value 1 +)
  17200. (R1 ^reward R1022 +)
  17201. Firing propose*predict-yes
  17202. -->
  17203. (O2037 ^name predict-yes +)
  17204. (S1 ^operator O2037 +)
  17205. Firing propose*predict-no
  17206. -->
  17207. (O2038 ^name predict-no +)
  17208. (S1 ^operator O2038 +)
  17209. Firing rl*prefer*rvt*predict-no*H0*2
  17210. -->
  17211. (S1 ^operator O2036 = 0.2550133872099196)
  17212. Firing rl*prefer*rvt*predict-yes*H0*1
  17213. -->
  17214. (S1 ^operator O2035 = 0.5231200446407348)
  17215. Firing prefer*rvt*predict-yes*H0
  17216. -->
  17217. Firing prefer*rvt*predict-no*H0
  17218. -->
  17219. Firing elaborate*copy-dir-to-output-link
  17220. -->
  17221. (I3 ^dir L +)
  17222. inner elaboration loop at bottom goal.
  17223. Retracting elaborate*copy-see-to-output-link
  17224. -->
  17225. (I3 ^see 1 +)
  17226. Retracting propose*predict-no
  17227. -->
  17228. (O2036 ^name predict-no +)
  17229. (S1 ^operator O2036 +)
  17230. Retracting propose*predict-yes
  17231. -->
  17232. (O2035 ^name predict-yes +)
  17233. (S1 ^operator O2035 +)
  17234. Retracting elaborate*reward*based*on*reward
  17235. -->
  17236. (R1021 ^value 1 +)
  17237. (R1 ^reward R1021 +)
  17238. Retracting elaborate*copy-dir-to-output-link
  17239. -->
  17240. (I3 ^dir R +)
  17241. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  17242. -->
  17243. (S1 ^operator O2036 = 0.4901349546100854)
  17244. Retracting rl*prefer*rvt*predict-no*H0*4
  17245. -->
  17246. (S1 ^operator O2036 = 0.1269768389574707)
  17247. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  17248. -->
  17249. (S1 ^operator O2035 = 0.6170388055857089)
  17250. Retracting rl*prefer*rvt*predict-yes*H0*3
  17251. -->
  17252. (S1 ^operator O2035 = 0.3829376185498909)
  17253. =>WM: (14316: S1 ^operator O2038 +)
  17254. =>WM: (14315: S1 ^operator O2037 +)
  17255. =>WM: (14314: I3 ^dir L)
  17256. =>WM: (14313: O2038 ^name predict-no)
  17257. =>WM: (14312: O2037 ^name predict-yes)
  17258. =>WM: (14311: R1022 ^value 1)
  17259. =>WM: (14310: R1 ^reward R1022)
  17260. <=WM: (14301: S1 ^operator O2035 +)
  17261. <=WM: (14303: S1 ^operator O2035)
  17262. <=WM: (14302: S1 ^operator O2036 +)
  17263. <=WM: (14300: I3 ^dir R)
  17264. <=WM: (14296: R1 ^reward R1021)
  17265. <=WM: (14299: O2036 ^name predict-no)
  17266. <=WM: (14298: O2035 ^name predict-yes)
  17267. <=WM: (14297: R1021 ^value 1)
  17268. --- Inner Elaboration Phase, active level 1 (S1) ---
  17269. Firing prefer*rvt*predict-yes*H0
  17270. -->
  17271. Firing rl*prefer*rvt*predict-yes*H0*1
  17272. -->
  17273. (S1 ^operator O2037 = 0.5231200446407348)
  17274. Firing prefer*rvt*predict-yes*H0*1*H1
  17275. -->
  17276. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  17277. -->
  17278. (S1 ^operator O2037 = 0.4768784469452474)
  17279. Firing prefer*rvt*predict-no*H0
  17280. -->
  17281. Firing rl*prefer*rvt*predict-no*H0*2
  17282. -->
  17283. (S1 ^operator O2038 = 0.2550133872099196)
  17284. Firing prefer*rvt*predict-no*H0*2*H1
  17285. -->
  17286. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  17287. -->
  17288. (S1 ^operator O2038 = -0.01194930198035649)
  17289. inner elaboration loop at bottom goal.
  17290. Retracting rl*prefer*rvt*predict-no*H0*2
  17291. -->
  17292. (S1 ^operator O2036 = 0.2550133872099196)
  17293. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  17294. -->
  17295. (S1 ^operator O2036 = -0.01194930198035649)
  17296. Retracting rl*prefer*rvt*predict-yes*H0*1
  17297. -->
  17298. (S1 ^operator O2035 = 0.5231200446407348)
  17299. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  17300. -->
  17301. (S1 ^operator O2035 = 0.4768784469452474)
  17302. --- END Proposal Phase ---
  17303. --- Decision Phase ---
  17304. RL update rl*prefer*rvt*predict-yes*H0*3 0.673131 -0.290193 0.382938 -> 0.673134 -0.290193 0.382941(R,m,v=1,0.961538,0.0372208)
  17305. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326848 0.290191 0.617039 -> 0.326851 0.290192 0.617042(R,m,v=1,1,0)
  17306. =>WM: (14317: S1 ^operator O2037)
  17307. 1019: O: O2037 (predict-yes)
  17308. --- END Decision Phase ---
  17309. --- Application Phase ---
  17310. --- Firing Productions (PE) For State At Depth 1 ---
  17311. --- Inner Elaboration Phase, active level 1 (S1) ---
  17312. Firing apply*operator
  17313. -->
  17314. (I3 ^predict-yes N1019 + :O )
  17315. Firing apply*operator*complete
  17316. -->
  17317. (I3 ^predict-yes N1018 - :O )
  17318. inner elaboration loop at bottom goal.
  17319. --- Change Working Memory (PE) ---
  17320. =>WM: (14318: I3 ^predict-yes N1019)
  17321. <=WM: (14305: N1018 ^status complete)
  17322. <=WM: (14304: I3 ^predict-yes N1018)
  17323. --- Firing Productions (IE) For State At Depth 1 ---
  17324. --- Inner Elaboration Phase, active level 1 (S1) ---
  17325. Firing monitor*world
  17326. -->
  17327. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17328. --- Change Working Memory (IE) ---
  17329. --- END Application Phase ---
  17330. --- Output Phase ---
  17331. ENV: Agent did: predict-yes for direction L in state State-B
  17332. In State-B moving L
  17333. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  17334. predict error 0
  17335. dir: dir isR
  17336. --- END Output Phase ---
  17337. /|--- Input Phase ---
  17338. =>WM: (14322: I2 ^dir R)
  17339. =>WM: (14321: I2 ^reward 1)
  17340. =>WM: (14320: I2 ^see 1)
  17341. =>WM: (14319: N1019 ^status complete)
  17342. <=WM: (14308: I2 ^dir L)
  17343. <=WM: (14307: I2 ^reward 1)
  17344. <=WM: (14306: I2 ^see 1)
  17345. =>WM: (14323: I2 ^level-1 L1-root)
  17346. <=WM: (14309: I2 ^level-1 R1-root)
  17347. --- END Input Phase ---
  17348. --- Proposal Phase ---
  17349. --- Inner Elaboration Phase, active level 1 (S1) ---
  17350. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  17351. -->
  17352. (S1 ^operator O2037 = 0.617042341965369)
  17353. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  17354. -->
  17355. (S1 ^operator O2038 = 0.4901349546100854)
  17356. Firing prefer*rvt*predict-no*H0*4*H1
  17357. -->
  17358. Firing prefer*rvt*predict-yes*H0*3*H1
  17359. -->
  17360. Firing elaborate*copy-see-to-output-link
  17361. -->
  17362. (I3 ^see 1 +)
  17363. Firing elaborate*reward*based*on*reward
  17364. -->
  17365. (R1023 ^value 1 +)
  17366. (R1 ^reward R1023 +)
  17367. Firing propose*predict-yes
  17368. -->
  17369. (O2039 ^name predict-yes +)
  17370. (S1 ^operator O2039 +)
  17371. Firing propose*predict-no
  17372. -->
  17373. (O2040 ^name predict-no +)
  17374. (S1 ^operator O2040 +)
  17375. Firing rl*prefer*rvt*predict-no*H0*4
  17376. -->
  17377. (S1 ^operator O2038 = 0.1269768389574707)
  17378. Firing rl*prefer*rvt*predict-yes*H0*3
  17379. -->
  17380. (S1 ^operator O2037 = 0.3829411549295509)
  17381. Firing prefer*rvt*predict-yes*H0
  17382. -->
  17383. Firing prefer*rvt*predict-no*H0
  17384. -->
  17385. Firing elaborate*copy-dir-to-output-link
  17386. -->
  17387. (I3 ^dir R +)
  17388. inner elaboration loop at bottom goal.
  17389. Retracting elaborate*copy-see-to-output-link
  17390. -->
  17391. (I3 ^see 1 +)
  17392. Retracting propose*predict-no
  17393. -->
  17394. (O2038 ^name predict-no +)
  17395. (S1 ^operator O2038 +)
  17396. Retracting propose*predict-yes
  17397. -->
  17398. (O2037 ^name predict-yes +)
  17399. (S1 ^operator O2037 +)
  17400. Retracting elaborate*reward*based*on*reward
  17401. -->
  17402. (R1022 ^value 1 +)
  17403. (R1 ^reward R1022 +)
  17404. Retracting elaborate*copy-dir-to-output-link
  17405. -->
  17406. (I3 ^dir L +)
  17407. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  17408. -->
  17409. (S1 ^operator O2038 = -0.01194930198035649)
  17410. Retracting rl*prefer*rvt*predict-no*H0*2
  17411. -->
  17412. (S1 ^operator O2038 = 0.2550133872099196)
  17413. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  17414. -->
  17415. (S1 ^operator O2037 = 0.4768784469452474)
  17416. Retracting rl*prefer*rvt*predict-yes*H0*1
  17417. -->
  17418. (S1 ^operator O2037 = 0.5231200446407348)
  17419. =>WM: (14330: S1 ^operator O2040 +)
  17420. =>WM: (14329: S1 ^operator O2039 +)
  17421. =>WM: (14328: I3 ^dir R)
  17422. =>WM: (14327: O2040 ^name predict-no)
  17423. =>WM: (14326: O2039 ^name predict-yes)
  17424. =>WM: (14325: R1023 ^value 1)
  17425. =>WM: (14324: R1 ^reward R1023)
  17426. <=WM: (14315: S1 ^operator O2037 +)
  17427. <=WM: (14317: S1 ^operator O2037)
  17428. <=WM: (14316: S1 ^operator O2038 +)
  17429. <=WM: (14314: I3 ^dir L)
  17430. <=WM: (14310: R1 ^reward R1022)
  17431. <=WM: (14313: O2038 ^name predict-no)
  17432. <=WM: (14312: O2037 ^name predict-yes)
  17433. <=WM: (14311: R1022 ^value 1)
  17434. --- Inner Elaboration Phase, active level 1 (S1) ---
  17435. Firing prefer*rvt*predict-yes*H0
  17436. -->
  17437. Firing rl*prefer*rvt*predict-yes*H0*3
  17438. -->
  17439. (S1 ^operator O2039 = 0.3829411549295509)
  17440. Firing prefer*rvt*predict-yes*H0*3*H1
  17441. -->
  17442. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  17443. -->
  17444. (S1 ^operator O2039 = 0.617042341965369)
  17445. Firing prefer*rvt*predict-no*H0
  17446. -->
  17447. Firing rl*prefer*rvt*predict-no*H0*4
  17448. -->
  17449. (S1 ^operator O2040 = 0.1269768389574707)
  17450. Firing prefer*rvt*predict-no*H0*4*H1
  17451. -->
  17452. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  17453. -->
  17454. (S1 ^operator O2040 = 0.4901349546100854)
  17455. inner elaboration loop at bottom goal.
  17456. Retracting rl*prefer*rvt*predict-no*H0*4
  17457. -->
  17458. (S1 ^operator O2038 = 0.1269768389574707)
  17459. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  17460. -->
  17461. (S1 ^operator O2038 = 0.4901349546100854)
  17462. Retracting rl*prefer*rvt*predict-yes*H0*3
  17463. -->
  17464. (S1 ^operator O2037 = 0.3829411549295509)
  17465. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  17466. -->
  17467. (S1 ^operator O2037 = 0.617042341965369)
  17468. --- END Proposal Phase ---
  17469. --- Decision Phase ---
  17470. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.979452,0.0202645)
  17471. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272038 0.20484 0.476878 -> 0.272038 0.20484 0.476879(R,m,v=1,1,0)
  17472. =>WM: (14331: S1 ^operator O2039)
  17473. 1020: O: O2039 (predict-yes)
  17474. --- END Decision Phase ---
  17475. --- Application Phase ---
  17476. --- Firing Productions (PE) For State At Depth 1 ---
  17477. --- Inner Elaboration Phase, active level 1 (S1) ---
  17478. Firing apply*operator
  17479. -->
  17480. (I3 ^predict-yes N1020 + :O )
  17481. Firing apply*operator*complete
  17482. -->
  17483. (I3 ^predict-yes N1019 - :O )
  17484. inner elaboration loop at bottom goal.
  17485. --- Change Working Memory (PE) ---
  17486. =>WM: (14332: I3 ^predict-yes N1020)
  17487. <=WM: (14319: N1019 ^status complete)
  17488. <=WM: (14318: I3 ^predict-yes N1019)
  17489. --- Firing Productions (IE) For State At Depth 1 ---
  17490. --- Inner Elaboration Phase, active level 1 (S1) ---
  17491. Firing monitor*world
  17492. -->
  17493. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17494. --- Change Working Memory (IE) ---
  17495. --- END Application Phase ---
  17496. --- Output Phase ---
  17497. ENV: Agent did: predict-yes for direction R in state State-A
  17498. In State-A moving R
  17499. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17500. predict error 0
  17501. dir: dir isR
  17502. --- END Output Phase ---
  17503. \--- Input Phase ---
  17504. =>WM: (14336: I2 ^dir R)
  17505. =>WM: (14335: I2 ^reward 1)
  17506. =>WM: (14334: I2 ^see 1)
  17507. =>WM: (14333: N1020 ^status complete)
  17508. <=WM: (14322: I2 ^dir R)
  17509. <=WM: (14321: I2 ^reward 1)
  17510. <=WM: (14320: I2 ^see 1)
  17511. =>WM: (14337: I2 ^level-1 R1-root)
  17512. <=WM: (14323: I2 ^level-1 L1-root)
  17513. --- END Input Phase ---
  17514. --- Proposal Phase ---
  17515. --- Inner Elaboration Phase, active level 1 (S1) ---
  17516. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  17517. -->
  17518. (S1 ^operator O2039 = 0.08783148430849691)
  17519. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  17520. -->
  17521. (S1 ^operator O2040 = 0.8730233482627223)
  17522. Firing prefer*rvt*predict-no*H0*4*H1
  17523. -->
  17524. Firing prefer*rvt*predict-yes*H0*3*H1
  17525. -->
  17526. Firing elaborate*copy-see-to-output-link
  17527. -->
  17528. (I3 ^see 1 +)
  17529. Firing elaborate*reward*based*on*reward
  17530. -->
  17531. (R1024 ^value 1 +)
  17532. (R1 ^reward R1024 +)
  17533. Firing propose*predict-yes
  17534. -->
  17535. (O2041 ^name predict-yes +)
  17536. (S1 ^operator O2041 +)
  17537. Firing propose*predict-no
  17538. -->
  17539. (O2042 ^name predict-no +)
  17540. (S1 ^operator O2042 +)
  17541. Firing rl*prefer*rvt*predict-no*H0*4
  17542. -->
  17543. (S1 ^operator O2040 = 0.1269768389574707)
  17544. Firing rl*prefer*rvt*predict-yes*H0*3
  17545. -->
  17546. (S1 ^operator O2039 = 0.3829411549295509)
  17547. Firing prefer*rvt*predict-yes*H0
  17548. -->
  17549. Firing prefer*rvt*predict-no*H0
  17550. -->
  17551. Firing elaborate*copy-dir-to-output-link
  17552. -->
  17553. (I3 ^dir R +)
  17554. inner elaboration loop at bottom goal.
  17555. Retracting elaborate*copy-see-to-output-link
  17556. -->
  17557. (I3 ^see 1 +)
  17558. Retracting propose*predict-no
  17559. -->
  17560. (O2040 ^name predict-no +)
  17561. (S1 ^operator O2040 +)
  17562. Retracting propose*predict-yes
  17563. -->
  17564. (O2039 ^name predict-yes +)
  17565. (S1 ^operator O2039 +)
  17566. Retracting elaborate*reward*based*on*reward
  17567. -->
  17568. (R1023 ^value 1 +)
  17569. (R1 ^reward R1023 +)
  17570. Retracting elaborate*copy-dir-to-output-link
  17571. -->
  17572. (I3 ^dir R +)
  17573. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  17574. -->
  17575. (S1 ^operator O2040 = 0.4901349546100854)
  17576. Retracting rl*prefer*rvt*predict-no*H0*4
  17577. -->
  17578. (S1 ^operator O2040 = 0.1269768389574707)
  17579. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  17580. -->
  17581. (S1 ^operator O2039 = 0.617042341965369)
  17582. Retracting rl*prefer*rvt*predict-yes*H0*3
  17583. -->
  17584. (S1 ^operator O2039 = 0.3829411549295509)
  17585. =>WM: (14343: S1 ^operator O2042 +)
  17586. =>WM: (14342: S1 ^operator O2041 +)
  17587. =>WM: (14341: O2042 ^name predict-no)
  17588. =>WM: (14340: O2041 ^name predict-yes)
  17589. =>WM: (14339: R1024 ^value 1)
  17590. =>WM: (14338: R1 ^reward R1024)
  17591. <=WM: (14329: S1 ^operator O2039 +)
  17592. <=WM: (14331: S1 ^operator O2039)
  17593. <=WM: (14330: S1 ^operator O2040 +)
  17594. <=WM: (14324: R1 ^reward R1023)
  17595. <=WM: (14327: O2040 ^name predict-no)
  17596. <=WM: (14326: O2039 ^name predict-yes)
  17597. <=WM: (14325: R1023 ^value 1)
  17598. --- Inner Elaboration Phase, active level 1 (S1) ---
  17599. Firing prefer*rvt*predict-yes*H0
  17600. -->
  17601. Firing rl*prefer*rvt*predict-yes*H0*3
  17602. -->
  17603. (S1 ^operator O2041 = 0.3829411549295509)
  17604. Firing prefer*rvt*predict-yes*H0*3*H1
  17605. -->
  17606. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  17607. -->
  17608. (S1 ^operator O2041 = 0.08783148430849691)
  17609. Firing prefer*rvt*predict-no*H0
  17610. -->
  17611. Firing rl*prefer*rvt*predict-no*H0*4
  17612. -->
  17613. (S1 ^operator O2042 = 0.1269768389574707)
  17614. Firing prefer*rvt*predict-no*H0*4*H1
  17615. -->
  17616. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  17617. -->
  17618. (S1 ^operator O2042 = 0.8730233482627223)
  17619. inner elaboration loop at bottom goal.
  17620. Retracting rl*prefer*rvt*predict-no*H0*4
  17621. -->
  17622. (S1 ^operator O2040 = 0.1269768389574707)
  17623. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  17624. -->
  17625. (S1 ^operator O2040 = 0.8730233482627223)
  17626. Retracting rl*prefer*rvt*predict-yes*H0*3
  17627. -->
  17628. (S1 ^operator O2039 = 0.3829411549295509)
  17629. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  17630. -->
  17631. (S1 ^operator O2039 = 0.08783148430849691)
  17632. --- END Proposal Phase ---
  17633. --- Decision Phase ---
  17634. RL update rl*prefer*rvt*predict-yes*H0*3 0.673134 -0.290193 0.382941 -> 0.673136 -0.290193 0.382944(R,m,v=1,0.961783,0.0369917)
  17635. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326851 0.290192 0.617042 -> 0.326853 0.290192 0.617045(R,m,v=1,1,0)
  17636. =>WM: (14344: S1 ^operator O2042)
  17637. 1021: O: O2042 (predict-no)
  17638. --- END Decision Phase ---
  17639. --- Application Phase ---
  17640. --- Firing Productions (PE) For State At Depth 1 ---
  17641. --- Inner Elaboration Phase, active level 1 (S1) ---
  17642. Firing apply*operator
  17643. -->
  17644. (I3 ^predict-no N1021 + :O )
  17645. Firing apply*operator*complete
  17646. -->
  17647. (I3 ^predict-yes N1020 - :O )
  17648. inner elaboration loop at bottom goal.
  17649. --- Change Working Memory (PE) ---
  17650. =>WM: (14345: I3 ^predict-no N1021)
  17651. <=WM: (14333: N1020 ^status complete)
  17652. <=WM: (14332: I3 ^predict-yes N1020)
  17653. --- Firing Productions (IE) For State At Depth 1 ---
  17654. --- Inner Elaboration Phase, active level 1 (S1) ---
  17655. Firing monitor*world
  17656. -->
  17657. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17658. --- Change Working Memory (IE) ---
  17659. --- END Application Phase ---
  17660. --- Output Phase ---
  17661. ENV: Agent did: predict-no for direction R in state State-B
  17662. In State-B moving R
  17663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17664. predict error 0
  17665. dir: dir isL
  17666. --- END Output Phase ---
  17667. ---- Input Phase ---
  17668. =>WM: (14349: I2 ^dir L)
  17669. =>WM: (14348: I2 ^reward 1)
  17670. =>WM: (14347: I2 ^see 0)
  17671. =>WM: (14346: N1021 ^status complete)
  17672. <=WM: (14336: I2 ^dir R)
  17673. <=WM: (14335: I2 ^reward 1)
  17674. <=WM: (14334: I2 ^see 1)
  17675. =>WM: (14350: I2 ^level-1 R0-root)
  17676. <=WM: (14337: I2 ^level-1 R1-root)
  17677. --- END Input Phase ---
  17678. --- Proposal Phase ---
  17679. --- Inner Elaboration Phase, active level 1 (S1) ---
  17680. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  17681. -->
  17682. (S1 ^operator O2041 = 0.4768823043018841)
  17683. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  17684. -->
  17685. (S1 ^operator O2042 = 0.1700769046561409)
  17686. Firing prefer*rvt*predict-no*H0*2*H1
  17687. -->
  17688. Firing prefer*rvt*predict-yes*H0*1*H1
  17689. -->
  17690. Firing elaborate*copy-see-to-output-link
  17691. -->
  17692. (I3 ^see 0 +)
  17693. Firing elaborate*reward*based*on*reward
  17694. -->
  17695. (R1025 ^value 1 +)
  17696. (R1 ^reward R1025 +)
  17697. Firing propose*predict-yes
  17698. -->
  17699. (O2043 ^name predict-yes +)
  17700. (S1 ^operator O2043 +)
  17701. Firing propose*predict-no
  17702. -->
  17703. (O2044 ^name predict-no +)
  17704. (S1 ^operator O2044 +)
  17705. Firing rl*prefer*rvt*predict-no*H0*2
  17706. -->
  17707. (S1 ^operator O2042 = 0.2550133872099196)
  17708. Firing rl*prefer*rvt*predict-yes*H0*1
  17709. -->
  17710. (S1 ^operator O2041 = 0.5231202709028374)
  17711. Firing prefer*rvt*predict-yes*H0
  17712. -->
  17713. Firing prefer*rvt*predict-no*H0
  17714. -->
  17715. Firing elaborate*copy-dir-to-output-link
  17716. -->
  17717. (I3 ^dir L +)
  17718. inner elaboration loop at bottom goal.
  17719. Retracting elaborate*copy-see-to-output-link
  17720. -->
  17721. (I3 ^see 1 +)
  17722. Retracting propose*predict-no
  17723. -->
  17724. (O2042 ^name predict-no +)
  17725. (S1 ^operator O2042 +)
  17726. Retracting propose*predict-yes
  17727. -->
  17728. (O2041 ^name predict-yes +)
  17729. (S1 ^operator O2041 +)
  17730. Retracting elaborate*reward*based*on*reward
  17731. -->
  17732. (R1024 ^value 1 +)
  17733. (R1 ^reward R1024 +)
  17734. Retracting elaborate*copy-dir-to-output-link
  17735. -->
  17736. (I3 ^dir R +)
  17737. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  17738. -->
  17739. (S1 ^operator O2042 = 0.8730233482627223)
  17740. Retracting rl*prefer*rvt*predict-no*H0*4
  17741. -->
  17742. (S1 ^operator O2042 = 0.1269768389574707)
  17743. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  17744. -->
  17745. (S1 ^operator O2041 = 0.08783148430849691)
  17746. Retracting rl*prefer*rvt*predict-yes*H0*3
  17747. -->
  17748. (S1 ^operator O2041 = 0.3829436303953129)
  17749. =>WM: (14358: S1 ^operator O2044 +)
  17750. =>WM: (14357: S1 ^operator O2043 +)
  17751. =>WM: (14356: I3 ^dir L)
  17752. =>WM: (14355: O2044 ^name predict-no)
  17753. =>WM: (14354: O2043 ^name predict-yes)
  17754. =>WM: (14353: R1025 ^value 1)
  17755. =>WM: (14352: R1 ^reward R1025)
  17756. =>WM: (14351: I3 ^see 0)
  17757. <=WM: (14342: S1 ^operator O2041 +)
  17758. <=WM: (14343: S1 ^operator O2042 +)
  17759. <=WM: (14344: S1 ^operator O2042)
  17760. <=WM: (14328: I3 ^dir R)
  17761. <=WM: (14338: R1 ^reward R1024)
  17762. <=WM: (14295: I3 ^see 1)
  17763. <=WM: (14341: O2042 ^name predict-no)
  17764. <=WM: (14340: O2041 ^name predict-yes)
  17765. <=WM: (14339: R1024 ^value 1)
  17766. --- Inner Elaboration Phase, active level 1 (S1) ---
  17767. Firing prefer*rvt*predict-yes*H0
  17768. -->
  17769. Firing rl*prefer*rvt*predict-yes*H0*1
  17770. -->
  17771. (S1 ^operator O2043 = 0.5231202709028374)
  17772. Firing prefer*rvt*predict-yes*H0*1*H1
  17773. -->
  17774. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  17775. -->
  17776. (S1 ^operator O2043 = 0.4768823043018841)
  17777. Firing prefer*rvt*predict-no*H0
  17778. -->
  17779. Firing rl*prefer*rvt*predict-no*H0*2
  17780. -->
  17781. (S1 ^operator O2044 = 0.2550133872099196)
  17782. Firing prefer*rvt*predict-no*H0*2*H1
  17783. -->
  17784. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  17785. -->
  17786. (S1 ^operator O2044 = 0.1700769046561409)
  17787. inner elaboration loop at bottom goal.
  17788. Retracting rl*prefer*rvt*predict-no*H0*2
  17789. -->
  17790. (S1 ^operator O2042 = 0.2550133872099196)
  17791. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  17792. -->
  17793. (S1 ^operator O2042 = 0.1700769046561409)
  17794. Retracting rl*prefer*rvt*predict-yes*H0*1
  17795. -->
  17796. (S1 ^operator O2041 = 0.5231202709028374)
  17797. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  17798. -->
  17799. (S1 ^operator O2041 = 0.4768823043018841)
  17800. --- END Proposal Phase ---
  17801. --- Decision Phase ---
  17802. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.95,0.0477654)
  17803. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  17804. =>WM: (14359: S1 ^operator O2043)
  17805. 1022: O: O2043 (predict-yes)
  17806. --- END Decision Phase ---
  17807. --- Application Phase ---
  17808. --- Firing Productions (PE) For State At Depth 1 ---
  17809. --- Inner Elaboration Phase, active level 1 (S1) ---
  17810. Firing apply*operator
  17811. -->
  17812. (I3 ^predict-yes N1022 + :O )
  17813. Firing apply*operator*complete
  17814. -->
  17815. (I3 ^predict-no N1021 - :O )
  17816. inner elaboration loop at bottom goal.
  17817. --- Change Working Memory (PE) ---
  17818. =>WM: (14360: I3 ^predict-yes N1022)
  17819. <=WM: (14346: N1021 ^status complete)
  17820. <=WM: (14345: I3 ^predict-no N1021)
  17821. --- Firing Productions (IE) For State At Depth 1 ---
  17822. --- Inner Elaboration Phase, active level 1 (S1) ---
  17823. Firing monitor*world
  17824. -->
  17825. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17826. --- Change Working Memory (IE) ---
  17827. --- END Application Phase ---
  17828. --- Output Phase ---
  17829. ENV: Agent did: predict-yes for direction L in state State-B
  17830. In State-B moving L
  17831. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  17832. predict error 0
  17833. dir: dir isR
  17834. --- END Output Phase ---
  17835. /|\--- Input Phase ---
  17836. =>WM: (14364: I2 ^dir R)
  17837. =>WM: (14363: I2 ^reward 1)
  17838. =>WM: (14362: I2 ^see 1)
  17839. =>WM: (14361: N1022 ^status complete)
  17840. <=WM: (14349: I2 ^dir L)
  17841. <=WM: (14348: I2 ^reward 1)
  17842. <=WM: (14347: I2 ^see 0)
  17843. =>WM: (14365: I2 ^level-1 L1-root)
  17844. <=WM: (14350: I2 ^level-1 R0-root)
  17845. --- END Input Phase ---
  17846. --- Proposal Phase ---
  17847. --- Inner Elaboration Phase, active level 1 (S1) ---
  17848. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  17849. -->
  17850. (S1 ^operator O2043 = 0.6170448174311309)
  17851. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  17852. -->
  17853. (S1 ^operator O2044 = 0.4901349546100854)
  17854. Firing prefer*rvt*predict-no*H0*4*H1
  17855. -->
  17856. Firing prefer*rvt*predict-yes*H0*3*H1
  17857. -->
  17858. Firing elaborate*copy-see-to-output-link
  17859. -->
  17860. (I3 ^see 1 +)
  17861. Firing elaborate*reward*based*on*reward
  17862. -->
  17863. (R1026 ^value 1 +)
  17864. (R1 ^reward R1026 +)
  17865. Firing propose*predict-yes
  17866. -->
  17867. (O2045 ^name predict-yes +)
  17868. (S1 ^operator O2045 +)
  17869. Firing propose*predict-no
  17870. -->
  17871. (O2046 ^name predict-no +)
  17872. (S1 ^operator O2046 +)
  17873. Firing rl*prefer*rvt*predict-no*H0*4
  17874. -->
  17875. (S1 ^operator O2044 = 0.1269768108744418)
  17876. Firing rl*prefer*rvt*predict-yes*H0*3
  17877. -->
  17878. (S1 ^operator O2043 = 0.3829436303953129)
  17879. Firing prefer*rvt*predict-yes*H0
  17880. -->
  17881. Firing prefer*rvt*predict-no*H0
  17882. -->
  17883. Firing elaborate*copy-dir-to-output-link
  17884. -->
  17885. (I3 ^dir R +)
  17886. inner elaboration loop at bottom goal.
  17887. Retracting elaborate*copy-see-to-output-link
  17888. -->
  17889. (I3 ^see 0 +)
  17890. Retracting propose*predict-no
  17891. -->
  17892. (O2044 ^name predict-no +)
  17893. (S1 ^operator O2044 +)
  17894. Retracting propose*predict-yes
  17895. -->
  17896. (O2043 ^name predict-yes +)
  17897. (S1 ^operator O2043 +)
  17898. Retracting elaborate*reward*based*on*reward
  17899. -->
  17900. (R1025 ^value 1 +)
  17901. (R1 ^reward R1025 +)
  17902. Retracting elaborate*copy-dir-to-output-link
  17903. -->
  17904. (I3 ^dir L +)
  17905. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  17906. -->
  17907. (S1 ^operator O2044 = 0.1700769046561409)
  17908. Retracting rl*prefer*rvt*predict-no*H0*2
  17909. -->
  17910. (S1 ^operator O2044 = 0.2550133872099196)
  17911. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  17912. -->
  17913. (S1 ^operator O2043 = 0.4768823043018841)
  17914. Retracting rl*prefer*rvt*predict-yes*H0*1
  17915. -->
  17916. (S1 ^operator O2043 = 0.5231202709028374)
  17917. =>WM: (14373: S1 ^operator O2046 +)
  17918. =>WM: (14372: S1 ^operator O2045 +)
  17919. =>WM: (14371: I3 ^dir R)
  17920. =>WM: (14370: O2046 ^name predict-no)
  17921. =>WM: (14369: O2045 ^name predict-yes)
  17922. =>WM: (14368: R1026 ^value 1)
  17923. =>WM: (14367: R1 ^reward R1026)
  17924. =>WM: (14366: I3 ^see 1)
  17925. <=WM: (14357: S1 ^operator O2043 +)
  17926. <=WM: (14359: S1 ^operator O2043)
  17927. <=WM: (14358: S1 ^operator O2044 +)
  17928. <=WM: (14356: I3 ^dir L)
  17929. <=WM: (14352: R1 ^reward R1025)
  17930. <=WM: (14351: I3 ^see 0)
  17931. <=WM: (14355: O2044 ^name predict-no)
  17932. <=WM: (14354: O2043 ^name predict-yes)
  17933. <=WM: (14353: R1025 ^value 1)
  17934. --- Inner Elaboration Phase, active level 1 (S1) ---
  17935. Firing prefer*rvt*predict-yes*H0
  17936. -->
  17937. Firing rl*prefer*rvt*predict-yes*H0*3
  17938. -->
  17939. (S1 ^operator O2045 = 0.3829436303953129)
  17940. Firing prefer*rvt*predict-yes*H0*3*H1
  17941. -->
  17942. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  17943. -->
  17944. (S1 ^operator O2045 = 0.6170448174311309)
  17945. Firing prefer*rvt*predict-no*H0
  17946. -->
  17947. Firing rl*prefer*rvt*predict-no*H0*4
  17948. -->
  17949. (S1 ^operator O2046 = 0.1269768108744418)
  17950. Firing prefer*rvt*predict-no*H0*4*H1
  17951. -->
  17952. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  17953. -->
  17954. (S1 ^operator O2046 = 0.4901349546100854)
  17955. inner elaboration loop at bottom goal.
  17956. Retracting rl*prefer*rvt*predict-no*H0*4
  17957. -->
  17958. (S1 ^operator O2044 = 0.1269768108744418)
  17959. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  17960. -->
  17961. (S1 ^operator O2044 = 0.4901349546100854)
  17962. Retracting rl*prefer*rvt*predict-yes*H0*3
  17963. -->
  17964. (S1 ^operator O2043 = 0.3829436303953129)
  17965. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  17966. -->
  17967. (S1 ^operator O2043 = 0.6170448174311309)
  17968. --- END Proposal Phase ---
  17969. --- Decision Phase ---
  17970. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.979592,0.0201286)
  17971. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272043 0.204839 0.476882 -> 0.272043 0.204839 0.476882(R,m,v=1,1,0)
  17972. =>WM: (14374: S1 ^operator O2045)
  17973. 1023: O: O2045 (predict-yes)
  17974. --- END Decision Phase ---
  17975. --- Application Phase ---
  17976. --- Firing Productions (PE) For State At Depth 1 ---
  17977. --- Inner Elaboration Phase, active level 1 (S1) ---
  17978. Firing apply*operator
  17979. -->
  17980. (I3 ^predict-yes N1023 + :O )
  17981. Firing apply*operator*complete
  17982. -->
  17983. (I3 ^predict-yes N1022 - :O )
  17984. inner elaboration loop at bottom goal.
  17985. --- Change Working Memory (PE) ---
  17986. =>WM: (14375: I3 ^predict-yes N1023)
  17987. <=WM: (14361: N1022 ^status complete)
  17988. <=WM: (14360: I3 ^predict-yes N1022)
  17989. --- Firing Productions (IE) For State At Depth 1 ---
  17990. --- Inner Elaboration Phase, active level 1 (S1) ---
  17991. Firing monitor*world
  17992. -->
  17993. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17994. --- Change Working Memory (IE) ---
  17995. --- END Application Phase ---
  17996. --- Output Phase ---
  17997. ENV: Agent did: predict-yes for direction R in state State-A
  17998. In State-A moving R
  17999. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  18000. predict error 0
  18001. dir: dir isR
  18002. --- END Output Phase ---
  18003. -/|--- Input Phase ---
  18004. =>WM: (14379: I2 ^dir R)
  18005. =>WM: (14378: I2 ^reward 1)
  18006. =>WM: (14377: I2 ^see 1)
  18007. =>WM: (14376: N1023 ^status complete)
  18008. <=WM: (14364: I2 ^dir R)
  18009. <=WM: (14363: I2 ^reward 1)
  18010. <=WM: (14362: I2 ^see 1)
  18011. =>WM: (14380: I2 ^level-1 R1-root)
  18012. <=WM: (14365: I2 ^level-1 L1-root)
  18013. --- END Input Phase ---
  18014. --- Proposal Phase ---
  18015. --- Inner Elaboration Phase, active level 1 (S1) ---
  18016. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  18017. -->
  18018. (S1 ^operator O2045 = 0.08783148430849691)
  18019. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  18020. -->
  18021. (S1 ^operator O2046 = 0.8730233201796934)
  18022. Firing prefer*rvt*predict-no*H0*4*H1
  18023. -->
  18024. Firing prefer*rvt*predict-yes*H0*3*H1
  18025. -->
  18026. Firing elaborate*copy-see-to-output-link
  18027. -->
  18028. (I3 ^see 1 +)
  18029. Firing elaborate*reward*based*on*reward
  18030. -->
  18031. (R1027 ^value 1 +)
  18032. (R1 ^reward R1027 +)
  18033. Firing propose*predict-yes
  18034. -->
  18035. (O2047 ^name predict-yes +)
  18036. (S1 ^operator O2047 +)
  18037. Firing propose*predict-no
  18038. -->
  18039. (O2048 ^name predict-no +)
  18040. (S1 ^operator O2048 +)
  18041. Firing rl*prefer*rvt*predict-no*H0*4
  18042. -->
  18043. (S1 ^operator O2046 = 0.1269768108744418)
  18044. Firing rl*prefer*rvt*predict-yes*H0*3
  18045. -->
  18046. (S1 ^operator O2045 = 0.3829436303953129)
  18047. Firing prefer*rvt*predict-yes*H0
  18048. -->
  18049. Firing prefer*rvt*predict-no*H0
  18050. -->
  18051. Firing elaborate*copy-dir-to-output-link
  18052. -->
  18053. (I3 ^dir R +)
  18054. inner elaboration loop at bottom goal.
  18055. Retracting elaborate*copy-see-to-output-link
  18056. -->
  18057. (I3 ^see 1 +)
  18058. Retracting propose*predict-no
  18059. -->
  18060. (O2046 ^name predict-no +)
  18061. (S1 ^operator O2046 +)
  18062. Retracting propose*predict-yes
  18063. -->
  18064. (O2045 ^name predict-yes +)
  18065. (S1 ^operator O2045 +)
  18066. Retracting elaborate*reward*based*on*reward
  18067. -->
  18068. (R1026 ^value 1 +)
  18069. (R1 ^reward R1026 +)
  18070. Retracting elaborate*copy-dir-to-output-link
  18071. -->
  18072. (I3 ^dir R +)
  18073. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  18074. -->
  18075. (S1 ^operator O2046 = 0.4901349546100854)
  18076. Retracting rl*prefer*rvt*predict-no*H0*4
  18077. -->
  18078. (S1 ^operator O2046 = 0.1269768108744418)
  18079. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  18080. -->
  18081. (S1 ^operator O2045 = 0.6170448174311309)
  18082. Retracting rl*prefer*rvt*predict-yes*H0*3
  18083. -->
  18084. (S1 ^operator O2045 = 0.3829436303953129)
  18085. =>WM: (14386: S1 ^operator O2048 +)
  18086. =>WM: (14385: S1 ^operator O2047 +)
  18087. =>WM: (14384: O2048 ^name predict-no)
  18088. =>WM: (14383: O2047 ^name predict-yes)
  18089. =>WM: (14382: R1027 ^value 1)
  18090. =>WM: (14381: R1 ^reward R1027)
  18091. <=WM: (14372: S1 ^operator O2045 +)
  18092. <=WM: (14374: S1 ^operator O2045)
  18093. <=WM: (14373: S1 ^operator O2046 +)
  18094. <=WM: (14367: R1 ^reward R1026)
  18095. <=WM: (14370: O2046 ^name predict-no)
  18096. <=WM: (14369: O2045 ^name predict-yes)
  18097. <=WM: (14368: R1026 ^value 1)
  18098. --- Inner Elaboration Phase, active level 1 (S1) ---
  18099. Firing prefer*rvt*predict-yes*H0
  18100. -->
  18101. Firing rl*prefer*rvt*predict-yes*H0*3
  18102. -->
  18103. (S1 ^operator O2047 = 0.3829436303953129)
  18104. Firing prefer*rvt*predict-yes*H0*3*H1
  18105. -->
  18106. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  18107. -->
  18108. (S1 ^operator O2047 = 0.08783148430849691)
  18109. Firing prefer*rvt*predict-no*H0
  18110. -->
  18111. Firing rl*prefer*rvt*predict-no*H0*4
  18112. -->
  18113. (S1 ^operator O2048 = 0.1269768108744418)
  18114. Firing prefer*rvt*predict-no*H0*4*H1
  18115. -->
  18116. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  18117. -->
  18118. (S1 ^operator O2048 = 0.8730233201796934)
  18119. inner elaboration loop at bottom goal.
  18120. Retracting rl*prefer*rvt*predict-no*H0*4
  18121. -->
  18122. (S1 ^operator O2046 = 0.1269768108744418)
  18123. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  18124. -->
  18125. (S1 ^operator O2046 = 0.8730233201796934)
  18126. Retracting rl*prefer*rvt*predict-yes*H0*3
  18127. -->
  18128. (S1 ^operator O2045 = 0.3829436303953129)
  18129. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  18130. -->
  18131. (S1 ^operator O2045 = 0.08783148430849691)
  18132. --- END Proposal Phase ---
  18133. --- Decision Phase ---
  18134. RL update rl*prefer*rvt*predict-yes*H0*3 0.673136 -0.290193 0.382944 -> 0.673138 -0.290193 0.382945(R,m,v=1,0.962025,0.0367653)
  18135. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326853 0.290192 0.617045 -> 0.326855 0.290192 0.617047(R,m,v=1,1,0)
  18136. =>WM: (14387: S1 ^operator O2048)
  18137. 1024: O: O2048 (predict-no)
  18138. --- END Decision Phase ---
  18139. --- Application Phase ---
  18140. --- Firing Productions (PE) For State At Depth 1 ---
  18141. --- Inner Elaboration Phase, active level 1 (S1) ---
  18142. Firing apply*operator
  18143. -->
  18144. (I3 ^predict-no N1024 + :O )
  18145. Firing apply*operator*complete
  18146. -->
  18147. (I3 ^predict-yes N1023 - :O )
  18148. inner elaboration loop at bottom goal.
  18149. --- Change Working Memory (PE) ---
  18150. =>WM: (14388: I3 ^predict-no N1024)
  18151. <=WM: (14376: N1023 ^status complete)
  18152. <=WM: (14375: I3 ^predict-yes N1023)
  18153. --- Firing Productions (IE) For State At Depth 1 ---
  18154. --- Inner Elaboration Phase, active level 1 (S1) ---
  18155. Firing monitor*world
  18156. -->
  18157. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18158. --- Change Working Memory (IE) ---
  18159. --- END Application Phase ---
  18160. --- Output Phase ---
  18161. ENV: Agent did: predict-no for direction R in state State-B
  18162. In State-B moving R
  18163. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18164. predict error 0
  18165. dir: dir isU
  18166. --- END Output Phase ---
  18167. \-/--- Input Phase ---
  18168. =>WM: (14392: I2 ^dir U)
  18169. =>WM: (14391: I2 ^reward 1)
  18170. =>WM: (14390: I2 ^see 0)
  18171. =>WM: (14389: N1024 ^status complete)
  18172. <=WM: (14379: I2 ^dir R)
  18173. <=WM: (14378: I2 ^reward 1)
  18174. <=WM: (14377: I2 ^see 1)
  18175. =>WM: (14393: I2 ^level-1 R0-root)
  18176. <=WM: (14380: I2 ^level-1 R1-root)
  18177. --- END Input Phase ---
  18178. --- Proposal Phase ---
  18179. --- Inner Elaboration Phase, active level 1 (S1) ---
  18180. Firing elaborate*copy-see-to-output-link
  18181. -->
  18182. (I3 ^see 0 +)
  18183. Firing elaborate*reward*based*on*reward
  18184. -->
  18185. (R1028 ^value 1 +)
  18186. (R1 ^reward R1028 +)
  18187. Firing propose*predict-yes
  18188. -->
  18189. (O2049 ^name predict-yes +)
  18190. (S1 ^operator O2049 +)
  18191. Firing propose*predict-no
  18192. -->
  18193. (O2050 ^name predict-no +)
  18194. (S1 ^operator O2050 +)
  18195. Firing rl*prefer*rvt*predict-no*H0*6
  18196. -->
  18197. (S1 ^operator O2048 = 0.9999999999999999)
  18198. Firing rl*prefer*rvt*predict-yes*H0*5
  18199. -->
  18200. (S1 ^operator O2047 = 0.)
  18201. Firing prefer*rvt*predict-yes*H0
  18202. -->
  18203. Firing prefer*rvt*predict-no*H0
  18204. -->
  18205. Firing elaborate*copy-dir-to-output-link
  18206. -->
  18207. (I3 ^dir U +)
  18208. inner elaboration loop at bottom goal.
  18209. Retracting elaborate*copy-see-to-output-link
  18210. -->
  18211. (I3 ^see 1 +)
  18212. Retracting propose*predict-no
  18213. -->
  18214. (O2048 ^name predict-no +)
  18215. (S1 ^operator O2048 +)
  18216. Retracting propose*predict-yes
  18217. -->
  18218. (O2047 ^name predict-yes +)
  18219. (S1 ^operator O2047 +)
  18220. Retracting elaborate*reward*based*on*reward
  18221. -->
  18222. (R1027 ^value 1 +)
  18223. (R1 ^reward R1027 +)
  18224. Retracting elaborate*copy-dir-to-output-link
  18225. -->
  18226. (I3 ^dir R +)
  18227. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  18228. -->
  18229. (S1 ^operator O2048 = 0.8730233201796934)
  18230. Retracting rl*prefer*rvt*predict-no*H0*4
  18231. -->
  18232. (S1 ^operator O2048 = 0.1269768108744418)
  18233. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  18234. -->
  18235. (S1 ^operator O2047 = 0.08783148430849691)
  18236. Retracting rl*prefer*rvt*predict-yes*H0*3
  18237. -->
  18238. (S1 ^operator O2047 = 0.3829453632213463)
  18239. =>WM: (14401: S1 ^operator O2050 +)
  18240. =>WM: (14400: S1 ^operator O2049 +)
  18241. =>WM: (14399: I3 ^dir U)
  18242. =>WM: (14398: O2050 ^name predict-no)
  18243. =>WM: (14397: O2049 ^name predict-yes)
  18244. =>WM: (14396: R1028 ^value 1)
  18245. =>WM: (14395: R1 ^reward R1028)
  18246. =>WM: (14394: I3 ^see 0)
  18247. <=WM: (14385: S1 ^operator O2047 +)
  18248. <=WM: (14386: S1 ^operator O2048 +)
  18249. <=WM: (14387: S1 ^operator O2048)
  18250. <=WM: (14371: I3 ^dir R)
  18251. <=WM: (14381: R1 ^reward R1027)
  18252. <=WM: (14366: I3 ^see 1)
  18253. <=WM: (14384: O2048 ^name predict-no)
  18254. <=WM: (14383: O2047 ^name predict-yes)
  18255. <=WM: (14382: R1027 ^value 1)
  18256. --- Inner Elaboration Phase, active level 1 (S1) ---
  18257. Firing prefer*rvt*predict-yes*H0
  18258. -->
  18259. Firing rl*prefer*rvt*predict-yes*H0*5
  18260. -->
  18261. (S1 ^operator O2049 = 0.)
  18262. Firing prefer*rvt*predict-no*H0
  18263. -->
  18264. Firing rl*prefer*rvt*predict-no*H0*6
  18265. -->
  18266. (S1 ^operator O2050 = 0.9999999999999999)
  18267. inner elaboration loop at bottom goal.
  18268. Retracting rl*prefer*rvt*predict-no*H0*6
  18269. -->
  18270. (S1 ^operator O2048 = 0.9999999999999999)
  18271. Retracting rl*prefer*rvt*predict-yes*H0*5
  18272. -->
  18273. (S1 ^operator O2047 = 0.)
  18274. --- END Proposal Phase ---
  18275. --- Decision Phase ---
  18276. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.950276,0.0475138)
  18277. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  18278. =>WM: (14402: S1 ^operator O2050)
  18279. 1025: O: O2050 (predict-no)
  18280. --- END Decision Phase ---
  18281. --- Application Phase ---
  18282. --- Firing Productions (PE) For State At Depth 1 ---
  18283. --- Inner Elaboration Phase, active level 1 (S1) ---
  18284. Firing apply*operator
  18285. -->
  18286. (I3 ^predict-no N1025 + :O )
  18287. Firing apply*operator*complete
  18288. -->
  18289. (I3 ^predict-no N1024 - :O )
  18290. inner elaboration loop at bottom goal.
  18291. --- Change Working Memory (PE) ---
  18292. =>WM: (14403: I3 ^predict-no N1025)
  18293. <=WM: (14389: N1024 ^status complete)
  18294. <=WM: (14388: I3 ^predict-no N1024)
  18295. --- Firing Productions (IE) For State At Depth 1 ---
  18296. --- Inner Elaboration Phase, active level 1 (S1) ---
  18297. Firing monitor*world
  18298. -->
  18299. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18300. --- Change Working Memory (IE) ---
  18301. --- END Application Phase ---
  18302. --- Output Phase ---
  18303. ENV: Agent did: predict-no for direction U in state State-B
  18304. In State-B moving U
  18305. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18306. predict error 0
  18307. dir: dir isU
  18308. --- END Output Phase ---
  18309. |\--- Input Phase ---
  18310. =>WM: (14407: I2 ^dir U)
  18311. =>WM: (14406: I2 ^reward 1)
  18312. =>WM: (14405: I2 ^see 0)
  18313. =>WM: (14404: N1025 ^status complete)
  18314. <=WM: (14392: I2 ^dir U)
  18315. <=WM: (14391: I2 ^reward 1)
  18316. <=WM: (14390: I2 ^see 0)
  18317. =>WM: (14408: I2 ^level-1 R0-root)
  18318. <=WM: (14393: I2 ^level-1 R0-root)
  18319. --- END Input Phase ---
  18320. --- Proposal Phase ---
  18321. --- Inner Elaboration Phase, active level 1 (S1) ---
  18322. Firing elaborate*copy-see-to-output-link
  18323. -->
  18324. (I3 ^see 0 +)
  18325. Firing elaborate*reward*based*on*reward
  18326. -->
  18327. (R1029 ^value 1 +)
  18328. (R1 ^reward R1029 +)
  18329. Firing propose*predict-yes
  18330. -->
  18331. (O2051 ^name predict-yes +)
  18332. (S1 ^operator O2051 +)
  18333. Firing propose*predict-no
  18334. -->
  18335. (O2052 ^name predict-no +)
  18336. (S1 ^operator O2052 +)
  18337. Firing rl*prefer*rvt*predict-no*H0*6
  18338. -->
  18339. (S1 ^operator O2050 = 0.9999999999999999)
  18340. Firing rl*prefer*rvt*predict-yes*H0*5
  18341. -->
  18342. (S1 ^operator O2049 = 0.)
  18343. Firing prefer*rvt*predict-yes*H0
  18344. -->
  18345. Firing prefer*rvt*predict-no*H0
  18346. -->
  18347. Firing elaborate*copy-dir-to-output-link
  18348. -->
  18349. (I3 ^dir U +)
  18350. inner elaboration loop at bottom goal.
  18351. Retracting elaborate*copy-see-to-output-link
  18352. -->
  18353. (I3 ^see 0 +)
  18354. Retracting propose*predict-no
  18355. -->
  18356. (O2050 ^name predict-no +)
  18357. (S1 ^operator O2050 +)
  18358. Retracting propose*predict-yes
  18359. -->
  18360. (O2049 ^name predict-yes +)
  18361. (S1 ^operator O2049 +)
  18362. Retracting elaborate*reward*based*on*reward
  18363. -->
  18364. (R1028 ^value 1 +)
  18365. (R1 ^reward R1028 +)
  18366. Retracting elaborate*copy-dir-to-output-link
  18367. -->
  18368. (I3 ^dir U +)
  18369. Retracting rl*prefer*rvt*predict-no*H0*6
  18370. -->
  18371. (S1 ^operator O2050 = 0.9999999999999999)
  18372. Retracting rl*prefer*rvt*predict-yes*H0*5
  18373. -->
  18374. (S1 ^operator O2049 = 0.)
  18375. =>WM: (14414: S1 ^operator O2052 +)
  18376. =>WM: (14413: S1 ^operator O2051 +)
  18377. =>WM: (14412: O2052 ^name predict-no)
  18378. =>WM: (14411: O2051 ^name predict-yes)
  18379. =>WM: (14410: R1029 ^value 1)
  18380. =>WM: (14409: R1 ^reward R1029)
  18381. <=WM: (14400: S1 ^operator O2049 +)
  18382. <=WM: (14401: S1 ^operator O2050 +)
  18383. <=WM: (14402: S1 ^operator O2050)
  18384. <=WM: (14395: R1 ^reward R1028)
  18385. <=WM: (14398: O2050 ^name predict-no)
  18386. <=WM: (14397: O2049 ^name predict-yes)
  18387. <=WM: (14396: R1028 ^value 1)
  18388. --- Inner Elaboration Phase, active level 1 (S1) ---
  18389. Firing prefer*rvt*predict-yes*H0
  18390. -->
  18391. Firing rl*prefer*rvt*predict-yes*H0*5
  18392. -->
  18393. (S1 ^operator O2051 = 0.)
  18394. Firing prefer*rvt*predict-no*H0
  18395. -->
  18396. Firing rl*prefer*rvt*predict-no*H0*6
  18397. -->
  18398. (S1 ^operator O2052 = 0.9999999999999999)
  18399. inner elaboration loop at bottom goal.
  18400. Retracting rl*prefer*rvt*predict-no*H0*6
  18401. -->
  18402. (S1 ^operator O2050 = 0.9999999999999999)
  18403. Retracting rl*prefer*rvt*predict-yes*H0*5
  18404. -->
  18405. (S1 ^operator O2049 = 0.)
  18406. --- END Proposal Phase ---
  18407. --- Decision Phase ---
  18408. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18409. =>WM: (14415: S1 ^operator O2052)
  18410. 1026: O: O2052 (predict-no)
  18411. --- END Decision Phase ---
  18412. --- Application Phase ---
  18413. --- Firing Productions (PE) For State At Depth 1 ---
  18414. --- Inner Elaboration Phase, active level 1 (S1) ---
  18415. Firing apply*operator
  18416. -->
  18417. (I3 ^predict-no N1026 + :O )
  18418. Firing apply*operator*complete
  18419. -->
  18420. (I3 ^predict-no N1025 - :O )
  18421. inner elaboration loop at bottom goal.
  18422. --- Change Working Memory (PE) ---
  18423. =>WM: (14416: I3 ^predict-no N1026)
  18424. <=WM: (14404: N1025 ^status complete)
  18425. <=WM: (14403: I3 ^predict-no N1025)
  18426. --- Firing Productions (IE) For State At Depth 1 ---
  18427. --- Inner Elaboration Phase, active level 1 (S1) ---
  18428. Firing monitor*world
  18429. -->
  18430. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18431. --- Change Working Memory (IE) ---
  18432. --- END Application Phase ---
  18433. --- Output Phase ---
  18434. ENV: Agent did: predict-no for direction U in state State-B
  18435. In State-B moving U
  18436. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18437. predict error 0
  18438. dir: dir isU
  18439. --- END Output Phase ---
  18440. ---- Input Phase ---
  18441. =>WM: (14420: I2 ^dir U)
  18442. =>WM: (14419: I2 ^reward 1)
  18443. =>WM: (14418: I2 ^see 0)
  18444. =>WM: (14417: N1026 ^status complete)
  18445. <=WM: (14407: I2 ^dir U)
  18446. <=WM: (14406: I2 ^reward 1)
  18447. <=WM: (14405: I2 ^see 0)
  18448. =>WM: (14421: I2 ^level-1 R0-root)
  18449. <=WM: (14408: I2 ^level-1 R0-root)
  18450. --- END Input Phase ---
  18451. --- Proposal Phase ---
  18452. --- Inner Elaboration Phase, active level 1 (S1) ---
  18453. Firing elaborate*copy-see-to-output-link
  18454. -->
  18455. (I3 ^see 0 +)
  18456. Firing elaborate*reward*based*on*reward
  18457. -->
  18458. (R1030 ^value 1 +)
  18459. (R1 ^reward R1030 +)
  18460. Firing propose*predict-yes
  18461. -->
  18462. (O2053 ^name predict-yes +)
  18463. (S1 ^operator O2053 +)
  18464. Firing propose*predict-no
  18465. -->
  18466. (O2054 ^name predict-no +)
  18467. (S1 ^operator O2054 +)
  18468. Firing rl*prefer*rvt*predict-no*H0*6
  18469. -->
  18470. (S1 ^operator O2052 = 0.9999999999999999)
  18471. Firing rl*prefer*rvt*predict-yes*H0*5
  18472. -->
  18473. (S1 ^operator O2051 = 0.)
  18474. Firing prefer*rvt*predict-yes*H0
  18475. -->
  18476. Firing prefer*rvt*predict-no*H0
  18477. -->
  18478. Firing elaborate*copy-dir-to-output-link
  18479. -->
  18480. (I3 ^dir U +)
  18481. inner elaboration loop at bottom goal.
  18482. Retracting elaborate*copy-see-to-output-link
  18483. -->
  18484. (I3 ^see 0 +)
  18485. Retracting propose*predict-no
  18486. -->
  18487. (O2052 ^name predict-no +)
  18488. (S1 ^operator O2052 +)
  18489. Retracting propose*predict-yes
  18490. -->
  18491. (O2051 ^name predict-yes +)
  18492. (S1 ^operator O2051 +)
  18493. Retracting elaborate*reward*based*on*reward
  18494. -->
  18495. (R1029 ^value 1 +)
  18496. (R1 ^reward R1029 +)
  18497. Retracting elaborate*copy-dir-to-output-link
  18498. -->
  18499. (I3 ^dir U +)
  18500. Retracting rl*prefer*rvt*predict-no*H0*6
  18501. -->
  18502. (S1 ^operator O2052 = 0.9999999999999999)
  18503. Retracting rl*prefer*rvt*predict-yes*H0*5
  18504. -->
  18505. (S1 ^operator O2051 = 0.)
  18506. =>WM: (14427: S1 ^operator O2054 +)
  18507. =>WM: (14426: S1 ^operator O2053 +)
  18508. =>WM: (14425: O2054 ^name predict-no)
  18509. =>WM: (14424: O2053 ^name predict-yes)
  18510. =>WM: (14423: R1030 ^value 1)
  18511. =>WM: (14422: R1 ^reward R1030)
  18512. <=WM: (14413: S1 ^operator O2051 +)
  18513. <=WM: (14414: S1 ^operator O2052 +)
  18514. <=WM: (14415: S1 ^operator O2052)
  18515. <=WM: (14409: R1 ^reward R1029)
  18516. <=WM: (14412: O2052 ^name predict-no)
  18517. <=WM: (14411: O2051 ^name predict-yes)
  18518. <=WM: (14410: R1029 ^value 1)
  18519. --- Inner Elaboration Phase, active level 1 (S1) ---
  18520. Firing prefer*rvt*predict-yes*H0
  18521. -->
  18522. Firing rl*prefer*rvt*predict-yes*H0*5
  18523. -->
  18524. (S1 ^operator O2053 = 0.)
  18525. Firing prefer*rvt*predict-no*H0
  18526. -->
  18527. Firing rl*prefer*rvt*predict-no*H0*6
  18528. -->
  18529. (S1 ^operator O2054 = 0.9999999999999999)
  18530. inner elaboration loop at bottom goal.
  18531. Retracting rl*prefer*rvt*predict-no*H0*6
  18532. -->
  18533. (S1 ^operator O2052 = 0.9999999999999999)
  18534. Retracting rl*prefer*rvt*predict-yes*H0*5
  18535. -->
  18536. (S1 ^operator O2051 = 0.)
  18537. --- END Proposal Phase ---
  18538. --- Decision Phase ---
  18539. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18540. =>WM: (14428: S1 ^operator O2054)
  18541. 1027: O: O2054 (predict-no)
  18542. --- END Decision Phase ---
  18543. --- Application Phase ---
  18544. --- Firing Productions (PE) For State At Depth 1 ---
  18545. --- Inner Elaboration Phase, active level 1 (S1) ---
  18546. Firing apply*operator
  18547. -->
  18548. (I3 ^predict-no N1027 + :O )
  18549. Firing apply*operator*complete
  18550. -->
  18551. (I3 ^predict-no N1026 - :O )
  18552. inner elaboration loop at bottom goal.
  18553. --- Change Working Memory (PE) ---
  18554. =>WM: (14429: I3 ^predict-no N1027)
  18555. <=WM: (14417: N1026 ^status complete)
  18556. <=WM: (14416: I3 ^predict-no N1026)
  18557. --- Firing Productions (IE) For State At Depth 1 ---
  18558. --- Inner Elaboration Phase, active level 1 (S1) ---
  18559. Firing monitor*world
  18560. -->
  18561. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18562. --- Change Working Memory (IE) ---
  18563. --- END Application Phase ---
  18564. --- Output Phase ---
  18565. ENV: Agent did: predict-no for direction U in state State-B
  18566. In State-B moving U
  18567. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18568. predict error 0
  18569. dir: dir isU
  18570. --- END Output Phase ---
  18571. /|--- Input Phase ---
  18572. =>WM: (14433: I2 ^dir U)
  18573. =>WM: (14432: I2 ^reward 1)
  18574. =>WM: (14431: I2 ^see 0)
  18575. =>WM: (14430: N1027 ^status complete)
  18576. <=WM: (14420: I2 ^dir U)
  18577. <=WM: (14419: I2 ^reward 1)
  18578. <=WM: (14418: I2 ^see 0)
  18579. =>WM: (14434: I2 ^level-1 R0-root)
  18580. <=WM: (14421: I2 ^level-1 R0-root)
  18581. --- END Input Phase ---
  18582. --- Proposal Phase ---
  18583. --- Inner Elaboration Phase, active level 1 (S1) ---
  18584. Firing elaborate*copy-see-to-output-link
  18585. -->
  18586. (I3 ^see 0 +)
  18587. Firing elaborate*reward*based*on*reward
  18588. -->
  18589. (R1031 ^value 1 +)
  18590. (R1 ^reward R1031 +)
  18591. Firing propose*predict-yes
  18592. -->
  18593. (O2055 ^name predict-yes +)
  18594. (S1 ^operator O2055 +)
  18595. Firing propose*predict-no
  18596. -->
  18597. (O2056 ^name predict-no +)
  18598. (S1 ^operator O2056 +)
  18599. Firing rl*prefer*rvt*predict-no*H0*6
  18600. -->
  18601. (S1 ^operator O2054 = 0.9999999999999999)
  18602. Firing rl*prefer*rvt*predict-yes*H0*5
  18603. -->
  18604. (S1 ^operator O2053 = 0.)
  18605. Firing prefer*rvt*predict-yes*H0
  18606. -->
  18607. Firing prefer*rvt*predict-no*H0
  18608. -->
  18609. Firing elaborate*copy-dir-to-output-link
  18610. -->
  18611. (I3 ^dir U +)
  18612. inner elaboration loop at bottom goal.
  18613. Retracting elaborate*copy-see-to-output-link
  18614. -->
  18615. (I3 ^see 0 +)
  18616. Retracting propose*predict-no
  18617. -->
  18618. (O2054 ^name predict-no +)
  18619. (S1 ^operator O2054 +)
  18620. Retracting propose*predict-yes
  18621. -->
  18622. (O2053 ^name predict-yes +)
  18623. (S1 ^operator O2053 +)
  18624. Retracting elaborate*reward*based*on*reward
  18625. -->
  18626. (R1030 ^value 1 +)
  18627. (R1 ^reward R1030 +)
  18628. Retracting elaborate*copy-dir-to-output-link
  18629. -->
  18630. (I3 ^dir U +)
  18631. Retracting rl*prefer*rvt*predict-no*H0*6
  18632. -->
  18633. (S1 ^operator O2054 = 0.9999999999999999)
  18634. Retracting rl*prefer*rvt*predict-yes*H0*5
  18635. -->
  18636. (S1 ^operator O2053 = 0.)
  18637. =>WM: (14440: S1 ^operator O2056 +)
  18638. =>WM: (14439: S1 ^operator O2055 +)
  18639. =>WM: (14438: O2056 ^name predict-no)
  18640. =>WM: (14437: O2055 ^name predict-yes)
  18641. =>WM: (14436: R1031 ^value 1)
  18642. =>WM: (14435: R1 ^reward R1031)
  18643. <=WM: (14426: S1 ^operator O2053 +)
  18644. <=WM: (14427: S1 ^operator O2054 +)
  18645. <=WM: (14428: S1 ^operator O2054)
  18646. <=WM: (14422: R1 ^reward R1030)
  18647. <=WM: (14425: O2054 ^name predict-no)
  18648. <=WM: (14424: O2053 ^name predict-yes)
  18649. <=WM: (14423: R1030 ^value 1)
  18650. --- Inner Elaboration Phase, active level 1 (S1) ---
  18651. Firing prefer*rvt*predict-yes*H0
  18652. -->
  18653. Firing rl*prefer*rvt*predict-yes*H0*5
  18654. -->
  18655. (S1 ^operator O2055 = 0.)
  18656. Firing prefer*rvt*predict-no*H0
  18657. -->
  18658. Firing rl*prefer*rvt*predict-no*H0*6
  18659. -->
  18660. (S1 ^operator O2056 = 0.9999999999999999)
  18661. inner elaboration loop at bottom goal.
  18662. Retracting rl*prefer*rvt*predict-no*H0*6
  18663. -->
  18664. (S1 ^operator O2054 = 0.9999999999999999)
  18665. Retracting rl*prefer*rvt*predict-yes*H0*5
  18666. -->
  18667. (S1 ^operator O2053 = 0.)
  18668. --- END Proposal Phase ---
  18669. --- Decision Phase ---
  18670. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18671. =>WM: (14441: S1 ^operator O2056)
  18672. 1028: O: O2056 (predict-no)
  18673. --- END Decision Phase ---
  18674. --- Application Phase ---
  18675. --- Firing Productions (PE) For State At Depth 1 ---
  18676. --- Inner Elaboration Phase, active level 1 (S1) ---
  18677. Firing apply*operator
  18678. -->
  18679. (I3 ^predict-no N1028 + :O )
  18680. Firing apply*operator*complete
  18681. -->
  18682. (I3 ^predict-no N1027 - :O )
  18683. inner elaboration loop at bottom goal.
  18684. --- Change Working Memory (PE) ---
  18685. =>WM: (14442: I3 ^predict-no N1028)
  18686. <=WM: (14430: N1027 ^status complete)
  18687. <=WM: (14429: I3 ^predict-no N1027)
  18688. --- Firing Productions (IE) For State At Depth 1 ---
  18689. --- Inner Elaboration Phase, active level 1 (S1) ---
  18690. Firing monitor*world
  18691. -->
  18692. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18693. --- Change Working Memory (IE) ---
  18694. --- END Application Phase ---
  18695. --- Output Phase ---
  18696. ENV: Agent did: predict-no for direction U in state State-B
  18697. In State-B moving U
  18698. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18699. predict error 0
  18700. dir: dir isL
  18701. --- END Output Phase ---
  18702. \--- Input Phase ---
  18703. =>WM: (14446: I2 ^dir L)
  18704. =>WM: (14445: I2 ^reward 1)
  18705. =>WM: (14444: I2 ^see 0)
  18706. =>WM: (14443: N1028 ^status complete)
  18707. <=WM: (14433: I2 ^dir U)
  18708. <=WM: (14432: I2 ^reward 1)
  18709. <=WM: (14431: I2 ^see 0)
  18710. =>WM: (14447: I2 ^level-1 R0-root)
  18711. <=WM: (14434: I2 ^level-1 R0-root)
  18712. --- END Input Phase ---
  18713. --- Proposal Phase ---
  18714. --- Inner Elaboration Phase, active level 1 (S1) ---
  18715. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  18716. -->
  18717. (S1 ^operator O2055 = 0.4768819180211759)
  18718. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  18719. -->
  18720. (S1 ^operator O2056 = 0.1700769046561409)
  18721. Firing prefer*rvt*predict-no*H0*2*H1
  18722. -->
  18723. Firing prefer*rvt*predict-yes*H0*1*H1
  18724. -->
  18725. Firing elaborate*copy-see-to-output-link
  18726. -->
  18727. (I3 ^see 0 +)
  18728. Firing elaborate*reward*based*on*reward
  18729. -->
  18730. (R1032 ^value 1 +)
  18731. (R1 ^reward R1032 +)
  18732. Firing propose*predict-yes
  18733. -->
  18734. (O2057 ^name predict-yes +)
  18735. (S1 ^operator O2057 +)
  18736. Firing propose*predict-no
  18737. -->
  18738. (O2058 ^name predict-no +)
  18739. (S1 ^operator O2058 +)
  18740. Firing rl*prefer*rvt*predict-no*H0*2
  18741. -->
  18742. (S1 ^operator O2056 = 0.2550133872099196)
  18743. Firing rl*prefer*rvt*predict-yes*H0*1
  18744. -->
  18745. (S1 ^operator O2055 = 0.5231198846221292)
  18746. Firing prefer*rvt*predict-yes*H0
  18747. -->
  18748. Firing prefer*rvt*predict-no*H0
  18749. -->
  18750. Firing elaborate*copy-dir-to-output-link
  18751. -->
  18752. (I3 ^dir L +)
  18753. inner elaboration loop at bottom goal.
  18754. Retracting elaborate*copy-see-to-output-link
  18755. -->
  18756. (I3 ^see 0 +)
  18757. Retracting propose*predict-no
  18758. -->
  18759. (O2056 ^name predict-no +)
  18760. (S1 ^operator O2056 +)
  18761. Retracting propose*predict-yes
  18762. -->
  18763. (O2055 ^name predict-yes +)
  18764. (S1 ^operator O2055 +)
  18765. Retracting elaborate*reward*based*on*reward
  18766. -->
  18767. (R1031 ^value 1 +)
  18768. (R1 ^reward R1031 +)
  18769. Retracting elaborate*copy-dir-to-output-link
  18770. -->
  18771. (I3 ^dir U +)
  18772. Retracting rl*prefer*rvt*predict-no*H0*6
  18773. -->
  18774. (S1 ^operator O2056 = 0.9999999999999999)
  18775. Retracting rl*prefer*rvt*predict-yes*H0*5
  18776. -->
  18777. (S1 ^operator O2055 = 0.)
  18778. =>WM: (14454: S1 ^operator O2058 +)
  18779. =>WM: (14453: S1 ^operator O2057 +)
  18780. =>WM: (14452: I3 ^dir L)
  18781. =>WM: (14451: O2058 ^name predict-no)
  18782. =>WM: (14450: O2057 ^name predict-yes)
  18783. =>WM: (14449: R1032 ^value 1)
  18784. =>WM: (14448: R1 ^reward R1032)
  18785. <=WM: (14439: S1 ^operator O2055 +)
  18786. <=WM: (14440: S1 ^operator O2056 +)
  18787. <=WM: (14441: S1 ^operator O2056)
  18788. <=WM: (14399: I3 ^dir U)
  18789. <=WM: (14435: R1 ^reward R1031)
  18790. <=WM: (14438: O2056 ^name predict-no)
  18791. <=WM: (14437: O2055 ^name predict-yes)
  18792. <=WM: (14436: R1031 ^value 1)
  18793. --- Inner Elaboration Phase, active level 1 (S1) ---
  18794. Firing prefer*rvt*predict-yes*H0
  18795. -->
  18796. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  18797. -->
  18798. (S1 ^operator O2057 = 0.4768819180211759)
  18799. Firing rl*prefer*rvt*predict-yes*H0*1
  18800. -->
  18801. (S1 ^operator O2057 = 0.5231198846221292)
  18802. Firing prefer*rvt*predict-yes*H0*1*H1
  18803. -->
  18804. Firing prefer*rvt*predict-no*H0
  18805. -->
  18806. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  18807. -->
  18808. (S1 ^operator O2058 = 0.1700769046561409)
  18809. Firing rl*prefer*rvt*predict-no*H0*2
  18810. -->
  18811. (S1 ^operator O2058 = 0.2550133872099196)
  18812. Firing prefer*rvt*predict-no*H0*2*H1
  18813. -->
  18814. inner elaboration loop at bottom goal.
  18815. Retracting rl*prefer*rvt*predict-no*H0*2
  18816. -->
  18817. (S1 ^operator O2056 = 0.2550133872099196)
  18818. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  18819. -->
  18820. (S1 ^operator O2056 = 0.1700769046561409)
  18821. Retracting rl*prefer*rvt*predict-yes*H0*1
  18822. -->
  18823. (S1 ^operator O2055 = 0.5231198846221292)
  18824. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  18825. -->
  18826. (S1 ^operator O2055 = 0.4768819180211759)
  18827. --- END Proposal Phase ---
  18828. --- Decision Phase ---
  18829. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18830. =>WM: (14455: S1 ^operator O2057)
  18831. 1029: O: O2057 (predict-yes)
  18832. --- END Decision Phase ---
  18833. --- Application Phase ---
  18834. --- Firing Productions (PE) For State At Depth 1 ---
  18835. --- Inner Elaboration Phase, active level 1 (S1) ---
  18836. Firing apply*operator
  18837. -->
  18838. (I3 ^predict-yes N1029 + :O )
  18839. Firing apply*operator*complete
  18840. -->
  18841. (I3 ^predict-no N1028 - :O )
  18842. inner elaboration loop at bottom goal.
  18843. --- Change Working Memory (PE) ---
  18844. =>WM: (14456: I3 ^predict-yes N1029)
  18845. <=WM: (14443: N1028 ^status complete)
  18846. <=WM: (14442: I3 ^predict-no N1028)
  18847. --- Firing Productions (IE) For State At Depth 1 ---
  18848. --- Inner Elaboration Phase, active level 1 (S1) ---
  18849. Firing monitor*world
  18850. -->
  18851. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18852. --- Change Working Memory (IE) ---
  18853. --- END Application Phase ---
  18854. --- Output Phase ---
  18855. ENV: Agent did: predict-yes for direction L in state State-B
  18856. In State-B moving L
  18857. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  18858. predict error 0
  18859. dir: dir isU
  18860. --- END Output Phase ---
  18861. -/|--- Input Phase ---
  18862. =>WM: (14460: I2 ^dir U)
  18863. =>WM: (14459: I2 ^reward 1)
  18864. =>WM: (14458: I2 ^see 1)
  18865. =>WM: (14457: N1029 ^status complete)
  18866. <=WM: (14446: I2 ^dir L)
  18867. <=WM: (14445: I2 ^reward 1)
  18868. <=WM: (14444: I2 ^see 0)
  18869. =>WM: (14461: I2 ^level-1 L1-root)
  18870. <=WM: (14447: I2 ^level-1 R0-root)
  18871. --- END Input Phase ---
  18872. --- Proposal Phase ---
  18873. --- Inner Elaboration Phase, active level 1 (S1) ---
  18874. Firing elaborate*copy-see-to-output-link
  18875. -->
  18876. (I3 ^see 1 +)
  18877. Firing elaborate*reward*based*on*reward
  18878. -->
  18879. (R1033 ^value 1 +)
  18880. (R1 ^reward R1033 +)
  18881. Firing propose*predict-yes
  18882. -->
  18883. (O2059 ^name predict-yes +)
  18884. (S1 ^operator O2059 +)
  18885. Firing propose*predict-no
  18886. -->
  18887. (O2060 ^name predict-no +)
  18888. (S1 ^operator O2060 +)
  18889. Firing rl*prefer*rvt*predict-no*H0*6
  18890. -->
  18891. (S1 ^operator O2058 = 0.9999999999999999)
  18892. Firing rl*prefer*rvt*predict-yes*H0*5
  18893. -->
  18894. (S1 ^operator O2057 = 0.)
  18895. Firing prefer*rvt*predict-yes*H0
  18896. -->
  18897. Firing prefer*rvt*predict-no*H0
  18898. -->
  18899. Firing elaborate*copy-dir-to-output-link
  18900. -->
  18901. (I3 ^dir U +)
  18902. inner elaboration loop at bottom goal.
  18903. Retracting elaborate*copy-see-to-output-link
  18904. -->
  18905. (I3 ^see 0 +)
  18906. Retracting propose*predict-no
  18907. -->
  18908. (O2058 ^name predict-no +)
  18909. (S1 ^operator O2058 +)
  18910. Retracting propose*predict-yes
  18911. -->
  18912. (O2057 ^name predict-yes +)
  18913. (S1 ^operator O2057 +)
  18914. Retracting elaborate*reward*based*on*reward
  18915. -->
  18916. (R1032 ^value 1 +)
  18917. (R1 ^reward R1032 +)
  18918. Retracting elaborate*copy-dir-to-output-link
  18919. -->
  18920. (I3 ^dir L +)
  18921. Retracting rl*prefer*rvt*predict-no*H0*2
  18922. -->
  18923. (S1 ^operator O2058 = 0.2550133872099196)
  18924. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  18925. -->
  18926. (S1 ^operator O2058 = 0.1700769046561409)
  18927. Retracting rl*prefer*rvt*predict-yes*H0*1
  18928. -->
  18929. (S1 ^operator O2057 = 0.5231198846221292)
  18930. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  18931. -->
  18932. (S1 ^operator O2057 = 0.4768819180211759)
  18933. =>WM: (14469: S1 ^operator O2060 +)
  18934. =>WM: (14468: S1 ^operator O2059 +)
  18935. =>WM: (14467: I3 ^dir U)
  18936. =>WM: (14466: O2060 ^name predict-no)
  18937. =>WM: (14465: O2059 ^name predict-yes)
  18938. =>WM: (14464: R1033 ^value 1)
  18939. =>WM: (14463: R1 ^reward R1033)
  18940. =>WM: (14462: I3 ^see 1)
  18941. <=WM: (14453: S1 ^operator O2057 +)
  18942. <=WM: (14455: S1 ^operator O2057)
  18943. <=WM: (14454: S1 ^operator O2058 +)
  18944. <=WM: (14452: I3 ^dir L)
  18945. <=WM: (14448: R1 ^reward R1032)
  18946. <=WM: (14394: I3 ^see 0)
  18947. <=WM: (14451: O2058 ^name predict-no)
  18948. <=WM: (14450: O2057 ^name predict-yes)
  18949. <=WM: (14449: R1032 ^value 1)
  18950. --- Inner Elaboration Phase, active level 1 (S1) ---
  18951. Firing prefer*rvt*predict-yes*H0
  18952. -->
  18953. Firing rl*prefer*rvt*predict-yes*H0*5
  18954. -->
  18955. (S1 ^operator O2059 = 0.)
  18956. Firing prefer*rvt*predict-no*H0
  18957. -->
  18958. Firing rl*prefer*rvt*predict-no*H0*6
  18959. -->
  18960. (S1 ^operator O2060 = 0.9999999999999999)
  18961. inner elaboration loop at bottom goal.
  18962. Retracting rl*prefer*rvt*predict-no*H0*6
  18963. -->
  18964. (S1 ^operator O2058 = 0.9999999999999999)
  18965. Retracting rl*prefer*rvt*predict-yes*H0*5
  18966. -->
  18967. (S1 ^operator O2057 = 0.)
  18968. --- END Proposal Phase ---
  18969. --- Decision Phase ---
  18970. RL update rl*prefer*rvt*predict-yes*H0*1 0.72796 -0.20484 0.52312 -> 0.727959 -0.20484 0.52312(R,m,v=1,0.97973,0.0199945)
  18971. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272043 0.204839 0.476882 -> 0.272042 0.204839 0.476882(R,m,v=1,1,0)
  18972. =>WM: (14470: S1 ^operator O2060)
  18973. 1030: O: O2060 (predict-no)
  18974. --- END Decision Phase ---
  18975. --- Application Phase ---
  18976. --- Firing Productions (PE) For State At Depth 1 ---
  18977. --- Inner Elaboration Phase, active level 1 (S1) ---
  18978. Firing apply*operator
  18979. -->
  18980. (I3 ^predict-no N1030 + :O )
  18981. Firing apply*operator*complete
  18982. -->
  18983. (I3 ^predict-yes N1029 - :O )
  18984. inner elaboration loop at bottom goal.
  18985. --- Change Working Memory (PE) ---
  18986. =>WM: (14471: I3 ^predict-no N1030)
  18987. <=WM: (14457: N1029 ^status complete)
  18988. <=WM: (14456: I3 ^predict-yes N1029)
  18989. --- Firing Productions (IE) For State At Depth 1 ---
  18990. --- Inner Elaboration Phase, active level 1 (S1) ---
  18991. Firing monitor*world
  18992. -->
  18993. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18994. --- Change Working Memory (IE) ---
  18995. --- END Application Phase ---
  18996. --- Output Phase ---
  18997. ENV: Agent did: predict-no for direction U in state State-A
  18998. In State-A moving U
  18999. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19000. predict error 0
  19001. dir: dir isL
  19002. --- END Output Phase ---
  19003. \-/--- Input Phase ---
  19004. =>WM: (14475: I2 ^dir L)
  19005. =>WM: (14474: I2 ^reward 1)
  19006. =>WM: (14473: I2 ^see 0)
  19007. =>WM: (14472: N1030 ^status complete)
  19008. <=WM: (14460: I2 ^dir U)
  19009. <=WM: (14459: I2 ^reward 1)
  19010. <=WM: (14458: I2 ^see 1)
  19011. =>WM: (14476: I2 ^level-1 L1-root)
  19012. <=WM: (14461: I2 ^level-1 L1-root)
  19013. --- END Input Phase ---
  19014. --- Proposal Phase ---
  19015. --- Inner Elaboration Phase, active level 1 (S1) ---
  19016. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  19017. -->
  19018. (S1 ^operator O2059 = 0.1693592933936033)
  19019. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  19020. -->
  19021. (S1 ^operator O2060 = 0.7449864907562294)
  19022. Firing prefer*rvt*predict-no*H0*2*H1
  19023. -->
  19024. Firing prefer*rvt*predict-yes*H0*1*H1
  19025. -->
  19026. Firing elaborate*copy-see-to-output-link
  19027. -->
  19028. (I3 ^see 0 +)
  19029. Firing elaborate*reward*based*on*reward
  19030. -->
  19031. (R1034 ^value 1 +)
  19032. (R1 ^reward R1034 +)
  19033. Firing propose*predict-yes
  19034. -->
  19035. (O2061 ^name predict-yes +)
  19036. (S1 ^operator O2061 +)
  19037. Firing propose*predict-no
  19038. -->
  19039. (O2062 ^name predict-no +)
  19040. (S1 ^operator O2062 +)
  19041. Firing rl*prefer*rvt*predict-no*H0*2
  19042. -->
  19043. (S1 ^operator O2060 = 0.2550133872099196)
  19044. Firing rl*prefer*rvt*predict-yes*H0*1
  19045. -->
  19046. (S1 ^operator O2059 = 0.5231196142256334)
  19047. Firing prefer*rvt*predict-yes*H0
  19048. -->
  19049. Firing prefer*rvt*predict-no*H0
  19050. -->
  19051. Firing elaborate*copy-dir-to-output-link
  19052. -->
  19053. (I3 ^dir L +)
  19054. inner elaboration loop at bottom goal.
  19055. Retracting elaborate*copy-see-to-output-link
  19056. -->
  19057. (I3 ^see 1 +)
  19058. Retracting propose*predict-no
  19059. -->
  19060. (O2060 ^name predict-no +)
  19061. (S1 ^operator O2060 +)
  19062. Retracting propose*predict-yes
  19063. -->
  19064. (O2059 ^name predict-yes +)
  19065. (S1 ^operator O2059 +)
  19066. Retracting elaborate*reward*based*on*reward
  19067. -->
  19068. (R1033 ^value 1 +)
  19069. (R1 ^reward R1033 +)
  19070. Retracting elaborate*copy-dir-to-output-link
  19071. -->
  19072. (I3 ^dir U +)
  19073. Retracting rl*prefer*rvt*predict-no*H0*6
  19074. -->
  19075. (S1 ^operator O2060 = 0.9999999999999999)
  19076. Retracting rl*prefer*rvt*predict-yes*H0*5
  19077. -->
  19078. (S1 ^operator O2059 = 0.)
  19079. =>WM: (14484: S1 ^operator O2062 +)
  19080. =>WM: (14483: S1 ^operator O2061 +)
  19081. =>WM: (14482: I3 ^dir L)
  19082. =>WM: (14481: O2062 ^name predict-no)
  19083. =>WM: (14480: O2061 ^name predict-yes)
  19084. =>WM: (14479: R1034 ^value 1)
  19085. =>WM: (14478: R1 ^reward R1034)
  19086. =>WM: (14477: I3 ^see 0)
  19087. <=WM: (14468: S1 ^operator O2059 +)
  19088. <=WM: (14469: S1 ^operator O2060 +)
  19089. <=WM: (14470: S1 ^operator O2060)
  19090. <=WM: (14467: I3 ^dir U)
  19091. <=WM: (14463: R1 ^reward R1033)
  19092. <=WM: (14462: I3 ^see 1)
  19093. <=WM: (14466: O2060 ^name predict-no)
  19094. <=WM: (14465: O2059 ^name predict-yes)
  19095. <=WM: (14464: R1033 ^value 1)
  19096. --- Inner Elaboration Phase, active level 1 (S1) ---
  19097. Firing prefer*rvt*predict-yes*H0
  19098. -->
  19099. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  19100. -->
  19101. (S1 ^operator O2061 = 0.1693592933936033)
  19102. Firing rl*prefer*rvt*predict-yes*H0*1
  19103. -->
  19104. (S1 ^operator O2061 = 0.5231196142256334)
  19105. Firing prefer*rvt*predict-yes*H0*1*H1
  19106. -->
  19107. Firing prefer*rvt*predict-no*H0
  19108. -->
  19109. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  19110. -->
  19111. (S1 ^operator O2062 = 0.7449864907562294)
  19112. Firing rl*prefer*rvt*predict-no*H0*2
  19113. -->
  19114. (S1 ^operator O2062 = 0.2550133872099196)
  19115. Firing prefer*rvt*predict-no*H0*2*H1
  19116. -->
  19117. inner elaboration loop at bottom goal.
  19118. Retracting rl*prefer*rvt*predict-no*H0*2
  19119. -->
  19120. (S1 ^operator O2060 = 0.2550133872099196)
  19121. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  19122. -->
  19123. (S1 ^operator O2060 = 0.7449864907562294)
  19124. Retracting rl*prefer*rvt*predict-yes*H0*1
  19125. -->
  19126. (S1 ^operator O2059 = 0.5231196142256334)
  19127. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  19128. -->
  19129. (S1 ^operator O2059 = 0.1693592933936033)
  19130. --- END Proposal Phase ---
  19131. --- Decision Phase ---
  19132. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19133. =>WM: (14485: S1 ^operator O2062)
  19134. 1031: O: O2062 (predict-no)
  19135. --- END Decision Phase ---
  19136. --- Application Phase ---
  19137. --- Firing Productions (PE) For State At Depth 1 ---
  19138. --- Inner Elaboration Phase, active level 1 (S1) ---
  19139. Firing apply*operator
  19140. -->
  19141. (I3 ^predict-no N1031 + :O )
  19142. Firing apply*operator*complete
  19143. -->
  19144. (I3 ^predict-no N1030 - :O )
  19145. inner elaboration loop at bottom goal.
  19146. --- Change Working Memory (PE) ---
  19147. =>WM: (14486: I3 ^predict-no N1031)
  19148. <=WM: (14472: N1030 ^status complete)
  19149. <=WM: (14471: I3 ^predict-no N1030)
  19150. --- Firing Productions (IE) For State At Depth 1 ---
  19151. --- Inner Elaboration Phase, active level 1 (S1) ---
  19152. Firing monitor*world
  19153. -->
  19154. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19155. --- Change Working Memory (IE) ---
  19156. --- END Application Phase ---
  19157. --- Output Phase ---
  19158. ENV: Agent did: predict-no for direction L in state State-A
  19159. In State-A moving L
  19160. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19161. predict error 0
  19162. dir: dir isL
  19163. --- END Output Phase ---
  19164. |--- Input Phase ---
  19165. =>WM: (14490: I2 ^dir L)
  19166. =>WM: (14489: I2 ^reward 1)
  19167. =>WM: (14488: I2 ^see 0)
  19168. =>WM: (14487: N1031 ^status complete)
  19169. <=WM: (14475: I2 ^dir L)
  19170. <=WM: (14474: I2 ^reward 1)
  19171. <=WM: (14473: I2 ^see 0)
  19172. =>WM: (14491: I2 ^level-1 L0-root)
  19173. <=WM: (14476: I2 ^level-1 L1-root)
  19174. --- END Input Phase ---
  19175. --- Proposal Phase ---
  19176. --- Inner Elaboration Phase, active level 1 (S1) ---
  19177. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  19178. -->
  19179. (S1 ^operator O2061 = 0.3)
  19180. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  19181. -->
  19182. (S1 ^operator O2062 = 0.7449866897649052)
  19183. Firing prefer*rvt*predict-no*H0*2*H1
  19184. -->
  19185. Firing prefer*rvt*predict-yes*H0*1*H1
  19186. -->
  19187. Firing elaborate*copy-see-to-output-link
  19188. -->
  19189. (I3 ^see 0 +)
  19190. Firing elaborate*reward*based*on*reward
  19191. -->
  19192. (R1035 ^value 1 +)
  19193. (R1 ^reward R1035 +)
  19194. Firing propose*predict-yes
  19195. -->
  19196. (O2063 ^name predict-yes +)
  19197. (S1 ^operator O2063 +)
  19198. Firing propose*predict-no
  19199. -->
  19200. (O2064 ^name predict-no +)
  19201. (S1 ^operator O2064 +)
  19202. Firing rl*prefer*rvt*predict-no*H0*2
  19203. -->
  19204. (S1 ^operator O2062 = 0.2550133872099196)
  19205. Firing rl*prefer*rvt*predict-yes*H0*1
  19206. -->
  19207. (S1 ^operator O2061 = 0.5231196142256334)
  19208. Firing prefer*rvt*predict-yes*H0
  19209. -->
  19210. Firing prefer*rvt*predict-no*H0
  19211. -->
  19212. Firing elaborate*copy-dir-to-output-link
  19213. -->
  19214. (I3 ^dir L +)
  19215. inner elaboration loop at bottom goal.
  19216. Retracting elaborate*copy-see-to-output-link
  19217. -->
  19218. (I3 ^see 0 +)
  19219. Retracting propose*predict-no
  19220. -->
  19221. (O2062 ^name predict-no +)
  19222. (S1 ^operator O2062 +)
  19223. Retracting propose*predict-yes
  19224. -->
  19225. (O2061 ^name predict-yes +)
  19226. (S1 ^operator O2061 +)
  19227. Retracting elaborate*reward*based*on*reward
  19228. -->
  19229. (R1034 ^value 1 +)
  19230. (R1 ^reward R1034 +)
  19231. Retracting elaborate*copy-dir-to-output-link
  19232. -->
  19233. (I3 ^dir L +)
  19234. Retracting rl*prefer*rvt*predict-no*H0*2
  19235. -->
  19236. (S1 ^operator O2062 = 0.2550133872099196)
  19237. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  19238. -->
  19239. (S1 ^operator O2062 = 0.7449864907562294)
  19240. Retracting rl*prefer*rvt*predict-yes*H0*1
  19241. -->
  19242. (S1 ^operator O2061 = 0.5231196142256334)
  19243. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  19244. -->
  19245. (S1 ^operator O2061 = 0.1693592933936033)
  19246. =>WM: (14497: S1 ^operator O2064 +)
  19247. =>WM: (14496: S1 ^operator O2063 +)
  19248. =>WM: (14495: O2064 ^name predict-no)
  19249. =>WM: (14494: O2063 ^name predict-yes)
  19250. =>WM: (14493: R1035 ^value 1)
  19251. =>WM: (14492: R1 ^reward R1035)
  19252. <=WM: (14483: S1 ^operator O2061 +)
  19253. <=WM: (14484: S1 ^operator O2062 +)
  19254. <=WM: (14485: S1 ^operator O2062)
  19255. <=WM: (14478: R1 ^reward R1034)
  19256. <=WM: (14481: O2062 ^name predict-no)
  19257. <=WM: (14480: O2061 ^name predict-yes)
  19258. <=WM: (14479: R1034 ^value 1)
  19259. --- Inner Elaboration Phase, active level 1 (S1) ---
  19260. Firing prefer*rvt*predict-yes*H0
  19261. -->
  19262. Firing rl*prefer*rvt*predict-yes*H0*1
  19263. -->
  19264. (S1 ^operator O2063 = 0.5231196142256334)
  19265. Firing prefer*rvt*predict-yes*H0*1*H1
  19266. -->
  19267. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  19268. -->
  19269. (S1 ^operator O2063 = 0.3)
  19270. Firing prefer*rvt*predict-no*H0
  19271. -->
  19272. Firing rl*prefer*rvt*predict-no*H0*2
  19273. -->
  19274. (S1 ^operator O2064 = 0.2550133872099196)
  19275. Firing prefer*rvt*predict-no*H0*2*H1
  19276. -->
  19277. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  19278. -->
  19279. (S1 ^operator O2064 = 0.7449866897649052)
  19280. inner elaboration loop at bottom goal.
  19281. Retracting rl*prefer*rvt*predict-no*H0*2
  19282. -->
  19283. (S1 ^operator O2062 = 0.2550133872099196)
  19284. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  19285. -->
  19286. (S1 ^operator O2062 = 0.7449866897649052)
  19287. Retracting rl*prefer*rvt*predict-yes*H0*1
  19288. -->
  19289. (S1 ^operator O2061 = 0.5231196142256334)
  19290. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  19291. -->
  19292. (S1 ^operator O2061 = 0.3)
  19293. --- END Proposal Phase ---
  19294. --- Decision Phase ---
  19295. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.919598,0.0743109)
  19296. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744986 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  19297. =>WM: (14498: S1 ^operator O2064)
  19298. 1032: O: O2064 (predict-no)
  19299. --- END Decision Phase ---
  19300. --- Application Phase ---
  19301. --- Firing Productions (PE) For State At Depth 1 ---
  19302. --- Inner Elaboration Phase, active level 1 (S1) ---
  19303. Firing apply*operator
  19304. -->
  19305. (I3 ^predict-no N1032 + :O )
  19306. Firing apply*operator*complete
  19307. -->
  19308. (I3 ^predict-no N1031 - :O )
  19309. inner elaboration loop at bottom goal.
  19310. --- Change Working Memory (PE) ---
  19311. =>WM: (14499: I3 ^predict-no N1032)
  19312. <=WM: (14487: N1031 ^status complete)
  19313. <=WM: (14486: I3 ^predict-no N1031)
  19314. --- Firing Productions (IE) For State At Depth 1 ---
  19315. --- Inner Elaboration Phase, active level 1 (S1) ---
  19316. Firing monitor*world
  19317. -->
  19318. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19319. --- Change Working Memory (IE) ---
  19320. --- END Application Phase ---
  19321. --- Output Phase ---
  19322. ENV: Agent did: predict-no for direction L in state State-A
  19323. In State-A moving L
  19324. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19325. predict error 0
  19326. dir: dir isR
  19327. --- END Output Phase ---
  19328. \---- Input Phase ---
  19329. =>WM: (14503: I2 ^dir R)
  19330. =>WM: (14502: I2 ^reward 1)
  19331. =>WM: (14501: I2 ^see 0)
  19332. =>WM: (14500: N1032 ^status complete)
  19333. <=WM: (14490: I2 ^dir L)
  19334. <=WM: (14489: I2 ^reward 1)
  19335. <=WM: (14488: I2 ^see 0)
  19336. =>WM: (14504: I2 ^level-1 L0-root)
  19337. <=WM: (14491: I2 ^level-1 L0-root)
  19338. --- END Input Phase ---
  19339. --- Proposal Phase ---
  19340. --- Inner Elaboration Phase, active level 1 (S1) ---
  19341. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  19342. -->
  19343. (S1 ^operator O2063 = 0.6170732174748315)
  19344. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  19345. -->
  19346. (S1 ^operator O2064 = 0.4910065094545203)
  19347. Firing prefer*rvt*predict-no*H0*4*H1
  19348. -->
  19349. Firing prefer*rvt*predict-yes*H0*3*H1
  19350. -->
  19351. Firing elaborate*copy-see-to-output-link
  19352. -->
  19353. (I3 ^see 0 +)
  19354. Firing elaborate*reward*based*on*reward
  19355. -->
  19356. (R1036 ^value 1 +)
  19357. (R1 ^reward R1036 +)
  19358. Firing propose*predict-yes
  19359. -->
  19360. (O2065 ^name predict-yes +)
  19361. (S1 ^operator O2065 +)
  19362. Firing propose*predict-no
  19363. -->
  19364. (O2066 ^name predict-no +)
  19365. (S1 ^operator O2066 +)
  19366. Firing rl*prefer*rvt*predict-no*H0*4
  19367. -->
  19368. (S1 ^operator O2064 = 0.1269767912163215)
  19369. Firing rl*prefer*rvt*predict-yes*H0*3
  19370. -->
  19371. (S1 ^operator O2063 = 0.3829453632213463)
  19372. Firing prefer*rvt*predict-yes*H0
  19373. -->
  19374. Firing prefer*rvt*predict-no*H0
  19375. -->
  19376. Firing elaborate*copy-dir-to-output-link
  19377. -->
  19378. (I3 ^dir R +)
  19379. inner elaboration loop at bottom goal.
  19380. Retracting elaborate*copy-see-to-output-link
  19381. -->
  19382. (I3 ^see 0 +)
  19383. Retracting propose*predict-no
  19384. -->
  19385. (O2064 ^name predict-no +)
  19386. (S1 ^operator O2064 +)
  19387. Retracting propose*predict-yes
  19388. -->
  19389. (O2063 ^name predict-yes +)
  19390. (S1 ^operator O2063 +)
  19391. Retracting elaborate*reward*based*on*reward
  19392. -->
  19393. (R1035 ^value 1 +)
  19394. (R1 ^reward R1035 +)
  19395. Retracting elaborate*copy-dir-to-output-link
  19396. -->
  19397. (I3 ^dir L +)
  19398. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  19399. -->
  19400. (S1 ^operator O2064 = 0.7449866897649052)
  19401. Retracting rl*prefer*rvt*predict-no*H0*2
  19402. -->
  19403. (S1 ^operator O2064 = 0.2550134055149972)
  19404. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  19405. -->
  19406. (S1 ^operator O2063 = 0.3)
  19407. Retracting rl*prefer*rvt*predict-yes*H0*1
  19408. -->
  19409. (S1 ^operator O2063 = 0.5231196142256334)
  19410. =>WM: (14511: S1 ^operator O2066 +)
  19411. =>WM: (14510: S1 ^operator O2065 +)
  19412. =>WM: (14509: I3 ^dir R)
  19413. =>WM: (14508: O2066 ^name predict-no)
  19414. =>WM: (14507: O2065 ^name predict-yes)
  19415. =>WM: (14506: R1036 ^value 1)
  19416. =>WM: (14505: R1 ^reward R1036)
  19417. <=WM: (14496: S1 ^operator O2063 +)
  19418. <=WM: (14497: S1 ^operator O2064 +)
  19419. <=WM: (14498: S1 ^operator O2064)
  19420. <=WM: (14482: I3 ^dir L)
  19421. <=WM: (14492: R1 ^reward R1035)
  19422. <=WM: (14495: O2064 ^name predict-no)
  19423. <=WM: (14494: O2063 ^name predict-yes)
  19424. <=WM: (14493: R1035 ^value 1)
  19425. --- Inner Elaboration Phase, active level 1 (S1) ---
  19426. Firing prefer*rvt*predict-yes*H0
  19427. -->
  19428. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  19429. -->
  19430. (S1 ^operator O2065 = 0.6170732174748315)
  19431. Firing rl*prefer*rvt*predict-yes*H0*3
  19432. -->
  19433. (S1 ^operator O2065 = 0.3829453632213463)
  19434. Firing prefer*rvt*predict-yes*H0*3*H1
  19435. -->
  19436. Firing prefer*rvt*predict-no*H0
  19437. -->
  19438. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  19439. -->
  19440. (S1 ^operator O2066 = 0.4910065094545203)
  19441. Firing rl*prefer*rvt*predict-no*H0*4
  19442. -->
  19443. (S1 ^operator O2066 = 0.1269767912163215)
  19444. Firing prefer*rvt*predict-no*H0*4*H1
  19445. -->
  19446. inner elaboration loop at bottom goal.
  19447. Retracting rl*prefer*rvt*predict-no*H0*4
  19448. -->
  19449. (S1 ^operator O2064 = 0.1269767912163215)
  19450. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  19451. -->
  19452. (S1 ^operator O2064 = 0.4910065094545203)
  19453. Retracting rl*prefer*rvt*predict-yes*H0*3
  19454. -->
  19455. (S1 ^operator O2063 = 0.3829453632213463)
  19456. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  19457. -->
  19458. (S1 ^operator O2063 = 0.6170732174748315)
  19459. --- END Proposal Phase ---
  19460. --- Decision Phase ---
  19461. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.92,0.0739698)
  19462. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  19463. =>WM: (14512: S1 ^operator O2065)
  19464. 1033: O: O2065 (predict-yes)
  19465. --- END Decision Phase ---
  19466. --- Application Phase ---
  19467. --- Firing Productions (PE) For State At Depth 1 ---
  19468. --- Inner Elaboration Phase, active level 1 (S1) ---
  19469. Firing apply*operator
  19470. -->
  19471. (I3 ^predict-yes N1033 + :O )
  19472. Firing apply*operator*complete
  19473. -->
  19474. (I3 ^predict-no N1032 - :O )
  19475. inner elaboration loop at bottom goal.
  19476. --- Change Working Memory (PE) ---
  19477. =>WM: (14513: I3 ^predict-yes N1033)
  19478. <=WM: (14500: N1032 ^status complete)
  19479. <=WM: (14499: I3 ^predict-no N1032)
  19480. --- Firing Productions (IE) For State At Depth 1 ---
  19481. --- Inner Elaboration Phase, active level 1 (S1) ---
  19482. Firing monitor*world
  19483. -->
  19484. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19485. --- Change Working Memory (IE) ---
  19486. --- END Application Phase ---
  19487. --- Output Phase ---
  19488. ENV: Agent did: predict-yes for direction R in state State-A
  19489. In State-A moving R
  19490. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  19491. predict error 0
  19492. dir: dir isU
  19493. --- END Output Phase ---
  19494. /|\--- Input Phase ---
  19495. =>WM: (14517: I2 ^dir U)
  19496. =>WM: (14516: I2 ^reward 1)
  19497. =>WM: (14515: I2 ^see 1)
  19498. =>WM: (14514: N1033 ^status complete)
  19499. <=WM: (14503: I2 ^dir R)
  19500. <=WM: (14502: I2 ^reward 1)
  19501. <=WM: (14501: I2 ^see 0)
  19502. =>WM: (14518: I2 ^level-1 R1-root)
  19503. <=WM: (14504: I2 ^level-1 L0-root)
  19504. --- END Input Phase ---
  19505. --- Proposal Phase ---
  19506. --- Inner Elaboration Phase, active level 1 (S1) ---
  19507. Firing elaborate*copy-see-to-output-link
  19508. -->
  19509. (I3 ^see 1 +)
  19510. Firing elaborate*reward*based*on*reward
  19511. -->
  19512. (R1037 ^value 1 +)
  19513. (R1 ^reward R1037 +)
  19514. Firing propose*predict-yes
  19515. -->
  19516. (O2067 ^name predict-yes +)
  19517. (S1 ^operator O2067 +)
  19518. Firing propose*predict-no
  19519. -->
  19520. (O2068 ^name predict-no +)
  19521. (S1 ^operator O2068 +)
  19522. Firing rl*prefer*rvt*predict-no*H0*6
  19523. -->
  19524. (S1 ^operator O2066 = 0.9999999999999999)
  19525. Firing rl*prefer*rvt*predict-yes*H0*5
  19526. -->
  19527. (S1 ^operator O2065 = 0.)
  19528. Firing prefer*rvt*predict-yes*H0
  19529. -->
  19530. Firing prefer*rvt*predict-no*H0
  19531. -->
  19532. Firing elaborate*copy-dir-to-output-link
  19533. -->
  19534. (I3 ^dir U +)
  19535. inner elaboration loop at bottom goal.
  19536. Retracting elaborate*copy-see-to-output-link
  19537. -->
  19538. (I3 ^see 0 +)
  19539. Retracting propose*predict-no
  19540. -->
  19541. (O2066 ^name predict-no +)
  19542. (S1 ^operator O2066 +)
  19543. Retracting propose*predict-yes
  19544. -->
  19545. (O2065 ^name predict-yes +)
  19546. (S1 ^operator O2065 +)
  19547. Retracting elaborate*reward*based*on*reward
  19548. -->
  19549. (R1036 ^value 1 +)
  19550. (R1 ^reward R1036 +)
  19551. Retracting elaborate*copy-dir-to-output-link
  19552. -->
  19553. (I3 ^dir R +)
  19554. Retracting rl*prefer*rvt*predict-no*H0*4
  19555. -->
  19556. (S1 ^operator O2066 = 0.1269767912163215)
  19557. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  19558. -->
  19559. (S1 ^operator O2066 = 0.4910065094545203)
  19560. Retracting rl*prefer*rvt*predict-yes*H0*3
  19561. -->
  19562. (S1 ^operator O2065 = 0.3829453632213463)
  19563. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  19564. -->
  19565. (S1 ^operator O2065 = 0.6170732174748315)
  19566. =>WM: (14526: S1 ^operator O2068 +)
  19567. =>WM: (14525: S1 ^operator O2067 +)
  19568. =>WM: (14524: I3 ^dir U)
  19569. =>WM: (14523: O2068 ^name predict-no)
  19570. =>WM: (14522: O2067 ^name predict-yes)
  19571. =>WM: (14521: R1037 ^value 1)
  19572. =>WM: (14520: R1 ^reward R1037)
  19573. =>WM: (14519: I3 ^see 1)
  19574. <=WM: (14510: S1 ^operator O2065 +)
  19575. <=WM: (14512: S1 ^operator O2065)
  19576. <=WM: (14511: S1 ^operator O2066 +)
  19577. <=WM: (14509: I3 ^dir R)
  19578. <=WM: (14505: R1 ^reward R1036)
  19579. <=WM: (14477: I3 ^see 0)
  19580. <=WM: (14508: O2066 ^name predict-no)
  19581. <=WM: (14507: O2065 ^name predict-yes)
  19582. <=WM: (14506: R1036 ^value 1)
  19583. --- Inner Elaboration Phase, active level 1 (S1) ---
  19584. Firing prefer*rvt*predict-yes*H0
  19585. -->
  19586. Firing rl*prefer*rvt*predict-yes*H0*5
  19587. -->
  19588. (S1 ^operator O2067 = 0.)
  19589. Firing prefer*rvt*predict-no*H0
  19590. -->
  19591. Firing rl*prefer*rvt*predict-no*H0*6
  19592. -->
  19593. (S1 ^operator O2068 = 0.9999999999999999)
  19594. inner elaboration loop at bottom goal.
  19595. Retracting rl*prefer*rvt*predict-no*H0*6
  19596. -->
  19597. (S1 ^operator O2066 = 0.9999999999999999)
  19598. Retracting rl*prefer*rvt*predict-yes*H0*5
  19599. -->
  19600. (S1 ^operator O2065 = 0.)
  19601. --- END Proposal Phase ---
  19602. --- Decision Phase ---
  19603. RL update rl*prefer*rvt*predict-yes*H0*3 0.673138 -0.290193 0.382945 -> 0.673136 -0.290193 0.382943(R,m,v=1,0.962264,0.0365417)
  19604. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326879 0.290194 0.617073 -> 0.326876 0.290194 0.61707(R,m,v=1,1,0)
  19605. =>WM: (14527: S1 ^operator O2068)
  19606. 1034: O: O2068 (predict-no)
  19607. --- END Decision Phase ---
  19608. --- Application Phase ---
  19609. --- Firing Productions (PE) For State At Depth 1 ---
  19610. --- Inner Elaboration Phase, active level 1 (S1) ---
  19611. Firing apply*operator
  19612. -->
  19613. (I3 ^predict-no N1034 + :O )
  19614. Firing apply*operator*complete
  19615. -->
  19616. (I3 ^predict-yes N1033 - :O )
  19617. inner elaboration loop at bottom goal.
  19618. --- Change Working Memory (PE) ---
  19619. =>WM: (14528: I3 ^predict-no N1034)
  19620. <=WM: (14514: N1033 ^status complete)
  19621. <=WM: (14513: I3 ^predict-yes N1033)
  19622. --- Firing Productions (IE) For State At Depth 1 ---
  19623. --- Inner Elaboration Phase, active level 1 (S1) ---
  19624. Firing monitor*world
  19625. -->
  19626. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19627. --- Change Working Memory (IE) ---
  19628. --- END Application Phase ---
  19629. --- Output Phase ---
  19630. ENV: Agent did: predict-no for direction U in state State-B
  19631. In State-B moving U
  19632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19633. predict error 0
  19634. dir: dir isR
  19635. --- END Output Phase ---
  19636. -/--- Input Phase ---
  19637. =>WM: (14532: I2 ^dir R)
  19638. =>WM: (14531: I2 ^reward 1)
  19639. =>WM: (14530: I2 ^see 0)
  19640. =>WM: (14529: N1034 ^status complete)
  19641. <=WM: (14517: I2 ^dir U)
  19642. <=WM: (14516: I2 ^reward 1)
  19643. <=WM: (14515: I2 ^see 1)
  19644. =>WM: (14533: I2 ^level-1 R1-root)
  19645. <=WM: (14518: I2 ^level-1 R1-root)
  19646. --- END Input Phase ---
  19647. --- Proposal Phase ---
  19648. --- Inner Elaboration Phase, active level 1 (S1) ---
  19649. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  19650. -->
  19651. (S1 ^operator O2067 = 0.08783148430849691)
  19652. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  19653. -->
  19654. (S1 ^operator O2068 = 0.8730233005215732)
  19655. Firing prefer*rvt*predict-no*H0*4*H1
  19656. -->
  19657. Firing prefer*rvt*predict-yes*H0*3*H1
  19658. -->
  19659. Firing elaborate*copy-see-to-output-link
  19660. -->
  19661. (I3 ^see 0 +)
  19662. Firing elaborate*reward*based*on*reward
  19663. -->
  19664. (R1038 ^value 1 +)
  19665. (R1 ^reward R1038 +)
  19666. Firing propose*predict-yes
  19667. -->
  19668. (O2069 ^name predict-yes +)
  19669. (S1 ^operator O2069 +)
  19670. Firing propose*predict-no
  19671. -->
  19672. (O2070 ^name predict-no +)
  19673. (S1 ^operator O2070 +)
  19674. Firing rl*prefer*rvt*predict-no*H0*4
  19675. -->
  19676. (S1 ^operator O2068 = 0.1269767912163215)
  19677. Firing rl*prefer*rvt*predict-yes*H0*3
  19678. -->
  19679. (S1 ^operator O2067 = 0.3829425761169197)
  19680. Firing prefer*rvt*predict-yes*H0
  19681. -->
  19682. Firing prefer*rvt*predict-no*H0
  19683. -->
  19684. Firing elaborate*copy-dir-to-output-link
  19685. -->
  19686. (I3 ^dir R +)
  19687. inner elaboration loop at bottom goal.
  19688. Retracting elaborate*copy-see-to-output-link
  19689. -->
  19690. (I3 ^see 1 +)
  19691. Retracting propose*predict-no
  19692. -->
  19693. (O2068 ^name predict-no +)
  19694. (S1 ^operator O2068 +)
  19695. Retracting propose*predict-yes
  19696. -->
  19697. (O2067 ^name predict-yes +)
  19698. (S1 ^operator O2067 +)
  19699. Retracting elaborate*reward*based*on*reward
  19700. -->
  19701. (R1037 ^value 1 +)
  19702. (R1 ^reward R1037 +)
  19703. Retracting elaborate*copy-dir-to-output-link
  19704. -->
  19705. (I3 ^dir U +)
  19706. Retracting rl*prefer*rvt*predict-no*H0*6
  19707. -->
  19708. (S1 ^operator O2068 = 0.9999999999999999)
  19709. Retracting rl*prefer*rvt*predict-yes*H0*5
  19710. -->
  19711. (S1 ^operator O2067 = 0.)
  19712. =>WM: (14541: S1 ^operator O2070 +)
  19713. =>WM: (14540: S1 ^operator O2069 +)
  19714. =>WM: (14539: I3 ^dir R)
  19715. =>WM: (14538: O2070 ^name predict-no)
  19716. =>WM: (14537: O2069 ^name predict-yes)
  19717. =>WM: (14536: R1038 ^value 1)
  19718. =>WM: (14535: R1 ^reward R1038)
  19719. =>WM: (14534: I3 ^see 0)
  19720. <=WM: (14525: S1 ^operator O2067 +)
  19721. <=WM: (14526: S1 ^operator O2068 +)
  19722. <=WM: (14527: S1 ^operator O2068)
  19723. <=WM: (14524: I3 ^dir U)
  19724. <=WM: (14520: R1 ^reward R1037)
  19725. <=WM: (14519: I3 ^see 1)
  19726. <=WM: (14523: O2068 ^name predict-no)
  19727. <=WM: (14522: O2067 ^name predict-yes)
  19728. <=WM: (14521: R1037 ^value 1)
  19729. --- Inner Elaboration Phase, active level 1 (S1) ---
  19730. Firing prefer*rvt*predict-yes*H0
  19731. -->
  19732. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  19733. -->
  19734. (S1 ^operator O2069 = 0.08783148430849691)
  19735. Firing rl*prefer*rvt*predict-yes*H0*3
  19736. -->
  19737. (S1 ^operator O2069 = 0.3829425761169197)
  19738. Firing prefer*rvt*predict-yes*H0*3*H1
  19739. -->
  19740. Firing prefer*rvt*predict-no*H0
  19741. -->
  19742. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  19743. -->
  19744. (S1 ^operator O2070 = 0.8730233005215732)
  19745. Firing rl*prefer*rvt*predict-no*H0*4
  19746. -->
  19747. (S1 ^operator O2070 = 0.1269767912163215)
  19748. Firing prefer*rvt*predict-no*H0*4*H1
  19749. -->
  19750. inner elaboration loop at bottom goal.
  19751. Retracting rl*prefer*rvt*predict-no*H0*4
  19752. -->
  19753. (S1 ^operator O2068 = 0.1269767912163215)
  19754. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  19755. -->
  19756. (S1 ^operator O2068 = 0.8730233005215732)
  19757. Retracting rl*prefer*rvt*predict-yes*H0*3
  19758. -->
  19759. (S1 ^operator O2067 = 0.3829425761169197)
  19760. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  19761. -->
  19762. (S1 ^operator O2067 = 0.08783148430849691)
  19763. --- END Proposal Phase ---
  19764. --- Decision Phase ---
  19765. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19766. =>WM: (14542: S1 ^operator O2070)
  19767. 1035: O: O2070 (predict-no)
  19768. --- END Decision Phase ---
  19769. --- Application Phase ---
  19770. --- Firing Productions (PE) For State At Depth 1 ---
  19771. --- Inner Elaboration Phase, active level 1 (S1) ---
  19772. Firing apply*operator
  19773. -->
  19774. (I3 ^predict-no N1035 + :O )
  19775. Firing apply*operator*complete
  19776. -->
  19777. (I3 ^predict-no N1034 - :O )
  19778. inner elaboration loop at bottom goal.
  19779. --- Change Working Memory (PE) ---
  19780. =>WM: (14543: I3 ^predict-no N1035)
  19781. <=WM: (14529: N1034 ^status complete)
  19782. <=WM: (14528: I3 ^predict-no N1034)
  19783. --- Firing Productions (IE) For State At Depth 1 ---
  19784. --- Inner Elaboration Phase, active level 1 (S1) ---
  19785. Firing monitor*world
  19786. -->
  19787. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19788. --- Change Working Memory (IE) ---
  19789. --- END Application Phase ---
  19790. --- Output Phase ---
  19791. ENV: Agent did: predict-no for direction R in state State-B
  19792. In State-B moving R
  19793. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19794. predict error 0
  19795. dir: dir isR
  19796. --- END Output Phase ---
  19797. |\---- Input Phase ---
  19798. =>WM: (14547: I2 ^dir R)
  19799. =>WM: (14546: I2 ^reward 1)
  19800. =>WM: (14545: I2 ^see 0)
  19801. =>WM: (14544: N1035 ^status complete)
  19802. <=WM: (14532: I2 ^dir R)
  19803. <=WM: (14531: I2 ^reward 1)
  19804. <=WM: (14530: I2 ^see 0)
  19805. =>WM: (14548: I2 ^level-1 R0-root)
  19806. <=WM: (14533: I2 ^level-1 R1-root)
  19807. --- END Input Phase ---
  19808. --- Proposal Phase ---
  19809. --- Inner Elaboration Phase, active level 1 (S1) ---
  19810. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  19811. -->
  19812. (S1 ^operator O2069 = 0.2696941111808541)
  19813. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  19814. -->
  19815. (S1 ^operator O2070 = 0.8730230210936206)
  19816. Firing prefer*rvt*predict-no*H0*4*H1
  19817. -->
  19818. Firing prefer*rvt*predict-yes*H0*3*H1
  19819. -->
  19820. Firing elaborate*copy-see-to-output-link
  19821. -->
  19822. (I3 ^see 0 +)
  19823. Firing elaborate*reward*based*on*reward
  19824. -->
  19825. (R1039 ^value 1 +)
  19826. (R1 ^reward R1039 +)
  19827. Firing propose*predict-yes
  19828. -->
  19829. (O2071 ^name predict-yes +)
  19830. (S1 ^operator O2071 +)
  19831. Firing propose*predict-no
  19832. -->
  19833. (O2072 ^name predict-no +)
  19834. (S1 ^operator O2072 +)
  19835. Firing rl*prefer*rvt*predict-no*H0*4
  19836. -->
  19837. (S1 ^operator O2070 = 0.1269767912163215)
  19838. Firing rl*prefer*rvt*predict-yes*H0*3
  19839. -->
  19840. (S1 ^operator O2069 = 0.3829425761169197)
  19841. Firing prefer*rvt*predict-yes*H0
  19842. -->
  19843. Firing prefer*rvt*predict-no*H0
  19844. -->
  19845. Firing elaborate*copy-dir-to-output-link
  19846. -->
  19847. (I3 ^dir R +)
  19848. inner elaboration loop at bottom goal.
  19849. Retracting elaborate*copy-see-to-output-link
  19850. -->
  19851. (I3 ^see 0 +)
  19852. Retracting propose*predict-no
  19853. -->
  19854. (O2070 ^name predict-no +)
  19855. (S1 ^operator O2070 +)
  19856. Retracting propose*predict-yes
  19857. -->
  19858. (O2069 ^name predict-yes +)
  19859. (S1 ^operator O2069 +)
  19860. Retracting elaborate*reward*based*on*reward
  19861. -->
  19862. (R1038 ^value 1 +)
  19863. (R1 ^reward R1038 +)
  19864. Retracting elaborate*copy-dir-to-output-link
  19865. -->
  19866. (I3 ^dir R +)
  19867. Retracting rl*prefer*rvt*predict-no*H0*4
  19868. -->
  19869. (S1 ^operator O2070 = 0.1269767912163215)
  19870. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  19871. -->
  19872. (S1 ^operator O2070 = 0.8730233005215732)
  19873. Retracting rl*prefer*rvt*predict-yes*H0*3
  19874. -->
  19875. (S1 ^operator O2069 = 0.3829425761169197)
  19876. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  19877. -->
  19878. (S1 ^operator O2069 = 0.08783148430849691)
  19879. =>WM: (14554: S1 ^operator O2072 +)
  19880. =>WM: (14553: S1 ^operator O2071 +)
  19881. =>WM: (14552: O2072 ^name predict-no)
  19882. =>WM: (14551: O2071 ^name predict-yes)
  19883. =>WM: (14550: R1039 ^value 1)
  19884. =>WM: (14549: R1 ^reward R1039)
  19885. <=WM: (14540: S1 ^operator O2069 +)
  19886. <=WM: (14541: S1 ^operator O2070 +)
  19887. <=WM: (14542: S1 ^operator O2070)
  19888. <=WM: (14535: R1 ^reward R1038)
  19889. <=WM: (14538: O2070 ^name predict-no)
  19890. <=WM: (14537: O2069 ^name predict-yes)
  19891. <=WM: (14536: R1038 ^value 1)
  19892. --- Inner Elaboration Phase, active level 1 (S1) ---
  19893. Firing prefer*rvt*predict-yes*H0
  19894. -->
  19895. Firing rl*prefer*rvt*predict-yes*H0*3
  19896. -->
  19897. (S1 ^operator O2071 = 0.3829425761169197)
  19898. Firing prefer*rvt*predict-yes*H0*3*H1
  19899. -->
  19900. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  19901. -->
  19902. (S1 ^operator O2071 = 0.2696941111808541)
  19903. Firing prefer*rvt*predict-no*H0
  19904. -->
  19905. Firing rl*prefer*rvt*predict-no*H0*4
  19906. -->
  19907. (S1 ^operator O2072 = 0.1269767912163215)
  19908. Firing prefer*rvt*predict-no*H0*4*H1
  19909. -->
  19910. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  19911. -->
  19912. (S1 ^operator O2072 = 0.8730230210936206)
  19913. inner elaboration loop at bottom goal.
  19914. Retracting rl*prefer*rvt*predict-no*H0*4
  19915. -->
  19916. (S1 ^operator O2070 = 0.1269767912163215)
  19917. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  19918. -->
  19919. (S1 ^operator O2070 = 0.8730230210936206)
  19920. Retracting rl*prefer*rvt*predict-yes*H0*3
  19921. -->
  19922. (S1 ^operator O2069 = 0.3829425761169197)
  19923. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  19924. -->
  19925. (S1 ^operator O2069 = 0.2696941111808541)
  19926. --- END Proposal Phase ---
  19927. --- Decision Phase ---
  19928. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.950549,0.0472649)
  19929. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  19930. =>WM: (14555: S1 ^operator O2072)
  19931. 1036: O: O2072 (predict-no)
  19932. --- END Decision Phase ---
  19933. --- Application Phase ---
  19934. --- Firing Productions (PE) For State At Depth 1 ---
  19935. --- Inner Elaboration Phase, active level 1 (S1) ---
  19936. Firing apply*operator
  19937. -->
  19938. (I3 ^predict-no N1036 + :O )
  19939. Firing apply*operator*complete
  19940. -->
  19941. (I3 ^predict-no N1035 - :O )
  19942. inner elaboration loop at bottom goal.
  19943. --- Change Working Memory (PE) ---
  19944. =>WM: (14556: I3 ^predict-no N1036)
  19945. <=WM: (14544: N1035 ^status complete)
  19946. <=WM: (14543: I3 ^predict-no N1035)
  19947. --- Firing Productions (IE) For State At Depth 1 ---
  19948. --- Inner Elaboration Phase, active level 1 (S1) ---
  19949. Firing monitor*world
  19950. -->
  19951. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19952. --- Change Working Memory (IE) ---
  19953. --- END Application Phase ---
  19954. --- Output Phase ---
  19955. ENV: Agent did: predict-no for direction R in state State-B
  19956. In State-B moving R
  19957. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19958. predict error 0
  19959. dir: dir isL
  19960. --- END Output Phase ---
  19961. /|\--- Input Phase ---
  19962. =>WM: (14560: I2 ^dir L)
  19963. =>WM: (14559: I2 ^reward 1)
  19964. =>WM: (14558: I2 ^see 0)
  19965. =>WM: (14557: N1036 ^status complete)
  19966. <=WM: (14547: I2 ^dir R)
  19967. <=WM: (14546: I2 ^reward 1)
  19968. <=WM: (14545: I2 ^see 0)
  19969. =>WM: (14561: I2 ^level-1 R0-root)
  19970. <=WM: (14548: I2 ^level-1 R0-root)
  19971. --- END Input Phase ---
  19972. --- Proposal Phase ---
  19973. --- Inner Elaboration Phase, active level 1 (S1) ---
  19974. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  19975. -->
  19976. (S1 ^operator O2071 = 0.4768816476246801)
  19977. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  19978. -->
  19979. (S1 ^operator O2072 = 0.1700769046561409)
  19980. Firing prefer*rvt*predict-no*H0*2*H1
  19981. -->
  19982. Firing prefer*rvt*predict-yes*H0*1*H1
  19983. -->
  19984. Firing elaborate*copy-see-to-output-link
  19985. -->
  19986. (I3 ^see 0 +)
  19987. Firing elaborate*reward*based*on*reward
  19988. -->
  19989. (R1040 ^value 1 +)
  19990. (R1 ^reward R1040 +)
  19991. Firing propose*predict-yes
  19992. -->
  19993. (O2073 ^name predict-yes +)
  19994. (S1 ^operator O2073 +)
  19995. Firing propose*predict-no
  19996. -->
  19997. (O2074 ^name predict-no +)
  19998. (S1 ^operator O2074 +)
  19999. Firing rl*prefer*rvt*predict-no*H0*2
  20000. -->
  20001. (S1 ^operator O2072 = 0.2550133912230119)
  20002. Firing rl*prefer*rvt*predict-yes*H0*1
  20003. -->
  20004. (S1 ^operator O2071 = 0.5231196142256334)
  20005. Firing prefer*rvt*predict-yes*H0
  20006. -->
  20007. Firing prefer*rvt*predict-no*H0
  20008. -->
  20009. Firing elaborate*copy-dir-to-output-link
  20010. -->
  20011. (I3 ^dir L +)
  20012. inner elaboration loop at bottom goal.
  20013. Retracting elaborate*copy-see-to-output-link
  20014. -->
  20015. (I3 ^see 0 +)
  20016. Retracting propose*predict-no
  20017. -->
  20018. (O2072 ^name predict-no +)
  20019. (S1 ^operator O2072 +)
  20020. Retracting propose*predict-yes
  20021. -->
  20022. (O2071 ^name predict-yes +)
  20023. (S1 ^operator O2071 +)
  20024. Retracting elaborate*reward*based*on*reward
  20025. -->
  20026. (R1039 ^value 1 +)
  20027. (R1 ^reward R1039 +)
  20028. Retracting elaborate*copy-dir-to-output-link
  20029. -->
  20030. (I3 ^dir R +)
  20031. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  20032. -->
  20033. (S1 ^operator O2072 = 0.8730230210936206)
  20034. Retracting rl*prefer*rvt*predict-no*H0*4
  20035. -->
  20036. (S1 ^operator O2072 = 0.1269767774556373)
  20037. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  20038. -->
  20039. (S1 ^operator O2071 = 0.2696941111808541)
  20040. Retracting rl*prefer*rvt*predict-yes*H0*3
  20041. -->
  20042. (S1 ^operator O2071 = 0.3829425761169197)
  20043. =>WM: (14568: S1 ^operator O2074 +)
  20044. =>WM: (14567: S1 ^operator O2073 +)
  20045. =>WM: (14566: I3 ^dir L)
  20046. =>WM: (14565: O2074 ^name predict-no)
  20047. =>WM: (14564: O2073 ^name predict-yes)
  20048. =>WM: (14563: R1040 ^value 1)
  20049. =>WM: (14562: R1 ^reward R1040)
  20050. <=WM: (14553: S1 ^operator O2071 +)
  20051. <=WM: (14554: S1 ^operator O2072 +)
  20052. <=WM: (14555: S1 ^operator O2072)
  20053. <=WM: (14539: I3 ^dir R)
  20054. <=WM: (14549: R1 ^reward R1039)
  20055. <=WM: (14552: O2072 ^name predict-no)
  20056. <=WM: (14551: O2071 ^name predict-yes)
  20057. <=WM: (14550: R1039 ^value 1)
  20058. --- Inner Elaboration Phase, active level 1 (S1) ---
  20059. Firing prefer*rvt*predict-yes*H0
  20060. -->
  20061. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  20062. -->
  20063. (S1 ^operator O2073 = 0.4768816476246801)
  20064. Firing rl*prefer*rvt*predict-yes*H0*1
  20065. -->
  20066. (S1 ^operator O2073 = 0.5231196142256334)
  20067. Firing prefer*rvt*predict-yes*H0*1*H1
  20068. -->
  20069. Firing prefer*rvt*predict-no*H0
  20070. -->
  20071. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  20072. -->
  20073. (S1 ^operator O2074 = 0.1700769046561409)
  20074. Firing rl*prefer*rvt*predict-no*H0*2
  20075. -->
  20076. (S1 ^operator O2074 = 0.2550133912230119)
  20077. Firing prefer*rvt*predict-no*H0*2*H1
  20078. -->
  20079. inner elaboration loop at bottom goal.
  20080. Retracting rl*prefer*rvt*predict-no*H0*2
  20081. -->
  20082. (S1 ^operator O2072 = 0.2550133912230119)
  20083. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  20084. -->
  20085. (S1 ^operator O2072 = 0.1700769046561409)
  20086. Retracting rl*prefer*rvt*predict-yes*H0*1
  20087. -->
  20088. (S1 ^operator O2071 = 0.5231196142256334)
  20089. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  20090. -->
  20091. (S1 ^operator O2071 = 0.4768816476246801)
  20092. --- END Proposal Phase ---
  20093. --- Decision Phase ---
  20094. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.95082,0.0470186)
  20095. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  20096. =>WM: (14569: S1 ^operator O2073)
  20097. 1037: O: O2073 (predict-yes)
  20098. --- END Decision Phase ---
  20099. --- Application Phase ---
  20100. --- Firing Productions (PE) For State At Depth 1 ---
  20101. --- Inner Elaboration Phase, active level 1 (S1) ---
  20102. Firing apply*operator
  20103. -->
  20104. (I3 ^predict-yes N1037 + :O )
  20105. Firing apply*operator*complete
  20106. -->
  20107. (I3 ^predict-no N1036 - :O )
  20108. inner elaboration loop at bottom goal.
  20109. --- Change Working Memory (PE) ---
  20110. =>WM: (14570: I3 ^predict-yes N1037)
  20111. <=WM: (14557: N1036 ^status complete)
  20112. <=WM: (14556: I3 ^predict-no N1036)
  20113. --- Firing Productions (IE) For State At Depth 1 ---
  20114. --- Inner Elaboration Phase, active level 1 (S1) ---
  20115. Firing monitor*world
  20116. -->
  20117. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20118. --- Change Working Memory (IE) ---
  20119. --- END Application Phase ---
  20120. --- Output Phase ---
  20121. ENV: Agent did: predict-yes for direction L in state State-B
  20122. In State-B moving L
  20123. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  20124. predict error 0
  20125. dir: dir isU
  20126. --- END Output Phase ---
  20127. -/--- Input Phase ---
  20128. =>WM: (14574: I2 ^dir U)
  20129. =>WM: (14573: I2 ^reward 1)
  20130. =>WM: (14572: I2 ^see 1)
  20131. =>WM: (14571: N1037 ^status complete)
  20132. <=WM: (14560: I2 ^dir L)
  20133. <=WM: (14559: I2 ^reward 1)
  20134. <=WM: (14558: I2 ^see 0)
  20135. =>WM: (14575: I2 ^level-1 L1-root)
  20136. <=WM: (14561: I2 ^level-1 R0-root)
  20137. --- END Input Phase ---
  20138. --- Proposal Phase ---
  20139. --- Inner Elaboration Phase, active level 1 (S1) ---
  20140. Firing elaborate*copy-see-to-output-link
  20141. -->
  20142. (I3 ^see 1 +)
  20143. Firing elaborate*reward*based*on*reward
  20144. -->
  20145. (R1041 ^value 1 +)
  20146. (R1 ^reward R1041 +)
  20147. Firing propose*predict-yes
  20148. -->
  20149. (O2075 ^name predict-yes +)
  20150. (S1 ^operator O2075 +)
  20151. Firing propose*predict-no
  20152. -->
  20153. (O2076 ^name predict-no +)
  20154. (S1 ^operator O2076 +)
  20155. Firing rl*prefer*rvt*predict-no*H0*6
  20156. -->
  20157. (S1 ^operator O2074 = 0.9999999999999999)
  20158. Firing rl*prefer*rvt*predict-yes*H0*5
  20159. -->
  20160. (S1 ^operator O2073 = 0.)
  20161. Firing prefer*rvt*predict-yes*H0
  20162. -->
  20163. Firing prefer*rvt*predict-no*H0
  20164. -->
  20165. Firing elaborate*copy-dir-to-output-link
  20166. -->
  20167. (I3 ^dir U +)
  20168. inner elaboration loop at bottom goal.
  20169. Retracting elaborate*copy-see-to-output-link
  20170. -->
  20171. (I3 ^see 0 +)
  20172. Retracting propose*predict-no
  20173. -->
  20174. (O2074 ^name predict-no +)
  20175. (S1 ^operator O2074 +)
  20176. Retracting propose*predict-yes
  20177. -->
  20178. (O2073 ^name predict-yes +)
  20179. (S1 ^operator O2073 +)
  20180. Retracting elaborate*reward*based*on*reward
  20181. -->
  20182. (R1040 ^value 1 +)
  20183. (R1 ^reward R1040 +)
  20184. Retracting elaborate*copy-dir-to-output-link
  20185. -->
  20186. (I3 ^dir L +)
  20187. Retracting rl*prefer*rvt*predict-no*H0*2
  20188. -->
  20189. (S1 ^operator O2074 = 0.2550133912230119)
  20190. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  20191. -->
  20192. (S1 ^operator O2074 = 0.1700769046561409)
  20193. Retracting rl*prefer*rvt*predict-yes*H0*1
  20194. -->
  20195. (S1 ^operator O2073 = 0.5231196142256334)
  20196. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  20197. -->
  20198. (S1 ^operator O2073 = 0.4768816476246801)
  20199. =>WM: (14583: S1 ^operator O2076 +)
  20200. =>WM: (14582: S1 ^operator O2075 +)
  20201. =>WM: (14581: I3 ^dir U)
  20202. =>WM: (14580: O2076 ^name predict-no)
  20203. =>WM: (14579: O2075 ^name predict-yes)
  20204. =>WM: (14578: R1041 ^value 1)
  20205. =>WM: (14577: R1 ^reward R1041)
  20206. =>WM: (14576: I3 ^see 1)
  20207. <=WM: (14567: S1 ^operator O2073 +)
  20208. <=WM: (14569: S1 ^operator O2073)
  20209. <=WM: (14568: S1 ^operator O2074 +)
  20210. <=WM: (14566: I3 ^dir L)
  20211. <=WM: (14562: R1 ^reward R1040)
  20212. <=WM: (14534: I3 ^see 0)
  20213. <=WM: (14565: O2074 ^name predict-no)
  20214. <=WM: (14564: O2073 ^name predict-yes)
  20215. <=WM: (14563: R1040 ^value 1)
  20216. --- Inner Elaboration Phase, active level 1 (S1) ---
  20217. Firing prefer*rvt*predict-yes*H0
  20218. -->
  20219. Firing rl*prefer*rvt*predict-yes*H0*5
  20220. -->
  20221. (S1 ^operator O2075 = 0.)
  20222. Firing prefer*rvt*predict-no*H0
  20223. -->
  20224. Firing rl*prefer*rvt*predict-no*H0*6
  20225. -->
  20226. (S1 ^operator O2076 = 0.9999999999999999)
  20227. inner elaboration loop at bottom goal.
  20228. Retracting rl*prefer*rvt*predict-no*H0*6
  20229. -->
  20230. (S1 ^operator O2074 = 0.9999999999999999)
  20231. Retracting rl*prefer*rvt*predict-yes*H0*5
  20232. -->
  20233. (S1 ^operator O2073 = 0.)
  20234. --- END Proposal Phase ---
  20235. --- Decision Phase ---
  20236. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.52312 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.979866,0.0198621)
  20237. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272042 0.204839 0.476882 -> 0.272042 0.204839 0.476881(R,m,v=1,1,0)
  20238. =>WM: (14584: S1 ^operator O2076)
  20239. 1038: O: O2076 (predict-no)
  20240. --- END Decision Phase ---
  20241. --- Application Phase ---
  20242. --- Firing Productions (PE) For State At Depth 1 ---
  20243. --- Inner Elaboration Phase, active level 1 (S1) ---
  20244. Firing apply*operator
  20245. -->
  20246. (I3 ^predict-no N1038 + :O )
  20247. Firing apply*operator*complete
  20248. -->
  20249. (I3 ^predict-yes N1037 - :O )
  20250. inner elaboration loop at bottom goal.
  20251. --- Change Working Memory (PE) ---
  20252. =>WM: (14585: I3 ^predict-no N1038)
  20253. <=WM: (14571: N1037 ^status complete)
  20254. <=WM: (14570: I3 ^predict-yes N1037)
  20255. --- Firing Productions (IE) For State At Depth 1 ---
  20256. --- Inner Elaboration Phase, active level 1 (S1) ---
  20257. Firing monitor*world
  20258. -->
  20259. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20260. --- Change Working Memory (IE) ---
  20261. --- END Application Phase ---
  20262. --- Output Phase ---
  20263. ENV: Agent did: predict-no for direction U in state State-A
  20264. In State-A moving U
  20265. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20266. predict error 0
  20267. dir: dir isR
  20268. --- END Output Phase ---
  20269. |\---- Input Phase ---
  20270. =>WM: (14589: I2 ^dir R)
  20271. =>WM: (14588: I2 ^reward 1)
  20272. =>WM: (14587: I2 ^see 0)
  20273. =>WM: (14586: N1038 ^status complete)
  20274. <=WM: (14574: I2 ^dir U)
  20275. <=WM: (14573: I2 ^reward 1)
  20276. <=WM: (14572: I2 ^see 1)
  20277. =>WM: (14590: I2 ^level-1 L1-root)
  20278. <=WM: (14575: I2 ^level-1 L1-root)
  20279. --- END Input Phase ---
  20280. --- Proposal Phase ---
  20281. --- Inner Elaboration Phase, active level 1 (S1) ---
  20282. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  20283. -->
  20284. (S1 ^operator O2075 = 0.6170465502571644)
  20285. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  20286. -->
  20287. (S1 ^operator O2076 = 0.4901349546100854)
  20288. Firing prefer*rvt*predict-no*H0*4*H1
  20289. -->
  20290. Firing prefer*rvt*predict-yes*H0*3*H1
  20291. -->
  20292. Firing elaborate*copy-see-to-output-link
  20293. -->
  20294. (I3 ^see 0 +)
  20295. Firing elaborate*reward*based*on*reward
  20296. -->
  20297. (R1042 ^value 1 +)
  20298. (R1 ^reward R1042 +)
  20299. Firing propose*predict-yes
  20300. -->
  20301. (O2077 ^name predict-yes +)
  20302. (S1 ^operator O2077 +)
  20303. Firing propose*predict-no
  20304. -->
  20305. (O2078 ^name predict-no +)
  20306. (S1 ^operator O2078 +)
  20307. Firing rl*prefer*rvt*predict-no*H0*4
  20308. -->
  20309. (S1 ^operator O2076 = 0.1269768076732486)
  20310. Firing rl*prefer*rvt*predict-yes*H0*3
  20311. -->
  20312. (S1 ^operator O2075 = 0.3829425761169197)
  20313. Firing prefer*rvt*predict-yes*H0
  20314. -->
  20315. Firing prefer*rvt*predict-no*H0
  20316. -->
  20317. Firing elaborate*copy-dir-to-output-link
  20318. -->
  20319. (I3 ^dir R +)
  20320. inner elaboration loop at bottom goal.
  20321. Retracting elaborate*copy-see-to-output-link
  20322. -->
  20323. (I3 ^see 1 +)
  20324. Retracting propose*predict-no
  20325. -->
  20326. (O2076 ^name predict-no +)
  20327. (S1 ^operator O2076 +)
  20328. Retracting propose*predict-yes
  20329. -->
  20330. (O2075 ^name predict-yes +)
  20331. (S1 ^operator O2075 +)
  20332. Retracting elaborate*reward*based*on*reward
  20333. -->
  20334. (R1041 ^value 1 +)
  20335. (R1 ^reward R1041 +)
  20336. Retracting elaborate*copy-dir-to-output-link
  20337. -->
  20338. (I3 ^dir U +)
  20339. Retracting rl*prefer*rvt*predict-no*H0*6
  20340. -->
  20341. (S1 ^operator O2076 = 0.9999999999999999)
  20342. Retracting rl*prefer*rvt*predict-yes*H0*5
  20343. -->
  20344. (S1 ^operator O2075 = 0.)
  20345. =>WM: (14598: S1 ^operator O2078 +)
  20346. =>WM: (14597: S1 ^operator O2077 +)
  20347. =>WM: (14596: I3 ^dir R)
  20348. =>WM: (14595: O2078 ^name predict-no)
  20349. =>WM: (14594: O2077 ^name predict-yes)
  20350. =>WM: (14593: R1042 ^value 1)
  20351. =>WM: (14592: R1 ^reward R1042)
  20352. =>WM: (14591: I3 ^see 0)
  20353. <=WM: (14582: S1 ^operator O2075 +)
  20354. <=WM: (14583: S1 ^operator O2076 +)
  20355. <=WM: (14584: S1 ^operator O2076)
  20356. <=WM: (14581: I3 ^dir U)
  20357. <=WM: (14577: R1 ^reward R1041)
  20358. <=WM: (14576: I3 ^see 1)
  20359. <=WM: (14580: O2076 ^name predict-no)
  20360. <=WM: (14579: O2075 ^name predict-yes)
  20361. <=WM: (14578: R1041 ^value 1)
  20362. --- Inner Elaboration Phase, active level 1 (S1) ---
  20363. Firing prefer*rvt*predict-yes*H0
  20364. -->
  20365. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  20366. -->
  20367. (S1 ^operator O2077 = 0.6170465502571644)
  20368. Firing rl*prefer*rvt*predict-yes*H0*3
  20369. -->
  20370. (S1 ^operator O2077 = 0.3829425761169197)
  20371. Firing prefer*rvt*predict-yes*H0*3*H1
  20372. -->
  20373. Firing prefer*rvt*predict-no*H0
  20374. -->
  20375. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  20376. -->
  20377. (S1 ^operator O2078 = 0.4901349546100854)
  20378. Firing rl*prefer*rvt*predict-no*H0*4
  20379. -->
  20380. (S1 ^operator O2078 = 0.1269768076732486)
  20381. Firing prefer*rvt*predict-no*H0*4*H1
  20382. -->
  20383. inner elaboration loop at bottom goal.
  20384. Retracting rl*prefer*rvt*predict-no*H0*4
  20385. -->
  20386. (S1 ^operator O2076 = 0.1269768076732486)
  20387. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  20388. -->
  20389. (S1 ^operator O2076 = 0.4901349546100854)
  20390. Retracting rl*prefer*rvt*predict-yes*H0*3
  20391. -->
  20392. (S1 ^operator O2075 = 0.3829425761169197)
  20393. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  20394. -->
  20395. (S1 ^operator O2075 = 0.6170465502571644)
  20396. --- END Proposal Phase ---
  20397. --- Decision Phase ---
  20398. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20399. =>WM: (14599: S1 ^operator O2077)
  20400. 1039: O: O2077 (predict-yes)
  20401. --- END Decision Phase ---
  20402. --- Application Phase ---
  20403. --- Firing Productions (PE) For State At Depth 1 ---
  20404. --- Inner Elaboration Phase, active level 1 (S1) ---
  20405. Firing apply*operator
  20406. -->
  20407. (I3 ^predict-yes N1039 + :O )
  20408. Firing apply*operator*complete
  20409. -->
  20410. (I3 ^predict-no N1038 - :O )
  20411. inner elaboration loop at bottom goal.
  20412. --- Change Working Memory (PE) ---
  20413. =>WM: (14600: I3 ^predict-yes N1039)
  20414. <=WM: (14586: N1038 ^status complete)
  20415. <=WM: (14585: I3 ^predict-no N1038)
  20416. --- Firing Productions (IE) For State At Depth 1 ---
  20417. --- Inner Elaboration Phase, active level 1 (S1) ---
  20418. Firing monitor*world
  20419. -->
  20420. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20421. --- Change Working Memory (IE) ---
  20422. --- END Application Phase ---
  20423. --- Output Phase ---
  20424. ENV: Agent did: predict-yes for direction R in state State-A
  20425. In State-A moving R
  20426. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  20427. predict error 0
  20428. dir: dir isR
  20429. --- END Output Phase ---
  20430. /|\--- Input Phase ---
  20431. =>WM: (14604: I2 ^dir R)
  20432. =>WM: (14603: I2 ^reward 1)
  20433. =>WM: (14602: I2 ^see 1)
  20434. =>WM: (14601: N1039 ^status complete)
  20435. <=WM: (14589: I2 ^dir R)
  20436. <=WM: (14588: I2 ^reward 1)
  20437. <=WM: (14587: I2 ^see 0)
  20438. =>WM: (14605: I2 ^level-1 R1-root)
  20439. <=WM: (14590: I2 ^level-1 L1-root)
  20440. --- END Input Phase ---
  20441. --- Proposal Phase ---
  20442. --- Inner Elaboration Phase, active level 1 (S1) ---
  20443. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  20444. -->
  20445. (S1 ^operator O2077 = 0.08783148430849691)
  20446. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  20447. -->
  20448. (S1 ^operator O2078 = 0.8730232867608889)
  20449. Firing prefer*rvt*predict-no*H0*4*H1
  20450. -->
  20451. Firing prefer*rvt*predict-yes*H0*3*H1
  20452. -->
  20453. Firing elaborate*copy-see-to-output-link
  20454. -->
  20455. (I3 ^see 1 +)
  20456. Firing elaborate*reward*based*on*reward
  20457. -->
  20458. (R1043 ^value 1 +)
  20459. (R1 ^reward R1043 +)
  20460. Firing propose*predict-yes
  20461. -->
  20462. (O2079 ^name predict-yes +)
  20463. (S1 ^operator O2079 +)
  20464. Firing propose*predict-no
  20465. -->
  20466. (O2080 ^name predict-no +)
  20467. (S1 ^operator O2080 +)
  20468. Firing rl*prefer*rvt*predict-no*H0*4
  20469. -->
  20470. (S1 ^operator O2078 = 0.1269768076732486)
  20471. Firing rl*prefer*rvt*predict-yes*H0*3
  20472. -->
  20473. (S1 ^operator O2077 = 0.3829425761169197)
  20474. Firing prefer*rvt*predict-yes*H0
  20475. -->
  20476. Firing prefer*rvt*predict-no*H0
  20477. -->
  20478. Firing elaborate*copy-dir-to-output-link
  20479. -->
  20480. (I3 ^dir R +)
  20481. inner elaboration loop at bottom goal.
  20482. Retracting elaborate*copy-see-to-output-link
  20483. -->
  20484. (I3 ^see 0 +)
  20485. Retracting propose*predict-no
  20486. -->
  20487. (O2078 ^name predict-no +)
  20488. (S1 ^operator O2078 +)
  20489. Retracting propose*predict-yes
  20490. -->
  20491. (O2077 ^name predict-yes +)
  20492. (S1 ^operator O2077 +)
  20493. Retracting elaborate*reward*based*on*reward
  20494. -->
  20495. (R1042 ^value 1 +)
  20496. (R1 ^reward R1042 +)
  20497. Retracting elaborate*copy-dir-to-output-link
  20498. -->
  20499. (I3 ^dir R +)
  20500. Retracting rl*prefer*rvt*predict-no*H0*4
  20501. -->
  20502. (S1 ^operator O2078 = 0.1269768076732486)
  20503. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  20504. -->
  20505. (S1 ^operator O2078 = 0.4901349546100854)
  20506. Retracting rl*prefer*rvt*predict-yes*H0*3
  20507. -->
  20508. (S1 ^operator O2077 = 0.3829425761169197)
  20509. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  20510. -->
  20511. (S1 ^operator O2077 = 0.6170465502571644)
  20512. =>WM: (14612: S1 ^operator O2080 +)
  20513. =>WM: (14611: S1 ^operator O2079 +)
  20514. =>WM: (14610: O2080 ^name predict-no)
  20515. =>WM: (14609: O2079 ^name predict-yes)
  20516. =>WM: (14608: R1043 ^value 1)
  20517. =>WM: (14607: R1 ^reward R1043)
  20518. =>WM: (14606: I3 ^see 1)
  20519. <=WM: (14597: S1 ^operator O2077 +)
  20520. <=WM: (14599: S1 ^operator O2077)
  20521. <=WM: (14598: S1 ^operator O2078 +)
  20522. <=WM: (14592: R1 ^reward R1042)
  20523. <=WM: (14591: I3 ^see 0)
  20524. <=WM: (14595: O2078 ^name predict-no)
  20525. <=WM: (14594: O2077 ^name predict-yes)
  20526. <=WM: (14593: R1042 ^value 1)
  20527. --- Inner Elaboration Phase, active level 1 (S1) ---
  20528. Firing prefer*rvt*predict-yes*H0
  20529. -->
  20530. Firing rl*prefer*rvt*predict-yes*H0*3
  20531. -->
  20532. (S1 ^operator O2079 = 0.3829425761169197)
  20533. Firing prefer*rvt*predict-yes*H0*3*H1
  20534. -->
  20535. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  20536. -->
  20537. (S1 ^operator O2079 = 0.08783148430849691)
  20538. Firing prefer*rvt*predict-no*H0
  20539. -->
  20540. Firing rl*prefer*rvt*predict-no*H0*4
  20541. -->
  20542. (S1 ^operator O2080 = 0.1269768076732486)
  20543. Firing prefer*rvt*predict-no*H0*4*H1
  20544. -->
  20545. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  20546. -->
  20547. (S1 ^operator O2080 = 0.8730232867608889)
  20548. inner elaboration loop at bottom goal.
  20549. Retracting rl*prefer*rvt*predict-no*H0*4
  20550. -->
  20551. (S1 ^operator O2078 = 0.1269768076732486)
  20552. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  20553. -->
  20554. (S1 ^operator O2078 = 0.8730232867608889)
  20555. Retracting rl*prefer*rvt*predict-yes*H0*3
  20556. -->
  20557. (S1 ^operator O2077 = 0.3829425761169197)
  20558. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  20559. -->
  20560. (S1 ^operator O2077 = 0.08783148430849691)
  20561. --- END Proposal Phase ---
  20562. --- Decision Phase ---
  20563. RL update rl*prefer*rvt*predict-yes*H0*3 0.673136 -0.290193 0.382943 -> 0.673137 -0.290193 0.382944(R,m,v=1,0.9625,0.0363208)
  20564. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326855 0.290192 0.617047 -> 0.326856 0.290192 0.617048(R,m,v=1,1,0)
  20565. =>WM: (14613: S1 ^operator O2080)
  20566. 1040: O: O2080 (predict-no)
  20567. --- END Decision Phase ---
  20568. --- Application Phase ---
  20569. --- Firing Productions (PE) For State At Depth 1 ---
  20570. --- Inner Elaboration Phase, active level 1 (S1) ---
  20571. Firing apply*operator
  20572. -->
  20573. (I3 ^predict-no N1040 + :O )
  20574. Firing apply*operator*complete
  20575. -->
  20576. (I3 ^predict-yes N1039 - :O )
  20577. inner elaboration loop at bottom goal.
  20578. --- Change Working Memory (PE) ---
  20579. =>WM: (14614: I3 ^predict-no N1040)
  20580. <=WM: (14601: N1039 ^status complete)
  20581. <=WM: (14600: I3 ^predict-yes N1039)
  20582. --- Firing Productions (IE) For State At Depth 1 ---
  20583. --- Inner Elaboration Phase, active level 1 (S1) ---
  20584. Firing monitor*world
  20585. -->
  20586. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20587. --- Change Working Memory (IE) ---
  20588. --- END Application Phase ---
  20589. --- Output Phase ---
  20590. ENV: Agent did: predict-no for direction R in state State-B
  20591. In State-B moving R
  20592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20593. predict error 0
  20594. dir: dir isR
  20595. --- END Output Phase ---
  20596. -/--- Input Phase ---
  20597. =>WM: (14618: I2 ^dir R)
  20598. =>WM: (14617: I2 ^reward 1)
  20599. =>WM: (14616: I2 ^see 0)
  20600. =>WM: (14615: N1040 ^status complete)
  20601. <=WM: (14604: I2 ^dir R)
  20602. <=WM: (14603: I2 ^reward 1)
  20603. <=WM: (14602: I2 ^see 1)
  20604. =>WM: (14619: I2 ^level-1 R0-root)
  20605. <=WM: (14605: I2 ^level-1 R1-root)
  20606. --- END Input Phase ---
  20607. --- Proposal Phase ---
  20608. --- Inner Elaboration Phase, active level 1 (S1) ---
  20609. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  20610. -->
  20611. (S1 ^operator O2079 = 0.2696941111808541)
  20612. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  20613. -->
  20614. (S1 ^operator O2080 = 0.8730230513112319)
  20615. Firing prefer*rvt*predict-no*H0*4*H1
  20616. -->
  20617. Firing prefer*rvt*predict-yes*H0*3*H1
  20618. -->
  20619. Firing elaborate*copy-see-to-output-link
  20620. -->
  20621. (I3 ^see 0 +)
  20622. Firing elaborate*reward*based*on*reward
  20623. -->
  20624. (R1044 ^value 1 +)
  20625. (R1 ^reward R1044 +)
  20626. Firing propose*predict-yes
  20627. -->
  20628. (O2081 ^name predict-yes +)
  20629. (S1 ^operator O2081 +)
  20630. Firing propose*predict-no
  20631. -->
  20632. (O2082 ^name predict-no +)
  20633. (S1 ^operator O2082 +)
  20634. Firing rl*prefer*rvt*predict-no*H0*4
  20635. -->
  20636. (S1 ^operator O2080 = 0.1269768076732486)
  20637. Firing rl*prefer*rvt*predict-yes*H0*3
  20638. -->
  20639. (S1 ^operator O2079 = 0.3829442071608071)
  20640. Firing prefer*rvt*predict-yes*H0
  20641. -->
  20642. Firing prefer*rvt*predict-no*H0
  20643. -->
  20644. Firing elaborate*copy-dir-to-output-link
  20645. -->
  20646. (I3 ^dir R +)
  20647. inner elaboration loop at bottom goal.
  20648. Retracting elaborate*copy-see-to-output-link
  20649. -->
  20650. (I3 ^see 1 +)
  20651. Retracting propose*predict-no
  20652. -->
  20653. (O2080 ^name predict-no +)
  20654. (S1 ^operator O2080 +)
  20655. Retracting propose*predict-yes
  20656. -->
  20657. (O2079 ^name predict-yes +)
  20658. (S1 ^operator O2079 +)
  20659. Retracting elaborate*reward*based*on*reward
  20660. -->
  20661. (R1043 ^value 1 +)
  20662. (R1 ^reward R1043 +)
  20663. Retracting elaborate*copy-dir-to-output-link
  20664. -->
  20665. (I3 ^dir R +)
  20666. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  20667. -->
  20668. (S1 ^operator O2080 = 0.8730232867608889)
  20669. Retracting rl*prefer*rvt*predict-no*H0*4
  20670. -->
  20671. (S1 ^operator O2080 = 0.1269768076732486)
  20672. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  20673. -->
  20674. (S1 ^operator O2079 = 0.08783148430849691)
  20675. Retracting rl*prefer*rvt*predict-yes*H0*3
  20676. -->
  20677. (S1 ^operator O2079 = 0.3829442071608071)
  20678. =>WM: (14626: S1 ^operator O2082 +)
  20679. =>WM: (14625: S1 ^operator O2081 +)
  20680. =>WM: (14624: O2082 ^name predict-no)
  20681. =>WM: (14623: O2081 ^name predict-yes)
  20682. =>WM: (14622: R1044 ^value 1)
  20683. =>WM: (14621: R1 ^reward R1044)
  20684. =>WM: (14620: I3 ^see 0)
  20685. <=WM: (14611: S1 ^operator O2079 +)
  20686. <=WM: (14612: S1 ^operator O2080 +)
  20687. <=WM: (14613: S1 ^operator O2080)
  20688. <=WM: (14607: R1 ^reward R1043)
  20689. <=WM: (14606: I3 ^see 1)
  20690. <=WM: (14610: O2080 ^name predict-no)
  20691. <=WM: (14609: O2079 ^name predict-yes)
  20692. <=WM: (14608: R1043 ^value 1)
  20693. --- Inner Elaboration Phase, active level 1 (S1) ---
  20694. Firing prefer*rvt*predict-yes*H0
  20695. -->
  20696. Firing rl*prefer*rvt*predict-yes*H0*3
  20697. -->
  20698. (S1 ^operator O2081 = 0.3829442071608071)
  20699. Firing prefer*rvt*predict-yes*H0*3*H1
  20700. -->
  20701. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  20702. -->
  20703. (S1 ^operator O2081 = 0.2696941111808541)
  20704. Firing prefer*rvt*predict-no*H0
  20705. -->
  20706. Firing rl*prefer*rvt*predict-no*H0*4
  20707. -->
  20708. (S1 ^operator O2082 = 0.1269768076732486)
  20709. Firing prefer*rvt*predict-no*H0*4*H1
  20710. -->
  20711. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  20712. -->
  20713. (S1 ^operator O2082 = 0.8730230513112319)
  20714. inner elaboration loop at bottom goal.
  20715. Retracting rl*prefer*rvt*predict-no*H0*4
  20716. -->
  20717. (S1 ^operator O2080 = 0.1269768076732486)
  20718. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  20719. -->
  20720. (S1 ^operator O2080 = 0.8730230513112319)
  20721. Retracting rl*prefer*rvt*predict-yes*H0*3
  20722. -->
  20723. (S1 ^operator O2079 = 0.3829442071608071)
  20724. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  20725. -->
  20726. (S1 ^operator O2079 = 0.2696941111808541)
  20727. --- END Proposal Phase ---
  20728. --- Decision Phase ---
  20729. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.951087,0.0467748)
  20730. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  20731. =>WM: (14627: S1 ^operator O2082)
  20732. 1041: O: O2082 (predict-no)
  20733. --- END Decision Phase ---
  20734. --- Application Phase ---
  20735. --- Firing Productions (PE) For State At Depth 1 ---
  20736. --- Inner Elaboration Phase, active level 1 (S1) ---
  20737. Firing apply*operator
  20738. -->
  20739. (I3 ^predict-no N1041 + :O )
  20740. Firing apply*operator*complete
  20741. -->
  20742. (I3 ^predict-no N1040 - :O )
  20743. inner elaboration loop at bottom goal.
  20744. --- Change Working Memory (PE) ---
  20745. =>WM: (14628: I3 ^predict-no N1041)
  20746. <=WM: (14615: N1040 ^status complete)
  20747. <=WM: (14614: I3 ^predict-no N1040)
  20748. --- Firing Productions (IE) For State At Depth 1 ---
  20749. --- Inner Elaboration Phase, active level 1 (S1) ---
  20750. Firing monitor*world
  20751. -->
  20752. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20753. --- Change Working Memory (IE) ---
  20754. --- END Application Phase ---
  20755. --- Output Phase ---
  20756. ENV: Agent did: predict-no for direction R in state State-B
  20757. In State-B moving R
  20758. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20759. predict error 0
  20760. dir: dir isU
  20761. --- END Output Phase ---
  20762. |--- Input Phase ---
  20763. =>WM: (14632: I2 ^dir U)
  20764. =>WM: (14631: I2 ^reward 1)
  20765. =>WM: (14630: I2 ^see 0)
  20766. =>WM: (14629: N1041 ^status complete)
  20767. <=WM: (14618: I2 ^dir R)
  20768. <=WM: (14617: I2 ^reward 1)
  20769. <=WM: (14616: I2 ^see 0)
  20770. =>WM: (14633: I2 ^level-1 R0-root)
  20771. <=WM: (14619: I2 ^level-1 R0-root)
  20772. --- END Input Phase ---
  20773. --- Proposal Phase ---
  20774. --- Inner Elaboration Phase, active level 1 (S1) ---
  20775. Firing elaborate*copy-see-to-output-link
  20776. -->
  20777. (I3 ^see 0 +)
  20778. Firing elaborate*reward*based*on*reward
  20779. -->
  20780. (R1045 ^value 1 +)
  20781. (R1 ^reward R1045 +)
  20782. Firing propose*predict-yes
  20783. -->
  20784. (O2083 ^name predict-yes +)
  20785. (S1 ^operator O2083 +)
  20786. Firing propose*predict-no
  20787. -->
  20788. (O2084 ^name predict-no +)
  20789. (S1 ^operator O2084 +)
  20790. Firing rl*prefer*rvt*predict-no*H0*6
  20791. -->
  20792. (S1 ^operator O2082 = 0.9999999999999999)
  20793. Firing rl*prefer*rvt*predict-yes*H0*5
  20794. -->
  20795. (S1 ^operator O2081 = 0.)
  20796. Firing prefer*rvt*predict-yes*H0
  20797. -->
  20798. Firing prefer*rvt*predict-no*H0
  20799. -->
  20800. Firing elaborate*copy-dir-to-output-link
  20801. -->
  20802. (I3 ^dir U +)
  20803. inner elaboration loop at bottom goal.
  20804. Retracting elaborate*copy-see-to-output-link
  20805. -->
  20806. (I3 ^see 0 +)
  20807. Retracting propose*predict-no
  20808. -->
  20809. (O2082 ^name predict-no +)
  20810. (S1 ^operator O2082 +)
  20811. Retracting propose*predict-yes
  20812. -->
  20813. (O2081 ^name predict-yes +)
  20814. (S1 ^operator O2081 +)
  20815. Retracting elaborate*reward*based*on*reward
  20816. -->
  20817. (R1044 ^value 1 +)
  20818. (R1 ^reward R1044 +)
  20819. Retracting elaborate*copy-dir-to-output-link
  20820. -->
  20821. (I3 ^dir R +)
  20822. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  20823. -->
  20824. (S1 ^operator O2082 = 0.8730230513112319)
  20825. Retracting rl*prefer*rvt*predict-no*H0*4
  20826. -->
  20827. (S1 ^operator O2082 = 0.126976793508128)
  20828. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  20829. -->
  20830. (S1 ^operator O2081 = 0.2696941111808541)
  20831. Retracting rl*prefer*rvt*predict-yes*H0*3
  20832. -->
  20833. (S1 ^operator O2081 = 0.3829442071608071)
  20834. =>WM: (14640: S1 ^operator O2084 +)
  20835. =>WM: (14639: S1 ^operator O2083 +)
  20836. =>WM: (14638: I3 ^dir U)
  20837. =>WM: (14637: O2084 ^name predict-no)
  20838. =>WM: (14636: O2083 ^name predict-yes)
  20839. =>WM: (14635: R1045 ^value 1)
  20840. =>WM: (14634: R1 ^reward R1045)
  20841. <=WM: (14625: S1 ^operator O2081 +)
  20842. <=WM: (14626: S1 ^operator O2082 +)
  20843. <=WM: (14627: S1 ^operator O2082)
  20844. <=WM: (14596: I3 ^dir R)
  20845. <=WM: (14621: R1 ^reward R1044)
  20846. <=WM: (14624: O2082 ^name predict-no)
  20847. <=WM: (14623: O2081 ^name predict-yes)
  20848. <=WM: (14622: R1044 ^value 1)
  20849. --- Inner Elaboration Phase, active level 1 (S1) ---
  20850. Firing prefer*rvt*predict-yes*H0
  20851. -->
  20852. Firing rl*prefer*rvt*predict-yes*H0*5
  20853. -->
  20854. (S1 ^operator O2083 = 0.)
  20855. Firing prefer*rvt*predict-no*H0
  20856. -->
  20857. Firing rl*prefer*rvt*predict-no*H0*6
  20858. -->
  20859. (S1 ^operator O2084 = 0.9999999999999999)
  20860. inner elaboration loop at bottom goal.
  20861. Retracting rl*prefer*rvt*predict-no*H0*6
  20862. -->
  20863. (S1 ^operator O2082 = 0.9999999999999999)
  20864. Retracting rl*prefer*rvt*predict-yes*H0*5
  20865. -->
  20866. (S1 ^operator O2081 = 0.)
  20867. --- END Proposal Phase ---
  20868. --- Decision Phase ---
  20869. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.951351,0.0465335)
  20870. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  20871. =>WM: (14641: S1 ^operator O2084)
  20872. 1042: O: O2084 (predict-no)
  20873. --- END Decision Phase ---
  20874. --- Application Phase ---
  20875. --- Firing Productions (PE) For State At Depth 1 ---
  20876. --- Inner Elaboration Phase, active level 1 (S1) ---
  20877. Firing apply*operator
  20878. -->
  20879. (I3 ^predict-no N1042 + :O )
  20880. Firing apply*operator*complete
  20881. -->
  20882. (I3 ^predict-no N1041 - :O )
  20883. inner elaboration loop at bottom goal.
  20884. --- Change Working Memory (PE) ---
  20885. =>WM: (14642: I3 ^predict-no N1042)
  20886. <=WM: (14629: N1041 ^status complete)
  20887. <=WM: (14628: I3 ^predict-no N1041)
  20888. --- Firing Productions (IE) For State At Depth 1 ---
  20889. --- Inner Elaboration Phase, active level 1 (S1) ---
  20890. Firing monitor*world
  20891. -->
  20892. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20893. --- Change Working Memory (IE) ---
  20894. --- END Application Phase ---
  20895. --- Output Phase ---
  20896. ENV: Agent did: predict-no for direction U in state State-B
  20897. In State-B moving U
  20898. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20899. predict error 0
  20900. dir: dir isU
  20901. --- END Output Phase ---
  20902. \---- Input Phase ---
  20903. =>WM: (14646: I2 ^dir U)
  20904. =>WM: (14645: I2 ^reward 1)
  20905. =>WM: (14644: I2 ^see 0)
  20906. =>WM: (14643: N1042 ^status complete)
  20907. <=WM: (14632: I2 ^dir U)
  20908. <=WM: (14631: I2 ^reward 1)
  20909. <=WM: (14630: I2 ^see 0)
  20910. =>WM: (14647: I2 ^level-1 R0-root)
  20911. <=WM: (14633: I2 ^level-1 R0-root)
  20912. --- END Input Phase ---
  20913. --- Proposal Phase ---
  20914. --- Inner Elaboration Phase, active level 1 (S1) ---
  20915. Firing elaborate*copy-see-to-output-link
  20916. -->
  20917. (I3 ^see 0 +)
  20918. Firing elaborate*reward*based*on*reward
  20919. -->
  20920. (R1046 ^value 1 +)
  20921. (R1 ^reward R1046 +)
  20922. Firing propose*predict-yes
  20923. -->
  20924. (O2085 ^name predict-yes +)
  20925. (S1 ^operator O2085 +)
  20926. Firing propose*predict-no
  20927. -->
  20928. (O2086 ^name predict-no +)
  20929. (S1 ^operator O2086 +)
  20930. Firing rl*prefer*rvt*predict-no*H0*6
  20931. -->
  20932. (S1 ^operator O2084 = 0.9999999999999999)
  20933. Firing rl*prefer*rvt*predict-yes*H0*5
  20934. -->
  20935. (S1 ^operator O2083 = 0.)
  20936. Firing prefer*rvt*predict-yes*H0
  20937. -->
  20938. Firing prefer*rvt*predict-no*H0
  20939. -->
  20940. Firing elaborate*copy-dir-to-output-link
  20941. -->
  20942. (I3 ^dir U +)
  20943. inner elaboration loop at bottom goal.
  20944. Retracting elaborate*copy-see-to-output-link
  20945. -->
  20946. (I3 ^see 0 +)
  20947. Retracting propose*predict-no
  20948. -->
  20949. (O2084 ^name predict-no +)
  20950. (S1 ^operator O2084 +)
  20951. Retracting propose*predict-yes
  20952. -->
  20953. (O2083 ^name predict-yes +)
  20954. (S1 ^operator O2083 +)
  20955. Retracting elaborate*reward*based*on*reward
  20956. -->
  20957. (R1045 ^value 1 +)
  20958. (R1 ^reward R1045 +)
  20959. Retracting elaborate*copy-dir-to-output-link
  20960. -->
  20961. (I3 ^dir U +)
  20962. Retracting rl*prefer*rvt*predict-no*H0*6
  20963. -->
  20964. (S1 ^operator O2084 = 0.9999999999999999)
  20965. Retracting rl*prefer*rvt*predict-yes*H0*5
  20966. -->
  20967. (S1 ^operator O2083 = 0.)
  20968. =>WM: (14653: S1 ^operator O2086 +)
  20969. =>WM: (14652: S1 ^operator O2085 +)
  20970. =>WM: (14651: O2086 ^name predict-no)
  20971. =>WM: (14650: O2085 ^name predict-yes)
  20972. =>WM: (14649: R1046 ^value 1)
  20973. =>WM: (14648: R1 ^reward R1046)
  20974. <=WM: (14639: S1 ^operator O2083 +)
  20975. <=WM: (14640: S1 ^operator O2084 +)
  20976. <=WM: (14641: S1 ^operator O2084)
  20977. <=WM: (14634: R1 ^reward R1045)
  20978. <=WM: (14637: O2084 ^name predict-no)
  20979. <=WM: (14636: O2083 ^name predict-yes)
  20980. <=WM: (14635: R1045 ^value 1)
  20981. --- Inner Elaboration Phase, active level 1 (S1) ---
  20982. Firing prefer*rvt*predict-yes*H0
  20983. -->
  20984. Firing rl*prefer*rvt*predict-yes*H0*5
  20985. -->
  20986. (S1 ^operator O2085 = 0.)
  20987. Firing prefer*rvt*predict-no*H0
  20988. -->
  20989. Firing rl*prefer*rvt*predict-no*H0*6
  20990. -->
  20991. (S1 ^operator O2086 = 0.9999999999999999)
  20992. inner elaboration loop at bottom goal.
  20993. Retracting rl*prefer*rvt*predict-no*H0*6
  20994. -->
  20995. (S1 ^operator O2084 = 0.9999999999999999)
  20996. Retracting rl*prefer*rvt*predict-yes*H0*5
  20997. -->
  20998. (S1 ^operator O2083 = 0.)
  20999. --- END Proposal Phase ---
  21000. --- Decision Phase ---
  21001. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21002. =>WM: (14654: S1 ^operator O2086)
  21003. 1043: O: O2086 (predict-no)
  21004. --- END Decision Phase ---
  21005. --- Application Phase ---
  21006. --- Firing Productions (PE) For State At Depth 1 ---
  21007. --- Inner Elaboration Phase, active level 1 (S1) ---
  21008. Firing apply*operator
  21009. -->
  21010. (I3 ^predict-no N1043 + :O )
  21011. Firing apply*operator*complete
  21012. -->
  21013. (I3 ^predict-no N1042 - :O )
  21014. inner elaboration loop at bottom goal.
  21015. --- Change Working Memory (PE) ---
  21016. =>WM: (14655: I3 ^predict-no N1043)
  21017. <=WM: (14643: N1042 ^status complete)
  21018. <=WM: (14642: I3 ^predict-no N1042)
  21019. --- Firing Productions (IE) For State At Depth 1 ---
  21020. --- Inner Elaboration Phase, active level 1 (S1) ---
  21021. Firing monitor*world
  21022. -->
  21023. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21024. --- Change Working Memory (IE) ---
  21025. --- END Application Phase ---
  21026. --- Output Phase ---
  21027. ENV: Agent did: predict-no for direction U in state State-B
  21028. In State-B moving U
  21029. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21030. predict error 0
  21031. dir: dir isR
  21032. --- END Output Phase ---
  21033. /|--- Input Phase ---
  21034. =>WM: (14659: I2 ^dir R)
  21035. =>WM: (14658: I2 ^reward 1)
  21036. =>WM: (14657: I2 ^see 0)
  21037. =>WM: (14656: N1043 ^status complete)
  21038. <=WM: (14646: I2 ^dir U)
  21039. <=WM: (14645: I2 ^reward 1)
  21040. <=WM: (14644: I2 ^see 0)
  21041. =>WM: (14660: I2 ^level-1 R0-root)
  21042. <=WM: (14647: I2 ^level-1 R0-root)
  21043. --- END Input Phase ---
  21044. --- Proposal Phase ---
  21045. --- Inner Elaboration Phase, active level 1 (S1) ---
  21046. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  21047. -->
  21048. (S1 ^operator O2085 = 0.2696941111808541)
  21049. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  21050. -->
  21051. (S1 ^operator O2086 = 0.873023074588328)
  21052. Firing prefer*rvt*predict-no*H0*4*H1
  21053. -->
  21054. Firing prefer*rvt*predict-yes*H0*3*H1
  21055. -->
  21056. Firing elaborate*copy-see-to-output-link
  21057. -->
  21058. (I3 ^see 0 +)
  21059. Firing elaborate*reward*based*on*reward
  21060. -->
  21061. (R1047 ^value 1 +)
  21062. (R1 ^reward R1047 +)
  21063. Firing propose*predict-yes
  21064. -->
  21065. (O2087 ^name predict-yes +)
  21066. (S1 ^operator O2087 +)
  21067. Firing propose*predict-no
  21068. -->
  21069. (O2088 ^name predict-no +)
  21070. (S1 ^operator O2088 +)
  21071. Firing rl*prefer*rvt*predict-no*H0*4
  21072. -->
  21073. (S1 ^operator O2086 = 0.126976816785224)
  21074. Firing rl*prefer*rvt*predict-yes*H0*3
  21075. -->
  21076. (S1 ^operator O2085 = 0.3829442071608071)
  21077. Firing prefer*rvt*predict-yes*H0
  21078. -->
  21079. Firing prefer*rvt*predict-no*H0
  21080. -->
  21081. Firing elaborate*copy-dir-to-output-link
  21082. -->
  21083. (I3 ^dir R +)
  21084. inner elaboration loop at bottom goal.
  21085. Retracting elaborate*copy-see-to-output-link
  21086. -->
  21087. (I3 ^see 0 +)
  21088. Retracting propose*predict-no
  21089. -->
  21090. (O2086 ^name predict-no +)
  21091. (S1 ^operator O2086 +)
  21092. Retracting propose*predict-yes
  21093. -->
  21094. (O2085 ^name predict-yes +)
  21095. (S1 ^operator O2085 +)
  21096. Retracting elaborate*reward*based*on*reward
  21097. -->
  21098. (R1046 ^value 1 +)
  21099. (R1 ^reward R1046 +)
  21100. Retracting elaborate*copy-dir-to-output-link
  21101. -->
  21102. (I3 ^dir U +)
  21103. Retracting rl*prefer*rvt*predict-no*H0*6
  21104. -->
  21105. (S1 ^operator O2086 = 0.9999999999999999)
  21106. Retracting rl*prefer*rvt*predict-yes*H0*5
  21107. -->
  21108. (S1 ^operator O2085 = 0.)
  21109. =>WM: (14667: S1 ^operator O2088 +)
  21110. =>WM: (14666: S1 ^operator O2087 +)
  21111. =>WM: (14665: I3 ^dir R)
  21112. =>WM: (14664: O2088 ^name predict-no)
  21113. =>WM: (14663: O2087 ^name predict-yes)
  21114. =>WM: (14662: R1047 ^value 1)
  21115. =>WM: (14661: R1 ^reward R1047)
  21116. <=WM: (14652: S1 ^operator O2085 +)
  21117. <=WM: (14653: S1 ^operator O2086 +)
  21118. <=WM: (14654: S1 ^operator O2086)
  21119. <=WM: (14638: I3 ^dir U)
  21120. <=WM: (14648: R1 ^reward R1046)
  21121. <=WM: (14651: O2086 ^name predict-no)
  21122. <=WM: (14650: O2085 ^name predict-yes)
  21123. <=WM: (14649: R1046 ^value 1)
  21124. --- Inner Elaboration Phase, active level 1 (S1) ---
  21125. Firing prefer*rvt*predict-yes*H0
  21126. -->
  21127. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  21128. -->
  21129. (S1 ^operator O2087 = 0.2696941111808541)
  21130. Firing rl*prefer*rvt*predict-yes*H0*3
  21131. -->
  21132. (S1 ^operator O2087 = 0.3829442071608071)
  21133. Firing prefer*rvt*predict-yes*H0*3*H1
  21134. -->
  21135. Firing prefer*rvt*predict-no*H0
  21136. -->
  21137. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  21138. -->
  21139. (S1 ^operator O2088 = 0.873023074588328)
  21140. Firing rl*prefer*rvt*predict-no*H0*4
  21141. -->
  21142. (S1 ^operator O2088 = 0.126976816785224)
  21143. Firing prefer*rvt*predict-no*H0*4*H1
  21144. -->
  21145. inner elaboration loop at bottom goal.
  21146. Retracting rl*prefer*rvt*predict-no*H0*4
  21147. -->
  21148. (S1 ^operator O2086 = 0.126976816785224)
  21149. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  21150. -->
  21151. (S1 ^operator O2086 = 0.873023074588328)
  21152. Retracting rl*prefer*rvt*predict-yes*H0*3
  21153. -->
  21154. (S1 ^operator O2085 = 0.3829442071608071)
  21155. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  21156. -->
  21157. (S1 ^operator O2085 = 0.2696941111808541)
  21158. --- END Proposal Phase ---
  21159. --- Decision Phase ---
  21160. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21161. =>WM: (14668: S1 ^operator O2088)
  21162. 1044: O: O2088 (predict-no)
  21163. --- END Decision Phase ---
  21164. --- Application Phase ---
  21165. --- Firing Productions (PE) For State At Depth 1 ---
  21166. --- Inner Elaboration Phase, active level 1 (S1) ---
  21167. Firing apply*operator
  21168. -->
  21169. (I3 ^predict-no N1044 + :O )
  21170. Firing apply*operator*complete
  21171. -->
  21172. (I3 ^predict-no N1043 - :O )
  21173. inner elaboration loop at bottom goal.
  21174. --- Change Working Memory (PE) ---
  21175. =>WM: (14669: I3 ^predict-no N1044)
  21176. <=WM: (14656: N1043 ^status complete)
  21177. <=WM: (14655: I3 ^predict-no N1043)
  21178. --- Firing Productions (IE) For State At Depth 1 ---
  21179. --- Inner Elaboration Phase, active level 1 (S1) ---
  21180. Firing monitor*world
  21181. -->
  21182. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21183. --- Change Working Memory (IE) ---
  21184. --- END Application Phase ---
  21185. --- Output Phase ---
  21186. ENV: Agent did: predict-no for direction R in state State-B
  21187. In State-B moving R
  21188. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21189. predict error 0
  21190. dir: dir isR
  21191. --- END Output Phase ---
  21192. \-/--- Input Phase ---
  21193. =>WM: (14673: I2 ^dir R)
  21194. =>WM: (14672: I2 ^reward 1)
  21195. =>WM: (14671: I2 ^see 0)
  21196. =>WM: (14670: N1044 ^status complete)
  21197. <=WM: (14659: I2 ^dir R)
  21198. <=WM: (14658: I2 ^reward 1)
  21199. <=WM: (14657: I2 ^see 0)
  21200. =>WM: (14674: I2 ^level-1 R0-root)
  21201. <=WM: (14660: I2 ^level-1 R0-root)
  21202. --- END Input Phase ---
  21203. --- Proposal Phase ---
  21204. --- Inner Elaboration Phase, active level 1 (S1) ---
  21205. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  21206. -->
  21207. (S1 ^operator O2087 = 0.2696941111808541)
  21208. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  21209. -->
  21210. (S1 ^operator O2088 = 0.873023074588328)
  21211. Firing prefer*rvt*predict-no*H0*4*H1
  21212. -->
  21213. Firing prefer*rvt*predict-yes*H0*3*H1
  21214. -->
  21215. Firing elaborate*copy-see-to-output-link
  21216. -->
  21217. (I3 ^see 0 +)
  21218. Firing elaborate*reward*based*on*reward
  21219. -->
  21220. (R1048 ^value 1 +)
  21221. (R1 ^reward R1048 +)
  21222. Firing propose*predict-yes
  21223. -->
  21224. (O2089 ^name predict-yes +)
  21225. (S1 ^operator O2089 +)
  21226. Firing propose*predict-no
  21227. -->
  21228. (O2090 ^name predict-no +)
  21229. (S1 ^operator O2090 +)
  21230. Firing rl*prefer*rvt*predict-no*H0*4
  21231. -->
  21232. (S1 ^operator O2088 = 0.126976816785224)
  21233. Firing rl*prefer*rvt*predict-yes*H0*3
  21234. -->
  21235. (S1 ^operator O2087 = 0.3829442071608071)
  21236. Firing prefer*rvt*predict-yes*H0
  21237. -->
  21238. Firing prefer*rvt*predict-no*H0
  21239. -->
  21240. Firing elaborate*copy-dir-to-output-link
  21241. -->
  21242. (I3 ^dir R +)
  21243. inner elaboration loop at bottom goal.
  21244. Retracting elaborate*copy-see-to-output-link
  21245. -->
  21246. (I3 ^see 0 +)
  21247. Retracting propose*predict-no
  21248. -->
  21249. (O2088 ^name predict-no +)
  21250. (S1 ^operator O2088 +)
  21251. Retracting propose*predict-yes
  21252. -->
  21253. (O2087 ^name predict-yes +)
  21254. (S1 ^operator O2087 +)
  21255. Retracting elaborate*reward*based*on*reward
  21256. -->
  21257. (R1047 ^value 1 +)
  21258. (R1 ^reward R1047 +)
  21259. Retracting elaborate*copy-dir-to-output-link
  21260. -->
  21261. (I3 ^dir R +)
  21262. Retracting rl*prefer*rvt*predict-no*H0*4
  21263. -->
  21264. (S1 ^operator O2088 = 0.126976816785224)
  21265. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  21266. -->
  21267. (S1 ^operator O2088 = 0.873023074588328)
  21268. Retracting rl*prefer*rvt*predict-yes*H0*3
  21269. -->
  21270. (S1 ^operator O2087 = 0.3829442071608071)
  21271. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  21272. -->
  21273. (S1 ^operator O2087 = 0.2696941111808541)
  21274. =>WM: (14680: S1 ^operator O2090 +)
  21275. =>WM: (14679: S1 ^operator O2089 +)
  21276. =>WM: (14678: O2090 ^name predict-no)
  21277. =>WM: (14677: O2089 ^name predict-yes)
  21278. =>WM: (14676: R1048 ^value 1)
  21279. =>WM: (14675: R1 ^reward R1048)
  21280. <=WM: (14666: S1 ^operator O2087 +)
  21281. <=WM: (14667: S1 ^operator O2088 +)
  21282. <=WM: (14668: S1 ^operator O2088)
  21283. <=WM: (14661: R1 ^reward R1047)
  21284. <=WM: (14664: O2088 ^name predict-no)
  21285. <=WM: (14663: O2087 ^name predict-yes)
  21286. <=WM: (14662: R1047 ^value 1)
  21287. --- Inner Elaboration Phase, active level 1 (S1) ---
  21288. Firing prefer*rvt*predict-yes*H0
  21289. -->
  21290. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  21291. -->
  21292. (S1 ^operator O2089 = 0.2696941111808541)
  21293. Firing rl*prefer*rvt*predict-yes*H0*3
  21294. -->
  21295. (S1 ^operator O2089 = 0.3829442071608071)
  21296. Firing prefer*rvt*predict-yes*H0*3*H1
  21297. -->
  21298. Firing prefer*rvt*predict-no*H0
  21299. -->
  21300. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  21301. -->
  21302. (S1 ^operator O2090 = 0.873023074588328)
  21303. Firing rl*prefer*rvt*predict-no*H0*4
  21304. -->
  21305. (S1 ^operator O2090 = 0.126976816785224)
  21306. Firing prefer*rvt*predict-no*H0*4*H1
  21307. -->
  21308. inner elaboration loop at bottom goal.
  21309. Retracting rl*prefer*rvt*predict-no*H0*4
  21310. -->
  21311. (S1 ^operator O2088 = 0.126976816785224)
  21312. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  21313. -->
  21314. (S1 ^operator O2088 = 0.873023074588328)
  21315. Retracting rl*prefer*rvt*predict-yes*H0*3
  21316. -->
  21317. (S1 ^operator O2087 = 0.3829442071608071)
  21318. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  21319. -->
  21320. (S1 ^operator O2087 = 0.2696941111808541)
  21321. --- END Proposal Phase ---
  21322. --- Decision Phase ---
  21323. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.951613,0.0462947)
  21324. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  21325. =>WM: (14681: S1 ^operator O2090)
  21326. 1045: O: O2090 (predict-no)
  21327. --- END Decision Phase ---
  21328. --- Application Phase ---
  21329. --- Firing Productions (PE) For State At Depth 1 ---
  21330. --- Inner Elaboration Phase, active level 1 (S1) ---
  21331. Firing apply*operator
  21332. -->
  21333. (I3 ^predict-no N1045 + :O )
  21334. Firing apply*operator*complete
  21335. -->
  21336. (I3 ^predict-no N1044 - :O )
  21337. inner elaboration loop at bottom goal.
  21338. --- Change Working Memory (PE) ---
  21339. =>WM: (14682: I3 ^predict-no N1045)
  21340. <=WM: (14670: N1044 ^status complete)
  21341. <=WM: (14669: I3 ^predict-no N1044)
  21342. --- Firing Productions (IE) For State At Depth 1 ---
  21343. --- Inner Elaboration Phase, active level 1 (S1) ---
  21344. Firing monitor*world
  21345. -->
  21346. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21347. --- Change Working Memory (IE) ---
  21348. --- END Application Phase ---
  21349. --- Output Phase ---
  21350. ENV: Agent did: predict-no for direction R in state State-B
  21351. In State-B moving R
  21352. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21353. predict error 0
  21354. dir: dir isR
  21355. --- END Output Phase ---
  21356. |\---- Input Phase ---
  21357. =>WM: (14686: I2 ^dir R)
  21358. =>WM: (14685: I2 ^reward 1)
  21359. =>WM: (14684: I2 ^see 0)
  21360. =>WM: (14683: N1045 ^status complete)
  21361. <=WM: (14673: I2 ^dir R)
  21362. <=WM: (14672: I2 ^reward 1)
  21363. <=WM: (14671: I2 ^see 0)
  21364. =>WM: (14687: I2 ^level-1 R0-root)
  21365. <=WM: (14674: I2 ^level-1 R0-root)
  21366. --- END Input Phase ---
  21367. --- Proposal Phase ---
  21368. --- Inner Elaboration Phase, active level 1 (S1) ---
  21369. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  21370. -->
  21371. (S1 ^operator O2089 = 0.2696941111808541)
  21372. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  21373. -->
  21374. (S1 ^operator O2090 = 0.8730230908822952)
  21375. Firing prefer*rvt*predict-no*H0*4*H1
  21376. -->
  21377. Firing prefer*rvt*predict-yes*H0*3*H1
  21378. -->
  21379. Firing elaborate*copy-see-to-output-link
  21380. -->
  21381. (I3 ^see 0 +)
  21382. Firing elaborate*reward*based*on*reward
  21383. -->
  21384. (R1049 ^value 1 +)
  21385. (R1 ^reward R1049 +)
  21386. Firing propose*predict-yes
  21387. -->
  21388. (O2091 ^name predict-yes +)
  21389. (S1 ^operator O2091 +)
  21390. Firing propose*predict-no
  21391. -->
  21392. (O2092 ^name predict-no +)
  21393. (S1 ^operator O2092 +)
  21394. Firing rl*prefer*rvt*predict-no*H0*4
  21395. -->
  21396. (S1 ^operator O2090 = 0.1269768330791913)
  21397. Firing rl*prefer*rvt*predict-yes*H0*3
  21398. -->
  21399. (S1 ^operator O2089 = 0.3829442071608071)
  21400. Firing prefer*rvt*predict-yes*H0
  21401. -->
  21402. Firing prefer*rvt*predict-no*H0
  21403. -->
  21404. Firing elaborate*copy-dir-to-output-link
  21405. -->
  21406. (I3 ^dir R +)
  21407. inner elaboration loop at bottom goal.
  21408. Retracting elaborate*copy-see-to-output-link
  21409. -->
  21410. (I3 ^see 0 +)
  21411. Retracting propose*predict-no
  21412. -->
  21413. (O2090 ^name predict-no +)
  21414. (S1 ^operator O2090 +)
  21415. Retracting propose*predict-yes
  21416. -->
  21417. (O2089 ^name predict-yes +)
  21418. (S1 ^operator O2089 +)
  21419. Retracting elaborate*reward*based*on*reward
  21420. -->
  21421. (R1048 ^value 1 +)
  21422. (R1 ^reward R1048 +)
  21423. Retracting elaborate*copy-dir-to-output-link
  21424. -->
  21425. (I3 ^dir R +)
  21426. Retracting rl*prefer*rvt*predict-no*H0*4
  21427. -->
  21428. (S1 ^operator O2090 = 0.1269768330791913)
  21429. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  21430. -->
  21431. (S1 ^operator O2090 = 0.8730230908822952)
  21432. Retracting rl*prefer*rvt*predict-yes*H0*3
  21433. -->
  21434. (S1 ^operator O2089 = 0.3829442071608071)
  21435. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  21436. -->
  21437. (S1 ^operator O2089 = 0.2696941111808541)
  21438. =>WM: (14693: S1 ^operator O2092 +)
  21439. =>WM: (14692: S1 ^operator O2091 +)
  21440. =>WM: (14691: O2092 ^name predict-no)
  21441. =>WM: (14690: O2091 ^name predict-yes)
  21442. =>WM: (14689: R1049 ^value 1)
  21443. =>WM: (14688: R1 ^reward R1049)
  21444. <=WM: (14679: S1 ^operator O2089 +)
  21445. <=WM: (14680: S1 ^operator O2090 +)
  21446. <=WM: (14681: S1 ^operator O2090)
  21447. <=WM: (14675: R1 ^reward R1048)
  21448. <=WM: (14678: O2090 ^name predict-no)
  21449. <=WM: (14677: O2089 ^name predict-yes)
  21450. <=WM: (14676: R1048 ^value 1)
  21451. --- Inner Elaboration Phase, active level 1 (S1) ---
  21452. Firing prefer*rvt*predict-yes*H0
  21453. -->
  21454. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  21455. -->
  21456. (S1 ^operator O2091 = 0.2696941111808541)
  21457. Firing rl*prefer*rvt*predict-yes*H0*3
  21458. -->
  21459. (S1 ^operator O2091 = 0.3829442071608071)
  21460. Firing prefer*rvt*predict-yes*H0*3*H1
  21461. -->
  21462. Firing prefer*rvt*predict-no*H0
  21463. -->
  21464. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  21465. -->
  21466. (S1 ^operator O2092 = 0.8730230908822952)
  21467. Firing rl*prefer*rvt*predict-no*H0*4
  21468. -->
  21469. (S1 ^operator O2092 = 0.1269768330791913)
  21470. Firing prefer*rvt*predict-no*H0*4*H1
  21471. -->
  21472. inner elaboration loop at bottom goal.
  21473. Retracting rl*prefer*rvt*predict-no*H0*4
  21474. -->
  21475. (S1 ^operator O2090 = 0.1269768330791913)
  21476. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  21477. -->
  21478. (S1 ^operator O2090 = 0.8730230908822952)
  21479. Retracting rl*prefer*rvt*predict-yes*H0*3
  21480. -->
  21481. (S1 ^operator O2089 = 0.3829442071608071)
  21482. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  21483. -->
  21484. (S1 ^operator O2089 = 0.2696941111808541)
  21485. --- END Proposal Phase ---
  21486. --- Decision Phase ---
  21487. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.951872,0.0460583)
  21488. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  21489. =>WM: (14694: S1 ^operator O2092)
  21490. 1046: O: O2092 (predict-no)
  21491. --- END Decision Phase ---
  21492. --- Application Phase ---
  21493. --- Firing Productions (PE) For State At Depth 1 ---
  21494. --- Inner Elaboration Phase, active level 1 (S1) ---
  21495. Firing apply*operator
  21496. -->
  21497. (I3 ^predict-no N1046 + :O )
  21498. Firing apply*operator*complete
  21499. -->
  21500. (I3 ^predict-no N1045 - :O )
  21501. inner elaboration loop at bottom goal.
  21502. --- Change Working Memory (PE) ---
  21503. =>WM: (14695: I3 ^predict-no N1046)
  21504. <=WM: (14683: N1045 ^status complete)
  21505. <=WM: (14682: I3 ^predict-no N1045)
  21506. --- Firing Productions (IE) For State At Depth 1 ---
  21507. --- Inner Elaboration Phase, active level 1 (S1) ---
  21508. Firing monitor*world
  21509. -->
  21510. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21511. --- Change Working Memory (IE) ---
  21512. --- END Application Phase ---
  21513. --- Output Phase ---
  21514. ENV: Agent did: predict-no for direction R in state State-B
  21515. In State-B moving R
  21516. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21517. predict error 0
  21518. dir: dir isL
  21519. --- END Output Phase ---
  21520. /|\--- Input Phase ---
  21521. =>WM: (14699: I2 ^dir L)
  21522. =>WM: (14698: I2 ^reward 1)
  21523. =>WM: (14697: I2 ^see 0)
  21524. =>WM: (14696: N1046 ^status complete)
  21525. <=WM: (14686: I2 ^dir R)
  21526. <=WM: (14685: I2 ^reward 1)
  21527. <=WM: (14684: I2 ^see 0)
  21528. =>WM: (14700: I2 ^level-1 R0-root)
  21529. <=WM: (14687: I2 ^level-1 R0-root)
  21530. --- END Input Phase ---
  21531. --- Proposal Phase ---
  21532. --- Inner Elaboration Phase, active level 1 (S1) ---
  21533. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  21534. -->
  21535. (S1 ^operator O2091 = 0.4768814583471331)
  21536. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  21537. -->
  21538. (S1 ^operator O2092 = 0.1700769046561409)
  21539. Firing prefer*rvt*predict-no*H0*2*H1
  21540. -->
  21541. Firing prefer*rvt*predict-yes*H0*1*H1
  21542. -->
  21543. Firing elaborate*copy-see-to-output-link
  21544. -->
  21545. (I3 ^see 0 +)
  21546. Firing elaborate*reward*based*on*reward
  21547. -->
  21548. (R1050 ^value 1 +)
  21549. (R1 ^reward R1050 +)
  21550. Firing propose*predict-yes
  21551. -->
  21552. (O2093 ^name predict-yes +)
  21553. (S1 ^operator O2093 +)
  21554. Firing propose*predict-no
  21555. -->
  21556. (O2094 ^name predict-no +)
  21557. (S1 ^operator O2094 +)
  21558. Firing rl*prefer*rvt*predict-no*H0*2
  21559. -->
  21560. (S1 ^operator O2092 = 0.2550133912230119)
  21561. Firing rl*prefer*rvt*predict-yes*H0*1
  21562. -->
  21563. (S1 ^operator O2091 = 0.5231194249480864)
  21564. Firing prefer*rvt*predict-yes*H0
  21565. -->
  21566. Firing prefer*rvt*predict-no*H0
  21567. -->
  21568. Firing elaborate*copy-dir-to-output-link
  21569. -->
  21570. (I3 ^dir L +)
  21571. inner elaboration loop at bottom goal.
  21572. Retracting elaborate*copy-see-to-output-link
  21573. -->
  21574. (I3 ^see 0 +)
  21575. Retracting propose*predict-no
  21576. -->
  21577. (O2092 ^name predict-no +)
  21578. (S1 ^operator O2092 +)
  21579. Retracting propose*predict-yes
  21580. -->
  21581. (O2091 ^name predict-yes +)
  21582. (S1 ^operator O2091 +)
  21583. Retracting elaborate*reward*based*on*reward
  21584. -->
  21585. (R1049 ^value 1 +)
  21586. (R1 ^reward R1049 +)
  21587. Retracting elaborate*copy-dir-to-output-link
  21588. -->
  21589. (I3 ^dir R +)
  21590. Retracting rl*prefer*rvt*predict-no*H0*4
  21591. -->
  21592. (S1 ^operator O2092 = 0.1269768444849683)
  21593. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  21594. -->
  21595. (S1 ^operator O2092 = 0.8730231022880722)
  21596. Retracting rl*prefer*rvt*predict-yes*H0*3
  21597. -->
  21598. (S1 ^operator O2091 = 0.3829442071608071)
  21599. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  21600. -->
  21601. (S1 ^operator O2091 = 0.2696941111808541)
  21602. =>WM: (14707: S1 ^operator O2094 +)
  21603. =>WM: (14706: S1 ^operator O2093 +)
  21604. =>WM: (14705: I3 ^dir L)
  21605. =>WM: (14704: O2094 ^name predict-no)
  21606. =>WM: (14703: O2093 ^name predict-yes)
  21607. =>WM: (14702: R1050 ^value 1)
  21608. =>WM: (14701: R1 ^reward R1050)
  21609. <=WM: (14692: S1 ^operator O2091 +)
  21610. <=WM: (14693: S1 ^operator O2092 +)
  21611. <=WM: (14694: S1 ^operator O2092)
  21612. <=WM: (14665: I3 ^dir R)
  21613. <=WM: (14688: R1 ^reward R1049)
  21614. <=WM: (14691: O2092 ^name predict-no)
  21615. <=WM: (14690: O2091 ^name predict-yes)
  21616. <=WM: (14689: R1049 ^value 1)
  21617. --- Inner Elaboration Phase, active level 1 (S1) ---
  21618. Firing prefer*rvt*predict-yes*H0
  21619. -->
  21620. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  21621. -->
  21622. (S1 ^operator O2093 = 0.4768814583471331)
  21623. Firing rl*prefer*rvt*predict-yes*H0*1
  21624. -->
  21625. (S1 ^operator O2093 = 0.5231194249480864)
  21626. Firing prefer*rvt*predict-yes*H0*1*H1
  21627. -->
  21628. Firing prefer*rvt*predict-no*H0
  21629. -->
  21630. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  21631. -->
  21632. (S1 ^operator O2094 = 0.1700769046561409)
  21633. Firing rl*prefer*rvt*predict-no*H0*2
  21634. -->
  21635. (S1 ^operator O2094 = 0.2550133912230119)
  21636. Firing prefer*rvt*predict-no*H0*2*H1
  21637. -->
  21638. inner elaboration loop at bottom goal.
  21639. Retracting rl*prefer*rvt*predict-no*H0*2
  21640. -->
  21641. (S1 ^operator O2092 = 0.2550133912230119)
  21642. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  21643. -->
  21644. (S1 ^operator O2092 = 0.1700769046561409)
  21645. Retracting rl*prefer*rvt*predict-yes*H0*1
  21646. -->
  21647. (S1 ^operator O2091 = 0.5231194249480864)
  21648. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  21649. -->
  21650. (S1 ^operator O2091 = 0.4768814583471331)
  21651. --- END Proposal Phase ---
  21652. --- Decision Phase ---
  21653. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.952128,0.0458243)
  21654. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  21655. =>WM: (14708: S1 ^operator O2093)
  21656. 1047: O: O2093 (predict-yes)
  21657. --- END Decision Phase ---
  21658. --- Application Phase ---
  21659. --- Firing Productions (PE) For State At Depth 1 ---
  21660. --- Inner Elaboration Phase, active level 1 (S1) ---
  21661. Firing apply*operator
  21662. -->
  21663. (I3 ^predict-yes N1047 + :O )
  21664. Firing apply*operator*complete
  21665. -->
  21666. (I3 ^predict-no N1046 - :O )
  21667. inner elaboration loop at bottom goal.
  21668. --- Change Working Memory (PE) ---
  21669. =>WM: (14709: I3 ^predict-yes N1047)
  21670. <=WM: (14696: N1046 ^status complete)
  21671. <=WM: (14695: I3 ^predict-no N1046)
  21672. --- Firing Productions (IE) For State At Depth 1 ---
  21673. --- Inner Elaboration Phase, active level 1 (S1) ---
  21674. Firing monitor*world
  21675. -->
  21676. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21677. --- Change Working Memory (IE) ---
  21678. --- END Application Phase ---
  21679. --- Output Phase ---
  21680. ENV: Agent did: predict-yes for direction L in state State-B
  21681. In State-B moving L
  21682. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  21683. predict error 0
  21684. dir: dir isR
  21685. --- END Output Phase ---
  21686. -/--- Input Phase ---
  21687. =>WM: (14713: I2 ^dir R)
  21688. =>WM: (14712: I2 ^reward 1)
  21689. =>WM: (14711: I2 ^see 1)
  21690. =>WM: (14710: N1047 ^status complete)
  21691. <=WM: (14699: I2 ^dir L)
  21692. <=WM: (14698: I2 ^reward 1)
  21693. <=WM: (14697: I2 ^see 0)
  21694. =>WM: (14714: I2 ^level-1 L1-root)
  21695. <=WM: (14700: I2 ^level-1 R0-root)
  21696. --- END Input Phase ---
  21697. --- Proposal Phase ---
  21698. --- Inner Elaboration Phase, active level 1 (S1) ---
  21699. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  21700. -->
  21701. (S1 ^operator O2093 = 0.6170481813010518)
  21702. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  21703. -->
  21704. (S1 ^operator O2094 = 0.4901349546100854)
  21705. Firing prefer*rvt*predict-no*H0*4*H1
  21706. -->
  21707. Firing prefer*rvt*predict-yes*H0*3*H1
  21708. -->
  21709. Firing elaborate*copy-see-to-output-link
  21710. -->
  21711. (I3 ^see 1 +)
  21712. Firing elaborate*reward*based*on*reward
  21713. -->
  21714. (R1051 ^value 1 +)
  21715. (R1 ^reward R1051 +)
  21716. Firing propose*predict-yes
  21717. -->
  21718. (O2095 ^name predict-yes +)
  21719. (S1 ^operator O2095 +)
  21720. Firing propose*predict-no
  21721. -->
  21722. (O2096 ^name predict-no +)
  21723. (S1 ^operator O2096 +)
  21724. Firing rl*prefer*rvt*predict-no*H0*4
  21725. -->
  21726. (S1 ^operator O2094 = 0.1269768524690122)
  21727. Firing rl*prefer*rvt*predict-yes*H0*3
  21728. -->
  21729. (S1 ^operator O2093 = 0.3829442071608071)
  21730. Firing prefer*rvt*predict-yes*H0
  21731. -->
  21732. Firing prefer*rvt*predict-no*H0
  21733. -->
  21734. Firing elaborate*copy-dir-to-output-link
  21735. -->
  21736. (I3 ^dir R +)
  21737. inner elaboration loop at bottom goal.
  21738. Retracting elaborate*copy-see-to-output-link
  21739. -->
  21740. (I3 ^see 0 +)
  21741. Retracting propose*predict-no
  21742. -->
  21743. (O2094 ^name predict-no +)
  21744. (S1 ^operator O2094 +)
  21745. Retracting propose*predict-yes
  21746. -->
  21747. (O2093 ^name predict-yes +)
  21748. (S1 ^operator O2093 +)
  21749. Retracting elaborate*reward*based*on*reward
  21750. -->
  21751. (R1050 ^value 1 +)
  21752. (R1 ^reward R1050 +)
  21753. Retracting elaborate*copy-dir-to-output-link
  21754. -->
  21755. (I3 ^dir L +)
  21756. Retracting rl*prefer*rvt*predict-no*H0*2
  21757. -->
  21758. (S1 ^operator O2094 = 0.2550133912230119)
  21759. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  21760. -->
  21761. (S1 ^operator O2094 = 0.1700769046561409)
  21762. Retracting rl*prefer*rvt*predict-yes*H0*1
  21763. -->
  21764. (S1 ^operator O2093 = 0.5231194249480864)
  21765. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  21766. -->
  21767. (S1 ^operator O2093 = 0.4768814583471331)
  21768. =>WM: (14722: S1 ^operator O2096 +)
  21769. =>WM: (14721: S1 ^operator O2095 +)
  21770. =>WM: (14720: I3 ^dir R)
  21771. =>WM: (14719: O2096 ^name predict-no)
  21772. =>WM: (14718: O2095 ^name predict-yes)
  21773. =>WM: (14717: R1051 ^value 1)
  21774. =>WM: (14716: R1 ^reward R1051)
  21775. =>WM: (14715: I3 ^see 1)
  21776. <=WM: (14706: S1 ^operator O2093 +)
  21777. <=WM: (14708: S1 ^operator O2093)
  21778. <=WM: (14707: S1 ^operator O2094 +)
  21779. <=WM: (14705: I3 ^dir L)
  21780. <=WM: (14701: R1 ^reward R1050)
  21781. <=WM: (14620: I3 ^see 0)
  21782. <=WM: (14704: O2094 ^name predict-no)
  21783. <=WM: (14703: O2093 ^name predict-yes)
  21784. <=WM: (14702: R1050 ^value 1)
  21785. --- Inner Elaboration Phase, active level 1 (S1) ---
  21786. Firing prefer*rvt*predict-yes*H0
  21787. -->
  21788. Firing rl*prefer*rvt*predict-yes*H0*3
  21789. -->
  21790. (S1 ^operator O2095 = 0.3829442071608071)
  21791. Firing prefer*rvt*predict-yes*H0*3*H1
  21792. -->
  21793. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  21794. -->
  21795. (S1 ^operator O2095 = 0.6170481813010518)
  21796. Firing prefer*rvt*predict-no*H0
  21797. -->
  21798. Firing rl*prefer*rvt*predict-no*H0*4
  21799. -->
  21800. (S1 ^operator O2096 = 0.1269768524690122)
  21801. Firing prefer*rvt*predict-no*H0*4*H1
  21802. -->
  21803. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  21804. -->
  21805. (S1 ^operator O2096 = 0.4901349546100854)
  21806. inner elaboration loop at bottom goal.
  21807. Retracting rl*prefer*rvt*predict-no*H0*4
  21808. -->
  21809. (S1 ^operator O2094 = 0.1269768524690122)
  21810. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  21811. -->
  21812. (S1 ^operator O2094 = 0.4901349546100854)
  21813. Retracting rl*prefer*rvt*predict-yes*H0*3
  21814. -->
  21815. (S1 ^operator O2093 = 0.3829442071608071)
  21816. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  21817. -->
  21818. (S1 ^operator O2093 = 0.6170481813010518)
  21819. --- END Proposal Phase ---
  21820. --- Decision Phase ---
  21821. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.98,0.0197315)
  21822. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272042 0.204839 0.476881 -> 0.272042 0.204839 0.476881(R,m,v=1,1,0)
  21823. =>WM: (14723: S1 ^operator O2095)
  21824. 1048: O: O2095 (predict-yes)
  21825. --- END Decision Phase ---
  21826. --- Application Phase ---
  21827. --- Firing Productions (PE) For State At Depth 1 ---
  21828. --- Inner Elaboration Phase, active level 1 (S1) ---
  21829. Firing apply*operator
  21830. -->
  21831. (I3 ^predict-yes N1048 + :O )
  21832. Firing apply*operator*complete
  21833. -->
  21834. (I3 ^predict-yes N1047 - :O )
  21835. inner elaboration loop at bottom goal.
  21836. --- Change Working Memory (PE) ---
  21837. =>WM: (14724: I3 ^predict-yes N1048)
  21838. <=WM: (14710: N1047 ^status complete)
  21839. <=WM: (14709: I3 ^predict-yes N1047)
  21840. --- Firing Productions (IE) For State At Depth 1 ---
  21841. --- Inner Elaboration Phase, active level 1 (S1) ---
  21842. Firing monitor*world
  21843. -->
  21844. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21845. --- Change Working Memory (IE) ---
  21846. --- END Application Phase ---
  21847. --- Output Phase ---
  21848. ENV: Agent did: predict-yes for direction R in state State-A
  21849. In State-A moving R
  21850. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  21851. predict error 0
  21852. dir: dir isU
  21853. --- END Output Phase ---
  21854. |\---- Input Phase ---
  21855. =>WM: (14728: I2 ^dir U)
  21856. =>WM: (14727: I2 ^reward 1)
  21857. =>WM: (14726: I2 ^see 1)
  21858. =>WM: (14725: N1048 ^status complete)
  21859. <=WM: (14713: I2 ^dir R)
  21860. <=WM: (14712: I2 ^reward 1)
  21861. <=WM: (14711: I2 ^see 1)
  21862. =>WM: (14729: I2 ^level-1 R1-root)
  21863. <=WM: (14714: I2 ^level-1 L1-root)
  21864. --- END Input Phase ---
  21865. --- Proposal Phase ---
  21866. --- Inner Elaboration Phase, active level 1 (S1) ---
  21867. Firing elaborate*copy-see-to-output-link
  21868. -->
  21869. (I3 ^see 1 +)
  21870. Firing elaborate*reward*based*on*reward
  21871. -->
  21872. (R1052 ^value 1 +)
  21873. (R1 ^reward R1052 +)
  21874. Firing propose*predict-yes
  21875. -->
  21876. (O2097 ^name predict-yes +)
  21877. (S1 ^operator O2097 +)
  21878. Firing propose*predict-no
  21879. -->
  21880. (O2098 ^name predict-no +)
  21881. (S1 ^operator O2098 +)
  21882. Firing rl*prefer*rvt*predict-no*H0*6
  21883. -->
  21884. (S1 ^operator O2096 = 0.9999999999999999)
  21885. Firing rl*prefer*rvt*predict-yes*H0*5
  21886. -->
  21887. (S1 ^operator O2095 = 0.)
  21888. Firing prefer*rvt*predict-yes*H0
  21889. -->
  21890. Firing prefer*rvt*predict-no*H0
  21891. -->
  21892. Firing elaborate*copy-dir-to-output-link
  21893. -->
  21894. (I3 ^dir U +)
  21895. inner elaboration loop at bottom goal.
  21896. Retracting elaborate*copy-see-to-output-link
  21897. -->
  21898. (I3 ^see 1 +)
  21899. Retracting propose*predict-no
  21900. -->
  21901. (O2096 ^name predict-no +)
  21902. (S1 ^operator O2096 +)
  21903. Retracting propose*predict-yes
  21904. -->
  21905. (O2095 ^name predict-yes +)
  21906. (S1 ^operator O2095 +)
  21907. Retracting elaborate*reward*based*on*reward
  21908. -->
  21909. (R1051 ^value 1 +)
  21910. (R1 ^reward R1051 +)
  21911. Retracting elaborate*copy-dir-to-output-link
  21912. -->
  21913. (I3 ^dir R +)
  21914. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  21915. -->
  21916. (S1 ^operator O2096 = 0.4901349546100854)
  21917. Retracting rl*prefer*rvt*predict-no*H0*4
  21918. -->
  21919. (S1 ^operator O2096 = 0.1269768524690122)
  21920. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  21921. -->
  21922. (S1 ^operator O2095 = 0.6170481813010518)
  21923. Retracting rl*prefer*rvt*predict-yes*H0*3
  21924. -->
  21925. (S1 ^operator O2095 = 0.3829442071608071)
  21926. =>WM: (14736: S1 ^operator O2098 +)
  21927. =>WM: (14735: S1 ^operator O2097 +)
  21928. =>WM: (14734: I3 ^dir U)
  21929. =>WM: (14733: O2098 ^name predict-no)
  21930. =>WM: (14732: O2097 ^name predict-yes)
  21931. =>WM: (14731: R1052 ^value 1)
  21932. =>WM: (14730: R1 ^reward R1052)
  21933. <=WM: (14721: S1 ^operator O2095 +)
  21934. <=WM: (14723: S1 ^operator O2095)
  21935. <=WM: (14722: S1 ^operator O2096 +)
  21936. <=WM: (14720: I3 ^dir R)
  21937. <=WM: (14716: R1 ^reward R1051)
  21938. <=WM: (14719: O2096 ^name predict-no)
  21939. <=WM: (14718: O2095 ^name predict-yes)
  21940. <=WM: (14717: R1051 ^value 1)
  21941. --- Inner Elaboration Phase, active level 1 (S1) ---
  21942. Firing prefer*rvt*predict-yes*H0
  21943. -->
  21944. Firing rl*prefer*rvt*predict-yes*H0*5
  21945. -->
  21946. (S1 ^operator O2097 = 0.)
  21947. Firing prefer*rvt*predict-no*H0
  21948. -->
  21949. Firing rl*prefer*rvt*predict-no*H0*6
  21950. -->
  21951. (S1 ^operator O2098 = 0.9999999999999999)
  21952. inner elaboration loop at bottom goal.
  21953. Retracting rl*prefer*rvt*predict-no*H0*6
  21954. -->
  21955. (S1 ^operator O2096 = 0.9999999999999999)
  21956. Retracting rl*prefer*rvt*predict-yes*H0*5
  21957. -->
  21958. (S1 ^operator O2095 = 0.)
  21959. --- END Proposal Phase ---
  21960. --- Decision Phase ---
  21961. RL update rl*prefer*rvt*predict-yes*H0*3 0.673137 -0.290193 0.382944 -> 0.673138 -0.290193 0.382945(R,m,v=1,0.962733,0.0361025)
  21962. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326856 0.290192 0.617048 -> 0.326857 0.290192 0.617049(R,m,v=1,1,0)
  21963. =>WM: (14737: S1 ^operator O2098)
  21964. 1049: O: O2098 (predict-no)
  21965. --- END Decision Phase ---
  21966. --- Application Phase ---
  21967. --- Firing Productions (PE) For State At Depth 1 ---
  21968. --- Inner Elaboration Phase, active level 1 (S1) ---
  21969. Firing apply*operator
  21970. -->
  21971. (I3 ^predict-no N1049 + :O )
  21972. Firing apply*operator*complete
  21973. -->
  21974. (I3 ^predict-yes N1048 - :O )
  21975. inner elaboration loop at bottom goal.
  21976. --- Change Working Memory (PE) ---
  21977. =>WM: (14738: I3 ^predict-no N1049)
  21978. <=WM: (14725: N1048 ^status complete)
  21979. <=WM: (14724: I3 ^predict-yes N1048)
  21980. --- Firing Productions (IE) For State At Depth 1 ---
  21981. --- Inner Elaboration Phase, active level 1 (S1) ---
  21982. Firing monitor*world
  21983. -->
  21984. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21985. --- Change Working Memory (IE) ---
  21986. --- END Application Phase ---
  21987. --- Output Phase ---
  21988. ENV: Agent did: predict-no for direction U in state State-B
  21989. In State-B moving U
  21990. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21991. predict error 0
  21992. dir: dir isU
  21993. --- END Output Phase ---
  21994. /|\--- Input Phase ---
  21995. =>WM: (14742: I2 ^dir U)
  21996. =>WM: (14741: I2 ^reward 1)
  21997. =>WM: (14740: I2 ^see 0)
  21998. =>WM: (14739: N1049 ^status complete)
  21999. <=WM: (14728: I2 ^dir U)
  22000. <=WM: (14727: I2 ^reward 1)
  22001. <=WM: (14726: I2 ^see 1)
  22002. =>WM: (14743: I2 ^level-1 R1-root)
  22003. <=WM: (14729: I2 ^level-1 R1-root)
  22004. --- END Input Phase ---
  22005. --- Proposal Phase ---
  22006. --- Inner Elaboration Phase, active level 1 (S1) ---
  22007. Firing elaborate*copy-see-to-output-link
  22008. -->
  22009. (I3 ^see 0 +)
  22010. Firing elaborate*reward*based*on*reward
  22011. -->
  22012. (R1053 ^value 1 +)
  22013. (R1 ^reward R1053 +)
  22014. Firing propose*predict-yes
  22015. -->
  22016. (O2099 ^name predict-yes +)
  22017. (S1 ^operator O2099 +)
  22018. Firing propose*predict-no
  22019. -->
  22020. (O2100 ^name predict-no +)
  22021. (S1 ^operator O2100 +)
  22022. Firing rl*prefer*rvt*predict-no*H0*6
  22023. -->
  22024. (S1 ^operator O2098 = 0.9999999999999999)
  22025. Firing rl*prefer*rvt*predict-yes*H0*5
  22026. -->
  22027. (S1 ^operator O2097 = 0.)
  22028. Firing prefer*rvt*predict-yes*H0
  22029. -->
  22030. Firing prefer*rvt*predict-no*H0
  22031. -->
  22032. Firing elaborate*copy-dir-to-output-link
  22033. -->
  22034. (I3 ^dir U +)
  22035. inner elaboration loop at bottom goal.
  22036. Retracting elaborate*copy-see-to-output-link
  22037. -->
  22038. (I3 ^see 1 +)
  22039. Retracting propose*predict-no
  22040. -->
  22041. (O2098 ^name predict-no +)
  22042. (S1 ^operator O2098 +)
  22043. Retracting propose*predict-yes
  22044. -->
  22045. (O2097 ^name predict-yes +)
  22046. (S1 ^operator O2097 +)
  22047. Retracting elaborate*reward*based*on*reward
  22048. -->
  22049. (R1052 ^value 1 +)
  22050. (R1 ^reward R1052 +)
  22051. Retracting elaborate*copy-dir-to-output-link
  22052. -->
  22053. (I3 ^dir U +)
  22054. Retracting rl*prefer*rvt*predict-no*H0*6
  22055. -->
  22056. (S1 ^operator O2098 = 0.9999999999999999)
  22057. Retracting rl*prefer*rvt*predict-yes*H0*5
  22058. -->
  22059. (S1 ^operator O2097 = 0.)
  22060. =>WM: (14750: S1 ^operator O2100 +)
  22061. =>WM: (14749: S1 ^operator O2099 +)
  22062. =>WM: (14748: O2100 ^name predict-no)
  22063. =>WM: (14747: O2099 ^name predict-yes)
  22064. =>WM: (14746: R1053 ^value 1)
  22065. =>WM: (14745: R1 ^reward R1053)
  22066. =>WM: (14744: I3 ^see 0)
  22067. <=WM: (14735: S1 ^operator O2097 +)
  22068. <=WM: (14736: S1 ^operator O2098 +)
  22069. <=WM: (14737: S1 ^operator O2098)
  22070. <=WM: (14730: R1 ^reward R1052)
  22071. <=WM: (14715: I3 ^see 1)
  22072. <=WM: (14733: O2098 ^name predict-no)
  22073. <=WM: (14732: O2097 ^name predict-yes)
  22074. <=WM: (14731: R1052 ^value 1)
  22075. --- Inner Elaboration Phase, active level 1 (S1) ---
  22076. Firing prefer*rvt*predict-yes*H0
  22077. -->
  22078. Firing rl*prefer*rvt*predict-yes*H0*5
  22079. -->
  22080. (S1 ^operator O2099 = 0.)
  22081. Firing prefer*rvt*predict-no*H0
  22082. -->
  22083. Firing rl*prefer*rvt*predict-no*H0*6
  22084. -->
  22085. (S1 ^operator O2100 = 0.9999999999999999)
  22086. inner elaboration loop at bottom goal.
  22087. Retracting rl*prefer*rvt*predict-no*H0*6
  22088. -->
  22089. (S1 ^operator O2098 = 0.9999999999999999)
  22090. Retracting rl*prefer*rvt*predict-yes*H0*5
  22091. -->
  22092. (S1 ^operator O2097 = 0.)
  22093. --- END Proposal Phase ---
  22094. --- Decision Phase ---
  22095. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22096. =>WM: (14751: S1 ^operator O2100)
  22097. 1050: O: O2100 (predict-no)
  22098. --- END Decision Phase ---
  22099. --- Application Phase ---
  22100. --- Firing Productions (PE) For State At Depth 1 ---
  22101. --- Inner Elaboration Phase, active level 1 (S1) ---
  22102. Firing apply*operator
  22103. -->
  22104. (I3 ^predict-no N1050 + :O )
  22105. Firing apply*operator*complete
  22106. -->
  22107. (I3 ^predict-no N1049 - :O )
  22108. inner elaboration loop at bottom goal.
  22109. --- Change Working Memory (PE) ---
  22110. =>WM: (14752: I3 ^predict-no N1050)
  22111. <=WM: (14739: N1049 ^status complete)
  22112. <=WM: (14738: I3 ^predict-no N1049)
  22113. --- Firing Productions (IE) For State At Depth 1 ---
  22114. --- Inner Elaboration Phase, active level 1 (S1) ---
  22115. Firing monitor*world
  22116. -->
  22117. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22118. --- Change Working Memory (IE) ---
  22119. --- END Application Phase ---
  22120. --- Output Phase ---
  22121. ENV: Agent did: predict-no for direction U in state State-B
  22122. In State-B moving U
  22123. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22124. predict error 0
  22125. dir: dir isL
  22126. --- END Output Phase ---
  22127. -/|--- Input Phase ---
  22128. =>WM: (14756: I2 ^dir L)
  22129. =>WM: (14755: I2 ^reward 1)
  22130. =>WM: (14754: I2 ^see 0)
  22131. =>WM: (14753: N1050 ^status complete)
  22132. <=WM: (14742: I2 ^dir U)
  22133. <=WM: (14741: I2 ^reward 1)
  22134. <=WM: (14740: I2 ^see 0)
  22135. =>WM: (14757: I2 ^level-1 R1-root)
  22136. <=WM: (14743: I2 ^level-1 R1-root)
  22137. --- END Input Phase ---
  22138. --- Proposal Phase ---
  22139. --- Inner Elaboration Phase, active level 1 (S1) ---
  22140. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  22141. -->
  22142. (S1 ^operator O2099 = 0.4768786732073501)
  22143. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  22144. -->
  22145. (S1 ^operator O2100 = -0.01194930198035649)
  22146. Firing prefer*rvt*predict-no*H0*2*H1
  22147. -->
  22148. Firing prefer*rvt*predict-yes*H0*1*H1
  22149. -->
  22150. Firing elaborate*copy-see-to-output-link
  22151. -->
  22152. (I3 ^see 0 +)
  22153. Firing elaborate*reward*based*on*reward
  22154. -->
  22155. (R1054 ^value 1 +)
  22156. (R1 ^reward R1054 +)
  22157. Firing propose*predict-yes
  22158. -->
  22159. (O2101 ^name predict-yes +)
  22160. (S1 ^operator O2101 +)
  22161. Firing propose*predict-no
  22162. -->
  22163. (O2102 ^name predict-no +)
  22164. (S1 ^operator O2102 +)
  22165. Firing rl*prefer*rvt*predict-no*H0*2
  22166. -->
  22167. (S1 ^operator O2100 = 0.2550133912230119)
  22168. Firing rl*prefer*rvt*predict-yes*H0*1
  22169. -->
  22170. (S1 ^operator O2099 = 0.5231192924538035)
  22171. Firing prefer*rvt*predict-yes*H0
  22172. -->
  22173. Firing prefer*rvt*predict-no*H0
  22174. -->
  22175. Firing elaborate*copy-dir-to-output-link
  22176. -->
  22177. (I3 ^dir L +)
  22178. inner elaboration loop at bottom goal.
  22179. Retracting elaborate*copy-see-to-output-link
  22180. -->
  22181. (I3 ^see 0 +)
  22182. Retracting propose*predict-no
  22183. -->
  22184. (O2100 ^name predict-no +)
  22185. (S1 ^operator O2100 +)
  22186. Retracting propose*predict-yes
  22187. -->
  22188. (O2099 ^name predict-yes +)
  22189. (S1 ^operator O2099 +)
  22190. Retracting elaborate*reward*based*on*reward
  22191. -->
  22192. (R1053 ^value 1 +)
  22193. (R1 ^reward R1053 +)
  22194. Retracting elaborate*copy-dir-to-output-link
  22195. -->
  22196. (I3 ^dir U +)
  22197. Retracting rl*prefer*rvt*predict-no*H0*6
  22198. -->
  22199. (S1 ^operator O2100 = 0.9999999999999999)
  22200. Retracting rl*prefer*rvt*predict-yes*H0*5
  22201. -->
  22202. (S1 ^operator O2099 = 0.)
  22203. =>WM: (14764: S1 ^operator O2102 +)
  22204. =>WM: (14763: S1 ^operator O2101 +)
  22205. =>WM: (14762: I3 ^dir L)
  22206. =>WM: (14761: O2102 ^name predict-no)
  22207. =>WM: (14760: O2101 ^name predict-yes)
  22208. =>WM: (14759: R1054 ^value 1)
  22209. =>WM: (14758: R1 ^reward R1054)
  22210. <=WM: (14749: S1 ^operator O2099 +)
  22211. <=WM: (14750: S1 ^operator O2100 +)
  22212. <=WM: (14751: S1 ^operator O2100)
  22213. <=WM: (14734: I3 ^dir U)
  22214. <=WM: (14745: R1 ^reward R1053)
  22215. <=WM: (14748: O2100 ^name predict-no)
  22216. <=WM: (14747: O2099 ^name predict-yes)
  22217. <=WM: (14746: R1053 ^value 1)
  22218. --- Inner Elaboration Phase, active level 1 (S1) ---
  22219. Firing prefer*rvt*predict-yes*H0
  22220. -->
  22221. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  22222. -->
  22223. (S1 ^operator O2101 = 0.4768786732073501)
  22224. Firing rl*prefer*rvt*predict-yes*H0*1
  22225. -->
  22226. (S1 ^operator O2101 = 0.5231192924538035)
  22227. Firing prefer*rvt*predict-yes*H0*1*H1
  22228. -->
  22229. Firing prefer*rvt*predict-no*H0
  22230. -->
  22231. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  22232. -->
  22233. (S1 ^operator O2102 = -0.01194930198035649)
  22234. Firing rl*prefer*rvt*predict-no*H0*2
  22235. -->
  22236. (S1 ^operator O2102 = 0.2550133912230119)
  22237. Firing prefer*rvt*predict-no*H0*2*H1
  22238. -->
  22239. inner elaboration loop at bottom goal.
  22240. Retracting rl*prefer*rvt*predict-no*H0*2
  22241. -->
  22242. (S1 ^operator O2100 = 0.2550133912230119)
  22243. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  22244. -->
  22245. (S1 ^operator O2100 = -0.01194930198035649)
  22246. Retracting rl*prefer*rvt*predict-yes*H0*1
  22247. -->
  22248. (S1 ^operator O2099 = 0.5231192924538035)
  22249. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  22250. -->
  22251. (S1 ^operator O2099 = 0.4768786732073501)
  22252. --- END Proposal Phase ---
  22253. --- Decision Phase ---
  22254. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22255. =>WM: (14765: S1 ^operator O2101)
  22256. 1051: O: O2101 (predict-yes)
  22257. --- END Decision Phase ---
  22258. --- Application Phase ---
  22259. --- Firing Productions (PE) For State At Depth 1 ---
  22260. --- Inner Elaboration Phase, active level 1 (S1) ---
  22261. Firing apply*operator
  22262. -->
  22263. (I3 ^predict-yes N1051 + :O )
  22264. Firing apply*operator*complete
  22265. -->
  22266. (I3 ^predict-no N1050 - :O )
  22267. inner elaboration loop at bottom goal.
  22268. --- Change Working Memory (PE) ---
  22269. =>WM: (14766: I3 ^predict-yes N1051)
  22270. <=WM: (14753: N1050 ^status complete)
  22271. <=WM: (14752: I3 ^predict-no N1050)
  22272. --- Firing Productions (IE) For State At Depth 1 ---
  22273. --- Inner Elaboration Phase, active level 1 (S1) ---
  22274. Firing monitor*world
  22275. -->
  22276. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22277. --- Change Working Memory (IE) ---
  22278. --- END Application Phase ---
  22279. --- Output Phase ---
  22280. ENV: Agent did: predict-yes for direction L in state State-B
  22281. In State-B moving L
  22282. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  22283. predict error 0
  22284. dir: dir isU
  22285. --- END Output Phase ---
  22286. \--- Input Phase ---
  22287. =>WM: (14770: I2 ^dir U)
  22288. =>WM: (14769: I2 ^reward 1)
  22289. =>WM: (14768: I2 ^see 1)
  22290. =>WM: (14767: N1051 ^status complete)
  22291. <=WM: (14756: I2 ^dir L)
  22292. <=WM: (14755: I2 ^reward 1)
  22293. <=WM: (14754: I2 ^see 0)
  22294. =>WM: (14771: I2 ^level-1 L1-root)
  22295. <=WM: (14757: I2 ^level-1 R1-root)
  22296. --- END Input Phase ---
  22297. --- Proposal Phase ---
  22298. --- Inner Elaboration Phase, active level 1 (S1) ---
  22299. Firing elaborate*copy-see-to-output-link
  22300. -->
  22301. (I3 ^see 1 +)
  22302. Firing elaborate*reward*based*on*reward
  22303. -->
  22304. (R1055 ^value 1 +)
  22305. (R1 ^reward R1055 +)
  22306. Firing propose*predict-yes
  22307. -->
  22308. (O2103 ^name predict-yes +)
  22309. (S1 ^operator O2103 +)
  22310. Firing propose*predict-no
  22311. -->
  22312. (O2104 ^name predict-no +)
  22313. (S1 ^operator O2104 +)
  22314. Firing rl*prefer*rvt*predict-no*H0*6
  22315. -->
  22316. (S1 ^operator O2102 = 0.9999999999999999)
  22317. Firing rl*prefer*rvt*predict-yes*H0*5
  22318. -->
  22319. (S1 ^operator O2101 = 0.)
  22320. Firing prefer*rvt*predict-yes*H0
  22321. -->
  22322. Firing prefer*rvt*predict-no*H0
  22323. -->
  22324. Firing elaborate*copy-dir-to-output-link
  22325. -->
  22326. (I3 ^dir U +)
  22327. inner elaboration loop at bottom goal.
  22328. Retracting elaborate*copy-see-to-output-link
  22329. -->
  22330. (I3 ^see 0 +)
  22331. Retracting propose*predict-no
  22332. -->
  22333. (O2102 ^name predict-no +)
  22334. (S1 ^operator O2102 +)
  22335. Retracting propose*predict-yes
  22336. -->
  22337. (O2101 ^name predict-yes +)
  22338. (S1 ^operator O2101 +)
  22339. Retracting elaborate*reward*based*on*reward
  22340. -->
  22341. (R1054 ^value 1 +)
  22342. (R1 ^reward R1054 +)
  22343. Retracting elaborate*copy-dir-to-output-link
  22344. -->
  22345. (I3 ^dir L +)
  22346. Retracting rl*prefer*rvt*predict-no*H0*2
  22347. -->
  22348. (S1 ^operator O2102 = 0.2550133912230119)
  22349. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  22350. -->
  22351. (S1 ^operator O2102 = -0.01194930198035649)
  22352. Retracting rl*prefer*rvt*predict-yes*H0*1
  22353. -->
  22354. (S1 ^operator O2101 = 0.5231192924538035)
  22355. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  22356. -->
  22357. (S1 ^operator O2101 = 0.4768786732073501)
  22358. =>WM: (14779: S1 ^operator O2104 +)
  22359. =>WM: (14778: S1 ^operator O2103 +)
  22360. =>WM: (14777: I3 ^dir U)
  22361. =>WM: (14776: O2104 ^name predict-no)
  22362. =>WM: (14775: O2103 ^name predict-yes)
  22363. =>WM: (14774: R1055 ^value 1)
  22364. =>WM: (14773: R1 ^reward R1055)
  22365. =>WM: (14772: I3 ^see 1)
  22366. <=WM: (14763: S1 ^operator O2101 +)
  22367. <=WM: (14765: S1 ^operator O2101)
  22368. <=WM: (14764: S1 ^operator O2102 +)
  22369. <=WM: (14762: I3 ^dir L)
  22370. <=WM: (14758: R1 ^reward R1054)
  22371. <=WM: (14744: I3 ^see 0)
  22372. <=WM: (14761: O2102 ^name predict-no)
  22373. <=WM: (14760: O2101 ^name predict-yes)
  22374. <=WM: (14759: R1054 ^value 1)
  22375. --- Inner Elaboration Phase, active level 1 (S1) ---
  22376. Firing prefer*rvt*predict-yes*H0
  22377. -->
  22378. Firing rl*prefer*rvt*predict-yes*H0*5
  22379. -->
  22380. (S1 ^operator O2103 = 0.)
  22381. Firing prefer*rvt*predict-no*H0
  22382. -->
  22383. Firing rl*prefer*rvt*predict-no*H0*6
  22384. -->
  22385. (S1 ^operator O2104 = 0.9999999999999999)
  22386. inner elaboration loop at bottom goal.
  22387. Retracting rl*prefer*rvt*predict-no*H0*6
  22388. -->
  22389. (S1 ^operator O2102 = 0.9999999999999999)
  22390. Retracting rl*prefer*rvt*predict-yes*H0*5
  22391. -->
  22392. (S1 ^operator O2101 = 0.)
  22393. --- END Proposal Phase ---
  22394. --- Decision Phase ---
  22395. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.52312(R,m,v=1,0.980132,0.0196026)
  22396. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272038 0.20484 0.476879 -> 0.272039 0.20484 0.476879(R,m,v=1,1,0)
  22397. =>WM: (14780: S1 ^operator O2104)
  22398. 1052: O: O2104 (predict-no)
  22399. --- END Decision Phase ---
  22400. --- Application Phase ---
  22401. --- Firing Productions (PE) For State At Depth 1 ---
  22402. --- Inner Elaboration Phase, active level 1 (S1) ---
  22403. Firing apply*operator
  22404. -->
  22405. (I3 ^predict-no N1052 + :O )
  22406. Firing apply*operator*complete
  22407. -->
  22408. (I3 ^predict-yes N1051 - :O )
  22409. inner elaboration loop at bottom goal.
  22410. --- Change Working Memory (PE) ---
  22411. =>WM: (14781: I3 ^predict-no N1052)
  22412. <=WM: (14767: N1051 ^status complete)
  22413. <=WM: (14766: I3 ^predict-yes N1051)
  22414. --- Firing Productions (IE) For State At Depth 1 ---
  22415. --- Inner Elaboration Phase, active level 1 (S1) ---
  22416. Firing monitor*world
  22417. -->
  22418. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22419. --- Change Working Memory (IE) ---
  22420. --- END Application Phase ---
  22421. --- Output Phase ---
  22422. ENV: Agent did: predict-no for direction U in state State-A
  22423. In State-A moving U
  22424. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22425. predict error 0
  22426. dir: dir isR
  22427. --- END Output Phase ---
  22428. -/|--- Input Phase ---
  22429. =>WM: (14785: I2 ^dir R)
  22430. =>WM: (14784: I2 ^reward 1)
  22431. =>WM: (14783: I2 ^see 0)
  22432. =>WM: (14782: N1052 ^status complete)
  22433. <=WM: (14770: I2 ^dir U)
  22434. <=WM: (14769: I2 ^reward 1)
  22435. <=WM: (14768: I2 ^see 1)
  22436. =>WM: (14786: I2 ^level-1 L1-root)
  22437. <=WM: (14771: I2 ^level-1 L1-root)
  22438. --- END Input Phase ---
  22439. --- Proposal Phase ---
  22440. --- Inner Elaboration Phase, active level 1 (S1) ---
  22441. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  22442. -->
  22443. (S1 ^operator O2103 = 0.6170493230317728)
  22444. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  22445. -->
  22446. (S1 ^operator O2104 = 0.4901349546100854)
  22447. Firing prefer*rvt*predict-no*H0*4*H1
  22448. -->
  22449. Firing prefer*rvt*predict-yes*H0*3*H1
  22450. -->
  22451. Firing elaborate*copy-see-to-output-link
  22452. -->
  22453. (I3 ^see 0 +)
  22454. Firing elaborate*reward*based*on*reward
  22455. -->
  22456. (R1056 ^value 1 +)
  22457. (R1 ^reward R1056 +)
  22458. Firing propose*predict-yes
  22459. -->
  22460. (O2105 ^name predict-yes +)
  22461. (S1 ^operator O2105 +)
  22462. Firing propose*predict-no
  22463. -->
  22464. (O2106 ^name predict-no +)
  22465. (S1 ^operator O2106 +)
  22466. Firing rl*prefer*rvt*predict-no*H0*4
  22467. -->
  22468. (S1 ^operator O2104 = 0.1269768524690122)
  22469. Firing rl*prefer*rvt*predict-yes*H0*3
  22470. -->
  22471. (S1 ^operator O2103 = 0.3829453488915282)
  22472. Firing prefer*rvt*predict-yes*H0
  22473. -->
  22474. Firing prefer*rvt*predict-no*H0
  22475. -->
  22476. Firing elaborate*copy-dir-to-output-link
  22477. -->
  22478. (I3 ^dir R +)
  22479. inner elaboration loop at bottom goal.
  22480. Retracting elaborate*copy-see-to-output-link
  22481. -->
  22482. (I3 ^see 1 +)
  22483. Retracting propose*predict-no
  22484. -->
  22485. (O2104 ^name predict-no +)
  22486. (S1 ^operator O2104 +)
  22487. Retracting propose*predict-yes
  22488. -->
  22489. (O2103 ^name predict-yes +)
  22490. (S1 ^operator O2103 +)
  22491. Retracting elaborate*reward*based*on*reward
  22492. -->
  22493. (R1055 ^value 1 +)
  22494. (R1 ^reward R1055 +)
  22495. Retracting elaborate*copy-dir-to-output-link
  22496. -->
  22497. (I3 ^dir U +)
  22498. Retracting rl*prefer*rvt*predict-no*H0*6
  22499. -->
  22500. (S1 ^operator O2104 = 0.9999999999999999)
  22501. Retracting rl*prefer*rvt*predict-yes*H0*5
  22502. -->
  22503. (S1 ^operator O2103 = 0.)
  22504. =>WM: (14794: S1 ^operator O2106 +)
  22505. =>WM: (14793: S1 ^operator O2105 +)
  22506. =>WM: (14792: I3 ^dir R)
  22507. =>WM: (14791: O2106 ^name predict-no)
  22508. =>WM: (14790: O2105 ^name predict-yes)
  22509. =>WM: (14789: R1056 ^value 1)
  22510. =>WM: (14788: R1 ^reward R1056)
  22511. =>WM: (14787: I3 ^see 0)
  22512. <=WM: (14778: S1 ^operator O2103 +)
  22513. <=WM: (14779: S1 ^operator O2104 +)
  22514. <=WM: (14780: S1 ^operator O2104)
  22515. <=WM: (14777: I3 ^dir U)
  22516. <=WM: (14773: R1 ^reward R1055)
  22517. <=WM: (14772: I3 ^see 1)
  22518. <=WM: (14776: O2104 ^name predict-no)
  22519. <=WM: (14775: O2103 ^name predict-yes)
  22520. <=WM: (14774: R1055 ^value 1)
  22521. --- Inner Elaboration Phase, active level 1 (S1) ---
  22522. Firing prefer*rvt*predict-yes*H0
  22523. -->
  22524. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  22525. -->
  22526. (S1 ^operator O2105 = 0.6170493230317728)
  22527. Firing rl*prefer*rvt*predict-yes*H0*3
  22528. -->
  22529. (S1 ^operator O2105 = 0.3829453488915282)
  22530. Firing prefer*rvt*predict-yes*H0*3*H1
  22531. -->
  22532. Firing prefer*rvt*predict-no*H0
  22533. -->
  22534. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  22535. -->
  22536. (S1 ^operator O2106 = 0.4901349546100854)
  22537. Firing rl*prefer*rvt*predict-no*H0*4
  22538. -->
  22539. (S1 ^operator O2106 = 0.1269768524690122)
  22540. Firing prefer*rvt*predict-no*H0*4*H1
  22541. -->
  22542. inner elaboration loop at bottom goal.
  22543. Retracting rl*prefer*rvt*predict-no*H0*4
  22544. -->
  22545. (S1 ^operator O2104 = 0.1269768524690122)
  22546. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  22547. -->
  22548. (S1 ^operator O2104 = 0.4901349546100854)
  22549. Retracting rl*prefer*rvt*predict-yes*H0*3
  22550. -->
  22551. (S1 ^operator O2103 = 0.3829453488915282)
  22552. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  22553. -->
  22554. (S1 ^operator O2103 = 0.6170493230317728)
  22555. --- END Proposal Phase ---
  22556. --- Decision Phase ---
  22557. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22558. =>WM: (14795: S1 ^operator O2105)
  22559. 1053: O: O2105 (predict-yes)
  22560. --- END Decision Phase ---
  22561. --- Application Phase ---
  22562. --- Firing Productions (PE) For State At Depth 1 ---
  22563. --- Inner Elaboration Phase, active level 1 (S1) ---
  22564. Firing apply*operator
  22565. -->
  22566. (I3 ^predict-yes N1053 + :O )
  22567. Firing apply*operator*complete
  22568. -->
  22569. (I3 ^predict-no N1052 - :O )
  22570. inner elaboration loop at bottom goal.
  22571. --- Change Working Memory (PE) ---
  22572. =>WM: (14796: I3 ^predict-yes N1053)
  22573. <=WM: (14782: N1052 ^status complete)
  22574. <=WM: (14781: I3 ^predict-no N1052)
  22575. --- Firing Productions (IE) For State At Depth 1 ---
  22576. --- Inner Elaboration Phase, active level 1 (S1) ---
  22577. Firing monitor*world
  22578. -->
  22579. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22580. --- Change Working Memory (IE) ---
  22581. --- END Application Phase ---
  22582. --- Output Phase ---
  22583. ENV: Agent did: predict-yes for direction R in state State-A
  22584. In State-A moving R
  22585. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22586. predict error 0
  22587. dir: dir isU
  22588. --- END Output Phase ---
  22589. \-/|--- Input Phase ---
  22590. =>WM: (14800: I2 ^dir U)
  22591. =>WM: (14799: I2 ^reward 1)
  22592. =>WM: (14798: I2 ^see 1)
  22593. =>WM: (14797: N1053 ^status complete)
  22594. <=WM: (14785: I2 ^dir R)
  22595. <=WM: (14784: I2 ^reward 1)
  22596. <=WM: (14783: I2 ^see 0)
  22597. =>WM: (14801: I2 ^level-1 R1-root)
  22598. <=WM: (14786: I2 ^level-1 L1-root)
  22599. --- END Input Phase ---
  22600. --- Proposal Phase ---
  22601. --- Inner Elaboration Phase, active level 1 (S1) ---
  22602. Firing elaborate*copy-see-to-output-link
  22603. -->
  22604. (I3 ^see 1 +)
  22605. Firing elaborate*reward*based*on*reward
  22606. -->
  22607. (R1057 ^value 1 +)
  22608. (R1 ^reward R1057 +)
  22609. Firing propose*predict-yes
  22610. -->
  22611. (O2107 ^name predict-yes +)
  22612. (S1 ^operator O2107 +)
  22613. Firing propose*predict-no
  22614. -->
  22615. (O2108 ^name predict-no +)
  22616. (S1 ^operator O2108 +)
  22617. Firing rl*prefer*rvt*predict-no*H0*6
  22618. -->
  22619. (S1 ^operator O2106 = 0.9999999999999999)
  22620. Firing rl*prefer*rvt*predict-yes*H0*5
  22621. -->
  22622. (S1 ^operator O2105 = 0.)
  22623. Firing prefer*rvt*predict-yes*H0
  22624. -->
  22625. Firing prefer*rvt*predict-no*H0
  22626. -->
  22627. Firing elaborate*copy-dir-to-output-link
  22628. -->
  22629. (I3 ^dir U +)
  22630. inner elaboration loop at bottom goal.
  22631. Retracting elaborate*copy-see-to-output-link
  22632. -->
  22633. (I3 ^see 0 +)
  22634. Retracting propose*predict-no
  22635. -->
  22636. (O2106 ^name predict-no +)
  22637. (S1 ^operator O2106 +)
  22638. Retracting propose*predict-yes
  22639. -->
  22640. (O2105 ^name predict-yes +)
  22641. (S1 ^operator O2105 +)
  22642. Retracting elaborate*reward*based*on*reward
  22643. -->
  22644. (R1056 ^value 1 +)
  22645. (R1 ^reward R1056 +)
  22646. Retracting elaborate*copy-dir-to-output-link
  22647. -->
  22648. (I3 ^dir R +)
  22649. Retracting rl*prefer*rvt*predict-no*H0*4
  22650. -->
  22651. (S1 ^operator O2106 = 0.1269768524690122)
  22652. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  22653. -->
  22654. (S1 ^operator O2106 = 0.4901349546100854)
  22655. Retracting rl*prefer*rvt*predict-yes*H0*3
  22656. -->
  22657. (S1 ^operator O2105 = 0.3829453488915282)
  22658. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  22659. -->
  22660. (S1 ^operator O2105 = 0.6170493230317728)
  22661. =>WM: (14809: S1 ^operator O2108 +)
  22662. =>WM: (14808: S1 ^operator O2107 +)
  22663. =>WM: (14807: I3 ^dir U)
  22664. =>WM: (14806: O2108 ^name predict-no)
  22665. =>WM: (14805: O2107 ^name predict-yes)
  22666. =>WM: (14804: R1057 ^value 1)
  22667. =>WM: (14803: R1 ^reward R1057)
  22668. =>WM: (14802: I3 ^see 1)
  22669. <=WM: (14793: S1 ^operator O2105 +)
  22670. <=WM: (14795: S1 ^operator O2105)
  22671. <=WM: (14794: S1 ^operator O2106 +)
  22672. <=WM: (14792: I3 ^dir R)
  22673. <=WM: (14788: R1 ^reward R1056)
  22674. <=WM: (14787: I3 ^see 0)
  22675. <=WM: (14791: O2106 ^name predict-no)
  22676. <=WM: (14790: O2105 ^name predict-yes)
  22677. <=WM: (14789: R1056 ^value 1)
  22678. --- Inner Elaboration Phase, active level 1 (S1) ---
  22679. Firing prefer*rvt*predict-yes*H0
  22680. -->
  22681. Firing rl*prefer*rvt*predict-yes*H0*5
  22682. -->
  22683. (S1 ^operator O2107 = 0.)
  22684. Firing prefer*rvt*predict-no*H0
  22685. -->
  22686. Firing rl*prefer*rvt*predict-no*H0*6
  22687. -->
  22688. (S1 ^operator O2108 = 0.9999999999999999)
  22689. inner elaboration loop at bottom goal.
  22690. Retracting rl*prefer*rvt*predict-no*H0*6
  22691. -->
  22692. (S1 ^operator O2106 = 0.9999999999999999)
  22693. Retracting rl*prefer*rvt*predict-yes*H0*5
  22694. -->
  22695. (S1 ^operator O2105 = 0.)
  22696. --- END Proposal Phase ---
  22697. --- Decision Phase ---
  22698. RL update rl*prefer*rvt*predict-yes*H0*3 0.673138 -0.290193 0.382945 -> 0.673139 -0.290193 0.382946(R,m,v=1,0.962963,0.0358868)
  22699. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326857 0.290192 0.617049 -> 0.326858 0.290192 0.61705(R,m,v=1,1,0)
  22700. =>WM: (14810: S1 ^operator O2108)
  22701. 1054: O: O2108 (predict-no)
  22702. --- END Decision Phase ---
  22703. --- Application Phase ---
  22704. --- Firing Productions (PE) For State At Depth 1 ---
  22705. --- Inner Elaboration Phase, active level 1 (S1) ---
  22706. Firing apply*operator
  22707. -->
  22708. (I3 ^predict-no N1054 + :O )
  22709. Firing apply*operator*complete
  22710. -->
  22711. (I3 ^predict-yes N1053 - :O )
  22712. inner elaboration loop at bottom goal.
  22713. --- Change Working Memory (PE) ---
  22714. =>WM: (14811: I3 ^predict-no N1054)
  22715. <=WM: (14797: N1053 ^status complete)
  22716. <=WM: (14796: I3 ^predict-yes N1053)
  22717. --- Firing Productions (IE) For State At Depth 1 ---
  22718. --- Inner Elaboration Phase, active level 1 (S1) ---
  22719. Firing monitor*world
  22720. -->
  22721. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22722. --- Change Working Memory (IE) ---
  22723. --- END Application Phase ---
  22724. --- Output Phase ---
  22725. ENV: Agent did: predict-no for direction U in state State-B
  22726. In State-B moving U
  22727. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22728. predict error 0
  22729. dir: dir isR
  22730. --- END Output Phase ---
  22731. \---- Input Phase ---
  22732. =>WM: (14815: I2 ^dir R)
  22733. =>WM: (14814: I2 ^reward 1)
  22734. =>WM: (14813: I2 ^see 0)
  22735. =>WM: (14812: N1054 ^status complete)
  22736. <=WM: (14800: I2 ^dir U)
  22737. <=WM: (14799: I2 ^reward 1)
  22738. <=WM: (14798: I2 ^see 1)
  22739. =>WM: (14816: I2 ^level-1 R1-root)
  22740. <=WM: (14801: I2 ^level-1 R1-root)
  22741. --- END Input Phase ---
  22742. --- Proposal Phase ---
  22743. --- Inner Elaboration Phase, active level 1 (S1) ---
  22744. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  22745. -->
  22746. (S1 ^operator O2107 = 0.08783148430849691)
  22747. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  22748. -->
  22749. (S1 ^operator O2108 = 0.8730232725957683)
  22750. Firing prefer*rvt*predict-no*H0*4*H1
  22751. -->
  22752. Firing prefer*rvt*predict-yes*H0*3*H1
  22753. -->
  22754. Firing elaborate*copy-see-to-output-link
  22755. -->
  22756. (I3 ^see 0 +)
  22757. Firing elaborate*reward*based*on*reward
  22758. -->
  22759. (R1058 ^value 1 +)
  22760. (R1 ^reward R1058 +)
  22761. Firing propose*predict-yes
  22762. -->
  22763. (O2109 ^name predict-yes +)
  22764. (S1 ^operator O2109 +)
  22765. Firing propose*predict-no
  22766. -->
  22767. (O2110 ^name predict-no +)
  22768. (S1 ^operator O2110 +)
  22769. Firing rl*prefer*rvt*predict-no*H0*4
  22770. -->
  22771. (S1 ^operator O2108 = 0.1269768524690122)
  22772. Firing rl*prefer*rvt*predict-yes*H0*3
  22773. -->
  22774. (S1 ^operator O2107 = 0.382946148103033)
  22775. Firing prefer*rvt*predict-yes*H0
  22776. -->
  22777. Firing prefer*rvt*predict-no*H0
  22778. -->
  22779. Firing elaborate*copy-dir-to-output-link
  22780. -->
  22781. (I3 ^dir R +)
  22782. inner elaboration loop at bottom goal.
  22783. Retracting elaborate*copy-see-to-output-link
  22784. -->
  22785. (I3 ^see 1 +)
  22786. Retracting propose*predict-no
  22787. -->
  22788. (O2108 ^name predict-no +)
  22789. (S1 ^operator O2108 +)
  22790. Retracting propose*predict-yes
  22791. -->
  22792. (O2107 ^name predict-yes +)
  22793. (S1 ^operator O2107 +)
  22794. Retracting elaborate*reward*based*on*reward
  22795. -->
  22796. (R1057 ^value 1 +)
  22797. (R1 ^reward R1057 +)
  22798. Retracting elaborate*copy-dir-to-output-link
  22799. -->
  22800. (I3 ^dir U +)
  22801. Retracting rl*prefer*rvt*predict-no*H0*6
  22802. -->
  22803. (S1 ^operator O2108 = 0.9999999999999999)
  22804. Retracting rl*prefer*rvt*predict-yes*H0*5
  22805. -->
  22806. (S1 ^operator O2107 = 0.)
  22807. =>WM: (14824: S1 ^operator O2110 +)
  22808. =>WM: (14823: S1 ^operator O2109 +)
  22809. =>WM: (14822: I3 ^dir R)
  22810. =>WM: (14821: O2110 ^name predict-no)
  22811. =>WM: (14820: O2109 ^name predict-yes)
  22812. =>WM: (14819: R1058 ^value 1)
  22813. =>WM: (14818: R1 ^reward R1058)
  22814. =>WM: (14817: I3 ^see 0)
  22815. <=WM: (14808: S1 ^operator O2107 +)
  22816. <=WM: (14809: S1 ^operator O2108 +)
  22817. <=WM: (14810: S1 ^operator O2108)
  22818. <=WM: (14807: I3 ^dir U)
  22819. <=WM: (14803: R1 ^reward R1057)
  22820. <=WM: (14802: I3 ^see 1)
  22821. <=WM: (14806: O2108 ^name predict-no)
  22822. <=WM: (14805: O2107 ^name predict-yes)
  22823. <=WM: (14804: R1057 ^value 1)
  22824. --- Inner Elaboration Phase, active level 1 (S1) ---
  22825. Firing prefer*rvt*predict-yes*H0
  22826. -->
  22827. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  22828. -->
  22829. (S1 ^operator O2109 = 0.08783148430849691)
  22830. Firing rl*prefer*rvt*predict-yes*H0*3
  22831. -->
  22832. (S1 ^operator O2109 = 0.382946148103033)
  22833. Firing prefer*rvt*predict-yes*H0*3*H1
  22834. -->
  22835. Firing prefer*rvt*predict-no*H0
  22836. -->
  22837. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  22838. -->
  22839. (S1 ^operator O2110 = 0.8730232725957683)
  22840. Firing rl*prefer*rvt*predict-no*H0*4
  22841. -->
  22842. (S1 ^operator O2110 = 0.1269768524690122)
  22843. Firing prefer*rvt*predict-no*H0*4*H1
  22844. -->
  22845. inner elaboration loop at bottom goal.
  22846. Retracting rl*prefer*rvt*predict-no*H0*4
  22847. -->
  22848. (S1 ^operator O2108 = 0.1269768524690122)
  22849. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  22850. -->
  22851. (S1 ^operator O2108 = 0.8730232725957683)
  22852. Retracting rl*prefer*rvt*predict-yes*H0*3
  22853. -->
  22854. (S1 ^operator O2107 = 0.382946148103033)
  22855. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  22856. -->
  22857. (S1 ^operator O2107 = 0.08783148430849691)
  22858. --- END Proposal Phase ---
  22859. --- Decision Phase ---
  22860. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22861. =>WM: (14825: S1 ^operator O2110)
  22862. 1055: O: O2110 (predict-no)
  22863. --- END Decision Phase ---
  22864. --- Application Phase ---
  22865. --- Firing Productions (PE) For State At Depth 1 ---
  22866. --- Inner Elaboration Phase, active level 1 (S1) ---
  22867. Firing apply*operator
  22868. -->
  22869. (I3 ^predict-no N1055 + :O )
  22870. Firing apply*operator*complete
  22871. -->
  22872. (I3 ^predict-no N1054 - :O )
  22873. inner elaboration loop at bottom goal.
  22874. --- Change Working Memory (PE) ---
  22875. =>WM: (14826: I3 ^predict-no N1055)
  22876. <=WM: (14812: N1054 ^status complete)
  22877. <=WM: (14811: I3 ^predict-no N1054)
  22878. --- Firing Productions (IE) For State At Depth 1 ---
  22879. --- Inner Elaboration Phase, active level 1 (S1) ---
  22880. Firing monitor*world
  22881. -->
  22882. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22883. --- Change Working Memory (IE) ---
  22884. --- END Application Phase ---
  22885. --- Output Phase ---
  22886. ENV: Agent did: predict-no for direction R in state State-B
  22887. In State-B moving R
  22888. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22889. predict error 0
  22890. dir: dir isL
  22891. --- END Output Phase ---
  22892. /|\--- Input Phase ---
  22893. =>WM: (14830: I2 ^dir L)
  22894. =>WM: (14829: I2 ^reward 1)
  22895. =>WM: (14828: I2 ^see 0)
  22896. =>WM: (14827: N1055 ^status complete)
  22897. <=WM: (14815: I2 ^dir R)
  22898. <=WM: (14814: I2 ^reward 1)
  22899. <=WM: (14813: I2 ^see 0)
  22900. =>WM: (14831: I2 ^level-1 R0-root)
  22901. <=WM: (14816: I2 ^level-1 R1-root)
  22902. --- END Input Phase ---
  22903. --- Proposal Phase ---
  22904. --- Inner Elaboration Phase, active level 1 (S1) ---
  22905. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  22906. -->
  22907. (S1 ^operator O2109 = 0.4768813258528501)
  22908. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  22909. -->
  22910. (S1 ^operator O2110 = 0.1700769046561409)
  22911. Firing prefer*rvt*predict-no*H0*2*H1
  22912. -->
  22913. Firing prefer*rvt*predict-yes*H0*1*H1
  22914. -->
  22915. Firing elaborate*copy-see-to-output-link
  22916. -->
  22917. (I3 ^see 0 +)
  22918. Firing elaborate*reward*based*on*reward
  22919. -->
  22920. (R1059 ^value 1 +)
  22921. (R1 ^reward R1059 +)
  22922. Firing propose*predict-yes
  22923. -->
  22924. (O2111 ^name predict-yes +)
  22925. (S1 ^operator O2111 +)
  22926. Firing propose*predict-no
  22927. -->
  22928. (O2112 ^name predict-no +)
  22929. (S1 ^operator O2112 +)
  22930. Firing rl*prefer*rvt*predict-no*H0*2
  22931. -->
  22932. (S1 ^operator O2110 = 0.2550133912230119)
  22933. Firing rl*prefer*rvt*predict-yes*H0*1
  22934. -->
  22935. (S1 ^operator O2109 = 0.5231195976046303)
  22936. Firing prefer*rvt*predict-yes*H0
  22937. -->
  22938. Firing prefer*rvt*predict-no*H0
  22939. -->
  22940. Firing elaborate*copy-dir-to-output-link
  22941. -->
  22942. (I3 ^dir L +)
  22943. inner elaboration loop at bottom goal.
  22944. Retracting elaborate*copy-see-to-output-link
  22945. -->
  22946. (I3 ^see 0 +)
  22947. Retracting propose*predict-no
  22948. -->
  22949. (O2110 ^name predict-no +)
  22950. (S1 ^operator O2110 +)
  22951. Retracting propose*predict-yes
  22952. -->
  22953. (O2109 ^name predict-yes +)
  22954. (S1 ^operator O2109 +)
  22955. Retracting elaborate*reward*based*on*reward
  22956. -->
  22957. (R1058 ^value 1 +)
  22958. (R1 ^reward R1058 +)
  22959. Retracting elaborate*copy-dir-to-output-link
  22960. -->
  22961. (I3 ^dir R +)
  22962. Retracting rl*prefer*rvt*predict-no*H0*4
  22963. -->
  22964. (S1 ^operator O2110 = 0.1269768524690122)
  22965. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  22966. -->
  22967. (S1 ^operator O2110 = 0.8730232725957683)
  22968. Retracting rl*prefer*rvt*predict-yes*H0*3
  22969. -->
  22970. (S1 ^operator O2109 = 0.382946148103033)
  22971. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  22972. -->
  22973. (S1 ^operator O2109 = 0.08783148430849691)
  22974. =>WM: (14838: S1 ^operator O2112 +)
  22975. =>WM: (14837: S1 ^operator O2111 +)
  22976. =>WM: (14836: I3 ^dir L)
  22977. =>WM: (14835: O2112 ^name predict-no)
  22978. =>WM: (14834: O2111 ^name predict-yes)
  22979. =>WM: (14833: R1059 ^value 1)
  22980. =>WM: (14832: R1 ^reward R1059)
  22981. <=WM: (14823: S1 ^operator O2109 +)
  22982. <=WM: (14824: S1 ^operator O2110 +)
  22983. <=WM: (14825: S1 ^operator O2110)
  22984. <=WM: (14822: I3 ^dir R)
  22985. <=WM: (14818: R1 ^reward R1058)
  22986. <=WM: (14821: O2110 ^name predict-no)
  22987. <=WM: (14820: O2109 ^name predict-yes)
  22988. <=WM: (14819: R1058 ^value 1)
  22989. --- Inner Elaboration Phase, active level 1 (S1) ---
  22990. Firing prefer*rvt*predict-yes*H0
  22991. -->
  22992. Firing rl*prefer*rvt*predict-yes*H0*1
  22993. -->
  22994. (S1 ^operator O2111 = 0.5231195976046303)
  22995. Firing prefer*rvt*predict-yes*H0*1*H1
  22996. -->
  22997. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  22998. -->
  22999. (S1 ^operator O2111 = 0.4768813258528501)
  23000. Firing prefer*rvt*predict-no*H0
  23001. -->
  23002. Firing rl*prefer*rvt*predict-no*H0*2
  23003. -->
  23004. (S1 ^operator O2112 = 0.2550133912230119)
  23005. Firing prefer*rvt*predict-no*H0*2*H1
  23006. -->
  23007. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  23008. -->
  23009. (S1 ^operator O2112 = 0.1700769046561409)
  23010. inner elaboration loop at bottom goal.
  23011. Retracting rl*prefer*rvt*predict-no*H0*2
  23012. -->
  23013. (S1 ^operator O2110 = 0.2550133912230119)
  23014. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  23015. -->
  23016. (S1 ^operator O2110 = 0.1700769046561409)
  23017. Retracting rl*prefer*rvt*predict-yes*H0*1
  23018. -->
  23019. (S1 ^operator O2109 = 0.5231195976046303)
  23020. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  23021. -->
  23022. (S1 ^operator O2109 = 0.4768813258528501)
  23023. --- END Proposal Phase ---
  23024. --- Decision Phase ---
  23025. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.952381,0.0455927)
  23026. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  23027. =>WM: (14839: S1 ^operator O2111)
  23028. 1056: O: O2111 (predict-yes)
  23029. --- END Decision Phase ---
  23030. --- Application Phase ---
  23031. --- Firing Productions (PE) For State At Depth 1 ---
  23032. --- Inner Elaboration Phase, active level 1 (S1) ---
  23033. Firing apply*operator
  23034. -->
  23035. (I3 ^predict-yes N1056 + :O )
  23036. Firing apply*operator*complete
  23037. -->
  23038. (I3 ^predict-no N1055 - :O )
  23039. inner elaboration loop at bottom goal.
  23040. --- Change Working Memory (PE) ---
  23041. =>WM: (14840: I3 ^predict-yes N1056)
  23042. <=WM: (14827: N1055 ^status complete)
  23043. <=WM: (14826: I3 ^predict-no N1055)
  23044. --- Firing Productions (IE) For State At Depth 1 ---
  23045. --- Inner Elaboration Phase, active level 1 (S1) ---
  23046. Firing monitor*world
  23047. -->
  23048. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23049. --- Change Working Memory (IE) ---
  23050. --- END Application Phase ---
  23051. --- Output Phase ---
  23052. ENV: Agent did: predict-yes for direction L in state State-B
  23053. In State-B moving L
  23054. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  23055. predict error 0
  23056. dir: dir isR
  23057. --- END Output Phase ---
  23058. -/|--- Input Phase ---
  23059. =>WM: (14844: I2 ^dir R)
  23060. =>WM: (14843: I2 ^reward 1)
  23061. =>WM: (14842: I2 ^see 1)
  23062. =>WM: (14841: N1056 ^status complete)
  23063. <=WM: (14830: I2 ^dir L)
  23064. <=WM: (14829: I2 ^reward 1)
  23065. <=WM: (14828: I2 ^see 0)
  23066. =>WM: (14845: I2 ^level-1 L1-root)
  23067. <=WM: (14831: I2 ^level-1 R0-root)
  23068. --- END Input Phase ---
  23069. --- Proposal Phase ---
  23070. --- Inner Elaboration Phase, active level 1 (S1) ---
  23071. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  23072. -->
  23073. (S1 ^operator O2111 = 0.6170501222432777)
  23074. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  23075. -->
  23076. (S1 ^operator O2112 = 0.4901349546100854)
  23077. Firing prefer*rvt*predict-no*H0*4*H1
  23078. -->
  23079. Firing prefer*rvt*predict-yes*H0*3*H1
  23080. -->
  23081. Firing elaborate*copy-see-to-output-link
  23082. -->
  23083. (I3 ^see 1 +)
  23084. Firing elaborate*reward*based*on*reward
  23085. -->
  23086. (R1060 ^value 1 +)
  23087. (R1 ^reward R1060 +)
  23088. Firing propose*predict-yes
  23089. -->
  23090. (O2113 ^name predict-yes +)
  23091. (S1 ^operator O2113 +)
  23092. Firing propose*predict-no
  23093. -->
  23094. (O2114 ^name predict-no +)
  23095. (S1 ^operator O2114 +)
  23096. Firing rl*prefer*rvt*predict-no*H0*4
  23097. -->
  23098. (S1 ^operator O2112 = 0.1269768337092951)
  23099. Firing rl*prefer*rvt*predict-yes*H0*3
  23100. -->
  23101. (S1 ^operator O2111 = 0.382946148103033)
  23102. Firing prefer*rvt*predict-yes*H0
  23103. -->
  23104. Firing prefer*rvt*predict-no*H0
  23105. -->
  23106. Firing elaborate*copy-dir-to-output-link
  23107. -->
  23108. (I3 ^dir R +)
  23109. inner elaboration loop at bottom goal.
  23110. Retracting elaborate*copy-see-to-output-link
  23111. -->
  23112. (I3 ^see 0 +)
  23113. Retracting propose*predict-no
  23114. -->
  23115. (O2112 ^name predict-no +)
  23116. (S1 ^operator O2112 +)
  23117. Retracting propose*predict-yes
  23118. -->
  23119. (O2111 ^name predict-yes +)
  23120. (S1 ^operator O2111 +)
  23121. Retracting elaborate*reward*based*on*reward
  23122. -->
  23123. (R1059 ^value 1 +)
  23124. (R1 ^reward R1059 +)
  23125. Retracting elaborate*copy-dir-to-output-link
  23126. -->
  23127. (I3 ^dir L +)
  23128. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  23129. -->
  23130. (S1 ^operator O2112 = 0.1700769046561409)
  23131. Retracting rl*prefer*rvt*predict-no*H0*2
  23132. -->
  23133. (S1 ^operator O2112 = 0.2550133912230119)
  23134. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  23135. -->
  23136. (S1 ^operator O2111 = 0.4768813258528501)
  23137. Retracting rl*prefer*rvt*predict-yes*H0*1
  23138. -->
  23139. (S1 ^operator O2111 = 0.5231195976046303)
  23140. =>WM: (14853: S1 ^operator O2114 +)
  23141. =>WM: (14852: S1 ^operator O2113 +)
  23142. =>WM: (14851: I3 ^dir R)
  23143. =>WM: (14850: O2114 ^name predict-no)
  23144. =>WM: (14849: O2113 ^name predict-yes)
  23145. =>WM: (14848: R1060 ^value 1)
  23146. =>WM: (14847: R1 ^reward R1060)
  23147. =>WM: (14846: I3 ^see 1)
  23148. <=WM: (14837: S1 ^operator O2111 +)
  23149. <=WM: (14839: S1 ^operator O2111)
  23150. <=WM: (14838: S1 ^operator O2112 +)
  23151. <=WM: (14836: I3 ^dir L)
  23152. <=WM: (14832: R1 ^reward R1059)
  23153. <=WM: (14817: I3 ^see 0)
  23154. <=WM: (14835: O2112 ^name predict-no)
  23155. <=WM: (14834: O2111 ^name predict-yes)
  23156. <=WM: (14833: R1059 ^value 1)
  23157. --- Inner Elaboration Phase, active level 1 (S1) ---
  23158. Firing prefer*rvt*predict-yes*H0
  23159. -->
  23160. Firing rl*prefer*rvt*predict-yes*H0*3
  23161. -->
  23162. (S1 ^operator O2113 = 0.382946148103033)
  23163. Firing prefer*rvt*predict-yes*H0*3*H1
  23164. -->
  23165. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  23166. -->
  23167. (S1 ^operator O2113 = 0.6170501222432777)
  23168. Firing prefer*rvt*predict-no*H0
  23169. -->
  23170. Firing rl*prefer*rvt*predict-no*H0*4
  23171. -->
  23172. (S1 ^operator O2114 = 0.1269768337092951)
  23173. Firing prefer*rvt*predict-no*H0*4*H1
  23174. -->
  23175. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  23176. -->
  23177. (S1 ^operator O2114 = 0.4901349546100854)
  23178. inner elaboration loop at bottom goal.
  23179. Retracting rl*prefer*rvt*predict-no*H0*4
  23180. -->
  23181. (S1 ^operator O2112 = 0.1269768337092951)
  23182. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  23183. -->
  23184. (S1 ^operator O2112 = 0.4901349546100854)
  23185. Retracting rl*prefer*rvt*predict-yes*H0*3
  23186. -->
  23187. (S1 ^operator O2111 = 0.382946148103033)
  23188. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  23189. -->
  23190. (S1 ^operator O2111 = 0.6170501222432777)
  23191. --- END Proposal Phase ---
  23192. --- Decision Phase ---
  23193. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.52312 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.980263,0.0194754)
  23194. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272042 0.204839 0.476881 -> 0.272042 0.20484 0.476881(R,m,v=1,1,0)
  23195. =>WM: (14854: S1 ^operator O2113)
  23196. 1057: O: O2113 (predict-yes)
  23197. --- END Decision Phase ---
  23198. --- Application Phase ---
  23199. --- Firing Productions (PE) For State At Depth 1 ---
  23200. --- Inner Elaboration Phase, active level 1 (S1) ---
  23201. Firing apply*operator
  23202. -->
  23203. (I3 ^predict-yes N1057 + :O )
  23204. Firing apply*operator*complete
  23205. -->
  23206. (I3 ^predict-yes N1056 - :O )
  23207. inner elaboration loop at bottom goal.
  23208. --- Change Working Memory (PE) ---
  23209. =>WM: (14855: I3 ^predict-yes N1057)
  23210. <=WM: (14841: N1056 ^status complete)
  23211. <=WM: (14840: I3 ^predict-yes N1056)
  23212. --- Firing Productions (IE) For State At Depth 1 ---
  23213. --- Inner Elaboration Phase, active level 1 (S1) ---
  23214. Firing monitor*world
  23215. -->
  23216. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23217. --- Change Working Memory (IE) ---
  23218. --- END Application Phase ---
  23219. --- Output Phase ---
  23220. ENV: Agent did: predict-yes for direction R in state State-A
  23221. In State-A moving R
  23222. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  23223. predict error 0
  23224. dir: dir isU
  23225. --- END Output Phase ---
  23226. \---- Input Phase ---
  23227. =>WM: (14859: I2 ^dir U)
  23228. =>WM: (14858: I2 ^reward 1)
  23229. =>WM: (14857: I2 ^see 1)
  23230. =>WM: (14856: N1057 ^status complete)
  23231. <=WM: (14844: I2 ^dir R)
  23232. <=WM: (14843: I2 ^reward 1)
  23233. <=WM: (14842: I2 ^see 1)
  23234. =>WM: (14860: I2 ^level-1 R1-root)
  23235. <=WM: (14845: I2 ^level-1 L1-root)
  23236. --- END Input Phase ---
  23237. --- Proposal Phase ---
  23238. --- Inner Elaboration Phase, active level 1 (S1) ---
  23239. Firing elaborate*copy-see-to-output-link
  23240. -->
  23241. (I3 ^see 1 +)
  23242. Firing elaborate*reward*based*on*reward
  23243. -->
  23244. (R1061 ^value 1 +)
  23245. (R1 ^reward R1061 +)
  23246. Firing propose*predict-yes
  23247. -->
  23248. (O2115 ^name predict-yes +)
  23249. (S1 ^operator O2115 +)
  23250. Firing propose*predict-no
  23251. -->
  23252. (O2116 ^name predict-no +)
  23253. (S1 ^operator O2116 +)
  23254. Firing rl*prefer*rvt*predict-no*H0*6
  23255. -->
  23256. (S1 ^operator O2114 = 0.9999999999999999)
  23257. Firing rl*prefer*rvt*predict-yes*H0*5
  23258. -->
  23259. (S1 ^operator O2113 = 0.)
  23260. Firing prefer*rvt*predict-yes*H0
  23261. -->
  23262. Firing prefer*rvt*predict-no*H0
  23263. -->
  23264. Firing elaborate*copy-dir-to-output-link
  23265. -->
  23266. (I3 ^dir U +)
  23267. inner elaboration loop at bottom goal.
  23268. Retracting elaborate*copy-see-to-output-link
  23269. -->
  23270. (I3 ^see 1 +)
  23271. Retracting propose*predict-no
  23272. -->
  23273. (O2114 ^name predict-no +)
  23274. (S1 ^operator O2114 +)
  23275. Retracting propose*predict-yes
  23276. -->
  23277. (O2113 ^name predict-yes +)
  23278. (S1 ^operator O2113 +)
  23279. Retracting elaborate*reward*based*on*reward
  23280. -->
  23281. (R1060 ^value 1 +)
  23282. (R1 ^reward R1060 +)
  23283. Retracting elaborate*copy-dir-to-output-link
  23284. -->
  23285. (I3 ^dir R +)
  23286. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  23287. -->
  23288. (S1 ^operator O2114 = 0.4901349546100854)
  23289. Retracting rl*prefer*rvt*predict-no*H0*4
  23290. -->
  23291. (S1 ^operator O2114 = 0.1269768337092951)
  23292. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  23293. -->
  23294. (S1 ^operator O2113 = 0.6170501222432777)
  23295. Retracting rl*prefer*rvt*predict-yes*H0*3
  23296. -->
  23297. (S1 ^operator O2113 = 0.382946148103033)
  23298. =>WM: (14867: S1 ^operator O2116 +)
  23299. =>WM: (14866: S1 ^operator O2115 +)
  23300. =>WM: (14865: I3 ^dir U)
  23301. =>WM: (14864: O2116 ^name predict-no)
  23302. =>WM: (14863: O2115 ^name predict-yes)
  23303. =>WM: (14862: R1061 ^value 1)
  23304. =>WM: (14861: R1 ^reward R1061)
  23305. <=WM: (14852: S1 ^operator O2113 +)
  23306. <=WM: (14854: S1 ^operator O2113)
  23307. <=WM: (14853: S1 ^operator O2114 +)
  23308. <=WM: (14851: I3 ^dir R)
  23309. <=WM: (14847: R1 ^reward R1060)
  23310. <=WM: (14850: O2114 ^name predict-no)
  23311. <=WM: (14849: O2113 ^name predict-yes)
  23312. <=WM: (14848: R1060 ^value 1)
  23313. --- Inner Elaboration Phase, active level 1 (S1) ---
  23314. Firing prefer*rvt*predict-yes*H0
  23315. -->
  23316. Firing rl*prefer*rvt*predict-yes*H0*5
  23317. -->
  23318. (S1 ^operator O2115 = 0.)
  23319. Firing prefer*rvt*predict-no*H0
  23320. -->
  23321. Firing rl*prefer*rvt*predict-no*H0*6
  23322. -->
  23323. (S1 ^operator O2116 = 0.9999999999999999)
  23324. inner elaboration loop at bottom goal.
  23325. Retracting rl*prefer*rvt*predict-no*H0*6
  23326. -->
  23327. (S1 ^operator O2114 = 0.9999999999999999)
  23328. Retracting rl*prefer*rvt*predict-yes*H0*5
  23329. -->
  23330. (S1 ^operator O2113 = 0.)
  23331. --- END Proposal Phase ---
  23332. --- Decision Phase ---
  23333. RL update rl*prefer*rvt*predict-yes*H0*3 0.673139 -0.290193 0.382946 -> 0.673139 -0.290193 0.382947(R,m,v=1,0.96319,0.0356737)
  23334. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326858 0.290192 0.61705 -> 0.326858 0.290192 0.617051(R,m,v=1,1,0)
  23335. =>WM: (14868: S1 ^operator O2116)
  23336. 1058: O: O2116 (predict-no)
  23337. --- END Decision Phase ---
  23338. --- Application Phase ---
  23339. --- Firing Productions (PE) For State At Depth 1 ---
  23340. --- Inner Elaboration Phase, active level 1 (S1) ---
  23341. Firing apply*operator
  23342. -->
  23343. (I3 ^predict-no N1058 + :O )
  23344. Firing apply*operator*complete
  23345. -->
  23346. (I3 ^predict-yes N1057 - :O )
  23347. inner elaboration loop at bottom goal.
  23348. --- Change Working Memory (PE) ---
  23349. =>WM: (14869: I3 ^predict-no N1058)
  23350. <=WM: (14856: N1057 ^status complete)
  23351. <=WM: (14855: I3 ^predict-yes N1057)
  23352. --- Firing Productions (IE) For State At Depth 1 ---
  23353. --- Inner Elaboration Phase, active level 1 (S1) ---
  23354. Firing monitor*world
  23355. -->
  23356. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23357. --- Change Working Memory (IE) ---
  23358. --- END Application Phase ---
  23359. --- Output Phase ---
  23360. ENV: Agent did: predict-no for direction U in state State-B
  23361. In State-B moving U
  23362. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23363. predict error 0
  23364. dir: dir isR
  23365. --- END Output Phase ---
  23366. /|\--- Input Phase ---
  23367. =>WM: (14873: I2 ^dir R)
  23368. =>WM: (14872: I2 ^reward 1)
  23369. =>WM: (14871: I2 ^see 0)
  23370. =>WM: (14870: N1058 ^status complete)
  23371. <=WM: (14859: I2 ^dir U)
  23372. <=WM: (14858: I2 ^reward 1)
  23373. <=WM: (14857: I2 ^see 1)
  23374. =>WM: (14874: I2 ^level-1 R1-root)
  23375. <=WM: (14860: I2 ^level-1 R1-root)
  23376. --- END Input Phase ---
  23377. --- Proposal Phase ---
  23378. --- Inner Elaboration Phase, active level 1 (S1) ---
  23379. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  23380. -->
  23381. (S1 ^operator O2115 = 0.08783148430849691)
  23382. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  23383. -->
  23384. (S1 ^operator O2116 = 0.8730232538360513)
  23385. Firing prefer*rvt*predict-no*H0*4*H1
  23386. -->
  23387. Firing prefer*rvt*predict-yes*H0*3*H1
  23388. -->
  23389. Firing elaborate*copy-see-to-output-link
  23390. -->
  23391. (I3 ^see 0 +)
  23392. Firing elaborate*reward*based*on*reward
  23393. -->
  23394. (R1062 ^value 1 +)
  23395. (R1 ^reward R1062 +)
  23396. Firing propose*predict-yes
  23397. -->
  23398. (O2117 ^name predict-yes +)
  23399. (S1 ^operator O2117 +)
  23400. Firing propose*predict-no
  23401. -->
  23402. (O2118 ^name predict-no +)
  23403. (S1 ^operator O2118 +)
  23404. Firing rl*prefer*rvt*predict-no*H0*4
  23405. -->
  23406. (S1 ^operator O2116 = 0.1269768337092951)
  23407. Firing rl*prefer*rvt*predict-yes*H0*3
  23408. -->
  23409. (S1 ^operator O2115 = 0.3829467075510865)
  23410. Firing prefer*rvt*predict-yes*H0
  23411. -->
  23412. Firing prefer*rvt*predict-no*H0
  23413. -->
  23414. Firing elaborate*copy-dir-to-output-link
  23415. -->
  23416. (I3 ^dir R +)
  23417. inner elaboration loop at bottom goal.
  23418. Retracting elaborate*copy-see-to-output-link
  23419. -->
  23420. (I3 ^see 1 +)
  23421. Retracting propose*predict-no
  23422. -->
  23423. (O2116 ^name predict-no +)
  23424. (S1 ^operator O2116 +)
  23425. Retracting propose*predict-yes
  23426. -->
  23427. (O2115 ^name predict-yes +)
  23428. (S1 ^operator O2115 +)
  23429. Retracting elaborate*reward*based*on*reward
  23430. -->
  23431. (R1061 ^value 1 +)
  23432. (R1 ^reward R1061 +)
  23433. Retracting elaborate*copy-dir-to-output-link
  23434. -->
  23435. (I3 ^dir U +)
  23436. Retracting rl*prefer*rvt*predict-no*H0*6
  23437. -->
  23438. (S1 ^operator O2116 = 0.9999999999999999)
  23439. Retracting rl*prefer*rvt*predict-yes*H0*5
  23440. -->
  23441. (S1 ^operator O2115 = 0.)
  23442. =>WM: (14882: S1 ^operator O2118 +)
  23443. =>WM: (14881: S1 ^operator O2117 +)
  23444. =>WM: (14880: I3 ^dir R)
  23445. =>WM: (14879: O2118 ^name predict-no)
  23446. =>WM: (14878: O2117 ^name predict-yes)
  23447. =>WM: (14877: R1062 ^value 1)
  23448. =>WM: (14876: R1 ^reward R1062)
  23449. =>WM: (14875: I3 ^see 0)
  23450. <=WM: (14866: S1 ^operator O2115 +)
  23451. <=WM: (14867: S1 ^operator O2116 +)
  23452. <=WM: (14868: S1 ^operator O2116)
  23453. <=WM: (14865: I3 ^dir U)
  23454. <=WM: (14861: R1 ^reward R1061)
  23455. <=WM: (14846: I3 ^see 1)
  23456. <=WM: (14864: O2116 ^name predict-no)
  23457. <=WM: (14863: O2115 ^name predict-yes)
  23458. <=WM: (14862: R1061 ^value 1)
  23459. --- Inner Elaboration Phase, active level 1 (S1) ---
  23460. Firing prefer*rvt*predict-yes*H0
  23461. -->
  23462. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  23463. -->
  23464. (S1 ^operator O2117 = 0.08783148430849691)
  23465. Firing rl*prefer*rvt*predict-yes*H0*3
  23466. -->
  23467. (S1 ^operator O2117 = 0.3829467075510865)
  23468. Firing prefer*rvt*predict-yes*H0*3*H1
  23469. -->
  23470. Firing prefer*rvt*predict-no*H0
  23471. -->
  23472. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  23473. -->
  23474. (S1 ^operator O2118 = 0.8730232538360513)
  23475. Firing rl*prefer*rvt*predict-no*H0*4
  23476. -->
  23477. (S1 ^operator O2118 = 0.1269768337092951)
  23478. Firing prefer*rvt*predict-no*H0*4*H1
  23479. -->
  23480. inner elaboration loop at bottom goal.
  23481. Retracting rl*prefer*rvt*predict-no*H0*4
  23482. -->
  23483. (S1 ^operator O2116 = 0.1269768337092951)
  23484. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  23485. -->
  23486. (S1 ^operator O2116 = 0.8730232538360513)
  23487. Retracting rl*prefer*rvt*predict-yes*H0*3
  23488. -->
  23489. (S1 ^operator O2115 = 0.3829467075510865)
  23490. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  23491. -->
  23492. (S1 ^operator O2115 = 0.08783148430849691)
  23493. --- END Proposal Phase ---
  23494. --- Decision Phase ---
  23495. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23496. =>WM: (14883: S1 ^operator O2118)
  23497. 1059: O: O2118 (predict-no)
  23498. --- END Decision Phase ---
  23499. --- Application Phase ---
  23500. --- Firing Productions (PE) For State At Depth 1 ---
  23501. --- Inner Elaboration Phase, active level 1 (S1) ---
  23502. Firing apply*operator
  23503. -->
  23504. (I3 ^predict-no N1059 + :O )
  23505. Firing apply*operator*complete
  23506. -->
  23507. (I3 ^predict-no N1058 - :O )
  23508. inner elaboration loop at bottom goal.
  23509. --- Change Working Memory (PE) ---
  23510. =>WM: (14884: I3 ^predict-no N1059)
  23511. <=WM: (14870: N1058 ^status complete)
  23512. <=WM: (14869: I3 ^predict-no N1058)
  23513. --- Firing Productions (IE) For State At Depth 1 ---
  23514. --- Inner Elaboration Phase, active level 1 (S1) ---
  23515. Firing monitor*world
  23516. -->
  23517. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23518. --- Change Working Memory (IE) ---
  23519. --- END Application Phase ---
  23520. --- Output Phase ---
  23521. ENV: Agent did: predict-no for direction R in state State-B
  23522. In State-B moving R
  23523. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23524. predict error 0
  23525. dir: dir isU
  23526. --- END Output Phase ---
  23527. -/--- Input Phase ---
  23528. =>WM: (14888: I2 ^dir U)
  23529. =>WM: (14887: I2 ^reward 1)
  23530. =>WM: (14886: I2 ^see 0)
  23531. =>WM: (14885: N1059 ^status complete)
  23532. <=WM: (14873: I2 ^dir R)
  23533. <=WM: (14872: I2 ^reward 1)
  23534. <=WM: (14871: I2 ^see 0)
  23535. =>WM: (14889: I2 ^level-1 R0-root)
  23536. <=WM: (14874: I2 ^level-1 R1-root)
  23537. --- END Input Phase ---
  23538. --- Proposal Phase ---
  23539. --- Inner Elaboration Phase, active level 1 (S1) ---
  23540. Firing elaborate*copy-see-to-output-link
  23541. -->
  23542. (I3 ^see 0 +)
  23543. Firing elaborate*reward*based*on*reward
  23544. -->
  23545. (R1063 ^value 1 +)
  23546. (R1 ^reward R1063 +)
  23547. Firing propose*predict-yes
  23548. -->
  23549. (O2119 ^name predict-yes +)
  23550. (S1 ^operator O2119 +)
  23551. Firing propose*predict-no
  23552. -->
  23553. (O2120 ^name predict-no +)
  23554. (S1 ^operator O2120 +)
  23555. Firing rl*prefer*rvt*predict-no*H0*6
  23556. -->
  23557. (S1 ^operator O2118 = 0.9999999999999999)
  23558. Firing rl*prefer*rvt*predict-yes*H0*5
  23559. -->
  23560. (S1 ^operator O2117 = 0.)
  23561. Firing prefer*rvt*predict-yes*H0
  23562. -->
  23563. Firing prefer*rvt*predict-no*H0
  23564. -->
  23565. Firing elaborate*copy-dir-to-output-link
  23566. -->
  23567. (I3 ^dir U +)
  23568. inner elaboration loop at bottom goal.
  23569. Retracting elaborate*copy-see-to-output-link
  23570. -->
  23571. (I3 ^see 0 +)
  23572. Retracting propose*predict-no
  23573. -->
  23574. (O2118 ^name predict-no +)
  23575. (S1 ^operator O2118 +)
  23576. Retracting propose*predict-yes
  23577. -->
  23578. (O2117 ^name predict-yes +)
  23579. (S1 ^operator O2117 +)
  23580. Retracting elaborate*reward*based*on*reward
  23581. -->
  23582. (R1062 ^value 1 +)
  23583. (R1 ^reward R1062 +)
  23584. Retracting elaborate*copy-dir-to-output-link
  23585. -->
  23586. (I3 ^dir R +)
  23587. Retracting rl*prefer*rvt*predict-no*H0*4
  23588. -->
  23589. (S1 ^operator O2118 = 0.1269768337092951)
  23590. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  23591. -->
  23592. (S1 ^operator O2118 = 0.8730232538360513)
  23593. Retracting rl*prefer*rvt*predict-yes*H0*3
  23594. -->
  23595. (S1 ^operator O2117 = 0.3829467075510865)
  23596. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  23597. -->
  23598. (S1 ^operator O2117 = 0.08783148430849691)
  23599. =>WM: (14896: S1 ^operator O2120 +)
  23600. =>WM: (14895: S1 ^operator O2119 +)
  23601. =>WM: (14894: I3 ^dir U)
  23602. =>WM: (14893: O2120 ^name predict-no)
  23603. =>WM: (14892: O2119 ^name predict-yes)
  23604. =>WM: (14891: R1063 ^value 1)
  23605. =>WM: (14890: R1 ^reward R1063)
  23606. <=WM: (14881: S1 ^operator O2117 +)
  23607. <=WM: (14882: S1 ^operator O2118 +)
  23608. <=WM: (14883: S1 ^operator O2118)
  23609. <=WM: (14880: I3 ^dir R)
  23610. <=WM: (14876: R1 ^reward R1062)
  23611. <=WM: (14879: O2118 ^name predict-no)
  23612. <=WM: (14878: O2117 ^name predict-yes)
  23613. <=WM: (14877: R1062 ^value 1)
  23614. --- Inner Elaboration Phase, active level 1 (S1) ---
  23615. Firing prefer*rvt*predict-yes*H0
  23616. -->
  23617. Firing rl*prefer*rvt*predict-yes*H0*5
  23618. -->
  23619. (S1 ^operator O2119 = 0.)
  23620. Firing prefer*rvt*predict-no*H0
  23621. -->
  23622. Firing rl*prefer*rvt*predict-no*H0*6
  23623. -->
  23624. (S1 ^operator O2120 = 0.9999999999999999)
  23625. inner elaboration loop at bottom goal.
  23626. Retracting rl*prefer*rvt*predict-no*H0*6
  23627. -->
  23628. (S1 ^operator O2118 = 0.9999999999999999)
  23629. Retracting rl*prefer*rvt*predict-yes*H0*5
  23630. -->
  23631. (S1 ^operator O2117 = 0.)
  23632. --- END Proposal Phase ---
  23633. --- Decision Phase ---
  23634. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.952632,0.0453634)
  23635. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  23636. =>WM: (14897: S1 ^operator O2120)
  23637. 1060: O: O2120 (predict-no)
  23638. --- END Decision Phase ---
  23639. --- Application Phase ---
  23640. --- Firing Productions (PE) For State At Depth 1 ---
  23641. --- Inner Elaboration Phase, active level 1 (S1) ---
  23642. Firing apply*operator
  23643. -->
  23644. (I3 ^predict-no N1060 + :O )
  23645. Firing apply*operator*complete
  23646. -->
  23647. (I3 ^predict-no N1059 - :O )
  23648. inner elaboration loop at bottom goal.
  23649. --- Change Working Memory (PE) ---
  23650. =>WM: (14898: I3 ^predict-no N1060)
  23651. <=WM: (14885: N1059 ^status complete)
  23652. <=WM: (14884: I3 ^predict-no N1059)
  23653. --- Firing Productions (IE) For State At Depth 1 ---
  23654. --- Inner Elaboration Phase, active level 1 (S1) ---
  23655. Firing monitor*world
  23656. -->
  23657. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23658. --- Change Working Memory (IE) ---
  23659. --- END Application Phase ---
  23660. --- Output Phase ---
  23661. ENV: Agent did: predict-no for direction U in state State-B
  23662. In State-B moving U
  23663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23664. predict error 0
  23665. dir: dir isU
  23666. --- END Output Phase ---
  23667. |\---- Input Phase ---
  23668. =>WM: (14902: I2 ^dir U)
  23669. =>WM: (14901: I2 ^reward 1)
  23670. =>WM: (14900: I2 ^see 0)
  23671. =>WM: (14899: N1060 ^status complete)
  23672. <=WM: (14888: I2 ^dir U)
  23673. <=WM: (14887: I2 ^reward 1)
  23674. <=WM: (14886: I2 ^see 0)
  23675. =>WM: (14903: I2 ^level-1 R0-root)
  23676. <=WM: (14889: I2 ^level-1 R0-root)
  23677. --- END Input Phase ---
  23678. --- Proposal Phase ---
  23679. --- Inner Elaboration Phase, active level 1 (S1) ---
  23680. Firing elaborate*copy-see-to-output-link
  23681. -->
  23682. (I3 ^see 0 +)
  23683. Firing elaborate*reward*based*on*reward
  23684. -->
  23685. (R1064 ^value 1 +)
  23686. (R1 ^reward R1064 +)
  23687. Firing propose*predict-yes
  23688. -->
  23689. (O2121 ^name predict-yes +)
  23690. (S1 ^operator O2121 +)
  23691. Firing propose*predict-no
  23692. -->
  23693. (O2122 ^name predict-no +)
  23694. (S1 ^operator O2122 +)
  23695. Firing rl*prefer*rvt*predict-no*H0*6
  23696. -->
  23697. (S1 ^operator O2120 = 0.9999999999999999)
  23698. Firing rl*prefer*rvt*predict-yes*H0*5
  23699. -->
  23700. (S1 ^operator O2119 = 0.)
  23701. Firing prefer*rvt*predict-yes*H0
  23702. -->
  23703. Firing prefer*rvt*predict-no*H0
  23704. -->
  23705. Firing elaborate*copy-dir-to-output-link
  23706. -->
  23707. (I3 ^dir U +)
  23708. inner elaboration loop at bottom goal.
  23709. Retracting elaborate*copy-see-to-output-link
  23710. -->
  23711. (I3 ^see 0 +)
  23712. Retracting propose*predict-no
  23713. -->
  23714. (O2120 ^name predict-no +)
  23715. (S1 ^operator O2120 +)
  23716. Retracting propose*predict-yes
  23717. -->
  23718. (O2119 ^name predict-yes +)
  23719. (S1 ^operator O2119 +)
  23720. Retracting elaborate*reward*based*on*reward
  23721. -->
  23722. (R1063 ^value 1 +)
  23723. (R1 ^reward R1063 +)
  23724. Retracting elaborate*copy-dir-to-output-link
  23725. -->
  23726. (I3 ^dir U +)
  23727. Retracting rl*prefer*rvt*predict-no*H0*6
  23728. -->
  23729. (S1 ^operator O2120 = 0.9999999999999999)
  23730. Retracting rl*prefer*rvt*predict-yes*H0*5
  23731. -->
  23732. (S1 ^operator O2119 = 0.)
  23733. =>WM: (14909: S1 ^operator O2122 +)
  23734. =>WM: (14908: S1 ^operator O2121 +)
  23735. =>WM: (14907: O2122 ^name predict-no)
  23736. =>WM: (14906: O2121 ^name predict-yes)
  23737. =>WM: (14905: R1064 ^value 1)
  23738. =>WM: (14904: R1 ^reward R1064)
  23739. <=WM: (14895: S1 ^operator O2119 +)
  23740. <=WM: (14896: S1 ^operator O2120 +)
  23741. <=WM: (14897: S1 ^operator O2120)
  23742. <=WM: (14890: R1 ^reward R1063)
  23743. <=WM: (14893: O2120 ^name predict-no)
  23744. <=WM: (14892: O2119 ^name predict-yes)
  23745. <=WM: (14891: R1063 ^value 1)
  23746. --- Inner Elaboration Phase, active level 1 (S1) ---
  23747. Firing prefer*rvt*predict-yes*H0
  23748. -->
  23749. Firing rl*prefer*rvt*predict-yes*H0*5
  23750. -->
  23751. (S1 ^operator O2121 = 0.)
  23752. Firing prefer*rvt*predict-no*H0
  23753. -->
  23754. Firing rl*prefer*rvt*predict-no*H0*6
  23755. -->
  23756. (S1 ^operator O2122 = 0.9999999999999999)
  23757. inner elaboration loop at bottom goal.
  23758. Retracting rl*prefer*rvt*predict-no*H0*6
  23759. -->
  23760. (S1 ^operator O2120 = 0.9999999999999999)
  23761. Retracting rl*prefer*rvt*predict-yes*H0*5
  23762. -->
  23763. (S1 ^operator O2119 = 0.)
  23764. --- END Proposal Phase ---
  23765. --- Decision Phase ---
  23766. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23767. =>WM: (14910: S1 ^operator O2122)
  23768. 1061: O: O2122 (predict-no)
  23769. --- END Decision Phase ---
  23770. --- Application Phase ---
  23771. --- Firing Productions (PE) For State At Depth 1 ---
  23772. --- Inner Elaboration Phase, active level 1 (S1) ---
  23773. Firing apply*operator
  23774. -->
  23775. (I3 ^predict-no N1061 + :O )
  23776. Firing apply*operator*complete
  23777. -->
  23778. (I3 ^predict-no N1060 - :O )
  23779. inner elaboration loop at bottom goal.
  23780. --- Change Working Memory (PE) ---
  23781. =>WM: (14911: I3 ^predict-no N1061)
  23782. <=WM: (14899: N1060 ^status complete)
  23783. <=WM: (14898: I3 ^predict-no N1060)
  23784. --- Firing Productions (IE) For State At Depth 1 ---
  23785. --- Inner Elaboration Phase, active level 1 (S1) ---
  23786. Firing monitor*world
  23787. -->
  23788. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23789. --- Change Working Memory (IE) ---
  23790. --- END Application Phase ---
  23791. --- Output Phase ---
  23792. ENV: Agent did: predict-no for direction U in state State-B
  23793. In State-B moving U
  23794. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23795. predict error 0
  23796. dir: dir isR
  23797. --- END Output Phase ---
  23798. /--- Input Phase ---
  23799. =>WM: (14915: I2 ^dir R)
  23800. =>WM: (14914: I2 ^reward 1)
  23801. =>WM: (14913: I2 ^see 0)
  23802. =>WM: (14912: N1061 ^status complete)
  23803. <=WM: (14902: I2 ^dir U)
  23804. <=WM: (14901: I2 ^reward 1)
  23805. <=WM: (14900: I2 ^see 0)
  23806. =>WM: (14916: I2 ^level-1 R0-root)
  23807. <=WM: (14903: I2 ^level-1 R0-root)
  23808. --- END Input Phase ---
  23809. --- Proposal Phase ---
  23810. --- Inner Elaboration Phase, active level 1 (S1) ---
  23811. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  23812. -->
  23813. (S1 ^operator O2121 = 0.2696941111808541)
  23814. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  23815. -->
  23816. (S1 ^operator O2122 = 0.8730231102721162)
  23817. Firing prefer*rvt*predict-no*H0*4*H1
  23818. -->
  23819. Firing prefer*rvt*predict-yes*H0*3*H1
  23820. -->
  23821. Firing elaborate*copy-see-to-output-link
  23822. -->
  23823. (I3 ^see 0 +)
  23824. Firing elaborate*reward*based*on*reward
  23825. -->
  23826. (R1065 ^value 1 +)
  23827. (R1 ^reward R1065 +)
  23828. Firing propose*predict-yes
  23829. -->
  23830. (O2123 ^name predict-yes +)
  23831. (S1 ^operator O2123 +)
  23832. Firing propose*predict-no
  23833. -->
  23834. (O2124 ^name predict-no +)
  23835. (S1 ^operator O2124 +)
  23836. Firing rl*prefer*rvt*predict-no*H0*4
  23837. -->
  23838. (S1 ^operator O2122 = 0.1269768205774933)
  23839. Firing rl*prefer*rvt*predict-yes*H0*3
  23840. -->
  23841. (S1 ^operator O2121 = 0.3829467075510865)
  23842. Firing prefer*rvt*predict-yes*H0
  23843. -->
  23844. Firing prefer*rvt*predict-no*H0
  23845. -->
  23846. Firing elaborate*copy-dir-to-output-link
  23847. -->
  23848. (I3 ^dir R +)
  23849. inner elaboration loop at bottom goal.
  23850. Retracting elaborate*copy-see-to-output-link
  23851. -->
  23852. (I3 ^see 0 +)
  23853. Retracting propose*predict-no
  23854. -->
  23855. (O2122 ^name predict-no +)
  23856. (S1 ^operator O2122 +)
  23857. Retracting propose*predict-yes
  23858. -->
  23859. (O2121 ^name predict-yes +)
  23860. (S1 ^operator O2121 +)
  23861. Retracting elaborate*reward*based*on*reward
  23862. -->
  23863. (R1064 ^value 1 +)
  23864. (R1 ^reward R1064 +)
  23865. Retracting elaborate*copy-dir-to-output-link
  23866. -->
  23867. (I3 ^dir U +)
  23868. Retracting rl*prefer*rvt*predict-no*H0*6
  23869. -->
  23870. (S1 ^operator O2122 = 0.9999999999999999)
  23871. Retracting rl*prefer*rvt*predict-yes*H0*5
  23872. -->
  23873. (S1 ^operator O2121 = 0.)
  23874. =>WM: (14923: S1 ^operator O2124 +)
  23875. =>WM: (14922: S1 ^operator O2123 +)
  23876. =>WM: (14921: I3 ^dir R)
  23877. =>WM: (14920: O2124 ^name predict-no)
  23878. =>WM: (14919: O2123 ^name predict-yes)
  23879. =>WM: (14918: R1065 ^value 1)
  23880. =>WM: (14917: R1 ^reward R1065)
  23881. <=WM: (14908: S1 ^operator O2121 +)
  23882. <=WM: (14909: S1 ^operator O2122 +)
  23883. <=WM: (14910: S1 ^operator O2122)
  23884. <=WM: (14894: I3 ^dir U)
  23885. <=WM: (14904: R1 ^reward R1064)
  23886. <=WM: (14907: O2122 ^name predict-no)
  23887. <=WM: (14906: O2121 ^name predict-yes)
  23888. <=WM: (14905: R1064 ^value 1)
  23889. --- Inner Elaboration Phase, active level 1 (S1) ---
  23890. Firing prefer*rvt*predict-yes*H0
  23891. -->
  23892. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  23893. -->
  23894. (S1 ^operator O2123 = 0.2696941111808541)
  23895. Firing rl*prefer*rvt*predict-yes*H0*3
  23896. -->
  23897. (S1 ^operator O2123 = 0.3829467075510865)
  23898. Firing prefer*rvt*predict-yes*H0*3*H1
  23899. -->
  23900. Firing prefer*rvt*predict-no*H0
  23901. -->
  23902. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  23903. -->
  23904. (S1 ^operator O2124 = 0.8730231102721162)
  23905. Firing rl*prefer*rvt*predict-no*H0*4
  23906. -->
  23907. (S1 ^operator O2124 = 0.1269768205774933)
  23908. Firing prefer*rvt*predict-no*H0*4*H1
  23909. -->
  23910. inner elaboration loop at bottom goal.
  23911. Retracting rl*prefer*rvt*predict-no*H0*4
  23912. -->
  23913. (S1 ^operator O2122 = 0.1269768205774933)
  23914. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  23915. -->
  23916. (S1 ^operator O2122 = 0.8730231102721162)
  23917. Retracting rl*prefer*rvt*predict-yes*H0*3
  23918. -->
  23919. (S1 ^operator O2121 = 0.3829467075510865)
  23920. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  23921. -->
  23922. (S1 ^operator O2121 = 0.2696941111808541)
  23923. --- END Proposal Phase ---
  23924. --- Decision Phase ---
  23925. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23926. =>WM: (14924: S1 ^operator O2124)
  23927. 1062: O: O2124 (predict-no)
  23928. --- END Decision Phase ---
  23929. --- Application Phase ---
  23930. --- Firing Productions (PE) For State At Depth 1 ---
  23931. --- Inner Elaboration Phase, active level 1 (S1) ---
  23932. Firing apply*operator
  23933. -->
  23934. (I3 ^predict-no N1062 + :O )
  23935. Firing apply*operator*complete
  23936. -->
  23937. (I3 ^predict-no N1061 - :O )
  23938. inner elaboration loop at bottom goal.
  23939. --- Change Working Memory (PE) ---
  23940. =>WM: (14925: I3 ^predict-no N1062)
  23941. <=WM: (14912: N1061 ^status complete)
  23942. <=WM: (14911: I3 ^predict-no N1061)
  23943. --- Firing Productions (IE) For State At Depth 1 ---
  23944. --- Inner Elaboration Phase, active level 1 (S1) ---
  23945. Firing monitor*world
  23946. -->
  23947. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23948. --- Change Working Memory (IE) ---
  23949. --- END Application Phase ---
  23950. --- Output Phase ---
  23951. ENV: Agent did: predict-no for direction R in state State-B
  23952. In State-B moving R
  23953. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23954. predict error 0
  23955. dir: dir isL
  23956. --- END Output Phase ---
  23957. |\-/--- Input Phase ---
  23958. =>WM: (14929: I2 ^dir L)
  23959. =>WM: (14928: I2 ^reward 1)
  23960. =>WM: (14927: I2 ^see 0)
  23961. =>WM: (14926: N1062 ^status complete)
  23962. <=WM: (14915: I2 ^dir R)
  23963. <=WM: (14914: I2 ^reward 1)
  23964. <=WM: (14913: I2 ^see 0)
  23965. =>WM: (14930: I2 ^level-1 R0-root)
  23966. <=WM: (14916: I2 ^level-1 R0-root)
  23967. --- END Input Phase ---
  23968. --- Proposal Phase ---
  23969. --- Inner Elaboration Phase, active level 1 (S1) ---
  23970. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  23971. -->
  23972. (S1 ^operator O2123 = 0.4768811873342281)
  23973. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  23974. -->
  23975. (S1 ^operator O2124 = 0.1700769046561409)
  23976. Firing prefer*rvt*predict-no*H0*2*H1
  23977. -->
  23978. Firing prefer*rvt*predict-yes*H0*1*H1
  23979. -->
  23980. Firing elaborate*copy-see-to-output-link
  23981. -->
  23982. (I3 ^see 0 +)
  23983. Firing elaborate*reward*based*on*reward
  23984. -->
  23985. (R1066 ^value 1 +)
  23986. (R1 ^reward R1066 +)
  23987. Firing propose*predict-yes
  23988. -->
  23989. (O2125 ^name predict-yes +)
  23990. (S1 ^operator O2125 +)
  23991. Firing propose*predict-no
  23992. -->
  23993. (O2126 ^name predict-no +)
  23994. (S1 ^operator O2126 +)
  23995. Firing rl*prefer*rvt*predict-no*H0*2
  23996. -->
  23997. (S1 ^operator O2124 = 0.2550133912230119)
  23998. Firing rl*prefer*rvt*predict-yes*H0*1
  23999. -->
  24000. (S1 ^operator O2123 = 0.5231194590860083)
  24001. Firing prefer*rvt*predict-yes*H0
  24002. -->
  24003. Firing prefer*rvt*predict-no*H0
  24004. -->
  24005. Firing elaborate*copy-dir-to-output-link
  24006. -->
  24007. (I3 ^dir L +)
  24008. inner elaboration loop at bottom goal.
  24009. Retracting elaborate*copy-see-to-output-link
  24010. -->
  24011. (I3 ^see 0 +)
  24012. Retracting propose*predict-no
  24013. -->
  24014. (O2124 ^name predict-no +)
  24015. (S1 ^operator O2124 +)
  24016. Retracting propose*predict-yes
  24017. -->
  24018. (O2123 ^name predict-yes +)
  24019. (S1 ^operator O2123 +)
  24020. Retracting elaborate*reward*based*on*reward
  24021. -->
  24022. (R1065 ^value 1 +)
  24023. (R1 ^reward R1065 +)
  24024. Retracting elaborate*copy-dir-to-output-link
  24025. -->
  24026. (I3 ^dir R +)
  24027. Retracting rl*prefer*rvt*predict-no*H0*4
  24028. -->
  24029. (S1 ^operator O2124 = 0.1269768205774933)
  24030. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  24031. -->
  24032. (S1 ^operator O2124 = 0.8730231102721162)
  24033. Retracting rl*prefer*rvt*predict-yes*H0*3
  24034. -->
  24035. (S1 ^operator O2123 = 0.3829467075510865)
  24036. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  24037. -->
  24038. (S1 ^operator O2123 = 0.2696941111808541)
  24039. =>WM: (14937: S1 ^operator O2126 +)
  24040. =>WM: (14936: S1 ^operator O2125 +)
  24041. =>WM: (14935: I3 ^dir L)
  24042. =>WM: (14934: O2126 ^name predict-no)
  24043. =>WM: (14933: O2125 ^name predict-yes)
  24044. =>WM: (14932: R1066 ^value 1)
  24045. =>WM: (14931: R1 ^reward R1066)
  24046. <=WM: (14922: S1 ^operator O2123 +)
  24047. <=WM: (14923: S1 ^operator O2124 +)
  24048. <=WM: (14924: S1 ^operator O2124)
  24049. <=WM: (14921: I3 ^dir R)
  24050. <=WM: (14917: R1 ^reward R1065)
  24051. <=WM: (14920: O2124 ^name predict-no)
  24052. <=WM: (14919: O2123 ^name predict-yes)
  24053. <=WM: (14918: R1065 ^value 1)
  24054. --- Inner Elaboration Phase, active level 1 (S1) ---
  24055. Firing prefer*rvt*predict-yes*H0
  24056. -->
  24057. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  24058. -->
  24059. (S1 ^operator O2125 = 0.4768811873342281)
  24060. Firing rl*prefer*rvt*predict-yes*H0*1
  24061. -->
  24062. (S1 ^operator O2125 = 0.5231194590860083)
  24063. Firing prefer*rvt*predict-yes*H0*1*H1
  24064. -->
  24065. Firing prefer*rvt*predict-no*H0
  24066. -->
  24067. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  24068. -->
  24069. (S1 ^operator O2126 = 0.1700769046561409)
  24070. Firing rl*prefer*rvt*predict-no*H0*2
  24071. -->
  24072. (S1 ^operator O2126 = 0.2550133912230119)
  24073. Firing prefer*rvt*predict-no*H0*2*H1
  24074. -->
  24075. inner elaboration loop at bottom goal.
  24076. Retracting rl*prefer*rvt*predict-no*H0*2
  24077. -->
  24078. (S1 ^operator O2124 = 0.2550133912230119)
  24079. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  24080. -->
  24081. (S1 ^operator O2124 = 0.1700769046561409)
  24082. Retracting rl*prefer*rvt*predict-yes*H0*1
  24083. -->
  24084. (S1 ^operator O2123 = 0.5231194590860083)
  24085. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  24086. -->
  24087. (S1 ^operator O2123 = 0.4768811873342281)
  24088. --- END Proposal Phase ---
  24089. --- Decision Phase ---
  24090. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.95288,0.0451364)
  24091. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  24092. =>WM: (14938: S1 ^operator O2125)
  24093. 1063: O: O2125 (predict-yes)
  24094. --- END Decision Phase ---
  24095. --- Application Phase ---
  24096. --- Firing Productions (PE) For State At Depth 1 ---
  24097. --- Inner Elaboration Phase, active level 1 (S1) ---
  24098. Firing apply*operator
  24099. -->
  24100. (I3 ^predict-yes N1063 + :O )
  24101. Firing apply*operator*complete
  24102. -->
  24103. (I3 ^predict-no N1062 - :O )
  24104. inner elaboration loop at bottom goal.
  24105. --- Change Working Memory (PE) ---
  24106. =>WM: (14939: I3 ^predict-yes N1063)
  24107. <=WM: (14926: N1062 ^status complete)
  24108. <=WM: (14925: I3 ^predict-no N1062)
  24109. --- Firing Productions (IE) For State At Depth 1 ---
  24110. --- Inner Elaboration Phase, active level 1 (S1) ---
  24111. Firing monitor*world
  24112. -->
  24113. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24114. --- Change Working Memory (IE) ---
  24115. --- END Application Phase ---
  24116. --- Output Phase ---
  24117. ENV: Agent did: predict-yes for direction L in state State-B
  24118. In State-B moving L
  24119. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  24120. predict error 0
  24121. dir: dir isR
  24122. --- END Output Phase ---
  24123. |\---- Input Phase ---
  24124. =>WM: (14943: I2 ^dir R)
  24125. =>WM: (14942: I2 ^reward 1)
  24126. =>WM: (14941: I2 ^see 1)
  24127. =>WM: (14940: N1063 ^status complete)
  24128. <=WM: (14929: I2 ^dir L)
  24129. <=WM: (14928: I2 ^reward 1)
  24130. <=WM: (14927: I2 ^see 0)
  24131. =>WM: (14944: I2 ^level-1 L1-root)
  24132. <=WM: (14930: I2 ^level-1 R0-root)
  24133. --- END Input Phase ---
  24134. --- Proposal Phase ---
  24135. --- Inner Elaboration Phase, active level 1 (S1) ---
  24136. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  24137. -->
  24138. (S1 ^operator O2125 = 0.6170506816913311)
  24139. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  24140. -->
  24141. (S1 ^operator O2126 = 0.4901349546100854)
  24142. Firing prefer*rvt*predict-no*H0*4*H1
  24143. -->
  24144. Firing prefer*rvt*predict-yes*H0*3*H1
  24145. -->
  24146. Firing elaborate*copy-see-to-output-link
  24147. -->
  24148. (I3 ^see 1 +)
  24149. Firing elaborate*reward*based*on*reward
  24150. -->
  24151. (R1067 ^value 1 +)
  24152. (R1 ^reward R1067 +)
  24153. Firing propose*predict-yes
  24154. -->
  24155. (O2127 ^name predict-yes +)
  24156. (S1 ^operator O2127 +)
  24157. Firing propose*predict-no
  24158. -->
  24159. (O2128 ^name predict-no +)
  24160. (S1 ^operator O2128 +)
  24161. Firing rl*prefer*rvt*predict-no*H0*4
  24162. -->
  24163. (S1 ^operator O2126 = 0.1269768309500519)
  24164. Firing rl*prefer*rvt*predict-yes*H0*3
  24165. -->
  24166. (S1 ^operator O2125 = 0.3829467075510865)
  24167. Firing prefer*rvt*predict-yes*H0
  24168. -->
  24169. Firing prefer*rvt*predict-no*H0
  24170. -->
  24171. Firing elaborate*copy-dir-to-output-link
  24172. -->
  24173. (I3 ^dir R +)
  24174. inner elaboration loop at bottom goal.
  24175. Retracting elaborate*copy-see-to-output-link
  24176. -->
  24177. (I3 ^see 0 +)
  24178. Retracting propose*predict-no
  24179. -->
  24180. (O2126 ^name predict-no +)
  24181. (S1 ^operator O2126 +)
  24182. Retracting propose*predict-yes
  24183. -->
  24184. (O2125 ^name predict-yes +)
  24185. (S1 ^operator O2125 +)
  24186. Retracting elaborate*reward*based*on*reward
  24187. -->
  24188. (R1066 ^value 1 +)
  24189. (R1 ^reward R1066 +)
  24190. Retracting elaborate*copy-dir-to-output-link
  24191. -->
  24192. (I3 ^dir L +)
  24193. Retracting rl*prefer*rvt*predict-no*H0*2
  24194. -->
  24195. (S1 ^operator O2126 = 0.2550133912230119)
  24196. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  24197. -->
  24198. (S1 ^operator O2126 = 0.1700769046561409)
  24199. Retracting rl*prefer*rvt*predict-yes*H0*1
  24200. -->
  24201. (S1 ^operator O2125 = 0.5231194590860083)
  24202. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  24203. -->
  24204. (S1 ^operator O2125 = 0.4768811873342281)
  24205. =>WM: (14952: S1 ^operator O2128 +)
  24206. =>WM: (14951: S1 ^operator O2127 +)
  24207. =>WM: (14950: I3 ^dir R)
  24208. =>WM: (14949: O2128 ^name predict-no)
  24209. =>WM: (14948: O2127 ^name predict-yes)
  24210. =>WM: (14947: R1067 ^value 1)
  24211. =>WM: (14946: R1 ^reward R1067)
  24212. =>WM: (14945: I3 ^see 1)
  24213. <=WM: (14936: S1 ^operator O2125 +)
  24214. <=WM: (14938: S1 ^operator O2125)
  24215. <=WM: (14937: S1 ^operator O2126 +)
  24216. <=WM: (14935: I3 ^dir L)
  24217. <=WM: (14931: R1 ^reward R1066)
  24218. <=WM: (14875: I3 ^see 0)
  24219. <=WM: (14934: O2126 ^name predict-no)
  24220. <=WM: (14933: O2125 ^name predict-yes)
  24221. <=WM: (14932: R1066 ^value 1)
  24222. --- Inner Elaboration Phase, active level 1 (S1) ---
  24223. Firing prefer*rvt*predict-yes*H0
  24224. -->
  24225. Firing rl*prefer*rvt*predict-yes*H0*3
  24226. -->
  24227. (S1 ^operator O2127 = 0.3829467075510865)
  24228. Firing prefer*rvt*predict-yes*H0*3*H1
  24229. -->
  24230. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  24231. -->
  24232. (S1 ^operator O2127 = 0.6170506816913311)
  24233. Firing prefer*rvt*predict-no*H0
  24234. -->
  24235. Firing rl*prefer*rvt*predict-no*H0*4
  24236. -->
  24237. (S1 ^operator O2128 = 0.1269768309500519)
  24238. Firing prefer*rvt*predict-no*H0*4*H1
  24239. -->
  24240. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  24241. -->
  24242. (S1 ^operator O2128 = 0.4901349546100854)
  24243. inner elaboration loop at bottom goal.
  24244. Retracting rl*prefer*rvt*predict-no*H0*4
  24245. -->
  24246. (S1 ^operator O2126 = 0.1269768309500519)
  24247. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  24248. -->
  24249. (S1 ^operator O2126 = 0.4901349546100854)
  24250. Retracting rl*prefer*rvt*predict-yes*H0*3
  24251. -->
  24252. (S1 ^operator O2125 = 0.3829467075510865)
  24253. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  24254. -->
  24255. (S1 ^operator O2125 = 0.6170506816913311)
  24256. --- END Proposal Phase ---
  24257. --- Decision Phase ---
  24258. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.980392,0.0193498)
  24259. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272042 0.20484 0.476881 -> 0.272042 0.20484 0.476881(R,m,v=1,1,0)
  24260. =>WM: (14953: S1 ^operator O2127)
  24261. 1064: O: O2127 (predict-yes)
  24262. --- END Decision Phase ---
  24263. --- Application Phase ---
  24264. --- Firing Productions (PE) For State At Depth 1 ---
  24265. --- Inner Elaboration Phase, active level 1 (S1) ---
  24266. Firing apply*operator
  24267. -->
  24268. (I3 ^predict-yes N1064 + :O )
  24269. Firing apply*operator*complete
  24270. -->
  24271. (I3 ^predict-yes N1063 - :O )
  24272. inner elaboration loop at bottom goal.
  24273. --- Change Working Memory (PE) ---
  24274. =>WM: (14954: I3 ^predict-yes N1064)
  24275. <=WM: (14940: N1063 ^status complete)
  24276. <=WM: (14939: I3 ^predict-yes N1063)
  24277. --- Firing Productions (IE) For State At Depth 1 ---
  24278. --- Inner Elaboration Phase, active level 1 (S1) ---
  24279. Firing monitor*world
  24280. -->
  24281. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24282. --- Change Working Memory (IE) ---
  24283. --- END Application Phase ---
  24284. --- Output Phase ---
  24285. ENV: Agent did: predict-yes for direction R in state State-A
  24286. In State-A moving R
  24287. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  24288. predict error 0
  24289. dir: dir isR
  24290. --- END Output Phase ---
  24291. /|\---- Input Phase ---
  24292. =>WM: (14958: I2 ^dir R)
  24293. =>WM: (14957: I2 ^reward 1)
  24294. =>WM: (14956: I2 ^see 1)
  24295. =>WM: (14955: N1064 ^status complete)
  24296. <=WM: (14943: I2 ^dir R)
  24297. <=WM: (14942: I2 ^reward 1)
  24298. <=WM: (14941: I2 ^see 1)
  24299. =>WM: (14959: I2 ^level-1 R1-root)
  24300. <=WM: (14944: I2 ^level-1 L1-root)
  24301. --- END Input Phase ---
  24302. --- Proposal Phase ---
  24303. --- Inner Elaboration Phase, active level 1 (S1) ---
  24304. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  24305. -->
  24306. (S1 ^operator O2127 = 0.08783148430849691)
  24307. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  24308. -->
  24309. (S1 ^operator O2128 = 0.8730232407042493)
  24310. Firing prefer*rvt*predict-no*H0*4*H1
  24311. -->
  24312. Firing prefer*rvt*predict-yes*H0*3*H1
  24313. -->
  24314. Firing elaborate*copy-see-to-output-link
  24315. -->
  24316. (I3 ^see 1 +)
  24317. Firing elaborate*reward*based*on*reward
  24318. -->
  24319. (R1068 ^value 1 +)
  24320. (R1 ^reward R1068 +)
  24321. Firing propose*predict-yes
  24322. -->
  24323. (O2129 ^name predict-yes +)
  24324. (S1 ^operator O2129 +)
  24325. Firing propose*predict-no
  24326. -->
  24327. (O2130 ^name predict-no +)
  24328. (S1 ^operator O2130 +)
  24329. Firing rl*prefer*rvt*predict-no*H0*4
  24330. -->
  24331. (S1 ^operator O2128 = 0.1269768309500519)
  24332. Firing rl*prefer*rvt*predict-yes*H0*3
  24333. -->
  24334. (S1 ^operator O2127 = 0.3829467075510865)
  24335. Firing prefer*rvt*predict-yes*H0
  24336. -->
  24337. Firing prefer*rvt*predict-no*H0
  24338. -->
  24339. Firing elaborate*copy-dir-to-output-link
  24340. -->
  24341. (I3 ^dir R +)
  24342. inner elaboration loop at bottom goal.
  24343. Retracting elaborate*copy-see-to-output-link
  24344. -->
  24345. (I3 ^see 1 +)
  24346. Retracting propose*predict-no
  24347. -->
  24348. (O2128 ^name predict-no +)
  24349. (S1 ^operator O2128 +)
  24350. Retracting propose*predict-yes
  24351. -->
  24352. (O2127 ^name predict-yes +)
  24353. (S1 ^operator O2127 +)
  24354. Retracting elaborate*reward*based*on*reward
  24355. -->
  24356. (R1067 ^value 1 +)
  24357. (R1 ^reward R1067 +)
  24358. Retracting elaborate*copy-dir-to-output-link
  24359. -->
  24360. (I3 ^dir R +)
  24361. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  24362. -->
  24363. (S1 ^operator O2128 = 0.4901349546100854)
  24364. Retracting rl*prefer*rvt*predict-no*H0*4
  24365. -->
  24366. (S1 ^operator O2128 = 0.1269768309500519)
  24367. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  24368. -->
  24369. (S1 ^operator O2127 = 0.6170506816913311)
  24370. Retracting rl*prefer*rvt*predict-yes*H0*3
  24371. -->
  24372. (S1 ^operator O2127 = 0.3829467075510865)
  24373. =>WM: (14965: S1 ^operator O2130 +)
  24374. =>WM: (14964: S1 ^operator O2129 +)
  24375. =>WM: (14963: O2130 ^name predict-no)
  24376. =>WM: (14962: O2129 ^name predict-yes)
  24377. =>WM: (14961: R1068 ^value 1)
  24378. =>WM: (14960: R1 ^reward R1068)
  24379. <=WM: (14951: S1 ^operator O2127 +)
  24380. <=WM: (14953: S1 ^operator O2127)
  24381. <=WM: (14952: S1 ^operator O2128 +)
  24382. <=WM: (14946: R1 ^reward R1067)
  24383. <=WM: (14949: O2128 ^name predict-no)
  24384. <=WM: (14948: O2127 ^name predict-yes)
  24385. <=WM: (14947: R1067 ^value 1)
  24386. --- Inner Elaboration Phase, active level 1 (S1) ---
  24387. Firing prefer*rvt*predict-yes*H0
  24388. -->
  24389. Firing rl*prefer*rvt*predict-yes*H0*3
  24390. -->
  24391. (S1 ^operator O2129 = 0.3829467075510865)
  24392. Firing prefer*rvt*predict-yes*H0*3*H1
  24393. -->
  24394. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  24395. -->
  24396. (S1 ^operator O2129 = 0.08783148430849691)
  24397. Firing prefer*rvt*predict-no*H0
  24398. -->
  24399. Firing rl*prefer*rvt*predict-no*H0*4
  24400. -->
  24401. (S1 ^operator O2130 = 0.1269768309500519)
  24402. Firing prefer*rvt*predict-no*H0*4*H1
  24403. -->
  24404. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  24405. -->
  24406. (S1 ^operator O2130 = 0.8730232407042493)
  24407. inner elaboration loop at bottom goal.
  24408. Retracting rl*prefer*rvt*predict-no*H0*4
  24409. -->
  24410. (S1 ^operator O2128 = 0.1269768309500519)
  24411. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  24412. -->
  24413. (S1 ^operator O2128 = 0.8730232407042493)
  24414. Retracting rl*prefer*rvt*predict-yes*H0*3
  24415. -->
  24416. (S1 ^operator O2127 = 0.3829467075510865)
  24417. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  24418. -->
  24419. (S1 ^operator O2127 = 0.08783148430849691)
  24420. --- END Proposal Phase ---
  24421. --- Decision Phase ---
  24422. RL update rl*prefer*rvt*predict-yes*H0*3 0.673139 -0.290193 0.382947 -> 0.67314 -0.290193 0.382947(R,m,v=1,0.963415,0.0354631)
  24423. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326858 0.290192 0.617051 -> 0.326859 0.290192 0.617051(R,m,v=1,1,0)
  24424. =>WM: (14966: S1 ^operator O2130)
  24425. 1065: O: O2130 (predict-no)
  24426. --- END Decision Phase ---
  24427. --- Application Phase ---
  24428. --- Firing Productions (PE) For State At Depth 1 ---
  24429. --- Inner Elaboration Phase, active level 1 (S1) ---
  24430. Firing apply*operator
  24431. -->
  24432. (I3 ^predict-no N1065 + :O )
  24433. Firing apply*operator*complete
  24434. -->
  24435. (I3 ^predict-yes N1064 - :O )
  24436. inner elaboration loop at bottom goal.
  24437. --- Change Working Memory (PE) ---
  24438. =>WM: (14967: I3 ^predict-no N1065)
  24439. <=WM: (14955: N1064 ^status complete)
  24440. <=WM: (14954: I3 ^predict-yes N1064)
  24441. --- Firing Productions (IE) For State At Depth 1 ---
  24442. --- Inner Elaboration Phase, active level 1 (S1) ---
  24443. Firing monitor*world
  24444. -->
  24445. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24446. --- Change Working Memory (IE) ---
  24447. --- END Application Phase ---
  24448. --- Output Phase ---
  24449. ENV: Agent did: predict-no for direction R in state State-B
  24450. In State-B moving R
  24451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24452. predict error 0
  24453. dir: dir isR
  24454. --- END Output Phase ---
  24455. /|\---- Input Phase ---
  24456. =>WM: (14971: I2 ^dir R)
  24457. =>WM: (14970: I2 ^reward 1)
  24458. =>WM: (14969: I2 ^see 0)
  24459. =>WM: (14968: N1065 ^status complete)
  24460. <=WM: (14958: I2 ^dir R)
  24461. <=WM: (14957: I2 ^reward 1)
  24462. <=WM: (14956: I2 ^see 1)
  24463. =>WM: (14972: I2 ^level-1 R0-root)
  24464. <=WM: (14959: I2 ^level-1 R1-root)
  24465. --- END Input Phase ---
  24466. --- Proposal Phase ---
  24467. --- Inner Elaboration Phase, active level 1 (S1) ---
  24468. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  24469. -->
  24470. (S1 ^operator O2129 = 0.2696941111808541)
  24471. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  24472. -->
  24473. (S1 ^operator O2130 = 0.8730231206446747)
  24474. Firing prefer*rvt*predict-no*H0*4*H1
  24475. -->
  24476. Firing prefer*rvt*predict-yes*H0*3*H1
  24477. -->
  24478. Firing elaborate*copy-see-to-output-link
  24479. -->
  24480. (I3 ^see 0 +)
  24481. Firing elaborate*reward*based*on*reward
  24482. -->
  24483. (R1069 ^value 1 +)
  24484. (R1 ^reward R1069 +)
  24485. Firing propose*predict-yes
  24486. -->
  24487. (O2131 ^name predict-yes +)
  24488. (S1 ^operator O2131 +)
  24489. Firing propose*predict-no
  24490. -->
  24491. (O2132 ^name predict-no +)
  24492. (S1 ^operator O2132 +)
  24493. Firing rl*prefer*rvt*predict-no*H0*4
  24494. -->
  24495. (S1 ^operator O2130 = 0.1269768309500519)
  24496. Firing rl*prefer*rvt*predict-yes*H0*3
  24497. -->
  24498. (S1 ^operator O2129 = 0.3829470991647237)
  24499. Firing prefer*rvt*predict-yes*H0
  24500. -->
  24501. Firing prefer*rvt*predict-no*H0
  24502. -->
  24503. Firing elaborate*copy-dir-to-output-link
  24504. -->
  24505. (I3 ^dir R +)
  24506. inner elaboration loop at bottom goal.
  24507. Retracting elaborate*copy-see-to-output-link
  24508. -->
  24509. (I3 ^see 1 +)
  24510. Retracting propose*predict-no
  24511. -->
  24512. (O2130 ^name predict-no +)
  24513. (S1 ^operator O2130 +)
  24514. Retracting propose*predict-yes
  24515. -->
  24516. (O2129 ^name predict-yes +)
  24517. (S1 ^operator O2129 +)
  24518. Retracting elaborate*reward*based*on*reward
  24519. -->
  24520. (R1068 ^value 1 +)
  24521. (R1 ^reward R1068 +)
  24522. Retracting elaborate*copy-dir-to-output-link
  24523. -->
  24524. (I3 ^dir R +)
  24525. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  24526. -->
  24527. (S1 ^operator O2130 = 0.8730232407042493)
  24528. Retracting rl*prefer*rvt*predict-no*H0*4
  24529. -->
  24530. (S1 ^operator O2130 = 0.1269768309500519)
  24531. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  24532. -->
  24533. (S1 ^operator O2129 = 0.08783148430849691)
  24534. Retracting rl*prefer*rvt*predict-yes*H0*3
  24535. -->
  24536. (S1 ^operator O2129 = 0.3829470991647237)
  24537. =>WM: (14979: S1 ^operator O2132 +)
  24538. =>WM: (14978: S1 ^operator O2131 +)
  24539. =>WM: (14977: O2132 ^name predict-no)
  24540. =>WM: (14976: O2131 ^name predict-yes)
  24541. =>WM: (14975: R1069 ^value 1)
  24542. =>WM: (14974: R1 ^reward R1069)
  24543. =>WM: (14973: I3 ^see 0)
  24544. <=WM: (14964: S1 ^operator O2129 +)
  24545. <=WM: (14965: S1 ^operator O2130 +)
  24546. <=WM: (14966: S1 ^operator O2130)
  24547. <=WM: (14960: R1 ^reward R1068)
  24548. <=WM: (14945: I3 ^see 1)
  24549. <=WM: (14963: O2130 ^name predict-no)
  24550. <=WM: (14962: O2129 ^name predict-yes)
  24551. <=WM: (14961: R1068 ^value 1)
  24552. --- Inner Elaboration Phase, active level 1 (S1) ---
  24553. Firing prefer*rvt*predict-yes*H0
  24554. -->
  24555. Firing rl*prefer*rvt*predict-yes*H0*3
  24556. -->
  24557. (S1 ^operator O2131 = 0.3829470991647237)
  24558. Firing prefer*rvt*predict-yes*H0*3*H1
  24559. -->
  24560. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  24561. -->
  24562. (S1 ^operator O2131 = 0.2696941111808541)
  24563. Firing prefer*rvt*predict-no*H0
  24564. -->
  24565. Firing rl*prefer*rvt*predict-no*H0*4
  24566. -->
  24567. (S1 ^operator O2132 = 0.1269768309500519)
  24568. Firing prefer*rvt*predict-no*H0*4*H1
  24569. -->
  24570. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  24571. -->
  24572. (S1 ^operator O2132 = 0.8730231206446747)
  24573. inner elaboration loop at bottom goal.
  24574. Retracting rl*prefer*rvt*predict-no*H0*4
  24575. -->
  24576. (S1 ^operator O2130 = 0.1269768309500519)
  24577. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  24578. -->
  24579. (S1 ^operator O2130 = 0.8730231206446747)
  24580. Retracting rl*prefer*rvt*predict-yes*H0*3
  24581. -->
  24582. (S1 ^operator O2129 = 0.3829470991647237)
  24583. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  24584. -->
  24585. (S1 ^operator O2129 = 0.2696941111808541)
  24586. --- END Proposal Phase ---
  24587. --- Decision Phase ---
  24588. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.953125,0.0449116)
  24589. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  24590. =>WM: (14980: S1 ^operator O2132)
  24591. 1066: O: O2132 (predict-no)
  24592. --- END Decision Phase ---
  24593. --- Application Phase ---
  24594. --- Firing Productions (PE) For State At Depth 1 ---
  24595. --- Inner Elaboration Phase, active level 1 (S1) ---
  24596. Firing apply*operator
  24597. -->
  24598. (I3 ^predict-no N1066 + :O )
  24599. Firing apply*operator*complete
  24600. -->
  24601. (I3 ^predict-no N1065 - :O )
  24602. inner elaboration loop at bottom goal.
  24603. --- Change Working Memory (PE) ---
  24604. =>WM: (14981: I3 ^predict-no N1066)
  24605. <=WM: (14968: N1065 ^status complete)
  24606. <=WM: (14967: I3 ^predict-no N1065)
  24607. --- Firing Productions (IE) For State At Depth 1 ---
  24608. --- Inner Elaboration Phase, active level 1 (S1) ---
  24609. Firing monitor*world
  24610. -->
  24611. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24612. --- Change Working Memory (IE) ---
  24613. --- END Application Phase ---
  24614. --- Output Phase ---
  24615. ENV: Agent did: predict-no for direction R in state State-B
  24616. In State-B moving R
  24617. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24618. predict error 0
  24619. dir: dir isR
  24620. --- END Output Phase ---
  24621. /|\--- Input Phase ---
  24622. =>WM: (14985: I2 ^dir R)
  24623. =>WM: (14984: I2 ^reward 1)
  24624. =>WM: (14983: I2 ^see 0)
  24625. =>WM: (14982: N1066 ^status complete)
  24626. <=WM: (14971: I2 ^dir R)
  24627. <=WM: (14970: I2 ^reward 1)
  24628. <=WM: (14969: I2 ^see 0)
  24629. =>WM: (14986: I2 ^level-1 R0-root)
  24630. <=WM: (14972: I2 ^level-1 R0-root)
  24631. --- END Input Phase ---
  24632. --- Proposal Phase ---
  24633. --- Inner Elaboration Phase, active level 1 (S1) ---
  24634. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  24635. -->
  24636. (S1 ^operator O2131 = 0.2696941111808541)
  24637. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  24638. -->
  24639. (S1 ^operator O2132 = 0.8730231206446747)
  24640. Firing prefer*rvt*predict-no*H0*4*H1
  24641. -->
  24642. Firing prefer*rvt*predict-yes*H0*3*H1
  24643. -->
  24644. Firing elaborate*copy-see-to-output-link
  24645. -->
  24646. (I3 ^see 0 +)
  24647. Firing elaborate*reward*based*on*reward
  24648. -->
  24649. (R1070 ^value 1 +)
  24650. (R1 ^reward R1070 +)
  24651. Firing propose*predict-yes
  24652. -->
  24653. (O2133 ^name predict-yes +)
  24654. (S1 ^operator O2133 +)
  24655. Firing propose*predict-no
  24656. -->
  24657. (O2134 ^name predict-no +)
  24658. (S1 ^operator O2134 +)
  24659. Firing rl*prefer*rvt*predict-no*H0*4
  24660. -->
  24661. (S1 ^operator O2132 = 0.1269768202019067)
  24662. Firing rl*prefer*rvt*predict-yes*H0*3
  24663. -->
  24664. (S1 ^operator O2131 = 0.3829470991647237)
  24665. Firing prefer*rvt*predict-yes*H0
  24666. -->
  24667. Firing prefer*rvt*predict-no*H0
  24668. -->
  24669. Firing elaborate*copy-dir-to-output-link
  24670. -->
  24671. (I3 ^dir R +)
  24672. inner elaboration loop at bottom goal.
  24673. Retracting elaborate*copy-see-to-output-link
  24674. -->
  24675. (I3 ^see 0 +)
  24676. Retracting propose*predict-no
  24677. -->
  24678. (O2132 ^name predict-no +)
  24679. (S1 ^operator O2132 +)
  24680. Retracting propose*predict-yes
  24681. -->
  24682. (O2131 ^name predict-yes +)
  24683. (S1 ^operator O2131 +)
  24684. Retracting elaborate*reward*based*on*reward
  24685. -->
  24686. (R1069 ^value 1 +)
  24687. (R1 ^reward R1069 +)
  24688. Retracting elaborate*copy-dir-to-output-link
  24689. -->
  24690. (I3 ^dir R +)
  24691. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  24692. -->
  24693. (S1 ^operator O2132 = 0.8730231206446747)
  24694. Retracting rl*prefer*rvt*predict-no*H0*4
  24695. -->
  24696. (S1 ^operator O2132 = 0.1269768202019067)
  24697. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  24698. -->
  24699. (S1 ^operator O2131 = 0.2696941111808541)
  24700. Retracting rl*prefer*rvt*predict-yes*H0*3
  24701. -->
  24702. (S1 ^operator O2131 = 0.3829470991647237)
  24703. =>WM: (14992: S1 ^operator O2134 +)
  24704. =>WM: (14991: S1 ^operator O2133 +)
  24705. =>WM: (14990: O2134 ^name predict-no)
  24706. =>WM: (14989: O2133 ^name predict-yes)
  24707. =>WM: (14988: R1070 ^value 1)
  24708. =>WM: (14987: R1 ^reward R1070)
  24709. <=WM: (14978: S1 ^operator O2131 +)
  24710. <=WM: (14979: S1 ^operator O2132 +)
  24711. <=WM: (14980: S1 ^operator O2132)
  24712. <=WM: (14974: R1 ^reward R1069)
  24713. <=WM: (14977: O2132 ^name predict-no)
  24714. <=WM: (14976: O2131 ^name predict-yes)
  24715. <=WM: (14975: R1069 ^value 1)
  24716. --- Inner Elaboration Phase, active level 1 (S1) ---
  24717. Firing prefer*rvt*predict-yes*H0
  24718. -->
  24719. Firing rl*prefer*rvt*predict-yes*H0*3
  24720. -->
  24721. (S1 ^operator O2133 = 0.3829470991647237)
  24722. Firing prefer*rvt*predict-yes*H0*3*H1
  24723. -->
  24724. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  24725. -->
  24726. (S1 ^operator O2133 = 0.2696941111808541)
  24727. Firing prefer*rvt*predict-no*H0
  24728. -->
  24729. Firing rl*prefer*rvt*predict-no*H0*4
  24730. -->
  24731. (S1 ^operator O2134 = 0.1269768202019067)
  24732. Firing prefer*rvt*predict-no*H0*4*H1
  24733. -->
  24734. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  24735. -->
  24736. (S1 ^operator O2134 = 0.8730231206446747)
  24737. inner elaboration loop at bottom goal.
  24738. Retracting rl*prefer*rvt*predict-no*H0*4
  24739. -->
  24740. (S1 ^operator O2132 = 0.1269768202019067)
  24741. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  24742. -->
  24743. (S1 ^operator O2132 = 0.8730231206446747)
  24744. Retracting rl*prefer*rvt*predict-yes*H0*3
  24745. -->
  24746. (S1 ^operator O2131 = 0.3829470991647237)
  24747. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  24748. -->
  24749. (S1 ^operator O2131 = 0.2696941111808541)
  24750. --- END Proposal Phase ---
  24751. --- Decision Phase ---
  24752. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.953368,0.0446891)
  24753. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  24754. =>WM: (14993: S1 ^operator O2134)
  24755. 1067: O: O2134 (predict-no)
  24756. --- END Decision Phase ---
  24757. --- Application Phase ---
  24758. --- Firing Productions (PE) For State At Depth 1 ---
  24759. --- Inner Elaboration Phase, active level 1 (S1) ---
  24760. Firing apply*operator
  24761. -->
  24762. (I3 ^predict-no N1067 + :O )
  24763. Firing apply*operator*complete
  24764. -->
  24765. (I3 ^predict-no N1066 - :O )
  24766. inner elaboration loop at bottom goal.
  24767. --- Change Working Memory (PE) ---
  24768. =>WM: (14994: I3 ^predict-no N1067)
  24769. <=WM: (14982: N1066 ^status complete)
  24770. <=WM: (14981: I3 ^predict-no N1066)
  24771. --- Firing Productions (IE) For State At Depth 1 ---
  24772. --- Inner Elaboration Phase, active level 1 (S1) ---
  24773. Firing monitor*world
  24774. -->
  24775. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24776. --- Change Working Memory (IE) ---
  24777. --- END Application Phase ---
  24778. --- Output Phase ---
  24779. ENV: Agent did: predict-no for direction R in state State-B
  24780. In State-B moving R
  24781. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24782. predict error 0
  24783. dir: dir isU
  24784. --- END Output Phase ---
  24785. -/|--- Input Phase ---
  24786. =>WM: (14998: I2 ^dir U)
  24787. =>WM: (14997: I2 ^reward 1)
  24788. =>WM: (14996: I2 ^see 0)
  24789. =>WM: (14995: N1067 ^status complete)
  24790. <=WM: (14985: I2 ^dir R)
  24791. <=WM: (14984: I2 ^reward 1)
  24792. <=WM: (14983: I2 ^see 0)
  24793. =>WM: (14999: I2 ^level-1 R0-root)
  24794. <=WM: (14986: I2 ^level-1 R0-root)
  24795. --- END Input Phase ---
  24796. --- Proposal Phase ---
  24797. --- Inner Elaboration Phase, active level 1 (S1) ---
  24798. Firing elaborate*copy-see-to-output-link
  24799. -->
  24800. (I3 ^see 0 +)
  24801. Firing elaborate*reward*based*on*reward
  24802. -->
  24803. (R1071 ^value 1 +)
  24804. (R1 ^reward R1071 +)
  24805. Firing propose*predict-yes
  24806. -->
  24807. (O2135 ^name predict-yes +)
  24808. (S1 ^operator O2135 +)
  24809. Firing propose*predict-no
  24810. -->
  24811. (O2136 ^name predict-no +)
  24812. (S1 ^operator O2136 +)
  24813. Firing rl*prefer*rvt*predict-no*H0*6
  24814. -->
  24815. (S1 ^operator O2134 = 0.9999999999999999)
  24816. Firing rl*prefer*rvt*predict-yes*H0*5
  24817. -->
  24818. (S1 ^operator O2133 = 0.)
  24819. Firing prefer*rvt*predict-yes*H0
  24820. -->
  24821. Firing prefer*rvt*predict-no*H0
  24822. -->
  24823. Firing elaborate*copy-dir-to-output-link
  24824. -->
  24825. (I3 ^dir U +)
  24826. inner elaboration loop at bottom goal.
  24827. Retracting elaborate*copy-see-to-output-link
  24828. -->
  24829. (I3 ^see 0 +)
  24830. Retracting propose*predict-no
  24831. -->
  24832. (O2134 ^name predict-no +)
  24833. (S1 ^operator O2134 +)
  24834. Retracting propose*predict-yes
  24835. -->
  24836. (O2133 ^name predict-yes +)
  24837. (S1 ^operator O2133 +)
  24838. Retracting elaborate*reward*based*on*reward
  24839. -->
  24840. (R1070 ^value 1 +)
  24841. (R1 ^reward R1070 +)
  24842. Retracting elaborate*copy-dir-to-output-link
  24843. -->
  24844. (I3 ^dir R +)
  24845. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  24846. -->
  24847. (S1 ^operator O2134 = 0.8730231295176875)
  24848. Retracting rl*prefer*rvt*predict-no*H0*4
  24849. -->
  24850. (S1 ^operator O2134 = 0.1269768290749195)
  24851. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  24852. -->
  24853. (S1 ^operator O2133 = 0.2696941111808541)
  24854. Retracting rl*prefer*rvt*predict-yes*H0*3
  24855. -->
  24856. (S1 ^operator O2133 = 0.3829470991647237)
  24857. =>WM: (15006: S1 ^operator O2136 +)
  24858. =>WM: (15005: S1 ^operator O2135 +)
  24859. =>WM: (15004: I3 ^dir U)
  24860. =>WM: (15003: O2136 ^name predict-no)
  24861. =>WM: (15002: O2135 ^name predict-yes)
  24862. =>WM: (15001: R1071 ^value 1)
  24863. =>WM: (15000: R1 ^reward R1071)
  24864. <=WM: (14991: S1 ^operator O2133 +)
  24865. <=WM: (14992: S1 ^operator O2134 +)
  24866. <=WM: (14993: S1 ^operator O2134)
  24867. <=WM: (14950: I3 ^dir R)
  24868. <=WM: (14987: R1 ^reward R1070)
  24869. <=WM: (14990: O2134 ^name predict-no)
  24870. <=WM: (14989: O2133 ^name predict-yes)
  24871. <=WM: (14988: R1070 ^value 1)
  24872. --- Inner Elaboration Phase, active level 1 (S1) ---
  24873. Firing prefer*rvt*predict-yes*H0
  24874. -->
  24875. Firing rl*prefer*rvt*predict-yes*H0*5
  24876. -->
  24877. (S1 ^operator O2135 = 0.)
  24878. Firing prefer*rvt*predict-no*H0
  24879. -->
  24880. Firing rl*prefer*rvt*predict-no*H0*6
  24881. -->
  24882. (S1 ^operator O2136 = 0.9999999999999999)
  24883. inner elaboration loop at bottom goal.
  24884. Retracting rl*prefer*rvt*predict-no*H0*6
  24885. -->
  24886. (S1 ^operator O2134 = 0.9999999999999999)
  24887. Retracting rl*prefer*rvt*predict-yes*H0*5
  24888. -->
  24889. (S1 ^operator O2133 = 0.)
  24890. --- END Proposal Phase ---
  24891. --- Decision Phase ---
  24892. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.953608,0.0444688)
  24893. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  24894. =>WM: (15007: S1 ^operator O2136)
  24895. 1068: O: O2136 (predict-no)
  24896. --- END Decision Phase ---
  24897. --- Application Phase ---
  24898. --- Firing Productions (PE) For State At Depth 1 ---
  24899. --- Inner Elaboration Phase, active level 1 (S1) ---
  24900. Firing apply*operator
  24901. -->
  24902. (I3 ^predict-no N1068 + :O )
  24903. Firing apply*operator*complete
  24904. -->
  24905. (I3 ^predict-no N1067 - :O )
  24906. inner elaboration loop at bottom goal.
  24907. --- Change Working Memory (PE) ---
  24908. =>WM: (15008: I3 ^predict-no N1068)
  24909. <=WM: (14995: N1067 ^status complete)
  24910. <=WM: (14994: I3 ^predict-no N1067)
  24911. --- Firing Productions (IE) For State At Depth 1 ---
  24912. --- Inner Elaboration Phase, active level 1 (S1) ---
  24913. Firing monitor*world
  24914. -->
  24915. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24916. --- Change Working Memory (IE) ---
  24917. --- END Application Phase ---
  24918. --- Output Phase ---
  24919. ENV: Agent did: predict-no for direction U in state State-B
  24920. In State-B moving U
  24921. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24922. predict error 0
  24923. dir: dir isR
  24924. --- END Output Phase ---
  24925. \-/--- Input Phase ---
  24926. =>WM: (15012: I2 ^dir R)
  24927. =>WM: (15011: I2 ^reward 1)
  24928. =>WM: (15010: I2 ^see 0)
  24929. =>WM: (15009: N1068 ^status complete)
  24930. <=WM: (14998: I2 ^dir U)
  24931. <=WM: (14997: I2 ^reward 1)
  24932. <=WM: (14996: I2 ^see 0)
  24933. =>WM: (15013: I2 ^level-1 R0-root)
  24934. <=WM: (14999: I2 ^level-1 R0-root)
  24935. --- END Input Phase ---
  24936. --- Proposal Phase ---
  24937. --- Inner Elaboration Phase, active level 1 (S1) ---
  24938. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  24939. -->
  24940. (S1 ^operator O2135 = 0.2696941111808541)
  24941. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  24942. -->
  24943. (S1 ^operator O2136 = 0.8730231357287965)
  24944. Firing prefer*rvt*predict-no*H0*4*H1
  24945. -->
  24946. Firing prefer*rvt*predict-yes*H0*3*H1
  24947. -->
  24948. Firing elaborate*copy-see-to-output-link
  24949. -->
  24950. (I3 ^see 0 +)
  24951. Firing elaborate*reward*based*on*reward
  24952. -->
  24953. (R1072 ^value 1 +)
  24954. (R1 ^reward R1072 +)
  24955. Firing propose*predict-yes
  24956. -->
  24957. (O2137 ^name predict-yes +)
  24958. (S1 ^operator O2137 +)
  24959. Firing propose*predict-no
  24960. -->
  24961. (O2138 ^name predict-no +)
  24962. (S1 ^operator O2138 +)
  24963. Firing rl*prefer*rvt*predict-no*H0*4
  24964. -->
  24965. (S1 ^operator O2136 = 0.1269768352860284)
  24966. Firing rl*prefer*rvt*predict-yes*H0*3
  24967. -->
  24968. (S1 ^operator O2135 = 0.3829470991647237)
  24969. Firing prefer*rvt*predict-yes*H0
  24970. -->
  24971. Firing prefer*rvt*predict-no*H0
  24972. -->
  24973. Firing elaborate*copy-dir-to-output-link
  24974. -->
  24975. (I3 ^dir R +)
  24976. inner elaboration loop at bottom goal.
  24977. Retracting elaborate*copy-see-to-output-link
  24978. -->
  24979. (I3 ^see 0 +)
  24980. Retracting propose*predict-no
  24981. -->
  24982. (O2136 ^name predict-no +)
  24983. (S1 ^operator O2136 +)
  24984. Retracting propose*predict-yes
  24985. -->
  24986. (O2135 ^name predict-yes +)
  24987. (S1 ^operator O2135 +)
  24988. Retracting elaborate*reward*based*on*reward
  24989. -->
  24990. (R1071 ^value 1 +)
  24991. (R1 ^reward R1071 +)
  24992. Retracting elaborate*copy-dir-to-output-link
  24993. -->
  24994. (I3 ^dir U +)
  24995. Retracting rl*prefer*rvt*predict-no*H0*6
  24996. -->
  24997. (S1 ^operator O2136 = 0.9999999999999999)
  24998. Retracting rl*prefer*rvt*predict-yes*H0*5
  24999. -->
  25000. (S1 ^operator O2135 = 0.)
  25001. =>WM: (15020: S1 ^operator O2138 +)
  25002. =>WM: (15019: S1 ^operator O2137 +)
  25003. =>WM: (15018: I3 ^dir R)
  25004. =>WM: (15017: O2138 ^name predict-no)
  25005. =>WM: (15016: O2137 ^name predict-yes)
  25006. =>WM: (15015: R1072 ^value 1)
  25007. =>WM: (15014: R1 ^reward R1072)
  25008. <=WM: (15005: S1 ^operator O2135 +)
  25009. <=WM: (15006: S1 ^operator O2136 +)
  25010. <=WM: (15007: S1 ^operator O2136)
  25011. <=WM: (15004: I3 ^dir U)
  25012. <=WM: (15000: R1 ^reward R1071)
  25013. <=WM: (15003: O2136 ^name predict-no)
  25014. <=WM: (15002: O2135 ^name predict-yes)
  25015. <=WM: (15001: R1071 ^value 1)
  25016. --- Inner Elaboration Phase, active level 1 (S1) ---
  25017. Firing prefer*rvt*predict-yes*H0
  25018. -->
  25019. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  25020. -->
  25021. (S1 ^operator O2137 = 0.2696941111808541)
  25022. Firing rl*prefer*rvt*predict-yes*H0*3
  25023. -->
  25024. (S1 ^operator O2137 = 0.3829470991647237)
  25025. Firing prefer*rvt*predict-yes*H0*3*H1
  25026. -->
  25027. Firing prefer*rvt*predict-no*H0
  25028. -->
  25029. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  25030. -->
  25031. (S1 ^operator O2138 = 0.8730231357287965)
  25032. Firing rl*prefer*rvt*predict-no*H0*4
  25033. -->
  25034. (S1 ^operator O2138 = 0.1269768352860284)
  25035. Firing prefer*rvt*predict-no*H0*4*H1
  25036. -->
  25037. inner elaboration loop at bottom goal.
  25038. Retracting rl*prefer*rvt*predict-no*H0*4
  25039. -->
  25040. (S1 ^operator O2136 = 0.1269768352860284)
  25041. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  25042. -->
  25043. (S1 ^operator O2136 = 0.8730231357287965)
  25044. Retracting rl*prefer*rvt*predict-yes*H0*3
  25045. -->
  25046. (S1 ^operator O2135 = 0.3829470991647237)
  25047. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  25048. -->
  25049. (S1 ^operator O2135 = 0.2696941111808541)
  25050. --- END Proposal Phase ---
  25051. --- Decision Phase ---
  25052. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  25053. =>WM: (15021: S1 ^operator O2138)
  25054. 1069: O: O2138 (predict-no)
  25055. --- END Decision Phase ---
  25056. --- Application Phase ---
  25057. --- Firing Productions (PE) For State At Depth 1 ---
  25058. --- Inner Elaboration Phase, active level 1 (S1) ---
  25059. Firing apply*operator
  25060. -->
  25061. (I3 ^predict-no N1069 + :O )
  25062. Firing apply*operator*complete
  25063. -->
  25064. (I3 ^predict-no N1068 - :O )
  25065. inner elaboration loop at bottom goal.
  25066. --- Change Working Memory (PE) ---
  25067. =>WM: (15022: I3 ^predict-no N1069)
  25068. <=WM: (15009: N1068 ^status complete)
  25069. <=WM: (15008: I3 ^predict-no N1068)
  25070. --- Firing Productions (IE) For State At Depth 1 ---
  25071. --- Inner Elaboration Phase, active level 1 (S1) ---
  25072. Firing monitor*world
  25073. -->
  25074. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25075. --- Change Working Memory (IE) ---
  25076. --- END Application Phase ---
  25077. --- Output Phase ---
  25078. ENV: Agent did: predict-no for direction R in state State-B
  25079. In State-B moving R
  25080. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25081. predict error 0
  25082. dir: dir isL
  25083. --- END Output Phase ---
  25084. |\---- Input Phase ---
  25085. =>WM: (15026: I2 ^dir L)
  25086. =>WM: (15025: I2 ^reward 1)
  25087. =>WM: (15024: I2 ^see 0)
  25088. =>WM: (15023: N1069 ^status complete)
  25089. <=WM: (15012: I2 ^dir R)
  25090. <=WM: (15011: I2 ^reward 1)
  25091. <=WM: (15010: I2 ^see 0)
  25092. =>WM: (15027: I2 ^level-1 R0-root)
  25093. <=WM: (15013: I2 ^level-1 R0-root)
  25094. --- END Input Phase ---
  25095. --- Proposal Phase ---
  25096. --- Inner Elaboration Phase, active level 1 (S1) ---
  25097. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  25098. -->
  25099. (S1 ^operator O2137 = 0.4768810903711926)
  25100. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  25101. -->
  25102. (S1 ^operator O2138 = 0.1700769046561409)
  25103. Firing prefer*rvt*predict-no*H0*2*H1
  25104. -->
  25105. Firing prefer*rvt*predict-yes*H0*1*H1
  25106. -->
  25107. Firing elaborate*copy-see-to-output-link
  25108. -->
  25109. (I3 ^see 0 +)
  25110. Firing elaborate*reward*based*on*reward
  25111. -->
  25112. (R1073 ^value 1 +)
  25113. (R1 ^reward R1073 +)
  25114. Firing propose*predict-yes
  25115. -->
  25116. (O2139 ^name predict-yes +)
  25117. (S1 ^operator O2139 +)
  25118. Firing propose*predict-no
  25119. -->
  25120. (O2140 ^name predict-no +)
  25121. (S1 ^operator O2140 +)
  25122. Firing rl*prefer*rvt*predict-no*H0*2
  25123. -->
  25124. (S1 ^operator O2138 = 0.2550133912230119)
  25125. Firing rl*prefer*rvt*predict-yes*H0*1
  25126. -->
  25127. (S1 ^operator O2137 = 0.5231193621229728)
  25128. Firing prefer*rvt*predict-yes*H0
  25129. -->
  25130. Firing prefer*rvt*predict-no*H0
  25131. -->
  25132. Firing elaborate*copy-dir-to-output-link
  25133. -->
  25134. (I3 ^dir L +)
  25135. inner elaboration loop at bottom goal.
  25136. Retracting elaborate*copy-see-to-output-link
  25137. -->
  25138. (I3 ^see 0 +)
  25139. Retracting propose*predict-no
  25140. -->
  25141. (O2138 ^name predict-no +)
  25142. (S1 ^operator O2138 +)
  25143. Retracting propose*predict-yes
  25144. -->
  25145. (O2137 ^name predict-yes +)
  25146. (S1 ^operator O2137 +)
  25147. Retracting elaborate*reward*based*on*reward
  25148. -->
  25149. (R1072 ^value 1 +)
  25150. (R1 ^reward R1072 +)
  25151. Retracting elaborate*copy-dir-to-output-link
  25152. -->
  25153. (I3 ^dir R +)
  25154. Retracting rl*prefer*rvt*predict-no*H0*4
  25155. -->
  25156. (S1 ^operator O2138 = 0.1269768352860284)
  25157. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  25158. -->
  25159. (S1 ^operator O2138 = 0.8730231357287965)
  25160. Retracting rl*prefer*rvt*predict-yes*H0*3
  25161. -->
  25162. (S1 ^operator O2137 = 0.3829470991647237)
  25163. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  25164. -->
  25165. (S1 ^operator O2137 = 0.2696941111808541)
  25166. =>WM: (15034: S1 ^operator O2140 +)
  25167. =>WM: (15033: S1 ^operator O2139 +)
  25168. =>WM: (15032: I3 ^dir L)
  25169. =>WM: (15031: O2140 ^name predict-no)
  25170. =>WM: (15030: O2139 ^name predict-yes)
  25171. =>WM: (15029: R1073 ^value 1)
  25172. =>WM: (15028: R1 ^reward R1073)
  25173. <=WM: (15019: S1 ^operator O2137 +)
  25174. <=WM: (15020: S1 ^operator O2138 +)
  25175. <=WM: (15021: S1 ^operator O2138)
  25176. <=WM: (15018: I3 ^dir R)
  25177. <=WM: (15014: R1 ^reward R1072)
  25178. <=WM: (15017: O2138 ^name predict-no)
  25179. <=WM: (15016: O2137 ^name predict-yes)
  25180. <=WM: (15015: R1072 ^value 1)
  25181. --- Inner Elaboration Phase, active level 1 (S1) ---
  25182. Firing prefer*rvt*predict-yes*H0
  25183. -->
  25184. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  25185. -->
  25186. (S1 ^operator O2139 = 0.4768810903711926)
  25187. Firing rl*prefer*rvt*predict-yes*H0*1
  25188. -->
  25189. (S1 ^operator O2139 = 0.5231193621229728)
  25190. Firing prefer*rvt*predict-yes*H0*1*H1
  25191. -->
  25192. Firing prefer*rvt*predict-no*H0
  25193. -->
  25194. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  25195. -->
  25196. (S1 ^operator O2140 = 0.1700769046561409)
  25197. Firing rl*prefer*rvt*predict-no*H0*2
  25198. -->
  25199. (S1 ^operator O2140 = 0.2550133912230119)
  25200. Firing prefer*rvt*predict-no*H0*2*H1
  25201. -->
  25202. inner elaboration loop at bottom goal.
  25203. Retracting rl*prefer*rvt*predict-no*H0*2
  25204. -->
  25205. (S1 ^operator O2138 = 0.2550133912230119)
  25206. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  25207. -->
  25208. (S1 ^operator O2138 = 0.1700769046561409)
  25209. Retracting rl*prefer*rvt*predict-yes*H0*1
  25210. -->
  25211. (S1 ^operator O2137 = 0.5231193621229728)
  25212. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  25213. -->
  25214. (S1 ^operator O2137 = 0.4768810903711926)
  25215. --- END Proposal Phase ---
  25216. --- Decision Phase ---
  25217. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.953846,0.0442506)
  25218. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  25219. =>WM: (15035: S1 ^operator O2139)
  25220. 1070: O: O2139 (predict-yes)
  25221. --- END Decision Phase ---
  25222. --- Application Phase ---
  25223. --- Firing Productions (PE) For State At Depth 1 ---
  25224. --- Inner Elaboration Phase, active level 1 (S1) ---
  25225. Firing apply*operator
  25226. -->
  25227. (I3 ^predict-yes N1070 + :O )
  25228. Firing apply*operator*complete
  25229. -->
  25230. (I3 ^predict-no N1069 - :O )
  25231. inner elaboration loop at bottom goal.
  25232. --- Change Working Memory (PE) ---
  25233. =>WM: (15036: I3 ^predict-yes N1070)
  25234. <=WM: (15023: N1069 ^status complete)
  25235. <=WM: (15022: I3 ^predict-no N1069)
  25236. --- Firing Productions (IE) For State At Depth 1 ---
  25237. --- Inner Elaboration Phase, active level 1 (S1) ---
  25238. Firing monitor*world
  25239. -->
  25240. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25241. --- Change Working Memory (IE) ---
  25242. --- END Application Phase ---
  25243. --- Output Phase ---
  25244. ENV: Agent did: predict-yes for direction L in state State-B
  25245. In State-B moving L
  25246. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  25247. predict error 0
  25248. dir: dir isL
  25249. --- END Output Phase ---
  25250. /|--- Input Phase ---
  25251. =>WM: (15040: I2 ^dir L)
  25252. =>WM: (15039: I2 ^reward 1)
  25253. =>WM: (15038: I2 ^see 1)
  25254. =>WM: (15037: N1070 ^status complete)
  25255. <=WM: (15026: I2 ^dir L)
  25256. <=WM: (15025: I2 ^reward 1)
  25257. <=WM: (15024: I2 ^see 0)
  25258. =>WM: (15041: I2 ^level-1 L1-root)
  25259. <=WM: (15027: I2 ^level-1 R0-root)
  25260. --- END Input Phase ---
  25261. --- Proposal Phase ---
  25262. --- Inner Elaboration Phase, active level 1 (S1) ---
  25263. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  25264. -->
  25265. (S1 ^operator O2139 = 0.1693592933936033)
  25266. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  25267. -->
  25268. (S1 ^operator O2140 = 0.744986509061307)
  25269. Firing prefer*rvt*predict-no*H0*2*H1
  25270. -->
  25271. Firing prefer*rvt*predict-yes*H0*1*H1
  25272. -->
  25273. Firing elaborate*copy-see-to-output-link
  25274. -->
  25275. (I3 ^see 1 +)
  25276. Firing elaborate*reward*based*on*reward
  25277. -->
  25278. (R1074 ^value 1 +)
  25279. (R1 ^reward R1074 +)
  25280. Firing propose*predict-yes
  25281. -->
  25282. (O2141 ^name predict-yes +)
  25283. (S1 ^operator O2141 +)
  25284. Firing propose*predict-no
  25285. -->
  25286. (O2142 ^name predict-no +)
  25287. (S1 ^operator O2142 +)
  25288. Firing rl*prefer*rvt*predict-no*H0*2
  25289. -->
  25290. (S1 ^operator O2140 = 0.2550133912230119)
  25291. Firing rl*prefer*rvt*predict-yes*H0*1
  25292. -->
  25293. (S1 ^operator O2139 = 0.5231193621229728)
  25294. Firing prefer*rvt*predict-yes*H0
  25295. -->
  25296. Firing prefer*rvt*predict-no*H0
  25297. -->
  25298. Firing elaborate*copy-dir-to-output-link
  25299. -->
  25300. (I3 ^dir L +)
  25301. inner elaboration loop at bottom goal.
  25302. Retracting elaborate*copy-see-to-output-link
  25303. -->
  25304. (I3 ^see 0 +)
  25305. Retracting propose*predict-no
  25306. -->
  25307. (O2140 ^name predict-no +)
  25308. (S1 ^operator O2140 +)
  25309. Retracting propose*predict-yes
  25310. -->
  25311. (O2139 ^name predict-yes +)
  25312. (S1 ^operator O2139 +)
  25313. Retracting elaborate*reward*based*on*reward
  25314. -->
  25315. (R1073 ^value 1 +)
  25316. (R1 ^reward R1073 +)
  25317. Retracting elaborate*copy-dir-to-output-link
  25318. -->
  25319. (I3 ^dir L +)
  25320. Retracting rl*prefer*rvt*predict-no*H0*2
  25321. -->
  25322. (S1 ^operator O2140 = 0.2550133912230119)
  25323. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  25324. -->
  25325. (S1 ^operator O2140 = 0.1700769046561409)
  25326. Retracting rl*prefer*rvt*predict-yes*H0*1
  25327. -->
  25328. (S1 ^operator O2139 = 0.5231193621229728)
  25329. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  25330. -->
  25331. (S1 ^operator O2139 = 0.4768810903711926)
  25332. =>WM: (15048: S1 ^operator O2142 +)
  25333. =>WM: (15047: S1 ^operator O2141 +)
  25334. =>WM: (15046: O2142 ^name predict-no)
  25335. =>WM: (15045: O2141 ^name predict-yes)
  25336. =>WM: (15044: R1074 ^value 1)
  25337. =>WM: (15043: R1 ^reward R1074)
  25338. =>WM: (15042: I3 ^see 1)
  25339. <=WM: (15033: S1 ^operator O2139 +)
  25340. <=WM: (15035: S1 ^operator O2139)
  25341. <=WM: (15034: S1 ^operator O2140 +)
  25342. <=WM: (15028: R1 ^reward R1073)
  25343. <=WM: (14973: I3 ^see 0)
  25344. <=WM: (15031: O2140 ^name predict-no)
  25345. <=WM: (15030: O2139 ^name predict-yes)
  25346. <=WM: (15029: R1073 ^value 1)
  25347. --- Inner Elaboration Phase, active level 1 (S1) ---
  25348. Firing prefer*rvt*predict-yes*H0
  25349. -->
  25350. Firing rl*prefer*rvt*predict-yes*H0*1
  25351. -->
  25352. (S1 ^operator O2141 = 0.5231193621229728)
  25353. Firing prefer*rvt*predict-yes*H0*1*H1
  25354. -->
  25355. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  25356. -->
  25357. (S1 ^operator O2141 = 0.1693592933936033)
  25358. Firing prefer*rvt*predict-no*H0
  25359. -->
  25360. Firing rl*prefer*rvt*predict-no*H0*2
  25361. -->
  25362. (S1 ^operator O2142 = 0.2550133912230119)
  25363. Firing prefer*rvt*predict-no*H0*2*H1
  25364. -->
  25365. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  25366. -->
  25367. (S1 ^operator O2142 = 0.744986509061307)
  25368. inner elaboration loop at bottom goal.
  25369. Retracting rl*prefer*rvt*predict-no*H0*2
  25370. -->
  25371. (S1 ^operator O2140 = 0.2550133912230119)
  25372. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  25373. -->
  25374. (S1 ^operator O2140 = 0.744986509061307)
  25375. Retracting rl*prefer*rvt*predict-yes*H0*1
  25376. -->
  25377. (S1 ^operator O2139 = 0.5231193621229728)
  25378. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  25379. -->
  25380. (S1 ^operator O2139 = 0.1693592933936033)
  25381. --- END Proposal Phase ---
  25382. --- Decision Phase ---
  25383. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.980519,0.0192259)
  25384. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272042 0.20484 0.476881 -> 0.272041 0.20484 0.476881(R,m,v=1,1,0)
  25385. =>WM: (15049: S1 ^operator O2142)
  25386. 1071: O: O2142 (predict-no)
  25387. --- END Decision Phase ---
  25388. --- Application Phase ---
  25389. --- Firing Productions (PE) For State At Depth 1 ---
  25390. --- Inner Elaboration Phase, active level 1 (S1) ---
  25391. Firing apply*operator
  25392. -->
  25393. (I3 ^predict-no N1071 + :O )
  25394. Firing apply*operator*complete
  25395. -->
  25396. (I3 ^predict-yes N1070 - :O )
  25397. inner elaboration loop at bottom goal.
  25398. --- Change Working Memory (PE) ---
  25399. =>WM: (15050: I3 ^predict-no N1071)
  25400. <=WM: (15037: N1070 ^status complete)
  25401. <=WM: (15036: I3 ^predict-yes N1070)
  25402. --- Firing Productions (IE) For State At Depth 1 ---
  25403. --- Inner Elaboration Phase, active level 1 (S1) ---
  25404. Firing monitor*world
  25405. -->
  25406. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25407. --- Change Working Memory (IE) ---
  25408. --- END Application Phase ---
  25409. --- Output Phase ---
  25410. ENV: Agent did: predict-no for direction L in state State-A
  25411. In State-A moving L
  25412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25413. predict error 0
  25414. dir: dir isR
  25415. --- END Output Phase ---
  25416. \--- Input Phase ---
  25417. =>WM: (15054: I2 ^dir R)
  25418. =>WM: (15053: I2 ^reward 1)
  25419. =>WM: (15052: I2 ^see 0)
  25420. =>WM: (15051: N1071 ^status complete)
  25421. <=WM: (15040: I2 ^dir L)
  25422. <=WM: (15039: I2 ^reward 1)
  25423. <=WM: (15038: I2 ^see 1)
  25424. =>WM: (15055: I2 ^level-1 L0-root)
  25425. <=WM: (15041: I2 ^level-1 L1-root)
  25426. --- END Input Phase ---
  25427. --- Proposal Phase ---
  25428. --- Inner Elaboration Phase, active level 1 (S1) ---
  25429. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  25430. -->
  25431. (S1 ^operator O2141 = 0.6170704303704048)
  25432. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  25433. -->
  25434. (S1 ^operator O2142 = 0.4910065094545203)
  25435. Firing prefer*rvt*predict-no*H0*4*H1
  25436. -->
  25437. Firing prefer*rvt*predict-yes*H0*3*H1
  25438. -->
  25439. Firing elaborate*copy-see-to-output-link
  25440. -->
  25441. (I3 ^see 0 +)
  25442. Firing elaborate*reward*based*on*reward
  25443. -->
  25444. (R1075 ^value 1 +)
  25445. (R1 ^reward R1075 +)
  25446. Firing propose*predict-yes
  25447. -->
  25448. (O2143 ^name predict-yes +)
  25449. (S1 ^operator O2143 +)
  25450. Firing propose*predict-no
  25451. -->
  25452. (O2144 ^name predict-no +)
  25453. (S1 ^operator O2144 +)
  25454. Firing rl*prefer*rvt*predict-no*H0*4
  25455. -->
  25456. (S1 ^operator O2142 = 0.1269768396338047)
  25457. Firing rl*prefer*rvt*predict-yes*H0*3
  25458. -->
  25459. (S1 ^operator O2141 = 0.3829470991647237)
  25460. Firing prefer*rvt*predict-yes*H0
  25461. -->
  25462. Firing prefer*rvt*predict-no*H0
  25463. -->
  25464. Firing elaborate*copy-dir-to-output-link
  25465. -->
  25466. (I3 ^dir R +)
  25467. inner elaboration loop at bottom goal.
  25468. Retracting elaborate*copy-see-to-output-link
  25469. -->
  25470. (I3 ^see 1 +)
  25471. Retracting propose*predict-no
  25472. -->
  25473. (O2142 ^name predict-no +)
  25474. (S1 ^operator O2142 +)
  25475. Retracting propose*predict-yes
  25476. -->
  25477. (O2141 ^name predict-yes +)
  25478. (S1 ^operator O2141 +)
  25479. Retracting elaborate*reward*based*on*reward
  25480. -->
  25481. (R1074 ^value 1 +)
  25482. (R1 ^reward R1074 +)
  25483. Retracting elaborate*copy-dir-to-output-link
  25484. -->
  25485. (I3 ^dir L +)
  25486. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  25487. -->
  25488. (S1 ^operator O2142 = 0.744986509061307)
  25489. Retracting rl*prefer*rvt*predict-no*H0*2
  25490. -->
  25491. (S1 ^operator O2142 = 0.2550133912230119)
  25492. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  25493. -->
  25494. (S1 ^operator O2141 = 0.1693592933936033)
  25495. Retracting rl*prefer*rvt*predict-yes*H0*1
  25496. -->
  25497. (S1 ^operator O2141 = 0.523119294248848)
  25498. =>WM: (15063: S1 ^operator O2144 +)
  25499. =>WM: (15062: S1 ^operator O2143 +)
  25500. =>WM: (15061: I3 ^dir R)
  25501. =>WM: (15060: O2144 ^name predict-no)
  25502. =>WM: (15059: O2143 ^name predict-yes)
  25503. =>WM: (15058: R1075 ^value 1)
  25504. =>WM: (15057: R1 ^reward R1075)
  25505. =>WM: (15056: I3 ^see 0)
  25506. <=WM: (15047: S1 ^operator O2141 +)
  25507. <=WM: (15048: S1 ^operator O2142 +)
  25508. <=WM: (15049: S1 ^operator O2142)
  25509. <=WM: (15032: I3 ^dir L)
  25510. <=WM: (15043: R1 ^reward R1074)
  25511. <=WM: (15042: I3 ^see 1)
  25512. <=WM: (15046: O2142 ^name predict-no)
  25513. <=WM: (15045: O2141 ^name predict-yes)
  25514. <=WM: (15044: R1074 ^value 1)
  25515. --- Inner Elaboration Phase, active level 1 (S1) ---
  25516. Firing prefer*rvt*predict-yes*H0
  25517. -->
  25518. Firing rl*prefer*rvt*predict-yes*H0*3
  25519. -->
  25520. (S1 ^operator O2143 = 0.3829470991647237)
  25521. Firing prefer*rvt*predict-yes*H0*3*H1
  25522. -->
  25523. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  25524. -->
  25525. (S1 ^operator O2143 = 0.6170704303704048)
  25526. Firing prefer*rvt*predict-no*H0
  25527. -->
  25528. Firing rl*prefer*rvt*predict-no*H0*4
  25529. -->
  25530. (S1 ^operator O2144 = 0.1269768396338047)
  25531. Firing prefer*rvt*predict-no*H0*4*H1
  25532. -->
  25533. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  25534. -->
  25535. (S1 ^operator O2144 = 0.4910065094545203)
  25536. inner elaboration loop at bottom goal.
  25537. Retracting rl*prefer*rvt*predict-no*H0*4
  25538. -->
  25539. (S1 ^operator O2142 = 0.1269768396338047)
  25540. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  25541. -->
  25542. (S1 ^operator O2142 = 0.4910065094545203)
  25543. Retracting rl*prefer*rvt*predict-yes*H0*3
  25544. -->
  25545. (S1 ^operator O2141 = 0.3829470991647237)
  25546. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  25547. -->
  25548. (S1 ^operator O2141 = 0.6170704303704048)
  25549. --- END Proposal Phase ---
  25550. --- Decision Phase ---
  25551. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.920398,0.0736318)
  25552. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  25553. =>WM: (15064: S1 ^operator O2143)
  25554. 1072: O: O2143 (predict-yes)
  25555. --- END Decision Phase ---
  25556. --- Application Phase ---
  25557. --- Firing Productions (PE) For State At Depth 1 ---
  25558. --- Inner Elaboration Phase, active level 1 (S1) ---
  25559. Firing apply*operator
  25560. -->
  25561. (I3 ^predict-yes N1072 + :O )
  25562. Firing apply*operator*complete
  25563. -->
  25564. (I3 ^predict-no N1071 - :O )
  25565. inner elaboration loop at bottom goal.
  25566. --- Change Working Memory (PE) ---
  25567. =>WM: (15065: I3 ^predict-yes N1072)
  25568. <=WM: (15051: N1071 ^status complete)
  25569. <=WM: (15050: I3 ^predict-no N1071)
  25570. --- Firing Productions (IE) For State At Depth 1 ---
  25571. --- Inner Elaboration Phase, active level 1 (S1) ---
  25572. Firing monitor*world
  25573. -->
  25574. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25575. --- Change Working Memory (IE) ---
  25576. --- END Application Phase ---
  25577. --- Output Phase ---
  25578. ENV: Agent did: predict-yes for direction R in state State-A
  25579. In State-A moving R
  25580. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25581. predict error 0
  25582. dir: dir isR
  25583. --- END Output Phase ---
  25584. -/|--- Input Phase ---
  25585. =>WM: (15069: I2 ^dir R)
  25586. =>WM: (15068: I2 ^reward 1)
  25587. =>WM: (15067: I2 ^see 1)
  25588. =>WM: (15066: N1072 ^status complete)
  25589. <=WM: (15054: I2 ^dir R)
  25590. <=WM: (15053: I2 ^reward 1)
  25591. <=WM: (15052: I2 ^see 0)
  25592. =>WM: (15070: I2 ^level-1 R1-root)
  25593. <=WM: (15055: I2 ^level-1 L0-root)
  25594. --- END Input Phase ---
  25595. --- Proposal Phase ---
  25596. --- Inner Elaboration Phase, active level 1 (S1) ---
  25597. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  25598. -->
  25599. (S1 ^operator O2143 = 0.08783148430849691)
  25600. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  25601. -->
  25602. (S1 ^operator O2144 = 0.8730232299561043)
  25603. Firing prefer*rvt*predict-no*H0*4*H1
  25604. -->
  25605. Firing prefer*rvt*predict-yes*H0*3*H1
  25606. -->
  25607. Firing elaborate*copy-see-to-output-link
  25608. -->
  25609. (I3 ^see 1 +)
  25610. Firing elaborate*reward*based*on*reward
  25611. -->
  25612. (R1076 ^value 1 +)
  25613. (R1 ^reward R1076 +)
  25614. Firing propose*predict-yes
  25615. -->
  25616. (O2145 ^name predict-yes +)
  25617. (S1 ^operator O2145 +)
  25618. Firing propose*predict-no
  25619. -->
  25620. (O2146 ^name predict-no +)
  25621. (S1 ^operator O2146 +)
  25622. Firing rl*prefer*rvt*predict-no*H0*4
  25623. -->
  25624. (S1 ^operator O2144 = 0.1269768396338047)
  25625. Firing rl*prefer*rvt*predict-yes*H0*3
  25626. -->
  25627. (S1 ^operator O2143 = 0.3829470991647237)
  25628. Firing prefer*rvt*predict-yes*H0
  25629. -->
  25630. Firing prefer*rvt*predict-no*H0
  25631. -->
  25632. Firing elaborate*copy-dir-to-output-link
  25633. -->
  25634. (I3 ^dir R +)
  25635. inner elaboration loop at bottom goal.
  25636. Retracting elaborate*copy-see-to-output-link
  25637. -->
  25638. (I3 ^see 0 +)
  25639. Retracting propose*predict-no
  25640. -->
  25641. (O2144 ^name predict-no +)
  25642. (S1 ^operator O2144 +)
  25643. Retracting propose*predict-yes
  25644. -->
  25645. (O2143 ^name predict-yes +)
  25646. (S1 ^operator O2143 +)
  25647. Retracting elaborate*reward*based*on*reward
  25648. -->
  25649. (R1075 ^value 1 +)
  25650. (R1 ^reward R1075 +)
  25651. Retracting elaborate*copy-dir-to-output-link
  25652. -->
  25653. (I3 ^dir R +)
  25654. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  25655. -->
  25656. (S1 ^operator O2144 = 0.4910065094545203)
  25657. Retracting rl*prefer*rvt*predict-no*H0*4
  25658. -->
  25659. (S1 ^operator O2144 = 0.1269768396338047)
  25660. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  25661. -->
  25662. (S1 ^operator O2143 = 0.6170704303704048)
  25663. Retracting rl*prefer*rvt*predict-yes*H0*3
  25664. -->
  25665. (S1 ^operator O2143 = 0.3829470991647237)
  25666. =>WM: (15077: S1 ^operator O2146 +)
  25667. =>WM: (15076: S1 ^operator O2145 +)
  25668. =>WM: (15075: O2146 ^name predict-no)
  25669. =>WM: (15074: O2145 ^name predict-yes)
  25670. =>WM: (15073: R1076 ^value 1)
  25671. =>WM: (15072: R1 ^reward R1076)
  25672. =>WM: (15071: I3 ^see 1)
  25673. <=WM: (15062: S1 ^operator O2143 +)
  25674. <=WM: (15064: S1 ^operator O2143)
  25675. <=WM: (15063: S1 ^operator O2144 +)
  25676. <=WM: (15057: R1 ^reward R1075)
  25677. <=WM: (15056: I3 ^see 0)
  25678. <=WM: (15060: O2144 ^name predict-no)
  25679. <=WM: (15059: O2143 ^name predict-yes)
  25680. <=WM: (15058: R1075 ^value 1)
  25681. --- Inner Elaboration Phase, active level 1 (S1) ---
  25682. Firing prefer*rvt*predict-yes*H0
  25683. -->
  25684. Firing rl*prefer*rvt*predict-yes*H0*3
  25685. -->
  25686. (S1 ^operator O2145 = 0.3829470991647237)
  25687. Firing prefer*rvt*predict-yes*H0*3*H1
  25688. -->
  25689. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  25690. -->
  25691. (S1 ^operator O2145 = 0.08783148430849691)
  25692. Firing prefer*rvt*predict-no*H0
  25693. -->
  25694. Firing rl*prefer*rvt*predict-no*H0*4
  25695. -->
  25696. (S1 ^operator O2146 = 0.1269768396338047)
  25697. Firing prefer*rvt*predict-no*H0*4*H1
  25698. -->
  25699. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  25700. -->
  25701. (S1 ^operator O2146 = 0.8730232299561043)
  25702. inner elaboration loop at bottom goal.
  25703. Retracting rl*prefer*rvt*predict-no*H0*4
  25704. -->
  25705. (S1 ^operator O2144 = 0.1269768396338047)
  25706. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  25707. -->
  25708. (S1 ^operator O2144 = 0.8730232299561043)
  25709. Retracting rl*prefer*rvt*predict-yes*H0*3
  25710. -->
  25711. (S1 ^operator O2143 = 0.3829470991647237)
  25712. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  25713. -->
  25714. (S1 ^operator O2143 = 0.08783148430849691)
  25715. --- END Proposal Phase ---
  25716. --- Decision Phase ---
  25717. RL update rl*prefer*rvt*predict-yes*H0*3 0.67314 -0.290193 0.382947 -> 0.673137 -0.290193 0.382944(R,m,v=1,0.963636,0.035255)
  25718. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326876 0.290194 0.61707 -> 0.326874 0.290194 0.617068(R,m,v=1,1,0)
  25719. =>WM: (15078: S1 ^operator O2146)
  25720. 1073: O: O2146 (predict-no)
  25721. --- END Decision Phase ---
  25722. --- Application Phase ---
  25723. --- Firing Productions (PE) For State At Depth 1 ---
  25724. --- Inner Elaboration Phase, active level 1 (S1) ---
  25725. Firing apply*operator
  25726. -->
  25727. (I3 ^predict-no N1073 + :O )
  25728. Firing apply*operator*complete
  25729. -->
  25730. (I3 ^predict-yes N1072 - :O )
  25731. inner elaboration loop at bottom goal.
  25732. --- Change Working Memory (PE) ---
  25733. =>WM: (15079: I3 ^predict-no N1073)
  25734. <=WM: (15066: N1072 ^status complete)
  25735. <=WM: (15065: I3 ^predict-yes N1072)
  25736. --- Firing Productions (IE) For State At Depth 1 ---
  25737. --- Inner Elaboration Phase, active level 1 (S1) ---
  25738. Firing monitor*world
  25739. -->
  25740. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25741. --- Change Working Memory (IE) ---
  25742. --- END Application Phase ---
  25743. --- Output Phase ---
  25744. ENV: Agent did: predict-no for direction R in state State-B
  25745. In State-B moving R
  25746. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25747. predict error 0
  25748. dir: dir isL
  25749. --- END Output Phase ---
  25750. \---- Input Phase ---
  25751. =>WM: (15083: I2 ^dir L)
  25752. =>WM: (15082: I2 ^reward 1)
  25753. =>WM: (15081: I2 ^see 0)
  25754. =>WM: (15080: N1073 ^status complete)
  25755. <=WM: (15069: I2 ^dir R)
  25756. <=WM: (15068: I2 ^reward 1)
  25757. <=WM: (15067: I2 ^see 1)
  25758. =>WM: (15084: I2 ^level-1 R0-root)
  25759. <=WM: (15070: I2 ^level-1 R1-root)
  25760. --- END Input Phase ---
  25761. --- Proposal Phase ---
  25762. --- Inner Elaboration Phase, active level 1 (S1) ---
  25763. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  25764. -->
  25765. (S1 ^operator O2145 = 0.4768810224970678)
  25766. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  25767. -->
  25768. (S1 ^operator O2146 = 0.1700769046561409)
  25769. Firing prefer*rvt*predict-no*H0*2*H1
  25770. -->
  25771. Firing prefer*rvt*predict-yes*H0*1*H1
  25772. -->
  25773. Firing elaborate*copy-see-to-output-link
  25774. -->
  25775. (I3 ^see 0 +)
  25776. Firing elaborate*reward*based*on*reward
  25777. -->
  25778. (R1077 ^value 1 +)
  25779. (R1 ^reward R1077 +)
  25780. Firing propose*predict-yes
  25781. -->
  25782. (O2147 ^name predict-yes +)
  25783. (S1 ^operator O2147 +)
  25784. Firing propose*predict-no
  25785. -->
  25786. (O2148 ^name predict-no +)
  25787. (S1 ^operator O2148 +)
  25788. Firing rl*prefer*rvt*predict-no*H0*2
  25789. -->
  25790. (S1 ^operator O2146 = 0.255013406180364)
  25791. Firing rl*prefer*rvt*predict-yes*H0*1
  25792. -->
  25793. (S1 ^operator O2145 = 0.523119294248848)
  25794. Firing prefer*rvt*predict-yes*H0
  25795. -->
  25796. Firing prefer*rvt*predict-no*H0
  25797. -->
  25798. Firing elaborate*copy-dir-to-output-link
  25799. -->
  25800. (I3 ^dir L +)
  25801. inner elaboration loop at bottom goal.
  25802. Retracting elaborate*copy-see-to-output-link
  25803. -->
  25804. (I3 ^see 1 +)
  25805. Retracting propose*predict-no
  25806. -->
  25807. (O2146 ^name predict-no +)
  25808. (S1 ^operator O2146 +)
  25809. Retracting propose*predict-yes
  25810. -->
  25811. (O2145 ^name predict-yes +)
  25812. (S1 ^operator O2145 +)
  25813. Retracting elaborate*reward*based*on*reward
  25814. -->
  25815. (R1076 ^value 1 +)
  25816. (R1 ^reward R1076 +)
  25817. Retracting elaborate*copy-dir-to-output-link
  25818. -->
  25819. (I3 ^dir R +)
  25820. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  25821. -->
  25822. (S1 ^operator O2146 = 0.8730232299561043)
  25823. Retracting rl*prefer*rvt*predict-no*H0*4
  25824. -->
  25825. (S1 ^operator O2146 = 0.1269768396338047)
  25826. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  25827. -->
  25828. (S1 ^operator O2145 = 0.08783148430849691)
  25829. Retracting rl*prefer*rvt*predict-yes*H0*3
  25830. -->
  25831. (S1 ^operator O2145 = 0.3829444697344545)
  25832. =>WM: (15092: S1 ^operator O2148 +)
  25833. =>WM: (15091: S1 ^operator O2147 +)
  25834. =>WM: (15090: I3 ^dir L)
  25835. =>WM: (15089: O2148 ^name predict-no)
  25836. =>WM: (15088: O2147 ^name predict-yes)
  25837. =>WM: (15087: R1077 ^value 1)
  25838. =>WM: (15086: R1 ^reward R1077)
  25839. =>WM: (15085: I3 ^see 0)
  25840. <=WM: (15076: S1 ^operator O2145 +)
  25841. <=WM: (15077: S1 ^operator O2146 +)
  25842. <=WM: (15078: S1 ^operator O2146)
  25843. <=WM: (15061: I3 ^dir R)
  25844. <=WM: (15072: R1 ^reward R1076)
  25845. <=WM: (15071: I3 ^see 1)
  25846. <=WM: (15075: O2146 ^name predict-no)
  25847. <=WM: (15074: O2145 ^name predict-yes)
  25848. <=WM: (15073: R1076 ^value 1)
  25849. --- Inner Elaboration Phase, active level 1 (S1) ---
  25850. Firing prefer*rvt*predict-yes*H0
  25851. -->
  25852. Firing rl*prefer*rvt*predict-yes*H0*1
  25853. -->
  25854. (S1 ^operator O2147 = 0.523119294248848)
  25855. Firing prefer*rvt*predict-yes*H0*1*H1
  25856. -->
  25857. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  25858. -->
  25859. (S1 ^operator O2147 = 0.4768810224970678)
  25860. Firing prefer*rvt*predict-no*H0
  25861. -->
  25862. Firing rl*prefer*rvt*predict-no*H0*2
  25863. -->
  25864. (S1 ^operator O2148 = 0.255013406180364)
  25865. Firing prefer*rvt*predict-no*H0*2*H1
  25866. -->
  25867. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  25868. -->
  25869. (S1 ^operator O2148 = 0.1700769046561409)
  25870. inner elaboration loop at bottom goal.
  25871. Retracting rl*prefer*rvt*predict-no*H0*2
  25872. -->
  25873. (S1 ^operator O2146 = 0.255013406180364)
  25874. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  25875. -->
  25876. (S1 ^operator O2146 = 0.1700769046561409)
  25877. Retracting rl*prefer*rvt*predict-yes*H0*1
  25878. -->
  25879. (S1 ^operator O2145 = 0.523119294248848)
  25880. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  25881. -->
  25882. (S1 ^operator O2145 = 0.4768810224970678)
  25883. --- END Proposal Phase ---
  25884. --- Decision Phase ---
  25885. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.954082,0.0440345)
  25886. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  25887. =>WM: (15093: S1 ^operator O2147)
  25888. 1074: O: O2147 (predict-yes)
  25889. --- END Decision Phase ---
  25890. --- Application Phase ---
  25891. --- Firing Productions (PE) For State At Depth 1 ---
  25892. --- Inner Elaboration Phase, active level 1 (S1) ---
  25893. Firing apply*operator
  25894. -->
  25895. (I3 ^predict-yes N1074 + :O )
  25896. Firing apply*operator*complete
  25897. -->
  25898. (I3 ^predict-no N1073 - :O )
  25899. inner elaboration loop at bottom goal.
  25900. --- Change Working Memory (PE) ---
  25901. =>WM: (15094: I3 ^predict-yes N1074)
  25902. <=WM: (15080: N1073 ^status complete)
  25903. <=WM: (15079: I3 ^predict-no N1073)
  25904. --- Firing Productions (IE) For State At Depth 1 ---
  25905. --- Inner Elaboration Phase, active level 1 (S1) ---
  25906. Firing monitor*world
  25907. -->
  25908. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25909. --- Change Working Memory (IE) ---
  25910. --- END Application Phase ---
  25911. --- Output Phase ---
  25912. ENV: Agent did: predict-yes for direction L in state State-B
  25913. In State-B moving L
  25914. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  25915. predict error 0
  25916. dir: dir isL
  25917. --- END Output Phase ---
  25918. /|\--- Input Phase ---
  25919. =>WM: (15098: I2 ^dir L)
  25920. =>WM: (15097: I2 ^reward 1)
  25921. =>WM: (15096: I2 ^see 1)
  25922. =>WM: (15095: N1074 ^status complete)
  25923. <=WM: (15083: I2 ^dir L)
  25924. <=WM: (15082: I2 ^reward 1)
  25925. <=WM: (15081: I2 ^see 0)
  25926. =>WM: (15099: I2 ^level-1 L1-root)
  25927. <=WM: (15084: I2 ^level-1 R0-root)
  25928. --- END Input Phase ---
  25929. --- Proposal Phase ---
  25930. --- Inner Elaboration Phase, active level 1 (S1) ---
  25931. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  25932. -->
  25933. (S1 ^operator O2147 = 0.1693592933936033)
  25934. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  25935. -->
  25936. (S1 ^operator O2148 = 0.7449865240186593)
  25937. Firing prefer*rvt*predict-no*H0*2*H1
  25938. -->
  25939. Firing prefer*rvt*predict-yes*H0*1*H1
  25940. -->
  25941. Firing elaborate*copy-see-to-output-link
  25942. -->
  25943. (I3 ^see 1 +)
  25944. Firing elaborate*reward*based*on*reward
  25945. -->
  25946. (R1078 ^value 1 +)
  25947. (R1 ^reward R1078 +)
  25948. Firing propose*predict-yes
  25949. -->
  25950. (O2149 ^name predict-yes +)
  25951. (S1 ^operator O2149 +)
  25952. Firing propose*predict-no
  25953. -->
  25954. (O2150 ^name predict-no +)
  25955. (S1 ^operator O2150 +)
  25956. Firing rl*prefer*rvt*predict-no*H0*2
  25957. -->
  25958. (S1 ^operator O2148 = 0.255013406180364)
  25959. Firing rl*prefer*rvt*predict-yes*H0*1
  25960. -->
  25961. (S1 ^operator O2147 = 0.523119294248848)
  25962. Firing prefer*rvt*predict-yes*H0
  25963. -->
  25964. Firing prefer*rvt*predict-no*H0
  25965. -->
  25966. Firing elaborate*copy-dir-to-output-link
  25967. -->
  25968. (I3 ^dir L +)
  25969. inner elaboration loop at bottom goal.
  25970. Retracting elaborate*copy-see-to-output-link
  25971. -->
  25972. (I3 ^see 0 +)
  25973. Retracting propose*predict-no
  25974. -->
  25975. (O2148 ^name predict-no +)
  25976. (S1 ^operator O2148 +)
  25977. Retracting propose*predict-yes
  25978. -->
  25979. (O2147 ^name predict-yes +)
  25980. (S1 ^operator O2147 +)
  25981. Retracting elaborate*reward*based*on*reward
  25982. -->
  25983. (R1077 ^value 1 +)
  25984. (R1 ^reward R1077 +)
  25985. Retracting elaborate*copy-dir-to-output-link
  25986. -->
  25987. (I3 ^dir L +)
  25988. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  25989. -->
  25990. (S1 ^operator O2148 = 0.1700769046561409)
  25991. Retracting rl*prefer*rvt*predict-no*H0*2
  25992. -->
  25993. (S1 ^operator O2148 = 0.255013406180364)
  25994. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  25995. -->
  25996. (S1 ^operator O2147 = 0.4768810224970678)
  25997. Retracting rl*prefer*rvt*predict-yes*H0*1
  25998. -->
  25999. (S1 ^operator O2147 = 0.523119294248848)
  26000. =>WM: (15106: S1 ^operator O2150 +)
  26001. =>WM: (15105: S1 ^operator O2149 +)
  26002. =>WM: (15104: O2150 ^name predict-no)
  26003. =>WM: (15103: O2149 ^name predict-yes)
  26004. =>WM: (15102: R1078 ^value 1)
  26005. =>WM: (15101: R1 ^reward R1078)
  26006. =>WM: (15100: I3 ^see 1)
  26007. <=WM: (15091: S1 ^operator O2147 +)
  26008. <=WM: (15093: S1 ^operator O2147)
  26009. <=WM: (15092: S1 ^operator O2148 +)
  26010. <=WM: (15086: R1 ^reward R1077)
  26011. <=WM: (15085: I3 ^see 0)
  26012. <=WM: (15089: O2148 ^name predict-no)
  26013. <=WM: (15088: O2147 ^name predict-yes)
  26014. <=WM: (15087: R1077 ^value 1)
  26015. --- Inner Elaboration Phase, active level 1 (S1) ---
  26016. Firing prefer*rvt*predict-yes*H0
  26017. -->
  26018. Firing rl*prefer*rvt*predict-yes*H0*1
  26019. -->
  26020. (S1 ^operator O2149 = 0.523119294248848)
  26021. Firing prefer*rvt*predict-yes*H0*1*H1
  26022. -->
  26023. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  26024. -->
  26025. (S1 ^operator O2149 = 0.1693592933936033)
  26026. Firing prefer*rvt*predict-no*H0
  26027. -->
  26028. Firing rl*prefer*rvt*predict-no*H0*2
  26029. -->
  26030. (S1 ^operator O2150 = 0.255013406180364)
  26031. Firing prefer*rvt*predict-no*H0*2*H1
  26032. -->
  26033. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  26034. -->
  26035. (S1 ^operator O2150 = 0.7449865240186593)
  26036. inner elaboration loop at bottom goal.
  26037. Retracting rl*prefer*rvt*predict-no*H0*2
  26038. -->
  26039. (S1 ^operator O2148 = 0.255013406180364)
  26040. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  26041. -->
  26042. (S1 ^operator O2148 = 0.7449865240186593)
  26043. Retracting rl*prefer*rvt*predict-yes*H0*1
  26044. -->
  26045. (S1 ^operator O2147 = 0.523119294248848)
  26046. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  26047. -->
  26048. (S1 ^operator O2147 = 0.1693592933936033)
  26049. --- END Proposal Phase ---
  26050. --- Decision Phase ---
  26051. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.980645,0.0191035)
  26052. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272041 0.20484 0.476881 -> 0.272041 0.20484 0.476881(R,m,v=1,1,0)
  26053. =>WM: (15107: S1 ^operator O2150)
  26054. 1075: O: O2150 (predict-no)
  26055. --- END Decision Phase ---
  26056. --- Application Phase ---
  26057. --- Firing Productions (PE) For State At Depth 1 ---
  26058. --- Inner Elaboration Phase, active level 1 (S1) ---
  26059. Firing apply*operator
  26060. -->
  26061. (I3 ^predict-no N1075 + :O )
  26062. Firing apply*operator*complete
  26063. -->
  26064. (I3 ^predict-yes N1074 - :O )
  26065. inner elaboration loop at bottom goal.
  26066. --- Change Working Memory (PE) ---
  26067. =>WM: (15108: I3 ^predict-no N1075)
  26068. <=WM: (15095: N1074 ^status complete)
  26069. <=WM: (15094: I3 ^predict-yes N1074)
  26070. --- Firing Productions (IE) For State At Depth 1 ---
  26071. --- Inner Elaboration Phase, active level 1 (S1) ---
  26072. Firing monitor*world
  26073. -->
  26074. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26075. --- Change Working Memory (IE) ---
  26076. --- END Application Phase ---
  26077. --- Output Phase ---
  26078. ENV: Agent did: predict-no for direction L in state State-A
  26079. In State-A moving L
  26080. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26081. predict error 0
  26082. dir: dir isL
  26083. --- END Output Phase ---
  26084. -/|--- Input Phase ---
  26085. =>WM: (15112: I2 ^dir L)
  26086. =>WM: (15111: I2 ^reward 1)
  26087. =>WM: (15110: I2 ^see 0)
  26088. =>WM: (15109: N1075 ^status complete)
  26089. <=WM: (15098: I2 ^dir L)
  26090. <=WM: (15097: I2 ^reward 1)
  26091. <=WM: (15096: I2 ^see 1)
  26092. =>WM: (15113: I2 ^level-1 L0-root)
  26093. <=WM: (15099: I2 ^level-1 L1-root)
  26094. --- END Input Phase ---
  26095. --- Proposal Phase ---
  26096. --- Inner Elaboration Phase, active level 1 (S1) ---
  26097. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  26098. -->
  26099. (S1 ^operator O2149 = 0.3)
  26100. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  26101. -->
  26102. (S1 ^operator O2150 = 0.74498667547292)
  26103. Firing prefer*rvt*predict-no*H0*2*H1
  26104. -->
  26105. Firing prefer*rvt*predict-yes*H0*1*H1
  26106. -->
  26107. Firing elaborate*copy-see-to-output-link
  26108. -->
  26109. (I3 ^see 0 +)
  26110. Firing elaborate*reward*based*on*reward
  26111. -->
  26112. (R1079 ^value 1 +)
  26113. (R1 ^reward R1079 +)
  26114. Firing propose*predict-yes
  26115. -->
  26116. (O2151 ^name predict-yes +)
  26117. (S1 ^operator O2151 +)
  26118. Firing propose*predict-no
  26119. -->
  26120. (O2152 ^name predict-no +)
  26121. (S1 ^operator O2152 +)
  26122. Firing rl*prefer*rvt*predict-no*H0*2
  26123. -->
  26124. (S1 ^operator O2150 = 0.255013406180364)
  26125. Firing rl*prefer*rvt*predict-yes*H0*1
  26126. -->
  26127. (S1 ^operator O2149 = 0.5231192467369606)
  26128. Firing prefer*rvt*predict-yes*H0
  26129. -->
  26130. Firing prefer*rvt*predict-no*H0
  26131. -->
  26132. Firing elaborate*copy-dir-to-output-link
  26133. -->
  26134. (I3 ^dir L +)
  26135. inner elaboration loop at bottom goal.
  26136. Retracting elaborate*copy-see-to-output-link
  26137. -->
  26138. (I3 ^see 1 +)
  26139. Retracting propose*predict-no
  26140. -->
  26141. (O2150 ^name predict-no +)
  26142. (S1 ^operator O2150 +)
  26143. Retracting propose*predict-yes
  26144. -->
  26145. (O2149 ^name predict-yes +)
  26146. (S1 ^operator O2149 +)
  26147. Retracting elaborate*reward*based*on*reward
  26148. -->
  26149. (R1078 ^value 1 +)
  26150. (R1 ^reward R1078 +)
  26151. Retracting elaborate*copy-dir-to-output-link
  26152. -->
  26153. (I3 ^dir L +)
  26154. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  26155. -->
  26156. (S1 ^operator O2150 = 0.7449865240186593)
  26157. Retracting rl*prefer*rvt*predict-no*H0*2
  26158. -->
  26159. (S1 ^operator O2150 = 0.255013406180364)
  26160. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  26161. -->
  26162. (S1 ^operator O2149 = 0.1693592933936033)
  26163. Retracting rl*prefer*rvt*predict-yes*H0*1
  26164. -->
  26165. (S1 ^operator O2149 = 0.5231192467369606)
  26166. =>WM: (15120: S1 ^operator O2152 +)
  26167. =>WM: (15119: S1 ^operator O2151 +)
  26168. =>WM: (15118: O2152 ^name predict-no)
  26169. =>WM: (15117: O2151 ^name predict-yes)
  26170. =>WM: (15116: R1079 ^value 1)
  26171. =>WM: (15115: R1 ^reward R1079)
  26172. =>WM: (15114: I3 ^see 0)
  26173. <=WM: (15105: S1 ^operator O2149 +)
  26174. <=WM: (15106: S1 ^operator O2150 +)
  26175. <=WM: (15107: S1 ^operator O2150)
  26176. <=WM: (15101: R1 ^reward R1078)
  26177. <=WM: (15100: I3 ^see 1)
  26178. <=WM: (15104: O2150 ^name predict-no)
  26179. <=WM: (15103: O2149 ^name predict-yes)
  26180. <=WM: (15102: R1078 ^value 1)
  26181. --- Inner Elaboration Phase, active level 1 (S1) ---
  26182. Firing prefer*rvt*predict-yes*H0
  26183. -->
  26184. Firing rl*prefer*rvt*predict-yes*H0*1
  26185. -->
  26186. (S1 ^operator O2151 = 0.5231192467369606)
  26187. Firing prefer*rvt*predict-yes*H0*1*H1
  26188. -->
  26189. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  26190. -->
  26191. (S1 ^operator O2151 = 0.3)
  26192. Firing prefer*rvt*predict-no*H0
  26193. -->
  26194. Firing rl*prefer*rvt*predict-no*H0*2
  26195. -->
  26196. (S1 ^operator O2152 = 0.255013406180364)
  26197. Firing prefer*rvt*predict-no*H0*2*H1
  26198. -->
  26199. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  26200. -->
  26201. (S1 ^operator O2152 = 0.74498667547292)
  26202. inner elaboration loop at bottom goal.
  26203. Retracting rl*prefer*rvt*predict-no*H0*2
  26204. -->
  26205. (S1 ^operator O2150 = 0.255013406180364)
  26206. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  26207. -->
  26208. (S1 ^operator O2150 = 0.74498667547292)
  26209. Retracting rl*prefer*rvt*predict-yes*H0*1
  26210. -->
  26211. (S1 ^operator O2149 = 0.5231192467369606)
  26212. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  26213. -->
  26214. (S1 ^operator O2149 = 0.3)
  26215. --- END Proposal Phase ---
  26216. --- Decision Phase ---
  26217. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.920792,0.0732969)
  26218. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  26219. =>WM: (15121: S1 ^operator O2152)
  26220. 1076: O: O2152 (predict-no)
  26221. --- END Decision Phase ---
  26222. --- Application Phase ---
  26223. --- Firing Productions (PE) For State At Depth 1 ---
  26224. --- Inner Elaboration Phase, active level 1 (S1) ---
  26225. Firing apply*operator
  26226. -->
  26227. (I3 ^predict-no N1076 + :O )
  26228. Firing apply*operator*complete
  26229. -->
  26230. (I3 ^predict-no N1075 - :O )
  26231. inner elaboration loop at bottom goal.
  26232. --- Change Working Memory (PE) ---
  26233. =>WM: (15122: I3 ^predict-no N1076)
  26234. <=WM: (15109: N1075 ^status complete)
  26235. <=WM: (15108: I3 ^predict-no N1075)
  26236. --- Firing Productions (IE) For State At Depth 1 ---
  26237. --- Inner Elaboration Phase, active level 1 (S1) ---
  26238. Firing monitor*world
  26239. -->
  26240. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26241. --- Change Working Memory (IE) ---
  26242. --- END Application Phase ---
  26243. --- Output Phase ---
  26244. ENV: Agent did: predict-no for direction L in state State-A
  26245. In State-A moving L
  26246. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26247. predict error 0
  26248. dir: dir isL
  26249. --- END Output Phase ---
  26250. \---- Input Phase ---
  26251. =>WM: (15126: I2 ^dir L)
  26252. =>WM: (15125: I2 ^reward 1)
  26253. =>WM: (15124: I2 ^see 0)
  26254. =>WM: (15123: N1076 ^status complete)
  26255. <=WM: (15112: I2 ^dir L)
  26256. <=WM: (15111: I2 ^reward 1)
  26257. <=WM: (15110: I2 ^see 0)
  26258. =>WM: (15127: I2 ^level-1 L0-root)
  26259. <=WM: (15113: I2 ^level-1 L0-root)
  26260. --- END Input Phase ---
  26261. --- Proposal Phase ---
  26262. --- Inner Elaboration Phase, active level 1 (S1) ---
  26263. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  26264. -->
  26265. (S1 ^operator O2151 = 0.3)
  26266. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  26267. -->
  26268. (S1 ^operator O2152 = 0.74498667547292)
  26269. Firing prefer*rvt*predict-no*H0*2*H1
  26270. -->
  26271. Firing prefer*rvt*predict-yes*H0*1*H1
  26272. -->
  26273. Firing elaborate*copy-see-to-output-link
  26274. -->
  26275. (I3 ^see 0 +)
  26276. Firing elaborate*reward*based*on*reward
  26277. -->
  26278. (R1080 ^value 1 +)
  26279. (R1 ^reward R1080 +)
  26280. Firing propose*predict-yes
  26281. -->
  26282. (O2153 ^name predict-yes +)
  26283. (S1 ^operator O2153 +)
  26284. Firing propose*predict-no
  26285. -->
  26286. (O2154 ^name predict-no +)
  26287. (S1 ^operator O2154 +)
  26288. Firing rl*prefer*rvt*predict-no*H0*2
  26289. -->
  26290. (S1 ^operator O2152 = 0.2550134166505105)
  26291. Firing rl*prefer*rvt*predict-yes*H0*1
  26292. -->
  26293. (S1 ^operator O2151 = 0.5231192467369606)
  26294. Firing prefer*rvt*predict-yes*H0
  26295. -->
  26296. Firing prefer*rvt*predict-no*H0
  26297. -->
  26298. Firing elaborate*copy-dir-to-output-link
  26299. -->
  26300. (I3 ^dir L +)
  26301. inner elaboration loop at bottom goal.
  26302. Retracting elaborate*copy-see-to-output-link
  26303. -->
  26304. (I3 ^see 0 +)
  26305. Retracting propose*predict-no
  26306. -->
  26307. (O2152 ^name predict-no +)
  26308. (S1 ^operator O2152 +)
  26309. Retracting propose*predict-yes
  26310. -->
  26311. (O2151 ^name predict-yes +)
  26312. (S1 ^operator O2151 +)
  26313. Retracting elaborate*reward*based*on*reward
  26314. -->
  26315. (R1079 ^value 1 +)
  26316. (R1 ^reward R1079 +)
  26317. Retracting elaborate*copy-dir-to-output-link
  26318. -->
  26319. (I3 ^dir L +)
  26320. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  26321. -->
  26322. (S1 ^operator O2152 = 0.74498667547292)
  26323. Retracting rl*prefer*rvt*predict-no*H0*2
  26324. -->
  26325. (S1 ^operator O2152 = 0.2550134166505105)
  26326. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  26327. -->
  26328. (S1 ^operator O2151 = 0.3)
  26329. Retracting rl*prefer*rvt*predict-yes*H0*1
  26330. -->
  26331. (S1 ^operator O2151 = 0.5231192467369606)
  26332. =>WM: (15133: S1 ^operator O2154 +)
  26333. =>WM: (15132: S1 ^operator O2153 +)
  26334. =>WM: (15131: O2154 ^name predict-no)
  26335. =>WM: (15130: O2153 ^name predict-yes)
  26336. =>WM: (15129: R1080 ^value 1)
  26337. =>WM: (15128: R1 ^reward R1080)
  26338. <=WM: (15119: S1 ^operator O2151 +)
  26339. <=WM: (15120: S1 ^operator O2152 +)
  26340. <=WM: (15121: S1 ^operator O2152)
  26341. <=WM: (15115: R1 ^reward R1079)
  26342. <=WM: (15118: O2152 ^name predict-no)
  26343. <=WM: (15117: O2151 ^name predict-yes)
  26344. <=WM: (15116: R1079 ^value 1)
  26345. --- Inner Elaboration Phase, active level 1 (S1) ---
  26346. Firing prefer*rvt*predict-yes*H0
  26347. -->
  26348. Firing rl*prefer*rvt*predict-yes*H0*1
  26349. -->
  26350. (S1 ^operator O2153 = 0.5231192467369606)
  26351. Firing prefer*rvt*predict-yes*H0*1*H1
  26352. -->
  26353. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  26354. -->
  26355. (S1 ^operator O2153 = 0.3)
  26356. Firing prefer*rvt*predict-no*H0
  26357. -->
  26358. Firing rl*prefer*rvt*predict-no*H0*2
  26359. -->
  26360. (S1 ^operator O2154 = 0.2550134166505105)
  26361. Firing prefer*rvt*predict-no*H0*2*H1
  26362. -->
  26363. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  26364. -->
  26365. (S1 ^operator O2154 = 0.74498667547292)
  26366. inner elaboration loop at bottom goal.
  26367. Retracting rl*prefer*rvt*predict-no*H0*2
  26368. -->
  26369. (S1 ^operator O2152 = 0.2550134166505105)
  26370. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  26371. -->
  26372. (S1 ^operator O2152 = 0.74498667547292)
  26373. Retracting rl*prefer*rvt*predict-yes*H0*1
  26374. -->
  26375. (S1 ^operator O2151 = 0.5231192467369606)
  26376. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  26377. -->
  26378. (S1 ^operator O2151 = 0.3)
  26379. --- END Proposal Phase ---
  26380. --- Decision Phase ---
  26381. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.921182,0.0729649)
  26382. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  26383. =>WM: (15134: S1 ^operator O2154)
  26384. 1077: O: O2154 (predict-no)
  26385. --- END Decision Phase ---
  26386. --- Application Phase ---
  26387. --- Firing Productions (PE) For State At Depth 1 ---
  26388. --- Inner Elaboration Phase, active level 1 (S1) ---
  26389. Firing apply*operator
  26390. -->
  26391. (I3 ^predict-no N1077 + :O )
  26392. Firing apply*operator*complete
  26393. -->
  26394. (I3 ^predict-no N1076 - :O )
  26395. inner elaboration loop at bottom goal.
  26396. --- Change Working Memory (PE) ---
  26397. =>WM: (15135: I3 ^predict-no N1077)
  26398. <=WM: (15123: N1076 ^status complete)
  26399. <=WM: (15122: I3 ^predict-no N1076)
  26400. --- Firing Productions (IE) For State At Depth 1 ---
  26401. --- Inner Elaboration Phase, active level 1 (S1) ---
  26402. Firing monitor*world
  26403. -->
  26404. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26405. --- Change Working Memory (IE) ---
  26406. --- END Application Phase ---
  26407. --- Output Phase ---
  26408. ENV: Agent did: predict-no for direction L in state State-A
  26409. In State-A moving L
  26410. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26411. predict error 0
  26412. dir: dir isL
  26413. --- END Output Phase ---
  26414. /|\--- Input Phase ---
  26415. =>WM: (15139: I2 ^dir L)
  26416. =>WM: (15138: I2 ^reward 1)
  26417. =>WM: (15137: I2 ^see 0)
  26418. =>WM: (15136: N1077 ^status complete)
  26419. <=WM: (15126: I2 ^dir L)
  26420. <=WM: (15125: I2 ^reward 1)
  26421. <=WM: (15124: I2 ^see 0)
  26422. =>WM: (15140: I2 ^level-1 L0-root)
  26423. <=WM: (15127: I2 ^level-1 L0-root)
  26424. --- END Input Phase ---
  26425. --- Proposal Phase ---
  26426. --- Inner Elaboration Phase, active level 1 (S1) ---
  26427. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  26428. -->
  26429. (S1 ^operator O2153 = 0.3)
  26430. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  26431. -->
  26432. (S1 ^operator O2154 = 0.7449866616544054)
  26433. Firing prefer*rvt*predict-no*H0*2*H1
  26434. -->
  26435. Firing prefer*rvt*predict-yes*H0*1*H1
  26436. -->
  26437. Firing elaborate*copy-see-to-output-link
  26438. -->
  26439. (I3 ^see 0 +)
  26440. Firing elaborate*reward*based*on*reward
  26441. -->
  26442. (R1081 ^value 1 +)
  26443. (R1 ^reward R1081 +)
  26444. Firing propose*predict-yes
  26445. -->
  26446. (O2155 ^name predict-yes +)
  26447. (S1 ^operator O2155 +)
  26448. Firing propose*predict-no
  26449. -->
  26450. (O2156 ^name predict-no +)
  26451. (S1 ^operator O2156 +)
  26452. Firing rl*prefer*rvt*predict-no*H0*2
  26453. -->
  26454. (S1 ^operator O2154 = 0.2550134028319959)
  26455. Firing rl*prefer*rvt*predict-yes*H0*1
  26456. -->
  26457. (S1 ^operator O2153 = 0.5231192467369606)
  26458. Firing prefer*rvt*predict-yes*H0
  26459. -->
  26460. Firing prefer*rvt*predict-no*H0
  26461. -->
  26462. Firing elaborate*copy-dir-to-output-link
  26463. -->
  26464. (I3 ^dir L +)
  26465. inner elaboration loop at bottom goal.
  26466. Retracting elaborate*copy-see-to-output-link
  26467. -->
  26468. (I3 ^see 0 +)
  26469. Retracting propose*predict-no
  26470. -->
  26471. (O2154 ^name predict-no +)
  26472. (S1 ^operator O2154 +)
  26473. Retracting propose*predict-yes
  26474. -->
  26475. (O2153 ^name predict-yes +)
  26476. (S1 ^operator O2153 +)
  26477. Retracting elaborate*reward*based*on*reward
  26478. -->
  26479. (R1080 ^value 1 +)
  26480. (R1 ^reward R1080 +)
  26481. Retracting elaborate*copy-dir-to-output-link
  26482. -->
  26483. (I3 ^dir L +)
  26484. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  26485. -->
  26486. (S1 ^operator O2154 = 0.7449866616544054)
  26487. Retracting rl*prefer*rvt*predict-no*H0*2
  26488. -->
  26489. (S1 ^operator O2154 = 0.2550134028319959)
  26490. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  26491. -->
  26492. (S1 ^operator O2153 = 0.3)
  26493. Retracting rl*prefer*rvt*predict-yes*H0*1
  26494. -->
  26495. (S1 ^operator O2153 = 0.5231192467369606)
  26496. =>WM: (15146: S1 ^operator O2156 +)
  26497. =>WM: (15145: S1 ^operator O2155 +)
  26498. =>WM: (15144: O2156 ^name predict-no)
  26499. =>WM: (15143: O2155 ^name predict-yes)
  26500. =>WM: (15142: R1081 ^value 1)
  26501. =>WM: (15141: R1 ^reward R1081)
  26502. <=WM: (15132: S1 ^operator O2153 +)
  26503. <=WM: (15133: S1 ^operator O2154 +)
  26504. <=WM: (15134: S1 ^operator O2154)
  26505. <=WM: (15128: R1 ^reward R1080)
  26506. <=WM: (15131: O2154 ^name predict-no)
  26507. <=WM: (15130: O2153 ^name predict-yes)
  26508. <=WM: (15129: R1080 ^value 1)
  26509. --- Inner Elaboration Phase, active level 1 (S1) ---
  26510. Firing prefer*rvt*predict-yes*H0
  26511. -->
  26512. Firing rl*prefer*rvt*predict-yes*H0*1
  26513. -->
  26514. (S1 ^operator O2155 = 0.5231192467369606)
  26515. Firing prefer*rvt*predict-yes*H0*1*H1
  26516. -->
  26517. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  26518. -->
  26519. (S1 ^operator O2155 = 0.3)
  26520. Firing prefer*rvt*predict-no*H0
  26521. -->
  26522. Firing rl*prefer*rvt*predict-no*H0*2
  26523. -->
  26524. (S1 ^operator O2156 = 0.2550134028319959)
  26525. Firing prefer*rvt*predict-no*H0*2*H1
  26526. -->
  26527. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  26528. -->
  26529. (S1 ^operator O2156 = 0.7449866616544054)
  26530. inner elaboration loop at bottom goal.
  26531. Retracting rl*prefer*rvt*predict-no*H0*2
  26532. -->
  26533. (S1 ^operator O2154 = 0.2550134028319959)
  26534. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  26535. -->
  26536. (S1 ^operator O2154 = 0.7449866616544054)
  26537. Retracting rl*prefer*rvt*predict-yes*H0*1
  26538. -->
  26539. (S1 ^operator O2153 = 0.5231192467369606)
  26540. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  26541. -->
  26542. (S1 ^operator O2153 = 0.3)
  26543. --- END Proposal Phase ---
  26544. --- Decision Phase ---
  26545. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.921569,0.072636)
  26546. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  26547. =>WM: (15147: S1 ^operator O2156)
  26548. 1078: O: O2156 (predict-no)
  26549. --- END Decision Phase ---
  26550. --- Application Phase ---
  26551. --- Firing Productions (PE) For State At Depth 1 ---
  26552. --- Inner Elaboration Phase, active level 1 (S1) ---
  26553. Firing apply*operator
  26554. -->
  26555. (I3 ^predict-no N1078 + :O )
  26556. Firing apply*operator*complete
  26557. -->
  26558. (I3 ^predict-no N1077 - :O )
  26559. inner elaboration loop at bottom goal.
  26560. --- Change Working Memory (PE) ---
  26561. =>WM: (15148: I3 ^predict-no N1078)
  26562. <=WM: (15136: N1077 ^status complete)
  26563. <=WM: (15135: I3 ^predict-no N1077)
  26564. --- Firing Productions (IE) For State At Depth 1 ---
  26565. --- Inner Elaboration Phase, active level 1 (S1) ---
  26566. Firing monitor*world
  26567. -->
  26568. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26569. --- Change Working Memory (IE) ---
  26570. --- END Application Phase ---
  26571. --- Output Phase ---
  26572. ENV: Agent did: predict-no for direction L in state State-A
  26573. In State-A moving L
  26574. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26575. predict error 0
  26576. dir: dir isR
  26577. --- END Output Phase ---
  26578. -/|--- Input Phase ---
  26579. =>WM: (15152: I2 ^dir R)
  26580. =>WM: (15151: I2 ^reward 1)
  26581. =>WM: (15150: I2 ^see 0)
  26582. =>WM: (15149: N1078 ^status complete)
  26583. <=WM: (15139: I2 ^dir L)
  26584. <=WM: (15138: I2 ^reward 1)
  26585. <=WM: (15137: I2 ^see 0)
  26586. =>WM: (15153: I2 ^level-1 L0-root)
  26587. <=WM: (15140: I2 ^level-1 L0-root)
  26588. --- END Input Phase ---
  26589. --- Proposal Phase ---
  26590. --- Inner Elaboration Phase, active level 1 (S1) ---
  26591. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  26592. -->
  26593. (S1 ^operator O2155 = 0.6170678009401356)
  26594. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  26595. -->
  26596. (S1 ^operator O2156 = 0.4910065094545203)
  26597. Firing prefer*rvt*predict-no*H0*4*H1
  26598. -->
  26599. Firing prefer*rvt*predict-yes*H0*3*H1
  26600. -->
  26601. Firing elaborate*copy-see-to-output-link
  26602. -->
  26603. (I3 ^see 0 +)
  26604. Firing elaborate*reward*based*on*reward
  26605. -->
  26606. (R1082 ^value 1 +)
  26607. (R1 ^reward R1082 +)
  26608. Firing propose*predict-yes
  26609. -->
  26610. (O2157 ^name predict-yes +)
  26611. (S1 ^operator O2157 +)
  26612. Firing propose*predict-no
  26613. -->
  26614. (O2158 ^name predict-no +)
  26615. (S1 ^operator O2158 +)
  26616. Firing rl*prefer*rvt*predict-no*H0*4
  26617. -->
  26618. (S1 ^operator O2156 = 0.1269768291953184)
  26619. Firing rl*prefer*rvt*predict-yes*H0*3
  26620. -->
  26621. (S1 ^operator O2155 = 0.3829444697344545)
  26622. Firing prefer*rvt*predict-yes*H0
  26623. -->
  26624. Firing prefer*rvt*predict-no*H0
  26625. -->
  26626. Firing elaborate*copy-dir-to-output-link
  26627. -->
  26628. (I3 ^dir R +)
  26629. inner elaboration loop at bottom goal.
  26630. Retracting elaborate*copy-see-to-output-link
  26631. -->
  26632. (I3 ^see 0 +)
  26633. Retracting propose*predict-no
  26634. -->
  26635. (O2156 ^name predict-no +)
  26636. (S1 ^operator O2156 +)
  26637. Retracting propose*predict-yes
  26638. -->
  26639. (O2155 ^name predict-yes +)
  26640. (S1 ^operator O2155 +)
  26641. Retracting elaborate*reward*based*on*reward
  26642. -->
  26643. (R1081 ^value 1 +)
  26644. (R1 ^reward R1081 +)
  26645. Retracting elaborate*copy-dir-to-output-link
  26646. -->
  26647. (I3 ^dir L +)
  26648. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  26649. -->
  26650. (S1 ^operator O2156 = 0.7449866519814452)
  26651. Retracting rl*prefer*rvt*predict-no*H0*2
  26652. -->
  26653. (S1 ^operator O2156 = 0.2550133931590357)
  26654. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  26655. -->
  26656. (S1 ^operator O2155 = 0.3)
  26657. Retracting rl*prefer*rvt*predict-yes*H0*1
  26658. -->
  26659. (S1 ^operator O2155 = 0.5231192467369606)
  26660. =>WM: (15160: S1 ^operator O2158 +)
  26661. =>WM: (15159: S1 ^operator O2157 +)
  26662. =>WM: (15158: I3 ^dir R)
  26663. =>WM: (15157: O2158 ^name predict-no)
  26664. =>WM: (15156: O2157 ^name predict-yes)
  26665. =>WM: (15155: R1082 ^value 1)
  26666. =>WM: (15154: R1 ^reward R1082)
  26667. <=WM: (15145: S1 ^operator O2155 +)
  26668. <=WM: (15146: S1 ^operator O2156 +)
  26669. <=WM: (15147: S1 ^operator O2156)
  26670. <=WM: (15090: I3 ^dir L)
  26671. <=WM: (15141: R1 ^reward R1081)
  26672. <=WM: (15144: O2156 ^name predict-no)
  26673. <=WM: (15143: O2155 ^name predict-yes)
  26674. <=WM: (15142: R1081 ^value 1)
  26675. --- Inner Elaboration Phase, active level 1 (S1) ---
  26676. Firing prefer*rvt*predict-yes*H0
  26677. -->
  26678. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  26679. -->
  26680. (S1 ^operator O2157 = 0.6170678009401356)
  26681. Firing rl*prefer*rvt*predict-yes*H0*3
  26682. -->
  26683. (S1 ^operator O2157 = 0.3829444697344545)
  26684. Firing prefer*rvt*predict-yes*H0*3*H1
  26685. -->
  26686. Firing prefer*rvt*predict-no*H0
  26687. -->
  26688. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  26689. -->
  26690. (S1 ^operator O2158 = 0.4910065094545203)
  26691. Firing rl*prefer*rvt*predict-no*H0*4
  26692. -->
  26693. (S1 ^operator O2158 = 0.1269768291953184)
  26694. Firing prefer*rvt*predict-no*H0*4*H1
  26695. -->
  26696. inner elaboration loop at bottom goal.
  26697. Retracting rl*prefer*rvt*predict-no*H0*4
  26698. -->
  26699. (S1 ^operator O2156 = 0.1269768291953184)
  26700. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  26701. -->
  26702. (S1 ^operator O2156 = 0.4910065094545203)
  26703. Retracting rl*prefer*rvt*predict-yes*H0*3
  26704. -->
  26705. (S1 ^operator O2155 = 0.3829444697344545)
  26706. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  26707. -->
  26708. (S1 ^operator O2155 = 0.6170678009401356)
  26709. --- END Proposal Phase ---
  26710. --- Decision Phase ---
  26711. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.921951,0.0723099)
  26712. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  26713. =>WM: (15161: S1 ^operator O2157)
  26714. 1079: O: O2157 (predict-yes)
  26715. --- END Decision Phase ---
  26716. --- Application Phase ---
  26717. --- Firing Productions (PE) For State At Depth 1 ---
  26718. --- Inner Elaboration Phase, active level 1 (S1) ---
  26719. Firing apply*operator
  26720. -->
  26721. (I3 ^predict-yes N1079 + :O )
  26722. Firing apply*operator*complete
  26723. -->
  26724. (I3 ^predict-no N1078 - :O )
  26725. inner elaboration loop at bottom goal.
  26726. --- Change Working Memory (PE) ---
  26727. =>WM: (15162: I3 ^predict-yes N1079)
  26728. <=WM: (15149: N1078 ^status complete)
  26729. <=WM: (15148: I3 ^predict-no N1078)
  26730. --- Firing Productions (IE) For State At Depth 1 ---
  26731. --- Inner Elaboration Phase, active level 1 (S1) ---
  26732. Firing monitor*world
  26733. -->
  26734. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26735. --- Change Working Memory (IE) ---
  26736. --- END Application Phase ---
  26737. --- Output Phase ---
  26738. ENV: Agent did: predict-yes for direction R in state State-A
  26739. In State-A moving R
  26740. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  26741. predict error 0
  26742. dir: dir isU
  26743. --- END Output Phase ---
  26744. \-/--- Input Phase ---
  26745. =>WM: (15166: I2 ^dir U)
  26746. =>WM: (15165: I2 ^reward 1)
  26747. =>WM: (15164: I2 ^see 1)
  26748. =>WM: (15163: N1079 ^status complete)
  26749. <=WM: (15152: I2 ^dir R)
  26750. <=WM: (15151: I2 ^reward 1)
  26751. <=WM: (15150: I2 ^see 0)
  26752. =>WM: (15167: I2 ^level-1 R1-root)
  26753. <=WM: (15153: I2 ^level-1 L0-root)
  26754. --- END Input Phase ---
  26755. --- Proposal Phase ---
  26756. --- Inner Elaboration Phase, active level 1 (S1) ---
  26757. Firing elaborate*copy-see-to-output-link
  26758. -->
  26759. (I3 ^see 1 +)
  26760. Firing elaborate*reward*based*on*reward
  26761. -->
  26762. (R1083 ^value 1 +)
  26763. (R1 ^reward R1083 +)
  26764. Firing propose*predict-yes
  26765. -->
  26766. (O2159 ^name predict-yes +)
  26767. (S1 ^operator O2159 +)
  26768. Firing propose*predict-no
  26769. -->
  26770. (O2160 ^name predict-no +)
  26771. (S1 ^operator O2160 +)
  26772. Firing rl*prefer*rvt*predict-no*H0*6
  26773. -->
  26774. (S1 ^operator O2158 = 0.9999999999999999)
  26775. Firing rl*prefer*rvt*predict-yes*H0*5
  26776. -->
  26777. (S1 ^operator O2157 = 0.)
  26778. Firing prefer*rvt*predict-yes*H0
  26779. -->
  26780. Firing prefer*rvt*predict-no*H0
  26781. -->
  26782. Firing elaborate*copy-dir-to-output-link
  26783. -->
  26784. (I3 ^dir U +)
  26785. inner elaboration loop at bottom goal.
  26786. Retracting elaborate*copy-see-to-output-link
  26787. -->
  26788. (I3 ^see 0 +)
  26789. Retracting propose*predict-no
  26790. -->
  26791. (O2158 ^name predict-no +)
  26792. (S1 ^operator O2158 +)
  26793. Retracting propose*predict-yes
  26794. -->
  26795. (O2157 ^name predict-yes +)
  26796. (S1 ^operator O2157 +)
  26797. Retracting elaborate*reward*based*on*reward
  26798. -->
  26799. (R1082 ^value 1 +)
  26800. (R1 ^reward R1082 +)
  26801. Retracting elaborate*copy-dir-to-output-link
  26802. -->
  26803. (I3 ^dir R +)
  26804. Retracting rl*prefer*rvt*predict-no*H0*4
  26805. -->
  26806. (S1 ^operator O2158 = 0.1269768291953184)
  26807. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  26808. -->
  26809. (S1 ^operator O2158 = 0.4910065094545203)
  26810. Retracting rl*prefer*rvt*predict-yes*H0*3
  26811. -->
  26812. (S1 ^operator O2157 = 0.3829444697344545)
  26813. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  26814. -->
  26815. (S1 ^operator O2157 = 0.6170678009401356)
  26816. =>WM: (15175: S1 ^operator O2160 +)
  26817. =>WM: (15174: S1 ^operator O2159 +)
  26818. =>WM: (15173: I3 ^dir U)
  26819. =>WM: (15172: O2160 ^name predict-no)
  26820. =>WM: (15171: O2159 ^name predict-yes)
  26821. =>WM: (15170: R1083 ^value 1)
  26822. =>WM: (15169: R1 ^reward R1083)
  26823. =>WM: (15168: I3 ^see 1)
  26824. <=WM: (15159: S1 ^operator O2157 +)
  26825. <=WM: (15161: S1 ^operator O2157)
  26826. <=WM: (15160: S1 ^operator O2158 +)
  26827. <=WM: (15158: I3 ^dir R)
  26828. <=WM: (15154: R1 ^reward R1082)
  26829. <=WM: (15114: I3 ^see 0)
  26830. <=WM: (15157: O2158 ^name predict-no)
  26831. <=WM: (15156: O2157 ^name predict-yes)
  26832. <=WM: (15155: R1082 ^value 1)
  26833. --- Inner Elaboration Phase, active level 1 (S1) ---
  26834. Firing prefer*rvt*predict-yes*H0
  26835. -->
  26836. Firing rl*prefer*rvt*predict-yes*H0*5
  26837. -->
  26838. (S1 ^operator O2159 = 0.)
  26839. Firing prefer*rvt*predict-no*H0
  26840. -->
  26841. Firing rl*prefer*rvt*predict-no*H0*6
  26842. -->
  26843. (S1 ^operator O2160 = 0.9999999999999999)
  26844. inner elaboration loop at bottom goal.
  26845. Retracting rl*prefer*rvt*predict-no*H0*6
  26846. -->
  26847. (S1 ^operator O2158 = 0.9999999999999999)
  26848. Retracting rl*prefer*rvt*predict-yes*H0*5
  26849. -->
  26850. (S1 ^operator O2157 = 0.)
  26851. --- END Proposal Phase ---
  26852. --- Decision Phase ---
  26853. RL update rl*prefer*rvt*predict-yes*H0*3 0.673137 -0.290193 0.382944 -> 0.673136 -0.290193 0.382943(R,m,v=1,0.963855,0.0350493)
  26854. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326874 0.290194 0.617068 -> 0.326872 0.290194 0.617066(R,m,v=1,1,0)
  26855. =>WM: (15176: S1 ^operator O2160)
  26856. 1080: O: O2160 (predict-no)
  26857. --- END Decision Phase ---
  26858. --- Application Phase ---
  26859. --- Firing Productions (PE) For State At Depth 1 ---
  26860. --- Inner Elaboration Phase, active level 1 (S1) ---
  26861. Firing apply*operator
  26862. -->
  26863. (I3 ^predict-no N1080 + :O )
  26864. Firing apply*operator*complete
  26865. -->
  26866. (I3 ^predict-yes N1079 - :O )
  26867. inner elaboration loop at bottom goal.
  26868. --- Change Working Memory (PE) ---
  26869. =>WM: (15177: I3 ^predict-no N1080)
  26870. <=WM: (15163: N1079 ^status complete)
  26871. <=WM: (15162: I3 ^predict-yes N1079)
  26872. --- Firing Productions (IE) For State At Depth 1 ---
  26873. --- Inner Elaboration Phase, active level 1 (S1) ---
  26874. Firing monitor*world
  26875. -->
  26876. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26877. --- Change Working Memory (IE) ---
  26878. --- END Application Phase ---
  26879. --- Output Phase ---
  26880. ENV: Agent did: predict-no for direction U in state State-B
  26881. In State-B moving U
  26882. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26883. predict error 0
  26884. dir: dir isR
  26885. --- END Output Phase ---
  26886. |\---- Input Phase ---
  26887. =>WM: (15181: I2 ^dir R)
  26888. =>WM: (15180: I2 ^reward 1)
  26889. =>WM: (15179: I2 ^see 0)
  26890. =>WM: (15178: N1080 ^status complete)
  26891. <=WM: (15166: I2 ^dir U)
  26892. <=WM: (15165: I2 ^reward 1)
  26893. <=WM: (15164: I2 ^see 1)
  26894. =>WM: (15182: I2 ^level-1 R1-root)
  26895. <=WM: (15167: I2 ^level-1 R1-root)
  26896. --- END Input Phase ---
  26897. --- Proposal Phase ---
  26898. --- Inner Elaboration Phase, active level 1 (S1) ---
  26899. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  26900. -->
  26901. (S1 ^operator O2159 = 0.08783148430849691)
  26902. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  26903. -->
  26904. (S1 ^operator O2160 = 0.8730232195176179)
  26905. Firing prefer*rvt*predict-no*H0*4*H1
  26906. -->
  26907. Firing prefer*rvt*predict-yes*H0*3*H1
  26908. -->
  26909. Firing elaborate*copy-see-to-output-link
  26910. -->
  26911. (I3 ^see 0 +)
  26912. Firing elaborate*reward*based*on*reward
  26913. -->
  26914. (R1084 ^value 1 +)
  26915. (R1 ^reward R1084 +)
  26916. Firing propose*predict-yes
  26917. -->
  26918. (O2161 ^name predict-yes +)
  26919. (S1 ^operator O2161 +)
  26920. Firing propose*predict-no
  26921. -->
  26922. (O2162 ^name predict-no +)
  26923. (S1 ^operator O2162 +)
  26924. Firing rl*prefer*rvt*predict-no*H0*4
  26925. -->
  26926. (S1 ^operator O2160 = 0.1269768291953184)
  26927. Firing rl*prefer*rvt*predict-yes*H0*3
  26928. -->
  26929. (S1 ^operator O2159 = 0.382942629133266)
  26930. Firing prefer*rvt*predict-yes*H0
  26931. -->
  26932. Firing prefer*rvt*predict-no*H0
  26933. -->
  26934. Firing elaborate*copy-dir-to-output-link
  26935. -->
  26936. (I3 ^dir R +)
  26937. inner elaboration loop at bottom goal.
  26938. Retracting elaborate*copy-see-to-output-link
  26939. -->
  26940. (I3 ^see 1 +)
  26941. Retracting propose*predict-no
  26942. -->
  26943. (O2160 ^name predict-no +)
  26944. (S1 ^operator O2160 +)
  26945. Retracting propose*predict-yes
  26946. -->
  26947. (O2159 ^name predict-yes +)
  26948. (S1 ^operator O2159 +)
  26949. Retracting elaborate*reward*based*on*reward
  26950. -->
  26951. (R1083 ^value 1 +)
  26952. (R1 ^reward R1083 +)
  26953. Retracting elaborate*copy-dir-to-output-link
  26954. -->
  26955. (I3 ^dir U +)
  26956. Retracting rl*prefer*rvt*predict-no*H0*6
  26957. -->
  26958. (S1 ^operator O2160 = 0.9999999999999999)
  26959. Retracting rl*prefer*rvt*predict-yes*H0*5
  26960. -->
  26961. (S1 ^operator O2159 = 0.)
  26962. =>WM: (15190: S1 ^operator O2162 +)
  26963. =>WM: (15189: S1 ^operator O2161 +)
  26964. =>WM: (15188: I3 ^dir R)
  26965. =>WM: (15187: O2162 ^name predict-no)
  26966. =>WM: (15186: O2161 ^name predict-yes)
  26967. =>WM: (15185: R1084 ^value 1)
  26968. =>WM: (15184: R1 ^reward R1084)
  26969. =>WM: (15183: I3 ^see 0)
  26970. <=WM: (15174: S1 ^operator O2159 +)
  26971. <=WM: (15175: S1 ^operator O2160 +)
  26972. <=WM: (15176: S1 ^operator O2160)
  26973. <=WM: (15173: I3 ^dir U)
  26974. <=WM: (15169: R1 ^reward R1083)
  26975. <=WM: (15168: I3 ^see 1)
  26976. <=WM: (15172: O2160 ^name predict-no)
  26977. <=WM: (15171: O2159 ^name predict-yes)
  26978. <=WM: (15170: R1083 ^value 1)
  26979. --- Inner Elaboration Phase, active level 1 (S1) ---
  26980. Firing prefer*rvt*predict-yes*H0
  26981. -->
  26982. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  26983. -->
  26984. (S1 ^operator O2161 = 0.08783148430849691)
  26985. Firing rl*prefer*rvt*predict-yes*H0*3
  26986. -->
  26987. (S1 ^operator O2161 = 0.382942629133266)
  26988. Firing prefer*rvt*predict-yes*H0*3*H1
  26989. -->
  26990. Firing prefer*rvt*predict-no*H0
  26991. -->
  26992. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  26993. -->
  26994. (S1 ^operator O2162 = 0.8730232195176179)
  26995. Firing rl*prefer*rvt*predict-no*H0*4
  26996. -->
  26997. (S1 ^operator O2162 = 0.1269768291953184)
  26998. Firing prefer*rvt*predict-no*H0*4*H1
  26999. -->
  27000. inner elaboration loop at bottom goal.
  27001. Retracting rl*prefer*rvt*predict-no*H0*4
  27002. -->
  27003. (S1 ^operator O2160 = 0.1269768291953184)
  27004. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  27005. -->
  27006. (S1 ^operator O2160 = 0.8730232195176179)
  27007. Retracting rl*prefer*rvt*predict-yes*H0*3
  27008. -->
  27009. (S1 ^operator O2159 = 0.382942629133266)
  27010. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  27011. -->
  27012. (S1 ^operator O2159 = 0.08783148430849691)
  27013. --- END Proposal Phase ---
  27014. --- Decision Phase ---
  27015. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27016. =>WM: (15191: S1 ^operator O2162)
  27017. 1081: O: O2162 (predict-no)
  27018. --- END Decision Phase ---
  27019. --- Application Phase ---
  27020. --- Firing Productions (PE) For State At Depth 1 ---
  27021. --- Inner Elaboration Phase, active level 1 (S1) ---
  27022. Firing apply*operator
  27023. -->
  27024. (I3 ^predict-no N1081 + :O )
  27025. Firing apply*operator*complete
  27026. -->
  27027. (I3 ^predict-no N1080 - :O )
  27028. inner elaboration loop at bottom goal.
  27029. --- Change Working Memory (PE) ---
  27030. =>WM: (15192: I3 ^predict-no N1081)
  27031. <=WM: (15178: N1080 ^status complete)
  27032. <=WM: (15177: I3 ^predict-no N1080)
  27033. --- Firing Productions (IE) For State At Depth 1 ---
  27034. --- Inner Elaboration Phase, active level 1 (S1) ---
  27035. Firing monitor*world
  27036. -->
  27037. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27038. --- Change Working Memory (IE) ---
  27039. --- END Application Phase ---
  27040. --- Output Phase ---
  27041. ENV: Agent did: predict-no for direction R in state State-B
  27042. In State-B moving R
  27043. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27044. predict error 0
  27045. dir: dir isU
  27046. --- END Output Phase ---
  27047. /--- Input Phase ---
  27048. =>WM: (15196: I2 ^dir U)
  27049. =>WM: (15195: I2 ^reward 1)
  27050. =>WM: (15194: I2 ^see 0)
  27051. =>WM: (15193: N1081 ^status complete)
  27052. <=WM: (15181: I2 ^dir R)
  27053. <=WM: (15180: I2 ^reward 1)
  27054. <=WM: (15179: I2 ^see 0)
  27055. =>WM: (15197: I2 ^level-1 R0-root)
  27056. <=WM: (15182: I2 ^level-1 R1-root)
  27057. --- END Input Phase ---
  27058. --- Proposal Phase ---
  27059. --- Inner Elaboration Phase, active level 1 (S1) ---
  27060. Firing elaborate*copy-see-to-output-link
  27061. -->
  27062. (I3 ^see 0 +)
  27063. Firing elaborate*reward*based*on*reward
  27064. -->
  27065. (R1085 ^value 1 +)
  27066. (R1 ^reward R1085 +)
  27067. Firing propose*predict-yes
  27068. -->
  27069. (O2163 ^name predict-yes +)
  27070. (S1 ^operator O2163 +)
  27071. Firing propose*predict-no
  27072. -->
  27073. (O2164 ^name predict-no +)
  27074. (S1 ^operator O2164 +)
  27075. Firing rl*prefer*rvt*predict-no*H0*6
  27076. -->
  27077. (S1 ^operator O2162 = 0.9999999999999999)
  27078. Firing rl*prefer*rvt*predict-yes*H0*5
  27079. -->
  27080. (S1 ^operator O2161 = 0.)
  27081. Firing prefer*rvt*predict-yes*H0
  27082. -->
  27083. Firing prefer*rvt*predict-no*H0
  27084. -->
  27085. Firing elaborate*copy-dir-to-output-link
  27086. -->
  27087. (I3 ^dir U +)
  27088. inner elaboration loop at bottom goal.
  27089. Retracting elaborate*copy-see-to-output-link
  27090. -->
  27091. (I3 ^see 0 +)
  27092. Retracting propose*predict-no
  27093. -->
  27094. (O2162 ^name predict-no +)
  27095. (S1 ^operator O2162 +)
  27096. Retracting propose*predict-yes
  27097. -->
  27098. (O2161 ^name predict-yes +)
  27099. (S1 ^operator O2161 +)
  27100. Retracting elaborate*reward*based*on*reward
  27101. -->
  27102. (R1084 ^value 1 +)
  27103. (R1 ^reward R1084 +)
  27104. Retracting elaborate*copy-dir-to-output-link
  27105. -->
  27106. (I3 ^dir R +)
  27107. Retracting rl*prefer*rvt*predict-no*H0*4
  27108. -->
  27109. (S1 ^operator O2162 = 0.1269768291953184)
  27110. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  27111. -->
  27112. (S1 ^operator O2162 = 0.8730232195176179)
  27113. Retracting rl*prefer*rvt*predict-yes*H0*3
  27114. -->
  27115. (S1 ^operator O2161 = 0.382942629133266)
  27116. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  27117. -->
  27118. (S1 ^operator O2161 = 0.08783148430849691)
  27119. =>WM: (15204: S1 ^operator O2164 +)
  27120. =>WM: (15203: S1 ^operator O2163 +)
  27121. =>WM: (15202: I3 ^dir U)
  27122. =>WM: (15201: O2164 ^name predict-no)
  27123. =>WM: (15200: O2163 ^name predict-yes)
  27124. =>WM: (15199: R1085 ^value 1)
  27125. =>WM: (15198: R1 ^reward R1085)
  27126. <=WM: (15189: S1 ^operator O2161 +)
  27127. <=WM: (15190: S1 ^operator O2162 +)
  27128. <=WM: (15191: S1 ^operator O2162)
  27129. <=WM: (15188: I3 ^dir R)
  27130. <=WM: (15184: R1 ^reward R1084)
  27131. <=WM: (15187: O2162 ^name predict-no)
  27132. <=WM: (15186: O2161 ^name predict-yes)
  27133. <=WM: (15185: R1084 ^value 1)
  27134. --- Inner Elaboration Phase, active level 1 (S1) ---
  27135. Firing prefer*rvt*predict-yes*H0
  27136. -->
  27137. Firing rl*prefer*rvt*predict-yes*H0*5
  27138. -->
  27139. (S1 ^operator O2163 = 0.)
  27140. Firing prefer*rvt*predict-no*H0
  27141. -->
  27142. Firing rl*prefer*rvt*predict-no*H0*6
  27143. -->
  27144. (S1 ^operator O2164 = 0.9999999999999999)
  27145. inner elaboration loop at bottom goal.
  27146. Retracting rl*prefer*rvt*predict-no*H0*6
  27147. -->
  27148. (S1 ^operator O2162 = 0.9999999999999999)
  27149. Retracting rl*prefer*rvt*predict-yes*H0*5
  27150. -->
  27151. (S1 ^operator O2161 = 0.)
  27152. --- END Proposal Phase ---
  27153. --- Decision Phase ---
  27154. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.954315,0.0438206)
  27155. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  27156. =>WM: (15205: S1 ^operator O2164)
  27157. 1082: O: O2164 (predict-no)
  27158. --- END Decision Phase ---
  27159. --- Application Phase ---
  27160. --- Firing Productions (PE) For State At Depth 1 ---
  27161. --- Inner Elaboration Phase, active level 1 (S1) ---
  27162. Firing apply*operator
  27163. -->
  27164. (I3 ^predict-no N1082 + :O )
  27165. Firing apply*operator*complete
  27166. -->
  27167. (I3 ^predict-no N1081 - :O )
  27168. inner elaboration loop at bottom goal.
  27169. --- Change Working Memory (PE) ---
  27170. =>WM: (15206: I3 ^predict-no N1082)
  27171. <=WM: (15193: N1081 ^status complete)
  27172. <=WM: (15192: I3 ^predict-no N1081)
  27173. --- Firing Productions (IE) For State At Depth 1 ---
  27174. --- Inner Elaboration Phase, active level 1 (S1) ---
  27175. Firing monitor*world
  27176. -->
  27177. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27178. --- Change Working Memory (IE) ---
  27179. --- END Application Phase ---
  27180. --- Output Phase ---
  27181. ENV: Agent did: predict-no for direction U in state State-B
  27182. In State-B moving U
  27183. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27184. predict error 0
  27185. dir: dir isU
  27186. --- END Output Phase ---
  27187. |\---- Input Phase ---
  27188. =>WM: (15210: I2 ^dir U)
  27189. =>WM: (15209: I2 ^reward 1)
  27190. =>WM: (15208: I2 ^see 0)
  27191. =>WM: (15207: N1082 ^status complete)
  27192. <=WM: (15196: I2 ^dir U)
  27193. <=WM: (15195: I2 ^reward 1)
  27194. <=WM: (15194: I2 ^see 0)
  27195. =>WM: (15211: I2 ^level-1 R0-root)
  27196. <=WM: (15197: I2 ^level-1 R0-root)
  27197. --- END Input Phase ---
  27198. --- Proposal Phase ---
  27199. --- Inner Elaboration Phase, active level 1 (S1) ---
  27200. Firing elaborate*copy-see-to-output-link
  27201. -->
  27202. (I3 ^see 0 +)
  27203. Firing elaborate*reward*based*on*reward
  27204. -->
  27205. (R1086 ^value 1 +)
  27206. (R1 ^reward R1086 +)
  27207. Firing propose*predict-yes
  27208. -->
  27209. (O2165 ^name predict-yes +)
  27210. (S1 ^operator O2165 +)
  27211. Firing propose*predict-no
  27212. -->
  27213. (O2166 ^name predict-no +)
  27214. (S1 ^operator O2166 +)
  27215. Firing rl*prefer*rvt*predict-no*H0*6
  27216. -->
  27217. (S1 ^operator O2164 = 0.9999999999999999)
  27218. Firing rl*prefer*rvt*predict-yes*H0*5
  27219. -->
  27220. (S1 ^operator O2163 = 0.)
  27221. Firing prefer*rvt*predict-yes*H0
  27222. -->
  27223. Firing prefer*rvt*predict-no*H0
  27224. -->
  27225. Firing elaborate*copy-dir-to-output-link
  27226. -->
  27227. (I3 ^dir U +)
  27228. inner elaboration loop at bottom goal.
  27229. Retracting elaborate*copy-see-to-output-link
  27230. -->
  27231. (I3 ^see 0 +)
  27232. Retracting propose*predict-no
  27233. -->
  27234. (O2164 ^name predict-no +)
  27235. (S1 ^operator O2164 +)
  27236. Retracting propose*predict-yes
  27237. -->
  27238. (O2163 ^name predict-yes +)
  27239. (S1 ^operator O2163 +)
  27240. Retracting elaborate*reward*based*on*reward
  27241. -->
  27242. (R1085 ^value 1 +)
  27243. (R1 ^reward R1085 +)
  27244. Retracting elaborate*copy-dir-to-output-link
  27245. -->
  27246. (I3 ^dir U +)
  27247. Retracting rl*prefer*rvt*predict-no*H0*6
  27248. -->
  27249. (S1 ^operator O2164 = 0.9999999999999999)
  27250. Retracting rl*prefer*rvt*predict-yes*H0*5
  27251. -->
  27252. (S1 ^operator O2163 = 0.)
  27253. =>WM: (15217: S1 ^operator O2166 +)
  27254. =>WM: (15216: S1 ^operator O2165 +)
  27255. =>WM: (15215: O2166 ^name predict-no)
  27256. =>WM: (15214: O2165 ^name predict-yes)
  27257. =>WM: (15213: R1086 ^value 1)
  27258. =>WM: (15212: R1 ^reward R1086)
  27259. <=WM: (15203: S1 ^operator O2163 +)
  27260. <=WM: (15204: S1 ^operator O2164 +)
  27261. <=WM: (15205: S1 ^operator O2164)
  27262. <=WM: (15198: R1 ^reward R1085)
  27263. <=WM: (15201: O2164 ^name predict-no)
  27264. <=WM: (15200: O2163 ^name predict-yes)
  27265. <=WM: (15199: R1085 ^value 1)
  27266. --- Inner Elaboration Phase, active level 1 (S1) ---
  27267. Firing prefer*rvt*predict-yes*H0
  27268. -->
  27269. Firing rl*prefer*rvt*predict-yes*H0*5
  27270. -->
  27271. (S1 ^operator O2165 = 0.)
  27272. Firing prefer*rvt*predict-no*H0
  27273. -->
  27274. Firing rl*prefer*rvt*predict-no*H0*6
  27275. -->
  27276. (S1 ^operator O2166 = 0.9999999999999999)
  27277. inner elaboration loop at bottom goal.
  27278. Retracting rl*prefer*rvt*predict-no*H0*6
  27279. -->
  27280. (S1 ^operator O2164 = 0.9999999999999999)
  27281. Retracting rl*prefer*rvt*predict-yes*H0*5
  27282. -->
  27283. (S1 ^operator O2163 = 0.)
  27284. --- END Proposal Phase ---
  27285. --- Decision Phase ---
  27286. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27287. =>WM: (15218: S1 ^operator O2166)
  27288. 1083: O: O2166 (predict-no)
  27289. --- END Decision Phase ---
  27290. --- Application Phase ---
  27291. --- Firing Productions (PE) For State At Depth 1 ---
  27292. --- Inner Elaboration Phase, active level 1 (S1) ---
  27293. Firing apply*operator
  27294. -->
  27295. (I3 ^predict-no N1083 + :O )
  27296. Firing apply*operator*complete
  27297. -->
  27298. (I3 ^predict-no N1082 - :O )
  27299. inner elaboration loop at bottom goal.
  27300. --- Change Working Memory (PE) ---
  27301. =>WM: (15219: I3 ^predict-no N1083)
  27302. <=WM: (15207: N1082 ^status complete)
  27303. <=WM: (15206: I3 ^predict-no N1082)
  27304. --- Firing Productions (IE) For State At Depth 1 ---
  27305. --- Inner Elaboration Phase, active level 1 (S1) ---
  27306. Firing monitor*world
  27307. -->
  27308. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27309. --- Change Working Memory (IE) ---
  27310. --- END Application Phase ---
  27311. --- Output Phase ---
  27312. ENV: Agent did: predict-no for direction U in state State-B
  27313. In State-B moving U
  27314. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27315. predict error 0
  27316. dir: dir isU
  27317. --- END Output Phase ---
  27318. /|\--- Input Phase ---
  27319. =>WM: (15223: I2 ^dir U)
  27320. =>WM: (15222: I2 ^reward 1)
  27321. =>WM: (15221: I2 ^see 0)
  27322. =>WM: (15220: N1083 ^status complete)
  27323. <=WM: (15210: I2 ^dir U)
  27324. <=WM: (15209: I2 ^reward 1)
  27325. <=WM: (15208: I2 ^see 0)
  27326. =>WM: (15224: I2 ^level-1 R0-root)
  27327. <=WM: (15211: I2 ^level-1 R0-root)
  27328. --- END Input Phase ---
  27329. --- Proposal Phase ---
  27330. --- Inner Elaboration Phase, active level 1 (S1) ---
  27331. Firing elaborate*copy-see-to-output-link
  27332. -->
  27333. (I3 ^see 0 +)
  27334. Firing elaborate*reward*based*on*reward
  27335. -->
  27336. (R1087 ^value 1 +)
  27337. (R1 ^reward R1087 +)
  27338. Firing propose*predict-yes
  27339. -->
  27340. (O2167 ^name predict-yes +)
  27341. (S1 ^operator O2167 +)
  27342. Firing propose*predict-no
  27343. -->
  27344. (O2168 ^name predict-no +)
  27345. (S1 ^operator O2168 +)
  27346. Firing rl*prefer*rvt*predict-no*H0*6
  27347. -->
  27348. (S1 ^operator O2166 = 0.9999999999999999)
  27349. Firing rl*prefer*rvt*predict-yes*H0*5
  27350. -->
  27351. (S1 ^operator O2165 = 0.)
  27352. Firing prefer*rvt*predict-yes*H0
  27353. -->
  27354. Firing prefer*rvt*predict-no*H0
  27355. -->
  27356. Firing elaborate*copy-dir-to-output-link
  27357. -->
  27358. (I3 ^dir U +)
  27359. inner elaboration loop at bottom goal.
  27360. Retracting elaborate*copy-see-to-output-link
  27361. -->
  27362. (I3 ^see 0 +)
  27363. Retracting propose*predict-no
  27364. -->
  27365. (O2166 ^name predict-no +)
  27366. (S1 ^operator O2166 +)
  27367. Retracting propose*predict-yes
  27368. -->
  27369. (O2165 ^name predict-yes +)
  27370. (S1 ^operator O2165 +)
  27371. Retracting elaborate*reward*based*on*reward
  27372. -->
  27373. (R1086 ^value 1 +)
  27374. (R1 ^reward R1086 +)
  27375. Retracting elaborate*copy-dir-to-output-link
  27376. -->
  27377. (I3 ^dir U +)
  27378. Retracting rl*prefer*rvt*predict-no*H0*6
  27379. -->
  27380. (S1 ^operator O2166 = 0.9999999999999999)
  27381. Retracting rl*prefer*rvt*predict-yes*H0*5
  27382. -->
  27383. (S1 ^operator O2165 = 0.)
  27384. =>WM: (15230: S1 ^operator O2168 +)
  27385. =>WM: (15229: S1 ^operator O2167 +)
  27386. =>WM: (15228: O2168 ^name predict-no)
  27387. =>WM: (15227: O2167 ^name predict-yes)
  27388. =>WM: (15226: R1087 ^value 1)
  27389. =>WM: (15225: R1 ^reward R1087)
  27390. <=WM: (15216: S1 ^operator O2165 +)
  27391. <=WM: (15217: S1 ^operator O2166 +)
  27392. <=WM: (15218: S1 ^operator O2166)
  27393. <=WM: (15212: R1 ^reward R1086)
  27394. <=WM: (15215: O2166 ^name predict-no)
  27395. <=WM: (15214: O2165 ^name predict-yes)
  27396. <=WM: (15213: R1086 ^value 1)
  27397. --- Inner Elaboration Phase, active level 1 (S1) ---
  27398. Firing prefer*rvt*predict-yes*H0
  27399. -->
  27400. Firing rl*prefer*rvt*predict-yes*H0*5
  27401. -->
  27402. (S1 ^operator O2167 = 0.)
  27403. Firing prefer*rvt*predict-no*H0
  27404. -->
  27405. Firing rl*prefer*rvt*predict-no*H0*6
  27406. -->
  27407. (S1 ^operator O2168 = 0.9999999999999999)
  27408. inner elaboration loop at bottom goal.
  27409. Retracting rl*prefer*rvt*predict-no*H0*6
  27410. -->
  27411. (S1 ^operator O2166 = 0.9999999999999999)
  27412. Retracting rl*prefer*rvt*predict-yes*H0*5
  27413. -->
  27414. (S1 ^operator O2165 = 0.)
  27415. --- END Proposal Phase ---
  27416. --- Decision Phase ---
  27417. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27418. =>WM: (15231: S1 ^operator O2168)
  27419. 1084: O: O2168 (predict-no)
  27420. --- END Decision Phase ---
  27421. --- Application Phase ---
  27422. --- Firing Productions (PE) For State At Depth 1 ---
  27423. --- Inner Elaboration Phase, active level 1 (S1) ---
  27424. Firing apply*operator
  27425. -->
  27426. (I3 ^predict-no N1084 + :O )
  27427. Firing apply*operator*complete
  27428. -->
  27429. (I3 ^predict-no N1083 - :O )
  27430. inner elaboration loop at bottom goal.
  27431. --- Change Working Memory (PE) ---
  27432. =>WM: (15232: I3 ^predict-no N1084)
  27433. <=WM: (15220: N1083 ^status complete)
  27434. <=WM: (15219: I3 ^predict-no N1083)
  27435. --- Firing Productions (IE) For State At Depth 1 ---
  27436. --- Inner Elaboration Phase, active level 1 (S1) ---
  27437. Firing monitor*world
  27438. -->
  27439. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27440. --- Change Working Memory (IE) ---
  27441. --- END Application Phase ---
  27442. --- Output Phase ---
  27443. ENV: Agent did: predict-no for direction U in state State-B
  27444. In State-B moving U
  27445. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27446. predict error 0
  27447. dir: dir isU
  27448. --- END Output Phase ---
  27449. -/|\--- Input Phase ---
  27450. =>WM: (15236: I2 ^dir U)
  27451. =>WM: (15235: I2 ^reward 1)
  27452. =>WM: (15234: I2 ^see 0)
  27453. =>WM: (15233: N1084 ^status complete)
  27454. <=WM: (15223: I2 ^dir U)
  27455. <=WM: (15222: I2 ^reward 1)
  27456. <=WM: (15221: I2 ^see 0)
  27457. =>WM: (15237: I2 ^level-1 R0-root)
  27458. <=WM: (15224: I2 ^level-1 R0-root)
  27459. --- END Input Phase ---
  27460. --- Proposal Phase ---
  27461. --- Inner Elaboration Phase, active level 1 (S1) ---
  27462. Firing elaborate*copy-see-to-output-link
  27463. -->
  27464. (I3 ^see 0 +)
  27465. Firing elaborate*reward*based*on*reward
  27466. -->
  27467. (R1088 ^value 1 +)
  27468. (R1 ^reward R1088 +)
  27469. Firing propose*predict-yes
  27470. -->
  27471. (O2169 ^name predict-yes +)
  27472. (S1 ^operator O2169 +)
  27473. Firing propose*predict-no
  27474. -->
  27475. (O2170 ^name predict-no +)
  27476. (S1 ^operator O2170 +)
  27477. Firing rl*prefer*rvt*predict-no*H0*6
  27478. -->
  27479. (S1 ^operator O2168 = 0.9999999999999999)
  27480. Firing rl*prefer*rvt*predict-yes*H0*5
  27481. -->
  27482. (S1 ^operator O2167 = 0.)
  27483. Firing prefer*rvt*predict-yes*H0
  27484. -->
  27485. Firing prefer*rvt*predict-no*H0
  27486. -->
  27487. Firing elaborate*copy-dir-to-output-link
  27488. -->
  27489. (I3 ^dir U +)
  27490. inner elaboration loop at bottom goal.
  27491. Retracting elaborate*copy-see-to-output-link
  27492. -->
  27493. (I3 ^see 0 +)
  27494. Retracting propose*predict-no
  27495. -->
  27496. (O2168 ^name predict-no +)
  27497. (S1 ^operator O2168 +)
  27498. Retracting propose*predict-yes
  27499. -->
  27500. (O2167 ^name predict-yes +)
  27501. (S1 ^operator O2167 +)
  27502. Retracting elaborate*reward*based*on*reward
  27503. -->
  27504. (R1087 ^value 1 +)
  27505. (R1 ^reward R1087 +)
  27506. Retracting elaborate*copy-dir-to-output-link
  27507. -->
  27508. (I3 ^dir U +)
  27509. Retracting rl*prefer*rvt*predict-no*H0*6
  27510. -->
  27511. (S1 ^operator O2168 = 0.9999999999999999)
  27512. Retracting rl*prefer*rvt*predict-yes*H0*5
  27513. -->
  27514. (S1 ^operator O2167 = 0.)
  27515. =>WM: (15243: S1 ^operator O2170 +)
  27516. =>WM: (15242: S1 ^operator O2169 +)
  27517. =>WM: (15241: O2170 ^name predict-no)
  27518. =>WM: (15240: O2169 ^name predict-yes)
  27519. =>WM: (15239: R1088 ^value 1)
  27520. =>WM: (15238: R1 ^reward R1088)
  27521. <=WM: (15229: S1 ^operator O2167 +)
  27522. <=WM: (15230: S1 ^operator O2168 +)
  27523. <=WM: (15231: S1 ^operator O2168)
  27524. <=WM: (15225: R1 ^reward R1087)
  27525. <=WM: (15228: O2168 ^name predict-no)
  27526. <=WM: (15227: O2167 ^name predict-yes)
  27527. <=WM: (15226: R1087 ^value 1)
  27528. --- Inner Elaboration Phase, active level 1 (S1) ---
  27529. Firing prefer*rvt*predict-yes*H0
  27530. -->
  27531. Firing rl*prefer*rvt*predict-yes*H0*5
  27532. -->
  27533. (S1 ^operator O2169 = 0.)
  27534. Firing prefer*rvt*predict-no*H0
  27535. -->
  27536. Firing rl*prefer*rvt*predict-no*H0*6
  27537. -->
  27538. (S1 ^operator O2170 = 0.9999999999999999)
  27539. inner elaboration loop at bottom goal.
  27540. Retracting rl*prefer*rvt*predict-no*H0*6
  27541. -->
  27542. (S1 ^operator O2168 = 0.9999999999999999)
  27543. Retracting rl*prefer*rvt*predict-yes*H0*5
  27544. -->
  27545. (S1 ^operator O2167 = 0.)
  27546. --- END Proposal Phase ---
  27547. --- Decision Phase ---
  27548. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27549. =>WM: (15244: S1 ^operator O2170)
  27550. 1085: O: O2170 (predict-no)
  27551. --- END Decision Phase ---
  27552. --- Application Phase ---
  27553. --- Firing Productions (PE) For State At Depth 1 ---
  27554. --- Inner Elaboration Phase, active level 1 (S1) ---
  27555. Firing apply*operator
  27556. -->
  27557. (I3 ^predict-no N1085 + :O )
  27558. Firing apply*operator*complete
  27559. -->
  27560. (I3 ^predict-no N1084 - :O )
  27561. inner elaboration loop at bottom goal.
  27562. --- Change Working Memory (PE) ---
  27563. =>WM: (15245: I3 ^predict-no N1085)
  27564. <=WM: (15233: N1084 ^status complete)
  27565. <=WM: (15232: I3 ^predict-no N1084)
  27566. --- Firing Productions (IE) For State At Depth 1 ---
  27567. --- Inner Elaboration Phase, active level 1 (S1) ---
  27568. Firing monitor*world
  27569. -->
  27570. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27571. --- Change Working Memory (IE) ---
  27572. --- END Application Phase ---
  27573. --- Output Phase ---
  27574. ENV: Agent did: predict-no for direction U in state State-B
  27575. In State-B moving U
  27576. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27577. predict error 0
  27578. dir: dir isR
  27579. --- END Output Phase ---
  27580. -/|--- Input Phase ---
  27581. =>WM: (15249: I2 ^dir R)
  27582. =>WM: (15248: I2 ^reward 1)
  27583. =>WM: (15247: I2 ^see 0)
  27584. =>WM: (15246: N1085 ^status complete)
  27585. <=WM: (15236: I2 ^dir U)
  27586. <=WM: (15235: I2 ^reward 1)
  27587. <=WM: (15234: I2 ^see 0)
  27588. =>WM: (15250: I2 ^level-1 R0-root)
  27589. <=WM: (15237: I2 ^level-1 R0-root)
  27590. --- END Input Phase ---
  27591. --- Proposal Phase ---
  27592. --- Inner Elaboration Phase, active level 1 (S1) ---
  27593. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  27594. -->
  27595. (S1 ^operator O2169 = 0.2696941111808541)
  27596. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  27597. -->
  27598. (S1 ^operator O2170 = 0.8730231400765728)
  27599. Firing prefer*rvt*predict-no*H0*4*H1
  27600. -->
  27601. Firing prefer*rvt*predict-yes*H0*3*H1
  27602. -->
  27603. Firing elaborate*copy-see-to-output-link
  27604. -->
  27605. (I3 ^see 0 +)
  27606. Firing elaborate*reward*based*on*reward
  27607. -->
  27608. (R1089 ^value 1 +)
  27609. (R1 ^reward R1089 +)
  27610. Firing propose*predict-yes
  27611. -->
  27612. (O2171 ^name predict-yes +)
  27613. (S1 ^operator O2171 +)
  27614. Firing propose*predict-no
  27615. -->
  27616. (O2172 ^name predict-no +)
  27617. (S1 ^operator O2172 +)
  27618. Firing rl*prefer*rvt*predict-no*H0*4
  27619. -->
  27620. (S1 ^operator O2170 = 0.1269768218883779)
  27621. Firing rl*prefer*rvt*predict-yes*H0*3
  27622. -->
  27623. (S1 ^operator O2169 = 0.382942629133266)
  27624. Firing prefer*rvt*predict-yes*H0
  27625. -->
  27626. Firing prefer*rvt*predict-no*H0
  27627. -->
  27628. Firing elaborate*copy-dir-to-output-link
  27629. -->
  27630. (I3 ^dir R +)
  27631. inner elaboration loop at bottom goal.
  27632. Retracting elaborate*copy-see-to-output-link
  27633. -->
  27634. (I3 ^see 0 +)
  27635. Retracting propose*predict-no
  27636. -->
  27637. (O2170 ^name predict-no +)
  27638. (S1 ^operator O2170 +)
  27639. Retracting propose*predict-yes
  27640. -->
  27641. (O2169 ^name predict-yes +)
  27642. (S1 ^operator O2169 +)
  27643. Retracting elaborate*reward*based*on*reward
  27644. -->
  27645. (R1088 ^value 1 +)
  27646. (R1 ^reward R1088 +)
  27647. Retracting elaborate*copy-dir-to-output-link
  27648. -->
  27649. (I3 ^dir U +)
  27650. Retracting rl*prefer*rvt*predict-no*H0*6
  27651. -->
  27652. (S1 ^operator O2170 = 0.9999999999999999)
  27653. Retracting rl*prefer*rvt*predict-yes*H0*5
  27654. -->
  27655. (S1 ^operator O2169 = 0.)
  27656. =>WM: (15257: S1 ^operator O2172 +)
  27657. =>WM: (15256: S1 ^operator O2171 +)
  27658. =>WM: (15255: I3 ^dir R)
  27659. =>WM: (15254: O2172 ^name predict-no)
  27660. =>WM: (15253: O2171 ^name predict-yes)
  27661. =>WM: (15252: R1089 ^value 1)
  27662. =>WM: (15251: R1 ^reward R1089)
  27663. <=WM: (15242: S1 ^operator O2169 +)
  27664. <=WM: (15243: S1 ^operator O2170 +)
  27665. <=WM: (15244: S1 ^operator O2170)
  27666. <=WM: (15202: I3 ^dir U)
  27667. <=WM: (15238: R1 ^reward R1088)
  27668. <=WM: (15241: O2170 ^name predict-no)
  27669. <=WM: (15240: O2169 ^name predict-yes)
  27670. <=WM: (15239: R1088 ^value 1)
  27671. --- Inner Elaboration Phase, active level 1 (S1) ---
  27672. Firing prefer*rvt*predict-yes*H0
  27673. -->
  27674. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  27675. -->
  27676. (S1 ^operator O2171 = 0.2696941111808541)
  27677. Firing rl*prefer*rvt*predict-yes*H0*3
  27678. -->
  27679. (S1 ^operator O2171 = 0.382942629133266)
  27680. Firing prefer*rvt*predict-yes*H0*3*H1
  27681. -->
  27682. Firing prefer*rvt*predict-no*H0
  27683. -->
  27684. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  27685. -->
  27686. (S1 ^operator O2172 = 0.8730231400765728)
  27687. Firing rl*prefer*rvt*predict-no*H0*4
  27688. -->
  27689. (S1 ^operator O2172 = 0.1269768218883779)
  27690. Firing prefer*rvt*predict-no*H0*4*H1
  27691. -->
  27692. inner elaboration loop at bottom goal.
  27693. Retracting rl*prefer*rvt*predict-no*H0*4
  27694. -->
  27695. (S1 ^operator O2170 = 0.1269768218883779)
  27696. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  27697. -->
  27698. (S1 ^operator O2170 = 0.8730231400765728)
  27699. Retracting rl*prefer*rvt*predict-yes*H0*3
  27700. -->
  27701. (S1 ^operator O2169 = 0.382942629133266)
  27702. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  27703. -->
  27704. (S1 ^operator O2169 = 0.2696941111808541)
  27705. --- END Proposal Phase ---
  27706. --- Decision Phase ---
  27707. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27708. =>WM: (15258: S1 ^operator O2172)
  27709. 1086: O: O2172 (predict-no)
  27710. --- END Decision Phase ---
  27711. --- Application Phase ---
  27712. --- Firing Productions (PE) For State At Depth 1 ---
  27713. --- Inner Elaboration Phase, active level 1 (S1) ---
  27714. Firing apply*operator
  27715. -->
  27716. (I3 ^predict-no N1086 + :O )
  27717. Firing apply*operator*complete
  27718. -->
  27719. (I3 ^predict-no N1085 - :O )
  27720. inner elaboration loop at bottom goal.
  27721. --- Change Working Memory (PE) ---
  27722. =>WM: (15259: I3 ^predict-no N1086)
  27723. <=WM: (15246: N1085 ^status complete)
  27724. <=WM: (15245: I3 ^predict-no N1085)
  27725. --- Firing Productions (IE) For State At Depth 1 ---
  27726. --- Inner Elaboration Phase, active level 1 (S1) ---
  27727. Firing monitor*world
  27728. -->
  27729. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27730. --- Change Working Memory (IE) ---
  27731. --- END Application Phase ---
  27732. --- Output Phase ---
  27733. ENV: Agent did: predict-no for direction R in state State-B
  27734. In State-B moving R
  27735. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27736. predict error 0
  27737. dir: dir isU
  27738. --- END Output Phase ---
  27739. \---- Input Phase ---
  27740. =>WM: (15263: I2 ^dir U)
  27741. =>WM: (15262: I2 ^reward 1)
  27742. =>WM: (15261: I2 ^see 0)
  27743. =>WM: (15260: N1086 ^status complete)
  27744. <=WM: (15249: I2 ^dir R)
  27745. <=WM: (15248: I2 ^reward 1)
  27746. <=WM: (15247: I2 ^see 0)
  27747. =>WM: (15264: I2 ^level-1 R0-root)
  27748. <=WM: (15250: I2 ^level-1 R0-root)
  27749. --- END Input Phase ---
  27750. --- Proposal Phase ---
  27751. --- Inner Elaboration Phase, active level 1 (S1) ---
  27752. Firing elaborate*copy-see-to-output-link
  27753. -->
  27754. (I3 ^see 0 +)
  27755. Firing elaborate*reward*based*on*reward
  27756. -->
  27757. (R1090 ^value 1 +)
  27758. (R1 ^reward R1090 +)
  27759. Firing propose*predict-yes
  27760. -->
  27761. (O2173 ^name predict-yes +)
  27762. (S1 ^operator O2173 +)
  27763. Firing propose*predict-no
  27764. -->
  27765. (O2174 ^name predict-no +)
  27766. (S1 ^operator O2174 +)
  27767. Firing rl*prefer*rvt*predict-no*H0*6
  27768. -->
  27769. (S1 ^operator O2172 = 0.9999999999999999)
  27770. Firing rl*prefer*rvt*predict-yes*H0*5
  27771. -->
  27772. (S1 ^operator O2171 = 0.)
  27773. Firing prefer*rvt*predict-yes*H0
  27774. -->
  27775. Firing prefer*rvt*predict-no*H0
  27776. -->
  27777. Firing elaborate*copy-dir-to-output-link
  27778. -->
  27779. (I3 ^dir U +)
  27780. inner elaboration loop at bottom goal.
  27781. Retracting elaborate*copy-see-to-output-link
  27782. -->
  27783. (I3 ^see 0 +)
  27784. Retracting propose*predict-no
  27785. -->
  27786. (O2172 ^name predict-no +)
  27787. (S1 ^operator O2172 +)
  27788. Retracting propose*predict-yes
  27789. -->
  27790. (O2171 ^name predict-yes +)
  27791. (S1 ^operator O2171 +)
  27792. Retracting elaborate*reward*based*on*reward
  27793. -->
  27794. (R1089 ^value 1 +)
  27795. (R1 ^reward R1089 +)
  27796. Retracting elaborate*copy-dir-to-output-link
  27797. -->
  27798. (I3 ^dir R +)
  27799. Retracting rl*prefer*rvt*predict-no*H0*4
  27800. -->
  27801. (S1 ^operator O2172 = 0.1269768218883779)
  27802. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  27803. -->
  27804. (S1 ^operator O2172 = 0.8730231400765728)
  27805. Retracting rl*prefer*rvt*predict-yes*H0*3
  27806. -->
  27807. (S1 ^operator O2171 = 0.382942629133266)
  27808. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  27809. -->
  27810. (S1 ^operator O2171 = 0.2696941111808541)
  27811. =>WM: (15271: S1 ^operator O2174 +)
  27812. =>WM: (15270: S1 ^operator O2173 +)
  27813. =>WM: (15269: I3 ^dir U)
  27814. =>WM: (15268: O2174 ^name predict-no)
  27815. =>WM: (15267: O2173 ^name predict-yes)
  27816. =>WM: (15266: R1090 ^value 1)
  27817. =>WM: (15265: R1 ^reward R1090)
  27818. <=WM: (15256: S1 ^operator O2171 +)
  27819. <=WM: (15257: S1 ^operator O2172 +)
  27820. <=WM: (15258: S1 ^operator O2172)
  27821. <=WM: (15255: I3 ^dir R)
  27822. <=WM: (15251: R1 ^reward R1089)
  27823. <=WM: (15254: O2172 ^name predict-no)
  27824. <=WM: (15253: O2171 ^name predict-yes)
  27825. <=WM: (15252: R1089 ^value 1)
  27826. --- Inner Elaboration Phase, active level 1 (S1) ---
  27827. Firing prefer*rvt*predict-yes*H0
  27828. -->
  27829. Firing rl*prefer*rvt*predict-yes*H0*5
  27830. -->
  27831. (S1 ^operator O2173 = 0.)
  27832. Firing prefer*rvt*predict-no*H0
  27833. -->
  27834. Firing rl*prefer*rvt*predict-no*H0*6
  27835. -->
  27836. (S1 ^operator O2174 = 0.9999999999999999)
  27837. inner elaboration loop at bottom goal.
  27838. Retracting rl*prefer*rvt*predict-no*H0*6
  27839. -->
  27840. (S1 ^operator O2172 = 0.9999999999999999)
  27841. Retracting rl*prefer*rvt*predict-yes*H0*5
  27842. -->
  27843. (S1 ^operator O2171 = 0.)
  27844. --- END Proposal Phase ---
  27845. --- Decision Phase ---
  27846. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.954545,0.0436087)
  27847. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  27848. =>WM: (15272: S1 ^operator O2174)
  27849. 1087: O: O2174 (predict-no)
  27850. --- END Decision Phase ---
  27851. --- Application Phase ---
  27852. --- Firing Productions (PE) For State At Depth 1 ---
  27853. --- Inner Elaboration Phase, active level 1 (S1) ---
  27854. Firing apply*operator
  27855. -->
  27856. (I3 ^predict-no N1087 + :O )
  27857. Firing apply*operator*complete
  27858. -->
  27859. (I3 ^predict-no N1086 - :O )
  27860. inner elaboration loop at bottom goal.
  27861. --- Change Working Memory (PE) ---
  27862. =>WM: (15273: I3 ^predict-no N1087)
  27863. <=WM: (15260: N1086 ^status complete)
  27864. <=WM: (15259: I3 ^predict-no N1086)
  27865. --- Firing Productions (IE) For State At Depth 1 ---
  27866. --- Inner Elaboration Phase, active level 1 (S1) ---
  27867. Firing monitor*world
  27868. -->
  27869. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27870. --- Change Working Memory (IE) ---
  27871. --- END Application Phase ---
  27872. --- Output Phase ---
  27873. ENV: Agent did: predict-no for direction U in state State-B
  27874. In State-B moving U
  27875. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27876. predict error 0
  27877. dir: dir isL
  27878. --- END Output Phase ---
  27879. /|\--- Input Phase ---
  27880. =>WM: (15277: I2 ^dir L)
  27881. =>WM: (15276: I2 ^reward 1)
  27882. =>WM: (15275: I2 ^see 0)
  27883. =>WM: (15274: N1087 ^status complete)
  27884. <=WM: (15263: I2 ^dir U)
  27885. <=WM: (15262: I2 ^reward 1)
  27886. <=WM: (15261: I2 ^see 0)
  27887. =>WM: (15278: I2 ^level-1 R0-root)
  27888. <=WM: (15264: I2 ^level-1 R0-root)
  27889. --- END Input Phase ---
  27890. --- Proposal Phase ---
  27891. --- Inner Elaboration Phase, active level 1 (S1) ---
  27892. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  27893. -->
  27894. (S1 ^operator O2173 = 0.4768809749851805)
  27895. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  27896. -->
  27897. (S1 ^operator O2174 = 0.1700769046561409)
  27898. Firing prefer*rvt*predict-no*H0*2*H1
  27899. -->
  27900. Firing prefer*rvt*predict-yes*H0*1*H1
  27901. -->
  27902. Firing elaborate*copy-see-to-output-link
  27903. -->
  27904. (I3 ^see 0 +)
  27905. Firing elaborate*reward*based*on*reward
  27906. -->
  27907. (R1091 ^value 1 +)
  27908. (R1 ^reward R1091 +)
  27909. Firing propose*predict-yes
  27910. -->
  27911. (O2175 ^name predict-yes +)
  27912. (S1 ^operator O2175 +)
  27913. Firing propose*predict-no
  27914. -->
  27915. (O2176 ^name predict-no +)
  27916. (S1 ^operator O2176 +)
  27917. Firing rl*prefer*rvt*predict-no*H0*2
  27918. -->
  27919. (S1 ^operator O2174 = 0.2550133863879636)
  27920. Firing rl*prefer*rvt*predict-yes*H0*1
  27921. -->
  27922. (S1 ^operator O2173 = 0.5231192467369606)
  27923. Firing prefer*rvt*predict-yes*H0
  27924. -->
  27925. Firing prefer*rvt*predict-no*H0
  27926. -->
  27927. Firing elaborate*copy-dir-to-output-link
  27928. -->
  27929. (I3 ^dir L +)
  27930. inner elaboration loop at bottom goal.
  27931. Retracting elaborate*copy-see-to-output-link
  27932. -->
  27933. (I3 ^see 0 +)
  27934. Retracting propose*predict-no
  27935. -->
  27936. (O2174 ^name predict-no +)
  27937. (S1 ^operator O2174 +)
  27938. Retracting propose*predict-yes
  27939. -->
  27940. (O2173 ^name predict-yes +)
  27941. (S1 ^operator O2173 +)
  27942. Retracting elaborate*reward*based*on*reward
  27943. -->
  27944. (R1090 ^value 1 +)
  27945. (R1 ^reward R1090 +)
  27946. Retracting elaborate*copy-dir-to-output-link
  27947. -->
  27948. (I3 ^dir U +)
  27949. Retracting rl*prefer*rvt*predict-no*H0*6
  27950. -->
  27951. (S1 ^operator O2174 = 0.9999999999999999)
  27952. Retracting rl*prefer*rvt*predict-yes*H0*5
  27953. -->
  27954. (S1 ^operator O2173 = 0.)
  27955. =>WM: (15285: S1 ^operator O2176 +)
  27956. =>WM: (15284: S1 ^operator O2175 +)
  27957. =>WM: (15283: I3 ^dir L)
  27958. =>WM: (15282: O2176 ^name predict-no)
  27959. =>WM: (15281: O2175 ^name predict-yes)
  27960. =>WM: (15280: R1091 ^value 1)
  27961. =>WM: (15279: R1 ^reward R1091)
  27962. <=WM: (15270: S1 ^operator O2173 +)
  27963. <=WM: (15271: S1 ^operator O2174 +)
  27964. <=WM: (15272: S1 ^operator O2174)
  27965. <=WM: (15269: I3 ^dir U)
  27966. <=WM: (15265: R1 ^reward R1090)
  27967. <=WM: (15268: O2174 ^name predict-no)
  27968. <=WM: (15267: O2173 ^name predict-yes)
  27969. <=WM: (15266: R1090 ^value 1)
  27970. --- Inner Elaboration Phase, active level 1 (S1) ---
  27971. Firing prefer*rvt*predict-yes*H0
  27972. -->
  27973. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  27974. -->
  27975. (S1 ^operator O2175 = 0.4768809749851805)
  27976. Firing rl*prefer*rvt*predict-yes*H0*1
  27977. -->
  27978. (S1 ^operator O2175 = 0.5231192467369606)
  27979. Firing prefer*rvt*predict-yes*H0*1*H1
  27980. -->
  27981. Firing prefer*rvt*predict-no*H0
  27982. -->
  27983. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  27984. -->
  27985. (S1 ^operator O2176 = 0.1700769046561409)
  27986. Firing rl*prefer*rvt*predict-no*H0*2
  27987. -->
  27988. (S1 ^operator O2176 = 0.2550133863879636)
  27989. Firing prefer*rvt*predict-no*H0*2*H1
  27990. -->
  27991. inner elaboration loop at bottom goal.
  27992. Retracting rl*prefer*rvt*predict-no*H0*2
  27993. -->
  27994. (S1 ^operator O2174 = 0.2550133863879636)
  27995. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  27996. -->
  27997. (S1 ^operator O2174 = 0.1700769046561409)
  27998. Retracting rl*prefer*rvt*predict-yes*H0*1
  27999. -->
  28000. (S1 ^operator O2173 = 0.5231192467369606)
  28001. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  28002. -->
  28003. (S1 ^operator O2173 = 0.4768809749851805)
  28004. --- END Proposal Phase ---
  28005. --- Decision Phase ---
  28006. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28007. =>WM: (15286: S1 ^operator O2175)
  28008. 1088: O: O2175 (predict-yes)
  28009. --- END Decision Phase ---
  28010. --- Application Phase ---
  28011. --- Firing Productions (PE) For State At Depth 1 ---
  28012. --- Inner Elaboration Phase, active level 1 (S1) ---
  28013. Firing apply*operator
  28014. -->
  28015. (I3 ^predict-yes N1088 + :O )
  28016. Firing apply*operator*complete
  28017. -->
  28018. (I3 ^predict-no N1087 - :O )
  28019. inner elaboration loop at bottom goal.
  28020. --- Change Working Memory (PE) ---
  28021. =>WM: (15287: I3 ^predict-yes N1088)
  28022. <=WM: (15274: N1087 ^status complete)
  28023. <=WM: (15273: I3 ^predict-no N1087)
  28024. --- Firing Productions (IE) For State At Depth 1 ---
  28025. --- Inner Elaboration Phase, active level 1 (S1) ---
  28026. Firing monitor*world
  28027. -->
  28028. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28029. --- Change Working Memory (IE) ---
  28030. --- END Application Phase ---
  28031. --- Output Phase ---
  28032. ENV: Agent did: predict-yes for direction L in state State-B
  28033. In State-B moving L
  28034. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28035. predict error 0
  28036. dir: dir isU
  28037. --- END Output Phase ---
  28038. -/|--- Input Phase ---
  28039. =>WM: (15291: I2 ^dir U)
  28040. =>WM: (15290: I2 ^reward 1)
  28041. =>WM: (15289: I2 ^see 1)
  28042. =>WM: (15288: N1088 ^status complete)
  28043. <=WM: (15277: I2 ^dir L)
  28044. <=WM: (15276: I2 ^reward 1)
  28045. <=WM: (15275: I2 ^see 0)
  28046. =>WM: (15292: I2 ^level-1 L1-root)
  28047. <=WM: (15278: I2 ^level-1 R0-root)
  28048. --- END Input Phase ---
  28049. --- Proposal Phase ---
  28050. --- Inner Elaboration Phase, active level 1 (S1) ---
  28051. Firing elaborate*copy-see-to-output-link
  28052. -->
  28053. (I3 ^see 1 +)
  28054. Firing elaborate*reward*based*on*reward
  28055. -->
  28056. (R1092 ^value 1 +)
  28057. (R1 ^reward R1092 +)
  28058. Firing propose*predict-yes
  28059. -->
  28060. (O2177 ^name predict-yes +)
  28061. (S1 ^operator O2177 +)
  28062. Firing propose*predict-no
  28063. -->
  28064. (O2178 ^name predict-no +)
  28065. (S1 ^operator O2178 +)
  28066. Firing rl*prefer*rvt*predict-no*H0*6
  28067. -->
  28068. (S1 ^operator O2176 = 0.9999999999999999)
  28069. Firing rl*prefer*rvt*predict-yes*H0*5
  28070. -->
  28071. (S1 ^operator O2175 = 0.)
  28072. Firing prefer*rvt*predict-yes*H0
  28073. -->
  28074. Firing prefer*rvt*predict-no*H0
  28075. -->
  28076. Firing elaborate*copy-dir-to-output-link
  28077. -->
  28078. (I3 ^dir U +)
  28079. inner elaboration loop at bottom goal.
  28080. Retracting elaborate*copy-see-to-output-link
  28081. -->
  28082. (I3 ^see 0 +)
  28083. Retracting propose*predict-no
  28084. -->
  28085. (O2176 ^name predict-no +)
  28086. (S1 ^operator O2176 +)
  28087. Retracting propose*predict-yes
  28088. -->
  28089. (O2175 ^name predict-yes +)
  28090. (S1 ^operator O2175 +)
  28091. Retracting elaborate*reward*based*on*reward
  28092. -->
  28093. (R1091 ^value 1 +)
  28094. (R1 ^reward R1091 +)
  28095. Retracting elaborate*copy-dir-to-output-link
  28096. -->
  28097. (I3 ^dir L +)
  28098. Retracting rl*prefer*rvt*predict-no*H0*2
  28099. -->
  28100. (S1 ^operator O2176 = 0.2550133863879636)
  28101. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  28102. -->
  28103. (S1 ^operator O2176 = 0.1700769046561409)
  28104. Retracting rl*prefer*rvt*predict-yes*H0*1
  28105. -->
  28106. (S1 ^operator O2175 = 0.5231192467369606)
  28107. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  28108. -->
  28109. (S1 ^operator O2175 = 0.4768809749851805)
  28110. =>WM: (15300: S1 ^operator O2178 +)
  28111. =>WM: (15299: S1 ^operator O2177 +)
  28112. =>WM: (15298: I3 ^dir U)
  28113. =>WM: (15297: O2178 ^name predict-no)
  28114. =>WM: (15296: O2177 ^name predict-yes)
  28115. =>WM: (15295: R1092 ^value 1)
  28116. =>WM: (15294: R1 ^reward R1092)
  28117. =>WM: (15293: I3 ^see 1)
  28118. <=WM: (15284: S1 ^operator O2175 +)
  28119. <=WM: (15286: S1 ^operator O2175)
  28120. <=WM: (15285: S1 ^operator O2176 +)
  28121. <=WM: (15283: I3 ^dir L)
  28122. <=WM: (15279: R1 ^reward R1091)
  28123. <=WM: (15183: I3 ^see 0)
  28124. <=WM: (15282: O2176 ^name predict-no)
  28125. <=WM: (15281: O2175 ^name predict-yes)
  28126. <=WM: (15280: R1091 ^value 1)
  28127. --- Inner Elaboration Phase, active level 1 (S1) ---
  28128. Firing prefer*rvt*predict-yes*H0
  28129. -->
  28130. Firing rl*prefer*rvt*predict-yes*H0*5
  28131. -->
  28132. (S1 ^operator O2177 = 0.)
  28133. Firing prefer*rvt*predict-no*H0
  28134. -->
  28135. Firing rl*prefer*rvt*predict-no*H0*6
  28136. -->
  28137. (S1 ^operator O2178 = 0.9999999999999999)
  28138. inner elaboration loop at bottom goal.
  28139. Retracting rl*prefer*rvt*predict-no*H0*6
  28140. -->
  28141. (S1 ^operator O2176 = 0.9999999999999999)
  28142. Retracting rl*prefer*rvt*predict-yes*H0*5
  28143. -->
  28144. (S1 ^operator O2175 = 0.)
  28145. --- END Proposal Phase ---
  28146. --- Decision Phase ---
  28147. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.980769,0.0189826)
  28148. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272041 0.20484 0.476881 -> 0.272041 0.20484 0.476881(R,m,v=1,1,0)
  28149. =>WM: (15301: S1 ^operator O2178)
  28150. 1089: O: O2178 (predict-no)
  28151. --- END Decision Phase ---
  28152. --- Application Phase ---
  28153. --- Firing Productions (PE) For State At Depth 1 ---
  28154. --- Inner Elaboration Phase, active level 1 (S1) ---
  28155. Firing apply*operator
  28156. -->
  28157. (I3 ^predict-no N1089 + :O )
  28158. Firing apply*operator*complete
  28159. -->
  28160. (I3 ^predict-yes N1088 - :O )
  28161. inner elaboration loop at bottom goal.
  28162. --- Change Working Memory (PE) ---
  28163. =>WM: (15302: I3 ^predict-no N1089)
  28164. <=WM: (15288: N1088 ^status complete)
  28165. <=WM: (15287: I3 ^predict-yes N1088)
  28166. --- Firing Productions (IE) For State At Depth 1 ---
  28167. --- Inner Elaboration Phase, active level 1 (S1) ---
  28168. Firing monitor*world
  28169. -->
  28170. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28171. --- Change Working Memory (IE) ---
  28172. --- END Application Phase ---
  28173. --- Output Phase ---
  28174. ENV: Agent did: predict-no for direction U in state State-A
  28175. In State-A moving U
  28176. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28177. predict error 0
  28178. dir: dir isR
  28179. --- END Output Phase ---
  28180. \-/--- Input Phase ---
  28181. =>WM: (15306: I2 ^dir R)
  28182. =>WM: (15305: I2 ^reward 1)
  28183. =>WM: (15304: I2 ^see 0)
  28184. =>WM: (15303: N1089 ^status complete)
  28185. <=WM: (15291: I2 ^dir U)
  28186. <=WM: (15290: I2 ^reward 1)
  28187. <=WM: (15289: I2 ^see 1)
  28188. =>WM: (15307: I2 ^level-1 L1-root)
  28189. <=WM: (15292: I2 ^level-1 L1-root)
  28190. --- END Input Phase ---
  28191. --- Proposal Phase ---
  28192. --- Inner Elaboration Phase, active level 1 (S1) ---
  28193. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  28194. -->
  28195. (S1 ^operator O2177 = 0.6170510733049686)
  28196. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  28197. -->
  28198. (S1 ^operator O2178 = 0.4901349546100854)
  28199. Firing prefer*rvt*predict-no*H0*4*H1
  28200. -->
  28201. Firing prefer*rvt*predict-yes*H0*3*H1
  28202. -->
  28203. Firing elaborate*copy-see-to-output-link
  28204. -->
  28205. (I3 ^see 0 +)
  28206. Firing elaborate*reward*based*on*reward
  28207. -->
  28208. (R1093 ^value 1 +)
  28209. (R1 ^reward R1093 +)
  28210. Firing propose*predict-yes
  28211. -->
  28212. (O2179 ^name predict-yes +)
  28213. (S1 ^operator O2179 +)
  28214. Firing propose*predict-no
  28215. -->
  28216. (O2180 ^name predict-no +)
  28217. (S1 ^operator O2180 +)
  28218. Firing rl*prefer*rvt*predict-no*H0*4
  28219. -->
  28220. (S1 ^operator O2178 = 0.1269768275936353)
  28221. Firing rl*prefer*rvt*predict-yes*H0*3
  28222. -->
  28223. (S1 ^operator O2177 = 0.382942629133266)
  28224. Firing prefer*rvt*predict-yes*H0
  28225. -->
  28226. Firing prefer*rvt*predict-no*H0
  28227. -->
  28228. Firing elaborate*copy-dir-to-output-link
  28229. -->
  28230. (I3 ^dir R +)
  28231. inner elaboration loop at bottom goal.
  28232. Retracting elaborate*copy-see-to-output-link
  28233. -->
  28234. (I3 ^see 1 +)
  28235. Retracting propose*predict-no
  28236. -->
  28237. (O2178 ^name predict-no +)
  28238. (S1 ^operator O2178 +)
  28239. Retracting propose*predict-yes
  28240. -->
  28241. (O2177 ^name predict-yes +)
  28242. (S1 ^operator O2177 +)
  28243. Retracting elaborate*reward*based*on*reward
  28244. -->
  28245. (R1092 ^value 1 +)
  28246. (R1 ^reward R1092 +)
  28247. Retracting elaborate*copy-dir-to-output-link
  28248. -->
  28249. (I3 ^dir U +)
  28250. Retracting rl*prefer*rvt*predict-no*H0*6
  28251. -->
  28252. (S1 ^operator O2178 = 0.9999999999999999)
  28253. Retracting rl*prefer*rvt*predict-yes*H0*5
  28254. -->
  28255. (S1 ^operator O2177 = 0.)
  28256. =>WM: (15315: S1 ^operator O2180 +)
  28257. =>WM: (15314: S1 ^operator O2179 +)
  28258. =>WM: (15313: I3 ^dir R)
  28259. =>WM: (15312: O2180 ^name predict-no)
  28260. =>WM: (15311: O2179 ^name predict-yes)
  28261. =>WM: (15310: R1093 ^value 1)
  28262. =>WM: (15309: R1 ^reward R1093)
  28263. =>WM: (15308: I3 ^see 0)
  28264. <=WM: (15299: S1 ^operator O2177 +)
  28265. <=WM: (15300: S1 ^operator O2178 +)
  28266. <=WM: (15301: S1 ^operator O2178)
  28267. <=WM: (15298: I3 ^dir U)
  28268. <=WM: (15294: R1 ^reward R1092)
  28269. <=WM: (15293: I3 ^see 1)
  28270. <=WM: (15297: O2178 ^name predict-no)
  28271. <=WM: (15296: O2177 ^name predict-yes)
  28272. <=WM: (15295: R1092 ^value 1)
  28273. --- Inner Elaboration Phase, active level 1 (S1) ---
  28274. Firing prefer*rvt*predict-yes*H0
  28275. -->
  28276. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  28277. -->
  28278. (S1 ^operator O2179 = 0.6170510733049686)
  28279. Firing rl*prefer*rvt*predict-yes*H0*3
  28280. -->
  28281. (S1 ^operator O2179 = 0.382942629133266)
  28282. Firing prefer*rvt*predict-yes*H0*3*H1
  28283. -->
  28284. Firing prefer*rvt*predict-no*H0
  28285. -->
  28286. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  28287. -->
  28288. (S1 ^operator O2180 = 0.4901349546100854)
  28289. Firing rl*prefer*rvt*predict-no*H0*4
  28290. -->
  28291. (S1 ^operator O2180 = 0.1269768275936353)
  28292. Firing prefer*rvt*predict-no*H0*4*H1
  28293. -->
  28294. inner elaboration loop at bottom goal.
  28295. Retracting rl*prefer*rvt*predict-no*H0*4
  28296. -->
  28297. (S1 ^operator O2178 = 0.1269768275936353)
  28298. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  28299. -->
  28300. (S1 ^operator O2178 = 0.4901349546100854)
  28301. Retracting rl*prefer*rvt*predict-yes*H0*3
  28302. -->
  28303. (S1 ^operator O2177 = 0.382942629133266)
  28304. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  28305. -->
  28306. (S1 ^operator O2177 = 0.6170510733049686)
  28307. --- END Proposal Phase ---
  28308. --- Decision Phase ---
  28309. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28310. =>WM: (15316: S1 ^operator O2179)
  28311. 1090: O: O2179 (predict-yes)
  28312. --- END Decision Phase ---
  28313. --- Application Phase ---
  28314. --- Firing Productions (PE) For State At Depth 1 ---
  28315. --- Inner Elaboration Phase, active level 1 (S1) ---
  28316. Firing apply*operator
  28317. -->
  28318. (I3 ^predict-yes N1090 + :O )
  28319. Firing apply*operator*complete
  28320. -->
  28321. (I3 ^predict-no N1089 - :O )
  28322. inner elaboration loop at bottom goal.
  28323. --- Change Working Memory (PE) ---
  28324. =>WM: (15317: I3 ^predict-yes N1090)
  28325. <=WM: (15303: N1089 ^status complete)
  28326. <=WM: (15302: I3 ^predict-no N1089)
  28327. --- Firing Productions (IE) For State At Depth 1 ---
  28328. --- Inner Elaboration Phase, active level 1 (S1) ---
  28329. Firing monitor*world
  28330. -->
  28331. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28332. --- Change Working Memory (IE) ---
  28333. --- END Application Phase ---
  28334. --- Output Phase ---
  28335. ENV: Agent did: predict-yes for direction R in state State-A
  28336. In State-A moving R
  28337. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  28338. predict error 0
  28339. dir: dir isR
  28340. --- END Output Phase ---
  28341. |\--- Input Phase ---
  28342. =>WM: (15321: I2 ^dir R)
  28343. =>WM: (15320: I2 ^reward 1)
  28344. =>WM: (15319: I2 ^see 1)
  28345. =>WM: (15318: N1090 ^status complete)
  28346. <=WM: (15306: I2 ^dir R)
  28347. <=WM: (15305: I2 ^reward 1)
  28348. <=WM: (15304: I2 ^see 0)
  28349. =>WM: (15322: I2 ^level-1 R1-root)
  28350. <=WM: (15307: I2 ^level-1 L1-root)
  28351. --- END Input Phase ---
  28352. --- Proposal Phase ---
  28353. --- Inner Elaboration Phase, active level 1 (S1) ---
  28354. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  28355. -->
  28356. (S1 ^operator O2179 = 0.08783148430849691)
  28357. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  28358. -->
  28359. (S1 ^operator O2180 = 0.8730232122106774)
  28360. Firing prefer*rvt*predict-no*H0*4*H1
  28361. -->
  28362. Firing prefer*rvt*predict-yes*H0*3*H1
  28363. -->
  28364. Firing elaborate*copy-see-to-output-link
  28365. -->
  28366. (I3 ^see 1 +)
  28367. Firing elaborate*reward*based*on*reward
  28368. -->
  28369. (R1094 ^value 1 +)
  28370. (R1 ^reward R1094 +)
  28371. Firing propose*predict-yes
  28372. -->
  28373. (O2181 ^name predict-yes +)
  28374. (S1 ^operator O2181 +)
  28375. Firing propose*predict-no
  28376. -->
  28377. (O2182 ^name predict-no +)
  28378. (S1 ^operator O2182 +)
  28379. Firing rl*prefer*rvt*predict-no*H0*4
  28380. -->
  28381. (S1 ^operator O2180 = 0.1269768275936353)
  28382. Firing rl*prefer*rvt*predict-yes*H0*3
  28383. -->
  28384. (S1 ^operator O2179 = 0.382942629133266)
  28385. Firing prefer*rvt*predict-yes*H0
  28386. -->
  28387. Firing prefer*rvt*predict-no*H0
  28388. -->
  28389. Firing elaborate*copy-dir-to-output-link
  28390. -->
  28391. (I3 ^dir R +)
  28392. inner elaboration loop at bottom goal.
  28393. Retracting elaborate*copy-see-to-output-link
  28394. -->
  28395. (I3 ^see 0 +)
  28396. Retracting propose*predict-no
  28397. -->
  28398. (O2180 ^name predict-no +)
  28399. (S1 ^operator O2180 +)
  28400. Retracting propose*predict-yes
  28401. -->
  28402. (O2179 ^name predict-yes +)
  28403. (S1 ^operator O2179 +)
  28404. Retracting elaborate*reward*based*on*reward
  28405. -->
  28406. (R1093 ^value 1 +)
  28407. (R1 ^reward R1093 +)
  28408. Retracting elaborate*copy-dir-to-output-link
  28409. -->
  28410. (I3 ^dir R +)
  28411. Retracting rl*prefer*rvt*predict-no*H0*4
  28412. -->
  28413. (S1 ^operator O2180 = 0.1269768275936353)
  28414. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  28415. -->
  28416. (S1 ^operator O2180 = 0.4901349546100854)
  28417. Retracting rl*prefer*rvt*predict-yes*H0*3
  28418. -->
  28419. (S1 ^operator O2179 = 0.382942629133266)
  28420. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  28421. -->
  28422. (S1 ^operator O2179 = 0.6170510733049686)
  28423. =>WM: (15329: S1 ^operator O2182 +)
  28424. =>WM: (15328: S1 ^operator O2181 +)
  28425. =>WM: (15327: O2182 ^name predict-no)
  28426. =>WM: (15326: O2181 ^name predict-yes)
  28427. =>WM: (15325: R1094 ^value 1)
  28428. =>WM: (15324: R1 ^reward R1094)
  28429. =>WM: (15323: I3 ^see 1)
  28430. <=WM: (15314: S1 ^operator O2179 +)
  28431. <=WM: (15316: S1 ^operator O2179)
  28432. <=WM: (15315: S1 ^operator O2180 +)
  28433. <=WM: (15309: R1 ^reward R1093)
  28434. <=WM: (15308: I3 ^see 0)
  28435. <=WM: (15312: O2180 ^name predict-no)
  28436. <=WM: (15311: O2179 ^name predict-yes)
  28437. <=WM: (15310: R1093 ^value 1)
  28438. --- Inner Elaboration Phase, active level 1 (S1) ---
  28439. Firing prefer*rvt*predict-yes*H0
  28440. -->
  28441. Firing rl*prefer*rvt*predict-yes*H0*3
  28442. -->
  28443. (S1 ^operator O2181 = 0.382942629133266)
  28444. Firing prefer*rvt*predict-yes*H0*3*H1
  28445. -->
  28446. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  28447. -->
  28448. (S1 ^operator O2181 = 0.08783148430849691)
  28449. Firing prefer*rvt*predict-no*H0
  28450. -->
  28451. Firing rl*prefer*rvt*predict-no*H0*4
  28452. -->
  28453. (S1 ^operator O2182 = 0.1269768275936353)
  28454. Firing prefer*rvt*predict-no*H0*4*H1
  28455. -->
  28456. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  28457. -->
  28458. (S1 ^operator O2182 = 0.8730232122106774)
  28459. inner elaboration loop at bottom goal.
  28460. Retracting rl*prefer*rvt*predict-no*H0*4
  28461. -->
  28462. (S1 ^operator O2180 = 0.1269768275936353)
  28463. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  28464. -->
  28465. (S1 ^operator O2180 = 0.8730232122106774)
  28466. Retracting rl*prefer*rvt*predict-yes*H0*3
  28467. -->
  28468. (S1 ^operator O2179 = 0.382942629133266)
  28469. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  28470. -->
  28471. (S1 ^operator O2179 = 0.08783148430849691)
  28472. --- END Proposal Phase ---
  28473. --- Decision Phase ---
  28474. RL update rl*prefer*rvt*predict-yes*H0*3 0.673136 -0.290193 0.382943 -> 0.673136 -0.290193 0.382944(R,m,v=1,0.964072,0.034846)
  28475. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.326859 0.290192 0.617051 -> 0.32686 0.290192 0.617052(R,m,v=1,1,0)
  28476. =>WM: (15330: S1 ^operator O2182)
  28477. 1091: O: O2182 (predict-no)
  28478. --- END Decision Phase ---
  28479. --- Application Phase ---
  28480. --- Firing Productions (PE) For State At Depth 1 ---
  28481. --- Inner Elaboration Phase, active level 1 (S1) ---
  28482. Firing apply*operator
  28483. -->
  28484. (I3 ^predict-no N1091 + :O )
  28485. Firing apply*operator*complete
  28486. -->
  28487. (I3 ^predict-yes N1090 - :O )
  28488. inner elaboration loop at bottom goal.
  28489. --- Change Working Memory (PE) ---
  28490. =>WM: (15331: I3 ^predict-no N1091)
  28491. <=WM: (15318: N1090 ^status complete)
  28492. <=WM: (15317: I3 ^predict-yes N1090)
  28493. --- Firing Productions (IE) For State At Depth 1 ---
  28494. --- Inner Elaboration Phase, active level 1 (S1) ---
  28495. Firing monitor*world
  28496. -->
  28497. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28498. --- Change Working Memory (IE) ---
  28499. --- END Application Phase ---
  28500. --- Output Phase ---
  28501. ENV: Agent did: predict-no for direction R in state State-B
  28502. In State-B moving R
  28503. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28504. predict error 0
  28505. dir: dir isU
  28506. --- END Output Phase ---
  28507. ---- Input Phase ---
  28508. =>WM: (15335: I2 ^dir U)
  28509. =>WM: (15334: I2 ^reward 1)
  28510. =>WM: (15333: I2 ^see 0)
  28511. =>WM: (15332: N1091 ^status complete)
  28512. <=WM: (15321: I2 ^dir R)
  28513. <=WM: (15320: I2 ^reward 1)
  28514. <=WM: (15319: I2 ^see 1)
  28515. =>WM: (15336: I2 ^level-1 R0-root)
  28516. <=WM: (15322: I2 ^level-1 R1-root)
  28517. --- END Input Phase ---
  28518. --- Proposal Phase ---
  28519. --- Inner Elaboration Phase, active level 1 (S1) ---
  28520. Firing elaborate*copy-see-to-output-link
  28521. -->
  28522. (I3 ^see 0 +)
  28523. Firing elaborate*reward*based*on*reward
  28524. -->
  28525. (R1095 ^value 1 +)
  28526. (R1 ^reward R1095 +)
  28527. Firing propose*predict-yes
  28528. -->
  28529. (O2183 ^name predict-yes +)
  28530. (S1 ^operator O2183 +)
  28531. Firing propose*predict-no
  28532. -->
  28533. (O2184 ^name predict-no +)
  28534. (S1 ^operator O2184 +)
  28535. Firing rl*prefer*rvt*predict-no*H0*6
  28536. -->
  28537. (S1 ^operator O2182 = 0.9999999999999999)
  28538. Firing rl*prefer*rvt*predict-yes*H0*5
  28539. -->
  28540. (S1 ^operator O2181 = 0.)
  28541. Firing prefer*rvt*predict-yes*H0
  28542. -->
  28543. Firing prefer*rvt*predict-no*H0
  28544. -->
  28545. Firing elaborate*copy-dir-to-output-link
  28546. -->
  28547. (I3 ^dir U +)
  28548. inner elaboration loop at bottom goal.
  28549. Retracting elaborate*copy-see-to-output-link
  28550. -->
  28551. (I3 ^see 1 +)
  28552. Retracting propose*predict-no
  28553. -->
  28554. (O2182 ^name predict-no +)
  28555. (S1 ^operator O2182 +)
  28556. Retracting propose*predict-yes
  28557. -->
  28558. (O2181 ^name predict-yes +)
  28559. (S1 ^operator O2181 +)
  28560. Retracting elaborate*reward*based*on*reward
  28561. -->
  28562. (R1094 ^value 1 +)
  28563. (R1 ^reward R1094 +)
  28564. Retracting elaborate*copy-dir-to-output-link
  28565. -->
  28566. (I3 ^dir R +)
  28567. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  28568. -->
  28569. (S1 ^operator O2182 = 0.8730232122106774)
  28570. Retracting rl*prefer*rvt*predict-no*H0*4
  28571. -->
  28572. (S1 ^operator O2182 = 0.1269768275936353)
  28573. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  28574. -->
  28575. (S1 ^operator O2181 = 0.08783148430849691)
  28576. Retracting rl*prefer*rvt*predict-yes*H0*3
  28577. -->
  28578. (S1 ^operator O2181 = 0.3829435737675308)
  28579. =>WM: (15344: S1 ^operator O2184 +)
  28580. =>WM: (15343: S1 ^operator O2183 +)
  28581. =>WM: (15342: I3 ^dir U)
  28582. =>WM: (15341: O2184 ^name predict-no)
  28583. =>WM: (15340: O2183 ^name predict-yes)
  28584. =>WM: (15339: R1095 ^value 1)
  28585. =>WM: (15338: R1 ^reward R1095)
  28586. =>WM: (15337: I3 ^see 0)
  28587. <=WM: (15328: S1 ^operator O2181 +)
  28588. <=WM: (15329: S1 ^operator O2182 +)
  28589. <=WM: (15330: S1 ^operator O2182)
  28590. <=WM: (15313: I3 ^dir R)
  28591. <=WM: (15324: R1 ^reward R1094)
  28592. <=WM: (15323: I3 ^see 1)
  28593. <=WM: (15327: O2182 ^name predict-no)
  28594. <=WM: (15326: O2181 ^name predict-yes)
  28595. <=WM: (15325: R1094 ^value 1)
  28596. --- Inner Elaboration Phase, active level 1 (S1) ---
  28597. Firing prefer*rvt*predict-yes*H0
  28598. -->
  28599. Firing rl*prefer*rvt*predict-yes*H0*5
  28600. -->
  28601. (S1 ^operator O2183 = 0.)
  28602. Firing prefer*rvt*predict-no*H0
  28603. -->
  28604. Firing rl*prefer*rvt*predict-no*H0*6
  28605. -->
  28606. (S1 ^operator O2184 = 0.9999999999999999)
  28607. inner elaboration loop at bottom goal.
  28608. Retracting rl*prefer*rvt*predict-no*H0*6
  28609. -->
  28610. (S1 ^operator O2182 = 0.9999999999999999)
  28611. Retracting rl*prefer*rvt*predict-yes*H0*5
  28612. -->
  28613. (S1 ^operator O2181 = 0.)
  28614. --- END Proposal Phase ---
  28615. --- Decision Phase ---
  28616. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.954774,0.0433988)
  28617. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  28618. =>WM: (15345: S1 ^operator O2184)
  28619. 1092: O: O2184 (predict-no)
  28620. --- END Decision Phase ---
  28621. --- Application Phase ---
  28622. --- Firing Productions (PE) For State At Depth 1 ---
  28623. --- Inner Elaboration Phase, active level 1 (S1) ---
  28624. Firing apply*operator
  28625. -->
  28626. (I3 ^predict-no N1092 + :O )
  28627. Firing apply*operator*complete
  28628. -->
  28629. (I3 ^predict-no N1091 - :O )
  28630. inner elaboration loop at bottom goal.
  28631. --- Change Working Memory (PE) ---
  28632. =>WM: (15346: I3 ^predict-no N1092)
  28633. <=WM: (15332: N1091 ^status complete)
  28634. <=WM: (15331: I3 ^predict-no N1091)
  28635. --- Firing Productions (IE) For State At Depth 1 ---
  28636. --- Inner Elaboration Phase, active level 1 (S1) ---
  28637. Firing monitor*world
  28638. -->
  28639. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28640. --- Change Working Memory (IE) ---
  28641. --- END Application Phase ---
  28642. --- Output Phase ---
  28643. ENV: Agent did: predict-no for direction U in state State-B
  28644. In State-B moving U
  28645. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28646. predict error 0
  28647. dir: dir isL
  28648. --- END Output Phase ---
  28649. /|\--- Input Phase ---
  28650. =>WM: (15350: I2 ^dir L)
  28651. =>WM: (15349: I2 ^reward 1)
  28652. =>WM: (15348: I2 ^see 0)
  28653. =>WM: (15347: N1092 ^status complete)
  28654. <=WM: (15335: I2 ^dir U)
  28655. <=WM: (15334: I2 ^reward 1)
  28656. <=WM: (15333: I2 ^see 0)
  28657. =>WM: (15351: I2 ^level-1 R0-root)
  28658. <=WM: (15336: I2 ^level-1 R0-root)
  28659. --- END Input Phase ---
  28660. --- Proposal Phase ---
  28661. --- Inner Elaboration Phase, active level 1 (S1) ---
  28662. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  28663. -->
  28664. (S1 ^operator O2183 = 0.4768809417268593)
  28665. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  28666. -->
  28667. (S1 ^operator O2184 = 0.1700769046561409)
  28668. Firing prefer*rvt*predict-no*H0*2*H1
  28669. -->
  28670. Firing prefer*rvt*predict-yes*H0*1*H1
  28671. -->
  28672. Firing elaborate*copy-see-to-output-link
  28673. -->
  28674. (I3 ^see 0 +)
  28675. Firing elaborate*reward*based*on*reward
  28676. -->
  28677. (R1096 ^value 1 +)
  28678. (R1 ^reward R1096 +)
  28679. Firing propose*predict-yes
  28680. -->
  28681. (O2185 ^name predict-yes +)
  28682. (S1 ^operator O2185 +)
  28683. Firing propose*predict-no
  28684. -->
  28685. (O2186 ^name predict-no +)
  28686. (S1 ^operator O2186 +)
  28687. Firing rl*prefer*rvt*predict-no*H0*2
  28688. -->
  28689. (S1 ^operator O2184 = 0.2550133863879636)
  28690. Firing rl*prefer*rvt*predict-yes*H0*1
  28691. -->
  28692. (S1 ^operator O2183 = 0.5231192134786394)
  28693. Firing prefer*rvt*predict-yes*H0
  28694. -->
  28695. Firing prefer*rvt*predict-no*H0
  28696. -->
  28697. Firing elaborate*copy-dir-to-output-link
  28698. -->
  28699. (I3 ^dir L +)
  28700. inner elaboration loop at bottom goal.
  28701. Retracting elaborate*copy-see-to-output-link
  28702. -->
  28703. (I3 ^see 0 +)
  28704. Retracting propose*predict-no
  28705. -->
  28706. (O2184 ^name predict-no +)
  28707. (S1 ^operator O2184 +)
  28708. Retracting propose*predict-yes
  28709. -->
  28710. (O2183 ^name predict-yes +)
  28711. (S1 ^operator O2183 +)
  28712. Retracting elaborate*reward*based*on*reward
  28713. -->
  28714. (R1095 ^value 1 +)
  28715. (R1 ^reward R1095 +)
  28716. Retracting elaborate*copy-dir-to-output-link
  28717. -->
  28718. (I3 ^dir U +)
  28719. Retracting rl*prefer*rvt*predict-no*H0*6
  28720. -->
  28721. (S1 ^operator O2184 = 0.9999999999999999)
  28722. Retracting rl*prefer*rvt*predict-yes*H0*5
  28723. -->
  28724. (S1 ^operator O2183 = 0.)
  28725. =>WM: (15358: S1 ^operator O2186 +)
  28726. =>WM: (15357: S1 ^operator O2185 +)
  28727. =>WM: (15356: I3 ^dir L)
  28728. =>WM: (15355: O2186 ^name predict-no)
  28729. =>WM: (15354: O2185 ^name predict-yes)
  28730. =>WM: (15353: R1096 ^value 1)
  28731. =>WM: (15352: R1 ^reward R1096)
  28732. <=WM: (15343: S1 ^operator O2183 +)
  28733. <=WM: (15344: S1 ^operator O2184 +)
  28734. <=WM: (15345: S1 ^operator O2184)
  28735. <=WM: (15342: I3 ^dir U)
  28736. <=WM: (15338: R1 ^reward R1095)
  28737. <=WM: (15341: O2184 ^name predict-no)
  28738. <=WM: (15340: O2183 ^name predict-yes)
  28739. <=WM: (15339: R1095 ^value 1)
  28740. --- Inner Elaboration Phase, active level 1 (S1) ---
  28741. Firing prefer*rvt*predict-yes*H0
  28742. -->
  28743. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  28744. -->
  28745. (S1 ^operator O2185 = 0.4768809417268593)
  28746. Firing rl*prefer*rvt*predict-yes*H0*1
  28747. -->
  28748. (S1 ^operator O2185 = 0.5231192134786394)
  28749. Firing prefer*rvt*predict-yes*H0*1*H1
  28750. -->
  28751. Firing prefer*rvt*predict-no*H0
  28752. -->
  28753. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  28754. -->
  28755. (S1 ^operator O2186 = 0.1700769046561409)
  28756. Firing rl*prefer*rvt*predict-no*H0*2
  28757. -->
  28758. (S1 ^operator O2186 = 0.2550133863879636)
  28759. Firing prefer*rvt*predict-no*H0*2*H1
  28760. -->
  28761. inner elaboration loop at bottom goal.
  28762. Retracting rl*prefer*rvt*predict-no*H0*2
  28763. -->
  28764. (S1 ^operator O2184 = 0.2550133863879636)
  28765. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  28766. -->
  28767. (S1 ^operator O2184 = 0.1700769046561409)
  28768. Retracting rl*prefer*rvt*predict-yes*H0*1
  28769. -->
  28770. (S1 ^operator O2183 = 0.5231192134786394)
  28771. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  28772. -->
  28773. (S1 ^operator O2183 = 0.4768809417268593)
  28774. --- END Proposal Phase ---
  28775. --- Decision Phase ---
  28776. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28777. =>WM: (15359: S1 ^operator O2185)
  28778. 1093: O: O2185 (predict-yes)
  28779. --- END Decision Phase ---
  28780. --- Application Phase ---
  28781. --- Firing Productions (PE) For State At Depth 1 ---
  28782. --- Inner Elaboration Phase, active level 1 (S1) ---
  28783. Firing apply*operator
  28784. -->
  28785. (I3 ^predict-yes N1093 + :O )
  28786. Firing apply*operator*complete
  28787. -->
  28788. (I3 ^predict-no N1092 - :O )
  28789. inner elaboration loop at bottom goal.
  28790. --- Change Working Memory (PE) ---
  28791. =>WM: (15360: I3 ^predict-yes N1093)
  28792. <=WM: (15347: N1092 ^status complete)
  28793. <=WM: (15346: I3 ^predict-no N1092)
  28794. --- Firing Productions (IE) For State At Depth 1 ---
  28795. --- Inner Elaboration Phase, active level 1 (S1) ---
  28796. Firing monitor*world
  28797. -->
  28798. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28799. --- Change Working Memory (IE) ---
  28800. --- END Application Phase ---
  28801. --- Output Phase ---
  28802. ENV: Agent did: predict-yes for direction L in state State-B
  28803. In State-B moving L
  28804. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28805. predict error 0
  28806. dir: dir isL
  28807. --- END Output Phase ---
  28808. -/|--- Input Phase ---
  28809. =>WM: (15364: I2 ^dir L)
  28810. =>WM: (15363: I2 ^reward 1)
  28811. =>WM: (15362: I2 ^see 1)
  28812. =>WM: (15361: N1093 ^status complete)
  28813. <=WM: (15350: I2 ^dir L)
  28814. <=WM: (15349: I2 ^reward 1)
  28815. <=WM: (15348: I2 ^see 0)
  28816. =>WM: (15365: I2 ^level-1 L1-root)
  28817. <=WM: (15351: I2 ^level-1 R0-root)
  28818. --- END Input Phase ---
  28819. --- Proposal Phase ---
  28820. --- Inner Elaboration Phase, active level 1 (S1) ---
  28821. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  28822. -->
  28823. (S1 ^operator O2185 = 0.1693592933936033)
  28824. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  28825. -->
  28826. (S1 ^operator O2186 = 0.7449865344888057)
  28827. Firing prefer*rvt*predict-no*H0*2*H1
  28828. -->
  28829. Firing prefer*rvt*predict-yes*H0*1*H1
  28830. -->
  28831. Firing elaborate*copy-see-to-output-link
  28832. -->
  28833. (I3 ^see 1 +)
  28834. Firing elaborate*reward*based*on*reward
  28835. -->
  28836. (R1097 ^value 1 +)
  28837. (R1 ^reward R1097 +)
  28838. Firing propose*predict-yes
  28839. -->
  28840. (O2187 ^name predict-yes +)
  28841. (S1 ^operator O2187 +)
  28842. Firing propose*predict-no
  28843. -->
  28844. (O2188 ^name predict-no +)
  28845. (S1 ^operator O2188 +)
  28846. Firing rl*prefer*rvt*predict-no*H0*2
  28847. -->
  28848. (S1 ^operator O2186 = 0.2550133863879636)
  28849. Firing rl*prefer*rvt*predict-yes*H0*1
  28850. -->
  28851. (S1 ^operator O2185 = 0.5231192134786394)
  28852. Firing prefer*rvt*predict-yes*H0
  28853. -->
  28854. Firing prefer*rvt*predict-no*H0
  28855. -->
  28856. Firing elaborate*copy-dir-to-output-link
  28857. -->
  28858. (I3 ^dir L +)
  28859. inner elaboration loop at bottom goal.
  28860. Retracting elaborate*copy-see-to-output-link
  28861. -->
  28862. (I3 ^see 0 +)
  28863. Retracting propose*predict-no
  28864. -->
  28865. (O2186 ^name predict-no +)
  28866. (S1 ^operator O2186 +)
  28867. Retracting propose*predict-yes
  28868. -->
  28869. (O2185 ^name predict-yes +)
  28870. (S1 ^operator O2185 +)
  28871. Retracting elaborate*reward*based*on*reward
  28872. -->
  28873. (R1096 ^value 1 +)
  28874. (R1 ^reward R1096 +)
  28875. Retracting elaborate*copy-dir-to-output-link
  28876. -->
  28877. (I3 ^dir L +)
  28878. Retracting rl*prefer*rvt*predict-no*H0*2
  28879. -->
  28880. (S1 ^operator O2186 = 0.2550133863879636)
  28881. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  28882. -->
  28883. (S1 ^operator O2186 = 0.1700769046561409)
  28884. Retracting rl*prefer*rvt*predict-yes*H0*1
  28885. -->
  28886. (S1 ^operator O2185 = 0.5231192134786394)
  28887. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  28888. -->
  28889. (S1 ^operator O2185 = 0.4768809417268593)
  28890. =>WM: (15372: S1 ^operator O2188 +)
  28891. =>WM: (15371: S1 ^operator O2187 +)
  28892. =>WM: (15370: O2188 ^name predict-no)
  28893. =>WM: (15369: O2187 ^name predict-yes)
  28894. =>WM: (15368: R1097 ^value 1)
  28895. =>WM: (15367: R1 ^reward R1097)
  28896. =>WM: (15366: I3 ^see 1)
  28897. <=WM: (15357: S1 ^operator O2185 +)
  28898. <=WM: (15359: S1 ^operator O2185)
  28899. <=WM: (15358: S1 ^operator O2186 +)
  28900. <=WM: (15352: R1 ^reward R1096)
  28901. <=WM: (15337: I3 ^see 0)
  28902. <=WM: (15355: O2186 ^name predict-no)
  28903. <=WM: (15354: O2185 ^name predict-yes)
  28904. <=WM: (15353: R1096 ^value 1)
  28905. --- Inner Elaboration Phase, active level 1 (S1) ---
  28906. Firing prefer*rvt*predict-yes*H0
  28907. -->
  28908. Firing rl*prefer*rvt*predict-yes*H0*1
  28909. -->
  28910. (S1 ^operator O2187 = 0.5231192134786394)
  28911. Firing prefer*rvt*predict-yes*H0*1*H1
  28912. -->
  28913. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  28914. -->
  28915. (S1 ^operator O2187 = 0.1693592933936033)
  28916. Firing prefer*rvt*predict-no*H0
  28917. -->
  28918. Firing rl*prefer*rvt*predict-no*H0*2
  28919. -->
  28920. (S1 ^operator O2188 = 0.2550133863879636)
  28921. Firing prefer*rvt*predict-no*H0*2*H1
  28922. -->
  28923. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  28924. -->
  28925. (S1 ^operator O2188 = 0.7449865344888057)
  28926. inner elaboration loop at bottom goal.
  28927. Retracting rl*prefer*rvt*predict-no*H0*2
  28928. -->
  28929. (S1 ^operator O2186 = 0.2550133863879636)
  28930. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  28931. -->
  28932. (S1 ^operator O2186 = 0.7449865344888057)
  28933. Retracting rl*prefer*rvt*predict-yes*H0*1
  28934. -->
  28935. (S1 ^operator O2185 = 0.5231192134786394)
  28936. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  28937. -->
  28938. (S1 ^operator O2185 = 0.1693592933936033)
  28939. --- END Proposal Phase ---
  28940. --- Decision Phase ---
  28941. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.980892,0.0188633)
  28942. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272041 0.20484 0.476881 -> 0.272041 0.20484 0.476881(R,m,v=1,1,0)
  28943. =>WM: (15373: S1 ^operator O2188)
  28944. 1094: O: O2188 (predict-no)
  28945. --- END Decision Phase ---
  28946. --- Application Phase ---
  28947. --- Firing Productions (PE) For State At Depth 1 ---
  28948. --- Inner Elaboration Phase, active level 1 (S1) ---
  28949. Firing apply*operator
  28950. -->
  28951. (I3 ^predict-no N1094 + :O )
  28952. Firing apply*operator*complete
  28953. -->
  28954. (I3 ^predict-yes N1093 - :O )
  28955. inner elaboration loop at bottom goal.
  28956. --- Change Working Memory (PE) ---
  28957. =>WM: (15374: I3 ^predict-no N1094)
  28958. <=WM: (15361: N1093 ^status complete)
  28959. <=WM: (15360: I3 ^predict-yes N1093)
  28960. --- Firing Productions (IE) For State At Depth 1 ---
  28961. --- Inner Elaboration Phase, active level 1 (S1) ---
  28962. Firing monitor*world
  28963. -->
  28964. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28965. --- Change Working Memory (IE) ---
  28966. --- END Application Phase ---
  28967. --- Output Phase ---
  28968. ENV: Agent did: predict-no for direction L in state State-A
  28969. In State-A moving L
  28970. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28971. predict error 0
  28972. dir: dir isU
  28973. --- END Output Phase ---
  28974. \-/--- Input Phase ---
  28975. =>WM: (15378: I2 ^dir U)
  28976. =>WM: (15377: I2 ^reward 1)
  28977. =>WM: (15376: I2 ^see 0)
  28978. =>WM: (15375: N1094 ^status complete)
  28979. <=WM: (15364: I2 ^dir L)
  28980. <=WM: (15363: I2 ^reward 1)
  28981. <=WM: (15362: I2 ^see 1)
  28982. =>WM: (15379: I2 ^level-1 L0-root)
  28983. <=WM: (15365: I2 ^level-1 L1-root)
  28984. --- END Input Phase ---
  28985. --- Proposal Phase ---
  28986. --- Inner Elaboration Phase, active level 1 (S1) ---
  28987. Firing elaborate*copy-see-to-output-link
  28988. -->
  28989. (I3 ^see 0 +)
  28990. Firing elaborate*reward*based*on*reward
  28991. -->
  28992. (R1098 ^value 1 +)
  28993. (R1 ^reward R1098 +)
  28994. Firing propose*predict-yes
  28995. -->
  28996. (O2189 ^name predict-yes +)
  28997. (S1 ^operator O2189 +)
  28998. Firing propose*predict-no
  28999. -->
  29000. (O2190 ^name predict-no +)
  29001. (S1 ^operator O2190 +)
  29002. Firing rl*prefer*rvt*predict-no*H0*6
  29003. -->
  29004. (S1 ^operator O2188 = 0.9999999999999999)
  29005. Firing rl*prefer*rvt*predict-yes*H0*5
  29006. -->
  29007. (S1 ^operator O2187 = 0.)
  29008. Firing prefer*rvt*predict-yes*H0
  29009. -->
  29010. Firing prefer*rvt*predict-no*H0
  29011. -->
  29012. Firing elaborate*copy-dir-to-output-link
  29013. -->
  29014. (I3 ^dir U +)
  29015. inner elaboration loop at bottom goal.
  29016. Retracting elaborate*copy-see-to-output-link
  29017. -->
  29018. (I3 ^see 1 +)
  29019. Retracting propose*predict-no
  29020. -->
  29021. (O2188 ^name predict-no +)
  29022. (S1 ^operator O2188 +)
  29023. Retracting propose*predict-yes
  29024. -->
  29025. (O2187 ^name predict-yes +)
  29026. (S1 ^operator O2187 +)
  29027. Retracting elaborate*reward*based*on*reward
  29028. -->
  29029. (R1097 ^value 1 +)
  29030. (R1 ^reward R1097 +)
  29031. Retracting elaborate*copy-dir-to-output-link
  29032. -->
  29033. (I3 ^dir L +)
  29034. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  29035. -->
  29036. (S1 ^operator O2188 = 0.7449865344888057)
  29037. Retracting rl*prefer*rvt*predict-no*H0*2
  29038. -->
  29039. (S1 ^operator O2188 = 0.2550133863879636)
  29040. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  29041. -->
  29042. (S1 ^operator O2187 = 0.1693592933936033)
  29043. Retracting rl*prefer*rvt*predict-yes*H0*1
  29044. -->
  29045. (S1 ^operator O2187 = 0.5231191901978147)
  29046. =>WM: (15387: S1 ^operator O2190 +)
  29047. =>WM: (15386: S1 ^operator O2189 +)
  29048. =>WM: (15385: I3 ^dir U)
  29049. =>WM: (15384: O2190 ^name predict-no)
  29050. =>WM: (15383: O2189 ^name predict-yes)
  29051. =>WM: (15382: R1098 ^value 1)
  29052. =>WM: (15381: R1 ^reward R1098)
  29053. =>WM: (15380: I3 ^see 0)
  29054. <=WM: (15371: S1 ^operator O2187 +)
  29055. <=WM: (15372: S1 ^operator O2188 +)
  29056. <=WM: (15373: S1 ^operator O2188)
  29057. <=WM: (15356: I3 ^dir L)
  29058. <=WM: (15367: R1 ^reward R1097)
  29059. <=WM: (15366: I3 ^see 1)
  29060. <=WM: (15370: O2188 ^name predict-no)
  29061. <=WM: (15369: O2187 ^name predict-yes)
  29062. <=WM: (15368: R1097 ^value 1)
  29063. --- Inner Elaboration Phase, active level 1 (S1) ---
  29064. Firing prefer*rvt*predict-yes*H0
  29065. -->
  29066. Firing rl*prefer*rvt*predict-yes*H0*5
  29067. -->
  29068. (S1 ^operator O2189 = 0.)
  29069. Firing prefer*rvt*predict-no*H0
  29070. -->
  29071. Firing rl*prefer*rvt*predict-no*H0*6
  29072. -->
  29073. (S1 ^operator O2190 = 0.9999999999999999)
  29074. inner elaboration loop at bottom goal.
  29075. Retracting rl*prefer*rvt*predict-no*H0*6
  29076. -->
  29077. (S1 ^operator O2188 = 0.9999999999999999)
  29078. Retracting rl*prefer*rvt*predict-yes*H0*5
  29079. -->
  29080. (S1 ^operator O2187 = 0.)
  29081. --- END Proposal Phase ---
  29082. --- Decision Phase ---
  29083. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.92233,0.0719867)
  29084. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  29085. =>WM: (15388: S1 ^operator O2190)
  29086. 1095: O: O2190 (predict-no)
  29087. --- END Decision Phase ---
  29088. --- Application Phase ---
  29089. --- Firing Productions (PE) For State At Depth 1 ---
  29090. --- Inner Elaboration Phase, active level 1 (S1) ---
  29091. Firing apply*operator
  29092. -->
  29093. (I3 ^predict-no N1095 + :O )
  29094. Firing apply*operator*complete
  29095. -->
  29096. (I3 ^predict-no N1094 - :O )
  29097. inner elaboration loop at bottom goal.
  29098. --- Change Working Memory (PE) ---
  29099. =>WM: (15389: I3 ^predict-no N1095)
  29100. <=WM: (15375: N1094 ^status complete)
  29101. <=WM: (15374: I3 ^predict-no N1094)
  29102. --- Firing Productions (IE) For State At Depth 1 ---
  29103. --- Inner Elaboration Phase, active level 1 (S1) ---
  29104. Firing monitor*world
  29105. -->
  29106. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29107. --- Change Working Memory (IE) ---
  29108. --- END Application Phase ---
  29109. --- Output Phase ---
  29110. ENV: Agent did: predict-no for direction U in state State-A
  29111. In State-A moving U
  29112. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29113. predict error 0
  29114. dir: dir isR
  29115. --- END Output Phase ---
  29116. |\---- Input Phase ---
  29117. =>WM: (15393: I2 ^dir R)
  29118. =>WM: (15392: I2 ^reward 1)
  29119. =>WM: (15391: I2 ^see 0)
  29120. =>WM: (15390: N1095 ^status complete)
  29121. <=WM: (15378: I2 ^dir U)
  29122. <=WM: (15377: I2 ^reward 1)
  29123. <=WM: (15376: I2 ^see 0)
  29124. =>WM: (15394: I2 ^level-1 L0-root)
  29125. <=WM: (15379: I2 ^level-1 L0-root)
  29126. --- END Input Phase ---
  29127. --- Proposal Phase ---
  29128. --- Inner Elaboration Phase, active level 1 (S1) ---
  29129. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  29130. -->
  29131. (S1 ^operator O2189 = 0.617065960338947)
  29132. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  29133. -->
  29134. (S1 ^operator O2190 = 0.4910065094545203)
  29135. Firing prefer*rvt*predict-no*H0*4*H1
  29136. -->
  29137. Firing prefer*rvt*predict-yes*H0*3*H1
  29138. -->
  29139. Firing elaborate*copy-see-to-output-link
  29140. -->
  29141. (I3 ^see 0 +)
  29142. Firing elaborate*reward*based*on*reward
  29143. -->
  29144. (R1099 ^value 1 +)
  29145. (R1 ^reward R1099 +)
  29146. Firing propose*predict-yes
  29147. -->
  29148. (O2191 ^name predict-yes +)
  29149. (S1 ^operator O2191 +)
  29150. Firing propose*predict-no
  29151. -->
  29152. (O2192 ^name predict-no +)
  29153. (S1 ^operator O2192 +)
  29154. Firing rl*prefer*rvt*predict-no*H0*4
  29155. -->
  29156. (S1 ^operator O2190 = 0.1269768216229884)
  29157. Firing rl*prefer*rvt*predict-yes*H0*3
  29158. -->
  29159. (S1 ^operator O2189 = 0.3829435737675308)
  29160. Firing prefer*rvt*predict-yes*H0
  29161. -->
  29162. Firing prefer*rvt*predict-no*H0
  29163. -->
  29164. Firing elaborate*copy-dir-to-output-link
  29165. -->
  29166. (I3 ^dir R +)
  29167. inner elaboration loop at bottom goal.
  29168. Retracting elaborate*copy-see-to-output-link
  29169. -->
  29170. (I3 ^see 0 +)
  29171. Retracting propose*predict-no
  29172. -->
  29173. (O2190 ^name predict-no +)
  29174. (S1 ^operator O2190 +)
  29175. Retracting propose*predict-yes
  29176. -->
  29177. (O2189 ^name predict-yes +)
  29178. (S1 ^operator O2189 +)
  29179. Retracting elaborate*reward*based*on*reward
  29180. -->
  29181. (R1098 ^value 1 +)
  29182. (R1 ^reward R1098 +)
  29183. Retracting elaborate*copy-dir-to-output-link
  29184. -->
  29185. (I3 ^dir U +)
  29186. Retracting rl*prefer*rvt*predict-no*H0*6
  29187. -->
  29188. (S1 ^operator O2190 = 0.9999999999999999)
  29189. Retracting rl*prefer*rvt*predict-yes*H0*5
  29190. -->
  29191. (S1 ^operator O2189 = 0.)
  29192. =>WM: (15401: S1 ^operator O2192 +)
  29193. =>WM: (15400: S1 ^operator O2191 +)
  29194. =>WM: (15399: I3 ^dir R)
  29195. =>WM: (15398: O2192 ^name predict-no)
  29196. =>WM: (15397: O2191 ^name predict-yes)
  29197. =>WM: (15396: R1099 ^value 1)
  29198. =>WM: (15395: R1 ^reward R1099)
  29199. <=WM: (15386: S1 ^operator O2189 +)
  29200. <=WM: (15387: S1 ^operator O2190 +)
  29201. <=WM: (15388: S1 ^operator O2190)
  29202. <=WM: (15385: I3 ^dir U)
  29203. <=WM: (15381: R1 ^reward R1098)
  29204. <=WM: (15384: O2190 ^name predict-no)
  29205. <=WM: (15383: O2189 ^name predict-yes)
  29206. <=WM: (15382: R1098 ^value 1)
  29207. --- Inner Elaboration Phase, active level 1 (S1) ---
  29208. Firing prefer*rvt*predict-yes*H0
  29209. -->
  29210. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  29211. -->
  29212. (S1 ^operator O2191 = 0.617065960338947)
  29213. Firing rl*prefer*rvt*predict-yes*H0*3
  29214. -->
  29215. (S1 ^operator O2191 = 0.3829435737675308)
  29216. Firing prefer*rvt*predict-yes*H0*3*H1
  29217. -->
  29218. Firing prefer*rvt*predict-no*H0
  29219. -->
  29220. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  29221. -->
  29222. (S1 ^operator O2192 = 0.4910065094545203)
  29223. Firing rl*prefer*rvt*predict-no*H0*4
  29224. -->
  29225. (S1 ^operator O2192 = 0.1269768216229884)
  29226. Firing prefer*rvt*predict-no*H0*4*H1
  29227. -->
  29228. inner elaboration loop at bottom goal.
  29229. Retracting rl*prefer*rvt*predict-no*H0*4
  29230. -->
  29231. (S1 ^operator O2190 = 0.1269768216229884)
  29232. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  29233. -->
  29234. (S1 ^operator O2190 = 0.4910065094545203)
  29235. Retracting rl*prefer*rvt*predict-yes*H0*3
  29236. -->
  29237. (S1 ^operator O2189 = 0.3829435737675308)
  29238. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  29239. -->
  29240. (S1 ^operator O2189 = 0.617065960338947)
  29241. --- END Proposal Phase ---
  29242. --- Decision Phase ---
  29243. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29244. =>WM: (15402: S1 ^operator O2191)
  29245. 1096: O: O2191 (predict-yes)
  29246. --- END Decision Phase ---
  29247. --- Application Phase ---
  29248. --- Firing Productions (PE) For State At Depth 1 ---
  29249. --- Inner Elaboration Phase, active level 1 (S1) ---
  29250. Firing apply*operator
  29251. -->
  29252. (I3 ^predict-yes N1096 + :O )
  29253. Firing apply*operator*complete
  29254. -->
  29255. (I3 ^predict-no N1095 - :O )
  29256. inner elaboration loop at bottom goal.
  29257. --- Change Working Memory (PE) ---
  29258. =>WM: (15403: I3 ^predict-yes N1096)
  29259. <=WM: (15390: N1095 ^status complete)
  29260. <=WM: (15389: I3 ^predict-no N1095)
  29261. --- Firing Productions (IE) For State At Depth 1 ---
  29262. --- Inner Elaboration Phase, active level 1 (S1) ---
  29263. Firing monitor*world
  29264. -->
  29265. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29266. --- Change Working Memory (IE) ---
  29267. --- END Application Phase ---
  29268. --- Output Phase ---
  29269. ENV: Agent did: predict-yes for direction R in state State-A
  29270. In State-A moving R
  29271. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  29272. predict error 0
  29273. dir: dir isL
  29274. --- END Output Phase ---
  29275. /|--- Input Phase ---
  29276. =>WM: (15407: I2 ^dir L)
  29277. =>WM: (15406: I2 ^reward 1)
  29278. =>WM: (15405: I2 ^see 1)
  29279. =>WM: (15404: N1096 ^status complete)
  29280. <=WM: (15393: I2 ^dir R)
  29281. <=WM: (15392: I2 ^reward 1)
  29282. <=WM: (15391: I2 ^see 0)
  29283. =>WM: (15408: I2 ^level-1 R1-root)
  29284. <=WM: (15394: I2 ^level-1 L0-root)
  29285. --- END Input Phase ---
  29286. --- Proposal Phase ---
  29287. --- Inner Elaboration Phase, active level 1 (S1) ---
  29288. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  29289. -->
  29290. (S1 ^operator O2191 = 0.4768789783581771)
  29291. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  29292. -->
  29293. (S1 ^operator O2192 = -0.01194930198035649)
  29294. Firing prefer*rvt*predict-no*H0*2*H1
  29295. -->
  29296. Firing prefer*rvt*predict-yes*H0*1*H1
  29297. -->
  29298. Firing elaborate*copy-see-to-output-link
  29299. -->
  29300. (I3 ^see 1 +)
  29301. Firing elaborate*reward*based*on*reward
  29302. -->
  29303. (R1100 ^value 1 +)
  29304. (R1 ^reward R1100 +)
  29305. Firing propose*predict-yes
  29306. -->
  29307. (O2193 ^name predict-yes +)
  29308. (S1 ^operator O2193 +)
  29309. Firing propose*predict-no
  29310. -->
  29311. (O2194 ^name predict-no +)
  29312. (S1 ^operator O2194 +)
  29313. Firing rl*prefer*rvt*predict-no*H0*2
  29314. -->
  29315. (S1 ^operator O2192 = 0.2550133982564481)
  29316. Firing rl*prefer*rvt*predict-yes*H0*1
  29317. -->
  29318. (S1 ^operator O2191 = 0.5231191901978147)
  29319. Firing prefer*rvt*predict-yes*H0
  29320. -->
  29321. Firing prefer*rvt*predict-no*H0
  29322. -->
  29323. Firing elaborate*copy-dir-to-output-link
  29324. -->
  29325. (I3 ^dir L +)
  29326. inner elaboration loop at bottom goal.
  29327. Retracting elaborate*copy-see-to-output-link
  29328. -->
  29329. (I3 ^see 0 +)
  29330. Retracting propose*predict-no
  29331. -->
  29332. (O2192 ^name predict-no +)
  29333. (S1 ^operator O2192 +)
  29334. Retracting propose*predict-yes
  29335. -->
  29336. (O2191 ^name predict-yes +)
  29337. (S1 ^operator O2191 +)
  29338. Retracting elaborate*reward*based*on*reward
  29339. -->
  29340. (R1099 ^value 1 +)
  29341. (R1 ^reward R1099 +)
  29342. Retracting elaborate*copy-dir-to-output-link
  29343. -->
  29344. (I3 ^dir R +)
  29345. Retracting rl*prefer*rvt*predict-no*H0*4
  29346. -->
  29347. (S1 ^operator O2192 = 0.1269768216229884)
  29348. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  29349. -->
  29350. (S1 ^operator O2192 = 0.4910065094545203)
  29351. Retracting rl*prefer*rvt*predict-yes*H0*3
  29352. -->
  29353. (S1 ^operator O2191 = 0.3829435737675308)
  29354. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  29355. -->
  29356. (S1 ^operator O2191 = 0.617065960338947)
  29357. =>WM: (15416: S1 ^operator O2194 +)
  29358. =>WM: (15415: S1 ^operator O2193 +)
  29359. =>WM: (15414: I3 ^dir L)
  29360. =>WM: (15413: O2194 ^name predict-no)
  29361. =>WM: (15412: O2193 ^name predict-yes)
  29362. =>WM: (15411: R1100 ^value 1)
  29363. =>WM: (15410: R1 ^reward R1100)
  29364. =>WM: (15409: I3 ^see 1)
  29365. <=WM: (15400: S1 ^operator O2191 +)
  29366. <=WM: (15402: S1 ^operator O2191)
  29367. <=WM: (15401: S1 ^operator O2192 +)
  29368. <=WM: (15399: I3 ^dir R)
  29369. <=WM: (15395: R1 ^reward R1099)
  29370. <=WM: (15380: I3 ^see 0)
  29371. <=WM: (15398: O2192 ^name predict-no)
  29372. <=WM: (15397: O2191 ^name predict-yes)
  29373. <=WM: (15396: R1099 ^value 1)
  29374. --- Inner Elaboration Phase, active level 1 (S1) ---
  29375. Firing prefer*rvt*predict-yes*H0
  29376. -->
  29377. Firing rl*prefer*rvt*predict-yes*H0*1
  29378. -->
  29379. (S1 ^operator O2193 = 0.5231191901978147)
  29380. Firing prefer*rvt*predict-yes*H0*1*H1
  29381. -->
  29382. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  29383. -->
  29384. (S1 ^operator O2193 = 0.4768789783581771)
  29385. Firing prefer*rvt*predict-no*H0
  29386. -->
  29387. Firing rl*prefer*rvt*predict-no*H0*2
  29388. -->
  29389. (S1 ^operator O2194 = 0.2550133982564481)
  29390. Firing prefer*rvt*predict-no*H0*2*H1
  29391. -->
  29392. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  29393. -->
  29394. (S1 ^operator O2194 = -0.01194930198035649)
  29395. inner elaboration loop at bottom goal.
  29396. Retracting rl*prefer*rvt*predict-no*H0*2
  29397. -->
  29398. (S1 ^operator O2192 = 0.2550133982564481)
  29399. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  29400. -->
  29401. (S1 ^operator O2192 = -0.01194930198035649)
  29402. Retracting rl*prefer*rvt*predict-yes*H0*1
  29403. -->
  29404. (S1 ^operator O2191 = 0.5231191901978147)
  29405. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  29406. -->
  29407. (S1 ^operator O2191 = 0.4768789783581771)
  29408. --- END Proposal Phase ---
  29409. --- Decision Phase ---
  29410. RL update rl*prefer*rvt*predict-yes*H0*3 0.673136 -0.290193 0.382944 -> 0.673135 -0.290193 0.382942(R,m,v=1,0.964286,0.034645)
  29411. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326872 0.290194 0.617066 -> 0.326871 0.290194 0.617065(R,m,v=1,1,0)
  29412. =>WM: (15417: S1 ^operator O2193)
  29413. 1097: O: O2193 (predict-yes)
  29414. --- END Decision Phase ---
  29415. --- Application Phase ---
  29416. --- Firing Productions (PE) For State At Depth 1 ---
  29417. --- Inner Elaboration Phase, active level 1 (S1) ---
  29418. Firing apply*operator
  29419. -->
  29420. (I3 ^predict-yes N1097 + :O )
  29421. Firing apply*operator*complete
  29422. -->
  29423. (I3 ^predict-yes N1096 - :O )
  29424. inner elaboration loop at bottom goal.
  29425. --- Change Working Memory (PE) ---
  29426. =>WM: (15418: I3 ^predict-yes N1097)
  29427. <=WM: (15404: N1096 ^status complete)
  29428. <=WM: (15403: I3 ^predict-yes N1096)
  29429. --- Firing Productions (IE) For State At Depth 1 ---
  29430. --- Inner Elaboration Phase, active level 1 (S1) ---
  29431. Firing monitor*world
  29432. -->
  29433. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29434. --- Change Working Memory (IE) ---
  29435. --- END Application Phase ---
  29436. --- Output Phase ---
  29437. ENV: Agent did: predict-yes for direction L in state State-B
  29438. In State-B moving L
  29439. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  29440. predict error 0
  29441. dir: dir isL
  29442. --- END Output Phase ---
  29443. \-/--- Input Phase ---
  29444. =>WM: (15422: I2 ^dir L)
  29445. =>WM: (15421: I2 ^reward 1)
  29446. =>WM: (15420: I2 ^see 1)
  29447. =>WM: (15419: N1097 ^status complete)
  29448. <=WM: (15407: I2 ^dir L)
  29449. <=WM: (15406: I2 ^reward 1)
  29450. <=WM: (15405: I2 ^see 1)
  29451. =>WM: (15423: I2 ^level-1 L1-root)
  29452. <=WM: (15408: I2 ^level-1 R1-root)
  29453. --- END Input Phase ---
  29454. --- Proposal Phase ---
  29455. --- Inner Elaboration Phase, active level 1 (S1) ---
  29456. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  29457. -->
  29458. (S1 ^operator O2193 = 0.1693592933936033)
  29459. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  29460. -->
  29461. (S1 ^operator O2194 = 0.7449865463572902)
  29462. Firing prefer*rvt*predict-no*H0*2*H1
  29463. -->
  29464. Firing prefer*rvt*predict-yes*H0*1*H1
  29465. -->
  29466. Firing elaborate*copy-see-to-output-link
  29467. -->
  29468. (I3 ^see 1 +)
  29469. Firing elaborate*reward*based*on*reward
  29470. -->
  29471. (R1101 ^value 1 +)
  29472. (R1 ^reward R1101 +)
  29473. Firing propose*predict-yes
  29474. -->
  29475. (O2195 ^name predict-yes +)
  29476. (S1 ^operator O2195 +)
  29477. Firing propose*predict-no
  29478. -->
  29479. (O2196 ^name predict-no +)
  29480. (S1 ^operator O2196 +)
  29481. Firing rl*prefer*rvt*predict-no*H0*2
  29482. -->
  29483. (S1 ^operator O2194 = 0.2550133982564481)
  29484. Firing rl*prefer*rvt*predict-yes*H0*1
  29485. -->
  29486. (S1 ^operator O2193 = 0.5231191901978147)
  29487. Firing prefer*rvt*predict-yes*H0
  29488. -->
  29489. Firing prefer*rvt*predict-no*H0
  29490. -->
  29491. Firing elaborate*copy-dir-to-output-link
  29492. -->
  29493. (I3 ^dir L +)
  29494. inner elaboration loop at bottom goal.
  29495. Retracting elaborate*copy-see-to-output-link
  29496. -->
  29497. (I3 ^see 1 +)
  29498. Retracting propose*predict-no
  29499. -->
  29500. (O2194 ^name predict-no +)
  29501. (S1 ^operator O2194 +)
  29502. Retracting propose*predict-yes
  29503. -->
  29504. (O2193 ^name predict-yes +)
  29505. (S1 ^operator O2193 +)
  29506. Retracting elaborate*reward*based*on*reward
  29507. -->
  29508. (R1100 ^value 1 +)
  29509. (R1 ^reward R1100 +)
  29510. Retracting elaborate*copy-dir-to-output-link
  29511. -->
  29512. (I3 ^dir L +)
  29513. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  29514. -->
  29515. (S1 ^operator O2194 = -0.01194930198035649)
  29516. Retracting rl*prefer*rvt*predict-no*H0*2
  29517. -->
  29518. (S1 ^operator O2194 = 0.2550133982564481)
  29519. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  29520. -->
  29521. (S1 ^operator O2193 = 0.4768789783581771)
  29522. Retracting rl*prefer*rvt*predict-yes*H0*1
  29523. -->
  29524. (S1 ^operator O2193 = 0.5231191901978147)
  29525. =>WM: (15429: S1 ^operator O2196 +)
  29526. =>WM: (15428: S1 ^operator O2195 +)
  29527. =>WM: (15427: O2196 ^name predict-no)
  29528. =>WM: (15426: O2195 ^name predict-yes)
  29529. =>WM: (15425: R1101 ^value 1)
  29530. =>WM: (15424: R1 ^reward R1101)
  29531. <=WM: (15415: S1 ^operator O2193 +)
  29532. <=WM: (15417: S1 ^operator O2193)
  29533. <=WM: (15416: S1 ^operator O2194 +)
  29534. <=WM: (15410: R1 ^reward R1100)
  29535. <=WM: (15413: O2194 ^name predict-no)
  29536. <=WM: (15412: O2193 ^name predict-yes)
  29537. <=WM: (15411: R1100 ^value 1)
  29538. --- Inner Elaboration Phase, active level 1 (S1) ---
  29539. Firing prefer*rvt*predict-yes*H0
  29540. -->
  29541. Firing rl*prefer*rvt*predict-yes*H0*1
  29542. -->
  29543. (S1 ^operator O2195 = 0.5231191901978147)
  29544. Firing prefer*rvt*predict-yes*H0*1*H1
  29545. -->
  29546. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  29547. -->
  29548. (S1 ^operator O2195 = 0.1693592933936033)
  29549. Firing prefer*rvt*predict-no*H0
  29550. -->
  29551. Firing rl*prefer*rvt*predict-no*H0*2
  29552. -->
  29553. (S1 ^operator O2196 = 0.2550133982564481)
  29554. Firing prefer*rvt*predict-no*H0*2*H1
  29555. -->
  29556. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  29557. -->
  29558. (S1 ^operator O2196 = 0.7449865463572902)
  29559. inner elaboration loop at bottom goal.
  29560. Retracting rl*prefer*rvt*predict-no*H0*2
  29561. -->
  29562. (S1 ^operator O2194 = 0.2550133982564481)
  29563. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  29564. -->
  29565. (S1 ^operator O2194 = 0.7449865463572902)
  29566. Retracting rl*prefer*rvt*predict-yes*H0*1
  29567. -->
  29568. (S1 ^operator O2193 = 0.5231191901978147)
  29569. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  29570. -->
  29571. (S1 ^operator O2193 = 0.1693592933936033)
  29572. --- END Proposal Phase ---
  29573. --- Decision Phase ---
  29574. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.523119(R,m,v=1,0.981013,0.0187455)
  29575. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272039 0.20484 0.476879 -> 0.272039 0.20484 0.476879(R,m,v=1,1,0)
  29576. =>WM: (15430: S1 ^operator O2196)
  29577. 1098: O: O2196 (predict-no)
  29578. --- END Decision Phase ---
  29579. --- Application Phase ---
  29580. --- Firing Productions (PE) For State At Depth 1 ---
  29581. --- Inner Elaboration Phase, active level 1 (S1) ---
  29582. Firing apply*operator
  29583. -->
  29584. (I3 ^predict-no N1098 + :O )
  29585. Firing apply*operator*complete
  29586. -->
  29587. (I3 ^predict-yes N1097 - :O )
  29588. inner elaboration loop at bottom goal.
  29589. --- Change Working Memory (PE) ---
  29590. =>WM: (15431: I3 ^predict-no N1098)
  29591. <=WM: (15419: N1097 ^status complete)
  29592. <=WM: (15418: I3 ^predict-yes N1097)
  29593. --- Firing Productions (IE) For State At Depth 1 ---
  29594. --- Inner Elaboration Phase, active level 1 (S1) ---
  29595. Firing monitor*world
  29596. -->
  29597. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29598. --- Change Working Memory (IE) ---
  29599. --- END Application Phase ---
  29600. --- Output Phase ---
  29601. ENV: Agent did: predict-no for direction L in state State-A
  29602. In State-A moving L
  29603. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29604. predict error 0
  29605. dir: dir isR
  29606. --- END Output Phase ---
  29607. |\---- Input Phase ---
  29608. =>WM: (15435: I2 ^dir R)
  29609. =>WM: (15434: I2 ^reward 1)
  29610. =>WM: (15433: I2 ^see 0)
  29611. =>WM: (15432: N1098 ^status complete)
  29612. <=WM: (15422: I2 ^dir L)
  29613. <=WM: (15421: I2 ^reward 1)
  29614. <=WM: (15420: I2 ^see 1)
  29615. =>WM: (15436: I2 ^level-1 L0-root)
  29616. <=WM: (15423: I2 ^level-1 L1-root)
  29617. --- END Input Phase ---
  29618. --- Proposal Phase ---
  29619. --- Inner Elaboration Phase, active level 1 (S1) ---
  29620. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  29621. -->
  29622. (S1 ^operator O2195 = 0.6170645302229754)
  29623. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  29624. -->
  29625. (S1 ^operator O2196 = 0.4910065094545203)
  29626. Firing prefer*rvt*predict-no*H0*4*H1
  29627. -->
  29628. Firing prefer*rvt*predict-yes*H0*3*H1
  29629. -->
  29630. Firing elaborate*copy-see-to-output-link
  29631. -->
  29632. (I3 ^see 0 +)
  29633. Firing elaborate*reward*based*on*reward
  29634. -->
  29635. (R1102 ^value 1 +)
  29636. (R1 ^reward R1102 +)
  29637. Firing propose*predict-yes
  29638. -->
  29639. (O2197 ^name predict-yes +)
  29640. (S1 ^operator O2197 +)
  29641. Firing propose*predict-no
  29642. -->
  29643. (O2198 ^name predict-no +)
  29644. (S1 ^operator O2198 +)
  29645. Firing rl*prefer*rvt*predict-no*H0*4
  29646. -->
  29647. (S1 ^operator O2196 = 0.1269768216229884)
  29648. Firing rl*prefer*rvt*predict-yes*H0*3
  29649. -->
  29650. (S1 ^operator O2195 = 0.3829421436515592)
  29651. Firing prefer*rvt*predict-yes*H0
  29652. -->
  29653. Firing prefer*rvt*predict-no*H0
  29654. -->
  29655. Firing elaborate*copy-dir-to-output-link
  29656. -->
  29657. (I3 ^dir R +)
  29658. inner elaboration loop at bottom goal.
  29659. Retracting elaborate*copy-see-to-output-link
  29660. -->
  29661. (I3 ^see 1 +)
  29662. Retracting propose*predict-no
  29663. -->
  29664. (O2196 ^name predict-no +)
  29665. (S1 ^operator O2196 +)
  29666. Retracting propose*predict-yes
  29667. -->
  29668. (O2195 ^name predict-yes +)
  29669. (S1 ^operator O2195 +)
  29670. Retracting elaborate*reward*based*on*reward
  29671. -->
  29672. (R1101 ^value 1 +)
  29673. (R1 ^reward R1101 +)
  29674. Retracting elaborate*copy-dir-to-output-link
  29675. -->
  29676. (I3 ^dir L +)
  29677. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  29678. -->
  29679. (S1 ^operator O2196 = 0.7449865463572902)
  29680. Retracting rl*prefer*rvt*predict-no*H0*2
  29681. -->
  29682. (S1 ^operator O2196 = 0.2550133982564481)
  29683. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  29684. -->
  29685. (S1 ^operator O2195 = 0.1693592933936033)
  29686. Retracting rl*prefer*rvt*predict-yes*H0*1
  29687. -->
  29688. (S1 ^operator O2195 = 0.5231194649144159)
  29689. =>WM: (15444: S1 ^operator O2198 +)
  29690. =>WM: (15443: S1 ^operator O2197 +)
  29691. =>WM: (15442: I3 ^dir R)
  29692. =>WM: (15441: O2198 ^name predict-no)
  29693. =>WM: (15440: O2197 ^name predict-yes)
  29694. =>WM: (15439: R1102 ^value 1)
  29695. =>WM: (15438: R1 ^reward R1102)
  29696. =>WM: (15437: I3 ^see 0)
  29697. <=WM: (15428: S1 ^operator O2195 +)
  29698. <=WM: (15429: S1 ^operator O2196 +)
  29699. <=WM: (15430: S1 ^operator O2196)
  29700. <=WM: (15414: I3 ^dir L)
  29701. <=WM: (15424: R1 ^reward R1101)
  29702. <=WM: (15409: I3 ^see 1)
  29703. <=WM: (15427: O2196 ^name predict-no)
  29704. <=WM: (15426: O2195 ^name predict-yes)
  29705. <=WM: (15425: R1101 ^value 1)
  29706. --- Inner Elaboration Phase, active level 1 (S1) ---
  29707. Firing prefer*rvt*predict-yes*H0
  29708. -->
  29709. Firing rl*prefer*rvt*predict-yes*H0*3
  29710. -->
  29711. (S1 ^operator O2197 = 0.3829421436515592)
  29712. Firing prefer*rvt*predict-yes*H0*3*H1
  29713. -->
  29714. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  29715. -->
  29716. (S1 ^operator O2197 = 0.6170645302229754)
  29717. Firing prefer*rvt*predict-no*H0
  29718. -->
  29719. Firing rl*prefer*rvt*predict-no*H0*4
  29720. -->
  29721. (S1 ^operator O2198 = 0.1269768216229884)
  29722. Firing prefer*rvt*predict-no*H0*4*H1
  29723. -->
  29724. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  29725. -->
  29726. (S1 ^operator O2198 = 0.4910065094545203)
  29727. inner elaboration loop at bottom goal.
  29728. Retracting rl*prefer*rvt*predict-no*H0*4
  29729. -->
  29730. (S1 ^operator O2196 = 0.1269768216229884)
  29731. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  29732. -->
  29733. (S1 ^operator O2196 = 0.4910065094545203)
  29734. Retracting rl*prefer*rvt*predict-yes*H0*3
  29735. -->
  29736. (S1 ^operator O2195 = 0.3829421436515592)
  29737. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  29738. -->
  29739. (S1 ^operator O2195 = 0.6170645302229754)
  29740. --- END Proposal Phase ---
  29741. --- Decision Phase ---
  29742. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.922705,0.0716664)
  29743. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  29744. =>WM: (15445: S1 ^operator O2197)
  29745. 1099: O: O2197 (predict-yes)
  29746. --- END Decision Phase ---
  29747. --- Application Phase ---
  29748. --- Firing Productions (PE) For State At Depth 1 ---
  29749. --- Inner Elaboration Phase, active level 1 (S1) ---
  29750. Firing apply*operator
  29751. -->
  29752. (I3 ^predict-yes N1099 + :O )
  29753. Firing apply*operator*complete
  29754. -->
  29755. (I3 ^predict-no N1098 - :O )
  29756. inner elaboration loop at bottom goal.
  29757. --- Change Working Memory (PE) ---
  29758. =>WM: (15446: I3 ^predict-yes N1099)
  29759. <=WM: (15432: N1098 ^status complete)
  29760. <=WM: (15431: I3 ^predict-no N1098)
  29761. --- Firing Productions (IE) For State At Depth 1 ---
  29762. --- Inner Elaboration Phase, active level 1 (S1) ---
  29763. Firing monitor*world
  29764. -->
  29765. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29766. --- Change Working Memory (IE) ---
  29767. --- END Application Phase ---
  29768. --- Output Phase ---
  29769. ENV: Agent did: predict-yes for direction R in state State-A
  29770. In State-A moving R
  29771. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  29772. predict error 0
  29773. dir: dir isL
  29774. --- END Output Phase ---
  29775. /|\---- Input Phase ---
  29776. =>WM: (15450: I2 ^dir L)
  29777. =>WM: (15449: I2 ^reward 1)
  29778. =>WM: (15448: I2 ^see 1)
  29779. =>WM: (15447: N1099 ^status complete)
  29780. <=WM: (15435: I2 ^dir R)
  29781. <=WM: (15434: I2 ^reward 1)
  29782. <=WM: (15433: I2 ^see 0)
  29783. =>WM: (15451: I2 ^level-1 R1-root)
  29784. <=WM: (15436: I2 ^level-1 L0-root)
  29785. --- END Input Phase ---
  29786. --- Proposal Phase ---
  29787. --- Inner Elaboration Phase, active level 1 (S1) ---
  29788. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  29789. -->
  29790. (S1 ^operator O2197 = 0.4768792530747783)
  29791. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  29792. -->
  29793. (S1 ^operator O2198 = -0.01194930198035649)
  29794. Firing prefer*rvt*predict-no*H0*2*H1
  29795. -->
  29796. Firing prefer*rvt*predict-yes*H0*1*H1
  29797. -->
  29798. Firing elaborate*copy-see-to-output-link
  29799. -->
  29800. (I3 ^see 1 +)
  29801. Firing elaborate*reward*based*on*reward
  29802. -->
  29803. (R1103 ^value 1 +)
  29804. (R1 ^reward R1103 +)
  29805. Firing propose*predict-yes
  29806. -->
  29807. (O2199 ^name predict-yes +)
  29808. (S1 ^operator O2199 +)
  29809. Firing propose*predict-no
  29810. -->
  29811. (O2200 ^name predict-no +)
  29812. (S1 ^operator O2200 +)
  29813. Firing rl*prefer*rvt*predict-no*H0*2
  29814. -->
  29815. (S1 ^operator O2198 = 0.2550134065643873)
  29816. Firing rl*prefer*rvt*predict-yes*H0*1
  29817. -->
  29818. (S1 ^operator O2197 = 0.5231194649144159)
  29819. Firing prefer*rvt*predict-yes*H0
  29820. -->
  29821. Firing prefer*rvt*predict-no*H0
  29822. -->
  29823. Firing elaborate*copy-dir-to-output-link
  29824. -->
  29825. (I3 ^dir L +)
  29826. inner elaboration loop at bottom goal.
  29827. Retracting elaborate*copy-see-to-output-link
  29828. -->
  29829. (I3 ^see 0 +)
  29830. Retracting propose*predict-no
  29831. -->
  29832. (O2198 ^name predict-no +)
  29833. (S1 ^operator O2198 +)
  29834. Retracting propose*predict-yes
  29835. -->
  29836. (O2197 ^name predict-yes +)
  29837. (S1 ^operator O2197 +)
  29838. Retracting elaborate*reward*based*on*reward
  29839. -->
  29840. (R1102 ^value 1 +)
  29841. (R1 ^reward R1102 +)
  29842. Retracting elaborate*copy-dir-to-output-link
  29843. -->
  29844. (I3 ^dir R +)
  29845. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  29846. -->
  29847. (S1 ^operator O2198 = 0.4910065094545203)
  29848. Retracting rl*prefer*rvt*predict-no*H0*4
  29849. -->
  29850. (S1 ^operator O2198 = 0.1269768216229884)
  29851. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  29852. -->
  29853. (S1 ^operator O2197 = 0.6170645302229754)
  29854. Retracting rl*prefer*rvt*predict-yes*H0*3
  29855. -->
  29856. (S1 ^operator O2197 = 0.3829421436515592)
  29857. =>WM: (15459: S1 ^operator O2200 +)
  29858. =>WM: (15458: S1 ^operator O2199 +)
  29859. =>WM: (15457: I3 ^dir L)
  29860. =>WM: (15456: O2200 ^name predict-no)
  29861. =>WM: (15455: O2199 ^name predict-yes)
  29862. =>WM: (15454: R1103 ^value 1)
  29863. =>WM: (15453: R1 ^reward R1103)
  29864. =>WM: (15452: I3 ^see 1)
  29865. <=WM: (15443: S1 ^operator O2197 +)
  29866. <=WM: (15445: S1 ^operator O2197)
  29867. <=WM: (15444: S1 ^operator O2198 +)
  29868. <=WM: (15442: I3 ^dir R)
  29869. <=WM: (15438: R1 ^reward R1102)
  29870. <=WM: (15437: I3 ^see 0)
  29871. <=WM: (15441: O2198 ^name predict-no)
  29872. <=WM: (15440: O2197 ^name predict-yes)
  29873. <=WM: (15439: R1102 ^value 1)
  29874. --- Inner Elaboration Phase, active level 1 (S1) ---
  29875. Firing prefer*rvt*predict-yes*H0
  29876. -->
  29877. Firing rl*prefer*rvt*predict-yes*H0*1
  29878. -->
  29879. (S1 ^operator O2199 = 0.5231194649144159)
  29880. Firing prefer*rvt*predict-yes*H0*1*H1
  29881. -->
  29882. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  29883. -->
  29884. (S1 ^operator O2199 = 0.4768792530747783)
  29885. Firing prefer*rvt*predict-no*H0
  29886. -->
  29887. Firing rl*prefer*rvt*predict-no*H0*2
  29888. -->
  29889. (S1 ^operator O2200 = 0.2550134065643873)
  29890. Firing prefer*rvt*predict-no*H0*2*H1
  29891. -->
  29892. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  29893. -->
  29894. (S1 ^operator O2200 = -0.01194930198035649)
  29895. inner elaboration loop at bottom goal.
  29896. Retracting rl*prefer*rvt*predict-no*H0*2
  29897. -->
  29898. (S1 ^operator O2198 = 0.2550134065643873)
  29899. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  29900. -->
  29901. (S1 ^operator O2198 = -0.01194930198035649)
  29902. Retracting rl*prefer*rvt*predict-yes*H0*1
  29903. -->
  29904. (S1 ^operator O2197 = 0.5231194649144159)
  29905. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  29906. -->
  29907. (S1 ^operator O2197 = 0.4768792530747783)
  29908. --- END Proposal Phase ---
  29909. --- Decision Phase ---
  29910. RL update rl*prefer*rvt*predict-yes*H0*3 0.673135 -0.290193 0.382942 -> 0.673134 -0.290193 0.382941(R,m,v=1,0.964497,0.0344463)
  29911. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326871 0.290194 0.617065 -> 0.32687 0.290193 0.617064(R,m,v=1,1,0)
  29912. =>WM: (15460: S1 ^operator O2199)
  29913. 1100: O: O2199 (predict-yes)
  29914. --- END Decision Phase ---
  29915. --- Application Phase ---
  29916. --- Firing Productions (PE) For State At Depth 1 ---
  29917. --- Inner Elaboration Phase, active level 1 (S1) ---
  29918. Firing apply*operator
  29919. -->
  29920. (I3 ^predict-yes N1100 + :O )
  29921. Firing apply*operator*complete
  29922. -->
  29923. (I3 ^predict-yes N1099 - :O )
  29924. inner elaboration loop at bottom goal.
  29925. --- Change Working Memory (PE) ---
  29926. =>WM: (15461: I3 ^predict-yes N1100)
  29927. <=WM: (15447: N1099 ^status complete)
  29928. <=WM: (15446: I3 ^predict-yes N1099)
  29929. --- Firing Productions (IE) For State At Depth 1 ---
  29930. --- Inner Elaboration Phase, active level 1 (S1) ---
  29931. Firing monitor*world
  29932. -->
  29933. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29934. --- Change Working Memory (IE) ---
  29935. --- END Application Phase ---
  29936. --- Output Phase ---
  29937. ENV: Agent did: predict-yes for direction L in state State-B
  29938. In State-B moving L
  29939. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  29940. predict error 0
  29941. dir: dir isU
  29942. --- END Output Phase ---
  29943. /|\--- Input Phase ---
  29944. =>WM: (15465: I2 ^dir U)
  29945. =>WM: (15464: I2 ^reward 1)
  29946. =>WM: (15463: I2 ^see 1)
  29947. =>WM: (15462: N1100 ^status complete)
  29948. <=WM: (15450: I2 ^dir L)
  29949. <=WM: (15449: I2 ^reward 1)
  29950. <=WM: (15448: I2 ^see 1)
  29951. =>WM: (15466: I2 ^level-1 L1-root)
  29952. <=WM: (15451: I2 ^level-1 R1-root)
  29953. --- END Input Phase ---
  29954. --- Proposal Phase ---
  29955. --- Inner Elaboration Phase, active level 1 (S1) ---
  29956. Firing elaborate*copy-see-to-output-link
  29957. -->
  29958. (I3 ^see 1 +)
  29959. Firing elaborate*reward*based*on*reward
  29960. -->
  29961. (R1104 ^value 1 +)
  29962. (R1 ^reward R1104 +)
  29963. Firing propose*predict-yes
  29964. -->
  29965. (O2201 ^name predict-yes +)
  29966. (S1 ^operator O2201 +)
  29967. Firing propose*predict-no
  29968. -->
  29969. (O2202 ^name predict-no +)
  29970. (S1 ^operator O2202 +)
  29971. Firing rl*prefer*rvt*predict-no*H0*6
  29972. -->
  29973. (S1 ^operator O2200 = 0.9999999999999999)
  29974. Firing rl*prefer*rvt*predict-yes*H0*5
  29975. -->
  29976. (S1 ^operator O2199 = 0.)
  29977. Firing prefer*rvt*predict-yes*H0
  29978. -->
  29979. Firing prefer*rvt*predict-no*H0
  29980. -->
  29981. Firing elaborate*copy-dir-to-output-link
  29982. -->
  29983. (I3 ^dir U +)
  29984. inner elaboration loop at bottom goal.
  29985. Retracting elaborate*copy-see-to-output-link
  29986. -->
  29987. (I3 ^see 1 +)
  29988. Retracting propose*predict-no
  29989. -->
  29990. (O2200 ^name predict-no +)
  29991. (S1 ^operator O2200 +)
  29992. Retracting propose*predict-yes
  29993. -->
  29994. (O2199 ^name predict-yes +)
  29995. (S1 ^operator O2199 +)
  29996. Retracting elaborate*reward*based*on*reward
  29997. -->
  29998. (R1103 ^value 1 +)
  29999. (R1 ^reward R1103 +)
  30000. Retracting elaborate*copy-dir-to-output-link
  30001. -->
  30002. (I3 ^dir L +)
  30003. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  30004. -->
  30005. (S1 ^operator O2200 = -0.01194930198035649)
  30006. Retracting rl*prefer*rvt*predict-no*H0*2
  30007. -->
  30008. (S1 ^operator O2200 = 0.2550134065643873)
  30009. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  30010. -->
  30011. (S1 ^operator O2199 = 0.4768792530747783)
  30012. Retracting rl*prefer*rvt*predict-yes*H0*1
  30013. -->
  30014. (S1 ^operator O2199 = 0.5231194649144159)
  30015. =>WM: (15473: S1 ^operator O2202 +)
  30016. =>WM: (15472: S1 ^operator O2201 +)
  30017. =>WM: (15471: I3 ^dir U)
  30018. =>WM: (15470: O2202 ^name predict-no)
  30019. =>WM: (15469: O2201 ^name predict-yes)
  30020. =>WM: (15468: R1104 ^value 1)
  30021. =>WM: (15467: R1 ^reward R1104)
  30022. <=WM: (15458: S1 ^operator O2199 +)
  30023. <=WM: (15460: S1 ^operator O2199)
  30024. <=WM: (15459: S1 ^operator O2200 +)
  30025. <=WM: (15457: I3 ^dir L)
  30026. <=WM: (15453: R1 ^reward R1103)
  30027. <=WM: (15456: O2200 ^name predict-no)
  30028. <=WM: (15455: O2199 ^name predict-yes)
  30029. <=WM: (15454: R1103 ^value 1)
  30030. --- Inner Elaboration Phase, active level 1 (S1) ---
  30031. Firing prefer*rvt*predict-yes*H0
  30032. -->
  30033. Firing rl*prefer*rvt*predict-yes*H0*5
  30034. -->
  30035. (S1 ^operator O2201 = 0.)
  30036. Firing prefer*rvt*predict-no*H0
  30037. -->
  30038. Firing rl*prefer*rvt*predict-no*H0*6
  30039. -->
  30040. (S1 ^operator O2202 = 0.9999999999999999)
  30041. inner elaboration loop at bottom goal.
  30042. Retracting rl*prefer*rvt*predict-no*H0*6
  30043. -->
  30044. (S1 ^operator O2200 = 0.9999999999999999)
  30045. Retracting rl*prefer*rvt*predict-yes*H0*5
  30046. -->
  30047. (S1 ^operator O2199 = 0.)
  30048. --- END Proposal Phase ---
  30049. --- Decision Phase ---
  30050. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.523119 -> 0.727959 -0.20484 0.52312(R,m,v=1,0.981132,0.0186291)
  30051. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272039 0.20484 0.476879 -> 0.272039 0.20484 0.476879(R,m,v=1,1,0)
  30052. =>WM: (15474: S1 ^operator O2202)
  30053. 1101: O: O2202 (predict-no)
  30054. --- END Decision Phase ---
  30055. --- Application Phase ---
  30056. --- Firing Productions (PE) For State At Depth 1 ---
  30057. --- Inner Elaboration Phase, active level 1 (S1) ---
  30058. Firing apply*operator
  30059. -->
  30060. (I3 ^predict-no N1101 + :O )
  30061. Firing apply*operator*complete
  30062. -->
  30063. (I3 ^predict-yes N1100 - :O )
  30064. inner elaboration loop at bottom goal.
  30065. --- Change Working Memory (PE) ---
  30066. =>WM: (15475: I3 ^predict-no N1101)
  30067. <=WM: (15462: N1100 ^status complete)
  30068. <=WM: (15461: I3 ^predict-yes N1100)
  30069. --- Firing Productions (IE) For State At Depth 1 ---
  30070. --- Inner Elaboration Phase, active level 1 (S1) ---
  30071. Firing monitor*world
  30072. -->
  30073. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30074. --- Change Working Memory (IE) ---
  30075. --- END Application Phase ---
  30076. --- Output Phase ---
  30077. ENV: Agent did: predict-no for direction U in state State-A
  30078. In State-A moving U
  30079. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30080. predict error 0
  30081. dir: dir isL
  30082. --- END Output Phase ---
  30083. ---- Input Phase ---
  30084. =>WM: (15479: I2 ^dir L)
  30085. =>WM: (15478: I2 ^reward 1)
  30086. =>WM: (15477: I2 ^see 0)
  30087. =>WM: (15476: N1101 ^status complete)
  30088. <=WM: (15465: I2 ^dir U)
  30089. <=WM: (15464: I2 ^reward 1)
  30090. <=WM: (15463: I2 ^see 1)
  30091. =>WM: (15480: I2 ^level-1 L1-root)
  30092. <=WM: (15466: I2 ^level-1 L1-root)
  30093. --- END Input Phase ---
  30094. --- Proposal Phase ---
  30095. --- Inner Elaboration Phase, active level 1 (S1) ---
  30096. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  30097. -->
  30098. (S1 ^operator O2201 = 0.1693592933936033)
  30099. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  30100. -->
  30101. (S1 ^operator O2202 = 0.7449865546652295)
  30102. Firing prefer*rvt*predict-no*H0*2*H1
  30103. -->
  30104. Firing prefer*rvt*predict-yes*H0*1*H1
  30105. -->
  30106. Firing elaborate*copy-see-to-output-link
  30107. -->
  30108. (I3 ^see 0 +)
  30109. Firing elaborate*reward*based*on*reward
  30110. -->
  30111. (R1105 ^value 1 +)
  30112. (R1 ^reward R1105 +)
  30113. Firing propose*predict-yes
  30114. -->
  30115. (O2203 ^name predict-yes +)
  30116. (S1 ^operator O2203 +)
  30117. Firing propose*predict-no
  30118. -->
  30119. (O2204 ^name predict-no +)
  30120. (S1 ^operator O2204 +)
  30121. Firing rl*prefer*rvt*predict-no*H0*2
  30122. -->
  30123. (S1 ^operator O2202 = 0.2550134065643873)
  30124. Firing rl*prefer*rvt*predict-yes*H0*1
  30125. -->
  30126. (S1 ^operator O2201 = 0.5231196572160367)
  30127. Firing prefer*rvt*predict-yes*H0
  30128. -->
  30129. Firing prefer*rvt*predict-no*H0
  30130. -->
  30131. Firing elaborate*copy-dir-to-output-link
  30132. -->
  30133. (I3 ^dir L +)
  30134. inner elaboration loop at bottom goal.
  30135. Retracting elaborate*copy-see-to-output-link
  30136. -->
  30137. (I3 ^see 1 +)
  30138. Retracting propose*predict-no
  30139. -->
  30140. (O2202 ^name predict-no +)
  30141. (S1 ^operator O2202 +)
  30142. Retracting propose*predict-yes
  30143. -->
  30144. (O2201 ^name predict-yes +)
  30145. (S1 ^operator O2201 +)
  30146. Retracting elaborate*reward*based*on*reward
  30147. -->
  30148. (R1104 ^value 1 +)
  30149. (R1 ^reward R1104 +)
  30150. Retracting elaborate*copy-dir-to-output-link
  30151. -->
  30152. (I3 ^dir U +)
  30153. Retracting rl*prefer*rvt*predict-no*H0*6
  30154. -->
  30155. (S1 ^operator O2202 = 0.9999999999999999)
  30156. Retracting rl*prefer*rvt*predict-yes*H0*5
  30157. -->
  30158. (S1 ^operator O2201 = 0.)
  30159. =>WM: (15488: S1 ^operator O2204 +)
  30160. =>WM: (15487: S1 ^operator O2203 +)
  30161. =>WM: (15486: I3 ^dir L)
  30162. =>WM: (15485: O2204 ^name predict-no)
  30163. =>WM: (15484: O2203 ^name predict-yes)
  30164. =>WM: (15483: R1105 ^value 1)
  30165. =>WM: (15482: R1 ^reward R1105)
  30166. =>WM: (15481: I3 ^see 0)
  30167. <=WM: (15472: S1 ^operator O2201 +)
  30168. <=WM: (15473: S1 ^operator O2202 +)
  30169. <=WM: (15474: S1 ^operator O2202)
  30170. <=WM: (15471: I3 ^dir U)
  30171. <=WM: (15467: R1 ^reward R1104)
  30172. <=WM: (15452: I3 ^see 1)
  30173. <=WM: (15470: O2202 ^name predict-no)
  30174. <=WM: (15469: O2201 ^name predict-yes)
  30175. <=WM: (15468: R1104 ^value 1)
  30176. --- Inner Elaboration Phase, active level 1 (S1) ---
  30177. Firing prefer*rvt*predict-yes*H0
  30178. -->
  30179. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  30180. -->
  30181. (S1 ^operator O2203 = 0.1693592933936033)
  30182. Firing rl*prefer*rvt*predict-yes*H0*1
  30183. -->
  30184. (S1 ^operator O2203 = 0.5231196572160367)
  30185. Firing prefer*rvt*predict-yes*H0*1*H1
  30186. -->
  30187. Firing prefer*rvt*predict-no*H0
  30188. -->
  30189. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  30190. -->
  30191. (S1 ^operator O2204 = 0.7449865546652295)
  30192. Firing rl*prefer*rvt*predict-no*H0*2
  30193. -->
  30194. (S1 ^operator O2204 = 0.2550134065643873)
  30195. Firing prefer*rvt*predict-no*H0*2*H1
  30196. -->
  30197. inner elaboration loop at bottom goal.
  30198. Retracting rl*prefer*rvt*predict-no*H0*2
  30199. -->
  30200. (S1 ^operator O2202 = 0.2550134065643873)
  30201. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  30202. -->
  30203. (S1 ^operator O2202 = 0.7449865546652295)
  30204. Retracting rl*prefer*rvt*predict-yes*H0*1
  30205. -->
  30206. (S1 ^operator O2201 = 0.5231196572160367)
  30207. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  30208. -->
  30209. (S1 ^operator O2201 = 0.1693592933936033)
  30210. --- END Proposal Phase ---
  30211. --- Decision Phase ---
  30212. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  30213. =>WM: (15489: S1 ^operator O2204)
  30214. 1102: O: O2204 (predict-no)
  30215. --- END Decision Phase ---
  30216. --- Application Phase ---
  30217. --- Firing Productions (PE) For State At Depth 1 ---
  30218. --- Inner Elaboration Phase, active level 1 (S1) ---
  30219. Firing apply*operator
  30220. -->
  30221. (I3 ^predict-no N1102 + :O )
  30222. Firing apply*operator*complete
  30223. -->
  30224. (I3 ^predict-no N1101 - :O )
  30225. inner elaboration loop at bottom goal.
  30226. --- Change Working Memory (PE) ---
  30227. =>WM: (15490: I3 ^predict-no N1102)
  30228. <=WM: (15476: N1101 ^status complete)
  30229. <=WM: (15475: I3 ^predict-no N1101)
  30230. --- Firing Productions (IE) For State At Depth 1 ---
  30231. --- Inner Elaboration Phase, active level 1 (S1) ---
  30232. Firing monitor*world
  30233. -->
  30234. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30235. --- Change Working Memory (IE) ---
  30236. --- END Application Phase ---
  30237. --- Output Phase ---
  30238. ENV: Agent did: predict-no for direction L in state State-A
  30239. In State-A moving L
  30240. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30241. predict error 0
  30242. dir: dir isL
  30243. --- END Output Phase ---
  30244. /|--- Input Phase ---
  30245. =>WM: (15494: I2 ^dir L)
  30246. =>WM: (15493: I2 ^reward 1)
  30247. =>WM: (15492: I2 ^see 0)
  30248. =>WM: (15491: N1102 ^status complete)
  30249. <=WM: (15479: I2 ^dir L)
  30250. <=WM: (15478: I2 ^reward 1)
  30251. <=WM: (15477: I2 ^see 0)
  30252. =>WM: (15495: I2 ^level-1 L0-root)
  30253. <=WM: (15480: I2 ^level-1 L1-root)
  30254. --- END Input Phase ---
  30255. --- Proposal Phase ---
  30256. --- Inner Elaboration Phase, active level 1 (S1) ---
  30257. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  30258. -->
  30259. (S1 ^operator O2203 = 0.3)
  30260. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  30261. -->
  30262. (S1 ^operator O2204 = 0.7449866452103731)
  30263. Firing prefer*rvt*predict-no*H0*2*H1
  30264. -->
  30265. Firing prefer*rvt*predict-yes*H0*1*H1
  30266. -->
  30267. Firing elaborate*copy-see-to-output-link
  30268. -->
  30269. (I3 ^see 0 +)
  30270. Firing elaborate*reward*based*on*reward
  30271. -->
  30272. (R1106 ^value 1 +)
  30273. (R1 ^reward R1106 +)
  30274. Firing propose*predict-yes
  30275. -->
  30276. (O2205 ^name predict-yes +)
  30277. (S1 ^operator O2205 +)
  30278. Firing propose*predict-no
  30279. -->
  30280. (O2206 ^name predict-no +)
  30281. (S1 ^operator O2206 +)
  30282. Firing rl*prefer*rvt*predict-no*H0*2
  30283. -->
  30284. (S1 ^operator O2204 = 0.2550134065643873)
  30285. Firing rl*prefer*rvt*predict-yes*H0*1
  30286. -->
  30287. (S1 ^operator O2203 = 0.5231196572160367)
  30288. Firing prefer*rvt*predict-yes*H0
  30289. -->
  30290. Firing prefer*rvt*predict-no*H0
  30291. -->
  30292. Firing elaborate*copy-dir-to-output-link
  30293. -->
  30294. (I3 ^dir L +)
  30295. inner elaboration loop at bottom goal.
  30296. Retracting elaborate*copy-see-to-output-link
  30297. -->
  30298. (I3 ^see 0 +)
  30299. Retracting propose*predict-no
  30300. -->
  30301. (O2204 ^name predict-no +)
  30302. (S1 ^operator O2204 +)
  30303. Retracting propose*predict-yes
  30304. -->
  30305. (O2203 ^name predict-yes +)
  30306. (S1 ^operator O2203 +)
  30307. Retracting elaborate*reward*based*on*reward
  30308. -->
  30309. (R1105 ^value 1 +)
  30310. (R1 ^reward R1105 +)
  30311. Retracting elaborate*copy-dir-to-output-link
  30312. -->
  30313. (I3 ^dir L +)
  30314. Retracting rl*prefer*rvt*predict-no*H0*2
  30315. -->
  30316. (S1 ^operator O2204 = 0.2550134065643873)
  30317. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  30318. -->
  30319. (S1 ^operator O2204 = 0.7449865546652295)
  30320. Retracting rl*prefer*rvt*predict-yes*H0*1
  30321. -->
  30322. (S1 ^operator O2203 = 0.5231196572160367)
  30323. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  30324. -->
  30325. (S1 ^operator O2203 = 0.1693592933936033)
  30326. =>WM: (15501: S1 ^operator O2206 +)
  30327. =>WM: (15500: S1 ^operator O2205 +)
  30328. =>WM: (15499: O2206 ^name predict-no)
  30329. =>WM: (15498: O2205 ^name predict-yes)
  30330. =>WM: (15497: R1106 ^value 1)
  30331. =>WM: (15496: R1 ^reward R1106)
  30332. <=WM: (15487: S1 ^operator O2203 +)
  30333. <=WM: (15488: S1 ^operator O2204 +)
  30334. <=WM: (15489: S1 ^operator O2204)
  30335. <=WM: (15482: R1 ^reward R1105)
  30336. <=WM: (15485: O2204 ^name predict-no)
  30337. <=WM: (15484: O2203 ^name predict-yes)
  30338. <=WM: (15483: R1105 ^value 1)
  30339. --- Inner Elaboration Phase, active level 1 (S1) ---
  30340. Firing prefer*rvt*predict-yes*H0
  30341. -->
  30342. Firing rl*prefer*rvt*predict-yes*H0*1
  30343. -->
  30344. (S1 ^operator O2205 = 0.5231196572160367)
  30345. Firing prefer*rvt*predict-yes*H0*1*H1
  30346. -->
  30347. Firing rl*prefer*rvt*predict-yes*H0*1*H1*21
  30348. -->
  30349. (S1 ^operator O2205 = 0.3)
  30350. Firing prefer*rvt*predict-no*H0
  30351. -->
  30352. Firing rl*prefer*rvt*predict-no*H0*2
  30353. -->
  30354. (S1 ^operator O2206 = 0.2550134065643873)
  30355. Firing prefer*rvt*predict-no*H0*2*H1
  30356. -->
  30357. Firing rl*prefer*rvt*predict-no*H0*2*H1*12
  30358. -->
  30359. (S1 ^operator O2206 = 0.7449866452103731)
  30360. inner elaboration loop at bottom goal.
  30361. Retracting rl*prefer*rvt*predict-no*H0*2
  30362. -->
  30363. (S1 ^operator O2204 = 0.2550134065643873)
  30364. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  30365. -->
  30366. (S1 ^operator O2204 = 0.7449866452103731)
  30367. Retracting rl*prefer*rvt*predict-yes*H0*1
  30368. -->
  30369. (S1 ^operator O2203 = 0.5231196572160367)
  30370. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  30371. -->
  30372. (S1 ^operator O2203 = 0.3)
  30373. --- END Proposal Phase ---
  30374. --- Decision Phase ---
  30375. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.923077,0.0713489)
  30376. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  30377. =>WM: (15502: S1 ^operator O2206)
  30378. 1103: O: O2206 (predict-no)
  30379. --- END Decision Phase ---
  30380. --- Application Phase ---
  30381. --- Firing Productions (PE) For State At Depth 1 ---
  30382. --- Inner Elaboration Phase, active level 1 (S1) ---
  30383. Firing apply*operator
  30384. -->
  30385. (I3 ^predict-no N1103 + :O )
  30386. Firing apply*operator*complete
  30387. -->
  30388. (I3 ^predict-no N1102 - :O )
  30389. inner elaboration loop at bottom goal.
  30390. --- Change Working Memory (PE) ---
  30391. =>WM: (15503: I3 ^predict-no N1103)
  30392. <=WM: (15491: N1102 ^status complete)
  30393. <=WM: (15490: I3 ^predict-no N1102)
  30394. --- Firing Productions (IE) For State At Depth 1 ---
  30395. --- Inner Elaboration Phase, active level 1 (S1) ---
  30396. Firing monitor*world
  30397. -->
  30398. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30399. --- Change Working Memory (IE) ---
  30400. --- END Application Phase ---
  30401. --- Output Phase ---
  30402. ENV: Agent did: predict-no for direction L in state State-A
  30403. In State-A moving L
  30404. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30405. predict error 0
  30406. dir: dir isR
  30407. --- END Output Phase ---
  30408. \-/--- Input Phase ---
  30409. =>WM: (15507: I2 ^dir R)
  30410. =>WM: (15506: I2 ^reward 1)
  30411. =>WM: (15505: I2 ^see 0)
  30412. =>WM: (15504: N1103 ^status complete)
  30413. <=WM: (15494: I2 ^dir L)
  30414. <=WM: (15493: I2 ^reward 1)
  30415. <=WM: (15492: I2 ^see 0)
  30416. =>WM: (15508: I2 ^level-1 L0-root)
  30417. <=WM: (15495: I2 ^level-1 L0-root)
  30418. --- END Input Phase ---
  30419. --- Proposal Phase ---
  30420. --- Inner Elaboration Phase, active level 1 (S1) ---
  30421. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  30422. -->
  30423. (S1 ^operator O2205 = 0.6170635291417952)
  30424. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  30425. -->
  30426. (S1 ^operator O2206 = 0.4910065094545203)
  30427. Firing prefer*rvt*predict-no*H0*4*H1
  30428. -->
  30429. Firing prefer*rvt*predict-yes*H0*3*H1
  30430. -->
  30431. Firing elaborate*copy-see-to-output-link
  30432. -->
  30433. (I3 ^see 0 +)
  30434. Firing elaborate*reward*based*on*reward
  30435. -->
  30436. (R1107 ^value 1 +)
  30437. (R1 ^reward R1107 +)
  30438. Firing propose*predict-yes
  30439. -->
  30440. (O2207 ^name predict-yes +)
  30441. (S1 ^operator O2207 +)
  30442. Firing propose*predict-no
  30443. -->
  30444. (O2208 ^name predict-no +)
  30445. (S1 ^operator O2208 +)
  30446. Firing rl*prefer*rvt*predict-no*H0*4
  30447. -->
  30448. (S1 ^operator O2206 = 0.1269768216229884)
  30449. Firing rl*prefer*rvt*predict-yes*H0*3
  30450. -->
  30451. (S1 ^operator O2205 = 0.382941142570379)
  30452. Firing prefer*rvt*predict-yes*H0
  30453. -->
  30454. Firing prefer*rvt*predict-no*H0
  30455. -->
  30456. Firing elaborate*copy-dir-to-output-link
  30457. -->
  30458. (I3 ^dir R +)
  30459. inner elaboration loop at bottom goal.
  30460. Retracting elaborate*copy-see-to-output-link
  30461. -->
  30462. (I3 ^see 0 +)
  30463. Retracting propose*predict-no
  30464. -->
  30465. (O2206 ^name predict-no +)
  30466. (S1 ^operator O2206 +)
  30467. Retracting propose*predict-yes
  30468. -->
  30469. (O2205 ^name predict-yes +)
  30470. (S1 ^operator O2205 +)
  30471. Retracting elaborate*reward*based*on*reward
  30472. -->
  30473. (R1106 ^value 1 +)
  30474. (R1 ^reward R1106 +)
  30475. Retracting elaborate*copy-dir-to-output-link
  30476. -->
  30477. (I3 ^dir L +)
  30478. Retracting rl*prefer*rvt*predict-no*H0*2*H1*12
  30479. -->
  30480. (S1 ^operator O2206 = 0.7449866452103731)
  30481. Retracting rl*prefer*rvt*predict-no*H0*2
  30482. -->
  30483. (S1 ^operator O2206 = 0.2550134123799448)
  30484. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*21
  30485. -->
  30486. (S1 ^operator O2205 = 0.3)
  30487. Retracting rl*prefer*rvt*predict-yes*H0*1
  30488. -->
  30489. (S1 ^operator O2205 = 0.5231196572160367)
  30490. =>WM: (15515: S1 ^operator O2208 +)
  30491. =>WM: (15514: S1 ^operator O2207 +)
  30492. =>WM: (15513: I3 ^dir R)
  30493. =>WM: (15512: O2208 ^name predict-no)
  30494. =>WM: (15511: O2207 ^name predict-yes)
  30495. =>WM: (15510: R1107 ^value 1)
  30496. =>WM: (15509: R1 ^reward R1107)
  30497. <=WM: (15500: S1 ^operator O2205 +)
  30498. <=WM: (15501: S1 ^operator O2206 +)
  30499. <=WM: (15502: S1 ^operator O2206)
  30500. <=WM: (15486: I3 ^dir L)
  30501. <=WM: (15496: R1 ^reward R1106)
  30502. <=WM: (15499: O2206 ^name predict-no)
  30503. <=WM: (15498: O2205 ^name predict-yes)
  30504. <=WM: (15497: R1106 ^value 1)
  30505. --- Inner Elaboration Phase, active level 1 (S1) ---
  30506. Firing prefer*rvt*predict-yes*H0
  30507. -->
  30508. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  30509. -->
  30510. (S1 ^operator O2207 = 0.6170635291417952)
  30511. Firing rl*prefer*rvt*predict-yes*H0*3
  30512. -->
  30513. (S1 ^operator O2207 = 0.382941142570379)
  30514. Firing prefer*rvt*predict-yes*H0*3*H1
  30515. -->
  30516. Firing prefer*rvt*predict-no*H0
  30517. -->
  30518. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  30519. -->
  30520. (S1 ^operator O2208 = 0.4910065094545203)
  30521. Firing rl*prefer*rvt*predict-no*H0*4
  30522. -->
  30523. (S1 ^operator O2208 = 0.1269768216229884)
  30524. Firing prefer*rvt*predict-no*H0*4*H1
  30525. -->
  30526. inner elaboration loop at bottom goal.
  30527. Retracting rl*prefer*rvt*predict-no*H0*4
  30528. -->
  30529. (S1 ^operator O2206 = 0.1269768216229884)
  30530. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  30531. -->
  30532. (S1 ^operator O2206 = 0.4910065094545203)
  30533. Retracting rl*prefer*rvt*predict-yes*H0*3
  30534. -->
  30535. (S1 ^operator O2205 = 0.382941142570379)
  30536. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  30537. -->
  30538. (S1 ^operator O2205 = 0.6170635291417952)
  30539. --- END Proposal Phase ---
  30540. --- Decision Phase ---
  30541. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.923445,0.0710342)
  30542. RL update rl*prefer*rvt*predict-no*H0*2*H1*12 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  30543. =>WM: (15516: S1 ^operator O2207)
  30544. 1104: O: O2207 (predict-yes)
  30545. --- END Decision Phase ---
  30546. --- Application Phase ---
  30547. --- Firing Productions (PE) For State At Depth 1 ---
  30548. --- Inner Elaboration Phase, active level 1 (S1) ---
  30549. Firing apply*operator
  30550. -->
  30551. (I3 ^predict-yes N1104 + :O )
  30552. Firing apply*operator*complete
  30553. -->
  30554. (I3 ^predict-no N1103 - :O )
  30555. inner elaboration loop at bottom goal.
  30556. --- Change Working Memory (PE) ---
  30557. =>WM: (15517: I3 ^predict-yes N1104)
  30558. <=WM: (15504: N1103 ^status complete)
  30559. <=WM: (15503: I3 ^predict-no N1103)
  30560. --- Firing Productions (IE) For State At Depth 1 ---
  30561. --- Inner Elaboration Phase, active level 1 (S1) ---
  30562. Firing monitor*world
  30563. -->
  30564. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30565. --- Change Working Memory (IE) ---
  30566. --- END Application Phase ---
  30567. --- Output Phase ---
  30568. ENV: Agent did: predict-yes for direction R in state State-A
  30569. In State-A moving R
  30570. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  30571. predict error 0
  30572. dir: dir isR
  30573. --- END Output Phase ---
  30574. |\---- Input Phase ---
  30575. =>WM: (15521: I2 ^dir R)
  30576. =>WM: (15520: I2 ^reward 1)
  30577. =>WM: (15519: I2 ^see 1)
  30578. =>WM: (15518: N1104 ^status complete)
  30579. <=WM: (15507: I2 ^dir R)
  30580. <=WM: (15506: I2 ^reward 1)
  30581. <=WM: (15505: I2 ^see 0)
  30582. =>WM: (15522: I2 ^level-1 R1-root)
  30583. <=WM: (15508: I2 ^level-1 L0-root)
  30584. --- END Input Phase ---
  30585. --- Proposal Phase ---
  30586. --- Inner Elaboration Phase, active level 1 (S1) ---
  30587. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  30588. -->
  30589. (S1 ^operator O2207 = 0.08783148430849691)
  30590. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  30591. -->
  30592. (S1 ^operator O2208 = 0.8730232062400305)
  30593. Firing prefer*rvt*predict-no*H0*4*H1
  30594. -->
  30595. Firing prefer*rvt*predict-yes*H0*3*H1
  30596. -->
  30597. Firing elaborate*copy-see-to-output-link
  30598. -->
  30599. (I3 ^see 1 +)
  30600. Firing elaborate*reward*based*on*reward
  30601. -->
  30602. (R1108 ^value 1 +)
  30603. (R1 ^reward R1108 +)
  30604. Firing propose*predict-yes
  30605. -->
  30606. (O2209 ^name predict-yes +)
  30607. (S1 ^operator O2209 +)
  30608. Firing propose*predict-no
  30609. -->
  30610. (O2210 ^name predict-no +)
  30611. (S1 ^operator O2210 +)
  30612. Firing rl*prefer*rvt*predict-no*H0*4
  30613. -->
  30614. (S1 ^operator O2208 = 0.1269768216229884)
  30615. Firing rl*prefer*rvt*predict-yes*H0*3
  30616. -->
  30617. (S1 ^operator O2207 = 0.382941142570379)
  30618. Firing prefer*rvt*predict-yes*H0
  30619. -->
  30620. Firing prefer*rvt*predict-no*H0
  30621. -->
  30622. Firing elaborate*copy-dir-to-output-link
  30623. -->
  30624. (I3 ^dir R +)
  30625. inner elaboration loop at bottom goal.
  30626. Retracting elaborate*copy-see-to-output-link
  30627. -->
  30628. (I3 ^see 0 +)
  30629. Retracting propose*predict-no
  30630. -->
  30631. (O2208 ^name predict-no +)
  30632. (S1 ^operator O2208 +)
  30633. Retracting propose*predict-yes
  30634. -->
  30635. (O2207 ^name predict-yes +)
  30636. (S1 ^operator O2207 +)
  30637. Retracting elaborate*reward*based*on*reward
  30638. -->
  30639. (R1107 ^value 1 +)
  30640. (R1 ^reward R1107 +)
  30641. Retracting elaborate*copy-dir-to-output-link
  30642. -->
  30643. (I3 ^dir R +)
  30644. Retracting rl*prefer*rvt*predict-no*H0*4
  30645. -->
  30646. (S1 ^operator O2208 = 0.1269768216229884)
  30647. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  30648. -->
  30649. (S1 ^operator O2208 = 0.4910065094545203)
  30650. Retracting rl*prefer*rvt*predict-yes*H0*3
  30651. -->
  30652. (S1 ^operator O2207 = 0.382941142570379)
  30653. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  30654. -->
  30655. (S1 ^operator O2207 = 0.6170635291417952)
  30656. =>WM: (15529: S1 ^operator O2210 +)
  30657. =>WM: (15528: S1 ^operator O2209 +)
  30658. =>WM: (15527: O2210 ^name predict-no)
  30659. =>WM: (15526: O2209 ^name predict-yes)
  30660. =>WM: (15525: R1108 ^value 1)
  30661. =>WM: (15524: R1 ^reward R1108)
  30662. =>WM: (15523: I3 ^see 1)
  30663. <=WM: (15514: S1 ^operator O2207 +)
  30664. <=WM: (15516: S1 ^operator O2207)
  30665. <=WM: (15515: S1 ^operator O2208 +)
  30666. <=WM: (15509: R1 ^reward R1107)
  30667. <=WM: (15481: I3 ^see 0)
  30668. <=WM: (15512: O2208 ^name predict-no)
  30669. <=WM: (15511: O2207 ^name predict-yes)
  30670. <=WM: (15510: R1107 ^value 1)
  30671. --- Inner Elaboration Phase, active level 1 (S1) ---
  30672. Firing prefer*rvt*predict-yes*H0
  30673. -->
  30674. Firing rl*prefer*rvt*predict-yes*H0*3
  30675. -->
  30676. (S1 ^operator O2209 = 0.382941142570379)
  30677. Firing prefer*rvt*predict-yes*H0*3*H1
  30678. -->
  30679. Firing rl*prefer*rvt*predict-yes*H0*3*H1*16
  30680. -->
  30681. (S1 ^operator O2209 = 0.08783148430849691)
  30682. Firing prefer*rvt*predict-no*H0
  30683. -->
  30684. Firing rl*prefer*rvt*predict-no*H0*4
  30685. -->
  30686. (S1 ^operator O2210 = 0.1269768216229884)
  30687. Firing prefer*rvt*predict-no*H0*4*H1
  30688. -->
  30689. Firing rl*prefer*rvt*predict-no*H0*4*H1*15
  30690. -->
  30691. (S1 ^operator O2210 = 0.8730232062400305)
  30692. inner elaboration loop at bottom goal.
  30693. Retracting rl*prefer*rvt*predict-no*H0*4
  30694. -->
  30695. (S1 ^operator O2208 = 0.1269768216229884)
  30696. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  30697. -->
  30698. (S1 ^operator O2208 = 0.8730232062400305)
  30699. Retracting rl*prefer*rvt*predict-yes*H0*3
  30700. -->
  30701. (S1 ^operator O2207 = 0.382941142570379)
  30702. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  30703. -->
  30704. (S1 ^operator O2207 = 0.08783148430849691)
  30705. --- END Proposal Phase ---
  30706. --- Decision Phase ---
  30707. RL update rl*prefer*rvt*predict-yes*H0*3 0.673134 -0.290193 0.382941 -> 0.673134 -0.290193 0.38294(R,m,v=1,0.964706,0.0342499)
  30708. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.32687 0.290193 0.617064 -> 0.326869 0.290193 0.617063(R,m,v=1,1,0)
  30709. =>WM: (15530: S1 ^operator O2210)
  30710. 1105: O: O2210 (predict-no)
  30711. --- END Decision Phase ---
  30712. --- Application Phase ---
  30713. --- Firing Productions (PE) For State At Depth 1 ---
  30714. --- Inner Elaboration Phase, active level 1 (S1) ---
  30715. Firing apply*operator
  30716. -->
  30717. (I3 ^predict-no N1105 + :O )
  30718. Firing apply*operator*complete
  30719. -->
  30720. (I3 ^predict-yes N1104 - :O )
  30721. inner elaboration loop at bottom goal.
  30722. --- Change Working Memory (PE) ---
  30723. =>WM: (15531: I3 ^predict-no N1105)
  30724. <=WM: (15518: N1104 ^status complete)
  30725. <=WM: (15517: I3 ^predict-yes N1104)
  30726. --- Firing Productions (IE) For State At Depth 1 ---
  30727. --- Inner Elaboration Phase, active level 1 (S1) ---
  30728. Firing monitor*world
  30729. -->
  30730. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30731. --- Change Working Memory (IE) ---
  30732. --- END Application Phase ---
  30733. --- Output Phase ---
  30734. ENV: Agent did: predict-no for direction R in state State-B
  30735. In State-B moving R
  30736. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30737. predict error 0
  30738. dir: dir isR
  30739. --- END Output Phase ---
  30740. /|--- Input Phase ---
  30741. =>WM: (15535: I2 ^dir R)
  30742. =>WM: (15534: I2 ^reward 1)
  30743. =>WM: (15533: I2 ^see 0)
  30744. =>WM: (15532: N1105 ^status complete)
  30745. <=WM: (15521: I2 ^dir R)
  30746. <=WM: (15520: I2 ^reward 1)
  30747. <=WM: (15519: I2 ^see 1)
  30748. =>WM: (15536: I2 ^level-1 R0-root)
  30749. <=WM: (15522: I2 ^level-1 R1-root)
  30750. --- END Input Phase ---
  30751. --- Proposal Phase ---
  30752. --- Inner Elaboration Phase, active level 1 (S1) ---
  30753. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  30754. -->
  30755. (S1 ^operator O2209 = 0.2696941111808541)
  30756. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  30757. -->
  30758. (S1 ^operator O2210 = 0.8730231457818302)
  30759. Firing prefer*rvt*predict-no*H0*4*H1
  30760. -->
  30761. Firing prefer*rvt*predict-yes*H0*3*H1
  30762. -->
  30763. Firing elaborate*copy-see-to-output-link
  30764. -->
  30765. (I3 ^see 0 +)
  30766. Firing elaborate*reward*based*on*reward
  30767. -->
  30768. (R1109 ^value 1 +)
  30769. (R1 ^reward R1109 +)
  30770. Firing propose*predict-yes
  30771. -->
  30772. (O2211 ^name predict-yes +)
  30773. (S1 ^operator O2211 +)
  30774. Firing propose*predict-no
  30775. -->
  30776. (O2212 ^name predict-no +)
  30777. (S1 ^operator O2212 +)
  30778. Firing rl*prefer*rvt*predict-no*H0*4
  30779. -->
  30780. (S1 ^operator O2210 = 0.1269768216229884)
  30781. Firing rl*prefer*rvt*predict-yes*H0*3
  30782. -->
  30783. (S1 ^operator O2209 = 0.3829404418135529)
  30784. Firing prefer*rvt*predict-yes*H0
  30785. -->
  30786. Firing prefer*rvt*predict-no*H0
  30787. -->
  30788. Firing elaborate*copy-dir-to-output-link
  30789. -->
  30790. (I3 ^dir R +)
  30791. inner elaboration loop at bottom goal.
  30792. Retracting elaborate*copy-see-to-output-link
  30793. -->
  30794. (I3 ^see 1 +)
  30795. Retracting propose*predict-no
  30796. -->
  30797. (O2210 ^name predict-no +)
  30798. (S1 ^operator O2210 +)
  30799. Retracting propose*predict-yes
  30800. -->
  30801. (O2209 ^name predict-yes +)
  30802. (S1 ^operator O2209 +)
  30803. Retracting elaborate*reward*based*on*reward
  30804. -->
  30805. (R1108 ^value 1 +)
  30806. (R1 ^reward R1108 +)
  30807. Retracting elaborate*copy-dir-to-output-link
  30808. -->
  30809. (I3 ^dir R +)
  30810. Retracting rl*prefer*rvt*predict-no*H0*4*H1*15
  30811. -->
  30812. (S1 ^operator O2210 = 0.8730232062400305)
  30813. Retracting rl*prefer*rvt*predict-no*H0*4
  30814. -->
  30815. (S1 ^operator O2210 = 0.1269768216229884)
  30816. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*16
  30817. -->
  30818. (S1 ^operator O2209 = 0.08783148430849691)
  30819. Retracting rl*prefer*rvt*predict-yes*H0*3
  30820. -->
  30821. (S1 ^operator O2209 = 0.3829404418135529)
  30822. =>WM: (15543: S1 ^operator O2212 +)
  30823. =>WM: (15542: S1 ^operator O2211 +)
  30824. =>WM: (15541: O2212 ^name predict-no)
  30825. =>WM: (15540: O2211 ^name predict-yes)
  30826. =>WM: (15539: R1109 ^value 1)
  30827. =>WM: (15538: R1 ^reward R1109)
  30828. =>WM: (15537: I3 ^see 0)
  30829. <=WM: (15528: S1 ^operator O2209 +)
  30830. <=WM: (15529: S1 ^operator O2210 +)
  30831. <=WM: (15530: S1 ^operator O2210)
  30832. <=WM: (15524: R1 ^reward R1108)
  30833. <=WM: (15523: I3 ^see 1)
  30834. <=WM: (15527: O2210 ^name predict-no)
  30835. <=WM: (15526: O2209 ^name predict-yes)
  30836. <=WM: (15525: R1108 ^value 1)
  30837. --- Inner Elaboration Phase, active level 1 (S1) ---
  30838. Firing prefer*rvt*predict-yes*H0
  30839. -->
  30840. Firing rl*prefer*rvt*predict-yes*H0*3
  30841. -->
  30842. (S1 ^operator O2211 = 0.3829404418135529)
  30843. Firing prefer*rvt*predict-yes*H0*3*H1
  30844. -->
  30845. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  30846. -->
  30847. (S1 ^operator O2211 = 0.2696941111808541)
  30848. Firing prefer*rvt*predict-no*H0
  30849. -->
  30850. Firing rl*prefer*rvt*predict-no*H0*4
  30851. -->
  30852. (S1 ^operator O2212 = 0.1269768216229884)
  30853. Firing prefer*rvt*predict-no*H0*4*H1
  30854. -->
  30855. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  30856. -->
  30857. (S1 ^operator O2212 = 0.8730231457818302)
  30858. inner elaboration loop at bottom goal.
  30859. Retracting rl*prefer*rvt*predict-no*H0*4
  30860. -->
  30861. (S1 ^operator O2210 = 0.1269768216229884)
  30862. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  30863. -->
  30864. (S1 ^operator O2210 = 0.8730231457818302)
  30865. Retracting rl*prefer*rvt*predict-yes*H0*3
  30866. -->
  30867. (S1 ^operator O2209 = 0.3829404418135529)
  30868. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  30869. -->
  30870. (S1 ^operator O2209 = 0.2696941111808541)
  30871. --- END Proposal Phase ---
  30872. --- Decision Phase ---
  30873. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.955,0.043191)
  30874. RL update rl*prefer*rvt*predict-no*H0*4*H1*15 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  30875. =>WM: (15544: S1 ^operator O2212)
  30876. 1106: O: O2212 (predict-no)
  30877. --- END Decision Phase ---
  30878. --- Application Phase ---
  30879. --- Firing Productions (PE) For State At Depth 1 ---
  30880. --- Inner Elaboration Phase, active level 1 (S1) ---
  30881. Firing apply*operator
  30882. -->
  30883. (I3 ^predict-no N1106 + :O )
  30884. Firing apply*operator*complete
  30885. -->
  30886. (I3 ^predict-no N1105 - :O )
  30887. inner elaboration loop at bottom goal.
  30888. --- Change Working Memory (PE) ---
  30889. =>WM: (15545: I3 ^predict-no N1106)
  30890. <=WM: (15532: N1105 ^status complete)
  30891. <=WM: (15531: I3 ^predict-no N1105)
  30892. --- Firing Productions (IE) For State At Depth 1 ---
  30893. --- Inner Elaboration Phase, active level 1 (S1) ---
  30894. Firing monitor*world
  30895. -->
  30896. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30897. --- Change Working Memory (IE) ---
  30898. --- END Application Phase ---
  30899. --- Output Phase ---
  30900. ENV: Agent did: predict-no for direction R in state State-B
  30901. In State-B moving R
  30902. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30903. predict error 0
  30904. dir: dir isR
  30905. --- END Output Phase ---
  30906. \-/--- Input Phase ---
  30907. =>WM: (15549: I2 ^dir R)
  30908. =>WM: (15548: I2 ^reward 1)
  30909. =>WM: (15547: I2 ^see 0)
  30910. =>WM: (15546: N1106 ^status complete)
  30911. <=WM: (15535: I2 ^dir R)
  30912. <=WM: (15534: I2 ^reward 1)
  30913. <=WM: (15533: I2 ^see 0)
  30914. =>WM: (15550: I2 ^level-1 R0-root)
  30915. <=WM: (15536: I2 ^level-1 R0-root)
  30916. --- END Input Phase ---
  30917. --- Proposal Phase ---
  30918. --- Inner Elaboration Phase, active level 1 (S1) ---
  30919. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  30920. -->
  30921. (S1 ^operator O2211 = 0.2696941111808541)
  30922. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  30923. -->
  30924. (S1 ^operator O2212 = 0.8730231457818302)
  30925. Firing prefer*rvt*predict-no*H0*4*H1
  30926. -->
  30927. Firing prefer*rvt*predict-yes*H0*3*H1
  30928. -->
  30929. Firing elaborate*copy-see-to-output-link
  30930. -->
  30931. (I3 ^see 0 +)
  30932. Firing elaborate*reward*based*on*reward
  30933. -->
  30934. (R1110 ^value 1 +)
  30935. (R1 ^reward R1110 +)
  30936. Firing propose*predict-yes
  30937. -->
  30938. (O2213 ^name predict-yes +)
  30939. (S1 ^operator O2213 +)
  30940. Firing propose*predict-no
  30941. -->
  30942. (O2214 ^name predict-no +)
  30943. (S1 ^operator O2214 +)
  30944. Firing rl*prefer*rvt*predict-no*H0*4
  30945. -->
  30946. (S1 ^operator O2212 = 0.1269768174435356)
  30947. Firing rl*prefer*rvt*predict-yes*H0*3
  30948. -->
  30949. (S1 ^operator O2211 = 0.3829404418135529)
  30950. Firing prefer*rvt*predict-yes*H0
  30951. -->
  30952. Firing prefer*rvt*predict-no*H0
  30953. -->
  30954. Firing elaborate*copy-dir-to-output-link
  30955. -->
  30956. (I3 ^dir R +)
  30957. inner elaboration loop at bottom goal.
  30958. Retracting elaborate*copy-see-to-output-link
  30959. -->
  30960. (I3 ^see 0 +)
  30961. Retracting propose*predict-no
  30962. -->
  30963. (O2212 ^name predict-no +)
  30964. (S1 ^operator O2212 +)
  30965. Retracting propose*predict-yes
  30966. -->
  30967. (O2211 ^name predict-yes +)
  30968. (S1 ^operator O2211 +)
  30969. Retracting elaborate*reward*based*on*reward
  30970. -->
  30971. (R1109 ^value 1 +)
  30972. (R1 ^reward R1109 +)
  30973. Retracting elaborate*copy-dir-to-output-link
  30974. -->
  30975. (I3 ^dir R +)
  30976. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  30977. -->
  30978. (S1 ^operator O2212 = 0.8730231457818302)
  30979. Retracting rl*prefer*rvt*predict-no*H0*4
  30980. -->
  30981. (S1 ^operator O2212 = 0.1269768174435356)
  30982. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  30983. -->
  30984. (S1 ^operator O2211 = 0.2696941111808541)
  30985. Retracting rl*prefer*rvt*predict-yes*H0*3
  30986. -->
  30987. (S1 ^operator O2211 = 0.3829404418135529)
  30988. =>WM: (15556: S1 ^operator O2214 +)
  30989. =>WM: (15555: S1 ^operator O2213 +)
  30990. =>WM: (15554: O2214 ^name predict-no)
  30991. =>WM: (15553: O2213 ^name predict-yes)
  30992. =>WM: (15552: R1110 ^value 1)
  30993. =>WM: (15551: R1 ^reward R1110)
  30994. <=WM: (15542: S1 ^operator O2211 +)
  30995. <=WM: (15543: S1 ^operator O2212 +)
  30996. <=WM: (15544: S1 ^operator O2212)
  30997. <=WM: (15538: R1 ^reward R1109)
  30998. <=WM: (15541: O2212 ^name predict-no)
  30999. <=WM: (15540: O2211 ^name predict-yes)
  31000. <=WM: (15539: R1109 ^value 1)
  31001. --- Inner Elaboration Phase, active level 1 (S1) ---
  31002. Firing prefer*rvt*predict-yes*H0
  31003. -->
  31004. Firing rl*prefer*rvt*predict-yes*H0*3
  31005. -->
  31006. (S1 ^operator O2213 = 0.3829404418135529)
  31007. Firing prefer*rvt*predict-yes*H0*3*H1
  31008. -->
  31009. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  31010. -->
  31011. (S1 ^operator O2213 = 0.2696941111808541)
  31012. Firing prefer*rvt*predict-no*H0
  31013. -->
  31014. Firing rl*prefer*rvt*predict-no*H0*4
  31015. -->
  31016. (S1 ^operator O2214 = 0.1269768174435356)
  31017. Firing prefer*rvt*predict-no*H0*4*H1
  31018. -->
  31019. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  31020. -->
  31021. (S1 ^operator O2214 = 0.8730231457818302)
  31022. inner elaboration loop at bottom goal.
  31023. Retracting rl*prefer*rvt*predict-no*H0*4
  31024. -->
  31025. (S1 ^operator O2212 = 0.1269768174435356)
  31026. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  31027. -->
  31028. (S1 ^operator O2212 = 0.8730231457818302)
  31029. Retracting rl*prefer*rvt*predict-yes*H0*3
  31030. -->
  31031. (S1 ^operator O2211 = 0.3829404418135529)
  31032. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  31033. -->
  31034. (S1 ^operator O2211 = 0.2696941111808541)
  31035. --- END Proposal Phase ---
  31036. --- Decision Phase ---
  31037. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.955224,0.0429851)
  31038. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  31039. =>WM: (15557: S1 ^operator O2214)
  31040. 1107: O: O2214 (predict-no)
  31041. --- END Decision Phase ---
  31042. --- Application Phase ---
  31043. --- Firing Productions (PE) For State At Depth 1 ---
  31044. --- Inner Elaboration Phase, active level 1 (S1) ---
  31045. Firing apply*operator
  31046. -->
  31047. (I3 ^predict-no N1107 + :O )
  31048. Firing apply*operator*complete
  31049. -->
  31050. (I3 ^predict-no N1106 - :O )
  31051. inner elaboration loop at bottom goal.
  31052. --- Change Working Memory (PE) ---
  31053. =>WM: (15558: I3 ^predict-no N1107)
  31054. <=WM: (15546: N1106 ^status complete)
  31055. <=WM: (15545: I3 ^predict-no N1106)
  31056. --- Firing Productions (IE) For State At Depth 1 ---
  31057. --- Inner Elaboration Phase, active level 1 (S1) ---
  31058. Firing monitor*world
  31059. -->
  31060. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31061. --- Change Working Memory (IE) ---
  31062. --- END Application Phase ---
  31063. --- Output Phase ---
  31064. ENV: Agent did: predict-no for direction R in state State-B
  31065. In State-B moving R
  31066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31067. predict error 0
  31068. dir: dir isR
  31069. --- END Output Phase ---
  31070. |\---- Input Phase ---
  31071. =>WM: (15562: I2 ^dir R)
  31072. =>WM: (15561: I2 ^reward 1)
  31073. =>WM: (15560: I2 ^see 0)
  31074. =>WM: (15559: N1107 ^status complete)
  31075. <=WM: (15549: I2 ^dir R)
  31076. <=WM: (15548: I2 ^reward 1)
  31077. <=WM: (15547: I2 ^see 0)
  31078. =>WM: (15563: I2 ^level-1 R0-root)
  31079. <=WM: (15550: I2 ^level-1 R0-root)
  31080. --- END Input Phase ---
  31081. --- Proposal Phase ---
  31082. --- Inner Elaboration Phase, active level 1 (S1) ---
  31083. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  31084. -->
  31085. (S1 ^operator O2213 = 0.2696941111808541)
  31086. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  31087. -->
  31088. (S1 ^operator O2214 = 0.8730231512980253)
  31089. Firing prefer*rvt*predict-no*H0*4*H1
  31090. -->
  31091. Firing prefer*rvt*predict-yes*H0*3*H1
  31092. -->
  31093. Firing elaborate*copy-see-to-output-link
  31094. -->
  31095. (I3 ^see 0 +)
  31096. Firing elaborate*reward*based*on*reward
  31097. -->
  31098. (R1111 ^value 1 +)
  31099. (R1 ^reward R1111 +)
  31100. Firing propose*predict-yes
  31101. -->
  31102. (O2215 ^name predict-yes +)
  31103. (S1 ^operator O2215 +)
  31104. Firing propose*predict-no
  31105. -->
  31106. (O2216 ^name predict-no +)
  31107. (S1 ^operator O2216 +)
  31108. Firing rl*prefer*rvt*predict-no*H0*4
  31109. -->
  31110. (S1 ^operator O2214 = 0.1269768229597308)
  31111. Firing rl*prefer*rvt*predict-yes*H0*3
  31112. -->
  31113. (S1 ^operator O2213 = 0.3829404418135529)
  31114. Firing prefer*rvt*predict-yes*H0
  31115. -->
  31116. Firing prefer*rvt*predict-no*H0
  31117. -->
  31118. Firing elaborate*copy-dir-to-output-link
  31119. -->
  31120. (I3 ^dir R +)
  31121. inner elaboration loop at bottom goal.
  31122. Retracting elaborate*copy-see-to-output-link
  31123. -->
  31124. (I3 ^see 0 +)
  31125. Retracting propose*predict-no
  31126. -->
  31127. (O2214 ^name predict-no +)
  31128. (S1 ^operator O2214 +)
  31129. Retracting propose*predict-yes
  31130. -->
  31131. (O2213 ^name predict-yes +)
  31132. (S1 ^operator O2213 +)
  31133. Retracting elaborate*reward*based*on*reward
  31134. -->
  31135. (R1110 ^value 1 +)
  31136. (R1 ^reward R1110 +)
  31137. Retracting elaborate*copy-dir-to-output-link
  31138. -->
  31139. (I3 ^dir R +)
  31140. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  31141. -->
  31142. (S1 ^operator O2214 = 0.8730231512980253)
  31143. Retracting rl*prefer*rvt*predict-no*H0*4
  31144. -->
  31145. (S1 ^operator O2214 = 0.1269768229597308)
  31146. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  31147. -->
  31148. (S1 ^operator O2213 = 0.2696941111808541)
  31149. Retracting rl*prefer*rvt*predict-yes*H0*3
  31150. -->
  31151. (S1 ^operator O2213 = 0.3829404418135529)
  31152. =>WM: (15569: S1 ^operator O2216 +)
  31153. =>WM: (15568: S1 ^operator O2215 +)
  31154. =>WM: (15567: O2216 ^name predict-no)
  31155. =>WM: (15566: O2215 ^name predict-yes)
  31156. =>WM: (15565: R1111 ^value 1)
  31157. =>WM: (15564: R1 ^reward R1111)
  31158. <=WM: (15555: S1 ^operator O2213 +)
  31159. <=WM: (15556: S1 ^operator O2214 +)
  31160. <=WM: (15557: S1 ^operator O2214)
  31161. <=WM: (15551: R1 ^reward R1110)
  31162. <=WM: (15554: O2214 ^name predict-no)
  31163. <=WM: (15553: O2213 ^name predict-yes)
  31164. <=WM: (15552: R1110 ^value 1)
  31165. --- Inner Elaboration Phase, active level 1 (S1) ---
  31166. Firing prefer*rvt*predict-yes*H0
  31167. -->
  31168. Firing rl*prefer*rvt*predict-yes*H0*3
  31169. -->
  31170. (S1 ^operator O2215 = 0.3829404418135529)
  31171. Firing prefer*rvt*predict-yes*H0*3*H1
  31172. -->
  31173. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  31174. -->
  31175. (S1 ^operator O2215 = 0.2696941111808541)
  31176. Firing prefer*rvt*predict-no*H0
  31177. -->
  31178. Firing rl*prefer*rvt*predict-no*H0*4
  31179. -->
  31180. (S1 ^operator O2216 = 0.1269768229597308)
  31181. Firing prefer*rvt*predict-no*H0*4*H1
  31182. -->
  31183. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  31184. -->
  31185. (S1 ^operator O2216 = 0.8730231512980253)
  31186. inner elaboration loop at bottom goal.
  31187. Retracting rl*prefer*rvt*predict-no*H0*4
  31188. -->
  31189. (S1 ^operator O2214 = 0.1269768229597308)
  31190. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  31191. -->
  31192. (S1 ^operator O2214 = 0.8730231512980253)
  31193. Retracting rl*prefer*rvt*predict-yes*H0*3
  31194. -->
  31195. (S1 ^operator O2213 = 0.3829404418135529)
  31196. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  31197. -->
  31198. (S1 ^operator O2213 = 0.2696941111808541)
  31199. --- END Proposal Phase ---
  31200. --- Decision Phase ---
  31201. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.955446,0.0427811)
  31202. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  31203. =>WM: (15570: S1 ^operator O2216)
  31204. 1108: O: O2216 (predict-no)
  31205. --- END Decision Phase ---
  31206. --- Application Phase ---
  31207. --- Firing Productions (PE) For State At Depth 1 ---
  31208. --- Inner Elaboration Phase, active level 1 (S1) ---
  31209. Firing apply*operator
  31210. -->
  31211. (I3 ^predict-no N1108 + :O )
  31212. Firing apply*operator*complete
  31213. -->
  31214. (I3 ^predict-no N1107 - :O )
  31215. inner elaboration loop at bottom goal.
  31216. --- Change Working Memory (PE) ---
  31217. =>WM: (15571: I3 ^predict-no N1108)
  31218. <=WM: (15559: N1107 ^status complete)
  31219. <=WM: (15558: I3 ^predict-no N1107)
  31220. --- Firing Productions (IE) For State At Depth 1 ---
  31221. --- Inner Elaboration Phase, active level 1 (S1) ---
  31222. Firing monitor*world
  31223. -->
  31224. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31225. --- Change Working Memory (IE) ---
  31226. --- END Application Phase ---
  31227. --- Output Phase ---
  31228. ENV: Agent did: predict-no for direction R in state State-B
  31229. In State-B moving R
  31230. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31231. predict error 0
  31232. dir: dir isR
  31233. --- END Output Phase ---
  31234. /|\--- Input Phase ---
  31235. =>WM: (15575: I2 ^dir R)
  31236. =>WM: (15574: I2 ^reward 1)
  31237. =>WM: (15573: I2 ^see 0)
  31238. =>WM: (15572: N1108 ^status complete)
  31239. <=WM: (15562: I2 ^dir R)
  31240. <=WM: (15561: I2 ^reward 1)
  31241. <=WM: (15560: I2 ^see 0)
  31242. =>WM: (15576: I2 ^level-1 R0-root)
  31243. <=WM: (15563: I2 ^level-1 R0-root)
  31244. --- END Input Phase ---
  31245. --- Proposal Phase ---
  31246. --- Inner Elaboration Phase, active level 1 (S1) ---
  31247. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  31248. -->
  31249. (S1 ^operator O2215 = 0.2696941111808541)
  31250. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  31251. -->
  31252. (S1 ^operator O2216 = 0.8730231551593619)
  31253. Firing prefer*rvt*predict-no*H0*4*H1
  31254. -->
  31255. Firing prefer*rvt*predict-yes*H0*3*H1
  31256. -->
  31257. Firing elaborate*copy-see-to-output-link
  31258. -->
  31259. (I3 ^see 0 +)
  31260. Firing elaborate*reward*based*on*reward
  31261. -->
  31262. (R1112 ^value 1 +)
  31263. (R1 ^reward R1112 +)
  31264. Firing propose*predict-yes
  31265. -->
  31266. (O2217 ^name predict-yes +)
  31267. (S1 ^operator O2217 +)
  31268. Firing propose*predict-no
  31269. -->
  31270. (O2218 ^name predict-no +)
  31271. (S1 ^operator O2218 +)
  31272. Firing rl*prefer*rvt*predict-no*H0*4
  31273. -->
  31274. (S1 ^operator O2216 = 0.1269768268210674)
  31275. Firing rl*prefer*rvt*predict-yes*H0*3
  31276. -->
  31277. (S1 ^operator O2215 = 0.3829404418135529)
  31278. Firing prefer*rvt*predict-yes*H0
  31279. -->
  31280. Firing prefer*rvt*predict-no*H0
  31281. -->
  31282. Firing elaborate*copy-dir-to-output-link
  31283. -->
  31284. (I3 ^dir R +)
  31285. inner elaboration loop at bottom goal.
  31286. Retracting elaborate*copy-see-to-output-link
  31287. -->
  31288. (I3 ^see 0 +)
  31289. Retracting propose*predict-no
  31290. -->
  31291. (O2216 ^name predict-no +)
  31292. (S1 ^operator O2216 +)
  31293. Retracting propose*predict-yes
  31294. -->
  31295. (O2215 ^name predict-yes +)
  31296. (S1 ^operator O2215 +)
  31297. Retracting elaborate*reward*based*on*reward
  31298. -->
  31299. (R1111 ^value 1 +)
  31300. (R1 ^reward R1111 +)
  31301. Retracting elaborate*copy-dir-to-output-link
  31302. -->
  31303. (I3 ^dir R +)
  31304. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  31305. -->
  31306. (S1 ^operator O2216 = 0.8730231551593619)
  31307. Retracting rl*prefer*rvt*predict-no*H0*4
  31308. -->
  31309. (S1 ^operator O2216 = 0.1269768268210674)
  31310. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  31311. -->
  31312. (S1 ^operator O2215 = 0.2696941111808541)
  31313. Retracting rl*prefer*rvt*predict-yes*H0*3
  31314. -->
  31315. (S1 ^operator O2215 = 0.3829404418135529)
  31316. =>WM: (15582: S1 ^operator O2218 +)
  31317. =>WM: (15581: S1 ^operator O2217 +)
  31318. =>WM: (15580: O2218 ^name predict-no)
  31319. =>WM: (15579: O2217 ^name predict-yes)
  31320. =>WM: (15578: R1112 ^value 1)
  31321. =>WM: (15577: R1 ^reward R1112)
  31322. <=WM: (15568: S1 ^operator O2215 +)
  31323. <=WM: (15569: S1 ^operator O2216 +)
  31324. <=WM: (15570: S1 ^operator O2216)
  31325. <=WM: (15564: R1 ^reward R1111)
  31326. <=WM: (15567: O2216 ^name predict-no)
  31327. <=WM: (15566: O2215 ^name predict-yes)
  31328. <=WM: (15565: R1111 ^value 1)
  31329. --- Inner Elaboration Phase, active level 1 (S1) ---
  31330. Firing prefer*rvt*predict-yes*H0
  31331. -->
  31332. Firing rl*prefer*rvt*predict-yes*H0*3
  31333. -->
  31334. (S1 ^operator O2217 = 0.3829404418135529)
  31335. Firing prefer*rvt*predict-yes*H0*3*H1
  31336. -->
  31337. Firing rl*prefer*rvt*predict-yes*H0*3*H1*18
  31338. -->
  31339. (S1 ^operator O2217 = 0.2696941111808541)
  31340. Firing prefer*rvt*predict-no*H0
  31341. -->
  31342. Firing rl*prefer*rvt*predict-no*H0*4
  31343. -->
  31344. (S1 ^operator O2218 = 0.1269768268210674)
  31345. Firing prefer*rvt*predict-no*H0*4*H1
  31346. -->
  31347. Firing rl*prefer*rvt*predict-no*H0*4*H1*17
  31348. -->
  31349. (S1 ^operator O2218 = 0.8730231551593619)
  31350. inner elaboration loop at bottom goal.
  31351. Retracting rl*prefer*rvt*predict-no*H0*4
  31352. -->
  31353. (S1 ^operator O2216 = 0.1269768268210674)
  31354. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  31355. -->
  31356. (S1 ^operator O2216 = 0.8730231551593619)
  31357. Retracting rl*prefer*rvt*predict-yes*H0*3
  31358. -->
  31359. (S1 ^operator O2215 = 0.3829404418135529)
  31360. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  31361. -->
  31362. (S1 ^operator O2215 = 0.2696941111808541)
  31363. --- END Proposal Phase ---
  31364. --- Decision Phase ---
  31365. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.955665,0.0425791)
  31366. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  31367. =>WM: (15583: S1 ^operator O2218)
  31368. 1109: O: O2218 (predict-no)
  31369. --- END Decision Phase ---
  31370. --- Application Phase ---
  31371. --- Firing Productions (PE) For State At Depth 1 ---
  31372. --- Inner Elaboration Phase, active level 1 (S1) ---
  31373. Firing apply*operator
  31374. -->
  31375. (I3 ^predict-no N1109 + :O )
  31376. Firing apply*operator*complete
  31377. -->
  31378. (I3 ^predict-no N1108 - :O )
  31379. inner elaboration loop at bottom goal.
  31380. --- Change Working Memory (PE) ---
  31381. =>WM: (15584: I3 ^predict-no N1109)
  31382. <=WM: (15572: N1108 ^status complete)
  31383. <=WM: (15571: I3 ^predict-no N1108)
  31384. --- Firing Productions (IE) For State At Depth 1 ---
  31385. --- Inner Elaboration Phase, active level 1 (S1) ---
  31386. Firing monitor*world
  31387. -->
  31388. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31389. --- Change Working Memory (IE) ---
  31390. --- END Application Phase ---
  31391. --- Output Phase ---
  31392. ENV: Agent did: predict-no for direction R in state State-B
  31393. In State-B moving R
  31394. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31395. predict error 0
  31396. dir: dir isL
  31397. --- END Output Phase ---
  31398. -/--- Input Phase ---
  31399. =>WM: (15588: I2 ^dir L)
  31400. =>WM: (15587: I2 ^reward 1)
  31401. =>WM: (15586: I2 ^see 0)
  31402. =>WM: (15585: N1109 ^status complete)
  31403. <=WM: (15575: I2 ^dir R)
  31404. <=WM: (15574: I2 ^reward 1)
  31405. <=WM: (15573: I2 ^see 0)
  31406. =>WM: (15589: I2 ^level-1 R0-root)
  31407. <=WM: (15576: I2 ^level-1 R0-root)
  31408. --- END Input Phase ---
  31409. --- Proposal Phase ---
  31410. --- Inner Elaboration Phase, active level 1 (S1) ---
  31411. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  31412. -->
  31413. (S1 ^operator O2217 = 0.4768809184460345)
  31414. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  31415. -->
  31416. (S1 ^operator O2218 = 0.1700769046561409)
  31417. Firing prefer*rvt*predict-no*H0*2*H1
  31418. -->
  31419. Firing prefer*rvt*predict-yes*H0*1*H1
  31420. -->
  31421. Firing elaborate*copy-see-to-output-link
  31422. -->
  31423. (I3 ^see 0 +)
  31424. Firing elaborate*reward*based*on*reward
  31425. -->
  31426. (R1113 ^value 1 +)
  31427. (R1 ^reward R1113 +)
  31428. Firing propose*predict-yes
  31429. -->
  31430. (O2219 ^name predict-yes +)
  31431. (S1 ^operator O2219 +)
  31432. Firing propose*predict-no
  31433. -->
  31434. (O2220 ^name predict-no +)
  31435. (S1 ^operator O2220 +)
  31436. Firing rl*prefer*rvt*predict-no*H0*2
  31437. -->
  31438. (S1 ^operator O2218 = 0.2550134037413971)
  31439. Firing rl*prefer*rvt*predict-yes*H0*1
  31440. -->
  31441. (S1 ^operator O2217 = 0.5231196572160367)
  31442. Firing prefer*rvt*predict-yes*H0
  31443. -->
  31444. Firing prefer*rvt*predict-no*H0
  31445. -->
  31446. Firing elaborate*copy-dir-to-output-link
  31447. -->
  31448. (I3 ^dir L +)
  31449. inner elaboration loop at bottom goal.
  31450. Retracting elaborate*copy-see-to-output-link
  31451. -->
  31452. (I3 ^see 0 +)
  31453. Retracting propose*predict-no
  31454. -->
  31455. (O2218 ^name predict-no +)
  31456. (S1 ^operator O2218 +)
  31457. Retracting propose*predict-yes
  31458. -->
  31459. (O2217 ^name predict-yes +)
  31460. (S1 ^operator O2217 +)
  31461. Retracting elaborate*reward*based*on*reward
  31462. -->
  31463. (R1112 ^value 1 +)
  31464. (R1 ^reward R1112 +)
  31465. Retracting elaborate*copy-dir-to-output-link
  31466. -->
  31467. (I3 ^dir R +)
  31468. Retracting rl*prefer*rvt*predict-no*H0*4*H1*17
  31469. -->
  31470. (S1 ^operator O2218 = 0.8730231578622976)
  31471. Retracting rl*prefer*rvt*predict-no*H0*4
  31472. -->
  31473. (S1 ^operator O2218 = 0.1269768295240029)
  31474. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*18
  31475. -->
  31476. (S1 ^operator O2217 = 0.2696941111808541)
  31477. Retracting rl*prefer*rvt*predict-yes*H0*3
  31478. -->
  31479. (S1 ^operator O2217 = 0.3829404418135529)
  31480. =>WM: (15596: S1 ^operator O2220 +)
  31481. =>WM: (15595: S1 ^operator O2219 +)
  31482. =>WM: (15594: I3 ^dir L)
  31483. =>WM: (15593: O2220 ^name predict-no)
  31484. =>WM: (15592: O2219 ^name predict-yes)
  31485. =>WM: (15591: R1113 ^value 1)
  31486. =>WM: (15590: R1 ^reward R1113)
  31487. <=WM: (15581: S1 ^operator O2217 +)
  31488. <=WM: (15582: S1 ^operator O2218 +)
  31489. <=WM: (15583: S1 ^operator O2218)
  31490. <=WM: (15513: I3 ^dir R)
  31491. <=WM: (15577: R1 ^reward R1112)
  31492. <=WM: (15580: O2218 ^name predict-no)
  31493. <=WM: (15579: O2217 ^name predict-yes)
  31494. <=WM: (15578: R1112 ^value 1)
  31495. --- Inner Elaboration Phase, active level 1 (S1) ---
  31496. Firing prefer*rvt*predict-yes*H0
  31497. -->
  31498. Firing rl*prefer*rvt*predict-yes*H0*1*H1*19
  31499. -->
  31500. (S1 ^operator O2219 = 0.4768809184460345)
  31501. Firing rl*prefer*rvt*predict-yes*H0*1
  31502. -->
  31503. (S1 ^operator O2219 = 0.5231196572160367)
  31504. Firing prefer*rvt*predict-yes*H0*1*H1
  31505. -->
  31506. Firing prefer*rvt*predict-no*H0
  31507. -->
  31508. Firing rl*prefer*rvt*predict-no*H0*2*H1*7
  31509. -->
  31510. (S1 ^operator O2220 = 0.1700769046561409)
  31511. Firing rl*prefer*rvt*predict-no*H0*2
  31512. -->
  31513. (S1 ^operator O2220 = 0.2550134037413971)
  31514. Firing prefer*rvt*predict-no*H0*2*H1
  31515. -->
  31516. inner elaboration loop at bottom goal.
  31517. Retracting rl*prefer*rvt*predict-no*H0*2
  31518. -->
  31519. (S1 ^operator O2218 = 0.2550134037413971)
  31520. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  31521. -->
  31522. (S1 ^operator O2218 = 0.1700769046561409)
  31523. Retracting rl*prefer*rvt*predict-yes*H0*1
  31524. -->
  31525. (S1 ^operator O2217 = 0.5231196572160367)
  31526. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  31527. -->
  31528. (S1 ^operator O2217 = 0.4768809184460345)
  31529. --- END Proposal Phase ---
  31530. --- Decision Phase ---
  31531. RL update rl*prefer*rvt*predict-no*H0*4 0.814714 -0.687737 0.126977 -> 0.814714 -0.687737 0.126977(R,m,v=1,0.955882,0.042379)
  31532. RL update rl*prefer*rvt*predict-no*H0*4*H1*17 0.185286 0.687737 0.873023 -> 0.185286 0.687737 0.873023(R,m,v=1,1,0)
  31533. =>WM: (15597: S1 ^operator O2219)
  31534. 1110: O: O2219 (predict-yes)
  31535. --- END Decision Phase ---
  31536. --- Application Phase ---
  31537. --- Firing Productions (PE) For State At Depth 1 ---
  31538. --- Inner Elaboration Phase, active level 1 (S1) ---
  31539. Firing apply*operator
  31540. -->
  31541. (I3 ^predict-yes N1110 + :O )
  31542. Firing apply*operator*complete
  31543. -->
  31544. (I3 ^predict-no N1109 - :O )
  31545. inner elaboration loop at bottom goal.
  31546. --- Change Working Memory (PE) ---
  31547. =>WM: (15598: I3 ^predict-yes N1110)
  31548. <=WM: (15585: N1109 ^status complete)
  31549. <=WM: (15584: I3 ^predict-no N1109)
  31550. --- Firing Productions (IE) For State At Depth 1 ---
  31551. --- Inner Elaboration Phase, active level 1 (S1) ---
  31552. Firing monitor*world
  31553. -->
  31554. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31555. --- Change Working Memory (IE) ---
  31556. --- END Application Phase ---
  31557. --- Output Phase ---
  31558. ENV: Agent did: predict-yes for direction L in state State-B
  31559. In State-B moving L
  31560. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  31561. predict error 0
  31562. dir: dir isU
  31563. --- END Output Phase ---
  31564. |\---- Input Phase ---
  31565. =>WM: (15602: I2 ^dir U)
  31566. =>WM: (15601: I2 ^reward 1)
  31567. =>WM: (15600: I2 ^see 1)
  31568. =>WM: (15599: N1110 ^status complete)
  31569. <=WM: (15588: I2 ^dir L)
  31570. <=WM: (15587: I2 ^reward 1)
  31571. <=WM: (15586: I2 ^see 0)
  31572. =>WM: (15603: I2 ^level-1 L1-root)
  31573. <=WM: (15589: I2 ^level-1 R0-root)
  31574. --- END Input Phase ---
  31575. --- Proposal Phase ---
  31576. --- Inner Elaboration Phase, active level 1 (S1) ---
  31577. Firing elaborate*copy-see-to-output-link
  31578. -->
  31579. (I3 ^see 1 +)
  31580. Firing elaborate*reward*based*on*reward
  31581. -->
  31582. (R1114 ^value 1 +)
  31583. (R1 ^reward R1114 +)
  31584. Firing propose*predict-yes
  31585. -->
  31586. (O2221 ^name predict-yes +)
  31587. (S1 ^operator O2221 +)
  31588. Firing propose*predict-no
  31589. -->
  31590. (O2222 ^name predict-no +)
  31591. (S1 ^operator O2222 +)
  31592. Firing rl*prefer*rvt*predict-no*H0*6
  31593. -->
  31594. (S1 ^operator O2220 = 0.9999999999999999)
  31595. Firing rl*prefer*rvt*predict-yes*H0*5
  31596. -->
  31597. (S1 ^operator O2219 = 0.)
  31598. Firing prefer*rvt*predict-yes*H0
  31599. -->
  31600. Firing prefer*rvt*predict-no*H0
  31601. -->
  31602. Firing elaborate*copy-dir-to-output-link
  31603. -->
  31604. (I3 ^dir U +)
  31605. inner elaboration loop at bottom goal.
  31606. Retracting elaborate*copy-see-to-output-link
  31607. -->
  31608. (I3 ^see 0 +)
  31609. Retracting propose*predict-no
  31610. -->
  31611. (O2220 ^name predict-no +)
  31612. (S1 ^operator O2220 +)
  31613. Retracting propose*predict-yes
  31614. -->
  31615. (O2219 ^name predict-yes +)
  31616. (S1 ^operator O2219 +)
  31617. Retracting elaborate*reward*based*on*reward
  31618. -->
  31619. (R1113 ^value 1 +)
  31620. (R1 ^reward R1113 +)
  31621. Retracting elaborate*copy-dir-to-output-link
  31622. -->
  31623. (I3 ^dir L +)
  31624. Retracting rl*prefer*rvt*predict-no*H0*2
  31625. -->
  31626. (S1 ^operator O2220 = 0.2550134037413971)
  31627. Retracting rl*prefer*rvt*predict-no*H0*2*H1*7
  31628. -->
  31629. (S1 ^operator O2220 = 0.1700769046561409)
  31630. Retracting rl*prefer*rvt*predict-yes*H0*1
  31631. -->
  31632. (S1 ^operator O2219 = 0.5231196572160367)
  31633. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*19
  31634. -->
  31635. (S1 ^operator O2219 = 0.4768809184460345)
  31636. =>WM: (15611: S1 ^operator O2222 +)
  31637. =>WM: (15610: S1 ^operator O2221 +)
  31638. =>WM: (15609: I3 ^dir U)
  31639. =>WM: (15608: O2222 ^name predict-no)
  31640. =>WM: (15607: O2221 ^name predict-yes)
  31641. =>WM: (15606: R1114 ^value 1)
  31642. =>WM: (15605: R1 ^reward R1114)
  31643. =>WM: (15604: I3 ^see 1)
  31644. <=WM: (15595: S1 ^operator O2219 +)
  31645. <=WM: (15597: S1 ^operator O2219)
  31646. <=WM: (15596: S1 ^operator O2220 +)
  31647. <=WM: (15594: I3 ^dir L)
  31648. <=WM: (15590: R1 ^reward R1113)
  31649. <=WM: (15537: I3 ^see 0)
  31650. <=WM: (15593: O2220 ^name predict-no)
  31651. <=WM: (15592: O2219 ^name predict-yes)
  31652. <=WM: (15591: R1113 ^value 1)
  31653. --- Inner Elaboration Phase, active level 1 (S1) ---
  31654. Firing prefer*rvt*predict-yes*H0
  31655. -->
  31656. Firing rl*prefer*rvt*predict-yes*H0*5
  31657. -->
  31658. (S1 ^operator O2221 = 0.)
  31659. Firing prefer*rvt*predict-no*H0
  31660. -->
  31661. Firing rl*prefer*rvt*predict-no*H0*6
  31662. -->
  31663. (S1 ^operator O2222 = 0.9999999999999999)
  31664. inner elaboration loop at bottom goal.
  31665. Retracting rl*prefer*rvt*predict-no*H0*6
  31666. -->
  31667. (S1 ^operator O2220 = 0.9999999999999999)
  31668. Retracting rl*prefer*rvt*predict-yes*H0*5
  31669. -->
  31670. (S1 ^operator O2219 = 0.)
  31671. --- END Proposal Phase ---
  31672. --- Decision Phase ---
  31673. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.52312 -> 0.727959 -0.20484 0.52312(R,m,v=1,0.98125,0.0185142)
  31674. RL update rl*prefer*rvt*predict-yes*H0*1*H1*19 0.272041 0.20484 0.476881 -> 0.272041 0.20484 0.476881(R,m,v=1,1,0)
  31675. =>WM: (15612: S1 ^operator O2222)
  31676. 1111: O: O2222 (predict-no)
  31677. --- END Decision Phase ---
  31678. --- Application Phase ---
  31679. --- Firing Productions (PE) For State At Depth 1 ---
  31680. --- Inner Elaboration Phase, active level 1 (S1) ---
  31681. Firing apply*operator
  31682. -->
  31683. (I3 ^predict-no N1111 + :O )
  31684. Firing apply*operator*complete
  31685. -->
  31686. (I3 ^predict-yes N1110 - :O )
  31687. inner elaboration loop at bottom goal.
  31688. --- Change Working Memory (PE) ---
  31689. =>WM: (15613: I3 ^predict-no N1111)
  31690. <=WM: (15599: N1110 ^status complete)
  31691. <=WM: (15598: I3 ^predict-yes N1110)
  31692. --- Firing Productions (IE) For State At Depth 1 ---
  31693. --- Inner Elaboration Phase, active level 1 (S1) ---
  31694. Firing monitor*world
  31695. -->
  31696. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31697. --- Change Working Memory (IE) ---
  31698. --- END Application Phase ---
  31699. --- Output Phase ---
  31700. ENV: Agent did: predict-no for direction U in state State-A
  31701. In State-A moving U
  31702. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  31703. predict error 0
  31704. dir: dir isU
  31705. --- END Output Phase ---
  31706. /--- Input Phase ---
  31707. =>WM: (15617: I2 ^dir U)
  31708. =>WM: (15616: I2 ^reward 1)
  31709. =>WM: (15615: I2 ^see 0)
  31710. =>WM: (15614: N1111 ^status complete)
  31711. <=WM: (15602: I2 ^dir U)
  31712. <=WM: (15601: I2 ^reward 1)
  31713. <=WM: (15600: I2 ^see 1)
  31714. =>WM: (15618: I2 ^level-1 L1-root)
  31715. <=WM: (15603: I2 ^level-1 L1-root)
  31716. --- END Input Phase ---
  31717. --- Proposal Phase ---
  31718. --- Inner Elaboration Phase, active level 1 (S1) ---
  31719. Firing elaborate*copy-see-to-output-link
  31720. -->
  31721. (I3 ^see 0 +)
  31722. Firing elaborate*reward*based*on*reward
  31723. -->
  31724. (R1115 ^value 1 +)
  31725. (R1 ^reward R1115 +)
  31726. Firing propose*predict-yes
  31727. -->
  31728. (O2223 ^name predict-yes +)
  31729. (S1 ^operator O2223 +)
  31730. Firing propose*predict-no
  31731. -->
  31732. (O2224 ^name predict-no +)
  31733. (S1 ^operator O2224 +)
  31734. Firing rl*prefer*rvt*predict-no*H0*6
  31735. -->
  31736. (S1 ^operator O2222 = 0.9999999999999999)
  31737. Firing rl*prefer*rvt*predict-yes*H0*5
  31738. -->
  31739. (S1 ^operator O2221 = 0.)
  31740. Firing prefer*rvt*predict-yes*H0
  31741. -->
  31742. Firing prefer*rvt*predict-no*H0
  31743. -->
  31744. Firing elaborate*copy-dir-to-output-link
  31745. -->
  31746. (I3 ^dir U +)
  31747. inner elaboration loop at bottom goal.
  31748. Retracting elaborate*copy-see-to-output-link
  31749. -->
  31750. (I3 ^see 1 +)
  31751. Retracting propose*predict-no
  31752. -->
  31753. (O2222 ^name predict-no +)
  31754. (S1 ^operator O2222 +)
  31755. Retracting propose*predict-yes
  31756. -->
  31757. (O2221 ^name predict-yes +)
  31758. (S1 ^operator O2221 +)
  31759. Retracting elaborate*reward*based*on*reward
  31760. -->
  31761. (R1114 ^value 1 +)
  31762. (R1 ^reward R1114 +)
  31763. Retracting elaborate*copy-dir-to-output-link
  31764. -->
  31765. (I3 ^dir U +)
  31766. Retracting rl*prefer*rvt*predict-no*H0*6
  31767. -->
  31768. (S1 ^operator O2222 = 0.9999999999999999)
  31769. Retracting rl*prefer*rvt*predict-yes*H0*5
  31770. -->
  31771. (S1 ^operator O2221 = 0.)
  31772. =>WM: (15625: S1 ^operator O2224 +)
  31773. =>WM: (15624: S1 ^operator O2223 +)
  31774. =>WM: (15623: O2224 ^name predict-no)
  31775. =>WM: (15622: O2223 ^name predict-yes)
  31776. =>WM: (15621: R1115 ^value 1)
  31777. =>WM: (15620: R1 ^reward R1115)
  31778. =>WM: (15619: I3 ^see 0)
  31779. <=WM: (15610: S1 ^operator O2221 +)
  31780. <=WM: (15611: S1 ^operator O2222 +)
  31781. <=WM: (15612: S1 ^operator O2222)
  31782. <=WM: (15605: R1 ^reward R1114)
  31783. <=WM: (15604: I3 ^see 1)
  31784. <=WM: (15608: O2222 ^name predict-no)
  31785. <=WM: (15607: O2221 ^name predict-yes)
  31786. <=WM: (15606: R1114 ^value 1)
  31787. --- Inner Elaboration Phase, active level 1 (S1) ---
  31788. Firing prefer*rvt*predict-yes*H0
  31789. -->
  31790. Firing rl*prefer*rvt*predict-yes*H0*5
  31791. -->
  31792. (S1 ^operator O2223 = 0.)
  31793. Firing prefer*rvt*predict-no*H0
  31794. -->
  31795. Firing rl*prefer*rvt*predict-no*H0*6
  31796. -->
  31797. (S1 ^operator O2224 = 0.9999999999999999)
  31798. inner elaboration loop at bottom goal.
  31799. Retracting rl*prefer*rvt*predict-no*H0*6
  31800. -->
  31801. (S1 ^operator O2222 = 0.9999999999999999)
  31802. Retracting rl*prefer*rvt*predict-yes*H0*5
  31803. -->
  31804. (S1 ^operator O2221 = 0.)
  31805. --- END Proposal Phase ---
  31806. --- Decision Phase ---
  31807. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31808. =>WM: (15626: S1 ^operator O2224)
  31809. 1112: O: O2224 (predict-no)
  31810. --- END Decision Phase ---
  31811. --- Application Phase ---
  31812. --- Firing Productions (PE) For State At Depth 1 ---
  31813. --- Inner Elaboration Phase, active level 1 (S1) ---
  31814. Firing apply*operator
  31815. -->
  31816. (I3 ^predict-no N1112 + :O )
  31817. Firing apply*operator*complete
  31818. -->
  31819. (I3 ^predict-no N1111 - :O )
  31820. inner elaboration loop at bottom goal.
  31821. --- Change Working Memory (PE) ---
  31822. =>WM: (15627: I3 ^predict-no N1112)
  31823. <=WM: (15614: N1111 ^status complete)
  31824. <=WM: (15613: I3 ^predict-no N1111)
  31825. --- Firing Productions (IE) For State At Depth 1 ---
  31826. --- Inner Elaboration Phase, active level 1 (S1) ---
  31827. Firing monitor*world
  31828. -->
  31829. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31830. --- Change Working Memory (IE) ---
  31831. --- END Application Phase ---
  31832. --- Output Phase ---
  31833. ENV: Agent did: predict-no for direction U in state State-A
  31834. In State-A moving U
  31835. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  31836. predict error 0
  31837. dir: dir isL
  31838. --- END Output Phase ---
  31839. |\--- Input Phase ---
  31840. =>WM: (15631: I2 ^dir L)
  31841. =>WM: (15630: I2 ^reward 1)
  31842. =>WM: (15629: I2 ^see 0)
  31843. =>WM: (15628: N1112 ^status complete)
  31844. <=WM: (15617: I2 ^dir U)
  31845. <=WM: (15616: I2 ^reward 1)
  31846. <=WM: (15615: I2 ^see 0)
  31847. =>WM: (15632: I2 ^level-1 L1-root)
  31848. <=WM: (15618: I2 ^level-1 L1-root)
  31849. --- END Input Phase ---
  31850. --- Proposal Phase ---
  31851. --- Inner Elaboration Phase, active level 1 (S1) ---
  31852. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  31853. -->
  31854. (S1 ^operator O2223 = 0.1693592933936033)
  31855. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  31856. -->
  31857. (S1 ^operator O2224 = 0.744986560480787)
  31858. Firing prefer*rvt*predict-no*H0*2*H1
  31859. -->
  31860. Firing prefer*rvt*predict-yes*H0*1*H1
  31861. -->
  31862. Firing elaborate*copy-see-to-output-link
  31863. -->
  31864. (I3 ^see 0 +)
  31865. Firing elaborate*reward*based*on*reward
  31866. -->
  31867. (R1116 ^value 1 +)
  31868. (R1 ^reward R1116 +)
  31869. Firing propose*predict-yes
  31870. -->
  31871. (O2225 ^name predict-yes +)
  31872. (S1 ^operator O2225 +)
  31873. Firing propose*predict-no
  31874. -->
  31875. (O2226 ^name predict-no +)
  31876. (S1 ^operator O2226 +)
  31877. Firing rl*prefer*rvt*predict-no*H0*2
  31878. -->
  31879. (S1 ^operator O2224 = 0.2550134037413971)
  31880. Firing rl*prefer*rvt*predict-yes*H0*1
  31881. -->
  31882. (S1 ^operator O2223 = 0.5231195708667261)
  31883. Firing prefer*rvt*predict-yes*H0
  31884. -->
  31885. Firing prefer*rvt*predict-no*H0
  31886. -->
  31887. Firing elaborate*copy-dir-to-output-link
  31888. -->
  31889. (I3 ^dir L +)
  31890. inner elaboration loop at bottom goal.
  31891. Retracting elaborate*copy-see-to-output-link
  31892. -->
  31893. (I3 ^see 0 +)
  31894. Retracting propose*predict-no
  31895. -->
  31896. (O2224 ^name predict-no +)
  31897. (S1 ^operator O2224 +)
  31898. Retracting propose*predict-yes
  31899. -->
  31900. (O2223 ^name predict-yes +)
  31901. (S1 ^operator O2223 +)
  31902. Retracting elaborate*reward*based*on*reward
  31903. -->
  31904. (R1115 ^value 1 +)
  31905. (R1 ^reward R1115 +)
  31906. Retracting elaborate*copy-dir-to-output-link
  31907. -->
  31908. (I3 ^dir U +)
  31909. Retracting rl*prefer*rvt*predict-no*H0*6
  31910. -->
  31911. (S1 ^operator O2224 = 0.9999999999999999)
  31912. Retracting rl*prefer*rvt*predict-yes*H0*5
  31913. -->
  31914. (S1 ^operator O2223 = 0.)
  31915. =>WM: (15639: S1 ^operator O2226 +)
  31916. =>WM: (15638: S1 ^operator O2225 +)
  31917. =>WM: (15637: I3 ^dir L)
  31918. =>WM: (15636: O2226 ^name predict-no)
  31919. =>WM: (15635: O2225 ^name predict-yes)
  31920. =>WM: (15634: R1116 ^value 1)
  31921. =>WM: (15633: R1 ^reward R1116)
  31922. <=WM: (15624: S1 ^operator O2223 +)
  31923. <=WM: (15625: S1 ^operator O2224 +)
  31924. <=WM: (15626: S1 ^operator O2224)
  31925. <=WM: (15609: I3 ^dir U)
  31926. <=WM: (15620: R1 ^reward R1115)
  31927. <=WM: (15623: O2224 ^name predict-no)
  31928. <=WM: (15622: O2223 ^name predict-yes)
  31929. <=WM: (15621: R1115 ^value 1)
  31930. --- Inner Elaboration Phase, active level 1 (S1) ---
  31931. Firing prefer*rvt*predict-yes*H0
  31932. -->
  31933. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  31934. -->
  31935. (S1 ^operator O2225 = 0.1693592933936033)
  31936. Firing rl*prefer*rvt*predict-yes*H0*1
  31937. -->
  31938. (S1 ^operator O2225 = 0.5231195708667261)
  31939. Firing prefer*rvt*predict-yes*H0*1*H1
  31940. -->
  31941. Firing prefer*rvt*predict-no*H0
  31942. -->
  31943. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  31944. -->
  31945. (S1 ^operator O2226 = 0.744986560480787)
  31946. Firing rl*prefer*rvt*predict-no*H0*2
  31947. -->
  31948. (S1 ^operator O2226 = 0.2550134037413971)
  31949. Firing prefer*rvt*predict-no*H0*2*H1
  31950. -->
  31951. inner elaboration loop at bottom goal.
  31952. Retracting rl*prefer*rvt*predict-no*H0*2
  31953. -->
  31954. (S1 ^operator O2224 = 0.2550134037413971)
  31955. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  31956. -->
  31957. (S1 ^operator O2224 = 0.744986560480787)
  31958. Retracting rl*prefer*rvt*predict-yes*H0*1
  31959. -->
  31960. (S1 ^operator O2223 = 0.5231195708667261)
  31961. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  31962. -->
  31963. (S1 ^operator O2223 = 0.1693592933936033)
  31964. --- END Proposal Phase ---
  31965. --- Decision Phase ---
  31966. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31967. =>WM: (15640: S1 ^operator O2226)
  31968. 1113: O: O2226 (predict-no)
  31969. --- END Decision Phase ---
  31970. --- Application Phase ---
  31971. --- Firing Productions (PE) For State At Depth 1 ---
  31972. --- Inner Elaboration Phase, active level 1 (S1) ---
  31973. Firing apply*operator
  31974. -->
  31975. (I3 ^predict-no N1113 + :O )
  31976. Firing apply*operator*complete
  31977. -->
  31978. (I3 ^predict-no N1112 - :O )
  31979. inner elaboration loop at bottom goal.
  31980. --- Change Working Memory (PE) ---
  31981. =>WM: (15641: I3 ^predict-no N1113)
  31982. <=WM: (15628: N1112 ^status complete)
  31983. <=WM: (15627: I3 ^predict-no N1112)
  31984. --- Firing Productions (IE) For State At Depth 1 ---
  31985. --- Inner Elaboration Phase, active level 1 (S1) ---
  31986. Firing monitor*world
  31987. -->
  31988. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31989. --- Change Working Memory (IE) ---
  31990. --- END Application Phase ---
  31991. --- Output Phase ---
  31992. ENV: Agent did: predict-no for direction L in state State-A
  31993. In State-A moving L
  31994. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  31995. predict error 0
  31996. dir: dir isR
  31997. --- END Output Phase ---
  31998. -/--- Input Phase ---
  31999. =>WM: (15645: I2 ^dir R)
  32000. =>WM: (15644: I2 ^reward 1)
  32001. =>WM: (15643: I2 ^see 0)
  32002. =>WM: (15642: N1113 ^status complete)
  32003. <=WM: (15631: I2 ^dir L)
  32004. <=WM: (15630: I2 ^reward 1)
  32005. <=WM: (15629: I2 ^see 0)
  32006. =>WM: (15646: I2 ^level-1 L0-root)
  32007. <=WM: (15632: I2 ^level-1 L1-root)
  32008. --- END Input Phase ---
  32009. --- Proposal Phase ---
  32010. --- Inner Elaboration Phase, active level 1 (S1) ---
  32011. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  32012. -->
  32013. (S1 ^operator O2225 = 0.6170628283849691)
  32014. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  32015. -->
  32016. (S1 ^operator O2226 = 0.4910065094545203)
  32017. Firing prefer*rvt*predict-no*H0*4*H1
  32018. -->
  32019. Firing prefer*rvt*predict-yes*H0*3*H1
  32020. -->
  32021. Firing elaborate*copy-see-to-output-link
  32022. -->
  32023. (I3 ^see 0 +)
  32024. Firing elaborate*reward*based*on*reward
  32025. -->
  32026. (R1117 ^value 1 +)
  32027. (R1 ^reward R1117 +)
  32028. Firing propose*predict-yes
  32029. -->
  32030. (O2227 ^name predict-yes +)
  32031. (S1 ^operator O2227 +)
  32032. Firing propose*predict-no
  32033. -->
  32034. (O2228 ^name predict-no +)
  32035. (S1 ^operator O2228 +)
  32036. Firing rl*prefer*rvt*predict-no*H0*4
  32037. -->
  32038. (S1 ^operator O2226 = 0.1269768314160579)
  32039. Firing rl*prefer*rvt*predict-yes*H0*3
  32040. -->
  32041. (S1 ^operator O2225 = 0.3829404418135529)
  32042. Firing prefer*rvt*predict-yes*H0
  32043. -->
  32044. Firing prefer*rvt*predict-no*H0
  32045. -->
  32046. Firing elaborate*copy-dir-to-output-link
  32047. -->
  32048. (I3 ^dir R +)
  32049. inner elaboration loop at bottom goal.
  32050. Retracting elaborate*copy-see-to-output-link
  32051. -->
  32052. (I3 ^see 0 +)
  32053. Retracting propose*predict-no
  32054. -->
  32055. (O2226 ^name predict-no +)
  32056. (S1 ^operator O2226 +)
  32057. Retracting propose*predict-yes
  32058. -->
  32059. (O2225 ^name predict-yes +)
  32060. (S1 ^operator O2225 +)
  32061. Retracting elaborate*reward*based*on*reward
  32062. -->
  32063. (R1116 ^value 1 +)
  32064. (R1 ^reward R1116 +)
  32065. Retracting elaborate*copy-dir-to-output-link
  32066. -->
  32067. (I3 ^dir L +)
  32068. Retracting rl*prefer*rvt*predict-no*H0*2
  32069. -->
  32070. (S1 ^operator O2226 = 0.2550134037413971)
  32071. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  32072. -->
  32073. (S1 ^operator O2226 = 0.744986560480787)
  32074. Retracting rl*prefer*rvt*predict-yes*H0*1
  32075. -->
  32076. (S1 ^operator O2225 = 0.5231195708667261)
  32077. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  32078. -->
  32079. (S1 ^operator O2225 = 0.1693592933936033)
  32080. =>WM: (15653: S1 ^operator O2228 +)
  32081. =>WM: (15652: S1 ^operator O2227 +)
  32082. =>WM: (15651: I3 ^dir R)
  32083. =>WM: (15650: O2228 ^name predict-no)
  32084. =>WM: (15649: O2227 ^name predict-yes)
  32085. =>WM: (15648: R1117 ^value 1)
  32086. =>WM: (15647: R1 ^reward R1117)
  32087. <=WM: (15638: S1 ^operator O2225 +)
  32088. <=WM: (15639: S1 ^operator O2226 +)
  32089. <=WM: (15640: S1 ^operator O2226)
  32090. <=WM: (15637: I3 ^dir L)
  32091. <=WM: (15633: R1 ^reward R1116)
  32092. <=WM: (15636: O2226 ^name predict-no)
  32093. <=WM: (15635: O2225 ^name predict-yes)
  32094. <=WM: (15634: R1116 ^value 1)
  32095. --- Inner Elaboration Phase, active level 1 (S1) ---
  32096. Firing prefer*rvt*predict-yes*H0
  32097. -->
  32098. Firing rl*prefer*rvt*predict-yes*H0*3
  32099. -->
  32100. (S1 ^operator O2227 = 0.3829404418135529)
  32101. Firing prefer*rvt*predict-yes*H0*3*H1
  32102. -->
  32103. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  32104. -->
  32105. (S1 ^operator O2227 = 0.6170628283849691)
  32106. Firing prefer*rvt*predict-no*H0
  32107. -->
  32108. Firing rl*prefer*rvt*predict-no*H0*4
  32109. -->
  32110. (S1 ^operator O2228 = 0.1269768314160579)
  32111. Firing prefer*rvt*predict-no*H0*4*H1
  32112. -->
  32113. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  32114. -->
  32115. (S1 ^operator O2228 = 0.4910065094545203)
  32116. inner elaboration loop at bottom goal.
  32117. Retracting rl*prefer*rvt*predict-no*H0*4
  32118. -->
  32119. (S1 ^operator O2226 = 0.1269768314160579)
  32120. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  32121. -->
  32122. (S1 ^operator O2226 = 0.4910065094545203)
  32123. Retracting rl*prefer*rvt*predict-yes*H0*3
  32124. -->
  32125. (S1 ^operator O2225 = 0.3829404418135529)
  32126. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  32127. -->
  32128. (S1 ^operator O2225 = 0.6170628283849691)
  32129. --- END Proposal Phase ---
  32130. --- Decision Phase ---
  32131. RL update rl*prefer*rvt*predict-no*H0*2 0.631495 -0.376482 0.255013 -> 0.631495 -0.376482 0.255013(R,m,v=1,0.92381,0.0707223)
  32132. RL update rl*prefer*rvt*predict-no*H0*2*H1*11 0.368505 0.376482 0.744987 -> 0.368505 0.376482 0.744987(R,m,v=1,1,0)
  32133. =>WM: (15654: S1 ^operator O2227)
  32134. 1114: O: O2227 (predict-yes)
  32135. --- END Decision Phase ---
  32136. --- Application Phase ---
  32137. --- Firing Productions (PE) For State At Depth 1 ---
  32138. --- Inner Elaboration Phase, active level 1 (S1) ---
  32139. Firing apply*operator
  32140. -->
  32141. (I3 ^predict-yes N1114 + :O )
  32142. Firing apply*operator*complete
  32143. -->
  32144. (I3 ^predict-no N1113 - :O )
  32145. inner elaboration loop at bottom goal.
  32146. --- Change Working Memory (PE) ---
  32147. =>WM: (15655: I3 ^predict-yes N1114)
  32148. <=WM: (15642: N1113 ^status complete)
  32149. <=WM: (15641: I3 ^predict-no N1113)
  32150. --- Firing Productions (IE) For State At Depth 1 ---
  32151. --- Inner Elaboration Phase, active level 1 (S1) ---
  32152. Firing monitor*world
  32153. -->
  32154. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32155. --- Change Working Memory (IE) ---
  32156. --- END Application Phase ---
  32157. --- Output Phase ---
  32158. ENV: Agent did: predict-yes for direction R in state State-A
  32159. In State-A moving R
  32160. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  32161. predict error 0
  32162. dir: dir isU
  32163. --- END Output Phase ---
  32164. |\--- Input Phase ---
  32165. =>WM: (15659: I2 ^dir U)
  32166. =>WM: (15658: I2 ^reward 1)
  32167. =>WM: (15657: I2 ^see 1)
  32168. =>WM: (15656: N1114 ^status complete)
  32169. <=WM: (15645: I2 ^dir R)
  32170. <=WM: (15644: I2 ^reward 1)
  32171. <=WM: (15643: I2 ^see 0)
  32172. =>WM: (15660: I2 ^level-1 R1-root)
  32173. <=WM: (15646: I2 ^level-1 L0-root)
  32174. --- END Input Phase ---
  32175. --- Proposal Phase ---
  32176. --- Inner Elaboration Phase, active level 1 (S1) ---
  32177. Firing elaborate*copy-see-to-output-link
  32178. -->
  32179. (I3 ^see 1 +)
  32180. Firing elaborate*reward*based*on*reward
  32181. -->
  32182. (R1118 ^value 1 +)
  32183. (R1 ^reward R1118 +)
  32184. Firing propose*predict-yes
  32185. -->
  32186. (O2229 ^name predict-yes +)
  32187. (S1 ^operator O2229 +)
  32188. Firing propose*predict-no
  32189. -->
  32190. (O2230 ^name predict-no +)
  32191. (S1 ^operator O2230 +)
  32192. Firing rl*prefer*rvt*predict-no*H0*6
  32193. -->
  32194. (S1 ^operator O2228 = 0.9999999999999999)
  32195. Firing rl*prefer*rvt*predict-yes*H0*5
  32196. -->
  32197. (S1 ^operator O2227 = 0.)
  32198. Firing prefer*rvt*predict-yes*H0
  32199. -->
  32200. Firing prefer*rvt*predict-no*H0
  32201. -->
  32202. Firing elaborate*copy-dir-to-output-link
  32203. -->
  32204. (I3 ^dir U +)
  32205. inner elaboration loop at bottom goal.
  32206. Retracting elaborate*copy-see-to-output-link
  32207. -->
  32208. (I3 ^see 0 +)
  32209. Retracting propose*predict-no
  32210. -->
  32211. (O2228 ^name predict-no +)
  32212. (S1 ^operator O2228 +)
  32213. Retracting propose*predict-yes
  32214. -->
  32215. (O2227 ^name predict-yes +)
  32216. (S1 ^operator O2227 +)
  32217. Retracting elaborate*reward*based*on*reward
  32218. -->
  32219. (R1117 ^value 1 +)
  32220. (R1 ^reward R1117 +)
  32221. Retracting elaborate*copy-dir-to-output-link
  32222. -->
  32223. (I3 ^dir R +)
  32224. Retracting rl*prefer*rvt*predict-no*H0*4*H1*13
  32225. -->
  32226. (S1 ^operator O2228 = 0.4910065094545203)
  32227. Retracting rl*prefer*rvt*predict-no*H0*4
  32228. -->
  32229. (S1 ^operator O2228 = 0.1269768314160579)
  32230. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*14
  32231. -->
  32232. (S1 ^operator O2227 = 0.6170628283849691)
  32233. Retracting rl*prefer*rvt*predict-yes*H0*3
  32234. -->
  32235. (S1 ^operator O2227 = 0.3829404418135529)
  32236. =>WM: (15668: S1 ^operator O2230 +)
  32237. =>WM: (15667: S1 ^operator O2229 +)
  32238. =>WM: (15666: I3 ^dir U)
  32239. =>WM: (15665: O2230 ^name predict-no)
  32240. =>WM: (15664: O2229 ^name predict-yes)
  32241. =>WM: (15663: R1118 ^value 1)
  32242. =>WM: (15662: R1 ^reward R1118)
  32243. =>WM: (15661: I3 ^see 1)
  32244. <=WM: (15652: S1 ^operator O2227 +)
  32245. <=WM: (15654: S1 ^operator O2227)
  32246. <=WM: (15653: S1 ^operator O2228 +)
  32247. <=WM: (15651: I3 ^dir R)
  32248. <=WM: (15647: R1 ^reward R1117)
  32249. <=WM: (15619: I3 ^see 0)
  32250. <=WM: (15650: O2228 ^name predict-no)
  32251. <=WM: (15649: O2227 ^name predict-yes)
  32252. <=WM: (15648: R1117 ^value 1)
  32253. --- Inner Elaboration Phase, active level 1 (S1) ---
  32254. Firing prefer*rvt*predict-yes*H0
  32255. -->
  32256. Firing rl*prefer*rvt*predict-yes*H0*5
  32257. -->
  32258. (S1 ^operator O2229 = 0.)
  32259. Firing prefer*rvt*predict-no*H0
  32260. -->
  32261. Firing rl*prefer*rvt*predict-no*H0*6
  32262. -->
  32263. (S1 ^operator O2230 = 0.9999999999999999)
  32264. inner elaboration loop at bottom goal.
  32265. Retracting rl*prefer*rvt*predict-no*H0*6
  32266. -->
  32267. (S1 ^operator O2228 = 0.9999999999999999)
  32268. Retracting rl*prefer*rvt*predict-yes*H0*5
  32269. -->
  32270. (S1 ^operator O2227 = 0.)
  32271. --- END Proposal Phase ---
  32272. --- Decision Phase ---
  32273. RL update rl*prefer*rvt*predict-yes*H0*3 0.673134 -0.290193 0.38294 -> 0.673133 -0.290193 0.38294(R,m,v=1,0.964912,0.0340557)
  32274. RL update rl*prefer*rvt*predict-yes*H0*3*H1*14 0.326869 0.290193 0.617063 -> 0.326869 0.290193 0.617062(R,m,v=1,1,0)
  32275. =>WM: (15669: S1 ^operator O2230)
  32276. 1115: O: O2230 (predict-no)
  32277. --- END Decision Phase ---
  32278. --- Application Phase ---
  32279. --- Firing Productions (PE) For State At Depth 1 ---
  32280. --- Inner Elaboration Phase, active level 1 (S1) ---
  32281. Firing apply*operator
  32282. -->
  32283. (I3 ^predict-no N1115 + :O )
  32284. Firing apply*operator*complete
  32285. -->
  32286. (I3 ^predict-yes N1114 - :O )
  32287. inner elaboration loop at bottom goal.
  32288. --- Change Working Memory (PE) ---
  32289. =>WM: (15670: I3 ^predict-no N1115)
  32290. <=WM: (15656: N1114 ^status complete)
  32291. <=WM: (15655: I3 ^predict-yes N1114)
  32292. --- Firing Productions (IE) For State At Depth 1 ---
  32293. --- Inner Elaboration Phase, active level 1 (S1) ---
  32294. Firing monitor*world
  32295. -->
  32296. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32297. --- Change Working Memory (IE) ---
  32298. --- END Application Phase ---
  32299. --- Output Phase ---
  32300. ENV: Agent did: predict-no for direction U in state State-B
  32301. In State-B moving U
  32302. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32303. predict error 0
  32304. dir: dir isU
  32305. --- END Output Phase ---
  32306. -/|\sleeping...
  32307. ---- Input Phase ---
  32308. =>WM: (15674: I2 ^dir U)
  32309. =>WM: (15673: I2 ^reward 1)
  32310. =>WM: (15672: I2 ^see 0)
  32311. =>WM: (15671: N1115 ^status complete)
  32312. <=WM: (15659: I2 ^dir U)
  32313. <=WM: (15658: I2 ^reward 1)
  32314. <=WM: (15657: I2 ^see 1)
  32315. =>WM: (15675: I2 ^level-1 R1-root)
  32316. <=WM: (15660: I2 ^level-1 R1-root)
  32317. --- END Input Phase ---
  32318. --- Proposal Phase ---
  32319. --- Inner Elaboration Phase, active level 1 (S1) ---
  32320. Firing elaborate*copy-see-to-output-link
  32321. -->
  32322. (I3 ^see 0 +)
  32323. Firing elaborate*reward*based*on*reward
  32324. -->
  32325. (R1119 ^value 1 +)
  32326. (R1 ^reward R1119 +)
  32327. Firing propose*predict-yes
  32328. -->
  32329. (O2231 ^name predict-yes +)
  32330. (S1 ^operator O2231 +)
  32331. Firing propose*predict-no
  32332. -->
  32333. (O2232 ^name predict-no +)
  32334. (S1 ^operator O2232 +)
  32335. Firing rl*prefer*rvt*predict-no*H0*6
  32336. -->
  32337. (S1 ^operator O2230 = 0.9999999999999999)
  32338. Firing rl*prefer*rvt*predict-yes*H0*5
  32339. -->
  32340. (S1 ^operator O2229 = 0.)
  32341. Firing prefer*rvt*predict-yes*H0
  32342. -->
  32343. Firing prefer*rvt*predict-no*H0
  32344. -->
  32345. Firing elaborate*copy-dir-to-output-link
  32346. -->
  32347. (I3 ^dir U +)
  32348. inner elaboration loop at bottom goal.
  32349. Retracting elaborate*copy-see-to-output-link
  32350. -->
  32351. (I3 ^see 1 +)
  32352. Retracting propose*predict-no
  32353. -->
  32354. (O2230 ^name predict-no +)
  32355. (S1 ^operator O2230 +)
  32356. Retracting propose*predict-yes
  32357. -->
  32358. (O2229 ^name predict-yes +)
  32359. (S1 ^operator O2229 +)
  32360. Retracting elaborate*reward*based*on*reward
  32361. -->
  32362. (R1118 ^value 1 +)
  32363. (R1 ^reward R1118 +)
  32364. Retracting elaborate*copy-dir-to-output-link
  32365. -->
  32366. (I3 ^dir U +)
  32367. Retracting rl*prefer*rvt*predict-no*H0*6
  32368. -->
  32369. (S1 ^operator O2230 = 0.9999999999999999)
  32370. Retracting rl*prefer*rvt*predict-yes*H0*5
  32371. -->
  32372. (S1 ^operator O2229 = 0.)
  32373. =>WM: (15682: S1 ^operator O2232 +)
  32374. =>WM: (15681: S1 ^operator O2231 +)
  32375. =>WM: (15680: O2232 ^name predict-no)
  32376. =>WM: (15679: O2231 ^name predict-yes)
  32377. =>WM: (15678: R1119 ^value 1)
  32378. =>WM: (15677: R1 ^reward R1119)
  32379. =>WM: (15676: I3 ^see 0)
  32380. <=WM: (15667: S1 ^operator O2229 +)
  32381. <=WM: (15668: S1 ^operator O2230 +)
  32382. <=WM: (15669: S1 ^operator O2230)
  32383. <=WM: (15662: R1 ^reward R1118)
  32384. <=WM: (15661: I3 ^see 1)
  32385. <=WM: (15665: O2230 ^name predict-no)
  32386. <=WM: (15664: O2229 ^name predict-yes)
  32387. <=WM: (15663: R1118 ^value 1)
  32388. --- Inner Elaboration Phase, active level 1 (S1) ---
  32389. Firing prefer*rvt*predict-yes*H0
  32390. -->
  32391. Firing rl*prefer*rvt*predict-yes*H0*5
  32392. -->
  32393. (S1 ^operator O2231 = 0.)
  32394. Firing prefer*rvt*predict-no*H0
  32395. -->
  32396. Firing rl*prefer*rvt*predict-no*H0*6
  32397. -->
  32398. (S1 ^operator O2232 = 0.9999999999999999)
  32399. inner elaboration loop at bottom goal.
  32400. Retracting rl*prefer*rvt*predict-no*H0*6
  32401. -->
  32402. (S1 ^operator O2230 = 0.9999999999999999)
  32403. Retracting rl*prefer*rvt*predict-yes*H0*5
  32404. -->
  32405. (S1 ^operator O2229 = 0.)
  32406. --- END Proposal Phase ---
  32407. --- Decision Phase ---
  32408. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32409. =>WM: (15683: S1 ^operator O2232)
  32410. 1116: O: O2232 (predict-no)
  32411. --- END Decision Phase ---
  32412. --- Application Phase ---
  32413. --- Firing Productions (PE) For State At Depth 1 ---
  32414. --- Inner Elaboration Phase, active level 1 (S1) ---
  32415. Firing apply*operator
  32416. -->
  32417. (I3 ^predict-no N1116 + :O )
  32418. Firing apply*operator*complete
  32419. -->
  32420. (I3 ^predict-no N1115 - :O )
  32421. inner elaboration loop at bottom goal.
  32422. --- Change Working Memory (PE) ---
  32423. =>WM: (15684: I3 ^predict-no N1116)
  32424. <=WM: (15671: N1115 ^status complete)
  32425. <=WM: (15670: I3 ^predict-no N1115)
  32426. --- Firing Productions (IE) For State At Depth 1 ---
  32427. --- Inner Elaboration Phase, active level 1 (S1) ---
  32428. Firing monitor*world
  32429. -->
  32430. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32431. --- Change Working Memory (IE) ---
  32432. --- END Application Phase ---
  32433. --- Output Phase ---
  32434. ENV: Agent did: predict-no for direction U in state State-B
  32435. In State-B moving U
  32436. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32437. predict error 0
  32438. dir: dir isL
  32439. --- END Output Phase ---
  32440. /|--- Input Phase ---
  32441. =>WM: (15688: I2 ^dir L)
  32442. =>WM: (15687: I2 ^reward 1)
  32443. =>WM: (15686: I2 ^see 0)
  32444. =>WM: (15685: N1116 ^status complete)
  32445. <=WM: (15674: I2 ^dir U)
  32446. <=WM: (15673: I2 ^reward 1)
  32447. <=WM: (15672: I2 ^see 0)
  32448. =>WM: (15689: I2 ^level-1 R1-root)
  32449. <=WM: (15675: I2 ^level-1 R1-root)
  32450. --- END Input Phase ---
  32451. --- Proposal Phase ---
  32452. --- Inner Elaboration Phase, active level 1 (S1) ---
  32453. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  32454. -->
  32455. (S1 ^operator O2231 = 0.4768794453763991)
  32456. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  32457. -->
  32458. (S1 ^operator O2232 = -0.01194930198035649)
  32459. Firing prefer*rvt*predict-no*H0*2*H1
  32460. -->
  32461. Firing prefer*rvt*predict-yes*H0*1*H1
  32462. -->
  32463. Firing elaborate*copy-see-to-output-link
  32464. -->
  32465. (I3 ^see 0 +)
  32466. Firing elaborate*reward*based*on*reward
  32467. -->
  32468. (R1120 ^value 1 +)
  32469. (R1 ^reward R1120 +)
  32470. Firing propose*predict-yes
  32471. -->
  32472. (O2233 ^name predict-yes +)
  32473. (S1 ^operator O2233 +)
  32474. Firing propose*predict-no
  32475. -->
  32476. (O2234 ^name predict-no +)
  32477. (S1 ^operator O2234 +)
  32478. Firing rl*prefer*rvt*predict-no*H0*2
  32479. -->
  32480. (S1 ^operator O2232 = 0.2550134091080695)
  32481. Firing rl*prefer*rvt*predict-yes*H0*1
  32482. -->
  32483. (S1 ^operator O2231 = 0.5231195708667261)
  32484. Firing prefer*rvt*predict-yes*H0
  32485. -->
  32486. Firing prefer*rvt*predict-no*H0
  32487. -->
  32488. Firing elaborate*copy-dir-to-output-link
  32489. -->
  32490. (I3 ^dir L +)
  32491. inner elaboration loop at bottom goal.
  32492. Retracting elaborate*copy-see-to-output-link
  32493. -->
  32494. (I3 ^see 0 +)
  32495. Retracting propose*predict-no
  32496. -->
  32497. (O2232 ^name predict-no +)
  32498. (S1 ^operator O2232 +)
  32499. Retracting propose*predict-yes
  32500. -->
  32501. (O2231 ^name predict-yes +)
  32502. (S1 ^operator O2231 +)
  32503. Retracting elaborate*reward*based*on*reward
  32504. -->
  32505. (R1119 ^value 1 +)
  32506. (R1 ^reward R1119 +)
  32507. Retracting elaborate*copy-dir-to-output-link
  32508. -->
  32509. (I3 ^dir U +)
  32510. Retracting rl*prefer*rvt*predict-no*H0*6
  32511. -->
  32512. (S1 ^operator O2232 = 0.9999999999999999)
  32513. Retracting rl*prefer*rvt*predict-yes*H0*5
  32514. -->
  32515. (S1 ^operator O2231 = 0.)
  32516. =>WM: (15696: S1 ^operator O2234 +)
  32517. =>WM: (15695: S1 ^operator O2233 +)
  32518. =>WM: (15694: I3 ^dir L)
  32519. =>WM: (15693: O2234 ^name predict-no)
  32520. =>WM: (15692: O2233 ^name predict-yes)
  32521. =>WM: (15691: R1120 ^value 1)
  32522. =>WM: (15690: R1 ^reward R1120)
  32523. <=WM: (15681: S1 ^operator O2231 +)
  32524. <=WM: (15682: S1 ^operator O2232 +)
  32525. <=WM: (15683: S1 ^operator O2232)
  32526. <=WM: (15666: I3 ^dir U)
  32527. <=WM: (15677: R1 ^reward R1119)
  32528. <=WM: (15680: O2232 ^name predict-no)
  32529. <=WM: (15679: O2231 ^name predict-yes)
  32530. <=WM: (15678: R1119 ^value 1)
  32531. --- Inner Elaboration Phase, active level 1 (S1) ---
  32532. Firing prefer*rvt*predict-yes*H0
  32533. -->
  32534. Firing rl*prefer*rvt*predict-yes*H0*1*H1*22
  32535. -->
  32536. (S1 ^operator O2233 = 0.4768794453763991)
  32537. Firing rl*prefer*rvt*predict-yes*H0*1
  32538. -->
  32539. (S1 ^operator O2233 = 0.5231195708667261)
  32540. Firing prefer*rvt*predict-yes*H0*1*H1
  32541. -->
  32542. Firing prefer*rvt*predict-no*H0
  32543. -->
  32544. Firing rl*prefer*rvt*predict-no*H0*2*H1*10
  32545. -->
  32546. (S1 ^operator O2234 = -0.01194930198035649)
  32547. Firing rl*prefer*rvt*predict-no*H0*2
  32548. -->
  32549. (S1 ^operator O2234 = 0.2550134091080695)
  32550. Firing prefer*rvt*predict-no*H0*2*H1
  32551. -->
  32552. inner elaboration loop at bottom goal.
  32553. Retracting rl*prefer*rvt*predict-no*H0*2
  32554. -->
  32555. (S1 ^operator O2232 = 0.2550134091080695)
  32556. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  32557. -->
  32558. (S1 ^operator O2232 = -0.01194930198035649)
  32559. Retracting rl*prefer*rvt*predict-yes*H0*1
  32560. -->
  32561. (S1 ^operator O2231 = 0.5231195708667261)
  32562. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  32563. -->
  32564. (S1 ^operator O2231 = 0.4768794453763991)
  32565. --- END Proposal Phase ---
  32566. --- Decision Phase ---
  32567. RL update rl*prefer*rvt*predict-no*H0*6 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32568. =>WM: (15697: S1 ^operator O2233)
  32569. 1117: O: O2233 (predict-yes)
  32570. --- END Decision Phase ---
  32571. --- Application Phase ---
  32572. --- Firing Productions (PE) For State At Depth 1 ---
  32573. --- Inner Elaboration Phase, active level 1 (S1) ---
  32574. Firing apply*operator
  32575. -->
  32576. (I3 ^predict-yes N1117 + :O )
  32577. Firing apply*operator*complete
  32578. -->
  32579. (I3 ^predict-no N1116 - :O )
  32580. inner elaboration loop at bottom goal.
  32581. --- Change Working Memory (PE) ---
  32582. =>WM: (15698: I3 ^predict-yes N1117)
  32583. <=WM: (15685: N1116 ^status complete)
  32584. <=WM: (15684: I3 ^predict-no N1116)
  32585. --- Firing Productions (IE) For State At Depth 1 ---
  32586. --- Inner Elaboration Phase, active level 1 (S1) ---
  32587. Firing monitor*world
  32588. -->
  32589. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32590. --- Change Working Memory (IE) ---
  32591. --- END Application Phase ---
  32592. --- Output Phase ---
  32593. ENV: Agent did: predict-yes for direction L in state State-B
  32594. In State-B moving L
  32595. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  32596. predict error 0
  32597. dir: dir isL
  32598. --- END Output Phase ---
  32599. \-/--- Input Phase ---
  32600. =>WM: (15702: I2 ^dir L)
  32601. =>WM: (15701: I2 ^reward 1)
  32602. =>WM: (15700: I2 ^see 1)
  32603. =>WM: (15699: N1117 ^status complete)
  32604. <=WM: (15688: I2 ^dir L)
  32605. <=WM: (15687: I2 ^reward 1)
  32606. <=WM: (15686: I2 ^see 0)
  32607. =>WM: (15703: I2 ^level-1 L1-root)
  32608. <=WM: (15689: I2 ^level-1 R1-root)
  32609. --- END Input Phase ---
  32610. --- Proposal Phase ---
  32611. --- Inner Elaboration Phase, active level 1 (S1) ---
  32612. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  32613. -->
  32614. (S1 ^operator O2233 = 0.1693592933936033)
  32615. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  32616. -->
  32617. (S1 ^operator O2234 = 0.7449865658474594)
  32618. Firing prefer*rvt*predict-no*H0*2*H1
  32619. -->
  32620. Firing prefer*rvt*predict-yes*H0*1*H1
  32621. -->
  32622. Firing elaborate*copy-see-to-output-link
  32623. -->
  32624. (I3 ^see 1 +)
  32625. Firing elaborate*reward*based*on*reward
  32626. -->
  32627. (R1121 ^value 1 +)
  32628. (R1 ^reward R1121 +)
  32629. Firing propose*predict-yes
  32630. -->
  32631. (O2235 ^name predict-yes +)
  32632. (S1 ^operator O2235 +)
  32633. Firing propose*predict-no
  32634. -->
  32635. (O2236 ^name predict-no +)
  32636. (S1 ^operator O2236 +)
  32637. Firing rl*prefer*rvt*predict-no*H0*2
  32638. -->
  32639. (S1 ^operator O2234 = 0.2550134091080695)
  32640. Firing rl*prefer*rvt*predict-yes*H0*1
  32641. -->
  32642. (S1 ^operator O2233 = 0.5231195708667261)
  32643. Firing prefer*rvt*predict-yes*H0
  32644. -->
  32645. Firing prefer*rvt*predict-no*H0
  32646. -->
  32647. Firing elaborate*copy-dir-to-output-link
  32648. -->
  32649. (I3 ^dir L +)
  32650. inner elaboration loop at bottom goal.
  32651. Retracting elaborate*copy-see-to-output-link
  32652. -->
  32653. (I3 ^see 0 +)
  32654. Retracting propose*predict-no
  32655. -->
  32656. (O2234 ^name predict-no +)
  32657. (S1 ^operator O2234 +)
  32658. Retracting propose*predict-yes
  32659. -->
  32660. (O2233 ^name predict-yes +)
  32661. (S1 ^operator O2233 +)
  32662. Retracting elaborate*reward*based*on*reward
  32663. -->
  32664. (R1120 ^value 1 +)
  32665. (R1 ^reward R1120 +)
  32666. Retracting elaborate*copy-dir-to-output-link
  32667. -->
  32668. (I3 ^dir L +)
  32669. Retracting rl*prefer*rvt*predict-no*H0*2
  32670. -->
  32671. (S1 ^operator O2234 = 0.2550134091080695)
  32672. Retracting rl*prefer*rvt*predict-no*H0*2*H1*10
  32673. -->
  32674. (S1 ^operator O2234 = -0.01194930198035649)
  32675. Retracting rl*prefer*rvt*predict-yes*H0*1
  32676. -->
  32677. (S1 ^operator O2233 = 0.5231195708667261)
  32678. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*22
  32679. -->
  32680. (S1 ^operator O2233 = 0.4768794453763991)
  32681. =>WM: (15710: S1 ^operator O2236 +)
  32682. =>WM: (15709: S1 ^operator O2235 +)
  32683. =>WM: (15708: O2236 ^name predict-no)
  32684. =>WM: (15707: O2235 ^name predict-yes)
  32685. =>WM: (15706: R1121 ^value 1)
  32686. =>WM: (15705: R1 ^reward R1121)
  32687. =>WM: (15704: I3 ^see 1)
  32688. <=WM: (15695: S1 ^operator O2233 +)
  32689. <=WM: (15697: S1 ^operator O2233)
  32690. <=WM: (15696: S1 ^operator O2234 +)
  32691. <=WM: (15690: R1 ^reward R1120)
  32692. <=WM: (15676: I3 ^see 0)
  32693. <=WM: (15693: O2234 ^name predict-no)
  32694. <=WM: (15692: O2233 ^name predict-yes)
  32695. <=WM: (15691: R1120 ^value 1)
  32696. --- Inner Elaboration Phase, active level 1 (S1) ---
  32697. Firing prefer*rvt*predict-yes*H0
  32698. -->
  32699. Firing rl*prefer*rvt*predict-yes*H0*1
  32700. -->
  32701. (S1 ^operator O2235 = 0.5231195708667261)
  32702. Firing prefer*rvt*predict-yes*H0*1*H1
  32703. -->
  32704. Firing rl*prefer*rvt*predict-yes*H0*1*H1*20
  32705. -->
  32706. (S1 ^operator O2235 = 0.1693592933936033)
  32707. Firing prefer*rvt*predict-no*H0
  32708. -->
  32709. Firing rl*prefer*rvt*predict-no*H0*2
  32710. -->
  32711. (S1 ^operator O2236 = 0.2550134091080695)
  32712. Firing prefer*rvt*predict-no*H0*2*H1
  32713. -->
  32714. Firing rl*prefer*rvt*predict-no*H0*2*H1*11
  32715. -->
  32716. (S1 ^operator O2236 = 0.7449865658474594)
  32717. inner elaboration loop at bottom goal.
  32718. Retracting rl*prefer*rvt*predict-no*H0*2
  32719. -->
  32720. (S1 ^operator O2234 = 0.2550134091080695)
  32721. Retracting rl*prefer*rvt*predict-no*H0*2*H1*11
  32722. -->
  32723. (S1 ^operator O2234 = 0.7449865658474594)
  32724. Retracting rl*prefer*rvt*predict-yes*H0*1
  32725. -->
  32726. (S1 ^operator O2233 = 0.5231195708667261)
  32727. Retracting rl*prefer*rvt*predict-yes*H0*1*H1*20
  32728. -->
  32729. (S1 ^operator O2233 = 0.1693592933936033)
  32730. --- END Proposal Phase ---
  32731. --- Decision Phase ---
  32732. RL update rl*prefer*rvt*predict-yes*H0*1 0.727959 -0.20484 0.52312 -> 0.72796 -0.20484 0.52312(R,m,v=1,0.981366,0.0184006)
  32733. RL update rl*prefer*rvt*predict-yes*H0*1*H1*22 0.272039 0.20484 0.476879 -> 0.27204 0.20484 0.47688(R,m,v=1,1,0)
  32734. =>WM: (15711: S1 ^operator O2236)
  32735. 1118: O: O2236 (predict-no)
  32736. --- END Decision Phase ---
  32737. --- Application Phase ---
  32738. --- Firing Productions (PE) For State At Depth 1 ---
  32739. --- Inner Elaboration Phase, active level 1 (S1) ---
  32740. Firing apply*operator
  32741. -->
  32742. (I3 ^predict-no N1118 + :O )
  32743. Firing apply*operator*complete
  32744. -->
  32745. (I3 ^predict-yes N1117 - :O )
  32746. inner elaboration loop at bottom goal.
  32747. --- Change Working Memory (PE) ---
  32748. =>WM: (15712: I3 ^predict-no N1118)
  32749. <=WM: (15699: N1117 ^status complete)
  32750. <=WM: (15698: I3 ^predict-yes N1117)
  32751. --- Firing Productions (IE) For State At Depth 1 ---
  32752. --- Inner Elaboration Phase, active level 1 (S1) ---
  32753. Firing monitor*world
  32754. -->
  32755. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32756. --- Change Working Memory (IE) ---
  32757. --- END Application Phase ---
  32758. --- Output Phase ---
  32759. ENV: Agent did: predict-no for direction L in state State-A
  32760. In State-A moving L
  32761. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  32762. predict error 0
  32763. dir: dir isR
  32764. --- END Output Phase ---
  32765. |\---- Input Phase ---
  32766. =>WM: (15716: I2 ^dir R)
  32767. =>WM: (15715: I2 ^reward 1)
  32768. =>WM: (15714: I2 ^see 0)
  32769. =>WM: (15713: N1118 ^status complete)
  32770. <=WM: (15702: I2 ^dir L)
  32771. <=WM: (15701: I2 ^reward 1)
  32772. <=WM: (15700: I2 ^see 1)
  32773. =>WM: (15717: I2 ^level-1 L0-root)
  32774. <=WM: (15703: I2 ^level-1 L1-root)
  32775. --- END Input Phase ---
  32776. --- Proposal Phase ---
  32777. --- Inner Elaboration Phase, active level 1 (S1) ---
  32778. Firing rl*prefer*rvt*predict-yes*H0*3*H1*14
  32779. -->
  32780. (S1 ^operator O2235 = 0.6170623378551907)
  32781. Firing rl*prefer*rvt*predict-no*H0*4*H1*13
  32782. -->
  32783. (S1 ^operator O2236 = 0.4910065094545203)
  32784. Firing prefer*rvt*predict-no*H0*4*H1
  32785. -->
  32786. Firing prefer*rvt*predict-yes*H0*3*H1
  32787. -->
  32788. Firing elaborate*copy-see-to-output-link
  32789. -->
  32790. (I3 ^see 0 +)
  32791. Firing elaborate*reward*based*on*reward
  32792. -->
  32793. (R1122 ^value 1 +)
  32794. (R1 ^reward R1122 +)
  32795. Firing propose*predict-yes
  32796. -->
  32797. (O2237 ^name predict-yes +)
  32798. (S1 ^operator O2237 +)
  32799. Firing propose*predict-no
  32800. -->
  32801. (O2238 ^name predict-no +)
  32802. (S1 ^operator O2238 +)
  32803. Firing rl*prefer*rvt*predict-no*H0*4
  32804. -->
  32805. (S1 ^operator O2236 = 0.1269768314160579)
  32806. Firing rl*prefer*rvt*predict-yes*H0*3
  32807. -->
  32808. (S1 ^operator O2235 =