/flipv2/20121112-101138-2.5K-ReLST-Evan/stdout-flip-2.5K_0.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 35130 lines · 32966 code · 2164 blank · 0 comment · 0 complexity · ced40955f45f159289b4215de1fd8824 MD5 · raw file

  1. Seeding... 0
  2. dir: dir isU
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 0 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_0.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\sleeping...
  20. -/|\-/|sleeping...
  21. \1: O: O1 (predict-yes)
  22. I see 0 and I'm going to do: predict-yes
  23. ENV: Agent did: predict-yes for direction U in state State-A
  24. In State-A moving U
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  26. predict error 1
  27. dir: dir isU
  28. rule alias: '*'
  29. rule alias: '*'
  30. -/|\-/|\2: O: O4 (predict-no)
  31. I see 0 and I'm going to do: predict-no
  32. ENV: Agent did: predict-no for direction U in state State-A
  33. In State-A moving U
  34. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  35. predict error 0
  36. dir: dir isR
  37. -/|3: O: O5 (predict-yes)
  38. I see 1 and I'm going to do: predict-yes
  39. ENV: Agent did: predict-yes for direction R in state State-A
  40. In State-A moving R
  41. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  42. predict error 0
  43. dir: dir isL
  44. \-/4: O: O7 (predict-yes)
  45. I see 1 and I'm going to do: predict-yes
  46. ENV: Agent did: predict-yes for direction L in state State-B
  47. In State-B moving L
  48. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  49. predict error 0
  50. dir: dir isR
  51. |\-5: O: O9 (predict-yes)
  52. I see 1 and I'm going to do: predict-yes
  53. ENV: Agent did: predict-yes for direction R in state State-A
  54. In State-A moving R
  55. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  56. predict error 0
  57. dir: dir isR
  58. /|\6: O: O11 (predict-yes)
  59. I see 1 and I'm going to do: predict-yes
  60. ENV: Agent did: predict-yes for direction R in state State-B
  61. In State-B moving R
  62. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  63. predict error 1
  64. dir: dir isU
  65. -/|7: O: O14 (predict-no)
  66. I see 0 and I'm going to do: predict-no
  67. ENV: Agent did: predict-no for direction U in state State-B
  68. In State-B moving U
  69. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  70. predict error 0
  71. dir: dir isL
  72. \-/8: O: O15 (predict-yes)
  73. I see 1 and I'm going to do: predict-yes
  74. ENV: Agent did: predict-yes for direction L in state State-B
  75. In State-B moving L
  76. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  77. predict error 0
  78. dir: dir isR
  79. |\9: O: O17 (predict-yes)
  80. I see 1 and I'm going to do: predict-yes
  81. ENV: Agent did: predict-yes for direction R in state State-A
  82. In State-A moving R
  83. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  84. predict error 0
  85. dir: dir isR
  86. -10: O: O19 (predict-yes)
  87. I see 1 and I'm going to do: predict-yes
  88. ENV: Agent did: predict-yes for direction R in state State-B
  89. In State-B moving R
  90. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  91. predict error 1
  92. dir: dir isU
  93. /|\11: O: O22 (predict-no)
  94. I see 0 and I'm going to do: predict-no
  95. ENV: Agent did: predict-no for direction U in state State-B
  96. In State-B moving U
  97. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  98. predict error 0
  99. dir: dir isR
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. -12: O: O24 (predict-no)
  105. I see 1 and I'm going to do: predict-no
  106. ENV: Agent did: predict-no for direction R in state State-B
  107. In State-B moving R
  108. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  109. predict error 0
  110. dir: dir isL
  111. /|\13: O: O26 (predict-no)
  112. I see 1 and I'm going to do: predict-no
  113. ENV: Agent did: predict-no for direction L in state State-B
  114. In State-B moving L
  115. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  116. predict error 1
  117. dir: dir isU
  118. -/|14: O: O28 (predict-no)
  119. I see 0 and I'm going to do: predict-no
  120. ENV: Agent did: predict-no for direction U in state State-A
  121. In State-A moving U
  122. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  123. predict error 0
  124. dir: dir isR
  125. \-/15: O: O30 (predict-no)
  126. I see 1 and I'm going to do: predict-no
  127. ENV: Agent did: predict-no for direction R in state State-A
  128. In State-A moving R
  129. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  130. predict error 1
  131. dir: dir isL
  132. |\-16: O: O31 (predict-yes)
  133. I see 0 and I'm going to do: predict-yes
  134. ENV: Agent did: predict-yes for direction L in state State-B
  135. In State-B moving L
  136. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  137. predict error 0
  138. dir: dir isU
  139. /|\17: O: O34 (predict-no)
  140. I see 1 and I'm going to do: predict-no
  141. ENV: Agent did: predict-no for direction U in state State-A
  142. In State-A moving U
  143. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  144. predict error 0
  145. dir: dir isU
  146. -/|18: O: O36 (predict-no)
  147. I see 1 and I'm going to do: predict-no
  148. ENV: Agent did: predict-no for direction U in state State-A
  149. In State-A moving U
  150. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  151. predict error 0
  152. dir: dir isU
  153. \-/19: O: O38 (predict-no)
  154. I see 1 and I'm going to do: predict-no
  155. ENV: Agent did: predict-no for direction U in state State-A
  156. In State-A moving U
  157. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  158. predict error 0
  159. dir: dir isU
  160. |\-20: O: O40 (predict-no)
  161. I see 1 and I'm going to do: predict-no
  162. ENV: Agent did: predict-no for direction U in state State-A
  163. In State-A moving U
  164. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  165. predict error 0
  166. dir: dir isL
  167. /|\21: O: O41 (predict-yes)
  168. I see 1 and I'm going to do: predict-yes
  169. ENV: Agent did: predict-yes for direction L in state State-A
  170. In State-A moving L
  171. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  172. predict error 1
  173. dir: dir isU
  174. -22: O: O44 (predict-no)
  175. I see 0 and I'm going to do: predict-no
  176. ENV: Agent did: predict-no for direction U in state State-A
  177. In State-A moving U
  178. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  179. predict error 0
  180. dir: dir isU
  181. /|\23: O: O46 (predict-no)
  182. I see 1 and I'm going to do: predict-no
  183. ENV: Agent did: predict-no for direction U in state State-A
  184. In State-A moving U
  185. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  186. predict error 0
  187. dir: dir isU
  188. -/|24: O: O48 (predict-no)
  189. I see 1 and I'm going to do: predict-no
  190. ENV: Agent did: predict-no for direction U in state State-A
  191. In State-A moving U
  192. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  193. predict error 0
  194. dir: dir isR
  195. \-25: O: O50 (predict-no)
  196. I see 1 and I'm going to do: predict-no
  197. ENV: Agent did: predict-no for direction R in state State-A
  198. In State-A moving R
  199. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  200. predict error 1
  201. dir: dir isL
  202. /|\26: O: O51 (predict-yes)
  203. I see 0 and I'm going to do: predict-yes
  204. ENV: Agent did: predict-yes for direction L in state State-B
  205. In State-B moving L
  206. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  207. predict error 0
  208. dir: dir isR
  209. -/|27: O: O53 (predict-yes)
  210. I see 1 and I'm going to do: predict-yes
  211. ENV: Agent did: predict-yes for direction R in state State-A
  212. In State-A moving R
  213. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  214. predict error 0
  215. dir: dir isR
  216. \-28: O: O55 (predict-yes)
  217. I see 1 and I'm going to do: predict-yes
  218. ENV: Agent did: predict-yes for direction R in state State-B
  219. In State-B moving R
  220. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  221. predict error 1
  222. dir: dir isU
  223. /|\29: O: O57 (predict-yes)
  224. I see 0 and I'm going to do: predict-yes
  225. ENV: Agent did: predict-yes for direction U in state State-B
  226. In State-B moving U
  227. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  228. predict error 1
  229. dir: dir isU
  230. -/|30: O: O60 (predict-no)
  231. I see 0 and I'm going to do: predict-no
  232. ENV: Agent did: predict-no for direction U in state State-B
  233. In State-B moving U
  234. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  235. predict error 0
  236. dir: dir isR
  237. \-/31: O: O61 (predict-yes)
  238. I see 1 and I'm going to do: predict-yes
  239. ENV: Agent did: predict-yes for direction R in state State-B
  240. In State-B moving R
  241. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  242. predict error 1
  243. dir: dir isU
  244. |32: O: O64 (predict-no)
  245. I see 0 and I'm going to do: predict-no
  246. ENV: Agent did: predict-no for direction U in state State-B
  247. In State-B moving U
  248. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  249. predict error 0
  250. dir: dir isL
  251. \-/33: O: O65 (predict-yes)
  252. I see 1 and I'm going to do: predict-yes
  253. ENV: Agent did: predict-yes for direction L in state State-B
  254. In State-B moving L
  255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  256. predict error 0
  257. dir: dir isU
  258. |\-34: O: O68 (predict-no)
  259. I see 1 and I'm going to do: predict-no
  260. ENV: Agent did: predict-no for direction U in state State-A
  261. In State-A moving U
  262. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  263. predict error 0
  264. dir: dir isR
  265. /|\35: O: O69 (predict-yes)
  266. I see 1 and I'm going to do: predict-yes
  267. ENV: Agent did: predict-yes for direction R in state State-A
  268. In State-A moving R
  269. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  270. predict error 0
  271. dir: dir isL
  272. -/|36: O: O71 (predict-yes)
  273. I see 1 and I'm going to do: predict-yes
  274. ENV: Agent did: predict-yes for direction L in state State-B
  275. In State-B moving L
  276. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  277. predict error 0
  278. dir: dir isU
  279. \-/37: O: O74 (predict-no)
  280. I see 1 and I'm going to do: predict-no
  281. ENV: Agent did: predict-no for direction U in state State-A
  282. In State-A moving U
  283. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  284. predict error 0
  285. dir: dir isR
  286. |\38: O: O75 (predict-yes)
  287. I see 1 and I'm going to do: predict-yes
  288. ENV: Agent did: predict-yes for direction R in state State-A
  289. In State-A moving R
  290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  291. predict error 0
  292. dir: dir isU
  293. -/|39: O: O77 (predict-yes)
  294. I see 1 and I'm going to do: predict-yes
  295. ENV: Agent did: predict-yes for direction U in state State-B
  296. In State-B moving U
  297. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  298. predict error 1
  299. dir: dir isU
  300. \-/40: O: O80 (predict-no)
  301. I see 0 and I'm going to do: predict-no
  302. ENV: Agent did: predict-no for direction U in state State-B
  303. In State-B moving U
  304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  305. predict error 0
  306. dir: dir isL
  307. |\-41: O: O81 (predict-yes)
  308. I see 1 and I'm going to do: predict-yes
  309. ENV: Agent did: predict-yes for direction L in state State-B
  310. In State-B moving L
  311. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  312. predict error 0
  313. dir: dir isR
  314. /42: O: O83 (predict-yes)
  315. I see 1 and I'm going to do: predict-yes
  316. ENV: Agent did: predict-yes for direction R in state State-A
  317. In State-A moving R
  318. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  319. predict error 0
  320. dir: dir isU
  321. |\43: O: O86 (predict-no)
  322. I see 1 and I'm going to do: predict-no
  323. ENV: Agent did: predict-no for direction U in state State-B
  324. In State-B moving U
  325. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  326. predict error 0
  327. dir: dir isL
  328. -/44: O: O87 (predict-yes)
  329. I see 1 and I'm going to do: predict-yes
  330. ENV: Agent did: predict-yes for direction L in state State-B
  331. In State-B moving L
  332. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  333. predict error 0
  334. dir: dir isL
  335. |\-45: O: O89 (predict-yes)
  336. I see 1 and I'm going to do: predict-yes
  337. ENV: Agent did: predict-yes for direction L in state State-A
  338. In State-A moving L
  339. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  340. predict error 1
  341. dir: dir isU
  342. /|\46: O: O92 (predict-no)
  343. I see 0 and I'm going to do: predict-no
  344. ENV: Agent did: predict-no for direction U in state State-A
  345. In State-A moving U
  346. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  347. predict error 0
  348. dir: dir isL
  349. -/|47: O: O93 (predict-yes)
  350. I see 1 and I'm going to do: predict-yes
  351. ENV: Agent did: predict-yes for direction L in state State-A
  352. In State-A moving L
  353. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  354. predict error 1
  355. dir: dir isR
  356. \-/48: O: O96 (predict-no)
  357. I see 0 and I'm going to do: predict-no
  358. ENV: Agent did: predict-no for direction R in state State-A
  359. In State-A moving R
  360. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  361. predict error 1
  362. dir: dir isL
  363. |\-49: O: O97 (predict-yes)
  364. I see 0 and I'm going to do: predict-yes
  365. ENV: Agent did: predict-yes for direction L in state State-B
  366. In State-B moving L
  367. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  368. predict error 0
  369. dir: dir isU
  370. /|\50: O: O100 (predict-no)
  371. I see 1 and I'm going to do: predict-no
  372. ENV: Agent did: predict-no for direction U in state State-A
  373. In State-A moving U
  374. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  375. predict error 0
  376. dir: dir isU
  377. -/|\-/|sleeping...
  378. \sleeping...
  379. -51: O: O102 (predict-no)
  380. I see 1 and I'm going to do: predict-no
  381. ENV: Agent did: predict-no for direction U in state State-A
  382. In State-A moving U
  383. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  384. predict error 0
  385. dir: dir isR
  386. /52: O: O104 (predict-no)
  387. I see 1 and I'm going to do: predict-no
  388. ENV: Agent did: predict-no for direction R in state State-A
  389. In State-A moving R
  390. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  391. predict error 1
  392. dir: dir isL
  393. |\-53: O: O106 (predict-no)
  394. I see 0 and I'm going to do: predict-no
  395. ENV: Agent did: predict-no for direction L in state State-B
  396. In State-B moving L
  397. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  398. predict error 1
  399. dir: dir isL
  400. /|\54: O: O107 (predict-yes)
  401. I see 0 and I'm going to do: predict-yes
  402. ENV: Agent did: predict-yes for direction L in state State-A
  403. In State-A moving L
  404. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  405. predict error 1
  406. dir: dir isR
  407. -/55: O: O109 (predict-yes)
  408. I see 0 and I'm going to do: predict-yes
  409. ENV: Agent did: predict-yes for direction R in state State-A
  410. In State-A moving R
  411. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  412. predict error 0
  413. dir: dir isU
  414. |\-56: O: O112 (predict-no)
  415. I see 1 and I'm going to do: predict-no
  416. ENV: Agent did: predict-no for direction U in state State-B
  417. In State-B moving U
  418. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  419. predict error 0
  420. dir: dir isL
  421. /|\57: O: O114 (predict-no)
  422. I see 1 and I'm going to do: predict-no
  423. ENV: Agent did: predict-no for direction L in state State-B
  424. In State-B moving L
  425. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  426. predict error 1
  427. dir: dir isR
  428. -/|\58: O: O115 (predict-yes)
  429. I see 0 and I'm going to do: predict-yes
  430. ENV: Agent did: predict-yes for direction R in state State-A
  431. In State-A moving R
  432. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  433. predict error 0
  434. dir: dir isU
  435. -59: O: O118 (predict-no)
  436. I see 1 and I'm going to do: predict-no
  437. ENV: Agent did: predict-no for direction U in state State-B
  438. In State-B moving U
  439. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  440. predict error 0
  441. dir: dir isR
  442. /|60: O: O119 (predict-yes)
  443. I see 1 and I'm going to do: predict-yes
  444. ENV: Agent did: predict-yes for direction R in state State-B
  445. In State-B moving R
  446. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  447. predict error 1
  448. dir: dir isU
  449. \-61: O: O122 (predict-no)
  450. I see 0 and I'm going to do: predict-no
  451. ENV: Agent did: predict-no for direction U in state State-B
  452. In State-B moving U
  453. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  454. predict error 0
  455. dir: dir isR
  456. rule alias: '*'
  457. rule alias: '*'
  458. rule alias: '*'
  459. rule alias: '*'
  460. rule alias: '*'
  461. rule alias: '*'
  462. rule alias: '*'
  463. rule alias: '*'
  464. rule alias: '*'
  465. rule alias: '*'
  466. rule alias: '*'
  467. /62: O: O123 (predict-yes)
  468. I see 1 and I'm going to do: predict-yes
  469. ENV: Agent did: predict-yes for direction R in state State-B
  470. In State-B moving R
  471. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  472. predict error 1
  473. dir: dir isU
  474. |\-63: O: O126 (predict-no)
  475. I see 0 and I'm going to do: predict-no
  476. ENV: Agent did: predict-no for direction U in state State-B
  477. In State-B moving U
  478. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  479. predict error 0
  480. dir: dir isR
  481. /|64: O: O127 (predict-yes)
  482. I see 1 and I'm going to do: predict-yes
  483. ENV: Agent did: predict-yes for direction R in state State-B
  484. In State-B moving R
  485. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  486. predict error 1
  487. dir: dir isR
  488. \-65: O: O129 (predict-yes)
  489. I see 0 and I'm going to do: predict-yes
  490. ENV: Agent did: predict-yes for direction R in state State-B
  491. In State-B moving R
  492. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  493. predict error 1
  494. dir: dir isR
  495. /|66: O: O131 (predict-yes)
  496. I see 0 and I'm going to do: predict-yes
  497. ENV: Agent did: predict-yes for direction R in state State-B
  498. In State-B moving R
  499. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  500. predict error 1
  501. dir: dir isR
  502. \-/67: O: O133 (predict-yes)
  503. I see 0 and I'm going to do: predict-yes
  504. ENV: Agent did: predict-yes for direction R in state State-B
  505. In State-B moving R
  506. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  507. predict error 1
  508. dir: dir isR
  509. |\68: O: O135 (predict-yes)
  510. I see 0 and I'm going to do: predict-yes
  511. ENV: Agent did: predict-yes for direction R in state State-B
  512. In State-B moving R
  513. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  514. predict error 1
  515. dir: dir isR
  516. -/|69: O: O138 (predict-no)
  517. I see 0 and I'm going to do: predict-no
  518. ENV: Agent did: predict-no for direction R in state State-B
  519. In State-B moving R
  520. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  521. predict error 0
  522. dir: dir isL
  523. \-/70: O: O139 (predict-yes)
  524. I see 1 and I'm going to do: predict-yes
  525. ENV: Agent did: predict-yes for direction L in state State-B
  526. In State-B moving L
  527. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  528. predict error 0
  529. dir: dir isL
  530. |\71: O: O141 (predict-yes)
  531. I see 1 and I'm going to do: predict-yes
  532. ENV: Agent did: predict-yes for direction L in state State-A
  533. In State-A moving L
  534. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  535. predict error 1
  536. dir: dir isL
  537. rule alias: '*'
  538. rule alias: '*'
  539. rule alias: '*'
  540. rule alias: '*'
  541. rule alias: '*'
  542. -72: O: O143 (predict-yes)
  543. I see 0 and I'm going to do: predict-yes
  544. ENV: Agent did: predict-yes for direction L in state State-A
  545. In State-A moving L
  546. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  547. predict error 1
  548. dir: dir isR
  549. /|\73: O: O146 (predict-no)
  550. I see 0 and I'm going to do: predict-no
  551. ENV: Agent did: predict-no for direction R in state State-A
  552. In State-A moving R
  553. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  554. predict error 1
  555. dir: dir isR
  556. -/74: O: O147 (predict-yes)
  557. I see 0 and I'm going to do: predict-yes
  558. ENV: Agent did: predict-yes for direction R in state State-B
  559. In State-B moving R
  560. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  561. predict error 1
  562. dir: dir isR
  563. |\75: O: O150 (predict-no)
  564. I see 0 and I'm going to do: predict-no
  565. ENV: Agent did: predict-no for direction R in state State-B
  566. In State-B moving R
  567. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  568. predict error 0
  569. dir: dir isL
  570. -/|76: O: O151 (predict-yes)
  571. I see 1 and I'm going to do: predict-yes
  572. ENV: Agent did: predict-yes for direction L in state State-B
  573. In State-B moving L
  574. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  575. predict error 0
  576. dir: dir isU
  577. \-/77: O: O154 (predict-no)
  578. I see 1 and I'm going to do: predict-no
  579. ENV: Agent did: predict-no for direction U in state State-A
  580. In State-A moving U
  581. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  582. predict error 0
  583. dir: dir isU
  584. |\-78: O: O156 (predict-no)
  585. I see 1 and I'm going to do: predict-no
  586. ENV: Agent did: predict-no for direction U in state State-A
  587. In State-A moving U
  588. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  589. predict error 0
  590. dir: dir isU
  591. /|\79: O: O158 (predict-no)
  592. I see 1 and I'm going to do: predict-no
  593. ENV: Agent did: predict-no for direction U in state State-A
  594. In State-A moving U
  595. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  596. predict error 0
  597. dir: dir isU
  598. -80: O: O160 (predict-no)
  599. I see 1 and I'm going to do: predict-no
  600. ENV: Agent did: predict-no for direction U in state State-A
  601. In State-A moving U
  602. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  603. predict error 0
  604. dir: dir isU
  605. /|\81: O: O162 (predict-no)
  606. I see 1 and I'm going to do: predict-no
  607. ENV: Agent did: predict-no for direction U in state State-A
  608. In State-A moving U
  609. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  610. predict error 0
  611. dir: dir isU
  612. rule alias: '*'
  613. rule alias: '*'
  614. rule alias: '*'
  615. -82: O: O164 (predict-no)
  616. I see 1 and I'm going to do: predict-no
  617. ENV: Agent did: predict-no for direction U in state State-A
  618. In State-A moving U
  619. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  620. predict error 0
  621. dir: dir isR
  622. /|\83: O: O165 (predict-yes)
  623. I see 1 and I'm going to do: predict-yes
  624. ENV: Agent did: predict-yes for direction R in state State-A
  625. In State-A moving R
  626. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  627. predict error 0
  628. dir: dir isR
  629. -/84: O: O168 (predict-no)
  630. I see 1 and I'm going to do: predict-no
  631. ENV: Agent did: predict-no for direction R in state State-B
  632. In State-B moving R
  633. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  634. predict error 0
  635. dir: dir isU
  636. |\-85: O: O169 (predict-yes)
  637. I see 1 and I'm going to do: predict-yes
  638. ENV: Agent did: predict-yes for direction U in state State-B
  639. In State-B moving U
  640. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  641. predict error 1
  642. dir: dir isL
  643. /|\86: O: O172 (predict-no)
  644. I see 0 and I'm going to do: predict-no
  645. ENV: Agent did: predict-no for direction L in state State-B
  646. In State-B moving L
  647. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  648. predict error 1
  649. dir: dir isU
  650. -/|87: O: O174 (predict-no)
  651. I see 0 and I'm going to do: predict-no
  652. ENV: Agent did: predict-no for direction U in state State-A
  653. In State-A moving U
  654. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  655. predict error 0
  656. dir: dir isU
  657. \-88: O: O176 (predict-no)
  658. I see 1 and I'm going to do: predict-no
  659. ENV: Agent did: predict-no for direction U in state State-A
  660. In State-A moving U
  661. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  662. predict error 0
  663. dir: dir isU
  664. /|\89: O: O178 (predict-no)
  665. I see 1 and I'm going to do: predict-no
  666. ENV: Agent did: predict-no for direction U in state State-A
  667. In State-A moving U
  668. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  669. predict error 0
  670. dir: dir isR
  671. -/90: O: O180 (predict-no)
  672. I see 1 and I'm going to do: predict-no
  673. ENV: Agent did: predict-no for direction R in state State-A
  674. In State-A moving R
  675. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  676. predict error 1
  677. dir: dir isU
  678. |\-91: O: O182 (predict-no)
  679. I see 0 and I'm going to do: predict-no
  680. ENV: Agent did: predict-no for direction U in state State-B
  681. In State-B moving U
  682. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  683. predict error 0
  684. dir: dir isR
  685. rule alias: '*'
  686. rule alias: '*'
  687. rule alias: '*'
  688. /92: O: O184 (predict-no)
  689. I see 1 and I'm going to do: predict-no
  690. ENV: Agent did: predict-no for direction R in state State-B
  691. In State-B moving R
  692. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  693. predict error 0
  694. dir: dir isR
  695. |\-93: O: O186 (predict-no)
  696. I see 1 and I'm going to do: predict-no
  697. ENV: Agent did: predict-no for direction R in state State-B
  698. In State-B moving R
  699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  700. predict error 0
  701. dir: dir isR
  702. /|\94: O: O187 (predict-yes)
  703. I see 1 and I'm going to do: predict-yes
  704. ENV: Agent did: predict-yes for direction R in state State-B
  705. In State-B moving R
  706. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  707. predict error 1
  708. dir: dir isU
  709. -/|95: O: O189 (predict-yes)
  710. I see 0 and I'm going to do: predict-yes
  711. ENV: Agent did: predict-yes for direction U in state State-B
  712. In State-B moving U
  713. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  714. predict error 1
  715. dir: dir isU
  716. \96: O: O192 (predict-no)
  717. I see 0 and I'm going to do: predict-no
  718. ENV: Agent did: predict-no for direction U in state State-B
  719. In State-B moving U
  720. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  721. predict error 0
  722. dir: dir isU
  723. -/|97: O: O194 (predict-no)
  724. I see 1 and I'm going to do: predict-no
  725. ENV: Agent did: predict-no for direction U in state State-B
  726. In State-B moving U
  727. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  728. predict error 0
  729. dir: dir isL
  730. \-98: O: O195 (predict-yes)
  731. I see 1 and I'm going to do: predict-yes
  732. ENV: Agent did: predict-yes for direction L in state State-B
  733. In State-B moving L
  734. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  735. predict error 0
  736. dir: dir isR
  737. /|\99: O: O197 (predict-yes)
  738. I see 1 and I'm going to do: predict-yes
  739. ENV: Agent did: predict-yes for direction R in state State-A
  740. In State-A moving R
  741. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  742. predict error 0
  743. dir: dir isR
  744. -/|100: O: O200 (predict-no)
  745. I see 1 and I'm going to do: predict-no
  746. ENV: Agent did: predict-no for direction R in state State-B
  747. In State-B moving R
  748. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  749. predict error 0
  750. dir: dir isR
  751. \-/101: O: O202 (predict-no)
  752. I see 1 and I'm going to do: predict-no
  753. ENV: Agent did: predict-no for direction R in state State-B
  754. In State-B moving R
  755. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  756. predict error 0
  757. dir: dir isU
  758. rule alias: '*'
  759. |\-/|\-/|\-/|\-/|\-/|\-/|\-/|\-/|sleeping...
  760. \102: O: O204 (predict-no)
  761. I see 1 and I'm going to do: predict-no
  762. ENV: Agent did: predict-no for direction U in state State-B
  763. In State-B moving U
  764. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  765. predict error 0
  766. dir: dir isL
  767. -/|103: O: O206 (predict-no)
  768. I see 1 and I'm going to do: predict-no
  769. ENV: Agent did: predict-no for direction L in state State-B
  770. In State-B moving L
  771. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  772. predict error 1
  773. dir: dir isU
  774. \-/104: O: O208 (predict-no)
  775. I see 0 and I'm going to do: predict-no
  776. ENV: Agent did: predict-no for direction U in state State-A
  777. In State-A moving U
  778. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  779. predict error 0
  780. dir: dir isL
  781. |\-105: O: O209 (predict-yes)
  782. I see 1 and I'm going to do: predict-yes
  783. ENV: Agent did: predict-yes for direction L in state State-A
  784. In State-A moving L
  785. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  786. predict error 1
  787. dir: dir isL
  788. /|\106: O: O211 (predict-yes)
  789. I see 0 and I'm going to do: predict-yes
  790. ENV: Agent did: predict-yes for direction L in state State-A
  791. In State-A moving L
  792. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  793. predict error 1
  794. dir: dir isU
  795. -/|107: O: O214 (predict-no)
  796. I see 0 and I'm going to do: predict-no
  797. ENV: Agent did: predict-no for direction U in state State-A
  798. In State-A moving U
  799. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  800. predict error 0
  801. dir: dir isL
  802. \-108: O: O215 (predict-yes)
  803. I see 1 and I'm going to do: predict-yes
  804. ENV: Agent did: predict-yes for direction L in state State-A
  805. In State-A moving L
  806. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  807. predict error 1
  808. dir: dir isU
  809. /|109: O: O218 (predict-no)
  810. I see 0 and I'm going to do: predict-no
  811. ENV: Agent did: predict-no for direction U in state State-A
  812. In State-A moving U
  813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  814. predict error 0
  815. dir: dir isL
  816. \-/110: O: O219 (predict-yes)
  817. I see 1 and I'm going to do: predict-yes
  818. ENV: Agent did: predict-yes for direction L in state State-A
  819. In State-A moving L
  820. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  821. predict error 1
  822. dir: dir isL
  823. |\111: O: O221 (predict-yes)
  824. I see 0 and I'm going to do: predict-yes
  825. ENV: Agent did: predict-yes for direction L in state State-A
  826. In State-A moving L
  827. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  828. predict error 1
  829. dir: dir isU
  830. rule alias: '*'
  831. rule alias: '*'
  832. rule alias: '*'
  833. -112: O: O224 (predict-no)
  834. I see 0 and I'm going to do: predict-no
  835. ENV: Agent did: predict-no for direction U in state State-A
  836. In State-A moving U
  837. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  838. predict error 0
  839. dir: dir isL
  840. /|\113: O: O225 (predict-yes)
  841. I see 1 and I'm going to do: predict-yes
  842. ENV: Agent did: predict-yes for direction L in state State-A
  843. In State-A moving L
  844. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  845. predict error 1
  846. dir: dir isR
  847. -114: O: O228 (predict-no)
  848. I see 0 and I'm going to do: predict-no
  849. ENV: Agent did: predict-no for direction R in state State-A
  850. In State-A moving R
  851. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  852. predict error 1
  853. dir: dir isU
  854. /|\115: O: O230 (predict-no)
  855. I see 0 and I'm going to do: predict-no
  856. ENV: Agent did: predict-no for direction U in state State-B
  857. In State-B moving U
  858. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  859. predict error 0
  860. dir: dir isR
  861. -/116: O: O232 (predict-no)
  862. I see 1 and I'm going to do: predict-no
  863. ENV: Agent did: predict-no for direction R in state State-B
  864. In State-B moving R
  865. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  866. predict error 0
  867. dir: dir isU
  868. |117: O: O234 (predict-no)
  869. I see 1 and I'm going to do: predict-no
  870. ENV: Agent did: predict-no for direction U in state State-B
  871. In State-B moving U
  872. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  873. predict error 0
  874. dir: dir isL
  875. \-/118: O: O235 (predict-yes)
  876. I see 1 and I'm going to do: predict-yes
  877. ENV: Agent did: predict-yes for direction L in state State-B
  878. In State-B moving L
  879. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  880. predict error 0
  881. dir: dir isR
  882. |\119: O: O238 (predict-no)
  883. I see 1 and I'm going to do: predict-no
  884. ENV: Agent did: predict-no for direction R in state State-A
  885. In State-A moving R
  886. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  887. predict error 1
  888. dir: dir isR
  889. -/|120: O: O239 (predict-yes)
  890. I see 0 and I'm going to do: predict-yes
  891. ENV: Agent did: predict-yes for direction R in state State-B
  892. In State-B moving R
  893. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  894. predict error 1
  895. dir: dir isR
  896. \-121: O: O242 (predict-no)
  897. I see 0 and I'm going to do: predict-no
  898. ENV: Agent did: predict-no for direction R in state State-B
  899. In State-B moving R
  900. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  901. predict error 0
  902. dir: dir isR
  903. rule alias: '*'
  904. rule alias: '*'
  905. rule alias: '*'
  906. rule alias: '*'
  907. rule alias: '*'
  908. rule alias: '*'
  909. rule alias: '*'
  910. rule alias: '*'
  911. /122: O: O244 (predict-no)
  912. I see 1 and I'm going to do: predict-no
  913. ENV: Agent did: predict-no for direction R in state State-B
  914. In State-B moving R
  915. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  916. predict error 0
  917. dir: dir isR
  918. |\-/123: O: O245 (predict-yes)
  919. I see 1 and I'm going to do: predict-yes
  920. ENV: Agent did: predict-yes for direction R in state State-B
  921. In State-B moving R
  922. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  923. predict error 1
  924. dir: dir isU
  925. |\-124: O: O248 (predict-no)
  926. I see 0 and I'm going to do: predict-no
  927. ENV: Agent did: predict-no for direction U in state State-B
  928. In State-B moving U
  929. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  930. predict error 0
  931. dir: dir isL
  932. /|\125: O: O249 (predict-yes)
  933. I see 1 and I'm going to do: predict-yes
  934. ENV: Agent did: predict-yes for direction L in state State-B
  935. In State-B moving L
  936. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  937. predict error 0
  938. dir: dir isL
  939. -/|126: O: O251 (predict-yes)
  940. I see 1 and I'm going to do: predict-yes
  941. ENV: Agent did: predict-yes for direction L in state State-A
  942. In State-A moving L
  943. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  944. predict error 1
  945. dir: dir isU
  946. \-/127: O: O254 (predict-no)
  947. I see 0 and I'm going to do: predict-no
  948. ENV: Agent did: predict-no for direction U in state State-A
  949. In State-A moving U
  950. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  951. predict error 0
  952. dir: dir isL
  953. |\-128: O: O255 (predict-yes)
  954. I see 1 and I'm going to do: predict-yes
  955. ENV: Agent did: predict-yes for direction L in state State-A
  956. In State-A moving L
  957. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  958. predict error 1
  959. dir: dir isL
  960. /|129: O: O257 (predict-yes)
  961. I see 0 and I'm going to do: predict-yes
  962. ENV: Agent did: predict-yes for direction L in state State-A
  963. In State-A moving L
  964. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  965. predict error 1
  966. dir: dir isL
  967. \-/130: O: O259 (predict-yes)
  968. I see 0 and I'm going to do: predict-yes
  969. ENV: Agent did: predict-yes for direction L in state State-A
  970. In State-A moving L
  971. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  972. predict error 1
  973. dir: dir isU
  974. |\131: O: O262 (predict-no)
  975. I see 0 and I'm going to do: predict-no
  976. ENV: Agent did: predict-no for direction U in state State-A
  977. In State-A moving U
  978. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  979. predict error 0
  980. dir: dir isU
  981. -132: O: O264 (predict-no)
  982. I see 1 and I'm going to do: predict-no
  983. ENV: Agent did: predict-no for direction U in state State-A
  984. In State-A moving U
  985. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  986. predict error 0
  987. dir: dir isL
  988. /|\-133: O: O265 (predict-yes)
  989. I see 1 and I'm going to do: predict-yes
  990. ENV: Agent did: predict-yes for direction L in state State-A
  991. In State-A moving L
  992. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  993. predict error 1
  994. dir: dir isR
  995. /|\134: O: O268 (predict-no)
  996. I see 0 and I'm going to do: predict-no
  997. ENV: Agent did: predict-no for direction R in state State-A
  998. In State-A moving R
  999. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1000. predict error 1
  1001. dir: dir isL
  1002. -/135: O: O269 (predict-yes)
  1003. I see 0 and I'm going to do: predict-yes
  1004. ENV: Agent did: predict-yes for direction L in state State-B
  1005. In State-B moving L
  1006. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1007. predict error 0
  1008. dir: dir isL
  1009. |\-136: O: O271 (predict-yes)
  1010. I see 1 and I'm going to do: predict-yes
  1011. ENV: Agent did: predict-yes for direction L in state State-A
  1012. In State-A moving L
  1013. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1014. predict error 1
  1015. dir: dir isL
  1016. /|\137: O: O274 (predict-no)
  1017. I see 0 and I'm going to do: predict-no
  1018. ENV: Agent did: predict-no for direction L in state State-A
  1019. In State-A moving L
  1020. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1021. predict error 0
  1022. dir: dir isR
  1023. -/|138: O: O276 (predict-no)
  1024. I see 1 and I'm going to do: predict-no
  1025. ENV: Agent did: predict-no for direction R in state State-A
  1026. In State-A moving R
  1027. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1028. predict error 1
  1029. dir: dir isR
  1030. \-/139: O: O278 (predict-no)
  1031. I see 0 and I'm going to do: predict-no
  1032. ENV: Agent did: predict-no for direction R in state State-B
  1033. In State-B moving R
  1034. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1035. predict error 0
  1036. dir: dir isL
  1037. |\-140: O: O279 (predict-yes)
  1038. I see 1 and I'm going to do: predict-yes
  1039. ENV: Agent did: predict-yes for direction L in state State-B
  1040. In State-B moving L
  1041. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1042. predict error 0
  1043. dir: dir isR
  1044. /|\141: O: O282 (predict-no)
  1045. I see 1 and I'm going to do: predict-no
  1046. ENV: Agent did: predict-no for direction R in state State-A
  1047. In State-A moving R
  1048. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1049. predict error 1
  1050. dir: dir isL
  1051. rule alias: '*'
  1052. -142: O: O283 (predict-yes)
  1053. I see 0 and I'm going to do: predict-yes
  1054. ENV: Agent did: predict-yes for direction L in state State-B
  1055. In State-B moving L
  1056. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1057. predict error 0
  1058. dir: dir isL
  1059. /|\-143: O: O286 (predict-no)
  1060. I see 1 and I'm going to do: predict-no
  1061. ENV: Agent did: predict-no for direction L in state State-A
  1062. In State-A moving L
  1063. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1064. predict error 0
  1065. dir: dir isU
  1066. /|\144: O: O288 (predict-no)
  1067. I see 1 and I'm going to do: predict-no
  1068. ENV: Agent did: predict-no for direction U in state State-A
  1069. In State-A moving U
  1070. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1071. predict error 0
  1072. dir: dir isL
  1073. -/|145: O: O290 (predict-no)
  1074. I see 1 and I'm going to do: predict-no
  1075. ENV: Agent did: predict-no for direction L in state State-A
  1076. In State-A moving L
  1077. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1078. predict error 0
  1079. dir: dir isR
  1080. \-/146: O: O292 (predict-no)
  1081. I see 1 and I'm going to do: predict-no
  1082. ENV: Agent did: predict-no for direction R in state State-A
  1083. In State-A moving R
  1084. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1085. predict error 1
  1086. dir: dir isU
  1087. |\-147: O: O294 (predict-no)
  1088. I see 0 and I'm going to do: predict-no
  1089. ENV: Agent did: predict-no for direction U in state State-B
  1090. In State-B moving U
  1091. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1092. predict error 0
  1093. dir: dir isU
  1094. /|\148: O: O296 (predict-no)
  1095. I see 1 and I'm going to do: predict-no
  1096. ENV: Agent did: predict-no for direction U in state State-B
  1097. In State-B moving U
  1098. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1099. predict error 0
  1100. dir: dir isU
  1101. -/|149: O: O298 (predict-no)
  1102. I see 1 and I'm going to do: predict-no
  1103. ENV: Agent did: predict-no for direction U in state State-B
  1104. In State-B moving U
  1105. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1106. predict error 0
  1107. dir: dir isL
  1108. \-/150: O: O299 (predict-yes)
  1109. I see 1 and I'm going to do: predict-yes
  1110. ENV: Agent did: predict-yes for direction L in state State-B
  1111. In State-B moving L
  1112. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1113. predict error 0
  1114. dir: dir isU
  1115. |\-151: O: O302 (predict-no)
  1116. I see 1 and I'm going to do: predict-no
  1117. ENV: Agent did: predict-no for direction U in state State-A
  1118. In State-A moving U
  1119. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1120. predict error 0
  1121. dir: dir isU
  1122. /152: O: O304 (predict-no)
  1123. I see 1 and I'm going to do: predict-no
  1124. ENV: Agent did: predict-no for direction U in state State-A
  1125. In State-A moving U
  1126. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1127. predict error 0
  1128. dir: dir isL
  1129. |\-153: O: O306 (predict-no)
  1130. I see 1 and I'm going to do: predict-no
  1131. ENV: Agent did: predict-no for direction L in state State-A
  1132. In State-A moving L
  1133. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1134. predict error 0
  1135. dir: dir isU
  1136. /|154: O: O308 (predict-no)
  1137. I see 1 and I'm going to do: predict-no
  1138. ENV: Agent did: predict-no for direction U in state State-A
  1139. In State-A moving U
  1140. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1141. predict error 0
  1142. dir: dir isU
  1143. \-/155: O: O310 (predict-no)
  1144. I see 1 and I'm going to do: predict-no
  1145. ENV: Agent did: predict-no for direction U in state State-A
  1146. In State-A moving U
  1147. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1148. predict error 0
  1149. dir: dir isR
  1150. |\-/sleeping...
  1151. |156: O: O312 (predict-no)
  1152. I see 1 and I'm going to do: predict-no
  1153. ENV: Agent did: predict-no for direction R in state State-A
  1154. In State-A moving R
  1155. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1156. predict error 1
  1157. dir: dir isL
  1158. \-/157: O: O313 (predict-yes)
  1159. I see 0 and I'm going to do: predict-yes
  1160. ENV: Agent did: predict-yes for direction L in state State-B
  1161. In State-B moving L
  1162. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1163. predict error 0
  1164. dir: dir isR
  1165. |\-158: O: O316 (predict-no)
  1166. I see 1 and I'm going to do: predict-no
  1167. ENV: Agent did: predict-no for direction R in state State-A
  1168. In State-A moving R
  1169. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1170. predict error 1
  1171. dir: dir isR
  1172. /|\159: O: O318 (predict-no)
  1173. I see 0 and I'm going to do: predict-no
  1174. ENV: Agent did: predict-no for direction R in state State-B
  1175. In State-B moving R
  1176. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1177. predict error 0
  1178. dir: dir isL
  1179. -160: O: O319 (predict-yes)
  1180. I see 1 and I'm going to do: predict-yes
  1181. ENV: Agent did: predict-yes for direction L in state State-B
  1182. In State-B moving L
  1183. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1184. predict error 0
  1185. dir: dir isR
  1186. /|\161: O: O322 (predict-no)
  1187. I see 1 and I'm going to do: predict-no
  1188. ENV: Agent did: predict-no for direction R in state State-A
  1189. In State-A moving R
  1190. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1191. predict error 1
  1192. dir: dir isR
  1193. -162: O: O324 (predict-no)
  1194. I see 0 and I'm going to do: predict-no
  1195. ENV: Agent did: predict-no for direction R in state State-B
  1196. In State-B moving R
  1197. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1198. predict error 0
  1199. dir: dir isR
  1200. /|163: O: O326 (predict-no)
  1201. I see 1 and I'm going to do: predict-no
  1202. ENV: Agent did: predict-no for direction R in state State-B
  1203. In State-B moving R
  1204. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1205. predict error 0
  1206. dir: dir isR
  1207. \164: O: O328 (predict-no)
  1208. I see 1 and I'm going to do: predict-no
  1209. ENV: Agent did: predict-no for direction R in state State-B
  1210. In State-B moving R
  1211. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1212. predict error 0
  1213. dir: dir isL
  1214. -/|165: O: O329 (predict-yes)
  1215. I see 1 and I'm going to do: predict-yes
  1216. ENV: Agent did: predict-yes for direction L in state State-B
  1217. In State-B moving L
  1218. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1219. predict error 0
  1220. dir: dir isR
  1221. \-/166: O: O332 (predict-no)
  1222. I see 1 and I'm going to do: predict-no
  1223. ENV: Agent did: predict-no for direction R in state State-A
  1224. In State-A moving R
  1225. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1226. predict error 1
  1227. dir: dir isU
  1228. |\-167: O: O334 (predict-no)
  1229. I see 0 and I'm going to do: predict-no
  1230. ENV: Agent did: predict-no for direction U in state State-B
  1231. In State-B moving U
  1232. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1233. predict error 0
  1234. dir: dir isL
  1235. /|\168: O: O335 (predict-yes)
  1236. I see 1 and I'm going to do: predict-yes
  1237. ENV: Agent did: predict-yes for direction L in state State-B
  1238. In State-B moving L
  1239. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1240. predict error 0
  1241. dir: dir isR
  1242. -/|169: O: O338 (predict-no)
  1243. I see 1 and I'm going to do: predict-no
  1244. ENV: Agent did: predict-no for direction R in state State-A
  1245. In State-A moving R
  1246. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1247. predict error 1
  1248. dir: dir isL
  1249. \-170: O: O339 (predict-yes)
  1250. I see 0 and I'm going to do: predict-yes
  1251. ENV: Agent did: predict-yes for direction L in state State-B
  1252. In State-B moving L
  1253. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1254. predict error 0
  1255. dir: dir isU
  1256. /|171: O: O342 (predict-no)
  1257. I see 1 and I'm going to do: predict-no
  1258. ENV: Agent did: predict-no for direction U in state State-A
  1259. In State-A moving U
  1260. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1261. predict error 0
  1262. dir: dir isR
  1263. \172: O: O343 (predict-yes)
  1264. I see 1 and I'm going to do: predict-yes
  1265. ENV: Agent did: predict-yes for direction R in state State-A
  1266. In State-A moving R
  1267. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1268. predict error 0
  1269. dir: dir isL
  1270. -/173: O: O345 (predict-yes)
  1271. I see 1 and I'm going to do: predict-yes
  1272. ENV: Agent did: predict-yes for direction L in state State-B
  1273. In State-B moving L
  1274. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1275. predict error 0
  1276. dir: dir isL
  1277. |\-174: O: O348 (predict-no)
  1278. I see 1 and I'm going to do: predict-no
  1279. ENV: Agent did: predict-no for direction L in state State-A
  1280. In State-A moving L
  1281. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1282. predict error 0
  1283. dir: dir isL
  1284. /|\175: O: O350 (predict-no)
  1285. I see 1 and I'm going to do: predict-no
  1286. ENV: Agent did: predict-no for direction L in state State-A
  1287. In State-A moving L
  1288. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1289. predict error 0
  1290. dir: dir isU
  1291. -/|176: O: O352 (predict-no)
  1292. I see 1 and I'm going to do: predict-no
  1293. ENV: Agent did: predict-no for direction U in state State-A
  1294. In State-A moving U
  1295. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1296. predict error 0
  1297. dir: dir isR
  1298. \-177: O: O353 (predict-yes)
  1299. I see 1 and I'm going to do: predict-yes
  1300. ENV: Agent did: predict-yes for direction R in state State-A
  1301. In State-A moving R
  1302. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1303. predict error 0
  1304. dir: dir isL
  1305. /|\178: O: O355 (predict-yes)
  1306. I see 1 and I'm going to do: predict-yes
  1307. ENV: Agent did: predict-yes for direction L in state State-B
  1308. In State-B moving L
  1309. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1310. predict error 0
  1311. dir: dir isR
  1312. -/|179: O: O357 (predict-yes)
  1313. I see 1 and I'm going to do: predict-yes
  1314. ENV: Agent did: predict-yes for direction R in state State-A
  1315. In State-A moving R
  1316. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1317. predict error 0
  1318. dir: dir isU
  1319. \-/180: O: O360 (predict-no)
  1320. I see 1 and I'm going to do: predict-no
  1321. ENV: Agent did: predict-no for direction U in state State-B
  1322. In State-B moving U
  1323. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1324. predict error 0
  1325. dir: dir isR
  1326. |\-181: O: O362 (predict-no)
  1327. I see 1 and I'm going to do: predict-no
  1328. ENV: Agent did: predict-no for direction R in state State-B
  1329. In State-B moving R
  1330. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1331. predict error 0
  1332. dir: dir isR
  1333. /182: O: O364 (predict-no)
  1334. I see 1 and I'm going to do: predict-no
  1335. ENV: Agent did: predict-no for direction R in state State-B
  1336. In State-B moving R
  1337. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1338. predict error 0
  1339. dir: dir isU
  1340. |\-183: O: O366 (predict-no)
  1341. I see 1 and I'm going to do: predict-no
  1342. ENV: Agent did: predict-no for direction U in state State-B
  1343. In State-B moving U
  1344. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1345. predict error 0
  1346. dir: dir isR
  1347. /|\184: O: O368 (predict-no)
  1348. I see 1 and I'm going to do: predict-no
  1349. ENV: Agent did: predict-no for direction R in state State-B
  1350. In State-B moving R
  1351. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1352. predict error 0
  1353. dir: dir isR
  1354. -/|185: O: O370 (predict-no)
  1355. I see 1 and I'm going to do: predict-no
  1356. ENV: Agent did: predict-no for direction R in state State-B
  1357. In State-B moving R
  1358. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1359. predict error 0
  1360. dir: dir isR
  1361. \-/186: O: O372 (predict-no)
  1362. I see 1 and I'm going to do: predict-no
  1363. ENV: Agent did: predict-no for direction R in state State-B
  1364. In State-B moving R
  1365. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1366. predict error 0
  1367. dir: dir isL
  1368. |\-187: O: O373 (predict-yes)
  1369. I see 1 and I'm going to do: predict-yes
  1370. ENV: Agent did: predict-yes for direction L in state State-B
  1371. In State-B moving L
  1372. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1373. predict error 0
  1374. dir: dir isL
  1375. /|\188: O: O376 (predict-no)
  1376. I see 1 and I'm going to do: predict-no
  1377. ENV: Agent did: predict-no for direction L in state State-A
  1378. In State-A moving L
  1379. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1380. predict error 0
  1381. dir: dir isR
  1382. -/|189: O: O377 (predict-yes)
  1383. I see 1 and I'm going to do: predict-yes
  1384. ENV: Agent did: predict-yes for direction R in state State-A
  1385. In State-A moving R
  1386. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1387. predict error 0
  1388. dir: dir isL
  1389. \190: O: O379 (predict-yes)
  1390. I see 1 and I'm going to do: predict-yes
  1391. ENV: Agent did: predict-yes for direction L in state State-B
  1392. In State-B moving L
  1393. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1394. predict error 0
  1395. dir: dir isR
  1396. -/|191: O: O381 (predict-yes)
  1397. I see 1 and I'm going to do: predict-yes
  1398. ENV: Agent did: predict-yes for direction R in state State-A
  1399. In State-A moving R
  1400. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1401. predict error 0
  1402. dir: dir isR
  1403. \192: O: O384 (predict-no)
  1404. I see 1 and I'm going to do: predict-no
  1405. ENV: Agent did: predict-no for direction R in state State-B
  1406. In State-B moving R
  1407. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1408. predict error 0
  1409. dir: dir isU
  1410. -/|193: O: O386 (predict-no)
  1411. I see 1 and I'm going to do: predict-no
  1412. ENV: Agent did: predict-no for direction U in state State-B
  1413. In State-B moving U
  1414. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1415. predict error 0
  1416. dir: dir isR
  1417. \-194: O: O388 (predict-no)
  1418. I see 1 and I'm going to do: predict-no
  1419. ENV: Agent did: predict-no for direction R in state State-B
  1420. In State-B moving R
  1421. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1422. predict error 0
  1423. dir: dir isR
  1424. /|\195: O: O390 (predict-no)
  1425. I see 1 and I'm going to do: predict-no
  1426. ENV: Agent did: predict-no for direction R in state State-B
  1427. In State-B moving R
  1428. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1429. predict error 0
  1430. dir: dir isR
  1431. -/|196: O: O392 (predict-no)
  1432. I see 1 and I'm going to do: predict-no
  1433. ENV: Agent did: predict-no for direction R in state State-B
  1434. In State-B moving R
  1435. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1436. predict error 0
  1437. dir: dir isU
  1438. \-/197: O: O394 (predict-no)
  1439. I see 1 and I'm going to do: predict-no
  1440. ENV: Agent did: predict-no for direction U in state State-B
  1441. In State-B moving U
  1442. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1443. predict error 0
  1444. dir: dir isR
  1445. |198: O: O396 (predict-no)
  1446. I see 1 and I'm going to do: predict-no
  1447. ENV: Agent did: predict-no for direction R in state State-B
  1448. In State-B moving R
  1449. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1450. predict error 0
  1451. dir: dir isR
  1452. \-/199: O: O398 (predict-no)
  1453. I see 1 and I'm going to do: predict-no
  1454. ENV: Agent did: predict-no for direction R in state State-B
  1455. In State-B moving R
  1456. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1457. predict error 0
  1458. dir: dir isL
  1459. |\200: O: O399 (predict-yes)
  1460. I see 1 and I'm going to do: predict-yes
  1461. ENV: Agent did: predict-yes for direction L in state State-B
  1462. In State-B moving L
  1463. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1464. predict error 0
  1465. dir: dir isR
  1466. -/|201: O: O401 (predict-yes)
  1467. I see 1 and I'm going to do: predict-yes
  1468. ENV: Agent did: predict-yes for direction R in state State-A
  1469. In State-A moving R
  1470. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1471. predict error 0
  1472. dir: dir isL
  1473. \-202: O: O403 (predict-yes)
  1474. I see 1 and I'm going to do: predict-yes
  1475. ENV: Agent did: predict-yes for direction L in state State-B
  1476. In State-B moving L
  1477. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1478. predict error 0
  1479. dir: dir isL
  1480. /|\203: O: O406 (predict-no)
  1481. I see 1 and I'm going to do: predict-no
  1482. ENV: Agent did: predict-no for direction L in state State-A
  1483. In State-A moving L
  1484. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1485. predict error 0
  1486. dir: dir isR
  1487. -/|204: O: O407 (predict-yes)
  1488. I see 1 and I'm going to do: predict-yes
  1489. ENV: Agent did: predict-yes for direction R in state State-A
  1490. In State-A moving R
  1491. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1492. predict error 0
  1493. dir: dir isR
  1494. \-/205: O: O410 (predict-no)
  1495. I see 1 and I'm going to do: predict-no
  1496. ENV: Agent did: predict-no for direction R in state State-B
  1497. In State-B moving R
  1498. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1499. predict error 0
  1500. dir: dir isR
  1501. |\-206: O: O412 (predict-no)
  1502. I see 1 and I'm going to do: predict-no
  1503. ENV: Agent did: predict-no for direction R in state State-B
  1504. In State-B moving R
  1505. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1506. predict error 0
  1507. dir: dir isU
  1508. /|\-207: O: O414 (predict-no)
  1509. I see 1 and I'm going to do: predict-no
  1510. ENV: Agent did: predict-no for direction U in state State-B
  1511. In State-B moving U
  1512. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1513. predict error 0
  1514. dir: dir isU
  1515. /|\208: O: O416 (predict-no)
  1516. I see 1 and I'm going to do: predict-no
  1517. ENV: Agent did: predict-no for direction U in state State-B
  1518. In State-B moving U
  1519. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1520. predict error 0
  1521. dir: dir isR
  1522. -/209: O: O418 (predict-no)
  1523. I see 1 and I'm going to do: predict-no
  1524. ENV: Agent did: predict-no for direction R in state State-B
  1525. In State-B moving R
  1526. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1527. predict error 0
  1528. dir: dir isL
  1529. |\-/210: O: O419 (predict-yes)
  1530. I see 1 and I'm going to do: predict-yes
  1531. ENV: Agent did: predict-yes for direction L in state State-B
  1532. In State-B moving L
  1533. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1534. predict error 0
  1535. dir: dir isR
  1536. |\-211: O: O421 (predict-yes)
  1537. I see 1 and I'm going to do: predict-yes
  1538. ENV: Agent did: predict-yes for direction R in state State-A
  1539. In State-A moving R
  1540. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1541. predict error 0
  1542. dir: dir isU
  1543. /212: O: O424 (predict-no)
  1544. I see 1 and I'm going to do: predict-no
  1545. ENV: Agent did: predict-no for direction U in state State-B
  1546. In State-B moving U
  1547. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1548. predict error 0
  1549. dir: dir isU
  1550. |\-213: O: O426 (predict-no)
  1551. I see 1 and I'm going to do: predict-no
  1552. ENV: Agent did: predict-no for direction U in state State-B
  1553. In State-B moving U
  1554. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1555. predict error 0
  1556. dir: dir isU
  1557. /|\214: O: O428 (predict-no)
  1558. I see 1 and I'm going to do: predict-no
  1559. ENV: Agent did: predict-no for direction U in state State-B
  1560. In State-B moving U
  1561. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1562. predict error 0
  1563. dir: dir isL
  1564. -/|215: O: O429 (predict-yes)
  1565. I see 1 and I'm going to do: predict-yes
  1566. ENV: Agent did: predict-yes for direction L in state State-B
  1567. In State-B moving L
  1568. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1569. predict error 0
  1570. dir: dir isU
  1571. \-/216: O: O432 (predict-no)
  1572. I see 1 and I'm going to do: predict-no
  1573. ENV: Agent did: predict-no for direction U in state State-A
  1574. In State-A moving U
  1575. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1576. predict error 0
  1577. dir: dir isR
  1578. |\-217: O: O433 (predict-yes)
  1579. I see 1 and I'm going to do: predict-yes
  1580. ENV: Agent did: predict-yes for direction R in state State-A
  1581. In State-A moving R
  1582. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1583. predict error 0
  1584. dir: dir isL
  1585. /|\218: O: O435 (predict-yes)
  1586. I see 1 and I'm going to do: predict-yes
  1587. ENV: Agent did: predict-yes for direction L in state State-B
  1588. In State-B moving L
  1589. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1590. predict error 0
  1591. dir: dir isU
  1592. -/219: O: O437 (predict-yes)
  1593. I see 1 and I'm going to do: predict-yes
  1594. ENV: Agent did: predict-yes for direction U in state State-A
  1595. In State-A moving U
  1596. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1597. predict error 1
  1598. dir: dir isU
  1599. |\-220: O: O440 (predict-no)
  1600. I see 0 and I'm going to do: predict-no
  1601. ENV: Agent did: predict-no for direction U in state State-A
  1602. In State-A moving U
  1603. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1604. predict error 0
  1605. dir: dir isR
  1606. /|\221: O: O441 (predict-yes)
  1607. I see 1 and I'm going to do: predict-yes
  1608. ENV: Agent did: predict-yes for direction R in state State-A
  1609. In State-A moving R
  1610. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1611. predict error 0
  1612. dir: dir isU
  1613. -222: O: O444 (predict-no)
  1614. I see 1 and I'm going to do: predict-no
  1615. ENV: Agent did: predict-no for direction U in state State-B
  1616. In State-B moving U
  1617. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1618. predict error 0
  1619. dir: dir isL
  1620. /|223: O: O445 (predict-yes)
  1621. I see 1 and I'm going to do: predict-yes
  1622. ENV: Agent did: predict-yes for direction L in state State-B
  1623. In State-B moving L
  1624. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1625. predict error 0
  1626. dir: dir isL
  1627. \-224: O: O448 (predict-no)
  1628. I see 1 and I'm going to do: predict-no
  1629. ENV: Agent did: predict-no for direction L in state State-A
  1630. In State-A moving L
  1631. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1632. predict error 0
  1633. dir: dir isU
  1634. /|\225: O: O450 (predict-no)
  1635. I see 1 and I'm going to do: predict-no
  1636. ENV: Agent did: predict-no for direction U in state State-A
  1637. In State-A moving U
  1638. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1639. predict error 0
  1640. dir: dir isL
  1641. -226: O: O452 (predict-no)
  1642. I see 1 and I'm going to do: predict-no
  1643. ENV: Agent did: predict-no for direction L in state State-A
  1644. In State-A moving L
  1645. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1646. predict error 0
  1647. dir: dir isU
  1648. /|\227: O: O454 (predict-no)
  1649. I see 1 and I'm going to do: predict-no
  1650. ENV: Agent did: predict-no for direction U in state State-A
  1651. In State-A moving U
  1652. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1653. predict error 0
  1654. dir: dir isR
  1655. -/|228: O: O455 (predict-yes)
  1656. I see 1 and I'm going to do: predict-yes
  1657. ENV: Agent did: predict-yes for direction R in state State-A
  1658. In State-A moving R
  1659. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1660. predict error 0
  1661. dir: dir isL
  1662. \-229: O: O457 (predict-yes)
  1663. I see 1 and I'm going to do: predict-yes
  1664. ENV: Agent did: predict-yes for direction L in state State-B
  1665. In State-B moving L
  1666. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1667. predict error 0
  1668. dir: dir isL
  1669. /|\230: O: O460 (predict-no)
  1670. I see 1 and I'm going to do: predict-no
  1671. ENV: Agent did: predict-no for direction L in state State-A
  1672. In State-A moving L
  1673. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1674. predict error 0
  1675. dir: dir isR
  1676. -/|231: O: O462 (predict-no)
  1677. I see 1 and I'm going to do: predict-no
  1678. ENV: Agent did: predict-no for direction R in state State-A
  1679. In State-A moving R
  1680. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1681. predict error 1
  1682. dir: dir isU
  1683. \232: O: O464 (predict-no)
  1684. I see 0 and I'm going to do: predict-no
  1685. ENV: Agent did: predict-no for direction U in state State-B
  1686. In State-B moving U
  1687. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1688. predict error 0
  1689. dir: dir isL
  1690. -/|233: O: O465 (predict-yes)
  1691. I see 1 and I'm going to do: predict-yes
  1692. ENV: Agent did: predict-yes for direction L in state State-B
  1693. In State-B moving L
  1694. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1695. predict error 0
  1696. dir: dir isU
  1697. \-234: O: O468 (predict-no)
  1698. I see 1 and I'm going to do: predict-no
  1699. ENV: Agent did: predict-no for direction U in state State-A
  1700. In State-A moving U
  1701. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1702. predict error 0
  1703. dir: dir isL
  1704. /|\235: O: O470 (predict-no)
  1705. I see 1 and I'm going to do: predict-no
  1706. ENV: Agent did: predict-no for direction L in state State-A
  1707. In State-A moving L
  1708. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1709. predict error 0
  1710. dir: dir isU
  1711. -/|236: O: O472 (predict-no)
  1712. I see 1 and I'm going to do: predict-no
  1713. ENV: Agent did: predict-no for direction U in state State-A
  1714. In State-A moving U
  1715. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1716. predict error 0
  1717. dir: dir isR
  1718. \-/|237: O: O473 (predict-yes)
  1719. I see 1 and I'm going to do: predict-yes
  1720. ENV: Agent did: predict-yes for direction R in state State-A
  1721. In State-A moving R
  1722. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1723. predict error 0
  1724. dir: dir isL
  1725. \-/238: O: O475 (predict-yes)
  1726. I see 1 and I'm going to do: predict-yes
  1727. ENV: Agent did: predict-yes for direction L in state State-B
  1728. In State-B moving L
  1729. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1730. predict error 0
  1731. dir: dir isL
  1732. |\-239: O: O478 (predict-no)
  1733. I see 1 and I'm going to do: predict-no
  1734. ENV: Agent did: predict-no for direction L in state State-A
  1735. In State-A moving L
  1736. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1737. predict error 0
  1738. dir: dir isR
  1739. /|240: O: O479 (predict-yes)
  1740. I see 1 and I'm going to do: predict-yes
  1741. ENV: Agent did: predict-yes for direction R in state State-A
  1742. In State-A moving R
  1743. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1744. predict error 0
  1745. dir: dir isU
  1746. \-/241: O: O482 (predict-no)
  1747. I see 1 and I'm going to do: predict-no
  1748. ENV: Agent did: predict-no for direction U in state State-B
  1749. In State-B moving U
  1750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1751. predict error 0
  1752. dir: dir isU
  1753. |242: O: O484 (predict-no)
  1754. I see 1 and I'm going to do: predict-no
  1755. ENV: Agent did: predict-no for direction U in state State-B
  1756. In State-B moving U
  1757. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1758. predict error 0
  1759. dir: dir isL
  1760. \-/243: O: O485 (predict-yes)
  1761. I see 1 and I'm going to do: predict-yes
  1762. ENV: Agent did: predict-yes for direction L in state State-B
  1763. In State-B moving L
  1764. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1765. predict error 0
  1766. dir: dir isR
  1767. |\244: O: O487 (predict-yes)
  1768. I see 1 and I'm going to do: predict-yes
  1769. ENV: Agent did: predict-yes for direction R in state State-A
  1770. In State-A moving R
  1771. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1772. predict error 0
  1773. dir: dir isR
  1774. -/|245: O: O490 (predict-no)
  1775. I see 1 and I'm going to do: predict-no
  1776. ENV: Agent did: predict-no for direction R in state State-B
  1777. In State-B moving R
  1778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1779. predict error 0
  1780. dir: dir isR
  1781. \-/246: O: O492 (predict-no)
  1782. I see 1 and I'm going to do: predict-no
  1783. ENV: Agent did: predict-no for direction R in state State-B
  1784. In State-B moving R
  1785. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1786. predict error 0
  1787. dir: dir isU
  1788. |\-247: O: O494 (predict-no)
  1789. I see 1 and I'm going to do: predict-no
  1790. ENV: Agent did: predict-no for direction U in state State-B
  1791. In State-B moving U
  1792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1793. predict error 0
  1794. dir: dir isL
  1795. /|\248: O: O495 (predict-yes)
  1796. I see 1 and I'm going to do: predict-yes
  1797. ENV: Agent did: predict-yes for direction L in state State-B
  1798. In State-B moving L
  1799. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1800. predict error 0
  1801. dir: dir isL
  1802. -/|249: O: O498 (predict-no)
  1803. I see 1 and I'm going to do: predict-no
  1804. ENV: Agent did: predict-no for direction L in state State-A
  1805. In State-A moving L
  1806. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1807. predict error 0
  1808. dir: dir isL
  1809. \-250: O: O500 (predict-no)
  1810. I see 1 and I'm going to do: predict-no
  1811. ENV: Agent did: predict-no for direction L in state State-A
  1812. In State-A moving L
  1813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1814. predict error 0
  1815. dir: dir isU
  1816. /|251: O: O502 (predict-no)
  1817. I see 1 and I'm going to do: predict-no
  1818. ENV: Agent did: predict-no for direction U in state State-A
  1819. In State-A moving U
  1820. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1821. predict error 0
  1822. dir: dir isR
  1823. \252: O: O503 (predict-yes)
  1824. I see 1 and I'm going to do: predict-yes
  1825. ENV: Agent did: predict-yes for direction R in state State-A
  1826. In State-A moving R
  1827. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1828. predict error 0
  1829. dir: dir isU
  1830. -/|253: O: O506 (predict-no)
  1831. I see 1 and I'm going to do: predict-no
  1832. ENV: Agent did: predict-no for direction U in state State-B
  1833. In State-B moving U
  1834. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1835. predict error 0
  1836. dir: dir isU
  1837. \-/254: O: O508 (predict-no)
  1838. I see 1 and I'm going to do: predict-no
  1839. ENV: Agent did: predict-no for direction U in state State-B
  1840. In State-B moving U
  1841. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1842. predict error 0
  1843. dir: dir isU
  1844. |\-255: O: O510 (predict-no)
  1845. I see 1 and I'm going to do: predict-no
  1846. ENV: Agent did: predict-no for direction U in state State-B
  1847. In State-B moving U
  1848. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1849. predict error 0
  1850. dir: dir isL
  1851. /|\256: O: O511 (predict-yes)
  1852. I see 1 and I'm going to do: predict-yes
  1853. ENV: Agent did: predict-yes for direction L in state State-B
  1854. In State-B moving L
  1855. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1856. predict error 0
  1857. dir: dir isU
  1858. -/|257: O: O514 (predict-no)
  1859. I see 1 and I'm going to do: predict-no
  1860. ENV: Agent did: predict-no for direction U in state State-A
  1861. In State-A moving U
  1862. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1863. predict error 0
  1864. dir: dir isU
  1865. \-/258: O: O516 (predict-no)
  1866. I see 1 and I'm going to do: predict-no
  1867. ENV: Agent did: predict-no for direction U in state State-A
  1868. In State-A moving U
  1869. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1870. predict error 0
  1871. dir: dir isR
  1872. |\259: O: O517 (predict-yes)
  1873. I see 1 and I'm going to do: predict-yes
  1874. ENV: Agent did: predict-yes for direction R in state State-A
  1875. In State-A moving R
  1876. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1877. predict error 0
  1878. dir: dir isU
  1879. -/|260: O: O519 (predict-yes)
  1880. I see 1 and I'm going to do: predict-yes
  1881. ENV: Agent did: predict-yes for direction U in state State-B
  1882. In State-B moving U
  1883. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1884. predict error 1
  1885. dir: dir isU
  1886. \-/261: O: O522 (predict-no)
  1887. I see 0 and I'm going to do: predict-no
  1888. ENV: Agent did: predict-no for direction U in state State-B
  1889. In State-B moving U
  1890. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1891. predict error 0
  1892. dir: dir isR
  1893. |262: O: O524 (predict-no)
  1894. I see 1 and I'm going to do: predict-no
  1895. ENV: Agent did: predict-no for direction R in state State-B
  1896. In State-B moving R
  1897. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1898. predict error 0
  1899. dir: dir isR
  1900. \-263: O: O526 (predict-no)
  1901. I see 1 and I'm going to do: predict-no
  1902. ENV: Agent did: predict-no for direction R in state State-B
  1903. In State-B moving R
  1904. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1905. predict error 0
  1906. dir: dir isR
  1907. /|\264: O: O528 (predict-no)
  1908. I see 1 and I'm going to do: predict-no
  1909. ENV: Agent did: predict-no for direction R in state State-B
  1910. In State-B moving R
  1911. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1912. predict error 0
  1913. dir: dir isL
  1914. -/|265: O: O529 (predict-yes)
  1915. I see 1 and I'm going to do: predict-yes
  1916. ENV: Agent did: predict-yes for direction L in state State-B
  1917. In State-B moving L
  1918. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1919. predict error 0
  1920. dir: dir isR
  1921. \-/266: O: O531 (predict-yes)
  1922. I see 1 and I'm going to do: predict-yes
  1923. ENV: Agent did: predict-yes for direction R in state State-A
  1924. In State-A moving R
  1925. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1926. predict error 0
  1927. dir: dir isL
  1928. |\267: O: O534 (predict-no)
  1929. I see 1 and I'm going to do: predict-no
  1930. ENV: Agent did: predict-no for direction L in state State-B
  1931. In State-B moving L
  1932. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1933. predict error 1
  1934. dir: dir isL
  1935. -/268: O: O536 (predict-no)
  1936. I see 0 and I'm going to do: predict-no
  1937. ENV: Agent did: predict-no for direction L in state State-A
  1938. In State-A moving L
  1939. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1940. predict error 0
  1941. dir: dir isR
  1942. |269: O: O537 (predict-yes)
  1943. I see 1 and I'm going to do: predict-yes
  1944. ENV: Agent did: predict-yes for direction R in state State-A
  1945. In State-A moving R
  1946. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1947. predict error 0
  1948. dir: dir isU
  1949. \-/270: O: O540 (predict-no)
  1950. I see 1 and I'm going to do: predict-no
  1951. ENV: Agent did: predict-no for direction U in state State-B
  1952. In State-B moving U
  1953. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1954. predict error 0
  1955. dir: dir isU
  1956. |\-271: O: O542 (predict-no)
  1957. I see 1 and I'm going to do: predict-no
  1958. ENV: Agent did: predict-no for direction U in state State-B
  1959. In State-B moving U
  1960. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1961. predict error 0
  1962. dir: dir isR
  1963. /272: O: O543 (predict-yes)
  1964. I see 1 and I'm going to do: predict-yes
  1965. ENV: Agent did: predict-yes for direction R in state State-B
  1966. In State-B moving R
  1967. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1968. predict error 1
  1969. dir: dir isR
  1970. |\-273: O: O546 (predict-no)
  1971. I see 0 and I'm going to do: predict-no
  1972. ENV: Agent did: predict-no for direction R in state State-B
  1973. In State-B moving R
  1974. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1975. predict error 0
  1976. dir: dir isL
  1977. /|274: O: O547 (predict-yes)
  1978. I see 1 and I'm going to do: predict-yes
  1979. ENV: Agent did: predict-yes for direction L in state State-B
  1980. In State-B moving L
  1981. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1982. predict error 0
  1983. dir: dir isL
  1984. \-/275: O: O550 (predict-no)
  1985. I see 1 and I'm going to do: predict-no
  1986. ENV: Agent did: predict-no for direction L in state State-A
  1987. In State-A moving L
  1988. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1989. predict error 0
  1990. dir: dir isU
  1991. |\-276: O: O552 (predict-no)
  1992. I see 1 and I'm going to do: predict-no
  1993. ENV: Agent did: predict-no for direction U in state State-A
  1994. In State-A moving U
  1995. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1996. predict error 0
  1997. dir: dir isL
  1998. /|\277: O: O554 (predict-no)
  1999. I see 1 and I'm going to do: predict-no
  2000. ENV: Agent did: predict-no for direction L in state State-A
  2001. In State-A moving L
  2002. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2003. predict error 0
  2004. dir: dir isR
  2005. -/|278: O: O555 (predict-yes)
  2006. I see 1 and I'm going to do: predict-yes
  2007. ENV: Agent did: predict-yes for direction R in state State-A
  2008. In State-A moving R
  2009. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2010. predict error 0
  2011. dir: dir isR
  2012. \-/279: O: O558 (predict-no)
  2013. I see 1 and I'm going to do: predict-no
  2014. ENV: Agent did: predict-no for direction R in state State-B
  2015. In State-B moving R
  2016. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2017. predict error 0
  2018. dir: dir isL
  2019. |\-280: O: O559 (predict-yes)
  2020. I see 1 and I'm going to do: predict-yes
  2021. ENV: Agent did: predict-yes for direction L in state State-B
  2022. In State-B moving L
  2023. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2024. predict error 0
  2025. dir: dir isR
  2026. /281: O: O561 (predict-yes)
  2027. I see 1 and I'm going to do: predict-yes
  2028. ENV: Agent did: predict-yes for direction R in state State-A
  2029. In State-A moving R
  2030. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2031. predict error 0
  2032. dir: dir isL
  2033. |282: O: O564 (predict-no)
  2034. I see 1 and I'm going to do: predict-no
  2035. ENV: Agent did: predict-no for direction L in state State-B
  2036. In State-B moving L
  2037. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2038. predict error 1
  2039. dir: dir isL
  2040. \-/283: O: O565 (predict-yes)
  2041. I see 0 and I'm going to do: predict-yes
  2042. ENV: Agent did: predict-yes for direction L in state State-A
  2043. In State-A moving L
  2044. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2045. predict error 1
  2046. dir: dir isL
  2047. |\284: O: O568 (predict-no)
  2048. I see 0 and I'm going to do: predict-no
  2049. ENV: Agent did: predict-no for direction L in state State-A
  2050. In State-A moving L
  2051. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2052. predict error 0
  2053. dir: dir isR
  2054. -/|285: O: O569 (predict-yes)
  2055. I see 1 and I'm going to do: predict-yes
  2056. ENV: Agent did: predict-yes for direction R in state State-A
  2057. In State-A moving R
  2058. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2059. predict error 0
  2060. dir: dir isL
  2061. \-/286: O: O571 (predict-yes)
  2062. I see 1 and I'm going to do: predict-yes
  2063. ENV: Agent did: predict-yes for direction L in state State-B
  2064. In State-B moving L
  2065. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2066. predict error 0
  2067. dir: dir isR
  2068. |287: O: O573 (predict-yes)
  2069. I see 1 and I'm going to do: predict-yes
  2070. ENV: Agent did: predict-yes for direction R in state State-A
  2071. In State-A moving R
  2072. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2073. predict error 0
  2074. dir: dir isL
  2075. \-/288: O: O575 (predict-yes)
  2076. I see 1 and I'm going to do: predict-yes
  2077. ENV: Agent did: predict-yes for direction L in state State-B
  2078. In State-B moving L
  2079. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2080. predict error 0
  2081. dir: dir isR
  2082. |\289: O: O577 (predict-yes)
  2083. I see 1 and I'm going to do: predict-yes
  2084. ENV: Agent did: predict-yes for direction R in state State-A
  2085. In State-A moving R
  2086. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2087. predict error 0
  2088. dir: dir isL
  2089. -/|290: O: O579 (predict-yes)
  2090. I see 1 and I'm going to do: predict-yes
  2091. ENV: Agent did: predict-yes for direction L in state State-B
  2092. In State-B moving L
  2093. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2094. predict error 0
  2095. dir: dir isU
  2096. \-291: O: O582 (predict-no)
  2097. I see 1 and I'm going to do: predict-no
  2098. ENV: Agent did: predict-no for direction U in state State-A
  2099. In State-A moving U
  2100. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2101. predict error 0
  2102. dir: dir isR
  2103. /292: O: O583 (predict-yes)
  2104. I see 1 and I'm going to do: predict-yes
  2105. ENV: Agent did: predict-yes for direction R in state State-A
  2106. In State-A moving R
  2107. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2108. predict error 0
  2109. dir: dir isU
  2110. |\-293: O: O585 (predict-yes)
  2111. I see 1 and I'm going to do: predict-yes
  2112. ENV: Agent did: predict-yes for direction U in state State-B
  2113. In State-B moving U
  2114. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2115. predict error 1
  2116. dir: dir isU
  2117. /|\294: O: O587 (predict-yes)
  2118. I see 0 and I'm going to do: predict-yes
  2119. ENV: Agent did: predict-yes for direction U in state State-B
  2120. In State-B moving U
  2121. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2122. predict error 1
  2123. dir: dir isR
  2124. -/|295: O: O590 (predict-no)
  2125. I see 0 and I'm going to do: predict-no
  2126. ENV: Agent did: predict-no for direction R in state State-B
  2127. In State-B moving R
  2128. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2129. predict error 0
  2130. dir: dir isR
  2131. \-/296: O: O592 (predict-no)
  2132. I see 1 and I'm going to do: predict-no
  2133. ENV: Agent did: predict-no for direction R in state State-B
  2134. In State-B moving R
  2135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2136. predict error 0
  2137. dir: dir isU
  2138. |\-297: O: O594 (predict-no)
  2139. I see 1 and I'm going to do: predict-no
  2140. ENV: Agent did: predict-no for direction U in state State-B
  2141. In State-B moving U
  2142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2143. predict error 0
  2144. dir: dir isR
  2145. /|\298: O: O596 (predict-no)
  2146. I see 1 and I'm going to do: predict-no
  2147. ENV: Agent did: predict-no for direction R in state State-B
  2148. In State-B moving R
  2149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2150. predict error 0
  2151. dir: dir isL
  2152. -/|299: O: O597 (predict-yes)
  2153. I see 1 and I'm going to do: predict-yes
  2154. ENV: Agent did: predict-yes for direction L in state State-B
  2155. In State-B moving L
  2156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2157. predict error 0
  2158. dir: dir isU
  2159. \-300: O: O600 (predict-no)
  2160. I see 1 and I'm going to do: predict-no
  2161. ENV: Agent did: predict-no for direction U in state State-A
  2162. In State-A moving U
  2163. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2164. predict error 0
  2165. dir: dir isU
  2166. /|\-/|301: O: O602 (predict-no)
  2167. I see 1 and I'm going to do: predict-no
  2168. ENV: Agent did: predict-no for direction U in state State-A
  2169. In State-A moving U
  2170. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2171. predict error 0
  2172. dir: dir isU
  2173. \302: O: O604 (predict-no)
  2174. I see 1 and I'm going to do: predict-no
  2175. ENV: Agent did: predict-no for direction U in state State-A
  2176. In State-A moving U
  2177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2178. predict error 0
  2179. dir: dir isR
  2180. -/|303: O: O605 (predict-yes)
  2181. I see 1 and I'm going to do: predict-yes
  2182. ENV: Agent did: predict-yes for direction R in state State-A
  2183. In State-A moving R
  2184. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2185. predict error 0
  2186. dir: dir isR
  2187. \-/|304: O: O608 (predict-no)
  2188. I see 1 and I'm going to do: predict-no
  2189. ENV: Agent did: predict-no for direction R in state State-B
  2190. In State-B moving R
  2191. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2192. predict error 0
  2193. dir: dir isU
  2194. \305: O: O610 (predict-no)
  2195. I see 1 and I'm going to do: predict-no
  2196. ENV: Agent did: predict-no for direction U in state State-B
  2197. In State-B moving U
  2198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2199. predict error 0
  2200. dir: dir isR
  2201. -/306: O: O612 (predict-no)
  2202. I see 1 and I'm going to do: predict-no
  2203. ENV: Agent did: predict-no for direction R in state State-B
  2204. In State-B moving R
  2205. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2206. predict error 0
  2207. dir: dir isL
  2208. |\307: O: O613 (predict-yes)
  2209. I see 1 and I'm going to do: predict-yes
  2210. ENV: Agent did: predict-yes for direction L in state State-B
  2211. In State-B moving L
  2212. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2213. predict error 0
  2214. dir: dir isL
  2215. -/|308: O: O616 (predict-no)
  2216. I see 1 and I'm going to do: predict-no
  2217. ENV: Agent did: predict-no for direction L in state State-A
  2218. In State-A moving L
  2219. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2220. predict error 0
  2221. dir: dir isU
  2222. \-/309: O: O618 (predict-no)
  2223. I see 1 and I'm going to do: predict-no
  2224. ENV: Agent did: predict-no for direction U in state State-A
  2225. In State-A moving U
  2226. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2227. predict error 0
  2228. dir: dir isL
  2229. |\-310: O: O620 (predict-no)
  2230. I see 1 and I'm going to do: predict-no
  2231. ENV: Agent did: predict-no for direction L in state State-A
  2232. In State-A moving L
  2233. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2234. predict error 0
  2235. dir: dir isL
  2236. /|\311: O: O622 (predict-no)
  2237. I see 1 and I'm going to do: predict-no
  2238. ENV: Agent did: predict-no for direction L in state State-A
  2239. In State-A moving L
  2240. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2241. predict error 0
  2242. dir: dir isR
  2243. -312: O: O623 (predict-yes)
  2244. I see 1 and I'm going to do: predict-yes
  2245. ENV: Agent did: predict-yes for direction R in state State-A
  2246. In State-A moving R
  2247. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2248. predict error 0
  2249. dir: dir isR
  2250. /|\313: O: O626 (predict-no)
  2251. I see 1 and I'm going to do: predict-no
  2252. ENV: Agent did: predict-no for direction R in state State-B
  2253. In State-B moving R
  2254. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2255. predict error 0
  2256. dir: dir isR
  2257. -/|314: O: O628 (predict-no)
  2258. I see 1 and I'm going to do: predict-no
  2259. ENV: Agent did: predict-no for direction R in state State-B
  2260. In State-B moving R
  2261. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2262. predict error 0
  2263. dir: dir isR
  2264. \-/|315: O: O630 (predict-no)
  2265. I see 1 and I'm going to do: predict-no
  2266. ENV: Agent did: predict-no for direction R in state State-B
  2267. In State-B moving R
  2268. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2269. predict error 0
  2270. dir: dir isR
  2271. \-/316: O: O632 (predict-no)
  2272. I see 1 and I'm going to do: predict-no
  2273. ENV: Agent did: predict-no for direction R in state State-B
  2274. In State-B moving R
  2275. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2276. predict error 0
  2277. dir: dir isU
  2278. |\-317: O: O634 (predict-no)
  2279. I see 1 and I'm going to do: predict-no
  2280. ENV: Agent did: predict-no for direction U in state State-B
  2281. In State-B moving U
  2282. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2283. predict error 0
  2284. dir: dir isR
  2285. /|\318: O: O636 (predict-no)
  2286. I see 1 and I'm going to do: predict-no
  2287. ENV: Agent did: predict-no for direction R in state State-B
  2288. In State-B moving R
  2289. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2290. predict error 0
  2291. dir: dir isR
  2292. -/319: O: O638 (predict-no)
  2293. I see 1 and I'm going to do: predict-no
  2294. ENV: Agent did: predict-no for direction R in state State-B
  2295. In State-B moving R
  2296. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2297. predict error 0
  2298. dir: dir isU
  2299. |\-320: O: O640 (predict-no)
  2300. I see 1 and I'm going to do: predict-no
  2301. ENV: Agent did: predict-no for direction U in state State-B
  2302. In State-B moving U
  2303. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2304. predict error 0
  2305. dir: dir isL
  2306. /|\321: O: O641 (predict-yes)
  2307. I see 1 and I'm going to do: predict-yes
  2308. ENV: Agent did: predict-yes for direction L in state State-B
  2309. In State-B moving L
  2310. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2311. predict error 0
  2312. dir: dir isU
  2313. -322: O: O644 (predict-no)
  2314. I see 1 and I'm going to do: predict-no
  2315. ENV: Agent did: predict-no for direction U in state State-A
  2316. In State-A moving U
  2317. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2318. predict error 0
  2319. dir: dir isR
  2320. /|\323: O: O645 (predict-yes)
  2321. I see 1 and I'm going to do: predict-yes
  2322. ENV: Agent did: predict-yes for direction R in state State-A
  2323. In State-A moving R
  2324. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2325. predict error 0
  2326. dir: dir isR
  2327. -/|324: O: O648 (predict-no)
  2328. I see 1 and I'm going to do: predict-no
  2329. ENV: Agent did: predict-no for direction R in state State-B
  2330. In State-B moving R
  2331. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2332. predict error 0
  2333. dir: dir isL
  2334. \-/325: O: O649 (predict-yes)
  2335. I see 1 and I'm going to do: predict-yes
  2336. ENV: Agent did: predict-yes for direction L in state State-B
  2337. In State-B moving L
  2338. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2339. predict error 0
  2340. dir: dir isU
  2341. |\-326: O: O652 (predict-no)
  2342. I see 1 and I'm going to do: predict-no
  2343. ENV: Agent did: predict-no for direction U in state State-A
  2344. In State-A moving U
  2345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2346. predict error 0
  2347. dir: dir isU
  2348. /|\327: O: O653 (predict-yes)
  2349. I see 1 and I'm going to do: predict-yes
  2350. ENV: Agent did: predict-yes for direction U in state State-A
  2351. In State-A moving U
  2352. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2353. predict error 1
  2354. dir: dir isU
  2355. -/|328: O: O656 (predict-no)
  2356. I see 0 and I'm going to do: predict-no
  2357. ENV: Agent did: predict-no for direction U in state State-A
  2358. In State-A moving U
  2359. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2360. predict error 0
  2361. dir: dir isR
  2362. \-/329: O: O657 (predict-yes)
  2363. I see 1 and I'm going to do: predict-yes
  2364. ENV: Agent did: predict-yes for direction R in state State-A
  2365. In State-A moving R
  2366. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2367. predict error 0
  2368. dir: dir isU
  2369. |\-330: O: O660 (predict-no)
  2370. I see 1 and I'm going to do: predict-no
  2371. ENV: Agent did: predict-no for direction U in state State-B
  2372. In State-B moving U
  2373. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2374. predict error 0
  2375. dir: dir isL
  2376. /|\331: O: O661 (predict-yes)
  2377. I see 1 and I'm going to do: predict-yes
  2378. ENV: Agent did: predict-yes for direction L in state State-B
  2379. In State-B moving L
  2380. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2381. predict error 0
  2382. dir: dir isR
  2383. -332: O: O663 (predict-yes)
  2384. I see 1 and I'm going to do: predict-yes
  2385. ENV: Agent did: predict-yes for direction R in state State-A
  2386. In State-A moving R
  2387. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2388. predict error 0
  2389. dir: dir isL
  2390. /333: O: O665 (predict-yes)
  2391. I see 1 and I'm going to do: predict-yes
  2392. ENV: Agent did: predict-yes for direction L in state State-B
  2393. In State-B moving L
  2394. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2395. predict error 0
  2396. dir: dir isL
  2397. |\-334: O: O668 (predict-no)
  2398. I see 1 and I'm going to do: predict-no
  2399. ENV: Agent did: predict-no for direction L in state State-A
  2400. In State-A moving L
  2401. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2402. predict error 0
  2403. dir: dir isU
  2404. /|\335: O: O670 (predict-no)
  2405. I see 1 and I'm going to do: predict-no
  2406. ENV: Agent did: predict-no for direction U in state State-A
  2407. In State-A moving U
  2408. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2409. predict error 0
  2410. dir: dir isL
  2411. -/|336: O: O672 (predict-no)
  2412. I see 1 and I'm going to do: predict-no
  2413. ENV: Agent did: predict-no for direction L in state State-A
  2414. In State-A moving L
  2415. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2416. predict error 0
  2417. dir: dir isL
  2418. \-/337: O: O674 (predict-no)
  2419. I see 1 and I'm going to do: predict-no
  2420. ENV: Agent did: predict-no for direction L in state State-A
  2421. In State-A moving L
  2422. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2423. predict error 0
  2424. dir: dir isL
  2425. |\-338: O: O676 (predict-no)
  2426. I see 1 and I'm going to do: predict-no
  2427. ENV: Agent did: predict-no for direction L in state State-A
  2428. In State-A moving L
  2429. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2430. predict error 0
  2431. dir: dir isR
  2432. /|\339: O: O677 (predict-yes)
  2433. I see 1 and I'm going to do: predict-yes
  2434. ENV: Agent did: predict-yes for direction R in state State-A
  2435. In State-A moving R
  2436. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2437. predict error 0
  2438. dir: dir isR
  2439. -/|340: O: O680 (predict-no)
  2440. I see 1 and I'm going to do: predict-no
  2441. ENV: Agent did: predict-no for direction R in state State-B
  2442. In State-B moving R
  2443. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2444. predict error 0
  2445. dir: dir isL
  2446. \-/341: O: O681 (predict-yes)
  2447. I see 1 and I'm going to do: predict-yes
  2448. ENV: Agent did: predict-yes for direction L in state State-B
  2449. In State-B moving L
  2450. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2451. predict error 0
  2452. dir: dir isU
  2453. |342: O: O684 (predict-no)
  2454. I see 1 and I'm going to do: predict-no
  2455. ENV: Agent did: predict-no for direction U in state State-A
  2456. In State-A moving U
  2457. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2458. predict error 0
  2459. dir: dir isU
  2460. \-/343: O: O686 (predict-no)
  2461. I see 1 and I'm going to do: predict-no
  2462. ENV: Agent did: predict-no for direction U in state State-A
  2463. In State-A moving U
  2464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2465. predict error 0
  2466. dir: dir isL
  2467. |\-/344: O: O688 (predict-no)
  2468. I see 1 and I'm going to do: predict-no
  2469. ENV: Agent did: predict-no for direction L in state State-A
  2470. In State-A moving L
  2471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2472. predict error 0
  2473. dir: dir isR
  2474. |\-345: O: O689 (predict-yes)
  2475. I see 1 and I'm going to do: predict-yes
  2476. ENV: Agent did: predict-yes for direction R in state State-A
  2477. In State-A moving R
  2478. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2479. predict error 0
  2480. dir: dir isU
  2481. /|\346: O: O692 (predict-no)
  2482. I see 1 and I'm going to do: predict-no
  2483. ENV: Agent did: predict-no for direction U in state State-B
  2484. In State-B moving U
  2485. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2486. predict error 0
  2487. dir: dir isU
  2488. -347: O: O694 (predict-no)
  2489. I see 1 and I'm going to do: predict-no
  2490. ENV: Agent did: predict-no for direction U in state State-B
  2491. In State-B moving U
  2492. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2493. predict error 0
  2494. dir: dir isR
  2495. /|348: O: O696 (predict-no)
  2496. I see 1 and I'm going to do: predict-no
  2497. ENV: Agent did: predict-no for direction R in state State-B
  2498. In State-B moving R
  2499. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2500. predict error 0
  2501. dir: dir isU
  2502. \-/349: O: O698 (predict-no)
  2503. I see 1 and I'm going to do: predict-no
  2504. ENV: Agent did: predict-no for direction U in state State-B
  2505. In State-B moving U
  2506. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2507. predict error 0
  2508. dir: dir isL
  2509. |\-350: O: O699 (predict-yes)
  2510. I see 1 and I'm going to do: predict-yes
  2511. ENV: Agent did: predict-yes for direction L in state State-B
  2512. In State-B moving L
  2513. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2514. predict error 0
  2515. dir: dir isR
  2516. /|351: O: O701 (predict-yes)
  2517. I see 1 and I'm going to do: predict-yes
  2518. ENV: Agent did: predict-yes for direction R in state State-A
  2519. In State-A moving R
  2520. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2521. predict error 0
  2522. dir: dir isR
  2523. \352: O: O704 (predict-no)
  2524. I see 1 and I'm going to do: predict-no
  2525. ENV: Agent did: predict-no for direction R in state State-B
  2526. In State-B moving R
  2527. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2528. predict error 0
  2529. dir: dir isU
  2530. -353: O: O706 (predict-no)
  2531. I see 1 and I'm going to do: predict-no
  2532. ENV: Agent did: predict-no for direction U in state State-B
  2533. In State-B moving U
  2534. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2535. predict error 0
  2536. dir: dir isL
  2537. /|\354: O: O707 (predict-yes)
  2538. I see 1 and I'm going to do: predict-yes
  2539. ENV: Agent did: predict-yes for direction L in state State-B
  2540. In State-B moving L
  2541. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2542. predict error 0
  2543. dir: dir isR
  2544. -/|355: O: O710 (predict-no)
  2545. I see 1 and I'm going to do: predict-no
  2546. ENV: Agent did: predict-no for direction R in state State-A
  2547. In State-A moving R
  2548. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2549. predict error 1
  2550. dir: dir isL
  2551. \-/356: O: O711 (predict-yes)
  2552. I see 0 and I'm going to do: predict-yes
  2553. ENV: Agent did: predict-yes for direction L in state State-B
  2554. In State-B moving L
  2555. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2556. predict error 0
  2557. dir: dir isR
  2558. |\-357: O: O713 (predict-yes)
  2559. I see 1 and I'm going to do: predict-yes
  2560. ENV: Agent did: predict-yes for direction R in state State-A
  2561. In State-A moving R
  2562. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2563. predict error 0
  2564. dir: dir isU
  2565. /|\358: O: O716 (predict-no)
  2566. I see 1 and I'm going to do: predict-no
  2567. ENV: Agent did: predict-no for direction U in state State-B
  2568. In State-B moving U
  2569. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2570. predict error 0
  2571. dir: dir isU
  2572. -/|359: O: O718 (predict-no)
  2573. I see 1 and I'm going to do: predict-no
  2574. ENV: Agent did: predict-no for direction U in state State-B
  2575. In State-B moving U
  2576. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2577. predict error 0
  2578. dir: dir isU
  2579. \-/|360: O: O720 (predict-no)
  2580. I see 1 and I'm going to do: predict-no
  2581. ENV: Agent did: predict-no for direction U in state State-B
  2582. In State-B moving U
  2583. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2584. predict error 0
  2585. dir: dir isL
  2586. \-/361: O: O721 (predict-yes)
  2587. I see 1 and I'm going to do: predict-yes
  2588. ENV: Agent did: predict-yes for direction L in state State-B
  2589. In State-B moving L
  2590. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2591. predict error 0
  2592. dir: dir isL
  2593. |362: O: O724 (predict-no)
  2594. I see 1 and I'm going to do: predict-no
  2595. ENV: Agent did: predict-no for direction L in state State-A
  2596. In State-A moving L
  2597. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2598. predict error 0
  2599. dir: dir isL
  2600. \-/363: O: O726 (predict-no)
  2601. I see 1 and I'm going to do: predict-no
  2602. ENV: Agent did: predict-no for direction L in state State-A
  2603. In State-A moving L
  2604. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2605. predict error 0
  2606. dir: dir isU
  2607. |\-364: O: O728 (predict-no)
  2608. I see 1 and I'm going to do: predict-no
  2609. ENV: Agent did: predict-no for direction U in state State-A
  2610. In State-A moving U
  2611. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2612. predict error 0
  2613. dir: dir isU
  2614. /|\365: O: O730 (predict-no)
  2615. I see 1 and I'm going to do: predict-no
  2616. ENV: Agent did: predict-no for direction U in state State-A
  2617. In State-A moving U
  2618. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2619. predict error 0
  2620. dir: dir isR
  2621. -/|366: O: O731 (predict-yes)
  2622. I see 1 and I'm going to do: predict-yes
  2623. ENV: Agent did: predict-yes for direction R in state State-A
  2624. In State-A moving R
  2625. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2626. predict error 0
  2627. dir: dir isU
  2628. \-/367: O: O734 (predict-no)
  2629. I see 1 and I'm going to do: predict-no
  2630. ENV: Agent did: predict-no for direction U in state State-B
  2631. In State-B moving U
  2632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2633. predict error 0
  2634. dir: dir isU
  2635. |\-368: O: O735 (predict-yes)
  2636. I see 1 and I'm going to do: predict-yes
  2637. ENV: Agent did: predict-yes for direction U in state State-B
  2638. In State-B moving U
  2639. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2640. predict error 1
  2641. dir: dir isL
  2642. /|\369: O: O737 (predict-yes)
  2643. I see 0 and I'm going to do: predict-yes
  2644. ENV: Agent did: predict-yes for direction L in state State-B
  2645. In State-B moving L
  2646. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2647. predict error 0
  2648. dir: dir isL
  2649. -/|370: O: O740 (predict-no)
  2650. I see 1 and I'm going to do: predict-no
  2651. ENV: Agent did: predict-no for direction L in state State-A
  2652. In State-A moving L
  2653. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2654. predict error 0
  2655. dir: dir isU
  2656. \-/371: O: O742 (predict-no)
  2657. I see 1 and I'm going to do: predict-no
  2658. ENV: Agent did: predict-no for direction U in state State-A
  2659. In State-A moving U
  2660. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2661. predict error 0
  2662. dir: dir isL
  2663. |372: O: O744 (predict-no)
  2664. I see 1 and I'm going to do: predict-no
  2665. ENV: Agent did: predict-no for direction L in state State-A
  2666. In State-A moving L
  2667. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2668. predict error 0
  2669. dir: dir isL
  2670. \-/373: O: O746 (predict-no)
  2671. I see 1 and I'm going to do: predict-no
  2672. ENV: Agent did: predict-no for direction L in state State-A
  2673. In State-A moving L
  2674. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2675. predict error 0
  2676. dir: dir isL
  2677. |\-374: O: O748 (predict-no)
  2678. I see 1 and I'm going to do: predict-no
  2679. ENV: Agent did: predict-no for direction L in state State-A
  2680. In State-A moving L
  2681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2682. predict error 0
  2683. dir: dir isL
  2684. /375: O: O750 (predict-no)
  2685. I see 1 and I'm going to do: predict-no
  2686. ENV: Agent did: predict-no for direction L in state State-A
  2687. In State-A moving L
  2688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2689. predict error 0
  2690. dir: dir isU
  2691. |\-376: O: O752 (predict-no)
  2692. I see 1 and I'm going to do: predict-no
  2693. ENV: Agent did: predict-no for direction U in state State-A
  2694. In State-A moving U
  2695. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2696. predict error 0
  2697. dir: dir isL
  2698. /|377: O: O754 (predict-no)
  2699. I see 1 and I'm going to do: predict-no
  2700. ENV: Agent did: predict-no for direction L in state State-A
  2701. In State-A moving L
  2702. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2703. predict error 0
  2704. dir: dir isL
  2705. \-/378: O: O756 (predict-no)
  2706. I see 1 and I'm going to do: predict-no
  2707. ENV: Agent did: predict-no for direction L in state State-A
  2708. In State-A moving L
  2709. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2710. predict error 0
  2711. dir: dir isL
  2712. |\-379: O: O758 (predict-no)
  2713. I see 1 and I'm going to do: predict-no
  2714. ENV: Agent did: predict-no for direction L in state State-A
  2715. In State-A moving L
  2716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2717. predict error 0
  2718. dir: dir isR
  2719. /|\380: O: O759 (predict-yes)
  2720. I see 1 and I'm going to do: predict-yes
  2721. ENV: Agent did: predict-yes for direction R in state State-A
  2722. In State-A moving R
  2723. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2724. predict error 0
  2725. dir: dir isU
  2726. -/|381: O: O762 (predict-no)
  2727. I see 1 and I'm going to do: predict-no
  2728. ENV: Agent did: predict-no for direction U in state State-B
  2729. In State-B moving U
  2730. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2731. predict error 0
  2732. dir: dir isR
  2733. \382: O: O764 (predict-no)
  2734. I see 1 and I'm going to do: predict-no
  2735. ENV: Agent did: predict-no for direction R in state State-B
  2736. In State-B moving R
  2737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2738. predict error 0
  2739. dir: dir isU
  2740. -/|383: O: O766 (predict-no)
  2741. I see 1 and I'm going to do: predict-no
  2742. ENV: Agent did: predict-no for direction U in state State-B
  2743. In State-B moving U
  2744. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2745. predict error 0
  2746. dir: dir isR
  2747. \384: O: O768 (predict-no)
  2748. I see 1 and I'm going to do: predict-no
  2749. ENV: Agent did: predict-no for direction R in state State-B
  2750. In State-B moving R
  2751. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2752. predict error 0
  2753. dir: dir isR
  2754. -/385: O: O770 (predict-no)
  2755. I see 1 and I'm going to do: predict-no
  2756. ENV: Agent did: predict-no for direction R in state State-B
  2757. In State-B moving R
  2758. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2759. predict error 0
  2760. dir: dir isU
  2761. |\-386: O: O772 (predict-no)
  2762. I see 1 and I'm going to do: predict-no
  2763. ENV: Agent did: predict-no for direction U in state State-B
  2764. In State-B moving U
  2765. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2766. predict error 0
  2767. dir: dir isU
  2768. /|\387: O: O774 (predict-no)
  2769. I see 1 and I'm going to do: predict-no
  2770. ENV: Agent did: predict-no for direction U in state State-B
  2771. In State-B moving U
  2772. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2773. predict error 0
  2774. dir: dir isU
  2775. -/|388: O: O776 (predict-no)
  2776. I see 1 and I'm going to do: predict-no
  2777. ENV: Agent did: predict-no for direction U in state State-B
  2778. In State-B moving U
  2779. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2780. predict error 0
  2781. dir: dir isU
  2782. \-389: O: O778 (predict-no)
  2783. I see 1 and I'm going to do: predict-no
  2784. ENV: Agent did: predict-no for direction U in state State-B
  2785. In State-B moving U
  2786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2787. predict error 0
  2788. dir: dir isU
  2789. /390: O: O780 (predict-no)
  2790. I see 1 and I'm going to do: predict-no
  2791. ENV: Agent did: predict-no for direction U in state State-B
  2792. In State-B moving U
  2793. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2794. predict error 0
  2795. dir: dir isU
  2796. |\-391: O: O782 (predict-no)
  2797. I see 1 and I'm going to do: predict-no
  2798. ENV: Agent did: predict-no for direction U in state State-B
  2799. In State-B moving U
  2800. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2801. predict error 0
  2802. dir: dir isL
  2803. /392: O: O784 (predict-no)
  2804. I see 1 and I'm going to do: predict-no
  2805. ENV: Agent did: predict-no for direction L in state State-B
  2806. In State-B moving L
  2807. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2808. predict error 1
  2809. dir: dir isR
  2810. |\-393: O: O785 (predict-yes)
  2811. I see 0 and I'm going to do: predict-yes
  2812. ENV: Agent did: predict-yes for direction R in state State-A
  2813. In State-A moving R
  2814. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2815. predict error 0
  2816. dir: dir isR
  2817. /|\394: O: O788 (predict-no)
  2818. I see 1 and I'm going to do: predict-no
  2819. ENV: Agent did: predict-no for direction R in state State-B
  2820. In State-B moving R
  2821. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2822. predict error 0
  2823. dir: dir isU
  2824. -/|395: O: O790 (predict-no)
  2825. I see 1 and I'm going to do: predict-no
  2826. ENV: Agent did: predict-no for direction U in state State-B
  2827. In State-B moving U
  2828. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2829. predict error 0
  2830. dir: dir isR
  2831. \-396: O: O792 (predict-no)
  2832. I see 1 and I'm going to do: predict-no
  2833. ENV: Agent did: predict-no for direction R in state State-B
  2834. In State-B moving R
  2835. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2836. predict error 0
  2837. dir: dir isU
  2838. /|\397: O: O794 (predict-no)
  2839. I see 1 and I'm going to do: predict-no
  2840. ENV: Agent did: predict-no for direction U in state State-B
  2841. In State-B moving U
  2842. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2843. predict error 0
  2844. dir: dir isR
  2845. -/|398: O: O796 (predict-no)
  2846. I see 1 and I'm going to do: predict-no
  2847. ENV: Agent did: predict-no for direction R in state State-B
  2848. In State-B moving R
  2849. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2850. predict error 0
  2851. dir: dir isR
  2852. \-/399: O: O798 (predict-no)
  2853. I see 1 and I'm going to do: predict-no
  2854. ENV: Agent did: predict-no for direction R in state State-B
  2855. In State-B moving R
  2856. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2857. predict error 0
  2858. dir: dir isU
  2859. |\-400: O: O800 (predict-no)
  2860. I see 1 and I'm going to do: predict-no
  2861. ENV: Agent did: predict-no for direction U in state State-B
  2862. In State-B moving U
  2863. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2864. predict error 0
  2865. dir: dir isU
  2866. /|\401: O: O802 (predict-no)
  2867. I see 1 and I'm going to do: predict-no
  2868. ENV: Agent did: predict-no for direction U in state State-B
  2869. In State-B moving U
  2870. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2871. predict error 0
  2872. dir: dir isR
  2873. -402: O: O804 (predict-no)
  2874. I see 1 and I'm going to do: predict-no
  2875. ENV: Agent did: predict-no for direction R in state State-B
  2876. In State-B moving R
  2877. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2878. predict error 0
  2879. dir: dir isL
  2880. /|\403: O: O805 (predict-yes)
  2881. I see 1 and I'm going to do: predict-yes
  2882. ENV: Agent did: predict-yes for direction L in state State-B
  2883. In State-B moving L
  2884. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2885. predict error 0
  2886. dir: dir isL
  2887. -/404: O: O808 (predict-no)
  2888. I see 1 and I'm going to do: predict-no
  2889. ENV: Agent did: predict-no for direction L in state State-A
  2890. In State-A moving L
  2891. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2892. predict error 0
  2893. dir: dir isR
  2894. |\405: O: O809 (predict-yes)
  2895. I see 1 and I'm going to do: predict-yes
  2896. ENV: Agent did: predict-yes for direction R in state State-A
  2897. In State-A moving R
  2898. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2899. predict error 0
  2900. dir: dir isL
  2901. -/|406: O: O811 (predict-yes)
  2902. I see 1 and I'm going to do: predict-yes
  2903. ENV: Agent did: predict-yes for direction L in state State-B
  2904. In State-B moving L
  2905. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2906. predict error 0
  2907. dir: dir isL
  2908. \-407: O: O813 (predict-yes)
  2909. I see 1 and I'm going to do: predict-yes
  2910. ENV: Agent did: predict-yes for direction L in state State-A
  2911. In State-A moving L
  2912. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2913. predict error 1
  2914. dir: dir isU
  2915. /|408: O: O816 (predict-no)
  2916. I see 0 and I'm going to do: predict-no
  2917. ENV: Agent did: predict-no for direction U in state State-A
  2918. In State-A moving U
  2919. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2920. predict error 0
  2921. dir: dir isU
  2922. \-/409: O: O818 (predict-no)
  2923. I see 1 and I'm going to do: predict-no
  2924. ENV: Agent did: predict-no for direction U in state State-A
  2925. In State-A moving U
  2926. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2927. predict error 0
  2928. dir: dir isL
  2929. |\-410: O: O820 (predict-no)
  2930. I see 1 and I'm going to do: predict-no
  2931. ENV: Agent did: predict-no for direction L in state State-A
  2932. In State-A moving L
  2933. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2934. predict error 0
  2935. dir: dir isR
  2936. /|411: O: O821 (predict-yes)
  2937. I see 1 and I'm going to do: predict-yes
  2938. ENV: Agent did: predict-yes for direction R in state State-A
  2939. In State-A moving R
  2940. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2941. predict error 0
  2942. dir: dir isU
  2943. \412: O: O824 (predict-no)
  2944. I see 1 and I'm going to do: predict-no
  2945. ENV: Agent did: predict-no for direction U in state State-B
  2946. In State-B moving U
  2947. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2948. predict error 0
  2949. dir: dir isL
  2950. -/|413: O: O825 (predict-yes)
  2951. I see 1 and I'm going to do: predict-yes
  2952. ENV: Agent did: predict-yes for direction L in state State-B
  2953. In State-B moving L
  2954. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2955. predict error 0
  2956. dir: dir isR
  2957. \-/414: O: O827 (predict-yes)
  2958. I see 1 and I'm going to do: predict-yes
  2959. ENV: Agent did: predict-yes for direction R in state State-A
  2960. In State-A moving R
  2961. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2962. predict error 0
  2963. dir: dir isL
  2964. |\-415: O: O829 (predict-yes)
  2965. I see 1 and I'm going to do: predict-yes
  2966. ENV: Agent did: predict-yes for direction L in state State-B
  2967. In State-B moving L
  2968. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2969. predict error 0
  2970. dir: dir isL
  2971. /|\416: O: O832 (predict-no)
  2972. I see 1 and I'm going to do: predict-no
  2973. ENV: Agent did: predict-no for direction L in state State-A
  2974. In State-A moving L
  2975. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2976. predict error 0
  2977. dir: dir isU
  2978. -/|417: O: O834 (predict-no)
  2979. I see 1 and I'm going to do: predict-no
  2980. ENV: Agent did: predict-no for direction U in state State-A
  2981. In State-A moving U
  2982. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2983. predict error 0
  2984. dir: dir isL
  2985. \-/418: O: O836 (predict-no)
  2986. I see 1 and I'm going to do: predict-no
  2987. ENV: Agent did: predict-no for direction L in state State-A
  2988. In State-A moving L
  2989. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2990. predict error 0
  2991. dir: dir isL
  2992. |\-419: O: O838 (predict-no)
  2993. I see 1 and I'm going to do: predict-no
  2994. ENV: Agent did: predict-no for direction L in state State-A
  2995. In State-A moving L
  2996. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2997. predict error 0
  2998. dir: dir isR
  2999. /|\420: O: O839 (predict-yes)
  3000. I see 1 and I'm going to do: predict-yes
  3001. ENV: Agent did: predict-yes for direction R in state State-A
  3002. In State-A moving R
  3003. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3004. predict error 0
  3005. dir: dir isR
  3006. -/421: O: O842 (predict-no)
  3007. I see 1 and I'm going to do: predict-no
  3008. ENV: Agent did: predict-no for direction R in state State-B
  3009. In State-B moving R
  3010. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3011. predict error 0
  3012. dir: dir isU
  3013. |422: O: O843 (predict-yes)
  3014. I see 1 and I'm going to do: predict-yes
  3015. ENV: Agent did: predict-yes for direction U in state State-B
  3016. In State-B moving U
  3017. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3018. predict error 1
  3019. dir: dir isU
  3020. \-/423: O: O846 (predict-no)
  3021. I see 0 and I'm going to do: predict-no
  3022. ENV: Agent did: predict-no for direction U in state State-B
  3023. In State-B moving U
  3024. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3025. predict error 0
  3026. dir: dir isU
  3027. |\-424: O: O848 (predict-no)
  3028. I see 1 and I'm going to do: predict-no
  3029. ENV: Agent did: predict-no for direction U in state State-B
  3030. In State-B moving U
  3031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3032. predict error 0
  3033. dir: dir isL
  3034. /|\425: O: O850 (predict-no)
  3035. I see 1 and I'm going to do: predict-no
  3036. ENV: Agent did: predict-no for direction L in state State-B
  3037. In State-B moving L
  3038. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3039. predict error 1
  3040. dir: dir isU
  3041. -/|426: O: O852 (predict-no)
  3042. I see 0 and I'm going to do: predict-no
  3043. ENV: Agent did: predict-no for direction U in state State-A
  3044. In State-A moving U
  3045. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3046. predict error 0
  3047. dir: dir isR
  3048. \-427: O: O853 (predict-yes)
  3049. I see 1 and I'm going to do: predict-yes
  3050. ENV: Agent did: predict-yes for direction R in state State-A
  3051. In State-A moving R
  3052. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3053. predict error 0
  3054. dir: dir isR
  3055. /|428: O: O856 (predict-no)
  3056. I see 1 and I'm going to do: predict-no
  3057. ENV: Agent did: predict-no for direction R in state State-B
  3058. In State-B moving R
  3059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3060. predict error 0
  3061. dir: dir isR
  3062. \-429: O: O858 (predict-no)
  3063. I see 1 and I'm going to do: predict-no
  3064. ENV: Agent did: predict-no for direction R in state State-B
  3065. In State-B moving R
  3066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3067. predict error 0
  3068. dir: dir isL
  3069. /|\430: O: O860 (predict-no)
  3070. I see 1 and I'm going to do: predict-no
  3071. ENV: Agent did: predict-no for direction L in state State-B
  3072. In State-B moving L
  3073. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3074. predict error 1
  3075. dir: dir isR
  3076. -/|431: O: O861 (predict-yes)
  3077. I see 0 and I'm going to do: predict-yes
  3078. ENV: Agent did: predict-yes for direction R in state State-A
  3079. In State-A moving R
  3080. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3081. predict error 0
  3082. dir: dir isL
  3083. \432: O: O863 (predict-yes)
  3084. I see 1 and I'm going to do: predict-yes
  3085. ENV: Agent did: predict-yes for direction L in state State-B
  3086. In State-B moving L
  3087. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3088. predict error 0
  3089. dir: dir isL
  3090. -/|433: O: O866 (predict-no)
  3091. I see 1 and I'm going to do: predict-no
  3092. ENV: Agent did: predict-no for direction L in state State-A
  3093. In State-A moving L
  3094. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3095. predict error 0
  3096. dir: dir isR
  3097. \-434: O: O868 (predict-no)
  3098. I see 1 and I'm going to do: predict-no
  3099. ENV: Agent did: predict-no for direction R in state State-A
  3100. In State-A moving R
  3101. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3102. predict error 1
  3103. dir: dir isR
  3104. /|435: O: O870 (predict-no)
  3105. I see 0 and I'm going to do: predict-no
  3106. ENV: Agent did: predict-no for direction R in state State-B
  3107. In State-B moving R
  3108. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3109. predict error 0
  3110. dir: dir isL
  3111. \-/436: O: O871 (predict-yes)
  3112. I see 1 and I'm going to do: predict-yes
  3113. ENV: Agent did: predict-yes for direction L in state State-B
  3114. In State-B moving L
  3115. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3116. predict error 0
  3117. dir: dir isR
  3118. |\-437: O: O873 (predict-yes)
  3119. I see 1 and I'm going to do: predict-yes
  3120. ENV: Agent did: predict-yes for direction R in state State-A
  3121. In State-A moving R
  3122. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3123. predict error 0
  3124. dir: dir isR
  3125. /|438: O: O876 (predict-no)
  3126. I see 1 and I'm going to do: predict-no
  3127. ENV: Agent did: predict-no for direction R in state State-B
  3128. In State-B moving R
  3129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3130. predict error 0
  3131. dir: dir isR
  3132. \-/439: O: O878 (predict-no)
  3133. I see 1 and I'm going to do: predict-no
  3134. ENV: Agent did: predict-no for direction R in state State-B
  3135. In State-B moving R
  3136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3137. predict error 0
  3138. dir: dir isU
  3139. |\-440: O: O879 (predict-yes)
  3140. I see 1 and I'm going to do: predict-yes
  3141. ENV: Agent did: predict-yes for direction U in state State-B
  3142. In State-B moving U
  3143. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3144. predict error 1
  3145. dir: dir isR
  3146. /|\441: O: O882 (predict-no)
  3147. I see 0 and I'm going to do: predict-no
  3148. ENV: Agent did: predict-no for direction R in state State-B
  3149. In State-B moving R
  3150. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3151. predict error 0
  3152. dir: dir isU
  3153. -442: O: O884 (predict-no)
  3154. I see 1 and I'm going to do: predict-no
  3155. ENV: Agent did: predict-no for direction U in state State-B
  3156. In State-B moving U
  3157. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3158. predict error 0
  3159. dir: dir isR
  3160. /|\443: O: O886 (predict-no)
  3161. I see 1 and I'm going to do: predict-no
  3162. ENV: Agent did: predict-no for direction R in state State-B
  3163. In State-B moving R
  3164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3165. predict error 0
  3166. dir: dir isR
  3167. -/|444: O: O888 (predict-no)
  3168. I see 1 and I'm going to do: predict-no
  3169. ENV: Agent did: predict-no for direction R in state State-B
  3170. In State-B moving R
  3171. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3172. predict error 0
  3173. dir: dir isR
  3174. \-445: O: O890 (predict-no)
  3175. I see 1 and I'm going to do: predict-no
  3176. ENV: Agent did: predict-no for direction R in state State-B
  3177. In State-B moving R
  3178. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3179. predict error 0
  3180. dir: dir isR
  3181. /|\446: O: O892 (predict-no)
  3182. I see 1 and I'm going to do: predict-no
  3183. ENV: Agent did: predict-no for direction R in state State-B
  3184. In State-B moving R
  3185. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3186. predict error 0
  3187. dir: dir isL
  3188. -/|447: O: O893 (predict-yes)
  3189. I see 1 and I'm going to do: predict-yes
  3190. ENV: Agent did: predict-yes for direction L in state State-B
  3191. In State-B moving L
  3192. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3193. predict error 0
  3194. dir: dir isU
  3195. \-/448: O: O896 (predict-no)
  3196. I see 1 and I'm going to do: predict-no
  3197. ENV: Agent did: predict-no for direction U in state State-A
  3198. In State-A moving U
  3199. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3200. predict error 0
  3201. dir: dir isR
  3202. |\-/449: O: O897 (predict-yes)
  3203. I see 1 and I'm going to do: predict-yes
  3204. ENV: Agent did: predict-yes for direction R in state State-A
  3205. In State-A moving R
  3206. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3207. predict error 0
  3208. dir: dir isU
  3209. |\-450: O: O900 (predict-no)
  3210. I see 1 and I'm going to do: predict-no
  3211. ENV: Agent did: predict-no for direction U in state State-B
  3212. In State-B moving U
  3213. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3214. predict error 0
  3215. dir: dir isL
  3216. /|\451: O: O901 (predict-yes)
  3217. I see 1 and I'm going to do: predict-yes
  3218. ENV: Agent did: predict-yes for direction L in state State-B
  3219. In State-B moving L
  3220. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3221. predict error 0
  3222. dir: dir isU
  3223. -452: O: O904 (predict-no)
  3224. I see 1 and I'm going to do: predict-no
  3225. ENV: Agent did: predict-no for direction U in state State-A
  3226. In State-A moving U
  3227. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3228. predict error 0
  3229. dir: dir isU
  3230. /|\453: O: O906 (predict-no)
  3231. I see 1 and I'm going to do: predict-no
  3232. ENV: Agent did: predict-no for direction U in state State-A
  3233. In State-A moving U
  3234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3235. predict error 0
  3236. dir: dir isU
  3237. -/454: O: O908 (predict-no)
  3238. I see 1 and I'm going to do: predict-no
  3239. ENV: Agent did: predict-no for direction U in state State-A
  3240. In State-A moving U
  3241. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3242. predict error 0
  3243. dir: dir isU
  3244. |\-455: O: O910 (predict-no)
  3245. I see 1 and I'm going to do: predict-no
  3246. ENV: Agent did: predict-no for direction U in state State-A
  3247. In State-A moving U
  3248. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3249. predict error 0
  3250. dir: dir isU
  3251. /|\456: O: O912 (predict-no)
  3252. I see 1 and I'm going to do: predict-no
  3253. ENV: Agent did: predict-no for direction U in state State-A
  3254. In State-A moving U
  3255. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3256. predict error 0
  3257. dir: dir isU
  3258. -/457: O: O914 (predict-no)
  3259. I see 1 and I'm going to do: predict-no
  3260. ENV: Agent did: predict-no for direction U in state State-A
  3261. In State-A moving U
  3262. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3263. predict error 0
  3264. dir: dir isR
  3265. |458: O: O915 (predict-yes)
  3266. I see 1 and I'm going to do: predict-yes
  3267. ENV: Agent did: predict-yes for direction R in state State-A
  3268. In State-A moving R
  3269. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3270. predict error 0
  3271. dir: dir isU
  3272. \-/459: O: O918 (predict-no)
  3273. I see 1 and I'm going to do: predict-no
  3274. ENV: Agent did: predict-no for direction U in state State-B
  3275. In State-B moving U
  3276. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3277. predict error 0
  3278. dir: dir isL
  3279. |\-460: O: O919 (predict-yes)
  3280. I see 1 and I'm going to do: predict-yes
  3281. ENV: Agent did: predict-yes for direction L in state State-B
  3282. In State-B moving L
  3283. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3284. predict error 0
  3285. dir: dir isU
  3286. /|\461: O: O922 (predict-no)
  3287. I see 1 and I'm going to do: predict-no
  3288. ENV: Agent did: predict-no for direction U in state State-A
  3289. In State-A moving U
  3290. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3291. predict error 0
  3292. dir: dir isR
  3293. -462: O: O923 (predict-yes)
  3294. I see 1 and I'm going to do: predict-yes
  3295. ENV: Agent did: predict-yes for direction R in state State-A
  3296. In State-A moving R
  3297. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3298. predict error 0
  3299. dir: dir isU
  3300. /|463: O: O926 (predict-no)
  3301. I see 1 and I'm going to do: predict-no
  3302. ENV: Agent did: predict-no for direction U in state State-B
  3303. In State-B moving U
  3304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3305. predict error 0
  3306. dir: dir isR
  3307. \-/464: O: O928 (predict-no)
  3308. I see 1 and I'm going to do: predict-no
  3309. ENV: Agent did: predict-no for direction R in state State-B
  3310. In State-B moving R
  3311. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3312. predict error 0
  3313. dir: dir isU
  3314. |\465: O: O930 (predict-no)
  3315. I see 1 and I'm going to do: predict-no
  3316. ENV: Agent did: predict-no for direction U in state State-B
  3317. In State-B moving U
  3318. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3319. predict error 0
  3320. dir: dir isL
  3321. -/|466: O: O931 (predict-yes)
  3322. I see 1 and I'm going to do: predict-yes
  3323. ENV: Agent did: predict-yes for direction L in state State-B
  3324. In State-B moving L
  3325. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3326. predict error 0
  3327. dir: dir isL
  3328. \-/467: O: O934 (predict-no)
  3329. I see 1 and I'm going to do: predict-no
  3330. ENV: Agent did: predict-no for direction L in state State-A
  3331. In State-A moving L
  3332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3333. predict error 0
  3334. dir: dir isU
  3335. |\-468: O: O936 (predict-no)
  3336. I see 1 and I'm going to do: predict-no
  3337. ENV: Agent did: predict-no for direction U in state State-A
  3338. In State-A moving U
  3339. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3340. predict error 0
  3341. dir: dir isR
  3342. /|\469: O: O937 (predict-yes)
  3343. I see 1 and I'm going to do: predict-yes
  3344. ENV: Agent did: predict-yes for direction R in state State-A
  3345. In State-A moving R
  3346. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3347. predict error 0
  3348. dir: dir isU
  3349. -/470: O: O940 (predict-no)
  3350. I see 1 and I'm going to do: predict-no
  3351. ENV: Agent did: predict-no for direction U in state State-B
  3352. In State-B moving U
  3353. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3354. predict error 0
  3355. dir: dir isU
  3356. |\471: O: O942 (predict-no)
  3357. I see 1 and I'm going to do: predict-no
  3358. ENV: Agent did: predict-no for direction U in state State-B
  3359. In State-B moving U
  3360. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3361. predict error 0
  3362. dir: dir isR
  3363. -472: O: O944 (predict-no)
  3364. I see 1 and I'm going to do: predict-no
  3365. ENV: Agent did: predict-no for direction R in state State-B
  3366. In State-B moving R
  3367. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3368. predict error 0
  3369. dir: dir isR
  3370. /|\473: O: O946 (predict-no)
  3371. I see 1 and I'm going to do: predict-no
  3372. ENV: Agent did: predict-no for direction R in state State-B
  3373. In State-B moving R
  3374. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3375. predict error 0
  3376. dir: dir isL
  3377. -/|474: O: O947 (predict-yes)
  3378. I see 1 and I'm going to do: predict-yes
  3379. ENV: Agent did: predict-yes for direction L in state State-B
  3380. In State-B moving L
  3381. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3382. predict error 0
  3383. dir: dir isL
  3384. \-/475: O: O950 (predict-no)
  3385. I see 1 and I'm going to do: predict-no
  3386. ENV: Agent did: predict-no for direction L in state State-A
  3387. In State-A moving L
  3388. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3389. predict error 0
  3390. dir: dir isU
  3391. |\476: O: O952 (predict-no)
  3392. I see 1 and I'm going to do: predict-no
  3393. ENV: Agent did: predict-no for direction U in state State-A
  3394. In State-A moving U
  3395. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3396. predict error 0
  3397. dir: dir isU
  3398. -/|477: O: O954 (predict-no)
  3399. I see 1 and I'm going to do: predict-no
  3400. ENV: Agent did: predict-no for direction U in state State-A
  3401. In State-A moving U
  3402. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3403. predict error 0
  3404. dir: dir isU
  3405. \-/478: O: O956 (predict-no)
  3406. I see 1 and I'm going to do: predict-no
  3407. ENV: Agent did: predict-no for direction U in state State-A
  3408. In State-A moving U
  3409. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3410. predict error 0
  3411. dir: dir isU
  3412. |\479: O: O958 (predict-no)
  3413. I see 1 and I'm going to do: predict-no
  3414. ENV: Agent did: predict-no for direction U in state State-A
  3415. In State-A moving U
  3416. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3417. predict error 0
  3418. dir: dir isR
  3419. -/480: O: O959 (predict-yes)
  3420. I see 1 and I'm going to do: predict-yes
  3421. ENV: Agent did: predict-yes for direction R in state State-A
  3422. In State-A moving R
  3423. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3424. predict error 0
  3425. dir: dir isL
  3426. |481: O: O961 (predict-yes)
  3427. I see 1 and I'm going to do: predict-yes
  3428. ENV: Agent did: predict-yes for direction L in state State-B
  3429. In State-B moving L
  3430. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3431. predict error 0
  3432. dir: dir isL
  3433. \482: O: O964 (predict-no)
  3434. I see 1 and I'm going to do: predict-no
  3435. ENV: Agent did: predict-no for direction L in state State-A
  3436. In State-A moving L
  3437. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3438. predict error 0
  3439. dir: dir isR
  3440. -/|483: O: O965 (predict-yes)
  3441. I see 1 and I'm going to do: predict-yes
  3442. ENV: Agent did: predict-yes for direction R in state State-A
  3443. In State-A moving R
  3444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3445. predict error 0
  3446. dir: dir isR
  3447. \-/484: O: O968 (predict-no)
  3448. I see 1 and I'm going to do: predict-no
  3449. ENV: Agent did: predict-no for direction R in state State-B
  3450. In State-B moving R
  3451. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3452. predict error 0
  3453. dir: dir isU
  3454. |\-485: O: O970 (predict-no)
  3455. I see 1 and I'm going to do: predict-no
  3456. ENV: Agent did: predict-no for direction U in state State-B
  3457. In State-B moving U
  3458. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3459. predict error 0
  3460. dir: dir isU
  3461. /|\-486: O: O972 (predict-no)
  3462. I see 1 and I'm going to do: predict-no
  3463. ENV: Agent did: predict-no for direction U in state State-B
  3464. In State-B moving U
  3465. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3466. predict error 0
  3467. dir: dir isR
  3468. /|487: O: O974 (predict-no)
  3469. I see 1 and I'm going to do: predict-no
  3470. ENV: Agent did: predict-no for direction R in state State-B
  3471. In State-B moving R
  3472. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3473. predict error 0
  3474. dir: dir isL
  3475. \-488: O: O975 (predict-yes)
  3476. I see 1 and I'm going to do: predict-yes
  3477. ENV: Agent did: predict-yes for direction L in state State-B
  3478. In State-B moving L
  3479. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3480. predict error 0
  3481. dir: dir isU
  3482. /|\489: O: O978 (predict-no)
  3483. I see 1 and I'm going to do: predict-no
  3484. ENV: Agent did: predict-no for direction U in state State-A
  3485. In State-A moving U
  3486. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3487. predict error 0
  3488. dir: dir isU
  3489. -/|490: O: O979 (predict-yes)
  3490. I see 1 and I'm going to do: predict-yes
  3491. ENV: Agent did: predict-yes for direction U in state State-A
  3492. In State-A moving U
  3493. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3494. predict error 1
  3495. dir: dir isL
  3496. \-/491: O: O982 (predict-no)
  3497. I see 0 and I'm going to do: predict-no
  3498. ENV: Agent did: predict-no for direction L in state State-A
  3499. In State-A moving L
  3500. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3501. predict error 0
  3502. dir: dir isR
  3503. |492: O: O983 (predict-yes)
  3504. I see 1 and I'm going to do: predict-yes
  3505. ENV: Agent did: predict-yes for direction R in state State-A
  3506. In State-A moving R
  3507. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3508. predict error 0
  3509. dir: dir isU
  3510. \-/493: O: O986 (predict-no)
  3511. I see 1 and I'm going to do: predict-no
  3512. ENV: Agent did: predict-no for direction U in state State-B
  3513. In State-B moving U
  3514. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3515. predict error 0
  3516. dir: dir isL
  3517. |494: O: O987 (predict-yes)
  3518. I see 1 and I'm going to do: predict-yes
  3519. ENV: Agent did: predict-yes for direction L in state State-B
  3520. In State-B moving L
  3521. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3522. predict error 0
  3523. dir: dir isU
  3524. \-/495: O: O990 (predict-no)
  3525. I see 1 and I'm going to do: predict-no
  3526. ENV: Agent did: predict-no for direction U in state State-A
  3527. In State-A moving U
  3528. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3529. predict error 0
  3530. dir: dir isU
  3531. |\-496: O: O992 (predict-no)
  3532. I see 1 and I'm going to do: predict-no
  3533. ENV: Agent did: predict-no for direction U in state State-A
  3534. In State-A moving U
  3535. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3536. predict error 0
  3537. dir: dir isU
  3538. /|\497: O: O994 (predict-no)
  3539. I see 1 and I'm going to do: predict-no
  3540. ENV: Agent did: predict-no for direction U in state State-A
  3541. In State-A moving U
  3542. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3543. predict error 0
  3544. dir: dir isL
  3545. -/|498: O: O996 (predict-no)
  3546. I see 1 and I'm going to do: predict-no
  3547. ENV: Agent did: predict-no for direction L in state State-A
  3548. In State-A moving L
  3549. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3550. predict error 0
  3551. dir: dir isL
  3552. \-/499: O: O998 (predict-no)
  3553. I see 1 and I'm going to do: predict-no
  3554. ENV: Agent did: predict-no for direction L in state State-A
  3555. In State-A moving L
  3556. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3557. predict error 0
  3558. dir: dir isR
  3559. |\-500: O: O999 (predict-yes)
  3560. I see 1 and I'm going to do: predict-yes
  3561. ENV: Agent did: predict-yes for direction R in state State-A
  3562. In State-A moving R
  3563. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3564. predict error 0
  3565. dir: dir isL
  3566. /|\-/|\501: O: O1001 (predict-yes)
  3567. I see 1 and I'm going to do: predict-yes
  3568. ENV: Agent did: predict-yes for direction L in state State-B
  3569. In State-B moving L
  3570. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3571. predict error 0
  3572. dir: dir isR
  3573. -502: O: O1003 (predict-yes)
  3574. I see 1 and I'm going to do: predict-yes
  3575. ENV: Agent did: predict-yes for direction R in state State-A
  3576. In State-A moving R
  3577. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3578. predict error 0
  3579. dir: dir isL
  3580. /|\503: O: O1005 (predict-yes)
  3581. I see 1 and I'm going to do: predict-yes
  3582. ENV: Agent did: predict-yes for direction L in state State-B
  3583. In State-B moving L
  3584. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3585. predict error 0
  3586. dir: dir isU
  3587. -504: O: O1008 (predict-no)
  3588. I see 1 and I'm going to do: predict-no
  3589. ENV: Agent did: predict-no for direction U in state State-A
  3590. In State-A moving U
  3591. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3592. predict error 0
  3593. dir: dir isU
  3594. /|505: O: O1010 (predict-no)
  3595. I see 1 and I'm going to do: predict-no
  3596. ENV: Agent did: predict-no for direction U in state State-A
  3597. In State-A moving U
  3598. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3599. predict error 0
  3600. dir: dir isL
  3601. \-506: O: O1012 (predict-no)
  3602. I see 1 and I'm going to do: predict-no
  3603. ENV: Agent did: predict-no for direction L in state State-A
  3604. In State-A moving L
  3605. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3606. predict error 0
  3607. dir: dir isU
  3608. /507: O: O1014 (predict-no)
  3609. I see 1 and I'm going to do: predict-no
  3610. ENV: Agent did: predict-no for direction U in state State-A
  3611. In State-A moving U
  3612. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3613. predict error 0
  3614. dir: dir isL
  3615. |\-508: O: O1016 (predict-no)
  3616. I see 1 and I'm going to do: predict-no
  3617. ENV: Agent did: predict-no for direction L in state State-A
  3618. In State-A moving L
  3619. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3620. predict error 0
  3621. dir: dir isL
  3622. /|\509: O: O1018 (predict-no)
  3623. I see 1 and I'm going to do: predict-no
  3624. ENV: Agent did: predict-no for direction L in state State-A
  3625. In State-A moving L
  3626. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3627. predict error 0
  3628. dir: dir isU
  3629. -/|510: O: O1020 (predict-no)
  3630. I see 1 and I'm going to do: predict-no
  3631. ENV: Agent did: predict-no for direction U in state State-A
  3632. In State-A moving U
  3633. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3634. predict error 0
  3635. dir: dir isU
  3636. \-511: O: O1022 (predict-no)
  3637. I see 1 and I'm going to do: predict-no
  3638. ENV: Agent did: predict-no for direction U in state State-A
  3639. In State-A moving U
  3640. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3641. predict error 0
  3642. dir: dir isL
  3643. /512: O: O1024 (predict-no)
  3644. I see 1 and I'm going to do: predict-no
  3645. ENV: Agent did: predict-no for direction L in state State-A
  3646. In State-A moving L
  3647. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3648. predict error 0
  3649. dir: dir isL
  3650. |\-513: O: O1026 (predict-no)
  3651. I see 1 and I'm going to do: predict-no
  3652. ENV: Agent did: predict-no for direction L in state State-A
  3653. In State-A moving L
  3654. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3655. predict error 0
  3656. dir: dir isR
  3657. /|\514: O: O1027 (predict-yes)
  3658. I see 1 and I'm going to do: predict-yes
  3659. ENV: Agent did: predict-yes for direction R in state State-A
  3660. In State-A moving R
  3661. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3662. predict error 0
  3663. dir: dir isL
  3664. -/|515: O: O1029 (predict-yes)
  3665. I see 1 and I'm going to do: predict-yes
  3666. ENV: Agent did: predict-yes for direction L in state State-B
  3667. In State-B moving L
  3668. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3669. predict error 0
  3670. dir: dir isR
  3671. \-/516: O: O1031 (predict-yes)
  3672. I see 1 and I'm going to do: predict-yes
  3673. ENV: Agent did: predict-yes for direction R in state State-A
  3674. In State-A moving R
  3675. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3676. predict error 0
  3677. dir: dir isU
  3678. |\-517: O: O1034 (predict-no)
  3679. I see 1 and I'm going to do: predict-no
  3680. ENV: Agent did: predict-no for direction U in state State-B
  3681. In State-B moving U
  3682. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3683. predict error 0
  3684. dir: dir isL
  3685. /|\518: O: O1035 (predict-yes)
  3686. I see 1 and I'm going to do: predict-yes
  3687. ENV: Agent did: predict-yes for direction L in state State-B
  3688. In State-B moving L
  3689. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3690. predict error 0
  3691. dir: dir isL
  3692. -/|519: O: O1038 (predict-no)
  3693. I see 1 and I'm going to do: predict-no
  3694. ENV: Agent did: predict-no for direction L in state State-A
  3695. In State-A moving L
  3696. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3697. predict error 0
  3698. dir: dir isR
  3699. \-520: O: O1039 (predict-yes)
  3700. I see 1 and I'm going to do: predict-yes
  3701. ENV: Agent did: predict-yes for direction R in state State-A
  3702. In State-A moving R
  3703. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3704. predict error 0
  3705. dir: dir isU
  3706. /|\521: O: O1042 (predict-no)
  3707. I see 1 and I'm going to do: predict-no
  3708. ENV: Agent did: predict-no for direction U in state State-B
  3709. In State-B moving U
  3710. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3711. predict error 0
  3712. dir: dir isL
  3713. -522: O: O1043 (predict-yes)
  3714. I see 1 and I'm going to do: predict-yes
  3715. ENV: Agent did: predict-yes for direction L in state State-B
  3716. In State-B moving L
  3717. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3718. predict error 0
  3719. dir: dir isU
  3720. /|\523: O: O1046 (predict-no)
  3721. I see 1 and I'm going to do: predict-no
  3722. ENV: Agent did: predict-no for direction U in state State-A
  3723. In State-A moving U
  3724. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3725. predict error 0
  3726. dir: dir isR
  3727. -/524: O: O1048 (predict-no)
  3728. I see 1 and I'm going to do: predict-no
  3729. ENV: Agent did: predict-no for direction R in state State-A
  3730. In State-A moving R
  3731. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3732. predict error 1
  3733. dir: dir isR
  3734. |\-525: O: O1050 (predict-no)
  3735. I see 0 and I'm going to do: predict-no
  3736. ENV: Agent did: predict-no for direction R in state State-B
  3737. In State-B moving R
  3738. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3739. predict error 0
  3740. dir: dir isL
  3741. /|526: O: O1052 (predict-no)
  3742. I see 1 and I'm going to do: predict-no
  3743. ENV: Agent did: predict-no for direction L in state State-B
  3744. In State-B moving L
  3745. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3746. predict error 1
  3747. dir: dir isU
  3748. \-/527: O: O1054 (predict-no)
  3749. I see 0 and I'm going to do: predict-no
  3750. ENV: Agent did: predict-no for direction U in state State-A
  3751. In State-A moving U
  3752. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3753. predict error 0
  3754. dir: dir isU
  3755. |\528: O: O1056 (predict-no)
  3756. I see 1 and I'm going to do: predict-no
  3757. ENV: Agent did: predict-no for direction U in state State-A
  3758. In State-A moving U
  3759. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3760. predict error 0
  3761. dir: dir isR
  3762. -/|529: O: O1057 (predict-yes)
  3763. I see 1 and I'm going to do: predict-yes
  3764. ENV: Agent did: predict-yes for direction R in state State-A
  3765. In State-A moving R
  3766. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3767. predict error 0
  3768. dir: dir isL
  3769. \-/530: O: O1059 (predict-yes)
  3770. I see 1 and I'm going to do: predict-yes
  3771. ENV: Agent did: predict-yes for direction L in state State-B
  3772. In State-B moving L
  3773. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3774. predict error 0
  3775. dir: dir isU
  3776. |\-531: O: O1062 (predict-no)
  3777. I see 1 and I'm going to do: predict-no
  3778. ENV: Agent did: predict-no for direction U in state State-A
  3779. In State-A moving U
  3780. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3781. predict error 0
  3782. dir: dir isL
  3783. /532: O: O1063 (predict-yes)
  3784. I see 1 and I'm going to do: predict-yes
  3785. ENV: Agent did: predict-yes for direction L in state State-A
  3786. In State-A moving L
  3787. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3788. predict error 1
  3789. dir: dir isR
  3790. |\533: O: O1065 (predict-yes)
  3791. I see 0 and I'm going to do: predict-yes
  3792. ENV: Agent did: predict-yes for direction R in state State-A
  3793. In State-A moving R
  3794. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3795. predict error 0
  3796. dir: dir isL
  3797. -/|534: O: O1067 (predict-yes)
  3798. I see 1 and I'm going to do: predict-yes
  3799. ENV: Agent did: predict-yes for direction L in state State-B
  3800. In State-B moving L
  3801. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3802. predict error 0
  3803. dir: dir isU
  3804. \-/535: O: O1070 (predict-no)
  3805. I see 1 and I'm going to do: predict-no
  3806. ENV: Agent did: predict-no for direction U in state State-A
  3807. In State-A moving U
  3808. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3809. predict error 0
  3810. dir: dir isU
  3811. |\-536: O: O1072 (predict-no)
  3812. I see 1 and I'm going to do: predict-no
  3813. ENV: Agent did: predict-no for direction U in state State-A
  3814. In State-A moving U
  3815. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3816. predict error 0
  3817. dir: dir isU
  3818. /|537: O: O1074 (predict-no)
  3819. I see 1 and I'm going to do: predict-no
  3820. ENV: Agent did: predict-no for direction U in state State-A
  3821. In State-A moving U
  3822. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3823. predict error 0
  3824. dir: dir isL
  3825. \-538: O: O1076 (predict-no)
  3826. I see 1 and I'm going to do: predict-no
  3827. ENV: Agent did: predict-no for direction L in state State-A
  3828. In State-A moving L
  3829. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3830. predict error 0
  3831. dir: dir isL
  3832. /|\539: O: O1078 (predict-no)
  3833. I see 1 and I'm going to do: predict-no
  3834. ENV: Agent did: predict-no for direction L in state State-A
  3835. In State-A moving L
  3836. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3837. predict error 0
  3838. dir: dir isU
  3839. -/|540: O: O1080 (predict-no)
  3840. I see 1 and I'm going to do: predict-no
  3841. ENV: Agent did: predict-no for direction U in state State-A
  3842. In State-A moving U
  3843. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3844. predict error 0
  3845. dir: dir isL
  3846. \-541: O: O1082 (predict-no)
  3847. I see 1 and I'm going to do: predict-no
  3848. ENV: Agent did: predict-no for direction L in state State-A
  3849. In State-A moving L
  3850. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3851. predict error 0
  3852. dir: dir isR
  3853. /542: O: O1083 (predict-yes)
  3854. I see 1 and I'm going to do: predict-yes
  3855. ENV: Agent did: predict-yes for direction R in state State-A
  3856. In State-A moving R
  3857. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3858. predict error 0
  3859. dir: dir isL
  3860. |\-543: O: O1085 (predict-yes)
  3861. I see 1 and I'm going to do: predict-yes
  3862. ENV: Agent did: predict-yes for direction L in state State-B
  3863. In State-B moving L
  3864. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3865. predict error 0
  3866. dir: dir isL
  3867. /|\544: O: O1088 (predict-no)
  3868. I see 1 and I'm going to do: predict-no
  3869. ENV: Agent did: predict-no for direction L in state State-A
  3870. In State-A moving L
  3871. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3872. predict error 0
  3873. dir: dir isL
  3874. -/|545: O: O1090 (predict-no)
  3875. I see 1 and I'm going to do: predict-no
  3876. ENV: Agent did: predict-no for direction L in state State-A
  3877. In State-A moving L
  3878. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3879. predict error 0
  3880. dir: dir isL
  3881. \-/546: O: O1092 (predict-no)
  3882. I see 1 and I'm going to do: predict-no
  3883. ENV: Agent did: predict-no for direction L in state State-A
  3884. In State-A moving L
  3885. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3886. predict error 0
  3887. dir: dir isL
  3888. |\547: O: O1094 (predict-no)
  3889. I see 1 and I'm going to do: predict-no
  3890. ENV: Agent did: predict-no for direction L in state State-A
  3891. In State-A moving L
  3892. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3893. predict error 0
  3894. dir: dir isR
  3895. -/548: O: O1095 (predict-yes)
  3896. I see 1 and I'm going to do: predict-yes
  3897. ENV: Agent did: predict-yes for direction R in state State-A
  3898. In State-A moving R
  3899. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3900. predict error 0
  3901. dir: dir isR
  3902. |\-/549: O: O1098 (predict-no)
  3903. I see 1 and I'm going to do: predict-no
  3904. ENV: Agent did: predict-no for direction R in state State-B
  3905. In State-B moving R
  3906. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3907. predict error 0
  3908. dir: dir isU
  3909. |\-550: O: O1100 (predict-no)
  3910. I see 1 and I'm going to do: predict-no
  3911. ENV: Agent did: predict-no for direction U in state State-B
  3912. In State-B moving U
  3913. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3914. predict error 0
  3915. dir: dir isL
  3916. /|\551: O: O1102 (predict-no)
  3917. I see 1 and I'm going to do: predict-no
  3918. ENV: Agent did: predict-no for direction L in state State-B
  3919. In State-B moving L
  3920. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3921. predict error 1
  3922. dir: dir isR
  3923. -552: O: O1103 (predict-yes)
  3924. I see 0 and I'm going to do: predict-yes
  3925. ENV: Agent did: predict-yes for direction R in state State-A
  3926. In State-A moving R
  3927. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3928. predict error 0
  3929. dir: dir isR
  3930. /|\553: O: O1106 (predict-no)
  3931. I see 1 and I'm going to do: predict-no
  3932. ENV: Agent did: predict-no for direction R in state State-B
  3933. In State-B moving R
  3934. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3935. predict error 0
  3936. dir: dir isL
  3937. -/554: O: O1107 (predict-yes)
  3938. I see 1 and I'm going to do: predict-yes
  3939. ENV: Agent did: predict-yes for direction L in state State-B
  3940. In State-B moving L
  3941. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3942. predict error 0
  3943. dir: dir isR
  3944. |\-555: O: O1109 (predict-yes)
  3945. I see 1 and I'm going to do: predict-yes
  3946. ENV: Agent did: predict-yes for direction R in state State-A
  3947. In State-A moving R
  3948. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3949. predict error 0
  3950. dir: dir isR
  3951. /|\556: O: O1112 (predict-no)
  3952. I see 1 and I'm going to do: predict-no
  3953. ENV: Agent did: predict-no for direction R in state State-B
  3954. In State-B moving R
  3955. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3956. predict error 0
  3957. dir: dir isU
  3958. -/|557: O: O1114 (predict-no)
  3959. I see 1 and I'm going to do: predict-no
  3960. ENV: Agent did: predict-no for direction U in state State-B
  3961. In State-B moving U
  3962. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3963. predict error 0
  3964. dir: dir isL
  3965. \-/558: O: O1115 (predict-yes)
  3966. I see 1 and I'm going to do: predict-yes
  3967. ENV: Agent did: predict-yes for direction L in state State-B
  3968. In State-B moving L
  3969. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3970. predict error 0
  3971. dir: dir isR
  3972. |\559: O: O1117 (predict-yes)
  3973. I see 1 and I'm going to do: predict-yes
  3974. ENV: Agent did: predict-yes for direction R in state State-A
  3975. In State-A moving R
  3976. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3977. predict error 0
  3978. dir: dir isR
  3979. -/|560: O: O1120 (predict-no)
  3980. I see 1 and I'm going to do: predict-no
  3981. ENV: Agent did: predict-no for direction R in state State-B
  3982. In State-B moving R
  3983. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3984. predict error 0
  3985. dir: dir isU
  3986. \-561: O: O1122 (predict-no)
  3987. I see 1 and I'm going to do: predict-no
  3988. ENV: Agent did: predict-no for direction U in state State-B
  3989. In State-B moving U
  3990. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3991. predict error 0
  3992. dir: dir isL
  3993. /562: O: O1123 (predict-yes)
  3994. I see 1 and I'm going to do: predict-yes
  3995. ENV: Agent did: predict-yes for direction L in state State-B
  3996. In State-B moving L
  3997. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3998. predict error 0
  3999. dir: dir isL
  4000. |\-563: O: O1126 (predict-no)
  4001. I see 1 and I'm going to do: predict-no
  4002. ENV: Agent did: predict-no for direction L in state State-A
  4003. In State-A moving L
  4004. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4005. predict error 0
  4006. dir: dir isL
  4007. /|\564: O: O1128 (predict-no)
  4008. I see 1 and I'm going to do: predict-no
  4009. ENV: Agent did: predict-no for direction L in state State-A
  4010. In State-A moving L
  4011. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4012. predict error 0
  4013. dir: dir isR
  4014. -/|565: O: O1129 (predict-yes)
  4015. I see 1 and I'm going to do: predict-yes
  4016. ENV: Agent did: predict-yes for direction R in state State-A
  4017. In State-A moving R
  4018. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4019. predict error 0
  4020. dir: dir isU
  4021. \-/566: O: O1132 (predict-no)
  4022. I see 1 and I'm going to do: predict-no
  4023. ENV: Agent did: predict-no for direction U in state State-B
  4024. In State-B moving U
  4025. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4026. predict error 0
  4027. dir: dir isU
  4028. |\-567: O: O1134 (predict-no)
  4029. I see 1 and I'm going to do: predict-no
  4030. ENV: Agent did: predict-no for direction U in state State-B
  4031. In State-B moving U
  4032. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4033. predict error 0
  4034. dir: dir isL
  4035. /|568: O: O1135 (predict-yes)
  4036. I see 1 and I'm going to do: predict-yes
  4037. ENV: Agent did: predict-yes for direction L in state State-B
  4038. In State-B moving L
  4039. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4040. predict error 0
  4041. dir: dir isR
  4042. \-/569: O: O1137 (predict-yes)
  4043. I see 1 and I'm going to do: predict-yes
  4044. ENV: Agent did: predict-yes for direction R in state State-A
  4045. In State-A moving R
  4046. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4047. predict error 0
  4048. dir: dir isU
  4049. |\-570: O: O1140 (predict-no)
  4050. I see 1 and I'm going to do: predict-no
  4051. ENV: Agent did: predict-no for direction U in state State-B
  4052. In State-B moving U
  4053. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4054. predict error 0
  4055. dir: dir isU
  4056. /|571: O: O1142 (predict-no)
  4057. I see 1 and I'm going to do: predict-no
  4058. ENV: Agent did: predict-no for direction U in state State-B
  4059. In State-B moving U
  4060. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4061. predict error 0
  4062. dir: dir isR
  4063. \572: O: O1144 (predict-no)
  4064. I see 1 and I'm going to do: predict-no
  4065. ENV: Agent did: predict-no for direction R in state State-B
  4066. In State-B moving R
  4067. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4068. predict error 0
  4069. dir: dir isR
  4070. -/|573: O: O1146 (predict-no)
  4071. I see 1 and I'm going to do: predict-no
  4072. ENV: Agent did: predict-no for direction R in state State-B
  4073. In State-B moving R
  4074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4075. predict error 0
  4076. dir: dir isU
  4077. \-/574: O: O1148 (predict-no)
  4078. I see 1 and I'm going to do: predict-no
  4079. ENV: Agent did: predict-no for direction U in state State-B
  4080. In State-B moving U
  4081. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4082. predict error 0
  4083. dir: dir isR
  4084. |\-575: O: O1150 (predict-no)
  4085. I see 1 and I'm going to do: predict-no
  4086. ENV: Agent did: predict-no for direction R in state State-B
  4087. In State-B moving R
  4088. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4089. predict error 0
  4090. dir: dir isL
  4091. /|\576: O: O1151 (predict-yes)
  4092. I see 1 and I'm going to do: predict-yes
  4093. ENV: Agent did: predict-yes for direction L in state State-B
  4094. In State-B moving L
  4095. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4096. predict error 0
  4097. dir: dir isR
  4098. -/|577: O: O1153 (predict-yes)
  4099. I see 1 and I'm going to do: predict-yes
  4100. ENV: Agent did: predict-yes for direction R in state State-A
  4101. In State-A moving R
  4102. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4103. predict error 0
  4104. dir: dir isU
  4105. \-/|sleeping...
  4106. \578: O: O1156 (predict-no)
  4107. I see 1 and I'm going to do: predict-no
  4108. ENV: Agent did: predict-no for direction U in state State-B
  4109. In State-B moving U
  4110. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4111. predict error 0
  4112. dir: dir isL
  4113. -/|579: O: O1157 (predict-yes)
  4114. I see 1 and I'm going to do: predict-yes
  4115. ENV: Agent did: predict-yes for direction L in state State-B
  4116. In State-B moving L
  4117. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4118. predict error 0
  4119. dir: dir isR
  4120. \-/580: O: O1159 (predict-yes)
  4121. I see 1 and I'm going to do: predict-yes
  4122. ENV: Agent did: predict-yes for direction R in state State-A
  4123. In State-A moving R
  4124. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4125. predict error 0
  4126. dir: dir isR
  4127. |\-581: O: O1162 (predict-no)
  4128. I see 1 and I'm going to do: predict-no
  4129. ENV: Agent did: predict-no for direction R in state State-B
  4130. In State-B moving R
  4131. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4132. predict error 0
  4133. dir: dir isR
  4134. /582: O: O1163 (predict-yes)
  4135. I see 1 and I'm going to do: predict-yes
  4136. ENV: Agent did: predict-yes for direction R in state State-B
  4137. In State-B moving R
  4138. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  4139. predict error 1
  4140. dir: dir isL
  4141. |\-583: O: O1165 (predict-yes)
  4142. I see 0 and I'm going to do: predict-yes
  4143. ENV: Agent did: predict-yes for direction L in state State-B
  4144. In State-B moving L
  4145. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4146. predict error 0
  4147. dir: dir isL
  4148. /|\584: O: O1168 (predict-no)
  4149. I see 1 and I'm going to do: predict-no
  4150. ENV: Agent did: predict-no for direction L in state State-A
  4151. In State-A moving L
  4152. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4153. predict error 0
  4154. dir: dir isU
  4155. -/585: O: O1170 (predict-no)
  4156. I see 1 and I'm going to do: predict-no
  4157. ENV: Agent did: predict-no for direction U in state State-A
  4158. In State-A moving U
  4159. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4160. predict error 0
  4161. dir: dir isR
  4162. |\-586: O: O1171 (predict-yes)
  4163. I see 1 and I'm going to do: predict-yes
  4164. ENV: Agent did: predict-yes for direction R in state State-A
  4165. In State-A moving R
  4166. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4167. predict error 0
  4168. dir: dir isL
  4169. /|\587: O: O1173 (predict-yes)
  4170. I see 1 and I'm going to do: predict-yes
  4171. ENV: Agent did: predict-yes for direction L in state State-B
  4172. In State-B moving L
  4173. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4174. predict error 0
  4175. dir: dir isL
  4176. -/|588: O: O1176 (predict-no)
  4177. I see 1 and I'm going to do: predict-no
  4178. ENV: Agent did: predict-no for direction L in state State-A
  4179. In State-A moving L
  4180. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4181. predict error 0
  4182. dir: dir isU
  4183. \-589: O: O1178 (predict-no)
  4184. I see 1 and I'm going to do: predict-no
  4185. ENV: Agent did: predict-no for direction U in state State-A
  4186. In State-A moving U
  4187. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4188. predict error 0
  4189. dir: dir isR
  4190. /590: O: O1179 (predict-yes)
  4191. I see 1 and I'm going to do: predict-yes
  4192. ENV: Agent did: predict-yes for direction R in state State-A
  4193. In State-A moving R
  4194. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4195. predict error 0
  4196. dir: dir isR
  4197. |\-591: O: O1182 (predict-no)
  4198. I see 1 and I'm going to do: predict-no
  4199. ENV: Agent did: predict-no for direction R in state State-B
  4200. In State-B moving R
  4201. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4202. predict error 0
  4203. dir: dir isL
  4204. /592: O: O1183 (predict-yes)
  4205. I see 1 and I'm going to do: predict-yes
  4206. ENV: Agent did: predict-yes for direction L in state State-B
  4207. In State-B moving L
  4208. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4209. predict error 0
  4210. dir: dir isU
  4211. |\-593: O: O1186 (predict-no)
  4212. I see 1 and I'm going to do: predict-no
  4213. ENV: Agent did: predict-no for direction U in state State-A
  4214. In State-A moving U
  4215. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4216. predict error 0
  4217. dir: dir isR
  4218. /|\594: O: O1187 (predict-yes)
  4219. I see 1 and I'm going to do: predict-yes
  4220. ENV: Agent did: predict-yes for direction R in state State-A
  4221. In State-A moving R
  4222. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4223. predict error 0
  4224. dir: dir isU
  4225. -/|595: O: O1190 (predict-no)
  4226. I see 1 and I'm going to do: predict-no
  4227. ENV: Agent did: predict-no for direction U in state State-B
  4228. In State-B moving U
  4229. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4230. predict error 0
  4231. dir: dir isU
  4232. \596: O: O1192 (predict-no)
  4233. I see 1 and I'm going to do: predict-no
  4234. ENV: Agent did: predict-no for direction U in state State-B
  4235. In State-B moving U
  4236. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4237. predict error 0
  4238. dir: dir isL
  4239. -/|597: O: O1193 (predict-yes)
  4240. I see 1 and I'm going to do: predict-yes
  4241. ENV: Agent did: predict-yes for direction L in state State-B
  4242. In State-B moving L
  4243. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4244. predict error 0
  4245. dir: dir isU
  4246. \-/598: O: O1196 (predict-no)
  4247. I see 1 and I'm going to do: predict-no
  4248. ENV: Agent did: predict-no for direction U in state State-A
  4249. In State-A moving U
  4250. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4251. predict error 0
  4252. dir: dir isL
  4253. |\-599: O: O1198 (predict-no)
  4254. I see 1 and I'm going to do: predict-no
  4255. ENV: Agent did: predict-no for direction L in state State-A
  4256. In State-A moving L
  4257. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4258. predict error 0
  4259. dir: dir isU
  4260. /600: O: O1200 (predict-no)
  4261. I see 1 and I'm going to do: predict-no
  4262. ENV: Agent did: predict-no for direction U in state State-A
  4263. In State-A moving U
  4264. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4265. predict error 0
  4266. dir: dir isL
  4267. |601: O: O1202 (predict-no)
  4268. I see 1 and I'm going to do: predict-no
  4269. ENV: Agent did: predict-no for direction L in state State-A
  4270. In State-A moving L
  4271. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4272. predict error 0
  4273. dir: dir isU
  4274. \602: O: O1204 (predict-no)
  4275. I see 1 and I'm going to do: predict-no
  4276. ENV: Agent did: predict-no for direction U in state State-A
  4277. In State-A moving U
  4278. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4279. predict error 0
  4280. dir: dir isL
  4281. -/|\603: O: O1206 (predict-no)
  4282. I see 1 and I'm going to do: predict-no
  4283. ENV: Agent did: predict-no for direction L in state State-A
  4284. In State-A moving L
  4285. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4286. predict error 0
  4287. dir: dir isL
  4288. -/|604: O: O1208 (predict-no)
  4289. I see 1 and I'm going to do: predict-no
  4290. ENV: Agent did: predict-no for direction L in state State-A
  4291. In State-A moving L
  4292. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4293. predict error 0
  4294. dir: dir isR
  4295. \-/605: O: O1209 (predict-yes)
  4296. I see 1 and I'm going to do: predict-yes
  4297. ENV: Agent did: predict-yes for direction R in state State-A
  4298. In State-A moving R
  4299. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4300. predict error 0
  4301. dir: dir isR
  4302. |\-606: O: O1212 (predict-no)
  4303. I see 1 and I'm going to do: predict-no
  4304. ENV: Agent did: predict-no for direction R in state State-B
  4305. In State-B moving R
  4306. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4307. predict error 0
  4308. dir: dir isR
  4309. /|607: O: O1214 (predict-no)
  4310. I see 1 and I'm going to do: predict-no
  4311. ENV: Agent did: predict-no for direction R in state State-B
  4312. In State-B moving R
  4313. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4314. predict error 0
  4315. dir: dir isL
  4316. \-/608: O: O1215 (predict-yes)
  4317. I see 1 and I'm going to do: predict-yes
  4318. ENV: Agent did: predict-yes for direction L in state State-B
  4319. In State-B moving L
  4320. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4321. predict error 0
  4322. dir: dir isL
  4323. |\-609: O: O1218 (predict-no)
  4324. I see 1 and I'm going to do: predict-no
  4325. ENV: Agent did: predict-no for direction L in state State-A
  4326. In State-A moving L
  4327. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4328. predict error 0
  4329. dir: dir isL
  4330. /|\610: O: O1220 (predict-no)
  4331. I see 1 and I'm going to do: predict-no
  4332. ENV: Agent did: predict-no for direction L in state State-A
  4333. In State-A moving L
  4334. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4335. predict error 0
  4336. dir: dir isU
  4337. -/611: O: O1222 (predict-no)
  4338. I see 1 and I'm going to do: predict-no
  4339. ENV: Agent did: predict-no for direction U in state State-A
  4340. In State-A moving U
  4341. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4342. predict error 0
  4343. dir: dir isU
  4344. |612: O: O1224 (predict-no)
  4345. I see 1 and I'm going to do: predict-no
  4346. ENV: Agent did: predict-no for direction U in state State-A
  4347. In State-A moving U
  4348. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4349. predict error 0
  4350. dir: dir isR
  4351. \-/613: O: O1225 (predict-yes)
  4352. I see 1 and I'm going to do: predict-yes
  4353. ENV: Agent did: predict-yes for direction R in state State-A
  4354. In State-A moving R
  4355. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4356. predict error 0
  4357. dir: dir isL
  4358. |\-/614: O: O1227 (predict-yes)
  4359. I see 1 and I'm going to do: predict-yes
  4360. ENV: Agent did: predict-yes for direction L in state State-B
  4361. In State-B moving L
  4362. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4363. predict error 0
  4364. dir: dir isU
  4365. |\-615: O: O1230 (predict-no)
  4366. I see 1 and I'm going to do: predict-no
  4367. ENV: Agent did: predict-no for direction U in state State-A
  4368. In State-A moving U
  4369. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4370. predict error 0
  4371. dir: dir isL
  4372. /|\616: O: O1232 (predict-no)
  4373. I see 1 and I'm going to do: predict-no
  4374. ENV: Agent did: predict-no for direction L in state State-A
  4375. In State-A moving L
  4376. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4377. predict error 0
  4378. dir: dir isR
  4379. -/|617: O: O1233 (predict-yes)
  4380. I see 1 and I'm going to do: predict-yes
  4381. ENV: Agent did: predict-yes for direction R in state State-A
  4382. In State-A moving R
  4383. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4384. predict error 0
  4385. dir: dir isR
  4386. \618: O: O1236 (predict-no)
  4387. I see 1 and I'm going to do: predict-no
  4388. ENV: Agent did: predict-no for direction R in state State-B
  4389. In State-B moving R
  4390. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4391. predict error 0
  4392. dir: dir isL
  4393. -/|619: O: O1237 (predict-yes)
  4394. I see 1 and I'm going to do: predict-yes
  4395. ENV: Agent did: predict-yes for direction L in state State-B
  4396. In State-B moving L
  4397. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4398. predict error 0
  4399. dir: dir isU
  4400. \-/620: O: O1240 (predict-no)
  4401. I see 1 and I'm going to do: predict-no
  4402. ENV: Agent did: predict-no for direction U in state State-A
  4403. In State-A moving U
  4404. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4405. predict error 0
  4406. dir: dir isL
  4407. |\-621: O: O1242 (predict-no)
  4408. I see 1 and I'm going to do: predict-no
  4409. ENV: Agent did: predict-no for direction L in state State-A
  4410. In State-A moving L
  4411. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4412. predict error 0
  4413. dir: dir isR
  4414. /622: O: O1243 (predict-yes)
  4415. I see 1 and I'm going to do: predict-yes
  4416. ENV: Agent did: predict-yes for direction R in state State-A
  4417. In State-A moving R
  4418. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4419. predict error 0
  4420. dir: dir isL
  4421. |\-623: O: O1245 (predict-yes)
  4422. I see 1 and I'm going to do: predict-yes
  4423. ENV: Agent did: predict-yes for direction L in state State-B
  4424. In State-B moving L
  4425. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4426. predict error 0
  4427. dir: dir isR
  4428. /|\624: O: O1247 (predict-yes)
  4429. I see 1 and I'm going to do: predict-yes
  4430. ENV: Agent did: predict-yes for direction R in state State-A
  4431. In State-A moving R
  4432. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4433. predict error 0
  4434. dir: dir isR
  4435. -/|625: O: O1250 (predict-no)
  4436. I see 1 and I'm going to do: predict-no
  4437. ENV: Agent did: predict-no for direction R in state State-B
  4438. In State-B moving R
  4439. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4440. predict error 0
  4441. dir: dir isR
  4442. \-/626: O: O1252 (predict-no)
  4443. I see 1 and I'm going to do: predict-no
  4444. ENV: Agent did: predict-no for direction R in state State-B
  4445. In State-B moving R
  4446. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4447. predict error 0
  4448. dir: dir isR
  4449. |\-627: O: O1254 (predict-no)
  4450. I see 1 and I'm going to do: predict-no
  4451. ENV: Agent did: predict-no for direction R in state State-B
  4452. In State-B moving R
  4453. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4454. predict error 0
  4455. dir: dir isR
  4456. /|\628: O: O1256 (predict-no)
  4457. I see 1 and I'm going to do: predict-no
  4458. ENV: Agent did: predict-no for direction R in state State-B
  4459. In State-B moving R
  4460. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4461. predict error 0
  4462. dir: dir isR
  4463. -/|629: O: O1258 (predict-no)
  4464. I see 1 and I'm going to do: predict-no
  4465. ENV: Agent did: predict-no for direction R in state State-B
  4466. In State-B moving R
  4467. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4468. predict error 0
  4469. dir: dir isR
  4470. \-/630: O: O1260 (predict-no)
  4471. I see 1 and I'm going to do: predict-no
  4472. ENV: Agent did: predict-no for direction R in state State-B
  4473. In State-B moving R
  4474. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4475. predict error 0
  4476. dir: dir isU
  4477. |\-/631: O: O1262 (predict-no)
  4478. I see 1 and I'm going to do: predict-no
  4479. ENV: Agent did: predict-no for direction U in state State-B
  4480. In State-B moving U
  4481. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4482. predict error 0
  4483. dir: dir isU
  4484. |632: O: O1264 (predict-no)
  4485. I see 1 and I'm going to do: predict-no
  4486. ENV: Agent did: predict-no for direction U in state State-B
  4487. In State-B moving U
  4488. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4489. predict error 0
  4490. dir: dir isL
  4491. \-/633: O: O1265 (predict-yes)
  4492. I see 1 and I'm going to do: predict-yes
  4493. ENV: Agent did: predict-yes for direction L in state State-B
  4494. In State-B moving L
  4495. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4496. predict error 0
  4497. dir: dir isR
  4498. |\634: O: O1267 (predict-yes)
  4499. I see 1 and I'm going to do: predict-yes
  4500. ENV: Agent did: predict-yes for direction R in state State-A
  4501. In State-A moving R
  4502. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4503. predict error 0
  4504. dir: dir isR
  4505. -/635: O: O1270 (predict-no)
  4506. I see 1 and I'm going to do: predict-no
  4507. ENV: Agent did: predict-no for direction R in state State-B
  4508. In State-B moving R
  4509. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4510. predict error 0
  4511. dir: dir isL
  4512. |\-636: O: O1271 (predict-yes)
  4513. I see 1 and I'm going to do: predict-yes
  4514. ENV: Agent did: predict-yes for direction L in state State-B
  4515. In State-B moving L
  4516. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4517. predict error 0
  4518. dir: dir isU
  4519. /|\637: O: O1274 (predict-no)
  4520. I see 1 and I'm going to do: predict-no
  4521. ENV: Agent did: predict-no for direction U in state State-A
  4522. In State-A moving U
  4523. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4524. predict error 0
  4525. dir: dir isR
  4526. -/|638: O: O1275 (predict-yes)
  4527. I see 1 and I'm going to do: predict-yes
  4528. ENV: Agent did: predict-yes for direction R in state State-A
  4529. In State-A moving R
  4530. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4531. predict error 0
  4532. dir: dir isR
  4533. \-/639: O: O1278 (predict-no)
  4534. I see 1 and I'm going to do: predict-no
  4535. ENV: Agent did: predict-no for direction R in state State-B
  4536. In State-B moving R
  4537. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4538. predict error 0
  4539. dir: dir isL
  4540. |\-640: O: O1279 (predict-yes)
  4541. I see 1 and I'm going to do: predict-yes
  4542. ENV: Agent did: predict-yes for direction L in state State-B
  4543. In State-B moving L
  4544. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4545. predict error 0
  4546. dir: dir isU
  4547. /|\641: O: O1282 (predict-no)
  4548. I see 1 and I'm going to do: predict-no
  4549. ENV: Agent did: predict-no for direction U in state State-A
  4550. In State-A moving U
  4551. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4552. predict error 0
  4553. dir: dir isR
  4554. -642: O: O1283 (predict-yes)
  4555. I see 1 and I'm going to do: predict-yes
  4556. ENV: Agent did: predict-yes for direction R in state State-A
  4557. In State-A moving R
  4558. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4559. predict error 0
  4560. dir: dir isR
  4561. /|\643: O: O1286 (predict-no)
  4562. I see 1 and I'm going to do: predict-no
  4563. ENV: Agent did: predict-no for direction R in state State-B
  4564. In State-B moving R
  4565. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4566. predict error 0
  4567. dir: dir isR
  4568. -/|644: O: O1288 (predict-no)
  4569. I see 1 and I'm going to do: predict-no
  4570. ENV: Agent did: predict-no for direction R in state State-B
  4571. In State-B moving R
  4572. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4573. predict error 0
  4574. dir: dir isR
  4575. \-/645: O: O1290 (predict-no)
  4576. I see 1 and I'm going to do: predict-no
  4577. ENV: Agent did: predict-no for direction R in state State-B
  4578. In State-B moving R
  4579. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4580. predict error 0
  4581. dir: dir isU
  4582. |\-646: O: O1292 (predict-no)
  4583. I see 1 and I'm going to do: predict-no
  4584. ENV: Agent did: predict-no for direction U in state State-B
  4585. In State-B moving U
  4586. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4587. predict error 0
  4588. dir: dir isL
  4589. /|\647: O: O1294 (predict-no)
  4590. I see 1 and I'm going to do: predict-no
  4591. ENV: Agent did: predict-no for direction L in state State-B
  4592. In State-B moving L
  4593. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  4594. predict error 1
  4595. dir: dir isR
  4596. -/|648: O: O1295 (predict-yes)
  4597. I see 0 and I'm going to do: predict-yes
  4598. ENV: Agent did: predict-yes for direction R in state State-A
  4599. In State-A moving R
  4600. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4601. predict error 0
  4602. dir: dir isL
  4603. \-/|649: O: O1297 (predict-yes)
  4604. I see 1 and I'm going to do: predict-yes
  4605. ENV: Agent did: predict-yes for direction L in state State-B
  4606. In State-B moving L
  4607. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4608. predict error 0
  4609. dir: dir isL
  4610. \-/650: O: O1300 (predict-no)
  4611. I see 1 and I'm going to do: predict-no
  4612. ENV: Agent did: predict-no for direction L in state State-A
  4613. In State-A moving L
  4614. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4615. predict error 0
  4616. dir: dir isU
  4617. |\-651: O: O1302 (predict-no)
  4618. I see 1 and I'm going to do: predict-no
  4619. ENV: Agent did: predict-no for direction U in state State-A
  4620. In State-A moving U
  4621. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4622. predict error 0
  4623. dir: dir isL
  4624. /652: O: O1303 (predict-yes)
  4625. I see 1 and I'm going to do: predict-yes
  4626. ENV: Agent did: predict-yes for direction L in state State-A
  4627. In State-A moving L
  4628. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  4629. predict error 1
  4630. dir: dir isR
  4631. |\-653: O: O1305 (predict-yes)
  4632. I see 0 and I'm going to do: predict-yes
  4633. ENV: Agent did: predict-yes for direction R in state State-A
  4634. In State-A moving R
  4635. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4636. predict error 0
  4637. dir: dir isL
  4638. /|\654: O: O1307 (predict-yes)
  4639. I see 1 and I'm going to do: predict-yes
  4640. ENV: Agent did: predict-yes for direction L in state State-B
  4641. In State-B moving L
  4642. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4643. predict error 0
  4644. dir: dir isR
  4645. -/|655: O: O1309 (predict-yes)
  4646. I see 1 and I'm going to do: predict-yes
  4647. ENV: Agent did: predict-yes for direction R in state State-A
  4648. In State-A moving R
  4649. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4650. predict error 0
  4651. dir: dir isU
  4652. \-/656: O: O1312 (predict-no)
  4653. I see 1 and I'm going to do: predict-no
  4654. ENV: Agent did: predict-no for direction U in state State-B
  4655. In State-B moving U
  4656. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4657. predict error 0
  4658. dir: dir isL
  4659. |\-657: O: O1313 (predict-yes)
  4660. I see 1 and I'm going to do: predict-yes
  4661. ENV: Agent did: predict-yes for direction L in state State-B
  4662. In State-B moving L
  4663. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4664. predict error 0
  4665. dir: dir isR
  4666. /|\658: O: O1315 (predict-yes)
  4667. I see 1 and I'm going to do: predict-yes
  4668. ENV: Agent did: predict-yes for direction R in state State-A
  4669. In State-A moving R
  4670. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4671. predict error 0
  4672. dir: dir isL
  4673. -/|659: O: O1317 (predict-yes)
  4674. I see 1 and I'm going to do: predict-yes
  4675. ENV: Agent did: predict-yes for direction L in state State-B
  4676. In State-B moving L
  4677. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4678. predict error 0
  4679. dir: dir isU
  4680. \-660: O: O1320 (predict-no)
  4681. I see 1 and I'm going to do: predict-no
  4682. ENV: Agent did: predict-no for direction U in state State-A
  4683. In State-A moving U
  4684. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4685. predict error 0
  4686. dir: dir isU
  4687. /|\-661: O: O1322 (predict-no)
  4688. I see 1 and I'm going to do: predict-no
  4689. ENV: Agent did: predict-no for direction U in state State-A
  4690. In State-A moving U
  4691. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4692. predict error 0
  4693. dir: dir isU
  4694. /662: O: O1324 (predict-no)
  4695. I see 1 and I'm going to do: predict-no
  4696. ENV: Agent did: predict-no for direction U in state State-A
  4697. In State-A moving U
  4698. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4699. predict error 0
  4700. dir: dir isU
  4701. |\-663: O: O1326 (predict-no)
  4702. I see 1 and I'm going to do: predict-no
  4703. ENV: Agent did: predict-no for direction U in state State-A
  4704. In State-A moving U
  4705. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4706. predict error 0
  4707. dir: dir isL
  4708. /|\664: O: O1328 (predict-no)
  4709. I see 1 and I'm going to do: predict-no
  4710. ENV: Agent did: predict-no for direction L in state State-A
  4711. In State-A moving L
  4712. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4713. predict error 0
  4714. dir: dir isL
  4715. -/|665: O: O1330 (predict-no)
  4716. I see 1 and I'm going to do: predict-no
  4717. ENV: Agent did: predict-no for direction L in state State-A
  4718. In State-A moving L
  4719. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4720. predict error 0
  4721. dir: dir isR
  4722. \-/666: O: O1331 (predict-yes)
  4723. I see 1 and I'm going to do: predict-yes
  4724. ENV: Agent did: predict-yes for direction R in state State-A
  4725. In State-A moving R
  4726. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4727. predict error 0
  4728. dir: dir isU
  4729. |\-667: O: O1334 (predict-no)
  4730. I see 1 and I'm going to do: predict-no
  4731. ENV: Agent did: predict-no for direction U in state State-B
  4732. In State-B moving U
  4733. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4734. predict error 0
  4735. dir: dir isU
  4736. /|668: O: O1336 (predict-no)
  4737. I see 1 and I'm going to do: predict-no
  4738. ENV: Agent did: predict-no for direction U in state State-B
  4739. In State-B moving U
  4740. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4741. predict error 0
  4742. dir: dir isU
  4743. \-/669: O: O1338 (predict-no)
  4744. I see 1 and I'm going to do: predict-no
  4745. ENV: Agent did: predict-no for direction U in state State-B
  4746. In State-B moving U
  4747. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4748. predict error 0
  4749. dir: dir isU
  4750. |\-670: O: O1340 (predict-no)
  4751. I see 1 and I'm going to do: predict-no
  4752. ENV: Agent did: predict-no for direction U in state State-B
  4753. In State-B moving U
  4754. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4755. predict error 0
  4756. dir: dir isL
  4757. /|\671: O: O1341 (predict-yes)
  4758. I see 1 and I'm going to do: predict-yes
  4759. ENV: Agent did: predict-yes for direction L in state State-B
  4760. In State-B moving L
  4761. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4762. predict error 0
  4763. dir: dir isU
  4764. -672: O: O1344 (predict-no)
  4765. I see 1 and I'm going to do: predict-no
  4766. ENV: Agent did: predict-no for direction U in state State-A
  4767. In State-A moving U
  4768. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4769. predict error 0
  4770. dir: dir isL
  4771. /|673: O: O1346 (predict-no)
  4772. I see 1 and I'm going to do: predict-no
  4773. ENV: Agent did: predict-no for direction L in state State-A
  4774. In State-A moving L
  4775. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4776. predict error 0
  4777. dir: dir isL
  4778. \-674: O: O1348 (predict-no)
  4779. I see 1 and I'm going to do: predict-no
  4780. ENV: Agent did: predict-no for direction L in state State-A
  4781. In State-A moving L
  4782. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4783. predict error 0
  4784. dir: dir isR
  4785. /|\675: O: O1349 (predict-yes)
  4786. I see 1 and I'm going to do: predict-yes
  4787. ENV: Agent did: predict-yes for direction R in state State-A
  4788. In State-A moving R
  4789. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4790. predict error 0
  4791. dir: dir isL
  4792. -/|676: O: O1351 (predict-yes)
  4793. I see 1 and I'm going to do: predict-yes
  4794. ENV: Agent did: predict-yes for direction L in state State-B
  4795. In State-B moving L
  4796. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4797. predict error 0
  4798. dir: dir isL
  4799. \-/677: O: O1354 (predict-no)
  4800. I see 1 and I'm going to do: predict-no
  4801. ENV: Agent did: predict-no for direction L in state State-A
  4802. In State-A moving L
  4803. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4804. predict error 0
  4805. dir: dir isR
  4806. |\-678: O: O1355 (predict-yes)
  4807. I see 1 and I'm going to do: predict-yes
  4808. ENV: Agent did: predict-yes for direction R in state State-A
  4809. In State-A moving R
  4810. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4811. predict error 0
  4812. dir: dir isU
  4813. /|\679: O: O1358 (predict-no)
  4814. I see 1 and I'm going to do: predict-no
  4815. ENV: Agent did: predict-no for direction U in state State-B
  4816. In State-B moving U
  4817. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4818. predict error 0
  4819. dir: dir isR
  4820. -/|680: O: O1360 (predict-no)
  4821. I see 1 and I'm going to do: predict-no
  4822. ENV: Agent did: predict-no for direction R in state State-B
  4823. In State-B moving R
  4824. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4825. predict error 0
  4826. dir: dir isR
  4827. \-/681: O: O1362 (predict-no)
  4828. I see 1 and I'm going to do: predict-no
  4829. ENV: Agent did: predict-no for direction R in state State-B
  4830. In State-B moving R
  4831. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4832. predict error 0
  4833. dir: dir isU
  4834. |682: O: O1364 (predict-no)
  4835. I see 1 and I'm going to do: predict-no
  4836. ENV: Agent did: predict-no for direction U in state State-B
  4837. In State-B moving U
  4838. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4839. predict error 0
  4840. dir: dir isR
  4841. \-/683: O: O1366 (predict-no)
  4842. I see 1 and I'm going to do: predict-no
  4843. ENV: Agent did: predict-no for direction R in state State-B
  4844. In State-B moving R
  4845. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4846. predict error 0
  4847. dir: dir isL
  4848. |\-684: O: O1367 (predict-yes)
  4849. I see 1 and I'm going to do: predict-yes
  4850. ENV: Agent did: predict-yes for direction L in state State-B
  4851. In State-B moving L
  4852. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4853. predict error 0
  4854. dir: dir isU
  4855. /|\685: O: O1370 (predict-no)
  4856. I see 1 and I'm going to do: predict-no
  4857. ENV: Agent did: predict-no for direction U in state State-A
  4858. In State-A moving U
  4859. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4860. predict error 0
  4861. dir: dir isR
  4862. -/|686: O: O1371 (predict-yes)
  4863. I see 1 and I'm going to do: predict-yes
  4864. ENV: Agent did: predict-yes for direction R in state State-A
  4865. In State-A moving R
  4866. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4867. predict error 0
  4868. dir: dir isU
  4869. \-/687: O: O1374 (predict-no)
  4870. I see 1 and I'm going to do: predict-no
  4871. ENV: Agent did: predict-no for direction U in state State-B
  4872. In State-B moving U
  4873. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4874. predict error 0
  4875. dir: dir isR
  4876. |\-688: O: O1376 (predict-no)
  4877. I see 1 and I'm going to do: predict-no
  4878. ENV: Agent did: predict-no for direction R in state State-B
  4879. In State-B moving R
  4880. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4881. predict error 0
  4882. dir: dir isU
  4883. /|\689: O: O1378 (predict-no)
  4884. I see 1 and I'm going to do: predict-no
  4885. ENV: Agent did: predict-no for direction U in state State-B
  4886. In State-B moving U
  4887. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4888. predict error 0
  4889. dir: dir isR
  4890. -/|690: O: O1380 (predict-no)
  4891. I see 1 and I'm going to do: predict-no
  4892. ENV: Agent did: predict-no for direction R in state State-B
  4893. In State-B moving R
  4894. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4895. predict error 0
  4896. dir: dir isL
  4897. \-/691: O: O1381 (predict-yes)
  4898. I see 1 and I'm going to do: predict-yes
  4899. ENV: Agent did: predict-yes for direction L in state State-B
  4900. In State-B moving L
  4901. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4902. predict error 0
  4903. dir: dir isL
  4904. |692: O: O1384 (predict-no)
  4905. I see 1 and I'm going to do: predict-no
  4906. ENV: Agent did: predict-no for direction L in state State-A
  4907. In State-A moving L
  4908. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4909. predict error 0
  4910. dir: dir isU
  4911. \-693: O: O1386 (predict-no)
  4912. I see 1 and I'm going to do: predict-no
  4913. ENV: Agent did: predict-no for direction U in state State-A
  4914. In State-A moving U
  4915. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4916. predict error 0
  4917. dir: dir isR
  4918. /|\694: O: O1387 (predict-yes)
  4919. I see 1 and I'm going to do: predict-yes
  4920. ENV: Agent did: predict-yes for direction R in state State-A
  4921. In State-A moving R
  4922. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4923. predict error 0
  4924. dir: dir isL
  4925. -/|695: O: O1389 (predict-yes)
  4926. I see 1 and I'm going to do: predict-yes
  4927. ENV: Agent did: predict-yes for direction L in state State-B
  4928. In State-B moving L
  4929. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4930. predict error 0
  4931. dir: dir isR
  4932. \-/696: O: O1391 (predict-yes)
  4933. I see 1 and I'm going to do: predict-yes
  4934. ENV: Agent did: predict-yes for direction R in state State-A
  4935. In State-A moving R
  4936. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4937. predict error 0
  4938. dir: dir isL
  4939. |\-697: O: O1393 (predict-yes)
  4940. I see 1 and I'm going to do: predict-yes
  4941. ENV: Agent did: predict-yes for direction L in state State-B
  4942. In State-B moving L
  4943. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4944. predict error 0
  4945. dir: dir isL
  4946. /|698: O: O1396 (predict-no)
  4947. I see 1 and I'm going to do: predict-no
  4948. ENV: Agent did: predict-no for direction L in state State-A
  4949. In State-A moving L
  4950. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4951. predict error 0
  4952. dir: dir isL
  4953. \-699: O: O1398 (predict-no)
  4954. I see 1 and I'm going to do: predict-no
  4955. ENV: Agent did: predict-no for direction L in state State-A
  4956. In State-A moving L
  4957. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4958. predict error 0
  4959. dir: dir isL
  4960. /700: O: O1400 (predict-no)
  4961. I see 1 and I'm going to do: predict-no
  4962. ENV: Agent did: predict-no for direction L in state State-A
  4963. In State-A moving L
  4964. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4965. predict error 0
  4966. dir: dir isR
  4967. |\-701: O: O1401 (predict-yes)
  4968. I see 1 and I'm going to do: predict-yes
  4969. ENV: Agent did: predict-yes for direction R in state State-A
  4970. In State-A moving R
  4971. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4972. predict error 0
  4973. dir: dir isL
  4974. /702: O: O1403 (predict-yes)
  4975. I see 1 and I'm going to do: predict-yes
  4976. ENV: Agent did: predict-yes for direction L in state State-B
  4977. In State-B moving L
  4978. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4979. predict error 0
  4980. dir: dir isR
  4981. |\-703: O: O1405 (predict-yes)
  4982. I see 1 and I'm going to do: predict-yes
  4983. ENV: Agent did: predict-yes for direction R in state State-A
  4984. In State-A moving R
  4985. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4986. predict error 0
  4987. dir: dir isR
  4988. /|704: O: O1408 (predict-no)
  4989. I see 1 and I'm going to do: predict-no
  4990. ENV: Agent did: predict-no for direction R in state State-B
  4991. In State-B moving R
  4992. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4993. predict error 0
  4994. dir: dir isU
  4995. \-/705: O: O1410 (predict-no)
  4996. I see 1 and I'm going to do: predict-no
  4997. ENV: Agent did: predict-no for direction U in state State-B
  4998. In State-B moving U
  4999. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5000. predict error 0
  5001. dir: dir isR
  5002. |\-706: O: O1412 (predict-no)
  5003. I see 1 and I'm going to do: predict-no
  5004. ENV: Agent did: predict-no for direction R in state State-B
  5005. In State-B moving R
  5006. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5007. predict error 0
  5008. dir: dir isL
  5009. /|\707: O: O1413 (predict-yes)
  5010. I see 1 and I'm going to do: predict-yes
  5011. ENV: Agent did: predict-yes for direction L in state State-B
  5012. In State-B moving L
  5013. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5014. predict error 0
  5015. dir: dir isU
  5016. -/|708: O: O1416 (predict-no)
  5017. I see 1 and I'm going to do: predict-no
  5018. ENV: Agent did: predict-no for direction U in state State-A
  5019. In State-A moving U
  5020. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5021. predict error 0
  5022. dir: dir isR
  5023. \-/709: O: O1417 (predict-yes)
  5024. I see 1 and I'm going to do: predict-yes
  5025. ENV: Agent did: predict-yes for direction R in state State-A
  5026. In State-A moving R
  5027. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5028. predict error 0
  5029. dir: dir isR
  5030. |\-710: O: O1420 (predict-no)
  5031. I see 1 and I'm going to do: predict-no
  5032. ENV: Agent did: predict-no for direction R in state State-B
  5033. In State-B moving R
  5034. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5035. predict error 0
  5036. dir: dir isR
  5037. /|\711: O: O1422 (predict-no)
  5038. I see 1 and I'm going to do: predict-no
  5039. ENV: Agent did: predict-no for direction R in state State-B
  5040. In State-B moving R
  5041. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5042. predict error 0
  5043. dir: dir isR
  5044. -712: O: O1424 (predict-no)
  5045. I see 1 and I'm going to do: predict-no
  5046. ENV: Agent did: predict-no for direction R in state State-B
  5047. In State-B moving R
  5048. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5049. predict error 0
  5050. dir: dir isU
  5051. /|\713: O: O1426 (predict-no)
  5052. I see 1 and I'm going to do: predict-no
  5053. ENV: Agent did: predict-no for direction U in state State-B
  5054. In State-B moving U
  5055. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5056. predict error 0
  5057. dir: dir isU
  5058. -/|714: O: O1428 (predict-no)
  5059. I see 1 and I'm going to do: predict-no
  5060. ENV: Agent did: predict-no for direction U in state State-B
  5061. In State-B moving U
  5062. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5063. predict error 0
  5064. dir: dir isU
  5065. \-715: O: O1430 (predict-no)
  5066. I see 1 and I'm going to do: predict-no
  5067. ENV: Agent did: predict-no for direction U in state State-B
  5068. In State-B moving U
  5069. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5070. predict error 0
  5071. dir: dir isU
  5072. /|\716: O: O1432 (predict-no)
  5073. I see 1 and I'm going to do: predict-no
  5074. ENV: Agent did: predict-no for direction U in state State-B
  5075. In State-B moving U
  5076. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5077. predict error 0
  5078. dir: dir isR
  5079. -/|717: O: O1434 (predict-no)
  5080. I see 1 and I'm going to do: predict-no
  5081. ENV: Agent did: predict-no for direction R in state State-B
  5082. In State-B moving R
  5083. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5084. predict error 0
  5085. dir: dir isR
  5086. \-718: O: O1436 (predict-no)
  5087. I see 1 and I'm going to do: predict-no
  5088. ENV: Agent did: predict-no for direction R in state State-B
  5089. In State-B moving R
  5090. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5091. predict error 0
  5092. dir: dir isU
  5093. /|\719: O: O1438 (predict-no)
  5094. I see 1 and I'm going to do: predict-no
  5095. ENV: Agent did: predict-no for direction U in state State-B
  5096. In State-B moving U
  5097. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5098. predict error 0
  5099. dir: dir isL
  5100. -720: O: O1439 (predict-yes)
  5101. I see 1 and I'm going to do: predict-yes
  5102. ENV: Agent did: predict-yes for direction L in state State-B
  5103. In State-B moving L
  5104. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5105. predict error 0
  5106. dir: dir isL
  5107. /|\721: O: O1442 (predict-no)
  5108. I see 1 and I'm going to do: predict-no
  5109. ENV: Agent did: predict-no for direction L in state State-A
  5110. In State-A moving L
  5111. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5112. predict error 0
  5113. dir: dir isL
  5114. -722: O: O1444 (predict-no)
  5115. I see 1 and I'm going to do: predict-no
  5116. ENV: Agent did: predict-no for direction L in state State-A
  5117. In State-A moving L
  5118. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5119. predict error 0
  5120. dir: dir isL
  5121. /|\723: O: O1446 (predict-no)
  5122. I see 1 and I'm going to do: predict-no
  5123. ENV: Agent did: predict-no for direction L in state State-A
  5124. In State-A moving L
  5125. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5126. predict error 0
  5127. dir: dir isL
  5128. -/|724: O: O1448 (predict-no)
  5129. I see 1 and I'm going to do: predict-no
  5130. ENV: Agent did: predict-no for direction L in state State-A
  5131. In State-A moving L
  5132. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5133. predict error 0
  5134. dir: dir isR
  5135. \-/725: O: O1449 (predict-yes)
  5136. I see 1 and I'm going to do: predict-yes
  5137. ENV: Agent did: predict-yes for direction R in state State-A
  5138. In State-A moving R
  5139. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5140. predict error 0
  5141. dir: dir isL
  5142. |\-726: O: O1451 (predict-yes)
  5143. I see 1 and I'm going to do: predict-yes
  5144. ENV: Agent did: predict-yes for direction L in state State-B
  5145. In State-B moving L
  5146. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5147. predict error 0
  5148. dir: dir isU
  5149. /|\727: O: O1454 (predict-no)
  5150. I see 1 and I'm going to do: predict-no
  5151. ENV: Agent did: predict-no for direction U in state State-A
  5152. In State-A moving U
  5153. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5154. predict error 0
  5155. dir: dir isU
  5156. -/|728: O: O1456 (predict-no)
  5157. I see 1 and I'm going to do: predict-no
  5158. ENV: Agent did: predict-no for direction U in state State-A
  5159. In State-A moving U
  5160. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5161. predict error 0
  5162. dir: dir isU
  5163. \-729: O: O1458 (predict-no)
  5164. I see 1 and I'm going to do: predict-no
  5165. ENV: Agent did: predict-no for direction U in state State-A
  5166. In State-A moving U
  5167. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5168. predict error 0
  5169. dir: dir isR
  5170. /730: O: O1459 (predict-yes)
  5171. I see 1 and I'm going to do: predict-yes
  5172. ENV: Agent did: predict-yes for direction R in state State-A
  5173. In State-A moving R
  5174. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5175. predict error 0
  5176. dir: dir isU
  5177. |\-731: O: O1462 (predict-no)
  5178. I see 1 and I'm going to do: predict-no
  5179. ENV: Agent did: predict-no for direction U in state State-B
  5180. In State-B moving U
  5181. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5182. predict error 0
  5183. dir: dir isR
  5184. /732: O: O1464 (predict-no)
  5185. I see 1 and I'm going to do: predict-no
  5186. ENV: Agent did: predict-no for direction R in state State-B
  5187. In State-B moving R
  5188. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5189. predict error 0
  5190. dir: dir isR
  5191. |\-733: O: O1466 (predict-no)
  5192. I see 1 and I'm going to do: predict-no
  5193. ENV: Agent did: predict-no for direction R in state State-B
  5194. In State-B moving R
  5195. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5196. predict error 0
  5197. dir: dir isL
  5198. /|734: O: O1467 (predict-yes)
  5199. I see 1 and I'm going to do: predict-yes
  5200. ENV: Agent did: predict-yes for direction L in state State-B
  5201. In State-B moving L
  5202. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5203. predict error 0
  5204. dir: dir isU
  5205. \-735: O: O1470 (predict-no)
  5206. I see 1 and I'm going to do: predict-no
  5207. ENV: Agent did: predict-no for direction U in state State-A
  5208. In State-A moving U
  5209. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5210. predict error 0
  5211. dir: dir isU
  5212. /|\736: O: O1472 (predict-no)
  5213. I see 1 and I'm going to do: predict-no
  5214. ENV: Agent did: predict-no for direction U in state State-A
  5215. In State-A moving U
  5216. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5217. predict error 0
  5218. dir: dir isL
  5219. -/|737: O: O1474 (predict-no)
  5220. I see 1 and I'm going to do: predict-no
  5221. ENV: Agent did: predict-no for direction L in state State-A
  5222. In State-A moving L
  5223. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5224. predict error 0
  5225. dir: dir isR
  5226. \-738: O: O1475 (predict-yes)
  5227. I see 1 and I'm going to do: predict-yes
  5228. ENV: Agent did: predict-yes for direction R in state State-A
  5229. In State-A moving R
  5230. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5231. predict error 0
  5232. dir: dir isL
  5233. /|739: O: O1477 (predict-yes)
  5234. I see 1 and I'm going to do: predict-yes
  5235. ENV: Agent did: predict-yes for direction L in state State-B
  5236. In State-B moving L
  5237. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5238. predict error 0
  5239. dir: dir isL
  5240. \-/740: O: O1480 (predict-no)
  5241. I see 1 and I'm going to do: predict-no
  5242. ENV: Agent did: predict-no for direction L in state State-A
  5243. In State-A moving L
  5244. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5245. predict error 0
  5246. dir: dir isL
  5247. |\741: O: O1482 (predict-no)
  5248. I see 1 and I'm going to do: predict-no
  5249. ENV: Agent did: predict-no for direction L in state State-A
  5250. In State-A moving L
  5251. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5252. predict error 0
  5253. dir: dir isR
  5254. -742: O: O1483 (predict-yes)
  5255. I see 1 and I'm going to do: predict-yes
  5256. ENV: Agent did: predict-yes for direction R in state State-A
  5257. In State-A moving R
  5258. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5259. predict error 0
  5260. dir: dir isL
  5261. /|\743: O: O1485 (predict-yes)
  5262. I see 1 and I'm going to do: predict-yes
  5263. ENV: Agent did: predict-yes for direction L in state State-B
  5264. In State-B moving L
  5265. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5266. predict error 0
  5267. dir: dir isR
  5268. -/|744: O: O1487 (predict-yes)
  5269. I see 1 and I'm going to do: predict-yes
  5270. ENV: Agent did: predict-yes for direction R in state State-A
  5271. In State-A moving R
  5272. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5273. predict error 0
  5274. dir: dir isL
  5275. \-/745: O: O1489 (predict-yes)
  5276. I see 1 and I'm going to do: predict-yes
  5277. ENV: Agent did: predict-yes for direction L in state State-B
  5278. In State-B moving L
  5279. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5280. predict error 0
  5281. dir: dir isL
  5282. |\-746: O: O1492 (predict-no)
  5283. I see 1 and I'm going to do: predict-no
  5284. ENV: Agent did: predict-no for direction L in state State-A
  5285. In State-A moving L
  5286. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5287. predict error 0
  5288. dir: dir isU
  5289. /|\747: O: O1494 (predict-no)
  5290. I see 1 and I'm going to do: predict-no
  5291. ENV: Agent did: predict-no for direction U in state State-A
  5292. In State-A moving U
  5293. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5294. predict error 0
  5295. dir: dir isU
  5296. -748: O: O1496 (predict-no)
  5297. I see 1 and I'm going to do: predict-no
  5298. ENV: Agent did: predict-no for direction U in state State-A
  5299. In State-A moving U
  5300. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5301. predict error 0
  5302. dir: dir isL
  5303. /|\749: O: O1498 (predict-no)
  5304. I see 1 and I'm going to do: predict-no
  5305. ENV: Agent did: predict-no for direction L in state State-A
  5306. In State-A moving L
  5307. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5308. predict error 0
  5309. dir: dir isU
  5310. -/|750: O: O1500 (predict-no)
  5311. I see 1 and I'm going to do: predict-no
  5312. ENV: Agent did: predict-no for direction U in state State-A
  5313. In State-A moving U
  5314. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5315. predict error 0
  5316. dir: dir isL
  5317. \-751: O: O1502 (predict-no)
  5318. I see 1 and I'm going to do: predict-no
  5319. ENV: Agent did: predict-no for direction L in state State-A
  5320. In State-A moving L
  5321. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5322. predict error 0
  5323. dir: dir isR
  5324. /752: O: O1503 (predict-yes)
  5325. I see 1 and I'm going to do: predict-yes
  5326. ENV: Agent did: predict-yes for direction R in state State-A
  5327. In State-A moving R
  5328. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5329. predict error 0
  5330. dir: dir isU
  5331. |\-753: O: O1506 (predict-no)
  5332. I see 1 and I'm going to do: predict-no
  5333. ENV: Agent did: predict-no for direction U in state State-B
  5334. In State-B moving U
  5335. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5336. predict error 0
  5337. dir: dir isL
  5338. /|754: O: O1507 (predict-yes)
  5339. I see 1 and I'm going to do: predict-yes
  5340. ENV: Agent did: predict-yes for direction L in state State-B
  5341. In State-B moving L
  5342. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5343. predict error 0
  5344. dir: dir isU
  5345. \-/755: O: O1510 (predict-no)
  5346. I see 1 and I'm going to do: predict-no
  5347. ENV: Agent did: predict-no for direction U in state State-A
  5348. In State-A moving U
  5349. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5350. predict error 0
  5351. dir: dir isL
  5352. |\-756: O: O1512 (predict-no)
  5353. I see 1 and I'm going to do: predict-no
  5354. ENV: Agent did: predict-no for direction L in state State-A
  5355. In State-A moving L
  5356. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5357. predict error 0
  5358. dir: dir isR
  5359. /|\757: O: O1513 (predict-yes)
  5360. I see 1 and I'm going to do: predict-yes
  5361. ENV: Agent did: predict-yes for direction R in state State-A
  5362. In State-A moving R
  5363. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5364. predict error 0
  5365. dir: dir isU
  5366. -/758: O: O1516 (predict-no)
  5367. I see 1 and I'm going to do: predict-no
  5368. ENV: Agent did: predict-no for direction U in state State-B
  5369. In State-B moving U
  5370. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5371. predict error 0
  5372. dir: dir isL
  5373. |\-759: O: O1517 (predict-yes)
  5374. I see 1 and I'm going to do: predict-yes
  5375. ENV: Agent did: predict-yes for direction L in state State-B
  5376. In State-B moving L
  5377. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5378. predict error 0
  5379. dir: dir isU
  5380. /|\760: O: O1520 (predict-no)
  5381. I see 1 and I'm going to do: predict-no
  5382. ENV: Agent did: predict-no for direction U in state State-A
  5383. In State-A moving U
  5384. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5385. predict error 0
  5386. dir: dir isU
  5387. -/|761: O: O1522 (predict-no)
  5388. I see 1 and I'm going to do: predict-no
  5389. ENV: Agent did: predict-no for direction U in state State-A
  5390. In State-A moving U
  5391. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5392. predict error 0
  5393. dir: dir isR
  5394. \762: O: O1523 (predict-yes)
  5395. I see 1 and I'm going to do: predict-yes
  5396. ENV: Agent did: predict-yes for direction R in state State-A
  5397. In State-A moving R
  5398. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5399. predict error 0
  5400. dir: dir isL
  5401. -/|763: O: O1525 (predict-yes)
  5402. I see 1 and I'm going to do: predict-yes
  5403. ENV: Agent did: predict-yes for direction L in state State-B
  5404. In State-B moving L
  5405. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5406. predict error 0
  5407. dir: dir isL
  5408. \-/764: O: O1528 (predict-no)
  5409. I see 1 and I'm going to do: predict-no
  5410. ENV: Agent did: predict-no for direction L in state State-A
  5411. In State-A moving L
  5412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5413. predict error 0
  5414. dir: dir isL
  5415. |\-765: O: O1530 (predict-no)
  5416. I see 1 and I'm going to do: predict-no
  5417. ENV: Agent did: predict-no for direction L in state State-A
  5418. In State-A moving L
  5419. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5420. predict error 0
  5421. dir: dir isU
  5422. /|\766: O: O1532 (predict-no)
  5423. I see 1 and I'm going to do: predict-no
  5424. ENV: Agent did: predict-no for direction U in state State-A
  5425. In State-A moving U
  5426. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5427. predict error 0
  5428. dir: dir isR
  5429. -/|767: O: O1533 (predict-yes)
  5430. I see 1 and I'm going to do: predict-yes
  5431. ENV: Agent did: predict-yes for direction R in state State-A
  5432. In State-A moving R
  5433. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5434. predict error 0
  5435. dir: dir isU
  5436. \-768: O: O1536 (predict-no)
  5437. I see 1 and I'm going to do: predict-no
  5438. ENV: Agent did: predict-no for direction U in state State-B
  5439. In State-B moving U
  5440. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5441. predict error 0
  5442. dir: dir isR
  5443. /|\769: O: O1538 (predict-no)
  5444. I see 1 and I'm going to do: predict-no
  5445. ENV: Agent did: predict-no for direction R in state State-B
  5446. In State-B moving R
  5447. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5448. predict error 0
  5449. dir: dir isL
  5450. -/|\770: O: O1540 (predict-no)
  5451. I see 1 and I'm going to do: predict-no
  5452. ENV: Agent did: predict-no for direction L in state State-B
  5453. In State-B moving L
  5454. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  5455. predict error 1
  5456. dir: dir isR
  5457. -/|771: O: O1541 (predict-yes)
  5458. I see 0 and I'm going to do: predict-yes
  5459. ENV: Agent did: predict-yes for direction R in state State-A
  5460. In State-A moving R
  5461. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5462. predict error 0
  5463. dir: dir isU
  5464. \772: O: O1544 (predict-no)
  5465. I see 1 and I'm going to do: predict-no
  5466. ENV: Agent did: predict-no for direction U in state State-B
  5467. In State-B moving U
  5468. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5469. predict error 0
  5470. dir: dir isU
  5471. -/|773: O: O1546 (predict-no)
  5472. I see 1 and I'm going to do: predict-no
  5473. ENV: Agent did: predict-no for direction U in state State-B
  5474. In State-B moving U
  5475. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5476. predict error 0
  5477. dir: dir isL
  5478. \-/774: O: O1547 (predict-yes)
  5479. I see 1 and I'm going to do: predict-yes
  5480. ENV: Agent did: predict-yes for direction L in state State-B
  5481. In State-B moving L
  5482. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5483. predict error 0
  5484. dir: dir isL
  5485. |\-/775: O: O1550 (predict-no)
  5486. I see 1 and I'm going to do: predict-no
  5487. ENV: Agent did: predict-no for direction L in state State-A
  5488. In State-A moving L
  5489. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5490. predict error 0
  5491. dir: dir isR
  5492. |\-776: O: O1551 (predict-yes)
  5493. I see 1 and I'm going to do: predict-yes
  5494. ENV: Agent did: predict-yes for direction R in state State-A
  5495. In State-A moving R
  5496. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5497. predict error 0
  5498. dir: dir isL
  5499. /|\777: O: O1553 (predict-yes)
  5500. I see 1 and I'm going to do: predict-yes
  5501. ENV: Agent did: predict-yes for direction L in state State-B
  5502. In State-B moving L
  5503. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5504. predict error 0
  5505. dir: dir isU
  5506. -/|778: O: O1556 (predict-no)
  5507. I see 1 and I'm going to do: predict-no
  5508. ENV: Agent did: predict-no for direction U in state State-A
  5509. In State-A moving U
  5510. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5511. predict error 0
  5512. dir: dir isU
  5513. \-/779: O: O1558 (predict-no)
  5514. I see 1 and I'm going to do: predict-no
  5515. ENV: Agent did: predict-no for direction U in state State-A
  5516. In State-A moving U
  5517. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5518. predict error 0
  5519. dir: dir isL
  5520. |\780: O: O1560 (predict-no)
  5521. I see 1 and I'm going to do: predict-no
  5522. ENV: Agent did: predict-no for direction L in state State-A
  5523. In State-A moving L
  5524. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5525. predict error 0
  5526. dir: dir isR
  5527. -/781: O: O1561 (predict-yes)
  5528. I see 1 and I'm going to do: predict-yes
  5529. ENV: Agent did: predict-yes for direction R in state State-A
  5530. In State-A moving R
  5531. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5532. predict error 0
  5533. dir: dir isR
  5534. |782: O: O1564 (predict-no)
  5535. I see 1 and I'm going to do: predict-no
  5536. ENV: Agent did: predict-no for direction R in state State-B
  5537. In State-B moving R
  5538. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5539. predict error 0
  5540. dir: dir isL
  5541. \-/783: O: O1565 (predict-yes)
  5542. I see 1 and I'm going to do: predict-yes
  5543. ENV: Agent did: predict-yes for direction L in state State-B
  5544. In State-B moving L
  5545. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5546. predict error 0
  5547. dir: dir isR
  5548. |\-784: O: O1567 (predict-yes)
  5549. I see 1 and I'm going to do: predict-yes
  5550. ENV: Agent did: predict-yes for direction R in state State-A
  5551. In State-A moving R
  5552. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5553. predict error 0
  5554. dir: dir isL
  5555. /|\785: O: O1569 (predict-yes)
  5556. I see 1 and I'm going to do: predict-yes
  5557. ENV: Agent did: predict-yes for direction L in state State-B
  5558. In State-B moving L
  5559. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5560. predict error 0
  5561. dir: dir isL
  5562. -/|786: O: O1572 (predict-no)
  5563. I see 1 and I'm going to do: predict-no
  5564. ENV: Agent did: predict-no for direction L in state State-A
  5565. In State-A moving L
  5566. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5567. predict error 0
  5568. dir: dir isR
  5569. \-/787: O: O1573 (predict-yes)
  5570. I see 1 and I'm going to do: predict-yes
  5571. ENV: Agent did: predict-yes for direction R in state State-A
  5572. In State-A moving R
  5573. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5574. predict error 0
  5575. dir: dir isR
  5576. |\-788: O: O1576 (predict-no)
  5577. I see 1 and I'm going to do: predict-no
  5578. ENV: Agent did: predict-no for direction R in state State-B
  5579. In State-B moving R
  5580. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5581. predict error 0
  5582. dir: dir isR
  5583. /|789: O: O1578 (predict-no)
  5584. I see 1 and I'm going to do: predict-no
  5585. ENV: Agent did: predict-no for direction R in state State-B
  5586. In State-B moving R
  5587. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5588. predict error 0
  5589. dir: dir isL
  5590. \-/790: O: O1579 (predict-yes)
  5591. I see 1 and I'm going to do: predict-yes
  5592. ENV: Agent did: predict-yes for direction L in state State-B
  5593. In State-B moving L
  5594. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5595. predict error 0
  5596. dir: dir isL
  5597. |\-791: O: O1582 (predict-no)
  5598. I see 1 and I'm going to do: predict-no
  5599. ENV: Agent did: predict-no for direction L in state State-A
  5600. In State-A moving L
  5601. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5602. predict error 0
  5603. dir: dir isL
  5604. /792: O: O1584 (predict-no)
  5605. I see 1 and I'm going to do: predict-no
  5606. ENV: Agent did: predict-no for direction L in state State-A
  5607. In State-A moving L
  5608. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5609. predict error 0
  5610. dir: dir isU
  5611. |\-793: O: O1586 (predict-no)
  5612. I see 1 and I'm going to do: predict-no
  5613. ENV: Agent did: predict-no for direction U in state State-A
  5614. In State-A moving U
  5615. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5616. predict error 0
  5617. dir: dir isL
  5618. /|\794: O: O1588 (predict-no)
  5619. I see 1 and I'm going to do: predict-no
  5620. ENV: Agent did: predict-no for direction L in state State-A
  5621. In State-A moving L
  5622. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5623. predict error 0
  5624. dir: dir isU
  5625. -/|795: O: O1590 (predict-no)
  5626. I see 1 and I'm going to do: predict-no
  5627. ENV: Agent did: predict-no for direction U in state State-A
  5628. In State-A moving U
  5629. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5630. predict error 0
  5631. dir: dir isL
  5632. \-/796: O: O1592 (predict-no)
  5633. I see 1 and I'm going to do: predict-no
  5634. ENV: Agent did: predict-no for direction L in state State-A
  5635. In State-A moving L
  5636. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5637. predict error 0
  5638. dir: dir isL
  5639. |\-797: O: O1594 (predict-no)
  5640. I see 1 and I'm going to do: predict-no
  5641. ENV: Agent did: predict-no for direction L in state State-A
  5642. In State-A moving L
  5643. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5644. predict error 0
  5645. dir: dir isU
  5646. /|\798: O: O1596 (predict-no)
  5647. I see 1 and I'm going to do: predict-no
  5648. ENV: Agent did: predict-no for direction U in state State-A
  5649. In State-A moving U
  5650. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5651. predict error 0
  5652. dir: dir isR
  5653. -/|799: O: O1597 (predict-yes)
  5654. I see 1 and I'm going to do: predict-yes
  5655. ENV: Agent did: predict-yes for direction R in state State-A
  5656. In State-A moving R
  5657. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5658. predict error 0
  5659. dir: dir isU
  5660. \800: O: O1600 (predict-no)
  5661. I see 1 and I'm going to do: predict-no
  5662. ENV: Agent did: predict-no for direction U in state State-B
  5663. In State-B moving U
  5664. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5665. predict error 0
  5666. dir: dir isR
  5667. -/|801: O: O1602 (predict-no)
  5668. I see 1 and I'm going to do: predict-no
  5669. ENV: Agent did: predict-no for direction R in state State-B
  5670. In State-B moving R
  5671. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5672. predict error 0
  5673. dir: dir isU
  5674. \802: O: O1604 (predict-no)
  5675. I see 1 and I'm going to do: predict-no
  5676. ENV: Agent did: predict-no for direction U in state State-B
  5677. In State-B moving U
  5678. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5679. predict error 0
  5680. dir: dir isL
  5681. -/|803: O: O1605 (predict-yes)
  5682. I see 1 and I'm going to do: predict-yes
  5683. ENV: Agent did: predict-yes for direction L in state State-B
  5684. In State-B moving L
  5685. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5686. predict error 0
  5687. dir: dir isR
  5688. \-/804: O: O1607 (predict-yes)
  5689. I see 1 and I'm going to do: predict-yes
  5690. ENV: Agent did: predict-yes for direction R in state State-A
  5691. In State-A moving R
  5692. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5693. predict error 0
  5694. dir: dir isL
  5695. |\-805: O: O1609 (predict-yes)
  5696. I see 1 and I'm going to do: predict-yes
  5697. ENV: Agent did: predict-yes for direction L in state State-B
  5698. In State-B moving L
  5699. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5700. predict error 0
  5701. dir: dir isU
  5702. /|\806: O: O1612 (predict-no)
  5703. I see 1 and I'm going to do: predict-no
  5704. ENV: Agent did: predict-no for direction U in state State-A
  5705. In State-A moving U
  5706. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5707. predict error 0
  5708. dir: dir isR
  5709. -/|807: O: O1613 (predict-yes)
  5710. I see 1 and I'm going to do: predict-yes
  5711. ENV: Agent did: predict-yes for direction R in state State-A
  5712. In State-A moving R
  5713. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5714. predict error 0
  5715. dir: dir isU
  5716. \-/808: O: O1616 (predict-no)
  5717. I see 1 and I'm going to do: predict-no
  5718. ENV: Agent did: predict-no for direction U in state State-B
  5719. In State-B moving U
  5720. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5721. predict error 0
  5722. dir: dir isU
  5723. |\809: O: O1618 (predict-no)
  5724. I see 1 and I'm going to do: predict-no
  5725. ENV: Agent did: predict-no for direction U in state State-B
  5726. In State-B moving U
  5727. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5728. predict error 0
  5729. dir: dir isR
  5730. -/|810: O: O1620 (predict-no)
  5731. I see 1 and I'm going to do: predict-no
  5732. ENV: Agent did: predict-no for direction R in state State-B
  5733. In State-B moving R
  5734. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5735. predict error 0
  5736. dir: dir isL
  5737. \-/811: O: O1621 (predict-yes)
  5738. I see 1 and I'm going to do: predict-yes
  5739. ENV: Agent did: predict-yes for direction L in state State-B
  5740. In State-B moving L
  5741. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5742. predict error 0
  5743. dir: dir isU
  5744. |812: O: O1624 (predict-no)
  5745. I see 1 and I'm going to do: predict-no
  5746. ENV: Agent did: predict-no for direction U in state State-A
  5747. In State-A moving U
  5748. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5749. predict error 0
  5750. dir: dir isL
  5751. \-813: O: O1626 (predict-no)
  5752. I see 1 and I'm going to do: predict-no
  5753. ENV: Agent did: predict-no for direction L in state State-A
  5754. In State-A moving L
  5755. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5756. predict error 0
  5757. dir: dir isR
  5758. /|814: O: O1627 (predict-yes)
  5759. I see 1 and I'm going to do: predict-yes
  5760. ENV: Agent did: predict-yes for direction R in state State-A
  5761. In State-A moving R
  5762. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5763. predict error 0
  5764. dir: dir isU
  5765. \-/815: O: O1630 (predict-no)
  5766. I see 1 and I'm going to do: predict-no
  5767. ENV: Agent did: predict-no for direction U in state State-B
  5768. In State-B moving U
  5769. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5770. predict error 0
  5771. dir: dir isL
  5772. |\-816: O: O1631 (predict-yes)
  5773. I see 1 and I'm going to do: predict-yes
  5774. ENV: Agent did: predict-yes for direction L in state State-B
  5775. In State-B moving L
  5776. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5777. predict error 0
  5778. dir: dir isR
  5779. /|\817: O: O1633 (predict-yes)
  5780. I see 1 and I'm going to do: predict-yes
  5781. ENV: Agent did: predict-yes for direction R in state State-A
  5782. In State-A moving R
  5783. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5784. predict error 0
  5785. dir: dir isL
  5786. -/|\818: O: O1635 (predict-yes)
  5787. I see 1 and I'm going to do: predict-yes
  5788. ENV: Agent did: predict-yes for direction L in state State-B
  5789. In State-B moving L
  5790. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5791. predict error 0
  5792. dir: dir isL
  5793. -/|819: O: O1638 (predict-no)
  5794. I see 1 and I'm going to do: predict-no
  5795. ENV: Agent did: predict-no for direction L in state State-A
  5796. In State-A moving L
  5797. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5798. predict error 0
  5799. dir: dir isU
  5800. \-/820: O: O1640 (predict-no)
  5801. I see 1 and I'm going to do: predict-no
  5802. ENV: Agent did: predict-no for direction U in state State-A
  5803. In State-A moving U
  5804. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5805. predict error 0
  5806. dir: dir isR
  5807. |\-821: O: O1641 (predict-yes)
  5808. I see 1 and I'm going to do: predict-yes
  5809. ENV: Agent did: predict-yes for direction R in state State-A
  5810. In State-A moving R
  5811. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5812. predict error 0
  5813. dir: dir isL
  5814. /822: O: O1643 (predict-yes)
  5815. I see 1 and I'm going to do: predict-yes
  5816. ENV: Agent did: predict-yes for direction L in state State-B
  5817. In State-B moving L
  5818. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5819. predict error 0
  5820. dir: dir isR
  5821. |\-823: O: O1645 (predict-yes)
  5822. I see 1 and I'm going to do: predict-yes
  5823. ENV: Agent did: predict-yes for direction R in state State-A
  5824. In State-A moving R
  5825. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5826. predict error 0
  5827. dir: dir isL
  5828. /|\-824: O: O1647 (predict-yes)
  5829. I see 1 and I'm going to do: predict-yes
  5830. ENV: Agent did: predict-yes for direction L in state State-B
  5831. In State-B moving L
  5832. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5833. predict error 0
  5834. dir: dir isL
  5835. /|\825: O: O1650 (predict-no)
  5836. I see 1 and I'm going to do: predict-no
  5837. ENV: Agent did: predict-no for direction L in state State-A
  5838. In State-A moving L
  5839. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5840. predict error 0
  5841. dir: dir isR
  5842. -/|826: O: O1651 (predict-yes)
  5843. I see 1 and I'm going to do: predict-yes
  5844. ENV: Agent did: predict-yes for direction R in state State-A
  5845. In State-A moving R
  5846. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5847. predict error 0
  5848. dir: dir isU
  5849. \-827: O: O1654 (predict-no)
  5850. I see 1 and I'm going to do: predict-no
  5851. ENV: Agent did: predict-no for direction U in state State-B
  5852. In State-B moving U
  5853. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5854. predict error 0
  5855. dir: dir isR
  5856. /|\828: O: O1656 (predict-no)
  5857. I see 1 and I'm going to do: predict-no
  5858. ENV: Agent did: predict-no for direction R in state State-B
  5859. In State-B moving R
  5860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5861. predict error 0
  5862. dir: dir isL
  5863. -/829: O: O1657 (predict-yes)
  5864. I see 1 and I'm going to do: predict-yes
  5865. ENV: Agent did: predict-yes for direction L in state State-B
  5866. In State-B moving L
  5867. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5868. predict error 0
  5869. dir: dir isU
  5870. |\-830: O: O1660 (predict-no)
  5871. I see 1 and I'm going to do: predict-no
  5872. ENV: Agent did: predict-no for direction U in state State-A
  5873. In State-A moving U
  5874. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5875. predict error 0
  5876. dir: dir isU
  5877. /|\831: O: O1662 (predict-no)
  5878. I see 1 and I'm going to do: predict-no
  5879. ENV: Agent did: predict-no for direction U in state State-A
  5880. In State-A moving U
  5881. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5882. predict error 0
  5883. dir: dir isU
  5884. -832: O: O1664 (predict-no)
  5885. I see 1 and I'm going to do: predict-no
  5886. ENV: Agent did: predict-no for direction U in state State-A
  5887. In State-A moving U
  5888. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5889. predict error 0
  5890. dir: dir isR
  5891. /|\833: O: O1665 (predict-yes)
  5892. I see 1 and I'm going to do: predict-yes
  5893. ENV: Agent did: predict-yes for direction R in state State-A
  5894. In State-A moving R
  5895. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5896. predict error 0
  5897. dir: dir isU
  5898. -/|834: O: O1668 (predict-no)
  5899. I see 1 and I'm going to do: predict-no
  5900. ENV: Agent did: predict-no for direction U in state State-B
  5901. In State-B moving U
  5902. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5903. predict error 0
  5904. dir: dir isL
  5905. \-/835: O: O1669 (predict-yes)
  5906. I see 1 and I'm going to do: predict-yes
  5907. ENV: Agent did: predict-yes for direction L in state State-B
  5908. In State-B moving L
  5909. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5910. predict error 0
  5911. dir: dir isU
  5912. |\-836: O: O1672 (predict-no)
  5913. I see 1 and I'm going to do: predict-no
  5914. ENV: Agent did: predict-no for direction U in state State-A
  5915. In State-A moving U
  5916. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5917. predict error 0
  5918. dir: dir isU
  5919. /|\837: O: O1674 (predict-no)
  5920. I see 1 and I'm going to do: predict-no
  5921. ENV: Agent did: predict-no for direction U in state State-A
  5922. In State-A moving U
  5923. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5924. predict error 0
  5925. dir: dir isU
  5926. -/838: O: O1676 (predict-no)
  5927. I see 1 and I'm going to do: predict-no
  5928. ENV: Agent did: predict-no for direction U in state State-A
  5929. In State-A moving U
  5930. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5931. predict error 0
  5932. dir: dir isR
  5933. |\-839: O: O1677 (predict-yes)
  5934. I see 1 and I'm going to do: predict-yes
  5935. ENV: Agent did: predict-yes for direction R in state State-A
  5936. In State-A moving R
  5937. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5938. predict error 0
  5939. dir: dir isR
  5940. /|\840: O: O1680 (predict-no)
  5941. I see 1 and I'm going to do: predict-no
  5942. ENV: Agent did: predict-no for direction R in state State-B
  5943. In State-B moving R
  5944. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5945. predict error 0
  5946. dir: dir isR
  5947. -/|841: O: O1682 (predict-no)
  5948. I see 1 and I'm going to do: predict-no
  5949. ENV: Agent did: predict-no for direction R in state State-B
  5950. In State-B moving R
  5951. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5952. predict error 0
  5953. dir: dir isU
  5954. \842: O: O1684 (predict-no)
  5955. I see 1 and I'm going to do: predict-no
  5956. ENV: Agent did: predict-no for direction U in state State-B
  5957. In State-B moving U
  5958. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5959. predict error 0
  5960. dir: dir isL
  5961. -/|843: O: O1685 (predict-yes)
  5962. I see 1 and I'm going to do: predict-yes
  5963. ENV: Agent did: predict-yes for direction L in state State-B
  5964. In State-B moving L
  5965. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5966. predict error 0
  5967. dir: dir isU
  5968. \-/844: O: O1688 (predict-no)
  5969. I see 1 and I'm going to do: predict-no
  5970. ENV: Agent did: predict-no for direction U in state State-A
  5971. In State-A moving U
  5972. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5973. predict error 0
  5974. dir: dir isR
  5975. |\-845: O: O1689 (predict-yes)
  5976. I see 1 and I'm going to do: predict-yes
  5977. ENV: Agent did: predict-yes for direction R in state State-A
  5978. In State-A moving R
  5979. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5980. predict error 0
  5981. dir: dir isR
  5982. /846: O: O1692 (predict-no)
  5983. I see 1 and I'm going to do: predict-no
  5984. ENV: Agent did: predict-no for direction R in state State-B
  5985. In State-B moving R
  5986. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5987. predict error 0
  5988. dir: dir isR
  5989. |\-847: O: O1694 (predict-no)
  5990. I see 1 and I'm going to do: predict-no
  5991. ENV: Agent did: predict-no for direction R in state State-B
  5992. In State-B moving R
  5993. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5994. predict error 0
  5995. dir: dir isL
  5996. /|\848: O: O1695 (predict-yes)
  5997. I see 1 and I'm going to do: predict-yes
  5998. ENV: Agent did: predict-yes for direction L in state State-B
  5999. In State-B moving L
  6000. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6001. predict error 0
  6002. dir: dir isL
  6003. -/|849: O: O1698 (predict-no)
  6004. I see 1 and I'm going to do: predict-no
  6005. ENV: Agent did: predict-no for direction L in state State-A
  6006. In State-A moving L
  6007. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6008. predict error 0
  6009. dir: dir isR
  6010. \-/850: O: O1699 (predict-yes)
  6011. I see 1 and I'm going to do: predict-yes
  6012. ENV: Agent did: predict-yes for direction R in state State-A
  6013. In State-A moving R
  6014. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6015. predict error 0
  6016. dir: dir isR
  6017. |\-851: O: O1702 (predict-no)
  6018. I see 1 and I'm going to do: predict-no
  6019. ENV: Agent did: predict-no for direction R in state State-B
  6020. In State-B moving R
  6021. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6022. predict error 0
  6023. dir: dir isR
  6024. /852: O: O1704 (predict-no)
  6025. I see 1 and I'm going to do: predict-no
  6026. ENV: Agent did: predict-no for direction R in state State-B
  6027. In State-B moving R
  6028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6029. predict error 0
  6030. dir: dir isU
  6031. |\853: O: O1706 (predict-no)
  6032. I see 1 and I'm going to do: predict-no
  6033. ENV: Agent did: predict-no for direction U in state State-B
  6034. In State-B moving U
  6035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6036. predict error 0
  6037. dir: dir isR
  6038. -/854: O: O1708 (predict-no)
  6039. I see 1 and I'm going to do: predict-no
  6040. ENV: Agent did: predict-no for direction R in state State-B
  6041. In State-B moving R
  6042. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6043. predict error 0
  6044. dir: dir isL
  6045. |\-855: O: O1709 (predict-yes)
  6046. I see 1 and I'm going to do: predict-yes
  6047. ENV: Agent did: predict-yes for direction L in state State-B
  6048. In State-B moving L
  6049. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6050. predict error 0
  6051. dir: dir isU
  6052. /|\856: O: O1712 (predict-no)
  6053. I see 1 and I'm going to do: predict-no
  6054. ENV: Agent did: predict-no for direction U in state State-A
  6055. In State-A moving U
  6056. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6057. predict error 0
  6058. dir: dir isL
  6059. -857: O: O1714 (predict-no)
  6060. I see 1 and I'm going to do: predict-no
  6061. ENV: Agent did: predict-no for direction L in state State-A
  6062. In State-A moving L
  6063. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6064. predict error 0
  6065. dir: dir isR
  6066. /|\858: O: O1715 (predict-yes)
  6067. I see 1 and I'm going to do: predict-yes
  6068. ENV: Agent did: predict-yes for direction R in state State-A
  6069. In State-A moving R
  6070. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6071. predict error 0
  6072. dir: dir isU
  6073. -/859: O: O1718 (predict-no)
  6074. I see 1 and I'm going to do: predict-no
  6075. ENV: Agent did: predict-no for direction U in state State-B
  6076. In State-B moving U
  6077. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6078. predict error 0
  6079. dir: dir isU
  6080. |\-860: O: O1720 (predict-no)
  6081. I see 1 and I'm going to do: predict-no
  6082. ENV: Agent did: predict-no for direction U in state State-B
  6083. In State-B moving U
  6084. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6085. predict error 0
  6086. dir: dir isU
  6087. /|\861: O: O1722 (predict-no)
  6088. I see 1 and I'm going to do: predict-no
  6089. ENV: Agent did: predict-no for direction U in state State-B
  6090. In State-B moving U
  6091. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6092. predict error 0
  6093. dir: dir isR
  6094. -862: O: O1724 (predict-no)
  6095. I see 1 and I'm going to do: predict-no
  6096. ENV: Agent did: predict-no for direction R in state State-B
  6097. In State-B moving R
  6098. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6099. predict error 0
  6100. dir: dir isL
  6101. /|863: O: O1725 (predict-yes)
  6102. I see 1 and I'm going to do: predict-yes
  6103. ENV: Agent did: predict-yes for direction L in state State-B
  6104. In State-B moving L
  6105. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6106. predict error 0
  6107. dir: dir isL
  6108. \864: O: O1728 (predict-no)
  6109. I see 1 and I'm going to do: predict-no
  6110. ENV: Agent did: predict-no for direction L in state State-A
  6111. In State-A moving L
  6112. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6113. predict error 0
  6114. dir: dir isU
  6115. -/|865: O: O1730 (predict-no)
  6116. I see 1 and I'm going to do: predict-no
  6117. ENV: Agent did: predict-no for direction U in state State-A
  6118. In State-A moving U
  6119. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6120. predict error 0
  6121. dir: dir isU
  6122. \-/866: O: O1732 (predict-no)
  6123. I see 1 and I'm going to do: predict-no
  6124. ENV: Agent did: predict-no for direction U in state State-A
  6125. In State-A moving U
  6126. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6127. predict error 0
  6128. dir: dir isR
  6129. |\-867: O: O1733 (predict-yes)
  6130. I see 1 and I'm going to do: predict-yes
  6131. ENV: Agent did: predict-yes for direction R in state State-A
  6132. In State-A moving R
  6133. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6134. predict error 0
  6135. dir: dir isR
  6136. /|\868: O: O1736 (predict-no)
  6137. I see 1 and I'm going to do: predict-no
  6138. ENV: Agent did: predict-no for direction R in state State-B
  6139. In State-B moving R
  6140. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6141. predict error 0
  6142. dir: dir isU
  6143. -/|869: O: O1738 (predict-no)
  6144. I see 1 and I'm going to do: predict-no
  6145. ENV: Agent did: predict-no for direction U in state State-B
  6146. In State-B moving U
  6147. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6148. predict error 0
  6149. dir: dir isR
  6150. \-/870: O: O1740 (predict-no)
  6151. I see 1 and I'm going to do: predict-no
  6152. ENV: Agent did: predict-no for direction R in state State-B
  6153. In State-B moving R
  6154. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6155. predict error 0
  6156. dir: dir isL
  6157. |\-871: O: O1741 (predict-yes)
  6158. I see 1 and I'm going to do: predict-yes
  6159. ENV: Agent did: predict-yes for direction L in state State-B
  6160. In State-B moving L
  6161. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6162. predict error 0
  6163. dir: dir isU
  6164. /872: O: O1744 (predict-no)
  6165. I see 1 and I'm going to do: predict-no
  6166. ENV: Agent did: predict-no for direction U in state State-A
  6167. In State-A moving U
  6168. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6169. predict error 0
  6170. dir: dir isL
  6171. |\-873: O: O1746 (predict-no)
  6172. I see 1 and I'm going to do: predict-no
  6173. ENV: Agent did: predict-no for direction L in state State-A
  6174. In State-A moving L
  6175. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6176. predict error 0
  6177. dir: dir isR
  6178. /|\-874: O: O1747 (predict-yes)
  6179. I see 1 and I'm going to do: predict-yes
  6180. ENV: Agent did: predict-yes for direction R in state State-A
  6181. In State-A moving R
  6182. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6183. predict error 0
  6184. dir: dir isR
  6185. /|\-875: O: O1750 (predict-no)
  6186. I see 1 and I'm going to do: predict-no
  6187. ENV: Agent did: predict-no for direction R in state State-B
  6188. In State-B moving R
  6189. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6190. predict error 0
  6191. dir: dir isU
  6192. /|876: O: O1752 (predict-no)
  6193. I see 1 and I'm going to do: predict-no
  6194. ENV: Agent did: predict-no for direction U in state State-B
  6195. In State-B moving U
  6196. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6197. predict error 0
  6198. dir: dir isL
  6199. \-/877: O: O1753 (predict-yes)
  6200. I see 1 and I'm going to do: predict-yes
  6201. ENV: Agent did: predict-yes for direction L in state State-B
  6202. In State-B moving L
  6203. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6204. predict error 0
  6205. dir: dir isL
  6206. |\878: O: O1756 (predict-no)
  6207. I see 1 and I'm going to do: predict-no
  6208. ENV: Agent did: predict-no for direction L in state State-A
  6209. In State-A moving L
  6210. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6211. predict error 0
  6212. dir: dir isR
  6213. -/|879: O: O1757 (predict-yes)
  6214. I see 1 and I'm going to do: predict-yes
  6215. ENV: Agent did: predict-yes for direction R in state State-A
  6216. In State-A moving R
  6217. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6218. predict error 0
  6219. dir: dir isL
  6220. \-/|880: O: O1759 (predict-yes)
  6221. I see 1 and I'm going to do: predict-yes
  6222. ENV: Agent did: predict-yes for direction L in state State-B
  6223. In State-B moving L
  6224. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6225. predict error 0
  6226. dir: dir isL
  6227. \-/881: O: O1762 (predict-no)
  6228. I see 1 and I'm going to do: predict-no
  6229. ENV: Agent did: predict-no for direction L in state State-A
  6230. In State-A moving L
  6231. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6232. predict error 0
  6233. dir: dir isL
  6234. |882: O: O1764 (predict-no)
  6235. I see 1 and I'm going to do: predict-no
  6236. ENV: Agent did: predict-no for direction L in state State-A
  6237. In State-A moving L
  6238. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6239. predict error 0
  6240. dir: dir isR
  6241. \-/883: O: O1765 (predict-yes)
  6242. I see 1 and I'm going to do: predict-yes
  6243. ENV: Agent did: predict-yes for direction R in state State-A
  6244. In State-A moving R
  6245. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6246. predict error 0
  6247. dir: dir isL
  6248. |\884: O: O1767 (predict-yes)
  6249. I see 1 and I'm going to do: predict-yes
  6250. ENV: Agent did: predict-yes for direction L in state State-B
  6251. In State-B moving L
  6252. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6253. predict error 0
  6254. dir: dir isL
  6255. -/|885: O: O1770 (predict-no)
  6256. I see 1 and I'm going to do: predict-no
  6257. ENV: Agent did: predict-no for direction L in state State-A
  6258. In State-A moving L
  6259. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6260. predict error 0
  6261. dir: dir isU
  6262. \-/886: O: O1772 (predict-no)
  6263. I see 1 and I'm going to do: predict-no
  6264. ENV: Agent did: predict-no for direction U in state State-A
  6265. In State-A moving U
  6266. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6267. predict error 0
  6268. dir: dir isR
  6269. |\-887: O: O1773 (predict-yes)
  6270. I see 1 and I'm going to do: predict-yes
  6271. ENV: Agent did: predict-yes for direction R in state State-A
  6272. In State-A moving R
  6273. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6274. predict error 0
  6275. dir: dir isU
  6276. /|\888: O: O1776 (predict-no)
  6277. I see 1 and I'm going to do: predict-no
  6278. ENV: Agent did: predict-no for direction U in state State-B
  6279. In State-B moving U
  6280. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6281. predict error 0
  6282. dir: dir isL
  6283. -/889: O: O1777 (predict-yes)
  6284. I see 1 and I'm going to do: predict-yes
  6285. ENV: Agent did: predict-yes for direction L in state State-B
  6286. In State-B moving L
  6287. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6288. predict error 0
  6289. dir: dir isR
  6290. |\-890: O: O1779 (predict-yes)
  6291. I see 1 and I'm going to do: predict-yes
  6292. ENV: Agent did: predict-yes for direction R in state State-A
  6293. In State-A moving R
  6294. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6295. predict error 0
  6296. dir: dir isR
  6297. /|\891: O: O1782 (predict-no)
  6298. I see 1 and I'm going to do: predict-no
  6299. ENV: Agent did: predict-no for direction R in state State-B
  6300. In State-B moving R
  6301. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6302. predict error 0
  6303. dir: dir isL
  6304. -892: O: O1783 (predict-yes)
  6305. I see 1 and I'm going to do: predict-yes
  6306. ENV: Agent did: predict-yes for direction L in state State-B
  6307. In State-B moving L
  6308. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6309. predict error 0
  6310. dir: dir isL
  6311. /|893: O: O1786 (predict-no)
  6312. I see 1 and I'm going to do: predict-no
  6313. ENV: Agent did: predict-no for direction L in state State-A
  6314. In State-A moving L
  6315. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6316. predict error 0
  6317. dir: dir isU
  6318. \-/894: O: O1788 (predict-no)
  6319. I see 1 and I'm going to do: predict-no
  6320. ENV: Agent did: predict-no for direction U in state State-A
  6321. In State-A moving U
  6322. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6323. predict error 0
  6324. dir: dir isU
  6325. |\895: O: O1790 (predict-no)
  6326. I see 1 and I'm going to do: predict-no
  6327. ENV: Agent did: predict-no for direction U in state State-A
  6328. In State-A moving U
  6329. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6330. predict error 0
  6331. dir: dir isR
  6332. -/|896: O: O1791 (predict-yes)
  6333. I see 1 and I'm going to do: predict-yes
  6334. ENV: Agent did: predict-yes for direction R in state State-A
  6335. In State-A moving R
  6336. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6337. predict error 0
  6338. dir: dir isR
  6339. \-/|897: O: O1794 (predict-no)
  6340. I see 1 and I'm going to do: predict-no
  6341. ENV: Agent did: predict-no for direction R in state State-B
  6342. In State-B moving R
  6343. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6344. predict error 0
  6345. dir: dir isL
  6346. \-/898: O: O1795 (predict-yes)
  6347. I see 1 and I'm going to do: predict-yes
  6348. ENV: Agent did: predict-yes for direction L in state State-B
  6349. In State-B moving L
  6350. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6351. predict error 0
  6352. dir: dir isR
  6353. |\-899: O: O1797 (predict-yes)
  6354. I see 1 and I'm going to do: predict-yes
  6355. ENV: Agent did: predict-yes for direction R in state State-A
  6356. In State-A moving R
  6357. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6358. predict error 0
  6359. dir: dir isU
  6360. /|\900: O: O1800 (predict-no)
  6361. I see 1 and I'm going to do: predict-no
  6362. ENV: Agent did: predict-no for direction U in state State-B
  6363. In State-B moving U
  6364. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6365. predict error 0
  6366. dir: dir isU
  6367. -/|901: O: O1802 (predict-no)
  6368. I see 1 and I'm going to do: predict-no
  6369. ENV: Agent did: predict-no for direction U in state State-B
  6370. In State-B moving U
  6371. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6372. predict error 0
  6373. dir: dir isR
  6374. \902: O: O1804 (predict-no)
  6375. I see 1 and I'm going to do: predict-no
  6376. ENV: Agent did: predict-no for direction R in state State-B
  6377. In State-B moving R
  6378. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6379. predict error 0
  6380. dir: dir isL
  6381. -/|903: O: O1805 (predict-yes)
  6382. I see 1 and I'm going to do: predict-yes
  6383. ENV: Agent did: predict-yes for direction L in state State-B
  6384. In State-B moving L
  6385. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6386. predict error 0
  6387. dir: dir isU
  6388. \-/904: O: O1808 (predict-no)
  6389. I see 1 and I'm going to do: predict-no
  6390. ENV: Agent did: predict-no for direction U in state State-A
  6391. In State-A moving U
  6392. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6393. predict error 0
  6394. dir: dir isR
  6395. |\-905: O: O1809 (predict-yes)
  6396. I see 1 and I'm going to do: predict-yes
  6397. ENV: Agent did: predict-yes for direction R in state State-A
  6398. In State-A moving R
  6399. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6400. predict error 0
  6401. dir: dir isR
  6402. /|\906: O: O1812 (predict-no)
  6403. I see 1 and I'm going to do: predict-no
  6404. ENV: Agent did: predict-no for direction R in state State-B
  6405. In State-B moving R
  6406. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6407. predict error 0
  6408. dir: dir isL
  6409. -/|907: O: O1813 (predict-yes)
  6410. I see 1 and I'm going to do: predict-yes
  6411. ENV: Agent did: predict-yes for direction L in state State-B
  6412. In State-B moving L
  6413. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6414. predict error 0
  6415. dir: dir isL
  6416. \-/908: O: O1816 (predict-no)
  6417. I see 1 and I'm going to do: predict-no
  6418. ENV: Agent did: predict-no for direction L in state State-A
  6419. In State-A moving L
  6420. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6421. predict error 0
  6422. dir: dir isU
  6423. |\-909: O: O1818 (predict-no)
  6424. I see 1 and I'm going to do: predict-no
  6425. ENV: Agent did: predict-no for direction U in state State-A
  6426. In State-A moving U
  6427. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6428. predict error 0
  6429. dir: dir isR
  6430. /|\910: O: O1819 (predict-yes)
  6431. I see 1 and I'm going to do: predict-yes
  6432. ENV: Agent did: predict-yes for direction R in state State-A
  6433. In State-A moving R
  6434. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6435. predict error 0
  6436. dir: dir isU
  6437. -911: O: O1822 (predict-no)
  6438. I see 1 and I'm going to do: predict-no
  6439. ENV: Agent did: predict-no for direction U in state State-B
  6440. In State-B moving U
  6441. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6442. predict error 0
  6443. dir: dir isL
  6444. /912: O: O1823 (predict-yes)
  6445. I see 1 and I'm going to do: predict-yes
  6446. ENV: Agent did: predict-yes for direction L in state State-B
  6447. In State-B moving L
  6448. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6449. predict error 0
  6450. dir: dir isL
  6451. |\-913: O: O1826 (predict-no)
  6452. I see 1 and I'm going to do: predict-no
  6453. ENV: Agent did: predict-no for direction L in state State-A
  6454. In State-A moving L
  6455. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6456. predict error 0
  6457. dir: dir isU
  6458. /|914: O: O1828 (predict-no)
  6459. I see 1 and I'm going to do: predict-no
  6460. ENV: Agent did: predict-no for direction U in state State-A
  6461. In State-A moving U
  6462. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6463. predict error 0
  6464. dir: dir isU
  6465. \-915: O: O1830 (predict-no)
  6466. I see 1 and I'm going to do: predict-no
  6467. ENV: Agent did: predict-no for direction U in state State-A
  6468. In State-A moving U
  6469. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6470. predict error 0
  6471. dir: dir isL
  6472. /|916: O: O1832 (predict-no)
  6473. I see 1 and I'm going to do: predict-no
  6474. ENV: Agent did: predict-no for direction L in state State-A
  6475. In State-A moving L
  6476. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6477. predict error 0
  6478. dir: dir isL
  6479. \-/917: O: O1834 (predict-no)
  6480. I see 1 and I'm going to do: predict-no
  6481. ENV: Agent did: predict-no for direction L in state State-A
  6482. In State-A moving L
  6483. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6484. predict error 0
  6485. dir: dir isU
  6486. |\-918: O: O1836 (predict-no)
  6487. I see 1 and I'm going to do: predict-no
  6488. ENV: Agent did: predict-no for direction U in state State-A
  6489. In State-A moving U
  6490. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6491. predict error 0
  6492. dir: dir isL
  6493. /|\919: O: O1838 (predict-no)
  6494. I see 1 and I'm going to do: predict-no
  6495. ENV: Agent did: predict-no for direction L in state State-A
  6496. In State-A moving L
  6497. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6498. predict error 0
  6499. dir: dir isU
  6500. -/|920: O: O1840 (predict-no)
  6501. I see 1 and I'm going to do: predict-no
  6502. ENV: Agent did: predict-no for direction U in state State-A
  6503. In State-A moving U
  6504. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6505. predict error 0
  6506. dir: dir isU
  6507. \-/921: O: O1842 (predict-no)
  6508. I see 1 and I'm going to do: predict-no
  6509. ENV: Agent did: predict-no for direction U in state State-A
  6510. In State-A moving U
  6511. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6512. predict error 0
  6513. dir: dir isL
  6514. |922: O: O1844 (predict-no)
  6515. I see 1 and I'm going to do: predict-no
  6516. ENV: Agent did: predict-no for direction L in state State-A
  6517. In State-A moving L
  6518. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6519. predict error 0
  6520. dir: dir isL
  6521. \-/923: O: O1846 (predict-no)
  6522. I see 1 and I'm going to do: predict-no
  6523. ENV: Agent did: predict-no for direction L in state State-A
  6524. In State-A moving L
  6525. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6526. predict error 0
  6527. dir: dir isU
  6528. |\-924: O: O1848 (predict-no)
  6529. I see 1 and I'm going to do: predict-no
  6530. ENV: Agent did: predict-no for direction U in state State-A
  6531. In State-A moving U
  6532. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6533. predict error 0
  6534. dir: dir isR
  6535. /|\-925: O: O1849 (predict-yes)
  6536. I see 1 and I'm going to do: predict-yes
  6537. ENV: Agent did: predict-yes for direction R in state State-A
  6538. In State-A moving R
  6539. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6540. predict error 0
  6541. dir: dir isR
  6542. /|\926: O: O1852 (predict-no)
  6543. I see 1 and I'm going to do: predict-no
  6544. ENV: Agent did: predict-no for direction R in state State-B
  6545. In State-B moving R
  6546. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6547. predict error 0
  6548. dir: dir isR
  6549. -927: O: O1854 (predict-no)
  6550. I see 1 and I'm going to do: predict-no
  6551. ENV: Agent did: predict-no for direction R in state State-B
  6552. In State-B moving R
  6553. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6554. predict error 0
  6555. dir: dir isL
  6556. /|928: O: O1855 (predict-yes)
  6557. I see 1 and I'm going to do: predict-yes
  6558. ENV: Agent did: predict-yes for direction L in state State-B
  6559. In State-B moving L
  6560. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6561. predict error 0
  6562. dir: dir isR
  6563. \-/|929: O: O1857 (predict-yes)
  6564. I see 1 and I'm going to do: predict-yes
  6565. ENV: Agent did: predict-yes for direction R in state State-A
  6566. In State-A moving R
  6567. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6568. predict error 0
  6569. dir: dir isL
  6570. \-/930: O: O1859 (predict-yes)
  6571. I see 1 and I'm going to do: predict-yes
  6572. ENV: Agent did: predict-yes for direction L in state State-B
  6573. In State-B moving L
  6574. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6575. predict error 0
  6576. dir: dir isU
  6577. |\-931: O: O1862 (predict-no)
  6578. I see 1 and I'm going to do: predict-no
  6579. ENV: Agent did: predict-no for direction U in state State-A
  6580. In State-A moving U
  6581. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6582. predict error 0
  6583. dir: dir isU
  6584. /932: O: O1864 (predict-no)
  6585. I see 1 and I'm going to do: predict-no
  6586. ENV: Agent did: predict-no for direction U in state State-A
  6587. In State-A moving U
  6588. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6589. predict error 0
  6590. dir: dir isL
  6591. |\933: O: O1866 (predict-no)
  6592. I see 1 and I'm going to do: predict-no
  6593. ENV: Agent did: predict-no for direction L in state State-A
  6594. In State-A moving L
  6595. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6596. predict error 0
  6597. dir: dir isU
  6598. -/|934: O: O1868 (predict-no)
  6599. I see 1 and I'm going to do: predict-no
  6600. ENV: Agent did: predict-no for direction U in state State-A
  6601. In State-A moving U
  6602. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6603. predict error 0
  6604. dir: dir isL
  6605. \-/935: O: O1870 (predict-no)
  6606. I see 1 and I'm going to do: predict-no
  6607. ENV: Agent did: predict-no for direction L in state State-A
  6608. In State-A moving L
  6609. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6610. predict error 0
  6611. dir: dir isL
  6612. |\936: O: O1872 (predict-no)
  6613. I see 1 and I'm going to do: predict-no
  6614. ENV: Agent did: predict-no for direction L in state State-A
  6615. In State-A moving L
  6616. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6617. predict error 0
  6618. dir: dir isL
  6619. -/|\937: O: O1874 (predict-no)
  6620. I see 1 and I'm going to do: predict-no
  6621. ENV: Agent did: predict-no for direction L in state State-A
  6622. In State-A moving L
  6623. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6624. predict error 0
  6625. dir: dir isL
  6626. -/938: O: O1876 (predict-no)
  6627. I see 1 and I'm going to do: predict-no
  6628. ENV: Agent did: predict-no for direction L in state State-A
  6629. In State-A moving L
  6630. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6631. predict error 0
  6632. dir: dir isL
  6633. |\-/939: O: O1878 (predict-no)
  6634. I see 1 and I'm going to do: predict-no
  6635. ENV: Agent did: predict-no for direction L in state State-A
  6636. In State-A moving L
  6637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6638. predict error 0
  6639. dir: dir isR
  6640. |\-940: O: O1879 (predict-yes)
  6641. I see 1 and I'm going to do: predict-yes
  6642. ENV: Agent did: predict-yes for direction R in state State-A
  6643. In State-A moving R
  6644. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6645. predict error 0
  6646. dir: dir isU
  6647. /|\941: O: O1882 (predict-no)
  6648. I see 1 and I'm going to do: predict-no
  6649. ENV: Agent did: predict-no for direction U in state State-B
  6650. In State-B moving U
  6651. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6652. predict error 0
  6653. dir: dir isR
  6654. -942: O: O1884 (predict-no)
  6655. I see 1 and I'm going to do: predict-no
  6656. ENV: Agent did: predict-no for direction R in state State-B
  6657. In State-B moving R
  6658. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6659. predict error 0
  6660. dir: dir isL
  6661. /|\943: O: O1885 (predict-yes)
  6662. I see 1 and I'm going to do: predict-yes
  6663. ENV: Agent did: predict-yes for direction L in state State-B
  6664. In State-B moving L
  6665. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6666. predict error 0
  6667. dir: dir isR
  6668. -944: O: O1887 (predict-yes)
  6669. I see 1 and I'm going to do: predict-yes
  6670. ENV: Agent did: predict-yes for direction R in state State-A
  6671. In State-A moving R
  6672. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6673. predict error 0
  6674. dir: dir isR
  6675. /|\945: O: O1890 (predict-no)
  6676. I see 1 and I'm going to do: predict-no
  6677. ENV: Agent did: predict-no for direction R in state State-B
  6678. In State-B moving R
  6679. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6680. predict error 0
  6681. dir: dir isL
  6682. -/|946: O: O1891 (predict-yes)
  6683. I see 1 and I'm going to do: predict-yes
  6684. ENV: Agent did: predict-yes for direction L in state State-B
  6685. In State-B moving L
  6686. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6687. predict error 0
  6688. dir: dir isU
  6689. \-/947: O: O1894 (predict-no)
  6690. I see 1 and I'm going to do: predict-no
  6691. ENV: Agent did: predict-no for direction U in state State-A
  6692. In State-A moving U
  6693. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6694. predict error 0
  6695. dir: dir isR
  6696. |\-948: O: O1895 (predict-yes)
  6697. I see 1 and I'm going to do: predict-yes
  6698. ENV: Agent did: predict-yes for direction R in state State-A
  6699. In State-A moving R
  6700. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6701. predict error 0
  6702. dir: dir isU
  6703. /|\949: O: O1898 (predict-no)
  6704. I see 1 and I'm going to do: predict-no
  6705. ENV: Agent did: predict-no for direction U in state State-B
  6706. In State-B moving U
  6707. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6708. predict error 0
  6709. dir: dir isU
  6710. -/|950: O: O1900 (predict-no)
  6711. I see 1 and I'm going to do: predict-no
  6712. ENV: Agent did: predict-no for direction U in state State-B
  6713. In State-B moving U
  6714. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6715. predict error 0
  6716. dir: dir isR
  6717. \-/|\-/|\-/--- Input Phase ---
  6718. =>WM: (13382: I2 ^dir R)
  6719. =>WM: (13381: I2 ^reward 1)
  6720. =>WM: (13380: I2 ^see 0)
  6721. =>WM: (13379: N950 ^status complete)
  6722. <=WM: (13368: I2 ^dir U)
  6723. <=WM: (13367: I2 ^reward 1)
  6724. <=WM: (13366: I2 ^see 0)
  6725. =>WM: (13383: I2 ^level-1 R1-root)
  6726. <=WM: (13369: I2 ^level-1 R1-root)
  6727. --- END Input Phase ---
  6728. --- Proposal Phase ---
  6729. --- Inner Elaboration Phase, active level 1 (S1) ---
  6730. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6731. -->
  6732. (S1 ^operator O1899 = -0.1070236389116304)
  6733. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6734. -->
  6735. (S1 ^operator O1900 = 0.66025212945601)
  6736. Firing prefer*rvt*predict-no*H0*4*v1*H1
  6737. -->
  6738. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  6739. -->
  6740. Firing elaborate*copy-see-to-output-link
  6741. -->
  6742. (I3 ^see 0 +)
  6743. Firing elaborate*reward*based*on*reward
  6744. -->
  6745. (R954 ^value 1 +)
  6746. (R1 ^reward R954 +)
  6747. Firing propose*predict-yes
  6748. -->
  6749. (O1901 ^name predict-yes +)
  6750. (S1 ^operator O1901 +)
  6751. Firing propose*predict-no
  6752. -->
  6753. (O1902 ^name predict-no +)
  6754. (S1 ^operator O1902 +)
  6755. Firing rl*prefer*rvt*predict-no*H0*4
  6756. -->
  6757. (S1 ^operator O1900 = 0.3397665963572414)
  6758. Firing rl*prefer*rvt*predict-yes*H0*3
  6759. -->
  6760. (S1 ^operator O1899 = 0.3377110766337923)
  6761. Firing prefer*rvt*predict-yes*H0
  6762. -->
  6763. Firing prefer*rvt*predict-no*H0
  6764. -->
  6765. Firing elaborate*copy-dir-to-output-link
  6766. -->
  6767. (I3 ^dir R +)
  6768. inner elaboration loop at bottom goal.
  6769. Retracting elaborate*copy-see-to-output-link
  6770. -->
  6771. (I3 ^see 0 +)
  6772. Retracting propose*predict-no
  6773. -->
  6774. (O1900 ^name predict-no +)
  6775. (S1 ^operator O1900 +)
  6776. Retracting propose*predict-yes
  6777. -->
  6778. (O1899 ^name predict-yes +)
  6779. (S1 ^operator O1899 +)
  6780. Retracting elaborate*reward*based*on*reward
  6781. -->
  6782. (R953 ^value 1 +)
  6783. (R1 ^reward R953 +)
  6784. Retracting elaborate*copy-dir-to-output-link
  6785. -->
  6786. (I3 ^dir U +)
  6787. Retracting rl*prefer*rvt*predict-no*H0*2
  6788. -->
  6789. (S1 ^operator O1900 = 1.)
  6790. Retracting rl*prefer*rvt*predict-yes*H0*1
  6791. -->
  6792. (S1 ^operator O1899 = 0.)
  6793. =>WM: (13390: S1 ^operator O1902 +)
  6794. =>WM: (13389: S1 ^operator O1901 +)
  6795. =>WM: (13388: I3 ^dir R)
  6796. =>WM: (13387: O1902 ^name predict-no)
  6797. =>WM: (13386: O1901 ^name predict-yes)
  6798. =>WM: (13385: R954 ^value 1)
  6799. =>WM: (13384: R1 ^reward R954)
  6800. <=WM: (13375: S1 ^operator O1899 +)
  6801. <=WM: (13376: S1 ^operator O1900 +)
  6802. <=WM: (13377: S1 ^operator O1900)
  6803. <=WM: (13360: I3 ^dir U)
  6804. <=WM: (13371: R1 ^reward R953)
  6805. <=WM: (13374: O1900 ^name predict-no)
  6806. <=WM: (13373: O1899 ^name predict-yes)
  6807. <=WM: (13372: R953 ^value 1)
  6808. --- Inner Elaboration Phase, active level 1 (S1) ---
  6809. Firing prefer*rvt*predict-yes*H0
  6810. -->
  6811. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6812. -->
  6813. (S1 ^operator O1901 = -0.1070236389116304)
  6814. Firing rl*prefer*rvt*predict-yes*H0*3
  6815. -->
  6816. (S1 ^operator O1901 = 0.3377110766337923)
  6817. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  6818. -->
  6819. Firing prefer*rvt*predict-no*H0
  6820. -->
  6821. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6822. -->
  6823. (S1 ^operator O1902 = 0.66025212945601)
  6824. Firing rl*prefer*rvt*predict-no*H0*4
  6825. -->
  6826. (S1 ^operator O1902 = 0.3397665963572414)
  6827. Firing prefer*rvt*predict-no*H0*4*v1*H1
  6828. -->
  6829. inner elaboration loop at bottom goal.
  6830. Retracting rl*prefer*rvt*predict-no*H0*4
  6831. -->
  6832. (S1 ^operator O1900 = 0.3397665963572414)
  6833. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6834. -->
  6835. (S1 ^operator O1900 = 0.66025212945601)
  6836. Retracting rl*prefer*rvt*predict-yes*H0*3
  6837. -->
  6838. (S1 ^operator O1899 = 0.3377110766337923)
  6839. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6840. -->
  6841. (S1 ^operator O1899 = -0.1070236389116304)
  6842. --- END Proposal Phase ---
  6843. --- Decision Phase ---
  6844. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6845. =>WM: (13391: S1 ^operator O1902)
  6846. 951: O: O1902 (predict-no)
  6847. --- END Decision Phase ---
  6848. --- Application Phase ---
  6849. --- Firing Productions (PE) For State At Depth 1 ---
  6850. --- Inner Elaboration Phase, active level 1 (S1) ---
  6851. Firing apply*operator
  6852. -->
  6853. (I3 ^predict-no N951 + :O )
  6854. Firing apply*operator*complete
  6855. -->
  6856. (I3 ^predict-no N950 - :O )
  6857. inner elaboration loop at bottom goal.
  6858. --- Change Working Memory (PE) ---
  6859. =>WM: (13392: I3 ^predict-no N951)
  6860. <=WM: (13379: N950 ^status complete)
  6861. <=WM: (13378: I3 ^predict-no N950)
  6862. --- Firing Productions (IE) For State At Depth 1 ---
  6863. --- Inner Elaboration Phase, active level 1 (S1) ---
  6864. Firing monitor*world
  6865. -->
  6866. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6867. --- Change Working Memory (IE) ---
  6868. --- END Application Phase ---
  6869. --- Output Phase ---
  6870. ENV: Agent did: predict-no for direction R in state State-B
  6871. In State-B moving R
  6872. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6873. predict error 0
  6874. dir: dir isL
  6875. --- END Output Phase ---
  6876. |--- Input Phase ---
  6877. =>WM: (13396: I2 ^dir L)
  6878. =>WM: (13395: I2 ^reward 1)
  6879. =>WM: (13394: I2 ^see 0)
  6880. =>WM: (13393: N951 ^status complete)
  6881. <=WM: (13382: I2 ^dir R)
  6882. <=WM: (13381: I2 ^reward 1)
  6883. <=WM: (13380: I2 ^see 0)
  6884. =>WM: (13397: I2 ^level-1 R0-root)
  6885. <=WM: (13383: I2 ^level-1 R1-root)
  6886. --- END Input Phase ---
  6887. --- Proposal Phase ---
  6888. --- Inner Elaboration Phase, active level 1 (S1) ---
  6889. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  6890. -->
  6891. (S1 ^operator O1901 = 0.735786774178754)
  6892. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  6893. -->
  6894. Firing elaborate*copy-see-to-output-link
  6895. -->
  6896. (I3 ^see 0 +)
  6897. Firing elaborate*reward*based*on*reward
  6898. -->
  6899. (R955 ^value 1 +)
  6900. (R1 ^reward R955 +)
  6901. Firing propose*predict-yes
  6902. -->
  6903. (O1903 ^name predict-yes +)
  6904. (S1 ^operator O1903 +)
  6905. Firing propose*predict-no
  6906. -->
  6907. (O1904 ^name predict-no +)
  6908. (S1 ^operator O1904 +)
  6909. Firing rl*prefer*rvt*predict-no*H0*6
  6910. -->
  6911. (S1 ^operator O1902 = 0.9996367744406318)
  6912. Firing rl*prefer*rvt*predict-yes*H0*5
  6913. -->
  6914. (S1 ^operator O1901 = 0.2640533371018167)
  6915. Firing prefer*rvt*predict-yes*H0
  6916. -->
  6917. Firing prefer*rvt*predict-no*H0
  6918. -->
  6919. Firing elaborate*copy-dir-to-output-link
  6920. -->
  6921. (I3 ^dir L +)
  6922. inner elaboration loop at bottom goal.
  6923. Retracting elaborate*copy-see-to-output-link
  6924. -->
  6925. (I3 ^see 0 +)
  6926. Retracting propose*predict-no
  6927. -->
  6928. (O1902 ^name predict-no +)
  6929. (S1 ^operator O1902 +)
  6930. Retracting propose*predict-yes
  6931. -->
  6932. (O1901 ^name predict-yes +)
  6933. (S1 ^operator O1901 +)
  6934. Retracting elaborate*reward*based*on*reward
  6935. -->
  6936. (R954 ^value 1 +)
  6937. (R1 ^reward R954 +)
  6938. Retracting elaborate*copy-dir-to-output-link
  6939. -->
  6940. (I3 ^dir R +)
  6941. Retracting rl*prefer*rvt*predict-no*H0*4
  6942. -->
  6943. (S1 ^operator O1902 = 0.3397665963572414)
  6944. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  6945. -->
  6946. (S1 ^operator O1902 = 0.66025212945601)
  6947. Retracting rl*prefer*rvt*predict-yes*H0*3
  6948. -->
  6949. (S1 ^operator O1901 = 0.3377110766337923)
  6950. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  6951. -->
  6952. (S1 ^operator O1901 = -0.1070236389116304)
  6953. =>WM: (13404: S1 ^operator O1904 +)
  6954. =>WM: (13403: S1 ^operator O1903 +)
  6955. =>WM: (13402: I3 ^dir L)
  6956. =>WM: (13401: O1904 ^name predict-no)
  6957. =>WM: (13400: O1903 ^name predict-yes)
  6958. =>WM: (13399: R955 ^value 1)
  6959. =>WM: (13398: R1 ^reward R955)
  6960. <=WM: (13389: S1 ^operator O1901 +)
  6961. <=WM: (13390: S1 ^operator O1902 +)
  6962. <=WM: (13391: S1 ^operator O1902)
  6963. <=WM: (13388: I3 ^dir R)
  6964. <=WM: (13384: R1 ^reward R954)
  6965. <=WM: (13387: O1902 ^name predict-no)
  6966. <=WM: (13386: O1901 ^name predict-yes)
  6967. <=WM: (13385: R954 ^value 1)
  6968. --- Inner Elaboration Phase, active level 1 (S1) ---
  6969. Firing prefer*rvt*predict-yes*H0
  6970. -->
  6971. Firing rl*prefer*rvt*predict-yes*H0*5
  6972. -->
  6973. (S1 ^operator O1903 = 0.2640533371018167)
  6974. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  6975. -->
  6976. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  6977. -->
  6978. (S1 ^operator O1903 = 0.735786774178754)
  6979. Firing prefer*rvt*predict-no*H0
  6980. -->
  6981. Firing rl*prefer*rvt*predict-no*H0*6
  6982. -->
  6983. (S1 ^operator O1904 = 0.9996367744406318)
  6984. inner elaboration loop at bottom goal.
  6985. Retracting rl*prefer*rvt*predict-no*H0*6
  6986. -->
  6987. (S1 ^operator O1902 = 0.9996367744406318)
  6988. Retracting rl*prefer*rvt*predict-yes*H0*5
  6989. -->
  6990. (S1 ^operator O1901 = 0.2640533371018167)
  6991. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  6992. -->
  6993. (S1 ^operator O1901 = 0.735786774178754)
  6994. --- END Proposal Phase ---
  6995. --- Decision Phase ---
  6996. RL update rl*prefer*rvt*predict-no*H0*4 0.57025 -0.230483 0.339767 -> 0.570248 -0.230483 0.339765(R,m,v=1,0.87037,0.113527)
  6997. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.42977 0.230482 0.660252 -> 0.429768 0.230482 0.66025(R,m,v=1,1,0)
  6998. =>WM: (13405: S1 ^operator O1903)
  6999. 952: O: O1903 (predict-yes)
  7000. --- END Decision Phase ---
  7001. --- Application Phase ---
  7002. --- Firing Productions (PE) For State At Depth 1 ---
  7003. --- Inner Elaboration Phase, active level 1 (S1) ---
  7004. Firing apply*operator
  7005. -->
  7006. (I3 ^predict-yes N952 + :O )
  7007. Firing apply*operator*complete
  7008. -->
  7009. (I3 ^predict-no N951 - :O )
  7010. inner elaboration loop at bottom goal.
  7011. --- Change Working Memory (PE) ---
  7012. =>WM: (13406: I3 ^predict-yes N952)
  7013. <=WM: (13393: N951 ^status complete)
  7014. <=WM: (13392: I3 ^predict-no N951)
  7015. --- Firing Productions (IE) For State At Depth 1 ---
  7016. --- Inner Elaboration Phase, active level 1 (S1) ---
  7017. Firing monitor*world
  7018. -->
  7019. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7020. --- Change Working Memory (IE) ---
  7021. --- END Application Phase ---
  7022. --- Output Phase ---
  7023. ENV: Agent did: predict-yes for direction L in state State-B
  7024. In State-B moving L
  7025. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7026. predict error 0
  7027. dir: dir isU
  7028. --- END Output Phase ---
  7029. \-/--- Input Phase ---
  7030. =>WM: (13410: I2 ^dir U)
  7031. =>WM: (13409: I2 ^reward 1)
  7032. =>WM: (13408: I2 ^see 1)
  7033. =>WM: (13407: N952 ^status complete)
  7034. <=WM: (13396: I2 ^dir L)
  7035. <=WM: (13395: I2 ^reward 1)
  7036. <=WM: (13394: I2 ^see 0)
  7037. =>WM: (13411: I2 ^level-1 L1-root)
  7038. <=WM: (13397: I2 ^level-1 R0-root)
  7039. --- END Input Phase ---
  7040. --- Proposal Phase ---
  7041. --- Inner Elaboration Phase, active level 1 (S1) ---
  7042. Firing elaborate*copy-see-to-output-link
  7043. -->
  7044. (I3 ^see 1 +)
  7045. Firing elaborate*reward*based*on*reward
  7046. -->
  7047. (R956 ^value 1 +)
  7048. (R1 ^reward R956 +)
  7049. Firing propose*predict-yes
  7050. -->
  7051. (O1905 ^name predict-yes +)
  7052. (S1 ^operator O1905 +)
  7053. Firing propose*predict-no
  7054. -->
  7055. (O1906 ^name predict-no +)
  7056. (S1 ^operator O1906 +)
  7057. Firing rl*prefer*rvt*predict-no*H0*2
  7058. -->
  7059. (S1 ^operator O1904 = 1.)
  7060. Firing rl*prefer*rvt*predict-yes*H0*1
  7061. -->
  7062. (S1 ^operator O1903 = 0.)
  7063. Firing prefer*rvt*predict-yes*H0
  7064. -->
  7065. Firing prefer*rvt*predict-no*H0
  7066. -->
  7067. Firing elaborate*copy-dir-to-output-link
  7068. -->
  7069. (I3 ^dir U +)
  7070. inner elaboration loop at bottom goal.
  7071. Retracting elaborate*copy-see-to-output-link
  7072. -->
  7073. (I3 ^see 0 +)
  7074. Retracting propose*predict-no
  7075. -->
  7076. (O1904 ^name predict-no +)
  7077. (S1 ^operator O1904 +)
  7078. Retracting propose*predict-yes
  7079. -->
  7080. (O1903 ^name predict-yes +)
  7081. (S1 ^operator O1903 +)
  7082. Retracting elaborate*reward*based*on*reward
  7083. -->
  7084. (R955 ^value 1 +)
  7085. (R1 ^reward R955 +)
  7086. Retracting elaborate*copy-dir-to-output-link
  7087. -->
  7088. (I3 ^dir L +)
  7089. Retracting rl*prefer*rvt*predict-no*H0*6
  7090. -->
  7091. (S1 ^operator O1904 = 0.9996367744406318)
  7092. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7093. -->
  7094. (S1 ^operator O1903 = 0.735786774178754)
  7095. Retracting rl*prefer*rvt*predict-yes*H0*5
  7096. -->
  7097. (S1 ^operator O1903 = 0.2640533371018167)
  7098. =>WM: (13419: S1 ^operator O1906 +)
  7099. =>WM: (13418: S1 ^operator O1905 +)
  7100. =>WM: (13417: I3 ^dir U)
  7101. =>WM: (13416: O1906 ^name predict-no)
  7102. =>WM: (13415: O1905 ^name predict-yes)
  7103. =>WM: (13414: R956 ^value 1)
  7104. =>WM: (13413: R1 ^reward R956)
  7105. =>WM: (13412: I3 ^see 1)
  7106. <=WM: (13403: S1 ^operator O1903 +)
  7107. <=WM: (13405: S1 ^operator O1903)
  7108. <=WM: (13404: S1 ^operator O1904 +)
  7109. <=WM: (13402: I3 ^dir L)
  7110. <=WM: (13398: R1 ^reward R955)
  7111. <=WM: (13370: I3 ^see 0)
  7112. <=WM: (13401: O1904 ^name predict-no)
  7113. <=WM: (13400: O1903 ^name predict-yes)
  7114. <=WM: (13399: R955 ^value 1)
  7115. --- Inner Elaboration Phase, active level 1 (S1) ---
  7116. Firing prefer*rvt*predict-yes*H0
  7117. -->
  7118. Firing rl*prefer*rvt*predict-yes*H0*1
  7119. -->
  7120. (S1 ^operator O1905 = 0.)
  7121. Firing prefer*rvt*predict-no*H0
  7122. -->
  7123. Firing rl*prefer*rvt*predict-no*H0*2
  7124. -->
  7125. (S1 ^operator O1906 = 1.)
  7126. inner elaboration loop at bottom goal.
  7127. Retracting rl*prefer*rvt*predict-no*H0*2
  7128. -->
  7129. (S1 ^operator O1904 = 1.)
  7130. Retracting rl*prefer*rvt*predict-yes*H0*1
  7131. -->
  7132. (S1 ^operator O1903 = 0.)
  7133. --- END Proposal Phase ---
  7134. --- Decision Phase ---
  7135. RL update rl*prefer*rvt*predict-yes*H0*5 0.554438 -0.290385 0.264053 -> 0.554451 -0.290385 0.264066(R,m,v=1,0.872093,0.112199)
  7136. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445404 0.290382 0.735787 -> 0.44542 0.290383 0.735802(R,m,v=1,1,0)
  7137. =>WM: (13420: S1 ^operator O1906)
  7138. 953: O: O1906 (predict-no)
  7139. --- END Decision Phase ---
  7140. --- Application Phase ---
  7141. --- Firing Productions (PE) For State At Depth 1 ---
  7142. --- Inner Elaboration Phase, active level 1 (S1) ---
  7143. Firing apply*operator
  7144. -->
  7145. (I3 ^predict-no N953 + :O )
  7146. Firing apply*operator*complete
  7147. -->
  7148. (I3 ^predict-yes N952 - :O )
  7149. inner elaboration loop at bottom goal.
  7150. --- Change Working Memory (PE) ---
  7151. =>WM: (13421: I3 ^predict-no N953)
  7152. <=WM: (13407: N952 ^status complete)
  7153. <=WM: (13406: I3 ^predict-yes N952)
  7154. --- Firing Productions (IE) For State At Depth 1 ---
  7155. --- Inner Elaboration Phase, active level 1 (S1) ---
  7156. Firing monitor*world
  7157. -->
  7158. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7159. --- Change Working Memory (IE) ---
  7160. --- END Application Phase ---
  7161. --- Output Phase ---
  7162. ENV: Agent did: predict-no for direction U in state State-A
  7163. In State-A moving U
  7164. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7165. predict error 0
  7166. dir: dir isR
  7167. --- END Output Phase ---
  7168. |\---- Input Phase ---
  7169. =>WM: (13425: I2 ^dir R)
  7170. =>WM: (13424: I2 ^reward 1)
  7171. =>WM: (13423: I2 ^see 0)
  7172. =>WM: (13422: N953 ^status complete)
  7173. <=WM: (13410: I2 ^dir U)
  7174. <=WM: (13409: I2 ^reward 1)
  7175. <=WM: (13408: I2 ^see 1)
  7176. =>WM: (13426: I2 ^level-1 L1-root)
  7177. <=WM: (13411: I2 ^level-1 L1-root)
  7178. --- END Input Phase ---
  7179. --- Proposal Phase ---
  7180. --- Inner Elaboration Phase, active level 1 (S1) ---
  7181. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7182. -->
  7183. (S1 ^operator O1906 = -0.2714224023553999)
  7184. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7185. -->
  7186. (S1 ^operator O1905 = 0.6621942993402632)
  7187. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7188. -->
  7189. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7190. -->
  7191. Firing elaborate*copy-see-to-output-link
  7192. -->
  7193. (I3 ^see 0 +)
  7194. Firing elaborate*reward*based*on*reward
  7195. -->
  7196. (R957 ^value 1 +)
  7197. (R1 ^reward R957 +)
  7198. Firing propose*predict-yes
  7199. -->
  7200. (O1907 ^name predict-yes +)
  7201. (S1 ^operator O1907 +)
  7202. Firing propose*predict-no
  7203. -->
  7204. (O1908 ^name predict-no +)
  7205. (S1 ^operator O1908 +)
  7206. Firing rl*prefer*rvt*predict-no*H0*4
  7207. -->
  7208. (S1 ^operator O1906 = 0.3397650583271044)
  7209. Firing rl*prefer*rvt*predict-yes*H0*3
  7210. -->
  7211. (S1 ^operator O1905 = 0.3377110766337923)
  7212. Firing prefer*rvt*predict-yes*H0
  7213. -->
  7214. Firing prefer*rvt*predict-no*H0
  7215. -->
  7216. Firing elaborate*copy-dir-to-output-link
  7217. -->
  7218. (I3 ^dir R +)
  7219. inner elaboration loop at bottom goal.
  7220. Retracting elaborate*copy-see-to-output-link
  7221. -->
  7222. (I3 ^see 1 +)
  7223. Retracting propose*predict-no
  7224. -->
  7225. (O1906 ^name predict-no +)
  7226. (S1 ^operator O1906 +)
  7227. Retracting propose*predict-yes
  7228. -->
  7229. (O1905 ^name predict-yes +)
  7230. (S1 ^operator O1905 +)
  7231. Retracting elaborate*reward*based*on*reward
  7232. -->
  7233. (R956 ^value 1 +)
  7234. (R1 ^reward R956 +)
  7235. Retracting elaborate*copy-dir-to-output-link
  7236. -->
  7237. (I3 ^dir U +)
  7238. Retracting rl*prefer*rvt*predict-no*H0*2
  7239. -->
  7240. (S1 ^operator O1906 = 1.)
  7241. Retracting rl*prefer*rvt*predict-yes*H0*1
  7242. -->
  7243. (S1 ^operator O1905 = 0.)
  7244. =>WM: (13434: S1 ^operator O1908 +)
  7245. =>WM: (13433: S1 ^operator O1907 +)
  7246. =>WM: (13432: I3 ^dir R)
  7247. =>WM: (13431: O1908 ^name predict-no)
  7248. =>WM: (13430: O1907 ^name predict-yes)
  7249. =>WM: (13429: R957 ^value 1)
  7250. =>WM: (13428: R1 ^reward R957)
  7251. =>WM: (13427: I3 ^see 0)
  7252. <=WM: (13418: S1 ^operator O1905 +)
  7253. <=WM: (13419: S1 ^operator O1906 +)
  7254. <=WM: (13420: S1 ^operator O1906)
  7255. <=WM: (13417: I3 ^dir U)
  7256. <=WM: (13413: R1 ^reward R956)
  7257. <=WM: (13412: I3 ^see 1)
  7258. <=WM: (13416: O1906 ^name predict-no)
  7259. <=WM: (13415: O1905 ^name predict-yes)
  7260. <=WM: (13414: R956 ^value 1)
  7261. --- Inner Elaboration Phase, active level 1 (S1) ---
  7262. Firing prefer*rvt*predict-yes*H0
  7263. -->
  7264. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7265. -->
  7266. (S1 ^operator O1907 = 0.6621942993402632)
  7267. Firing rl*prefer*rvt*predict-yes*H0*3
  7268. -->
  7269. (S1 ^operator O1907 = 0.3377110766337923)
  7270. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7271. -->
  7272. Firing prefer*rvt*predict-no*H0
  7273. -->
  7274. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7275. -->
  7276. (S1 ^operator O1908 = -0.2714224023553999)
  7277. Firing rl*prefer*rvt*predict-no*H0*4
  7278. -->
  7279. (S1 ^operator O1908 = 0.3397650583271044)
  7280. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7281. -->
  7282. inner elaboration loop at bottom goal.
  7283. Retracting rl*prefer*rvt*predict-no*H0*4
  7284. -->
  7285. (S1 ^operator O1906 = 0.3397650583271044)
  7286. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7287. -->
  7288. (S1 ^operator O1906 = -0.2714224023553999)
  7289. Retracting rl*prefer*rvt*predict-yes*H0*3
  7290. -->
  7291. (S1 ^operator O1905 = 0.3377110766337923)
  7292. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7293. -->
  7294. (S1 ^operator O1905 = 0.6621942993402632)
  7295. --- END Proposal Phase ---
  7296. --- Decision Phase ---
  7297. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7298. =>WM: (13435: S1 ^operator O1907)
  7299. 954: O: O1907 (predict-yes)
  7300. --- END Decision Phase ---
  7301. --- Application Phase ---
  7302. --- Firing Productions (PE) For State At Depth 1 ---
  7303. --- Inner Elaboration Phase, active level 1 (S1) ---
  7304. Firing apply*operator
  7305. -->
  7306. (I3 ^predict-yes N954 + :O )
  7307. Firing apply*operator*complete
  7308. -->
  7309. (I3 ^predict-no N953 - :O )
  7310. inner elaboration loop at bottom goal.
  7311. --- Change Working Memory (PE) ---
  7312. =>WM: (13436: I3 ^predict-yes N954)
  7313. <=WM: (13422: N953 ^status complete)
  7314. <=WM: (13421: I3 ^predict-no N953)
  7315. --- Firing Productions (IE) For State At Depth 1 ---
  7316. --- Inner Elaboration Phase, active level 1 (S1) ---
  7317. Firing monitor*world
  7318. -->
  7319. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7320. --- Change Working Memory (IE) ---
  7321. --- END Application Phase ---
  7322. --- Output Phase ---
  7323. ENV: Agent did: predict-yes for direction R in state State-A
  7324. In State-A moving R
  7325. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7326. predict error 0
  7327. dir: dir isU
  7328. --- END Output Phase ---
  7329. /|\---- Input Phase ---
  7330. =>WM: (13440: I2 ^dir U)
  7331. =>WM: (13439: I2 ^reward 1)
  7332. =>WM: (13438: I2 ^see 1)
  7333. =>WM: (13437: N954 ^status complete)
  7334. <=WM: (13425: I2 ^dir R)
  7335. <=WM: (13424: I2 ^reward 1)
  7336. <=WM: (13423: I2 ^see 0)
  7337. =>WM: (13441: I2 ^level-1 R1-root)
  7338. <=WM: (13426: I2 ^level-1 L1-root)
  7339. --- END Input Phase ---
  7340. --- Proposal Phase ---
  7341. --- Inner Elaboration Phase, active level 1 (S1) ---
  7342. Firing elaborate*copy-see-to-output-link
  7343. -->
  7344. (I3 ^see 1 +)
  7345. Firing elaborate*reward*based*on*reward
  7346. -->
  7347. (R958 ^value 1 +)
  7348. (R1 ^reward R958 +)
  7349. Firing propose*predict-yes
  7350. -->
  7351. (O1909 ^name predict-yes +)
  7352. (S1 ^operator O1909 +)
  7353. Firing propose*predict-no
  7354. -->
  7355. (O1910 ^name predict-no +)
  7356. (S1 ^operator O1910 +)
  7357. Firing rl*prefer*rvt*predict-no*H0*2
  7358. -->
  7359. (S1 ^operator O1908 = 1.)
  7360. Firing rl*prefer*rvt*predict-yes*H0*1
  7361. -->
  7362. (S1 ^operator O1907 = 0.)
  7363. Firing prefer*rvt*predict-yes*H0
  7364. -->
  7365. Firing prefer*rvt*predict-no*H0
  7366. -->
  7367. Firing elaborate*copy-dir-to-output-link
  7368. -->
  7369. (I3 ^dir U +)
  7370. inner elaboration loop at bottom goal.
  7371. Retracting elaborate*copy-see-to-output-link
  7372. -->
  7373. (I3 ^see 0 +)
  7374. Retracting propose*predict-no
  7375. -->
  7376. (O1908 ^name predict-no +)
  7377. (S1 ^operator O1908 +)
  7378. Retracting propose*predict-yes
  7379. -->
  7380. (O1907 ^name predict-yes +)
  7381. (S1 ^operator O1907 +)
  7382. Retracting elaborate*reward*based*on*reward
  7383. -->
  7384. (R957 ^value 1 +)
  7385. (R1 ^reward R957 +)
  7386. Retracting elaborate*copy-dir-to-output-link
  7387. -->
  7388. (I3 ^dir R +)
  7389. Retracting rl*prefer*rvt*predict-no*H0*4
  7390. -->
  7391. (S1 ^operator O1908 = 0.3397650583271044)
  7392. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  7393. -->
  7394. (S1 ^operator O1908 = -0.2714224023553999)
  7395. Retracting rl*prefer*rvt*predict-yes*H0*3
  7396. -->
  7397. (S1 ^operator O1907 = 0.3377110766337923)
  7398. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  7399. -->
  7400. (S1 ^operator O1907 = 0.6621942993402632)
  7401. =>WM: (13449: S1 ^operator O1910 +)
  7402. =>WM: (13448: S1 ^operator O1909 +)
  7403. =>WM: (13447: I3 ^dir U)
  7404. =>WM: (13446: O1910 ^name predict-no)
  7405. =>WM: (13445: O1909 ^name predict-yes)
  7406. =>WM: (13444: R958 ^value 1)
  7407. =>WM: (13443: R1 ^reward R958)
  7408. =>WM: (13442: I3 ^see 1)
  7409. <=WM: (13433: S1 ^operator O1907 +)
  7410. <=WM: (13435: S1 ^operator O1907)
  7411. <=WM: (13434: S1 ^operator O1908 +)
  7412. <=WM: (13432: I3 ^dir R)
  7413. <=WM: (13428: R1 ^reward R957)
  7414. <=WM: (13427: I3 ^see 0)
  7415. <=WM: (13431: O1908 ^name predict-no)
  7416. <=WM: (13430: O1907 ^name predict-yes)
  7417. <=WM: (13429: R957 ^value 1)
  7418. --- Inner Elaboration Phase, active level 1 (S1) ---
  7419. Firing prefer*rvt*predict-yes*H0
  7420. -->
  7421. Firing rl*prefer*rvt*predict-yes*H0*1
  7422. -->
  7423. (S1 ^operator O1909 = 0.)
  7424. Firing prefer*rvt*predict-no*H0
  7425. -->
  7426. Firing rl*prefer*rvt*predict-no*H0*2
  7427. -->
  7428. (S1 ^operator O1910 = 1.)
  7429. inner elaboration loop at bottom goal.
  7430. Retracting rl*prefer*rvt*predict-no*H0*2
  7431. -->
  7432. (S1 ^operator O1908 = 1.)
  7433. Retracting rl*prefer*rvt*predict-yes*H0*1
  7434. -->
  7435. (S1 ^operator O1907 = 0.)
  7436. --- END Proposal Phase ---
  7437. --- Decision Phase ---
  7438. RL update rl*prefer*rvt*predict-yes*H0*3 0.590111 -0.2524 0.337711 -> 0.59012 -0.252401 0.337719(R,m,v=1,0.89441,0.0950311)
  7439. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.40978 0.252415 0.662194 -> 0.40979 0.252413 0.662203(R,m,v=1,1,0)
  7440. =>WM: (13450: S1 ^operator O1910)
  7441. 955: O: O1910 (predict-no)
  7442. --- END Decision Phase ---
  7443. --- Application Phase ---
  7444. --- Firing Productions (PE) For State At Depth 1 ---
  7445. --- Inner Elaboration Phase, active level 1 (S1) ---
  7446. Firing apply*operator
  7447. -->
  7448. (I3 ^predict-no N955 + :O )
  7449. Firing apply*operator*complete
  7450. -->
  7451. (I3 ^predict-yes N954 - :O )
  7452. inner elaboration loop at bottom goal.
  7453. --- Change Working Memory (PE) ---
  7454. =>WM: (13451: I3 ^predict-no N955)
  7455. <=WM: (13437: N954 ^status complete)
  7456. <=WM: (13436: I3 ^predict-yes N954)
  7457. --- Firing Productions (IE) For State At Depth 1 ---
  7458. --- Inner Elaboration Phase, active level 1 (S1) ---
  7459. Firing monitor*world
  7460. -->
  7461. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7462. --- Change Working Memory (IE) ---
  7463. --- END Application Phase ---
  7464. --- Output Phase ---
  7465. ENV: Agent did: predict-no for direction U in state State-B
  7466. In State-B moving U
  7467. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7468. predict error 0
  7469. dir: dir isR
  7470. --- END Output Phase ---
  7471. /|\--- Input Phase ---
  7472. =>WM: (13455: I2 ^dir R)
  7473. =>WM: (13454: I2 ^reward 1)
  7474. =>WM: (13453: I2 ^see 0)
  7475. =>WM: (13452: N955 ^status complete)
  7476. <=WM: (13440: I2 ^dir U)
  7477. <=WM: (13439: I2 ^reward 1)
  7478. <=WM: (13438: I2 ^see 1)
  7479. =>WM: (13456: I2 ^level-1 R1-root)
  7480. <=WM: (13441: I2 ^level-1 R1-root)
  7481. --- END Input Phase ---
  7482. --- Proposal Phase ---
  7483. --- Inner Elaboration Phase, active level 1 (S1) ---
  7484. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7485. -->
  7486. (S1 ^operator O1909 = -0.1070236389116304)
  7487. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7488. -->
  7489. (S1 ^operator O1910 = 0.6602503199844459)
  7490. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7491. -->
  7492. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7493. -->
  7494. Firing elaborate*copy-see-to-output-link
  7495. -->
  7496. (I3 ^see 0 +)
  7497. Firing elaborate*reward*based*on*reward
  7498. -->
  7499. (R959 ^value 1 +)
  7500. (R1 ^reward R959 +)
  7501. Firing propose*predict-yes
  7502. -->
  7503. (O1911 ^name predict-yes +)
  7504. (S1 ^operator O1911 +)
  7505. Firing propose*predict-no
  7506. -->
  7507. (O1912 ^name predict-no +)
  7508. (S1 ^operator O1912 +)
  7509. Firing rl*prefer*rvt*predict-no*H0*4
  7510. -->
  7511. (S1 ^operator O1910 = 0.3397650583271044)
  7512. Firing rl*prefer*rvt*predict-yes*H0*3
  7513. -->
  7514. (S1 ^operator O1909 = 0.3377188564178903)
  7515. Firing prefer*rvt*predict-yes*H0
  7516. -->
  7517. Firing prefer*rvt*predict-no*H0
  7518. -->
  7519. Firing elaborate*copy-dir-to-output-link
  7520. -->
  7521. (I3 ^dir R +)
  7522. inner elaboration loop at bottom goal.
  7523. Retracting elaborate*copy-see-to-output-link
  7524. -->
  7525. (I3 ^see 1 +)
  7526. Retracting propose*predict-no
  7527. -->
  7528. (O1910 ^name predict-no +)
  7529. (S1 ^operator O1910 +)
  7530. Retracting propose*predict-yes
  7531. -->
  7532. (O1909 ^name predict-yes +)
  7533. (S1 ^operator O1909 +)
  7534. Retracting elaborate*reward*based*on*reward
  7535. -->
  7536. (R958 ^value 1 +)
  7537. (R1 ^reward R958 +)
  7538. Retracting elaborate*copy-dir-to-output-link
  7539. -->
  7540. (I3 ^dir U +)
  7541. Retracting rl*prefer*rvt*predict-no*H0*2
  7542. -->
  7543. (S1 ^operator O1910 = 1.)
  7544. Retracting rl*prefer*rvt*predict-yes*H0*1
  7545. -->
  7546. (S1 ^operator O1909 = 0.)
  7547. =>WM: (13464: S1 ^operator O1912 +)
  7548. =>WM: (13463: S1 ^operator O1911 +)
  7549. =>WM: (13462: I3 ^dir R)
  7550. =>WM: (13461: O1912 ^name predict-no)
  7551. =>WM: (13460: O1911 ^name predict-yes)
  7552. =>WM: (13459: R959 ^value 1)
  7553. =>WM: (13458: R1 ^reward R959)
  7554. =>WM: (13457: I3 ^see 0)
  7555. <=WM: (13448: S1 ^operator O1909 +)
  7556. <=WM: (13449: S1 ^operator O1910 +)
  7557. <=WM: (13450: S1 ^operator O1910)
  7558. <=WM: (13447: I3 ^dir U)
  7559. <=WM: (13443: R1 ^reward R958)
  7560. <=WM: (13442: I3 ^see 1)
  7561. <=WM: (13446: O1910 ^name predict-no)
  7562. <=WM: (13445: O1909 ^name predict-yes)
  7563. <=WM: (13444: R958 ^value 1)
  7564. --- Inner Elaboration Phase, active level 1 (S1) ---
  7565. Firing prefer*rvt*predict-yes*H0
  7566. -->
  7567. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7568. -->
  7569. (S1 ^operator O1911 = -0.1070236389116304)
  7570. Firing rl*prefer*rvt*predict-yes*H0*3
  7571. -->
  7572. (S1 ^operator O1911 = 0.3377188564178903)
  7573. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7574. -->
  7575. Firing prefer*rvt*predict-no*H0
  7576. -->
  7577. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7578. -->
  7579. (S1 ^operator O1912 = 0.6602503199844459)
  7580. Firing rl*prefer*rvt*predict-no*H0*4
  7581. -->
  7582. (S1 ^operator O1912 = 0.3397650583271044)
  7583. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7584. -->
  7585. inner elaboration loop at bottom goal.
  7586. Retracting rl*prefer*rvt*predict-no*H0*4
  7587. -->
  7588. (S1 ^operator O1910 = 0.3397650583271044)
  7589. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7590. -->
  7591. (S1 ^operator O1910 = 0.6602503199844459)
  7592. Retracting rl*prefer*rvt*predict-yes*H0*3
  7593. -->
  7594. (S1 ^operator O1909 = 0.3377188564178903)
  7595. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7596. -->
  7597. (S1 ^operator O1909 = -0.1070236389116304)
  7598. --- END Proposal Phase ---
  7599. --- Decision Phase ---
  7600. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7601. =>WM: (13465: S1 ^operator O1912)
  7602. 956: O: O1912 (predict-no)
  7603. --- END Decision Phase ---
  7604. --- Application Phase ---
  7605. --- Firing Productions (PE) For State At Depth 1 ---
  7606. --- Inner Elaboration Phase, active level 1 (S1) ---
  7607. Firing apply*operator
  7608. -->
  7609. (I3 ^predict-no N956 + :O )
  7610. Firing apply*operator*complete
  7611. -->
  7612. (I3 ^predict-no N955 - :O )
  7613. inner elaboration loop at bottom goal.
  7614. --- Change Working Memory (PE) ---
  7615. =>WM: (13466: I3 ^predict-no N956)
  7616. <=WM: (13452: N955 ^status complete)
  7617. <=WM: (13451: I3 ^predict-no N955)
  7618. --- Firing Productions (IE) For State At Depth 1 ---
  7619. --- Inner Elaboration Phase, active level 1 (S1) ---
  7620. Firing monitor*world
  7621. -->
  7622. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7623. --- Change Working Memory (IE) ---
  7624. --- END Application Phase ---
  7625. --- Output Phase ---
  7626. ENV: Agent did: predict-no for direction R in state State-B
  7627. In State-B moving R
  7628. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7629. predict error 0
  7630. dir: dir isR
  7631. --- END Output Phase ---
  7632. -/--- Input Phase ---
  7633. =>WM: (13470: I2 ^dir R)
  7634. =>WM: (13469: I2 ^reward 1)
  7635. =>WM: (13468: I2 ^see 0)
  7636. =>WM: (13467: N956 ^status complete)
  7637. <=WM: (13455: I2 ^dir R)
  7638. <=WM: (13454: I2 ^reward 1)
  7639. <=WM: (13453: I2 ^see 0)
  7640. =>WM: (13471: I2 ^level-1 R0-root)
  7641. <=WM: (13456: I2 ^level-1 R1-root)
  7642. --- END Input Phase ---
  7643. --- Proposal Phase ---
  7644. --- Inner Elaboration Phase, active level 1 (S1) ---
  7645. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7646. -->
  7647. (S1 ^operator O1912 = 0.6601435952544124)
  7648. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7649. -->
  7650. (S1 ^operator O1911 = -0.1028953566115423)
  7651. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7652. -->
  7653. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7654. -->
  7655. Firing elaborate*copy-see-to-output-link
  7656. -->
  7657. (I3 ^see 0 +)
  7658. Firing elaborate*reward*based*on*reward
  7659. -->
  7660. (R960 ^value 1 +)
  7661. (R1 ^reward R960 +)
  7662. Firing propose*predict-yes
  7663. -->
  7664. (O1913 ^name predict-yes +)
  7665. (S1 ^operator O1913 +)
  7666. Firing propose*predict-no
  7667. -->
  7668. (O1914 ^name predict-no +)
  7669. (S1 ^operator O1914 +)
  7670. Firing rl*prefer*rvt*predict-no*H0*4
  7671. -->
  7672. (S1 ^operator O1912 = 0.3397650583271044)
  7673. Firing rl*prefer*rvt*predict-yes*H0*3
  7674. -->
  7675. (S1 ^operator O1911 = 0.3377188564178903)
  7676. Firing prefer*rvt*predict-yes*H0
  7677. -->
  7678. Firing prefer*rvt*predict-no*H0
  7679. -->
  7680. Firing elaborate*copy-dir-to-output-link
  7681. -->
  7682. (I3 ^dir R +)
  7683. inner elaboration loop at bottom goal.
  7684. Retracting elaborate*copy-see-to-output-link
  7685. -->
  7686. (I3 ^see 0 +)
  7687. Retracting propose*predict-no
  7688. -->
  7689. (O1912 ^name predict-no +)
  7690. (S1 ^operator O1912 +)
  7691. Retracting propose*predict-yes
  7692. -->
  7693. (O1911 ^name predict-yes +)
  7694. (S1 ^operator O1911 +)
  7695. Retracting elaborate*reward*based*on*reward
  7696. -->
  7697. (R959 ^value 1 +)
  7698. (R1 ^reward R959 +)
  7699. Retracting elaborate*copy-dir-to-output-link
  7700. -->
  7701. (I3 ^dir R +)
  7702. Retracting rl*prefer*rvt*predict-no*H0*4
  7703. -->
  7704. (S1 ^operator O1912 = 0.3397650583271044)
  7705. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  7706. -->
  7707. (S1 ^operator O1912 = 0.6602503199844459)
  7708. Retracting rl*prefer*rvt*predict-yes*H0*3
  7709. -->
  7710. (S1 ^operator O1911 = 0.3377188564178903)
  7711. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  7712. -->
  7713. (S1 ^operator O1911 = -0.1070236389116304)
  7714. =>WM: (13477: S1 ^operator O1914 +)
  7715. =>WM: (13476: S1 ^operator O1913 +)
  7716. =>WM: (13475: O1914 ^name predict-no)
  7717. =>WM: (13474: O1913 ^name predict-yes)
  7718. =>WM: (13473: R960 ^value 1)
  7719. =>WM: (13472: R1 ^reward R960)
  7720. <=WM: (13463: S1 ^operator O1911 +)
  7721. <=WM: (13464: S1 ^operator O1912 +)
  7722. <=WM: (13465: S1 ^operator O1912)
  7723. <=WM: (13458: R1 ^reward R959)
  7724. <=WM: (13461: O1912 ^name predict-no)
  7725. <=WM: (13460: O1911 ^name predict-yes)
  7726. <=WM: (13459: R959 ^value 1)
  7727. --- Inner Elaboration Phase, active level 1 (S1) ---
  7728. Firing prefer*rvt*predict-yes*H0
  7729. -->
  7730. Firing rl*prefer*rvt*predict-yes*H0*3
  7731. -->
  7732. (S1 ^operator O1913 = 0.3377188564178903)
  7733. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  7734. -->
  7735. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7736. -->
  7737. (S1 ^operator O1913 = -0.1028953566115423)
  7738. Firing prefer*rvt*predict-no*H0
  7739. -->
  7740. Firing rl*prefer*rvt*predict-no*H0*4
  7741. -->
  7742. (S1 ^operator O1914 = 0.3397650583271044)
  7743. Firing prefer*rvt*predict-no*H0*4*v1*H1
  7744. -->
  7745. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7746. -->
  7747. (S1 ^operator O1914 = 0.6601435952544124)
  7748. inner elaboration loop at bottom goal.
  7749. Retracting rl*prefer*rvt*predict-no*H0*4
  7750. -->
  7751. (S1 ^operator O1912 = 0.3397650583271044)
  7752. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7753. -->
  7754. (S1 ^operator O1912 = 0.6601435952544124)
  7755. Retracting rl*prefer*rvt*predict-yes*H0*3
  7756. -->
  7757. (S1 ^operator O1911 = 0.3377188564178903)
  7758. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7759. -->
  7760. (S1 ^operator O1911 = -0.1028953566115423)
  7761. --- END Proposal Phase ---
  7762. --- Decision Phase ---
  7763. RL update rl*prefer*rvt*predict-no*H0*4 0.570248 -0.230483 0.339765 -> 0.570247 -0.230483 0.339764(R,m,v=1,0.871166,0.112929)
  7764. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429768 0.230482 0.66025 -> 0.429766 0.230483 0.660249(R,m,v=1,1,0)
  7765. =>WM: (13478: S1 ^operator O1914)
  7766. 957: O: O1914 (predict-no)
  7767. --- END Decision Phase ---
  7768. --- Application Phase ---
  7769. --- Firing Productions (PE) For State At Depth 1 ---
  7770. --- Inner Elaboration Phase, active level 1 (S1) ---
  7771. Firing apply*operator
  7772. -->
  7773. (I3 ^predict-no N957 + :O )
  7774. Firing apply*operator*complete
  7775. -->
  7776. (I3 ^predict-no N956 - :O )
  7777. inner elaboration loop at bottom goal.
  7778. --- Change Working Memory (PE) ---
  7779. =>WM: (13479: I3 ^predict-no N957)
  7780. <=WM: (13467: N956 ^status complete)
  7781. <=WM: (13466: I3 ^predict-no N956)
  7782. --- Firing Productions (IE) For State At Depth 1 ---
  7783. --- Inner Elaboration Phase, active level 1 (S1) ---
  7784. Firing monitor*world
  7785. -->
  7786. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7787. --- Change Working Memory (IE) ---
  7788. --- END Application Phase ---
  7789. --- Output Phase ---
  7790. ENV: Agent did: predict-no for direction R in state State-B
  7791. In State-B moving R
  7792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7793. predict error 0
  7794. dir: dir isL
  7795. --- END Output Phase ---
  7796. |\---- Input Phase ---
  7797. =>WM: (13483: I2 ^dir L)
  7798. =>WM: (13482: I2 ^reward 1)
  7799. =>WM: (13481: I2 ^see 0)
  7800. =>WM: (13480: N957 ^status complete)
  7801. <=WM: (13470: I2 ^dir R)
  7802. <=WM: (13469: I2 ^reward 1)
  7803. <=WM: (13468: I2 ^see 0)
  7804. =>WM: (13484: I2 ^level-1 R0-root)
  7805. <=WM: (13471: I2 ^level-1 R0-root)
  7806. --- END Input Phase ---
  7807. --- Proposal Phase ---
  7808. --- Inner Elaboration Phase, active level 1 (S1) ---
  7809. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7810. -->
  7811. (S1 ^operator O1913 = 0.7358024669452599)
  7812. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7813. -->
  7814. Firing elaborate*copy-see-to-output-link
  7815. -->
  7816. (I3 ^see 0 +)
  7817. Firing elaborate*reward*based*on*reward
  7818. -->
  7819. (R961 ^value 1 +)
  7820. (R1 ^reward R961 +)
  7821. Firing propose*predict-yes
  7822. -->
  7823. (O1915 ^name predict-yes +)
  7824. (S1 ^operator O1915 +)
  7825. Firing propose*predict-no
  7826. -->
  7827. (O1916 ^name predict-no +)
  7828. (S1 ^operator O1916 +)
  7829. Firing rl*prefer*rvt*predict-no*H0*6
  7830. -->
  7831. (S1 ^operator O1914 = 0.9996367744406318)
  7832. Firing rl*prefer*rvt*predict-yes*H0*5
  7833. -->
  7834. (S1 ^operator O1913 = 0.2640663414827097)
  7835. Firing prefer*rvt*predict-yes*H0
  7836. -->
  7837. Firing prefer*rvt*predict-no*H0
  7838. -->
  7839. Firing elaborate*copy-dir-to-output-link
  7840. -->
  7841. (I3 ^dir L +)
  7842. inner elaboration loop at bottom goal.
  7843. Retracting elaborate*copy-see-to-output-link
  7844. -->
  7845. (I3 ^see 0 +)
  7846. Retracting propose*predict-no
  7847. -->
  7848. (O1914 ^name predict-no +)
  7849. (S1 ^operator O1914 +)
  7850. Retracting propose*predict-yes
  7851. -->
  7852. (O1913 ^name predict-yes +)
  7853. (S1 ^operator O1913 +)
  7854. Retracting elaborate*reward*based*on*reward
  7855. -->
  7856. (R960 ^value 1 +)
  7857. (R1 ^reward R960 +)
  7858. Retracting elaborate*copy-dir-to-output-link
  7859. -->
  7860. (I3 ^dir R +)
  7861. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  7862. -->
  7863. (S1 ^operator O1914 = 0.6601435952544124)
  7864. Retracting rl*prefer*rvt*predict-no*H0*4
  7865. -->
  7866. (S1 ^operator O1914 = 0.3397637965169674)
  7867. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  7868. -->
  7869. (S1 ^operator O1913 = -0.1028953566115423)
  7870. Retracting rl*prefer*rvt*predict-yes*H0*3
  7871. -->
  7872. (S1 ^operator O1913 = 0.3377188564178903)
  7873. =>WM: (13491: S1 ^operator O1916 +)
  7874. =>WM: (13490: S1 ^operator O1915 +)
  7875. =>WM: (13489: I3 ^dir L)
  7876. =>WM: (13488: O1916 ^name predict-no)
  7877. =>WM: (13487: O1915 ^name predict-yes)
  7878. =>WM: (13486: R961 ^value 1)
  7879. =>WM: (13485: R1 ^reward R961)
  7880. <=WM: (13476: S1 ^operator O1913 +)
  7881. <=WM: (13477: S1 ^operator O1914 +)
  7882. <=WM: (13478: S1 ^operator O1914)
  7883. <=WM: (13462: I3 ^dir R)
  7884. <=WM: (13472: R1 ^reward R960)
  7885. <=WM: (13475: O1914 ^name predict-no)
  7886. <=WM: (13474: O1913 ^name predict-yes)
  7887. <=WM: (13473: R960 ^value 1)
  7888. --- Inner Elaboration Phase, active level 1 (S1) ---
  7889. Firing prefer*rvt*predict-yes*H0
  7890. -->
  7891. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7892. -->
  7893. (S1 ^operator O1915 = 0.7358024669452599)
  7894. Firing rl*prefer*rvt*predict-yes*H0*5
  7895. -->
  7896. (S1 ^operator O1915 = 0.2640663414827097)
  7897. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  7898. -->
  7899. Firing prefer*rvt*predict-no*H0
  7900. -->
  7901. Firing rl*prefer*rvt*predict-no*H0*6
  7902. -->
  7903. (S1 ^operator O1916 = 0.9996367744406318)
  7904. inner elaboration loop at bottom goal.
  7905. Retracting rl*prefer*rvt*predict-no*H0*6
  7906. -->
  7907. (S1 ^operator O1914 = 0.9996367744406318)
  7908. Retracting rl*prefer*rvt*predict-yes*H0*5
  7909. -->
  7910. (S1 ^operator O1913 = 0.2640663414827097)
  7911. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  7912. -->
  7913. (S1 ^operator O1913 = 0.7358024669452599)
  7914. --- END Proposal Phase ---
  7915. --- Decision Phase ---
  7916. RL update rl*prefer*rvt*predict-no*H0*4 0.570247 -0.230483 0.339764 -> 0.570255 -0.230484 0.339771(R,m,v=1,0.871951,0.112337)
  7917. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429656 0.230488 0.660144 -> 0.429665 0.230487 0.660152(R,m,v=1,1,0)
  7918. =>WM: (13492: S1 ^operator O1915)
  7919. 958: O: O1915 (predict-yes)
  7920. --- END Decision Phase ---
  7921. --- Application Phase ---
  7922. --- Firing Productions (PE) For State At Depth 1 ---
  7923. --- Inner Elaboration Phase, active level 1 (S1) ---
  7924. Firing apply*operator
  7925. -->
  7926. (I3 ^predict-yes N958 + :O )
  7927. Firing apply*operator*complete
  7928. -->
  7929. (I3 ^predict-no N957 - :O )
  7930. inner elaboration loop at bottom goal.
  7931. --- Change Working Memory (PE) ---
  7932. =>WM: (13493: I3 ^predict-yes N958)
  7933. <=WM: (13480: N957 ^status complete)
  7934. <=WM: (13479: I3 ^predict-no N957)
  7935. --- Firing Productions (IE) For State At Depth 1 ---
  7936. --- Inner Elaboration Phase, active level 1 (S1) ---
  7937. Firing monitor*world
  7938. -->
  7939. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7940. --- Change Working Memory (IE) ---
  7941. --- END Application Phase ---
  7942. --- Output Phase ---
  7943. ENV: Agent did: predict-yes for direction L in state State-B
  7944. In State-B moving L
  7945. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7946. predict error 0
  7947. dir: dir isU
  7948. --- END Output Phase ---
  7949. /|\--- Input Phase ---
  7950. =>WM: (13497: I2 ^dir U)
  7951. =>WM: (13496: I2 ^reward 1)
  7952. =>WM: (13495: I2 ^see 1)
  7953. =>WM: (13494: N958 ^status complete)
  7954. <=WM: (13483: I2 ^dir L)
  7955. <=WM: (13482: I2 ^reward 1)
  7956. <=WM: (13481: I2 ^see 0)
  7957. =>WM: (13498: I2 ^level-1 L1-root)
  7958. <=WM: (13484: I2 ^level-1 R0-root)
  7959. --- END Input Phase ---
  7960. --- Proposal Phase ---
  7961. --- Inner Elaboration Phase, active level 1 (S1) ---
  7962. Firing elaborate*copy-see-to-output-link
  7963. -->
  7964. (I3 ^see 1 +)
  7965. Firing elaborate*reward*based*on*reward
  7966. -->
  7967. (R962 ^value 1 +)
  7968. (R1 ^reward R962 +)
  7969. Firing propose*predict-yes
  7970. -->
  7971. (O1917 ^name predict-yes +)
  7972. (S1 ^operator O1917 +)
  7973. Firing propose*predict-no
  7974. -->
  7975. (O1918 ^name predict-no +)
  7976. (S1 ^operator O1918 +)
  7977. Firing rl*prefer*rvt*predict-no*H0*2
  7978. -->
  7979. (S1 ^operator O1916 = 1.)
  7980. Firing rl*prefer*rvt*predict-yes*H0*1
  7981. -->
  7982. (S1 ^operator O1915 = 0.)
  7983. Firing prefer*rvt*predict-yes*H0
  7984. -->
  7985. Firing prefer*rvt*predict-no*H0
  7986. -->
  7987. Firing elaborate*copy-dir-to-output-link
  7988. -->
  7989. (I3 ^dir U +)
  7990. inner elaboration loop at bottom goal.
  7991. Retracting elaborate*copy-see-to-output-link
  7992. -->
  7993. (I3 ^see 0 +)
  7994. Retracting propose*predict-no
  7995. -->
  7996. (O1916 ^name predict-no +)
  7997. (S1 ^operator O1916 +)
  7998. Retracting propose*predict-yes
  7999. -->
  8000. (O1915 ^name predict-yes +)
  8001. (S1 ^operator O1915 +)
  8002. Retracting elaborate*reward*based*on*reward
  8003. -->
  8004. (R961 ^value 1 +)
  8005. (R1 ^reward R961 +)
  8006. Retracting elaborate*copy-dir-to-output-link
  8007. -->
  8008. (I3 ^dir L +)
  8009. Retracting rl*prefer*rvt*predict-no*H0*6
  8010. -->
  8011. (S1 ^operator O1916 = 0.9996367744406318)
  8012. Retracting rl*prefer*rvt*predict-yes*H0*5
  8013. -->
  8014. (S1 ^operator O1915 = 0.2640663414827097)
  8015. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  8016. -->
  8017. (S1 ^operator O1915 = 0.7358024669452599)
  8018. =>WM: (13506: S1 ^operator O1918 +)
  8019. =>WM: (13505: S1 ^operator O1917 +)
  8020. =>WM: (13504: I3 ^dir U)
  8021. =>WM: (13503: O1918 ^name predict-no)
  8022. =>WM: (13502: O1917 ^name predict-yes)
  8023. =>WM: (13501: R962 ^value 1)
  8024. =>WM: (13500: R1 ^reward R962)
  8025. =>WM: (13499: I3 ^see 1)
  8026. <=WM: (13490: S1 ^operator O1915 +)
  8027. <=WM: (13492: S1 ^operator O1915)
  8028. <=WM: (13491: S1 ^operator O1916 +)
  8029. <=WM: (13489: I3 ^dir L)
  8030. <=WM: (13485: R1 ^reward R961)
  8031. <=WM: (13457: I3 ^see 0)
  8032. <=WM: (13488: O1916 ^name predict-no)
  8033. <=WM: (13487: O1915 ^name predict-yes)
  8034. <=WM: (13486: R961 ^value 1)
  8035. --- Inner Elaboration Phase, active level 1 (S1) ---
  8036. Firing prefer*rvt*predict-yes*H0
  8037. -->
  8038. Firing rl*prefer*rvt*predict-yes*H0*1
  8039. -->
  8040. (S1 ^operator O1917 = 0.)
  8041. Firing prefer*rvt*predict-no*H0
  8042. -->
  8043. Firing rl*prefer*rvt*predict-no*H0*2
  8044. -->
  8045. (S1 ^operator O1918 = 1.)
  8046. inner elaboration loop at bottom goal.
  8047. Retracting rl*prefer*rvt*predict-no*H0*2
  8048. -->
  8049. (S1 ^operator O1916 = 1.)
  8050. Retracting rl*prefer*rvt*predict-yes*H0*1
  8051. -->
  8052. (S1 ^operator O1915 = 0.)
  8053. --- END Proposal Phase ---
  8054. --- Decision Phase ---
  8055. RL update rl*prefer*rvt*predict-yes*H0*5 0.554451 -0.290385 0.264066 -> 0.554462 -0.290385 0.264077(R,m,v=1,0.872832,0.111641)
  8056. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.44542 0.290383 0.735802 -> 0.445432 0.290383 0.735815(R,m,v=1,1,0)
  8057. =>WM: (13507: S1 ^operator O1918)
  8058. 959: O: O1918 (predict-no)
  8059. --- END Decision Phase ---
  8060. --- Application Phase ---
  8061. --- Firing Productions (PE) For State At Depth 1 ---
  8062. --- Inner Elaboration Phase, active level 1 (S1) ---
  8063. Firing apply*operator
  8064. -->
  8065. (I3 ^predict-no N959 + :O )
  8066. Firing apply*operator*complete
  8067. -->
  8068. (I3 ^predict-yes N958 - :O )
  8069. inner elaboration loop at bottom goal.
  8070. --- Change Working Memory (PE) ---
  8071. =>WM: (13508: I3 ^predict-no N959)
  8072. <=WM: (13494: N958 ^status complete)
  8073. <=WM: (13493: I3 ^predict-yes N958)
  8074. --- Firing Productions (IE) For State At Depth 1 ---
  8075. --- Inner Elaboration Phase, active level 1 (S1) ---
  8076. Firing monitor*world
  8077. -->
  8078. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8079. --- Change Working Memory (IE) ---
  8080. --- END Application Phase ---
  8081. --- Output Phase ---
  8082. ENV: Agent did: predict-no for direction U in state State-A
  8083. In State-A moving U
  8084. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8085. predict error 0
  8086. dir: dir isL
  8087. --- END Output Phase ---
  8088. -/--- Input Phase ---
  8089. =>WM: (13512: I2 ^dir L)
  8090. =>WM: (13511: I2 ^reward 1)
  8091. =>WM: (13510: I2 ^see 0)
  8092. =>WM: (13509: N959 ^status complete)
  8093. <=WM: (13497: I2 ^dir U)
  8094. <=WM: (13496: I2 ^reward 1)
  8095. <=WM: (13495: I2 ^see 1)
  8096. =>WM: (13513: I2 ^level-1 L1-root)
  8097. <=WM: (13498: I2 ^level-1 L1-root)
  8098. --- END Input Phase ---
  8099. --- Proposal Phase ---
  8100. --- Inner Elaboration Phase, active level 1 (S1) ---
  8101. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8102. -->
  8103. (S1 ^operator O1917 = -0.181727099742844)
  8104. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8105. -->
  8106. Firing elaborate*copy-see-to-output-link
  8107. -->
  8108. (I3 ^see 0 +)
  8109. Firing elaborate*reward*based*on*reward
  8110. -->
  8111. (R963 ^value 1 +)
  8112. (R1 ^reward R963 +)
  8113. Firing propose*predict-yes
  8114. -->
  8115. (O1919 ^name predict-yes +)
  8116. (S1 ^operator O1919 +)
  8117. Firing propose*predict-no
  8118. -->
  8119. (O1920 ^name predict-no +)
  8120. (S1 ^operator O1920 +)
  8121. Firing rl*prefer*rvt*predict-no*H0*6
  8122. -->
  8123. (S1 ^operator O1918 = 0.9996367744406318)
  8124. Firing rl*prefer*rvt*predict-yes*H0*5
  8125. -->
  8126. (S1 ^operator O1917 = 0.2640770017585976)
  8127. Firing prefer*rvt*predict-yes*H0
  8128. -->
  8129. Firing prefer*rvt*predict-no*H0
  8130. -->
  8131. Firing elaborate*copy-dir-to-output-link
  8132. -->
  8133. (I3 ^dir L +)
  8134. inner elaboration loop at bottom goal.
  8135. Retracting elaborate*copy-see-to-output-link
  8136. -->
  8137. (I3 ^see 1 +)
  8138. Retracting propose*predict-no
  8139. -->
  8140. (O1918 ^name predict-no +)
  8141. (S1 ^operator O1918 +)
  8142. Retracting propose*predict-yes
  8143. -->
  8144. (O1917 ^name predict-yes +)
  8145. (S1 ^operator O1917 +)
  8146. Retracting elaborate*reward*based*on*reward
  8147. -->
  8148. (R962 ^value 1 +)
  8149. (R1 ^reward R962 +)
  8150. Retracting elaborate*copy-dir-to-output-link
  8151. -->
  8152. (I3 ^dir U +)
  8153. Retracting rl*prefer*rvt*predict-no*H0*2
  8154. -->
  8155. (S1 ^operator O1918 = 1.)
  8156. Retracting rl*prefer*rvt*predict-yes*H0*1
  8157. -->
  8158. (S1 ^operator O1917 = 0.)
  8159. =>WM: (13521: S1 ^operator O1920 +)
  8160. =>WM: (13520: S1 ^operator O1919 +)
  8161. =>WM: (13519: I3 ^dir L)
  8162. =>WM: (13518: O1920 ^name predict-no)
  8163. =>WM: (13517: O1919 ^name predict-yes)
  8164. =>WM: (13516: R963 ^value 1)
  8165. =>WM: (13515: R1 ^reward R963)
  8166. =>WM: (13514: I3 ^see 0)
  8167. <=WM: (13505: S1 ^operator O1917 +)
  8168. <=WM: (13506: S1 ^operator O1918 +)
  8169. <=WM: (13507: S1 ^operator O1918)
  8170. <=WM: (13504: I3 ^dir U)
  8171. <=WM: (13500: R1 ^reward R962)
  8172. <=WM: (13499: I3 ^see 1)
  8173. <=WM: (13503: O1918 ^name predict-no)
  8174. <=WM: (13502: O1917 ^name predict-yes)
  8175. <=WM: (13501: R962 ^value 1)
  8176. --- Inner Elaboration Phase, active level 1 (S1) ---
  8177. Firing prefer*rvt*predict-yes*H0
  8178. -->
  8179. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8180. -->
  8181. (S1 ^operator O1919 = -0.181727099742844)
  8182. Firing rl*prefer*rvt*predict-yes*H0*5
  8183. -->
  8184. (S1 ^operator O1919 = 0.2640770017585976)
  8185. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8186. -->
  8187. Firing prefer*rvt*predict-no*H0
  8188. -->
  8189. Firing rl*prefer*rvt*predict-no*H0*6
  8190. -->
  8191. (S1 ^operator O1920 = 0.9996367744406318)
  8192. inner elaboration loop at bottom goal.
  8193. Retracting rl*prefer*rvt*predict-no*H0*6
  8194. -->
  8195. (S1 ^operator O1918 = 0.9996367744406318)
  8196. Retracting rl*prefer*rvt*predict-yes*H0*5
  8197. -->
  8198. (S1 ^operator O1917 = 0.2640770017585976)
  8199. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8200. -->
  8201. (S1 ^operator O1917 = -0.181727099742844)
  8202. --- END Proposal Phase ---
  8203. --- Decision Phase ---
  8204. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8205. =>WM: (13522: S1 ^operator O1920)
  8206. 960: O: O1920 (predict-no)
  8207. --- END Decision Phase ---
  8208. --- Application Phase ---
  8209. --- Firing Productions (PE) For State At Depth 1 ---
  8210. --- Inner Elaboration Phase, active level 1 (S1) ---
  8211. Firing apply*operator
  8212. -->
  8213. (I3 ^predict-no N960 + :O )
  8214. Firing apply*operator*complete
  8215. -->
  8216. (I3 ^predict-no N959 - :O )
  8217. inner elaboration loop at bottom goal.
  8218. --- Change Working Memory (PE) ---
  8219. =>WM: (13523: I3 ^predict-no N960)
  8220. <=WM: (13509: N959 ^status complete)
  8221. <=WM: (13508: I3 ^predict-no N959)
  8222. --- Firing Productions (IE) For State At Depth 1 ---
  8223. --- Inner Elaboration Phase, active level 1 (S1) ---
  8224. Firing monitor*world
  8225. -->
  8226. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8227. --- Change Working Memory (IE) ---
  8228. --- END Application Phase ---
  8229. --- Output Phase ---
  8230. ENV: Agent did: predict-no for direction L in state State-A
  8231. In State-A moving L
  8232. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8233. predict error 0
  8234. dir: dir isU
  8235. --- END Output Phase ---
  8236. |\---- Input Phase ---
  8237. =>WM: (13527: I2 ^dir U)
  8238. =>WM: (13526: I2 ^reward 1)
  8239. =>WM: (13525: I2 ^see 0)
  8240. =>WM: (13524: N960 ^status complete)
  8241. <=WM: (13512: I2 ^dir L)
  8242. <=WM: (13511: I2 ^reward 1)
  8243. <=WM: (13510: I2 ^see 0)
  8244. =>WM: (13528: I2 ^level-1 L0-root)
  8245. <=WM: (13513: I2 ^level-1 L1-root)
  8246. --- END Input Phase ---
  8247. --- Proposal Phase ---
  8248. --- Inner Elaboration Phase, active level 1 (S1) ---
  8249. Firing elaborate*copy-see-to-output-link
  8250. -->
  8251. (I3 ^see 0 +)
  8252. Firing elaborate*reward*based*on*reward
  8253. -->
  8254. (R964 ^value 1 +)
  8255. (R1 ^reward R964 +)
  8256. Firing propose*predict-yes
  8257. -->
  8258. (O1921 ^name predict-yes +)
  8259. (S1 ^operator O1921 +)
  8260. Firing propose*predict-no
  8261. -->
  8262. (O1922 ^name predict-no +)
  8263. (S1 ^operator O1922 +)
  8264. Firing rl*prefer*rvt*predict-no*H0*2
  8265. -->
  8266. (S1 ^operator O1920 = 1.)
  8267. Firing rl*prefer*rvt*predict-yes*H0*1
  8268. -->
  8269. (S1 ^operator O1919 = 0.)
  8270. Firing prefer*rvt*predict-yes*H0
  8271. -->
  8272. Firing prefer*rvt*predict-no*H0
  8273. -->
  8274. Firing elaborate*copy-dir-to-output-link
  8275. -->
  8276. (I3 ^dir U +)
  8277. inner elaboration loop at bottom goal.
  8278. Retracting elaborate*copy-see-to-output-link
  8279. -->
  8280. (I3 ^see 0 +)
  8281. Retracting propose*predict-no
  8282. -->
  8283. (O1920 ^name predict-no +)
  8284. (S1 ^operator O1920 +)
  8285. Retracting propose*predict-yes
  8286. -->
  8287. (O1919 ^name predict-yes +)
  8288. (S1 ^operator O1919 +)
  8289. Retracting elaborate*reward*based*on*reward
  8290. -->
  8291. (R963 ^value 1 +)
  8292. (R1 ^reward R963 +)
  8293. Retracting elaborate*copy-dir-to-output-link
  8294. -->
  8295. (I3 ^dir L +)
  8296. Retracting rl*prefer*rvt*predict-no*H0*6
  8297. -->
  8298. (S1 ^operator O1920 = 0.9996367744406318)
  8299. Retracting rl*prefer*rvt*predict-yes*H0*5
  8300. -->
  8301. (S1 ^operator O1919 = 0.2640770017585976)
  8302. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8303. -->
  8304. (S1 ^operator O1919 = -0.181727099742844)
  8305. =>WM: (13535: S1 ^operator O1922 +)
  8306. =>WM: (13534: S1 ^operator O1921 +)
  8307. =>WM: (13533: I3 ^dir U)
  8308. =>WM: (13532: O1922 ^name predict-no)
  8309. =>WM: (13531: O1921 ^name predict-yes)
  8310. =>WM: (13530: R964 ^value 1)
  8311. =>WM: (13529: R1 ^reward R964)
  8312. <=WM: (13520: S1 ^operator O1919 +)
  8313. <=WM: (13521: S1 ^operator O1920 +)
  8314. <=WM: (13522: S1 ^operator O1920)
  8315. <=WM: (13519: I3 ^dir L)
  8316. <=WM: (13515: R1 ^reward R963)
  8317. <=WM: (13518: O1920 ^name predict-no)
  8318. <=WM: (13517: O1919 ^name predict-yes)
  8319. <=WM: (13516: R963 ^value 1)
  8320. --- Inner Elaboration Phase, active level 1 (S1) ---
  8321. Firing prefer*rvt*predict-yes*H0
  8322. -->
  8323. Firing rl*prefer*rvt*predict-yes*H0*1
  8324. -->
  8325. (S1 ^operator O1921 = 0.)
  8326. Firing prefer*rvt*predict-no*H0
  8327. -->
  8328. Firing rl*prefer*rvt*predict-no*H0*2
  8329. -->
  8330. (S1 ^operator O1922 = 1.)
  8331. inner elaboration loop at bottom goal.
  8332. Retracting rl*prefer*rvt*predict-no*H0*2
  8333. -->
  8334. (S1 ^operator O1920 = 1.)
  8335. Retracting rl*prefer*rvt*predict-yes*H0*1
  8336. -->
  8337. (S1 ^operator O1919 = 0.)
  8338. --- END Proposal Phase ---
  8339. --- Decision Phase ---
  8340. RL update rl*prefer*rvt*predict-no*H0*6 0.999637 0 0.999637 -> 0.999698 0 0.999698(R,m,v=1,0.903448,0.0878352)
  8341. =>WM: (13536: S1 ^operator O1922)
  8342. 961: O: O1922 (predict-no)
  8343. --- END Decision Phase ---
  8344. --- Application Phase ---
  8345. --- Firing Productions (PE) For State At Depth 1 ---
  8346. --- Inner Elaboration Phase, active level 1 (S1) ---
  8347. Firing apply*operator
  8348. -->
  8349. (I3 ^predict-no N961 + :O )
  8350. Firing apply*operator*complete
  8351. -->
  8352. (I3 ^predict-no N960 - :O )
  8353. inner elaboration loop at bottom goal.
  8354. --- Change Working Memory (PE) ---
  8355. =>WM: (13537: I3 ^predict-no N961)
  8356. <=WM: (13524: N960 ^status complete)
  8357. <=WM: (13523: I3 ^predict-no N960)
  8358. --- Firing Productions (IE) For State At Depth 1 ---
  8359. --- Inner Elaboration Phase, active level 1 (S1) ---
  8360. Firing monitor*world
  8361. -->
  8362. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8363. --- Change Working Memory (IE) ---
  8364. --- END Application Phase ---
  8365. --- Output Phase ---
  8366. ENV: Agent did: predict-no for direction U in state State-A
  8367. In State-A moving U
  8368. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8369. predict error 0
  8370. dir: dir isR
  8371. --- END Output Phase ---
  8372. /--- Input Phase ---
  8373. =>WM: (13541: I2 ^dir R)
  8374. =>WM: (13540: I2 ^reward 1)
  8375. =>WM: (13539: I2 ^see 0)
  8376. =>WM: (13538: N961 ^status complete)
  8377. <=WM: (13527: I2 ^dir U)
  8378. <=WM: (13526: I2 ^reward 1)
  8379. <=WM: (13525: I2 ^see 0)
  8380. =>WM: (13542: I2 ^level-1 L0-root)
  8381. <=WM: (13528: I2 ^level-1 L0-root)
  8382. --- END Input Phase ---
  8383. --- Proposal Phase ---
  8384. --- Inner Elaboration Phase, active level 1 (S1) ---
  8385. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8386. -->
  8387. (S1 ^operator O1922 = -0.2817060109291377)
  8388. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8389. -->
  8390. (S1 ^operator O1921 = 0.6623767743575877)
  8391. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8392. -->
  8393. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8394. -->
  8395. Firing elaborate*copy-see-to-output-link
  8396. -->
  8397. (I3 ^see 0 +)
  8398. Firing elaborate*reward*based*on*reward
  8399. -->
  8400. (R965 ^value 1 +)
  8401. (R1 ^reward R965 +)
  8402. Firing propose*predict-yes
  8403. -->
  8404. (O1923 ^name predict-yes +)
  8405. (S1 ^operator O1923 +)
  8406. Firing propose*predict-no
  8407. -->
  8408. (O1924 ^name predict-no +)
  8409. (S1 ^operator O1924 +)
  8410. Firing rl*prefer*rvt*predict-no*H0*4
  8411. -->
  8412. (S1 ^operator O1922 = 0.3397713875215998)
  8413. Firing rl*prefer*rvt*predict-yes*H0*3
  8414. -->
  8415. (S1 ^operator O1921 = 0.3377188564178903)
  8416. Firing prefer*rvt*predict-yes*H0
  8417. -->
  8418. Firing prefer*rvt*predict-no*H0
  8419. -->
  8420. Firing elaborate*copy-dir-to-output-link
  8421. -->
  8422. (I3 ^dir R +)
  8423. inner elaboration loop at bottom goal.
  8424. Retracting elaborate*copy-see-to-output-link
  8425. -->
  8426. (I3 ^see 0 +)
  8427. Retracting propose*predict-no
  8428. -->
  8429. (O1922 ^name predict-no +)
  8430. (S1 ^operator O1922 +)
  8431. Retracting propose*predict-yes
  8432. -->
  8433. (O1921 ^name predict-yes +)
  8434. (S1 ^operator O1921 +)
  8435. Retracting elaborate*reward*based*on*reward
  8436. -->
  8437. (R964 ^value 1 +)
  8438. (R1 ^reward R964 +)
  8439. Retracting elaborate*copy-dir-to-output-link
  8440. -->
  8441. (I3 ^dir U +)
  8442. Retracting rl*prefer*rvt*predict-no*H0*2
  8443. -->
  8444. (S1 ^operator O1922 = 1.)
  8445. Retracting rl*prefer*rvt*predict-yes*H0*1
  8446. -->
  8447. (S1 ^operator O1921 = 0.)
  8448. =>WM: (13549: S1 ^operator O1924 +)
  8449. =>WM: (13548: S1 ^operator O1923 +)
  8450. =>WM: (13547: I3 ^dir R)
  8451. =>WM: (13546: O1924 ^name predict-no)
  8452. =>WM: (13545: O1923 ^name predict-yes)
  8453. =>WM: (13544: R965 ^value 1)
  8454. =>WM: (13543: R1 ^reward R965)
  8455. <=WM: (13534: S1 ^operator O1921 +)
  8456. <=WM: (13535: S1 ^operator O1922 +)
  8457. <=WM: (13536: S1 ^operator O1922)
  8458. <=WM: (13533: I3 ^dir U)
  8459. <=WM: (13529: R1 ^reward R964)
  8460. <=WM: (13532: O1922 ^name predict-no)
  8461. <=WM: (13531: O1921 ^name predict-yes)
  8462. <=WM: (13530: R964 ^value 1)
  8463. --- Inner Elaboration Phase, active level 1 (S1) ---
  8464. Firing prefer*rvt*predict-yes*H0
  8465. -->
  8466. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8467. -->
  8468. (S1 ^operator O1923 = 0.6623767743575877)
  8469. Firing rl*prefer*rvt*predict-yes*H0*3
  8470. -->
  8471. (S1 ^operator O1923 = 0.3377188564178903)
  8472. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  8473. -->
  8474. Firing prefer*rvt*predict-no*H0
  8475. -->
  8476. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8477. -->
  8478. (S1 ^operator O1924 = -0.2817060109291377)
  8479. Firing rl*prefer*rvt*predict-no*H0*4
  8480. -->
  8481. (S1 ^operator O1924 = 0.3397713875215998)
  8482. Firing prefer*rvt*predict-no*H0*4*v1*H1
  8483. -->
  8484. inner elaboration loop at bottom goal.
  8485. Retracting rl*prefer*rvt*predict-no*H0*4
  8486. -->
  8487. (S1 ^operator O1922 = 0.3397713875215998)
  8488. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8489. -->
  8490. (S1 ^operator O1922 = -0.2817060109291377)
  8491. Retracting rl*prefer*rvt*predict-yes*H0*3
  8492. -->
  8493. (S1 ^operator O1921 = 0.3377188564178903)
  8494. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8495. -->
  8496. (S1 ^operator O1921 = 0.6623767743575877)
  8497. --- END Proposal Phase ---
  8498. --- Decision Phase ---
  8499. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8500. =>WM: (13550: S1 ^operator O1923)
  8501. 962: O: O1923 (predict-yes)
  8502. --- END Decision Phase ---
  8503. --- Application Phase ---
  8504. --- Firing Productions (PE) For State At Depth 1 ---
  8505. --- Inner Elaboration Phase, active level 1 (S1) ---
  8506. Firing apply*operator
  8507. -->
  8508. (I3 ^predict-yes N962 + :O )
  8509. Firing apply*operator*complete
  8510. -->
  8511. (I3 ^predict-no N961 - :O )
  8512. inner elaboration loop at bottom goal.
  8513. --- Change Working Memory (PE) ---
  8514. =>WM: (13551: I3 ^predict-yes N962)
  8515. <=WM: (13538: N961 ^status complete)
  8516. <=WM: (13537: I3 ^predict-no N961)
  8517. --- Firing Productions (IE) For State At Depth 1 ---
  8518. --- Inner Elaboration Phase, active level 1 (S1) ---
  8519. Firing monitor*world
  8520. -->
  8521. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8522. --- Change Working Memory (IE) ---
  8523. --- END Application Phase ---
  8524. --- Output Phase ---
  8525. ENV: Agent did: predict-yes for direction R in state State-A
  8526. In State-A moving R
  8527. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8528. predict error 0
  8529. dir: dir isU
  8530. --- END Output Phase ---
  8531. |\--- Input Phase ---
  8532. =>WM: (13555: I2 ^dir U)
  8533. =>WM: (13554: I2 ^reward 1)
  8534. =>WM: (13553: I2 ^see 1)
  8535. =>WM: (13552: N962 ^status complete)
  8536. <=WM: (13541: I2 ^dir R)
  8537. <=WM: (13540: I2 ^reward 1)
  8538. <=WM: (13539: I2 ^see 0)
  8539. =>WM: (13556: I2 ^level-1 R1-root)
  8540. <=WM: (13542: I2 ^level-1 L0-root)
  8541. --- END Input Phase ---
  8542. --- Proposal Phase ---
  8543. --- Inner Elaboration Phase, active level 1 (S1) ---
  8544. Firing elaborate*copy-see-to-output-link
  8545. -->
  8546. (I3 ^see 1 +)
  8547. Firing elaborate*reward*based*on*reward
  8548. -->
  8549. (R966 ^value 1 +)
  8550. (R1 ^reward R966 +)
  8551. Firing propose*predict-yes
  8552. -->
  8553. (O1925 ^name predict-yes +)
  8554. (S1 ^operator O1925 +)
  8555. Firing propose*predict-no
  8556. -->
  8557. (O1926 ^name predict-no +)
  8558. (S1 ^operator O1926 +)
  8559. Firing rl*prefer*rvt*predict-no*H0*2
  8560. -->
  8561. (S1 ^operator O1924 = 1.)
  8562. Firing rl*prefer*rvt*predict-yes*H0*1
  8563. -->
  8564. (S1 ^operator O1923 = 0.)
  8565. Firing prefer*rvt*predict-yes*H0
  8566. -->
  8567. Firing prefer*rvt*predict-no*H0
  8568. -->
  8569. Firing elaborate*copy-dir-to-output-link
  8570. -->
  8571. (I3 ^dir U +)
  8572. inner elaboration loop at bottom goal.
  8573. Retracting elaborate*copy-see-to-output-link
  8574. -->
  8575. (I3 ^see 0 +)
  8576. Retracting propose*predict-no
  8577. -->
  8578. (O1924 ^name predict-no +)
  8579. (S1 ^operator O1924 +)
  8580. Retracting propose*predict-yes
  8581. -->
  8582. (O1923 ^name predict-yes +)
  8583. (S1 ^operator O1923 +)
  8584. Retracting elaborate*reward*based*on*reward
  8585. -->
  8586. (R965 ^value 1 +)
  8587. (R1 ^reward R965 +)
  8588. Retracting elaborate*copy-dir-to-output-link
  8589. -->
  8590. (I3 ^dir R +)
  8591. Retracting rl*prefer*rvt*predict-no*H0*4
  8592. -->
  8593. (S1 ^operator O1924 = 0.3397713875215998)
  8594. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  8595. -->
  8596. (S1 ^operator O1924 = -0.2817060109291377)
  8597. Retracting rl*prefer*rvt*predict-yes*H0*3
  8598. -->
  8599. (S1 ^operator O1923 = 0.3377188564178903)
  8600. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  8601. -->
  8602. (S1 ^operator O1923 = 0.6623767743575877)
  8603. =>WM: (13564: S1 ^operator O1926 +)
  8604. =>WM: (13563: S1 ^operator O1925 +)
  8605. =>WM: (13562: I3 ^dir U)
  8606. =>WM: (13561: O1926 ^name predict-no)
  8607. =>WM: (13560: O1925 ^name predict-yes)
  8608. =>WM: (13559: R966 ^value 1)
  8609. =>WM: (13558: R1 ^reward R966)
  8610. =>WM: (13557: I3 ^see 1)
  8611. <=WM: (13548: S1 ^operator O1923 +)
  8612. <=WM: (13550: S1 ^operator O1923)
  8613. <=WM: (13549: S1 ^operator O1924 +)
  8614. <=WM: (13547: I3 ^dir R)
  8615. <=WM: (13543: R1 ^reward R965)
  8616. <=WM: (13514: I3 ^see 0)
  8617. <=WM: (13546: O1924 ^name predict-no)
  8618. <=WM: (13545: O1923 ^name predict-yes)
  8619. <=WM: (13544: R965 ^value 1)
  8620. --- Inner Elaboration Phase, active level 1 (S1) ---
  8621. Firing prefer*rvt*predict-yes*H0
  8622. -->
  8623. Firing rl*prefer*rvt*predict-yes*H0*1
  8624. -->
  8625. (S1 ^operator O1925 = 0.)
  8626. Firing prefer*rvt*predict-no*H0
  8627. -->
  8628. Firing rl*prefer*rvt*predict-no*H0*2
  8629. -->
  8630. (S1 ^operator O1926 = 1.)
  8631. inner elaboration loop at bottom goal.
  8632. Retracting rl*prefer*rvt*predict-no*H0*2
  8633. -->
  8634. (S1 ^operator O1924 = 1.)
  8635. Retracting rl*prefer*rvt*predict-yes*H0*1
  8636. -->
  8637. (S1 ^operator O1923 = 0.)
  8638. --- END Proposal Phase ---
  8639. --- Decision Phase ---
  8640. RL update rl*prefer*rvt*predict-yes*H0*3 0.59012 -0.252401 0.337719 -> 0.590111 -0.2524 0.337711(R,m,v=1,0.895062,0.0945096)
  8641. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.40999 0.252387 0.662377 -> 0.409979 0.252388 0.662368(R,m,v=1,1,0)
  8642. =>WM: (13565: S1 ^operator O1926)
  8643. 963: O: O1926 (predict-no)
  8644. --- END Decision Phase ---
  8645. --- Application Phase ---
  8646. --- Firing Productions (PE) For State At Depth 1 ---
  8647. --- Inner Elaboration Phase, active level 1 (S1) ---
  8648. Firing apply*operator
  8649. -->
  8650. (I3 ^predict-no N963 + :O )
  8651. Firing apply*operator*complete
  8652. -->
  8653. (I3 ^predict-yes N962 - :O )
  8654. inner elaboration loop at bottom goal.
  8655. --- Change Working Memory (PE) ---
  8656. =>WM: (13566: I3 ^predict-no N963)
  8657. <=WM: (13552: N962 ^status complete)
  8658. <=WM: (13551: I3 ^predict-yes N962)
  8659. --- Firing Productions (IE) For State At Depth 1 ---
  8660. --- Inner Elaboration Phase, active level 1 (S1) ---
  8661. Firing monitor*world
  8662. -->
  8663. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8664. --- Change Working Memory (IE) ---
  8665. --- END Application Phase ---
  8666. --- Output Phase ---
  8667. ENV: Agent did: predict-no for direction U in state State-B
  8668. In State-B moving U
  8669. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8670. predict error 0
  8671. dir: dir isL
  8672. --- END Output Phase ---
  8673. -/|--- Input Phase ---
  8674. =>WM: (13570: I2 ^dir L)
  8675. =>WM: (13569: I2 ^reward 1)
  8676. =>WM: (13568: I2 ^see 0)
  8677. =>WM: (13567: N963 ^status complete)
  8678. <=WM: (13555: I2 ^dir U)
  8679. <=WM: (13554: I2 ^reward 1)
  8680. <=WM: (13553: I2 ^see 1)
  8681. =>WM: (13571: I2 ^level-1 R1-root)
  8682. <=WM: (13556: I2 ^level-1 R1-root)
  8683. --- END Input Phase ---
  8684. --- Proposal Phase ---
  8685. --- Inner Elaboration Phase, active level 1 (S1) ---
  8686. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8687. -->
  8688. (S1 ^operator O1925 = 0.7363235474336447)
  8689. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8690. -->
  8691. Firing elaborate*copy-see-to-output-link
  8692. -->
  8693. (I3 ^see 0 +)
  8694. Firing elaborate*reward*based*on*reward
  8695. -->
  8696. (R967 ^value 1 +)
  8697. (R1 ^reward R967 +)
  8698. Firing propose*predict-yes
  8699. -->
  8700. (O1927 ^name predict-yes +)
  8701. (S1 ^operator O1927 +)
  8702. Firing propose*predict-no
  8703. -->
  8704. (O1928 ^name predict-no +)
  8705. (S1 ^operator O1928 +)
  8706. Firing rl*prefer*rvt*predict-no*H0*6
  8707. -->
  8708. (S1 ^operator O1926 = 0.9996975476948911)
  8709. Firing rl*prefer*rvt*predict-yes*H0*5
  8710. -->
  8711. (S1 ^operator O1925 = 0.2640770017585976)
  8712. Firing prefer*rvt*predict-yes*H0
  8713. -->
  8714. Firing prefer*rvt*predict-no*H0
  8715. -->
  8716. Firing elaborate*copy-dir-to-output-link
  8717. -->
  8718. (I3 ^dir L +)
  8719. inner elaboration loop at bottom goal.
  8720. Retracting elaborate*copy-see-to-output-link
  8721. -->
  8722. (I3 ^see 1 +)
  8723. Retracting propose*predict-no
  8724. -->
  8725. (O1926 ^name predict-no +)
  8726. (S1 ^operator O1926 +)
  8727. Retracting propose*predict-yes
  8728. -->
  8729. (O1925 ^name predict-yes +)
  8730. (S1 ^operator O1925 +)
  8731. Retracting elaborate*reward*based*on*reward
  8732. -->
  8733. (R966 ^value 1 +)
  8734. (R1 ^reward R966 +)
  8735. Retracting elaborate*copy-dir-to-output-link
  8736. -->
  8737. (I3 ^dir U +)
  8738. Retracting rl*prefer*rvt*predict-no*H0*2
  8739. -->
  8740. (S1 ^operator O1926 = 1.)
  8741. Retracting rl*prefer*rvt*predict-yes*H0*1
  8742. -->
  8743. (S1 ^operator O1925 = 0.)
  8744. =>WM: (13579: S1 ^operator O1928 +)
  8745. =>WM: (13578: S1 ^operator O1927 +)
  8746. =>WM: (13577: I3 ^dir L)
  8747. =>WM: (13576: O1928 ^name predict-no)
  8748. =>WM: (13575: O1927 ^name predict-yes)
  8749. =>WM: (13574: R967 ^value 1)
  8750. =>WM: (13573: R1 ^reward R967)
  8751. =>WM: (13572: I3 ^see 0)
  8752. <=WM: (13563: S1 ^operator O1925 +)
  8753. <=WM: (13564: S1 ^operator O1926 +)
  8754. <=WM: (13565: S1 ^operator O1926)
  8755. <=WM: (13562: I3 ^dir U)
  8756. <=WM: (13558: R1 ^reward R966)
  8757. <=WM: (13557: I3 ^see 1)
  8758. <=WM: (13561: O1926 ^name predict-no)
  8759. <=WM: (13560: O1925 ^name predict-yes)
  8760. <=WM: (13559: R966 ^value 1)
  8761. --- Inner Elaboration Phase, active level 1 (S1) ---
  8762. Firing prefer*rvt*predict-yes*H0
  8763. -->
  8764. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8765. -->
  8766. (S1 ^operator O1927 = 0.7363235474336447)
  8767. Firing rl*prefer*rvt*predict-yes*H0*5
  8768. -->
  8769. (S1 ^operator O1927 = 0.2640770017585976)
  8770. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8771. -->
  8772. Firing prefer*rvt*predict-no*H0
  8773. -->
  8774. Firing rl*prefer*rvt*predict-no*H0*6
  8775. -->
  8776. (S1 ^operator O1928 = 0.9996975476948911)
  8777. inner elaboration loop at bottom goal.
  8778. Retracting rl*prefer*rvt*predict-no*H0*6
  8779. -->
  8780. (S1 ^operator O1926 = 0.9996975476948911)
  8781. Retracting rl*prefer*rvt*predict-yes*H0*5
  8782. -->
  8783. (S1 ^operator O1925 = 0.2640770017585976)
  8784. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8785. -->
  8786. (S1 ^operator O1925 = 0.7363235474336447)
  8787. --- END Proposal Phase ---
  8788. --- Decision Phase ---
  8789. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8790. =>WM: (13580: S1 ^operator O1927)
  8791. 964: O: O1927 (predict-yes)
  8792. --- END Decision Phase ---
  8793. --- Application Phase ---
  8794. --- Firing Productions (PE) For State At Depth 1 ---
  8795. --- Inner Elaboration Phase, active level 1 (S1) ---
  8796. Firing apply*operator
  8797. -->
  8798. (I3 ^predict-yes N964 + :O )
  8799. Firing apply*operator*complete
  8800. -->
  8801. (I3 ^predict-no N963 - :O )
  8802. inner elaboration loop at bottom goal.
  8803. --- Change Working Memory (PE) ---
  8804. =>WM: (13581: I3 ^predict-yes N964)
  8805. <=WM: (13567: N963 ^status complete)
  8806. <=WM: (13566: I3 ^predict-no N963)
  8807. --- Firing Productions (IE) For State At Depth 1 ---
  8808. --- Inner Elaboration Phase, active level 1 (S1) ---
  8809. Firing monitor*world
  8810. -->
  8811. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8812. --- Change Working Memory (IE) ---
  8813. --- END Application Phase ---
  8814. --- Output Phase ---
  8815. ENV: Agent did: predict-yes for direction L in state State-B
  8816. In State-B moving L
  8817. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8818. predict error 0
  8819. dir: dir isU
  8820. --- END Output Phase ---
  8821. \-/--- Input Phase ---
  8822. =>WM: (13585: I2 ^dir U)
  8823. =>WM: (13584: I2 ^reward 1)
  8824. =>WM: (13583: I2 ^see 1)
  8825. =>WM: (13582: N964 ^status complete)
  8826. <=WM: (13570: I2 ^dir L)
  8827. <=WM: (13569: I2 ^reward 1)
  8828. <=WM: (13568: I2 ^see 0)
  8829. =>WM: (13586: I2 ^level-1 L1-root)
  8830. <=WM: (13571: I2 ^level-1 R1-root)
  8831. --- END Input Phase ---
  8832. --- Proposal Phase ---
  8833. --- Inner Elaboration Phase, active level 1 (S1) ---
  8834. Firing elaborate*copy-see-to-output-link
  8835. -->
  8836. (I3 ^see 1 +)
  8837. Firing elaborate*reward*based*on*reward
  8838. -->
  8839. (R968 ^value 1 +)
  8840. (R1 ^reward R968 +)
  8841. Firing propose*predict-yes
  8842. -->
  8843. (O1929 ^name predict-yes +)
  8844. (S1 ^operator O1929 +)
  8845. Firing propose*predict-no
  8846. -->
  8847. (O1930 ^name predict-no +)
  8848. (S1 ^operator O1930 +)
  8849. Firing rl*prefer*rvt*predict-no*H0*2
  8850. -->
  8851. (S1 ^operator O1928 = 1.)
  8852. Firing rl*prefer*rvt*predict-yes*H0*1
  8853. -->
  8854. (S1 ^operator O1927 = 0.)
  8855. Firing prefer*rvt*predict-yes*H0
  8856. -->
  8857. Firing prefer*rvt*predict-no*H0
  8858. -->
  8859. Firing elaborate*copy-dir-to-output-link
  8860. -->
  8861. (I3 ^dir U +)
  8862. inner elaboration loop at bottom goal.
  8863. Retracting elaborate*copy-see-to-output-link
  8864. -->
  8865. (I3 ^see 0 +)
  8866. Retracting propose*predict-no
  8867. -->
  8868. (O1928 ^name predict-no +)
  8869. (S1 ^operator O1928 +)
  8870. Retracting propose*predict-yes
  8871. -->
  8872. (O1927 ^name predict-yes +)
  8873. (S1 ^operator O1927 +)
  8874. Retracting elaborate*reward*based*on*reward
  8875. -->
  8876. (R967 ^value 1 +)
  8877. (R1 ^reward R967 +)
  8878. Retracting elaborate*copy-dir-to-output-link
  8879. -->
  8880. (I3 ^dir L +)
  8881. Retracting rl*prefer*rvt*predict-no*H0*6
  8882. -->
  8883. (S1 ^operator O1928 = 0.9996975476948911)
  8884. Retracting rl*prefer*rvt*predict-yes*H0*5
  8885. -->
  8886. (S1 ^operator O1927 = 0.2640770017585976)
  8887. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  8888. -->
  8889. (S1 ^operator O1927 = 0.7363235474336447)
  8890. =>WM: (13594: S1 ^operator O1930 +)
  8891. =>WM: (13593: S1 ^operator O1929 +)
  8892. =>WM: (13592: I3 ^dir U)
  8893. =>WM: (13591: O1930 ^name predict-no)
  8894. =>WM: (13590: O1929 ^name predict-yes)
  8895. =>WM: (13589: R968 ^value 1)
  8896. =>WM: (13588: R1 ^reward R968)
  8897. =>WM: (13587: I3 ^see 1)
  8898. <=WM: (13578: S1 ^operator O1927 +)
  8899. <=WM: (13580: S1 ^operator O1927)
  8900. <=WM: (13579: S1 ^operator O1928 +)
  8901. <=WM: (13577: I3 ^dir L)
  8902. <=WM: (13573: R1 ^reward R967)
  8903. <=WM: (13572: I3 ^see 0)
  8904. <=WM: (13576: O1928 ^name predict-no)
  8905. <=WM: (13575: O1927 ^name predict-yes)
  8906. <=WM: (13574: R967 ^value 1)
  8907. --- Inner Elaboration Phase, active level 1 (S1) ---
  8908. Firing prefer*rvt*predict-yes*H0
  8909. -->
  8910. Firing rl*prefer*rvt*predict-yes*H0*1
  8911. -->
  8912. (S1 ^operator O1929 = 0.)
  8913. Firing prefer*rvt*predict-no*H0
  8914. -->
  8915. Firing rl*prefer*rvt*predict-no*H0*2
  8916. -->
  8917. (S1 ^operator O1930 = 1.)
  8918. inner elaboration loop at bottom goal.
  8919. Retracting rl*prefer*rvt*predict-no*H0*2
  8920. -->
  8921. (S1 ^operator O1928 = 1.)
  8922. Retracting rl*prefer*rvt*predict-yes*H0*1
  8923. -->
  8924. (S1 ^operator O1927 = 0.)
  8925. --- END Proposal Phase ---
  8926. --- Decision Phase ---
  8927. RL update rl*prefer*rvt*predict-yes*H0*5 0.554462 -0.290385 0.264077 -> 0.55443 -0.290385 0.264044(R,m,v=1,0.873563,0.111089)
  8928. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445932 0.290392 0.736324 -> 0.445895 0.290391 0.736286(R,m,v=1,1,0)
  8929. =>WM: (13595: S1 ^operator O1930)
  8930. 965: O: O1930 (predict-no)
  8931. --- END Decision Phase ---
  8932. --- Application Phase ---
  8933. --- Firing Productions (PE) For State At Depth 1 ---
  8934. --- Inner Elaboration Phase, active level 1 (S1) ---
  8935. Firing apply*operator
  8936. -->
  8937. (I3 ^predict-no N965 + :O )
  8938. Firing apply*operator*complete
  8939. -->
  8940. (I3 ^predict-yes N964 - :O )
  8941. inner elaboration loop at bottom goal.
  8942. --- Change Working Memory (PE) ---
  8943. =>WM: (13596: I3 ^predict-no N965)
  8944. <=WM: (13582: N964 ^status complete)
  8945. <=WM: (13581: I3 ^predict-yes N964)
  8946. --- Firing Productions (IE) For State At Depth 1 ---
  8947. --- Inner Elaboration Phase, active level 1 (S1) ---
  8948. Firing monitor*world
  8949. -->
  8950. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8951. --- Change Working Memory (IE) ---
  8952. --- END Application Phase ---
  8953. --- Output Phase ---
  8954. ENV: Agent did: predict-no for direction U in state State-A
  8955. In State-A moving U
  8956. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8957. predict error 0
  8958. dir: dir isL
  8959. --- END Output Phase ---
  8960. |\--- Input Phase ---
  8961. =>WM: (13600: I2 ^dir L)
  8962. =>WM: (13599: I2 ^reward 1)
  8963. =>WM: (13598: I2 ^see 0)
  8964. =>WM: (13597: N965 ^status complete)
  8965. <=WM: (13585: I2 ^dir U)
  8966. <=WM: (13584: I2 ^reward 1)
  8967. <=WM: (13583: I2 ^see 1)
  8968. =>WM: (13601: I2 ^level-1 L1-root)
  8969. <=WM: (13586: I2 ^level-1 L1-root)
  8970. --- END Input Phase ---
  8971. --- Proposal Phase ---
  8972. --- Inner Elaboration Phase, active level 1 (S1) ---
  8973. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  8974. -->
  8975. (S1 ^operator O1929 = -0.181727099742844)
  8976. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  8977. -->
  8978. Firing elaborate*copy-see-to-output-link
  8979. -->
  8980. (I3 ^see 0 +)
  8981. Firing elaborate*reward*based*on*reward
  8982. -->
  8983. (R969 ^value 1 +)
  8984. (R1 ^reward R969 +)
  8985. Firing propose*predict-yes
  8986. -->
  8987. (O1931 ^name predict-yes +)
  8988. (S1 ^operator O1931 +)
  8989. Firing propose*predict-no
  8990. -->
  8991. (O1932 ^name predict-no +)
  8992. (S1 ^operator O1932 +)
  8993. Firing rl*prefer*rvt*predict-no*H0*6
  8994. -->
  8995. (S1 ^operator O1930 = 0.9996975476948911)
  8996. Firing rl*prefer*rvt*predict-yes*H0*5
  8997. -->
  8998. (S1 ^operator O1929 = 0.2640444846619989)
  8999. Firing prefer*rvt*predict-yes*H0
  9000. -->
  9001. Firing prefer*rvt*predict-no*H0
  9002. -->
  9003. Firing elaborate*copy-dir-to-output-link
  9004. -->
  9005. (I3 ^dir L +)
  9006. inner elaboration loop at bottom goal.
  9007. Retracting elaborate*copy-see-to-output-link
  9008. -->
  9009. (I3 ^see 1 +)
  9010. Retracting propose*predict-no
  9011. -->
  9012. (O1930 ^name predict-no +)
  9013. (S1 ^operator O1930 +)
  9014. Retracting propose*predict-yes
  9015. -->
  9016. (O1929 ^name predict-yes +)
  9017. (S1 ^operator O1929 +)
  9018. Retracting elaborate*reward*based*on*reward
  9019. -->
  9020. (R968 ^value 1 +)
  9021. (R1 ^reward R968 +)
  9022. Retracting elaborate*copy-dir-to-output-link
  9023. -->
  9024. (I3 ^dir U +)
  9025. Retracting rl*prefer*rvt*predict-no*H0*2
  9026. -->
  9027. (S1 ^operator O1930 = 1.)
  9028. Retracting rl*prefer*rvt*predict-yes*H0*1
  9029. -->
  9030. (S1 ^operator O1929 = 0.)
  9031. =>WM: (13609: S1 ^operator O1932 +)
  9032. =>WM: (13608: S1 ^operator O1931 +)
  9033. =>WM: (13607: I3 ^dir L)
  9034. =>WM: (13606: O1932 ^name predict-no)
  9035. =>WM: (13605: O1931 ^name predict-yes)
  9036. =>WM: (13604: R969 ^value 1)
  9037. =>WM: (13603: R1 ^reward R969)
  9038. =>WM: (13602: I3 ^see 0)
  9039. <=WM: (13593: S1 ^operator O1929 +)
  9040. <=WM: (13594: S1 ^operator O1930 +)
  9041. <=WM: (13595: S1 ^operator O1930)
  9042. <=WM: (13592: I3 ^dir U)
  9043. <=WM: (13588: R1 ^reward R968)
  9044. <=WM: (13587: I3 ^see 1)
  9045. <=WM: (13591: O1930 ^name predict-no)
  9046. <=WM: (13590: O1929 ^name predict-yes)
  9047. <=WM: (13589: R968 ^value 1)
  9048. --- Inner Elaboration Phase, active level 1 (S1) ---
  9049. Firing prefer*rvt*predict-yes*H0
  9050. -->
  9051. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9052. -->
  9053. (S1 ^operator O1931 = -0.181727099742844)
  9054. Firing rl*prefer*rvt*predict-yes*H0*5
  9055. -->
  9056. (S1 ^operator O1931 = 0.2640444846619989)
  9057. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9058. -->
  9059. Firing prefer*rvt*predict-no*H0
  9060. -->
  9061. Firing rl*prefer*rvt*predict-no*H0*6
  9062. -->
  9063. (S1 ^operator O1932 = 0.9996975476948911)
  9064. inner elaboration loop at bottom goal.
  9065. Retracting rl*prefer*rvt*predict-no*H0*6
  9066. -->
  9067. (S1 ^operator O1930 = 0.9996975476948911)
  9068. Retracting rl*prefer*rvt*predict-yes*H0*5
  9069. -->
  9070. (S1 ^operator O1929 = 0.2640444846619989)
  9071. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9072. -->
  9073. (S1 ^operator O1929 = -0.181727099742844)
  9074. --- END Proposal Phase ---
  9075. --- Decision Phase ---
  9076. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9077. =>WM: (13610: S1 ^operator O1932)
  9078. 966: O: O1932 (predict-no)
  9079. --- END Decision Phase ---
  9080. --- Application Phase ---
  9081. --- Firing Productions (PE) For State At Depth 1 ---
  9082. --- Inner Elaboration Phase, active level 1 (S1) ---
  9083. Firing apply*operator
  9084. -->
  9085. (I3 ^predict-no N966 + :O )
  9086. Firing apply*operator*complete
  9087. -->
  9088. (I3 ^predict-no N965 - :O )
  9089. inner elaboration loop at bottom goal.
  9090. --- Change Working Memory (PE) ---
  9091. =>WM: (13611: I3 ^predict-no N966)
  9092. <=WM: (13597: N965 ^status complete)
  9093. <=WM: (13596: I3 ^predict-no N965)
  9094. --- Firing Productions (IE) For State At Depth 1 ---
  9095. --- Inner Elaboration Phase, active level 1 (S1) ---
  9096. Firing monitor*world
  9097. -->
  9098. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9099. --- Change Working Memory (IE) ---
  9100. --- END Application Phase ---
  9101. --- Output Phase ---
  9102. ENV: Agent did: predict-no for direction L in state State-A
  9103. In State-A moving L
  9104. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9105. predict error 0
  9106. dir: dir isR
  9107. --- END Output Phase ---
  9108. -/|--- Input Phase ---
  9109. =>WM: (13615: I2 ^dir R)
  9110. =>WM: (13614: I2 ^reward 1)
  9111. =>WM: (13613: I2 ^see 0)
  9112. =>WM: (13612: N966 ^status complete)
  9113. <=WM: (13600: I2 ^dir L)
  9114. <=WM: (13599: I2 ^reward 1)
  9115. <=WM: (13598: I2 ^see 0)
  9116. =>WM: (13616: I2 ^level-1 L0-root)
  9117. <=WM: (13601: I2 ^level-1 L1-root)
  9118. --- END Input Phase ---
  9119. --- Proposal Phase ---
  9120. --- Inner Elaboration Phase, active level 1 (S1) ---
  9121. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9122. -->
  9123. (S1 ^operator O1932 = -0.2817060109291377)
  9124. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9125. -->
  9126. (S1 ^operator O1931 = 0.6623675607605151)
  9127. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9128. -->
  9129. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9130. -->
  9131. Firing elaborate*copy-see-to-output-link
  9132. -->
  9133. (I3 ^see 0 +)
  9134. Firing elaborate*reward*based*on*reward
  9135. -->
  9136. (R970 ^value 1 +)
  9137. (R1 ^reward R970 +)
  9138. Firing propose*predict-yes
  9139. -->
  9140. (O1933 ^name predict-yes +)
  9141. (S1 ^operator O1933 +)
  9142. Firing propose*predict-no
  9143. -->
  9144. (O1934 ^name predict-no +)
  9145. (S1 ^operator O1934 +)
  9146. Firing rl*prefer*rvt*predict-no*H0*4
  9147. -->
  9148. (S1 ^operator O1932 = 0.3397713875215998)
  9149. Firing rl*prefer*rvt*predict-yes*H0*3
  9150. -->
  9151. (S1 ^operator O1931 = 0.3377110018583719)
  9152. Firing prefer*rvt*predict-yes*H0
  9153. -->
  9154. Firing prefer*rvt*predict-no*H0
  9155. -->
  9156. Firing elaborate*copy-dir-to-output-link
  9157. -->
  9158. (I3 ^dir R +)
  9159. inner elaboration loop at bottom goal.
  9160. Retracting elaborate*copy-see-to-output-link
  9161. -->
  9162. (I3 ^see 0 +)
  9163. Retracting propose*predict-no
  9164. -->
  9165. (O1932 ^name predict-no +)
  9166. (S1 ^operator O1932 +)
  9167. Retracting propose*predict-yes
  9168. -->
  9169. (O1931 ^name predict-yes +)
  9170. (S1 ^operator O1931 +)
  9171. Retracting elaborate*reward*based*on*reward
  9172. -->
  9173. (R969 ^value 1 +)
  9174. (R1 ^reward R969 +)
  9175. Retracting elaborate*copy-dir-to-output-link
  9176. -->
  9177. (I3 ^dir L +)
  9178. Retracting rl*prefer*rvt*predict-no*H0*6
  9179. -->
  9180. (S1 ^operator O1932 = 0.9996975476948911)
  9181. Retracting rl*prefer*rvt*predict-yes*H0*5
  9182. -->
  9183. (S1 ^operator O1931 = 0.2640444846619989)
  9184. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  9185. -->
  9186. (S1 ^operator O1931 = -0.181727099742844)
  9187. =>WM: (13623: S1 ^operator O1934 +)
  9188. =>WM: (13622: S1 ^operator O1933 +)
  9189. =>WM: (13621: I3 ^dir R)
  9190. =>WM: (13620: O1934 ^name predict-no)
  9191. =>WM: (13619: O1933 ^name predict-yes)
  9192. =>WM: (13618: R970 ^value 1)
  9193. =>WM: (13617: R1 ^reward R970)
  9194. <=WM: (13608: S1 ^operator O1931 +)
  9195. <=WM: (13609: S1 ^operator O1932 +)
  9196. <=WM: (13610: S1 ^operator O1932)
  9197. <=WM: (13607: I3 ^dir L)
  9198. <=WM: (13603: R1 ^reward R969)
  9199. <=WM: (13606: O1932 ^name predict-no)
  9200. <=WM: (13605: O1931 ^name predict-yes)
  9201. <=WM: (13604: R969 ^value 1)
  9202. --- Inner Elaboration Phase, active level 1 (S1) ---
  9203. Firing prefer*rvt*predict-yes*H0
  9204. -->
  9205. Firing rl*prefer*rvt*predict-yes*H0*3
  9206. -->
  9207. (S1 ^operator O1933 = 0.3377110018583719)
  9208. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9209. -->
  9210. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9211. -->
  9212. (S1 ^operator O1933 = 0.6623675607605151)
  9213. Firing prefer*rvt*predict-no*H0
  9214. -->
  9215. Firing rl*prefer*rvt*predict-no*H0*4
  9216. -->
  9217. (S1 ^operator O1934 = 0.3397713875215998)
  9218. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9219. -->
  9220. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9221. -->
  9222. (S1 ^operator O1934 = -0.2817060109291377)
  9223. inner elaboration loop at bottom goal.
  9224. Retracting rl*prefer*rvt*predict-no*H0*4
  9225. -->
  9226. (S1 ^operator O1932 = 0.3397713875215998)
  9227. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9228. -->
  9229. (S1 ^operator O1932 = -0.2817060109291377)
  9230. Retracting rl*prefer*rvt*predict-yes*H0*3
  9231. -->
  9232. (S1 ^operator O1931 = 0.3377110018583719)
  9233. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9234. -->
  9235. (S1 ^operator O1931 = 0.6623675607605151)
  9236. --- END Proposal Phase ---
  9237. --- Decision Phase ---
  9238. RL update rl*prefer*rvt*predict-no*H0*6 0.999698 0 0.999698 -> 0.999748 0 0.999748(R,m,v=1,0.90411,0.0872933)
  9239. =>WM: (13624: S1 ^operator O1933)
  9240. 967: O: O1933 (predict-yes)
  9241. --- END Decision Phase ---
  9242. --- Application Phase ---
  9243. --- Firing Productions (PE) For State At Depth 1 ---
  9244. --- Inner Elaboration Phase, active level 1 (S1) ---
  9245. Firing apply*operator
  9246. -->
  9247. (I3 ^predict-yes N967 + :O )
  9248. Firing apply*operator*complete
  9249. -->
  9250. (I3 ^predict-no N966 - :O )
  9251. inner elaboration loop at bottom goal.
  9252. --- Change Working Memory (PE) ---
  9253. =>WM: (13625: I3 ^predict-yes N967)
  9254. <=WM: (13612: N966 ^status complete)
  9255. <=WM: (13611: I3 ^predict-no N966)
  9256. --- Firing Productions (IE) For State At Depth 1 ---
  9257. --- Inner Elaboration Phase, active level 1 (S1) ---
  9258. Firing monitor*world
  9259. -->
  9260. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9261. --- Change Working Memory (IE) ---
  9262. --- END Application Phase ---
  9263. --- Output Phase ---
  9264. ENV: Agent did: predict-yes for direction R in state State-A
  9265. In State-A moving R
  9266. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9267. predict error 0
  9268. dir: dir isR
  9269. --- END Output Phase ---
  9270. \-/--- Input Phase ---
  9271. =>WM: (13629: I2 ^dir R)
  9272. =>WM: (13628: I2 ^reward 1)
  9273. =>WM: (13627: I2 ^see 1)
  9274. =>WM: (13626: N967 ^status complete)
  9275. <=WM: (13615: I2 ^dir R)
  9276. <=WM: (13614: I2 ^reward 1)
  9277. <=WM: (13613: I2 ^see 0)
  9278. =>WM: (13630: I2 ^level-1 R1-root)
  9279. <=WM: (13616: I2 ^level-1 L0-root)
  9280. --- END Input Phase ---
  9281. --- Proposal Phase ---
  9282. --- Inner Elaboration Phase, active level 1 (S1) ---
  9283. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9284. -->
  9285. (S1 ^operator O1933 = -0.1070236389116304)
  9286. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9287. -->
  9288. (S1 ^operator O1934 = 0.6602488383529777)
  9289. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9290. -->
  9291. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9292. -->
  9293. Firing elaborate*copy-see-to-output-link
  9294. -->
  9295. (I3 ^see 1 +)
  9296. Firing elaborate*reward*based*on*reward
  9297. -->
  9298. (R971 ^value 1 +)
  9299. (R1 ^reward R971 +)
  9300. Firing propose*predict-yes
  9301. -->
  9302. (O1935 ^name predict-yes +)
  9303. (S1 ^operator O1935 +)
  9304. Firing propose*predict-no
  9305. -->
  9306. (O1936 ^name predict-no +)
  9307. (S1 ^operator O1936 +)
  9308. Firing rl*prefer*rvt*predict-no*H0*4
  9309. -->
  9310. (S1 ^operator O1934 = 0.3397713875215998)
  9311. Firing rl*prefer*rvt*predict-yes*H0*3
  9312. -->
  9313. (S1 ^operator O1933 = 0.3377110018583719)
  9314. Firing prefer*rvt*predict-yes*H0
  9315. -->
  9316. Firing prefer*rvt*predict-no*H0
  9317. -->
  9318. Firing elaborate*copy-dir-to-output-link
  9319. -->
  9320. (I3 ^dir R +)
  9321. inner elaboration loop at bottom goal.
  9322. Retracting elaborate*copy-see-to-output-link
  9323. -->
  9324. (I3 ^see 0 +)
  9325. Retracting propose*predict-no
  9326. -->
  9327. (O1934 ^name predict-no +)
  9328. (S1 ^operator O1934 +)
  9329. Retracting propose*predict-yes
  9330. -->
  9331. (O1933 ^name predict-yes +)
  9332. (S1 ^operator O1933 +)
  9333. Retracting elaborate*reward*based*on*reward
  9334. -->
  9335. (R970 ^value 1 +)
  9336. (R1 ^reward R970 +)
  9337. Retracting elaborate*copy-dir-to-output-link
  9338. -->
  9339. (I3 ^dir R +)
  9340. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  9341. -->
  9342. (S1 ^operator O1934 = -0.2817060109291377)
  9343. Retracting rl*prefer*rvt*predict-no*H0*4
  9344. -->
  9345. (S1 ^operator O1934 = 0.3397713875215998)
  9346. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  9347. -->
  9348. (S1 ^operator O1933 = 0.6623675607605151)
  9349. Retracting rl*prefer*rvt*predict-yes*H0*3
  9350. -->
  9351. (S1 ^operator O1933 = 0.3377110018583719)
  9352. =>WM: (13637: S1 ^operator O1936 +)
  9353. =>WM: (13636: S1 ^operator O1935 +)
  9354. =>WM: (13635: O1936 ^name predict-no)
  9355. =>WM: (13634: O1935 ^name predict-yes)
  9356. =>WM: (13633: R971 ^value 1)
  9357. =>WM: (13632: R1 ^reward R971)
  9358. =>WM: (13631: I3 ^see 1)
  9359. <=WM: (13622: S1 ^operator O1933 +)
  9360. <=WM: (13624: S1 ^operator O1933)
  9361. <=WM: (13623: S1 ^operator O1934 +)
  9362. <=WM: (13617: R1 ^reward R970)
  9363. <=WM: (13602: I3 ^see 0)
  9364. <=WM: (13620: O1934 ^name predict-no)
  9365. <=WM: (13619: O1933 ^name predict-yes)
  9366. <=WM: (13618: R970 ^value 1)
  9367. --- Inner Elaboration Phase, active level 1 (S1) ---
  9368. Firing prefer*rvt*predict-yes*H0
  9369. -->
  9370. Firing rl*prefer*rvt*predict-yes*H0*3
  9371. -->
  9372. (S1 ^operator O1935 = 0.3377110018583719)
  9373. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9374. -->
  9375. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9376. -->
  9377. (S1 ^operator O1935 = -0.1070236389116304)
  9378. Firing prefer*rvt*predict-no*H0
  9379. -->
  9380. Firing rl*prefer*rvt*predict-no*H0*4
  9381. -->
  9382. (S1 ^operator O1936 = 0.3397713875215998)
  9383. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9384. -->
  9385. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9386. -->
  9387. (S1 ^operator O1936 = 0.6602488383529777)
  9388. inner elaboration loop at bottom goal.
  9389. Retracting rl*prefer*rvt*predict-no*H0*4
  9390. -->
  9391. (S1 ^operator O1934 = 0.3397713875215998)
  9392. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9393. -->
  9394. (S1 ^operator O1934 = 0.6602488383529777)
  9395. Retracting rl*prefer*rvt*predict-yes*H0*3
  9396. -->
  9397. (S1 ^operator O1933 = 0.3377110018583719)
  9398. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9399. -->
  9400. (S1 ^operator O1933 = -0.1070236389116304)
  9401. --- END Proposal Phase ---
  9402. --- Decision Phase ---
  9403. RL update rl*prefer*rvt*predict-yes*H0*3 0.590111 -0.2524 0.337711 -> 0.590104 -0.252399 0.337705(R,m,v=1,0.895706,0.0939938)
  9404. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409979 0.252388 0.662368 -> 0.409971 0.252389 0.66236(R,m,v=1,1,0)
  9405. =>WM: (13638: S1 ^operator O1936)
  9406. 968: O: O1936 (predict-no)
  9407. --- END Decision Phase ---
  9408. --- Application Phase ---
  9409. --- Firing Productions (PE) For State At Depth 1 ---
  9410. --- Inner Elaboration Phase, active level 1 (S1) ---
  9411. Firing apply*operator
  9412. -->
  9413. (I3 ^predict-no N968 + :O )
  9414. Firing apply*operator*complete
  9415. -->
  9416. (I3 ^predict-yes N967 - :O )
  9417. inner elaboration loop at bottom goal.
  9418. --- Change Working Memory (PE) ---
  9419. =>WM: (13639: I3 ^predict-no N968)
  9420. <=WM: (13626: N967 ^status complete)
  9421. <=WM: (13625: I3 ^predict-yes N967)
  9422. --- Firing Productions (IE) For State At Depth 1 ---
  9423. --- Inner Elaboration Phase, active level 1 (S1) ---
  9424. Firing monitor*world
  9425. -->
  9426. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9427. --- Change Working Memory (IE) ---
  9428. --- END Application Phase ---
  9429. --- Output Phase ---
  9430. ENV: Agent did: predict-no for direction R in state State-B
  9431. In State-B moving R
  9432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9433. predict error 0
  9434. dir: dir isU
  9435. --- END Output Phase ---
  9436. |\--- Input Phase ---
  9437. =>WM: (13643: I2 ^dir U)
  9438. =>WM: (13642: I2 ^reward 1)
  9439. =>WM: (13641: I2 ^see 0)
  9440. =>WM: (13640: N968 ^status complete)
  9441. <=WM: (13629: I2 ^dir R)
  9442. <=WM: (13628: I2 ^reward 1)
  9443. <=WM: (13627: I2 ^see 1)
  9444. =>WM: (13644: I2 ^level-1 R0-root)
  9445. <=WM: (13630: I2 ^level-1 R1-root)
  9446. --- END Input Phase ---
  9447. --- Proposal Phase ---
  9448. --- Inner Elaboration Phase, active level 1 (S1) ---
  9449. Firing elaborate*copy-see-to-output-link
  9450. -->
  9451. (I3 ^see 0 +)
  9452. Firing elaborate*reward*based*on*reward
  9453. -->
  9454. (R972 ^value 1 +)
  9455. (R1 ^reward R972 +)
  9456. Firing propose*predict-yes
  9457. -->
  9458. (O1937 ^name predict-yes +)
  9459. (S1 ^operator O1937 +)
  9460. Firing propose*predict-no
  9461. -->
  9462. (O1938 ^name predict-no +)
  9463. (S1 ^operator O1938 +)
  9464. Firing rl*prefer*rvt*predict-no*H0*2
  9465. -->
  9466. (S1 ^operator O1936 = 1.)
  9467. Firing rl*prefer*rvt*predict-yes*H0*1
  9468. -->
  9469. (S1 ^operator O1935 = 0.)
  9470. Firing prefer*rvt*predict-yes*H0
  9471. -->
  9472. Firing prefer*rvt*predict-no*H0
  9473. -->
  9474. Firing elaborate*copy-dir-to-output-link
  9475. -->
  9476. (I3 ^dir U +)
  9477. inner elaboration loop at bottom goal.
  9478. Retracting elaborate*copy-see-to-output-link
  9479. -->
  9480. (I3 ^see 1 +)
  9481. Retracting propose*predict-no
  9482. -->
  9483. (O1936 ^name predict-no +)
  9484. (S1 ^operator O1936 +)
  9485. Retracting propose*predict-yes
  9486. -->
  9487. (O1935 ^name predict-yes +)
  9488. (S1 ^operator O1935 +)
  9489. Retracting elaborate*reward*based*on*reward
  9490. -->
  9491. (R971 ^value 1 +)
  9492. (R1 ^reward R971 +)
  9493. Retracting elaborate*copy-dir-to-output-link
  9494. -->
  9495. (I3 ^dir R +)
  9496. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  9497. -->
  9498. (S1 ^operator O1936 = 0.6602488383529777)
  9499. Retracting rl*prefer*rvt*predict-no*H0*4
  9500. -->
  9501. (S1 ^operator O1936 = 0.3397713875215998)
  9502. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  9503. -->
  9504. (S1 ^operator O1935 = -0.1070236389116304)
  9505. Retracting rl*prefer*rvt*predict-yes*H0*3
  9506. -->
  9507. (S1 ^operator O1935 = 0.3377045556949833)
  9508. =>WM: (13652: S1 ^operator O1938 +)
  9509. =>WM: (13651: S1 ^operator O1937 +)
  9510. =>WM: (13650: I3 ^dir U)
  9511. =>WM: (13649: O1938 ^name predict-no)
  9512. =>WM: (13648: O1937 ^name predict-yes)
  9513. =>WM: (13647: R972 ^value 1)
  9514. =>WM: (13646: R1 ^reward R972)
  9515. =>WM: (13645: I3 ^see 0)
  9516. <=WM: (13636: S1 ^operator O1935 +)
  9517. <=WM: (13637: S1 ^operator O1936 +)
  9518. <=WM: (13638: S1 ^operator O1936)
  9519. <=WM: (13621: I3 ^dir R)
  9520. <=WM: (13632: R1 ^reward R971)
  9521. <=WM: (13631: I3 ^see 1)
  9522. <=WM: (13635: O1936 ^name predict-no)
  9523. <=WM: (13634: O1935 ^name predict-yes)
  9524. <=WM: (13633: R971 ^value 1)
  9525. --- Inner Elaboration Phase, active level 1 (S1) ---
  9526. Firing prefer*rvt*predict-yes*H0
  9527. -->
  9528. Firing rl*prefer*rvt*predict-yes*H0*1
  9529. -->
  9530. (S1 ^operator O1937 = 0.)
  9531. Firing prefer*rvt*predict-no*H0
  9532. -->
  9533. Firing rl*prefer*rvt*predict-no*H0*2
  9534. -->
  9535. (S1 ^operator O1938 = 1.)
  9536. inner elaboration loop at bottom goal.
  9537. Retracting rl*prefer*rvt*predict-no*H0*2
  9538. -->
  9539. (S1 ^operator O1936 = 1.)
  9540. Retracting rl*prefer*rvt*predict-yes*H0*1
  9541. -->
  9542. (S1 ^operator O1935 = 0.)
  9543. --- END Proposal Phase ---
  9544. --- Decision Phase ---
  9545. RL update rl*prefer*rvt*predict-no*H0*4 0.570255 -0.230484 0.339771 -> 0.570253 -0.230483 0.33977(R,m,v=1,0.872727,0.111752)
  9546. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429766 0.230483 0.660249 -> 0.429764 0.230483 0.660247(R,m,v=1,1,0)
  9547. =>WM: (13653: S1 ^operator O1938)
  9548. 969: O: O1938 (predict-no)
  9549. --- END Decision Phase ---
  9550. --- Application Phase ---
  9551. --- Firing Productions (PE) For State At Depth 1 ---
  9552. --- Inner Elaboration Phase, active level 1 (S1) ---
  9553. Firing apply*operator
  9554. -->
  9555. (I3 ^predict-no N969 + :O )
  9556. Firing apply*operator*complete
  9557. -->
  9558. (I3 ^predict-no N968 - :O )
  9559. inner elaboration loop at bottom goal.
  9560. --- Change Working Memory (PE) ---
  9561. =>WM: (13654: I3 ^predict-no N969)
  9562. <=WM: (13640: N968 ^status complete)
  9563. <=WM: (13639: I3 ^predict-no N968)
  9564. --- Firing Productions (IE) For State At Depth 1 ---
  9565. --- Inner Elaboration Phase, active level 1 (S1) ---
  9566. Firing monitor*world
  9567. -->
  9568. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9569. --- Change Working Memory (IE) ---
  9570. --- END Application Phase ---
  9571. --- Output Phase ---
  9572. ENV: Agent did: predict-no for direction U in state State-B
  9573. In State-B moving U
  9574. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9575. predict error 0
  9576. dir: dir isU
  9577. --- END Output Phase ---
  9578. -/|--- Input Phase ---
  9579. =>WM: (13658: I2 ^dir U)
  9580. =>WM: (13657: I2 ^reward 1)
  9581. =>WM: (13656: I2 ^see 0)
  9582. =>WM: (13655: N969 ^status complete)
  9583. <=WM: (13643: I2 ^dir U)
  9584. <=WM: (13642: I2 ^reward 1)
  9585. <=WM: (13641: I2 ^see 0)
  9586. =>WM: (13659: I2 ^level-1 R0-root)
  9587. <=WM: (13644: I2 ^level-1 R0-root)
  9588. --- END Input Phase ---
  9589. --- Proposal Phase ---
  9590. --- Inner Elaboration Phase, active level 1 (S1) ---
  9591. Firing elaborate*copy-see-to-output-link
  9592. -->
  9593. (I3 ^see 0 +)
  9594. Firing elaborate*reward*based*on*reward
  9595. -->
  9596. (R973 ^value 1 +)
  9597. (R1 ^reward R973 +)
  9598. Firing propose*predict-yes
  9599. -->
  9600. (O1939 ^name predict-yes +)
  9601. (S1 ^operator O1939 +)
  9602. Firing propose*predict-no
  9603. -->
  9604. (O1940 ^name predict-no +)
  9605. (S1 ^operator O1940 +)
  9606. Firing rl*prefer*rvt*predict-no*H0*2
  9607. -->
  9608. (S1 ^operator O1938 = 1.)
  9609. Firing rl*prefer*rvt*predict-yes*H0*1
  9610. -->
  9611. (S1 ^operator O1937 = 0.)
  9612. Firing prefer*rvt*predict-yes*H0
  9613. -->
  9614. Firing prefer*rvt*predict-no*H0
  9615. -->
  9616. Firing elaborate*copy-dir-to-output-link
  9617. -->
  9618. (I3 ^dir U +)
  9619. inner elaboration loop at bottom goal.
  9620. Retracting elaborate*copy-see-to-output-link
  9621. -->
  9622. (I3 ^see 0 +)
  9623. Retracting propose*predict-no
  9624. -->
  9625. (O1938 ^name predict-no +)
  9626. (S1 ^operator O1938 +)
  9627. Retracting propose*predict-yes
  9628. -->
  9629. (O1937 ^name predict-yes +)
  9630. (S1 ^operator O1937 +)
  9631. Retracting elaborate*reward*based*on*reward
  9632. -->
  9633. (R972 ^value 1 +)
  9634. (R1 ^reward R972 +)
  9635. Retracting elaborate*copy-dir-to-output-link
  9636. -->
  9637. (I3 ^dir U +)
  9638. Retracting rl*prefer*rvt*predict-no*H0*2
  9639. -->
  9640. (S1 ^operator O1938 = 1.)
  9641. Retracting rl*prefer*rvt*predict-yes*H0*1
  9642. -->
  9643. (S1 ^operator O1937 = 0.)
  9644. =>WM: (13665: S1 ^operator O1940 +)
  9645. =>WM: (13664: S1 ^operator O1939 +)
  9646. =>WM: (13663: O1940 ^name predict-no)
  9647. =>WM: (13662: O1939 ^name predict-yes)
  9648. =>WM: (13661: R973 ^value 1)
  9649. =>WM: (13660: R1 ^reward R973)
  9650. <=WM: (13651: S1 ^operator O1937 +)
  9651. <=WM: (13652: S1 ^operator O1938 +)
  9652. <=WM: (13653: S1 ^operator O1938)
  9653. <=WM: (13646: R1 ^reward R972)
  9654. <=WM: (13649: O1938 ^name predict-no)
  9655. <=WM: (13648: O1937 ^name predict-yes)
  9656. <=WM: (13647: R972 ^value 1)
  9657. --- Inner Elaboration Phase, active level 1 (S1) ---
  9658. Firing prefer*rvt*predict-yes*H0
  9659. -->
  9660. Firing rl*prefer*rvt*predict-yes*H0*1
  9661. -->
  9662. (S1 ^operator O1939 = 0.)
  9663. Firing prefer*rvt*predict-no*H0
  9664. -->
  9665. Firing rl*prefer*rvt*predict-no*H0*2
  9666. -->
  9667. (S1 ^operator O1940 = 1.)
  9668. inner elaboration loop at bottom goal.
  9669. Retracting rl*prefer*rvt*predict-no*H0*2
  9670. -->
  9671. (S1 ^operator O1938 = 1.)
  9672. Retracting rl*prefer*rvt*predict-yes*H0*1
  9673. -->
  9674. (S1 ^operator O1937 = 0.)
  9675. --- END Proposal Phase ---
  9676. --- Decision Phase ---
  9677. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9678. =>WM: (13666: S1 ^operator O1940)
  9679. 970: O: O1940 (predict-no)
  9680. --- END Decision Phase ---
  9681. --- Application Phase ---
  9682. --- Firing Productions (PE) For State At Depth 1 ---
  9683. --- Inner Elaboration Phase, active level 1 (S1) ---
  9684. Firing apply*operator
  9685. -->
  9686. (I3 ^predict-no N970 + :O )
  9687. Firing apply*operator*complete
  9688. -->
  9689. (I3 ^predict-no N969 - :O )
  9690. inner elaboration loop at bottom goal.
  9691. --- Change Working Memory (PE) ---
  9692. =>WM: (13667: I3 ^predict-no N970)
  9693. <=WM: (13655: N969 ^status complete)
  9694. <=WM: (13654: I3 ^predict-no N969)
  9695. --- Firing Productions (IE) For State At Depth 1 ---
  9696. --- Inner Elaboration Phase, active level 1 (S1) ---
  9697. Firing monitor*world
  9698. -->
  9699. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9700. --- Change Working Memory (IE) ---
  9701. --- END Application Phase ---
  9702. --- Output Phase ---
  9703. ENV: Agent did: predict-no for direction U in state State-B
  9704. In State-B moving U
  9705. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9706. predict error 0
  9707. dir: dir isL
  9708. --- END Output Phase ---
  9709. \---- Input Phase ---
  9710. =>WM: (13671: I2 ^dir L)
  9711. =>WM: (13670: I2 ^reward 1)
  9712. =>WM: (13669: I2 ^see 0)
  9713. =>WM: (13668: N970 ^status complete)
  9714. <=WM: (13658: I2 ^dir U)
  9715. <=WM: (13657: I2 ^reward 1)
  9716. <=WM: (13656: I2 ^see 0)
  9717. =>WM: (13672: I2 ^level-1 R0-root)
  9718. <=WM: (13659: I2 ^level-1 R0-root)
  9719. --- END Input Phase ---
  9720. --- Proposal Phase ---
  9721. --- Inner Elaboration Phase, active level 1 (S1) ---
  9722. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9723. -->
  9724. (S1 ^operator O1939 = 0.735815301499146)
  9725. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9726. -->
  9727. Firing elaborate*copy-see-to-output-link
  9728. -->
  9729. (I3 ^see 0 +)
  9730. Firing elaborate*reward*based*on*reward
  9731. -->
  9732. (R974 ^value 1 +)
  9733. (R1 ^reward R974 +)
  9734. Firing propose*predict-yes
  9735. -->
  9736. (O1941 ^name predict-yes +)
  9737. (S1 ^operator O1941 +)
  9738. Firing propose*predict-no
  9739. -->
  9740. (O1942 ^name predict-no +)
  9741. (S1 ^operator O1942 +)
  9742. Firing rl*prefer*rvt*predict-no*H0*6
  9743. -->
  9744. (S1 ^operator O1940 = 0.9997480945179411)
  9745. Firing rl*prefer*rvt*predict-yes*H0*5
  9746. -->
  9747. (S1 ^operator O1939 = 0.2640444846619989)
  9748. Firing prefer*rvt*predict-yes*H0
  9749. -->
  9750. Firing prefer*rvt*predict-no*H0
  9751. -->
  9752. Firing elaborate*copy-dir-to-output-link
  9753. -->
  9754. (I3 ^dir L +)
  9755. inner elaboration loop at bottom goal.
  9756. Retracting elaborate*copy-see-to-output-link
  9757. -->
  9758. (I3 ^see 0 +)
  9759. Retracting propose*predict-no
  9760. -->
  9761. (O1940 ^name predict-no +)
  9762. (S1 ^operator O1940 +)
  9763. Retracting propose*predict-yes
  9764. -->
  9765. (O1939 ^name predict-yes +)
  9766. (S1 ^operator O1939 +)
  9767. Retracting elaborate*reward*based*on*reward
  9768. -->
  9769. (R973 ^value 1 +)
  9770. (R1 ^reward R973 +)
  9771. Retracting elaborate*copy-dir-to-output-link
  9772. -->
  9773. (I3 ^dir U +)
  9774. Retracting rl*prefer*rvt*predict-no*H0*2
  9775. -->
  9776. (S1 ^operator O1940 = 1.)
  9777. Retracting rl*prefer*rvt*predict-yes*H0*1
  9778. -->
  9779. (S1 ^operator O1939 = 0.)
  9780. =>WM: (13679: S1 ^operator O1942 +)
  9781. =>WM: (13678: S1 ^operator O1941 +)
  9782. =>WM: (13677: I3 ^dir L)
  9783. =>WM: (13676: O1942 ^name predict-no)
  9784. =>WM: (13675: O1941 ^name predict-yes)
  9785. =>WM: (13674: R974 ^value 1)
  9786. =>WM: (13673: R1 ^reward R974)
  9787. <=WM: (13664: S1 ^operator O1939 +)
  9788. <=WM: (13665: S1 ^operator O1940 +)
  9789. <=WM: (13666: S1 ^operator O1940)
  9790. <=WM: (13650: I3 ^dir U)
  9791. <=WM: (13660: R1 ^reward R973)
  9792. <=WM: (13663: O1940 ^name predict-no)
  9793. <=WM: (13662: O1939 ^name predict-yes)
  9794. <=WM: (13661: R973 ^value 1)
  9795. --- Inner Elaboration Phase, active level 1 (S1) ---
  9796. Firing prefer*rvt*predict-yes*H0
  9797. -->
  9798. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9799. -->
  9800. (S1 ^operator O1941 = 0.735815301499146)
  9801. Firing rl*prefer*rvt*predict-yes*H0*5
  9802. -->
  9803. (S1 ^operator O1941 = 0.2640444846619989)
  9804. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  9805. -->
  9806. Firing prefer*rvt*predict-no*H0
  9807. -->
  9808. Firing rl*prefer*rvt*predict-no*H0*6
  9809. -->
  9810. (S1 ^operator O1942 = 0.9997480945179411)
  9811. inner elaboration loop at bottom goal.
  9812. Retracting rl*prefer*rvt*predict-no*H0*6
  9813. -->
  9814. (S1 ^operator O1940 = 0.9997480945179411)
  9815. Retracting rl*prefer*rvt*predict-yes*H0*5
  9816. -->
  9817. (S1 ^operator O1939 = 0.2640444846619989)
  9818. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9819. -->
  9820. (S1 ^operator O1939 = 0.735815301499146)
  9821. --- END Proposal Phase ---
  9822. --- Decision Phase ---
  9823. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9824. =>WM: (13680: S1 ^operator O1941)
  9825. 971: O: O1941 (predict-yes)
  9826. --- END Decision Phase ---
  9827. --- Application Phase ---
  9828. --- Firing Productions (PE) For State At Depth 1 ---
  9829. --- Inner Elaboration Phase, active level 1 (S1) ---
  9830. Firing apply*operator
  9831. -->
  9832. (I3 ^predict-yes N971 + :O )
  9833. Firing apply*operator*complete
  9834. -->
  9835. (I3 ^predict-no N970 - :O )
  9836. inner elaboration loop at bottom goal.
  9837. --- Change Working Memory (PE) ---
  9838. =>WM: (13681: I3 ^predict-yes N971)
  9839. <=WM: (13668: N970 ^status complete)
  9840. <=WM: (13667: I3 ^predict-no N970)
  9841. --- Firing Productions (IE) For State At Depth 1 ---
  9842. --- Inner Elaboration Phase, active level 1 (S1) ---
  9843. Firing monitor*world
  9844. -->
  9845. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9846. --- Change Working Memory (IE) ---
  9847. --- END Application Phase ---
  9848. --- Output Phase ---
  9849. ENV: Agent did: predict-yes for direction L in state State-B
  9850. In State-B moving L
  9851. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9852. predict error 0
  9853. dir: dir isR
  9854. --- END Output Phase ---
  9855. /--- Input Phase ---
  9856. =>WM: (13685: I2 ^dir R)
  9857. =>WM: (13684: I2 ^reward 1)
  9858. =>WM: (13683: I2 ^see 1)
  9859. =>WM: (13682: N971 ^status complete)
  9860. <=WM: (13671: I2 ^dir L)
  9861. <=WM: (13670: I2 ^reward 1)
  9862. <=WM: (13669: I2 ^see 0)
  9863. =>WM: (13686: I2 ^level-1 L1-root)
  9864. <=WM: (13672: I2 ^level-1 R0-root)
  9865. --- END Input Phase ---
  9866. --- Proposal Phase ---
  9867. --- Inner Elaboration Phase, active level 1 (S1) ---
  9868. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  9869. -->
  9870. (S1 ^operator O1942 = -0.2714224023553999)
  9871. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  9872. -->
  9873. (S1 ^operator O1941 = 0.6622033637991441)
  9874. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9875. -->
  9876. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9877. -->
  9878. Firing elaborate*copy-see-to-output-link
  9879. -->
  9880. (I3 ^see 1 +)
  9881. Firing elaborate*reward*based*on*reward
  9882. -->
  9883. (R975 ^value 1 +)
  9884. (R1 ^reward R975 +)
  9885. Firing propose*predict-yes
  9886. -->
  9887. (O1943 ^name predict-yes +)
  9888. (S1 ^operator O1943 +)
  9889. Firing propose*predict-no
  9890. -->
  9891. (O1944 ^name predict-no +)
  9892. (S1 ^operator O1944 +)
  9893. Firing rl*prefer*rvt*predict-no*H0*4
  9894. -->
  9895. (S1 ^operator O1942 = 0.339769731277316)
  9896. Firing rl*prefer*rvt*predict-yes*H0*3
  9897. -->
  9898. (S1 ^operator O1941 = 0.3377045556949833)
  9899. Firing prefer*rvt*predict-yes*H0
  9900. -->
  9901. Firing prefer*rvt*predict-no*H0
  9902. -->
  9903. Firing elaborate*copy-dir-to-output-link
  9904. -->
  9905. (I3 ^dir R +)
  9906. inner elaboration loop at bottom goal.
  9907. Retracting elaborate*copy-see-to-output-link
  9908. -->
  9909. (I3 ^see 0 +)
  9910. Retracting propose*predict-no
  9911. -->
  9912. (O1942 ^name predict-no +)
  9913. (S1 ^operator O1942 +)
  9914. Retracting propose*predict-yes
  9915. -->
  9916. (O1941 ^name predict-yes +)
  9917. (S1 ^operator O1941 +)
  9918. Retracting elaborate*reward*based*on*reward
  9919. -->
  9920. (R974 ^value 1 +)
  9921. (R1 ^reward R974 +)
  9922. Retracting elaborate*copy-dir-to-output-link
  9923. -->
  9924. (I3 ^dir L +)
  9925. Retracting rl*prefer*rvt*predict-no*H0*6
  9926. -->
  9927. (S1 ^operator O1942 = 0.9997480945179411)
  9928. Retracting rl*prefer*rvt*predict-yes*H0*5
  9929. -->
  9930. (S1 ^operator O1941 = 0.2640444846619989)
  9931. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  9932. -->
  9933. (S1 ^operator O1941 = 0.735815301499146)
  9934. =>WM: (13694: S1 ^operator O1944 +)
  9935. =>WM: (13693: S1 ^operator O1943 +)
  9936. =>WM: (13692: I3 ^dir R)
  9937. =>WM: (13691: O1944 ^name predict-no)
  9938. =>WM: (13690: O1943 ^name predict-yes)
  9939. =>WM: (13689: R975 ^value 1)
  9940. =>WM: (13688: R1 ^reward R975)
  9941. =>WM: (13687: I3 ^see 1)
  9942. <=WM: (13678: S1 ^operator O1941 +)
  9943. <=WM: (13680: S1 ^operator O1941)
  9944. <=WM: (13679: S1 ^operator O1942 +)
  9945. <=WM: (13677: I3 ^dir L)
  9946. <=WM: (13673: R1 ^reward R974)
  9947. <=WM: (13645: I3 ^see 0)
  9948. <=WM: (13676: O1942 ^name predict-no)
  9949. <=WM: (13675: O1941 ^name predict-yes)
  9950. <=WM: (13674: R974 ^value 1)
  9951. --- Inner Elaboration Phase, active level 1 (S1) ---
  9952. Firing prefer*rvt*predict-yes*H0
  9953. -->
  9954. Firing rl*prefer*rvt*predict-yes*H0*3
  9955. -->
  9956. (S1 ^operator O1943 = 0.3377045556949833)
  9957. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  9958. -->
  9959. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  9960. -->
  9961. (S1 ^operator O1943 = 0.6622033637991441)
  9962. Firing prefer*rvt*predict-no*H0
  9963. -->
  9964. Firing rl*prefer*rvt*predict-no*H0*4
  9965. -->
  9966. (S1 ^operator O1944 = 0.339769731277316)
  9967. Firing prefer*rvt*predict-no*H0*4*v1*H1
  9968. -->
  9969. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  9970. -->
  9971. (S1 ^operator O1944 = -0.2714224023553999)
  9972. inner elaboration loop at bottom goal.
  9973. Retracting rl*prefer*rvt*predict-no*H0*4
  9974. -->
  9975. (S1 ^operator O1942 = 0.339769731277316)
  9976. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  9977. -->
  9978. (S1 ^operator O1942 = -0.2714224023553999)
  9979. Retracting rl*prefer*rvt*predict-yes*H0*3
  9980. -->
  9981. (S1 ^operator O1941 = 0.3377045556949833)
  9982. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  9983. -->
  9984. (S1 ^operator O1941 = 0.6622033637991441)
  9985. --- END Proposal Phase ---
  9986. --- Decision Phase ---
  9987. RL update rl*prefer*rvt*predict-yes*H0*5 0.55443 -0.290385 0.264044 -> 0.554441 -0.290385 0.264056(R,m,v=1,0.874286,0.110542)
  9988. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445432 0.290383 0.735815 -> 0.445446 0.290383 0.735829(R,m,v=1,1,0)
  9989. =>WM: (13695: S1 ^operator O1943)
  9990. 972: O: O1943 (predict-yes)
  9991. --- END Decision Phase ---
  9992. --- Application Phase ---
  9993. --- Firing Productions (PE) For State At Depth 1 ---
  9994. --- Inner Elaboration Phase, active level 1 (S1) ---
  9995. Firing apply*operator
  9996. -->
  9997. (I3 ^predict-yes N972 + :O )
  9998. Firing apply*operator*complete
  9999. -->
  10000. (I3 ^predict-yes N971 - :O )
  10001. inner elaboration loop at bottom goal.
  10002. --- Change Working Memory (PE) ---
  10003. =>WM: (13696: I3 ^predict-yes N972)
  10004. <=WM: (13682: N971 ^status complete)
  10005. <=WM: (13681: I3 ^predict-yes N971)
  10006. --- Firing Productions (IE) For State At Depth 1 ---
  10007. --- Inner Elaboration Phase, active level 1 (S1) ---
  10008. Firing monitor*world
  10009. -->
  10010. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10011. --- Change Working Memory (IE) ---
  10012. --- END Application Phase ---
  10013. --- Output Phase ---
  10014. ENV: Agent did: predict-yes for direction R in state State-A
  10015. In State-A moving R
  10016. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10017. predict error 0
  10018. dir: dir isL
  10019. --- END Output Phase ---
  10020. |\---- Input Phase ---
  10021. =>WM: (13700: I2 ^dir L)
  10022. =>WM: (13699: I2 ^reward 1)
  10023. =>WM: (13698: I2 ^see 1)
  10024. =>WM: (13697: N972 ^status complete)
  10025. <=WM: (13685: I2 ^dir R)
  10026. <=WM: (13684: I2 ^reward 1)
  10027. <=WM: (13683: I2 ^see 1)
  10028. =>WM: (13701: I2 ^level-1 R1-root)
  10029. <=WM: (13686: I2 ^level-1 L1-root)
  10030. --- END Input Phase ---
  10031. --- Proposal Phase ---
  10032. --- Inner Elaboration Phase, active level 1 (S1) ---
  10033. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10034. -->
  10035. (S1 ^operator O1943 = 0.7362862485154646)
  10036. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10037. -->
  10038. Firing elaborate*copy-see-to-output-link
  10039. -->
  10040. (I3 ^see 1 +)
  10041. Firing elaborate*reward*based*on*reward
  10042. -->
  10043. (R976 ^value 1 +)
  10044. (R1 ^reward R976 +)
  10045. Firing propose*predict-yes
  10046. -->
  10047. (O1945 ^name predict-yes +)
  10048. (S1 ^operator O1945 +)
  10049. Firing propose*predict-no
  10050. -->
  10051. (O1946 ^name predict-no +)
  10052. (S1 ^operator O1946 +)
  10053. Firing rl*prefer*rvt*predict-no*H0*6
  10054. -->
  10055. (S1 ^operator O1944 = 0.9997480945179411)
  10056. Firing rl*prefer*rvt*predict-yes*H0*5
  10057. -->
  10058. (S1 ^operator O1943 = 0.2640558568198847)
  10059. Firing prefer*rvt*predict-yes*H0
  10060. -->
  10061. Firing prefer*rvt*predict-no*H0
  10062. -->
  10063. Firing elaborate*copy-dir-to-output-link
  10064. -->
  10065. (I3 ^dir L +)
  10066. inner elaboration loop at bottom goal.
  10067. Retracting elaborate*copy-see-to-output-link
  10068. -->
  10069. (I3 ^see 1 +)
  10070. Retracting propose*predict-no
  10071. -->
  10072. (O1944 ^name predict-no +)
  10073. (S1 ^operator O1944 +)
  10074. Retracting propose*predict-yes
  10075. -->
  10076. (O1943 ^name predict-yes +)
  10077. (S1 ^operator O1943 +)
  10078. Retracting elaborate*reward*based*on*reward
  10079. -->
  10080. (R975 ^value 1 +)
  10081. (R1 ^reward R975 +)
  10082. Retracting elaborate*copy-dir-to-output-link
  10083. -->
  10084. (I3 ^dir R +)
  10085. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10086. -->
  10087. (S1 ^operator O1944 = -0.2714224023553999)
  10088. Retracting rl*prefer*rvt*predict-no*H0*4
  10089. -->
  10090. (S1 ^operator O1944 = 0.339769731277316)
  10091. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10092. -->
  10093. (S1 ^operator O1943 = 0.6622033637991441)
  10094. Retracting rl*prefer*rvt*predict-yes*H0*3
  10095. -->
  10096. (S1 ^operator O1943 = 0.3377045556949833)
  10097. =>WM: (13708: S1 ^operator O1946 +)
  10098. =>WM: (13707: S1 ^operator O1945 +)
  10099. =>WM: (13706: I3 ^dir L)
  10100. =>WM: (13705: O1946 ^name predict-no)
  10101. =>WM: (13704: O1945 ^name predict-yes)
  10102. =>WM: (13703: R976 ^value 1)
  10103. =>WM: (13702: R1 ^reward R976)
  10104. <=WM: (13693: S1 ^operator O1943 +)
  10105. <=WM: (13695: S1 ^operator O1943)
  10106. <=WM: (13694: S1 ^operator O1944 +)
  10107. <=WM: (13692: I3 ^dir R)
  10108. <=WM: (13688: R1 ^reward R975)
  10109. <=WM: (13691: O1944 ^name predict-no)
  10110. <=WM: (13690: O1943 ^name predict-yes)
  10111. <=WM: (13689: R975 ^value 1)
  10112. --- Inner Elaboration Phase, active level 1 (S1) ---
  10113. Firing prefer*rvt*predict-yes*H0
  10114. -->
  10115. Firing rl*prefer*rvt*predict-yes*H0*5
  10116. -->
  10117. (S1 ^operator O1945 = 0.2640558568198847)
  10118. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  10119. -->
  10120. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10121. -->
  10122. (S1 ^operator O1945 = 0.7362862485154646)
  10123. Firing prefer*rvt*predict-no*H0
  10124. -->
  10125. Firing rl*prefer*rvt*predict-no*H0*6
  10126. -->
  10127. (S1 ^operator O1946 = 0.9997480945179411)
  10128. inner elaboration loop at bottom goal.
  10129. Retracting rl*prefer*rvt*predict-no*H0*6
  10130. -->
  10131. (S1 ^operator O1944 = 0.9997480945179411)
  10132. Retracting rl*prefer*rvt*predict-yes*H0*5
  10133. -->
  10134. (S1 ^operator O1943 = 0.2640558568198847)
  10135. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10136. -->
  10137. (S1 ^operator O1943 = 0.7362862485154646)
  10138. --- END Proposal Phase ---
  10139. --- Decision Phase ---
  10140. RL update rl*prefer*rvt*predict-yes*H0*3 0.590104 -0.252399 0.337705 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.896341,0.0934835)
  10141. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.40979 0.252413 0.662203 -> 0.4098 0.252412 0.662212(R,m,v=1,1,0)
  10142. =>WM: (13709: S1 ^operator O1945)
  10143. 973: O: O1945 (predict-yes)
  10144. --- END Decision Phase ---
  10145. --- Application Phase ---
  10146. --- Firing Productions (PE) For State At Depth 1 ---
  10147. --- Inner Elaboration Phase, active level 1 (S1) ---
  10148. Firing apply*operator
  10149. -->
  10150. (I3 ^predict-yes N973 + :O )
  10151. Firing apply*operator*complete
  10152. -->
  10153. (I3 ^predict-yes N972 - :O )
  10154. inner elaboration loop at bottom goal.
  10155. --- Change Working Memory (PE) ---
  10156. =>WM: (13710: I3 ^predict-yes N973)
  10157. <=WM: (13697: N972 ^status complete)
  10158. <=WM: (13696: I3 ^predict-yes N972)
  10159. --- Firing Productions (IE) For State At Depth 1 ---
  10160. --- Inner Elaboration Phase, active level 1 (S1) ---
  10161. Firing monitor*world
  10162. -->
  10163. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10164. --- Change Working Memory (IE) ---
  10165. --- END Application Phase ---
  10166. --- Output Phase ---
  10167. ENV: Agent did: predict-yes for direction L in state State-B
  10168. In State-B moving L
  10169. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10170. predict error 0
  10171. dir: dir isU
  10172. --- END Output Phase ---
  10173. /|\--- Input Phase ---
  10174. =>WM: (13714: I2 ^dir U)
  10175. =>WM: (13713: I2 ^reward 1)
  10176. =>WM: (13712: I2 ^see 1)
  10177. =>WM: (13711: N973 ^status complete)
  10178. <=WM: (13700: I2 ^dir L)
  10179. <=WM: (13699: I2 ^reward 1)
  10180. <=WM: (13698: I2 ^see 1)
  10181. =>WM: (13715: I2 ^level-1 L1-root)
  10182. <=WM: (13701: I2 ^level-1 R1-root)
  10183. --- END Input Phase ---
  10184. --- Proposal Phase ---
  10185. --- Inner Elaboration Phase, active level 1 (S1) ---
  10186. Firing elaborate*copy-see-to-output-link
  10187. -->
  10188. (I3 ^see 1 +)
  10189. Firing elaborate*reward*based*on*reward
  10190. -->
  10191. (R977 ^value 1 +)
  10192. (R1 ^reward R977 +)
  10193. Firing propose*predict-yes
  10194. -->
  10195. (O1947 ^name predict-yes +)
  10196. (S1 ^operator O1947 +)
  10197. Firing propose*predict-no
  10198. -->
  10199. (O1948 ^name predict-no +)
  10200. (S1 ^operator O1948 +)
  10201. Firing rl*prefer*rvt*predict-no*H0*2
  10202. -->
  10203. (S1 ^operator O1946 = 1.)
  10204. Firing rl*prefer*rvt*predict-yes*H0*1
  10205. -->
  10206. (S1 ^operator O1945 = 0.)
  10207. Firing prefer*rvt*predict-yes*H0
  10208. -->
  10209. Firing prefer*rvt*predict-no*H0
  10210. -->
  10211. Firing elaborate*copy-dir-to-output-link
  10212. -->
  10213. (I3 ^dir U +)
  10214. inner elaboration loop at bottom goal.
  10215. Retracting elaborate*copy-see-to-output-link
  10216. -->
  10217. (I3 ^see 1 +)
  10218. Retracting propose*predict-no
  10219. -->
  10220. (O1946 ^name predict-no +)
  10221. (S1 ^operator O1946 +)
  10222. Retracting propose*predict-yes
  10223. -->
  10224. (O1945 ^name predict-yes +)
  10225. (S1 ^operator O1945 +)
  10226. Retracting elaborate*reward*based*on*reward
  10227. -->
  10228. (R976 ^value 1 +)
  10229. (R1 ^reward R976 +)
  10230. Retracting elaborate*copy-dir-to-output-link
  10231. -->
  10232. (I3 ^dir L +)
  10233. Retracting rl*prefer*rvt*predict-no*H0*6
  10234. -->
  10235. (S1 ^operator O1946 = 0.9997480945179411)
  10236. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  10237. -->
  10238. (S1 ^operator O1945 = 0.7362862485154646)
  10239. Retracting rl*prefer*rvt*predict-yes*H0*5
  10240. -->
  10241. (S1 ^operator O1945 = 0.2640558568198847)
  10242. =>WM: (13722: S1 ^operator O1948 +)
  10243. =>WM: (13721: S1 ^operator O1947 +)
  10244. =>WM: (13720: I3 ^dir U)
  10245. =>WM: (13719: O1948 ^name predict-no)
  10246. =>WM: (13718: O1947 ^name predict-yes)
  10247. =>WM: (13717: R977 ^value 1)
  10248. =>WM: (13716: R1 ^reward R977)
  10249. <=WM: (13707: S1 ^operator O1945 +)
  10250. <=WM: (13709: S1 ^operator O1945)
  10251. <=WM: (13708: S1 ^operator O1946 +)
  10252. <=WM: (13706: I3 ^dir L)
  10253. <=WM: (13702: R1 ^reward R976)
  10254. <=WM: (13705: O1946 ^name predict-no)
  10255. <=WM: (13704: O1945 ^name predict-yes)
  10256. <=WM: (13703: R976 ^value 1)
  10257. --- Inner Elaboration Phase, active level 1 (S1) ---
  10258. Firing prefer*rvt*predict-yes*H0
  10259. -->
  10260. Firing rl*prefer*rvt*predict-yes*H0*1
  10261. -->
  10262. (S1 ^operator O1947 = 0.)
  10263. Firing prefer*rvt*predict-no*H0
  10264. -->
  10265. Firing rl*prefer*rvt*predict-no*H0*2
  10266. -->
  10267. (S1 ^operator O1948 = 1.)
  10268. inner elaboration loop at bottom goal.
  10269. Retracting rl*prefer*rvt*predict-no*H0*2
  10270. -->
  10271. (S1 ^operator O1946 = 1.)
  10272. Retracting rl*prefer*rvt*predict-yes*H0*1
  10273. -->
  10274. (S1 ^operator O1945 = 0.)
  10275. --- END Proposal Phase ---
  10276. --- Decision Phase ---
  10277. RL update rl*prefer*rvt*predict-yes*H0*5 0.554441 -0.290385 0.264056 -> 0.554414 -0.290386 0.264028(R,m,v=1,0.875,0.11)
  10278. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445895 0.290391 0.736286 -> 0.445864 0.29039 0.736254(R,m,v=1,1,0)
  10279. =>WM: (13723: S1 ^operator O1948)
  10280. 974: O: O1948 (predict-no)
  10281. --- END Decision Phase ---
  10282. --- Application Phase ---
  10283. --- Firing Productions (PE) For State At Depth 1 ---
  10284. --- Inner Elaboration Phase, active level 1 (S1) ---
  10285. Firing apply*operator
  10286. -->
  10287. (I3 ^predict-no N974 + :O )
  10288. Firing apply*operator*complete
  10289. -->
  10290. (I3 ^predict-yes N973 - :O )
  10291. inner elaboration loop at bottom goal.
  10292. --- Change Working Memory (PE) ---
  10293. =>WM: (13724: I3 ^predict-no N974)
  10294. <=WM: (13711: N973 ^status complete)
  10295. <=WM: (13710: I3 ^predict-yes N973)
  10296. --- Firing Productions (IE) For State At Depth 1 ---
  10297. --- Inner Elaboration Phase, active level 1 (S1) ---
  10298. Firing monitor*world
  10299. -->
  10300. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10301. --- Change Working Memory (IE) ---
  10302. --- END Application Phase ---
  10303. --- Output Phase ---
  10304. ENV: Agent did: predict-no for direction U in state State-A
  10305. In State-A moving U
  10306. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10307. predict error 0
  10308. dir: dir isU
  10309. --- END Output Phase ---
  10310. -/|--- Input Phase ---
  10311. =>WM: (13728: I2 ^dir U)
  10312. =>WM: (13727: I2 ^reward 1)
  10313. =>WM: (13726: I2 ^see 0)
  10314. =>WM: (13725: N974 ^status complete)
  10315. <=WM: (13714: I2 ^dir U)
  10316. <=WM: (13713: I2 ^reward 1)
  10317. <=WM: (13712: I2 ^see 1)
  10318. =>WM: (13729: I2 ^level-1 L1-root)
  10319. <=WM: (13715: I2 ^level-1 L1-root)
  10320. --- END Input Phase ---
  10321. --- Proposal Phase ---
  10322. --- Inner Elaboration Phase, active level 1 (S1) ---
  10323. Firing elaborate*copy-see-to-output-link
  10324. -->
  10325. (I3 ^see 0 +)
  10326. Firing elaborate*reward*based*on*reward
  10327. -->
  10328. (R978 ^value 1 +)
  10329. (R1 ^reward R978 +)
  10330. Firing propose*predict-yes
  10331. -->
  10332. (O1949 ^name predict-yes +)
  10333. (S1 ^operator O1949 +)
  10334. Firing propose*predict-no
  10335. -->
  10336. (O1950 ^name predict-no +)
  10337. (S1 ^operator O1950 +)
  10338. Firing rl*prefer*rvt*predict-no*H0*2
  10339. -->
  10340. (S1 ^operator O1948 = 1.)
  10341. Firing rl*prefer*rvt*predict-yes*H0*1
  10342. -->
  10343. (S1 ^operator O1947 = 0.)
  10344. Firing prefer*rvt*predict-yes*H0
  10345. -->
  10346. Firing prefer*rvt*predict-no*H0
  10347. -->
  10348. Firing elaborate*copy-dir-to-output-link
  10349. -->
  10350. (I3 ^dir U +)
  10351. inner elaboration loop at bottom goal.
  10352. Retracting elaborate*copy-see-to-output-link
  10353. -->
  10354. (I3 ^see 1 +)
  10355. Retracting propose*predict-no
  10356. -->
  10357. (O1948 ^name predict-no +)
  10358. (S1 ^operator O1948 +)
  10359. Retracting propose*predict-yes
  10360. -->
  10361. (O1947 ^name predict-yes +)
  10362. (S1 ^operator O1947 +)
  10363. Retracting elaborate*reward*based*on*reward
  10364. -->
  10365. (R977 ^value 1 +)
  10366. (R1 ^reward R977 +)
  10367. Retracting elaborate*copy-dir-to-output-link
  10368. -->
  10369. (I3 ^dir U +)
  10370. Retracting rl*prefer*rvt*predict-no*H0*2
  10371. -->
  10372. (S1 ^operator O1948 = 1.)
  10373. Retracting rl*prefer*rvt*predict-yes*H0*1
  10374. -->
  10375. (S1 ^operator O1947 = 0.)
  10376. =>WM: (13736: S1 ^operator O1950 +)
  10377. =>WM: (13735: S1 ^operator O1949 +)
  10378. =>WM: (13734: O1950 ^name predict-no)
  10379. =>WM: (13733: O1949 ^name predict-yes)
  10380. =>WM: (13732: R978 ^value 1)
  10381. =>WM: (13731: R1 ^reward R978)
  10382. =>WM: (13730: I3 ^see 0)
  10383. <=WM: (13721: S1 ^operator O1947 +)
  10384. <=WM: (13722: S1 ^operator O1948 +)
  10385. <=WM: (13723: S1 ^operator O1948)
  10386. <=WM: (13716: R1 ^reward R977)
  10387. <=WM: (13687: I3 ^see 1)
  10388. <=WM: (13719: O1948 ^name predict-no)
  10389. <=WM: (13718: O1947 ^name predict-yes)
  10390. <=WM: (13717: R977 ^value 1)
  10391. --- Inner Elaboration Phase, active level 1 (S1) ---
  10392. Firing prefer*rvt*predict-yes*H0
  10393. -->
  10394. Firing rl*prefer*rvt*predict-yes*H0*1
  10395. -->
  10396. (S1 ^operator O1949 = 0.)
  10397. Firing prefer*rvt*predict-no*H0
  10398. -->
  10399. Firing rl*prefer*rvt*predict-no*H0*2
  10400. -->
  10401. (S1 ^operator O1950 = 1.)
  10402. inner elaboration loop at bottom goal.
  10403. Retracting rl*prefer*rvt*predict-no*H0*2
  10404. -->
  10405. (S1 ^operator O1948 = 1.)
  10406. Retracting rl*prefer*rvt*predict-yes*H0*1
  10407. -->
  10408. (S1 ^operator O1947 = 0.)
  10409. --- END Proposal Phase ---
  10410. --- Decision Phase ---
  10411. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10412. =>WM: (13737: S1 ^operator O1950)
  10413. 975: O: O1950 (predict-no)
  10414. --- END Decision Phase ---
  10415. --- Application Phase ---
  10416. --- Firing Productions (PE) For State At Depth 1 ---
  10417. --- Inner Elaboration Phase, active level 1 (S1) ---
  10418. Firing apply*operator
  10419. -->
  10420. (I3 ^predict-no N975 + :O )
  10421. Firing apply*operator*complete
  10422. -->
  10423. (I3 ^predict-no N974 - :O )
  10424. inner elaboration loop at bottom goal.
  10425. --- Change Working Memory (PE) ---
  10426. =>WM: (13738: I3 ^predict-no N975)
  10427. <=WM: (13725: N974 ^status complete)
  10428. <=WM: (13724: I3 ^predict-no N974)
  10429. --- Firing Productions (IE) For State At Depth 1 ---
  10430. --- Inner Elaboration Phase, active level 1 (S1) ---
  10431. Firing monitor*world
  10432. -->
  10433. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10434. --- Change Working Memory (IE) ---
  10435. --- END Application Phase ---
  10436. --- Output Phase ---
  10437. ENV: Agent did: predict-no for direction U in state State-A
  10438. In State-A moving U
  10439. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10440. predict error 0
  10441. dir: dir isR
  10442. --- END Output Phase ---
  10443. \-/--- Input Phase ---
  10444. =>WM: (13742: I2 ^dir R)
  10445. =>WM: (13741: I2 ^reward 1)
  10446. =>WM: (13740: I2 ^see 0)
  10447. =>WM: (13739: N975 ^status complete)
  10448. <=WM: (13728: I2 ^dir U)
  10449. <=WM: (13727: I2 ^reward 1)
  10450. <=WM: (13726: I2 ^see 0)
  10451. =>WM: (13743: I2 ^level-1 L1-root)
  10452. <=WM: (13729: I2 ^level-1 L1-root)
  10453. --- END Input Phase ---
  10454. --- Proposal Phase ---
  10455. --- Inner Elaboration Phase, active level 1 (S1) ---
  10456. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10457. -->
  10458. (S1 ^operator O1950 = -0.2714224023553999)
  10459. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10460. -->
  10461. (S1 ^operator O1949 = 0.6622121600001568)
  10462. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10463. -->
  10464. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10465. -->
  10466. Firing elaborate*copy-see-to-output-link
  10467. -->
  10468. (I3 ^see 0 +)
  10469. Firing elaborate*reward*based*on*reward
  10470. -->
  10471. (R979 ^value 1 +)
  10472. (R1 ^reward R979 +)
  10473. Firing propose*predict-yes
  10474. -->
  10475. (O1951 ^name predict-yes +)
  10476. (S1 ^operator O1951 +)
  10477. Firing propose*predict-no
  10478. -->
  10479. (O1952 ^name predict-no +)
  10480. (S1 ^operator O1952 +)
  10481. Firing rl*prefer*rvt*predict-no*H0*4
  10482. -->
  10483. (S1 ^operator O1950 = 0.339769731277316)
  10484. Firing rl*prefer*rvt*predict-yes*H0*3
  10485. -->
  10486. (S1 ^operator O1949 = 0.3377121034427055)
  10487. Firing prefer*rvt*predict-yes*H0
  10488. -->
  10489. Firing prefer*rvt*predict-no*H0
  10490. -->
  10491. Firing elaborate*copy-dir-to-output-link
  10492. -->
  10493. (I3 ^dir R +)
  10494. inner elaboration loop at bottom goal.
  10495. Retracting elaborate*copy-see-to-output-link
  10496. -->
  10497. (I3 ^see 0 +)
  10498. Retracting propose*predict-no
  10499. -->
  10500. (O1950 ^name predict-no +)
  10501. (S1 ^operator O1950 +)
  10502. Retracting propose*predict-yes
  10503. -->
  10504. (O1949 ^name predict-yes +)
  10505. (S1 ^operator O1949 +)
  10506. Retracting elaborate*reward*based*on*reward
  10507. -->
  10508. (R978 ^value 1 +)
  10509. (R1 ^reward R978 +)
  10510. Retracting elaborate*copy-dir-to-output-link
  10511. -->
  10512. (I3 ^dir U +)
  10513. Retracting rl*prefer*rvt*predict-no*H0*2
  10514. -->
  10515. (S1 ^operator O1950 = 1.)
  10516. Retracting rl*prefer*rvt*predict-yes*H0*1
  10517. -->
  10518. (S1 ^operator O1949 = 0.)
  10519. =>WM: (13750: S1 ^operator O1952 +)
  10520. =>WM: (13749: S1 ^operator O1951 +)
  10521. =>WM: (13748: I3 ^dir R)
  10522. =>WM: (13747: O1952 ^name predict-no)
  10523. =>WM: (13746: O1951 ^name predict-yes)
  10524. =>WM: (13745: R979 ^value 1)
  10525. =>WM: (13744: R1 ^reward R979)
  10526. <=WM: (13735: S1 ^operator O1949 +)
  10527. <=WM: (13736: S1 ^operator O1950 +)
  10528. <=WM: (13737: S1 ^operator O1950)
  10529. <=WM: (13720: I3 ^dir U)
  10530. <=WM: (13731: R1 ^reward R978)
  10531. <=WM: (13734: O1950 ^name predict-no)
  10532. <=WM: (13733: O1949 ^name predict-yes)
  10533. <=WM: (13732: R978 ^value 1)
  10534. --- Inner Elaboration Phase, active level 1 (S1) ---
  10535. Firing prefer*rvt*predict-yes*H0
  10536. -->
  10537. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10538. -->
  10539. (S1 ^operator O1951 = 0.6622121600001568)
  10540. Firing rl*prefer*rvt*predict-yes*H0*3
  10541. -->
  10542. (S1 ^operator O1951 = 0.3377121034427055)
  10543. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10544. -->
  10545. Firing prefer*rvt*predict-no*H0
  10546. -->
  10547. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10548. -->
  10549. (S1 ^operator O1952 = -0.2714224023553999)
  10550. Firing rl*prefer*rvt*predict-no*H0*4
  10551. -->
  10552. (S1 ^operator O1952 = 0.339769731277316)
  10553. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10554. -->
  10555. inner elaboration loop at bottom goal.
  10556. Retracting rl*prefer*rvt*predict-no*H0*4
  10557. -->
  10558. (S1 ^operator O1950 = 0.339769731277316)
  10559. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10560. -->
  10561. (S1 ^operator O1950 = -0.2714224023553999)
  10562. Retracting rl*prefer*rvt*predict-yes*H0*3
  10563. -->
  10564. (S1 ^operator O1949 = 0.3377121034427055)
  10565. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10566. -->
  10567. (S1 ^operator O1949 = 0.6622121600001568)
  10568. --- END Proposal Phase ---
  10569. --- Decision Phase ---
  10570. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10571. =>WM: (13751: S1 ^operator O1951)
  10572. 976: O: O1951 (predict-yes)
  10573. --- END Decision Phase ---
  10574. --- Application Phase ---
  10575. --- Firing Productions (PE) For State At Depth 1 ---
  10576. --- Inner Elaboration Phase, active level 1 (S1) ---
  10577. Firing apply*operator
  10578. -->
  10579. (I3 ^predict-yes N976 + :O )
  10580. Firing apply*operator*complete
  10581. -->
  10582. (I3 ^predict-no N975 - :O )
  10583. inner elaboration loop at bottom goal.
  10584. --- Change Working Memory (PE) ---
  10585. =>WM: (13752: I3 ^predict-yes N976)
  10586. <=WM: (13739: N975 ^status complete)
  10587. <=WM: (13738: I3 ^predict-no N975)
  10588. --- Firing Productions (IE) For State At Depth 1 ---
  10589. --- Inner Elaboration Phase, active level 1 (S1) ---
  10590. Firing monitor*world
  10591. -->
  10592. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10593. --- Change Working Memory (IE) ---
  10594. --- END Application Phase ---
  10595. --- Output Phase ---
  10596. ENV: Agent did: predict-yes for direction R in state State-A
  10597. In State-A moving R
  10598. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10599. predict error 0
  10600. dir: dir isU
  10601. --- END Output Phase ---
  10602. |\---- Input Phase ---
  10603. =>WM: (13756: I2 ^dir U)
  10604. =>WM: (13755: I2 ^reward 1)
  10605. =>WM: (13754: I2 ^see 1)
  10606. =>WM: (13753: N976 ^status complete)
  10607. <=WM: (13742: I2 ^dir R)
  10608. <=WM: (13741: I2 ^reward 1)
  10609. <=WM: (13740: I2 ^see 0)
  10610. =>WM: (13757: I2 ^level-1 R1-root)
  10611. <=WM: (13743: I2 ^level-1 L1-root)
  10612. --- END Input Phase ---
  10613. --- Proposal Phase ---
  10614. --- Inner Elaboration Phase, active level 1 (S1) ---
  10615. Firing elaborate*copy-see-to-output-link
  10616. -->
  10617. (I3 ^see 1 +)
  10618. Firing elaborate*reward*based*on*reward
  10619. -->
  10620. (R980 ^value 1 +)
  10621. (R1 ^reward R980 +)
  10622. Firing propose*predict-yes
  10623. -->
  10624. (O1953 ^name predict-yes +)
  10625. (S1 ^operator O1953 +)
  10626. Firing propose*predict-no
  10627. -->
  10628. (O1954 ^name predict-no +)
  10629. (S1 ^operator O1954 +)
  10630. Firing rl*prefer*rvt*predict-no*H0*2
  10631. -->
  10632. (S1 ^operator O1952 = 1.)
  10633. Firing rl*prefer*rvt*predict-yes*H0*1
  10634. -->
  10635. (S1 ^operator O1951 = 0.)
  10636. Firing prefer*rvt*predict-yes*H0
  10637. -->
  10638. Firing prefer*rvt*predict-no*H0
  10639. -->
  10640. Firing elaborate*copy-dir-to-output-link
  10641. -->
  10642. (I3 ^dir U +)
  10643. inner elaboration loop at bottom goal.
  10644. Retracting elaborate*copy-see-to-output-link
  10645. -->
  10646. (I3 ^see 0 +)
  10647. Retracting propose*predict-no
  10648. -->
  10649. (O1952 ^name predict-no +)
  10650. (S1 ^operator O1952 +)
  10651. Retracting propose*predict-yes
  10652. -->
  10653. (O1951 ^name predict-yes +)
  10654. (S1 ^operator O1951 +)
  10655. Retracting elaborate*reward*based*on*reward
  10656. -->
  10657. (R979 ^value 1 +)
  10658. (R1 ^reward R979 +)
  10659. Retracting elaborate*copy-dir-to-output-link
  10660. -->
  10661. (I3 ^dir R +)
  10662. Retracting rl*prefer*rvt*predict-no*H0*4
  10663. -->
  10664. (S1 ^operator O1952 = 0.339769731277316)
  10665. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  10666. -->
  10667. (S1 ^operator O1952 = -0.2714224023553999)
  10668. Retracting rl*prefer*rvt*predict-yes*H0*3
  10669. -->
  10670. (S1 ^operator O1951 = 0.3377121034427055)
  10671. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  10672. -->
  10673. (S1 ^operator O1951 = 0.6622121600001568)
  10674. =>WM: (13765: S1 ^operator O1954 +)
  10675. =>WM: (13764: S1 ^operator O1953 +)
  10676. =>WM: (13763: I3 ^dir U)
  10677. =>WM: (13762: O1954 ^name predict-no)
  10678. =>WM: (13761: O1953 ^name predict-yes)
  10679. =>WM: (13760: R980 ^value 1)
  10680. =>WM: (13759: R1 ^reward R980)
  10681. =>WM: (13758: I3 ^see 1)
  10682. <=WM: (13749: S1 ^operator O1951 +)
  10683. <=WM: (13751: S1 ^operator O1951)
  10684. <=WM: (13750: S1 ^operator O1952 +)
  10685. <=WM: (13748: I3 ^dir R)
  10686. <=WM: (13744: R1 ^reward R979)
  10687. <=WM: (13730: I3 ^see 0)
  10688. <=WM: (13747: O1952 ^name predict-no)
  10689. <=WM: (13746: O1951 ^name predict-yes)
  10690. <=WM: (13745: R979 ^value 1)
  10691. --- Inner Elaboration Phase, active level 1 (S1) ---
  10692. Firing prefer*rvt*predict-yes*H0
  10693. -->
  10694. Firing rl*prefer*rvt*predict-yes*H0*1
  10695. -->
  10696. (S1 ^operator O1953 = 0.)
  10697. Firing prefer*rvt*predict-no*H0
  10698. -->
  10699. Firing rl*prefer*rvt*predict-no*H0*2
  10700. -->
  10701. (S1 ^operator O1954 = 1.)
  10702. inner elaboration loop at bottom goal.
  10703. Retracting rl*prefer*rvt*predict-no*H0*2
  10704. -->
  10705. (S1 ^operator O1952 = 1.)
  10706. Retracting rl*prefer*rvt*predict-yes*H0*1
  10707. -->
  10708. (S1 ^operator O1951 = 0.)
  10709. --- END Proposal Phase ---
  10710. --- Decision Phase ---
  10711. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.59012 -0.252401 0.337718(R,m,v=1,0.89697,0.0929786)
  10712. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.4098 0.252412 0.662212 -> 0.409809 0.252411 0.662219(R,m,v=1,1,0)
  10713. =>WM: (13766: S1 ^operator O1954)
  10714. 977: O: O1954 (predict-no)
  10715. --- END Decision Phase ---
  10716. --- Application Phase ---
  10717. --- Firing Productions (PE) For State At Depth 1 ---
  10718. --- Inner Elaboration Phase, active level 1 (S1) ---
  10719. Firing apply*operator
  10720. -->
  10721. (I3 ^predict-no N977 + :O )
  10722. Firing apply*operator*complete
  10723. -->
  10724. (I3 ^predict-yes N976 - :O )
  10725. inner elaboration loop at bottom goal.
  10726. --- Change Working Memory (PE) ---
  10727. =>WM: (13767: I3 ^predict-no N977)
  10728. <=WM: (13753: N976 ^status complete)
  10729. <=WM: (13752: I3 ^predict-yes N976)
  10730. --- Firing Productions (IE) For State At Depth 1 ---
  10731. --- Inner Elaboration Phase, active level 1 (S1) ---
  10732. Firing monitor*world
  10733. -->
  10734. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10735. --- Change Working Memory (IE) ---
  10736. --- END Application Phase ---
  10737. --- Output Phase ---
  10738. ENV: Agent did: predict-no for direction U in state State-B
  10739. In State-B moving U
  10740. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10741. predict error 0
  10742. dir: dir isU
  10743. --- END Output Phase ---
  10744. /|\--- Input Phase ---
  10745. =>WM: (13771: I2 ^dir U)
  10746. =>WM: (13770: I2 ^reward 1)
  10747. =>WM: (13769: I2 ^see 0)
  10748. =>WM: (13768: N977 ^status complete)
  10749. <=WM: (13756: I2 ^dir U)
  10750. <=WM: (13755: I2 ^reward 1)
  10751. <=WM: (13754: I2 ^see 1)
  10752. =>WM: (13772: I2 ^level-1 R1-root)
  10753. <=WM: (13757: I2 ^level-1 R1-root)
  10754. --- END Input Phase ---
  10755. --- Proposal Phase ---
  10756. --- Inner Elaboration Phase, active level 1 (S1) ---
  10757. Firing elaborate*copy-see-to-output-link
  10758. -->
  10759. (I3 ^see 0 +)
  10760. Firing elaborate*reward*based*on*reward
  10761. -->
  10762. (R981 ^value 1 +)
  10763. (R1 ^reward R981 +)
  10764. Firing propose*predict-yes
  10765. -->
  10766. (O1955 ^name predict-yes +)
  10767. (S1 ^operator O1955 +)
  10768. Firing propose*predict-no
  10769. -->
  10770. (O1956 ^name predict-no +)
  10771. (S1 ^operator O1956 +)
  10772. Firing rl*prefer*rvt*predict-no*H0*2
  10773. -->
  10774. (S1 ^operator O1954 = 1.)
  10775. Firing rl*prefer*rvt*predict-yes*H0*1
  10776. -->
  10777. (S1 ^operator O1953 = 0.)
  10778. Firing prefer*rvt*predict-yes*H0
  10779. -->
  10780. Firing prefer*rvt*predict-no*H0
  10781. -->
  10782. Firing elaborate*copy-dir-to-output-link
  10783. -->
  10784. (I3 ^dir U +)
  10785. inner elaboration loop at bottom goal.
  10786. Retracting elaborate*copy-see-to-output-link
  10787. -->
  10788. (I3 ^see 1 +)
  10789. Retracting propose*predict-no
  10790. -->
  10791. (O1954 ^name predict-no +)
  10792. (S1 ^operator O1954 +)
  10793. Retracting propose*predict-yes
  10794. -->
  10795. (O1953 ^name predict-yes +)
  10796. (S1 ^operator O1953 +)
  10797. Retracting elaborate*reward*based*on*reward
  10798. -->
  10799. (R980 ^value 1 +)
  10800. (R1 ^reward R980 +)
  10801. Retracting elaborate*copy-dir-to-output-link
  10802. -->
  10803. (I3 ^dir U +)
  10804. Retracting rl*prefer*rvt*predict-no*H0*2
  10805. -->
  10806. (S1 ^operator O1954 = 1.)
  10807. Retracting rl*prefer*rvt*predict-yes*H0*1
  10808. -->
  10809. (S1 ^operator O1953 = 0.)
  10810. =>WM: (13779: S1 ^operator O1956 +)
  10811. =>WM: (13778: S1 ^operator O1955 +)
  10812. =>WM: (13777: O1956 ^name predict-no)
  10813. =>WM: (13776: O1955 ^name predict-yes)
  10814. =>WM: (13775: R981 ^value 1)
  10815. =>WM: (13774: R1 ^reward R981)
  10816. =>WM: (13773: I3 ^see 0)
  10817. <=WM: (13764: S1 ^operator O1953 +)
  10818. <=WM: (13765: S1 ^operator O1954 +)
  10819. <=WM: (13766: S1 ^operator O1954)
  10820. <=WM: (13759: R1 ^reward R980)
  10821. <=WM: (13758: I3 ^see 1)
  10822. <=WM: (13762: O1954 ^name predict-no)
  10823. <=WM: (13761: O1953 ^name predict-yes)
  10824. <=WM: (13760: R980 ^value 1)
  10825. --- Inner Elaboration Phase, active level 1 (S1) ---
  10826. Firing prefer*rvt*predict-yes*H0
  10827. -->
  10828. Firing rl*prefer*rvt*predict-yes*H0*1
  10829. -->
  10830. (S1 ^operator O1955 = 0.)
  10831. Firing prefer*rvt*predict-no*H0
  10832. -->
  10833. Firing rl*prefer*rvt*predict-no*H0*2
  10834. -->
  10835. (S1 ^operator O1956 = 1.)
  10836. inner elaboration loop at bottom goal.
  10837. Retracting rl*prefer*rvt*predict-no*H0*2
  10838. -->
  10839. (S1 ^operator O1954 = 1.)
  10840. Retracting rl*prefer*rvt*predict-yes*H0*1
  10841. -->
  10842. (S1 ^operator O1953 = 0.)
  10843. --- END Proposal Phase ---
  10844. --- Decision Phase ---
  10845. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10846. =>WM: (13780: S1 ^operator O1956)
  10847. 978: O: O1956 (predict-no)
  10848. --- END Decision Phase ---
  10849. --- Application Phase ---
  10850. --- Firing Productions (PE) For State At Depth 1 ---
  10851. --- Inner Elaboration Phase, active level 1 (S1) ---
  10852. Firing apply*operator
  10853. -->
  10854. (I3 ^predict-no N978 + :O )
  10855. Firing apply*operator*complete
  10856. -->
  10857. (I3 ^predict-no N977 - :O )
  10858. inner elaboration loop at bottom goal.
  10859. --- Change Working Memory (PE) ---
  10860. =>WM: (13781: I3 ^predict-no N978)
  10861. <=WM: (13768: N977 ^status complete)
  10862. <=WM: (13767: I3 ^predict-no N977)
  10863. --- Firing Productions (IE) For State At Depth 1 ---
  10864. --- Inner Elaboration Phase, active level 1 (S1) ---
  10865. Firing monitor*world
  10866. -->
  10867. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10868. --- Change Working Memory (IE) ---
  10869. --- END Application Phase ---
  10870. --- Output Phase ---
  10871. ENV: Agent did: predict-no for direction U in state State-B
  10872. In State-B moving U
  10873. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10874. predict error 0
  10875. dir: dir isR
  10876. --- END Output Phase ---
  10877. -/|--- Input Phase ---
  10878. =>WM: (13785: I2 ^dir R)
  10879. =>WM: (13784: I2 ^reward 1)
  10880. =>WM: (13783: I2 ^see 0)
  10881. =>WM: (13782: N978 ^status complete)
  10882. <=WM: (13771: I2 ^dir U)
  10883. <=WM: (13770: I2 ^reward 1)
  10884. <=WM: (13769: I2 ^see 0)
  10885. =>WM: (13786: I2 ^level-1 R1-root)
  10886. <=WM: (13772: I2 ^level-1 R1-root)
  10887. --- END Input Phase ---
  10888. --- Proposal Phase ---
  10889. --- Inner Elaboration Phase, active level 1 (S1) ---
  10890. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  10891. -->
  10892. (S1 ^operator O1955 = -0.1070236389116304)
  10893. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  10894. -->
  10895. (S1 ^operator O1956 = 0.6602468953107985)
  10896. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10897. -->
  10898. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10899. -->
  10900. Firing elaborate*copy-see-to-output-link
  10901. -->
  10902. (I3 ^see 0 +)
  10903. Firing elaborate*reward*based*on*reward
  10904. -->
  10905. (R982 ^value 1 +)
  10906. (R1 ^reward R982 +)
  10907. Firing propose*predict-yes
  10908. -->
  10909. (O1957 ^name predict-yes +)
  10910. (S1 ^operator O1957 +)
  10911. Firing propose*predict-no
  10912. -->
  10913. (O1958 ^name predict-no +)
  10914. (S1 ^operator O1958 +)
  10915. Firing rl*prefer*rvt*predict-no*H0*4
  10916. -->
  10917. (S1 ^operator O1956 = 0.339769731277316)
  10918. Firing rl*prefer*rvt*predict-yes*H0*3
  10919. -->
  10920. (S1 ^operator O1955 = 0.3377183053124619)
  10921. Firing prefer*rvt*predict-yes*H0
  10922. -->
  10923. Firing prefer*rvt*predict-no*H0
  10924. -->
  10925. Firing elaborate*copy-dir-to-output-link
  10926. -->
  10927. (I3 ^dir R +)
  10928. inner elaboration loop at bottom goal.
  10929. Retracting elaborate*copy-see-to-output-link
  10930. -->
  10931. (I3 ^see 0 +)
  10932. Retracting propose*predict-no
  10933. -->
  10934. (O1956 ^name predict-no +)
  10935. (S1 ^operator O1956 +)
  10936. Retracting propose*predict-yes
  10937. -->
  10938. (O1955 ^name predict-yes +)
  10939. (S1 ^operator O1955 +)
  10940. Retracting elaborate*reward*based*on*reward
  10941. -->
  10942. (R981 ^value 1 +)
  10943. (R1 ^reward R981 +)
  10944. Retracting elaborate*copy-dir-to-output-link
  10945. -->
  10946. (I3 ^dir U +)
  10947. Retracting rl*prefer*rvt*predict-no*H0*2
  10948. -->
  10949. (S1 ^operator O1956 = 1.)
  10950. Retracting rl*prefer*rvt*predict-yes*H0*1
  10951. -->
  10952. (S1 ^operator O1955 = 0.)
  10953. =>WM: (13793: S1 ^operator O1958 +)
  10954. =>WM: (13792: S1 ^operator O1957 +)
  10955. =>WM: (13791: I3 ^dir R)
  10956. =>WM: (13790: O1958 ^name predict-no)
  10957. =>WM: (13789: O1957 ^name predict-yes)
  10958. =>WM: (13788: R982 ^value 1)
  10959. =>WM: (13787: R1 ^reward R982)
  10960. <=WM: (13778: S1 ^operator O1955 +)
  10961. <=WM: (13779: S1 ^operator O1956 +)
  10962. <=WM: (13780: S1 ^operator O1956)
  10963. <=WM: (13763: I3 ^dir U)
  10964. <=WM: (13774: R1 ^reward R981)
  10965. <=WM: (13777: O1956 ^name predict-no)
  10966. <=WM: (13776: O1955 ^name predict-yes)
  10967. <=WM: (13775: R981 ^value 1)
  10968. --- Inner Elaboration Phase, active level 1 (S1) ---
  10969. Firing prefer*rvt*predict-yes*H0
  10970. -->
  10971. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  10972. -->
  10973. (S1 ^operator O1957 = -0.1070236389116304)
  10974. Firing rl*prefer*rvt*predict-yes*H0*3
  10975. -->
  10976. (S1 ^operator O1957 = 0.3377183053124619)
  10977. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  10978. -->
  10979. Firing prefer*rvt*predict-no*H0
  10980. -->
  10981. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  10982. -->
  10983. (S1 ^operator O1958 = 0.6602468953107985)
  10984. Firing rl*prefer*rvt*predict-no*H0*4
  10985. -->
  10986. (S1 ^operator O1958 = 0.339769731277316)
  10987. Firing prefer*rvt*predict-no*H0*4*v1*H1
  10988. -->
  10989. inner elaboration loop at bottom goal.
  10990. Retracting rl*prefer*rvt*predict-no*H0*4
  10991. -->
  10992. (S1 ^operator O1956 = 0.339769731277316)
  10993. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  10994. -->
  10995. (S1 ^operator O1956 = 0.6602468953107985)
  10996. Retracting rl*prefer*rvt*predict-yes*H0*3
  10997. -->
  10998. (S1 ^operator O1955 = 0.3377183053124619)
  10999. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  11000. -->
  11001. (S1 ^operator O1955 = -0.1070236389116304)
  11002. --- END Proposal Phase ---
  11003. --- Decision Phase ---
  11004. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11005. =>WM: (13794: S1 ^operator O1958)
  11006. 979: O: O1958 (predict-no)
  11007. --- END Decision Phase ---
  11008. --- Application Phase ---
  11009. --- Firing Productions (PE) For State At Depth 1 ---
  11010. --- Inner Elaboration Phase, active level 1 (S1) ---
  11011. Firing apply*operator
  11012. -->
  11013. (I3 ^predict-no N979 + :O )
  11014. Firing apply*operator*complete
  11015. -->
  11016. (I3 ^predict-no N978 - :O )
  11017. inner elaboration loop at bottom goal.
  11018. --- Change Working Memory (PE) ---
  11019. =>WM: (13795: I3 ^predict-no N979)
  11020. <=WM: (13782: N978 ^status complete)
  11021. <=WM: (13781: I3 ^predict-no N978)
  11022. --- Firing Productions (IE) For State At Depth 1 ---
  11023. --- Inner Elaboration Phase, active level 1 (S1) ---
  11024. Firing monitor*world
  11025. -->
  11026. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11027. --- Change Working Memory (IE) ---
  11028. --- END Application Phase ---
  11029. --- Output Phase ---
  11030. ENV: Agent did: predict-no for direction R in state State-B
  11031. In State-B moving R
  11032. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11033. predict error 0
  11034. dir: dir isU
  11035. --- END Output Phase ---
  11036. \-/--- Input Phase ---
  11037. =>WM: (13799: I2 ^dir U)
  11038. =>WM: (13798: I2 ^reward 1)
  11039. =>WM: (13797: I2 ^see 0)
  11040. =>WM: (13796: N979 ^status complete)
  11041. <=WM: (13785: I2 ^dir R)
  11042. <=WM: (13784: I2 ^reward 1)
  11043. <=WM: (13783: I2 ^see 0)
  11044. =>WM: (13800: I2 ^level-1 R0-root)
  11045. <=WM: (13786: I2 ^level-1 R1-root)
  11046. --- END Input Phase ---
  11047. --- Proposal Phase ---
  11048. --- Inner Elaboration Phase, active level 1 (S1) ---
  11049. Firing elaborate*copy-see-to-output-link
  11050. -->
  11051. (I3 ^see 0 +)
  11052. Firing elaborate*reward*based*on*reward
  11053. -->
  11054. (R983 ^value 1 +)
  11055. (R1 ^reward R983 +)
  11056. Firing propose*predict-yes
  11057. -->
  11058. (O1959 ^name predict-yes +)
  11059. (S1 ^operator O1959 +)
  11060. Firing propose*predict-no
  11061. -->
  11062. (O1960 ^name predict-no +)
  11063. (S1 ^operator O1960 +)
  11064. Firing rl*prefer*rvt*predict-no*H0*2
  11065. -->
  11066. (S1 ^operator O1958 = 1.)
  11067. Firing rl*prefer*rvt*predict-yes*H0*1
  11068. -->
  11069. (S1 ^operator O1957 = 0.)
  11070. Firing prefer*rvt*predict-yes*H0
  11071. -->
  11072. Firing prefer*rvt*predict-no*H0
  11073. -->
  11074. Firing elaborate*copy-dir-to-output-link
  11075. -->
  11076. (I3 ^dir U +)
  11077. inner elaboration loop at bottom goal.
  11078. Retracting elaborate*copy-see-to-output-link
  11079. -->
  11080. (I3 ^see 0 +)
  11081. Retracting propose*predict-no
  11082. -->
  11083. (O1958 ^name predict-no +)
  11084. (S1 ^operator O1958 +)
  11085. Retracting propose*predict-yes
  11086. -->
  11087. (O1957 ^name predict-yes +)
  11088. (S1 ^operator O1957 +)
  11089. Retracting elaborate*reward*based*on*reward
  11090. -->
  11091. (R982 ^value 1 +)
  11092. (R1 ^reward R982 +)
  11093. Retracting elaborate*copy-dir-to-output-link
  11094. -->
  11095. (I3 ^dir R +)
  11096. Retracting rl*prefer*rvt*predict-no*H0*4
  11097. -->
  11098. (S1 ^operator O1958 = 0.339769731277316)
  11099. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  11100. -->
  11101. (S1 ^operator O1958 = 0.6602468953107985)
  11102. Retracting rl*prefer*rvt*predict-yes*H0*3
  11103. -->
  11104. (S1 ^operator O1957 = 0.3377183053124619)
  11105. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  11106. -->
  11107. (S1 ^operator O1957 = -0.1070236389116304)
  11108. =>WM: (13807: S1 ^operator O1960 +)
  11109. =>WM: (13806: S1 ^operator O1959 +)
  11110. =>WM: (13805: I3 ^dir U)
  11111. =>WM: (13804: O1960 ^name predict-no)
  11112. =>WM: (13803: O1959 ^name predict-yes)
  11113. =>WM: (13802: R983 ^value 1)
  11114. =>WM: (13801: R1 ^reward R983)
  11115. <=WM: (13792: S1 ^operator O1957 +)
  11116. <=WM: (13793: S1 ^operator O1958 +)
  11117. <=WM: (13794: S1 ^operator O1958)
  11118. <=WM: (13791: I3 ^dir R)
  11119. <=WM: (13787: R1 ^reward R982)
  11120. <=WM: (13790: O1958 ^name predict-no)
  11121. <=WM: (13789: O1957 ^name predict-yes)
  11122. <=WM: (13788: R982 ^value 1)
  11123. --- Inner Elaboration Phase, active level 1 (S1) ---
  11124. Firing prefer*rvt*predict-yes*H0
  11125. -->
  11126. Firing rl*prefer*rvt*predict-yes*H0*1
  11127. -->
  11128. (S1 ^operator O1959 = 0.)
  11129. Firing prefer*rvt*predict-no*H0
  11130. -->
  11131. Firing rl*prefer*rvt*predict-no*H0*2
  11132. -->
  11133. (S1 ^operator O1960 = 1.)
  11134. inner elaboration loop at bottom goal.
  11135. Retracting rl*prefer*rvt*predict-no*H0*2
  11136. -->
  11137. (S1 ^operator O1958 = 1.)
  11138. Retracting rl*prefer*rvt*predict-yes*H0*1
  11139. -->
  11140. (S1 ^operator O1957 = 0.)
  11141. --- END Proposal Phase ---
  11142. --- Decision Phase ---
  11143. RL update rl*prefer*rvt*predict-no*H0*4 0.570253 -0.230483 0.33977 -> 0.570252 -0.230483 0.339768(R,m,v=1,0.873494,0.111172)
  11144. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429764 0.230483 0.660247 -> 0.429763 0.230483 0.660245(R,m,v=1,1,0)
  11145. =>WM: (13808: S1 ^operator O1960)
  11146. 980: O: O1960 (predict-no)
  11147. --- END Decision Phase ---
  11148. --- Application Phase ---
  11149. --- Firing Productions (PE) For State At Depth 1 ---
  11150. --- Inner Elaboration Phase, active level 1 (S1) ---
  11151. Firing apply*operator
  11152. -->
  11153. (I3 ^predict-no N980 + :O )
  11154. Firing apply*operator*complete
  11155. -->
  11156. (I3 ^predict-no N979 - :O )
  11157. inner elaboration loop at bottom goal.
  11158. --- Change Working Memory (PE) ---
  11159. =>WM: (13809: I3 ^predict-no N980)
  11160. <=WM: (13796: N979 ^status complete)
  11161. <=WM: (13795: I3 ^predict-no N979)
  11162. --- Firing Productions (IE) For State At Depth 1 ---
  11163. --- Inner Elaboration Phase, active level 1 (S1) ---
  11164. Firing monitor*world
  11165. -->
  11166. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11167. --- Change Working Memory (IE) ---
  11168. --- END Application Phase ---
  11169. --- Output Phase ---
  11170. ENV: Agent did: predict-no for direction U in state State-B
  11171. In State-B moving U
  11172. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11173. predict error 0
  11174. dir: dir isU
  11175. --- END Output Phase ---
  11176. |\--- Input Phase ---
  11177. =>WM: (13813: I2 ^dir U)
  11178. =>WM: (13812: I2 ^reward 1)
  11179. =>WM: (13811: I2 ^see 0)
  11180. =>WM: (13810: N980 ^status complete)
  11181. <=WM: (13799: I2 ^dir U)
  11182. <=WM: (13798: I2 ^reward 1)
  11183. <=WM: (13797: I2 ^see 0)
  11184. =>WM: (13814: I2 ^level-1 R0-root)
  11185. <=WM: (13800: I2 ^level-1 R0-root)
  11186. --- END Input Phase ---
  11187. --- Proposal Phase ---
  11188. --- Inner Elaboration Phase, active level 1 (S1) ---
  11189. Firing elaborate*copy-see-to-output-link
  11190. -->
  11191. (I3 ^see 0 +)
  11192. Firing elaborate*reward*based*on*reward
  11193. -->
  11194. (R984 ^value 1 +)
  11195. (R1 ^reward R984 +)
  11196. Firing propose*predict-yes
  11197. -->
  11198. (O1961 ^name predict-yes +)
  11199. (S1 ^operator O1961 +)
  11200. Firing propose*predict-no
  11201. -->
  11202. (O1962 ^name predict-no +)
  11203. (S1 ^operator O1962 +)
  11204. Firing rl*prefer*rvt*predict-no*H0*2
  11205. -->
  11206. (S1 ^operator O1960 = 1.)
  11207. Firing rl*prefer*rvt*predict-yes*H0*1
  11208. -->
  11209. (S1 ^operator O1959 = 0.)
  11210. Firing prefer*rvt*predict-yes*H0
  11211. -->
  11212. Firing prefer*rvt*predict-no*H0
  11213. -->
  11214. Firing elaborate*copy-dir-to-output-link
  11215. -->
  11216. (I3 ^dir U +)
  11217. inner elaboration loop at bottom goal.
  11218. Retracting elaborate*copy-see-to-output-link
  11219. -->
  11220. (I3 ^see 0 +)
  11221. Retracting propose*predict-no
  11222. -->
  11223. (O1960 ^name predict-no +)
  11224. (S1 ^operator O1960 +)
  11225. Retracting propose*predict-yes
  11226. -->
  11227. (O1959 ^name predict-yes +)
  11228. (S1 ^operator O1959 +)
  11229. Retracting elaborate*reward*based*on*reward
  11230. -->
  11231. (R983 ^value 1 +)
  11232. (R1 ^reward R983 +)
  11233. Retracting elaborate*copy-dir-to-output-link
  11234. -->
  11235. (I3 ^dir U +)
  11236. Retracting rl*prefer*rvt*predict-no*H0*2
  11237. -->
  11238. (S1 ^operator O1960 = 1.)
  11239. Retracting rl*prefer*rvt*predict-yes*H0*1
  11240. -->
  11241. (S1 ^operator O1959 = 0.)
  11242. =>WM: (13820: S1 ^operator O1962 +)
  11243. =>WM: (13819: S1 ^operator O1961 +)
  11244. =>WM: (13818: O1962 ^name predict-no)
  11245. =>WM: (13817: O1961 ^name predict-yes)
  11246. =>WM: (13816: R984 ^value 1)
  11247. =>WM: (13815: R1 ^reward R984)
  11248. <=WM: (13806: S1 ^operator O1959 +)
  11249. <=WM: (13807: S1 ^operator O1960 +)
  11250. <=WM: (13808: S1 ^operator O1960)
  11251. <=WM: (13801: R1 ^reward R983)
  11252. <=WM: (13804: O1960 ^name predict-no)
  11253. <=WM: (13803: O1959 ^name predict-yes)
  11254. <=WM: (13802: R983 ^value 1)
  11255. --- Inner Elaboration Phase, active level 1 (S1) ---
  11256. Firing prefer*rvt*predict-yes*H0
  11257. -->
  11258. Firing rl*prefer*rvt*predict-yes*H0*1
  11259. -->
  11260. (S1 ^operator O1961 = 0.)
  11261. Firing prefer*rvt*predict-no*H0
  11262. -->
  11263. Firing rl*prefer*rvt*predict-no*H0*2
  11264. -->
  11265. (S1 ^operator O1962 = 1.)
  11266. inner elaboration loop at bottom goal.
  11267. Retracting rl*prefer*rvt*predict-no*H0*2
  11268. -->
  11269. (S1 ^operator O1960 = 1.)
  11270. Retracting rl*prefer*rvt*predict-yes*H0*1
  11271. -->
  11272. (S1 ^operator O1959 = 0.)
  11273. --- END Proposal Phase ---
  11274. --- Decision Phase ---
  11275. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11276. =>WM: (13821: S1 ^operator O1962)
  11277. 981: O: O1962 (predict-no)
  11278. --- END Decision Phase ---
  11279. --- Application Phase ---
  11280. --- Firing Productions (PE) For State At Depth 1 ---
  11281. --- Inner Elaboration Phase, active level 1 (S1) ---
  11282. Firing apply*operator
  11283. -->
  11284. (I3 ^predict-no N981 + :O )
  11285. Firing apply*operator*complete
  11286. -->
  11287. (I3 ^predict-no N980 - :O )
  11288. inner elaboration loop at bottom goal.
  11289. --- Change Working Memory (PE) ---
  11290. =>WM: (13822: I3 ^predict-no N981)
  11291. <=WM: (13810: N980 ^status complete)
  11292. <=WM: (13809: I3 ^predict-no N980)
  11293. --- Firing Productions (IE) For State At Depth 1 ---
  11294. --- Inner Elaboration Phase, active level 1 (S1) ---
  11295. Firing monitor*world
  11296. -->
  11297. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11298. --- Change Working Memory (IE) ---
  11299. --- END Application Phase ---
  11300. --- Output Phase ---
  11301. ENV: Agent did: predict-no for direction U in state State-B
  11302. In State-B moving U
  11303. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11304. predict error 0
  11305. dir: dir isL
  11306. --- END Output Phase ---
  11307. ---- Input Phase ---
  11308. =>WM: (13826: I2 ^dir L)
  11309. =>WM: (13825: I2 ^reward 1)
  11310. =>WM: (13824: I2 ^see 0)
  11311. =>WM: (13823: N981 ^status complete)
  11312. <=WM: (13813: I2 ^dir U)
  11313. <=WM: (13812: I2 ^reward 1)
  11314. <=WM: (13811: I2 ^see 0)
  11315. =>WM: (13827: I2 ^level-1 R0-root)
  11316. <=WM: (13814: I2 ^level-1 R0-root)
  11317. --- END Input Phase ---
  11318. --- Proposal Phase ---
  11319. --- Inner Elaboration Phase, active level 1 (S1) ---
  11320. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11321. -->
  11322. (S1 ^operator O1961 = 0.7358289752034343)
  11323. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11324. -->
  11325. Firing elaborate*copy-see-to-output-link
  11326. -->
  11327. (I3 ^see 0 +)
  11328. Firing elaborate*reward*based*on*reward
  11329. -->
  11330. (R985 ^value 1 +)
  11331. (R1 ^reward R985 +)
  11332. Firing propose*predict-yes
  11333. -->
  11334. (O1963 ^name predict-yes +)
  11335. (S1 ^operator O1963 +)
  11336. Firing propose*predict-no
  11337. -->
  11338. (O1964 ^name predict-no +)
  11339. (S1 ^operator O1964 +)
  11340. Firing rl*prefer*rvt*predict-no*H0*6
  11341. -->
  11342. (S1 ^operator O1962 = 0.9997480945179411)
  11343. Firing rl*prefer*rvt*predict-yes*H0*5
  11344. -->
  11345. (S1 ^operator O1961 = 0.2640281357095451)
  11346. Firing prefer*rvt*predict-yes*H0
  11347. -->
  11348. Firing prefer*rvt*predict-no*H0
  11349. -->
  11350. Firing elaborate*copy-dir-to-output-link
  11351. -->
  11352. (I3 ^dir L +)
  11353. inner elaboration loop at bottom goal.
  11354. Retracting elaborate*copy-see-to-output-link
  11355. -->
  11356. (I3 ^see 0 +)
  11357. Retracting propose*predict-no
  11358. -->
  11359. (O1962 ^name predict-no +)
  11360. (S1 ^operator O1962 +)
  11361. Retracting propose*predict-yes
  11362. -->
  11363. (O1961 ^name predict-yes +)
  11364. (S1 ^operator O1961 +)
  11365. Retracting elaborate*reward*based*on*reward
  11366. -->
  11367. (R984 ^value 1 +)
  11368. (R1 ^reward R984 +)
  11369. Retracting elaborate*copy-dir-to-output-link
  11370. -->
  11371. (I3 ^dir U +)
  11372. Retracting rl*prefer*rvt*predict-no*H0*2
  11373. -->
  11374. (S1 ^operator O1962 = 1.)
  11375. Retracting rl*prefer*rvt*predict-yes*H0*1
  11376. -->
  11377. (S1 ^operator O1961 = 0.)
  11378. =>WM: (13834: S1 ^operator O1964 +)
  11379. =>WM: (13833: S1 ^operator O1963 +)
  11380. =>WM: (13832: I3 ^dir L)
  11381. =>WM: (13831: O1964 ^name predict-no)
  11382. =>WM: (13830: O1963 ^name predict-yes)
  11383. =>WM: (13829: R985 ^value 1)
  11384. =>WM: (13828: R1 ^reward R985)
  11385. <=WM: (13819: S1 ^operator O1961 +)
  11386. <=WM: (13820: S1 ^operator O1962 +)
  11387. <=WM: (13821: S1 ^operator O1962)
  11388. <=WM: (13805: I3 ^dir U)
  11389. <=WM: (13815: R1 ^reward R984)
  11390. <=WM: (13818: O1962 ^name predict-no)
  11391. <=WM: (13817: O1961 ^name predict-yes)
  11392. <=WM: (13816: R984 ^value 1)
  11393. --- Inner Elaboration Phase, active level 1 (S1) ---
  11394. Firing prefer*rvt*predict-yes*H0
  11395. -->
  11396. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11397. -->
  11398. (S1 ^operator O1963 = 0.7358289752034343)
  11399. Firing rl*prefer*rvt*predict-yes*H0*5
  11400. -->
  11401. (S1 ^operator O1963 = 0.2640281357095451)
  11402. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11403. -->
  11404. Firing prefer*rvt*predict-no*H0
  11405. -->
  11406. Firing rl*prefer*rvt*predict-no*H0*6
  11407. -->
  11408. (S1 ^operator O1964 = 0.9997480945179411)
  11409. inner elaboration loop at bottom goal.
  11410. Retracting rl*prefer*rvt*predict-no*H0*6
  11411. -->
  11412. (S1 ^operator O1962 = 0.9997480945179411)
  11413. Retracting rl*prefer*rvt*predict-yes*H0*5
  11414. -->
  11415. (S1 ^operator O1961 = 0.2640281357095451)
  11416. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11417. -->
  11418. (S1 ^operator O1961 = 0.7358289752034343)
  11419. --- END Proposal Phase ---
  11420. --- Decision Phase ---
  11421. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11422. =>WM: (13835: S1 ^operator O1963)
  11423. 982: O: O1963 (predict-yes)
  11424. --- END Decision Phase ---
  11425. --- Application Phase ---
  11426. --- Firing Productions (PE) For State At Depth 1 ---
  11427. --- Inner Elaboration Phase, active level 1 (S1) ---
  11428. Firing apply*operator
  11429. -->
  11430. (I3 ^predict-yes N982 + :O )
  11431. Firing apply*operator*complete
  11432. -->
  11433. (I3 ^predict-no N981 - :O )
  11434. inner elaboration loop at bottom goal.
  11435. --- Change Working Memory (PE) ---
  11436. =>WM: (13836: I3 ^predict-yes N982)
  11437. <=WM: (13823: N981 ^status complete)
  11438. <=WM: (13822: I3 ^predict-no N981)
  11439. --- Firing Productions (IE) For State At Depth 1 ---
  11440. --- Inner Elaboration Phase, active level 1 (S1) ---
  11441. Firing monitor*world
  11442. -->
  11443. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11444. --- Change Working Memory (IE) ---
  11445. --- END Application Phase ---
  11446. --- Output Phase ---
  11447. ENV: Agent did: predict-yes for direction L in state State-B
  11448. In State-B moving L
  11449. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11450. predict error 0
  11451. dir: dir isU
  11452. --- END Output Phase ---
  11453. /|\--- Input Phase ---
  11454. =>WM: (13840: I2 ^dir U)
  11455. =>WM: (13839: I2 ^reward 1)
  11456. =>WM: (13838: I2 ^see 1)
  11457. =>WM: (13837: N982 ^status complete)
  11458. <=WM: (13826: I2 ^dir L)
  11459. <=WM: (13825: I2 ^reward 1)
  11460. <=WM: (13824: I2 ^see 0)
  11461. =>WM: (13841: I2 ^level-1 L1-root)
  11462. <=WM: (13827: I2 ^level-1 R0-root)
  11463. --- END Input Phase ---
  11464. --- Proposal Phase ---
  11465. --- Inner Elaboration Phase, active level 1 (S1) ---
  11466. Firing elaborate*copy-see-to-output-link
  11467. -->
  11468. (I3 ^see 1 +)
  11469. Firing elaborate*reward*based*on*reward
  11470. -->
  11471. (R986 ^value 1 +)
  11472. (R1 ^reward R986 +)
  11473. Firing propose*predict-yes
  11474. -->
  11475. (O1965 ^name predict-yes +)
  11476. (S1 ^operator O1965 +)
  11477. Firing propose*predict-no
  11478. -->
  11479. (O1966 ^name predict-no +)
  11480. (S1 ^operator O1966 +)
  11481. Firing rl*prefer*rvt*predict-no*H0*2
  11482. -->
  11483. (S1 ^operator O1964 = 1.)
  11484. Firing rl*prefer*rvt*predict-yes*H0*1
  11485. -->
  11486. (S1 ^operator O1963 = 0.)
  11487. Firing prefer*rvt*predict-yes*H0
  11488. -->
  11489. Firing prefer*rvt*predict-no*H0
  11490. -->
  11491. Firing elaborate*copy-dir-to-output-link
  11492. -->
  11493. (I3 ^dir U +)
  11494. inner elaboration loop at bottom goal.
  11495. Retracting elaborate*copy-see-to-output-link
  11496. -->
  11497. (I3 ^see 0 +)
  11498. Retracting propose*predict-no
  11499. -->
  11500. (O1964 ^name predict-no +)
  11501. (S1 ^operator O1964 +)
  11502. Retracting propose*predict-yes
  11503. -->
  11504. (O1963 ^name predict-yes +)
  11505. (S1 ^operator O1963 +)
  11506. Retracting elaborate*reward*based*on*reward
  11507. -->
  11508. (R985 ^value 1 +)
  11509. (R1 ^reward R985 +)
  11510. Retracting elaborate*copy-dir-to-output-link
  11511. -->
  11512. (I3 ^dir L +)
  11513. Retracting rl*prefer*rvt*predict-no*H0*6
  11514. -->
  11515. (S1 ^operator O1964 = 0.9997480945179411)
  11516. Retracting rl*prefer*rvt*predict-yes*H0*5
  11517. -->
  11518. (S1 ^operator O1963 = 0.2640281357095451)
  11519. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  11520. -->
  11521. (S1 ^operator O1963 = 0.7358289752034343)
  11522. =>WM: (13849: S1 ^operator O1966 +)
  11523. =>WM: (13848: S1 ^operator O1965 +)
  11524. =>WM: (13847: I3 ^dir U)
  11525. =>WM: (13846: O1966 ^name predict-no)
  11526. =>WM: (13845: O1965 ^name predict-yes)
  11527. =>WM: (13844: R986 ^value 1)
  11528. =>WM: (13843: R1 ^reward R986)
  11529. =>WM: (13842: I3 ^see 1)
  11530. <=WM: (13833: S1 ^operator O1963 +)
  11531. <=WM: (13835: S1 ^operator O1963)
  11532. <=WM: (13834: S1 ^operator O1964 +)
  11533. <=WM: (13832: I3 ^dir L)
  11534. <=WM: (13828: R1 ^reward R985)
  11535. <=WM: (13773: I3 ^see 0)
  11536. <=WM: (13831: O1964 ^name predict-no)
  11537. <=WM: (13830: O1963 ^name predict-yes)
  11538. <=WM: (13829: R985 ^value 1)
  11539. --- Inner Elaboration Phase, active level 1 (S1) ---
  11540. Firing prefer*rvt*predict-yes*H0
  11541. -->
  11542. Firing rl*prefer*rvt*predict-yes*H0*1
  11543. -->
  11544. (S1 ^operator O1965 = 0.)
  11545. Firing prefer*rvt*predict-no*H0
  11546. -->
  11547. Firing rl*prefer*rvt*predict-no*H0*2
  11548. -->
  11549. (S1 ^operator O1966 = 1.)
  11550. inner elaboration loop at bottom goal.
  11551. Retracting rl*prefer*rvt*predict-no*H0*2
  11552. -->
  11553. (S1 ^operator O1964 = 1.)
  11554. Retracting rl*prefer*rvt*predict-yes*H0*1
  11555. -->
  11556. (S1 ^operator O1963 = 0.)
  11557. --- END Proposal Phase ---
  11558. --- Decision Phase ---
  11559. RL update rl*prefer*rvt*predict-yes*H0*5 0.554414 -0.290386 0.264028 -> 0.554425 -0.290385 0.26404(R,m,v=1,0.875706,0.109463)
  11560. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445446 0.290383 0.735829 -> 0.44546 0.290383 0.735843(R,m,v=1,1,0)
  11561. =>WM: (13850: S1 ^operator O1966)
  11562. 983: O: O1966 (predict-no)
  11563. --- END Decision Phase ---
  11564. --- Application Phase ---
  11565. --- Firing Productions (PE) For State At Depth 1 ---
  11566. --- Inner Elaboration Phase, active level 1 (S1) ---
  11567. Firing apply*operator
  11568. -->
  11569. (I3 ^predict-no N983 + :O )
  11570. Firing apply*operator*complete
  11571. -->
  11572. (I3 ^predict-yes N982 - :O )
  11573. inner elaboration loop at bottom goal.
  11574. --- Change Working Memory (PE) ---
  11575. =>WM: (13851: I3 ^predict-no N983)
  11576. <=WM: (13837: N982 ^status complete)
  11577. <=WM: (13836: I3 ^predict-yes N982)
  11578. --- Firing Productions (IE) For State At Depth 1 ---
  11579. --- Inner Elaboration Phase, active level 1 (S1) ---
  11580. Firing monitor*world
  11581. -->
  11582. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11583. --- Change Working Memory (IE) ---
  11584. --- END Application Phase ---
  11585. --- Output Phase ---
  11586. ENV: Agent did: predict-no for direction U in state State-A
  11587. In State-A moving U
  11588. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11589. predict error 0
  11590. dir: dir isL
  11591. --- END Output Phase ---
  11592. -/|--- Input Phase ---
  11593. =>WM: (13855: I2 ^dir L)
  11594. =>WM: (13854: I2 ^reward 1)
  11595. =>WM: (13853: I2 ^see 0)
  11596. =>WM: (13852: N983 ^status complete)
  11597. <=WM: (13840: I2 ^dir U)
  11598. <=WM: (13839: I2 ^reward 1)
  11599. <=WM: (13838: I2 ^see 1)
  11600. =>WM: (13856: I2 ^level-1 L1-root)
  11601. <=WM: (13841: I2 ^level-1 L1-root)
  11602. --- END Input Phase ---
  11603. --- Proposal Phase ---
  11604. --- Inner Elaboration Phase, active level 1 (S1) ---
  11605. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11606. -->
  11607. (S1 ^operator O1965 = -0.181727099742844)
  11608. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11609. -->
  11610. Firing elaborate*copy-see-to-output-link
  11611. -->
  11612. (I3 ^see 0 +)
  11613. Firing elaborate*reward*based*on*reward
  11614. -->
  11615. (R987 ^value 1 +)
  11616. (R1 ^reward R987 +)
  11617. Firing propose*predict-yes
  11618. -->
  11619. (O1967 ^name predict-yes +)
  11620. (S1 ^operator O1967 +)
  11621. Firing propose*predict-no
  11622. -->
  11623. (O1968 ^name predict-no +)
  11624. (S1 ^operator O1968 +)
  11625. Firing rl*prefer*rvt*predict-no*H0*6
  11626. -->
  11627. (S1 ^operator O1966 = 0.9997480945179411)
  11628. Firing rl*prefer*rvt*predict-yes*H0*5
  11629. -->
  11630. (S1 ^operator O1965 = 0.264039703522277)
  11631. Firing prefer*rvt*predict-yes*H0
  11632. -->
  11633. Firing prefer*rvt*predict-no*H0
  11634. -->
  11635. Firing elaborate*copy-dir-to-output-link
  11636. -->
  11637. (I3 ^dir L +)
  11638. inner elaboration loop at bottom goal.
  11639. Retracting elaborate*copy-see-to-output-link
  11640. -->
  11641. (I3 ^see 1 +)
  11642. Retracting propose*predict-no
  11643. -->
  11644. (O1966 ^name predict-no +)
  11645. (S1 ^operator O1966 +)
  11646. Retracting propose*predict-yes
  11647. -->
  11648. (O1965 ^name predict-yes +)
  11649. (S1 ^operator O1965 +)
  11650. Retracting elaborate*reward*based*on*reward
  11651. -->
  11652. (R986 ^value 1 +)
  11653. (R1 ^reward R986 +)
  11654. Retracting elaborate*copy-dir-to-output-link
  11655. -->
  11656. (I3 ^dir U +)
  11657. Retracting rl*prefer*rvt*predict-no*H0*2
  11658. -->
  11659. (S1 ^operator O1966 = 1.)
  11660. Retracting rl*prefer*rvt*predict-yes*H0*1
  11661. -->
  11662. (S1 ^operator O1965 = 0.)
  11663. =>WM: (13864: S1 ^operator O1968 +)
  11664. =>WM: (13863: S1 ^operator O1967 +)
  11665. =>WM: (13862: I3 ^dir L)
  11666. =>WM: (13861: O1968 ^name predict-no)
  11667. =>WM: (13860: O1967 ^name predict-yes)
  11668. =>WM: (13859: R987 ^value 1)
  11669. =>WM: (13858: R1 ^reward R987)
  11670. =>WM: (13857: I3 ^see 0)
  11671. <=WM: (13848: S1 ^operator O1965 +)
  11672. <=WM: (13849: S1 ^operator O1966 +)
  11673. <=WM: (13850: S1 ^operator O1966)
  11674. <=WM: (13847: I3 ^dir U)
  11675. <=WM: (13843: R1 ^reward R986)
  11676. <=WM: (13842: I3 ^see 1)
  11677. <=WM: (13846: O1966 ^name predict-no)
  11678. <=WM: (13845: O1965 ^name predict-yes)
  11679. <=WM: (13844: R986 ^value 1)
  11680. --- Inner Elaboration Phase, active level 1 (S1) ---
  11681. Firing prefer*rvt*predict-yes*H0
  11682. -->
  11683. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11684. -->
  11685. (S1 ^operator O1967 = -0.181727099742844)
  11686. Firing rl*prefer*rvt*predict-yes*H0*5
  11687. -->
  11688. (S1 ^operator O1967 = 0.264039703522277)
  11689. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  11690. -->
  11691. Firing prefer*rvt*predict-no*H0
  11692. -->
  11693. Firing rl*prefer*rvt*predict-no*H0*6
  11694. -->
  11695. (S1 ^operator O1968 = 0.9997480945179411)
  11696. inner elaboration loop at bottom goal.
  11697. Retracting rl*prefer*rvt*predict-no*H0*6
  11698. -->
  11699. (S1 ^operator O1966 = 0.9997480945179411)
  11700. Retracting rl*prefer*rvt*predict-yes*H0*5
  11701. -->
  11702. (S1 ^operator O1965 = 0.264039703522277)
  11703. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11704. -->
  11705. (S1 ^operator O1965 = -0.181727099742844)
  11706. --- END Proposal Phase ---
  11707. --- Decision Phase ---
  11708. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  11709. =>WM: (13865: S1 ^operator O1968)
  11710. 984: O: O1968 (predict-no)
  11711. --- END Decision Phase ---
  11712. --- Application Phase ---
  11713. --- Firing Productions (PE) For State At Depth 1 ---
  11714. --- Inner Elaboration Phase, active level 1 (S1) ---
  11715. Firing apply*operator
  11716. -->
  11717. (I3 ^predict-no N984 + :O )
  11718. Firing apply*operator*complete
  11719. -->
  11720. (I3 ^predict-no N983 - :O )
  11721. inner elaboration loop at bottom goal.
  11722. --- Change Working Memory (PE) ---
  11723. =>WM: (13866: I3 ^predict-no N984)
  11724. <=WM: (13852: N983 ^status complete)
  11725. <=WM: (13851: I3 ^predict-no N983)
  11726. --- Firing Productions (IE) For State At Depth 1 ---
  11727. --- Inner Elaboration Phase, active level 1 (S1) ---
  11728. Firing monitor*world
  11729. -->
  11730. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11731. --- Change Working Memory (IE) ---
  11732. --- END Application Phase ---
  11733. --- Output Phase ---
  11734. ENV: Agent did: predict-no for direction L in state State-A
  11735. In State-A moving L
  11736. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11737. predict error 0
  11738. dir: dir isU
  11739. --- END Output Phase ---
  11740. \-/--- Input Phase ---
  11741. =>WM: (13870: I2 ^dir U)
  11742. =>WM: (13869: I2 ^reward 1)
  11743. =>WM: (13868: I2 ^see 0)
  11744. =>WM: (13867: N984 ^status complete)
  11745. <=WM: (13855: I2 ^dir L)
  11746. <=WM: (13854: I2 ^reward 1)
  11747. <=WM: (13853: I2 ^see 0)
  11748. =>WM: (13871: I2 ^level-1 L0-root)
  11749. <=WM: (13856: I2 ^level-1 L1-root)
  11750. --- END Input Phase ---
  11751. --- Proposal Phase ---
  11752. --- Inner Elaboration Phase, active level 1 (S1) ---
  11753. Firing elaborate*copy-see-to-output-link
  11754. -->
  11755. (I3 ^see 0 +)
  11756. Firing elaborate*reward*based*on*reward
  11757. -->
  11758. (R988 ^value 1 +)
  11759. (R1 ^reward R988 +)
  11760. Firing propose*predict-yes
  11761. -->
  11762. (O1969 ^name predict-yes +)
  11763. (S1 ^operator O1969 +)
  11764. Firing propose*predict-no
  11765. -->
  11766. (O1970 ^name predict-no +)
  11767. (S1 ^operator O1970 +)
  11768. Firing rl*prefer*rvt*predict-no*H0*2
  11769. -->
  11770. (S1 ^operator O1968 = 1.)
  11771. Firing rl*prefer*rvt*predict-yes*H0*1
  11772. -->
  11773. (S1 ^operator O1967 = 0.)
  11774. Firing prefer*rvt*predict-yes*H0
  11775. -->
  11776. Firing prefer*rvt*predict-no*H0
  11777. -->
  11778. Firing elaborate*copy-dir-to-output-link
  11779. -->
  11780. (I3 ^dir U +)
  11781. inner elaboration loop at bottom goal.
  11782. Retracting elaborate*copy-see-to-output-link
  11783. -->
  11784. (I3 ^see 0 +)
  11785. Retracting propose*predict-no
  11786. -->
  11787. (O1968 ^name predict-no +)
  11788. (S1 ^operator O1968 +)
  11789. Retracting propose*predict-yes
  11790. -->
  11791. (O1967 ^name predict-yes +)
  11792. (S1 ^operator O1967 +)
  11793. Retracting elaborate*reward*based*on*reward
  11794. -->
  11795. (R987 ^value 1 +)
  11796. (R1 ^reward R987 +)
  11797. Retracting elaborate*copy-dir-to-output-link
  11798. -->
  11799. (I3 ^dir L +)
  11800. Retracting rl*prefer*rvt*predict-no*H0*6
  11801. -->
  11802. (S1 ^operator O1968 = 0.9997480945179411)
  11803. Retracting rl*prefer*rvt*predict-yes*H0*5
  11804. -->
  11805. (S1 ^operator O1967 = 0.264039703522277)
  11806. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  11807. -->
  11808. (S1 ^operator O1967 = -0.181727099742844)
  11809. =>WM: (13878: S1 ^operator O1970 +)
  11810. =>WM: (13877: S1 ^operator O1969 +)
  11811. =>WM: (13876: I3 ^dir U)
  11812. =>WM: (13875: O1970 ^name predict-no)
  11813. =>WM: (13874: O1969 ^name predict-yes)
  11814. =>WM: (13873: R988 ^value 1)
  11815. =>WM: (13872: R1 ^reward R988)
  11816. <=WM: (13863: S1 ^operator O1967 +)
  11817. <=WM: (13864: S1 ^operator O1968 +)
  11818. <=WM: (13865: S1 ^operator O1968)
  11819. <=WM: (13862: I3 ^dir L)
  11820. <=WM: (13858: R1 ^reward R987)
  11821. <=WM: (13861: O1968 ^name predict-no)
  11822. <=WM: (13860: O1967 ^name predict-yes)
  11823. <=WM: (13859: R987 ^value 1)
  11824. --- Inner Elaboration Phase, active level 1 (S1) ---
  11825. Firing prefer*rvt*predict-yes*H0
  11826. -->
  11827. Firing rl*prefer*rvt*predict-yes*H0*1
  11828. -->
  11829. (S1 ^operator O1969 = 0.)
  11830. Firing prefer*rvt*predict-no*H0
  11831. -->
  11832. Firing rl*prefer*rvt*predict-no*H0*2
  11833. -->
  11834. (S1 ^operator O1970 = 1.)
  11835. inner elaboration loop at bottom goal.
  11836. Retracting rl*prefer*rvt*predict-no*H0*2
  11837. -->
  11838. (S1 ^operator O1968 = 1.)
  11839. Retracting rl*prefer*rvt*predict-yes*H0*1
  11840. -->
  11841. (S1 ^operator O1967 = 0.)
  11842. --- END Proposal Phase ---
  11843. --- Decision Phase ---
  11844. RL update rl*prefer*rvt*predict-no*H0*6 0.999748 0 0.999748 -> 0.99979 0 0.99979(R,m,v=1,0.904762,0.086758)
  11845. =>WM: (13879: S1 ^operator O1970)
  11846. 985: O: O1970 (predict-no)
  11847. --- END Decision Phase ---
  11848. --- Application Phase ---
  11849. --- Firing Productions (PE) For State At Depth 1 ---
  11850. --- Inner Elaboration Phase, active level 1 (S1) ---
  11851. Firing apply*operator
  11852. -->
  11853. (I3 ^predict-no N985 + :O )
  11854. Firing apply*operator*complete
  11855. -->
  11856. (I3 ^predict-no N984 - :O )
  11857. inner elaboration loop at bottom goal.
  11858. --- Change Working Memory (PE) ---
  11859. =>WM: (13880: I3 ^predict-no N985)
  11860. <=WM: (13867: N984 ^status complete)
  11861. <=WM: (13866: I3 ^predict-no N984)
  11862. --- Firing Productions (IE) For State At Depth 1 ---
  11863. --- Inner Elaboration Phase, active level 1 (S1) ---
  11864. Firing monitor*world
  11865. -->
  11866. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11867. --- Change Working Memory (IE) ---
  11868. --- END Application Phase ---
  11869. --- Output Phase ---
  11870. ENV: Agent did: predict-no for direction U in state State-A
  11871. In State-A moving U
  11872. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11873. predict error 0
  11874. dir: dir isR
  11875. --- END Output Phase ---
  11876. |\---- Input Phase ---
  11877. =>WM: (13884: I2 ^dir R)
  11878. =>WM: (13883: I2 ^reward 1)
  11879. =>WM: (13882: I2 ^see 0)
  11880. =>WM: (13881: N985 ^status complete)
  11881. <=WM: (13870: I2 ^dir U)
  11882. <=WM: (13869: I2 ^reward 1)
  11883. <=WM: (13868: I2 ^see 0)
  11884. =>WM: (13885: I2 ^level-1 L0-root)
  11885. <=WM: (13871: I2 ^level-1 L0-root)
  11886. --- END Input Phase ---
  11887. --- Proposal Phase ---
  11888. --- Inner Elaboration Phase, active level 1 (S1) ---
  11889. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11890. -->
  11891. (S1 ^operator O1970 = -0.2817060109291377)
  11892. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11893. -->
  11894. (S1 ^operator O1969 = 0.6623600134734193)
  11895. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11896. -->
  11897. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11898. -->
  11899. Firing elaborate*copy-see-to-output-link
  11900. -->
  11901. (I3 ^see 0 +)
  11902. Firing elaborate*reward*based*on*reward
  11903. -->
  11904. (R989 ^value 1 +)
  11905. (R1 ^reward R989 +)
  11906. Firing propose*predict-yes
  11907. -->
  11908. (O1971 ^name predict-yes +)
  11909. (S1 ^operator O1971 +)
  11910. Firing propose*predict-no
  11911. -->
  11912. (O1972 ^name predict-no +)
  11913. (S1 ^operator O1972 +)
  11914. Firing rl*prefer*rvt*predict-no*H0*4
  11915. -->
  11916. (S1 ^operator O1970 = 0.3397683711152304)
  11917. Firing rl*prefer*rvt*predict-yes*H0*3
  11918. -->
  11919. (S1 ^operator O1969 = 0.3377183053124619)
  11920. Firing prefer*rvt*predict-yes*H0
  11921. -->
  11922. Firing prefer*rvt*predict-no*H0
  11923. -->
  11924. Firing elaborate*copy-dir-to-output-link
  11925. -->
  11926. (I3 ^dir R +)
  11927. inner elaboration loop at bottom goal.
  11928. Retracting elaborate*copy-see-to-output-link
  11929. -->
  11930. (I3 ^see 0 +)
  11931. Retracting propose*predict-no
  11932. -->
  11933. (O1970 ^name predict-no +)
  11934. (S1 ^operator O1970 +)
  11935. Retracting propose*predict-yes
  11936. -->
  11937. (O1969 ^name predict-yes +)
  11938. (S1 ^operator O1969 +)
  11939. Retracting elaborate*reward*based*on*reward
  11940. -->
  11941. (R988 ^value 1 +)
  11942. (R1 ^reward R988 +)
  11943. Retracting elaborate*copy-dir-to-output-link
  11944. -->
  11945. (I3 ^dir U +)
  11946. Retracting rl*prefer*rvt*predict-no*H0*2
  11947. -->
  11948. (S1 ^operator O1970 = 1.)
  11949. Retracting rl*prefer*rvt*predict-yes*H0*1
  11950. -->
  11951. (S1 ^operator O1969 = 0.)
  11952. =>WM: (13892: S1 ^operator O1972 +)
  11953. =>WM: (13891: S1 ^operator O1971 +)
  11954. =>WM: (13890: I3 ^dir R)
  11955. =>WM: (13889: O1972 ^name predict-no)
  11956. =>WM: (13888: O1971 ^name predict-yes)
  11957. =>WM: (13887: R989 ^value 1)
  11958. =>WM: (13886: R1 ^reward R989)
  11959. <=WM: (13877: S1 ^operator O1969 +)
  11960. <=WM: (13878: S1 ^operator O1970 +)
  11961. <=WM: (13879: S1 ^operator O1970)
  11962. <=WM: (13876: I3 ^dir U)
  11963. <=WM: (13872: R1 ^reward R988)
  11964. <=WM: (13875: O1970 ^name predict-no)
  11965. <=WM: (13874: O1969 ^name predict-yes)
  11966. <=WM: (13873: R988 ^value 1)
  11967. --- Inner Elaboration Phase, active level 1 (S1) ---
  11968. Firing prefer*rvt*predict-yes*H0
  11969. -->
  11970. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11971. -->
  11972. (S1 ^operator O1971 = 0.6623600134734193)
  11973. Firing rl*prefer*rvt*predict-yes*H0*3
  11974. -->
  11975. (S1 ^operator O1971 = 0.3377183053124619)
  11976. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  11977. -->
  11978. Firing prefer*rvt*predict-no*H0
  11979. -->
  11980. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11981. -->
  11982. (S1 ^operator O1972 = -0.2817060109291377)
  11983. Firing rl*prefer*rvt*predict-no*H0*4
  11984. -->
  11985. (S1 ^operator O1972 = 0.3397683711152304)
  11986. Firing prefer*rvt*predict-no*H0*4*v1*H1
  11987. -->
  11988. inner elaboration loop at bottom goal.
  11989. Retracting rl*prefer*rvt*predict-no*H0*4
  11990. -->
  11991. (S1 ^operator O1970 = 0.3397683711152304)
  11992. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  11993. -->
  11994. (S1 ^operator O1970 = -0.2817060109291377)
  11995. Retracting rl*prefer*rvt*predict-yes*H0*3
  11996. -->
  11997. (S1 ^operator O1969 = 0.3377183053124619)
  11998. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  11999. -->
  12000. (S1 ^operator O1969 = 0.6623600134734193)
  12001. --- END Proposal Phase ---
  12002. --- Decision Phase ---
  12003. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12004. =>WM: (13893: S1 ^operator O1971)
  12005. 986: O: O1971 (predict-yes)
  12006. --- END Decision Phase ---
  12007. --- Application Phase ---
  12008. --- Firing Productions (PE) For State At Depth 1 ---
  12009. --- Inner Elaboration Phase, active level 1 (S1) ---
  12010. Firing apply*operator
  12011. -->
  12012. (I3 ^predict-yes N986 + :O )
  12013. Firing apply*operator*complete
  12014. -->
  12015. (I3 ^predict-no N985 - :O )
  12016. inner elaboration loop at bottom goal.
  12017. --- Change Working Memory (PE) ---
  12018. =>WM: (13894: I3 ^predict-yes N986)
  12019. <=WM: (13881: N985 ^status complete)
  12020. <=WM: (13880: I3 ^predict-no N985)
  12021. --- Firing Productions (IE) For State At Depth 1 ---
  12022. --- Inner Elaboration Phase, active level 1 (S1) ---
  12023. Firing monitor*world
  12024. -->
  12025. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12026. --- Change Working Memory (IE) ---
  12027. --- END Application Phase ---
  12028. --- Output Phase ---
  12029. ENV: Agent did: predict-yes for direction R in state State-A
  12030. In State-A moving R
  12031. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12032. predict error 0
  12033. dir: dir isU
  12034. --- END Output Phase ---
  12035. /|\--- Input Phase ---
  12036. =>WM: (13898: I2 ^dir U)
  12037. =>WM: (13897: I2 ^reward 1)
  12038. =>WM: (13896: I2 ^see 1)
  12039. =>WM: (13895: N986 ^status complete)
  12040. <=WM: (13884: I2 ^dir R)
  12041. <=WM: (13883: I2 ^reward 1)
  12042. <=WM: (13882: I2 ^see 0)
  12043. =>WM: (13899: I2 ^level-1 R1-root)
  12044. <=WM: (13885: I2 ^level-1 L0-root)
  12045. --- END Input Phase ---
  12046. --- Proposal Phase ---
  12047. --- Inner Elaboration Phase, active level 1 (S1) ---
  12048. Firing elaborate*copy-see-to-output-link
  12049. -->
  12050. (I3 ^see 1 +)
  12051. Firing elaborate*reward*based*on*reward
  12052. -->
  12053. (R990 ^value 1 +)
  12054. (R1 ^reward R990 +)
  12055. Firing propose*predict-yes
  12056. -->
  12057. (O1973 ^name predict-yes +)
  12058. (S1 ^operator O1973 +)
  12059. Firing propose*predict-no
  12060. -->
  12061. (O1974 ^name predict-no +)
  12062. (S1 ^operator O1974 +)
  12063. Firing rl*prefer*rvt*predict-no*H0*2
  12064. -->
  12065. (S1 ^operator O1972 = 1.)
  12066. Firing rl*prefer*rvt*predict-yes*H0*1
  12067. -->
  12068. (S1 ^operator O1971 = 0.)
  12069. Firing prefer*rvt*predict-yes*H0
  12070. -->
  12071. Firing prefer*rvt*predict-no*H0
  12072. -->
  12073. Firing elaborate*copy-dir-to-output-link
  12074. -->
  12075. (I3 ^dir U +)
  12076. inner elaboration loop at bottom goal.
  12077. Retracting elaborate*copy-see-to-output-link
  12078. -->
  12079. (I3 ^see 0 +)
  12080. Retracting propose*predict-no
  12081. -->
  12082. (O1972 ^name predict-no +)
  12083. (S1 ^operator O1972 +)
  12084. Retracting propose*predict-yes
  12085. -->
  12086. (O1971 ^name predict-yes +)
  12087. (S1 ^operator O1971 +)
  12088. Retracting elaborate*reward*based*on*reward
  12089. -->
  12090. (R989 ^value 1 +)
  12091. (R1 ^reward R989 +)
  12092. Retracting elaborate*copy-dir-to-output-link
  12093. -->
  12094. (I3 ^dir R +)
  12095. Retracting rl*prefer*rvt*predict-no*H0*4
  12096. -->
  12097. (S1 ^operator O1972 = 0.3397683711152304)
  12098. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  12099. -->
  12100. (S1 ^operator O1972 = -0.2817060109291377)
  12101. Retracting rl*prefer*rvt*predict-yes*H0*3
  12102. -->
  12103. (S1 ^operator O1971 = 0.3377183053124619)
  12104. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  12105. -->
  12106. (S1 ^operator O1971 = 0.6623600134734193)
  12107. =>WM: (13907: S1 ^operator O1974 +)
  12108. =>WM: (13906: S1 ^operator O1973 +)
  12109. =>WM: (13905: I3 ^dir U)
  12110. =>WM: (13904: O1974 ^name predict-no)
  12111. =>WM: (13903: O1973 ^name predict-yes)
  12112. =>WM: (13902: R990 ^value 1)
  12113. =>WM: (13901: R1 ^reward R990)
  12114. =>WM: (13900: I3 ^see 1)
  12115. <=WM: (13891: S1 ^operator O1971 +)
  12116. <=WM: (13893: S1 ^operator O1971)
  12117. <=WM: (13892: S1 ^operator O1972 +)
  12118. <=WM: (13890: I3 ^dir R)
  12119. <=WM: (13886: R1 ^reward R989)
  12120. <=WM: (13857: I3 ^see 0)
  12121. <=WM: (13889: O1972 ^name predict-no)
  12122. <=WM: (13888: O1971 ^name predict-yes)
  12123. <=WM: (13887: R989 ^value 1)
  12124. --- Inner Elaboration Phase, active level 1 (S1) ---
  12125. Firing prefer*rvt*predict-yes*H0
  12126. -->
  12127. Firing rl*prefer*rvt*predict-yes*H0*1
  12128. -->
  12129. (S1 ^operator O1973 = 0.)
  12130. Firing prefer*rvt*predict-no*H0
  12131. -->
  12132. Firing rl*prefer*rvt*predict-no*H0*2
  12133. -->
  12134. (S1 ^operator O1974 = 1.)
  12135. inner elaboration loop at bottom goal.
  12136. Retracting rl*prefer*rvt*predict-no*H0*2
  12137. -->
  12138. (S1 ^operator O1972 = 1.)
  12139. Retracting rl*prefer*rvt*predict-yes*H0*1
  12140. -->
  12141. (S1 ^operator O1971 = 0.)
  12142. --- END Proposal Phase ---
  12143. --- Decision Phase ---
  12144. RL update rl*prefer*rvt*predict-yes*H0*3 0.59012 -0.252401 0.337718 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.89759,0.092479)
  12145. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409971 0.252389 0.66236 -> 0.409962 0.25239 0.662353(R,m,v=1,1,0)
  12146. =>WM: (13908: S1 ^operator O1974)
  12147. 987: O: O1974 (predict-no)
  12148. --- END Decision Phase ---
  12149. --- Application Phase ---
  12150. --- Firing Productions (PE) For State At Depth 1 ---
  12151. --- Inner Elaboration Phase, active level 1 (S1) ---
  12152. Firing apply*operator
  12153. -->
  12154. (I3 ^predict-no N987 + :O )
  12155. Firing apply*operator*complete
  12156. -->
  12157. (I3 ^predict-yes N986 - :O )
  12158. inner elaboration loop at bottom goal.
  12159. --- Change Working Memory (PE) ---
  12160. =>WM: (13909: I3 ^predict-no N987)
  12161. <=WM: (13895: N986 ^status complete)
  12162. <=WM: (13894: I3 ^predict-yes N986)
  12163. --- Firing Productions (IE) For State At Depth 1 ---
  12164. --- Inner Elaboration Phase, active level 1 (S1) ---
  12165. Firing monitor*world
  12166. -->
  12167. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12168. --- Change Working Memory (IE) ---
  12169. --- END Application Phase ---
  12170. --- Output Phase ---
  12171. ENV: Agent did: predict-no for direction U in state State-B
  12172. In State-B moving U
  12173. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12174. predict error 0
  12175. dir: dir isR
  12176. --- END Output Phase ---
  12177. -/|--- Input Phase ---
  12178. =>WM: (13913: I2 ^dir R)
  12179. =>WM: (13912: I2 ^reward 1)
  12180. =>WM: (13911: I2 ^see 0)
  12181. =>WM: (13910: N987 ^status complete)
  12182. <=WM: (13898: I2 ^dir U)
  12183. <=WM: (13897: I2 ^reward 1)
  12184. <=WM: (13896: I2 ^see 1)
  12185. =>WM: (13914: I2 ^level-1 R1-root)
  12186. <=WM: (13899: I2 ^level-1 R1-root)
  12187. --- END Input Phase ---
  12188. --- Proposal Phase ---
  12189. --- Inner Elaboration Phase, active level 1 (S1) ---
  12190. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12191. -->
  12192. (S1 ^operator O1973 = -0.1070236389116304)
  12193. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12194. -->
  12195. (S1 ^operator O1974 = 0.6602453025755203)
  12196. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12197. -->
  12198. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12199. -->
  12200. Firing elaborate*copy-see-to-output-link
  12201. -->
  12202. (I3 ^see 0 +)
  12203. Firing elaborate*reward*based*on*reward
  12204. -->
  12205. (R991 ^value 1 +)
  12206. (R1 ^reward R991 +)
  12207. Firing propose*predict-yes
  12208. -->
  12209. (O1975 ^name predict-yes +)
  12210. (S1 ^operator O1975 +)
  12211. Firing propose*predict-no
  12212. -->
  12213. (O1976 ^name predict-no +)
  12214. (S1 ^operator O1976 +)
  12215. Firing rl*prefer*rvt*predict-no*H0*4
  12216. -->
  12217. (S1 ^operator O1974 = 0.3397683711152304)
  12218. Firing rl*prefer*rvt*predict-yes*H0*3
  12219. -->
  12220. (S1 ^operator O1973 = 0.3377118983309207)
  12221. Firing prefer*rvt*predict-yes*H0
  12222. -->
  12223. Firing prefer*rvt*predict-no*H0
  12224. -->
  12225. Firing elaborate*copy-dir-to-output-link
  12226. -->
  12227. (I3 ^dir R +)
  12228. inner elaboration loop at bottom goal.
  12229. Retracting elaborate*copy-see-to-output-link
  12230. -->
  12231. (I3 ^see 1 +)
  12232. Retracting propose*predict-no
  12233. -->
  12234. (O1974 ^name predict-no +)
  12235. (S1 ^operator O1974 +)
  12236. Retracting propose*predict-yes
  12237. -->
  12238. (O1973 ^name predict-yes +)
  12239. (S1 ^operator O1973 +)
  12240. Retracting elaborate*reward*based*on*reward
  12241. -->
  12242. (R990 ^value 1 +)
  12243. (R1 ^reward R990 +)
  12244. Retracting elaborate*copy-dir-to-output-link
  12245. -->
  12246. (I3 ^dir U +)
  12247. Retracting rl*prefer*rvt*predict-no*H0*2
  12248. -->
  12249. (S1 ^operator O1974 = 1.)
  12250. Retracting rl*prefer*rvt*predict-yes*H0*1
  12251. -->
  12252. (S1 ^operator O1973 = 0.)
  12253. =>WM: (13922: S1 ^operator O1976 +)
  12254. =>WM: (13921: S1 ^operator O1975 +)
  12255. =>WM: (13920: I3 ^dir R)
  12256. =>WM: (13919: O1976 ^name predict-no)
  12257. =>WM: (13918: O1975 ^name predict-yes)
  12258. =>WM: (13917: R991 ^value 1)
  12259. =>WM: (13916: R1 ^reward R991)
  12260. =>WM: (13915: I3 ^see 0)
  12261. <=WM: (13906: S1 ^operator O1973 +)
  12262. <=WM: (13907: S1 ^operator O1974 +)
  12263. <=WM: (13908: S1 ^operator O1974)
  12264. <=WM: (13905: I3 ^dir U)
  12265. <=WM: (13901: R1 ^reward R990)
  12266. <=WM: (13900: I3 ^see 1)
  12267. <=WM: (13904: O1974 ^name predict-no)
  12268. <=WM: (13903: O1973 ^name predict-yes)
  12269. <=WM: (13902: R990 ^value 1)
  12270. --- Inner Elaboration Phase, active level 1 (S1) ---
  12271. Firing prefer*rvt*predict-yes*H0
  12272. -->
  12273. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12274. -->
  12275. (S1 ^operator O1975 = -0.1070236389116304)
  12276. Firing rl*prefer*rvt*predict-yes*H0*3
  12277. -->
  12278. (S1 ^operator O1975 = 0.3377118983309207)
  12279. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12280. -->
  12281. Firing prefer*rvt*predict-no*H0
  12282. -->
  12283. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12284. -->
  12285. (S1 ^operator O1976 = 0.6602453025755203)
  12286. Firing rl*prefer*rvt*predict-no*H0*4
  12287. -->
  12288. (S1 ^operator O1976 = 0.3397683711152304)
  12289. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12290. -->
  12291. inner elaboration loop at bottom goal.
  12292. Retracting rl*prefer*rvt*predict-no*H0*4
  12293. -->
  12294. (S1 ^operator O1974 = 0.3397683711152304)
  12295. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12296. -->
  12297. (S1 ^operator O1974 = 0.6602453025755203)
  12298. Retracting rl*prefer*rvt*predict-yes*H0*3
  12299. -->
  12300. (S1 ^operator O1973 = 0.3377118983309207)
  12301. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12302. -->
  12303. (S1 ^operator O1973 = -0.1070236389116304)
  12304. --- END Proposal Phase ---
  12305. --- Decision Phase ---
  12306. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12307. =>WM: (13923: S1 ^operator O1976)
  12308. 988: O: O1976 (predict-no)
  12309. --- END Decision Phase ---
  12310. --- Application Phase ---
  12311. --- Firing Productions (PE) For State At Depth 1 ---
  12312. --- Inner Elaboration Phase, active level 1 (S1) ---
  12313. Firing apply*operator
  12314. -->
  12315. (I3 ^predict-no N988 + :O )
  12316. Firing apply*operator*complete
  12317. -->
  12318. (I3 ^predict-no N987 - :O )
  12319. inner elaboration loop at bottom goal.
  12320. --- Change Working Memory (PE) ---
  12321. =>WM: (13924: I3 ^predict-no N988)
  12322. <=WM: (13910: N987 ^status complete)
  12323. <=WM: (13909: I3 ^predict-no N987)
  12324. --- Firing Productions (IE) For State At Depth 1 ---
  12325. --- Inner Elaboration Phase, active level 1 (S1) ---
  12326. Firing monitor*world
  12327. -->
  12328. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12329. --- Change Working Memory (IE) ---
  12330. --- END Application Phase ---
  12331. --- Output Phase ---
  12332. ENV: Agent did: predict-no for direction R in state State-B
  12333. In State-B moving R
  12334. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12335. predict error 0
  12336. dir: dir isR
  12337. --- END Output Phase ---
  12338. \-/--- Input Phase ---
  12339. =>WM: (13928: I2 ^dir R)
  12340. =>WM: (13927: I2 ^reward 1)
  12341. =>WM: (13926: I2 ^see 0)
  12342. =>WM: (13925: N988 ^status complete)
  12343. <=WM: (13913: I2 ^dir R)
  12344. <=WM: (13912: I2 ^reward 1)
  12345. <=WM: (13911: I2 ^see 0)
  12346. =>WM: (13929: I2 ^level-1 R0-root)
  12347. <=WM: (13914: I2 ^level-1 R1-root)
  12348. --- END Input Phase ---
  12349. --- Proposal Phase ---
  12350. --- Inner Elaboration Phase, active level 1 (S1) ---
  12351. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12352. -->
  12353. (S1 ^operator O1976 = 0.660152441867348)
  12354. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12355. -->
  12356. (S1 ^operator O1975 = -0.1028953566115423)
  12357. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12358. -->
  12359. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12360. -->
  12361. Firing elaborate*copy-see-to-output-link
  12362. -->
  12363. (I3 ^see 0 +)
  12364. Firing elaborate*reward*based*on*reward
  12365. -->
  12366. (R992 ^value 1 +)
  12367. (R1 ^reward R992 +)
  12368. Firing propose*predict-yes
  12369. -->
  12370. (O1977 ^name predict-yes +)
  12371. (S1 ^operator O1977 +)
  12372. Firing propose*predict-no
  12373. -->
  12374. (O1978 ^name predict-no +)
  12375. (S1 ^operator O1978 +)
  12376. Firing rl*prefer*rvt*predict-no*H0*4
  12377. -->
  12378. (S1 ^operator O1976 = 0.3397683711152304)
  12379. Firing rl*prefer*rvt*predict-yes*H0*3
  12380. -->
  12381. (S1 ^operator O1975 = 0.3377118983309207)
  12382. Firing prefer*rvt*predict-yes*H0
  12383. -->
  12384. Firing prefer*rvt*predict-no*H0
  12385. -->
  12386. Firing elaborate*copy-dir-to-output-link
  12387. -->
  12388. (I3 ^dir R +)
  12389. inner elaboration loop at bottom goal.
  12390. Retracting elaborate*copy-see-to-output-link
  12391. -->
  12392. (I3 ^see 0 +)
  12393. Retracting propose*predict-no
  12394. -->
  12395. (O1976 ^name predict-no +)
  12396. (S1 ^operator O1976 +)
  12397. Retracting propose*predict-yes
  12398. -->
  12399. (O1975 ^name predict-yes +)
  12400. (S1 ^operator O1975 +)
  12401. Retracting elaborate*reward*based*on*reward
  12402. -->
  12403. (R991 ^value 1 +)
  12404. (R1 ^reward R991 +)
  12405. Retracting elaborate*copy-dir-to-output-link
  12406. -->
  12407. (I3 ^dir R +)
  12408. Retracting rl*prefer*rvt*predict-no*H0*4
  12409. -->
  12410. (S1 ^operator O1976 = 0.3397683711152304)
  12411. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  12412. -->
  12413. (S1 ^operator O1976 = 0.6602453025755203)
  12414. Retracting rl*prefer*rvt*predict-yes*H0*3
  12415. -->
  12416. (S1 ^operator O1975 = 0.3377118983309207)
  12417. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  12418. -->
  12419. (S1 ^operator O1975 = -0.1070236389116304)
  12420. =>WM: (13935: S1 ^operator O1978 +)
  12421. =>WM: (13934: S1 ^operator O1977 +)
  12422. =>WM: (13933: O1978 ^name predict-no)
  12423. =>WM: (13932: O1977 ^name predict-yes)
  12424. =>WM: (13931: R992 ^value 1)
  12425. =>WM: (13930: R1 ^reward R992)
  12426. <=WM: (13921: S1 ^operator O1975 +)
  12427. <=WM: (13922: S1 ^operator O1976 +)
  12428. <=WM: (13923: S1 ^operator O1976)
  12429. <=WM: (13916: R1 ^reward R991)
  12430. <=WM: (13919: O1976 ^name predict-no)
  12431. <=WM: (13918: O1975 ^name predict-yes)
  12432. <=WM: (13917: R991 ^value 1)
  12433. --- Inner Elaboration Phase, active level 1 (S1) ---
  12434. Firing prefer*rvt*predict-yes*H0
  12435. -->
  12436. Firing rl*prefer*rvt*predict-yes*H0*3
  12437. -->
  12438. (S1 ^operator O1977 = 0.3377118983309207)
  12439. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12440. -->
  12441. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12442. -->
  12443. (S1 ^operator O1977 = -0.1028953566115423)
  12444. Firing prefer*rvt*predict-no*H0
  12445. -->
  12446. Firing rl*prefer*rvt*predict-no*H0*4
  12447. -->
  12448. (S1 ^operator O1978 = 0.3397683711152304)
  12449. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12450. -->
  12451. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12452. -->
  12453. (S1 ^operator O1978 = 0.660152441867348)
  12454. inner elaboration loop at bottom goal.
  12455. Retracting rl*prefer*rvt*predict-no*H0*4
  12456. -->
  12457. (S1 ^operator O1976 = 0.3397683711152304)
  12458. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12459. -->
  12460. (S1 ^operator O1976 = 0.660152441867348)
  12461. Retracting rl*prefer*rvt*predict-yes*H0*3
  12462. -->
  12463. (S1 ^operator O1975 = 0.3377118983309207)
  12464. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12465. -->
  12466. (S1 ^operator O1975 = -0.1028953566115423)
  12467. --- END Proposal Phase ---
  12468. --- Decision Phase ---
  12469. RL update rl*prefer*rvt*predict-no*H0*4 0.570252 -0.230483 0.339768 -> 0.570251 -0.230483 0.339767(R,m,v=1,0.874251,0.110598)
  12470. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429763 0.230483 0.660245 -> 0.429761 0.230483 0.660244(R,m,v=1,1,0)
  12471. =>WM: (13936: S1 ^operator O1978)
  12472. 989: O: O1978 (predict-no)
  12473. --- END Decision Phase ---
  12474. --- Application Phase ---
  12475. --- Firing Productions (PE) For State At Depth 1 ---
  12476. --- Inner Elaboration Phase, active level 1 (S1) ---
  12477. Firing apply*operator
  12478. -->
  12479. (I3 ^predict-no N989 + :O )
  12480. Firing apply*operator*complete
  12481. -->
  12482. (I3 ^predict-no N988 - :O )
  12483. inner elaboration loop at bottom goal.
  12484. --- Change Working Memory (PE) ---
  12485. =>WM: (13937: I3 ^predict-no N989)
  12486. <=WM: (13925: N988 ^status complete)
  12487. <=WM: (13924: I3 ^predict-no N988)
  12488. --- Firing Productions (IE) For State At Depth 1 ---
  12489. --- Inner Elaboration Phase, active level 1 (S1) ---
  12490. Firing monitor*world
  12491. -->
  12492. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12493. --- Change Working Memory (IE) ---
  12494. --- END Application Phase ---
  12495. --- Output Phase ---
  12496. ENV: Agent did: predict-no for direction R in state State-B
  12497. In State-B moving R
  12498. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12499. predict error 0
  12500. dir: dir isL
  12501. --- END Output Phase ---
  12502. |\---- Input Phase ---
  12503. =>WM: (13941: I2 ^dir L)
  12504. =>WM: (13940: I2 ^reward 1)
  12505. =>WM: (13939: I2 ^see 0)
  12506. =>WM: (13938: N989 ^status complete)
  12507. <=WM: (13928: I2 ^dir R)
  12508. <=WM: (13927: I2 ^reward 1)
  12509. <=WM: (13926: I2 ^see 0)
  12510. =>WM: (13942: I2 ^level-1 R0-root)
  12511. <=WM: (13929: I2 ^level-1 R0-root)
  12512. --- END Input Phase ---
  12513. --- Proposal Phase ---
  12514. --- Inner Elaboration Phase, active level 1 (S1) ---
  12515. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12516. -->
  12517. (S1 ^operator O1977 = 0.7358428664482317)
  12518. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12519. -->
  12520. Firing elaborate*copy-see-to-output-link
  12521. -->
  12522. (I3 ^see 0 +)
  12523. Firing elaborate*reward*based*on*reward
  12524. -->
  12525. (R993 ^value 1 +)
  12526. (R1 ^reward R993 +)
  12527. Firing propose*predict-yes
  12528. -->
  12529. (O1979 ^name predict-yes +)
  12530. (S1 ^operator O1979 +)
  12531. Firing propose*predict-no
  12532. -->
  12533. (O1980 ^name predict-no +)
  12534. (S1 ^operator O1980 +)
  12535. Firing rl*prefer*rvt*predict-no*H0*6
  12536. -->
  12537. (S1 ^operator O1978 = 0.999790145818646)
  12538. Firing rl*prefer*rvt*predict-yes*H0*5
  12539. -->
  12540. (S1 ^operator O1977 = 0.264039703522277)
  12541. Firing prefer*rvt*predict-yes*H0
  12542. -->
  12543. Firing prefer*rvt*predict-no*H0
  12544. -->
  12545. Firing elaborate*copy-dir-to-output-link
  12546. -->
  12547. (I3 ^dir L +)
  12548. inner elaboration loop at bottom goal.
  12549. Retracting elaborate*copy-see-to-output-link
  12550. -->
  12551. (I3 ^see 0 +)
  12552. Retracting propose*predict-no
  12553. -->
  12554. (O1978 ^name predict-no +)
  12555. (S1 ^operator O1978 +)
  12556. Retracting propose*predict-yes
  12557. -->
  12558. (O1977 ^name predict-yes +)
  12559. (S1 ^operator O1977 +)
  12560. Retracting elaborate*reward*based*on*reward
  12561. -->
  12562. (R992 ^value 1 +)
  12563. (R1 ^reward R992 +)
  12564. Retracting elaborate*copy-dir-to-output-link
  12565. -->
  12566. (I3 ^dir R +)
  12567. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  12568. -->
  12569. (S1 ^operator O1978 = 0.660152441867348)
  12570. Retracting rl*prefer*rvt*predict-no*H0*4
  12571. -->
  12572. (S1 ^operator O1978 = 0.339767253617308)
  12573. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  12574. -->
  12575. (S1 ^operator O1977 = -0.1028953566115423)
  12576. Retracting rl*prefer*rvt*predict-yes*H0*3
  12577. -->
  12578. (S1 ^operator O1977 = 0.3377118983309207)
  12579. =>WM: (13949: S1 ^operator O1980 +)
  12580. =>WM: (13948: S1 ^operator O1979 +)
  12581. =>WM: (13947: I3 ^dir L)
  12582. =>WM: (13946: O1980 ^name predict-no)
  12583. =>WM: (13945: O1979 ^name predict-yes)
  12584. =>WM: (13944: R993 ^value 1)
  12585. =>WM: (13943: R1 ^reward R993)
  12586. <=WM: (13934: S1 ^operator O1977 +)
  12587. <=WM: (13935: S1 ^operator O1978 +)
  12588. <=WM: (13936: S1 ^operator O1978)
  12589. <=WM: (13920: I3 ^dir R)
  12590. <=WM: (13930: R1 ^reward R992)
  12591. <=WM: (13933: O1978 ^name predict-no)
  12592. <=WM: (13932: O1977 ^name predict-yes)
  12593. <=WM: (13931: R992 ^value 1)
  12594. --- Inner Elaboration Phase, active level 1 (S1) ---
  12595. Firing prefer*rvt*predict-yes*H0
  12596. -->
  12597. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12598. -->
  12599. (S1 ^operator O1979 = 0.7358428664482317)
  12600. Firing rl*prefer*rvt*predict-yes*H0*5
  12601. -->
  12602. (S1 ^operator O1979 = 0.264039703522277)
  12603. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  12604. -->
  12605. Firing prefer*rvt*predict-no*H0
  12606. -->
  12607. Firing rl*prefer*rvt*predict-no*H0*6
  12608. -->
  12609. (S1 ^operator O1980 = 0.999790145818646)
  12610. inner elaboration loop at bottom goal.
  12611. Retracting rl*prefer*rvt*predict-no*H0*6
  12612. -->
  12613. (S1 ^operator O1978 = 0.999790145818646)
  12614. Retracting rl*prefer*rvt*predict-yes*H0*5
  12615. -->
  12616. (S1 ^operator O1977 = 0.264039703522277)
  12617. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12618. -->
  12619. (S1 ^operator O1977 = 0.7358428664482317)
  12620. --- END Proposal Phase ---
  12621. --- Decision Phase ---
  12622. RL update rl*prefer*rvt*predict-no*H0*4 0.570251 -0.230483 0.339767 -> 0.570257 -0.230484 0.339774(R,m,v=1,0.875,0.11003)
  12623. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429665 0.230487 0.660152 -> 0.429673 0.230487 0.66016(R,m,v=1,1,0)
  12624. =>WM: (13950: S1 ^operator O1979)
  12625. 990: O: O1979 (predict-yes)
  12626. --- END Decision Phase ---
  12627. --- Application Phase ---
  12628. --- Firing Productions (PE) For State At Depth 1 ---
  12629. --- Inner Elaboration Phase, active level 1 (S1) ---
  12630. Firing apply*operator
  12631. -->
  12632. (I3 ^predict-yes N990 + :O )
  12633. Firing apply*operator*complete
  12634. -->
  12635. (I3 ^predict-no N989 - :O )
  12636. inner elaboration loop at bottom goal.
  12637. --- Change Working Memory (PE) ---
  12638. =>WM: (13951: I3 ^predict-yes N990)
  12639. <=WM: (13938: N989 ^status complete)
  12640. <=WM: (13937: I3 ^predict-no N989)
  12641. --- Firing Productions (IE) For State At Depth 1 ---
  12642. --- Inner Elaboration Phase, active level 1 (S1) ---
  12643. Firing monitor*world
  12644. -->
  12645. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12646. --- Change Working Memory (IE) ---
  12647. --- END Application Phase ---
  12648. --- Output Phase ---
  12649. ENV: Agent did: predict-yes for direction L in state State-B
  12650. In State-B moving L
  12651. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12652. predict error 0
  12653. dir: dir isU
  12654. --- END Output Phase ---
  12655. /|\--- Input Phase ---
  12656. =>WM: (13955: I2 ^dir U)
  12657. =>WM: (13954: I2 ^reward 1)
  12658. =>WM: (13953: I2 ^see 1)
  12659. =>WM: (13952: N990 ^status complete)
  12660. <=WM: (13941: I2 ^dir L)
  12661. <=WM: (13940: I2 ^reward 1)
  12662. <=WM: (13939: I2 ^see 0)
  12663. =>WM: (13956: I2 ^level-1 L1-root)
  12664. <=WM: (13942: I2 ^level-1 R0-root)
  12665. --- END Input Phase ---
  12666. --- Proposal Phase ---
  12667. --- Inner Elaboration Phase, active level 1 (S1) ---
  12668. Firing elaborate*copy-see-to-output-link
  12669. -->
  12670. (I3 ^see 1 +)
  12671. Firing elaborate*reward*based*on*reward
  12672. -->
  12673. (R994 ^value 1 +)
  12674. (R1 ^reward R994 +)
  12675. Firing propose*predict-yes
  12676. -->
  12677. (O1981 ^name predict-yes +)
  12678. (S1 ^operator O1981 +)
  12679. Firing propose*predict-no
  12680. -->
  12681. (O1982 ^name predict-no +)
  12682. (S1 ^operator O1982 +)
  12683. Firing rl*prefer*rvt*predict-no*H0*2
  12684. -->
  12685. (S1 ^operator O1980 = 1.)
  12686. Firing rl*prefer*rvt*predict-yes*H0*1
  12687. -->
  12688. (S1 ^operator O1979 = 0.)
  12689. Firing prefer*rvt*predict-yes*H0
  12690. -->
  12691. Firing prefer*rvt*predict-no*H0
  12692. -->
  12693. Firing elaborate*copy-dir-to-output-link
  12694. -->
  12695. (I3 ^dir U +)
  12696. inner elaboration loop at bottom goal.
  12697. Retracting elaborate*copy-see-to-output-link
  12698. -->
  12699. (I3 ^see 0 +)
  12700. Retracting propose*predict-no
  12701. -->
  12702. (O1980 ^name predict-no +)
  12703. (S1 ^operator O1980 +)
  12704. Retracting propose*predict-yes
  12705. -->
  12706. (O1979 ^name predict-yes +)
  12707. (S1 ^operator O1979 +)
  12708. Retracting elaborate*reward*based*on*reward
  12709. -->
  12710. (R993 ^value 1 +)
  12711. (R1 ^reward R993 +)
  12712. Retracting elaborate*copy-dir-to-output-link
  12713. -->
  12714. (I3 ^dir L +)
  12715. Retracting rl*prefer*rvt*predict-no*H0*6
  12716. -->
  12717. (S1 ^operator O1980 = 0.999790145818646)
  12718. Retracting rl*prefer*rvt*predict-yes*H0*5
  12719. -->
  12720. (S1 ^operator O1979 = 0.264039703522277)
  12721. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  12722. -->
  12723. (S1 ^operator O1979 = 0.7358428664482317)
  12724. =>WM: (13964: S1 ^operator O1982 +)
  12725. =>WM: (13963: S1 ^operator O1981 +)
  12726. =>WM: (13962: I3 ^dir U)
  12727. =>WM: (13961: O1982 ^name predict-no)
  12728. =>WM: (13960: O1981 ^name predict-yes)
  12729. =>WM: (13959: R994 ^value 1)
  12730. =>WM: (13958: R1 ^reward R994)
  12731. =>WM: (13957: I3 ^see 1)
  12732. <=WM: (13948: S1 ^operator O1979 +)
  12733. <=WM: (13950: S1 ^operator O1979)
  12734. <=WM: (13949: S1 ^operator O1980 +)
  12735. <=WM: (13947: I3 ^dir L)
  12736. <=WM: (13943: R1 ^reward R993)
  12737. <=WM: (13915: I3 ^see 0)
  12738. <=WM: (13946: O1980 ^name predict-no)
  12739. <=WM: (13945: O1979 ^name predict-yes)
  12740. <=WM: (13944: R993 ^value 1)
  12741. --- Inner Elaboration Phase, active level 1 (S1) ---
  12742. Firing prefer*rvt*predict-yes*H0
  12743. -->
  12744. Firing rl*prefer*rvt*predict-yes*H0*1
  12745. -->
  12746. (S1 ^operator O1981 = 0.)
  12747. Firing prefer*rvt*predict-no*H0
  12748. -->
  12749. Firing rl*prefer*rvt*predict-no*H0*2
  12750. -->
  12751. (S1 ^operator O1982 = 1.)
  12752. inner elaboration loop at bottom goal.
  12753. Retracting rl*prefer*rvt*predict-no*H0*2
  12754. -->
  12755. (S1 ^operator O1980 = 1.)
  12756. Retracting rl*prefer*rvt*predict-yes*H0*1
  12757. -->
  12758. (S1 ^operator O1979 = 0.)
  12759. --- END Proposal Phase ---
  12760. --- Decision Phase ---
  12761. RL update rl*prefer*rvt*predict-yes*H0*5 0.554425 -0.290385 0.26404 -> 0.554434 -0.290385 0.264049(R,m,v=1,0.876404,0.108932)
  12762. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.44546 0.290383 0.735843 -> 0.445471 0.290384 0.735854(R,m,v=1,1,0)
  12763. =>WM: (13965: S1 ^operator O1982)
  12764. 991: O: O1982 (predict-no)
  12765. --- END Decision Phase ---
  12766. --- Application Phase ---
  12767. --- Firing Productions (PE) For State At Depth 1 ---
  12768. --- Inner Elaboration Phase, active level 1 (S1) ---
  12769. Firing apply*operator
  12770. -->
  12771. (I3 ^predict-no N991 + :O )
  12772. Firing apply*operator*complete
  12773. -->
  12774. (I3 ^predict-yes N990 - :O )
  12775. inner elaboration loop at bottom goal.
  12776. --- Change Working Memory (PE) ---
  12777. =>WM: (13966: I3 ^predict-no N991)
  12778. <=WM: (13952: N990 ^status complete)
  12779. <=WM: (13951: I3 ^predict-yes N990)
  12780. --- Firing Productions (IE) For State At Depth 1 ---
  12781. --- Inner Elaboration Phase, active level 1 (S1) ---
  12782. Firing monitor*world
  12783. -->
  12784. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12785. --- Change Working Memory (IE) ---
  12786. --- END Application Phase ---
  12787. --- Output Phase ---
  12788. ENV: Agent did: predict-no for direction U in state State-A
  12789. In State-A moving U
  12790. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12791. predict error 0
  12792. dir: dir isR
  12793. --- END Output Phase ---
  12794. ---- Input Phase ---
  12795. =>WM: (13970: I2 ^dir R)
  12796. =>WM: (13969: I2 ^reward 1)
  12797. =>WM: (13968: I2 ^see 0)
  12798. =>WM: (13967: N991 ^status complete)
  12799. <=WM: (13955: I2 ^dir U)
  12800. <=WM: (13954: I2 ^reward 1)
  12801. <=WM: (13953: I2 ^see 1)
  12802. =>WM: (13971: I2 ^level-1 L1-root)
  12803. <=WM: (13956: I2 ^level-1 L1-root)
  12804. --- END Input Phase ---
  12805. --- Proposal Phase ---
  12806. --- Inner Elaboration Phase, active level 1 (S1) ---
  12807. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  12808. -->
  12809. (S1 ^operator O1982 = -0.2714224023553999)
  12810. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  12811. -->
  12812. (S1 ^operator O1981 = 0.662219375073587)
  12813. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12814. -->
  12815. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12816. -->
  12817. Firing elaborate*copy-see-to-output-link
  12818. -->
  12819. (I3 ^see 0 +)
  12820. Firing elaborate*reward*based*on*reward
  12821. -->
  12822. (R995 ^value 1 +)
  12823. (R1 ^reward R995 +)
  12824. Firing propose*predict-yes
  12825. -->
  12826. (O1983 ^name predict-yes +)
  12827. (S1 ^operator O1983 +)
  12828. Firing propose*predict-no
  12829. -->
  12830. (O1984 ^name predict-no +)
  12831. (S1 ^operator O1984 +)
  12832. Firing rl*prefer*rvt*predict-no*H0*4
  12833. -->
  12834. (S1 ^operator O1982 = 0.339773810196969)
  12835. Firing rl*prefer*rvt*predict-yes*H0*3
  12836. -->
  12837. (S1 ^operator O1981 = 0.3377118983309207)
  12838. Firing prefer*rvt*predict-yes*H0
  12839. -->
  12840. Firing prefer*rvt*predict-no*H0
  12841. -->
  12842. Firing elaborate*copy-dir-to-output-link
  12843. -->
  12844. (I3 ^dir R +)
  12845. inner elaboration loop at bottom goal.
  12846. Retracting elaborate*copy-see-to-output-link
  12847. -->
  12848. (I3 ^see 1 +)
  12849. Retracting propose*predict-no
  12850. -->
  12851. (O1982 ^name predict-no +)
  12852. (S1 ^operator O1982 +)
  12853. Retracting propose*predict-yes
  12854. -->
  12855. (O1981 ^name predict-yes +)
  12856. (S1 ^operator O1981 +)
  12857. Retracting elaborate*reward*based*on*reward
  12858. -->
  12859. (R994 ^value 1 +)
  12860. (R1 ^reward R994 +)
  12861. Retracting elaborate*copy-dir-to-output-link
  12862. -->
  12863. (I3 ^dir U +)
  12864. Retracting rl*prefer*rvt*predict-no*H0*2
  12865. -->
  12866. (S1 ^operator O1982 = 1.)
  12867. Retracting rl*prefer*rvt*predict-yes*H0*1
  12868. -->
  12869. (S1 ^operator O1981 = 0.)
  12870. =>WM: (13979: S1 ^operator O1984 +)
  12871. =>WM: (13978: S1 ^operator O1983 +)
  12872. =>WM: (13977: I3 ^dir R)
  12873. =>WM: (13976: O1984 ^name predict-no)
  12874. =>WM: (13975: O1983 ^name predict-yes)
  12875. =>WM: (13974: R995 ^value 1)
  12876. =>WM: (13973: R1 ^reward R995)
  12877. =>WM: (13972: I3 ^see 0)
  12878. <=WM: (13963: S1 ^operator O1981 +)
  12879. <=WM: (13964: S1 ^operator O1982 +)
  12880. <=WM: (13965: S1 ^operator O1982)
  12881. <=WM: (13962: I3 ^dir U)
  12882. <=WM: (13958: R1 ^reward R994)
  12883. <=WM: (13957: I3 ^see 1)
  12884. <=WM: (13961: O1982 ^name predict-no)
  12885. <=WM: (13960: O1981 ^name predict-yes)
  12886. <=WM: (13959: R994 ^value 1)
  12887. --- Inner Elaboration Phase, active level 1 (S1) ---
  12888. Firing prefer*rvt*predict-yes*H0
  12889. -->
  12890. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  12891. -->
  12892. (S1 ^operator O1983 = 0.662219375073587)
  12893. Firing rl*prefer*rvt*predict-yes*H0*3
  12894. -->
  12895. (S1 ^operator O1983 = 0.3377118983309207)
  12896. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  12897. -->
  12898. Firing prefer*rvt*predict-no*H0
  12899. -->
  12900. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  12901. -->
  12902. (S1 ^operator O1984 = -0.2714224023553999)
  12903. Firing rl*prefer*rvt*predict-no*H0*4
  12904. -->
  12905. (S1 ^operator O1984 = 0.339773810196969)
  12906. Firing prefer*rvt*predict-no*H0*4*v1*H1
  12907. -->
  12908. inner elaboration loop at bottom goal.
  12909. Retracting rl*prefer*rvt*predict-no*H0*4
  12910. -->
  12911. (S1 ^operator O1982 = 0.339773810196969)
  12912. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  12913. -->
  12914. (S1 ^operator O1982 = -0.2714224023553999)
  12915. Retracting rl*prefer*rvt*predict-yes*H0*3
  12916. -->
  12917. (S1 ^operator O1981 = 0.3377118983309207)
  12918. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  12919. -->
  12920. (S1 ^operator O1981 = 0.662219375073587)
  12921. --- END Proposal Phase ---
  12922. --- Decision Phase ---
  12923. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12924. =>WM: (13980: S1 ^operator O1983)
  12925. 992: O: O1983 (predict-yes)
  12926. --- END Decision Phase ---
  12927. --- Application Phase ---
  12928. --- Firing Productions (PE) For State At Depth 1 ---
  12929. --- Inner Elaboration Phase, active level 1 (S1) ---
  12930. Firing apply*operator
  12931. -->
  12932. (I3 ^predict-yes N992 + :O )
  12933. Firing apply*operator*complete
  12934. -->
  12935. (I3 ^predict-no N991 - :O )
  12936. inner elaboration loop at bottom goal.
  12937. --- Change Working Memory (PE) ---
  12938. =>WM: (13981: I3 ^predict-yes N992)
  12939. <=WM: (13967: N991 ^status complete)
  12940. <=WM: (13966: I3 ^predict-no N991)
  12941. --- Firing Productions (IE) For State At Depth 1 ---
  12942. --- Inner Elaboration Phase, active level 1 (S1) ---
  12943. Firing monitor*world
  12944. -->
  12945. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12946. --- Change Working Memory (IE) ---
  12947. --- END Application Phase ---
  12948. --- Output Phase ---
  12949. ENV: Agent did: predict-yes for direction R in state State-A
  12950. In State-A moving R
  12951. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12952. predict error 0
  12953. dir: dir isU
  12954. --- END Output Phase ---
  12955. /|--- Input Phase ---
  12956. =>WM: (13985: I2 ^dir U)
  12957. =>WM: (13984: I2 ^reward 1)
  12958. =>WM: (13983: I2 ^see 1)
  12959. =>WM: (13982: N992 ^status complete)
  12960. <=WM: (13970: I2 ^dir R)
  12961. <=WM: (13969: I2 ^reward 1)
  12962. <=WM: (13968: I2 ^see 0)
  12963. =>WM: (13986: I2 ^level-1 R1-root)
  12964. <=WM: (13971: I2 ^level-1 L1-root)
  12965. --- END Input Phase ---
  12966. --- Proposal Phase ---
  12967. --- Inner Elaboration Phase, active level 1 (S1) ---
  12968. Firing elaborate*copy-see-to-output-link
  12969. -->
  12970. (I3 ^see 1 +)
  12971. Firing elaborate*reward*based*on*reward
  12972. -->
  12973. (R996 ^value 1 +)
  12974. (R1 ^reward R996 +)
  12975. Firing propose*predict-yes
  12976. -->
  12977. (O1985 ^name predict-yes +)
  12978. (S1 ^operator O1985 +)
  12979. Firing propose*predict-no
  12980. -->
  12981. (O1986 ^name predict-no +)
  12982. (S1 ^operator O1986 +)
  12983. Firing rl*prefer*rvt*predict-no*H0*2
  12984. -->
  12985. (S1 ^operator O1984 = 1.)
  12986. Firing rl*prefer*rvt*predict-yes*H0*1
  12987. -->
  12988. (S1 ^operator O1983 = 0.)
  12989. Firing prefer*rvt*predict-yes*H0
  12990. -->
  12991. Firing prefer*rvt*predict-no*H0
  12992. -->
  12993. Firing elaborate*copy-dir-to-output-link
  12994. -->
  12995. (I3 ^dir U +)
  12996. inner elaboration loop at bottom goal.
  12997. Retracting elaborate*copy-see-to-output-link
  12998. -->
  12999. (I3 ^see 0 +)
  13000. Retracting propose*predict-no
  13001. -->
  13002. (O1984 ^name predict-no +)
  13003. (S1 ^operator O1984 +)
  13004. Retracting propose*predict-yes
  13005. -->
  13006. (O1983 ^name predict-yes +)
  13007. (S1 ^operator O1983 +)
  13008. Retracting elaborate*reward*based*on*reward
  13009. -->
  13010. (R995 ^value 1 +)
  13011. (R1 ^reward R995 +)
  13012. Retracting elaborate*copy-dir-to-output-link
  13013. -->
  13014. (I3 ^dir R +)
  13015. Retracting rl*prefer*rvt*predict-no*H0*4
  13016. -->
  13017. (S1 ^operator O1984 = 0.339773810196969)
  13018. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  13019. -->
  13020. (S1 ^operator O1984 = -0.2714224023553999)
  13021. Retracting rl*prefer*rvt*predict-yes*H0*3
  13022. -->
  13023. (S1 ^operator O1983 = 0.3377118983309207)
  13024. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  13025. -->
  13026. (S1 ^operator O1983 = 0.662219375073587)
  13027. =>WM: (13994: S1 ^operator O1986 +)
  13028. =>WM: (13993: S1 ^operator O1985 +)
  13029. =>WM: (13992: I3 ^dir U)
  13030. =>WM: (13991: O1986 ^name predict-no)
  13031. =>WM: (13990: O1985 ^name predict-yes)
  13032. =>WM: (13989: R996 ^value 1)
  13033. =>WM: (13988: R1 ^reward R996)
  13034. =>WM: (13987: I3 ^see 1)
  13035. <=WM: (13978: S1 ^operator O1983 +)
  13036. <=WM: (13980: S1 ^operator O1983)
  13037. <=WM: (13979: S1 ^operator O1984 +)
  13038. <=WM: (13977: I3 ^dir R)
  13039. <=WM: (13973: R1 ^reward R995)
  13040. <=WM: (13972: I3 ^see 0)
  13041. <=WM: (13976: O1984 ^name predict-no)
  13042. <=WM: (13975: O1983 ^name predict-yes)
  13043. <=WM: (13974: R995 ^value 1)
  13044. --- Inner Elaboration Phase, active level 1 (S1) ---
  13045. Firing prefer*rvt*predict-yes*H0
  13046. -->
  13047. Firing rl*prefer*rvt*predict-yes*H0*1
  13048. -->
  13049. (S1 ^operator O1985 = 0.)
  13050. Firing prefer*rvt*predict-no*H0
  13051. -->
  13052. Firing rl*prefer*rvt*predict-no*H0*2
  13053. -->
  13054. (S1 ^operator O1986 = 1.)
  13055. inner elaboration loop at bottom goal.
  13056. Retracting rl*prefer*rvt*predict-no*H0*2
  13057. -->
  13058. (S1 ^operator O1984 = 1.)
  13059. Retracting rl*prefer*rvt*predict-yes*H0*1
  13060. -->
  13061. (S1 ^operator O1983 = 0.)
  13062. --- END Proposal Phase ---
  13063. --- Decision Phase ---
  13064. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.590119 -0.252401 0.337718(R,m,v=1,0.898204,0.0919847)
  13065. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409809 0.252411 0.662219 -> 0.409816 0.25241 0.662226(R,m,v=1,1,0)
  13066. =>WM: (13995: S1 ^operator O1986)
  13067. 993: O: O1986 (predict-no)
  13068. --- END Decision Phase ---
  13069. --- Application Phase ---
  13070. --- Firing Productions (PE) For State At Depth 1 ---
  13071. --- Inner Elaboration Phase, active level 1 (S1) ---
  13072. Firing apply*operator
  13073. -->
  13074. (I3 ^predict-no N993 + :O )
  13075. Firing apply*operator*complete
  13076. -->
  13077. (I3 ^predict-yes N992 - :O )
  13078. inner elaboration loop at bottom goal.
  13079. --- Change Working Memory (PE) ---
  13080. =>WM: (13996: I3 ^predict-no N993)
  13081. <=WM: (13982: N992 ^status complete)
  13082. <=WM: (13981: I3 ^predict-yes N992)
  13083. --- Firing Productions (IE) For State At Depth 1 ---
  13084. --- Inner Elaboration Phase, active level 1 (S1) ---
  13085. Firing monitor*world
  13086. -->
  13087. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13088. --- Change Working Memory (IE) ---
  13089. --- END Application Phase ---
  13090. --- Output Phase ---
  13091. ENV: Agent did: predict-no for direction U in state State-B
  13092. In State-B moving U
  13093. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13094. predict error 0
  13095. dir: dir isL
  13096. --- END Output Phase ---
  13097. \-/--- Input Phase ---
  13098. =>WM: (14000: I2 ^dir L)
  13099. =>WM: (13999: I2 ^reward 1)
  13100. =>WM: (13998: I2 ^see 0)
  13101. =>WM: (13997: N993 ^status complete)
  13102. <=WM: (13985: I2 ^dir U)
  13103. <=WM: (13984: I2 ^reward 1)
  13104. <=WM: (13983: I2 ^see 1)
  13105. =>WM: (14001: I2 ^level-1 R1-root)
  13106. <=WM: (13986: I2 ^level-1 R1-root)
  13107. --- END Input Phase ---
  13108. --- Proposal Phase ---
  13109. --- Inner Elaboration Phase, active level 1 (S1) ---
  13110. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13111. -->
  13112. (S1 ^operator O1985 = 0.7362544663116062)
  13113. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13114. -->
  13115. Firing elaborate*copy-see-to-output-link
  13116. -->
  13117. (I3 ^see 0 +)
  13118. Firing elaborate*reward*based*on*reward
  13119. -->
  13120. (R997 ^value 1 +)
  13121. (R1 ^reward R997 +)
  13122. Firing propose*predict-yes
  13123. -->
  13124. (O1987 ^name predict-yes +)
  13125. (S1 ^operator O1987 +)
  13126. Firing propose*predict-no
  13127. -->
  13128. (O1988 ^name predict-no +)
  13129. (S1 ^operator O1988 +)
  13130. Firing rl*prefer*rvt*predict-no*H0*6
  13131. -->
  13132. (S1 ^operator O1986 = 0.999790145818646)
  13133. Firing rl*prefer*rvt*predict-yes*H0*5
  13134. -->
  13135. (S1 ^operator O1985 = 0.2640492015925779)
  13136. Firing prefer*rvt*predict-yes*H0
  13137. -->
  13138. Firing prefer*rvt*predict-no*H0
  13139. -->
  13140. Firing elaborate*copy-dir-to-output-link
  13141. -->
  13142. (I3 ^dir L +)
  13143. inner elaboration loop at bottom goal.
  13144. Retracting elaborate*copy-see-to-output-link
  13145. -->
  13146. (I3 ^see 1 +)
  13147. Retracting propose*predict-no
  13148. -->
  13149. (O1986 ^name predict-no +)
  13150. (S1 ^operator O1986 +)
  13151. Retracting propose*predict-yes
  13152. -->
  13153. (O1985 ^name predict-yes +)
  13154. (S1 ^operator O1985 +)
  13155. Retracting elaborate*reward*based*on*reward
  13156. -->
  13157. (R996 ^value 1 +)
  13158. (R1 ^reward R996 +)
  13159. Retracting elaborate*copy-dir-to-output-link
  13160. -->
  13161. (I3 ^dir U +)
  13162. Retracting rl*prefer*rvt*predict-no*H0*2
  13163. -->
  13164. (S1 ^operator O1986 = 1.)
  13165. Retracting rl*prefer*rvt*predict-yes*H0*1
  13166. -->
  13167. (S1 ^operator O1985 = 0.)
  13168. =>WM: (14009: S1 ^operator O1988 +)
  13169. =>WM: (14008: S1 ^operator O1987 +)
  13170. =>WM: (14007: I3 ^dir L)
  13171. =>WM: (14006: O1988 ^name predict-no)
  13172. =>WM: (14005: O1987 ^name predict-yes)
  13173. =>WM: (14004: R997 ^value 1)
  13174. =>WM: (14003: R1 ^reward R997)
  13175. =>WM: (14002: I3 ^see 0)
  13176. <=WM: (13993: S1 ^operator O1985 +)
  13177. <=WM: (13994: S1 ^operator O1986 +)
  13178. <=WM: (13995: S1 ^operator O1986)
  13179. <=WM: (13992: I3 ^dir U)
  13180. <=WM: (13988: R1 ^reward R996)
  13181. <=WM: (13987: I3 ^see 1)
  13182. <=WM: (13991: O1986 ^name predict-no)
  13183. <=WM: (13990: O1985 ^name predict-yes)
  13184. <=WM: (13989: R996 ^value 1)
  13185. --- Inner Elaboration Phase, active level 1 (S1) ---
  13186. Firing prefer*rvt*predict-yes*H0
  13187. -->
  13188. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13189. -->
  13190. (S1 ^operator O1987 = 0.7362544663116062)
  13191. Firing rl*prefer*rvt*predict-yes*H0*5
  13192. -->
  13193. (S1 ^operator O1987 = 0.2640492015925779)
  13194. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13195. -->
  13196. Firing prefer*rvt*predict-no*H0
  13197. -->
  13198. Firing rl*prefer*rvt*predict-no*H0*6
  13199. -->
  13200. (S1 ^operator O1988 = 0.999790145818646)
  13201. inner elaboration loop at bottom goal.
  13202. Retracting rl*prefer*rvt*predict-no*H0*6
  13203. -->
  13204. (S1 ^operator O1986 = 0.999790145818646)
  13205. Retracting rl*prefer*rvt*predict-yes*H0*5
  13206. -->
  13207. (S1 ^operator O1985 = 0.2640492015925779)
  13208. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13209. -->
  13210. (S1 ^operator O1985 = 0.7362544663116062)
  13211. --- END Proposal Phase ---
  13212. --- Decision Phase ---
  13213. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13214. =>WM: (14010: S1 ^operator O1987)
  13215. 994: O: O1987 (predict-yes)
  13216. --- END Decision Phase ---
  13217. --- Application Phase ---
  13218. --- Firing Productions (PE) For State At Depth 1 ---
  13219. --- Inner Elaboration Phase, active level 1 (S1) ---
  13220. Firing apply*operator
  13221. -->
  13222. (I3 ^predict-yes N994 + :O )
  13223. Firing apply*operator*complete
  13224. -->
  13225. (I3 ^predict-no N993 - :O )
  13226. inner elaboration loop at bottom goal.
  13227. --- Change Working Memory (PE) ---
  13228. =>WM: (14011: I3 ^predict-yes N994)
  13229. <=WM: (13997: N993 ^status complete)
  13230. <=WM: (13996: I3 ^predict-no N993)
  13231. --- Firing Productions (IE) For State At Depth 1 ---
  13232. --- Inner Elaboration Phase, active level 1 (S1) ---
  13233. Firing monitor*world
  13234. -->
  13235. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13236. --- Change Working Memory (IE) ---
  13237. --- END Application Phase ---
  13238. --- Output Phase ---
  13239. ENV: Agent did: predict-yes for direction L in state State-B
  13240. In State-B moving L
  13241. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13242. predict error 0
  13243. dir: dir isL
  13244. --- END Output Phase ---
  13245. |\---- Input Phase ---
  13246. =>WM: (14015: I2 ^dir L)
  13247. =>WM: (14014: I2 ^reward 1)
  13248. =>WM: (14013: I2 ^see 1)
  13249. =>WM: (14012: N994 ^status complete)
  13250. <=WM: (14000: I2 ^dir L)
  13251. <=WM: (13999: I2 ^reward 1)
  13252. <=WM: (13998: I2 ^see 0)
  13253. =>WM: (14016: I2 ^level-1 L1-root)
  13254. <=WM: (14001: I2 ^level-1 R1-root)
  13255. --- END Input Phase ---
  13256. --- Proposal Phase ---
  13257. --- Inner Elaboration Phase, active level 1 (S1) ---
  13258. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13259. -->
  13260. (S1 ^operator O1987 = -0.181727099742844)
  13261. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13262. -->
  13263. Firing elaborate*copy-see-to-output-link
  13264. -->
  13265. (I3 ^see 1 +)
  13266. Firing elaborate*reward*based*on*reward
  13267. -->
  13268. (R998 ^value 1 +)
  13269. (R1 ^reward R998 +)
  13270. Firing propose*predict-yes
  13271. -->
  13272. (O1989 ^name predict-yes +)
  13273. (S1 ^operator O1989 +)
  13274. Firing propose*predict-no
  13275. -->
  13276. (O1990 ^name predict-no +)
  13277. (S1 ^operator O1990 +)
  13278. Firing rl*prefer*rvt*predict-no*H0*6
  13279. -->
  13280. (S1 ^operator O1988 = 0.999790145818646)
  13281. Firing rl*prefer*rvt*predict-yes*H0*5
  13282. -->
  13283. (S1 ^operator O1987 = 0.2640492015925779)
  13284. Firing prefer*rvt*predict-yes*H0
  13285. -->
  13286. Firing prefer*rvt*predict-no*H0
  13287. -->
  13288. Firing elaborate*copy-dir-to-output-link
  13289. -->
  13290. (I3 ^dir L +)
  13291. inner elaboration loop at bottom goal.
  13292. Retracting elaborate*copy-see-to-output-link
  13293. -->
  13294. (I3 ^see 0 +)
  13295. Retracting propose*predict-no
  13296. -->
  13297. (O1988 ^name predict-no +)
  13298. (S1 ^operator O1988 +)
  13299. Retracting propose*predict-yes
  13300. -->
  13301. (O1987 ^name predict-yes +)
  13302. (S1 ^operator O1987 +)
  13303. Retracting elaborate*reward*based*on*reward
  13304. -->
  13305. (R997 ^value 1 +)
  13306. (R1 ^reward R997 +)
  13307. Retracting elaborate*copy-dir-to-output-link
  13308. -->
  13309. (I3 ^dir L +)
  13310. Retracting rl*prefer*rvt*predict-no*H0*6
  13311. -->
  13312. (S1 ^operator O1988 = 0.999790145818646)
  13313. Retracting rl*prefer*rvt*predict-yes*H0*5
  13314. -->
  13315. (S1 ^operator O1987 = 0.2640492015925779)
  13316. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  13317. -->
  13318. (S1 ^operator O1987 = 0.7362544663116062)
  13319. =>WM: (14023: S1 ^operator O1990 +)
  13320. =>WM: (14022: S1 ^operator O1989 +)
  13321. =>WM: (14021: O1990 ^name predict-no)
  13322. =>WM: (14020: O1989 ^name predict-yes)
  13323. =>WM: (14019: R998 ^value 1)
  13324. =>WM: (14018: R1 ^reward R998)
  13325. =>WM: (14017: I3 ^see 1)
  13326. <=WM: (14008: S1 ^operator O1987 +)
  13327. <=WM: (14010: S1 ^operator O1987)
  13328. <=WM: (14009: S1 ^operator O1988 +)
  13329. <=WM: (14003: R1 ^reward R997)
  13330. <=WM: (14002: I3 ^see 0)
  13331. <=WM: (14006: O1988 ^name predict-no)
  13332. <=WM: (14005: O1987 ^name predict-yes)
  13333. <=WM: (14004: R997 ^value 1)
  13334. --- Inner Elaboration Phase, active level 1 (S1) ---
  13335. Firing prefer*rvt*predict-yes*H0
  13336. -->
  13337. Firing rl*prefer*rvt*predict-yes*H0*5
  13338. -->
  13339. (S1 ^operator O1989 = 0.2640492015925779)
  13340. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13341. -->
  13342. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13343. -->
  13344. (S1 ^operator O1989 = -0.181727099742844)
  13345. Firing prefer*rvt*predict-no*H0
  13346. -->
  13347. Firing rl*prefer*rvt*predict-no*H0*6
  13348. -->
  13349. (S1 ^operator O1990 = 0.999790145818646)
  13350. inner elaboration loop at bottom goal.
  13351. Retracting rl*prefer*rvt*predict-no*H0*6
  13352. -->
  13353. (S1 ^operator O1988 = 0.999790145818646)
  13354. Retracting rl*prefer*rvt*predict-yes*H0*5
  13355. -->
  13356. (S1 ^operator O1987 = 0.2640492015925779)
  13357. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13358. -->
  13359. (S1 ^operator O1987 = -0.181727099742844)
  13360. --- END Proposal Phase ---
  13361. --- Decision Phase ---
  13362. RL update rl*prefer*rvt*predict-yes*H0*5 0.554434 -0.290385 0.264049 -> 0.55441 -0.290386 0.264025(R,m,v=1,0.877095,0.108405)
  13363. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445864 0.29039 0.736254 -> 0.445836 0.29039 0.736226(R,m,v=1,1,0)
  13364. =>WM: (14024: S1 ^operator O1990)
  13365. 995: O: O1990 (predict-no)
  13366. --- END Decision Phase ---
  13367. --- Application Phase ---
  13368. --- Firing Productions (PE) For State At Depth 1 ---
  13369. --- Inner Elaboration Phase, active level 1 (S1) ---
  13370. Firing apply*operator
  13371. -->
  13372. (I3 ^predict-no N995 + :O )
  13373. Firing apply*operator*complete
  13374. -->
  13375. (I3 ^predict-yes N994 - :O )
  13376. inner elaboration loop at bottom goal.
  13377. --- Change Working Memory (PE) ---
  13378. =>WM: (14025: I3 ^predict-no N995)
  13379. <=WM: (14012: N994 ^status complete)
  13380. <=WM: (14011: I3 ^predict-yes N994)
  13381. --- Firing Productions (IE) For State At Depth 1 ---
  13382. --- Inner Elaboration Phase, active level 1 (S1) ---
  13383. Firing monitor*world
  13384. -->
  13385. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13386. --- Change Working Memory (IE) ---
  13387. --- END Application Phase ---
  13388. --- Output Phase ---
  13389. ENV: Agent did: predict-no for direction L in state State-A
  13390. In State-A moving L
  13391. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13392. predict error 0
  13393. dir: dir isL
  13394. --- END Output Phase ---
  13395. /|\--- Input Phase ---
  13396. =>WM: (14029: I2 ^dir L)
  13397. =>WM: (14028: I2 ^reward 1)
  13398. =>WM: (14027: I2 ^see 0)
  13399. =>WM: (14026: N995 ^status complete)
  13400. <=WM: (14015: I2 ^dir L)
  13401. <=WM: (14014: I2 ^reward 1)
  13402. <=WM: (14013: I2 ^see 1)
  13403. =>WM: (14030: I2 ^level-1 L0-root)
  13404. <=WM: (14016: I2 ^level-1 L1-root)
  13405. --- END Input Phase ---
  13406. --- Proposal Phase ---
  13407. --- Inner Elaboration Phase, active level 1 (S1) ---
  13408. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13409. -->
  13410. (S1 ^operator O1989 = -0.1386470047172653)
  13411. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13412. -->
  13413. Firing elaborate*copy-see-to-output-link
  13414. -->
  13415. (I3 ^see 0 +)
  13416. Firing elaborate*reward*based*on*reward
  13417. -->
  13418. (R999 ^value 1 +)
  13419. (R1 ^reward R999 +)
  13420. Firing propose*predict-yes
  13421. -->
  13422. (O1991 ^name predict-yes +)
  13423. (S1 ^operator O1991 +)
  13424. Firing propose*predict-no
  13425. -->
  13426. (O1992 ^name predict-no +)
  13427. (S1 ^operator O1992 +)
  13428. Firing rl*prefer*rvt*predict-no*H0*6
  13429. -->
  13430. (S1 ^operator O1990 = 0.999790145818646)
  13431. Firing rl*prefer*rvt*predict-yes*H0*5
  13432. -->
  13433. (S1 ^operator O1989 = 0.2640246623191502)
  13434. Firing prefer*rvt*predict-yes*H0
  13435. -->
  13436. Firing prefer*rvt*predict-no*H0
  13437. -->
  13438. Firing elaborate*copy-dir-to-output-link
  13439. -->
  13440. (I3 ^dir L +)
  13441. inner elaboration loop at bottom goal.
  13442. Retracting elaborate*copy-see-to-output-link
  13443. -->
  13444. (I3 ^see 1 +)
  13445. Retracting propose*predict-no
  13446. -->
  13447. (O1990 ^name predict-no +)
  13448. (S1 ^operator O1990 +)
  13449. Retracting propose*predict-yes
  13450. -->
  13451. (O1989 ^name predict-yes +)
  13452. (S1 ^operator O1989 +)
  13453. Retracting elaborate*reward*based*on*reward
  13454. -->
  13455. (R998 ^value 1 +)
  13456. (R1 ^reward R998 +)
  13457. Retracting elaborate*copy-dir-to-output-link
  13458. -->
  13459. (I3 ^dir L +)
  13460. Retracting rl*prefer*rvt*predict-no*H0*6
  13461. -->
  13462. (S1 ^operator O1990 = 0.999790145818646)
  13463. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  13464. -->
  13465. (S1 ^operator O1989 = -0.181727099742844)
  13466. Retracting rl*prefer*rvt*predict-yes*H0*5
  13467. -->
  13468. (S1 ^operator O1989 = 0.2640246623191502)
  13469. =>WM: (14037: S1 ^operator O1992 +)
  13470. =>WM: (14036: S1 ^operator O1991 +)
  13471. =>WM: (14035: O1992 ^name predict-no)
  13472. =>WM: (14034: O1991 ^name predict-yes)
  13473. =>WM: (14033: R999 ^value 1)
  13474. =>WM: (14032: R1 ^reward R999)
  13475. =>WM: (14031: I3 ^see 0)
  13476. <=WM: (14022: S1 ^operator O1989 +)
  13477. <=WM: (14023: S1 ^operator O1990 +)
  13478. <=WM: (14024: S1 ^operator O1990)
  13479. <=WM: (14018: R1 ^reward R998)
  13480. <=WM: (14017: I3 ^see 1)
  13481. <=WM: (14021: O1990 ^name predict-no)
  13482. <=WM: (14020: O1989 ^name predict-yes)
  13483. <=WM: (14019: R998 ^value 1)
  13484. --- Inner Elaboration Phase, active level 1 (S1) ---
  13485. Firing prefer*rvt*predict-yes*H0
  13486. -->
  13487. Firing rl*prefer*rvt*predict-yes*H0*5
  13488. -->
  13489. (S1 ^operator O1991 = 0.2640246623191502)
  13490. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13491. -->
  13492. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13493. -->
  13494. (S1 ^operator O1991 = -0.1386470047172653)
  13495. Firing prefer*rvt*predict-no*H0
  13496. -->
  13497. Firing rl*prefer*rvt*predict-no*H0*6
  13498. -->
  13499. (S1 ^operator O1992 = 0.999790145818646)
  13500. inner elaboration loop at bottom goal.
  13501. Retracting rl*prefer*rvt*predict-no*H0*6
  13502. -->
  13503. (S1 ^operator O1990 = 0.999790145818646)
  13504. Retracting rl*prefer*rvt*predict-yes*H0*5
  13505. -->
  13506. (S1 ^operator O1989 = 0.2640246623191502)
  13507. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13508. -->
  13509. (S1 ^operator O1989 = -0.1386470047172653)
  13510. --- END Proposal Phase ---
  13511. --- Decision Phase ---
  13512. RL update rl*prefer*rvt*predict-no*H0*6 0.99979 0 0.99979 -> 0.999825 0 0.999825(R,m,v=1,0.905405,0.0862291)
  13513. =>WM: (14038: S1 ^operator O1992)
  13514. 996: O: O1992 (predict-no)
  13515. --- END Decision Phase ---
  13516. --- Application Phase ---
  13517. --- Firing Productions (PE) For State At Depth 1 ---
  13518. --- Inner Elaboration Phase, active level 1 (S1) ---
  13519. Firing apply*operator
  13520. -->
  13521. (I3 ^predict-no N996 + :O )
  13522. Firing apply*operator*complete
  13523. -->
  13524. (I3 ^predict-no N995 - :O )
  13525. inner elaboration loop at bottom goal.
  13526. --- Change Working Memory (PE) ---
  13527. =>WM: (14039: I3 ^predict-no N996)
  13528. <=WM: (14026: N995 ^status complete)
  13529. <=WM: (14025: I3 ^predict-no N995)
  13530. --- Firing Productions (IE) For State At Depth 1 ---
  13531. --- Inner Elaboration Phase, active level 1 (S1) ---
  13532. Firing monitor*world
  13533. -->
  13534. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13535. --- Change Working Memory (IE) ---
  13536. --- END Application Phase ---
  13537. --- Output Phase ---
  13538. ENV: Agent did: predict-no for direction L in state State-A
  13539. In State-A moving L
  13540. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13541. predict error 0
  13542. dir: dir isL
  13543. --- END Output Phase ---
  13544. -/|--- Input Phase ---
  13545. =>WM: (14043: I2 ^dir L)
  13546. =>WM: (14042: I2 ^reward 1)
  13547. =>WM: (14041: I2 ^see 0)
  13548. =>WM: (14040: N996 ^status complete)
  13549. <=WM: (14029: I2 ^dir L)
  13550. <=WM: (14028: I2 ^reward 1)
  13551. <=WM: (14027: I2 ^see 0)
  13552. =>WM: (14044: I2 ^level-1 L0-root)
  13553. <=WM: (14030: I2 ^level-1 L0-root)
  13554. --- END Input Phase ---
  13555. --- Proposal Phase ---
  13556. --- Inner Elaboration Phase, active level 1 (S1) ---
  13557. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13558. -->
  13559. (S1 ^operator O1991 = -0.1386470047172653)
  13560. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13561. -->
  13562. Firing elaborate*copy-see-to-output-link
  13563. -->
  13564. (I3 ^see 0 +)
  13565. Firing elaborate*reward*based*on*reward
  13566. -->
  13567. (R1000 ^value 1 +)
  13568. (R1 ^reward R1000 +)
  13569. Firing propose*predict-yes
  13570. -->
  13571. (O1993 ^name predict-yes +)
  13572. (S1 ^operator O1993 +)
  13573. Firing propose*predict-no
  13574. -->
  13575. (O1994 ^name predict-no +)
  13576. (S1 ^operator O1994 +)
  13577. Firing rl*prefer*rvt*predict-no*H0*6
  13578. -->
  13579. (S1 ^operator O1992 = 0.9998251377735368)
  13580. Firing rl*prefer*rvt*predict-yes*H0*5
  13581. -->
  13582. (S1 ^operator O1991 = 0.2640246623191502)
  13583. Firing prefer*rvt*predict-yes*H0
  13584. -->
  13585. Firing prefer*rvt*predict-no*H0
  13586. -->
  13587. Firing elaborate*copy-dir-to-output-link
  13588. -->
  13589. (I3 ^dir L +)
  13590. inner elaboration loop at bottom goal.
  13591. Retracting elaborate*copy-see-to-output-link
  13592. -->
  13593. (I3 ^see 0 +)
  13594. Retracting propose*predict-no
  13595. -->
  13596. (O1992 ^name predict-no +)
  13597. (S1 ^operator O1992 +)
  13598. Retracting propose*predict-yes
  13599. -->
  13600. (O1991 ^name predict-yes +)
  13601. (S1 ^operator O1991 +)
  13602. Retracting elaborate*reward*based*on*reward
  13603. -->
  13604. (R999 ^value 1 +)
  13605. (R1 ^reward R999 +)
  13606. Retracting elaborate*copy-dir-to-output-link
  13607. -->
  13608. (I3 ^dir L +)
  13609. Retracting rl*prefer*rvt*predict-no*H0*6
  13610. -->
  13611. (S1 ^operator O1992 = 0.9998251377735368)
  13612. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13613. -->
  13614. (S1 ^operator O1991 = -0.1386470047172653)
  13615. Retracting rl*prefer*rvt*predict-yes*H0*5
  13616. -->
  13617. (S1 ^operator O1991 = 0.2640246623191502)
  13618. =>WM: (14050: S1 ^operator O1994 +)
  13619. =>WM: (14049: S1 ^operator O1993 +)
  13620. =>WM: (14048: O1994 ^name predict-no)
  13621. =>WM: (14047: O1993 ^name predict-yes)
  13622. =>WM: (14046: R1000 ^value 1)
  13623. =>WM: (14045: R1 ^reward R1000)
  13624. <=WM: (14036: S1 ^operator O1991 +)
  13625. <=WM: (14037: S1 ^operator O1992 +)
  13626. <=WM: (14038: S1 ^operator O1992)
  13627. <=WM: (14032: R1 ^reward R999)
  13628. <=WM: (14035: O1992 ^name predict-no)
  13629. <=WM: (14034: O1991 ^name predict-yes)
  13630. <=WM: (14033: R999 ^value 1)
  13631. --- Inner Elaboration Phase, active level 1 (S1) ---
  13632. Firing prefer*rvt*predict-yes*H0
  13633. -->
  13634. Firing rl*prefer*rvt*predict-yes*H0*5
  13635. -->
  13636. (S1 ^operator O1993 = 0.2640246623191502)
  13637. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  13638. -->
  13639. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13640. -->
  13641. (S1 ^operator O1993 = -0.1386470047172653)
  13642. Firing prefer*rvt*predict-no*H0
  13643. -->
  13644. Firing rl*prefer*rvt*predict-no*H0*6
  13645. -->
  13646. (S1 ^operator O1994 = 0.9998251377735368)
  13647. inner elaboration loop at bottom goal.
  13648. Retracting rl*prefer*rvt*predict-no*H0*6
  13649. -->
  13650. (S1 ^operator O1992 = 0.9998251377735368)
  13651. Retracting rl*prefer*rvt*predict-yes*H0*5
  13652. -->
  13653. (S1 ^operator O1991 = 0.2640246623191502)
  13654. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13655. -->
  13656. (S1 ^operator O1991 = -0.1386470047172653)
  13657. --- END Proposal Phase ---
  13658. --- Decision Phase ---
  13659. RL update rl*prefer*rvt*predict-no*H0*6 0.999825 0 0.999825 -> 0.999854 0 0.999854(R,m,v=1,0.90604,0.0857065)
  13660. =>WM: (14051: S1 ^operator O1994)
  13661. 997: O: O1994 (predict-no)
  13662. --- END Decision Phase ---
  13663. --- Application Phase ---
  13664. --- Firing Productions (PE) For State At Depth 1 ---
  13665. --- Inner Elaboration Phase, active level 1 (S1) ---
  13666. Firing apply*operator
  13667. -->
  13668. (I3 ^predict-no N997 + :O )
  13669. Firing apply*operator*complete
  13670. -->
  13671. (I3 ^predict-no N996 - :O )
  13672. inner elaboration loop at bottom goal.
  13673. --- Change Working Memory (PE) ---
  13674. =>WM: (14052: I3 ^predict-no N997)
  13675. <=WM: (14040: N996 ^status complete)
  13676. <=WM: (14039: I3 ^predict-no N996)
  13677. --- Firing Productions (IE) For State At Depth 1 ---
  13678. --- Inner Elaboration Phase, active level 1 (S1) ---
  13679. Firing monitor*world
  13680. -->
  13681. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13682. --- Change Working Memory (IE) ---
  13683. --- END Application Phase ---
  13684. --- Output Phase ---
  13685. ENV: Agent did: predict-no for direction L in state State-A
  13686. In State-A moving L
  13687. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13688. predict error 0
  13689. dir: dir isU
  13690. --- END Output Phase ---
  13691. \-/--- Input Phase ---
  13692. =>WM: (14056: I2 ^dir U)
  13693. =>WM: (14055: I2 ^reward 1)
  13694. =>WM: (14054: I2 ^see 0)
  13695. =>WM: (14053: N997 ^status complete)
  13696. <=WM: (14043: I2 ^dir L)
  13697. <=WM: (14042: I2 ^reward 1)
  13698. <=WM: (14041: I2 ^see 0)
  13699. =>WM: (14057: I2 ^level-1 L0-root)
  13700. <=WM: (14044: I2 ^level-1 L0-root)
  13701. --- END Input Phase ---
  13702. --- Proposal Phase ---
  13703. --- Inner Elaboration Phase, active level 1 (S1) ---
  13704. Firing elaborate*copy-see-to-output-link
  13705. -->
  13706. (I3 ^see 0 +)
  13707. Firing elaborate*reward*based*on*reward
  13708. -->
  13709. (R1001 ^value 1 +)
  13710. (R1 ^reward R1001 +)
  13711. Firing propose*predict-yes
  13712. -->
  13713. (O1995 ^name predict-yes +)
  13714. (S1 ^operator O1995 +)
  13715. Firing propose*predict-no
  13716. -->
  13717. (O1996 ^name predict-no +)
  13718. (S1 ^operator O1996 +)
  13719. Firing rl*prefer*rvt*predict-no*H0*2
  13720. -->
  13721. (S1 ^operator O1994 = 1.)
  13722. Firing rl*prefer*rvt*predict-yes*H0*1
  13723. -->
  13724. (S1 ^operator O1993 = 0.)
  13725. Firing prefer*rvt*predict-yes*H0
  13726. -->
  13727. Firing prefer*rvt*predict-no*H0
  13728. -->
  13729. Firing elaborate*copy-dir-to-output-link
  13730. -->
  13731. (I3 ^dir U +)
  13732. inner elaboration loop at bottom goal.
  13733. Retracting elaborate*copy-see-to-output-link
  13734. -->
  13735. (I3 ^see 0 +)
  13736. Retracting propose*predict-no
  13737. -->
  13738. (O1994 ^name predict-no +)
  13739. (S1 ^operator O1994 +)
  13740. Retracting propose*predict-yes
  13741. -->
  13742. (O1993 ^name predict-yes +)
  13743. (S1 ^operator O1993 +)
  13744. Retracting elaborate*reward*based*on*reward
  13745. -->
  13746. (R1000 ^value 1 +)
  13747. (R1 ^reward R1000 +)
  13748. Retracting elaborate*copy-dir-to-output-link
  13749. -->
  13750. (I3 ^dir L +)
  13751. Retracting rl*prefer*rvt*predict-no*H0*6
  13752. -->
  13753. (S1 ^operator O1994 = 0.9998542623222174)
  13754. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  13755. -->
  13756. (S1 ^operator O1993 = -0.1386470047172653)
  13757. Retracting rl*prefer*rvt*predict-yes*H0*5
  13758. -->
  13759. (S1 ^operator O1993 = 0.2640246623191502)
  13760. =>WM: (14064: S1 ^operator O1996 +)
  13761. =>WM: (14063: S1 ^operator O1995 +)
  13762. =>WM: (14062: I3 ^dir U)
  13763. =>WM: (14061: O1996 ^name predict-no)
  13764. =>WM: (14060: O1995 ^name predict-yes)
  13765. =>WM: (14059: R1001 ^value 1)
  13766. =>WM: (14058: R1 ^reward R1001)
  13767. <=WM: (14049: S1 ^operator O1993 +)
  13768. <=WM: (14050: S1 ^operator O1994 +)
  13769. <=WM: (14051: S1 ^operator O1994)
  13770. <=WM: (14007: I3 ^dir L)
  13771. <=WM: (14045: R1 ^reward R1000)
  13772. <=WM: (14048: O1994 ^name predict-no)
  13773. <=WM: (14047: O1993 ^name predict-yes)
  13774. <=WM: (14046: R1000 ^value 1)
  13775. --- Inner Elaboration Phase, active level 1 (S1) ---
  13776. Firing prefer*rvt*predict-yes*H0
  13777. -->
  13778. Firing rl*prefer*rvt*predict-yes*H0*1
  13779. -->
  13780. (S1 ^operator O1995 = 0.)
  13781. Firing prefer*rvt*predict-no*H0
  13782. -->
  13783. Firing rl*prefer*rvt*predict-no*H0*2
  13784. -->
  13785. (S1 ^operator O1996 = 1.)
  13786. inner elaboration loop at bottom goal.
  13787. Retracting rl*prefer*rvt*predict-no*H0*2
  13788. -->
  13789. (S1 ^operator O1994 = 1.)
  13790. Retracting rl*prefer*rvt*predict-yes*H0*1
  13791. -->
  13792. (S1 ^operator O1993 = 0.)
  13793. --- END Proposal Phase ---
  13794. --- Decision Phase ---
  13795. RL update rl*prefer*rvt*predict-no*H0*6 0.999854 0 0.999854 -> 0.999879 0 0.999879(R,m,v=1,0.906667,0.0851902)
  13796. =>WM: (14065: S1 ^operator O1996)
  13797. 998: O: O1996 (predict-no)
  13798. --- END Decision Phase ---
  13799. --- Application Phase ---
  13800. --- Firing Productions (PE) For State At Depth 1 ---
  13801. --- Inner Elaboration Phase, active level 1 (S1) ---
  13802. Firing apply*operator
  13803. -->
  13804. (I3 ^predict-no N998 + :O )
  13805. Firing apply*operator*complete
  13806. -->
  13807. (I3 ^predict-no N997 - :O )
  13808. inner elaboration loop at bottom goal.
  13809. --- Change Working Memory (PE) ---
  13810. =>WM: (14066: I3 ^predict-no N998)
  13811. <=WM: (14053: N997 ^status complete)
  13812. <=WM: (14052: I3 ^predict-no N997)
  13813. --- Firing Productions (IE) For State At Depth 1 ---
  13814. --- Inner Elaboration Phase, active level 1 (S1) ---
  13815. Firing monitor*world
  13816. -->
  13817. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13818. --- Change Working Memory (IE) ---
  13819. --- END Application Phase ---
  13820. --- Output Phase ---
  13821. ENV: Agent did: predict-no for direction U in state State-A
  13822. In State-A moving U
  13823. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13824. predict error 0
  13825. dir: dir isU
  13826. --- END Output Phase ---
  13827. |\---- Input Phase ---
  13828. =>WM: (14070: I2 ^dir U)
  13829. =>WM: (14069: I2 ^reward 1)
  13830. =>WM: (14068: I2 ^see 0)
  13831. =>WM: (14067: N998 ^status complete)
  13832. <=WM: (14056: I2 ^dir U)
  13833. <=WM: (14055: I2 ^reward 1)
  13834. <=WM: (14054: I2 ^see 0)
  13835. =>WM: (14071: I2 ^level-1 L0-root)
  13836. <=WM: (14057: I2 ^level-1 L0-root)
  13837. --- END Input Phase ---
  13838. --- Proposal Phase ---
  13839. --- Inner Elaboration Phase, active level 1 (S1) ---
  13840. Firing elaborate*copy-see-to-output-link
  13841. -->
  13842. (I3 ^see 0 +)
  13843. Firing elaborate*reward*based*on*reward
  13844. -->
  13845. (R1002 ^value 1 +)
  13846. (R1 ^reward R1002 +)
  13847. Firing propose*predict-yes
  13848. -->
  13849. (O1997 ^name predict-yes +)
  13850. (S1 ^operator O1997 +)
  13851. Firing propose*predict-no
  13852. -->
  13853. (O1998 ^name predict-no +)
  13854. (S1 ^operator O1998 +)
  13855. Firing rl*prefer*rvt*predict-no*H0*2
  13856. -->
  13857. (S1 ^operator O1996 = 1.)
  13858. Firing rl*prefer*rvt*predict-yes*H0*1
  13859. -->
  13860. (S1 ^operator O1995 = 0.)
  13861. Firing prefer*rvt*predict-yes*H0
  13862. -->
  13863. Firing prefer*rvt*predict-no*H0
  13864. -->
  13865. Firing elaborate*copy-dir-to-output-link
  13866. -->
  13867. (I3 ^dir U +)
  13868. inner elaboration loop at bottom goal.
  13869. Retracting elaborate*copy-see-to-output-link
  13870. -->
  13871. (I3 ^see 0 +)
  13872. Retracting propose*predict-no
  13873. -->
  13874. (O1996 ^name predict-no +)
  13875. (S1 ^operator O1996 +)
  13876. Retracting propose*predict-yes
  13877. -->
  13878. (O1995 ^name predict-yes +)
  13879. (S1 ^operator O1995 +)
  13880. Retracting elaborate*reward*based*on*reward
  13881. -->
  13882. (R1001 ^value 1 +)
  13883. (R1 ^reward R1001 +)
  13884. Retracting elaborate*copy-dir-to-output-link
  13885. -->
  13886. (I3 ^dir U +)
  13887. Retracting rl*prefer*rvt*predict-no*H0*2
  13888. -->
  13889. (S1 ^operator O1996 = 1.)
  13890. Retracting rl*prefer*rvt*predict-yes*H0*1
  13891. -->
  13892. (S1 ^operator O1995 = 0.)
  13893. =>WM: (14077: S1 ^operator O1998 +)
  13894. =>WM: (14076: S1 ^operator O1997 +)
  13895. =>WM: (14075: O1998 ^name predict-no)
  13896. =>WM: (14074: O1997 ^name predict-yes)
  13897. =>WM: (14073: R1002 ^value 1)
  13898. =>WM: (14072: R1 ^reward R1002)
  13899. <=WM: (14063: S1 ^operator O1995 +)
  13900. <=WM: (14064: S1 ^operator O1996 +)
  13901. <=WM: (14065: S1 ^operator O1996)
  13902. <=WM: (14058: R1 ^reward R1001)
  13903. <=WM: (14061: O1996 ^name predict-no)
  13904. <=WM: (14060: O1995 ^name predict-yes)
  13905. <=WM: (14059: R1001 ^value 1)
  13906. --- Inner Elaboration Phase, active level 1 (S1) ---
  13907. Firing prefer*rvt*predict-yes*H0
  13908. -->
  13909. Firing rl*prefer*rvt*predict-yes*H0*1
  13910. -->
  13911. (S1 ^operator O1997 = 0.)
  13912. Firing prefer*rvt*predict-no*H0
  13913. -->
  13914. Firing rl*prefer*rvt*predict-no*H0*2
  13915. -->
  13916. (S1 ^operator O1998 = 1.)
  13917. inner elaboration loop at bottom goal.
  13918. Retracting rl*prefer*rvt*predict-no*H0*2
  13919. -->
  13920. (S1 ^operator O1996 = 1.)
  13921. Retracting rl*prefer*rvt*predict-yes*H0*1
  13922. -->
  13923. (S1 ^operator O1995 = 0.)
  13924. --- END Proposal Phase ---
  13925. --- Decision Phase ---
  13926. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13927. =>WM: (14078: S1 ^operator O1998)
  13928. 999: O: O1998 (predict-no)
  13929. --- END Decision Phase ---
  13930. --- Application Phase ---
  13931. --- Firing Productions (PE) For State At Depth 1 ---
  13932. --- Inner Elaboration Phase, active level 1 (S1) ---
  13933. Firing apply*operator
  13934. -->
  13935. (I3 ^predict-no N999 + :O )
  13936. Firing apply*operator*complete
  13937. -->
  13938. (I3 ^predict-no N998 - :O )
  13939. inner elaboration loop at bottom goal.
  13940. --- Change Working Memory (PE) ---
  13941. =>WM: (14079: I3 ^predict-no N999)
  13942. <=WM: (14067: N998 ^status complete)
  13943. <=WM: (14066: I3 ^predict-no N998)
  13944. --- Firing Productions (IE) For State At Depth 1 ---
  13945. --- Inner Elaboration Phase, active level 1 (S1) ---
  13946. Firing monitor*world
  13947. -->
  13948. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13949. --- Change Working Memory (IE) ---
  13950. --- END Application Phase ---
  13951. --- Output Phase ---
  13952. ENV: Agent did: predict-no for direction U in state State-A
  13953. In State-A moving U
  13954. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13955. predict error 0
  13956. dir: dir isR
  13957. --- END Output Phase ---
  13958. /|\--- Input Phase ---
  13959. =>WM: (14083: I2 ^dir R)
  13960. =>WM: (14082: I2 ^reward 1)
  13961. =>WM: (14081: I2 ^see 0)
  13962. =>WM: (14080: N999 ^status complete)
  13963. <=WM: (14070: I2 ^dir U)
  13964. <=WM: (14069: I2 ^reward 1)
  13965. <=WM: (14068: I2 ^see 0)
  13966. =>WM: (14084: I2 ^level-1 L0-root)
  13967. <=WM: (14071: I2 ^level-1 L0-root)
  13968. --- END Input Phase ---
  13969. --- Proposal Phase ---
  13970. --- Inner Elaboration Phase, active level 1 (S1) ---
  13971. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  13972. -->
  13973. (S1 ^operator O1998 = -0.2817060109291377)
  13974. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  13975. -->
  13976. (S1 ^operator O1997 = 0.6623525109664488)
  13977. Firing prefer*rvt*predict-no*H0*4*v1*H1
  13978. -->
  13979. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  13980. -->
  13981. Firing elaborate*copy-see-to-output-link
  13982. -->
  13983. (I3 ^see 0 +)
  13984. Firing elaborate*reward*based*on*reward
  13985. -->
  13986. (R1003 ^value 1 +)
  13987. (R1 ^reward R1003 +)
  13988. Firing propose*predict-yes
  13989. -->
  13990. (O1999 ^name predict-yes +)
  13991. (S1 ^operator O1999 +)
  13992. Firing propose*predict-no
  13993. -->
  13994. (O2000 ^name predict-no +)
  13995. (S1 ^operator O2000 +)
  13996. Firing rl*prefer*rvt*predict-no*H0*4
  13997. -->
  13998. (S1 ^operator O1998 = 0.339773810196969)
  13999. Firing rl*prefer*rvt*predict-yes*H0*3
  14000. -->
  14001. (S1 ^operator O1997 = 0.337717515090074)
  14002. Firing prefer*rvt*predict-yes*H0
  14003. -->
  14004. Firing prefer*rvt*predict-no*H0
  14005. -->
  14006. Firing elaborate*copy-dir-to-output-link
  14007. -->
  14008. (I3 ^dir R +)
  14009. inner elaboration loop at bottom goal.
  14010. Retracting elaborate*copy-see-to-output-link
  14011. -->
  14012. (I3 ^see 0 +)
  14013. Retracting propose*predict-no
  14014. -->
  14015. (O1998 ^name predict-no +)
  14016. (S1 ^operator O1998 +)
  14017. Retracting propose*predict-yes
  14018. -->
  14019. (O1997 ^name predict-yes +)
  14020. (S1 ^operator O1997 +)
  14021. Retracting elaborate*reward*based*on*reward
  14022. -->
  14023. (R1002 ^value 1 +)
  14024. (R1 ^reward R1002 +)
  14025. Retracting elaborate*copy-dir-to-output-link
  14026. -->
  14027. (I3 ^dir U +)
  14028. Retracting rl*prefer*rvt*predict-no*H0*2
  14029. -->
  14030. (S1 ^operator O1998 = 1.)
  14031. Retracting rl*prefer*rvt*predict-yes*H0*1
  14032. -->
  14033. (S1 ^operator O1997 = 0.)
  14034. =>WM: (14091: S1 ^operator O2000 +)
  14035. =>WM: (14090: S1 ^operator O1999 +)
  14036. =>WM: (14089: I3 ^dir R)
  14037. =>WM: (14088: O2000 ^name predict-no)
  14038. =>WM: (14087: O1999 ^name predict-yes)
  14039. =>WM: (14086: R1003 ^value 1)
  14040. =>WM: (14085: R1 ^reward R1003)
  14041. <=WM: (14076: S1 ^operator O1997 +)
  14042. <=WM: (14077: S1 ^operator O1998 +)
  14043. <=WM: (14078: S1 ^operator O1998)
  14044. <=WM: (14062: I3 ^dir U)
  14045. <=WM: (14072: R1 ^reward R1002)
  14046. <=WM: (14075: O1998 ^name predict-no)
  14047. <=WM: (14074: O1997 ^name predict-yes)
  14048. <=WM: (14073: R1002 ^value 1)
  14049. --- Inner Elaboration Phase, active level 1 (S1) ---
  14050. Firing prefer*rvt*predict-yes*H0
  14051. -->
  14052. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14053. -->
  14054. (S1 ^operator O1999 = 0.6623525109664488)
  14055. Firing rl*prefer*rvt*predict-yes*H0*3
  14056. -->
  14057. (S1 ^operator O1999 = 0.337717515090074)
  14058. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14059. -->
  14060. Firing prefer*rvt*predict-no*H0
  14061. -->
  14062. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14063. -->
  14064. (S1 ^operator O2000 = -0.2817060109291377)
  14065. Firing rl*prefer*rvt*predict-no*H0*4
  14066. -->
  14067. (S1 ^operator O2000 = 0.339773810196969)
  14068. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14069. -->
  14070. inner elaboration loop at bottom goal.
  14071. Retracting rl*prefer*rvt*predict-no*H0*4
  14072. -->
  14073. (S1 ^operator O1998 = 0.339773810196969)
  14074. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14075. -->
  14076. (S1 ^operator O1998 = -0.2817060109291377)
  14077. Retracting rl*prefer*rvt*predict-yes*H0*3
  14078. -->
  14079. (S1 ^operator O1997 = 0.337717515090074)
  14080. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14081. -->
  14082. (S1 ^operator O1997 = 0.6623525109664488)
  14083. --- END Proposal Phase ---
  14084. --- Decision Phase ---
  14085. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14086. =>WM: (14092: S1 ^operator O1999)
  14087. 1000: O: O1999 (predict-yes)
  14088. --- END Decision Phase ---
  14089. --- Application Phase ---
  14090. --- Firing Productions (PE) For State At Depth 1 ---
  14091. --- Inner Elaboration Phase, active level 1 (S1) ---
  14092. Firing apply*operator
  14093. -->
  14094. (I3 ^predict-yes N1000 + :O )
  14095. Firing apply*operator*complete
  14096. -->
  14097. (I3 ^predict-no N999 - :O )
  14098. inner elaboration loop at bottom goal.
  14099. --- Change Working Memory (PE) ---
  14100. =>WM: (14093: I3 ^predict-yes N1000)
  14101. <=WM: (14080: N999 ^status complete)
  14102. <=WM: (14079: I3 ^predict-no N999)
  14103. --- Firing Productions (IE) For State At Depth 1 ---
  14104. --- Inner Elaboration Phase, active level 1 (S1) ---
  14105. Firing monitor*world
  14106. -->
  14107. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14108. --- Change Working Memory (IE) ---
  14109. --- END Application Phase ---
  14110. --- Output Phase ---
  14111. ENV: Agent did: predict-yes for direction R in state State-A
  14112. In State-A moving R
  14113. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14114. predict error 0
  14115. dir: dir isU
  14116. --- END Output Phase ---
  14117. -/|\-/|\-/|--- Input Phase ---
  14118. =>WM: (14097: I2 ^dir U)
  14119. =>WM: (14096: I2 ^reward 1)
  14120. =>WM: (14095: I2 ^see 1)
  14121. =>WM: (14094: N1000 ^status complete)
  14122. <=WM: (14083: I2 ^dir R)
  14123. <=WM: (14082: I2 ^reward 1)
  14124. <=WM: (14081: I2 ^see 0)
  14125. =>WM: (14098: I2 ^level-1 R1-root)
  14126. <=WM: (14084: I2 ^level-1 L0-root)
  14127. --- END Input Phase ---
  14128. --- Proposal Phase ---
  14129. --- Inner Elaboration Phase, active level 1 (S1) ---
  14130. Firing elaborate*copy-see-to-output-link
  14131. -->
  14132. (I3 ^see 1 +)
  14133. Firing elaborate*reward*based*on*reward
  14134. -->
  14135. (R1004 ^value 1 +)
  14136. (R1 ^reward R1004 +)
  14137. Firing propose*predict-yes
  14138. -->
  14139. (O2001 ^name predict-yes +)
  14140. (S1 ^operator O2001 +)
  14141. Firing propose*predict-no
  14142. -->
  14143. (O2002 ^name predict-no +)
  14144. (S1 ^operator O2002 +)
  14145. Firing rl*prefer*rvt*predict-no*H0*2
  14146. -->
  14147. (S1 ^operator O2000 = 1.)
  14148. Firing rl*prefer*rvt*predict-yes*H0*1
  14149. -->
  14150. (S1 ^operator O1999 = 0.)
  14151. Firing prefer*rvt*predict-yes*H0
  14152. -->
  14153. Firing prefer*rvt*predict-no*H0
  14154. -->
  14155. Firing elaborate*copy-dir-to-output-link
  14156. -->
  14157. (I3 ^dir U +)
  14158. inner elaboration loop at bottom goal.
  14159. Retracting elaborate*copy-see-to-output-link
  14160. -->
  14161. (I3 ^see 0 +)
  14162. Retracting propose*predict-no
  14163. -->
  14164. (O2000 ^name predict-no +)
  14165. (S1 ^operator O2000 +)
  14166. Retracting propose*predict-yes
  14167. -->
  14168. (O1999 ^name predict-yes +)
  14169. (S1 ^operator O1999 +)
  14170. Retracting elaborate*reward*based*on*reward
  14171. -->
  14172. (R1003 ^value 1 +)
  14173. (R1 ^reward R1003 +)
  14174. Retracting elaborate*copy-dir-to-output-link
  14175. -->
  14176. (I3 ^dir R +)
  14177. Retracting rl*prefer*rvt*predict-no*H0*4
  14178. -->
  14179. (S1 ^operator O2000 = 0.339773810196969)
  14180. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  14181. -->
  14182. (S1 ^operator O2000 = -0.2817060109291377)
  14183. Retracting rl*prefer*rvt*predict-yes*H0*3
  14184. -->
  14185. (S1 ^operator O1999 = 0.337717515090074)
  14186. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  14187. -->
  14188. (S1 ^operator O1999 = 0.6623525109664488)
  14189. =>WM: (14106: S1 ^operator O2002 +)
  14190. =>WM: (14105: S1 ^operator O2001 +)
  14191. =>WM: (14104: I3 ^dir U)
  14192. =>WM: (14103: O2002 ^name predict-no)
  14193. =>WM: (14102: O2001 ^name predict-yes)
  14194. =>WM: (14101: R1004 ^value 1)
  14195. =>WM: (14100: R1 ^reward R1004)
  14196. =>WM: (14099: I3 ^see 1)
  14197. <=WM: (14090: S1 ^operator O1999 +)
  14198. <=WM: (14092: S1 ^operator O1999)
  14199. <=WM: (14091: S1 ^operator O2000 +)
  14200. <=WM: (14089: I3 ^dir R)
  14201. <=WM: (14085: R1 ^reward R1003)
  14202. <=WM: (14031: I3 ^see 0)
  14203. <=WM: (14088: O2000 ^name predict-no)
  14204. <=WM: (14087: O1999 ^name predict-yes)
  14205. <=WM: (14086: R1003 ^value 1)
  14206. --- Inner Elaboration Phase, active level 1 (S1) ---
  14207. Firing prefer*rvt*predict-yes*H0
  14208. -->
  14209. Firing rl*prefer*rvt*predict-yes*H0*1
  14210. -->
  14211. (S1 ^operator O2001 = 0.)
  14212. Firing prefer*rvt*predict-no*H0
  14213. -->
  14214. Firing rl*prefer*rvt*predict-no*H0*2
  14215. -->
  14216. (S1 ^operator O2002 = 1.)
  14217. inner elaboration loop at bottom goal.
  14218. Retracting rl*prefer*rvt*predict-no*H0*2
  14219. -->
  14220. (S1 ^operator O2000 = 1.)
  14221. Retracting rl*prefer*rvt*predict-yes*H0*1
  14222. -->
  14223. (S1 ^operator O1999 = 0.)
  14224. --- END Proposal Phase ---
  14225. --- Decision Phase ---
  14226. RL update rl*prefer*rvt*predict-yes*H0*3 0.590119 -0.252401 0.337718 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.89881,0.0914956)
  14227. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409962 0.25239 0.662353 -> 0.409954 0.252391 0.662346(R,m,v=1,1,0)
  14228. =>WM: (14107: S1 ^operator O2002)
  14229. 1001: O: O2002 (predict-no)
  14230. --- END Decision Phase ---
  14231. --- Application Phase ---
  14232. --- Firing Productions (PE) For State At Depth 1 ---
  14233. --- Inner Elaboration Phase, active level 1 (S1) ---
  14234. Firing apply*operator
  14235. -->
  14236. (I3 ^predict-no N1001 + :O )
  14237. Firing apply*operator*complete
  14238. -->
  14239. (I3 ^predict-yes N1000 - :O )
  14240. inner elaboration loop at bottom goal.
  14241. --- Change Working Memory (PE) ---
  14242. =>WM: (14108: I3 ^predict-no N1001)
  14243. <=WM: (14094: N1000 ^status complete)
  14244. <=WM: (14093: I3 ^predict-yes N1000)
  14245. --- Firing Productions (IE) For State At Depth 1 ---
  14246. --- Inner Elaboration Phase, active level 1 (S1) ---
  14247. Firing monitor*world
  14248. -->
  14249. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14250. --- Change Working Memory (IE) ---
  14251. --- END Application Phase ---
  14252. --- Output Phase ---
  14253. ENV: Agent did: predict-no for direction U in state State-B
  14254. In State-B moving U
  14255. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14256. predict error 0
  14257. dir: dir isU
  14258. --- END Output Phase ---
  14259. \--- Input Phase ---
  14260. =>WM: (14112: I2 ^dir U)
  14261. =>WM: (14111: I2 ^reward 1)
  14262. =>WM: (14110: I2 ^see 0)
  14263. =>WM: (14109: N1001 ^status complete)
  14264. <=WM: (14097: I2 ^dir U)
  14265. <=WM: (14096: I2 ^reward 1)
  14266. <=WM: (14095: I2 ^see 1)
  14267. =>WM: (14113: I2 ^level-1 R1-root)
  14268. <=WM: (14098: I2 ^level-1 R1-root)
  14269. --- END Input Phase ---
  14270. --- Proposal Phase ---
  14271. --- Inner Elaboration Phase, active level 1 (S1) ---
  14272. Firing elaborate*copy-see-to-output-link
  14273. -->
  14274. (I3 ^see 0 +)
  14275. Firing elaborate*reward*based*on*reward
  14276. -->
  14277. (R1005 ^value 1 +)
  14278. (R1 ^reward R1005 +)
  14279. Firing propose*predict-yes
  14280. -->
  14281. (O2003 ^name predict-yes +)
  14282. (S1 ^operator O2003 +)
  14283. Firing propose*predict-no
  14284. -->
  14285. (O2004 ^name predict-no +)
  14286. (S1 ^operator O2004 +)
  14287. Firing rl*prefer*rvt*predict-no*H0*2
  14288. -->
  14289. (S1 ^operator O2002 = 1.)
  14290. Firing rl*prefer*rvt*predict-yes*H0*1
  14291. -->
  14292. (S1 ^operator O2001 = 0.)
  14293. Firing prefer*rvt*predict-yes*H0
  14294. -->
  14295. Firing prefer*rvt*predict-no*H0
  14296. -->
  14297. Firing elaborate*copy-dir-to-output-link
  14298. -->
  14299. (I3 ^dir U +)
  14300. inner elaboration loop at bottom goal.
  14301. Retracting elaborate*copy-see-to-output-link
  14302. -->
  14303. (I3 ^see 1 +)
  14304. Retracting propose*predict-no
  14305. -->
  14306. (O2002 ^name predict-no +)
  14307. (S1 ^operator O2002 +)
  14308. Retracting propose*predict-yes
  14309. -->
  14310. (O2001 ^name predict-yes +)
  14311. (S1 ^operator O2001 +)
  14312. Retracting elaborate*reward*based*on*reward
  14313. -->
  14314. (R1004 ^value 1 +)
  14315. (R1 ^reward R1004 +)
  14316. Retracting elaborate*copy-dir-to-output-link
  14317. -->
  14318. (I3 ^dir U +)
  14319. Retracting rl*prefer*rvt*predict-no*H0*2
  14320. -->
  14321. (S1 ^operator O2002 = 1.)
  14322. Retracting rl*prefer*rvt*predict-yes*H0*1
  14323. -->
  14324. (S1 ^operator O2001 = 0.)
  14325. =>WM: (14120: S1 ^operator O2004 +)
  14326. =>WM: (14119: S1 ^operator O2003 +)
  14327. =>WM: (14118: O2004 ^name predict-no)
  14328. =>WM: (14117: O2003 ^name predict-yes)
  14329. =>WM: (14116: R1005 ^value 1)
  14330. =>WM: (14115: R1 ^reward R1005)
  14331. =>WM: (14114: I3 ^see 0)
  14332. <=WM: (14105: S1 ^operator O2001 +)
  14333. <=WM: (14106: S1 ^operator O2002 +)
  14334. <=WM: (14107: S1 ^operator O2002)
  14335. <=WM: (14100: R1 ^reward R1004)
  14336. <=WM: (14099: I3 ^see 1)
  14337. <=WM: (14103: O2002 ^name predict-no)
  14338. <=WM: (14102: O2001 ^name predict-yes)
  14339. <=WM: (14101: R1004 ^value 1)
  14340. --- Inner Elaboration Phase, active level 1 (S1) ---
  14341. Firing prefer*rvt*predict-yes*H0
  14342. -->
  14343. Firing rl*prefer*rvt*predict-yes*H0*1
  14344. -->
  14345. (S1 ^operator O2003 = 0.)
  14346. Firing prefer*rvt*predict-no*H0
  14347. -->
  14348. Firing rl*prefer*rvt*predict-no*H0*2
  14349. -->
  14350. (S1 ^operator O2004 = 1.)
  14351. inner elaboration loop at bottom goal.
  14352. Retracting rl*prefer*rvt*predict-no*H0*2
  14353. -->
  14354. (S1 ^operator O2002 = 1.)
  14355. Retracting rl*prefer*rvt*predict-yes*H0*1
  14356. -->
  14357. (S1 ^operator O2001 = 0.)
  14358. --- END Proposal Phase ---
  14359. --- Decision Phase ---
  14360. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14361. =>WM: (14121: S1 ^operator O2004)
  14362. 1002: O: O2004 (predict-no)
  14363. --- END Decision Phase ---
  14364. --- Application Phase ---
  14365. --- Firing Productions (PE) For State At Depth 1 ---
  14366. --- Inner Elaboration Phase, active level 1 (S1) ---
  14367. Firing apply*operator
  14368. -->
  14369. (I3 ^predict-no N1002 + :O )
  14370. Firing apply*operator*complete
  14371. -->
  14372. (I3 ^predict-no N1001 - :O )
  14373. inner elaboration loop at bottom goal.
  14374. --- Change Working Memory (PE) ---
  14375. =>WM: (14122: I3 ^predict-no N1002)
  14376. <=WM: (14109: N1001 ^status complete)
  14377. <=WM: (14108: I3 ^predict-no N1001)
  14378. --- Firing Productions (IE) For State At Depth 1 ---
  14379. --- Inner Elaboration Phase, active level 1 (S1) ---
  14380. Firing monitor*world
  14381. -->
  14382. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14383. --- Change Working Memory (IE) ---
  14384. --- END Application Phase ---
  14385. --- Output Phase ---
  14386. ENV: Agent did: predict-no for direction U in state State-B
  14387. In State-B moving U
  14388. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14389. predict error 0
  14390. dir: dir isU
  14391. --- END Output Phase ---
  14392. ---- Input Phase ---
  14393. =>WM: (14126: I2 ^dir U)
  14394. =>WM: (14125: I2 ^reward 1)
  14395. =>WM: (14124: I2 ^see 0)
  14396. =>WM: (14123: N1002 ^status complete)
  14397. <=WM: (14112: I2 ^dir U)
  14398. <=WM: (14111: I2 ^reward 1)
  14399. <=WM: (14110: I2 ^see 0)
  14400. =>WM: (14127: I2 ^level-1 R1-root)
  14401. <=WM: (14113: I2 ^level-1 R1-root)
  14402. --- END Input Phase ---
  14403. --- Proposal Phase ---
  14404. --- Inner Elaboration Phase, active level 1 (S1) ---
  14405. Firing elaborate*copy-see-to-output-link
  14406. -->
  14407. (I3 ^see 0 +)
  14408. Firing elaborate*reward*based*on*reward
  14409. -->
  14410. (R1006 ^value 1 +)
  14411. (R1 ^reward R1006 +)
  14412. Firing propose*predict-yes
  14413. -->
  14414. (O2005 ^name predict-yes +)
  14415. (S1 ^operator O2005 +)
  14416. Firing propose*predict-no
  14417. -->
  14418. (O2006 ^name predict-no +)
  14419. (S1 ^operator O2006 +)
  14420. Firing rl*prefer*rvt*predict-no*H0*2
  14421. -->
  14422. (S1 ^operator O2004 = 1.)
  14423. Firing rl*prefer*rvt*predict-yes*H0*1
  14424. -->
  14425. (S1 ^operator O2003 = 0.)
  14426. Firing prefer*rvt*predict-yes*H0
  14427. -->
  14428. Firing prefer*rvt*predict-no*H0
  14429. -->
  14430. Firing elaborate*copy-dir-to-output-link
  14431. -->
  14432. (I3 ^dir U +)
  14433. inner elaboration loop at bottom goal.
  14434. Retracting elaborate*copy-see-to-output-link
  14435. -->
  14436. (I3 ^see 0 +)
  14437. Retracting propose*predict-no
  14438. -->
  14439. (O2004 ^name predict-no +)
  14440. (S1 ^operator O2004 +)
  14441. Retracting propose*predict-yes
  14442. -->
  14443. (O2003 ^name predict-yes +)
  14444. (S1 ^operator O2003 +)
  14445. Retracting elaborate*reward*based*on*reward
  14446. -->
  14447. (R1005 ^value 1 +)
  14448. (R1 ^reward R1005 +)
  14449. Retracting elaborate*copy-dir-to-output-link
  14450. -->
  14451. (I3 ^dir U +)
  14452. Retracting rl*prefer*rvt*predict-no*H0*2
  14453. -->
  14454. (S1 ^operator O2004 = 1.)
  14455. Retracting rl*prefer*rvt*predict-yes*H0*1
  14456. -->
  14457. (S1 ^operator O2003 = 0.)
  14458. =>WM: (14133: S1 ^operator O2006 +)
  14459. =>WM: (14132: S1 ^operator O2005 +)
  14460. =>WM: (14131: O2006 ^name predict-no)
  14461. =>WM: (14130: O2005 ^name predict-yes)
  14462. =>WM: (14129: R1006 ^value 1)
  14463. =>WM: (14128: R1 ^reward R1006)
  14464. <=WM: (14119: S1 ^operator O2003 +)
  14465. <=WM: (14120: S1 ^operator O2004 +)
  14466. <=WM: (14121: S1 ^operator O2004)
  14467. <=WM: (14115: R1 ^reward R1005)
  14468. <=WM: (14118: O2004 ^name predict-no)
  14469. <=WM: (14117: O2003 ^name predict-yes)
  14470. <=WM: (14116: R1005 ^value 1)
  14471. --- Inner Elaboration Phase, active level 1 (S1) ---
  14472. Firing prefer*rvt*predict-yes*H0
  14473. -->
  14474. Firing rl*prefer*rvt*predict-yes*H0*1
  14475. -->
  14476. (S1 ^operator O2005 = 0.)
  14477. Firing prefer*rvt*predict-no*H0
  14478. -->
  14479. Firing rl*prefer*rvt*predict-no*H0*2
  14480. -->
  14481. (S1 ^operator O2006 = 1.)
  14482. inner elaboration loop at bottom goal.
  14483. Retracting rl*prefer*rvt*predict-no*H0*2
  14484. -->
  14485. (S1 ^operator O2004 = 1.)
  14486. Retracting rl*prefer*rvt*predict-yes*H0*1
  14487. -->
  14488. (S1 ^operator O2003 = 0.)
  14489. --- END Proposal Phase ---
  14490. --- Decision Phase ---
  14491. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14492. =>WM: (14134: S1 ^operator O2006)
  14493. 1003: O: O2006 (predict-no)
  14494. --- END Decision Phase ---
  14495. --- Application Phase ---
  14496. --- Firing Productions (PE) For State At Depth 1 ---
  14497. --- Inner Elaboration Phase, active level 1 (S1) ---
  14498. Firing apply*operator
  14499. -->
  14500. (I3 ^predict-no N1003 + :O )
  14501. Firing apply*operator*complete
  14502. -->
  14503. (I3 ^predict-no N1002 - :O )
  14504. inner elaboration loop at bottom goal.
  14505. --- Change Working Memory (PE) ---
  14506. =>WM: (14135: I3 ^predict-no N1003)
  14507. <=WM: (14123: N1002 ^status complete)
  14508. <=WM: (14122: I3 ^predict-no N1002)
  14509. --- Firing Productions (IE) For State At Depth 1 ---
  14510. --- Inner Elaboration Phase, active level 1 (S1) ---
  14511. Firing monitor*world
  14512. -->
  14513. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14514. --- Change Working Memory (IE) ---
  14515. --- END Application Phase ---
  14516. --- Output Phase ---
  14517. ENV: Agent did: predict-no for direction U in state State-B
  14518. In State-B moving U
  14519. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14520. predict error 0
  14521. dir: dir isU
  14522. --- END Output Phase ---
  14523. /|--- Input Phase ---
  14524. =>WM: (14139: I2 ^dir U)
  14525. =>WM: (14138: I2 ^reward 1)
  14526. =>WM: (14137: I2 ^see 0)
  14527. =>WM: (14136: N1003 ^status complete)
  14528. <=WM: (14126: I2 ^dir U)
  14529. <=WM: (14125: I2 ^reward 1)
  14530. <=WM: (14124: I2 ^see 0)
  14531. =>WM: (14140: I2 ^level-1 R1-root)
  14532. <=WM: (14127: I2 ^level-1 R1-root)
  14533. --- END Input Phase ---
  14534. --- Proposal Phase ---
  14535. --- Inner Elaboration Phase, active level 1 (S1) ---
  14536. Firing elaborate*copy-see-to-output-link
  14537. -->
  14538. (I3 ^see 0 +)
  14539. Firing elaborate*reward*based*on*reward
  14540. -->
  14541. (R1007 ^value 1 +)
  14542. (R1 ^reward R1007 +)
  14543. Firing propose*predict-yes
  14544. -->
  14545. (O2007 ^name predict-yes +)
  14546. (S1 ^operator O2007 +)
  14547. Firing propose*predict-no
  14548. -->
  14549. (O2008 ^name predict-no +)
  14550. (S1 ^operator O2008 +)
  14551. Firing rl*prefer*rvt*predict-no*H0*2
  14552. -->
  14553. (S1 ^operator O2006 = 1.)
  14554. Firing rl*prefer*rvt*predict-yes*H0*1
  14555. -->
  14556. (S1 ^operator O2005 = 0.)
  14557. Firing prefer*rvt*predict-yes*H0
  14558. -->
  14559. Firing prefer*rvt*predict-no*H0
  14560. -->
  14561. Firing elaborate*copy-dir-to-output-link
  14562. -->
  14563. (I3 ^dir U +)
  14564. inner elaboration loop at bottom goal.
  14565. Retracting elaborate*copy-see-to-output-link
  14566. -->
  14567. (I3 ^see 0 +)
  14568. Retracting propose*predict-no
  14569. -->
  14570. (O2006 ^name predict-no +)
  14571. (S1 ^operator O2006 +)
  14572. Retracting propose*predict-yes
  14573. -->
  14574. (O2005 ^name predict-yes +)
  14575. (S1 ^operator O2005 +)
  14576. Retracting elaborate*reward*based*on*reward
  14577. -->
  14578. (R1006 ^value 1 +)
  14579. (R1 ^reward R1006 +)
  14580. Retracting elaborate*copy-dir-to-output-link
  14581. -->
  14582. (I3 ^dir U +)
  14583. Retracting rl*prefer*rvt*predict-no*H0*2
  14584. -->
  14585. (S1 ^operator O2006 = 1.)
  14586. Retracting rl*prefer*rvt*predict-yes*H0*1
  14587. -->
  14588. (S1 ^operator O2005 = 0.)
  14589. =>WM: (14146: S1 ^operator O2008 +)
  14590. =>WM: (14145: S1 ^operator O2007 +)
  14591. =>WM: (14144: O2008 ^name predict-no)
  14592. =>WM: (14143: O2007 ^name predict-yes)
  14593. =>WM: (14142: R1007 ^value 1)
  14594. =>WM: (14141: R1 ^reward R1007)
  14595. <=WM: (14132: S1 ^operator O2005 +)
  14596. <=WM: (14133: S1 ^operator O2006 +)
  14597. <=WM: (14134: S1 ^operator O2006)
  14598. <=WM: (14128: R1 ^reward R1006)
  14599. <=WM: (14131: O2006 ^name predict-no)
  14600. <=WM: (14130: O2005 ^name predict-yes)
  14601. <=WM: (14129: R1006 ^value 1)
  14602. --- Inner Elaboration Phase, active level 1 (S1) ---
  14603. Firing prefer*rvt*predict-yes*H0
  14604. -->
  14605. Firing rl*prefer*rvt*predict-yes*H0*1
  14606. -->
  14607. (S1 ^operator O2007 = 0.)
  14608. Firing prefer*rvt*predict-no*H0
  14609. -->
  14610. Firing rl*prefer*rvt*predict-no*H0*2
  14611. -->
  14612. (S1 ^operator O2008 = 1.)
  14613. inner elaboration loop at bottom goal.
  14614. Retracting rl*prefer*rvt*predict-no*H0*2
  14615. -->
  14616. (S1 ^operator O2006 = 1.)
  14617. Retracting rl*prefer*rvt*predict-yes*H0*1
  14618. -->
  14619. (S1 ^operator O2005 = 0.)
  14620. --- END Proposal Phase ---
  14621. --- Decision Phase ---
  14622. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14623. =>WM: (14147: S1 ^operator O2008)
  14624. 1004: O: O2008 (predict-no)
  14625. --- END Decision Phase ---
  14626. --- Application Phase ---
  14627. --- Firing Productions (PE) For State At Depth 1 ---
  14628. --- Inner Elaboration Phase, active level 1 (S1) ---
  14629. Firing apply*operator
  14630. -->
  14631. (I3 ^predict-no N1004 + :O )
  14632. Firing apply*operator*complete
  14633. -->
  14634. (I3 ^predict-no N1003 - :O )
  14635. inner elaboration loop at bottom goal.
  14636. --- Change Working Memory (PE) ---
  14637. =>WM: (14148: I3 ^predict-no N1004)
  14638. <=WM: (14136: N1003 ^status complete)
  14639. <=WM: (14135: I3 ^predict-no N1003)
  14640. --- Firing Productions (IE) For State At Depth 1 ---
  14641. --- Inner Elaboration Phase, active level 1 (S1) ---
  14642. Firing monitor*world
  14643. -->
  14644. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14645. --- Change Working Memory (IE) ---
  14646. --- END Application Phase ---
  14647. --- Output Phase ---
  14648. ENV: Agent did: predict-no for direction U in state State-B
  14649. In State-B moving U
  14650. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14651. predict error 0
  14652. dir: dir isL
  14653. --- END Output Phase ---
  14654. \-/--- Input Phase ---
  14655. =>WM: (14152: I2 ^dir L)
  14656. =>WM: (14151: I2 ^reward 1)
  14657. =>WM: (14150: I2 ^see 0)
  14658. =>WM: (14149: N1004 ^status complete)
  14659. <=WM: (14139: I2 ^dir U)
  14660. <=WM: (14138: I2 ^reward 1)
  14661. <=WM: (14137: I2 ^see 0)
  14662. =>WM: (14153: I2 ^level-1 R1-root)
  14663. <=WM: (14140: I2 ^level-1 R1-root)
  14664. --- END Input Phase ---
  14665. --- Proposal Phase ---
  14666. --- Inner Elaboration Phase, active level 1 (S1) ---
  14667. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14668. -->
  14669. (S1 ^operator O2007 = 0.7362263199804909)
  14670. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14671. -->
  14672. Firing elaborate*copy-see-to-output-link
  14673. -->
  14674. (I3 ^see 0 +)
  14675. Firing elaborate*reward*based*on*reward
  14676. -->
  14677. (R1008 ^value 1 +)
  14678. (R1 ^reward R1008 +)
  14679. Firing propose*predict-yes
  14680. -->
  14681. (O2009 ^name predict-yes +)
  14682. (S1 ^operator O2009 +)
  14683. Firing propose*predict-no
  14684. -->
  14685. (O2010 ^name predict-no +)
  14686. (S1 ^operator O2010 +)
  14687. Firing rl*prefer*rvt*predict-no*H0*6
  14688. -->
  14689. (S1 ^operator O2008 = 0.9998785089568328)
  14690. Firing rl*prefer*rvt*predict-yes*H0*5
  14691. -->
  14692. (S1 ^operator O2007 = 0.2640246623191502)
  14693. Firing prefer*rvt*predict-yes*H0
  14694. -->
  14695. Firing prefer*rvt*predict-no*H0
  14696. -->
  14697. Firing elaborate*copy-dir-to-output-link
  14698. -->
  14699. (I3 ^dir L +)
  14700. inner elaboration loop at bottom goal.
  14701. Retracting elaborate*copy-see-to-output-link
  14702. -->
  14703. (I3 ^see 0 +)
  14704. Retracting propose*predict-no
  14705. -->
  14706. (O2008 ^name predict-no +)
  14707. (S1 ^operator O2008 +)
  14708. Retracting propose*predict-yes
  14709. -->
  14710. (O2007 ^name predict-yes +)
  14711. (S1 ^operator O2007 +)
  14712. Retracting elaborate*reward*based*on*reward
  14713. -->
  14714. (R1007 ^value 1 +)
  14715. (R1 ^reward R1007 +)
  14716. Retracting elaborate*copy-dir-to-output-link
  14717. -->
  14718. (I3 ^dir U +)
  14719. Retracting rl*prefer*rvt*predict-no*H0*2
  14720. -->
  14721. (S1 ^operator O2008 = 1.)
  14722. Retracting rl*prefer*rvt*predict-yes*H0*1
  14723. -->
  14724. (S1 ^operator O2007 = 0.)
  14725. =>WM: (14160: S1 ^operator O2010 +)
  14726. =>WM: (14159: S1 ^operator O2009 +)
  14727. =>WM: (14158: I3 ^dir L)
  14728. =>WM: (14157: O2010 ^name predict-no)
  14729. =>WM: (14156: O2009 ^name predict-yes)
  14730. =>WM: (14155: R1008 ^value 1)
  14731. =>WM: (14154: R1 ^reward R1008)
  14732. <=WM: (14145: S1 ^operator O2007 +)
  14733. <=WM: (14146: S1 ^operator O2008 +)
  14734. <=WM: (14147: S1 ^operator O2008)
  14735. <=WM: (14104: I3 ^dir U)
  14736. <=WM: (14141: R1 ^reward R1007)
  14737. <=WM: (14144: O2008 ^name predict-no)
  14738. <=WM: (14143: O2007 ^name predict-yes)
  14739. <=WM: (14142: R1007 ^value 1)
  14740. --- Inner Elaboration Phase, active level 1 (S1) ---
  14741. Firing prefer*rvt*predict-yes*H0
  14742. -->
  14743. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14744. -->
  14745. (S1 ^operator O2009 = 0.7362263199804909)
  14746. Firing rl*prefer*rvt*predict-yes*H0*5
  14747. -->
  14748. (S1 ^operator O2009 = 0.2640246623191502)
  14749. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  14750. -->
  14751. Firing prefer*rvt*predict-no*H0
  14752. -->
  14753. Firing rl*prefer*rvt*predict-no*H0*6
  14754. -->
  14755. (S1 ^operator O2010 = 0.9998785089568328)
  14756. inner elaboration loop at bottom goal.
  14757. Retracting rl*prefer*rvt*predict-no*H0*6
  14758. -->
  14759. (S1 ^operator O2008 = 0.9998785089568328)
  14760. Retracting rl*prefer*rvt*predict-yes*H0*5
  14761. -->
  14762. (S1 ^operator O2007 = 0.2640246623191502)
  14763. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14764. -->
  14765. (S1 ^operator O2007 = 0.7362263199804909)
  14766. --- END Proposal Phase ---
  14767. --- Decision Phase ---
  14768. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14769. =>WM: (14161: S1 ^operator O2009)
  14770. 1005: O: O2009 (predict-yes)
  14771. --- END Decision Phase ---
  14772. --- Application Phase ---
  14773. --- Firing Productions (PE) For State At Depth 1 ---
  14774. --- Inner Elaboration Phase, active level 1 (S1) ---
  14775. Firing apply*operator
  14776. -->
  14777. (I3 ^predict-yes N1005 + :O )
  14778. Firing apply*operator*complete
  14779. -->
  14780. (I3 ^predict-no N1004 - :O )
  14781. inner elaboration loop at bottom goal.
  14782. --- Change Working Memory (PE) ---
  14783. =>WM: (14162: I3 ^predict-yes N1005)
  14784. <=WM: (14149: N1004 ^status complete)
  14785. <=WM: (14148: I3 ^predict-no N1004)
  14786. --- Firing Productions (IE) For State At Depth 1 ---
  14787. --- Inner Elaboration Phase, active level 1 (S1) ---
  14788. Firing monitor*world
  14789. -->
  14790. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14791. --- Change Working Memory (IE) ---
  14792. --- END Application Phase ---
  14793. --- Output Phase ---
  14794. ENV: Agent did: predict-yes for direction L in state State-B
  14795. In State-B moving L
  14796. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  14797. predict error 0
  14798. dir: dir isR
  14799. --- END Output Phase ---
  14800. |\---- Input Phase ---
  14801. =>WM: (14166: I2 ^dir R)
  14802. =>WM: (14165: I2 ^reward 1)
  14803. =>WM: (14164: I2 ^see 1)
  14804. =>WM: (14163: N1005 ^status complete)
  14805. <=WM: (14152: I2 ^dir L)
  14806. <=WM: (14151: I2 ^reward 1)
  14807. <=WM: (14150: I2 ^see 0)
  14808. =>WM: (14167: I2 ^level-1 L1-root)
  14809. <=WM: (14153: I2 ^level-1 R1-root)
  14810. --- END Input Phase ---
  14811. --- Proposal Phase ---
  14812. --- Inner Elaboration Phase, active level 1 (S1) ---
  14813. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  14814. -->
  14815. (S1 ^operator O2010 = -0.2714224023553999)
  14816. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  14817. -->
  14818. (S1 ^operator O2009 = 0.6622259046932006)
  14819. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14820. -->
  14821. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14822. -->
  14823. Firing elaborate*copy-see-to-output-link
  14824. -->
  14825. (I3 ^see 1 +)
  14826. Firing elaborate*reward*based*on*reward
  14827. -->
  14828. (R1009 ^value 1 +)
  14829. (R1 ^reward R1009 +)
  14830. Firing propose*predict-yes
  14831. -->
  14832. (O2011 ^name predict-yes +)
  14833. (S1 ^operator O2011 +)
  14834. Firing propose*predict-no
  14835. -->
  14836. (O2012 ^name predict-no +)
  14837. (S1 ^operator O2012 +)
  14838. Firing rl*prefer*rvt*predict-no*H0*4
  14839. -->
  14840. (S1 ^operator O2010 = 0.339773810196969)
  14841. Firing rl*prefer*rvt*predict-yes*H0*3
  14842. -->
  14843. (S1 ^operator O2009 = 0.3377117977102235)
  14844. Firing prefer*rvt*predict-yes*H0
  14845. -->
  14846. Firing prefer*rvt*predict-no*H0
  14847. -->
  14848. Firing elaborate*copy-dir-to-output-link
  14849. -->
  14850. (I3 ^dir R +)
  14851. inner elaboration loop at bottom goal.
  14852. Retracting elaborate*copy-see-to-output-link
  14853. -->
  14854. (I3 ^see 0 +)
  14855. Retracting propose*predict-no
  14856. -->
  14857. (O2010 ^name predict-no +)
  14858. (S1 ^operator O2010 +)
  14859. Retracting propose*predict-yes
  14860. -->
  14861. (O2009 ^name predict-yes +)
  14862. (S1 ^operator O2009 +)
  14863. Retracting elaborate*reward*based*on*reward
  14864. -->
  14865. (R1008 ^value 1 +)
  14866. (R1 ^reward R1008 +)
  14867. Retracting elaborate*copy-dir-to-output-link
  14868. -->
  14869. (I3 ^dir L +)
  14870. Retracting rl*prefer*rvt*predict-no*H0*6
  14871. -->
  14872. (S1 ^operator O2010 = 0.9998785089568328)
  14873. Retracting rl*prefer*rvt*predict-yes*H0*5
  14874. -->
  14875. (S1 ^operator O2009 = 0.2640246623191502)
  14876. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  14877. -->
  14878. (S1 ^operator O2009 = 0.7362263199804909)
  14879. =>WM: (14175: S1 ^operator O2012 +)
  14880. =>WM: (14174: S1 ^operator O2011 +)
  14881. =>WM: (14173: I3 ^dir R)
  14882. =>WM: (14172: O2012 ^name predict-no)
  14883. =>WM: (14171: O2011 ^name predict-yes)
  14884. =>WM: (14170: R1009 ^value 1)
  14885. =>WM: (14169: R1 ^reward R1009)
  14886. =>WM: (14168: I3 ^see 1)
  14887. <=WM: (14159: S1 ^operator O2009 +)
  14888. <=WM: (14161: S1 ^operator O2009)
  14889. <=WM: (14160: S1 ^operator O2010 +)
  14890. <=WM: (14158: I3 ^dir L)
  14891. <=WM: (14154: R1 ^reward R1008)
  14892. <=WM: (14114: I3 ^see 0)
  14893. <=WM: (14157: O2010 ^name predict-no)
  14894. <=WM: (14156: O2009 ^name predict-yes)
  14895. <=WM: (14155: R1008 ^value 1)
  14896. --- Inner Elaboration Phase, active level 1 (S1) ---
  14897. Firing prefer*rvt*predict-yes*H0
  14898. -->
  14899. Firing rl*prefer*rvt*predict-yes*H0*3
  14900. -->
  14901. (S1 ^operator O2011 = 0.3377117977102235)
  14902. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14903. -->
  14904. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  14905. -->
  14906. (S1 ^operator O2011 = 0.6622259046932006)
  14907. Firing prefer*rvt*predict-no*H0
  14908. -->
  14909. Firing rl*prefer*rvt*predict-no*H0*4
  14910. -->
  14911. (S1 ^operator O2012 = 0.339773810196969)
  14912. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14913. -->
  14914. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  14915. -->
  14916. (S1 ^operator O2012 = -0.2714224023553999)
  14917. inner elaboration loop at bottom goal.
  14918. Retracting rl*prefer*rvt*predict-no*H0*4
  14919. -->
  14920. (S1 ^operator O2010 = 0.339773810196969)
  14921. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  14922. -->
  14923. (S1 ^operator O2010 = -0.2714224023553999)
  14924. Retracting rl*prefer*rvt*predict-yes*H0*3
  14925. -->
  14926. (S1 ^operator O2009 = 0.3377117977102235)
  14927. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  14928. -->
  14929. (S1 ^operator O2009 = 0.6622259046932006)
  14930. --- END Proposal Phase ---
  14931. --- Decision Phase ---
  14932. RL update rl*prefer*rvt*predict-yes*H0*5 0.55441 -0.290386 0.264025 -> 0.55439 -0.290386 0.264004(R,m,v=1,0.877778,0.107883)
  14933. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445836 0.29039 0.736226 -> 0.445814 0.290389 0.736203(R,m,v=1,1,0)
  14934. =>WM: (14176: S1 ^operator O2011)
  14935. 1006: O: O2011 (predict-yes)
  14936. --- END Decision Phase ---
  14937. --- Application Phase ---
  14938. --- Firing Productions (PE) For State At Depth 1 ---
  14939. --- Inner Elaboration Phase, active level 1 (S1) ---
  14940. Firing apply*operator
  14941. -->
  14942. (I3 ^predict-yes N1006 + :O )
  14943. Firing apply*operator*complete
  14944. -->
  14945. (I3 ^predict-yes N1005 - :O )
  14946. inner elaboration loop at bottom goal.
  14947. --- Change Working Memory (PE) ---
  14948. =>WM: (14177: I3 ^predict-yes N1006)
  14949. <=WM: (14163: N1005 ^status complete)
  14950. <=WM: (14162: I3 ^predict-yes N1005)
  14951. --- Firing Productions (IE) For State At Depth 1 ---
  14952. --- Inner Elaboration Phase, active level 1 (S1) ---
  14953. Firing monitor*world
  14954. -->
  14955. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14956. --- Change Working Memory (IE) ---
  14957. --- END Application Phase ---
  14958. --- Output Phase ---
  14959. ENV: Agent did: predict-yes for direction R in state State-A
  14960. In State-A moving R
  14961. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14962. predict error 0
  14963. dir: dir isR
  14964. --- END Output Phase ---
  14965. /|\--- Input Phase ---
  14966. =>WM: (14181: I2 ^dir R)
  14967. =>WM: (14180: I2 ^reward 1)
  14968. =>WM: (14179: I2 ^see 1)
  14969. =>WM: (14178: N1006 ^status complete)
  14970. <=WM: (14166: I2 ^dir R)
  14971. <=WM: (14165: I2 ^reward 1)
  14972. <=WM: (14164: I2 ^see 1)
  14973. =>WM: (14182: I2 ^level-1 R1-root)
  14974. <=WM: (14167: I2 ^level-1 L1-root)
  14975. --- END Input Phase ---
  14976. --- Proposal Phase ---
  14977. --- Inner Elaboration Phase, active level 1 (S1) ---
  14978. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  14979. -->
  14980. (S1 ^operator O2011 = -0.1070236389116304)
  14981. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  14982. -->
  14983. (S1 ^operator O2012 = 0.6602439963649246)
  14984. Firing prefer*rvt*predict-no*H0*4*v1*H1
  14985. -->
  14986. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  14987. -->
  14988. Firing elaborate*copy-see-to-output-link
  14989. -->
  14990. (I3 ^see 1 +)
  14991. Firing elaborate*reward*based*on*reward
  14992. -->
  14993. (R1010 ^value 1 +)
  14994. (R1 ^reward R1010 +)
  14995. Firing propose*predict-yes
  14996. -->
  14997. (O2013 ^name predict-yes +)
  14998. (S1 ^operator O2013 +)
  14999. Firing propose*predict-no
  15000. -->
  15001. (O2014 ^name predict-no +)
  15002. (S1 ^operator O2014 +)
  15003. Firing rl*prefer*rvt*predict-no*H0*4
  15004. -->
  15005. (S1 ^operator O2012 = 0.339773810196969)
  15006. Firing rl*prefer*rvt*predict-yes*H0*3
  15007. -->
  15008. (S1 ^operator O2011 = 0.3377117977102235)
  15009. Firing prefer*rvt*predict-yes*H0
  15010. -->
  15011. Firing prefer*rvt*predict-no*H0
  15012. -->
  15013. Firing elaborate*copy-dir-to-output-link
  15014. -->
  15015. (I3 ^dir R +)
  15016. inner elaboration loop at bottom goal.
  15017. Retracting elaborate*copy-see-to-output-link
  15018. -->
  15019. (I3 ^see 1 +)
  15020. Retracting propose*predict-no
  15021. -->
  15022. (O2012 ^name predict-no +)
  15023. (S1 ^operator O2012 +)
  15024. Retracting propose*predict-yes
  15025. -->
  15026. (O2011 ^name predict-yes +)
  15027. (S1 ^operator O2011 +)
  15028. Retracting elaborate*reward*based*on*reward
  15029. -->
  15030. (R1009 ^value 1 +)
  15031. (R1 ^reward R1009 +)
  15032. Retracting elaborate*copy-dir-to-output-link
  15033. -->
  15034. (I3 ^dir R +)
  15035. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  15036. -->
  15037. (S1 ^operator O2012 = -0.2714224023553999)
  15038. Retracting rl*prefer*rvt*predict-no*H0*4
  15039. -->
  15040. (S1 ^operator O2012 = 0.339773810196969)
  15041. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  15042. -->
  15043. (S1 ^operator O2011 = 0.6622259046932006)
  15044. Retracting rl*prefer*rvt*predict-yes*H0*3
  15045. -->
  15046. (S1 ^operator O2011 = 0.3377117977102235)
  15047. =>WM: (14188: S1 ^operator O2014 +)
  15048. =>WM: (14187: S1 ^operator O2013 +)
  15049. =>WM: (14186: O2014 ^name predict-no)
  15050. =>WM: (14185: O2013 ^name predict-yes)
  15051. =>WM: (14184: R1010 ^value 1)
  15052. =>WM: (14183: R1 ^reward R1010)
  15053. <=WM: (14174: S1 ^operator O2011 +)
  15054. <=WM: (14176: S1 ^operator O2011)
  15055. <=WM: (14175: S1 ^operator O2012 +)
  15056. <=WM: (14169: R1 ^reward R1009)
  15057. <=WM: (14172: O2012 ^name predict-no)
  15058. <=WM: (14171: O2011 ^name predict-yes)
  15059. <=WM: (14170: R1009 ^value 1)
  15060. --- Inner Elaboration Phase, active level 1 (S1) ---
  15061. Firing prefer*rvt*predict-yes*H0
  15062. -->
  15063. Firing rl*prefer*rvt*predict-yes*H0*3
  15064. -->
  15065. (S1 ^operator O2013 = 0.3377117977102235)
  15066. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15067. -->
  15068. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15069. -->
  15070. (S1 ^operator O2013 = -0.1070236389116304)
  15071. Firing prefer*rvt*predict-no*H0
  15072. -->
  15073. Firing rl*prefer*rvt*predict-no*H0*4
  15074. -->
  15075. (S1 ^operator O2014 = 0.339773810196969)
  15076. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15077. -->
  15078. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15079. -->
  15080. (S1 ^operator O2014 = 0.6602439963649246)
  15081. inner elaboration loop at bottom goal.
  15082. Retracting rl*prefer*rvt*predict-no*H0*4
  15083. -->
  15084. (S1 ^operator O2012 = 0.339773810196969)
  15085. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15086. -->
  15087. (S1 ^operator O2012 = 0.6602439963649246)
  15088. Retracting rl*prefer*rvt*predict-yes*H0*3
  15089. -->
  15090. (S1 ^operator O2011 = 0.3377117977102235)
  15091. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15092. -->
  15093. (S1 ^operator O2011 = -0.1070236389116304)
  15094. --- END Proposal Phase ---
  15095. --- Decision Phase ---
  15096. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.590118 -0.252401 0.337717(R,m,v=1,0.899408,0.0910116)
  15097. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409816 0.25241 0.662226 -> 0.409823 0.252409 0.662232(R,m,v=1,1,0)
  15098. =>WM: (14189: S1 ^operator O2014)
  15099. 1007: O: O2014 (predict-no)
  15100. --- END Decision Phase ---
  15101. --- Application Phase ---
  15102. --- Firing Productions (PE) For State At Depth 1 ---
  15103. --- Inner Elaboration Phase, active level 1 (S1) ---
  15104. Firing apply*operator
  15105. -->
  15106. (I3 ^predict-no N1007 + :O )
  15107. Firing apply*operator*complete
  15108. -->
  15109. (I3 ^predict-yes N1006 - :O )
  15110. inner elaboration loop at bottom goal.
  15111. --- Change Working Memory (PE) ---
  15112. =>WM: (14190: I3 ^predict-no N1007)
  15113. <=WM: (14178: N1006 ^status complete)
  15114. <=WM: (14177: I3 ^predict-yes N1006)
  15115. --- Firing Productions (IE) For State At Depth 1 ---
  15116. --- Inner Elaboration Phase, active level 1 (S1) ---
  15117. Firing monitor*world
  15118. -->
  15119. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15120. --- Change Working Memory (IE) ---
  15121. --- END Application Phase ---
  15122. --- Output Phase ---
  15123. ENV: Agent did: predict-no for direction R in state State-B
  15124. In State-B moving R
  15125. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15126. predict error 0
  15127. dir: dir isU
  15128. --- END Output Phase ---
  15129. -/|--- Input Phase ---
  15130. =>WM: (14194: I2 ^dir U)
  15131. =>WM: (14193: I2 ^reward 1)
  15132. =>WM: (14192: I2 ^see 0)
  15133. =>WM: (14191: N1007 ^status complete)
  15134. <=WM: (14181: I2 ^dir R)
  15135. <=WM: (14180: I2 ^reward 1)
  15136. <=WM: (14179: I2 ^see 1)
  15137. =>WM: (14195: I2 ^level-1 R0-root)
  15138. <=WM: (14182: I2 ^level-1 R1-root)
  15139. --- END Input Phase ---
  15140. --- Proposal Phase ---
  15141. --- Inner Elaboration Phase, active level 1 (S1) ---
  15142. Firing elaborate*copy-see-to-output-link
  15143. -->
  15144. (I3 ^see 0 +)
  15145. Firing elaborate*reward*based*on*reward
  15146. -->
  15147. (R1011 ^value 1 +)
  15148. (R1 ^reward R1011 +)
  15149. Firing propose*predict-yes
  15150. -->
  15151. (O2015 ^name predict-yes +)
  15152. (S1 ^operator O2015 +)
  15153. Firing propose*predict-no
  15154. -->
  15155. (O2016 ^name predict-no +)
  15156. (S1 ^operator O2016 +)
  15157. Firing rl*prefer*rvt*predict-no*H0*2
  15158. -->
  15159. (S1 ^operator O2014 = 1.)
  15160. Firing rl*prefer*rvt*predict-yes*H0*1
  15161. -->
  15162. (S1 ^operator O2013 = 0.)
  15163. Firing prefer*rvt*predict-yes*H0
  15164. -->
  15165. Firing prefer*rvt*predict-no*H0
  15166. -->
  15167. Firing elaborate*copy-dir-to-output-link
  15168. -->
  15169. (I3 ^dir U +)
  15170. inner elaboration loop at bottom goal.
  15171. Retracting elaborate*copy-see-to-output-link
  15172. -->
  15173. (I3 ^see 1 +)
  15174. Retracting propose*predict-no
  15175. -->
  15176. (O2014 ^name predict-no +)
  15177. (S1 ^operator O2014 +)
  15178. Retracting propose*predict-yes
  15179. -->
  15180. (O2013 ^name predict-yes +)
  15181. (S1 ^operator O2013 +)
  15182. Retracting elaborate*reward*based*on*reward
  15183. -->
  15184. (R1010 ^value 1 +)
  15185. (R1 ^reward R1010 +)
  15186. Retracting elaborate*copy-dir-to-output-link
  15187. -->
  15188. (I3 ^dir R +)
  15189. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  15190. -->
  15191. (S1 ^operator O2014 = 0.6602439963649246)
  15192. Retracting rl*prefer*rvt*predict-no*H0*4
  15193. -->
  15194. (S1 ^operator O2014 = 0.339773810196969)
  15195. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  15196. -->
  15197. (S1 ^operator O2013 = -0.1070236389116304)
  15198. Retracting rl*prefer*rvt*predict-yes*H0*3
  15199. -->
  15200. (S1 ^operator O2013 = 0.3377168791642142)
  15201. =>WM: (14203: S1 ^operator O2016 +)
  15202. =>WM: (14202: S1 ^operator O2015 +)
  15203. =>WM: (14201: I3 ^dir U)
  15204. =>WM: (14200: O2016 ^name predict-no)
  15205. =>WM: (14199: O2015 ^name predict-yes)
  15206. =>WM: (14198: R1011 ^value 1)
  15207. =>WM: (14197: R1 ^reward R1011)
  15208. =>WM: (14196: I3 ^see 0)
  15209. <=WM: (14187: S1 ^operator O2013 +)
  15210. <=WM: (14188: S1 ^operator O2014 +)
  15211. <=WM: (14189: S1 ^operator O2014)
  15212. <=WM: (14173: I3 ^dir R)
  15213. <=WM: (14183: R1 ^reward R1010)
  15214. <=WM: (14168: I3 ^see 1)
  15215. <=WM: (14186: O2014 ^name predict-no)
  15216. <=WM: (14185: O2013 ^name predict-yes)
  15217. <=WM: (14184: R1010 ^value 1)
  15218. --- Inner Elaboration Phase, active level 1 (S1) ---
  15219. Firing prefer*rvt*predict-yes*H0
  15220. -->
  15221. Firing rl*prefer*rvt*predict-yes*H0*1
  15222. -->
  15223. (S1 ^operator O2015 = 0.)
  15224. Firing prefer*rvt*predict-no*H0
  15225. -->
  15226. Firing rl*prefer*rvt*predict-no*H0*2
  15227. -->
  15228. (S1 ^operator O2016 = 1.)
  15229. inner elaboration loop at bottom goal.
  15230. Retracting rl*prefer*rvt*predict-no*H0*2
  15231. -->
  15232. (S1 ^operator O2014 = 1.)
  15233. Retracting rl*prefer*rvt*predict-yes*H0*1
  15234. -->
  15235. (S1 ^operator O2013 = 0.)
  15236. --- END Proposal Phase ---
  15237. --- Decision Phase ---
  15238. RL update rl*prefer*rvt*predict-no*H0*4 0.570257 -0.230484 0.339774 -> 0.570256 -0.230484 0.339772(R,m,v=1,0.87574,0.109467)
  15239. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429761 0.230483 0.660244 -> 0.429759 0.230483 0.660242(R,m,v=1,1,0)
  15240. =>WM: (14204: S1 ^operator O2016)
  15241. 1008: O: O2016 (predict-no)
  15242. --- END Decision Phase ---
  15243. --- Application Phase ---
  15244. --- Firing Productions (PE) For State At Depth 1 ---
  15245. --- Inner Elaboration Phase, active level 1 (S1) ---
  15246. Firing apply*operator
  15247. -->
  15248. (I3 ^predict-no N1008 + :O )
  15249. Firing apply*operator*complete
  15250. -->
  15251. (I3 ^predict-no N1007 - :O )
  15252. inner elaboration loop at bottom goal.
  15253. --- Change Working Memory (PE) ---
  15254. =>WM: (14205: I3 ^predict-no N1008)
  15255. <=WM: (14191: N1007 ^status complete)
  15256. <=WM: (14190: I3 ^predict-no N1007)
  15257. --- Firing Productions (IE) For State At Depth 1 ---
  15258. --- Inner Elaboration Phase, active level 1 (S1) ---
  15259. Firing monitor*world
  15260. -->
  15261. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15262. --- Change Working Memory (IE) ---
  15263. --- END Application Phase ---
  15264. --- Output Phase ---
  15265. ENV: Agent did: predict-no for direction U in state State-B
  15266. In State-B moving U
  15267. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15268. predict error 0
  15269. dir: dir isL
  15270. --- END Output Phase ---
  15271. \-/|--- Input Phase ---
  15272. =>WM: (14209: I2 ^dir L)
  15273. =>WM: (14208: I2 ^reward 1)
  15274. =>WM: (14207: I2 ^see 0)
  15275. =>WM: (14206: N1008 ^status complete)
  15276. <=WM: (14194: I2 ^dir U)
  15277. <=WM: (14193: I2 ^reward 1)
  15278. <=WM: (14192: I2 ^see 0)
  15279. =>WM: (14210: I2 ^level-1 R0-root)
  15280. <=WM: (14195: I2 ^level-1 R0-root)
  15281. --- END Input Phase ---
  15282. --- Proposal Phase ---
  15283. --- Inner Elaboration Phase, active level 1 (S1) ---
  15284. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15285. -->
  15286. (S1 ^operator O2015 = 0.7358542477906264)
  15287. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15288. -->
  15289. Firing elaborate*copy-see-to-output-link
  15290. -->
  15291. (I3 ^see 0 +)
  15292. Firing elaborate*reward*based*on*reward
  15293. -->
  15294. (R1012 ^value 1 +)
  15295. (R1 ^reward R1012 +)
  15296. Firing propose*predict-yes
  15297. -->
  15298. (O2017 ^name predict-yes +)
  15299. (S1 ^operator O2017 +)
  15300. Firing propose*predict-no
  15301. -->
  15302. (O2018 ^name predict-no +)
  15303. (S1 ^operator O2018 +)
  15304. Firing rl*prefer*rvt*predict-no*H0*6
  15305. -->
  15306. (S1 ^operator O2016 = 0.9998785089568328)
  15307. Firing rl*prefer*rvt*predict-yes*H0*5
  15308. -->
  15309. (S1 ^operator O2015 = 0.2640043987919141)
  15310. Firing prefer*rvt*predict-yes*H0
  15311. -->
  15312. Firing prefer*rvt*predict-no*H0
  15313. -->
  15314. Firing elaborate*copy-dir-to-output-link
  15315. -->
  15316. (I3 ^dir L +)
  15317. inner elaboration loop at bottom goal.
  15318. Retracting elaborate*copy-see-to-output-link
  15319. -->
  15320. (I3 ^see 0 +)
  15321. Retracting propose*predict-no
  15322. -->
  15323. (O2016 ^name predict-no +)
  15324. (S1 ^operator O2016 +)
  15325. Retracting propose*predict-yes
  15326. -->
  15327. (O2015 ^name predict-yes +)
  15328. (S1 ^operator O2015 +)
  15329. Retracting elaborate*reward*based*on*reward
  15330. -->
  15331. (R1011 ^value 1 +)
  15332. (R1 ^reward R1011 +)
  15333. Retracting elaborate*copy-dir-to-output-link
  15334. -->
  15335. (I3 ^dir U +)
  15336. Retracting rl*prefer*rvt*predict-no*H0*2
  15337. -->
  15338. (S1 ^operator O2016 = 1.)
  15339. Retracting rl*prefer*rvt*predict-yes*H0*1
  15340. -->
  15341. (S1 ^operator O2015 = 0.)
  15342. =>WM: (14217: S1 ^operator O2018 +)
  15343. =>WM: (14216: S1 ^operator O2017 +)
  15344. =>WM: (14215: I3 ^dir L)
  15345. =>WM: (14214: O2018 ^name predict-no)
  15346. =>WM: (14213: O2017 ^name predict-yes)
  15347. =>WM: (14212: R1012 ^value 1)
  15348. =>WM: (14211: R1 ^reward R1012)
  15349. <=WM: (14202: S1 ^operator O2015 +)
  15350. <=WM: (14203: S1 ^operator O2016 +)
  15351. <=WM: (14204: S1 ^operator O2016)
  15352. <=WM: (14201: I3 ^dir U)
  15353. <=WM: (14197: R1 ^reward R1011)
  15354. <=WM: (14200: O2016 ^name predict-no)
  15355. <=WM: (14199: O2015 ^name predict-yes)
  15356. <=WM: (14198: R1011 ^value 1)
  15357. --- Inner Elaboration Phase, active level 1 (S1) ---
  15358. Firing prefer*rvt*predict-yes*H0
  15359. -->
  15360. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15361. -->
  15362. (S1 ^operator O2017 = 0.7358542477906264)
  15363. Firing rl*prefer*rvt*predict-yes*H0*5
  15364. -->
  15365. (S1 ^operator O2017 = 0.2640043987919141)
  15366. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15367. -->
  15368. Firing prefer*rvt*predict-no*H0
  15369. -->
  15370. Firing rl*prefer*rvt*predict-no*H0*6
  15371. -->
  15372. (S1 ^operator O2018 = 0.9998785089568328)
  15373. inner elaboration loop at bottom goal.
  15374. Retracting rl*prefer*rvt*predict-no*H0*6
  15375. -->
  15376. (S1 ^operator O2016 = 0.9998785089568328)
  15377. Retracting rl*prefer*rvt*predict-yes*H0*5
  15378. -->
  15379. (S1 ^operator O2015 = 0.2640043987919141)
  15380. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15381. -->
  15382. (S1 ^operator O2015 = 0.7358542477906264)
  15383. --- END Proposal Phase ---
  15384. --- Decision Phase ---
  15385. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15386. =>WM: (14218: S1 ^operator O2018)
  15387. 1009: O: O2018 (predict-no)
  15388. --- END Decision Phase ---
  15389. --- Application Phase ---
  15390. --- Firing Productions (PE) For State At Depth 1 ---
  15391. --- Inner Elaboration Phase, active level 1 (S1) ---
  15392. Firing apply*operator
  15393. -->
  15394. (I3 ^predict-no N1009 + :O )
  15395. Firing apply*operator*complete
  15396. -->
  15397. (I3 ^predict-no N1008 - :O )
  15398. inner elaboration loop at bottom goal.
  15399. --- Change Working Memory (PE) ---
  15400. =>WM: (14219: I3 ^predict-no N1009)
  15401. <=WM: (14206: N1008 ^status complete)
  15402. <=WM: (14205: I3 ^predict-no N1008)
  15403. --- Firing Productions (IE) For State At Depth 1 ---
  15404. --- Inner Elaboration Phase, active level 1 (S1) ---
  15405. Firing monitor*world
  15406. -->
  15407. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15408. --- Change Working Memory (IE) ---
  15409. --- END Application Phase ---
  15410. --- Output Phase ---
  15411. ENV: Agent did: predict-no for direction L in state State-B
  15412. In State-B moving L
  15413. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  15414. predict error 1
  15415. dir: dir isL
  15416. --- END Output Phase ---
  15417. \-/--- Input Phase ---
  15418. =>WM: (14223: I2 ^dir L)
  15419. =>WM: (14222: I2 ^reward 0)
  15420. =>WM: (14221: I2 ^see 1)
  15421. =>WM: (14220: N1009 ^status complete)
  15422. <=WM: (14209: I2 ^dir L)
  15423. <=WM: (14208: I2 ^reward 1)
  15424. <=WM: (14207: I2 ^see 0)
  15425. =>WM: (14224: I2 ^level-1 L1-root)
  15426. <=WM: (14210: I2 ^level-1 R0-root)
  15427. --- END Input Phase ---
  15428. --- Proposal Phase ---
  15429. --- Inner Elaboration Phase, active level 1 (S1) ---
  15430. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15431. -->
  15432. (S1 ^operator O2017 = -0.181727099742844)
  15433. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15434. -->
  15435. Firing elaborate*copy-see-to-output-link
  15436. -->
  15437. (I3 ^see 1 +)
  15438. Firing elaborate*reward*based*on*reward
  15439. -->
  15440. (R1013 ^value 0 +)
  15441. (R1 ^reward R1013 +)
  15442. Firing propose*predict-yes
  15443. -->
  15444. (O2019 ^name predict-yes +)
  15445. (S1 ^operator O2019 +)
  15446. Firing propose*predict-no
  15447. -->
  15448. (O2020 ^name predict-no +)
  15449. (S1 ^operator O2020 +)
  15450. Firing rl*prefer*rvt*predict-no*H0*6
  15451. -->
  15452. (S1 ^operator O2018 = 0.9998785089568328)
  15453. Firing rl*prefer*rvt*predict-yes*H0*5
  15454. -->
  15455. (S1 ^operator O2017 = 0.2640043987919141)
  15456. Firing prefer*rvt*predict-yes*H0
  15457. -->
  15458. Firing prefer*rvt*predict-no*H0
  15459. -->
  15460. Firing elaborate*copy-dir-to-output-link
  15461. -->
  15462. (I3 ^dir L +)
  15463. inner elaboration loop at bottom goal.
  15464. Retracting elaborate*copy-see-to-output-link
  15465. -->
  15466. (I3 ^see 0 +)
  15467. Retracting propose*predict-no
  15468. -->
  15469. (O2018 ^name predict-no +)
  15470. (S1 ^operator O2018 +)
  15471. Retracting propose*predict-yes
  15472. -->
  15473. (O2017 ^name predict-yes +)
  15474. (S1 ^operator O2017 +)
  15475. Retracting elaborate*reward*based*on*reward
  15476. -->
  15477. (R1012 ^value 1 +)
  15478. (R1 ^reward R1012 +)
  15479. Retracting elaborate*copy-dir-to-output-link
  15480. -->
  15481. (I3 ^dir L +)
  15482. Retracting rl*prefer*rvt*predict-no*H0*6
  15483. -->
  15484. (S1 ^operator O2018 = 0.9998785089568328)
  15485. Retracting rl*prefer*rvt*predict-yes*H0*5
  15486. -->
  15487. (S1 ^operator O2017 = 0.2640043987919141)
  15488. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  15489. -->
  15490. (S1 ^operator O2017 = 0.7358542477906264)
  15491. =>WM: (14231: S1 ^operator O2020 +)
  15492. =>WM: (14230: S1 ^operator O2019 +)
  15493. =>WM: (14229: O2020 ^name predict-no)
  15494. =>WM: (14228: O2019 ^name predict-yes)
  15495. =>WM: (14227: R1013 ^value 0)
  15496. =>WM: (14226: R1 ^reward R1013)
  15497. =>WM: (14225: I3 ^see 1)
  15498. <=WM: (14216: S1 ^operator O2017 +)
  15499. <=WM: (14217: S1 ^operator O2018 +)
  15500. <=WM: (14218: S1 ^operator O2018)
  15501. <=WM: (14211: R1 ^reward R1012)
  15502. <=WM: (14196: I3 ^see 0)
  15503. <=WM: (14214: O2018 ^name predict-no)
  15504. <=WM: (14213: O2017 ^name predict-yes)
  15505. <=WM: (14212: R1012 ^value 1)
  15506. --- Inner Elaboration Phase, active level 1 (S1) ---
  15507. Firing prefer*rvt*predict-yes*H0
  15508. -->
  15509. Firing rl*prefer*rvt*predict-yes*H0*5
  15510. -->
  15511. (S1 ^operator O2019 = 0.2640043987919141)
  15512. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15513. -->
  15514. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15515. -->
  15516. (S1 ^operator O2019 = -0.181727099742844)
  15517. Firing prefer*rvt*predict-no*H0
  15518. -->
  15519. Firing rl*prefer*rvt*predict-no*H0*6
  15520. -->
  15521. (S1 ^operator O2020 = 0.9998785089568328)
  15522. inner elaboration loop at bottom goal.
  15523. Retracting rl*prefer*rvt*predict-no*H0*6
  15524. -->
  15525. (S1 ^operator O2018 = 0.9998785089568328)
  15526. Retracting rl*prefer*rvt*predict-yes*H0*5
  15527. -->
  15528. (S1 ^operator O2017 = 0.2640043987919141)
  15529. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15530. -->
  15531. (S1 ^operator O2017 = -0.181727099742844)
  15532. --- END Proposal Phase ---
  15533. --- Decision Phase ---
  15534. RL update rl*prefer*rvt*predict-no*H0*6 0.999879 0 0.999879 -> 0.833711 0 0.833711(R,m,v=0,0.900662,0.0900662)
  15535. =>WM: (14232: S1 ^operator O2020)
  15536. 1010: O: O2020 (predict-no)
  15537. --- END Decision Phase ---
  15538. --- Application Phase ---
  15539. --- Firing Productions (PE) For State At Depth 1 ---
  15540. --- Inner Elaboration Phase, active level 1 (S1) ---
  15541. Firing apply*operator
  15542. -->
  15543. (I3 ^predict-no N1010 + :O )
  15544. Firing apply*operator*complete
  15545. -->
  15546. (I3 ^predict-no N1009 - :O )
  15547. inner elaboration loop at bottom goal.
  15548. --- Change Working Memory (PE) ---
  15549. =>WM: (14233: I3 ^predict-no N1010)
  15550. <=WM: (14220: N1009 ^status complete)
  15551. <=WM: (14219: I3 ^predict-no N1009)
  15552. --- Firing Productions (IE) For State At Depth 1 ---
  15553. --- Inner Elaboration Phase, active level 1 (S1) ---
  15554. Firing monitor*world
  15555. -->
  15556. I see 0 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15557. --- Change Working Memory (IE) ---
  15558. --- END Application Phase ---
  15559. --- Output Phase ---
  15560. ENV: Agent did: predict-no for direction L in state State-A
  15561. In State-A moving L
  15562. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15563. predict error 0
  15564. dir: dir isR
  15565. --- END Output Phase ---
  15566. |\---- Input Phase ---
  15567. =>WM: (14237: I2 ^dir R)
  15568. =>WM: (14236: I2 ^reward 1)
  15569. =>WM: (14235: I2 ^see 0)
  15570. =>WM: (14234: N1010 ^status complete)
  15571. <=WM: (14223: I2 ^dir L)
  15572. <=WM: (14222: I2 ^reward 0)
  15573. <=WM: (14221: I2 ^see 1)
  15574. =>WM: (14238: I2 ^level-1 L0-root)
  15575. <=WM: (14224: I2 ^level-1 L1-root)
  15576. --- END Input Phase ---
  15577. --- Proposal Phase ---
  15578. --- Inner Elaboration Phase, active level 1 (S1) ---
  15579. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15580. -->
  15581. (S1 ^operator O2020 = -0.2817060109291377)
  15582. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15583. -->
  15584. (S1 ^operator O2019 = 0.6623458215671729)
  15585. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15586. -->
  15587. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15588. -->
  15589. Firing elaborate*copy-see-to-output-link
  15590. -->
  15591. (I3 ^see 0 +)
  15592. Firing elaborate*reward*based*on*reward
  15593. -->
  15594. (R1014 ^value 1 +)
  15595. (R1 ^reward R1014 +)
  15596. Firing propose*predict-yes
  15597. -->
  15598. (O2021 ^name predict-yes +)
  15599. (S1 ^operator O2021 +)
  15600. Firing propose*predict-no
  15601. -->
  15602. (O2022 ^name predict-no +)
  15603. (S1 ^operator O2022 +)
  15604. Firing rl*prefer*rvt*predict-no*H0*4
  15605. -->
  15606. (S1 ^operator O2020 = 0.3397723577617232)
  15607. Firing rl*prefer*rvt*predict-yes*H0*3
  15608. -->
  15609. (S1 ^operator O2019 = 0.3377168791642142)
  15610. Firing prefer*rvt*predict-yes*H0
  15611. -->
  15612. Firing prefer*rvt*predict-no*H0
  15613. -->
  15614. Firing elaborate*copy-dir-to-output-link
  15615. -->
  15616. (I3 ^dir R +)
  15617. inner elaboration loop at bottom goal.
  15618. Retracting elaborate*copy-see-to-output-link
  15619. -->
  15620. (I3 ^see 1 +)
  15621. Retracting propose*predict-no
  15622. -->
  15623. (O2020 ^name predict-no +)
  15624. (S1 ^operator O2020 +)
  15625. Retracting propose*predict-yes
  15626. -->
  15627. (O2019 ^name predict-yes +)
  15628. (S1 ^operator O2019 +)
  15629. Retracting elaborate*reward*based*on*reward
  15630. -->
  15631. (R1013 ^value 0 +)
  15632. (R1 ^reward R1013 +)
  15633. Retracting elaborate*copy-dir-to-output-link
  15634. -->
  15635. (I3 ^dir L +)
  15636. Retracting rl*prefer*rvt*predict-no*H0*6
  15637. -->
  15638. (S1 ^operator O2020 = 0.8337106497126315)
  15639. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15640. -->
  15641. (S1 ^operator O2019 = -0.181727099742844)
  15642. Retracting rl*prefer*rvt*predict-yes*H0*5
  15643. -->
  15644. (S1 ^operator O2019 = 0.2640043987919141)
  15645. =>WM: (14246: S1 ^operator O2022 +)
  15646. =>WM: (14245: S1 ^operator O2021 +)
  15647. =>WM: (14244: I3 ^dir R)
  15648. =>WM: (14243: O2022 ^name predict-no)
  15649. =>WM: (14242: O2021 ^name predict-yes)
  15650. =>WM: (14241: R1014 ^value 1)
  15651. =>WM: (14240: R1 ^reward R1014)
  15652. =>WM: (14239: I3 ^see 0)
  15653. <=WM: (14230: S1 ^operator O2019 +)
  15654. <=WM: (14231: S1 ^operator O2020 +)
  15655. <=WM: (14232: S1 ^operator O2020)
  15656. <=WM: (14215: I3 ^dir L)
  15657. <=WM: (14226: R1 ^reward R1013)
  15658. <=WM: (14225: I3 ^see 1)
  15659. <=WM: (14229: O2020 ^name predict-no)
  15660. <=WM: (14228: O2019 ^name predict-yes)
  15661. <=WM: (14227: R1013 ^value 0)
  15662. --- Inner Elaboration Phase, active level 1 (S1) ---
  15663. Firing prefer*rvt*predict-yes*H0
  15664. -->
  15665. Firing rl*prefer*rvt*predict-yes*H0*3
  15666. -->
  15667. (S1 ^operator O2021 = 0.3377168791642142)
  15668. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  15669. -->
  15670. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15671. -->
  15672. (S1 ^operator O2021 = 0.6623458215671729)
  15673. Firing prefer*rvt*predict-no*H0
  15674. -->
  15675. Firing rl*prefer*rvt*predict-no*H0*4
  15676. -->
  15677. (S1 ^operator O2022 = 0.3397723577617232)
  15678. Firing prefer*rvt*predict-no*H0*4*v1*H1
  15679. -->
  15680. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15681. -->
  15682. (S1 ^operator O2022 = -0.2817060109291377)
  15683. inner elaboration loop at bottom goal.
  15684. Retracting rl*prefer*rvt*predict-no*H0*4
  15685. -->
  15686. (S1 ^operator O2020 = 0.3397723577617232)
  15687. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15688. -->
  15689. (S1 ^operator O2020 = -0.2817060109291377)
  15690. Retracting rl*prefer*rvt*predict-yes*H0*3
  15691. -->
  15692. (S1 ^operator O2019 = 0.3377168791642142)
  15693. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15694. -->
  15695. (S1 ^operator O2019 = 0.6623458215671729)
  15696. --- END Proposal Phase ---
  15697. --- Decision Phase ---
  15698. RL update rl*prefer*rvt*predict-no*H0*6 0.833711 0 0.833711 -> 0.861316 0 0.861316(R,m,v=1,0.901316,0.0895347)
  15699. =>WM: (14247: S1 ^operator O2021)
  15700. 1011: O: O2021 (predict-yes)
  15701. --- END Decision Phase ---
  15702. --- Application Phase ---
  15703. --- Firing Productions (PE) For State At Depth 1 ---
  15704. --- Inner Elaboration Phase, active level 1 (S1) ---
  15705. Firing apply*operator
  15706. -->
  15707. (I3 ^predict-yes N1011 + :O )
  15708. Firing apply*operator*complete
  15709. -->
  15710. (I3 ^predict-no N1010 - :O )
  15711. inner elaboration loop at bottom goal.
  15712. --- Change Working Memory (PE) ---
  15713. =>WM: (14248: I3 ^predict-yes N1011)
  15714. <=WM: (14234: N1010 ^status complete)
  15715. <=WM: (14233: I3 ^predict-no N1010)
  15716. --- Firing Productions (IE) For State At Depth 1 ---
  15717. --- Inner Elaboration Phase, active level 1 (S1) ---
  15718. Firing monitor*world
  15719. -->
  15720. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15721. --- Change Working Memory (IE) ---
  15722. --- END Application Phase ---
  15723. --- Output Phase ---
  15724. ENV: Agent did: predict-yes for direction R in state State-A
  15725. In State-A moving R
  15726. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15727. predict error 0
  15728. dir: dir isL
  15729. --- END Output Phase ---
  15730. /--- Input Phase ---
  15731. =>WM: (14252: I2 ^dir L)
  15732. =>WM: (14251: I2 ^reward 1)
  15733. =>WM: (14250: I2 ^see 1)
  15734. =>WM: (14249: N1011 ^status complete)
  15735. <=WM: (14237: I2 ^dir R)
  15736. <=WM: (14236: I2 ^reward 1)
  15737. <=WM: (14235: I2 ^see 0)
  15738. =>WM: (14253: I2 ^level-1 R1-root)
  15739. <=WM: (14238: I2 ^level-1 L0-root)
  15740. --- END Input Phase ---
  15741. --- Proposal Phase ---
  15742. --- Inner Elaboration Phase, active level 1 (S1) ---
  15743. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15744. -->
  15745. (S1 ^operator O2021 = 0.7362031097592677)
  15746. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15747. -->
  15748. Firing elaborate*copy-see-to-output-link
  15749. -->
  15750. (I3 ^see 1 +)
  15751. Firing elaborate*reward*based*on*reward
  15752. -->
  15753. (R1015 ^value 1 +)
  15754. (R1 ^reward R1015 +)
  15755. Firing propose*predict-yes
  15756. -->
  15757. (O2023 ^name predict-yes +)
  15758. (S1 ^operator O2023 +)
  15759. Firing propose*predict-no
  15760. -->
  15761. (O2024 ^name predict-no +)
  15762. (S1 ^operator O2024 +)
  15763. Firing rl*prefer*rvt*predict-no*H0*6
  15764. -->
  15765. (S1 ^operator O2022 = 0.8613156710459575)
  15766. Firing rl*prefer*rvt*predict-yes*H0*5
  15767. -->
  15768. (S1 ^operator O2021 = 0.2640043987919141)
  15769. Firing prefer*rvt*predict-yes*H0
  15770. -->
  15771. Firing prefer*rvt*predict-no*H0
  15772. -->
  15773. Firing elaborate*copy-dir-to-output-link
  15774. -->
  15775. (I3 ^dir L +)
  15776. inner elaboration loop at bottom goal.
  15777. Retracting elaborate*copy-see-to-output-link
  15778. -->
  15779. (I3 ^see 0 +)
  15780. Retracting propose*predict-no
  15781. -->
  15782. (O2022 ^name predict-no +)
  15783. (S1 ^operator O2022 +)
  15784. Retracting propose*predict-yes
  15785. -->
  15786. (O2021 ^name predict-yes +)
  15787. (S1 ^operator O2021 +)
  15788. Retracting elaborate*reward*based*on*reward
  15789. -->
  15790. (R1014 ^value 1 +)
  15791. (R1 ^reward R1014 +)
  15792. Retracting elaborate*copy-dir-to-output-link
  15793. -->
  15794. (I3 ^dir R +)
  15795. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  15796. -->
  15797. (S1 ^operator O2022 = -0.2817060109291377)
  15798. Retracting rl*prefer*rvt*predict-no*H0*4
  15799. -->
  15800. (S1 ^operator O2022 = 0.3397723577617232)
  15801. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  15802. -->
  15803. (S1 ^operator O2021 = 0.6623458215671729)
  15804. Retracting rl*prefer*rvt*predict-yes*H0*3
  15805. -->
  15806. (S1 ^operator O2021 = 0.3377168791642142)
  15807. =>WM: (14261: S1 ^operator O2024 +)
  15808. =>WM: (14260: S1 ^operator O2023 +)
  15809. =>WM: (14259: I3 ^dir L)
  15810. =>WM: (14258: O2024 ^name predict-no)
  15811. =>WM: (14257: O2023 ^name predict-yes)
  15812. =>WM: (14256: R1015 ^value 1)
  15813. =>WM: (14255: R1 ^reward R1015)
  15814. =>WM: (14254: I3 ^see 1)
  15815. <=WM: (14245: S1 ^operator O2021 +)
  15816. <=WM: (14247: S1 ^operator O2021)
  15817. <=WM: (14246: S1 ^operator O2022 +)
  15818. <=WM: (14244: I3 ^dir R)
  15819. <=WM: (14240: R1 ^reward R1014)
  15820. <=WM: (14239: I3 ^see 0)
  15821. <=WM: (14243: O2022 ^name predict-no)
  15822. <=WM: (14242: O2021 ^name predict-yes)
  15823. <=WM: (14241: R1014 ^value 1)
  15824. --- Inner Elaboration Phase, active level 1 (S1) ---
  15825. Firing prefer*rvt*predict-yes*H0
  15826. -->
  15827. Firing rl*prefer*rvt*predict-yes*H0*5
  15828. -->
  15829. (S1 ^operator O2023 = 0.2640043987919141)
  15830. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15831. -->
  15832. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15833. -->
  15834. (S1 ^operator O2023 = 0.7362031097592677)
  15835. Firing prefer*rvt*predict-no*H0
  15836. -->
  15837. Firing rl*prefer*rvt*predict-no*H0*6
  15838. -->
  15839. (S1 ^operator O2024 = 0.8613156710459575)
  15840. inner elaboration loop at bottom goal.
  15841. Retracting rl*prefer*rvt*predict-no*H0*6
  15842. -->
  15843. (S1 ^operator O2022 = 0.8613156710459575)
  15844. Retracting rl*prefer*rvt*predict-yes*H0*5
  15845. -->
  15846. (S1 ^operator O2021 = 0.2640043987919141)
  15847. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15848. -->
  15849. (S1 ^operator O2021 = 0.7362031097592677)
  15850. --- END Proposal Phase ---
  15851. --- Decision Phase ---
  15852. RL update rl*prefer*rvt*predict-yes*H0*3 0.590118 -0.252401 0.337717 -> 0.590112 -0.2524 0.337712(R,m,v=1,0.9,0.0905325)
  15853. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409954 0.252391 0.662346 -> 0.409947 0.252392 0.66234(R,m,v=1,1,0)
  15854. =>WM: (14262: S1 ^operator O2023)
  15855. 1012: O: O2023 (predict-yes)
  15856. --- END Decision Phase ---
  15857. --- Application Phase ---
  15858. --- Firing Productions (PE) For State At Depth 1 ---
  15859. --- Inner Elaboration Phase, active level 1 (S1) ---
  15860. Firing apply*operator
  15861. -->
  15862. (I3 ^predict-yes N1012 + :O )
  15863. Firing apply*operator*complete
  15864. -->
  15865. (I3 ^predict-yes N1011 - :O )
  15866. inner elaboration loop at bottom goal.
  15867. --- Change Working Memory (PE) ---
  15868. =>WM: (14263: I3 ^predict-yes N1012)
  15869. <=WM: (14249: N1011 ^status complete)
  15870. <=WM: (14248: I3 ^predict-yes N1011)
  15871. --- Firing Productions (IE) For State At Depth 1 ---
  15872. --- Inner Elaboration Phase, active level 1 (S1) ---
  15873. Firing monitor*world
  15874. -->
  15875. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15876. --- Change Working Memory (IE) ---
  15877. --- END Application Phase ---
  15878. --- Output Phase ---
  15879. ENV: Agent did: predict-yes for direction L in state State-B
  15880. In State-B moving L
  15881. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15882. predict error 0
  15883. dir: dir isL
  15884. --- END Output Phase ---
  15885. |\---- Input Phase ---
  15886. =>WM: (14267: I2 ^dir L)
  15887. =>WM: (14266: I2 ^reward 1)
  15888. =>WM: (14265: I2 ^see 1)
  15889. =>WM: (14264: N1012 ^status complete)
  15890. <=WM: (14252: I2 ^dir L)
  15891. <=WM: (14251: I2 ^reward 1)
  15892. <=WM: (14250: I2 ^see 1)
  15893. =>WM: (14268: I2 ^level-1 L1-root)
  15894. <=WM: (14253: I2 ^level-1 R1-root)
  15895. --- END Input Phase ---
  15896. --- Proposal Phase ---
  15897. --- Inner Elaboration Phase, active level 1 (S1) ---
  15898. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15899. -->
  15900. (S1 ^operator O2023 = -0.181727099742844)
  15901. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15902. -->
  15903. Firing elaborate*copy-see-to-output-link
  15904. -->
  15905. (I3 ^see 1 +)
  15906. Firing elaborate*reward*based*on*reward
  15907. -->
  15908. (R1016 ^value 1 +)
  15909. (R1 ^reward R1016 +)
  15910. Firing propose*predict-yes
  15911. -->
  15912. (O2025 ^name predict-yes +)
  15913. (S1 ^operator O2025 +)
  15914. Firing propose*predict-no
  15915. -->
  15916. (O2026 ^name predict-no +)
  15917. (S1 ^operator O2026 +)
  15918. Firing rl*prefer*rvt*predict-no*H0*6
  15919. -->
  15920. (S1 ^operator O2024 = 0.8613156710459575)
  15921. Firing rl*prefer*rvt*predict-yes*H0*5
  15922. -->
  15923. (S1 ^operator O2023 = 0.2640043987919141)
  15924. Firing prefer*rvt*predict-yes*H0
  15925. -->
  15926. Firing prefer*rvt*predict-no*H0
  15927. -->
  15928. Firing elaborate*copy-dir-to-output-link
  15929. -->
  15930. (I3 ^dir L +)
  15931. inner elaboration loop at bottom goal.
  15932. Retracting elaborate*copy-see-to-output-link
  15933. -->
  15934. (I3 ^see 1 +)
  15935. Retracting propose*predict-no
  15936. -->
  15937. (O2024 ^name predict-no +)
  15938. (S1 ^operator O2024 +)
  15939. Retracting propose*predict-yes
  15940. -->
  15941. (O2023 ^name predict-yes +)
  15942. (S1 ^operator O2023 +)
  15943. Retracting elaborate*reward*based*on*reward
  15944. -->
  15945. (R1015 ^value 1 +)
  15946. (R1 ^reward R1015 +)
  15947. Retracting elaborate*copy-dir-to-output-link
  15948. -->
  15949. (I3 ^dir L +)
  15950. Retracting rl*prefer*rvt*predict-no*H0*6
  15951. -->
  15952. (S1 ^operator O2024 = 0.8613156710459575)
  15953. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  15954. -->
  15955. (S1 ^operator O2023 = 0.7362031097592677)
  15956. Retracting rl*prefer*rvt*predict-yes*H0*5
  15957. -->
  15958. (S1 ^operator O2023 = 0.2640043987919141)
  15959. =>WM: (14274: S1 ^operator O2026 +)
  15960. =>WM: (14273: S1 ^operator O2025 +)
  15961. =>WM: (14272: O2026 ^name predict-no)
  15962. =>WM: (14271: O2025 ^name predict-yes)
  15963. =>WM: (14270: R1016 ^value 1)
  15964. =>WM: (14269: R1 ^reward R1016)
  15965. <=WM: (14260: S1 ^operator O2023 +)
  15966. <=WM: (14262: S1 ^operator O2023)
  15967. <=WM: (14261: S1 ^operator O2024 +)
  15968. <=WM: (14255: R1 ^reward R1015)
  15969. <=WM: (14258: O2024 ^name predict-no)
  15970. <=WM: (14257: O2023 ^name predict-yes)
  15971. <=WM: (14256: R1015 ^value 1)
  15972. --- Inner Elaboration Phase, active level 1 (S1) ---
  15973. Firing prefer*rvt*predict-yes*H0
  15974. -->
  15975. Firing rl*prefer*rvt*predict-yes*H0*5
  15976. -->
  15977. (S1 ^operator O2025 = 0.2640043987919141)
  15978. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  15979. -->
  15980. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15981. -->
  15982. (S1 ^operator O2025 = -0.181727099742844)
  15983. Firing prefer*rvt*predict-no*H0
  15984. -->
  15985. Firing rl*prefer*rvt*predict-no*H0*6
  15986. -->
  15987. (S1 ^operator O2026 = 0.8613156710459575)
  15988. inner elaboration loop at bottom goal.
  15989. Retracting rl*prefer*rvt*predict-no*H0*6
  15990. -->
  15991. (S1 ^operator O2024 = 0.8613156710459575)
  15992. Retracting rl*prefer*rvt*predict-yes*H0*5
  15993. -->
  15994. (S1 ^operator O2023 = 0.2640043987919141)
  15995. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  15996. -->
  15997. (S1 ^operator O2023 = -0.181727099742844)
  15998. --- END Proposal Phase ---
  15999. --- Decision Phase ---
  16000. RL update rl*prefer*rvt*predict-yes*H0*5 0.55439 -0.290386 0.264004 -> 0.554374 -0.290386 0.263988(R,m,v=1,0.878453,0.107366)
  16001. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445814 0.290389 0.736203 -> 0.445795 0.290389 0.736184(R,m,v=1,1,0)
  16002. =>WM: (14275: S1 ^operator O2026)
  16003. 1013: O: O2026 (predict-no)
  16004. --- END Decision Phase ---
  16005. --- Application Phase ---
  16006. --- Firing Productions (PE) For State At Depth 1 ---
  16007. --- Inner Elaboration Phase, active level 1 (S1) ---
  16008. Firing apply*operator
  16009. -->
  16010. (I3 ^predict-no N1013 + :O )
  16011. Firing apply*operator*complete
  16012. -->
  16013. (I3 ^predict-yes N1012 - :O )
  16014. inner elaboration loop at bottom goal.
  16015. --- Change Working Memory (PE) ---
  16016. =>WM: (14276: I3 ^predict-no N1013)
  16017. <=WM: (14264: N1012 ^status complete)
  16018. <=WM: (14263: I3 ^predict-yes N1012)
  16019. --- Firing Productions (IE) For State At Depth 1 ---
  16020. --- Inner Elaboration Phase, active level 1 (S1) ---
  16021. Firing monitor*world
  16022. -->
  16023. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16024. --- Change Working Memory (IE) ---
  16025. --- END Application Phase ---
  16026. --- Output Phase ---
  16027. ENV: Agent did: predict-no for direction L in state State-A
  16028. In State-A moving L
  16029. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16030. predict error 0
  16031. dir: dir isR
  16032. --- END Output Phase ---
  16033. /|--- Input Phase ---
  16034. =>WM: (14280: I2 ^dir R)
  16035. =>WM: (14279: I2 ^reward 1)
  16036. =>WM: (14278: I2 ^see 0)
  16037. =>WM: (14277: N1013 ^status complete)
  16038. <=WM: (14267: I2 ^dir L)
  16039. <=WM: (14266: I2 ^reward 1)
  16040. <=WM: (14265: I2 ^see 1)
  16041. =>WM: (14281: I2 ^level-1 L0-root)
  16042. <=WM: (14268: I2 ^level-1 L1-root)
  16043. --- END Input Phase ---
  16044. --- Proposal Phase ---
  16045. --- Inner Elaboration Phase, active level 1 (S1) ---
  16046. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16047. -->
  16048. (S1 ^operator O2026 = -0.2817060109291377)
  16049. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16050. -->
  16051. (S1 ^operator O2025 = 0.6623398483569007)
  16052. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16053. -->
  16054. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16055. -->
  16056. Firing elaborate*copy-see-to-output-link
  16057. -->
  16058. (I3 ^see 0 +)
  16059. Firing elaborate*reward*based*on*reward
  16060. -->
  16061. (R1017 ^value 1 +)
  16062. (R1 ^reward R1017 +)
  16063. Firing propose*predict-yes
  16064. -->
  16065. (O2027 ^name predict-yes +)
  16066. (S1 ^operator O2027 +)
  16067. Firing propose*predict-no
  16068. -->
  16069. (O2028 ^name predict-no +)
  16070. (S1 ^operator O2028 +)
  16071. Firing rl*prefer*rvt*predict-no*H0*4
  16072. -->
  16073. (S1 ^operator O2026 = 0.3397723577617232)
  16074. Firing rl*prefer*rvt*predict-yes*H0*3
  16075. -->
  16076. (S1 ^operator O2025 = 0.3377117697451198)
  16077. Firing prefer*rvt*predict-yes*H0
  16078. -->
  16079. Firing prefer*rvt*predict-no*H0
  16080. -->
  16081. Firing elaborate*copy-dir-to-output-link
  16082. -->
  16083. (I3 ^dir R +)
  16084. inner elaboration loop at bottom goal.
  16085. Retracting elaborate*copy-see-to-output-link
  16086. -->
  16087. (I3 ^see 1 +)
  16088. Retracting propose*predict-no
  16089. -->
  16090. (O2026 ^name predict-no +)
  16091. (S1 ^operator O2026 +)
  16092. Retracting propose*predict-yes
  16093. -->
  16094. (O2025 ^name predict-yes +)
  16095. (S1 ^operator O2025 +)
  16096. Retracting elaborate*reward*based*on*reward
  16097. -->
  16098. (R1016 ^value 1 +)
  16099. (R1 ^reward R1016 +)
  16100. Retracting elaborate*copy-dir-to-output-link
  16101. -->
  16102. (I3 ^dir L +)
  16103. Retracting rl*prefer*rvt*predict-no*H0*6
  16104. -->
  16105. (S1 ^operator O2026 = 0.8613156710459575)
  16106. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16107. -->
  16108. (S1 ^operator O2025 = -0.181727099742844)
  16109. Retracting rl*prefer*rvt*predict-yes*H0*5
  16110. -->
  16111. (S1 ^operator O2025 = 0.2639876601736543)
  16112. =>WM: (14289: S1 ^operator O2028 +)
  16113. =>WM: (14288: S1 ^operator O2027 +)
  16114. =>WM: (14287: I3 ^dir R)
  16115. =>WM: (14286: O2028 ^name predict-no)
  16116. =>WM: (14285: O2027 ^name predict-yes)
  16117. =>WM: (14284: R1017 ^value 1)
  16118. =>WM: (14283: R1 ^reward R1017)
  16119. =>WM: (14282: I3 ^see 0)
  16120. <=WM: (14273: S1 ^operator O2025 +)
  16121. <=WM: (14274: S1 ^operator O2026 +)
  16122. <=WM: (14275: S1 ^operator O2026)
  16123. <=WM: (14259: I3 ^dir L)
  16124. <=WM: (14269: R1 ^reward R1016)
  16125. <=WM: (14254: I3 ^see 1)
  16126. <=WM: (14272: O2026 ^name predict-no)
  16127. <=WM: (14271: O2025 ^name predict-yes)
  16128. <=WM: (14270: R1016 ^value 1)
  16129. --- Inner Elaboration Phase, active level 1 (S1) ---
  16130. Firing prefer*rvt*predict-yes*H0
  16131. -->
  16132. Firing rl*prefer*rvt*predict-yes*H0*3
  16133. -->
  16134. (S1 ^operator O2027 = 0.3377117697451198)
  16135. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16136. -->
  16137. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16138. -->
  16139. (S1 ^operator O2027 = 0.6623398483569007)
  16140. Firing prefer*rvt*predict-no*H0
  16141. -->
  16142. Firing rl*prefer*rvt*predict-no*H0*4
  16143. -->
  16144. (S1 ^operator O2028 = 0.3397723577617232)
  16145. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16146. -->
  16147. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16148. -->
  16149. (S1 ^operator O2028 = -0.2817060109291377)
  16150. inner elaboration loop at bottom goal.
  16151. Retracting rl*prefer*rvt*predict-no*H0*4
  16152. -->
  16153. (S1 ^operator O2026 = 0.3397723577617232)
  16154. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16155. -->
  16156. (S1 ^operator O2026 = -0.2817060109291377)
  16157. Retracting rl*prefer*rvt*predict-yes*H0*3
  16158. -->
  16159. (S1 ^operator O2025 = 0.3377117697451198)
  16160. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16161. -->
  16162. (S1 ^operator O2025 = 0.6623398483569007)
  16163. --- END Proposal Phase ---
  16164. --- Decision Phase ---
  16165. RL update rl*prefer*rvt*predict-no*H0*6 0.861316 0 0.861316 -> 0.884313 0 0.884313(R,m,v=1,0.901961,0.0890093)
  16166. =>WM: (14290: S1 ^operator O2027)
  16167. 1014: O: O2027 (predict-yes)
  16168. --- END Decision Phase ---
  16169. --- Application Phase ---
  16170. --- Firing Productions (PE) For State At Depth 1 ---
  16171. --- Inner Elaboration Phase, active level 1 (S1) ---
  16172. Firing apply*operator
  16173. -->
  16174. (I3 ^predict-yes N1014 + :O )
  16175. Firing apply*operator*complete
  16176. -->
  16177. (I3 ^predict-no N1013 - :O )
  16178. inner elaboration loop at bottom goal.
  16179. --- Change Working Memory (PE) ---
  16180. =>WM: (14291: I3 ^predict-yes N1014)
  16181. <=WM: (14277: N1013 ^status complete)
  16182. <=WM: (14276: I3 ^predict-no N1013)
  16183. --- Firing Productions (IE) For State At Depth 1 ---
  16184. --- Inner Elaboration Phase, active level 1 (S1) ---
  16185. Firing monitor*world
  16186. -->
  16187. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16188. --- Change Working Memory (IE) ---
  16189. --- END Application Phase ---
  16190. --- Output Phase ---
  16191. ENV: Agent did: predict-yes for direction R in state State-A
  16192. In State-A moving R
  16193. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  16194. predict error 0
  16195. dir: dir isU
  16196. --- END Output Phase ---
  16197. \-/--- Input Phase ---
  16198. =>WM: (14295: I2 ^dir U)
  16199. =>WM: (14294: I2 ^reward 1)
  16200. =>WM: (14293: I2 ^see 1)
  16201. =>WM: (14292: N1014 ^status complete)
  16202. <=WM: (14280: I2 ^dir R)
  16203. <=WM: (14279: I2 ^reward 1)
  16204. <=WM: (14278: I2 ^see 0)
  16205. =>WM: (14296: I2 ^level-1 R1-root)
  16206. <=WM: (14281: I2 ^level-1 L0-root)
  16207. --- END Input Phase ---
  16208. --- Proposal Phase ---
  16209. --- Inner Elaboration Phase, active level 1 (S1) ---
  16210. Firing elaborate*copy-see-to-output-link
  16211. -->
  16212. (I3 ^see 1 +)
  16213. Firing elaborate*reward*based*on*reward
  16214. -->
  16215. (R1018 ^value 1 +)
  16216. (R1 ^reward R1018 +)
  16217. Firing propose*predict-yes
  16218. -->
  16219. (O2029 ^name predict-yes +)
  16220. (S1 ^operator O2029 +)
  16221. Firing propose*predict-no
  16222. -->
  16223. (O2030 ^name predict-no +)
  16224. (S1 ^operator O2030 +)
  16225. Firing rl*prefer*rvt*predict-no*H0*2
  16226. -->
  16227. (S1 ^operator O2028 = 1.)
  16228. Firing rl*prefer*rvt*predict-yes*H0*1
  16229. -->
  16230. (S1 ^operator O2027 = 0.)
  16231. Firing prefer*rvt*predict-yes*H0
  16232. -->
  16233. Firing prefer*rvt*predict-no*H0
  16234. -->
  16235. Firing elaborate*copy-dir-to-output-link
  16236. -->
  16237. (I3 ^dir U +)
  16238. inner elaboration loop at bottom goal.
  16239. Retracting elaborate*copy-see-to-output-link
  16240. -->
  16241. (I3 ^see 0 +)
  16242. Retracting propose*predict-no
  16243. -->
  16244. (O2028 ^name predict-no +)
  16245. (S1 ^operator O2028 +)
  16246. Retracting propose*predict-yes
  16247. -->
  16248. (O2027 ^name predict-yes +)
  16249. (S1 ^operator O2027 +)
  16250. Retracting elaborate*reward*based*on*reward
  16251. -->
  16252. (R1017 ^value 1 +)
  16253. (R1 ^reward R1017 +)
  16254. Retracting elaborate*copy-dir-to-output-link
  16255. -->
  16256. (I3 ^dir R +)
  16257. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16258. -->
  16259. (S1 ^operator O2028 = -0.2817060109291377)
  16260. Retracting rl*prefer*rvt*predict-no*H0*4
  16261. -->
  16262. (S1 ^operator O2028 = 0.3397723577617232)
  16263. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16264. -->
  16265. (S1 ^operator O2027 = 0.6623398483569007)
  16266. Retracting rl*prefer*rvt*predict-yes*H0*3
  16267. -->
  16268. (S1 ^operator O2027 = 0.3377117697451198)
  16269. =>WM: (14304: S1 ^operator O2030 +)
  16270. =>WM: (14303: S1 ^operator O2029 +)
  16271. =>WM: (14302: I3 ^dir U)
  16272. =>WM: (14301: O2030 ^name predict-no)
  16273. =>WM: (14300: O2029 ^name predict-yes)
  16274. =>WM: (14299: R1018 ^value 1)
  16275. =>WM: (14298: R1 ^reward R1018)
  16276. =>WM: (14297: I3 ^see 1)
  16277. <=WM: (14288: S1 ^operator O2027 +)
  16278. <=WM: (14290: S1 ^operator O2027)
  16279. <=WM: (14289: S1 ^operator O2028 +)
  16280. <=WM: (14287: I3 ^dir R)
  16281. <=WM: (14283: R1 ^reward R1017)
  16282. <=WM: (14282: I3 ^see 0)
  16283. <=WM: (14286: O2028 ^name predict-no)
  16284. <=WM: (14285: O2027 ^name predict-yes)
  16285. <=WM: (14284: R1017 ^value 1)
  16286. --- Inner Elaboration Phase, active level 1 (S1) ---
  16287. Firing prefer*rvt*predict-yes*H0
  16288. -->
  16289. Firing rl*prefer*rvt*predict-yes*H0*1
  16290. -->
  16291. (S1 ^operator O2029 = 0.)
  16292. Firing prefer*rvt*predict-no*H0
  16293. -->
  16294. Firing rl*prefer*rvt*predict-no*H0*2
  16295. -->
  16296. (S1 ^operator O2030 = 1.)
  16297. inner elaboration loop at bottom goal.
  16298. Retracting rl*prefer*rvt*predict-no*H0*2
  16299. -->
  16300. (S1 ^operator O2028 = 1.)
  16301. Retracting rl*prefer*rvt*predict-yes*H0*1
  16302. -->
  16303. (S1 ^operator O2027 = 0.)
  16304. --- END Proposal Phase ---
  16305. --- Decision Phase ---
  16306. RL update rl*prefer*rvt*predict-yes*H0*3 0.590112 -0.2524 0.337712 -> 0.590107 -0.2524 0.337708(R,m,v=1,0.900585,0.0900585)
  16307. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409947 0.252392 0.66234 -> 0.409942 0.252393 0.662335(R,m,v=1,1,0)
  16308. =>WM: (14305: S1 ^operator O2030)
  16309. 1015: O: O2030 (predict-no)
  16310. --- END Decision Phase ---
  16311. --- Application Phase ---
  16312. --- Firing Productions (PE) For State At Depth 1 ---
  16313. --- Inner Elaboration Phase, active level 1 (S1) ---
  16314. Firing apply*operator
  16315. -->
  16316. (I3 ^predict-no N1015 + :O )
  16317. Firing apply*operator*complete
  16318. -->
  16319. (I3 ^predict-yes N1014 - :O )
  16320. inner elaboration loop at bottom goal.
  16321. --- Change Working Memory (PE) ---
  16322. =>WM: (14306: I3 ^predict-no N1015)
  16323. <=WM: (14292: N1014 ^status complete)
  16324. <=WM: (14291: I3 ^predict-yes N1014)
  16325. --- Firing Productions (IE) For State At Depth 1 ---
  16326. --- Inner Elaboration Phase, active level 1 (S1) ---
  16327. Firing monitor*world
  16328. -->
  16329. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16330. --- Change Working Memory (IE) ---
  16331. --- END Application Phase ---
  16332. --- Output Phase ---
  16333. ENV: Agent did: predict-no for direction U in state State-B
  16334. In State-B moving U
  16335. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16336. predict error 0
  16337. dir: dir isR
  16338. --- END Output Phase ---
  16339. |\---- Input Phase ---
  16340. =>WM: (14310: I2 ^dir R)
  16341. =>WM: (14309: I2 ^reward 1)
  16342. =>WM: (14308: I2 ^see 0)
  16343. =>WM: (14307: N1015 ^status complete)
  16344. <=WM: (14295: I2 ^dir U)
  16345. <=WM: (14294: I2 ^reward 1)
  16346. <=WM: (14293: I2 ^see 1)
  16347. =>WM: (14311: I2 ^level-1 R1-root)
  16348. <=WM: (14296: I2 ^level-1 R1-root)
  16349. --- END Input Phase ---
  16350. --- Proposal Phase ---
  16351. --- Inner Elaboration Phase, active level 1 (S1) ---
  16352. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  16353. -->
  16354. (S1 ^operator O2029 = -0.1070236389116304)
  16355. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  16356. -->
  16357. (S1 ^operator O2030 = 0.6602423000156785)
  16358. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16359. -->
  16360. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16361. -->
  16362. Firing elaborate*copy-see-to-output-link
  16363. -->
  16364. (I3 ^see 0 +)
  16365. Firing elaborate*reward*based*on*reward
  16366. -->
  16367. (R1019 ^value 1 +)
  16368. (R1 ^reward R1019 +)
  16369. Firing propose*predict-yes
  16370. -->
  16371. (O2031 ^name predict-yes +)
  16372. (S1 ^operator O2031 +)
  16373. Firing propose*predict-no
  16374. -->
  16375. (O2032 ^name predict-no +)
  16376. (S1 ^operator O2032 +)
  16377. Firing rl*prefer*rvt*predict-no*H0*4
  16378. -->
  16379. (S1 ^operator O2030 = 0.3397723577617232)
  16380. Firing rl*prefer*rvt*predict-yes*H0*3
  16381. -->
  16382. (S1 ^operator O2029 = 0.3377075674551746)
  16383. Firing prefer*rvt*predict-yes*H0
  16384. -->
  16385. Firing prefer*rvt*predict-no*H0
  16386. -->
  16387. Firing elaborate*copy-dir-to-output-link
  16388. -->
  16389. (I3 ^dir R +)
  16390. inner elaboration loop at bottom goal.
  16391. Retracting elaborate*copy-see-to-output-link
  16392. -->
  16393. (I3 ^see 1 +)
  16394. Retracting propose*predict-no
  16395. -->
  16396. (O2030 ^name predict-no +)
  16397. (S1 ^operator O2030 +)
  16398. Retracting propose*predict-yes
  16399. -->
  16400. (O2029 ^name predict-yes +)
  16401. (S1 ^operator O2029 +)
  16402. Retracting elaborate*reward*based*on*reward
  16403. -->
  16404. (R1018 ^value 1 +)
  16405. (R1 ^reward R1018 +)
  16406. Retracting elaborate*copy-dir-to-output-link
  16407. -->
  16408. (I3 ^dir U +)
  16409. Retracting rl*prefer*rvt*predict-no*H0*2
  16410. -->
  16411. (S1 ^operator O2030 = 1.)
  16412. Retracting rl*prefer*rvt*predict-yes*H0*1
  16413. -->
  16414. (S1 ^operator O2029 = 0.)
  16415. =>WM: (14319: S1 ^operator O2032 +)
  16416. =>WM: (14318: S1 ^operator O2031 +)
  16417. =>WM: (14317: I3 ^dir R)
  16418. =>WM: (14316: O2032 ^name predict-no)
  16419. =>WM: (14315: O2031 ^name predict-yes)
  16420. =>WM: (14314: R1019 ^value 1)
  16421. =>WM: (14313: R1 ^reward R1019)
  16422. =>WM: (14312: I3 ^see 0)
  16423. <=WM: (14303: S1 ^operator O2029 +)
  16424. <=WM: (14304: S1 ^operator O2030 +)
  16425. <=WM: (14305: S1 ^operator O2030)
  16426. <=WM: (14302: I3 ^dir U)
  16427. <=WM: (14298: R1 ^reward R1018)
  16428. <=WM: (14297: I3 ^see 1)
  16429. <=WM: (14301: O2030 ^name predict-no)
  16430. <=WM: (14300: O2029 ^name predict-yes)
  16431. <=WM: (14299: R1018 ^value 1)
  16432. --- Inner Elaboration Phase, active level 1 (S1) ---
  16433. Firing prefer*rvt*predict-yes*H0
  16434. -->
  16435. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  16436. -->
  16437. (S1 ^operator O2031 = -0.1070236389116304)
  16438. Firing rl*prefer*rvt*predict-yes*H0*3
  16439. -->
  16440. (S1 ^operator O2031 = 0.3377075674551746)
  16441. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16442. -->
  16443. Firing prefer*rvt*predict-no*H0
  16444. -->
  16445. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  16446. -->
  16447. (S1 ^operator O2032 = 0.6602423000156785)
  16448. Firing rl*prefer*rvt*predict-no*H0*4
  16449. -->
  16450. (S1 ^operator O2032 = 0.3397723577617232)
  16451. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16452. -->
  16453. inner elaboration loop at bottom goal.
  16454. Retracting rl*prefer*rvt*predict-no*H0*4
  16455. -->
  16456. (S1 ^operator O2030 = 0.3397723577617232)
  16457. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  16458. -->
  16459. (S1 ^operator O2030 = 0.6602423000156785)
  16460. Retracting rl*prefer*rvt*predict-yes*H0*3
  16461. -->
  16462. (S1 ^operator O2029 = 0.3377075674551746)
  16463. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  16464. -->
  16465. (S1 ^operator O2029 = -0.1070236389116304)
  16466. --- END Proposal Phase ---
  16467. --- Decision Phase ---
  16468. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16469. =>WM: (14320: S1 ^operator O2032)
  16470. 1016: O: O2032 (predict-no)
  16471. --- END Decision Phase ---
  16472. --- Application Phase ---
  16473. --- Firing Productions (PE) For State At Depth 1 ---
  16474. --- Inner Elaboration Phase, active level 1 (S1) ---
  16475. Firing apply*operator
  16476. -->
  16477. (I3 ^predict-no N1016 + :O )
  16478. Firing apply*operator*complete
  16479. -->
  16480. (I3 ^predict-no N1015 - :O )
  16481. inner elaboration loop at bottom goal.
  16482. --- Change Working Memory (PE) ---
  16483. =>WM: (14321: I3 ^predict-no N1016)
  16484. <=WM: (14307: N1015 ^status complete)
  16485. <=WM: (14306: I3 ^predict-no N1015)
  16486. --- Firing Productions (IE) For State At Depth 1 ---
  16487. --- Inner Elaboration Phase, active level 1 (S1) ---
  16488. Firing monitor*world
  16489. -->
  16490. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16491. --- Change Working Memory (IE) ---
  16492. --- END Application Phase ---
  16493. --- Output Phase ---
  16494. ENV: Agent did: predict-no for direction R in state State-B
  16495. In State-B moving R
  16496. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16497. predict error 0
  16498. dir: dir isU
  16499. --- END Output Phase ---
  16500. /|--- Input Phase ---
  16501. =>WM: (14325: I2 ^dir U)
  16502. =>WM: (14324: I2 ^reward 1)
  16503. =>WM: (14323: I2 ^see 0)
  16504. =>WM: (14322: N1016 ^status complete)
  16505. <=WM: (14310: I2 ^dir R)
  16506. <=WM: (14309: I2 ^reward 1)
  16507. <=WM: (14308: I2 ^see 0)
  16508. =>WM: (14326: I2 ^level-1 R0-root)
  16509. <=WM: (14311: I2 ^level-1 R1-root)
  16510. --- END Input Phase ---
  16511. --- Proposal Phase ---
  16512. --- Inner Elaboration Phase, active level 1 (S1) ---
  16513. Firing elaborate*copy-see-to-output-link
  16514. -->
  16515. (I3 ^see 0 +)
  16516. Firing elaborate*reward*based*on*reward
  16517. -->
  16518. (R1020 ^value 1 +)
  16519. (R1 ^reward R1020 +)
  16520. Firing propose*predict-yes
  16521. -->
  16522. (O2033 ^name predict-yes +)
  16523. (S1 ^operator O2033 +)
  16524. Firing propose*predict-no
  16525. -->
  16526. (O2034 ^name predict-no +)
  16527. (S1 ^operator O2034 +)
  16528. Firing rl*prefer*rvt*predict-no*H0*2
  16529. -->
  16530. (S1 ^operator O2032 = 1.)
  16531. Firing rl*prefer*rvt*predict-yes*H0*1
  16532. -->
  16533. (S1 ^operator O2031 = 0.)
  16534. Firing prefer*rvt*predict-yes*H0
  16535. -->
  16536. Firing prefer*rvt*predict-no*H0
  16537. -->
  16538. Firing elaborate*copy-dir-to-output-link
  16539. -->
  16540. (I3 ^dir U +)
  16541. inner elaboration loop at bottom goal.
  16542. Retracting elaborate*copy-see-to-output-link
  16543. -->
  16544. (I3 ^see 0 +)
  16545. Retracting propose*predict-no
  16546. -->
  16547. (O2032 ^name predict-no +)
  16548. (S1 ^operator O2032 +)
  16549. Retracting propose*predict-yes
  16550. -->
  16551. (O2031 ^name predict-yes +)
  16552. (S1 ^operator O2031 +)
  16553. Retracting elaborate*reward*based*on*reward
  16554. -->
  16555. (R1019 ^value 1 +)
  16556. (R1 ^reward R1019 +)
  16557. Retracting elaborate*copy-dir-to-output-link
  16558. -->
  16559. (I3 ^dir R +)
  16560. Retracting rl*prefer*rvt*predict-no*H0*4
  16561. -->
  16562. (S1 ^operator O2032 = 0.3397723577617232)
  16563. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  16564. -->
  16565. (S1 ^operator O2032 = 0.6602423000156785)
  16566. Retracting rl*prefer*rvt*predict-yes*H0*3
  16567. -->
  16568. (S1 ^operator O2031 = 0.3377075674551746)
  16569. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  16570. -->
  16571. (S1 ^operator O2031 = -0.1070236389116304)
  16572. =>WM: (14333: S1 ^operator O2034 +)
  16573. =>WM: (14332: S1 ^operator O2033 +)
  16574. =>WM: (14331: I3 ^dir U)
  16575. =>WM: (14330: O2034 ^name predict-no)
  16576. =>WM: (14329: O2033 ^name predict-yes)
  16577. =>WM: (14328: R1020 ^value 1)
  16578. =>WM: (14327: R1 ^reward R1020)
  16579. <=WM: (14318: S1 ^operator O2031 +)
  16580. <=WM: (14319: S1 ^operator O2032 +)
  16581. <=WM: (14320: S1 ^operator O2032)
  16582. <=WM: (14317: I3 ^dir R)
  16583. <=WM: (14313: R1 ^reward R1019)
  16584. <=WM: (14316: O2032 ^name predict-no)
  16585. <=WM: (14315: O2031 ^name predict-yes)
  16586. <=WM: (14314: R1019 ^value 1)
  16587. --- Inner Elaboration Phase, active level 1 (S1) ---
  16588. Firing prefer*rvt*predict-yes*H0
  16589. -->
  16590. Firing rl*prefer*rvt*predict-yes*H0*1
  16591. -->
  16592. (S1 ^operator O2033 = 0.)
  16593. Firing prefer*rvt*predict-no*H0
  16594. -->
  16595. Firing rl*prefer*rvt*predict-no*H0*2
  16596. -->
  16597. (S1 ^operator O2034 = 1.)
  16598. inner elaboration loop at bottom goal.
  16599. Retracting rl*prefer*rvt*predict-no*H0*2
  16600. -->
  16601. (S1 ^operator O2032 = 1.)
  16602. Retracting rl*prefer*rvt*predict-yes*H0*1
  16603. -->
  16604. (S1 ^operator O2031 = 0.)
  16605. --- END Proposal Phase ---
  16606. --- Decision Phase ---
  16607. RL update rl*prefer*rvt*predict-no*H0*4 0.570256 -0.230484 0.339772 -> 0.570255 -0.230484 0.339771(R,m,v=1,0.876471,0.108911)
  16608. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429759 0.230483 0.660242 -> 0.429758 0.230483 0.660241(R,m,v=1,1,0)
  16609. =>WM: (14334: S1 ^operator O2034)
  16610. 1017: O: O2034 (predict-no)
  16611. --- END Decision Phase ---
  16612. --- Application Phase ---
  16613. --- Firing Productions (PE) For State At Depth 1 ---
  16614. --- Inner Elaboration Phase, active level 1 (S1) ---
  16615. Firing apply*operator
  16616. -->
  16617. (I3 ^predict-no N1017 + :O )
  16618. Firing apply*operator*complete
  16619. -->
  16620. (I3 ^predict-no N1016 - :O )
  16621. inner elaboration loop at bottom goal.
  16622. --- Change Working Memory (PE) ---
  16623. =>WM: (14335: I3 ^predict-no N1017)
  16624. <=WM: (14322: N1016 ^status complete)
  16625. <=WM: (14321: I3 ^predict-no N1016)
  16626. --- Firing Productions (IE) For State At Depth 1 ---
  16627. --- Inner Elaboration Phase, active level 1 (S1) ---
  16628. Firing monitor*world
  16629. -->
  16630. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16631. --- Change Working Memory (IE) ---
  16632. --- END Application Phase ---
  16633. --- Output Phase ---
  16634. ENV: Agent did: predict-no for direction U in state State-B
  16635. In State-B moving U
  16636. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16637. predict error 0
  16638. dir: dir isL
  16639. --- END Output Phase ---
  16640. \---- Input Phase ---
  16641. =>WM: (14339: I2 ^dir L)
  16642. =>WM: (14338: I2 ^reward 1)
  16643. =>WM: (14337: I2 ^see 0)
  16644. =>WM: (14336: N1017 ^status complete)
  16645. <=WM: (14325: I2 ^dir U)
  16646. <=WM: (14324: I2 ^reward 1)
  16647. <=WM: (14323: I2 ^see 0)
  16648. =>WM: (14340: I2 ^level-1 R0-root)
  16649. <=WM: (14326: I2 ^level-1 R0-root)
  16650. --- END Input Phase ---
  16651. --- Proposal Phase ---
  16652. --- Inner Elaboration Phase, active level 1 (S1) ---
  16653. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  16654. -->
  16655. (S1 ^operator O2033 = 0.7358542477906264)
  16656. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16657. -->
  16658. Firing elaborate*copy-see-to-output-link
  16659. -->
  16660. (I3 ^see 0 +)
  16661. Firing elaborate*reward*based*on*reward
  16662. -->
  16663. (R1021 ^value 1 +)
  16664. (R1 ^reward R1021 +)
  16665. Firing propose*predict-yes
  16666. -->
  16667. (O2035 ^name predict-yes +)
  16668. (S1 ^operator O2035 +)
  16669. Firing propose*predict-no
  16670. -->
  16671. (O2036 ^name predict-no +)
  16672. (S1 ^operator O2036 +)
  16673. Firing rl*prefer*rvt*predict-no*H0*6
  16674. -->
  16675. (S1 ^operator O2034 = 0.8843130604166486)
  16676. Firing rl*prefer*rvt*predict-yes*H0*5
  16677. -->
  16678. (S1 ^operator O2033 = 0.2639876601736543)
  16679. Firing prefer*rvt*predict-yes*H0
  16680. -->
  16681. Firing prefer*rvt*predict-no*H0
  16682. -->
  16683. Firing elaborate*copy-dir-to-output-link
  16684. -->
  16685. (I3 ^dir L +)
  16686. inner elaboration loop at bottom goal.
  16687. Retracting elaborate*copy-see-to-output-link
  16688. -->
  16689. (I3 ^see 0 +)
  16690. Retracting propose*predict-no
  16691. -->
  16692. (O2034 ^name predict-no +)
  16693. (S1 ^operator O2034 +)
  16694. Retracting propose*predict-yes
  16695. -->
  16696. (O2033 ^name predict-yes +)
  16697. (S1 ^operator O2033 +)
  16698. Retracting elaborate*reward*based*on*reward
  16699. -->
  16700. (R1020 ^value 1 +)
  16701. (R1 ^reward R1020 +)
  16702. Retracting elaborate*copy-dir-to-output-link
  16703. -->
  16704. (I3 ^dir U +)
  16705. Retracting rl*prefer*rvt*predict-no*H0*2
  16706. -->
  16707. (S1 ^operator O2034 = 1.)
  16708. Retracting rl*prefer*rvt*predict-yes*H0*1
  16709. -->
  16710. (S1 ^operator O2033 = 0.)
  16711. =>WM: (14347: S1 ^operator O2036 +)
  16712. =>WM: (14346: S1 ^operator O2035 +)
  16713. =>WM: (14345: I3 ^dir L)
  16714. =>WM: (14344: O2036 ^name predict-no)
  16715. =>WM: (14343: O2035 ^name predict-yes)
  16716. =>WM: (14342: R1021 ^value 1)
  16717. =>WM: (14341: R1 ^reward R1021)
  16718. <=WM: (14332: S1 ^operator O2033 +)
  16719. <=WM: (14333: S1 ^operator O2034 +)
  16720. <=WM: (14334: S1 ^operator O2034)
  16721. <=WM: (14331: I3 ^dir U)
  16722. <=WM: (14327: R1 ^reward R1020)
  16723. <=WM: (14330: O2034 ^name predict-no)
  16724. <=WM: (14329: O2033 ^name predict-yes)
  16725. <=WM: (14328: R1020 ^value 1)
  16726. --- Inner Elaboration Phase, active level 1 (S1) ---
  16727. Firing prefer*rvt*predict-yes*H0
  16728. -->
  16729. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  16730. -->
  16731. (S1 ^operator O2035 = 0.7358542477906264)
  16732. Firing rl*prefer*rvt*predict-yes*H0*5
  16733. -->
  16734. (S1 ^operator O2035 = 0.2639876601736543)
  16735. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16736. -->
  16737. Firing prefer*rvt*predict-no*H0
  16738. -->
  16739. Firing rl*prefer*rvt*predict-no*H0*6
  16740. -->
  16741. (S1 ^operator O2036 = 0.8843130604166486)
  16742. inner elaboration loop at bottom goal.
  16743. Retracting rl*prefer*rvt*predict-no*H0*6
  16744. -->
  16745. (S1 ^operator O2034 = 0.8843130604166486)
  16746. Retracting rl*prefer*rvt*predict-yes*H0*5
  16747. -->
  16748. (S1 ^operator O2033 = 0.2639876601736543)
  16749. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  16750. -->
  16751. (S1 ^operator O2033 = 0.7358542477906264)
  16752. --- END Proposal Phase ---
  16753. --- Decision Phase ---
  16754. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16755. =>WM: (14348: S1 ^operator O2035)
  16756. 1018: O: O2035 (predict-yes)
  16757. --- END Decision Phase ---
  16758. --- Application Phase ---
  16759. --- Firing Productions (PE) For State At Depth 1 ---
  16760. --- Inner Elaboration Phase, active level 1 (S1) ---
  16761. Firing apply*operator
  16762. -->
  16763. (I3 ^predict-yes N1018 + :O )
  16764. Firing apply*operator*complete
  16765. -->
  16766. (I3 ^predict-no N1017 - :O )
  16767. inner elaboration loop at bottom goal.
  16768. --- Change Working Memory (PE) ---
  16769. =>WM: (14349: I3 ^predict-yes N1018)
  16770. <=WM: (14336: N1017 ^status complete)
  16771. <=WM: (14335: I3 ^predict-no N1017)
  16772. --- Firing Productions (IE) For State At Depth 1 ---
  16773. --- Inner Elaboration Phase, active level 1 (S1) ---
  16774. Firing monitor*world
  16775. -->
  16776. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16777. --- Change Working Memory (IE) ---
  16778. --- END Application Phase ---
  16779. --- Output Phase ---
  16780. ENV: Agent did: predict-yes for direction L in state State-B
  16781. In State-B moving L
  16782. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  16783. predict error 0
  16784. dir: dir isL
  16785. --- END Output Phase ---
  16786. /|\--- Input Phase ---
  16787. =>WM: (14353: I2 ^dir L)
  16788. =>WM: (14352: I2 ^reward 1)
  16789. =>WM: (14351: I2 ^see 1)
  16790. =>WM: (14350: N1018 ^status complete)
  16791. <=WM: (14339: I2 ^dir L)
  16792. <=WM: (14338: I2 ^reward 1)
  16793. <=WM: (14337: I2 ^see 0)
  16794. =>WM: (14354: I2 ^level-1 L1-root)
  16795. <=WM: (14340: I2 ^level-1 R0-root)
  16796. --- END Input Phase ---
  16797. --- Proposal Phase ---
  16798. --- Inner Elaboration Phase, active level 1 (S1) ---
  16799. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16800. -->
  16801. (S1 ^operator O2035 = -0.181727099742844)
  16802. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16803. -->
  16804. Firing elaborate*copy-see-to-output-link
  16805. -->
  16806. (I3 ^see 1 +)
  16807. Firing elaborate*reward*based*on*reward
  16808. -->
  16809. (R1022 ^value 1 +)
  16810. (R1 ^reward R1022 +)
  16811. Firing propose*predict-yes
  16812. -->
  16813. (O2037 ^name predict-yes +)
  16814. (S1 ^operator O2037 +)
  16815. Firing propose*predict-no
  16816. -->
  16817. (O2038 ^name predict-no +)
  16818. (S1 ^operator O2038 +)
  16819. Firing rl*prefer*rvt*predict-no*H0*6
  16820. -->
  16821. (S1 ^operator O2036 = 0.8843130604166486)
  16822. Firing rl*prefer*rvt*predict-yes*H0*5
  16823. -->
  16824. (S1 ^operator O2035 = 0.2639876601736543)
  16825. Firing prefer*rvt*predict-yes*H0
  16826. -->
  16827. Firing prefer*rvt*predict-no*H0
  16828. -->
  16829. Firing elaborate*copy-dir-to-output-link
  16830. -->
  16831. (I3 ^dir L +)
  16832. inner elaboration loop at bottom goal.
  16833. Retracting elaborate*copy-see-to-output-link
  16834. -->
  16835. (I3 ^see 0 +)
  16836. Retracting propose*predict-no
  16837. -->
  16838. (O2036 ^name predict-no +)
  16839. (S1 ^operator O2036 +)
  16840. Retracting propose*predict-yes
  16841. -->
  16842. (O2035 ^name predict-yes +)
  16843. (S1 ^operator O2035 +)
  16844. Retracting elaborate*reward*based*on*reward
  16845. -->
  16846. (R1021 ^value 1 +)
  16847. (R1 ^reward R1021 +)
  16848. Retracting elaborate*copy-dir-to-output-link
  16849. -->
  16850. (I3 ^dir L +)
  16851. Retracting rl*prefer*rvt*predict-no*H0*6
  16852. -->
  16853. (S1 ^operator O2036 = 0.8843130604166486)
  16854. Retracting rl*prefer*rvt*predict-yes*H0*5
  16855. -->
  16856. (S1 ^operator O2035 = 0.2639876601736543)
  16857. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  16858. -->
  16859. (S1 ^operator O2035 = 0.7358542477906264)
  16860. =>WM: (14361: S1 ^operator O2038 +)
  16861. =>WM: (14360: S1 ^operator O2037 +)
  16862. =>WM: (14359: O2038 ^name predict-no)
  16863. =>WM: (14358: O2037 ^name predict-yes)
  16864. =>WM: (14357: R1022 ^value 1)
  16865. =>WM: (14356: R1 ^reward R1022)
  16866. =>WM: (14355: I3 ^see 1)
  16867. <=WM: (14346: S1 ^operator O2035 +)
  16868. <=WM: (14348: S1 ^operator O2035)
  16869. <=WM: (14347: S1 ^operator O2036 +)
  16870. <=WM: (14341: R1 ^reward R1021)
  16871. <=WM: (14312: I3 ^see 0)
  16872. <=WM: (14344: O2036 ^name predict-no)
  16873. <=WM: (14343: O2035 ^name predict-yes)
  16874. <=WM: (14342: R1021 ^value 1)
  16875. --- Inner Elaboration Phase, active level 1 (S1) ---
  16876. Firing prefer*rvt*predict-yes*H0
  16877. -->
  16878. Firing rl*prefer*rvt*predict-yes*H0*5
  16879. -->
  16880. (S1 ^operator O2037 = 0.2639876601736543)
  16881. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  16882. -->
  16883. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16884. -->
  16885. (S1 ^operator O2037 = -0.181727099742844)
  16886. Firing prefer*rvt*predict-no*H0
  16887. -->
  16888. Firing rl*prefer*rvt*predict-no*H0*6
  16889. -->
  16890. (S1 ^operator O2038 = 0.8843130604166486)
  16891. inner elaboration loop at bottom goal.
  16892. Retracting rl*prefer*rvt*predict-no*H0*6
  16893. -->
  16894. (S1 ^operator O2036 = 0.8843130604166486)
  16895. Retracting rl*prefer*rvt*predict-yes*H0*5
  16896. -->
  16897. (S1 ^operator O2035 = 0.2639876601736543)
  16898. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  16899. -->
  16900. (S1 ^operator O2035 = -0.181727099742844)
  16901. --- END Proposal Phase ---
  16902. --- Decision Phase ---
  16903. RL update rl*prefer*rvt*predict-yes*H0*5 0.554374 -0.290386 0.263988 -> 0.554386 -0.290386 0.264(R,m,v=1,0.879121,0.106854)
  16904. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445471 0.290384 0.735854 -> 0.445486 0.290384 0.73587(R,m,v=1,1,0)
  16905. =>WM: (14362: S1 ^operator O2038)
  16906. 1019: O: O2038 (predict-no)
  16907. --- END Decision Phase ---
  16908. --- Application Phase ---
  16909. --- Firing Productions (PE) For State At Depth 1 ---
  16910. --- Inner Elaboration Phase, active level 1 (S1) ---
  16911. Firing apply*operator
  16912. -->
  16913. (I3 ^predict-no N1019 + :O )
  16914. Firing apply*operator*complete
  16915. -->
  16916. (I3 ^predict-yes N1018 - :O )
  16917. inner elaboration loop at bottom goal.
  16918. --- Change Working Memory (PE) ---
  16919. =>WM: (14363: I3 ^predict-no N1019)
  16920. <=WM: (14350: N1018 ^status complete)
  16921. <=WM: (14349: I3 ^predict-yes N1018)
  16922. --- Firing Productions (IE) For State At Depth 1 ---
  16923. --- Inner Elaboration Phase, active level 1 (S1) ---
  16924. Firing monitor*world
  16925. -->
  16926. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16927. --- Change Working Memory (IE) ---
  16928. --- END Application Phase ---
  16929. --- Output Phase ---
  16930. ENV: Agent did: predict-no for direction L in state State-A
  16931. In State-A moving L
  16932. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16933. predict error 0
  16934. dir: dir isR
  16935. --- END Output Phase ---
  16936. -/--- Input Phase ---
  16937. =>WM: (14367: I2 ^dir R)
  16938. =>WM: (14366: I2 ^reward 1)
  16939. =>WM: (14365: I2 ^see 0)
  16940. =>WM: (14364: N1019 ^status complete)
  16941. <=WM: (14353: I2 ^dir L)
  16942. <=WM: (14352: I2 ^reward 1)
  16943. <=WM: (14351: I2 ^see 1)
  16944. =>WM: (14368: I2 ^level-1 L0-root)
  16945. <=WM: (14354: I2 ^level-1 L1-root)
  16946. --- END Input Phase ---
  16947. --- Proposal Phase ---
  16948. --- Inner Elaboration Phase, active level 1 (S1) ---
  16949. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  16950. -->
  16951. (S1 ^operator O2038 = -0.2817060109291377)
  16952. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  16953. -->
  16954. (S1 ^operator O2037 = 0.6623349441917961)
  16955. Firing prefer*rvt*predict-no*H0*4*v1*H1
  16956. -->
  16957. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  16958. -->
  16959. Firing elaborate*copy-see-to-output-link
  16960. -->
  16961. (I3 ^see 0 +)
  16962. Firing elaborate*reward*based*on*reward
  16963. -->
  16964. (R1023 ^value 1 +)
  16965. (R1 ^reward R1023 +)
  16966. Firing propose*predict-yes
  16967. -->
  16968. (O2039 ^name predict-yes +)
  16969. (S1 ^operator O2039 +)
  16970. Firing propose*predict-no
  16971. -->
  16972. (O2040 ^name predict-no +)
  16973. (S1 ^operator O2040 +)
  16974. Firing rl*prefer*rvt*predict-no*H0*4
  16975. -->
  16976. (S1 ^operator O2038 = 0.3397711633142888)
  16977. Firing rl*prefer*rvt*predict-yes*H0*3
  16978. -->
  16979. (S1 ^operator O2037 = 0.3377075674551746)
  16980. Firing prefer*rvt*predict-yes*H0
  16981. -->
  16982. Firing prefer*rvt*predict-no*H0
  16983. -->
  16984. Firing elaborate*copy-dir-to-output-link
  16985. -->
  16986. (I3 ^dir R +)
  16987. inner elaboration loop at bottom goal.
  16988. Retracting elaborate*copy-see-to-output-link
  16989. -->
  16990. (I3 ^see 1 +)
  16991. Retracting propose*predict-no
  16992. -->
  16993. (O2038 ^name predict-no +)
  16994. (S1 ^operator O2038 +)
  16995. Retracting propose*predict-yes
  16996. -->
  16997. (O2037 ^name predict-yes +)
  16998. (S1 ^operator O2037 +)
  16999. Retracting elaborate*reward*based*on*reward
  17000. -->
  17001. (R1022 ^value 1 +)
  17002. (R1 ^reward R1022 +)
  17003. Retracting elaborate*copy-dir-to-output-link
  17004. -->
  17005. (I3 ^dir L +)
  17006. Retracting rl*prefer*rvt*predict-no*H0*6
  17007. -->
  17008. (S1 ^operator O2038 = 0.8843130604166486)
  17009. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  17010. -->
  17011. (S1 ^operator O2037 = -0.181727099742844)
  17012. Retracting rl*prefer*rvt*predict-yes*H0*5
  17013. -->
  17014. (S1 ^operator O2037 = 0.2640004012975515)
  17015. =>WM: (14376: S1 ^operator O2040 +)
  17016. =>WM: (14375: S1 ^operator O2039 +)
  17017. =>WM: (14374: I3 ^dir R)
  17018. =>WM: (14373: O2040 ^name predict-no)
  17019. =>WM: (14372: O2039 ^name predict-yes)
  17020. =>WM: (14371: R1023 ^value 1)
  17021. =>WM: (14370: R1 ^reward R1023)
  17022. =>WM: (14369: I3 ^see 0)
  17023. <=WM: (14360: S1 ^operator O2037 +)
  17024. <=WM: (14361: S1 ^operator O2038 +)
  17025. <=WM: (14362: S1 ^operator O2038)
  17026. <=WM: (14345: I3 ^dir L)
  17027. <=WM: (14356: R1 ^reward R1022)
  17028. <=WM: (14355: I3 ^see 1)
  17029. <=WM: (14359: O2038 ^name predict-no)
  17030. <=WM: (14358: O2037 ^name predict-yes)
  17031. <=WM: (14357: R1022 ^value 1)
  17032. --- Inner Elaboration Phase, active level 1 (S1) ---
  17033. Firing prefer*rvt*predict-yes*H0
  17034. -->
  17035. Firing rl*prefer*rvt*predict-yes*H0*3
  17036. -->
  17037. (S1 ^operator O2039 = 0.3377075674551746)
  17038. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17039. -->
  17040. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17041. -->
  17042. (S1 ^operator O2039 = 0.6623349441917961)
  17043. Firing prefer*rvt*predict-no*H0
  17044. -->
  17045. Firing rl*prefer*rvt*predict-no*H0*4
  17046. -->
  17047. (S1 ^operator O2040 = 0.3397711633142888)
  17048. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17049. -->
  17050. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17051. -->
  17052. (S1 ^operator O2040 = -0.2817060109291377)
  17053. inner elaboration loop at bottom goal.
  17054. Retracting rl*prefer*rvt*predict-no*H0*4
  17055. -->
  17056. (S1 ^operator O2038 = 0.3397711633142888)
  17057. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17058. -->
  17059. (S1 ^operator O2038 = -0.2817060109291377)
  17060. Retracting rl*prefer*rvt*predict-yes*H0*3
  17061. -->
  17062. (S1 ^operator O2037 = 0.3377075674551746)
  17063. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17064. -->
  17065. (S1 ^operator O2037 = 0.6623349441917961)
  17066. --- END Proposal Phase ---
  17067. --- Decision Phase ---
  17068. RL update rl*prefer*rvt*predict-no*H0*6 0.884313 0 0.884313 -> 0.903476 0 0.903476(R,m,v=1,0.902597,0.0884899)
  17069. =>WM: (14377: S1 ^operator O2039)
  17070. 1020: O: O2039 (predict-yes)
  17071. --- END Decision Phase ---
  17072. --- Application Phase ---
  17073. --- Firing Productions (PE) For State At Depth 1 ---
  17074. --- Inner Elaboration Phase, active level 1 (S1) ---
  17075. Firing apply*operator
  17076. -->
  17077. (I3 ^predict-yes N1020 + :O )
  17078. Firing apply*operator*complete
  17079. -->
  17080. (I3 ^predict-no N1019 - :O )
  17081. inner elaboration loop at bottom goal.
  17082. --- Change Working Memory (PE) ---
  17083. =>WM: (14378: I3 ^predict-yes N1020)
  17084. <=WM: (14364: N1019 ^status complete)
  17085. <=WM: (14363: I3 ^predict-no N1019)
  17086. --- Firing Productions (IE) For State At Depth 1 ---
  17087. --- Inner Elaboration Phase, active level 1 (S1) ---
  17088. Firing monitor*world
  17089. -->
  17090. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17091. --- Change Working Memory (IE) ---
  17092. --- END Application Phase ---
  17093. --- Output Phase ---
  17094. ENV: Agent did: predict-yes for direction R in state State-A
  17095. In State-A moving R
  17096. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17097. predict error 0
  17098. dir: dir isU
  17099. --- END Output Phase ---
  17100. |\---- Input Phase ---
  17101. =>WM: (14382: I2 ^dir U)
  17102. =>WM: (14381: I2 ^reward 1)
  17103. =>WM: (14380: I2 ^see 1)
  17104. =>WM: (14379: N1020 ^status complete)
  17105. <=WM: (14367: I2 ^dir R)
  17106. <=WM: (14366: I2 ^reward 1)
  17107. <=WM: (14365: I2 ^see 0)
  17108. =>WM: (14383: I2 ^level-1 R1-root)
  17109. <=WM: (14368: I2 ^level-1 L0-root)
  17110. --- END Input Phase ---
  17111. --- Proposal Phase ---
  17112. --- Inner Elaboration Phase, active level 1 (S1) ---
  17113. Firing elaborate*copy-see-to-output-link
  17114. -->
  17115. (I3 ^see 1 +)
  17116. Firing elaborate*reward*based*on*reward
  17117. -->
  17118. (R1024 ^value 1 +)
  17119. (R1 ^reward R1024 +)
  17120. Firing propose*predict-yes
  17121. -->
  17122. (O2041 ^name predict-yes +)
  17123. (S1 ^operator O2041 +)
  17124. Firing propose*predict-no
  17125. -->
  17126. (O2042 ^name predict-no +)
  17127. (S1 ^operator O2042 +)
  17128. Firing rl*prefer*rvt*predict-no*H0*2
  17129. -->
  17130. (S1 ^operator O2040 = 1.)
  17131. Firing rl*prefer*rvt*predict-yes*H0*1
  17132. -->
  17133. (S1 ^operator O2039 = 0.)
  17134. Firing prefer*rvt*predict-yes*H0
  17135. -->
  17136. Firing prefer*rvt*predict-no*H0
  17137. -->
  17138. Firing elaborate*copy-dir-to-output-link
  17139. -->
  17140. (I3 ^dir U +)
  17141. inner elaboration loop at bottom goal.
  17142. Retracting elaborate*copy-see-to-output-link
  17143. -->
  17144. (I3 ^see 0 +)
  17145. Retracting propose*predict-no
  17146. -->
  17147. (O2040 ^name predict-no +)
  17148. (S1 ^operator O2040 +)
  17149. Retracting propose*predict-yes
  17150. -->
  17151. (O2039 ^name predict-yes +)
  17152. (S1 ^operator O2039 +)
  17153. Retracting elaborate*reward*based*on*reward
  17154. -->
  17155. (R1023 ^value 1 +)
  17156. (R1 ^reward R1023 +)
  17157. Retracting elaborate*copy-dir-to-output-link
  17158. -->
  17159. (I3 ^dir R +)
  17160. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  17161. -->
  17162. (S1 ^operator O2040 = -0.2817060109291377)
  17163. Retracting rl*prefer*rvt*predict-no*H0*4
  17164. -->
  17165. (S1 ^operator O2040 = 0.3397711633142888)
  17166. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  17167. -->
  17168. (S1 ^operator O2039 = 0.6623349441917961)
  17169. Retracting rl*prefer*rvt*predict-yes*H0*3
  17170. -->
  17171. (S1 ^operator O2039 = 0.3377075674551746)
  17172. =>WM: (14391: S1 ^operator O2042 +)
  17173. =>WM: (14390: S1 ^operator O2041 +)
  17174. =>WM: (14389: I3 ^dir U)
  17175. =>WM: (14388: O2042 ^name predict-no)
  17176. =>WM: (14387: O2041 ^name predict-yes)
  17177. =>WM: (14386: R1024 ^value 1)
  17178. =>WM: (14385: R1 ^reward R1024)
  17179. =>WM: (14384: I3 ^see 1)
  17180. <=WM: (14375: S1 ^operator O2039 +)
  17181. <=WM: (14377: S1 ^operator O2039)
  17182. <=WM: (14376: S1 ^operator O2040 +)
  17183. <=WM: (14374: I3 ^dir R)
  17184. <=WM: (14370: R1 ^reward R1023)
  17185. <=WM: (14369: I3 ^see 0)
  17186. <=WM: (14373: O2040 ^name predict-no)
  17187. <=WM: (14372: O2039 ^name predict-yes)
  17188. <=WM: (14371: R1023 ^value 1)
  17189. --- Inner Elaboration Phase, active level 1 (S1) ---
  17190. Firing prefer*rvt*predict-yes*H0
  17191. -->
  17192. Firing rl*prefer*rvt*predict-yes*H0*1
  17193. -->
  17194. (S1 ^operator O2041 = 0.)
  17195. Firing prefer*rvt*predict-no*H0
  17196. -->
  17197. Firing rl*prefer*rvt*predict-no*H0*2
  17198. -->
  17199. (S1 ^operator O2042 = 1.)
  17200. inner elaboration loop at bottom goal.
  17201. Retracting rl*prefer*rvt*predict-no*H0*2
  17202. -->
  17203. (S1 ^operator O2040 = 1.)
  17204. Retracting rl*prefer*rvt*predict-yes*H0*1
  17205. -->
  17206. (S1 ^operator O2039 = 0.)
  17207. --- END Proposal Phase ---
  17208. --- Decision Phase ---
  17209. RL update rl*prefer*rvt*predict-yes*H0*3 0.590107 -0.2524 0.337708 -> 0.590103 -0.252399 0.337704(R,m,v=1,0.901163,0.0895893)
  17210. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409942 0.252393 0.662335 -> 0.409937 0.252394 0.662331(R,m,v=1,1,0)
  17211. =>WM: (14392: S1 ^operator O2042)
  17212. 1021: O: O2042 (predict-no)
  17213. --- END Decision Phase ---
  17214. --- Application Phase ---
  17215. --- Firing Productions (PE) For State At Depth 1 ---
  17216. --- Inner Elaboration Phase, active level 1 (S1) ---
  17217. Firing apply*operator
  17218. -->
  17219. (I3 ^predict-no N1021 + :O )
  17220. Firing apply*operator*complete
  17221. -->
  17222. (I3 ^predict-yes N1020 - :O )
  17223. inner elaboration loop at bottom goal.
  17224. --- Change Working Memory (PE) ---
  17225. =>WM: (14393: I3 ^predict-no N1021)
  17226. <=WM: (14379: N1020 ^status complete)
  17227. <=WM: (14378: I3 ^predict-yes N1020)
  17228. --- Firing Productions (IE) For State At Depth 1 ---
  17229. --- Inner Elaboration Phase, active level 1 (S1) ---
  17230. Firing monitor*world
  17231. -->
  17232. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17233. --- Change Working Memory (IE) ---
  17234. --- END Application Phase ---
  17235. --- Output Phase ---
  17236. ENV: Agent did: predict-no for direction U in state State-B
  17237. In State-B moving U
  17238. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17239. predict error 0
  17240. dir: dir isU
  17241. --- END Output Phase ---
  17242. /--- Input Phase ---
  17243. =>WM: (14397: I2 ^dir U)
  17244. =>WM: (14396: I2 ^reward 1)
  17245. =>WM: (14395: I2 ^see 0)
  17246. =>WM: (14394: N1021 ^status complete)
  17247. <=WM: (14382: I2 ^dir U)
  17248. <=WM: (14381: I2 ^reward 1)
  17249. <=WM: (14380: I2 ^see 1)
  17250. =>WM: (14398: I2 ^level-1 R1-root)
  17251. <=WM: (14383: I2 ^level-1 R1-root)
  17252. --- END Input Phase ---
  17253. --- Proposal Phase ---
  17254. --- Inner Elaboration Phase, active level 1 (S1) ---
  17255. Firing elaborate*copy-see-to-output-link
  17256. -->
  17257. (I3 ^see 0 +)
  17258. Firing elaborate*reward*based*on*reward
  17259. -->
  17260. (R1025 ^value 1 +)
  17261. (R1 ^reward R1025 +)
  17262. Firing propose*predict-yes
  17263. -->
  17264. (O2043 ^name predict-yes +)
  17265. (S1 ^operator O2043 +)
  17266. Firing propose*predict-no
  17267. -->
  17268. (O2044 ^name predict-no +)
  17269. (S1 ^operator O2044 +)
  17270. Firing rl*prefer*rvt*predict-no*H0*2
  17271. -->
  17272. (S1 ^operator O2042 = 1.)
  17273. Firing rl*prefer*rvt*predict-yes*H0*1
  17274. -->
  17275. (S1 ^operator O2041 = 0.)
  17276. Firing prefer*rvt*predict-yes*H0
  17277. -->
  17278. Firing prefer*rvt*predict-no*H0
  17279. -->
  17280. Firing elaborate*copy-dir-to-output-link
  17281. -->
  17282. (I3 ^dir U +)
  17283. inner elaboration loop at bottom goal.
  17284. Retracting elaborate*copy-see-to-output-link
  17285. -->
  17286. (I3 ^see 1 +)
  17287. Retracting propose*predict-no
  17288. -->
  17289. (O2042 ^name predict-no +)
  17290. (S1 ^operator O2042 +)
  17291. Retracting propose*predict-yes
  17292. -->
  17293. (O2041 ^name predict-yes +)
  17294. (S1 ^operator O2041 +)
  17295. Retracting elaborate*reward*based*on*reward
  17296. -->
  17297. (R1024 ^value 1 +)
  17298. (R1 ^reward R1024 +)
  17299. Retracting elaborate*copy-dir-to-output-link
  17300. -->
  17301. (I3 ^dir U +)
  17302. Retracting rl*prefer*rvt*predict-no*H0*2
  17303. -->
  17304. (S1 ^operator O2042 = 1.)
  17305. Retracting rl*prefer*rvt*predict-yes*H0*1
  17306. -->
  17307. (S1 ^operator O2041 = 0.)
  17308. =>WM: (14405: S1 ^operator O2044 +)
  17309. =>WM: (14404: S1 ^operator O2043 +)
  17310. =>WM: (14403: O2044 ^name predict-no)
  17311. =>WM: (14402: O2043 ^name predict-yes)
  17312. =>WM: (14401: R1025 ^value 1)
  17313. =>WM: (14400: R1 ^reward R1025)
  17314. =>WM: (14399: I3 ^see 0)
  17315. <=WM: (14390: S1 ^operator O2041 +)
  17316. <=WM: (14391: S1 ^operator O2042 +)
  17317. <=WM: (14392: S1 ^operator O2042)
  17318. <=WM: (14385: R1 ^reward R1024)
  17319. <=WM: (14384: I3 ^see 1)
  17320. <=WM: (14388: O2042 ^name predict-no)
  17321. <=WM: (14387: O2041 ^name predict-yes)
  17322. <=WM: (14386: R1024 ^value 1)
  17323. --- Inner Elaboration Phase, active level 1 (S1) ---
  17324. Firing prefer*rvt*predict-yes*H0
  17325. -->
  17326. Firing rl*prefer*rvt*predict-yes*H0*1
  17327. -->
  17328. (S1 ^operator O2043 = 0.)
  17329. Firing prefer*rvt*predict-no*H0
  17330. -->
  17331. Firing rl*prefer*rvt*predict-no*H0*2
  17332. -->
  17333. (S1 ^operator O2044 = 1.)
  17334. inner elaboration loop at bottom goal.
  17335. Retracting rl*prefer*rvt*predict-no*H0*2
  17336. -->
  17337. (S1 ^operator O2042 = 1.)
  17338. Retracting rl*prefer*rvt*predict-yes*H0*1
  17339. -->
  17340. (S1 ^operator O2041 = 0.)
  17341. --- END Proposal Phase ---
  17342. --- Decision Phase ---
  17343. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17344. =>WM: (14406: S1 ^operator O2044)
  17345. 1022: O: O2044 (predict-no)
  17346. --- END Decision Phase ---
  17347. --- Application Phase ---
  17348. --- Firing Productions (PE) For State At Depth 1 ---
  17349. --- Inner Elaboration Phase, active level 1 (S1) ---
  17350. Firing apply*operator
  17351. -->
  17352. (I3 ^predict-no N1022 + :O )
  17353. Firing apply*operator*complete
  17354. -->
  17355. (I3 ^predict-no N1021 - :O )
  17356. inner elaboration loop at bottom goal.
  17357. --- Change Working Memory (PE) ---
  17358. =>WM: (14407: I3 ^predict-no N1022)
  17359. <=WM: (14394: N1021 ^status complete)
  17360. <=WM: (14393: I3 ^predict-no N1021)
  17361. --- Firing Productions (IE) For State At Depth 1 ---
  17362. --- Inner Elaboration Phase, active level 1 (S1) ---
  17363. Firing monitor*world
  17364. -->
  17365. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17366. --- Change Working Memory (IE) ---
  17367. --- END Application Phase ---
  17368. --- Output Phase ---
  17369. ENV: Agent did: predict-no for direction U in state State-B
  17370. In State-B moving U
  17371. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17372. predict error 0
  17373. dir: dir isU
  17374. --- END Output Phase ---
  17375. |\--- Input Phase ---
  17376. =>WM: (14411: I2 ^dir U)
  17377. =>WM: (14410: I2 ^reward 1)
  17378. =>WM: (14409: I2 ^see 0)
  17379. =>WM: (14408: N1022 ^status complete)
  17380. <=WM: (14397: I2 ^dir U)
  17381. <=WM: (14396: I2 ^reward 1)
  17382. <=WM: (14395: I2 ^see 0)
  17383. =>WM: (14412: I2 ^level-1 R1-root)
  17384. <=WM: (14398: I2 ^level-1 R1-root)
  17385. --- END Input Phase ---
  17386. --- Proposal Phase ---
  17387. --- Inner Elaboration Phase, active level 1 (S1) ---
  17388. Firing elaborate*copy-see-to-output-link
  17389. -->
  17390. (I3 ^see 0 +)
  17391. Firing elaborate*reward*based*on*reward
  17392. -->
  17393. (R1026 ^value 1 +)
  17394. (R1 ^reward R1026 +)
  17395. Firing propose*predict-yes
  17396. -->
  17397. (O2045 ^name predict-yes +)
  17398. (S1 ^operator O2045 +)
  17399. Firing propose*predict-no
  17400. -->
  17401. (O2046 ^name predict-no +)
  17402. (S1 ^operator O2046 +)
  17403. Firing rl*prefer*rvt*predict-no*H0*2
  17404. -->
  17405. (S1 ^operator O2044 = 1.)
  17406. Firing rl*prefer*rvt*predict-yes*H0*1
  17407. -->
  17408. (S1 ^operator O2043 = 0.)
  17409. Firing prefer*rvt*predict-yes*H0
  17410. -->
  17411. Firing prefer*rvt*predict-no*H0
  17412. -->
  17413. Firing elaborate*copy-dir-to-output-link
  17414. -->
  17415. (I3 ^dir U +)
  17416. inner elaboration loop at bottom goal.
  17417. Retracting elaborate*copy-see-to-output-link
  17418. -->
  17419. (I3 ^see 0 +)
  17420. Retracting propose*predict-no
  17421. -->
  17422. (O2044 ^name predict-no +)
  17423. (S1 ^operator O2044 +)
  17424. Retracting propose*predict-yes
  17425. -->
  17426. (O2043 ^name predict-yes +)
  17427. (S1 ^operator O2043 +)
  17428. Retracting elaborate*reward*based*on*reward
  17429. -->
  17430. (R1025 ^value 1 +)
  17431. (R1 ^reward R1025 +)
  17432. Retracting elaborate*copy-dir-to-output-link
  17433. -->
  17434. (I3 ^dir U +)
  17435. Retracting rl*prefer*rvt*predict-no*H0*2
  17436. -->
  17437. (S1 ^operator O2044 = 1.)
  17438. Retracting rl*prefer*rvt*predict-yes*H0*1
  17439. -->
  17440. (S1 ^operator O2043 = 0.)
  17441. =>WM: (14418: S1 ^operator O2046 +)
  17442. =>WM: (14417: S1 ^operator O2045 +)
  17443. =>WM: (14416: O2046 ^name predict-no)
  17444. =>WM: (14415: O2045 ^name predict-yes)
  17445. =>WM: (14414: R1026 ^value 1)
  17446. =>WM: (14413: R1 ^reward R1026)
  17447. <=WM: (14404: S1 ^operator O2043 +)
  17448. <=WM: (14405: S1 ^operator O2044 +)
  17449. <=WM: (14406: S1 ^operator O2044)
  17450. <=WM: (14400: R1 ^reward R1025)
  17451. <=WM: (14403: O2044 ^name predict-no)
  17452. <=WM: (14402: O2043 ^name predict-yes)
  17453. <=WM: (14401: R1025 ^value 1)
  17454. --- Inner Elaboration Phase, active level 1 (S1) ---
  17455. Firing prefer*rvt*predict-yes*H0
  17456. -->
  17457. Firing rl*prefer*rvt*predict-yes*H0*1
  17458. -->
  17459. (S1 ^operator O2045 = 0.)
  17460. Firing prefer*rvt*predict-no*H0
  17461. -->
  17462. Firing rl*prefer*rvt*predict-no*H0*2
  17463. -->
  17464. (S1 ^operator O2046 = 1.)
  17465. inner elaboration loop at bottom goal.
  17466. Retracting rl*prefer*rvt*predict-no*H0*2
  17467. -->
  17468. (S1 ^operator O2044 = 1.)
  17469. Retracting rl*prefer*rvt*predict-yes*H0*1
  17470. -->
  17471. (S1 ^operator O2043 = 0.)
  17472. --- END Proposal Phase ---
  17473. --- Decision Phase ---
  17474. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17475. =>WM: (14419: S1 ^operator O2046)
  17476. 1023: O: O2046 (predict-no)
  17477. --- END Decision Phase ---
  17478. --- Application Phase ---
  17479. --- Firing Productions (PE) For State At Depth 1 ---
  17480. --- Inner Elaboration Phase, active level 1 (S1) ---
  17481. Firing apply*operator
  17482. -->
  17483. (I3 ^predict-no N1023 + :O )
  17484. Firing apply*operator*complete
  17485. -->
  17486. (I3 ^predict-no N1022 - :O )
  17487. inner elaboration loop at bottom goal.
  17488. --- Change Working Memory (PE) ---
  17489. =>WM: (14420: I3 ^predict-no N1023)
  17490. <=WM: (14408: N1022 ^status complete)
  17491. <=WM: (14407: I3 ^predict-no N1022)
  17492. --- Firing Productions (IE) For State At Depth 1 ---
  17493. --- Inner Elaboration Phase, active level 1 (S1) ---
  17494. Firing monitor*world
  17495. -->
  17496. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17497. --- Change Working Memory (IE) ---
  17498. --- END Application Phase ---
  17499. --- Output Phase ---
  17500. ENV: Agent did: predict-no for direction U in state State-B
  17501. In State-B moving U
  17502. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17503. predict error 0
  17504. dir: dir isR
  17505. --- END Output Phase ---
  17506. -/|--- Input Phase ---
  17507. =>WM: (14424: I2 ^dir R)
  17508. =>WM: (14423: I2 ^reward 1)
  17509. =>WM: (14422: I2 ^see 0)
  17510. =>WM: (14421: N1023 ^status complete)
  17511. <=WM: (14411: I2 ^dir U)
  17512. <=WM: (14410: I2 ^reward 1)
  17513. <=WM: (14409: I2 ^see 0)
  17514. =>WM: (14425: I2 ^level-1 R1-root)
  17515. <=WM: (14412: I2 ^level-1 R1-root)
  17516. --- END Input Phase ---
  17517. --- Proposal Phase ---
  17518. --- Inner Elaboration Phase, active level 1 (S1) ---
  17519. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  17520. -->
  17521. (S1 ^operator O2045 = -0.1070236389116304)
  17522. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  17523. -->
  17524. (S1 ^operator O2046 = 0.6602409074003434)
  17525. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17526. -->
  17527. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17528. -->
  17529. Firing elaborate*copy-see-to-output-link
  17530. -->
  17531. (I3 ^see 0 +)
  17532. Firing elaborate*reward*based*on*reward
  17533. -->
  17534. (R1027 ^value 1 +)
  17535. (R1 ^reward R1027 +)
  17536. Firing propose*predict-yes
  17537. -->
  17538. (O2047 ^name predict-yes +)
  17539. (S1 ^operator O2047 +)
  17540. Firing propose*predict-no
  17541. -->
  17542. (O2048 ^name predict-no +)
  17543. (S1 ^operator O2048 +)
  17544. Firing rl*prefer*rvt*predict-no*H0*4
  17545. -->
  17546. (S1 ^operator O2046 = 0.3397711633142888)
  17547. Firing rl*prefer*rvt*predict-yes*H0*3
  17548. -->
  17549. (S1 ^operator O2045 = 0.3377041098150635)
  17550. Firing prefer*rvt*predict-yes*H0
  17551. -->
  17552. Firing prefer*rvt*predict-no*H0
  17553. -->
  17554. Firing elaborate*copy-dir-to-output-link
  17555. -->
  17556. (I3 ^dir R +)
  17557. inner elaboration loop at bottom goal.
  17558. Retracting elaborate*copy-see-to-output-link
  17559. -->
  17560. (I3 ^see 0 +)
  17561. Retracting propose*predict-no
  17562. -->
  17563. (O2046 ^name predict-no +)
  17564. (S1 ^operator O2046 +)
  17565. Retracting propose*predict-yes
  17566. -->
  17567. (O2045 ^name predict-yes +)
  17568. (S1 ^operator O2045 +)
  17569. Retracting elaborate*reward*based*on*reward
  17570. -->
  17571. (R1026 ^value 1 +)
  17572. (R1 ^reward R1026 +)
  17573. Retracting elaborate*copy-dir-to-output-link
  17574. -->
  17575. (I3 ^dir U +)
  17576. Retracting rl*prefer*rvt*predict-no*H0*2
  17577. -->
  17578. (S1 ^operator O2046 = 1.)
  17579. Retracting rl*prefer*rvt*predict-yes*H0*1
  17580. -->
  17581. (S1 ^operator O2045 = 0.)
  17582. =>WM: (14432: S1 ^operator O2048 +)
  17583. =>WM: (14431: S1 ^operator O2047 +)
  17584. =>WM: (14430: I3 ^dir R)
  17585. =>WM: (14429: O2048 ^name predict-no)
  17586. =>WM: (14428: O2047 ^name predict-yes)
  17587. =>WM: (14427: R1027 ^value 1)
  17588. =>WM: (14426: R1 ^reward R1027)
  17589. <=WM: (14417: S1 ^operator O2045 +)
  17590. <=WM: (14418: S1 ^operator O2046 +)
  17591. <=WM: (14419: S1 ^operator O2046)
  17592. <=WM: (14389: I3 ^dir U)
  17593. <=WM: (14413: R1 ^reward R1026)
  17594. <=WM: (14416: O2046 ^name predict-no)
  17595. <=WM: (14415: O2045 ^name predict-yes)
  17596. <=WM: (14414: R1026 ^value 1)
  17597. --- Inner Elaboration Phase, active level 1 (S1) ---
  17598. Firing prefer*rvt*predict-yes*H0
  17599. -->
  17600. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  17601. -->
  17602. (S1 ^operator O2047 = -0.1070236389116304)
  17603. Firing rl*prefer*rvt*predict-yes*H0*3
  17604. -->
  17605. (S1 ^operator O2047 = 0.3377041098150635)
  17606. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17607. -->
  17608. Firing prefer*rvt*predict-no*H0
  17609. -->
  17610. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  17611. -->
  17612. (S1 ^operator O2048 = 0.6602409074003434)
  17613. Firing rl*prefer*rvt*predict-no*H0*4
  17614. -->
  17615. (S1 ^operator O2048 = 0.3397711633142888)
  17616. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17617. -->
  17618. inner elaboration loop at bottom goal.
  17619. Retracting rl*prefer*rvt*predict-no*H0*4
  17620. -->
  17621. (S1 ^operator O2046 = 0.3397711633142888)
  17622. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  17623. -->
  17624. (S1 ^operator O2046 = 0.6602409074003434)
  17625. Retracting rl*prefer*rvt*predict-yes*H0*3
  17626. -->
  17627. (S1 ^operator O2045 = 0.3377041098150635)
  17628. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  17629. -->
  17630. (S1 ^operator O2045 = -0.1070236389116304)
  17631. --- END Proposal Phase ---
  17632. --- Decision Phase ---
  17633. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17634. =>WM: (14433: S1 ^operator O2048)
  17635. 1024: O: O2048 (predict-no)
  17636. --- END Decision Phase ---
  17637. --- Application Phase ---
  17638. --- Firing Productions (PE) For State At Depth 1 ---
  17639. --- Inner Elaboration Phase, active level 1 (S1) ---
  17640. Firing apply*operator
  17641. -->
  17642. (I3 ^predict-no N1024 + :O )
  17643. Firing apply*operator*complete
  17644. -->
  17645. (I3 ^predict-no N1023 - :O )
  17646. inner elaboration loop at bottom goal.
  17647. --- Change Working Memory (PE) ---
  17648. =>WM: (14434: I3 ^predict-no N1024)
  17649. <=WM: (14421: N1023 ^status complete)
  17650. <=WM: (14420: I3 ^predict-no N1023)
  17651. --- Firing Productions (IE) For State At Depth 1 ---
  17652. --- Inner Elaboration Phase, active level 1 (S1) ---
  17653. Firing monitor*world
  17654. -->
  17655. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17656. --- Change Working Memory (IE) ---
  17657. --- END Application Phase ---
  17658. --- Output Phase ---
  17659. ENV: Agent did: predict-no for direction R in state State-B
  17660. In State-B moving R
  17661. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17662. predict error 0
  17663. dir: dir isR
  17664. --- END Output Phase ---
  17665. \-/--- Input Phase ---
  17666. =>WM: (14438: I2 ^dir R)
  17667. =>WM: (14437: I2 ^reward 1)
  17668. =>WM: (14436: I2 ^see 0)
  17669. =>WM: (14435: N1024 ^status complete)
  17670. <=WM: (14424: I2 ^dir R)
  17671. <=WM: (14423: I2 ^reward 1)
  17672. <=WM: (14422: I2 ^see 0)
  17673. =>WM: (14439: I2 ^level-1 R0-root)
  17674. <=WM: (14425: I2 ^level-1 R1-root)
  17675. --- END Input Phase ---
  17676. --- Proposal Phase ---
  17677. --- Inner Elaboration Phase, active level 1 (S1) ---
  17678. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17679. -->
  17680. (S1 ^operator O2048 = 0.6601600921091451)
  17681. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17682. -->
  17683. (S1 ^operator O2047 = -0.1028953566115423)
  17684. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17685. -->
  17686. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17687. -->
  17688. Firing elaborate*copy-see-to-output-link
  17689. -->
  17690. (I3 ^see 0 +)
  17691. Firing elaborate*reward*based*on*reward
  17692. -->
  17693. (R1028 ^value 1 +)
  17694. (R1 ^reward R1028 +)
  17695. Firing propose*predict-yes
  17696. -->
  17697. (O2049 ^name predict-yes +)
  17698. (S1 ^operator O2049 +)
  17699. Firing propose*predict-no
  17700. -->
  17701. (O2050 ^name predict-no +)
  17702. (S1 ^operator O2050 +)
  17703. Firing rl*prefer*rvt*predict-no*H0*4
  17704. -->
  17705. (S1 ^operator O2048 = 0.3397711633142888)
  17706. Firing rl*prefer*rvt*predict-yes*H0*3
  17707. -->
  17708. (S1 ^operator O2047 = 0.3377041098150635)
  17709. Firing prefer*rvt*predict-yes*H0
  17710. -->
  17711. Firing prefer*rvt*predict-no*H0
  17712. -->
  17713. Firing elaborate*copy-dir-to-output-link
  17714. -->
  17715. (I3 ^dir R +)
  17716. inner elaboration loop at bottom goal.
  17717. Retracting elaborate*copy-see-to-output-link
  17718. -->
  17719. (I3 ^see 0 +)
  17720. Retracting propose*predict-no
  17721. -->
  17722. (O2048 ^name predict-no +)
  17723. (S1 ^operator O2048 +)
  17724. Retracting propose*predict-yes
  17725. -->
  17726. (O2047 ^name predict-yes +)
  17727. (S1 ^operator O2047 +)
  17728. Retracting elaborate*reward*based*on*reward
  17729. -->
  17730. (R1027 ^value 1 +)
  17731. (R1 ^reward R1027 +)
  17732. Retracting elaborate*copy-dir-to-output-link
  17733. -->
  17734. (I3 ^dir R +)
  17735. Retracting rl*prefer*rvt*predict-no*H0*4
  17736. -->
  17737. (S1 ^operator O2048 = 0.3397711633142888)
  17738. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  17739. -->
  17740. (S1 ^operator O2048 = 0.6602409074003434)
  17741. Retracting rl*prefer*rvt*predict-yes*H0*3
  17742. -->
  17743. (S1 ^operator O2047 = 0.3377041098150635)
  17744. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  17745. -->
  17746. (S1 ^operator O2047 = -0.1070236389116304)
  17747. =>WM: (14445: S1 ^operator O2050 +)
  17748. =>WM: (14444: S1 ^operator O2049 +)
  17749. =>WM: (14443: O2050 ^name predict-no)
  17750. =>WM: (14442: O2049 ^name predict-yes)
  17751. =>WM: (14441: R1028 ^value 1)
  17752. =>WM: (14440: R1 ^reward R1028)
  17753. <=WM: (14431: S1 ^operator O2047 +)
  17754. <=WM: (14432: S1 ^operator O2048 +)
  17755. <=WM: (14433: S1 ^operator O2048)
  17756. <=WM: (14426: R1 ^reward R1027)
  17757. <=WM: (14429: O2048 ^name predict-no)
  17758. <=WM: (14428: O2047 ^name predict-yes)
  17759. <=WM: (14427: R1027 ^value 1)
  17760. --- Inner Elaboration Phase, active level 1 (S1) ---
  17761. Firing prefer*rvt*predict-yes*H0
  17762. -->
  17763. Firing rl*prefer*rvt*predict-yes*H0*3
  17764. -->
  17765. (S1 ^operator O2049 = 0.3377041098150635)
  17766. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  17767. -->
  17768. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17769. -->
  17770. (S1 ^operator O2049 = -0.1028953566115423)
  17771. Firing prefer*rvt*predict-no*H0
  17772. -->
  17773. Firing rl*prefer*rvt*predict-no*H0*4
  17774. -->
  17775. (S1 ^operator O2050 = 0.3397711633142888)
  17776. Firing prefer*rvt*predict-no*H0*4*v1*H1
  17777. -->
  17778. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17779. -->
  17780. (S1 ^operator O2050 = 0.6601600921091451)
  17781. inner elaboration loop at bottom goal.
  17782. Retracting rl*prefer*rvt*predict-no*H0*4
  17783. -->
  17784. (S1 ^operator O2048 = 0.3397711633142888)
  17785. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17786. -->
  17787. (S1 ^operator O2048 = 0.6601600921091451)
  17788. Retracting rl*prefer*rvt*predict-yes*H0*3
  17789. -->
  17790. (S1 ^operator O2047 = 0.3377041098150635)
  17791. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17792. -->
  17793. (S1 ^operator O2047 = -0.1028953566115423)
  17794. --- END Proposal Phase ---
  17795. --- Decision Phase ---
  17796. RL update rl*prefer*rvt*predict-no*H0*4 0.570255 -0.230484 0.339771 -> 0.570254 -0.230483 0.33977(R,m,v=1,0.877193,0.108359)
  17797. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429758 0.230483 0.660241 -> 0.429757 0.230483 0.66024(R,m,v=1,1,0)
  17798. =>WM: (14446: S1 ^operator O2050)
  17799. 1025: O: O2050 (predict-no)
  17800. --- END Decision Phase ---
  17801. --- Application Phase ---
  17802. --- Firing Productions (PE) For State At Depth 1 ---
  17803. --- Inner Elaboration Phase, active level 1 (S1) ---
  17804. Firing apply*operator
  17805. -->
  17806. (I3 ^predict-no N1025 + :O )
  17807. Firing apply*operator*complete
  17808. -->
  17809. (I3 ^predict-no N1024 - :O )
  17810. inner elaboration loop at bottom goal.
  17811. --- Change Working Memory (PE) ---
  17812. =>WM: (14447: I3 ^predict-no N1025)
  17813. <=WM: (14435: N1024 ^status complete)
  17814. <=WM: (14434: I3 ^predict-no N1024)
  17815. --- Firing Productions (IE) For State At Depth 1 ---
  17816. --- Inner Elaboration Phase, active level 1 (S1) ---
  17817. Firing monitor*world
  17818. -->
  17819. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17820. --- Change Working Memory (IE) ---
  17821. --- END Application Phase ---
  17822. --- Output Phase ---
  17823. ENV: Agent did: predict-no for direction R in state State-B
  17824. In State-B moving R
  17825. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17826. predict error 0
  17827. dir: dir isU
  17828. --- END Output Phase ---
  17829. |\---- Input Phase ---
  17830. =>WM: (14451: I2 ^dir U)
  17831. =>WM: (14450: I2 ^reward 1)
  17832. =>WM: (14449: I2 ^see 0)
  17833. =>WM: (14448: N1025 ^status complete)
  17834. <=WM: (14438: I2 ^dir R)
  17835. <=WM: (14437: I2 ^reward 1)
  17836. <=WM: (14436: I2 ^see 0)
  17837. =>WM: (14452: I2 ^level-1 R0-root)
  17838. <=WM: (14439: I2 ^level-1 R0-root)
  17839. --- END Input Phase ---
  17840. --- Proposal Phase ---
  17841. --- Inner Elaboration Phase, active level 1 (S1) ---
  17842. Firing elaborate*copy-see-to-output-link
  17843. -->
  17844. (I3 ^see 0 +)
  17845. Firing elaborate*reward*based*on*reward
  17846. -->
  17847. (R1029 ^value 1 +)
  17848. (R1 ^reward R1029 +)
  17849. Firing propose*predict-yes
  17850. -->
  17851. (O2051 ^name predict-yes +)
  17852. (S1 ^operator O2051 +)
  17853. Firing propose*predict-no
  17854. -->
  17855. (O2052 ^name predict-no +)
  17856. (S1 ^operator O2052 +)
  17857. Firing rl*prefer*rvt*predict-no*H0*2
  17858. -->
  17859. (S1 ^operator O2050 = 1.)
  17860. Firing rl*prefer*rvt*predict-yes*H0*1
  17861. -->
  17862. (S1 ^operator O2049 = 0.)
  17863. Firing prefer*rvt*predict-yes*H0
  17864. -->
  17865. Firing prefer*rvt*predict-no*H0
  17866. -->
  17867. Firing elaborate*copy-dir-to-output-link
  17868. -->
  17869. (I3 ^dir U +)
  17870. inner elaboration loop at bottom goal.
  17871. Retracting elaborate*copy-see-to-output-link
  17872. -->
  17873. (I3 ^see 0 +)
  17874. Retracting propose*predict-no
  17875. -->
  17876. (O2050 ^name predict-no +)
  17877. (S1 ^operator O2050 +)
  17878. Retracting propose*predict-yes
  17879. -->
  17880. (O2049 ^name predict-yes +)
  17881. (S1 ^operator O2049 +)
  17882. Retracting elaborate*reward*based*on*reward
  17883. -->
  17884. (R1028 ^value 1 +)
  17885. (R1 ^reward R1028 +)
  17886. Retracting elaborate*copy-dir-to-output-link
  17887. -->
  17888. (I3 ^dir R +)
  17889. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  17890. -->
  17891. (S1 ^operator O2050 = 0.6601600921091451)
  17892. Retracting rl*prefer*rvt*predict-no*H0*4
  17893. -->
  17894. (S1 ^operator O2050 = 0.3397701806233191)
  17895. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  17896. -->
  17897. (S1 ^operator O2049 = -0.1028953566115423)
  17898. Retracting rl*prefer*rvt*predict-yes*H0*3
  17899. -->
  17900. (S1 ^operator O2049 = 0.3377041098150635)
  17901. =>WM: (14459: S1 ^operator O2052 +)
  17902. =>WM: (14458: S1 ^operator O2051 +)
  17903. =>WM: (14457: I3 ^dir U)
  17904. =>WM: (14456: O2052 ^name predict-no)
  17905. =>WM: (14455: O2051 ^name predict-yes)
  17906. =>WM: (14454: R1029 ^value 1)
  17907. =>WM: (14453: R1 ^reward R1029)
  17908. <=WM: (14444: S1 ^operator O2049 +)
  17909. <=WM: (14445: S1 ^operator O2050 +)
  17910. <=WM: (14446: S1 ^operator O2050)
  17911. <=WM: (14430: I3 ^dir R)
  17912. <=WM: (14440: R1 ^reward R1028)
  17913. <=WM: (14443: O2050 ^name predict-no)
  17914. <=WM: (14442: O2049 ^name predict-yes)
  17915. <=WM: (14441: R1028 ^value 1)
  17916. --- Inner Elaboration Phase, active level 1 (S1) ---
  17917. Firing prefer*rvt*predict-yes*H0
  17918. -->
  17919. Firing rl*prefer*rvt*predict-yes*H0*1
  17920. -->
  17921. (S1 ^operator O2051 = 0.)
  17922. Firing prefer*rvt*predict-no*H0
  17923. -->
  17924. Firing rl*prefer*rvt*predict-no*H0*2
  17925. -->
  17926. (S1 ^operator O2052 = 1.)
  17927. inner elaboration loop at bottom goal.
  17928. Retracting rl*prefer*rvt*predict-no*H0*2
  17929. -->
  17930. (S1 ^operator O2050 = 1.)
  17931. Retracting rl*prefer*rvt*predict-yes*H0*1
  17932. -->
  17933. (S1 ^operator O2049 = 0.)
  17934. --- END Proposal Phase ---
  17935. --- Decision Phase ---
  17936. RL update rl*prefer*rvt*predict-no*H0*4 0.570254 -0.230483 0.33977 -> 0.57026 -0.230484 0.339776(R,m,v=1,0.877907,0.107813)
  17937. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429673 0.230487 0.66016 -> 0.42968 0.230487 0.660167(R,m,v=1,1,0)
  17938. =>WM: (14460: S1 ^operator O2052)
  17939. 1026: O: O2052 (predict-no)
  17940. --- END Decision Phase ---
  17941. --- Application Phase ---
  17942. --- Firing Productions (PE) For State At Depth 1 ---
  17943. --- Inner Elaboration Phase, active level 1 (S1) ---
  17944. Firing apply*operator
  17945. -->
  17946. (I3 ^predict-no N1026 + :O )
  17947. Firing apply*operator*complete
  17948. -->
  17949. (I3 ^predict-no N1025 - :O )
  17950. inner elaboration loop at bottom goal.
  17951. --- Change Working Memory (PE) ---
  17952. =>WM: (14461: I3 ^predict-no N1026)
  17953. <=WM: (14448: N1025 ^status complete)
  17954. <=WM: (14447: I3 ^predict-no N1025)
  17955. --- Firing Productions (IE) For State At Depth 1 ---
  17956. --- Inner Elaboration Phase, active level 1 (S1) ---
  17957. Firing monitor*world
  17958. -->
  17959. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17960. --- Change Working Memory (IE) ---
  17961. --- END Application Phase ---
  17962. --- Output Phase ---
  17963. ENV: Agent did: predict-no for direction U in state State-B
  17964. In State-B moving U
  17965. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17966. predict error 0
  17967. dir: dir isL
  17968. --- END Output Phase ---
  17969. /|\--- Input Phase ---
  17970. =>WM: (14465: I2 ^dir L)
  17971. =>WM: (14464: I2 ^reward 1)
  17972. =>WM: (14463: I2 ^see 0)
  17973. =>WM: (14462: N1026 ^status complete)
  17974. <=WM: (14451: I2 ^dir U)
  17975. <=WM: (14450: I2 ^reward 1)
  17976. <=WM: (14449: I2 ^see 0)
  17977. =>WM: (14466: I2 ^level-1 R0-root)
  17978. <=WM: (14452: I2 ^level-1 R0-root)
  17979. --- END Input Phase ---
  17980. --- Proposal Phase ---
  17981. --- Inner Elaboration Phase, active level 1 (S1) ---
  17982. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  17983. -->
  17984. (S1 ^operator O2051 = 0.7358695241922053)
  17985. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  17986. -->
  17987. Firing elaborate*copy-see-to-output-link
  17988. -->
  17989. (I3 ^see 0 +)
  17990. Firing elaborate*reward*based*on*reward
  17991. -->
  17992. (R1030 ^value 1 +)
  17993. (R1 ^reward R1030 +)
  17994. Firing propose*predict-yes
  17995. -->
  17996. (O2053 ^name predict-yes +)
  17997. (S1 ^operator O2053 +)
  17998. Firing propose*predict-no
  17999. -->
  18000. (O2054 ^name predict-no +)
  18001. (S1 ^operator O2054 +)
  18002. Firing rl*prefer*rvt*predict-no*H0*6
  18003. -->
  18004. (S1 ^operator O2052 = 0.9034761957271857)
  18005. Firing rl*prefer*rvt*predict-yes*H0*5
  18006. -->
  18007. (S1 ^operator O2051 = 0.2640004012975515)
  18008. Firing prefer*rvt*predict-yes*H0
  18009. -->
  18010. Firing prefer*rvt*predict-no*H0
  18011. -->
  18012. Firing elaborate*copy-dir-to-output-link
  18013. -->
  18014. (I3 ^dir L +)
  18015. inner elaboration loop at bottom goal.
  18016. Retracting elaborate*copy-see-to-output-link
  18017. -->
  18018. (I3 ^see 0 +)
  18019. Retracting propose*predict-no
  18020. -->
  18021. (O2052 ^name predict-no +)
  18022. (S1 ^operator O2052 +)
  18023. Retracting propose*predict-yes
  18024. -->
  18025. (O2051 ^name predict-yes +)
  18026. (S1 ^operator O2051 +)
  18027. Retracting elaborate*reward*based*on*reward
  18028. -->
  18029. (R1029 ^value 1 +)
  18030. (R1 ^reward R1029 +)
  18031. Retracting elaborate*copy-dir-to-output-link
  18032. -->
  18033. (I3 ^dir U +)
  18034. Retracting rl*prefer*rvt*predict-no*H0*2
  18035. -->
  18036. (S1 ^operator O2052 = 1.)
  18037. Retracting rl*prefer*rvt*predict-yes*H0*1
  18038. -->
  18039. (S1 ^operator O2051 = 0.)
  18040. =>WM: (14473: S1 ^operator O2054 +)
  18041. =>WM: (14472: S1 ^operator O2053 +)
  18042. =>WM: (14471: I3 ^dir L)
  18043. =>WM: (14470: O2054 ^name predict-no)
  18044. =>WM: (14469: O2053 ^name predict-yes)
  18045. =>WM: (14468: R1030 ^value 1)
  18046. =>WM: (14467: R1 ^reward R1030)
  18047. <=WM: (14458: S1 ^operator O2051 +)
  18048. <=WM: (14459: S1 ^operator O2052 +)
  18049. <=WM: (14460: S1 ^operator O2052)
  18050. <=WM: (14457: I3 ^dir U)
  18051. <=WM: (14453: R1 ^reward R1029)
  18052. <=WM: (14456: O2052 ^name predict-no)
  18053. <=WM: (14455: O2051 ^name predict-yes)
  18054. <=WM: (14454: R1029 ^value 1)
  18055. --- Inner Elaboration Phase, active level 1 (S1) ---
  18056. Firing prefer*rvt*predict-yes*H0
  18057. -->
  18058. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  18059. -->
  18060. (S1 ^operator O2053 = 0.7358695241922053)
  18061. Firing rl*prefer*rvt*predict-yes*H0*5
  18062. -->
  18063. (S1 ^operator O2053 = 0.2640004012975515)
  18064. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  18065. -->
  18066. Firing prefer*rvt*predict-no*H0
  18067. -->
  18068. Firing rl*prefer*rvt*predict-no*H0*6
  18069. -->
  18070. (S1 ^operator O2054 = 0.9034761957271857)
  18071. inner elaboration loop at bottom goal.
  18072. Retracting rl*prefer*rvt*predict-no*H0*6
  18073. -->
  18074. (S1 ^operator O2052 = 0.9034761957271857)
  18075. Retracting rl*prefer*rvt*predict-yes*H0*5
  18076. -->
  18077. (S1 ^operator O2051 = 0.2640004012975515)
  18078. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  18079. -->
  18080. (S1 ^operator O2051 = 0.7358695241922053)
  18081. --- END Proposal Phase ---
  18082. --- Decision Phase ---
  18083. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18084. =>WM: (14474: S1 ^operator O2053)
  18085. 1027: O: O2053 (predict-yes)
  18086. --- END Decision Phase ---
  18087. --- Application Phase ---
  18088. --- Firing Productions (PE) For State At Depth 1 ---
  18089. --- Inner Elaboration Phase, active level 1 (S1) ---
  18090. Firing apply*operator
  18091. -->
  18092. (I3 ^predict-yes N1027 + :O )
  18093. Firing apply*operator*complete
  18094. -->
  18095. (I3 ^predict-no N1026 - :O )
  18096. inner elaboration loop at bottom goal.
  18097. --- Change Working Memory (PE) ---
  18098. =>WM: (14475: I3 ^predict-yes N1027)
  18099. <=WM: (14462: N1026 ^status complete)
  18100. <=WM: (14461: I3 ^predict-no N1026)
  18101. --- Firing Productions (IE) For State At Depth 1 ---
  18102. --- Inner Elaboration Phase, active level 1 (S1) ---
  18103. Firing monitor*world
  18104. -->
  18105. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18106. --- Change Working Memory (IE) ---
  18107. --- END Application Phase ---
  18108. --- Output Phase ---
  18109. ENV: Agent did: predict-yes for direction L in state State-B
  18110. In State-B moving L
  18111. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  18112. predict error 0
  18113. dir: dir isU
  18114. --- END Output Phase ---
  18115. -/|--- Input Phase ---
  18116. =>WM: (14479: I2 ^dir U)
  18117. =>WM: (14478: I2 ^reward 1)
  18118. =>WM: (14477: I2 ^see 1)
  18119. =>WM: (14476: N1027 ^status complete)
  18120. <=WM: (14465: I2 ^dir L)
  18121. <=WM: (14464: I2 ^reward 1)
  18122. <=WM: (14463: I2 ^see 0)
  18123. =>WM: (14480: I2 ^level-1 L1-root)
  18124. <=WM: (14466: I2 ^level-1 R0-root)
  18125. --- END Input Phase ---
  18126. --- Proposal Phase ---
  18127. --- Inner Elaboration Phase, active level 1 (S1) ---
  18128. Firing elaborate*copy-see-to-output-link
  18129. -->
  18130. (I3 ^see 1 +)
  18131. Firing elaborate*reward*based*on*reward
  18132. -->
  18133. (R1031 ^value 1 +)
  18134. (R1 ^reward R1031 +)
  18135. Firing propose*predict-yes
  18136. -->
  18137. (O2055 ^name predict-yes +)
  18138. (S1 ^operator O2055 +)
  18139. Firing propose*predict-no
  18140. -->
  18141. (O2056 ^name predict-no +)
  18142. (S1 ^operator O2056 +)
  18143. Firing rl*prefer*rvt*predict-no*H0*2
  18144. -->
  18145. (S1 ^operator O2054 = 1.)
  18146. Firing rl*prefer*rvt*predict-yes*H0*1
  18147. -->
  18148. (S1 ^operator O2053 = 0.)
  18149. Firing prefer*rvt*predict-yes*H0
  18150. -->
  18151. Firing prefer*rvt*predict-no*H0
  18152. -->
  18153. Firing elaborate*copy-dir-to-output-link
  18154. -->
  18155. (I3 ^dir U +)
  18156. inner elaboration loop at bottom goal.
  18157. Retracting elaborate*copy-see-to-output-link
  18158. -->
  18159. (I3 ^see 0 +)
  18160. Retracting propose*predict-no
  18161. -->
  18162. (O2054 ^name predict-no +)
  18163. (S1 ^operator O2054 +)
  18164. Retracting propose*predict-yes
  18165. -->
  18166. (O2053 ^name predict-yes +)
  18167. (S1 ^operator O2053 +)
  18168. Retracting elaborate*reward*based*on*reward
  18169. -->
  18170. (R1030 ^value 1 +)
  18171. (R1 ^reward R1030 +)
  18172. Retracting elaborate*copy-dir-to-output-link
  18173. -->
  18174. (I3 ^dir L +)
  18175. Retracting rl*prefer*rvt*predict-no*H0*6
  18176. -->
  18177. (S1 ^operator O2054 = 0.9034761957271857)
  18178. Retracting rl*prefer*rvt*predict-yes*H0*5
  18179. -->
  18180. (S1 ^operator O2053 = 0.2640004012975515)
  18181. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  18182. -->
  18183. (S1 ^operator O2053 = 0.7358695241922053)
  18184. =>WM: (14488: S1 ^operator O2056 +)
  18185. =>WM: (14487: S1 ^operator O2055 +)
  18186. =>WM: (14486: I3 ^dir U)
  18187. =>WM: (14485: O2056 ^name predict-no)
  18188. =>WM: (14484: O2055 ^name predict-yes)
  18189. =>WM: (14483: R1031 ^value 1)
  18190. =>WM: (14482: R1 ^reward R1031)
  18191. =>WM: (14481: I3 ^see 1)
  18192. <=WM: (14472: S1 ^operator O2053 +)
  18193. <=WM: (14474: S1 ^operator O2053)
  18194. <=WM: (14473: S1 ^operator O2054 +)
  18195. <=WM: (14471: I3 ^dir L)
  18196. <=WM: (14467: R1 ^reward R1030)
  18197. <=WM: (14399: I3 ^see 0)
  18198. <=WM: (14470: O2054 ^name predict-no)
  18199. <=WM: (14469: O2053 ^name predict-yes)
  18200. <=WM: (14468: R1030 ^value 1)
  18201. --- Inner Elaboration Phase, active level 1 (S1) ---
  18202. Firing prefer*rvt*predict-yes*H0
  18203. -->
  18204. Firing rl*prefer*rvt*predict-yes*H0*1
  18205. -->
  18206. (S1 ^operator O2055 = 0.)
  18207. Firing prefer*rvt*predict-no*H0
  18208. -->
  18209. Firing rl*prefer*rvt*predict-no*H0*2
  18210. -->
  18211. (S1 ^operator O2056 = 1.)
  18212. inner elaboration loop at bottom goal.
  18213. Retracting rl*prefer*rvt*predict-no*H0*2
  18214. -->
  18215. (S1 ^operator O2054 = 1.)
  18216. Retracting rl*prefer*rvt*predict-yes*H0*1
  18217. -->
  18218. (S1 ^operator O2053 = 0.)
  18219. --- END Proposal Phase ---
  18220. --- Decision Phase ---
  18221. RL update rl*prefer*rvt*predict-yes*H0*5 0.554386 -0.290386 0.264 -> 0.554397 -0.290386 0.264011(R,m,v=1,0.879781,0.106347)
  18222. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445486 0.290384 0.73587 -> 0.445498 0.290384 0.735882(R,m,v=1,1,0)
  18223. =>WM: (14489: S1 ^operator O2056)
  18224. 1028: O: O2056 (predict-no)
  18225. --- END Decision Phase ---
  18226. --- Application Phase ---
  18227. --- Firing Productions (PE) For State At Depth 1 ---
  18228. --- Inner Elaboration Phase, active level 1 (S1) ---
  18229. Firing apply*operator
  18230. -->
  18231. (I3 ^predict-no N1028 + :O )
  18232. Firing apply*operator*complete
  18233. -->
  18234. (I3 ^predict-yes N1027 - :O )
  18235. inner elaboration loop at bottom goal.
  18236. --- Change Working Memory (PE) ---
  18237. =>WM: (14490: I3 ^predict-no N1028)
  18238. <=WM: (14476: N1027 ^status complete)
  18239. <=WM: (14475: I3 ^predict-yes N1027)
  18240. --- Firing Productions (IE) For State At Depth 1 ---
  18241. --- Inner Elaboration Phase, active level 1 (S1) ---
  18242. Firing monitor*world
  18243. -->
  18244. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18245. --- Change Working Memory (IE) ---
  18246. --- END Application Phase ---
  18247. --- Output Phase ---
  18248. ENV: Agent did: predict-no for direction U in state State-A
  18249. In State-A moving U
  18250. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18251. predict error 0
  18252. dir: dir isU
  18253. --- END Output Phase ---
  18254. \-/--- Input Phase ---
  18255. =>WM: (14494: I2 ^dir U)
  18256. =>WM: (14493: I2 ^reward 1)
  18257. =>WM: (14492: I2 ^see 0)
  18258. =>WM: (14491: N1028 ^status complete)
  18259. <=WM: (14479: I2 ^dir U)
  18260. <=WM: (14478: I2 ^reward 1)
  18261. <=WM: (14477: I2 ^see 1)
  18262. =>WM: (14495: I2 ^level-1 L1-root)
  18263. <=WM: (14480: I2 ^level-1 L1-root)
  18264. --- END Input Phase ---
  18265. --- Proposal Phase ---
  18266. --- Inner Elaboration Phase, active level 1 (S1) ---
  18267. Firing elaborate*copy-see-to-output-link
  18268. -->
  18269. (I3 ^see 0 +)
  18270. Firing elaborate*reward*based*on*reward
  18271. -->
  18272. (R1032 ^value 1 +)
  18273. (R1 ^reward R1032 +)
  18274. Firing propose*predict-yes
  18275. -->
  18276. (O2057 ^name predict-yes +)
  18277. (S1 ^operator O2057 +)
  18278. Firing propose*predict-no
  18279. -->
  18280. (O2058 ^name predict-no +)
  18281. (S1 ^operator O2058 +)
  18282. Firing rl*prefer*rvt*predict-no*H0*2
  18283. -->
  18284. (S1 ^operator O2056 = 1.)
  18285. Firing rl*prefer*rvt*predict-yes*H0*1
  18286. -->
  18287. (S1 ^operator O2055 = 0.)
  18288. Firing prefer*rvt*predict-yes*H0
  18289. -->
  18290. Firing prefer*rvt*predict-no*H0
  18291. -->
  18292. Firing elaborate*copy-dir-to-output-link
  18293. -->
  18294. (I3 ^dir U +)
  18295. inner elaboration loop at bottom goal.
  18296. Retracting elaborate*copy-see-to-output-link
  18297. -->
  18298. (I3 ^see 1 +)
  18299. Retracting propose*predict-no
  18300. -->
  18301. (O2056 ^name predict-no +)
  18302. (S1 ^operator O2056 +)
  18303. Retracting propose*predict-yes
  18304. -->
  18305. (O2055 ^name predict-yes +)
  18306. (S1 ^operator O2055 +)
  18307. Retracting elaborate*reward*based*on*reward
  18308. -->
  18309. (R1031 ^value 1 +)
  18310. (R1 ^reward R1031 +)
  18311. Retracting elaborate*copy-dir-to-output-link
  18312. -->
  18313. (I3 ^dir U +)
  18314. Retracting rl*prefer*rvt*predict-no*H0*2
  18315. -->
  18316. (S1 ^operator O2056 = 1.)
  18317. Retracting rl*prefer*rvt*predict-yes*H0*1
  18318. -->
  18319. (S1 ^operator O2055 = 0.)
  18320. =>WM: (14502: S1 ^operator O2058 +)
  18321. =>WM: (14501: S1 ^operator O2057 +)
  18322. =>WM: (14500: O2058 ^name predict-no)
  18323. =>WM: (14499: O2057 ^name predict-yes)
  18324. =>WM: (14498: R1032 ^value 1)
  18325. =>WM: (14497: R1 ^reward R1032)
  18326. =>WM: (14496: I3 ^see 0)
  18327. <=WM: (14487: S1 ^operator O2055 +)
  18328. <=WM: (14488: S1 ^operator O2056 +)
  18329. <=WM: (14489: S1 ^operator O2056)
  18330. <=WM: (14482: R1 ^reward R1031)
  18331. <=WM: (14481: I3 ^see 1)
  18332. <=WM: (14485: O2056 ^name predict-no)
  18333. <=WM: (14484: O2055 ^name predict-yes)
  18334. <=WM: (14483: R1031 ^value 1)
  18335. --- Inner Elaboration Phase, active level 1 (S1) ---
  18336. Firing prefer*rvt*predict-yes*H0
  18337. -->
  18338. Firing rl*prefer*rvt*predict-yes*H0*1
  18339. -->
  18340. (S1 ^operator O2057 = 0.)
  18341. Firing prefer*rvt*predict-no*H0
  18342. -->
  18343. Firing rl*prefer*rvt*predict-no*H0*2
  18344. -->
  18345. (S1 ^operator O2058 = 1.)
  18346. inner elaboration loop at bottom goal.
  18347. Retracting rl*prefer*rvt*predict-no*H0*2
  18348. -->
  18349. (S1 ^operator O2056 = 1.)
  18350. Retracting rl*prefer*rvt*predict-yes*H0*1
  18351. -->
  18352. (S1 ^operator O2055 = 0.)
  18353. --- END Proposal Phase ---
  18354. --- Decision Phase ---
  18355. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18356. =>WM: (14503: S1 ^operator O2058)
  18357. 1029: O: O2058 (predict-no)
  18358. --- END Decision Phase ---
  18359. --- Application Phase ---
  18360. --- Firing Productions (PE) For State At Depth 1 ---
  18361. --- Inner Elaboration Phase, active level 1 (S1) ---
  18362. Firing apply*operator
  18363. -->
  18364. (I3 ^predict-no N1029 + :O )
  18365. Firing apply*operator*complete
  18366. -->
  18367. (I3 ^predict-no N1028 - :O )
  18368. inner elaboration loop at bottom goal.
  18369. --- Change Working Memory (PE) ---
  18370. =>WM: (14504: I3 ^predict-no N1029)
  18371. <=WM: (14491: N1028 ^status complete)
  18372. <=WM: (14490: I3 ^predict-no N1028)
  18373. --- Firing Productions (IE) For State At Depth 1 ---
  18374. --- Inner Elaboration Phase, active level 1 (S1) ---
  18375. Firing monitor*world
  18376. -->
  18377. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18378. --- Change Working Memory (IE) ---
  18379. --- END Application Phase ---
  18380. --- Output Phase ---
  18381. ENV: Agent did: predict-no for direction U in state State-A
  18382. In State-A moving U
  18383. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18384. predict error 0
  18385. dir: dir isU
  18386. --- END Output Phase ---
  18387. |\---- Input Phase ---
  18388. =>WM: (14508: I2 ^dir U)
  18389. =>WM: (14507: I2 ^reward 1)
  18390. =>WM: (14506: I2 ^see 0)
  18391. =>WM: (14505: N1029 ^status complete)
  18392. <=WM: (14494: I2 ^dir U)
  18393. <=WM: (14493: I2 ^reward 1)
  18394. <=WM: (14492: I2 ^see 0)
  18395. =>WM: (14509: I2 ^level-1 L1-root)
  18396. <=WM: (14495: I2 ^level-1 L1-root)
  18397. --- END Input Phase ---
  18398. --- Proposal Phase ---
  18399. --- Inner Elaboration Phase, active level 1 (S1) ---
  18400. Firing elaborate*copy-see-to-output-link
  18401. -->
  18402. (I3 ^see 0 +)
  18403. Firing elaborate*reward*based*on*reward
  18404. -->
  18405. (R1033 ^value 1 +)
  18406. (R1 ^reward R1033 +)
  18407. Firing propose*predict-yes
  18408. -->
  18409. (O2059 ^name predict-yes +)
  18410. (S1 ^operator O2059 +)
  18411. Firing propose*predict-no
  18412. -->
  18413. (O2060 ^name predict-no +)
  18414. (S1 ^operator O2060 +)
  18415. Firing rl*prefer*rvt*predict-no*H0*2
  18416. -->
  18417. (S1 ^operator O2058 = 1.)
  18418. Firing rl*prefer*rvt*predict-yes*H0*1
  18419. -->
  18420. (S1 ^operator O2057 = 0.)
  18421. Firing prefer*rvt*predict-yes*H0
  18422. -->
  18423. Firing prefer*rvt*predict-no*H0
  18424. -->
  18425. Firing elaborate*copy-dir-to-output-link
  18426. -->
  18427. (I3 ^dir U +)
  18428. inner elaboration loop at bottom goal.
  18429. Retracting elaborate*copy-see-to-output-link
  18430. -->
  18431. (I3 ^see 0 +)
  18432. Retracting propose*predict-no
  18433. -->
  18434. (O2058 ^name predict-no +)
  18435. (S1 ^operator O2058 +)
  18436. Retracting propose*predict-yes
  18437. -->
  18438. (O2057 ^name predict-yes +)
  18439. (S1 ^operator O2057 +)
  18440. Retracting elaborate*reward*based*on*reward
  18441. -->
  18442. (R1032 ^value 1 +)
  18443. (R1 ^reward R1032 +)
  18444. Retracting elaborate*copy-dir-to-output-link
  18445. -->
  18446. (I3 ^dir U +)
  18447. Retracting rl*prefer*rvt*predict-no*H0*2
  18448. -->
  18449. (S1 ^operator O2058 = 1.)
  18450. Retracting rl*prefer*rvt*predict-yes*H0*1
  18451. -->
  18452. (S1 ^operator O2057 = 0.)
  18453. =>WM: (14515: S1 ^operator O2060 +)
  18454. =>WM: (14514: S1 ^operator O2059 +)
  18455. =>WM: (14513: O2060 ^name predict-no)
  18456. =>WM: (14512: O2059 ^name predict-yes)
  18457. =>WM: (14511: R1033 ^value 1)
  18458. =>WM: (14510: R1 ^reward R1033)
  18459. <=WM: (14501: S1 ^operator O2057 +)
  18460. <=WM: (14502: S1 ^operator O2058 +)
  18461. <=WM: (14503: S1 ^operator O2058)
  18462. <=WM: (14497: R1 ^reward R1032)
  18463. <=WM: (14500: O2058 ^name predict-no)
  18464. <=WM: (14499: O2057 ^name predict-yes)
  18465. <=WM: (14498: R1032 ^value 1)
  18466. --- Inner Elaboration Phase, active level 1 (S1) ---
  18467. Firing prefer*rvt*predict-yes*H0
  18468. -->
  18469. Firing rl*prefer*rvt*predict-yes*H0*1
  18470. -->
  18471. (S1 ^operator O2059 = 0.)
  18472. Firing prefer*rvt*predict-no*H0
  18473. -->
  18474. Firing rl*prefer*rvt*predict-no*H0*2
  18475. -->
  18476. (S1 ^operator O2060 = 1.)
  18477. inner elaboration loop at bottom goal.
  18478. Retracting rl*prefer*rvt*predict-no*H0*2
  18479. -->
  18480. (S1 ^operator O2058 = 1.)
  18481. Retracting rl*prefer*rvt*predict-yes*H0*1
  18482. -->
  18483. (S1 ^operator O2057 = 0.)
  18484. --- END Proposal Phase ---
  18485. --- Decision Phase ---
  18486. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18487. =>WM: (14516: S1 ^operator O2060)
  18488. 1030: O: O2060 (predict-no)
  18489. --- END Decision Phase ---
  18490. --- Application Phase ---
  18491. --- Firing Productions (PE) For State At Depth 1 ---
  18492. --- Inner Elaboration Phase, active level 1 (S1) ---
  18493. Firing apply*operator
  18494. -->
  18495. (I3 ^predict-no N1030 + :O )
  18496. Firing apply*operator*complete
  18497. -->
  18498. (I3 ^predict-no N1029 - :O )
  18499. inner elaboration loop at bottom goal.
  18500. --- Change Working Memory (PE) ---
  18501. =>WM: (14517: I3 ^predict-no N1030)
  18502. <=WM: (14505: N1029 ^status complete)
  18503. <=WM: (14504: I3 ^predict-no N1029)
  18504. --- Firing Productions (IE) For State At Depth 1 ---
  18505. --- Inner Elaboration Phase, active level 1 (S1) ---
  18506. Firing monitor*world
  18507. -->
  18508. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18509. --- Change Working Memory (IE) ---
  18510. --- END Application Phase ---
  18511. --- Output Phase ---
  18512. ENV: Agent did: predict-no for direction U in state State-A
  18513. In State-A moving U
  18514. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18515. predict error 0
  18516. dir: dir isU
  18517. --- END Output Phase ---
  18518. /|\--- Input Phase ---
  18519. =>WM: (14521: I2 ^dir U)
  18520. =>WM: (14520: I2 ^reward 1)
  18521. =>WM: (14519: I2 ^see 0)
  18522. =>WM: (14518: N1030 ^status complete)
  18523. <=WM: (14508: I2 ^dir U)
  18524. <=WM: (14507: I2 ^reward 1)
  18525. <=WM: (14506: I2 ^see 0)
  18526. =>WM: (14522: I2 ^level-1 L1-root)
  18527. <=WM: (14509: I2 ^level-1 L1-root)
  18528. --- END Input Phase ---
  18529. --- Proposal Phase ---
  18530. --- Inner Elaboration Phase, active level 1 (S1) ---
  18531. Firing elaborate*copy-see-to-output-link
  18532. -->
  18533. (I3 ^see 0 +)
  18534. Firing elaborate*reward*based*on*reward
  18535. -->
  18536. (R1034 ^value 1 +)
  18537. (R1 ^reward R1034 +)
  18538. Firing propose*predict-yes
  18539. -->
  18540. (O2061 ^name predict-yes +)
  18541. (S1 ^operator O2061 +)
  18542. Firing propose*predict-no
  18543. -->
  18544. (O2062 ^name predict-no +)
  18545. (S1 ^operator O2062 +)
  18546. Firing rl*prefer*rvt*predict-no*H0*2
  18547. -->
  18548. (S1 ^operator O2060 = 1.)
  18549. Firing rl*prefer*rvt*predict-yes*H0*1
  18550. -->
  18551. (S1 ^operator O2059 = 0.)
  18552. Firing prefer*rvt*predict-yes*H0
  18553. -->
  18554. Firing prefer*rvt*predict-no*H0
  18555. -->
  18556. Firing elaborate*copy-dir-to-output-link
  18557. -->
  18558. (I3 ^dir U +)
  18559. inner elaboration loop at bottom goal.
  18560. Retracting elaborate*copy-see-to-output-link
  18561. -->
  18562. (I3 ^see 0 +)
  18563. Retracting propose*predict-no
  18564. -->
  18565. (O2060 ^name predict-no +)
  18566. (S1 ^operator O2060 +)
  18567. Retracting propose*predict-yes
  18568. -->
  18569. (O2059 ^name predict-yes +)
  18570. (S1 ^operator O2059 +)
  18571. Retracting elaborate*reward*based*on*reward
  18572. -->
  18573. (R1033 ^value 1 +)
  18574. (R1 ^reward R1033 +)
  18575. Retracting elaborate*copy-dir-to-output-link
  18576. -->
  18577. (I3 ^dir U +)
  18578. Retracting rl*prefer*rvt*predict-no*H0*2
  18579. -->
  18580. (S1 ^operator O2060 = 1.)
  18581. Retracting rl*prefer*rvt*predict-yes*H0*1
  18582. -->
  18583. (S1 ^operator O2059 = 0.)
  18584. =>WM: (14528: S1 ^operator O2062 +)
  18585. =>WM: (14527: S1 ^operator O2061 +)
  18586. =>WM: (14526: O2062 ^name predict-no)
  18587. =>WM: (14525: O2061 ^name predict-yes)
  18588. =>WM: (14524: R1034 ^value 1)
  18589. =>WM: (14523: R1 ^reward R1034)
  18590. <=WM: (14514: S1 ^operator O2059 +)
  18591. <=WM: (14515: S1 ^operator O2060 +)
  18592. <=WM: (14516: S1 ^operator O2060)
  18593. <=WM: (14510: R1 ^reward R1033)
  18594. <=WM: (14513: O2060 ^name predict-no)
  18595. <=WM: (14512: O2059 ^name predict-yes)
  18596. <=WM: (14511: R1033 ^value 1)
  18597. --- Inner Elaboration Phase, active level 1 (S1) ---
  18598. Firing prefer*rvt*predict-yes*H0
  18599. -->
  18600. Firing rl*prefer*rvt*predict-yes*H0*1
  18601. -->
  18602. (S1 ^operator O2061 = 0.)
  18603. Firing prefer*rvt*predict-no*H0
  18604. -->
  18605. Firing rl*prefer*rvt*predict-no*H0*2
  18606. -->
  18607. (S1 ^operator O2062 = 1.)
  18608. inner elaboration loop at bottom goal.
  18609. Retracting rl*prefer*rvt*predict-no*H0*2
  18610. -->
  18611. (S1 ^operator O2060 = 1.)
  18612. Retracting rl*prefer*rvt*predict-yes*H0*1
  18613. -->
  18614. (S1 ^operator O2059 = 0.)
  18615. --- END Proposal Phase ---
  18616. --- Decision Phase ---
  18617. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18618. =>WM: (14529: S1 ^operator O2062)
  18619. 1031: O: O2062 (predict-no)
  18620. --- END Decision Phase ---
  18621. --- Application Phase ---
  18622. --- Firing Productions (PE) For State At Depth 1 ---
  18623. --- Inner Elaboration Phase, active level 1 (S1) ---
  18624. Firing apply*operator
  18625. -->
  18626. (I3 ^predict-no N1031 + :O )
  18627. Firing apply*operator*complete
  18628. -->
  18629. (I3 ^predict-no N1030 - :O )
  18630. inner elaboration loop at bottom goal.
  18631. --- Change Working Memory (PE) ---
  18632. =>WM: (14530: I3 ^predict-no N1031)
  18633. <=WM: (14518: N1030 ^status complete)
  18634. <=WM: (14517: I3 ^predict-no N1030)
  18635. --- Firing Productions (IE) For State At Depth 1 ---
  18636. --- Inner Elaboration Phase, active level 1 (S1) ---
  18637. Firing monitor*world
  18638. -->
  18639. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18640. --- Change Working Memory (IE) ---
  18641. --- END Application Phase ---
  18642. --- Output Phase ---
  18643. ENV: Agent did: predict-no for direction U in state State-A
  18644. In State-A moving U
  18645. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18646. predict error 0
  18647. dir: dir isL
  18648. --- END Output Phase ---
  18649. ---- Input Phase ---
  18650. =>WM: (14534: I2 ^dir L)
  18651. =>WM: (14533: I2 ^reward 1)
  18652. =>WM: (14532: I2 ^see 0)
  18653. =>WM: (14531: N1031 ^status complete)
  18654. <=WM: (14521: I2 ^dir U)
  18655. <=WM: (14520: I2 ^reward 1)
  18656. <=WM: (14519: I2 ^see 0)
  18657. =>WM: (14535: I2 ^level-1 L1-root)
  18658. <=WM: (14522: I2 ^level-1 L1-root)
  18659. --- END Input Phase ---
  18660. --- Proposal Phase ---
  18661. --- Inner Elaboration Phase, active level 1 (S1) ---
  18662. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  18663. -->
  18664. (S1 ^operator O2061 = -0.181727099742844)
  18665. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  18666. -->
  18667. Firing elaborate*copy-see-to-output-link
  18668. -->
  18669. (I3 ^see 0 +)
  18670. Firing elaborate*reward*based*on*reward
  18671. -->
  18672. (R1035 ^value 1 +)
  18673. (R1 ^reward R1035 +)
  18674. Firing propose*predict-yes
  18675. -->
  18676. (O2063 ^name predict-yes +)
  18677. (S1 ^operator O2063 +)
  18678. Firing propose*predict-no
  18679. -->
  18680. (O2064 ^name predict-no +)
  18681. (S1 ^operator O2064 +)
  18682. Firing rl*prefer*rvt*predict-no*H0*6
  18683. -->
  18684. (S1 ^operator O2062 = 0.9034761957271857)
  18685. Firing rl*prefer*rvt*predict-yes*H0*5
  18686. -->
  18687. (S1 ^operator O2061 = 0.2640108751521542)
  18688. Firing prefer*rvt*predict-yes*H0
  18689. -->
  18690. Firing prefer*rvt*predict-no*H0
  18691. -->
  18692. Firing elaborate*copy-dir-to-output-link
  18693. -->
  18694. (I3 ^dir L +)
  18695. inner elaboration loop at bottom goal.
  18696. Retracting elaborate*copy-see-to-output-link
  18697. -->
  18698. (I3 ^see 0 +)
  18699. Retracting propose*predict-no
  18700. -->
  18701. (O2062 ^name predict-no +)
  18702. (S1 ^operator O2062 +)
  18703. Retracting propose*predict-yes
  18704. -->
  18705. (O2061 ^name predict-yes +)
  18706. (S1 ^operator O2061 +)
  18707. Retracting elaborate*reward*based*on*reward
  18708. -->
  18709. (R1034 ^value 1 +)
  18710. (R1 ^reward R1034 +)
  18711. Retracting elaborate*copy-dir-to-output-link
  18712. -->
  18713. (I3 ^dir U +)
  18714. Retracting rl*prefer*rvt*predict-no*H0*2
  18715. -->
  18716. (S1 ^operator O2062 = 1.)
  18717. Retracting rl*prefer*rvt*predict-yes*H0*1
  18718. -->
  18719. (S1 ^operator O2061 = 0.)
  18720. =>WM: (14542: S1 ^operator O2064 +)
  18721. =>WM: (14541: S1 ^operator O2063 +)
  18722. =>WM: (14540: I3 ^dir L)
  18723. =>WM: (14539: O2064 ^name predict-no)
  18724. =>WM: (14538: O2063 ^name predict-yes)
  18725. =>WM: (14537: R1035 ^value 1)
  18726. =>WM: (14536: R1 ^reward R1035)
  18727. <=WM: (14527: S1 ^operator O2061 +)
  18728. <=WM: (14528: S1 ^operator O2062 +)
  18729. <=WM: (14529: S1 ^operator O2062)
  18730. <=WM: (14486: I3 ^dir U)
  18731. <=WM: (14523: R1 ^reward R1034)
  18732. <=WM: (14526: O2062 ^name predict-no)
  18733. <=WM: (14525: O2061 ^name predict-yes)
  18734. <=WM: (14524: R1034 ^value 1)
  18735. --- Inner Elaboration Phase, active level 1 (S1) ---
  18736. Firing prefer*rvt*predict-yes*H0
  18737. -->
  18738. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  18739. -->
  18740. (S1 ^operator O2063 = -0.181727099742844)
  18741. Firing rl*prefer*rvt*predict-yes*H0*5
  18742. -->
  18743. (S1 ^operator O2063 = 0.2640108751521542)
  18744. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  18745. -->
  18746. Firing prefer*rvt*predict-no*H0
  18747. -->
  18748. Firing rl*prefer*rvt*predict-no*H0*6
  18749. -->
  18750. (S1 ^operator O2064 = 0.9034761957271857)
  18751. inner elaboration loop at bottom goal.
  18752. Retracting rl*prefer*rvt*predict-no*H0*6
  18753. -->
  18754. (S1 ^operator O2062 = 0.9034761957271857)
  18755. Retracting rl*prefer*rvt*predict-yes*H0*5
  18756. -->
  18757. (S1 ^operator O2061 = 0.2640108751521542)
  18758. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  18759. -->
  18760. (S1 ^operator O2061 = -0.181727099742844)
  18761. --- END Proposal Phase ---
  18762. --- Decision Phase ---
  18763. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18764. =>WM: (14543: S1 ^operator O2064)
  18765. 1032: O: O2064 (predict-no)
  18766. --- END Decision Phase ---
  18767. --- Application Phase ---
  18768. --- Firing Productions (PE) For State At Depth 1 ---
  18769. --- Inner Elaboration Phase, active level 1 (S1) ---
  18770. Firing apply*operator
  18771. -->
  18772. (I3 ^predict-no N1032 + :O )
  18773. Firing apply*operator*complete
  18774. -->
  18775. (I3 ^predict-no N1031 - :O )
  18776. inner elaboration loop at bottom goal.
  18777. --- Change Working Memory (PE) ---
  18778. =>WM: (14544: I3 ^predict-no N1032)
  18779. <=WM: (14531: N1031 ^status complete)
  18780. <=WM: (14530: I3 ^predict-no N1031)
  18781. --- Firing Productions (IE) For State At Depth 1 ---
  18782. --- Inner Elaboration Phase, active level 1 (S1) ---
  18783. Firing monitor*world
  18784. -->
  18785. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18786. --- Change Working Memory (IE) ---
  18787. --- END Application Phase ---
  18788. --- Output Phase ---
  18789. ENV: Agent did: predict-no for direction L in state State-A
  18790. In State-A moving L
  18791. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18792. predict error 0
  18793. dir: dir isR
  18794. --- END Output Phase ---
  18795. /|\--- Input Phase ---
  18796. =>WM: (14548: I2 ^dir R)
  18797. =>WM: (14547: I2 ^reward 1)
  18798. =>WM: (14546: I2 ^see 0)
  18799. =>WM: (14545: N1032 ^status complete)
  18800. <=WM: (14534: I2 ^dir L)
  18801. <=WM: (14533: I2 ^reward 1)
  18802. <=WM: (14532: I2 ^see 0)
  18803. =>WM: (14549: I2 ^level-1 L0-root)
  18804. <=WM: (14535: I2 ^level-1 L1-root)
  18805. --- END Input Phase ---
  18806. --- Proposal Phase ---
  18807. --- Inner Elaboration Phase, active level 1 (S1) ---
  18808. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  18809. -->
  18810. (S1 ^operator O2064 = -0.2817060109291377)
  18811. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  18812. -->
  18813. (S1 ^operator O2063 = 0.6623309159241351)
  18814. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18815. -->
  18816. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18817. -->
  18818. Firing elaborate*copy-see-to-output-link
  18819. -->
  18820. (I3 ^see 0 +)
  18821. Firing elaborate*reward*based*on*reward
  18822. -->
  18823. (R1036 ^value 1 +)
  18824. (R1 ^reward R1036 +)
  18825. Firing propose*predict-yes
  18826. -->
  18827. (O2065 ^name predict-yes +)
  18828. (S1 ^operator O2065 +)
  18829. Firing propose*predict-no
  18830. -->
  18831. (O2066 ^name predict-no +)
  18832. (S1 ^operator O2066 +)
  18833. Firing rl*prefer*rvt*predict-no*H0*4
  18834. -->
  18835. (S1 ^operator O2064 = 0.3397758518173152)
  18836. Firing rl*prefer*rvt*predict-yes*H0*3
  18837. -->
  18838. (S1 ^operator O2063 = 0.3377041098150635)
  18839. Firing prefer*rvt*predict-yes*H0
  18840. -->
  18841. Firing prefer*rvt*predict-no*H0
  18842. -->
  18843. Firing elaborate*copy-dir-to-output-link
  18844. -->
  18845. (I3 ^dir R +)
  18846. inner elaboration loop at bottom goal.
  18847. Retracting elaborate*copy-see-to-output-link
  18848. -->
  18849. (I3 ^see 0 +)
  18850. Retracting propose*predict-no
  18851. -->
  18852. (O2064 ^name predict-no +)
  18853. (S1 ^operator O2064 +)
  18854. Retracting propose*predict-yes
  18855. -->
  18856. (O2063 ^name predict-yes +)
  18857. (S1 ^operator O2063 +)
  18858. Retracting elaborate*reward*based*on*reward
  18859. -->
  18860. (R1035 ^value 1 +)
  18861. (R1 ^reward R1035 +)
  18862. Retracting elaborate*copy-dir-to-output-link
  18863. -->
  18864. (I3 ^dir L +)
  18865. Retracting rl*prefer*rvt*predict-no*H0*6
  18866. -->
  18867. (S1 ^operator O2064 = 0.9034761957271857)
  18868. Retracting rl*prefer*rvt*predict-yes*H0*5
  18869. -->
  18870. (S1 ^operator O2063 = 0.2640108751521542)
  18871. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  18872. -->
  18873. (S1 ^operator O2063 = -0.181727099742844)
  18874. =>WM: (14556: S1 ^operator O2066 +)
  18875. =>WM: (14555: S1 ^operator O2065 +)
  18876. =>WM: (14554: I3 ^dir R)
  18877. =>WM: (14553: O2066 ^name predict-no)
  18878. =>WM: (14552: O2065 ^name predict-yes)
  18879. =>WM: (14551: R1036 ^value 1)
  18880. =>WM: (14550: R1 ^reward R1036)
  18881. <=WM: (14541: S1 ^operator O2063 +)
  18882. <=WM: (14542: S1 ^operator O2064 +)
  18883. <=WM: (14543: S1 ^operator O2064)
  18884. <=WM: (14540: I3 ^dir L)
  18885. <=WM: (14536: R1 ^reward R1035)
  18886. <=WM: (14539: O2064 ^name predict-no)
  18887. <=WM: (14538: O2063 ^name predict-yes)
  18888. <=WM: (14537: R1035 ^value 1)
  18889. --- Inner Elaboration Phase, active level 1 (S1) ---
  18890. Firing prefer*rvt*predict-yes*H0
  18891. -->
  18892. Firing rl*prefer*rvt*predict-yes*H0*3
  18893. -->
  18894. (S1 ^operator O2065 = 0.3377041098150635)
  18895. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  18896. -->
  18897. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  18898. -->
  18899. (S1 ^operator O2065 = 0.6623309159241351)
  18900. Firing prefer*rvt*predict-no*H0
  18901. -->
  18902. Firing rl*prefer*rvt*predict-no*H0*4
  18903. -->
  18904. (S1 ^operator O2066 = 0.3397758518173152)
  18905. Firing prefer*rvt*predict-no*H0*4*v1*H1
  18906. -->
  18907. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  18908. -->
  18909. (S1 ^operator O2066 = -0.2817060109291377)
  18910. inner elaboration loop at bottom goal.
  18911. Retracting rl*prefer*rvt*predict-no*H0*4
  18912. -->
  18913. (S1 ^operator O2064 = 0.3397758518173152)
  18914. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  18915. -->
  18916. (S1 ^operator O2064 = -0.2817060109291377)
  18917. Retracting rl*prefer*rvt*predict-yes*H0*3
  18918. -->
  18919. (S1 ^operator O2063 = 0.3377041098150635)
  18920. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  18921. -->
  18922. (S1 ^operator O2063 = 0.6623309159241351)
  18923. --- END Proposal Phase ---
  18924. --- Decision Phase ---
  18925. RL update rl*prefer*rvt*predict-no*H0*6 0.903476 0 0.903476 -> 0.919448 0 0.919448(R,m,v=1,0.903226,0.0879765)
  18926. =>WM: (14557: S1 ^operator O2065)
  18927. 1033: O: O2065 (predict-yes)
  18928. --- END Decision Phase ---
  18929. --- Application Phase ---
  18930. --- Firing Productions (PE) For State At Depth 1 ---
  18931. --- Inner Elaboration Phase, active level 1 (S1) ---
  18932. Firing apply*operator
  18933. -->
  18934. (I3 ^predict-yes N1033 + :O )
  18935. Firing apply*operator*complete
  18936. -->
  18937. (I3 ^predict-no N1032 - :O )
  18938. inner elaboration loop at bottom goal.
  18939. --- Change Working Memory (PE) ---
  18940. =>WM: (14558: I3 ^predict-yes N1033)
  18941. <=WM: (14545: N1032 ^status complete)
  18942. <=WM: (14544: I3 ^predict-no N1032)
  18943. --- Firing Productions (IE) For State At Depth 1 ---
  18944. --- Inner Elaboration Phase, active level 1 (S1) ---
  18945. Firing monitor*world
  18946. -->
  18947. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18948. --- Change Working Memory (IE) ---
  18949. --- END Application Phase ---
  18950. --- Output Phase ---
  18951. ENV: Agent did: predict-yes for direction R in state State-A
  18952. In State-A moving R
  18953. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  18954. predict error 0
  18955. dir: dir isU
  18956. --- END Output Phase ---
  18957. -/|--- Input Phase ---
  18958. =>WM: (14562: I2 ^dir U)
  18959. =>WM: (14561: I2 ^reward 1)
  18960. =>WM: (14560: I2 ^see 1)
  18961. =>WM: (14559: N1033 ^status complete)
  18962. <=WM: (14548: I2 ^dir R)
  18963. <=WM: (14547: I2 ^reward 1)
  18964. <=WM: (14546: I2 ^see 0)
  18965. =>WM: (14563: I2 ^level-1 R1-root)
  18966. <=WM: (14549: I2 ^level-1 L0-root)
  18967. --- END Input Phase ---
  18968. --- Proposal Phase ---
  18969. --- Inner Elaboration Phase, active level 1 (S1) ---
  18970. Firing elaborate*copy-see-to-output-link
  18971. -->
  18972. (I3 ^see 1 +)
  18973. Firing elaborate*reward*based*on*reward
  18974. -->
  18975. (R1037 ^value 1 +)
  18976. (R1 ^reward R1037 +)
  18977. Firing propose*predict-yes
  18978. -->
  18979. (O2067 ^name predict-yes +)
  18980. (S1 ^operator O2067 +)
  18981. Firing propose*predict-no
  18982. -->
  18983. (O2068 ^name predict-no +)
  18984. (S1 ^operator O2068 +)
  18985. Firing rl*prefer*rvt*predict-no*H0*2
  18986. -->
  18987. (S1 ^operator O2066 = 1.)
  18988. Firing rl*prefer*rvt*predict-yes*H0*1
  18989. -->
  18990. (S1 ^operator O2065 = 0.)
  18991. Firing prefer*rvt*predict-yes*H0
  18992. -->
  18993. Firing prefer*rvt*predict-no*H0
  18994. -->
  18995. Firing elaborate*copy-dir-to-output-link
  18996. -->
  18997. (I3 ^dir U +)
  18998. inner elaboration loop at bottom goal.
  18999. Retracting elaborate*copy-see-to-output-link
  19000. -->
  19001. (I3 ^see 0 +)
  19002. Retracting propose*predict-no
  19003. -->
  19004. (O2066 ^name predict-no +)
  19005. (S1 ^operator O2066 +)
  19006. Retracting propose*predict-yes
  19007. -->
  19008. (O2065 ^name predict-yes +)
  19009. (S1 ^operator O2065 +)
  19010. Retracting elaborate*reward*based*on*reward
  19011. -->
  19012. (R1036 ^value 1 +)
  19013. (R1 ^reward R1036 +)
  19014. Retracting elaborate*copy-dir-to-output-link
  19015. -->
  19016. (I3 ^dir R +)
  19017. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  19018. -->
  19019. (S1 ^operator O2066 = -0.2817060109291377)
  19020. Retracting rl*prefer*rvt*predict-no*H0*4
  19021. -->
  19022. (S1 ^operator O2066 = 0.3397758518173152)
  19023. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  19024. -->
  19025. (S1 ^operator O2065 = 0.6623309159241351)
  19026. Retracting rl*prefer*rvt*predict-yes*H0*3
  19027. -->
  19028. (S1 ^operator O2065 = 0.3377041098150635)
  19029. =>WM: (14571: S1 ^operator O2068 +)
  19030. =>WM: (14570: S1 ^operator O2067 +)
  19031. =>WM: (14569: I3 ^dir U)
  19032. =>WM: (14568: O2068 ^name predict-no)
  19033. =>WM: (14567: O2067 ^name predict-yes)
  19034. =>WM: (14566: R1037 ^value 1)
  19035. =>WM: (14565: R1 ^reward R1037)
  19036. =>WM: (14564: I3 ^see 1)
  19037. <=WM: (14555: S1 ^operator O2065 +)
  19038. <=WM: (14557: S1 ^operator O2065)
  19039. <=WM: (14556: S1 ^operator O2066 +)
  19040. <=WM: (14554: I3 ^dir R)
  19041. <=WM: (14550: R1 ^reward R1036)
  19042. <=WM: (14496: I3 ^see 0)
  19043. <=WM: (14553: O2066 ^name predict-no)
  19044. <=WM: (14552: O2065 ^name predict-yes)
  19045. <=WM: (14551: R1036 ^value 1)
  19046. --- Inner Elaboration Phase, active level 1 (S1) ---
  19047. Firing prefer*rvt*predict-yes*H0
  19048. -->
  19049. Firing rl*prefer*rvt*predict-yes*H0*1
  19050. -->
  19051. (S1 ^operator O2067 = 0.)
  19052. Firing prefer*rvt*predict-no*H0
  19053. -->
  19054. Firing rl*prefer*rvt*predict-no*H0*2
  19055. -->
  19056. (S1 ^operator O2068 = 1.)
  19057. inner elaboration loop at bottom goal.
  19058. Retracting rl*prefer*rvt*predict-no*H0*2
  19059. -->
  19060. (S1 ^operator O2066 = 1.)
  19061. Retracting rl*prefer*rvt*predict-yes*H0*1
  19062. -->
  19063. (S1 ^operator O2065 = 0.)
  19064. --- END Proposal Phase ---
  19065. --- Decision Phase ---
  19066. RL update rl*prefer*rvt*predict-yes*H0*3 0.590103 -0.252399 0.337704 -> 0.5901 -0.252399 0.337701(R,m,v=1,0.901734,0.0891249)
  19067. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409937 0.252394 0.662331 -> 0.409933 0.252394 0.662328(R,m,v=1,1,0)
  19068. =>WM: (14572: S1 ^operator O2068)
  19069. 1034: O: O2068 (predict-no)
  19070. --- END Decision Phase ---
  19071. --- Application Phase ---
  19072. --- Firing Productions (PE) For State At Depth 1 ---
  19073. --- Inner Elaboration Phase, active level 1 (S1) ---
  19074. Firing apply*operator
  19075. -->
  19076. (I3 ^predict-no N1034 + :O )
  19077. Firing apply*operator*complete
  19078. -->
  19079. (I3 ^predict-yes N1033 - :O )
  19080. inner elaboration loop at bottom goal.
  19081. --- Change Working Memory (PE) ---
  19082. =>WM: (14573: I3 ^predict-no N1034)
  19083. <=WM: (14559: N1033 ^status complete)
  19084. <=WM: (14558: I3 ^predict-yes N1033)
  19085. --- Firing Productions (IE) For State At Depth 1 ---
  19086. --- Inner Elaboration Phase, active level 1 (S1) ---
  19087. Firing monitor*world
  19088. -->
  19089. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19090. --- Change Working Memory (IE) ---
  19091. --- END Application Phase ---
  19092. --- Output Phase ---
  19093. ENV: Agent did: predict-no for direction U in state State-B
  19094. In State-B moving U
  19095. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19096. predict error 0
  19097. dir: dir isR
  19098. --- END Output Phase ---
  19099. \-/--- Input Phase ---
  19100. =>WM: (14577: I2 ^dir R)
  19101. =>WM: (14576: I2 ^reward 1)
  19102. =>WM: (14575: I2 ^see 0)
  19103. =>WM: (14574: N1034 ^status complete)
  19104. <=WM: (14562: I2 ^dir U)
  19105. <=WM: (14561: I2 ^reward 1)
  19106. <=WM: (14560: I2 ^see 1)
  19107. =>WM: (14578: I2 ^level-1 R1-root)
  19108. <=WM: (14563: I2 ^level-1 R1-root)
  19109. --- END Input Phase ---
  19110. --- Proposal Phase ---
  19111. --- Inner Elaboration Phase, active level 1 (S1) ---
  19112. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  19113. -->
  19114. (S1 ^operator O2067 = -0.1070236389116304)
  19115. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  19116. -->
  19117. (S1 ^operator O2068 = 0.6602397636180422)
  19118. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19119. -->
  19120. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19121. -->
  19122. Firing elaborate*copy-see-to-output-link
  19123. -->
  19124. (I3 ^see 0 +)
  19125. Firing elaborate*reward*based*on*reward
  19126. -->
  19127. (R1038 ^value 1 +)
  19128. (R1 ^reward R1038 +)
  19129. Firing propose*predict-yes
  19130. -->
  19131. (O2069 ^name predict-yes +)
  19132. (S1 ^operator O2069 +)
  19133. Firing propose*predict-no
  19134. -->
  19135. (O2070 ^name predict-no +)
  19136. (S1 ^operator O2070 +)
  19137. Firing rl*prefer*rvt*predict-no*H0*4
  19138. -->
  19139. (S1 ^operator O2068 = 0.3397758518173152)
  19140. Firing rl*prefer*rvt*predict-yes*H0*3
  19141. -->
  19142. (S1 ^operator O2067 = 0.337701263717275)
  19143. Firing prefer*rvt*predict-yes*H0
  19144. -->
  19145. Firing prefer*rvt*predict-no*H0
  19146. -->
  19147. Firing elaborate*copy-dir-to-output-link
  19148. -->
  19149. (I3 ^dir R +)
  19150. inner elaboration loop at bottom goal.
  19151. Retracting elaborate*copy-see-to-output-link
  19152. -->
  19153. (I3 ^see 1 +)
  19154. Retracting propose*predict-no
  19155. -->
  19156. (O2068 ^name predict-no +)
  19157. (S1 ^operator O2068 +)
  19158. Retracting propose*predict-yes
  19159. -->
  19160. (O2067 ^name predict-yes +)
  19161. (S1 ^operator O2067 +)
  19162. Retracting elaborate*reward*based*on*reward
  19163. -->
  19164. (R1037 ^value 1 +)
  19165. (R1 ^reward R1037 +)
  19166. Retracting elaborate*copy-dir-to-output-link
  19167. -->
  19168. (I3 ^dir U +)
  19169. Retracting rl*prefer*rvt*predict-no*H0*2
  19170. -->
  19171. (S1 ^operator O2068 = 1.)
  19172. Retracting rl*prefer*rvt*predict-yes*H0*1
  19173. -->
  19174. (S1 ^operator O2067 = 0.)
  19175. =>WM: (14586: S1 ^operator O2070 +)
  19176. =>WM: (14585: S1 ^operator O2069 +)
  19177. =>WM: (14584: I3 ^dir R)
  19178. =>WM: (14583: O2070 ^name predict-no)
  19179. =>WM: (14582: O2069 ^name predict-yes)
  19180. =>WM: (14581: R1038 ^value 1)
  19181. =>WM: (14580: R1 ^reward R1038)
  19182. =>WM: (14579: I3 ^see 0)
  19183. <=WM: (14570: S1 ^operator O2067 +)
  19184. <=WM: (14571: S1 ^operator O2068 +)
  19185. <=WM: (14572: S1 ^operator O2068)
  19186. <=WM: (14569: I3 ^dir U)
  19187. <=WM: (14565: R1 ^reward R1037)
  19188. <=WM: (14564: I3 ^see 1)
  19189. <=WM: (14568: O2068 ^name predict-no)
  19190. <=WM: (14567: O2067 ^name predict-yes)
  19191. <=WM: (14566: R1037 ^value 1)
  19192. --- Inner Elaboration Phase, active level 1 (S1) ---
  19193. Firing prefer*rvt*predict-yes*H0
  19194. -->
  19195. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  19196. -->
  19197. (S1 ^operator O2069 = -0.1070236389116304)
  19198. Firing rl*prefer*rvt*predict-yes*H0*3
  19199. -->
  19200. (S1 ^operator O2069 = 0.337701263717275)
  19201. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19202. -->
  19203. Firing prefer*rvt*predict-no*H0
  19204. -->
  19205. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  19206. -->
  19207. (S1 ^operator O2070 = 0.6602397636180422)
  19208. Firing rl*prefer*rvt*predict-no*H0*4
  19209. -->
  19210. (S1 ^operator O2070 = 0.3397758518173152)
  19211. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19212. -->
  19213. inner elaboration loop at bottom goal.
  19214. Retracting rl*prefer*rvt*predict-no*H0*4
  19215. -->
  19216. (S1 ^operator O2068 = 0.3397758518173152)
  19217. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  19218. -->
  19219. (S1 ^operator O2068 = 0.6602397636180422)
  19220. Retracting rl*prefer*rvt*predict-yes*H0*3
  19221. -->
  19222. (S1 ^operator O2067 = 0.337701263717275)
  19223. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  19224. -->
  19225. (S1 ^operator O2067 = -0.1070236389116304)
  19226. --- END Proposal Phase ---
  19227. --- Decision Phase ---
  19228. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19229. =>WM: (14587: S1 ^operator O2070)
  19230. 1035: O: O2070 (predict-no)
  19231. --- END Decision Phase ---
  19232. --- Application Phase ---
  19233. --- Firing Productions (PE) For State At Depth 1 ---
  19234. --- Inner Elaboration Phase, active level 1 (S1) ---
  19235. Firing apply*operator
  19236. -->
  19237. (I3 ^predict-no N1035 + :O )
  19238. Firing apply*operator*complete
  19239. -->
  19240. (I3 ^predict-no N1034 - :O )
  19241. inner elaboration loop at bottom goal.
  19242. --- Change Working Memory (PE) ---
  19243. =>WM: (14588: I3 ^predict-no N1035)
  19244. <=WM: (14574: N1034 ^status complete)
  19245. <=WM: (14573: I3 ^predict-no N1034)
  19246. --- Firing Productions (IE) For State At Depth 1 ---
  19247. --- Inner Elaboration Phase, active level 1 (S1) ---
  19248. Firing monitor*world
  19249. -->
  19250. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19251. --- Change Working Memory (IE) ---
  19252. --- END Application Phase ---
  19253. --- Output Phase ---
  19254. ENV: Agent did: predict-no for direction R in state State-B
  19255. In State-B moving R
  19256. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19257. predict error 0
  19258. dir: dir isR
  19259. --- END Output Phase ---
  19260. |\--- Input Phase ---
  19261. =>WM: (14592: I2 ^dir R)
  19262. =>WM: (14591: I2 ^reward 1)
  19263. =>WM: (14590: I2 ^see 0)
  19264. =>WM: (14589: N1035 ^status complete)
  19265. <=WM: (14577: I2 ^dir R)
  19266. <=WM: (14576: I2 ^reward 1)
  19267. <=WM: (14575: I2 ^see 0)
  19268. =>WM: (14593: I2 ^level-1 R0-root)
  19269. <=WM: (14578: I2 ^level-1 R1-root)
  19270. --- END Input Phase ---
  19271. --- Proposal Phase ---
  19272. --- Inner Elaboration Phase, active level 1 (S1) ---
  19273. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19274. -->
  19275. (S1 ^operator O2070 = 0.6601667168012377)
  19276. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19277. -->
  19278. (S1 ^operator O2069 = -0.1028953566115423)
  19279. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19280. -->
  19281. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19282. -->
  19283. Firing elaborate*copy-see-to-output-link
  19284. -->
  19285. (I3 ^see 0 +)
  19286. Firing elaborate*reward*based*on*reward
  19287. -->
  19288. (R1039 ^value 1 +)
  19289. (R1 ^reward R1039 +)
  19290. Firing propose*predict-yes
  19291. -->
  19292. (O2071 ^name predict-yes +)
  19293. (S1 ^operator O2071 +)
  19294. Firing propose*predict-no
  19295. -->
  19296. (O2072 ^name predict-no +)
  19297. (S1 ^operator O2072 +)
  19298. Firing rl*prefer*rvt*predict-no*H0*4
  19299. -->
  19300. (S1 ^operator O2070 = 0.3397758518173152)
  19301. Firing rl*prefer*rvt*predict-yes*H0*3
  19302. -->
  19303. (S1 ^operator O2069 = 0.337701263717275)
  19304. Firing prefer*rvt*predict-yes*H0
  19305. -->
  19306. Firing prefer*rvt*predict-no*H0
  19307. -->
  19308. Firing elaborate*copy-dir-to-output-link
  19309. -->
  19310. (I3 ^dir R +)
  19311. inner elaboration loop at bottom goal.
  19312. Retracting elaborate*copy-see-to-output-link
  19313. -->
  19314. (I3 ^see 0 +)
  19315. Retracting propose*predict-no
  19316. -->
  19317. (O2070 ^name predict-no +)
  19318. (S1 ^operator O2070 +)
  19319. Retracting propose*predict-yes
  19320. -->
  19321. (O2069 ^name predict-yes +)
  19322. (S1 ^operator O2069 +)
  19323. Retracting elaborate*reward*based*on*reward
  19324. -->
  19325. (R1038 ^value 1 +)
  19326. (R1 ^reward R1038 +)
  19327. Retracting elaborate*copy-dir-to-output-link
  19328. -->
  19329. (I3 ^dir R +)
  19330. Retracting rl*prefer*rvt*predict-no*H0*4
  19331. -->
  19332. (S1 ^operator O2070 = 0.3397758518173152)
  19333. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  19334. -->
  19335. (S1 ^operator O2070 = 0.6602397636180422)
  19336. Retracting rl*prefer*rvt*predict-yes*H0*3
  19337. -->
  19338. (S1 ^operator O2069 = 0.337701263717275)
  19339. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  19340. -->
  19341. (S1 ^operator O2069 = -0.1070236389116304)
  19342. =>WM: (14599: S1 ^operator O2072 +)
  19343. =>WM: (14598: S1 ^operator O2071 +)
  19344. =>WM: (14597: O2072 ^name predict-no)
  19345. =>WM: (14596: O2071 ^name predict-yes)
  19346. =>WM: (14595: R1039 ^value 1)
  19347. =>WM: (14594: R1 ^reward R1039)
  19348. <=WM: (14585: S1 ^operator O2069 +)
  19349. <=WM: (14586: S1 ^operator O2070 +)
  19350. <=WM: (14587: S1 ^operator O2070)
  19351. <=WM: (14580: R1 ^reward R1038)
  19352. <=WM: (14583: O2070 ^name predict-no)
  19353. <=WM: (14582: O2069 ^name predict-yes)
  19354. <=WM: (14581: R1038 ^value 1)
  19355. --- Inner Elaboration Phase, active level 1 (S1) ---
  19356. Firing prefer*rvt*predict-yes*H0
  19357. -->
  19358. Firing rl*prefer*rvt*predict-yes*H0*3
  19359. -->
  19360. (S1 ^operator O2071 = 0.337701263717275)
  19361. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19362. -->
  19363. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19364. -->
  19365. (S1 ^operator O2071 = -0.1028953566115423)
  19366. Firing prefer*rvt*predict-no*H0
  19367. -->
  19368. Firing rl*prefer*rvt*predict-no*H0*4
  19369. -->
  19370. (S1 ^operator O2072 = 0.3397758518173152)
  19371. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19372. -->
  19373. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19374. -->
  19375. (S1 ^operator O2072 = 0.6601667168012377)
  19376. inner elaboration loop at bottom goal.
  19377. Retracting rl*prefer*rvt*predict-no*H0*4
  19378. -->
  19379. (S1 ^operator O2070 = 0.3397758518173152)
  19380. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19381. -->
  19382. (S1 ^operator O2070 = 0.6601667168012377)
  19383. Retracting rl*prefer*rvt*predict-yes*H0*3
  19384. -->
  19385. (S1 ^operator O2069 = 0.337701263717275)
  19386. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19387. -->
  19388. (S1 ^operator O2069 = -0.1028953566115423)
  19389. --- END Proposal Phase ---
  19390. --- Decision Phase ---
  19391. RL update rl*prefer*rvt*predict-no*H0*4 0.57026 -0.230484 0.339776 -> 0.570258 -0.230484 0.339775(R,m,v=1,0.878613,0.107272)
  19392. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429757 0.230483 0.66024 -> 0.429755 0.230483 0.660238(R,m,v=1,1,0)
  19393. =>WM: (14600: S1 ^operator O2072)
  19394. 1036: O: O2072 (predict-no)
  19395. --- END Decision Phase ---
  19396. --- Application Phase ---
  19397. --- Firing Productions (PE) For State At Depth 1 ---
  19398. --- Inner Elaboration Phase, active level 1 (S1) ---
  19399. Firing apply*operator
  19400. -->
  19401. (I3 ^predict-no N1036 + :O )
  19402. Firing apply*operator*complete
  19403. -->
  19404. (I3 ^predict-no N1035 - :O )
  19405. inner elaboration loop at bottom goal.
  19406. --- Change Working Memory (PE) ---
  19407. =>WM: (14601: I3 ^predict-no N1036)
  19408. <=WM: (14589: N1035 ^status complete)
  19409. <=WM: (14588: I3 ^predict-no N1035)
  19410. --- Firing Productions (IE) For State At Depth 1 ---
  19411. --- Inner Elaboration Phase, active level 1 (S1) ---
  19412. Firing monitor*world
  19413. -->
  19414. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19415. --- Change Working Memory (IE) ---
  19416. --- END Application Phase ---
  19417. --- Output Phase ---
  19418. ENV: Agent did: predict-no for direction R in state State-B
  19419. In State-B moving R
  19420. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19421. predict error 0
  19422. dir: dir isU
  19423. --- END Output Phase ---
  19424. -/|--- Input Phase ---
  19425. =>WM: (14605: I2 ^dir U)
  19426. =>WM: (14604: I2 ^reward 1)
  19427. =>WM: (14603: I2 ^see 0)
  19428. =>WM: (14602: N1036 ^status complete)
  19429. <=WM: (14592: I2 ^dir R)
  19430. <=WM: (14591: I2 ^reward 1)
  19431. <=WM: (14590: I2 ^see 0)
  19432. =>WM: (14606: I2 ^level-1 R0-root)
  19433. <=WM: (14593: I2 ^level-1 R0-root)
  19434. --- END Input Phase ---
  19435. --- Proposal Phase ---
  19436. --- Inner Elaboration Phase, active level 1 (S1) ---
  19437. Firing elaborate*copy-see-to-output-link
  19438. -->
  19439. (I3 ^see 0 +)
  19440. Firing elaborate*reward*based*on*reward
  19441. -->
  19442. (R1040 ^value 1 +)
  19443. (R1 ^reward R1040 +)
  19444. Firing propose*predict-yes
  19445. -->
  19446. (O2073 ^name predict-yes +)
  19447. (S1 ^operator O2073 +)
  19448. Firing propose*predict-no
  19449. -->
  19450. (O2074 ^name predict-no +)
  19451. (S1 ^operator O2074 +)
  19452. Firing rl*prefer*rvt*predict-no*H0*2
  19453. -->
  19454. (S1 ^operator O2072 = 1.)
  19455. Firing rl*prefer*rvt*predict-yes*H0*1
  19456. -->
  19457. (S1 ^operator O2071 = 0.)
  19458. Firing prefer*rvt*predict-yes*H0
  19459. -->
  19460. Firing prefer*rvt*predict-no*H0
  19461. -->
  19462. Firing elaborate*copy-dir-to-output-link
  19463. -->
  19464. (I3 ^dir U +)
  19465. inner elaboration loop at bottom goal.
  19466. Retracting elaborate*copy-see-to-output-link
  19467. -->
  19468. (I3 ^see 0 +)
  19469. Retracting propose*predict-no
  19470. -->
  19471. (O2072 ^name predict-no +)
  19472. (S1 ^operator O2072 +)
  19473. Retracting propose*predict-yes
  19474. -->
  19475. (O2071 ^name predict-yes +)
  19476. (S1 ^operator O2071 +)
  19477. Retracting elaborate*reward*based*on*reward
  19478. -->
  19479. (R1039 ^value 1 +)
  19480. (R1 ^reward R1039 +)
  19481. Retracting elaborate*copy-dir-to-output-link
  19482. -->
  19483. (I3 ^dir R +)
  19484. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19485. -->
  19486. (S1 ^operator O2072 = 0.6601667168012377)
  19487. Retracting rl*prefer*rvt*predict-no*H0*4
  19488. -->
  19489. (S1 ^operator O2072 = 0.3397745829488472)
  19490. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19491. -->
  19492. (S1 ^operator O2071 = -0.1028953566115423)
  19493. Retracting rl*prefer*rvt*predict-yes*H0*3
  19494. -->
  19495. (S1 ^operator O2071 = 0.337701263717275)
  19496. =>WM: (14613: S1 ^operator O2074 +)
  19497. =>WM: (14612: S1 ^operator O2073 +)
  19498. =>WM: (14611: I3 ^dir U)
  19499. =>WM: (14610: O2074 ^name predict-no)
  19500. =>WM: (14609: O2073 ^name predict-yes)
  19501. =>WM: (14608: R1040 ^value 1)
  19502. =>WM: (14607: R1 ^reward R1040)
  19503. <=WM: (14598: S1 ^operator O2071 +)
  19504. <=WM: (14599: S1 ^operator O2072 +)
  19505. <=WM: (14600: S1 ^operator O2072)
  19506. <=WM: (14584: I3 ^dir R)
  19507. <=WM: (14594: R1 ^reward R1039)
  19508. <=WM: (14597: O2072 ^name predict-no)
  19509. <=WM: (14596: O2071 ^name predict-yes)
  19510. <=WM: (14595: R1039 ^value 1)
  19511. --- Inner Elaboration Phase, active level 1 (S1) ---
  19512. Firing prefer*rvt*predict-yes*H0
  19513. -->
  19514. Firing rl*prefer*rvt*predict-yes*H0*1
  19515. -->
  19516. (S1 ^operator O2073 = 0.)
  19517. Firing prefer*rvt*predict-no*H0
  19518. -->
  19519. Firing rl*prefer*rvt*predict-no*H0*2
  19520. -->
  19521. (S1 ^operator O2074 = 1.)
  19522. inner elaboration loop at bottom goal.
  19523. Retracting rl*prefer*rvt*predict-no*H0*2
  19524. -->
  19525. (S1 ^operator O2072 = 1.)
  19526. Retracting rl*prefer*rvt*predict-yes*H0*1
  19527. -->
  19528. (S1 ^operator O2071 = 0.)
  19529. --- END Proposal Phase ---
  19530. --- Decision Phase ---
  19531. RL update rl*prefer*rvt*predict-no*H0*4 0.570258 -0.230484 0.339775 -> 0.570263 -0.230484 0.339779(R,m,v=1,0.87931,0.106737)
  19532. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.42968 0.230487 0.660167 -> 0.429686 0.230486 0.660172(R,m,v=1,1,0)
  19533. =>WM: (14614: S1 ^operator O2074)
  19534. 1037: O: O2074 (predict-no)
  19535. --- END Decision Phase ---
  19536. --- Application Phase ---
  19537. --- Firing Productions (PE) For State At Depth 1 ---
  19538. --- Inner Elaboration Phase, active level 1 (S1) ---
  19539. Firing apply*operator
  19540. -->
  19541. (I3 ^predict-no N1037 + :O )
  19542. Firing apply*operator*complete
  19543. -->
  19544. (I3 ^predict-no N1036 - :O )
  19545. inner elaboration loop at bottom goal.
  19546. --- Change Working Memory (PE) ---
  19547. =>WM: (14615: I3 ^predict-no N1037)
  19548. <=WM: (14602: N1036 ^status complete)
  19549. <=WM: (14601: I3 ^predict-no N1036)
  19550. --- Firing Productions (IE) For State At Depth 1 ---
  19551. --- Inner Elaboration Phase, active level 1 (S1) ---
  19552. Firing monitor*world
  19553. -->
  19554. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19555. --- Change Working Memory (IE) ---
  19556. --- END Application Phase ---
  19557. --- Output Phase ---
  19558. ENV: Agent did: predict-no for direction U in state State-B
  19559. In State-B moving U
  19560. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19561. predict error 0
  19562. dir: dir isR
  19563. --- END Output Phase ---
  19564. \-/--- Input Phase ---
  19565. =>WM: (14619: I2 ^dir R)
  19566. =>WM: (14618: I2 ^reward 1)
  19567. =>WM: (14617: I2 ^see 0)
  19568. =>WM: (14616: N1037 ^status complete)
  19569. <=WM: (14605: I2 ^dir U)
  19570. <=WM: (14604: I2 ^reward 1)
  19571. <=WM: (14603: I2 ^see 0)
  19572. =>WM: (14620: I2 ^level-1 R0-root)
  19573. <=WM: (14606: I2 ^level-1 R0-root)
  19574. --- END Input Phase ---
  19575. --- Proposal Phase ---
  19576. --- Inner Elaboration Phase, active level 1 (S1) ---
  19577. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19578. -->
  19579. (S1 ^operator O2074 = 0.6601722790491221)
  19580. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19581. -->
  19582. (S1 ^operator O2073 = -0.1028953566115423)
  19583. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19584. -->
  19585. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19586. -->
  19587. Firing elaborate*copy-see-to-output-link
  19588. -->
  19589. (I3 ^see 0 +)
  19590. Firing elaborate*reward*based*on*reward
  19591. -->
  19592. (R1041 ^value 1 +)
  19593. (R1 ^reward R1041 +)
  19594. Firing propose*predict-yes
  19595. -->
  19596. (O2075 ^name predict-yes +)
  19597. (S1 ^operator O2075 +)
  19598. Firing propose*predict-no
  19599. -->
  19600. (O2076 ^name predict-no +)
  19601. (S1 ^operator O2076 +)
  19602. Firing rl*prefer*rvt*predict-no*H0*4
  19603. -->
  19604. (S1 ^operator O2074 = 0.3397793483103405)
  19605. Firing rl*prefer*rvt*predict-yes*H0*3
  19606. -->
  19607. (S1 ^operator O2073 = 0.337701263717275)
  19608. Firing prefer*rvt*predict-yes*H0
  19609. -->
  19610. Firing prefer*rvt*predict-no*H0
  19611. -->
  19612. Firing elaborate*copy-dir-to-output-link
  19613. -->
  19614. (I3 ^dir R +)
  19615. inner elaboration loop at bottom goal.
  19616. Retracting elaborate*copy-see-to-output-link
  19617. -->
  19618. (I3 ^see 0 +)
  19619. Retracting propose*predict-no
  19620. -->
  19621. (O2074 ^name predict-no +)
  19622. (S1 ^operator O2074 +)
  19623. Retracting propose*predict-yes
  19624. -->
  19625. (O2073 ^name predict-yes +)
  19626. (S1 ^operator O2073 +)
  19627. Retracting elaborate*reward*based*on*reward
  19628. -->
  19629. (R1040 ^value 1 +)
  19630. (R1 ^reward R1040 +)
  19631. Retracting elaborate*copy-dir-to-output-link
  19632. -->
  19633. (I3 ^dir U +)
  19634. Retracting rl*prefer*rvt*predict-no*H0*2
  19635. -->
  19636. (S1 ^operator O2074 = 1.)
  19637. Retracting rl*prefer*rvt*predict-yes*H0*1
  19638. -->
  19639. (S1 ^operator O2073 = 0.)
  19640. =>WM: (14627: S1 ^operator O2076 +)
  19641. =>WM: (14626: S1 ^operator O2075 +)
  19642. =>WM: (14625: I3 ^dir R)
  19643. =>WM: (14624: O2076 ^name predict-no)
  19644. =>WM: (14623: O2075 ^name predict-yes)
  19645. =>WM: (14622: R1041 ^value 1)
  19646. =>WM: (14621: R1 ^reward R1041)
  19647. <=WM: (14612: S1 ^operator O2073 +)
  19648. <=WM: (14613: S1 ^operator O2074 +)
  19649. <=WM: (14614: S1 ^operator O2074)
  19650. <=WM: (14611: I3 ^dir U)
  19651. <=WM: (14607: R1 ^reward R1040)
  19652. <=WM: (14610: O2074 ^name predict-no)
  19653. <=WM: (14609: O2073 ^name predict-yes)
  19654. <=WM: (14608: R1040 ^value 1)
  19655. --- Inner Elaboration Phase, active level 1 (S1) ---
  19656. Firing prefer*rvt*predict-yes*H0
  19657. -->
  19658. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19659. -->
  19660. (S1 ^operator O2075 = -0.1028953566115423)
  19661. Firing rl*prefer*rvt*predict-yes*H0*3
  19662. -->
  19663. (S1 ^operator O2075 = 0.337701263717275)
  19664. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19665. -->
  19666. Firing prefer*rvt*predict-no*H0
  19667. -->
  19668. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19669. -->
  19670. (S1 ^operator O2076 = 0.6601722790491221)
  19671. Firing rl*prefer*rvt*predict-no*H0*4
  19672. -->
  19673. (S1 ^operator O2076 = 0.3397793483103405)
  19674. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19675. -->
  19676. inner elaboration loop at bottom goal.
  19677. Retracting rl*prefer*rvt*predict-no*H0*4
  19678. -->
  19679. (S1 ^operator O2074 = 0.3397793483103405)
  19680. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19681. -->
  19682. (S1 ^operator O2074 = 0.6601722790491221)
  19683. Retracting rl*prefer*rvt*predict-yes*H0*3
  19684. -->
  19685. (S1 ^operator O2073 = 0.337701263717275)
  19686. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19687. -->
  19688. (S1 ^operator O2073 = -0.1028953566115423)
  19689. --- END Proposal Phase ---
  19690. --- Decision Phase ---
  19691. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19692. =>WM: (14628: S1 ^operator O2076)
  19693. 1038: O: O2076 (predict-no)
  19694. --- END Decision Phase ---
  19695. --- Application Phase ---
  19696. --- Firing Productions (PE) For State At Depth 1 ---
  19697. --- Inner Elaboration Phase, active level 1 (S1) ---
  19698. Firing apply*operator
  19699. -->
  19700. (I3 ^predict-no N1038 + :O )
  19701. Firing apply*operator*complete
  19702. -->
  19703. (I3 ^predict-no N1037 - :O )
  19704. inner elaboration loop at bottom goal.
  19705. --- Change Working Memory (PE) ---
  19706. =>WM: (14629: I3 ^predict-no N1038)
  19707. <=WM: (14616: N1037 ^status complete)
  19708. <=WM: (14615: I3 ^predict-no N1037)
  19709. --- Firing Productions (IE) For State At Depth 1 ---
  19710. --- Inner Elaboration Phase, active level 1 (S1) ---
  19711. Firing monitor*world
  19712. -->
  19713. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19714. --- Change Working Memory (IE) ---
  19715. --- END Application Phase ---
  19716. --- Output Phase ---
  19717. ENV: Agent did: predict-no for direction R in state State-B
  19718. In State-B moving R
  19719. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19720. predict error 0
  19721. dir: dir isR
  19722. --- END Output Phase ---
  19723. |\---- Input Phase ---
  19724. =>WM: (14633: I2 ^dir R)
  19725. =>WM: (14632: I2 ^reward 1)
  19726. =>WM: (14631: I2 ^see 0)
  19727. =>WM: (14630: N1038 ^status complete)
  19728. <=WM: (14619: I2 ^dir R)
  19729. <=WM: (14618: I2 ^reward 1)
  19730. <=WM: (14617: I2 ^see 0)
  19731. =>WM: (14634: I2 ^level-1 R0-root)
  19732. <=WM: (14620: I2 ^level-1 R0-root)
  19733. --- END Input Phase ---
  19734. --- Proposal Phase ---
  19735. --- Inner Elaboration Phase, active level 1 (S1) ---
  19736. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19737. -->
  19738. (S1 ^operator O2076 = 0.6601722790491221)
  19739. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19740. -->
  19741. (S1 ^operator O2075 = -0.1028953566115423)
  19742. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19743. -->
  19744. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19745. -->
  19746. Firing elaborate*copy-see-to-output-link
  19747. -->
  19748. (I3 ^see 0 +)
  19749. Firing elaborate*reward*based*on*reward
  19750. -->
  19751. (R1042 ^value 1 +)
  19752. (R1 ^reward R1042 +)
  19753. Firing propose*predict-yes
  19754. -->
  19755. (O2077 ^name predict-yes +)
  19756. (S1 ^operator O2077 +)
  19757. Firing propose*predict-no
  19758. -->
  19759. (O2078 ^name predict-no +)
  19760. (S1 ^operator O2078 +)
  19761. Firing rl*prefer*rvt*predict-no*H0*4
  19762. -->
  19763. (S1 ^operator O2076 = 0.3397793483103405)
  19764. Firing rl*prefer*rvt*predict-yes*H0*3
  19765. -->
  19766. (S1 ^operator O2075 = 0.337701263717275)
  19767. Firing prefer*rvt*predict-yes*H0
  19768. -->
  19769. Firing prefer*rvt*predict-no*H0
  19770. -->
  19771. Firing elaborate*copy-dir-to-output-link
  19772. -->
  19773. (I3 ^dir R +)
  19774. inner elaboration loop at bottom goal.
  19775. Retracting elaborate*copy-see-to-output-link
  19776. -->
  19777. (I3 ^see 0 +)
  19778. Retracting propose*predict-no
  19779. -->
  19780. (O2076 ^name predict-no +)
  19781. (S1 ^operator O2076 +)
  19782. Retracting propose*predict-yes
  19783. -->
  19784. (O2075 ^name predict-yes +)
  19785. (S1 ^operator O2075 +)
  19786. Retracting elaborate*reward*based*on*reward
  19787. -->
  19788. (R1041 ^value 1 +)
  19789. (R1 ^reward R1041 +)
  19790. Retracting elaborate*copy-dir-to-output-link
  19791. -->
  19792. (I3 ^dir R +)
  19793. Retracting rl*prefer*rvt*predict-no*H0*4
  19794. -->
  19795. (S1 ^operator O2076 = 0.3397793483103405)
  19796. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19797. -->
  19798. (S1 ^operator O2076 = 0.6601722790491221)
  19799. Retracting rl*prefer*rvt*predict-yes*H0*3
  19800. -->
  19801. (S1 ^operator O2075 = 0.337701263717275)
  19802. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19803. -->
  19804. (S1 ^operator O2075 = -0.1028953566115423)
  19805. =>WM: (14640: S1 ^operator O2078 +)
  19806. =>WM: (14639: S1 ^operator O2077 +)
  19807. =>WM: (14638: O2078 ^name predict-no)
  19808. =>WM: (14637: O2077 ^name predict-yes)
  19809. =>WM: (14636: R1042 ^value 1)
  19810. =>WM: (14635: R1 ^reward R1042)
  19811. <=WM: (14626: S1 ^operator O2075 +)
  19812. <=WM: (14627: S1 ^operator O2076 +)
  19813. <=WM: (14628: S1 ^operator O2076)
  19814. <=WM: (14621: R1 ^reward R1041)
  19815. <=WM: (14624: O2076 ^name predict-no)
  19816. <=WM: (14623: O2075 ^name predict-yes)
  19817. <=WM: (14622: R1041 ^value 1)
  19818. --- Inner Elaboration Phase, active level 1 (S1) ---
  19819. Firing prefer*rvt*predict-yes*H0
  19820. -->
  19821. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19822. -->
  19823. (S1 ^operator O2077 = -0.1028953566115423)
  19824. Firing rl*prefer*rvt*predict-yes*H0*3
  19825. -->
  19826. (S1 ^operator O2077 = 0.337701263717275)
  19827. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19828. -->
  19829. Firing prefer*rvt*predict-no*H0
  19830. -->
  19831. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19832. -->
  19833. (S1 ^operator O2078 = 0.6601722790491221)
  19834. Firing rl*prefer*rvt*predict-no*H0*4
  19835. -->
  19836. (S1 ^operator O2078 = 0.3397793483103405)
  19837. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19838. -->
  19839. inner elaboration loop at bottom goal.
  19840. Retracting rl*prefer*rvt*predict-no*H0*4
  19841. -->
  19842. (S1 ^operator O2076 = 0.3397793483103405)
  19843. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19844. -->
  19845. (S1 ^operator O2076 = 0.6601722790491221)
  19846. Retracting rl*prefer*rvt*predict-yes*H0*3
  19847. -->
  19848. (S1 ^operator O2075 = 0.337701263717275)
  19849. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19850. -->
  19851. (S1 ^operator O2075 = -0.1028953566115423)
  19852. --- END Proposal Phase ---
  19853. --- Decision Phase ---
  19854. RL update rl*prefer*rvt*predict-no*H0*4 0.570263 -0.230484 0.339779 -> 0.570267 -0.230484 0.339783(R,m,v=1,0.88,0.106207)
  19855. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429686 0.230486 0.660172 -> 0.429691 0.230486 0.660177(R,m,v=1,1,0)
  19856. =>WM: (14641: S1 ^operator O2078)
  19857. 1039: O: O2078 (predict-no)
  19858. --- END Decision Phase ---
  19859. --- Application Phase ---
  19860. --- Firing Productions (PE) For State At Depth 1 ---
  19861. --- Inner Elaboration Phase, active level 1 (S1) ---
  19862. Firing apply*operator
  19863. -->
  19864. (I3 ^predict-no N1039 + :O )
  19865. Firing apply*operator*complete
  19866. -->
  19867. (I3 ^predict-no N1038 - :O )
  19868. inner elaboration loop at bottom goal.
  19869. --- Change Working Memory (PE) ---
  19870. =>WM: (14642: I3 ^predict-no N1039)
  19871. <=WM: (14630: N1038 ^status complete)
  19872. <=WM: (14629: I3 ^predict-no N1038)
  19873. --- Firing Productions (IE) For State At Depth 1 ---
  19874. --- Inner Elaboration Phase, active level 1 (S1) ---
  19875. Firing monitor*world
  19876. -->
  19877. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19878. --- Change Working Memory (IE) ---
  19879. --- END Application Phase ---
  19880. --- Output Phase ---
  19881. ENV: Agent did: predict-no for direction R in state State-B
  19882. In State-B moving R
  19883. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19884. predict error 0
  19885. dir: dir isR
  19886. --- END Output Phase ---
  19887. /|\---- Input Phase ---
  19888. =>WM: (14646: I2 ^dir R)
  19889. =>WM: (14645: I2 ^reward 1)
  19890. =>WM: (14644: I2 ^see 0)
  19891. =>WM: (14643: N1039 ^status complete)
  19892. <=WM: (14633: I2 ^dir R)
  19893. <=WM: (14632: I2 ^reward 1)
  19894. <=WM: (14631: I2 ^see 0)
  19895. =>WM: (14647: I2 ^level-1 R0-root)
  19896. <=WM: (14634: I2 ^level-1 R0-root)
  19897. --- END Input Phase ---
  19898. --- Proposal Phase ---
  19899. --- Inner Elaboration Phase, active level 1 (S1) ---
  19900. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19901. -->
  19902. (S1 ^operator O2078 = 0.6601768507352938)
  19903. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19904. -->
  19905. (S1 ^operator O2077 = -0.1028953566115423)
  19906. Firing prefer*rvt*predict-no*H0*4*v1*H1
  19907. -->
  19908. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19909. -->
  19910. Firing elaborate*copy-see-to-output-link
  19911. -->
  19912. (I3 ^see 0 +)
  19913. Firing elaborate*reward*based*on*reward
  19914. -->
  19915. (R1043 ^value 1 +)
  19916. (R1 ^reward R1043 +)
  19917. Firing propose*predict-yes
  19918. -->
  19919. (O2079 ^name predict-yes +)
  19920. (S1 ^operator O2079 +)
  19921. Firing propose*predict-no
  19922. -->
  19923. (O2080 ^name predict-no +)
  19924. (S1 ^operator O2080 +)
  19925. Firing rl*prefer*rvt*predict-no*H0*4
  19926. -->
  19927. (S1 ^operator O2078 = 0.3397832716128478)
  19928. Firing rl*prefer*rvt*predict-yes*H0*3
  19929. -->
  19930. (S1 ^operator O2077 = 0.337701263717275)
  19931. Firing prefer*rvt*predict-yes*H0
  19932. -->
  19933. Firing prefer*rvt*predict-no*H0
  19934. -->
  19935. Firing elaborate*copy-dir-to-output-link
  19936. -->
  19937. (I3 ^dir R +)
  19938. inner elaboration loop at bottom goal.
  19939. Retracting elaborate*copy-see-to-output-link
  19940. -->
  19941. (I3 ^see 0 +)
  19942. Retracting propose*predict-no
  19943. -->
  19944. (O2078 ^name predict-no +)
  19945. (S1 ^operator O2078 +)
  19946. Retracting propose*predict-yes
  19947. -->
  19948. (O2077 ^name predict-yes +)
  19949. (S1 ^operator O2077 +)
  19950. Retracting elaborate*reward*based*on*reward
  19951. -->
  19952. (R1042 ^value 1 +)
  19953. (R1 ^reward R1042 +)
  19954. Retracting elaborate*copy-dir-to-output-link
  19955. -->
  19956. (I3 ^dir R +)
  19957. Retracting rl*prefer*rvt*predict-no*H0*4
  19958. -->
  19959. (S1 ^operator O2078 = 0.3397832716128478)
  19960. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19961. -->
  19962. (S1 ^operator O2078 = 0.6601768507352938)
  19963. Retracting rl*prefer*rvt*predict-yes*H0*3
  19964. -->
  19965. (S1 ^operator O2077 = 0.337701263717275)
  19966. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19967. -->
  19968. (S1 ^operator O2077 = -0.1028953566115423)
  19969. =>WM: (14653: S1 ^operator O2080 +)
  19970. =>WM: (14652: S1 ^operator O2079 +)
  19971. =>WM: (14651: O2080 ^name predict-no)
  19972. =>WM: (14650: O2079 ^name predict-yes)
  19973. =>WM: (14649: R1043 ^value 1)
  19974. =>WM: (14648: R1 ^reward R1043)
  19975. <=WM: (14639: S1 ^operator O2077 +)
  19976. <=WM: (14640: S1 ^operator O2078 +)
  19977. <=WM: (14641: S1 ^operator O2078)
  19978. <=WM: (14635: R1 ^reward R1042)
  19979. <=WM: (14638: O2078 ^name predict-no)
  19980. <=WM: (14637: O2077 ^name predict-yes)
  19981. <=WM: (14636: R1042 ^value 1)
  19982. --- Inner Elaboration Phase, active level 1 (S1) ---
  19983. Firing prefer*rvt*predict-yes*H0
  19984. -->
  19985. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  19986. -->
  19987. (S1 ^operator O2079 = -0.1028953566115423)
  19988. Firing rl*prefer*rvt*predict-yes*H0*3
  19989. -->
  19990. (S1 ^operator O2079 = 0.337701263717275)
  19991. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  19992. -->
  19993. Firing prefer*rvt*predict-no*H0
  19994. -->
  19995. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  19996. -->
  19997. (S1 ^operator O2080 = 0.6601768507352938)
  19998. Firing rl*prefer*rvt*predict-no*H0*4
  19999. -->
  20000. (S1 ^operator O2080 = 0.3397832716128478)
  20001. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20002. -->
  20003. inner elaboration loop at bottom goal.
  20004. Retracting rl*prefer*rvt*predict-no*H0*4
  20005. -->
  20006. (S1 ^operator O2078 = 0.3397832716128478)
  20007. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  20008. -->
  20009. (S1 ^operator O2078 = 0.6601768507352938)
  20010. Retracting rl*prefer*rvt*predict-yes*H0*3
  20011. -->
  20012. (S1 ^operator O2077 = 0.337701263717275)
  20013. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  20014. -->
  20015. (S1 ^operator O2077 = -0.1028953566115423)
  20016. --- END Proposal Phase ---
  20017. --- Decision Phase ---
  20018. RL update rl*prefer*rvt*predict-no*H0*4 0.570267 -0.230484 0.339783 -> 0.570271 -0.230484 0.339787(R,m,v=1,0.880682,0.105682)
  20019. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429691 0.230486 0.660177 -> 0.429695 0.230486 0.660181(R,m,v=1,1,0)
  20020. =>WM: (14654: S1 ^operator O2080)
  20021. 1040: O: O2080 (predict-no)
  20022. --- END Decision Phase ---
  20023. --- Application Phase ---
  20024. --- Firing Productions (PE) For State At Depth 1 ---
  20025. --- Inner Elaboration Phase, active level 1 (S1) ---
  20026. Firing apply*operator
  20027. -->
  20028. (I3 ^predict-no N1040 + :O )
  20029. Firing apply*operator*complete
  20030. -->
  20031. (I3 ^predict-no N1039 - :O )
  20032. inner elaboration loop at bottom goal.
  20033. --- Change Working Memory (PE) ---
  20034. =>WM: (14655: I3 ^predict-no N1040)
  20035. <=WM: (14643: N1039 ^status complete)
  20036. <=WM: (14642: I3 ^predict-no N1039)
  20037. --- Firing Productions (IE) For State At Depth 1 ---
  20038. --- Inner Elaboration Phase, active level 1 (S1) ---
  20039. Firing monitor*world
  20040. -->
  20041. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20042. --- Change Working Memory (IE) ---
  20043. --- END Application Phase ---
  20044. --- Output Phase ---
  20045. ENV: Agent did: predict-no for direction R in state State-B
  20046. In State-B moving R
  20047. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20048. predict error 0
  20049. dir: dir isU
  20050. --- END Output Phase ---
  20051. /|\--- Input Phase ---
  20052. =>WM: (14659: I2 ^dir U)
  20053. =>WM: (14658: I2 ^reward 1)
  20054. =>WM: (14657: I2 ^see 0)
  20055. =>WM: (14656: N1040 ^status complete)
  20056. <=WM: (14646: I2 ^dir R)
  20057. <=WM: (14645: I2 ^reward 1)
  20058. <=WM: (14644: I2 ^see 0)
  20059. =>WM: (14660: I2 ^level-1 R0-root)
  20060. <=WM: (14647: I2 ^level-1 R0-root)
  20061. --- END Input Phase ---
  20062. --- Proposal Phase ---
  20063. --- Inner Elaboration Phase, active level 1 (S1) ---
  20064. Firing elaborate*copy-see-to-output-link
  20065. -->
  20066. (I3 ^see 0 +)
  20067. Firing elaborate*reward*based*on*reward
  20068. -->
  20069. (R1044 ^value 1 +)
  20070. (R1 ^reward R1044 +)
  20071. Firing propose*predict-yes
  20072. -->
  20073. (O2081 ^name predict-yes +)
  20074. (S1 ^operator O2081 +)
  20075. Firing propose*predict-no
  20076. -->
  20077. (O2082 ^name predict-no +)
  20078. (S1 ^operator O2082 +)
  20079. Firing rl*prefer*rvt*predict-no*H0*2
  20080. -->
  20081. (S1 ^operator O2080 = 1.)
  20082. Firing rl*prefer*rvt*predict-yes*H0*1
  20083. -->
  20084. (S1 ^operator O2079 = 0.)
  20085. Firing prefer*rvt*predict-yes*H0
  20086. -->
  20087. Firing prefer*rvt*predict-no*H0
  20088. -->
  20089. Firing elaborate*copy-dir-to-output-link
  20090. -->
  20091. (I3 ^dir U +)
  20092. inner elaboration loop at bottom goal.
  20093. Retracting elaborate*copy-see-to-output-link
  20094. -->
  20095. (I3 ^see 0 +)
  20096. Retracting propose*predict-no
  20097. -->
  20098. (O2080 ^name predict-no +)
  20099. (S1 ^operator O2080 +)
  20100. Retracting propose*predict-yes
  20101. -->
  20102. (O2079 ^name predict-yes +)
  20103. (S1 ^operator O2079 +)
  20104. Retracting elaborate*reward*based*on*reward
  20105. -->
  20106. (R1043 ^value 1 +)
  20107. (R1 ^reward R1043 +)
  20108. Retracting elaborate*copy-dir-to-output-link
  20109. -->
  20110. (I3 ^dir R +)
  20111. Retracting rl*prefer*rvt*predict-no*H0*4
  20112. -->
  20113. (S1 ^operator O2080 = 0.3397865029356979)
  20114. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  20115. -->
  20116. (S1 ^operator O2080 = 0.6601806098946515)
  20117. Retracting rl*prefer*rvt*predict-yes*H0*3
  20118. -->
  20119. (S1 ^operator O2079 = 0.337701263717275)
  20120. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  20121. -->
  20122. (S1 ^operator O2079 = -0.1028953566115423)
  20123. =>WM: (14667: S1 ^operator O2082 +)
  20124. =>WM: (14666: S1 ^operator O2081 +)
  20125. =>WM: (14665: I3 ^dir U)
  20126. =>WM: (14664: O2082 ^name predict-no)
  20127. =>WM: (14663: O2081 ^name predict-yes)
  20128. =>WM: (14662: R1044 ^value 1)
  20129. =>WM: (14661: R1 ^reward R1044)
  20130. <=WM: (14652: S1 ^operator O2079 +)
  20131. <=WM: (14653: S1 ^operator O2080 +)
  20132. <=WM: (14654: S1 ^operator O2080)
  20133. <=WM: (14625: I3 ^dir R)
  20134. <=WM: (14648: R1 ^reward R1043)
  20135. <=WM: (14651: O2080 ^name predict-no)
  20136. <=WM: (14650: O2079 ^name predict-yes)
  20137. <=WM: (14649: R1043 ^value 1)
  20138. --- Inner Elaboration Phase, active level 1 (S1) ---
  20139. Firing prefer*rvt*predict-yes*H0
  20140. -->
  20141. Firing rl*prefer*rvt*predict-yes*H0*1
  20142. -->
  20143. (S1 ^operator O2081 = 0.)
  20144. Firing prefer*rvt*predict-no*H0
  20145. -->
  20146. Firing rl*prefer*rvt*predict-no*H0*2
  20147. -->
  20148. (S1 ^operator O2082 = 1.)
  20149. inner elaboration loop at bottom goal.
  20150. Retracting rl*prefer*rvt*predict-no*H0*2
  20151. -->
  20152. (S1 ^operator O2080 = 1.)
  20153. Retracting rl*prefer*rvt*predict-yes*H0*1
  20154. -->
  20155. (S1 ^operator O2079 = 0.)
  20156. --- END Proposal Phase ---
  20157. --- Decision Phase ---
  20158. RL update rl*prefer*rvt*predict-no*H0*4 0.570271 -0.230484 0.339787 -> 0.570274 -0.230484 0.339789(R,m,v=1,0.881356,0.105162)
  20159. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429695 0.230486 0.660181 -> 0.429698 0.230486 0.660184(R,m,v=1,1,0)
  20160. =>WM: (14668: S1 ^operator O2082)
  20161. 1041: O: O2082 (predict-no)
  20162. --- END Decision Phase ---
  20163. --- Application Phase ---
  20164. --- Firing Productions (PE) For State At Depth 1 ---
  20165. --- Inner Elaboration Phase, active level 1 (S1) ---
  20166. Firing apply*operator
  20167. -->
  20168. (I3 ^predict-no N1041 + :O )
  20169. Firing apply*operator*complete
  20170. -->
  20171. (I3 ^predict-no N1040 - :O )
  20172. inner elaboration loop at bottom goal.
  20173. --- Change Working Memory (PE) ---
  20174. =>WM: (14669: I3 ^predict-no N1041)
  20175. <=WM: (14656: N1040 ^status complete)
  20176. <=WM: (14655: I3 ^predict-no N1040)
  20177. --- Firing Productions (IE) For State At Depth 1 ---
  20178. --- Inner Elaboration Phase, active level 1 (S1) ---
  20179. Firing monitor*world
  20180. -->
  20181. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20182. --- Change Working Memory (IE) ---
  20183. --- END Application Phase ---
  20184. --- Output Phase ---
  20185. ENV: Agent did: predict-no for direction U in state State-B
  20186. In State-B moving U
  20187. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20188. predict error 0
  20189. dir: dir isL
  20190. --- END Output Phase ---
  20191. ---- Input Phase ---
  20192. =>WM: (14673: I2 ^dir L)
  20193. =>WM: (14672: I2 ^reward 1)
  20194. =>WM: (14671: I2 ^see 0)
  20195. =>WM: (14670: N1041 ^status complete)
  20196. <=WM: (14659: I2 ^dir U)
  20197. <=WM: (14658: I2 ^reward 1)
  20198. <=WM: (14657: I2 ^see 0)
  20199. =>WM: (14674: I2 ^level-1 R0-root)
  20200. <=WM: (14660: I2 ^level-1 R0-root)
  20201. --- END Input Phase ---
  20202. --- Proposal Phase ---
  20203. --- Inner Elaboration Phase, active level 1 (S1) ---
  20204. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  20205. -->
  20206. (S1 ^operator O2081 = 0.7358820562889159)
  20207. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20208. -->
  20209. Firing elaborate*copy-see-to-output-link
  20210. -->
  20211. (I3 ^see 0 +)
  20212. Firing elaborate*reward*based*on*reward
  20213. -->
  20214. (R1045 ^value 1 +)
  20215. (R1 ^reward R1045 +)
  20216. Firing propose*predict-yes
  20217. -->
  20218. (O2083 ^name predict-yes +)
  20219. (S1 ^operator O2083 +)
  20220. Firing propose*predict-no
  20221. -->
  20222. (O2084 ^name predict-no +)
  20223. (S1 ^operator O2084 +)
  20224. Firing rl*prefer*rvt*predict-no*H0*6
  20225. -->
  20226. (S1 ^operator O2082 = 0.9194479009118144)
  20227. Firing rl*prefer*rvt*predict-yes*H0*5
  20228. -->
  20229. (S1 ^operator O2081 = 0.2640108751521542)
  20230. Firing prefer*rvt*predict-yes*H0
  20231. -->
  20232. Firing prefer*rvt*predict-no*H0
  20233. -->
  20234. Firing elaborate*copy-dir-to-output-link
  20235. -->
  20236. (I3 ^dir L +)
  20237. inner elaboration loop at bottom goal.
  20238. Retracting elaborate*copy-see-to-output-link
  20239. -->
  20240. (I3 ^see 0 +)
  20241. Retracting propose*predict-no
  20242. -->
  20243. (O2082 ^name predict-no +)
  20244. (S1 ^operator O2082 +)
  20245. Retracting propose*predict-yes
  20246. -->
  20247. (O2081 ^name predict-yes +)
  20248. (S1 ^operator O2081 +)
  20249. Retracting elaborate*reward*based*on*reward
  20250. -->
  20251. (R1044 ^value 1 +)
  20252. (R1 ^reward R1044 +)
  20253. Retracting elaborate*copy-dir-to-output-link
  20254. -->
  20255. (I3 ^dir U +)
  20256. Retracting rl*prefer*rvt*predict-no*H0*2
  20257. -->
  20258. (S1 ^operator O2082 = 1.)
  20259. Retracting rl*prefer*rvt*predict-yes*H0*1
  20260. -->
  20261. (S1 ^operator O2081 = 0.)
  20262. =>WM: (14681: S1 ^operator O2084 +)
  20263. =>WM: (14680: S1 ^operator O2083 +)
  20264. =>WM: (14679: I3 ^dir L)
  20265. =>WM: (14678: O2084 ^name predict-no)
  20266. =>WM: (14677: O2083 ^name predict-yes)
  20267. =>WM: (14676: R1045 ^value 1)
  20268. =>WM: (14675: R1 ^reward R1045)
  20269. <=WM: (14666: S1 ^operator O2081 +)
  20270. <=WM: (14667: S1 ^operator O2082 +)
  20271. <=WM: (14668: S1 ^operator O2082)
  20272. <=WM: (14665: I3 ^dir U)
  20273. <=WM: (14661: R1 ^reward R1044)
  20274. <=WM: (14664: O2082 ^name predict-no)
  20275. <=WM: (14663: O2081 ^name predict-yes)
  20276. <=WM: (14662: R1044 ^value 1)
  20277. --- Inner Elaboration Phase, active level 1 (S1) ---
  20278. Firing prefer*rvt*predict-yes*H0
  20279. -->
  20280. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  20281. -->
  20282. (S1 ^operator O2083 = 0.7358820562889159)
  20283. Firing rl*prefer*rvt*predict-yes*H0*5
  20284. -->
  20285. (S1 ^operator O2083 = 0.2640108751521542)
  20286. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20287. -->
  20288. Firing prefer*rvt*predict-no*H0
  20289. -->
  20290. Firing rl*prefer*rvt*predict-no*H0*6
  20291. -->
  20292. (S1 ^operator O2084 = 0.9194479009118144)
  20293. inner elaboration loop at bottom goal.
  20294. Retracting rl*prefer*rvt*predict-no*H0*6
  20295. -->
  20296. (S1 ^operator O2082 = 0.9194479009118144)
  20297. Retracting rl*prefer*rvt*predict-yes*H0*5
  20298. -->
  20299. (S1 ^operator O2081 = 0.2640108751521542)
  20300. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  20301. -->
  20302. (S1 ^operator O2081 = 0.7358820562889159)
  20303. --- END Proposal Phase ---
  20304. --- Decision Phase ---
  20305. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20306. =>WM: (14682: S1 ^operator O2083)
  20307. 1042: O: O2083 (predict-yes)
  20308. --- END Decision Phase ---
  20309. --- Application Phase ---
  20310. --- Firing Productions (PE) For State At Depth 1 ---
  20311. --- Inner Elaboration Phase, active level 1 (S1) ---
  20312. Firing apply*operator
  20313. -->
  20314. (I3 ^predict-yes N1042 + :O )
  20315. Firing apply*operator*complete
  20316. -->
  20317. (I3 ^predict-no N1041 - :O )
  20318. inner elaboration loop at bottom goal.
  20319. --- Change Working Memory (PE) ---
  20320. =>WM: (14683: I3 ^predict-yes N1042)
  20321. <=WM: (14670: N1041 ^status complete)
  20322. <=WM: (14669: I3 ^predict-no N1041)
  20323. --- Firing Productions (IE) For State At Depth 1 ---
  20324. --- Inner Elaboration Phase, active level 1 (S1) ---
  20325. Firing monitor*world
  20326. -->
  20327. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20328. --- Change Working Memory (IE) ---
  20329. --- END Application Phase ---
  20330. --- Output Phase ---
  20331. ENV: Agent did: predict-yes for direction L in state State-B
  20332. In State-B moving L
  20333. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  20334. predict error 0
  20335. dir: dir isL
  20336. --- END Output Phase ---
  20337. /|\--- Input Phase ---
  20338. =>WM: (14687: I2 ^dir L)
  20339. =>WM: (14686: I2 ^reward 1)
  20340. =>WM: (14685: I2 ^see 1)
  20341. =>WM: (14684: N1042 ^status complete)
  20342. <=WM: (14673: I2 ^dir L)
  20343. <=WM: (14672: I2 ^reward 1)
  20344. <=WM: (14671: I2 ^see 0)
  20345. =>WM: (14688: I2 ^level-1 L1-root)
  20346. <=WM: (14674: I2 ^level-1 R0-root)
  20347. --- END Input Phase ---
  20348. --- Proposal Phase ---
  20349. --- Inner Elaboration Phase, active level 1 (S1) ---
  20350. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20351. -->
  20352. (S1 ^operator O2083 = -0.181727099742844)
  20353. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20354. -->
  20355. Firing elaborate*copy-see-to-output-link
  20356. -->
  20357. (I3 ^see 1 +)
  20358. Firing elaborate*reward*based*on*reward
  20359. -->
  20360. (R1046 ^value 1 +)
  20361. (R1 ^reward R1046 +)
  20362. Firing propose*predict-yes
  20363. -->
  20364. (O2085 ^name predict-yes +)
  20365. (S1 ^operator O2085 +)
  20366. Firing propose*predict-no
  20367. -->
  20368. (O2086 ^name predict-no +)
  20369. (S1 ^operator O2086 +)
  20370. Firing rl*prefer*rvt*predict-no*H0*6
  20371. -->
  20372. (S1 ^operator O2084 = 0.9194479009118144)
  20373. Firing rl*prefer*rvt*predict-yes*H0*5
  20374. -->
  20375. (S1 ^operator O2083 = 0.2640108751521542)
  20376. Firing prefer*rvt*predict-yes*H0
  20377. -->
  20378. Firing prefer*rvt*predict-no*H0
  20379. -->
  20380. Firing elaborate*copy-dir-to-output-link
  20381. -->
  20382. (I3 ^dir L +)
  20383. inner elaboration loop at bottom goal.
  20384. Retracting elaborate*copy-see-to-output-link
  20385. -->
  20386. (I3 ^see 0 +)
  20387. Retracting propose*predict-no
  20388. -->
  20389. (O2084 ^name predict-no +)
  20390. (S1 ^operator O2084 +)
  20391. Retracting propose*predict-yes
  20392. -->
  20393. (O2083 ^name predict-yes +)
  20394. (S1 ^operator O2083 +)
  20395. Retracting elaborate*reward*based*on*reward
  20396. -->
  20397. (R1045 ^value 1 +)
  20398. (R1 ^reward R1045 +)
  20399. Retracting elaborate*copy-dir-to-output-link
  20400. -->
  20401. (I3 ^dir L +)
  20402. Retracting rl*prefer*rvt*predict-no*H0*6
  20403. -->
  20404. (S1 ^operator O2084 = 0.9194479009118144)
  20405. Retracting rl*prefer*rvt*predict-yes*H0*5
  20406. -->
  20407. (S1 ^operator O2083 = 0.2640108751521542)
  20408. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  20409. -->
  20410. (S1 ^operator O2083 = 0.7358820562889159)
  20411. =>WM: (14695: S1 ^operator O2086 +)
  20412. =>WM: (14694: S1 ^operator O2085 +)
  20413. =>WM: (14693: O2086 ^name predict-no)
  20414. =>WM: (14692: O2085 ^name predict-yes)
  20415. =>WM: (14691: R1046 ^value 1)
  20416. =>WM: (14690: R1 ^reward R1046)
  20417. =>WM: (14689: I3 ^see 1)
  20418. <=WM: (14680: S1 ^operator O2083 +)
  20419. <=WM: (14682: S1 ^operator O2083)
  20420. <=WM: (14681: S1 ^operator O2084 +)
  20421. <=WM: (14675: R1 ^reward R1045)
  20422. <=WM: (14579: I3 ^see 0)
  20423. <=WM: (14678: O2084 ^name predict-no)
  20424. <=WM: (14677: O2083 ^name predict-yes)
  20425. <=WM: (14676: R1045 ^value 1)
  20426. --- Inner Elaboration Phase, active level 1 (S1) ---
  20427. Firing prefer*rvt*predict-yes*H0
  20428. -->
  20429. Firing rl*prefer*rvt*predict-yes*H0*5
  20430. -->
  20431. (S1 ^operator O2085 = 0.2640108751521542)
  20432. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  20433. -->
  20434. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20435. -->
  20436. (S1 ^operator O2085 = -0.181727099742844)
  20437. Firing prefer*rvt*predict-no*H0
  20438. -->
  20439. Firing rl*prefer*rvt*predict-no*H0*6
  20440. -->
  20441. (S1 ^operator O2086 = 0.9194479009118144)
  20442. inner elaboration loop at bottom goal.
  20443. Retracting rl*prefer*rvt*predict-no*H0*6
  20444. -->
  20445. (S1 ^operator O2084 = 0.9194479009118144)
  20446. Retracting rl*prefer*rvt*predict-yes*H0*5
  20447. -->
  20448. (S1 ^operator O2083 = 0.2640108751521542)
  20449. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20450. -->
  20451. (S1 ^operator O2083 = -0.181727099742844)
  20452. --- END Proposal Phase ---
  20453. --- Decision Phase ---
  20454. RL update rl*prefer*rvt*predict-yes*H0*5 0.554397 -0.290386 0.264011 -> 0.554405 -0.290386 0.264019(R,m,v=1,0.880435,0.105845)
  20455. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445498 0.290384 0.735882 -> 0.445508 0.290384 0.735892(R,m,v=1,1,0)
  20456. =>WM: (14696: S1 ^operator O2086)
  20457. 1043: O: O2086 (predict-no)
  20458. --- END Decision Phase ---
  20459. --- Application Phase ---
  20460. --- Firing Productions (PE) For State At Depth 1 ---
  20461. --- Inner Elaboration Phase, active level 1 (S1) ---
  20462. Firing apply*operator
  20463. -->
  20464. (I3 ^predict-no N1043 + :O )
  20465. Firing apply*operator*complete
  20466. -->
  20467. (I3 ^predict-yes N1042 - :O )
  20468. inner elaboration loop at bottom goal.
  20469. --- Change Working Memory (PE) ---
  20470. =>WM: (14697: I3 ^predict-no N1043)
  20471. <=WM: (14684: N1042 ^status complete)
  20472. <=WM: (14683: I3 ^predict-yes N1042)
  20473. --- Firing Productions (IE) For State At Depth 1 ---
  20474. --- Inner Elaboration Phase, active level 1 (S1) ---
  20475. Firing monitor*world
  20476. -->
  20477. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20478. --- Change Working Memory (IE) ---
  20479. --- END Application Phase ---
  20480. --- Output Phase ---
  20481. ENV: Agent did: predict-no for direction L in state State-A
  20482. In State-A moving L
  20483. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20484. predict error 0
  20485. dir: dir isR
  20486. --- END Output Phase ---
  20487. -/|--- Input Phase ---
  20488. =>WM: (14701: I2 ^dir R)
  20489. =>WM: (14700: I2 ^reward 1)
  20490. =>WM: (14699: I2 ^see 0)
  20491. =>WM: (14698: N1043 ^status complete)
  20492. <=WM: (14687: I2 ^dir L)
  20493. <=WM: (14686: I2 ^reward 1)
  20494. <=WM: (14685: I2 ^see 1)
  20495. =>WM: (14702: I2 ^level-1 L0-root)
  20496. <=WM: (14688: I2 ^level-1 L1-root)
  20497. --- END Input Phase ---
  20498. --- Proposal Phase ---
  20499. --- Inner Elaboration Phase, active level 1 (S1) ---
  20500. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20501. -->
  20502. (S1 ^operator O2086 = -0.2817060109291377)
  20503. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20504. -->
  20505. (S1 ^operator O2085 = 0.6623276056502743)
  20506. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20507. -->
  20508. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20509. -->
  20510. Firing elaborate*copy-see-to-output-link
  20511. -->
  20512. (I3 ^see 0 +)
  20513. Firing elaborate*reward*based*on*reward
  20514. -->
  20515. (R1047 ^value 1 +)
  20516. (R1 ^reward R1047 +)
  20517. Firing propose*predict-yes
  20518. -->
  20519. (O2087 ^name predict-yes +)
  20520. (S1 ^operator O2087 +)
  20521. Firing propose*predict-no
  20522. -->
  20523. (O2088 ^name predict-no +)
  20524. (S1 ^operator O2088 +)
  20525. Firing rl*prefer*rvt*predict-no*H0*4
  20526. -->
  20527. (S1 ^operator O2086 = 0.3397891653686922)
  20528. Firing rl*prefer*rvt*predict-yes*H0*3
  20529. -->
  20530. (S1 ^operator O2085 = 0.337701263717275)
  20531. Firing prefer*rvt*predict-yes*H0
  20532. -->
  20533. Firing prefer*rvt*predict-no*H0
  20534. -->
  20535. Firing elaborate*copy-dir-to-output-link
  20536. -->
  20537. (I3 ^dir R +)
  20538. inner elaboration loop at bottom goal.
  20539. Retracting elaborate*copy-see-to-output-link
  20540. -->
  20541. (I3 ^see 1 +)
  20542. Retracting propose*predict-no
  20543. -->
  20544. (O2086 ^name predict-no +)
  20545. (S1 ^operator O2086 +)
  20546. Retracting propose*predict-yes
  20547. -->
  20548. (O2085 ^name predict-yes +)
  20549. (S1 ^operator O2085 +)
  20550. Retracting elaborate*reward*based*on*reward
  20551. -->
  20552. (R1046 ^value 1 +)
  20553. (R1 ^reward R1046 +)
  20554. Retracting elaborate*copy-dir-to-output-link
  20555. -->
  20556. (I3 ^dir L +)
  20557. Retracting rl*prefer*rvt*predict-no*H0*6
  20558. -->
  20559. (S1 ^operator O2086 = 0.9194479009118144)
  20560. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  20561. -->
  20562. (S1 ^operator O2085 = -0.181727099742844)
  20563. Retracting rl*prefer*rvt*predict-yes*H0*5
  20564. -->
  20565. (S1 ^operator O2085 = 0.2640194889627474)
  20566. =>WM: (14710: S1 ^operator O2088 +)
  20567. =>WM: (14709: S1 ^operator O2087 +)
  20568. =>WM: (14708: I3 ^dir R)
  20569. =>WM: (14707: O2088 ^name predict-no)
  20570. =>WM: (14706: O2087 ^name predict-yes)
  20571. =>WM: (14705: R1047 ^value 1)
  20572. =>WM: (14704: R1 ^reward R1047)
  20573. =>WM: (14703: I3 ^see 0)
  20574. <=WM: (14694: S1 ^operator O2085 +)
  20575. <=WM: (14695: S1 ^operator O2086 +)
  20576. <=WM: (14696: S1 ^operator O2086)
  20577. <=WM: (14679: I3 ^dir L)
  20578. <=WM: (14690: R1 ^reward R1046)
  20579. <=WM: (14689: I3 ^see 1)
  20580. <=WM: (14693: O2086 ^name predict-no)
  20581. <=WM: (14692: O2085 ^name predict-yes)
  20582. <=WM: (14691: R1046 ^value 1)
  20583. --- Inner Elaboration Phase, active level 1 (S1) ---
  20584. Firing prefer*rvt*predict-yes*H0
  20585. -->
  20586. Firing rl*prefer*rvt*predict-yes*H0*3
  20587. -->
  20588. (S1 ^operator O2087 = 0.337701263717275)
  20589. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20590. -->
  20591. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20592. -->
  20593. (S1 ^operator O2087 = 0.6623276056502743)
  20594. Firing prefer*rvt*predict-no*H0
  20595. -->
  20596. Firing rl*prefer*rvt*predict-no*H0*4
  20597. -->
  20598. (S1 ^operator O2088 = 0.3397891653686922)
  20599. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20600. -->
  20601. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20602. -->
  20603. (S1 ^operator O2088 = -0.2817060109291377)
  20604. inner elaboration loop at bottom goal.
  20605. Retracting rl*prefer*rvt*predict-no*H0*4
  20606. -->
  20607. (S1 ^operator O2086 = 0.3397891653686922)
  20608. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20609. -->
  20610. (S1 ^operator O2086 = -0.2817060109291377)
  20611. Retracting rl*prefer*rvt*predict-yes*H0*3
  20612. -->
  20613. (S1 ^operator O2085 = 0.337701263717275)
  20614. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20615. -->
  20616. (S1 ^operator O2085 = 0.6623276056502743)
  20617. --- END Proposal Phase ---
  20618. --- Decision Phase ---
  20619. RL update rl*prefer*rvt*predict-no*H0*6 0.919448 0 0.919448 -> 0.932763 0 0.932763(R,m,v=1,0.903846,0.087469)
  20620. =>WM: (14711: S1 ^operator O2087)
  20621. 1044: O: O2087 (predict-yes)
  20622. --- END Decision Phase ---
  20623. --- Application Phase ---
  20624. --- Firing Productions (PE) For State At Depth 1 ---
  20625. --- Inner Elaboration Phase, active level 1 (S1) ---
  20626. Firing apply*operator
  20627. -->
  20628. (I3 ^predict-yes N1044 + :O )
  20629. Firing apply*operator*complete
  20630. -->
  20631. (I3 ^predict-no N1043 - :O )
  20632. inner elaboration loop at bottom goal.
  20633. --- Change Working Memory (PE) ---
  20634. =>WM: (14712: I3 ^predict-yes N1044)
  20635. <=WM: (14698: N1043 ^status complete)
  20636. <=WM: (14697: I3 ^predict-no N1043)
  20637. --- Firing Productions (IE) For State At Depth 1 ---
  20638. --- Inner Elaboration Phase, active level 1 (S1) ---
  20639. Firing monitor*world
  20640. -->
  20641. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20642. --- Change Working Memory (IE) ---
  20643. --- END Application Phase ---
  20644. --- Output Phase ---
  20645. ENV: Agent did: predict-yes for direction R in state State-A
  20646. In State-A moving R
  20647. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  20648. predict error 0
  20649. dir: dir isU
  20650. --- END Output Phase ---
  20651. \-/--- Input Phase ---
  20652. =>WM: (14716: I2 ^dir U)
  20653. =>WM: (14715: I2 ^reward 1)
  20654. =>WM: (14714: I2 ^see 1)
  20655. =>WM: (14713: N1044 ^status complete)
  20656. <=WM: (14701: I2 ^dir R)
  20657. <=WM: (14700: I2 ^reward 1)
  20658. <=WM: (14699: I2 ^see 0)
  20659. =>WM: (14717: I2 ^level-1 R1-root)
  20660. <=WM: (14702: I2 ^level-1 L0-root)
  20661. --- END Input Phase ---
  20662. --- Proposal Phase ---
  20663. --- Inner Elaboration Phase, active level 1 (S1) ---
  20664. Firing elaborate*copy-see-to-output-link
  20665. -->
  20666. (I3 ^see 1 +)
  20667. Firing elaborate*reward*based*on*reward
  20668. -->
  20669. (R1048 ^value 1 +)
  20670. (R1 ^reward R1048 +)
  20671. Firing propose*predict-yes
  20672. -->
  20673. (O2089 ^name predict-yes +)
  20674. (S1 ^operator O2089 +)
  20675. Firing propose*predict-no
  20676. -->
  20677. (O2090 ^name predict-no +)
  20678. (S1 ^operator O2090 +)
  20679. Firing rl*prefer*rvt*predict-no*H0*2
  20680. -->
  20681. (S1 ^operator O2088 = 1.)
  20682. Firing rl*prefer*rvt*predict-yes*H0*1
  20683. -->
  20684. (S1 ^operator O2087 = 0.)
  20685. Firing prefer*rvt*predict-yes*H0
  20686. -->
  20687. Firing prefer*rvt*predict-no*H0
  20688. -->
  20689. Firing elaborate*copy-dir-to-output-link
  20690. -->
  20691. (I3 ^dir U +)
  20692. inner elaboration loop at bottom goal.
  20693. Retracting elaborate*copy-see-to-output-link
  20694. -->
  20695. (I3 ^see 0 +)
  20696. Retracting propose*predict-no
  20697. -->
  20698. (O2088 ^name predict-no +)
  20699. (S1 ^operator O2088 +)
  20700. Retracting propose*predict-yes
  20701. -->
  20702. (O2087 ^name predict-yes +)
  20703. (S1 ^operator O2087 +)
  20704. Retracting elaborate*reward*based*on*reward
  20705. -->
  20706. (R1047 ^value 1 +)
  20707. (R1 ^reward R1047 +)
  20708. Retracting elaborate*copy-dir-to-output-link
  20709. -->
  20710. (I3 ^dir R +)
  20711. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  20712. -->
  20713. (S1 ^operator O2088 = -0.2817060109291377)
  20714. Retracting rl*prefer*rvt*predict-no*H0*4
  20715. -->
  20716. (S1 ^operator O2088 = 0.3397891653686922)
  20717. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  20718. -->
  20719. (S1 ^operator O2087 = 0.6623276056502743)
  20720. Retracting rl*prefer*rvt*predict-yes*H0*3
  20721. -->
  20722. (S1 ^operator O2087 = 0.337701263717275)
  20723. =>WM: (14725: S1 ^operator O2090 +)
  20724. =>WM: (14724: S1 ^operator O2089 +)
  20725. =>WM: (14723: I3 ^dir U)
  20726. =>WM: (14722: O2090 ^name predict-no)
  20727. =>WM: (14721: O2089 ^name predict-yes)
  20728. =>WM: (14720: R1048 ^value 1)
  20729. =>WM: (14719: R1 ^reward R1048)
  20730. =>WM: (14718: I3 ^see 1)
  20731. <=WM: (14709: S1 ^operator O2087 +)
  20732. <=WM: (14711: S1 ^operator O2087)
  20733. <=WM: (14710: S1 ^operator O2088 +)
  20734. <=WM: (14708: I3 ^dir R)
  20735. <=WM: (14704: R1 ^reward R1047)
  20736. <=WM: (14703: I3 ^see 0)
  20737. <=WM: (14707: O2088 ^name predict-no)
  20738. <=WM: (14706: O2087 ^name predict-yes)
  20739. <=WM: (14705: R1047 ^value 1)
  20740. --- Inner Elaboration Phase, active level 1 (S1) ---
  20741. Firing prefer*rvt*predict-yes*H0
  20742. -->
  20743. Firing rl*prefer*rvt*predict-yes*H0*1
  20744. -->
  20745. (S1 ^operator O2089 = 0.)
  20746. Firing prefer*rvt*predict-no*H0
  20747. -->
  20748. Firing rl*prefer*rvt*predict-no*H0*2
  20749. -->
  20750. (S1 ^operator O2090 = 1.)
  20751. inner elaboration loop at bottom goal.
  20752. Retracting rl*prefer*rvt*predict-no*H0*2
  20753. -->
  20754. (S1 ^operator O2088 = 1.)
  20755. Retracting rl*prefer*rvt*predict-yes*H0*1
  20756. -->
  20757. (S1 ^operator O2087 = 0.)
  20758. --- END Proposal Phase ---
  20759. --- Decision Phase ---
  20760. RL update rl*prefer*rvt*predict-yes*H0*3 0.5901 -0.252399 0.337701 -> 0.590097 -0.252398 0.337699(R,m,v=1,0.902299,0.0886652)
  20761. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409933 0.252394 0.662328 -> 0.40993 0.252395 0.662325(R,m,v=1,1,0)
  20762. =>WM: (14726: S1 ^operator O2090)
  20763. 1045: O: O2090 (predict-no)
  20764. --- END Decision Phase ---
  20765. --- Application Phase ---
  20766. --- Firing Productions (PE) For State At Depth 1 ---
  20767. --- Inner Elaboration Phase, active level 1 (S1) ---
  20768. Firing apply*operator
  20769. -->
  20770. (I3 ^predict-no N1045 + :O )
  20771. Firing apply*operator*complete
  20772. -->
  20773. (I3 ^predict-yes N1044 - :O )
  20774. inner elaboration loop at bottom goal.
  20775. --- Change Working Memory (PE) ---
  20776. =>WM: (14727: I3 ^predict-no N1045)
  20777. <=WM: (14713: N1044 ^status complete)
  20778. <=WM: (14712: I3 ^predict-yes N1044)
  20779. --- Firing Productions (IE) For State At Depth 1 ---
  20780. --- Inner Elaboration Phase, active level 1 (S1) ---
  20781. Firing monitor*world
  20782. -->
  20783. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20784. --- Change Working Memory (IE) ---
  20785. --- END Application Phase ---
  20786. --- Output Phase ---
  20787. ENV: Agent did: predict-no for direction U in state State-B
  20788. In State-B moving U
  20789. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20790. predict error 0
  20791. dir: dir isU
  20792. --- END Output Phase ---
  20793. |\---- Input Phase ---
  20794. =>WM: (14731: I2 ^dir U)
  20795. =>WM: (14730: I2 ^reward 1)
  20796. =>WM: (14729: I2 ^see 0)
  20797. =>WM: (14728: N1045 ^status complete)
  20798. <=WM: (14716: I2 ^dir U)
  20799. <=WM: (14715: I2 ^reward 1)
  20800. <=WM: (14714: I2 ^see 1)
  20801. =>WM: (14732: I2 ^level-1 R1-root)
  20802. <=WM: (14717: I2 ^level-1 R1-root)
  20803. --- END Input Phase ---
  20804. --- Proposal Phase ---
  20805. --- Inner Elaboration Phase, active level 1 (S1) ---
  20806. Firing elaborate*copy-see-to-output-link
  20807. -->
  20808. (I3 ^see 0 +)
  20809. Firing elaborate*reward*based*on*reward
  20810. -->
  20811. (R1049 ^value 1 +)
  20812. (R1 ^reward R1049 +)
  20813. Firing propose*predict-yes
  20814. -->
  20815. (O2091 ^name predict-yes +)
  20816. (S1 ^operator O2091 +)
  20817. Firing propose*predict-no
  20818. -->
  20819. (O2092 ^name predict-no +)
  20820. (S1 ^operator O2092 +)
  20821. Firing rl*prefer*rvt*predict-no*H0*2
  20822. -->
  20823. (S1 ^operator O2090 = 1.)
  20824. Firing rl*prefer*rvt*predict-yes*H0*1
  20825. -->
  20826. (S1 ^operator O2089 = 0.)
  20827. Firing prefer*rvt*predict-yes*H0
  20828. -->
  20829. Firing prefer*rvt*predict-no*H0
  20830. -->
  20831. Firing elaborate*copy-dir-to-output-link
  20832. -->
  20833. (I3 ^dir U +)
  20834. inner elaboration loop at bottom goal.
  20835. Retracting elaborate*copy-see-to-output-link
  20836. -->
  20837. (I3 ^see 1 +)
  20838. Retracting propose*predict-no
  20839. -->
  20840. (O2090 ^name predict-no +)
  20841. (S1 ^operator O2090 +)
  20842. Retracting propose*predict-yes
  20843. -->
  20844. (O2089 ^name predict-yes +)
  20845. (S1 ^operator O2089 +)
  20846. Retracting elaborate*reward*based*on*reward
  20847. -->
  20848. (R1048 ^value 1 +)
  20849. (R1 ^reward R1048 +)
  20850. Retracting elaborate*copy-dir-to-output-link
  20851. -->
  20852. (I3 ^dir U +)
  20853. Retracting rl*prefer*rvt*predict-no*H0*2
  20854. -->
  20855. (S1 ^operator O2090 = 1.)
  20856. Retracting rl*prefer*rvt*predict-yes*H0*1
  20857. -->
  20858. (S1 ^operator O2089 = 0.)
  20859. =>WM: (14739: S1 ^operator O2092 +)
  20860. =>WM: (14738: S1 ^operator O2091 +)
  20861. =>WM: (14737: O2092 ^name predict-no)
  20862. =>WM: (14736: O2091 ^name predict-yes)
  20863. =>WM: (14735: R1049 ^value 1)
  20864. =>WM: (14734: R1 ^reward R1049)
  20865. =>WM: (14733: I3 ^see 0)
  20866. <=WM: (14724: S1 ^operator O2089 +)
  20867. <=WM: (14725: S1 ^operator O2090 +)
  20868. <=WM: (14726: S1 ^operator O2090)
  20869. <=WM: (14719: R1 ^reward R1048)
  20870. <=WM: (14718: I3 ^see 1)
  20871. <=WM: (14722: O2090 ^name predict-no)
  20872. <=WM: (14721: O2089 ^name predict-yes)
  20873. <=WM: (14720: R1048 ^value 1)
  20874. --- Inner Elaboration Phase, active level 1 (S1) ---
  20875. Firing prefer*rvt*predict-yes*H0
  20876. -->
  20877. Firing rl*prefer*rvt*predict-yes*H0*1
  20878. -->
  20879. (S1 ^operator O2091 = 0.)
  20880. Firing prefer*rvt*predict-no*H0
  20881. -->
  20882. Firing rl*prefer*rvt*predict-no*H0*2
  20883. -->
  20884. (S1 ^operator O2092 = 1.)
  20885. inner elaboration loop at bottom goal.
  20886. Retracting rl*prefer*rvt*predict-no*H0*2
  20887. -->
  20888. (S1 ^operator O2090 = 1.)
  20889. Retracting rl*prefer*rvt*predict-yes*H0*1
  20890. -->
  20891. (S1 ^operator O2089 = 0.)
  20892. --- END Proposal Phase ---
  20893. --- Decision Phase ---
  20894. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  20895. =>WM: (14740: S1 ^operator O2092)
  20896. 1046: O: O2092 (predict-no)
  20897. --- END Decision Phase ---
  20898. --- Application Phase ---
  20899. --- Firing Productions (PE) For State At Depth 1 ---
  20900. --- Inner Elaboration Phase, active level 1 (S1) ---
  20901. Firing apply*operator
  20902. -->
  20903. (I3 ^predict-no N1046 + :O )
  20904. Firing apply*operator*complete
  20905. -->
  20906. (I3 ^predict-no N1045 - :O )
  20907. inner elaboration loop at bottom goal.
  20908. --- Change Working Memory (PE) ---
  20909. =>WM: (14741: I3 ^predict-no N1046)
  20910. <=WM: (14728: N1045 ^status complete)
  20911. <=WM: (14727: I3 ^predict-no N1045)
  20912. --- Firing Productions (IE) For State At Depth 1 ---
  20913. --- Inner Elaboration Phase, active level 1 (S1) ---
  20914. Firing monitor*world
  20915. -->
  20916. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20917. --- Change Working Memory (IE) ---
  20918. --- END Application Phase ---
  20919. --- Output Phase ---
  20920. ENV: Agent did: predict-no for direction U in state State-B
  20921. In State-B moving U
  20922. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20923. predict error 0
  20924. dir: dir isR
  20925. --- END Output Phase ---
  20926. /|\--- Input Phase ---
  20927. =>WM: (14745: I2 ^dir R)
  20928. =>WM: (14744: I2 ^reward 1)
  20929. =>WM: (14743: I2 ^see 0)
  20930. =>WM: (14742: N1046 ^status complete)
  20931. <=WM: (14731: I2 ^dir U)
  20932. <=WM: (14730: I2 ^reward 1)
  20933. <=WM: (14729: I2 ^see 0)
  20934. =>WM: (14746: I2 ^level-1 R1-root)
  20935. <=WM: (14732: I2 ^level-1 R1-root)
  20936. --- END Input Phase ---
  20937. --- Proposal Phase ---
  20938. --- Inner Elaboration Phase, active level 1 (S1) ---
  20939. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  20940. -->
  20941. (S1 ^operator O2091 = -0.1070236389116304)
  20942. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  20943. -->
  20944. (S1 ^operator O2092 = 0.660238287807148)
  20945. Firing prefer*rvt*predict-no*H0*4*v1*H1
  20946. -->
  20947. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  20948. -->
  20949. Firing elaborate*copy-see-to-output-link
  20950. -->
  20951. (I3 ^see 0 +)
  20952. Firing elaborate*reward*based*on*reward
  20953. -->
  20954. (R1050 ^value 1 +)
  20955. (R1 ^reward R1050 +)
  20956. Firing propose*predict-yes
  20957. -->
  20958. (O2093 ^name predict-yes +)
  20959. (S1 ^operator O2093 +)
  20960. Firing propose*predict-no
  20961. -->
  20962. (O2094 ^name predict-no +)
  20963. (S1 ^operator O2094 +)
  20964. Firing rl*prefer*rvt*predict-no*H0*4
  20965. -->
  20966. (S1 ^operator O2092 = 0.3397891653686922)
  20967. Firing rl*prefer*rvt*predict-yes*H0*3
  20968. -->
  20969. (S1 ^operator O2091 = 0.3376989200650307)
  20970. Firing prefer*rvt*predict-yes*H0
  20971. -->
  20972. Firing prefer*rvt*predict-no*H0
  20973. -->
  20974. Firing elaborate*copy-dir-to-output-link
  20975. -->
  20976. (I3 ^dir R +)
  20977. inner elaboration loop at bottom goal.
  20978. Retracting elaborate*copy-see-to-output-link
  20979. -->
  20980. (I3 ^see 0 +)
  20981. Retracting propose*predict-no
  20982. -->
  20983. (O2092 ^name predict-no +)
  20984. (S1 ^operator O2092 +)
  20985. Retracting propose*predict-yes
  20986. -->
  20987. (O2091 ^name predict-yes +)
  20988. (S1 ^operator O2091 +)
  20989. Retracting elaborate*reward*based*on*reward
  20990. -->
  20991. (R1049 ^value 1 +)
  20992. (R1 ^reward R1049 +)
  20993. Retracting elaborate*copy-dir-to-output-link
  20994. -->
  20995. (I3 ^dir U +)
  20996. Retracting rl*prefer*rvt*predict-no*H0*2
  20997. -->
  20998. (S1 ^operator O2092 = 1.)
  20999. Retracting rl*prefer*rvt*predict-yes*H0*1
  21000. -->
  21001. (S1 ^operator O2091 = 0.)
  21002. =>WM: (14753: S1 ^operator O2094 +)
  21003. =>WM: (14752: S1 ^operator O2093 +)
  21004. =>WM: (14751: I3 ^dir R)
  21005. =>WM: (14750: O2094 ^name predict-no)
  21006. =>WM: (14749: O2093 ^name predict-yes)
  21007. =>WM: (14748: R1050 ^value 1)
  21008. =>WM: (14747: R1 ^reward R1050)
  21009. <=WM: (14738: S1 ^operator O2091 +)
  21010. <=WM: (14739: S1 ^operator O2092 +)
  21011. <=WM: (14740: S1 ^operator O2092)
  21012. <=WM: (14723: I3 ^dir U)
  21013. <=WM: (14734: R1 ^reward R1049)
  21014. <=WM: (14737: O2092 ^name predict-no)
  21015. <=WM: (14736: O2091 ^name predict-yes)
  21016. <=WM: (14735: R1049 ^value 1)
  21017. --- Inner Elaboration Phase, active level 1 (S1) ---
  21018. Firing prefer*rvt*predict-yes*H0
  21019. -->
  21020. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  21021. -->
  21022. (S1 ^operator O2093 = -0.1070236389116304)
  21023. Firing rl*prefer*rvt*predict-yes*H0*3
  21024. -->
  21025. (S1 ^operator O2093 = 0.3376989200650307)
  21026. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  21027. -->
  21028. Firing prefer*rvt*predict-no*H0
  21029. -->
  21030. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  21031. -->
  21032. (S1 ^operator O2094 = 0.660238287807148)
  21033. Firing rl*prefer*rvt*predict-no*H0*4
  21034. -->
  21035. (S1 ^operator O2094 = 0.3397891653686922)
  21036. Firing prefer*rvt*predict-no*H0*4*v1*H1
  21037. -->
  21038. inner elaboration loop at bottom goal.
  21039. Retracting rl*prefer*rvt*predict-no*H0*4
  21040. -->
  21041. (S1 ^operator O2092 = 0.3397891653686922)
  21042. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  21043. -->
  21044. (S1 ^operator O2092 = 0.660238287807148)
  21045. Retracting rl*prefer*rvt*predict-yes*H0*3
  21046. -->
  21047. (S1 ^operator O2091 = 0.3376989200650307)
  21048. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  21049. -->
  21050. (S1 ^operator O2091 = -0.1070236389116304)
  21051. --- END Proposal Phase ---
  21052. --- Decision Phase ---
  21053. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21054. =>WM: (14754: S1 ^operator O2094)
  21055. 1047: O: O2094 (predict-no)
  21056. --- END Decision Phase ---
  21057. --- Application Phase ---
  21058. --- Firing Productions (PE) For State At Depth 1 ---
  21059. --- Inner Elaboration Phase, active level 1 (S1) ---
  21060. Firing apply*operator
  21061. -->
  21062. (I3 ^predict-no N1047 + :O )
  21063. Firing apply*operator*complete
  21064. -->
  21065. (I3 ^predict-no N1046 - :O )
  21066. inner elaboration loop at bottom goal.
  21067. --- Change Working Memory (PE) ---
  21068. =>WM: (14755: I3 ^predict-no N1047)
  21069. <=WM: (14742: N1046 ^status complete)
  21070. <=WM: (14741: I3 ^predict-no N1046)
  21071. --- Firing Productions (IE) For State At Depth 1 ---
  21072. --- Inner Elaboration Phase, active level 1 (S1) ---
  21073. Firing monitor*world
  21074. -->
  21075. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21076. --- Change Working Memory (IE) ---
  21077. --- END Application Phase ---
  21078. --- Output Phase ---
  21079. ENV: Agent did: predict-no for direction R in state State-B
  21080. In State-B moving R
  21081. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21082. predict error 0
  21083. dir: dir isR
  21084. --- END Output Phase ---
  21085. -/|--- Input Phase ---
  21086. =>WM: (14759: I2 ^dir R)
  21087. =>WM: (14758: I2 ^reward 1)
  21088. =>WM: (14757: I2 ^see 0)
  21089. =>WM: (14756: N1047 ^status complete)
  21090. <=WM: (14745: I2 ^dir R)
  21091. <=WM: (14744: I2 ^reward 1)
  21092. <=WM: (14743: I2 ^see 0)
  21093. =>WM: (14760: I2 ^level-1 R0-root)
  21094. <=WM: (14746: I2 ^level-1 R1-root)
  21095. --- END Input Phase ---
  21096. --- Proposal Phase ---
  21097. --- Inner Elaboration Phase, active level 1 (S1) ---
  21098. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21099. -->
  21100. (S1 ^operator O2094 = 0.6601837022541405)
  21101. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21102. -->
  21103. (S1 ^operator O2093 = -0.1028953566115423)
  21104. Firing prefer*rvt*predict-no*H0*4*v1*H1
  21105. -->
  21106. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  21107. -->
  21108. Firing elaborate*copy-see-to-output-link
  21109. -->
  21110. (I3 ^see 0 +)
  21111. Firing elaborate*reward*based*on*reward
  21112. -->
  21113. (R1051 ^value 1 +)
  21114. (R1 ^reward R1051 +)
  21115. Firing propose*predict-yes
  21116. -->
  21117. (O2095 ^name predict-yes +)
  21118. (S1 ^operator O2095 +)
  21119. Firing propose*predict-no
  21120. -->
  21121. (O2096 ^name predict-no +)
  21122. (S1 ^operator O2096 +)
  21123. Firing rl*prefer*rvt*predict-no*H0*4
  21124. -->
  21125. (S1 ^operator O2094 = 0.3397891653686922)
  21126. Firing rl*prefer*rvt*predict-yes*H0*3
  21127. -->
  21128. (S1 ^operator O2093 = 0.3376989200650307)
  21129. Firing prefer*rvt*predict-yes*H0
  21130. -->
  21131. Firing prefer*rvt*predict-no*H0
  21132. -->
  21133. Firing elaborate*copy-dir-to-output-link
  21134. -->
  21135. (I3 ^dir R +)
  21136. inner elaboration loop at bottom goal.
  21137. Retracting elaborate*copy-see-to-output-link
  21138. -->
  21139. (I3 ^see 0 +)
  21140. Retracting propose*predict-no
  21141. -->
  21142. (O2094 ^name predict-no +)
  21143. (S1 ^operator O2094 +)
  21144. Retracting propose*predict-yes
  21145. -->
  21146. (O2093 ^name predict-yes +)
  21147. (S1 ^operator O2093 +)
  21148. Retracting elaborate*reward*based*on*reward
  21149. -->
  21150. (R1050 ^value 1 +)
  21151. (R1 ^reward R1050 +)
  21152. Retracting elaborate*copy-dir-to-output-link
  21153. -->
  21154. (I3 ^dir R +)
  21155. Retracting rl*prefer*rvt*predict-no*H0*4
  21156. -->
  21157. (S1 ^operator O2094 = 0.3397891653686922)
  21158. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  21159. -->
  21160. (S1 ^operator O2094 = 0.660238287807148)
  21161. Retracting rl*prefer*rvt*predict-yes*H0*3
  21162. -->
  21163. (S1 ^operator O2093 = 0.3376989200650307)
  21164. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  21165. -->
  21166. (S1 ^operator O2093 = -0.1070236389116304)
  21167. =>WM: (14766: S1 ^operator O2096 +)
  21168. =>WM: (14765: S1 ^operator O2095 +)
  21169. =>WM: (14764: O2096 ^name predict-no)
  21170. =>WM: (14763: O2095 ^name predict-yes)
  21171. =>WM: (14762: R1051 ^value 1)
  21172. =>WM: (14761: R1 ^reward R1051)
  21173. <=WM: (14752: S1 ^operator O2093 +)
  21174. <=WM: (14753: S1 ^operator O2094 +)
  21175. <=WM: (14754: S1 ^operator O2094)
  21176. <=WM: (14747: R1 ^reward R1050)
  21177. <=WM: (14750: O2094 ^name predict-no)
  21178. <=WM: (14749: O2093 ^name predict-yes)
  21179. <=WM: (14748: R1050 ^value 1)
  21180. --- Inner Elaboration Phase, active level 1 (S1) ---
  21181. Firing prefer*rvt*predict-yes*H0
  21182. -->
  21183. Firing rl*prefer*rvt*predict-yes*H0*3
  21184. -->
  21185. (S1 ^operator O2095 = 0.3376989200650307)
  21186. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  21187. -->
  21188. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21189. -->
  21190. (S1 ^operator O2095 = -0.1028953566115423)
  21191. Firing prefer*rvt*predict-no*H0
  21192. -->
  21193. Firing rl*prefer*rvt*predict-no*H0*4
  21194. -->
  21195. (S1 ^operator O2096 = 0.3397891653686922)
  21196. Firing prefer*rvt*predict-no*H0*4*v1*H1
  21197. -->
  21198. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21199. -->
  21200. (S1 ^operator O2096 = 0.6601837022541405)
  21201. inner elaboration loop at bottom goal.
  21202. Retracting rl*prefer*rvt*predict-no*H0*4
  21203. -->
  21204. (S1 ^operator O2094 = 0.3397891653686922)
  21205. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21206. -->
  21207. (S1 ^operator O2094 = 0.6601837022541405)
  21208. Retracting rl*prefer*rvt*predict-yes*H0*3
  21209. -->
  21210. (S1 ^operator O2093 = 0.3376989200650307)
  21211. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21212. -->
  21213. (S1 ^operator O2093 = -0.1028953566115423)
  21214. --- END Proposal Phase ---
  21215. --- Decision Phase ---
  21216. RL update rl*prefer*rvt*predict-no*H0*4 0.570274 -0.230484 0.339789 -> 0.570271 -0.230484 0.339787(R,m,v=1,0.882022,0.104647)
  21217. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429755 0.230483 0.660238 -> 0.429752 0.230483 0.660236(R,m,v=1,1,0)
  21218. =>WM: (14767: S1 ^operator O2096)
  21219. 1048: O: O2096 (predict-no)
  21220. --- END Decision Phase ---
  21221. --- Application Phase ---
  21222. --- Firing Productions (PE) For State At Depth 1 ---
  21223. --- Inner Elaboration Phase, active level 1 (S1) ---
  21224. Firing apply*operator
  21225. -->
  21226. (I3 ^predict-no N1048 + :O )
  21227. Firing apply*operator*complete
  21228. -->
  21229. (I3 ^predict-no N1047 - :O )
  21230. inner elaboration loop at bottom goal.
  21231. --- Change Working Memory (PE) ---
  21232. =>WM: (14768: I3 ^predict-no N1048)
  21233. <=WM: (14756: N1047 ^status complete)
  21234. <=WM: (14755: I3 ^predict-no N1047)
  21235. --- Firing Productions (IE) For State At Depth 1 ---
  21236. --- Inner Elaboration Phase, active level 1 (S1) ---
  21237. Firing monitor*world
  21238. -->
  21239. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21240. --- Change Working Memory (IE) ---
  21241. --- END Application Phase ---
  21242. --- Output Phase ---
  21243. ENV: Agent did: predict-no for direction R in state State-B
  21244. In State-B moving R
  21245. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21246. predict error 0
  21247. dir: dir isR
  21248. --- END Output Phase ---
  21249. \-/--- Input Phase ---
  21250. =>WM: (14772: I2 ^dir R)
  21251. =>WM: (14771: I2 ^reward 1)
  21252. =>WM: (14770: I2 ^see 0)
  21253. =>WM: (14769: N1048 ^status complete)
  21254. <=WM: (14759: I2 ^dir R)
  21255. <=WM: (14758: I2 ^reward 1)
  21256. <=WM: (14757: I2 ^see 0)
  21257. =>WM: (14773: I2 ^level-1 R0-root)
  21258. <=WM: (14760: I2 ^level-1 R0-root)
  21259. --- END Input Phase ---
  21260. --- Proposal Phase ---
  21261. --- Inner Elaboration Phase, active level 1 (S1) ---
  21262. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21263. -->
  21264. (S1 ^operator O2096 = 0.6601837022541405)
  21265. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21266. -->
  21267. (S1 ^operator O2095 = -0.1028953566115423)
  21268. Firing prefer*rvt*predict-no*H0*4*v1*H1
  21269. -->
  21270. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  21271. -->
  21272. Firing elaborate*copy-see-to-output-link
  21273. -->
  21274. (I3 ^see 0 +)
  21275. Firing elaborate*reward*based*on*reward
  21276. -->
  21277. (R1052 ^value 1 +)
  21278. (R1 ^reward R1052 +)
  21279. Firing propose*predict-yes
  21280. -->
  21281. (O2097 ^name predict-yes +)
  21282. (S1 ^operator O2097 +)
  21283. Firing propose*predict-no
  21284. -->
  21285. (O2098 ^name predict-no +)
  21286. (S1 ^operator O2098 +)
  21287. Firing rl*prefer*rvt*predict-no*H0*4
  21288. -->
  21289. (S1 ^operator O2096 = 0.339786944878795)
  21290. Firing rl*prefer*rvt*predict-yes*H0*3
  21291. -->
  21292. (S1 ^operator O2095 = 0.3376989200650307)
  21293. Firing prefer*rvt*predict-yes*H0
  21294. -->
  21295. Firing prefer*rvt*predict-no*H0
  21296. -->
  21297. Firing elaborate*copy-dir-to-output-link
  21298. -->
  21299. (I3 ^dir R +)
  21300. inner elaboration loop at bottom goal.
  21301. Retracting elaborate*copy-see-to-output-link
  21302. -->
  21303. (I3 ^see 0 +)
  21304. Retracting propose*predict-no
  21305. -->
  21306. (O2096 ^name predict-no +)
  21307. (S1 ^operator O2096 +)
  21308. Retracting propose*predict-yes
  21309. -->
  21310. (O2095 ^name predict-yes +)
  21311. (S1 ^operator O2095 +)
  21312. Retracting elaborate*reward*based*on*reward
  21313. -->
  21314. (R1051 ^value 1 +)
  21315. (R1 ^reward R1051 +)
  21316. Retracting elaborate*copy-dir-to-output-link
  21317. -->
  21318. (I3 ^dir R +)
  21319. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21320. -->
  21321. (S1 ^operator O2096 = 0.6601837022541405)
  21322. Retracting rl*prefer*rvt*predict-no*H0*4
  21323. -->
  21324. (S1 ^operator O2096 = 0.339786944878795)
  21325. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21326. -->
  21327. (S1 ^operator O2095 = -0.1028953566115423)
  21328. Retracting rl*prefer*rvt*predict-yes*H0*3
  21329. -->
  21330. (S1 ^operator O2095 = 0.3376989200650307)
  21331. =>WM: (14779: S1 ^operator O2098 +)
  21332. =>WM: (14778: S1 ^operator O2097 +)
  21333. =>WM: (14777: O2098 ^name predict-no)
  21334. =>WM: (14776: O2097 ^name predict-yes)
  21335. =>WM: (14775: R1052 ^value 1)
  21336. =>WM: (14774: R1 ^reward R1052)
  21337. <=WM: (14765: S1 ^operator O2095 +)
  21338. <=WM: (14766: S1 ^operator O2096 +)
  21339. <=WM: (14767: S1 ^operator O2096)
  21340. <=WM: (14761: R1 ^reward R1051)
  21341. <=WM: (14764: O2096 ^name predict-no)
  21342. <=WM: (14763: O2095 ^name predict-yes)
  21343. <=WM: (14762: R1051 ^value 1)
  21344. --- Inner Elaboration Phase, active level 1 (S1) ---
  21345. Firing prefer*rvt*predict-yes*H0
  21346. -->
  21347. Firing rl*prefer*rvt*predict-yes*H0*3
  21348. -->
  21349. (S1 ^operator O2097 = 0.3376989200650307)
  21350. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  21351. -->
  21352. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21353. -->
  21354. (S1 ^operator O2097 = -0.1028953566115423)
  21355. Firing prefer*rvt*predict-no*H0
  21356. -->
  21357. Firing rl*prefer*rvt*predict-no*H0*4
  21358. -->
  21359. (S1 ^operator O2098 = 0.339786944878795)
  21360. Firing prefer*rvt*predict-no*H0*4*v1*H1
  21361. -->
  21362. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21363. -->
  21364. (S1 ^operator O2098 = 0.6601837022541405)
  21365. inner elaboration loop at bottom goal.
  21366. Retracting rl*prefer*rvt*predict-no*H0*4
  21367. -->
  21368. (S1 ^operator O2096 = 0.339786944878795)
  21369. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21370. -->
  21371. (S1 ^operator O2096 = 0.6601837022541405)
  21372. Retracting rl*prefer*rvt*predict-yes*H0*3
  21373. -->
  21374. (S1 ^operator O2095 = 0.3376989200650307)
  21375. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21376. -->
  21377. (S1 ^operator O2095 = -0.1028953566115423)
  21378. --- END Proposal Phase ---
  21379. --- Decision Phase ---
  21380. RL update rl*prefer*rvt*predict-no*H0*4 0.570271 -0.230484 0.339787 -> 0.570274 -0.230484 0.339789(R,m,v=1,0.882682,0.104137)
  21381. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429698 0.230486 0.660184 -> 0.429701 0.230486 0.660186(R,m,v=1,1,0)
  21382. =>WM: (14780: S1 ^operator O2098)
  21383. 1049: O: O2098 (predict-no)
  21384. --- END Decision Phase ---
  21385. --- Application Phase ---
  21386. --- Firing Productions (PE) For State At Depth 1 ---
  21387. --- Inner Elaboration Phase, active level 1 (S1) ---
  21388. Firing apply*operator
  21389. -->
  21390. (I3 ^predict-no N1049 + :O )
  21391. Firing apply*operator*complete
  21392. -->
  21393. (I3 ^predict-no N1048 - :O )
  21394. inner elaboration loop at bottom goal.
  21395. --- Change Working Memory (PE) ---
  21396. =>WM: (14781: I3 ^predict-no N1049)
  21397. <=WM: (14769: N1048 ^status complete)
  21398. <=WM: (14768: I3 ^predict-no N1048)
  21399. --- Firing Productions (IE) For State At Depth 1 ---
  21400. --- Inner Elaboration Phase, active level 1 (S1) ---
  21401. Firing monitor*world
  21402. -->
  21403. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21404. --- Change Working Memory (IE) ---
  21405. --- END Application Phase ---
  21406. --- Output Phase ---
  21407. ENV: Agent did: predict-no for direction R in state State-B
  21408. In State-B moving R
  21409. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21410. predict error 0
  21411. dir: dir isU
  21412. --- END Output Phase ---
  21413. |\---- Input Phase ---
  21414. =>WM: (14785: I2 ^dir U)
  21415. =>WM: (14784: I2 ^reward 1)
  21416. =>WM: (14783: I2 ^see 0)
  21417. =>WM: (14782: N1049 ^status complete)
  21418. <=WM: (14772: I2 ^dir R)
  21419. <=WM: (14771: I2 ^reward 1)
  21420. <=WM: (14770: I2 ^see 0)
  21421. =>WM: (14786: I2 ^level-1 R0-root)
  21422. <=WM: (14773: I2 ^level-1 R0-root)
  21423. --- END Input Phase ---
  21424. --- Proposal Phase ---
  21425. --- Inner Elaboration Phase, active level 1 (S1) ---
  21426. Firing elaborate*copy-see-to-output-link
  21427. -->
  21428. (I3 ^see 0 +)
  21429. Firing elaborate*reward*based*on*reward
  21430. -->
  21431. (R1053 ^value 1 +)
  21432. (R1 ^reward R1053 +)
  21433. Firing propose*predict-yes
  21434. -->
  21435. (O2099 ^name predict-yes +)
  21436. (S1 ^operator O2099 +)
  21437. Firing propose*predict-no
  21438. -->
  21439. (O2100 ^name predict-no +)
  21440. (S1 ^operator O2100 +)
  21441. Firing rl*prefer*rvt*predict-no*H0*2
  21442. -->
  21443. (S1 ^operator O2098 = 1.)
  21444. Firing rl*prefer*rvt*predict-yes*H0*1
  21445. -->
  21446. (S1 ^operator O2097 = 0.)
  21447. Firing prefer*rvt*predict-yes*H0
  21448. -->
  21449. Firing prefer*rvt*predict-no*H0
  21450. -->
  21451. Firing elaborate*copy-dir-to-output-link
  21452. -->
  21453. (I3 ^dir U +)
  21454. inner elaboration loop at bottom goal.
  21455. Retracting elaborate*copy-see-to-output-link
  21456. -->
  21457. (I3 ^see 0 +)
  21458. Retracting propose*predict-no
  21459. -->
  21460. (O2098 ^name predict-no +)
  21461. (S1 ^operator O2098 +)
  21462. Retracting propose*predict-yes
  21463. -->
  21464. (O2097 ^name predict-yes +)
  21465. (S1 ^operator O2097 +)
  21466. Retracting elaborate*reward*based*on*reward
  21467. -->
  21468. (R1052 ^value 1 +)
  21469. (R1 ^reward R1052 +)
  21470. Retracting elaborate*copy-dir-to-output-link
  21471. -->
  21472. (I3 ^dir R +)
  21473. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  21474. -->
  21475. (S1 ^operator O2098 = 0.6601864554275962)
  21476. Retracting rl*prefer*rvt*predict-no*H0*4
  21477. -->
  21478. (S1 ^operator O2098 = 0.3397893168714253)
  21479. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  21480. -->
  21481. (S1 ^operator O2097 = -0.1028953566115423)
  21482. Retracting rl*prefer*rvt*predict-yes*H0*3
  21483. -->
  21484. (S1 ^operator O2097 = 0.3376989200650307)
  21485. =>WM: (14793: S1 ^operator O2100 +)
  21486. =>WM: (14792: S1 ^operator O2099 +)
  21487. =>WM: (14791: I3 ^dir U)
  21488. =>WM: (14790: O2100 ^name predict-no)
  21489. =>WM: (14789: O2099 ^name predict-yes)
  21490. =>WM: (14788: R1053 ^value 1)
  21491. =>WM: (14787: R1 ^reward R1053)
  21492. <=WM: (14778: S1 ^operator O2097 +)
  21493. <=WM: (14779: S1 ^operator O2098 +)
  21494. <=WM: (14780: S1 ^operator O2098)
  21495. <=WM: (14751: I3 ^dir R)
  21496. <=WM: (14774: R1 ^reward R1052)
  21497. <=WM: (14777: O2098 ^name predict-no)
  21498. <=WM: (14776: O2097 ^name predict-yes)
  21499. <=WM: (14775: R1052 ^value 1)
  21500. --- Inner Elaboration Phase, active level 1 (S1) ---
  21501. Firing prefer*rvt*predict-yes*H0
  21502. -->
  21503. Firing rl*prefer*rvt*predict-yes*H0*1
  21504. -->
  21505. (S1 ^operator O2099 = 0.)
  21506. Firing prefer*rvt*predict-no*H0
  21507. -->
  21508. Firing rl*prefer*rvt*predict-no*H0*2
  21509. -->
  21510. (S1 ^operator O2100 = 1.)
  21511. inner elaboration loop at bottom goal.
  21512. Retracting rl*prefer*rvt*predict-no*H0*2
  21513. -->
  21514. (S1 ^operator O2098 = 1.)
  21515. Retracting rl*prefer*rvt*predict-yes*H0*1
  21516. -->
  21517. (S1 ^operator O2097 = 0.)
  21518. --- END Proposal Phase ---
  21519. --- Decision Phase ---
  21520. RL update rl*prefer*rvt*predict-no*H0*4 0.570274 -0.230484 0.339789 -> 0.570276 -0.230485 0.339791(R,m,v=1,0.883333,0.103631)
  21521. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429701 0.230486 0.660186 -> 0.429703 0.230485 0.660189(R,m,v=1,1,0)
  21522. =>WM: (14794: S1 ^operator O2100)
  21523. 1050: O: O2100 (predict-no)
  21524. --- END Decision Phase ---
  21525. --- Application Phase ---
  21526. --- Firing Productions (PE) For State At Depth 1 ---
  21527. --- Inner Elaboration Phase, active level 1 (S1) ---
  21528. Firing apply*operator
  21529. -->
  21530. (I3 ^predict-no N1050 + :O )
  21531. Firing apply*operator*complete
  21532. -->
  21533. (I3 ^predict-no N1049 - :O )
  21534. inner elaboration loop at bottom goal.
  21535. --- Change Working Memory (PE) ---
  21536. =>WM: (14795: I3 ^predict-no N1050)
  21537. <=WM: (14782: N1049 ^status complete)
  21538. <=WM: (14781: I3 ^predict-no N1049)
  21539. --- Firing Productions (IE) For State At Depth 1 ---
  21540. --- Inner Elaboration Phase, active level 1 (S1) ---
  21541. Firing monitor*world
  21542. -->
  21543. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21544. --- Change Working Memory (IE) ---
  21545. --- END Application Phase ---
  21546. --- Output Phase ---
  21547. ENV: Agent did: predict-no for direction U in state State-B
  21548. In State-B moving U
  21549. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21550. predict error 0
  21551. dir: dir isL
  21552. --- END Output Phase ---
  21553. /|\--- Input Phase ---
  21554. =>WM: (14799: I2 ^dir L)
  21555. =>WM: (14798: I2 ^reward 1)
  21556. =>WM: (14797: I2 ^see 0)
  21557. =>WM: (14796: N1050 ^status complete)
  21558. <=WM: (14785: I2 ^dir U)
  21559. <=WM: (14784: I2 ^reward 1)
  21560. <=WM: (14783: I2 ^see 0)
  21561. =>WM: (14800: I2 ^level-1 R0-root)
  21562. <=WM: (14786: I2 ^level-1 R0-root)
  21563. --- END Input Phase ---
  21564. --- Proposal Phase ---
  21565. --- Inner Elaboration Phase, active level 1 (S1) ---
  21566. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  21567. -->
  21568. (S1 ^operator O2099 = 0.7358923420605031)
  21569. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21570. -->
  21571. Firing elaborate*copy-see-to-output-link
  21572. -->
  21573. (I3 ^see 0 +)
  21574. Firing elaborate*reward*based*on*reward
  21575. -->
  21576. (R1054 ^value 1 +)
  21577. (R1 ^reward R1054 +)
  21578. Firing propose*predict-yes
  21579. -->
  21580. (O2101 ^name predict-yes +)
  21581. (S1 ^operator O2101 +)
  21582. Firing propose*predict-no
  21583. -->
  21584. (O2102 ^name predict-no +)
  21585. (S1 ^operator O2102 +)
  21586. Firing rl*prefer*rvt*predict-no*H0*6
  21587. -->
  21588. (S1 ^operator O2100 = 0.9327626143515492)
  21589. Firing rl*prefer*rvt*predict-yes*H0*5
  21590. -->
  21591. (S1 ^operator O2099 = 0.2640194889627474)
  21592. Firing prefer*rvt*predict-yes*H0
  21593. -->
  21594. Firing prefer*rvt*predict-no*H0
  21595. -->
  21596. Firing elaborate*copy-dir-to-output-link
  21597. -->
  21598. (I3 ^dir L +)
  21599. inner elaboration loop at bottom goal.
  21600. Retracting elaborate*copy-see-to-output-link
  21601. -->
  21602. (I3 ^see 0 +)
  21603. Retracting propose*predict-no
  21604. -->
  21605. (O2100 ^name predict-no +)
  21606. (S1 ^operator O2100 +)
  21607. Retracting propose*predict-yes
  21608. -->
  21609. (O2099 ^name predict-yes +)
  21610. (S1 ^operator O2099 +)
  21611. Retracting elaborate*reward*based*on*reward
  21612. -->
  21613. (R1053 ^value 1 +)
  21614. (R1 ^reward R1053 +)
  21615. Retracting elaborate*copy-dir-to-output-link
  21616. -->
  21617. (I3 ^dir U +)
  21618. Retracting rl*prefer*rvt*predict-no*H0*2
  21619. -->
  21620. (S1 ^operator O2100 = 1.)
  21621. Retracting rl*prefer*rvt*predict-yes*H0*1
  21622. -->
  21623. (S1 ^operator O2099 = 0.)
  21624. =>WM: (14807: S1 ^operator O2102 +)
  21625. =>WM: (14806: S1 ^operator O2101 +)
  21626. =>WM: (14805: I3 ^dir L)
  21627. =>WM: (14804: O2102 ^name predict-no)
  21628. =>WM: (14803: O2101 ^name predict-yes)
  21629. =>WM: (14802: R1054 ^value 1)
  21630. =>WM: (14801: R1 ^reward R1054)
  21631. <=WM: (14792: S1 ^operator O2099 +)
  21632. <=WM: (14793: S1 ^operator O2100 +)
  21633. <=WM: (14794: S1 ^operator O2100)
  21634. <=WM: (14791: I3 ^dir U)
  21635. <=WM: (14787: R1 ^reward R1053)
  21636. <=WM: (14790: O2100 ^name predict-no)
  21637. <=WM: (14789: O2099 ^name predict-yes)
  21638. <=WM: (14788: R1053 ^value 1)
  21639. --- Inner Elaboration Phase, active level 1 (S1) ---
  21640. Firing prefer*rvt*predict-yes*H0
  21641. -->
  21642. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  21643. -->
  21644. (S1 ^operator O2101 = 0.7358923420605031)
  21645. Firing rl*prefer*rvt*predict-yes*H0*5
  21646. -->
  21647. (S1 ^operator O2101 = 0.2640194889627474)
  21648. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21649. -->
  21650. Firing prefer*rvt*predict-no*H0
  21651. -->
  21652. Firing rl*prefer*rvt*predict-no*H0*6
  21653. -->
  21654. (S1 ^operator O2102 = 0.9327626143515492)
  21655. inner elaboration loop at bottom goal.
  21656. Retracting rl*prefer*rvt*predict-no*H0*6
  21657. -->
  21658. (S1 ^operator O2100 = 0.9327626143515492)
  21659. Retracting rl*prefer*rvt*predict-yes*H0*5
  21660. -->
  21661. (S1 ^operator O2099 = 0.2640194889627474)
  21662. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  21663. -->
  21664. (S1 ^operator O2099 = 0.7358923420605031)
  21665. --- END Proposal Phase ---
  21666. --- Decision Phase ---
  21667. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21668. =>WM: (14808: S1 ^operator O2101)
  21669. 1051: O: O2101 (predict-yes)
  21670. --- END Decision Phase ---
  21671. --- Application Phase ---
  21672. --- Firing Productions (PE) For State At Depth 1 ---
  21673. --- Inner Elaboration Phase, active level 1 (S1) ---
  21674. Firing apply*operator
  21675. -->
  21676. (I3 ^predict-yes N1051 + :O )
  21677. Firing apply*operator*complete
  21678. -->
  21679. (I3 ^predict-no N1050 - :O )
  21680. inner elaboration loop at bottom goal.
  21681. --- Change Working Memory (PE) ---
  21682. =>WM: (14809: I3 ^predict-yes N1051)
  21683. <=WM: (14796: N1050 ^status complete)
  21684. <=WM: (14795: I3 ^predict-no N1050)
  21685. --- Firing Productions (IE) For State At Depth 1 ---
  21686. --- Inner Elaboration Phase, active level 1 (S1) ---
  21687. Firing monitor*world
  21688. -->
  21689. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21690. --- Change Working Memory (IE) ---
  21691. --- END Application Phase ---
  21692. --- Output Phase ---
  21693. ENV: Agent did: predict-yes for direction L in state State-B
  21694. In State-B moving L
  21695. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  21696. predict error 0
  21697. dir: dir isU
  21698. --- END Output Phase ---
  21699. ---- Input Phase ---
  21700. =>WM: (14813: I2 ^dir U)
  21701. =>WM: (14812: I2 ^reward 1)
  21702. =>WM: (14811: I2 ^see 1)
  21703. =>WM: (14810: N1051 ^status complete)
  21704. <=WM: (14799: I2 ^dir L)
  21705. <=WM: (14798: I2 ^reward 1)
  21706. <=WM: (14797: I2 ^see 0)
  21707. =>WM: (14814: I2 ^level-1 L1-root)
  21708. <=WM: (14800: I2 ^level-1 R0-root)
  21709. --- END Input Phase ---
  21710. --- Proposal Phase ---
  21711. --- Inner Elaboration Phase, active level 1 (S1) ---
  21712. Firing elaborate*copy-see-to-output-link
  21713. -->
  21714. (I3 ^see 1 +)
  21715. Firing elaborate*reward*based*on*reward
  21716. -->
  21717. (R1055 ^value 1 +)
  21718. (R1 ^reward R1055 +)
  21719. Firing propose*predict-yes
  21720. -->
  21721. (O2103 ^name predict-yes +)
  21722. (S1 ^operator O2103 +)
  21723. Firing propose*predict-no
  21724. -->
  21725. (O2104 ^name predict-no +)
  21726. (S1 ^operator O2104 +)
  21727. Firing rl*prefer*rvt*predict-no*H0*2
  21728. -->
  21729. (S1 ^operator O2102 = 1.)
  21730. Firing rl*prefer*rvt*predict-yes*H0*1
  21731. -->
  21732. (S1 ^operator O2101 = 0.)
  21733. Firing prefer*rvt*predict-yes*H0
  21734. -->
  21735. Firing prefer*rvt*predict-no*H0
  21736. -->
  21737. Firing elaborate*copy-dir-to-output-link
  21738. -->
  21739. (I3 ^dir U +)
  21740. inner elaboration loop at bottom goal.
  21741. Retracting elaborate*copy-see-to-output-link
  21742. -->
  21743. (I3 ^see 0 +)
  21744. Retracting propose*predict-no
  21745. -->
  21746. (O2102 ^name predict-no +)
  21747. (S1 ^operator O2102 +)
  21748. Retracting propose*predict-yes
  21749. -->
  21750. (O2101 ^name predict-yes +)
  21751. (S1 ^operator O2101 +)
  21752. Retracting elaborate*reward*based*on*reward
  21753. -->
  21754. (R1054 ^value 1 +)
  21755. (R1 ^reward R1054 +)
  21756. Retracting elaborate*copy-dir-to-output-link
  21757. -->
  21758. (I3 ^dir L +)
  21759. Retracting rl*prefer*rvt*predict-no*H0*6
  21760. -->
  21761. (S1 ^operator O2102 = 0.9327626143515492)
  21762. Retracting rl*prefer*rvt*predict-yes*H0*5
  21763. -->
  21764. (S1 ^operator O2101 = 0.2640194889627474)
  21765. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  21766. -->
  21767. (S1 ^operator O2101 = 0.7358923420605031)
  21768. =>WM: (14822: S1 ^operator O2104 +)
  21769. =>WM: (14821: S1 ^operator O2103 +)
  21770. =>WM: (14820: I3 ^dir U)
  21771. =>WM: (14819: O2104 ^name predict-no)
  21772. =>WM: (14818: O2103 ^name predict-yes)
  21773. =>WM: (14817: R1055 ^value 1)
  21774. =>WM: (14816: R1 ^reward R1055)
  21775. =>WM: (14815: I3 ^see 1)
  21776. <=WM: (14806: S1 ^operator O2101 +)
  21777. <=WM: (14808: S1 ^operator O2101)
  21778. <=WM: (14807: S1 ^operator O2102 +)
  21779. <=WM: (14805: I3 ^dir L)
  21780. <=WM: (14801: R1 ^reward R1054)
  21781. <=WM: (14733: I3 ^see 0)
  21782. <=WM: (14804: O2102 ^name predict-no)
  21783. <=WM: (14803: O2101 ^name predict-yes)
  21784. <=WM: (14802: R1054 ^value 1)
  21785. --- Inner Elaboration Phase, active level 1 (S1) ---
  21786. Firing prefer*rvt*predict-yes*H0
  21787. -->
  21788. Firing rl*prefer*rvt*predict-yes*H0*1
  21789. -->
  21790. (S1 ^operator O2103 = 0.)
  21791. Firing prefer*rvt*predict-no*H0
  21792. -->
  21793. Firing rl*prefer*rvt*predict-no*H0*2
  21794. -->
  21795. (S1 ^operator O2104 = 1.)
  21796. inner elaboration loop at bottom goal.
  21797. Retracting rl*prefer*rvt*predict-no*H0*2
  21798. -->
  21799. (S1 ^operator O2102 = 1.)
  21800. Retracting rl*prefer*rvt*predict-yes*H0*1
  21801. -->
  21802. (S1 ^operator O2101 = 0.)
  21803. --- END Proposal Phase ---
  21804. --- Decision Phase ---
  21805. RL update rl*prefer*rvt*predict-yes*H0*5 0.554405 -0.290386 0.264019 -> 0.554412 -0.290386 0.264027(R,m,v=1,0.881081,0.105347)
  21806. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445508 0.290384 0.735892 -> 0.445516 0.290384 0.735901(R,m,v=1,1,0)
  21807. =>WM: (14823: S1 ^operator O2104)
  21808. 1052: O: O2104 (predict-no)
  21809. --- END Decision Phase ---
  21810. --- Application Phase ---
  21811. --- Firing Productions (PE) For State At Depth 1 ---
  21812. --- Inner Elaboration Phase, active level 1 (S1) ---
  21813. Firing apply*operator
  21814. -->
  21815. (I3 ^predict-no N1052 + :O )
  21816. Firing apply*operator*complete
  21817. -->
  21818. (I3 ^predict-yes N1051 - :O )
  21819. inner elaboration loop at bottom goal.
  21820. --- Change Working Memory (PE) ---
  21821. =>WM: (14824: I3 ^predict-no N1052)
  21822. <=WM: (14810: N1051 ^status complete)
  21823. <=WM: (14809: I3 ^predict-yes N1051)
  21824. --- Firing Productions (IE) For State At Depth 1 ---
  21825. --- Inner Elaboration Phase, active level 1 (S1) ---
  21826. Firing monitor*world
  21827. -->
  21828. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21829. --- Change Working Memory (IE) ---
  21830. --- END Application Phase ---
  21831. --- Output Phase ---
  21832. ENV: Agent did: predict-no for direction U in state State-A
  21833. In State-A moving U
  21834. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21835. predict error 0
  21836. dir: dir isL
  21837. --- END Output Phase ---
  21838. /|\--- Input Phase ---
  21839. =>WM: (14828: I2 ^dir L)
  21840. =>WM: (14827: I2 ^reward 1)
  21841. =>WM: (14826: I2 ^see 0)
  21842. =>WM: (14825: N1052 ^status complete)
  21843. <=WM: (14813: I2 ^dir U)
  21844. <=WM: (14812: I2 ^reward 1)
  21845. <=WM: (14811: I2 ^see 1)
  21846. =>WM: (14829: I2 ^level-1 L1-root)
  21847. <=WM: (14814: I2 ^level-1 L1-root)
  21848. --- END Input Phase ---
  21849. --- Proposal Phase ---
  21850. --- Inner Elaboration Phase, active level 1 (S1) ---
  21851. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21852. -->
  21853. (S1 ^operator O2103 = -0.181727099742844)
  21854. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21855. -->
  21856. Firing elaborate*copy-see-to-output-link
  21857. -->
  21858. (I3 ^see 0 +)
  21859. Firing elaborate*reward*based*on*reward
  21860. -->
  21861. (R1056 ^value 1 +)
  21862. (R1 ^reward R1056 +)
  21863. Firing propose*predict-yes
  21864. -->
  21865. (O2105 ^name predict-yes +)
  21866. (S1 ^operator O2105 +)
  21867. Firing propose*predict-no
  21868. -->
  21869. (O2106 ^name predict-no +)
  21870. (S1 ^operator O2106 +)
  21871. Firing rl*prefer*rvt*predict-no*H0*6
  21872. -->
  21873. (S1 ^operator O2104 = 0.9327626143515492)
  21874. Firing rl*prefer*rvt*predict-yes*H0*5
  21875. -->
  21876. (S1 ^operator O2103 = 0.2640265760956406)
  21877. Firing prefer*rvt*predict-yes*H0
  21878. -->
  21879. Firing prefer*rvt*predict-no*H0
  21880. -->
  21881. Firing elaborate*copy-dir-to-output-link
  21882. -->
  21883. (I3 ^dir L +)
  21884. inner elaboration loop at bottom goal.
  21885. Retracting elaborate*copy-see-to-output-link
  21886. -->
  21887. (I3 ^see 1 +)
  21888. Retracting propose*predict-no
  21889. -->
  21890. (O2104 ^name predict-no +)
  21891. (S1 ^operator O2104 +)
  21892. Retracting propose*predict-yes
  21893. -->
  21894. (O2103 ^name predict-yes +)
  21895. (S1 ^operator O2103 +)
  21896. Retracting elaborate*reward*based*on*reward
  21897. -->
  21898. (R1055 ^value 1 +)
  21899. (R1 ^reward R1055 +)
  21900. Retracting elaborate*copy-dir-to-output-link
  21901. -->
  21902. (I3 ^dir U +)
  21903. Retracting rl*prefer*rvt*predict-no*H0*2
  21904. -->
  21905. (S1 ^operator O2104 = 1.)
  21906. Retracting rl*prefer*rvt*predict-yes*H0*1
  21907. -->
  21908. (S1 ^operator O2103 = 0.)
  21909. =>WM: (14837: S1 ^operator O2106 +)
  21910. =>WM: (14836: S1 ^operator O2105 +)
  21911. =>WM: (14835: I3 ^dir L)
  21912. =>WM: (14834: O2106 ^name predict-no)
  21913. =>WM: (14833: O2105 ^name predict-yes)
  21914. =>WM: (14832: R1056 ^value 1)
  21915. =>WM: (14831: R1 ^reward R1056)
  21916. =>WM: (14830: I3 ^see 0)
  21917. <=WM: (14821: S1 ^operator O2103 +)
  21918. <=WM: (14822: S1 ^operator O2104 +)
  21919. <=WM: (14823: S1 ^operator O2104)
  21920. <=WM: (14820: I3 ^dir U)
  21921. <=WM: (14816: R1 ^reward R1055)
  21922. <=WM: (14815: I3 ^see 1)
  21923. <=WM: (14819: O2104 ^name predict-no)
  21924. <=WM: (14818: O2103 ^name predict-yes)
  21925. <=WM: (14817: R1055 ^value 1)
  21926. --- Inner Elaboration Phase, active level 1 (S1) ---
  21927. Firing prefer*rvt*predict-yes*H0
  21928. -->
  21929. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21930. -->
  21931. (S1 ^operator O2105 = -0.181727099742844)
  21932. Firing rl*prefer*rvt*predict-yes*H0*5
  21933. -->
  21934. (S1 ^operator O2105 = 0.2640265760956406)
  21935. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  21936. -->
  21937. Firing prefer*rvt*predict-no*H0
  21938. -->
  21939. Firing rl*prefer*rvt*predict-no*H0*6
  21940. -->
  21941. (S1 ^operator O2106 = 0.9327626143515492)
  21942. inner elaboration loop at bottom goal.
  21943. Retracting rl*prefer*rvt*predict-no*H0*6
  21944. -->
  21945. (S1 ^operator O2104 = 0.9327626143515492)
  21946. Retracting rl*prefer*rvt*predict-yes*H0*5
  21947. -->
  21948. (S1 ^operator O2103 = 0.2640265760956406)
  21949. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  21950. -->
  21951. (S1 ^operator O2103 = -0.181727099742844)
  21952. --- END Proposal Phase ---
  21953. --- Decision Phase ---
  21954. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21955. =>WM: (14838: S1 ^operator O2106)
  21956. 1053: O: O2106 (predict-no)
  21957. --- END Decision Phase ---
  21958. --- Application Phase ---
  21959. --- Firing Productions (PE) For State At Depth 1 ---
  21960. --- Inner Elaboration Phase, active level 1 (S1) ---
  21961. Firing apply*operator
  21962. -->
  21963. (I3 ^predict-no N1053 + :O )
  21964. Firing apply*operator*complete
  21965. -->
  21966. (I3 ^predict-no N1052 - :O )
  21967. inner elaboration loop at bottom goal.
  21968. --- Change Working Memory (PE) ---
  21969. =>WM: (14839: I3 ^predict-no N1053)
  21970. <=WM: (14825: N1052 ^status complete)
  21971. <=WM: (14824: I3 ^predict-no N1052)
  21972. --- Firing Productions (IE) For State At Depth 1 ---
  21973. --- Inner Elaboration Phase, active level 1 (S1) ---
  21974. Firing monitor*world
  21975. -->
  21976. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21977. --- Change Working Memory (IE) ---
  21978. --- END Application Phase ---
  21979. --- Output Phase ---
  21980. ENV: Agent did: predict-no for direction L in state State-A
  21981. In State-A moving L
  21982. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21983. predict error 0
  21984. dir: dir isU
  21985. --- END Output Phase ---
  21986. -/|--- Input Phase ---
  21987. =>WM: (14843: I2 ^dir U)
  21988. =>WM: (14842: I2 ^reward 1)
  21989. =>WM: (14841: I2 ^see 0)
  21990. =>WM: (14840: N1053 ^status complete)
  21991. <=WM: (14828: I2 ^dir L)
  21992. <=WM: (14827: I2 ^reward 1)
  21993. <=WM: (14826: I2 ^see 0)
  21994. =>WM: (14844: I2 ^level-1 L0-root)
  21995. <=WM: (14829: I2 ^level-1 L1-root)
  21996. --- END Input Phase ---
  21997. --- Proposal Phase ---
  21998. --- Inner Elaboration Phase, active level 1 (S1) ---
  21999. Firing elaborate*copy-see-to-output-link
  22000. -->
  22001. (I3 ^see 0 +)
  22002. Firing elaborate*reward*based*on*reward
  22003. -->
  22004. (R1057 ^value 1 +)
  22005. (R1 ^reward R1057 +)
  22006. Firing propose*predict-yes
  22007. -->
  22008. (O2107 ^name predict-yes +)
  22009. (S1 ^operator O2107 +)
  22010. Firing propose*predict-no
  22011. -->
  22012. (O2108 ^name predict-no +)
  22013. (S1 ^operator O2108 +)
  22014. Firing rl*prefer*rvt*predict-no*H0*2
  22015. -->
  22016. (S1 ^operator O2106 = 1.)
  22017. Firing rl*prefer*rvt*predict-yes*H0*1
  22018. -->
  22019. (S1 ^operator O2105 = 0.)
  22020. Firing prefer*rvt*predict-yes*H0
  22021. -->
  22022. Firing prefer*rvt*predict-no*H0
  22023. -->
  22024. Firing elaborate*copy-dir-to-output-link
  22025. -->
  22026. (I3 ^dir U +)
  22027. inner elaboration loop at bottom goal.
  22028. Retracting elaborate*copy-see-to-output-link
  22029. -->
  22030. (I3 ^see 0 +)
  22031. Retracting propose*predict-no
  22032. -->
  22033. (O2106 ^name predict-no +)
  22034. (S1 ^operator O2106 +)
  22035. Retracting propose*predict-yes
  22036. -->
  22037. (O2105 ^name predict-yes +)
  22038. (S1 ^operator O2105 +)
  22039. Retracting elaborate*reward*based*on*reward
  22040. -->
  22041. (R1056 ^value 1 +)
  22042. (R1 ^reward R1056 +)
  22043. Retracting elaborate*copy-dir-to-output-link
  22044. -->
  22045. (I3 ^dir L +)
  22046. Retracting rl*prefer*rvt*predict-no*H0*6
  22047. -->
  22048. (S1 ^operator O2106 = 0.9327626143515492)
  22049. Retracting rl*prefer*rvt*predict-yes*H0*5
  22050. -->
  22051. (S1 ^operator O2105 = 0.2640265760956406)
  22052. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  22053. -->
  22054. (S1 ^operator O2105 = -0.181727099742844)
  22055. =>WM: (14851: S1 ^operator O2108 +)
  22056. =>WM: (14850: S1 ^operator O2107 +)
  22057. =>WM: (14849: I3 ^dir U)
  22058. =>WM: (14848: O2108 ^name predict-no)
  22059. =>WM: (14847: O2107 ^name predict-yes)
  22060. =>WM: (14846: R1057 ^value 1)
  22061. =>WM: (14845: R1 ^reward R1057)
  22062. <=WM: (14836: S1 ^operator O2105 +)
  22063. <=WM: (14837: S1 ^operator O2106 +)
  22064. <=WM: (14838: S1 ^operator O2106)
  22065. <=WM: (14835: I3 ^dir L)
  22066. <=WM: (14831: R1 ^reward R1056)
  22067. <=WM: (14834: O2106 ^name predict-no)
  22068. <=WM: (14833: O2105 ^name predict-yes)
  22069. <=WM: (14832: R1056 ^value 1)
  22070. --- Inner Elaboration Phase, active level 1 (S1) ---
  22071. Firing prefer*rvt*predict-yes*H0
  22072. -->
  22073. Firing rl*prefer*rvt*predict-yes*H0*1
  22074. -->
  22075. (S1 ^operator O2107 = 0.)
  22076. Firing prefer*rvt*predict-no*H0
  22077. -->
  22078. Firing rl*prefer*rvt*predict-no*H0*2
  22079. -->
  22080. (S1 ^operator O2108 = 1.)
  22081. inner elaboration loop at bottom goal.
  22082. Retracting rl*prefer*rvt*predict-no*H0*2
  22083. -->
  22084. (S1 ^operator O2106 = 1.)
  22085. Retracting rl*prefer*rvt*predict-yes*H0*1
  22086. -->
  22087. (S1 ^operator O2105 = 0.)
  22088. --- END Proposal Phase ---
  22089. --- Decision Phase ---
  22090. RL update rl*prefer*rvt*predict-no*H0*6 0.932763 0 0.932763 -> 0.943865 0 0.943865(R,m,v=1,0.904459,0.0869672)
  22091. =>WM: (14852: S1 ^operator O2108)
  22092. 1054: O: O2108 (predict-no)
  22093. --- END Decision Phase ---
  22094. --- Application Phase ---
  22095. --- Firing Productions (PE) For State At Depth 1 ---
  22096. --- Inner Elaboration Phase, active level 1 (S1) ---
  22097. Firing apply*operator
  22098. -->
  22099. (I3 ^predict-no N1054 + :O )
  22100. Firing apply*operator*complete
  22101. -->
  22102. (I3 ^predict-no N1053 - :O )
  22103. inner elaboration loop at bottom goal.
  22104. --- Change Working Memory (PE) ---
  22105. =>WM: (14853: I3 ^predict-no N1054)
  22106. <=WM: (14840: N1053 ^status complete)
  22107. <=WM: (14839: I3 ^predict-no N1053)
  22108. --- Firing Productions (IE) For State At Depth 1 ---
  22109. --- Inner Elaboration Phase, active level 1 (S1) ---
  22110. Firing monitor*world
  22111. -->
  22112. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22113. --- Change Working Memory (IE) ---
  22114. --- END Application Phase ---
  22115. --- Output Phase ---
  22116. ENV: Agent did: predict-no for direction U in state State-A
  22117. In State-A moving U
  22118. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22119. predict error 0
  22120. dir: dir isU
  22121. --- END Output Phase ---
  22122. \-/--- Input Phase ---
  22123. =>WM: (14857: I2 ^dir U)
  22124. =>WM: (14856: I2 ^reward 1)
  22125. =>WM: (14855: I2 ^see 0)
  22126. =>WM: (14854: N1054 ^status complete)
  22127. <=WM: (14843: I2 ^dir U)
  22128. <=WM: (14842: I2 ^reward 1)
  22129. <=WM: (14841: I2 ^see 0)
  22130. =>WM: (14858: I2 ^level-1 L0-root)
  22131. <=WM: (14844: I2 ^level-1 L0-root)
  22132. --- END Input Phase ---
  22133. --- Proposal Phase ---
  22134. --- Inner Elaboration Phase, active level 1 (S1) ---
  22135. Firing elaborate*copy-see-to-output-link
  22136. -->
  22137. (I3 ^see 0 +)
  22138. Firing elaborate*reward*based*on*reward
  22139. -->
  22140. (R1058 ^value 1 +)
  22141. (R1 ^reward R1058 +)
  22142. Firing propose*predict-yes
  22143. -->
  22144. (O2109 ^name predict-yes +)
  22145. (S1 ^operator O2109 +)
  22146. Firing propose*predict-no
  22147. -->
  22148. (O2110 ^name predict-no +)
  22149. (S1 ^operator O2110 +)
  22150. Firing rl*prefer*rvt*predict-no*H0*2
  22151. -->
  22152. (S1 ^operator O2108 = 1.)
  22153. Firing rl*prefer*rvt*predict-yes*H0*1
  22154. -->
  22155. (S1 ^operator O2107 = 0.)
  22156. Firing prefer*rvt*predict-yes*H0
  22157. -->
  22158. Firing prefer*rvt*predict-no*H0
  22159. -->
  22160. Firing elaborate*copy-dir-to-output-link
  22161. -->
  22162. (I3 ^dir U +)
  22163. inner elaboration loop at bottom goal.
  22164. Retracting elaborate*copy-see-to-output-link
  22165. -->
  22166. (I3 ^see 0 +)
  22167. Retracting propose*predict-no
  22168. -->
  22169. (O2108 ^name predict-no +)
  22170. (S1 ^operator O2108 +)
  22171. Retracting propose*predict-yes
  22172. -->
  22173. (O2107 ^name predict-yes +)
  22174. (S1 ^operator O2107 +)
  22175. Retracting elaborate*reward*based*on*reward
  22176. -->
  22177. (R1057 ^value 1 +)
  22178. (R1 ^reward R1057 +)
  22179. Retracting elaborate*copy-dir-to-output-link
  22180. -->
  22181. (I3 ^dir U +)
  22182. Retracting rl*prefer*rvt*predict-no*H0*2
  22183. -->
  22184. (S1 ^operator O2108 = 1.)
  22185. Retracting rl*prefer*rvt*predict-yes*H0*1
  22186. -->
  22187. (S1 ^operator O2107 = 0.)
  22188. =>WM: (14864: S1 ^operator O2110 +)
  22189. =>WM: (14863: S1 ^operator O2109 +)
  22190. =>WM: (14862: O2110 ^name predict-no)
  22191. =>WM: (14861: O2109 ^name predict-yes)
  22192. =>WM: (14860: R1058 ^value 1)
  22193. =>WM: (14859: R1 ^reward R1058)
  22194. <=WM: (14850: S1 ^operator O2107 +)
  22195. <=WM: (14851: S1 ^operator O2108 +)
  22196. <=WM: (14852: S1 ^operator O2108)
  22197. <=WM: (14845: R1 ^reward R1057)
  22198. <=WM: (14848: O2108 ^name predict-no)
  22199. <=WM: (14847: O2107 ^name predict-yes)
  22200. <=WM: (14846: R1057 ^value 1)
  22201. --- Inner Elaboration Phase, active level 1 (S1) ---
  22202. Firing prefer*rvt*predict-yes*H0
  22203. -->
  22204. Firing rl*prefer*rvt*predict-yes*H0*1
  22205. -->
  22206. (S1 ^operator O2109 = 0.)
  22207. Firing prefer*rvt*predict-no*H0
  22208. -->
  22209. Firing rl*prefer*rvt*predict-no*H0*2
  22210. -->
  22211. (S1 ^operator O2110 = 1.)
  22212. inner elaboration loop at bottom goal.
  22213. Retracting rl*prefer*rvt*predict-no*H0*2
  22214. -->
  22215. (S1 ^operator O2108 = 1.)
  22216. Retracting rl*prefer*rvt*predict-yes*H0*1
  22217. -->
  22218. (S1 ^operator O2107 = 0.)
  22219. --- END Proposal Phase ---
  22220. --- Decision Phase ---
  22221. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22222. =>WM: (14865: S1 ^operator O2110)
  22223. 1055: O: O2110 (predict-no)
  22224. --- END Decision Phase ---
  22225. --- Application Phase ---
  22226. --- Firing Productions (PE) For State At Depth 1 ---
  22227. --- Inner Elaboration Phase, active level 1 (S1) ---
  22228. Firing apply*operator
  22229. -->
  22230. (I3 ^predict-no N1055 + :O )
  22231. Firing apply*operator*complete
  22232. -->
  22233. (I3 ^predict-no N1054 - :O )
  22234. inner elaboration loop at bottom goal.
  22235. --- Change Working Memory (PE) ---
  22236. =>WM: (14866: I3 ^predict-no N1055)
  22237. <=WM: (14854: N1054 ^status complete)
  22238. <=WM: (14853: I3 ^predict-no N1054)
  22239. --- Firing Productions (IE) For State At Depth 1 ---
  22240. --- Inner Elaboration Phase, active level 1 (S1) ---
  22241. Firing monitor*world
  22242. -->
  22243. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22244. --- Change Working Memory (IE) ---
  22245. --- END Application Phase ---
  22246. --- Output Phase ---
  22247. ENV: Agent did: predict-no for direction U in state State-A
  22248. In State-A moving U
  22249. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22250. predict error 0
  22251. dir: dir isL
  22252. --- END Output Phase ---
  22253. |\---- Input Phase ---
  22254. =>WM: (14870: I2 ^dir L)
  22255. =>WM: (14869: I2 ^reward 1)
  22256. =>WM: (14868: I2 ^see 0)
  22257. =>WM: (14867: N1055 ^status complete)
  22258. <=WM: (14857: I2 ^dir U)
  22259. <=WM: (14856: I2 ^reward 1)
  22260. <=WM: (14855: I2 ^see 0)
  22261. =>WM: (14871: I2 ^level-1 L0-root)
  22262. <=WM: (14858: I2 ^level-1 L0-root)
  22263. --- END Input Phase ---
  22264. --- Proposal Phase ---
  22265. --- Inner Elaboration Phase, active level 1 (S1) ---
  22266. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  22267. -->
  22268. (S1 ^operator O2109 = -0.1386470047172653)
  22269. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22270. -->
  22271. Firing elaborate*copy-see-to-output-link
  22272. -->
  22273. (I3 ^see 0 +)
  22274. Firing elaborate*reward*based*on*reward
  22275. -->
  22276. (R1059 ^value 1 +)
  22277. (R1 ^reward R1059 +)
  22278. Firing propose*predict-yes
  22279. -->
  22280. (O2111 ^name predict-yes +)
  22281. (S1 ^operator O2111 +)
  22282. Firing propose*predict-no
  22283. -->
  22284. (O2112 ^name predict-no +)
  22285. (S1 ^operator O2112 +)
  22286. Firing rl*prefer*rvt*predict-no*H0*6
  22287. -->
  22288. (S1 ^operator O2110 = 0.9438647703421585)
  22289. Firing rl*prefer*rvt*predict-yes*H0*5
  22290. -->
  22291. (S1 ^operator O2109 = 0.2640265760956406)
  22292. Firing prefer*rvt*predict-yes*H0
  22293. -->
  22294. Firing prefer*rvt*predict-no*H0
  22295. -->
  22296. Firing elaborate*copy-dir-to-output-link
  22297. -->
  22298. (I3 ^dir L +)
  22299. inner elaboration loop at bottom goal.
  22300. Retracting elaborate*copy-see-to-output-link
  22301. -->
  22302. (I3 ^see 0 +)
  22303. Retracting propose*predict-no
  22304. -->
  22305. (O2110 ^name predict-no +)
  22306. (S1 ^operator O2110 +)
  22307. Retracting propose*predict-yes
  22308. -->
  22309. (O2109 ^name predict-yes +)
  22310. (S1 ^operator O2109 +)
  22311. Retracting elaborate*reward*based*on*reward
  22312. -->
  22313. (R1058 ^value 1 +)
  22314. (R1 ^reward R1058 +)
  22315. Retracting elaborate*copy-dir-to-output-link
  22316. -->
  22317. (I3 ^dir U +)
  22318. Retracting rl*prefer*rvt*predict-no*H0*2
  22319. -->
  22320. (S1 ^operator O2110 = 1.)
  22321. Retracting rl*prefer*rvt*predict-yes*H0*1
  22322. -->
  22323. (S1 ^operator O2109 = 0.)
  22324. =>WM: (14878: S1 ^operator O2112 +)
  22325. =>WM: (14877: S1 ^operator O2111 +)
  22326. =>WM: (14876: I3 ^dir L)
  22327. =>WM: (14875: O2112 ^name predict-no)
  22328. =>WM: (14874: O2111 ^name predict-yes)
  22329. =>WM: (14873: R1059 ^value 1)
  22330. =>WM: (14872: R1 ^reward R1059)
  22331. <=WM: (14863: S1 ^operator O2109 +)
  22332. <=WM: (14864: S1 ^operator O2110 +)
  22333. <=WM: (14865: S1 ^operator O2110)
  22334. <=WM: (14849: I3 ^dir U)
  22335. <=WM: (14859: R1 ^reward R1058)
  22336. <=WM: (14862: O2110 ^name predict-no)
  22337. <=WM: (14861: O2109 ^name predict-yes)
  22338. <=WM: (14860: R1058 ^value 1)
  22339. --- Inner Elaboration Phase, active level 1 (S1) ---
  22340. Firing prefer*rvt*predict-yes*H0
  22341. -->
  22342. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  22343. -->
  22344. (S1 ^operator O2111 = -0.1386470047172653)
  22345. Firing rl*prefer*rvt*predict-yes*H0*5
  22346. -->
  22347. (S1 ^operator O2111 = 0.2640265760956406)
  22348. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  22349. -->
  22350. Firing prefer*rvt*predict-no*H0
  22351. -->
  22352. Firing rl*prefer*rvt*predict-no*H0*6
  22353. -->
  22354. (S1 ^operator O2112 = 0.9438647703421585)
  22355. inner elaboration loop at bottom goal.
  22356. Retracting rl*prefer*rvt*predict-no*H0*6
  22357. -->
  22358. (S1 ^operator O2110 = 0.9438647703421585)
  22359. Retracting rl*prefer*rvt*predict-yes*H0*5
  22360. -->
  22361. (S1 ^operator O2109 = 0.2640265760956406)
  22362. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  22363. -->
  22364. (S1 ^operator O2109 = -0.1386470047172653)
  22365. --- END Proposal Phase ---
  22366. --- Decision Phase ---
  22367. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22368. =>WM: (14879: S1 ^operator O2112)
  22369. 1056: O: O2112 (predict-no)
  22370. --- END Decision Phase ---
  22371. --- Application Phase ---
  22372. --- Firing Productions (PE) For State At Depth 1 ---
  22373. --- Inner Elaboration Phase, active level 1 (S1) ---
  22374. Firing apply*operator
  22375. -->
  22376. (I3 ^predict-no N1056 + :O )
  22377. Firing apply*operator*complete
  22378. -->
  22379. (I3 ^predict-no N1055 - :O )
  22380. inner elaboration loop at bottom goal.
  22381. --- Change Working Memory (PE) ---
  22382. =>WM: (14880: I3 ^predict-no N1056)
  22383. <=WM: (14867: N1055 ^status complete)
  22384. <=WM: (14866: I3 ^predict-no N1055)
  22385. --- Firing Productions (IE) For State At Depth 1 ---
  22386. --- Inner Elaboration Phase, active level 1 (S1) ---
  22387. Firing monitor*world
  22388. -->
  22389. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22390. --- Change Working Memory (IE) ---
  22391. --- END Application Phase ---
  22392. --- Output Phase ---
  22393. ENV: Agent did: predict-no for direction L in state State-A
  22394. In State-A moving L
  22395. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22396. predict error 0
  22397. dir: dir isR
  22398. --- END Output Phase ---
  22399. /|\--- Input Phase ---
  22400. =>WM: (14884: I2 ^dir R)
  22401. =>WM: (14883: I2 ^reward 1)
  22402. =>WM: (14882: I2 ^see 0)
  22403. =>WM: (14881: N1056 ^status complete)
  22404. <=WM: (14870: I2 ^dir L)
  22405. <=WM: (14869: I2 ^reward 1)
  22406. <=WM: (14868: I2 ^see 0)
  22407. =>WM: (14885: I2 ^level-1 L0-root)
  22408. <=WM: (14871: I2 ^level-1 L0-root)
  22409. --- END Input Phase ---
  22410. --- Proposal Phase ---
  22411. --- Inner Elaboration Phase, active level 1 (S1) ---
  22412. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22413. -->
  22414. (S1 ^operator O2112 = -0.2817060109291377)
  22415. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22416. -->
  22417. (S1 ^operator O2111 = 0.6623248842123732)
  22418. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22419. -->
  22420. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22421. -->
  22422. Firing elaborate*copy-see-to-output-link
  22423. -->
  22424. (I3 ^see 0 +)
  22425. Firing elaborate*reward*based*on*reward
  22426. -->
  22427. (R1060 ^value 1 +)
  22428. (R1 ^reward R1060 +)
  22429. Firing propose*predict-yes
  22430. -->
  22431. (O2113 ^name predict-yes +)
  22432. (S1 ^operator O2113 +)
  22433. Firing propose*predict-no
  22434. -->
  22435. (O2114 ^name predict-no +)
  22436. (S1 ^operator O2114 +)
  22437. Firing rl*prefer*rvt*predict-no*H0*4
  22438. -->
  22439. (S1 ^operator O2112 = 0.3397912729403567)
  22440. Firing rl*prefer*rvt*predict-yes*H0*3
  22441. -->
  22442. (S1 ^operator O2111 = 0.3376989200650307)
  22443. Firing prefer*rvt*predict-yes*H0
  22444. -->
  22445. Firing prefer*rvt*predict-no*H0
  22446. -->
  22447. Firing elaborate*copy-dir-to-output-link
  22448. -->
  22449. (I3 ^dir R +)
  22450. inner elaboration loop at bottom goal.
  22451. Retracting elaborate*copy-see-to-output-link
  22452. -->
  22453. (I3 ^see 0 +)
  22454. Retracting propose*predict-no
  22455. -->
  22456. (O2112 ^name predict-no +)
  22457. (S1 ^operator O2112 +)
  22458. Retracting propose*predict-yes
  22459. -->
  22460. (O2111 ^name predict-yes +)
  22461. (S1 ^operator O2111 +)
  22462. Retracting elaborate*reward*based*on*reward
  22463. -->
  22464. (R1059 ^value 1 +)
  22465. (R1 ^reward R1059 +)
  22466. Retracting elaborate*copy-dir-to-output-link
  22467. -->
  22468. (I3 ^dir L +)
  22469. Retracting rl*prefer*rvt*predict-no*H0*6
  22470. -->
  22471. (S1 ^operator O2112 = 0.9438647703421585)
  22472. Retracting rl*prefer*rvt*predict-yes*H0*5
  22473. -->
  22474. (S1 ^operator O2111 = 0.2640265760956406)
  22475. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  22476. -->
  22477. (S1 ^operator O2111 = -0.1386470047172653)
  22478. =>WM: (14892: S1 ^operator O2114 +)
  22479. =>WM: (14891: S1 ^operator O2113 +)
  22480. =>WM: (14890: I3 ^dir R)
  22481. =>WM: (14889: O2114 ^name predict-no)
  22482. =>WM: (14888: O2113 ^name predict-yes)
  22483. =>WM: (14887: R1060 ^value 1)
  22484. =>WM: (14886: R1 ^reward R1060)
  22485. <=WM: (14877: S1 ^operator O2111 +)
  22486. <=WM: (14878: S1 ^operator O2112 +)
  22487. <=WM: (14879: S1 ^operator O2112)
  22488. <=WM: (14876: I3 ^dir L)
  22489. <=WM: (14872: R1 ^reward R1059)
  22490. <=WM: (14875: O2112 ^name predict-no)
  22491. <=WM: (14874: O2111 ^name predict-yes)
  22492. <=WM: (14873: R1059 ^value 1)
  22493. --- Inner Elaboration Phase, active level 1 (S1) ---
  22494. Firing prefer*rvt*predict-yes*H0
  22495. -->
  22496. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22497. -->
  22498. (S1 ^operator O2113 = 0.6623248842123732)
  22499. Firing rl*prefer*rvt*predict-yes*H0*3
  22500. -->
  22501. (S1 ^operator O2113 = 0.3376989200650307)
  22502. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22503. -->
  22504. Firing prefer*rvt*predict-no*H0
  22505. -->
  22506. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22507. -->
  22508. (S1 ^operator O2114 = -0.2817060109291377)
  22509. Firing rl*prefer*rvt*predict-no*H0*4
  22510. -->
  22511. (S1 ^operator O2114 = 0.3397912729403567)
  22512. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22513. -->
  22514. inner elaboration loop at bottom goal.
  22515. Retracting rl*prefer*rvt*predict-no*H0*4
  22516. -->
  22517. (S1 ^operator O2112 = 0.3397912729403567)
  22518. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22519. -->
  22520. (S1 ^operator O2112 = -0.2817060109291377)
  22521. Retracting rl*prefer*rvt*predict-yes*H0*3
  22522. -->
  22523. (S1 ^operator O2111 = 0.3376989200650307)
  22524. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22525. -->
  22526. (S1 ^operator O2111 = 0.6623248842123732)
  22527. --- END Proposal Phase ---
  22528. --- Decision Phase ---
  22529. RL update rl*prefer*rvt*predict-no*H0*6 0.943865 0 0.943865 -> 0.953124 0 0.953124(R,m,v=1,0.905063,0.086471)
  22530. =>WM: (14893: S1 ^operator O2113)
  22531. 1057: O: O2113 (predict-yes)
  22532. --- END Decision Phase ---
  22533. --- Application Phase ---
  22534. --- Firing Productions (PE) For State At Depth 1 ---
  22535. --- Inner Elaboration Phase, active level 1 (S1) ---
  22536. Firing apply*operator
  22537. -->
  22538. (I3 ^predict-yes N1057 + :O )
  22539. Firing apply*operator*complete
  22540. -->
  22541. (I3 ^predict-no N1056 - :O )
  22542. inner elaboration loop at bottom goal.
  22543. --- Change Working Memory (PE) ---
  22544. =>WM: (14894: I3 ^predict-yes N1057)
  22545. <=WM: (14881: N1056 ^status complete)
  22546. <=WM: (14880: I3 ^predict-no N1056)
  22547. --- Firing Productions (IE) For State At Depth 1 ---
  22548. --- Inner Elaboration Phase, active level 1 (S1) ---
  22549. Firing monitor*world
  22550. -->
  22551. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22552. --- Change Working Memory (IE) ---
  22553. --- END Application Phase ---
  22554. --- Output Phase ---
  22555. ENV: Agent did: predict-yes for direction R in state State-A
  22556. In State-A moving R
  22557. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22558. predict error 0
  22559. dir: dir isU
  22560. --- END Output Phase ---
  22561. -/|--- Input Phase ---
  22562. =>WM: (14898: I2 ^dir U)
  22563. =>WM: (14897: I2 ^reward 1)
  22564. =>WM: (14896: I2 ^see 1)
  22565. =>WM: (14895: N1057 ^status complete)
  22566. <=WM: (14884: I2 ^dir R)
  22567. <=WM: (14883: I2 ^reward 1)
  22568. <=WM: (14882: I2 ^see 0)
  22569. =>WM: (14899: I2 ^level-1 R1-root)
  22570. <=WM: (14885: I2 ^level-1 L0-root)
  22571. --- END Input Phase ---
  22572. --- Proposal Phase ---
  22573. --- Inner Elaboration Phase, active level 1 (S1) ---
  22574. Firing elaborate*copy-see-to-output-link
  22575. -->
  22576. (I3 ^see 1 +)
  22577. Firing elaborate*reward*based*on*reward
  22578. -->
  22579. (R1061 ^value 1 +)
  22580. (R1 ^reward R1061 +)
  22581. Firing propose*predict-yes
  22582. -->
  22583. (O2115 ^name predict-yes +)
  22584. (S1 ^operator O2115 +)
  22585. Firing propose*predict-no
  22586. -->
  22587. (O2116 ^name predict-no +)
  22588. (S1 ^operator O2116 +)
  22589. Firing rl*prefer*rvt*predict-no*H0*2
  22590. -->
  22591. (S1 ^operator O2114 = 1.)
  22592. Firing rl*prefer*rvt*predict-yes*H0*1
  22593. -->
  22594. (S1 ^operator O2113 = 0.)
  22595. Firing prefer*rvt*predict-yes*H0
  22596. -->
  22597. Firing prefer*rvt*predict-no*H0
  22598. -->
  22599. Firing elaborate*copy-dir-to-output-link
  22600. -->
  22601. (I3 ^dir U +)
  22602. inner elaboration loop at bottom goal.
  22603. Retracting elaborate*copy-see-to-output-link
  22604. -->
  22605. (I3 ^see 0 +)
  22606. Retracting propose*predict-no
  22607. -->
  22608. (O2114 ^name predict-no +)
  22609. (S1 ^operator O2114 +)
  22610. Retracting propose*predict-yes
  22611. -->
  22612. (O2113 ^name predict-yes +)
  22613. (S1 ^operator O2113 +)
  22614. Retracting elaborate*reward*based*on*reward
  22615. -->
  22616. (R1060 ^value 1 +)
  22617. (R1 ^reward R1060 +)
  22618. Retracting elaborate*copy-dir-to-output-link
  22619. -->
  22620. (I3 ^dir R +)
  22621. Retracting rl*prefer*rvt*predict-no*H0*4
  22622. -->
  22623. (S1 ^operator O2114 = 0.3397912729403567)
  22624. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  22625. -->
  22626. (S1 ^operator O2114 = -0.2817060109291377)
  22627. Retracting rl*prefer*rvt*predict-yes*H0*3
  22628. -->
  22629. (S1 ^operator O2113 = 0.3376989200650307)
  22630. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  22631. -->
  22632. (S1 ^operator O2113 = 0.6623248842123732)
  22633. =>WM: (14907: S1 ^operator O2116 +)
  22634. =>WM: (14906: S1 ^operator O2115 +)
  22635. =>WM: (14905: I3 ^dir U)
  22636. =>WM: (14904: O2116 ^name predict-no)
  22637. =>WM: (14903: O2115 ^name predict-yes)
  22638. =>WM: (14902: R1061 ^value 1)
  22639. =>WM: (14901: R1 ^reward R1061)
  22640. =>WM: (14900: I3 ^see 1)
  22641. <=WM: (14891: S1 ^operator O2113 +)
  22642. <=WM: (14893: S1 ^operator O2113)
  22643. <=WM: (14892: S1 ^operator O2114 +)
  22644. <=WM: (14890: I3 ^dir R)
  22645. <=WM: (14886: R1 ^reward R1060)
  22646. <=WM: (14830: I3 ^see 0)
  22647. <=WM: (14889: O2114 ^name predict-no)
  22648. <=WM: (14888: O2113 ^name predict-yes)
  22649. <=WM: (14887: R1060 ^value 1)
  22650. --- Inner Elaboration Phase, active level 1 (S1) ---
  22651. Firing prefer*rvt*predict-yes*H0
  22652. -->
  22653. Firing rl*prefer*rvt*predict-yes*H0*1
  22654. -->
  22655. (S1 ^operator O2115 = 0.)
  22656. Firing prefer*rvt*predict-no*H0
  22657. -->
  22658. Firing rl*prefer*rvt*predict-no*H0*2
  22659. -->
  22660. (S1 ^operator O2116 = 1.)
  22661. inner elaboration loop at bottom goal.
  22662. Retracting rl*prefer*rvt*predict-no*H0*2
  22663. -->
  22664. (S1 ^operator O2114 = 1.)
  22665. Retracting rl*prefer*rvt*predict-yes*H0*1
  22666. -->
  22667. (S1 ^operator O2113 = 0.)
  22668. --- END Proposal Phase ---
  22669. --- Decision Phase ---
  22670. RL update rl*prefer*rvt*predict-yes*H0*3 0.590097 -0.252398 0.337699 -> 0.590095 -0.252398 0.337697(R,m,v=1,0.902857,0.0882102)
  22671. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.40993 0.252395 0.662325 -> 0.409928 0.252395 0.662323(R,m,v=1,1,0)
  22672. =>WM: (14908: S1 ^operator O2116)
  22673. 1058: O: O2116 (predict-no)
  22674. --- END Decision Phase ---
  22675. --- Application Phase ---
  22676. --- Firing Productions (PE) For State At Depth 1 ---
  22677. --- Inner Elaboration Phase, active level 1 (S1) ---
  22678. Firing apply*operator
  22679. -->
  22680. (I3 ^predict-no N1058 + :O )
  22681. Firing apply*operator*complete
  22682. -->
  22683. (I3 ^predict-yes N1057 - :O )
  22684. inner elaboration loop at bottom goal.
  22685. --- Change Working Memory (PE) ---
  22686. =>WM: (14909: I3 ^predict-no N1058)
  22687. <=WM: (14895: N1057 ^status complete)
  22688. <=WM: (14894: I3 ^predict-yes N1057)
  22689. --- Firing Productions (IE) For State At Depth 1 ---
  22690. --- Inner Elaboration Phase, active level 1 (S1) ---
  22691. Firing monitor*world
  22692. -->
  22693. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22694. --- Change Working Memory (IE) ---
  22695. --- END Application Phase ---
  22696. --- Output Phase ---
  22697. ENV: Agent did: predict-no for direction U in state State-B
  22698. In State-B moving U
  22699. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22700. predict error 0
  22701. dir: dir isR
  22702. --- END Output Phase ---
  22703. \---- Input Phase ---
  22704. =>WM: (14913: I2 ^dir R)
  22705. =>WM: (14912: I2 ^reward 1)
  22706. =>WM: (14911: I2 ^see 0)
  22707. =>WM: (14910: N1058 ^status complete)
  22708. <=WM: (14898: I2 ^dir U)
  22709. <=WM: (14897: I2 ^reward 1)
  22710. <=WM: (14896: I2 ^see 1)
  22711. =>WM: (14914: I2 ^level-1 R1-root)
  22712. <=WM: (14899: I2 ^level-1 R1-root)
  22713. --- END Input Phase ---
  22714. --- Proposal Phase ---
  22715. --- Inner Elaboration Phase, active level 1 (S1) ---
  22716. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  22717. -->
  22718. (S1 ^operator O2115 = -0.1070236389116304)
  22719. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  22720. -->
  22721. (S1 ^operator O2116 = 0.6602356998698435)
  22722. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22723. -->
  22724. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22725. -->
  22726. Firing elaborate*copy-see-to-output-link
  22727. -->
  22728. (I3 ^see 0 +)
  22729. Firing elaborate*reward*based*on*reward
  22730. -->
  22731. (R1062 ^value 1 +)
  22732. (R1 ^reward R1062 +)
  22733. Firing propose*predict-yes
  22734. -->
  22735. (O2117 ^name predict-yes +)
  22736. (S1 ^operator O2117 +)
  22737. Firing propose*predict-no
  22738. -->
  22739. (O2118 ^name predict-no +)
  22740. (S1 ^operator O2118 +)
  22741. Firing rl*prefer*rvt*predict-no*H0*4
  22742. -->
  22743. (S1 ^operator O2116 = 0.3397912729403567)
  22744. Firing rl*prefer*rvt*predict-yes*H0*3
  22745. -->
  22746. (S1 ^operator O2115 = 0.3376969893996755)
  22747. Firing prefer*rvt*predict-yes*H0
  22748. -->
  22749. Firing prefer*rvt*predict-no*H0
  22750. -->
  22751. Firing elaborate*copy-dir-to-output-link
  22752. -->
  22753. (I3 ^dir R +)
  22754. inner elaboration loop at bottom goal.
  22755. Retracting elaborate*copy-see-to-output-link
  22756. -->
  22757. (I3 ^see 1 +)
  22758. Retracting propose*predict-no
  22759. -->
  22760. (O2116 ^name predict-no +)
  22761. (S1 ^operator O2116 +)
  22762. Retracting propose*predict-yes
  22763. -->
  22764. (O2115 ^name predict-yes +)
  22765. (S1 ^operator O2115 +)
  22766. Retracting elaborate*reward*based*on*reward
  22767. -->
  22768. (R1061 ^value 1 +)
  22769. (R1 ^reward R1061 +)
  22770. Retracting elaborate*copy-dir-to-output-link
  22771. -->
  22772. (I3 ^dir U +)
  22773. Retracting rl*prefer*rvt*predict-no*H0*2
  22774. -->
  22775. (S1 ^operator O2116 = 1.)
  22776. Retracting rl*prefer*rvt*predict-yes*H0*1
  22777. -->
  22778. (S1 ^operator O2115 = 0.)
  22779. =>WM: (14922: S1 ^operator O2118 +)
  22780. =>WM: (14921: S1 ^operator O2117 +)
  22781. =>WM: (14920: I3 ^dir R)
  22782. =>WM: (14919: O2118 ^name predict-no)
  22783. =>WM: (14918: O2117 ^name predict-yes)
  22784. =>WM: (14917: R1062 ^value 1)
  22785. =>WM: (14916: R1 ^reward R1062)
  22786. =>WM: (14915: I3 ^see 0)
  22787. <=WM: (14906: S1 ^operator O2115 +)
  22788. <=WM: (14907: S1 ^operator O2116 +)
  22789. <=WM: (14908: S1 ^operator O2116)
  22790. <=WM: (14905: I3 ^dir U)
  22791. <=WM: (14901: R1 ^reward R1061)
  22792. <=WM: (14900: I3 ^see 1)
  22793. <=WM: (14904: O2116 ^name predict-no)
  22794. <=WM: (14903: O2115 ^name predict-yes)
  22795. <=WM: (14902: R1061 ^value 1)
  22796. --- Inner Elaboration Phase, active level 1 (S1) ---
  22797. Firing prefer*rvt*predict-yes*H0
  22798. -->
  22799. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  22800. -->
  22801. (S1 ^operator O2117 = -0.1070236389116304)
  22802. Firing rl*prefer*rvt*predict-yes*H0*3
  22803. -->
  22804. (S1 ^operator O2117 = 0.3376969893996755)
  22805. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22806. -->
  22807. Firing prefer*rvt*predict-no*H0
  22808. -->
  22809. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  22810. -->
  22811. (S1 ^operator O2118 = 0.6602356998698435)
  22812. Firing rl*prefer*rvt*predict-no*H0*4
  22813. -->
  22814. (S1 ^operator O2118 = 0.3397912729403567)
  22815. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22816. -->
  22817. inner elaboration loop at bottom goal.
  22818. Retracting rl*prefer*rvt*predict-no*H0*4
  22819. -->
  22820. (S1 ^operator O2116 = 0.3397912729403567)
  22821. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  22822. -->
  22823. (S1 ^operator O2116 = 0.6602356998698435)
  22824. Retracting rl*prefer*rvt*predict-yes*H0*3
  22825. -->
  22826. (S1 ^operator O2115 = 0.3376969893996755)
  22827. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  22828. -->
  22829. (S1 ^operator O2115 = -0.1070236389116304)
  22830. --- END Proposal Phase ---
  22831. --- Decision Phase ---
  22832. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22833. =>WM: (14923: S1 ^operator O2118)
  22834. 1059: O: O2118 (predict-no)
  22835. --- END Decision Phase ---
  22836. --- Application Phase ---
  22837. --- Firing Productions (PE) For State At Depth 1 ---
  22838. --- Inner Elaboration Phase, active level 1 (S1) ---
  22839. Firing apply*operator
  22840. -->
  22841. (I3 ^predict-no N1059 + :O )
  22842. Firing apply*operator*complete
  22843. -->
  22844. (I3 ^predict-no N1058 - :O )
  22845. inner elaboration loop at bottom goal.
  22846. --- Change Working Memory (PE) ---
  22847. =>WM: (14924: I3 ^predict-no N1059)
  22848. <=WM: (14910: N1058 ^status complete)
  22849. <=WM: (14909: I3 ^predict-no N1058)
  22850. --- Firing Productions (IE) For State At Depth 1 ---
  22851. --- Inner Elaboration Phase, active level 1 (S1) ---
  22852. Firing monitor*world
  22853. -->
  22854. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22855. --- Change Working Memory (IE) ---
  22856. --- END Application Phase ---
  22857. --- Output Phase ---
  22858. ENV: Agent did: predict-no for direction R in state State-B
  22859. In State-B moving R
  22860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22861. predict error 0
  22862. dir: dir isR
  22863. --- END Output Phase ---
  22864. /|\--- Input Phase ---
  22865. =>WM: (14928: I2 ^dir R)
  22866. =>WM: (14927: I2 ^reward 1)
  22867. =>WM: (14926: I2 ^see 0)
  22868. =>WM: (14925: N1059 ^status complete)
  22869. <=WM: (14913: I2 ^dir R)
  22870. <=WM: (14912: I2 ^reward 1)
  22871. <=WM: (14911: I2 ^see 0)
  22872. =>WM: (14929: I2 ^level-1 R0-root)
  22873. <=WM: (14914: I2 ^level-1 R1-root)
  22874. --- END Input Phase ---
  22875. --- Proposal Phase ---
  22876. --- Inner Elaboration Phase, active level 1 (S1) ---
  22877. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22878. -->
  22879. (S1 ^operator O2118 = 0.6601887223234754)
  22880. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22881. -->
  22882. (S1 ^operator O2117 = -0.1028953566115423)
  22883. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22884. -->
  22885. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22886. -->
  22887. Firing elaborate*copy-see-to-output-link
  22888. -->
  22889. (I3 ^see 0 +)
  22890. Firing elaborate*reward*based*on*reward
  22891. -->
  22892. (R1063 ^value 1 +)
  22893. (R1 ^reward R1063 +)
  22894. Firing propose*predict-yes
  22895. -->
  22896. (O2119 ^name predict-yes +)
  22897. (S1 ^operator O2119 +)
  22898. Firing propose*predict-no
  22899. -->
  22900. (O2120 ^name predict-no +)
  22901. (S1 ^operator O2120 +)
  22902. Firing rl*prefer*rvt*predict-no*H0*4
  22903. -->
  22904. (S1 ^operator O2118 = 0.3397912729403567)
  22905. Firing rl*prefer*rvt*predict-yes*H0*3
  22906. -->
  22907. (S1 ^operator O2117 = 0.3376969893996755)
  22908. Firing prefer*rvt*predict-yes*H0
  22909. -->
  22910. Firing prefer*rvt*predict-no*H0
  22911. -->
  22912. Firing elaborate*copy-dir-to-output-link
  22913. -->
  22914. (I3 ^dir R +)
  22915. inner elaboration loop at bottom goal.
  22916. Retracting elaborate*copy-see-to-output-link
  22917. -->
  22918. (I3 ^see 0 +)
  22919. Retracting propose*predict-no
  22920. -->
  22921. (O2118 ^name predict-no +)
  22922. (S1 ^operator O2118 +)
  22923. Retracting propose*predict-yes
  22924. -->
  22925. (O2117 ^name predict-yes +)
  22926. (S1 ^operator O2117 +)
  22927. Retracting elaborate*reward*based*on*reward
  22928. -->
  22929. (R1062 ^value 1 +)
  22930. (R1 ^reward R1062 +)
  22931. Retracting elaborate*copy-dir-to-output-link
  22932. -->
  22933. (I3 ^dir R +)
  22934. Retracting rl*prefer*rvt*predict-no*H0*4
  22935. -->
  22936. (S1 ^operator O2118 = 0.3397912729403567)
  22937. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  22938. -->
  22939. (S1 ^operator O2118 = 0.6602356998698435)
  22940. Retracting rl*prefer*rvt*predict-yes*H0*3
  22941. -->
  22942. (S1 ^operator O2117 = 0.3376969893996755)
  22943. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  22944. -->
  22945. (S1 ^operator O2117 = -0.1070236389116304)
  22946. =>WM: (14935: S1 ^operator O2120 +)
  22947. =>WM: (14934: S1 ^operator O2119 +)
  22948. =>WM: (14933: O2120 ^name predict-no)
  22949. =>WM: (14932: O2119 ^name predict-yes)
  22950. =>WM: (14931: R1063 ^value 1)
  22951. =>WM: (14930: R1 ^reward R1063)
  22952. <=WM: (14921: S1 ^operator O2117 +)
  22953. <=WM: (14922: S1 ^operator O2118 +)
  22954. <=WM: (14923: S1 ^operator O2118)
  22955. <=WM: (14916: R1 ^reward R1062)
  22956. <=WM: (14919: O2118 ^name predict-no)
  22957. <=WM: (14918: O2117 ^name predict-yes)
  22958. <=WM: (14917: R1062 ^value 1)
  22959. --- Inner Elaboration Phase, active level 1 (S1) ---
  22960. Firing prefer*rvt*predict-yes*H0
  22961. -->
  22962. Firing rl*prefer*rvt*predict-yes*H0*3
  22963. -->
  22964. (S1 ^operator O2119 = 0.3376969893996755)
  22965. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  22966. -->
  22967. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22968. -->
  22969. (S1 ^operator O2119 = -0.1028953566115423)
  22970. Firing prefer*rvt*predict-no*H0
  22971. -->
  22972. Firing rl*prefer*rvt*predict-no*H0*4
  22973. -->
  22974. (S1 ^operator O2120 = 0.3397912729403567)
  22975. Firing prefer*rvt*predict-no*H0*4*v1*H1
  22976. -->
  22977. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22978. -->
  22979. (S1 ^operator O2120 = 0.6601887223234754)
  22980. inner elaboration loop at bottom goal.
  22981. Retracting rl*prefer*rvt*predict-no*H0*4
  22982. -->
  22983. (S1 ^operator O2118 = 0.3397912729403567)
  22984. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  22985. -->
  22986. (S1 ^operator O2118 = 0.6601887223234754)
  22987. Retracting rl*prefer*rvt*predict-yes*H0*3
  22988. -->
  22989. (S1 ^operator O2117 = 0.3376969893996755)
  22990. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  22991. -->
  22992. (S1 ^operator O2117 = -0.1028953566115423)
  22993. --- END Proposal Phase ---
  22994. --- Decision Phase ---
  22995. RL update rl*prefer*rvt*predict-no*H0*4 0.570276 -0.230485 0.339791 -> 0.570274 -0.230484 0.339789(R,m,v=1,0.883978,0.103131)
  22996. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429752 0.230483 0.660236 -> 0.42975 0.230483 0.660233(R,m,v=1,1,0)
  22997. =>WM: (14936: S1 ^operator O2120)
  22998. 1060: O: O2120 (predict-no)
  22999. --- END Decision Phase ---
  23000. --- Application Phase ---
  23001. --- Firing Productions (PE) For State At Depth 1 ---
  23002. --- Inner Elaboration Phase, active level 1 (S1) ---
  23003. Firing apply*operator
  23004. -->
  23005. (I3 ^predict-no N1060 + :O )
  23006. Firing apply*operator*complete
  23007. -->
  23008. (I3 ^predict-no N1059 - :O )
  23009. inner elaboration loop at bottom goal.
  23010. --- Change Working Memory (PE) ---
  23011. =>WM: (14937: I3 ^predict-no N1060)
  23012. <=WM: (14925: N1059 ^status complete)
  23013. <=WM: (14924: I3 ^predict-no N1059)
  23014. --- Firing Productions (IE) For State At Depth 1 ---
  23015. --- Inner Elaboration Phase, active level 1 (S1) ---
  23016. Firing monitor*world
  23017. -->
  23018. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23019. --- Change Working Memory (IE) ---
  23020. --- END Application Phase ---
  23021. --- Output Phase ---
  23022. ENV: Agent did: predict-no for direction R in state State-B
  23023. In State-B moving R
  23024. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23025. predict error 0
  23026. dir: dir isU
  23027. --- END Output Phase ---
  23028. -/|--- Input Phase ---
  23029. =>WM: (14941: I2 ^dir U)
  23030. =>WM: (14940: I2 ^reward 1)
  23031. =>WM: (14939: I2 ^see 0)
  23032. =>WM: (14938: N1060 ^status complete)
  23033. <=WM: (14928: I2 ^dir R)
  23034. <=WM: (14927: I2 ^reward 1)
  23035. <=WM: (14926: I2 ^see 0)
  23036. =>WM: (14942: I2 ^level-1 R0-root)
  23037. <=WM: (14929: I2 ^level-1 R0-root)
  23038. --- END Input Phase ---
  23039. --- Proposal Phase ---
  23040. --- Inner Elaboration Phase, active level 1 (S1) ---
  23041. Firing elaborate*copy-see-to-output-link
  23042. -->
  23043. (I3 ^see 0 +)
  23044. Firing elaborate*reward*based*on*reward
  23045. -->
  23046. (R1064 ^value 1 +)
  23047. (R1 ^reward R1064 +)
  23048. Firing propose*predict-yes
  23049. -->
  23050. (O2121 ^name predict-yes +)
  23051. (S1 ^operator O2121 +)
  23052. Firing propose*predict-no
  23053. -->
  23054. (O2122 ^name predict-no +)
  23055. (S1 ^operator O2122 +)
  23056. Firing rl*prefer*rvt*predict-no*H0*2
  23057. -->
  23058. (S1 ^operator O2120 = 1.)
  23059. Firing rl*prefer*rvt*predict-yes*H0*1
  23060. -->
  23061. (S1 ^operator O2119 = 0.)
  23062. Firing prefer*rvt*predict-yes*H0
  23063. -->
  23064. Firing prefer*rvt*predict-no*H0
  23065. -->
  23066. Firing elaborate*copy-dir-to-output-link
  23067. -->
  23068. (I3 ^dir U +)
  23069. inner elaboration loop at bottom goal.
  23070. Retracting elaborate*copy-see-to-output-link
  23071. -->
  23072. (I3 ^see 0 +)
  23073. Retracting propose*predict-no
  23074. -->
  23075. (O2120 ^name predict-no +)
  23076. (S1 ^operator O2120 +)
  23077. Retracting propose*predict-yes
  23078. -->
  23079. (O2119 ^name predict-yes +)
  23080. (S1 ^operator O2119 +)
  23081. Retracting elaborate*reward*based*on*reward
  23082. -->
  23083. (R1063 ^value 1 +)
  23084. (R1 ^reward R1063 +)
  23085. Retracting elaborate*copy-dir-to-output-link
  23086. -->
  23087. (I3 ^dir R +)
  23088. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  23089. -->
  23090. (S1 ^operator O2120 = 0.6601887223234754)
  23091. Retracting rl*prefer*rvt*predict-no*H0*4
  23092. -->
  23093. (S1 ^operator O2120 = 0.3397890971862937)
  23094. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  23095. -->
  23096. (S1 ^operator O2119 = -0.1028953566115423)
  23097. Retracting rl*prefer*rvt*predict-yes*H0*3
  23098. -->
  23099. (S1 ^operator O2119 = 0.3376969893996755)
  23100. =>WM: (14949: S1 ^operator O2122 +)
  23101. =>WM: (14948: S1 ^operator O2121 +)
  23102. =>WM: (14947: I3 ^dir U)
  23103. =>WM: (14946: O2122 ^name predict-no)
  23104. =>WM: (14945: O2121 ^name predict-yes)
  23105. =>WM: (14944: R1064 ^value 1)
  23106. =>WM: (14943: R1 ^reward R1064)
  23107. <=WM: (14934: S1 ^operator O2119 +)
  23108. <=WM: (14935: S1 ^operator O2120 +)
  23109. <=WM: (14936: S1 ^operator O2120)
  23110. <=WM: (14920: I3 ^dir R)
  23111. <=WM: (14930: R1 ^reward R1063)
  23112. <=WM: (14933: O2120 ^name predict-no)
  23113. <=WM: (14932: O2119 ^name predict-yes)
  23114. <=WM: (14931: R1063 ^value 1)
  23115. --- Inner Elaboration Phase, active level 1 (S1) ---
  23116. Firing prefer*rvt*predict-yes*H0
  23117. -->
  23118. Firing rl*prefer*rvt*predict-yes*H0*1
  23119. -->
  23120. (S1 ^operator O2121 = 0.)
  23121. Firing prefer*rvt*predict-no*H0
  23122. -->
  23123. Firing rl*prefer*rvt*predict-no*H0*2
  23124. -->
  23125. (S1 ^operator O2122 = 1.)
  23126. inner elaboration loop at bottom goal.
  23127. Retracting rl*prefer*rvt*predict-no*H0*2
  23128. -->
  23129. (S1 ^operator O2120 = 1.)
  23130. Retracting rl*prefer*rvt*predict-yes*H0*1
  23131. -->
  23132. (S1 ^operator O2119 = 0.)
  23133. --- END Proposal Phase ---
  23134. --- Decision Phase ---
  23135. RL update rl*prefer*rvt*predict-no*H0*4 0.570274 -0.230484 0.339789 -> 0.570275 -0.230484 0.339791(R,m,v=1,0.884615,0.102635)
  23136. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429703 0.230485 0.660189 -> 0.429705 0.230485 0.660191(R,m,v=1,1,0)
  23137. =>WM: (14950: S1 ^operator O2122)
  23138. 1061: O: O2122 (predict-no)
  23139. --- END Decision Phase ---
  23140. --- Application Phase ---
  23141. --- Firing Productions (PE) For State At Depth 1 ---
  23142. --- Inner Elaboration Phase, active level 1 (S1) ---
  23143. Firing apply*operator
  23144. -->
  23145. (I3 ^predict-no N1061 + :O )
  23146. Firing apply*operator*complete
  23147. -->
  23148. (I3 ^predict-no N1060 - :O )
  23149. inner elaboration loop at bottom goal.
  23150. --- Change Working Memory (PE) ---
  23151. =>WM: (14951: I3 ^predict-no N1061)
  23152. <=WM: (14938: N1060 ^status complete)
  23153. <=WM: (14937: I3 ^predict-no N1060)
  23154. --- Firing Productions (IE) For State At Depth 1 ---
  23155. --- Inner Elaboration Phase, active level 1 (S1) ---
  23156. Firing monitor*world
  23157. -->
  23158. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23159. --- Change Working Memory (IE) ---
  23160. --- END Application Phase ---
  23161. --- Output Phase ---
  23162. ENV: Agent did: predict-no for direction U in state State-B
  23163. In State-B moving U
  23164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23165. predict error 0
  23166. dir: dir isU
  23167. --- END Output Phase ---
  23168. \--- Input Phase ---
  23169. =>WM: (14955: I2 ^dir U)
  23170. =>WM: (14954: I2 ^reward 1)
  23171. =>WM: (14953: I2 ^see 0)
  23172. =>WM: (14952: N1061 ^status complete)
  23173. <=WM: (14941: I2 ^dir U)
  23174. <=WM: (14940: I2 ^reward 1)
  23175. <=WM: (14939: I2 ^see 0)
  23176. =>WM: (14956: I2 ^level-1 R0-root)
  23177. <=WM: (14942: I2 ^level-1 R0-root)
  23178. --- END Input Phase ---
  23179. --- Proposal Phase ---
  23180. --- Inner Elaboration Phase, active level 1 (S1) ---
  23181. Firing elaborate*copy-see-to-output-link
  23182. -->
  23183. (I3 ^see 0 +)
  23184. Firing elaborate*reward*based*on*reward
  23185. -->
  23186. (R1065 ^value 1 +)
  23187. (R1 ^reward R1065 +)
  23188. Firing propose*predict-yes
  23189. -->
  23190. (O2123 ^name predict-yes +)
  23191. (S1 ^operator O2123 +)
  23192. Firing propose*predict-no
  23193. -->
  23194. (O2124 ^name predict-no +)
  23195. (S1 ^operator O2124 +)
  23196. Firing rl*prefer*rvt*predict-no*H0*2
  23197. -->
  23198. (S1 ^operator O2122 = 1.)
  23199. Firing rl*prefer*rvt*predict-yes*H0*1
  23200. -->
  23201. (S1 ^operator O2121 = 0.)
  23202. Firing prefer*rvt*predict-yes*H0
  23203. -->
  23204. Firing prefer*rvt*predict-no*H0
  23205. -->
  23206. Firing elaborate*copy-dir-to-output-link
  23207. -->
  23208. (I3 ^dir U +)
  23209. inner elaboration loop at bottom goal.
  23210. Retracting elaborate*copy-see-to-output-link
  23211. -->
  23212. (I3 ^see 0 +)
  23213. Retracting propose*predict-no
  23214. -->
  23215. (O2122 ^name predict-no +)
  23216. (S1 ^operator O2122 +)
  23217. Retracting propose*predict-yes
  23218. -->
  23219. (O2121 ^name predict-yes +)
  23220. (S1 ^operator O2121 +)
  23221. Retracting elaborate*reward*based*on*reward
  23222. -->
  23223. (R1064 ^value 1 +)
  23224. (R1 ^reward R1064 +)
  23225. Retracting elaborate*copy-dir-to-output-link
  23226. -->
  23227. (I3 ^dir U +)
  23228. Retracting rl*prefer*rvt*predict-no*H0*2
  23229. -->
  23230. (S1 ^operator O2122 = 1.)
  23231. Retracting rl*prefer*rvt*predict-yes*H0*1
  23232. -->
  23233. (S1 ^operator O2121 = 0.)
  23234. =>WM: (14962: S1 ^operator O2124 +)
  23235. =>WM: (14961: S1 ^operator O2123 +)
  23236. =>WM: (14960: O2124 ^name predict-no)
  23237. =>WM: (14959: O2123 ^name predict-yes)
  23238. =>WM: (14958: R1065 ^value 1)
  23239. =>WM: (14957: R1 ^reward R1065)
  23240. <=WM: (14948: S1 ^operator O2121 +)
  23241. <=WM: (14949: S1 ^operator O2122 +)
  23242. <=WM: (14950: S1 ^operator O2122)
  23243. <=WM: (14943: R1 ^reward R1064)
  23244. <=WM: (14946: O2122 ^name predict-no)
  23245. <=WM: (14945: O2121 ^name predict-yes)
  23246. <=WM: (14944: R1064 ^value 1)
  23247. --- Inner Elaboration Phase, active level 1 (S1) ---
  23248. Firing prefer*rvt*predict-yes*H0
  23249. -->
  23250. Firing rl*prefer*rvt*predict-yes*H0*1
  23251. -->
  23252. (S1 ^operator O2123 = 0.)
  23253. Firing prefer*rvt*predict-no*H0
  23254. -->
  23255. Firing rl*prefer*rvt*predict-no*H0*2
  23256. -->
  23257. (S1 ^operator O2124 = 1.)
  23258. inner elaboration loop at bottom goal.
  23259. Retracting rl*prefer*rvt*predict-no*H0*2
  23260. -->
  23261. (S1 ^operator O2122 = 1.)
  23262. Retracting rl*prefer*rvt*predict-yes*H0*1
  23263. -->
  23264. (S1 ^operator O2121 = 0.)
  23265. --- END Proposal Phase ---
  23266. --- Decision Phase ---
  23267. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23268. =>WM: (14963: S1 ^operator O2124)
  23269. 1062: O: O2124 (predict-no)
  23270. --- END Decision Phase ---
  23271. --- Application Phase ---
  23272. --- Firing Productions (PE) For State At Depth 1 ---
  23273. --- Inner Elaboration Phase, active level 1 (S1) ---
  23274. Firing apply*operator
  23275. -->
  23276. (I3 ^predict-no N1062 + :O )
  23277. Firing apply*operator*complete
  23278. -->
  23279. (I3 ^predict-no N1061 - :O )
  23280. inner elaboration loop at bottom goal.
  23281. --- Change Working Memory (PE) ---
  23282. =>WM: (14964: I3 ^predict-no N1062)
  23283. <=WM: (14952: N1061 ^status complete)
  23284. <=WM: (14951: I3 ^predict-no N1061)
  23285. --- Firing Productions (IE) For State At Depth 1 ---
  23286. --- Inner Elaboration Phase, active level 1 (S1) ---
  23287. Firing monitor*world
  23288. -->
  23289. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23290. --- Change Working Memory (IE) ---
  23291. --- END Application Phase ---
  23292. --- Output Phase ---
  23293. ENV: Agent did: predict-no for direction U in state State-B
  23294. In State-B moving U
  23295. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23296. predict error 0
  23297. dir: dir isU
  23298. --- END Output Phase ---
  23299. -/--- Input Phase ---
  23300. =>WM: (14968: I2 ^dir U)
  23301. =>WM: (14967: I2 ^reward 1)
  23302. =>WM: (14966: I2 ^see 0)
  23303. =>WM: (14965: N1062 ^status complete)
  23304. <=WM: (14955: I2 ^dir U)
  23305. <=WM: (14954: I2 ^reward 1)
  23306. <=WM: (14953: I2 ^see 0)
  23307. =>WM: (14969: I2 ^level-1 R0-root)
  23308. <=WM: (14956: I2 ^level-1 R0-root)
  23309. --- END Input Phase ---
  23310. --- Proposal Phase ---
  23311. --- Inner Elaboration Phase, active level 1 (S1) ---
  23312. Firing elaborate*copy-see-to-output-link
  23313. -->
  23314. (I3 ^see 0 +)
  23315. Firing elaborate*reward*based*on*reward
  23316. -->
  23317. (R1066 ^value 1 +)
  23318. (R1 ^reward R1066 +)
  23319. Firing propose*predict-yes
  23320. -->
  23321. (O2125 ^name predict-yes +)
  23322. (S1 ^operator O2125 +)
  23323. Firing propose*predict-no
  23324. -->
  23325. (O2126 ^name predict-no +)
  23326. (S1 ^operator O2126 +)
  23327. Firing rl*prefer*rvt*predict-no*H0*2
  23328. -->
  23329. (S1 ^operator O2124 = 1.)
  23330. Firing rl*prefer*rvt*predict-yes*H0*1
  23331. -->
  23332. (S1 ^operator O2123 = 0.)
  23333. Firing prefer*rvt*predict-yes*H0
  23334. -->
  23335. Firing prefer*rvt*predict-no*H0
  23336. -->
  23337. Firing elaborate*copy-dir-to-output-link
  23338. -->
  23339. (I3 ^dir U +)
  23340. inner elaboration loop at bottom goal.
  23341. Retracting elaborate*copy-see-to-output-link
  23342. -->
  23343. (I3 ^see 0 +)
  23344. Retracting propose*predict-no
  23345. -->
  23346. (O2124 ^name predict-no +)
  23347. (S1 ^operator O2124 +)
  23348. Retracting propose*predict-yes
  23349. -->
  23350. (O2123 ^name predict-yes +)
  23351. (S1 ^operator O2123 +)
  23352. Retracting elaborate*reward*based*on*reward
  23353. -->
  23354. (R1065 ^value 1 +)
  23355. (R1 ^reward R1065 +)
  23356. Retracting elaborate*copy-dir-to-output-link
  23357. -->
  23358. (I3 ^dir U +)
  23359. Retracting rl*prefer*rvt*predict-no*H0*2
  23360. -->
  23361. (S1 ^operator O2124 = 1.)
  23362. Retracting rl*prefer*rvt*predict-yes*H0*1
  23363. -->
  23364. (S1 ^operator O2123 = 0.)
  23365. =>WM: (14975: S1 ^operator O2126 +)
  23366. =>WM: (14974: S1 ^operator O2125 +)
  23367. =>WM: (14973: O2126 ^name predict-no)
  23368. =>WM: (14972: O2125 ^name predict-yes)
  23369. =>WM: (14971: R1066 ^value 1)
  23370. =>WM: (14970: R1 ^reward R1066)
  23371. <=WM: (14961: S1 ^operator O2123 +)
  23372. <=WM: (14962: S1 ^operator O2124 +)
  23373. <=WM: (14963: S1 ^operator O2124)
  23374. <=WM: (14957: R1 ^reward R1065)
  23375. <=WM: (14960: O2124 ^name predict-no)
  23376. <=WM: (14959: O2123 ^name predict-yes)
  23377. <=WM: (14958: R1065 ^value 1)
  23378. --- Inner Elaboration Phase, active level 1 (S1) ---
  23379. Firing prefer*rvt*predict-yes*H0
  23380. -->
  23381. Firing rl*prefer*rvt*predict-yes*H0*1
  23382. -->
  23383. (S1 ^operator O2125 = 0.)
  23384. Firing prefer*rvt*predict-no*H0
  23385. -->
  23386. Firing rl*prefer*rvt*predict-no*H0*2
  23387. -->
  23388. (S1 ^operator O2126 = 1.)
  23389. inner elaboration loop at bottom goal.
  23390. Retracting rl*prefer*rvt*predict-no*H0*2
  23391. -->
  23392. (S1 ^operator O2124 = 1.)
  23393. Retracting rl*prefer*rvt*predict-yes*H0*1
  23394. -->
  23395. (S1 ^operator O2123 = 0.)
  23396. --- END Proposal Phase ---
  23397. --- Decision Phase ---
  23398. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23399. =>WM: (14976: S1 ^operator O2126)
  23400. 1063: O: O2126 (predict-no)
  23401. --- END Decision Phase ---
  23402. --- Application Phase ---
  23403. --- Firing Productions (PE) For State At Depth 1 ---
  23404. --- Inner Elaboration Phase, active level 1 (S1) ---
  23405. Firing apply*operator
  23406. -->
  23407. (I3 ^predict-no N1063 + :O )
  23408. Firing apply*operator*complete
  23409. -->
  23410. (I3 ^predict-no N1062 - :O )
  23411. inner elaboration loop at bottom goal.
  23412. --- Change Working Memory (PE) ---
  23413. =>WM: (14977: I3 ^predict-no N1063)
  23414. <=WM: (14965: N1062 ^status complete)
  23415. <=WM: (14964: I3 ^predict-no N1062)
  23416. --- Firing Productions (IE) For State At Depth 1 ---
  23417. --- Inner Elaboration Phase, active level 1 (S1) ---
  23418. Firing monitor*world
  23419. -->
  23420. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23421. --- Change Working Memory (IE) ---
  23422. --- END Application Phase ---
  23423. --- Output Phase ---
  23424. ENV: Agent did: predict-no for direction U in state State-B
  23425. In State-B moving U
  23426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23427. predict error 0
  23428. dir: dir isL
  23429. --- END Output Phase ---
  23430. |\---- Input Phase ---
  23431. =>WM: (14981: I2 ^dir L)
  23432. =>WM: (14980: I2 ^reward 1)
  23433. =>WM: (14979: I2 ^see 0)
  23434. =>WM: (14978: N1063 ^status complete)
  23435. <=WM: (14968: I2 ^dir U)
  23436. <=WM: (14967: I2 ^reward 1)
  23437. <=WM: (14966: I2 ^see 0)
  23438. =>WM: (14982: I2 ^level-1 R0-root)
  23439. <=WM: (14969: I2 ^level-1 R0-root)
  23440. --- END Input Phase ---
  23441. --- Proposal Phase ---
  23442. --- Inner Elaboration Phase, active level 1 (S1) ---
  23443. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  23444. -->
  23445. (S1 ^operator O2125 = 0.7359007881613358)
  23446. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  23447. -->
  23448. Firing elaborate*copy-see-to-output-link
  23449. -->
  23450. (I3 ^see 0 +)
  23451. Firing elaborate*reward*based*on*reward
  23452. -->
  23453. (R1067 ^value 1 +)
  23454. (R1 ^reward R1067 +)
  23455. Firing propose*predict-yes
  23456. -->
  23457. (O2127 ^name predict-yes +)
  23458. (S1 ^operator O2127 +)
  23459. Firing propose*predict-no
  23460. -->
  23461. (O2128 ^name predict-no +)
  23462. (S1 ^operator O2128 +)
  23463. Firing rl*prefer*rvt*predict-no*H0*6
  23464. -->
  23465. (S1 ^operator O2126 = 0.9531240445229402)
  23466. Firing rl*prefer*rvt*predict-yes*H0*5
  23467. -->
  23468. (S1 ^operator O2125 = 0.2640265760956406)
  23469. Firing prefer*rvt*predict-yes*H0
  23470. -->
  23471. Firing prefer*rvt*predict-no*H0
  23472. -->
  23473. Firing elaborate*copy-dir-to-output-link
  23474. -->
  23475. (I3 ^dir L +)
  23476. inner elaboration loop at bottom goal.
  23477. Retracting elaborate*copy-see-to-output-link
  23478. -->
  23479. (I3 ^see 0 +)
  23480. Retracting propose*predict-no
  23481. -->
  23482. (O2126 ^name predict-no +)
  23483. (S1 ^operator O2126 +)
  23484. Retracting propose*predict-yes
  23485. -->
  23486. (O2125 ^name predict-yes +)
  23487. (S1 ^operator O2125 +)
  23488. Retracting elaborate*reward*based*on*reward
  23489. -->
  23490. (R1066 ^value 1 +)
  23491. (R1 ^reward R1066 +)
  23492. Retracting elaborate*copy-dir-to-output-link
  23493. -->
  23494. (I3 ^dir U +)
  23495. Retracting rl*prefer*rvt*predict-no*H0*2
  23496. -->
  23497. (S1 ^operator O2126 = 1.)
  23498. Retracting rl*prefer*rvt*predict-yes*H0*1
  23499. -->
  23500. (S1 ^operator O2125 = 0.)
  23501. =>WM: (14989: S1 ^operator O2128 +)
  23502. =>WM: (14988: S1 ^operator O2127 +)
  23503. =>WM: (14987: I3 ^dir L)
  23504. =>WM: (14986: O2128 ^name predict-no)
  23505. =>WM: (14985: O2127 ^name predict-yes)
  23506. =>WM: (14984: R1067 ^value 1)
  23507. =>WM: (14983: R1 ^reward R1067)
  23508. <=WM: (14974: S1 ^operator O2125 +)
  23509. <=WM: (14975: S1 ^operator O2126 +)
  23510. <=WM: (14976: S1 ^operator O2126)
  23511. <=WM: (14947: I3 ^dir U)
  23512. <=WM: (14970: R1 ^reward R1066)
  23513. <=WM: (14973: O2126 ^name predict-no)
  23514. <=WM: (14972: O2125 ^name predict-yes)
  23515. <=WM: (14971: R1066 ^value 1)
  23516. --- Inner Elaboration Phase, active level 1 (S1) ---
  23517. Firing prefer*rvt*predict-yes*H0
  23518. -->
  23519. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  23520. -->
  23521. (S1 ^operator O2127 = 0.7359007881613358)
  23522. Firing rl*prefer*rvt*predict-yes*H0*5
  23523. -->
  23524. (S1 ^operator O2127 = 0.2640265760956406)
  23525. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  23526. -->
  23527. Firing prefer*rvt*predict-no*H0
  23528. -->
  23529. Firing rl*prefer*rvt*predict-no*H0*6
  23530. -->
  23531. (S1 ^operator O2128 = 0.9531240445229402)
  23532. inner elaboration loop at bottom goal.
  23533. Retracting rl*prefer*rvt*predict-no*H0*6
  23534. -->
  23535. (S1 ^operator O2126 = 0.9531240445229402)
  23536. Retracting rl*prefer*rvt*predict-yes*H0*5
  23537. -->
  23538. (S1 ^operator O2125 = 0.2640265760956406)
  23539. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  23540. -->
  23541. (S1 ^operator O2125 = 0.7359007881613358)
  23542. --- END Proposal Phase ---
  23543. --- Decision Phase ---
  23544. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23545. =>WM: (14990: S1 ^operator O2127)
  23546. 1064: O: O2127 (predict-yes)
  23547. --- END Decision Phase ---
  23548. --- Application Phase ---
  23549. --- Firing Productions (PE) For State At Depth 1 ---
  23550. --- Inner Elaboration Phase, active level 1 (S1) ---
  23551. Firing apply*operator
  23552. -->
  23553. (I3 ^predict-yes N1064 + :O )
  23554. Firing apply*operator*complete
  23555. -->
  23556. (I3 ^predict-no N1063 - :O )
  23557. inner elaboration loop at bottom goal.
  23558. --- Change Working Memory (PE) ---
  23559. =>WM: (14991: I3 ^predict-yes N1064)
  23560. <=WM: (14978: N1063 ^status complete)
  23561. <=WM: (14977: I3 ^predict-no N1063)
  23562. --- Firing Productions (IE) For State At Depth 1 ---
  23563. --- Inner Elaboration Phase, active level 1 (S1) ---
  23564. Firing monitor*world
  23565. -->
  23566. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23567. --- Change Working Memory (IE) ---
  23568. --- END Application Phase ---
  23569. --- Output Phase ---
  23570. ENV: Agent did: predict-yes for direction L in state State-B
  23571. In State-B moving L
  23572. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  23573. predict error 0
  23574. dir: dir isU
  23575. --- END Output Phase ---
  23576. /|\--- Input Phase ---
  23577. =>WM: (14995: I2 ^dir U)
  23578. =>WM: (14994: I2 ^reward 1)
  23579. =>WM: (14993: I2 ^see 1)
  23580. =>WM: (14992: N1064 ^status complete)
  23581. <=WM: (14981: I2 ^dir L)
  23582. <=WM: (14980: I2 ^reward 1)
  23583. <=WM: (14979: I2 ^see 0)
  23584. =>WM: (14996: I2 ^level-1 L1-root)
  23585. <=WM: (14982: I2 ^level-1 R0-root)
  23586. --- END Input Phase ---
  23587. --- Proposal Phase ---
  23588. --- Inner Elaboration Phase, active level 1 (S1) ---
  23589. Firing elaborate*copy-see-to-output-link
  23590. -->
  23591. (I3 ^see 1 +)
  23592. Firing elaborate*reward*based*on*reward
  23593. -->
  23594. (R1068 ^value 1 +)
  23595. (R1 ^reward R1068 +)
  23596. Firing propose*predict-yes
  23597. -->
  23598. (O2129 ^name predict-yes +)
  23599. (S1 ^operator O2129 +)
  23600. Firing propose*predict-no
  23601. -->
  23602. (O2130 ^name predict-no +)
  23603. (S1 ^operator O2130 +)
  23604. Firing rl*prefer*rvt*predict-no*H0*2
  23605. -->
  23606. (S1 ^operator O2128 = 1.)
  23607. Firing rl*prefer*rvt*predict-yes*H0*1
  23608. -->
  23609. (S1 ^operator O2127 = 0.)
  23610. Firing prefer*rvt*predict-yes*H0
  23611. -->
  23612. Firing prefer*rvt*predict-no*H0
  23613. -->
  23614. Firing elaborate*copy-dir-to-output-link
  23615. -->
  23616. (I3 ^dir U +)
  23617. inner elaboration loop at bottom goal.
  23618. Retracting elaborate*copy-see-to-output-link
  23619. -->
  23620. (I3 ^see 0 +)
  23621. Retracting propose*predict-no
  23622. -->
  23623. (O2128 ^name predict-no +)
  23624. (S1 ^operator O2128 +)
  23625. Retracting propose*predict-yes
  23626. -->
  23627. (O2127 ^name predict-yes +)
  23628. (S1 ^operator O2127 +)
  23629. Retracting elaborate*reward*based*on*reward
  23630. -->
  23631. (R1067 ^value 1 +)
  23632. (R1 ^reward R1067 +)
  23633. Retracting elaborate*copy-dir-to-output-link
  23634. -->
  23635. (I3 ^dir L +)
  23636. Retracting rl*prefer*rvt*predict-no*H0*6
  23637. -->
  23638. (S1 ^operator O2128 = 0.9531240445229402)
  23639. Retracting rl*prefer*rvt*predict-yes*H0*5
  23640. -->
  23641. (S1 ^operator O2127 = 0.2640265760956406)
  23642. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  23643. -->
  23644. (S1 ^operator O2127 = 0.7359007881613358)
  23645. =>WM: (15004: S1 ^operator O2130 +)
  23646. =>WM: (15003: S1 ^operator O2129 +)
  23647. =>WM: (15002: I3 ^dir U)
  23648. =>WM: (15001: O2130 ^name predict-no)
  23649. =>WM: (15000: O2129 ^name predict-yes)
  23650. =>WM: (14999: R1068 ^value 1)
  23651. =>WM: (14998: R1 ^reward R1068)
  23652. =>WM: (14997: I3 ^see 1)
  23653. <=WM: (14988: S1 ^operator O2127 +)
  23654. <=WM: (14990: S1 ^operator O2127)
  23655. <=WM: (14989: S1 ^operator O2128 +)
  23656. <=WM: (14987: I3 ^dir L)
  23657. <=WM: (14983: R1 ^reward R1067)
  23658. <=WM: (14915: I3 ^see 0)
  23659. <=WM: (14986: O2128 ^name predict-no)
  23660. <=WM: (14985: O2127 ^name predict-yes)
  23661. <=WM: (14984: R1067 ^value 1)
  23662. --- Inner Elaboration Phase, active level 1 (S1) ---
  23663. Firing prefer*rvt*predict-yes*H0
  23664. -->
  23665. Firing rl*prefer*rvt*predict-yes*H0*1
  23666. -->
  23667. (S1 ^operator O2129 = 0.)
  23668. Firing prefer*rvt*predict-no*H0
  23669. -->
  23670. Firing rl*prefer*rvt*predict-no*H0*2
  23671. -->
  23672. (S1 ^operator O2130 = 1.)
  23673. inner elaboration loop at bottom goal.
  23674. Retracting rl*prefer*rvt*predict-no*H0*2
  23675. -->
  23676. (S1 ^operator O2128 = 1.)
  23677. Retracting rl*prefer*rvt*predict-yes*H0*1
  23678. -->
  23679. (S1 ^operator O2127 = 0.)
  23680. --- END Proposal Phase ---
  23681. --- Decision Phase ---
  23682. RL update rl*prefer*rvt*predict-yes*H0*5 0.554412 -0.290386 0.264027 -> 0.554418 -0.290385 0.264032(R,m,v=1,0.88172,0.104853)
  23683. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445516 0.290384 0.735901 -> 0.445523 0.290384 0.735908(R,m,v=1,1,0)
  23684. =>WM: (15005: S1 ^operator O2130)
  23685. 1065: O: O2130 (predict-no)
  23686. --- END Decision Phase ---
  23687. --- Application Phase ---
  23688. --- Firing Productions (PE) For State At Depth 1 ---
  23689. --- Inner Elaboration Phase, active level 1 (S1) ---
  23690. Firing apply*operator
  23691. -->
  23692. (I3 ^predict-no N1065 + :O )
  23693. Firing apply*operator*complete
  23694. -->
  23695. (I3 ^predict-yes N1064 - :O )
  23696. inner elaboration loop at bottom goal.
  23697. --- Change Working Memory (PE) ---
  23698. =>WM: (15006: I3 ^predict-no N1065)
  23699. <=WM: (14992: N1064 ^status complete)
  23700. <=WM: (14991: I3 ^predict-yes N1064)
  23701. --- Firing Productions (IE) For State At Depth 1 ---
  23702. --- Inner Elaboration Phase, active level 1 (S1) ---
  23703. Firing monitor*world
  23704. -->
  23705. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23706. --- Change Working Memory (IE) ---
  23707. --- END Application Phase ---
  23708. --- Output Phase ---
  23709. ENV: Agent did: predict-no for direction U in state State-A
  23710. In State-A moving U
  23711. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23712. predict error 0
  23713. dir: dir isU
  23714. --- END Output Phase ---
  23715. -/|--- Input Phase ---
  23716. =>WM: (15010: I2 ^dir U)
  23717. =>WM: (15009: I2 ^reward 1)
  23718. =>WM: (15008: I2 ^see 0)
  23719. =>WM: (15007: N1065 ^status complete)
  23720. <=WM: (14995: I2 ^dir U)
  23721. <=WM: (14994: I2 ^reward 1)
  23722. <=WM: (14993: I2 ^see 1)
  23723. =>WM: (15011: I2 ^level-1 L1-root)
  23724. <=WM: (14996: I2 ^level-1 L1-root)
  23725. --- END Input Phase ---
  23726. --- Proposal Phase ---
  23727. --- Inner Elaboration Phase, active level 1 (S1) ---
  23728. Firing elaborate*copy-see-to-output-link
  23729. -->
  23730. (I3 ^see 0 +)
  23731. Firing elaborate*reward*based*on*reward
  23732. -->
  23733. (R1069 ^value 1 +)
  23734. (R1 ^reward R1069 +)
  23735. Firing propose*predict-yes
  23736. -->
  23737. (O2131 ^name predict-yes +)
  23738. (S1 ^operator O2131 +)
  23739. Firing propose*predict-no
  23740. -->
  23741. (O2132 ^name predict-no +)
  23742. (S1 ^operator O2132 +)
  23743. Firing rl*prefer*rvt*predict-no*H0*2
  23744. -->
  23745. (S1 ^operator O2130 = 1.)
  23746. Firing rl*prefer*rvt*predict-yes*H0*1
  23747. -->
  23748. (S1 ^operator O2129 = 0.)
  23749. Firing prefer*rvt*predict-yes*H0
  23750. -->
  23751. Firing prefer*rvt*predict-no*H0
  23752. -->
  23753. Firing elaborate*copy-dir-to-output-link
  23754. -->
  23755. (I3 ^dir U +)
  23756. inner elaboration loop at bottom goal.
  23757. Retracting elaborate*copy-see-to-output-link
  23758. -->
  23759. (I3 ^see 1 +)
  23760. Retracting propose*predict-no
  23761. -->
  23762. (O2130 ^name predict-no +)
  23763. (S1 ^operator O2130 +)
  23764. Retracting propose*predict-yes
  23765. -->
  23766. (O2129 ^name predict-yes +)
  23767. (S1 ^operator O2129 +)
  23768. Retracting elaborate*reward*based*on*reward
  23769. -->
  23770. (R1068 ^value 1 +)
  23771. (R1 ^reward R1068 +)
  23772. Retracting elaborate*copy-dir-to-output-link
  23773. -->
  23774. (I3 ^dir U +)
  23775. Retracting rl*prefer*rvt*predict-no*H0*2
  23776. -->
  23777. (S1 ^operator O2130 = 1.)
  23778. Retracting rl*prefer*rvt*predict-yes*H0*1
  23779. -->
  23780. (S1 ^operator O2129 = 0.)
  23781. =>WM: (15018: S1 ^operator O2132 +)
  23782. =>WM: (15017: S1 ^operator O2131 +)
  23783. =>WM: (15016: O2132 ^name predict-no)
  23784. =>WM: (15015: O2131 ^name predict-yes)
  23785. =>WM: (15014: R1069 ^value 1)
  23786. =>WM: (15013: R1 ^reward R1069)
  23787. =>WM: (15012: I3 ^see 0)
  23788. <=WM: (15003: S1 ^operator O2129 +)
  23789. <=WM: (15004: S1 ^operator O2130 +)
  23790. <=WM: (15005: S1 ^operator O2130)
  23791. <=WM: (14998: R1 ^reward R1068)
  23792. <=WM: (14997: I3 ^see 1)
  23793. <=WM: (15001: O2130 ^name predict-no)
  23794. <=WM: (15000: O2129 ^name predict-yes)
  23795. <=WM: (14999: R1068 ^value 1)
  23796. --- Inner Elaboration Phase, active level 1 (S1) ---
  23797. Firing prefer*rvt*predict-yes*H0
  23798. -->
  23799. Firing rl*prefer*rvt*predict-yes*H0*1
  23800. -->
  23801. (S1 ^operator O2131 = 0.)
  23802. Firing prefer*rvt*predict-no*H0
  23803. -->
  23804. Firing rl*prefer*rvt*predict-no*H0*2
  23805. -->
  23806. (S1 ^operator O2132 = 1.)
  23807. inner elaboration loop at bottom goal.
  23808. Retracting rl*prefer*rvt*predict-no*H0*2
  23809. -->
  23810. (S1 ^operator O2130 = 1.)
  23811. Retracting rl*prefer*rvt*predict-yes*H0*1
  23812. -->
  23813. (S1 ^operator O2129 = 0.)
  23814. --- END Proposal Phase ---
  23815. --- Decision Phase ---
  23816. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23817. =>WM: (15019: S1 ^operator O2132)
  23818. 1066: O: O2132 (predict-no)
  23819. --- END Decision Phase ---
  23820. --- Application Phase ---
  23821. --- Firing Productions (PE) For State At Depth 1 ---
  23822. --- Inner Elaboration Phase, active level 1 (S1) ---
  23823. Firing apply*operator
  23824. -->
  23825. (I3 ^predict-no N1066 + :O )
  23826. Firing apply*operator*complete
  23827. -->
  23828. (I3 ^predict-no N1065 - :O )
  23829. inner elaboration loop at bottom goal.
  23830. --- Change Working Memory (PE) ---
  23831. =>WM: (15020: I3 ^predict-no N1066)
  23832. <=WM: (15007: N1065 ^status complete)
  23833. <=WM: (15006: I3 ^predict-no N1065)
  23834. --- Firing Productions (IE) For State At Depth 1 ---
  23835. --- Inner Elaboration Phase, active level 1 (S1) ---
  23836. Firing monitor*world
  23837. -->
  23838. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23839. --- Change Working Memory (IE) ---
  23840. --- END Application Phase ---
  23841. --- Output Phase ---
  23842. ENV: Agent did: predict-no for direction U in state State-A
  23843. In State-A moving U
  23844. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23845. predict error 0
  23846. dir: dir isR
  23847. --- END Output Phase ---
  23848. \---- Input Phase ---
  23849. =>WM: (15024: I2 ^dir R)
  23850. =>WM: (15023: I2 ^reward 1)
  23851. =>WM: (15022: I2 ^see 0)
  23852. =>WM: (15021: N1066 ^status complete)
  23853. <=WM: (15010: I2 ^dir U)
  23854. <=WM: (15009: I2 ^reward 1)
  23855. <=WM: (15008: I2 ^see 0)
  23856. =>WM: (15025: I2 ^level-1 L1-root)
  23857. <=WM: (15011: I2 ^level-1 L1-root)
  23858. --- END Input Phase ---
  23859. --- Proposal Phase ---
  23860. --- Inner Elaboration Phase, active level 1 (S1) ---
  23861. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  23862. -->
  23863. (S1 ^operator O2132 = -0.2714224023553999)
  23864. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  23865. -->
  23866. (S1 ^operator O2131 = 0.6622318078141458)
  23867. Firing prefer*rvt*predict-no*H0*4*v1*H1
  23868. -->
  23869. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  23870. -->
  23871. Firing elaborate*copy-see-to-output-link
  23872. -->
  23873. (I3 ^see 0 +)
  23874. Firing elaborate*reward*based*on*reward
  23875. -->
  23876. (R1070 ^value 1 +)
  23877. (R1 ^reward R1070 +)
  23878. Firing propose*predict-yes
  23879. -->
  23880. (O2133 ^name predict-yes +)
  23881. (S1 ^operator O2133 +)
  23882. Firing propose*predict-no
  23883. -->
  23884. (O2134 ^name predict-no +)
  23885. (S1 ^operator O2134 +)
  23886. Firing rl*prefer*rvt*predict-no*H0*4
  23887. -->
  23888. (S1 ^operator O2132 = 0.3397908847802913)
  23889. Firing rl*prefer*rvt*predict-yes*H0*3
  23890. -->
  23891. (S1 ^operator O2131 = 0.3376969893996755)
  23892. Firing prefer*rvt*predict-yes*H0
  23893. -->
  23894. Firing prefer*rvt*predict-no*H0
  23895. -->
  23896. Firing elaborate*copy-dir-to-output-link
  23897. -->
  23898. (I3 ^dir R +)
  23899. inner elaboration loop at bottom goal.
  23900. Retracting elaborate*copy-see-to-output-link
  23901. -->
  23902. (I3 ^see 0 +)
  23903. Retracting propose*predict-no
  23904. -->
  23905. (O2132 ^name predict-no +)
  23906. (S1 ^operator O2132 +)
  23907. Retracting propose*predict-yes
  23908. -->
  23909. (O2131 ^name predict-yes +)
  23910. (S1 ^operator O2131 +)
  23911. Retracting elaborate*reward*based*on*reward
  23912. -->
  23913. (R1069 ^value 1 +)
  23914. (R1 ^reward R1069 +)
  23915. Retracting elaborate*copy-dir-to-output-link
  23916. -->
  23917. (I3 ^dir U +)
  23918. Retracting rl*prefer*rvt*predict-no*H0*2
  23919. -->
  23920. (S1 ^operator O2132 = 1.)
  23921. Retracting rl*prefer*rvt*predict-yes*H0*1
  23922. -->
  23923. (S1 ^operator O2131 = 0.)
  23924. =>WM: (15032: S1 ^operator O2134 +)
  23925. =>WM: (15031: S1 ^operator O2133 +)
  23926. =>WM: (15030: I3 ^dir R)
  23927. =>WM: (15029: O2134 ^name predict-no)
  23928. =>WM: (15028: O2133 ^name predict-yes)
  23929. =>WM: (15027: R1070 ^value 1)
  23930. =>WM: (15026: R1 ^reward R1070)
  23931. <=WM: (15017: S1 ^operator O2131 +)
  23932. <=WM: (15018: S1 ^operator O2132 +)
  23933. <=WM: (15019: S1 ^operator O2132)
  23934. <=WM: (15002: I3 ^dir U)
  23935. <=WM: (15013: R1 ^reward R1069)
  23936. <=WM: (15016: O2132 ^name predict-no)
  23937. <=WM: (15015: O2131 ^name predict-yes)
  23938. <=WM: (15014: R1069 ^value 1)
  23939. --- Inner Elaboration Phase, active level 1 (S1) ---
  23940. Firing prefer*rvt*predict-yes*H0
  23941. -->
  23942. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  23943. -->
  23944. (S1 ^operator O2133 = 0.6622318078141458)
  23945. Firing rl*prefer*rvt*predict-yes*H0*3
  23946. -->
  23947. (S1 ^operator O2133 = 0.3376969893996755)
  23948. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  23949. -->
  23950. Firing prefer*rvt*predict-no*H0
  23951. -->
  23952. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  23953. -->
  23954. (S1 ^operator O2134 = -0.2714224023553999)
  23955. Firing rl*prefer*rvt*predict-no*H0*4
  23956. -->
  23957. (S1 ^operator O2134 = 0.3397908847802913)
  23958. Firing prefer*rvt*predict-no*H0*4*v1*H1
  23959. -->
  23960. inner elaboration loop at bottom goal.
  23961. Retracting rl*prefer*rvt*predict-no*H0*4
  23962. -->
  23963. (S1 ^operator O2132 = 0.3397908847802913)
  23964. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  23965. -->
  23966. (S1 ^operator O2132 = -0.2714224023553999)
  23967. Retracting rl*prefer*rvt*predict-yes*H0*3
  23968. -->
  23969. (S1 ^operator O2131 = 0.3376969893996755)
  23970. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  23971. -->
  23972. (S1 ^operator O2131 = 0.6622318078141458)
  23973. --- END Proposal Phase ---
  23974. --- Decision Phase ---
  23975. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23976. =>WM: (15033: S1 ^operator O2133)
  23977. 1067: O: O2133 (predict-yes)
  23978. --- END Decision Phase ---
  23979. --- Application Phase ---
  23980. --- Firing Productions (PE) For State At Depth 1 ---
  23981. --- Inner Elaboration Phase, active level 1 (S1) ---
  23982. Firing apply*operator
  23983. -->
  23984. (I3 ^predict-yes N1067 + :O )
  23985. Firing apply*operator*complete
  23986. -->
  23987. (I3 ^predict-no N1066 - :O )
  23988. inner elaboration loop at bottom goal.
  23989. --- Change Working Memory (PE) ---
  23990. =>WM: (15034: I3 ^predict-yes N1067)
  23991. <=WM: (15021: N1066 ^status complete)
  23992. <=WM: (15020: I3 ^predict-no N1066)
  23993. --- Firing Productions (IE) For State At Depth 1 ---
  23994. --- Inner Elaboration Phase, active level 1 (S1) ---
  23995. Firing monitor*world
  23996. -->
  23997. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23998. --- Change Working Memory (IE) ---
  23999. --- END Application Phase ---
  24000. --- Output Phase ---
  24001. ENV: Agent did: predict-yes for direction R in state State-A
  24002. In State-A moving R
  24003. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  24004. predict error 0
  24005. dir: dir isU
  24006. --- END Output Phase ---
  24007. /|\--- Input Phase ---
  24008. =>WM: (15038: I2 ^dir U)
  24009. =>WM: (15037: I2 ^reward 1)
  24010. =>WM: (15036: I2 ^see 1)
  24011. =>WM: (15035: N1067 ^status complete)
  24012. <=WM: (15024: I2 ^dir R)
  24013. <=WM: (15023: I2 ^reward 1)
  24014. <=WM: (15022: I2 ^see 0)
  24015. =>WM: (15039: I2 ^level-1 R1-root)
  24016. <=WM: (15025: I2 ^level-1 L1-root)
  24017. --- END Input Phase ---
  24018. --- Proposal Phase ---
  24019. --- Inner Elaboration Phase, active level 1 (S1) ---
  24020. Firing elaborate*copy-see-to-output-link
  24021. -->
  24022. (I3 ^see 1 +)
  24023. Firing elaborate*reward*based*on*reward
  24024. -->
  24025. (R1071 ^value 1 +)
  24026. (R1 ^reward R1071 +)
  24027. Firing propose*predict-yes
  24028. -->
  24029. (O2135 ^name predict-yes +)
  24030. (S1 ^operator O2135 +)
  24031. Firing propose*predict-no
  24032. -->
  24033. (O2136 ^name predict-no +)
  24034. (S1 ^operator O2136 +)
  24035. Firing rl*prefer*rvt*predict-no*H0*2
  24036. -->
  24037. (S1 ^operator O2134 = 1.)
  24038. Firing rl*prefer*rvt*predict-yes*H0*1
  24039. -->
  24040. (S1 ^operator O2133 = 0.)
  24041. Firing prefer*rvt*predict-yes*H0
  24042. -->
  24043. Firing prefer*rvt*predict-no*H0
  24044. -->
  24045. Firing elaborate*copy-dir-to-output-link
  24046. -->
  24047. (I3 ^dir U +)
  24048. inner elaboration loop at bottom goal.
  24049. Retracting elaborate*copy-see-to-output-link
  24050. -->
  24051. (I3 ^see 0 +)
  24052. Retracting propose*predict-no
  24053. -->
  24054. (O2134 ^name predict-no +)
  24055. (S1 ^operator O2134 +)
  24056. Retracting propose*predict-yes
  24057. -->
  24058. (O2133 ^name predict-yes +)
  24059. (S1 ^operator O2133 +)
  24060. Retracting elaborate*reward*based*on*reward
  24061. -->
  24062. (R1070 ^value 1 +)
  24063. (R1 ^reward R1070 +)
  24064. Retracting elaborate*copy-dir-to-output-link
  24065. -->
  24066. (I3 ^dir R +)
  24067. Retracting rl*prefer*rvt*predict-no*H0*4
  24068. -->
  24069. (S1 ^operator O2134 = 0.3397908847802913)
  24070. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  24071. -->
  24072. (S1 ^operator O2134 = -0.2714224023553999)
  24073. Retracting rl*prefer*rvt*predict-yes*H0*3
  24074. -->
  24075. (S1 ^operator O2133 = 0.3376969893996755)
  24076. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  24077. -->
  24078. (S1 ^operator O2133 = 0.6622318078141458)
  24079. =>WM: (15047: S1 ^operator O2136 +)
  24080. =>WM: (15046: S1 ^operator O2135 +)
  24081. =>WM: (15045: I3 ^dir U)
  24082. =>WM: (15044: O2136 ^name predict-no)
  24083. =>WM: (15043: O2135 ^name predict-yes)
  24084. =>WM: (15042: R1071 ^value 1)
  24085. =>WM: (15041: R1 ^reward R1071)
  24086. =>WM: (15040: I3 ^see 1)
  24087. <=WM: (15031: S1 ^operator O2133 +)
  24088. <=WM: (15033: S1 ^operator O2133)
  24089. <=WM: (15032: S1 ^operator O2134 +)
  24090. <=WM: (15030: I3 ^dir R)
  24091. <=WM: (15026: R1 ^reward R1070)
  24092. <=WM: (15012: I3 ^see 0)
  24093. <=WM: (15029: O2134 ^name predict-no)
  24094. <=WM: (15028: O2133 ^name predict-yes)
  24095. <=WM: (15027: R1070 ^value 1)
  24096. --- Inner Elaboration Phase, active level 1 (S1) ---
  24097. Firing prefer*rvt*predict-yes*H0
  24098. -->
  24099. Firing rl*prefer*rvt*predict-yes*H0*1
  24100. -->
  24101. (S1 ^operator O2135 = 0.)
  24102. Firing prefer*rvt*predict-no*H0
  24103. -->
  24104. Firing rl*prefer*rvt*predict-no*H0*2
  24105. -->
  24106. (S1 ^operator O2136 = 1.)
  24107. inner elaboration loop at bottom goal.
  24108. Retracting rl*prefer*rvt*predict-no*H0*2
  24109. -->
  24110. (S1 ^operator O2134 = 1.)
  24111. Retracting rl*prefer*rvt*predict-yes*H0*1
  24112. -->
  24113. (S1 ^operator O2133 = 0.)
  24114. --- END Proposal Phase ---
  24115. --- Decision Phase ---
  24116. RL update rl*prefer*rvt*predict-yes*H0*3 0.590095 -0.252398 0.337697 -> 0.590102 -0.252399 0.337703(R,m,v=1,0.903409,0.0877597)
  24117. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409823 0.252409 0.662232 -> 0.409831 0.252408 0.662239(R,m,v=1,1,0)
  24118. =>WM: (15048: S1 ^operator O2136)
  24119. 1068: O: O2136 (predict-no)
  24120. --- END Decision Phase ---
  24121. --- Application Phase ---
  24122. --- Firing Productions (PE) For State At Depth 1 ---
  24123. --- Inner Elaboration Phase, active level 1 (S1) ---
  24124. Firing apply*operator
  24125. -->
  24126. (I3 ^predict-no N1068 + :O )
  24127. Firing apply*operator*complete
  24128. -->
  24129. (I3 ^predict-yes N1067 - :O )
  24130. inner elaboration loop at bottom goal.
  24131. --- Change Working Memory (PE) ---
  24132. =>WM: (15049: I3 ^predict-no N1068)
  24133. <=WM: (15035: N1067 ^status complete)
  24134. <=WM: (15034: I3 ^predict-yes N1067)
  24135. --- Firing Productions (IE) For State At Depth 1 ---
  24136. --- Inner Elaboration Phase, active level 1 (S1) ---
  24137. Firing monitor*world
  24138. -->
  24139. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24140. --- Change Working Memory (IE) ---
  24141. --- END Application Phase ---
  24142. --- Output Phase ---
  24143. ENV: Agent did: predict-no for direction U in state State-B
  24144. In State-B moving U
  24145. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24146. predict error 0
  24147. dir: dir isL
  24148. --- END Output Phase ---
  24149. -/|--- Input Phase ---
  24150. =>WM: (15053: I2 ^dir L)
  24151. =>WM: (15052: I2 ^reward 1)
  24152. =>WM: (15051: I2 ^see 0)
  24153. =>WM: (15050: N1068 ^status complete)
  24154. <=WM: (15038: I2 ^dir U)
  24155. <=WM: (15037: I2 ^reward 1)
  24156. <=WM: (15036: I2 ^see 1)
  24157. =>WM: (15054: I2 ^level-1 R1-root)
  24158. <=WM: (15039: I2 ^level-1 R1-root)
  24159. --- END Input Phase ---
  24160. --- Proposal Phase ---
  24161. --- Inner Elaboration Phase, active level 1 (S1) ---
  24162. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24163. -->
  24164. (S1 ^operator O2135 = 0.7361839628082684)
  24165. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  24166. -->
  24167. Firing elaborate*copy-see-to-output-link
  24168. -->
  24169. (I3 ^see 0 +)
  24170. Firing elaborate*reward*based*on*reward
  24171. -->
  24172. (R1072 ^value 1 +)
  24173. (R1 ^reward R1072 +)
  24174. Firing propose*predict-yes
  24175. -->
  24176. (O2137 ^name predict-yes +)
  24177. (S1 ^operator O2137 +)
  24178. Firing propose*predict-no
  24179. -->
  24180. (O2138 ^name predict-no +)
  24181. (S1 ^operator O2138 +)
  24182. Firing rl*prefer*rvt*predict-no*H0*6
  24183. -->
  24184. (S1 ^operator O2136 = 0.9531240445229402)
  24185. Firing rl*prefer*rvt*predict-yes*H0*5
  24186. -->
  24187. (S1 ^operator O2135 = 0.2640324095921535)
  24188. Firing prefer*rvt*predict-yes*H0
  24189. -->
  24190. Firing prefer*rvt*predict-no*H0
  24191. -->
  24192. Firing elaborate*copy-dir-to-output-link
  24193. -->
  24194. (I3 ^dir L +)
  24195. inner elaboration loop at bottom goal.
  24196. Retracting elaborate*copy-see-to-output-link
  24197. -->
  24198. (I3 ^see 1 +)
  24199. Retracting propose*predict-no
  24200. -->
  24201. (O2136 ^name predict-no +)
  24202. (S1 ^operator O2136 +)
  24203. Retracting propose*predict-yes
  24204. -->
  24205. (O2135 ^name predict-yes +)
  24206. (S1 ^operator O2135 +)
  24207. Retracting elaborate*reward*based*on*reward
  24208. -->
  24209. (R1071 ^value 1 +)
  24210. (R1 ^reward R1071 +)
  24211. Retracting elaborate*copy-dir-to-output-link
  24212. -->
  24213. (I3 ^dir U +)
  24214. Retracting rl*prefer*rvt*predict-no*H0*2
  24215. -->
  24216. (S1 ^operator O2136 = 1.)
  24217. Retracting rl*prefer*rvt*predict-yes*H0*1
  24218. -->
  24219. (S1 ^operator O2135 = 0.)
  24220. =>WM: (15062: S1 ^operator O2138 +)
  24221. =>WM: (15061: S1 ^operator O2137 +)
  24222. =>WM: (15060: I3 ^dir L)
  24223. =>WM: (15059: O2138 ^name predict-no)
  24224. =>WM: (15058: O2137 ^name predict-yes)
  24225. =>WM: (15057: R1072 ^value 1)
  24226. =>WM: (15056: R1 ^reward R1072)
  24227. =>WM: (15055: I3 ^see 0)
  24228. <=WM: (15046: S1 ^operator O2135 +)
  24229. <=WM: (15047: S1 ^operator O2136 +)
  24230. <=WM: (15048: S1 ^operator O2136)
  24231. <=WM: (15045: I3 ^dir U)
  24232. <=WM: (15041: R1 ^reward R1071)
  24233. <=WM: (15040: I3 ^see 1)
  24234. <=WM: (15044: O2136 ^name predict-no)
  24235. <=WM: (15043: O2135 ^name predict-yes)
  24236. <=WM: (15042: R1071 ^value 1)
  24237. --- Inner Elaboration Phase, active level 1 (S1) ---
  24238. Firing prefer*rvt*predict-yes*H0
  24239. -->
  24240. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24241. -->
  24242. (S1 ^operator O2137 = 0.7361839628082684)
  24243. Firing rl*prefer*rvt*predict-yes*H0*5
  24244. -->
  24245. (S1 ^operator O2137 = 0.2640324095921535)
  24246. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  24247. -->
  24248. Firing prefer*rvt*predict-no*H0
  24249. -->
  24250. Firing rl*prefer*rvt*predict-no*H0*6
  24251. -->
  24252. (S1 ^operator O2138 = 0.9531240445229402)
  24253. inner elaboration loop at bottom goal.
  24254. Retracting rl*prefer*rvt*predict-no*H0*6
  24255. -->
  24256. (S1 ^operator O2136 = 0.9531240445229402)
  24257. Retracting rl*prefer*rvt*predict-yes*H0*5
  24258. -->
  24259. (S1 ^operator O2135 = 0.2640324095921535)
  24260. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24261. -->
  24262. (S1 ^operator O2135 = 0.7361839628082684)
  24263. --- END Proposal Phase ---
  24264. --- Decision Phase ---
  24265. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24266. =>WM: (15063: S1 ^operator O2137)
  24267. 1069: O: O2137 (predict-yes)
  24268. --- END Decision Phase ---
  24269. --- Application Phase ---
  24270. --- Firing Productions (PE) For State At Depth 1 ---
  24271. --- Inner Elaboration Phase, active level 1 (S1) ---
  24272. Firing apply*operator
  24273. -->
  24274. (I3 ^predict-yes N1069 + :O )
  24275. Firing apply*operator*complete
  24276. -->
  24277. (I3 ^predict-no N1068 - :O )
  24278. inner elaboration loop at bottom goal.
  24279. --- Change Working Memory (PE) ---
  24280. =>WM: (15064: I3 ^predict-yes N1069)
  24281. <=WM: (15050: N1068 ^status complete)
  24282. <=WM: (15049: I3 ^predict-no N1068)
  24283. --- Firing Productions (IE) For State At Depth 1 ---
  24284. --- Inner Elaboration Phase, active level 1 (S1) ---
  24285. Firing monitor*world
  24286. -->
  24287. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24288. --- Change Working Memory (IE) ---
  24289. --- END Application Phase ---
  24290. --- Output Phase ---
  24291. ENV: Agent did: predict-yes for direction L in state State-B
  24292. In State-B moving L
  24293. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  24294. predict error 0
  24295. dir: dir isU
  24296. --- END Output Phase ---
  24297. \-/--- Input Phase ---
  24298. =>WM: (15068: I2 ^dir U)
  24299. =>WM: (15067: I2 ^reward 1)
  24300. =>WM: (15066: I2 ^see 1)
  24301. =>WM: (15065: N1069 ^status complete)
  24302. <=WM: (15053: I2 ^dir L)
  24303. <=WM: (15052: I2 ^reward 1)
  24304. <=WM: (15051: I2 ^see 0)
  24305. =>WM: (15069: I2 ^level-1 L1-root)
  24306. <=WM: (15054: I2 ^level-1 R1-root)
  24307. --- END Input Phase ---
  24308. --- Proposal Phase ---
  24309. --- Inner Elaboration Phase, active level 1 (S1) ---
  24310. Firing elaborate*copy-see-to-output-link
  24311. -->
  24312. (I3 ^see 1 +)
  24313. Firing elaborate*reward*based*on*reward
  24314. -->
  24315. (R1073 ^value 1 +)
  24316. (R1 ^reward R1073 +)
  24317. Firing propose*predict-yes
  24318. -->
  24319. (O2139 ^name predict-yes +)
  24320. (S1 ^operator O2139 +)
  24321. Firing propose*predict-no
  24322. -->
  24323. (O2140 ^name predict-no +)
  24324. (S1 ^operator O2140 +)
  24325. Firing rl*prefer*rvt*predict-no*H0*2
  24326. -->
  24327. (S1 ^operator O2138 = 1.)
  24328. Firing rl*prefer*rvt*predict-yes*H0*1
  24329. -->
  24330. (S1 ^operator O2137 = 0.)
  24331. Firing prefer*rvt*predict-yes*H0
  24332. -->
  24333. Firing prefer*rvt*predict-no*H0
  24334. -->
  24335. Firing elaborate*copy-dir-to-output-link
  24336. -->
  24337. (I3 ^dir U +)
  24338. inner elaboration loop at bottom goal.
  24339. Retracting elaborate*copy-see-to-output-link
  24340. -->
  24341. (I3 ^see 0 +)
  24342. Retracting propose*predict-no
  24343. -->
  24344. (O2138 ^name predict-no +)
  24345. (S1 ^operator O2138 +)
  24346. Retracting propose*predict-yes
  24347. -->
  24348. (O2137 ^name predict-yes +)
  24349. (S1 ^operator O2137 +)
  24350. Retracting elaborate*reward*based*on*reward
  24351. -->
  24352. (R1072 ^value 1 +)
  24353. (R1 ^reward R1072 +)
  24354. Retracting elaborate*copy-dir-to-output-link
  24355. -->
  24356. (I3 ^dir L +)
  24357. Retracting rl*prefer*rvt*predict-no*H0*6
  24358. -->
  24359. (S1 ^operator O2138 = 0.9531240445229402)
  24360. Retracting rl*prefer*rvt*predict-yes*H0*5
  24361. -->
  24362. (S1 ^operator O2137 = 0.2640324095921535)
  24363. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24364. -->
  24365. (S1 ^operator O2137 = 0.7361839628082684)
  24366. =>WM: (15077: S1 ^operator O2140 +)
  24367. =>WM: (15076: S1 ^operator O2139 +)
  24368. =>WM: (15075: I3 ^dir U)
  24369. =>WM: (15074: O2140 ^name predict-no)
  24370. =>WM: (15073: O2139 ^name predict-yes)
  24371. =>WM: (15072: R1073 ^value 1)
  24372. =>WM: (15071: R1 ^reward R1073)
  24373. =>WM: (15070: I3 ^see 1)
  24374. <=WM: (15061: S1 ^operator O2137 +)
  24375. <=WM: (15063: S1 ^operator O2137)
  24376. <=WM: (15062: S1 ^operator O2138 +)
  24377. <=WM: (15060: I3 ^dir L)
  24378. <=WM: (15056: R1 ^reward R1072)
  24379. <=WM: (15055: I3 ^see 0)
  24380. <=WM: (15059: O2138 ^name predict-no)
  24381. <=WM: (15058: O2137 ^name predict-yes)
  24382. <=WM: (15057: R1072 ^value 1)
  24383. --- Inner Elaboration Phase, active level 1 (S1) ---
  24384. Firing prefer*rvt*predict-yes*H0
  24385. -->
  24386. Firing rl*prefer*rvt*predict-yes*H0*1
  24387. -->
  24388. (S1 ^operator O2139 = 0.)
  24389. Firing prefer*rvt*predict-no*H0
  24390. -->
  24391. Firing rl*prefer*rvt*predict-no*H0*2
  24392. -->
  24393. (S1 ^operator O2140 = 1.)
  24394. inner elaboration loop at bottom goal.
  24395. Retracting rl*prefer*rvt*predict-no*H0*2
  24396. -->
  24397. (S1 ^operator O2138 = 1.)
  24398. Retracting rl*prefer*rvt*predict-yes*H0*1
  24399. -->
  24400. (S1 ^operator O2137 = 0.)
  24401. --- END Proposal Phase ---
  24402. --- Decision Phase ---
  24403. RL update rl*prefer*rvt*predict-yes*H0*5 0.554418 -0.290385 0.264032 -> 0.554401 -0.290386 0.264015(R,m,v=1,0.882353,0.104364)
  24404. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445795 0.290389 0.736184 -> 0.445775 0.290389 0.736164(R,m,v=1,1,0)
  24405. =>WM: (15078: S1 ^operator O2140)
  24406. 1070: O: O2140 (predict-no)
  24407. --- END Decision Phase ---
  24408. --- Application Phase ---
  24409. --- Firing Productions (PE) For State At Depth 1 ---
  24410. --- Inner Elaboration Phase, active level 1 (S1) ---
  24411. Firing apply*operator
  24412. -->
  24413. (I3 ^predict-no N1070 + :O )
  24414. Firing apply*operator*complete
  24415. -->
  24416. (I3 ^predict-yes N1069 - :O )
  24417. inner elaboration loop at bottom goal.
  24418. --- Change Working Memory (PE) ---
  24419. =>WM: (15079: I3 ^predict-no N1070)
  24420. <=WM: (15065: N1069 ^status complete)
  24421. <=WM: (15064: I3 ^predict-yes N1069)
  24422. --- Firing Productions (IE) For State At Depth 1 ---
  24423. --- Inner Elaboration Phase, active level 1 (S1) ---
  24424. Firing monitor*world
  24425. -->
  24426. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24427. --- Change Working Memory (IE) ---
  24428. --- END Application Phase ---
  24429. --- Output Phase ---
  24430. ENV: Agent did: predict-no for direction U in state State-A
  24431. In State-A moving U
  24432. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24433. predict error 0
  24434. dir: dir isU
  24435. --- END Output Phase ---
  24436. |\--- Input Phase ---
  24437. =>WM: (15083: I2 ^dir U)
  24438. =>WM: (15082: I2 ^reward 1)
  24439. =>WM: (15081: I2 ^see 0)
  24440. =>WM: (15080: N1070 ^status complete)
  24441. <=WM: (15068: I2 ^dir U)
  24442. <=WM: (15067: I2 ^reward 1)
  24443. <=WM: (15066: I2 ^see 1)
  24444. =>WM: (15084: I2 ^level-1 L1-root)
  24445. <=WM: (15069: I2 ^level-1 L1-root)
  24446. --- END Input Phase ---
  24447. --- Proposal Phase ---
  24448. --- Inner Elaboration Phase, active level 1 (S1) ---
  24449. Firing elaborate*copy-see-to-output-link
  24450. -->
  24451. (I3 ^see 0 +)
  24452. Firing elaborate*reward*based*on*reward
  24453. -->
  24454. (R1074 ^value 1 +)
  24455. (R1 ^reward R1074 +)
  24456. Firing propose*predict-yes
  24457. -->
  24458. (O2141 ^name predict-yes +)
  24459. (S1 ^operator O2141 +)
  24460. Firing propose*predict-no
  24461. -->
  24462. (O2142 ^name predict-no +)
  24463. (S1 ^operator O2142 +)
  24464. Firing rl*prefer*rvt*predict-no*H0*2
  24465. -->
  24466. (S1 ^operator O2140 = 1.)
  24467. Firing rl*prefer*rvt*predict-yes*H0*1
  24468. -->
  24469. (S1 ^operator O2139 = 0.)
  24470. Firing prefer*rvt*predict-yes*H0
  24471. -->
  24472. Firing prefer*rvt*predict-no*H0
  24473. -->
  24474. Firing elaborate*copy-dir-to-output-link
  24475. -->
  24476. (I3 ^dir U +)
  24477. inner elaboration loop at bottom goal.
  24478. Retracting elaborate*copy-see-to-output-link
  24479. -->
  24480. (I3 ^see 1 +)
  24481. Retracting propose*predict-no
  24482. -->
  24483. (O2140 ^name predict-no +)
  24484. (S1 ^operator O2140 +)
  24485. Retracting propose*predict-yes
  24486. -->
  24487. (O2139 ^name predict-yes +)
  24488. (S1 ^operator O2139 +)
  24489. Retracting elaborate*reward*based*on*reward
  24490. -->
  24491. (R1073 ^value 1 +)
  24492. (R1 ^reward R1073 +)
  24493. Retracting elaborate*copy-dir-to-output-link
  24494. -->
  24495. (I3 ^dir U +)
  24496. Retracting rl*prefer*rvt*predict-no*H0*2
  24497. -->
  24498. (S1 ^operator O2140 = 1.)
  24499. Retracting rl*prefer*rvt*predict-yes*H0*1
  24500. -->
  24501. (S1 ^operator O2139 = 0.)
  24502. =>WM: (15091: S1 ^operator O2142 +)
  24503. =>WM: (15090: S1 ^operator O2141 +)
  24504. =>WM: (15089: O2142 ^name predict-no)
  24505. =>WM: (15088: O2141 ^name predict-yes)
  24506. =>WM: (15087: R1074 ^value 1)
  24507. =>WM: (15086: R1 ^reward R1074)
  24508. =>WM: (15085: I3 ^see 0)
  24509. <=WM: (15076: S1 ^operator O2139 +)
  24510. <=WM: (15077: S1 ^operator O2140 +)
  24511. <=WM: (15078: S1 ^operator O2140)
  24512. <=WM: (15071: R1 ^reward R1073)
  24513. <=WM: (15070: I3 ^see 1)
  24514. <=WM: (15074: O2140 ^name predict-no)
  24515. <=WM: (15073: O2139 ^name predict-yes)
  24516. <=WM: (15072: R1073 ^value 1)
  24517. --- Inner Elaboration Phase, active level 1 (S1) ---
  24518. Firing prefer*rvt*predict-yes*H0
  24519. -->
  24520. Firing rl*prefer*rvt*predict-yes*H0*1
  24521. -->
  24522. (S1 ^operator O2141 = 0.)
  24523. Firing prefer*rvt*predict-no*H0
  24524. -->
  24525. Firing rl*prefer*rvt*predict-no*H0*2
  24526. -->
  24527. (S1 ^operator O2142 = 1.)
  24528. inner elaboration loop at bottom goal.
  24529. Retracting rl*prefer*rvt*predict-no*H0*2
  24530. -->
  24531. (S1 ^operator O2140 = 1.)
  24532. Retracting rl*prefer*rvt*predict-yes*H0*1
  24533. -->
  24534. (S1 ^operator O2139 = 0.)
  24535. --- END Proposal Phase ---
  24536. --- Decision Phase ---
  24537. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24538. =>WM: (15092: S1 ^operator O2142)
  24539. 1071: O: O2142 (predict-no)
  24540. --- END Decision Phase ---
  24541. --- Application Phase ---
  24542. --- Firing Productions (PE) For State At Depth 1 ---
  24543. --- Inner Elaboration Phase, active level 1 (S1) ---
  24544. Firing apply*operator
  24545. -->
  24546. (I3 ^predict-no N1071 + :O )
  24547. Firing apply*operator*complete
  24548. -->
  24549. (I3 ^predict-no N1070 - :O )
  24550. inner elaboration loop at bottom goal.
  24551. --- Change Working Memory (PE) ---
  24552. =>WM: (15093: I3 ^predict-no N1071)
  24553. <=WM: (15080: N1070 ^status complete)
  24554. <=WM: (15079: I3 ^predict-no N1070)
  24555. --- Firing Productions (IE) For State At Depth 1 ---
  24556. --- Inner Elaboration Phase, active level 1 (S1) ---
  24557. Firing monitor*world
  24558. -->
  24559. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24560. --- Change Working Memory (IE) ---
  24561. --- END Application Phase ---
  24562. --- Output Phase ---
  24563. ENV: Agent did: predict-no for direction U in state State-A
  24564. In State-A moving U
  24565. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  24566. predict error 0
  24567. dir: dir isR
  24568. --- END Output Phase ---
  24569. ---- Input Phase ---
  24570. =>WM: (15097: I2 ^dir R)
  24571. =>WM: (15096: I2 ^reward 1)
  24572. =>WM: (15095: I2 ^see 0)
  24573. =>WM: (15094: N1071 ^status complete)
  24574. <=WM: (15083: I2 ^dir U)
  24575. <=WM: (15082: I2 ^reward 1)
  24576. <=WM: (15081: I2 ^see 0)
  24577. =>WM: (15098: I2 ^level-1 L1-root)
  24578. <=WM: (15084: I2 ^level-1 L1-root)
  24579. --- END Input Phase ---
  24580. --- Proposal Phase ---
  24581. --- Inner Elaboration Phase, active level 1 (S1) ---
  24582. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  24583. -->
  24584. (S1 ^operator O2142 = -0.2714224023553999)
  24585. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  24586. -->
  24587. (S1 ^operator O2141 = 0.6622385371716781)
  24588. Firing prefer*rvt*predict-no*H0*4*v1*H1
  24589. -->
  24590. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  24591. -->
  24592. Firing elaborate*copy-see-to-output-link
  24593. -->
  24594. (I3 ^see 0 +)
  24595. Firing elaborate*reward*based*on*reward
  24596. -->
  24597. (R1075 ^value 1 +)
  24598. (R1 ^reward R1075 +)
  24599. Firing propose*predict-yes
  24600. -->
  24601. (O2143 ^name predict-yes +)
  24602. (S1 ^operator O2143 +)
  24603. Firing propose*predict-no
  24604. -->
  24605. (O2144 ^name predict-no +)
  24606. (S1 ^operator O2144 +)
  24607. Firing rl*prefer*rvt*predict-no*H0*4
  24608. -->
  24609. (S1 ^operator O2142 = 0.3397908847802913)
  24610. Firing rl*prefer*rvt*predict-yes*H0*3
  24611. -->
  24612. (S1 ^operator O2141 = 0.3377027590270043)
  24613. Firing prefer*rvt*predict-yes*H0
  24614. -->
  24615. Firing prefer*rvt*predict-no*H0
  24616. -->
  24617. Firing elaborate*copy-dir-to-output-link
  24618. -->
  24619. (I3 ^dir R +)
  24620. inner elaboration loop at bottom goal.
  24621. Retracting elaborate*copy-see-to-output-link
  24622. -->
  24623. (I3 ^see 0 +)
  24624. Retracting propose*predict-no
  24625. -->
  24626. (O2142 ^name predict-no +)
  24627. (S1 ^operator O2142 +)
  24628. Retracting propose*predict-yes
  24629. -->
  24630. (O2141 ^name predict-yes +)
  24631. (S1 ^operator O2141 +)
  24632. Retracting elaborate*reward*based*on*reward
  24633. -->
  24634. (R1074 ^value 1 +)
  24635. (R1 ^reward R1074 +)
  24636. Retracting elaborate*copy-dir-to-output-link
  24637. -->
  24638. (I3 ^dir U +)
  24639. Retracting rl*prefer*rvt*predict-no*H0*2
  24640. -->
  24641. (S1 ^operator O2142 = 1.)
  24642. Retracting rl*prefer*rvt*predict-yes*H0*1
  24643. -->
  24644. (S1 ^operator O2141 = 0.)
  24645. =>WM: (15105: S1 ^operator O2144 +)
  24646. =>WM: (15104: S1 ^operator O2143 +)
  24647. =>WM: (15103: I3 ^dir R)
  24648. =>WM: (15102: O2144 ^name predict-no)
  24649. =>WM: (15101: O2143 ^name predict-yes)
  24650. =>WM: (15100: R1075 ^value 1)
  24651. =>WM: (15099: R1 ^reward R1075)
  24652. <=WM: (15090: S1 ^operator O2141 +)
  24653. <=WM: (15091: S1 ^operator O2142 +)
  24654. <=WM: (15092: S1 ^operator O2142)
  24655. <=WM: (15075: I3 ^dir U)
  24656. <=WM: (15086: R1 ^reward R1074)
  24657. <=WM: (15089: O2142 ^name predict-no)
  24658. <=WM: (15088: O2141 ^name predict-yes)
  24659. <=WM: (15087: R1074 ^value 1)
  24660. --- Inner Elaboration Phase, active level 1 (S1) ---
  24661. Firing prefer*rvt*predict-yes*H0
  24662. -->
  24663. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  24664. -->
  24665. (S1 ^operator O2143 = 0.6622385371716781)
  24666. Firing rl*prefer*rvt*predict-yes*H0*3
  24667. -->
  24668. (S1 ^operator O2143 = 0.3377027590270043)
  24669. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  24670. -->
  24671. Firing prefer*rvt*predict-no*H0
  24672. -->
  24673. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  24674. -->
  24675. (S1 ^operator O2144 = -0.2714224023553999)
  24676. Firing rl*prefer*rvt*predict-no*H0*4
  24677. -->
  24678. (S1 ^operator O2144 = 0.3397908847802913)
  24679. Firing prefer*rvt*predict-no*H0*4*v1*H1
  24680. -->
  24681. inner elaboration loop at bottom goal.
  24682. Retracting rl*prefer*rvt*predict-no*H0*4
  24683. -->
  24684. (S1 ^operator O2142 = 0.3397908847802913)
  24685. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  24686. -->
  24687. (S1 ^operator O2142 = -0.2714224023553999)
  24688. Retracting rl*prefer*rvt*predict-yes*H0*3
  24689. -->
  24690. (S1 ^operator O2141 = 0.3377027590270043)
  24691. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  24692. -->
  24693. (S1 ^operator O2141 = 0.6622385371716781)
  24694. --- END Proposal Phase ---
  24695. --- Decision Phase ---
  24696. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24697. =>WM: (15106: S1 ^operator O2143)
  24698. 1072: O: O2143 (predict-yes)
  24699. --- END Decision Phase ---
  24700. --- Application Phase ---
  24701. --- Firing Productions (PE) For State At Depth 1 ---
  24702. --- Inner Elaboration Phase, active level 1 (S1) ---
  24703. Firing apply*operator
  24704. -->
  24705. (I3 ^predict-yes N1072 + :O )
  24706. Firing apply*operator*complete
  24707. -->
  24708. (I3 ^predict-no N1071 - :O )
  24709. inner elaboration loop at bottom goal.
  24710. --- Change Working Memory (PE) ---
  24711. =>WM: (15107: I3 ^predict-yes N1072)
  24712. <=WM: (15094: N1071 ^status complete)
  24713. <=WM: (15093: I3 ^predict-no N1071)
  24714. --- Firing Productions (IE) For State At Depth 1 ---
  24715. --- Inner Elaboration Phase, active level 1 (S1) ---
  24716. Firing monitor*world
  24717. -->
  24718. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24719. --- Change Working Memory (IE) ---
  24720. --- END Application Phase ---
  24721. --- Output Phase ---
  24722. ENV: Agent did: predict-yes for direction R in state State-A
  24723. In State-A moving R
  24724. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  24725. predict error 0
  24726. dir: dir isL
  24727. --- END Output Phase ---
  24728. /|\--- Input Phase ---
  24729. =>WM: (15111: I2 ^dir L)
  24730. =>WM: (15110: I2 ^reward 1)
  24731. =>WM: (15109: I2 ^see 1)
  24732. =>WM: (15108: N1072 ^status complete)
  24733. <=WM: (15097: I2 ^dir R)
  24734. <=WM: (15096: I2 ^reward 1)
  24735. <=WM: (15095: I2 ^see 0)
  24736. =>WM: (15112: I2 ^level-1 R1-root)
  24737. <=WM: (15098: I2 ^level-1 L1-root)
  24738. --- END Input Phase ---
  24739. --- Proposal Phase ---
  24740. --- Inner Elaboration Phase, active level 1 (S1) ---
  24741. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24742. -->
  24743. (S1 ^operator O2143 = 0.7361640420110442)
  24744. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  24745. -->
  24746. Firing elaborate*copy-see-to-output-link
  24747. -->
  24748. (I3 ^see 1 +)
  24749. Firing elaborate*reward*based*on*reward
  24750. -->
  24751. (R1076 ^value 1 +)
  24752. (R1 ^reward R1076 +)
  24753. Firing propose*predict-yes
  24754. -->
  24755. (O2145 ^name predict-yes +)
  24756. (S1 ^operator O2145 +)
  24757. Firing propose*predict-no
  24758. -->
  24759. (O2146 ^name predict-no +)
  24760. (S1 ^operator O2146 +)
  24761. Firing rl*prefer*rvt*predict-no*H0*6
  24762. -->
  24763. (S1 ^operator O2144 = 0.9531240445229402)
  24764. Firing rl*prefer*rvt*predict-yes*H0*5
  24765. -->
  24766. (S1 ^operator O2143 = 0.2640150473205478)
  24767. Firing prefer*rvt*predict-yes*H0
  24768. -->
  24769. Firing prefer*rvt*predict-no*H0
  24770. -->
  24771. Firing elaborate*copy-dir-to-output-link
  24772. -->
  24773. (I3 ^dir L +)
  24774. inner elaboration loop at bottom goal.
  24775. Retracting elaborate*copy-see-to-output-link
  24776. -->
  24777. (I3 ^see 0 +)
  24778. Retracting propose*predict-no
  24779. -->
  24780. (O2144 ^name predict-no +)
  24781. (S1 ^operator O2144 +)
  24782. Retracting propose*predict-yes
  24783. -->
  24784. (O2143 ^name predict-yes +)
  24785. (S1 ^operator O2143 +)
  24786. Retracting elaborate*reward*based*on*reward
  24787. -->
  24788. (R1075 ^value 1 +)
  24789. (R1 ^reward R1075 +)
  24790. Retracting elaborate*copy-dir-to-output-link
  24791. -->
  24792. (I3 ^dir R +)
  24793. Retracting rl*prefer*rvt*predict-no*H0*4
  24794. -->
  24795. (S1 ^operator O2144 = 0.3397908847802913)
  24796. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  24797. -->
  24798. (S1 ^operator O2144 = -0.2714224023553999)
  24799. Retracting rl*prefer*rvt*predict-yes*H0*3
  24800. -->
  24801. (S1 ^operator O2143 = 0.3377027590270043)
  24802. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  24803. -->
  24804. (S1 ^operator O2143 = 0.6622385371716781)
  24805. =>WM: (15120: S1 ^operator O2146 +)
  24806. =>WM: (15119: S1 ^operator O2145 +)
  24807. =>WM: (15118: I3 ^dir L)
  24808. =>WM: (15117: O2146 ^name predict-no)
  24809. =>WM: (15116: O2145 ^name predict-yes)
  24810. =>WM: (15115: R1076 ^value 1)
  24811. =>WM: (15114: R1 ^reward R1076)
  24812. =>WM: (15113: I3 ^see 1)
  24813. <=WM: (15104: S1 ^operator O2143 +)
  24814. <=WM: (15106: S1 ^operator O2143)
  24815. <=WM: (15105: S1 ^operator O2144 +)
  24816. <=WM: (15103: I3 ^dir R)
  24817. <=WM: (15099: R1 ^reward R1075)
  24818. <=WM: (15085: I3 ^see 0)
  24819. <=WM: (15102: O2144 ^name predict-no)
  24820. <=WM: (15101: O2143 ^name predict-yes)
  24821. <=WM: (15100: R1075 ^value 1)
  24822. --- Inner Elaboration Phase, active level 1 (S1) ---
  24823. Firing prefer*rvt*predict-yes*H0
  24824. -->
  24825. Firing rl*prefer*rvt*predict-yes*H0*5
  24826. -->
  24827. (S1 ^operator O2145 = 0.2640150473205478)
  24828. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  24829. -->
  24830. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24831. -->
  24832. (S1 ^operator O2145 = 0.7361640420110442)
  24833. Firing prefer*rvt*predict-no*H0
  24834. -->
  24835. Firing rl*prefer*rvt*predict-no*H0*6
  24836. -->
  24837. (S1 ^operator O2146 = 0.9531240445229402)
  24838. inner elaboration loop at bottom goal.
  24839. Retracting rl*prefer*rvt*predict-no*H0*6
  24840. -->
  24841. (S1 ^operator O2144 = 0.9531240445229402)
  24842. Retracting rl*prefer*rvt*predict-yes*H0*5
  24843. -->
  24844. (S1 ^operator O2143 = 0.2640150473205478)
  24845. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24846. -->
  24847. (S1 ^operator O2143 = 0.7361640420110442)
  24848. --- END Proposal Phase ---
  24849. --- Decision Phase ---
  24850. RL update rl*prefer*rvt*predict-yes*H0*3 0.590102 -0.252399 0.337703 -> 0.590107 -0.2524 0.337708(R,m,v=1,0.903955,0.0873138)
  24851. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409831 0.252408 0.662239 -> 0.409837 0.252407 0.662244(R,m,v=1,1,0)
  24852. =>WM: (15121: S1 ^operator O2145)
  24853. 1073: O: O2145 (predict-yes)
  24854. --- END Decision Phase ---
  24855. --- Application Phase ---
  24856. --- Firing Productions (PE) For State At Depth 1 ---
  24857. --- Inner Elaboration Phase, active level 1 (S1) ---
  24858. Firing apply*operator
  24859. -->
  24860. (I3 ^predict-yes N1073 + :O )
  24861. Firing apply*operator*complete
  24862. -->
  24863. (I3 ^predict-yes N1072 - :O )
  24864. inner elaboration loop at bottom goal.
  24865. --- Change Working Memory (PE) ---
  24866. =>WM: (15122: I3 ^predict-yes N1073)
  24867. <=WM: (15108: N1072 ^status complete)
  24868. <=WM: (15107: I3 ^predict-yes N1072)
  24869. --- Firing Productions (IE) For State At Depth 1 ---
  24870. --- Inner Elaboration Phase, active level 1 (S1) ---
  24871. Firing monitor*world
  24872. -->
  24873. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24874. --- Change Working Memory (IE) ---
  24875. --- END Application Phase ---
  24876. --- Output Phase ---
  24877. ENV: Agent did: predict-yes for direction L in state State-B
  24878. In State-B moving L
  24879. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  24880. predict error 0
  24881. dir: dir isL
  24882. --- END Output Phase ---
  24883. -/|--- Input Phase ---
  24884. =>WM: (15126: I2 ^dir L)
  24885. =>WM: (15125: I2 ^reward 1)
  24886. =>WM: (15124: I2 ^see 1)
  24887. =>WM: (15123: N1073 ^status complete)
  24888. <=WM: (15111: I2 ^dir L)
  24889. <=WM: (15110: I2 ^reward 1)
  24890. <=WM: (15109: I2 ^see 1)
  24891. =>WM: (15127: I2 ^level-1 L1-root)
  24892. <=WM: (15112: I2 ^level-1 R1-root)
  24893. --- END Input Phase ---
  24894. --- Proposal Phase ---
  24895. --- Inner Elaboration Phase, active level 1 (S1) ---
  24896. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  24897. -->
  24898. (S1 ^operator O2145 = -0.181727099742844)
  24899. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  24900. -->
  24901. Firing elaborate*copy-see-to-output-link
  24902. -->
  24903. (I3 ^see 1 +)
  24904. Firing elaborate*reward*based*on*reward
  24905. -->
  24906. (R1077 ^value 1 +)
  24907. (R1 ^reward R1077 +)
  24908. Firing propose*predict-yes
  24909. -->
  24910. (O2147 ^name predict-yes +)
  24911. (S1 ^operator O2147 +)
  24912. Firing propose*predict-no
  24913. -->
  24914. (O2148 ^name predict-no +)
  24915. (S1 ^operator O2148 +)
  24916. Firing rl*prefer*rvt*predict-no*H0*6
  24917. -->
  24918. (S1 ^operator O2146 = 0.9531240445229402)
  24919. Firing rl*prefer*rvt*predict-yes*H0*5
  24920. -->
  24921. (S1 ^operator O2145 = 0.2640150473205478)
  24922. Firing prefer*rvt*predict-yes*H0
  24923. -->
  24924. Firing prefer*rvt*predict-no*H0
  24925. -->
  24926. Firing elaborate*copy-dir-to-output-link
  24927. -->
  24928. (I3 ^dir L +)
  24929. inner elaboration loop at bottom goal.
  24930. Retracting elaborate*copy-see-to-output-link
  24931. -->
  24932. (I3 ^see 1 +)
  24933. Retracting propose*predict-no
  24934. -->
  24935. (O2146 ^name predict-no +)
  24936. (S1 ^operator O2146 +)
  24937. Retracting propose*predict-yes
  24938. -->
  24939. (O2145 ^name predict-yes +)
  24940. (S1 ^operator O2145 +)
  24941. Retracting elaborate*reward*based*on*reward
  24942. -->
  24943. (R1076 ^value 1 +)
  24944. (R1 ^reward R1076 +)
  24945. Retracting elaborate*copy-dir-to-output-link
  24946. -->
  24947. (I3 ^dir L +)
  24948. Retracting rl*prefer*rvt*predict-no*H0*6
  24949. -->
  24950. (S1 ^operator O2146 = 0.9531240445229402)
  24951. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  24952. -->
  24953. (S1 ^operator O2145 = 0.7361640420110442)
  24954. Retracting rl*prefer*rvt*predict-yes*H0*5
  24955. -->
  24956. (S1 ^operator O2145 = 0.2640150473205478)
  24957. =>WM: (15133: S1 ^operator O2148 +)
  24958. =>WM: (15132: S1 ^operator O2147 +)
  24959. =>WM: (15131: O2148 ^name predict-no)
  24960. =>WM: (15130: O2147 ^name predict-yes)
  24961. =>WM: (15129: R1077 ^value 1)
  24962. =>WM: (15128: R1 ^reward R1077)
  24963. <=WM: (15119: S1 ^operator O2145 +)
  24964. <=WM: (15121: S1 ^operator O2145)
  24965. <=WM: (15120: S1 ^operator O2146 +)
  24966. <=WM: (15114: R1 ^reward R1076)
  24967. <=WM: (15117: O2146 ^name predict-no)
  24968. <=WM: (15116: O2145 ^name predict-yes)
  24969. <=WM: (15115: R1076 ^value 1)
  24970. --- Inner Elaboration Phase, active level 1 (S1) ---
  24971. Firing prefer*rvt*predict-yes*H0
  24972. -->
  24973. Firing rl*prefer*rvt*predict-yes*H0*5
  24974. -->
  24975. (S1 ^operator O2147 = 0.2640150473205478)
  24976. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  24977. -->
  24978. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  24979. -->
  24980. (S1 ^operator O2147 = -0.181727099742844)
  24981. Firing prefer*rvt*predict-no*H0
  24982. -->
  24983. Firing rl*prefer*rvt*predict-no*H0*6
  24984. -->
  24985. (S1 ^operator O2148 = 0.9531240445229402)
  24986. inner elaboration loop at bottom goal.
  24987. Retracting rl*prefer*rvt*predict-no*H0*6
  24988. -->
  24989. (S1 ^operator O2146 = 0.9531240445229402)
  24990. Retracting rl*prefer*rvt*predict-yes*H0*5
  24991. -->
  24992. (S1 ^operator O2145 = 0.2640150473205478)
  24993. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  24994. -->
  24995. (S1 ^operator O2145 = -0.181727099742844)
  24996. --- END Proposal Phase ---
  24997. --- Decision Phase ---
  24998. RL update rl*prefer*rvt*predict-yes*H0*5 0.554401 -0.290386 0.264015 -> 0.554387 -0.290386 0.264001(R,m,v=1,0.882979,0.10388)
  24999. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445775 0.290389 0.736164 -> 0.445759 0.290389 0.736148(R,m,v=1,1,0)
  25000. =>WM: (15134: S1 ^operator O2148)
  25001. 1074: O: O2148 (predict-no)
  25002. --- END Decision Phase ---
  25003. --- Application Phase ---
  25004. --- Firing Productions (PE) For State At Depth 1 ---
  25005. --- Inner Elaboration Phase, active level 1 (S1) ---
  25006. Firing apply*operator
  25007. -->
  25008. (I3 ^predict-no N1074 + :O )
  25009. Firing apply*operator*complete
  25010. -->
  25011. (I3 ^predict-yes N1073 - :O )
  25012. inner elaboration loop at bottom goal.
  25013. --- Change Working Memory (PE) ---
  25014. =>WM: (15135: I3 ^predict-no N1074)
  25015. <=WM: (15123: N1073 ^status complete)
  25016. <=WM: (15122: I3 ^predict-yes N1073)
  25017. --- Firing Productions (IE) For State At Depth 1 ---
  25018. --- Inner Elaboration Phase, active level 1 (S1) ---
  25019. Firing monitor*world
  25020. -->
  25021. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25022. --- Change Working Memory (IE) ---
  25023. --- END Application Phase ---
  25024. --- Output Phase ---
  25025. ENV: Agent did: predict-no for direction L in state State-A
  25026. In State-A moving L
  25027. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25028. predict error 0
  25029. dir: dir isL
  25030. --- END Output Phase ---
  25031. \-/--- Input Phase ---
  25032. =>WM: (15139: I2 ^dir L)
  25033. =>WM: (15138: I2 ^reward 1)
  25034. =>WM: (15137: I2 ^see 0)
  25035. =>WM: (15136: N1074 ^status complete)
  25036. <=WM: (15126: I2 ^dir L)
  25037. <=WM: (15125: I2 ^reward 1)
  25038. <=WM: (15124: I2 ^see 1)
  25039. =>WM: (15140: I2 ^level-1 L0-root)
  25040. <=WM: (15127: I2 ^level-1 L1-root)
  25041. --- END Input Phase ---
  25042. --- Proposal Phase ---
  25043. --- Inner Elaboration Phase, active level 1 (S1) ---
  25044. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25045. -->
  25046. (S1 ^operator O2147 = -0.1386470047172653)
  25047. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25048. -->
  25049. Firing elaborate*copy-see-to-output-link
  25050. -->
  25051. (I3 ^see 0 +)
  25052. Firing elaborate*reward*based*on*reward
  25053. -->
  25054. (R1078 ^value 1 +)
  25055. (R1 ^reward R1078 +)
  25056. Firing propose*predict-yes
  25057. -->
  25058. (O2149 ^name predict-yes +)
  25059. (S1 ^operator O2149 +)
  25060. Firing propose*predict-no
  25061. -->
  25062. (O2150 ^name predict-no +)
  25063. (S1 ^operator O2150 +)
  25064. Firing rl*prefer*rvt*predict-no*H0*6
  25065. -->
  25066. (S1 ^operator O2148 = 0.9531240445229402)
  25067. Firing rl*prefer*rvt*predict-yes*H0*5
  25068. -->
  25069. (S1 ^operator O2147 = 0.2640006890267754)
  25070. Firing prefer*rvt*predict-yes*H0
  25071. -->
  25072. Firing prefer*rvt*predict-no*H0
  25073. -->
  25074. Firing elaborate*copy-dir-to-output-link
  25075. -->
  25076. (I3 ^dir L +)
  25077. inner elaboration loop at bottom goal.
  25078. Retracting elaborate*copy-see-to-output-link
  25079. -->
  25080. (I3 ^see 1 +)
  25081. Retracting propose*predict-no
  25082. -->
  25083. (O2148 ^name predict-no +)
  25084. (S1 ^operator O2148 +)
  25085. Retracting propose*predict-yes
  25086. -->
  25087. (O2147 ^name predict-yes +)
  25088. (S1 ^operator O2147 +)
  25089. Retracting elaborate*reward*based*on*reward
  25090. -->
  25091. (R1077 ^value 1 +)
  25092. (R1 ^reward R1077 +)
  25093. Retracting elaborate*copy-dir-to-output-link
  25094. -->
  25095. (I3 ^dir L +)
  25096. Retracting rl*prefer*rvt*predict-no*H0*6
  25097. -->
  25098. (S1 ^operator O2148 = 0.9531240445229402)
  25099. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  25100. -->
  25101. (S1 ^operator O2147 = -0.181727099742844)
  25102. Retracting rl*prefer*rvt*predict-yes*H0*5
  25103. -->
  25104. (S1 ^operator O2147 = 0.2640006890267754)
  25105. =>WM: (15147: S1 ^operator O2150 +)
  25106. =>WM: (15146: S1 ^operator O2149 +)
  25107. =>WM: (15145: O2150 ^name predict-no)
  25108. =>WM: (15144: O2149 ^name predict-yes)
  25109. =>WM: (15143: R1078 ^value 1)
  25110. =>WM: (15142: R1 ^reward R1078)
  25111. =>WM: (15141: I3 ^see 0)
  25112. <=WM: (15132: S1 ^operator O2147 +)
  25113. <=WM: (15133: S1 ^operator O2148 +)
  25114. <=WM: (15134: S1 ^operator O2148)
  25115. <=WM: (15128: R1 ^reward R1077)
  25116. <=WM: (15113: I3 ^see 1)
  25117. <=WM: (15131: O2148 ^name predict-no)
  25118. <=WM: (15130: O2147 ^name predict-yes)
  25119. <=WM: (15129: R1077 ^value 1)
  25120. --- Inner Elaboration Phase, active level 1 (S1) ---
  25121. Firing prefer*rvt*predict-yes*H0
  25122. -->
  25123. Firing rl*prefer*rvt*predict-yes*H0*5
  25124. -->
  25125. (S1 ^operator O2149 = 0.2640006890267754)
  25126. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25127. -->
  25128. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25129. -->
  25130. (S1 ^operator O2149 = -0.1386470047172653)
  25131. Firing prefer*rvt*predict-no*H0
  25132. -->
  25133. Firing rl*prefer*rvt*predict-no*H0*6
  25134. -->
  25135. (S1 ^operator O2150 = 0.9531240445229402)
  25136. inner elaboration loop at bottom goal.
  25137. Retracting rl*prefer*rvt*predict-no*H0*6
  25138. -->
  25139. (S1 ^operator O2148 = 0.9531240445229402)
  25140. Retracting rl*prefer*rvt*predict-yes*H0*5
  25141. -->
  25142. (S1 ^operator O2147 = 0.2640006890267754)
  25143. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25144. -->
  25145. (S1 ^operator O2147 = -0.1386470047172653)
  25146. --- END Proposal Phase ---
  25147. --- Decision Phase ---
  25148. RL update rl*prefer*rvt*predict-no*H0*6 0.953124 0 0.953124 -> 0.960848 0 0.960848(R,m,v=1,0.90566,0.0859804)
  25149. =>WM: (15148: S1 ^operator O2150)
  25150. 1075: O: O2150 (predict-no)
  25151. --- END Decision Phase ---
  25152. --- Application Phase ---
  25153. --- Firing Productions (PE) For State At Depth 1 ---
  25154. --- Inner Elaboration Phase, active level 1 (S1) ---
  25155. Firing apply*operator
  25156. -->
  25157. (I3 ^predict-no N1075 + :O )
  25158. Firing apply*operator*complete
  25159. -->
  25160. (I3 ^predict-no N1074 - :O )
  25161. inner elaboration loop at bottom goal.
  25162. --- Change Working Memory (PE) ---
  25163. =>WM: (15149: I3 ^predict-no N1075)
  25164. <=WM: (15136: N1074 ^status complete)
  25165. <=WM: (15135: I3 ^predict-no N1074)
  25166. --- Firing Productions (IE) For State At Depth 1 ---
  25167. --- Inner Elaboration Phase, active level 1 (S1) ---
  25168. Firing monitor*world
  25169. -->
  25170. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25171. --- Change Working Memory (IE) ---
  25172. --- END Application Phase ---
  25173. --- Output Phase ---
  25174. ENV: Agent did: predict-no for direction L in state State-A
  25175. In State-A moving L
  25176. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25177. predict error 0
  25178. dir: dir isL
  25179. --- END Output Phase ---
  25180. |\---- Input Phase ---
  25181. =>WM: (15153: I2 ^dir L)
  25182. =>WM: (15152: I2 ^reward 1)
  25183. =>WM: (15151: I2 ^see 0)
  25184. =>WM: (15150: N1075 ^status complete)
  25185. <=WM: (15139: I2 ^dir L)
  25186. <=WM: (15138: I2 ^reward 1)
  25187. <=WM: (15137: I2 ^see 0)
  25188. =>WM: (15154: I2 ^level-1 L0-root)
  25189. <=WM: (15140: I2 ^level-1 L0-root)
  25190. --- END Input Phase ---
  25191. --- Proposal Phase ---
  25192. --- Inner Elaboration Phase, active level 1 (S1) ---
  25193. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25194. -->
  25195. (S1 ^operator O2149 = -0.1386470047172653)
  25196. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25197. -->
  25198. Firing elaborate*copy-see-to-output-link
  25199. -->
  25200. (I3 ^see 0 +)
  25201. Firing elaborate*reward*based*on*reward
  25202. -->
  25203. (R1079 ^value 1 +)
  25204. (R1 ^reward R1079 +)
  25205. Firing propose*predict-yes
  25206. -->
  25207. (O2151 ^name predict-yes +)
  25208. (S1 ^operator O2151 +)
  25209. Firing propose*predict-no
  25210. -->
  25211. (O2152 ^name predict-no +)
  25212. (S1 ^operator O2152 +)
  25213. Firing rl*prefer*rvt*predict-no*H0*6
  25214. -->
  25215. (S1 ^operator O2150 = 0.9608480015858826)
  25216. Firing rl*prefer*rvt*predict-yes*H0*5
  25217. -->
  25218. (S1 ^operator O2149 = 0.2640006890267754)
  25219. Firing prefer*rvt*predict-yes*H0
  25220. -->
  25221. Firing prefer*rvt*predict-no*H0
  25222. -->
  25223. Firing elaborate*copy-dir-to-output-link
  25224. -->
  25225. (I3 ^dir L +)
  25226. inner elaboration loop at bottom goal.
  25227. Retracting elaborate*copy-see-to-output-link
  25228. -->
  25229. (I3 ^see 0 +)
  25230. Retracting propose*predict-no
  25231. -->
  25232. (O2150 ^name predict-no +)
  25233. (S1 ^operator O2150 +)
  25234. Retracting propose*predict-yes
  25235. -->
  25236. (O2149 ^name predict-yes +)
  25237. (S1 ^operator O2149 +)
  25238. Retracting elaborate*reward*based*on*reward
  25239. -->
  25240. (R1078 ^value 1 +)
  25241. (R1 ^reward R1078 +)
  25242. Retracting elaborate*copy-dir-to-output-link
  25243. -->
  25244. (I3 ^dir L +)
  25245. Retracting rl*prefer*rvt*predict-no*H0*6
  25246. -->
  25247. (S1 ^operator O2150 = 0.9608480015858826)
  25248. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25249. -->
  25250. (S1 ^operator O2149 = -0.1386470047172653)
  25251. Retracting rl*prefer*rvt*predict-yes*H0*5
  25252. -->
  25253. (S1 ^operator O2149 = 0.2640006890267754)
  25254. =>WM: (15160: S1 ^operator O2152 +)
  25255. =>WM: (15159: S1 ^operator O2151 +)
  25256. =>WM: (15158: O2152 ^name predict-no)
  25257. =>WM: (15157: O2151 ^name predict-yes)
  25258. =>WM: (15156: R1079 ^value 1)
  25259. =>WM: (15155: R1 ^reward R1079)
  25260. <=WM: (15146: S1 ^operator O2149 +)
  25261. <=WM: (15147: S1 ^operator O2150 +)
  25262. <=WM: (15148: S1 ^operator O2150)
  25263. <=WM: (15142: R1 ^reward R1078)
  25264. <=WM: (15145: O2150 ^name predict-no)
  25265. <=WM: (15144: O2149 ^name predict-yes)
  25266. <=WM: (15143: R1078 ^value 1)
  25267. --- Inner Elaboration Phase, active level 1 (S1) ---
  25268. Firing prefer*rvt*predict-yes*H0
  25269. -->
  25270. Firing rl*prefer*rvt*predict-yes*H0*5
  25271. -->
  25272. (S1 ^operator O2151 = 0.2640006890267754)
  25273. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  25274. -->
  25275. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25276. -->
  25277. (S1 ^operator O2151 = -0.1386470047172653)
  25278. Firing prefer*rvt*predict-no*H0
  25279. -->
  25280. Firing rl*prefer*rvt*predict-no*H0*6
  25281. -->
  25282. (S1 ^operator O2152 = 0.9608480015858826)
  25283. inner elaboration loop at bottom goal.
  25284. Retracting rl*prefer*rvt*predict-no*H0*6
  25285. -->
  25286. (S1 ^operator O2150 = 0.9608480015858826)
  25287. Retracting rl*prefer*rvt*predict-yes*H0*5
  25288. -->
  25289. (S1 ^operator O2149 = 0.2640006890267754)
  25290. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25291. -->
  25292. (S1 ^operator O2149 = -0.1386470047172653)
  25293. --- END Proposal Phase ---
  25294. --- Decision Phase ---
  25295. RL update rl*prefer*rvt*predict-no*H0*6 0.960848 0 0.960848 -> 0.967293 0 0.967293(R,m,v=1,0.90625,0.0854953)
  25296. =>WM: (15161: S1 ^operator O2152)
  25297. 1076: O: O2152 (predict-no)
  25298. --- END Decision Phase ---
  25299. --- Application Phase ---
  25300. --- Firing Productions (PE) For State At Depth 1 ---
  25301. --- Inner Elaboration Phase, active level 1 (S1) ---
  25302. Firing apply*operator
  25303. -->
  25304. (I3 ^predict-no N1076 + :O )
  25305. Firing apply*operator*complete
  25306. -->
  25307. (I3 ^predict-no N1075 - :O )
  25308. inner elaboration loop at bottom goal.
  25309. --- Change Working Memory (PE) ---
  25310. =>WM: (15162: I3 ^predict-no N1076)
  25311. <=WM: (15150: N1075 ^status complete)
  25312. <=WM: (15149: I3 ^predict-no N1075)
  25313. --- Firing Productions (IE) For State At Depth 1 ---
  25314. --- Inner Elaboration Phase, active level 1 (S1) ---
  25315. Firing monitor*world
  25316. -->
  25317. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25318. --- Change Working Memory (IE) ---
  25319. --- END Application Phase ---
  25320. --- Output Phase ---
  25321. ENV: Agent did: predict-no for direction L in state State-A
  25322. In State-A moving L
  25323. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25324. predict error 0
  25325. dir: dir isU
  25326. --- END Output Phase ---
  25327. /|--- Input Phase ---
  25328. =>WM: (15166: I2 ^dir U)
  25329. =>WM: (15165: I2 ^reward 1)
  25330. =>WM: (15164: I2 ^see 0)
  25331. =>WM: (15163: N1076 ^status complete)
  25332. <=WM: (15153: I2 ^dir L)
  25333. <=WM: (15152: I2 ^reward 1)
  25334. <=WM: (15151: I2 ^see 0)
  25335. =>WM: (15167: I2 ^level-1 L0-root)
  25336. <=WM: (15154: I2 ^level-1 L0-root)
  25337. --- END Input Phase ---
  25338. --- Proposal Phase ---
  25339. --- Inner Elaboration Phase, active level 1 (S1) ---
  25340. Firing elaborate*copy-see-to-output-link
  25341. -->
  25342. (I3 ^see 0 +)
  25343. Firing elaborate*reward*based*on*reward
  25344. -->
  25345. (R1080 ^value 1 +)
  25346. (R1 ^reward R1080 +)
  25347. Firing propose*predict-yes
  25348. -->
  25349. (O2153 ^name predict-yes +)
  25350. (S1 ^operator O2153 +)
  25351. Firing propose*predict-no
  25352. -->
  25353. (O2154 ^name predict-no +)
  25354. (S1 ^operator O2154 +)
  25355. Firing rl*prefer*rvt*predict-no*H0*2
  25356. -->
  25357. (S1 ^operator O2152 = 1.)
  25358. Firing rl*prefer*rvt*predict-yes*H0*1
  25359. -->
  25360. (S1 ^operator O2151 = 0.)
  25361. Firing prefer*rvt*predict-yes*H0
  25362. -->
  25363. Firing prefer*rvt*predict-no*H0
  25364. -->
  25365. Firing elaborate*copy-dir-to-output-link
  25366. -->
  25367. (I3 ^dir U +)
  25368. inner elaboration loop at bottom goal.
  25369. Retracting elaborate*copy-see-to-output-link
  25370. -->
  25371. (I3 ^see 0 +)
  25372. Retracting propose*predict-no
  25373. -->
  25374. (O2152 ^name predict-no +)
  25375. (S1 ^operator O2152 +)
  25376. Retracting propose*predict-yes
  25377. -->
  25378. (O2151 ^name predict-yes +)
  25379. (S1 ^operator O2151 +)
  25380. Retracting elaborate*reward*based*on*reward
  25381. -->
  25382. (R1079 ^value 1 +)
  25383. (R1 ^reward R1079 +)
  25384. Retracting elaborate*copy-dir-to-output-link
  25385. -->
  25386. (I3 ^dir L +)
  25387. Retracting rl*prefer*rvt*predict-no*H0*6
  25388. -->
  25389. (S1 ^operator O2152 = 0.967292590597631)
  25390. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  25391. -->
  25392. (S1 ^operator O2151 = -0.1386470047172653)
  25393. Retracting rl*prefer*rvt*predict-yes*H0*5
  25394. -->
  25395. (S1 ^operator O2151 = 0.2640006890267754)
  25396. =>WM: (15174: S1 ^operator O2154 +)
  25397. =>WM: (15173: S1 ^operator O2153 +)
  25398. =>WM: (15172: I3 ^dir U)
  25399. =>WM: (15171: O2154 ^name predict-no)
  25400. =>WM: (15170: O2153 ^name predict-yes)
  25401. =>WM: (15169: R1080 ^value 1)
  25402. =>WM: (15168: R1 ^reward R1080)
  25403. <=WM: (15159: S1 ^operator O2151 +)
  25404. <=WM: (15160: S1 ^operator O2152 +)
  25405. <=WM: (15161: S1 ^operator O2152)
  25406. <=WM: (15118: I3 ^dir L)
  25407. <=WM: (15155: R1 ^reward R1079)
  25408. <=WM: (15158: O2152 ^name predict-no)
  25409. <=WM: (15157: O2151 ^name predict-yes)
  25410. <=WM: (15156: R1079 ^value 1)
  25411. --- Inner Elaboration Phase, active level 1 (S1) ---
  25412. Firing prefer*rvt*predict-yes*H0
  25413. -->
  25414. Firing rl*prefer*rvt*predict-yes*H0*1
  25415. -->
  25416. (S1 ^operator O2153 = 0.)
  25417. Firing prefer*rvt*predict-no*H0
  25418. -->
  25419. Firing rl*prefer*rvt*predict-no*H0*2
  25420. -->
  25421. (S1 ^operator O2154 = 1.)
  25422. inner elaboration loop at bottom goal.
  25423. Retracting rl*prefer*rvt*predict-no*H0*2
  25424. -->
  25425. (S1 ^operator O2152 = 1.)
  25426. Retracting rl*prefer*rvt*predict-yes*H0*1
  25427. -->
  25428. (S1 ^operator O2151 = 0.)
  25429. --- END Proposal Phase ---
  25430. --- Decision Phase ---
  25431. RL update rl*prefer*rvt*predict-no*H0*6 0.967293 0 0.967293 -> 0.972671 0 0.972671(R,m,v=1,0.906832,0.0850155)
  25432. =>WM: (15175: S1 ^operator O2154)
  25433. 1077: O: O2154 (predict-no)
  25434. --- END Decision Phase ---
  25435. --- Application Phase ---
  25436. --- Firing Productions (PE) For State At Depth 1 ---
  25437. --- Inner Elaboration Phase, active level 1 (S1) ---
  25438. Firing apply*operator
  25439. -->
  25440. (I3 ^predict-no N1077 + :O )
  25441. Firing apply*operator*complete
  25442. -->
  25443. (I3 ^predict-no N1076 - :O )
  25444. inner elaboration loop at bottom goal.
  25445. --- Change Working Memory (PE) ---
  25446. =>WM: (15176: I3 ^predict-no N1077)
  25447. <=WM: (15163: N1076 ^status complete)
  25448. <=WM: (15162: I3 ^predict-no N1076)
  25449. --- Firing Productions (IE) For State At Depth 1 ---
  25450. --- Inner Elaboration Phase, active level 1 (S1) ---
  25451. Firing monitor*world
  25452. -->
  25453. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25454. --- Change Working Memory (IE) ---
  25455. --- END Application Phase ---
  25456. --- Output Phase ---
  25457. ENV: Agent did: predict-no for direction U in state State-A
  25458. In State-A moving U
  25459. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25460. predict error 0
  25461. dir: dir isR
  25462. --- END Output Phase ---
  25463. \---- Input Phase ---
  25464. =>WM: (15180: I2 ^dir R)
  25465. =>WM: (15179: I2 ^reward 1)
  25466. =>WM: (15178: I2 ^see 0)
  25467. =>WM: (15177: N1077 ^status complete)
  25468. <=WM: (15166: I2 ^dir U)
  25469. <=WM: (15165: I2 ^reward 1)
  25470. <=WM: (15164: I2 ^see 0)
  25471. =>WM: (15181: I2 ^level-1 L0-root)
  25472. <=WM: (15167: I2 ^level-1 L0-root)
  25473. --- END Input Phase ---
  25474. --- Proposal Phase ---
  25475. --- Inner Elaboration Phase, active level 1 (S1) ---
  25476. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25477. -->
  25478. (S1 ^operator O2154 = -0.2817060109291377)
  25479. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25480. -->
  25481. (S1 ^operator O2153 = 0.6623226459114221)
  25482. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25483. -->
  25484. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25485. -->
  25486. Firing elaborate*copy-see-to-output-link
  25487. -->
  25488. (I3 ^see 0 +)
  25489. Firing elaborate*reward*based*on*reward
  25490. -->
  25491. (R1081 ^value 1 +)
  25492. (R1 ^reward R1081 +)
  25493. Firing propose*predict-yes
  25494. -->
  25495. (O2155 ^name predict-yes +)
  25496. (S1 ^operator O2155 +)
  25497. Firing propose*predict-no
  25498. -->
  25499. (O2156 ^name predict-no +)
  25500. (S1 ^operator O2156 +)
  25501. Firing rl*prefer*rvt*predict-no*H0*4
  25502. -->
  25503. (S1 ^operator O2154 = 0.3397908847802913)
  25504. Firing rl*prefer*rvt*predict-yes*H0*3
  25505. -->
  25506. (S1 ^operator O2153 = 0.337707511486373)
  25507. Firing prefer*rvt*predict-yes*H0
  25508. -->
  25509. Firing prefer*rvt*predict-no*H0
  25510. -->
  25511. Firing elaborate*copy-dir-to-output-link
  25512. -->
  25513. (I3 ^dir R +)
  25514. inner elaboration loop at bottom goal.
  25515. Retracting elaborate*copy-see-to-output-link
  25516. -->
  25517. (I3 ^see 0 +)
  25518. Retracting propose*predict-no
  25519. -->
  25520. (O2154 ^name predict-no +)
  25521. (S1 ^operator O2154 +)
  25522. Retracting propose*predict-yes
  25523. -->
  25524. (O2153 ^name predict-yes +)
  25525. (S1 ^operator O2153 +)
  25526. Retracting elaborate*reward*based*on*reward
  25527. -->
  25528. (R1080 ^value 1 +)
  25529. (R1 ^reward R1080 +)
  25530. Retracting elaborate*copy-dir-to-output-link
  25531. -->
  25532. (I3 ^dir U +)
  25533. Retracting rl*prefer*rvt*predict-no*H0*2
  25534. -->
  25535. (S1 ^operator O2154 = 1.)
  25536. Retracting rl*prefer*rvt*predict-yes*H0*1
  25537. -->
  25538. (S1 ^operator O2153 = 0.)
  25539. =>WM: (15188: S1 ^operator O2156 +)
  25540. =>WM: (15187: S1 ^operator O2155 +)
  25541. =>WM: (15186: I3 ^dir R)
  25542. =>WM: (15185: O2156 ^name predict-no)
  25543. =>WM: (15184: O2155 ^name predict-yes)
  25544. =>WM: (15183: R1081 ^value 1)
  25545. =>WM: (15182: R1 ^reward R1081)
  25546. <=WM: (15173: S1 ^operator O2153 +)
  25547. <=WM: (15174: S1 ^operator O2154 +)
  25548. <=WM: (15175: S1 ^operator O2154)
  25549. <=WM: (15172: I3 ^dir U)
  25550. <=WM: (15168: R1 ^reward R1080)
  25551. <=WM: (15171: O2154 ^name predict-no)
  25552. <=WM: (15170: O2153 ^name predict-yes)
  25553. <=WM: (15169: R1080 ^value 1)
  25554. --- Inner Elaboration Phase, active level 1 (S1) ---
  25555. Firing prefer*rvt*predict-yes*H0
  25556. -->
  25557. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25558. -->
  25559. (S1 ^operator O2155 = 0.6623226459114221)
  25560. Firing rl*prefer*rvt*predict-yes*H0*3
  25561. -->
  25562. (S1 ^operator O2155 = 0.337707511486373)
  25563. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25564. -->
  25565. Firing prefer*rvt*predict-no*H0
  25566. -->
  25567. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25568. -->
  25569. (S1 ^operator O2156 = -0.2817060109291377)
  25570. Firing rl*prefer*rvt*predict-no*H0*4
  25571. -->
  25572. (S1 ^operator O2156 = 0.3397908847802913)
  25573. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25574. -->
  25575. inner elaboration loop at bottom goal.
  25576. Retracting rl*prefer*rvt*predict-no*H0*4
  25577. -->
  25578. (S1 ^operator O2154 = 0.3397908847802913)
  25579. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25580. -->
  25581. (S1 ^operator O2154 = -0.2817060109291377)
  25582. Retracting rl*prefer*rvt*predict-yes*H0*3
  25583. -->
  25584. (S1 ^operator O2153 = 0.337707511486373)
  25585. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25586. -->
  25587. (S1 ^operator O2153 = 0.6623226459114221)
  25588. --- END Proposal Phase ---
  25589. --- Decision Phase ---
  25590. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  25591. =>WM: (15189: S1 ^operator O2155)
  25592. 1078: O: O2155 (predict-yes)
  25593. --- END Decision Phase ---
  25594. --- Application Phase ---
  25595. --- Firing Productions (PE) For State At Depth 1 ---
  25596. --- Inner Elaboration Phase, active level 1 (S1) ---
  25597. Firing apply*operator
  25598. -->
  25599. (I3 ^predict-yes N1078 + :O )
  25600. Firing apply*operator*complete
  25601. -->
  25602. (I3 ^predict-no N1077 - :O )
  25603. inner elaboration loop at bottom goal.
  25604. --- Change Working Memory (PE) ---
  25605. =>WM: (15190: I3 ^predict-yes N1078)
  25606. <=WM: (15177: N1077 ^status complete)
  25607. <=WM: (15176: I3 ^predict-no N1077)
  25608. --- Firing Productions (IE) For State At Depth 1 ---
  25609. --- Inner Elaboration Phase, active level 1 (S1) ---
  25610. Firing monitor*world
  25611. -->
  25612. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25613. --- Change Working Memory (IE) ---
  25614. --- END Application Phase ---
  25615. --- Output Phase ---
  25616. ENV: Agent did: predict-yes for direction R in state State-A
  25617. In State-A moving R
  25618. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25619. predict error 0
  25620. dir: dir isU
  25621. --- END Output Phase ---
  25622. /|\--- Input Phase ---
  25623. =>WM: (15194: I2 ^dir U)
  25624. =>WM: (15193: I2 ^reward 1)
  25625. =>WM: (15192: I2 ^see 1)
  25626. =>WM: (15191: N1078 ^status complete)
  25627. <=WM: (15180: I2 ^dir R)
  25628. <=WM: (15179: I2 ^reward 1)
  25629. <=WM: (15178: I2 ^see 0)
  25630. =>WM: (15195: I2 ^level-1 R1-root)
  25631. <=WM: (15181: I2 ^level-1 L0-root)
  25632. --- END Input Phase ---
  25633. --- Proposal Phase ---
  25634. --- Inner Elaboration Phase, active level 1 (S1) ---
  25635. Firing elaborate*copy-see-to-output-link
  25636. -->
  25637. (I3 ^see 1 +)
  25638. Firing elaborate*reward*based*on*reward
  25639. -->
  25640. (R1082 ^value 1 +)
  25641. (R1 ^reward R1082 +)
  25642. Firing propose*predict-yes
  25643. -->
  25644. (O2157 ^name predict-yes +)
  25645. (S1 ^operator O2157 +)
  25646. Firing propose*predict-no
  25647. -->
  25648. (O2158 ^name predict-no +)
  25649. (S1 ^operator O2158 +)
  25650. Firing rl*prefer*rvt*predict-no*H0*2
  25651. -->
  25652. (S1 ^operator O2156 = 1.)
  25653. Firing rl*prefer*rvt*predict-yes*H0*1
  25654. -->
  25655. (S1 ^operator O2155 = 0.)
  25656. Firing prefer*rvt*predict-yes*H0
  25657. -->
  25658. Firing prefer*rvt*predict-no*H0
  25659. -->
  25660. Firing elaborate*copy-dir-to-output-link
  25661. -->
  25662. (I3 ^dir U +)
  25663. inner elaboration loop at bottom goal.
  25664. Retracting elaborate*copy-see-to-output-link
  25665. -->
  25666. (I3 ^see 0 +)
  25667. Retracting propose*predict-no
  25668. -->
  25669. (O2156 ^name predict-no +)
  25670. (S1 ^operator O2156 +)
  25671. Retracting propose*predict-yes
  25672. -->
  25673. (O2155 ^name predict-yes +)
  25674. (S1 ^operator O2155 +)
  25675. Retracting elaborate*reward*based*on*reward
  25676. -->
  25677. (R1081 ^value 1 +)
  25678. (R1 ^reward R1081 +)
  25679. Retracting elaborate*copy-dir-to-output-link
  25680. -->
  25681. (I3 ^dir R +)
  25682. Retracting rl*prefer*rvt*predict-no*H0*4
  25683. -->
  25684. (S1 ^operator O2156 = 0.3397908847802913)
  25685. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  25686. -->
  25687. (S1 ^operator O2156 = -0.2817060109291377)
  25688. Retracting rl*prefer*rvt*predict-yes*H0*3
  25689. -->
  25690. (S1 ^operator O2155 = 0.337707511486373)
  25691. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  25692. -->
  25693. (S1 ^operator O2155 = 0.6623226459114221)
  25694. =>WM: (15203: S1 ^operator O2158 +)
  25695. =>WM: (15202: S1 ^operator O2157 +)
  25696. =>WM: (15201: I3 ^dir U)
  25697. =>WM: (15200: O2158 ^name predict-no)
  25698. =>WM: (15199: O2157 ^name predict-yes)
  25699. =>WM: (15198: R1082 ^value 1)
  25700. =>WM: (15197: R1 ^reward R1082)
  25701. =>WM: (15196: I3 ^see 1)
  25702. <=WM: (15187: S1 ^operator O2155 +)
  25703. <=WM: (15189: S1 ^operator O2155)
  25704. <=WM: (15188: S1 ^operator O2156 +)
  25705. <=WM: (15186: I3 ^dir R)
  25706. <=WM: (15182: R1 ^reward R1081)
  25707. <=WM: (15141: I3 ^see 0)
  25708. <=WM: (15185: O2156 ^name predict-no)
  25709. <=WM: (15184: O2155 ^name predict-yes)
  25710. <=WM: (15183: R1081 ^value 1)
  25711. --- Inner Elaboration Phase, active level 1 (S1) ---
  25712. Firing prefer*rvt*predict-yes*H0
  25713. -->
  25714. Firing rl*prefer*rvt*predict-yes*H0*1
  25715. -->
  25716. (S1 ^operator O2157 = 0.)
  25717. Firing prefer*rvt*predict-no*H0
  25718. -->
  25719. Firing rl*prefer*rvt*predict-no*H0*2
  25720. -->
  25721. (S1 ^operator O2158 = 1.)
  25722. inner elaboration loop at bottom goal.
  25723. Retracting rl*prefer*rvt*predict-no*H0*2
  25724. -->
  25725. (S1 ^operator O2156 = 1.)
  25726. Retracting rl*prefer*rvt*predict-yes*H0*1
  25727. -->
  25728. (S1 ^operator O2155 = 0.)
  25729. --- END Proposal Phase ---
  25730. --- Decision Phase ---
  25731. RL update rl*prefer*rvt*predict-yes*H0*3 0.590107 -0.2524 0.337708 -> 0.590104 -0.252399 0.337705(R,m,v=1,0.904494,0.0868723)
  25732. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409928 0.252395 0.662323 -> 0.409924 0.252395 0.66232(R,m,v=1,1,0)
  25733. =>WM: (15204: S1 ^operator O2158)
  25734. 1079: O: O2158 (predict-no)
  25735. --- END Decision Phase ---
  25736. --- Application Phase ---
  25737. --- Firing Productions (PE) For State At Depth 1 ---
  25738. --- Inner Elaboration Phase, active level 1 (S1) ---
  25739. Firing apply*operator
  25740. -->
  25741. (I3 ^predict-no N1079 + :O )
  25742. Firing apply*operator*complete
  25743. -->
  25744. (I3 ^predict-yes N1078 - :O )
  25745. inner elaboration loop at bottom goal.
  25746. --- Change Working Memory (PE) ---
  25747. =>WM: (15205: I3 ^predict-no N1079)
  25748. <=WM: (15191: N1078 ^status complete)
  25749. <=WM: (15190: I3 ^predict-yes N1078)
  25750. --- Firing Productions (IE) For State At Depth 1 ---
  25751. --- Inner Elaboration Phase, active level 1 (S1) ---
  25752. Firing monitor*world
  25753. -->
  25754. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25755. --- Change Working Memory (IE) ---
  25756. --- END Application Phase ---
  25757. --- Output Phase ---
  25758. ENV: Agent did: predict-no for direction U in state State-B
  25759. In State-B moving U
  25760. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25761. predict error 0
  25762. dir: dir isR
  25763. --- END Output Phase ---
  25764. -/|--- Input Phase ---
  25765. =>WM: (15209: I2 ^dir R)
  25766. =>WM: (15208: I2 ^reward 1)
  25767. =>WM: (15207: I2 ^see 0)
  25768. =>WM: (15206: N1079 ^status complete)
  25769. <=WM: (15194: I2 ^dir U)
  25770. <=WM: (15193: I2 ^reward 1)
  25771. <=WM: (15192: I2 ^see 1)
  25772. =>WM: (15210: I2 ^level-1 R1-root)
  25773. <=WM: (15195: I2 ^level-1 R1-root)
  25774. --- END Input Phase ---
  25775. --- Proposal Phase ---
  25776. --- Inner Elaboration Phase, active level 1 (S1) ---
  25777. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  25778. -->
  25779. (S1 ^operator O2157 = -0.1070236389116304)
  25780. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  25781. -->
  25782. (S1 ^operator O2158 = 0.6602331636337839)
  25783. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25784. -->
  25785. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25786. -->
  25787. Firing elaborate*copy-see-to-output-link
  25788. -->
  25789. (I3 ^see 0 +)
  25790. Firing elaborate*reward*based*on*reward
  25791. -->
  25792. (R1083 ^value 1 +)
  25793. (R1 ^reward R1083 +)
  25794. Firing propose*predict-yes
  25795. -->
  25796. (O2159 ^name predict-yes +)
  25797. (S1 ^operator O2159 +)
  25798. Firing propose*predict-no
  25799. -->
  25800. (O2160 ^name predict-no +)
  25801. (S1 ^operator O2160 +)
  25802. Firing rl*prefer*rvt*predict-no*H0*4
  25803. -->
  25804. (S1 ^operator O2158 = 0.3397908847802913)
  25805. Firing rl*prefer*rvt*predict-yes*H0*3
  25806. -->
  25807. (S1 ^operator O2157 = 0.3377050722714159)
  25808. Firing prefer*rvt*predict-yes*H0
  25809. -->
  25810. Firing prefer*rvt*predict-no*H0
  25811. -->
  25812. Firing elaborate*copy-dir-to-output-link
  25813. -->
  25814. (I3 ^dir R +)
  25815. inner elaboration loop at bottom goal.
  25816. Retracting elaborate*copy-see-to-output-link
  25817. -->
  25818. (I3 ^see 1 +)
  25819. Retracting propose*predict-no
  25820. -->
  25821. (O2158 ^name predict-no +)
  25822. (S1 ^operator O2158 +)
  25823. Retracting propose*predict-yes
  25824. -->
  25825. (O2157 ^name predict-yes +)
  25826. (S1 ^operator O2157 +)
  25827. Retracting elaborate*reward*based*on*reward
  25828. -->
  25829. (R1082 ^value 1 +)
  25830. (R1 ^reward R1082 +)
  25831. Retracting elaborate*copy-dir-to-output-link
  25832. -->
  25833. (I3 ^dir U +)
  25834. Retracting rl*prefer*rvt*predict-no*H0*2
  25835. -->
  25836. (S1 ^operator O2158 = 1.)
  25837. Retracting rl*prefer*rvt*predict-yes*H0*1
  25838. -->
  25839. (S1 ^operator O2157 = 0.)
  25840. =>WM: (15218: S1 ^operator O2160 +)
  25841. =>WM: (15217: S1 ^operator O2159 +)
  25842. =>WM: (15216: I3 ^dir R)
  25843. =>WM: (15215: O2160 ^name predict-no)
  25844. =>WM: (15214: O2159 ^name predict-yes)
  25845. =>WM: (15213: R1083 ^value 1)
  25846. =>WM: (15212: R1 ^reward R1083)
  25847. =>WM: (15211: I3 ^see 0)
  25848. <=WM: (15202: S1 ^operator O2157 +)
  25849. <=WM: (15203: S1 ^operator O2158 +)
  25850. <=WM: (15204: S1 ^operator O2158)
  25851. <=WM: (15201: I3 ^dir U)
  25852. <=WM: (15197: R1 ^reward R1082)
  25853. <=WM: (15196: I3 ^see 1)
  25854. <=WM: (15200: O2158 ^name predict-no)
  25855. <=WM: (15199: O2157 ^name predict-yes)
  25856. <=WM: (15198: R1082 ^value 1)
  25857. --- Inner Elaboration Phase, active level 1 (S1) ---
  25858. Firing prefer*rvt*predict-yes*H0
  25859. -->
  25860. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  25861. -->
  25862. (S1 ^operator O2159 = -0.1070236389116304)
  25863. Firing rl*prefer*rvt*predict-yes*H0*3
  25864. -->
  25865. (S1 ^operator O2159 = 0.3377050722714159)
  25866. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25867. -->
  25868. Firing prefer*rvt*predict-no*H0
  25869. -->
  25870. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  25871. -->
  25872. (S1 ^operator O2160 = 0.6602331636337839)
  25873. Firing rl*prefer*rvt*predict-no*H0*4
  25874. -->
  25875. (S1 ^operator O2160 = 0.3397908847802913)
  25876. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25877. -->
  25878. inner elaboration loop at bottom goal.
  25879. Retracting rl*prefer*rvt*predict-no*H0*4
  25880. -->
  25881. (S1 ^operator O2158 = 0.3397908847802913)
  25882. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  25883. -->
  25884. (S1 ^operator O2158 = 0.6602331636337839)
  25885. Retracting rl*prefer*rvt*predict-yes*H0*3
  25886. -->
  25887. (S1 ^operator O2157 = 0.3377050722714159)
  25888. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  25889. -->
  25890. (S1 ^operator O2157 = -0.1070236389116304)
  25891. --- END Proposal Phase ---
  25892. --- Decision Phase ---
  25893. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  25894. =>WM: (15219: S1 ^operator O2160)
  25895. 1080: O: O2160 (predict-no)
  25896. --- END Decision Phase ---
  25897. --- Application Phase ---
  25898. --- Firing Productions (PE) For State At Depth 1 ---
  25899. --- Inner Elaboration Phase, active level 1 (S1) ---
  25900. Firing apply*operator
  25901. -->
  25902. (I3 ^predict-no N1080 + :O )
  25903. Firing apply*operator*complete
  25904. -->
  25905. (I3 ^predict-no N1079 - :O )
  25906. inner elaboration loop at bottom goal.
  25907. --- Change Working Memory (PE) ---
  25908. =>WM: (15220: I3 ^predict-no N1080)
  25909. <=WM: (15206: N1079 ^status complete)
  25910. <=WM: (15205: I3 ^predict-no N1079)
  25911. --- Firing Productions (IE) For State At Depth 1 ---
  25912. --- Inner Elaboration Phase, active level 1 (S1) ---
  25913. Firing monitor*world
  25914. -->
  25915. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25916. --- Change Working Memory (IE) ---
  25917. --- END Application Phase ---
  25918. --- Output Phase ---
  25919. ENV: Agent did: predict-no for direction R in state State-B
  25920. In State-B moving R
  25921. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25922. predict error 0
  25923. dir: dir isR
  25924. --- END Output Phase ---
  25925. \--- Input Phase ---
  25926. =>WM: (15224: I2 ^dir R)
  25927. =>WM: (15223: I2 ^reward 1)
  25928. =>WM: (15222: I2 ^see 0)
  25929. =>WM: (15221: N1080 ^status complete)
  25930. <=WM: (15209: I2 ^dir R)
  25931. <=WM: (15208: I2 ^reward 1)
  25932. <=WM: (15207: I2 ^see 0)
  25933. =>WM: (15225: I2 ^level-1 R0-root)
  25934. <=WM: (15210: I2 ^level-1 R1-root)
  25935. --- END Input Phase ---
  25936. --- Proposal Phase ---
  25937. --- Inner Elaboration Phase, active level 1 (S1) ---
  25938. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  25939. -->
  25940. (S1 ^operator O2160 = 0.660190792670301)
  25941. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  25942. -->
  25943. (S1 ^operator O2159 = -0.1028953566115423)
  25944. Firing prefer*rvt*predict-no*H0*4*v1*H1
  25945. -->
  25946. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  25947. -->
  25948. Firing elaborate*copy-see-to-output-link
  25949. -->
  25950. (I3 ^see 0 +)
  25951. Firing elaborate*reward*based*on*reward
  25952. -->
  25953. (R1084 ^value 1 +)
  25954. (R1 ^reward R1084 +)
  25955. Firing propose*predict-yes
  25956. -->
  25957. (O2161 ^name predict-yes +)
  25958. (S1 ^operator O2161 +)
  25959. Firing propose*predict-no
  25960. -->
  25961. (O2162 ^name predict-no +)
  25962. (S1 ^operator O2162 +)
  25963. Firing rl*prefer*rvt*predict-no*H0*4
  25964. -->
  25965. (S1 ^operator O2160 = 0.3397908847802913)
  25966. Firing rl*prefer*rvt*predict-yes*H0*3
  25967. -->
  25968. (S1 ^operator O2159 = 0.3377050722714159)
  25969. Firing prefer*rvt*predict-yes*H0
  25970. -->
  25971. Firing prefer*rvt*predict-no*H0
  25972. -->
  25973. Firing elaborate*copy-dir-to-output-link
  25974. -->
  25975. (I3 ^dir R +)
  25976. inner elaboration loop at bottom goal.
  25977. Retracting elaborate*copy-see-to-output-link
  25978. -->
  25979. (I3 ^see 0 +)
  25980. Retracting propose*predict-no
  25981. -->
  25982. (O2160 ^name predict-no +)
  25983. (S1 ^operator O2160 +)
  25984. Retracting propose*predict-yes
  25985. -->
  25986. (O2159 ^name predict-yes +)
  25987. (S1 ^operator O2159 +)
  25988. Retracting elaborate*reward*based*on*reward
  25989. -->
  25990. (R1083 ^value 1 +)
  25991. (R1 ^reward R1083 +)
  25992. Retracting elaborate*copy-dir-to-output-link
  25993. -->
  25994. (I3 ^dir R +)
  25995. Retracting rl*prefer*rvt*predict-no*H0*4
  25996. -->
  25997. (S1 ^operator O2160 = 0.3397908847802913)
  25998. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  25999. -->
  26000. (S1 ^operator O2160 = 0.6602331636337839)
  26001. Retracting rl*prefer*rvt*predict-yes*H0*3
  26002. -->
  26003. (S1 ^operator O2159 = 0.3377050722714159)
  26004. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  26005. -->
  26006. (S1 ^operator O2159 = -0.1070236389116304)
  26007. =>WM: (15231: S1 ^operator O2162 +)
  26008. =>WM: (15230: S1 ^operator O2161 +)
  26009. =>WM: (15229: O2162 ^name predict-no)
  26010. =>WM: (15228: O2161 ^name predict-yes)
  26011. =>WM: (15227: R1084 ^value 1)
  26012. =>WM: (15226: R1 ^reward R1084)
  26013. <=WM: (15217: S1 ^operator O2159 +)
  26014. <=WM: (15218: S1 ^operator O2160 +)
  26015. <=WM: (15219: S1 ^operator O2160)
  26016. <=WM: (15212: R1 ^reward R1083)
  26017. <=WM: (15215: O2160 ^name predict-no)
  26018. <=WM: (15214: O2159 ^name predict-yes)
  26019. <=WM: (15213: R1083 ^value 1)
  26020. --- Inner Elaboration Phase, active level 1 (S1) ---
  26021. Firing prefer*rvt*predict-yes*H0
  26022. -->
  26023. Firing rl*prefer*rvt*predict-yes*H0*3
  26024. -->
  26025. (S1 ^operator O2161 = 0.3377050722714159)
  26026. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  26027. -->
  26028. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  26029. -->
  26030. (S1 ^operator O2161 = -0.1028953566115423)
  26031. Firing prefer*rvt*predict-no*H0
  26032. -->
  26033. Firing rl*prefer*rvt*predict-no*H0*4
  26034. -->
  26035. (S1 ^operator O2162 = 0.3397908847802913)
  26036. Firing prefer*rvt*predict-no*H0*4*v1*H1
  26037. -->
  26038. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  26039. -->
  26040. (S1 ^operator O2162 = 0.660190792670301)
  26041. inner elaboration loop at bottom goal.
  26042. Retracting rl*prefer*rvt*predict-no*H0*4
  26043. -->
  26044. (S1 ^operator O2160 = 0.3397908847802913)
  26045. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  26046. -->
  26047. (S1 ^operator O2160 = 0.660190792670301)
  26048. Retracting rl*prefer*rvt*predict-yes*H0*3
  26049. -->
  26050. (S1 ^operator O2159 = 0.3377050722714159)
  26051. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  26052. -->
  26053. (S1 ^operator O2159 = -0.1028953566115423)
  26054. --- END Proposal Phase ---
  26055. --- Decision Phase ---
  26056. RL update rl*prefer*rvt*predict-no*H0*4 0.570275 -0.230484 0.339791 -> 0.570273 -0.230484 0.339789(R,m,v=1,0.885246,0.102144)
  26057. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.42975 0.230483 0.660233 -> 0.429747 0.230483 0.660231(R,m,v=1,1,0)
  26058. =>WM: (15232: S1 ^operator O2162)
  26059. 1081: O: O2162 (predict-no)
  26060. --- END Decision Phase ---
  26061. --- Application Phase ---
  26062. --- Firing Productions (PE) For State At Depth 1 ---
  26063. --- Inner Elaboration Phase, active level 1 (S1) ---
  26064. Firing apply*operator
  26065. -->
  26066. (I3 ^predict-no N1081 + :O )
  26067. Firing apply*operator*complete
  26068. -->
  26069. (I3 ^predict-no N1080 - :O )
  26070. inner elaboration loop at bottom goal.
  26071. --- Change Working Memory (PE) ---
  26072. =>WM: (15233: I3 ^predict-no N1081)
  26073. <=WM: (15221: N1080 ^status complete)
  26074. <=WM: (15220: I3 ^predict-no N1080)
  26075. --- Firing Productions (IE) For State At Depth 1 ---
  26076. --- Inner Elaboration Phase, active level 1 (S1) ---
  26077. Firing monitor*world
  26078. -->
  26079. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26080. --- Change Working Memory (IE) ---
  26081. --- END Application Phase ---
  26082. --- Output Phase ---
  26083. ENV: Agent did: predict-no for direction R in state State-B
  26084. In State-B moving R
  26085. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26086. predict error 0
  26087. dir: dir isU
  26088. --- END Output Phase ---
  26089. ---- Input Phase ---
  26090. =>WM: (15237: I2 ^dir U)
  26091. =>WM: (15236: I2 ^reward 1)
  26092. =>WM: (15235: I2 ^see 0)
  26093. =>WM: (15234: N1081 ^status complete)
  26094. <=WM: (15224: I2 ^dir R)
  26095. <=WM: (15223: I2 ^reward 1)
  26096. <=WM: (15222: I2 ^see 0)
  26097. =>WM: (15238: I2 ^level-1 R0-root)
  26098. <=WM: (15225: I2 ^level-1 R0-root)
  26099. --- END Input Phase ---
  26100. --- Proposal Phase ---
  26101. --- Inner Elaboration Phase, active level 1 (S1) ---
  26102. Firing elaborate*copy-see-to-output-link
  26103. -->
  26104. (I3 ^see 0 +)
  26105. Firing elaborate*reward*based*on*reward
  26106. -->
  26107. (R1085 ^value 1 +)
  26108. (R1 ^reward R1085 +)
  26109. Firing propose*predict-yes
  26110. -->
  26111. (O2163 ^name predict-yes +)
  26112. (S1 ^operator O2163 +)
  26113. Firing propose*predict-no
  26114. -->
  26115. (O2164 ^name predict-no +)
  26116. (S1 ^operator O2164 +)
  26117. Firing rl*prefer*rvt*predict-no*H0*2
  26118. -->
  26119. (S1 ^operator O2162 = 1.)
  26120. Firing rl*prefer*rvt*predict-yes*H0*1
  26121. -->
  26122. (S1 ^operator O2161 = 0.)
  26123. Firing prefer*rvt*predict-yes*H0
  26124. -->
  26125. Firing prefer*rvt*predict-no*H0
  26126. -->
  26127. Firing elaborate*copy-dir-to-output-link
  26128. -->
  26129. (I3 ^dir U +)
  26130. inner elaboration loop at bottom goal.
  26131. Retracting elaborate*copy-see-to-output-link
  26132. -->
  26133. (I3 ^see 0 +)
  26134. Retracting propose*predict-no
  26135. -->
  26136. (O2162 ^name predict-no +)
  26137. (S1 ^operator O2162 +)
  26138. Retracting propose*predict-yes
  26139. -->
  26140. (O2161 ^name predict-yes +)
  26141. (S1 ^operator O2161 +)
  26142. Retracting elaborate*reward*based*on*reward
  26143. -->
  26144. (R1084 ^value 1 +)
  26145. (R1 ^reward R1084 +)
  26146. Retracting elaborate*copy-dir-to-output-link
  26147. -->
  26148. (I3 ^dir R +)
  26149. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  26150. -->
  26151. (S1 ^operator O2162 = 0.660190792670301)
  26152. Retracting rl*prefer*rvt*predict-no*H0*4
  26153. -->
  26154. (S1 ^operator O2162 = 0.3397889483548382)
  26155. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  26156. -->
  26157. (S1 ^operator O2161 = -0.1028953566115423)
  26158. Retracting rl*prefer*rvt*predict-yes*H0*3
  26159. -->
  26160. (S1 ^operator O2161 = 0.3377050722714159)
  26161. =>WM: (15245: S1 ^operator O2164 +)
  26162. =>WM: (15244: S1 ^operator O2163 +)
  26163. =>WM: (15243: I3 ^dir U)
  26164. =>WM: (15242: O2164 ^name predict-no)
  26165. =>WM: (15241: O2163 ^name predict-yes)
  26166. =>WM: (15240: R1085 ^value 1)
  26167. =>WM: (15239: R1 ^reward R1085)
  26168. <=WM: (15230: S1 ^operator O2161 +)
  26169. <=WM: (15231: S1 ^operator O2162 +)
  26170. <=WM: (15232: S1 ^operator O2162)
  26171. <=WM: (15216: I3 ^dir R)
  26172. <=WM: (15226: R1 ^reward R1084)
  26173. <=WM: (15229: O2162 ^name predict-no)
  26174. <=WM: (15228: O2161 ^name predict-yes)
  26175. <=WM: (15227: R1084 ^value 1)
  26176. --- Inner Elaboration Phase, active level 1 (S1) ---
  26177. Firing prefer*rvt*predict-yes*H0
  26178. -->
  26179. Firing rl*prefer*rvt*predict-yes*H0*1
  26180. -->
  26181. (S1 ^operator O2163 = 0.)
  26182. Firing prefer*rvt*predict-no*H0
  26183. -->
  26184. Firing rl*prefer*rvt*predict-no*H0*2
  26185. -->
  26186. (S1 ^operator O2164 = 1.)
  26187. inner elaboration loop at bottom goal.
  26188. Retracting rl*prefer*rvt*predict-no*H0*2
  26189. -->
  26190. (S1 ^operator O2162 = 1.)
  26191. Retracting rl*prefer*rvt*predict-yes*H0*1
  26192. -->
  26193. (S1 ^operator O2161 = 0.)
  26194. --- END Proposal Phase ---
  26195. --- Decision Phase ---
  26196. RL update rl*prefer*rvt*predict-no*H0*4 0.570273 -0.230484 0.339789 -> 0.570275 -0.230484 0.339791(R,m,v=1,0.88587,0.101657)
  26197. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429705 0.230485 0.660191 -> 0.429707 0.230485 0.660193(R,m,v=1,1,0)
  26198. =>WM: (15246: S1 ^operator O2164)
  26199. 1082: O: O2164 (predict-no)
  26200. --- END Decision Phase ---
  26201. --- Application Phase ---
  26202. --- Firing Productions (PE) For State At Depth 1 ---
  26203. --- Inner Elaboration Phase, active level 1 (S1) ---
  26204. Firing apply*operator
  26205. -->
  26206. (I3 ^predict-no N1082 + :O )
  26207. Firing apply*operator*complete
  26208. -->
  26209. (I3 ^predict-no N1081 - :O )
  26210. inner elaboration loop at bottom goal.
  26211. --- Change Working Memory (PE) ---
  26212. =>WM: (15247: I3 ^predict-no N1082)
  26213. <=WM: (15234: N1081 ^status complete)
  26214. <=WM: (15233: I3 ^predict-no N1081)
  26215. --- Firing Productions (IE) For State At Depth 1 ---
  26216. --- Inner Elaboration Phase, active level 1 (S1) ---
  26217. Firing monitor*world
  26218. -->
  26219. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26220. --- Change Working Memory (IE) ---
  26221. --- END Application Phase ---
  26222. --- Output Phase ---
  26223. ENV: Agent did: predict-no for direction U in state State-B
  26224. In State-B moving U
  26225. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26226. predict error 0
  26227. dir: dir isL
  26228. --- END Output Phase ---
  26229. /|\--- Input Phase ---
  26230. =>WM: (15251: I2 ^dir L)
  26231. =>WM: (15250: I2 ^reward 1)
  26232. =>WM: (15249: I2 ^see 0)
  26233. =>WM: (15248: N1082 ^status complete)
  26234. <=WM: (15237: I2 ^dir U)
  26235. <=WM: (15236: I2 ^reward 1)
  26236. <=WM: (15235: I2 ^see 0)
  26237. =>WM: (15252: I2 ^level-1 R0-root)
  26238. <=WM: (15238: I2 ^level-1 R0-root)
  26239. --- END Input Phase ---
  26240. --- Proposal Phase ---
  26241. --- Inner Elaboration Phase, active level 1 (S1) ---
  26242. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  26243. -->
  26244. (S1 ^operator O2163 = 0.7359077268568859)
  26245. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26246. -->
  26247. Firing elaborate*copy-see-to-output-link
  26248. -->
  26249. (I3 ^see 0 +)
  26250. Firing elaborate*reward*based*on*reward
  26251. -->
  26252. (R1086 ^value 1 +)
  26253. (R1 ^reward R1086 +)
  26254. Firing propose*predict-yes
  26255. -->
  26256. (O2165 ^name predict-yes +)
  26257. (S1 ^operator O2165 +)
  26258. Firing propose*predict-no
  26259. -->
  26260. (O2166 ^name predict-no +)
  26261. (S1 ^operator O2166 +)
  26262. Firing rl*prefer*rvt*predict-no*H0*6
  26263. -->
  26264. (S1 ^operator O2164 = 0.9726708564453506)
  26265. Firing rl*prefer*rvt*predict-yes*H0*5
  26266. -->
  26267. (S1 ^operator O2163 = 0.2640006890267754)
  26268. Firing prefer*rvt*predict-yes*H0
  26269. -->
  26270. Firing prefer*rvt*predict-no*H0
  26271. -->
  26272. Firing elaborate*copy-dir-to-output-link
  26273. -->
  26274. (I3 ^dir L +)
  26275. inner elaboration loop at bottom goal.
  26276. Retracting elaborate*copy-see-to-output-link
  26277. -->
  26278. (I3 ^see 0 +)
  26279. Retracting propose*predict-no
  26280. -->
  26281. (O2164 ^name predict-no +)
  26282. (S1 ^operator O2164 +)
  26283. Retracting propose*predict-yes
  26284. -->
  26285. (O2163 ^name predict-yes +)
  26286. (S1 ^operator O2163 +)
  26287. Retracting elaborate*reward*based*on*reward
  26288. -->
  26289. (R1085 ^value 1 +)
  26290. (R1 ^reward R1085 +)
  26291. Retracting elaborate*copy-dir-to-output-link
  26292. -->
  26293. (I3 ^dir U +)
  26294. Retracting rl*prefer*rvt*predict-no*H0*2
  26295. -->
  26296. (S1 ^operator O2164 = 1.)
  26297. Retracting rl*prefer*rvt*predict-yes*H0*1
  26298. -->
  26299. (S1 ^operator O2163 = 0.)
  26300. =>WM: (15259: S1 ^operator O2166 +)
  26301. =>WM: (15258: S1 ^operator O2165 +)
  26302. =>WM: (15257: I3 ^dir L)
  26303. =>WM: (15256: O2166 ^name predict-no)
  26304. =>WM: (15255: O2165 ^name predict-yes)
  26305. =>WM: (15254: R1086 ^value 1)
  26306. =>WM: (15253: R1 ^reward R1086)
  26307. <=WM: (15244: S1 ^operator O2163 +)
  26308. <=WM: (15245: S1 ^operator O2164 +)
  26309. <=WM: (15246: S1 ^operator O2164)
  26310. <=WM: (15243: I3 ^dir U)
  26311. <=WM: (15239: R1 ^reward R1085)
  26312. <=WM: (15242: O2164 ^name predict-no)
  26313. <=WM: (15241: O2163 ^name predict-yes)
  26314. <=WM: (15240: R1085 ^value 1)
  26315. --- Inner Elaboration Phase, active level 1 (S1) ---
  26316. Firing prefer*rvt*predict-yes*H0
  26317. -->
  26318. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  26319. -->
  26320. (S1 ^operator O2165 = 0.7359077268568859)
  26321. Firing rl*prefer*rvt*predict-yes*H0*5
  26322. -->
  26323. (S1 ^operator O2165 = 0.2640006890267754)
  26324. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26325. -->
  26326. Firing prefer*rvt*predict-no*H0
  26327. -->
  26328. Firing rl*prefer*rvt*predict-no*H0*6
  26329. -->
  26330. (S1 ^operator O2166 = 0.9726708564453506)
  26331. inner elaboration loop at bottom goal.
  26332. Retracting rl*prefer*rvt*predict-no*H0*6
  26333. -->
  26334. (S1 ^operator O2164 = 0.9726708564453506)
  26335. Retracting rl*prefer*rvt*predict-yes*H0*5
  26336. -->
  26337. (S1 ^operator O2163 = 0.2640006890267754)
  26338. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  26339. -->
  26340. (S1 ^operator O2163 = 0.7359077268568859)
  26341. --- END Proposal Phase ---
  26342. --- Decision Phase ---
  26343. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  26344. =>WM: (15260: S1 ^operator O2165)
  26345. 1083: O: O2165 (predict-yes)
  26346. --- END Decision Phase ---
  26347. --- Application Phase ---
  26348. --- Firing Productions (PE) For State At Depth 1 ---
  26349. --- Inner Elaboration Phase, active level 1 (S1) ---
  26350. Firing apply*operator
  26351. -->
  26352. (I3 ^predict-yes N1083 + :O )
  26353. Firing apply*operator*complete
  26354. -->
  26355. (I3 ^predict-no N1082 - :O )
  26356. inner elaboration loop at bottom goal.
  26357. --- Change Working Memory (PE) ---
  26358. =>WM: (15261: I3 ^predict-yes N1083)
  26359. <=WM: (15248: N1082 ^status complete)
  26360. <=WM: (15247: I3 ^predict-no N1082)
  26361. --- Firing Productions (IE) For State At Depth 1 ---
  26362. --- Inner Elaboration Phase, active level 1 (S1) ---
  26363. Firing monitor*world
  26364. -->
  26365. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26366. --- Change Working Memory (IE) ---
  26367. --- END Application Phase ---
  26368. --- Output Phase ---
  26369. ENV: Agent did: predict-yes for direction L in state State-B
  26370. In State-B moving L
  26371. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26372. predict error 0
  26373. dir: dir isR
  26374. --- END Output Phase ---
  26375. -/|--- Input Phase ---
  26376. =>WM: (15265: I2 ^dir R)
  26377. =>WM: (15264: I2 ^reward 1)
  26378. =>WM: (15263: I2 ^see 1)
  26379. =>WM: (15262: N1083 ^status complete)
  26380. <=WM: (15251: I2 ^dir L)
  26381. <=WM: (15250: I2 ^reward 1)
  26382. <=WM: (15249: I2 ^see 0)
  26383. =>WM: (15266: I2 ^level-1 L1-root)
  26384. <=WM: (15252: I2 ^level-1 R0-root)
  26385. --- END Input Phase ---
  26386. --- Proposal Phase ---
  26387. --- Inner Elaboration Phase, active level 1 (S1) ---
  26388. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  26389. -->
  26390. (S1 ^operator O2166 = -0.2714224023553999)
  26391. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  26392. -->
  26393. (S1 ^operator O2165 = 0.6622440710216861)
  26394. Firing prefer*rvt*predict-no*H0*4*v1*H1
  26395. -->
  26396. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  26397. -->
  26398. Firing elaborate*copy-see-to-output-link
  26399. -->
  26400. (I3 ^see 1 +)
  26401. Firing elaborate*reward*based*on*reward
  26402. -->
  26403. (R1087 ^value 1 +)
  26404. (R1 ^reward R1087 +)
  26405. Firing propose*predict-yes
  26406. -->
  26407. (O2167 ^name predict-yes +)
  26408. (S1 ^operator O2167 +)
  26409. Firing propose*predict-no
  26410. -->
  26411. (O2168 ^name predict-no +)
  26412. (S1 ^operator O2168 +)
  26413. Firing rl*prefer*rvt*predict-no*H0*4
  26414. -->
  26415. (S1 ^operator O2166 = 0.339790578216807)
  26416. Firing rl*prefer*rvt*predict-yes*H0*3
  26417. -->
  26418. (S1 ^operator O2165 = 0.3377050722714159)
  26419. Firing prefer*rvt*predict-yes*H0
  26420. -->
  26421. Firing prefer*rvt*predict-no*H0
  26422. -->
  26423. Firing elaborate*copy-dir-to-output-link
  26424. -->
  26425. (I3 ^dir R +)
  26426. inner elaboration loop at bottom goal.
  26427. Retracting elaborate*copy-see-to-output-link
  26428. -->
  26429. (I3 ^see 0 +)
  26430. Retracting propose*predict-no
  26431. -->
  26432. (O2166 ^name predict-no +)
  26433. (S1 ^operator O2166 +)
  26434. Retracting propose*predict-yes
  26435. -->
  26436. (O2165 ^name predict-yes +)
  26437. (S1 ^operator O2165 +)
  26438. Retracting elaborate*reward*based*on*reward
  26439. -->
  26440. (R1086 ^value 1 +)
  26441. (R1 ^reward R1086 +)
  26442. Retracting elaborate*copy-dir-to-output-link
  26443. -->
  26444. (I3 ^dir L +)
  26445. Retracting rl*prefer*rvt*predict-no*H0*6
  26446. -->
  26447. (S1 ^operator O2166 = 0.9726708564453506)
  26448. Retracting rl*prefer*rvt*predict-yes*H0*5
  26449. -->
  26450. (S1 ^operator O2165 = 0.2640006890267754)
  26451. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  26452. -->
  26453. (S1 ^operator O2165 = 0.7359077268568859)
  26454. =>WM: (15274: S1 ^operator O2168 +)
  26455. =>WM: (15273: S1 ^operator O2167 +)
  26456. =>WM: (15272: I3 ^dir R)
  26457. =>WM: (15271: O2168 ^name predict-no)
  26458. =>WM: (15270: O2167 ^name predict-yes)
  26459. =>WM: (15269: R1087 ^value 1)
  26460. =>WM: (15268: R1 ^reward R1087)
  26461. =>WM: (15267: I3 ^see 1)
  26462. <=WM: (15258: S1 ^operator O2165 +)
  26463. <=WM: (15260: S1 ^operator O2165)
  26464. <=WM: (15259: S1 ^operator O2166 +)
  26465. <=WM: (15257: I3 ^dir L)
  26466. <=WM: (15253: R1 ^reward R1086)
  26467. <=WM: (15211: I3 ^see 0)
  26468. <=WM: (15256: O2166 ^name predict-no)
  26469. <=WM: (15255: O2165 ^name predict-yes)
  26470. <=WM: (15254: R1086 ^value 1)
  26471. --- Inner Elaboration Phase, active level 1 (S1) ---
  26472. Firing prefer*rvt*predict-yes*H0
  26473. -->
  26474. Firing rl*prefer*rvt*predict-yes*H0*3
  26475. -->
  26476. (S1 ^operator O2167 = 0.3377050722714159)
  26477. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  26478. -->
  26479. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  26480. -->
  26481. (S1 ^operator O2167 = 0.6622440710216861)
  26482. Firing prefer*rvt*predict-no*H0
  26483. -->
  26484. Firing rl*prefer*rvt*predict-no*H0*4
  26485. -->
  26486. (S1 ^operator O2168 = 0.339790578216807)
  26487. Firing prefer*rvt*predict-no*H0*4*v1*H1
  26488. -->
  26489. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  26490. -->
  26491. (S1 ^operator O2168 = -0.2714224023553999)
  26492. inner elaboration loop at bottom goal.
  26493. Retracting rl*prefer*rvt*predict-no*H0*4
  26494. -->
  26495. (S1 ^operator O2166 = 0.339790578216807)
  26496. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  26497. -->
  26498. (S1 ^operator O2166 = -0.2714224023553999)
  26499. Retracting rl*prefer*rvt*predict-yes*H0*3
  26500. -->
  26501. (S1 ^operator O2165 = 0.3377050722714159)
  26502. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  26503. -->
  26504. (S1 ^operator O2165 = 0.6622440710216861)
  26505. --- END Proposal Phase ---
  26506. --- Decision Phase ---
  26507. RL update rl*prefer*rvt*predict-yes*H0*5 0.554387 -0.290386 0.264001 -> 0.554394 -0.290386 0.264008(R,m,v=1,0.883598,0.1034)
  26508. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445523 0.290384 0.735908 -> 0.445532 0.290385 0.735916(R,m,v=1,1,0)
  26509. =>WM: (15275: S1 ^operator O2167)
  26510. 1084: O: O2167 (predict-yes)
  26511. --- END Decision Phase ---
  26512. --- Application Phase ---
  26513. --- Firing Productions (PE) For State At Depth 1 ---
  26514. --- Inner Elaboration Phase, active level 1 (S1) ---
  26515. Firing apply*operator
  26516. -->
  26517. (I3 ^predict-yes N1084 + :O )
  26518. Firing apply*operator*complete
  26519. -->
  26520. (I3 ^predict-yes N1083 - :O )
  26521. inner elaboration loop at bottom goal.
  26522. --- Change Working Memory (PE) ---
  26523. =>WM: (15276: I3 ^predict-yes N1084)
  26524. <=WM: (15262: N1083 ^status complete)
  26525. <=WM: (15261: I3 ^predict-yes N1083)
  26526. --- Firing Productions (IE) For State At Depth 1 ---
  26527. --- Inner Elaboration Phase, active level 1 (S1) ---
  26528. Firing monitor*world
  26529. -->
  26530. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26531. --- Change Working Memory (IE) ---
  26532. --- END Application Phase ---
  26533. --- Output Phase ---
  26534. ENV: Agent did: predict-yes for direction R in state State-A
  26535. In State-A moving R
  26536. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  26537. predict error 0
  26538. dir: dir isU
  26539. --- END Output Phase ---
  26540. \-/--- Input Phase ---
  26541. =>WM: (15280: I2 ^dir U)
  26542. =>WM: (15279: I2 ^reward 1)
  26543. =>WM: (15278: I2 ^see 1)
  26544. =>WM: (15277: N1084 ^status complete)
  26545. <=WM: (15265: I2 ^dir R)
  26546. <=WM: (15264: I2 ^reward 1)
  26547. <=WM: (15263: I2 ^see 1)
  26548. =>WM: (15281: I2 ^level-1 R1-root)
  26549. <=WM: (15266: I2 ^level-1 L1-root)
  26550. --- END Input Phase ---
  26551. --- Proposal Phase ---
  26552. --- Inner Elaboration Phase, active level 1 (S1) ---
  26553. Firing elaborate*copy-see-to-output-link
  26554. -->
  26555. (I3 ^see 1 +)
  26556. Firing elaborate*reward*based*on*reward
  26557. -->
  26558. (R1088 ^value 1 +)
  26559. (R1 ^reward R1088 +)
  26560. Firing propose*predict-yes
  26561. -->
  26562. (O2169 ^name predict-yes +)
  26563. (S1 ^operator O2169 +)
  26564. Firing propose*predict-no
  26565. -->
  26566. (O2170 ^name predict-no +)
  26567. (S1 ^operator O2170 +)
  26568. Firing rl*prefer*rvt*predict-no*H0*2
  26569. -->
  26570. (S1 ^operator O2168 = 1.)
  26571. Firing rl*prefer*rvt*predict-yes*H0*1
  26572. -->
  26573. (S1 ^operator O2167 = 0.)
  26574. Firing prefer*rvt*predict-yes*H0
  26575. -->
  26576. Firing prefer*rvt*predict-no*H0
  26577. -->
  26578. Firing elaborate*copy-dir-to-output-link
  26579. -->
  26580. (I3 ^dir U +)
  26581. inner elaboration loop at bottom goal.
  26582. Retracting elaborate*copy-see-to-output-link
  26583. -->
  26584. (I3 ^see 1 +)
  26585. Retracting propose*predict-no
  26586. -->
  26587. (O2168 ^name predict-no +)
  26588. (S1 ^operator O2168 +)
  26589. Retracting propose*predict-yes
  26590. -->
  26591. (O2167 ^name predict-yes +)
  26592. (S1 ^operator O2167 +)
  26593. Retracting elaborate*reward*based*on*reward
  26594. -->
  26595. (R1087 ^value 1 +)
  26596. (R1 ^reward R1087 +)
  26597. Retracting elaborate*copy-dir-to-output-link
  26598. -->
  26599. (I3 ^dir R +)
  26600. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  26601. -->
  26602. (S1 ^operator O2168 = -0.2714224023553999)
  26603. Retracting rl*prefer*rvt*predict-no*H0*4
  26604. -->
  26605. (S1 ^operator O2168 = 0.339790578216807)
  26606. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  26607. -->
  26608. (S1 ^operator O2167 = 0.6622440710216861)
  26609. Retracting rl*prefer*rvt*predict-yes*H0*3
  26610. -->
  26611. (S1 ^operator O2167 = 0.3377050722714159)
  26612. =>WM: (15288: S1 ^operator O2170 +)
  26613. =>WM: (15287: S1 ^operator O2169 +)
  26614. =>WM: (15286: I3 ^dir U)
  26615. =>WM: (15285: O2170 ^name predict-no)
  26616. =>WM: (15284: O2169 ^name predict-yes)
  26617. =>WM: (15283: R1088 ^value 1)
  26618. =>WM: (15282: R1 ^reward R1088)
  26619. <=WM: (15273: S1 ^operator O2167 +)
  26620. <=WM: (15275: S1 ^operator O2167)
  26621. <=WM: (15274: S1 ^operator O2168 +)
  26622. <=WM: (15272: I3 ^dir R)
  26623. <=WM: (15268: R1 ^reward R1087)
  26624. <=WM: (15271: O2168 ^name predict-no)
  26625. <=WM: (15270: O2167 ^name predict-yes)
  26626. <=WM: (15269: R1087 ^value 1)
  26627. --- Inner Elaboration Phase, active level 1 (S1) ---
  26628. Firing prefer*rvt*predict-yes*H0
  26629. -->
  26630. Firing rl*prefer*rvt*predict-yes*H0*1
  26631. -->
  26632. (S1 ^operator O2169 = 0.)
  26633. Firing prefer*rvt*predict-no*H0
  26634. -->
  26635. Firing rl*prefer*rvt*predict-no*H0*2
  26636. -->
  26637. (S1 ^operator O2170 = 1.)
  26638. inner elaboration loop at bottom goal.
  26639. Retracting rl*prefer*rvt*predict-no*H0*2
  26640. -->
  26641. (S1 ^operator O2168 = 1.)
  26642. Retracting rl*prefer*rvt*predict-yes*H0*1
  26643. -->
  26644. (S1 ^operator O2167 = 0.)
  26645. --- END Proposal Phase ---
  26646. --- Decision Phase ---
  26647. RL update rl*prefer*rvt*predict-yes*H0*3 0.590104 -0.252399 0.337705 -> 0.590109 -0.2524 0.337709(R,m,v=1,0.905028,0.0864353)
  26648. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409837 0.252407 0.662244 -> 0.409843 0.252406 0.662249(R,m,v=1,1,0)
  26649. =>WM: (15289: S1 ^operator O2170)
  26650. 1085: O: O2170 (predict-no)
  26651. --- END Decision Phase ---
  26652. --- Application Phase ---
  26653. --- Firing Productions (PE) For State At Depth 1 ---
  26654. --- Inner Elaboration Phase, active level 1 (S1) ---
  26655. Firing apply*operator
  26656. -->
  26657. (I3 ^predict-no N1085 + :O )
  26658. Firing apply*operator*complete
  26659. -->
  26660. (I3 ^predict-yes N1084 - :O )
  26661. inner elaboration loop at bottom goal.
  26662. --- Change Working Memory (PE) ---
  26663. =>WM: (15290: I3 ^predict-no N1085)
  26664. <=WM: (15277: N1084 ^status complete)
  26665. <=WM: (15276: I3 ^predict-yes N1084)
  26666. --- Firing Productions (IE) For State At Depth 1 ---
  26667. --- Inner Elaboration Phase, active level 1 (S1) ---
  26668. Firing monitor*world
  26669. -->
  26670. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26671. --- Change Working Memory (IE) ---
  26672. --- END Application Phase ---
  26673. --- Output Phase ---
  26674. ENV: Agent did: predict-no for direction U in state State-B
  26675. In State-B moving U
  26676. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26677. predict error 0
  26678. dir: dir isL
  26679. --- END Output Phase ---
  26680. |\---- Input Phase ---
  26681. =>WM: (15294: I2 ^dir L)
  26682. =>WM: (15293: I2 ^reward 1)
  26683. =>WM: (15292: I2 ^see 0)
  26684. =>WM: (15291: N1085 ^status complete)
  26685. <=WM: (15280: I2 ^dir U)
  26686. <=WM: (15279: I2 ^reward 1)
  26687. <=WM: (15278: I2 ^see 1)
  26688. =>WM: (15295: I2 ^level-1 R1-root)
  26689. <=WM: (15281: I2 ^level-1 R1-root)
  26690. --- END Input Phase ---
  26691. --- Proposal Phase ---
  26692. --- Inner Elaboration Phase, active level 1 (S1) ---
  26693. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26694. -->
  26695. (S1 ^operator O2169 = 0.7361475896128331)
  26696. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26697. -->
  26698. Firing elaborate*copy-see-to-output-link
  26699. -->
  26700. (I3 ^see 0 +)
  26701. Firing elaborate*reward*based*on*reward
  26702. -->
  26703. (R1089 ^value 1 +)
  26704. (R1 ^reward R1089 +)
  26705. Firing propose*predict-yes
  26706. -->
  26707. (O2171 ^name predict-yes +)
  26708. (S1 ^operator O2171 +)
  26709. Firing propose*predict-no
  26710. -->
  26711. (O2172 ^name predict-no +)
  26712. (S1 ^operator O2172 +)
  26713. Firing rl*prefer*rvt*predict-no*H0*6
  26714. -->
  26715. (S1 ^operator O2170 = 0.9726708564453506)
  26716. Firing rl*prefer*rvt*predict-yes*H0*5
  26717. -->
  26718. (S1 ^operator O2169 = 0.2640080254436565)
  26719. Firing prefer*rvt*predict-yes*H0
  26720. -->
  26721. Firing prefer*rvt*predict-no*H0
  26722. -->
  26723. Firing elaborate*copy-dir-to-output-link
  26724. -->
  26725. (I3 ^dir L +)
  26726. inner elaboration loop at bottom goal.
  26727. Retracting elaborate*copy-see-to-output-link
  26728. -->
  26729. (I3 ^see 1 +)
  26730. Retracting propose*predict-no
  26731. -->
  26732. (O2170 ^name predict-no +)
  26733. (S1 ^operator O2170 +)
  26734. Retracting propose*predict-yes
  26735. -->
  26736. (O2169 ^name predict-yes +)
  26737. (S1 ^operator O2169 +)
  26738. Retracting elaborate*reward*based*on*reward
  26739. -->
  26740. (R1088 ^value 1 +)
  26741. (R1 ^reward R1088 +)
  26742. Retracting elaborate*copy-dir-to-output-link
  26743. -->
  26744. (I3 ^dir U +)
  26745. Retracting rl*prefer*rvt*predict-no*H0*2
  26746. -->
  26747. (S1 ^operator O2170 = 1.)
  26748. Retracting rl*prefer*rvt*predict-yes*H0*1
  26749. -->
  26750. (S1 ^operator O2169 = 0.)
  26751. =>WM: (15303: S1 ^operator O2172 +)
  26752. =>WM: (15302: S1 ^operator O2171 +)
  26753. =>WM: (15301: I3 ^dir L)
  26754. =>WM: (15300: O2172 ^name predict-no)
  26755. =>WM: (15299: O2171 ^name predict-yes)
  26756. =>WM: (15298: R1089 ^value 1)
  26757. =>WM: (15297: R1 ^reward R1089)
  26758. =>WM: (15296: I3 ^see 0)
  26759. <=WM: (15287: S1 ^operator O2169 +)
  26760. <=WM: (15288: S1 ^operator O2170 +)
  26761. <=WM: (15289: S1 ^operator O2170)
  26762. <=WM: (15286: I3 ^dir U)
  26763. <=WM: (15282: R1 ^reward R1088)
  26764. <=WM: (15267: I3 ^see 1)
  26765. <=WM: (15285: O2170 ^name predict-no)
  26766. <=WM: (15284: O2169 ^name predict-yes)
  26767. <=WM: (15283: R1088 ^value 1)
  26768. --- Inner Elaboration Phase, active level 1 (S1) ---
  26769. Firing prefer*rvt*predict-yes*H0
  26770. -->
  26771. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26772. -->
  26773. (S1 ^operator O2171 = 0.7361475896128331)
  26774. Firing rl*prefer*rvt*predict-yes*H0*5
  26775. -->
  26776. (S1 ^operator O2171 = 0.2640080254436565)
  26777. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26778. -->
  26779. Firing prefer*rvt*predict-no*H0
  26780. -->
  26781. Firing rl*prefer*rvt*predict-no*H0*6
  26782. -->
  26783. (S1 ^operator O2172 = 0.9726708564453506)
  26784. inner elaboration loop at bottom goal.
  26785. Retracting rl*prefer*rvt*predict-no*H0*6
  26786. -->
  26787. (S1 ^operator O2170 = 0.9726708564453506)
  26788. Retracting rl*prefer*rvt*predict-yes*H0*5
  26789. -->
  26790. (S1 ^operator O2169 = 0.2640080254436565)
  26791. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26792. -->
  26793. (S1 ^operator O2169 = 0.7361475896128331)
  26794. --- END Proposal Phase ---
  26795. --- Decision Phase ---
  26796. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  26797. =>WM: (15304: S1 ^operator O2171)
  26798. 1086: O: O2171 (predict-yes)
  26799. --- END Decision Phase ---
  26800. --- Application Phase ---
  26801. --- Firing Productions (PE) For State At Depth 1 ---
  26802. --- Inner Elaboration Phase, active level 1 (S1) ---
  26803. Firing apply*operator
  26804. -->
  26805. (I3 ^predict-yes N1086 + :O )
  26806. Firing apply*operator*complete
  26807. -->
  26808. (I3 ^predict-no N1085 - :O )
  26809. inner elaboration loop at bottom goal.
  26810. --- Change Working Memory (PE) ---
  26811. =>WM: (15305: I3 ^predict-yes N1086)
  26812. <=WM: (15291: N1085 ^status complete)
  26813. <=WM: (15290: I3 ^predict-no N1085)
  26814. --- Firing Productions (IE) For State At Depth 1 ---
  26815. --- Inner Elaboration Phase, active level 1 (S1) ---
  26816. Firing monitor*world
  26817. -->
  26818. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26819. --- Change Working Memory (IE) ---
  26820. --- END Application Phase ---
  26821. --- Output Phase ---
  26822. ENV: Agent did: predict-yes for direction L in state State-B
  26823. In State-B moving L
  26824. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26825. predict error 0
  26826. dir: dir isU
  26827. --- END Output Phase ---
  26828. /|\--- Input Phase ---
  26829. =>WM: (15309: I2 ^dir U)
  26830. =>WM: (15308: I2 ^reward 1)
  26831. =>WM: (15307: I2 ^see 1)
  26832. =>WM: (15306: N1086 ^status complete)
  26833. <=WM: (15294: I2 ^dir L)
  26834. <=WM: (15293: I2 ^reward 1)
  26835. <=WM: (15292: I2 ^see 0)
  26836. =>WM: (15310: I2 ^level-1 L1-root)
  26837. <=WM: (15295: I2 ^level-1 R1-root)
  26838. --- END Input Phase ---
  26839. --- Proposal Phase ---
  26840. --- Inner Elaboration Phase, active level 1 (S1) ---
  26841. Firing elaborate*copy-see-to-output-link
  26842. -->
  26843. (I3 ^see 1 +)
  26844. Firing elaborate*reward*based*on*reward
  26845. -->
  26846. (R1090 ^value 1 +)
  26847. (R1 ^reward R1090 +)
  26848. Firing propose*predict-yes
  26849. -->
  26850. (O2173 ^name predict-yes +)
  26851. (S1 ^operator O2173 +)
  26852. Firing propose*predict-no
  26853. -->
  26854. (O2174 ^name predict-no +)
  26855. (S1 ^operator O2174 +)
  26856. Firing rl*prefer*rvt*predict-no*H0*2
  26857. -->
  26858. (S1 ^operator O2172 = 1.)
  26859. Firing rl*prefer*rvt*predict-yes*H0*1
  26860. -->
  26861. (S1 ^operator O2171 = 0.)
  26862. Firing prefer*rvt*predict-yes*H0
  26863. -->
  26864. Firing prefer*rvt*predict-no*H0
  26865. -->
  26866. Firing elaborate*copy-dir-to-output-link
  26867. -->
  26868. (I3 ^dir U +)
  26869. inner elaboration loop at bottom goal.
  26870. Retracting elaborate*copy-see-to-output-link
  26871. -->
  26872. (I3 ^see 0 +)
  26873. Retracting propose*predict-no
  26874. -->
  26875. (O2172 ^name predict-no +)
  26876. (S1 ^operator O2172 +)
  26877. Retracting propose*predict-yes
  26878. -->
  26879. (O2171 ^name predict-yes +)
  26880. (S1 ^operator O2171 +)
  26881. Retracting elaborate*reward*based*on*reward
  26882. -->
  26883. (R1089 ^value 1 +)
  26884. (R1 ^reward R1089 +)
  26885. Retracting elaborate*copy-dir-to-output-link
  26886. -->
  26887. (I3 ^dir L +)
  26888. Retracting rl*prefer*rvt*predict-no*H0*6
  26889. -->
  26890. (S1 ^operator O2172 = 0.9726708564453506)
  26891. Retracting rl*prefer*rvt*predict-yes*H0*5
  26892. -->
  26893. (S1 ^operator O2171 = 0.2640080254436565)
  26894. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  26895. -->
  26896. (S1 ^operator O2171 = 0.7361475896128331)
  26897. =>WM: (15318: S1 ^operator O2174 +)
  26898. =>WM: (15317: S1 ^operator O2173 +)
  26899. =>WM: (15316: I3 ^dir U)
  26900. =>WM: (15315: O2174 ^name predict-no)
  26901. =>WM: (15314: O2173 ^name predict-yes)
  26902. =>WM: (15313: R1090 ^value 1)
  26903. =>WM: (15312: R1 ^reward R1090)
  26904. =>WM: (15311: I3 ^see 1)
  26905. <=WM: (15302: S1 ^operator O2171 +)
  26906. <=WM: (15304: S1 ^operator O2171)
  26907. <=WM: (15303: S1 ^operator O2172 +)
  26908. <=WM: (15301: I3 ^dir L)
  26909. <=WM: (15297: R1 ^reward R1089)
  26910. <=WM: (15296: I3 ^see 0)
  26911. <=WM: (15300: O2172 ^name predict-no)
  26912. <=WM: (15299: O2171 ^name predict-yes)
  26913. <=WM: (15298: R1089 ^value 1)
  26914. --- Inner Elaboration Phase, active level 1 (S1) ---
  26915. Firing prefer*rvt*predict-yes*H0
  26916. -->
  26917. Firing rl*prefer*rvt*predict-yes*H0*1
  26918. -->
  26919. (S1 ^operator O2173 = 0.)
  26920. Firing prefer*rvt*predict-no*H0
  26921. -->
  26922. Firing rl*prefer*rvt*predict-no*H0*2
  26923. -->
  26924. (S1 ^operator O2174 = 1.)
  26925. inner elaboration loop at bottom goal.
  26926. Retracting rl*prefer*rvt*predict-no*H0*2
  26927. -->
  26928. (S1 ^operator O2172 = 1.)
  26929. Retracting rl*prefer*rvt*predict-yes*H0*1
  26930. -->
  26931. (S1 ^operator O2171 = 0.)
  26932. --- END Proposal Phase ---
  26933. --- Decision Phase ---
  26934. RL update rl*prefer*rvt*predict-yes*H0*5 0.554394 -0.290386 0.264008 -> 0.554382 -0.290386 0.263996(R,m,v=1,0.884211,0.102924)
  26935. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445759 0.290389 0.736148 -> 0.445745 0.290388 0.736133(R,m,v=1,1,0)
  26936. =>WM: (15319: S1 ^operator O2174)
  26937. 1087: O: O2174 (predict-no)
  26938. --- END Decision Phase ---
  26939. --- Application Phase ---
  26940. --- Firing Productions (PE) For State At Depth 1 ---
  26941. --- Inner Elaboration Phase, active level 1 (S1) ---
  26942. Firing apply*operator
  26943. -->
  26944. (I3 ^predict-no N1087 + :O )
  26945. Firing apply*operator*complete
  26946. -->
  26947. (I3 ^predict-yes N1086 - :O )
  26948. inner elaboration loop at bottom goal.
  26949. --- Change Working Memory (PE) ---
  26950. =>WM: (15320: I3 ^predict-no N1087)
  26951. <=WM: (15306: N1086 ^status complete)
  26952. <=WM: (15305: I3 ^predict-yes N1086)
  26953. --- Firing Productions (IE) For State At Depth 1 ---
  26954. --- Inner Elaboration Phase, active level 1 (S1) ---
  26955. Firing monitor*world
  26956. -->
  26957. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26958. --- Change Working Memory (IE) ---
  26959. --- END Application Phase ---
  26960. --- Output Phase ---
  26961. ENV: Agent did: predict-no for direction U in state State-A
  26962. In State-A moving U
  26963. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  26964. predict error 0
  26965. dir: dir isL
  26966. --- END Output Phase ---
  26967. ---- Input Phase ---
  26968. =>WM: (15324: I2 ^dir L)
  26969. =>WM: (15323: I2 ^reward 1)
  26970. =>WM: (15322: I2 ^see 0)
  26971. =>WM: (15321: N1087 ^status complete)
  26972. <=WM: (15309: I2 ^dir U)
  26973. <=WM: (15308: I2 ^reward 1)
  26974. <=WM: (15307: I2 ^see 1)
  26975. =>WM: (15325: I2 ^level-1 L1-root)
  26976. <=WM: (15310: I2 ^level-1 L1-root)
  26977. --- END Input Phase ---
  26978. --- Proposal Phase ---
  26979. --- Inner Elaboration Phase, active level 1 (S1) ---
  26980. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  26981. -->
  26982. (S1 ^operator O2173 = -0.181727099742844)
  26983. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  26984. -->
  26985. Firing elaborate*copy-see-to-output-link
  26986. -->
  26987. (I3 ^see 0 +)
  26988. Firing elaborate*reward*based*on*reward
  26989. -->
  26990. (R1091 ^value 1 +)
  26991. (R1 ^reward R1091 +)
  26992. Firing propose*predict-yes
  26993. -->
  26994. (O2175 ^name predict-yes +)
  26995. (S1 ^operator O2175 +)
  26996. Firing propose*predict-no
  26997. -->
  26998. (O2176 ^name predict-no +)
  26999. (S1 ^operator O2176 +)
  27000. Firing rl*prefer*rvt*predict-no*H0*6
  27001. -->
  27002. (S1 ^operator O2174 = 0.9726708564453506)
  27003. Firing rl*prefer*rvt*predict-yes*H0*5
  27004. -->
  27005. (S1 ^operator O2173 = 0.2639955703086441)
  27006. Firing prefer*rvt*predict-yes*H0
  27007. -->
  27008. Firing prefer*rvt*predict-no*H0
  27009. -->
  27010. Firing elaborate*copy-dir-to-output-link
  27011. -->
  27012. (I3 ^dir L +)
  27013. inner elaboration loop at bottom goal.
  27014. Retracting elaborate*copy-see-to-output-link
  27015. -->
  27016. (I3 ^see 1 +)
  27017. Retracting propose*predict-no
  27018. -->
  27019. (O2174 ^name predict-no +)
  27020. (S1 ^operator O2174 +)
  27021. Retracting propose*predict-yes
  27022. -->
  27023. (O2173 ^name predict-yes +)
  27024. (S1 ^operator O2173 +)
  27025. Retracting elaborate*reward*based*on*reward
  27026. -->
  27027. (R1090 ^value 1 +)
  27028. (R1 ^reward R1090 +)
  27029. Retracting elaborate*copy-dir-to-output-link
  27030. -->
  27031. (I3 ^dir U +)
  27032. Retracting rl*prefer*rvt*predict-no*H0*2
  27033. -->
  27034. (S1 ^operator O2174 = 1.)
  27035. Retracting rl*prefer*rvt*predict-yes*H0*1
  27036. -->
  27037. (S1 ^operator O2173 = 0.)
  27038. =>WM: (15333: S1 ^operator O2176 +)
  27039. =>WM: (15332: S1 ^operator O2175 +)
  27040. =>WM: (15331: I3 ^dir L)
  27041. =>WM: (15330: O2176 ^name predict-no)
  27042. =>WM: (15329: O2175 ^name predict-yes)
  27043. =>WM: (15328: R1091 ^value 1)
  27044. =>WM: (15327: R1 ^reward R1091)
  27045. =>WM: (15326: I3 ^see 0)
  27046. <=WM: (15317: S1 ^operator O2173 +)
  27047. <=WM: (15318: S1 ^operator O2174 +)
  27048. <=WM: (15319: S1 ^operator O2174)
  27049. <=WM: (15316: I3 ^dir U)
  27050. <=WM: (15312: R1 ^reward R1090)
  27051. <=WM: (15311: I3 ^see 1)
  27052. <=WM: (15315: O2174 ^name predict-no)
  27053. <=WM: (15314: O2173 ^name predict-yes)
  27054. <=WM: (15313: R1090 ^value 1)
  27055. --- Inner Elaboration Phase, active level 1 (S1) ---
  27056. Firing prefer*rvt*predict-yes*H0
  27057. -->
  27058. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27059. -->
  27060. (S1 ^operator O2175 = -0.181727099742844)
  27061. Firing rl*prefer*rvt*predict-yes*H0*5
  27062. -->
  27063. (S1 ^operator O2175 = 0.2639955703086441)
  27064. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27065. -->
  27066. Firing prefer*rvt*predict-no*H0
  27067. -->
  27068. Firing rl*prefer*rvt*predict-no*H0*6
  27069. -->
  27070. (S1 ^operator O2176 = 0.9726708564453506)
  27071. inner elaboration loop at bottom goal.
  27072. Retracting rl*prefer*rvt*predict-no*H0*6
  27073. -->
  27074. (S1 ^operator O2174 = 0.9726708564453506)
  27075. Retracting rl*prefer*rvt*predict-yes*H0*5
  27076. -->
  27077. (S1 ^operator O2173 = 0.2639955703086441)
  27078. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27079. -->
  27080. (S1 ^operator O2173 = -0.181727099742844)
  27081. --- END Proposal Phase ---
  27082. --- Decision Phase ---
  27083. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27084. =>WM: (15334: S1 ^operator O2176)
  27085. 1088: O: O2176 (predict-no)
  27086. --- END Decision Phase ---
  27087. --- Application Phase ---
  27088. --- Firing Productions (PE) For State At Depth 1 ---
  27089. --- Inner Elaboration Phase, active level 1 (S1) ---
  27090. Firing apply*operator
  27091. -->
  27092. (I3 ^predict-no N1088 + :O )
  27093. Firing apply*operator*complete
  27094. -->
  27095. (I3 ^predict-no N1087 - :O )
  27096. inner elaboration loop at bottom goal.
  27097. --- Change Working Memory (PE) ---
  27098. =>WM: (15335: I3 ^predict-no N1088)
  27099. <=WM: (15321: N1087 ^status complete)
  27100. <=WM: (15320: I3 ^predict-no N1087)
  27101. --- Firing Productions (IE) For State At Depth 1 ---
  27102. --- Inner Elaboration Phase, active level 1 (S1) ---
  27103. Firing monitor*world
  27104. -->
  27105. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27106. --- Change Working Memory (IE) ---
  27107. --- END Application Phase ---
  27108. --- Output Phase ---
  27109. ENV: Agent did: predict-no for direction L in state State-A
  27110. In State-A moving L
  27111. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27112. predict error 0
  27113. dir: dir isL
  27114. --- END Output Phase ---
  27115. /|\--- Input Phase ---
  27116. =>WM: (15339: I2 ^dir L)
  27117. =>WM: (15338: I2 ^reward 1)
  27118. =>WM: (15337: I2 ^see 0)
  27119. =>WM: (15336: N1088 ^status complete)
  27120. <=WM: (15324: I2 ^dir L)
  27121. <=WM: (15323: I2 ^reward 1)
  27122. <=WM: (15322: I2 ^see 0)
  27123. =>WM: (15340: I2 ^level-1 L0-root)
  27124. <=WM: (15325: I2 ^level-1 L1-root)
  27125. --- END Input Phase ---
  27126. --- Proposal Phase ---
  27127. --- Inner Elaboration Phase, active level 1 (S1) ---
  27128. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  27129. -->
  27130. (S1 ^operator O2175 = -0.1386470047172653)
  27131. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27132. -->
  27133. Firing elaborate*copy-see-to-output-link
  27134. -->
  27135. (I3 ^see 0 +)
  27136. Firing elaborate*reward*based*on*reward
  27137. -->
  27138. (R1092 ^value 1 +)
  27139. (R1 ^reward R1092 +)
  27140. Firing propose*predict-yes
  27141. -->
  27142. (O2177 ^name predict-yes +)
  27143. (S1 ^operator O2177 +)
  27144. Firing propose*predict-no
  27145. -->
  27146. (O2178 ^name predict-no +)
  27147. (S1 ^operator O2178 +)
  27148. Firing rl*prefer*rvt*predict-no*H0*6
  27149. -->
  27150. (S1 ^operator O2176 = 0.9726708564453506)
  27151. Firing rl*prefer*rvt*predict-yes*H0*5
  27152. -->
  27153. (S1 ^operator O2175 = 0.2639955703086441)
  27154. Firing prefer*rvt*predict-yes*H0
  27155. -->
  27156. Firing prefer*rvt*predict-no*H0
  27157. -->
  27158. Firing elaborate*copy-dir-to-output-link
  27159. -->
  27160. (I3 ^dir L +)
  27161. inner elaboration loop at bottom goal.
  27162. Retracting elaborate*copy-see-to-output-link
  27163. -->
  27164. (I3 ^see 0 +)
  27165. Retracting propose*predict-no
  27166. -->
  27167. (O2176 ^name predict-no +)
  27168. (S1 ^operator O2176 +)
  27169. Retracting propose*predict-yes
  27170. -->
  27171. (O2175 ^name predict-yes +)
  27172. (S1 ^operator O2175 +)
  27173. Retracting elaborate*reward*based*on*reward
  27174. -->
  27175. (R1091 ^value 1 +)
  27176. (R1 ^reward R1091 +)
  27177. Retracting elaborate*copy-dir-to-output-link
  27178. -->
  27179. (I3 ^dir L +)
  27180. Retracting rl*prefer*rvt*predict-no*H0*6
  27181. -->
  27182. (S1 ^operator O2176 = 0.9726708564453506)
  27183. Retracting rl*prefer*rvt*predict-yes*H0*5
  27184. -->
  27185. (S1 ^operator O2175 = 0.2639955703086441)
  27186. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  27187. -->
  27188. (S1 ^operator O2175 = -0.181727099742844)
  27189. =>WM: (15346: S1 ^operator O2178 +)
  27190. =>WM: (15345: S1 ^operator O2177 +)
  27191. =>WM: (15344: O2178 ^name predict-no)
  27192. =>WM: (15343: O2177 ^name predict-yes)
  27193. =>WM: (15342: R1092 ^value 1)
  27194. =>WM: (15341: R1 ^reward R1092)
  27195. <=WM: (15332: S1 ^operator O2175 +)
  27196. <=WM: (15333: S1 ^operator O2176 +)
  27197. <=WM: (15334: S1 ^operator O2176)
  27198. <=WM: (15327: R1 ^reward R1091)
  27199. <=WM: (15330: O2176 ^name predict-no)
  27200. <=WM: (15329: O2175 ^name predict-yes)
  27201. <=WM: (15328: R1091 ^value 1)
  27202. --- Inner Elaboration Phase, active level 1 (S1) ---
  27203. Firing prefer*rvt*predict-yes*H0
  27204. -->
  27205. Firing rl*prefer*rvt*predict-yes*H0*5
  27206. -->
  27207. (S1 ^operator O2177 = 0.2639955703086441)
  27208. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27209. -->
  27210. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  27211. -->
  27212. (S1 ^operator O2177 = -0.1386470047172653)
  27213. Firing prefer*rvt*predict-no*H0
  27214. -->
  27215. Firing rl*prefer*rvt*predict-no*H0*6
  27216. -->
  27217. (S1 ^operator O2178 = 0.9726708564453506)
  27218. inner elaboration loop at bottom goal.
  27219. Retracting rl*prefer*rvt*predict-no*H0*6
  27220. -->
  27221. (S1 ^operator O2176 = 0.9726708564453506)
  27222. Retracting rl*prefer*rvt*predict-yes*H0*5
  27223. -->
  27224. (S1 ^operator O2175 = 0.2639955703086441)
  27225. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  27226. -->
  27227. (S1 ^operator O2175 = -0.1386470047172653)
  27228. --- END Proposal Phase ---
  27229. --- Decision Phase ---
  27230. RL update rl*prefer*rvt*predict-no*H0*6 0.972671 0 0.972671 -> 0.97716 0 0.97716(R,m,v=1,0.907407,0.0845411)
  27231. =>WM: (15347: S1 ^operator O2178)
  27232. 1089: O: O2178 (predict-no)
  27233. --- END Decision Phase ---
  27234. --- Application Phase ---
  27235. --- Firing Productions (PE) For State At Depth 1 ---
  27236. --- Inner Elaboration Phase, active level 1 (S1) ---
  27237. Firing apply*operator
  27238. -->
  27239. (I3 ^predict-no N1089 + :O )
  27240. Firing apply*operator*complete
  27241. -->
  27242. (I3 ^predict-no N1088 - :O )
  27243. inner elaboration loop at bottom goal.
  27244. --- Change Working Memory (PE) ---
  27245. =>WM: (15348: I3 ^predict-no N1089)
  27246. <=WM: (15336: N1088 ^status complete)
  27247. <=WM: (15335: I3 ^predict-no N1088)
  27248. --- Firing Productions (IE) For State At Depth 1 ---
  27249. --- Inner Elaboration Phase, active level 1 (S1) ---
  27250. Firing monitor*world
  27251. -->
  27252. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27253. --- Change Working Memory (IE) ---
  27254. --- END Application Phase ---
  27255. --- Output Phase ---
  27256. ENV: Agent did: predict-no for direction L in state State-A
  27257. In State-A moving L
  27258. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27259. predict error 0
  27260. dir: dir isR
  27261. --- END Output Phase ---
  27262. -/|--- Input Phase ---
  27263. =>WM: (15352: I2 ^dir R)
  27264. =>WM: (15351: I2 ^reward 1)
  27265. =>WM: (15350: I2 ^see 0)
  27266. =>WM: (15349: N1089 ^status complete)
  27267. <=WM: (15339: I2 ^dir L)
  27268. <=WM: (15338: I2 ^reward 1)
  27269. <=WM: (15337: I2 ^see 0)
  27270. =>WM: (15353: I2 ^level-1 L0-root)
  27271. <=WM: (15340: I2 ^level-1 L0-root)
  27272. --- END Input Phase ---
  27273. --- Proposal Phase ---
  27274. --- Inner Elaboration Phase, active level 1 (S1) ---
  27275. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27276. -->
  27277. (S1 ^operator O2178 = -0.2817060109291377)
  27278. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27279. -->
  27280. (S1 ^operator O2177 = 0.6623198172764229)
  27281. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27282. -->
  27283. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27284. -->
  27285. Firing elaborate*copy-see-to-output-link
  27286. -->
  27287. (I3 ^see 0 +)
  27288. Firing elaborate*reward*based*on*reward
  27289. -->
  27290. (R1093 ^value 1 +)
  27291. (R1 ^reward R1093 +)
  27292. Firing propose*predict-yes
  27293. -->
  27294. (O2179 ^name predict-yes +)
  27295. (S1 ^operator O2179 +)
  27296. Firing propose*predict-no
  27297. -->
  27298. (O2180 ^name predict-no +)
  27299. (S1 ^operator O2180 +)
  27300. Firing rl*prefer*rvt*predict-no*H0*4
  27301. -->
  27302. (S1 ^operator O2178 = 0.339790578216807)
  27303. Firing rl*prefer*rvt*predict-yes*H0*3
  27304. -->
  27305. (S1 ^operator O2177 = 0.3377091819801437)
  27306. Firing prefer*rvt*predict-yes*H0
  27307. -->
  27308. Firing prefer*rvt*predict-no*H0
  27309. -->
  27310. Firing elaborate*copy-dir-to-output-link
  27311. -->
  27312. (I3 ^dir R +)
  27313. inner elaboration loop at bottom goal.
  27314. Retracting elaborate*copy-see-to-output-link
  27315. -->
  27316. (I3 ^see 0 +)
  27317. Retracting propose*predict-no
  27318. -->
  27319. (O2178 ^name predict-no +)
  27320. (S1 ^operator O2178 +)
  27321. Retracting propose*predict-yes
  27322. -->
  27323. (O2177 ^name predict-yes +)
  27324. (S1 ^operator O2177 +)
  27325. Retracting elaborate*reward*based*on*reward
  27326. -->
  27327. (R1092 ^value 1 +)
  27328. (R1 ^reward R1092 +)
  27329. Retracting elaborate*copy-dir-to-output-link
  27330. -->
  27331. (I3 ^dir L +)
  27332. Retracting rl*prefer*rvt*predict-no*H0*6
  27333. -->
  27334. (S1 ^operator O2178 = 0.9771601724330878)
  27335. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  27336. -->
  27337. (S1 ^operator O2177 = -0.1386470047172653)
  27338. Retracting rl*prefer*rvt*predict-yes*H0*5
  27339. -->
  27340. (S1 ^operator O2177 = 0.2639955703086441)
  27341. =>WM: (15360: S1 ^operator O2180 +)
  27342. =>WM: (15359: S1 ^operator O2179 +)
  27343. =>WM: (15358: I3 ^dir R)
  27344. =>WM: (15357: O2180 ^name predict-no)
  27345. =>WM: (15356: O2179 ^name predict-yes)
  27346. =>WM: (15355: R1093 ^value 1)
  27347. =>WM: (15354: R1 ^reward R1093)
  27348. <=WM: (15345: S1 ^operator O2177 +)
  27349. <=WM: (15346: S1 ^operator O2178 +)
  27350. <=WM: (15347: S1 ^operator O2178)
  27351. <=WM: (15331: I3 ^dir L)
  27352. <=WM: (15341: R1 ^reward R1092)
  27353. <=WM: (15344: O2178 ^name predict-no)
  27354. <=WM: (15343: O2177 ^name predict-yes)
  27355. <=WM: (15342: R1092 ^value 1)
  27356. --- Inner Elaboration Phase, active level 1 (S1) ---
  27357. Firing prefer*rvt*predict-yes*H0
  27358. -->
  27359. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27360. -->
  27361. (S1 ^operator O2179 = 0.6623198172764229)
  27362. Firing rl*prefer*rvt*predict-yes*H0*3
  27363. -->
  27364. (S1 ^operator O2179 = 0.3377091819801437)
  27365. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27366. -->
  27367. Firing prefer*rvt*predict-no*H0
  27368. -->
  27369. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27370. -->
  27371. (S1 ^operator O2180 = -0.2817060109291377)
  27372. Firing rl*prefer*rvt*predict-no*H0*4
  27373. -->
  27374. (S1 ^operator O2180 = 0.339790578216807)
  27375. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27376. -->
  27377. inner elaboration loop at bottom goal.
  27378. Retracting rl*prefer*rvt*predict-no*H0*4
  27379. -->
  27380. (S1 ^operator O2178 = 0.339790578216807)
  27381. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27382. -->
  27383. (S1 ^operator O2178 = -0.2817060109291377)
  27384. Retracting rl*prefer*rvt*predict-yes*H0*3
  27385. -->
  27386. (S1 ^operator O2177 = 0.3377091819801437)
  27387. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27388. -->
  27389. (S1 ^operator O2177 = 0.6623198172764229)
  27390. --- END Proposal Phase ---
  27391. --- Decision Phase ---
  27392. RL update rl*prefer*rvt*predict-no*H0*6 0.97716 0 0.97716 -> 0.980908 0 0.980908(R,m,v=1,0.907975,0.0840718)
  27393. =>WM: (15361: S1 ^operator O2179)
  27394. 1090: O: O2179 (predict-yes)
  27395. --- END Decision Phase ---
  27396. --- Application Phase ---
  27397. --- Firing Productions (PE) For State At Depth 1 ---
  27398. --- Inner Elaboration Phase, active level 1 (S1) ---
  27399. Firing apply*operator
  27400. -->
  27401. (I3 ^predict-yes N1090 + :O )
  27402. Firing apply*operator*complete
  27403. -->
  27404. (I3 ^predict-no N1089 - :O )
  27405. inner elaboration loop at bottom goal.
  27406. --- Change Working Memory (PE) ---
  27407. =>WM: (15362: I3 ^predict-yes N1090)
  27408. <=WM: (15349: N1089 ^status complete)
  27409. <=WM: (15348: I3 ^predict-no N1089)
  27410. --- Firing Productions (IE) For State At Depth 1 ---
  27411. --- Inner Elaboration Phase, active level 1 (S1) ---
  27412. Firing monitor*world
  27413. -->
  27414. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  27415. --- Change Working Memory (IE) ---
  27416. --- END Application Phase ---
  27417. --- Output Phase ---
  27418. ENV: Agent did: predict-yes for direction R in state State-A
  27419. In State-A moving R
  27420. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  27421. predict error 0
  27422. dir: dir isR
  27423. --- END Output Phase ---
  27424. \-/--- Input Phase ---
  27425. =>WM: (15366: I2 ^dir R)
  27426. =>WM: (15365: I2 ^reward 1)
  27427. =>WM: (15364: I2 ^see 1)
  27428. =>WM: (15363: N1090 ^status complete)
  27429. <=WM: (15352: I2 ^dir R)
  27430. <=WM: (15351: I2 ^reward 1)
  27431. <=WM: (15350: I2 ^see 0)
  27432. =>WM: (15367: I2 ^level-1 R1-root)
  27433. <=WM: (15353: I2 ^level-1 L0-root)
  27434. --- END Input Phase ---
  27435. --- Proposal Phase ---
  27436. --- Inner Elaboration Phase, active level 1 (S1) ---
  27437. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  27438. -->
  27439. (S1 ^operator O2179 = -0.1070236389116304)
  27440. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  27441. -->
  27442. (S1 ^operator O2180 = 0.6602309079953435)
  27443. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27444. -->
  27445. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27446. -->
  27447. Firing elaborate*copy-see-to-output-link
  27448. -->
  27449. (I3 ^see 1 +)
  27450. Firing elaborate*reward*based*on*reward
  27451. -->
  27452. (R1094 ^value 1 +)
  27453. (R1 ^reward R1094 +)
  27454. Firing propose*predict-yes
  27455. -->
  27456. (O2181 ^name predict-yes +)
  27457. (S1 ^operator O2181 +)
  27458. Firing propose*predict-no
  27459. -->
  27460. (O2182 ^name predict-no +)
  27461. (S1 ^operator O2182 +)
  27462. Firing rl*prefer*rvt*predict-no*H0*4
  27463. -->
  27464. (S1 ^operator O2180 = 0.339790578216807)
  27465. Firing rl*prefer*rvt*predict-yes*H0*3
  27466. -->
  27467. (S1 ^operator O2179 = 0.3377091819801437)
  27468. Firing prefer*rvt*predict-yes*H0
  27469. -->
  27470. Firing prefer*rvt*predict-no*H0
  27471. -->
  27472. Firing elaborate*copy-dir-to-output-link
  27473. -->
  27474. (I3 ^dir R +)
  27475. inner elaboration loop at bottom goal.
  27476. Retracting elaborate*copy-see-to-output-link
  27477. -->
  27478. (I3 ^see 0 +)
  27479. Retracting propose*predict-no
  27480. -->
  27481. (O2180 ^name predict-no +)
  27482. (S1 ^operator O2180 +)
  27483. Retracting propose*predict-yes
  27484. -->
  27485. (O2179 ^name predict-yes +)
  27486. (S1 ^operator O2179 +)
  27487. Retracting elaborate*reward*based*on*reward
  27488. -->
  27489. (R1093 ^value 1 +)
  27490. (R1 ^reward R1093 +)
  27491. Retracting elaborate*copy-dir-to-output-link
  27492. -->
  27493. (I3 ^dir R +)
  27494. Retracting rl*prefer*rvt*predict-no*H0*4
  27495. -->
  27496. (S1 ^operator O2180 = 0.339790578216807)
  27497. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  27498. -->
  27499. (S1 ^operator O2180 = -0.2817060109291377)
  27500. Retracting rl*prefer*rvt*predict-yes*H0*3
  27501. -->
  27502. (S1 ^operator O2179 = 0.3377091819801437)
  27503. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  27504. -->
  27505. (S1 ^operator O2179 = 0.6623198172764229)
  27506. =>WM: (15374: S1 ^operator O2182 +)
  27507. =>WM: (15373: S1 ^operator O2181 +)
  27508. =>WM: (15372: O2182 ^name predict-no)
  27509. =>WM: (15371: O2181 ^name predict-yes)
  27510. =>WM: (15370: R1094 ^value 1)
  27511. =>WM: (15369: R1 ^reward R1094)
  27512. =>WM: (15368: I3 ^see 1)
  27513. <=WM: (15359: S1 ^operator O2179 +)
  27514. <=WM: (15361: S1 ^operator O2179)
  27515. <=WM: (15360: S1 ^operator O2180 +)
  27516. <=WM: (15354: R1 ^reward R1093)
  27517. <=WM: (15326: I3 ^see 0)
  27518. <=WM: (15357: O2180 ^name predict-no)
  27519. <=WM: (15356: O2179 ^name predict-yes)
  27520. <=WM: (15355: R1093 ^value 1)
  27521. --- Inner Elaboration Phase, active level 1 (S1) ---
  27522. Firing prefer*rvt*predict-yes*H0
  27523. -->
  27524. Firing rl*prefer*rvt*predict-yes*H0*3
  27525. -->
  27526. (S1 ^operator O2181 = 0.3377091819801437)
  27527. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27528. -->
  27529. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  27530. -->
  27531. (S1 ^operator O2181 = -0.1070236389116304)
  27532. Firing prefer*rvt*predict-no*H0
  27533. -->
  27534. Firing rl*prefer*rvt*predict-no*H0*4
  27535. -->
  27536. (S1 ^operator O2182 = 0.339790578216807)
  27537. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27538. -->
  27539. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  27540. -->
  27541. (S1 ^operator O2182 = 0.6602309079953435)
  27542. inner elaboration loop at bottom goal.
  27543. Retracting rl*prefer*rvt*predict-no*H0*4
  27544. -->
  27545. (S1 ^operator O2180 = 0.339790578216807)
  27546. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  27547. -->
  27548. (S1 ^operator O2180 = 0.6602309079953435)
  27549. Retracting rl*prefer*rvt*predict-yes*H0*3
  27550. -->
  27551. (S1 ^operator O2179 = 0.3377091819801437)
  27552. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  27553. -->
  27554. (S1 ^operator O2179 = -0.1070236389116304)
  27555. --- END Proposal Phase ---
  27556. --- Decision Phase ---
  27557. RL update rl*prefer*rvt*predict-yes*H0*3 0.590109 -0.2524 0.337709 -> 0.590106 -0.2524 0.337707(R,m,v=1,0.905556,0.0860025)
  27558. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409924 0.252395 0.66232 -> 0.409921 0.252396 0.662317(R,m,v=1,1,0)
  27559. =>WM: (15375: S1 ^operator O2182)
  27560. 1091: O: O2182 (predict-no)
  27561. --- END Decision Phase ---
  27562. --- Application Phase ---
  27563. --- Firing Productions (PE) For State At Depth 1 ---
  27564. --- Inner Elaboration Phase, active level 1 (S1) ---
  27565. Firing apply*operator
  27566. -->
  27567. (I3 ^predict-no N1091 + :O )
  27568. Firing apply*operator*complete
  27569. -->
  27570. (I3 ^predict-yes N1090 - :O )
  27571. inner elaboration loop at bottom goal.
  27572. --- Change Working Memory (PE) ---
  27573. =>WM: (15376: I3 ^predict-no N1091)
  27574. <=WM: (15363: N1090 ^status complete)
  27575. <=WM: (15362: I3 ^predict-yes N1090)
  27576. --- Firing Productions (IE) For State At Depth 1 ---
  27577. --- Inner Elaboration Phase, active level 1 (S1) ---
  27578. Firing monitor*world
  27579. -->
  27580. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27581. --- Change Working Memory (IE) ---
  27582. --- END Application Phase ---
  27583. --- Output Phase ---
  27584. ENV: Agent did: predict-no for direction R in state State-B
  27585. In State-B moving R
  27586. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  27587. predict error 0
  27588. dir: dir isL
  27589. --- END Output Phase ---
  27590. |--- Input Phase ---
  27591. =>WM: (15380: I2 ^dir L)
  27592. =>WM: (15379: I2 ^reward 1)
  27593. =>WM: (15378: I2 ^see 0)
  27594. =>WM: (15377: N1091 ^status complete)
  27595. <=WM: (15366: I2 ^dir R)
  27596. <=WM: (15365: I2 ^reward 1)
  27597. <=WM: (15364: I2 ^see 1)
  27598. =>WM: (15381: I2 ^level-1 R0-root)
  27599. <=WM: (15367: I2 ^level-1 R1-root)
  27600. --- END Input Phase ---
  27601. --- Proposal Phase ---
  27602. --- Inner Elaboration Phase, active level 1 (S1) ---
  27603. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  27604. -->
  27605. (S1 ^operator O2181 = 0.7359164516543863)
  27606. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27607. -->
  27608. Firing elaborate*copy-see-to-output-link
  27609. -->
  27610. (I3 ^see 0 +)
  27611. Firing elaborate*reward*based*on*reward
  27612. -->
  27613. (R1095 ^value 1 +)
  27614. (R1 ^reward R1095 +)
  27615. Firing propose*predict-yes
  27616. -->
  27617. (O2183 ^name predict-yes +)
  27618. (S1 ^operator O2183 +)
  27619. Firing propose*predict-no
  27620. -->
  27621. (O2184 ^name predict-no +)
  27622. (S1 ^operator O2184 +)
  27623. Firing rl*prefer*rvt*predict-no*H0*6
  27624. -->
  27625. (S1 ^operator O2182 = 0.9809082465769686)
  27626. Firing rl*prefer*rvt*predict-yes*H0*5
  27627. -->
  27628. (S1 ^operator O2181 = 0.2639955703086441)
  27629. Firing prefer*rvt*predict-yes*H0
  27630. -->
  27631. Firing prefer*rvt*predict-no*H0
  27632. -->
  27633. Firing elaborate*copy-dir-to-output-link
  27634. -->
  27635. (I3 ^dir L +)
  27636. inner elaboration loop at bottom goal.
  27637. Retracting elaborate*copy-see-to-output-link
  27638. -->
  27639. (I3 ^see 1 +)
  27640. Retracting propose*predict-no
  27641. -->
  27642. (O2182 ^name predict-no +)
  27643. (S1 ^operator O2182 +)
  27644. Retracting propose*predict-yes
  27645. -->
  27646. (O2181 ^name predict-yes +)
  27647. (S1 ^operator O2181 +)
  27648. Retracting elaborate*reward*based*on*reward
  27649. -->
  27650. (R1094 ^value 1 +)
  27651. (R1 ^reward R1094 +)
  27652. Retracting elaborate*copy-dir-to-output-link
  27653. -->
  27654. (I3 ^dir R +)
  27655. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  27656. -->
  27657. (S1 ^operator O2182 = 0.6602309079953435)
  27658. Retracting rl*prefer*rvt*predict-no*H0*4
  27659. -->
  27660. (S1 ^operator O2182 = 0.339790578216807)
  27661. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  27662. -->
  27663. (S1 ^operator O2181 = -0.1070236389116304)
  27664. Retracting rl*prefer*rvt*predict-yes*H0*3
  27665. -->
  27666. (S1 ^operator O2181 = 0.3377068406707124)
  27667. =>WM: (15389: S1 ^operator O2184 +)
  27668. =>WM: (15388: S1 ^operator O2183 +)
  27669. =>WM: (15387: I3 ^dir L)
  27670. =>WM: (15386: O2184 ^name predict-no)
  27671. =>WM: (15385: O2183 ^name predict-yes)
  27672. =>WM: (15384: R1095 ^value 1)
  27673. =>WM: (15383: R1 ^reward R1095)
  27674. =>WM: (15382: I3 ^see 0)
  27675. <=WM: (15373: S1 ^operator O2181 +)
  27676. <=WM: (15374: S1 ^operator O2182 +)
  27677. <=WM: (15375: S1 ^operator O2182)
  27678. <=WM: (15358: I3 ^dir R)
  27679. <=WM: (15369: R1 ^reward R1094)
  27680. <=WM: (15368: I3 ^see 1)
  27681. <=WM: (15372: O2182 ^name predict-no)
  27682. <=WM: (15371: O2181 ^name predict-yes)
  27683. <=WM: (15370: R1094 ^value 1)
  27684. --- Inner Elaboration Phase, active level 1 (S1) ---
  27685. Firing prefer*rvt*predict-yes*H0
  27686. -->
  27687. Firing rl*prefer*rvt*predict-yes*H0*5
  27688. -->
  27689. (S1 ^operator O2183 = 0.2639955703086441)
  27690. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  27691. -->
  27692. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  27693. -->
  27694. (S1 ^operator O2183 = 0.7359164516543863)
  27695. Firing prefer*rvt*predict-no*H0
  27696. -->
  27697. Firing rl*prefer*rvt*predict-no*H0*6
  27698. -->
  27699. (S1 ^operator O2184 = 0.9809082465769686)
  27700. inner elaboration loop at bottom goal.
  27701. Retracting rl*prefer*rvt*predict-no*H0*6
  27702. -->
  27703. (S1 ^operator O2182 = 0.9809082465769686)
  27704. Retracting rl*prefer*rvt*predict-yes*H0*5
  27705. -->
  27706. (S1 ^operator O2181 = 0.2639955703086441)
  27707. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  27708. -->
  27709. (S1 ^operator O2181 = 0.7359164516543863)
  27710. --- END Proposal Phase ---
  27711. --- Decision Phase ---
  27712. RL update rl*prefer*rvt*predict-no*H0*4 0.570275 -0.230484 0.339791 -> 0.570273 -0.230484 0.339789(R,m,v=1,0.886486,0.101175)
  27713. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429747 0.230483 0.660231 -> 0.429745 0.230484 0.660229(R,m,v=1,1,0)
  27714. =>WM: (15390: S1 ^operator O2183)
  27715. 1092: O: O2183 (predict-yes)
  27716. --- END Decision Phase ---
  27717. --- Application Phase ---
  27718. --- Firing Productions (PE) For State At Depth 1 ---
  27719. --- Inner Elaboration Phase, active level 1 (S1) ---
  27720. Firing apply*operator
  27721. -->
  27722. (I3 ^predict-yes N1092 + :O )
  27723. Firing apply*operator*complete
  27724. -->
  27725. (I3 ^predict-no N1091 - :O )
  27726. inner elaboration loop at bottom goal.
  27727. --- Change Working Memory (PE) ---
  27728. =>WM: (15391: I3 ^predict-yes N1092)
  27729. <=WM: (15377: N1091 ^status complete)
  27730. <=WM: (15376: I3 ^predict-no N1091)
  27731. --- Firing Productions (IE) For State At Depth 1 ---
  27732. --- Inner Elaboration Phase, active level 1 (S1) ---
  27733. Firing monitor*world
  27734. -->
  27735. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  27736. --- Change Working Memory (IE) ---
  27737. --- END Application Phase ---
  27738. --- Output Phase ---
  27739. ENV: Agent did: predict-yes for direction L in state State-B
  27740. In State-B moving L
  27741. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  27742. predict error 0
  27743. dir: dir isR
  27744. --- END Output Phase ---
  27745. \-/--- Input Phase ---
  27746. =>WM: (15395: I2 ^dir R)
  27747. =>WM: (15394: I2 ^reward 1)
  27748. =>WM: (15393: I2 ^see 1)
  27749. =>WM: (15392: N1092 ^status complete)
  27750. <=WM: (15380: I2 ^dir L)
  27751. <=WM: (15379: I2 ^reward 1)
  27752. <=WM: (15378: I2 ^see 0)
  27753. =>WM: (15396: I2 ^level-1 L1-root)
  27754. <=WM: (15381: I2 ^level-1 R0-root)
  27755. --- END Input Phase ---
  27756. --- Proposal Phase ---
  27757. --- Inner Elaboration Phase, active level 1 (S1) ---
  27758. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  27759. -->
  27760. (S1 ^operator O2184 = -0.2714224023553999)
  27761. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  27762. -->
  27763. (S1 ^operator O2183 = 0.6622488530452479)
  27764. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27765. -->
  27766. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27767. -->
  27768. Firing elaborate*copy-see-to-output-link
  27769. -->
  27770. (I3 ^see 1 +)
  27771. Firing elaborate*reward*based*on*reward
  27772. -->
  27773. (R1096 ^value 1 +)
  27774. (R1 ^reward R1096 +)
  27775. Firing propose*predict-yes
  27776. -->
  27777. (O2185 ^name predict-yes +)
  27778. (S1 ^operator O2185 +)
  27779. Firing propose*predict-no
  27780. -->
  27781. (O2186 ^name predict-no +)
  27782. (S1 ^operator O2186 +)
  27783. Firing rl*prefer*rvt*predict-no*H0*4
  27784. -->
  27785. (S1 ^operator O2184 = 0.3397888511281359)
  27786. Firing rl*prefer*rvt*predict-yes*H0*3
  27787. -->
  27788. (S1 ^operator O2183 = 0.3377068406707124)
  27789. Firing prefer*rvt*predict-yes*H0
  27790. -->
  27791. Firing prefer*rvt*predict-no*H0
  27792. -->
  27793. Firing elaborate*copy-dir-to-output-link
  27794. -->
  27795. (I3 ^dir R +)
  27796. inner elaboration loop at bottom goal.
  27797. Retracting elaborate*copy-see-to-output-link
  27798. -->
  27799. (I3 ^see 0 +)
  27800. Retracting propose*predict-no
  27801. -->
  27802. (O2184 ^name predict-no +)
  27803. (S1 ^operator O2184 +)
  27804. Retracting propose*predict-yes
  27805. -->
  27806. (O2183 ^name predict-yes +)
  27807. (S1 ^operator O2183 +)
  27808. Retracting elaborate*reward*based*on*reward
  27809. -->
  27810. (R1095 ^value 1 +)
  27811. (R1 ^reward R1095 +)
  27812. Retracting elaborate*copy-dir-to-output-link
  27813. -->
  27814. (I3 ^dir L +)
  27815. Retracting rl*prefer*rvt*predict-no*H0*6
  27816. -->
  27817. (S1 ^operator O2184 = 0.9809082465769686)
  27818. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  27819. -->
  27820. (S1 ^operator O2183 = 0.7359164516543863)
  27821. Retracting rl*prefer*rvt*predict-yes*H0*5
  27822. -->
  27823. (S1 ^operator O2183 = 0.2639955703086441)
  27824. =>WM: (15404: S1 ^operator O2186 +)
  27825. =>WM: (15403: S1 ^operator O2185 +)
  27826. =>WM: (15402: I3 ^dir R)
  27827. =>WM: (15401: O2186 ^name predict-no)
  27828. =>WM: (15400: O2185 ^name predict-yes)
  27829. =>WM: (15399: R1096 ^value 1)
  27830. =>WM: (15398: R1 ^reward R1096)
  27831. =>WM: (15397: I3 ^see 1)
  27832. <=WM: (15388: S1 ^operator O2183 +)
  27833. <=WM: (15390: S1 ^operator O2183)
  27834. <=WM: (15389: S1 ^operator O2184 +)
  27835. <=WM: (15387: I3 ^dir L)
  27836. <=WM: (15383: R1 ^reward R1095)
  27837. <=WM: (15382: I3 ^see 0)
  27838. <=WM: (15386: O2184 ^name predict-no)
  27839. <=WM: (15385: O2183 ^name predict-yes)
  27840. <=WM: (15384: R1095 ^value 1)
  27841. --- Inner Elaboration Phase, active level 1 (S1) ---
  27842. Firing prefer*rvt*predict-yes*H0
  27843. -->
  27844. Firing rl*prefer*rvt*predict-yes*H0*3
  27845. -->
  27846. (S1 ^operator O2185 = 0.3377068406707124)
  27847. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27848. -->
  27849. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  27850. -->
  27851. (S1 ^operator O2185 = 0.6622488530452479)
  27852. Firing prefer*rvt*predict-no*H0
  27853. -->
  27854. Firing rl*prefer*rvt*predict-no*H0*4
  27855. -->
  27856. (S1 ^operator O2186 = 0.3397888511281359)
  27857. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27858. -->
  27859. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  27860. -->
  27861. (S1 ^operator O2186 = -0.2714224023553999)
  27862. inner elaboration loop at bottom goal.
  27863. Retracting rl*prefer*rvt*predict-no*H0*4
  27864. -->
  27865. (S1 ^operator O2184 = 0.3397888511281359)
  27866. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  27867. -->
  27868. (S1 ^operator O2184 = -0.2714224023553999)
  27869. Retracting rl*prefer*rvt*predict-yes*H0*3
  27870. -->
  27871. (S1 ^operator O2183 = 0.3377068406707124)
  27872. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  27873. -->
  27874. (S1 ^operator O2183 = 0.6622488530452479)
  27875. --- END Proposal Phase ---
  27876. --- Decision Phase ---
  27877. RL update rl*prefer*rvt*predict-yes*H0*5 0.554382 -0.290386 0.263996 -> 0.554389 -0.290386 0.264003(R,m,v=1,0.884817,0.102452)
  27878. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445532 0.290385 0.735916 -> 0.44554 0.290385 0.735925(R,m,v=1,1,0)
  27879. =>WM: (15405: S1 ^operator O2185)
  27880. 1093: O: O2185 (predict-yes)
  27881. --- END Decision Phase ---
  27882. --- Application Phase ---
  27883. --- Firing Productions (PE) For State At Depth 1 ---
  27884. --- Inner Elaboration Phase, active level 1 (S1) ---
  27885. Firing apply*operator
  27886. -->
  27887. (I3 ^predict-yes N1093 + :O )
  27888. Firing apply*operator*complete
  27889. -->
  27890. (I3 ^predict-yes N1092 - :O )
  27891. inner elaboration loop at bottom goal.
  27892. --- Change Working Memory (PE) ---
  27893. =>WM: (15406: I3 ^predict-yes N1093)
  27894. <=WM: (15392: N1092 ^status complete)
  27895. <=WM: (15391: I3 ^predict-yes N1092)
  27896. --- Firing Productions (IE) For State At Depth 1 ---
  27897. --- Inner Elaboration Phase, active level 1 (S1) ---
  27898. Firing monitor*world
  27899. -->
  27900. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  27901. --- Change Working Memory (IE) ---
  27902. --- END Application Phase ---
  27903. --- Output Phase ---
  27904. ENV: Agent did: predict-yes for direction R in state State-A
  27905. In State-A moving R
  27906. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  27907. predict error 0
  27908. dir: dir isR
  27909. --- END Output Phase ---
  27910. |\---- Input Phase ---
  27911. =>WM: (15410: I2 ^dir R)
  27912. =>WM: (15409: I2 ^reward 1)
  27913. =>WM: (15408: I2 ^see 1)
  27914. =>WM: (15407: N1093 ^status complete)
  27915. <=WM: (15395: I2 ^dir R)
  27916. <=WM: (15394: I2 ^reward 1)
  27917. <=WM: (15393: I2 ^see 1)
  27918. =>WM: (15411: I2 ^level-1 R1-root)
  27919. <=WM: (15396: I2 ^level-1 L1-root)
  27920. --- END Input Phase ---
  27921. --- Proposal Phase ---
  27922. --- Inner Elaboration Phase, active level 1 (S1) ---
  27923. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  27924. -->
  27925. (S1 ^operator O2185 = -0.1070236389116304)
  27926. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  27927. -->
  27928. (S1 ^operator O2186 = 0.6602288976103786)
  27929. Firing prefer*rvt*predict-no*H0*4*v1*H1
  27930. -->
  27931. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  27932. -->
  27933. Firing elaborate*copy-see-to-output-link
  27934. -->
  27935. (I3 ^see 1 +)
  27936. Firing elaborate*reward*based*on*reward
  27937. -->
  27938. (R1097 ^value 1 +)
  27939. (R1 ^reward R1097 +)
  27940. Firing propose*predict-yes
  27941. -->
  27942. (O2187 ^name predict-yes +)
  27943. (S1 ^operator O2187 +)
  27944. Firing propose*predict-no
  27945. -->
  27946. (O2188 ^name predict-no +)
  27947. (S1 ^operator O2188 +)
  27948. Firing rl*prefer*rvt*predict-no*H0*4
  27949. -->
  27950. (S1 ^operator O2186 = 0.3397888511281359)
  27951. Firing rl*prefer*rvt*predict-yes*H0*3
  27952. -->
  27953. (S1 ^operator O2185 = 0.3377068406707124)
  27954. Firing prefer*rvt*predict-yes*H0
  27955. -->
  27956. Firing prefer*rvt*predict-no*H0
  27957. -->
  27958. Firing elaborate*copy-dir-to-output-link
  27959. -->
  27960. (I3 ^dir R +)
  27961. inner elaboration loop at bottom goal.
  27962. Retracting elaborate*copy-see-to-output-link
  27963. -->
  27964. (I3 ^see 1 +)
  27965. Retracting propose*predict-no
  27966. -->
  27967. (O2186 ^name predict-no +)
  27968. (S1 ^operator O2186 +)
  27969. Retracting propose*predict-yes
  27970. -->
  27971. (O2185 ^name predict-yes +)
  27972. (S1 ^operator O2185 +)
  27973. Retracting elaborate*reward*based*on*reward
  27974. -->
  27975. (R1096 ^value 1 +)
  27976. (R1 ^reward R1096 +)
  27977. Retracting elaborate*copy-dir-to-output-link
  27978. -->
  27979. (I3 ^dir R +)
  27980. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*37
  27981. -->
  27982. (S1 ^operator O2186 = -0.2714224023553999)
  27983. Retracting rl*prefer*rvt*predict-no*H0*4
  27984. -->
  27985. (S1 ^operator O2186 = 0.3397888511281359)
  27986. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*38
  27987. -->
  27988. (S1 ^operator O2185 = 0.6622488530452479)
  27989. Retracting rl*prefer*rvt*predict-yes*H0*3
  27990. -->
  27991. (S1 ^operator O2185 = 0.3377068406707124)
  27992. =>WM: (15417: S1 ^operator O2188 +)
  27993. =>WM: (15416: S1 ^operator O2187 +)
  27994. =>WM: (15415: O2188 ^name predict-no)
  27995. =>WM: (15414: O2187 ^name predict-yes)
  27996. =>WM: (15413: R1097 ^value 1)
  27997. =>WM: (15412: R1 ^reward R1097)
  27998. <=WM: (15403: S1 ^operator O2185 +)
  27999. <=WM: (15405: S1 ^operator O2185)
  28000. <=WM: (15404: S1 ^operator O2186 +)
  28001. <=WM: (15398: R1 ^reward R1096)
  28002. <=WM: (15401: O2186 ^name predict-no)
  28003. <=WM: (15400: O2185 ^name predict-yes)
  28004. <=WM: (15399: R1096 ^value 1)
  28005. --- Inner Elaboration Phase, active level 1 (S1) ---
  28006. Firing prefer*rvt*predict-yes*H0
  28007. -->
  28008. Firing rl*prefer*rvt*predict-yes*H0*3
  28009. -->
  28010. (S1 ^operator O2187 = 0.3377068406707124)
  28011. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28012. -->
  28013. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  28014. -->
  28015. (S1 ^operator O2187 = -0.1070236389116304)
  28016. Firing prefer*rvt*predict-no*H0
  28017. -->
  28018. Firing rl*prefer*rvt*predict-no*H0*4
  28019. -->
  28020. (S1 ^operator O2188 = 0.3397888511281359)
  28021. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28022. -->
  28023. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  28024. -->
  28025. (S1 ^operator O2188 = 0.6602288976103786)
  28026. inner elaboration loop at bottom goal.
  28027. Retracting rl*prefer*rvt*predict-no*H0*4
  28028. -->
  28029. (S1 ^operator O2186 = 0.3397888511281359)
  28030. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  28031. -->
  28032. (S1 ^operator O2186 = 0.6602288976103786)
  28033. Retracting rl*prefer*rvt*predict-yes*H0*3
  28034. -->
  28035. (S1 ^operator O2185 = 0.3377068406707124)
  28036. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  28037. -->
  28038. (S1 ^operator O2185 = -0.1070236389116304)
  28039. --- END Proposal Phase ---
  28040. --- Decision Phase ---
  28041. RL update rl*prefer*rvt*predict-yes*H0*3 0.590106 -0.2524 0.337707 -> 0.59011 -0.2524 0.33771(R,m,v=1,0.906077,0.085574)
  28042. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*38 0.409843 0.252406 0.662249 -> 0.409847 0.252406 0.662253(R,m,v=1,1,0)
  28043. =>WM: (15418: S1 ^operator O2188)
  28044. 1094: O: O2188 (predict-no)
  28045. --- END Decision Phase ---
  28046. --- Application Phase ---
  28047. --- Firing Productions (PE) For State At Depth 1 ---
  28048. --- Inner Elaboration Phase, active level 1 (S1) ---
  28049. Firing apply*operator
  28050. -->
  28051. (I3 ^predict-no N1094 + :O )
  28052. Firing apply*operator*complete
  28053. -->
  28054. (I3 ^predict-yes N1093 - :O )
  28055. inner elaboration loop at bottom goal.
  28056. --- Change Working Memory (PE) ---
  28057. =>WM: (15419: I3 ^predict-no N1094)
  28058. <=WM: (15407: N1093 ^status complete)
  28059. <=WM: (15406: I3 ^predict-yes N1093)
  28060. --- Firing Productions (IE) For State At Depth 1 ---
  28061. --- Inner Elaboration Phase, active level 1 (S1) ---
  28062. Firing monitor*world
  28063. -->
  28064. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28065. --- Change Working Memory (IE) ---
  28066. --- END Application Phase ---
  28067. --- Output Phase ---
  28068. ENV: Agent did: predict-no for direction R in state State-B
  28069. In State-B moving R
  28070. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28071. predict error 0
  28072. dir: dir isL
  28073. --- END Output Phase ---
  28074. /|\--- Input Phase ---
  28075. =>WM: (15423: I2 ^dir L)
  28076. =>WM: (15422: I2 ^reward 1)
  28077. =>WM: (15421: I2 ^see 0)
  28078. =>WM: (15420: N1094 ^status complete)
  28079. <=WM: (15410: I2 ^dir R)
  28080. <=WM: (15409: I2 ^reward 1)
  28081. <=WM: (15408: I2 ^see 1)
  28082. =>WM: (15424: I2 ^level-1 R0-root)
  28083. <=WM: (15411: I2 ^level-1 R1-root)
  28084. --- END Input Phase ---
  28085. --- Proposal Phase ---
  28086. --- Inner Elaboration Phase, active level 1 (S1) ---
  28087. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  28088. -->
  28089. (S1 ^operator O2187 = 0.7359248103270613)
  28090. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28091. -->
  28092. Firing elaborate*copy-see-to-output-link
  28093. -->
  28094. (I3 ^see 0 +)
  28095. Firing elaborate*reward*based*on*reward
  28096. -->
  28097. (R1098 ^value 1 +)
  28098. (R1 ^reward R1098 +)
  28099. Firing propose*predict-yes
  28100. -->
  28101. (O2189 ^name predict-yes +)
  28102. (S1 ^operator O2189 +)
  28103. Firing propose*predict-no
  28104. -->
  28105. (O2190 ^name predict-no +)
  28106. (S1 ^operator O2190 +)
  28107. Firing rl*prefer*rvt*predict-no*H0*6
  28108. -->
  28109. (S1 ^operator O2188 = 0.9809082465769686)
  28110. Firing rl*prefer*rvt*predict-yes*H0*5
  28111. -->
  28112. (S1 ^operator O2187 = 0.2640026059923823)
  28113. Firing prefer*rvt*predict-yes*H0
  28114. -->
  28115. Firing prefer*rvt*predict-no*H0
  28116. -->
  28117. Firing elaborate*copy-dir-to-output-link
  28118. -->
  28119. (I3 ^dir L +)
  28120. inner elaboration loop at bottom goal.
  28121. Retracting elaborate*copy-see-to-output-link
  28122. -->
  28123. (I3 ^see 1 +)
  28124. Retracting propose*predict-no
  28125. -->
  28126. (O2188 ^name predict-no +)
  28127. (S1 ^operator O2188 +)
  28128. Retracting propose*predict-yes
  28129. -->
  28130. (O2187 ^name predict-yes +)
  28131. (S1 ^operator O2187 +)
  28132. Retracting elaborate*reward*based*on*reward
  28133. -->
  28134. (R1097 ^value 1 +)
  28135. (R1 ^reward R1097 +)
  28136. Retracting elaborate*copy-dir-to-output-link
  28137. -->
  28138. (I3 ^dir R +)
  28139. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  28140. -->
  28141. (S1 ^operator O2188 = 0.6602288976103786)
  28142. Retracting rl*prefer*rvt*predict-no*H0*4
  28143. -->
  28144. (S1 ^operator O2188 = 0.3397888511281359)
  28145. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  28146. -->
  28147. (S1 ^operator O2187 = -0.1070236389116304)
  28148. Retracting rl*prefer*rvt*predict-yes*H0*3
  28149. -->
  28150. (S1 ^operator O2187 = 0.3377104146245125)
  28151. =>WM: (15432: S1 ^operator O2190 +)
  28152. =>WM: (15431: S1 ^operator O2189 +)
  28153. =>WM: (15430: I3 ^dir L)
  28154. =>WM: (15429: O2190 ^name predict-no)
  28155. =>WM: (15428: O2189 ^name predict-yes)
  28156. =>WM: (15427: R1098 ^value 1)
  28157. =>WM: (15426: R1 ^reward R1098)
  28158. =>WM: (15425: I3 ^see 0)
  28159. <=WM: (15416: S1 ^operator O2187 +)
  28160. <=WM: (15417: S1 ^operator O2188 +)
  28161. <=WM: (15418: S1 ^operator O2188)
  28162. <=WM: (15402: I3 ^dir R)
  28163. <=WM: (15412: R1 ^reward R1097)
  28164. <=WM: (15397: I3 ^see 1)
  28165. <=WM: (15415: O2188 ^name predict-no)
  28166. <=WM: (15414: O2187 ^name predict-yes)
  28167. <=WM: (15413: R1097 ^value 1)
  28168. --- Inner Elaboration Phase, active level 1 (S1) ---
  28169. Firing prefer*rvt*predict-yes*H0
  28170. -->
  28171. Firing rl*prefer*rvt*predict-yes*H0*5
  28172. -->
  28173. (S1 ^operator O2189 = 0.2640026059923823)
  28174. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28175. -->
  28176. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  28177. -->
  28178. (S1 ^operator O2189 = 0.7359248103270613)
  28179. Firing prefer*rvt*predict-no*H0
  28180. -->
  28181. Firing rl*prefer*rvt*predict-no*H0*6
  28182. -->
  28183. (S1 ^operator O2190 = 0.9809082465769686)
  28184. inner elaboration loop at bottom goal.
  28185. Retracting rl*prefer*rvt*predict-no*H0*6
  28186. -->
  28187. (S1 ^operator O2188 = 0.9809082465769686)
  28188. Retracting rl*prefer*rvt*predict-yes*H0*5
  28189. -->
  28190. (S1 ^operator O2187 = 0.2640026059923823)
  28191. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  28192. -->
  28193. (S1 ^operator O2187 = 0.7359248103270613)
  28194. --- END Proposal Phase ---
  28195. --- Decision Phase ---
  28196. RL update rl*prefer*rvt*predict-no*H0*4 0.570273 -0.230484 0.339789 -> 0.570272 -0.230484 0.339787(R,m,v=1,0.887097,0.100697)
  28197. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429745 0.230484 0.660229 -> 0.429744 0.230484 0.660227(R,m,v=1,1,0)
  28198. =>WM: (15433: S1 ^operator O2189)
  28199. 1095: O: O2189 (predict-yes)
  28200. --- END Decision Phase ---
  28201. --- Application Phase ---
  28202. --- Firing Productions (PE) For State At Depth 1 ---
  28203. --- Inner Elaboration Phase, active level 1 (S1) ---
  28204. Firing apply*operator
  28205. -->
  28206. (I3 ^predict-yes N1095 + :O )
  28207. Firing apply*operator*complete
  28208. -->
  28209. (I3 ^predict-no N1094 - :O )
  28210. inner elaboration loop at bottom goal.
  28211. --- Change Working Memory (PE) ---
  28212. =>WM: (15434: I3 ^predict-yes N1095)
  28213. <=WM: (15420: N1094 ^status complete)
  28214. <=WM: (15419: I3 ^predict-no N1094)
  28215. --- Firing Productions (IE) For State At Depth 1 ---
  28216. --- Inner Elaboration Phase, active level 1 (S1) ---
  28217. Firing monitor*world
  28218. -->
  28219. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28220. --- Change Working Memory (IE) ---
  28221. --- END Application Phase ---
  28222. --- Output Phase ---
  28223. ENV: Agent did: predict-yes for direction L in state State-B
  28224. In State-B moving L
  28225. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28226. predict error 0
  28227. dir: dir isL
  28228. --- END Output Phase ---
  28229. -/|--- Input Phase ---
  28230. =>WM: (15438: I2 ^dir L)
  28231. =>WM: (15437: I2 ^reward 1)
  28232. =>WM: (15436: I2 ^see 1)
  28233. =>WM: (15435: N1095 ^status complete)
  28234. <=WM: (15423: I2 ^dir L)
  28235. <=WM: (15422: I2 ^reward 1)
  28236. <=WM: (15421: I2 ^see 0)
  28237. =>WM: (15439: I2 ^level-1 L1-root)
  28238. <=WM: (15424: I2 ^level-1 R0-root)
  28239. --- END Input Phase ---
  28240. --- Proposal Phase ---
  28241. --- Inner Elaboration Phase, active level 1 (S1) ---
  28242. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  28243. -->
  28244. (S1 ^operator O2189 = -0.181727099742844)
  28245. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28246. -->
  28247. Firing elaborate*copy-see-to-output-link
  28248. -->
  28249. (I3 ^see 1 +)
  28250. Firing elaborate*reward*based*on*reward
  28251. -->
  28252. (R1099 ^value 1 +)
  28253. (R1 ^reward R1099 +)
  28254. Firing propose*predict-yes
  28255. -->
  28256. (O2191 ^name predict-yes +)
  28257. (S1 ^operator O2191 +)
  28258. Firing propose*predict-no
  28259. -->
  28260. (O2192 ^name predict-no +)
  28261. (S1 ^operator O2192 +)
  28262. Firing rl*prefer*rvt*predict-no*H0*6
  28263. -->
  28264. (S1 ^operator O2190 = 0.9809082465769686)
  28265. Firing rl*prefer*rvt*predict-yes*H0*5
  28266. -->
  28267. (S1 ^operator O2189 = 0.2640026059923823)
  28268. Firing prefer*rvt*predict-yes*H0
  28269. -->
  28270. Firing prefer*rvt*predict-no*H0
  28271. -->
  28272. Firing elaborate*copy-dir-to-output-link
  28273. -->
  28274. (I3 ^dir L +)
  28275. inner elaboration loop at bottom goal.
  28276. Retracting elaborate*copy-see-to-output-link
  28277. -->
  28278. (I3 ^see 0 +)
  28279. Retracting propose*predict-no
  28280. -->
  28281. (O2190 ^name predict-no +)
  28282. (S1 ^operator O2190 +)
  28283. Retracting propose*predict-yes
  28284. -->
  28285. (O2189 ^name predict-yes +)
  28286. (S1 ^operator O2189 +)
  28287. Retracting elaborate*reward*based*on*reward
  28288. -->
  28289. (R1098 ^value 1 +)
  28290. (R1 ^reward R1098 +)
  28291. Retracting elaborate*copy-dir-to-output-link
  28292. -->
  28293. (I3 ^dir L +)
  28294. Retracting rl*prefer*rvt*predict-no*H0*6
  28295. -->
  28296. (S1 ^operator O2190 = 0.9809082465769686)
  28297. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  28298. -->
  28299. (S1 ^operator O2189 = 0.7359248103270613)
  28300. Retracting rl*prefer*rvt*predict-yes*H0*5
  28301. -->
  28302. (S1 ^operator O2189 = 0.2640026059923823)
  28303. =>WM: (15446: S1 ^operator O2192 +)
  28304. =>WM: (15445: S1 ^operator O2191 +)
  28305. =>WM: (15444: O2192 ^name predict-no)
  28306. =>WM: (15443: O2191 ^name predict-yes)
  28307. =>WM: (15442: R1099 ^value 1)
  28308. =>WM: (15441: R1 ^reward R1099)
  28309. =>WM: (15440: I3 ^see 1)
  28310. <=WM: (15431: S1 ^operator O2189 +)
  28311. <=WM: (15433: S1 ^operator O2189)
  28312. <=WM: (15432: S1 ^operator O2190 +)
  28313. <=WM: (15426: R1 ^reward R1098)
  28314. <=WM: (15425: I3 ^see 0)
  28315. <=WM: (15429: O2190 ^name predict-no)
  28316. <=WM: (15428: O2189 ^name predict-yes)
  28317. <=WM: (15427: R1098 ^value 1)
  28318. --- Inner Elaboration Phase, active level 1 (S1) ---
  28319. Firing prefer*rvt*predict-yes*H0
  28320. -->
  28321. Firing rl*prefer*rvt*predict-yes*H0*5
  28322. -->
  28323. (S1 ^operator O2191 = 0.2640026059923823)
  28324. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28325. -->
  28326. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  28327. -->
  28328. (S1 ^operator O2191 = -0.181727099742844)
  28329. Firing prefer*rvt*predict-no*H0
  28330. -->
  28331. Firing rl*prefer*rvt*predict-no*H0*6
  28332. -->
  28333. (S1 ^operator O2192 = 0.9809082465769686)
  28334. inner elaboration loop at bottom goal.
  28335. Retracting rl*prefer*rvt*predict-no*H0*6
  28336. -->
  28337. (S1 ^operator O2190 = 0.9809082465769686)
  28338. Retracting rl*prefer*rvt*predict-yes*H0*5
  28339. -->
  28340. (S1 ^operator O2189 = 0.2640026059923823)
  28341. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  28342. -->
  28343. (S1 ^operator O2189 = -0.181727099742844)
  28344. --- END Proposal Phase ---
  28345. --- Decision Phase ---
  28346. RL update rl*prefer*rvt*predict-yes*H0*5 0.554389 -0.290386 0.264003 -> 0.554394 -0.290386 0.264008(R,m,v=1,0.885417,0.101985)
  28347. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.44554 0.290385 0.735925 -> 0.445547 0.290385 0.735932(R,m,v=1,1,0)
  28348. =>WM: (15447: S1 ^operator O2192)
  28349. 1096: O: O2192 (predict-no)
  28350. --- END Decision Phase ---
  28351. --- Application Phase ---
  28352. --- Firing Productions (PE) For State At Depth 1 ---
  28353. --- Inner Elaboration Phase, active level 1 (S1) ---
  28354. Firing apply*operator
  28355. -->
  28356. (I3 ^predict-no N1096 + :O )
  28357. Firing apply*operator*complete
  28358. -->
  28359. (I3 ^predict-yes N1095 - :O )
  28360. inner elaboration loop at bottom goal.
  28361. --- Change Working Memory (PE) ---
  28362. =>WM: (15448: I3 ^predict-no N1096)
  28363. <=WM: (15435: N1095 ^status complete)
  28364. <=WM: (15434: I3 ^predict-yes N1095)
  28365. --- Firing Productions (IE) For State At Depth 1 ---
  28366. --- Inner Elaboration Phase, active level 1 (S1) ---
  28367. Firing monitor*world
  28368. -->
  28369. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28370. --- Change Working Memory (IE) ---
  28371. --- END Application Phase ---
  28372. --- Output Phase ---
  28373. ENV: Agent did: predict-no for direction L in state State-A
  28374. In State-A moving L
  28375. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28376. predict error 0
  28377. dir: dir isL
  28378. --- END Output Phase ---
  28379. \-/--- Input Phase ---
  28380. =>WM: (15452: I2 ^dir L)
  28381. =>WM: (15451: I2 ^reward 1)
  28382. =>WM: (15450: I2 ^see 0)
  28383. =>WM: (15449: N1096 ^status complete)
  28384. <=WM: (15438: I2 ^dir L)
  28385. <=WM: (15437: I2 ^reward 1)
  28386. <=WM: (15436: I2 ^see 1)
  28387. =>WM: (15453: I2 ^level-1 L0-root)
  28388. <=WM: (15439: I2 ^level-1 L1-root)
  28389. --- END Input Phase ---
  28390. --- Proposal Phase ---
  28391. --- Inner Elaboration Phase, active level 1 (S1) ---
  28392. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  28393. -->
  28394. (S1 ^operator O2191 = -0.1386470047172653)
  28395. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28396. -->
  28397. Firing elaborate*copy-see-to-output-link
  28398. -->
  28399. (I3 ^see 0 +)
  28400. Firing elaborate*reward*based*on*reward
  28401. -->
  28402. (R1100 ^value 1 +)
  28403. (R1 ^reward R1100 +)
  28404. Firing propose*predict-yes
  28405. -->
  28406. (O2193 ^name predict-yes +)
  28407. (S1 ^operator O2193 +)
  28408. Firing propose*predict-no
  28409. -->
  28410. (O2194 ^name predict-no +)
  28411. (S1 ^operator O2194 +)
  28412. Firing rl*prefer*rvt*predict-no*H0*6
  28413. -->
  28414. (S1 ^operator O2192 = 0.9809082465769686)
  28415. Firing rl*prefer*rvt*predict-yes*H0*5
  28416. -->
  28417. (S1 ^operator O2191 = 0.2640084057314346)
  28418. Firing prefer*rvt*predict-yes*H0
  28419. -->
  28420. Firing prefer*rvt*predict-no*H0
  28421. -->
  28422. Firing elaborate*copy-dir-to-output-link
  28423. -->
  28424. (I3 ^dir L +)
  28425. inner elaboration loop at bottom goal.
  28426. Retracting elaborate*copy-see-to-output-link
  28427. -->
  28428. (I3 ^see 1 +)
  28429. Retracting propose*predict-no
  28430. -->
  28431. (O2192 ^name predict-no +)
  28432. (S1 ^operator O2192 +)
  28433. Retracting propose*predict-yes
  28434. -->
  28435. (O2191 ^name predict-yes +)
  28436. (S1 ^operator O2191 +)
  28437. Retracting elaborate*reward*based*on*reward
  28438. -->
  28439. (R1099 ^value 1 +)
  28440. (R1 ^reward R1099 +)
  28441. Retracting elaborate*copy-dir-to-output-link
  28442. -->
  28443. (I3 ^dir L +)
  28444. Retracting rl*prefer*rvt*predict-no*H0*6
  28445. -->
  28446. (S1 ^operator O2192 = 0.9809082465769686)
  28447. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  28448. -->
  28449. (S1 ^operator O2191 = -0.181727099742844)
  28450. Retracting rl*prefer*rvt*predict-yes*H0*5
  28451. -->
  28452. (S1 ^operator O2191 = 0.2640084057314346)
  28453. =>WM: (15460: S1 ^operator O2194 +)
  28454. =>WM: (15459: S1 ^operator O2193 +)
  28455. =>WM: (15458: O2194 ^name predict-no)
  28456. =>WM: (15457: O2193 ^name predict-yes)
  28457. =>WM: (15456: R1100 ^value 1)
  28458. =>WM: (15455: R1 ^reward R1100)
  28459. =>WM: (15454: I3 ^see 0)
  28460. <=WM: (15445: S1 ^operator O2191 +)
  28461. <=WM: (15446: S1 ^operator O2192 +)
  28462. <=WM: (15447: S1 ^operator O2192)
  28463. <=WM: (15441: R1 ^reward R1099)
  28464. <=WM: (15440: I3 ^see 1)
  28465. <=WM: (15444: O2192 ^name predict-no)
  28466. <=WM: (15443: O2191 ^name predict-yes)
  28467. <=WM: (15442: R1099 ^value 1)
  28468. --- Inner Elaboration Phase, active level 1 (S1) ---
  28469. Firing prefer*rvt*predict-yes*H0
  28470. -->
  28471. Firing rl*prefer*rvt*predict-yes*H0*5
  28472. -->
  28473. (S1 ^operator O2193 = 0.2640084057314346)
  28474. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28475. -->
  28476. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  28477. -->
  28478. (S1 ^operator O2193 = -0.1386470047172653)
  28479. Firing prefer*rvt*predict-no*H0
  28480. -->
  28481. Firing rl*prefer*rvt*predict-no*H0*6
  28482. -->
  28483. (S1 ^operator O2194 = 0.9809082465769686)
  28484. inner elaboration loop at bottom goal.
  28485. Retracting rl*prefer*rvt*predict-no*H0*6
  28486. -->
  28487. (S1 ^operator O2192 = 0.9809082465769686)
  28488. Retracting rl*prefer*rvt*predict-yes*H0*5
  28489. -->
  28490. (S1 ^operator O2191 = 0.2640084057314346)
  28491. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  28492. -->
  28493. (S1 ^operator O2191 = -0.1386470047172653)
  28494. --- END Proposal Phase ---
  28495. --- Decision Phase ---
  28496. RL update rl*prefer*rvt*predict-no*H0*6 0.980908 0 0.980908 -> 0.984038 0 0.984038(R,m,v=1,0.908537,0.0836077)
  28497. =>WM: (15461: S1 ^operator O2194)
  28498. 1097: O: O2194 (predict-no)
  28499. --- END Decision Phase ---
  28500. --- Application Phase ---
  28501. --- Firing Productions (PE) For State At Depth 1 ---
  28502. --- Inner Elaboration Phase, active level 1 (S1) ---
  28503. Firing apply*operator
  28504. -->
  28505. (I3 ^predict-no N1097 + :O )
  28506. Firing apply*operator*complete
  28507. -->
  28508. (I3 ^predict-no N1096 - :O )
  28509. inner elaboration loop at bottom goal.
  28510. --- Change Working Memory (PE) ---
  28511. =>WM: (15462: I3 ^predict-no N1097)
  28512. <=WM: (15449: N1096 ^status complete)
  28513. <=WM: (15448: I3 ^predict-no N1096)
  28514. --- Firing Productions (IE) For State At Depth 1 ---
  28515. --- Inner Elaboration Phase, active level 1 (S1) ---
  28516. Firing monitor*world
  28517. -->
  28518. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28519. --- Change Working Memory (IE) ---
  28520. --- END Application Phase ---
  28521. --- Output Phase ---
  28522. ENV: Agent did: predict-no for direction L in state State-A
  28523. In State-A moving L
  28524. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28525. predict error 0
  28526. dir: dir isR
  28527. --- END Output Phase ---
  28528. |--- Input Phase ---
  28529. =>WM: (15466: I2 ^dir R)
  28530. =>WM: (15465: I2 ^reward 1)
  28531. =>WM: (15464: I2 ^see 0)
  28532. =>WM: (15463: N1097 ^status complete)
  28533. <=WM: (15452: I2 ^dir L)
  28534. <=WM: (15451: I2 ^reward 1)
  28535. <=WM: (15450: I2 ^see 0)
  28536. =>WM: (15467: I2 ^level-1 L0-root)
  28537. <=WM: (15453: I2 ^level-1 L0-root)
  28538. --- END Input Phase ---
  28539. --- Proposal Phase ---
  28540. --- Inner Elaboration Phase, active level 1 (S1) ---
  28541. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  28542. -->
  28543. (S1 ^operator O2194 = -0.2817060109291377)
  28544. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  28545. -->
  28546. (S1 ^operator O2193 = 0.6623171039238327)
  28547. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28548. -->
  28549. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28550. -->
  28551. Firing elaborate*copy-see-to-output-link
  28552. -->
  28553. (I3 ^see 0 +)
  28554. Firing elaborate*reward*based*on*reward
  28555. -->
  28556. (R1101 ^value 1 +)
  28557. (R1 ^reward R1101 +)
  28558. Firing propose*predict-yes
  28559. -->
  28560. (O2195 ^name predict-yes +)
  28561. (S1 ^operator O2195 +)
  28562. Firing propose*predict-no
  28563. -->
  28564. (O2196 ^name predict-no +)
  28565. (S1 ^operator O2196 +)
  28566. Firing rl*prefer*rvt*predict-no*H0*4
  28567. -->
  28568. (S1 ^operator O2194 = 0.3397874256976259)
  28569. Firing rl*prefer*rvt*predict-yes*H0*3
  28570. -->
  28571. (S1 ^operator O2193 = 0.3377104146245125)
  28572. Firing prefer*rvt*predict-yes*H0
  28573. -->
  28574. Firing prefer*rvt*predict-no*H0
  28575. -->
  28576. Firing elaborate*copy-dir-to-output-link
  28577. -->
  28578. (I3 ^dir R +)
  28579. inner elaboration loop at bottom goal.
  28580. Retracting elaborate*copy-see-to-output-link
  28581. -->
  28582. (I3 ^see 0 +)
  28583. Retracting propose*predict-no
  28584. -->
  28585. (O2194 ^name predict-no +)
  28586. (S1 ^operator O2194 +)
  28587. Retracting propose*predict-yes
  28588. -->
  28589. (O2193 ^name predict-yes +)
  28590. (S1 ^operator O2193 +)
  28591. Retracting elaborate*reward*based*on*reward
  28592. -->
  28593. (R1100 ^value 1 +)
  28594. (R1 ^reward R1100 +)
  28595. Retracting elaborate*copy-dir-to-output-link
  28596. -->
  28597. (I3 ^dir L +)
  28598. Retracting rl*prefer*rvt*predict-no*H0*6
  28599. -->
  28600. (S1 ^operator O2194 = 0.9840381107549686)
  28601. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  28602. -->
  28603. (S1 ^operator O2193 = -0.1386470047172653)
  28604. Retracting rl*prefer*rvt*predict-yes*H0*5
  28605. -->
  28606. (S1 ^operator O2193 = 0.2640084057314346)
  28607. =>WM: (15474: S1 ^operator O2196 +)
  28608. =>WM: (15473: S1 ^operator O2195 +)
  28609. =>WM: (15472: I3 ^dir R)
  28610. =>WM: (15471: O2196 ^name predict-no)
  28611. =>WM: (15470: O2195 ^name predict-yes)
  28612. =>WM: (15469: R1101 ^value 1)
  28613. =>WM: (15468: R1 ^reward R1101)
  28614. <=WM: (15459: S1 ^operator O2193 +)
  28615. <=WM: (15460: S1 ^operator O2194 +)
  28616. <=WM: (15461: S1 ^operator O2194)
  28617. <=WM: (15430: I3 ^dir L)
  28618. <=WM: (15455: R1 ^reward R1100)
  28619. <=WM: (15458: O2194 ^name predict-no)
  28620. <=WM: (15457: O2193 ^name predict-yes)
  28621. <=WM: (15456: R1100 ^value 1)
  28622. --- Inner Elaboration Phase, active level 1 (S1) ---
  28623. Firing prefer*rvt*predict-yes*H0
  28624. -->
  28625. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  28626. -->
  28627. (S1 ^operator O2195 = 0.6623171039238327)
  28628. Firing rl*prefer*rvt*predict-yes*H0*3
  28629. -->
  28630. (S1 ^operator O2195 = 0.3377104146245125)
  28631. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  28632. -->
  28633. Firing prefer*rvt*predict-no*H0
  28634. -->
  28635. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  28636. -->
  28637. (S1 ^operator O2196 = -0.2817060109291377)
  28638. Firing rl*prefer*rvt*predict-no*H0*4
  28639. -->
  28640. (S1 ^operator O2196 = 0.3397874256976259)
  28641. Firing prefer*rvt*predict-no*H0*4*v1*H1
  28642. -->
  28643. inner elaboration loop at bottom goal.
  28644. Retracting rl*prefer*rvt*predict-no*H0*4
  28645. -->
  28646. (S1 ^operator O2194 = 0.3397874256976259)
  28647. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  28648. -->
  28649. (S1 ^operator O2194 = -0.2817060109291377)
  28650. Retracting rl*prefer*rvt*predict-yes*H0*3
  28651. -->
  28652. (S1 ^operator O2193 = 0.3377104146245125)
  28653. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  28654. -->
  28655. (S1 ^operator O2193 = 0.6623171039238327)
  28656. --- END Proposal Phase ---
  28657. --- Decision Phase ---
  28658. RL update rl*prefer*rvt*predict-no*H0*6 0.984038 0 0.984038 -> 0.986652 0 0.986652(R,m,v=1,0.909091,0.0831486)
  28659. =>WM: (15475: S1 ^operator O2195)
  28660. 1098: O: O2195 (predict-yes)
  28661. --- END Decision Phase ---
  28662. --- Application Phase ---
  28663. --- Firing Productions (PE) For State At Depth 1 ---
  28664. --- Inner Elaboration Phase, active level 1 (S1) ---
  28665. Firing apply*operator
  28666. -->
  28667. (I3 ^predict-yes N1098 + :O )
  28668. Firing apply*operator*complete
  28669. -->
  28670. (I3 ^predict-no N1097 - :O )
  28671. inner elaboration loop at bottom goal.
  28672. --- Change Working Memory (PE) ---
  28673. =>WM: (15476: I3 ^predict-yes N1098)
  28674. <=WM: (15463: N1097 ^status complete)
  28675. <=WM: (15462: I3 ^predict-no N1097)
  28676. --- Firing Productions (IE) For State At Depth 1 ---
  28677. --- Inner Elaboration Phase, active level 1 (S1) ---
  28678. Firing monitor*world
  28679. -->
  28680. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28681. --- Change Working Memory (IE) ---
  28682. --- END Application Phase ---
  28683. --- Output Phase ---
  28684. ENV: Agent did: predict-yes for direction R in state State-A
  28685. In State-A moving R
  28686. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  28687. predict error 0
  28688. dir: dir isL
  28689. --- END Output Phase ---
  28690. \-/--- Input Phase ---
  28691. =>WM: (15480: I2 ^dir L)
  28692. =>WM: (15479: I2 ^reward 1)
  28693. =>WM: (15478: I2 ^see 1)
  28694. =>WM: (15477: N1098 ^status complete)
  28695. <=WM: (15466: I2 ^dir R)
  28696. <=WM: (15465: I2 ^reward 1)
  28697. <=WM: (15464: I2 ^see 0)
  28698. =>WM: (15481: I2 ^level-1 R1-root)
  28699. <=WM: (15467: I2 ^level-1 L0-root)
  28700. --- END Input Phase ---
  28701. --- Proposal Phase ---
  28702. --- Inner Elaboration Phase, active level 1 (S1) ---
  28703. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  28704. -->
  28705. (S1 ^operator O2195 = 0.7361333243810797)
  28706. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28707. -->
  28708. Firing elaborate*copy-see-to-output-link
  28709. -->
  28710. (I3 ^see 1 +)
  28711. Firing elaborate*reward*based*on*reward
  28712. -->
  28713. (R1102 ^value 1 +)
  28714. (R1 ^reward R1102 +)
  28715. Firing propose*predict-yes
  28716. -->
  28717. (O2197 ^name predict-yes +)
  28718. (S1 ^operator O2197 +)
  28719. Firing propose*predict-no
  28720. -->
  28721. (O2198 ^name predict-no +)
  28722. (S1 ^operator O2198 +)
  28723. Firing rl*prefer*rvt*predict-no*H0*6
  28724. -->
  28725. (S1 ^operator O2196 = 0.9866522659768354)
  28726. Firing rl*prefer*rvt*predict-yes*H0*5
  28727. -->
  28728. (S1 ^operator O2195 = 0.2640084057314346)
  28729. Firing prefer*rvt*predict-yes*H0
  28730. -->
  28731. Firing prefer*rvt*predict-no*H0
  28732. -->
  28733. Firing elaborate*copy-dir-to-output-link
  28734. -->
  28735. (I3 ^dir L +)
  28736. inner elaboration loop at bottom goal.
  28737. Retracting elaborate*copy-see-to-output-link
  28738. -->
  28739. (I3 ^see 0 +)
  28740. Retracting propose*predict-no
  28741. -->
  28742. (O2196 ^name predict-no +)
  28743. (S1 ^operator O2196 +)
  28744. Retracting propose*predict-yes
  28745. -->
  28746. (O2195 ^name predict-yes +)
  28747. (S1 ^operator O2195 +)
  28748. Retracting elaborate*reward*based*on*reward
  28749. -->
  28750. (R1101 ^value 1 +)
  28751. (R1 ^reward R1101 +)
  28752. Retracting elaborate*copy-dir-to-output-link
  28753. -->
  28754. (I3 ^dir R +)
  28755. Retracting rl*prefer*rvt*predict-no*H0*4
  28756. -->
  28757. (S1 ^operator O2196 = 0.3397874256976259)
  28758. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  28759. -->
  28760. (S1 ^operator O2196 = -0.2817060109291377)
  28761. Retracting rl*prefer*rvt*predict-yes*H0*3
  28762. -->
  28763. (S1 ^operator O2195 = 0.3377104146245125)
  28764. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  28765. -->
  28766. (S1 ^operator O2195 = 0.6623171039238327)
  28767. =>WM: (15489: S1 ^operator O2198 +)
  28768. =>WM: (15488: S1 ^operator O2197 +)
  28769. =>WM: (15487: I3 ^dir L)
  28770. =>WM: (15486: O2198 ^name predict-no)
  28771. =>WM: (15485: O2197 ^name predict-yes)
  28772. =>WM: (15484: R1102 ^value 1)
  28773. =>WM: (15483: R1 ^reward R1102)
  28774. =>WM: (15482: I3 ^see 1)
  28775. <=WM: (15473: S1 ^operator O2195 +)
  28776. <=WM: (15475: S1 ^operator O2195)
  28777. <=WM: (15474: S1 ^operator O2196 +)
  28778. <=WM: (15472: I3 ^dir R)
  28779. <=WM: (15468: R1 ^reward R1101)
  28780. <=WM: (15454: I3 ^see 0)
  28781. <=WM: (15471: O2196 ^name predict-no)
  28782. <=WM: (15470: O2195 ^name predict-yes)
  28783. <=WM: (15469: R1101 ^value 1)
  28784. --- Inner Elaboration Phase, active level 1 (S1) ---
  28785. Firing prefer*rvt*predict-yes*H0
  28786. -->
  28787. Firing rl*prefer*rvt*predict-yes*H0*5
  28788. -->
  28789. (S1 ^operator O2197 = 0.2640084057314346)
  28790. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  28791. -->
  28792. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  28793. -->
  28794. (S1 ^operator O2197 = 0.7361333243810797)
  28795. Firing prefer*rvt*predict-no*H0
  28796. -->
  28797. Firing rl*prefer*rvt*predict-no*H0*6
  28798. -->
  28799. (S1 ^operator O2198 = 0.9866522659768354)
  28800. inner elaboration loop at bottom goal.
  28801. Retracting rl*prefer*rvt*predict-no*H0*6
  28802. -->
  28803. (S1 ^operator O2196 = 0.9866522659768354)
  28804. Retracting rl*prefer*rvt*predict-yes*H0*5
  28805. -->
  28806. (S1 ^operator O2195 = 0.2640084057314346)
  28807. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  28808. -->
  28809. (S1 ^operator O2195 = 0.7361333243810797)
  28810. --- END Proposal Phase ---
  28811. --- Decision Phase ---
  28812. RL update rl*prefer*rvt*predict-yes*H0*3 0.59011 -0.2524 0.33771 -> 0.590108 -0.2524 0.337708(R,m,v=1,0.906593,0.0851497)
  28813. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409921 0.252396 0.662317 -> 0.409918 0.252396 0.662315(R,m,v=1,1,0)
  28814. =>WM: (15490: S1 ^operator O2197)
  28815. 1099: O: O2197 (predict-yes)
  28816. --- END Decision Phase ---
  28817. --- Application Phase ---
  28818. --- Firing Productions (PE) For State At Depth 1 ---
  28819. --- Inner Elaboration Phase, active level 1 (S1) ---
  28820. Firing apply*operator
  28821. -->
  28822. (I3 ^predict-yes N1099 + :O )
  28823. Firing apply*operator*complete
  28824. -->
  28825. (I3 ^predict-yes N1098 - :O )
  28826. inner elaboration loop at bottom goal.
  28827. --- Change Working Memory (PE) ---
  28828. =>WM: (15491: I3 ^predict-yes N1099)
  28829. <=WM: (15477: N1098 ^status complete)
  28830. <=WM: (15476: I3 ^predict-yes N1098)
  28831. --- Firing Productions (IE) For State At Depth 1 ---
  28832. --- Inner Elaboration Phase, active level 1 (S1) ---
  28833. Firing monitor*world
  28834. -->
  28835. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28836. --- Change Working Memory (IE) ---
  28837. --- END Application Phase ---
  28838. --- Output Phase ---
  28839. ENV: Agent did: predict-yes for direction L in state State-B
  28840. In State-B moving L
  28841. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28842. predict error 0
  28843. dir: dir isU
  28844. --- END Output Phase ---
  28845. |\--- Input Phase ---
  28846. =>WM: (15495: I2 ^dir U)
  28847. =>WM: (15494: I2 ^reward 1)
  28848. =>WM: (15493: I2 ^see 1)
  28849. =>WM: (15492: N1099 ^status complete)
  28850. <=WM: (15480: I2 ^dir L)
  28851. <=WM: (15479: I2 ^reward 1)
  28852. <=WM: (15478: I2 ^see 1)
  28853. =>WM: (15496: I2 ^level-1 L1-root)
  28854. <=WM: (15481: I2 ^level-1 R1-root)
  28855. --- END Input Phase ---
  28856. --- Proposal Phase ---
  28857. --- Inner Elaboration Phase, active level 1 (S1) ---
  28858. Firing elaborate*copy-see-to-output-link
  28859. -->
  28860. (I3 ^see 1 +)
  28861. Firing elaborate*reward*based*on*reward
  28862. -->
  28863. (R1103 ^value 1 +)
  28864. (R1 ^reward R1103 +)
  28865. Firing propose*predict-yes
  28866. -->
  28867. (O2199 ^name predict-yes +)
  28868. (S1 ^operator O2199 +)
  28869. Firing propose*predict-no
  28870. -->
  28871. (O2200 ^name predict-no +)
  28872. (S1 ^operator O2200 +)
  28873. Firing rl*prefer*rvt*predict-no*H0*2
  28874. -->
  28875. (S1 ^operator O2198 = 1.)
  28876. Firing rl*prefer*rvt*predict-yes*H0*1
  28877. -->
  28878. (S1 ^operator O2197 = 0.)
  28879. Firing prefer*rvt*predict-yes*H0
  28880. -->
  28881. Firing prefer*rvt*predict-no*H0
  28882. -->
  28883. Firing elaborate*copy-dir-to-output-link
  28884. -->
  28885. (I3 ^dir U +)
  28886. inner elaboration loop at bottom goal.
  28887. Retracting elaborate*copy-see-to-output-link
  28888. -->
  28889. (I3 ^see 1 +)
  28890. Retracting propose*predict-no
  28891. -->
  28892. (O2198 ^name predict-no +)
  28893. (S1 ^operator O2198 +)
  28894. Retracting propose*predict-yes
  28895. -->
  28896. (O2197 ^name predict-yes +)
  28897. (S1 ^operator O2197 +)
  28898. Retracting elaborate*reward*based*on*reward
  28899. -->
  28900. (R1102 ^value 1 +)
  28901. (R1 ^reward R1102 +)
  28902. Retracting elaborate*copy-dir-to-output-link
  28903. -->
  28904. (I3 ^dir L +)
  28905. Retracting rl*prefer*rvt*predict-no*H0*6
  28906. -->
  28907. (S1 ^operator O2198 = 0.9866522659768354)
  28908. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*41
  28909. -->
  28910. (S1 ^operator O2197 = 0.7361333243810797)
  28911. Retracting rl*prefer*rvt*predict-yes*H0*5
  28912. -->
  28913. (S1 ^operator O2197 = 0.2640084057314346)
  28914. =>WM: (15503: S1 ^operator O2200 +)
  28915. =>WM: (15502: S1 ^operator O2199 +)
  28916. =>WM: (15501: I3 ^dir U)
  28917. =>WM: (15500: O2200 ^name predict-no)
  28918. =>WM: (15499: O2199 ^name predict-yes)
  28919. =>WM: (15498: R1103 ^value 1)
  28920. =>WM: (15497: R1 ^reward R1103)
  28921. <=WM: (15488: S1 ^operator O2197 +)
  28922. <=WM: (15490: S1 ^operator O2197)
  28923. <=WM: (15489: S1 ^operator O2198 +)
  28924. <=WM: (15487: I3 ^dir L)
  28925. <=WM: (15483: R1 ^reward R1102)
  28926. <=WM: (15486: O2198 ^name predict-no)
  28927. <=WM: (15485: O2197 ^name predict-yes)
  28928. <=WM: (15484: R1102 ^value 1)
  28929. --- Inner Elaboration Phase, active level 1 (S1) ---
  28930. Firing prefer*rvt*predict-yes*H0
  28931. -->
  28932. Firing rl*prefer*rvt*predict-yes*H0*1
  28933. -->
  28934. (S1 ^operator O2199 = 0.)
  28935. Firing prefer*rvt*predict-no*H0
  28936. -->
  28937. Firing rl*prefer*rvt*predict-no*H0*2
  28938. -->
  28939. (S1 ^operator O2200 = 1.)
  28940. inner elaboration loop at bottom goal.
  28941. Retracting rl*prefer*rvt*predict-no*H0*2
  28942. -->
  28943. (S1 ^operator O2198 = 1.)
  28944. Retracting rl*prefer*rvt*predict-yes*H0*1
  28945. -->
  28946. (S1 ^operator O2197 = 0.)
  28947. --- END Proposal Phase ---
  28948. --- Decision Phase ---
  28949. RL update rl*prefer*rvt*predict-yes*H0*5 0.554394 -0.290386 0.264008 -> 0.554383 -0.290386 0.263997(R,m,v=1,0.88601,0.101522)
  28950. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*41 0.445745 0.290388 0.736133 -> 0.445732 0.290388 0.73612(R,m,v=1,1,0)
  28951. =>WM: (15504: S1 ^operator O2200)
  28952. 1100: O: O2200 (predict-no)
  28953. --- END Decision Phase ---
  28954. --- Application Phase ---
  28955. --- Firing Productions (PE) For State At Depth 1 ---
  28956. --- Inner Elaboration Phase, active level 1 (S1) ---
  28957. Firing apply*operator
  28958. -->
  28959. (I3 ^predict-no N1100 + :O )
  28960. Firing apply*operator*complete
  28961. -->
  28962. (I3 ^predict-yes N1099 - :O )
  28963. inner elaboration loop at bottom goal.
  28964. --- Change Working Memory (PE) ---
  28965. =>WM: (15505: I3 ^predict-no N1100)
  28966. <=WM: (15492: N1099 ^status complete)
  28967. <=WM: (15491: I3 ^predict-yes N1099)
  28968. --- Firing Productions (IE) For State At Depth 1 ---
  28969. --- Inner Elaboration Phase, active level 1 (S1) ---
  28970. Firing monitor*world
  28971. -->
  28972. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28973. --- Change Working Memory (IE) ---
  28974. --- END Application Phase ---
  28975. --- Output Phase ---
  28976. ENV: Agent did: predict-no for direction U in state State-A
  28977. In State-A moving U
  28978. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28979. predict error 0
  28980. dir: dir isU
  28981. --- END Output Phase ---
  28982. -/|--- Input Phase ---
  28983. =>WM: (15509: I2 ^dir U)
  28984. =>WM: (15508: I2 ^reward 1)
  28985. =>WM: (15507: I2 ^see 0)
  28986. =>WM: (15506: N1100 ^status complete)
  28987. <=WM: (15495: I2 ^dir U)
  28988. <=WM: (15494: I2 ^reward 1)
  28989. <=WM: (15493: I2 ^see 1)
  28990. =>WM: (15510: I2 ^level-1 L1-root)
  28991. <=WM: (15496: I2 ^level-1 L1-root)
  28992. --- END Input Phase ---
  28993. --- Proposal Phase ---
  28994. --- Inner Elaboration Phase, active level 1 (S1) ---
  28995. Firing elaborate*copy-see-to-output-link
  28996. -->
  28997. (I3 ^see 0 +)
  28998. Firing elaborate*reward*based*on*reward
  28999. -->
  29000. (R1104 ^value 1 +)
  29001. (R1 ^reward R1104 +)
  29002. Firing propose*predict-yes
  29003. -->
  29004. (O2201 ^name predict-yes +)
  29005. (S1 ^operator O2201 +)
  29006. Firing propose*predict-no
  29007. -->
  29008. (O2202 ^name predict-no +)
  29009. (S1 ^operator O2202 +)
  29010. Firing rl*prefer*rvt*predict-no*H0*2
  29011. -->
  29012. (S1 ^operator O2200 = 1.)
  29013. Firing rl*prefer*rvt*predict-yes*H0*1
  29014. -->
  29015. (S1 ^operator O2199 = 0.)
  29016. Firing prefer*rvt*predict-yes*H0
  29017. -->
  29018. Firing prefer*rvt*predict-no*H0
  29019. -->
  29020. Firing elaborate*copy-dir-to-output-link
  29021. -->
  29022. (I3 ^dir U +)
  29023. inner elaboration loop at bottom goal.
  29024. Retracting elaborate*copy-see-to-output-link
  29025. -->
  29026. (I3 ^see 1 +)
  29027. Retracting propose*predict-no
  29028. -->
  29029. (O2200 ^name predict-no +)
  29030. (S1 ^operator O2200 +)
  29031. Retracting propose*predict-yes
  29032. -->
  29033. (O2199 ^name predict-yes +)
  29034. (S1 ^operator O2199 +)
  29035. Retracting elaborate*reward*based*on*reward
  29036. -->
  29037. (R1103 ^value 1 +)
  29038. (R1 ^reward R1103 +)
  29039. Retracting elaborate*copy-dir-to-output-link
  29040. -->
  29041. (I3 ^dir U +)
  29042. Retracting rl*prefer*rvt*predict-no*H0*2
  29043. -->
  29044. (S1 ^operator O2200 = 1.)
  29045. Retracting rl*prefer*rvt*predict-yes*H0*1
  29046. -->
  29047. (S1 ^operator O2199 = 0.)
  29048. =>WM: (15517: S1 ^operator O2202 +)
  29049. =>WM: (15516: S1 ^operator O2201 +)
  29050. =>WM: (15515: O2202 ^name predict-no)
  29051. =>WM: (15514: O2201 ^name predict-yes)
  29052. =>WM: (15513: R1104 ^value 1)
  29053. =>WM: (15512: R1 ^reward R1104)
  29054. =>WM: (15511: I3 ^see 0)
  29055. <=WM: (15502: S1 ^operator O2199 +)
  29056. <=WM: (15503: S1 ^operator O2200 +)
  29057. <=WM: (15504: S1 ^operator O2200)
  29058. <=WM: (15497: R1 ^reward R1103)
  29059. <=WM: (15482: I3 ^see 1)
  29060. <=WM: (15500: O2200 ^name predict-no)
  29061. <=WM: (15499: O2199 ^name predict-yes)
  29062. <=WM: (15498: R1103 ^value 1)
  29063. --- Inner Elaboration Phase, active level 1 (S1) ---
  29064. Firing prefer*rvt*predict-yes*H0
  29065. -->
  29066. Firing rl*prefer*rvt*predict-yes*H0*1
  29067. -->
  29068. (S1 ^operator O2201 = 0.)
  29069. Firing prefer*rvt*predict-no*H0
  29070. -->
  29071. Firing rl*prefer*rvt*predict-no*H0*2
  29072. -->
  29073. (S1 ^operator O2202 = 1.)
  29074. inner elaboration loop at bottom goal.
  29075. Retracting rl*prefer*rvt*predict-no*H0*2
  29076. -->
  29077. (S1 ^operator O2200 = 1.)
  29078. Retracting rl*prefer*rvt*predict-yes*H0*1
  29079. -->
  29080. (S1 ^operator O2199 = 0.)
  29081. --- END Proposal Phase ---
  29082. --- Decision Phase ---
  29083. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29084. =>WM: (15518: S1 ^operator O2202)
  29085. 1101: O: O2202 (predict-no)
  29086. --- END Decision Phase ---
  29087. --- Application Phase ---
  29088. --- Firing Productions (PE) For State At Depth 1 ---
  29089. --- Inner Elaboration Phase, active level 1 (S1) ---
  29090. Firing apply*operator
  29091. -->
  29092. (I3 ^predict-no N1101 + :O )
  29093. Firing apply*operator*complete
  29094. -->
  29095. (I3 ^predict-no N1100 - :O )
  29096. inner elaboration loop at bottom goal.
  29097. --- Change Working Memory (PE) ---
  29098. =>WM: (15519: I3 ^predict-no N1101)
  29099. <=WM: (15506: N1100 ^status complete)
  29100. <=WM: (15505: I3 ^predict-no N1100)
  29101. --- Firing Productions (IE) For State At Depth 1 ---
  29102. --- Inner Elaboration Phase, active level 1 (S1) ---
  29103. Firing monitor*world
  29104. -->
  29105. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29106. --- Change Working Memory (IE) ---
  29107. --- END Application Phase ---
  29108. --- Output Phase ---
  29109. ENV: Agent did: predict-no for direction U in state State-A
  29110. In State-A moving U
  29111. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29112. predict error 0
  29113. dir: dir isU
  29114. --- END Output Phase ---
  29115. \--- Input Phase ---
  29116. =>WM: (15523: I2 ^dir U)
  29117. =>WM: (15522: I2 ^reward 1)
  29118. =>WM: (15521: I2 ^see 0)
  29119. =>WM: (15520: N1101 ^status complete)
  29120. <=WM: (15509: I2 ^dir U)
  29121. <=WM: (15508: I2 ^reward 1)
  29122. <=WM: (15507: I2 ^see 0)
  29123. =>WM: (15524: I2 ^level-1 L1-root)
  29124. <=WM: (15510: I2 ^level-1 L1-root)
  29125. --- END Input Phase ---
  29126. --- Proposal Phase ---
  29127. --- Inner Elaboration Phase, active level 1 (S1) ---
  29128. Firing elaborate*copy-see-to-output-link
  29129. -->
  29130. (I3 ^see 0 +)
  29131. Firing elaborate*reward*based*on*reward
  29132. -->
  29133. (R1105 ^value 1 +)
  29134. (R1 ^reward R1105 +)
  29135. Firing propose*predict-yes
  29136. -->
  29137. (O2203 ^name predict-yes +)
  29138. (S1 ^operator O2203 +)
  29139. Firing propose*predict-no
  29140. -->
  29141. (O2204 ^name predict-no +)
  29142. (S1 ^operator O2204 +)
  29143. Firing rl*prefer*rvt*predict-no*H0*2
  29144. -->
  29145. (S1 ^operator O2202 = 1.)
  29146. Firing rl*prefer*rvt*predict-yes*H0*1
  29147. -->
  29148. (S1 ^operator O2201 = 0.)
  29149. Firing prefer*rvt*predict-yes*H0
  29150. -->
  29151. Firing prefer*rvt*predict-no*H0
  29152. -->
  29153. Firing elaborate*copy-dir-to-output-link
  29154. -->
  29155. (I3 ^dir U +)
  29156. inner elaboration loop at bottom goal.
  29157. Retracting elaborate*copy-see-to-output-link
  29158. -->
  29159. (I3 ^see 0 +)
  29160. Retracting propose*predict-no
  29161. -->
  29162. (O2202 ^name predict-no +)
  29163. (S1 ^operator O2202 +)
  29164. Retracting propose*predict-yes
  29165. -->
  29166. (O2201 ^name predict-yes +)
  29167. (S1 ^operator O2201 +)
  29168. Retracting elaborate*reward*based*on*reward
  29169. -->
  29170. (R1104 ^value 1 +)
  29171. (R1 ^reward R1104 +)
  29172. Retracting elaborate*copy-dir-to-output-link
  29173. -->
  29174. (I3 ^dir U +)
  29175. Retracting rl*prefer*rvt*predict-no*H0*2
  29176. -->
  29177. (S1 ^operator O2202 = 1.)
  29178. Retracting rl*prefer*rvt*predict-yes*H0*1
  29179. -->
  29180. (S1 ^operator O2201 = 0.)
  29181. =>WM: (15530: S1 ^operator O2204 +)
  29182. =>WM: (15529: S1 ^operator O2203 +)
  29183. =>WM: (15528: O2204 ^name predict-no)
  29184. =>WM: (15527: O2203 ^name predict-yes)
  29185. =>WM: (15526: R1105 ^value 1)
  29186. =>WM: (15525: R1 ^reward R1105)
  29187. <=WM: (15516: S1 ^operator O2201 +)
  29188. <=WM: (15517: S1 ^operator O2202 +)
  29189. <=WM: (15518: S1 ^operator O2202)
  29190. <=WM: (15512: R1 ^reward R1104)
  29191. <=WM: (15515: O2202 ^name predict-no)
  29192. <=WM: (15514: O2201 ^name predict-yes)
  29193. <=WM: (15513: R1104 ^value 1)
  29194. --- Inner Elaboration Phase, active level 1 (S1) ---
  29195. Firing prefer*rvt*predict-yes*H0
  29196. -->
  29197. Firing rl*prefer*rvt*predict-yes*H0*1
  29198. -->
  29199. (S1 ^operator O2203 = 0.)
  29200. Firing prefer*rvt*predict-no*H0
  29201. -->
  29202. Firing rl*prefer*rvt*predict-no*H0*2
  29203. -->
  29204. (S1 ^operator O2204 = 1.)
  29205. inner elaboration loop at bottom goal.
  29206. Retracting rl*prefer*rvt*predict-no*H0*2
  29207. -->
  29208. (S1 ^operator O2202 = 1.)
  29209. Retracting rl*prefer*rvt*predict-yes*H0*1
  29210. -->
  29211. (S1 ^operator O2201 = 0.)
  29212. --- END Proposal Phase ---
  29213. --- Decision Phase ---
  29214. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29215. =>WM: (15531: S1 ^operator O2204)
  29216. 1102: O: O2204 (predict-no)
  29217. --- END Decision Phase ---
  29218. --- Application Phase ---
  29219. --- Firing Productions (PE) For State At Depth 1 ---
  29220. --- Inner Elaboration Phase, active level 1 (S1) ---
  29221. Firing apply*operator
  29222. -->
  29223. (I3 ^predict-no N1102 + :O )
  29224. Firing apply*operator*complete
  29225. -->
  29226. (I3 ^predict-no N1101 - :O )
  29227. inner elaboration loop at bottom goal.
  29228. --- Change Working Memory (PE) ---
  29229. =>WM: (15532: I3 ^predict-no N1102)
  29230. <=WM: (15520: N1101 ^status complete)
  29231. <=WM: (15519: I3 ^predict-no N1101)
  29232. --- Firing Productions (IE) For State At Depth 1 ---
  29233. --- Inner Elaboration Phase, active level 1 (S1) ---
  29234. Firing monitor*world
  29235. -->
  29236. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29237. --- Change Working Memory (IE) ---
  29238. --- END Application Phase ---
  29239. --- Output Phase ---
  29240. ENV: Agent did: predict-no for direction U in state State-A
  29241. In State-A moving U
  29242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29243. predict error 0
  29244. dir: dir isL
  29245. --- END Output Phase ---
  29246. -/|--- Input Phase ---
  29247. =>WM: (15536: I2 ^dir L)
  29248. =>WM: (15535: I2 ^reward 1)
  29249. =>WM: (15534: I2 ^see 0)
  29250. =>WM: (15533: N1102 ^status complete)
  29251. <=WM: (15523: I2 ^dir U)
  29252. <=WM: (15522: I2 ^reward 1)
  29253. <=WM: (15521: I2 ^see 0)
  29254. =>WM: (15537: I2 ^level-1 L1-root)
  29255. <=WM: (15524: I2 ^level-1 L1-root)
  29256. --- END Input Phase ---
  29257. --- Proposal Phase ---
  29258. --- Inner Elaboration Phase, active level 1 (S1) ---
  29259. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  29260. -->
  29261. (S1 ^operator O2203 = -0.181727099742844)
  29262. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29263. -->
  29264. Firing elaborate*copy-see-to-output-link
  29265. -->
  29266. (I3 ^see 0 +)
  29267. Firing elaborate*reward*based*on*reward
  29268. -->
  29269. (R1106 ^value 1 +)
  29270. (R1 ^reward R1106 +)
  29271. Firing propose*predict-yes
  29272. -->
  29273. (O2205 ^name predict-yes +)
  29274. (S1 ^operator O2205 +)
  29275. Firing propose*predict-no
  29276. -->
  29277. (O2206 ^name predict-no +)
  29278. (S1 ^operator O2206 +)
  29279. Firing rl*prefer*rvt*predict-no*H0*6
  29280. -->
  29281. (S1 ^operator O2204 = 0.9866522659768354)
  29282. Firing rl*prefer*rvt*predict-yes*H0*5
  29283. -->
  29284. (S1 ^operator O2203 = 0.2639970902976322)
  29285. Firing prefer*rvt*predict-yes*H0
  29286. -->
  29287. Firing prefer*rvt*predict-no*H0
  29288. -->
  29289. Firing elaborate*copy-dir-to-output-link
  29290. -->
  29291. (I3 ^dir L +)
  29292. inner elaboration loop at bottom goal.
  29293. Retracting elaborate*copy-see-to-output-link
  29294. -->
  29295. (I3 ^see 0 +)
  29296. Retracting propose*predict-no
  29297. -->
  29298. (O2204 ^name predict-no +)
  29299. (S1 ^operator O2204 +)
  29300. Retracting propose*predict-yes
  29301. -->
  29302. (O2203 ^name predict-yes +)
  29303. (S1 ^operator O2203 +)
  29304. Retracting elaborate*reward*based*on*reward
  29305. -->
  29306. (R1105 ^value 1 +)
  29307. (R1 ^reward R1105 +)
  29308. Retracting elaborate*copy-dir-to-output-link
  29309. -->
  29310. (I3 ^dir U +)
  29311. Retracting rl*prefer*rvt*predict-no*H0*2
  29312. -->
  29313. (S1 ^operator O2204 = 1.)
  29314. Retracting rl*prefer*rvt*predict-yes*H0*1
  29315. -->
  29316. (S1 ^operator O2203 = 0.)
  29317. =>WM: (15544: S1 ^operator O2206 +)
  29318. =>WM: (15543: S1 ^operator O2205 +)
  29319. =>WM: (15542: I3 ^dir L)
  29320. =>WM: (15541: O2206 ^name predict-no)
  29321. =>WM: (15540: O2205 ^name predict-yes)
  29322. =>WM: (15539: R1106 ^value 1)
  29323. =>WM: (15538: R1 ^reward R1106)
  29324. <=WM: (15529: S1 ^operator O2203 +)
  29325. <=WM: (15530: S1 ^operator O2204 +)
  29326. <=WM: (15531: S1 ^operator O2204)
  29327. <=WM: (15501: I3 ^dir U)
  29328. <=WM: (15525: R1 ^reward R1105)
  29329. <=WM: (15528: O2204 ^name predict-no)
  29330. <=WM: (15527: O2203 ^name predict-yes)
  29331. <=WM: (15526: R1105 ^value 1)
  29332. --- Inner Elaboration Phase, active level 1 (S1) ---
  29333. Firing prefer*rvt*predict-yes*H0
  29334. -->
  29335. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  29336. -->
  29337. (S1 ^operator O2205 = -0.181727099742844)
  29338. Firing rl*prefer*rvt*predict-yes*H0*5
  29339. -->
  29340. (S1 ^operator O2205 = 0.2639970902976322)
  29341. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29342. -->
  29343. Firing prefer*rvt*predict-no*H0
  29344. -->
  29345. Firing rl*prefer*rvt*predict-no*H0*6
  29346. -->
  29347. (S1 ^operator O2206 = 0.9866522659768354)
  29348. inner elaboration loop at bottom goal.
  29349. Retracting rl*prefer*rvt*predict-no*H0*6
  29350. -->
  29351. (S1 ^operator O2204 = 0.9866522659768354)
  29352. Retracting rl*prefer*rvt*predict-yes*H0*5
  29353. -->
  29354. (S1 ^operator O2203 = 0.2639970902976322)
  29355. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  29356. -->
  29357. (S1 ^operator O2203 = -0.181727099742844)
  29358. --- END Proposal Phase ---
  29359. --- Decision Phase ---
  29360. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29361. =>WM: (15545: S1 ^operator O2206)
  29362. 1103: O: O2206 (predict-no)
  29363. --- END Decision Phase ---
  29364. --- Application Phase ---
  29365. --- Firing Productions (PE) For State At Depth 1 ---
  29366. --- Inner Elaboration Phase, active level 1 (S1) ---
  29367. Firing apply*operator
  29368. -->
  29369. (I3 ^predict-no N1103 + :O )
  29370. Firing apply*operator*complete
  29371. -->
  29372. (I3 ^predict-no N1102 - :O )
  29373. inner elaboration loop at bottom goal.
  29374. --- Change Working Memory (PE) ---
  29375. =>WM: (15546: I3 ^predict-no N1103)
  29376. <=WM: (15533: N1102 ^status complete)
  29377. <=WM: (15532: I3 ^predict-no N1102)
  29378. --- Firing Productions (IE) For State At Depth 1 ---
  29379. --- Inner Elaboration Phase, active level 1 (S1) ---
  29380. Firing monitor*world
  29381. -->
  29382. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29383. --- Change Working Memory (IE) ---
  29384. --- END Application Phase ---
  29385. --- Output Phase ---
  29386. ENV: Agent did: predict-no for direction L in state State-A
  29387. In State-A moving L
  29388. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29389. predict error 0
  29390. dir: dir isU
  29391. --- END Output Phase ---
  29392. \-/--- Input Phase ---
  29393. =>WM: (15550: I2 ^dir U)
  29394. =>WM: (15549: I2 ^reward 1)
  29395. =>WM: (15548: I2 ^see 0)
  29396. =>WM: (15547: N1103 ^status complete)
  29397. <=WM: (15536: I2 ^dir L)
  29398. <=WM: (15535: I2 ^reward 1)
  29399. <=WM: (15534: I2 ^see 0)
  29400. =>WM: (15551: I2 ^level-1 L0-root)
  29401. <=WM: (15537: I2 ^level-1 L1-root)
  29402. --- END Input Phase ---
  29403. --- Proposal Phase ---
  29404. --- Inner Elaboration Phase, active level 1 (S1) ---
  29405. Firing elaborate*copy-see-to-output-link
  29406. -->
  29407. (I3 ^see 0 +)
  29408. Firing elaborate*reward*based*on*reward
  29409. -->
  29410. (R1107 ^value 1 +)
  29411. (R1 ^reward R1107 +)
  29412. Firing propose*predict-yes
  29413. -->
  29414. (O2207 ^name predict-yes +)
  29415. (S1 ^operator O2207 +)
  29416. Firing propose*predict-no
  29417. -->
  29418. (O2208 ^name predict-no +)
  29419. (S1 ^operator O2208 +)
  29420. Firing rl*prefer*rvt*predict-no*H0*2
  29421. -->
  29422. (S1 ^operator O2206 = 1.)
  29423. Firing rl*prefer*rvt*predict-yes*H0*1
  29424. -->
  29425. (S1 ^operator O2205 = 0.)
  29426. Firing prefer*rvt*predict-yes*H0
  29427. -->
  29428. Firing prefer*rvt*predict-no*H0
  29429. -->
  29430. Firing elaborate*copy-dir-to-output-link
  29431. -->
  29432. (I3 ^dir U +)
  29433. inner elaboration loop at bottom goal.
  29434. Retracting elaborate*copy-see-to-output-link
  29435. -->
  29436. (I3 ^see 0 +)
  29437. Retracting propose*predict-no
  29438. -->
  29439. (O2206 ^name predict-no +)
  29440. (S1 ^operator O2206 +)
  29441. Retracting propose*predict-yes
  29442. -->
  29443. (O2205 ^name predict-yes +)
  29444. (S1 ^operator O2205 +)
  29445. Retracting elaborate*reward*based*on*reward
  29446. -->
  29447. (R1106 ^value 1 +)
  29448. (R1 ^reward R1106 +)
  29449. Retracting elaborate*copy-dir-to-output-link
  29450. -->
  29451. (I3 ^dir L +)
  29452. Retracting rl*prefer*rvt*predict-no*H0*6
  29453. -->
  29454. (S1 ^operator O2206 = 0.9866522659768354)
  29455. Retracting rl*prefer*rvt*predict-yes*H0*5
  29456. -->
  29457. (S1 ^operator O2205 = 0.2639970902976322)
  29458. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  29459. -->
  29460. (S1 ^operator O2205 = -0.181727099742844)
  29461. =>WM: (15558: S1 ^operator O2208 +)
  29462. =>WM: (15557: S1 ^operator O2207 +)
  29463. =>WM: (15556: I3 ^dir U)
  29464. =>WM: (15555: O2208 ^name predict-no)
  29465. =>WM: (15554: O2207 ^name predict-yes)
  29466. =>WM: (15553: R1107 ^value 1)
  29467. =>WM: (15552: R1 ^reward R1107)
  29468. <=WM: (15543: S1 ^operator O2205 +)
  29469. <=WM: (15544: S1 ^operator O2206 +)
  29470. <=WM: (15545: S1 ^operator O2206)
  29471. <=WM: (15542: I3 ^dir L)
  29472. <=WM: (15538: R1 ^reward R1106)
  29473. <=WM: (15541: O2206 ^name predict-no)
  29474. <=WM: (15540: O2205 ^name predict-yes)
  29475. <=WM: (15539: R1106 ^value 1)
  29476. --- Inner Elaboration Phase, active level 1 (S1) ---
  29477. Firing prefer*rvt*predict-yes*H0
  29478. -->
  29479. Firing rl*prefer*rvt*predict-yes*H0*1
  29480. -->
  29481. (S1 ^operator O2207 = 0.)
  29482. Firing prefer*rvt*predict-no*H0
  29483. -->
  29484. Firing rl*prefer*rvt*predict-no*H0*2
  29485. -->
  29486. (S1 ^operator O2208 = 1.)
  29487. inner elaboration loop at bottom goal.
  29488. Retracting rl*prefer*rvt*predict-no*H0*2
  29489. -->
  29490. (S1 ^operator O2206 = 1.)
  29491. Retracting rl*prefer*rvt*predict-yes*H0*1
  29492. -->
  29493. (S1 ^operator O2205 = 0.)
  29494. --- END Proposal Phase ---
  29495. --- Decision Phase ---
  29496. RL update rl*prefer*rvt*predict-no*H0*6 0.986652 0 0.986652 -> 0.988836 0 0.988836(R,m,v=1,0.909639,0.0826944)
  29497. =>WM: (15559: S1 ^operator O2208)
  29498. 1104: O: O2208 (predict-no)
  29499. --- END Decision Phase ---
  29500. --- Application Phase ---
  29501. --- Firing Productions (PE) For State At Depth 1 ---
  29502. --- Inner Elaboration Phase, active level 1 (S1) ---
  29503. Firing apply*operator
  29504. -->
  29505. (I3 ^predict-no N1104 + :O )
  29506. Firing apply*operator*complete
  29507. -->
  29508. (I3 ^predict-no N1103 - :O )
  29509. inner elaboration loop at bottom goal.
  29510. --- Change Working Memory (PE) ---
  29511. =>WM: (15560: I3 ^predict-no N1104)
  29512. <=WM: (15547: N1103 ^status complete)
  29513. <=WM: (15546: I3 ^predict-no N1103)
  29514. --- Firing Productions (IE) For State At Depth 1 ---
  29515. --- Inner Elaboration Phase, active level 1 (S1) ---
  29516. Firing monitor*world
  29517. -->
  29518. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29519. --- Change Working Memory (IE) ---
  29520. --- END Application Phase ---
  29521. --- Output Phase ---
  29522. ENV: Agent did: predict-no for direction U in state State-A
  29523. In State-A moving U
  29524. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29525. predict error 0
  29526. dir: dir isU
  29527. --- END Output Phase ---
  29528. |\---- Input Phase ---
  29529. =>WM: (15564: I2 ^dir U)
  29530. =>WM: (15563: I2 ^reward 1)
  29531. =>WM: (15562: I2 ^see 0)
  29532. =>WM: (15561: N1104 ^status complete)
  29533. <=WM: (15550: I2 ^dir U)
  29534. <=WM: (15549: I2 ^reward 1)
  29535. <=WM: (15548: I2 ^see 0)
  29536. =>WM: (15565: I2 ^level-1 L0-root)
  29537. <=WM: (15551: I2 ^level-1 L0-root)
  29538. --- END Input Phase ---
  29539. --- Proposal Phase ---
  29540. --- Inner Elaboration Phase, active level 1 (S1) ---
  29541. Firing elaborate*copy-see-to-output-link
  29542. -->
  29543. (I3 ^see 0 +)
  29544. Firing elaborate*reward*based*on*reward
  29545. -->
  29546. (R1108 ^value 1 +)
  29547. (R1 ^reward R1108 +)
  29548. Firing propose*predict-yes
  29549. -->
  29550. (O2209 ^name predict-yes +)
  29551. (S1 ^operator O2209 +)
  29552. Firing propose*predict-no
  29553. -->
  29554. (O2210 ^name predict-no +)
  29555. (S1 ^operator O2210 +)
  29556. Firing rl*prefer*rvt*predict-no*H0*2
  29557. -->
  29558. (S1 ^operator O2208 = 1.)
  29559. Firing rl*prefer*rvt*predict-yes*H0*1
  29560. -->
  29561. (S1 ^operator O2207 = 0.)
  29562. Firing prefer*rvt*predict-yes*H0
  29563. -->
  29564. Firing prefer*rvt*predict-no*H0
  29565. -->
  29566. Firing elaborate*copy-dir-to-output-link
  29567. -->
  29568. (I3 ^dir U +)
  29569. inner elaboration loop at bottom goal.
  29570. Retracting elaborate*copy-see-to-output-link
  29571. -->
  29572. (I3 ^see 0 +)
  29573. Retracting propose*predict-no
  29574. -->
  29575. (O2208 ^name predict-no +)
  29576. (S1 ^operator O2208 +)
  29577. Retracting propose*predict-yes
  29578. -->
  29579. (O2207 ^name predict-yes +)
  29580. (S1 ^operator O2207 +)
  29581. Retracting elaborate*reward*based*on*reward
  29582. -->
  29583. (R1107 ^value 1 +)
  29584. (R1 ^reward R1107 +)
  29585. Retracting elaborate*copy-dir-to-output-link
  29586. -->
  29587. (I3 ^dir U +)
  29588. Retracting rl*prefer*rvt*predict-no*H0*2
  29589. -->
  29590. (S1 ^operator O2208 = 1.)
  29591. Retracting rl*prefer*rvt*predict-yes*H0*1
  29592. -->
  29593. (S1 ^operator O2207 = 0.)
  29594. =>WM: (15571: S1 ^operator O2210 +)
  29595. =>WM: (15570: S1 ^operator O2209 +)
  29596. =>WM: (15569: O2210 ^name predict-no)
  29597. =>WM: (15568: O2209 ^name predict-yes)
  29598. =>WM: (15567: R1108 ^value 1)
  29599. =>WM: (15566: R1 ^reward R1108)
  29600. <=WM: (15557: S1 ^operator O2207 +)
  29601. <=WM: (15558: S1 ^operator O2208 +)
  29602. <=WM: (15559: S1 ^operator O2208)
  29603. <=WM: (15552: R1 ^reward R1107)
  29604. <=WM: (15555: O2208 ^name predict-no)
  29605. <=WM: (15554: O2207 ^name predict-yes)
  29606. <=WM: (15553: R1107 ^value 1)
  29607. --- Inner Elaboration Phase, active level 1 (S1) ---
  29608. Firing prefer*rvt*predict-yes*H0
  29609. -->
  29610. Firing rl*prefer*rvt*predict-yes*H0*1
  29611. -->
  29612. (S1 ^operator O2209 = 0.)
  29613. Firing prefer*rvt*predict-no*H0
  29614. -->
  29615. Firing rl*prefer*rvt*predict-no*H0*2
  29616. -->
  29617. (S1 ^operator O2210 = 1.)
  29618. inner elaboration loop at bottom goal.
  29619. Retracting rl*prefer*rvt*predict-no*H0*2
  29620. -->
  29621. (S1 ^operator O2208 = 1.)
  29622. Retracting rl*prefer*rvt*predict-yes*H0*1
  29623. -->
  29624. (S1 ^operator O2207 = 0.)
  29625. --- END Proposal Phase ---
  29626. --- Decision Phase ---
  29627. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29628. =>WM: (15572: S1 ^operator O2210)
  29629. 1105: O: O2210 (predict-no)
  29630. --- END Decision Phase ---
  29631. --- Application Phase ---
  29632. --- Firing Productions (PE) For State At Depth 1 ---
  29633. --- Inner Elaboration Phase, active level 1 (S1) ---
  29634. Firing apply*operator
  29635. -->
  29636. (I3 ^predict-no N1105 + :O )
  29637. Firing apply*operator*complete
  29638. -->
  29639. (I3 ^predict-no N1104 - :O )
  29640. inner elaboration loop at bottom goal.
  29641. --- Change Working Memory (PE) ---
  29642. =>WM: (15573: I3 ^predict-no N1105)
  29643. <=WM: (15561: N1104 ^status complete)
  29644. <=WM: (15560: I3 ^predict-no N1104)
  29645. --- Firing Productions (IE) For State At Depth 1 ---
  29646. --- Inner Elaboration Phase, active level 1 (S1) ---
  29647. Firing monitor*world
  29648. -->
  29649. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29650. --- Change Working Memory (IE) ---
  29651. --- END Application Phase ---
  29652. --- Output Phase ---
  29653. ENV: Agent did: predict-no for direction U in state State-A
  29654. In State-A moving U
  29655. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29656. predict error 0
  29657. dir: dir isU
  29658. --- END Output Phase ---
  29659. /|\--- Input Phase ---
  29660. =>WM: (15577: I2 ^dir U)
  29661. =>WM: (15576: I2 ^reward 1)
  29662. =>WM: (15575: I2 ^see 0)
  29663. =>WM: (15574: N1105 ^status complete)
  29664. <=WM: (15564: I2 ^dir U)
  29665. <=WM: (15563: I2 ^reward 1)
  29666. <=WM: (15562: I2 ^see 0)
  29667. =>WM: (15578: I2 ^level-1 L0-root)
  29668. <=WM: (15565: I2 ^level-1 L0-root)
  29669. --- END Input Phase ---
  29670. --- Proposal Phase ---
  29671. --- Inner Elaboration Phase, active level 1 (S1) ---
  29672. Firing elaborate*copy-see-to-output-link
  29673. -->
  29674. (I3 ^see 0 +)
  29675. Firing elaborate*reward*based*on*reward
  29676. -->
  29677. (R1109 ^value 1 +)
  29678. (R1 ^reward R1109 +)
  29679. Firing propose*predict-yes
  29680. -->
  29681. (O2211 ^name predict-yes +)
  29682. (S1 ^operator O2211 +)
  29683. Firing propose*predict-no
  29684. -->
  29685. (O2212 ^name predict-no +)
  29686. (S1 ^operator O2212 +)
  29687. Firing rl*prefer*rvt*predict-no*H0*2
  29688. -->
  29689. (S1 ^operator O2210 = 1.)
  29690. Firing rl*prefer*rvt*predict-yes*H0*1
  29691. -->
  29692. (S1 ^operator O2209 = 0.)
  29693. Firing prefer*rvt*predict-yes*H0
  29694. -->
  29695. Firing prefer*rvt*predict-no*H0
  29696. -->
  29697. Firing elaborate*copy-dir-to-output-link
  29698. -->
  29699. (I3 ^dir U +)
  29700. inner elaboration loop at bottom goal.
  29701. Retracting elaborate*copy-see-to-output-link
  29702. -->
  29703. (I3 ^see 0 +)
  29704. Retracting propose*predict-no
  29705. -->
  29706. (O2210 ^name predict-no +)
  29707. (S1 ^operator O2210 +)
  29708. Retracting propose*predict-yes
  29709. -->
  29710. (O2209 ^name predict-yes +)
  29711. (S1 ^operator O2209 +)
  29712. Retracting elaborate*reward*based*on*reward
  29713. -->
  29714. (R1108 ^value 1 +)
  29715. (R1 ^reward R1108 +)
  29716. Retracting elaborate*copy-dir-to-output-link
  29717. -->
  29718. (I3 ^dir U +)
  29719. Retracting rl*prefer*rvt*predict-no*H0*2
  29720. -->
  29721. (S1 ^operator O2210 = 1.)
  29722. Retracting rl*prefer*rvt*predict-yes*H0*1
  29723. -->
  29724. (S1 ^operator O2209 = 0.)
  29725. =>WM: (15584: S1 ^operator O2212 +)
  29726. =>WM: (15583: S1 ^operator O2211 +)
  29727. =>WM: (15582: O2212 ^name predict-no)
  29728. =>WM: (15581: O2211 ^name predict-yes)
  29729. =>WM: (15580: R1109 ^value 1)
  29730. =>WM: (15579: R1 ^reward R1109)
  29731. <=WM: (15570: S1 ^operator O2209 +)
  29732. <=WM: (15571: S1 ^operator O2210 +)
  29733. <=WM: (15572: S1 ^operator O2210)
  29734. <=WM: (15566: R1 ^reward R1108)
  29735. <=WM: (15569: O2210 ^name predict-no)
  29736. <=WM: (15568: O2209 ^name predict-yes)
  29737. <=WM: (15567: R1108 ^value 1)
  29738. --- Inner Elaboration Phase, active level 1 (S1) ---
  29739. Firing prefer*rvt*predict-yes*H0
  29740. -->
  29741. Firing rl*prefer*rvt*predict-yes*H0*1
  29742. -->
  29743. (S1 ^operator O2211 = 0.)
  29744. Firing prefer*rvt*predict-no*H0
  29745. -->
  29746. Firing rl*prefer*rvt*predict-no*H0*2
  29747. -->
  29748. (S1 ^operator O2212 = 1.)
  29749. inner elaboration loop at bottom goal.
  29750. Retracting rl*prefer*rvt*predict-no*H0*2
  29751. -->
  29752. (S1 ^operator O2210 = 1.)
  29753. Retracting rl*prefer*rvt*predict-yes*H0*1
  29754. -->
  29755. (S1 ^operator O2209 = 0.)
  29756. --- END Proposal Phase ---
  29757. --- Decision Phase ---
  29758. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29759. =>WM: (15585: S1 ^operator O2212)
  29760. 1106: O: O2212 (predict-no)
  29761. --- END Decision Phase ---
  29762. --- Application Phase ---
  29763. --- Firing Productions (PE) For State At Depth 1 ---
  29764. --- Inner Elaboration Phase, active level 1 (S1) ---
  29765. Firing apply*operator
  29766. -->
  29767. (I3 ^predict-no N1106 + :O )
  29768. Firing apply*operator*complete
  29769. -->
  29770. (I3 ^predict-no N1105 - :O )
  29771. inner elaboration loop at bottom goal.
  29772. --- Change Working Memory (PE) ---
  29773. =>WM: (15586: I3 ^predict-no N1106)
  29774. <=WM: (15574: N1105 ^status complete)
  29775. <=WM: (15573: I3 ^predict-no N1105)
  29776. --- Firing Productions (IE) For State At Depth 1 ---
  29777. --- Inner Elaboration Phase, active level 1 (S1) ---
  29778. Firing monitor*world
  29779. -->
  29780. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29781. --- Change Working Memory (IE) ---
  29782. --- END Application Phase ---
  29783. --- Output Phase ---
  29784. ENV: Agent did: predict-no for direction U in state State-A
  29785. In State-A moving U
  29786. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29787. predict error 0
  29788. dir: dir isL
  29789. --- END Output Phase ---
  29790. -/|--- Input Phase ---
  29791. =>WM: (15590: I2 ^dir L)
  29792. =>WM: (15589: I2 ^reward 1)
  29793. =>WM: (15588: I2 ^see 0)
  29794. =>WM: (15587: N1106 ^status complete)
  29795. <=WM: (15577: I2 ^dir U)
  29796. <=WM: (15576: I2 ^reward 1)
  29797. <=WM: (15575: I2 ^see 0)
  29798. =>WM: (15591: I2 ^level-1 L0-root)
  29799. <=WM: (15578: I2 ^level-1 L0-root)
  29800. --- END Input Phase ---
  29801. --- Proposal Phase ---
  29802. --- Inner Elaboration Phase, active level 1 (S1) ---
  29803. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  29804. -->
  29805. (S1 ^operator O2211 = -0.1386470047172653)
  29806. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29807. -->
  29808. Firing elaborate*copy-see-to-output-link
  29809. -->
  29810. (I3 ^see 0 +)
  29811. Firing elaborate*reward*based*on*reward
  29812. -->
  29813. (R1110 ^value 1 +)
  29814. (R1 ^reward R1110 +)
  29815. Firing propose*predict-yes
  29816. -->
  29817. (O2213 ^name predict-yes +)
  29818. (S1 ^operator O2213 +)
  29819. Firing propose*predict-no
  29820. -->
  29821. (O2214 ^name predict-no +)
  29822. (S1 ^operator O2214 +)
  29823. Firing rl*prefer*rvt*predict-no*H0*6
  29824. -->
  29825. (S1 ^operator O2212 = 0.9888361273465509)
  29826. Firing rl*prefer*rvt*predict-yes*H0*5
  29827. -->
  29828. (S1 ^operator O2211 = 0.2639970902976322)
  29829. Firing prefer*rvt*predict-yes*H0
  29830. -->
  29831. Firing prefer*rvt*predict-no*H0
  29832. -->
  29833. Firing elaborate*copy-dir-to-output-link
  29834. -->
  29835. (I3 ^dir L +)
  29836. inner elaboration loop at bottom goal.
  29837. Retracting elaborate*copy-see-to-output-link
  29838. -->
  29839. (I3 ^see 0 +)
  29840. Retracting propose*predict-no
  29841. -->
  29842. (O2212 ^name predict-no +)
  29843. (S1 ^operator O2212 +)
  29844. Retracting propose*predict-yes
  29845. -->
  29846. (O2211 ^name predict-yes +)
  29847. (S1 ^operator O2211 +)
  29848. Retracting elaborate*reward*based*on*reward
  29849. -->
  29850. (R1109 ^value 1 +)
  29851. (R1 ^reward R1109 +)
  29852. Retracting elaborate*copy-dir-to-output-link
  29853. -->
  29854. (I3 ^dir U +)
  29855. Retracting rl*prefer*rvt*predict-no*H0*2
  29856. -->
  29857. (S1 ^operator O2212 = 1.)
  29858. Retracting rl*prefer*rvt*predict-yes*H0*1
  29859. -->
  29860. (S1 ^operator O2211 = 0.)
  29861. =>WM: (15598: S1 ^operator O2214 +)
  29862. =>WM: (15597: S1 ^operator O2213 +)
  29863. =>WM: (15596: I3 ^dir L)
  29864. =>WM: (15595: O2214 ^name predict-no)
  29865. =>WM: (15594: O2213 ^name predict-yes)
  29866. =>WM: (15593: R1110 ^value 1)
  29867. =>WM: (15592: R1 ^reward R1110)
  29868. <=WM: (15583: S1 ^operator O2211 +)
  29869. <=WM: (15584: S1 ^operator O2212 +)
  29870. <=WM: (15585: S1 ^operator O2212)
  29871. <=WM: (15556: I3 ^dir U)
  29872. <=WM: (15579: R1 ^reward R1109)
  29873. <=WM: (15582: O2212 ^name predict-no)
  29874. <=WM: (15581: O2211 ^name predict-yes)
  29875. <=WM: (15580: R1109 ^value 1)
  29876. --- Inner Elaboration Phase, active level 1 (S1) ---
  29877. Firing prefer*rvt*predict-yes*H0
  29878. -->
  29879. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  29880. -->
  29881. (S1 ^operator O2213 = -0.1386470047172653)
  29882. Firing rl*prefer*rvt*predict-yes*H0*5
  29883. -->
  29884. (S1 ^operator O2213 = 0.2639970902976322)
  29885. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  29886. -->
  29887. Firing prefer*rvt*predict-no*H0
  29888. -->
  29889. Firing rl*prefer*rvt*predict-no*H0*6
  29890. -->
  29891. (S1 ^operator O2214 = 0.9888361273465509)
  29892. inner elaboration loop at bottom goal.
  29893. Retracting rl*prefer*rvt*predict-no*H0*6
  29894. -->
  29895. (S1 ^operator O2212 = 0.9888361273465509)
  29896. Retracting rl*prefer*rvt*predict-yes*H0*5
  29897. -->
  29898. (S1 ^operator O2211 = 0.2639970902976322)
  29899. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  29900. -->
  29901. (S1 ^operator O2211 = -0.1386470047172653)
  29902. --- END Proposal Phase ---
  29903. --- Decision Phase ---
  29904. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29905. =>WM: (15599: S1 ^operator O2214)
  29906. 1107: O: O2214 (predict-no)
  29907. --- END Decision Phase ---
  29908. --- Application Phase ---
  29909. --- Firing Productions (PE) For State At Depth 1 ---
  29910. --- Inner Elaboration Phase, active level 1 (S1) ---
  29911. Firing apply*operator
  29912. -->
  29913. (I3 ^predict-no N1107 + :O )
  29914. Firing apply*operator*complete
  29915. -->
  29916. (I3 ^predict-no N1106 - :O )
  29917. inner elaboration loop at bottom goal.
  29918. --- Change Working Memory (PE) ---
  29919. =>WM: (15600: I3 ^predict-no N1107)
  29920. <=WM: (15587: N1106 ^status complete)
  29921. <=WM: (15586: I3 ^predict-no N1106)
  29922. --- Firing Productions (IE) For State At Depth 1 ---
  29923. --- Inner Elaboration Phase, active level 1 (S1) ---
  29924. Firing monitor*world
  29925. -->
  29926. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29927. --- Change Working Memory (IE) ---
  29928. --- END Application Phase ---
  29929. --- Output Phase ---
  29930. ENV: Agent did: predict-no for direction L in state State-A
  29931. In State-A moving L
  29932. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29933. predict error 0
  29934. dir: dir isU
  29935. --- END Output Phase ---
  29936. \-/--- Input Phase ---
  29937. =>WM: (15604: I2 ^dir U)
  29938. =>WM: (15603: I2 ^reward 1)
  29939. =>WM: (15602: I2 ^see 0)
  29940. =>WM: (15601: N1107 ^status complete)
  29941. <=WM: (15590: I2 ^dir L)
  29942. <=WM: (15589: I2 ^reward 1)
  29943. <=WM: (15588: I2 ^see 0)
  29944. =>WM: (15605: I2 ^level-1 L0-root)
  29945. <=WM: (15591: I2 ^level-1 L0-root)
  29946. --- END Input Phase ---
  29947. --- Proposal Phase ---
  29948. --- Inner Elaboration Phase, active level 1 (S1) ---
  29949. Firing elaborate*copy-see-to-output-link
  29950. -->
  29951. (I3 ^see 0 +)
  29952. Firing elaborate*reward*based*on*reward
  29953. -->
  29954. (R1111 ^value 1 +)
  29955. (R1 ^reward R1111 +)
  29956. Firing propose*predict-yes
  29957. -->
  29958. (O2215 ^name predict-yes +)
  29959. (S1 ^operator O2215 +)
  29960. Firing propose*predict-no
  29961. -->
  29962. (O2216 ^name predict-no +)
  29963. (S1 ^operator O2216 +)
  29964. Firing rl*prefer*rvt*predict-no*H0*2
  29965. -->
  29966. (S1 ^operator O2214 = 1.)
  29967. Firing rl*prefer*rvt*predict-yes*H0*1
  29968. -->
  29969. (S1 ^operator O2213 = 0.)
  29970. Firing prefer*rvt*predict-yes*H0
  29971. -->
  29972. Firing prefer*rvt*predict-no*H0
  29973. -->
  29974. Firing elaborate*copy-dir-to-output-link
  29975. -->
  29976. (I3 ^dir U +)
  29977. inner elaboration loop at bottom goal.
  29978. Retracting elaborate*copy-see-to-output-link
  29979. -->
  29980. (I3 ^see 0 +)
  29981. Retracting propose*predict-no
  29982. -->
  29983. (O2214 ^name predict-no +)
  29984. (S1 ^operator O2214 +)
  29985. Retracting propose*predict-yes
  29986. -->
  29987. (O2213 ^name predict-yes +)
  29988. (S1 ^operator O2213 +)
  29989. Retracting elaborate*reward*based*on*reward
  29990. -->
  29991. (R1110 ^value 1 +)
  29992. (R1 ^reward R1110 +)
  29993. Retracting elaborate*copy-dir-to-output-link
  29994. -->
  29995. (I3 ^dir L +)
  29996. Retracting rl*prefer*rvt*predict-no*H0*6
  29997. -->
  29998. (S1 ^operator O2214 = 0.9888361273465509)
  29999. Retracting rl*prefer*rvt*predict-yes*H0*5
  30000. -->
  30001. (S1 ^operator O2213 = 0.2639970902976322)
  30002. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30003. -->
  30004. (S1 ^operator O2213 = -0.1386470047172653)
  30005. =>WM: (15612: S1 ^operator O2216 +)
  30006. =>WM: (15611: S1 ^operator O2215 +)
  30007. =>WM: (15610: I3 ^dir U)
  30008. =>WM: (15609: O2216 ^name predict-no)
  30009. =>WM: (15608: O2215 ^name predict-yes)
  30010. =>WM: (15607: R1111 ^value 1)
  30011. =>WM: (15606: R1 ^reward R1111)
  30012. <=WM: (15597: S1 ^operator O2213 +)
  30013. <=WM: (15598: S1 ^operator O2214 +)
  30014. <=WM: (15599: S1 ^operator O2214)
  30015. <=WM: (15596: I3 ^dir L)
  30016. <=WM: (15592: R1 ^reward R1110)
  30017. <=WM: (15595: O2214 ^name predict-no)
  30018. <=WM: (15594: O2213 ^name predict-yes)
  30019. <=WM: (15593: R1110 ^value 1)
  30020. --- Inner Elaboration Phase, active level 1 (S1) ---
  30021. Firing prefer*rvt*predict-yes*H0
  30022. -->
  30023. Firing rl*prefer*rvt*predict-yes*H0*1
  30024. -->
  30025. (S1 ^operator O2215 = 0.)
  30026. Firing prefer*rvt*predict-no*H0
  30027. -->
  30028. Firing rl*prefer*rvt*predict-no*H0*2
  30029. -->
  30030. (S1 ^operator O2216 = 1.)
  30031. inner elaboration loop at bottom goal.
  30032. Retracting rl*prefer*rvt*predict-no*H0*2
  30033. -->
  30034. (S1 ^operator O2214 = 1.)
  30035. Retracting rl*prefer*rvt*predict-yes*H0*1
  30036. -->
  30037. (S1 ^operator O2213 = 0.)
  30038. --- END Proposal Phase ---
  30039. --- Decision Phase ---
  30040. RL update rl*prefer*rvt*predict-no*H0*6 0.988836 0 0.988836 -> 0.990661 0 0.990661(R,m,v=1,0.91018,0.0822451)
  30041. =>WM: (15613: S1 ^operator O2216)
  30042. 1108: O: O2216 (predict-no)
  30043. --- END Decision Phase ---
  30044. --- Application Phase ---
  30045. --- Firing Productions (PE) For State At Depth 1 ---
  30046. --- Inner Elaboration Phase, active level 1 (S1) ---
  30047. Firing apply*operator
  30048. -->
  30049. (I3 ^predict-no N1108 + :O )
  30050. Firing apply*operator*complete
  30051. -->
  30052. (I3 ^predict-no N1107 - :O )
  30053. inner elaboration loop at bottom goal.
  30054. --- Change Working Memory (PE) ---
  30055. =>WM: (15614: I3 ^predict-no N1108)
  30056. <=WM: (15601: N1107 ^status complete)
  30057. <=WM: (15600: I3 ^predict-no N1107)
  30058. --- Firing Productions (IE) For State At Depth 1 ---
  30059. --- Inner Elaboration Phase, active level 1 (S1) ---
  30060. Firing monitor*world
  30061. -->
  30062. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30063. --- Change Working Memory (IE) ---
  30064. --- END Application Phase ---
  30065. --- Output Phase ---
  30066. ENV: Agent did: predict-no for direction U in state State-A
  30067. In State-A moving U
  30068. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30069. predict error 0
  30070. dir: dir isR
  30071. --- END Output Phase ---
  30072. |\---- Input Phase ---
  30073. =>WM: (15618: I2 ^dir R)
  30074. =>WM: (15617: I2 ^reward 1)
  30075. =>WM: (15616: I2 ^see 0)
  30076. =>WM: (15615: N1108 ^status complete)
  30077. <=WM: (15604: I2 ^dir U)
  30078. <=WM: (15603: I2 ^reward 1)
  30079. <=WM: (15602: I2 ^see 0)
  30080. =>WM: (15619: I2 ^level-1 L0-root)
  30081. <=WM: (15605: I2 ^level-1 L0-root)
  30082. --- END Input Phase ---
  30083. --- Proposal Phase ---
  30084. --- Inner Elaboration Phase, active level 1 (S1) ---
  30085. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  30086. -->
  30087. (S1 ^operator O2216 = -0.2817060109291377)
  30088. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  30089. -->
  30090. (S1 ^operator O2215 = 0.6623145353178812)
  30091. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30092. -->
  30093. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30094. -->
  30095. Firing elaborate*copy-see-to-output-link
  30096. -->
  30097. (I3 ^see 0 +)
  30098. Firing elaborate*reward*based*on*reward
  30099. -->
  30100. (R1112 ^value 1 +)
  30101. (R1 ^reward R1112 +)
  30102. Firing propose*predict-yes
  30103. -->
  30104. (O2217 ^name predict-yes +)
  30105. (S1 ^operator O2217 +)
  30106. Firing propose*predict-no
  30107. -->
  30108. (O2218 ^name predict-no +)
  30109. (S1 ^operator O2218 +)
  30110. Firing rl*prefer*rvt*predict-no*H0*4
  30111. -->
  30112. (S1 ^operator O2216 = 0.3397874256976259)
  30113. Firing rl*prefer*rvt*predict-yes*H0*3
  30114. -->
  30115. (S1 ^operator O2215 = 0.3377081968199763)
  30116. Firing prefer*rvt*predict-yes*H0
  30117. -->
  30118. Firing prefer*rvt*predict-no*H0
  30119. -->
  30120. Firing elaborate*copy-dir-to-output-link
  30121. -->
  30122. (I3 ^dir R +)
  30123. inner elaboration loop at bottom goal.
  30124. Retracting elaborate*copy-see-to-output-link
  30125. -->
  30126. (I3 ^see 0 +)
  30127. Retracting propose*predict-no
  30128. -->
  30129. (O2216 ^name predict-no +)
  30130. (S1 ^operator O2216 +)
  30131. Retracting propose*predict-yes
  30132. -->
  30133. (O2215 ^name predict-yes +)
  30134. (S1 ^operator O2215 +)
  30135. Retracting elaborate*reward*based*on*reward
  30136. -->
  30137. (R1111 ^value 1 +)
  30138. (R1 ^reward R1111 +)
  30139. Retracting elaborate*copy-dir-to-output-link
  30140. -->
  30141. (I3 ^dir U +)
  30142. Retracting rl*prefer*rvt*predict-no*H0*2
  30143. -->
  30144. (S1 ^operator O2216 = 1.)
  30145. Retracting rl*prefer*rvt*predict-yes*H0*1
  30146. -->
  30147. (S1 ^operator O2215 = 0.)
  30148. =>WM: (15626: S1 ^operator O2218 +)
  30149. =>WM: (15625: S1 ^operator O2217 +)
  30150. =>WM: (15624: I3 ^dir R)
  30151. =>WM: (15623: O2218 ^name predict-no)
  30152. =>WM: (15622: O2217 ^name predict-yes)
  30153. =>WM: (15621: R1112 ^value 1)
  30154. =>WM: (15620: R1 ^reward R1112)
  30155. <=WM: (15611: S1 ^operator O2215 +)
  30156. <=WM: (15612: S1 ^operator O2216 +)
  30157. <=WM: (15613: S1 ^operator O2216)
  30158. <=WM: (15610: I3 ^dir U)
  30159. <=WM: (15606: R1 ^reward R1111)
  30160. <=WM: (15609: O2216 ^name predict-no)
  30161. <=WM: (15608: O2215 ^name predict-yes)
  30162. <=WM: (15607: R1111 ^value 1)
  30163. --- Inner Elaboration Phase, active level 1 (S1) ---
  30164. Firing prefer*rvt*predict-yes*H0
  30165. -->
  30166. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  30167. -->
  30168. (S1 ^operator O2217 = 0.6623145353178812)
  30169. Firing rl*prefer*rvt*predict-yes*H0*3
  30170. -->
  30171. (S1 ^operator O2217 = 0.3377081968199763)
  30172. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30173. -->
  30174. Firing prefer*rvt*predict-no*H0
  30175. -->
  30176. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  30177. -->
  30178. (S1 ^operator O2218 = -0.2817060109291377)
  30179. Firing rl*prefer*rvt*predict-no*H0*4
  30180. -->
  30181. (S1 ^operator O2218 = 0.3397874256976259)
  30182. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30183. -->
  30184. inner elaboration loop at bottom goal.
  30185. Retracting rl*prefer*rvt*predict-no*H0*4
  30186. -->
  30187. (S1 ^operator O2216 = 0.3397874256976259)
  30188. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  30189. -->
  30190. (S1 ^operator O2216 = -0.2817060109291377)
  30191. Retracting rl*prefer*rvt*predict-yes*H0*3
  30192. -->
  30193. (S1 ^operator O2215 = 0.3377081968199763)
  30194. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  30195. -->
  30196. (S1 ^operator O2215 = 0.6623145353178812)
  30197. --- END Proposal Phase ---
  30198. --- Decision Phase ---
  30199. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  30200. =>WM: (15627: S1 ^operator O2217)
  30201. 1109: O: O2217 (predict-yes)
  30202. --- END Decision Phase ---
  30203. --- Application Phase ---
  30204. --- Firing Productions (PE) For State At Depth 1 ---
  30205. --- Inner Elaboration Phase, active level 1 (S1) ---
  30206. Firing apply*operator
  30207. -->
  30208. (I3 ^predict-yes N1109 + :O )
  30209. Firing apply*operator*complete
  30210. -->
  30211. (I3 ^predict-no N1108 - :O )
  30212. inner elaboration loop at bottom goal.
  30213. --- Change Working Memory (PE) ---
  30214. =>WM: (15628: I3 ^predict-yes N1109)
  30215. <=WM: (15615: N1108 ^status complete)
  30216. <=WM: (15614: I3 ^predict-no N1108)
  30217. --- Firing Productions (IE) For State At Depth 1 ---
  30218. --- Inner Elaboration Phase, active level 1 (S1) ---
  30219. Firing monitor*world
  30220. -->
  30221. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30222. --- Change Working Memory (IE) ---
  30223. --- END Application Phase ---
  30224. --- Output Phase ---
  30225. ENV: Agent did: predict-yes for direction R in state State-A
  30226. In State-A moving R
  30227. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  30228. predict error 0
  30229. dir: dir isR
  30230. --- END Output Phase ---
  30231. /|\--- Input Phase ---
  30232. =>WM: (15632: I2 ^dir R)
  30233. =>WM: (15631: I2 ^reward 1)
  30234. =>WM: (15630: I2 ^see 1)
  30235. =>WM: (15629: N1109 ^status complete)
  30236. <=WM: (15618: I2 ^dir R)
  30237. <=WM: (15617: I2 ^reward 1)
  30238. <=WM: (15616: I2 ^see 0)
  30239. =>WM: (15633: I2 ^level-1 R1-root)
  30240. <=WM: (15619: I2 ^level-1 L0-root)
  30241. --- END Input Phase ---
  30242. --- Proposal Phase ---
  30243. --- Inner Elaboration Phase, active level 1 (S1) ---
  30244. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  30245. -->
  30246. (S1 ^operator O2217 = -0.1070236389116304)
  30247. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  30248. -->
  30249. (S1 ^operator O2218 = 0.6602272409272278)
  30250. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30251. -->
  30252. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30253. -->
  30254. Firing elaborate*copy-see-to-output-link
  30255. -->
  30256. (I3 ^see 1 +)
  30257. Firing elaborate*reward*based*on*reward
  30258. -->
  30259. (R1113 ^value 1 +)
  30260. (R1 ^reward R1113 +)
  30261. Firing propose*predict-yes
  30262. -->
  30263. (O2219 ^name predict-yes +)
  30264. (S1 ^operator O2219 +)
  30265. Firing propose*predict-no
  30266. -->
  30267. (O2220 ^name predict-no +)
  30268. (S1 ^operator O2220 +)
  30269. Firing rl*prefer*rvt*predict-no*H0*4
  30270. -->
  30271. (S1 ^operator O2218 = 0.3397874256976259)
  30272. Firing rl*prefer*rvt*predict-yes*H0*3
  30273. -->
  30274. (S1 ^operator O2217 = 0.3377081968199763)
  30275. Firing prefer*rvt*predict-yes*H0
  30276. -->
  30277. Firing prefer*rvt*predict-no*H0
  30278. -->
  30279. Firing elaborate*copy-dir-to-output-link
  30280. -->
  30281. (I3 ^dir R +)
  30282. inner elaboration loop at bottom goal.
  30283. Retracting elaborate*copy-see-to-output-link
  30284. -->
  30285. (I3 ^see 0 +)
  30286. Retracting propose*predict-no
  30287. -->
  30288. (O2218 ^name predict-no +)
  30289. (S1 ^operator O2218 +)
  30290. Retracting propose*predict-yes
  30291. -->
  30292. (O2217 ^name predict-yes +)
  30293. (S1 ^operator O2217 +)
  30294. Retracting elaborate*reward*based*on*reward
  30295. -->
  30296. (R1112 ^value 1 +)
  30297. (R1 ^reward R1112 +)
  30298. Retracting elaborate*copy-dir-to-output-link
  30299. -->
  30300. (I3 ^dir R +)
  30301. Retracting rl*prefer*rvt*predict-no*H0*4
  30302. -->
  30303. (S1 ^operator O2218 = 0.3397874256976259)
  30304. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  30305. -->
  30306. (S1 ^operator O2218 = -0.2817060109291377)
  30307. Retracting rl*prefer*rvt*predict-yes*H0*3
  30308. -->
  30309. (S1 ^operator O2217 = 0.3377081968199763)
  30310. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  30311. -->
  30312. (S1 ^operator O2217 = 0.6623145353178812)
  30313. =>WM: (15640: S1 ^operator O2220 +)
  30314. =>WM: (15639: S1 ^operator O2219 +)
  30315. =>WM: (15638: O2220 ^name predict-no)
  30316. =>WM: (15637: O2219 ^name predict-yes)
  30317. =>WM: (15636: R1113 ^value 1)
  30318. =>WM: (15635: R1 ^reward R1113)
  30319. =>WM: (15634: I3 ^see 1)
  30320. <=WM: (15625: S1 ^operator O2217 +)
  30321. <=WM: (15627: S1 ^operator O2217)
  30322. <=WM: (15626: S1 ^operator O2218 +)
  30323. <=WM: (15620: R1 ^reward R1112)
  30324. <=WM: (15511: I3 ^see 0)
  30325. <=WM: (15623: O2218 ^name predict-no)
  30326. <=WM: (15622: O2217 ^name predict-yes)
  30327. <=WM: (15621: R1112 ^value 1)
  30328. --- Inner Elaboration Phase, active level 1 (S1) ---
  30329. Firing prefer*rvt*predict-yes*H0
  30330. -->
  30331. Firing rl*prefer*rvt*predict-yes*H0*3
  30332. -->
  30333. (S1 ^operator O2219 = 0.3377081968199763)
  30334. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  30335. -->
  30336. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  30337. -->
  30338. (S1 ^operator O2219 = -0.1070236389116304)
  30339. Firing prefer*rvt*predict-no*H0
  30340. -->
  30341. Firing rl*prefer*rvt*predict-no*H0*4
  30342. -->
  30343. (S1 ^operator O2220 = 0.3397874256976259)
  30344. Firing prefer*rvt*predict-no*H0*4*v1*H1
  30345. -->
  30346. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  30347. -->
  30348. (S1 ^operator O2220 = 0.6602272409272278)
  30349. inner elaboration loop at bottom goal.
  30350. Retracting rl*prefer*rvt*predict-no*H0*4
  30351. -->
  30352. (S1 ^operator O2218 = 0.3397874256976259)
  30353. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  30354. -->
  30355. (S1 ^operator O2218 = 0.6602272409272278)
  30356. Retracting rl*prefer*rvt*predict-yes*H0*3
  30357. -->
  30358. (S1 ^operator O2217 = 0.3377081968199763)
  30359. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  30360. -->
  30361. (S1 ^operator O2217 = -0.1070236389116304)
  30362. --- END Proposal Phase ---
  30363. --- Decision Phase ---
  30364. RL update rl*prefer*rvt*predict-yes*H0*3 0.590108 -0.2524 0.337708 -> 0.590106 -0.252399 0.337706(R,m,v=1,0.907104,0.0847295)
  30365. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409918 0.252396 0.662315 -> 0.409916 0.252397 0.662312(R,m,v=1,1,0)
  30366. =>WM: (15641: S1 ^operator O2220)
  30367. 1110: O: O2220 (predict-no)
  30368. --- END Decision Phase ---
  30369. --- Application Phase ---
  30370. --- Firing Productions (PE) For State At Depth 1 ---
  30371. --- Inner Elaboration Phase, active level 1 (S1) ---
  30372. Firing apply*operator
  30373. -->
  30374. (I3 ^predict-no N1110 + :O )
  30375. Firing apply*operator*complete
  30376. -->
  30377. (I3 ^predict-yes N1109 - :O )
  30378. inner elaboration loop at bottom goal.
  30379. --- Change Working Memory (PE) ---
  30380. =>WM: (15642: I3 ^predict-no N1110)
  30381. <=WM: (15629: N1109 ^status complete)
  30382. <=WM: (15628: I3 ^predict-yes N1109)
  30383. --- Firing Productions (IE) For State At Depth 1 ---
  30384. --- Inner Elaboration Phase, active level 1 (S1) ---
  30385. Firing monitor*world
  30386. -->
  30387. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30388. --- Change Working Memory (IE) ---
  30389. --- END Application Phase ---
  30390. --- Output Phase ---
  30391. ENV: Agent did: predict-no for direction R in state State-B
  30392. In State-B moving R
  30393. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30394. predict error 0
  30395. dir: dir isL
  30396. --- END Output Phase ---
  30397. -/|--- Input Phase ---
  30398. =>WM: (15646: I2 ^dir L)
  30399. =>WM: (15645: I2 ^reward 1)
  30400. =>WM: (15644: I2 ^see 0)
  30401. =>WM: (15643: N1110 ^status complete)
  30402. <=WM: (15632: I2 ^dir R)
  30403. <=WM: (15631: I2 ^reward 1)
  30404. <=WM: (15630: I2 ^see 1)
  30405. =>WM: (15647: I2 ^level-1 R0-root)
  30406. <=WM: (15633: I2 ^level-1 R1-root)
  30407. --- END Input Phase ---
  30408. --- Proposal Phase ---
  30409. --- Inner Elaboration Phase, active level 1 (S1) ---
  30410. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  30411. -->
  30412. (S1 ^operator O2219 = 0.7359316881244164)
  30413. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30414. -->
  30415. Firing elaborate*copy-see-to-output-link
  30416. -->
  30417. (I3 ^see 0 +)
  30418. Firing elaborate*reward*based*on*reward
  30419. -->
  30420. (R1114 ^value 1 +)
  30421. (R1 ^reward R1114 +)
  30422. Firing propose*predict-yes
  30423. -->
  30424. (O2221 ^name predict-yes +)
  30425. (S1 ^operator O2221 +)
  30426. Firing propose*predict-no
  30427. -->
  30428. (O2222 ^name predict-no +)
  30429. (S1 ^operator O2222 +)
  30430. Firing rl*prefer*rvt*predict-no*H0*6
  30431. -->
  30432. (S1 ^operator O2220 = 0.9906608877166565)
  30433. Firing rl*prefer*rvt*predict-yes*H0*5
  30434. -->
  30435. (S1 ^operator O2219 = 0.2639970902976322)
  30436. Firing prefer*rvt*predict-yes*H0
  30437. -->
  30438. Firing prefer*rvt*predict-no*H0
  30439. -->
  30440. Firing elaborate*copy-dir-to-output-link
  30441. -->
  30442. (I3 ^dir L +)
  30443. inner elaboration loop at bottom goal.
  30444. Retracting elaborate*copy-see-to-output-link
  30445. -->
  30446. (I3 ^see 1 +)
  30447. Retracting propose*predict-no
  30448. -->
  30449. (O2220 ^name predict-no +)
  30450. (S1 ^operator O2220 +)
  30451. Retracting propose*predict-yes
  30452. -->
  30453. (O2219 ^name predict-yes +)
  30454. (S1 ^operator O2219 +)
  30455. Retracting elaborate*reward*based*on*reward
  30456. -->
  30457. (R1113 ^value 1 +)
  30458. (R1 ^reward R1113 +)
  30459. Retracting elaborate*copy-dir-to-output-link
  30460. -->
  30461. (I3 ^dir R +)
  30462. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  30463. -->
  30464. (S1 ^operator O2220 = 0.6602272409272278)
  30465. Retracting rl*prefer*rvt*predict-no*H0*4
  30466. -->
  30467. (S1 ^operator O2220 = 0.3397874256976259)
  30468. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  30469. -->
  30470. (S1 ^operator O2219 = -0.1070236389116304)
  30471. Retracting rl*prefer*rvt*predict-yes*H0*3
  30472. -->
  30473. (S1 ^operator O2219 = 0.337706366383665)
  30474. =>WM: (15655: S1 ^operator O2222 +)
  30475. =>WM: (15654: S1 ^operator O2221 +)
  30476. =>WM: (15653: I3 ^dir L)
  30477. =>WM: (15652: O2222 ^name predict-no)
  30478. =>WM: (15651: O2221 ^name predict-yes)
  30479. =>WM: (15650: R1114 ^value 1)
  30480. =>WM: (15649: R1 ^reward R1114)
  30481. =>WM: (15648: I3 ^see 0)
  30482. <=WM: (15639: S1 ^operator O2219 +)
  30483. <=WM: (15640: S1 ^operator O2220 +)
  30484. <=WM: (15641: S1 ^operator O2220)
  30485. <=WM: (15624: I3 ^dir R)
  30486. <=WM: (15635: R1 ^reward R1113)
  30487. <=WM: (15634: I3 ^see 1)
  30488. <=WM: (15638: O2220 ^name predict-no)
  30489. <=WM: (15637: O2219 ^name predict-yes)
  30490. <=WM: (15636: R1113 ^value 1)
  30491. --- Inner Elaboration Phase, active level 1 (S1) ---
  30492. Firing prefer*rvt*predict-yes*H0
  30493. -->
  30494. Firing rl*prefer*rvt*predict-yes*H0*5
  30495. -->
  30496. (S1 ^operator O2221 = 0.2639970902976322)
  30497. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30498. -->
  30499. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  30500. -->
  30501. (S1 ^operator O2221 = 0.7359316881244164)
  30502. Firing prefer*rvt*predict-no*H0
  30503. -->
  30504. Firing rl*prefer*rvt*predict-no*H0*6
  30505. -->
  30506. (S1 ^operator O2222 = 0.9906608877166565)
  30507. inner elaboration loop at bottom goal.
  30508. Retracting rl*prefer*rvt*predict-no*H0*6
  30509. -->
  30510. (S1 ^operator O2220 = 0.9906608877166565)
  30511. Retracting rl*prefer*rvt*predict-yes*H0*5
  30512. -->
  30513. (S1 ^operator O2219 = 0.2639970902976322)
  30514. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  30515. -->
  30516. (S1 ^operator O2219 = 0.7359316881244164)
  30517. --- END Proposal Phase ---
  30518. --- Decision Phase ---
  30519. RL update rl*prefer*rvt*predict-no*H0*4 0.570272 -0.230484 0.339787 -> 0.570271 -0.230484 0.339786(R,m,v=1,0.887701,0.100224)
  30520. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429744 0.230484 0.660227 -> 0.429742 0.230484 0.660226(R,m,v=1,1,0)
  30521. =>WM: (15656: S1 ^operator O2221)
  30522. 1111: O: O2221 (predict-yes)
  30523. --- END Decision Phase ---
  30524. --- Application Phase ---
  30525. --- Firing Productions (PE) For State At Depth 1 ---
  30526. --- Inner Elaboration Phase, active level 1 (S1) ---
  30527. Firing apply*operator
  30528. -->
  30529. (I3 ^predict-yes N1111 + :O )
  30530. Firing apply*operator*complete
  30531. -->
  30532. (I3 ^predict-no N1110 - :O )
  30533. inner elaboration loop at bottom goal.
  30534. --- Change Working Memory (PE) ---
  30535. =>WM: (15657: I3 ^predict-yes N1111)
  30536. <=WM: (15643: N1110 ^status complete)
  30537. <=WM: (15642: I3 ^predict-no N1110)
  30538. --- Firing Productions (IE) For State At Depth 1 ---
  30539. --- Inner Elaboration Phase, active level 1 (S1) ---
  30540. Firing monitor*world
  30541. -->
  30542. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30543. --- Change Working Memory (IE) ---
  30544. --- END Application Phase ---
  30545. --- Output Phase ---
  30546. ENV: Agent did: predict-yes for direction L in state State-B
  30547. In State-B moving L
  30548. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  30549. predict error 0
  30550. dir: dir isL
  30551. --- END Output Phase ---
  30552. \--- Input Phase ---
  30553. =>WM: (15661: I2 ^dir L)
  30554. =>WM: (15660: I2 ^reward 1)
  30555. =>WM: (15659: I2 ^see 1)
  30556. =>WM: (15658: N1111 ^status complete)
  30557. <=WM: (15646: I2 ^dir L)
  30558. <=WM: (15645: I2 ^reward 1)
  30559. <=WM: (15644: I2 ^see 0)
  30560. =>WM: (15662: I2 ^level-1 L1-root)
  30561. <=WM: (15647: I2 ^level-1 R0-root)
  30562. --- END Input Phase ---
  30563. --- Proposal Phase ---
  30564. --- Inner Elaboration Phase, active level 1 (S1) ---
  30565. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30566. -->
  30567. (S1 ^operator O2221 = -0.181727099742844)
  30568. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30569. -->
  30570. Firing elaborate*copy-see-to-output-link
  30571. -->
  30572. (I3 ^see 1 +)
  30573. Firing elaborate*reward*based*on*reward
  30574. -->
  30575. (R1115 ^value 1 +)
  30576. (R1 ^reward R1115 +)
  30577. Firing propose*predict-yes
  30578. -->
  30579. (O2223 ^name predict-yes +)
  30580. (S1 ^operator O2223 +)
  30581. Firing propose*predict-no
  30582. -->
  30583. (O2224 ^name predict-no +)
  30584. (S1 ^operator O2224 +)
  30585. Firing rl*prefer*rvt*predict-no*H0*6
  30586. -->
  30587. (S1 ^operator O2222 = 0.9906608877166565)
  30588. Firing rl*prefer*rvt*predict-yes*H0*5
  30589. -->
  30590. (S1 ^operator O2221 = 0.2639970902976322)
  30591. Firing prefer*rvt*predict-yes*H0
  30592. -->
  30593. Firing prefer*rvt*predict-no*H0
  30594. -->
  30595. Firing elaborate*copy-dir-to-output-link
  30596. -->
  30597. (I3 ^dir L +)
  30598. inner elaboration loop at bottom goal.
  30599. Retracting elaborate*copy-see-to-output-link
  30600. -->
  30601. (I3 ^see 0 +)
  30602. Retracting propose*predict-no
  30603. -->
  30604. (O2222 ^name predict-no +)
  30605. (S1 ^operator O2222 +)
  30606. Retracting propose*predict-yes
  30607. -->
  30608. (O2221 ^name predict-yes +)
  30609. (S1 ^operator O2221 +)
  30610. Retracting elaborate*reward*based*on*reward
  30611. -->
  30612. (R1114 ^value 1 +)
  30613. (R1 ^reward R1114 +)
  30614. Retracting elaborate*copy-dir-to-output-link
  30615. -->
  30616. (I3 ^dir L +)
  30617. Retracting rl*prefer*rvt*predict-no*H0*6
  30618. -->
  30619. (S1 ^operator O2222 = 0.9906608877166565)
  30620. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  30621. -->
  30622. (S1 ^operator O2221 = 0.7359316881244164)
  30623. Retracting rl*prefer*rvt*predict-yes*H0*5
  30624. -->
  30625. (S1 ^operator O2221 = 0.2639970902976322)
  30626. =>WM: (15669: S1 ^operator O2224 +)
  30627. =>WM: (15668: S1 ^operator O2223 +)
  30628. =>WM: (15667: O2224 ^name predict-no)
  30629. =>WM: (15666: O2223 ^name predict-yes)
  30630. =>WM: (15665: R1115 ^value 1)
  30631. =>WM: (15664: R1 ^reward R1115)
  30632. =>WM: (15663: I3 ^see 1)
  30633. <=WM: (15654: S1 ^operator O2221 +)
  30634. <=WM: (15656: S1 ^operator O2221)
  30635. <=WM: (15655: S1 ^operator O2222 +)
  30636. <=WM: (15649: R1 ^reward R1114)
  30637. <=WM: (15648: I3 ^see 0)
  30638. <=WM: (15652: O2222 ^name predict-no)
  30639. <=WM: (15651: O2221 ^name predict-yes)
  30640. <=WM: (15650: R1114 ^value 1)
  30641. --- Inner Elaboration Phase, active level 1 (S1) ---
  30642. Firing prefer*rvt*predict-yes*H0
  30643. -->
  30644. Firing rl*prefer*rvt*predict-yes*H0*5
  30645. -->
  30646. (S1 ^operator O2223 = 0.2639970902976322)
  30647. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30648. -->
  30649. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30650. -->
  30651. (S1 ^operator O2223 = -0.181727099742844)
  30652. Firing prefer*rvt*predict-no*H0
  30653. -->
  30654. Firing rl*prefer*rvt*predict-no*H0*6
  30655. -->
  30656. (S1 ^operator O2224 = 0.9906608877166565)
  30657. inner elaboration loop at bottom goal.
  30658. Retracting rl*prefer*rvt*predict-no*H0*6
  30659. -->
  30660. (S1 ^operator O2222 = 0.9906608877166565)
  30661. Retracting rl*prefer*rvt*predict-yes*H0*5
  30662. -->
  30663. (S1 ^operator O2221 = 0.2639970902976322)
  30664. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30665. -->
  30666. (S1 ^operator O2221 = -0.181727099742844)
  30667. --- END Proposal Phase ---
  30668. --- Decision Phase ---
  30669. RL update rl*prefer*rvt*predict-yes*H0*5 0.554383 -0.290386 0.263997 -> 0.554389 -0.290386 0.264003(R,m,v=1,0.886598,0.101063)
  30670. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445547 0.290385 0.735932 -> 0.445553 0.290385 0.735938(R,m,v=1,1,0)
  30671. =>WM: (15670: S1 ^operator O2224)
  30672. 1112: O: O2224 (predict-no)
  30673. --- END Decision Phase ---
  30674. --- Application Phase ---
  30675. --- Firing Productions (PE) For State At Depth 1 ---
  30676. --- Inner Elaboration Phase, active level 1 (S1) ---
  30677. Firing apply*operator
  30678. -->
  30679. (I3 ^predict-no N1112 + :O )
  30680. Firing apply*operator*complete
  30681. -->
  30682. (I3 ^predict-yes N1111 - :O )
  30683. inner elaboration loop at bottom goal.
  30684. --- Change Working Memory (PE) ---
  30685. =>WM: (15671: I3 ^predict-no N1112)
  30686. <=WM: (15658: N1111 ^status complete)
  30687. <=WM: (15657: I3 ^predict-yes N1111)
  30688. --- Firing Productions (IE) For State At Depth 1 ---
  30689. --- Inner Elaboration Phase, active level 1 (S1) ---
  30690. Firing monitor*world
  30691. -->
  30692. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30693. --- Change Working Memory (IE) ---
  30694. --- END Application Phase ---
  30695. --- Output Phase ---
  30696. ENV: Agent did: predict-no for direction L in state State-A
  30697. In State-A moving L
  30698. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30699. predict error 0
  30700. dir: dir isL
  30701. --- END Output Phase ---
  30702. -/|--- Input Phase ---
  30703. =>WM: (15675: I2 ^dir L)
  30704. =>WM: (15674: I2 ^reward 1)
  30705. =>WM: (15673: I2 ^see 0)
  30706. =>WM: (15672: N1112 ^status complete)
  30707. <=WM: (15661: I2 ^dir L)
  30708. <=WM: (15660: I2 ^reward 1)
  30709. <=WM: (15659: I2 ^see 1)
  30710. =>WM: (15676: I2 ^level-1 L0-root)
  30711. <=WM: (15662: I2 ^level-1 L1-root)
  30712. --- END Input Phase ---
  30713. --- Proposal Phase ---
  30714. --- Inner Elaboration Phase, active level 1 (S1) ---
  30715. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30716. -->
  30717. (S1 ^operator O2223 = -0.1386470047172653)
  30718. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30719. -->
  30720. Firing elaborate*copy-see-to-output-link
  30721. -->
  30722. (I3 ^see 0 +)
  30723. Firing elaborate*reward*based*on*reward
  30724. -->
  30725. (R1116 ^value 1 +)
  30726. (R1 ^reward R1116 +)
  30727. Firing propose*predict-yes
  30728. -->
  30729. (O2225 ^name predict-yes +)
  30730. (S1 ^operator O2225 +)
  30731. Firing propose*predict-no
  30732. -->
  30733. (O2226 ^name predict-no +)
  30734. (S1 ^operator O2226 +)
  30735. Firing rl*prefer*rvt*predict-no*H0*6
  30736. -->
  30737. (S1 ^operator O2224 = 0.9906608877166565)
  30738. Firing rl*prefer*rvt*predict-yes*H0*5
  30739. -->
  30740. (S1 ^operator O2223 = 0.2640027717901089)
  30741. Firing prefer*rvt*predict-yes*H0
  30742. -->
  30743. Firing prefer*rvt*predict-no*H0
  30744. -->
  30745. Firing elaborate*copy-dir-to-output-link
  30746. -->
  30747. (I3 ^dir L +)
  30748. inner elaboration loop at bottom goal.
  30749. Retracting elaborate*copy-see-to-output-link
  30750. -->
  30751. (I3 ^see 1 +)
  30752. Retracting propose*predict-no
  30753. -->
  30754. (O2224 ^name predict-no +)
  30755. (S1 ^operator O2224 +)
  30756. Retracting propose*predict-yes
  30757. -->
  30758. (O2223 ^name predict-yes +)
  30759. (S1 ^operator O2223 +)
  30760. Retracting elaborate*reward*based*on*reward
  30761. -->
  30762. (R1115 ^value 1 +)
  30763. (R1 ^reward R1115 +)
  30764. Retracting elaborate*copy-dir-to-output-link
  30765. -->
  30766. (I3 ^dir L +)
  30767. Retracting rl*prefer*rvt*predict-no*H0*6
  30768. -->
  30769. (S1 ^operator O2224 = 0.9906608877166565)
  30770. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  30771. -->
  30772. (S1 ^operator O2223 = -0.181727099742844)
  30773. Retracting rl*prefer*rvt*predict-yes*H0*5
  30774. -->
  30775. (S1 ^operator O2223 = 0.2640027717901089)
  30776. =>WM: (15683: S1 ^operator O2226 +)
  30777. =>WM: (15682: S1 ^operator O2225 +)
  30778. =>WM: (15681: O2226 ^name predict-no)
  30779. =>WM: (15680: O2225 ^name predict-yes)
  30780. =>WM: (15679: R1116 ^value 1)
  30781. =>WM: (15678: R1 ^reward R1116)
  30782. =>WM: (15677: I3 ^see 0)
  30783. <=WM: (15668: S1 ^operator O2223 +)
  30784. <=WM: (15669: S1 ^operator O2224 +)
  30785. <=WM: (15670: S1 ^operator O2224)
  30786. <=WM: (15664: R1 ^reward R1115)
  30787. <=WM: (15663: I3 ^see 1)
  30788. <=WM: (15667: O2224 ^name predict-no)
  30789. <=WM: (15666: O2223 ^name predict-yes)
  30790. <=WM: (15665: R1115 ^value 1)
  30791. --- Inner Elaboration Phase, active level 1 (S1) ---
  30792. Firing prefer*rvt*predict-yes*H0
  30793. -->
  30794. Firing rl*prefer*rvt*predict-yes*H0*5
  30795. -->
  30796. (S1 ^operator O2225 = 0.2640027717901089)
  30797. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30798. -->
  30799. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30800. -->
  30801. (S1 ^operator O2225 = -0.1386470047172653)
  30802. Firing prefer*rvt*predict-no*H0
  30803. -->
  30804. Firing rl*prefer*rvt*predict-no*H0*6
  30805. -->
  30806. (S1 ^operator O2226 = 0.9906608877166565)
  30807. inner elaboration loop at bottom goal.
  30808. Retracting rl*prefer*rvt*predict-no*H0*6
  30809. -->
  30810. (S1 ^operator O2224 = 0.9906608877166565)
  30811. Retracting rl*prefer*rvt*predict-yes*H0*5
  30812. -->
  30813. (S1 ^operator O2223 = 0.2640027717901089)
  30814. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30815. -->
  30816. (S1 ^operator O2223 = -0.1386470047172653)
  30817. --- END Proposal Phase ---
  30818. --- Decision Phase ---
  30819. RL update rl*prefer*rvt*predict-no*H0*6 0.990661 0 0.990661 -> 0.992186 0 0.992186(R,m,v=1,0.910714,0.0818007)
  30820. =>WM: (15684: S1 ^operator O2226)
  30821. 1113: O: O2226 (predict-no)
  30822. --- END Decision Phase ---
  30823. --- Application Phase ---
  30824. --- Firing Productions (PE) For State At Depth 1 ---
  30825. --- Inner Elaboration Phase, active level 1 (S1) ---
  30826. Firing apply*operator
  30827. -->
  30828. (I3 ^predict-no N1113 + :O )
  30829. Firing apply*operator*complete
  30830. -->
  30831. (I3 ^predict-no N1112 - :O )
  30832. inner elaboration loop at bottom goal.
  30833. --- Change Working Memory (PE) ---
  30834. =>WM: (15685: I3 ^predict-no N1113)
  30835. <=WM: (15672: N1112 ^status complete)
  30836. <=WM: (15671: I3 ^predict-no N1112)
  30837. --- Firing Productions (IE) For State At Depth 1 ---
  30838. --- Inner Elaboration Phase, active level 1 (S1) ---
  30839. Firing monitor*world
  30840. -->
  30841. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30842. --- Change Working Memory (IE) ---
  30843. --- END Application Phase ---
  30844. --- Output Phase ---
  30845. ENV: Agent did: predict-no for direction L in state State-A
  30846. In State-A moving L
  30847. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30848. predict error 0
  30849. dir: dir isL
  30850. --- END Output Phase ---
  30851. \-/--- Input Phase ---
  30852. =>WM: (15689: I2 ^dir L)
  30853. =>WM: (15688: I2 ^reward 1)
  30854. =>WM: (15687: I2 ^see 0)
  30855. =>WM: (15686: N1113 ^status complete)
  30856. <=WM: (15675: I2 ^dir L)
  30857. <=WM: (15674: I2 ^reward 1)
  30858. <=WM: (15673: I2 ^see 0)
  30859. =>WM: (15690: I2 ^level-1 L0-root)
  30860. <=WM: (15676: I2 ^level-1 L0-root)
  30861. --- END Input Phase ---
  30862. --- Proposal Phase ---
  30863. --- Inner Elaboration Phase, active level 1 (S1) ---
  30864. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30865. -->
  30866. (S1 ^operator O2225 = -0.1386470047172653)
  30867. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30868. -->
  30869. Firing elaborate*copy-see-to-output-link
  30870. -->
  30871. (I3 ^see 0 +)
  30872. Firing elaborate*reward*based*on*reward
  30873. -->
  30874. (R1117 ^value 1 +)
  30875. (R1 ^reward R1117 +)
  30876. Firing propose*predict-yes
  30877. -->
  30878. (O2227 ^name predict-yes +)
  30879. (S1 ^operator O2227 +)
  30880. Firing propose*predict-no
  30881. -->
  30882. (O2228 ^name predict-no +)
  30883. (S1 ^operator O2228 +)
  30884. Firing rl*prefer*rvt*predict-no*H0*6
  30885. -->
  30886. (S1 ^operator O2226 = 0.9921858986923503)
  30887. Firing rl*prefer*rvt*predict-yes*H0*5
  30888. -->
  30889. (S1 ^operator O2225 = 0.2640027717901089)
  30890. Firing prefer*rvt*predict-yes*H0
  30891. -->
  30892. Firing prefer*rvt*predict-no*H0
  30893. -->
  30894. Firing elaborate*copy-dir-to-output-link
  30895. -->
  30896. (I3 ^dir L +)
  30897. inner elaboration loop at bottom goal.
  30898. Retracting elaborate*copy-see-to-output-link
  30899. -->
  30900. (I3 ^see 0 +)
  30901. Retracting propose*predict-no
  30902. -->
  30903. (O2226 ^name predict-no +)
  30904. (S1 ^operator O2226 +)
  30905. Retracting propose*predict-yes
  30906. -->
  30907. (O2225 ^name predict-yes +)
  30908. (S1 ^operator O2225 +)
  30909. Retracting elaborate*reward*based*on*reward
  30910. -->
  30911. (R1116 ^value 1 +)
  30912. (R1 ^reward R1116 +)
  30913. Retracting elaborate*copy-dir-to-output-link
  30914. -->
  30915. (I3 ^dir L +)
  30916. Retracting rl*prefer*rvt*predict-no*H0*6
  30917. -->
  30918. (S1 ^operator O2226 = 0.9921858986923503)
  30919. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30920. -->
  30921. (S1 ^operator O2225 = -0.1386470047172653)
  30922. Retracting rl*prefer*rvt*predict-yes*H0*5
  30923. -->
  30924. (S1 ^operator O2225 = 0.2640027717901089)
  30925. =>WM: (15696: S1 ^operator O2228 +)
  30926. =>WM: (15695: S1 ^operator O2227 +)
  30927. =>WM: (15694: O2228 ^name predict-no)
  30928. =>WM: (15693: O2227 ^name predict-yes)
  30929. =>WM: (15692: R1117 ^value 1)
  30930. =>WM: (15691: R1 ^reward R1117)
  30931. <=WM: (15682: S1 ^operator O2225 +)
  30932. <=WM: (15683: S1 ^operator O2226 +)
  30933. <=WM: (15684: S1 ^operator O2226)
  30934. <=WM: (15678: R1 ^reward R1116)
  30935. <=WM: (15681: O2226 ^name predict-no)
  30936. <=WM: (15680: O2225 ^name predict-yes)
  30937. <=WM: (15679: R1116 ^value 1)
  30938. --- Inner Elaboration Phase, active level 1 (S1) ---
  30939. Firing prefer*rvt*predict-yes*H0
  30940. -->
  30941. Firing rl*prefer*rvt*predict-yes*H0*5
  30942. -->
  30943. (S1 ^operator O2227 = 0.2640027717901089)
  30944. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  30945. -->
  30946. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30947. -->
  30948. (S1 ^operator O2227 = -0.1386470047172653)
  30949. Firing prefer*rvt*predict-no*H0
  30950. -->
  30951. Firing rl*prefer*rvt*predict-no*H0*6
  30952. -->
  30953. (S1 ^operator O2228 = 0.9921858986923503)
  30954. inner elaboration loop at bottom goal.
  30955. Retracting rl*prefer*rvt*predict-no*H0*6
  30956. -->
  30957. (S1 ^operator O2226 = 0.9921858986923503)
  30958. Retracting rl*prefer*rvt*predict-yes*H0*5
  30959. -->
  30960. (S1 ^operator O2225 = 0.2640027717901089)
  30961. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  30962. -->
  30963. (S1 ^operator O2225 = -0.1386470047172653)
  30964. --- END Proposal Phase ---
  30965. --- Decision Phase ---
  30966. RL update rl*prefer*rvt*predict-no*H0*6 0.992186 0 0.992186 -> 0.993461 0 0.993461(R,m,v=1,0.911243,0.0813609)
  30967. =>WM: (15697: S1 ^operator O2228)
  30968. 1114: O: O2228 (predict-no)
  30969. --- END Decision Phase ---
  30970. --- Application Phase ---
  30971. --- Firing Productions (PE) For State At Depth 1 ---
  30972. --- Inner Elaboration Phase, active level 1 (S1) ---
  30973. Firing apply*operator
  30974. -->
  30975. (I3 ^predict-no N1114 + :O )
  30976. Firing apply*operator*complete
  30977. -->
  30978. (I3 ^predict-no N1113 - :O )
  30979. inner elaboration loop at bottom goal.
  30980. --- Change Working Memory (PE) ---
  30981. =>WM: (15698: I3 ^predict-no N1114)
  30982. <=WM: (15686: N1113 ^status complete)
  30983. <=WM: (15685: I3 ^predict-no N1113)
  30984. --- Firing Productions (IE) For State At Depth 1 ---
  30985. --- Inner Elaboration Phase, active level 1 (S1) ---
  30986. Firing monitor*world
  30987. -->
  30988. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30989. --- Change Working Memory (IE) ---
  30990. --- END Application Phase ---
  30991. --- Output Phase ---
  30992. ENV: Agent did: predict-no for direction L in state State-A
  30993. In State-A moving L
  30994. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30995. predict error 0
  30996. dir: dir isR
  30997. --- END Output Phase ---
  30998. |\---- Input Phase ---
  30999. =>WM: (15702: I2 ^dir R)
  31000. =>WM: (15701: I2 ^reward 1)
  31001. =>WM: (15700: I2 ^see 0)
  31002. =>WM: (15699: N1114 ^status complete)
  31003. <=WM: (15689: I2 ^dir L)
  31004. <=WM: (15688: I2 ^reward 1)
  31005. <=WM: (15687: I2 ^see 0)
  31006. =>WM: (15703: I2 ^level-1 L0-root)
  31007. <=WM: (15690: I2 ^level-1 L0-root)
  31008. --- END Input Phase ---
  31009. --- Proposal Phase ---
  31010. --- Inner Elaboration Phase, active level 1 (S1) ---
  31011. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  31012. -->
  31013. (S1 ^operator O2228 = -0.2817060109291377)
  31014. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  31015. -->
  31016. (S1 ^operator O2227 = 0.6623124185138372)
  31017. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31018. -->
  31019. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31020. -->
  31021. Firing elaborate*copy-see-to-output-link
  31022. -->
  31023. (I3 ^see 0 +)
  31024. Firing elaborate*reward*based*on*reward
  31025. -->
  31026. (R1118 ^value 1 +)
  31027. (R1 ^reward R1118 +)
  31028. Firing propose*predict-yes
  31029. -->
  31030. (O2229 ^name predict-yes +)
  31031. (S1 ^operator O2229 +)
  31032. Firing propose*predict-no
  31033. -->
  31034. (O2230 ^name predict-no +)
  31035. (S1 ^operator O2230 +)
  31036. Firing rl*prefer*rvt*predict-no*H0*4
  31037. -->
  31038. (S1 ^operator O2228 = 0.339786248810353)
  31039. Firing rl*prefer*rvt*predict-yes*H0*3
  31040. -->
  31041. (S1 ^operator O2227 = 0.337706366383665)
  31042. Firing prefer*rvt*predict-yes*H0
  31043. -->
  31044. Firing prefer*rvt*predict-no*H0
  31045. -->
  31046. Firing elaborate*copy-dir-to-output-link
  31047. -->
  31048. (I3 ^dir R +)
  31049. inner elaboration loop at bottom goal.
  31050. Retracting elaborate*copy-see-to-output-link
  31051. -->
  31052. (I3 ^see 0 +)
  31053. Retracting propose*predict-no
  31054. -->
  31055. (O2228 ^name predict-no +)
  31056. (S1 ^operator O2228 +)
  31057. Retracting propose*predict-yes
  31058. -->
  31059. (O2227 ^name predict-yes +)
  31060. (S1 ^operator O2227 +)
  31061. Retracting elaborate*reward*based*on*reward
  31062. -->
  31063. (R1117 ^value 1 +)
  31064. (R1 ^reward R1117 +)
  31065. Retracting elaborate*copy-dir-to-output-link
  31066. -->
  31067. (I3 ^dir L +)
  31068. Retracting rl*prefer*rvt*predict-no*H0*6
  31069. -->
  31070. (S1 ^operator O2228 = 0.9934606508001831)
  31071. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*32
  31072. -->
  31073. (S1 ^operator O2227 = -0.1386470047172653)
  31074. Retracting rl*prefer*rvt*predict-yes*H0*5
  31075. -->
  31076. (S1 ^operator O2227 = 0.2640027717901089)
  31077. =>WM: (15710: S1 ^operator O2230 +)
  31078. =>WM: (15709: S1 ^operator O2229 +)
  31079. =>WM: (15708: I3 ^dir R)
  31080. =>WM: (15707: O2230 ^name predict-no)
  31081. =>WM: (15706: O2229 ^name predict-yes)
  31082. =>WM: (15705: R1118 ^value 1)
  31083. =>WM: (15704: R1 ^reward R1118)
  31084. <=WM: (15695: S1 ^operator O2227 +)
  31085. <=WM: (15696: S1 ^operator O2228 +)
  31086. <=WM: (15697: S1 ^operator O2228)
  31087. <=WM: (15653: I3 ^dir L)
  31088. <=WM: (15691: R1 ^reward R1117)
  31089. <=WM: (15694: O2228 ^name predict-no)
  31090. <=WM: (15693: O2227 ^name predict-yes)
  31091. <=WM: (15692: R1117 ^value 1)
  31092. --- Inner Elaboration Phase, active level 1 (S1) ---
  31093. Firing prefer*rvt*predict-yes*H0
  31094. -->
  31095. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  31096. -->
  31097. (S1 ^operator O2229 = 0.6623124185138372)
  31098. Firing rl*prefer*rvt*predict-yes*H0*3
  31099. -->
  31100. (S1 ^operator O2229 = 0.337706366383665)
  31101. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31102. -->
  31103. Firing prefer*rvt*predict-no*H0
  31104. -->
  31105. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  31106. -->
  31107. (S1 ^operator O2230 = -0.2817060109291377)
  31108. Firing rl*prefer*rvt*predict-no*H0*4
  31109. -->
  31110. (S1 ^operator O2230 = 0.339786248810353)
  31111. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31112. -->
  31113. inner elaboration loop at bottom goal.
  31114. Retracting rl*prefer*rvt*predict-no*H0*4
  31115. -->
  31116. (S1 ^operator O2228 = 0.339786248810353)
  31117. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  31118. -->
  31119. (S1 ^operator O2228 = -0.2817060109291377)
  31120. Retracting rl*prefer*rvt*predict-yes*H0*3
  31121. -->
  31122. (S1 ^operator O2227 = 0.337706366383665)
  31123. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  31124. -->
  31125. (S1 ^operator O2227 = 0.6623124185138372)
  31126. --- END Proposal Phase ---
  31127. --- Decision Phase ---
  31128. RL update rl*prefer*rvt*predict-no*H0*6 0.993461 0 0.993461 -> 0.994526 0 0.994526(R,m,v=1,0.911765,0.0809259)
  31129. =>WM: (15711: S1 ^operator O2229)
  31130. 1115: O: O2229 (predict-yes)
  31131. --- END Decision Phase ---
  31132. --- Application Phase ---
  31133. --- Firing Productions (PE) For State At Depth 1 ---
  31134. --- Inner Elaboration Phase, active level 1 (S1) ---
  31135. Firing apply*operator
  31136. -->
  31137. (I3 ^predict-yes N1115 + :O )
  31138. Firing apply*operator*complete
  31139. -->
  31140. (I3 ^predict-no N1114 - :O )
  31141. inner elaboration loop at bottom goal.
  31142. --- Change Working Memory (PE) ---
  31143. =>WM: (15712: I3 ^predict-yes N1115)
  31144. <=WM: (15699: N1114 ^status complete)
  31145. <=WM: (15698: I3 ^predict-no N1114)
  31146. --- Firing Productions (IE) For State At Depth 1 ---
  31147. --- Inner Elaboration Phase, active level 1 (S1) ---
  31148. Firing monitor*world
  31149. -->
  31150. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31151. --- Change Working Memory (IE) ---
  31152. --- END Application Phase ---
  31153. --- Output Phase ---
  31154. ENV: Agent did: predict-yes for direction R in state State-A
  31155. In State-A moving R
  31156. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  31157. predict error 0
  31158. dir: dir isR
  31159. --- END Output Phase ---
  31160. /|\--- Input Phase ---
  31161. =>WM: (15716: I2 ^dir R)
  31162. =>WM: (15715: I2 ^reward 1)
  31163. =>WM: (15714: I2 ^see 1)
  31164. =>WM: (15713: N1115 ^status complete)
  31165. <=WM: (15702: I2 ^dir R)
  31166. <=WM: (15701: I2 ^reward 1)
  31167. <=WM: (15700: I2 ^see 0)
  31168. =>WM: (15717: I2 ^level-1 R1-root)
  31169. <=WM: (15703: I2 ^level-1 L0-root)
  31170. --- END Input Phase ---
  31171. --- Proposal Phase ---
  31172. --- Inner Elaboration Phase, active level 1 (S1) ---
  31173. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  31174. -->
  31175. (S1 ^operator O2229 = -0.1070236389116304)
  31176. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  31177. -->
  31178. (S1 ^operator O2230 = 0.6602258751792722)
  31179. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31180. -->
  31181. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31182. -->
  31183. Firing elaborate*copy-see-to-output-link
  31184. -->
  31185. (I3 ^see 1 +)
  31186. Firing elaborate*reward*based*on*reward
  31187. -->
  31188. (R1119 ^value 1 +)
  31189. (R1 ^reward R1119 +)
  31190. Firing propose*predict-yes
  31191. -->
  31192. (O2231 ^name predict-yes +)
  31193. (S1 ^operator O2231 +)
  31194. Firing propose*predict-no
  31195. -->
  31196. (O2232 ^name predict-no +)
  31197. (S1 ^operator O2232 +)
  31198. Firing rl*prefer*rvt*predict-no*H0*4
  31199. -->
  31200. (S1 ^operator O2230 = 0.339786248810353)
  31201. Firing rl*prefer*rvt*predict-yes*H0*3
  31202. -->
  31203. (S1 ^operator O2229 = 0.337706366383665)
  31204. Firing prefer*rvt*predict-yes*H0
  31205. -->
  31206. Firing prefer*rvt*predict-no*H0
  31207. -->
  31208. Firing elaborate*copy-dir-to-output-link
  31209. -->
  31210. (I3 ^dir R +)
  31211. inner elaboration loop at bottom goal.
  31212. Retracting elaborate*copy-see-to-output-link
  31213. -->
  31214. (I3 ^see 0 +)
  31215. Retracting propose*predict-no
  31216. -->
  31217. (O2230 ^name predict-no +)
  31218. (S1 ^operator O2230 +)
  31219. Retracting propose*predict-yes
  31220. -->
  31221. (O2229 ^name predict-yes +)
  31222. (S1 ^operator O2229 +)
  31223. Retracting elaborate*reward*based*on*reward
  31224. -->
  31225. (R1118 ^value 1 +)
  31226. (R1 ^reward R1118 +)
  31227. Retracting elaborate*copy-dir-to-output-link
  31228. -->
  31229. (I3 ^dir R +)
  31230. Retracting rl*prefer*rvt*predict-no*H0*4
  31231. -->
  31232. (S1 ^operator O2230 = 0.339786248810353)
  31233. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  31234. -->
  31235. (S1 ^operator O2230 = -0.2817060109291377)
  31236. Retracting rl*prefer*rvt*predict-yes*H0*3
  31237. -->
  31238. (S1 ^operator O2229 = 0.337706366383665)
  31239. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  31240. -->
  31241. (S1 ^operator O2229 = 0.6623124185138372)
  31242. =>WM: (15724: S1 ^operator O2232 +)
  31243. =>WM: (15723: S1 ^operator O2231 +)
  31244. =>WM: (15722: O2232 ^name predict-no)
  31245. =>WM: (15721: O2231 ^name predict-yes)
  31246. =>WM: (15720: R1119 ^value 1)
  31247. =>WM: (15719: R1 ^reward R1119)
  31248. =>WM: (15718: I3 ^see 1)
  31249. <=WM: (15709: S1 ^operator O2229 +)
  31250. <=WM: (15711: S1 ^operator O2229)
  31251. <=WM: (15710: S1 ^operator O2230 +)
  31252. <=WM: (15704: R1 ^reward R1118)
  31253. <=WM: (15677: I3 ^see 0)
  31254. <=WM: (15707: O2230 ^name predict-no)
  31255. <=WM: (15706: O2229 ^name predict-yes)
  31256. <=WM: (15705: R1118 ^value 1)
  31257. --- Inner Elaboration Phase, active level 1 (S1) ---
  31258. Firing prefer*rvt*predict-yes*H0
  31259. -->
  31260. Firing rl*prefer*rvt*predict-yes*H0*3
  31261. -->
  31262. (S1 ^operator O2231 = 0.337706366383665)
  31263. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31264. -->
  31265. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  31266. -->
  31267. (S1 ^operator O2231 = -0.1070236389116304)
  31268. Firing prefer*rvt*predict-no*H0
  31269. -->
  31270. Firing rl*prefer*rvt*predict-no*H0*4
  31271. -->
  31272. (S1 ^operator O2232 = 0.339786248810353)
  31273. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31274. -->
  31275. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  31276. -->
  31277. (S1 ^operator O2232 = 0.6602258751792722)
  31278. inner elaboration loop at bottom goal.
  31279. Retracting rl*prefer*rvt*predict-no*H0*4
  31280. -->
  31281. (S1 ^operator O2230 = 0.339786248810353)
  31282. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  31283. -->
  31284. (S1 ^operator O2230 = 0.6602258751792722)
  31285. Retracting rl*prefer*rvt*predict-yes*H0*3
  31286. -->
  31287. (S1 ^operator O2229 = 0.337706366383665)
  31288. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  31289. -->
  31290. (S1 ^operator O2229 = -0.1070236389116304)
  31291. --- END Proposal Phase ---
  31292. --- Decision Phase ---
  31293. RL update rl*prefer*rvt*predict-yes*H0*3 0.590106 -0.252399 0.337706 -> 0.590104 -0.252399 0.337705(R,m,v=1,0.907609,0.0843134)
  31294. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409916 0.252397 0.662312 -> 0.409914 0.252397 0.662311(R,m,v=1,1,0)
  31295. =>WM: (15725: S1 ^operator O2232)
  31296. 1116: O: O2232 (predict-no)
  31297. --- END Decision Phase ---
  31298. --- Application Phase ---
  31299. --- Firing Productions (PE) For State At Depth 1 ---
  31300. --- Inner Elaboration Phase, active level 1 (S1) ---
  31301. Firing apply*operator
  31302. -->
  31303. (I3 ^predict-no N1116 + :O )
  31304. Firing apply*operator*complete
  31305. -->
  31306. (I3 ^predict-yes N1115 - :O )
  31307. inner elaboration loop at bottom goal.
  31308. --- Change Working Memory (PE) ---
  31309. =>WM: (15726: I3 ^predict-no N1116)
  31310. <=WM: (15713: N1115 ^status complete)
  31311. <=WM: (15712: I3 ^predict-yes N1115)
  31312. --- Firing Productions (IE) For State At Depth 1 ---
  31313. --- Inner Elaboration Phase, active level 1 (S1) ---
  31314. Firing monitor*world
  31315. -->
  31316. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31317. --- Change Working Memory (IE) ---
  31318. --- END Application Phase ---
  31319. --- Output Phase ---
  31320. ENV: Agent did: predict-no for direction R in state State-B
  31321. In State-B moving R
  31322. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31323. predict error 0
  31324. dir: dir isU
  31325. --- END Output Phase ---
  31326. -/|--- Input Phase ---
  31327. =>WM: (15730: I2 ^dir U)
  31328. =>WM: (15729: I2 ^reward 1)
  31329. =>WM: (15728: I2 ^see 0)
  31330. =>WM: (15727: N1116 ^status complete)
  31331. <=WM: (15716: I2 ^dir R)
  31332. <=WM: (15715: I2 ^reward 1)
  31333. <=WM: (15714: I2 ^see 1)
  31334. =>WM: (15731: I2 ^level-1 R0-root)
  31335. <=WM: (15717: I2 ^level-1 R1-root)
  31336. --- END Input Phase ---
  31337. --- Proposal Phase ---
  31338. --- Inner Elaboration Phase, active level 1 (S1) ---
  31339. Firing elaborate*copy-see-to-output-link
  31340. -->
  31341. (I3 ^see 0 +)
  31342. Firing elaborate*reward*based*on*reward
  31343. -->
  31344. (R1120 ^value 1 +)
  31345. (R1 ^reward R1120 +)
  31346. Firing propose*predict-yes
  31347. -->
  31348. (O2233 ^name predict-yes +)
  31349. (S1 ^operator O2233 +)
  31350. Firing propose*predict-no
  31351. -->
  31352. (O2234 ^name predict-no +)
  31353. (S1 ^operator O2234 +)
  31354. Firing rl*prefer*rvt*predict-no*H0*2
  31355. -->
  31356. (S1 ^operator O2232 = 1.)
  31357. Firing rl*prefer*rvt*predict-yes*H0*1
  31358. -->
  31359. (S1 ^operator O2231 = 0.)
  31360. Firing prefer*rvt*predict-yes*H0
  31361. -->
  31362. Firing prefer*rvt*predict-no*H0
  31363. -->
  31364. Firing elaborate*copy-dir-to-output-link
  31365. -->
  31366. (I3 ^dir U +)
  31367. inner elaboration loop at bottom goal.
  31368. Retracting elaborate*copy-see-to-output-link
  31369. -->
  31370. (I3 ^see 1 +)
  31371. Retracting propose*predict-no
  31372. -->
  31373. (O2232 ^name predict-no +)
  31374. (S1 ^operator O2232 +)
  31375. Retracting propose*predict-yes
  31376. -->
  31377. (O2231 ^name predict-yes +)
  31378. (S1 ^operator O2231 +)
  31379. Retracting elaborate*reward*based*on*reward
  31380. -->
  31381. (R1119 ^value 1 +)
  31382. (R1 ^reward R1119 +)
  31383. Retracting elaborate*copy-dir-to-output-link
  31384. -->
  31385. (I3 ^dir R +)
  31386. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  31387. -->
  31388. (S1 ^operator O2232 = 0.6602258751792722)
  31389. Retracting rl*prefer*rvt*predict-no*H0*4
  31390. -->
  31391. (S1 ^operator O2232 = 0.339786248810353)
  31392. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  31393. -->
  31394. (S1 ^operator O2231 = -0.1070236389116304)
  31395. Retracting rl*prefer*rvt*predict-yes*H0*3
  31396. -->
  31397. (S1 ^operator O2231 = 0.3377048551132163)
  31398. =>WM: (15739: S1 ^operator O2234 +)
  31399. =>WM: (15738: S1 ^operator O2233 +)
  31400. =>WM: (15737: I3 ^dir U)
  31401. =>WM: (15736: O2234 ^name predict-no)
  31402. =>WM: (15735: O2233 ^name predict-yes)
  31403. =>WM: (15734: R1120 ^value 1)
  31404. =>WM: (15733: R1 ^reward R1120)
  31405. =>WM: (15732: I3 ^see 0)
  31406. <=WM: (15723: S1 ^operator O2231 +)
  31407. <=WM: (15724: S1 ^operator O2232 +)
  31408. <=WM: (15725: S1 ^operator O2232)
  31409. <=WM: (15708: I3 ^dir R)
  31410. <=WM: (15719: R1 ^reward R1119)
  31411. <=WM: (15718: I3 ^see 1)
  31412. <=WM: (15722: O2232 ^name predict-no)
  31413. <=WM: (15721: O2231 ^name predict-yes)
  31414. <=WM: (15720: R1119 ^value 1)
  31415. --- Inner Elaboration Phase, active level 1 (S1) ---
  31416. Firing prefer*rvt*predict-yes*H0
  31417. -->
  31418. Firing rl*prefer*rvt*predict-yes*H0*1
  31419. -->
  31420. (S1 ^operator O2233 = 0.)
  31421. Firing prefer*rvt*predict-no*H0
  31422. -->
  31423. Firing rl*prefer*rvt*predict-no*H0*2
  31424. -->
  31425. (S1 ^operator O2234 = 1.)
  31426. inner elaboration loop at bottom goal.
  31427. Retracting rl*prefer*rvt*predict-no*H0*2
  31428. -->
  31429. (S1 ^operator O2232 = 1.)
  31430. Retracting rl*prefer*rvt*predict-yes*H0*1
  31431. -->
  31432. (S1 ^operator O2231 = 0.)
  31433. --- END Proposal Phase ---
  31434. --- Decision Phase ---
  31435. RL update rl*prefer*rvt*predict-no*H0*4 0.570271 -0.230484 0.339786 -> 0.570269 -0.230484 0.339785(R,m,v=1,0.888298,0.0997554)
  31436. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429742 0.230484 0.660226 -> 0.429741 0.230484 0.660225(R,m,v=1,1,0)
  31437. =>WM: (15740: S1 ^operator O2234)
  31438. 1117: O: O2234 (predict-no)
  31439. --- END Decision Phase ---
  31440. --- Application Phase ---
  31441. --- Firing Productions (PE) For State At Depth 1 ---
  31442. --- Inner Elaboration Phase, active level 1 (S1) ---
  31443. Firing apply*operator
  31444. -->
  31445. (I3 ^predict-no N1117 + :O )
  31446. Firing apply*operator*complete
  31447. -->
  31448. (I3 ^predict-no N1116 - :O )
  31449. inner elaboration loop at bottom goal.
  31450. --- Change Working Memory (PE) ---
  31451. =>WM: (15741: I3 ^predict-no N1117)
  31452. <=WM: (15727: N1116 ^status complete)
  31453. <=WM: (15726: I3 ^predict-no N1116)
  31454. --- Firing Productions (IE) For State At Depth 1 ---
  31455. --- Inner Elaboration Phase, active level 1 (S1) ---
  31456. Firing monitor*world
  31457. -->
  31458. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31459. --- Change Working Memory (IE) ---
  31460. --- END Application Phase ---
  31461. --- Output Phase ---
  31462. ENV: Agent did: predict-no for direction U in state State-B
  31463. In State-B moving U
  31464. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31465. predict error 0
  31466. dir: dir isL
  31467. --- END Output Phase ---
  31468. \---- Input Phase ---
  31469. =>WM: (15745: I2 ^dir L)
  31470. =>WM: (15744: I2 ^reward 1)
  31471. =>WM: (15743: I2 ^see 0)
  31472. =>WM: (15742: N1117 ^status complete)
  31473. <=WM: (15730: I2 ^dir U)
  31474. <=WM: (15729: I2 ^reward 1)
  31475. <=WM: (15728: I2 ^see 0)
  31476. =>WM: (15746: I2 ^level-1 R0-root)
  31477. <=WM: (15731: I2 ^level-1 R0-root)
  31478. --- END Input Phase ---
  31479. --- Proposal Phase ---
  31480. --- Inner Elaboration Phase, active level 1 (S1) ---
  31481. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  31482. -->
  31483. (S1 ^operator O2233 = 0.7359384192579546)
  31484. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31485. -->
  31486. Firing elaborate*copy-see-to-output-link
  31487. -->
  31488. (I3 ^see 0 +)
  31489. Firing elaborate*reward*based*on*reward
  31490. -->
  31491. (R1121 ^value 1 +)
  31492. (R1 ^reward R1121 +)
  31493. Firing propose*predict-yes
  31494. -->
  31495. (O2235 ^name predict-yes +)
  31496. (S1 ^operator O2235 +)
  31497. Firing propose*predict-no
  31498. -->
  31499. (O2236 ^name predict-no +)
  31500. (S1 ^operator O2236 +)
  31501. Firing rl*prefer*rvt*predict-no*H0*6
  31502. -->
  31503. (S1 ^operator O2234 = 0.9945264206860271)
  31504. Firing rl*prefer*rvt*predict-yes*H0*5
  31505. -->
  31506. (S1 ^operator O2233 = 0.2640027717901089)
  31507. Firing prefer*rvt*predict-yes*H0
  31508. -->
  31509. Firing prefer*rvt*predict-no*H0
  31510. -->
  31511. Firing elaborate*copy-dir-to-output-link
  31512. -->
  31513. (I3 ^dir L +)
  31514. inner elaboration loop at bottom goal.
  31515. Retracting elaborate*copy-see-to-output-link
  31516. -->
  31517. (I3 ^see 0 +)
  31518. Retracting propose*predict-no
  31519. -->
  31520. (O2234 ^name predict-no +)
  31521. (S1 ^operator O2234 +)
  31522. Retracting propose*predict-yes
  31523. -->
  31524. (O2233 ^name predict-yes +)
  31525. (S1 ^operator O2233 +)
  31526. Retracting elaborate*reward*based*on*reward
  31527. -->
  31528. (R1120 ^value 1 +)
  31529. (R1 ^reward R1120 +)
  31530. Retracting elaborate*copy-dir-to-output-link
  31531. -->
  31532. (I3 ^dir U +)
  31533. Retracting rl*prefer*rvt*predict-no*H0*2
  31534. -->
  31535. (S1 ^operator O2234 = 1.)
  31536. Retracting rl*prefer*rvt*predict-yes*H0*1
  31537. -->
  31538. (S1 ^operator O2233 = 0.)
  31539. =>WM: (15753: S1 ^operator O2236 +)
  31540. =>WM: (15752: S1 ^operator O2235 +)
  31541. =>WM: (15751: I3 ^dir L)
  31542. =>WM: (15750: O2236 ^name predict-no)
  31543. =>WM: (15749: O2235 ^name predict-yes)
  31544. =>WM: (15748: R1121 ^value 1)
  31545. =>WM: (15747: R1 ^reward R1121)
  31546. <=WM: (15738: S1 ^operator O2233 +)
  31547. <=WM: (15739: S1 ^operator O2234 +)
  31548. <=WM: (15740: S1 ^operator O2234)
  31549. <=WM: (15737: I3 ^dir U)
  31550. <=WM: (15733: R1 ^reward R1120)
  31551. <=WM: (15736: O2234 ^name predict-no)
  31552. <=WM: (15735: O2233 ^name predict-yes)
  31553. <=WM: (15734: R1120 ^value 1)
  31554. --- Inner Elaboration Phase, active level 1 (S1) ---
  31555. Firing prefer*rvt*predict-yes*H0
  31556. -->
  31557. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  31558. -->
  31559. (S1 ^operator O2235 = 0.7359384192579546)
  31560. Firing rl*prefer*rvt*predict-yes*H0*5
  31561. -->
  31562. (S1 ^operator O2235 = 0.2640027717901089)
  31563. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31564. -->
  31565. Firing prefer*rvt*predict-no*H0
  31566. -->
  31567. Firing rl*prefer*rvt*predict-no*H0*6
  31568. -->
  31569. (S1 ^operator O2236 = 0.9945264206860271)
  31570. inner elaboration loop at bottom goal.
  31571. Retracting rl*prefer*rvt*predict-no*H0*6
  31572. -->
  31573. (S1 ^operator O2234 = 0.9945264206860271)
  31574. Retracting rl*prefer*rvt*predict-yes*H0*5
  31575. -->
  31576. (S1 ^operator O2233 = 0.2640027717901089)
  31577. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  31578. -->
  31579. (S1 ^operator O2233 = 0.7359384192579546)
  31580. --- END Proposal Phase ---
  31581. --- Decision Phase ---
  31582. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31583. =>WM: (15754: S1 ^operator O2235)
  31584. 1118: O: O2235 (predict-yes)
  31585. --- END Decision Phase ---
  31586. --- Application Phase ---
  31587. --- Firing Productions (PE) For State At Depth 1 ---
  31588. --- Inner Elaboration Phase, active level 1 (S1) ---
  31589. Firing apply*operator
  31590. -->
  31591. (I3 ^predict-yes N1118 + :O )
  31592. Firing apply*operator*complete
  31593. -->
  31594. (I3 ^predict-no N1117 - :O )
  31595. inner elaboration loop at bottom goal.
  31596. --- Change Working Memory (PE) ---
  31597. =>WM: (15755: I3 ^predict-yes N1118)
  31598. <=WM: (15742: N1117 ^status complete)
  31599. <=WM: (15741: I3 ^predict-no N1117)
  31600. --- Firing Productions (IE) For State At Depth 1 ---
  31601. --- Inner Elaboration Phase, active level 1 (S1) ---
  31602. Firing monitor*world
  31603. -->
  31604. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31605. --- Change Working Memory (IE) ---
  31606. --- END Application Phase ---
  31607. --- Output Phase ---
  31608. ENV: Agent did: predict-yes for direction L in state State-B
  31609. In State-B moving L
  31610. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  31611. predict error 0
  31612. dir: dir isU
  31613. --- END Output Phase ---
  31614. /|--- Input Phase ---
  31615. =>WM: (15759: I2 ^dir U)
  31616. =>WM: (15758: I2 ^reward 1)
  31617. =>WM: (15757: I2 ^see 1)
  31618. =>WM: (15756: N1118 ^status complete)
  31619. <=WM: (15745: I2 ^dir L)
  31620. <=WM: (15744: I2 ^reward 1)
  31621. <=WM: (15743: I2 ^see 0)
  31622. =>WM: (15760: I2 ^level-1 L1-root)
  31623. <=WM: (15746: I2 ^level-1 R0-root)
  31624. --- END Input Phase ---
  31625. --- Proposal Phase ---
  31626. --- Inner Elaboration Phase, active level 1 (S1) ---
  31627. Firing elaborate*copy-see-to-output-link
  31628. -->
  31629. (I3 ^see 1 +)
  31630. Firing elaborate*reward*based*on*reward
  31631. -->
  31632. (R1122 ^value 1 +)
  31633. (R1 ^reward R1122 +)
  31634. Firing propose*predict-yes
  31635. -->
  31636. (O2237 ^name predict-yes +)
  31637. (S1 ^operator O2237 +)
  31638. Firing propose*predict-no
  31639. -->
  31640. (O2238 ^name predict-no +)
  31641. (S1 ^operator O2238 +)
  31642. Firing rl*prefer*rvt*predict-no*H0*2
  31643. -->
  31644. (S1 ^operator O2236 = 1.)
  31645. Firing rl*prefer*rvt*predict-yes*H0*1
  31646. -->
  31647. (S1 ^operator O2235 = 0.)
  31648. Firing prefer*rvt*predict-yes*H0
  31649. -->
  31650. Firing prefer*rvt*predict-no*H0
  31651. -->
  31652. Firing elaborate*copy-dir-to-output-link
  31653. -->
  31654. (I3 ^dir U +)
  31655. inner elaboration loop at bottom goal.
  31656. Retracting elaborate*copy-see-to-output-link
  31657. -->
  31658. (I3 ^see 0 +)
  31659. Retracting propose*predict-no
  31660. -->
  31661. (O2236 ^name predict-no +)
  31662. (S1 ^operator O2236 +)
  31663. Retracting propose*predict-yes
  31664. -->
  31665. (O2235 ^name predict-yes +)
  31666. (S1 ^operator O2235 +)
  31667. Retracting elaborate*reward*based*on*reward
  31668. -->
  31669. (R1121 ^value 1 +)
  31670. (R1 ^reward R1121 +)
  31671. Retracting elaborate*copy-dir-to-output-link
  31672. -->
  31673. (I3 ^dir L +)
  31674. Retracting rl*prefer*rvt*predict-no*H0*6
  31675. -->
  31676. (S1 ^operator O2236 = 0.9945264206860271)
  31677. Retracting rl*prefer*rvt*predict-yes*H0*5
  31678. -->
  31679. (S1 ^operator O2235 = 0.2640027717901089)
  31680. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  31681. -->
  31682. (S1 ^operator O2235 = 0.7359384192579546)
  31683. =>WM: (15768: S1 ^operator O2238 +)
  31684. =>WM: (15767: S1 ^operator O2237 +)
  31685. =>WM: (15766: I3 ^dir U)
  31686. =>WM: (15765: O2238 ^name predict-no)
  31687. =>WM: (15764: O2237 ^name predict-yes)
  31688. =>WM: (15763: R1122 ^value 1)
  31689. =>WM: (15762: R1 ^reward R1122)
  31690. =>WM: (15761: I3 ^see 1)
  31691. <=WM: (15752: S1 ^operator O2235 +)
  31692. <=WM: (15754: S1 ^operator O2235)
  31693. <=WM: (15753: S1 ^operator O2236 +)
  31694. <=WM: (15751: I3 ^dir L)
  31695. <=WM: (15747: R1 ^reward R1121)
  31696. <=WM: (15732: I3 ^see 0)
  31697. <=WM: (15750: O2236 ^name predict-no)
  31698. <=WM: (15749: O2235 ^name predict-yes)
  31699. <=WM: (15748: R1121 ^value 1)
  31700. --- Inner Elaboration Phase, active level 1 (S1) ---
  31701. Firing prefer*rvt*predict-yes*H0
  31702. -->
  31703. Firing rl*prefer*rvt*predict-yes*H0*1
  31704. -->
  31705. (S1 ^operator O2237 = 0.)
  31706. Firing prefer*rvt*predict-no*H0
  31707. -->
  31708. Firing rl*prefer*rvt*predict-no*H0*2
  31709. -->
  31710. (S1 ^operator O2238 = 1.)
  31711. inner elaboration loop at bottom goal.
  31712. Retracting rl*prefer*rvt*predict-no*H0*2
  31713. -->
  31714. (S1 ^operator O2236 = 1.)
  31715. Retracting rl*prefer*rvt*predict-yes*H0*1
  31716. -->
  31717. (S1 ^operator O2235 = 0.)
  31718. --- END Proposal Phase ---
  31719. --- Decision Phase ---
  31720. RL update rl*prefer*rvt*predict-yes*H0*5 0.554389 -0.290386 0.264003 -> 0.554393 -0.290386 0.264007(R,m,v=1,0.887179,0.100608)
  31721. RL update rl*prefer*rvt*predict-yes*H0*5*v1*H1*30 0.445553 0.290385 0.735938 -> 0.445559 0.290385 0.735944(R,m,v=1,1,0)
  31722. =>WM: (15769: S1 ^operator O2238)
  31723. 1119: O: O2238 (predict-no)
  31724. --- END Decision Phase ---
  31725. --- Application Phase ---
  31726. --- Firing Productions (PE) For State At Depth 1 ---
  31727. --- Inner Elaboration Phase, active level 1 (S1) ---
  31728. Firing apply*operator
  31729. -->
  31730. (I3 ^predict-no N1119 + :O )
  31731. Firing apply*operator*complete
  31732. -->
  31733. (I3 ^predict-yes N1118 - :O )
  31734. inner elaboration loop at bottom goal.
  31735. --- Change Working Memory (PE) ---
  31736. =>WM: (15770: I3 ^predict-no N1119)
  31737. <=WM: (15756: N1118 ^status complete)
  31738. <=WM: (15755: I3 ^predict-yes N1118)
  31739. --- Firing Productions (IE) For State At Depth 1 ---
  31740. --- Inner Elaboration Phase, active level 1 (S1) ---
  31741. Firing monitor*world
  31742. -->
  31743. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31744. --- Change Working Memory (IE) ---
  31745. --- END Application Phase ---
  31746. --- Output Phase ---
  31747. ENV: Agent did: predict-no for direction U in state State-A
  31748. In State-A moving U
  31749. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  31750. predict error 0
  31751. dir: dir isL
  31752. --- END Output Phase ---
  31753. \-/--- Input Phase ---
  31754. =>WM: (15774: I2 ^dir L)
  31755. =>WM: (15773: I2 ^reward 1)
  31756. =>WM: (15772: I2 ^see 0)
  31757. =>WM: (15771: N1119 ^status complete)
  31758. <=WM: (15759: I2 ^dir U)
  31759. <=WM: (15758: I2 ^reward 1)
  31760. <=WM: (15757: I2 ^see 1)
  31761. =>WM: (15775: I2 ^level-1 L1-root)
  31762. <=WM: (15760: I2 ^level-1 L1-root)
  31763. --- END Input Phase ---
  31764. --- Proposal Phase ---
  31765. --- Inner Elaboration Phase, active level 1 (S1) ---
  31766. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31767. -->
  31768. (S1 ^operator O2237 = -0.181727099742844)
  31769. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31770. -->
  31771. Firing elaborate*copy-see-to-output-link
  31772. -->
  31773. (I3 ^see 0 +)
  31774. Firing elaborate*reward*based*on*reward
  31775. -->
  31776. (R1123 ^value 1 +)
  31777. (R1 ^reward R1123 +)
  31778. Firing propose*predict-yes
  31779. -->
  31780. (O2239 ^name predict-yes +)
  31781. (S1 ^operator O2239 +)
  31782. Firing propose*predict-no
  31783. -->
  31784. (O2240 ^name predict-no +)
  31785. (S1 ^operator O2240 +)
  31786. Firing rl*prefer*rvt*predict-no*H0*6
  31787. -->
  31788. (S1 ^operator O2238 = 0.9945264206860271)
  31789. Firing rl*prefer*rvt*predict-yes*H0*5
  31790. -->
  31791. (S1 ^operator O2237 = 0.2640074592567178)
  31792. Firing prefer*rvt*predict-yes*H0
  31793. -->
  31794. Firing prefer*rvt*predict-no*H0
  31795. -->
  31796. Firing elaborate*copy-dir-to-output-link
  31797. -->
  31798. (I3 ^dir L +)
  31799. inner elaboration loop at bottom goal.
  31800. Retracting elaborate*copy-see-to-output-link
  31801. -->
  31802. (I3 ^see 1 +)
  31803. Retracting propose*predict-no
  31804. -->
  31805. (O2238 ^name predict-no +)
  31806. (S1 ^operator O2238 +)
  31807. Retracting propose*predict-yes
  31808. -->
  31809. (O2237 ^name predict-yes +)
  31810. (S1 ^operator O2237 +)
  31811. Retracting elaborate*reward*based*on*reward
  31812. -->
  31813. (R1122 ^value 1 +)
  31814. (R1 ^reward R1122 +)
  31815. Retracting elaborate*copy-dir-to-output-link
  31816. -->
  31817. (I3 ^dir U +)
  31818. Retracting rl*prefer*rvt*predict-no*H0*2
  31819. -->
  31820. (S1 ^operator O2238 = 1.)
  31821. Retracting rl*prefer*rvt*predict-yes*H0*1
  31822. -->
  31823. (S1 ^operator O2237 = 0.)
  31824. =>WM: (15783: S1 ^operator O2240 +)
  31825. =>WM: (15782: S1 ^operator O2239 +)
  31826. =>WM: (15781: I3 ^dir L)
  31827. =>WM: (15780: O2240 ^name predict-no)
  31828. =>WM: (15779: O2239 ^name predict-yes)
  31829. =>WM: (15778: R1123 ^value 1)
  31830. =>WM: (15777: R1 ^reward R1123)
  31831. =>WM: (15776: I3 ^see 0)
  31832. <=WM: (15767: S1 ^operator O2237 +)
  31833. <=WM: (15768: S1 ^operator O2238 +)
  31834. <=WM: (15769: S1 ^operator O2238)
  31835. <=WM: (15766: I3 ^dir U)
  31836. <=WM: (15762: R1 ^reward R1122)
  31837. <=WM: (15761: I3 ^see 1)
  31838. <=WM: (15765: O2238 ^name predict-no)
  31839. <=WM: (15764: O2237 ^name predict-yes)
  31840. <=WM: (15763: R1122 ^value 1)
  31841. --- Inner Elaboration Phase, active level 1 (S1) ---
  31842. Firing prefer*rvt*predict-yes*H0
  31843. -->
  31844. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31845. -->
  31846. (S1 ^operator O2239 = -0.181727099742844)
  31847. Firing rl*prefer*rvt*predict-yes*H0*5
  31848. -->
  31849. (S1 ^operator O2239 = 0.2640074592567178)
  31850. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  31851. -->
  31852. Firing prefer*rvt*predict-no*H0
  31853. -->
  31854. Firing rl*prefer*rvt*predict-no*H0*6
  31855. -->
  31856. (S1 ^operator O2240 = 0.9945264206860271)
  31857. inner elaboration loop at bottom goal.
  31858. Retracting rl*prefer*rvt*predict-no*H0*6
  31859. -->
  31860. (S1 ^operator O2238 = 0.9945264206860271)
  31861. Retracting rl*prefer*rvt*predict-yes*H0*5
  31862. -->
  31863. (S1 ^operator O2237 = 0.2640074592567178)
  31864. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31865. -->
  31866. (S1 ^operator O2237 = -0.181727099742844)
  31867. --- END Proposal Phase ---
  31868. --- Decision Phase ---
  31869. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31870. =>WM: (15784: S1 ^operator O2240)
  31871. 1120: O: O2240 (predict-no)
  31872. --- END Decision Phase ---
  31873. --- Application Phase ---
  31874. --- Firing Productions (PE) For State At Depth 1 ---
  31875. --- Inner Elaboration Phase, active level 1 (S1) ---
  31876. Firing apply*operator
  31877. -->
  31878. (I3 ^predict-no N1120 + :O )
  31879. Firing apply*operator*complete
  31880. -->
  31881. (I3 ^predict-no N1119 - :O )
  31882. inner elaboration loop at bottom goal.
  31883. --- Change Working Memory (PE) ---
  31884. =>WM: (15785: I3 ^predict-no N1120)
  31885. <=WM: (15771: N1119 ^status complete)
  31886. <=WM: (15770: I3 ^predict-no N1119)
  31887. --- Firing Productions (IE) For State At Depth 1 ---
  31888. --- Inner Elaboration Phase, active level 1 (S1) ---
  31889. Firing monitor*world
  31890. -->
  31891. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31892. --- Change Working Memory (IE) ---
  31893. --- END Application Phase ---
  31894. --- Output Phase ---
  31895. ENV: Agent did: predict-no for direction L in state State-A
  31896. In State-A moving L
  31897. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  31898. predict error 0
  31899. dir: dir isR
  31900. --- END Output Phase ---
  31901. |\---- Input Phase ---
  31902. =>WM: (15789: I2 ^dir R)
  31903. =>WM: (15788: I2 ^reward 1)
  31904. =>WM: (15787: I2 ^see 0)
  31905. =>WM: (15786: N1120 ^status complete)
  31906. <=WM: (15774: I2 ^dir L)
  31907. <=WM: (15773: I2 ^reward 1)
  31908. <=WM: (15772: I2 ^see 0)
  31909. =>WM: (15790: I2 ^level-1 L0-root)
  31910. <=WM: (15775: I2 ^level-1 L1-root)
  31911. --- END Input Phase ---
  31912. --- Proposal Phase ---
  31913. --- Inner Elaboration Phase, active level 1 (S1) ---
  31914. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  31915. -->
  31916. (S1 ^operator O2240 = -0.2817060109291377)
  31917. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  31918. -->
  31919. (S1 ^operator O2239 = 0.6623106733629137)
  31920. Firing prefer*rvt*predict-no*H0*4*v1*H1
  31921. -->
  31922. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  31923. -->
  31924. Firing elaborate*copy-see-to-output-link
  31925. -->
  31926. (I3 ^see 0 +)
  31927. Firing elaborate*reward*based*on*reward
  31928. -->
  31929. (R1124 ^value 1 +)
  31930. (R1 ^reward R1124 +)
  31931. Firing propose*predict-yes
  31932. -->
  31933. (O2241 ^name predict-yes +)
  31934. (S1 ^operator O2241 +)
  31935. Firing propose*predict-no
  31936. -->
  31937. (O2242 ^name predict-no +)
  31938. (S1 ^operator O2242 +)
  31939. Firing rl*prefer*rvt*predict-no*H0*4
  31940. -->
  31941. (S1 ^operator O2240 = 0.3397852767825768)
  31942. Firing rl*prefer*rvt*predict-yes*H0*3
  31943. -->
  31944. (S1 ^operator O2239 = 0.3377048551132163)
  31945. Firing prefer*rvt*predict-yes*H0
  31946. -->
  31947. Firing prefer*rvt*predict-no*H0
  31948. -->
  31949. Firing elaborate*copy-dir-to-output-link
  31950. -->
  31951. (I3 ^dir R +)
  31952. inner elaboration loop at bottom goal.
  31953. Retracting elaborate*copy-see-to-output-link
  31954. -->
  31955. (I3 ^see 0 +)
  31956. Retracting propose*predict-no
  31957. -->
  31958. (O2240 ^name predict-no +)
  31959. (S1 ^operator O2240 +)
  31960. Retracting propose*predict-yes
  31961. -->
  31962. (O2239 ^name predict-yes +)
  31963. (S1 ^operator O2239 +)
  31964. Retracting elaborate*reward*based*on*reward
  31965. -->
  31966. (R1123 ^value 1 +)
  31967. (R1 ^reward R1123 +)
  31968. Retracting elaborate*copy-dir-to-output-link
  31969. -->
  31970. (I3 ^dir L +)
  31971. Retracting rl*prefer*rvt*predict-no*H0*6
  31972. -->
  31973. (S1 ^operator O2240 = 0.9945264206860271)
  31974. Retracting rl*prefer*rvt*predict-yes*H0*5
  31975. -->
  31976. (S1 ^operator O2239 = 0.2640074592567178)
  31977. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*31
  31978. -->
  31979. (S1 ^operator O2239 = -0.181727099742844)
  31980. =>WM: (15797: S1 ^operator O2242 +)
  31981. =>WM: (15796: S1 ^operator O2241 +)
  31982. =>WM: (15795: I3 ^dir R)
  31983. =>WM: (15794: O2242 ^name predict-no)
  31984. =>WM: (15793: O2241 ^name predict-yes)
  31985. =>WM: (15792: R1124 ^value 1)
  31986. =>WM: (15791: R1 ^reward R1124)
  31987. <=WM: (15782: S1 ^operator O2239 +)
  31988. <=WM: (15783: S1 ^operator O2240 +)
  31989. <=WM: (15784: S1 ^operator O2240)
  31990. <=WM: (15781: I3 ^dir L)
  31991. <=WM: (15777: R1 ^reward R1123)
  31992. <=WM: (15780: O2240 ^name predict-no)
  31993. <=WM: (15779: O2239 ^name predict-yes)
  31994. <=WM: (15778: R1123 ^value 1)
  31995. --- Inner Elaboration Phase, active level 1 (S1) ---
  31996. Firing prefer*rvt*predict-yes*H0
  31997. -->
  31998. Firing rl*prefer*rvt*predict-yes*H0*3
  31999. -->
  32000. (S1 ^operator O2241 = 0.3377048551132163)
  32001. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32002. -->
  32003. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  32004. -->
  32005. (S1 ^operator O2241 = 0.6623106733629137)
  32006. Firing prefer*rvt*predict-no*H0
  32007. -->
  32008. Firing rl*prefer*rvt*predict-no*H0*4
  32009. -->
  32010. (S1 ^operator O2242 = 0.3397852767825768)
  32011. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32012. -->
  32013. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  32014. -->
  32015. (S1 ^operator O2242 = -0.2817060109291377)
  32016. inner elaboration loop at bottom goal.
  32017. Retracting rl*prefer*rvt*predict-no*H0*4
  32018. -->
  32019. (S1 ^operator O2240 = 0.3397852767825768)
  32020. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  32021. -->
  32022. (S1 ^operator O2240 = -0.2817060109291377)
  32023. Retracting rl*prefer*rvt*predict-yes*H0*3
  32024. -->
  32025. (S1 ^operator O2239 = 0.3377048551132163)
  32026. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  32027. -->
  32028. (S1 ^operator O2239 = 0.6623106733629137)
  32029. --- END Proposal Phase ---
  32030. --- Decision Phase ---
  32031. RL update rl*prefer*rvt*predict-no*H0*6 0.994526 0 0.994526 -> 0.995418 0 0.995418(R,m,v=1,0.912281,0.0804954)
  32032. =>WM: (15798: S1 ^operator O2241)
  32033. 1121: O: O2241 (predict-yes)
  32034. --- END Decision Phase ---
  32035. --- Application Phase ---
  32036. --- Firing Productions (PE) For State At Depth 1 ---
  32037. --- Inner Elaboration Phase, active level 1 (S1) ---
  32038. Firing apply*operator
  32039. -->
  32040. (I3 ^predict-yes N1121 + :O )
  32041. Firing apply*operator*complete
  32042. -->
  32043. (I3 ^predict-no N1120 - :O )
  32044. inner elaboration loop at bottom goal.
  32045. --- Change Working Memory (PE) ---
  32046. =>WM: (15799: I3 ^predict-yes N1121)
  32047. <=WM: (15786: N1120 ^status complete)
  32048. <=WM: (15785: I3 ^predict-no N1120)
  32049. --- Firing Productions (IE) For State At Depth 1 ---
  32050. --- Inner Elaboration Phase, active level 1 (S1) ---
  32051. Firing monitor*world
  32052. -->
  32053. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32054. --- Change Working Memory (IE) ---
  32055. --- END Application Phase ---
  32056. --- Output Phase ---
  32057. ENV: Agent did: predict-yes for direction R in state State-A
  32058. In State-A moving R
  32059. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  32060. predict error 0
  32061. dir: dir isU
  32062. --- END Output Phase ---
  32063. /--- Input Phase ---
  32064. =>WM: (15803: I2 ^dir U)
  32065. =>WM: (15802: I2 ^reward 1)
  32066. =>WM: (15801: I2 ^see 1)
  32067. =>WM: (15800: N1121 ^status complete)
  32068. <=WM: (15789: I2 ^dir R)
  32069. <=WM: (15788: I2 ^reward 1)
  32070. <=WM: (15787: I2 ^see 0)
  32071. =>WM: (15804: I2 ^level-1 R1-root)
  32072. <=WM: (15790: I2 ^level-1 L0-root)
  32073. --- END Input Phase ---
  32074. --- Proposal Phase ---
  32075. --- Inner Elaboration Phase, active level 1 (S1) ---
  32076. Firing elaborate*copy-see-to-output-link
  32077. -->
  32078. (I3 ^see 1 +)
  32079. Firing elaborate*reward*based*on*reward
  32080. -->
  32081. (R1125 ^value 1 +)
  32082. (R1 ^reward R1125 +)
  32083. Firing propose*predict-yes
  32084. -->
  32085. (O2243 ^name predict-yes +)
  32086. (S1 ^operator O2243 +)
  32087. Firing propose*predict-no
  32088. -->
  32089. (O2244 ^name predict-no +)
  32090. (S1 ^operator O2244 +)
  32091. Firing rl*prefer*rvt*predict-no*H0*2
  32092. -->
  32093. (S1 ^operator O2242 = 1.)
  32094. Firing rl*prefer*rvt*predict-yes*H0*1
  32095. -->
  32096. (S1 ^operator O2241 = 0.)
  32097. Firing prefer*rvt*predict-yes*H0
  32098. -->
  32099. Firing prefer*rvt*predict-no*H0
  32100. -->
  32101. Firing elaborate*copy-dir-to-output-link
  32102. -->
  32103. (I3 ^dir U +)
  32104. inner elaboration loop at bottom goal.
  32105. Retracting elaborate*copy-see-to-output-link
  32106. -->
  32107. (I3 ^see 0 +)
  32108. Retracting propose*predict-no
  32109. -->
  32110. (O2242 ^name predict-no +)
  32111. (S1 ^operator O2242 +)
  32112. Retracting propose*predict-yes
  32113. -->
  32114. (O2241 ^name predict-yes +)
  32115. (S1 ^operator O2241 +)
  32116. Retracting elaborate*reward*based*on*reward
  32117. -->
  32118. (R1124 ^value 1 +)
  32119. (R1 ^reward R1124 +)
  32120. Retracting elaborate*copy-dir-to-output-link
  32121. -->
  32122. (I3 ^dir R +)
  32123. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*33
  32124. -->
  32125. (S1 ^operator O2242 = -0.2817060109291377)
  32126. Retracting rl*prefer*rvt*predict-no*H0*4
  32127. -->
  32128. (S1 ^operator O2242 = 0.3397852767825768)
  32129. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*34
  32130. -->
  32131. (S1 ^operator O2241 = 0.6623106733629137)
  32132. Retracting rl*prefer*rvt*predict-yes*H0*3
  32133. -->
  32134. (S1 ^operator O2241 = 0.3377048551132163)
  32135. =>WM: (15812: S1 ^operator O2244 +)
  32136. =>WM: (15811: S1 ^operator O2243 +)
  32137. =>WM: (15810: I3 ^dir U)
  32138. =>WM: (15809: O2244 ^name predict-no)
  32139. =>WM: (15808: O2243 ^name predict-yes)
  32140. =>WM: (15807: R1125 ^value 1)
  32141. =>WM: (15806: R1 ^reward R1125)
  32142. =>WM: (15805: I3 ^see 1)
  32143. <=WM: (15796: S1 ^operator O2241 +)
  32144. <=WM: (15798: S1 ^operator O2241)
  32145. <=WM: (15797: S1 ^operator O2242 +)
  32146. <=WM: (15795: I3 ^dir R)
  32147. <=WM: (15791: R1 ^reward R1124)
  32148. <=WM: (15776: I3 ^see 0)
  32149. <=WM: (15794: O2242 ^name predict-no)
  32150. <=WM: (15793: O2241 ^name predict-yes)
  32151. <=WM: (15792: R1124 ^value 1)
  32152. --- Inner Elaboration Phase, active level 1 (S1) ---
  32153. Firing prefer*rvt*predict-yes*H0
  32154. -->
  32155. Firing rl*prefer*rvt*predict-yes*H0*1
  32156. -->
  32157. (S1 ^operator O2243 = 0.)
  32158. Firing prefer*rvt*predict-no*H0
  32159. -->
  32160. Firing rl*prefer*rvt*predict-no*H0*2
  32161. -->
  32162. (S1 ^operator O2244 = 1.)
  32163. inner elaboration loop at bottom goal.
  32164. Retracting rl*prefer*rvt*predict-no*H0*2
  32165. -->
  32166. (S1 ^operator O2242 = 1.)
  32167. Retracting rl*prefer*rvt*predict-yes*H0*1
  32168. -->
  32169. (S1 ^operator O2241 = 0.)
  32170. --- END Proposal Phase ---
  32171. --- Decision Phase ---
  32172. RL update rl*prefer*rvt*predict-yes*H0*3 0.590104 -0.252399 0.337705 -> 0.590103 -0.252399 0.337704(R,m,v=1,0.908108,0.0839013)
  32173. RL update rl*prefer*rvt*predict-yes*H0*3*v1*H1*34 0.409914 0.252397 0.662311 -> 0.409912 0.252397 0.662309(R,m,v=1,1,0)
  32174. =>WM: (15813: S1 ^operator O2244)
  32175. 1122: O: O2244 (predict-no)
  32176. --- END Decision Phase ---
  32177. --- Application Phase ---
  32178. --- Firing Productions (PE) For State At Depth 1 ---
  32179. --- Inner Elaboration Phase, active level 1 (S1) ---
  32180. Firing apply*operator
  32181. -->
  32182. (I3 ^predict-no N1122 + :O )
  32183. Firing apply*operator*complete
  32184. -->
  32185. (I3 ^predict-yes N1121 - :O )
  32186. inner elaboration loop at bottom goal.
  32187. --- Change Working Memory (PE) ---
  32188. =>WM: (15814: I3 ^predict-no N1122)
  32189. <=WM: (15800: N1121 ^status complete)
  32190. <=WM: (15799: I3 ^predict-yes N1121)
  32191. --- Firing Productions (IE) For State At Depth 1 ---
  32192. --- Inner Elaboration Phase, active level 1 (S1) ---
  32193. Firing monitor*world
  32194. -->
  32195. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32196. --- Change Working Memory (IE) ---
  32197. --- END Application Phase ---
  32198. --- Output Phase ---
  32199. ENV: Agent did: predict-no for direction U in state State-B
  32200. In State-B moving U
  32201. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32202. predict error 0
  32203. dir: dir isR
  32204. --- END Output Phase ---
  32205. |\---- Input Phase ---
  32206. =>WM: (15818: I2 ^dir R)
  32207. =>WM: (15817: I2 ^reward 1)
  32208. =>WM: (15816: I2 ^see 0)
  32209. =>WM: (15815: N1122 ^status complete)
  32210. <=WM: (15803: I2 ^dir U)
  32211. <=WM: (15802: I2 ^reward 1)
  32212. <=WM: (15801: I2 ^see 1)
  32213. =>WM: (15819: I2 ^level-1 R1-root)
  32214. <=WM: (15804: I2 ^level-1 R1-root)
  32215. --- END Input Phase ---
  32216. --- Proposal Phase ---
  32217. --- Inner Elaboration Phase, active level 1 (S1) ---
  32218. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  32219. -->
  32220. (S1 ^operator O2243 = -0.1070236389116304)
  32221. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  32222. -->
  32223. (S1 ^operator O2244 = 0.6602247488387273)
  32224. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32225. -->
  32226. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32227. -->
  32228. Firing elaborate*copy-see-to-output-link
  32229. -->
  32230. (I3 ^see 0 +)
  32231. Firing elaborate*reward*based*on*reward
  32232. -->
  32233. (R1126 ^value 1 +)
  32234. (R1 ^reward R1126 +)
  32235. Firing propose*predict-yes
  32236. -->
  32237. (O2245 ^name predict-yes +)
  32238. (S1 ^operator O2245 +)
  32239. Firing propose*predict-no
  32240. -->
  32241. (O2246 ^name predict-no +)
  32242. (S1 ^operator O2246 +)
  32243. Firing rl*prefer*rvt*predict-no*H0*4
  32244. -->
  32245. (S1 ^operator O2244 = 0.3397852767825768)
  32246. Firing rl*prefer*rvt*predict-yes*H0*3
  32247. -->
  32248. (S1 ^operator O2243 = 0.3377036069148361)
  32249. Firing prefer*rvt*predict-yes*H0
  32250. -->
  32251. Firing prefer*rvt*predict-no*H0
  32252. -->
  32253. Firing elaborate*copy-dir-to-output-link
  32254. -->
  32255. (I3 ^dir R +)
  32256. inner elaboration loop at bottom goal.
  32257. Retracting elaborate*copy-see-to-output-link
  32258. -->
  32259. (I3 ^see 1 +)
  32260. Retracting propose*predict-no
  32261. -->
  32262. (O2244 ^name predict-no +)
  32263. (S1 ^operator O2244 +)
  32264. Retracting propose*predict-yes
  32265. -->
  32266. (O2243 ^name predict-yes +)
  32267. (S1 ^operator O2243 +)
  32268. Retracting elaborate*reward*based*on*reward
  32269. -->
  32270. (R1125 ^value 1 +)
  32271. (R1 ^reward R1125 +)
  32272. Retracting elaborate*copy-dir-to-output-link
  32273. -->
  32274. (I3 ^dir U +)
  32275. Retracting rl*prefer*rvt*predict-no*H0*2
  32276. -->
  32277. (S1 ^operator O2244 = 1.)
  32278. Retracting rl*prefer*rvt*predict-yes*H0*1
  32279. -->
  32280. (S1 ^operator O2243 = 0.)
  32281. =>WM: (15827: S1 ^operator O2246 +)
  32282. =>WM: (15826: S1 ^operator O2245 +)
  32283. =>WM: (15825: I3 ^dir R)
  32284. =>WM: (15824: O2246 ^name predict-no)
  32285. =>WM: (15823: O2245 ^name predict-yes)
  32286. =>WM: (15822: R1126 ^value 1)
  32287. =>WM: (15821: R1 ^reward R1126)
  32288. =>WM: (15820: I3 ^see 0)
  32289. <=WM: (15811: S1 ^operator O2243 +)
  32290. <=WM: (15812: S1 ^operator O2244 +)
  32291. <=WM: (15813: S1 ^operator O2244)
  32292. <=WM: (15810: I3 ^dir U)
  32293. <=WM: (15806: R1 ^reward R1125)
  32294. <=WM: (15805: I3 ^see 1)
  32295. <=WM: (15809: O2244 ^name predict-no)
  32296. <=WM: (15808: O2243 ^name predict-yes)
  32297. <=WM: (15807: R1125 ^value 1)
  32298. --- Inner Elaboration Phase, active level 1 (S1) ---
  32299. Firing prefer*rvt*predict-yes*H0
  32300. -->
  32301. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  32302. -->
  32303. (S1 ^operator O2245 = -0.1070236389116304)
  32304. Firing rl*prefer*rvt*predict-yes*H0*3
  32305. -->
  32306. (S1 ^operator O2245 = 0.3377036069148361)
  32307. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32308. -->
  32309. Firing prefer*rvt*predict-no*H0
  32310. -->
  32311. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  32312. -->
  32313. (S1 ^operator O2246 = 0.6602247488387273)
  32314. Firing rl*prefer*rvt*predict-no*H0*4
  32315. -->
  32316. (S1 ^operator O2246 = 0.3397852767825768)
  32317. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32318. -->
  32319. inner elaboration loop at bottom goal.
  32320. Retracting rl*prefer*rvt*predict-no*H0*4
  32321. -->
  32322. (S1 ^operator O2244 = 0.3397852767825768)
  32323. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  32324. -->
  32325. (S1 ^operator O2244 = 0.6602247488387273)
  32326. Retracting rl*prefer*rvt*predict-yes*H0*3
  32327. -->
  32328. (S1 ^operator O2243 = 0.3377036069148361)
  32329. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  32330. -->
  32331. (S1 ^operator O2243 = -0.1070236389116304)
  32332. --- END Proposal Phase ---
  32333. --- Decision Phase ---
  32334. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32335. =>WM: (15828: S1 ^operator O2246)
  32336. 1123: O: O2246 (predict-no)
  32337. --- END Decision Phase ---
  32338. --- Application Phase ---
  32339. --- Firing Productions (PE) For State At Depth 1 ---
  32340. --- Inner Elaboration Phase, active level 1 (S1) ---
  32341. Firing apply*operator
  32342. -->
  32343. (I3 ^predict-no N1123 + :O )
  32344. Firing apply*operator*complete
  32345. -->
  32346. (I3 ^predict-no N1122 - :O )
  32347. inner elaboration loop at bottom goal.
  32348. --- Change Working Memory (PE) ---
  32349. =>WM: (15829: I3 ^predict-no N1123)
  32350. <=WM: (15815: N1122 ^status complete)
  32351. <=WM: (15814: I3 ^predict-no N1122)
  32352. --- Firing Productions (IE) For State At Depth 1 ---
  32353. --- Inner Elaboration Phase, active level 1 (S1) ---
  32354. Firing monitor*world
  32355. -->
  32356. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32357. --- Change Working Memory (IE) ---
  32358. --- END Application Phase ---
  32359. --- Output Phase ---
  32360. ENV: Agent did: predict-no for direction R in state State-B
  32361. In State-B moving R
  32362. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32363. predict error 0
  32364. dir: dir isU
  32365. --- END Output Phase ---
  32366. /|--- Input Phase ---
  32367. =>WM: (15833: I2 ^dir U)
  32368. =>WM: (15832: I2 ^reward 1)
  32369. =>WM: (15831: I2 ^see 0)
  32370. =>WM: (15830: N1123 ^status complete)
  32371. <=WM: (15818: I2 ^dir R)
  32372. <=WM: (15817: I2 ^reward 1)
  32373. <=WM: (15816: I2 ^see 0)
  32374. =>WM: (15834: I2 ^level-1 R0-root)
  32375. <=WM: (15819: I2 ^level-1 R1-root)
  32376. --- END Input Phase ---
  32377. --- Proposal Phase ---
  32378. --- Inner Elaboration Phase, active level 1 (S1) ---
  32379. Firing elaborate*copy-see-to-output-link
  32380. -->
  32381. (I3 ^see 0 +)
  32382. Firing elaborate*reward*based*on*reward
  32383. -->
  32384. (R1127 ^value 1 +)
  32385. (R1 ^reward R1127 +)
  32386. Firing propose*predict-yes
  32387. -->
  32388. (O2247 ^name predict-yes +)
  32389. (S1 ^operator O2247 +)
  32390. Firing propose*predict-no
  32391. -->
  32392. (O2248 ^name predict-no +)
  32393. (S1 ^operator O2248 +)
  32394. Firing rl*prefer*rvt*predict-no*H0*2
  32395. -->
  32396. (S1 ^operator O2246 = 1.)
  32397. Firing rl*prefer*rvt*predict-yes*H0*1
  32398. -->
  32399. (S1 ^operator O2245 = 0.)
  32400. Firing prefer*rvt*predict-yes*H0
  32401. -->
  32402. Firing prefer*rvt*predict-no*H0
  32403. -->
  32404. Firing elaborate*copy-dir-to-output-link
  32405. -->
  32406. (I3 ^dir U +)
  32407. inner elaboration loop at bottom goal.
  32408. Retracting elaborate*copy-see-to-output-link
  32409. -->
  32410. (I3 ^see 0 +)
  32411. Retracting propose*predict-no
  32412. -->
  32413. (O2246 ^name predict-no +)
  32414. (S1 ^operator O2246 +)
  32415. Retracting propose*predict-yes
  32416. -->
  32417. (O2245 ^name predict-yes +)
  32418. (S1 ^operator O2245 +)
  32419. Retracting elaborate*reward*based*on*reward
  32420. -->
  32421. (R1126 ^value 1 +)
  32422. (R1 ^reward R1126 +)
  32423. Retracting elaborate*copy-dir-to-output-link
  32424. -->
  32425. (I3 ^dir R +)
  32426. Retracting rl*prefer*rvt*predict-no*H0*4
  32427. -->
  32428. (S1 ^operator O2246 = 0.3397852767825768)
  32429. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*35
  32430. -->
  32431. (S1 ^operator O2246 = 0.6602247488387273)
  32432. Retracting rl*prefer*rvt*predict-yes*H0*3
  32433. -->
  32434. (S1 ^operator O2245 = 0.3377036069148361)
  32435. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*36
  32436. -->
  32437. (S1 ^operator O2245 = -0.1070236389116304)
  32438. =>WM: (15841: S1 ^operator O2248 +)
  32439. =>WM: (15840: S1 ^operator O2247 +)
  32440. =>WM: (15839: I3 ^dir U)
  32441. =>WM: (15838: O2248 ^name predict-no)
  32442. =>WM: (15837: O2247 ^name predict-yes)
  32443. =>WM: (15836: R1127 ^value 1)
  32444. =>WM: (15835: R1 ^reward R1127)
  32445. <=WM: (15826: S1 ^operator O2245 +)
  32446. <=WM: (15827: S1 ^operator O2246 +)
  32447. <=WM: (15828: S1 ^operator O2246)
  32448. <=WM: (15825: I3 ^dir R)
  32449. <=WM: (15821: R1 ^reward R1126)
  32450. <=WM: (15824: O2246 ^name predict-no)
  32451. <=WM: (15823: O2245 ^name predict-yes)
  32452. <=WM: (15822: R1126 ^value 1)
  32453. --- Inner Elaboration Phase, active level 1 (S1) ---
  32454. Firing prefer*rvt*predict-yes*H0
  32455. -->
  32456. Firing rl*prefer*rvt*predict-yes*H0*1
  32457. -->
  32458. (S1 ^operator O2247 = 0.)
  32459. Firing prefer*rvt*predict-no*H0
  32460. -->
  32461. Firing rl*prefer*rvt*predict-no*H0*2
  32462. -->
  32463. (S1 ^operator O2248 = 1.)
  32464. inner elaboration loop at bottom goal.
  32465. Retracting rl*prefer*rvt*predict-no*H0*2
  32466. -->
  32467. (S1 ^operator O2246 = 1.)
  32468. Retracting rl*prefer*rvt*predict-yes*H0*1
  32469. -->
  32470. (S1 ^operator O2245 = 0.)
  32471. --- END Proposal Phase ---
  32472. --- Decision Phase ---
  32473. RL update rl*prefer*rvt*predict-no*H0*4 0.570269 -0.230484 0.339785 -> 0.570269 -0.230484 0.339784(R,m,v=1,0.888889,0.0992908)
  32474. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*35 0.429741 0.230484 0.660225 -> 0.42974 0.230484 0.660224(R,m,v=1,1,0)
  32475. =>WM: (15842: S1 ^operator O2248)
  32476. 1124: O: O2248 (predict-no)
  32477. --- END Decision Phase ---
  32478. --- Application Phase ---
  32479. --- Firing Productions (PE) For State At Depth 1 ---
  32480. --- Inner Elaboration Phase, active level 1 (S1) ---
  32481. Firing apply*operator
  32482. -->
  32483. (I3 ^predict-no N1124 + :O )
  32484. Firing apply*operator*complete
  32485. -->
  32486. (I3 ^predict-no N1123 - :O )
  32487. inner elaboration loop at bottom goal.
  32488. --- Change Working Memory (PE) ---
  32489. =>WM: (15843: I3 ^predict-no N1124)
  32490. <=WM: (15830: N1123 ^status complete)
  32491. <=WM: (15829: I3 ^predict-no N1123)
  32492. --- Firing Productions (IE) For State At Depth 1 ---
  32493. --- Inner Elaboration Phase, active level 1 (S1) ---
  32494. Firing monitor*world
  32495. -->
  32496. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32497. --- Change Working Memory (IE) ---
  32498. --- END Application Phase ---
  32499. --- Output Phase ---
  32500. ENV: Agent did: predict-no for direction U in state State-B
  32501. In State-B moving U
  32502. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32503. predict error 0
  32504. dir: dir isR
  32505. --- END Output Phase ---
  32506. \-/--- Input Phase ---
  32507. =>WM: (15847: I2 ^dir R)
  32508. =>WM: (15846: I2 ^reward 1)
  32509. =>WM: (15845: I2 ^see 0)
  32510. =>WM: (15844: N1124 ^status complete)
  32511. <=WM: (15833: I2 ^dir U)
  32512. <=WM: (15832: I2 ^reward 1)
  32513. <=WM: (15831: I2 ^see 0)
  32514. =>WM: (15848: I2 ^level-1 R0-root)
  32515. <=WM: (15834: I2 ^level-1 R0-root)
  32516. --- END Input Phase ---
  32517. --- Proposal Phase ---
  32518. --- Inner Elaboration Phase, active level 1 (S1) ---
  32519. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32520. -->
  32521. (S1 ^operator O2248 = 0.6601926791747813)
  32522. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32523. -->
  32524. (S1 ^operator O2247 = -0.1028953566115423)
  32525. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32526. -->
  32527. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32528. -->
  32529. Firing elaborate*copy-see-to-output-link
  32530. -->
  32531. (I3 ^see 0 +)
  32532. Firing elaborate*reward*based*on*reward
  32533. -->
  32534. (R1128 ^value 1 +)
  32535. (R1 ^reward R1128 +)
  32536. Firing propose*predict-yes
  32537. -->
  32538. (O2249 ^name predict-yes +)
  32539. (S1 ^operator O2249 +)
  32540. Firing propose*predict-no
  32541. -->
  32542. (O2250 ^name predict-no +)
  32543. (S1 ^operator O2250 +)
  32544. Firing rl*prefer*rvt*predict-no*H0*4
  32545. -->
  32546. (S1 ^operator O2248 = 0.3397844736723834)
  32547. Firing rl*prefer*rvt*predict-yes*H0*3
  32548. -->
  32549. (S1 ^operator O2247 = 0.3377036069148361)
  32550. Firing prefer*rvt*predict-yes*H0
  32551. -->
  32552. Firing prefer*rvt*predict-no*H0
  32553. -->
  32554. Firing elaborate*copy-dir-to-output-link
  32555. -->
  32556. (I3 ^dir R +)
  32557. inner elaboration loop at bottom goal.
  32558. Retracting elaborate*copy-see-to-output-link
  32559. -->
  32560. (I3 ^see 0 +)
  32561. Retracting propose*predict-no
  32562. -->
  32563. (O2248 ^name predict-no +)
  32564. (S1 ^operator O2248 +)
  32565. Retracting propose*predict-yes
  32566. -->
  32567. (O2247 ^name predict-yes +)
  32568. (S1 ^operator O2247 +)
  32569. Retracting elaborate*reward*based*on*reward
  32570. -->
  32571. (R1127 ^value 1 +)
  32572. (R1 ^reward R1127 +)
  32573. Retracting elaborate*copy-dir-to-output-link
  32574. -->
  32575. (I3 ^dir U +)
  32576. Retracting rl*prefer*rvt*predict-no*H0*2
  32577. -->
  32578. (S1 ^operator O2248 = 1.)
  32579. Retracting rl*prefer*rvt*predict-yes*H0*1
  32580. -->
  32581. (S1 ^operator O2247 = 0.)
  32582. =>WM: (15855: S1 ^operator O2250 +)
  32583. =>WM: (15854: S1 ^operator O2249 +)
  32584. =>WM: (15853: I3 ^dir R)
  32585. =>WM: (15852: O2250 ^name predict-no)
  32586. =>WM: (15851: O2249 ^name predict-yes)
  32587. =>WM: (15850: R1128 ^value 1)
  32588. =>WM: (15849: R1 ^reward R1128)
  32589. <=WM: (15840: S1 ^operator O2247 +)
  32590. <=WM: (15841: S1 ^operator O2248 +)
  32591. <=WM: (15842: S1 ^operator O2248)
  32592. <=WM: (15839: I3 ^dir U)
  32593. <=WM: (15835: R1 ^reward R1127)
  32594. <=WM: (15838: O2248 ^name predict-no)
  32595. <=WM: (15837: O2247 ^name predict-yes)
  32596. <=WM: (15836: R1127 ^value 1)
  32597. --- Inner Elaboration Phase, active level 1 (S1) ---
  32598. Firing prefer*rvt*predict-yes*H0
  32599. -->
  32600. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32601. -->
  32602. (S1 ^operator O2249 = -0.1028953566115423)
  32603. Firing rl*prefer*rvt*predict-yes*H0*3
  32604. -->
  32605. (S1 ^operator O2249 = 0.3377036069148361)
  32606. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32607. -->
  32608. Firing prefer*rvt*predict-no*H0
  32609. -->
  32610. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32611. -->
  32612. (S1 ^operator O2250 = 0.6601926791747813)
  32613. Firing rl*prefer*rvt*predict-no*H0*4
  32614. -->
  32615. (S1 ^operator O2250 = 0.3397844736723834)
  32616. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32617. -->
  32618. inner elaboration loop at bottom goal.
  32619. Retracting rl*prefer*rvt*predict-no*H0*4
  32620. -->
  32621. (S1 ^operator O2248 = 0.3397844736723834)
  32622. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32623. -->
  32624. (S1 ^operator O2248 = 0.6601926791747813)
  32625. Retracting rl*prefer*rvt*predict-yes*H0*3
  32626. -->
  32627. (S1 ^operator O2247 = 0.3377036069148361)
  32628. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32629. -->
  32630. (S1 ^operator O2247 = -0.1028953566115423)
  32631. --- END Proposal Phase ---
  32632. --- Decision Phase ---
  32633. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32634. =>WM: (15856: S1 ^operator O2250)
  32635. 1125: O: O2250 (predict-no)
  32636. --- END Decision Phase ---
  32637. --- Application Phase ---
  32638. --- Firing Productions (PE) For State At Depth 1 ---
  32639. --- Inner Elaboration Phase, active level 1 (S1) ---
  32640. Firing apply*operator
  32641. -->
  32642. (I3 ^predict-no N1125 + :O )
  32643. Firing apply*operator*complete
  32644. -->
  32645. (I3 ^predict-no N1124 - :O )
  32646. inner elaboration loop at bottom goal.
  32647. --- Change Working Memory (PE) ---
  32648. =>WM: (15857: I3 ^predict-no N1125)
  32649. <=WM: (15844: N1124 ^status complete)
  32650. <=WM: (15843: I3 ^predict-no N1124)
  32651. --- Firing Productions (IE) For State At Depth 1 ---
  32652. --- Inner Elaboration Phase, active level 1 (S1) ---
  32653. Firing monitor*world
  32654. -->
  32655. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32656. --- Change Working Memory (IE) ---
  32657. --- END Application Phase ---
  32658. --- Output Phase ---
  32659. ENV: Agent did: predict-no for direction R in state State-B
  32660. In State-B moving R
  32661. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32662. predict error 0
  32663. dir: dir isR
  32664. --- END Output Phase ---
  32665. |\---- Input Phase ---
  32666. =>WM: (15861: I2 ^dir R)
  32667. =>WM: (15860: I2 ^reward 1)
  32668. =>WM: (15859: I2 ^see 0)
  32669. =>WM: (15858: N1125 ^status complete)
  32670. <=WM: (15847: I2 ^dir R)
  32671. <=WM: (15846: I2 ^reward 1)
  32672. <=WM: (15845: I2 ^see 0)
  32673. =>WM: (15862: I2 ^level-1 R0-root)
  32674. <=WM: (15848: I2 ^level-1 R0-root)
  32675. --- END Input Phase ---
  32676. --- Proposal Phase ---
  32677. --- Inner Elaboration Phase, active level 1 (S1) ---
  32678. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32679. -->
  32680. (S1 ^operator O2250 = 0.6601926791747813)
  32681. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32682. -->
  32683. (S1 ^operator O2249 = -0.1028953566115423)
  32684. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32685. -->
  32686. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32687. -->
  32688. Firing elaborate*copy-see-to-output-link
  32689. -->
  32690. (I3 ^see 0 +)
  32691. Firing elaborate*reward*based*on*reward
  32692. -->
  32693. (R1129 ^value 1 +)
  32694. (R1 ^reward R1129 +)
  32695. Firing propose*predict-yes
  32696. -->
  32697. (O2251 ^name predict-yes +)
  32698. (S1 ^operator O2251 +)
  32699. Firing propose*predict-no
  32700. -->
  32701. (O2252 ^name predict-no +)
  32702. (S1 ^operator O2252 +)
  32703. Firing rl*prefer*rvt*predict-no*H0*4
  32704. -->
  32705. (S1 ^operator O2250 = 0.3397844736723834)
  32706. Firing rl*prefer*rvt*predict-yes*H0*3
  32707. -->
  32708. (S1 ^operator O2249 = 0.3377036069148361)
  32709. Firing prefer*rvt*predict-yes*H0
  32710. -->
  32711. Firing prefer*rvt*predict-no*H0
  32712. -->
  32713. Firing elaborate*copy-dir-to-output-link
  32714. -->
  32715. (I3 ^dir R +)
  32716. inner elaboration loop at bottom goal.
  32717. Retracting elaborate*copy-see-to-output-link
  32718. -->
  32719. (I3 ^see 0 +)
  32720. Retracting propose*predict-no
  32721. -->
  32722. (O2250 ^name predict-no +)
  32723. (S1 ^operator O2250 +)
  32724. Retracting propose*predict-yes
  32725. -->
  32726. (O2249 ^name predict-yes +)
  32727. (S1 ^operator O2249 +)
  32728. Retracting elaborate*reward*based*on*reward
  32729. -->
  32730. (R1128 ^value 1 +)
  32731. (R1 ^reward R1128 +)
  32732. Retracting elaborate*copy-dir-to-output-link
  32733. -->
  32734. (I3 ^dir R +)
  32735. Retracting rl*prefer*rvt*predict-no*H0*4
  32736. -->
  32737. (S1 ^operator O2250 = 0.3397844736723834)
  32738. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32739. -->
  32740. (S1 ^operator O2250 = 0.6601926791747813)
  32741. Retracting rl*prefer*rvt*predict-yes*H0*3
  32742. -->
  32743. (S1 ^operator O2249 = 0.3377036069148361)
  32744. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32745. -->
  32746. (S1 ^operator O2249 = -0.1028953566115423)
  32747. =>WM: (15868: S1 ^operator O2252 +)
  32748. =>WM: (15867: S1 ^operator O2251 +)
  32749. =>WM: (15866: O2252 ^name predict-no)
  32750. =>WM: (15865: O2251 ^name predict-yes)
  32751. =>WM: (15864: R1129 ^value 1)
  32752. =>WM: (15863: R1 ^reward R1129)
  32753. <=WM: (15854: S1 ^operator O2249 +)
  32754. <=WM: (15855: S1 ^operator O2250 +)
  32755. <=WM: (15856: S1 ^operator O2250)
  32756. <=WM: (15849: R1 ^reward R1128)
  32757. <=WM: (15852: O2250 ^name predict-no)
  32758. <=WM: (15851: O2249 ^name predict-yes)
  32759. <=WM: (15850: R1128 ^value 1)
  32760. --- Inner Elaboration Phase, active level 1 (S1) ---
  32761. Firing prefer*rvt*predict-yes*H0
  32762. -->
  32763. Firing rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32764. -->
  32765. (S1 ^operator O2251 = -0.1028953566115423)
  32766. Firing rl*prefer*rvt*predict-yes*H0*3
  32767. -->
  32768. (S1 ^operator O2251 = 0.3377036069148361)
  32769. Firing prefer*rvt*predict-yes*H0*3*v1*H1
  32770. -->
  32771. Firing prefer*rvt*predict-no*H0
  32772. -->
  32773. Firing rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32774. -->
  32775. (S1 ^operator O2252 = 0.6601926791747813)
  32776. Firing rl*prefer*rvt*predict-no*H0*4
  32777. -->
  32778. (S1 ^operator O2252 = 0.3397844736723834)
  32779. Firing prefer*rvt*predict-no*H0*4*v1*H1
  32780. -->
  32781. inner elaboration loop at bottom goal.
  32782. Retracting rl*prefer*rvt*predict-no*H0*4
  32783. -->
  32784. (S1 ^operator O2250 = 0.3397844736723834)
  32785. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32786. -->
  32787. (S1 ^operator O2250 = 0.6601926791747813)
  32788. Retracting rl*prefer*rvt*predict-yes*H0*3
  32789. -->
  32790. (S1 ^operator O2249 = 0.3377036069148361)
  32791. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32792. -->
  32793. (S1 ^operator O2249 = -0.1028953566115423)
  32794. --- END Proposal Phase ---
  32795. --- Decision Phase ---
  32796. RL update rl*prefer*rvt*predict-no*H0*4 0.570269 -0.230484 0.339784 -> 0.570271 -0.230484 0.339786(R,m,v=1,0.889474,0.0988304)
  32797. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.429707 0.230485 0.660193 -> 0.42971 0.230485 0.660195(R,m,v=1,1,0)
  32798. =>WM: (15869: S1 ^operator O2252)
  32799. 1126: O: O2252 (predict-no)
  32800. --- END Decision Phase ---
  32801. --- Application Phase ---
  32802. --- Firing Productions (PE) For State At Depth 1 ---
  32803. --- Inner Elaboration Phase, active level 1 (S1) ---
  32804. Firing apply*operator
  32805. -->
  32806. (I3 ^predict-no N1126 + :O )
  32807. Firing apply*operator*complete
  32808. -->
  32809. (I3 ^predict-no N1125 - :O )
  32810. inner elaboration loop at bottom goal.
  32811. --- Change Working Memory (PE) ---
  32812. =>WM: (15870: I3 ^predict-no N1126)
  32813. <=WM: (15858: N1125 ^status complete)
  32814. <=WM: (15857: I3 ^predict-no N1125)
  32815. --- Firing Productions (IE) For State At Depth 1 ---
  32816. --- Inner Elaboration Phase, active level 1 (S1) ---
  32817. Firing monitor*world
  32818. -->
  32819. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32820. --- Change Working Memory (IE) ---
  32821. --- END Application Phase ---
  32822. --- Output Phase ---
  32823. ENV: Agent did: predict-no for direction R in state State-B
  32824. In State-B moving R
  32825. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32826. predict error 0
  32827. dir: dir isL
  32828. --- END Output Phase ---
  32829. /|\--- Input Phase ---
  32830. =>WM: (15874: I2 ^dir L)
  32831. =>WM: (15873: I2 ^reward 1)
  32832. =>WM: (15872: I2 ^see 0)
  32833. =>WM: (15871: N1126 ^status complete)
  32834. <=WM: (15861: I2 ^dir R)
  32835. <=WM: (15860: I2 ^reward 1)
  32836. <=WM: (15859: I2 ^see 0)
  32837. =>WM: (15875: I2 ^level-1 R0-root)
  32838. <=WM: (15862: I2 ^level-1 R0-root)
  32839. --- END Input Phase ---
  32840. --- Proposal Phase ---
  32841. --- Inner Elaboration Phase, active level 1 (S1) ---
  32842. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  32843. -->
  32844. (S1 ^operator O2251 = 0.7359439630202296)
  32845. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  32846. -->
  32847. Firing elaborate*copy-see-to-output-link
  32848. -->
  32849. (I3 ^see 0 +)
  32850. Firing elaborate*reward*based*on*reward
  32851. -->
  32852. (R1130 ^value 1 +)
  32853. (R1 ^reward R1130 +)
  32854. Firing propose*predict-yes
  32855. -->
  32856. (O2253 ^name predict-yes +)
  32857. (S1 ^operator O2253 +)
  32858. Firing propose*predict-no
  32859. -->
  32860. (O2254 ^name predict-no +)
  32861. (S1 ^operator O2254 +)
  32862. Firing rl*prefer*rvt*predict-no*H0*6
  32863. -->
  32864. (S1 ^operator O2252 = 0.9954176416497964)
  32865. Firing rl*prefer*rvt*predict-yes*H0*5
  32866. -->
  32867. (S1 ^operator O2251 = 0.2640074592567178)
  32868. Firing prefer*rvt*predict-yes*H0
  32869. -->
  32870. Firing prefer*rvt*predict-no*H0
  32871. -->
  32872. Firing elaborate*copy-dir-to-output-link
  32873. -->
  32874. (I3 ^dir L +)
  32875. inner elaboration loop at bottom goal.
  32876. Retracting elaborate*copy-see-to-output-link
  32877. -->
  32878. (I3 ^see 0 +)
  32879. Retracting propose*predict-no
  32880. -->
  32881. (O2252 ^name predict-no +)
  32882. (S1 ^operator O2252 +)
  32883. Retracting propose*predict-yes
  32884. -->
  32885. (O2251 ^name predict-yes +)
  32886. (S1 ^operator O2251 +)
  32887. Retracting elaborate*reward*based*on*reward
  32888. -->
  32889. (R1129 ^value 1 +)
  32890. (R1 ^reward R1129 +)
  32891. Retracting elaborate*copy-dir-to-output-link
  32892. -->
  32893. (I3 ^dir R +)
  32894. Retracting rl*prefer*rvt*predict-no*H0*4
  32895. -->
  32896. (S1 ^operator O2252 = 0.3397863023153158)
  32897. Retracting rl*prefer*rvt*predict-no*H0*4*v1*H1*39
  32898. -->
  32899. (S1 ^operator O2252 = 0.6601948017165659)
  32900. Retracting rl*prefer*rvt*predict-yes*H0*3
  32901. -->
  32902. (S1 ^operator O2251 = 0.3377036069148361)
  32903. Retracting rl*prefer*rvt*predict-yes*H0*3*v1*H1*40
  32904. -->
  32905. (S1 ^operator O2251 = -0.1028953566115423)
  32906. =>WM: (15882: S1 ^operator O2254 +)
  32907. =>WM: (15881: S1 ^operator O2253 +)
  32908. =>WM: (15880: I3 ^dir L)
  32909. =>WM: (15879: O2254 ^name predict-no)
  32910. =>WM: (15878: O2253 ^name predict-yes)
  32911. =>WM: (15877: R1130 ^value 1)
  32912. =>WM: (15876: R1 ^reward R1130)
  32913. <=WM: (15867: S1 ^operator O2251 +)
  32914. <=WM: (15868: S1 ^operator O2252 +)
  32915. <=WM: (15869: S1 ^operator O2252)
  32916. <=WM: (15853: I3 ^dir R)
  32917. <=WM: (15863: R1 ^reward R1129)
  32918. <=WM: (15866: O2252 ^name predict-no)
  32919. <=WM: (15865: O2251 ^name predict-yes)
  32920. <=WM: (15864: R1129 ^value 1)
  32921. --- Inner Elaboration Phase, active level 1 (S1) ---
  32922. Firing prefer*rvt*predict-yes*H0
  32923. -->
  32924. Firing rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  32925. -->
  32926. (S1 ^operator O2253 = 0.7359439630202296)
  32927. Firing rl*prefer*rvt*predict-yes*H0*5
  32928. -->
  32929. (S1 ^operator O2253 = 0.2640074592567178)
  32930. Firing prefer*rvt*predict-yes*H0*5*v1*H1
  32931. -->
  32932. Firing prefer*rvt*predict-no*H0
  32933. -->
  32934. Firing rl*prefer*rvt*predict-no*H0*6
  32935. -->
  32936. (S1 ^operator O2254 = 0.9954176416497964)
  32937. inner elaboration loop at bottom goal.
  32938. Retracting rl*prefer*rvt*predict-no*H0*6
  32939. -->
  32940. (S1 ^operator O2252 = 0.9954176416497964)
  32941. Retracting rl*prefer*rvt*predict-yes*H0*5
  32942. -->
  32943. (S1 ^operator O2251 = 0.2640074592567178)
  32944. Retracting rl*prefer*rvt*predict-yes*H0*5*v1*H1*30
  32945. -->
  32946. (S1 ^operator O2251 = 0.7359439630202296)
  32947. --- END Proposal Phase ---
  32948. --- Decision Phase ---
  32949. RL update rl*prefer*rvt*predict-no*H0*4 0.570271 -0.230484 0.339786 -> 0.570272 -0.230484 0.339788(R,m,v=1,0.890052,0.0983742)
  32950. RL update rl*prefer*rvt*predict-no*H0*4*v1*H1*39 0.42971 0.230485 0.660195 -> 0.429711 0.230485 0.660197(R,m,v=1,1,0)
  32951. =>WM: (15883: S1 ^operator O2253)
  32952. 1127: O: O2253 (predict-yes)
  32953. --- END Decision Phase ---
  32954. --- Application Phase ---
  32955. --- Firing Productions (PE) For State At Depth 1 ---
  32956. --- Inner Elaboration Phase, active level 1 (S1) ---
  32957. Firing apply*operator
  32958. -->
  32959. (I3 ^predict-yes N1127 + :O )
  32960. Firing apply*operator*complete
  32961. -->
  32962. (I3 ^predict-no N1126 - :O )
  32963. inner elaboration loop at bottom goal.
  32964. --- Change Working Memory (PE) ---
  32965. =>WM: (15884: I3 ^predict-yes N1127)
  32966. <=WM