/flipv2/20121112-101138-2.5K-ReLST-Evan/stdout-flip-2.5K_2.txt

https://bitbucket.org/evan13579b/soar-ziggurat · Plain Text · 35033 lines · 32933 code · 2100 blank · 0 comment · 0 complexity · 89a52a67c906c4fc3a1a3d7f661a0621 MD5 · raw file

  1. Seeding... 2
  2. dir: dir isU
  3. Python-Soar Flip environment.
  4. To accept commands from an external sml process, you'll need to
  5. type 'slave <log file> <n decisons>' at the prompt...
  6. sourcing 'flip_predict.soar'
  7. ***********
  8. Total: 11 productions sourced.
  9. seeding Soar with 2 ...
  10. soar> Entering slave mode:
  11. - log file 'rl-slave-2.5K_2.log'....
  12. - will exit slave mode after 2500 decisions
  13. waiting for commands from an externally connected sml process...
  14. -/|sleeping...
  15. \sleeping...
  16. -sleeping...
  17. /sleeping...
  18. |sleeping...
  19. \-/|\-/|\-sleeping...
  20. /|\-/|\sleeping...
  21. -1: O: O1 (predict-yes)
  22. I see 0 and I'm going to do: predict-yes
  23. ENV: Agent did: predict-yes for direction U in state State-A
  24. In State-A moving U
  25. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  26. predict error 1
  27. dir: dir isU
  28. rule alias: '*'
  29. rule alias: '*'
  30. /|\-/|\-2: O: O4 (predict-no)
  31. I see 0 and I'm going to do: predict-no
  32. ENV: Agent did: predict-no for direction U in state State-A
  33. In State-A moving U
  34. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  35. predict error 0
  36. dir: dir isL
  37. /|\-3: O: O5 (predict-yes)
  38. I see 1 and I'm going to do: predict-yes
  39. ENV: Agent did: predict-yes for direction L in state State-A
  40. In State-A moving L
  41. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  42. predict error 1
  43. dir: dir isL
  44. /|\4: O: O7 (predict-yes)
  45. I see 0 and I'm going to do: predict-yes
  46. ENV: Agent did: predict-yes for direction L in state State-A
  47. In State-A moving L
  48. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  49. predict error 1
  50. dir: dir isU
  51. -5: O: O10 (predict-no)
  52. I see 0 and I'm going to do: predict-no
  53. ENV: Agent did: predict-no for direction U in state State-A
  54. In State-A moving U
  55. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  56. predict error 0
  57. dir: dir isU
  58. /|\6: O: O12 (predict-no)
  59. I see 1 and I'm going to do: predict-no
  60. ENV: Agent did: predict-no for direction U in state State-A
  61. In State-A moving U
  62. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  63. predict error 0
  64. dir: dir isU
  65. -/|7: O: O14 (predict-no)
  66. I see 1 and I'm going to do: predict-no
  67. ENV: Agent did: predict-no for direction U in state State-A
  68. In State-A moving U
  69. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  70. predict error 0
  71. dir: dir isL
  72. \8: O: O15 (predict-yes)
  73. I see 1 and I'm going to do: predict-yes
  74. ENV: Agent did: predict-yes for direction L in state State-A
  75. In State-A moving L
  76. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  77. predict error 1
  78. dir: dir isR
  79. -/|9: O: O17 (predict-yes)
  80. I see 0 and I'm going to do: predict-yes
  81. ENV: Agent did: predict-yes for direction R in state State-A
  82. In State-A moving R
  83. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  84. predict error 0
  85. dir: dir isR
  86. \-/10: O: O19 (predict-yes)
  87. I see 1 and I'm going to do: predict-yes
  88. ENV: Agent did: predict-yes for direction R in state State-B
  89. In State-B moving R
  90. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  91. predict error 1
  92. dir: dir isR
  93. |\-11: O: O21 (predict-yes)
  94. I see 0 and I'm going to do: predict-yes
  95. ENV: Agent did: predict-yes for direction R in state State-B
  96. In State-B moving R
  97. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  98. predict error 1
  99. dir: dir isL
  100. rule alias: '*'
  101. rule alias: '*'
  102. rule alias: '*'
  103. rule alias: '*'
  104. /12: O: O24 (predict-no)
  105. I see 0 and I'm going to do: predict-no
  106. ENV: Agent did: predict-no for direction L in state State-B
  107. In State-B moving L
  108. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  109. predict error 1
  110. dir: dir isR
  111. |\-13: O: O25 (predict-yes)
  112. I see 0 and I'm going to do: predict-yes
  113. ENV: Agent did: predict-yes for direction R in state State-A
  114. In State-A moving R
  115. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  116. predict error 0
  117. dir: dir isR
  118. /|\14: O: O27 (predict-yes)
  119. I see 1 and I'm going to do: predict-yes
  120. ENV: Agent did: predict-yes for direction R in state State-B
  121. In State-B moving R
  122. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  123. predict error 1
  124. dir: dir isU
  125. -/15: O: O30 (predict-no)
  126. I see 0 and I'm going to do: predict-no
  127. ENV: Agent did: predict-no for direction U in state State-B
  128. In State-B moving U
  129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  130. predict error 0
  131. dir: dir isU
  132. |\16: O: O32 (predict-no)
  133. I see 1 and I'm going to do: predict-no
  134. ENV: Agent did: predict-no for direction U in state State-B
  135. In State-B moving U
  136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  137. predict error 0
  138. dir: dir isU
  139. -/|17: O: O34 (predict-no)
  140. I see 1 and I'm going to do: predict-no
  141. ENV: Agent did: predict-no for direction U in state State-B
  142. In State-B moving U
  143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  144. predict error 0
  145. dir: dir isR
  146. \18: O: O35 (predict-yes)
  147. I see 1 and I'm going to do: predict-yes
  148. ENV: Agent did: predict-yes for direction R in state State-B
  149. In State-B moving R
  150. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  151. predict error 1
  152. dir: dir isR
  153. -/|19: O: O37 (predict-yes)
  154. I see 0 and I'm going to do: predict-yes
  155. ENV: Agent did: predict-yes for direction R in state State-B
  156. In State-B moving R
  157. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  158. predict error 1
  159. dir: dir isL
  160. \-/20: O: O39 (predict-yes)
  161. I see 0 and I'm going to do: predict-yes
  162. ENV: Agent did: predict-yes for direction L in state State-B
  163. In State-B moving L
  164. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  165. predict error 0
  166. dir: dir isL
  167. |\-21: O: O42 (predict-no)
  168. I see 1 and I'm going to do: predict-no
  169. ENV: Agent did: predict-no for direction L in state State-A
  170. In State-A moving L
  171. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  172. predict error 0
  173. dir: dir isL
  174. /22: O: O43 (predict-yes)
  175. I see 1 and I'm going to do: predict-yes
  176. ENV: Agent did: predict-yes for direction L in state State-A
  177. In State-A moving L
  178. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  179. predict error 1
  180. dir: dir isR
  181. |\-23: O: O45 (predict-yes)
  182. I see 0 and I'm going to do: predict-yes
  183. ENV: Agent did: predict-yes for direction R in state State-A
  184. In State-A moving R
  185. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  186. predict error 0
  187. dir: dir isL
  188. /|\24: O: O48 (predict-no)
  189. I see 1 and I'm going to do: predict-no
  190. ENV: Agent did: predict-no for direction L in state State-B
  191. In State-B moving L
  192. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  193. predict error 1
  194. dir: dir isR
  195. -/25: O: O49 (predict-yes)
  196. I see 0 and I'm going to do: predict-yes
  197. ENV: Agent did: predict-yes for direction R in state State-A
  198. In State-A moving R
  199. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  200. predict error 0
  201. dir: dir isU
  202. |\-26: O: O52 (predict-no)
  203. I see 1 and I'm going to do: predict-no
  204. ENV: Agent did: predict-no for direction U in state State-B
  205. In State-B moving U
  206. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  207. predict error 0
  208. dir: dir isR
  209. /|27: O: O53 (predict-yes)
  210. I see 1 and I'm going to do: predict-yes
  211. ENV: Agent did: predict-yes for direction R in state State-B
  212. In State-B moving R
  213. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  214. predict error 1
  215. dir: dir isR
  216. \-/28: O: O55 (predict-yes)
  217. I see 0 and I'm going to do: predict-yes
  218. ENV: Agent did: predict-yes for direction R in state State-B
  219. In State-B moving R
  220. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  221. predict error 1
  222. dir: dir isL
  223. |\-29: O: O58 (predict-no)
  224. I see 0 and I'm going to do: predict-no
  225. ENV: Agent did: predict-no for direction L in state State-B
  226. In State-B moving L
  227. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  228. predict error 1
  229. dir: dir isL
  230. /30: O: O60 (predict-no)
  231. I see 0 and I'm going to do: predict-no
  232. ENV: Agent did: predict-no for direction L in state State-A
  233. In State-A moving L
  234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  235. predict error 0
  236. dir: dir isL
  237. |\31: O: O61 (predict-yes)
  238. I see 1 and I'm going to do: predict-yes
  239. ENV: Agent did: predict-yes for direction L in state State-A
  240. In State-A moving L
  241. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  242. predict error 1
  243. dir: dir isL
  244. -32: O: O64 (predict-no)
  245. I see 0 and I'm going to do: predict-no
  246. ENV: Agent did: predict-no for direction L in state State-A
  247. In State-A moving L
  248. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  249. predict error 0
  250. dir: dir isR
  251. /|\33: O: O65 (predict-yes)
  252. I see 1 and I'm going to do: predict-yes
  253. ENV: Agent did: predict-yes for direction R in state State-A
  254. In State-A moving R
  255. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  256. predict error 0
  257. dir: dir isU
  258. -34: O: O68 (predict-no)
  259. I see 1 and I'm going to do: predict-no
  260. ENV: Agent did: predict-no for direction U in state State-B
  261. In State-B moving U
  262. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  263. predict error 0
  264. dir: dir isU
  265. /|\35: O: O70 (predict-no)
  266. I see 1 and I'm going to do: predict-no
  267. ENV: Agent did: predict-no for direction U in state State-B
  268. In State-B moving U
  269. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  270. predict error 0
  271. dir: dir isL
  272. -36: O: O72 (predict-no)
  273. I see 1 and I'm going to do: predict-no
  274. ENV: Agent did: predict-no for direction L in state State-B
  275. In State-B moving L
  276. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  277. predict error 1
  278. dir: dir isU
  279. /|\37: O: O74 (predict-no)
  280. I see 0 and I'm going to do: predict-no
  281. ENV: Agent did: predict-no for direction U in state State-A
  282. In State-A moving U
  283. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  284. predict error 0
  285. dir: dir isU
  286. -/|38: O: O76 (predict-no)
  287. I see 1 and I'm going to do: predict-no
  288. ENV: Agent did: predict-no for direction U in state State-A
  289. In State-A moving U
  290. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  291. predict error 0
  292. dir: dir isU
  293. \-/39: O: O77 (predict-yes)
  294. I see 1 and I'm going to do: predict-yes
  295. ENV: Agent did: predict-yes for direction U in state State-A
  296. In State-A moving U
  297. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  298. predict error 1
  299. dir: dir isU
  300. |\-40: O: O79 (predict-yes)
  301. I see 0 and I'm going to do: predict-yes
  302. ENV: Agent did: predict-yes for direction U in state State-A
  303. In State-A moving U
  304. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  305. predict error 1
  306. dir: dir isU
  307. /|\41: O: O82 (predict-no)
  308. I see 0 and I'm going to do: predict-no
  309. ENV: Agent did: predict-no for direction U in state State-A
  310. In State-A moving U
  311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  312. predict error 0
  313. dir: dir isU
  314. -42: O: O84 (predict-no)
  315. I see 1 and I'm going to do: predict-no
  316. ENV: Agent did: predict-no for direction U in state State-A
  317. In State-A moving U
  318. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  319. predict error 0
  320. dir: dir isR
  321. /|\43: O: O85 (predict-yes)
  322. I see 1 and I'm going to do: predict-yes
  323. ENV: Agent did: predict-yes for direction R in state State-A
  324. In State-A moving R
  325. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  326. predict error 0
  327. dir: dir isU
  328. -/|44: O: O87 (predict-yes)
  329. I see 1 and I'm going to do: predict-yes
  330. ENV: Agent did: predict-yes for direction U in state State-B
  331. In State-B moving U
  332. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  333. predict error 1
  334. dir: dir isU
  335. \-/45: O: O90 (predict-no)
  336. I see 0 and I'm going to do: predict-no
  337. ENV: Agent did: predict-no for direction U in state State-B
  338. In State-B moving U
  339. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  340. predict error 0
  341. dir: dir isL
  342. |\46: O: O92 (predict-no)
  343. I see 1 and I'm going to do: predict-no
  344. ENV: Agent did: predict-no for direction L in state State-B
  345. In State-B moving L
  346. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  347. predict error 1
  348. dir: dir isU
  349. -/|47: O: O94 (predict-no)
  350. I see 0 and I'm going to do: predict-no
  351. ENV: Agent did: predict-no for direction U in state State-A
  352. In State-A moving U
  353. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  354. predict error 0
  355. dir: dir isU
  356. \-48: O: O96 (predict-no)
  357. I see 1 and I'm going to do: predict-no
  358. ENV: Agent did: predict-no for direction U in state State-A
  359. In State-A moving U
  360. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  361. predict error 0
  362. dir: dir isR
  363. /|49: O: O97 (predict-yes)
  364. I see 1 and I'm going to do: predict-yes
  365. ENV: Agent did: predict-yes for direction R in state State-A
  366. In State-A moving R
  367. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  368. predict error 0
  369. dir: dir isR
  370. \-/50: O: O99 (predict-yes)
  371. I see 1 and I'm going to do: predict-yes
  372. ENV: Agent did: predict-yes for direction R in state State-B
  373. In State-B moving R
  374. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  375. predict error 1
  376. dir: dir isR
  377. |\-/|\-sleeping...
  378. /51: O: O101 (predict-yes)
  379. I see 0 and I'm going to do: predict-yes
  380. ENV: Agent did: predict-yes for direction R in state State-B
  381. In State-B moving R
  382. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  383. predict error 1
  384. dir: dir isU
  385. rule alias: '*'
  386. |52: O: O104 (predict-no)
  387. I see 0 and I'm going to do: predict-no
  388. ENV: Agent did: predict-no for direction U in state State-B
  389. In State-B moving U
  390. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  391. predict error 0
  392. dir: dir isR
  393. \53: O: O105 (predict-yes)
  394. I see 1 and I'm going to do: predict-yes
  395. ENV: Agent did: predict-yes for direction R in state State-B
  396. In State-B moving R
  397. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  398. predict error 1
  399. dir: dir isU
  400. -54: O: O108 (predict-no)
  401. I see 0 and I'm going to do: predict-no
  402. ENV: Agent did: predict-no for direction U in state State-B
  403. In State-B moving U
  404. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  405. predict error 0
  406. dir: dir isR
  407. /|\55: O: O109 (predict-yes)
  408. I see 1 and I'm going to do: predict-yes
  409. ENV: Agent did: predict-yes for direction R in state State-B
  410. In State-B moving R
  411. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  412. predict error 1
  413. dir: dir isU
  414. -/|\sleeping...
  415. -56: O: O111 (predict-yes)
  416. I see 0 and I'm going to do: predict-yes
  417. ENV: Agent did: predict-yes for direction U in state State-B
  418. In State-B moving U
  419. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  420. predict error 1
  421. dir: dir isU
  422. /|57: O: O114 (predict-no)
  423. I see 0 and I'm going to do: predict-no
  424. ENV: Agent did: predict-no for direction U in state State-B
  425. In State-B moving U
  426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  427. predict error 0
  428. dir: dir isR
  429. \58: O: O115 (predict-yes)
  430. I see 1 and I'm going to do: predict-yes
  431. ENV: Agent did: predict-yes for direction R in state State-B
  432. In State-B moving R
  433. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  434. predict error 1
  435. dir: dir isR
  436. -/|59: O: O117 (predict-yes)
  437. I see 0 and I'm going to do: predict-yes
  438. ENV: Agent did: predict-yes for direction R in state State-B
  439. In State-B moving R
  440. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  441. predict error 1
  442. dir: dir isU
  443. \-/60: O: O120 (predict-no)
  444. I see 0 and I'm going to do: predict-no
  445. ENV: Agent did: predict-no for direction U in state State-B
  446. In State-B moving U
  447. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  448. predict error 0
  449. dir: dir isU
  450. |\-61: O: O122 (predict-no)
  451. I see 1 and I'm going to do: predict-no
  452. ENV: Agent did: predict-no for direction U in state State-B
  453. In State-B moving U
  454. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  455. predict error 0
  456. dir: dir isR
  457. /62: O: O123 (predict-yes)
  458. I see 1 and I'm going to do: predict-yes
  459. ENV: Agent did: predict-yes for direction R in state State-B
  460. In State-B moving R
  461. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  462. predict error 1
  463. dir: dir isL
  464. |\63: O: O126 (predict-no)
  465. I see 0 and I'm going to do: predict-no
  466. ENV: Agent did: predict-no for direction L in state State-B
  467. In State-B moving L
  468. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  469. predict error 1
  470. dir: dir isL
  471. -/64: O: O128 (predict-no)
  472. I see 0 and I'm going to do: predict-no
  473. ENV: Agent did: predict-no for direction L in state State-A
  474. In State-A moving L
  475. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  476. predict error 0
  477. dir: dir isU
  478. |\-65: O: O130 (predict-no)
  479. I see 1 and I'm going to do: predict-no
  480. ENV: Agent did: predict-no for direction U in state State-A
  481. In State-A moving U
  482. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  483. predict error 0
  484. dir: dir isL
  485. /66: O: O132 (predict-no)
  486. I see 1 and I'm going to do: predict-no
  487. ENV: Agent did: predict-no for direction L in state State-A
  488. In State-A moving L
  489. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  490. predict error 0
  491. dir: dir isU
  492. |67: O: O134 (predict-no)
  493. I see 1 and I'm going to do: predict-no
  494. ENV: Agent did: predict-no for direction U in state State-A
  495. In State-A moving U
  496. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  497. predict error 0
  498. dir: dir isL
  499. \68: O: O136 (predict-no)
  500. I see 1 and I'm going to do: predict-no
  501. ENV: Agent did: predict-no for direction L in state State-A
  502. In State-A moving L
  503. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  504. predict error 0
  505. dir: dir isU
  506. -/69: O: O138 (predict-no)
  507. I see 1 and I'm going to do: predict-no
  508. ENV: Agent did: predict-no for direction U in state State-A
  509. In State-A moving U
  510. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  511. predict error 0
  512. dir: dir isL
  513. |70: O: O140 (predict-no)
  514. I see 1 and I'm going to do: predict-no
  515. ENV: Agent did: predict-no for direction L in state State-A
  516. In State-A moving L
  517. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  518. predict error 0
  519. dir: dir isU
  520. \-71: O: O142 (predict-no)
  521. I see 1 and I'm going to do: predict-no
  522. ENV: Agent did: predict-no for direction U in state State-A
  523. In State-A moving U
  524. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  525. predict error 0
  526. dir: dir isU
  527. rule alias: '*'
  528. rule alias: '*'
  529. rule alias: '*'
  530. rule alias: '*'
  531. rule alias: '*'
  532. rule alias: '*'
  533. /72: O: O144 (predict-no)
  534. I see 1 and I'm going to do: predict-no
  535. ENV: Agent did: predict-no for direction U in state State-A
  536. In State-A moving U
  537. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  538. predict error 0
  539. dir: dir isR
  540. |\-73: O: O145 (predict-yes)
  541. I see 1 and I'm going to do: predict-yes
  542. ENV: Agent did: predict-yes for direction R in state State-A
  543. In State-A moving R
  544. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  545. predict error 0
  546. dir: dir isR
  547. /|74: O: O147 (predict-yes)
  548. I see 1 and I'm going to do: predict-yes
  549. ENV: Agent did: predict-yes for direction R in state State-B
  550. In State-B moving R
  551. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  552. predict error 1
  553. dir: dir isR
  554. \-75: O: O149 (predict-yes)
  555. I see 0 and I'm going to do: predict-yes
  556. ENV: Agent did: predict-yes for direction R in state State-B
  557. In State-B moving R
  558. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  559. predict error 1
  560. dir: dir isR
  561. /|\76: O: O152 (predict-no)
  562. I see 0 and I'm going to do: predict-no
  563. ENV: Agent did: predict-no for direction R in state State-B
  564. In State-B moving R
  565. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  566. predict error 0
  567. dir: dir isL
  568. -/|77: O: O154 (predict-no)
  569. I see 1 and I'm going to do: predict-no
  570. ENV: Agent did: predict-no for direction L in state State-B
  571. In State-B moving L
  572. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  573. predict error 1
  574. dir: dir isL
  575. \-/78: O: O156 (predict-no)
  576. I see 0 and I'm going to do: predict-no
  577. ENV: Agent did: predict-no for direction L in state State-A
  578. In State-A moving L
  579. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  580. predict error 0
  581. dir: dir isR
  582. |79: O: O158 (predict-no)
  583. I see 1 and I'm going to do: predict-no
  584. ENV: Agent did: predict-no for direction R in state State-A
  585. In State-A moving R
  586. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  587. predict error 1
  588. dir: dir isU
  589. \-80: O: O160 (predict-no)
  590. I see 0 and I'm going to do: predict-no
  591. ENV: Agent did: predict-no for direction U in state State-B
  592. In State-B moving U
  593. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  594. predict error 0
  595. dir: dir isR
  596. /|\81: O: O162 (predict-no)
  597. I see 1 and I'm going to do: predict-no
  598. ENV: Agent did: predict-no for direction R in state State-B
  599. In State-B moving R
  600. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  601. predict error 0
  602. dir: dir isL
  603. rule alias: '*'
  604. rule alias: '*'
  605. -82: O: O163 (predict-yes)
  606. I see 1 and I'm going to do: predict-yes
  607. ENV: Agent did: predict-yes for direction L in state State-B
  608. In State-B moving L
  609. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  610. predict error 0
  611. dir: dir isU
  612. /|\83: O: O166 (predict-no)
  613. I see 1 and I'm going to do: predict-no
  614. ENV: Agent did: predict-no for direction U in state State-A
  615. In State-A moving U
  616. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  617. predict error 0
  618. dir: dir isU
  619. -/|84: O: O168 (predict-no)
  620. I see 1 and I'm going to do: predict-no
  621. ENV: Agent did: predict-no for direction U in state State-A
  622. In State-A moving U
  623. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  624. predict error 0
  625. dir: dir isU
  626. \-85: O: O169 (predict-yes)
  627. I see 1 and I'm going to do: predict-yes
  628. ENV: Agent did: predict-yes for direction U in state State-A
  629. In State-A moving U
  630. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  631. predict error 1
  632. dir: dir isL
  633. /|\86: O: O172 (predict-no)
  634. I see 0 and I'm going to do: predict-no
  635. ENV: Agent did: predict-no for direction L in state State-A
  636. In State-A moving L
  637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  638. predict error 0
  639. dir: dir isU
  640. -/|87: O: O173 (predict-yes)
  641. I see 1 and I'm going to do: predict-yes
  642. ENV: Agent did: predict-yes for direction U in state State-A
  643. In State-A moving U
  644. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  645. predict error 1
  646. dir: dir isL
  647. \-/88: O: O176 (predict-no)
  648. I see 0 and I'm going to do: predict-no
  649. ENV: Agent did: predict-no for direction L in state State-A
  650. In State-A moving L
  651. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  652. predict error 0
  653. dir: dir isR
  654. |\89: O: O177 (predict-yes)
  655. I see 1 and I'm going to do: predict-yes
  656. ENV: Agent did: predict-yes for direction R in state State-A
  657. In State-A moving R
  658. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  659. predict error 0
  660. dir: dir isL
  661. -/90: O: O180 (predict-no)
  662. I see 1 and I'm going to do: predict-no
  663. ENV: Agent did: predict-no for direction L in state State-B
  664. In State-B moving L
  665. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  666. predict error 1
  667. dir: dir isL
  668. |\-91: O: O182 (predict-no)
  669. I see 0 and I'm going to do: predict-no
  670. ENV: Agent did: predict-no for direction L in state State-A
  671. In State-A moving L
  672. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  673. predict error 0
  674. dir: dir isU
  675. rule alias: '*'
  676. rule alias: '*'
  677. /92: O: O184 (predict-no)
  678. I see 1 and I'm going to do: predict-no
  679. ENV: Agent did: predict-no for direction U in state State-A
  680. In State-A moving U
  681. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  682. predict error 0
  683. dir: dir isL
  684. |\93: O: O186 (predict-no)
  685. I see 1 and I'm going to do: predict-no
  686. ENV: Agent did: predict-no for direction L in state State-A
  687. In State-A moving L
  688. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  689. predict error 0
  690. dir: dir isR
  691. -/|94: O: O187 (predict-yes)
  692. I see 1 and I'm going to do: predict-yes
  693. ENV: Agent did: predict-yes for direction R in state State-A
  694. In State-A moving R
  695. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  696. predict error 0
  697. dir: dir isU
  698. \-/95: O: O190 (predict-no)
  699. I see 1 and I'm going to do: predict-no
  700. ENV: Agent did: predict-no for direction U in state State-B
  701. In State-B moving U
  702. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  703. predict error 0
  704. dir: dir isL
  705. |\-96: O: O191 (predict-yes)
  706. I see 1 and I'm going to do: predict-yes
  707. ENV: Agent did: predict-yes for direction L in state State-B
  708. In State-B moving L
  709. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  710. predict error 0
  711. dir: dir isL
  712. /97: O: O194 (predict-no)
  713. I see 1 and I'm going to do: predict-no
  714. ENV: Agent did: predict-no for direction L in state State-A
  715. In State-A moving L
  716. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  717. predict error 0
  718. dir: dir isU
  719. |\-98: O: O196 (predict-no)
  720. I see 1 and I'm going to do: predict-no
  721. ENV: Agent did: predict-no for direction U in state State-A
  722. In State-A moving U
  723. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  724. predict error 0
  725. dir: dir isR
  726. /|\99: O: O198 (predict-no)
  727. I see 1 and I'm going to do: predict-no
  728. ENV: Agent did: predict-no for direction R in state State-A
  729. In State-A moving R
  730. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  731. predict error 1
  732. dir: dir isU
  733. -100: O: O200 (predict-no)
  734. I see 0 and I'm going to do: predict-no
  735. ENV: Agent did: predict-no for direction U in state State-B
  736. In State-B moving U
  737. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  738. predict error 0
  739. dir: dir isL
  740. /|\101: O: O201 (predict-yes)
  741. I see 1 and I'm going to do: predict-yes
  742. ENV: Agent did: predict-yes for direction L in state State-B
  743. In State-B moving L
  744. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  745. predict error 0
  746. dir: dir isR
  747. -/102: O: O203 (predict-yes)
  748. I see 1 and I'm going to do: predict-yes
  749. ENV: Agent did: predict-yes for direction R in state State-A
  750. In State-A moving R
  751. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  752. predict error 0
  753. dir: dir isL
  754. |\-103: O: O205 (predict-yes)
  755. I see 1 and I'm going to do: predict-yes
  756. ENV: Agent did: predict-yes for direction L in state State-B
  757. In State-B moving L
  758. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  759. predict error 0
  760. dir: dir isU
  761. /|104: O: O208 (predict-no)
  762. I see 1 and I'm going to do: predict-no
  763. ENV: Agent did: predict-no for direction U in state State-A
  764. In State-A moving U
  765. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  766. predict error 0
  767. dir: dir isL
  768. \-/105: O: O210 (predict-no)
  769. I see 1 and I'm going to do: predict-no
  770. ENV: Agent did: predict-no for direction L in state State-A
  771. In State-A moving L
  772. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  773. predict error 0
  774. dir: dir isU
  775. |\106: O: O212 (predict-no)
  776. I see 1 and I'm going to do: predict-no
  777. ENV: Agent did: predict-no for direction U in state State-A
  778. In State-A moving U
  779. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  780. predict error 0
  781. dir: dir isL
  782. -/|107: O: O214 (predict-no)
  783. I see 1 and I'm going to do: predict-no
  784. ENV: Agent did: predict-no for direction L in state State-A
  785. In State-A moving L
  786. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  787. predict error 0
  788. dir: dir isU
  789. \-/108: O: O216 (predict-no)
  790. I see 1 and I'm going to do: predict-no
  791. ENV: Agent did: predict-no for direction U in state State-A
  792. In State-A moving U
  793. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  794. predict error 0
  795. dir: dir isL
  796. |\-109: O: O218 (predict-no)
  797. I see 1 and I'm going to do: predict-no
  798. ENV: Agent did: predict-no for direction L in state State-A
  799. In State-A moving L
  800. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  801. predict error 0
  802. dir: dir isL
  803. /|110: O: O220 (predict-no)
  804. I see 1 and I'm going to do: predict-no
  805. ENV: Agent did: predict-no for direction L in state State-A
  806. In State-A moving L
  807. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  808. predict error 0
  809. dir: dir isU
  810. \-/111: O: O222 (predict-no)
  811. I see 1 and I'm going to do: predict-no
  812. ENV: Agent did: predict-no for direction U in state State-A
  813. In State-A moving U
  814. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  815. predict error 0
  816. dir: dir isL
  817. rule alias: '*'
  818. |112: O: O224 (predict-no)
  819. I see 1 and I'm going to do: predict-no
  820. ENV: Agent did: predict-no for direction L in state State-A
  821. In State-A moving L
  822. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  823. predict error 0
  824. dir: dir isL
  825. \113: O: O226 (predict-no)
  826. I see 1 and I'm going to do: predict-no
  827. ENV: Agent did: predict-no for direction L in state State-A
  828. In State-A moving L
  829. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  830. predict error 0
  831. dir: dir isU
  832. -/114: O: O228 (predict-no)
  833. I see 1 and I'm going to do: predict-no
  834. ENV: Agent did: predict-no for direction U in state State-A
  835. In State-A moving U
  836. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  837. predict error 0
  838. dir: dir isR
  839. |\-115: O: O229 (predict-yes)
  840. I see 1 and I'm going to do: predict-yes
  841. ENV: Agent did: predict-yes for direction R in state State-A
  842. In State-A moving R
  843. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  844. predict error 0
  845. dir: dir isL
  846. /|\-116: O: O231 (predict-yes)
  847. I see 1 and I'm going to do: predict-yes
  848. ENV: Agent did: predict-yes for direction L in state State-B
  849. In State-B moving L
  850. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  851. predict error 0
  852. dir: dir isU
  853. /|117: O: O234 (predict-no)
  854. I see 1 and I'm going to do: predict-no
  855. ENV: Agent did: predict-no for direction U in state State-A
  856. In State-A moving U
  857. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  858. predict error 0
  859. dir: dir isL
  860. \-/118: O: O236 (predict-no)
  861. I see 1 and I'm going to do: predict-no
  862. ENV: Agent did: predict-no for direction L in state State-A
  863. In State-A moving L
  864. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  865. predict error 0
  866. dir: dir isL
  867. |119: O: O238 (predict-no)
  868. I see 1 and I'm going to do: predict-no
  869. ENV: Agent did: predict-no for direction L in state State-A
  870. In State-A moving L
  871. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  872. predict error 0
  873. dir: dir isR
  874. \-/120: O: O239 (predict-yes)
  875. I see 1 and I'm going to do: predict-yes
  876. ENV: Agent did: predict-yes for direction R in state State-A
  877. In State-A moving R
  878. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  879. predict error 0
  880. dir: dir isR
  881. |\-121: O: O241 (predict-yes)
  882. I see 1 and I'm going to do: predict-yes
  883. ENV: Agent did: predict-yes for direction R in state State-B
  884. In State-B moving R
  885. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  886. predict error 1
  887. dir: dir isU
  888. /122: O: O244 (predict-no)
  889. I see 0 and I'm going to do: predict-no
  890. ENV: Agent did: predict-no for direction U in state State-B
  891. In State-B moving U
  892. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  893. predict error 0
  894. dir: dir isL
  895. |\-123: O: O245 (predict-yes)
  896. I see 1 and I'm going to do: predict-yes
  897. ENV: Agent did: predict-yes for direction L in state State-B
  898. In State-B moving L
  899. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  900. predict error 0
  901. dir: dir isR
  902. /|124: O: O248 (predict-no)
  903. I see 1 and I'm going to do: predict-no
  904. ENV: Agent did: predict-no for direction R in state State-A
  905. In State-A moving R
  906. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  907. predict error 1
  908. dir: dir isL
  909. \-/125: O: O249 (predict-yes)
  910. I see 0 and I'm going to do: predict-yes
  911. ENV: Agent did: predict-yes for direction L in state State-B
  912. In State-B moving L
  913. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  914. predict error 0
  915. dir: dir isR
  916. |126: O: O251 (predict-yes)
  917. I see 1 and I'm going to do: predict-yes
  918. ENV: Agent did: predict-yes for direction R in state State-A
  919. In State-A moving R
  920. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  921. predict error 0
  922. dir: dir isU
  923. \-/127: O: O254 (predict-no)
  924. I see 1 and I'm going to do: predict-no
  925. ENV: Agent did: predict-no for direction U in state State-B
  926. In State-B moving U
  927. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  928. predict error 0
  929. dir: dir isU
  930. |\128: O: O256 (predict-no)
  931. I see 1 and I'm going to do: predict-no
  932. ENV: Agent did: predict-no for direction U in state State-B
  933. In State-B moving U
  934. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  935. predict error 0
  936. dir: dir isU
  937. -/|129: O: O258 (predict-no)
  938. I see 1 and I'm going to do: predict-no
  939. ENV: Agent did: predict-no for direction U in state State-B
  940. In State-B moving U
  941. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  942. predict error 0
  943. dir: dir isU
  944. \130: O: O259 (predict-yes)
  945. I see 1 and I'm going to do: predict-yes
  946. ENV: Agent did: predict-yes for direction U in state State-B
  947. In State-B moving U
  948. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  949. predict error 1
  950. dir: dir isU
  951. -/131: O: O262 (predict-no)
  952. I see 0 and I'm going to do: predict-no
  953. ENV: Agent did: predict-no for direction U in state State-B
  954. In State-B moving U
  955. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  956. predict error 0
  957. dir: dir isU
  958. |132: O: O264 (predict-no)
  959. I see 1 and I'm going to do: predict-no
  960. ENV: Agent did: predict-no for direction U in state State-B
  961. In State-B moving U
  962. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  963. predict error 0
  964. dir: dir isR
  965. \-/133: O: O265 (predict-yes)
  966. I see 1 and I'm going to do: predict-yes
  967. ENV: Agent did: predict-yes for direction R in state State-B
  968. In State-B moving R
  969. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  970. predict error 1
  971. dir: dir isL
  972. |\-134: O: O267 (predict-yes)
  973. I see 0 and I'm going to do: predict-yes
  974. ENV: Agent did: predict-yes for direction L in state State-B
  975. In State-B moving L
  976. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  977. predict error 0
  978. dir: dir isR
  979. /135: O: O269 (predict-yes)
  980. I see 1 and I'm going to do: predict-yes
  981. ENV: Agent did: predict-yes for direction R in state State-A
  982. In State-A moving R
  983. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  984. predict error 0
  985. dir: dir isL
  986. |\136: O: O271 (predict-yes)
  987. I see 1 and I'm going to do: predict-yes
  988. ENV: Agent did: predict-yes for direction L in state State-B
  989. In State-B moving L
  990. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  991. predict error 0
  992. dir: dir isL
  993. -137: O: O274 (predict-no)
  994. I see 1 and I'm going to do: predict-no
  995. ENV: Agent did: predict-no for direction L in state State-A
  996. In State-A moving L
  997. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  998. predict error 0
  999. dir: dir isR
  1000. /|\138: O: O275 (predict-yes)
  1001. I see 1 and I'm going to do: predict-yes
  1002. ENV: Agent did: predict-yes for direction R in state State-A
  1003. In State-A moving R
  1004. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1005. predict error 0
  1006. dir: dir isU
  1007. -/|139: O: O278 (predict-no)
  1008. I see 1 and I'm going to do: predict-no
  1009. ENV: Agent did: predict-no for direction U in state State-B
  1010. In State-B moving U
  1011. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1012. predict error 0
  1013. dir: dir isL
  1014. \-/140: O: O279 (predict-yes)
  1015. I see 1 and I'm going to do: predict-yes
  1016. ENV: Agent did: predict-yes for direction L in state State-B
  1017. In State-B moving L
  1018. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1019. predict error 0
  1020. dir: dir isR
  1021. |\-141: O: O281 (predict-yes)
  1022. I see 1 and I'm going to do: predict-yes
  1023. ENV: Agent did: predict-yes for direction R in state State-A
  1024. In State-A moving R
  1025. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1026. predict error 0
  1027. dir: dir isR
  1028. /142: O: O284 (predict-no)
  1029. I see 1 and I'm going to do: predict-no
  1030. ENV: Agent did: predict-no for direction R in state State-B
  1031. In State-B moving R
  1032. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1033. predict error 0
  1034. dir: dir isR
  1035. |\143: O: O286 (predict-no)
  1036. I see 1 and I'm going to do: predict-no
  1037. ENV: Agent did: predict-no for direction R in state State-B
  1038. In State-B moving R
  1039. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1040. predict error 0
  1041. dir: dir isL
  1042. -/144: O: O287 (predict-yes)
  1043. I see 1 and I'm going to do: predict-yes
  1044. ENV: Agent did: predict-yes for direction L in state State-B
  1045. In State-B moving L
  1046. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1047. predict error 0
  1048. dir: dir isU
  1049. |\-145: O: O290 (predict-no)
  1050. I see 1 and I'm going to do: predict-no
  1051. ENV: Agent did: predict-no for direction U in state State-A
  1052. In State-A moving U
  1053. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1054. predict error 0
  1055. dir: dir isL
  1056. /|146: O: O292 (predict-no)
  1057. I see 1 and I'm going to do: predict-no
  1058. ENV: Agent did: predict-no for direction L in state State-A
  1059. In State-A moving L
  1060. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1061. predict error 0
  1062. dir: dir isR
  1063. \-147: O: O293 (predict-yes)
  1064. I see 1 and I'm going to do: predict-yes
  1065. ENV: Agent did: predict-yes for direction R in state State-A
  1066. In State-A moving R
  1067. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1068. predict error 0
  1069. dir: dir isR
  1070. /|\148: O: O296 (predict-no)
  1071. I see 1 and I'm going to do: predict-no
  1072. ENV: Agent did: predict-no for direction R in state State-B
  1073. In State-B moving R
  1074. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1075. predict error 0
  1076. dir: dir isL
  1077. -/149: O: O297 (predict-yes)
  1078. I see 1 and I'm going to do: predict-yes
  1079. ENV: Agent did: predict-yes for direction L in state State-B
  1080. In State-B moving L
  1081. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1082. predict error 0
  1083. dir: dir isR
  1084. |\-150: O: O299 (predict-yes)
  1085. I see 1 and I'm going to do: predict-yes
  1086. ENV: Agent did: predict-yes for direction R in state State-A
  1087. In State-A moving R
  1088. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1089. predict error 0
  1090. dir: dir isL
  1091. /|\151: O: O301 (predict-yes)
  1092. I see 1 and I'm going to do: predict-yes
  1093. ENV: Agent did: predict-yes for direction L in state State-B
  1094. In State-B moving L
  1095. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1096. predict error 0
  1097. dir: dir isL
  1098. -152: O: O304 (predict-no)
  1099. I see 1 and I'm going to do: predict-no
  1100. ENV: Agent did: predict-no for direction L in state State-A
  1101. In State-A moving L
  1102. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1103. predict error 0
  1104. dir: dir isL
  1105. /|\153: O: O306 (predict-no)
  1106. I see 1 and I'm going to do: predict-no
  1107. ENV: Agent did: predict-no for direction L in state State-A
  1108. In State-A moving L
  1109. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1110. predict error 0
  1111. dir: dir isL
  1112. -/|154: O: O308 (predict-no)
  1113. I see 1 and I'm going to do: predict-no
  1114. ENV: Agent did: predict-no for direction L in state State-A
  1115. In State-A moving L
  1116. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1117. predict error 0
  1118. dir: dir isL
  1119. \-/155: O: O310 (predict-no)
  1120. I see 1 and I'm going to do: predict-no
  1121. ENV: Agent did: predict-no for direction L in state State-A
  1122. In State-A moving L
  1123. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1124. predict error 0
  1125. dir: dir isR
  1126. |156: O: O311 (predict-yes)
  1127. I see 1 and I'm going to do: predict-yes
  1128. ENV: Agent did: predict-yes for direction R in state State-A
  1129. In State-A moving R
  1130. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1131. predict error 0
  1132. dir: dir isR
  1133. \-157: O: O314 (predict-no)
  1134. I see 1 and I'm going to do: predict-no
  1135. ENV: Agent did: predict-no for direction R in state State-B
  1136. In State-B moving R
  1137. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1138. predict error 0
  1139. dir: dir isU
  1140. /|158: O: O316 (predict-no)
  1141. I see 1 and I'm going to do: predict-no
  1142. ENV: Agent did: predict-no for direction U in state State-B
  1143. In State-B moving U
  1144. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1145. predict error 0
  1146. dir: dir isU
  1147. \-159: O: O318 (predict-no)
  1148. I see 1 and I'm going to do: predict-no
  1149. ENV: Agent did: predict-no for direction U in state State-B
  1150. In State-B moving U
  1151. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1152. predict error 0
  1153. dir: dir isU
  1154. /|\160: O: O320 (predict-no)
  1155. I see 1 and I'm going to do: predict-no
  1156. ENV: Agent did: predict-no for direction U in state State-B
  1157. In State-B moving U
  1158. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1159. predict error 0
  1160. dir: dir isL
  1161. -/|161: O: O321 (predict-yes)
  1162. I see 1 and I'm going to do: predict-yes
  1163. ENV: Agent did: predict-yes for direction L in state State-B
  1164. In State-B moving L
  1165. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1166. predict error 0
  1167. dir: dir isR
  1168. \162: O: O323 (predict-yes)
  1169. I see 1 and I'm going to do: predict-yes
  1170. ENV: Agent did: predict-yes for direction R in state State-A
  1171. In State-A moving R
  1172. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1173. predict error 0
  1174. dir: dir isL
  1175. -/|163: O: O325 (predict-yes)
  1176. I see 1 and I'm going to do: predict-yes
  1177. ENV: Agent did: predict-yes for direction L in state State-B
  1178. In State-B moving L
  1179. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1180. predict error 0
  1181. dir: dir isR
  1182. \-164: O: O327 (predict-yes)
  1183. I see 1 and I'm going to do: predict-yes
  1184. ENV: Agent did: predict-yes for direction R in state State-A
  1185. In State-A moving R
  1186. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1187. predict error 0
  1188. dir: dir isR
  1189. /|\165: O: O329 (predict-yes)
  1190. I see 1 and I'm going to do: predict-yes
  1191. ENV: Agent did: predict-yes for direction R in state State-B
  1192. In State-B moving R
  1193. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1194. predict error 1
  1195. dir: dir isU
  1196. -/|166: O: O332 (predict-no)
  1197. I see 0 and I'm going to do: predict-no
  1198. ENV: Agent did: predict-no for direction U in state State-B
  1199. In State-B moving U
  1200. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1201. predict error 0
  1202. dir: dir isU
  1203. \-/167: O: O334 (predict-no)
  1204. I see 1 and I'm going to do: predict-no
  1205. ENV: Agent did: predict-no for direction U in state State-B
  1206. In State-B moving U
  1207. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1208. predict error 0
  1209. dir: dir isL
  1210. |\168: O: O335 (predict-yes)
  1211. I see 1 and I'm going to do: predict-yes
  1212. ENV: Agent did: predict-yes for direction L in state State-B
  1213. In State-B moving L
  1214. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1215. predict error 0
  1216. dir: dir isR
  1217. -/169: O: O337 (predict-yes)
  1218. I see 1 and I'm going to do: predict-yes
  1219. ENV: Agent did: predict-yes for direction R in state State-A
  1220. In State-A moving R
  1221. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1222. predict error 0
  1223. dir: dir isR
  1224. |\-170: O: O340 (predict-no)
  1225. I see 1 and I'm going to do: predict-no
  1226. ENV: Agent did: predict-no for direction R in state State-B
  1227. In State-B moving R
  1228. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1229. predict error 0
  1230. dir: dir isL
  1231. /|171: O: O342 (predict-no)
  1232. I see 1 and I'm going to do: predict-no
  1233. ENV: Agent did: predict-no for direction L in state State-B
  1234. In State-B moving L
  1235. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1236. predict error 1
  1237. dir: dir isL
  1238. \172: O: O344 (predict-no)
  1239. I see 0 and I'm going to do: predict-no
  1240. ENV: Agent did: predict-no for direction L in state State-A
  1241. In State-A moving L
  1242. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1243. predict error 0
  1244. dir: dir isR
  1245. -/173: O: O345 (predict-yes)
  1246. I see 1 and I'm going to do: predict-yes
  1247. ENV: Agent did: predict-yes for direction R in state State-A
  1248. In State-A moving R
  1249. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1250. predict error 0
  1251. dir: dir isL
  1252. |\174: O: O347 (predict-yes)
  1253. I see 1 and I'm going to do: predict-yes
  1254. ENV: Agent did: predict-yes for direction L in state State-B
  1255. In State-B moving L
  1256. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1257. predict error 0
  1258. dir: dir isU
  1259. -/|\175: O: O350 (predict-no)
  1260. I see 1 and I'm going to do: predict-no
  1261. ENV: Agent did: predict-no for direction U in state State-A
  1262. In State-A moving U
  1263. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1264. predict error 0
  1265. dir: dir isL
  1266. -/176: O: O352 (predict-no)
  1267. I see 1 and I'm going to do: predict-no
  1268. ENV: Agent did: predict-no for direction L in state State-A
  1269. In State-A moving L
  1270. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1271. predict error 0
  1272. dir: dir isL
  1273. |\-177: O: O354 (predict-no)
  1274. I see 1 and I'm going to do: predict-no
  1275. ENV: Agent did: predict-no for direction L in state State-A
  1276. In State-A moving L
  1277. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1278. predict error 0
  1279. dir: dir isL
  1280. /|178: O: O356 (predict-no)
  1281. I see 1 and I'm going to do: predict-no
  1282. ENV: Agent did: predict-no for direction L in state State-A
  1283. In State-A moving L
  1284. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1285. predict error 0
  1286. dir: dir isL
  1287. \179: O: O358 (predict-no)
  1288. I see 1 and I'm going to do: predict-no
  1289. ENV: Agent did: predict-no for direction L in state State-A
  1290. In State-A moving L
  1291. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1292. predict error 0
  1293. dir: dir isL
  1294. -/|180: O: O360 (predict-no)
  1295. I see 1 and I'm going to do: predict-no
  1296. ENV: Agent did: predict-no for direction L in state State-A
  1297. In State-A moving L
  1298. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1299. predict error 0
  1300. dir: dir isR
  1301. \181: O: O361 (predict-yes)
  1302. I see 1 and I'm going to do: predict-yes
  1303. ENV: Agent did: predict-yes for direction R in state State-A
  1304. In State-A moving R
  1305. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1306. predict error 0
  1307. dir: dir isR
  1308. -182: O: O364 (predict-no)
  1309. I see 1 and I'm going to do: predict-no
  1310. ENV: Agent did: predict-no for direction R in state State-B
  1311. In State-B moving R
  1312. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1313. predict error 0
  1314. dir: dir isL
  1315. /|\183: O: O365 (predict-yes)
  1316. I see 1 and I'm going to do: predict-yes
  1317. ENV: Agent did: predict-yes for direction L in state State-B
  1318. In State-B moving L
  1319. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1320. predict error 0
  1321. dir: dir isR
  1322. -184: O: O367 (predict-yes)
  1323. I see 1 and I'm going to do: predict-yes
  1324. ENV: Agent did: predict-yes for direction R in state State-A
  1325. In State-A moving R
  1326. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1327. predict error 0
  1328. dir: dir isL
  1329. /185: O: O369 (predict-yes)
  1330. I see 1 and I'm going to do: predict-yes
  1331. ENV: Agent did: predict-yes for direction L in state State-B
  1332. In State-B moving L
  1333. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1334. predict error 0
  1335. dir: dir isU
  1336. |\-186: O: O372 (predict-no)
  1337. I see 1 and I'm going to do: predict-no
  1338. ENV: Agent did: predict-no for direction U in state State-A
  1339. In State-A moving U
  1340. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1341. predict error 0
  1342. dir: dir isU
  1343. /|\187: O: O374 (predict-no)
  1344. I see 1 and I'm going to do: predict-no
  1345. ENV: Agent did: predict-no for direction U in state State-A
  1346. In State-A moving U
  1347. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1348. predict error 0
  1349. dir: dir isU
  1350. -/|188: O: O376 (predict-no)
  1351. I see 1 and I'm going to do: predict-no
  1352. ENV: Agent did: predict-no for direction U in state State-A
  1353. In State-A moving U
  1354. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1355. predict error 0
  1356. dir: dir isR
  1357. \-/189: O: O378 (predict-no)
  1358. I see 1 and I'm going to do: predict-no
  1359. ENV: Agent did: predict-no for direction R in state State-A
  1360. In State-A moving R
  1361. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1362. predict error 1
  1363. dir: dir isR
  1364. |\-190: O: O380 (predict-no)
  1365. I see 0 and I'm going to do: predict-no
  1366. ENV: Agent did: predict-no for direction R in state State-B
  1367. In State-B moving R
  1368. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1369. predict error 0
  1370. dir: dir isR
  1371. /|191: O: O382 (predict-no)
  1372. I see 1 and I'm going to do: predict-no
  1373. ENV: Agent did: predict-no for direction R in state State-B
  1374. In State-B moving R
  1375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1376. predict error 0
  1377. dir: dir isL
  1378. \192: O: O383 (predict-yes)
  1379. I see 1 and I'm going to do: predict-yes
  1380. ENV: Agent did: predict-yes for direction L in state State-B
  1381. In State-B moving L
  1382. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1383. predict error 0
  1384. dir: dir isR
  1385. -/|193: O: O385 (predict-yes)
  1386. I see 1 and I'm going to do: predict-yes
  1387. ENV: Agent did: predict-yes for direction R in state State-A
  1388. In State-A moving R
  1389. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1390. predict error 0
  1391. dir: dir isR
  1392. \194: O: O388 (predict-no)
  1393. I see 1 and I'm going to do: predict-no
  1394. ENV: Agent did: predict-no for direction R in state State-B
  1395. In State-B moving R
  1396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1397. predict error 0
  1398. dir: dir isL
  1399. -/|195: O: O389 (predict-yes)
  1400. I see 1 and I'm going to do: predict-yes
  1401. ENV: Agent did: predict-yes for direction L in state State-B
  1402. In State-B moving L
  1403. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1404. predict error 0
  1405. dir: dir isL
  1406. \-/196: O: O392 (predict-no)
  1407. I see 1 and I'm going to do: predict-no
  1408. ENV: Agent did: predict-no for direction L in state State-A
  1409. In State-A moving L
  1410. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1411. predict error 0
  1412. dir: dir isU
  1413. |\-197: O: O394 (predict-no)
  1414. I see 1 and I'm going to do: predict-no
  1415. ENV: Agent did: predict-no for direction U in state State-A
  1416. In State-A moving U
  1417. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1418. predict error 0
  1419. dir: dir isR
  1420. /|\198: O: O395 (predict-yes)
  1421. I see 1 and I'm going to do: predict-yes
  1422. ENV: Agent did: predict-yes for direction R in state State-A
  1423. In State-A moving R
  1424. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1425. predict error 0
  1426. dir: dir isR
  1427. -/199: O: O398 (predict-no)
  1428. I see 1 and I'm going to do: predict-no
  1429. ENV: Agent did: predict-no for direction R in state State-B
  1430. In State-B moving R
  1431. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1432. predict error 0
  1433. dir: dir isU
  1434. |\-200: O: O400 (predict-no)
  1435. I see 1 and I'm going to do: predict-no
  1436. ENV: Agent did: predict-no for direction U in state State-B
  1437. In State-B moving U
  1438. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1439. predict error 0
  1440. dir: dir isR
  1441. /|\-/|201: O: O402 (predict-no)
  1442. I see 1 and I'm going to do: predict-no
  1443. ENV: Agent did: predict-no for direction R in state State-B
  1444. In State-B moving R
  1445. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1446. predict error 0
  1447. dir: dir isL
  1448. \202: O: O403 (predict-yes)
  1449. I see 1 and I'm going to do: predict-yes
  1450. ENV: Agent did: predict-yes for direction L in state State-B
  1451. In State-B moving L
  1452. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1453. predict error 0
  1454. dir: dir isU
  1455. -203: O: O406 (predict-no)
  1456. I see 1 and I'm going to do: predict-no
  1457. ENV: Agent did: predict-no for direction U in state State-A
  1458. In State-A moving U
  1459. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1460. predict error 0
  1461. dir: dir isR
  1462. /|\204: O: O407 (predict-yes)
  1463. I see 1 and I'm going to do: predict-yes
  1464. ENV: Agent did: predict-yes for direction R in state State-A
  1465. In State-A moving R
  1466. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1467. predict error 0
  1468. dir: dir isL
  1469. -/|205: O: O409 (predict-yes)
  1470. I see 1 and I'm going to do: predict-yes
  1471. ENV: Agent did: predict-yes for direction L in state State-B
  1472. In State-B moving L
  1473. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1474. predict error 0
  1475. dir: dir isU
  1476. \206: O: O412 (predict-no)
  1477. I see 1 and I'm going to do: predict-no
  1478. ENV: Agent did: predict-no for direction U in state State-A
  1479. In State-A moving U
  1480. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1481. predict error 0
  1482. dir: dir isL
  1483. -/|207: O: O414 (predict-no)
  1484. I see 1 and I'm going to do: predict-no
  1485. ENV: Agent did: predict-no for direction L in state State-A
  1486. In State-A moving L
  1487. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1488. predict error 0
  1489. dir: dir isL
  1490. \-/208: O: O415 (predict-yes)
  1491. I see 1 and I'm going to do: predict-yes
  1492. ENV: Agent did: predict-yes for direction L in state State-A
  1493. In State-A moving L
  1494. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  1495. predict error 1
  1496. dir: dir isU
  1497. |\209: O: O418 (predict-no)
  1498. I see 0 and I'm going to do: predict-no
  1499. ENV: Agent did: predict-no for direction U in state State-A
  1500. In State-A moving U
  1501. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1502. predict error 0
  1503. dir: dir isU
  1504. -/210: O: O420 (predict-no)
  1505. I see 1 and I'm going to do: predict-no
  1506. ENV: Agent did: predict-no for direction U in state State-A
  1507. In State-A moving U
  1508. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1509. predict error 0
  1510. dir: dir isL
  1511. |\211: O: O422 (predict-no)
  1512. I see 1 and I'm going to do: predict-no
  1513. ENV: Agent did: predict-no for direction L in state State-A
  1514. In State-A moving L
  1515. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1516. predict error 0
  1517. dir: dir isR
  1518. -212: O: O423 (predict-yes)
  1519. I see 1 and I'm going to do: predict-yes
  1520. ENV: Agent did: predict-yes for direction R in state State-A
  1521. In State-A moving R
  1522. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1523. predict error 0
  1524. dir: dir isR
  1525. /|\213: O: O426 (predict-no)
  1526. I see 1 and I'm going to do: predict-no
  1527. ENV: Agent did: predict-no for direction R in state State-B
  1528. In State-B moving R
  1529. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1530. predict error 0
  1531. dir: dir isR
  1532. -214: O: O428 (predict-no)
  1533. I see 1 and I'm going to do: predict-no
  1534. ENV: Agent did: predict-no for direction R in state State-B
  1535. In State-B moving R
  1536. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1537. predict error 0
  1538. dir: dir isL
  1539. /|\215: O: O429 (predict-yes)
  1540. I see 1 and I'm going to do: predict-yes
  1541. ENV: Agent did: predict-yes for direction L in state State-B
  1542. In State-B moving L
  1543. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1544. predict error 0
  1545. dir: dir isR
  1546. -/216: O: O431 (predict-yes)
  1547. I see 1 and I'm going to do: predict-yes
  1548. ENV: Agent did: predict-yes for direction R in state State-A
  1549. In State-A moving R
  1550. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1551. predict error 0
  1552. dir: dir isL
  1553. |\-217: O: O433 (predict-yes)
  1554. I see 1 and I'm going to do: predict-yes
  1555. ENV: Agent did: predict-yes for direction L in state State-B
  1556. In State-B moving L
  1557. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1558. predict error 0
  1559. dir: dir isL
  1560. /|\218: O: O436 (predict-no)
  1561. I see 1 and I'm going to do: predict-no
  1562. ENV: Agent did: predict-no for direction L in state State-A
  1563. In State-A moving L
  1564. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1565. predict error 0
  1566. dir: dir isR
  1567. -/|219: O: O437 (predict-yes)
  1568. I see 1 and I'm going to do: predict-yes
  1569. ENV: Agent did: predict-yes for direction R in state State-A
  1570. In State-A moving R
  1571. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1572. predict error 0
  1573. dir: dir isU
  1574. \-220: O: O440 (predict-no)
  1575. I see 1 and I'm going to do: predict-no
  1576. ENV: Agent did: predict-no for direction U in state State-B
  1577. In State-B moving U
  1578. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1579. predict error 0
  1580. dir: dir isL
  1581. /|\221: O: O441 (predict-yes)
  1582. I see 1 and I'm going to do: predict-yes
  1583. ENV: Agent did: predict-yes for direction L in state State-B
  1584. In State-B moving L
  1585. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1586. predict error 0
  1587. dir: dir isU
  1588. -222: O: O444 (predict-no)
  1589. I see 1 and I'm going to do: predict-no
  1590. ENV: Agent did: predict-no for direction U in state State-A
  1591. In State-A moving U
  1592. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1593. predict error 0
  1594. dir: dir isL
  1595. /|223: O: O446 (predict-no)
  1596. I see 1 and I'm going to do: predict-no
  1597. ENV: Agent did: predict-no for direction L in state State-A
  1598. In State-A moving L
  1599. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1600. predict error 0
  1601. dir: dir isR
  1602. \-224: O: O447 (predict-yes)
  1603. I see 1 and I'm going to do: predict-yes
  1604. ENV: Agent did: predict-yes for direction R in state State-A
  1605. In State-A moving R
  1606. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1607. predict error 0
  1608. dir: dir isR
  1609. /|225: O: O449 (predict-yes)
  1610. I see 1 and I'm going to do: predict-yes
  1611. ENV: Agent did: predict-yes for direction R in state State-B
  1612. In State-B moving R
  1613. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1614. predict error 1
  1615. dir: dir isR
  1616. \-/226: O: O452 (predict-no)
  1617. I see 0 and I'm going to do: predict-no
  1618. ENV: Agent did: predict-no for direction R in state State-B
  1619. In State-B moving R
  1620. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1621. predict error 0
  1622. dir: dir isU
  1623. |\227: O: O454 (predict-no)
  1624. I see 1 and I'm going to do: predict-no
  1625. ENV: Agent did: predict-no for direction U in state State-B
  1626. In State-B moving U
  1627. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1628. predict error 0
  1629. dir: dir isR
  1630. -/|228: O: O456 (predict-no)
  1631. I see 1 and I'm going to do: predict-no
  1632. ENV: Agent did: predict-no for direction R in state State-B
  1633. In State-B moving R
  1634. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1635. predict error 0
  1636. dir: dir isL
  1637. \229: O: O457 (predict-yes)
  1638. I see 1 and I'm going to do: predict-yes
  1639. ENV: Agent did: predict-yes for direction L in state State-B
  1640. In State-B moving L
  1641. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1642. predict error 0
  1643. dir: dir isL
  1644. -/|230: O: O460 (predict-no)
  1645. I see 1 and I'm going to do: predict-no
  1646. ENV: Agent did: predict-no for direction L in state State-A
  1647. In State-A moving L
  1648. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1649. predict error 0
  1650. dir: dir isL
  1651. \-231: O: O462 (predict-no)
  1652. I see 1 and I'm going to do: predict-no
  1653. ENV: Agent did: predict-no for direction L in state State-A
  1654. In State-A moving L
  1655. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1656. predict error 0
  1657. dir: dir isU
  1658. /232: O: O464 (predict-no)
  1659. I see 1 and I'm going to do: predict-no
  1660. ENV: Agent did: predict-no for direction U in state State-A
  1661. In State-A moving U
  1662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1663. predict error 0
  1664. dir: dir isR
  1665. |\233: O: O465 (predict-yes)
  1666. I see 1 and I'm going to do: predict-yes
  1667. ENV: Agent did: predict-yes for direction R in state State-A
  1668. In State-A moving R
  1669. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1670. predict error 0
  1671. dir: dir isU
  1672. -/|234: O: O468 (predict-no)
  1673. I see 1 and I'm going to do: predict-no
  1674. ENV: Agent did: predict-no for direction U in state State-B
  1675. In State-B moving U
  1676. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1677. predict error 0
  1678. dir: dir isU
  1679. \-235: O: O470 (predict-no)
  1680. I see 1 and I'm going to do: predict-no
  1681. ENV: Agent did: predict-no for direction U in state State-B
  1682. In State-B moving U
  1683. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1684. predict error 0
  1685. dir: dir isL
  1686. /|236: O: O471 (predict-yes)
  1687. I see 1 and I'm going to do: predict-yes
  1688. ENV: Agent did: predict-yes for direction L in state State-B
  1689. In State-B moving L
  1690. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1691. predict error 0
  1692. dir: dir isR
  1693. \-/237: O: O474 (predict-no)
  1694. I see 1 and I'm going to do: predict-no
  1695. ENV: Agent did: predict-no for direction R in state State-A
  1696. In State-A moving R
  1697. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1698. predict error 1
  1699. dir: dir isU
  1700. |\238: O: O476 (predict-no)
  1701. I see 0 and I'm going to do: predict-no
  1702. ENV: Agent did: predict-no for direction U in state State-B
  1703. In State-B moving U
  1704. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1705. predict error 0
  1706. dir: dir isU
  1707. -/|\239: O: O478 (predict-no)
  1708. I see 1 and I'm going to do: predict-no
  1709. ENV: Agent did: predict-no for direction U in state State-B
  1710. In State-B moving U
  1711. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1712. predict error 0
  1713. dir: dir isR
  1714. -/240: O: O480 (predict-no)
  1715. I see 1 and I'm going to do: predict-no
  1716. ENV: Agent did: predict-no for direction R in state State-B
  1717. In State-B moving R
  1718. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1719. predict error 0
  1720. dir: dir isR
  1721. |\-241: O: O481 (predict-yes)
  1722. I see 1 and I'm going to do: predict-yes
  1723. ENV: Agent did: predict-yes for direction R in state State-B
  1724. In State-B moving R
  1725. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1726. predict error 1
  1727. dir: dir isR
  1728. /242: O: O484 (predict-no)
  1729. I see 0 and I'm going to do: predict-no
  1730. ENV: Agent did: predict-no for direction R in state State-B
  1731. In State-B moving R
  1732. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1733. predict error 0
  1734. dir: dir isU
  1735. |\-243: O: O486 (predict-no)
  1736. I see 1 and I'm going to do: predict-no
  1737. ENV: Agent did: predict-no for direction U in state State-B
  1738. In State-B moving U
  1739. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1740. predict error 0
  1741. dir: dir isL
  1742. /|\244: O: O487 (predict-yes)
  1743. I see 1 and I'm going to do: predict-yes
  1744. ENV: Agent did: predict-yes for direction L in state State-B
  1745. In State-B moving L
  1746. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1747. predict error 0
  1748. dir: dir isR
  1749. -/245: O: O489 (predict-yes)
  1750. I see 1 and I'm going to do: predict-yes
  1751. ENV: Agent did: predict-yes for direction R in state State-A
  1752. In State-A moving R
  1753. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1754. predict error 0
  1755. dir: dir isR
  1756. |\-246: O: O491 (predict-yes)
  1757. I see 1 and I'm going to do: predict-yes
  1758. ENV: Agent did: predict-yes for direction R in state State-B
  1759. In State-B moving R
  1760. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1761. predict error 1
  1762. dir: dir isU
  1763. /|247: O: O494 (predict-no)
  1764. I see 0 and I'm going to do: predict-no
  1765. ENV: Agent did: predict-no for direction U in state State-B
  1766. In State-B moving U
  1767. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1768. predict error 0
  1769. dir: dir isU
  1770. \-/248: O: O496 (predict-no)
  1771. I see 1 and I'm going to do: predict-no
  1772. ENV: Agent did: predict-no for direction U in state State-B
  1773. In State-B moving U
  1774. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1775. predict error 0
  1776. dir: dir isU
  1777. |\-249: O: O498 (predict-no)
  1778. I see 1 and I'm going to do: predict-no
  1779. ENV: Agent did: predict-no for direction U in state State-B
  1780. In State-B moving U
  1781. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1782. predict error 0
  1783. dir: dir isU
  1784. /|\250: O: O500 (predict-no)
  1785. I see 1 and I'm going to do: predict-no
  1786. ENV: Agent did: predict-no for direction U in state State-B
  1787. In State-B moving U
  1788. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1789. predict error 0
  1790. dir: dir isL
  1791. -/|251: O: O501 (predict-yes)
  1792. I see 1 and I'm going to do: predict-yes
  1793. ENV: Agent did: predict-yes for direction L in state State-B
  1794. In State-B moving L
  1795. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1796. predict error 0
  1797. dir: dir isL
  1798. \252: O: O504 (predict-no)
  1799. I see 1 and I'm going to do: predict-no
  1800. ENV: Agent did: predict-no for direction L in state State-A
  1801. In State-A moving L
  1802. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1803. predict error 0
  1804. dir: dir isR
  1805. -/|253: O: O506 (predict-no)
  1806. I see 1 and I'm going to do: predict-no
  1807. ENV: Agent did: predict-no for direction R in state State-A
  1808. In State-A moving R
  1809. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  1810. predict error 1
  1811. dir: dir isL
  1812. \-/254: O: O508 (predict-no)
  1813. I see 0 and I'm going to do: predict-no
  1814. ENV: Agent did: predict-no for direction L in state State-B
  1815. In State-B moving L
  1816. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  1817. predict error 1
  1818. dir: dir isR
  1819. |\255: O: O509 (predict-yes)
  1820. I see 0 and I'm going to do: predict-yes
  1821. ENV: Agent did: predict-yes for direction R in state State-A
  1822. In State-A moving R
  1823. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1824. predict error 0
  1825. dir: dir isU
  1826. -/256: O: O511 (predict-yes)
  1827. I see 1 and I'm going to do: predict-yes
  1828. ENV: Agent did: predict-yes for direction U in state State-B
  1829. In State-B moving U
  1830. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1831. predict error 1
  1832. dir: dir isU
  1833. |\-257: O: O514 (predict-no)
  1834. I see 0 and I'm going to do: predict-no
  1835. ENV: Agent did: predict-no for direction U in state State-B
  1836. In State-B moving U
  1837. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1838. predict error 0
  1839. dir: dir isR
  1840. /|\258: O: O516 (predict-no)
  1841. I see 1 and I'm going to do: predict-no
  1842. ENV: Agent did: predict-no for direction R in state State-B
  1843. In State-B moving R
  1844. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1845. predict error 0
  1846. dir: dir isU
  1847. -/|259: O: O517 (predict-yes)
  1848. I see 1 and I'm going to do: predict-yes
  1849. ENV: Agent did: predict-yes for direction U in state State-B
  1850. In State-B moving U
  1851. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  1852. predict error 1
  1853. dir: dir isL
  1854. \-/260: O: O519 (predict-yes)
  1855. I see 0 and I'm going to do: predict-yes
  1856. ENV: Agent did: predict-yes for direction L in state State-B
  1857. In State-B moving L
  1858. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1859. predict error 0
  1860. dir: dir isR
  1861. |261: O: O521 (predict-yes)
  1862. I see 1 and I'm going to do: predict-yes
  1863. ENV: Agent did: predict-yes for direction R in state State-A
  1864. In State-A moving R
  1865. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1866. predict error 0
  1867. dir: dir isU
  1868. \262: O: O524 (predict-no)
  1869. I see 1 and I'm going to do: predict-no
  1870. ENV: Agent did: predict-no for direction U in state State-B
  1871. In State-B moving U
  1872. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1873. predict error 0
  1874. dir: dir isL
  1875. -/|263: O: O525 (predict-yes)
  1876. I see 1 and I'm going to do: predict-yes
  1877. ENV: Agent did: predict-yes for direction L in state State-B
  1878. In State-B moving L
  1879. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1880. predict error 0
  1881. dir: dir isR
  1882. \-/264: O: O527 (predict-yes)
  1883. I see 1 and I'm going to do: predict-yes
  1884. ENV: Agent did: predict-yes for direction R in state State-A
  1885. In State-A moving R
  1886. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1887. predict error 0
  1888. dir: dir isL
  1889. |\-265: O: O529 (predict-yes)
  1890. I see 1 and I'm going to do: predict-yes
  1891. ENV: Agent did: predict-yes for direction L in state State-B
  1892. In State-B moving L
  1893. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1894. predict error 0
  1895. dir: dir isL
  1896. /266: O: O532 (predict-no)
  1897. I see 1 and I'm going to do: predict-no
  1898. ENV: Agent did: predict-no for direction L in state State-A
  1899. In State-A moving L
  1900. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1901. predict error 0
  1902. dir: dir isU
  1903. |\-267: O: O534 (predict-no)
  1904. I see 1 and I'm going to do: predict-no
  1905. ENV: Agent did: predict-no for direction U in state State-A
  1906. In State-A moving U
  1907. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1908. predict error 0
  1909. dir: dir isU
  1910. /|\268: O: O536 (predict-no)
  1911. I see 1 and I'm going to do: predict-no
  1912. ENV: Agent did: predict-no for direction U in state State-A
  1913. In State-A moving U
  1914. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1915. predict error 0
  1916. dir: dir isR
  1917. -/|269: O: O537 (predict-yes)
  1918. I see 1 and I'm going to do: predict-yes
  1919. ENV: Agent did: predict-yes for direction R in state State-A
  1920. In State-A moving R
  1921. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1922. predict error 0
  1923. dir: dir isL
  1924. \-270: O: O539 (predict-yes)
  1925. I see 1 and I'm going to do: predict-yes
  1926. ENV: Agent did: predict-yes for direction L in state State-B
  1927. In State-B moving L
  1928. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1929. predict error 0
  1930. dir: dir isL
  1931. /|\271: O: O542 (predict-no)
  1932. I see 1 and I'm going to do: predict-no
  1933. ENV: Agent did: predict-no for direction L in state State-A
  1934. In State-A moving L
  1935. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1936. predict error 0
  1937. dir: dir isL
  1938. -272: O: O544 (predict-no)
  1939. I see 1 and I'm going to do: predict-no
  1940. ENV: Agent did: predict-no for direction L in state State-A
  1941. In State-A moving L
  1942. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1943. predict error 0
  1944. dir: dir isU
  1945. /|\273: O: O546 (predict-no)
  1946. I see 1 and I'm going to do: predict-no
  1947. ENV: Agent did: predict-no for direction U in state State-A
  1948. In State-A moving U
  1949. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1950. predict error 0
  1951. dir: dir isU
  1952. -/|274: O: O548 (predict-no)
  1953. I see 1 and I'm going to do: predict-no
  1954. ENV: Agent did: predict-no for direction U in state State-A
  1955. In State-A moving U
  1956. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  1957. predict error 0
  1958. dir: dir isR
  1959. \-/275: O: O549 (predict-yes)
  1960. I see 1 and I'm going to do: predict-yes
  1961. ENV: Agent did: predict-yes for direction R in state State-A
  1962. In State-A moving R
  1963. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1964. predict error 0
  1965. dir: dir isR
  1966. |\-276: O: O552 (predict-no)
  1967. I see 1 and I'm going to do: predict-no
  1968. ENV: Agent did: predict-no for direction R in state State-B
  1969. In State-B moving R
  1970. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1971. predict error 0
  1972. dir: dir isL
  1973. /277: O: O553 (predict-yes)
  1974. I see 1 and I'm going to do: predict-yes
  1975. ENV: Agent did: predict-yes for direction L in state State-B
  1976. In State-B moving L
  1977. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  1978. predict error 0
  1979. dir: dir isR
  1980. |\278: O: O555 (predict-yes)
  1981. I see 1 and I'm going to do: predict-yes
  1982. ENV: Agent did: predict-yes for direction R in state State-A
  1983. In State-A moving R
  1984. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  1985. predict error 0
  1986. dir: dir isU
  1987. -279: O: O558 (predict-no)
  1988. I see 1 and I'm going to do: predict-no
  1989. ENV: Agent did: predict-no for direction U in state State-B
  1990. In State-B moving U
  1991. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1992. predict error 0
  1993. dir: dir isU
  1994. /|\280: O: O560 (predict-no)
  1995. I see 1 and I'm going to do: predict-no
  1996. ENV: Agent did: predict-no for direction U in state State-B
  1997. In State-B moving U
  1998. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  1999. predict error 0
  2000. dir: dir isL
  2001. -/|281: O: O561 (predict-yes)
  2002. I see 1 and I'm going to do: predict-yes
  2003. ENV: Agent did: predict-yes for direction L in state State-B
  2004. In State-B moving L
  2005. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2006. predict error 0
  2007. dir: dir isR
  2008. \282: O: O563 (predict-yes)
  2009. I see 1 and I'm going to do: predict-yes
  2010. ENV: Agent did: predict-yes for direction R in state State-A
  2011. In State-A moving R
  2012. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2013. predict error 0
  2014. dir: dir isU
  2015. -283: O: O565 (predict-yes)
  2016. I see 1 and I'm going to do: predict-yes
  2017. ENV: Agent did: predict-yes for direction U in state State-B
  2018. In State-B moving U
  2019. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2020. predict error 1
  2021. dir: dir isL
  2022. /|\284: O: O567 (predict-yes)
  2023. I see 0 and I'm going to do: predict-yes
  2024. ENV: Agent did: predict-yes for direction L in state State-B
  2025. In State-B moving L
  2026. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2027. predict error 0
  2028. dir: dir isU
  2029. -/|285: O: O569 (predict-yes)
  2030. I see 1 and I'm going to do: predict-yes
  2031. ENV: Agent did: predict-yes for direction U in state State-A
  2032. In State-A moving U
  2033. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2034. predict error 1
  2035. dir: dir isR
  2036. \-/286: O: O572 (predict-no)
  2037. I see 0 and I'm going to do: predict-no
  2038. ENV: Agent did: predict-no for direction R in state State-A
  2039. In State-A moving R
  2040. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2041. predict error 1
  2042. dir: dir isU
  2043. |\-287: O: O574 (predict-no)
  2044. I see 0 and I'm going to do: predict-no
  2045. ENV: Agent did: predict-no for direction U in state State-B
  2046. In State-B moving U
  2047. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2048. predict error 0
  2049. dir: dir isR
  2050. /|\288: O: O576 (predict-no)
  2051. I see 1 and I'm going to do: predict-no
  2052. ENV: Agent did: predict-no for direction R in state State-B
  2053. In State-B moving R
  2054. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2055. predict error 0
  2056. dir: dir isU
  2057. -289: O: O578 (predict-no)
  2058. I see 1 and I'm going to do: predict-no
  2059. ENV: Agent did: predict-no for direction U in state State-B
  2060. In State-B moving U
  2061. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2062. predict error 0
  2063. dir: dir isU
  2064. /|\290: O: O580 (predict-no)
  2065. I see 1 and I'm going to do: predict-no
  2066. ENV: Agent did: predict-no for direction U in state State-B
  2067. In State-B moving U
  2068. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2069. predict error 0
  2070. dir: dir isR
  2071. -/|291: O: O582 (predict-no)
  2072. I see 1 and I'm going to do: predict-no
  2073. ENV: Agent did: predict-no for direction R in state State-B
  2074. In State-B moving R
  2075. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2076. predict error 0
  2077. dir: dir isL
  2078. \292: O: O583 (predict-yes)
  2079. I see 1 and I'm going to do: predict-yes
  2080. ENV: Agent did: predict-yes for direction L in state State-B
  2081. In State-B moving L
  2082. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2083. predict error 0
  2084. dir: dir isR
  2085. -293: O: O585 (predict-yes)
  2086. I see 1 and I'm going to do: predict-yes
  2087. ENV: Agent did: predict-yes for direction R in state State-A
  2088. In State-A moving R
  2089. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2090. predict error 0
  2091. dir: dir isL
  2092. /|\294: O: O587 (predict-yes)
  2093. I see 1 and I'm going to do: predict-yes
  2094. ENV: Agent did: predict-yes for direction L in state State-B
  2095. In State-B moving L
  2096. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2097. predict error 0
  2098. dir: dir isR
  2099. -/295: O: O589 (predict-yes)
  2100. I see 1 and I'm going to do: predict-yes
  2101. ENV: Agent did: predict-yes for direction R in state State-A
  2102. In State-A moving R
  2103. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2104. predict error 0
  2105. dir: dir isU
  2106. |296: O: O592 (predict-no)
  2107. I see 1 and I'm going to do: predict-no
  2108. ENV: Agent did: predict-no for direction U in state State-B
  2109. In State-B moving U
  2110. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2111. predict error 0
  2112. dir: dir isL
  2113. \-/297: O: O593 (predict-yes)
  2114. I see 1 and I'm going to do: predict-yes
  2115. ENV: Agent did: predict-yes for direction L in state State-B
  2116. In State-B moving L
  2117. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2118. predict error 0
  2119. dir: dir isR
  2120. |\-298: O: O595 (predict-yes)
  2121. I see 1 and I'm going to do: predict-yes
  2122. ENV: Agent did: predict-yes for direction R in state State-A
  2123. In State-A moving R
  2124. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2125. predict error 0
  2126. dir: dir isR
  2127. /299: O: O598 (predict-no)
  2128. I see 1 and I'm going to do: predict-no
  2129. ENV: Agent did: predict-no for direction R in state State-B
  2130. In State-B moving R
  2131. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2132. predict error 0
  2133. dir: dir isR
  2134. |\-300: O: O599 (predict-yes)
  2135. I see 1 and I'm going to do: predict-yes
  2136. ENV: Agent did: predict-yes for direction R in state State-B
  2137. In State-B moving R
  2138. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2139. predict error 1
  2140. dir: dir isU
  2141. /|\301: O: O602 (predict-no)
  2142. I see 0 and I'm going to do: predict-no
  2143. ENV: Agent did: predict-no for direction U in state State-B
  2144. In State-B moving U
  2145. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2146. predict error 0
  2147. dir: dir isU
  2148. -302: O: O604 (predict-no)
  2149. I see 1 and I'm going to do: predict-no
  2150. ENV: Agent did: predict-no for direction U in state State-B
  2151. In State-B moving U
  2152. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2153. predict error 0
  2154. dir: dir isU
  2155. /303: O: O606 (predict-no)
  2156. I see 1 and I'm going to do: predict-no
  2157. ENV: Agent did: predict-no for direction U in state State-B
  2158. In State-B moving U
  2159. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2160. predict error 0
  2161. dir: dir isR
  2162. |\-304: O: O608 (predict-no)
  2163. I see 1 and I'm going to do: predict-no
  2164. ENV: Agent did: predict-no for direction R in state State-B
  2165. In State-B moving R
  2166. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2167. predict error 0
  2168. dir: dir isU
  2169. /|\305: O: O610 (predict-no)
  2170. I see 1 and I'm going to do: predict-no
  2171. ENV: Agent did: predict-no for direction U in state State-B
  2172. In State-B moving U
  2173. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2174. predict error 0
  2175. dir: dir isL
  2176. -/306: O: O611 (predict-yes)
  2177. I see 1 and I'm going to do: predict-yes
  2178. ENV: Agent did: predict-yes for direction L in state State-B
  2179. In State-B moving L
  2180. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2181. predict error 0
  2182. dir: dir isU
  2183. |\-307: O: O614 (predict-no)
  2184. I see 1 and I'm going to do: predict-no
  2185. ENV: Agent did: predict-no for direction U in state State-A
  2186. In State-A moving U
  2187. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2188. predict error 0
  2189. dir: dir isR
  2190. /|\308: O: O615 (predict-yes)
  2191. I see 1 and I'm going to do: predict-yes
  2192. ENV: Agent did: predict-yes for direction R in state State-A
  2193. In State-A moving R
  2194. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2195. predict error 0
  2196. dir: dir isU
  2197. -/309: O: O617 (predict-yes)
  2198. I see 1 and I'm going to do: predict-yes
  2199. ENV: Agent did: predict-yes for direction U in state State-B
  2200. In State-B moving U
  2201. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2202. predict error 1
  2203. dir: dir isU
  2204. |\310: O: O620 (predict-no)
  2205. I see 0 and I'm going to do: predict-no
  2206. ENV: Agent did: predict-no for direction U in state State-B
  2207. In State-B moving U
  2208. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2209. predict error 0
  2210. dir: dir isL
  2211. -/|311: O: O621 (predict-yes)
  2212. I see 1 and I'm going to do: predict-yes
  2213. ENV: Agent did: predict-yes for direction L in state State-B
  2214. In State-B moving L
  2215. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2216. predict error 0
  2217. dir: dir isR
  2218. \312: O: O624 (predict-no)
  2219. I see 1 and I'm going to do: predict-no
  2220. ENV: Agent did: predict-no for direction R in state State-A
  2221. In State-A moving R
  2222. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2223. predict error 1
  2224. dir: dir isR
  2225. -/|313: O: O626 (predict-no)
  2226. I see 0 and I'm going to do: predict-no
  2227. ENV: Agent did: predict-no for direction R in state State-B
  2228. In State-B moving R
  2229. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2230. predict error 0
  2231. dir: dir isU
  2232. \-314: O: O628 (predict-no)
  2233. I see 1 and I'm going to do: predict-no
  2234. ENV: Agent did: predict-no for direction U in state State-B
  2235. In State-B moving U
  2236. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2237. predict error 0
  2238. dir: dir isU
  2239. /|\315: O: O630 (predict-no)
  2240. I see 1 and I'm going to do: predict-no
  2241. ENV: Agent did: predict-no for direction U in state State-B
  2242. In State-B moving U
  2243. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2244. predict error 0
  2245. dir: dir isR
  2246. -316: O: O632 (predict-no)
  2247. I see 1 and I'm going to do: predict-no
  2248. ENV: Agent did: predict-no for direction R in state State-B
  2249. In State-B moving R
  2250. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2251. predict error 0
  2252. dir: dir isL
  2253. /|\317: O: O634 (predict-no)
  2254. I see 1 and I'm going to do: predict-no
  2255. ENV: Agent did: predict-no for direction L in state State-B
  2256. In State-B moving L
  2257. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  2258. predict error 1
  2259. dir: dir isR
  2260. -/318: O: O635 (predict-yes)
  2261. I see 0 and I'm going to do: predict-yes
  2262. ENV: Agent did: predict-yes for direction R in state State-A
  2263. In State-A moving R
  2264. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2265. predict error 0
  2266. dir: dir isU
  2267. |\-319: O: O638 (predict-no)
  2268. I see 1 and I'm going to do: predict-no
  2269. ENV: Agent did: predict-no for direction U in state State-B
  2270. In State-B moving U
  2271. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2272. predict error 0
  2273. dir: dir isU
  2274. /|320: O: O640 (predict-no)
  2275. I see 1 and I'm going to do: predict-no
  2276. ENV: Agent did: predict-no for direction U in state State-B
  2277. In State-B moving U
  2278. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2279. predict error 0
  2280. dir: dir isR
  2281. \-/321: O: O642 (predict-no)
  2282. I see 1 and I'm going to do: predict-no
  2283. ENV: Agent did: predict-no for direction R in state State-B
  2284. In State-B moving R
  2285. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2286. predict error 0
  2287. dir: dir isU
  2288. |322: O: O644 (predict-no)
  2289. I see 1 and I'm going to do: predict-no
  2290. ENV: Agent did: predict-no for direction U in state State-B
  2291. In State-B moving U
  2292. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2293. predict error 0
  2294. dir: dir isL
  2295. \-/323: O: O645 (predict-yes)
  2296. I see 1 and I'm going to do: predict-yes
  2297. ENV: Agent did: predict-yes for direction L in state State-B
  2298. In State-B moving L
  2299. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2300. predict error 0
  2301. dir: dir isU
  2302. |324: O: O648 (predict-no)
  2303. I see 1 and I'm going to do: predict-no
  2304. ENV: Agent did: predict-no for direction U in state State-A
  2305. In State-A moving U
  2306. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2307. predict error 0
  2308. dir: dir isU
  2309. \-/325: O: O650 (predict-no)
  2310. I see 1 and I'm going to do: predict-no
  2311. ENV: Agent did: predict-no for direction U in state State-A
  2312. In State-A moving U
  2313. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2314. predict error 0
  2315. dir: dir isR
  2316. |326: O: O651 (predict-yes)
  2317. I see 1 and I'm going to do: predict-yes
  2318. ENV: Agent did: predict-yes for direction R in state State-A
  2319. In State-A moving R
  2320. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2321. predict error 0
  2322. dir: dir isU
  2323. \-/327: O: O654 (predict-no)
  2324. I see 1 and I'm going to do: predict-no
  2325. ENV: Agent did: predict-no for direction U in state State-B
  2326. In State-B moving U
  2327. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2328. predict error 0
  2329. dir: dir isU
  2330. |\328: O: O656 (predict-no)
  2331. I see 1 and I'm going to do: predict-no
  2332. ENV: Agent did: predict-no for direction U in state State-B
  2333. In State-B moving U
  2334. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2335. predict error 0
  2336. dir: dir isL
  2337. -/|329: O: O657 (predict-yes)
  2338. I see 1 and I'm going to do: predict-yes
  2339. ENV: Agent did: predict-yes for direction L in state State-B
  2340. In State-B moving L
  2341. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2342. predict error 0
  2343. dir: dir isU
  2344. \-/330: O: O660 (predict-no)
  2345. I see 1 and I'm going to do: predict-no
  2346. ENV: Agent did: predict-no for direction U in state State-A
  2347. In State-A moving U
  2348. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2349. predict error 0
  2350. dir: dir isU
  2351. |\-331: O: O662 (predict-no)
  2352. I see 1 and I'm going to do: predict-no
  2353. ENV: Agent did: predict-no for direction U in state State-A
  2354. In State-A moving U
  2355. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2356. predict error 0
  2357. dir: dir isL
  2358. /332: O: O664 (predict-no)
  2359. I see 1 and I'm going to do: predict-no
  2360. ENV: Agent did: predict-no for direction L in state State-A
  2361. In State-A moving L
  2362. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2363. predict error 0
  2364. dir: dir isU
  2365. |\-333: O: O665 (predict-yes)
  2366. I see 1 and I'm going to do: predict-yes
  2367. ENV: Agent did: predict-yes for direction U in state State-A
  2368. In State-A moving U
  2369. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2370. predict error 1
  2371. dir: dir isR
  2372. /|\334: O: O667 (predict-yes)
  2373. I see 0 and I'm going to do: predict-yes
  2374. ENV: Agent did: predict-yes for direction R in state State-A
  2375. In State-A moving R
  2376. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2377. predict error 0
  2378. dir: dir isL
  2379. -/|335: O: O669 (predict-yes)
  2380. I see 1 and I'm going to do: predict-yes
  2381. ENV: Agent did: predict-yes for direction L in state State-B
  2382. In State-B moving L
  2383. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2384. predict error 0
  2385. dir: dir isU
  2386. \-/336: O: O672 (predict-no)
  2387. I see 1 and I'm going to do: predict-no
  2388. ENV: Agent did: predict-no for direction U in state State-A
  2389. In State-A moving U
  2390. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2391. predict error 0
  2392. dir: dir isL
  2393. |\-337: O: O674 (predict-no)
  2394. I see 1 and I'm going to do: predict-no
  2395. ENV: Agent did: predict-no for direction L in state State-A
  2396. In State-A moving L
  2397. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2398. predict error 0
  2399. dir: dir isR
  2400. /|338: O: O675 (predict-yes)
  2401. I see 1 and I'm going to do: predict-yes
  2402. ENV: Agent did: predict-yes for direction R in state State-A
  2403. In State-A moving R
  2404. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2405. predict error 0
  2406. dir: dir isR
  2407. \-339: O: O678 (predict-no)
  2408. I see 1 and I'm going to do: predict-no
  2409. ENV: Agent did: predict-no for direction R in state State-B
  2410. In State-B moving R
  2411. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2412. predict error 0
  2413. dir: dir isL
  2414. /|340: O: O679 (predict-yes)
  2415. I see 1 and I'm going to do: predict-yes
  2416. ENV: Agent did: predict-yes for direction L in state State-B
  2417. In State-B moving L
  2418. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2419. predict error 0
  2420. dir: dir isR
  2421. \341: O: O681 (predict-yes)
  2422. I see 1 and I'm going to do: predict-yes
  2423. ENV: Agent did: predict-yes for direction R in state State-A
  2424. In State-A moving R
  2425. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2426. predict error 0
  2427. dir: dir isR
  2428. -342: O: O684 (predict-no)
  2429. I see 1 and I'm going to do: predict-no
  2430. ENV: Agent did: predict-no for direction R in state State-B
  2431. In State-B moving R
  2432. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2433. predict error 0
  2434. dir: dir isL
  2435. /|\343: O: O685 (predict-yes)
  2436. I see 1 and I'm going to do: predict-yes
  2437. ENV: Agent did: predict-yes for direction L in state State-B
  2438. In State-B moving L
  2439. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2440. predict error 0
  2441. dir: dir isU
  2442. -/|344: O: O688 (predict-no)
  2443. I see 1 and I'm going to do: predict-no
  2444. ENV: Agent did: predict-no for direction U in state State-A
  2445. In State-A moving U
  2446. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2447. predict error 0
  2448. dir: dir isL
  2449. \-/345: O: O690 (predict-no)
  2450. I see 1 and I'm going to do: predict-no
  2451. ENV: Agent did: predict-no for direction L in state State-A
  2452. In State-A moving L
  2453. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2454. predict error 0
  2455. dir: dir isL
  2456. |\-346: O: O692 (predict-no)
  2457. I see 1 and I'm going to do: predict-no
  2458. ENV: Agent did: predict-no for direction L in state State-A
  2459. In State-A moving L
  2460. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2461. predict error 0
  2462. dir: dir isR
  2463. /|347: O: O693 (predict-yes)
  2464. I see 1 and I'm going to do: predict-yes
  2465. ENV: Agent did: predict-yes for direction R in state State-A
  2466. In State-A moving R
  2467. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2468. predict error 0
  2469. dir: dir isU
  2470. \-/348: O: O696 (predict-no)
  2471. I see 1 and I'm going to do: predict-no
  2472. ENV: Agent did: predict-no for direction U in state State-B
  2473. In State-B moving U
  2474. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2475. predict error 0
  2476. dir: dir isR
  2477. |\-349: O: O698 (predict-no)
  2478. I see 1 and I'm going to do: predict-no
  2479. ENV: Agent did: predict-no for direction R in state State-B
  2480. In State-B moving R
  2481. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2482. predict error 0
  2483. dir: dir isU
  2484. /|\350: O: O700 (predict-no)
  2485. I see 1 and I'm going to do: predict-no
  2486. ENV: Agent did: predict-no for direction U in state State-B
  2487. In State-B moving U
  2488. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2489. predict error 0
  2490. dir: dir isR
  2491. -351: O: O702 (predict-no)
  2492. I see 1 and I'm going to do: predict-no
  2493. ENV: Agent did: predict-no for direction R in state State-B
  2494. In State-B moving R
  2495. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2496. predict error 0
  2497. dir: dir isU
  2498. /352: O: O703 (predict-yes)
  2499. I see 1 and I'm going to do: predict-yes
  2500. ENV: Agent did: predict-yes for direction U in state State-B
  2501. In State-B moving U
  2502. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2503. predict error 1
  2504. dir: dir isR
  2505. |\-353: O: O706 (predict-no)
  2506. I see 0 and I'm going to do: predict-no
  2507. ENV: Agent did: predict-no for direction R in state State-B
  2508. In State-B moving R
  2509. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2510. predict error 0
  2511. dir: dir isL
  2512. /|\354: O: O707 (predict-yes)
  2513. I see 1 and I'm going to do: predict-yes
  2514. ENV: Agent did: predict-yes for direction L in state State-B
  2515. In State-B moving L
  2516. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2517. predict error 0
  2518. dir: dir isR
  2519. -/|355: O: O709 (predict-yes)
  2520. I see 1 and I'm going to do: predict-yes
  2521. ENV: Agent did: predict-yes for direction R in state State-A
  2522. In State-A moving R
  2523. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2524. predict error 0
  2525. dir: dir isL
  2526. \-356: O: O711 (predict-yes)
  2527. I see 1 and I'm going to do: predict-yes
  2528. ENV: Agent did: predict-yes for direction L in state State-B
  2529. In State-B moving L
  2530. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2531. predict error 0
  2532. dir: dir isL
  2533. /357: O: O714 (predict-no)
  2534. I see 1 and I'm going to do: predict-no
  2535. ENV: Agent did: predict-no for direction L in state State-A
  2536. In State-A moving L
  2537. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2538. predict error 0
  2539. dir: dir isU
  2540. |\-358: O: O716 (predict-no)
  2541. I see 1 and I'm going to do: predict-no
  2542. ENV: Agent did: predict-no for direction U in state State-A
  2543. In State-A moving U
  2544. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2545. predict error 0
  2546. dir: dir isL
  2547. /|359: O: O718 (predict-no)
  2548. I see 1 and I'm going to do: predict-no
  2549. ENV: Agent did: predict-no for direction L in state State-A
  2550. In State-A moving L
  2551. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2552. predict error 0
  2553. dir: dir isU
  2554. \360: O: O720 (predict-no)
  2555. I see 1 and I'm going to do: predict-no
  2556. ENV: Agent did: predict-no for direction U in state State-A
  2557. In State-A moving U
  2558. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2559. predict error 0
  2560. dir: dir isU
  2561. -/|361: O: O721 (predict-yes)
  2562. I see 1 and I'm going to do: predict-yes
  2563. ENV: Agent did: predict-yes for direction U in state State-A
  2564. In State-A moving U
  2565. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2566. predict error 1
  2567. dir: dir isR
  2568. \362: O: O723 (predict-yes)
  2569. I see 0 and I'm going to do: predict-yes
  2570. ENV: Agent did: predict-yes for direction R in state State-A
  2571. In State-A moving R
  2572. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2573. predict error 0
  2574. dir: dir isU
  2575. -/363: O: O726 (predict-no)
  2576. I see 1 and I'm going to do: predict-no
  2577. ENV: Agent did: predict-no for direction U in state State-B
  2578. In State-B moving U
  2579. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2580. predict error 0
  2581. dir: dir isU
  2582. |\364: O: O728 (predict-no)
  2583. I see 1 and I'm going to do: predict-no
  2584. ENV: Agent did: predict-no for direction U in state State-B
  2585. In State-B moving U
  2586. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2587. predict error 0
  2588. dir: dir isU
  2589. -/|365: O: O730 (predict-no)
  2590. I see 1 and I'm going to do: predict-no
  2591. ENV: Agent did: predict-no for direction U in state State-B
  2592. In State-B moving U
  2593. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2594. predict error 0
  2595. dir: dir isL
  2596. \366: O: O731 (predict-yes)
  2597. I see 1 and I'm going to do: predict-yes
  2598. ENV: Agent did: predict-yes for direction L in state State-B
  2599. In State-B moving L
  2600. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2601. predict error 0
  2602. dir: dir isL
  2603. -/|367: O: O734 (predict-no)
  2604. I see 1 and I'm going to do: predict-no
  2605. ENV: Agent did: predict-no for direction L in state State-A
  2606. In State-A moving L
  2607. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2608. predict error 0
  2609. dir: dir isR
  2610. \-/368: O: O735 (predict-yes)
  2611. I see 1 and I'm going to do: predict-yes
  2612. ENV: Agent did: predict-yes for direction R in state State-A
  2613. In State-A moving R
  2614. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2615. predict error 0
  2616. dir: dir isR
  2617. |369: O: O737 (predict-yes)
  2618. I see 1 and I'm going to do: predict-yes
  2619. ENV: Agent did: predict-yes for direction R in state State-B
  2620. In State-B moving R
  2621. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  2622. predict error 1
  2623. dir: dir isL
  2624. \370: O: O739 (predict-yes)
  2625. I see 0 and I'm going to do: predict-yes
  2626. ENV: Agent did: predict-yes for direction L in state State-B
  2627. In State-B moving L
  2628. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2629. predict error 0
  2630. dir: dir isL
  2631. -/371: O: O742 (predict-no)
  2632. I see 1 and I'm going to do: predict-no
  2633. ENV: Agent did: predict-no for direction L in state State-A
  2634. In State-A moving L
  2635. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2636. predict error 0
  2637. dir: dir isR
  2638. |372: O: O743 (predict-yes)
  2639. I see 1 and I'm going to do: predict-yes
  2640. ENV: Agent did: predict-yes for direction R in state State-A
  2641. In State-A moving R
  2642. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2643. predict error 0
  2644. dir: dir isU
  2645. \373: O: O746 (predict-no)
  2646. I see 1 and I'm going to do: predict-no
  2647. ENV: Agent did: predict-no for direction U in state State-B
  2648. In State-B moving U
  2649. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2650. predict error 0
  2651. dir: dir isR
  2652. -/374: O: O748 (predict-no)
  2653. I see 1 and I'm going to do: predict-no
  2654. ENV: Agent did: predict-no for direction R in state State-B
  2655. In State-B moving R
  2656. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2657. predict error 0
  2658. dir: dir isR
  2659. |\375: O: O750 (predict-no)
  2660. I see 1 and I'm going to do: predict-no
  2661. ENV: Agent did: predict-no for direction R in state State-B
  2662. In State-B moving R
  2663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2664. predict error 0
  2665. dir: dir isR
  2666. -/376: O: O752 (predict-no)
  2667. I see 1 and I'm going to do: predict-no
  2668. ENV: Agent did: predict-no for direction R in state State-B
  2669. In State-B moving R
  2670. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2671. predict error 0
  2672. dir: dir isR
  2673. |\-377: O: O754 (predict-no)
  2674. I see 1 and I'm going to do: predict-no
  2675. ENV: Agent did: predict-no for direction R in state State-B
  2676. In State-B moving R
  2677. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2678. predict error 0
  2679. dir: dir isU
  2680. /|378: O: O756 (predict-no)
  2681. I see 1 and I'm going to do: predict-no
  2682. ENV: Agent did: predict-no for direction U in state State-B
  2683. In State-B moving U
  2684. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2685. predict error 0
  2686. dir: dir isL
  2687. \-/379: O: O757 (predict-yes)
  2688. I see 1 and I'm going to do: predict-yes
  2689. ENV: Agent did: predict-yes for direction L in state State-B
  2690. In State-B moving L
  2691. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2692. predict error 0
  2693. dir: dir isR
  2694. |\-380: O: O759 (predict-yes)
  2695. I see 1 and I'm going to do: predict-yes
  2696. ENV: Agent did: predict-yes for direction R in state State-A
  2697. In State-A moving R
  2698. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2699. predict error 0
  2700. dir: dir isR
  2701. /|\381: O: O762 (predict-no)
  2702. I see 1 and I'm going to do: predict-no
  2703. ENV: Agent did: predict-no for direction R in state State-B
  2704. In State-B moving R
  2705. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2706. predict error 0
  2707. dir: dir isR
  2708. -382: O: O764 (predict-no)
  2709. I see 1 and I'm going to do: predict-no
  2710. ENV: Agent did: predict-no for direction R in state State-B
  2711. In State-B moving R
  2712. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2713. predict error 0
  2714. dir: dir isU
  2715. /|\383: O: O766 (predict-no)
  2716. I see 1 and I'm going to do: predict-no
  2717. ENV: Agent did: predict-no for direction U in state State-B
  2718. In State-B moving U
  2719. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2720. predict error 0
  2721. dir: dir isL
  2722. -/|384: O: O767 (predict-yes)
  2723. I see 1 and I'm going to do: predict-yes
  2724. ENV: Agent did: predict-yes for direction L in state State-B
  2725. In State-B moving L
  2726. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2727. predict error 0
  2728. dir: dir isU
  2729. \-/385: O: O770 (predict-no)
  2730. I see 1 and I'm going to do: predict-no
  2731. ENV: Agent did: predict-no for direction U in state State-A
  2732. In State-A moving U
  2733. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2734. predict error 0
  2735. dir: dir isL
  2736. |\-386: O: O772 (predict-no)
  2737. I see 1 and I'm going to do: predict-no
  2738. ENV: Agent did: predict-no for direction L in state State-A
  2739. In State-A moving L
  2740. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2741. predict error 0
  2742. dir: dir isR
  2743. /|387: O: O773 (predict-yes)
  2744. I see 1 and I'm going to do: predict-yes
  2745. ENV: Agent did: predict-yes for direction R in state State-A
  2746. In State-A moving R
  2747. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2748. predict error 0
  2749. dir: dir isU
  2750. \-388: O: O776 (predict-no)
  2751. I see 1 and I'm going to do: predict-no
  2752. ENV: Agent did: predict-no for direction U in state State-B
  2753. In State-B moving U
  2754. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2755. predict error 0
  2756. dir: dir isR
  2757. /|389: O: O778 (predict-no)
  2758. I see 1 and I'm going to do: predict-no
  2759. ENV: Agent did: predict-no for direction R in state State-B
  2760. In State-B moving R
  2761. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2762. predict error 0
  2763. dir: dir isR
  2764. \-/390: O: O780 (predict-no)
  2765. I see 1 and I'm going to do: predict-no
  2766. ENV: Agent did: predict-no for direction R in state State-B
  2767. In State-B moving R
  2768. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2769. predict error 0
  2770. dir: dir isU
  2771. |\-391: O: O782 (predict-no)
  2772. I see 1 and I'm going to do: predict-no
  2773. ENV: Agent did: predict-no for direction U in state State-B
  2774. In State-B moving U
  2775. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2776. predict error 0
  2777. dir: dir isL
  2778. /392: O: O783 (predict-yes)
  2779. I see 1 and I'm going to do: predict-yes
  2780. ENV: Agent did: predict-yes for direction L in state State-B
  2781. In State-B moving L
  2782. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2783. predict error 0
  2784. dir: dir isR
  2785. |\393: O: O785 (predict-yes)
  2786. I see 1 and I'm going to do: predict-yes
  2787. ENV: Agent did: predict-yes for direction R in state State-A
  2788. In State-A moving R
  2789. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2790. predict error 0
  2791. dir: dir isR
  2792. -/|394: O: O788 (predict-no)
  2793. I see 1 and I'm going to do: predict-no
  2794. ENV: Agent did: predict-no for direction R in state State-B
  2795. In State-B moving R
  2796. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2797. predict error 0
  2798. dir: dir isR
  2799. \-/395: O: O790 (predict-no)
  2800. I see 1 and I'm going to do: predict-no
  2801. ENV: Agent did: predict-no for direction R in state State-B
  2802. In State-B moving R
  2803. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2804. predict error 0
  2805. dir: dir isU
  2806. |\396: O: O792 (predict-no)
  2807. I see 1 and I'm going to do: predict-no
  2808. ENV: Agent did: predict-no for direction U in state State-B
  2809. In State-B moving U
  2810. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2811. predict error 0
  2812. dir: dir isU
  2813. -/|397: O: O794 (predict-no)
  2814. I see 1 and I'm going to do: predict-no
  2815. ENV: Agent did: predict-no for direction U in state State-B
  2816. In State-B moving U
  2817. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2818. predict error 0
  2819. dir: dir isR
  2820. \-/398: O: O796 (predict-no)
  2821. I see 1 and I'm going to do: predict-no
  2822. ENV: Agent did: predict-no for direction R in state State-B
  2823. In State-B moving R
  2824. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2825. predict error 0
  2826. dir: dir isL
  2827. |\-399: O: O797 (predict-yes)
  2828. I see 1 and I'm going to do: predict-yes
  2829. ENV: Agent did: predict-yes for direction L in state State-B
  2830. In State-B moving L
  2831. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2832. predict error 0
  2833. dir: dir isL
  2834. /|\400: O: O800 (predict-no)
  2835. I see 1 and I'm going to do: predict-no
  2836. ENV: Agent did: predict-no for direction L in state State-A
  2837. In State-A moving L
  2838. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2839. predict error 0
  2840. dir: dir isR
  2841. -/401: O: O802 (predict-no)
  2842. I see 1 and I'm going to do: predict-no
  2843. ENV: Agent did: predict-no for direction R in state State-A
  2844. In State-A moving R
  2845. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  2846. predict error 1
  2847. dir: dir isL
  2848. |402: O: O803 (predict-yes)
  2849. I see 0 and I'm going to do: predict-yes
  2850. ENV: Agent did: predict-yes for direction L in state State-B
  2851. In State-B moving L
  2852. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2853. predict error 0
  2854. dir: dir isL
  2855. \-/403: O: O806 (predict-no)
  2856. I see 1 and I'm going to do: predict-no
  2857. ENV: Agent did: predict-no for direction L in state State-A
  2858. In State-A moving L
  2859. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2860. predict error 0
  2861. dir: dir isU
  2862. |\-404: O: O808 (predict-no)
  2863. I see 1 and I'm going to do: predict-no
  2864. ENV: Agent did: predict-no for direction U in state State-A
  2865. In State-A moving U
  2866. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2867. predict error 0
  2868. dir: dir isL
  2869. /405: O: O810 (predict-no)
  2870. I see 1 and I'm going to do: predict-no
  2871. ENV: Agent did: predict-no for direction L in state State-A
  2872. In State-A moving L
  2873. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2874. predict error 0
  2875. dir: dir isU
  2876. |\406: O: O812 (predict-no)
  2877. I see 1 and I'm going to do: predict-no
  2878. ENV: Agent did: predict-no for direction U in state State-A
  2879. In State-A moving U
  2880. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2881. predict error 0
  2882. dir: dir isR
  2883. -407: O: O813 (predict-yes)
  2884. I see 1 and I'm going to do: predict-yes
  2885. ENV: Agent did: predict-yes for direction R in state State-A
  2886. In State-A moving R
  2887. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2888. predict error 0
  2889. dir: dir isR
  2890. /|408: O: O816 (predict-no)
  2891. I see 1 and I'm going to do: predict-no
  2892. ENV: Agent did: predict-no for direction R in state State-B
  2893. In State-B moving R
  2894. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2895. predict error 0
  2896. dir: dir isL
  2897. \-409: O: O817 (predict-yes)
  2898. I see 1 and I'm going to do: predict-yes
  2899. ENV: Agent did: predict-yes for direction L in state State-B
  2900. In State-B moving L
  2901. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2902. predict error 0
  2903. dir: dir isL
  2904. /|\410: O: O820 (predict-no)
  2905. I see 1 and I'm going to do: predict-no
  2906. ENV: Agent did: predict-no for direction L in state State-A
  2907. In State-A moving L
  2908. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2909. predict error 0
  2910. dir: dir isR
  2911. -/|411: O: O821 (predict-yes)
  2912. I see 1 and I'm going to do: predict-yes
  2913. ENV: Agent did: predict-yes for direction R in state State-A
  2914. In State-A moving R
  2915. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2916. predict error 0
  2917. dir: dir isL
  2918. \412: O: O823 (predict-yes)
  2919. I see 1 and I'm going to do: predict-yes
  2920. ENV: Agent did: predict-yes for direction L in state State-B
  2921. In State-B moving L
  2922. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2923. predict error 0
  2924. dir: dir isR
  2925. -/413: O: O825 (predict-yes)
  2926. I see 1 and I'm going to do: predict-yes
  2927. ENV: Agent did: predict-yes for direction R in state State-A
  2928. In State-A moving R
  2929. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2930. predict error 0
  2931. dir: dir isL
  2932. |\-414: O: O827 (predict-yes)
  2933. I see 1 and I'm going to do: predict-yes
  2934. ENV: Agent did: predict-yes for direction L in state State-B
  2935. In State-B moving L
  2936. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2937. predict error 0
  2938. dir: dir isU
  2939. /|415: O: O829 (predict-yes)
  2940. I see 1 and I'm going to do: predict-yes
  2941. ENV: Agent did: predict-yes for direction U in state State-A
  2942. In State-A moving U
  2943. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  2944. predict error 1
  2945. dir: dir isU
  2946. \-/416: O: O832 (predict-no)
  2947. I see 0 and I'm going to do: predict-no
  2948. ENV: Agent did: predict-no for direction U in state State-A
  2949. In State-A moving U
  2950. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2951. predict error 0
  2952. dir: dir isR
  2953. |417: O: O833 (predict-yes)
  2954. I see 1 and I'm going to do: predict-yes
  2955. ENV: Agent did: predict-yes for direction R in state State-A
  2956. In State-A moving R
  2957. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  2958. predict error 0
  2959. dir: dir isR
  2960. \-/418: O: O836 (predict-no)
  2961. I see 1 and I'm going to do: predict-no
  2962. ENV: Agent did: predict-no for direction R in state State-B
  2963. In State-B moving R
  2964. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2965. predict error 0
  2966. dir: dir isU
  2967. |\-419: O: O838 (predict-no)
  2968. I see 1 and I'm going to do: predict-no
  2969. ENV: Agent did: predict-no for direction U in state State-B
  2970. In State-B moving U
  2971. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  2972. predict error 0
  2973. dir: dir isL
  2974. /420: O: O839 (predict-yes)
  2975. I see 1 and I'm going to do: predict-yes
  2976. ENV: Agent did: predict-yes for direction L in state State-B
  2977. In State-B moving L
  2978. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  2979. predict error 0
  2980. dir: dir isL
  2981. |\-421: O: O842 (predict-no)
  2982. I see 1 and I'm going to do: predict-no
  2983. ENV: Agent did: predict-no for direction L in state State-A
  2984. In State-A moving L
  2985. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2986. predict error 0
  2987. dir: dir isL
  2988. /422: O: O844 (predict-no)
  2989. I see 1 and I'm going to do: predict-no
  2990. ENV: Agent did: predict-no for direction L in state State-A
  2991. In State-A moving L
  2992. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  2993. predict error 0
  2994. dir: dir isL
  2995. |\423: O: O846 (predict-no)
  2996. I see 1 and I'm going to do: predict-no
  2997. ENV: Agent did: predict-no for direction L in state State-A
  2998. In State-A moving L
  2999. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3000. predict error 0
  3001. dir: dir isU
  3002. -/424: O: O848 (predict-no)
  3003. I see 1 and I'm going to do: predict-no
  3004. ENV: Agent did: predict-no for direction U in state State-A
  3005. In State-A moving U
  3006. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3007. predict error 0
  3008. dir: dir isR
  3009. |\425: O: O849 (predict-yes)
  3010. I see 1 and I'm going to do: predict-yes
  3011. ENV: Agent did: predict-yes for direction R in state State-A
  3012. In State-A moving R
  3013. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3014. predict error 0
  3015. dir: dir isR
  3016. -/426: O: O852 (predict-no)
  3017. I see 1 and I'm going to do: predict-no
  3018. ENV: Agent did: predict-no for direction R in state State-B
  3019. In State-B moving R
  3020. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3021. predict error 0
  3022. dir: dir isU
  3023. |\427: O: O854 (predict-no)
  3024. I see 1 and I'm going to do: predict-no
  3025. ENV: Agent did: predict-no for direction U in state State-B
  3026. In State-B moving U
  3027. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3028. predict error 0
  3029. dir: dir isL
  3030. -/|428: O: O855 (predict-yes)
  3031. I see 1 and I'm going to do: predict-yes
  3032. ENV: Agent did: predict-yes for direction L in state State-B
  3033. In State-B moving L
  3034. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3035. predict error 0
  3036. dir: dir isU
  3037. \-429: O: O858 (predict-no)
  3038. I see 1 and I'm going to do: predict-no
  3039. ENV: Agent did: predict-no for direction U in state State-A
  3040. In State-A moving U
  3041. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3042. predict error 0
  3043. dir: dir isR
  3044. /|\430: O: O859 (predict-yes)
  3045. I see 1 and I'm going to do: predict-yes
  3046. ENV: Agent did: predict-yes for direction R in state State-A
  3047. In State-A moving R
  3048. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3049. predict error 0
  3050. dir: dir isR
  3051. -431: O: O862 (predict-no)
  3052. I see 1 and I'm going to do: predict-no
  3053. ENV: Agent did: predict-no for direction R in state State-B
  3054. In State-B moving R
  3055. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3056. predict error 0
  3057. dir: dir isU
  3058. /432: O: O864 (predict-no)
  3059. I see 1 and I'm going to do: predict-no
  3060. ENV: Agent did: predict-no for direction U in state State-B
  3061. In State-B moving U
  3062. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3063. predict error 0
  3064. dir: dir isR
  3065. |\-433: O: O866 (predict-no)
  3066. I see 1 and I'm going to do: predict-no
  3067. ENV: Agent did: predict-no for direction R in state State-B
  3068. In State-B moving R
  3069. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3070. predict error 0
  3071. dir: dir isU
  3072. /|434: O: O867 (predict-yes)
  3073. I see 1 and I'm going to do: predict-yes
  3074. ENV: Agent did: predict-yes for direction U in state State-B
  3075. In State-B moving U
  3076. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3077. predict error 1
  3078. dir: dir isU
  3079. \-/435: O: O870 (predict-no)
  3080. I see 0 and I'm going to do: predict-no
  3081. ENV: Agent did: predict-no for direction U in state State-B
  3082. In State-B moving U
  3083. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3084. predict error 0
  3085. dir: dir isR
  3086. |436: O: O872 (predict-no)
  3087. I see 1 and I'm going to do: predict-no
  3088. ENV: Agent did: predict-no for direction R in state State-B
  3089. In State-B moving R
  3090. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3091. predict error 0
  3092. dir: dir isU
  3093. \-/437: O: O873 (predict-yes)
  3094. I see 1 and I'm going to do: predict-yes
  3095. ENV: Agent did: predict-yes for direction U in state State-B
  3096. In State-B moving U
  3097. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3098. predict error 1
  3099. dir: dir isU
  3100. |\438: O: O876 (predict-no)
  3101. I see 0 and I'm going to do: predict-no
  3102. ENV: Agent did: predict-no for direction U in state State-B
  3103. In State-B moving U
  3104. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3105. predict error 0
  3106. dir: dir isU
  3107. -/|439: O: O878 (predict-no)
  3108. I see 1 and I'm going to do: predict-no
  3109. ENV: Agent did: predict-no for direction U in state State-B
  3110. In State-B moving U
  3111. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3112. predict error 0
  3113. dir: dir isU
  3114. \-440: O: O880 (predict-no)
  3115. I see 1 and I'm going to do: predict-no
  3116. ENV: Agent did: predict-no for direction U in state State-B
  3117. In State-B moving U
  3118. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3119. predict error 0
  3120. dir: dir isU
  3121. /|441: O: O882 (predict-no)
  3122. I see 1 and I'm going to do: predict-no
  3123. ENV: Agent did: predict-no for direction U in state State-B
  3124. In State-B moving U
  3125. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3126. predict error 0
  3127. dir: dir isU
  3128. \442: O: O884 (predict-no)
  3129. I see 1 and I'm going to do: predict-no
  3130. ENV: Agent did: predict-no for direction U in state State-B
  3131. In State-B moving U
  3132. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3133. predict error 0
  3134. dir: dir isU
  3135. -/443: O: O886 (predict-no)
  3136. I see 1 and I'm going to do: predict-no
  3137. ENV: Agent did: predict-no for direction U in state State-B
  3138. In State-B moving U
  3139. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3140. predict error 0
  3141. dir: dir isU
  3142. |\-444: O: O888 (predict-no)
  3143. I see 1 and I'm going to do: predict-no
  3144. ENV: Agent did: predict-no for direction U in state State-B
  3145. In State-B moving U
  3146. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3147. predict error 0
  3148. dir: dir isL
  3149. /|\445: O: O889 (predict-yes)
  3150. I see 1 and I'm going to do: predict-yes
  3151. ENV: Agent did: predict-yes for direction L in state State-B
  3152. In State-B moving L
  3153. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3154. predict error 0
  3155. dir: dir isR
  3156. -446: O: O891 (predict-yes)
  3157. I see 1 and I'm going to do: predict-yes
  3158. ENV: Agent did: predict-yes for direction R in state State-A
  3159. In State-A moving R
  3160. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3161. predict error 0
  3162. dir: dir isU
  3163. /|\-447: O: O894 (predict-no)
  3164. I see 1 and I'm going to do: predict-no
  3165. ENV: Agent did: predict-no for direction U in state State-B
  3166. In State-B moving U
  3167. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3168. predict error 0
  3169. dir: dir isR
  3170. /|\448: O: O896 (predict-no)
  3171. I see 1 and I'm going to do: predict-no
  3172. ENV: Agent did: predict-no for direction R in state State-B
  3173. In State-B moving R
  3174. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3175. predict error 0
  3176. dir: dir isR
  3177. -/|449: O: O898 (predict-no)
  3178. I see 1 and I'm going to do: predict-no
  3179. ENV: Agent did: predict-no for direction R in state State-B
  3180. In State-B moving R
  3181. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3182. predict error 0
  3183. dir: dir isL
  3184. \-/450: O: O899 (predict-yes)
  3185. I see 1 and I'm going to do: predict-yes
  3186. ENV: Agent did: predict-yes for direction L in state State-B
  3187. In State-B moving L
  3188. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3189. predict error 0
  3190. dir: dir isU
  3191. |\-451: O: O902 (predict-no)
  3192. I see 1 and I'm going to do: predict-no
  3193. ENV: Agent did: predict-no for direction U in state State-A
  3194. In State-A moving U
  3195. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3196. predict error 0
  3197. dir: dir isR
  3198. /452: O: O903 (predict-yes)
  3199. I see 1 and I'm going to do: predict-yes
  3200. ENV: Agent did: predict-yes for direction R in state State-A
  3201. In State-A moving R
  3202. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3203. predict error 0
  3204. dir: dir isR
  3205. |\-453: O: O906 (predict-no)
  3206. I see 1 and I'm going to do: predict-no
  3207. ENV: Agent did: predict-no for direction R in state State-B
  3208. In State-B moving R
  3209. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3210. predict error 0
  3211. dir: dir isU
  3212. /|454: O: O908 (predict-no)
  3213. I see 1 and I'm going to do: predict-no
  3214. ENV: Agent did: predict-no for direction U in state State-B
  3215. In State-B moving U
  3216. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3217. predict error 0
  3218. dir: dir isL
  3219. \455: O: O909 (predict-yes)
  3220. I see 1 and I'm going to do: predict-yes
  3221. ENV: Agent did: predict-yes for direction L in state State-B
  3222. In State-B moving L
  3223. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3224. predict error 0
  3225. dir: dir isU
  3226. -/456: O: O912 (predict-no)
  3227. I see 1 and I'm going to do: predict-no
  3228. ENV: Agent did: predict-no for direction U in state State-A
  3229. In State-A moving U
  3230. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3231. predict error 0
  3232. dir: dir isL
  3233. |457: O: O913 (predict-yes)
  3234. I see 1 and I'm going to do: predict-yes
  3235. ENV: Agent did: predict-yes for direction L in state State-A
  3236. In State-A moving L
  3237. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3238. predict error 1
  3239. dir: dir isL
  3240. \458: O: O916 (predict-no)
  3241. I see 0 and I'm going to do: predict-no
  3242. ENV: Agent did: predict-no for direction L in state State-A
  3243. In State-A moving L
  3244. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3245. predict error 0
  3246. dir: dir isR
  3247. -/|459: O: O917 (predict-yes)
  3248. I see 1 and I'm going to do: predict-yes
  3249. ENV: Agent did: predict-yes for direction R in state State-A
  3250. In State-A moving R
  3251. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3252. predict error 0
  3253. dir: dir isR
  3254. \-/460: O: O920 (predict-no)
  3255. I see 1 and I'm going to do: predict-no
  3256. ENV: Agent did: predict-no for direction R in state State-B
  3257. In State-B moving R
  3258. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3259. predict error 0
  3260. dir: dir isU
  3261. |\-461: O: O922 (predict-no)
  3262. I see 1 and I'm going to do: predict-no
  3263. ENV: Agent did: predict-no for direction U in state State-B
  3264. In State-B moving U
  3265. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3266. predict error 0
  3267. dir: dir isU
  3268. /462: O: O924 (predict-no)
  3269. I see 1 and I'm going to do: predict-no
  3270. ENV: Agent did: predict-no for direction U in state State-B
  3271. In State-B moving U
  3272. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3273. predict error 0
  3274. dir: dir isU
  3275. |463: O: O926 (predict-no)
  3276. I see 1 and I'm going to do: predict-no
  3277. ENV: Agent did: predict-no for direction U in state State-B
  3278. In State-B moving U
  3279. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3280. predict error 0
  3281. dir: dir isR
  3282. \-464: O: O928 (predict-no)
  3283. I see 1 and I'm going to do: predict-no
  3284. ENV: Agent did: predict-no for direction R in state State-B
  3285. In State-B moving R
  3286. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3287. predict error 0
  3288. dir: dir isR
  3289. /|\465: O: O930 (predict-no)
  3290. I see 1 and I'm going to do: predict-no
  3291. ENV: Agent did: predict-no for direction R in state State-B
  3292. In State-B moving R
  3293. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3294. predict error 0
  3295. dir: dir isL
  3296. -/466: O: O931 (predict-yes)
  3297. I see 1 and I'm going to do: predict-yes
  3298. ENV: Agent did: predict-yes for direction L in state State-B
  3299. In State-B moving L
  3300. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3301. predict error 0
  3302. dir: dir isU
  3303. |\-467: O: O934 (predict-no)
  3304. I see 1 and I'm going to do: predict-no
  3305. ENV: Agent did: predict-no for direction U in state State-A
  3306. In State-A moving U
  3307. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3308. predict error 0
  3309. dir: dir isR
  3310. /468: O: O935 (predict-yes)
  3311. I see 1 and I'm going to do: predict-yes
  3312. ENV: Agent did: predict-yes for direction R in state State-A
  3313. In State-A moving R
  3314. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3315. predict error 0
  3316. dir: dir isL
  3317. |\-469: O: O937 (predict-yes)
  3318. I see 1 and I'm going to do: predict-yes
  3319. ENV: Agent did: predict-yes for direction L in state State-B
  3320. In State-B moving L
  3321. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3322. predict error 0
  3323. dir: dir isR
  3324. /|\470: O: O939 (predict-yes)
  3325. I see 1 and I'm going to do: predict-yes
  3326. ENV: Agent did: predict-yes for direction R in state State-A
  3327. In State-A moving R
  3328. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3329. predict error 0
  3330. dir: dir isL
  3331. -/|471: O: O941 (predict-yes)
  3332. I see 1 and I'm going to do: predict-yes
  3333. ENV: Agent did: predict-yes for direction L in state State-B
  3334. In State-B moving L
  3335. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3336. predict error 0
  3337. dir: dir isL
  3338. \472: O: O943 (predict-yes)
  3339. I see 1 and I'm going to do: predict-yes
  3340. ENV: Agent did: predict-yes for direction L in state State-A
  3341. In State-A moving L
  3342. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3343. predict error 1
  3344. dir: dir isU
  3345. -/473: O: O946 (predict-no)
  3346. I see 0 and I'm going to do: predict-no
  3347. ENV: Agent did: predict-no for direction U in state State-A
  3348. In State-A moving U
  3349. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3350. predict error 0
  3351. dir: dir isL
  3352. |\474: O: O948 (predict-no)
  3353. I see 1 and I'm going to do: predict-no
  3354. ENV: Agent did: predict-no for direction L in state State-A
  3355. In State-A moving L
  3356. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3357. predict error 0
  3358. dir: dir isL
  3359. -/|475: O: O950 (predict-no)
  3360. I see 1 and I'm going to do: predict-no
  3361. ENV: Agent did: predict-no for direction L in state State-A
  3362. In State-A moving L
  3363. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3364. predict error 0
  3365. dir: dir isL
  3366. \-/476: O: O952 (predict-no)
  3367. I see 1 and I'm going to do: predict-no
  3368. ENV: Agent did: predict-no for direction L in state State-A
  3369. In State-A moving L
  3370. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3371. predict error 0
  3372. dir: dir isL
  3373. |\-477: O: O953 (predict-yes)
  3374. I see 1 and I'm going to do: predict-yes
  3375. ENV: Agent did: predict-yes for direction L in state State-A
  3376. In State-A moving L
  3377. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  3378. predict error 1
  3379. dir: dir isR
  3380. /|\478: O: O955 (predict-yes)
  3381. I see 0 and I'm going to do: predict-yes
  3382. ENV: Agent did: predict-yes for direction R in state State-A
  3383. In State-A moving R
  3384. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3385. predict error 0
  3386. dir: dir isL
  3387. -/|479: O: O957 (predict-yes)
  3388. I see 1 and I'm going to do: predict-yes
  3389. ENV: Agent did: predict-yes for direction L in state State-B
  3390. In State-B moving L
  3391. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3392. predict error 0
  3393. dir: dir isR
  3394. \-/|480: O: O959 (predict-yes)
  3395. I see 1 and I'm going to do: predict-yes
  3396. ENV: Agent did: predict-yes for direction R in state State-A
  3397. In State-A moving R
  3398. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3399. predict error 0
  3400. dir: dir isL
  3401. \-481: O: O962 (predict-no)
  3402. I see 1 and I'm going to do: predict-no
  3403. ENV: Agent did: predict-no for direction L in state State-B
  3404. In State-B moving L
  3405. ENV: (next state, see, prediction correct?) = (State-A, 1, False)
  3406. predict error 1
  3407. dir: dir isL
  3408. /482: O: O964 (predict-no)
  3409. I see 0 and I'm going to do: predict-no
  3410. ENV: Agent did: predict-no for direction L in state State-A
  3411. In State-A moving L
  3412. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3413. predict error 0
  3414. dir: dir isR
  3415. |\483: O: O965 (predict-yes)
  3416. I see 1 and I'm going to do: predict-yes
  3417. ENV: Agent did: predict-yes for direction R in state State-A
  3418. In State-A moving R
  3419. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3420. predict error 0
  3421. dir: dir isR
  3422. -/|484: O: O968 (predict-no)
  3423. I see 1 and I'm going to do: predict-no
  3424. ENV: Agent did: predict-no for direction R in state State-B
  3425. In State-B moving R
  3426. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3427. predict error 0
  3428. dir: dir isR
  3429. \-485: O: O970 (predict-no)
  3430. I see 1 and I'm going to do: predict-no
  3431. ENV: Agent did: predict-no for direction R in state State-B
  3432. In State-B moving R
  3433. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3434. predict error 0
  3435. dir: dir isU
  3436. /|\486: O: O972 (predict-no)
  3437. I see 1 and I'm going to do: predict-no
  3438. ENV: Agent did: predict-no for direction U in state State-B
  3439. In State-B moving U
  3440. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3441. predict error 0
  3442. dir: dir isL
  3443. -/|487: O: O973 (predict-yes)
  3444. I see 1 and I'm going to do: predict-yes
  3445. ENV: Agent did: predict-yes for direction L in state State-B
  3446. In State-B moving L
  3447. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3448. predict error 0
  3449. dir: dir isL
  3450. \-488: O: O976 (predict-no)
  3451. I see 1 and I'm going to do: predict-no
  3452. ENV: Agent did: predict-no for direction L in state State-A
  3453. In State-A moving L
  3454. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3455. predict error 0
  3456. dir: dir isU
  3457. /|\489: O: O978 (predict-no)
  3458. I see 1 and I'm going to do: predict-no
  3459. ENV: Agent did: predict-no for direction U in state State-A
  3460. In State-A moving U
  3461. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3462. predict error 0
  3463. dir: dir isR
  3464. -/490: O: O979 (predict-yes)
  3465. I see 1 and I'm going to do: predict-yes
  3466. ENV: Agent did: predict-yes for direction R in state State-A
  3467. In State-A moving R
  3468. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3469. predict error 0
  3470. dir: dir isU
  3471. |\491: O: O982 (predict-no)
  3472. I see 1 and I'm going to do: predict-no
  3473. ENV: Agent did: predict-no for direction U in state State-B
  3474. In State-B moving U
  3475. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3476. predict error 0
  3477. dir: dir isL
  3478. -492: O: O983 (predict-yes)
  3479. I see 1 and I'm going to do: predict-yes
  3480. ENV: Agent did: predict-yes for direction L in state State-B
  3481. In State-B moving L
  3482. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3483. predict error 0
  3484. dir: dir isR
  3485. /|\493: O: O985 (predict-yes)
  3486. I see 1 and I'm going to do: predict-yes
  3487. ENV: Agent did: predict-yes for direction R in state State-A
  3488. In State-A moving R
  3489. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3490. predict error 0
  3491. dir: dir isU
  3492. -/494: O: O988 (predict-no)
  3493. I see 1 and I'm going to do: predict-no
  3494. ENV: Agent did: predict-no for direction U in state State-B
  3495. In State-B moving U
  3496. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3497. predict error 0
  3498. dir: dir isR
  3499. |\-495: O: O990 (predict-no)
  3500. I see 1 and I'm going to do: predict-no
  3501. ENV: Agent did: predict-no for direction R in state State-B
  3502. In State-B moving R
  3503. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3504. predict error 0
  3505. dir: dir isU
  3506. /|\496: O: O992 (predict-no)
  3507. I see 1 and I'm going to do: predict-no
  3508. ENV: Agent did: predict-no for direction U in state State-B
  3509. In State-B moving U
  3510. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3511. predict error 0
  3512. dir: dir isR
  3513. -/|497: O: O994 (predict-no)
  3514. I see 1 and I'm going to do: predict-no
  3515. ENV: Agent did: predict-no for direction R in state State-B
  3516. In State-B moving R
  3517. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3518. predict error 0
  3519. dir: dir isR
  3520. \-/498: O: O996 (predict-no)
  3521. I see 1 and I'm going to do: predict-no
  3522. ENV: Agent did: predict-no for direction R in state State-B
  3523. In State-B moving R
  3524. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3525. predict error 0
  3526. dir: dir isU
  3527. |\499: O: O998 (predict-no)
  3528. I see 1 and I'm going to do: predict-no
  3529. ENV: Agent did: predict-no for direction U in state State-B
  3530. In State-B moving U
  3531. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3532. predict error 0
  3533. dir: dir isR
  3534. -/500: O: O1000 (predict-no)
  3535. I see 1 and I'm going to do: predict-no
  3536. ENV: Agent did: predict-no for direction R in state State-B
  3537. In State-B moving R
  3538. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3539. predict error 0
  3540. dir: dir isR
  3541. |\-501: O: O1002 (predict-no)
  3542. I see 1 and I'm going to do: predict-no
  3543. ENV: Agent did: predict-no for direction R in state State-B
  3544. In State-B moving R
  3545. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3546. predict error 0
  3547. dir: dir isR
  3548. /502: O: O1004 (predict-no)
  3549. I see 1 and I'm going to do: predict-no
  3550. ENV: Agent did: predict-no for direction R in state State-B
  3551. In State-B moving R
  3552. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3553. predict error 0
  3554. dir: dir isL
  3555. |\-503: O: O1005 (predict-yes)
  3556. I see 1 and I'm going to do: predict-yes
  3557. ENV: Agent did: predict-yes for direction L in state State-B
  3558. In State-B moving L
  3559. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3560. predict error 0
  3561. dir: dir isU
  3562. /504: O: O1008 (predict-no)
  3563. I see 1 and I'm going to do: predict-no
  3564. ENV: Agent did: predict-no for direction U in state State-A
  3565. In State-A moving U
  3566. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3567. predict error 0
  3568. dir: dir isR
  3569. |\-505: O: O1009 (predict-yes)
  3570. I see 1 and I'm going to do: predict-yes
  3571. ENV: Agent did: predict-yes for direction R in state State-A
  3572. In State-A moving R
  3573. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3574. predict error 0
  3575. dir: dir isR
  3576. /|506: O: O1012 (predict-no)
  3577. I see 1 and I'm going to do: predict-no
  3578. ENV: Agent did: predict-no for direction R in state State-B
  3579. In State-B moving R
  3580. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3581. predict error 0
  3582. dir: dir isR
  3583. \-/507: O: O1014 (predict-no)
  3584. I see 1 and I'm going to do: predict-no
  3585. ENV: Agent did: predict-no for direction R in state State-B
  3586. In State-B moving R
  3587. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3588. predict error 0
  3589. dir: dir isU
  3590. |\-/508: O: O1016 (predict-no)
  3591. I see 1 and I'm going to do: predict-no
  3592. ENV: Agent did: predict-no for direction U in state State-B
  3593. In State-B moving U
  3594. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3595. predict error 0
  3596. dir: dir isU
  3597. |\-509: O: O1018 (predict-no)
  3598. I see 1 and I'm going to do: predict-no
  3599. ENV: Agent did: predict-no for direction U in state State-B
  3600. In State-B moving U
  3601. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3602. predict error 0
  3603. dir: dir isU
  3604. /|\510: O: O1020 (predict-no)
  3605. I see 1 and I'm going to do: predict-no
  3606. ENV: Agent did: predict-no for direction U in state State-B
  3607. In State-B moving U
  3608. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3609. predict error 0
  3610. dir: dir isR
  3611. -/|511: O: O1022 (predict-no)
  3612. I see 1 and I'm going to do: predict-no
  3613. ENV: Agent did: predict-no for direction R in state State-B
  3614. In State-B moving R
  3615. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3616. predict error 0
  3617. dir: dir isR
  3618. \512: O: O1024 (predict-no)
  3619. I see 1 and I'm going to do: predict-no
  3620. ENV: Agent did: predict-no for direction R in state State-B
  3621. In State-B moving R
  3622. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3623. predict error 0
  3624. dir: dir isR
  3625. -/|\513: O: O1026 (predict-no)
  3626. I see 1 and I'm going to do: predict-no
  3627. ENV: Agent did: predict-no for direction R in state State-B
  3628. In State-B moving R
  3629. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3630. predict error 0
  3631. dir: dir isL
  3632. -/514: O: O1027 (predict-yes)
  3633. I see 1 and I'm going to do: predict-yes
  3634. ENV: Agent did: predict-yes for direction L in state State-B
  3635. In State-B moving L
  3636. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3637. predict error 0
  3638. dir: dir isL
  3639. |\-515: O: O1030 (predict-no)
  3640. I see 1 and I'm going to do: predict-no
  3641. ENV: Agent did: predict-no for direction L in state State-A
  3642. In State-A moving L
  3643. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3644. predict error 0
  3645. dir: dir isU
  3646. /|\516: O: O1032 (predict-no)
  3647. I see 1 and I'm going to do: predict-no
  3648. ENV: Agent did: predict-no for direction U in state State-A
  3649. In State-A moving U
  3650. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3651. predict error 0
  3652. dir: dir isL
  3653. -/|517: O: O1034 (predict-no)
  3654. I see 1 and I'm going to do: predict-no
  3655. ENV: Agent did: predict-no for direction L in state State-A
  3656. In State-A moving L
  3657. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3658. predict error 0
  3659. dir: dir isU
  3660. \-/518: O: O1036 (predict-no)
  3661. I see 1 and I'm going to do: predict-no
  3662. ENV: Agent did: predict-no for direction U in state State-A
  3663. In State-A moving U
  3664. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3665. predict error 0
  3666. dir: dir isU
  3667. |\-519: O: O1038 (predict-no)
  3668. I see 1 and I'm going to do: predict-no
  3669. ENV: Agent did: predict-no for direction U in state State-A
  3670. In State-A moving U
  3671. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3672. predict error 0
  3673. dir: dir isU
  3674. /520: O: O1040 (predict-no)
  3675. I see 1 and I'm going to do: predict-no
  3676. ENV: Agent did: predict-no for direction U in state State-A
  3677. In State-A moving U
  3678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3679. predict error 0
  3680. dir: dir isL
  3681. |\521: O: O1042 (predict-no)
  3682. I see 1 and I'm going to do: predict-no
  3683. ENV: Agent did: predict-no for direction L in state State-A
  3684. In State-A moving L
  3685. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3686. predict error 0
  3687. dir: dir isL
  3688. -522: O: O1044 (predict-no)
  3689. I see 1 and I'm going to do: predict-no
  3690. ENV: Agent did: predict-no for direction L in state State-A
  3691. In State-A moving L
  3692. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3693. predict error 0
  3694. dir: dir isU
  3695. /|523: O: O1046 (predict-no)
  3696. I see 1 and I'm going to do: predict-no
  3697. ENV: Agent did: predict-no for direction U in state State-A
  3698. In State-A moving U
  3699. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3700. predict error 0
  3701. dir: dir isL
  3702. \-524: O: O1048 (predict-no)
  3703. I see 1 and I'm going to do: predict-no
  3704. ENV: Agent did: predict-no for direction L in state State-A
  3705. In State-A moving L
  3706. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3707. predict error 0
  3708. dir: dir isL
  3709. /|525: O: O1050 (predict-no)
  3710. I see 1 and I'm going to do: predict-no
  3711. ENV: Agent did: predict-no for direction L in state State-A
  3712. In State-A moving L
  3713. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3714. predict error 0
  3715. dir: dir isR
  3716. \-526: O: O1052 (predict-no)
  3717. I see 1 and I'm going to do: predict-no
  3718. ENV: Agent did: predict-no for direction R in state State-A
  3719. In State-A moving R
  3720. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  3721. predict error 1
  3722. dir: dir isL
  3723. /|\527: O: O1053 (predict-yes)
  3724. I see 0 and I'm going to do: predict-yes
  3725. ENV: Agent did: predict-yes for direction L in state State-B
  3726. In State-B moving L
  3727. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3728. predict error 0
  3729. dir: dir isL
  3730. -/528: O: O1056 (predict-no)
  3731. I see 1 and I'm going to do: predict-no
  3732. ENV: Agent did: predict-no for direction L in state State-A
  3733. In State-A moving L
  3734. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3735. predict error 0
  3736. dir: dir isU
  3737. |\529: O: O1058 (predict-no)
  3738. I see 1 and I'm going to do: predict-no
  3739. ENV: Agent did: predict-no for direction U in state State-A
  3740. In State-A moving U
  3741. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3742. predict error 0
  3743. dir: dir isL
  3744. -/|530: O: O1060 (predict-no)
  3745. I see 1 and I'm going to do: predict-no
  3746. ENV: Agent did: predict-no for direction L in state State-A
  3747. In State-A moving L
  3748. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3749. predict error 0
  3750. dir: dir isU
  3751. \-/531: O: O1062 (predict-no)
  3752. I see 1 and I'm going to do: predict-no
  3753. ENV: Agent did: predict-no for direction U in state State-A
  3754. In State-A moving U
  3755. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3756. predict error 0
  3757. dir: dir isR
  3758. |532: O: O1063 (predict-yes)
  3759. I see 1 and I'm going to do: predict-yes
  3760. ENV: Agent did: predict-yes for direction R in state State-A
  3761. In State-A moving R
  3762. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3763. predict error 0
  3764. dir: dir isL
  3765. \-/533: O: O1065 (predict-yes)
  3766. I see 1 and I'm going to do: predict-yes
  3767. ENV: Agent did: predict-yes for direction L in state State-B
  3768. In State-B moving L
  3769. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3770. predict error 0
  3771. dir: dir isU
  3772. |\-534: O: O1068 (predict-no)
  3773. I see 1 and I'm going to do: predict-no
  3774. ENV: Agent did: predict-no for direction U in state State-A
  3775. In State-A moving U
  3776. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3777. predict error 0
  3778. dir: dir isL
  3779. /|\535: O: O1070 (predict-no)
  3780. I see 1 and I'm going to do: predict-no
  3781. ENV: Agent did: predict-no for direction L in state State-A
  3782. In State-A moving L
  3783. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3784. predict error 0
  3785. dir: dir isR
  3786. -/|536: O: O1071 (predict-yes)
  3787. I see 1 and I'm going to do: predict-yes
  3788. ENV: Agent did: predict-yes for direction R in state State-A
  3789. In State-A moving R
  3790. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3791. predict error 0
  3792. dir: dir isR
  3793. \-/537: O: O1074 (predict-no)
  3794. I see 1 and I'm going to do: predict-no
  3795. ENV: Agent did: predict-no for direction R in state State-B
  3796. In State-B moving R
  3797. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3798. predict error 0
  3799. dir: dir isL
  3800. |538: O: O1075 (predict-yes)
  3801. I see 1 and I'm going to do: predict-yes
  3802. ENV: Agent did: predict-yes for direction L in state State-B
  3803. In State-B moving L
  3804. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3805. predict error 0
  3806. dir: dir isR
  3807. \-539: O: O1077 (predict-yes)
  3808. I see 1 and I'm going to do: predict-yes
  3809. ENV: Agent did: predict-yes for direction R in state State-A
  3810. In State-A moving R
  3811. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3812. predict error 0
  3813. dir: dir isL
  3814. /|\540: O: O1079 (predict-yes)
  3815. I see 1 and I'm going to do: predict-yes
  3816. ENV: Agent did: predict-yes for direction L in state State-B
  3817. In State-B moving L
  3818. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3819. predict error 0
  3820. dir: dir isL
  3821. -/|541: O: O1082 (predict-no)
  3822. I see 1 and I'm going to do: predict-no
  3823. ENV: Agent did: predict-no for direction L in state State-A
  3824. In State-A moving L
  3825. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3826. predict error 0
  3827. dir: dir isU
  3828. \542: O: O1084 (predict-no)
  3829. I see 1 and I'm going to do: predict-no
  3830. ENV: Agent did: predict-no for direction U in state State-A
  3831. In State-A moving U
  3832. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3833. predict error 0
  3834. dir: dir isU
  3835. -/|543: O: O1086 (predict-no)
  3836. I see 1 and I'm going to do: predict-no
  3837. ENV: Agent did: predict-no for direction U in state State-A
  3838. In State-A moving U
  3839. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3840. predict error 0
  3841. dir: dir isR
  3842. \-/544: O: O1087 (predict-yes)
  3843. I see 1 and I'm going to do: predict-yes
  3844. ENV: Agent did: predict-yes for direction R in state State-A
  3845. In State-A moving R
  3846. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3847. predict error 0
  3848. dir: dir isU
  3849. |\-545: O: O1090 (predict-no)
  3850. I see 1 and I'm going to do: predict-no
  3851. ENV: Agent did: predict-no for direction U in state State-B
  3852. In State-B moving U
  3853. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3854. predict error 0
  3855. dir: dir isU
  3856. /|\546: O: O1092 (predict-no)
  3857. I see 1 and I'm going to do: predict-no
  3858. ENV: Agent did: predict-no for direction U in state State-B
  3859. In State-B moving U
  3860. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3861. predict error 0
  3862. dir: dir isL
  3863. -547: O: O1093 (predict-yes)
  3864. I see 1 and I'm going to do: predict-yes
  3865. ENV: Agent did: predict-yes for direction L in state State-B
  3866. In State-B moving L
  3867. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3868. predict error 0
  3869. dir: dir isR
  3870. /|548: O: O1095 (predict-yes)
  3871. I see 1 and I'm going to do: predict-yes
  3872. ENV: Agent did: predict-yes for direction R in state State-A
  3873. In State-A moving R
  3874. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3875. predict error 0
  3876. dir: dir isL
  3877. \-/549: O: O1097 (predict-yes)
  3878. I see 1 and I'm going to do: predict-yes
  3879. ENV: Agent did: predict-yes for direction L in state State-B
  3880. In State-B moving L
  3881. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3882. predict error 0
  3883. dir: dir isL
  3884. |\-550: O: O1100 (predict-no)
  3885. I see 1 and I'm going to do: predict-no
  3886. ENV: Agent did: predict-no for direction L in state State-A
  3887. In State-A moving L
  3888. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3889. predict error 0
  3890. dir: dir isL
  3891. /|\551: O: O1102 (predict-no)
  3892. I see 1 and I'm going to do: predict-no
  3893. ENV: Agent did: predict-no for direction L in state State-A
  3894. In State-A moving L
  3895. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3896. predict error 0
  3897. dir: dir isL
  3898. -552: O: O1104 (predict-no)
  3899. I see 1 and I'm going to do: predict-no
  3900. ENV: Agent did: predict-no for direction L in state State-A
  3901. In State-A moving L
  3902. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3903. predict error 0
  3904. dir: dir isL
  3905. /|553: O: O1106 (predict-no)
  3906. I see 1 and I'm going to do: predict-no
  3907. ENV: Agent did: predict-no for direction L in state State-A
  3908. In State-A moving L
  3909. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3910. predict error 0
  3911. dir: dir isL
  3912. \-554: O: O1108 (predict-no)
  3913. I see 1 and I'm going to do: predict-no
  3914. ENV: Agent did: predict-no for direction L in state State-A
  3915. In State-A moving L
  3916. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3917. predict error 0
  3918. dir: dir isR
  3919. /|555: O: O1109 (predict-yes)
  3920. I see 1 and I'm going to do: predict-yes
  3921. ENV: Agent did: predict-yes for direction R in state State-A
  3922. In State-A moving R
  3923. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3924. predict error 0
  3925. dir: dir isR
  3926. \556: O: O1112 (predict-no)
  3927. I see 1 and I'm going to do: predict-no
  3928. ENV: Agent did: predict-no for direction R in state State-B
  3929. In State-B moving R
  3930. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3931. predict error 0
  3932. dir: dir isU
  3933. -557: O: O1114 (predict-no)
  3934. I see 1 and I'm going to do: predict-no
  3935. ENV: Agent did: predict-no for direction U in state State-B
  3936. In State-B moving U
  3937. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3938. predict error 0
  3939. dir: dir isU
  3940. /|558: O: O1116 (predict-no)
  3941. I see 1 and I'm going to do: predict-no
  3942. ENV: Agent did: predict-no for direction U in state State-B
  3943. In State-B moving U
  3944. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3945. predict error 0
  3946. dir: dir isU
  3947. \-/559: O: O1117 (predict-yes)
  3948. I see 1 and I'm going to do: predict-yes
  3949. ENV: Agent did: predict-yes for direction U in state State-B
  3950. In State-B moving U
  3951. ENV: (next state, see, prediction correct?) = (State-B, 0, False)
  3952. predict error 1
  3953. dir: dir isR
  3954. |\560: O: O1120 (predict-no)
  3955. I see 0 and I'm going to do: predict-no
  3956. ENV: Agent did: predict-no for direction R in state State-B
  3957. In State-B moving R
  3958. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3959. predict error 0
  3960. dir: dir isU
  3961. -/561: O: O1122 (predict-no)
  3962. I see 1 and I'm going to do: predict-no
  3963. ENV: Agent did: predict-no for direction U in state State-B
  3964. In State-B moving U
  3965. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  3966. predict error 0
  3967. dir: dir isL
  3968. |562: O: O1123 (predict-yes)
  3969. I see 1 and I'm going to do: predict-yes
  3970. ENV: Agent did: predict-yes for direction L in state State-B
  3971. In State-B moving L
  3972. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  3973. predict error 0
  3974. dir: dir isL
  3975. \-/|563: O: O1126 (predict-no)
  3976. I see 1 and I'm going to do: predict-no
  3977. ENV: Agent did: predict-no for direction L in state State-A
  3978. In State-A moving L
  3979. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3980. predict error 0
  3981. dir: dir isU
  3982. \-/564: O: O1128 (predict-no)
  3983. I see 1 and I'm going to do: predict-no
  3984. ENV: Agent did: predict-no for direction U in state State-A
  3985. In State-A moving U
  3986. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  3987. predict error 0
  3988. dir: dir isR
  3989. |565: O: O1129 (predict-yes)
  3990. I see 1 and I'm going to do: predict-yes
  3991. ENV: Agent did: predict-yes for direction R in state State-A
  3992. In State-A moving R
  3993. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  3994. predict error 0
  3995. dir: dir isL
  3996. \-566: O: O1131 (predict-yes)
  3997. I see 1 and I'm going to do: predict-yes
  3998. ENV: Agent did: predict-yes for direction L in state State-B
  3999. In State-B moving L
  4000. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4001. predict error 0
  4002. dir: dir isR
  4003. /|\567: O: O1133 (predict-yes)
  4004. I see 1 and I'm going to do: predict-yes
  4005. ENV: Agent did: predict-yes for direction R in state State-A
  4006. In State-A moving R
  4007. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4008. predict error 0
  4009. dir: dir isL
  4010. -/|568: O: O1135 (predict-yes)
  4011. I see 1 and I'm going to do: predict-yes
  4012. ENV: Agent did: predict-yes for direction L in state State-B
  4013. In State-B moving L
  4014. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4015. predict error 0
  4016. dir: dir isR
  4017. \569: O: O1137 (predict-yes)
  4018. I see 1 and I'm going to do: predict-yes
  4019. ENV: Agent did: predict-yes for direction R in state State-A
  4020. In State-A moving R
  4021. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4022. predict error 0
  4023. dir: dir isR
  4024. -/|570: O: O1140 (predict-no)
  4025. I see 1 and I'm going to do: predict-no
  4026. ENV: Agent did: predict-no for direction R in state State-B
  4027. In State-B moving R
  4028. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4029. predict error 0
  4030. dir: dir isR
  4031. \-/571: O: O1142 (predict-no)
  4032. I see 1 and I'm going to do: predict-no
  4033. ENV: Agent did: predict-no for direction R in state State-B
  4034. In State-B moving R
  4035. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4036. predict error 0
  4037. dir: dir isU
  4038. |572: O: O1144 (predict-no)
  4039. I see 1 and I'm going to do: predict-no
  4040. ENV: Agent did: predict-no for direction U in state State-B
  4041. In State-B moving U
  4042. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4043. predict error 0
  4044. dir: dir isU
  4045. \573: O: O1146 (predict-no)
  4046. I see 1 and I'm going to do: predict-no
  4047. ENV: Agent did: predict-no for direction U in state State-B
  4048. In State-B moving U
  4049. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4050. predict error 0
  4051. dir: dir isR
  4052. -/574: O: O1148 (predict-no)
  4053. I see 1 and I'm going to do: predict-no
  4054. ENV: Agent did: predict-no for direction R in state State-B
  4055. In State-B moving R
  4056. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4057. predict error 0
  4058. dir: dir isR
  4059. |\575: O: O1150 (predict-no)
  4060. I see 1 and I'm going to do: predict-no
  4061. ENV: Agent did: predict-no for direction R in state State-B
  4062. In State-B moving R
  4063. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4064. predict error 0
  4065. dir: dir isL
  4066. -576: O: O1151 (predict-yes)
  4067. I see 1 and I'm going to do: predict-yes
  4068. ENV: Agent did: predict-yes for direction L in state State-B
  4069. In State-B moving L
  4070. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4071. predict error 0
  4072. dir: dir isU
  4073. /|577: O: O1154 (predict-no)
  4074. I see 1 and I'm going to do: predict-no
  4075. ENV: Agent did: predict-no for direction U in state State-A
  4076. In State-A moving U
  4077. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4078. predict error 0
  4079. dir: dir isU
  4080. \-/578: O: O1156 (predict-no)
  4081. I see 1 and I'm going to do: predict-no
  4082. ENV: Agent did: predict-no for direction U in state State-A
  4083. In State-A moving U
  4084. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4085. predict error 0
  4086. dir: dir isL
  4087. |\579: O: O1158 (predict-no)
  4088. I see 1 and I'm going to do: predict-no
  4089. ENV: Agent did: predict-no for direction L in state State-A
  4090. In State-A moving L
  4091. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4092. predict error 0
  4093. dir: dir isU
  4094. -/|580: O: O1160 (predict-no)
  4095. I see 1 and I'm going to do: predict-no
  4096. ENV: Agent did: predict-no for direction U in state State-A
  4097. In State-A moving U
  4098. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4099. predict error 0
  4100. dir: dir isU
  4101. \581: O: O1162 (predict-no)
  4102. I see 1 and I'm going to do: predict-no
  4103. ENV: Agent did: predict-no for direction U in state State-A
  4104. In State-A moving U
  4105. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4106. predict error 0
  4107. dir: dir isR
  4108. -582: O: O1163 (predict-yes)
  4109. I see 1 and I'm going to do: predict-yes
  4110. ENV: Agent did: predict-yes for direction R in state State-A
  4111. In State-A moving R
  4112. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4113. predict error 0
  4114. dir: dir isL
  4115. /|\583: O: O1165 (predict-yes)
  4116. I see 1 and I'm going to do: predict-yes
  4117. ENV: Agent did: predict-yes for direction L in state State-B
  4118. In State-B moving L
  4119. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4120. predict error 0
  4121. dir: dir isR
  4122. -/|584: O: O1167 (predict-yes)
  4123. I see 1 and I'm going to do: predict-yes
  4124. ENV: Agent did: predict-yes for direction R in state State-A
  4125. In State-A moving R
  4126. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4127. predict error 0
  4128. dir: dir isL
  4129. \-585: O: O1169 (predict-yes)
  4130. I see 1 and I'm going to do: predict-yes
  4131. ENV: Agent did: predict-yes for direction L in state State-B
  4132. In State-B moving L
  4133. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4134. predict error 0
  4135. dir: dir isL
  4136. /|586: O: O1172 (predict-no)
  4137. I see 1 and I'm going to do: predict-no
  4138. ENV: Agent did: predict-no for direction L in state State-A
  4139. In State-A moving L
  4140. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4141. predict error 0
  4142. dir: dir isR
  4143. \-/587: O: O1173 (predict-yes)
  4144. I see 1 and I'm going to do: predict-yes
  4145. ENV: Agent did: predict-yes for direction R in state State-A
  4146. In State-A moving R
  4147. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4148. predict error 0
  4149. dir: dir isR
  4150. |\588: O: O1176 (predict-no)
  4151. I see 1 and I'm going to do: predict-no
  4152. ENV: Agent did: predict-no for direction R in state State-B
  4153. In State-B moving R
  4154. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4155. predict error 0
  4156. dir: dir isR
  4157. -/589: O: O1178 (predict-no)
  4158. I see 1 and I'm going to do: predict-no
  4159. ENV: Agent did: predict-no for direction R in state State-B
  4160. In State-B moving R
  4161. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4162. predict error 0
  4163. dir: dir isU
  4164. |\590: O: O1180 (predict-no)
  4165. I see 1 and I'm going to do: predict-no
  4166. ENV: Agent did: predict-no for direction U in state State-B
  4167. In State-B moving U
  4168. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4169. predict error 0
  4170. dir: dir isU
  4171. -/|591: O: O1182 (predict-no)
  4172. I see 1 and I'm going to do: predict-no
  4173. ENV: Agent did: predict-no for direction U in state State-B
  4174. In State-B moving U
  4175. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4176. predict error 0
  4177. dir: dir isR
  4178. \592: O: O1184 (predict-no)
  4179. I see 1 and I'm going to do: predict-no
  4180. ENV: Agent did: predict-no for direction R in state State-B
  4181. In State-B moving R
  4182. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4183. predict error 0
  4184. dir: dir isL
  4185. -/|593: O: O1185 (predict-yes)
  4186. I see 1 and I'm going to do: predict-yes
  4187. ENV: Agent did: predict-yes for direction L in state State-B
  4188. In State-B moving L
  4189. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4190. predict error 0
  4191. dir: dir isU
  4192. \-/594: O: O1188 (predict-no)
  4193. I see 1 and I'm going to do: predict-no
  4194. ENV: Agent did: predict-no for direction U in state State-A
  4195. In State-A moving U
  4196. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4197. predict error 0
  4198. dir: dir isU
  4199. |\-595: O: O1190 (predict-no)
  4200. I see 1 and I'm going to do: predict-no
  4201. ENV: Agent did: predict-no for direction U in state State-A
  4202. In State-A moving U
  4203. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4204. predict error 0
  4205. dir: dir isU
  4206. /|\596: O: O1192 (predict-no)
  4207. I see 1 and I'm going to do: predict-no
  4208. ENV: Agent did: predict-no for direction U in state State-A
  4209. In State-A moving U
  4210. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4211. predict error 0
  4212. dir: dir isR
  4213. -/|597: O: O1193 (predict-yes)
  4214. I see 1 and I'm going to do: predict-yes
  4215. ENV: Agent did: predict-yes for direction R in state State-A
  4216. In State-A moving R
  4217. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4218. predict error 0
  4219. dir: dir isL
  4220. \-/598: O: O1195 (predict-yes)
  4221. I see 1 and I'm going to do: predict-yes
  4222. ENV: Agent did: predict-yes for direction L in state State-B
  4223. In State-B moving L
  4224. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4225. predict error 0
  4226. dir: dir isL
  4227. |\599: O: O1198 (predict-no)
  4228. I see 1 and I'm going to do: predict-no
  4229. ENV: Agent did: predict-no for direction L in state State-A
  4230. In State-A moving L
  4231. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4232. predict error 0
  4233. dir: dir isL
  4234. -/600: O: O1200 (predict-no)
  4235. I see 1 and I'm going to do: predict-no
  4236. ENV: Agent did: predict-no for direction L in state State-A
  4237. In State-A moving L
  4238. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4239. predict error 0
  4240. dir: dir isR
  4241. |\-601: O: O1201 (predict-yes)
  4242. I see 1 and I'm going to do: predict-yes
  4243. ENV: Agent did: predict-yes for direction R in state State-A
  4244. In State-A moving R
  4245. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4246. predict error 0
  4247. dir: dir isR
  4248. /602: O: O1204 (predict-no)
  4249. I see 1 and I'm going to do: predict-no
  4250. ENV: Agent did: predict-no for direction R in state State-B
  4251. In State-B moving R
  4252. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4253. predict error 0
  4254. dir: dir isL
  4255. |\-603: O: O1205 (predict-yes)
  4256. I see 1 and I'm going to do: predict-yes
  4257. ENV: Agent did: predict-yes for direction L in state State-B
  4258. In State-B moving L
  4259. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4260. predict error 0
  4261. dir: dir isU
  4262. /604: O: O1208 (predict-no)
  4263. I see 1 and I'm going to do: predict-no
  4264. ENV: Agent did: predict-no for direction U in state State-A
  4265. In State-A moving U
  4266. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4267. predict error 0
  4268. dir: dir isU
  4269. |605: O: O1209 (predict-yes)
  4270. I see 1 and I'm going to do: predict-yes
  4271. ENV: Agent did: predict-yes for direction U in state State-A
  4272. In State-A moving U
  4273. ENV: (next state, see, prediction correct?) = (State-A, 0, False)
  4274. predict error 1
  4275. dir: dir isU
  4276. \-606: O: O1212 (predict-no)
  4277. I see 0 and I'm going to do: predict-no
  4278. ENV: Agent did: predict-no for direction U in state State-A
  4279. In State-A moving U
  4280. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4281. predict error 0
  4282. dir: dir isR
  4283. /|\607: O: O1213 (predict-yes)
  4284. I see 1 and I'm going to do: predict-yes
  4285. ENV: Agent did: predict-yes for direction R in state State-A
  4286. In State-A moving R
  4287. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4288. predict error 0
  4289. dir: dir isL
  4290. -608: O: O1215 (predict-yes)
  4291. I see 1 and I'm going to do: predict-yes
  4292. ENV: Agent did: predict-yes for direction L in state State-B
  4293. In State-B moving L
  4294. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4295. predict error 0
  4296. dir: dir isL
  4297. /|609: O: O1218 (predict-no)
  4298. I see 1 and I'm going to do: predict-no
  4299. ENV: Agent did: predict-no for direction L in state State-A
  4300. In State-A moving L
  4301. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4302. predict error 0
  4303. dir: dir isU
  4304. \-/610: O: O1220 (predict-no)
  4305. I see 1 and I'm going to do: predict-no
  4306. ENV: Agent did: predict-no for direction U in state State-A
  4307. In State-A moving U
  4308. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4309. predict error 0
  4310. dir: dir isU
  4311. |\-611: O: O1222 (predict-no)
  4312. I see 1 and I'm going to do: predict-no
  4313. ENV: Agent did: predict-no for direction U in state State-A
  4314. In State-A moving U
  4315. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4316. predict error 0
  4317. dir: dir isL
  4318. /612: O: O1224 (predict-no)
  4319. I see 1 and I'm going to do: predict-no
  4320. ENV: Agent did: predict-no for direction L in state State-A
  4321. In State-A moving L
  4322. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4323. predict error 0
  4324. dir: dir isU
  4325. |\-613: O: O1226 (predict-no)
  4326. I see 1 and I'm going to do: predict-no
  4327. ENV: Agent did: predict-no for direction U in state State-A
  4328. In State-A moving U
  4329. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4330. predict error 0
  4331. dir: dir isR
  4332. /|614: O: O1227 (predict-yes)
  4333. I see 1 and I'm going to do: predict-yes
  4334. ENV: Agent did: predict-yes for direction R in state State-A
  4335. In State-A moving R
  4336. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4337. predict error 0
  4338. dir: dir isR
  4339. \-615: O: O1230 (predict-no)
  4340. I see 1 and I'm going to do: predict-no
  4341. ENV: Agent did: predict-no for direction R in state State-B
  4342. In State-B moving R
  4343. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4344. predict error 0
  4345. dir: dir isR
  4346. /|\616: O: O1232 (predict-no)
  4347. I see 1 and I'm going to do: predict-no
  4348. ENV: Agent did: predict-no for direction R in state State-B
  4349. In State-B moving R
  4350. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4351. predict error 0
  4352. dir: dir isU
  4353. -/|617: O: O1234 (predict-no)
  4354. I see 1 and I'm going to do: predict-no
  4355. ENV: Agent did: predict-no for direction U in state State-B
  4356. In State-B moving U
  4357. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4358. predict error 0
  4359. dir: dir isR
  4360. \618: O: O1236 (predict-no)
  4361. I see 1 and I'm going to do: predict-no
  4362. ENV: Agent did: predict-no for direction R in state State-B
  4363. In State-B moving R
  4364. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4365. predict error 0
  4366. dir: dir isR
  4367. -/|619: O: O1238 (predict-no)
  4368. I see 1 and I'm going to do: predict-no
  4369. ENV: Agent did: predict-no for direction R in state State-B
  4370. In State-B moving R
  4371. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4372. predict error 0
  4373. dir: dir isL
  4374. \-/620: O: O1239 (predict-yes)
  4375. I see 1 and I'm going to do: predict-yes
  4376. ENV: Agent did: predict-yes for direction L in state State-B
  4377. In State-B moving L
  4378. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4379. predict error 0
  4380. dir: dir isU
  4381. |\-621: O: O1242 (predict-no)
  4382. I see 1 and I'm going to do: predict-no
  4383. ENV: Agent did: predict-no for direction U in state State-A
  4384. In State-A moving U
  4385. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4386. predict error 0
  4387. dir: dir isL
  4388. /622: O: O1244 (predict-no)
  4389. I see 1 and I'm going to do: predict-no
  4390. ENV: Agent did: predict-no for direction L in state State-A
  4391. In State-A moving L
  4392. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4393. predict error 0
  4394. dir: dir isU
  4395. |\-623: O: O1246 (predict-no)
  4396. I see 1 and I'm going to do: predict-no
  4397. ENV: Agent did: predict-no for direction U in state State-A
  4398. In State-A moving U
  4399. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4400. predict error 0
  4401. dir: dir isL
  4402. /|624: O: O1248 (predict-no)
  4403. I see 1 and I'm going to do: predict-no
  4404. ENV: Agent did: predict-no for direction L in state State-A
  4405. In State-A moving L
  4406. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4407. predict error 0
  4408. dir: dir isL
  4409. \-/625: O: O1250 (predict-no)
  4410. I see 1 and I'm going to do: predict-no
  4411. ENV: Agent did: predict-no for direction L in state State-A
  4412. In State-A moving L
  4413. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4414. predict error 0
  4415. dir: dir isR
  4416. |\-/sleeping...
  4417. |626: O: O1251 (predict-yes)
  4418. I see 1 and I'm going to do: predict-yes
  4419. ENV: Agent did: predict-yes for direction R in state State-A
  4420. In State-A moving R
  4421. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4422. predict error 0
  4423. dir: dir isR
  4424. \-627: O: O1254 (predict-no)
  4425. I see 1 and I'm going to do: predict-no
  4426. ENV: Agent did: predict-no for direction R in state State-B
  4427. In State-B moving R
  4428. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4429. predict error 0
  4430. dir: dir isR
  4431. /|\628: O: O1256 (predict-no)
  4432. I see 1 and I'm going to do: predict-no
  4433. ENV: Agent did: predict-no for direction R in state State-B
  4434. In State-B moving R
  4435. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4436. predict error 0
  4437. dir: dir isU
  4438. -/629: O: O1258 (predict-no)
  4439. I see 1 and I'm going to do: predict-no
  4440. ENV: Agent did: predict-no for direction U in state State-B
  4441. In State-B moving U
  4442. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4443. predict error 0
  4444. dir: dir isL
  4445. |630: O: O1259 (predict-yes)
  4446. I see 1 and I'm going to do: predict-yes
  4447. ENV: Agent did: predict-yes for direction L in state State-B
  4448. In State-B moving L
  4449. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4450. predict error 0
  4451. dir: dir isU
  4452. \631: O: O1262 (predict-no)
  4453. I see 1 and I'm going to do: predict-no
  4454. ENV: Agent did: predict-no for direction U in state State-A
  4455. In State-A moving U
  4456. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4457. predict error 0
  4458. dir: dir isU
  4459. -632: O: O1264 (predict-no)
  4460. I see 1 and I'm going to do: predict-no
  4461. ENV: Agent did: predict-no for direction U in state State-A
  4462. In State-A moving U
  4463. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4464. predict error 0
  4465. dir: dir isU
  4466. /|633: O: O1266 (predict-no)
  4467. I see 1 and I'm going to do: predict-no
  4468. ENV: Agent did: predict-no for direction U in state State-A
  4469. In State-A moving U
  4470. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4471. predict error 0
  4472. dir: dir isR
  4473. \-634: O: O1267 (predict-yes)
  4474. I see 1 and I'm going to do: predict-yes
  4475. ENV: Agent did: predict-yes for direction R in state State-A
  4476. In State-A moving R
  4477. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4478. predict error 0
  4479. dir: dir isR
  4480. /|\635: O: O1270 (predict-no)
  4481. I see 1 and I'm going to do: predict-no
  4482. ENV: Agent did: predict-no for direction R in state State-B
  4483. In State-B moving R
  4484. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4485. predict error 0
  4486. dir: dir isR
  4487. -636: O: O1272 (predict-no)
  4488. I see 1 and I'm going to do: predict-no
  4489. ENV: Agent did: predict-no for direction R in state State-B
  4490. In State-B moving R
  4491. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4492. predict error 0
  4493. dir: dir isR
  4494. /637: O: O1274 (predict-no)
  4495. I see 1 and I'm going to do: predict-no
  4496. ENV: Agent did: predict-no for direction R in state State-B
  4497. In State-B moving R
  4498. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4499. predict error 0
  4500. dir: dir isL
  4501. |\-638: O: O1275 (predict-yes)
  4502. I see 1 and I'm going to do: predict-yes
  4503. ENV: Agent did: predict-yes for direction L in state State-B
  4504. In State-B moving L
  4505. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4506. predict error 0
  4507. dir: dir isL
  4508. /|639: O: O1278 (predict-no)
  4509. I see 1 and I'm going to do: predict-no
  4510. ENV: Agent did: predict-no for direction L in state State-A
  4511. In State-A moving L
  4512. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4513. predict error 0
  4514. dir: dir isR
  4515. \-/640: O: O1279 (predict-yes)
  4516. I see 1 and I'm going to do: predict-yes
  4517. ENV: Agent did: predict-yes for direction R in state State-A
  4518. In State-A moving R
  4519. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4520. predict error 0
  4521. dir: dir isL
  4522. |\641: O: O1281 (predict-yes)
  4523. I see 1 and I'm going to do: predict-yes
  4524. ENV: Agent did: predict-yes for direction L in state State-B
  4525. In State-B moving L
  4526. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4527. predict error 0
  4528. dir: dir isR
  4529. -642: O: O1283 (predict-yes)
  4530. I see 1 and I'm going to do: predict-yes
  4531. ENV: Agent did: predict-yes for direction R in state State-A
  4532. In State-A moving R
  4533. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4534. predict error 0
  4535. dir: dir isR
  4536. /643: O: O1286 (predict-no)
  4537. I see 1 and I'm going to do: predict-no
  4538. ENV: Agent did: predict-no for direction R in state State-B
  4539. In State-B moving R
  4540. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4541. predict error 0
  4542. dir: dir isL
  4543. |\644: O: O1287 (predict-yes)
  4544. I see 1 and I'm going to do: predict-yes
  4545. ENV: Agent did: predict-yes for direction L in state State-B
  4546. In State-B moving L
  4547. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4548. predict error 0
  4549. dir: dir isL
  4550. -/|645: O: O1290 (predict-no)
  4551. I see 1 and I'm going to do: predict-no
  4552. ENV: Agent did: predict-no for direction L in state State-A
  4553. In State-A moving L
  4554. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4555. predict error 0
  4556. dir: dir isR
  4557. \646: O: O1291 (predict-yes)
  4558. I see 1 and I'm going to do: predict-yes
  4559. ENV: Agent did: predict-yes for direction R in state State-A
  4560. In State-A moving R
  4561. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4562. predict error 0
  4563. dir: dir isU
  4564. -/|647: O: O1294 (predict-no)
  4565. I see 1 and I'm going to do: predict-no
  4566. ENV: Agent did: predict-no for direction U in state State-B
  4567. In State-B moving U
  4568. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4569. predict error 0
  4570. dir: dir isL
  4571. \-648: O: O1295 (predict-yes)
  4572. I see 1 and I'm going to do: predict-yes
  4573. ENV: Agent did: predict-yes for direction L in state State-B
  4574. In State-B moving L
  4575. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4576. predict error 0
  4577. dir: dir isR
  4578. /|\649: O: O1297 (predict-yes)
  4579. I see 1 and I'm going to do: predict-yes
  4580. ENV: Agent did: predict-yes for direction R in state State-A
  4581. In State-A moving R
  4582. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4583. predict error 0
  4584. dir: dir isR
  4585. -/|650: O: O1300 (predict-no)
  4586. I see 1 and I'm going to do: predict-no
  4587. ENV: Agent did: predict-no for direction R in state State-B
  4588. In State-B moving R
  4589. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4590. predict error 0
  4591. dir: dir isU
  4592. \-/651: O: O1302 (predict-no)
  4593. I see 1 and I'm going to do: predict-no
  4594. ENV: Agent did: predict-no for direction U in state State-B
  4595. In State-B moving U
  4596. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4597. predict error 0
  4598. dir: dir isU
  4599. |652: O: O1304 (predict-no)
  4600. I see 1 and I'm going to do: predict-no
  4601. ENV: Agent did: predict-no for direction U in state State-B
  4602. In State-B moving U
  4603. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4604. predict error 0
  4605. dir: dir isL
  4606. \-/653: O: O1305 (predict-yes)
  4607. I see 1 and I'm going to do: predict-yes
  4608. ENV: Agent did: predict-yes for direction L in state State-B
  4609. In State-B moving L
  4610. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4611. predict error 0
  4612. dir: dir isL
  4613. |\654: O: O1308 (predict-no)
  4614. I see 1 and I'm going to do: predict-no
  4615. ENV: Agent did: predict-no for direction L in state State-A
  4616. In State-A moving L
  4617. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4618. predict error 0
  4619. dir: dir isU
  4620. -/655: O: O1310 (predict-no)
  4621. I see 1 and I'm going to do: predict-no
  4622. ENV: Agent did: predict-no for direction U in state State-A
  4623. In State-A moving U
  4624. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4625. predict error 0
  4626. dir: dir isL
  4627. |\-656: O: O1312 (predict-no)
  4628. I see 1 and I'm going to do: predict-no
  4629. ENV: Agent did: predict-no for direction L in state State-A
  4630. In State-A moving L
  4631. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4632. predict error 0
  4633. dir: dir isL
  4634. /|657: O: O1314 (predict-no)
  4635. I see 1 and I'm going to do: predict-no
  4636. ENV: Agent did: predict-no for direction L in state State-A
  4637. In State-A moving L
  4638. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4639. predict error 0
  4640. dir: dir isU
  4641. \658: O: O1316 (predict-no)
  4642. I see 1 and I'm going to do: predict-no
  4643. ENV: Agent did: predict-no for direction U in state State-A
  4644. In State-A moving U
  4645. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4646. predict error 0
  4647. dir: dir isL
  4648. -659: O: O1318 (predict-no)
  4649. I see 1 and I'm going to do: predict-no
  4650. ENV: Agent did: predict-no for direction L in state State-A
  4651. In State-A moving L
  4652. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4653. predict error 0
  4654. dir: dir isL
  4655. /|\660: O: O1320 (predict-no)
  4656. I see 1 and I'm going to do: predict-no
  4657. ENV: Agent did: predict-no for direction L in state State-A
  4658. In State-A moving L
  4659. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4660. predict error 0
  4661. dir: dir isL
  4662. -/|661: O: O1322 (predict-no)
  4663. I see 1 and I'm going to do: predict-no
  4664. ENV: Agent did: predict-no for direction L in state State-A
  4665. In State-A moving L
  4666. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4667. predict error 0
  4668. dir: dir isL
  4669. \662: O: O1324 (predict-no)
  4670. I see 1 and I'm going to do: predict-no
  4671. ENV: Agent did: predict-no for direction L in state State-A
  4672. In State-A moving L
  4673. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4674. predict error 0
  4675. dir: dir isU
  4676. -/|663: O: O1326 (predict-no)
  4677. I see 1 and I'm going to do: predict-no
  4678. ENV: Agent did: predict-no for direction U in state State-A
  4679. In State-A moving U
  4680. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4681. predict error 0
  4682. dir: dir isR
  4683. \-664: O: O1327 (predict-yes)
  4684. I see 1 and I'm going to do: predict-yes
  4685. ENV: Agent did: predict-yes for direction R in state State-A
  4686. In State-A moving R
  4687. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4688. predict error 0
  4689. dir: dir isR
  4690. /|665: O: O1330 (predict-no)
  4691. I see 1 and I'm going to do: predict-no
  4692. ENV: Agent did: predict-no for direction R in state State-B
  4693. In State-B moving R
  4694. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4695. predict error 0
  4696. dir: dir isR
  4697. \-/666: O: O1332 (predict-no)
  4698. I see 1 and I'm going to do: predict-no
  4699. ENV: Agent did: predict-no for direction R in state State-B
  4700. In State-B moving R
  4701. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4702. predict error 0
  4703. dir: dir isU
  4704. |\-667: O: O1334 (predict-no)
  4705. I see 1 and I'm going to do: predict-no
  4706. ENV: Agent did: predict-no for direction U in state State-B
  4707. In State-B moving U
  4708. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4709. predict error 0
  4710. dir: dir isL
  4711. /|668: O: O1335 (predict-yes)
  4712. I see 1 and I'm going to do: predict-yes
  4713. ENV: Agent did: predict-yes for direction L in state State-B
  4714. In State-B moving L
  4715. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4716. predict error 0
  4717. dir: dir isR
  4718. \-/669: O: O1337 (predict-yes)
  4719. I see 1 and I'm going to do: predict-yes
  4720. ENV: Agent did: predict-yes for direction R in state State-A
  4721. In State-A moving R
  4722. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4723. predict error 0
  4724. dir: dir isL
  4725. |\670: O: O1339 (predict-yes)
  4726. I see 1 and I'm going to do: predict-yes
  4727. ENV: Agent did: predict-yes for direction L in state State-B
  4728. In State-B moving L
  4729. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4730. predict error 0
  4731. dir: dir isL
  4732. -/|671: O: O1342 (predict-no)
  4733. I see 1 and I'm going to do: predict-no
  4734. ENV: Agent did: predict-no for direction L in state State-A
  4735. In State-A moving L
  4736. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4737. predict error 0
  4738. dir: dir isR
  4739. \672: O: O1343 (predict-yes)
  4740. I see 1 and I'm going to do: predict-yes
  4741. ENV: Agent did: predict-yes for direction R in state State-A
  4742. In State-A moving R
  4743. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4744. predict error 0
  4745. dir: dir isR
  4746. -/|673: O: O1346 (predict-no)
  4747. I see 1 and I'm going to do: predict-no
  4748. ENV: Agent did: predict-no for direction R in state State-B
  4749. In State-B moving R
  4750. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4751. predict error 0
  4752. dir: dir isL
  4753. \-674: O: O1347 (predict-yes)
  4754. I see 1 and I'm going to do: predict-yes
  4755. ENV: Agent did: predict-yes for direction L in state State-B
  4756. In State-B moving L
  4757. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4758. predict error 0
  4759. dir: dir isR
  4760. /|675: O: O1349 (predict-yes)
  4761. I see 1 and I'm going to do: predict-yes
  4762. ENV: Agent did: predict-yes for direction R in state State-A
  4763. In State-A moving R
  4764. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4765. predict error 0
  4766. dir: dir isU
  4767. \-/676: O: O1352 (predict-no)
  4768. I see 1 and I'm going to do: predict-no
  4769. ENV: Agent did: predict-no for direction U in state State-B
  4770. In State-B moving U
  4771. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4772. predict error 0
  4773. dir: dir isR
  4774. |\-677: O: O1354 (predict-no)
  4775. I see 1 and I'm going to do: predict-no
  4776. ENV: Agent did: predict-no for direction R in state State-B
  4777. In State-B moving R
  4778. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4779. predict error 0
  4780. dir: dir isR
  4781. /|678: O: O1356 (predict-no)
  4782. I see 1 and I'm going to do: predict-no
  4783. ENV: Agent did: predict-no for direction R in state State-B
  4784. In State-B moving R
  4785. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4786. predict error 0
  4787. dir: dir isU
  4788. \-/679: O: O1358 (predict-no)
  4789. I see 1 and I'm going to do: predict-no
  4790. ENV: Agent did: predict-no for direction U in state State-B
  4791. In State-B moving U
  4792. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4793. predict error 0
  4794. dir: dir isU
  4795. |\-680: O: O1360 (predict-no)
  4796. I see 1 and I'm going to do: predict-no
  4797. ENV: Agent did: predict-no for direction U in state State-B
  4798. In State-B moving U
  4799. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4800. predict error 0
  4801. dir: dir isL
  4802. /681: O: O1361 (predict-yes)
  4803. I see 1 and I'm going to do: predict-yes
  4804. ENV: Agent did: predict-yes for direction L in state State-B
  4805. In State-B moving L
  4806. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4807. predict error 0
  4808. dir: dir isL
  4809. |682: O: O1364 (predict-no)
  4810. I see 1 and I'm going to do: predict-no
  4811. ENV: Agent did: predict-no for direction L in state State-A
  4812. In State-A moving L
  4813. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4814. predict error 0
  4815. dir: dir isU
  4816. \-683: O: O1366 (predict-no)
  4817. I see 1 and I'm going to do: predict-no
  4818. ENV: Agent did: predict-no for direction U in state State-A
  4819. In State-A moving U
  4820. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4821. predict error 0
  4822. dir: dir isU
  4823. /|684: O: O1368 (predict-no)
  4824. I see 1 and I'm going to do: predict-no
  4825. ENV: Agent did: predict-no for direction U in state State-A
  4826. In State-A moving U
  4827. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4828. predict error 0
  4829. dir: dir isL
  4830. \-/685: O: O1370 (predict-no)
  4831. I see 1 and I'm going to do: predict-no
  4832. ENV: Agent did: predict-no for direction L in state State-A
  4833. In State-A moving L
  4834. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4835. predict error 0
  4836. dir: dir isU
  4837. |\686: O: O1372 (predict-no)
  4838. I see 1 and I'm going to do: predict-no
  4839. ENV: Agent did: predict-no for direction U in state State-A
  4840. In State-A moving U
  4841. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4842. predict error 0
  4843. dir: dir isU
  4844. -/687: O: O1374 (predict-no)
  4845. I see 1 and I'm going to do: predict-no
  4846. ENV: Agent did: predict-no for direction U in state State-A
  4847. In State-A moving U
  4848. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4849. predict error 0
  4850. dir: dir isR
  4851. |\-688: O: O1375 (predict-yes)
  4852. I see 1 and I'm going to do: predict-yes
  4853. ENV: Agent did: predict-yes for direction R in state State-A
  4854. In State-A moving R
  4855. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4856. predict error 0
  4857. dir: dir isU
  4858. /|\689: O: O1378 (predict-no)
  4859. I see 1 and I'm going to do: predict-no
  4860. ENV: Agent did: predict-no for direction U in state State-B
  4861. In State-B moving U
  4862. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4863. predict error 0
  4864. dir: dir isR
  4865. -/|690: O: O1380 (predict-no)
  4866. I see 1 and I'm going to do: predict-no
  4867. ENV: Agent did: predict-no for direction R in state State-B
  4868. In State-B moving R
  4869. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4870. predict error 0
  4871. dir: dir isL
  4872. \-/691: O: O1381 (predict-yes)
  4873. I see 1 and I'm going to do: predict-yes
  4874. ENV: Agent did: predict-yes for direction L in state State-B
  4875. In State-B moving L
  4876. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4877. predict error 0
  4878. dir: dir isL
  4879. |692: O: O1384 (predict-no)
  4880. I see 1 and I'm going to do: predict-no
  4881. ENV: Agent did: predict-no for direction L in state State-A
  4882. In State-A moving L
  4883. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4884. predict error 0
  4885. dir: dir isL
  4886. \-/693: O: O1386 (predict-no)
  4887. I see 1 and I'm going to do: predict-no
  4888. ENV: Agent did: predict-no for direction L in state State-A
  4889. In State-A moving L
  4890. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4891. predict error 0
  4892. dir: dir isR
  4893. |\-694: O: O1387 (predict-yes)
  4894. I see 1 and I'm going to do: predict-yes
  4895. ENV: Agent did: predict-yes for direction R in state State-A
  4896. In State-A moving R
  4897. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4898. predict error 0
  4899. dir: dir isL
  4900. /|695: O: O1389 (predict-yes)
  4901. I see 1 and I'm going to do: predict-yes
  4902. ENV: Agent did: predict-yes for direction L in state State-B
  4903. In State-B moving L
  4904. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4905. predict error 0
  4906. dir: dir isR
  4907. \-696: O: O1391 (predict-yes)
  4908. I see 1 and I'm going to do: predict-yes
  4909. ENV: Agent did: predict-yes for direction R in state State-A
  4910. In State-A moving R
  4911. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4912. predict error 0
  4913. dir: dir isR
  4914. /|\697: O: O1394 (predict-no)
  4915. I see 1 and I'm going to do: predict-no
  4916. ENV: Agent did: predict-no for direction R in state State-B
  4917. In State-B moving R
  4918. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4919. predict error 0
  4920. dir: dir isR
  4921. -/698: O: O1396 (predict-no)
  4922. I see 1 and I'm going to do: predict-no
  4923. ENV: Agent did: predict-no for direction R in state State-B
  4924. In State-B moving R
  4925. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4926. predict error 0
  4927. dir: dir isU
  4928. |\-699: O: O1398 (predict-no)
  4929. I see 1 and I'm going to do: predict-no
  4930. ENV: Agent did: predict-no for direction U in state State-B
  4931. In State-B moving U
  4932. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4933. predict error 0
  4934. dir: dir isR
  4935. /|\700: O: O1400 (predict-no)
  4936. I see 1 and I'm going to do: predict-no
  4937. ENV: Agent did: predict-no for direction R in state State-B
  4938. In State-B moving R
  4939. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4940. predict error 0
  4941. dir: dir isR
  4942. -/|701: O: O1402 (predict-no)
  4943. I see 1 and I'm going to do: predict-no
  4944. ENV: Agent did: predict-no for direction R in state State-B
  4945. In State-B moving R
  4946. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4947. predict error 0
  4948. dir: dir isR
  4949. \702: O: O1404 (predict-no)
  4950. I see 1 and I'm going to do: predict-no
  4951. ENV: Agent did: predict-no for direction R in state State-B
  4952. In State-B moving R
  4953. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4954. predict error 0
  4955. dir: dir isL
  4956. -/703: O: O1405 (predict-yes)
  4957. I see 1 and I'm going to do: predict-yes
  4958. ENV: Agent did: predict-yes for direction L in state State-B
  4959. In State-B moving L
  4960. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4961. predict error 0
  4962. dir: dir isL
  4963. |\704: O: O1408 (predict-no)
  4964. I see 1 and I'm going to do: predict-no
  4965. ENV: Agent did: predict-no for direction L in state State-A
  4966. In State-A moving L
  4967. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4968. predict error 0
  4969. dir: dir isR
  4970. -/705: O: O1409 (predict-yes)
  4971. I see 1 and I'm going to do: predict-yes
  4972. ENV: Agent did: predict-yes for direction R in state State-A
  4973. In State-A moving R
  4974. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  4975. predict error 0
  4976. dir: dir isU
  4977. |\-706: O: O1412 (predict-no)
  4978. I see 1 and I'm going to do: predict-no
  4979. ENV: Agent did: predict-no for direction U in state State-B
  4980. In State-B moving U
  4981. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  4982. predict error 0
  4983. dir: dir isL
  4984. /|\707: O: O1413 (predict-yes)
  4985. I see 1 and I'm going to do: predict-yes
  4986. ENV: Agent did: predict-yes for direction L in state State-B
  4987. In State-B moving L
  4988. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  4989. predict error 0
  4990. dir: dir isU
  4991. -/708: O: O1416 (predict-no)
  4992. I see 1 and I'm going to do: predict-no
  4993. ENV: Agent did: predict-no for direction U in state State-A
  4994. In State-A moving U
  4995. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  4996. predict error 0
  4997. dir: dir isU
  4998. |\-709: O: O1418 (predict-no)
  4999. I see 1 and I'm going to do: predict-no
  5000. ENV: Agent did: predict-no for direction U in state State-A
  5001. In State-A moving U
  5002. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5003. predict error 0
  5004. dir: dir isL
  5005. /|\710: O: O1420 (predict-no)
  5006. I see 1 and I'm going to do: predict-no
  5007. ENV: Agent did: predict-no for direction L in state State-A
  5008. In State-A moving L
  5009. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5010. predict error 0
  5011. dir: dir isU
  5012. -/|711: O: O1422 (predict-no)
  5013. I see 1 and I'm going to do: predict-no
  5014. ENV: Agent did: predict-no for direction U in state State-A
  5015. In State-A moving U
  5016. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5017. predict error 0
  5018. dir: dir isR
  5019. \712: O: O1423 (predict-yes)
  5020. I see 1 and I'm going to do: predict-yes
  5021. ENV: Agent did: predict-yes for direction R in state State-A
  5022. In State-A moving R
  5023. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5024. predict error 0
  5025. dir: dir isR
  5026. -/|713: O: O1426 (predict-no)
  5027. I see 1 and I'm going to do: predict-no
  5028. ENV: Agent did: predict-no for direction R in state State-B
  5029. In State-B moving R
  5030. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5031. predict error 0
  5032. dir: dir isL
  5033. \-/714: O: O1427 (predict-yes)
  5034. I see 1 and I'm going to do: predict-yes
  5035. ENV: Agent did: predict-yes for direction L in state State-B
  5036. In State-B moving L
  5037. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5038. predict error 0
  5039. dir: dir isR
  5040. |\-715: O: O1429 (predict-yes)
  5041. I see 1 and I'm going to do: predict-yes
  5042. ENV: Agent did: predict-yes for direction R in state State-A
  5043. In State-A moving R
  5044. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5045. predict error 0
  5046. dir: dir isR
  5047. /|716: O: O1432 (predict-no)
  5048. I see 1 and I'm going to do: predict-no
  5049. ENV: Agent did: predict-no for direction R in state State-B
  5050. In State-B moving R
  5051. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5052. predict error 0
  5053. dir: dir isU
  5054. \-/717: O: O1434 (predict-no)
  5055. I see 1 and I'm going to do: predict-no
  5056. ENV: Agent did: predict-no for direction U in state State-B
  5057. In State-B moving U
  5058. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5059. predict error 0
  5060. dir: dir isR
  5061. |\-718: O: O1436 (predict-no)
  5062. I see 1 and I'm going to do: predict-no
  5063. ENV: Agent did: predict-no for direction R in state State-B
  5064. In State-B moving R
  5065. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5066. predict error 0
  5067. dir: dir isU
  5068. /719: O: O1438 (predict-no)
  5069. I see 1 and I'm going to do: predict-no
  5070. ENV: Agent did: predict-no for direction U in state State-B
  5071. In State-B moving U
  5072. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5073. predict error 0
  5074. dir: dir isU
  5075. |\720: O: O1440 (predict-no)
  5076. I see 1 and I'm going to do: predict-no
  5077. ENV: Agent did: predict-no for direction U in state State-B
  5078. In State-B moving U
  5079. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5080. predict error 0
  5081. dir: dir isL
  5082. -/721: O: O1441 (predict-yes)
  5083. I see 1 and I'm going to do: predict-yes
  5084. ENV: Agent did: predict-yes for direction L in state State-B
  5085. In State-B moving L
  5086. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5087. predict error 0
  5088. dir: dir isL
  5089. |722: O: O1444 (predict-no)
  5090. I see 1 and I'm going to do: predict-no
  5091. ENV: Agent did: predict-no for direction L in state State-A
  5092. In State-A moving L
  5093. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5094. predict error 0
  5095. dir: dir isR
  5096. \-/723: O: O1445 (predict-yes)
  5097. I see 1 and I'm going to do: predict-yes
  5098. ENV: Agent did: predict-yes for direction R in state State-A
  5099. In State-A moving R
  5100. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5101. predict error 0
  5102. dir: dir isL
  5103. |\724: O: O1447 (predict-yes)
  5104. I see 1 and I'm going to do: predict-yes
  5105. ENV: Agent did: predict-yes for direction L in state State-B
  5106. In State-B moving L
  5107. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5108. predict error 0
  5109. dir: dir isL
  5110. -/|725: O: O1450 (predict-no)
  5111. I see 1 and I'm going to do: predict-no
  5112. ENV: Agent did: predict-no for direction L in state State-A
  5113. In State-A moving L
  5114. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5115. predict error 0
  5116. dir: dir isL
  5117. \726: O: O1452 (predict-no)
  5118. I see 1 and I'm going to do: predict-no
  5119. ENV: Agent did: predict-no for direction L in state State-A
  5120. In State-A moving L
  5121. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5122. predict error 0
  5123. dir: dir isR
  5124. -/|727: O: O1453 (predict-yes)
  5125. I see 1 and I'm going to do: predict-yes
  5126. ENV: Agent did: predict-yes for direction R in state State-A
  5127. In State-A moving R
  5128. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5129. predict error 0
  5130. dir: dir isR
  5131. \-728: O: O1456 (predict-no)
  5132. I see 1 and I'm going to do: predict-no
  5133. ENV: Agent did: predict-no for direction R in state State-B
  5134. In State-B moving R
  5135. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5136. predict error 0
  5137. dir: dir isR
  5138. /|\729: O: O1458 (predict-no)
  5139. I see 1 and I'm going to do: predict-no
  5140. ENV: Agent did: predict-no for direction R in state State-B
  5141. In State-B moving R
  5142. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5143. predict error 0
  5144. dir: dir isU
  5145. -730: O: O1460 (predict-no)
  5146. I see 1 and I'm going to do: predict-no
  5147. ENV: Agent did: predict-no for direction U in state State-B
  5148. In State-B moving U
  5149. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5150. predict error 0
  5151. dir: dir isL
  5152. /|\731: O: O1461 (predict-yes)
  5153. I see 1 and I'm going to do: predict-yes
  5154. ENV: Agent did: predict-yes for direction L in state State-B
  5155. In State-B moving L
  5156. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5157. predict error 0
  5158. dir: dir isR
  5159. -732: O: O1463 (predict-yes)
  5160. I see 1 and I'm going to do: predict-yes
  5161. ENV: Agent did: predict-yes for direction R in state State-A
  5162. In State-A moving R
  5163. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5164. predict error 0
  5165. dir: dir isL
  5166. /|733: O: O1465 (predict-yes)
  5167. I see 1 and I'm going to do: predict-yes
  5168. ENV: Agent did: predict-yes for direction L in state State-B
  5169. In State-B moving L
  5170. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5171. predict error 0
  5172. dir: dir isU
  5173. \-/734: O: O1468 (predict-no)
  5174. I see 1 and I'm going to do: predict-no
  5175. ENV: Agent did: predict-no for direction U in state State-A
  5176. In State-A moving U
  5177. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5178. predict error 0
  5179. dir: dir isR
  5180. |\735: O: O1469 (predict-yes)
  5181. I see 1 and I'm going to do: predict-yes
  5182. ENV: Agent did: predict-yes for direction R in state State-A
  5183. In State-A moving R
  5184. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5185. predict error 0
  5186. dir: dir isL
  5187. -/|736: O: O1471 (predict-yes)
  5188. I see 1 and I'm going to do: predict-yes
  5189. ENV: Agent did: predict-yes for direction L in state State-B
  5190. In State-B moving L
  5191. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5192. predict error 0
  5193. dir: dir isL
  5194. \-/737: O: O1474 (predict-no)
  5195. I see 1 and I'm going to do: predict-no
  5196. ENV: Agent did: predict-no for direction L in state State-A
  5197. In State-A moving L
  5198. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5199. predict error 0
  5200. dir: dir isR
  5201. |\738: O: O1475 (predict-yes)
  5202. I see 1 and I'm going to do: predict-yes
  5203. ENV: Agent did: predict-yes for direction R in state State-A
  5204. In State-A moving R
  5205. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5206. predict error 0
  5207. dir: dir isR
  5208. -/|739: O: O1478 (predict-no)
  5209. I see 1 and I'm going to do: predict-no
  5210. ENV: Agent did: predict-no for direction R in state State-B
  5211. In State-B moving R
  5212. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5213. predict error 0
  5214. dir: dir isU
  5215. \-/740: O: O1480 (predict-no)
  5216. I see 1 and I'm going to do: predict-no
  5217. ENV: Agent did: predict-no for direction U in state State-B
  5218. In State-B moving U
  5219. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5220. predict error 0
  5221. dir: dir isR
  5222. |\-741: O: O1482 (predict-no)
  5223. I see 1 and I'm going to do: predict-no
  5224. ENV: Agent did: predict-no for direction R in state State-B
  5225. In State-B moving R
  5226. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5227. predict error 0
  5228. dir: dir isR
  5229. /742: O: O1484 (predict-no)
  5230. I see 1 and I'm going to do: predict-no
  5231. ENV: Agent did: predict-no for direction R in state State-B
  5232. In State-B moving R
  5233. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5234. predict error 0
  5235. dir: dir isU
  5236. |\-743: O: O1486 (predict-no)
  5237. I see 1 and I'm going to do: predict-no
  5238. ENV: Agent did: predict-no for direction U in state State-B
  5239. In State-B moving U
  5240. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5241. predict error 0
  5242. dir: dir isR
  5243. /|\744: O: O1488 (predict-no)
  5244. I see 1 and I'm going to do: predict-no
  5245. ENV: Agent did: predict-no for direction R in state State-B
  5246. In State-B moving R
  5247. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5248. predict error 0
  5249. dir: dir isL
  5250. -/|745: O: O1489 (predict-yes)
  5251. I see 1 and I'm going to do: predict-yes
  5252. ENV: Agent did: predict-yes for direction L in state State-B
  5253. In State-B moving L
  5254. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5255. predict error 0
  5256. dir: dir isU
  5257. \-/746: O: O1492 (predict-no)
  5258. I see 1 and I'm going to do: predict-no
  5259. ENV: Agent did: predict-no for direction U in state State-A
  5260. In State-A moving U
  5261. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5262. predict error 0
  5263. dir: dir isR
  5264. |\-747: O: O1493 (predict-yes)
  5265. I see 1 and I'm going to do: predict-yes
  5266. ENV: Agent did: predict-yes for direction R in state State-A
  5267. In State-A moving R
  5268. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5269. predict error 0
  5270. dir: dir isU
  5271. /|\748: O: O1496 (predict-no)
  5272. I see 1 and I'm going to do: predict-no
  5273. ENV: Agent did: predict-no for direction U in state State-B
  5274. In State-B moving U
  5275. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5276. predict error 0
  5277. dir: dir isR
  5278. -749: O: O1498 (predict-no)
  5279. I see 1 and I'm going to do: predict-no
  5280. ENV: Agent did: predict-no for direction R in state State-B
  5281. In State-B moving R
  5282. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5283. predict error 0
  5284. dir: dir isU
  5285. /|750: O: O1500 (predict-no)
  5286. I see 1 and I'm going to do: predict-no
  5287. ENV: Agent did: predict-no for direction U in state State-B
  5288. In State-B moving U
  5289. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5290. predict error 0
  5291. dir: dir isU
  5292. \-/751: O: O1502 (predict-no)
  5293. I see 1 and I'm going to do: predict-no
  5294. ENV: Agent did: predict-no for direction U in state State-B
  5295. In State-B moving U
  5296. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5297. predict error 0
  5298. dir: dir isR
  5299. |752: O: O1504 (predict-no)
  5300. I see 1 and I'm going to do: predict-no
  5301. ENV: Agent did: predict-no for direction R in state State-B
  5302. In State-B moving R
  5303. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5304. predict error 0
  5305. dir: dir isU
  5306. \-753: O: O1506 (predict-no)
  5307. I see 1 and I'm going to do: predict-no
  5308. ENV: Agent did: predict-no for direction U in state State-B
  5309. In State-B moving U
  5310. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5311. predict error 0
  5312. dir: dir isL
  5313. /|\754: O: O1507 (predict-yes)
  5314. I see 1 and I'm going to do: predict-yes
  5315. ENV: Agent did: predict-yes for direction L in state State-B
  5316. In State-B moving L
  5317. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5318. predict error 0
  5319. dir: dir isR
  5320. -/|755: O: O1509 (predict-yes)
  5321. I see 1 and I'm going to do: predict-yes
  5322. ENV: Agent did: predict-yes for direction R in state State-A
  5323. In State-A moving R
  5324. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5325. predict error 0
  5326. dir: dir isU
  5327. \-/756: O: O1512 (predict-no)
  5328. I see 1 and I'm going to do: predict-no
  5329. ENV: Agent did: predict-no for direction U in state State-B
  5330. In State-B moving U
  5331. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5332. predict error 0
  5333. dir: dir isL
  5334. |\-757: O: O1513 (predict-yes)
  5335. I see 1 and I'm going to do: predict-yes
  5336. ENV: Agent did: predict-yes for direction L in state State-B
  5337. In State-B moving L
  5338. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5339. predict error 0
  5340. dir: dir isU
  5341. /|758: O: O1516 (predict-no)
  5342. I see 1 and I'm going to do: predict-no
  5343. ENV: Agent did: predict-no for direction U in state State-A
  5344. In State-A moving U
  5345. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5346. predict error 0
  5347. dir: dir isU
  5348. \759: O: O1518 (predict-no)
  5349. I see 1 and I'm going to do: predict-no
  5350. ENV: Agent did: predict-no for direction U in state State-A
  5351. In State-A moving U
  5352. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5353. predict error 0
  5354. dir: dir isU
  5355. -/760: O: O1520 (predict-no)
  5356. I see 1 and I'm going to do: predict-no
  5357. ENV: Agent did: predict-no for direction U in state State-A
  5358. In State-A moving U
  5359. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5360. predict error 0
  5361. dir: dir isU
  5362. |761: O: O1522 (predict-no)
  5363. I see 1 and I'm going to do: predict-no
  5364. ENV: Agent did: predict-no for direction U in state State-A
  5365. In State-A moving U
  5366. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5367. predict error 0
  5368. dir: dir isL
  5369. \762: O: O1524 (predict-no)
  5370. I see 1 and I'm going to do: predict-no
  5371. ENV: Agent did: predict-no for direction L in state State-A
  5372. In State-A moving L
  5373. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5374. predict error 0
  5375. dir: dir isR
  5376. -/|763: O: O1526 (predict-no)
  5377. I see 1 and I'm going to do: predict-no
  5378. ENV: Agent did: predict-no for direction R in state State-A
  5379. In State-A moving R
  5380. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  5381. predict error 1
  5382. dir: dir isR
  5383. \-/764: O: O1528 (predict-no)
  5384. I see 0 and I'm going to do: predict-no
  5385. ENV: Agent did: predict-no for direction R in state State-B
  5386. In State-B moving R
  5387. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5388. predict error 0
  5389. dir: dir isR
  5390. |\765: O: O1530 (predict-no)
  5391. I see 1 and I'm going to do: predict-no
  5392. ENV: Agent did: predict-no for direction R in state State-B
  5393. In State-B moving R
  5394. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5395. predict error 0
  5396. dir: dir isR
  5397. -/|766: O: O1532 (predict-no)
  5398. I see 1 and I'm going to do: predict-no
  5399. ENV: Agent did: predict-no for direction R in state State-B
  5400. In State-B moving R
  5401. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5402. predict error 0
  5403. dir: dir isU
  5404. \767: O: O1534 (predict-no)
  5405. I see 1 and I'm going to do: predict-no
  5406. ENV: Agent did: predict-no for direction U in state State-B
  5407. In State-B moving U
  5408. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5409. predict error 0
  5410. dir: dir isL
  5411. -/768: O: O1535 (predict-yes)
  5412. I see 1 and I'm going to do: predict-yes
  5413. ENV: Agent did: predict-yes for direction L in state State-B
  5414. In State-B moving L
  5415. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5416. predict error 0
  5417. dir: dir isL
  5418. |769: O: O1538 (predict-no)
  5419. I see 1 and I'm going to do: predict-no
  5420. ENV: Agent did: predict-no for direction L in state State-A
  5421. In State-A moving L
  5422. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5423. predict error 0
  5424. dir: dir isR
  5425. \-770: O: O1539 (predict-yes)
  5426. I see 1 and I'm going to do: predict-yes
  5427. ENV: Agent did: predict-yes for direction R in state State-A
  5428. In State-A moving R
  5429. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5430. predict error 0
  5431. dir: dir isU
  5432. /|\771: O: O1542 (predict-no)
  5433. I see 1 and I'm going to do: predict-no
  5434. ENV: Agent did: predict-no for direction U in state State-B
  5435. In State-B moving U
  5436. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5437. predict error 0
  5438. dir: dir isU
  5439. -772: O: O1544 (predict-no)
  5440. I see 1 and I'm going to do: predict-no
  5441. ENV: Agent did: predict-no for direction U in state State-B
  5442. In State-B moving U
  5443. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5444. predict error 0
  5445. dir: dir isU
  5446. /|\773: O: O1546 (predict-no)
  5447. I see 1 and I'm going to do: predict-no
  5448. ENV: Agent did: predict-no for direction U in state State-B
  5449. In State-B moving U
  5450. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5451. predict error 0
  5452. dir: dir isL
  5453. -774: O: O1547 (predict-yes)
  5454. I see 1 and I'm going to do: predict-yes
  5455. ENV: Agent did: predict-yes for direction L in state State-B
  5456. In State-B moving L
  5457. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5458. predict error 0
  5459. dir: dir isU
  5460. /|775: O: O1550 (predict-no)
  5461. I see 1 and I'm going to do: predict-no
  5462. ENV: Agent did: predict-no for direction U in state State-A
  5463. In State-A moving U
  5464. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5465. predict error 0
  5466. dir: dir isL
  5467. \-776: O: O1552 (predict-no)
  5468. I see 1 and I'm going to do: predict-no
  5469. ENV: Agent did: predict-no for direction L in state State-A
  5470. In State-A moving L
  5471. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5472. predict error 0
  5473. dir: dir isL
  5474. /|777: O: O1554 (predict-no)
  5475. I see 1 and I'm going to do: predict-no
  5476. ENV: Agent did: predict-no for direction L in state State-A
  5477. In State-A moving L
  5478. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5479. predict error 0
  5480. dir: dir isU
  5481. \-/778: O: O1556 (predict-no)
  5482. I see 1 and I'm going to do: predict-no
  5483. ENV: Agent did: predict-no for direction U in state State-A
  5484. In State-A moving U
  5485. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5486. predict error 0
  5487. dir: dir isU
  5488. |\779: O: O1558 (predict-no)
  5489. I see 1 and I'm going to do: predict-no
  5490. ENV: Agent did: predict-no for direction U in state State-A
  5491. In State-A moving U
  5492. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5493. predict error 0
  5494. dir: dir isR
  5495. -/|780: O: O1559 (predict-yes)
  5496. I see 1 and I'm going to do: predict-yes
  5497. ENV: Agent did: predict-yes for direction R in state State-A
  5498. In State-A moving R
  5499. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5500. predict error 0
  5501. dir: dir isU
  5502. \-/781: O: O1562 (predict-no)
  5503. I see 1 and I'm going to do: predict-no
  5504. ENV: Agent did: predict-no for direction U in state State-B
  5505. In State-B moving U
  5506. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5507. predict error 0
  5508. dir: dir isL
  5509. |782: O: O1563 (predict-yes)
  5510. I see 1 and I'm going to do: predict-yes
  5511. ENV: Agent did: predict-yes for direction L in state State-B
  5512. In State-B moving L
  5513. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5514. predict error 0
  5515. dir: dir isR
  5516. \-/783: O: O1565 (predict-yes)
  5517. I see 1 and I'm going to do: predict-yes
  5518. ENV: Agent did: predict-yes for direction R in state State-A
  5519. In State-A moving R
  5520. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5521. predict error 0
  5522. dir: dir isL
  5523. |\784: O: O1567 (predict-yes)
  5524. I see 1 and I'm going to do: predict-yes
  5525. ENV: Agent did: predict-yes for direction L in state State-B
  5526. In State-B moving L
  5527. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5528. predict error 0
  5529. dir: dir isL
  5530. -/785: O: O1570 (predict-no)
  5531. I see 1 and I'm going to do: predict-no
  5532. ENV: Agent did: predict-no for direction L in state State-A
  5533. In State-A moving L
  5534. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5535. predict error 0
  5536. dir: dir isL
  5537. |\786: O: O1572 (predict-no)
  5538. I see 1 and I'm going to do: predict-no
  5539. ENV: Agent did: predict-no for direction L in state State-A
  5540. In State-A moving L
  5541. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5542. predict error 0
  5543. dir: dir isL
  5544. -787: O: O1574 (predict-no)
  5545. I see 1 and I'm going to do: predict-no
  5546. ENV: Agent did: predict-no for direction L in state State-A
  5547. In State-A moving L
  5548. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5549. predict error 0
  5550. dir: dir isR
  5551. /|\788: O: O1575 (predict-yes)
  5552. I see 1 and I'm going to do: predict-yes
  5553. ENV: Agent did: predict-yes for direction R in state State-A
  5554. In State-A moving R
  5555. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5556. predict error 0
  5557. dir: dir isR
  5558. -/|789: O: O1578 (predict-no)
  5559. I see 1 and I'm going to do: predict-no
  5560. ENV: Agent did: predict-no for direction R in state State-B
  5561. In State-B moving R
  5562. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5563. predict error 0
  5564. dir: dir isL
  5565. \-790: O: O1579 (predict-yes)
  5566. I see 1 and I'm going to do: predict-yes
  5567. ENV: Agent did: predict-yes for direction L in state State-B
  5568. In State-B moving L
  5569. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5570. predict error 0
  5571. dir: dir isR
  5572. /791: O: O1581 (predict-yes)
  5573. I see 1 and I'm going to do: predict-yes
  5574. ENV: Agent did: predict-yes for direction R in state State-A
  5575. In State-A moving R
  5576. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5577. predict error 0
  5578. dir: dir isL
  5579. |792: O: O1583 (predict-yes)
  5580. I see 1 and I'm going to do: predict-yes
  5581. ENV: Agent did: predict-yes for direction L in state State-B
  5582. In State-B moving L
  5583. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5584. predict error 0
  5585. dir: dir isR
  5586. \-793: O: O1585 (predict-yes)
  5587. I see 1 and I'm going to do: predict-yes
  5588. ENV: Agent did: predict-yes for direction R in state State-A
  5589. In State-A moving R
  5590. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5591. predict error 0
  5592. dir: dir isR
  5593. /|\794: O: O1588 (predict-no)
  5594. I see 1 and I'm going to do: predict-no
  5595. ENV: Agent did: predict-no for direction R in state State-B
  5596. In State-B moving R
  5597. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5598. predict error 0
  5599. dir: dir isR
  5600. -/795: O: O1590 (predict-no)
  5601. I see 1 and I'm going to do: predict-no
  5602. ENV: Agent did: predict-no for direction R in state State-B
  5603. In State-B moving R
  5604. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5605. predict error 0
  5606. dir: dir isU
  5607. |\-796: O: O1592 (predict-no)
  5608. I see 1 and I'm going to do: predict-no
  5609. ENV: Agent did: predict-no for direction U in state State-B
  5610. In State-B moving U
  5611. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5612. predict error 0
  5613. dir: dir isL
  5614. /|\797: O: O1593 (predict-yes)
  5615. I see 1 and I'm going to do: predict-yes
  5616. ENV: Agent did: predict-yes for direction L in state State-B
  5617. In State-B moving L
  5618. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5619. predict error 0
  5620. dir: dir isR
  5621. -/798: O: O1595 (predict-yes)
  5622. I see 1 and I'm going to do: predict-yes
  5623. ENV: Agent did: predict-yes for direction R in state State-A
  5624. In State-A moving R
  5625. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5626. predict error 0
  5627. dir: dir isR
  5628. |\-799: O: O1598 (predict-no)
  5629. I see 1 and I'm going to do: predict-no
  5630. ENV: Agent did: predict-no for direction R in state State-B
  5631. In State-B moving R
  5632. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5633. predict error 0
  5634. dir: dir isR
  5635. /|\800: O: O1600 (predict-no)
  5636. I see 1 and I'm going to do: predict-no
  5637. ENV: Agent did: predict-no for direction R in state State-B
  5638. In State-B moving R
  5639. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5640. predict error 0
  5641. dir: dir isU
  5642. -/801: O: O1602 (predict-no)
  5643. I see 1 and I'm going to do: predict-no
  5644. ENV: Agent did: predict-no for direction U in state State-B
  5645. In State-B moving U
  5646. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5647. predict error 0
  5648. dir: dir isR
  5649. |802: O: O1604 (predict-no)
  5650. I see 1 and I'm going to do: predict-no
  5651. ENV: Agent did: predict-no for direction R in state State-B
  5652. In State-B moving R
  5653. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5654. predict error 0
  5655. dir: dir isR
  5656. \-/803: O: O1606 (predict-no)
  5657. I see 1 and I'm going to do: predict-no
  5658. ENV: Agent did: predict-no for direction R in state State-B
  5659. In State-B moving R
  5660. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5661. predict error 0
  5662. dir: dir isR
  5663. |\804: O: O1608 (predict-no)
  5664. I see 1 and I'm going to do: predict-no
  5665. ENV: Agent did: predict-no for direction R in state State-B
  5666. In State-B moving R
  5667. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5668. predict error 0
  5669. dir: dir isR
  5670. -/805: O: O1610 (predict-no)
  5671. I see 1 and I'm going to do: predict-no
  5672. ENV: Agent did: predict-no for direction R in state State-B
  5673. In State-B moving R
  5674. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5675. predict error 0
  5676. dir: dir isU
  5677. |\-806: O: O1612 (predict-no)
  5678. I see 1 and I'm going to do: predict-no
  5679. ENV: Agent did: predict-no for direction U in state State-B
  5680. In State-B moving U
  5681. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5682. predict error 0
  5683. dir: dir isU
  5684. /807: O: O1614 (predict-no)
  5685. I see 1 and I'm going to do: predict-no
  5686. ENV: Agent did: predict-no for direction U in state State-B
  5687. In State-B moving U
  5688. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5689. predict error 0
  5690. dir: dir isR
  5691. |\-808: O: O1616 (predict-no)
  5692. I see 1 and I'm going to do: predict-no
  5693. ENV: Agent did: predict-no for direction R in state State-B
  5694. In State-B moving R
  5695. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5696. predict error 0
  5697. dir: dir isL
  5698. /|\809: O: O1617 (predict-yes)
  5699. I see 1 and I'm going to do: predict-yes
  5700. ENV: Agent did: predict-yes for direction L in state State-B
  5701. In State-B moving L
  5702. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5703. predict error 0
  5704. dir: dir isR
  5705. -/|810: O: O1619 (predict-yes)
  5706. I see 1 and I'm going to do: predict-yes
  5707. ENV: Agent did: predict-yes for direction R in state State-A
  5708. In State-A moving R
  5709. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5710. predict error 0
  5711. dir: dir isL
  5712. \-/811: O: O1621 (predict-yes)
  5713. I see 1 and I'm going to do: predict-yes
  5714. ENV: Agent did: predict-yes for direction L in state State-B
  5715. In State-B moving L
  5716. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5717. predict error 0
  5718. dir: dir isR
  5719. |812: O: O1623 (predict-yes)
  5720. I see 1 and I'm going to do: predict-yes
  5721. ENV: Agent did: predict-yes for direction R in state State-A
  5722. In State-A moving R
  5723. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5724. predict error 0
  5725. dir: dir isL
  5726. \-813: O: O1625 (predict-yes)
  5727. I see 1 and I'm going to do: predict-yes
  5728. ENV: Agent did: predict-yes for direction L in state State-B
  5729. In State-B moving L
  5730. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5731. predict error 0
  5732. dir: dir isU
  5733. /|\814: O: O1628 (predict-no)
  5734. I see 1 and I'm going to do: predict-no
  5735. ENV: Agent did: predict-no for direction U in state State-A
  5736. In State-A moving U
  5737. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5738. predict error 0
  5739. dir: dir isU
  5740. -/|815: O: O1630 (predict-no)
  5741. I see 1 and I'm going to do: predict-no
  5742. ENV: Agent did: predict-no for direction U in state State-A
  5743. In State-A moving U
  5744. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5745. predict error 0
  5746. dir: dir isR
  5747. \-/816: O: O1631 (predict-yes)
  5748. I see 1 and I'm going to do: predict-yes
  5749. ENV: Agent did: predict-yes for direction R in state State-A
  5750. In State-A moving R
  5751. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5752. predict error 0
  5753. dir: dir isR
  5754. |\817: O: O1634 (predict-no)
  5755. I see 1 and I'm going to do: predict-no
  5756. ENV: Agent did: predict-no for direction R in state State-B
  5757. In State-B moving R
  5758. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5759. predict error 0
  5760. dir: dir isL
  5761. -/|818: O: O1635 (predict-yes)
  5762. I see 1 and I'm going to do: predict-yes
  5763. ENV: Agent did: predict-yes for direction L in state State-B
  5764. In State-B moving L
  5765. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5766. predict error 0
  5767. dir: dir isR
  5768. \819: O: O1637 (predict-yes)
  5769. I see 1 and I'm going to do: predict-yes
  5770. ENV: Agent did: predict-yes for direction R in state State-A
  5771. In State-A moving R
  5772. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5773. predict error 0
  5774. dir: dir isR
  5775. -/|820: O: O1640 (predict-no)
  5776. I see 1 and I'm going to do: predict-no
  5777. ENV: Agent did: predict-no for direction R in state State-B
  5778. In State-B moving R
  5779. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5780. predict error 0
  5781. dir: dir isR
  5782. \-/821: O: O1642 (predict-no)
  5783. I see 1 and I'm going to do: predict-no
  5784. ENV: Agent did: predict-no for direction R in state State-B
  5785. In State-B moving R
  5786. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5787. predict error 0
  5788. dir: dir isL
  5789. |822: O: O1643 (predict-yes)
  5790. I see 1 and I'm going to do: predict-yes
  5791. ENV: Agent did: predict-yes for direction L in state State-B
  5792. In State-B moving L
  5793. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5794. predict error 0
  5795. dir: dir isL
  5796. \-/823: O: O1646 (predict-no)
  5797. I see 1 and I'm going to do: predict-no
  5798. ENV: Agent did: predict-no for direction L in state State-A
  5799. In State-A moving L
  5800. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5801. predict error 0
  5802. dir: dir isU
  5803. |\-824: O: O1648 (predict-no)
  5804. I see 1 and I'm going to do: predict-no
  5805. ENV: Agent did: predict-no for direction U in state State-A
  5806. In State-A moving U
  5807. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5808. predict error 0
  5809. dir: dir isU
  5810. /|825: O: O1650 (predict-no)
  5811. I see 1 and I'm going to do: predict-no
  5812. ENV: Agent did: predict-no for direction U in state State-A
  5813. In State-A moving U
  5814. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5815. predict error 0
  5816. dir: dir isU
  5817. \-/826: O: O1652 (predict-no)
  5818. I see 1 and I'm going to do: predict-no
  5819. ENV: Agent did: predict-no for direction U in state State-A
  5820. In State-A moving U
  5821. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5822. predict error 0
  5823. dir: dir isR
  5824. |\-827: O: O1653 (predict-yes)
  5825. I see 1 and I'm going to do: predict-yes
  5826. ENV: Agent did: predict-yes for direction R in state State-A
  5827. In State-A moving R
  5828. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5829. predict error 0
  5830. dir: dir isL
  5831. /|\828: O: O1655 (predict-yes)
  5832. I see 1 and I'm going to do: predict-yes
  5833. ENV: Agent did: predict-yes for direction L in state State-B
  5834. In State-B moving L
  5835. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5836. predict error 0
  5837. dir: dir isL
  5838. -/|829: O: O1658 (predict-no)
  5839. I see 1 and I'm going to do: predict-no
  5840. ENV: Agent did: predict-no for direction L in state State-A
  5841. In State-A moving L
  5842. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5843. predict error 0
  5844. dir: dir isU
  5845. \-/830: O: O1660 (predict-no)
  5846. I see 1 and I'm going to do: predict-no
  5847. ENV: Agent did: predict-no for direction U in state State-A
  5848. In State-A moving U
  5849. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5850. predict error 0
  5851. dir: dir isU
  5852. |\-831: O: O1662 (predict-no)
  5853. I see 1 and I'm going to do: predict-no
  5854. ENV: Agent did: predict-no for direction U in state State-A
  5855. In State-A moving U
  5856. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5857. predict error 0
  5858. dir: dir isR
  5859. /832: O: O1663 (predict-yes)
  5860. I see 1 and I'm going to do: predict-yes
  5861. ENV: Agent did: predict-yes for direction R in state State-A
  5862. In State-A moving R
  5863. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5864. predict error 0
  5865. dir: dir isR
  5866. |\833: O: O1666 (predict-no)
  5867. I see 1 and I'm going to do: predict-no
  5868. ENV: Agent did: predict-no for direction R in state State-B
  5869. In State-B moving R
  5870. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5871. predict error 0
  5872. dir: dir isU
  5873. -/|834: O: O1668 (predict-no)
  5874. I see 1 and I'm going to do: predict-no
  5875. ENV: Agent did: predict-no for direction U in state State-B
  5876. In State-B moving U
  5877. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5878. predict error 0
  5879. dir: dir isL
  5880. \-/835: O: O1669 (predict-yes)
  5881. I see 1 and I'm going to do: predict-yes
  5882. ENV: Agent did: predict-yes for direction L in state State-B
  5883. In State-B moving L
  5884. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  5885. predict error 0
  5886. dir: dir isU
  5887. |836: O: O1672 (predict-no)
  5888. I see 1 and I'm going to do: predict-no
  5889. ENV: Agent did: predict-no for direction U in state State-A
  5890. In State-A moving U
  5891. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  5892. predict error 0
  5893. dir: dir isR
  5894. \-/837: O: O1673 (predict-yes)
  5895. I see 1 and I'm going to do: predict-yes
  5896. ENV: Agent did: predict-yes for direction R in state State-A
  5897. In State-A moving R
  5898. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  5899. predict error 0
  5900. dir: dir isU
  5901. |838: O: O1676 (predict-no)
  5902. I see 1 and I'm going to do: predict-no
  5903. ENV: Agent did: predict-no for direction U in state State-B
  5904. In State-B moving U
  5905. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5906. predict error 0
  5907. dir: dir isR
  5908. \-839: O: O1678 (predict-no)
  5909. I see 1 and I'm going to do: predict-no
  5910. ENV: Agent did: predict-no for direction R in state State-B
  5911. In State-B moving R
  5912. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5913. predict error 0
  5914. dir: dir isR
  5915. /|\840: O: O1680 (predict-no)
  5916. I see 1 and I'm going to do: predict-no
  5917. ENV: Agent did: predict-no for direction R in state State-B
  5918. In State-B moving R
  5919. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5920. predict error 0
  5921. dir: dir isR
  5922. -/|841: O: O1682 (predict-no)
  5923. I see 1 and I'm going to do: predict-no
  5924. ENV: Agent did: predict-no for direction R in state State-B
  5925. In State-B moving R
  5926. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5927. predict error 0
  5928. dir: dir isR
  5929. \842: O: O1684 (predict-no)
  5930. I see 1 and I'm going to do: predict-no
  5931. ENV: Agent did: predict-no for direction R in state State-B
  5932. In State-B moving R
  5933. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5934. predict error 0
  5935. dir: dir isR
  5936. -/843: O: O1686 (predict-no)
  5937. I see 1 and I'm going to do: predict-no
  5938. ENV: Agent did: predict-no for direction R in state State-B
  5939. In State-B moving R
  5940. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5941. predict error 0
  5942. dir: dir isU
  5943. |\-844: O: O1688 (predict-no)
  5944. I see 1 and I'm going to do: predict-no
  5945. ENV: Agent did: predict-no for direction U in state State-B
  5946. In State-B moving U
  5947. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5948. predict error 0
  5949. dir: dir isU
  5950. /|\845: O: O1690 (predict-no)
  5951. I see 1 and I'm going to do: predict-no
  5952. ENV: Agent did: predict-no for direction U in state State-B
  5953. In State-B moving U
  5954. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5955. predict error 0
  5956. dir: dir isR
  5957. -/|846: O: O1692 (predict-no)
  5958. I see 1 and I'm going to do: predict-no
  5959. ENV: Agent did: predict-no for direction R in state State-B
  5960. In State-B moving R
  5961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5962. predict error 0
  5963. dir: dir isR
  5964. \-/847: O: O1694 (predict-no)
  5965. I see 1 and I'm going to do: predict-no
  5966. ENV: Agent did: predict-no for direction R in state State-B
  5967. In State-B moving R
  5968. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5969. predict error 0
  5970. dir: dir isU
  5971. |\848: O: O1696 (predict-no)
  5972. I see 1 and I'm going to do: predict-no
  5973. ENV: Agent did: predict-no for direction U in state State-B
  5974. In State-B moving U
  5975. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5976. predict error 0
  5977. dir: dir isU
  5978. -/|849: O: O1698 (predict-no)
  5979. I see 1 and I'm going to do: predict-no
  5980. ENV: Agent did: predict-no for direction U in state State-B
  5981. In State-B moving U
  5982. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5983. predict error 0
  5984. dir: dir isR
  5985. \-850: O: O1700 (predict-no)
  5986. I see 1 and I'm going to do: predict-no
  5987. ENV: Agent did: predict-no for direction R in state State-B
  5988. In State-B moving R
  5989. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5990. predict error 0
  5991. dir: dir isU
  5992. /|\851: O: O1702 (predict-no)
  5993. I see 1 and I'm going to do: predict-no
  5994. ENV: Agent did: predict-no for direction U in state State-B
  5995. In State-B moving U
  5996. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  5997. predict error 0
  5998. dir: dir isL
  5999. -852: O: O1703 (predict-yes)
  6000. I see 1 and I'm going to do: predict-yes
  6001. ENV: Agent did: predict-yes for direction L in state State-B
  6002. In State-B moving L
  6003. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6004. predict error 0
  6005. dir: dir isR
  6006. /|\853: O: O1705 (predict-yes)
  6007. I see 1 and I'm going to do: predict-yes
  6008. ENV: Agent did: predict-yes for direction R in state State-A
  6009. In State-A moving R
  6010. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6011. predict error 0
  6012. dir: dir isR
  6013. -/854: O: O1708 (predict-no)
  6014. I see 1 and I'm going to do: predict-no
  6015. ENV: Agent did: predict-no for direction R in state State-B
  6016. In State-B moving R
  6017. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6018. predict error 0
  6019. dir: dir isR
  6020. |\855: O: O1710 (predict-no)
  6021. I see 1 and I'm going to do: predict-no
  6022. ENV: Agent did: predict-no for direction R in state State-B
  6023. In State-B moving R
  6024. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6025. predict error 0
  6026. dir: dir isU
  6027. -/|856: O: O1712 (predict-no)
  6028. I see 1 and I'm going to do: predict-no
  6029. ENV: Agent did: predict-no for direction U in state State-B
  6030. In State-B moving U
  6031. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6032. predict error 0
  6033. dir: dir isU
  6034. \-/857: O: O1714 (predict-no)
  6035. I see 1 and I'm going to do: predict-no
  6036. ENV: Agent did: predict-no for direction U in state State-B
  6037. In State-B moving U
  6038. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6039. predict error 0
  6040. dir: dir isR
  6041. |\858: O: O1716 (predict-no)
  6042. I see 1 and I'm going to do: predict-no
  6043. ENV: Agent did: predict-no for direction R in state State-B
  6044. In State-B moving R
  6045. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6046. predict error 0
  6047. dir: dir isR
  6048. -859: O: O1718 (predict-no)
  6049. I see 1 and I'm going to do: predict-no
  6050. ENV: Agent did: predict-no for direction R in state State-B
  6051. In State-B moving R
  6052. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6053. predict error 0
  6054. dir: dir isU
  6055. /|\860: O: O1720 (predict-no)
  6056. I see 1 and I'm going to do: predict-no
  6057. ENV: Agent did: predict-no for direction U in state State-B
  6058. In State-B moving U
  6059. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6060. predict error 0
  6061. dir: dir isU
  6062. -/|861: O: O1722 (predict-no)
  6063. I see 1 and I'm going to do: predict-no
  6064. ENV: Agent did: predict-no for direction U in state State-B
  6065. In State-B moving U
  6066. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6067. predict error 0
  6068. dir: dir isR
  6069. \862: O: O1724 (predict-no)
  6070. I see 1 and I'm going to do: predict-no
  6071. ENV: Agent did: predict-no for direction R in state State-B
  6072. In State-B moving R
  6073. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6074. predict error 0
  6075. dir: dir isU
  6076. -/|863: O: O1726 (predict-no)
  6077. I see 1 and I'm going to do: predict-no
  6078. ENV: Agent did: predict-no for direction U in state State-B
  6079. In State-B moving U
  6080. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6081. predict error 0
  6082. dir: dir isR
  6083. \-/864: O: O1728 (predict-no)
  6084. I see 1 and I'm going to do: predict-no
  6085. ENV: Agent did: predict-no for direction R in state State-B
  6086. In State-B moving R
  6087. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6088. predict error 0
  6089. dir: dir isL
  6090. |\-865: O: O1729 (predict-yes)
  6091. I see 1 and I'm going to do: predict-yes
  6092. ENV: Agent did: predict-yes for direction L in state State-B
  6093. In State-B moving L
  6094. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6095. predict error 0
  6096. dir: dir isR
  6097. /|866: O: O1731 (predict-yes)
  6098. I see 1 and I'm going to do: predict-yes
  6099. ENV: Agent did: predict-yes for direction R in state State-A
  6100. In State-A moving R
  6101. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6102. predict error 0
  6103. dir: dir isR
  6104. \-/867: O: O1734 (predict-no)
  6105. I see 1 and I'm going to do: predict-no
  6106. ENV: Agent did: predict-no for direction R in state State-B
  6107. In State-B moving R
  6108. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6109. predict error 0
  6110. dir: dir isR
  6111. |\-868: O: O1736 (predict-no)
  6112. I see 1 and I'm going to do: predict-no
  6113. ENV: Agent did: predict-no for direction R in state State-B
  6114. In State-B moving R
  6115. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6116. predict error 0
  6117. dir: dir isR
  6118. /869: O: O1738 (predict-no)
  6119. I see 1 and I'm going to do: predict-no
  6120. ENV: Agent did: predict-no for direction R in state State-B
  6121. In State-B moving R
  6122. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6123. predict error 0
  6124. dir: dir isU
  6125. |\-870: O: O1740 (predict-no)
  6126. I see 1 and I'm going to do: predict-no
  6127. ENV: Agent did: predict-no for direction U in state State-B
  6128. In State-B moving U
  6129. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6130. predict error 0
  6131. dir: dir isU
  6132. /|871: O: O1742 (predict-no)
  6133. I see 1 and I'm going to do: predict-no
  6134. ENV: Agent did: predict-no for direction U in state State-B
  6135. In State-B moving U
  6136. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6137. predict error 0
  6138. dir: dir isR
  6139. \872: O: O1744 (predict-no)
  6140. I see 1 and I'm going to do: predict-no
  6141. ENV: Agent did: predict-no for direction R in state State-B
  6142. In State-B moving R
  6143. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6144. predict error 0
  6145. dir: dir isU
  6146. -873: O: O1746 (predict-no)
  6147. I see 1 and I'm going to do: predict-no
  6148. ENV: Agent did: predict-no for direction U in state State-B
  6149. In State-B moving U
  6150. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6151. predict error 0
  6152. dir: dir isR
  6153. /|\874: O: O1748 (predict-no)
  6154. I see 1 and I'm going to do: predict-no
  6155. ENV: Agent did: predict-no for direction R in state State-B
  6156. In State-B moving R
  6157. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6158. predict error 0
  6159. dir: dir isR
  6160. -/|875: O: O1750 (predict-no)
  6161. I see 1 and I'm going to do: predict-no
  6162. ENV: Agent did: predict-no for direction R in state State-B
  6163. In State-B moving R
  6164. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6165. predict error 0
  6166. dir: dir isR
  6167. \-/876: O: O1752 (predict-no)
  6168. I see 1 and I'm going to do: predict-no
  6169. ENV: Agent did: predict-no for direction R in state State-B
  6170. In State-B moving R
  6171. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6172. predict error 0
  6173. dir: dir isU
  6174. |\877: O: O1754 (predict-no)
  6175. I see 1 and I'm going to do: predict-no
  6176. ENV: Agent did: predict-no for direction U in state State-B
  6177. In State-B moving U
  6178. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6179. predict error 0
  6180. dir: dir isU
  6181. -/|878: O: O1756 (predict-no)
  6182. I see 1 and I'm going to do: predict-no
  6183. ENV: Agent did: predict-no for direction U in state State-B
  6184. In State-B moving U
  6185. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6186. predict error 0
  6187. dir: dir isU
  6188. \-/879: O: O1758 (predict-no)
  6189. I see 1 and I'm going to do: predict-no
  6190. ENV: Agent did: predict-no for direction U in state State-B
  6191. In State-B moving U
  6192. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6193. predict error 0
  6194. dir: dir isU
  6195. |\-880: O: O1760 (predict-no)
  6196. I see 1 and I'm going to do: predict-no
  6197. ENV: Agent did: predict-no for direction U in state State-B
  6198. In State-B moving U
  6199. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6200. predict error 0
  6201. dir: dir isR
  6202. /|\881: O: O1762 (predict-no)
  6203. I see 1 and I'm going to do: predict-no
  6204. ENV: Agent did: predict-no for direction R in state State-B
  6205. In State-B moving R
  6206. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6207. predict error 0
  6208. dir: dir isL
  6209. -882: O: O1763 (predict-yes)
  6210. I see 1 and I'm going to do: predict-yes
  6211. ENV: Agent did: predict-yes for direction L in state State-B
  6212. In State-B moving L
  6213. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6214. predict error 0
  6215. dir: dir isR
  6216. /|\883: O: O1765 (predict-yes)
  6217. I see 1 and I'm going to do: predict-yes
  6218. ENV: Agent did: predict-yes for direction R in state State-A
  6219. In State-A moving R
  6220. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6221. predict error 0
  6222. dir: dir isL
  6223. -/|884: O: O1767 (predict-yes)
  6224. I see 1 and I'm going to do: predict-yes
  6225. ENV: Agent did: predict-yes for direction L in state State-B
  6226. In State-B moving L
  6227. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6228. predict error 0
  6229. dir: dir isU
  6230. \-/885: O: O1770 (predict-no)
  6231. I see 1 and I'm going to do: predict-no
  6232. ENV: Agent did: predict-no for direction U in state State-A
  6233. In State-A moving U
  6234. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6235. predict error 0
  6236. dir: dir isR
  6237. |\-886: O: O1771 (predict-yes)
  6238. I see 1 and I'm going to do: predict-yes
  6239. ENV: Agent did: predict-yes for direction R in state State-A
  6240. In State-A moving R
  6241. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6242. predict error 0
  6243. dir: dir isR
  6244. /|\887: O: O1774 (predict-no)
  6245. I see 1 and I'm going to do: predict-no
  6246. ENV: Agent did: predict-no for direction R in state State-B
  6247. In State-B moving R
  6248. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6249. predict error 0
  6250. dir: dir isU
  6251. -/|888: O: O1776 (predict-no)
  6252. I see 1 and I'm going to do: predict-no
  6253. ENV: Agent did: predict-no for direction U in state State-B
  6254. In State-B moving U
  6255. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6256. predict error 0
  6257. dir: dir isL
  6258. \889: O: O1777 (predict-yes)
  6259. I see 1 and I'm going to do: predict-yes
  6260. ENV: Agent did: predict-yes for direction L in state State-B
  6261. In State-B moving L
  6262. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6263. predict error 0
  6264. dir: dir isU
  6265. -890: O: O1780 (predict-no)
  6266. I see 1 and I'm going to do: predict-no
  6267. ENV: Agent did: predict-no for direction U in state State-A
  6268. In State-A moving U
  6269. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6270. predict error 0
  6271. dir: dir isL
  6272. /891: O: O1782 (predict-no)
  6273. I see 1 and I'm going to do: predict-no
  6274. ENV: Agent did: predict-no for direction L in state State-A
  6275. In State-A moving L
  6276. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6277. predict error 0
  6278. dir: dir isL
  6279. |892: O: O1784 (predict-no)
  6280. I see 1 and I'm going to do: predict-no
  6281. ENV: Agent did: predict-no for direction L in state State-A
  6282. In State-A moving L
  6283. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6284. predict error 0
  6285. dir: dir isR
  6286. \-/893: O: O1785 (predict-yes)
  6287. I see 1 and I'm going to do: predict-yes
  6288. ENV: Agent did: predict-yes for direction R in state State-A
  6289. In State-A moving R
  6290. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6291. predict error 0
  6292. dir: dir isR
  6293. |\894: O: O1788 (predict-no)
  6294. I see 1 and I'm going to do: predict-no
  6295. ENV: Agent did: predict-no for direction R in state State-B
  6296. In State-B moving R
  6297. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6298. predict error 0
  6299. dir: dir isU
  6300. -/|895: O: O1790 (predict-no)
  6301. I see 1 and I'm going to do: predict-no
  6302. ENV: Agent did: predict-no for direction U in state State-B
  6303. In State-B moving U
  6304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6305. predict error 0
  6306. dir: dir isR
  6307. \-/896: O: O1792 (predict-no)
  6308. I see 1 and I'm going to do: predict-no
  6309. ENV: Agent did: predict-no for direction R in state State-B
  6310. In State-B moving R
  6311. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6312. predict error 0
  6313. dir: dir isL
  6314. |\-897: O: O1793 (predict-yes)
  6315. I see 1 and I'm going to do: predict-yes
  6316. ENV: Agent did: predict-yes for direction L in state State-B
  6317. In State-B moving L
  6318. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6319. predict error 0
  6320. dir: dir isL
  6321. /|\898: O: O1796 (predict-no)
  6322. I see 1 and I'm going to do: predict-no
  6323. ENV: Agent did: predict-no for direction L in state State-A
  6324. In State-A moving L
  6325. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6326. predict error 0
  6327. dir: dir isL
  6328. -/899: O: O1798 (predict-no)
  6329. I see 1 and I'm going to do: predict-no
  6330. ENV: Agent did: predict-no for direction L in state State-A
  6331. In State-A moving L
  6332. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6333. predict error 0
  6334. dir: dir isR
  6335. |\-/sleeping...
  6336. |900: O: O1799 (predict-yes)
  6337. I see 1 and I'm going to do: predict-yes
  6338. ENV: Agent did: predict-yes for direction R in state State-A
  6339. In State-A moving R
  6340. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6341. predict error 0
  6342. dir: dir isU
  6343. \-901: O: O1802 (predict-no)
  6344. I see 1 and I'm going to do: predict-no
  6345. ENV: Agent did: predict-no for direction U in state State-B
  6346. In State-B moving U
  6347. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6348. predict error 0
  6349. dir: dir isL
  6350. /902: O: O1803 (predict-yes)
  6351. I see 1 and I'm going to do: predict-yes
  6352. ENV: Agent did: predict-yes for direction L in state State-B
  6353. In State-B moving L
  6354. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6355. predict error 0
  6356. dir: dir isL
  6357. |\-903: O: O1806 (predict-no)
  6358. I see 1 and I'm going to do: predict-no
  6359. ENV: Agent did: predict-no for direction L in state State-A
  6360. In State-A moving L
  6361. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6362. predict error 0
  6363. dir: dir isR
  6364. /|\904: O: O1807 (predict-yes)
  6365. I see 1 and I'm going to do: predict-yes
  6366. ENV: Agent did: predict-yes for direction R in state State-A
  6367. In State-A moving R
  6368. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6369. predict error 0
  6370. dir: dir isU
  6371. -/|905: O: O1810 (predict-no)
  6372. I see 1 and I'm going to do: predict-no
  6373. ENV: Agent did: predict-no for direction U in state State-B
  6374. In State-B moving U
  6375. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6376. predict error 0
  6377. dir: dir isU
  6378. \-/906: O: O1812 (predict-no)
  6379. I see 1 and I'm going to do: predict-no
  6380. ENV: Agent did: predict-no for direction U in state State-B
  6381. In State-B moving U
  6382. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6383. predict error 0
  6384. dir: dir isU
  6385. |\-907: O: O1814 (predict-no)
  6386. I see 1 and I'm going to do: predict-no
  6387. ENV: Agent did: predict-no for direction U in state State-B
  6388. In State-B moving U
  6389. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6390. predict error 0
  6391. dir: dir isR
  6392. /|\908: O: O1816 (predict-no)
  6393. I see 1 and I'm going to do: predict-no
  6394. ENV: Agent did: predict-no for direction R in state State-B
  6395. In State-B moving R
  6396. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6397. predict error 0
  6398. dir: dir isR
  6399. -/|909: O: O1818 (predict-no)
  6400. I see 1 and I'm going to do: predict-no
  6401. ENV: Agent did: predict-no for direction R in state State-B
  6402. In State-B moving R
  6403. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6404. predict error 0
  6405. dir: dir isL
  6406. \910: O: O1819 (predict-yes)
  6407. I see 1 and I'm going to do: predict-yes
  6408. ENV: Agent did: predict-yes for direction L in state State-B
  6409. In State-B moving L
  6410. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6411. predict error 0
  6412. dir: dir isR
  6413. -/|911: O: O1821 (predict-yes)
  6414. I see 1 and I'm going to do: predict-yes
  6415. ENV: Agent did: predict-yes for direction R in state State-A
  6416. In State-A moving R
  6417. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6418. predict error 0
  6419. dir: dir isL
  6420. \912: O: O1823 (predict-yes)
  6421. I see 1 and I'm going to do: predict-yes
  6422. ENV: Agent did: predict-yes for direction L in state State-B
  6423. In State-B moving L
  6424. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6425. predict error 0
  6426. dir: dir isL
  6427. -/913: O: O1826 (predict-no)
  6428. I see 1 and I'm going to do: predict-no
  6429. ENV: Agent did: predict-no for direction L in state State-A
  6430. In State-A moving L
  6431. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6432. predict error 0
  6433. dir: dir isU
  6434. |\914: O: O1828 (predict-no)
  6435. I see 1 and I'm going to do: predict-no
  6436. ENV: Agent did: predict-no for direction U in state State-A
  6437. In State-A moving U
  6438. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6439. predict error 0
  6440. dir: dir isR
  6441. -/|915: O: O1829 (predict-yes)
  6442. I see 1 and I'm going to do: predict-yes
  6443. ENV: Agent did: predict-yes for direction R in state State-A
  6444. In State-A moving R
  6445. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6446. predict error 0
  6447. dir: dir isU
  6448. \-/916: O: O1832 (predict-no)
  6449. I see 1 and I'm going to do: predict-no
  6450. ENV: Agent did: predict-no for direction U in state State-B
  6451. In State-B moving U
  6452. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6453. predict error 0
  6454. dir: dir isR
  6455. |\917: O: O1834 (predict-no)
  6456. I see 1 and I'm going to do: predict-no
  6457. ENV: Agent did: predict-no for direction R in state State-B
  6458. In State-B moving R
  6459. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6460. predict error 0
  6461. dir: dir isL
  6462. -918: O: O1835 (predict-yes)
  6463. I see 1 and I'm going to do: predict-yes
  6464. ENV: Agent did: predict-yes for direction L in state State-B
  6465. In State-B moving L
  6466. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6467. predict error 0
  6468. dir: dir isR
  6469. /|\919: O: O1837 (predict-yes)
  6470. I see 1 and I'm going to do: predict-yes
  6471. ENV: Agent did: predict-yes for direction R in state State-A
  6472. In State-A moving R
  6473. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6474. predict error 0
  6475. dir: dir isR
  6476. -/|920: O: O1840 (predict-no)
  6477. I see 1 and I'm going to do: predict-no
  6478. ENV: Agent did: predict-no for direction R in state State-B
  6479. In State-B moving R
  6480. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6481. predict error 0
  6482. dir: dir isL
  6483. \-/921: O: O1841 (predict-yes)
  6484. I see 1 and I'm going to do: predict-yes
  6485. ENV: Agent did: predict-yes for direction L in state State-B
  6486. In State-B moving L
  6487. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6488. predict error 0
  6489. dir: dir isU
  6490. |922: O: O1844 (predict-no)
  6491. I see 1 and I'm going to do: predict-no
  6492. ENV: Agent did: predict-no for direction U in state State-A
  6493. In State-A moving U
  6494. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6495. predict error 0
  6496. dir: dir isU
  6497. \-/923: O: O1846 (predict-no)
  6498. I see 1 and I'm going to do: predict-no
  6499. ENV: Agent did: predict-no for direction U in state State-A
  6500. In State-A moving U
  6501. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6502. predict error 0
  6503. dir: dir isL
  6504. |\-924: O: O1848 (predict-no)
  6505. I see 1 and I'm going to do: predict-no
  6506. ENV: Agent did: predict-no for direction L in state State-A
  6507. In State-A moving L
  6508. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6509. predict error 0
  6510. dir: dir isL
  6511. /925: O: O1850 (predict-no)
  6512. I see 1 and I'm going to do: predict-no
  6513. ENV: Agent did: predict-no for direction L in state State-A
  6514. In State-A moving L
  6515. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6516. predict error 0
  6517. dir: dir isU
  6518. |\-926: O: O1852 (predict-no)
  6519. I see 1 and I'm going to do: predict-no
  6520. ENV: Agent did: predict-no for direction U in state State-A
  6521. In State-A moving U
  6522. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6523. predict error 0
  6524. dir: dir isU
  6525. /|\927: O: O1854 (predict-no)
  6526. I see 1 and I'm going to do: predict-no
  6527. ENV: Agent did: predict-no for direction U in state State-A
  6528. In State-A moving U
  6529. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6530. predict error 0
  6531. dir: dir isL
  6532. -/928: O: O1856 (predict-no)
  6533. I see 1 and I'm going to do: predict-no
  6534. ENV: Agent did: predict-no for direction L in state State-A
  6535. In State-A moving L
  6536. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6537. predict error 0
  6538. dir: dir isR
  6539. |\-929: O: O1857 (predict-yes)
  6540. I see 1 and I'm going to do: predict-yes
  6541. ENV: Agent did: predict-yes for direction R in state State-A
  6542. In State-A moving R
  6543. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6544. predict error 0
  6545. dir: dir isL
  6546. /|\930: O: O1859 (predict-yes)
  6547. I see 1 and I'm going to do: predict-yes
  6548. ENV: Agent did: predict-yes for direction L in state State-B
  6549. In State-B moving L
  6550. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6551. predict error 0
  6552. dir: dir isU
  6553. -/931: O: O1862 (predict-no)
  6554. I see 1 and I'm going to do: predict-no
  6555. ENV: Agent did: predict-no for direction U in state State-A
  6556. In State-A moving U
  6557. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6558. predict error 0
  6559. dir: dir isR
  6560. |932: O: O1863 (predict-yes)
  6561. I see 1 and I'm going to do: predict-yes
  6562. ENV: Agent did: predict-yes for direction R in state State-A
  6563. In State-A moving R
  6564. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6565. predict error 0
  6566. dir: dir isR
  6567. \-/933: O: O1866 (predict-no)
  6568. I see 1 and I'm going to do: predict-no
  6569. ENV: Agent did: predict-no for direction R in state State-B
  6570. In State-B moving R
  6571. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6572. predict error 0
  6573. dir: dir isL
  6574. |\-934: O: O1867 (predict-yes)
  6575. I see 1 and I'm going to do: predict-yes
  6576. ENV: Agent did: predict-yes for direction L in state State-B
  6577. In State-B moving L
  6578. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6579. predict error 0
  6580. dir: dir isR
  6581. /|\935: O: O1869 (predict-yes)
  6582. I see 1 and I'm going to do: predict-yes
  6583. ENV: Agent did: predict-yes for direction R in state State-A
  6584. In State-A moving R
  6585. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6586. predict error 0
  6587. dir: dir isR
  6588. -/936: O: O1872 (predict-no)
  6589. I see 1 and I'm going to do: predict-no
  6590. ENV: Agent did: predict-no for direction R in state State-B
  6591. In State-B moving R
  6592. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6593. predict error 0
  6594. dir: dir isR
  6595. |\-937: O: O1874 (predict-no)
  6596. I see 1 and I'm going to do: predict-no
  6597. ENV: Agent did: predict-no for direction R in state State-B
  6598. In State-B moving R
  6599. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6600. predict error 0
  6601. dir: dir isL
  6602. /|\938: O: O1875 (predict-yes)
  6603. I see 1 and I'm going to do: predict-yes
  6604. ENV: Agent did: predict-yes for direction L in state State-B
  6605. In State-B moving L
  6606. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6607. predict error 0
  6608. dir: dir isL
  6609. -/|939: O: O1878 (predict-no)
  6610. I see 1 and I'm going to do: predict-no
  6611. ENV: Agent did: predict-no for direction L in state State-A
  6612. In State-A moving L
  6613. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6614. predict error 0
  6615. dir: dir isU
  6616. \-/940: O: O1880 (predict-no)
  6617. I see 1 and I'm going to do: predict-no
  6618. ENV: Agent did: predict-no for direction U in state State-A
  6619. In State-A moving U
  6620. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6621. predict error 0
  6622. dir: dir isU
  6623. |\-941: O: O1882 (predict-no)
  6624. I see 1 and I'm going to do: predict-no
  6625. ENV: Agent did: predict-no for direction U in state State-A
  6626. In State-A moving U
  6627. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6628. predict error 0
  6629. dir: dir isU
  6630. /942: O: O1884 (predict-no)
  6631. I see 1 and I'm going to do: predict-no
  6632. ENV: Agent did: predict-no for direction U in state State-A
  6633. In State-A moving U
  6634. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6635. predict error 0
  6636. dir: dir isR
  6637. |\943: O: O1885 (predict-yes)
  6638. I see 1 and I'm going to do: predict-yes
  6639. ENV: Agent did: predict-yes for direction R in state State-A
  6640. In State-A moving R
  6641. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  6642. predict error 0
  6643. dir: dir isU
  6644. -/|944: O: O1888 (predict-no)
  6645. I see 1 and I'm going to do: predict-no
  6646. ENV: Agent did: predict-no for direction U in state State-B
  6647. In State-B moving U
  6648. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  6649. predict error 0
  6650. dir: dir isL
  6651. \-/945: O: O1889 (predict-yes)
  6652. I see 1 and I'm going to do: predict-yes
  6653. ENV: Agent did: predict-yes for direction L in state State-B
  6654. In State-B moving L
  6655. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  6656. predict error 0
  6657. dir: dir isL
  6658. |\-946: O: O1892 (predict-no)
  6659. I see 1 and I'm going to do: predict-no
  6660. ENV: Agent did: predict-no for direction L in state State-A
  6661. In State-A moving L
  6662. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6663. predict error 0
  6664. dir: dir isU
  6665. /|947: O: O1894 (predict-no)
  6666. I see 1 and I'm going to do: predict-no
  6667. ENV: Agent did: predict-no for direction U in state State-A
  6668. In State-A moving U
  6669. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6670. predict error 0
  6671. dir: dir isL
  6672. \-948: O: O1896 (predict-no)
  6673. I see 1 and I'm going to do: predict-no
  6674. ENV: Agent did: predict-no for direction L in state State-A
  6675. In State-A moving L
  6676. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6677. predict error 0
  6678. dir: dir isL
  6679. /|\949: O: O1898 (predict-no)
  6680. I see 1 and I'm going to do: predict-no
  6681. ENV: Agent did: predict-no for direction L in state State-A
  6682. In State-A moving L
  6683. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6684. predict error 0
  6685. dir: dir isU
  6686. -/|950: O: O1900 (predict-no)
  6687. I see 1 and I'm going to do: predict-no
  6688. ENV: Agent did: predict-no for direction U in state State-A
  6689. In State-A moving U
  6690. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6691. predict error 0
  6692. dir: dir isL
  6693. \-/|\-/--- Input Phase ---
  6694. =>WM: (13326: I2 ^dir L)
  6695. =>WM: (13325: I2 ^reward 1)
  6696. =>WM: (13324: I2 ^see 0)
  6697. =>WM: (13323: N950 ^status complete)
  6698. <=WM: (13312: I2 ^dir U)
  6699. <=WM: (13311: I2 ^reward 1)
  6700. <=WM: (13310: I2 ^see 0)
  6701. =>WM: (13327: I2 ^level-1 L0-root)
  6702. <=WM: (13313: I2 ^level-1 L0-root)
  6703. --- END Input Phase ---
  6704. --- Proposal Phase ---
  6705. --- Inner Elaboration Phase, active level 1 (S1) ---
  6706. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6707. -->
  6708. (S1 ^operator O1899 = -0.208713043145708)
  6709. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6710. -->
  6711. (S1 ^operator O1900 = 0.6854017956462798)
  6712. Firing prefer*rvt*predict-no*H0*4*H1
  6713. -->
  6714. Firing prefer*rvt*predict-yes*H0*3*H1
  6715. -->
  6716. Firing elaborate*copy-see-to-output-link
  6717. -->
  6718. (I3 ^see 0 +)
  6719. Firing elaborate*reward*based*on*reward
  6720. -->
  6721. (R954 ^value 1 +)
  6722. (R1 ^reward R954 +)
  6723. Firing propose*predict-yes
  6724. -->
  6725. (O1901 ^name predict-yes +)
  6726. (S1 ^operator O1901 +)
  6727. Firing propose*predict-no
  6728. -->
  6729. (O1902 ^name predict-no +)
  6730. (S1 ^operator O1902 +)
  6731. Firing rl*prefer*rvt*predict-no*H0*4
  6732. -->
  6733. (S1 ^operator O1900 = 0.3145080651024651)
  6734. Firing rl*prefer*rvt*predict-yes*H0*3
  6735. -->
  6736. (S1 ^operator O1899 = 0.3908143935841644)
  6737. Firing prefer*rvt*predict-yes*H0
  6738. -->
  6739. Firing prefer*rvt*predict-no*H0
  6740. -->
  6741. Firing elaborate*copy-dir-to-output-link
  6742. -->
  6743. (I3 ^dir L +)
  6744. inner elaboration loop at bottom goal.
  6745. Retracting elaborate*copy-see-to-output-link
  6746. -->
  6747. (I3 ^see 0 +)
  6748. Retracting propose*predict-no
  6749. -->
  6750. (O1900 ^name predict-no +)
  6751. (S1 ^operator O1900 +)
  6752. Retracting propose*predict-yes
  6753. -->
  6754. (O1899 ^name predict-yes +)
  6755. (S1 ^operator O1899 +)
  6756. Retracting elaborate*reward*based*on*reward
  6757. -->
  6758. (R953 ^value 1 +)
  6759. (R1 ^reward R953 +)
  6760. Retracting elaborate*copy-dir-to-output-link
  6761. -->
  6762. (I3 ^dir U +)
  6763. Retracting rl*prefer*rvt*predict-no*H0*2
  6764. -->
  6765. (S1 ^operator O1900 = 1.)
  6766. Retracting rl*prefer*rvt*predict-yes*H0*1
  6767. -->
  6768. (S1 ^operator O1899 = 0.)
  6769. =>WM: (13334: S1 ^operator O1902 +)
  6770. =>WM: (13333: S1 ^operator O1901 +)
  6771. =>WM: (13332: I3 ^dir L)
  6772. =>WM: (13331: O1902 ^name predict-no)
  6773. =>WM: (13330: O1901 ^name predict-yes)
  6774. =>WM: (13329: R954 ^value 1)
  6775. =>WM: (13328: R1 ^reward R954)
  6776. <=WM: (13319: S1 ^operator O1899 +)
  6777. <=WM: (13320: S1 ^operator O1900 +)
  6778. <=WM: (13321: S1 ^operator O1900)
  6779. <=WM: (13318: I3 ^dir U)
  6780. <=WM: (13314: R1 ^reward R953)
  6781. <=WM: (13317: O1900 ^name predict-no)
  6782. <=WM: (13316: O1899 ^name predict-yes)
  6783. <=WM: (13315: R953 ^value 1)
  6784. --- Inner Elaboration Phase, active level 1 (S1) ---
  6785. Firing prefer*rvt*predict-yes*H0
  6786. -->
  6787. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6788. -->
  6789. (S1 ^operator O1901 = -0.208713043145708)
  6790. Firing rl*prefer*rvt*predict-yes*H0*3
  6791. -->
  6792. (S1 ^operator O1901 = 0.3908143935841644)
  6793. Firing prefer*rvt*predict-yes*H0*3*H1
  6794. -->
  6795. Firing prefer*rvt*predict-no*H0
  6796. -->
  6797. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6798. -->
  6799. (S1 ^operator O1902 = 0.6854017956462798)
  6800. Firing rl*prefer*rvt*predict-no*H0*4
  6801. -->
  6802. (S1 ^operator O1902 = 0.3145080651024651)
  6803. Firing prefer*rvt*predict-no*H0*4*H1
  6804. -->
  6805. inner elaboration loop at bottom goal.
  6806. Retracting rl*prefer*rvt*predict-no*H0*4
  6807. -->
  6808. (S1 ^operator O1900 = 0.3145080651024651)
  6809. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  6810. -->
  6811. (S1 ^operator O1900 = 0.6854017956462798)
  6812. Retracting rl*prefer*rvt*predict-yes*H0*3
  6813. -->
  6814. (S1 ^operator O1899 = 0.3908143935841644)
  6815. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  6816. -->
  6817. (S1 ^operator O1899 = -0.208713043145708)
  6818. --- END Proposal Phase ---
  6819. --- Decision Phase ---
  6820. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  6821. =>WM: (13335: S1 ^operator O1902)
  6822. 951: O: O1902 (predict-no)
  6823. --- END Decision Phase ---
  6824. --- Application Phase ---
  6825. --- Firing Productions (PE) For State At Depth 1 ---
  6826. --- Inner Elaboration Phase, active level 1 (S1) ---
  6827. Firing apply*operator
  6828. -->
  6829. (I3 ^predict-no N951 + :O )
  6830. Firing apply*operator*complete
  6831. -->
  6832. (I3 ^predict-no N950 - :O )
  6833. inner elaboration loop at bottom goal.
  6834. --- Change Working Memory (PE) ---
  6835. =>WM: (13336: I3 ^predict-no N951)
  6836. <=WM: (13323: N950 ^status complete)
  6837. <=WM: (13322: I3 ^predict-no N950)
  6838. --- Firing Productions (IE) For State At Depth 1 ---
  6839. --- Inner Elaboration Phase, active level 1 (S1) ---
  6840. Firing monitor*world
  6841. -->
  6842. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  6843. --- Change Working Memory (IE) ---
  6844. --- END Application Phase ---
  6845. --- Output Phase ---
  6846. ENV: Agent did: predict-no for direction L in state State-A
  6847. In State-A moving L
  6848. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  6849. predict error 0
  6850. dir: dir isL
  6851. --- END Output Phase ---
  6852. |--- Input Phase ---
  6853. =>WM: (13340: I2 ^dir L)
  6854. =>WM: (13339: I2 ^reward 1)
  6855. =>WM: (13338: I2 ^see 0)
  6856. =>WM: (13337: N951 ^status complete)
  6857. <=WM: (13326: I2 ^dir L)
  6858. <=WM: (13325: I2 ^reward 1)
  6859. <=WM: (13324: I2 ^see 0)
  6860. =>WM: (13341: I2 ^level-1 L0-root)
  6861. <=WM: (13327: I2 ^level-1 L0-root)
  6862. --- END Input Phase ---
  6863. --- Proposal Phase ---
  6864. --- Inner Elaboration Phase, active level 1 (S1) ---
  6865. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6866. -->
  6867. (S1 ^operator O1901 = -0.208713043145708)
  6868. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6869. -->
  6870. (S1 ^operator O1902 = 0.6854017956462798)
  6871. Firing prefer*rvt*predict-no*H0*4*H1
  6872. -->
  6873. Firing prefer*rvt*predict-yes*H0*3*H1
  6874. -->
  6875. Firing elaborate*copy-see-to-output-link
  6876. -->
  6877. (I3 ^see 0 +)
  6878. Firing elaborate*reward*based*on*reward
  6879. -->
  6880. (R955 ^value 1 +)
  6881. (R1 ^reward R955 +)
  6882. Firing propose*predict-yes
  6883. -->
  6884. (O1903 ^name predict-yes +)
  6885. (S1 ^operator O1903 +)
  6886. Firing propose*predict-no
  6887. -->
  6888. (O1904 ^name predict-no +)
  6889. (S1 ^operator O1904 +)
  6890. Firing rl*prefer*rvt*predict-no*H0*4
  6891. -->
  6892. (S1 ^operator O1902 = 0.3145080651024651)
  6893. Firing rl*prefer*rvt*predict-yes*H0*3
  6894. -->
  6895. (S1 ^operator O1901 = 0.3908143935841644)
  6896. Firing prefer*rvt*predict-yes*H0
  6897. -->
  6898. Firing prefer*rvt*predict-no*H0
  6899. -->
  6900. Firing elaborate*copy-dir-to-output-link
  6901. -->
  6902. (I3 ^dir L +)
  6903. inner elaboration loop at bottom goal.
  6904. Retracting elaborate*copy-see-to-output-link
  6905. -->
  6906. (I3 ^see 0 +)
  6907. Retracting propose*predict-no
  6908. -->
  6909. (O1902 ^name predict-no +)
  6910. (S1 ^operator O1902 +)
  6911. Retracting propose*predict-yes
  6912. -->
  6913. (O1901 ^name predict-yes +)
  6914. (S1 ^operator O1901 +)
  6915. Retracting elaborate*reward*based*on*reward
  6916. -->
  6917. (R954 ^value 1 +)
  6918. (R1 ^reward R954 +)
  6919. Retracting elaborate*copy-dir-to-output-link
  6920. -->
  6921. (I3 ^dir L +)
  6922. Retracting rl*prefer*rvt*predict-no*H0*4
  6923. -->
  6924. (S1 ^operator O1902 = 0.3145080651024651)
  6925. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  6926. -->
  6927. (S1 ^operator O1902 = 0.6854017956462798)
  6928. Retracting rl*prefer*rvt*predict-yes*H0*3
  6929. -->
  6930. (S1 ^operator O1901 = 0.3908143935841644)
  6931. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  6932. -->
  6933. (S1 ^operator O1901 = -0.208713043145708)
  6934. =>WM: (13347: S1 ^operator O1904 +)
  6935. =>WM: (13346: S1 ^operator O1903 +)
  6936. =>WM: (13345: O1904 ^name predict-no)
  6937. =>WM: (13344: O1903 ^name predict-yes)
  6938. =>WM: (13343: R955 ^value 1)
  6939. =>WM: (13342: R1 ^reward R955)
  6940. <=WM: (13333: S1 ^operator O1901 +)
  6941. <=WM: (13334: S1 ^operator O1902 +)
  6942. <=WM: (13335: S1 ^operator O1902)
  6943. <=WM: (13328: R1 ^reward R954)
  6944. <=WM: (13331: O1902 ^name predict-no)
  6945. <=WM: (13330: O1901 ^name predict-yes)
  6946. <=WM: (13329: R954 ^value 1)
  6947. --- Inner Elaboration Phase, active level 1 (S1) ---
  6948. Firing prefer*rvt*predict-yes*H0
  6949. -->
  6950. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  6951. -->
  6952. (S1 ^operator O1903 = -0.208713043145708)
  6953. Firing rl*prefer*rvt*predict-yes*H0*3
  6954. -->
  6955. (S1 ^operator O1903 = 0.3908143935841644)
  6956. Firing prefer*rvt*predict-yes*H0*3*H1
  6957. -->
  6958. Firing prefer*rvt*predict-no*H0
  6959. -->
  6960. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  6961. -->
  6962. (S1 ^operator O1904 = 0.6854017956462798)
  6963. Firing rl*prefer*rvt*predict-no*H0*4
  6964. -->
  6965. (S1 ^operator O1904 = 0.3145080651024651)
  6966. Firing prefer*rvt*predict-no*H0*4*H1
  6967. -->
  6968. inner elaboration loop at bottom goal.
  6969. Retracting rl*prefer*rvt*predict-no*H0*4
  6970. -->
  6971. (S1 ^operator O1902 = 0.3145080651024651)
  6972. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  6973. -->
  6974. (S1 ^operator O1902 = 0.6854017956462798)
  6975. Retracting rl*prefer*rvt*predict-yes*H0*3
  6976. -->
  6977. (S1 ^operator O1901 = 0.3908143935841644)
  6978. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  6979. -->
  6980. (S1 ^operator O1901 = -0.208713043145708)
  6981. --- END Proposal Phase ---
  6982. --- Decision Phase ---
  6983. RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478563 -0.164047 0.314516(R,m,v=1,0.917808,0.0759565)
  6984. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521362 0.16404 0.685402 -> 0.52137 0.16404 0.685411(R,m,v=1,1,0)
  6985. =>WM: (13348: S1 ^operator O1904)
  6986. 952: O: O1904 (predict-no)
  6987. --- END Decision Phase ---
  6988. --- Application Phase ---
  6989. --- Firing Productions (PE) For State At Depth 1 ---
  6990. --- Inner Elaboration Phase, active level 1 (S1) ---
  6991. Firing apply*operator
  6992. -->
  6993. (I3 ^predict-no N952 + :O )
  6994. Firing apply*operator*complete
  6995. -->
  6996. (I3 ^predict-no N951 - :O )
  6997. inner elaboration loop at bottom goal.
  6998. --- Change Working Memory (PE) ---
  6999. =>WM: (13349: I3 ^predict-no N952)
  7000. <=WM: (13337: N951 ^status complete)
  7001. <=WM: (13336: I3 ^predict-no N951)
  7002. --- Firing Productions (IE) For State At Depth 1 ---
  7003. --- Inner Elaboration Phase, active level 1 (S1) ---
  7004. Firing monitor*world
  7005. -->
  7006. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7007. --- Change Working Memory (IE) ---
  7008. --- END Application Phase ---
  7009. --- Output Phase ---
  7010. ENV: Agent did: predict-no for direction L in state State-A
  7011. In State-A moving L
  7012. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7013. predict error 0
  7014. dir: dir isR
  7015. --- END Output Phase ---
  7016. \-/--- Input Phase ---
  7017. =>WM: (13353: I2 ^dir R)
  7018. =>WM: (13352: I2 ^reward 1)
  7019. =>WM: (13351: I2 ^see 0)
  7020. =>WM: (13350: N952 ^status complete)
  7021. <=WM: (13340: I2 ^dir L)
  7022. <=WM: (13339: I2 ^reward 1)
  7023. <=WM: (13338: I2 ^see 0)
  7024. =>WM: (13354: I2 ^level-1 L0-root)
  7025. <=WM: (13341: I2 ^level-1 L0-root)
  7026. --- END Input Phase ---
  7027. --- Proposal Phase ---
  7028. --- Inner Elaboration Phase, active level 1 (S1) ---
  7029. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7030. -->
  7031. (S1 ^operator O1903 = 0.8783877442642956)
  7032. Firing prefer*rvt*predict-yes*H0*5*H1
  7033. -->
  7034. Firing elaborate*copy-see-to-output-link
  7035. -->
  7036. (I3 ^see 0 +)
  7037. Firing elaborate*reward*based*on*reward
  7038. -->
  7039. (R956 ^value 1 +)
  7040. (R1 ^reward R956 +)
  7041. Firing propose*predict-yes
  7042. -->
  7043. (O1905 ^name predict-yes +)
  7044. (S1 ^operator O1905 +)
  7045. Firing propose*predict-no
  7046. -->
  7047. (O1906 ^name predict-no +)
  7048. (S1 ^operator O1906 +)
  7049. Firing rl*prefer*rvt*predict-no*H0*6
  7050. -->
  7051. (S1 ^operator O1904 = 0.999977424773942)
  7052. Firing rl*prefer*rvt*predict-yes*H0*5
  7053. -->
  7054. (S1 ^operator O1903 = 0.1215951465100475)
  7055. Firing prefer*rvt*predict-yes*H0
  7056. -->
  7057. Firing prefer*rvt*predict-no*H0
  7058. -->
  7059. Firing elaborate*copy-dir-to-output-link
  7060. -->
  7061. (I3 ^dir R +)
  7062. inner elaboration loop at bottom goal.
  7063. Retracting elaborate*copy-see-to-output-link
  7064. -->
  7065. (I3 ^see 0 +)
  7066. Retracting propose*predict-no
  7067. -->
  7068. (O1904 ^name predict-no +)
  7069. (S1 ^operator O1904 +)
  7070. Retracting propose*predict-yes
  7071. -->
  7072. (O1903 ^name predict-yes +)
  7073. (S1 ^operator O1903 +)
  7074. Retracting elaborate*reward*based*on*reward
  7075. -->
  7076. (R955 ^value 1 +)
  7077. (R1 ^reward R955 +)
  7078. Retracting elaborate*copy-dir-to-output-link
  7079. -->
  7080. (I3 ^dir L +)
  7081. Retracting rl*prefer*rvt*predict-no*H0*4
  7082. -->
  7083. (S1 ^operator O1904 = 0.3145155972863931)
  7084. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  7085. -->
  7086. (S1 ^operator O1904 = 0.6854105587116136)
  7087. Retracting rl*prefer*rvt*predict-yes*H0*3
  7088. -->
  7089. (S1 ^operator O1903 = 0.3908143935841644)
  7090. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  7091. -->
  7092. (S1 ^operator O1903 = -0.208713043145708)
  7093. =>WM: (13361: S1 ^operator O1906 +)
  7094. =>WM: (13360: S1 ^operator O1905 +)
  7095. =>WM: (13359: I3 ^dir R)
  7096. =>WM: (13358: O1906 ^name predict-no)
  7097. =>WM: (13357: O1905 ^name predict-yes)
  7098. =>WM: (13356: R956 ^value 1)
  7099. =>WM: (13355: R1 ^reward R956)
  7100. <=WM: (13346: S1 ^operator O1903 +)
  7101. <=WM: (13347: S1 ^operator O1904 +)
  7102. <=WM: (13348: S1 ^operator O1904)
  7103. <=WM: (13332: I3 ^dir L)
  7104. <=WM: (13342: R1 ^reward R955)
  7105. <=WM: (13345: O1904 ^name predict-no)
  7106. <=WM: (13344: O1903 ^name predict-yes)
  7107. <=WM: (13343: R955 ^value 1)
  7108. --- Inner Elaboration Phase, active level 1 (S1) ---
  7109. Firing prefer*rvt*predict-yes*H0
  7110. -->
  7111. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7112. -->
  7113. (S1 ^operator O1905 = 0.8783877442642956)
  7114. Firing rl*prefer*rvt*predict-yes*H0*5
  7115. -->
  7116. (S1 ^operator O1905 = 0.1215951465100475)
  7117. Firing prefer*rvt*predict-yes*H0*5*H1
  7118. -->
  7119. Firing prefer*rvt*predict-no*H0
  7120. -->
  7121. Firing rl*prefer*rvt*predict-no*H0*6
  7122. -->
  7123. (S1 ^operator O1906 = 0.999977424773942)
  7124. inner elaboration loop at bottom goal.
  7125. Retracting rl*prefer*rvt*predict-no*H0*6
  7126. -->
  7127. (S1 ^operator O1904 = 0.999977424773942)
  7128. Retracting rl*prefer*rvt*predict-yes*H0*5
  7129. -->
  7130. (S1 ^operator O1903 = 0.1215951465100475)
  7131. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7132. -->
  7133. (S1 ^operator O1903 = 0.8783877442642956)
  7134. --- END Proposal Phase ---
  7135. --- Decision Phase ---
  7136. RL update rl*prefer*rvt*predict-no*H0*4 0.478563 -0.164047 0.314516 -> 0.478568 -0.164047 0.314522(R,m,v=1,0.918367,0.0754822)
  7137. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.52137 0.16404 0.685411 -> 0.521377 0.164041 0.685418(R,m,v=1,1,0)
  7138. =>WM: (13362: S1 ^operator O1905)
  7139. 953: O: O1905 (predict-yes)
  7140. --- END Decision Phase ---
  7141. --- Application Phase ---
  7142. --- Firing Productions (PE) For State At Depth 1 ---
  7143. --- Inner Elaboration Phase, active level 1 (S1) ---
  7144. Firing apply*operator
  7145. -->
  7146. (I3 ^predict-yes N953 + :O )
  7147. Firing apply*operator*complete
  7148. -->
  7149. (I3 ^predict-no N952 - :O )
  7150. inner elaboration loop at bottom goal.
  7151. --- Change Working Memory (PE) ---
  7152. =>WM: (13363: I3 ^predict-yes N953)
  7153. <=WM: (13350: N952 ^status complete)
  7154. <=WM: (13349: I3 ^predict-no N952)
  7155. --- Firing Productions (IE) For State At Depth 1 ---
  7156. --- Inner Elaboration Phase, active level 1 (S1) ---
  7157. Firing monitor*world
  7158. -->
  7159. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7160. --- Change Working Memory (IE) ---
  7161. --- END Application Phase ---
  7162. --- Output Phase ---
  7163. ENV: Agent did: predict-yes for direction R in state State-A
  7164. In State-A moving R
  7165. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7166. predict error 0
  7167. dir: dir isU
  7168. --- END Output Phase ---
  7169. |\--- Input Phase ---
  7170. =>WM: (13367: I2 ^dir U)
  7171. =>WM: (13366: I2 ^reward 1)
  7172. =>WM: (13365: I2 ^see 1)
  7173. =>WM: (13364: N953 ^status complete)
  7174. <=WM: (13353: I2 ^dir R)
  7175. <=WM: (13352: I2 ^reward 1)
  7176. <=WM: (13351: I2 ^see 0)
  7177. =>WM: (13368: I2 ^level-1 R1-root)
  7178. <=WM: (13354: I2 ^level-1 L0-root)
  7179. --- END Input Phase ---
  7180. --- Proposal Phase ---
  7181. --- Inner Elaboration Phase, active level 1 (S1) ---
  7182. Firing elaborate*copy-see-to-output-link
  7183. -->
  7184. (I3 ^see 1 +)
  7185. Firing elaborate*reward*based*on*reward
  7186. -->
  7187. (R957 ^value 1 +)
  7188. (R1 ^reward R957 +)
  7189. Firing propose*predict-yes
  7190. -->
  7191. (O1907 ^name predict-yes +)
  7192. (S1 ^operator O1907 +)
  7193. Firing propose*predict-no
  7194. -->
  7195. (O1908 ^name predict-no +)
  7196. (S1 ^operator O1908 +)
  7197. Firing rl*prefer*rvt*predict-no*H0*2
  7198. -->
  7199. (S1 ^operator O1906 = 1.)
  7200. Firing rl*prefer*rvt*predict-yes*H0*1
  7201. -->
  7202. (S1 ^operator O1905 = 0.)
  7203. Firing prefer*rvt*predict-yes*H0
  7204. -->
  7205. Firing prefer*rvt*predict-no*H0
  7206. -->
  7207. Firing elaborate*copy-dir-to-output-link
  7208. -->
  7209. (I3 ^dir U +)
  7210. inner elaboration loop at bottom goal.
  7211. Retracting elaborate*copy-see-to-output-link
  7212. -->
  7213. (I3 ^see 0 +)
  7214. Retracting propose*predict-no
  7215. -->
  7216. (O1906 ^name predict-no +)
  7217. (S1 ^operator O1906 +)
  7218. Retracting propose*predict-yes
  7219. -->
  7220. (O1905 ^name predict-yes +)
  7221. (S1 ^operator O1905 +)
  7222. Retracting elaborate*reward*based*on*reward
  7223. -->
  7224. (R956 ^value 1 +)
  7225. (R1 ^reward R956 +)
  7226. Retracting elaborate*copy-dir-to-output-link
  7227. -->
  7228. (I3 ^dir R +)
  7229. Retracting rl*prefer*rvt*predict-no*H0*6
  7230. -->
  7231. (S1 ^operator O1906 = 0.999977424773942)
  7232. Retracting rl*prefer*rvt*predict-yes*H0*5
  7233. -->
  7234. (S1 ^operator O1905 = 0.1215951465100475)
  7235. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7236. -->
  7237. (S1 ^operator O1905 = 0.8783877442642956)
  7238. =>WM: (13376: S1 ^operator O1908 +)
  7239. =>WM: (13375: S1 ^operator O1907 +)
  7240. =>WM: (13374: I3 ^dir U)
  7241. =>WM: (13373: O1908 ^name predict-no)
  7242. =>WM: (13372: O1907 ^name predict-yes)
  7243. =>WM: (13371: R957 ^value 1)
  7244. =>WM: (13370: R1 ^reward R957)
  7245. =>WM: (13369: I3 ^see 1)
  7246. <=WM: (13360: S1 ^operator O1905 +)
  7247. <=WM: (13362: S1 ^operator O1905)
  7248. <=WM: (13361: S1 ^operator O1906 +)
  7249. <=WM: (13359: I3 ^dir R)
  7250. <=WM: (13355: R1 ^reward R956)
  7251. <=WM: (13272: I3 ^see 0)
  7252. <=WM: (13358: O1906 ^name predict-no)
  7253. <=WM: (13357: O1905 ^name predict-yes)
  7254. <=WM: (13356: R956 ^value 1)
  7255. --- Inner Elaboration Phase, active level 1 (S1) ---
  7256. Firing prefer*rvt*predict-yes*H0
  7257. -->
  7258. Firing rl*prefer*rvt*predict-yes*H0*1
  7259. -->
  7260. (S1 ^operator O1907 = 0.)
  7261. Firing prefer*rvt*predict-no*H0
  7262. -->
  7263. Firing rl*prefer*rvt*predict-no*H0*2
  7264. -->
  7265. (S1 ^operator O1908 = 1.)
  7266. inner elaboration loop at bottom goal.
  7267. Retracting rl*prefer*rvt*predict-no*H0*2
  7268. -->
  7269. (S1 ^operator O1906 = 1.)
  7270. Retracting rl*prefer*rvt*predict-yes*H0*1
  7271. -->
  7272. (S1 ^operator O1905 = 0.)
  7273. --- END Proposal Phase ---
  7274. --- Decision Phase ---
  7275. RL update rl*prefer*rvt*predict-yes*H0*5 0.534522 -0.412927 0.121595 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.857143,0.123182)
  7276. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465464 0.412924 0.878388 -> 0.465465 0.412924 0.878389(R,m,v=1,1,0)
  7277. =>WM: (13377: S1 ^operator O1908)
  7278. 954: O: O1908 (predict-no)
  7279. --- END Decision Phase ---
  7280. --- Application Phase ---
  7281. --- Firing Productions (PE) For State At Depth 1 ---
  7282. --- Inner Elaboration Phase, active level 1 (S1) ---
  7283. Firing apply*operator
  7284. -->
  7285. (I3 ^predict-no N954 + :O )
  7286. Firing apply*operator*complete
  7287. -->
  7288. (I3 ^predict-yes N953 - :O )
  7289. inner elaboration loop at bottom goal.
  7290. --- Change Working Memory (PE) ---
  7291. =>WM: (13378: I3 ^predict-no N954)
  7292. <=WM: (13364: N953 ^status complete)
  7293. <=WM: (13363: I3 ^predict-yes N953)
  7294. --- Firing Productions (IE) For State At Depth 1 ---
  7295. --- Inner Elaboration Phase, active level 1 (S1) ---
  7296. Firing monitor*world
  7297. -->
  7298. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7299. --- Change Working Memory (IE) ---
  7300. --- END Application Phase ---
  7301. --- Output Phase ---
  7302. ENV: Agent did: predict-no for direction U in state State-B
  7303. In State-B moving U
  7304. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  7305. predict error 0
  7306. dir: dir isL
  7307. --- END Output Phase ---
  7308. -/--- Input Phase ---
  7309. =>WM: (13382: I2 ^dir L)
  7310. =>WM: (13381: I2 ^reward 1)
  7311. =>WM: (13380: I2 ^see 0)
  7312. =>WM: (13379: N954 ^status complete)
  7313. <=WM: (13367: I2 ^dir U)
  7314. <=WM: (13366: I2 ^reward 1)
  7315. <=WM: (13365: I2 ^see 1)
  7316. =>WM: (13383: I2 ^level-1 R1-root)
  7317. <=WM: (13368: I2 ^level-1 R1-root)
  7318. --- END Input Phase ---
  7319. --- Proposal Phase ---
  7320. --- Inner Elaboration Phase, active level 1 (S1) ---
  7321. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  7322. -->
  7323. (S1 ^operator O1908 = -0.168718511744511)
  7324. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  7325. -->
  7326. (S1 ^operator O1907 = 0.6093893278107597)
  7327. Firing prefer*rvt*predict-no*H0*4*H1
  7328. -->
  7329. Firing prefer*rvt*predict-yes*H0*3*H1
  7330. -->
  7331. Firing elaborate*copy-see-to-output-link
  7332. -->
  7333. (I3 ^see 0 +)
  7334. Firing elaborate*reward*based*on*reward
  7335. -->
  7336. (R958 ^value 1 +)
  7337. (R1 ^reward R958 +)
  7338. Firing propose*predict-yes
  7339. -->
  7340. (O1909 ^name predict-yes +)
  7341. (S1 ^operator O1909 +)
  7342. Firing propose*predict-no
  7343. -->
  7344. (O1910 ^name predict-no +)
  7345. (S1 ^operator O1910 +)
  7346. Firing rl*prefer*rvt*predict-no*H0*4
  7347. -->
  7348. (S1 ^operator O1908 = 0.3145217607813431)
  7349. Firing rl*prefer*rvt*predict-yes*H0*3
  7350. -->
  7351. (S1 ^operator O1907 = 0.3908143935841644)
  7352. Firing prefer*rvt*predict-yes*H0
  7353. -->
  7354. Firing prefer*rvt*predict-no*H0
  7355. -->
  7356. Firing elaborate*copy-dir-to-output-link
  7357. -->
  7358. (I3 ^dir L +)
  7359. inner elaboration loop at bottom goal.
  7360. Retracting elaborate*copy-see-to-output-link
  7361. -->
  7362. (I3 ^see 1 +)
  7363. Retracting propose*predict-no
  7364. -->
  7365. (O1908 ^name predict-no +)
  7366. (S1 ^operator O1908 +)
  7367. Retracting propose*predict-yes
  7368. -->
  7369. (O1907 ^name predict-yes +)
  7370. (S1 ^operator O1907 +)
  7371. Retracting elaborate*reward*based*on*reward
  7372. -->
  7373. (R957 ^value 1 +)
  7374. (R1 ^reward R957 +)
  7375. Retracting elaborate*copy-dir-to-output-link
  7376. -->
  7377. (I3 ^dir U +)
  7378. Retracting rl*prefer*rvt*predict-no*H0*2
  7379. -->
  7380. (S1 ^operator O1908 = 1.)
  7381. Retracting rl*prefer*rvt*predict-yes*H0*1
  7382. -->
  7383. (S1 ^operator O1907 = 0.)
  7384. =>WM: (13391: S1 ^operator O1910 +)
  7385. =>WM: (13390: S1 ^operator O1909 +)
  7386. =>WM: (13389: I3 ^dir L)
  7387. =>WM: (13388: O1910 ^name predict-no)
  7388. =>WM: (13387: O1909 ^name predict-yes)
  7389. =>WM: (13386: R958 ^value 1)
  7390. =>WM: (13385: R1 ^reward R958)
  7391. =>WM: (13384: I3 ^see 0)
  7392. <=WM: (13375: S1 ^operator O1907 +)
  7393. <=WM: (13376: S1 ^operator O1908 +)
  7394. <=WM: (13377: S1 ^operator O1908)
  7395. <=WM: (13374: I3 ^dir U)
  7396. <=WM: (13370: R1 ^reward R957)
  7397. <=WM: (13369: I3 ^see 1)
  7398. <=WM: (13373: O1908 ^name predict-no)
  7399. <=WM: (13372: O1907 ^name predict-yes)
  7400. <=WM: (13371: R957 ^value 1)
  7401. --- Inner Elaboration Phase, active level 1 (S1) ---
  7402. Firing prefer*rvt*predict-yes*H0
  7403. -->
  7404. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  7405. -->
  7406. (S1 ^operator O1909 = 0.6093893278107597)
  7407. Firing rl*prefer*rvt*predict-yes*H0*3
  7408. -->
  7409. (S1 ^operator O1909 = 0.3908143935841644)
  7410. Firing prefer*rvt*predict-yes*H0*3*H1
  7411. -->
  7412. Firing prefer*rvt*predict-no*H0
  7413. -->
  7414. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  7415. -->
  7416. (S1 ^operator O1910 = -0.168718511744511)
  7417. Firing rl*prefer*rvt*predict-no*H0*4
  7418. -->
  7419. (S1 ^operator O1910 = 0.3145217607813431)
  7420. Firing prefer*rvt*predict-no*H0*4*H1
  7421. -->
  7422. inner elaboration loop at bottom goal.
  7423. Retracting rl*prefer*rvt*predict-no*H0*4
  7424. -->
  7425. (S1 ^operator O1908 = 0.3145217607813431)
  7426. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  7427. -->
  7428. (S1 ^operator O1908 = -0.168718511744511)
  7429. Retracting rl*prefer*rvt*predict-yes*H0*3
  7430. -->
  7431. (S1 ^operator O1907 = 0.3908143935841644)
  7432. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  7433. -->
  7434. (S1 ^operator O1907 = 0.6093893278107597)
  7435. --- END Proposal Phase ---
  7436. --- Decision Phase ---
  7437. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7438. =>WM: (13392: S1 ^operator O1909)
  7439. 955: O: O1909 (predict-yes)
  7440. --- END Decision Phase ---
  7441. --- Application Phase ---
  7442. --- Firing Productions (PE) For State At Depth 1 ---
  7443. --- Inner Elaboration Phase, active level 1 (S1) ---
  7444. Firing apply*operator
  7445. -->
  7446. (I3 ^predict-yes N955 + :O )
  7447. Firing apply*operator*complete
  7448. -->
  7449. (I3 ^predict-no N954 - :O )
  7450. inner elaboration loop at bottom goal.
  7451. --- Change Working Memory (PE) ---
  7452. =>WM: (13393: I3 ^predict-yes N955)
  7453. <=WM: (13379: N954 ^status complete)
  7454. <=WM: (13378: I3 ^predict-no N954)
  7455. --- Firing Productions (IE) For State At Depth 1 ---
  7456. --- Inner Elaboration Phase, active level 1 (S1) ---
  7457. Firing monitor*world
  7458. -->
  7459. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7460. --- Change Working Memory (IE) ---
  7461. --- END Application Phase ---
  7462. --- Output Phase ---
  7463. ENV: Agent did: predict-yes for direction L in state State-B
  7464. In State-B moving L
  7465. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  7466. predict error 0
  7467. dir: dir isU
  7468. --- END Output Phase ---
  7469. |\---- Input Phase ---
  7470. =>WM: (13397: I2 ^dir U)
  7471. =>WM: (13396: I2 ^reward 1)
  7472. =>WM: (13395: I2 ^see 1)
  7473. =>WM: (13394: N955 ^status complete)
  7474. <=WM: (13382: I2 ^dir L)
  7475. <=WM: (13381: I2 ^reward 1)
  7476. <=WM: (13380: I2 ^see 0)
  7477. =>WM: (13398: I2 ^level-1 L1-root)
  7478. <=WM: (13383: I2 ^level-1 R1-root)
  7479. --- END Input Phase ---
  7480. --- Proposal Phase ---
  7481. --- Inner Elaboration Phase, active level 1 (S1) ---
  7482. Firing elaborate*copy-see-to-output-link
  7483. -->
  7484. (I3 ^see 1 +)
  7485. Firing elaborate*reward*based*on*reward
  7486. -->
  7487. (R959 ^value 1 +)
  7488. (R1 ^reward R959 +)
  7489. Firing propose*predict-yes
  7490. -->
  7491. (O1911 ^name predict-yes +)
  7492. (S1 ^operator O1911 +)
  7493. Firing propose*predict-no
  7494. -->
  7495. (O1912 ^name predict-no +)
  7496. (S1 ^operator O1912 +)
  7497. Firing rl*prefer*rvt*predict-no*H0*2
  7498. -->
  7499. (S1 ^operator O1910 = 1.)
  7500. Firing rl*prefer*rvt*predict-yes*H0*1
  7501. -->
  7502. (S1 ^operator O1909 = 0.)
  7503. Firing prefer*rvt*predict-yes*H0
  7504. -->
  7505. Firing prefer*rvt*predict-no*H0
  7506. -->
  7507. Firing elaborate*copy-dir-to-output-link
  7508. -->
  7509. (I3 ^dir U +)
  7510. inner elaboration loop at bottom goal.
  7511. Retracting elaborate*copy-see-to-output-link
  7512. -->
  7513. (I3 ^see 0 +)
  7514. Retracting propose*predict-no
  7515. -->
  7516. (O1910 ^name predict-no +)
  7517. (S1 ^operator O1910 +)
  7518. Retracting propose*predict-yes
  7519. -->
  7520. (O1909 ^name predict-yes +)
  7521. (S1 ^operator O1909 +)
  7522. Retracting elaborate*reward*based*on*reward
  7523. -->
  7524. (R958 ^value 1 +)
  7525. (R1 ^reward R958 +)
  7526. Retracting elaborate*copy-dir-to-output-link
  7527. -->
  7528. (I3 ^dir L +)
  7529. Retracting rl*prefer*rvt*predict-no*H0*4
  7530. -->
  7531. (S1 ^operator O1910 = 0.3145217607813431)
  7532. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  7533. -->
  7534. (S1 ^operator O1910 = -0.168718511744511)
  7535. Retracting rl*prefer*rvt*predict-yes*H0*3
  7536. -->
  7537. (S1 ^operator O1909 = 0.3908143935841644)
  7538. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  7539. -->
  7540. (S1 ^operator O1909 = 0.6093893278107597)
  7541. =>WM: (13406: S1 ^operator O1912 +)
  7542. =>WM: (13405: S1 ^operator O1911 +)
  7543. =>WM: (13404: I3 ^dir U)
  7544. =>WM: (13403: O1912 ^name predict-no)
  7545. =>WM: (13402: O1911 ^name predict-yes)
  7546. =>WM: (13401: R959 ^value 1)
  7547. =>WM: (13400: R1 ^reward R959)
  7548. =>WM: (13399: I3 ^see 1)
  7549. <=WM: (13390: S1 ^operator O1909 +)
  7550. <=WM: (13392: S1 ^operator O1909)
  7551. <=WM: (13391: S1 ^operator O1910 +)
  7552. <=WM: (13389: I3 ^dir L)
  7553. <=WM: (13385: R1 ^reward R958)
  7554. <=WM: (13384: I3 ^see 0)
  7555. <=WM: (13388: O1910 ^name predict-no)
  7556. <=WM: (13387: O1909 ^name predict-yes)
  7557. <=WM: (13386: R958 ^value 1)
  7558. --- Inner Elaboration Phase, active level 1 (S1) ---
  7559. Firing prefer*rvt*predict-yes*H0
  7560. -->
  7561. Firing rl*prefer*rvt*predict-yes*H0*1
  7562. -->
  7563. (S1 ^operator O1911 = 0.)
  7564. Firing prefer*rvt*predict-no*H0
  7565. -->
  7566. Firing rl*prefer*rvt*predict-no*H0*2
  7567. -->
  7568. (S1 ^operator O1912 = 1.)
  7569. inner elaboration loop at bottom goal.
  7570. Retracting rl*prefer*rvt*predict-no*H0*2
  7571. -->
  7572. (S1 ^operator O1910 = 1.)
  7573. Retracting rl*prefer*rvt*predict-yes*H0*1
  7574. -->
  7575. (S1 ^operator O1909 = 0.)
  7576. --- END Proposal Phase ---
  7577. --- Decision Phase ---
  7578. RL update rl*prefer*rvt*predict-yes*H0*3 0.472355 -0.0815405 0.390814 -> 0.47234 -0.081543 0.390797(R,m,v=1,0.940789,0.0560735)
  7579. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527819 0.0815706 0.609389 -> 0.527802 0.0815677 0.60937(R,m,v=1,1,0)
  7580. =>WM: (13407: S1 ^operator O1912)
  7581. 956: O: O1912 (predict-no)
  7582. --- END Decision Phase ---
  7583. --- Application Phase ---
  7584. --- Firing Productions (PE) For State At Depth 1 ---
  7585. --- Inner Elaboration Phase, active level 1 (S1) ---
  7586. Firing apply*operator
  7587. -->
  7588. (I3 ^predict-no N956 + :O )
  7589. Firing apply*operator*complete
  7590. -->
  7591. (I3 ^predict-yes N955 - :O )
  7592. inner elaboration loop at bottom goal.
  7593. --- Change Working Memory (PE) ---
  7594. =>WM: (13408: I3 ^predict-no N956)
  7595. <=WM: (13394: N955 ^status complete)
  7596. <=WM: (13393: I3 ^predict-yes N955)
  7597. --- Firing Productions (IE) For State At Depth 1 ---
  7598. --- Inner Elaboration Phase, active level 1 (S1) ---
  7599. Firing monitor*world
  7600. -->
  7601. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7602. --- Change Working Memory (IE) ---
  7603. --- END Application Phase ---
  7604. --- Output Phase ---
  7605. ENV: Agent did: predict-no for direction U in state State-A
  7606. In State-A moving U
  7607. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7608. predict error 0
  7609. dir: dir isL
  7610. --- END Output Phase ---
  7611. /|\--- Input Phase ---
  7612. =>WM: (13412: I2 ^dir L)
  7613. =>WM: (13411: I2 ^reward 1)
  7614. =>WM: (13410: I2 ^see 0)
  7615. =>WM: (13409: N956 ^status complete)
  7616. <=WM: (13397: I2 ^dir U)
  7617. <=WM: (13396: I2 ^reward 1)
  7618. <=WM: (13395: I2 ^see 1)
  7619. =>WM: (13413: I2 ^level-1 L1-root)
  7620. <=WM: (13398: I2 ^level-1 L1-root)
  7621. --- END Input Phase ---
  7622. --- Proposal Phase ---
  7623. --- Inner Elaboration Phase, active level 1 (S1) ---
  7624. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  7625. -->
  7626. (S1 ^operator O1911 = -0.2062723012911647)
  7627. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  7628. -->
  7629. (S1 ^operator O1912 = 0.6855673437364445)
  7630. Firing prefer*rvt*predict-no*H0*4*H1
  7631. -->
  7632. Firing prefer*rvt*predict-yes*H0*3*H1
  7633. -->
  7634. Firing elaborate*copy-see-to-output-link
  7635. -->
  7636. (I3 ^see 0 +)
  7637. Firing elaborate*reward*based*on*reward
  7638. -->
  7639. (R960 ^value 1 +)
  7640. (R1 ^reward R960 +)
  7641. Firing propose*predict-yes
  7642. -->
  7643. (O1913 ^name predict-yes +)
  7644. (S1 ^operator O1913 +)
  7645. Firing propose*predict-no
  7646. -->
  7647. (O1914 ^name predict-no +)
  7648. (S1 ^operator O1914 +)
  7649. Firing rl*prefer*rvt*predict-no*H0*4
  7650. -->
  7651. (S1 ^operator O1912 = 0.3145217607813431)
  7652. Firing rl*prefer*rvt*predict-yes*H0*3
  7653. -->
  7654. (S1 ^operator O1911 = 0.3907974841024591)
  7655. Firing prefer*rvt*predict-yes*H0
  7656. -->
  7657. Firing prefer*rvt*predict-no*H0
  7658. -->
  7659. Firing elaborate*copy-dir-to-output-link
  7660. -->
  7661. (I3 ^dir L +)
  7662. inner elaboration loop at bottom goal.
  7663. Retracting elaborate*copy-see-to-output-link
  7664. -->
  7665. (I3 ^see 1 +)
  7666. Retracting propose*predict-no
  7667. -->
  7668. (O1912 ^name predict-no +)
  7669. (S1 ^operator O1912 +)
  7670. Retracting propose*predict-yes
  7671. -->
  7672. (O1911 ^name predict-yes +)
  7673. (S1 ^operator O1911 +)
  7674. Retracting elaborate*reward*based*on*reward
  7675. -->
  7676. (R959 ^value 1 +)
  7677. (R1 ^reward R959 +)
  7678. Retracting elaborate*copy-dir-to-output-link
  7679. -->
  7680. (I3 ^dir U +)
  7681. Retracting rl*prefer*rvt*predict-no*H0*2
  7682. -->
  7683. (S1 ^operator O1912 = 1.)
  7684. Retracting rl*prefer*rvt*predict-yes*H0*1
  7685. -->
  7686. (S1 ^operator O1911 = 0.)
  7687. =>WM: (13421: S1 ^operator O1914 +)
  7688. =>WM: (13420: S1 ^operator O1913 +)
  7689. =>WM: (13419: I3 ^dir L)
  7690. =>WM: (13418: O1914 ^name predict-no)
  7691. =>WM: (13417: O1913 ^name predict-yes)
  7692. =>WM: (13416: R960 ^value 1)
  7693. =>WM: (13415: R1 ^reward R960)
  7694. =>WM: (13414: I3 ^see 0)
  7695. <=WM: (13405: S1 ^operator O1911 +)
  7696. <=WM: (13406: S1 ^operator O1912 +)
  7697. <=WM: (13407: S1 ^operator O1912)
  7698. <=WM: (13404: I3 ^dir U)
  7699. <=WM: (13400: R1 ^reward R959)
  7700. <=WM: (13399: I3 ^see 1)
  7701. <=WM: (13403: O1912 ^name predict-no)
  7702. <=WM: (13402: O1911 ^name predict-yes)
  7703. <=WM: (13401: R959 ^value 1)
  7704. --- Inner Elaboration Phase, active level 1 (S1) ---
  7705. Firing prefer*rvt*predict-yes*H0
  7706. -->
  7707. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  7708. -->
  7709. (S1 ^operator O1913 = -0.2062723012911647)
  7710. Firing rl*prefer*rvt*predict-yes*H0*3
  7711. -->
  7712. (S1 ^operator O1913 = 0.3907974841024591)
  7713. Firing prefer*rvt*predict-yes*H0*3*H1
  7714. -->
  7715. Firing prefer*rvt*predict-no*H0
  7716. -->
  7717. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  7718. -->
  7719. (S1 ^operator O1914 = 0.6855673437364445)
  7720. Firing rl*prefer*rvt*predict-no*H0*4
  7721. -->
  7722. (S1 ^operator O1914 = 0.3145217607813431)
  7723. Firing prefer*rvt*predict-no*H0*4*H1
  7724. -->
  7725. inner elaboration loop at bottom goal.
  7726. Retracting rl*prefer*rvt*predict-no*H0*4
  7727. -->
  7728. (S1 ^operator O1912 = 0.3145217607813431)
  7729. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  7730. -->
  7731. (S1 ^operator O1912 = 0.6855673437364445)
  7732. Retracting rl*prefer*rvt*predict-yes*H0*3
  7733. -->
  7734. (S1 ^operator O1911 = 0.3907974841024591)
  7735. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  7736. -->
  7737. (S1 ^operator O1911 = -0.2062723012911647)
  7738. --- END Proposal Phase ---
  7739. --- Decision Phase ---
  7740. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  7741. =>WM: (13422: S1 ^operator O1914)
  7742. 957: O: O1914 (predict-no)
  7743. --- END Decision Phase ---
  7744. --- Application Phase ---
  7745. --- Firing Productions (PE) For State At Depth 1 ---
  7746. --- Inner Elaboration Phase, active level 1 (S1) ---
  7747. Firing apply*operator
  7748. -->
  7749. (I3 ^predict-no N957 + :O )
  7750. Firing apply*operator*complete
  7751. -->
  7752. (I3 ^predict-no N956 - :O )
  7753. inner elaboration loop at bottom goal.
  7754. --- Change Working Memory (PE) ---
  7755. =>WM: (13423: I3 ^predict-no N957)
  7756. <=WM: (13409: N956 ^status complete)
  7757. <=WM: (13408: I3 ^predict-no N956)
  7758. --- Firing Productions (IE) For State At Depth 1 ---
  7759. --- Inner Elaboration Phase, active level 1 (S1) ---
  7760. Firing monitor*world
  7761. -->
  7762. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  7763. --- Change Working Memory (IE) ---
  7764. --- END Application Phase ---
  7765. --- Output Phase ---
  7766. ENV: Agent did: predict-no for direction L in state State-A
  7767. In State-A moving L
  7768. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  7769. predict error 0
  7770. dir: dir isR
  7771. --- END Output Phase ---
  7772. -/|--- Input Phase ---
  7773. =>WM: (13427: I2 ^dir R)
  7774. =>WM: (13426: I2 ^reward 1)
  7775. =>WM: (13425: I2 ^see 0)
  7776. =>WM: (13424: N957 ^status complete)
  7777. <=WM: (13412: I2 ^dir L)
  7778. <=WM: (13411: I2 ^reward 1)
  7779. <=WM: (13410: I2 ^see 0)
  7780. =>WM: (13428: I2 ^level-1 L0-root)
  7781. <=WM: (13413: I2 ^level-1 L1-root)
  7782. --- END Input Phase ---
  7783. --- Proposal Phase ---
  7784. --- Inner Elaboration Phase, active level 1 (S1) ---
  7785. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7786. -->
  7787. (S1 ^operator O1913 = 0.8783894024939338)
  7788. Firing prefer*rvt*predict-yes*H0*5*H1
  7789. -->
  7790. Firing elaborate*copy-see-to-output-link
  7791. -->
  7792. (I3 ^see 0 +)
  7793. Firing elaborate*reward*based*on*reward
  7794. -->
  7795. (R961 ^value 1 +)
  7796. (R1 ^reward R961 +)
  7797. Firing propose*predict-yes
  7798. -->
  7799. (O1915 ^name predict-yes +)
  7800. (S1 ^operator O1915 +)
  7801. Firing propose*predict-no
  7802. -->
  7803. (O1916 ^name predict-no +)
  7804. (S1 ^operator O1916 +)
  7805. Firing rl*prefer*rvt*predict-no*H0*6
  7806. -->
  7807. (S1 ^operator O1914 = 0.999977424773942)
  7808. Firing rl*prefer*rvt*predict-yes*H0*5
  7809. -->
  7810. (S1 ^operator O1913 = 0.1215965434178113)
  7811. Firing prefer*rvt*predict-yes*H0
  7812. -->
  7813. Firing prefer*rvt*predict-no*H0
  7814. -->
  7815. Firing elaborate*copy-dir-to-output-link
  7816. -->
  7817. (I3 ^dir R +)
  7818. inner elaboration loop at bottom goal.
  7819. Retracting elaborate*copy-see-to-output-link
  7820. -->
  7821. (I3 ^see 0 +)
  7822. Retracting propose*predict-no
  7823. -->
  7824. (O1914 ^name predict-no +)
  7825. (S1 ^operator O1914 +)
  7826. Retracting propose*predict-yes
  7827. -->
  7828. (O1913 ^name predict-yes +)
  7829. (S1 ^operator O1913 +)
  7830. Retracting elaborate*reward*based*on*reward
  7831. -->
  7832. (R960 ^value 1 +)
  7833. (R1 ^reward R960 +)
  7834. Retracting elaborate*copy-dir-to-output-link
  7835. -->
  7836. (I3 ^dir L +)
  7837. Retracting rl*prefer*rvt*predict-no*H0*4
  7838. -->
  7839. (S1 ^operator O1914 = 0.3145217607813431)
  7840. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  7841. -->
  7842. (S1 ^operator O1914 = 0.6855673437364445)
  7843. Retracting rl*prefer*rvt*predict-yes*H0*3
  7844. -->
  7845. (S1 ^operator O1913 = 0.3907974841024591)
  7846. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  7847. -->
  7848. (S1 ^operator O1913 = -0.2062723012911647)
  7849. =>WM: (13435: S1 ^operator O1916 +)
  7850. =>WM: (13434: S1 ^operator O1915 +)
  7851. =>WM: (13433: I3 ^dir R)
  7852. =>WM: (13432: O1916 ^name predict-no)
  7853. =>WM: (13431: O1915 ^name predict-yes)
  7854. =>WM: (13430: R961 ^value 1)
  7855. =>WM: (13429: R1 ^reward R961)
  7856. <=WM: (13420: S1 ^operator O1913 +)
  7857. <=WM: (13421: S1 ^operator O1914 +)
  7858. <=WM: (13422: S1 ^operator O1914)
  7859. <=WM: (13419: I3 ^dir L)
  7860. <=WM: (13415: R1 ^reward R960)
  7861. <=WM: (13418: O1914 ^name predict-no)
  7862. <=WM: (13417: O1913 ^name predict-yes)
  7863. <=WM: (13416: R960 ^value 1)
  7864. --- Inner Elaboration Phase, active level 1 (S1) ---
  7865. Firing prefer*rvt*predict-yes*H0
  7866. -->
  7867. Firing rl*prefer*rvt*predict-yes*H0*5
  7868. -->
  7869. (S1 ^operator O1915 = 0.1215965434178113)
  7870. Firing prefer*rvt*predict-yes*H0*5*H1
  7871. -->
  7872. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  7873. -->
  7874. (S1 ^operator O1915 = 0.8783894024939338)
  7875. Firing prefer*rvt*predict-no*H0
  7876. -->
  7877. Firing rl*prefer*rvt*predict-no*H0*6
  7878. -->
  7879. (S1 ^operator O1916 = 0.999977424773942)
  7880. inner elaboration loop at bottom goal.
  7881. Retracting rl*prefer*rvt*predict-no*H0*6
  7882. -->
  7883. (S1 ^operator O1914 = 0.999977424773942)
  7884. Retracting rl*prefer*rvt*predict-yes*H0*5
  7885. -->
  7886. (S1 ^operator O1913 = 0.1215965434178113)
  7887. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7888. -->
  7889. (S1 ^operator O1913 = 0.8783894024939338)
  7890. --- END Proposal Phase ---
  7891. --- Decision Phase ---
  7892. RL update rl*prefer*rvt*predict-no*H0*4 0.478568 -0.164047 0.314522 -> 0.478562 -0.164047 0.314514(R,m,v=1,0.918919,0.0750138)
  7893. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521513 0.164055 0.685567 -> 0.521505 0.164054 0.685559(R,m,v=1,1,0)
  7894. =>WM: (13436: S1 ^operator O1915)
  7895. 958: O: O1915 (predict-yes)
  7896. --- END Decision Phase ---
  7897. --- Application Phase ---
  7898. --- Firing Productions (PE) For State At Depth 1 ---
  7899. --- Inner Elaboration Phase, active level 1 (S1) ---
  7900. Firing apply*operator
  7901. -->
  7902. (I3 ^predict-yes N958 + :O )
  7903. Firing apply*operator*complete
  7904. -->
  7905. (I3 ^predict-no N957 - :O )
  7906. inner elaboration loop at bottom goal.
  7907. --- Change Working Memory (PE) ---
  7908. =>WM: (13437: I3 ^predict-yes N958)
  7909. <=WM: (13424: N957 ^status complete)
  7910. <=WM: (13423: I3 ^predict-no N957)
  7911. --- Firing Productions (IE) For State At Depth 1 ---
  7912. --- Inner Elaboration Phase, active level 1 (S1) ---
  7913. Firing monitor*world
  7914. -->
  7915. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  7916. --- Change Working Memory (IE) ---
  7917. --- END Application Phase ---
  7918. --- Output Phase ---
  7919. ENV: Agent did: predict-yes for direction R in state State-A
  7920. In State-A moving R
  7921. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  7922. predict error 0
  7923. dir: dir isR
  7924. --- END Output Phase ---
  7925. \-/--- Input Phase ---
  7926. =>WM: (13441: I2 ^dir R)
  7927. =>WM: (13440: I2 ^reward 1)
  7928. =>WM: (13439: I2 ^see 1)
  7929. =>WM: (13438: N958 ^status complete)
  7930. <=WM: (13427: I2 ^dir R)
  7931. <=WM: (13426: I2 ^reward 1)
  7932. <=WM: (13425: I2 ^see 0)
  7933. =>WM: (13442: I2 ^level-1 R1-root)
  7934. <=WM: (13428: I2 ^level-1 L0-root)
  7935. --- END Input Phase ---
  7936. --- Proposal Phase ---
  7937. --- Inner Elaboration Phase, active level 1 (S1) ---
  7938. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  7939. -->
  7940. (S1 ^operator O1915 = -0.04253361215288998)
  7941. Firing prefer*rvt*predict-yes*H0*5*H1
  7942. -->
  7943. Firing elaborate*copy-see-to-output-link
  7944. -->
  7945. (I3 ^see 1 +)
  7946. Firing elaborate*reward*based*on*reward
  7947. -->
  7948. (R962 ^value 1 +)
  7949. (R1 ^reward R962 +)
  7950. Firing propose*predict-yes
  7951. -->
  7952. (O1917 ^name predict-yes +)
  7953. (S1 ^operator O1917 +)
  7954. Firing propose*predict-no
  7955. -->
  7956. (O1918 ^name predict-no +)
  7957. (S1 ^operator O1918 +)
  7958. Firing rl*prefer*rvt*predict-no*H0*6
  7959. -->
  7960. (S1 ^operator O1916 = 0.999977424773942)
  7961. Firing rl*prefer*rvt*predict-yes*H0*5
  7962. -->
  7963. (S1 ^operator O1915 = 0.1215965434178113)
  7964. Firing prefer*rvt*predict-yes*H0
  7965. -->
  7966. Firing prefer*rvt*predict-no*H0
  7967. -->
  7968. Firing elaborate*copy-dir-to-output-link
  7969. -->
  7970. (I3 ^dir R +)
  7971. inner elaboration loop at bottom goal.
  7972. Retracting elaborate*copy-see-to-output-link
  7973. -->
  7974. (I3 ^see 0 +)
  7975. Retracting propose*predict-no
  7976. -->
  7977. (O1916 ^name predict-no +)
  7978. (S1 ^operator O1916 +)
  7979. Retracting propose*predict-yes
  7980. -->
  7981. (O1915 ^name predict-yes +)
  7982. (S1 ^operator O1915 +)
  7983. Retracting elaborate*reward*based*on*reward
  7984. -->
  7985. (R961 ^value 1 +)
  7986. (R1 ^reward R961 +)
  7987. Retracting elaborate*copy-dir-to-output-link
  7988. -->
  7989. (I3 ^dir R +)
  7990. Retracting rl*prefer*rvt*predict-no*H0*6
  7991. -->
  7992. (S1 ^operator O1916 = 0.999977424773942)
  7993. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  7994. -->
  7995. (S1 ^operator O1915 = 0.8783894024939338)
  7996. Retracting rl*prefer*rvt*predict-yes*H0*5
  7997. -->
  7998. (S1 ^operator O1915 = 0.1215965434178113)
  7999. =>WM: (13449: S1 ^operator O1918 +)
  8000. =>WM: (13448: S1 ^operator O1917 +)
  8001. =>WM: (13447: O1918 ^name predict-no)
  8002. =>WM: (13446: O1917 ^name predict-yes)
  8003. =>WM: (13445: R962 ^value 1)
  8004. =>WM: (13444: R1 ^reward R962)
  8005. =>WM: (13443: I3 ^see 1)
  8006. <=WM: (13434: S1 ^operator O1915 +)
  8007. <=WM: (13436: S1 ^operator O1915)
  8008. <=WM: (13435: S1 ^operator O1916 +)
  8009. <=WM: (13429: R1 ^reward R961)
  8010. <=WM: (13414: I3 ^see 0)
  8011. <=WM: (13432: O1916 ^name predict-no)
  8012. <=WM: (13431: O1915 ^name predict-yes)
  8013. <=WM: (13430: R961 ^value 1)
  8014. --- Inner Elaboration Phase, active level 1 (S1) ---
  8015. Firing prefer*rvt*predict-yes*H0
  8016. -->
  8017. Firing rl*prefer*rvt*predict-yes*H0*5
  8018. -->
  8019. (S1 ^operator O1917 = 0.1215965434178113)
  8020. Firing prefer*rvt*predict-yes*H0*5*H1
  8021. -->
  8022. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  8023. -->
  8024. (S1 ^operator O1917 = -0.04253361215288998)
  8025. Firing prefer*rvt*predict-no*H0
  8026. -->
  8027. Firing rl*prefer*rvt*predict-no*H0*6
  8028. -->
  8029. (S1 ^operator O1918 = 0.999977424773942)
  8030. inner elaboration loop at bottom goal.
  8031. Retracting rl*prefer*rvt*predict-no*H0*6
  8032. -->
  8033. (S1 ^operator O1916 = 0.999977424773942)
  8034. Retracting rl*prefer*rvt*predict-yes*H0*5
  8035. -->
  8036. (S1 ^operator O1915 = 0.1215965434178113)
  8037. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  8038. -->
  8039. (S1 ^operator O1915 = -0.04253361215288998)
  8040. --- END Proposal Phase ---
  8041. --- Decision Phase ---
  8042. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.857988,0.12257)
  8043. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465465 0.412924 0.878389 -> 0.465467 0.412924 0.878391(R,m,v=1,1,0)
  8044. =>WM: (13450: S1 ^operator O1918)
  8045. 959: O: O1918 (predict-no)
  8046. --- END Decision Phase ---
  8047. --- Application Phase ---
  8048. --- Firing Productions (PE) For State At Depth 1 ---
  8049. --- Inner Elaboration Phase, active level 1 (S1) ---
  8050. Firing apply*operator
  8051. -->
  8052. (I3 ^predict-no N959 + :O )
  8053. Firing apply*operator*complete
  8054. -->
  8055. (I3 ^predict-yes N958 - :O )
  8056. inner elaboration loop at bottom goal.
  8057. --- Change Working Memory (PE) ---
  8058. =>WM: (13451: I3 ^predict-no N959)
  8059. <=WM: (13438: N958 ^status complete)
  8060. <=WM: (13437: I3 ^predict-yes N958)
  8061. --- Firing Productions (IE) For State At Depth 1 ---
  8062. --- Inner Elaboration Phase, active level 1 (S1) ---
  8063. Firing monitor*world
  8064. -->
  8065. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8066. --- Change Working Memory (IE) ---
  8067. --- END Application Phase ---
  8068. --- Output Phase ---
  8069. ENV: Agent did: predict-no for direction R in state State-B
  8070. In State-B moving R
  8071. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  8072. predict error 0
  8073. dir: dir isL
  8074. --- END Output Phase ---
  8075. |\---- Input Phase ---
  8076. =>WM: (13455: I2 ^dir L)
  8077. =>WM: (13454: I2 ^reward 1)
  8078. =>WM: (13453: I2 ^see 0)
  8079. =>WM: (13452: N959 ^status complete)
  8080. <=WM: (13441: I2 ^dir R)
  8081. <=WM: (13440: I2 ^reward 1)
  8082. <=WM: (13439: I2 ^see 1)
  8083. =>WM: (13456: I2 ^level-1 R0-root)
  8084. <=WM: (13442: I2 ^level-1 R1-root)
  8085. --- END Input Phase ---
  8086. --- Proposal Phase ---
  8087. --- Inner Elaboration Phase, active level 1 (S1) ---
  8088. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  8089. -->
  8090. (S1 ^operator O1918 = -0.1984300550322165)
  8091. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  8092. -->
  8093. (S1 ^operator O1917 = 0.6090773459257411)
  8094. Firing prefer*rvt*predict-no*H0*4*H1
  8095. -->
  8096. Firing prefer*rvt*predict-yes*H0*3*H1
  8097. -->
  8098. Firing elaborate*copy-see-to-output-link
  8099. -->
  8100. (I3 ^see 0 +)
  8101. Firing elaborate*reward*based*on*reward
  8102. -->
  8103. (R963 ^value 1 +)
  8104. (R1 ^reward R963 +)
  8105. Firing propose*predict-yes
  8106. -->
  8107. (O1919 ^name predict-yes +)
  8108. (S1 ^operator O1919 +)
  8109. Firing propose*predict-no
  8110. -->
  8111. (O1920 ^name predict-no +)
  8112. (S1 ^operator O1920 +)
  8113. Firing rl*prefer*rvt*predict-no*H0*4
  8114. -->
  8115. (S1 ^operator O1918 = 0.3145143319532709)
  8116. Firing rl*prefer*rvt*predict-yes*H0*3
  8117. -->
  8118. (S1 ^operator O1917 = 0.3907974841024591)
  8119. Firing prefer*rvt*predict-yes*H0
  8120. -->
  8121. Firing prefer*rvt*predict-no*H0
  8122. -->
  8123. Firing elaborate*copy-dir-to-output-link
  8124. -->
  8125. (I3 ^dir L +)
  8126. inner elaboration loop at bottom goal.
  8127. Retracting elaborate*copy-see-to-output-link
  8128. -->
  8129. (I3 ^see 1 +)
  8130. Retracting propose*predict-no
  8131. -->
  8132. (O1918 ^name predict-no +)
  8133. (S1 ^operator O1918 +)
  8134. Retracting propose*predict-yes
  8135. -->
  8136. (O1917 ^name predict-yes +)
  8137. (S1 ^operator O1917 +)
  8138. Retracting elaborate*reward*based*on*reward
  8139. -->
  8140. (R962 ^value 1 +)
  8141. (R1 ^reward R962 +)
  8142. Retracting elaborate*copy-dir-to-output-link
  8143. -->
  8144. (I3 ^dir R +)
  8145. Retracting rl*prefer*rvt*predict-no*H0*6
  8146. -->
  8147. (S1 ^operator O1918 = 0.999977424773942)
  8148. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  8149. -->
  8150. (S1 ^operator O1917 = -0.04253361215288998)
  8151. Retracting rl*prefer*rvt*predict-yes*H0*5
  8152. -->
  8153. (S1 ^operator O1917 = 0.121597689773478)
  8154. =>WM: (13464: S1 ^operator O1920 +)
  8155. =>WM: (13463: S1 ^operator O1919 +)
  8156. =>WM: (13462: I3 ^dir L)
  8157. =>WM: (13461: O1920 ^name predict-no)
  8158. =>WM: (13460: O1919 ^name predict-yes)
  8159. =>WM: (13459: R963 ^value 1)
  8160. =>WM: (13458: R1 ^reward R963)
  8161. =>WM: (13457: I3 ^see 0)
  8162. <=WM: (13448: S1 ^operator O1917 +)
  8163. <=WM: (13449: S1 ^operator O1918 +)
  8164. <=WM: (13450: S1 ^operator O1918)
  8165. <=WM: (13433: I3 ^dir R)
  8166. <=WM: (13444: R1 ^reward R962)
  8167. <=WM: (13443: I3 ^see 1)
  8168. <=WM: (13447: O1918 ^name predict-no)
  8169. <=WM: (13446: O1917 ^name predict-yes)
  8170. <=WM: (13445: R962 ^value 1)
  8171. --- Inner Elaboration Phase, active level 1 (S1) ---
  8172. Firing prefer*rvt*predict-yes*H0
  8173. -->
  8174. Firing rl*prefer*rvt*predict-yes*H0*3
  8175. -->
  8176. (S1 ^operator O1919 = 0.3907974841024591)
  8177. Firing prefer*rvt*predict-yes*H0*3*H1
  8178. -->
  8179. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  8180. -->
  8181. (S1 ^operator O1919 = 0.6090773459257411)
  8182. Firing prefer*rvt*predict-no*H0
  8183. -->
  8184. Firing rl*prefer*rvt*predict-no*H0*4
  8185. -->
  8186. (S1 ^operator O1920 = 0.3145143319532709)
  8187. Firing prefer*rvt*predict-no*H0*4*H1
  8188. -->
  8189. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  8190. -->
  8191. (S1 ^operator O1920 = -0.1984300550322165)
  8192. inner elaboration loop at bottom goal.
  8193. Retracting rl*prefer*rvt*predict-no*H0*4
  8194. -->
  8195. (S1 ^operator O1918 = 0.3145143319532709)
  8196. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  8197. -->
  8198. (S1 ^operator O1918 = -0.1984300550322165)
  8199. Retracting rl*prefer*rvt*predict-yes*H0*3
  8200. -->
  8201. (S1 ^operator O1917 = 0.3907974841024591)
  8202. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  8203. -->
  8204. (S1 ^operator O1917 = 0.6090773459257411)
  8205. --- END Proposal Phase ---
  8206. --- Decision Phase ---
  8207. RL update rl*prefer*rvt*predict-no*H0*6 0.999977 0 0.999977 -> 0.999981 0 0.999981(R,m,v=1,0.936782,0.0595641)
  8208. =>WM: (13465: S1 ^operator O1919)
  8209. 960: O: O1919 (predict-yes)
  8210. --- END Decision Phase ---
  8211. --- Application Phase ---
  8212. --- Firing Productions (PE) For State At Depth 1 ---
  8213. --- Inner Elaboration Phase, active level 1 (S1) ---
  8214. Firing apply*operator
  8215. -->
  8216. (I3 ^predict-yes N960 + :O )
  8217. Firing apply*operator*complete
  8218. -->
  8219. (I3 ^predict-no N959 - :O )
  8220. inner elaboration loop at bottom goal.
  8221. --- Change Working Memory (PE) ---
  8222. =>WM: (13466: I3 ^predict-yes N960)
  8223. <=WM: (13452: N959 ^status complete)
  8224. <=WM: (13451: I3 ^predict-no N959)
  8225. --- Firing Productions (IE) For State At Depth 1 ---
  8226. --- Inner Elaboration Phase, active level 1 (S1) ---
  8227. Firing monitor*world
  8228. -->
  8229. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8230. --- Change Working Memory (IE) ---
  8231. --- END Application Phase ---
  8232. --- Output Phase ---
  8233. ENV: Agent did: predict-yes for direction L in state State-B
  8234. In State-B moving L
  8235. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  8236. predict error 0
  8237. dir: dir isU
  8238. --- END Output Phase ---
  8239. /|\--- Input Phase ---
  8240. =>WM: (13470: I2 ^dir U)
  8241. =>WM: (13469: I2 ^reward 1)
  8242. =>WM: (13468: I2 ^see 1)
  8243. =>WM: (13467: N960 ^status complete)
  8244. <=WM: (13455: I2 ^dir L)
  8245. <=WM: (13454: I2 ^reward 1)
  8246. <=WM: (13453: I2 ^see 0)
  8247. =>WM: (13471: I2 ^level-1 L1-root)
  8248. <=WM: (13456: I2 ^level-1 R0-root)
  8249. --- END Input Phase ---
  8250. --- Proposal Phase ---
  8251. --- Inner Elaboration Phase, active level 1 (S1) ---
  8252. Firing elaborate*copy-see-to-output-link
  8253. -->
  8254. (I3 ^see 1 +)
  8255. Firing elaborate*reward*based*on*reward
  8256. -->
  8257. (R964 ^value 1 +)
  8258. (R1 ^reward R964 +)
  8259. Firing propose*predict-yes
  8260. -->
  8261. (O1921 ^name predict-yes +)
  8262. (S1 ^operator O1921 +)
  8263. Firing propose*predict-no
  8264. -->
  8265. (O1922 ^name predict-no +)
  8266. (S1 ^operator O1922 +)
  8267. Firing rl*prefer*rvt*predict-no*H0*2
  8268. -->
  8269. (S1 ^operator O1920 = 1.)
  8270. Firing rl*prefer*rvt*predict-yes*H0*1
  8271. -->
  8272. (S1 ^operator O1919 = 0.)
  8273. Firing prefer*rvt*predict-yes*H0
  8274. -->
  8275. Firing prefer*rvt*predict-no*H0
  8276. -->
  8277. Firing elaborate*copy-dir-to-output-link
  8278. -->
  8279. (I3 ^dir U +)
  8280. inner elaboration loop at bottom goal.
  8281. Retracting elaborate*copy-see-to-output-link
  8282. -->
  8283. (I3 ^see 0 +)
  8284. Retracting propose*predict-no
  8285. -->
  8286. (O1920 ^name predict-no +)
  8287. (S1 ^operator O1920 +)
  8288. Retracting propose*predict-yes
  8289. -->
  8290. (O1919 ^name predict-yes +)
  8291. (S1 ^operator O1919 +)
  8292. Retracting elaborate*reward*based*on*reward
  8293. -->
  8294. (R963 ^value 1 +)
  8295. (R1 ^reward R963 +)
  8296. Retracting elaborate*copy-dir-to-output-link
  8297. -->
  8298. (I3 ^dir L +)
  8299. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  8300. -->
  8301. (S1 ^operator O1920 = -0.1984300550322165)
  8302. Retracting rl*prefer*rvt*predict-no*H0*4
  8303. -->
  8304. (S1 ^operator O1920 = 0.3145143319532709)
  8305. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  8306. -->
  8307. (S1 ^operator O1919 = 0.6090773459257411)
  8308. Retracting rl*prefer*rvt*predict-yes*H0*3
  8309. -->
  8310. (S1 ^operator O1919 = 0.3907974841024591)
  8311. =>WM: (13479: S1 ^operator O1922 +)
  8312. =>WM: (13478: S1 ^operator O1921 +)
  8313. =>WM: (13477: I3 ^dir U)
  8314. =>WM: (13476: O1922 ^name predict-no)
  8315. =>WM: (13475: O1921 ^name predict-yes)
  8316. =>WM: (13474: R964 ^value 1)
  8317. =>WM: (13473: R1 ^reward R964)
  8318. =>WM: (13472: I3 ^see 1)
  8319. <=WM: (13463: S1 ^operator O1919 +)
  8320. <=WM: (13465: S1 ^operator O1919)
  8321. <=WM: (13464: S1 ^operator O1920 +)
  8322. <=WM: (13462: I3 ^dir L)
  8323. <=WM: (13458: R1 ^reward R963)
  8324. <=WM: (13457: I3 ^see 0)
  8325. <=WM: (13461: O1920 ^name predict-no)
  8326. <=WM: (13460: O1919 ^name predict-yes)
  8327. <=WM: (13459: R963 ^value 1)
  8328. --- Inner Elaboration Phase, active level 1 (S1) ---
  8329. Firing prefer*rvt*predict-yes*H0
  8330. -->
  8331. Firing rl*prefer*rvt*predict-yes*H0*1
  8332. -->
  8333. (S1 ^operator O1921 = 0.)
  8334. Firing prefer*rvt*predict-no*H0
  8335. -->
  8336. Firing rl*prefer*rvt*predict-no*H0*2
  8337. -->
  8338. (S1 ^operator O1922 = 1.)
  8339. inner elaboration loop at bottom goal.
  8340. Retracting rl*prefer*rvt*predict-no*H0*2
  8341. -->
  8342. (S1 ^operator O1920 = 1.)
  8343. Retracting rl*prefer*rvt*predict-yes*H0*1
  8344. -->
  8345. (S1 ^operator O1919 = 0.)
  8346. --- END Proposal Phase ---
  8347. --- Decision Phase ---
  8348. RL update rl*prefer*rvt*predict-yes*H0*3 0.47234 -0.081543 0.390797 -> 0.472349 -0.0815415 0.390808(R,m,v=1,0.941176,0.0557276)
  8349. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527553 0.0815245 0.609077 -> 0.527563 0.0815262 0.609089(R,m,v=1,1,0)
  8350. =>WM: (13480: S1 ^operator O1922)
  8351. 961: O: O1922 (predict-no)
  8352. --- END Decision Phase ---
  8353. --- Application Phase ---
  8354. --- Firing Productions (PE) For State At Depth 1 ---
  8355. --- Inner Elaboration Phase, active level 1 (S1) ---
  8356. Firing apply*operator
  8357. -->
  8358. (I3 ^predict-no N961 + :O )
  8359. Firing apply*operator*complete
  8360. -->
  8361. (I3 ^predict-yes N960 - :O )
  8362. inner elaboration loop at bottom goal.
  8363. --- Change Working Memory (PE) ---
  8364. =>WM: (13481: I3 ^predict-no N961)
  8365. <=WM: (13467: N960 ^status complete)
  8366. <=WM: (13466: I3 ^predict-yes N960)
  8367. --- Firing Productions (IE) For State At Depth 1 ---
  8368. --- Inner Elaboration Phase, active level 1 (S1) ---
  8369. Firing monitor*world
  8370. -->
  8371. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8372. --- Change Working Memory (IE) ---
  8373. --- END Application Phase ---
  8374. --- Output Phase ---
  8375. ENV: Agent did: predict-no for direction U in state State-A
  8376. In State-A moving U
  8377. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8378. predict error 0
  8379. dir: dir isL
  8380. --- END Output Phase ---
  8381. ---- Input Phase ---
  8382. =>WM: (13485: I2 ^dir L)
  8383. =>WM: (13484: I2 ^reward 1)
  8384. =>WM: (13483: I2 ^see 0)
  8385. =>WM: (13482: N961 ^status complete)
  8386. <=WM: (13470: I2 ^dir U)
  8387. <=WM: (13469: I2 ^reward 1)
  8388. <=WM: (13468: I2 ^see 1)
  8389. =>WM: (13486: I2 ^level-1 L1-root)
  8390. <=WM: (13471: I2 ^level-1 L1-root)
  8391. --- END Input Phase ---
  8392. --- Proposal Phase ---
  8393. --- Inner Elaboration Phase, active level 1 (S1) ---
  8394. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  8395. -->
  8396. (S1 ^operator O1921 = -0.2062723012911647)
  8397. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  8398. -->
  8399. (S1 ^operator O1922 = 0.685558831823503)
  8400. Firing prefer*rvt*predict-no*H0*4*H1
  8401. -->
  8402. Firing prefer*rvt*predict-yes*H0*3*H1
  8403. -->
  8404. Firing elaborate*copy-see-to-output-link
  8405. -->
  8406. (I3 ^see 0 +)
  8407. Firing elaborate*reward*based*on*reward
  8408. -->
  8409. (R965 ^value 1 +)
  8410. (R1 ^reward R965 +)
  8411. Firing propose*predict-yes
  8412. -->
  8413. (O1923 ^name predict-yes +)
  8414. (S1 ^operator O1923 +)
  8415. Firing propose*predict-no
  8416. -->
  8417. (O1924 ^name predict-no +)
  8418. (S1 ^operator O1924 +)
  8419. Firing rl*prefer*rvt*predict-no*H0*4
  8420. -->
  8421. (S1 ^operator O1922 = 0.3145143319532709)
  8422. Firing rl*prefer*rvt*predict-yes*H0*3
  8423. -->
  8424. (S1 ^operator O1921 = 0.390807862285058)
  8425. Firing prefer*rvt*predict-yes*H0
  8426. -->
  8427. Firing prefer*rvt*predict-no*H0
  8428. -->
  8429. Firing elaborate*copy-dir-to-output-link
  8430. -->
  8431. (I3 ^dir L +)
  8432. inner elaboration loop at bottom goal.
  8433. Retracting elaborate*copy-see-to-output-link
  8434. -->
  8435. (I3 ^see 1 +)
  8436. Retracting propose*predict-no
  8437. -->
  8438. (O1922 ^name predict-no +)
  8439. (S1 ^operator O1922 +)
  8440. Retracting propose*predict-yes
  8441. -->
  8442. (O1921 ^name predict-yes +)
  8443. (S1 ^operator O1921 +)
  8444. Retracting elaborate*reward*based*on*reward
  8445. -->
  8446. (R964 ^value 1 +)
  8447. (R1 ^reward R964 +)
  8448. Retracting elaborate*copy-dir-to-output-link
  8449. -->
  8450. (I3 ^dir U +)
  8451. Retracting rl*prefer*rvt*predict-no*H0*2
  8452. -->
  8453. (S1 ^operator O1922 = 1.)
  8454. Retracting rl*prefer*rvt*predict-yes*H0*1
  8455. -->
  8456. (S1 ^operator O1921 = 0.)
  8457. =>WM: (13494: S1 ^operator O1924 +)
  8458. =>WM: (13493: S1 ^operator O1923 +)
  8459. =>WM: (13492: I3 ^dir L)
  8460. =>WM: (13491: O1924 ^name predict-no)
  8461. =>WM: (13490: O1923 ^name predict-yes)
  8462. =>WM: (13489: R965 ^value 1)
  8463. =>WM: (13488: R1 ^reward R965)
  8464. =>WM: (13487: I3 ^see 0)
  8465. <=WM: (13478: S1 ^operator O1921 +)
  8466. <=WM: (13479: S1 ^operator O1922 +)
  8467. <=WM: (13480: S1 ^operator O1922)
  8468. <=WM: (13477: I3 ^dir U)
  8469. <=WM: (13473: R1 ^reward R964)
  8470. <=WM: (13472: I3 ^see 1)
  8471. <=WM: (13476: O1922 ^name predict-no)
  8472. <=WM: (13475: O1921 ^name predict-yes)
  8473. <=WM: (13474: R964 ^value 1)
  8474. --- Inner Elaboration Phase, active level 1 (S1) ---
  8475. Firing prefer*rvt*predict-yes*H0
  8476. -->
  8477. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  8478. -->
  8479. (S1 ^operator O1923 = -0.2062723012911647)
  8480. Firing rl*prefer*rvt*predict-yes*H0*3
  8481. -->
  8482. (S1 ^operator O1923 = 0.390807862285058)
  8483. Firing prefer*rvt*predict-yes*H0*3*H1
  8484. -->
  8485. Firing prefer*rvt*predict-no*H0
  8486. -->
  8487. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  8488. -->
  8489. (S1 ^operator O1924 = 0.685558831823503)
  8490. Firing rl*prefer*rvt*predict-no*H0*4
  8491. -->
  8492. (S1 ^operator O1924 = 0.3145143319532709)
  8493. Firing prefer*rvt*predict-no*H0*4*H1
  8494. -->
  8495. inner elaboration loop at bottom goal.
  8496. Retracting rl*prefer*rvt*predict-no*H0*4
  8497. -->
  8498. (S1 ^operator O1922 = 0.3145143319532709)
  8499. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  8500. -->
  8501. (S1 ^operator O1922 = 0.685558831823503)
  8502. Retracting rl*prefer*rvt*predict-yes*H0*3
  8503. -->
  8504. (S1 ^operator O1921 = 0.390807862285058)
  8505. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  8506. -->
  8507. (S1 ^operator O1921 = -0.2062723012911647)
  8508. --- END Proposal Phase ---
  8509. --- Decision Phase ---
  8510. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8511. =>WM: (13495: S1 ^operator O1924)
  8512. 962: O: O1924 (predict-no)
  8513. --- END Decision Phase ---
  8514. --- Application Phase ---
  8515. --- Firing Productions (PE) For State At Depth 1 ---
  8516. --- Inner Elaboration Phase, active level 1 (S1) ---
  8517. Firing apply*operator
  8518. -->
  8519. (I3 ^predict-no N962 + :O )
  8520. Firing apply*operator*complete
  8521. -->
  8522. (I3 ^predict-no N961 - :O )
  8523. inner elaboration loop at bottom goal.
  8524. --- Change Working Memory (PE) ---
  8525. =>WM: (13496: I3 ^predict-no N962)
  8526. <=WM: (13482: N961 ^status complete)
  8527. <=WM: (13481: I3 ^predict-no N961)
  8528. --- Firing Productions (IE) For State At Depth 1 ---
  8529. --- Inner Elaboration Phase, active level 1 (S1) ---
  8530. Firing monitor*world
  8531. -->
  8532. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8533. --- Change Working Memory (IE) ---
  8534. --- END Application Phase ---
  8535. --- Output Phase ---
  8536. ENV: Agent did: predict-no for direction L in state State-A
  8537. In State-A moving L
  8538. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8539. predict error 0
  8540. dir: dir isU
  8541. --- END Output Phase ---
  8542. /|\--- Input Phase ---
  8543. =>WM: (13500: I2 ^dir U)
  8544. =>WM: (13499: I2 ^reward 1)
  8545. =>WM: (13498: I2 ^see 0)
  8546. =>WM: (13497: N962 ^status complete)
  8547. <=WM: (13485: I2 ^dir L)
  8548. <=WM: (13484: I2 ^reward 1)
  8549. <=WM: (13483: I2 ^see 0)
  8550. =>WM: (13501: I2 ^level-1 L0-root)
  8551. <=WM: (13486: I2 ^level-1 L1-root)
  8552. --- END Input Phase ---
  8553. --- Proposal Phase ---
  8554. --- Inner Elaboration Phase, active level 1 (S1) ---
  8555. Firing elaborate*copy-see-to-output-link
  8556. -->
  8557. (I3 ^see 0 +)
  8558. Firing elaborate*reward*based*on*reward
  8559. -->
  8560. (R966 ^value 1 +)
  8561. (R1 ^reward R966 +)
  8562. Firing propose*predict-yes
  8563. -->
  8564. (O1925 ^name predict-yes +)
  8565. (S1 ^operator O1925 +)
  8566. Firing propose*predict-no
  8567. -->
  8568. (O1926 ^name predict-no +)
  8569. (S1 ^operator O1926 +)
  8570. Firing rl*prefer*rvt*predict-no*H0*2
  8571. -->
  8572. (S1 ^operator O1924 = 1.)
  8573. Firing rl*prefer*rvt*predict-yes*H0*1
  8574. -->
  8575. (S1 ^operator O1923 = 0.)
  8576. Firing prefer*rvt*predict-yes*H0
  8577. -->
  8578. Firing prefer*rvt*predict-no*H0
  8579. -->
  8580. Firing elaborate*copy-dir-to-output-link
  8581. -->
  8582. (I3 ^dir U +)
  8583. inner elaboration loop at bottom goal.
  8584. Retracting elaborate*copy-see-to-output-link
  8585. -->
  8586. (I3 ^see 0 +)
  8587. Retracting propose*predict-no
  8588. -->
  8589. (O1924 ^name predict-no +)
  8590. (S1 ^operator O1924 +)
  8591. Retracting propose*predict-yes
  8592. -->
  8593. (O1923 ^name predict-yes +)
  8594. (S1 ^operator O1923 +)
  8595. Retracting elaborate*reward*based*on*reward
  8596. -->
  8597. (R965 ^value 1 +)
  8598. (R1 ^reward R965 +)
  8599. Retracting elaborate*copy-dir-to-output-link
  8600. -->
  8601. (I3 ^dir L +)
  8602. Retracting rl*prefer*rvt*predict-no*H0*4
  8603. -->
  8604. (S1 ^operator O1924 = 0.3145143319532709)
  8605. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  8606. -->
  8607. (S1 ^operator O1924 = 0.685558831823503)
  8608. Retracting rl*prefer*rvt*predict-yes*H0*3
  8609. -->
  8610. (S1 ^operator O1923 = 0.390807862285058)
  8611. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  8612. -->
  8613. (S1 ^operator O1923 = -0.2062723012911647)
  8614. =>WM: (13508: S1 ^operator O1926 +)
  8615. =>WM: (13507: S1 ^operator O1925 +)
  8616. =>WM: (13506: I3 ^dir U)
  8617. =>WM: (13505: O1926 ^name predict-no)
  8618. =>WM: (13504: O1925 ^name predict-yes)
  8619. =>WM: (13503: R966 ^value 1)
  8620. =>WM: (13502: R1 ^reward R966)
  8621. <=WM: (13493: S1 ^operator O1923 +)
  8622. <=WM: (13494: S1 ^operator O1924 +)
  8623. <=WM: (13495: S1 ^operator O1924)
  8624. <=WM: (13492: I3 ^dir L)
  8625. <=WM: (13488: R1 ^reward R965)
  8626. <=WM: (13491: O1924 ^name predict-no)
  8627. <=WM: (13490: O1923 ^name predict-yes)
  8628. <=WM: (13489: R965 ^value 1)
  8629. --- Inner Elaboration Phase, active level 1 (S1) ---
  8630. Firing prefer*rvt*predict-yes*H0
  8631. -->
  8632. Firing rl*prefer*rvt*predict-yes*H0*1
  8633. -->
  8634. (S1 ^operator O1925 = 0.)
  8635. Firing prefer*rvt*predict-no*H0
  8636. -->
  8637. Firing rl*prefer*rvt*predict-no*H0*2
  8638. -->
  8639. (S1 ^operator O1926 = 1.)
  8640. inner elaboration loop at bottom goal.
  8641. Retracting rl*prefer*rvt*predict-no*H0*2
  8642. -->
  8643. (S1 ^operator O1924 = 1.)
  8644. Retracting rl*prefer*rvt*predict-yes*H0*1
  8645. -->
  8646. (S1 ^operator O1923 = 0.)
  8647. --- END Proposal Phase ---
  8648. --- Decision Phase ---
  8649. RL update rl*prefer*rvt*predict-no*H0*4 0.478562 -0.164047 0.314514 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.919463,0.0745511)
  8650. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521505 0.164054 0.685559 -> 0.521498 0.164053 0.685552(R,m,v=1,1,0)
  8651. =>WM: (13509: S1 ^operator O1926)
  8652. 963: O: O1926 (predict-no)
  8653. --- END Decision Phase ---
  8654. --- Application Phase ---
  8655. --- Firing Productions (PE) For State At Depth 1 ---
  8656. --- Inner Elaboration Phase, active level 1 (S1) ---
  8657. Firing apply*operator
  8658. -->
  8659. (I3 ^predict-no N963 + :O )
  8660. Firing apply*operator*complete
  8661. -->
  8662. (I3 ^predict-no N962 - :O )
  8663. inner elaboration loop at bottom goal.
  8664. --- Change Working Memory (PE) ---
  8665. =>WM: (13510: I3 ^predict-no N963)
  8666. <=WM: (13497: N962 ^status complete)
  8667. <=WM: (13496: I3 ^predict-no N962)
  8668. --- Firing Productions (IE) For State At Depth 1 ---
  8669. --- Inner Elaboration Phase, active level 1 (S1) ---
  8670. Firing monitor*world
  8671. -->
  8672. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8673. --- Change Working Memory (IE) ---
  8674. --- END Application Phase ---
  8675. --- Output Phase ---
  8676. ENV: Agent did: predict-no for direction U in state State-A
  8677. In State-A moving U
  8678. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8679. predict error 0
  8680. dir: dir isU
  8681. --- END Output Phase ---
  8682. -/--- Input Phase ---
  8683. =>WM: (13514: I2 ^dir U)
  8684. =>WM: (13513: I2 ^reward 1)
  8685. =>WM: (13512: I2 ^see 0)
  8686. =>WM: (13511: N963 ^status complete)
  8687. <=WM: (13500: I2 ^dir U)
  8688. <=WM: (13499: I2 ^reward 1)
  8689. <=WM: (13498: I2 ^see 0)
  8690. =>WM: (13515: I2 ^level-1 L0-root)
  8691. <=WM: (13501: I2 ^level-1 L0-root)
  8692. --- END Input Phase ---
  8693. --- Proposal Phase ---
  8694. --- Inner Elaboration Phase, active level 1 (S1) ---
  8695. Firing elaborate*copy-see-to-output-link
  8696. -->
  8697. (I3 ^see 0 +)
  8698. Firing elaborate*reward*based*on*reward
  8699. -->
  8700. (R967 ^value 1 +)
  8701. (R1 ^reward R967 +)
  8702. Firing propose*predict-yes
  8703. -->
  8704. (O1927 ^name predict-yes +)
  8705. (S1 ^operator O1927 +)
  8706. Firing propose*predict-no
  8707. -->
  8708. (O1928 ^name predict-no +)
  8709. (S1 ^operator O1928 +)
  8710. Firing rl*prefer*rvt*predict-no*H0*2
  8711. -->
  8712. (S1 ^operator O1926 = 1.)
  8713. Firing rl*prefer*rvt*predict-yes*H0*1
  8714. -->
  8715. (S1 ^operator O1925 = 0.)
  8716. Firing prefer*rvt*predict-yes*H0
  8717. -->
  8718. Firing prefer*rvt*predict-no*H0
  8719. -->
  8720. Firing elaborate*copy-dir-to-output-link
  8721. -->
  8722. (I3 ^dir U +)
  8723. inner elaboration loop at bottom goal.
  8724. Retracting elaborate*copy-see-to-output-link
  8725. -->
  8726. (I3 ^see 0 +)
  8727. Retracting propose*predict-no
  8728. -->
  8729. (O1926 ^name predict-no +)
  8730. (S1 ^operator O1926 +)
  8731. Retracting propose*predict-yes
  8732. -->
  8733. (O1925 ^name predict-yes +)
  8734. (S1 ^operator O1925 +)
  8735. Retracting elaborate*reward*based*on*reward
  8736. -->
  8737. (R966 ^value 1 +)
  8738. (R1 ^reward R966 +)
  8739. Retracting elaborate*copy-dir-to-output-link
  8740. -->
  8741. (I3 ^dir U +)
  8742. Retracting rl*prefer*rvt*predict-no*H0*2
  8743. -->
  8744. (S1 ^operator O1926 = 1.)
  8745. Retracting rl*prefer*rvt*predict-yes*H0*1
  8746. -->
  8747. (S1 ^operator O1925 = 0.)
  8748. =>WM: (13521: S1 ^operator O1928 +)
  8749. =>WM: (13520: S1 ^operator O1927 +)
  8750. =>WM: (13519: O1928 ^name predict-no)
  8751. =>WM: (13518: O1927 ^name predict-yes)
  8752. =>WM: (13517: R967 ^value 1)
  8753. =>WM: (13516: R1 ^reward R967)
  8754. <=WM: (13507: S1 ^operator O1925 +)
  8755. <=WM: (13508: S1 ^operator O1926 +)
  8756. <=WM: (13509: S1 ^operator O1926)
  8757. <=WM: (13502: R1 ^reward R966)
  8758. <=WM: (13505: O1926 ^name predict-no)
  8759. <=WM: (13504: O1925 ^name predict-yes)
  8760. <=WM: (13503: R966 ^value 1)
  8761. --- Inner Elaboration Phase, active level 1 (S1) ---
  8762. Firing prefer*rvt*predict-yes*H0
  8763. -->
  8764. Firing rl*prefer*rvt*predict-yes*H0*1
  8765. -->
  8766. (S1 ^operator O1927 = 0.)
  8767. Firing prefer*rvt*predict-no*H0
  8768. -->
  8769. Firing rl*prefer*rvt*predict-no*H0*2
  8770. -->
  8771. (S1 ^operator O1928 = 1.)
  8772. inner elaboration loop at bottom goal.
  8773. Retracting rl*prefer*rvt*predict-no*H0*2
  8774. -->
  8775. (S1 ^operator O1926 = 1.)
  8776. Retracting rl*prefer*rvt*predict-yes*H0*1
  8777. -->
  8778. (S1 ^operator O1925 = 0.)
  8779. --- END Proposal Phase ---
  8780. --- Decision Phase ---
  8781. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8782. =>WM: (13522: S1 ^operator O1928)
  8783. 964: O: O1928 (predict-no)
  8784. --- END Decision Phase ---
  8785. --- Application Phase ---
  8786. --- Firing Productions (PE) For State At Depth 1 ---
  8787. --- Inner Elaboration Phase, active level 1 (S1) ---
  8788. Firing apply*operator
  8789. -->
  8790. (I3 ^predict-no N964 + :O )
  8791. Firing apply*operator*complete
  8792. -->
  8793. (I3 ^predict-no N963 - :O )
  8794. inner elaboration loop at bottom goal.
  8795. --- Change Working Memory (PE) ---
  8796. =>WM: (13523: I3 ^predict-no N964)
  8797. <=WM: (13511: N963 ^status complete)
  8798. <=WM: (13510: I3 ^predict-no N963)
  8799. --- Firing Productions (IE) For State At Depth 1 ---
  8800. --- Inner Elaboration Phase, active level 1 (S1) ---
  8801. Firing monitor*world
  8802. -->
  8803. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  8804. --- Change Working Memory (IE) ---
  8805. --- END Application Phase ---
  8806. --- Output Phase ---
  8807. ENV: Agent did: predict-no for direction U in state State-A
  8808. In State-A moving U
  8809. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  8810. predict error 0
  8811. dir: dir isR
  8812. --- END Output Phase ---
  8813. |\--- Input Phase ---
  8814. =>WM: (13527: I2 ^dir R)
  8815. =>WM: (13526: I2 ^reward 1)
  8816. =>WM: (13525: I2 ^see 0)
  8817. =>WM: (13524: N964 ^status complete)
  8818. <=WM: (13514: I2 ^dir U)
  8819. <=WM: (13513: I2 ^reward 1)
  8820. <=WM: (13512: I2 ^see 0)
  8821. =>WM: (13528: I2 ^level-1 L0-root)
  8822. <=WM: (13515: I2 ^level-1 L0-root)
  8823. --- END Input Phase ---
  8824. --- Proposal Phase ---
  8825. --- Inner Elaboration Phase, active level 1 (S1) ---
  8826. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  8827. -->
  8828. (S1 ^operator O1927 = 0.878390760537652)
  8829. Firing prefer*rvt*predict-yes*H0*5*H1
  8830. -->
  8831. Firing elaborate*copy-see-to-output-link
  8832. -->
  8833. (I3 ^see 0 +)
  8834. Firing elaborate*reward*based*on*reward
  8835. -->
  8836. (R968 ^value 1 +)
  8837. (R1 ^reward R968 +)
  8838. Firing propose*predict-yes
  8839. -->
  8840. (O1929 ^name predict-yes +)
  8841. (S1 ^operator O1929 +)
  8842. Firing propose*predict-no
  8843. -->
  8844. (O1930 ^name predict-no +)
  8845. (S1 ^operator O1930 +)
  8846. Firing rl*prefer*rvt*predict-no*H0*6
  8847. -->
  8848. (S1 ^operator O1928 = 0.9999810901454903)
  8849. Firing rl*prefer*rvt*predict-yes*H0*5
  8850. -->
  8851. (S1 ^operator O1927 = 0.121597689773478)
  8852. Firing prefer*rvt*predict-yes*H0
  8853. -->
  8854. Firing prefer*rvt*predict-no*H0
  8855. -->
  8856. Firing elaborate*copy-dir-to-output-link
  8857. -->
  8858. (I3 ^dir R +)
  8859. inner elaboration loop at bottom goal.
  8860. Retracting elaborate*copy-see-to-output-link
  8861. -->
  8862. (I3 ^see 0 +)
  8863. Retracting propose*predict-no
  8864. -->
  8865. (O1928 ^name predict-no +)
  8866. (S1 ^operator O1928 +)
  8867. Retracting propose*predict-yes
  8868. -->
  8869. (O1927 ^name predict-yes +)
  8870. (S1 ^operator O1927 +)
  8871. Retracting elaborate*reward*based*on*reward
  8872. -->
  8873. (R967 ^value 1 +)
  8874. (R1 ^reward R967 +)
  8875. Retracting elaborate*copy-dir-to-output-link
  8876. -->
  8877. (I3 ^dir U +)
  8878. Retracting rl*prefer*rvt*predict-no*H0*2
  8879. -->
  8880. (S1 ^operator O1928 = 1.)
  8881. Retracting rl*prefer*rvt*predict-yes*H0*1
  8882. -->
  8883. (S1 ^operator O1927 = 0.)
  8884. =>WM: (13535: S1 ^operator O1930 +)
  8885. =>WM: (13534: S1 ^operator O1929 +)
  8886. =>WM: (13533: I3 ^dir R)
  8887. =>WM: (13532: O1930 ^name predict-no)
  8888. =>WM: (13531: O1929 ^name predict-yes)
  8889. =>WM: (13530: R968 ^value 1)
  8890. =>WM: (13529: R1 ^reward R968)
  8891. <=WM: (13520: S1 ^operator O1927 +)
  8892. <=WM: (13521: S1 ^operator O1928 +)
  8893. <=WM: (13522: S1 ^operator O1928)
  8894. <=WM: (13506: I3 ^dir U)
  8895. <=WM: (13516: R1 ^reward R967)
  8896. <=WM: (13519: O1928 ^name predict-no)
  8897. <=WM: (13518: O1927 ^name predict-yes)
  8898. <=WM: (13517: R967 ^value 1)
  8899. --- Inner Elaboration Phase, active level 1 (S1) ---
  8900. Firing prefer*rvt*predict-yes*H0
  8901. -->
  8902. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  8903. -->
  8904. (S1 ^operator O1929 = 0.878390760537652)
  8905. Firing rl*prefer*rvt*predict-yes*H0*5
  8906. -->
  8907. (S1 ^operator O1929 = 0.121597689773478)
  8908. Firing prefer*rvt*predict-yes*H0*5*H1
  8909. -->
  8910. Firing prefer*rvt*predict-no*H0
  8911. -->
  8912. Firing rl*prefer*rvt*predict-no*H0*6
  8913. -->
  8914. (S1 ^operator O1930 = 0.9999810901454903)
  8915. inner elaboration loop at bottom goal.
  8916. Retracting rl*prefer*rvt*predict-no*H0*6
  8917. -->
  8918. (S1 ^operator O1928 = 0.9999810901454903)
  8919. Retracting rl*prefer*rvt*predict-yes*H0*5
  8920. -->
  8921. (S1 ^operator O1927 = 0.121597689773478)
  8922. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  8923. -->
  8924. (S1 ^operator O1927 = 0.878390760537652)
  8925. --- END Proposal Phase ---
  8926. --- Decision Phase ---
  8927. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  8928. =>WM: (13536: S1 ^operator O1929)
  8929. 965: O: O1929 (predict-yes)
  8930. --- END Decision Phase ---
  8931. --- Application Phase ---
  8932. --- Firing Productions (PE) For State At Depth 1 ---
  8933. --- Inner Elaboration Phase, active level 1 (S1) ---
  8934. Firing apply*operator
  8935. -->
  8936. (I3 ^predict-yes N965 + :O )
  8937. Firing apply*operator*complete
  8938. -->
  8939. (I3 ^predict-no N964 - :O )
  8940. inner elaboration loop at bottom goal.
  8941. --- Change Working Memory (PE) ---
  8942. =>WM: (13537: I3 ^predict-yes N965)
  8943. <=WM: (13524: N964 ^status complete)
  8944. <=WM: (13523: I3 ^predict-no N964)
  8945. --- Firing Productions (IE) For State At Depth 1 ---
  8946. --- Inner Elaboration Phase, active level 1 (S1) ---
  8947. Firing monitor*world
  8948. -->
  8949. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  8950. --- Change Working Memory (IE) ---
  8951. --- END Application Phase ---
  8952. --- Output Phase ---
  8953. ENV: Agent did: predict-yes for direction R in state State-A
  8954. In State-A moving R
  8955. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  8956. predict error 0
  8957. dir: dir isU
  8958. --- END Output Phase ---
  8959. -/|--- Input Phase ---
  8960. =>WM: (13541: I2 ^dir U)
  8961. =>WM: (13540: I2 ^reward 1)
  8962. =>WM: (13539: I2 ^see 1)
  8963. =>WM: (13538: N965 ^status complete)
  8964. <=WM: (13527: I2 ^dir R)
  8965. <=WM: (13526: I2 ^reward 1)
  8966. <=WM: (13525: I2 ^see 0)
  8967. =>WM: (13542: I2 ^level-1 R1-root)
  8968. <=WM: (13528: I2 ^level-1 L0-root)
  8969. --- END Input Phase ---
  8970. --- Proposal Phase ---
  8971. --- Inner Elaboration Phase, active level 1 (S1) ---
  8972. Firing elaborate*copy-see-to-output-link
  8973. -->
  8974. (I3 ^see 1 +)
  8975. Firing elaborate*reward*based*on*reward
  8976. -->
  8977. (R969 ^value 1 +)
  8978. (R1 ^reward R969 +)
  8979. Firing propose*predict-yes
  8980. -->
  8981. (O1931 ^name predict-yes +)
  8982. (S1 ^operator O1931 +)
  8983. Firing propose*predict-no
  8984. -->
  8985. (O1932 ^name predict-no +)
  8986. (S1 ^operator O1932 +)
  8987. Firing rl*prefer*rvt*predict-no*H0*2
  8988. -->
  8989. (S1 ^operator O1930 = 1.)
  8990. Firing rl*prefer*rvt*predict-yes*H0*1
  8991. -->
  8992. (S1 ^operator O1929 = 0.)
  8993. Firing prefer*rvt*predict-yes*H0
  8994. -->
  8995. Firing prefer*rvt*predict-no*H0
  8996. -->
  8997. Firing elaborate*copy-dir-to-output-link
  8998. -->
  8999. (I3 ^dir U +)
  9000. inner elaboration loop at bottom goal.
  9001. Retracting elaborate*copy-see-to-output-link
  9002. -->
  9003. (I3 ^see 0 +)
  9004. Retracting propose*predict-no
  9005. -->
  9006. (O1930 ^name predict-no +)
  9007. (S1 ^operator O1930 +)
  9008. Retracting propose*predict-yes
  9009. -->
  9010. (O1929 ^name predict-yes +)
  9011. (S1 ^operator O1929 +)
  9012. Retracting elaborate*reward*based*on*reward
  9013. -->
  9014. (R968 ^value 1 +)
  9015. (R1 ^reward R968 +)
  9016. Retracting elaborate*copy-dir-to-output-link
  9017. -->
  9018. (I3 ^dir R +)
  9019. Retracting rl*prefer*rvt*predict-no*H0*6
  9020. -->
  9021. (S1 ^operator O1930 = 0.9999810901454903)
  9022. Retracting rl*prefer*rvt*predict-yes*H0*5
  9023. -->
  9024. (S1 ^operator O1929 = 0.121597689773478)
  9025. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  9026. -->
  9027. (S1 ^operator O1929 = 0.878390760537652)
  9028. =>WM: (13550: S1 ^operator O1932 +)
  9029. =>WM: (13549: S1 ^operator O1931 +)
  9030. =>WM: (13548: I3 ^dir U)
  9031. =>WM: (13547: O1932 ^name predict-no)
  9032. =>WM: (13546: O1931 ^name predict-yes)
  9033. =>WM: (13545: R969 ^value 1)
  9034. =>WM: (13544: R1 ^reward R969)
  9035. =>WM: (13543: I3 ^see 1)
  9036. <=WM: (13534: S1 ^operator O1929 +)
  9037. <=WM: (13536: S1 ^operator O1929)
  9038. <=WM: (13535: S1 ^operator O1930 +)
  9039. <=WM: (13533: I3 ^dir R)
  9040. <=WM: (13529: R1 ^reward R968)
  9041. <=WM: (13487: I3 ^see 0)
  9042. <=WM: (13532: O1930 ^name predict-no)
  9043. <=WM: (13531: O1929 ^name predict-yes)
  9044. <=WM: (13530: R968 ^value 1)
  9045. --- Inner Elaboration Phase, active level 1 (S1) ---
  9046. Firing prefer*rvt*predict-yes*H0
  9047. -->
  9048. Firing rl*prefer*rvt*predict-yes*H0*1
  9049. -->
  9050. (S1 ^operator O1931 = 0.)
  9051. Firing prefer*rvt*predict-no*H0
  9052. -->
  9053. Firing rl*prefer*rvt*predict-no*H0*2
  9054. -->
  9055. (S1 ^operator O1932 = 1.)
  9056. inner elaboration loop at bottom goal.
  9057. Retracting rl*prefer*rvt*predict-no*H0*2
  9058. -->
  9059. (S1 ^operator O1930 = 1.)
  9060. Retracting rl*prefer*rvt*predict-yes*H0*1
  9061. -->
  9062. (S1 ^operator O1929 = 0.)
  9063. --- END Proposal Phase ---
  9064. --- Decision Phase ---
  9065. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.858824,0.121963)
  9066. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465467 0.412924 0.878391 -> 0.465467 0.412924 0.878392(R,m,v=1,1,0)
  9067. =>WM: (13551: S1 ^operator O1932)
  9068. 966: O: O1932 (predict-no)
  9069. --- END Decision Phase ---
  9070. --- Application Phase ---
  9071. --- Firing Productions (PE) For State At Depth 1 ---
  9072. --- Inner Elaboration Phase, active level 1 (S1) ---
  9073. Firing apply*operator
  9074. -->
  9075. (I3 ^predict-no N966 + :O )
  9076. Firing apply*operator*complete
  9077. -->
  9078. (I3 ^predict-yes N965 - :O )
  9079. inner elaboration loop at bottom goal.
  9080. --- Change Working Memory (PE) ---
  9081. =>WM: (13552: I3 ^predict-no N966)
  9082. <=WM: (13538: N965 ^status complete)
  9083. <=WM: (13537: I3 ^predict-yes N965)
  9084. --- Firing Productions (IE) For State At Depth 1 ---
  9085. --- Inner Elaboration Phase, active level 1 (S1) ---
  9086. Firing monitor*world
  9087. -->
  9088. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9089. --- Change Working Memory (IE) ---
  9090. --- END Application Phase ---
  9091. --- Output Phase ---
  9092. ENV: Agent did: predict-no for direction U in state State-B
  9093. In State-B moving U
  9094. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  9095. predict error 0
  9096. dir: dir isL
  9097. --- END Output Phase ---
  9098. \---- Input Phase ---
  9099. =>WM: (13556: I2 ^dir L)
  9100. =>WM: (13555: I2 ^reward 1)
  9101. =>WM: (13554: I2 ^see 0)
  9102. =>WM: (13553: N966 ^status complete)
  9103. <=WM: (13541: I2 ^dir U)
  9104. <=WM: (13540: I2 ^reward 1)
  9105. <=WM: (13539: I2 ^see 1)
  9106. =>WM: (13557: I2 ^level-1 R1-root)
  9107. <=WM: (13542: I2 ^level-1 R1-root)
  9108. --- END Input Phase ---
  9109. --- Proposal Phase ---
  9110. --- Inner Elaboration Phase, active level 1 (S1) ---
  9111. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9112. -->
  9113. (S1 ^operator O1932 = -0.168718511744511)
  9114. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9115. -->
  9116. (S1 ^operator O1931 = 0.6093697568764296)
  9117. Firing prefer*rvt*predict-no*H0*4*H1
  9118. -->
  9119. Firing prefer*rvt*predict-yes*H0*3*H1
  9120. -->
  9121. Firing elaborate*copy-see-to-output-link
  9122. -->
  9123. (I3 ^see 0 +)
  9124. Firing elaborate*reward*based*on*reward
  9125. -->
  9126. (R970 ^value 1 +)
  9127. (R1 ^reward R970 +)
  9128. Firing propose*predict-yes
  9129. -->
  9130. (O1933 ^name predict-yes +)
  9131. (S1 ^operator O1933 +)
  9132. Firing propose*predict-no
  9133. -->
  9134. (O1934 ^name predict-no +)
  9135. (S1 ^operator O1934 +)
  9136. Firing rl*prefer*rvt*predict-no*H0*4
  9137. -->
  9138. (S1 ^operator O1932 = 0.3145082389793297)
  9139. Firing rl*prefer*rvt*predict-yes*H0*3
  9140. -->
  9141. (S1 ^operator O1931 = 0.390807862285058)
  9142. Firing prefer*rvt*predict-yes*H0
  9143. -->
  9144. Firing prefer*rvt*predict-no*H0
  9145. -->
  9146. Firing elaborate*copy-dir-to-output-link
  9147. -->
  9148. (I3 ^dir L +)
  9149. inner elaboration loop at bottom goal.
  9150. Retracting elaborate*copy-see-to-output-link
  9151. -->
  9152. (I3 ^see 1 +)
  9153. Retracting propose*predict-no
  9154. -->
  9155. (O1932 ^name predict-no +)
  9156. (S1 ^operator O1932 +)
  9157. Retracting propose*predict-yes
  9158. -->
  9159. (O1931 ^name predict-yes +)
  9160. (S1 ^operator O1931 +)
  9161. Retracting elaborate*reward*based*on*reward
  9162. -->
  9163. (R969 ^value 1 +)
  9164. (R1 ^reward R969 +)
  9165. Retracting elaborate*copy-dir-to-output-link
  9166. -->
  9167. (I3 ^dir U +)
  9168. Retracting rl*prefer*rvt*predict-no*H0*2
  9169. -->
  9170. (S1 ^operator O1932 = 1.)
  9171. Retracting rl*prefer*rvt*predict-yes*H0*1
  9172. -->
  9173. (S1 ^operator O1931 = 0.)
  9174. =>WM: (13565: S1 ^operator O1934 +)
  9175. =>WM: (13564: S1 ^operator O1933 +)
  9176. =>WM: (13563: I3 ^dir L)
  9177. =>WM: (13562: O1934 ^name predict-no)
  9178. =>WM: (13561: O1933 ^name predict-yes)
  9179. =>WM: (13560: R970 ^value 1)
  9180. =>WM: (13559: R1 ^reward R970)
  9181. =>WM: (13558: I3 ^see 0)
  9182. <=WM: (13549: S1 ^operator O1931 +)
  9183. <=WM: (13550: S1 ^operator O1932 +)
  9184. <=WM: (13551: S1 ^operator O1932)
  9185. <=WM: (13548: I3 ^dir U)
  9186. <=WM: (13544: R1 ^reward R969)
  9187. <=WM: (13543: I3 ^see 1)
  9188. <=WM: (13547: O1932 ^name predict-no)
  9189. <=WM: (13546: O1931 ^name predict-yes)
  9190. <=WM: (13545: R969 ^value 1)
  9191. --- Inner Elaboration Phase, active level 1 (S1) ---
  9192. Firing prefer*rvt*predict-yes*H0
  9193. -->
  9194. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9195. -->
  9196. (S1 ^operator O1933 = 0.6093697568764296)
  9197. Firing rl*prefer*rvt*predict-yes*H0*3
  9198. -->
  9199. (S1 ^operator O1933 = 0.390807862285058)
  9200. Firing prefer*rvt*predict-yes*H0*3*H1
  9201. -->
  9202. Firing prefer*rvt*predict-no*H0
  9203. -->
  9204. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9205. -->
  9206. (S1 ^operator O1934 = -0.168718511744511)
  9207. Firing rl*prefer*rvt*predict-no*H0*4
  9208. -->
  9209. (S1 ^operator O1934 = 0.3145082389793297)
  9210. Firing prefer*rvt*predict-no*H0*4*H1
  9211. -->
  9212. inner elaboration loop at bottom goal.
  9213. Retracting rl*prefer*rvt*predict-no*H0*4
  9214. -->
  9215. (S1 ^operator O1932 = 0.3145082389793297)
  9216. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9217. -->
  9218. (S1 ^operator O1932 = -0.168718511744511)
  9219. Retracting rl*prefer*rvt*predict-yes*H0*3
  9220. -->
  9221. (S1 ^operator O1931 = 0.390807862285058)
  9222. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9223. -->
  9224. (S1 ^operator O1931 = 0.6093697568764296)
  9225. --- END Proposal Phase ---
  9226. --- Decision Phase ---
  9227. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  9228. =>WM: (13566: S1 ^operator O1933)
  9229. 967: O: O1933 (predict-yes)
  9230. --- END Decision Phase ---
  9231. --- Application Phase ---
  9232. --- Firing Productions (PE) For State At Depth 1 ---
  9233. --- Inner Elaboration Phase, active level 1 (S1) ---
  9234. Firing apply*operator
  9235. -->
  9236. (I3 ^predict-yes N967 + :O )
  9237. Firing apply*operator*complete
  9238. -->
  9239. (I3 ^predict-no N966 - :O )
  9240. inner elaboration loop at bottom goal.
  9241. --- Change Working Memory (PE) ---
  9242. =>WM: (13567: I3 ^predict-yes N967)
  9243. <=WM: (13553: N966 ^status complete)
  9244. <=WM: (13552: I3 ^predict-no N966)
  9245. --- Firing Productions (IE) For State At Depth 1 ---
  9246. --- Inner Elaboration Phase, active level 1 (S1) ---
  9247. Firing monitor*world
  9248. -->
  9249. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9250. --- Change Working Memory (IE) ---
  9251. --- END Application Phase ---
  9252. --- Output Phase ---
  9253. ENV: Agent did: predict-yes for direction L in state State-B
  9254. In State-B moving L
  9255. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9256. predict error 0
  9257. dir: dir isL
  9258. --- END Output Phase ---
  9259. /|\--- Input Phase ---
  9260. =>WM: (13571: I2 ^dir L)
  9261. =>WM: (13570: I2 ^reward 1)
  9262. =>WM: (13569: I2 ^see 1)
  9263. =>WM: (13568: N967 ^status complete)
  9264. <=WM: (13556: I2 ^dir L)
  9265. <=WM: (13555: I2 ^reward 1)
  9266. <=WM: (13554: I2 ^see 0)
  9267. =>WM: (13572: I2 ^level-1 L1-root)
  9268. <=WM: (13557: I2 ^level-1 R1-root)
  9269. --- END Input Phase ---
  9270. --- Proposal Phase ---
  9271. --- Inner Elaboration Phase, active level 1 (S1) ---
  9272. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  9273. -->
  9274. (S1 ^operator O1933 = -0.2062723012911647)
  9275. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  9276. -->
  9277. (S1 ^operator O1934 = 0.685551861847024)
  9278. Firing prefer*rvt*predict-no*H0*4*H1
  9279. -->
  9280. Firing prefer*rvt*predict-yes*H0*3*H1
  9281. -->
  9282. Firing elaborate*copy-see-to-output-link
  9283. -->
  9284. (I3 ^see 1 +)
  9285. Firing elaborate*reward*based*on*reward
  9286. -->
  9287. (R971 ^value 1 +)
  9288. (R1 ^reward R971 +)
  9289. Firing propose*predict-yes
  9290. -->
  9291. (O1935 ^name predict-yes +)
  9292. (S1 ^operator O1935 +)
  9293. Firing propose*predict-no
  9294. -->
  9295. (O1936 ^name predict-no +)
  9296. (S1 ^operator O1936 +)
  9297. Firing rl*prefer*rvt*predict-no*H0*4
  9298. -->
  9299. (S1 ^operator O1934 = 0.3145082389793297)
  9300. Firing rl*prefer*rvt*predict-yes*H0*3
  9301. -->
  9302. (S1 ^operator O1933 = 0.390807862285058)
  9303. Firing prefer*rvt*predict-yes*H0
  9304. -->
  9305. Firing prefer*rvt*predict-no*H0
  9306. -->
  9307. Firing elaborate*copy-dir-to-output-link
  9308. -->
  9309. (I3 ^dir L +)
  9310. inner elaboration loop at bottom goal.
  9311. Retracting elaborate*copy-see-to-output-link
  9312. -->
  9313. (I3 ^see 0 +)
  9314. Retracting propose*predict-no
  9315. -->
  9316. (O1934 ^name predict-no +)
  9317. (S1 ^operator O1934 +)
  9318. Retracting propose*predict-yes
  9319. -->
  9320. (O1933 ^name predict-yes +)
  9321. (S1 ^operator O1933 +)
  9322. Retracting elaborate*reward*based*on*reward
  9323. -->
  9324. (R970 ^value 1 +)
  9325. (R1 ^reward R970 +)
  9326. Retracting elaborate*copy-dir-to-output-link
  9327. -->
  9328. (I3 ^dir L +)
  9329. Retracting rl*prefer*rvt*predict-no*H0*4
  9330. -->
  9331. (S1 ^operator O1934 = 0.3145082389793297)
  9332. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9333. -->
  9334. (S1 ^operator O1934 = -0.168718511744511)
  9335. Retracting rl*prefer*rvt*predict-yes*H0*3
  9336. -->
  9337. (S1 ^operator O1933 = 0.390807862285058)
  9338. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9339. -->
  9340. (S1 ^operator O1933 = 0.6093697568764296)
  9341. =>WM: (13579: S1 ^operator O1936 +)
  9342. =>WM: (13578: S1 ^operator O1935 +)
  9343. =>WM: (13577: O1936 ^name predict-no)
  9344. =>WM: (13576: O1935 ^name predict-yes)
  9345. =>WM: (13575: R971 ^value 1)
  9346. =>WM: (13574: R1 ^reward R971)
  9347. =>WM: (13573: I3 ^see 1)
  9348. <=WM: (13564: S1 ^operator O1933 +)
  9349. <=WM: (13566: S1 ^operator O1933)
  9350. <=WM: (13565: S1 ^operator O1934 +)
  9351. <=WM: (13559: R1 ^reward R970)
  9352. <=WM: (13558: I3 ^see 0)
  9353. <=WM: (13562: O1934 ^name predict-no)
  9354. <=WM: (13561: O1933 ^name predict-yes)
  9355. <=WM: (13560: R970 ^value 1)
  9356. --- Inner Elaboration Phase, active level 1 (S1) ---
  9357. Firing prefer*rvt*predict-yes*H0
  9358. -->
  9359. Firing rl*prefer*rvt*predict-yes*H0*3
  9360. -->
  9361. (S1 ^operator O1935 = 0.390807862285058)
  9362. Firing prefer*rvt*predict-yes*H0*3*H1
  9363. -->
  9364. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  9365. -->
  9366. (S1 ^operator O1935 = -0.2062723012911647)
  9367. Firing prefer*rvt*predict-no*H0
  9368. -->
  9369. Firing rl*prefer*rvt*predict-no*H0*4
  9370. -->
  9371. (S1 ^operator O1936 = 0.3145082389793297)
  9372. Firing prefer*rvt*predict-no*H0*4*H1
  9373. -->
  9374. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  9375. -->
  9376. (S1 ^operator O1936 = 0.685551861847024)
  9377. inner elaboration loop at bottom goal.
  9378. Retracting rl*prefer*rvt*predict-no*H0*4
  9379. -->
  9380. (S1 ^operator O1934 = 0.3145082389793297)
  9381. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  9382. -->
  9383. (S1 ^operator O1934 = 0.685551861847024)
  9384. Retracting rl*prefer*rvt*predict-yes*H0*3
  9385. -->
  9386. (S1 ^operator O1933 = 0.390807862285058)
  9387. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  9388. -->
  9389. (S1 ^operator O1933 = -0.2062723012911647)
  9390. --- END Proposal Phase ---
  9391. --- Decision Phase ---
  9392. RL update rl*prefer*rvt*predict-yes*H0*3 0.472349 -0.0815415 0.390808 -> 0.472337 -0.0815436 0.390793(R,m,v=1,0.941558,0.0553858)
  9393. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527802 0.0815677 0.60937 -> 0.527788 0.0815652 0.609353(R,m,v=1,1,0)
  9394. =>WM: (13580: S1 ^operator O1936)
  9395. 968: O: O1936 (predict-no)
  9396. --- END Decision Phase ---
  9397. --- Application Phase ---
  9398. --- Firing Productions (PE) For State At Depth 1 ---
  9399. --- Inner Elaboration Phase, active level 1 (S1) ---
  9400. Firing apply*operator
  9401. -->
  9402. (I3 ^predict-no N968 + :O )
  9403. Firing apply*operator*complete
  9404. -->
  9405. (I3 ^predict-yes N967 - :O )
  9406. inner elaboration loop at bottom goal.
  9407. --- Change Working Memory (PE) ---
  9408. =>WM: (13581: I3 ^predict-no N968)
  9409. <=WM: (13568: N967 ^status complete)
  9410. <=WM: (13567: I3 ^predict-yes N967)
  9411. --- Firing Productions (IE) For State At Depth 1 ---
  9412. --- Inner Elaboration Phase, active level 1 (S1) ---
  9413. Firing monitor*world
  9414. -->
  9415. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9416. --- Change Working Memory (IE) ---
  9417. --- END Application Phase ---
  9418. --- Output Phase ---
  9419. ENV: Agent did: predict-no for direction L in state State-A
  9420. In State-A moving L
  9421. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9422. predict error 0
  9423. dir: dir isR
  9424. --- END Output Phase ---
  9425. -/|--- Input Phase ---
  9426. =>WM: (13585: I2 ^dir R)
  9427. =>WM: (13584: I2 ^reward 1)
  9428. =>WM: (13583: I2 ^see 0)
  9429. =>WM: (13582: N968 ^status complete)
  9430. <=WM: (13571: I2 ^dir L)
  9431. <=WM: (13570: I2 ^reward 1)
  9432. <=WM: (13569: I2 ^see 1)
  9433. =>WM: (13586: I2 ^level-1 L0-root)
  9434. <=WM: (13572: I2 ^level-1 L1-root)
  9435. --- END Input Phase ---
  9436. --- Proposal Phase ---
  9437. --- Inner Elaboration Phase, active level 1 (S1) ---
  9438. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  9439. -->
  9440. (S1 ^operator O1935 = 0.8783918732984659)
  9441. Firing prefer*rvt*predict-yes*H0*5*H1
  9442. -->
  9443. Firing elaborate*copy-see-to-output-link
  9444. -->
  9445. (I3 ^see 0 +)
  9446. Firing elaborate*reward*based*on*reward
  9447. -->
  9448. (R972 ^value 1 +)
  9449. (R1 ^reward R972 +)
  9450. Firing propose*predict-yes
  9451. -->
  9452. (O1937 ^name predict-yes +)
  9453. (S1 ^operator O1937 +)
  9454. Firing propose*predict-no
  9455. -->
  9456. (O1938 ^name predict-no +)
  9457. (S1 ^operator O1938 +)
  9458. Firing rl*prefer*rvt*predict-no*H0*6
  9459. -->
  9460. (S1 ^operator O1936 = 0.9999810901454903)
  9461. Firing rl*prefer*rvt*predict-yes*H0*5
  9462. -->
  9463. (S1 ^operator O1935 = 0.1215986309459259)
  9464. Firing prefer*rvt*predict-yes*H0
  9465. -->
  9466. Firing prefer*rvt*predict-no*H0
  9467. -->
  9468. Firing elaborate*copy-dir-to-output-link
  9469. -->
  9470. (I3 ^dir R +)
  9471. inner elaboration loop at bottom goal.
  9472. Retracting elaborate*copy-see-to-output-link
  9473. -->
  9474. (I3 ^see 1 +)
  9475. Retracting propose*predict-no
  9476. -->
  9477. (O1936 ^name predict-no +)
  9478. (S1 ^operator O1936 +)
  9479. Retracting propose*predict-yes
  9480. -->
  9481. (O1935 ^name predict-yes +)
  9482. (S1 ^operator O1935 +)
  9483. Retracting elaborate*reward*based*on*reward
  9484. -->
  9485. (R971 ^value 1 +)
  9486. (R1 ^reward R971 +)
  9487. Retracting elaborate*copy-dir-to-output-link
  9488. -->
  9489. (I3 ^dir L +)
  9490. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  9491. -->
  9492. (S1 ^operator O1936 = 0.685551861847024)
  9493. Retracting rl*prefer*rvt*predict-no*H0*4
  9494. -->
  9495. (S1 ^operator O1936 = 0.3145082389793297)
  9496. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  9497. -->
  9498. (S1 ^operator O1935 = -0.2062723012911647)
  9499. Retracting rl*prefer*rvt*predict-yes*H0*3
  9500. -->
  9501. (S1 ^operator O1935 = 0.3907931512898603)
  9502. =>WM: (13594: S1 ^operator O1938 +)
  9503. =>WM: (13593: S1 ^operator O1937 +)
  9504. =>WM: (13592: I3 ^dir R)
  9505. =>WM: (13591: O1938 ^name predict-no)
  9506. =>WM: (13590: O1937 ^name predict-yes)
  9507. =>WM: (13589: R972 ^value 1)
  9508. =>WM: (13588: R1 ^reward R972)
  9509. =>WM: (13587: I3 ^see 0)
  9510. <=WM: (13578: S1 ^operator O1935 +)
  9511. <=WM: (13579: S1 ^operator O1936 +)
  9512. <=WM: (13580: S1 ^operator O1936)
  9513. <=WM: (13563: I3 ^dir L)
  9514. <=WM: (13574: R1 ^reward R971)
  9515. <=WM: (13573: I3 ^see 1)
  9516. <=WM: (13577: O1936 ^name predict-no)
  9517. <=WM: (13576: O1935 ^name predict-yes)
  9518. <=WM: (13575: R971 ^value 1)
  9519. --- Inner Elaboration Phase, active level 1 (S1) ---
  9520. Firing prefer*rvt*predict-yes*H0
  9521. -->
  9522. Firing rl*prefer*rvt*predict-yes*H0*5
  9523. -->
  9524. (S1 ^operator O1937 = 0.1215986309459259)
  9525. Firing prefer*rvt*predict-yes*H0*5*H1
  9526. -->
  9527. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  9528. -->
  9529. (S1 ^operator O1937 = 0.8783918732984659)
  9530. Firing prefer*rvt*predict-no*H0
  9531. -->
  9532. Firing rl*prefer*rvt*predict-no*H0*6
  9533. -->
  9534. (S1 ^operator O1938 = 0.9999810901454903)
  9535. inner elaboration loop at bottom goal.
  9536. Retracting rl*prefer*rvt*predict-no*H0*6
  9537. -->
  9538. (S1 ^operator O1936 = 0.9999810901454903)
  9539. Retracting rl*prefer*rvt*predict-yes*H0*5
  9540. -->
  9541. (S1 ^operator O1935 = 0.1215986309459259)
  9542. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  9543. -->
  9544. (S1 ^operator O1935 = 0.8783918732984659)
  9545. --- END Proposal Phase ---
  9546. --- Decision Phase ---
  9547. RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478552 -0.164048 0.314503(R,m,v=1,0.92,0.074094)
  9548. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521498 0.164053 0.685552 -> 0.521493 0.164053 0.685546(R,m,v=1,1,0)
  9549. =>WM: (13595: S1 ^operator O1937)
  9550. 969: O: O1937 (predict-yes)
  9551. --- END Decision Phase ---
  9552. --- Application Phase ---
  9553. --- Firing Productions (PE) For State At Depth 1 ---
  9554. --- Inner Elaboration Phase, active level 1 (S1) ---
  9555. Firing apply*operator
  9556. -->
  9557. (I3 ^predict-yes N969 + :O )
  9558. Firing apply*operator*complete
  9559. -->
  9560. (I3 ^predict-no N968 - :O )
  9561. inner elaboration loop at bottom goal.
  9562. --- Change Working Memory (PE) ---
  9563. =>WM: (13596: I3 ^predict-yes N969)
  9564. <=WM: (13582: N968 ^status complete)
  9565. <=WM: (13581: I3 ^predict-no N968)
  9566. --- Firing Productions (IE) For State At Depth 1 ---
  9567. --- Inner Elaboration Phase, active level 1 (S1) ---
  9568. Firing monitor*world
  9569. -->
  9570. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9571. --- Change Working Memory (IE) ---
  9572. --- END Application Phase ---
  9573. --- Output Phase ---
  9574. ENV: Agent did: predict-yes for direction R in state State-A
  9575. In State-A moving R
  9576. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  9577. predict error 0
  9578. dir: dir isL
  9579. --- END Output Phase ---
  9580. \-/--- Input Phase ---
  9581. =>WM: (13600: I2 ^dir L)
  9582. =>WM: (13599: I2 ^reward 1)
  9583. =>WM: (13598: I2 ^see 1)
  9584. =>WM: (13597: N969 ^status complete)
  9585. <=WM: (13585: I2 ^dir R)
  9586. <=WM: (13584: I2 ^reward 1)
  9587. <=WM: (13583: I2 ^see 0)
  9588. =>WM: (13601: I2 ^level-1 R1-root)
  9589. <=WM: (13586: I2 ^level-1 L0-root)
  9590. --- END Input Phase ---
  9591. --- Proposal Phase ---
  9592. --- Inner Elaboration Phase, active level 1 (S1) ---
  9593. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9594. -->
  9595. (S1 ^operator O1938 = -0.168718511744511)
  9596. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9597. -->
  9598. (S1 ^operator O1937 = 0.6093527419421177)
  9599. Firing prefer*rvt*predict-no*H0*4*H1
  9600. -->
  9601. Firing prefer*rvt*predict-yes*H0*3*H1
  9602. -->
  9603. Firing elaborate*copy-see-to-output-link
  9604. -->
  9605. (I3 ^see 1 +)
  9606. Firing elaborate*reward*based*on*reward
  9607. -->
  9608. (R973 ^value 1 +)
  9609. (R1 ^reward R973 +)
  9610. Firing propose*predict-yes
  9611. -->
  9612. (O1939 ^name predict-yes +)
  9613. (S1 ^operator O1939 +)
  9614. Firing propose*predict-no
  9615. -->
  9616. (O1940 ^name predict-no +)
  9617. (S1 ^operator O1940 +)
  9618. Firing rl*prefer*rvt*predict-no*H0*4
  9619. -->
  9620. (S1 ^operator O1938 = 0.3145032394390637)
  9621. Firing rl*prefer*rvt*predict-yes*H0*3
  9622. -->
  9623. (S1 ^operator O1937 = 0.3907931512898603)
  9624. Firing prefer*rvt*predict-yes*H0
  9625. -->
  9626. Firing prefer*rvt*predict-no*H0
  9627. -->
  9628. Firing elaborate*copy-dir-to-output-link
  9629. -->
  9630. (I3 ^dir L +)
  9631. inner elaboration loop at bottom goal.
  9632. Retracting elaborate*copy-see-to-output-link
  9633. -->
  9634. (I3 ^see 0 +)
  9635. Retracting propose*predict-no
  9636. -->
  9637. (O1938 ^name predict-no +)
  9638. (S1 ^operator O1938 +)
  9639. Retracting propose*predict-yes
  9640. -->
  9641. (O1937 ^name predict-yes +)
  9642. (S1 ^operator O1937 +)
  9643. Retracting elaborate*reward*based*on*reward
  9644. -->
  9645. (R972 ^value 1 +)
  9646. (R1 ^reward R972 +)
  9647. Retracting elaborate*copy-dir-to-output-link
  9648. -->
  9649. (I3 ^dir R +)
  9650. Retracting rl*prefer*rvt*predict-no*H0*6
  9651. -->
  9652. (S1 ^operator O1938 = 0.9999810901454903)
  9653. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  9654. -->
  9655. (S1 ^operator O1937 = 0.8783918732984659)
  9656. Retracting rl*prefer*rvt*predict-yes*H0*5
  9657. -->
  9658. (S1 ^operator O1937 = 0.1215986309459259)
  9659. =>WM: (13609: S1 ^operator O1940 +)
  9660. =>WM: (13608: S1 ^operator O1939 +)
  9661. =>WM: (13607: I3 ^dir L)
  9662. =>WM: (13606: O1940 ^name predict-no)
  9663. =>WM: (13605: O1939 ^name predict-yes)
  9664. =>WM: (13604: R973 ^value 1)
  9665. =>WM: (13603: R1 ^reward R973)
  9666. =>WM: (13602: I3 ^see 1)
  9667. <=WM: (13593: S1 ^operator O1937 +)
  9668. <=WM: (13595: S1 ^operator O1937)
  9669. <=WM: (13594: S1 ^operator O1938 +)
  9670. <=WM: (13592: I3 ^dir R)
  9671. <=WM: (13588: R1 ^reward R972)
  9672. <=WM: (13587: I3 ^see 0)
  9673. <=WM: (13591: O1938 ^name predict-no)
  9674. <=WM: (13590: O1937 ^name predict-yes)
  9675. <=WM: (13589: R972 ^value 1)
  9676. --- Inner Elaboration Phase, active level 1 (S1) ---
  9677. Firing prefer*rvt*predict-yes*H0
  9678. -->
  9679. Firing rl*prefer*rvt*predict-yes*H0*3
  9680. -->
  9681. (S1 ^operator O1939 = 0.3907931512898603)
  9682. Firing prefer*rvt*predict-yes*H0*3*H1
  9683. -->
  9684. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  9685. -->
  9686. (S1 ^operator O1939 = 0.6093527419421177)
  9687. Firing prefer*rvt*predict-no*H0
  9688. -->
  9689. Firing rl*prefer*rvt*predict-no*H0*4
  9690. -->
  9691. (S1 ^operator O1940 = 0.3145032394390637)
  9692. Firing prefer*rvt*predict-no*H0*4*H1
  9693. -->
  9694. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  9695. -->
  9696. (S1 ^operator O1940 = -0.168718511744511)
  9697. inner elaboration loop at bottom goal.
  9698. Retracting rl*prefer*rvt*predict-no*H0*4
  9699. -->
  9700. (S1 ^operator O1938 = 0.3145032394390637)
  9701. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9702. -->
  9703. (S1 ^operator O1938 = -0.168718511744511)
  9704. Retracting rl*prefer*rvt*predict-yes*H0*3
  9705. -->
  9706. (S1 ^operator O1937 = 0.3907931512898603)
  9707. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9708. -->
  9709. (S1 ^operator O1937 = 0.6093527419421177)
  9710. --- END Proposal Phase ---
  9711. --- Decision Phase ---
  9712. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.859649,0.121362)
  9713. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465467 0.412924 0.878392 -> 0.465468 0.412925 0.878393(R,m,v=1,1,0)
  9714. =>WM: (13610: S1 ^operator O1939)
  9715. 970: O: O1939 (predict-yes)
  9716. --- END Decision Phase ---
  9717. --- Application Phase ---
  9718. --- Firing Productions (PE) For State At Depth 1 ---
  9719. --- Inner Elaboration Phase, active level 1 (S1) ---
  9720. Firing apply*operator
  9721. -->
  9722. (I3 ^predict-yes N970 + :O )
  9723. Firing apply*operator*complete
  9724. -->
  9725. (I3 ^predict-yes N969 - :O )
  9726. inner elaboration loop at bottom goal.
  9727. --- Change Working Memory (PE) ---
  9728. =>WM: (13611: I3 ^predict-yes N970)
  9729. <=WM: (13597: N969 ^status complete)
  9730. <=WM: (13596: I3 ^predict-yes N969)
  9731. --- Firing Productions (IE) For State At Depth 1 ---
  9732. --- Inner Elaboration Phase, active level 1 (S1) ---
  9733. Firing monitor*world
  9734. -->
  9735. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  9736. --- Change Working Memory (IE) ---
  9737. --- END Application Phase ---
  9738. --- Output Phase ---
  9739. ENV: Agent did: predict-yes for direction L in state State-B
  9740. In State-B moving L
  9741. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  9742. predict error 0
  9743. dir: dir isU
  9744. --- END Output Phase ---
  9745. |\--- Input Phase ---
  9746. =>WM: (13615: I2 ^dir U)
  9747. =>WM: (13614: I2 ^reward 1)
  9748. =>WM: (13613: I2 ^see 1)
  9749. =>WM: (13612: N970 ^status complete)
  9750. <=WM: (13600: I2 ^dir L)
  9751. <=WM: (13599: I2 ^reward 1)
  9752. <=WM: (13598: I2 ^see 1)
  9753. =>WM: (13616: I2 ^level-1 L1-root)
  9754. <=WM: (13601: I2 ^level-1 R1-root)
  9755. --- END Input Phase ---
  9756. --- Proposal Phase ---
  9757. --- Inner Elaboration Phase, active level 1 (S1) ---
  9758. Firing elaborate*copy-see-to-output-link
  9759. -->
  9760. (I3 ^see 1 +)
  9761. Firing elaborate*reward*based*on*reward
  9762. -->
  9763. (R974 ^value 1 +)
  9764. (R1 ^reward R974 +)
  9765. Firing propose*predict-yes
  9766. -->
  9767. (O1941 ^name predict-yes +)
  9768. (S1 ^operator O1941 +)
  9769. Firing propose*predict-no
  9770. -->
  9771. (O1942 ^name predict-no +)
  9772. (S1 ^operator O1942 +)
  9773. Firing rl*prefer*rvt*predict-no*H0*2
  9774. -->
  9775. (S1 ^operator O1940 = 1.)
  9776. Firing rl*prefer*rvt*predict-yes*H0*1
  9777. -->
  9778. (S1 ^operator O1939 = 0.)
  9779. Firing prefer*rvt*predict-yes*H0
  9780. -->
  9781. Firing prefer*rvt*predict-no*H0
  9782. -->
  9783. Firing elaborate*copy-dir-to-output-link
  9784. -->
  9785. (I3 ^dir U +)
  9786. inner elaboration loop at bottom goal.
  9787. Retracting elaborate*copy-see-to-output-link
  9788. -->
  9789. (I3 ^see 1 +)
  9790. Retracting propose*predict-no
  9791. -->
  9792. (O1940 ^name predict-no +)
  9793. (S1 ^operator O1940 +)
  9794. Retracting propose*predict-yes
  9795. -->
  9796. (O1939 ^name predict-yes +)
  9797. (S1 ^operator O1939 +)
  9798. Retracting elaborate*reward*based*on*reward
  9799. -->
  9800. (R973 ^value 1 +)
  9801. (R1 ^reward R973 +)
  9802. Retracting elaborate*copy-dir-to-output-link
  9803. -->
  9804. (I3 ^dir L +)
  9805. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  9806. -->
  9807. (S1 ^operator O1940 = -0.168718511744511)
  9808. Retracting rl*prefer*rvt*predict-no*H0*4
  9809. -->
  9810. (S1 ^operator O1940 = 0.3145032394390637)
  9811. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  9812. -->
  9813. (S1 ^operator O1939 = 0.6093527419421177)
  9814. Retracting rl*prefer*rvt*predict-yes*H0*3
  9815. -->
  9816. (S1 ^operator O1939 = 0.3907931512898603)
  9817. =>WM: (13623: S1 ^operator O1942 +)
  9818. =>WM: (13622: S1 ^operator O1941 +)
  9819. =>WM: (13621: I3 ^dir U)
  9820. =>WM: (13620: O1942 ^name predict-no)
  9821. =>WM: (13619: O1941 ^name predict-yes)
  9822. =>WM: (13618: R974 ^value 1)
  9823. =>WM: (13617: R1 ^reward R974)
  9824. <=WM: (13608: S1 ^operator O1939 +)
  9825. <=WM: (13610: S1 ^operator O1939)
  9826. <=WM: (13609: S1 ^operator O1940 +)
  9827. <=WM: (13607: I3 ^dir L)
  9828. <=WM: (13603: R1 ^reward R973)
  9829. <=WM: (13606: O1940 ^name predict-no)
  9830. <=WM: (13605: O1939 ^name predict-yes)
  9831. <=WM: (13604: R973 ^value 1)
  9832. --- Inner Elaboration Phase, active level 1 (S1) ---
  9833. Firing prefer*rvt*predict-yes*H0
  9834. -->
  9835. Firing rl*prefer*rvt*predict-yes*H0*1
  9836. -->
  9837. (S1 ^operator O1941 = 0.)
  9838. Firing prefer*rvt*predict-no*H0
  9839. -->
  9840. Firing rl*prefer*rvt*predict-no*H0*2
  9841. -->
  9842. (S1 ^operator O1942 = 1.)
  9843. inner elaboration loop at bottom goal.
  9844. Retracting rl*prefer*rvt*predict-no*H0*2
  9845. -->
  9846. (S1 ^operator O1940 = 1.)
  9847. Retracting rl*prefer*rvt*predict-yes*H0*1
  9848. -->
  9849. (S1 ^operator O1939 = 0.)
  9850. --- END Proposal Phase ---
  9851. --- Decision Phase ---
  9852. RL update rl*prefer*rvt*predict-yes*H0*3 0.472337 -0.0815436 0.390793 -> 0.472327 -0.0815454 0.390781(R,m,v=1,0.941935,0.0550482)
  9853. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527788 0.0815652 0.609353 -> 0.527776 0.0815632 0.609339(R,m,v=1,1,0)
  9854. =>WM: (13624: S1 ^operator O1942)
  9855. 971: O: O1942 (predict-no)
  9856. --- END Decision Phase ---
  9857. --- Application Phase ---
  9858. --- Firing Productions (PE) For State At Depth 1 ---
  9859. --- Inner Elaboration Phase, active level 1 (S1) ---
  9860. Firing apply*operator
  9861. -->
  9862. (I3 ^predict-no N971 + :O )
  9863. Firing apply*operator*complete
  9864. -->
  9865. (I3 ^predict-yes N970 - :O )
  9866. inner elaboration loop at bottom goal.
  9867. --- Change Working Memory (PE) ---
  9868. =>WM: (13625: I3 ^predict-no N971)
  9869. <=WM: (13612: N970 ^status complete)
  9870. <=WM: (13611: I3 ^predict-yes N970)
  9871. --- Firing Productions (IE) For State At Depth 1 ---
  9872. --- Inner Elaboration Phase, active level 1 (S1) ---
  9873. Firing monitor*world
  9874. -->
  9875. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  9876. --- Change Working Memory (IE) ---
  9877. --- END Application Phase ---
  9878. --- Output Phase ---
  9879. ENV: Agent did: predict-no for direction U in state State-A
  9880. In State-A moving U
  9881. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  9882. predict error 0
  9883. dir: dir isR
  9884. --- END Output Phase ---
  9885. ---- Input Phase ---
  9886. =>WM: (13629: I2 ^dir R)
  9887. =>WM: (13628: I2 ^reward 1)
  9888. =>WM: (13627: I2 ^see 0)
  9889. =>WM: (13626: N971 ^status complete)
  9890. <=WM: (13615: I2 ^dir U)
  9891. <=WM: (13614: I2 ^reward 1)
  9892. <=WM: (13613: I2 ^see 1)
  9893. =>WM: (13630: I2 ^level-1 L1-root)
  9894. <=WM: (13616: I2 ^level-1 L1-root)
  9895. --- END Input Phase ---
  9896. --- Proposal Phase ---
  9897. --- Inner Elaboration Phase, active level 1 (S1) ---
  9898. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  9899. -->
  9900. (S1 ^operator O1941 = 0.8784169509457307)
  9901. Firing prefer*rvt*predict-yes*H0*5*H1
  9902. -->
  9903. Firing elaborate*copy-see-to-output-link
  9904. -->
  9905. (I3 ^see 0 +)
  9906. Firing elaborate*reward*based*on*reward
  9907. -->
  9908. (R975 ^value 1 +)
  9909. (R1 ^reward R975 +)
  9910. Firing propose*predict-yes
  9911. -->
  9912. (O1943 ^name predict-yes +)
  9913. (S1 ^operator O1943 +)
  9914. Firing propose*predict-no
  9915. -->
  9916. (O1944 ^name predict-no +)
  9917. (S1 ^operator O1944 +)
  9918. Firing rl*prefer*rvt*predict-no*H0*6
  9919. -->
  9920. (S1 ^operator O1942 = 0.9999810901454903)
  9921. Firing rl*prefer*rvt*predict-yes*H0*5
  9922. -->
  9923. (S1 ^operator O1941 = 0.1215994040064755)
  9924. Firing prefer*rvt*predict-yes*H0
  9925. -->
  9926. Firing prefer*rvt*predict-no*H0
  9927. -->
  9928. Firing elaborate*copy-dir-to-output-link
  9929. -->
  9930. (I3 ^dir R +)
  9931. inner elaboration loop at bottom goal.
  9932. Retracting elaborate*copy-see-to-output-link
  9933. -->
  9934. (I3 ^see 1 +)
  9935. Retracting propose*predict-no
  9936. -->
  9937. (O1942 ^name predict-no +)
  9938. (S1 ^operator O1942 +)
  9939. Retracting propose*predict-yes
  9940. -->
  9941. (O1941 ^name predict-yes +)
  9942. (S1 ^operator O1941 +)
  9943. Retracting elaborate*reward*based*on*reward
  9944. -->
  9945. (R974 ^value 1 +)
  9946. (R1 ^reward R974 +)
  9947. Retracting elaborate*copy-dir-to-output-link
  9948. -->
  9949. (I3 ^dir U +)
  9950. Retracting rl*prefer*rvt*predict-no*H0*2
  9951. -->
  9952. (S1 ^operator O1942 = 1.)
  9953. Retracting rl*prefer*rvt*predict-yes*H0*1
  9954. -->
  9955. (S1 ^operator O1941 = 0.)
  9956. =>WM: (13638: S1 ^operator O1944 +)
  9957. =>WM: (13637: S1 ^operator O1943 +)
  9958. =>WM: (13636: I3 ^dir R)
  9959. =>WM: (13635: O1944 ^name predict-no)
  9960. =>WM: (13634: O1943 ^name predict-yes)
  9961. =>WM: (13633: R975 ^value 1)
  9962. =>WM: (13632: R1 ^reward R975)
  9963. =>WM: (13631: I3 ^see 0)
  9964. <=WM: (13622: S1 ^operator O1941 +)
  9965. <=WM: (13623: S1 ^operator O1942 +)
  9966. <=WM: (13624: S1 ^operator O1942)
  9967. <=WM: (13621: I3 ^dir U)
  9968. <=WM: (13617: R1 ^reward R974)
  9969. <=WM: (13602: I3 ^see 1)
  9970. <=WM: (13620: O1942 ^name predict-no)
  9971. <=WM: (13619: O1941 ^name predict-yes)
  9972. <=WM: (13618: R974 ^value 1)
  9973. --- Inner Elaboration Phase, active level 1 (S1) ---
  9974. Firing prefer*rvt*predict-yes*H0
  9975. -->
  9976. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  9977. -->
  9978. (S1 ^operator O1943 = 0.8784169509457307)
  9979. Firing rl*prefer*rvt*predict-yes*H0*5
  9980. -->
  9981. (S1 ^operator O1943 = 0.1215994040064755)
  9982. Firing prefer*rvt*predict-yes*H0*5*H1
  9983. -->
  9984. Firing prefer*rvt*predict-no*H0
  9985. -->
  9986. Firing rl*prefer*rvt*predict-no*H0*6
  9987. -->
  9988. (S1 ^operator O1944 = 0.9999810901454903)
  9989. inner elaboration loop at bottom goal.
  9990. Retracting rl*prefer*rvt*predict-no*H0*6
  9991. -->
  9992. (S1 ^operator O1942 = 0.9999810901454903)
  9993. Retracting rl*prefer*rvt*predict-yes*H0*5
  9994. -->
  9995. (S1 ^operator O1941 = 0.1215994040064755)
  9996. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  9997. -->
  9998. (S1 ^operator O1941 = 0.8784169509457307)
  9999. --- END Proposal Phase ---
  10000. --- Decision Phase ---
  10001. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10002. =>WM: (13639: S1 ^operator O1943)
  10003. 972: O: O1943 (predict-yes)
  10004. --- END Decision Phase ---
  10005. --- Application Phase ---
  10006. --- Firing Productions (PE) For State At Depth 1 ---
  10007. --- Inner Elaboration Phase, active level 1 (S1) ---
  10008. Firing apply*operator
  10009. -->
  10010. (I3 ^predict-yes N972 + :O )
  10011. Firing apply*operator*complete
  10012. -->
  10013. (I3 ^predict-no N971 - :O )
  10014. inner elaboration loop at bottom goal.
  10015. --- Change Working Memory (PE) ---
  10016. =>WM: (13640: I3 ^predict-yes N972)
  10017. <=WM: (13626: N971 ^status complete)
  10018. <=WM: (13625: I3 ^predict-no N971)
  10019. --- Firing Productions (IE) For State At Depth 1 ---
  10020. --- Inner Elaboration Phase, active level 1 (S1) ---
  10021. Firing monitor*world
  10022. -->
  10023. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10024. --- Change Working Memory (IE) ---
  10025. --- END Application Phase ---
  10026. --- Output Phase ---
  10027. ENV: Agent did: predict-yes for direction R in state State-A
  10028. In State-A moving R
  10029. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10030. predict error 0
  10031. dir: dir isU
  10032. --- END Output Phase ---
  10033. /|\--- Input Phase ---
  10034. =>WM: (13644: I2 ^dir U)
  10035. =>WM: (13643: I2 ^reward 1)
  10036. =>WM: (13642: I2 ^see 1)
  10037. =>WM: (13641: N972 ^status complete)
  10038. <=WM: (13629: I2 ^dir R)
  10039. <=WM: (13628: I2 ^reward 1)
  10040. <=WM: (13627: I2 ^see 0)
  10041. =>WM: (13645: I2 ^level-1 R1-root)
  10042. <=WM: (13630: I2 ^level-1 L1-root)
  10043. --- END Input Phase ---
  10044. --- Proposal Phase ---
  10045. --- Inner Elaboration Phase, active level 1 (S1) ---
  10046. Firing elaborate*copy-see-to-output-link
  10047. -->
  10048. (I3 ^see 1 +)
  10049. Firing elaborate*reward*based*on*reward
  10050. -->
  10051. (R976 ^value 1 +)
  10052. (R1 ^reward R976 +)
  10053. Firing propose*predict-yes
  10054. -->
  10055. (O1945 ^name predict-yes +)
  10056. (S1 ^operator O1945 +)
  10057. Firing propose*predict-no
  10058. -->
  10059. (O1946 ^name predict-no +)
  10060. (S1 ^operator O1946 +)
  10061. Firing rl*prefer*rvt*predict-no*H0*2
  10062. -->
  10063. (S1 ^operator O1944 = 1.)
  10064. Firing rl*prefer*rvt*predict-yes*H0*1
  10065. -->
  10066. (S1 ^operator O1943 = 0.)
  10067. Firing prefer*rvt*predict-yes*H0
  10068. -->
  10069. Firing prefer*rvt*predict-no*H0
  10070. -->
  10071. Firing elaborate*copy-dir-to-output-link
  10072. -->
  10073. (I3 ^dir U +)
  10074. inner elaboration loop at bottom goal.
  10075. Retracting elaborate*copy-see-to-output-link
  10076. -->
  10077. (I3 ^see 0 +)
  10078. Retracting propose*predict-no
  10079. -->
  10080. (O1944 ^name predict-no +)
  10081. (S1 ^operator O1944 +)
  10082. Retracting propose*predict-yes
  10083. -->
  10084. (O1943 ^name predict-yes +)
  10085. (S1 ^operator O1943 +)
  10086. Retracting elaborate*reward*based*on*reward
  10087. -->
  10088. (R975 ^value 1 +)
  10089. (R1 ^reward R975 +)
  10090. Retracting elaborate*copy-dir-to-output-link
  10091. -->
  10092. (I3 ^dir R +)
  10093. Retracting rl*prefer*rvt*predict-no*H0*6
  10094. -->
  10095. (S1 ^operator O1944 = 0.9999810901454903)
  10096. Retracting rl*prefer*rvt*predict-yes*H0*5
  10097. -->
  10098. (S1 ^operator O1943 = 0.1215994040064755)
  10099. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  10100. -->
  10101. (S1 ^operator O1943 = 0.8784169509457307)
  10102. =>WM: (13653: S1 ^operator O1946 +)
  10103. =>WM: (13652: S1 ^operator O1945 +)
  10104. =>WM: (13651: I3 ^dir U)
  10105. =>WM: (13650: O1946 ^name predict-no)
  10106. =>WM: (13649: O1945 ^name predict-yes)
  10107. =>WM: (13648: R976 ^value 1)
  10108. =>WM: (13647: R1 ^reward R976)
  10109. =>WM: (13646: I3 ^see 1)
  10110. <=WM: (13637: S1 ^operator O1943 +)
  10111. <=WM: (13639: S1 ^operator O1943)
  10112. <=WM: (13638: S1 ^operator O1944 +)
  10113. <=WM: (13636: I3 ^dir R)
  10114. <=WM: (13632: R1 ^reward R975)
  10115. <=WM: (13631: I3 ^see 0)
  10116. <=WM: (13635: O1944 ^name predict-no)
  10117. <=WM: (13634: O1943 ^name predict-yes)
  10118. <=WM: (13633: R975 ^value 1)
  10119. --- Inner Elaboration Phase, active level 1 (S1) ---
  10120. Firing prefer*rvt*predict-yes*H0
  10121. -->
  10122. Firing rl*prefer*rvt*predict-yes*H0*1
  10123. -->
  10124. (S1 ^operator O1945 = 0.)
  10125. Firing prefer*rvt*predict-no*H0
  10126. -->
  10127. Firing rl*prefer*rvt*predict-no*H0*2
  10128. -->
  10129. (S1 ^operator O1946 = 1.)
  10130. inner elaboration loop at bottom goal.
  10131. Retracting rl*prefer*rvt*predict-no*H0*2
  10132. -->
  10133. (S1 ^operator O1944 = 1.)
  10134. Retracting rl*prefer*rvt*predict-yes*H0*1
  10135. -->
  10136. (S1 ^operator O1943 = 0.)
  10137. --- END Proposal Phase ---
  10138. --- Decision Phase ---
  10139. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.860465,0.120767)
  10140. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465488 0.412929 0.878417 -> 0.465487 0.412928 0.878415(R,m,v=1,1,0)
  10141. =>WM: (13654: S1 ^operator O1946)
  10142. 973: O: O1946 (predict-no)
  10143. --- END Decision Phase ---
  10144. --- Application Phase ---
  10145. --- Firing Productions (PE) For State At Depth 1 ---
  10146. --- Inner Elaboration Phase, active level 1 (S1) ---
  10147. Firing apply*operator
  10148. -->
  10149. (I3 ^predict-no N973 + :O )
  10150. Firing apply*operator*complete
  10151. -->
  10152. (I3 ^predict-yes N972 - :O )
  10153. inner elaboration loop at bottom goal.
  10154. --- Change Working Memory (PE) ---
  10155. =>WM: (13655: I3 ^predict-no N973)
  10156. <=WM: (13641: N972 ^status complete)
  10157. <=WM: (13640: I3 ^predict-yes N972)
  10158. --- Firing Productions (IE) For State At Depth 1 ---
  10159. --- Inner Elaboration Phase, active level 1 (S1) ---
  10160. Firing monitor*world
  10161. -->
  10162. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10163. --- Change Working Memory (IE) ---
  10164. --- END Application Phase ---
  10165. --- Output Phase ---
  10166. ENV: Agent did: predict-no for direction U in state State-B
  10167. In State-B moving U
  10168. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  10169. predict error 0
  10170. dir: dir isL
  10171. --- END Output Phase ---
  10172. -/--- Input Phase ---
  10173. =>WM: (13659: I2 ^dir L)
  10174. =>WM: (13658: I2 ^reward 1)
  10175. =>WM: (13657: I2 ^see 0)
  10176. =>WM: (13656: N973 ^status complete)
  10177. <=WM: (13644: I2 ^dir U)
  10178. <=WM: (13643: I2 ^reward 1)
  10179. <=WM: (13642: I2 ^see 1)
  10180. =>WM: (13660: I2 ^level-1 R1-root)
  10181. <=WM: (13645: I2 ^level-1 R1-root)
  10182. --- END Input Phase ---
  10183. --- Proposal Phase ---
  10184. --- Inner Elaboration Phase, active level 1 (S1) ---
  10185. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  10186. -->
  10187. (S1 ^operator O1946 = -0.168718511744511)
  10188. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  10189. -->
  10190. (S1 ^operator O1945 = 0.609338805157315)
  10191. Firing prefer*rvt*predict-no*H0*4*H1
  10192. -->
  10193. Firing prefer*rvt*predict-yes*H0*3*H1
  10194. -->
  10195. Firing elaborate*copy-see-to-output-link
  10196. -->
  10197. (I3 ^see 0 +)
  10198. Firing elaborate*reward*based*on*reward
  10199. -->
  10200. (R977 ^value 1 +)
  10201. (R1 ^reward R977 +)
  10202. Firing propose*predict-yes
  10203. -->
  10204. (O1947 ^name predict-yes +)
  10205. (S1 ^operator O1947 +)
  10206. Firing propose*predict-no
  10207. -->
  10208. (O1948 ^name predict-no +)
  10209. (S1 ^operator O1948 +)
  10210. Firing rl*prefer*rvt*predict-no*H0*4
  10211. -->
  10212. (S1 ^operator O1946 = 0.3145032394390637)
  10213. Firing rl*prefer*rvt*predict-yes*H0*3
  10214. -->
  10215. (S1 ^operator O1945 = 0.3907810808803528)
  10216. Firing prefer*rvt*predict-yes*H0
  10217. -->
  10218. Firing prefer*rvt*predict-no*H0
  10219. -->
  10220. Firing elaborate*copy-dir-to-output-link
  10221. -->
  10222. (I3 ^dir L +)
  10223. inner elaboration loop at bottom goal.
  10224. Retracting elaborate*copy-see-to-output-link
  10225. -->
  10226. (I3 ^see 1 +)
  10227. Retracting propose*predict-no
  10228. -->
  10229. (O1946 ^name predict-no +)
  10230. (S1 ^operator O1946 +)
  10231. Retracting propose*predict-yes
  10232. -->
  10233. (O1945 ^name predict-yes +)
  10234. (S1 ^operator O1945 +)
  10235. Retracting elaborate*reward*based*on*reward
  10236. -->
  10237. (R976 ^value 1 +)
  10238. (R1 ^reward R976 +)
  10239. Retracting elaborate*copy-dir-to-output-link
  10240. -->
  10241. (I3 ^dir U +)
  10242. Retracting rl*prefer*rvt*predict-no*H0*2
  10243. -->
  10244. (S1 ^operator O1946 = 1.)
  10245. Retracting rl*prefer*rvt*predict-yes*H0*1
  10246. -->
  10247. (S1 ^operator O1945 = 0.)
  10248. =>WM: (13668: S1 ^operator O1948 +)
  10249. =>WM: (13667: S1 ^operator O1947 +)
  10250. =>WM: (13666: I3 ^dir L)
  10251. =>WM: (13665: O1948 ^name predict-no)
  10252. =>WM: (13664: O1947 ^name predict-yes)
  10253. =>WM: (13663: R977 ^value 1)
  10254. =>WM: (13662: R1 ^reward R977)
  10255. =>WM: (13661: I3 ^see 0)
  10256. <=WM: (13652: S1 ^operator O1945 +)
  10257. <=WM: (13653: S1 ^operator O1946 +)
  10258. <=WM: (13654: S1 ^operator O1946)
  10259. <=WM: (13651: I3 ^dir U)
  10260. <=WM: (13647: R1 ^reward R976)
  10261. <=WM: (13646: I3 ^see 1)
  10262. <=WM: (13650: O1946 ^name predict-no)
  10263. <=WM: (13649: O1945 ^name predict-yes)
  10264. <=WM: (13648: R976 ^value 1)
  10265. --- Inner Elaboration Phase, active level 1 (S1) ---
  10266. Firing prefer*rvt*predict-yes*H0
  10267. -->
  10268. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  10269. -->
  10270. (S1 ^operator O1947 = 0.609338805157315)
  10271. Firing rl*prefer*rvt*predict-yes*H0*3
  10272. -->
  10273. (S1 ^operator O1947 = 0.3907810808803528)
  10274. Firing prefer*rvt*predict-yes*H0*3*H1
  10275. -->
  10276. Firing prefer*rvt*predict-no*H0
  10277. -->
  10278. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  10279. -->
  10280. (S1 ^operator O1948 = -0.168718511744511)
  10281. Firing rl*prefer*rvt*predict-no*H0*4
  10282. -->
  10283. (S1 ^operator O1948 = 0.3145032394390637)
  10284. Firing prefer*rvt*predict-no*H0*4*H1
  10285. -->
  10286. inner elaboration loop at bottom goal.
  10287. Retracting rl*prefer*rvt*predict-no*H0*4
  10288. -->
  10289. (S1 ^operator O1946 = 0.3145032394390637)
  10290. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  10291. -->
  10292. (S1 ^operator O1946 = -0.168718511744511)
  10293. Retracting rl*prefer*rvt*predict-yes*H0*3
  10294. -->
  10295. (S1 ^operator O1945 = 0.3907810808803528)
  10296. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  10297. -->
  10298. (S1 ^operator O1945 = 0.609338805157315)
  10299. --- END Proposal Phase ---
  10300. --- Decision Phase ---
  10301. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10302. =>WM: (13669: S1 ^operator O1947)
  10303. 974: O: O1947 (predict-yes)
  10304. --- END Decision Phase ---
  10305. --- Application Phase ---
  10306. --- Firing Productions (PE) For State At Depth 1 ---
  10307. --- Inner Elaboration Phase, active level 1 (S1) ---
  10308. Firing apply*operator
  10309. -->
  10310. (I3 ^predict-yes N974 + :O )
  10311. Firing apply*operator*complete
  10312. -->
  10313. (I3 ^predict-no N973 - :O )
  10314. inner elaboration loop at bottom goal.
  10315. --- Change Working Memory (PE) ---
  10316. =>WM: (13670: I3 ^predict-yes N974)
  10317. <=WM: (13656: N973 ^status complete)
  10318. <=WM: (13655: I3 ^predict-no N973)
  10319. --- Firing Productions (IE) For State At Depth 1 ---
  10320. --- Inner Elaboration Phase, active level 1 (S1) ---
  10321. Firing monitor*world
  10322. -->
  10323. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10324. --- Change Working Memory (IE) ---
  10325. --- END Application Phase ---
  10326. --- Output Phase ---
  10327. ENV: Agent did: predict-yes for direction L in state State-B
  10328. In State-B moving L
  10329. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  10330. predict error 0
  10331. dir: dir isL
  10332. --- END Output Phase ---
  10333. |\--- Input Phase ---
  10334. =>WM: (13674: I2 ^dir L)
  10335. =>WM: (13673: I2 ^reward 1)
  10336. =>WM: (13672: I2 ^see 1)
  10337. =>WM: (13671: N974 ^status complete)
  10338. <=WM: (13659: I2 ^dir L)
  10339. <=WM: (13658: I2 ^reward 1)
  10340. <=WM: (13657: I2 ^see 0)
  10341. =>WM: (13675: I2 ^level-1 L1-root)
  10342. <=WM: (13660: I2 ^level-1 R1-root)
  10343. --- END Input Phase ---
  10344. --- Proposal Phase ---
  10345. --- Inner Elaboration Phase, active level 1 (S1) ---
  10346. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  10347. -->
  10348. (S1 ^operator O1947 = -0.2062723012911647)
  10349. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  10350. -->
  10351. (S1 ^operator O1948 = 0.6855461517499103)
  10352. Firing prefer*rvt*predict-no*H0*4*H1
  10353. -->
  10354. Firing prefer*rvt*predict-yes*H0*3*H1
  10355. -->
  10356. Firing elaborate*copy-see-to-output-link
  10357. -->
  10358. (I3 ^see 1 +)
  10359. Firing elaborate*reward*based*on*reward
  10360. -->
  10361. (R978 ^value 1 +)
  10362. (R1 ^reward R978 +)
  10363. Firing propose*predict-yes
  10364. -->
  10365. (O1949 ^name predict-yes +)
  10366. (S1 ^operator O1949 +)
  10367. Firing propose*predict-no
  10368. -->
  10369. (O1950 ^name predict-no +)
  10370. (S1 ^operator O1950 +)
  10371. Firing rl*prefer*rvt*predict-no*H0*4
  10372. -->
  10373. (S1 ^operator O1948 = 0.3145032394390637)
  10374. Firing rl*prefer*rvt*predict-yes*H0*3
  10375. -->
  10376. (S1 ^operator O1947 = 0.3907810808803528)
  10377. Firing prefer*rvt*predict-yes*H0
  10378. -->
  10379. Firing prefer*rvt*predict-no*H0
  10380. -->
  10381. Firing elaborate*copy-dir-to-output-link
  10382. -->
  10383. (I3 ^dir L +)
  10384. inner elaboration loop at bottom goal.
  10385. Retracting elaborate*copy-see-to-output-link
  10386. -->
  10387. (I3 ^see 0 +)
  10388. Retracting propose*predict-no
  10389. -->
  10390. (O1948 ^name predict-no +)
  10391. (S1 ^operator O1948 +)
  10392. Retracting propose*predict-yes
  10393. -->
  10394. (O1947 ^name predict-yes +)
  10395. (S1 ^operator O1947 +)
  10396. Retracting elaborate*reward*based*on*reward
  10397. -->
  10398. (R977 ^value 1 +)
  10399. (R1 ^reward R977 +)
  10400. Retracting elaborate*copy-dir-to-output-link
  10401. -->
  10402. (I3 ^dir L +)
  10403. Retracting rl*prefer*rvt*predict-no*H0*4
  10404. -->
  10405. (S1 ^operator O1948 = 0.3145032394390637)
  10406. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  10407. -->
  10408. (S1 ^operator O1948 = -0.168718511744511)
  10409. Retracting rl*prefer*rvt*predict-yes*H0*3
  10410. -->
  10411. (S1 ^operator O1947 = 0.3907810808803528)
  10412. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  10413. -->
  10414. (S1 ^operator O1947 = 0.609338805157315)
  10415. =>WM: (13682: S1 ^operator O1950 +)
  10416. =>WM: (13681: S1 ^operator O1949 +)
  10417. =>WM: (13680: O1950 ^name predict-no)
  10418. =>WM: (13679: O1949 ^name predict-yes)
  10419. =>WM: (13678: R978 ^value 1)
  10420. =>WM: (13677: R1 ^reward R978)
  10421. =>WM: (13676: I3 ^see 1)
  10422. <=WM: (13667: S1 ^operator O1947 +)
  10423. <=WM: (13669: S1 ^operator O1947)
  10424. <=WM: (13668: S1 ^operator O1948 +)
  10425. <=WM: (13662: R1 ^reward R977)
  10426. <=WM: (13661: I3 ^see 0)
  10427. <=WM: (13665: O1948 ^name predict-no)
  10428. <=WM: (13664: O1947 ^name predict-yes)
  10429. <=WM: (13663: R977 ^value 1)
  10430. --- Inner Elaboration Phase, active level 1 (S1) ---
  10431. Firing prefer*rvt*predict-yes*H0
  10432. -->
  10433. Firing rl*prefer*rvt*predict-yes*H0*3
  10434. -->
  10435. (S1 ^operator O1949 = 0.3907810808803528)
  10436. Firing prefer*rvt*predict-yes*H0*3*H1
  10437. -->
  10438. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  10439. -->
  10440. (S1 ^operator O1949 = -0.2062723012911647)
  10441. Firing prefer*rvt*predict-no*H0
  10442. -->
  10443. Firing rl*prefer*rvt*predict-no*H0*4
  10444. -->
  10445. (S1 ^operator O1950 = 0.3145032394390637)
  10446. Firing prefer*rvt*predict-no*H0*4*H1
  10447. -->
  10448. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  10449. -->
  10450. (S1 ^operator O1950 = 0.6855461517499103)
  10451. inner elaboration loop at bottom goal.
  10452. Retracting rl*prefer*rvt*predict-no*H0*4
  10453. -->
  10454. (S1 ^operator O1948 = 0.3145032394390637)
  10455. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  10456. -->
  10457. (S1 ^operator O1948 = 0.6855461517499103)
  10458. Retracting rl*prefer*rvt*predict-yes*H0*3
  10459. -->
  10460. (S1 ^operator O1947 = 0.3907810808803528)
  10461. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  10462. -->
  10463. (S1 ^operator O1947 = -0.2062723012911647)
  10464. --- END Proposal Phase ---
  10465. --- Decision Phase ---
  10466. RL update rl*prefer*rvt*predict-yes*H0*3 0.472327 -0.0815454 0.390781 -> 0.472318 -0.0815469 0.390771(R,m,v=1,0.942308,0.0547146)
  10467. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527776 0.0815632 0.609339 -> 0.527766 0.0815615 0.609327(R,m,v=1,1,0)
  10468. =>WM: (13683: S1 ^operator O1950)
  10469. 975: O: O1950 (predict-no)
  10470. --- END Decision Phase ---
  10471. --- Application Phase ---
  10472. --- Firing Productions (PE) For State At Depth 1 ---
  10473. --- Inner Elaboration Phase, active level 1 (S1) ---
  10474. Firing apply*operator
  10475. -->
  10476. (I3 ^predict-no N975 + :O )
  10477. Firing apply*operator*complete
  10478. -->
  10479. (I3 ^predict-yes N974 - :O )
  10480. inner elaboration loop at bottom goal.
  10481. --- Change Working Memory (PE) ---
  10482. =>WM: (13684: I3 ^predict-no N975)
  10483. <=WM: (13671: N974 ^status complete)
  10484. <=WM: (13670: I3 ^predict-yes N974)
  10485. --- Firing Productions (IE) For State At Depth 1 ---
  10486. --- Inner Elaboration Phase, active level 1 (S1) ---
  10487. Firing monitor*world
  10488. -->
  10489. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10490. --- Change Working Memory (IE) ---
  10491. --- END Application Phase ---
  10492. --- Output Phase ---
  10493. ENV: Agent did: predict-no for direction L in state State-A
  10494. In State-A moving L
  10495. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10496. predict error 0
  10497. dir: dir isU
  10498. --- END Output Phase ---
  10499. -/|--- Input Phase ---
  10500. =>WM: (13688: I2 ^dir U)
  10501. =>WM: (13687: I2 ^reward 1)
  10502. =>WM: (13686: I2 ^see 0)
  10503. =>WM: (13685: N975 ^status complete)
  10504. <=WM: (13674: I2 ^dir L)
  10505. <=WM: (13673: I2 ^reward 1)
  10506. <=WM: (13672: I2 ^see 1)
  10507. =>WM: (13689: I2 ^level-1 L0-root)
  10508. <=WM: (13675: I2 ^level-1 L1-root)
  10509. --- END Input Phase ---
  10510. --- Proposal Phase ---
  10511. --- Inner Elaboration Phase, active level 1 (S1) ---
  10512. Firing elaborate*copy-see-to-output-link
  10513. -->
  10514. (I3 ^see 0 +)
  10515. Firing elaborate*reward*based*on*reward
  10516. -->
  10517. (R979 ^value 1 +)
  10518. (R1 ^reward R979 +)
  10519. Firing propose*predict-yes
  10520. -->
  10521. (O1951 ^name predict-yes +)
  10522. (S1 ^operator O1951 +)
  10523. Firing propose*predict-no
  10524. -->
  10525. (O1952 ^name predict-no +)
  10526. (S1 ^operator O1952 +)
  10527. Firing rl*prefer*rvt*predict-no*H0*2
  10528. -->
  10529. (S1 ^operator O1950 = 1.)
  10530. Firing rl*prefer*rvt*predict-yes*H0*1
  10531. -->
  10532. (S1 ^operator O1949 = 0.)
  10533. Firing prefer*rvt*predict-yes*H0
  10534. -->
  10535. Firing prefer*rvt*predict-no*H0
  10536. -->
  10537. Firing elaborate*copy-dir-to-output-link
  10538. -->
  10539. (I3 ^dir U +)
  10540. inner elaboration loop at bottom goal.
  10541. Retracting elaborate*copy-see-to-output-link
  10542. -->
  10543. (I3 ^see 1 +)
  10544. Retracting propose*predict-no
  10545. -->
  10546. (O1950 ^name predict-no +)
  10547. (S1 ^operator O1950 +)
  10548. Retracting propose*predict-yes
  10549. -->
  10550. (O1949 ^name predict-yes +)
  10551. (S1 ^operator O1949 +)
  10552. Retracting elaborate*reward*based*on*reward
  10553. -->
  10554. (R978 ^value 1 +)
  10555. (R1 ^reward R978 +)
  10556. Retracting elaborate*copy-dir-to-output-link
  10557. -->
  10558. (I3 ^dir L +)
  10559. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  10560. -->
  10561. (S1 ^operator O1950 = 0.6855461517499103)
  10562. Retracting rl*prefer*rvt*predict-no*H0*4
  10563. -->
  10564. (S1 ^operator O1950 = 0.3145032394390637)
  10565. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  10566. -->
  10567. (S1 ^operator O1949 = -0.2062723012911647)
  10568. Retracting rl*prefer*rvt*predict-yes*H0*3
  10569. -->
  10570. (S1 ^operator O1949 = 0.3907711727075364)
  10571. =>WM: (13697: S1 ^operator O1952 +)
  10572. =>WM: (13696: S1 ^operator O1951 +)
  10573. =>WM: (13695: I3 ^dir U)
  10574. =>WM: (13694: O1952 ^name predict-no)
  10575. =>WM: (13693: O1951 ^name predict-yes)
  10576. =>WM: (13692: R979 ^value 1)
  10577. =>WM: (13691: R1 ^reward R979)
  10578. =>WM: (13690: I3 ^see 0)
  10579. <=WM: (13681: S1 ^operator O1949 +)
  10580. <=WM: (13682: S1 ^operator O1950 +)
  10581. <=WM: (13683: S1 ^operator O1950)
  10582. <=WM: (13666: I3 ^dir L)
  10583. <=WM: (13677: R1 ^reward R978)
  10584. <=WM: (13676: I3 ^see 1)
  10585. <=WM: (13680: O1950 ^name predict-no)
  10586. <=WM: (13679: O1949 ^name predict-yes)
  10587. <=WM: (13678: R978 ^value 1)
  10588. --- Inner Elaboration Phase, active level 1 (S1) ---
  10589. Firing prefer*rvt*predict-yes*H0
  10590. -->
  10591. Firing rl*prefer*rvt*predict-yes*H0*1
  10592. -->
  10593. (S1 ^operator O1951 = 0.)
  10594. Firing prefer*rvt*predict-no*H0
  10595. -->
  10596. Firing rl*prefer*rvt*predict-no*H0*2
  10597. -->
  10598. (S1 ^operator O1952 = 1.)
  10599. inner elaboration loop at bottom goal.
  10600. Retracting rl*prefer*rvt*predict-no*H0*2
  10601. -->
  10602. (S1 ^operator O1950 = 1.)
  10603. Retracting rl*prefer*rvt*predict-yes*H0*1
  10604. -->
  10605. (S1 ^operator O1949 = 0.)
  10606. --- END Proposal Phase ---
  10607. --- Decision Phase ---
  10608. RL update rl*prefer*rvt*predict-no*H0*4 0.478552 -0.164048 0.314503 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.92053,0.0736424)
  10609. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521493 0.164053 0.685546 -> 0.521489 0.164052 0.685541(R,m,v=1,1,0)
  10610. =>WM: (13698: S1 ^operator O1952)
  10611. 976: O: O1952 (predict-no)
  10612. --- END Decision Phase ---
  10613. --- Application Phase ---
  10614. --- Firing Productions (PE) For State At Depth 1 ---
  10615. --- Inner Elaboration Phase, active level 1 (S1) ---
  10616. Firing apply*operator
  10617. -->
  10618. (I3 ^predict-no N976 + :O )
  10619. Firing apply*operator*complete
  10620. -->
  10621. (I3 ^predict-no N975 - :O )
  10622. inner elaboration loop at bottom goal.
  10623. --- Change Working Memory (PE) ---
  10624. =>WM: (13699: I3 ^predict-no N976)
  10625. <=WM: (13685: N975 ^status complete)
  10626. <=WM: (13684: I3 ^predict-no N975)
  10627. --- Firing Productions (IE) For State At Depth 1 ---
  10628. --- Inner Elaboration Phase, active level 1 (S1) ---
  10629. Firing monitor*world
  10630. -->
  10631. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10632. --- Change Working Memory (IE) ---
  10633. --- END Application Phase ---
  10634. --- Output Phase ---
  10635. ENV: Agent did: predict-no for direction U in state State-A
  10636. In State-A moving U
  10637. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10638. predict error 0
  10639. dir: dir isL
  10640. --- END Output Phase ---
  10641. \-/--- Input Phase ---
  10642. =>WM: (13703: I2 ^dir L)
  10643. =>WM: (13702: I2 ^reward 1)
  10644. =>WM: (13701: I2 ^see 0)
  10645. =>WM: (13700: N976 ^status complete)
  10646. <=WM: (13688: I2 ^dir U)
  10647. <=WM: (13687: I2 ^reward 1)
  10648. <=WM: (13686: I2 ^see 0)
  10649. =>WM: (13704: I2 ^level-1 L0-root)
  10650. <=WM: (13689: I2 ^level-1 L0-root)
  10651. --- END Input Phase ---
  10652. --- Proposal Phase ---
  10653. --- Inner Elaboration Phase, active level 1 (S1) ---
  10654. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  10655. -->
  10656. (S1 ^operator O1951 = -0.208713043145708)
  10657. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  10658. -->
  10659. (S1 ^operator O1952 = 0.6854177156873388)
  10660. Firing prefer*rvt*predict-no*H0*4*H1
  10661. -->
  10662. Firing prefer*rvt*predict-yes*H0*3*H1
  10663. -->
  10664. Firing elaborate*copy-see-to-output-link
  10665. -->
  10666. (I3 ^see 0 +)
  10667. Firing elaborate*reward*based*on*reward
  10668. -->
  10669. (R980 ^value 1 +)
  10670. (R1 ^reward R980 +)
  10671. Firing propose*predict-yes
  10672. -->
  10673. (O1953 ^name predict-yes +)
  10674. (S1 ^operator O1953 +)
  10675. Firing propose*predict-no
  10676. -->
  10677. (O1954 ^name predict-no +)
  10678. (S1 ^operator O1954 +)
  10679. Firing rl*prefer*rvt*predict-no*H0*4
  10680. -->
  10681. (S1 ^operator O1952 = 0.3144991353263821)
  10682. Firing rl*prefer*rvt*predict-yes*H0*3
  10683. -->
  10684. (S1 ^operator O1951 = 0.3907711727075364)
  10685. Firing prefer*rvt*predict-yes*H0
  10686. -->
  10687. Firing prefer*rvt*predict-no*H0
  10688. -->
  10689. Firing elaborate*copy-dir-to-output-link
  10690. -->
  10691. (I3 ^dir L +)
  10692. inner elaboration loop at bottom goal.
  10693. Retracting elaborate*copy-see-to-output-link
  10694. -->
  10695. (I3 ^see 0 +)
  10696. Retracting propose*predict-no
  10697. -->
  10698. (O1952 ^name predict-no +)
  10699. (S1 ^operator O1952 +)
  10700. Retracting propose*predict-yes
  10701. -->
  10702. (O1951 ^name predict-yes +)
  10703. (S1 ^operator O1951 +)
  10704. Retracting elaborate*reward*based*on*reward
  10705. -->
  10706. (R979 ^value 1 +)
  10707. (R1 ^reward R979 +)
  10708. Retracting elaborate*copy-dir-to-output-link
  10709. -->
  10710. (I3 ^dir U +)
  10711. Retracting rl*prefer*rvt*predict-no*H0*2
  10712. -->
  10713. (S1 ^operator O1952 = 1.)
  10714. Retracting rl*prefer*rvt*predict-yes*H0*1
  10715. -->
  10716. (S1 ^operator O1951 = 0.)
  10717. =>WM: (13711: S1 ^operator O1954 +)
  10718. =>WM: (13710: S1 ^operator O1953 +)
  10719. =>WM: (13709: I3 ^dir L)
  10720. =>WM: (13708: O1954 ^name predict-no)
  10721. =>WM: (13707: O1953 ^name predict-yes)
  10722. =>WM: (13706: R980 ^value 1)
  10723. =>WM: (13705: R1 ^reward R980)
  10724. <=WM: (13696: S1 ^operator O1951 +)
  10725. <=WM: (13697: S1 ^operator O1952 +)
  10726. <=WM: (13698: S1 ^operator O1952)
  10727. <=WM: (13695: I3 ^dir U)
  10728. <=WM: (13691: R1 ^reward R979)
  10729. <=WM: (13694: O1952 ^name predict-no)
  10730. <=WM: (13693: O1951 ^name predict-yes)
  10731. <=WM: (13692: R979 ^value 1)
  10732. --- Inner Elaboration Phase, active level 1 (S1) ---
  10733. Firing prefer*rvt*predict-yes*H0
  10734. -->
  10735. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  10736. -->
  10737. (S1 ^operator O1953 = -0.208713043145708)
  10738. Firing rl*prefer*rvt*predict-yes*H0*3
  10739. -->
  10740. (S1 ^operator O1953 = 0.3907711727075364)
  10741. Firing prefer*rvt*predict-yes*H0*3*H1
  10742. -->
  10743. Firing prefer*rvt*predict-no*H0
  10744. -->
  10745. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  10746. -->
  10747. (S1 ^operator O1954 = 0.6854177156873388)
  10748. Firing rl*prefer*rvt*predict-no*H0*4
  10749. -->
  10750. (S1 ^operator O1954 = 0.3144991353263821)
  10751. Firing prefer*rvt*predict-no*H0*4*H1
  10752. -->
  10753. inner elaboration loop at bottom goal.
  10754. Retracting rl*prefer*rvt*predict-no*H0*4
  10755. -->
  10756. (S1 ^operator O1952 = 0.3144991353263821)
  10757. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  10758. -->
  10759. (S1 ^operator O1952 = 0.6854177156873388)
  10760. Retracting rl*prefer*rvt*predict-yes*H0*3
  10761. -->
  10762. (S1 ^operator O1951 = 0.3907711727075364)
  10763. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  10764. -->
  10765. (S1 ^operator O1951 = -0.208713043145708)
  10766. --- END Proposal Phase ---
  10767. --- Decision Phase ---
  10768. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  10769. =>WM: (13712: S1 ^operator O1954)
  10770. 977: O: O1954 (predict-no)
  10771. --- END Decision Phase ---
  10772. --- Application Phase ---
  10773. --- Firing Productions (PE) For State At Depth 1 ---
  10774. --- Inner Elaboration Phase, active level 1 (S1) ---
  10775. Firing apply*operator
  10776. -->
  10777. (I3 ^predict-no N977 + :O )
  10778. Firing apply*operator*complete
  10779. -->
  10780. (I3 ^predict-no N976 - :O )
  10781. inner elaboration loop at bottom goal.
  10782. --- Change Working Memory (PE) ---
  10783. =>WM: (13713: I3 ^predict-no N977)
  10784. <=WM: (13700: N976 ^status complete)
  10785. <=WM: (13699: I3 ^predict-no N976)
  10786. --- Firing Productions (IE) For State At Depth 1 ---
  10787. --- Inner Elaboration Phase, active level 1 (S1) ---
  10788. Firing monitor*world
  10789. -->
  10790. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  10791. --- Change Working Memory (IE) ---
  10792. --- END Application Phase ---
  10793. --- Output Phase ---
  10794. ENV: Agent did: predict-no for direction L in state State-A
  10795. In State-A moving L
  10796. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  10797. predict error 0
  10798. dir: dir isR
  10799. --- END Output Phase ---
  10800. |\---- Input Phase ---
  10801. =>WM: (13717: I2 ^dir R)
  10802. =>WM: (13716: I2 ^reward 1)
  10803. =>WM: (13715: I2 ^see 0)
  10804. =>WM: (13714: N977 ^status complete)
  10805. <=WM: (13703: I2 ^dir L)
  10806. <=WM: (13702: I2 ^reward 1)
  10807. <=WM: (13701: I2 ^see 0)
  10808. =>WM: (13718: I2 ^level-1 L0-root)
  10809. <=WM: (13704: I2 ^level-1 L0-root)
  10810. --- END Input Phase ---
  10811. --- Proposal Phase ---
  10812. --- Inner Elaboration Phase, active level 1 (S1) ---
  10813. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  10814. -->
  10815. (S1 ^operator O1953 = 0.8783927855286688)
  10816. Firing prefer*rvt*predict-yes*H0*5*H1
  10817. -->
  10818. Firing elaborate*copy-see-to-output-link
  10819. -->
  10820. (I3 ^see 0 +)
  10821. Firing elaborate*reward*based*on*reward
  10822. -->
  10823. (R981 ^value 1 +)
  10824. (R1 ^reward R981 +)
  10825. Firing propose*predict-yes
  10826. -->
  10827. (O1955 ^name predict-yes +)
  10828. (S1 ^operator O1955 +)
  10829. Firing propose*predict-no
  10830. -->
  10831. (O1956 ^name predict-no +)
  10832. (S1 ^operator O1956 +)
  10833. Firing rl*prefer*rvt*predict-no*H0*6
  10834. -->
  10835. (S1 ^operator O1954 = 0.9999810901454903)
  10836. Firing rl*prefer*rvt*predict-yes*H0*5
  10837. -->
  10838. (S1 ^operator O1953 = 0.1215980737936329)
  10839. Firing prefer*rvt*predict-yes*H0
  10840. -->
  10841. Firing prefer*rvt*predict-no*H0
  10842. -->
  10843. Firing elaborate*copy-dir-to-output-link
  10844. -->
  10845. (I3 ^dir R +)
  10846. inner elaboration loop at bottom goal.
  10847. Retracting elaborate*copy-see-to-output-link
  10848. -->
  10849. (I3 ^see 0 +)
  10850. Retracting propose*predict-no
  10851. -->
  10852. (O1954 ^name predict-no +)
  10853. (S1 ^operator O1954 +)
  10854. Retracting propose*predict-yes
  10855. -->
  10856. (O1953 ^name predict-yes +)
  10857. (S1 ^operator O1953 +)
  10858. Retracting elaborate*reward*based*on*reward
  10859. -->
  10860. (R980 ^value 1 +)
  10861. (R1 ^reward R980 +)
  10862. Retracting elaborate*copy-dir-to-output-link
  10863. -->
  10864. (I3 ^dir L +)
  10865. Retracting rl*prefer*rvt*predict-no*H0*4
  10866. -->
  10867. (S1 ^operator O1954 = 0.3144991353263821)
  10868. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  10869. -->
  10870. (S1 ^operator O1954 = 0.6854177156873388)
  10871. Retracting rl*prefer*rvt*predict-yes*H0*3
  10872. -->
  10873. (S1 ^operator O1953 = 0.3907711727075364)
  10874. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  10875. -->
  10876. (S1 ^operator O1953 = -0.208713043145708)
  10877. =>WM: (13725: S1 ^operator O1956 +)
  10878. =>WM: (13724: S1 ^operator O1955 +)
  10879. =>WM: (13723: I3 ^dir R)
  10880. =>WM: (13722: O1956 ^name predict-no)
  10881. =>WM: (13721: O1955 ^name predict-yes)
  10882. =>WM: (13720: R981 ^value 1)
  10883. =>WM: (13719: R1 ^reward R981)
  10884. <=WM: (13710: S1 ^operator O1953 +)
  10885. <=WM: (13711: S1 ^operator O1954 +)
  10886. <=WM: (13712: S1 ^operator O1954)
  10887. <=WM: (13709: I3 ^dir L)
  10888. <=WM: (13705: R1 ^reward R980)
  10889. <=WM: (13708: O1954 ^name predict-no)
  10890. <=WM: (13707: O1953 ^name predict-yes)
  10891. <=WM: (13706: R980 ^value 1)
  10892. --- Inner Elaboration Phase, active level 1 (S1) ---
  10893. Firing prefer*rvt*predict-yes*H0
  10894. -->
  10895. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  10896. -->
  10897. (S1 ^operator O1955 = 0.8783927855286688)
  10898. Firing rl*prefer*rvt*predict-yes*H0*5
  10899. -->
  10900. (S1 ^operator O1955 = 0.1215980737936329)
  10901. Firing prefer*rvt*predict-yes*H0*5*H1
  10902. -->
  10903. Firing prefer*rvt*predict-no*H0
  10904. -->
  10905. Firing rl*prefer*rvt*predict-no*H0*6
  10906. -->
  10907. (S1 ^operator O1956 = 0.9999810901454903)
  10908. inner elaboration loop at bottom goal.
  10909. Retracting rl*prefer*rvt*predict-no*H0*6
  10910. -->
  10911. (S1 ^operator O1954 = 0.9999810901454903)
  10912. Retracting rl*prefer*rvt*predict-yes*H0*5
  10913. -->
  10914. (S1 ^operator O1953 = 0.1215980737936329)
  10915. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  10916. -->
  10917. (S1 ^operator O1953 = 0.8783927855286688)
  10918. --- END Proposal Phase ---
  10919. --- Decision Phase ---
  10920. RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478554 -0.164048 0.314506(R,m,v=1,0.921053,0.0731962)
  10921. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521377 0.164041 0.685418 -> 0.521384 0.164042 0.685426(R,m,v=1,1,0)
  10922. =>WM: (13726: S1 ^operator O1955)
  10923. 978: O: O1955 (predict-yes)
  10924. --- END Decision Phase ---
  10925. --- Application Phase ---
  10926. --- Firing Productions (PE) For State At Depth 1 ---
  10927. --- Inner Elaboration Phase, active level 1 (S1) ---
  10928. Firing apply*operator
  10929. -->
  10930. (I3 ^predict-yes N978 + :O )
  10931. Firing apply*operator*complete
  10932. -->
  10933. (I3 ^predict-no N977 - :O )
  10934. inner elaboration loop at bottom goal.
  10935. --- Change Working Memory (PE) ---
  10936. =>WM: (13727: I3 ^predict-yes N978)
  10937. <=WM: (13714: N977 ^status complete)
  10938. <=WM: (13713: I3 ^predict-no N977)
  10939. --- Firing Productions (IE) For State At Depth 1 ---
  10940. --- Inner Elaboration Phase, active level 1 (S1) ---
  10941. Firing monitor*world
  10942. -->
  10943. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  10944. --- Change Working Memory (IE) ---
  10945. --- END Application Phase ---
  10946. --- Output Phase ---
  10947. ENV: Agent did: predict-yes for direction R in state State-A
  10948. In State-A moving R
  10949. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  10950. predict error 0
  10951. dir: dir isL
  10952. --- END Output Phase ---
  10953. /|\--- Input Phase ---
  10954. =>WM: (13731: I2 ^dir L)
  10955. =>WM: (13730: I2 ^reward 1)
  10956. =>WM: (13729: I2 ^see 1)
  10957. =>WM: (13728: N978 ^status complete)
  10958. <=WM: (13717: I2 ^dir R)
  10959. <=WM: (13716: I2 ^reward 1)
  10960. <=WM: (13715: I2 ^see 0)
  10961. =>WM: (13732: I2 ^level-1 R1-root)
  10962. <=WM: (13718: I2 ^level-1 L0-root)
  10963. --- END Input Phase ---
  10964. --- Proposal Phase ---
  10965. --- Inner Elaboration Phase, active level 1 (S1) ---
  10966. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  10967. -->
  10968. (S1 ^operator O1956 = -0.168718511744511)
  10969. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  10970. -->
  10971. (S1 ^operator O1955 = 0.6093273841659509)
  10972. Firing prefer*rvt*predict-no*H0*4*H1
  10973. -->
  10974. Firing prefer*rvt*predict-yes*H0*3*H1
  10975. -->
  10976. Firing elaborate*copy-see-to-output-link
  10977. -->
  10978. (I3 ^see 1 +)
  10979. Firing elaborate*reward*based*on*reward
  10980. -->
  10981. (R982 ^value 1 +)
  10982. (R1 ^reward R982 +)
  10983. Firing propose*predict-yes
  10984. -->
  10985. (O1957 ^name predict-yes +)
  10986. (S1 ^operator O1957 +)
  10987. Firing propose*predict-no
  10988. -->
  10989. (O1958 ^name predict-no +)
  10990. (S1 ^operator O1958 +)
  10991. Firing rl*prefer*rvt*predict-no*H0*4
  10992. -->
  10993. (S1 ^operator O1956 = 0.3145060369395525)
  10994. Firing rl*prefer*rvt*predict-yes*H0*3
  10995. -->
  10996. (S1 ^operator O1955 = 0.3907711727075364)
  10997. Firing prefer*rvt*predict-yes*H0
  10998. -->
  10999. Firing prefer*rvt*predict-no*H0
  11000. -->
  11001. Firing elaborate*copy-dir-to-output-link
  11002. -->
  11003. (I3 ^dir L +)
  11004. inner elaboration loop at bottom goal.
  11005. Retracting elaborate*copy-see-to-output-link
  11006. -->
  11007. (I3 ^see 0 +)
  11008. Retracting propose*predict-no
  11009. -->
  11010. (O1956 ^name predict-no +)
  11011. (S1 ^operator O1956 +)
  11012. Retracting propose*predict-yes
  11013. -->
  11014. (O1955 ^name predict-yes +)
  11015. (S1 ^operator O1955 +)
  11016. Retracting elaborate*reward*based*on*reward
  11017. -->
  11018. (R981 ^value 1 +)
  11019. (R1 ^reward R981 +)
  11020. Retracting elaborate*copy-dir-to-output-link
  11021. -->
  11022. (I3 ^dir R +)
  11023. Retracting rl*prefer*rvt*predict-no*H0*6
  11024. -->
  11025. (S1 ^operator O1956 = 0.9999810901454903)
  11026. Retracting rl*prefer*rvt*predict-yes*H0*5
  11027. -->
  11028. (S1 ^operator O1955 = 0.1215980737936329)
  11029. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  11030. -->
  11031. (S1 ^operator O1955 = 0.8783927855286688)
  11032. =>WM: (13740: S1 ^operator O1958 +)
  11033. =>WM: (13739: S1 ^operator O1957 +)
  11034. =>WM: (13738: I3 ^dir L)
  11035. =>WM: (13737: O1958 ^name predict-no)
  11036. =>WM: (13736: O1957 ^name predict-yes)
  11037. =>WM: (13735: R982 ^value 1)
  11038. =>WM: (13734: R1 ^reward R982)
  11039. =>WM: (13733: I3 ^see 1)
  11040. <=WM: (13724: S1 ^operator O1955 +)
  11041. <=WM: (13726: S1 ^operator O1955)
  11042. <=WM: (13725: S1 ^operator O1956 +)
  11043. <=WM: (13723: I3 ^dir R)
  11044. <=WM: (13719: R1 ^reward R981)
  11045. <=WM: (13690: I3 ^see 0)
  11046. <=WM: (13722: O1956 ^name predict-no)
  11047. <=WM: (13721: O1955 ^name predict-yes)
  11048. <=WM: (13720: R981 ^value 1)
  11049. --- Inner Elaboration Phase, active level 1 (S1) ---
  11050. Firing prefer*rvt*predict-yes*H0
  11051. -->
  11052. Firing rl*prefer*rvt*predict-yes*H0*3
  11053. -->
  11054. (S1 ^operator O1957 = 0.3907711727075364)
  11055. Firing prefer*rvt*predict-yes*H0*3*H1
  11056. -->
  11057. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  11058. -->
  11059. (S1 ^operator O1957 = 0.6093273841659509)
  11060. Firing prefer*rvt*predict-no*H0
  11061. -->
  11062. Firing rl*prefer*rvt*predict-no*H0*4
  11063. -->
  11064. (S1 ^operator O1958 = 0.3145060369395525)
  11065. Firing prefer*rvt*predict-no*H0*4*H1
  11066. -->
  11067. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  11068. -->
  11069. (S1 ^operator O1958 = -0.168718511744511)
  11070. inner elaboration loop at bottom goal.
  11071. Retracting rl*prefer*rvt*predict-no*H0*4
  11072. -->
  11073. (S1 ^operator O1956 = 0.3145060369395525)
  11074. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  11075. -->
  11076. (S1 ^operator O1956 = -0.168718511744511)
  11077. Retracting rl*prefer*rvt*predict-yes*H0*3
  11078. -->
  11079. (S1 ^operator O1955 = 0.3907711727075364)
  11080. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  11081. -->
  11082. (S1 ^operator O1955 = 0.6093273841659509)
  11083. --- END Proposal Phase ---
  11084. --- Decision Phase ---
  11085. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.861272,0.120177)
  11086. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465468 0.412925 0.878393 -> 0.465469 0.412925 0.878394(R,m,v=1,1,0)
  11087. =>WM: (13741: S1 ^operator O1957)
  11088. 979: O: O1957 (predict-yes)
  11089. --- END Decision Phase ---
  11090. --- Application Phase ---
  11091. --- Firing Productions (PE) For State At Depth 1 ---
  11092. --- Inner Elaboration Phase, active level 1 (S1) ---
  11093. Firing apply*operator
  11094. -->
  11095. (I3 ^predict-yes N979 + :O )
  11096. Firing apply*operator*complete
  11097. -->
  11098. (I3 ^predict-yes N978 - :O )
  11099. inner elaboration loop at bottom goal.
  11100. --- Change Working Memory (PE) ---
  11101. =>WM: (13742: I3 ^predict-yes N979)
  11102. <=WM: (13728: N978 ^status complete)
  11103. <=WM: (13727: I3 ^predict-yes N978)
  11104. --- Firing Productions (IE) For State At Depth 1 ---
  11105. --- Inner Elaboration Phase, active level 1 (S1) ---
  11106. Firing monitor*world
  11107. -->
  11108. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11109. --- Change Working Memory (IE) ---
  11110. --- END Application Phase ---
  11111. --- Output Phase ---
  11112. ENV: Agent did: predict-yes for direction L in state State-B
  11113. In State-B moving L
  11114. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11115. predict error 0
  11116. dir: dir isR
  11117. --- END Output Phase ---
  11118. -/|--- Input Phase ---
  11119. =>WM: (13746: I2 ^dir R)
  11120. =>WM: (13745: I2 ^reward 1)
  11121. =>WM: (13744: I2 ^see 1)
  11122. =>WM: (13743: N979 ^status complete)
  11123. <=WM: (13731: I2 ^dir L)
  11124. <=WM: (13730: I2 ^reward 1)
  11125. <=WM: (13729: I2 ^see 1)
  11126. =>WM: (13747: I2 ^level-1 L1-root)
  11127. <=WM: (13732: I2 ^level-1 R1-root)
  11128. --- END Input Phase ---
  11129. --- Proposal Phase ---
  11130. --- Inner Elaboration Phase, active level 1 (S1) ---
  11131. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  11132. -->
  11133. (S1 ^operator O1957 = 0.8784154092082219)
  11134. Firing prefer*rvt*predict-yes*H0*5*H1
  11135. -->
  11136. Firing elaborate*copy-see-to-output-link
  11137. -->
  11138. (I3 ^see 1 +)
  11139. Firing elaborate*reward*based*on*reward
  11140. -->
  11141. (R983 ^value 1 +)
  11142. (R1 ^reward R983 +)
  11143. Firing propose*predict-yes
  11144. -->
  11145. (O1959 ^name predict-yes +)
  11146. (S1 ^operator O1959 +)
  11147. Firing propose*predict-no
  11148. -->
  11149. (O1960 ^name predict-no +)
  11150. (S1 ^operator O1960 +)
  11151. Firing rl*prefer*rvt*predict-no*H0*6
  11152. -->
  11153. (S1 ^operator O1958 = 0.9999810901454903)
  11154. Firing rl*prefer*rvt*predict-yes*H0*5
  11155. -->
  11156. (S1 ^operator O1957 = 0.1215988165406292)
  11157. Firing prefer*rvt*predict-yes*H0
  11158. -->
  11159. Firing prefer*rvt*predict-no*H0
  11160. -->
  11161. Firing elaborate*copy-dir-to-output-link
  11162. -->
  11163. (I3 ^dir R +)
  11164. inner elaboration loop at bottom goal.
  11165. Retracting elaborate*copy-see-to-output-link
  11166. -->
  11167. (I3 ^see 1 +)
  11168. Retracting propose*predict-no
  11169. -->
  11170. (O1958 ^name predict-no +)
  11171. (S1 ^operator O1958 +)
  11172. Retracting propose*predict-yes
  11173. -->
  11174. (O1957 ^name predict-yes +)
  11175. (S1 ^operator O1957 +)
  11176. Retracting elaborate*reward*based*on*reward
  11177. -->
  11178. (R982 ^value 1 +)
  11179. (R1 ^reward R982 +)
  11180. Retracting elaborate*copy-dir-to-output-link
  11181. -->
  11182. (I3 ^dir L +)
  11183. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  11184. -->
  11185. (S1 ^operator O1958 = -0.168718511744511)
  11186. Retracting rl*prefer*rvt*predict-no*H0*4
  11187. -->
  11188. (S1 ^operator O1958 = 0.3145060369395525)
  11189. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  11190. -->
  11191. (S1 ^operator O1957 = 0.6093273841659509)
  11192. Retracting rl*prefer*rvt*predict-yes*H0*3
  11193. -->
  11194. (S1 ^operator O1957 = 0.3907711727075364)
  11195. =>WM: (13754: S1 ^operator O1960 +)
  11196. =>WM: (13753: S1 ^operator O1959 +)
  11197. =>WM: (13752: I3 ^dir R)
  11198. =>WM: (13751: O1960 ^name predict-no)
  11199. =>WM: (13750: O1959 ^name predict-yes)
  11200. =>WM: (13749: R983 ^value 1)
  11201. =>WM: (13748: R1 ^reward R983)
  11202. <=WM: (13739: S1 ^operator O1957 +)
  11203. <=WM: (13741: S1 ^operator O1957)
  11204. <=WM: (13740: S1 ^operator O1958 +)
  11205. <=WM: (13738: I3 ^dir L)
  11206. <=WM: (13734: R1 ^reward R982)
  11207. <=WM: (13737: O1958 ^name predict-no)
  11208. <=WM: (13736: O1957 ^name predict-yes)
  11209. <=WM: (13735: R982 ^value 1)
  11210. --- Inner Elaboration Phase, active level 1 (S1) ---
  11211. Firing prefer*rvt*predict-yes*H0
  11212. -->
  11213. Firing rl*prefer*rvt*predict-yes*H0*5
  11214. -->
  11215. (S1 ^operator O1959 = 0.1215988165406292)
  11216. Firing prefer*rvt*predict-yes*H0*5*H1
  11217. -->
  11218. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  11219. -->
  11220. (S1 ^operator O1959 = 0.8784154092082219)
  11221. Firing prefer*rvt*predict-no*H0
  11222. -->
  11223. Firing rl*prefer*rvt*predict-no*H0*6
  11224. -->
  11225. (S1 ^operator O1960 = 0.9999810901454903)
  11226. inner elaboration loop at bottom goal.
  11227. Retracting rl*prefer*rvt*predict-no*H0*6
  11228. -->
  11229. (S1 ^operator O1958 = 0.9999810901454903)
  11230. Retracting rl*prefer*rvt*predict-yes*H0*5
  11231. -->
  11232. (S1 ^operator O1957 = 0.1215988165406292)
  11233. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  11234. -->
  11235. (S1 ^operator O1957 = 0.8784154092082219)
  11236. --- END Proposal Phase ---
  11237. --- Decision Phase ---
  11238. RL update rl*prefer*rvt*predict-yes*H0*3 0.472318 -0.0815469 0.390771 -> 0.472311 -0.0815481 0.390763(R,m,v=1,0.942675,0.0543851)
  11239. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527766 0.0815615 0.609327 -> 0.527758 0.0815601 0.609318(R,m,v=1,1,0)
  11240. =>WM: (13755: S1 ^operator O1959)
  11241. 980: O: O1959 (predict-yes)
  11242. --- END Decision Phase ---
  11243. --- Application Phase ---
  11244. --- Firing Productions (PE) For State At Depth 1 ---
  11245. --- Inner Elaboration Phase, active level 1 (S1) ---
  11246. Firing apply*operator
  11247. -->
  11248. (I3 ^predict-yes N980 + :O )
  11249. Firing apply*operator*complete
  11250. -->
  11251. (I3 ^predict-yes N979 - :O )
  11252. inner elaboration loop at bottom goal.
  11253. --- Change Working Memory (PE) ---
  11254. =>WM: (13756: I3 ^predict-yes N980)
  11255. <=WM: (13743: N979 ^status complete)
  11256. <=WM: (13742: I3 ^predict-yes N979)
  11257. --- Firing Productions (IE) For State At Depth 1 ---
  11258. --- Inner Elaboration Phase, active level 1 (S1) ---
  11259. Firing monitor*world
  11260. -->
  11261. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11262. --- Change Working Memory (IE) ---
  11263. --- END Application Phase ---
  11264. --- Output Phase ---
  11265. ENV: Agent did: predict-yes for direction R in state State-A
  11266. In State-A moving R
  11267. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11268. predict error 0
  11269. dir: dir isR
  11270. --- END Output Phase ---
  11271. \---- Input Phase ---
  11272. =>WM: (13760: I2 ^dir R)
  11273. =>WM: (13759: I2 ^reward 1)
  11274. =>WM: (13758: I2 ^see 1)
  11275. =>WM: (13757: N980 ^status complete)
  11276. <=WM: (13746: I2 ^dir R)
  11277. <=WM: (13745: I2 ^reward 1)
  11278. <=WM: (13744: I2 ^see 1)
  11279. =>WM: (13761: I2 ^level-1 R1-root)
  11280. <=WM: (13747: I2 ^level-1 L1-root)
  11281. --- END Input Phase ---
  11282. --- Proposal Phase ---
  11283. --- Inner Elaboration Phase, active level 1 (S1) ---
  11284. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  11285. -->
  11286. (S1 ^operator O1959 = -0.04253361215288998)
  11287. Firing prefer*rvt*predict-yes*H0*5*H1
  11288. -->
  11289. Firing elaborate*copy-see-to-output-link
  11290. -->
  11291. (I3 ^see 1 +)
  11292. Firing elaborate*reward*based*on*reward
  11293. -->
  11294. (R984 ^value 1 +)
  11295. (R1 ^reward R984 +)
  11296. Firing propose*predict-yes
  11297. -->
  11298. (O1961 ^name predict-yes +)
  11299. (S1 ^operator O1961 +)
  11300. Firing propose*predict-no
  11301. -->
  11302. (O1962 ^name predict-no +)
  11303. (S1 ^operator O1962 +)
  11304. Firing rl*prefer*rvt*predict-no*H0*6
  11305. -->
  11306. (S1 ^operator O1960 = 0.9999810901454903)
  11307. Firing rl*prefer*rvt*predict-yes*H0*5
  11308. -->
  11309. (S1 ^operator O1959 = 0.1215988165406292)
  11310. Firing prefer*rvt*predict-yes*H0
  11311. -->
  11312. Firing prefer*rvt*predict-no*H0
  11313. -->
  11314. Firing elaborate*copy-dir-to-output-link
  11315. -->
  11316. (I3 ^dir R +)
  11317. inner elaboration loop at bottom goal.
  11318. Retracting elaborate*copy-see-to-output-link
  11319. -->
  11320. (I3 ^see 1 +)
  11321. Retracting propose*predict-no
  11322. -->
  11323. (O1960 ^name predict-no +)
  11324. (S1 ^operator O1960 +)
  11325. Retracting propose*predict-yes
  11326. -->
  11327. (O1959 ^name predict-yes +)
  11328. (S1 ^operator O1959 +)
  11329. Retracting elaborate*reward*based*on*reward
  11330. -->
  11331. (R983 ^value 1 +)
  11332. (R1 ^reward R983 +)
  11333. Retracting elaborate*copy-dir-to-output-link
  11334. -->
  11335. (I3 ^dir R +)
  11336. Retracting rl*prefer*rvt*predict-no*H0*6
  11337. -->
  11338. (S1 ^operator O1960 = 0.9999810901454903)
  11339. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  11340. -->
  11341. (S1 ^operator O1959 = 0.8784154092082219)
  11342. Retracting rl*prefer*rvt*predict-yes*H0*5
  11343. -->
  11344. (S1 ^operator O1959 = 0.1215988165406292)
  11345. =>WM: (13767: S1 ^operator O1962 +)
  11346. =>WM: (13766: S1 ^operator O1961 +)
  11347. =>WM: (13765: O1962 ^name predict-no)
  11348. =>WM: (13764: O1961 ^name predict-yes)
  11349. =>WM: (13763: R984 ^value 1)
  11350. =>WM: (13762: R1 ^reward R984)
  11351. <=WM: (13753: S1 ^operator O1959 +)
  11352. <=WM: (13755: S1 ^operator O1959)
  11353. <=WM: (13754: S1 ^operator O1960 +)
  11354. <=WM: (13748: R1 ^reward R983)
  11355. <=WM: (13751: O1960 ^name predict-no)
  11356. <=WM: (13750: O1959 ^name predict-yes)
  11357. <=WM: (13749: R983 ^value 1)
  11358. --- Inner Elaboration Phase, active level 1 (S1) ---
  11359. Firing prefer*rvt*predict-yes*H0
  11360. -->
  11361. Firing rl*prefer*rvt*predict-yes*H0*5
  11362. -->
  11363. (S1 ^operator O1961 = 0.1215988165406292)
  11364. Firing prefer*rvt*predict-yes*H0*5*H1
  11365. -->
  11366. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  11367. -->
  11368. (S1 ^operator O1961 = -0.04253361215288998)
  11369. Firing prefer*rvt*predict-no*H0
  11370. -->
  11371. Firing rl*prefer*rvt*predict-no*H0*6
  11372. -->
  11373. (S1 ^operator O1962 = 0.9999810901454903)
  11374. inner elaboration loop at bottom goal.
  11375. Retracting rl*prefer*rvt*predict-no*H0*6
  11376. -->
  11377. (S1 ^operator O1960 = 0.9999810901454903)
  11378. Retracting rl*prefer*rvt*predict-yes*H0*5
  11379. -->
  11380. (S1 ^operator O1959 = 0.1215988165406292)
  11381. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  11382. -->
  11383. (S1 ^operator O1959 = -0.04253361215288998)
  11384. --- END Proposal Phase ---
  11385. --- Decision Phase ---
  11386. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.862069,0.119593)
  11387. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465487 0.412928 0.878415 -> 0.465486 0.412928 0.878414(R,m,v=1,1,0)
  11388. =>WM: (13768: S1 ^operator O1962)
  11389. 981: O: O1962 (predict-no)
  11390. --- END Decision Phase ---
  11391. --- Application Phase ---
  11392. --- Firing Productions (PE) For State At Depth 1 ---
  11393. --- Inner Elaboration Phase, active level 1 (S1) ---
  11394. Firing apply*operator
  11395. -->
  11396. (I3 ^predict-no N981 + :O )
  11397. Firing apply*operator*complete
  11398. -->
  11399. (I3 ^predict-yes N980 - :O )
  11400. inner elaboration loop at bottom goal.
  11401. --- Change Working Memory (PE) ---
  11402. =>WM: (13769: I3 ^predict-no N981)
  11403. <=WM: (13757: N980 ^status complete)
  11404. <=WM: (13756: I3 ^predict-yes N980)
  11405. --- Firing Productions (IE) For State At Depth 1 ---
  11406. --- Inner Elaboration Phase, active level 1 (S1) ---
  11407. Firing monitor*world
  11408. -->
  11409. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11410. --- Change Working Memory (IE) ---
  11411. --- END Application Phase ---
  11412. --- Output Phase ---
  11413. ENV: Agent did: predict-no for direction R in state State-B
  11414. In State-B moving R
  11415. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  11416. predict error 0
  11417. dir: dir isL
  11418. --- END Output Phase ---
  11419. /--- Input Phase ---
  11420. =>WM: (13773: I2 ^dir L)
  11421. =>WM: (13772: I2 ^reward 1)
  11422. =>WM: (13771: I2 ^see 0)
  11423. =>WM: (13770: N981 ^status complete)
  11424. <=WM: (13760: I2 ^dir R)
  11425. <=WM: (13759: I2 ^reward 1)
  11426. <=WM: (13758: I2 ^see 1)
  11427. =>WM: (13774: I2 ^level-1 R0-root)
  11428. <=WM: (13761: I2 ^level-1 R1-root)
  11429. --- END Input Phase ---
  11430. --- Proposal Phase ---
  11431. --- Inner Elaboration Phase, active level 1 (S1) ---
  11432. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  11433. -->
  11434. (S1 ^operator O1962 = -0.1984300550322165)
  11435. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  11436. -->
  11437. (S1 ^operator O1961 = 0.609089086334031)
  11438. Firing prefer*rvt*predict-no*H0*4*H1
  11439. -->
  11440. Firing prefer*rvt*predict-yes*H0*3*H1
  11441. -->
  11442. Firing elaborate*copy-see-to-output-link
  11443. -->
  11444. (I3 ^see 0 +)
  11445. Firing elaborate*reward*based*on*reward
  11446. -->
  11447. (R985 ^value 1 +)
  11448. (R1 ^reward R985 +)
  11449. Firing propose*predict-yes
  11450. -->
  11451. (O1963 ^name predict-yes +)
  11452. (S1 ^operator O1963 +)
  11453. Firing propose*predict-no
  11454. -->
  11455. (O1964 ^name predict-no +)
  11456. (S1 ^operator O1964 +)
  11457. Firing rl*prefer*rvt*predict-no*H0*4
  11458. -->
  11459. (S1 ^operator O1962 = 0.3145060369395525)
  11460. Firing rl*prefer*rvt*predict-yes*H0*3
  11461. -->
  11462. (S1 ^operator O1961 = 0.39076303591152)
  11463. Firing prefer*rvt*predict-yes*H0
  11464. -->
  11465. Firing prefer*rvt*predict-no*H0
  11466. -->
  11467. Firing elaborate*copy-dir-to-output-link
  11468. -->
  11469. (I3 ^dir L +)
  11470. inner elaboration loop at bottom goal.
  11471. Retracting elaborate*copy-see-to-output-link
  11472. -->
  11473. (I3 ^see 1 +)
  11474. Retracting propose*predict-no
  11475. -->
  11476. (O1962 ^name predict-no +)
  11477. (S1 ^operator O1962 +)
  11478. Retracting propose*predict-yes
  11479. -->
  11480. (O1961 ^name predict-yes +)
  11481. (S1 ^operator O1961 +)
  11482. Retracting elaborate*reward*based*on*reward
  11483. -->
  11484. (R984 ^value 1 +)
  11485. (R1 ^reward R984 +)
  11486. Retracting elaborate*copy-dir-to-output-link
  11487. -->
  11488. (I3 ^dir R +)
  11489. Retracting rl*prefer*rvt*predict-no*H0*6
  11490. -->
  11491. (S1 ^operator O1962 = 0.9999810901454903)
  11492. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  11493. -->
  11494. (S1 ^operator O1961 = -0.04253361215288998)
  11495. Retracting rl*prefer*rvt*predict-yes*H0*5
  11496. -->
  11497. (S1 ^operator O1961 = 0.1215976616761118)
  11498. =>WM: (13782: S1 ^operator O1964 +)
  11499. =>WM: (13781: S1 ^operator O1963 +)
  11500. =>WM: (13780: I3 ^dir L)
  11501. =>WM: (13779: O1964 ^name predict-no)
  11502. =>WM: (13778: O1963 ^name predict-yes)
  11503. =>WM: (13777: R985 ^value 1)
  11504. =>WM: (13776: R1 ^reward R985)
  11505. =>WM: (13775: I3 ^see 0)
  11506. <=WM: (13766: S1 ^operator O1961 +)
  11507. <=WM: (13767: S1 ^operator O1962 +)
  11508. <=WM: (13768: S1 ^operator O1962)
  11509. <=WM: (13752: I3 ^dir R)
  11510. <=WM: (13762: R1 ^reward R984)
  11511. <=WM: (13733: I3 ^see 1)
  11512. <=WM: (13765: O1962 ^name predict-no)
  11513. <=WM: (13764: O1961 ^name predict-yes)
  11514. <=WM: (13763: R984 ^value 1)
  11515. --- Inner Elaboration Phase, active level 1 (S1) ---
  11516. Firing prefer*rvt*predict-yes*H0
  11517. -->
  11518. Firing rl*prefer*rvt*predict-yes*H0*3
  11519. -->
  11520. (S1 ^operator O1963 = 0.39076303591152)
  11521. Firing prefer*rvt*predict-yes*H0*3*H1
  11522. -->
  11523. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  11524. -->
  11525. (S1 ^operator O1963 = 0.609089086334031)
  11526. Firing prefer*rvt*predict-no*H0
  11527. -->
  11528. Firing rl*prefer*rvt*predict-no*H0*4
  11529. -->
  11530. (S1 ^operator O1964 = 0.3145060369395525)
  11531. Firing prefer*rvt*predict-no*H0*4*H1
  11532. -->
  11533. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  11534. -->
  11535. (S1 ^operator O1964 = -0.1984300550322165)
  11536. inner elaboration loop at bottom goal.
  11537. Retracting rl*prefer*rvt*predict-no*H0*4
  11538. -->
  11539. (S1 ^operator O1962 = 0.3145060369395525)
  11540. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  11541. -->
  11542. (S1 ^operator O1962 = -0.1984300550322165)
  11543. Retracting rl*prefer*rvt*predict-yes*H0*3
  11544. -->
  11545. (S1 ^operator O1961 = 0.39076303591152)
  11546. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  11547. -->
  11548. (S1 ^operator O1961 = 0.609089086334031)
  11549. --- END Proposal Phase ---
  11550. --- Decision Phase ---
  11551. RL update rl*prefer*rvt*predict-no*H0*6 0.999981 0 0.999981 -> 0.999984 0 0.999984(R,m,v=1,0.937143,0.0592447)
  11552. =>WM: (13783: S1 ^operator O1963)
  11553. 982: O: O1963 (predict-yes)
  11554. --- END Decision Phase ---
  11555. --- Application Phase ---
  11556. --- Firing Productions (PE) For State At Depth 1 ---
  11557. --- Inner Elaboration Phase, active level 1 (S1) ---
  11558. Firing apply*operator
  11559. -->
  11560. (I3 ^predict-yes N982 + :O )
  11561. Firing apply*operator*complete
  11562. -->
  11563. (I3 ^predict-no N981 - :O )
  11564. inner elaboration loop at bottom goal.
  11565. --- Change Working Memory (PE) ---
  11566. =>WM: (13784: I3 ^predict-yes N982)
  11567. <=WM: (13770: N981 ^status complete)
  11568. <=WM: (13769: I3 ^predict-no N981)
  11569. --- Firing Productions (IE) For State At Depth 1 ---
  11570. --- Inner Elaboration Phase, active level 1 (S1) ---
  11571. Firing monitor*world
  11572. -->
  11573. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11574. --- Change Working Memory (IE) ---
  11575. --- END Application Phase ---
  11576. --- Output Phase ---
  11577. ENV: Agent did: predict-yes for direction L in state State-B
  11578. In State-B moving L
  11579. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  11580. predict error 0
  11581. dir: dir isL
  11582. --- END Output Phase ---
  11583. |\--- Input Phase ---
  11584. =>WM: (13788: I2 ^dir L)
  11585. =>WM: (13787: I2 ^reward 1)
  11586. =>WM: (13786: I2 ^see 1)
  11587. =>WM: (13785: N982 ^status complete)
  11588. <=WM: (13773: I2 ^dir L)
  11589. <=WM: (13772: I2 ^reward 1)
  11590. <=WM: (13771: I2 ^see 0)
  11591. =>WM: (13789: I2 ^level-1 L1-root)
  11592. <=WM: (13774: I2 ^level-1 R0-root)
  11593. --- END Input Phase ---
  11594. --- Proposal Phase ---
  11595. --- Inner Elaboration Phase, active level 1 (S1) ---
  11596. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  11597. -->
  11598. (S1 ^operator O1963 = -0.2062723012911647)
  11599. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  11600. -->
  11601. (S1 ^operator O1964 = 0.6855414715988584)
  11602. Firing prefer*rvt*predict-no*H0*4*H1
  11603. -->
  11604. Firing prefer*rvt*predict-yes*H0*3*H1
  11605. -->
  11606. Firing elaborate*copy-see-to-output-link
  11607. -->
  11608. (I3 ^see 1 +)
  11609. Firing elaborate*reward*based*on*reward
  11610. -->
  11611. (R986 ^value 1 +)
  11612. (R1 ^reward R986 +)
  11613. Firing propose*predict-yes
  11614. -->
  11615. (O1965 ^name predict-yes +)
  11616. (S1 ^operator O1965 +)
  11617. Firing propose*predict-no
  11618. -->
  11619. (O1966 ^name predict-no +)
  11620. (S1 ^operator O1966 +)
  11621. Firing rl*prefer*rvt*predict-no*H0*4
  11622. -->
  11623. (S1 ^operator O1964 = 0.3145060369395525)
  11624. Firing rl*prefer*rvt*predict-yes*H0*3
  11625. -->
  11626. (S1 ^operator O1963 = 0.39076303591152)
  11627. Firing prefer*rvt*predict-yes*H0
  11628. -->
  11629. Firing prefer*rvt*predict-no*H0
  11630. -->
  11631. Firing elaborate*copy-dir-to-output-link
  11632. -->
  11633. (I3 ^dir L +)
  11634. inner elaboration loop at bottom goal.
  11635. Retracting elaborate*copy-see-to-output-link
  11636. -->
  11637. (I3 ^see 0 +)
  11638. Retracting propose*predict-no
  11639. -->
  11640. (O1964 ^name predict-no +)
  11641. (S1 ^operator O1964 +)
  11642. Retracting propose*predict-yes
  11643. -->
  11644. (O1963 ^name predict-yes +)
  11645. (S1 ^operator O1963 +)
  11646. Retracting elaborate*reward*based*on*reward
  11647. -->
  11648. (R985 ^value 1 +)
  11649. (R1 ^reward R985 +)
  11650. Retracting elaborate*copy-dir-to-output-link
  11651. -->
  11652. (I3 ^dir L +)
  11653. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  11654. -->
  11655. (S1 ^operator O1964 = -0.1984300550322165)
  11656. Retracting rl*prefer*rvt*predict-no*H0*4
  11657. -->
  11658. (S1 ^operator O1964 = 0.3145060369395525)
  11659. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  11660. -->
  11661. (S1 ^operator O1963 = 0.609089086334031)
  11662. Retracting rl*prefer*rvt*predict-yes*H0*3
  11663. -->
  11664. (S1 ^operator O1963 = 0.39076303591152)
  11665. =>WM: (13796: S1 ^operator O1966 +)
  11666. =>WM: (13795: S1 ^operator O1965 +)
  11667. =>WM: (13794: O1966 ^name predict-no)
  11668. =>WM: (13793: O1965 ^name predict-yes)
  11669. =>WM: (13792: R986 ^value 1)
  11670. =>WM: (13791: R1 ^reward R986)
  11671. =>WM: (13790: I3 ^see 1)
  11672. <=WM: (13781: S1 ^operator O1963 +)
  11673. <=WM: (13783: S1 ^operator O1963)
  11674. <=WM: (13782: S1 ^operator O1964 +)
  11675. <=WM: (13776: R1 ^reward R985)
  11676. <=WM: (13775: I3 ^see 0)
  11677. <=WM: (13779: O1964 ^name predict-no)
  11678. <=WM: (13778: O1963 ^name predict-yes)
  11679. <=WM: (13777: R985 ^value 1)
  11680. --- Inner Elaboration Phase, active level 1 (S1) ---
  11681. Firing prefer*rvt*predict-yes*H0
  11682. -->
  11683. Firing rl*prefer*rvt*predict-yes*H0*3
  11684. -->
  11685. (S1 ^operator O1965 = 0.39076303591152)
  11686. Firing prefer*rvt*predict-yes*H0*3*H1
  11687. -->
  11688. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  11689. -->
  11690. (S1 ^operator O1965 = -0.2062723012911647)
  11691. Firing prefer*rvt*predict-no*H0
  11692. -->
  11693. Firing rl*prefer*rvt*predict-no*H0*4
  11694. -->
  11695. (S1 ^operator O1966 = 0.3145060369395525)
  11696. Firing prefer*rvt*predict-no*H0*4*H1
  11697. -->
  11698. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  11699. -->
  11700. (S1 ^operator O1966 = 0.6855414715988584)
  11701. inner elaboration loop at bottom goal.
  11702. Retracting rl*prefer*rvt*predict-no*H0*4
  11703. -->
  11704. (S1 ^operator O1964 = 0.3145060369395525)
  11705. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  11706. -->
  11707. (S1 ^operator O1964 = 0.6855414715988584)
  11708. Retracting rl*prefer*rvt*predict-yes*H0*3
  11709. -->
  11710. (S1 ^operator O1963 = 0.39076303591152)
  11711. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  11712. -->
  11713. (S1 ^operator O1963 = -0.2062723012911647)
  11714. --- END Proposal Phase ---
  11715. --- Decision Phase ---
  11716. RL update rl*prefer*rvt*predict-yes*H0*3 0.472311 -0.0815481 0.390763 -> 0.472322 -0.0815463 0.390775(R,m,v=1,0.943038,0.0540595)
  11717. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527563 0.0815262 0.609089 -> 0.527575 0.0815283 0.609103(R,m,v=1,1,0)
  11718. =>WM: (13797: S1 ^operator O1966)
  11719. 983: O: O1966 (predict-no)
  11720. --- END Decision Phase ---
  11721. --- Application Phase ---
  11722. --- Firing Productions (PE) For State At Depth 1 ---
  11723. --- Inner Elaboration Phase, active level 1 (S1) ---
  11724. Firing apply*operator
  11725. -->
  11726. (I3 ^predict-no N983 + :O )
  11727. Firing apply*operator*complete
  11728. -->
  11729. (I3 ^predict-yes N982 - :O )
  11730. inner elaboration loop at bottom goal.
  11731. --- Change Working Memory (PE) ---
  11732. =>WM: (13798: I3 ^predict-no N983)
  11733. <=WM: (13785: N982 ^status complete)
  11734. <=WM: (13784: I3 ^predict-yes N982)
  11735. --- Firing Productions (IE) For State At Depth 1 ---
  11736. --- Inner Elaboration Phase, active level 1 (S1) ---
  11737. Firing monitor*world
  11738. -->
  11739. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  11740. --- Change Working Memory (IE) ---
  11741. --- END Application Phase ---
  11742. --- Output Phase ---
  11743. ENV: Agent did: predict-no for direction L in state State-A
  11744. In State-A moving L
  11745. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  11746. predict error 0
  11747. dir: dir isR
  11748. --- END Output Phase ---
  11749. -/--- Input Phase ---
  11750. =>WM: (13802: I2 ^dir R)
  11751. =>WM: (13801: I2 ^reward 1)
  11752. =>WM: (13800: I2 ^see 0)
  11753. =>WM: (13799: N983 ^status complete)
  11754. <=WM: (13788: I2 ^dir L)
  11755. <=WM: (13787: I2 ^reward 1)
  11756. <=WM: (13786: I2 ^see 1)
  11757. =>WM: (13803: I2 ^level-1 L0-root)
  11758. <=WM: (13789: I2 ^level-1 L1-root)
  11759. --- END Input Phase ---
  11760. --- Proposal Phase ---
  11761. --- Inner Elaboration Phase, active level 1 (S1) ---
  11762. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  11763. -->
  11764. (S1 ^operator O1965 = 0.8783936611550894)
  11765. Firing prefer*rvt*predict-yes*H0*5*H1
  11766. -->
  11767. Firing elaborate*copy-see-to-output-link
  11768. -->
  11769. (I3 ^see 0 +)
  11770. Firing elaborate*reward*based*on*reward
  11771. -->
  11772. (R987 ^value 1 +)
  11773. (R1 ^reward R987 +)
  11774. Firing propose*predict-yes
  11775. -->
  11776. (O1967 ^name predict-yes +)
  11777. (S1 ^operator O1967 +)
  11778. Firing propose*predict-no
  11779. -->
  11780. (O1968 ^name predict-no +)
  11781. (S1 ^operator O1968 +)
  11782. Firing rl*prefer*rvt*predict-no*H0*6
  11783. -->
  11784. (S1 ^operator O1966 = 0.9999841575438704)
  11785. Firing rl*prefer*rvt*predict-yes*H0*5
  11786. -->
  11787. (S1 ^operator O1965 = 0.1215976616761118)
  11788. Firing prefer*rvt*predict-yes*H0
  11789. -->
  11790. Firing prefer*rvt*predict-no*H0
  11791. -->
  11792. Firing elaborate*copy-dir-to-output-link
  11793. -->
  11794. (I3 ^dir R +)
  11795. inner elaboration loop at bottom goal.
  11796. Retracting elaborate*copy-see-to-output-link
  11797. -->
  11798. (I3 ^see 1 +)
  11799. Retracting propose*predict-no
  11800. -->
  11801. (O1966 ^name predict-no +)
  11802. (S1 ^operator O1966 +)
  11803. Retracting propose*predict-yes
  11804. -->
  11805. (O1965 ^name predict-yes +)
  11806. (S1 ^operator O1965 +)
  11807. Retracting elaborate*reward*based*on*reward
  11808. -->
  11809. (R986 ^value 1 +)
  11810. (R1 ^reward R986 +)
  11811. Retracting elaborate*copy-dir-to-output-link
  11812. -->
  11813. (I3 ^dir L +)
  11814. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  11815. -->
  11816. (S1 ^operator O1966 = 0.6855414715988584)
  11817. Retracting rl*prefer*rvt*predict-no*H0*4
  11818. -->
  11819. (S1 ^operator O1966 = 0.3145060369395525)
  11820. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  11821. -->
  11822. (S1 ^operator O1965 = -0.2062723012911647)
  11823. Retracting rl*prefer*rvt*predict-yes*H0*3
  11824. -->
  11825. (S1 ^operator O1965 = 0.390775231823802)
  11826. =>WM: (13811: S1 ^operator O1968 +)
  11827. =>WM: (13810: S1 ^operator O1967 +)
  11828. =>WM: (13809: I3 ^dir R)
  11829. =>WM: (13808: O1968 ^name predict-no)
  11830. =>WM: (13807: O1967 ^name predict-yes)
  11831. =>WM: (13806: R987 ^value 1)
  11832. =>WM: (13805: R1 ^reward R987)
  11833. =>WM: (13804: I3 ^see 0)
  11834. <=WM: (13795: S1 ^operator O1965 +)
  11835. <=WM: (13796: S1 ^operator O1966 +)
  11836. <=WM: (13797: S1 ^operator O1966)
  11837. <=WM: (13780: I3 ^dir L)
  11838. <=WM: (13791: R1 ^reward R986)
  11839. <=WM: (13790: I3 ^see 1)
  11840. <=WM: (13794: O1966 ^name predict-no)
  11841. <=WM: (13793: O1965 ^name predict-yes)
  11842. <=WM: (13792: R986 ^value 1)
  11843. --- Inner Elaboration Phase, active level 1 (S1) ---
  11844. Firing prefer*rvt*predict-yes*H0
  11845. -->
  11846. Firing rl*prefer*rvt*predict-yes*H0*5
  11847. -->
  11848. (S1 ^operator O1967 = 0.1215976616761118)
  11849. Firing prefer*rvt*predict-yes*H0*5*H1
  11850. -->
  11851. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  11852. -->
  11853. (S1 ^operator O1967 = 0.8783936611550894)
  11854. Firing prefer*rvt*predict-no*H0
  11855. -->
  11856. Firing rl*prefer*rvt*predict-no*H0*6
  11857. -->
  11858. (S1 ^operator O1968 = 0.9999841575438704)
  11859. inner elaboration loop at bottom goal.
  11860. Retracting rl*prefer*rvt*predict-no*H0*6
  11861. -->
  11862. (S1 ^operator O1966 = 0.9999841575438704)
  11863. Retracting rl*prefer*rvt*predict-yes*H0*5
  11864. -->
  11865. (S1 ^operator O1965 = 0.1215976616761118)
  11866. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  11867. -->
  11868. (S1 ^operator O1965 = 0.8783936611550894)
  11869. --- END Proposal Phase ---
  11870. --- Decision Phase ---
  11871. RL update rl*prefer*rvt*predict-no*H0*4 0.478554 -0.164048 0.314506 -> 0.478551 -0.164048 0.314502(R,m,v=1,0.921569,0.0727554)
  11872. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521489 0.164052 0.685541 -> 0.521485 0.164052 0.685537(R,m,v=1,1,0)
  11873. =>WM: (13812: S1 ^operator O1967)
  11874. 984: O: O1967 (predict-yes)
  11875. --- END Decision Phase ---
  11876. --- Application Phase ---
  11877. --- Firing Productions (PE) For State At Depth 1 ---
  11878. --- Inner Elaboration Phase, active level 1 (S1) ---
  11879. Firing apply*operator
  11880. -->
  11881. (I3 ^predict-yes N984 + :O )
  11882. Firing apply*operator*complete
  11883. -->
  11884. (I3 ^predict-no N983 - :O )
  11885. inner elaboration loop at bottom goal.
  11886. --- Change Working Memory (PE) ---
  11887. =>WM: (13813: I3 ^predict-yes N984)
  11888. <=WM: (13799: N983 ^status complete)
  11889. <=WM: (13798: I3 ^predict-no N983)
  11890. --- Firing Productions (IE) For State At Depth 1 ---
  11891. --- Inner Elaboration Phase, active level 1 (S1) ---
  11892. Firing monitor*world
  11893. -->
  11894. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  11895. --- Change Working Memory (IE) ---
  11896. --- END Application Phase ---
  11897. --- Output Phase ---
  11898. ENV: Agent did: predict-yes for direction R in state State-A
  11899. In State-A moving R
  11900. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  11901. predict error 0
  11902. dir: dir isU
  11903. --- END Output Phase ---
  11904. |\---- Input Phase ---
  11905. =>WM: (13817: I2 ^dir U)
  11906. =>WM: (13816: I2 ^reward 1)
  11907. =>WM: (13815: I2 ^see 1)
  11908. =>WM: (13814: N984 ^status complete)
  11909. <=WM: (13802: I2 ^dir R)
  11910. <=WM: (13801: I2 ^reward 1)
  11911. <=WM: (13800: I2 ^see 0)
  11912. =>WM: (13818: I2 ^level-1 R1-root)
  11913. <=WM: (13803: I2 ^level-1 L0-root)
  11914. --- END Input Phase ---
  11915. --- Proposal Phase ---
  11916. --- Inner Elaboration Phase, active level 1 (S1) ---
  11917. Firing elaborate*copy-see-to-output-link
  11918. -->
  11919. (I3 ^see 1 +)
  11920. Firing elaborate*reward*based*on*reward
  11921. -->
  11922. (R988 ^value 1 +)
  11923. (R1 ^reward R988 +)
  11924. Firing propose*predict-yes
  11925. -->
  11926. (O1969 ^name predict-yes +)
  11927. (S1 ^operator O1969 +)
  11928. Firing propose*predict-no
  11929. -->
  11930. (O1970 ^name predict-no +)
  11931. (S1 ^operator O1970 +)
  11932. Firing rl*prefer*rvt*predict-no*H0*2
  11933. -->
  11934. (S1 ^operator O1968 = 1.)
  11935. Firing rl*prefer*rvt*predict-yes*H0*1
  11936. -->
  11937. (S1 ^operator O1967 = 0.)
  11938. Firing prefer*rvt*predict-yes*H0
  11939. -->
  11940. Firing prefer*rvt*predict-no*H0
  11941. -->
  11942. Firing elaborate*copy-dir-to-output-link
  11943. -->
  11944. (I3 ^dir U +)
  11945. inner elaboration loop at bottom goal.
  11946. Retracting elaborate*copy-see-to-output-link
  11947. -->
  11948. (I3 ^see 0 +)
  11949. Retracting propose*predict-no
  11950. -->
  11951. (O1968 ^name predict-no +)
  11952. (S1 ^operator O1968 +)
  11953. Retracting propose*predict-yes
  11954. -->
  11955. (O1967 ^name predict-yes +)
  11956. (S1 ^operator O1967 +)
  11957. Retracting elaborate*reward*based*on*reward
  11958. -->
  11959. (R987 ^value 1 +)
  11960. (R1 ^reward R987 +)
  11961. Retracting elaborate*copy-dir-to-output-link
  11962. -->
  11963. (I3 ^dir R +)
  11964. Retracting rl*prefer*rvt*predict-no*H0*6
  11965. -->
  11966. (S1 ^operator O1968 = 0.9999841575438704)
  11967. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  11968. -->
  11969. (S1 ^operator O1967 = 0.8783936611550894)
  11970. Retracting rl*prefer*rvt*predict-yes*H0*5
  11971. -->
  11972. (S1 ^operator O1967 = 0.1215976616761118)
  11973. =>WM: (13826: S1 ^operator O1970 +)
  11974. =>WM: (13825: S1 ^operator O1969 +)
  11975. =>WM: (13824: I3 ^dir U)
  11976. =>WM: (13823: O1970 ^name predict-no)
  11977. =>WM: (13822: O1969 ^name predict-yes)
  11978. =>WM: (13821: R988 ^value 1)
  11979. =>WM: (13820: R1 ^reward R988)
  11980. =>WM: (13819: I3 ^see 1)
  11981. <=WM: (13810: S1 ^operator O1967 +)
  11982. <=WM: (13812: S1 ^operator O1967)
  11983. <=WM: (13811: S1 ^operator O1968 +)
  11984. <=WM: (13809: I3 ^dir R)
  11985. <=WM: (13805: R1 ^reward R987)
  11986. <=WM: (13804: I3 ^see 0)
  11987. <=WM: (13808: O1968 ^name predict-no)
  11988. <=WM: (13807: O1967 ^name predict-yes)
  11989. <=WM: (13806: R987 ^value 1)
  11990. --- Inner Elaboration Phase, active level 1 (S1) ---
  11991. Firing prefer*rvt*predict-yes*H0
  11992. -->
  11993. Firing rl*prefer*rvt*predict-yes*H0*1
  11994. -->
  11995. (S1 ^operator O1969 = 0.)
  11996. Firing prefer*rvt*predict-no*H0
  11997. -->
  11998. Firing rl*prefer*rvt*predict-no*H0*2
  11999. -->
  12000. (S1 ^operator O1970 = 1.)
  12001. inner elaboration loop at bottom goal.
  12002. Retracting rl*prefer*rvt*predict-no*H0*2
  12003. -->
  12004. (S1 ^operator O1968 = 1.)
  12005. Retracting rl*prefer*rvt*predict-yes*H0*1
  12006. -->
  12007. (S1 ^operator O1967 = 0.)
  12008. --- END Proposal Phase ---
  12009. --- Decision Phase ---
  12010. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.862857,0.119015)
  12011. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465469 0.412925 0.878394 -> 0.46547 0.412925 0.878394(R,m,v=1,1,0)
  12012. =>WM: (13827: S1 ^operator O1970)
  12013. 985: O: O1970 (predict-no)
  12014. --- END Decision Phase ---
  12015. --- Application Phase ---
  12016. --- Firing Productions (PE) For State At Depth 1 ---
  12017. --- Inner Elaboration Phase, active level 1 (S1) ---
  12018. Firing apply*operator
  12019. -->
  12020. (I3 ^predict-no N985 + :O )
  12021. Firing apply*operator*complete
  12022. -->
  12023. (I3 ^predict-yes N984 - :O )
  12024. inner elaboration loop at bottom goal.
  12025. --- Change Working Memory (PE) ---
  12026. =>WM: (13828: I3 ^predict-no N985)
  12027. <=WM: (13814: N984 ^status complete)
  12028. <=WM: (13813: I3 ^predict-yes N984)
  12029. --- Firing Productions (IE) For State At Depth 1 ---
  12030. --- Inner Elaboration Phase, active level 1 (S1) ---
  12031. Firing monitor*world
  12032. -->
  12033. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12034. --- Change Working Memory (IE) ---
  12035. --- END Application Phase ---
  12036. --- Output Phase ---
  12037. ENV: Agent did: predict-no for direction U in state State-B
  12038. In State-B moving U
  12039. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12040. predict error 0
  12041. dir: dir isL
  12042. --- END Output Phase ---
  12043. /|--- Input Phase ---
  12044. =>WM: (13832: I2 ^dir L)
  12045. =>WM: (13831: I2 ^reward 1)
  12046. =>WM: (13830: I2 ^see 0)
  12047. =>WM: (13829: N985 ^status complete)
  12048. <=WM: (13817: I2 ^dir U)
  12049. <=WM: (13816: I2 ^reward 1)
  12050. <=WM: (13815: I2 ^see 1)
  12051. =>WM: (13833: I2 ^level-1 R1-root)
  12052. <=WM: (13818: I2 ^level-1 R1-root)
  12053. --- END Input Phase ---
  12054. --- Proposal Phase ---
  12055. --- Inner Elaboration Phase, active level 1 (S1) ---
  12056. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  12057. -->
  12058. (S1 ^operator O1970 = -0.168718511744511)
  12059. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  12060. -->
  12061. (S1 ^operator O1969 = 0.6093180204125221)
  12062. Firing prefer*rvt*predict-no*H0*4*H1
  12063. -->
  12064. Firing prefer*rvt*predict-yes*H0*3*H1
  12065. -->
  12066. Firing elaborate*copy-see-to-output-link
  12067. -->
  12068. (I3 ^see 0 +)
  12069. Firing elaborate*reward*based*on*reward
  12070. -->
  12071. (R989 ^value 1 +)
  12072. (R1 ^reward R989 +)
  12073. Firing propose*predict-yes
  12074. -->
  12075. (O1971 ^name predict-yes +)
  12076. (S1 ^operator O1971 +)
  12077. Firing propose*predict-no
  12078. -->
  12079. (O1972 ^name predict-no +)
  12080. (S1 ^operator O1972 +)
  12081. Firing rl*prefer*rvt*predict-no*H0*4
  12082. -->
  12083. (S1 ^operator O1970 = 0.3145020978774952)
  12084. Firing rl*prefer*rvt*predict-yes*H0*3
  12085. -->
  12086. (S1 ^operator O1969 = 0.390775231823802)
  12087. Firing prefer*rvt*predict-yes*H0
  12088. -->
  12089. Firing prefer*rvt*predict-no*H0
  12090. -->
  12091. Firing elaborate*copy-dir-to-output-link
  12092. -->
  12093. (I3 ^dir L +)
  12094. inner elaboration loop at bottom goal.
  12095. Retracting elaborate*copy-see-to-output-link
  12096. -->
  12097. (I3 ^see 1 +)
  12098. Retracting propose*predict-no
  12099. -->
  12100. (O1970 ^name predict-no +)
  12101. (S1 ^operator O1970 +)
  12102. Retracting propose*predict-yes
  12103. -->
  12104. (O1969 ^name predict-yes +)
  12105. (S1 ^operator O1969 +)
  12106. Retracting elaborate*reward*based*on*reward
  12107. -->
  12108. (R988 ^value 1 +)
  12109. (R1 ^reward R988 +)
  12110. Retracting elaborate*copy-dir-to-output-link
  12111. -->
  12112. (I3 ^dir U +)
  12113. Retracting rl*prefer*rvt*predict-no*H0*2
  12114. -->
  12115. (S1 ^operator O1970 = 1.)
  12116. Retracting rl*prefer*rvt*predict-yes*H0*1
  12117. -->
  12118. (S1 ^operator O1969 = 0.)
  12119. =>WM: (13841: S1 ^operator O1972 +)
  12120. =>WM: (13840: S1 ^operator O1971 +)
  12121. =>WM: (13839: I3 ^dir L)
  12122. =>WM: (13838: O1972 ^name predict-no)
  12123. =>WM: (13837: O1971 ^name predict-yes)
  12124. =>WM: (13836: R989 ^value 1)
  12125. =>WM: (13835: R1 ^reward R989)
  12126. =>WM: (13834: I3 ^see 0)
  12127. <=WM: (13825: S1 ^operator O1969 +)
  12128. <=WM: (13826: S1 ^operator O1970 +)
  12129. <=WM: (13827: S1 ^operator O1970)
  12130. <=WM: (13824: I3 ^dir U)
  12131. <=WM: (13820: R1 ^reward R988)
  12132. <=WM: (13819: I3 ^see 1)
  12133. <=WM: (13823: O1970 ^name predict-no)
  12134. <=WM: (13822: O1969 ^name predict-yes)
  12135. <=WM: (13821: R988 ^value 1)
  12136. --- Inner Elaboration Phase, active level 1 (S1) ---
  12137. Firing prefer*rvt*predict-yes*H0
  12138. -->
  12139. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  12140. -->
  12141. (S1 ^operator O1971 = 0.6093180204125221)
  12142. Firing rl*prefer*rvt*predict-yes*H0*3
  12143. -->
  12144. (S1 ^operator O1971 = 0.390775231823802)
  12145. Firing prefer*rvt*predict-yes*H0*3*H1
  12146. -->
  12147. Firing prefer*rvt*predict-no*H0
  12148. -->
  12149. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  12150. -->
  12151. (S1 ^operator O1972 = -0.168718511744511)
  12152. Firing rl*prefer*rvt*predict-no*H0*4
  12153. -->
  12154. (S1 ^operator O1972 = 0.3145020978774952)
  12155. Firing prefer*rvt*predict-no*H0*4*H1
  12156. -->
  12157. inner elaboration loop at bottom goal.
  12158. Retracting rl*prefer*rvt*predict-no*H0*4
  12159. -->
  12160. (S1 ^operator O1970 = 0.3145020978774952)
  12161. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  12162. -->
  12163. (S1 ^operator O1970 = -0.168718511744511)
  12164. Retracting rl*prefer*rvt*predict-yes*H0*3
  12165. -->
  12166. (S1 ^operator O1969 = 0.390775231823802)
  12167. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  12168. -->
  12169. (S1 ^operator O1969 = 0.6093180204125221)
  12170. --- END Proposal Phase ---
  12171. --- Decision Phase ---
  12172. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12173. =>WM: (13842: S1 ^operator O1971)
  12174. 986: O: O1971 (predict-yes)
  12175. --- END Decision Phase ---
  12176. --- Application Phase ---
  12177. --- Firing Productions (PE) For State At Depth 1 ---
  12178. --- Inner Elaboration Phase, active level 1 (S1) ---
  12179. Firing apply*operator
  12180. -->
  12181. (I3 ^predict-yes N986 + :O )
  12182. Firing apply*operator*complete
  12183. -->
  12184. (I3 ^predict-no N985 - :O )
  12185. inner elaboration loop at bottom goal.
  12186. --- Change Working Memory (PE) ---
  12187. =>WM: (13843: I3 ^predict-yes N986)
  12188. <=WM: (13829: N985 ^status complete)
  12189. <=WM: (13828: I3 ^predict-no N985)
  12190. --- Firing Productions (IE) For State At Depth 1 ---
  12191. --- Inner Elaboration Phase, active level 1 (S1) ---
  12192. Firing monitor*world
  12193. -->
  12194. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12195. --- Change Working Memory (IE) ---
  12196. --- END Application Phase ---
  12197. --- Output Phase ---
  12198. ENV: Agent did: predict-yes for direction L in state State-B
  12199. In State-B moving L
  12200. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  12201. predict error 0
  12202. dir: dir isL
  12203. --- END Output Phase ---
  12204. \-/--- Input Phase ---
  12205. =>WM: (13847: I2 ^dir L)
  12206. =>WM: (13846: I2 ^reward 1)
  12207. =>WM: (13845: I2 ^see 1)
  12208. =>WM: (13844: N986 ^status complete)
  12209. <=WM: (13832: I2 ^dir L)
  12210. <=WM: (13831: I2 ^reward 1)
  12211. <=WM: (13830: I2 ^see 0)
  12212. =>WM: (13848: I2 ^level-1 L1-root)
  12213. <=WM: (13833: I2 ^level-1 R1-root)
  12214. --- END Input Phase ---
  12215. --- Proposal Phase ---
  12216. --- Inner Elaboration Phase, active level 1 (S1) ---
  12217. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  12218. -->
  12219. (S1 ^operator O1971 = -0.2062723012911647)
  12220. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  12221. -->
  12222. (S1 ^operator O1972 = 0.6855369815787629)
  12223. Firing prefer*rvt*predict-no*H0*4*H1
  12224. -->
  12225. Firing prefer*rvt*predict-yes*H0*3*H1
  12226. -->
  12227. Firing elaborate*copy-see-to-output-link
  12228. -->
  12229. (I3 ^see 1 +)
  12230. Firing elaborate*reward*based*on*reward
  12231. -->
  12232. (R990 ^value 1 +)
  12233. (R1 ^reward R990 +)
  12234. Firing propose*predict-yes
  12235. -->
  12236. (O1973 ^name predict-yes +)
  12237. (S1 ^operator O1973 +)
  12238. Firing propose*predict-no
  12239. -->
  12240. (O1974 ^name predict-no +)
  12241. (S1 ^operator O1974 +)
  12242. Firing rl*prefer*rvt*predict-no*H0*4
  12243. -->
  12244. (S1 ^operator O1972 = 0.3145020978774952)
  12245. Firing rl*prefer*rvt*predict-yes*H0*3
  12246. -->
  12247. (S1 ^operator O1971 = 0.390775231823802)
  12248. Firing prefer*rvt*predict-yes*H0
  12249. -->
  12250. Firing prefer*rvt*predict-no*H0
  12251. -->
  12252. Firing elaborate*copy-dir-to-output-link
  12253. -->
  12254. (I3 ^dir L +)
  12255. inner elaboration loop at bottom goal.
  12256. Retracting elaborate*copy-see-to-output-link
  12257. -->
  12258. (I3 ^see 0 +)
  12259. Retracting propose*predict-no
  12260. -->
  12261. (O1972 ^name predict-no +)
  12262. (S1 ^operator O1972 +)
  12263. Retracting propose*predict-yes
  12264. -->
  12265. (O1971 ^name predict-yes +)
  12266. (S1 ^operator O1971 +)
  12267. Retracting elaborate*reward*based*on*reward
  12268. -->
  12269. (R989 ^value 1 +)
  12270. (R1 ^reward R989 +)
  12271. Retracting elaborate*copy-dir-to-output-link
  12272. -->
  12273. (I3 ^dir L +)
  12274. Retracting rl*prefer*rvt*predict-no*H0*4
  12275. -->
  12276. (S1 ^operator O1972 = 0.3145020978774952)
  12277. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  12278. -->
  12279. (S1 ^operator O1972 = -0.168718511744511)
  12280. Retracting rl*prefer*rvt*predict-yes*H0*3
  12281. -->
  12282. (S1 ^operator O1971 = 0.390775231823802)
  12283. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  12284. -->
  12285. (S1 ^operator O1971 = 0.6093180204125221)
  12286. =>WM: (13855: S1 ^operator O1974 +)
  12287. =>WM: (13854: S1 ^operator O1973 +)
  12288. =>WM: (13853: O1974 ^name predict-no)
  12289. =>WM: (13852: O1973 ^name predict-yes)
  12290. =>WM: (13851: R990 ^value 1)
  12291. =>WM: (13850: R1 ^reward R990)
  12292. =>WM: (13849: I3 ^see 1)
  12293. <=WM: (13840: S1 ^operator O1971 +)
  12294. <=WM: (13842: S1 ^operator O1971)
  12295. <=WM: (13841: S1 ^operator O1972 +)
  12296. <=WM: (13835: R1 ^reward R989)
  12297. <=WM: (13834: I3 ^see 0)
  12298. <=WM: (13838: O1972 ^name predict-no)
  12299. <=WM: (13837: O1971 ^name predict-yes)
  12300. <=WM: (13836: R989 ^value 1)
  12301. --- Inner Elaboration Phase, active level 1 (S1) ---
  12302. Firing prefer*rvt*predict-yes*H0
  12303. -->
  12304. Firing rl*prefer*rvt*predict-yes*H0*3
  12305. -->
  12306. (S1 ^operator O1973 = 0.390775231823802)
  12307. Firing prefer*rvt*predict-yes*H0*3*H1
  12308. -->
  12309. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  12310. -->
  12311. (S1 ^operator O1973 = -0.2062723012911647)
  12312. Firing prefer*rvt*predict-no*H0
  12313. -->
  12314. Firing rl*prefer*rvt*predict-no*H0*4
  12315. -->
  12316. (S1 ^operator O1974 = 0.3145020978774952)
  12317. Firing prefer*rvt*predict-no*H0*4*H1
  12318. -->
  12319. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  12320. -->
  12321. (S1 ^operator O1974 = 0.6855369815787629)
  12322. inner elaboration loop at bottom goal.
  12323. Retracting rl*prefer*rvt*predict-no*H0*4
  12324. -->
  12325. (S1 ^operator O1972 = 0.3145020978774952)
  12326. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  12327. -->
  12328. (S1 ^operator O1972 = 0.6855369815787629)
  12329. Retracting rl*prefer*rvt*predict-yes*H0*3
  12330. -->
  12331. (S1 ^operator O1971 = 0.390775231823802)
  12332. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  12333. -->
  12334. (S1 ^operator O1971 = -0.2062723012911647)
  12335. --- END Proposal Phase ---
  12336. --- Decision Phase ---
  12337. RL update rl*prefer*rvt*predict-yes*H0*3 0.472322 -0.0815463 0.390775 -> 0.472315 -0.0815474 0.390768(R,m,v=1,0.943396,0.0537378)
  12338. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527758 0.0815601 0.609318 -> 0.52775 0.0815588 0.609309(R,m,v=1,1,0)
  12339. =>WM: (13856: S1 ^operator O1974)
  12340. 987: O: O1974 (predict-no)
  12341. --- END Decision Phase ---
  12342. --- Application Phase ---
  12343. --- Firing Productions (PE) For State At Depth 1 ---
  12344. --- Inner Elaboration Phase, active level 1 (S1) ---
  12345. Firing apply*operator
  12346. -->
  12347. (I3 ^predict-no N987 + :O )
  12348. Firing apply*operator*complete
  12349. -->
  12350. (I3 ^predict-yes N986 - :O )
  12351. inner elaboration loop at bottom goal.
  12352. --- Change Working Memory (PE) ---
  12353. =>WM: (13857: I3 ^predict-no N987)
  12354. <=WM: (13844: N986 ^status complete)
  12355. <=WM: (13843: I3 ^predict-yes N986)
  12356. --- Firing Productions (IE) For State At Depth 1 ---
  12357. --- Inner Elaboration Phase, active level 1 (S1) ---
  12358. Firing monitor*world
  12359. -->
  12360. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12361. --- Change Working Memory (IE) ---
  12362. --- END Application Phase ---
  12363. --- Output Phase ---
  12364. ENV: Agent did: predict-no for direction L in state State-A
  12365. In State-A moving L
  12366. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  12367. predict error 0
  12368. dir: dir isR
  12369. --- END Output Phase ---
  12370. |\---- Input Phase ---
  12371. =>WM: (13861: I2 ^dir R)
  12372. =>WM: (13860: I2 ^reward 1)
  12373. =>WM: (13859: I2 ^see 0)
  12374. =>WM: (13858: N987 ^status complete)
  12375. <=WM: (13847: I2 ^dir L)
  12376. <=WM: (13846: I2 ^reward 1)
  12377. <=WM: (13845: I2 ^see 1)
  12378. =>WM: (13862: I2 ^level-1 L0-root)
  12379. <=WM: (13848: I2 ^level-1 L1-root)
  12380. --- END Input Phase ---
  12381. --- Proposal Phase ---
  12382. --- Inner Elaboration Phase, active level 1 (S1) ---
  12383. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  12384. -->
  12385. (S1 ^operator O1973 = 0.8783944900614931)
  12386. Firing prefer*rvt*predict-yes*H0*5*H1
  12387. -->
  12388. Firing elaborate*copy-see-to-output-link
  12389. -->
  12390. (I3 ^see 0 +)
  12391. Firing elaborate*reward*based*on*reward
  12392. -->
  12393. (R991 ^value 1 +)
  12394. (R1 ^reward R991 +)
  12395. Firing propose*predict-yes
  12396. -->
  12397. (O1975 ^name predict-yes +)
  12398. (S1 ^operator O1975 +)
  12399. Firing propose*predict-no
  12400. -->
  12401. (O1976 ^name predict-no +)
  12402. (S1 ^operator O1976 +)
  12403. Firing rl*prefer*rvt*predict-no*H0*6
  12404. -->
  12405. (S1 ^operator O1974 = 0.9999841575438704)
  12406. Firing rl*prefer*rvt*predict-yes*H0*5
  12407. -->
  12408. (S1 ^operator O1973 = 0.1215983654449722)
  12409. Firing prefer*rvt*predict-yes*H0
  12410. -->
  12411. Firing prefer*rvt*predict-no*H0
  12412. -->
  12413. Firing elaborate*copy-dir-to-output-link
  12414. -->
  12415. (I3 ^dir R +)
  12416. inner elaboration loop at bottom goal.
  12417. Retracting elaborate*copy-see-to-output-link
  12418. -->
  12419. (I3 ^see 1 +)
  12420. Retracting propose*predict-no
  12421. -->
  12422. (O1974 ^name predict-no +)
  12423. (S1 ^operator O1974 +)
  12424. Retracting propose*predict-yes
  12425. -->
  12426. (O1973 ^name predict-yes +)
  12427. (S1 ^operator O1973 +)
  12428. Retracting elaborate*reward*based*on*reward
  12429. -->
  12430. (R990 ^value 1 +)
  12431. (R1 ^reward R990 +)
  12432. Retracting elaborate*copy-dir-to-output-link
  12433. -->
  12434. (I3 ^dir L +)
  12435. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  12436. -->
  12437. (S1 ^operator O1974 = 0.6855369815787629)
  12438. Retracting rl*prefer*rvt*predict-no*H0*4
  12439. -->
  12440. (S1 ^operator O1974 = 0.3145020978774952)
  12441. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  12442. -->
  12443. (S1 ^operator O1973 = -0.2062723012911647)
  12444. Retracting rl*prefer*rvt*predict-yes*H0*3
  12445. -->
  12446. (S1 ^operator O1973 = 0.3907675490335307)
  12447. =>WM: (13870: S1 ^operator O1976 +)
  12448. =>WM: (13869: S1 ^operator O1975 +)
  12449. =>WM: (13868: I3 ^dir R)
  12450. =>WM: (13867: O1976 ^name predict-no)
  12451. =>WM: (13866: O1975 ^name predict-yes)
  12452. =>WM: (13865: R991 ^value 1)
  12453. =>WM: (13864: R1 ^reward R991)
  12454. =>WM: (13863: I3 ^see 0)
  12455. <=WM: (13854: S1 ^operator O1973 +)
  12456. <=WM: (13855: S1 ^operator O1974 +)
  12457. <=WM: (13856: S1 ^operator O1974)
  12458. <=WM: (13839: I3 ^dir L)
  12459. <=WM: (13850: R1 ^reward R990)
  12460. <=WM: (13849: I3 ^see 1)
  12461. <=WM: (13853: O1974 ^name predict-no)
  12462. <=WM: (13852: O1973 ^name predict-yes)
  12463. <=WM: (13851: R990 ^value 1)
  12464. --- Inner Elaboration Phase, active level 1 (S1) ---
  12465. Firing prefer*rvt*predict-yes*H0
  12466. -->
  12467. Firing rl*prefer*rvt*predict-yes*H0*5
  12468. -->
  12469. (S1 ^operator O1975 = 0.1215983654449722)
  12470. Firing prefer*rvt*predict-yes*H0*5*H1
  12471. -->
  12472. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  12473. -->
  12474. (S1 ^operator O1975 = 0.8783944900614931)
  12475. Firing prefer*rvt*predict-no*H0
  12476. -->
  12477. Firing rl*prefer*rvt*predict-no*H0*6
  12478. -->
  12479. (S1 ^operator O1976 = 0.9999841575438704)
  12480. inner elaboration loop at bottom goal.
  12481. Retracting rl*prefer*rvt*predict-no*H0*6
  12482. -->
  12483. (S1 ^operator O1974 = 0.9999841575438704)
  12484. Retracting rl*prefer*rvt*predict-yes*H0*5
  12485. -->
  12486. (S1 ^operator O1973 = 0.1215983654449722)
  12487. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  12488. -->
  12489. (S1 ^operator O1973 = 0.8783944900614931)
  12490. --- END Proposal Phase ---
  12491. --- Decision Phase ---
  12492. RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314502 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.922078,0.0723198)
  12493. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521485 0.164052 0.685537 -> 0.521482 0.164052 0.685533(R,m,v=1,1,0)
  12494. =>WM: (13871: S1 ^operator O1975)
  12495. 988: O: O1975 (predict-yes)
  12496. --- END Decision Phase ---
  12497. --- Application Phase ---
  12498. --- Firing Productions (PE) For State At Depth 1 ---
  12499. --- Inner Elaboration Phase, active level 1 (S1) ---
  12500. Firing apply*operator
  12501. -->
  12502. (I3 ^predict-yes N988 + :O )
  12503. Firing apply*operator*complete
  12504. -->
  12505. (I3 ^predict-no N987 - :O )
  12506. inner elaboration loop at bottom goal.
  12507. --- Change Working Memory (PE) ---
  12508. =>WM: (13872: I3 ^predict-yes N988)
  12509. <=WM: (13858: N987 ^status complete)
  12510. <=WM: (13857: I3 ^predict-no N987)
  12511. --- Firing Productions (IE) For State At Depth 1 ---
  12512. --- Inner Elaboration Phase, active level 1 (S1) ---
  12513. Firing monitor*world
  12514. -->
  12515. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  12516. --- Change Working Memory (IE) ---
  12517. --- END Application Phase ---
  12518. --- Output Phase ---
  12519. ENV: Agent did: predict-yes for direction R in state State-A
  12520. In State-A moving R
  12521. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  12522. predict error 0
  12523. dir: dir isR
  12524. --- END Output Phase ---
  12525. /|\--- Input Phase ---
  12526. =>WM: (13876: I2 ^dir R)
  12527. =>WM: (13875: I2 ^reward 1)
  12528. =>WM: (13874: I2 ^see 1)
  12529. =>WM: (13873: N988 ^status complete)
  12530. <=WM: (13861: I2 ^dir R)
  12531. <=WM: (13860: I2 ^reward 1)
  12532. <=WM: (13859: I2 ^see 0)
  12533. =>WM: (13877: I2 ^level-1 R1-root)
  12534. <=WM: (13862: I2 ^level-1 L0-root)
  12535. --- END Input Phase ---
  12536. --- Proposal Phase ---
  12537. --- Inner Elaboration Phase, active level 1 (S1) ---
  12538. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  12539. -->
  12540. (S1 ^operator O1975 = -0.04253361215288998)
  12541. Firing prefer*rvt*predict-yes*H0*5*H1
  12542. -->
  12543. Firing elaborate*copy-see-to-output-link
  12544. -->
  12545. (I3 ^see 1 +)
  12546. Firing elaborate*reward*based*on*reward
  12547. -->
  12548. (R992 ^value 1 +)
  12549. (R1 ^reward R992 +)
  12550. Firing propose*predict-yes
  12551. -->
  12552. (O1977 ^name predict-yes +)
  12553. (S1 ^operator O1977 +)
  12554. Firing propose*predict-no
  12555. -->
  12556. (O1978 ^name predict-no +)
  12557. (S1 ^operator O1978 +)
  12558. Firing rl*prefer*rvt*predict-no*H0*6
  12559. -->
  12560. (S1 ^operator O1976 = 0.9999841575438704)
  12561. Firing rl*prefer*rvt*predict-yes*H0*5
  12562. -->
  12563. (S1 ^operator O1975 = 0.1215983654449722)
  12564. Firing prefer*rvt*predict-yes*H0
  12565. -->
  12566. Firing prefer*rvt*predict-no*H0
  12567. -->
  12568. Firing elaborate*copy-dir-to-output-link
  12569. -->
  12570. (I3 ^dir R +)
  12571. inner elaboration loop at bottom goal.
  12572. Retracting elaborate*copy-see-to-output-link
  12573. -->
  12574. (I3 ^see 0 +)
  12575. Retracting propose*predict-no
  12576. -->
  12577. (O1976 ^name predict-no +)
  12578. (S1 ^operator O1976 +)
  12579. Retracting propose*predict-yes
  12580. -->
  12581. (O1975 ^name predict-yes +)
  12582. (S1 ^operator O1975 +)
  12583. Retracting elaborate*reward*based*on*reward
  12584. -->
  12585. (R991 ^value 1 +)
  12586. (R1 ^reward R991 +)
  12587. Retracting elaborate*copy-dir-to-output-link
  12588. -->
  12589. (I3 ^dir R +)
  12590. Retracting rl*prefer*rvt*predict-no*H0*6
  12591. -->
  12592. (S1 ^operator O1976 = 0.9999841575438704)
  12593. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  12594. -->
  12595. (S1 ^operator O1975 = 0.8783944900614931)
  12596. Retracting rl*prefer*rvt*predict-yes*H0*5
  12597. -->
  12598. (S1 ^operator O1975 = 0.1215983654449722)
  12599. =>WM: (13884: S1 ^operator O1978 +)
  12600. =>WM: (13883: S1 ^operator O1977 +)
  12601. =>WM: (13882: O1978 ^name predict-no)
  12602. =>WM: (13881: O1977 ^name predict-yes)
  12603. =>WM: (13880: R992 ^value 1)
  12604. =>WM: (13879: R1 ^reward R992)
  12605. =>WM: (13878: I3 ^see 1)
  12606. <=WM: (13869: S1 ^operator O1975 +)
  12607. <=WM: (13871: S1 ^operator O1975)
  12608. <=WM: (13870: S1 ^operator O1976 +)
  12609. <=WM: (13864: R1 ^reward R991)
  12610. <=WM: (13863: I3 ^see 0)
  12611. <=WM: (13867: O1976 ^name predict-no)
  12612. <=WM: (13866: O1975 ^name predict-yes)
  12613. <=WM: (13865: R991 ^value 1)
  12614. --- Inner Elaboration Phase, active level 1 (S1) ---
  12615. Firing prefer*rvt*predict-yes*H0
  12616. -->
  12617. Firing rl*prefer*rvt*predict-yes*H0*5
  12618. -->
  12619. (S1 ^operator O1977 = 0.1215983654449722)
  12620. Firing prefer*rvt*predict-yes*H0*5*H1
  12621. -->
  12622. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  12623. -->
  12624. (S1 ^operator O1977 = -0.04253361215288998)
  12625. Firing prefer*rvt*predict-no*H0
  12626. -->
  12627. Firing rl*prefer*rvt*predict-no*H0*6
  12628. -->
  12629. (S1 ^operator O1978 = 0.9999841575438704)
  12630. inner elaboration loop at bottom goal.
  12631. Retracting rl*prefer*rvt*predict-no*H0*6
  12632. -->
  12633. (S1 ^operator O1976 = 0.9999841575438704)
  12634. Retracting rl*prefer*rvt*predict-yes*H0*5
  12635. -->
  12636. (S1 ^operator O1975 = 0.1215983654449722)
  12637. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  12638. -->
  12639. (S1 ^operator O1975 = -0.04253361215288998)
  12640. --- END Proposal Phase ---
  12641. --- Decision Phase ---
  12642. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.863636,0.118442)
  12643. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.46547 0.412925 0.878394 -> 0.46547 0.412925 0.878395(R,m,v=1,1,0)
  12644. =>WM: (13885: S1 ^operator O1978)
  12645. 989: O: O1978 (predict-no)
  12646. --- END Decision Phase ---
  12647. --- Application Phase ---
  12648. --- Firing Productions (PE) For State At Depth 1 ---
  12649. --- Inner Elaboration Phase, active level 1 (S1) ---
  12650. Firing apply*operator
  12651. -->
  12652. (I3 ^predict-no N989 + :O )
  12653. Firing apply*operator*complete
  12654. -->
  12655. (I3 ^predict-yes N988 - :O )
  12656. inner elaboration loop at bottom goal.
  12657. --- Change Working Memory (PE) ---
  12658. =>WM: (13886: I3 ^predict-no N989)
  12659. <=WM: (13873: N988 ^status complete)
  12660. <=WM: (13872: I3 ^predict-yes N988)
  12661. --- Firing Productions (IE) For State At Depth 1 ---
  12662. --- Inner Elaboration Phase, active level 1 (S1) ---
  12663. Firing monitor*world
  12664. -->
  12665. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12666. --- Change Working Memory (IE) ---
  12667. --- END Application Phase ---
  12668. --- Output Phase ---
  12669. ENV: Agent did: predict-no for direction R in state State-B
  12670. In State-B moving R
  12671. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12672. predict error 0
  12673. dir: dir isU
  12674. --- END Output Phase ---
  12675. -/--- Input Phase ---
  12676. =>WM: (13890: I2 ^dir U)
  12677. =>WM: (13889: I2 ^reward 1)
  12678. =>WM: (13888: I2 ^see 0)
  12679. =>WM: (13887: N989 ^status complete)
  12680. <=WM: (13876: I2 ^dir R)
  12681. <=WM: (13875: I2 ^reward 1)
  12682. <=WM: (13874: I2 ^see 1)
  12683. =>WM: (13891: I2 ^level-1 R0-root)
  12684. <=WM: (13877: I2 ^level-1 R1-root)
  12685. --- END Input Phase ---
  12686. --- Proposal Phase ---
  12687. --- Inner Elaboration Phase, active level 1 (S1) ---
  12688. Firing elaborate*copy-see-to-output-link
  12689. -->
  12690. (I3 ^see 0 +)
  12691. Firing elaborate*reward*based*on*reward
  12692. -->
  12693. (R993 ^value 1 +)
  12694. (R1 ^reward R993 +)
  12695. Firing propose*predict-yes
  12696. -->
  12697. (O1979 ^name predict-yes +)
  12698. (S1 ^operator O1979 +)
  12699. Firing propose*predict-no
  12700. -->
  12701. (O1980 ^name predict-no +)
  12702. (S1 ^operator O1980 +)
  12703. Firing rl*prefer*rvt*predict-no*H0*2
  12704. -->
  12705. (S1 ^operator O1978 = 1.)
  12706. Firing rl*prefer*rvt*predict-yes*H0*1
  12707. -->
  12708. (S1 ^operator O1977 = 0.)
  12709. Firing prefer*rvt*predict-yes*H0
  12710. -->
  12711. Firing prefer*rvt*predict-no*H0
  12712. -->
  12713. Firing elaborate*copy-dir-to-output-link
  12714. -->
  12715. (I3 ^dir U +)
  12716. inner elaboration loop at bottom goal.
  12717. Retracting elaborate*copy-see-to-output-link
  12718. -->
  12719. (I3 ^see 1 +)
  12720. Retracting propose*predict-no
  12721. -->
  12722. (O1978 ^name predict-no +)
  12723. (S1 ^operator O1978 +)
  12724. Retracting propose*predict-yes
  12725. -->
  12726. (O1977 ^name predict-yes +)
  12727. (S1 ^operator O1977 +)
  12728. Retracting elaborate*reward*based*on*reward
  12729. -->
  12730. (R992 ^value 1 +)
  12731. (R1 ^reward R992 +)
  12732. Retracting elaborate*copy-dir-to-output-link
  12733. -->
  12734. (I3 ^dir R +)
  12735. Retracting rl*prefer*rvt*predict-no*H0*6
  12736. -->
  12737. (S1 ^operator O1978 = 0.9999841575438704)
  12738. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  12739. -->
  12740. (S1 ^operator O1977 = -0.04253361215288998)
  12741. Retracting rl*prefer*rvt*predict-yes*H0*5
  12742. -->
  12743. (S1 ^operator O1977 = 0.1215989443698621)
  12744. =>WM: (13899: S1 ^operator O1980 +)
  12745. =>WM: (13898: S1 ^operator O1979 +)
  12746. =>WM: (13897: I3 ^dir U)
  12747. =>WM: (13896: O1980 ^name predict-no)
  12748. =>WM: (13895: O1979 ^name predict-yes)
  12749. =>WM: (13894: R993 ^value 1)
  12750. =>WM: (13893: R1 ^reward R993)
  12751. =>WM: (13892: I3 ^see 0)
  12752. <=WM: (13883: S1 ^operator O1977 +)
  12753. <=WM: (13884: S1 ^operator O1978 +)
  12754. <=WM: (13885: S1 ^operator O1978)
  12755. <=WM: (13868: I3 ^dir R)
  12756. <=WM: (13879: R1 ^reward R992)
  12757. <=WM: (13878: I3 ^see 1)
  12758. <=WM: (13882: O1978 ^name predict-no)
  12759. <=WM: (13881: O1977 ^name predict-yes)
  12760. <=WM: (13880: R992 ^value 1)
  12761. --- Inner Elaboration Phase, active level 1 (S1) ---
  12762. Firing prefer*rvt*predict-yes*H0
  12763. -->
  12764. Firing rl*prefer*rvt*predict-yes*H0*1
  12765. -->
  12766. (S1 ^operator O1979 = 0.)
  12767. Firing prefer*rvt*predict-no*H0
  12768. -->
  12769. Firing rl*prefer*rvt*predict-no*H0*2
  12770. -->
  12771. (S1 ^operator O1980 = 1.)
  12772. inner elaboration loop at bottom goal.
  12773. Retracting rl*prefer*rvt*predict-no*H0*2
  12774. -->
  12775. (S1 ^operator O1978 = 1.)
  12776. Retracting rl*prefer*rvt*predict-yes*H0*1
  12777. -->
  12778. (S1 ^operator O1977 = 0.)
  12779. --- END Proposal Phase ---
  12780. --- Decision Phase ---
  12781. RL update rl*prefer*rvt*predict-no*H0*6 0.999984 0 0.999984 -> 0.999987 0 0.999987(R,m,v=1,0.9375,0.0589286)
  12782. =>WM: (13900: S1 ^operator O1980)
  12783. 990: O: O1980 (predict-no)
  12784. --- END Decision Phase ---
  12785. --- Application Phase ---
  12786. --- Firing Productions (PE) For State At Depth 1 ---
  12787. --- Inner Elaboration Phase, active level 1 (S1) ---
  12788. Firing apply*operator
  12789. -->
  12790. (I3 ^predict-no N990 + :O )
  12791. Firing apply*operator*complete
  12792. -->
  12793. (I3 ^predict-no N989 - :O )
  12794. inner elaboration loop at bottom goal.
  12795. --- Change Working Memory (PE) ---
  12796. =>WM: (13901: I3 ^predict-no N990)
  12797. <=WM: (13887: N989 ^status complete)
  12798. <=WM: (13886: I3 ^predict-no N989)
  12799. --- Firing Productions (IE) For State At Depth 1 ---
  12800. --- Inner Elaboration Phase, active level 1 (S1) ---
  12801. Firing monitor*world
  12802. -->
  12803. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12804. --- Change Working Memory (IE) ---
  12805. --- END Application Phase ---
  12806. --- Output Phase ---
  12807. ENV: Agent did: predict-no for direction U in state State-B
  12808. In State-B moving U
  12809. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12810. predict error 0
  12811. dir: dir isR
  12812. --- END Output Phase ---
  12813. |\---- Input Phase ---
  12814. =>WM: (13905: I2 ^dir R)
  12815. =>WM: (13904: I2 ^reward 1)
  12816. =>WM: (13903: I2 ^see 0)
  12817. =>WM: (13902: N990 ^status complete)
  12818. <=WM: (13890: I2 ^dir U)
  12819. <=WM: (13889: I2 ^reward 1)
  12820. <=WM: (13888: I2 ^see 0)
  12821. =>WM: (13906: I2 ^level-1 R0-root)
  12822. <=WM: (13891: I2 ^level-1 R0-root)
  12823. --- END Input Phase ---
  12824. --- Proposal Phase ---
  12825. --- Inner Elaboration Phase, active level 1 (S1) ---
  12826. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  12827. -->
  12828. (S1 ^operator O1979 = -0.1512366769350551)
  12829. Firing prefer*rvt*predict-yes*H0*5*H1
  12830. -->
  12831. Firing elaborate*copy-see-to-output-link
  12832. -->
  12833. (I3 ^see 0 +)
  12834. Firing elaborate*reward*based*on*reward
  12835. -->
  12836. (R994 ^value 1 +)
  12837. (R1 ^reward R994 +)
  12838. Firing propose*predict-yes
  12839. -->
  12840. (O1981 ^name predict-yes +)
  12841. (S1 ^operator O1981 +)
  12842. Firing propose*predict-no
  12843. -->
  12844. (O1982 ^name predict-no +)
  12845. (S1 ^operator O1982 +)
  12846. Firing rl*prefer*rvt*predict-no*H0*6
  12847. -->
  12848. (S1 ^operator O1980 = 0.9999867250014868)
  12849. Firing rl*prefer*rvt*predict-yes*H0*5
  12850. -->
  12851. (S1 ^operator O1979 = 0.1215989443698621)
  12852. Firing prefer*rvt*predict-yes*H0
  12853. -->
  12854. Firing prefer*rvt*predict-no*H0
  12855. -->
  12856. Firing elaborate*copy-dir-to-output-link
  12857. -->
  12858. (I3 ^dir R +)
  12859. inner elaboration loop at bottom goal.
  12860. Retracting elaborate*copy-see-to-output-link
  12861. -->
  12862. (I3 ^see 0 +)
  12863. Retracting propose*predict-no
  12864. -->
  12865. (O1980 ^name predict-no +)
  12866. (S1 ^operator O1980 +)
  12867. Retracting propose*predict-yes
  12868. -->
  12869. (O1979 ^name predict-yes +)
  12870. (S1 ^operator O1979 +)
  12871. Retracting elaborate*reward*based*on*reward
  12872. -->
  12873. (R993 ^value 1 +)
  12874. (R1 ^reward R993 +)
  12875. Retracting elaborate*copy-dir-to-output-link
  12876. -->
  12877. (I3 ^dir U +)
  12878. Retracting rl*prefer*rvt*predict-no*H0*2
  12879. -->
  12880. (S1 ^operator O1980 = 1.)
  12881. Retracting rl*prefer*rvt*predict-yes*H0*1
  12882. -->
  12883. (S1 ^operator O1979 = 0.)
  12884. =>WM: (13913: S1 ^operator O1982 +)
  12885. =>WM: (13912: S1 ^operator O1981 +)
  12886. =>WM: (13911: I3 ^dir R)
  12887. =>WM: (13910: O1982 ^name predict-no)
  12888. =>WM: (13909: O1981 ^name predict-yes)
  12889. =>WM: (13908: R994 ^value 1)
  12890. =>WM: (13907: R1 ^reward R994)
  12891. <=WM: (13898: S1 ^operator O1979 +)
  12892. <=WM: (13899: S1 ^operator O1980 +)
  12893. <=WM: (13900: S1 ^operator O1980)
  12894. <=WM: (13897: I3 ^dir U)
  12895. <=WM: (13893: R1 ^reward R993)
  12896. <=WM: (13896: O1980 ^name predict-no)
  12897. <=WM: (13895: O1979 ^name predict-yes)
  12898. <=WM: (13894: R993 ^value 1)
  12899. --- Inner Elaboration Phase, active level 1 (S1) ---
  12900. Firing prefer*rvt*predict-yes*H0
  12901. -->
  12902. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  12903. -->
  12904. (S1 ^operator O1981 = -0.1512366769350551)
  12905. Firing rl*prefer*rvt*predict-yes*H0*5
  12906. -->
  12907. (S1 ^operator O1981 = 0.1215989443698621)
  12908. Firing prefer*rvt*predict-yes*H0*5*H1
  12909. -->
  12910. Firing prefer*rvt*predict-no*H0
  12911. -->
  12912. Firing rl*prefer*rvt*predict-no*H0*6
  12913. -->
  12914. (S1 ^operator O1982 = 0.9999867250014868)
  12915. inner elaboration loop at bottom goal.
  12916. Retracting rl*prefer*rvt*predict-no*H0*6
  12917. -->
  12918. (S1 ^operator O1980 = 0.9999867250014868)
  12919. Retracting rl*prefer*rvt*predict-yes*H0*5
  12920. -->
  12921. (S1 ^operator O1979 = 0.1215989443698621)
  12922. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  12923. -->
  12924. (S1 ^operator O1979 = -0.1512366769350551)
  12925. --- END Proposal Phase ---
  12926. --- Decision Phase ---
  12927. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  12928. =>WM: (13914: S1 ^operator O1982)
  12929. 991: O: O1982 (predict-no)
  12930. --- END Decision Phase ---
  12931. --- Application Phase ---
  12932. --- Firing Productions (PE) For State At Depth 1 ---
  12933. --- Inner Elaboration Phase, active level 1 (S1) ---
  12934. Firing apply*operator
  12935. -->
  12936. (I3 ^predict-no N991 + :O )
  12937. Firing apply*operator*complete
  12938. -->
  12939. (I3 ^predict-no N990 - :O )
  12940. inner elaboration loop at bottom goal.
  12941. --- Change Working Memory (PE) ---
  12942. =>WM: (13915: I3 ^predict-no N991)
  12943. <=WM: (13902: N990 ^status complete)
  12944. <=WM: (13901: I3 ^predict-no N990)
  12945. --- Firing Productions (IE) For State At Depth 1 ---
  12946. --- Inner Elaboration Phase, active level 1 (S1) ---
  12947. Firing monitor*world
  12948. -->
  12949. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  12950. --- Change Working Memory (IE) ---
  12951. --- END Application Phase ---
  12952. --- Output Phase ---
  12953. ENV: Agent did: predict-no for direction R in state State-B
  12954. In State-B moving R
  12955. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  12956. predict error 0
  12957. dir: dir isU
  12958. --- END Output Phase ---
  12959. /--- Input Phase ---
  12960. =>WM: (13919: I2 ^dir U)
  12961. =>WM: (13918: I2 ^reward 1)
  12962. =>WM: (13917: I2 ^see 0)
  12963. =>WM: (13916: N991 ^status complete)
  12964. <=WM: (13905: I2 ^dir R)
  12965. <=WM: (13904: I2 ^reward 1)
  12966. <=WM: (13903: I2 ^see 0)
  12967. =>WM: (13920: I2 ^level-1 R0-root)
  12968. <=WM: (13906: I2 ^level-1 R0-root)
  12969. --- END Input Phase ---
  12970. --- Proposal Phase ---
  12971. --- Inner Elaboration Phase, active level 1 (S1) ---
  12972. Firing elaborate*copy-see-to-output-link
  12973. -->
  12974. (I3 ^see 0 +)
  12975. Firing elaborate*reward*based*on*reward
  12976. -->
  12977. (R995 ^value 1 +)
  12978. (R1 ^reward R995 +)
  12979. Firing propose*predict-yes
  12980. -->
  12981. (O1983 ^name predict-yes +)
  12982. (S1 ^operator O1983 +)
  12983. Firing propose*predict-no
  12984. -->
  12985. (O1984 ^name predict-no +)
  12986. (S1 ^operator O1984 +)
  12987. Firing rl*prefer*rvt*predict-no*H0*2
  12988. -->
  12989. (S1 ^operator O1982 = 1.)
  12990. Firing rl*prefer*rvt*predict-yes*H0*1
  12991. -->
  12992. (S1 ^operator O1981 = 0.)
  12993. Firing prefer*rvt*predict-yes*H0
  12994. -->
  12995. Firing prefer*rvt*predict-no*H0
  12996. -->
  12997. Firing elaborate*copy-dir-to-output-link
  12998. -->
  12999. (I3 ^dir U +)
  13000. inner elaboration loop at bottom goal.
  13001. Retracting elaborate*copy-see-to-output-link
  13002. -->
  13003. (I3 ^see 0 +)
  13004. Retracting propose*predict-no
  13005. -->
  13006. (O1982 ^name predict-no +)
  13007. (S1 ^operator O1982 +)
  13008. Retracting propose*predict-yes
  13009. -->
  13010. (O1981 ^name predict-yes +)
  13011. (S1 ^operator O1981 +)
  13012. Retracting elaborate*reward*based*on*reward
  13013. -->
  13014. (R994 ^value 1 +)
  13015. (R1 ^reward R994 +)
  13016. Retracting elaborate*copy-dir-to-output-link
  13017. -->
  13018. (I3 ^dir R +)
  13019. Retracting rl*prefer*rvt*predict-no*H0*6
  13020. -->
  13021. (S1 ^operator O1982 = 0.9999867250014868)
  13022. Retracting rl*prefer*rvt*predict-yes*H0*5
  13023. -->
  13024. (S1 ^operator O1981 = 0.1215989443698621)
  13025. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  13026. -->
  13027. (S1 ^operator O1981 = -0.1512366769350551)
  13028. =>WM: (13927: S1 ^operator O1984 +)
  13029. =>WM: (13926: S1 ^operator O1983 +)
  13030. =>WM: (13925: I3 ^dir U)
  13031. =>WM: (13924: O1984 ^name predict-no)
  13032. =>WM: (13923: O1983 ^name predict-yes)
  13033. =>WM: (13922: R995 ^value 1)
  13034. =>WM: (13921: R1 ^reward R995)
  13035. <=WM: (13912: S1 ^operator O1981 +)
  13036. <=WM: (13913: S1 ^operator O1982 +)
  13037. <=WM: (13914: S1 ^operator O1982)
  13038. <=WM: (13911: I3 ^dir R)
  13039. <=WM: (13907: R1 ^reward R994)
  13040. <=WM: (13910: O1982 ^name predict-no)
  13041. <=WM: (13909: O1981 ^name predict-yes)
  13042. <=WM: (13908: R994 ^value 1)
  13043. --- Inner Elaboration Phase, active level 1 (S1) ---
  13044. Firing prefer*rvt*predict-yes*H0
  13045. -->
  13046. Firing rl*prefer*rvt*predict-yes*H0*1
  13047. -->
  13048. (S1 ^operator O1983 = 0.)
  13049. Firing prefer*rvt*predict-no*H0
  13050. -->
  13051. Firing rl*prefer*rvt*predict-no*H0*2
  13052. -->
  13053. (S1 ^operator O1984 = 1.)
  13054. inner elaboration loop at bottom goal.
  13055. Retracting rl*prefer*rvt*predict-no*H0*2
  13056. -->
  13057. (S1 ^operator O1982 = 1.)
  13058. Retracting rl*prefer*rvt*predict-yes*H0*1
  13059. -->
  13060. (S1 ^operator O1981 = 0.)
  13061. --- END Proposal Phase ---
  13062. --- Decision Phase ---
  13063. RL update rl*prefer*rvt*predict-no*H0*6 0.999987 0 0.999987 -> 0.999989 0 0.999989(R,m,v=1,0.937853,0.0586158)
  13064. =>WM: (13928: S1 ^operator O1984)
  13065. 992: O: O1984 (predict-no)
  13066. --- END Decision Phase ---
  13067. --- Application Phase ---
  13068. --- Firing Productions (PE) For State At Depth 1 ---
  13069. --- Inner Elaboration Phase, active level 1 (S1) ---
  13070. Firing apply*operator
  13071. -->
  13072. (I3 ^predict-no N992 + :O )
  13073. Firing apply*operator*complete
  13074. -->
  13075. (I3 ^predict-no N991 - :O )
  13076. inner elaboration loop at bottom goal.
  13077. --- Change Working Memory (PE) ---
  13078. =>WM: (13929: I3 ^predict-no N992)
  13079. <=WM: (13916: N991 ^status complete)
  13080. <=WM: (13915: I3 ^predict-no N991)
  13081. --- Firing Productions (IE) For State At Depth 1 ---
  13082. --- Inner Elaboration Phase, active level 1 (S1) ---
  13083. Firing monitor*world
  13084. -->
  13085. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13086. --- Change Working Memory (IE) ---
  13087. --- END Application Phase ---
  13088. --- Output Phase ---
  13089. ENV: Agent did: predict-no for direction U in state State-B
  13090. In State-B moving U
  13091. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  13092. predict error 0
  13093. dir: dir isL
  13094. --- END Output Phase ---
  13095. |\--- Input Phase ---
  13096. =>WM: (13933: I2 ^dir L)
  13097. =>WM: (13932: I2 ^reward 1)
  13098. =>WM: (13931: I2 ^see 0)
  13099. =>WM: (13930: N992 ^status complete)
  13100. <=WM: (13919: I2 ^dir U)
  13101. <=WM: (13918: I2 ^reward 1)
  13102. <=WM: (13917: I2 ^see 0)
  13103. =>WM: (13934: I2 ^level-1 R0-root)
  13104. <=WM: (13920: I2 ^level-1 R0-root)
  13105. --- END Input Phase ---
  13106. --- Proposal Phase ---
  13107. --- Inner Elaboration Phase, active level 1 (S1) ---
  13108. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  13109. -->
  13110. (S1 ^operator O1984 = -0.1984300550322165)
  13111. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  13112. -->
  13113. (S1 ^operator O1983 = 0.6091029227055655)
  13114. Firing prefer*rvt*predict-no*H0*4*H1
  13115. -->
  13116. Firing prefer*rvt*predict-yes*H0*3*H1
  13117. -->
  13118. Firing elaborate*copy-see-to-output-link
  13119. -->
  13120. (I3 ^see 0 +)
  13121. Firing elaborate*reward*based*on*reward
  13122. -->
  13123. (R996 ^value 1 +)
  13124. (R1 ^reward R996 +)
  13125. Firing propose*predict-yes
  13126. -->
  13127. (O1985 ^name predict-yes +)
  13128. (S1 ^operator O1985 +)
  13129. Firing propose*predict-no
  13130. -->
  13131. (O1986 ^name predict-no +)
  13132. (S1 ^operator O1986 +)
  13133. Firing rl*prefer*rvt*predict-no*H0*4
  13134. -->
  13135. (S1 ^operator O1984 = 0.3144988611901438)
  13136. Firing rl*prefer*rvt*predict-yes*H0*3
  13137. -->
  13138. (S1 ^operator O1983 = 0.3907675490335307)
  13139. Firing prefer*rvt*predict-yes*H0
  13140. -->
  13141. Firing prefer*rvt*predict-no*H0
  13142. -->
  13143. Firing elaborate*copy-dir-to-output-link
  13144. -->
  13145. (I3 ^dir L +)
  13146. inner elaboration loop at bottom goal.
  13147. Retracting elaborate*copy-see-to-output-link
  13148. -->
  13149. (I3 ^see 0 +)
  13150. Retracting propose*predict-no
  13151. -->
  13152. (O1984 ^name predict-no +)
  13153. (S1 ^operator O1984 +)
  13154. Retracting propose*predict-yes
  13155. -->
  13156. (O1983 ^name predict-yes +)
  13157. (S1 ^operator O1983 +)
  13158. Retracting elaborate*reward*based*on*reward
  13159. -->
  13160. (R995 ^value 1 +)
  13161. (R1 ^reward R995 +)
  13162. Retracting elaborate*copy-dir-to-output-link
  13163. -->
  13164. (I3 ^dir U +)
  13165. Retracting rl*prefer*rvt*predict-no*H0*2
  13166. -->
  13167. (S1 ^operator O1984 = 1.)
  13168. Retracting rl*prefer*rvt*predict-yes*H0*1
  13169. -->
  13170. (S1 ^operator O1983 = 0.)
  13171. =>WM: (13941: S1 ^operator O1986 +)
  13172. =>WM: (13940: S1 ^operator O1985 +)
  13173. =>WM: (13939: I3 ^dir L)
  13174. =>WM: (13938: O1986 ^name predict-no)
  13175. =>WM: (13937: O1985 ^name predict-yes)
  13176. =>WM: (13936: R996 ^value 1)
  13177. =>WM: (13935: R1 ^reward R996)
  13178. <=WM: (13926: S1 ^operator O1983 +)
  13179. <=WM: (13927: S1 ^operator O1984 +)
  13180. <=WM: (13928: S1 ^operator O1984)
  13181. <=WM: (13925: I3 ^dir U)
  13182. <=WM: (13921: R1 ^reward R995)
  13183. <=WM: (13924: O1984 ^name predict-no)
  13184. <=WM: (13923: O1983 ^name predict-yes)
  13185. <=WM: (13922: R995 ^value 1)
  13186. --- Inner Elaboration Phase, active level 1 (S1) ---
  13187. Firing prefer*rvt*predict-yes*H0
  13188. -->
  13189. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  13190. -->
  13191. (S1 ^operator O1985 = 0.6091029227055655)
  13192. Firing rl*prefer*rvt*predict-yes*H0*3
  13193. -->
  13194. (S1 ^operator O1985 = 0.3907675490335307)
  13195. Firing prefer*rvt*predict-yes*H0*3*H1
  13196. -->
  13197. Firing prefer*rvt*predict-no*H0
  13198. -->
  13199. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  13200. -->
  13201. (S1 ^operator O1986 = -0.1984300550322165)
  13202. Firing rl*prefer*rvt*predict-no*H0*4
  13203. -->
  13204. (S1 ^operator O1986 = 0.3144988611901438)
  13205. Firing prefer*rvt*predict-no*H0*4*H1
  13206. -->
  13207. inner elaboration loop at bottom goal.
  13208. Retracting rl*prefer*rvt*predict-no*H0*4
  13209. -->
  13210. (S1 ^operator O1984 = 0.3144988611901438)
  13211. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  13212. -->
  13213. (S1 ^operator O1984 = -0.1984300550322165)
  13214. Retracting rl*prefer*rvt*predict-yes*H0*3
  13215. -->
  13216. (S1 ^operator O1983 = 0.3907675490335307)
  13217. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  13218. -->
  13219. (S1 ^operator O1983 = 0.6091029227055655)
  13220. --- END Proposal Phase ---
  13221. --- Decision Phase ---
  13222. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13223. =>WM: (13942: S1 ^operator O1985)
  13224. 993: O: O1985 (predict-yes)
  13225. --- END Decision Phase ---
  13226. --- Application Phase ---
  13227. --- Firing Productions (PE) For State At Depth 1 ---
  13228. --- Inner Elaboration Phase, active level 1 (S1) ---
  13229. Firing apply*operator
  13230. -->
  13231. (I3 ^predict-yes N993 + :O )
  13232. Firing apply*operator*complete
  13233. -->
  13234. (I3 ^predict-no N992 - :O )
  13235. inner elaboration loop at bottom goal.
  13236. --- Change Working Memory (PE) ---
  13237. =>WM: (13943: I3 ^predict-yes N993)
  13238. <=WM: (13930: N992 ^status complete)
  13239. <=WM: (13929: I3 ^predict-no N992)
  13240. --- Firing Productions (IE) For State At Depth 1 ---
  13241. --- Inner Elaboration Phase, active level 1 (S1) ---
  13242. Firing monitor*world
  13243. -->
  13244. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  13245. --- Change Working Memory (IE) ---
  13246. --- END Application Phase ---
  13247. --- Output Phase ---
  13248. ENV: Agent did: predict-yes for direction L in state State-B
  13249. In State-B moving L
  13250. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  13251. predict error 0
  13252. dir: dir isU
  13253. --- END Output Phase ---
  13254. -/|--- Input Phase ---
  13255. =>WM: (13947: I2 ^dir U)
  13256. =>WM: (13946: I2 ^reward 1)
  13257. =>WM: (13945: I2 ^see 1)
  13258. =>WM: (13944: N993 ^status complete)
  13259. <=WM: (13933: I2 ^dir L)
  13260. <=WM: (13932: I2 ^reward 1)
  13261. <=WM: (13931: I2 ^see 0)
  13262. =>WM: (13948: I2 ^level-1 L1-root)
  13263. <=WM: (13934: I2 ^level-1 R0-root)
  13264. --- END Input Phase ---
  13265. --- Proposal Phase ---
  13266. --- Inner Elaboration Phase, active level 1 (S1) ---
  13267. Firing elaborate*copy-see-to-output-link
  13268. -->
  13269. (I3 ^see 1 +)
  13270. Firing elaborate*reward*based*on*reward
  13271. -->
  13272. (R997 ^value 1 +)
  13273. (R1 ^reward R997 +)
  13274. Firing propose*predict-yes
  13275. -->
  13276. (O1987 ^name predict-yes +)
  13277. (S1 ^operator O1987 +)
  13278. Firing propose*predict-no
  13279. -->
  13280. (O1988 ^name predict-no +)
  13281. (S1 ^operator O1988 +)
  13282. Firing rl*prefer*rvt*predict-no*H0*2
  13283. -->
  13284. (S1 ^operator O1986 = 1.)
  13285. Firing rl*prefer*rvt*predict-yes*H0*1
  13286. -->
  13287. (S1 ^operator O1985 = 0.)
  13288. Firing prefer*rvt*predict-yes*H0
  13289. -->
  13290. Firing prefer*rvt*predict-no*H0
  13291. -->
  13292. Firing elaborate*copy-dir-to-output-link
  13293. -->
  13294. (I3 ^dir U +)
  13295. inner elaboration loop at bottom goal.
  13296. Retracting elaborate*copy-see-to-output-link
  13297. -->
  13298. (I3 ^see 0 +)
  13299. Retracting propose*predict-no
  13300. -->
  13301. (O1986 ^name predict-no +)
  13302. (S1 ^operator O1986 +)
  13303. Retracting propose*predict-yes
  13304. -->
  13305. (O1985 ^name predict-yes +)
  13306. (S1 ^operator O1985 +)
  13307. Retracting elaborate*reward*based*on*reward
  13308. -->
  13309. (R996 ^value 1 +)
  13310. (R1 ^reward R996 +)
  13311. Retracting elaborate*copy-dir-to-output-link
  13312. -->
  13313. (I3 ^dir L +)
  13314. Retracting rl*prefer*rvt*predict-no*H0*4
  13315. -->
  13316. (S1 ^operator O1986 = 0.3144988611901438)
  13317. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  13318. -->
  13319. (S1 ^operator O1986 = -0.1984300550322165)
  13320. Retracting rl*prefer*rvt*predict-yes*H0*3
  13321. -->
  13322. (S1 ^operator O1985 = 0.3907675490335307)
  13323. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  13324. -->
  13325. (S1 ^operator O1985 = 0.6091029227055655)
  13326. =>WM: (13956: S1 ^operator O1988 +)
  13327. =>WM: (13955: S1 ^operator O1987 +)
  13328. =>WM: (13954: I3 ^dir U)
  13329. =>WM: (13953: O1988 ^name predict-no)
  13330. =>WM: (13952: O1987 ^name predict-yes)
  13331. =>WM: (13951: R997 ^value 1)
  13332. =>WM: (13950: R1 ^reward R997)
  13333. =>WM: (13949: I3 ^see 1)
  13334. <=WM: (13940: S1 ^operator O1985 +)
  13335. <=WM: (13942: S1 ^operator O1985)
  13336. <=WM: (13941: S1 ^operator O1986 +)
  13337. <=WM: (13939: I3 ^dir L)
  13338. <=WM: (13935: R1 ^reward R996)
  13339. <=WM: (13892: I3 ^see 0)
  13340. <=WM: (13938: O1986 ^name predict-no)
  13341. <=WM: (13937: O1985 ^name predict-yes)
  13342. <=WM: (13936: R996 ^value 1)
  13343. --- Inner Elaboration Phase, active level 1 (S1) ---
  13344. Firing prefer*rvt*predict-yes*H0
  13345. -->
  13346. Firing rl*prefer*rvt*predict-yes*H0*1
  13347. -->
  13348. (S1 ^operator O1987 = 0.)
  13349. Firing prefer*rvt*predict-no*H0
  13350. -->
  13351. Firing rl*prefer*rvt*predict-no*H0*2
  13352. -->
  13353. (S1 ^operator O1988 = 1.)
  13354. inner elaboration loop at bottom goal.
  13355. Retracting rl*prefer*rvt*predict-no*H0*2
  13356. -->
  13357. (S1 ^operator O1986 = 1.)
  13358. Retracting rl*prefer*rvt*predict-yes*H0*1
  13359. -->
  13360. (S1 ^operator O1985 = 0.)
  13361. --- END Proposal Phase ---
  13362. --- Decision Phase ---
  13363. RL update rl*prefer*rvt*predict-yes*H0*3 0.472315 -0.0815474 0.390768 -> 0.472324 -0.0815458 0.390778(R,m,v=1,0.94375,0.0534198)
  13364. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527575 0.0815283 0.609103 -> 0.527585 0.0815301 0.609115(R,m,v=1,1,0)
  13365. =>WM: (13957: S1 ^operator O1988)
  13366. 994: O: O1988 (predict-no)
  13367. --- END Decision Phase ---
  13368. --- Application Phase ---
  13369. --- Firing Productions (PE) For State At Depth 1 ---
  13370. --- Inner Elaboration Phase, active level 1 (S1) ---
  13371. Firing apply*operator
  13372. -->
  13373. (I3 ^predict-no N994 + :O )
  13374. Firing apply*operator*complete
  13375. -->
  13376. (I3 ^predict-yes N993 - :O )
  13377. inner elaboration loop at bottom goal.
  13378. --- Change Working Memory (PE) ---
  13379. =>WM: (13958: I3 ^predict-no N994)
  13380. <=WM: (13944: N993 ^status complete)
  13381. <=WM: (13943: I3 ^predict-yes N993)
  13382. --- Firing Productions (IE) For State At Depth 1 ---
  13383. --- Inner Elaboration Phase, active level 1 (S1) ---
  13384. Firing monitor*world
  13385. -->
  13386. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13387. --- Change Working Memory (IE) ---
  13388. --- END Application Phase ---
  13389. --- Output Phase ---
  13390. ENV: Agent did: predict-no for direction U in state State-A
  13391. In State-A moving U
  13392. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13393. predict error 0
  13394. dir: dir isL
  13395. --- END Output Phase ---
  13396. \-/--- Input Phase ---
  13397. =>WM: (13962: I2 ^dir L)
  13398. =>WM: (13961: I2 ^reward 1)
  13399. =>WM: (13960: I2 ^see 0)
  13400. =>WM: (13959: N994 ^status complete)
  13401. <=WM: (13947: I2 ^dir U)
  13402. <=WM: (13946: I2 ^reward 1)
  13403. <=WM: (13945: I2 ^see 1)
  13404. =>WM: (13963: I2 ^level-1 L1-root)
  13405. <=WM: (13948: I2 ^level-1 L1-root)
  13406. --- END Input Phase ---
  13407. --- Proposal Phase ---
  13408. --- Inner Elaboration Phase, active level 1 (S1) ---
  13409. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  13410. -->
  13411. (S1 ^operator O1987 = -0.2062723012911647)
  13412. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  13413. -->
  13414. (S1 ^operator O1988 = 0.685533297663165)
  13415. Firing prefer*rvt*predict-no*H0*4*H1
  13416. -->
  13417. Firing prefer*rvt*predict-yes*H0*3*H1
  13418. -->
  13419. Firing elaborate*copy-see-to-output-link
  13420. -->
  13421. (I3 ^see 0 +)
  13422. Firing elaborate*reward*based*on*reward
  13423. -->
  13424. (R998 ^value 1 +)
  13425. (R1 ^reward R998 +)
  13426. Firing propose*predict-yes
  13427. -->
  13428. (O1989 ^name predict-yes +)
  13429. (S1 ^operator O1989 +)
  13430. Firing propose*predict-no
  13431. -->
  13432. (O1990 ^name predict-no +)
  13433. (S1 ^operator O1990 +)
  13434. Firing rl*prefer*rvt*predict-no*H0*4
  13435. -->
  13436. (S1 ^operator O1988 = 0.3144988611901438)
  13437. Firing rl*prefer*rvt*predict-yes*H0*3
  13438. -->
  13439. (S1 ^operator O1987 = 0.3907782094907327)
  13440. Firing prefer*rvt*predict-yes*H0
  13441. -->
  13442. Firing prefer*rvt*predict-no*H0
  13443. -->
  13444. Firing elaborate*copy-dir-to-output-link
  13445. -->
  13446. (I3 ^dir L +)
  13447. inner elaboration loop at bottom goal.
  13448. Retracting elaborate*copy-see-to-output-link
  13449. -->
  13450. (I3 ^see 1 +)
  13451. Retracting propose*predict-no
  13452. -->
  13453. (O1988 ^name predict-no +)
  13454. (S1 ^operator O1988 +)
  13455. Retracting propose*predict-yes
  13456. -->
  13457. (O1987 ^name predict-yes +)
  13458. (S1 ^operator O1987 +)
  13459. Retracting elaborate*reward*based*on*reward
  13460. -->
  13461. (R997 ^value 1 +)
  13462. (R1 ^reward R997 +)
  13463. Retracting elaborate*copy-dir-to-output-link
  13464. -->
  13465. (I3 ^dir U +)
  13466. Retracting rl*prefer*rvt*predict-no*H0*2
  13467. -->
  13468. (S1 ^operator O1988 = 1.)
  13469. Retracting rl*prefer*rvt*predict-yes*H0*1
  13470. -->
  13471. (S1 ^operator O1987 = 0.)
  13472. =>WM: (13971: S1 ^operator O1990 +)
  13473. =>WM: (13970: S1 ^operator O1989 +)
  13474. =>WM: (13969: I3 ^dir L)
  13475. =>WM: (13968: O1990 ^name predict-no)
  13476. =>WM: (13967: O1989 ^name predict-yes)
  13477. =>WM: (13966: R998 ^value 1)
  13478. =>WM: (13965: R1 ^reward R998)
  13479. =>WM: (13964: I3 ^see 0)
  13480. <=WM: (13955: S1 ^operator O1987 +)
  13481. <=WM: (13956: S1 ^operator O1988 +)
  13482. <=WM: (13957: S1 ^operator O1988)
  13483. <=WM: (13954: I3 ^dir U)
  13484. <=WM: (13950: R1 ^reward R997)
  13485. <=WM: (13949: I3 ^see 1)
  13486. <=WM: (13953: O1988 ^name predict-no)
  13487. <=WM: (13952: O1987 ^name predict-yes)
  13488. <=WM: (13951: R997 ^value 1)
  13489. --- Inner Elaboration Phase, active level 1 (S1) ---
  13490. Firing prefer*rvt*predict-yes*H0
  13491. -->
  13492. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  13493. -->
  13494. (S1 ^operator O1989 = -0.2062723012911647)
  13495. Firing rl*prefer*rvt*predict-yes*H0*3
  13496. -->
  13497. (S1 ^operator O1989 = 0.3907782094907327)
  13498. Firing prefer*rvt*predict-yes*H0*3*H1
  13499. -->
  13500. Firing prefer*rvt*predict-no*H0
  13501. -->
  13502. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  13503. -->
  13504. (S1 ^operator O1990 = 0.685533297663165)
  13505. Firing rl*prefer*rvt*predict-no*H0*4
  13506. -->
  13507. (S1 ^operator O1990 = 0.3144988611901438)
  13508. Firing prefer*rvt*predict-no*H0*4*H1
  13509. -->
  13510. inner elaboration loop at bottom goal.
  13511. Retracting rl*prefer*rvt*predict-no*H0*4
  13512. -->
  13513. (S1 ^operator O1988 = 0.3144988611901438)
  13514. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  13515. -->
  13516. (S1 ^operator O1988 = 0.685533297663165)
  13517. Retracting rl*prefer*rvt*predict-yes*H0*3
  13518. -->
  13519. (S1 ^operator O1987 = 0.3907782094907327)
  13520. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  13521. -->
  13522. (S1 ^operator O1987 = -0.2062723012911647)
  13523. --- END Proposal Phase ---
  13524. --- Decision Phase ---
  13525. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  13526. =>WM: (13972: S1 ^operator O1990)
  13527. 995: O: O1990 (predict-no)
  13528. --- END Decision Phase ---
  13529. --- Application Phase ---
  13530. --- Firing Productions (PE) For State At Depth 1 ---
  13531. --- Inner Elaboration Phase, active level 1 (S1) ---
  13532. Firing apply*operator
  13533. -->
  13534. (I3 ^predict-no N995 + :O )
  13535. Firing apply*operator*complete
  13536. -->
  13537. (I3 ^predict-no N994 - :O )
  13538. inner elaboration loop at bottom goal.
  13539. --- Change Working Memory (PE) ---
  13540. =>WM: (13973: I3 ^predict-no N995)
  13541. <=WM: (13959: N994 ^status complete)
  13542. <=WM: (13958: I3 ^predict-no N994)
  13543. --- Firing Productions (IE) For State At Depth 1 ---
  13544. --- Inner Elaboration Phase, active level 1 (S1) ---
  13545. Firing monitor*world
  13546. -->
  13547. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13548. --- Change Working Memory (IE) ---
  13549. --- END Application Phase ---
  13550. --- Output Phase ---
  13551. ENV: Agent did: predict-no for direction L in state State-A
  13552. In State-A moving L
  13553. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13554. predict error 0
  13555. dir: dir isL
  13556. --- END Output Phase ---
  13557. |\---- Input Phase ---
  13558. =>WM: (13977: I2 ^dir L)
  13559. =>WM: (13976: I2 ^reward 1)
  13560. =>WM: (13975: I2 ^see 0)
  13561. =>WM: (13974: N995 ^status complete)
  13562. <=WM: (13962: I2 ^dir L)
  13563. <=WM: (13961: I2 ^reward 1)
  13564. <=WM: (13960: I2 ^see 0)
  13565. =>WM: (13978: I2 ^level-1 L0-root)
  13566. <=WM: (13963: I2 ^level-1 L1-root)
  13567. --- END Input Phase ---
  13568. --- Proposal Phase ---
  13569. --- Inner Elaboration Phase, active level 1 (S1) ---
  13570. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13571. -->
  13572. (S1 ^operator O1989 = -0.208713043145708)
  13573. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13574. -->
  13575. (S1 ^operator O1990 = 0.6854257503571404)
  13576. Firing prefer*rvt*predict-no*H0*4*H1
  13577. -->
  13578. Firing prefer*rvt*predict-yes*H0*3*H1
  13579. -->
  13580. Firing elaborate*copy-see-to-output-link
  13581. -->
  13582. (I3 ^see 0 +)
  13583. Firing elaborate*reward*based*on*reward
  13584. -->
  13585. (R999 ^value 1 +)
  13586. (R1 ^reward R999 +)
  13587. Firing propose*predict-yes
  13588. -->
  13589. (O1991 ^name predict-yes +)
  13590. (S1 ^operator O1991 +)
  13591. Firing propose*predict-no
  13592. -->
  13593. (O1992 ^name predict-no +)
  13594. (S1 ^operator O1992 +)
  13595. Firing rl*prefer*rvt*predict-no*H0*4
  13596. -->
  13597. (S1 ^operator O1990 = 0.3144988611901438)
  13598. Firing rl*prefer*rvt*predict-yes*H0*3
  13599. -->
  13600. (S1 ^operator O1989 = 0.3907782094907327)
  13601. Firing prefer*rvt*predict-yes*H0
  13602. -->
  13603. Firing prefer*rvt*predict-no*H0
  13604. -->
  13605. Firing elaborate*copy-dir-to-output-link
  13606. -->
  13607. (I3 ^dir L +)
  13608. inner elaboration loop at bottom goal.
  13609. Retracting elaborate*copy-see-to-output-link
  13610. -->
  13611. (I3 ^see 0 +)
  13612. Retracting propose*predict-no
  13613. -->
  13614. (O1990 ^name predict-no +)
  13615. (S1 ^operator O1990 +)
  13616. Retracting propose*predict-yes
  13617. -->
  13618. (O1989 ^name predict-yes +)
  13619. (S1 ^operator O1989 +)
  13620. Retracting elaborate*reward*based*on*reward
  13621. -->
  13622. (R998 ^value 1 +)
  13623. (R1 ^reward R998 +)
  13624. Retracting elaborate*copy-dir-to-output-link
  13625. -->
  13626. (I3 ^dir L +)
  13627. Retracting rl*prefer*rvt*predict-no*H0*4
  13628. -->
  13629. (S1 ^operator O1990 = 0.3144988611901438)
  13630. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  13631. -->
  13632. (S1 ^operator O1990 = 0.685533297663165)
  13633. Retracting rl*prefer*rvt*predict-yes*H0*3
  13634. -->
  13635. (S1 ^operator O1989 = 0.3907782094907327)
  13636. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  13637. -->
  13638. (S1 ^operator O1989 = -0.2062723012911647)
  13639. =>WM: (13984: S1 ^operator O1992 +)
  13640. =>WM: (13983: S1 ^operator O1991 +)
  13641. =>WM: (13982: O1992 ^name predict-no)
  13642. =>WM: (13981: O1991 ^name predict-yes)
  13643. =>WM: (13980: R999 ^value 1)
  13644. =>WM: (13979: R1 ^reward R999)
  13645. <=WM: (13970: S1 ^operator O1989 +)
  13646. <=WM: (13971: S1 ^operator O1990 +)
  13647. <=WM: (13972: S1 ^operator O1990)
  13648. <=WM: (13965: R1 ^reward R998)
  13649. <=WM: (13968: O1990 ^name predict-no)
  13650. <=WM: (13967: O1989 ^name predict-yes)
  13651. <=WM: (13966: R998 ^value 1)
  13652. --- Inner Elaboration Phase, active level 1 (S1) ---
  13653. Firing prefer*rvt*predict-yes*H0
  13654. -->
  13655. Firing rl*prefer*rvt*predict-yes*H0*3
  13656. -->
  13657. (S1 ^operator O1991 = 0.3907782094907327)
  13658. Firing prefer*rvt*predict-yes*H0*3*H1
  13659. -->
  13660. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13661. -->
  13662. (S1 ^operator O1991 = -0.208713043145708)
  13663. Firing prefer*rvt*predict-no*H0
  13664. -->
  13665. Firing rl*prefer*rvt*predict-no*H0*4
  13666. -->
  13667. (S1 ^operator O1992 = 0.3144988611901438)
  13668. Firing prefer*rvt*predict-no*H0*4*H1
  13669. -->
  13670. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13671. -->
  13672. (S1 ^operator O1992 = 0.6854257503571404)
  13673. inner elaboration loop at bottom goal.
  13674. Retracting rl*prefer*rvt*predict-no*H0*4
  13675. -->
  13676. (S1 ^operator O1990 = 0.3144988611901438)
  13677. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13678. -->
  13679. (S1 ^operator O1990 = 0.6854257503571404)
  13680. Retracting rl*prefer*rvt*predict-yes*H0*3
  13681. -->
  13682. (S1 ^operator O1989 = 0.3907782094907327)
  13683. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13684. -->
  13685. (S1 ^operator O1989 = -0.208713043145708)
  13686. --- END Proposal Phase ---
  13687. --- Decision Phase ---
  13688. RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478545 -0.164049 0.314496(R,m,v=1,0.922581,0.0718894)
  13689. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521482 0.164052 0.685533 -> 0.521479 0.164051 0.68553(R,m,v=1,1,0)
  13690. =>WM: (13985: S1 ^operator O1992)
  13691. 996: O: O1992 (predict-no)
  13692. --- END Decision Phase ---
  13693. --- Application Phase ---
  13694. --- Firing Productions (PE) For State At Depth 1 ---
  13695. --- Inner Elaboration Phase, active level 1 (S1) ---
  13696. Firing apply*operator
  13697. -->
  13698. (I3 ^predict-no N996 + :O )
  13699. Firing apply*operator*complete
  13700. -->
  13701. (I3 ^predict-no N995 - :O )
  13702. inner elaboration loop at bottom goal.
  13703. --- Change Working Memory (PE) ---
  13704. =>WM: (13986: I3 ^predict-no N996)
  13705. <=WM: (13974: N995 ^status complete)
  13706. <=WM: (13973: I3 ^predict-no N995)
  13707. --- Firing Productions (IE) For State At Depth 1 ---
  13708. --- Inner Elaboration Phase, active level 1 (S1) ---
  13709. Firing monitor*world
  13710. -->
  13711. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13712. --- Change Working Memory (IE) ---
  13713. --- END Application Phase ---
  13714. --- Output Phase ---
  13715. ENV: Agent did: predict-no for direction L in state State-A
  13716. In State-A moving L
  13717. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13718. predict error 0
  13719. dir: dir isL
  13720. --- END Output Phase ---
  13721. /|\--- Input Phase ---
  13722. =>WM: (13990: I2 ^dir L)
  13723. =>WM: (13989: I2 ^reward 1)
  13724. =>WM: (13988: I2 ^see 0)
  13725. =>WM: (13987: N996 ^status complete)
  13726. <=WM: (13977: I2 ^dir L)
  13727. <=WM: (13976: I2 ^reward 1)
  13728. <=WM: (13975: I2 ^see 0)
  13729. =>WM: (13991: I2 ^level-1 L0-root)
  13730. <=WM: (13978: I2 ^level-1 L0-root)
  13731. --- END Input Phase ---
  13732. --- Proposal Phase ---
  13733. --- Inner Elaboration Phase, active level 1 (S1) ---
  13734. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13735. -->
  13736. (S1 ^operator O1991 = -0.208713043145708)
  13737. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13738. -->
  13739. (S1 ^operator O1992 = 0.6854257503571404)
  13740. Firing prefer*rvt*predict-no*H0*4*H1
  13741. -->
  13742. Firing prefer*rvt*predict-yes*H0*3*H1
  13743. -->
  13744. Firing elaborate*copy-see-to-output-link
  13745. -->
  13746. (I3 ^see 0 +)
  13747. Firing elaborate*reward*based*on*reward
  13748. -->
  13749. (R1000 ^value 1 +)
  13750. (R1 ^reward R1000 +)
  13751. Firing propose*predict-yes
  13752. -->
  13753. (O1993 ^name predict-yes +)
  13754. (S1 ^operator O1993 +)
  13755. Firing propose*predict-no
  13756. -->
  13757. (O1994 ^name predict-no +)
  13758. (S1 ^operator O1994 +)
  13759. Firing rl*prefer*rvt*predict-no*H0*4
  13760. -->
  13761. (S1 ^operator O1992 = 0.3144962005421928)
  13762. Firing rl*prefer*rvt*predict-yes*H0*3
  13763. -->
  13764. (S1 ^operator O1991 = 0.3907782094907327)
  13765. Firing prefer*rvt*predict-yes*H0
  13766. -->
  13767. Firing prefer*rvt*predict-no*H0
  13768. -->
  13769. Firing elaborate*copy-dir-to-output-link
  13770. -->
  13771. (I3 ^dir L +)
  13772. inner elaboration loop at bottom goal.
  13773. Retracting elaborate*copy-see-to-output-link
  13774. -->
  13775. (I3 ^see 0 +)
  13776. Retracting propose*predict-no
  13777. -->
  13778. (O1992 ^name predict-no +)
  13779. (S1 ^operator O1992 +)
  13780. Retracting propose*predict-yes
  13781. -->
  13782. (O1991 ^name predict-yes +)
  13783. (S1 ^operator O1991 +)
  13784. Retracting elaborate*reward*based*on*reward
  13785. -->
  13786. (R999 ^value 1 +)
  13787. (R1 ^reward R999 +)
  13788. Retracting elaborate*copy-dir-to-output-link
  13789. -->
  13790. (I3 ^dir L +)
  13791. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13792. -->
  13793. (S1 ^operator O1992 = 0.6854257503571404)
  13794. Retracting rl*prefer*rvt*predict-no*H0*4
  13795. -->
  13796. (S1 ^operator O1992 = 0.3144962005421928)
  13797. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13798. -->
  13799. (S1 ^operator O1991 = -0.208713043145708)
  13800. Retracting rl*prefer*rvt*predict-yes*H0*3
  13801. -->
  13802. (S1 ^operator O1991 = 0.3907782094907327)
  13803. =>WM: (13997: S1 ^operator O1994 +)
  13804. =>WM: (13996: S1 ^operator O1993 +)
  13805. =>WM: (13995: O1994 ^name predict-no)
  13806. =>WM: (13994: O1993 ^name predict-yes)
  13807. =>WM: (13993: R1000 ^value 1)
  13808. =>WM: (13992: R1 ^reward R1000)
  13809. <=WM: (13983: S1 ^operator O1991 +)
  13810. <=WM: (13984: S1 ^operator O1992 +)
  13811. <=WM: (13985: S1 ^operator O1992)
  13812. <=WM: (13979: R1 ^reward R999)
  13813. <=WM: (13982: O1992 ^name predict-no)
  13814. <=WM: (13981: O1991 ^name predict-yes)
  13815. <=WM: (13980: R999 ^value 1)
  13816. --- Inner Elaboration Phase, active level 1 (S1) ---
  13817. Firing prefer*rvt*predict-yes*H0
  13818. -->
  13819. Firing rl*prefer*rvt*predict-yes*H0*3
  13820. -->
  13821. (S1 ^operator O1993 = 0.3907782094907327)
  13822. Firing prefer*rvt*predict-yes*H0*3*H1
  13823. -->
  13824. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  13825. -->
  13826. (S1 ^operator O1993 = -0.208713043145708)
  13827. Firing prefer*rvt*predict-no*H0
  13828. -->
  13829. Firing rl*prefer*rvt*predict-no*H0*4
  13830. -->
  13831. (S1 ^operator O1994 = 0.3144962005421928)
  13832. Firing prefer*rvt*predict-no*H0*4*H1
  13833. -->
  13834. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  13835. -->
  13836. (S1 ^operator O1994 = 0.6854257503571404)
  13837. inner elaboration loop at bottom goal.
  13838. Retracting rl*prefer*rvt*predict-no*H0*4
  13839. -->
  13840. (S1 ^operator O1992 = 0.3144962005421928)
  13841. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13842. -->
  13843. (S1 ^operator O1992 = 0.6854257503571404)
  13844. Retracting rl*prefer*rvt*predict-yes*H0*3
  13845. -->
  13846. (S1 ^operator O1991 = 0.3907782094907327)
  13847. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13848. -->
  13849. (S1 ^operator O1991 = -0.208713043145708)
  13850. --- END Proposal Phase ---
  13851. --- Decision Phase ---
  13852. RL update rl*prefer*rvt*predict-no*H0*4 0.478545 -0.164049 0.314496 -> 0.478551 -0.164048 0.314503(R,m,v=1,0.923077,0.071464)
  13853. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521384 0.164042 0.685426 -> 0.521391 0.164042 0.685433(R,m,v=1,1,0)
  13854. =>WM: (13998: S1 ^operator O1994)
  13855. 997: O: O1994 (predict-no)
  13856. --- END Decision Phase ---
  13857. --- Application Phase ---
  13858. --- Firing Productions (PE) For State At Depth 1 ---
  13859. --- Inner Elaboration Phase, active level 1 (S1) ---
  13860. Firing apply*operator
  13861. -->
  13862. (I3 ^predict-no N997 + :O )
  13863. Firing apply*operator*complete
  13864. -->
  13865. (I3 ^predict-no N996 - :O )
  13866. inner elaboration loop at bottom goal.
  13867. --- Change Working Memory (PE) ---
  13868. =>WM: (13999: I3 ^predict-no N997)
  13869. <=WM: (13987: N996 ^status complete)
  13870. <=WM: (13986: I3 ^predict-no N996)
  13871. --- Firing Productions (IE) For State At Depth 1 ---
  13872. --- Inner Elaboration Phase, active level 1 (S1) ---
  13873. Firing monitor*world
  13874. -->
  13875. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  13876. --- Change Working Memory (IE) ---
  13877. --- END Application Phase ---
  13878. --- Output Phase ---
  13879. ENV: Agent did: predict-no for direction L in state State-A
  13880. In State-A moving L
  13881. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  13882. predict error 0
  13883. dir: dir isU
  13884. --- END Output Phase ---
  13885. -/--- Input Phase ---
  13886. =>WM: (14003: I2 ^dir U)
  13887. =>WM: (14002: I2 ^reward 1)
  13888. =>WM: (14001: I2 ^see 0)
  13889. =>WM: (14000: N997 ^status complete)
  13890. <=WM: (13990: I2 ^dir L)
  13891. <=WM: (13989: I2 ^reward 1)
  13892. <=WM: (13988: I2 ^see 0)
  13893. =>WM: (14004: I2 ^level-1 L0-root)
  13894. <=WM: (13991: I2 ^level-1 L0-root)
  13895. --- END Input Phase ---
  13896. --- Proposal Phase ---
  13897. --- Inner Elaboration Phase, active level 1 (S1) ---
  13898. Firing elaborate*copy-see-to-output-link
  13899. -->
  13900. (I3 ^see 0 +)
  13901. Firing elaborate*reward*based*on*reward
  13902. -->
  13903. (R1001 ^value 1 +)
  13904. (R1 ^reward R1001 +)
  13905. Firing propose*predict-yes
  13906. -->
  13907. (O1995 ^name predict-yes +)
  13908. (S1 ^operator O1995 +)
  13909. Firing propose*predict-no
  13910. -->
  13911. (O1996 ^name predict-no +)
  13912. (S1 ^operator O1996 +)
  13913. Firing rl*prefer*rvt*predict-no*H0*2
  13914. -->
  13915. (S1 ^operator O1994 = 1.)
  13916. Firing rl*prefer*rvt*predict-yes*H0*1
  13917. -->
  13918. (S1 ^operator O1993 = 0.)
  13919. Firing prefer*rvt*predict-yes*H0
  13920. -->
  13921. Firing prefer*rvt*predict-no*H0
  13922. -->
  13923. Firing elaborate*copy-dir-to-output-link
  13924. -->
  13925. (I3 ^dir U +)
  13926. inner elaboration loop at bottom goal.
  13927. Retracting elaborate*copy-see-to-output-link
  13928. -->
  13929. (I3 ^see 0 +)
  13930. Retracting propose*predict-no
  13931. -->
  13932. (O1994 ^name predict-no +)
  13933. (S1 ^operator O1994 +)
  13934. Retracting propose*predict-yes
  13935. -->
  13936. (O1993 ^name predict-yes +)
  13937. (S1 ^operator O1993 +)
  13938. Retracting elaborate*reward*based*on*reward
  13939. -->
  13940. (R1000 ^value 1 +)
  13941. (R1 ^reward R1000 +)
  13942. Retracting elaborate*copy-dir-to-output-link
  13943. -->
  13944. (I3 ^dir L +)
  13945. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  13946. -->
  13947. (S1 ^operator O1994 = 0.6854332700385593)
  13948. Retracting rl*prefer*rvt*predict-no*H0*4
  13949. -->
  13950. (S1 ^operator O1994 = 0.3145026510346156)
  13951. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  13952. -->
  13953. (S1 ^operator O1993 = -0.208713043145708)
  13954. Retracting rl*prefer*rvt*predict-yes*H0*3
  13955. -->
  13956. (S1 ^operator O1993 = 0.3907782094907327)
  13957. =>WM: (14011: S1 ^operator O1996 +)
  13958. =>WM: (14010: S1 ^operator O1995 +)
  13959. =>WM: (14009: I3 ^dir U)
  13960. =>WM: (14008: O1996 ^name predict-no)
  13961. =>WM: (14007: O1995 ^name predict-yes)
  13962. =>WM: (14006: R1001 ^value 1)
  13963. =>WM: (14005: R1 ^reward R1001)
  13964. <=WM: (13996: S1 ^operator O1993 +)
  13965. <=WM: (13997: S1 ^operator O1994 +)
  13966. <=WM: (13998: S1 ^operator O1994)
  13967. <=WM: (13969: I3 ^dir L)
  13968. <=WM: (13992: R1 ^reward R1000)
  13969. <=WM: (13995: O1994 ^name predict-no)
  13970. <=WM: (13994: O1993 ^name predict-yes)
  13971. <=WM: (13993: R1000 ^value 1)
  13972. --- Inner Elaboration Phase, active level 1 (S1) ---
  13973. Firing prefer*rvt*predict-yes*H0
  13974. -->
  13975. Firing rl*prefer*rvt*predict-yes*H0*1
  13976. -->
  13977. (S1 ^operator O1995 = 0.)
  13978. Firing prefer*rvt*predict-no*H0
  13979. -->
  13980. Firing rl*prefer*rvt*predict-no*H0*2
  13981. -->
  13982. (S1 ^operator O1996 = 1.)
  13983. inner elaboration loop at bottom goal.
  13984. Retracting rl*prefer*rvt*predict-no*H0*2
  13985. -->
  13986. (S1 ^operator O1994 = 1.)
  13987. Retracting rl*prefer*rvt*predict-yes*H0*1
  13988. -->
  13989. (S1 ^operator O1993 = 0.)
  13990. --- END Proposal Phase ---
  13991. --- Decision Phase ---
  13992. RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314503 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.923567,0.0710436)
  13993. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521391 0.164042 0.685433 -> 0.521396 0.164043 0.685439(R,m,v=1,1,0)
  13994. =>WM: (14012: S1 ^operator O1996)
  13995. 998: O: O1996 (predict-no)
  13996. --- END Decision Phase ---
  13997. --- Application Phase ---
  13998. --- Firing Productions (PE) For State At Depth 1 ---
  13999. --- Inner Elaboration Phase, active level 1 (S1) ---
  14000. Firing apply*operator
  14001. -->
  14002. (I3 ^predict-no N998 + :O )
  14003. Firing apply*operator*complete
  14004. -->
  14005. (I3 ^predict-no N997 - :O )
  14006. inner elaboration loop at bottom goal.
  14007. --- Change Working Memory (PE) ---
  14008. =>WM: (14013: I3 ^predict-no N998)
  14009. <=WM: (14000: N997 ^status complete)
  14010. <=WM: (13999: I3 ^predict-no N997)
  14011. --- Firing Productions (IE) For State At Depth 1 ---
  14012. --- Inner Elaboration Phase, active level 1 (S1) ---
  14013. Firing monitor*world
  14014. -->
  14015. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14016. --- Change Working Memory (IE) ---
  14017. --- END Application Phase ---
  14018. --- Output Phase ---
  14019. ENV: Agent did: predict-no for direction U in state State-A
  14020. In State-A moving U
  14021. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  14022. predict error 0
  14023. dir: dir isR
  14024. --- END Output Phase ---
  14025. |\--- Input Phase ---
  14026. =>WM: (14017: I2 ^dir R)
  14027. =>WM: (14016: I2 ^reward 1)
  14028. =>WM: (14015: I2 ^see 0)
  14029. =>WM: (14014: N998 ^status complete)
  14030. <=WM: (14003: I2 ^dir U)
  14031. <=WM: (14002: I2 ^reward 1)
  14032. <=WM: (14001: I2 ^see 0)
  14033. =>WM: (14018: I2 ^level-1 L0-root)
  14034. <=WM: (14004: I2 ^level-1 L0-root)
  14035. --- END Input Phase ---
  14036. --- Proposal Phase ---
  14037. --- Inner Elaboration Phase, active level 1 (S1) ---
  14038. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  14039. -->
  14040. (S1 ^operator O1995 = 0.8783951706845293)
  14041. Firing prefer*rvt*predict-yes*H0*5*H1
  14042. -->
  14043. Firing elaborate*copy-see-to-output-link
  14044. -->
  14045. (I3 ^see 0 +)
  14046. Firing elaborate*reward*based*on*reward
  14047. -->
  14048. (R1002 ^value 1 +)
  14049. (R1 ^reward R1002 +)
  14050. Firing propose*predict-yes
  14051. -->
  14052. (O1997 ^name predict-yes +)
  14053. (S1 ^operator O1997 +)
  14054. Firing propose*predict-no
  14055. -->
  14056. (O1998 ^name predict-no +)
  14057. (S1 ^operator O1998 +)
  14058. Firing rl*prefer*rvt*predict-no*H0*6
  14059. -->
  14060. (S1 ^operator O1996 = 0.9999888743986174)
  14061. Firing rl*prefer*rvt*predict-yes*H0*5
  14062. -->
  14063. (S1 ^operator O1995 = 0.1215989443698621)
  14064. Firing prefer*rvt*predict-yes*H0
  14065. -->
  14066. Firing prefer*rvt*predict-no*H0
  14067. -->
  14068. Firing elaborate*copy-dir-to-output-link
  14069. -->
  14070. (I3 ^dir R +)
  14071. inner elaboration loop at bottom goal.
  14072. Retracting elaborate*copy-see-to-output-link
  14073. -->
  14074. (I3 ^see 0 +)
  14075. Retracting propose*predict-no
  14076. -->
  14077. (O1996 ^name predict-no +)
  14078. (S1 ^operator O1996 +)
  14079. Retracting propose*predict-yes
  14080. -->
  14081. (O1995 ^name predict-yes +)
  14082. (S1 ^operator O1995 +)
  14083. Retracting elaborate*reward*based*on*reward
  14084. -->
  14085. (R1001 ^value 1 +)
  14086. (R1 ^reward R1001 +)
  14087. Retracting elaborate*copy-dir-to-output-link
  14088. -->
  14089. (I3 ^dir U +)
  14090. Retracting rl*prefer*rvt*predict-no*H0*2
  14091. -->
  14092. (S1 ^operator O1996 = 1.)
  14093. Retracting rl*prefer*rvt*predict-yes*H0*1
  14094. -->
  14095. (S1 ^operator O1995 = 0.)
  14096. =>WM: (14025: S1 ^operator O1998 +)
  14097. =>WM: (14024: S1 ^operator O1997 +)
  14098. =>WM: (14023: I3 ^dir R)
  14099. =>WM: (14022: O1998 ^name predict-no)
  14100. =>WM: (14021: O1997 ^name predict-yes)
  14101. =>WM: (14020: R1002 ^value 1)
  14102. =>WM: (14019: R1 ^reward R1002)
  14103. <=WM: (14010: S1 ^operator O1995 +)
  14104. <=WM: (14011: S1 ^operator O1996 +)
  14105. <=WM: (14012: S1 ^operator O1996)
  14106. <=WM: (14009: I3 ^dir U)
  14107. <=WM: (14005: R1 ^reward R1001)
  14108. <=WM: (14008: O1996 ^name predict-no)
  14109. <=WM: (14007: O1995 ^name predict-yes)
  14110. <=WM: (14006: R1001 ^value 1)
  14111. --- Inner Elaboration Phase, active level 1 (S1) ---
  14112. Firing prefer*rvt*predict-yes*H0
  14113. -->
  14114. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  14115. -->
  14116. (S1 ^operator O1997 = 0.8783951706845293)
  14117. Firing rl*prefer*rvt*predict-yes*H0*5
  14118. -->
  14119. (S1 ^operator O1997 = 0.1215989443698621)
  14120. Firing prefer*rvt*predict-yes*H0*5*H1
  14121. -->
  14122. Firing prefer*rvt*predict-no*H0
  14123. -->
  14124. Firing rl*prefer*rvt*predict-no*H0*6
  14125. -->
  14126. (S1 ^operator O1998 = 0.9999888743986174)
  14127. inner elaboration loop at bottom goal.
  14128. Retracting rl*prefer*rvt*predict-no*H0*6
  14129. -->
  14130. (S1 ^operator O1996 = 0.9999888743986174)
  14131. Retracting rl*prefer*rvt*predict-yes*H0*5
  14132. -->
  14133. (S1 ^operator O1995 = 0.1215989443698621)
  14134. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  14135. -->
  14136. (S1 ^operator O1995 = 0.8783951706845293)
  14137. --- END Proposal Phase ---
  14138. --- Decision Phase ---
  14139. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14140. =>WM: (14026: S1 ^operator O1997)
  14141. 999: O: O1997 (predict-yes)
  14142. --- END Decision Phase ---
  14143. --- Application Phase ---
  14144. --- Firing Productions (PE) For State At Depth 1 ---
  14145. --- Inner Elaboration Phase, active level 1 (S1) ---
  14146. Firing apply*operator
  14147. -->
  14148. (I3 ^predict-yes N999 + :O )
  14149. Firing apply*operator*complete
  14150. -->
  14151. (I3 ^predict-no N998 - :O )
  14152. inner elaboration loop at bottom goal.
  14153. --- Change Working Memory (PE) ---
  14154. =>WM: (14027: I3 ^predict-yes N999)
  14155. <=WM: (14014: N998 ^status complete)
  14156. <=WM: (14013: I3 ^predict-no N998)
  14157. --- Firing Productions (IE) For State At Depth 1 ---
  14158. --- Inner Elaboration Phase, active level 1 (S1) ---
  14159. Firing monitor*world
  14160. -->
  14161. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  14162. --- Change Working Memory (IE) ---
  14163. --- END Application Phase ---
  14164. --- Output Phase ---
  14165. ENV: Agent did: predict-yes for direction R in state State-A
  14166. In State-A moving R
  14167. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  14168. predict error 0
  14169. dir: dir isU
  14170. --- END Output Phase ---
  14171. -/|--- Input Phase ---
  14172. =>WM: (14031: I2 ^dir U)
  14173. =>WM: (14030: I2 ^reward 1)
  14174. =>WM: (14029: I2 ^see 1)
  14175. =>WM: (14028: N999 ^status complete)
  14176. <=WM: (14017: I2 ^dir R)
  14177. <=WM: (14016: I2 ^reward 1)
  14178. <=WM: (14015: I2 ^see 0)
  14179. =>WM: (14032: I2 ^level-1 R1-root)
  14180. <=WM: (14018: I2 ^level-1 L0-root)
  14181. --- END Input Phase ---
  14182. --- Proposal Phase ---
  14183. --- Inner Elaboration Phase, active level 1 (S1) ---
  14184. Firing elaborate*copy-see-to-output-link
  14185. -->
  14186. (I3 ^see 1 +)
  14187. Firing elaborate*reward*based*on*reward
  14188. -->
  14189. (R1003 ^value 1 +)
  14190. (R1 ^reward R1003 +)
  14191. Firing propose*predict-yes
  14192. -->
  14193. (O1999 ^name predict-yes +)
  14194. (S1 ^operator O1999 +)
  14195. Firing propose*predict-no
  14196. -->
  14197. (O2000 ^name predict-no +)
  14198. (S1 ^operator O2000 +)
  14199. Firing rl*prefer*rvt*predict-no*H0*2
  14200. -->
  14201. (S1 ^operator O1998 = 1.)
  14202. Firing rl*prefer*rvt*predict-yes*H0*1
  14203. -->
  14204. (S1 ^operator O1997 = 0.)
  14205. Firing prefer*rvt*predict-yes*H0
  14206. -->
  14207. Firing prefer*rvt*predict-no*H0
  14208. -->
  14209. Firing elaborate*copy-dir-to-output-link
  14210. -->
  14211. (I3 ^dir U +)
  14212. inner elaboration loop at bottom goal.
  14213. Retracting elaborate*copy-see-to-output-link
  14214. -->
  14215. (I3 ^see 0 +)
  14216. Retracting propose*predict-no
  14217. -->
  14218. (O1998 ^name predict-no +)
  14219. (S1 ^operator O1998 +)
  14220. Retracting propose*predict-yes
  14221. -->
  14222. (O1997 ^name predict-yes +)
  14223. (S1 ^operator O1997 +)
  14224. Retracting elaborate*reward*based*on*reward
  14225. -->
  14226. (R1002 ^value 1 +)
  14227. (R1 ^reward R1002 +)
  14228. Retracting elaborate*copy-dir-to-output-link
  14229. -->
  14230. (I3 ^dir R +)
  14231. Retracting rl*prefer*rvt*predict-no*H0*6
  14232. -->
  14233. (S1 ^operator O1998 = 0.9999888743986174)
  14234. Retracting rl*prefer*rvt*predict-yes*H0*5
  14235. -->
  14236. (S1 ^operator O1997 = 0.1215989443698621)
  14237. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  14238. -->
  14239. (S1 ^operator O1997 = 0.8783951706845293)
  14240. =>WM: (14040: S1 ^operator O2000 +)
  14241. =>WM: (14039: S1 ^operator O1999 +)
  14242. =>WM: (14038: I3 ^dir U)
  14243. =>WM: (14037: O2000 ^name predict-no)
  14244. =>WM: (14036: O1999 ^name predict-yes)
  14245. =>WM: (14035: R1003 ^value 1)
  14246. =>WM: (14034: R1 ^reward R1003)
  14247. =>WM: (14033: I3 ^see 1)
  14248. <=WM: (14024: S1 ^operator O1997 +)
  14249. <=WM: (14026: S1 ^operator O1997)
  14250. <=WM: (14025: S1 ^operator O1998 +)
  14251. <=WM: (14023: I3 ^dir R)
  14252. <=WM: (14019: R1 ^reward R1002)
  14253. <=WM: (13964: I3 ^see 0)
  14254. <=WM: (14022: O1998 ^name predict-no)
  14255. <=WM: (14021: O1997 ^name predict-yes)
  14256. <=WM: (14020: R1002 ^value 1)
  14257. --- Inner Elaboration Phase, active level 1 (S1) ---
  14258. Firing prefer*rvt*predict-yes*H0
  14259. -->
  14260. Firing rl*prefer*rvt*predict-yes*H0*1
  14261. -->
  14262. (S1 ^operator O1999 = 0.)
  14263. Firing prefer*rvt*predict-no*H0
  14264. -->
  14265. Firing rl*prefer*rvt*predict-no*H0*2
  14266. -->
  14267. (S1 ^operator O2000 = 1.)
  14268. inner elaboration loop at bottom goal.
  14269. Retracting rl*prefer*rvt*predict-no*H0*2
  14270. -->
  14271. (S1 ^operator O1998 = 1.)
  14272. Retracting rl*prefer*rvt*predict-yes*H0*1
  14273. -->
  14274. (S1 ^operator O1997 = 0.)
  14275. --- END Proposal Phase ---
  14276. --- Decision Phase ---
  14277. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.864407,0.117874)
  14278. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.46547 0.412925 0.878395 -> 0.465471 0.412925 0.878396(R,m,v=1,1,0)
  14279. =>WM: (14041: S1 ^operator O2000)
  14280. 1000: O: O2000 (predict-no)
  14281. --- END Decision Phase ---
  14282. --- Application Phase ---
  14283. --- Firing Productions (PE) For State At Depth 1 ---
  14284. --- Inner Elaboration Phase, active level 1 (S1) ---
  14285. Firing apply*operator
  14286. -->
  14287. (I3 ^predict-no N1000 + :O )
  14288. Firing apply*operator*complete
  14289. -->
  14290. (I3 ^predict-yes N999 - :O )
  14291. inner elaboration loop at bottom goal.
  14292. --- Change Working Memory (PE) ---
  14293. =>WM: (14042: I3 ^predict-no N1000)
  14294. <=WM: (14028: N999 ^status complete)
  14295. <=WM: (14027: I3 ^predict-yes N999)
  14296. --- Firing Productions (IE) For State At Depth 1 ---
  14297. --- Inner Elaboration Phase, active level 1 (S1) ---
  14298. Firing monitor*world
  14299. -->
  14300. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14301. --- Change Working Memory (IE) ---
  14302. --- END Application Phase ---
  14303. --- Output Phase ---
  14304. ENV: Agent did: predict-no for direction U in state State-B
  14305. In State-B moving U
  14306. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14307. predict error 0
  14308. dir: dir isU
  14309. --- END Output Phase ---
  14310. \-/|\-/|--- Input Phase ---
  14311. =>WM: (14046: I2 ^dir U)
  14312. =>WM: (14045: I2 ^reward 1)
  14313. =>WM: (14044: I2 ^see 0)
  14314. =>WM: (14043: N1000 ^status complete)
  14315. <=WM: (14031: I2 ^dir U)
  14316. <=WM: (14030: I2 ^reward 1)
  14317. <=WM: (14029: I2 ^see 1)
  14318. =>WM: (14047: I2 ^level-1 R1-root)
  14319. <=WM: (14032: I2 ^level-1 R1-root)
  14320. --- END Input Phase ---
  14321. --- Proposal Phase ---
  14322. --- Inner Elaboration Phase, active level 1 (S1) ---
  14323. Firing elaborate*copy-see-to-output-link
  14324. -->
  14325. (I3 ^see 0 +)
  14326. Firing elaborate*reward*based*on*reward
  14327. -->
  14328. (R1004 ^value 1 +)
  14329. (R1 ^reward R1004 +)
  14330. Firing propose*predict-yes
  14331. -->
  14332. (O2001 ^name predict-yes +)
  14333. (S1 ^operator O2001 +)
  14334. Firing propose*predict-no
  14335. -->
  14336. (O2002 ^name predict-no +)
  14337. (S1 ^operator O2002 +)
  14338. Firing rl*prefer*rvt*predict-no*H0*2
  14339. -->
  14340. (S1 ^operator O2000 = 1.)
  14341. Firing rl*prefer*rvt*predict-yes*H0*1
  14342. -->
  14343. (S1 ^operator O1999 = 0.)
  14344. Firing prefer*rvt*predict-yes*H0
  14345. -->
  14346. Firing prefer*rvt*predict-no*H0
  14347. -->
  14348. Firing elaborate*copy-dir-to-output-link
  14349. -->
  14350. (I3 ^dir U +)
  14351. inner elaboration loop at bottom goal.
  14352. Retracting elaborate*copy-see-to-output-link
  14353. -->
  14354. (I3 ^see 1 +)
  14355. Retracting propose*predict-no
  14356. -->
  14357. (O2000 ^name predict-no +)
  14358. (S1 ^operator O2000 +)
  14359. Retracting propose*predict-yes
  14360. -->
  14361. (O1999 ^name predict-yes +)
  14362. (S1 ^operator O1999 +)
  14363. Retracting elaborate*reward*based*on*reward
  14364. -->
  14365. (R1003 ^value 1 +)
  14366. (R1 ^reward R1003 +)
  14367. Retracting elaborate*copy-dir-to-output-link
  14368. -->
  14369. (I3 ^dir U +)
  14370. Retracting rl*prefer*rvt*predict-no*H0*2
  14371. -->
  14372. (S1 ^operator O2000 = 1.)
  14373. Retracting rl*prefer*rvt*predict-yes*H0*1
  14374. -->
  14375. (S1 ^operator O1999 = 0.)
  14376. =>WM: (14054: S1 ^operator O2002 +)
  14377. =>WM: (14053: S1 ^operator O2001 +)
  14378. =>WM: (14052: O2002 ^name predict-no)
  14379. =>WM: (14051: O2001 ^name predict-yes)
  14380. =>WM: (14050: R1004 ^value 1)
  14381. =>WM: (14049: R1 ^reward R1004)
  14382. =>WM: (14048: I3 ^see 0)
  14383. <=WM: (14039: S1 ^operator O1999 +)
  14384. <=WM: (14040: S1 ^operator O2000 +)
  14385. <=WM: (14041: S1 ^operator O2000)
  14386. <=WM: (14034: R1 ^reward R1003)
  14387. <=WM: (14033: I3 ^see 1)
  14388. <=WM: (14037: O2000 ^name predict-no)
  14389. <=WM: (14036: O1999 ^name predict-yes)
  14390. <=WM: (14035: R1003 ^value 1)
  14391. --- Inner Elaboration Phase, active level 1 (S1) ---
  14392. Firing prefer*rvt*predict-yes*H0
  14393. -->
  14394. Firing rl*prefer*rvt*predict-yes*H0*1
  14395. -->
  14396. (S1 ^operator O2001 = 0.)
  14397. Firing prefer*rvt*predict-no*H0
  14398. -->
  14399. Firing rl*prefer*rvt*predict-no*H0*2
  14400. -->
  14401. (S1 ^operator O2002 = 1.)
  14402. inner elaboration loop at bottom goal.
  14403. Retracting rl*prefer*rvt*predict-no*H0*2
  14404. -->
  14405. (S1 ^operator O2000 = 1.)
  14406. Retracting rl*prefer*rvt*predict-yes*H0*1
  14407. -->
  14408. (S1 ^operator O1999 = 0.)
  14409. --- END Proposal Phase ---
  14410. --- Decision Phase ---
  14411. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14412. =>WM: (14055: S1 ^operator O2002)
  14413. 1001: O: O2002 (predict-no)
  14414. --- END Decision Phase ---
  14415. --- Application Phase ---
  14416. --- Firing Productions (PE) For State At Depth 1 ---
  14417. --- Inner Elaboration Phase, active level 1 (S1) ---
  14418. Firing apply*operator
  14419. -->
  14420. (I3 ^predict-no N1001 + :O )
  14421. Firing apply*operator*complete
  14422. -->
  14423. (I3 ^predict-no N1000 - :O )
  14424. inner elaboration loop at bottom goal.
  14425. --- Change Working Memory (PE) ---
  14426. =>WM: (14056: I3 ^predict-no N1001)
  14427. <=WM: (14043: N1000 ^status complete)
  14428. <=WM: (14042: I3 ^predict-no N1000)
  14429. --- Firing Productions (IE) For State At Depth 1 ---
  14430. --- Inner Elaboration Phase, active level 1 (S1) ---
  14431. Firing monitor*world
  14432. -->
  14433. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14434. --- Change Working Memory (IE) ---
  14435. --- END Application Phase ---
  14436. --- Output Phase ---
  14437. ENV: Agent did: predict-no for direction U in state State-B
  14438. In State-B moving U
  14439. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14440. predict error 0
  14441. dir: dir isU
  14442. --- END Output Phase ---
  14443. \--- Input Phase ---
  14444. =>WM: (14060: I2 ^dir U)
  14445. =>WM: (14059: I2 ^reward 1)
  14446. =>WM: (14058: I2 ^see 0)
  14447. =>WM: (14057: N1001 ^status complete)
  14448. <=WM: (14046: I2 ^dir U)
  14449. <=WM: (14045: I2 ^reward 1)
  14450. <=WM: (14044: I2 ^see 0)
  14451. =>WM: (14061: I2 ^level-1 R1-root)
  14452. <=WM: (14047: I2 ^level-1 R1-root)
  14453. --- END Input Phase ---
  14454. --- Proposal Phase ---
  14455. --- Inner Elaboration Phase, active level 1 (S1) ---
  14456. Firing elaborate*copy-see-to-output-link
  14457. -->
  14458. (I3 ^see 0 +)
  14459. Firing elaborate*reward*based*on*reward
  14460. -->
  14461. (R1005 ^value 1 +)
  14462. (R1 ^reward R1005 +)
  14463. Firing propose*predict-yes
  14464. -->
  14465. (O2003 ^name predict-yes +)
  14466. (S1 ^operator O2003 +)
  14467. Firing propose*predict-no
  14468. -->
  14469. (O2004 ^name predict-no +)
  14470. (S1 ^operator O2004 +)
  14471. Firing rl*prefer*rvt*predict-no*H0*2
  14472. -->
  14473. (S1 ^operator O2002 = 1.)
  14474. Firing rl*prefer*rvt*predict-yes*H0*1
  14475. -->
  14476. (S1 ^operator O2001 = 0.)
  14477. Firing prefer*rvt*predict-yes*H0
  14478. -->
  14479. Firing prefer*rvt*predict-no*H0
  14480. -->
  14481. Firing elaborate*copy-dir-to-output-link
  14482. -->
  14483. (I3 ^dir U +)
  14484. inner elaboration loop at bottom goal.
  14485. Retracting elaborate*copy-see-to-output-link
  14486. -->
  14487. (I3 ^see 0 +)
  14488. Retracting propose*predict-no
  14489. -->
  14490. (O2002 ^name predict-no +)
  14491. (S1 ^operator O2002 +)
  14492. Retracting propose*predict-yes
  14493. -->
  14494. (O2001 ^name predict-yes +)
  14495. (S1 ^operator O2001 +)
  14496. Retracting elaborate*reward*based*on*reward
  14497. -->
  14498. (R1004 ^value 1 +)
  14499. (R1 ^reward R1004 +)
  14500. Retracting elaborate*copy-dir-to-output-link
  14501. -->
  14502. (I3 ^dir U +)
  14503. Retracting rl*prefer*rvt*predict-no*H0*2
  14504. -->
  14505. (S1 ^operator O2002 = 1.)
  14506. Retracting rl*prefer*rvt*predict-yes*H0*1
  14507. -->
  14508. (S1 ^operator O2001 = 0.)
  14509. =>WM: (14067: S1 ^operator O2004 +)
  14510. =>WM: (14066: S1 ^operator O2003 +)
  14511. =>WM: (14065: O2004 ^name predict-no)
  14512. =>WM: (14064: O2003 ^name predict-yes)
  14513. =>WM: (14063: R1005 ^value 1)
  14514. =>WM: (14062: R1 ^reward R1005)
  14515. <=WM: (14053: S1 ^operator O2001 +)
  14516. <=WM: (14054: S1 ^operator O2002 +)
  14517. <=WM: (14055: S1 ^operator O2002)
  14518. <=WM: (14049: R1 ^reward R1004)
  14519. <=WM: (14052: O2002 ^name predict-no)
  14520. <=WM: (14051: O2001 ^name predict-yes)
  14521. <=WM: (14050: R1004 ^value 1)
  14522. --- Inner Elaboration Phase, active level 1 (S1) ---
  14523. Firing prefer*rvt*predict-yes*H0
  14524. -->
  14525. Firing rl*prefer*rvt*predict-yes*H0*1
  14526. -->
  14527. (S1 ^operator O2003 = 0.)
  14528. Firing prefer*rvt*predict-no*H0
  14529. -->
  14530. Firing rl*prefer*rvt*predict-no*H0*2
  14531. -->
  14532. (S1 ^operator O2004 = 1.)
  14533. inner elaboration loop at bottom goal.
  14534. Retracting rl*prefer*rvt*predict-no*H0*2
  14535. -->
  14536. (S1 ^operator O2002 = 1.)
  14537. Retracting rl*prefer*rvt*predict-yes*H0*1
  14538. -->
  14539. (S1 ^operator O2001 = 0.)
  14540. --- END Proposal Phase ---
  14541. --- Decision Phase ---
  14542. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14543. =>WM: (14068: S1 ^operator O2004)
  14544. 1002: O: O2004 (predict-no)
  14545. --- END Decision Phase ---
  14546. --- Application Phase ---
  14547. --- Firing Productions (PE) For State At Depth 1 ---
  14548. --- Inner Elaboration Phase, active level 1 (S1) ---
  14549. Firing apply*operator
  14550. -->
  14551. (I3 ^predict-no N1002 + :O )
  14552. Firing apply*operator*complete
  14553. -->
  14554. (I3 ^predict-no N1001 - :O )
  14555. inner elaboration loop at bottom goal.
  14556. --- Change Working Memory (PE) ---
  14557. =>WM: (14069: I3 ^predict-no N1002)
  14558. <=WM: (14057: N1001 ^status complete)
  14559. <=WM: (14056: I3 ^predict-no N1001)
  14560. --- Firing Productions (IE) For State At Depth 1 ---
  14561. --- Inner Elaboration Phase, active level 1 (S1) ---
  14562. Firing monitor*world
  14563. -->
  14564. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14565. --- Change Working Memory (IE) ---
  14566. --- END Application Phase ---
  14567. --- Output Phase ---
  14568. ENV: Agent did: predict-no for direction U in state State-B
  14569. In State-B moving U
  14570. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14571. predict error 0
  14572. dir: dir isR
  14573. --- END Output Phase ---
  14574. -/--- Input Phase ---
  14575. =>WM: (14073: I2 ^dir R)
  14576. =>WM: (14072: I2 ^reward 1)
  14577. =>WM: (14071: I2 ^see 0)
  14578. =>WM: (14070: N1002 ^status complete)
  14579. <=WM: (14060: I2 ^dir U)
  14580. <=WM: (14059: I2 ^reward 1)
  14581. <=WM: (14058: I2 ^see 0)
  14582. =>WM: (14074: I2 ^level-1 R1-root)
  14583. <=WM: (14061: I2 ^level-1 R1-root)
  14584. --- END Input Phase ---
  14585. --- Proposal Phase ---
  14586. --- Inner Elaboration Phase, active level 1 (S1) ---
  14587. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  14588. -->
  14589. (S1 ^operator O2003 = -0.04253361215288998)
  14590. Firing prefer*rvt*predict-yes*H0*5*H1
  14591. -->
  14592. Firing elaborate*copy-see-to-output-link
  14593. -->
  14594. (I3 ^see 0 +)
  14595. Firing elaborate*reward*based*on*reward
  14596. -->
  14597. (R1006 ^value 1 +)
  14598. (R1 ^reward R1006 +)
  14599. Firing propose*predict-yes
  14600. -->
  14601. (O2005 ^name predict-yes +)
  14602. (S1 ^operator O2005 +)
  14603. Firing propose*predict-no
  14604. -->
  14605. (O2006 ^name predict-no +)
  14606. (S1 ^operator O2006 +)
  14607. Firing rl*prefer*rvt*predict-no*H0*6
  14608. -->
  14609. (S1 ^operator O2004 = 0.9999888743986174)
  14610. Firing rl*prefer*rvt*predict-yes*H0*5
  14611. -->
  14612. (S1 ^operator O2003 = 0.1215994207949702)
  14613. Firing prefer*rvt*predict-yes*H0
  14614. -->
  14615. Firing prefer*rvt*predict-no*H0
  14616. -->
  14617. Firing elaborate*copy-dir-to-output-link
  14618. -->
  14619. (I3 ^dir R +)
  14620. inner elaboration loop at bottom goal.
  14621. Retracting elaborate*copy-see-to-output-link
  14622. -->
  14623. (I3 ^see 0 +)
  14624. Retracting propose*predict-no
  14625. -->
  14626. (O2004 ^name predict-no +)
  14627. (S1 ^operator O2004 +)
  14628. Retracting propose*predict-yes
  14629. -->
  14630. (O2003 ^name predict-yes +)
  14631. (S1 ^operator O2003 +)
  14632. Retracting elaborate*reward*based*on*reward
  14633. -->
  14634. (R1005 ^value 1 +)
  14635. (R1 ^reward R1005 +)
  14636. Retracting elaborate*copy-dir-to-output-link
  14637. -->
  14638. (I3 ^dir U +)
  14639. Retracting rl*prefer*rvt*predict-no*H0*2
  14640. -->
  14641. (S1 ^operator O2004 = 1.)
  14642. Retracting rl*prefer*rvt*predict-yes*H0*1
  14643. -->
  14644. (S1 ^operator O2003 = 0.)
  14645. =>WM: (14081: S1 ^operator O2006 +)
  14646. =>WM: (14080: S1 ^operator O2005 +)
  14647. =>WM: (14079: I3 ^dir R)
  14648. =>WM: (14078: O2006 ^name predict-no)
  14649. =>WM: (14077: O2005 ^name predict-yes)
  14650. =>WM: (14076: R1006 ^value 1)
  14651. =>WM: (14075: R1 ^reward R1006)
  14652. <=WM: (14066: S1 ^operator O2003 +)
  14653. <=WM: (14067: S1 ^operator O2004 +)
  14654. <=WM: (14068: S1 ^operator O2004)
  14655. <=WM: (14038: I3 ^dir U)
  14656. <=WM: (14062: R1 ^reward R1005)
  14657. <=WM: (14065: O2004 ^name predict-no)
  14658. <=WM: (14064: O2003 ^name predict-yes)
  14659. <=WM: (14063: R1005 ^value 1)
  14660. --- Inner Elaboration Phase, active level 1 (S1) ---
  14661. Firing prefer*rvt*predict-yes*H0
  14662. -->
  14663. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  14664. -->
  14665. (S1 ^operator O2005 = -0.04253361215288998)
  14666. Firing rl*prefer*rvt*predict-yes*H0*5
  14667. -->
  14668. (S1 ^operator O2005 = 0.1215994207949702)
  14669. Firing prefer*rvt*predict-yes*H0*5*H1
  14670. -->
  14671. Firing prefer*rvt*predict-no*H0
  14672. -->
  14673. Firing rl*prefer*rvt*predict-no*H0*6
  14674. -->
  14675. (S1 ^operator O2006 = 0.9999888743986174)
  14676. inner elaboration loop at bottom goal.
  14677. Retracting rl*prefer*rvt*predict-no*H0*6
  14678. -->
  14679. (S1 ^operator O2004 = 0.9999888743986174)
  14680. Retracting rl*prefer*rvt*predict-yes*H0*5
  14681. -->
  14682. (S1 ^operator O2003 = 0.1215994207949702)
  14683. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  14684. -->
  14685. (S1 ^operator O2003 = -0.04253361215288998)
  14686. --- END Proposal Phase ---
  14687. --- Decision Phase ---
  14688. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  14689. =>WM: (14082: S1 ^operator O2006)
  14690. 1003: O: O2006 (predict-no)
  14691. --- END Decision Phase ---
  14692. --- Application Phase ---
  14693. --- Firing Productions (PE) For State At Depth 1 ---
  14694. --- Inner Elaboration Phase, active level 1 (S1) ---
  14695. Firing apply*operator
  14696. -->
  14697. (I3 ^predict-no N1003 + :O )
  14698. Firing apply*operator*complete
  14699. -->
  14700. (I3 ^predict-no N1002 - :O )
  14701. inner elaboration loop at bottom goal.
  14702. --- Change Working Memory (PE) ---
  14703. =>WM: (14083: I3 ^predict-no N1003)
  14704. <=WM: (14070: N1002 ^status complete)
  14705. <=WM: (14069: I3 ^predict-no N1002)
  14706. --- Firing Productions (IE) For State At Depth 1 ---
  14707. --- Inner Elaboration Phase, active level 1 (S1) ---
  14708. Firing monitor*world
  14709. -->
  14710. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14711. --- Change Working Memory (IE) ---
  14712. --- END Application Phase ---
  14713. --- Output Phase ---
  14714. ENV: Agent did: predict-no for direction R in state State-B
  14715. In State-B moving R
  14716. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14717. predict error 0
  14718. dir: dir isR
  14719. --- END Output Phase ---
  14720. |\---- Input Phase ---
  14721. =>WM: (14087: I2 ^dir R)
  14722. =>WM: (14086: I2 ^reward 1)
  14723. =>WM: (14085: I2 ^see 0)
  14724. =>WM: (14084: N1003 ^status complete)
  14725. <=WM: (14073: I2 ^dir R)
  14726. <=WM: (14072: I2 ^reward 1)
  14727. <=WM: (14071: I2 ^see 0)
  14728. =>WM: (14088: I2 ^level-1 R0-root)
  14729. <=WM: (14074: I2 ^level-1 R1-root)
  14730. --- END Input Phase ---
  14731. --- Proposal Phase ---
  14732. --- Inner Elaboration Phase, active level 1 (S1) ---
  14733. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  14734. -->
  14735. (S1 ^operator O2005 = -0.1512366769350551)
  14736. Firing prefer*rvt*predict-yes*H0*5*H1
  14737. -->
  14738. Firing elaborate*copy-see-to-output-link
  14739. -->
  14740. (I3 ^see 0 +)
  14741. Firing elaborate*reward*based*on*reward
  14742. -->
  14743. (R1007 ^value 1 +)
  14744. (R1 ^reward R1007 +)
  14745. Firing propose*predict-yes
  14746. -->
  14747. (O2007 ^name predict-yes +)
  14748. (S1 ^operator O2007 +)
  14749. Firing propose*predict-no
  14750. -->
  14751. (O2008 ^name predict-no +)
  14752. (S1 ^operator O2008 +)
  14753. Firing rl*prefer*rvt*predict-no*H0*6
  14754. -->
  14755. (S1 ^operator O2006 = 0.9999888743986174)
  14756. Firing rl*prefer*rvt*predict-yes*H0*5
  14757. -->
  14758. (S1 ^operator O2005 = 0.1215994207949702)
  14759. Firing prefer*rvt*predict-yes*H0
  14760. -->
  14761. Firing prefer*rvt*predict-no*H0
  14762. -->
  14763. Firing elaborate*copy-dir-to-output-link
  14764. -->
  14765. (I3 ^dir R +)
  14766. inner elaboration loop at bottom goal.
  14767. Retracting elaborate*copy-see-to-output-link
  14768. -->
  14769. (I3 ^see 0 +)
  14770. Retracting propose*predict-no
  14771. -->
  14772. (O2006 ^name predict-no +)
  14773. (S1 ^operator O2006 +)
  14774. Retracting propose*predict-yes
  14775. -->
  14776. (O2005 ^name predict-yes +)
  14777. (S1 ^operator O2005 +)
  14778. Retracting elaborate*reward*based*on*reward
  14779. -->
  14780. (R1006 ^value 1 +)
  14781. (R1 ^reward R1006 +)
  14782. Retracting elaborate*copy-dir-to-output-link
  14783. -->
  14784. (I3 ^dir R +)
  14785. Retracting rl*prefer*rvt*predict-no*H0*6
  14786. -->
  14787. (S1 ^operator O2006 = 0.9999888743986174)
  14788. Retracting rl*prefer*rvt*predict-yes*H0*5
  14789. -->
  14790. (S1 ^operator O2005 = 0.1215994207949702)
  14791. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  14792. -->
  14793. (S1 ^operator O2005 = -0.04253361215288998)
  14794. =>WM: (14094: S1 ^operator O2008 +)
  14795. =>WM: (14093: S1 ^operator O2007 +)
  14796. =>WM: (14092: O2008 ^name predict-no)
  14797. =>WM: (14091: O2007 ^name predict-yes)
  14798. =>WM: (14090: R1007 ^value 1)
  14799. =>WM: (14089: R1 ^reward R1007)
  14800. <=WM: (14080: S1 ^operator O2005 +)
  14801. <=WM: (14081: S1 ^operator O2006 +)
  14802. <=WM: (14082: S1 ^operator O2006)
  14803. <=WM: (14075: R1 ^reward R1006)
  14804. <=WM: (14078: O2006 ^name predict-no)
  14805. <=WM: (14077: O2005 ^name predict-yes)
  14806. <=WM: (14076: R1006 ^value 1)
  14807. --- Inner Elaboration Phase, active level 1 (S1) ---
  14808. Firing prefer*rvt*predict-yes*H0
  14809. -->
  14810. Firing rl*prefer*rvt*predict-yes*H0*5
  14811. -->
  14812. (S1 ^operator O2007 = 0.1215994207949702)
  14813. Firing prefer*rvt*predict-yes*H0*5*H1
  14814. -->
  14815. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  14816. -->
  14817. (S1 ^operator O2007 = -0.1512366769350551)
  14818. Firing prefer*rvt*predict-no*H0
  14819. -->
  14820. Firing rl*prefer*rvt*predict-no*H0*6
  14821. -->
  14822. (S1 ^operator O2008 = 0.9999888743986174)
  14823. inner elaboration loop at bottom goal.
  14824. Retracting rl*prefer*rvt*predict-no*H0*6
  14825. -->
  14826. (S1 ^operator O2006 = 0.9999888743986174)
  14827. Retracting rl*prefer*rvt*predict-yes*H0*5
  14828. -->
  14829. (S1 ^operator O2005 = 0.1215994207949702)
  14830. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  14831. -->
  14832. (S1 ^operator O2005 = -0.1512366769350551)
  14833. --- END Proposal Phase ---
  14834. --- Decision Phase ---
  14835. RL update rl*prefer*rvt*predict-no*H0*6 0.999989 0 0.999989 -> 0.999991 0 0.999991(R,m,v=1,0.938202,0.0583064)
  14836. =>WM: (14095: S1 ^operator O2008)
  14837. 1004: O: O2008 (predict-no)
  14838. --- END Decision Phase ---
  14839. --- Application Phase ---
  14840. --- Firing Productions (PE) For State At Depth 1 ---
  14841. --- Inner Elaboration Phase, active level 1 (S1) ---
  14842. Firing apply*operator
  14843. -->
  14844. (I3 ^predict-no N1004 + :O )
  14845. Firing apply*operator*complete
  14846. -->
  14847. (I3 ^predict-no N1003 - :O )
  14848. inner elaboration loop at bottom goal.
  14849. --- Change Working Memory (PE) ---
  14850. =>WM: (14096: I3 ^predict-no N1004)
  14851. <=WM: (14084: N1003 ^status complete)
  14852. <=WM: (14083: I3 ^predict-no N1003)
  14853. --- Firing Productions (IE) For State At Depth 1 ---
  14854. --- Inner Elaboration Phase, active level 1 (S1) ---
  14855. Firing monitor*world
  14856. -->
  14857. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14858. --- Change Working Memory (IE) ---
  14859. --- END Application Phase ---
  14860. --- Output Phase ---
  14861. ENV: Agent did: predict-no for direction R in state State-B
  14862. In State-B moving R
  14863. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  14864. predict error 0
  14865. dir: dir isU
  14866. --- END Output Phase ---
  14867. /|\--- Input Phase ---
  14868. =>WM: (14100: I2 ^dir U)
  14869. =>WM: (14099: I2 ^reward 1)
  14870. =>WM: (14098: I2 ^see 0)
  14871. =>WM: (14097: N1004 ^status complete)
  14872. <=WM: (14087: I2 ^dir R)
  14873. <=WM: (14086: I2 ^reward 1)
  14874. <=WM: (14085: I2 ^see 0)
  14875. =>WM: (14101: I2 ^level-1 R0-root)
  14876. <=WM: (14088: I2 ^level-1 R0-root)
  14877. --- END Input Phase ---
  14878. --- Proposal Phase ---
  14879. --- Inner Elaboration Phase, active level 1 (S1) ---
  14880. Firing elaborate*copy-see-to-output-link
  14881. -->
  14882. (I3 ^see 0 +)
  14883. Firing elaborate*reward*based*on*reward
  14884. -->
  14885. (R1008 ^value 1 +)
  14886. (R1 ^reward R1008 +)
  14887. Firing propose*predict-yes
  14888. -->
  14889. (O2009 ^name predict-yes +)
  14890. (S1 ^operator O2009 +)
  14891. Firing propose*predict-no
  14892. -->
  14893. (O2010 ^name predict-no +)
  14894. (S1 ^operator O2010 +)
  14895. Firing rl*prefer*rvt*predict-no*H0*2
  14896. -->
  14897. (S1 ^operator O2008 = 1.)
  14898. Firing rl*prefer*rvt*predict-yes*H0*1
  14899. -->
  14900. (S1 ^operator O2007 = 0.)
  14901. Firing prefer*rvt*predict-yes*H0
  14902. -->
  14903. Firing prefer*rvt*predict-no*H0
  14904. -->
  14905. Firing elaborate*copy-dir-to-output-link
  14906. -->
  14907. (I3 ^dir U +)
  14908. inner elaboration loop at bottom goal.
  14909. Retracting elaborate*copy-see-to-output-link
  14910. -->
  14911. (I3 ^see 0 +)
  14912. Retracting propose*predict-no
  14913. -->
  14914. (O2008 ^name predict-no +)
  14915. (S1 ^operator O2008 +)
  14916. Retracting propose*predict-yes
  14917. -->
  14918. (O2007 ^name predict-yes +)
  14919. (S1 ^operator O2007 +)
  14920. Retracting elaborate*reward*based*on*reward
  14921. -->
  14922. (R1007 ^value 1 +)
  14923. (R1 ^reward R1007 +)
  14924. Retracting elaborate*copy-dir-to-output-link
  14925. -->
  14926. (I3 ^dir R +)
  14927. Retracting rl*prefer*rvt*predict-no*H0*6
  14928. -->
  14929. (S1 ^operator O2008 = 0.9999906741383352)
  14930. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  14931. -->
  14932. (S1 ^operator O2007 = -0.1512366769350551)
  14933. Retracting rl*prefer*rvt*predict-yes*H0*5
  14934. -->
  14935. (S1 ^operator O2007 = 0.1215994207949702)
  14936. =>WM: (14108: S1 ^operator O2010 +)
  14937. =>WM: (14107: S1 ^operator O2009 +)
  14938. =>WM: (14106: I3 ^dir U)
  14939. =>WM: (14105: O2010 ^name predict-no)
  14940. =>WM: (14104: O2009 ^name predict-yes)
  14941. =>WM: (14103: R1008 ^value 1)
  14942. =>WM: (14102: R1 ^reward R1008)
  14943. <=WM: (14093: S1 ^operator O2007 +)
  14944. <=WM: (14094: S1 ^operator O2008 +)
  14945. <=WM: (14095: S1 ^operator O2008)
  14946. <=WM: (14079: I3 ^dir R)
  14947. <=WM: (14089: R1 ^reward R1007)
  14948. <=WM: (14092: O2008 ^name predict-no)
  14949. <=WM: (14091: O2007 ^name predict-yes)
  14950. <=WM: (14090: R1007 ^value 1)
  14951. --- Inner Elaboration Phase, active level 1 (S1) ---
  14952. Firing prefer*rvt*predict-yes*H0
  14953. -->
  14954. Firing rl*prefer*rvt*predict-yes*H0*1
  14955. -->
  14956. (S1 ^operator O2009 = 0.)
  14957. Firing prefer*rvt*predict-no*H0
  14958. -->
  14959. Firing rl*prefer*rvt*predict-no*H0*2
  14960. -->
  14961. (S1 ^operator O2010 = 1.)
  14962. inner elaboration loop at bottom goal.
  14963. Retracting rl*prefer*rvt*predict-no*H0*2
  14964. -->
  14965. (S1 ^operator O2008 = 1.)
  14966. Retracting rl*prefer*rvt*predict-yes*H0*1
  14967. -->
  14968. (S1 ^operator O2007 = 0.)
  14969. --- END Proposal Phase ---
  14970. --- Decision Phase ---
  14971. RL update rl*prefer*rvt*predict-no*H0*6 0.999991 0 0.999991 -> 0.999992 0 0.999992(R,m,v=1,0.938547,0.0580001)
  14972. =>WM: (14109: S1 ^operator O2010)
  14973. 1005: O: O2010 (predict-no)
  14974. --- END Decision Phase ---
  14975. --- Application Phase ---
  14976. --- Firing Productions (PE) For State At Depth 1 ---
  14977. --- Inner Elaboration Phase, active level 1 (S1) ---
  14978. Firing apply*operator
  14979. -->
  14980. (I3 ^predict-no N1005 + :O )
  14981. Firing apply*operator*complete
  14982. -->
  14983. (I3 ^predict-no N1004 - :O )
  14984. inner elaboration loop at bottom goal.
  14985. --- Change Working Memory (PE) ---
  14986. =>WM: (14110: I3 ^predict-no N1005)
  14987. <=WM: (14097: N1004 ^status complete)
  14988. <=WM: (14096: I3 ^predict-no N1004)
  14989. --- Firing Productions (IE) For State At Depth 1 ---
  14990. --- Inner Elaboration Phase, active level 1 (S1) ---
  14991. Firing monitor*world
  14992. -->
  14993. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  14994. --- Change Working Memory (IE) ---
  14995. --- END Application Phase ---
  14996. --- Output Phase ---
  14997. ENV: Agent did: predict-no for direction U in state State-B
  14998. In State-B moving U
  14999. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15000. predict error 0
  15001. dir: dir isU
  15002. --- END Output Phase ---
  15003. -/|--- Input Phase ---
  15004. =>WM: (14114: I2 ^dir U)
  15005. =>WM: (14113: I2 ^reward 1)
  15006. =>WM: (14112: I2 ^see 0)
  15007. =>WM: (14111: N1005 ^status complete)
  15008. <=WM: (14100: I2 ^dir U)
  15009. <=WM: (14099: I2 ^reward 1)
  15010. <=WM: (14098: I2 ^see 0)
  15011. =>WM: (14115: I2 ^level-1 R0-root)
  15012. <=WM: (14101: I2 ^level-1 R0-root)
  15013. --- END Input Phase ---
  15014. --- Proposal Phase ---
  15015. --- Inner Elaboration Phase, active level 1 (S1) ---
  15016. Firing elaborate*copy-see-to-output-link
  15017. -->
  15018. (I3 ^see 0 +)
  15019. Firing elaborate*reward*based*on*reward
  15020. -->
  15021. (R1009 ^value 1 +)
  15022. (R1 ^reward R1009 +)
  15023. Firing propose*predict-yes
  15024. -->
  15025. (O2011 ^name predict-yes +)
  15026. (S1 ^operator O2011 +)
  15027. Firing propose*predict-no
  15028. -->
  15029. (O2012 ^name predict-no +)
  15030. (S1 ^operator O2012 +)
  15031. Firing rl*prefer*rvt*predict-no*H0*2
  15032. -->
  15033. (S1 ^operator O2010 = 1.)
  15034. Firing rl*prefer*rvt*predict-yes*H0*1
  15035. -->
  15036. (S1 ^operator O2009 = 0.)
  15037. Firing prefer*rvt*predict-yes*H0
  15038. -->
  15039. Firing prefer*rvt*predict-no*H0
  15040. -->
  15041. Firing elaborate*copy-dir-to-output-link
  15042. -->
  15043. (I3 ^dir U +)
  15044. inner elaboration loop at bottom goal.
  15045. Retracting elaborate*copy-see-to-output-link
  15046. -->
  15047. (I3 ^see 0 +)
  15048. Retracting propose*predict-no
  15049. -->
  15050. (O2010 ^name predict-no +)
  15051. (S1 ^operator O2010 +)
  15052. Retracting propose*predict-yes
  15053. -->
  15054. (O2009 ^name predict-yes +)
  15055. (S1 ^operator O2009 +)
  15056. Retracting elaborate*reward*based*on*reward
  15057. -->
  15058. (R1008 ^value 1 +)
  15059. (R1 ^reward R1008 +)
  15060. Retracting elaborate*copy-dir-to-output-link
  15061. -->
  15062. (I3 ^dir U +)
  15063. Retracting rl*prefer*rvt*predict-no*H0*2
  15064. -->
  15065. (S1 ^operator O2010 = 1.)
  15066. Retracting rl*prefer*rvt*predict-yes*H0*1
  15067. -->
  15068. (S1 ^operator O2009 = 0.)
  15069. =>WM: (14121: S1 ^operator O2012 +)
  15070. =>WM: (14120: S1 ^operator O2011 +)
  15071. =>WM: (14119: O2012 ^name predict-no)
  15072. =>WM: (14118: O2011 ^name predict-yes)
  15073. =>WM: (14117: R1009 ^value 1)
  15074. =>WM: (14116: R1 ^reward R1009)
  15075. <=WM: (14107: S1 ^operator O2009 +)
  15076. <=WM: (14108: S1 ^operator O2010 +)
  15077. <=WM: (14109: S1 ^operator O2010)
  15078. <=WM: (14102: R1 ^reward R1008)
  15079. <=WM: (14105: O2010 ^name predict-no)
  15080. <=WM: (14104: O2009 ^name predict-yes)
  15081. <=WM: (14103: R1008 ^value 1)
  15082. --- Inner Elaboration Phase, active level 1 (S1) ---
  15083. Firing prefer*rvt*predict-yes*H0
  15084. -->
  15085. Firing rl*prefer*rvt*predict-yes*H0*1
  15086. -->
  15087. (S1 ^operator O2011 = 0.)
  15088. Firing prefer*rvt*predict-no*H0
  15089. -->
  15090. Firing rl*prefer*rvt*predict-no*H0*2
  15091. -->
  15092. (S1 ^operator O2012 = 1.)
  15093. inner elaboration loop at bottom goal.
  15094. Retracting rl*prefer*rvt*predict-no*H0*2
  15095. -->
  15096. (S1 ^operator O2010 = 1.)
  15097. Retracting rl*prefer*rvt*predict-yes*H0*1
  15098. -->
  15099. (S1 ^operator O2009 = 0.)
  15100. --- END Proposal Phase ---
  15101. --- Decision Phase ---
  15102. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15103. =>WM: (14122: S1 ^operator O2012)
  15104. 1006: O: O2012 (predict-no)
  15105. --- END Decision Phase ---
  15106. --- Application Phase ---
  15107. --- Firing Productions (PE) For State At Depth 1 ---
  15108. --- Inner Elaboration Phase, active level 1 (S1) ---
  15109. Firing apply*operator
  15110. -->
  15111. (I3 ^predict-no N1006 + :O )
  15112. Firing apply*operator*complete
  15113. -->
  15114. (I3 ^predict-no N1005 - :O )
  15115. inner elaboration loop at bottom goal.
  15116. --- Change Working Memory (PE) ---
  15117. =>WM: (14123: I3 ^predict-no N1006)
  15118. <=WM: (14111: N1005 ^status complete)
  15119. <=WM: (14110: I3 ^predict-no N1005)
  15120. --- Firing Productions (IE) For State At Depth 1 ---
  15121. --- Inner Elaboration Phase, active level 1 (S1) ---
  15122. Firing monitor*world
  15123. -->
  15124. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15125. --- Change Working Memory (IE) ---
  15126. --- END Application Phase ---
  15127. --- Output Phase ---
  15128. ENV: Agent did: predict-no for direction U in state State-B
  15129. In State-B moving U
  15130. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  15131. predict error 0
  15132. dir: dir isL
  15133. --- END Output Phase ---
  15134. \-/--- Input Phase ---
  15135. =>WM: (14127: I2 ^dir L)
  15136. =>WM: (14126: I2 ^reward 1)
  15137. =>WM: (14125: I2 ^see 0)
  15138. =>WM: (14124: N1006 ^status complete)
  15139. <=WM: (14114: I2 ^dir U)
  15140. <=WM: (14113: I2 ^reward 1)
  15141. <=WM: (14112: I2 ^see 0)
  15142. =>WM: (14128: I2 ^level-1 R0-root)
  15143. <=WM: (14115: I2 ^level-1 R0-root)
  15144. --- END Input Phase ---
  15145. --- Proposal Phase ---
  15146. --- Inner Elaboration Phase, active level 1 (S1) ---
  15147. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  15148. -->
  15149. (S1 ^operator O2012 = -0.1984300550322165)
  15150. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  15151. -->
  15152. (S1 ^operator O2011 = 0.6091150129894595)
  15153. Firing prefer*rvt*predict-no*H0*4*H1
  15154. -->
  15155. Firing prefer*rvt*predict-yes*H0*3*H1
  15156. -->
  15157. Firing elaborate*copy-see-to-output-link
  15158. -->
  15159. (I3 ^see 0 +)
  15160. Firing elaborate*reward*based*on*reward
  15161. -->
  15162. (R1010 ^value 1 +)
  15163. (R1 ^reward R1010 +)
  15164. Firing propose*predict-yes
  15165. -->
  15166. (O2013 ^name predict-yes +)
  15167. (S1 ^operator O2013 +)
  15168. Firing propose*predict-no
  15169. -->
  15170. (O2014 ^name predict-no +)
  15171. (S1 ^operator O2014 +)
  15172. Firing rl*prefer*rvt*predict-no*H0*4
  15173. -->
  15174. (S1 ^operator O2012 = 0.3145079413521559)
  15175. Firing rl*prefer*rvt*predict-yes*H0*3
  15176. -->
  15177. (S1 ^operator O2011 = 0.3907782094907327)
  15178. Firing prefer*rvt*predict-yes*H0
  15179. -->
  15180. Firing prefer*rvt*predict-no*H0
  15181. -->
  15182. Firing elaborate*copy-dir-to-output-link
  15183. -->
  15184. (I3 ^dir L +)
  15185. inner elaboration loop at bottom goal.
  15186. Retracting elaborate*copy-see-to-output-link
  15187. -->
  15188. (I3 ^see 0 +)
  15189. Retracting propose*predict-no
  15190. -->
  15191. (O2012 ^name predict-no +)
  15192. (S1 ^operator O2012 +)
  15193. Retracting propose*predict-yes
  15194. -->
  15195. (O2011 ^name predict-yes +)
  15196. (S1 ^operator O2011 +)
  15197. Retracting elaborate*reward*based*on*reward
  15198. -->
  15199. (R1009 ^value 1 +)
  15200. (R1 ^reward R1009 +)
  15201. Retracting elaborate*copy-dir-to-output-link
  15202. -->
  15203. (I3 ^dir U +)
  15204. Retracting rl*prefer*rvt*predict-no*H0*2
  15205. -->
  15206. (S1 ^operator O2012 = 1.)
  15207. Retracting rl*prefer*rvt*predict-yes*H0*1
  15208. -->
  15209. (S1 ^operator O2011 = 0.)
  15210. =>WM: (14135: S1 ^operator O2014 +)
  15211. =>WM: (14134: S1 ^operator O2013 +)
  15212. =>WM: (14133: I3 ^dir L)
  15213. =>WM: (14132: O2014 ^name predict-no)
  15214. =>WM: (14131: O2013 ^name predict-yes)
  15215. =>WM: (14130: R1010 ^value 1)
  15216. =>WM: (14129: R1 ^reward R1010)
  15217. <=WM: (14120: S1 ^operator O2011 +)
  15218. <=WM: (14121: S1 ^operator O2012 +)
  15219. <=WM: (14122: S1 ^operator O2012)
  15220. <=WM: (14106: I3 ^dir U)
  15221. <=WM: (14116: R1 ^reward R1009)
  15222. <=WM: (14119: O2012 ^name predict-no)
  15223. <=WM: (14118: O2011 ^name predict-yes)
  15224. <=WM: (14117: R1009 ^value 1)
  15225. --- Inner Elaboration Phase, active level 1 (S1) ---
  15226. Firing prefer*rvt*predict-yes*H0
  15227. -->
  15228. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  15229. -->
  15230. (S1 ^operator O2013 = 0.6091150129894595)
  15231. Firing rl*prefer*rvt*predict-yes*H0*3
  15232. -->
  15233. (S1 ^operator O2013 = 0.3907782094907327)
  15234. Firing prefer*rvt*predict-yes*H0*3*H1
  15235. -->
  15236. Firing prefer*rvt*predict-no*H0
  15237. -->
  15238. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  15239. -->
  15240. (S1 ^operator O2014 = -0.1984300550322165)
  15241. Firing rl*prefer*rvt*predict-no*H0*4
  15242. -->
  15243. (S1 ^operator O2014 = 0.3145079413521559)
  15244. Firing prefer*rvt*predict-no*H0*4*H1
  15245. -->
  15246. inner elaboration loop at bottom goal.
  15247. Retracting rl*prefer*rvt*predict-no*H0*4
  15248. -->
  15249. (S1 ^operator O2012 = 0.3145079413521559)
  15250. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  15251. -->
  15252. (S1 ^operator O2012 = -0.1984300550322165)
  15253. Retracting rl*prefer*rvt*predict-yes*H0*3
  15254. -->
  15255. (S1 ^operator O2011 = 0.3907782094907327)
  15256. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  15257. -->
  15258. (S1 ^operator O2011 = 0.6091150129894595)
  15259. --- END Proposal Phase ---
  15260. --- Decision Phase ---
  15261. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  15262. =>WM: (14136: S1 ^operator O2013)
  15263. 1007: O: O2013 (predict-yes)
  15264. --- END Decision Phase ---
  15265. --- Application Phase ---
  15266. --- Firing Productions (PE) For State At Depth 1 ---
  15267. --- Inner Elaboration Phase, active level 1 (S1) ---
  15268. Firing apply*operator
  15269. -->
  15270. (I3 ^predict-yes N1007 + :O )
  15271. Firing apply*operator*complete
  15272. -->
  15273. (I3 ^predict-no N1006 - :O )
  15274. inner elaboration loop at bottom goal.
  15275. --- Change Working Memory (PE) ---
  15276. =>WM: (14137: I3 ^predict-yes N1007)
  15277. <=WM: (14124: N1006 ^status complete)
  15278. <=WM: (14123: I3 ^predict-no N1006)
  15279. --- Firing Productions (IE) For State At Depth 1 ---
  15280. --- Inner Elaboration Phase, active level 1 (S1) ---
  15281. Firing monitor*world
  15282. -->
  15283. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15284. --- Change Working Memory (IE) ---
  15285. --- END Application Phase ---
  15286. --- Output Phase ---
  15287. ENV: Agent did: predict-yes for direction L in state State-B
  15288. In State-B moving L
  15289. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15290. predict error 0
  15291. dir: dir isR
  15292. --- END Output Phase ---
  15293. |\---- Input Phase ---
  15294. =>WM: (14141: I2 ^dir R)
  15295. =>WM: (14140: I2 ^reward 1)
  15296. =>WM: (14139: I2 ^see 1)
  15297. =>WM: (14138: N1007 ^status complete)
  15298. <=WM: (14127: I2 ^dir L)
  15299. <=WM: (14126: I2 ^reward 1)
  15300. <=WM: (14125: I2 ^see 0)
  15301. =>WM: (14142: I2 ^level-1 L1-root)
  15302. <=WM: (14128: I2 ^level-1 R0-root)
  15303. --- END Input Phase ---
  15304. --- Proposal Phase ---
  15305. --- Inner Elaboration Phase, active level 1 (S1) ---
  15306. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  15307. -->
  15308. (S1 ^operator O2013 = 0.8784140715701729)
  15309. Firing prefer*rvt*predict-yes*H0*5*H1
  15310. -->
  15311. Firing elaborate*copy-see-to-output-link
  15312. -->
  15313. (I3 ^see 1 +)
  15314. Firing elaborate*reward*based*on*reward
  15315. -->
  15316. (R1011 ^value 1 +)
  15317. (R1 ^reward R1011 +)
  15318. Firing propose*predict-yes
  15319. -->
  15320. (O2015 ^name predict-yes +)
  15321. (S1 ^operator O2015 +)
  15322. Firing propose*predict-no
  15323. -->
  15324. (O2016 ^name predict-no +)
  15325. (S1 ^operator O2016 +)
  15326. Firing rl*prefer*rvt*predict-no*H0*6
  15327. -->
  15328. (S1 ^operator O2014 = 0.9999921813761182)
  15329. Firing rl*prefer*rvt*predict-yes*H0*5
  15330. -->
  15331. (S1 ^operator O2013 = 0.1215994207949702)
  15332. Firing prefer*rvt*predict-yes*H0
  15333. -->
  15334. Firing prefer*rvt*predict-no*H0
  15335. -->
  15336. Firing elaborate*copy-dir-to-output-link
  15337. -->
  15338. (I3 ^dir R +)
  15339. inner elaboration loop at bottom goal.
  15340. Retracting elaborate*copy-see-to-output-link
  15341. -->
  15342. (I3 ^see 0 +)
  15343. Retracting propose*predict-no
  15344. -->
  15345. (O2014 ^name predict-no +)
  15346. (S1 ^operator O2014 +)
  15347. Retracting propose*predict-yes
  15348. -->
  15349. (O2013 ^name predict-yes +)
  15350. (S1 ^operator O2013 +)
  15351. Retracting elaborate*reward*based*on*reward
  15352. -->
  15353. (R1010 ^value 1 +)
  15354. (R1 ^reward R1010 +)
  15355. Retracting elaborate*copy-dir-to-output-link
  15356. -->
  15357. (I3 ^dir L +)
  15358. Retracting rl*prefer*rvt*predict-no*H0*4
  15359. -->
  15360. (S1 ^operator O2014 = 0.3145079413521559)
  15361. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  15362. -->
  15363. (S1 ^operator O2014 = -0.1984300550322165)
  15364. Retracting rl*prefer*rvt*predict-yes*H0*3
  15365. -->
  15366. (S1 ^operator O2013 = 0.3907782094907327)
  15367. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  15368. -->
  15369. (S1 ^operator O2013 = 0.6091150129894595)
  15370. =>WM: (14150: S1 ^operator O2016 +)
  15371. =>WM: (14149: S1 ^operator O2015 +)
  15372. =>WM: (14148: I3 ^dir R)
  15373. =>WM: (14147: O2016 ^name predict-no)
  15374. =>WM: (14146: O2015 ^name predict-yes)
  15375. =>WM: (14145: R1011 ^value 1)
  15376. =>WM: (14144: R1 ^reward R1011)
  15377. =>WM: (14143: I3 ^see 1)
  15378. <=WM: (14134: S1 ^operator O2013 +)
  15379. <=WM: (14136: S1 ^operator O2013)
  15380. <=WM: (14135: S1 ^operator O2014 +)
  15381. <=WM: (14133: I3 ^dir L)
  15382. <=WM: (14129: R1 ^reward R1010)
  15383. <=WM: (14048: I3 ^see 0)
  15384. <=WM: (14132: O2014 ^name predict-no)
  15385. <=WM: (14131: O2013 ^name predict-yes)
  15386. <=WM: (14130: R1010 ^value 1)
  15387. --- Inner Elaboration Phase, active level 1 (S1) ---
  15388. Firing prefer*rvt*predict-yes*H0
  15389. -->
  15390. Firing rl*prefer*rvt*predict-yes*H0*5
  15391. -->
  15392. (S1 ^operator O2015 = 0.1215994207949702)
  15393. Firing prefer*rvt*predict-yes*H0*5*H1
  15394. -->
  15395. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  15396. -->
  15397. (S1 ^operator O2015 = 0.8784140715701729)
  15398. Firing prefer*rvt*predict-no*H0
  15399. -->
  15400. Firing rl*prefer*rvt*predict-no*H0*6
  15401. -->
  15402. (S1 ^operator O2016 = 0.9999921813761182)
  15403. inner elaboration loop at bottom goal.
  15404. Retracting rl*prefer*rvt*predict-no*H0*6
  15405. -->
  15406. (S1 ^operator O2014 = 0.9999921813761182)
  15407. Retracting rl*prefer*rvt*predict-yes*H0*5
  15408. -->
  15409. (S1 ^operator O2013 = 0.1215994207949702)
  15410. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  15411. -->
  15412. (S1 ^operator O2013 = 0.8784140715701729)
  15413. --- END Proposal Phase ---
  15414. --- Decision Phase ---
  15415. RL update rl*prefer*rvt*predict-yes*H0*3 0.472324 -0.0815458 0.390778 -> 0.472332 -0.0815445 0.390787(R,m,v=1,0.944099,0.0531056)
  15416. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527585 0.0815301 0.609115 -> 0.527593 0.0815315 0.609125(R,m,v=1,1,0)
  15417. =>WM: (14151: S1 ^operator O2015)
  15418. 1008: O: O2015 (predict-yes)
  15419. --- END Decision Phase ---
  15420. --- Application Phase ---
  15421. --- Firing Productions (PE) For State At Depth 1 ---
  15422. --- Inner Elaboration Phase, active level 1 (S1) ---
  15423. Firing apply*operator
  15424. -->
  15425. (I3 ^predict-yes N1008 + :O )
  15426. Firing apply*operator*complete
  15427. -->
  15428. (I3 ^predict-yes N1007 - :O )
  15429. inner elaboration loop at bottom goal.
  15430. --- Change Working Memory (PE) ---
  15431. =>WM: (14152: I3 ^predict-yes N1008)
  15432. <=WM: (14138: N1007 ^status complete)
  15433. <=WM: (14137: I3 ^predict-yes N1007)
  15434. --- Firing Productions (IE) For State At Depth 1 ---
  15435. --- Inner Elaboration Phase, active level 1 (S1) ---
  15436. Firing monitor*world
  15437. -->
  15438. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15439. --- Change Working Memory (IE) ---
  15440. --- END Application Phase ---
  15441. --- Output Phase ---
  15442. ENV: Agent did: predict-yes for direction R in state State-A
  15443. In State-A moving R
  15444. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15445. predict error 0
  15446. dir: dir isL
  15447. --- END Output Phase ---
  15448. /|--- Input Phase ---
  15449. =>WM: (14156: I2 ^dir L)
  15450. =>WM: (14155: I2 ^reward 1)
  15451. =>WM: (14154: I2 ^see 1)
  15452. =>WM: (14153: N1008 ^status complete)
  15453. <=WM: (14141: I2 ^dir R)
  15454. <=WM: (14140: I2 ^reward 1)
  15455. <=WM: (14139: I2 ^see 1)
  15456. =>WM: (14157: I2 ^level-1 R1-root)
  15457. <=WM: (14142: I2 ^level-1 L1-root)
  15458. --- END Input Phase ---
  15459. --- Proposal Phase ---
  15460. --- Inner Elaboration Phase, active level 1 (S1) ---
  15461. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  15462. -->
  15463. (S1 ^operator O2016 = -0.168718511744511)
  15464. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  15465. -->
  15466. (S1 ^operator O2015 = 0.6093091841289463)
  15467. Firing prefer*rvt*predict-no*H0*4*H1
  15468. -->
  15469. Firing prefer*rvt*predict-yes*H0*3*H1
  15470. -->
  15471. Firing elaborate*copy-see-to-output-link
  15472. -->
  15473. (I3 ^see 1 +)
  15474. Firing elaborate*reward*based*on*reward
  15475. -->
  15476. (R1012 ^value 1 +)
  15477. (R1 ^reward R1012 +)
  15478. Firing propose*predict-yes
  15479. -->
  15480. (O2017 ^name predict-yes +)
  15481. (S1 ^operator O2017 +)
  15482. Firing propose*predict-no
  15483. -->
  15484. (O2018 ^name predict-no +)
  15485. (S1 ^operator O2018 +)
  15486. Firing rl*prefer*rvt*predict-no*H0*4
  15487. -->
  15488. (S1 ^operator O2016 = 0.3145079413521559)
  15489. Firing rl*prefer*rvt*predict-yes*H0*3
  15490. -->
  15491. (S1 ^operator O2015 = 0.3907869885089824)
  15492. Firing prefer*rvt*predict-yes*H0
  15493. -->
  15494. Firing prefer*rvt*predict-no*H0
  15495. -->
  15496. Firing elaborate*copy-dir-to-output-link
  15497. -->
  15498. (I3 ^dir L +)
  15499. inner elaboration loop at bottom goal.
  15500. Retracting elaborate*copy-see-to-output-link
  15501. -->
  15502. (I3 ^see 1 +)
  15503. Retracting propose*predict-no
  15504. -->
  15505. (O2016 ^name predict-no +)
  15506. (S1 ^operator O2016 +)
  15507. Retracting propose*predict-yes
  15508. -->
  15509. (O2015 ^name predict-yes +)
  15510. (S1 ^operator O2015 +)
  15511. Retracting elaborate*reward*based*on*reward
  15512. -->
  15513. (R1011 ^value 1 +)
  15514. (R1 ^reward R1011 +)
  15515. Retracting elaborate*copy-dir-to-output-link
  15516. -->
  15517. (I3 ^dir R +)
  15518. Retracting rl*prefer*rvt*predict-no*H0*6
  15519. -->
  15520. (S1 ^operator O2016 = 0.9999921813761182)
  15521. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  15522. -->
  15523. (S1 ^operator O2015 = 0.8784140715701729)
  15524. Retracting rl*prefer*rvt*predict-yes*H0*5
  15525. -->
  15526. (S1 ^operator O2015 = 0.1215994207949702)
  15527. =>WM: (14164: S1 ^operator O2018 +)
  15528. =>WM: (14163: S1 ^operator O2017 +)
  15529. =>WM: (14162: I3 ^dir L)
  15530. =>WM: (14161: O2018 ^name predict-no)
  15531. =>WM: (14160: O2017 ^name predict-yes)
  15532. =>WM: (14159: R1012 ^value 1)
  15533. =>WM: (14158: R1 ^reward R1012)
  15534. <=WM: (14149: S1 ^operator O2015 +)
  15535. <=WM: (14151: S1 ^operator O2015)
  15536. <=WM: (14150: S1 ^operator O2016 +)
  15537. <=WM: (14148: I3 ^dir R)
  15538. <=WM: (14144: R1 ^reward R1011)
  15539. <=WM: (14147: O2016 ^name predict-no)
  15540. <=WM: (14146: O2015 ^name predict-yes)
  15541. <=WM: (14145: R1011 ^value 1)
  15542. --- Inner Elaboration Phase, active level 1 (S1) ---
  15543. Firing prefer*rvt*predict-yes*H0
  15544. -->
  15545. Firing rl*prefer*rvt*predict-yes*H0*3
  15546. -->
  15547. (S1 ^operator O2017 = 0.3907869885089824)
  15548. Firing prefer*rvt*predict-yes*H0*3*H1
  15549. -->
  15550. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  15551. -->
  15552. (S1 ^operator O2017 = 0.6093091841289463)
  15553. Firing prefer*rvt*predict-no*H0
  15554. -->
  15555. Firing rl*prefer*rvt*predict-no*H0*4
  15556. -->
  15557. (S1 ^operator O2018 = 0.3145079413521559)
  15558. Firing prefer*rvt*predict-no*H0*4*H1
  15559. -->
  15560. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  15561. -->
  15562. (S1 ^operator O2018 = -0.168718511744511)
  15563. inner elaboration loop at bottom goal.
  15564. Retracting rl*prefer*rvt*predict-no*H0*4
  15565. -->
  15566. (S1 ^operator O2016 = 0.3145079413521559)
  15567. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  15568. -->
  15569. (S1 ^operator O2016 = -0.168718511744511)
  15570. Retracting rl*prefer*rvt*predict-yes*H0*3
  15571. -->
  15572. (S1 ^operator O2015 = 0.3907869885089824)
  15573. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  15574. -->
  15575. (S1 ^operator O2015 = 0.6093091841289463)
  15576. --- END Proposal Phase ---
  15577. --- Decision Phase ---
  15578. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.865169,0.117311)
  15579. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465486 0.412928 0.878414 -> 0.465485 0.412928 0.878413(R,m,v=1,1,0)
  15580. =>WM: (14165: S1 ^operator O2017)
  15581. 1009: O: O2017 (predict-yes)
  15582. --- END Decision Phase ---
  15583. --- Application Phase ---
  15584. --- Firing Productions (PE) For State At Depth 1 ---
  15585. --- Inner Elaboration Phase, active level 1 (S1) ---
  15586. Firing apply*operator
  15587. -->
  15588. (I3 ^predict-yes N1009 + :O )
  15589. Firing apply*operator*complete
  15590. -->
  15591. (I3 ^predict-yes N1008 - :O )
  15592. inner elaboration loop at bottom goal.
  15593. --- Change Working Memory (PE) ---
  15594. =>WM: (14166: I3 ^predict-yes N1009)
  15595. <=WM: (14153: N1008 ^status complete)
  15596. <=WM: (14152: I3 ^predict-yes N1008)
  15597. --- Firing Productions (IE) For State At Depth 1 ---
  15598. --- Inner Elaboration Phase, active level 1 (S1) ---
  15599. Firing monitor*world
  15600. -->
  15601. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15602. --- Change Working Memory (IE) ---
  15603. --- END Application Phase ---
  15604. --- Output Phase ---
  15605. ENV: Agent did: predict-yes for direction L in state State-B
  15606. In State-B moving L
  15607. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  15608. predict error 0
  15609. dir: dir isL
  15610. --- END Output Phase ---
  15611. \-/--- Input Phase ---
  15612. =>WM: (14170: I2 ^dir L)
  15613. =>WM: (14169: I2 ^reward 1)
  15614. =>WM: (14168: I2 ^see 1)
  15615. =>WM: (14167: N1009 ^status complete)
  15616. <=WM: (14156: I2 ^dir L)
  15617. <=WM: (14155: I2 ^reward 1)
  15618. <=WM: (14154: I2 ^see 1)
  15619. =>WM: (14171: I2 ^level-1 L1-root)
  15620. <=WM: (14157: I2 ^level-1 R1-root)
  15621. --- END Input Phase ---
  15622. --- Proposal Phase ---
  15623. --- Inner Elaboration Phase, active level 1 (S1) ---
  15624. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  15625. -->
  15626. (S1 ^operator O2017 = -0.2062723012911647)
  15627. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  15628. -->
  15629. (S1 ^operator O2018 = 0.685530273786795)
  15630. Firing prefer*rvt*predict-no*H0*4*H1
  15631. -->
  15632. Firing prefer*rvt*predict-yes*H0*3*H1
  15633. -->
  15634. Firing elaborate*copy-see-to-output-link
  15635. -->
  15636. (I3 ^see 1 +)
  15637. Firing elaborate*reward*based*on*reward
  15638. -->
  15639. (R1013 ^value 1 +)
  15640. (R1 ^reward R1013 +)
  15641. Firing propose*predict-yes
  15642. -->
  15643. (O2019 ^name predict-yes +)
  15644. (S1 ^operator O2019 +)
  15645. Firing propose*predict-no
  15646. -->
  15647. (O2020 ^name predict-no +)
  15648. (S1 ^operator O2020 +)
  15649. Firing rl*prefer*rvt*predict-no*H0*4
  15650. -->
  15651. (S1 ^operator O2018 = 0.3145079413521559)
  15652. Firing rl*prefer*rvt*predict-yes*H0*3
  15653. -->
  15654. (S1 ^operator O2017 = 0.3907869885089824)
  15655. Firing prefer*rvt*predict-yes*H0
  15656. -->
  15657. Firing prefer*rvt*predict-no*H0
  15658. -->
  15659. Firing elaborate*copy-dir-to-output-link
  15660. -->
  15661. (I3 ^dir L +)
  15662. inner elaboration loop at bottom goal.
  15663. Retracting elaborate*copy-see-to-output-link
  15664. -->
  15665. (I3 ^see 1 +)
  15666. Retracting propose*predict-no
  15667. -->
  15668. (O2018 ^name predict-no +)
  15669. (S1 ^operator O2018 +)
  15670. Retracting propose*predict-yes
  15671. -->
  15672. (O2017 ^name predict-yes +)
  15673. (S1 ^operator O2017 +)
  15674. Retracting elaborate*reward*based*on*reward
  15675. -->
  15676. (R1012 ^value 1 +)
  15677. (R1 ^reward R1012 +)
  15678. Retracting elaborate*copy-dir-to-output-link
  15679. -->
  15680. (I3 ^dir L +)
  15681. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  15682. -->
  15683. (S1 ^operator O2018 = -0.168718511744511)
  15684. Retracting rl*prefer*rvt*predict-no*H0*4
  15685. -->
  15686. (S1 ^operator O2018 = 0.3145079413521559)
  15687. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  15688. -->
  15689. (S1 ^operator O2017 = 0.6093091841289463)
  15690. Retracting rl*prefer*rvt*predict-yes*H0*3
  15691. -->
  15692. (S1 ^operator O2017 = 0.3907869885089824)
  15693. =>WM: (14177: S1 ^operator O2020 +)
  15694. =>WM: (14176: S1 ^operator O2019 +)
  15695. =>WM: (14175: O2020 ^name predict-no)
  15696. =>WM: (14174: O2019 ^name predict-yes)
  15697. =>WM: (14173: R1013 ^value 1)
  15698. =>WM: (14172: R1 ^reward R1013)
  15699. <=WM: (14163: S1 ^operator O2017 +)
  15700. <=WM: (14165: S1 ^operator O2017)
  15701. <=WM: (14164: S1 ^operator O2018 +)
  15702. <=WM: (14158: R1 ^reward R1012)
  15703. <=WM: (14161: O2018 ^name predict-no)
  15704. <=WM: (14160: O2017 ^name predict-yes)
  15705. <=WM: (14159: R1012 ^value 1)
  15706. --- Inner Elaboration Phase, active level 1 (S1) ---
  15707. Firing prefer*rvt*predict-yes*H0
  15708. -->
  15709. Firing rl*prefer*rvt*predict-yes*H0*3
  15710. -->
  15711. (S1 ^operator O2019 = 0.3907869885089824)
  15712. Firing prefer*rvt*predict-yes*H0*3*H1
  15713. -->
  15714. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  15715. -->
  15716. (S1 ^operator O2019 = -0.2062723012911647)
  15717. Firing prefer*rvt*predict-no*H0
  15718. -->
  15719. Firing rl*prefer*rvt*predict-no*H0*4
  15720. -->
  15721. (S1 ^operator O2020 = 0.3145079413521559)
  15722. Firing prefer*rvt*predict-no*H0*4*H1
  15723. -->
  15724. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  15725. -->
  15726. (S1 ^operator O2020 = 0.685530273786795)
  15727. inner elaboration loop at bottom goal.
  15728. Retracting rl*prefer*rvt*predict-no*H0*4
  15729. -->
  15730. (S1 ^operator O2018 = 0.3145079413521559)
  15731. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  15732. -->
  15733. (S1 ^operator O2018 = 0.685530273786795)
  15734. Retracting rl*prefer*rvt*predict-yes*H0*3
  15735. -->
  15736. (S1 ^operator O2017 = 0.3907869885089824)
  15737. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  15738. -->
  15739. (S1 ^operator O2017 = -0.2062723012911647)
  15740. --- END Proposal Phase ---
  15741. --- Decision Phase ---
  15742. RL update rl*prefer*rvt*predict-yes*H0*3 0.472332 -0.0815445 0.390787 -> 0.472325 -0.0815457 0.390779(R,m,v=1,0.944444,0.052795)
  15743. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.52775 0.0815588 0.609309 -> 0.527743 0.0815574 0.6093(R,m,v=1,1,0)
  15744. =>WM: (14178: S1 ^operator O2020)
  15745. 1010: O: O2020 (predict-no)
  15746. --- END Decision Phase ---
  15747. --- Application Phase ---
  15748. --- Firing Productions (PE) For State At Depth 1 ---
  15749. --- Inner Elaboration Phase, active level 1 (S1) ---
  15750. Firing apply*operator
  15751. -->
  15752. (I3 ^predict-no N1010 + :O )
  15753. Firing apply*operator*complete
  15754. -->
  15755. (I3 ^predict-yes N1009 - :O )
  15756. inner elaboration loop at bottom goal.
  15757. --- Change Working Memory (PE) ---
  15758. =>WM: (14179: I3 ^predict-no N1010)
  15759. <=WM: (14167: N1009 ^status complete)
  15760. <=WM: (14166: I3 ^predict-yes N1009)
  15761. --- Firing Productions (IE) For State At Depth 1 ---
  15762. --- Inner Elaboration Phase, active level 1 (S1) ---
  15763. Firing monitor*world
  15764. -->
  15765. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  15766. --- Change Working Memory (IE) ---
  15767. --- END Application Phase ---
  15768. --- Output Phase ---
  15769. ENV: Agent did: predict-no for direction L in state State-A
  15770. In State-A moving L
  15771. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  15772. predict error 0
  15773. dir: dir isR
  15774. --- END Output Phase ---
  15775. |\---- Input Phase ---
  15776. =>WM: (14183: I2 ^dir R)
  15777. =>WM: (14182: I2 ^reward 1)
  15778. =>WM: (14181: I2 ^see 0)
  15779. =>WM: (14180: N1010 ^status complete)
  15780. <=WM: (14170: I2 ^dir L)
  15781. <=WM: (14169: I2 ^reward 1)
  15782. <=WM: (14168: I2 ^see 1)
  15783. =>WM: (14184: I2 ^level-1 L0-root)
  15784. <=WM: (14171: I2 ^level-1 L1-root)
  15785. --- END Input Phase ---
  15786. --- Proposal Phase ---
  15787. --- Inner Elaboration Phase, active level 1 (S1) ---
  15788. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  15789. -->
  15790. (S1 ^operator O2019 = 0.8783957298051434)
  15791. Firing prefer*rvt*predict-yes*H0*5*H1
  15792. -->
  15793. Firing elaborate*copy-see-to-output-link
  15794. -->
  15795. (I3 ^see 0 +)
  15796. Firing elaborate*reward*based*on*reward
  15797. -->
  15798. (R1014 ^value 1 +)
  15799. (R1 ^reward R1014 +)
  15800. Firing propose*predict-yes
  15801. -->
  15802. (O2021 ^name predict-yes +)
  15803. (S1 ^operator O2021 +)
  15804. Firing propose*predict-no
  15805. -->
  15806. (O2022 ^name predict-no +)
  15807. (S1 ^operator O2022 +)
  15808. Firing rl*prefer*rvt*predict-no*H0*6
  15809. -->
  15810. (S1 ^operator O2020 = 0.9999921813761182)
  15811. Firing rl*prefer*rvt*predict-yes*H0*5
  15812. -->
  15813. (S1 ^operator O2019 = 0.121598329494617)
  15814. Firing prefer*rvt*predict-yes*H0
  15815. -->
  15816. Firing prefer*rvt*predict-no*H0
  15817. -->
  15818. Firing elaborate*copy-dir-to-output-link
  15819. -->
  15820. (I3 ^dir R +)
  15821. inner elaboration loop at bottom goal.
  15822. Retracting elaborate*copy-see-to-output-link
  15823. -->
  15824. (I3 ^see 1 +)
  15825. Retracting propose*predict-no
  15826. -->
  15827. (O2020 ^name predict-no +)
  15828. (S1 ^operator O2020 +)
  15829. Retracting propose*predict-yes
  15830. -->
  15831. (O2019 ^name predict-yes +)
  15832. (S1 ^operator O2019 +)
  15833. Retracting elaborate*reward*based*on*reward
  15834. -->
  15835. (R1013 ^value 1 +)
  15836. (R1 ^reward R1013 +)
  15837. Retracting elaborate*copy-dir-to-output-link
  15838. -->
  15839. (I3 ^dir L +)
  15840. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  15841. -->
  15842. (S1 ^operator O2020 = 0.685530273786795)
  15843. Retracting rl*prefer*rvt*predict-no*H0*4
  15844. -->
  15845. (S1 ^operator O2020 = 0.3145079413521559)
  15846. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  15847. -->
  15848. (S1 ^operator O2019 = -0.2062723012911647)
  15849. Retracting rl*prefer*rvt*predict-yes*H0*3
  15850. -->
  15851. (S1 ^operator O2019 = 0.3907790894440122)
  15852. =>WM: (14192: S1 ^operator O2022 +)
  15853. =>WM: (14191: S1 ^operator O2021 +)
  15854. =>WM: (14190: I3 ^dir R)
  15855. =>WM: (14189: O2022 ^name predict-no)
  15856. =>WM: (14188: O2021 ^name predict-yes)
  15857. =>WM: (14187: R1014 ^value 1)
  15858. =>WM: (14186: R1 ^reward R1014)
  15859. =>WM: (14185: I3 ^see 0)
  15860. <=WM: (14176: S1 ^operator O2019 +)
  15861. <=WM: (14177: S1 ^operator O2020 +)
  15862. <=WM: (14178: S1 ^operator O2020)
  15863. <=WM: (14162: I3 ^dir L)
  15864. <=WM: (14172: R1 ^reward R1013)
  15865. <=WM: (14143: I3 ^see 1)
  15866. <=WM: (14175: O2020 ^name predict-no)
  15867. <=WM: (14174: O2019 ^name predict-yes)
  15868. <=WM: (14173: R1013 ^value 1)
  15869. --- Inner Elaboration Phase, active level 1 (S1) ---
  15870. Firing prefer*rvt*predict-yes*H0
  15871. -->
  15872. Firing rl*prefer*rvt*predict-yes*H0*5
  15873. -->
  15874. (S1 ^operator O2021 = 0.121598329494617)
  15875. Firing prefer*rvt*predict-yes*H0*5*H1
  15876. -->
  15877. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  15878. -->
  15879. (S1 ^operator O2021 = 0.8783957298051434)
  15880. Firing prefer*rvt*predict-no*H0
  15881. -->
  15882. Firing rl*prefer*rvt*predict-no*H0*6
  15883. -->
  15884. (S1 ^operator O2022 = 0.9999921813761182)
  15885. inner elaboration loop at bottom goal.
  15886. Retracting rl*prefer*rvt*predict-no*H0*6
  15887. -->
  15888. (S1 ^operator O2020 = 0.9999921813761182)
  15889. Retracting rl*prefer*rvt*predict-yes*H0*5
  15890. -->
  15891. (S1 ^operator O2019 = 0.121598329494617)
  15892. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  15893. -->
  15894. (S1 ^operator O2019 = 0.8783957298051434)
  15895. --- END Proposal Phase ---
  15896. --- Decision Phase ---
  15897. RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478553 -0.164048 0.314505(R,m,v=1,0.924051,0.0706281)
  15898. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521479 0.164051 0.68553 -> 0.521476 0.164051 0.685527(R,m,v=1,1,0)
  15899. =>WM: (14193: S1 ^operator O2021)
  15900. 1011: O: O2021 (predict-yes)
  15901. --- END Decision Phase ---
  15902. --- Application Phase ---
  15903. --- Firing Productions (PE) For State At Depth 1 ---
  15904. --- Inner Elaboration Phase, active level 1 (S1) ---
  15905. Firing apply*operator
  15906. -->
  15907. (I3 ^predict-yes N1011 + :O )
  15908. Firing apply*operator*complete
  15909. -->
  15910. (I3 ^predict-no N1010 - :O )
  15911. inner elaboration loop at bottom goal.
  15912. --- Change Working Memory (PE) ---
  15913. =>WM: (14194: I3 ^predict-yes N1011)
  15914. <=WM: (14180: N1010 ^status complete)
  15915. <=WM: (14179: I3 ^predict-no N1010)
  15916. --- Firing Productions (IE) For State At Depth 1 ---
  15917. --- Inner Elaboration Phase, active level 1 (S1) ---
  15918. Firing monitor*world
  15919. -->
  15920. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  15921. --- Change Working Memory (IE) ---
  15922. --- END Application Phase ---
  15923. --- Output Phase ---
  15924. ENV: Agent did: predict-yes for direction R in state State-A
  15925. In State-A moving R
  15926. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  15927. predict error 0
  15928. dir: dir isU
  15929. --- END Output Phase ---
  15930. /--- Input Phase ---
  15931. =>WM: (14198: I2 ^dir U)
  15932. =>WM: (14197: I2 ^reward 1)
  15933. =>WM: (14196: I2 ^see 1)
  15934. =>WM: (14195: N1011 ^status complete)
  15935. <=WM: (14183: I2 ^dir R)
  15936. <=WM: (14182: I2 ^reward 1)
  15937. <=WM: (14181: I2 ^see 0)
  15938. =>WM: (14199: I2 ^level-1 R1-root)
  15939. <=WM: (14184: I2 ^level-1 L0-root)
  15940. --- END Input Phase ---
  15941. --- Proposal Phase ---
  15942. --- Inner Elaboration Phase, active level 1 (S1) ---
  15943. Firing elaborate*copy-see-to-output-link
  15944. -->
  15945. (I3 ^see 1 +)
  15946. Firing elaborate*reward*based*on*reward
  15947. -->
  15948. (R1015 ^value 1 +)
  15949. (R1 ^reward R1015 +)
  15950. Firing propose*predict-yes
  15951. -->
  15952. (O2023 ^name predict-yes +)
  15953. (S1 ^operator O2023 +)
  15954. Firing propose*predict-no
  15955. -->
  15956. (O2024 ^name predict-no +)
  15957. (S1 ^operator O2024 +)
  15958. Firing rl*prefer*rvt*predict-no*H0*2
  15959. -->
  15960. (S1 ^operator O2022 = 1.)
  15961. Firing rl*prefer*rvt*predict-yes*H0*1
  15962. -->
  15963. (S1 ^operator O2021 = 0.)
  15964. Firing prefer*rvt*predict-yes*H0
  15965. -->
  15966. Firing prefer*rvt*predict-no*H0
  15967. -->
  15968. Firing elaborate*copy-dir-to-output-link
  15969. -->
  15970. (I3 ^dir U +)
  15971. inner elaboration loop at bottom goal.
  15972. Retracting elaborate*copy-see-to-output-link
  15973. -->
  15974. (I3 ^see 0 +)
  15975. Retracting propose*predict-no
  15976. -->
  15977. (O2022 ^name predict-no +)
  15978. (S1 ^operator O2022 +)
  15979. Retracting propose*predict-yes
  15980. -->
  15981. (O2021 ^name predict-yes +)
  15982. (S1 ^operator O2021 +)
  15983. Retracting elaborate*reward*based*on*reward
  15984. -->
  15985. (R1014 ^value 1 +)
  15986. (R1 ^reward R1014 +)
  15987. Retracting elaborate*copy-dir-to-output-link
  15988. -->
  15989. (I3 ^dir R +)
  15990. Retracting rl*prefer*rvt*predict-no*H0*6
  15991. -->
  15992. (S1 ^operator O2022 = 0.9999921813761182)
  15993. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  15994. -->
  15995. (S1 ^operator O2021 = 0.8783957298051434)
  15996. Retracting rl*prefer*rvt*predict-yes*H0*5
  15997. -->
  15998. (S1 ^operator O2021 = 0.121598329494617)
  15999. =>WM: (14207: S1 ^operator O2024 +)
  16000. =>WM: (14206: S1 ^operator O2023 +)
  16001. =>WM: (14205: I3 ^dir U)
  16002. =>WM: (14204: O2024 ^name predict-no)
  16003. =>WM: (14203: O2023 ^name predict-yes)
  16004. =>WM: (14202: R1015 ^value 1)
  16005. =>WM: (14201: R1 ^reward R1015)
  16006. =>WM: (14200: I3 ^see 1)
  16007. <=WM: (14191: S1 ^operator O2021 +)
  16008. <=WM: (14193: S1 ^operator O2021)
  16009. <=WM: (14192: S1 ^operator O2022 +)
  16010. <=WM: (14190: I3 ^dir R)
  16011. <=WM: (14186: R1 ^reward R1014)
  16012. <=WM: (14185: I3 ^see 0)
  16013. <=WM: (14189: O2022 ^name predict-no)
  16014. <=WM: (14188: O2021 ^name predict-yes)
  16015. <=WM: (14187: R1014 ^value 1)
  16016. --- Inner Elaboration Phase, active level 1 (S1) ---
  16017. Firing prefer*rvt*predict-yes*H0
  16018. -->
  16019. Firing rl*prefer*rvt*predict-yes*H0*1
  16020. -->
  16021. (S1 ^operator O2023 = 0.)
  16022. Firing prefer*rvt*predict-no*H0
  16023. -->
  16024. Firing rl*prefer*rvt*predict-no*H0*2
  16025. -->
  16026. (S1 ^operator O2024 = 1.)
  16027. inner elaboration loop at bottom goal.
  16028. Retracting rl*prefer*rvt*predict-no*H0*2
  16029. -->
  16030. (S1 ^operator O2022 = 1.)
  16031. Retracting rl*prefer*rvt*predict-yes*H0*1
  16032. -->
  16033. (S1 ^operator O2021 = 0.)
  16034. --- END Proposal Phase ---
  16035. --- Decision Phase ---
  16036. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534525 -0.412926 0.121599(R,m,v=1,0.865922,0.116753)
  16037. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465471 0.412925 0.878396 -> 0.465471 0.412925 0.878396(R,m,v=1,1,0)
  16038. =>WM: (14208: S1 ^operator O2024)
  16039. 1012: O: O2024 (predict-no)
  16040. --- END Decision Phase ---
  16041. --- Application Phase ---
  16042. --- Firing Productions (PE) For State At Depth 1 ---
  16043. --- Inner Elaboration Phase, active level 1 (S1) ---
  16044. Firing apply*operator
  16045. -->
  16046. (I3 ^predict-no N1012 + :O )
  16047. Firing apply*operator*complete
  16048. -->
  16049. (I3 ^predict-yes N1011 - :O )
  16050. inner elaboration loop at bottom goal.
  16051. --- Change Working Memory (PE) ---
  16052. =>WM: (14209: I3 ^predict-no N1012)
  16053. <=WM: (14195: N1011 ^status complete)
  16054. <=WM: (14194: I3 ^predict-yes N1011)
  16055. --- Firing Productions (IE) For State At Depth 1 ---
  16056. --- Inner Elaboration Phase, active level 1 (S1) ---
  16057. Firing monitor*world
  16058. -->
  16059. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16060. --- Change Working Memory (IE) ---
  16061. --- END Application Phase ---
  16062. --- Output Phase ---
  16063. ENV: Agent did: predict-no for direction U in state State-B
  16064. In State-B moving U
  16065. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16066. predict error 0
  16067. dir: dir isU
  16068. --- END Output Phase ---
  16069. |\--- Input Phase ---
  16070. =>WM: (14213: I2 ^dir U)
  16071. =>WM: (14212: I2 ^reward 1)
  16072. =>WM: (14211: I2 ^see 0)
  16073. =>WM: (14210: N1012 ^status complete)
  16074. <=WM: (14198: I2 ^dir U)
  16075. <=WM: (14197: I2 ^reward 1)
  16076. <=WM: (14196: I2 ^see 1)
  16077. =>WM: (14214: I2 ^level-1 R1-root)
  16078. <=WM: (14199: I2 ^level-1 R1-root)
  16079. --- END Input Phase ---
  16080. --- Proposal Phase ---
  16081. --- Inner Elaboration Phase, active level 1 (S1) ---
  16082. Firing elaborate*copy-see-to-output-link
  16083. -->
  16084. (I3 ^see 0 +)
  16085. Firing elaborate*reward*based*on*reward
  16086. -->
  16087. (R1016 ^value 1 +)
  16088. (R1 ^reward R1016 +)
  16089. Firing propose*predict-yes
  16090. -->
  16091. (O2025 ^name predict-yes +)
  16092. (S1 ^operator O2025 +)
  16093. Firing propose*predict-no
  16094. -->
  16095. (O2026 ^name predict-no +)
  16096. (S1 ^operator O2026 +)
  16097. Firing rl*prefer*rvt*predict-no*H0*2
  16098. -->
  16099. (S1 ^operator O2024 = 1.)
  16100. Firing rl*prefer*rvt*predict-yes*H0*1
  16101. -->
  16102. (S1 ^operator O2023 = 0.)
  16103. Firing prefer*rvt*predict-yes*H0
  16104. -->
  16105. Firing prefer*rvt*predict-no*H0
  16106. -->
  16107. Firing elaborate*copy-dir-to-output-link
  16108. -->
  16109. (I3 ^dir U +)
  16110. inner elaboration loop at bottom goal.
  16111. Retracting elaborate*copy-see-to-output-link
  16112. -->
  16113. (I3 ^see 1 +)
  16114. Retracting propose*predict-no
  16115. -->
  16116. (O2024 ^name predict-no +)
  16117. (S1 ^operator O2024 +)
  16118. Retracting propose*predict-yes
  16119. -->
  16120. (O2023 ^name predict-yes +)
  16121. (S1 ^operator O2023 +)
  16122. Retracting elaborate*reward*based*on*reward
  16123. -->
  16124. (R1015 ^value 1 +)
  16125. (R1 ^reward R1015 +)
  16126. Retracting elaborate*copy-dir-to-output-link
  16127. -->
  16128. (I3 ^dir U +)
  16129. Retracting rl*prefer*rvt*predict-no*H0*2
  16130. -->
  16131. (S1 ^operator O2024 = 1.)
  16132. Retracting rl*prefer*rvt*predict-yes*H0*1
  16133. -->
  16134. (S1 ^operator O2023 = 0.)
  16135. =>WM: (14221: S1 ^operator O2026 +)
  16136. =>WM: (14220: S1 ^operator O2025 +)
  16137. =>WM: (14219: O2026 ^name predict-no)
  16138. =>WM: (14218: O2025 ^name predict-yes)
  16139. =>WM: (14217: R1016 ^value 1)
  16140. =>WM: (14216: R1 ^reward R1016)
  16141. =>WM: (14215: I3 ^see 0)
  16142. <=WM: (14206: S1 ^operator O2023 +)
  16143. <=WM: (14207: S1 ^operator O2024 +)
  16144. <=WM: (14208: S1 ^operator O2024)
  16145. <=WM: (14201: R1 ^reward R1015)
  16146. <=WM: (14200: I3 ^see 1)
  16147. <=WM: (14204: O2024 ^name predict-no)
  16148. <=WM: (14203: O2023 ^name predict-yes)
  16149. <=WM: (14202: R1015 ^value 1)
  16150. --- Inner Elaboration Phase, active level 1 (S1) ---
  16151. Firing prefer*rvt*predict-yes*H0
  16152. -->
  16153. Firing rl*prefer*rvt*predict-yes*H0*1
  16154. -->
  16155. (S1 ^operator O2025 = 0.)
  16156. Firing prefer*rvt*predict-no*H0
  16157. -->
  16158. Firing rl*prefer*rvt*predict-no*H0*2
  16159. -->
  16160. (S1 ^operator O2026 = 1.)
  16161. inner elaboration loop at bottom goal.
  16162. Retracting rl*prefer*rvt*predict-no*H0*2
  16163. -->
  16164. (S1 ^operator O2024 = 1.)
  16165. Retracting rl*prefer*rvt*predict-yes*H0*1
  16166. -->
  16167. (S1 ^operator O2023 = 0.)
  16168. --- END Proposal Phase ---
  16169. --- Decision Phase ---
  16170. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16171. =>WM: (14222: S1 ^operator O2026)
  16172. 1013: O: O2026 (predict-no)
  16173. --- END Decision Phase ---
  16174. --- Application Phase ---
  16175. --- Firing Productions (PE) For State At Depth 1 ---
  16176. --- Inner Elaboration Phase, active level 1 (S1) ---
  16177. Firing apply*operator
  16178. -->
  16179. (I3 ^predict-no N1013 + :O )
  16180. Firing apply*operator*complete
  16181. -->
  16182. (I3 ^predict-no N1012 - :O )
  16183. inner elaboration loop at bottom goal.
  16184. --- Change Working Memory (PE) ---
  16185. =>WM: (14223: I3 ^predict-no N1013)
  16186. <=WM: (14210: N1012 ^status complete)
  16187. <=WM: (14209: I3 ^predict-no N1012)
  16188. --- Firing Productions (IE) For State At Depth 1 ---
  16189. --- Inner Elaboration Phase, active level 1 (S1) ---
  16190. Firing monitor*world
  16191. -->
  16192. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16193. --- Change Working Memory (IE) ---
  16194. --- END Application Phase ---
  16195. --- Output Phase ---
  16196. ENV: Agent did: predict-no for direction U in state State-B
  16197. In State-B moving U
  16198. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16199. predict error 0
  16200. dir: dir isU
  16201. --- END Output Phase ---
  16202. -/|--- Input Phase ---
  16203. =>WM: (14227: I2 ^dir U)
  16204. =>WM: (14226: I2 ^reward 1)
  16205. =>WM: (14225: I2 ^see 0)
  16206. =>WM: (14224: N1013 ^status complete)
  16207. <=WM: (14213: I2 ^dir U)
  16208. <=WM: (14212: I2 ^reward 1)
  16209. <=WM: (14211: I2 ^see 0)
  16210. =>WM: (14228: I2 ^level-1 R1-root)
  16211. <=WM: (14214: I2 ^level-1 R1-root)
  16212. --- END Input Phase ---
  16213. --- Proposal Phase ---
  16214. --- Inner Elaboration Phase, active level 1 (S1) ---
  16215. Firing elaborate*copy-see-to-output-link
  16216. -->
  16217. (I3 ^see 0 +)
  16218. Firing elaborate*reward*based*on*reward
  16219. -->
  16220. (R1017 ^value 1 +)
  16221. (R1 ^reward R1017 +)
  16222. Firing propose*predict-yes
  16223. -->
  16224. (O2027 ^name predict-yes +)
  16225. (S1 ^operator O2027 +)
  16226. Firing propose*predict-no
  16227. -->
  16228. (O2028 ^name predict-no +)
  16229. (S1 ^operator O2028 +)
  16230. Firing rl*prefer*rvt*predict-no*H0*2
  16231. -->
  16232. (S1 ^operator O2026 = 1.)
  16233. Firing rl*prefer*rvt*predict-yes*H0*1
  16234. -->
  16235. (S1 ^operator O2025 = 0.)
  16236. Firing prefer*rvt*predict-yes*H0
  16237. -->
  16238. Firing prefer*rvt*predict-no*H0
  16239. -->
  16240. Firing elaborate*copy-dir-to-output-link
  16241. -->
  16242. (I3 ^dir U +)
  16243. inner elaboration loop at bottom goal.
  16244. Retracting elaborate*copy-see-to-output-link
  16245. -->
  16246. (I3 ^see 0 +)
  16247. Retracting propose*predict-no
  16248. -->
  16249. (O2026 ^name predict-no +)
  16250. (S1 ^operator O2026 +)
  16251. Retracting propose*predict-yes
  16252. -->
  16253. (O2025 ^name predict-yes +)
  16254. (S1 ^operator O2025 +)
  16255. Retracting elaborate*reward*based*on*reward
  16256. -->
  16257. (R1016 ^value 1 +)
  16258. (R1 ^reward R1016 +)
  16259. Retracting elaborate*copy-dir-to-output-link
  16260. -->
  16261. (I3 ^dir U +)
  16262. Retracting rl*prefer*rvt*predict-no*H0*2
  16263. -->
  16264. (S1 ^operator O2026 = 1.)
  16265. Retracting rl*prefer*rvt*predict-yes*H0*1
  16266. -->
  16267. (S1 ^operator O2025 = 0.)
  16268. =>WM: (14234: S1 ^operator O2028 +)
  16269. =>WM: (14233: S1 ^operator O2027 +)
  16270. =>WM: (14232: O2028 ^name predict-no)
  16271. =>WM: (14231: O2027 ^name predict-yes)
  16272. =>WM: (14230: R1017 ^value 1)
  16273. =>WM: (14229: R1 ^reward R1017)
  16274. <=WM: (14220: S1 ^operator O2025 +)
  16275. <=WM: (14221: S1 ^operator O2026 +)
  16276. <=WM: (14222: S1 ^operator O2026)
  16277. <=WM: (14216: R1 ^reward R1016)
  16278. <=WM: (14219: O2026 ^name predict-no)
  16279. <=WM: (14218: O2025 ^name predict-yes)
  16280. <=WM: (14217: R1016 ^value 1)
  16281. --- Inner Elaboration Phase, active level 1 (S1) ---
  16282. Firing prefer*rvt*predict-yes*H0
  16283. -->
  16284. Firing rl*prefer*rvt*predict-yes*H0*1
  16285. -->
  16286. (S1 ^operator O2027 = 0.)
  16287. Firing prefer*rvt*predict-no*H0
  16288. -->
  16289. Firing rl*prefer*rvt*predict-no*H0*2
  16290. -->
  16291. (S1 ^operator O2028 = 1.)
  16292. inner elaboration loop at bottom goal.
  16293. Retracting rl*prefer*rvt*predict-no*H0*2
  16294. -->
  16295. (S1 ^operator O2026 = 1.)
  16296. Retracting rl*prefer*rvt*predict-yes*H0*1
  16297. -->
  16298. (S1 ^operator O2025 = 0.)
  16299. --- END Proposal Phase ---
  16300. --- Decision Phase ---
  16301. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16302. =>WM: (14235: S1 ^operator O2028)
  16303. 1014: O: O2028 (predict-no)
  16304. --- END Decision Phase ---
  16305. --- Application Phase ---
  16306. --- Firing Productions (PE) For State At Depth 1 ---
  16307. --- Inner Elaboration Phase, active level 1 (S1) ---
  16308. Firing apply*operator
  16309. -->
  16310. (I3 ^predict-no N1014 + :O )
  16311. Firing apply*operator*complete
  16312. -->
  16313. (I3 ^predict-no N1013 - :O )
  16314. inner elaboration loop at bottom goal.
  16315. --- Change Working Memory (PE) ---
  16316. =>WM: (14236: I3 ^predict-no N1014)
  16317. <=WM: (14224: N1013 ^status complete)
  16318. <=WM: (14223: I3 ^predict-no N1013)
  16319. --- Firing Productions (IE) For State At Depth 1 ---
  16320. --- Inner Elaboration Phase, active level 1 (S1) ---
  16321. Firing monitor*world
  16322. -->
  16323. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16324. --- Change Working Memory (IE) ---
  16325. --- END Application Phase ---
  16326. --- Output Phase ---
  16327. ENV: Agent did: predict-no for direction U in state State-B
  16328. In State-B moving U
  16329. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  16330. predict error 0
  16331. dir: dir isL
  16332. --- END Output Phase ---
  16333. \---- Input Phase ---
  16334. =>WM: (14240: I2 ^dir L)
  16335. =>WM: (14239: I2 ^reward 1)
  16336. =>WM: (14238: I2 ^see 0)
  16337. =>WM: (14237: N1014 ^status complete)
  16338. <=WM: (14227: I2 ^dir U)
  16339. <=WM: (14226: I2 ^reward 1)
  16340. <=WM: (14225: I2 ^see 0)
  16341. =>WM: (14241: I2 ^level-1 R1-root)
  16342. <=WM: (14228: I2 ^level-1 R1-root)
  16343. --- END Input Phase ---
  16344. --- Proposal Phase ---
  16345. --- Inner Elaboration Phase, active level 1 (S1) ---
  16346. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  16347. -->
  16348. (S1 ^operator O2028 = -0.168718511744511)
  16349. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  16350. -->
  16351. (S1 ^operator O2027 = 0.6093000948769637)
  16352. Firing prefer*rvt*predict-no*H0*4*H1
  16353. -->
  16354. Firing prefer*rvt*predict-yes*H0*3*H1
  16355. -->
  16356. Firing elaborate*copy-see-to-output-link
  16357. -->
  16358. (I3 ^see 0 +)
  16359. Firing elaborate*reward*based*on*reward
  16360. -->
  16361. (R1018 ^value 1 +)
  16362. (R1 ^reward R1018 +)
  16363. Firing propose*predict-yes
  16364. -->
  16365. (O2029 ^name predict-yes +)
  16366. (S1 ^operator O2029 +)
  16367. Firing propose*predict-no
  16368. -->
  16369. (O2030 ^name predict-no +)
  16370. (S1 ^operator O2030 +)
  16371. Firing rl*prefer*rvt*predict-no*H0*4
  16372. -->
  16373. (S1 ^operator O2028 = 0.3145047896375236)
  16374. Firing rl*prefer*rvt*predict-yes*H0*3
  16375. -->
  16376. (S1 ^operator O2027 = 0.3907790894440122)
  16377. Firing prefer*rvt*predict-yes*H0
  16378. -->
  16379. Firing prefer*rvt*predict-no*H0
  16380. -->
  16381. Firing elaborate*copy-dir-to-output-link
  16382. -->
  16383. (I3 ^dir L +)
  16384. inner elaboration loop at bottom goal.
  16385. Retracting elaborate*copy-see-to-output-link
  16386. -->
  16387. (I3 ^see 0 +)
  16388. Retracting propose*predict-no
  16389. -->
  16390. (O2028 ^name predict-no +)
  16391. (S1 ^operator O2028 +)
  16392. Retracting propose*predict-yes
  16393. -->
  16394. (O2027 ^name predict-yes +)
  16395. (S1 ^operator O2027 +)
  16396. Retracting elaborate*reward*based*on*reward
  16397. -->
  16398. (R1017 ^value 1 +)
  16399. (R1 ^reward R1017 +)
  16400. Retracting elaborate*copy-dir-to-output-link
  16401. -->
  16402. (I3 ^dir U +)
  16403. Retracting rl*prefer*rvt*predict-no*H0*2
  16404. -->
  16405. (S1 ^operator O2028 = 1.)
  16406. Retracting rl*prefer*rvt*predict-yes*H0*1
  16407. -->
  16408. (S1 ^operator O2027 = 0.)
  16409. =>WM: (14248: S1 ^operator O2030 +)
  16410. =>WM: (14247: S1 ^operator O2029 +)
  16411. =>WM: (14246: I3 ^dir L)
  16412. =>WM: (14245: O2030 ^name predict-no)
  16413. =>WM: (14244: O2029 ^name predict-yes)
  16414. =>WM: (14243: R1018 ^value 1)
  16415. =>WM: (14242: R1 ^reward R1018)
  16416. <=WM: (14233: S1 ^operator O2027 +)
  16417. <=WM: (14234: S1 ^operator O2028 +)
  16418. <=WM: (14235: S1 ^operator O2028)
  16419. <=WM: (14205: I3 ^dir U)
  16420. <=WM: (14229: R1 ^reward R1017)
  16421. <=WM: (14232: O2028 ^name predict-no)
  16422. <=WM: (14231: O2027 ^name predict-yes)
  16423. <=WM: (14230: R1017 ^value 1)
  16424. --- Inner Elaboration Phase, active level 1 (S1) ---
  16425. Firing prefer*rvt*predict-yes*H0
  16426. -->
  16427. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  16428. -->
  16429. (S1 ^operator O2029 = 0.6093000948769637)
  16430. Firing rl*prefer*rvt*predict-yes*H0*3
  16431. -->
  16432. (S1 ^operator O2029 = 0.3907790894440122)
  16433. Firing prefer*rvt*predict-yes*H0*3*H1
  16434. -->
  16435. Firing prefer*rvt*predict-no*H0
  16436. -->
  16437. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  16438. -->
  16439. (S1 ^operator O2030 = -0.168718511744511)
  16440. Firing rl*prefer*rvt*predict-no*H0*4
  16441. -->
  16442. (S1 ^operator O2030 = 0.3145047896375236)
  16443. Firing prefer*rvt*predict-no*H0*4*H1
  16444. -->
  16445. inner elaboration loop at bottom goal.
  16446. Retracting rl*prefer*rvt*predict-no*H0*4
  16447. -->
  16448. (S1 ^operator O2028 = 0.3145047896375236)
  16449. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  16450. -->
  16451. (S1 ^operator O2028 = -0.168718511744511)
  16452. Retracting rl*prefer*rvt*predict-yes*H0*3
  16453. -->
  16454. (S1 ^operator O2027 = 0.3907790894440122)
  16455. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  16456. -->
  16457. (S1 ^operator O2027 = 0.6093000948769637)
  16458. --- END Proposal Phase ---
  16459. --- Decision Phase ---
  16460. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16461. =>WM: (14249: S1 ^operator O2029)
  16462. 1015: O: O2029 (predict-yes)
  16463. --- END Decision Phase ---
  16464. --- Application Phase ---
  16465. --- Firing Productions (PE) For State At Depth 1 ---
  16466. --- Inner Elaboration Phase, active level 1 (S1) ---
  16467. Firing apply*operator
  16468. -->
  16469. (I3 ^predict-yes N1015 + :O )
  16470. Firing apply*operator*complete
  16471. -->
  16472. (I3 ^predict-no N1014 - :O )
  16473. inner elaboration loop at bottom goal.
  16474. --- Change Working Memory (PE) ---
  16475. =>WM: (14250: I3 ^predict-yes N1015)
  16476. <=WM: (14237: N1014 ^status complete)
  16477. <=WM: (14236: I3 ^predict-no N1014)
  16478. --- Firing Productions (IE) For State At Depth 1 ---
  16479. --- Inner Elaboration Phase, active level 1 (S1) ---
  16480. Firing monitor*world
  16481. -->
  16482. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  16483. --- Change Working Memory (IE) ---
  16484. --- END Application Phase ---
  16485. --- Output Phase ---
  16486. ENV: Agent did: predict-yes for direction L in state State-B
  16487. In State-B moving L
  16488. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  16489. predict error 0
  16490. dir: dir isU
  16491. --- END Output Phase ---
  16492. /|--- Input Phase ---
  16493. =>WM: (14254: I2 ^dir U)
  16494. =>WM: (14253: I2 ^reward 1)
  16495. =>WM: (14252: I2 ^see 1)
  16496. =>WM: (14251: N1015 ^status complete)
  16497. <=WM: (14240: I2 ^dir L)
  16498. <=WM: (14239: I2 ^reward 1)
  16499. <=WM: (14238: I2 ^see 0)
  16500. =>WM: (14255: I2 ^level-1 L1-root)
  16501. <=WM: (14241: I2 ^level-1 R1-root)
  16502. --- END Input Phase ---
  16503. --- Proposal Phase ---
  16504. --- Inner Elaboration Phase, active level 1 (S1) ---
  16505. Firing elaborate*copy-see-to-output-link
  16506. -->
  16507. (I3 ^see 1 +)
  16508. Firing elaborate*reward*based*on*reward
  16509. -->
  16510. (R1019 ^value 1 +)
  16511. (R1 ^reward R1019 +)
  16512. Firing propose*predict-yes
  16513. -->
  16514. (O2031 ^name predict-yes +)
  16515. (S1 ^operator O2031 +)
  16516. Firing propose*predict-no
  16517. -->
  16518. (O2032 ^name predict-no +)
  16519. (S1 ^operator O2032 +)
  16520. Firing rl*prefer*rvt*predict-no*H0*2
  16521. -->
  16522. (S1 ^operator O2030 = 1.)
  16523. Firing rl*prefer*rvt*predict-yes*H0*1
  16524. -->
  16525. (S1 ^operator O2029 = 0.)
  16526. Firing prefer*rvt*predict-yes*H0
  16527. -->
  16528. Firing prefer*rvt*predict-no*H0
  16529. -->
  16530. Firing elaborate*copy-dir-to-output-link
  16531. -->
  16532. (I3 ^dir U +)
  16533. inner elaboration loop at bottom goal.
  16534. Retracting elaborate*copy-see-to-output-link
  16535. -->
  16536. (I3 ^see 0 +)
  16537. Retracting propose*predict-no
  16538. -->
  16539. (O2030 ^name predict-no +)
  16540. (S1 ^operator O2030 +)
  16541. Retracting propose*predict-yes
  16542. -->
  16543. (O2029 ^name predict-yes +)
  16544. (S1 ^operator O2029 +)
  16545. Retracting elaborate*reward*based*on*reward
  16546. -->
  16547. (R1018 ^value 1 +)
  16548. (R1 ^reward R1018 +)
  16549. Retracting elaborate*copy-dir-to-output-link
  16550. -->
  16551. (I3 ^dir L +)
  16552. Retracting rl*prefer*rvt*predict-no*H0*4
  16553. -->
  16554. (S1 ^operator O2030 = 0.3145047896375236)
  16555. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  16556. -->
  16557. (S1 ^operator O2030 = -0.168718511744511)
  16558. Retracting rl*prefer*rvt*predict-yes*H0*3
  16559. -->
  16560. (S1 ^operator O2029 = 0.3907790894440122)
  16561. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  16562. -->
  16563. (S1 ^operator O2029 = 0.6093000948769637)
  16564. =>WM: (14263: S1 ^operator O2032 +)
  16565. =>WM: (14262: S1 ^operator O2031 +)
  16566. =>WM: (14261: I3 ^dir U)
  16567. =>WM: (14260: O2032 ^name predict-no)
  16568. =>WM: (14259: O2031 ^name predict-yes)
  16569. =>WM: (14258: R1019 ^value 1)
  16570. =>WM: (14257: R1 ^reward R1019)
  16571. =>WM: (14256: I3 ^see 1)
  16572. <=WM: (14247: S1 ^operator O2029 +)
  16573. <=WM: (14249: S1 ^operator O2029)
  16574. <=WM: (14248: S1 ^operator O2030 +)
  16575. <=WM: (14246: I3 ^dir L)
  16576. <=WM: (14242: R1 ^reward R1018)
  16577. <=WM: (14215: I3 ^see 0)
  16578. <=WM: (14245: O2030 ^name predict-no)
  16579. <=WM: (14244: O2029 ^name predict-yes)
  16580. <=WM: (14243: R1018 ^value 1)
  16581. --- Inner Elaboration Phase, active level 1 (S1) ---
  16582. Firing prefer*rvt*predict-yes*H0
  16583. -->
  16584. Firing rl*prefer*rvt*predict-yes*H0*1
  16585. -->
  16586. (S1 ^operator O2031 = 0.)
  16587. Firing prefer*rvt*predict-no*H0
  16588. -->
  16589. Firing rl*prefer*rvt*predict-no*H0*2
  16590. -->
  16591. (S1 ^operator O2032 = 1.)
  16592. inner elaboration loop at bottom goal.
  16593. Retracting rl*prefer*rvt*predict-no*H0*2
  16594. -->
  16595. (S1 ^operator O2030 = 1.)
  16596. Retracting rl*prefer*rvt*predict-yes*H0*1
  16597. -->
  16598. (S1 ^operator O2029 = 0.)
  16599. --- END Proposal Phase ---
  16600. --- Decision Phase ---
  16601. RL update rl*prefer*rvt*predict-yes*H0*3 0.472325 -0.0815457 0.390779 -> 0.472319 -0.0815467 0.390773(R,m,v=1,0.944785,0.0524881)
  16602. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527743 0.0815574 0.6093 -> 0.527736 0.0815563 0.609293(R,m,v=1,1,0)
  16603. =>WM: (14264: S1 ^operator O2032)
  16604. 1016: O: O2032 (predict-no)
  16605. --- END Decision Phase ---
  16606. --- Application Phase ---
  16607. --- Firing Productions (PE) For State At Depth 1 ---
  16608. --- Inner Elaboration Phase, active level 1 (S1) ---
  16609. Firing apply*operator
  16610. -->
  16611. (I3 ^predict-no N1016 + :O )
  16612. Firing apply*operator*complete
  16613. -->
  16614. (I3 ^predict-yes N1015 - :O )
  16615. inner elaboration loop at bottom goal.
  16616. --- Change Working Memory (PE) ---
  16617. =>WM: (14265: I3 ^predict-no N1016)
  16618. <=WM: (14251: N1015 ^status complete)
  16619. <=WM: (14250: I3 ^predict-yes N1015)
  16620. --- Firing Productions (IE) For State At Depth 1 ---
  16621. --- Inner Elaboration Phase, active level 1 (S1) ---
  16622. Firing monitor*world
  16623. -->
  16624. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16625. --- Change Working Memory (IE) ---
  16626. --- END Application Phase ---
  16627. --- Output Phase ---
  16628. ENV: Agent did: predict-no for direction U in state State-A
  16629. In State-A moving U
  16630. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16631. predict error 0
  16632. dir: dir isU
  16633. --- END Output Phase ---
  16634. \-/--- Input Phase ---
  16635. =>WM: (14269: I2 ^dir U)
  16636. =>WM: (14268: I2 ^reward 1)
  16637. =>WM: (14267: I2 ^see 0)
  16638. =>WM: (14266: N1016 ^status complete)
  16639. <=WM: (14254: I2 ^dir U)
  16640. <=WM: (14253: I2 ^reward 1)
  16641. <=WM: (14252: I2 ^see 1)
  16642. =>WM: (14270: I2 ^level-1 L1-root)
  16643. <=WM: (14255: I2 ^level-1 L1-root)
  16644. --- END Input Phase ---
  16645. --- Proposal Phase ---
  16646. --- Inner Elaboration Phase, active level 1 (S1) ---
  16647. Firing elaborate*copy-see-to-output-link
  16648. -->
  16649. (I3 ^see 0 +)
  16650. Firing elaborate*reward*based*on*reward
  16651. -->
  16652. (R1020 ^value 1 +)
  16653. (R1 ^reward R1020 +)
  16654. Firing propose*predict-yes
  16655. -->
  16656. (O2033 ^name predict-yes +)
  16657. (S1 ^operator O2033 +)
  16658. Firing propose*predict-no
  16659. -->
  16660. (O2034 ^name predict-no +)
  16661. (S1 ^operator O2034 +)
  16662. Firing rl*prefer*rvt*predict-no*H0*2
  16663. -->
  16664. (S1 ^operator O2032 = 1.)
  16665. Firing rl*prefer*rvt*predict-yes*H0*1
  16666. -->
  16667. (S1 ^operator O2031 = 0.)
  16668. Firing prefer*rvt*predict-yes*H0
  16669. -->
  16670. Firing prefer*rvt*predict-no*H0
  16671. -->
  16672. Firing elaborate*copy-dir-to-output-link
  16673. -->
  16674. (I3 ^dir U +)
  16675. inner elaboration loop at bottom goal.
  16676. Retracting elaborate*copy-see-to-output-link
  16677. -->
  16678. (I3 ^see 1 +)
  16679. Retracting propose*predict-no
  16680. -->
  16681. (O2032 ^name predict-no +)
  16682. (S1 ^operator O2032 +)
  16683. Retracting propose*predict-yes
  16684. -->
  16685. (O2031 ^name predict-yes +)
  16686. (S1 ^operator O2031 +)
  16687. Retracting elaborate*reward*based*on*reward
  16688. -->
  16689. (R1019 ^value 1 +)
  16690. (R1 ^reward R1019 +)
  16691. Retracting elaborate*copy-dir-to-output-link
  16692. -->
  16693. (I3 ^dir U +)
  16694. Retracting rl*prefer*rvt*predict-no*H0*2
  16695. -->
  16696. (S1 ^operator O2032 = 1.)
  16697. Retracting rl*prefer*rvt*predict-yes*H0*1
  16698. -->
  16699. (S1 ^operator O2031 = 0.)
  16700. =>WM: (14277: S1 ^operator O2034 +)
  16701. =>WM: (14276: S1 ^operator O2033 +)
  16702. =>WM: (14275: O2034 ^name predict-no)
  16703. =>WM: (14274: O2033 ^name predict-yes)
  16704. =>WM: (14273: R1020 ^value 1)
  16705. =>WM: (14272: R1 ^reward R1020)
  16706. =>WM: (14271: I3 ^see 0)
  16707. <=WM: (14262: S1 ^operator O2031 +)
  16708. <=WM: (14263: S1 ^operator O2032 +)
  16709. <=WM: (14264: S1 ^operator O2032)
  16710. <=WM: (14257: R1 ^reward R1019)
  16711. <=WM: (14256: I3 ^see 1)
  16712. <=WM: (14260: O2032 ^name predict-no)
  16713. <=WM: (14259: O2031 ^name predict-yes)
  16714. <=WM: (14258: R1019 ^value 1)
  16715. --- Inner Elaboration Phase, active level 1 (S1) ---
  16716. Firing prefer*rvt*predict-yes*H0
  16717. -->
  16718. Firing rl*prefer*rvt*predict-yes*H0*1
  16719. -->
  16720. (S1 ^operator O2033 = 0.)
  16721. Firing prefer*rvt*predict-no*H0
  16722. -->
  16723. Firing rl*prefer*rvt*predict-no*H0*2
  16724. -->
  16725. (S1 ^operator O2034 = 1.)
  16726. inner elaboration loop at bottom goal.
  16727. Retracting rl*prefer*rvt*predict-no*H0*2
  16728. -->
  16729. (S1 ^operator O2032 = 1.)
  16730. Retracting rl*prefer*rvt*predict-yes*H0*1
  16731. -->
  16732. (S1 ^operator O2031 = 0.)
  16733. --- END Proposal Phase ---
  16734. --- Decision Phase ---
  16735. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16736. =>WM: (14278: S1 ^operator O2034)
  16737. 1017: O: O2034 (predict-no)
  16738. --- END Decision Phase ---
  16739. --- Application Phase ---
  16740. --- Firing Productions (PE) For State At Depth 1 ---
  16741. --- Inner Elaboration Phase, active level 1 (S1) ---
  16742. Firing apply*operator
  16743. -->
  16744. (I3 ^predict-no N1017 + :O )
  16745. Firing apply*operator*complete
  16746. -->
  16747. (I3 ^predict-no N1016 - :O )
  16748. inner elaboration loop at bottom goal.
  16749. --- Change Working Memory (PE) ---
  16750. =>WM: (14279: I3 ^predict-no N1017)
  16751. <=WM: (14266: N1016 ^status complete)
  16752. <=WM: (14265: I3 ^predict-no N1016)
  16753. --- Firing Productions (IE) For State At Depth 1 ---
  16754. --- Inner Elaboration Phase, active level 1 (S1) ---
  16755. Firing monitor*world
  16756. -->
  16757. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16758. --- Change Working Memory (IE) ---
  16759. --- END Application Phase ---
  16760. --- Output Phase ---
  16761. ENV: Agent did: predict-no for direction U in state State-A
  16762. In State-A moving U
  16763. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16764. predict error 0
  16765. dir: dir isU
  16766. --- END Output Phase ---
  16767. |\--- Input Phase ---
  16768. =>WM: (14283: I2 ^dir U)
  16769. =>WM: (14282: I2 ^reward 1)
  16770. =>WM: (14281: I2 ^see 0)
  16771. =>WM: (14280: N1017 ^status complete)
  16772. <=WM: (14269: I2 ^dir U)
  16773. <=WM: (14268: I2 ^reward 1)
  16774. <=WM: (14267: I2 ^see 0)
  16775. =>WM: (14284: I2 ^level-1 L1-root)
  16776. <=WM: (14270: I2 ^level-1 L1-root)
  16777. --- END Input Phase ---
  16778. --- Proposal Phase ---
  16779. --- Inner Elaboration Phase, active level 1 (S1) ---
  16780. Firing elaborate*copy-see-to-output-link
  16781. -->
  16782. (I3 ^see 0 +)
  16783. Firing elaborate*reward*based*on*reward
  16784. -->
  16785. (R1021 ^value 1 +)
  16786. (R1 ^reward R1021 +)
  16787. Firing propose*predict-yes
  16788. -->
  16789. (O2035 ^name predict-yes +)
  16790. (S1 ^operator O2035 +)
  16791. Firing propose*predict-no
  16792. -->
  16793. (O2036 ^name predict-no +)
  16794. (S1 ^operator O2036 +)
  16795. Firing rl*prefer*rvt*predict-no*H0*2
  16796. -->
  16797. (S1 ^operator O2034 = 1.)
  16798. Firing rl*prefer*rvt*predict-yes*H0*1
  16799. -->
  16800. (S1 ^operator O2033 = 0.)
  16801. Firing prefer*rvt*predict-yes*H0
  16802. -->
  16803. Firing prefer*rvt*predict-no*H0
  16804. -->
  16805. Firing elaborate*copy-dir-to-output-link
  16806. -->
  16807. (I3 ^dir U +)
  16808. inner elaboration loop at bottom goal.
  16809. Retracting elaborate*copy-see-to-output-link
  16810. -->
  16811. (I3 ^see 0 +)
  16812. Retracting propose*predict-no
  16813. -->
  16814. (O2034 ^name predict-no +)
  16815. (S1 ^operator O2034 +)
  16816. Retracting propose*predict-yes
  16817. -->
  16818. (O2033 ^name predict-yes +)
  16819. (S1 ^operator O2033 +)
  16820. Retracting elaborate*reward*based*on*reward
  16821. -->
  16822. (R1020 ^value 1 +)
  16823. (R1 ^reward R1020 +)
  16824. Retracting elaborate*copy-dir-to-output-link
  16825. -->
  16826. (I3 ^dir U +)
  16827. Retracting rl*prefer*rvt*predict-no*H0*2
  16828. -->
  16829. (S1 ^operator O2034 = 1.)
  16830. Retracting rl*prefer*rvt*predict-yes*H0*1
  16831. -->
  16832. (S1 ^operator O2033 = 0.)
  16833. =>WM: (14290: S1 ^operator O2036 +)
  16834. =>WM: (14289: S1 ^operator O2035 +)
  16835. =>WM: (14288: O2036 ^name predict-no)
  16836. =>WM: (14287: O2035 ^name predict-yes)
  16837. =>WM: (14286: R1021 ^value 1)
  16838. =>WM: (14285: R1 ^reward R1021)
  16839. <=WM: (14276: S1 ^operator O2033 +)
  16840. <=WM: (14277: S1 ^operator O2034 +)
  16841. <=WM: (14278: S1 ^operator O2034)
  16842. <=WM: (14272: R1 ^reward R1020)
  16843. <=WM: (14275: O2034 ^name predict-no)
  16844. <=WM: (14274: O2033 ^name predict-yes)
  16845. <=WM: (14273: R1020 ^value 1)
  16846. --- Inner Elaboration Phase, active level 1 (S1) ---
  16847. Firing prefer*rvt*predict-yes*H0
  16848. -->
  16849. Firing rl*prefer*rvt*predict-yes*H0*1
  16850. -->
  16851. (S1 ^operator O2035 = 0.)
  16852. Firing prefer*rvt*predict-no*H0
  16853. -->
  16854. Firing rl*prefer*rvt*predict-no*H0*2
  16855. -->
  16856. (S1 ^operator O2036 = 1.)
  16857. inner elaboration loop at bottom goal.
  16858. Retracting rl*prefer*rvt*predict-no*H0*2
  16859. -->
  16860. (S1 ^operator O2034 = 1.)
  16861. Retracting rl*prefer*rvt*predict-yes*H0*1
  16862. -->
  16863. (S1 ^operator O2033 = 0.)
  16864. --- END Proposal Phase ---
  16865. --- Decision Phase ---
  16866. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  16867. =>WM: (14291: S1 ^operator O2036)
  16868. 1018: O: O2036 (predict-no)
  16869. --- END Decision Phase ---
  16870. --- Application Phase ---
  16871. --- Firing Productions (PE) For State At Depth 1 ---
  16872. --- Inner Elaboration Phase, active level 1 (S1) ---
  16873. Firing apply*operator
  16874. -->
  16875. (I3 ^predict-no N1018 + :O )
  16876. Firing apply*operator*complete
  16877. -->
  16878. (I3 ^predict-no N1017 - :O )
  16879. inner elaboration loop at bottom goal.
  16880. --- Change Working Memory (PE) ---
  16881. =>WM: (14292: I3 ^predict-no N1018)
  16882. <=WM: (14280: N1017 ^status complete)
  16883. <=WM: (14279: I3 ^predict-no N1017)
  16884. --- Firing Productions (IE) For State At Depth 1 ---
  16885. --- Inner Elaboration Phase, active level 1 (S1) ---
  16886. Firing monitor*world
  16887. -->
  16888. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  16889. --- Change Working Memory (IE) ---
  16890. --- END Application Phase ---
  16891. --- Output Phase ---
  16892. ENV: Agent did: predict-no for direction U in state State-A
  16893. In State-A moving U
  16894. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  16895. predict error 0
  16896. dir: dir isR
  16897. --- END Output Phase ---
  16898. -/--- Input Phase ---
  16899. =>WM: (14296: I2 ^dir R)
  16900. =>WM: (14295: I2 ^reward 1)
  16901. =>WM: (14294: I2 ^see 0)
  16902. =>WM: (14293: N1018 ^status complete)
  16903. <=WM: (14283: I2 ^dir U)
  16904. <=WM: (14282: I2 ^reward 1)
  16905. <=WM: (14281: I2 ^see 0)
  16906. =>WM: (14297: I2 ^level-1 L1-root)
  16907. <=WM: (14284: I2 ^level-1 L1-root)
  16908. --- END Input Phase ---
  16909. --- Proposal Phase ---
  16910. --- Inner Elaboration Phase, active level 1 (S1) ---
  16911. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  16912. -->
  16913. (S1 ^operator O2035 = 0.8784128060439984)
  16914. Firing prefer*rvt*predict-yes*H0*5*H1
  16915. -->
  16916. Firing elaborate*copy-see-to-output-link
  16917. -->
  16918. (I3 ^see 0 +)
  16919. Firing elaborate*reward*based*on*reward
  16920. -->
  16921. (R1022 ^value 1 +)
  16922. (R1 ^reward R1022 +)
  16923. Firing propose*predict-yes
  16924. -->
  16925. (O2037 ^name predict-yes +)
  16926. (S1 ^operator O2037 +)
  16927. Firing propose*predict-no
  16928. -->
  16929. (O2038 ^name predict-no +)
  16930. (S1 ^operator O2038 +)
  16931. Firing rl*prefer*rvt*predict-no*H0*6
  16932. -->
  16933. (S1 ^operator O2036 = 0.9999921813761182)
  16934. Firing rl*prefer*rvt*predict-yes*H0*5
  16935. -->
  16936. (S1 ^operator O2035 = 0.1215988095600619)
  16937. Firing prefer*rvt*predict-yes*H0
  16938. -->
  16939. Firing prefer*rvt*predict-no*H0
  16940. -->
  16941. Firing elaborate*copy-dir-to-output-link
  16942. -->
  16943. (I3 ^dir R +)
  16944. inner elaboration loop at bottom goal.
  16945. Retracting elaborate*copy-see-to-output-link
  16946. -->
  16947. (I3 ^see 0 +)
  16948. Retracting propose*predict-no
  16949. -->
  16950. (O2036 ^name predict-no +)
  16951. (S1 ^operator O2036 +)
  16952. Retracting propose*predict-yes
  16953. -->
  16954. (O2035 ^name predict-yes +)
  16955. (S1 ^operator O2035 +)
  16956. Retracting elaborate*reward*based*on*reward
  16957. -->
  16958. (R1021 ^value 1 +)
  16959. (R1 ^reward R1021 +)
  16960. Retracting elaborate*copy-dir-to-output-link
  16961. -->
  16962. (I3 ^dir U +)
  16963. Retracting rl*prefer*rvt*predict-no*H0*2
  16964. -->
  16965. (S1 ^operator O2036 = 1.)
  16966. Retracting rl*prefer*rvt*predict-yes*H0*1
  16967. -->
  16968. (S1 ^operator O2035 = 0.)
  16969. =>WM: (14304: S1 ^operator O2038 +)
  16970. =>WM: (14303: S1 ^operator O2037 +)
  16971. =>WM: (14302: I3 ^dir R)
  16972. =>WM: (14301: O2038 ^name predict-no)
  16973. =>WM: (14300: O2037 ^name predict-yes)
  16974. =>WM: (14299: R1022 ^value 1)
  16975. =>WM: (14298: R1 ^reward R1022)
  16976. <=WM: (14289: S1 ^operator O2035 +)
  16977. <=WM: (14290: S1 ^operator O2036 +)
  16978. <=WM: (14291: S1 ^operator O2036)
  16979. <=WM: (14261: I3 ^dir U)
  16980. <=WM: (14285: R1 ^reward R1021)
  16981. <=WM: (14288: O2036 ^name predict-no)
  16982. <=WM: (14287: O2035 ^name predict-yes)
  16983. <=WM: (14286: R1021 ^value 1)
  16984. --- Inner Elaboration Phase, active level 1 (S1) ---
  16985. Firing prefer*rvt*predict-yes*H0
  16986. -->
  16987. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  16988. -->
  16989. (S1 ^operator O2037 = 0.8784128060439984)
  16990. Firing rl*prefer*rvt*predict-yes*H0*5
  16991. -->
  16992. (S1 ^operator O2037 = 0.1215988095600619)
  16993. Firing prefer*rvt*predict-yes*H0*5*H1
  16994. -->
  16995. Firing prefer*rvt*predict-no*H0
  16996. -->
  16997. Firing rl*prefer*rvt*predict-no*H0*6
  16998. -->
  16999. (S1 ^operator O2038 = 0.9999921813761182)
  17000. inner elaboration loop at bottom goal.
  17001. Retracting rl*prefer*rvt*predict-no*H0*6
  17002. -->
  17003. (S1 ^operator O2036 = 0.9999921813761182)
  17004. Retracting rl*prefer*rvt*predict-yes*H0*5
  17005. -->
  17006. (S1 ^operator O2035 = 0.1215988095600619)
  17007. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  17008. -->
  17009. (S1 ^operator O2035 = 0.8784128060439984)
  17010. --- END Proposal Phase ---
  17011. --- Decision Phase ---
  17012. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17013. =>WM: (14305: S1 ^operator O2037)
  17014. 1019: O: O2037 (predict-yes)
  17015. --- END Decision Phase ---
  17016. --- Application Phase ---
  17017. --- Firing Productions (PE) For State At Depth 1 ---
  17018. --- Inner Elaboration Phase, active level 1 (S1) ---
  17019. Firing apply*operator
  17020. -->
  17021. (I3 ^predict-yes N1019 + :O )
  17022. Firing apply*operator*complete
  17023. -->
  17024. (I3 ^predict-no N1018 - :O )
  17025. inner elaboration loop at bottom goal.
  17026. --- Change Working Memory (PE) ---
  17027. =>WM: (14306: I3 ^predict-yes N1019)
  17028. <=WM: (14293: N1018 ^status complete)
  17029. <=WM: (14292: I3 ^predict-no N1018)
  17030. --- Firing Productions (IE) For State At Depth 1 ---
  17031. --- Inner Elaboration Phase, active level 1 (S1) ---
  17032. Firing monitor*world
  17033. -->
  17034. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17035. --- Change Working Memory (IE) ---
  17036. --- END Application Phase ---
  17037. --- Output Phase ---
  17038. ENV: Agent did: predict-yes for direction R in state State-A
  17039. In State-A moving R
  17040. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17041. predict error 0
  17042. dir: dir isL
  17043. --- END Output Phase ---
  17044. |\---- Input Phase ---
  17045. =>WM: (14310: I2 ^dir L)
  17046. =>WM: (14309: I2 ^reward 1)
  17047. =>WM: (14308: I2 ^see 1)
  17048. =>WM: (14307: N1019 ^status complete)
  17049. <=WM: (14296: I2 ^dir R)
  17050. <=WM: (14295: I2 ^reward 1)
  17051. <=WM: (14294: I2 ^see 0)
  17052. =>WM: (14311: I2 ^level-1 R1-root)
  17053. <=WM: (14297: I2 ^level-1 L1-root)
  17054. --- END Input Phase ---
  17055. --- Proposal Phase ---
  17056. --- Inner Elaboration Phase, active level 1 (S1) ---
  17057. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  17058. -->
  17059. (S1 ^operator O2038 = -0.168718511744511)
  17060. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  17061. -->
  17062. (S1 ^operator O2037 = 0.6092926303832609)
  17063. Firing prefer*rvt*predict-no*H0*4*H1
  17064. -->
  17065. Firing prefer*rvt*predict-yes*H0*3*H1
  17066. -->
  17067. Firing elaborate*copy-see-to-output-link
  17068. -->
  17069. (I3 ^see 1 +)
  17070. Firing elaborate*reward*based*on*reward
  17071. -->
  17072. (R1023 ^value 1 +)
  17073. (R1 ^reward R1023 +)
  17074. Firing propose*predict-yes
  17075. -->
  17076. (O2039 ^name predict-yes +)
  17077. (S1 ^operator O2039 +)
  17078. Firing propose*predict-no
  17079. -->
  17080. (O2040 ^name predict-no +)
  17081. (S1 ^operator O2040 +)
  17082. Firing rl*prefer*rvt*predict-no*H0*4
  17083. -->
  17084. (S1 ^operator O2038 = 0.3145047896375236)
  17085. Firing rl*prefer*rvt*predict-yes*H0*3
  17086. -->
  17087. (S1 ^operator O2037 = 0.3907725922691719)
  17088. Firing prefer*rvt*predict-yes*H0
  17089. -->
  17090. Firing prefer*rvt*predict-no*H0
  17091. -->
  17092. Firing elaborate*copy-dir-to-output-link
  17093. -->
  17094. (I3 ^dir L +)
  17095. inner elaboration loop at bottom goal.
  17096. Retracting elaborate*copy-see-to-output-link
  17097. -->
  17098. (I3 ^see 0 +)
  17099. Retracting propose*predict-no
  17100. -->
  17101. (O2038 ^name predict-no +)
  17102. (S1 ^operator O2038 +)
  17103. Retracting propose*predict-yes
  17104. -->
  17105. (O2037 ^name predict-yes +)
  17106. (S1 ^operator O2037 +)
  17107. Retracting elaborate*reward*based*on*reward
  17108. -->
  17109. (R1022 ^value 1 +)
  17110. (R1 ^reward R1022 +)
  17111. Retracting elaborate*copy-dir-to-output-link
  17112. -->
  17113. (I3 ^dir R +)
  17114. Retracting rl*prefer*rvt*predict-no*H0*6
  17115. -->
  17116. (S1 ^operator O2038 = 0.9999921813761182)
  17117. Retracting rl*prefer*rvt*predict-yes*H0*5
  17118. -->
  17119. (S1 ^operator O2037 = 0.1215988095600619)
  17120. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  17121. -->
  17122. (S1 ^operator O2037 = 0.8784128060439984)
  17123. =>WM: (14319: S1 ^operator O2040 +)
  17124. =>WM: (14318: S1 ^operator O2039 +)
  17125. =>WM: (14317: I3 ^dir L)
  17126. =>WM: (14316: O2040 ^name predict-no)
  17127. =>WM: (14315: O2039 ^name predict-yes)
  17128. =>WM: (14314: R1023 ^value 1)
  17129. =>WM: (14313: R1 ^reward R1023)
  17130. =>WM: (14312: I3 ^see 1)
  17131. <=WM: (14303: S1 ^operator O2037 +)
  17132. <=WM: (14305: S1 ^operator O2037)
  17133. <=WM: (14304: S1 ^operator O2038 +)
  17134. <=WM: (14302: I3 ^dir R)
  17135. <=WM: (14298: R1 ^reward R1022)
  17136. <=WM: (14271: I3 ^see 0)
  17137. <=WM: (14301: O2038 ^name predict-no)
  17138. <=WM: (14300: O2037 ^name predict-yes)
  17139. <=WM: (14299: R1022 ^value 1)
  17140. --- Inner Elaboration Phase, active level 1 (S1) ---
  17141. Firing prefer*rvt*predict-yes*H0
  17142. -->
  17143. Firing rl*prefer*rvt*predict-yes*H0*3
  17144. -->
  17145. (S1 ^operator O2039 = 0.3907725922691719)
  17146. Firing prefer*rvt*predict-yes*H0*3*H1
  17147. -->
  17148. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  17149. -->
  17150. (S1 ^operator O2039 = 0.6092926303832609)
  17151. Firing prefer*rvt*predict-no*H0
  17152. -->
  17153. Firing rl*prefer*rvt*predict-no*H0*4
  17154. -->
  17155. (S1 ^operator O2040 = 0.3145047896375236)
  17156. Firing prefer*rvt*predict-no*H0*4*H1
  17157. -->
  17158. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  17159. -->
  17160. (S1 ^operator O2040 = -0.168718511744511)
  17161. inner elaboration loop at bottom goal.
  17162. Retracting rl*prefer*rvt*predict-no*H0*4
  17163. -->
  17164. (S1 ^operator O2038 = 0.3145047896375236)
  17165. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  17166. -->
  17167. (S1 ^operator O2038 = -0.168718511744511)
  17168. Retracting rl*prefer*rvt*predict-yes*H0*3
  17169. -->
  17170. (S1 ^operator O2037 = 0.3907725922691719)
  17171. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  17172. -->
  17173. (S1 ^operator O2037 = 0.6092926303832609)
  17174. --- END Proposal Phase ---
  17175. --- Decision Phase ---
  17176. RL update rl*prefer*rvt*predict-yes*H0*5 0.534525 -0.412926 0.121599 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.866667,0.116201)
  17177. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465485 0.412928 0.878413 -> 0.465484 0.412928 0.878412(R,m,v=1,1,0)
  17178. =>WM: (14320: S1 ^operator O2039)
  17179. 1020: O: O2039 (predict-yes)
  17180. --- END Decision Phase ---
  17181. --- Application Phase ---
  17182. --- Firing Productions (PE) For State At Depth 1 ---
  17183. --- Inner Elaboration Phase, active level 1 (S1) ---
  17184. Firing apply*operator
  17185. -->
  17186. (I3 ^predict-yes N1020 + :O )
  17187. Firing apply*operator*complete
  17188. -->
  17189. (I3 ^predict-yes N1019 - :O )
  17190. inner elaboration loop at bottom goal.
  17191. --- Change Working Memory (PE) ---
  17192. =>WM: (14321: I3 ^predict-yes N1020)
  17193. <=WM: (14307: N1019 ^status complete)
  17194. <=WM: (14306: I3 ^predict-yes N1019)
  17195. --- Firing Productions (IE) For State At Depth 1 ---
  17196. --- Inner Elaboration Phase, active level 1 (S1) ---
  17197. Firing monitor*world
  17198. -->
  17199. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17200. --- Change Working Memory (IE) ---
  17201. --- END Application Phase ---
  17202. --- Output Phase ---
  17203. ENV: Agent did: predict-yes for direction L in state State-B
  17204. In State-B moving L
  17205. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  17206. predict error 0
  17207. dir: dir isL
  17208. --- END Output Phase ---
  17209. /|\---- Input Phase ---
  17210. =>WM: (14325: I2 ^dir L)
  17211. =>WM: (14324: I2 ^reward 1)
  17212. =>WM: (14323: I2 ^see 1)
  17213. =>WM: (14322: N1020 ^status complete)
  17214. <=WM: (14310: I2 ^dir L)
  17215. <=WM: (14309: I2 ^reward 1)
  17216. <=WM: (14308: I2 ^see 1)
  17217. =>WM: (14326: I2 ^level-1 L1-root)
  17218. <=WM: (14311: I2 ^level-1 R1-root)
  17219. --- END Input Phase ---
  17220. --- Proposal Phase ---
  17221. --- Inner Elaboration Phase, active level 1 (S1) ---
  17222. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  17223. -->
  17224. (S1 ^operator O2039 = -0.2062723012911647)
  17225. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  17226. -->
  17227. (S1 ^operator O2040 = 0.6855266893701198)
  17228. Firing prefer*rvt*predict-no*H0*4*H1
  17229. -->
  17230. Firing prefer*rvt*predict-yes*H0*3*H1
  17231. -->
  17232. Firing elaborate*copy-see-to-output-link
  17233. -->
  17234. (I3 ^see 1 +)
  17235. Firing elaborate*reward*based*on*reward
  17236. -->
  17237. (R1024 ^value 1 +)
  17238. (R1 ^reward R1024 +)
  17239. Firing propose*predict-yes
  17240. -->
  17241. (O2041 ^name predict-yes +)
  17242. (S1 ^operator O2041 +)
  17243. Firing propose*predict-no
  17244. -->
  17245. (O2042 ^name predict-no +)
  17246. (S1 ^operator O2042 +)
  17247. Firing rl*prefer*rvt*predict-no*H0*4
  17248. -->
  17249. (S1 ^operator O2040 = 0.3145047896375236)
  17250. Firing rl*prefer*rvt*predict-yes*H0*3
  17251. -->
  17252. (S1 ^operator O2039 = 0.3907725922691719)
  17253. Firing prefer*rvt*predict-yes*H0
  17254. -->
  17255. Firing prefer*rvt*predict-no*H0
  17256. -->
  17257. Firing elaborate*copy-dir-to-output-link
  17258. -->
  17259. (I3 ^dir L +)
  17260. inner elaboration loop at bottom goal.
  17261. Retracting elaborate*copy-see-to-output-link
  17262. -->
  17263. (I3 ^see 1 +)
  17264. Retracting propose*predict-no
  17265. -->
  17266. (O2040 ^name predict-no +)
  17267. (S1 ^operator O2040 +)
  17268. Retracting propose*predict-yes
  17269. -->
  17270. (O2039 ^name predict-yes +)
  17271. (S1 ^operator O2039 +)
  17272. Retracting elaborate*reward*based*on*reward
  17273. -->
  17274. (R1023 ^value 1 +)
  17275. (R1 ^reward R1023 +)
  17276. Retracting elaborate*copy-dir-to-output-link
  17277. -->
  17278. (I3 ^dir L +)
  17279. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  17280. -->
  17281. (S1 ^operator O2040 = -0.168718511744511)
  17282. Retracting rl*prefer*rvt*predict-no*H0*4
  17283. -->
  17284. (S1 ^operator O2040 = 0.3145047896375236)
  17285. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  17286. -->
  17287. (S1 ^operator O2039 = 0.6092926303832609)
  17288. Retracting rl*prefer*rvt*predict-yes*H0*3
  17289. -->
  17290. (S1 ^operator O2039 = 0.3907725922691719)
  17291. =>WM: (14332: S1 ^operator O2042 +)
  17292. =>WM: (14331: S1 ^operator O2041 +)
  17293. =>WM: (14330: O2042 ^name predict-no)
  17294. =>WM: (14329: O2041 ^name predict-yes)
  17295. =>WM: (14328: R1024 ^value 1)
  17296. =>WM: (14327: R1 ^reward R1024)
  17297. <=WM: (14318: S1 ^operator O2039 +)
  17298. <=WM: (14320: S1 ^operator O2039)
  17299. <=WM: (14319: S1 ^operator O2040 +)
  17300. <=WM: (14313: R1 ^reward R1023)
  17301. <=WM: (14316: O2040 ^name predict-no)
  17302. <=WM: (14315: O2039 ^name predict-yes)
  17303. <=WM: (14314: R1023 ^value 1)
  17304. --- Inner Elaboration Phase, active level 1 (S1) ---
  17305. Firing prefer*rvt*predict-yes*H0
  17306. -->
  17307. Firing rl*prefer*rvt*predict-yes*H0*3
  17308. -->
  17309. (S1 ^operator O2041 = 0.3907725922691719)
  17310. Firing prefer*rvt*predict-yes*H0*3*H1
  17311. -->
  17312. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  17313. -->
  17314. (S1 ^operator O2041 = -0.2062723012911647)
  17315. Firing prefer*rvt*predict-no*H0
  17316. -->
  17317. Firing rl*prefer*rvt*predict-no*H0*4
  17318. -->
  17319. (S1 ^operator O2042 = 0.3145047896375236)
  17320. Firing prefer*rvt*predict-no*H0*4*H1
  17321. -->
  17322. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  17323. -->
  17324. (S1 ^operator O2042 = 0.6855266893701198)
  17325. inner elaboration loop at bottom goal.
  17326. Retracting rl*prefer*rvt*predict-no*H0*4
  17327. -->
  17328. (S1 ^operator O2040 = 0.3145047896375236)
  17329. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  17330. -->
  17331. (S1 ^operator O2040 = 0.6855266893701198)
  17332. Retracting rl*prefer*rvt*predict-yes*H0*3
  17333. -->
  17334. (S1 ^operator O2039 = 0.3907725922691719)
  17335. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  17336. -->
  17337. (S1 ^operator O2039 = -0.2062723012911647)
  17338. --- END Proposal Phase ---
  17339. --- Decision Phase ---
  17340. RL update rl*prefer*rvt*predict-yes*H0*3 0.472319 -0.0815467 0.390773 -> 0.472315 -0.0815475 0.390767(R,m,v=1,0.945122,0.0521846)
  17341. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527736 0.0815563 0.609293 -> 0.527731 0.0815554 0.609286(R,m,v=1,1,0)
  17342. =>WM: (14333: S1 ^operator O2042)
  17343. 1021: O: O2042 (predict-no)
  17344. --- END Decision Phase ---
  17345. --- Application Phase ---
  17346. --- Firing Productions (PE) For State At Depth 1 ---
  17347. --- Inner Elaboration Phase, active level 1 (S1) ---
  17348. Firing apply*operator
  17349. -->
  17350. (I3 ^predict-no N1021 + :O )
  17351. Firing apply*operator*complete
  17352. -->
  17353. (I3 ^predict-yes N1020 - :O )
  17354. inner elaboration loop at bottom goal.
  17355. --- Change Working Memory (PE) ---
  17356. =>WM: (14334: I3 ^predict-no N1021)
  17357. <=WM: (14322: N1020 ^status complete)
  17358. <=WM: (14321: I3 ^predict-yes N1020)
  17359. --- Firing Productions (IE) For State At Depth 1 ---
  17360. --- Inner Elaboration Phase, active level 1 (S1) ---
  17361. Firing monitor*world
  17362. -->
  17363. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17364. --- Change Working Memory (IE) ---
  17365. --- END Application Phase ---
  17366. --- Output Phase ---
  17367. ENV: Agent did: predict-no for direction L in state State-A
  17368. In State-A moving L
  17369. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17370. predict error 0
  17371. dir: dir isR
  17372. --- END Output Phase ---
  17373. /--- Input Phase ---
  17374. =>WM: (14338: I2 ^dir R)
  17375. =>WM: (14337: I2 ^reward 1)
  17376. =>WM: (14336: I2 ^see 0)
  17377. =>WM: (14335: N1021 ^status complete)
  17378. <=WM: (14325: I2 ^dir L)
  17379. <=WM: (14324: I2 ^reward 1)
  17380. <=WM: (14323: I2 ^see 1)
  17381. =>WM: (14339: I2 ^level-1 L0-root)
  17382. <=WM: (14326: I2 ^level-1 L1-root)
  17383. --- END Input Phase ---
  17384. --- Proposal Phase ---
  17385. --- Inner Elaboration Phase, active level 1 (S1) ---
  17386. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  17387. -->
  17388. (S1 ^operator O2041 = 0.8783962927268922)
  17389. Firing prefer*rvt*predict-yes*H0*5*H1
  17390. -->
  17391. Firing elaborate*copy-see-to-output-link
  17392. -->
  17393. (I3 ^see 0 +)
  17394. Firing elaborate*reward*based*on*reward
  17395. -->
  17396. (R1025 ^value 1 +)
  17397. (R1 ^reward R1025 +)
  17398. Firing propose*predict-yes
  17399. -->
  17400. (O2043 ^name predict-yes +)
  17401. (S1 ^operator O2043 +)
  17402. Firing propose*predict-no
  17403. -->
  17404. (O2044 ^name predict-no +)
  17405. (S1 ^operator O2044 +)
  17406. Firing rl*prefer*rvt*predict-no*H0*6
  17407. -->
  17408. (S1 ^operator O2042 = 0.9999921813761182)
  17409. Firing rl*prefer*rvt*predict-yes*H0*5
  17410. -->
  17411. (S1 ^operator O2041 = 0.1215978717524572)
  17412. Firing prefer*rvt*predict-yes*H0
  17413. -->
  17414. Firing prefer*rvt*predict-no*H0
  17415. -->
  17416. Firing elaborate*copy-dir-to-output-link
  17417. -->
  17418. (I3 ^dir R +)
  17419. inner elaboration loop at bottom goal.
  17420. Retracting elaborate*copy-see-to-output-link
  17421. -->
  17422. (I3 ^see 1 +)
  17423. Retracting propose*predict-no
  17424. -->
  17425. (O2042 ^name predict-no +)
  17426. (S1 ^operator O2042 +)
  17427. Retracting propose*predict-yes
  17428. -->
  17429. (O2041 ^name predict-yes +)
  17430. (S1 ^operator O2041 +)
  17431. Retracting elaborate*reward*based*on*reward
  17432. -->
  17433. (R1024 ^value 1 +)
  17434. (R1 ^reward R1024 +)
  17435. Retracting elaborate*copy-dir-to-output-link
  17436. -->
  17437. (I3 ^dir L +)
  17438. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  17439. -->
  17440. (S1 ^operator O2042 = 0.6855266893701198)
  17441. Retracting rl*prefer*rvt*predict-no*H0*4
  17442. -->
  17443. (S1 ^operator O2042 = 0.3145047896375236)
  17444. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  17445. -->
  17446. (S1 ^operator O2041 = -0.2062723012911647)
  17447. Retracting rl*prefer*rvt*predict-yes*H0*3
  17448. -->
  17449. (S1 ^operator O2041 = 0.3907672460330531)
  17450. =>WM: (14347: S1 ^operator O2044 +)
  17451. =>WM: (14346: S1 ^operator O2043 +)
  17452. =>WM: (14345: I3 ^dir R)
  17453. =>WM: (14344: O2044 ^name predict-no)
  17454. =>WM: (14343: O2043 ^name predict-yes)
  17455. =>WM: (14342: R1025 ^value 1)
  17456. =>WM: (14341: R1 ^reward R1025)
  17457. =>WM: (14340: I3 ^see 0)
  17458. <=WM: (14331: S1 ^operator O2041 +)
  17459. <=WM: (14332: S1 ^operator O2042 +)
  17460. <=WM: (14333: S1 ^operator O2042)
  17461. <=WM: (14317: I3 ^dir L)
  17462. <=WM: (14327: R1 ^reward R1024)
  17463. <=WM: (14312: I3 ^see 1)
  17464. <=WM: (14330: O2042 ^name predict-no)
  17465. <=WM: (14329: O2041 ^name predict-yes)
  17466. <=WM: (14328: R1024 ^value 1)
  17467. --- Inner Elaboration Phase, active level 1 (S1) ---
  17468. Firing prefer*rvt*predict-yes*H0
  17469. -->
  17470. Firing rl*prefer*rvt*predict-yes*H0*5
  17471. -->
  17472. (S1 ^operator O2043 = 0.1215978717524572)
  17473. Firing prefer*rvt*predict-yes*H0*5*H1
  17474. -->
  17475. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  17476. -->
  17477. (S1 ^operator O2043 = 0.8783962927268922)
  17478. Firing prefer*rvt*predict-no*H0
  17479. -->
  17480. Firing rl*prefer*rvt*predict-no*H0*6
  17481. -->
  17482. (S1 ^operator O2044 = 0.9999921813761182)
  17483. inner elaboration loop at bottom goal.
  17484. Retracting rl*prefer*rvt*predict-no*H0*6
  17485. -->
  17486. (S1 ^operator O2042 = 0.9999921813761182)
  17487. Retracting rl*prefer*rvt*predict-yes*H0*5
  17488. -->
  17489. (S1 ^operator O2041 = 0.1215978717524572)
  17490. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  17491. -->
  17492. (S1 ^operator O2041 = 0.8783962927268922)
  17493. --- END Proposal Phase ---
  17494. --- Decision Phase ---
  17495. RL update rl*prefer*rvt*predict-no*H0*4 0.478553 -0.164048 0.314505 -> 0.478551 -0.164048 0.314502(R,m,v=1,0.924528,0.0702173)
  17496. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521476 0.164051 0.685527 -> 0.521473 0.164051 0.685524(R,m,v=1,1,0)
  17497. =>WM: (14348: S1 ^operator O2043)
  17498. 1022: O: O2043 (predict-yes)
  17499. --- END Decision Phase ---
  17500. --- Application Phase ---
  17501. --- Firing Productions (PE) For State At Depth 1 ---
  17502. --- Inner Elaboration Phase, active level 1 (S1) ---
  17503. Firing apply*operator
  17504. -->
  17505. (I3 ^predict-yes N1022 + :O )
  17506. Firing apply*operator*complete
  17507. -->
  17508. (I3 ^predict-no N1021 - :O )
  17509. inner elaboration loop at bottom goal.
  17510. --- Change Working Memory (PE) ---
  17511. =>WM: (14349: I3 ^predict-yes N1022)
  17512. <=WM: (14335: N1021 ^status complete)
  17513. <=WM: (14334: I3 ^predict-no N1021)
  17514. --- Firing Productions (IE) For State At Depth 1 ---
  17515. --- Inner Elaboration Phase, active level 1 (S1) ---
  17516. Firing monitor*world
  17517. -->
  17518. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17519. --- Change Working Memory (IE) ---
  17520. --- END Application Phase ---
  17521. --- Output Phase ---
  17522. ENV: Agent did: predict-yes for direction R in state State-A
  17523. In State-A moving R
  17524. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  17525. predict error 0
  17526. dir: dir isU
  17527. --- END Output Phase ---
  17528. |\---- Input Phase ---
  17529. =>WM: (14353: I2 ^dir U)
  17530. =>WM: (14352: I2 ^reward 1)
  17531. =>WM: (14351: I2 ^see 1)
  17532. =>WM: (14350: N1022 ^status complete)
  17533. <=WM: (14338: I2 ^dir R)
  17534. <=WM: (14337: I2 ^reward 1)
  17535. <=WM: (14336: I2 ^see 0)
  17536. =>WM: (14354: I2 ^level-1 R1-root)
  17537. <=WM: (14339: I2 ^level-1 L0-root)
  17538. --- END Input Phase ---
  17539. --- Proposal Phase ---
  17540. --- Inner Elaboration Phase, active level 1 (S1) ---
  17541. Firing elaborate*copy-see-to-output-link
  17542. -->
  17543. (I3 ^see 1 +)
  17544. Firing elaborate*reward*based*on*reward
  17545. -->
  17546. (R1026 ^value 1 +)
  17547. (R1 ^reward R1026 +)
  17548. Firing propose*predict-yes
  17549. -->
  17550. (O2045 ^name predict-yes +)
  17551. (S1 ^operator O2045 +)
  17552. Firing propose*predict-no
  17553. -->
  17554. (O2046 ^name predict-no +)
  17555. (S1 ^operator O2046 +)
  17556. Firing rl*prefer*rvt*predict-no*H0*2
  17557. -->
  17558. (S1 ^operator O2044 = 1.)
  17559. Firing rl*prefer*rvt*predict-yes*H0*1
  17560. -->
  17561. (S1 ^operator O2043 = 0.)
  17562. Firing prefer*rvt*predict-yes*H0
  17563. -->
  17564. Firing prefer*rvt*predict-no*H0
  17565. -->
  17566. Firing elaborate*copy-dir-to-output-link
  17567. -->
  17568. (I3 ^dir U +)
  17569. inner elaboration loop at bottom goal.
  17570. Retracting elaborate*copy-see-to-output-link
  17571. -->
  17572. (I3 ^see 0 +)
  17573. Retracting propose*predict-no
  17574. -->
  17575. (O2044 ^name predict-no +)
  17576. (S1 ^operator O2044 +)
  17577. Retracting propose*predict-yes
  17578. -->
  17579. (O2043 ^name predict-yes +)
  17580. (S1 ^operator O2043 +)
  17581. Retracting elaborate*reward*based*on*reward
  17582. -->
  17583. (R1025 ^value 1 +)
  17584. (R1 ^reward R1025 +)
  17585. Retracting elaborate*copy-dir-to-output-link
  17586. -->
  17587. (I3 ^dir R +)
  17588. Retracting rl*prefer*rvt*predict-no*H0*6
  17589. -->
  17590. (S1 ^operator O2044 = 0.9999921813761182)
  17591. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  17592. -->
  17593. (S1 ^operator O2043 = 0.8783962927268922)
  17594. Retracting rl*prefer*rvt*predict-yes*H0*5
  17595. -->
  17596. (S1 ^operator O2043 = 0.1215978717524572)
  17597. =>WM: (14362: S1 ^operator O2046 +)
  17598. =>WM: (14361: S1 ^operator O2045 +)
  17599. =>WM: (14360: I3 ^dir U)
  17600. =>WM: (14359: O2046 ^name predict-no)
  17601. =>WM: (14358: O2045 ^name predict-yes)
  17602. =>WM: (14357: R1026 ^value 1)
  17603. =>WM: (14356: R1 ^reward R1026)
  17604. =>WM: (14355: I3 ^see 1)
  17605. <=WM: (14346: S1 ^operator O2043 +)
  17606. <=WM: (14348: S1 ^operator O2043)
  17607. <=WM: (14347: S1 ^operator O2044 +)
  17608. <=WM: (14345: I3 ^dir R)
  17609. <=WM: (14341: R1 ^reward R1025)
  17610. <=WM: (14340: I3 ^see 0)
  17611. <=WM: (14344: O2044 ^name predict-no)
  17612. <=WM: (14343: O2043 ^name predict-yes)
  17613. <=WM: (14342: R1025 ^value 1)
  17614. --- Inner Elaboration Phase, active level 1 (S1) ---
  17615. Firing prefer*rvt*predict-yes*H0
  17616. -->
  17617. Firing rl*prefer*rvt*predict-yes*H0*1
  17618. -->
  17619. (S1 ^operator O2045 = 0.)
  17620. Firing prefer*rvt*predict-no*H0
  17621. -->
  17622. Firing rl*prefer*rvt*predict-no*H0*2
  17623. -->
  17624. (S1 ^operator O2046 = 1.)
  17625. inner elaboration loop at bottom goal.
  17626. Retracting rl*prefer*rvt*predict-no*H0*2
  17627. -->
  17628. (S1 ^operator O2044 = 1.)
  17629. Retracting rl*prefer*rvt*predict-yes*H0*1
  17630. -->
  17631. (S1 ^operator O2043 = 0.)
  17632. --- END Proposal Phase ---
  17633. --- Decision Phase ---
  17634. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.867403,0.115654)
  17635. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465471 0.412925 0.878396 -> 0.465472 0.412925 0.878397(R,m,v=1,1,0)
  17636. =>WM: (14363: S1 ^operator O2046)
  17637. 1023: O: O2046 (predict-no)
  17638. --- END Decision Phase ---
  17639. --- Application Phase ---
  17640. --- Firing Productions (PE) For State At Depth 1 ---
  17641. --- Inner Elaboration Phase, active level 1 (S1) ---
  17642. Firing apply*operator
  17643. -->
  17644. (I3 ^predict-no N1023 + :O )
  17645. Firing apply*operator*complete
  17646. -->
  17647. (I3 ^predict-yes N1022 - :O )
  17648. inner elaboration loop at bottom goal.
  17649. --- Change Working Memory (PE) ---
  17650. =>WM: (14364: I3 ^predict-no N1023)
  17651. <=WM: (14350: N1022 ^status complete)
  17652. <=WM: (14349: I3 ^predict-yes N1022)
  17653. --- Firing Productions (IE) For State At Depth 1 ---
  17654. --- Inner Elaboration Phase, active level 1 (S1) ---
  17655. Firing monitor*world
  17656. -->
  17657. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17658. --- Change Working Memory (IE) ---
  17659. --- END Application Phase ---
  17660. --- Output Phase ---
  17661. ENV: Agent did: predict-no for direction U in state State-B
  17662. In State-B moving U
  17663. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  17664. predict error 0
  17665. dir: dir isL
  17666. --- END Output Phase ---
  17667. /--- Input Phase ---
  17668. =>WM: (14368: I2 ^dir L)
  17669. =>WM: (14367: I2 ^reward 1)
  17670. =>WM: (14366: I2 ^see 0)
  17671. =>WM: (14365: N1023 ^status complete)
  17672. <=WM: (14353: I2 ^dir U)
  17673. <=WM: (14352: I2 ^reward 1)
  17674. <=WM: (14351: I2 ^see 1)
  17675. =>WM: (14369: I2 ^level-1 R1-root)
  17676. <=WM: (14354: I2 ^level-1 R1-root)
  17677. --- END Input Phase ---
  17678. --- Proposal Phase ---
  17679. --- Inner Elaboration Phase, active level 1 (S1) ---
  17680. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  17681. -->
  17682. (S1 ^operator O2046 = -0.168718511744511)
  17683. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  17684. -->
  17685. (S1 ^operator O2045 = 0.6092864975390457)
  17686. Firing prefer*rvt*predict-no*H0*4*H1
  17687. -->
  17688. Firing prefer*rvt*predict-yes*H0*3*H1
  17689. -->
  17690. Firing elaborate*copy-see-to-output-link
  17691. -->
  17692. (I3 ^see 0 +)
  17693. Firing elaborate*reward*based*on*reward
  17694. -->
  17695. (R1027 ^value 1 +)
  17696. (R1 ^reward R1027 +)
  17697. Firing propose*predict-yes
  17698. -->
  17699. (O2047 ^name predict-yes +)
  17700. (S1 ^operator O2047 +)
  17701. Firing propose*predict-no
  17702. -->
  17703. (O2048 ^name predict-no +)
  17704. (S1 ^operator O2048 +)
  17705. Firing rl*prefer*rvt*predict-no*H0*4
  17706. -->
  17707. (S1 ^operator O2046 = 0.314502196170351)
  17708. Firing rl*prefer*rvt*predict-yes*H0*3
  17709. -->
  17710. (S1 ^operator O2045 = 0.3907672460330531)
  17711. Firing prefer*rvt*predict-yes*H0
  17712. -->
  17713. Firing prefer*rvt*predict-no*H0
  17714. -->
  17715. Firing elaborate*copy-dir-to-output-link
  17716. -->
  17717. (I3 ^dir L +)
  17718. inner elaboration loop at bottom goal.
  17719. Retracting elaborate*copy-see-to-output-link
  17720. -->
  17721. (I3 ^see 1 +)
  17722. Retracting propose*predict-no
  17723. -->
  17724. (O2046 ^name predict-no +)
  17725. (S1 ^operator O2046 +)
  17726. Retracting propose*predict-yes
  17727. -->
  17728. (O2045 ^name predict-yes +)
  17729. (S1 ^operator O2045 +)
  17730. Retracting elaborate*reward*based*on*reward
  17731. -->
  17732. (R1026 ^value 1 +)
  17733. (R1 ^reward R1026 +)
  17734. Retracting elaborate*copy-dir-to-output-link
  17735. -->
  17736. (I3 ^dir U +)
  17737. Retracting rl*prefer*rvt*predict-no*H0*2
  17738. -->
  17739. (S1 ^operator O2046 = 1.)
  17740. Retracting rl*prefer*rvt*predict-yes*H0*1
  17741. -->
  17742. (S1 ^operator O2045 = 0.)
  17743. =>WM: (14377: S1 ^operator O2048 +)
  17744. =>WM: (14376: S1 ^operator O2047 +)
  17745. =>WM: (14375: I3 ^dir L)
  17746. =>WM: (14374: O2048 ^name predict-no)
  17747. =>WM: (14373: O2047 ^name predict-yes)
  17748. =>WM: (14372: R1027 ^value 1)
  17749. =>WM: (14371: R1 ^reward R1027)
  17750. =>WM: (14370: I3 ^see 0)
  17751. <=WM: (14361: S1 ^operator O2045 +)
  17752. <=WM: (14362: S1 ^operator O2046 +)
  17753. <=WM: (14363: S1 ^operator O2046)
  17754. <=WM: (14360: I3 ^dir U)
  17755. <=WM: (14356: R1 ^reward R1026)
  17756. <=WM: (14355: I3 ^see 1)
  17757. <=WM: (14359: O2046 ^name predict-no)
  17758. <=WM: (14358: O2045 ^name predict-yes)
  17759. <=WM: (14357: R1026 ^value 1)
  17760. --- Inner Elaboration Phase, active level 1 (S1) ---
  17761. Firing prefer*rvt*predict-yes*H0
  17762. -->
  17763. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  17764. -->
  17765. (S1 ^operator O2047 = 0.6092864975390457)
  17766. Firing rl*prefer*rvt*predict-yes*H0*3
  17767. -->
  17768. (S1 ^operator O2047 = 0.3907672460330531)
  17769. Firing prefer*rvt*predict-yes*H0*3*H1
  17770. -->
  17771. Firing prefer*rvt*predict-no*H0
  17772. -->
  17773. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  17774. -->
  17775. (S1 ^operator O2048 = -0.168718511744511)
  17776. Firing rl*prefer*rvt*predict-no*H0*4
  17777. -->
  17778. (S1 ^operator O2048 = 0.314502196170351)
  17779. Firing prefer*rvt*predict-no*H0*4*H1
  17780. -->
  17781. inner elaboration loop at bottom goal.
  17782. Retracting rl*prefer*rvt*predict-no*H0*4
  17783. -->
  17784. (S1 ^operator O2046 = 0.314502196170351)
  17785. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  17786. -->
  17787. (S1 ^operator O2046 = -0.168718511744511)
  17788. Retracting rl*prefer*rvt*predict-yes*H0*3
  17789. -->
  17790. (S1 ^operator O2045 = 0.3907672460330531)
  17791. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  17792. -->
  17793. (S1 ^operator O2045 = 0.6092864975390457)
  17794. --- END Proposal Phase ---
  17795. --- Decision Phase ---
  17796. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  17797. =>WM: (14378: S1 ^operator O2047)
  17798. 1024: O: O2047 (predict-yes)
  17799. --- END Decision Phase ---
  17800. --- Application Phase ---
  17801. --- Firing Productions (PE) For State At Depth 1 ---
  17802. --- Inner Elaboration Phase, active level 1 (S1) ---
  17803. Firing apply*operator
  17804. -->
  17805. (I3 ^predict-yes N1024 + :O )
  17806. Firing apply*operator*complete
  17807. -->
  17808. (I3 ^predict-no N1023 - :O )
  17809. inner elaboration loop at bottom goal.
  17810. --- Change Working Memory (PE) ---
  17811. =>WM: (14379: I3 ^predict-yes N1024)
  17812. <=WM: (14365: N1023 ^status complete)
  17813. <=WM: (14364: I3 ^predict-no N1023)
  17814. --- Firing Productions (IE) For State At Depth 1 ---
  17815. --- Inner Elaboration Phase, active level 1 (S1) ---
  17816. Firing monitor*world
  17817. -->
  17818. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  17819. --- Change Working Memory (IE) ---
  17820. --- END Application Phase ---
  17821. --- Output Phase ---
  17822. ENV: Agent did: predict-yes for direction L in state State-B
  17823. In State-B moving L
  17824. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  17825. predict error 0
  17826. dir: dir isU
  17827. --- END Output Phase ---
  17828. |\---- Input Phase ---
  17829. =>WM: (14383: I2 ^dir U)
  17830. =>WM: (14382: I2 ^reward 1)
  17831. =>WM: (14381: I2 ^see 1)
  17832. =>WM: (14380: N1024 ^status complete)
  17833. <=WM: (14368: I2 ^dir L)
  17834. <=WM: (14367: I2 ^reward 1)
  17835. <=WM: (14366: I2 ^see 0)
  17836. =>WM: (14384: I2 ^level-1 L1-root)
  17837. <=WM: (14369: I2 ^level-1 R1-root)
  17838. --- END Input Phase ---
  17839. --- Proposal Phase ---
  17840. --- Inner Elaboration Phase, active level 1 (S1) ---
  17841. Firing elaborate*copy-see-to-output-link
  17842. -->
  17843. (I3 ^see 1 +)
  17844. Firing elaborate*reward*based*on*reward
  17845. -->
  17846. (R1028 ^value 1 +)
  17847. (R1 ^reward R1028 +)
  17848. Firing propose*predict-yes
  17849. -->
  17850. (O2049 ^name predict-yes +)
  17851. (S1 ^operator O2049 +)
  17852. Firing propose*predict-no
  17853. -->
  17854. (O2050 ^name predict-no +)
  17855. (S1 ^operator O2050 +)
  17856. Firing rl*prefer*rvt*predict-no*H0*2
  17857. -->
  17858. (S1 ^operator O2048 = 1.)
  17859. Firing rl*prefer*rvt*predict-yes*H0*1
  17860. -->
  17861. (S1 ^operator O2047 = 0.)
  17862. Firing prefer*rvt*predict-yes*H0
  17863. -->
  17864. Firing prefer*rvt*predict-no*H0
  17865. -->
  17866. Firing elaborate*copy-dir-to-output-link
  17867. -->
  17868. (I3 ^dir U +)
  17869. inner elaboration loop at bottom goal.
  17870. Retracting elaborate*copy-see-to-output-link
  17871. -->
  17872. (I3 ^see 0 +)
  17873. Retracting propose*predict-no
  17874. -->
  17875. (O2048 ^name predict-no +)
  17876. (S1 ^operator O2048 +)
  17877. Retracting propose*predict-yes
  17878. -->
  17879. (O2047 ^name predict-yes +)
  17880. (S1 ^operator O2047 +)
  17881. Retracting elaborate*reward*based*on*reward
  17882. -->
  17883. (R1027 ^value 1 +)
  17884. (R1 ^reward R1027 +)
  17885. Retracting elaborate*copy-dir-to-output-link
  17886. -->
  17887. (I3 ^dir L +)
  17888. Retracting rl*prefer*rvt*predict-no*H0*4
  17889. -->
  17890. (S1 ^operator O2048 = 0.314502196170351)
  17891. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  17892. -->
  17893. (S1 ^operator O2048 = -0.168718511744511)
  17894. Retracting rl*prefer*rvt*predict-yes*H0*3
  17895. -->
  17896. (S1 ^operator O2047 = 0.3907672460330531)
  17897. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  17898. -->
  17899. (S1 ^operator O2047 = 0.6092864975390457)
  17900. =>WM: (14392: S1 ^operator O2050 +)
  17901. =>WM: (14391: S1 ^operator O2049 +)
  17902. =>WM: (14390: I3 ^dir U)
  17903. =>WM: (14389: O2050 ^name predict-no)
  17904. =>WM: (14388: O2049 ^name predict-yes)
  17905. =>WM: (14387: R1028 ^value 1)
  17906. =>WM: (14386: R1 ^reward R1028)
  17907. =>WM: (14385: I3 ^see 1)
  17908. <=WM: (14376: S1 ^operator O2047 +)
  17909. <=WM: (14378: S1 ^operator O2047)
  17910. <=WM: (14377: S1 ^operator O2048 +)
  17911. <=WM: (14375: I3 ^dir L)
  17912. <=WM: (14371: R1 ^reward R1027)
  17913. <=WM: (14370: I3 ^see 0)
  17914. <=WM: (14374: O2048 ^name predict-no)
  17915. <=WM: (14373: O2047 ^name predict-yes)
  17916. <=WM: (14372: R1027 ^value 1)
  17917. --- Inner Elaboration Phase, active level 1 (S1) ---
  17918. Firing prefer*rvt*predict-yes*H0
  17919. -->
  17920. Firing rl*prefer*rvt*predict-yes*H0*1
  17921. -->
  17922. (S1 ^operator O2049 = 0.)
  17923. Firing prefer*rvt*predict-no*H0
  17924. -->
  17925. Firing rl*prefer*rvt*predict-no*H0*2
  17926. -->
  17927. (S1 ^operator O2050 = 1.)
  17928. inner elaboration loop at bottom goal.
  17929. Retracting rl*prefer*rvt*predict-no*H0*2
  17930. -->
  17931. (S1 ^operator O2048 = 1.)
  17932. Retracting rl*prefer*rvt*predict-yes*H0*1
  17933. -->
  17934. (S1 ^operator O2047 = 0.)
  17935. --- END Proposal Phase ---
  17936. --- Decision Phase ---
  17937. RL update rl*prefer*rvt*predict-yes*H0*3 0.472315 -0.0815475 0.390767 -> 0.472311 -0.0815481 0.390763(R,m,v=1,0.945455,0.0518847)
  17938. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527731 0.0815554 0.609286 -> 0.527727 0.0815547 0.609281(R,m,v=1,1,0)
  17939. =>WM: (14393: S1 ^operator O2050)
  17940. 1025: O: O2050 (predict-no)
  17941. --- END Decision Phase ---
  17942. --- Application Phase ---
  17943. --- Firing Productions (PE) For State At Depth 1 ---
  17944. --- Inner Elaboration Phase, active level 1 (S1) ---
  17945. Firing apply*operator
  17946. -->
  17947. (I3 ^predict-no N1025 + :O )
  17948. Firing apply*operator*complete
  17949. -->
  17950. (I3 ^predict-yes N1024 - :O )
  17951. inner elaboration loop at bottom goal.
  17952. --- Change Working Memory (PE) ---
  17953. =>WM: (14394: I3 ^predict-no N1025)
  17954. <=WM: (14380: N1024 ^status complete)
  17955. <=WM: (14379: I3 ^predict-yes N1024)
  17956. --- Firing Productions (IE) For State At Depth 1 ---
  17957. --- Inner Elaboration Phase, active level 1 (S1) ---
  17958. Firing monitor*world
  17959. -->
  17960. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  17961. --- Change Working Memory (IE) ---
  17962. --- END Application Phase ---
  17963. --- Output Phase ---
  17964. ENV: Agent did: predict-no for direction U in state State-A
  17965. In State-A moving U
  17966. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  17967. predict error 0
  17968. dir: dir isU
  17969. --- END Output Phase ---
  17970. /|\--- Input Phase ---
  17971. =>WM: (14398: I2 ^dir U)
  17972. =>WM: (14397: I2 ^reward 1)
  17973. =>WM: (14396: I2 ^see 0)
  17974. =>WM: (14395: N1025 ^status complete)
  17975. <=WM: (14383: I2 ^dir U)
  17976. <=WM: (14382: I2 ^reward 1)
  17977. <=WM: (14381: I2 ^see 1)
  17978. =>WM: (14399: I2 ^level-1 L1-root)
  17979. <=WM: (14384: I2 ^level-1 L1-root)
  17980. --- END Input Phase ---
  17981. --- Proposal Phase ---
  17982. --- Inner Elaboration Phase, active level 1 (S1) ---
  17983. Firing elaborate*copy-see-to-output-link
  17984. -->
  17985. (I3 ^see 0 +)
  17986. Firing elaborate*reward*based*on*reward
  17987. -->
  17988. (R1029 ^value 1 +)
  17989. (R1 ^reward R1029 +)
  17990. Firing propose*predict-yes
  17991. -->
  17992. (O2051 ^name predict-yes +)
  17993. (S1 ^operator O2051 +)
  17994. Firing propose*predict-no
  17995. -->
  17996. (O2052 ^name predict-no +)
  17997. (S1 ^operator O2052 +)
  17998. Firing rl*prefer*rvt*predict-no*H0*2
  17999. -->
  18000. (S1 ^operator O2050 = 1.)
  18001. Firing rl*prefer*rvt*predict-yes*H0*1
  18002. -->
  18003. (S1 ^operator O2049 = 0.)
  18004. Firing prefer*rvt*predict-yes*H0
  18005. -->
  18006. Firing prefer*rvt*predict-no*H0
  18007. -->
  18008. Firing elaborate*copy-dir-to-output-link
  18009. -->
  18010. (I3 ^dir U +)
  18011. inner elaboration loop at bottom goal.
  18012. Retracting elaborate*copy-see-to-output-link
  18013. -->
  18014. (I3 ^see 1 +)
  18015. Retracting propose*predict-no
  18016. -->
  18017. (O2050 ^name predict-no +)
  18018. (S1 ^operator O2050 +)
  18019. Retracting propose*predict-yes
  18020. -->
  18021. (O2049 ^name predict-yes +)
  18022. (S1 ^operator O2049 +)
  18023. Retracting elaborate*reward*based*on*reward
  18024. -->
  18025. (R1028 ^value 1 +)
  18026. (R1 ^reward R1028 +)
  18027. Retracting elaborate*copy-dir-to-output-link
  18028. -->
  18029. (I3 ^dir U +)
  18030. Retracting rl*prefer*rvt*predict-no*H0*2
  18031. -->
  18032. (S1 ^operator O2050 = 1.)
  18033. Retracting rl*prefer*rvt*predict-yes*H0*1
  18034. -->
  18035. (S1 ^operator O2049 = 0.)
  18036. =>WM: (14406: S1 ^operator O2052 +)
  18037. =>WM: (14405: S1 ^operator O2051 +)
  18038. =>WM: (14404: O2052 ^name predict-no)
  18039. =>WM: (14403: O2051 ^name predict-yes)
  18040. =>WM: (14402: R1029 ^value 1)
  18041. =>WM: (14401: R1 ^reward R1029)
  18042. =>WM: (14400: I3 ^see 0)
  18043. <=WM: (14391: S1 ^operator O2049 +)
  18044. <=WM: (14392: S1 ^operator O2050 +)
  18045. <=WM: (14393: S1 ^operator O2050)
  18046. <=WM: (14386: R1 ^reward R1028)
  18047. <=WM: (14385: I3 ^see 1)
  18048. <=WM: (14389: O2050 ^name predict-no)
  18049. <=WM: (14388: O2049 ^name predict-yes)
  18050. <=WM: (14387: R1028 ^value 1)
  18051. --- Inner Elaboration Phase, active level 1 (S1) ---
  18052. Firing prefer*rvt*predict-yes*H0
  18053. -->
  18054. Firing rl*prefer*rvt*predict-yes*H0*1
  18055. -->
  18056. (S1 ^operator O2051 = 0.)
  18057. Firing prefer*rvt*predict-no*H0
  18058. -->
  18059. Firing rl*prefer*rvt*predict-no*H0*2
  18060. -->
  18061. (S1 ^operator O2052 = 1.)
  18062. inner elaboration loop at bottom goal.
  18063. Retracting rl*prefer*rvt*predict-no*H0*2
  18064. -->
  18065. (S1 ^operator O2050 = 1.)
  18066. Retracting rl*prefer*rvt*predict-yes*H0*1
  18067. -->
  18068. (S1 ^operator O2049 = 0.)
  18069. --- END Proposal Phase ---
  18070. --- Decision Phase ---
  18071. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18072. =>WM: (14407: S1 ^operator O2052)
  18073. 1026: O: O2052 (predict-no)
  18074. --- END Decision Phase ---
  18075. --- Application Phase ---
  18076. --- Firing Productions (PE) For State At Depth 1 ---
  18077. --- Inner Elaboration Phase, active level 1 (S1) ---
  18078. Firing apply*operator
  18079. -->
  18080. (I3 ^predict-no N1026 + :O )
  18081. Firing apply*operator*complete
  18082. -->
  18083. (I3 ^predict-no N1025 - :O )
  18084. inner elaboration loop at bottom goal.
  18085. --- Change Working Memory (PE) ---
  18086. =>WM: (14408: I3 ^predict-no N1026)
  18087. <=WM: (14395: N1025 ^status complete)
  18088. <=WM: (14394: I3 ^predict-no N1025)
  18089. --- Firing Productions (IE) For State At Depth 1 ---
  18090. --- Inner Elaboration Phase, active level 1 (S1) ---
  18091. Firing monitor*world
  18092. -->
  18093. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18094. --- Change Working Memory (IE) ---
  18095. --- END Application Phase ---
  18096. --- Output Phase ---
  18097. ENV: Agent did: predict-no for direction U in state State-A
  18098. In State-A moving U
  18099. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18100. predict error 0
  18101. dir: dir isR
  18102. --- END Output Phase ---
  18103. -/--- Input Phase ---
  18104. =>WM: (14412: I2 ^dir R)
  18105. =>WM: (14411: I2 ^reward 1)
  18106. =>WM: (14410: I2 ^see 0)
  18107. =>WM: (14409: N1026 ^status complete)
  18108. <=WM: (14398: I2 ^dir U)
  18109. <=WM: (14397: I2 ^reward 1)
  18110. <=WM: (14396: I2 ^see 0)
  18111. =>WM: (14413: I2 ^level-1 L1-root)
  18112. <=WM: (14399: I2 ^level-1 L1-root)
  18113. --- END Input Phase ---
  18114. --- Proposal Phase ---
  18115. --- Inner Elaboration Phase, active level 1 (S1) ---
  18116. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  18117. -->
  18118. (S1 ^operator O2051 = 0.8784117192151244)
  18119. Firing prefer*rvt*predict-yes*H0*5*H1
  18120. -->
  18121. Firing elaborate*copy-see-to-output-link
  18122. -->
  18123. (I3 ^see 0 +)
  18124. Firing elaborate*reward*based*on*reward
  18125. -->
  18126. (R1030 ^value 1 +)
  18127. (R1 ^reward R1030 +)
  18128. Firing propose*predict-yes
  18129. -->
  18130. (O2053 ^name predict-yes +)
  18131. (S1 ^operator O2053 +)
  18132. Firing propose*predict-no
  18133. -->
  18134. (O2054 ^name predict-no +)
  18135. (S1 ^operator O2054 +)
  18136. Firing rl*prefer*rvt*predict-no*H0*6
  18137. -->
  18138. (S1 ^operator O2052 = 0.9999921813761182)
  18139. Firing rl*prefer*rvt*predict-yes*H0*5
  18140. -->
  18141. (S1 ^operator O2051 = 0.1215983424730706)
  18142. Firing prefer*rvt*predict-yes*H0
  18143. -->
  18144. Firing prefer*rvt*predict-no*H0
  18145. -->
  18146. Firing elaborate*copy-dir-to-output-link
  18147. -->
  18148. (I3 ^dir R +)
  18149. inner elaboration loop at bottom goal.
  18150. Retracting elaborate*copy-see-to-output-link
  18151. -->
  18152. (I3 ^see 0 +)
  18153. Retracting propose*predict-no
  18154. -->
  18155. (O2052 ^name predict-no +)
  18156. (S1 ^operator O2052 +)
  18157. Retracting propose*predict-yes
  18158. -->
  18159. (O2051 ^name predict-yes +)
  18160. (S1 ^operator O2051 +)
  18161. Retracting elaborate*reward*based*on*reward
  18162. -->
  18163. (R1029 ^value 1 +)
  18164. (R1 ^reward R1029 +)
  18165. Retracting elaborate*copy-dir-to-output-link
  18166. -->
  18167. (I3 ^dir U +)
  18168. Retracting rl*prefer*rvt*predict-no*H0*2
  18169. -->
  18170. (S1 ^operator O2052 = 1.)
  18171. Retracting rl*prefer*rvt*predict-yes*H0*1
  18172. -->
  18173. (S1 ^operator O2051 = 0.)
  18174. =>WM: (14420: S1 ^operator O2054 +)
  18175. =>WM: (14419: S1 ^operator O2053 +)
  18176. =>WM: (14418: I3 ^dir R)
  18177. =>WM: (14417: O2054 ^name predict-no)
  18178. =>WM: (14416: O2053 ^name predict-yes)
  18179. =>WM: (14415: R1030 ^value 1)
  18180. =>WM: (14414: R1 ^reward R1030)
  18181. <=WM: (14405: S1 ^operator O2051 +)
  18182. <=WM: (14406: S1 ^operator O2052 +)
  18183. <=WM: (14407: S1 ^operator O2052)
  18184. <=WM: (14390: I3 ^dir U)
  18185. <=WM: (14401: R1 ^reward R1029)
  18186. <=WM: (14404: O2052 ^name predict-no)
  18187. <=WM: (14403: O2051 ^name predict-yes)
  18188. <=WM: (14402: R1029 ^value 1)
  18189. --- Inner Elaboration Phase, active level 1 (S1) ---
  18190. Firing prefer*rvt*predict-yes*H0
  18191. -->
  18192. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  18193. -->
  18194. (S1 ^operator O2053 = 0.8784117192151244)
  18195. Firing rl*prefer*rvt*predict-yes*H0*5
  18196. -->
  18197. (S1 ^operator O2053 = 0.1215983424730706)
  18198. Firing prefer*rvt*predict-yes*H0*5*H1
  18199. -->
  18200. Firing prefer*rvt*predict-no*H0
  18201. -->
  18202. Firing rl*prefer*rvt*predict-no*H0*6
  18203. -->
  18204. (S1 ^operator O2054 = 0.9999921813761182)
  18205. inner elaboration loop at bottom goal.
  18206. Retracting rl*prefer*rvt*predict-no*H0*6
  18207. -->
  18208. (S1 ^operator O2052 = 0.9999921813761182)
  18209. Retracting rl*prefer*rvt*predict-yes*H0*5
  18210. -->
  18211. (S1 ^operator O2051 = 0.1215983424730706)
  18212. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  18213. -->
  18214. (S1 ^operator O2051 = 0.8784117192151244)
  18215. --- END Proposal Phase ---
  18216. --- Decision Phase ---
  18217. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18218. =>WM: (14421: S1 ^operator O2053)
  18219. 1027: O: O2053 (predict-yes)
  18220. --- END Decision Phase ---
  18221. --- Application Phase ---
  18222. --- Firing Productions (PE) For State At Depth 1 ---
  18223. --- Inner Elaboration Phase, active level 1 (S1) ---
  18224. Firing apply*operator
  18225. -->
  18226. (I3 ^predict-yes N1027 + :O )
  18227. Firing apply*operator*complete
  18228. -->
  18229. (I3 ^predict-no N1026 - :O )
  18230. inner elaboration loop at bottom goal.
  18231. --- Change Working Memory (PE) ---
  18232. =>WM: (14422: I3 ^predict-yes N1027)
  18233. <=WM: (14409: N1026 ^status complete)
  18234. <=WM: (14408: I3 ^predict-no N1026)
  18235. --- Firing Productions (IE) For State At Depth 1 ---
  18236. --- Inner Elaboration Phase, active level 1 (S1) ---
  18237. Firing monitor*world
  18238. -->
  18239. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18240. --- Change Working Memory (IE) ---
  18241. --- END Application Phase ---
  18242. --- Output Phase ---
  18243. ENV: Agent did: predict-yes for direction R in state State-A
  18244. In State-A moving R
  18245. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  18246. predict error 0
  18247. dir: dir isU
  18248. --- END Output Phase ---
  18249. |\---- Input Phase ---
  18250. =>WM: (14426: I2 ^dir U)
  18251. =>WM: (14425: I2 ^reward 1)
  18252. =>WM: (14424: I2 ^see 1)
  18253. =>WM: (14423: N1027 ^status complete)
  18254. <=WM: (14412: I2 ^dir R)
  18255. <=WM: (14411: I2 ^reward 1)
  18256. <=WM: (14410: I2 ^see 0)
  18257. =>WM: (14427: I2 ^level-1 R1-root)
  18258. <=WM: (14413: I2 ^level-1 L1-root)
  18259. --- END Input Phase ---
  18260. --- Proposal Phase ---
  18261. --- Inner Elaboration Phase, active level 1 (S1) ---
  18262. Firing elaborate*copy-see-to-output-link
  18263. -->
  18264. (I3 ^see 1 +)
  18265. Firing elaborate*reward*based*on*reward
  18266. -->
  18267. (R1031 ^value 1 +)
  18268. (R1 ^reward R1031 +)
  18269. Firing propose*predict-yes
  18270. -->
  18271. (O2055 ^name predict-yes +)
  18272. (S1 ^operator O2055 +)
  18273. Firing propose*predict-no
  18274. -->
  18275. (O2056 ^name predict-no +)
  18276. (S1 ^operator O2056 +)
  18277. Firing rl*prefer*rvt*predict-no*H0*2
  18278. -->
  18279. (S1 ^operator O2054 = 1.)
  18280. Firing rl*prefer*rvt*predict-yes*H0*1
  18281. -->
  18282. (S1 ^operator O2053 = 0.)
  18283. Firing prefer*rvt*predict-yes*H0
  18284. -->
  18285. Firing prefer*rvt*predict-no*H0
  18286. -->
  18287. Firing elaborate*copy-dir-to-output-link
  18288. -->
  18289. (I3 ^dir U +)
  18290. inner elaboration loop at bottom goal.
  18291. Retracting elaborate*copy-see-to-output-link
  18292. -->
  18293. (I3 ^see 0 +)
  18294. Retracting propose*predict-no
  18295. -->
  18296. (O2054 ^name predict-no +)
  18297. (S1 ^operator O2054 +)
  18298. Retracting propose*predict-yes
  18299. -->
  18300. (O2053 ^name predict-yes +)
  18301. (S1 ^operator O2053 +)
  18302. Retracting elaborate*reward*based*on*reward
  18303. -->
  18304. (R1030 ^value 1 +)
  18305. (R1 ^reward R1030 +)
  18306. Retracting elaborate*copy-dir-to-output-link
  18307. -->
  18308. (I3 ^dir R +)
  18309. Retracting rl*prefer*rvt*predict-no*H0*6
  18310. -->
  18311. (S1 ^operator O2054 = 0.9999921813761182)
  18312. Retracting rl*prefer*rvt*predict-yes*H0*5
  18313. -->
  18314. (S1 ^operator O2053 = 0.1215983424730706)
  18315. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  18316. -->
  18317. (S1 ^operator O2053 = 0.8784117192151244)
  18318. =>WM: (14435: S1 ^operator O2056 +)
  18319. =>WM: (14434: S1 ^operator O2055 +)
  18320. =>WM: (14433: I3 ^dir U)
  18321. =>WM: (14432: O2056 ^name predict-no)
  18322. =>WM: (14431: O2055 ^name predict-yes)
  18323. =>WM: (14430: R1031 ^value 1)
  18324. =>WM: (14429: R1 ^reward R1031)
  18325. =>WM: (14428: I3 ^see 1)
  18326. <=WM: (14419: S1 ^operator O2053 +)
  18327. <=WM: (14421: S1 ^operator O2053)
  18328. <=WM: (14420: S1 ^operator O2054 +)
  18329. <=WM: (14418: I3 ^dir R)
  18330. <=WM: (14414: R1 ^reward R1030)
  18331. <=WM: (14400: I3 ^see 0)
  18332. <=WM: (14417: O2054 ^name predict-no)
  18333. <=WM: (14416: O2053 ^name predict-yes)
  18334. <=WM: (14415: R1030 ^value 1)
  18335. --- Inner Elaboration Phase, active level 1 (S1) ---
  18336. Firing prefer*rvt*predict-yes*H0
  18337. -->
  18338. Firing rl*prefer*rvt*predict-yes*H0*1
  18339. -->
  18340. (S1 ^operator O2055 = 0.)
  18341. Firing prefer*rvt*predict-no*H0
  18342. -->
  18343. Firing rl*prefer*rvt*predict-no*H0*2
  18344. -->
  18345. (S1 ^operator O2056 = 1.)
  18346. inner elaboration loop at bottom goal.
  18347. Retracting rl*prefer*rvt*predict-no*H0*2
  18348. -->
  18349. (S1 ^operator O2054 = 1.)
  18350. Retracting rl*prefer*rvt*predict-yes*H0*1
  18351. -->
  18352. (S1 ^operator O2053 = 0.)
  18353. --- END Proposal Phase ---
  18354. --- Decision Phase ---
  18355. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.868132,0.115111)
  18356. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465484 0.412928 0.878412 -> 0.465483 0.412928 0.878411(R,m,v=1,1,0)
  18357. =>WM: (14436: S1 ^operator O2056)
  18358. 1028: O: O2056 (predict-no)
  18359. --- END Decision Phase ---
  18360. --- Application Phase ---
  18361. --- Firing Productions (PE) For State At Depth 1 ---
  18362. --- Inner Elaboration Phase, active level 1 (S1) ---
  18363. Firing apply*operator
  18364. -->
  18365. (I3 ^predict-no N1028 + :O )
  18366. Firing apply*operator*complete
  18367. -->
  18368. (I3 ^predict-yes N1027 - :O )
  18369. inner elaboration loop at bottom goal.
  18370. --- Change Working Memory (PE) ---
  18371. =>WM: (14437: I3 ^predict-no N1028)
  18372. <=WM: (14423: N1027 ^status complete)
  18373. <=WM: (14422: I3 ^predict-yes N1027)
  18374. --- Firing Productions (IE) For State At Depth 1 ---
  18375. --- Inner Elaboration Phase, active level 1 (S1) ---
  18376. Firing monitor*world
  18377. -->
  18378. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18379. --- Change Working Memory (IE) ---
  18380. --- END Application Phase ---
  18381. --- Output Phase ---
  18382. ENV: Agent did: predict-no for direction U in state State-B
  18383. In State-B moving U
  18384. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  18385. predict error 0
  18386. dir: dir isL
  18387. --- END Output Phase ---
  18388. /|--- Input Phase ---
  18389. =>WM: (14441: I2 ^dir L)
  18390. =>WM: (14440: I2 ^reward 1)
  18391. =>WM: (14439: I2 ^see 0)
  18392. =>WM: (14438: N1028 ^status complete)
  18393. <=WM: (14426: I2 ^dir U)
  18394. <=WM: (14425: I2 ^reward 1)
  18395. <=WM: (14424: I2 ^see 1)
  18396. =>WM: (14442: I2 ^level-1 R1-root)
  18397. <=WM: (14427: I2 ^level-1 R1-root)
  18398. --- END Input Phase ---
  18399. --- Proposal Phase ---
  18400. --- Inner Elaboration Phase, active level 1 (S1) ---
  18401. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  18402. -->
  18403. (S1 ^operator O2056 = -0.168718511744511)
  18404. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  18405. -->
  18406. (S1 ^operator O2055 = 0.6092814566217208)
  18407. Firing prefer*rvt*predict-no*H0*4*H1
  18408. -->
  18409. Firing prefer*rvt*predict-yes*H0*3*H1
  18410. -->
  18411. Firing elaborate*copy-see-to-output-link
  18412. -->
  18413. (I3 ^see 0 +)
  18414. Firing elaborate*reward*based*on*reward
  18415. -->
  18416. (R1032 ^value 1 +)
  18417. (R1 ^reward R1032 +)
  18418. Firing propose*predict-yes
  18419. -->
  18420. (O2057 ^name predict-yes +)
  18421. (S1 ^operator O2057 +)
  18422. Firing propose*predict-no
  18423. -->
  18424. (O2058 ^name predict-no +)
  18425. (S1 ^operator O2058 +)
  18426. Firing rl*prefer*rvt*predict-no*H0*4
  18427. -->
  18428. (S1 ^operator O2056 = 0.314502196170351)
  18429. Firing rl*prefer*rvt*predict-yes*H0*3
  18430. -->
  18431. (S1 ^operator O2055 = 0.3907628451116619)
  18432. Firing prefer*rvt*predict-yes*H0
  18433. -->
  18434. Firing prefer*rvt*predict-no*H0
  18435. -->
  18436. Firing elaborate*copy-dir-to-output-link
  18437. -->
  18438. (I3 ^dir L +)
  18439. inner elaboration loop at bottom goal.
  18440. Retracting elaborate*copy-see-to-output-link
  18441. -->
  18442. (I3 ^see 1 +)
  18443. Retracting propose*predict-no
  18444. -->
  18445. (O2056 ^name predict-no +)
  18446. (S1 ^operator O2056 +)
  18447. Retracting propose*predict-yes
  18448. -->
  18449. (O2055 ^name predict-yes +)
  18450. (S1 ^operator O2055 +)
  18451. Retracting elaborate*reward*based*on*reward
  18452. -->
  18453. (R1031 ^value 1 +)
  18454. (R1 ^reward R1031 +)
  18455. Retracting elaborate*copy-dir-to-output-link
  18456. -->
  18457. (I3 ^dir U +)
  18458. Retracting rl*prefer*rvt*predict-no*H0*2
  18459. -->
  18460. (S1 ^operator O2056 = 1.)
  18461. Retracting rl*prefer*rvt*predict-yes*H0*1
  18462. -->
  18463. (S1 ^operator O2055 = 0.)
  18464. =>WM: (14450: S1 ^operator O2058 +)
  18465. =>WM: (14449: S1 ^operator O2057 +)
  18466. =>WM: (14448: I3 ^dir L)
  18467. =>WM: (14447: O2058 ^name predict-no)
  18468. =>WM: (14446: O2057 ^name predict-yes)
  18469. =>WM: (14445: R1032 ^value 1)
  18470. =>WM: (14444: R1 ^reward R1032)
  18471. =>WM: (14443: I3 ^see 0)
  18472. <=WM: (14434: S1 ^operator O2055 +)
  18473. <=WM: (14435: S1 ^operator O2056 +)
  18474. <=WM: (14436: S1 ^operator O2056)
  18475. <=WM: (14433: I3 ^dir U)
  18476. <=WM: (14429: R1 ^reward R1031)
  18477. <=WM: (14428: I3 ^see 1)
  18478. <=WM: (14432: O2056 ^name predict-no)
  18479. <=WM: (14431: O2055 ^name predict-yes)
  18480. <=WM: (14430: R1031 ^value 1)
  18481. --- Inner Elaboration Phase, active level 1 (S1) ---
  18482. Firing prefer*rvt*predict-yes*H0
  18483. -->
  18484. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  18485. -->
  18486. (S1 ^operator O2057 = 0.6092814566217208)
  18487. Firing rl*prefer*rvt*predict-yes*H0*3
  18488. -->
  18489. (S1 ^operator O2057 = 0.3907628451116619)
  18490. Firing prefer*rvt*predict-yes*H0*3*H1
  18491. -->
  18492. Firing prefer*rvt*predict-no*H0
  18493. -->
  18494. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  18495. -->
  18496. (S1 ^operator O2058 = -0.168718511744511)
  18497. Firing rl*prefer*rvt*predict-no*H0*4
  18498. -->
  18499. (S1 ^operator O2058 = 0.314502196170351)
  18500. Firing prefer*rvt*predict-no*H0*4*H1
  18501. -->
  18502. inner elaboration loop at bottom goal.
  18503. Retracting rl*prefer*rvt*predict-no*H0*4
  18504. -->
  18505. (S1 ^operator O2056 = 0.314502196170351)
  18506. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  18507. -->
  18508. (S1 ^operator O2056 = -0.168718511744511)
  18509. Retracting rl*prefer*rvt*predict-yes*H0*3
  18510. -->
  18511. (S1 ^operator O2055 = 0.3907628451116619)
  18512. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  18513. -->
  18514. (S1 ^operator O2055 = 0.6092814566217208)
  18515. --- END Proposal Phase ---
  18516. --- Decision Phase ---
  18517. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  18518. =>WM: (14451: S1 ^operator O2057)
  18519. 1029: O: O2057 (predict-yes)
  18520. --- END Decision Phase ---
  18521. --- Application Phase ---
  18522. --- Firing Productions (PE) For State At Depth 1 ---
  18523. --- Inner Elaboration Phase, active level 1 (S1) ---
  18524. Firing apply*operator
  18525. -->
  18526. (I3 ^predict-yes N1029 + :O )
  18527. Firing apply*operator*complete
  18528. -->
  18529. (I3 ^predict-no N1028 - :O )
  18530. inner elaboration loop at bottom goal.
  18531. --- Change Working Memory (PE) ---
  18532. =>WM: (14452: I3 ^predict-yes N1029)
  18533. <=WM: (14438: N1028 ^status complete)
  18534. <=WM: (14437: I3 ^predict-no N1028)
  18535. --- Firing Productions (IE) For State At Depth 1 ---
  18536. --- Inner Elaboration Phase, active level 1 (S1) ---
  18537. Firing monitor*world
  18538. -->
  18539. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18540. --- Change Working Memory (IE) ---
  18541. --- END Application Phase ---
  18542. --- Output Phase ---
  18543. ENV: Agent did: predict-yes for direction L in state State-B
  18544. In State-B moving L
  18545. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  18546. predict error 0
  18547. dir: dir isL
  18548. --- END Output Phase ---
  18549. \---- Input Phase ---
  18550. =>WM: (14456: I2 ^dir L)
  18551. =>WM: (14455: I2 ^reward 1)
  18552. =>WM: (14454: I2 ^see 1)
  18553. =>WM: (14453: N1029 ^status complete)
  18554. <=WM: (14441: I2 ^dir L)
  18555. <=WM: (14440: I2 ^reward 1)
  18556. <=WM: (14439: I2 ^see 0)
  18557. =>WM: (14457: I2 ^level-1 L1-root)
  18558. <=WM: (14442: I2 ^level-1 R1-root)
  18559. --- END Input Phase ---
  18560. --- Proposal Phase ---
  18561. --- Inner Elaboration Phase, active level 1 (S1) ---
  18562. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  18563. -->
  18564. (S1 ^operator O2057 = -0.2062723012911647)
  18565. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  18566. -->
  18567. (S1 ^operator O2058 = 0.6855237439964433)
  18568. Firing prefer*rvt*predict-no*H0*4*H1
  18569. -->
  18570. Firing prefer*rvt*predict-yes*H0*3*H1
  18571. -->
  18572. Firing elaborate*copy-see-to-output-link
  18573. -->
  18574. (I3 ^see 1 +)
  18575. Firing elaborate*reward*based*on*reward
  18576. -->
  18577. (R1033 ^value 1 +)
  18578. (R1 ^reward R1033 +)
  18579. Firing propose*predict-yes
  18580. -->
  18581. (O2059 ^name predict-yes +)
  18582. (S1 ^operator O2059 +)
  18583. Firing propose*predict-no
  18584. -->
  18585. (O2060 ^name predict-no +)
  18586. (S1 ^operator O2060 +)
  18587. Firing rl*prefer*rvt*predict-no*H0*4
  18588. -->
  18589. (S1 ^operator O2058 = 0.314502196170351)
  18590. Firing rl*prefer*rvt*predict-yes*H0*3
  18591. -->
  18592. (S1 ^operator O2057 = 0.3907628451116619)
  18593. Firing prefer*rvt*predict-yes*H0
  18594. -->
  18595. Firing prefer*rvt*predict-no*H0
  18596. -->
  18597. Firing elaborate*copy-dir-to-output-link
  18598. -->
  18599. (I3 ^dir L +)
  18600. inner elaboration loop at bottom goal.
  18601. Retracting elaborate*copy-see-to-output-link
  18602. -->
  18603. (I3 ^see 0 +)
  18604. Retracting propose*predict-no
  18605. -->
  18606. (O2058 ^name predict-no +)
  18607. (S1 ^operator O2058 +)
  18608. Retracting propose*predict-yes
  18609. -->
  18610. (O2057 ^name predict-yes +)
  18611. (S1 ^operator O2057 +)
  18612. Retracting elaborate*reward*based*on*reward
  18613. -->
  18614. (R1032 ^value 1 +)
  18615. (R1 ^reward R1032 +)
  18616. Retracting elaborate*copy-dir-to-output-link
  18617. -->
  18618. (I3 ^dir L +)
  18619. Retracting rl*prefer*rvt*predict-no*H0*4
  18620. -->
  18621. (S1 ^operator O2058 = 0.314502196170351)
  18622. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  18623. -->
  18624. (S1 ^operator O2058 = -0.168718511744511)
  18625. Retracting rl*prefer*rvt*predict-yes*H0*3
  18626. -->
  18627. (S1 ^operator O2057 = 0.3907628451116619)
  18628. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  18629. -->
  18630. (S1 ^operator O2057 = 0.6092814566217208)
  18631. =>WM: (14464: S1 ^operator O2060 +)
  18632. =>WM: (14463: S1 ^operator O2059 +)
  18633. =>WM: (14462: O2060 ^name predict-no)
  18634. =>WM: (14461: O2059 ^name predict-yes)
  18635. =>WM: (14460: R1033 ^value 1)
  18636. =>WM: (14459: R1 ^reward R1033)
  18637. =>WM: (14458: I3 ^see 1)
  18638. <=WM: (14449: S1 ^operator O2057 +)
  18639. <=WM: (14451: S1 ^operator O2057)
  18640. <=WM: (14450: S1 ^operator O2058 +)
  18641. <=WM: (14444: R1 ^reward R1032)
  18642. <=WM: (14443: I3 ^see 0)
  18643. <=WM: (14447: O2058 ^name predict-no)
  18644. <=WM: (14446: O2057 ^name predict-yes)
  18645. <=WM: (14445: R1032 ^value 1)
  18646. --- Inner Elaboration Phase, active level 1 (S1) ---
  18647. Firing prefer*rvt*predict-yes*H0
  18648. -->
  18649. Firing rl*prefer*rvt*predict-yes*H0*3
  18650. -->
  18651. (S1 ^operator O2059 = 0.3907628451116619)
  18652. Firing prefer*rvt*predict-yes*H0*3*H1
  18653. -->
  18654. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  18655. -->
  18656. (S1 ^operator O2059 = -0.2062723012911647)
  18657. Firing prefer*rvt*predict-no*H0
  18658. -->
  18659. Firing rl*prefer*rvt*predict-no*H0*4
  18660. -->
  18661. (S1 ^operator O2060 = 0.314502196170351)
  18662. Firing prefer*rvt*predict-no*H0*4*H1
  18663. -->
  18664. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  18665. -->
  18666. (S1 ^operator O2060 = 0.6855237439964433)
  18667. inner elaboration loop at bottom goal.
  18668. Retracting rl*prefer*rvt*predict-no*H0*4
  18669. -->
  18670. (S1 ^operator O2058 = 0.314502196170351)
  18671. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  18672. -->
  18673. (S1 ^operator O2058 = 0.6855237439964433)
  18674. Retracting rl*prefer*rvt*predict-yes*H0*3
  18675. -->
  18676. (S1 ^operator O2057 = 0.3907628451116619)
  18677. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  18678. -->
  18679. (S1 ^operator O2057 = -0.2062723012911647)
  18680. --- END Proposal Phase ---
  18681. --- Decision Phase ---
  18682. RL update rl*prefer*rvt*predict-yes*H0*3 0.472311 -0.0815481 0.390763 -> 0.472308 -0.0815487 0.390759(R,m,v=1,0.945783,0.0515882)
  18683. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527727 0.0815547 0.609281 -> 0.527723 0.0815541 0.609277(R,m,v=1,1,0)
  18684. =>WM: (14465: S1 ^operator O2060)
  18685. 1030: O: O2060 (predict-no)
  18686. --- END Decision Phase ---
  18687. --- Application Phase ---
  18688. --- Firing Productions (PE) For State At Depth 1 ---
  18689. --- Inner Elaboration Phase, active level 1 (S1) ---
  18690. Firing apply*operator
  18691. -->
  18692. (I3 ^predict-no N1030 + :O )
  18693. Firing apply*operator*complete
  18694. -->
  18695. (I3 ^predict-yes N1029 - :O )
  18696. inner elaboration loop at bottom goal.
  18697. --- Change Working Memory (PE) ---
  18698. =>WM: (14466: I3 ^predict-no N1030)
  18699. <=WM: (14453: N1029 ^status complete)
  18700. <=WM: (14452: I3 ^predict-yes N1029)
  18701. --- Firing Productions (IE) For State At Depth 1 ---
  18702. --- Inner Elaboration Phase, active level 1 (S1) ---
  18703. Firing monitor*world
  18704. -->
  18705. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  18706. --- Change Working Memory (IE) ---
  18707. --- END Application Phase ---
  18708. --- Output Phase ---
  18709. ENV: Agent did: predict-no for direction L in state State-A
  18710. In State-A moving L
  18711. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  18712. predict error 0
  18713. dir: dir isR
  18714. --- END Output Phase ---
  18715. /|\--- Input Phase ---
  18716. =>WM: (14470: I2 ^dir R)
  18717. =>WM: (14469: I2 ^reward 1)
  18718. =>WM: (14468: I2 ^see 0)
  18719. =>WM: (14467: N1030 ^status complete)
  18720. <=WM: (14456: I2 ^dir L)
  18721. <=WM: (14455: I2 ^reward 1)
  18722. <=WM: (14454: I2 ^see 1)
  18723. =>WM: (14471: I2 ^level-1 L0-root)
  18724. <=WM: (14457: I2 ^level-1 L1-root)
  18725. --- END Input Phase ---
  18726. --- Proposal Phase ---
  18727. --- Inner Elaboration Phase, active level 1 (S1) ---
  18728. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  18729. -->
  18730. (S1 ^operator O2059 = 0.8783968442404908)
  18731. Firing prefer*rvt*predict-yes*H0*5*H1
  18732. -->
  18733. Firing elaborate*copy-see-to-output-link
  18734. -->
  18735. (I3 ^see 0 +)
  18736. Firing elaborate*reward*based*on*reward
  18737. -->
  18738. (R1034 ^value 1 +)
  18739. (R1 ^reward R1034 +)
  18740. Firing propose*predict-yes
  18741. -->
  18742. (O2061 ^name predict-yes +)
  18743. (S1 ^operator O2061 +)
  18744. Firing propose*predict-no
  18745. -->
  18746. (O2062 ^name predict-no +)
  18747. (S1 ^operator O2062 +)
  18748. Firing rl*prefer*rvt*predict-no*H0*6
  18749. -->
  18750. (S1 ^operator O2060 = 0.9999921813761182)
  18751. Firing rl*prefer*rvt*predict-yes*H0*5
  18752. -->
  18753. (S1 ^operator O2059 = 0.1215975315706407)
  18754. Firing prefer*rvt*predict-yes*H0
  18755. -->
  18756. Firing prefer*rvt*predict-no*H0
  18757. -->
  18758. Firing elaborate*copy-dir-to-output-link
  18759. -->
  18760. (I3 ^dir R +)
  18761. inner elaboration loop at bottom goal.
  18762. Retracting elaborate*copy-see-to-output-link
  18763. -->
  18764. (I3 ^see 1 +)
  18765. Retracting propose*predict-no
  18766. -->
  18767. (O2060 ^name predict-no +)
  18768. (S1 ^operator O2060 +)
  18769. Retracting propose*predict-yes
  18770. -->
  18771. (O2059 ^name predict-yes +)
  18772. (S1 ^operator O2059 +)
  18773. Retracting elaborate*reward*based*on*reward
  18774. -->
  18775. (R1033 ^value 1 +)
  18776. (R1 ^reward R1033 +)
  18777. Retracting elaborate*copy-dir-to-output-link
  18778. -->
  18779. (I3 ^dir L +)
  18780. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  18781. -->
  18782. (S1 ^operator O2060 = 0.6855237439964433)
  18783. Retracting rl*prefer*rvt*predict-no*H0*4
  18784. -->
  18785. (S1 ^operator O2060 = 0.314502196170351)
  18786. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  18787. -->
  18788. (S1 ^operator O2059 = -0.2062723012911647)
  18789. Retracting rl*prefer*rvt*predict-yes*H0*3
  18790. -->
  18791. (S1 ^operator O2059 = 0.3907592209442947)
  18792. =>WM: (14479: S1 ^operator O2062 +)
  18793. =>WM: (14478: S1 ^operator O2061 +)
  18794. =>WM: (14477: I3 ^dir R)
  18795. =>WM: (14476: O2062 ^name predict-no)
  18796. =>WM: (14475: O2061 ^name predict-yes)
  18797. =>WM: (14474: R1034 ^value 1)
  18798. =>WM: (14473: R1 ^reward R1034)
  18799. =>WM: (14472: I3 ^see 0)
  18800. <=WM: (14463: S1 ^operator O2059 +)
  18801. <=WM: (14464: S1 ^operator O2060 +)
  18802. <=WM: (14465: S1 ^operator O2060)
  18803. <=WM: (14448: I3 ^dir L)
  18804. <=WM: (14459: R1 ^reward R1033)
  18805. <=WM: (14458: I3 ^see 1)
  18806. <=WM: (14462: O2060 ^name predict-no)
  18807. <=WM: (14461: O2059 ^name predict-yes)
  18808. <=WM: (14460: R1033 ^value 1)
  18809. --- Inner Elaboration Phase, active level 1 (S1) ---
  18810. Firing prefer*rvt*predict-yes*H0
  18811. -->
  18812. Firing rl*prefer*rvt*predict-yes*H0*5
  18813. -->
  18814. (S1 ^operator O2061 = 0.1215975315706407)
  18815. Firing prefer*rvt*predict-yes*H0*5*H1
  18816. -->
  18817. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  18818. -->
  18819. (S1 ^operator O2061 = 0.8783968442404908)
  18820. Firing prefer*rvt*predict-no*H0
  18821. -->
  18822. Firing rl*prefer*rvt*predict-no*H0*6
  18823. -->
  18824. (S1 ^operator O2062 = 0.9999921813761182)
  18825. inner elaboration loop at bottom goal.
  18826. Retracting rl*prefer*rvt*predict-no*H0*6
  18827. -->
  18828. (S1 ^operator O2060 = 0.9999921813761182)
  18829. Retracting rl*prefer*rvt*predict-yes*H0*5
  18830. -->
  18831. (S1 ^operator O2059 = 0.1215975315706407)
  18832. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  18833. -->
  18834. (S1 ^operator O2059 = 0.8783968442404908)
  18835. --- END Proposal Phase ---
  18836. --- Decision Phase ---
  18837. RL update rl*prefer*rvt*predict-no*H0*4 0.478551 -0.164048 0.314502 -> 0.478549 -0.164049 0.3145(R,m,v=1,0.925,0.0698113)
  18838. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521473 0.164051 0.685524 -> 0.521471 0.164051 0.685521(R,m,v=1,1,0)
  18839. =>WM: (14480: S1 ^operator O2061)
  18840. 1031: O: O2061 (predict-yes)
  18841. --- END Decision Phase ---
  18842. --- Application Phase ---
  18843. --- Firing Productions (PE) For State At Depth 1 ---
  18844. --- Inner Elaboration Phase, active level 1 (S1) ---
  18845. Firing apply*operator
  18846. -->
  18847. (I3 ^predict-yes N1031 + :O )
  18848. Firing apply*operator*complete
  18849. -->
  18850. (I3 ^predict-no N1030 - :O )
  18851. inner elaboration loop at bottom goal.
  18852. --- Change Working Memory (PE) ---
  18853. =>WM: (14481: I3 ^predict-yes N1031)
  18854. <=WM: (14467: N1030 ^status complete)
  18855. <=WM: (14466: I3 ^predict-no N1030)
  18856. --- Firing Productions (IE) For State At Depth 1 ---
  18857. --- Inner Elaboration Phase, active level 1 (S1) ---
  18858. Firing monitor*world
  18859. -->
  18860. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  18861. --- Change Working Memory (IE) ---
  18862. --- END Application Phase ---
  18863. --- Output Phase ---
  18864. ENV: Agent did: predict-yes for direction R in state State-A
  18865. In State-A moving R
  18866. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  18867. predict error 0
  18868. dir: dir isU
  18869. --- END Output Phase ---
  18870. ---- Input Phase ---
  18871. =>WM: (14485: I2 ^dir U)
  18872. =>WM: (14484: I2 ^reward 1)
  18873. =>WM: (14483: I2 ^see 1)
  18874. =>WM: (14482: N1031 ^status complete)
  18875. <=WM: (14470: I2 ^dir R)
  18876. <=WM: (14469: I2 ^reward 1)
  18877. <=WM: (14468: I2 ^see 0)
  18878. =>WM: (14486: I2 ^level-1 R1-root)
  18879. <=WM: (14471: I2 ^level-1 L0-root)
  18880. --- END Input Phase ---
  18881. --- Proposal Phase ---
  18882. --- Inner Elaboration Phase, active level 1 (S1) ---
  18883. Firing elaborate*copy-see-to-output-link
  18884. -->
  18885. (I3 ^see 1 +)
  18886. Firing elaborate*reward*based*on*reward
  18887. -->
  18888. (R1035 ^value 1 +)
  18889. (R1 ^reward R1035 +)
  18890. Firing propose*predict-yes
  18891. -->
  18892. (O2063 ^name predict-yes +)
  18893. (S1 ^operator O2063 +)
  18894. Firing propose*predict-no
  18895. -->
  18896. (O2064 ^name predict-no +)
  18897. (S1 ^operator O2064 +)
  18898. Firing rl*prefer*rvt*predict-no*H0*2
  18899. -->
  18900. (S1 ^operator O2062 = 1.)
  18901. Firing rl*prefer*rvt*predict-yes*H0*1
  18902. -->
  18903. (S1 ^operator O2061 = 0.)
  18904. Firing prefer*rvt*predict-yes*H0
  18905. -->
  18906. Firing prefer*rvt*predict-no*H0
  18907. -->
  18908. Firing elaborate*copy-dir-to-output-link
  18909. -->
  18910. (I3 ^dir U +)
  18911. inner elaboration loop at bottom goal.
  18912. Retracting elaborate*copy-see-to-output-link
  18913. -->
  18914. (I3 ^see 0 +)
  18915. Retracting propose*predict-no
  18916. -->
  18917. (O2062 ^name predict-no +)
  18918. (S1 ^operator O2062 +)
  18919. Retracting propose*predict-yes
  18920. -->
  18921. (O2061 ^name predict-yes +)
  18922. (S1 ^operator O2061 +)
  18923. Retracting elaborate*reward*based*on*reward
  18924. -->
  18925. (R1034 ^value 1 +)
  18926. (R1 ^reward R1034 +)
  18927. Retracting elaborate*copy-dir-to-output-link
  18928. -->
  18929. (I3 ^dir R +)
  18930. Retracting rl*prefer*rvt*predict-no*H0*6
  18931. -->
  18932. (S1 ^operator O2062 = 0.9999921813761182)
  18933. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  18934. -->
  18935. (S1 ^operator O2061 = 0.8783968442404908)
  18936. Retracting rl*prefer*rvt*predict-yes*H0*5
  18937. -->
  18938. (S1 ^operator O2061 = 0.1215975315706407)
  18939. =>WM: (14494: S1 ^operator O2064 +)
  18940. =>WM: (14493: S1 ^operator O2063 +)
  18941. =>WM: (14492: I3 ^dir U)
  18942. =>WM: (14491: O2064 ^name predict-no)
  18943. =>WM: (14490: O2063 ^name predict-yes)
  18944. =>WM: (14489: R1035 ^value 1)
  18945. =>WM: (14488: R1 ^reward R1035)
  18946. =>WM: (14487: I3 ^see 1)
  18947. <=WM: (14478: S1 ^operator O2061 +)
  18948. <=WM: (14480: S1 ^operator O2061)
  18949. <=WM: (14479: S1 ^operator O2062 +)
  18950. <=WM: (14477: I3 ^dir R)
  18951. <=WM: (14473: R1 ^reward R1034)
  18952. <=WM: (14472: I3 ^see 0)
  18953. <=WM: (14476: O2062 ^name predict-no)
  18954. <=WM: (14475: O2061 ^name predict-yes)
  18955. <=WM: (14474: R1034 ^value 1)
  18956. --- Inner Elaboration Phase, active level 1 (S1) ---
  18957. Firing prefer*rvt*predict-yes*H0
  18958. -->
  18959. Firing rl*prefer*rvt*predict-yes*H0*1
  18960. -->
  18961. (S1 ^operator O2063 = 0.)
  18962. Firing prefer*rvt*predict-no*H0
  18963. -->
  18964. Firing rl*prefer*rvt*predict-no*H0*2
  18965. -->
  18966. (S1 ^operator O2064 = 1.)
  18967. inner elaboration loop at bottom goal.
  18968. Retracting rl*prefer*rvt*predict-no*H0*2
  18969. -->
  18970. (S1 ^operator O2062 = 1.)
  18971. Retracting rl*prefer*rvt*predict-yes*H0*1
  18972. -->
  18973. (S1 ^operator O2061 = 0.)
  18974. --- END Proposal Phase ---
  18975. --- Decision Phase ---
  18976. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121598(R,m,v=1,0.868852,0.114574)
  18977. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465472 0.412925 0.878397 -> 0.465472 0.412925 0.878397(R,m,v=1,1,0)
  18978. =>WM: (14495: S1 ^operator O2064)
  18979. 1032: O: O2064 (predict-no)
  18980. --- END Decision Phase ---
  18981. --- Application Phase ---
  18982. --- Firing Productions (PE) For State At Depth 1 ---
  18983. --- Inner Elaboration Phase, active level 1 (S1) ---
  18984. Firing apply*operator
  18985. -->
  18986. (I3 ^predict-no N1032 + :O )
  18987. Firing apply*operator*complete
  18988. -->
  18989. (I3 ^predict-yes N1031 - :O )
  18990. inner elaboration loop at bottom goal.
  18991. --- Change Working Memory (PE) ---
  18992. =>WM: (14496: I3 ^predict-no N1032)
  18993. <=WM: (14482: N1031 ^status complete)
  18994. <=WM: (14481: I3 ^predict-yes N1031)
  18995. --- Firing Productions (IE) For State At Depth 1 ---
  18996. --- Inner Elaboration Phase, active level 1 (S1) ---
  18997. Firing monitor*world
  18998. -->
  18999. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19000. --- Change Working Memory (IE) ---
  19001. --- END Application Phase ---
  19002. --- Output Phase ---
  19003. ENV: Agent did: predict-no for direction U in state State-B
  19004. In State-B moving U
  19005. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19006. predict error 0
  19007. dir: dir isR
  19008. --- END Output Phase ---
  19009. /|\--- Input Phase ---
  19010. =>WM: (14500: I2 ^dir R)
  19011. =>WM: (14499: I2 ^reward 1)
  19012. =>WM: (14498: I2 ^see 0)
  19013. =>WM: (14497: N1032 ^status complete)
  19014. <=WM: (14485: I2 ^dir U)
  19015. <=WM: (14484: I2 ^reward 1)
  19016. <=WM: (14483: I2 ^see 1)
  19017. =>WM: (14501: I2 ^level-1 R1-root)
  19018. <=WM: (14486: I2 ^level-1 R1-root)
  19019. --- END Input Phase ---
  19020. --- Proposal Phase ---
  19021. --- Inner Elaboration Phase, active level 1 (S1) ---
  19022. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  19023. -->
  19024. (S1 ^operator O2063 = -0.04253361215288998)
  19025. Firing prefer*rvt*predict-yes*H0*5*H1
  19026. -->
  19027. Firing elaborate*copy-see-to-output-link
  19028. -->
  19029. (I3 ^see 0 +)
  19030. Firing elaborate*reward*based*on*reward
  19031. -->
  19032. (R1036 ^value 1 +)
  19033. (R1 ^reward R1036 +)
  19034. Firing propose*predict-yes
  19035. -->
  19036. (O2065 ^name predict-yes +)
  19037. (S1 ^operator O2065 +)
  19038. Firing propose*predict-no
  19039. -->
  19040. (O2066 ^name predict-no +)
  19041. (S1 ^operator O2066 +)
  19042. Firing rl*prefer*rvt*predict-no*H0*6
  19043. -->
  19044. (S1 ^operator O2064 = 0.9999921813761182)
  19045. Firing rl*prefer*rvt*predict-yes*H0*5
  19046. -->
  19047. (S1 ^operator O2063 = 0.1215979844413558)
  19048. Firing prefer*rvt*predict-yes*H0
  19049. -->
  19050. Firing prefer*rvt*predict-no*H0
  19051. -->
  19052. Firing elaborate*copy-dir-to-output-link
  19053. -->
  19054. (I3 ^dir R +)
  19055. inner elaboration loop at bottom goal.
  19056. Retracting elaborate*copy-see-to-output-link
  19057. -->
  19058. (I3 ^see 1 +)
  19059. Retracting propose*predict-no
  19060. -->
  19061. (O2064 ^name predict-no +)
  19062. (S1 ^operator O2064 +)
  19063. Retracting propose*predict-yes
  19064. -->
  19065. (O2063 ^name predict-yes +)
  19066. (S1 ^operator O2063 +)
  19067. Retracting elaborate*reward*based*on*reward
  19068. -->
  19069. (R1035 ^value 1 +)
  19070. (R1 ^reward R1035 +)
  19071. Retracting elaborate*copy-dir-to-output-link
  19072. -->
  19073. (I3 ^dir U +)
  19074. Retracting rl*prefer*rvt*predict-no*H0*2
  19075. -->
  19076. (S1 ^operator O2064 = 1.)
  19077. Retracting rl*prefer*rvt*predict-yes*H0*1
  19078. -->
  19079. (S1 ^operator O2063 = 0.)
  19080. =>WM: (14509: S1 ^operator O2066 +)
  19081. =>WM: (14508: S1 ^operator O2065 +)
  19082. =>WM: (14507: I3 ^dir R)
  19083. =>WM: (14506: O2066 ^name predict-no)
  19084. =>WM: (14505: O2065 ^name predict-yes)
  19085. =>WM: (14504: R1036 ^value 1)
  19086. =>WM: (14503: R1 ^reward R1036)
  19087. =>WM: (14502: I3 ^see 0)
  19088. <=WM: (14493: S1 ^operator O2063 +)
  19089. <=WM: (14494: S1 ^operator O2064 +)
  19090. <=WM: (14495: S1 ^operator O2064)
  19091. <=WM: (14492: I3 ^dir U)
  19092. <=WM: (14488: R1 ^reward R1035)
  19093. <=WM: (14487: I3 ^see 1)
  19094. <=WM: (14491: O2064 ^name predict-no)
  19095. <=WM: (14490: O2063 ^name predict-yes)
  19096. <=WM: (14489: R1035 ^value 1)
  19097. --- Inner Elaboration Phase, active level 1 (S1) ---
  19098. Firing prefer*rvt*predict-yes*H0
  19099. -->
  19100. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  19101. -->
  19102. (S1 ^operator O2065 = -0.04253361215288998)
  19103. Firing rl*prefer*rvt*predict-yes*H0*5
  19104. -->
  19105. (S1 ^operator O2065 = 0.1215979844413558)
  19106. Firing prefer*rvt*predict-yes*H0*5*H1
  19107. -->
  19108. Firing prefer*rvt*predict-no*H0
  19109. -->
  19110. Firing rl*prefer*rvt*predict-no*H0*6
  19111. -->
  19112. (S1 ^operator O2066 = 0.9999921813761182)
  19113. inner elaboration loop at bottom goal.
  19114. Retracting rl*prefer*rvt*predict-no*H0*6
  19115. -->
  19116. (S1 ^operator O2064 = 0.9999921813761182)
  19117. Retracting rl*prefer*rvt*predict-yes*H0*5
  19118. -->
  19119. (S1 ^operator O2063 = 0.1215979844413558)
  19120. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  19121. -->
  19122. (S1 ^operator O2063 = -0.04253361215288998)
  19123. --- END Proposal Phase ---
  19124. --- Decision Phase ---
  19125. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19126. =>WM: (14510: S1 ^operator O2066)
  19127. 1033: O: O2066 (predict-no)
  19128. --- END Decision Phase ---
  19129. --- Application Phase ---
  19130. --- Firing Productions (PE) For State At Depth 1 ---
  19131. --- Inner Elaboration Phase, active level 1 (S1) ---
  19132. Firing apply*operator
  19133. -->
  19134. (I3 ^predict-no N1033 + :O )
  19135. Firing apply*operator*complete
  19136. -->
  19137. (I3 ^predict-no N1032 - :O )
  19138. inner elaboration loop at bottom goal.
  19139. --- Change Working Memory (PE) ---
  19140. =>WM: (14511: I3 ^predict-no N1033)
  19141. <=WM: (14497: N1032 ^status complete)
  19142. <=WM: (14496: I3 ^predict-no N1032)
  19143. --- Firing Productions (IE) For State At Depth 1 ---
  19144. --- Inner Elaboration Phase, active level 1 (S1) ---
  19145. Firing monitor*world
  19146. -->
  19147. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19148. --- Change Working Memory (IE) ---
  19149. --- END Application Phase ---
  19150. --- Output Phase ---
  19151. ENV: Agent did: predict-no for direction R in state State-B
  19152. In State-B moving R
  19153. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19154. predict error 0
  19155. dir: dir isL
  19156. --- END Output Phase ---
  19157. -/--- Input Phase ---
  19158. =>WM: (14515: I2 ^dir L)
  19159. =>WM: (14514: I2 ^reward 1)
  19160. =>WM: (14513: I2 ^see 0)
  19161. =>WM: (14512: N1033 ^status complete)
  19162. <=WM: (14500: I2 ^dir R)
  19163. <=WM: (14499: I2 ^reward 1)
  19164. <=WM: (14498: I2 ^see 0)
  19165. =>WM: (14516: I2 ^level-1 R0-root)
  19166. <=WM: (14501: I2 ^level-1 R1-root)
  19167. --- END Input Phase ---
  19168. --- Proposal Phase ---
  19169. --- Inner Elaboration Phase, active level 1 (S1) ---
  19170. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  19171. -->
  19172. (S1 ^operator O2066 = -0.1984300550322165)
  19173. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  19174. -->
  19175. (S1 ^operator O2065 = 0.6091249560527634)
  19176. Firing prefer*rvt*predict-no*H0*4*H1
  19177. -->
  19178. Firing prefer*rvt*predict-yes*H0*3*H1
  19179. -->
  19180. Firing elaborate*copy-see-to-output-link
  19181. -->
  19182. (I3 ^see 0 +)
  19183. Firing elaborate*reward*based*on*reward
  19184. -->
  19185. (R1037 ^value 1 +)
  19186. (R1 ^reward R1037 +)
  19187. Firing propose*predict-yes
  19188. -->
  19189. (O2067 ^name predict-yes +)
  19190. (S1 ^operator O2067 +)
  19191. Firing propose*predict-no
  19192. -->
  19193. (O2068 ^name predict-no +)
  19194. (S1 ^operator O2068 +)
  19195. Firing rl*prefer*rvt*predict-no*H0*4
  19196. -->
  19197. (S1 ^operator O2066 = 0.314500061238283)
  19198. Firing rl*prefer*rvt*predict-yes*H0*3
  19199. -->
  19200. (S1 ^operator O2065 = 0.3907592209442947)
  19201. Firing prefer*rvt*predict-yes*H0
  19202. -->
  19203. Firing prefer*rvt*predict-no*H0
  19204. -->
  19205. Firing elaborate*copy-dir-to-output-link
  19206. -->
  19207. (I3 ^dir L +)
  19208. inner elaboration loop at bottom goal.
  19209. Retracting elaborate*copy-see-to-output-link
  19210. -->
  19211. (I3 ^see 0 +)
  19212. Retracting propose*predict-no
  19213. -->
  19214. (O2066 ^name predict-no +)
  19215. (S1 ^operator O2066 +)
  19216. Retracting propose*predict-yes
  19217. -->
  19218. (O2065 ^name predict-yes +)
  19219. (S1 ^operator O2065 +)
  19220. Retracting elaborate*reward*based*on*reward
  19221. -->
  19222. (R1036 ^value 1 +)
  19223. (R1 ^reward R1036 +)
  19224. Retracting elaborate*copy-dir-to-output-link
  19225. -->
  19226. (I3 ^dir R +)
  19227. Retracting rl*prefer*rvt*predict-no*H0*6
  19228. -->
  19229. (S1 ^operator O2066 = 0.9999921813761182)
  19230. Retracting rl*prefer*rvt*predict-yes*H0*5
  19231. -->
  19232. (S1 ^operator O2065 = 0.1215979844413558)
  19233. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  19234. -->
  19235. (S1 ^operator O2065 = -0.04253361215288998)
  19236. =>WM: (14523: S1 ^operator O2068 +)
  19237. =>WM: (14522: S1 ^operator O2067 +)
  19238. =>WM: (14521: I3 ^dir L)
  19239. =>WM: (14520: O2068 ^name predict-no)
  19240. =>WM: (14519: O2067 ^name predict-yes)
  19241. =>WM: (14518: R1037 ^value 1)
  19242. =>WM: (14517: R1 ^reward R1037)
  19243. <=WM: (14508: S1 ^operator O2065 +)
  19244. <=WM: (14509: S1 ^operator O2066 +)
  19245. <=WM: (14510: S1 ^operator O2066)
  19246. <=WM: (14507: I3 ^dir R)
  19247. <=WM: (14503: R1 ^reward R1036)
  19248. <=WM: (14506: O2066 ^name predict-no)
  19249. <=WM: (14505: O2065 ^name predict-yes)
  19250. <=WM: (14504: R1036 ^value 1)
  19251. --- Inner Elaboration Phase, active level 1 (S1) ---
  19252. Firing prefer*rvt*predict-yes*H0
  19253. -->
  19254. Firing rl*prefer*rvt*predict-yes*H0*3
  19255. -->
  19256. (S1 ^operator O2067 = 0.3907592209442947)
  19257. Firing prefer*rvt*predict-yes*H0*3*H1
  19258. -->
  19259. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  19260. -->
  19261. (S1 ^operator O2067 = 0.6091249560527634)
  19262. Firing prefer*rvt*predict-no*H0
  19263. -->
  19264. Firing rl*prefer*rvt*predict-no*H0*4
  19265. -->
  19266. (S1 ^operator O2068 = 0.314500061238283)
  19267. Firing prefer*rvt*predict-no*H0*4*H1
  19268. -->
  19269. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  19270. -->
  19271. (S1 ^operator O2068 = -0.1984300550322165)
  19272. inner elaboration loop at bottom goal.
  19273. Retracting rl*prefer*rvt*predict-no*H0*4
  19274. -->
  19275. (S1 ^operator O2066 = 0.314500061238283)
  19276. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  19277. -->
  19278. (S1 ^operator O2066 = -0.1984300550322165)
  19279. Retracting rl*prefer*rvt*predict-yes*H0*3
  19280. -->
  19281. (S1 ^operator O2065 = 0.3907592209442947)
  19282. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  19283. -->
  19284. (S1 ^operator O2065 = 0.6091249560527634)
  19285. --- END Proposal Phase ---
  19286. --- Decision Phase ---
  19287. RL update rl*prefer*rvt*predict-no*H0*6 0.999992 0 0.999992 -> 0.999993 0 0.999993(R,m,v=1,0.938889,0.0576971)
  19288. =>WM: (14524: S1 ^operator O2067)
  19289. 1034: O: O2067 (predict-yes)
  19290. --- END Decision Phase ---
  19291. --- Application Phase ---
  19292. --- Firing Productions (PE) For State At Depth 1 ---
  19293. --- Inner Elaboration Phase, active level 1 (S1) ---
  19294. Firing apply*operator
  19295. -->
  19296. (I3 ^predict-yes N1034 + :O )
  19297. Firing apply*operator*complete
  19298. -->
  19299. (I3 ^predict-no N1033 - :O )
  19300. inner elaboration loop at bottom goal.
  19301. --- Change Working Memory (PE) ---
  19302. =>WM: (14525: I3 ^predict-yes N1034)
  19303. <=WM: (14512: N1033 ^status complete)
  19304. <=WM: (14511: I3 ^predict-no N1033)
  19305. --- Firing Productions (IE) For State At Depth 1 ---
  19306. --- Inner Elaboration Phase, active level 1 (S1) ---
  19307. Firing monitor*world
  19308. -->
  19309. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19310. --- Change Working Memory (IE) ---
  19311. --- END Application Phase ---
  19312. --- Output Phase ---
  19313. ENV: Agent did: predict-yes for direction L in state State-B
  19314. In State-B moving L
  19315. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  19316. predict error 0
  19317. dir: dir isU
  19318. --- END Output Phase ---
  19319. |\--- Input Phase ---
  19320. =>WM: (14529: I2 ^dir U)
  19321. =>WM: (14528: I2 ^reward 1)
  19322. =>WM: (14527: I2 ^see 1)
  19323. =>WM: (14526: N1034 ^status complete)
  19324. <=WM: (14515: I2 ^dir L)
  19325. <=WM: (14514: I2 ^reward 1)
  19326. <=WM: (14513: I2 ^see 0)
  19327. =>WM: (14530: I2 ^level-1 L1-root)
  19328. <=WM: (14516: I2 ^level-1 R0-root)
  19329. --- END Input Phase ---
  19330. --- Proposal Phase ---
  19331. --- Inner Elaboration Phase, active level 1 (S1) ---
  19332. Firing elaborate*copy-see-to-output-link
  19333. -->
  19334. (I3 ^see 1 +)
  19335. Firing elaborate*reward*based*on*reward
  19336. -->
  19337. (R1038 ^value 1 +)
  19338. (R1 ^reward R1038 +)
  19339. Firing propose*predict-yes
  19340. -->
  19341. (O2069 ^name predict-yes +)
  19342. (S1 ^operator O2069 +)
  19343. Firing propose*predict-no
  19344. -->
  19345. (O2070 ^name predict-no +)
  19346. (S1 ^operator O2070 +)
  19347. Firing rl*prefer*rvt*predict-no*H0*2
  19348. -->
  19349. (S1 ^operator O2068 = 1.)
  19350. Firing rl*prefer*rvt*predict-yes*H0*1
  19351. -->
  19352. (S1 ^operator O2067 = 0.)
  19353. Firing prefer*rvt*predict-yes*H0
  19354. -->
  19355. Firing prefer*rvt*predict-no*H0
  19356. -->
  19357. Firing elaborate*copy-dir-to-output-link
  19358. -->
  19359. (I3 ^dir U +)
  19360. inner elaboration loop at bottom goal.
  19361. Retracting elaborate*copy-see-to-output-link
  19362. -->
  19363. (I3 ^see 0 +)
  19364. Retracting propose*predict-no
  19365. -->
  19366. (O2068 ^name predict-no +)
  19367. (S1 ^operator O2068 +)
  19368. Retracting propose*predict-yes
  19369. -->
  19370. (O2067 ^name predict-yes +)
  19371. (S1 ^operator O2067 +)
  19372. Retracting elaborate*reward*based*on*reward
  19373. -->
  19374. (R1037 ^value 1 +)
  19375. (R1 ^reward R1037 +)
  19376. Retracting elaborate*copy-dir-to-output-link
  19377. -->
  19378. (I3 ^dir L +)
  19379. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  19380. -->
  19381. (S1 ^operator O2068 = -0.1984300550322165)
  19382. Retracting rl*prefer*rvt*predict-no*H0*4
  19383. -->
  19384. (S1 ^operator O2068 = 0.314500061238283)
  19385. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  19386. -->
  19387. (S1 ^operator O2067 = 0.6091249560527634)
  19388. Retracting rl*prefer*rvt*predict-yes*H0*3
  19389. -->
  19390. (S1 ^operator O2067 = 0.3907592209442947)
  19391. =>WM: (14538: S1 ^operator O2070 +)
  19392. =>WM: (14537: S1 ^operator O2069 +)
  19393. =>WM: (14536: I3 ^dir U)
  19394. =>WM: (14535: O2070 ^name predict-no)
  19395. =>WM: (14534: O2069 ^name predict-yes)
  19396. =>WM: (14533: R1038 ^value 1)
  19397. =>WM: (14532: R1 ^reward R1038)
  19398. =>WM: (14531: I3 ^see 1)
  19399. <=WM: (14522: S1 ^operator O2067 +)
  19400. <=WM: (14524: S1 ^operator O2067)
  19401. <=WM: (14523: S1 ^operator O2068 +)
  19402. <=WM: (14521: I3 ^dir L)
  19403. <=WM: (14517: R1 ^reward R1037)
  19404. <=WM: (14502: I3 ^see 0)
  19405. <=WM: (14520: O2068 ^name predict-no)
  19406. <=WM: (14519: O2067 ^name predict-yes)
  19407. <=WM: (14518: R1037 ^value 1)
  19408. --- Inner Elaboration Phase, active level 1 (S1) ---
  19409. Firing prefer*rvt*predict-yes*H0
  19410. -->
  19411. Firing rl*prefer*rvt*predict-yes*H0*1
  19412. -->
  19413. (S1 ^operator O2069 = 0.)
  19414. Firing prefer*rvt*predict-no*H0
  19415. -->
  19416. Firing rl*prefer*rvt*predict-no*H0*2
  19417. -->
  19418. (S1 ^operator O2070 = 1.)
  19419. inner elaboration loop at bottom goal.
  19420. Retracting rl*prefer*rvt*predict-no*H0*2
  19421. -->
  19422. (S1 ^operator O2068 = 1.)
  19423. Retracting rl*prefer*rvt*predict-yes*H0*1
  19424. -->
  19425. (S1 ^operator O2067 = 0.)
  19426. --- END Proposal Phase ---
  19427. --- Decision Phase ---
  19428. RL update rl*prefer*rvt*predict-yes*H0*3 0.472308 -0.0815487 0.390759 -> 0.472316 -0.0815473 0.390769(R,m,v=1,0.946108,0.051295)
  19429. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527593 0.0815315 0.609125 -> 0.527603 0.0815331 0.609136(R,m,v=1,1,0)
  19430. =>WM: (14539: S1 ^operator O2070)
  19431. 1035: O: O2070 (predict-no)
  19432. --- END Decision Phase ---
  19433. --- Application Phase ---
  19434. --- Firing Productions (PE) For State At Depth 1 ---
  19435. --- Inner Elaboration Phase, active level 1 (S1) ---
  19436. Firing apply*operator
  19437. -->
  19438. (I3 ^predict-no N1035 + :O )
  19439. Firing apply*operator*complete
  19440. -->
  19441. (I3 ^predict-yes N1034 - :O )
  19442. inner elaboration loop at bottom goal.
  19443. --- Change Working Memory (PE) ---
  19444. =>WM: (14540: I3 ^predict-no N1035)
  19445. <=WM: (14526: N1034 ^status complete)
  19446. <=WM: (14525: I3 ^predict-yes N1034)
  19447. --- Firing Productions (IE) For State At Depth 1 ---
  19448. --- Inner Elaboration Phase, active level 1 (S1) ---
  19449. Firing monitor*world
  19450. -->
  19451. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19452. --- Change Working Memory (IE) ---
  19453. --- END Application Phase ---
  19454. --- Output Phase ---
  19455. ENV: Agent did: predict-no for direction U in state State-A
  19456. In State-A moving U
  19457. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  19458. predict error 0
  19459. dir: dir isR
  19460. --- END Output Phase ---
  19461. -/|--- Input Phase ---
  19462. =>WM: (14544: I2 ^dir R)
  19463. =>WM: (14543: I2 ^reward 1)
  19464. =>WM: (14542: I2 ^see 0)
  19465. =>WM: (14541: N1035 ^status complete)
  19466. <=WM: (14529: I2 ^dir U)
  19467. <=WM: (14528: I2 ^reward 1)
  19468. <=WM: (14527: I2 ^see 1)
  19469. =>WM: (14545: I2 ^level-1 L1-root)
  19470. <=WM: (14530: I2 ^level-1 L1-root)
  19471. --- END Input Phase ---
  19472. --- Proposal Phase ---
  19473. --- Inner Elaboration Phase, active level 1 (S1) ---
  19474. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  19475. -->
  19476. (S1 ^operator O2069 = 0.8784107800481358)
  19477. Firing prefer*rvt*predict-yes*H0*5*H1
  19478. -->
  19479. Firing elaborate*copy-see-to-output-link
  19480. -->
  19481. (I3 ^see 0 +)
  19482. Firing elaborate*reward*based*on*reward
  19483. -->
  19484. (R1039 ^value 1 +)
  19485. (R1 ^reward R1039 +)
  19486. Firing propose*predict-yes
  19487. -->
  19488. (O2071 ^name predict-yes +)
  19489. (S1 ^operator O2071 +)
  19490. Firing propose*predict-no
  19491. -->
  19492. (O2072 ^name predict-no +)
  19493. (S1 ^operator O2072 +)
  19494. Firing rl*prefer*rvt*predict-no*H0*6
  19495. -->
  19496. (S1 ^operator O2070 = 0.9999934438786788)
  19497. Firing rl*prefer*rvt*predict-yes*H0*5
  19498. -->
  19499. (S1 ^operator O2069 = 0.1215979844413558)
  19500. Firing prefer*rvt*predict-yes*H0
  19501. -->
  19502. Firing prefer*rvt*predict-no*H0
  19503. -->
  19504. Firing elaborate*copy-dir-to-output-link
  19505. -->
  19506. (I3 ^dir R +)
  19507. inner elaboration loop at bottom goal.
  19508. Retracting elaborate*copy-see-to-output-link
  19509. -->
  19510. (I3 ^see 1 +)
  19511. Retracting propose*predict-no
  19512. -->
  19513. (O2070 ^name predict-no +)
  19514. (S1 ^operator O2070 +)
  19515. Retracting propose*predict-yes
  19516. -->
  19517. (O2069 ^name predict-yes +)
  19518. (S1 ^operator O2069 +)
  19519. Retracting elaborate*reward*based*on*reward
  19520. -->
  19521. (R1038 ^value 1 +)
  19522. (R1 ^reward R1038 +)
  19523. Retracting elaborate*copy-dir-to-output-link
  19524. -->
  19525. (I3 ^dir U +)
  19526. Retracting rl*prefer*rvt*predict-no*H0*2
  19527. -->
  19528. (S1 ^operator O2070 = 1.)
  19529. Retracting rl*prefer*rvt*predict-yes*H0*1
  19530. -->
  19531. (S1 ^operator O2069 = 0.)
  19532. =>WM: (14553: S1 ^operator O2072 +)
  19533. =>WM: (14552: S1 ^operator O2071 +)
  19534. =>WM: (14551: I3 ^dir R)
  19535. =>WM: (14550: O2072 ^name predict-no)
  19536. =>WM: (14549: O2071 ^name predict-yes)
  19537. =>WM: (14548: R1039 ^value 1)
  19538. =>WM: (14547: R1 ^reward R1039)
  19539. =>WM: (14546: I3 ^see 0)
  19540. <=WM: (14537: S1 ^operator O2069 +)
  19541. <=WM: (14538: S1 ^operator O2070 +)
  19542. <=WM: (14539: S1 ^operator O2070)
  19543. <=WM: (14536: I3 ^dir U)
  19544. <=WM: (14532: R1 ^reward R1038)
  19545. <=WM: (14531: I3 ^see 1)
  19546. <=WM: (14535: O2070 ^name predict-no)
  19547. <=WM: (14534: O2069 ^name predict-yes)
  19548. <=WM: (14533: R1038 ^value 1)
  19549. --- Inner Elaboration Phase, active level 1 (S1) ---
  19550. Firing prefer*rvt*predict-yes*H0
  19551. -->
  19552. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  19553. -->
  19554. (S1 ^operator O2071 = 0.8784107800481358)
  19555. Firing rl*prefer*rvt*predict-yes*H0*5
  19556. -->
  19557. (S1 ^operator O2071 = 0.1215979844413558)
  19558. Firing prefer*rvt*predict-yes*H0*5*H1
  19559. -->
  19560. Firing prefer*rvt*predict-no*H0
  19561. -->
  19562. Firing rl*prefer*rvt*predict-no*H0*6
  19563. -->
  19564. (S1 ^operator O2072 = 0.9999934438786788)
  19565. inner elaboration loop at bottom goal.
  19566. Retracting rl*prefer*rvt*predict-no*H0*6
  19567. -->
  19568. (S1 ^operator O2070 = 0.9999934438786788)
  19569. Retracting rl*prefer*rvt*predict-yes*H0*5
  19570. -->
  19571. (S1 ^operator O2069 = 0.1215979844413558)
  19572. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  19573. -->
  19574. (S1 ^operator O2069 = 0.8784107800481358)
  19575. --- END Proposal Phase ---
  19576. --- Decision Phase ---
  19577. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19578. =>WM: (14554: S1 ^operator O2071)
  19579. 1036: O: O2071 (predict-yes)
  19580. --- END Decision Phase ---
  19581. --- Application Phase ---
  19582. --- Firing Productions (PE) For State At Depth 1 ---
  19583. --- Inner Elaboration Phase, active level 1 (S1) ---
  19584. Firing apply*operator
  19585. -->
  19586. (I3 ^predict-yes N1036 + :O )
  19587. Firing apply*operator*complete
  19588. -->
  19589. (I3 ^predict-no N1035 - :O )
  19590. inner elaboration loop at bottom goal.
  19591. --- Change Working Memory (PE) ---
  19592. =>WM: (14555: I3 ^predict-yes N1036)
  19593. <=WM: (14541: N1035 ^status complete)
  19594. <=WM: (14540: I3 ^predict-no N1035)
  19595. --- Firing Productions (IE) For State At Depth 1 ---
  19596. --- Inner Elaboration Phase, active level 1 (S1) ---
  19597. Firing monitor*world
  19598. -->
  19599. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19600. --- Change Working Memory (IE) ---
  19601. --- END Application Phase ---
  19602. --- Output Phase ---
  19603. ENV: Agent did: predict-yes for direction R in state State-A
  19604. In State-A moving R
  19605. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  19606. predict error 0
  19607. dir: dir isU
  19608. --- END Output Phase ---
  19609. \-/--- Input Phase ---
  19610. =>WM: (14559: I2 ^dir U)
  19611. =>WM: (14558: I2 ^reward 1)
  19612. =>WM: (14557: I2 ^see 1)
  19613. =>WM: (14556: N1036 ^status complete)
  19614. <=WM: (14544: I2 ^dir R)
  19615. <=WM: (14543: I2 ^reward 1)
  19616. <=WM: (14542: I2 ^see 0)
  19617. =>WM: (14560: I2 ^level-1 R1-root)
  19618. <=WM: (14545: I2 ^level-1 L1-root)
  19619. --- END Input Phase ---
  19620. --- Proposal Phase ---
  19621. --- Inner Elaboration Phase, active level 1 (S1) ---
  19622. Firing elaborate*copy-see-to-output-link
  19623. -->
  19624. (I3 ^see 1 +)
  19625. Firing elaborate*reward*based*on*reward
  19626. -->
  19627. (R1040 ^value 1 +)
  19628. (R1 ^reward R1040 +)
  19629. Firing propose*predict-yes
  19630. -->
  19631. (O2073 ^name predict-yes +)
  19632. (S1 ^operator O2073 +)
  19633. Firing propose*predict-no
  19634. -->
  19635. (O2074 ^name predict-no +)
  19636. (S1 ^operator O2074 +)
  19637. Firing rl*prefer*rvt*predict-no*H0*2
  19638. -->
  19639. (S1 ^operator O2072 = 1.)
  19640. Firing rl*prefer*rvt*predict-yes*H0*1
  19641. -->
  19642. (S1 ^operator O2071 = 0.)
  19643. Firing prefer*rvt*predict-yes*H0
  19644. -->
  19645. Firing prefer*rvt*predict-no*H0
  19646. -->
  19647. Firing elaborate*copy-dir-to-output-link
  19648. -->
  19649. (I3 ^dir U +)
  19650. inner elaboration loop at bottom goal.
  19651. Retracting elaborate*copy-see-to-output-link
  19652. -->
  19653. (I3 ^see 0 +)
  19654. Retracting propose*predict-no
  19655. -->
  19656. (O2072 ^name predict-no +)
  19657. (S1 ^operator O2072 +)
  19658. Retracting propose*predict-yes
  19659. -->
  19660. (O2071 ^name predict-yes +)
  19661. (S1 ^operator O2071 +)
  19662. Retracting elaborate*reward*based*on*reward
  19663. -->
  19664. (R1039 ^value 1 +)
  19665. (R1 ^reward R1039 +)
  19666. Retracting elaborate*copy-dir-to-output-link
  19667. -->
  19668. (I3 ^dir R +)
  19669. Retracting rl*prefer*rvt*predict-no*H0*6
  19670. -->
  19671. (S1 ^operator O2072 = 0.9999934438786788)
  19672. Retracting rl*prefer*rvt*predict-yes*H0*5
  19673. -->
  19674. (S1 ^operator O2071 = 0.1215979844413558)
  19675. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  19676. -->
  19677. (S1 ^operator O2071 = 0.8784107800481358)
  19678. =>WM: (14568: S1 ^operator O2074 +)
  19679. =>WM: (14567: S1 ^operator O2073 +)
  19680. =>WM: (14566: I3 ^dir U)
  19681. =>WM: (14565: O2074 ^name predict-no)
  19682. =>WM: (14564: O2073 ^name predict-yes)
  19683. =>WM: (14563: R1040 ^value 1)
  19684. =>WM: (14562: R1 ^reward R1040)
  19685. =>WM: (14561: I3 ^see 1)
  19686. <=WM: (14552: S1 ^operator O2071 +)
  19687. <=WM: (14554: S1 ^operator O2071)
  19688. <=WM: (14553: S1 ^operator O2072 +)
  19689. <=WM: (14551: I3 ^dir R)
  19690. <=WM: (14547: R1 ^reward R1039)
  19691. <=WM: (14546: I3 ^see 0)
  19692. <=WM: (14550: O2072 ^name predict-no)
  19693. <=WM: (14549: O2071 ^name predict-yes)
  19694. <=WM: (14548: R1039 ^value 1)
  19695. --- Inner Elaboration Phase, active level 1 (S1) ---
  19696. Firing prefer*rvt*predict-yes*H0
  19697. -->
  19698. Firing rl*prefer*rvt*predict-yes*H0*1
  19699. -->
  19700. (S1 ^operator O2073 = 0.)
  19701. Firing prefer*rvt*predict-no*H0
  19702. -->
  19703. Firing rl*prefer*rvt*predict-no*H0*2
  19704. -->
  19705. (S1 ^operator O2074 = 1.)
  19706. inner elaboration loop at bottom goal.
  19707. Retracting rl*prefer*rvt*predict-no*H0*2
  19708. -->
  19709. (S1 ^operator O2072 = 1.)
  19710. Retracting rl*prefer*rvt*predict-yes*H0*1
  19711. -->
  19712. (S1 ^operator O2071 = 0.)
  19713. --- END Proposal Phase ---
  19714. --- Decision Phase ---
  19715. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121598 -> 0.534524 -0.412926 0.121597(R,m,v=1,0.869565,0.114041)
  19716. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465483 0.412928 0.878411 -> 0.465483 0.412927 0.87841(R,m,v=1,1,0)
  19717. =>WM: (14569: S1 ^operator O2074)
  19718. 1037: O: O2074 (predict-no)
  19719. --- END Decision Phase ---
  19720. --- Application Phase ---
  19721. --- Firing Productions (PE) For State At Depth 1 ---
  19722. --- Inner Elaboration Phase, active level 1 (S1) ---
  19723. Firing apply*operator
  19724. -->
  19725. (I3 ^predict-no N1037 + :O )
  19726. Firing apply*operator*complete
  19727. -->
  19728. (I3 ^predict-yes N1036 - :O )
  19729. inner elaboration loop at bottom goal.
  19730. --- Change Working Memory (PE) ---
  19731. =>WM: (14570: I3 ^predict-no N1037)
  19732. <=WM: (14556: N1036 ^status complete)
  19733. <=WM: (14555: I3 ^predict-yes N1036)
  19734. --- Firing Productions (IE) For State At Depth 1 ---
  19735. --- Inner Elaboration Phase, active level 1 (S1) ---
  19736. Firing monitor*world
  19737. -->
  19738. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  19739. --- Change Working Memory (IE) ---
  19740. --- END Application Phase ---
  19741. --- Output Phase ---
  19742. ENV: Agent did: predict-no for direction U in state State-B
  19743. In State-B moving U
  19744. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  19745. predict error 0
  19746. dir: dir isL
  19747. --- END Output Phase ---
  19748. |\--- Input Phase ---
  19749. =>WM: (14574: I2 ^dir L)
  19750. =>WM: (14573: I2 ^reward 1)
  19751. =>WM: (14572: I2 ^see 0)
  19752. =>WM: (14571: N1037 ^status complete)
  19753. <=WM: (14559: I2 ^dir U)
  19754. <=WM: (14558: I2 ^reward 1)
  19755. <=WM: (14557: I2 ^see 1)
  19756. =>WM: (14575: I2 ^level-1 R1-root)
  19757. <=WM: (14560: I2 ^level-1 R1-root)
  19758. --- END Input Phase ---
  19759. --- Proposal Phase ---
  19760. --- Inner Elaboration Phase, active level 1 (S1) ---
  19761. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  19762. -->
  19763. (S1 ^operator O2074 = -0.168718511744511)
  19764. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  19765. -->
  19766. (S1 ^operator O2073 = 0.6092773114732839)
  19767. Firing prefer*rvt*predict-no*H0*4*H1
  19768. -->
  19769. Firing prefer*rvt*predict-yes*H0*3*H1
  19770. -->
  19771. Firing elaborate*copy-see-to-output-link
  19772. -->
  19773. (I3 ^see 0 +)
  19774. Firing elaborate*reward*based*on*reward
  19775. -->
  19776. (R1041 ^value 1 +)
  19777. (R1 ^reward R1041 +)
  19778. Firing propose*predict-yes
  19779. -->
  19780. (O2075 ^name predict-yes +)
  19781. (S1 ^operator O2075 +)
  19782. Firing propose*predict-no
  19783. -->
  19784. (O2076 ^name predict-no +)
  19785. (S1 ^operator O2076 +)
  19786. Firing rl*prefer*rvt*predict-no*H0*4
  19787. -->
  19788. (S1 ^operator O2074 = 0.314500061238283)
  19789. Firing rl*prefer*rvt*predict-yes*H0*3
  19790. -->
  19791. (S1 ^operator O2073 = 0.3907686867108918)
  19792. Firing prefer*rvt*predict-yes*H0
  19793. -->
  19794. Firing prefer*rvt*predict-no*H0
  19795. -->
  19796. Firing elaborate*copy-dir-to-output-link
  19797. -->
  19798. (I3 ^dir L +)
  19799. inner elaboration loop at bottom goal.
  19800. Retracting elaborate*copy-see-to-output-link
  19801. -->
  19802. (I3 ^see 1 +)
  19803. Retracting propose*predict-no
  19804. -->
  19805. (O2074 ^name predict-no +)
  19806. (S1 ^operator O2074 +)
  19807. Retracting propose*predict-yes
  19808. -->
  19809. (O2073 ^name predict-yes +)
  19810. (S1 ^operator O2073 +)
  19811. Retracting elaborate*reward*based*on*reward
  19812. -->
  19813. (R1040 ^value 1 +)
  19814. (R1 ^reward R1040 +)
  19815. Retracting elaborate*copy-dir-to-output-link
  19816. -->
  19817. (I3 ^dir U +)
  19818. Retracting rl*prefer*rvt*predict-no*H0*2
  19819. -->
  19820. (S1 ^operator O2074 = 1.)
  19821. Retracting rl*prefer*rvt*predict-yes*H0*1
  19822. -->
  19823. (S1 ^operator O2073 = 0.)
  19824. =>WM: (14583: S1 ^operator O2076 +)
  19825. =>WM: (14582: S1 ^operator O2075 +)
  19826. =>WM: (14581: I3 ^dir L)
  19827. =>WM: (14580: O2076 ^name predict-no)
  19828. =>WM: (14579: O2075 ^name predict-yes)
  19829. =>WM: (14578: R1041 ^value 1)
  19830. =>WM: (14577: R1 ^reward R1041)
  19831. =>WM: (14576: I3 ^see 0)
  19832. <=WM: (14567: S1 ^operator O2073 +)
  19833. <=WM: (14568: S1 ^operator O2074 +)
  19834. <=WM: (14569: S1 ^operator O2074)
  19835. <=WM: (14566: I3 ^dir U)
  19836. <=WM: (14562: R1 ^reward R1040)
  19837. <=WM: (14561: I3 ^see 1)
  19838. <=WM: (14565: O2074 ^name predict-no)
  19839. <=WM: (14564: O2073 ^name predict-yes)
  19840. <=WM: (14563: R1040 ^value 1)
  19841. --- Inner Elaboration Phase, active level 1 (S1) ---
  19842. Firing prefer*rvt*predict-yes*H0
  19843. -->
  19844. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  19845. -->
  19846. (S1 ^operator O2075 = 0.6092773114732839)
  19847. Firing rl*prefer*rvt*predict-yes*H0*3
  19848. -->
  19849. (S1 ^operator O2075 = 0.3907686867108918)
  19850. Firing prefer*rvt*predict-yes*H0*3*H1
  19851. -->
  19852. Firing prefer*rvt*predict-no*H0
  19853. -->
  19854. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  19855. -->
  19856. (S1 ^operator O2076 = -0.168718511744511)
  19857. Firing rl*prefer*rvt*predict-no*H0*4
  19858. -->
  19859. (S1 ^operator O2076 = 0.314500061238283)
  19860. Firing prefer*rvt*predict-no*H0*4*H1
  19861. -->
  19862. inner elaboration loop at bottom goal.
  19863. Retracting rl*prefer*rvt*predict-no*H0*4
  19864. -->
  19865. (S1 ^operator O2074 = 0.314500061238283)
  19866. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  19867. -->
  19868. (S1 ^operator O2074 = -0.168718511744511)
  19869. Retracting rl*prefer*rvt*predict-yes*H0*3
  19870. -->
  19871. (S1 ^operator O2073 = 0.3907686867108918)
  19872. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  19873. -->
  19874. (S1 ^operator O2073 = 0.6092773114732839)
  19875. --- END Proposal Phase ---
  19876. --- Decision Phase ---
  19877. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  19878. =>WM: (14584: S1 ^operator O2075)
  19879. 1038: O: O2075 (predict-yes)
  19880. --- END Decision Phase ---
  19881. --- Application Phase ---
  19882. --- Firing Productions (PE) For State At Depth 1 ---
  19883. --- Inner Elaboration Phase, active level 1 (S1) ---
  19884. Firing apply*operator
  19885. -->
  19886. (I3 ^predict-yes N1038 + :O )
  19887. Firing apply*operator*complete
  19888. -->
  19889. (I3 ^predict-no N1037 - :O )
  19890. inner elaboration loop at bottom goal.
  19891. --- Change Working Memory (PE) ---
  19892. =>WM: (14585: I3 ^predict-yes N1038)
  19893. <=WM: (14571: N1037 ^status complete)
  19894. <=WM: (14570: I3 ^predict-no N1037)
  19895. --- Firing Productions (IE) For State At Depth 1 ---
  19896. --- Inner Elaboration Phase, active level 1 (S1) ---
  19897. Firing monitor*world
  19898. -->
  19899. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  19900. --- Change Working Memory (IE) ---
  19901. --- END Application Phase ---
  19902. --- Output Phase ---
  19903. ENV: Agent did: predict-yes for direction L in state State-B
  19904. In State-B moving L
  19905. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  19906. predict error 0
  19907. dir: dir isR
  19908. --- END Output Phase ---
  19909. -/|--- Input Phase ---
  19910. =>WM: (14589: I2 ^dir R)
  19911. =>WM: (14588: I2 ^reward 1)
  19912. =>WM: (14587: I2 ^see 1)
  19913. =>WM: (14586: N1038 ^status complete)
  19914. <=WM: (14574: I2 ^dir L)
  19915. <=WM: (14573: I2 ^reward 1)
  19916. <=WM: (14572: I2 ^see 0)
  19917. =>WM: (14590: I2 ^level-1 L1-root)
  19918. <=WM: (14575: I2 ^level-1 R1-root)
  19919. --- END Input Phase ---
  19920. --- Proposal Phase ---
  19921. --- Inner Elaboration Phase, active level 1 (S1) ---
  19922. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  19923. -->
  19924. (S1 ^operator O2075 = 0.8784099639037452)
  19925. Firing prefer*rvt*predict-yes*H0*5*H1
  19926. -->
  19927. Firing elaborate*copy-see-to-output-link
  19928. -->
  19929. (I3 ^see 1 +)
  19930. Firing elaborate*reward*based*on*reward
  19931. -->
  19932. (R1042 ^value 1 +)
  19933. (R1 ^reward R1042 +)
  19934. Firing propose*predict-yes
  19935. -->
  19936. (O2077 ^name predict-yes +)
  19937. (S1 ^operator O2077 +)
  19938. Firing propose*predict-no
  19939. -->
  19940. (O2078 ^name predict-no +)
  19941. (S1 ^operator O2078 +)
  19942. Firing rl*prefer*rvt*predict-no*H0*6
  19943. -->
  19944. (S1 ^operator O2076 = 0.9999934438786788)
  19945. Firing rl*prefer*rvt*predict-yes*H0*5
  19946. -->
  19947. (S1 ^operator O2075 = 0.1215972793263044)
  19948. Firing prefer*rvt*predict-yes*H0
  19949. -->
  19950. Firing prefer*rvt*predict-no*H0
  19951. -->
  19952. Firing elaborate*copy-dir-to-output-link
  19953. -->
  19954. (I3 ^dir R +)
  19955. inner elaboration loop at bottom goal.
  19956. Retracting elaborate*copy-see-to-output-link
  19957. -->
  19958. (I3 ^see 0 +)
  19959. Retracting propose*predict-no
  19960. -->
  19961. (O2076 ^name predict-no +)
  19962. (S1 ^operator O2076 +)
  19963. Retracting propose*predict-yes
  19964. -->
  19965. (O2075 ^name predict-yes +)
  19966. (S1 ^operator O2075 +)
  19967. Retracting elaborate*reward*based*on*reward
  19968. -->
  19969. (R1041 ^value 1 +)
  19970. (R1 ^reward R1041 +)
  19971. Retracting elaborate*copy-dir-to-output-link
  19972. -->
  19973. (I3 ^dir L +)
  19974. Retracting rl*prefer*rvt*predict-no*H0*4
  19975. -->
  19976. (S1 ^operator O2076 = 0.314500061238283)
  19977. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  19978. -->
  19979. (S1 ^operator O2076 = -0.168718511744511)
  19980. Retracting rl*prefer*rvt*predict-yes*H0*3
  19981. -->
  19982. (S1 ^operator O2075 = 0.3907686867108918)
  19983. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  19984. -->
  19985. (S1 ^operator O2075 = 0.6092773114732839)
  19986. =>WM: (14598: S1 ^operator O2078 +)
  19987. =>WM: (14597: S1 ^operator O2077 +)
  19988. =>WM: (14596: I3 ^dir R)
  19989. =>WM: (14595: O2078 ^name predict-no)
  19990. =>WM: (14594: O2077 ^name predict-yes)
  19991. =>WM: (14593: R1042 ^value 1)
  19992. =>WM: (14592: R1 ^reward R1042)
  19993. =>WM: (14591: I3 ^see 1)
  19994. <=WM: (14582: S1 ^operator O2075 +)
  19995. <=WM: (14584: S1 ^operator O2075)
  19996. <=WM: (14583: S1 ^operator O2076 +)
  19997. <=WM: (14581: I3 ^dir L)
  19998. <=WM: (14577: R1 ^reward R1041)
  19999. <=WM: (14576: I3 ^see 0)
  20000. <=WM: (14580: O2076 ^name predict-no)
  20001. <=WM: (14579: O2075 ^name predict-yes)
  20002. <=WM: (14578: R1041 ^value 1)
  20003. --- Inner Elaboration Phase, active level 1 (S1) ---
  20004. Firing prefer*rvt*predict-yes*H0
  20005. -->
  20006. Firing rl*prefer*rvt*predict-yes*H0*5
  20007. -->
  20008. (S1 ^operator O2077 = 0.1215972793263044)
  20009. Firing prefer*rvt*predict-yes*H0*5*H1
  20010. -->
  20011. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  20012. -->
  20013. (S1 ^operator O2077 = 0.8784099639037452)
  20014. Firing prefer*rvt*predict-no*H0
  20015. -->
  20016. Firing rl*prefer*rvt*predict-no*H0*6
  20017. -->
  20018. (S1 ^operator O2078 = 0.9999934438786788)
  20019. inner elaboration loop at bottom goal.
  20020. Retracting rl*prefer*rvt*predict-no*H0*6
  20021. -->
  20022. (S1 ^operator O2076 = 0.9999934438786788)
  20023. Retracting rl*prefer*rvt*predict-yes*H0*5
  20024. -->
  20025. (S1 ^operator O2075 = 0.1215972793263044)
  20026. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  20027. -->
  20028. (S1 ^operator O2075 = 0.8784099639037452)
  20029. --- END Proposal Phase ---
  20030. --- Decision Phase ---
  20031. RL update rl*prefer*rvt*predict-yes*H0*3 0.472316 -0.0815473 0.390769 -> 0.472313 -0.0815478 0.390765(R,m,v=1,0.946429,0.0510051)
  20032. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527723 0.0815541 0.609277 -> 0.52772 0.0815534 0.609273(R,m,v=1,1,0)
  20033. =>WM: (14599: S1 ^operator O2077)
  20034. 1039: O: O2077 (predict-yes)
  20035. --- END Decision Phase ---
  20036. --- Application Phase ---
  20037. --- Firing Productions (PE) For State At Depth 1 ---
  20038. --- Inner Elaboration Phase, active level 1 (S1) ---
  20039. Firing apply*operator
  20040. -->
  20041. (I3 ^predict-yes N1039 + :O )
  20042. Firing apply*operator*complete
  20043. -->
  20044. (I3 ^predict-yes N1038 - :O )
  20045. inner elaboration loop at bottom goal.
  20046. --- Change Working Memory (PE) ---
  20047. =>WM: (14600: I3 ^predict-yes N1039)
  20048. <=WM: (14586: N1038 ^status complete)
  20049. <=WM: (14585: I3 ^predict-yes N1038)
  20050. --- Firing Productions (IE) For State At Depth 1 ---
  20051. --- Inner Elaboration Phase, active level 1 (S1) ---
  20052. Firing monitor*world
  20053. -->
  20054. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20055. --- Change Working Memory (IE) ---
  20056. --- END Application Phase ---
  20057. --- Output Phase ---
  20058. ENV: Agent did: predict-yes for direction R in state State-A
  20059. In State-A moving R
  20060. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  20061. predict error 0
  20062. dir: dir isL
  20063. --- END Output Phase ---
  20064. \-/--- Input Phase ---
  20065. =>WM: (14604: I2 ^dir L)
  20066. =>WM: (14603: I2 ^reward 1)
  20067. =>WM: (14602: I2 ^see 1)
  20068. =>WM: (14601: N1039 ^status complete)
  20069. <=WM: (14589: I2 ^dir R)
  20070. <=WM: (14588: I2 ^reward 1)
  20071. <=WM: (14587: I2 ^see 1)
  20072. =>WM: (14605: I2 ^level-1 R1-root)
  20073. <=WM: (14590: I2 ^level-1 L1-root)
  20074. --- END Input Phase ---
  20075. --- Proposal Phase ---
  20076. --- Inner Elaboration Phase, active level 1 (S1) ---
  20077. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  20078. -->
  20079. (S1 ^operator O2078 = -0.168718511744511)
  20080. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  20081. -->
  20082. (S1 ^operator O2077 = 0.6092730179615714)
  20083. Firing prefer*rvt*predict-no*H0*4*H1
  20084. -->
  20085. Firing prefer*rvt*predict-yes*H0*3*H1
  20086. -->
  20087. Firing elaborate*copy-see-to-output-link
  20088. -->
  20089. (I3 ^see 1 +)
  20090. Firing elaborate*reward*based*on*reward
  20091. -->
  20092. (R1043 ^value 1 +)
  20093. (R1 ^reward R1043 +)
  20094. Firing propose*predict-yes
  20095. -->
  20096. (O2079 ^name predict-yes +)
  20097. (S1 ^operator O2079 +)
  20098. Firing propose*predict-no
  20099. -->
  20100. (O2080 ^name predict-no +)
  20101. (S1 ^operator O2080 +)
  20102. Firing rl*prefer*rvt*predict-no*H0*4
  20103. -->
  20104. (S1 ^operator O2078 = 0.314500061238283)
  20105. Firing rl*prefer*rvt*predict-yes*H0*3
  20106. -->
  20107. (S1 ^operator O2077 = 0.3907649311218379)
  20108. Firing prefer*rvt*predict-yes*H0
  20109. -->
  20110. Firing prefer*rvt*predict-no*H0
  20111. -->
  20112. Firing elaborate*copy-dir-to-output-link
  20113. -->
  20114. (I3 ^dir L +)
  20115. inner elaboration loop at bottom goal.
  20116. Retracting elaborate*copy-see-to-output-link
  20117. -->
  20118. (I3 ^see 1 +)
  20119. Retracting propose*predict-no
  20120. -->
  20121. (O2078 ^name predict-no +)
  20122. (S1 ^operator O2078 +)
  20123. Retracting propose*predict-yes
  20124. -->
  20125. (O2077 ^name predict-yes +)
  20126. (S1 ^operator O2077 +)
  20127. Retracting elaborate*reward*based*on*reward
  20128. -->
  20129. (R1042 ^value 1 +)
  20130. (R1 ^reward R1042 +)
  20131. Retracting elaborate*copy-dir-to-output-link
  20132. -->
  20133. (I3 ^dir R +)
  20134. Retracting rl*prefer*rvt*predict-no*H0*6
  20135. -->
  20136. (S1 ^operator O2078 = 0.9999934438786788)
  20137. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  20138. -->
  20139. (S1 ^operator O2077 = 0.8784099639037452)
  20140. Retracting rl*prefer*rvt*predict-yes*H0*5
  20141. -->
  20142. (S1 ^operator O2077 = 0.1215972793263044)
  20143. =>WM: (14612: S1 ^operator O2080 +)
  20144. =>WM: (14611: S1 ^operator O2079 +)
  20145. =>WM: (14610: I3 ^dir L)
  20146. =>WM: (14609: O2080 ^name predict-no)
  20147. =>WM: (14608: O2079 ^name predict-yes)
  20148. =>WM: (14607: R1043 ^value 1)
  20149. =>WM: (14606: R1 ^reward R1043)
  20150. <=WM: (14597: S1 ^operator O2077 +)
  20151. <=WM: (14599: S1 ^operator O2077)
  20152. <=WM: (14598: S1 ^operator O2078 +)
  20153. <=WM: (14596: I3 ^dir R)
  20154. <=WM: (14592: R1 ^reward R1042)
  20155. <=WM: (14595: O2078 ^name predict-no)
  20156. <=WM: (14594: O2077 ^name predict-yes)
  20157. <=WM: (14593: R1042 ^value 1)
  20158. --- Inner Elaboration Phase, active level 1 (S1) ---
  20159. Firing prefer*rvt*predict-yes*H0
  20160. -->
  20161. Firing rl*prefer*rvt*predict-yes*H0*3
  20162. -->
  20163. (S1 ^operator O2079 = 0.3907649311218379)
  20164. Firing prefer*rvt*predict-yes*H0*3*H1
  20165. -->
  20166. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  20167. -->
  20168. (S1 ^operator O2079 = 0.6092730179615714)
  20169. Firing prefer*rvt*predict-no*H0
  20170. -->
  20171. Firing rl*prefer*rvt*predict-no*H0*4
  20172. -->
  20173. (S1 ^operator O2080 = 0.314500061238283)
  20174. Firing prefer*rvt*predict-no*H0*4*H1
  20175. -->
  20176. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  20177. -->
  20178. (S1 ^operator O2080 = -0.168718511744511)
  20179. inner elaboration loop at bottom goal.
  20180. Retracting rl*prefer*rvt*predict-no*H0*4
  20181. -->
  20182. (S1 ^operator O2078 = 0.314500061238283)
  20183. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  20184. -->
  20185. (S1 ^operator O2078 = -0.168718511744511)
  20186. Retracting rl*prefer*rvt*predict-yes*H0*3
  20187. -->
  20188. (S1 ^operator O2077 = 0.3907649311218379)
  20189. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  20190. -->
  20191. (S1 ^operator O2077 = 0.6092730179615714)
  20192. --- END Proposal Phase ---
  20193. --- Decision Phase ---
  20194. RL update rl*prefer*rvt*predict-yes*H0*5 0.534524 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.87027,0.113514)
  20195. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465483 0.412927 0.87841 -> 0.465482 0.412927 0.878409(R,m,v=1,1,0)
  20196. =>WM: (14613: S1 ^operator O2079)
  20197. 1040: O: O2079 (predict-yes)
  20198. --- END Decision Phase ---
  20199. --- Application Phase ---
  20200. --- Firing Productions (PE) For State At Depth 1 ---
  20201. --- Inner Elaboration Phase, active level 1 (S1) ---
  20202. Firing apply*operator
  20203. -->
  20204. (I3 ^predict-yes N1040 + :O )
  20205. Firing apply*operator*complete
  20206. -->
  20207. (I3 ^predict-yes N1039 - :O )
  20208. inner elaboration loop at bottom goal.
  20209. --- Change Working Memory (PE) ---
  20210. =>WM: (14614: I3 ^predict-yes N1040)
  20211. <=WM: (14601: N1039 ^status complete)
  20212. <=WM: (14600: I3 ^predict-yes N1039)
  20213. --- Firing Productions (IE) For State At Depth 1 ---
  20214. --- Inner Elaboration Phase, active level 1 (S1) ---
  20215. Firing monitor*world
  20216. -->
  20217. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20218. --- Change Working Memory (IE) ---
  20219. --- END Application Phase ---
  20220. --- Output Phase ---
  20221. ENV: Agent did: predict-yes for direction L in state State-B
  20222. In State-B moving L
  20223. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  20224. predict error 0
  20225. dir: dir isL
  20226. --- END Output Phase ---
  20227. |\---- Input Phase ---
  20228. =>WM: (14618: I2 ^dir L)
  20229. =>WM: (14617: I2 ^reward 1)
  20230. =>WM: (14616: I2 ^see 1)
  20231. =>WM: (14615: N1040 ^status complete)
  20232. <=WM: (14604: I2 ^dir L)
  20233. <=WM: (14603: I2 ^reward 1)
  20234. <=WM: (14602: I2 ^see 1)
  20235. =>WM: (14619: I2 ^level-1 L1-root)
  20236. <=WM: (14605: I2 ^level-1 R1-root)
  20237. --- END Input Phase ---
  20238. --- Proposal Phase ---
  20239. --- Inner Elaboration Phase, active level 1 (S1) ---
  20240. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  20241. -->
  20242. (S1 ^operator O2079 = -0.2062723012911647)
  20243. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  20244. -->
  20245. (S1 ^operator O2080 = 0.6855213227180397)
  20246. Firing prefer*rvt*predict-no*H0*4*H1
  20247. -->
  20248. Firing prefer*rvt*predict-yes*H0*3*H1
  20249. -->
  20250. Firing elaborate*copy-see-to-output-link
  20251. -->
  20252. (I3 ^see 1 +)
  20253. Firing elaborate*reward*based*on*reward
  20254. -->
  20255. (R1044 ^value 1 +)
  20256. (R1 ^reward R1044 +)
  20257. Firing propose*predict-yes
  20258. -->
  20259. (O2081 ^name predict-yes +)
  20260. (S1 ^operator O2081 +)
  20261. Firing propose*predict-no
  20262. -->
  20263. (O2082 ^name predict-no +)
  20264. (S1 ^operator O2082 +)
  20265. Firing rl*prefer*rvt*predict-no*H0*4
  20266. -->
  20267. (S1 ^operator O2080 = 0.314500061238283)
  20268. Firing rl*prefer*rvt*predict-yes*H0*3
  20269. -->
  20270. (S1 ^operator O2079 = 0.3907649311218379)
  20271. Firing prefer*rvt*predict-yes*H0
  20272. -->
  20273. Firing prefer*rvt*predict-no*H0
  20274. -->
  20275. Firing elaborate*copy-dir-to-output-link
  20276. -->
  20277. (I3 ^dir L +)
  20278. inner elaboration loop at bottom goal.
  20279. Retracting elaborate*copy-see-to-output-link
  20280. -->
  20281. (I3 ^see 1 +)
  20282. Retracting propose*predict-no
  20283. -->
  20284. (O2080 ^name predict-no +)
  20285. (S1 ^operator O2080 +)
  20286. Retracting propose*predict-yes
  20287. -->
  20288. (O2079 ^name predict-yes +)
  20289. (S1 ^operator O2079 +)
  20290. Retracting elaborate*reward*based*on*reward
  20291. -->
  20292. (R1043 ^value 1 +)
  20293. (R1 ^reward R1043 +)
  20294. Retracting elaborate*copy-dir-to-output-link
  20295. -->
  20296. (I3 ^dir L +)
  20297. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  20298. -->
  20299. (S1 ^operator O2080 = -0.168718511744511)
  20300. Retracting rl*prefer*rvt*predict-no*H0*4
  20301. -->
  20302. (S1 ^operator O2080 = 0.314500061238283)
  20303. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  20304. -->
  20305. (S1 ^operator O2079 = 0.6092730179615714)
  20306. Retracting rl*prefer*rvt*predict-yes*H0*3
  20307. -->
  20308. (S1 ^operator O2079 = 0.3907649311218379)
  20309. =>WM: (14625: S1 ^operator O2082 +)
  20310. =>WM: (14624: S1 ^operator O2081 +)
  20311. =>WM: (14623: O2082 ^name predict-no)
  20312. =>WM: (14622: O2081 ^name predict-yes)
  20313. =>WM: (14621: R1044 ^value 1)
  20314. =>WM: (14620: R1 ^reward R1044)
  20315. <=WM: (14611: S1 ^operator O2079 +)
  20316. <=WM: (14613: S1 ^operator O2079)
  20317. <=WM: (14612: S1 ^operator O2080 +)
  20318. <=WM: (14606: R1 ^reward R1043)
  20319. <=WM: (14609: O2080 ^name predict-no)
  20320. <=WM: (14608: O2079 ^name predict-yes)
  20321. <=WM: (14607: R1043 ^value 1)
  20322. --- Inner Elaboration Phase, active level 1 (S1) ---
  20323. Firing prefer*rvt*predict-yes*H0
  20324. -->
  20325. Firing rl*prefer*rvt*predict-yes*H0*3
  20326. -->
  20327. (S1 ^operator O2081 = 0.3907649311218379)
  20328. Firing prefer*rvt*predict-yes*H0*3*H1
  20329. -->
  20330. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  20331. -->
  20332. (S1 ^operator O2081 = -0.2062723012911647)
  20333. Firing prefer*rvt*predict-no*H0
  20334. -->
  20335. Firing rl*prefer*rvt*predict-no*H0*4
  20336. -->
  20337. (S1 ^operator O2082 = 0.314500061238283)
  20338. Firing prefer*rvt*predict-no*H0*4*H1
  20339. -->
  20340. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  20341. -->
  20342. (S1 ^operator O2082 = 0.6855213227180397)
  20343. inner elaboration loop at bottom goal.
  20344. Retracting rl*prefer*rvt*predict-no*H0*4
  20345. -->
  20346. (S1 ^operator O2080 = 0.314500061238283)
  20347. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  20348. -->
  20349. (S1 ^operator O2080 = 0.6855213227180397)
  20350. Retracting rl*prefer*rvt*predict-yes*H0*3
  20351. -->
  20352. (S1 ^operator O2079 = 0.3907649311218379)
  20353. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  20354. -->
  20355. (S1 ^operator O2079 = -0.2062723012911647)
  20356. --- END Proposal Phase ---
  20357. --- Decision Phase ---
  20358. RL update rl*prefer*rvt*predict-yes*H0*3 0.472313 -0.0815478 0.390765 -> 0.47231 -0.0815483 0.390762(R,m,v=1,0.946746,0.0507185)
  20359. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.52772 0.0815534 0.609273 -> 0.527717 0.0815529 0.609269(R,m,v=1,1,0)
  20360. =>WM: (14626: S1 ^operator O2082)
  20361. 1041: O: O2082 (predict-no)
  20362. --- END Decision Phase ---
  20363. --- Application Phase ---
  20364. --- Firing Productions (PE) For State At Depth 1 ---
  20365. --- Inner Elaboration Phase, active level 1 (S1) ---
  20366. Firing apply*operator
  20367. -->
  20368. (I3 ^predict-no N1041 + :O )
  20369. Firing apply*operator*complete
  20370. -->
  20371. (I3 ^predict-yes N1040 - :O )
  20372. inner elaboration loop at bottom goal.
  20373. --- Change Working Memory (PE) ---
  20374. =>WM: (14627: I3 ^predict-no N1041)
  20375. <=WM: (14615: N1040 ^status complete)
  20376. <=WM: (14614: I3 ^predict-yes N1040)
  20377. --- Firing Productions (IE) For State At Depth 1 ---
  20378. --- Inner Elaboration Phase, active level 1 (S1) ---
  20379. Firing monitor*world
  20380. -->
  20381. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20382. --- Change Working Memory (IE) ---
  20383. --- END Application Phase ---
  20384. --- Output Phase ---
  20385. ENV: Agent did: predict-no for direction L in state State-A
  20386. In State-A moving L
  20387. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  20388. predict error 0
  20389. dir: dir isR
  20390. --- END Output Phase ---
  20391. /--- Input Phase ---
  20392. =>WM: (14631: I2 ^dir R)
  20393. =>WM: (14630: I2 ^reward 1)
  20394. =>WM: (14629: I2 ^see 0)
  20395. =>WM: (14628: N1041 ^status complete)
  20396. <=WM: (14618: I2 ^dir L)
  20397. <=WM: (14617: I2 ^reward 1)
  20398. <=WM: (14616: I2 ^see 1)
  20399. =>WM: (14632: I2 ^level-1 L0-root)
  20400. <=WM: (14619: I2 ^level-1 L1-root)
  20401. --- END Input Phase ---
  20402. --- Proposal Phase ---
  20403. --- Inner Elaboration Phase, active level 1 (S1) ---
  20404. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  20405. -->
  20406. (S1 ^operator O2081 = 0.8783973744177012)
  20407. Firing prefer*rvt*predict-yes*H0*5*H1
  20408. -->
  20409. Firing elaborate*copy-see-to-output-link
  20410. -->
  20411. (I3 ^see 0 +)
  20412. Firing elaborate*reward*based*on*reward
  20413. -->
  20414. (R1045 ^value 1 +)
  20415. (R1 ^reward R1045 +)
  20416. Firing propose*predict-yes
  20417. -->
  20418. (O2083 ^name predict-yes +)
  20419. (S1 ^operator O2083 +)
  20420. Firing propose*predict-no
  20421. -->
  20422. (O2084 ^name predict-no +)
  20423. (S1 ^operator O2084 +)
  20424. Firing rl*prefer*rvt*predict-no*H0*6
  20425. -->
  20426. (S1 ^operator O2082 = 0.9999934438786788)
  20427. Firing rl*prefer*rvt*predict-yes*H0*5
  20428. -->
  20429. (S1 ^operator O2081 = 0.1215966971063918)
  20430. Firing prefer*rvt*predict-yes*H0
  20431. -->
  20432. Firing prefer*rvt*predict-no*H0
  20433. -->
  20434. Firing elaborate*copy-dir-to-output-link
  20435. -->
  20436. (I3 ^dir R +)
  20437. inner elaboration loop at bottom goal.
  20438. Retracting elaborate*copy-see-to-output-link
  20439. -->
  20440. (I3 ^see 1 +)
  20441. Retracting propose*predict-no
  20442. -->
  20443. (O2082 ^name predict-no +)
  20444. (S1 ^operator O2082 +)
  20445. Retracting propose*predict-yes
  20446. -->
  20447. (O2081 ^name predict-yes +)
  20448. (S1 ^operator O2081 +)
  20449. Retracting elaborate*reward*based*on*reward
  20450. -->
  20451. (R1044 ^value 1 +)
  20452. (R1 ^reward R1044 +)
  20453. Retracting elaborate*copy-dir-to-output-link
  20454. -->
  20455. (I3 ^dir L +)
  20456. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  20457. -->
  20458. (S1 ^operator O2082 = 0.6855213227180397)
  20459. Retracting rl*prefer*rvt*predict-no*H0*4
  20460. -->
  20461. (S1 ^operator O2082 = 0.314500061238283)
  20462. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  20463. -->
  20464. (S1 ^operator O2081 = -0.2062723012911647)
  20465. Retracting rl*prefer*rvt*predict-yes*H0*3
  20466. -->
  20467. (S1 ^operator O2081 = 0.3907618357131554)
  20468. =>WM: (14640: S1 ^operator O2084 +)
  20469. =>WM: (14639: S1 ^operator O2083 +)
  20470. =>WM: (14638: I3 ^dir R)
  20471. =>WM: (14637: O2084 ^name predict-no)
  20472. =>WM: (14636: O2083 ^name predict-yes)
  20473. =>WM: (14635: R1045 ^value 1)
  20474. =>WM: (14634: R1 ^reward R1045)
  20475. =>WM: (14633: I3 ^see 0)
  20476. <=WM: (14624: S1 ^operator O2081 +)
  20477. <=WM: (14625: S1 ^operator O2082 +)
  20478. <=WM: (14626: S1 ^operator O2082)
  20479. <=WM: (14610: I3 ^dir L)
  20480. <=WM: (14620: R1 ^reward R1044)
  20481. <=WM: (14591: I3 ^see 1)
  20482. <=WM: (14623: O2082 ^name predict-no)
  20483. <=WM: (14622: O2081 ^name predict-yes)
  20484. <=WM: (14621: R1044 ^value 1)
  20485. --- Inner Elaboration Phase, active level 1 (S1) ---
  20486. Firing prefer*rvt*predict-yes*H0
  20487. -->
  20488. Firing rl*prefer*rvt*predict-yes*H0*5
  20489. -->
  20490. (S1 ^operator O2083 = 0.1215966971063918)
  20491. Firing prefer*rvt*predict-yes*H0*5*H1
  20492. -->
  20493. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  20494. -->
  20495. (S1 ^operator O2083 = 0.8783973744177012)
  20496. Firing prefer*rvt*predict-no*H0
  20497. -->
  20498. Firing rl*prefer*rvt*predict-no*H0*6
  20499. -->
  20500. (S1 ^operator O2084 = 0.9999934438786788)
  20501. inner elaboration loop at bottom goal.
  20502. Retracting rl*prefer*rvt*predict-no*H0*6
  20503. -->
  20504. (S1 ^operator O2082 = 0.9999934438786788)
  20505. Retracting rl*prefer*rvt*predict-yes*H0*5
  20506. -->
  20507. (S1 ^operator O2081 = 0.1215966971063918)
  20508. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  20509. -->
  20510. (S1 ^operator O2081 = 0.8783973744177012)
  20511. --- END Proposal Phase ---
  20512. --- Decision Phase ---
  20513. RL update rl*prefer*rvt*predict-no*H0*4 0.478549 -0.164049 0.3145 -> 0.478547 -0.164049 0.314498(R,m,v=1,0.925466,0.0694099)
  20514. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521471 0.164051 0.685521 -> 0.521469 0.16405 0.685519(R,m,v=1,1,0)
  20515. =>WM: (14641: S1 ^operator O2083)
  20516. 1042: O: O2083 (predict-yes)
  20517. --- END Decision Phase ---
  20518. --- Application Phase ---
  20519. --- Firing Productions (PE) For State At Depth 1 ---
  20520. --- Inner Elaboration Phase, active level 1 (S1) ---
  20521. Firing apply*operator
  20522. -->
  20523. (I3 ^predict-yes N1042 + :O )
  20524. Firing apply*operator*complete
  20525. -->
  20526. (I3 ^predict-no N1041 - :O )
  20527. inner elaboration loop at bottom goal.
  20528. --- Change Working Memory (PE) ---
  20529. =>WM: (14642: I3 ^predict-yes N1042)
  20530. <=WM: (14628: N1041 ^status complete)
  20531. <=WM: (14627: I3 ^predict-no N1041)
  20532. --- Firing Productions (IE) For State At Depth 1 ---
  20533. --- Inner Elaboration Phase, active level 1 (S1) ---
  20534. Firing monitor*world
  20535. -->
  20536. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  20537. --- Change Working Memory (IE) ---
  20538. --- END Application Phase ---
  20539. --- Output Phase ---
  20540. ENV: Agent did: predict-yes for direction R in state State-A
  20541. In State-A moving R
  20542. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  20543. predict error 0
  20544. dir: dir isR
  20545. --- END Output Phase ---
  20546. |\--- Input Phase ---
  20547. =>WM: (14646: I2 ^dir R)
  20548. =>WM: (14645: I2 ^reward 1)
  20549. =>WM: (14644: I2 ^see 1)
  20550. =>WM: (14643: N1042 ^status complete)
  20551. <=WM: (14631: I2 ^dir R)
  20552. <=WM: (14630: I2 ^reward 1)
  20553. <=WM: (14629: I2 ^see 0)
  20554. =>WM: (14647: I2 ^level-1 R1-root)
  20555. <=WM: (14632: I2 ^level-1 L0-root)
  20556. --- END Input Phase ---
  20557. --- Proposal Phase ---
  20558. --- Inner Elaboration Phase, active level 1 (S1) ---
  20559. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  20560. -->
  20561. (S1 ^operator O2083 = -0.04253361215288998)
  20562. Firing prefer*rvt*predict-yes*H0*5*H1
  20563. -->
  20564. Firing elaborate*copy-see-to-output-link
  20565. -->
  20566. (I3 ^see 1 +)
  20567. Firing elaborate*reward*based*on*reward
  20568. -->
  20569. (R1046 ^value 1 +)
  20570. (R1 ^reward R1046 +)
  20571. Firing propose*predict-yes
  20572. -->
  20573. (O2085 ^name predict-yes +)
  20574. (S1 ^operator O2085 +)
  20575. Firing propose*predict-no
  20576. -->
  20577. (O2086 ^name predict-no +)
  20578. (S1 ^operator O2086 +)
  20579. Firing rl*prefer*rvt*predict-no*H0*6
  20580. -->
  20581. (S1 ^operator O2084 = 0.9999934438786788)
  20582. Firing rl*prefer*rvt*predict-yes*H0*5
  20583. -->
  20584. (S1 ^operator O2083 = 0.1215966971063918)
  20585. Firing prefer*rvt*predict-yes*H0
  20586. -->
  20587. Firing prefer*rvt*predict-no*H0
  20588. -->
  20589. Firing elaborate*copy-dir-to-output-link
  20590. -->
  20591. (I3 ^dir R +)
  20592. inner elaboration loop at bottom goal.
  20593. Retracting elaborate*copy-see-to-output-link
  20594. -->
  20595. (I3 ^see 0 +)
  20596. Retracting propose*predict-no
  20597. -->
  20598. (O2084 ^name predict-no +)
  20599. (S1 ^operator O2084 +)
  20600. Retracting propose*predict-yes
  20601. -->
  20602. (O2083 ^name predict-yes +)
  20603. (S1 ^operator O2083 +)
  20604. Retracting elaborate*reward*based*on*reward
  20605. -->
  20606. (R1045 ^value 1 +)
  20607. (R1 ^reward R1045 +)
  20608. Retracting elaborate*copy-dir-to-output-link
  20609. -->
  20610. (I3 ^dir R +)
  20611. Retracting rl*prefer*rvt*predict-no*H0*6
  20612. -->
  20613. (S1 ^operator O2084 = 0.9999934438786788)
  20614. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  20615. -->
  20616. (S1 ^operator O2083 = 0.8783973744177012)
  20617. Retracting rl*prefer*rvt*predict-yes*H0*5
  20618. -->
  20619. (S1 ^operator O2083 = 0.1215966971063918)
  20620. =>WM: (14654: S1 ^operator O2086 +)
  20621. =>WM: (14653: S1 ^operator O2085 +)
  20622. =>WM: (14652: O2086 ^name predict-no)
  20623. =>WM: (14651: O2085 ^name predict-yes)
  20624. =>WM: (14650: R1046 ^value 1)
  20625. =>WM: (14649: R1 ^reward R1046)
  20626. =>WM: (14648: I3 ^see 1)
  20627. <=WM: (14639: S1 ^operator O2083 +)
  20628. <=WM: (14641: S1 ^operator O2083)
  20629. <=WM: (14640: S1 ^operator O2084 +)
  20630. <=WM: (14634: R1 ^reward R1045)
  20631. <=WM: (14633: I3 ^see 0)
  20632. <=WM: (14637: O2084 ^name predict-no)
  20633. <=WM: (14636: O2083 ^name predict-yes)
  20634. <=WM: (14635: R1045 ^value 1)
  20635. --- Inner Elaboration Phase, active level 1 (S1) ---
  20636. Firing prefer*rvt*predict-yes*H0
  20637. -->
  20638. Firing rl*prefer*rvt*predict-yes*H0*5
  20639. -->
  20640. (S1 ^operator O2085 = 0.1215966971063918)
  20641. Firing prefer*rvt*predict-yes*H0*5*H1
  20642. -->
  20643. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  20644. -->
  20645. (S1 ^operator O2085 = -0.04253361215288998)
  20646. Firing prefer*rvt*predict-no*H0
  20647. -->
  20648. Firing rl*prefer*rvt*predict-no*H0*6
  20649. -->
  20650. (S1 ^operator O2086 = 0.9999934438786788)
  20651. inner elaboration loop at bottom goal.
  20652. Retracting rl*prefer*rvt*predict-no*H0*6
  20653. -->
  20654. (S1 ^operator O2084 = 0.9999934438786788)
  20655. Retracting rl*prefer*rvt*predict-yes*H0*5
  20656. -->
  20657. (S1 ^operator O2083 = 0.1215966971063918)
  20658. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  20659. -->
  20660. (S1 ^operator O2083 = -0.04253361215288998)
  20661. --- END Proposal Phase ---
  20662. --- Decision Phase ---
  20663. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.870968,0.11299)
  20664. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465472 0.412925 0.878397 -> 0.465473 0.412925 0.878398(R,m,v=1,1,0)
  20665. =>WM: (14655: S1 ^operator O2086)
  20666. 1043: O: O2086 (predict-no)
  20667. --- END Decision Phase ---
  20668. --- Application Phase ---
  20669. --- Firing Productions (PE) For State At Depth 1 ---
  20670. --- Inner Elaboration Phase, active level 1 (S1) ---
  20671. Firing apply*operator
  20672. -->
  20673. (I3 ^predict-no N1043 + :O )
  20674. Firing apply*operator*complete
  20675. -->
  20676. (I3 ^predict-yes N1042 - :O )
  20677. inner elaboration loop at bottom goal.
  20678. --- Change Working Memory (PE) ---
  20679. =>WM: (14656: I3 ^predict-no N1043)
  20680. <=WM: (14643: N1042 ^status complete)
  20681. <=WM: (14642: I3 ^predict-yes N1042)
  20682. --- Firing Productions (IE) For State At Depth 1 ---
  20683. --- Inner Elaboration Phase, active level 1 (S1) ---
  20684. Firing monitor*world
  20685. -->
  20686. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20687. --- Change Working Memory (IE) ---
  20688. --- END Application Phase ---
  20689. --- Output Phase ---
  20690. ENV: Agent did: predict-no for direction R in state State-B
  20691. In State-B moving R
  20692. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20693. predict error 0
  20694. dir: dir isR
  20695. --- END Output Phase ---
  20696. -/|--- Input Phase ---
  20697. =>WM: (14660: I2 ^dir R)
  20698. =>WM: (14659: I2 ^reward 1)
  20699. =>WM: (14658: I2 ^see 0)
  20700. =>WM: (14657: N1043 ^status complete)
  20701. <=WM: (14646: I2 ^dir R)
  20702. <=WM: (14645: I2 ^reward 1)
  20703. <=WM: (14644: I2 ^see 1)
  20704. =>WM: (14661: I2 ^level-1 R0-root)
  20705. <=WM: (14647: I2 ^level-1 R1-root)
  20706. --- END Input Phase ---
  20707. --- Proposal Phase ---
  20708. --- Inner Elaboration Phase, active level 1 (S1) ---
  20709. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  20710. -->
  20711. (S1 ^operator O2085 = -0.1512366769350551)
  20712. Firing prefer*rvt*predict-yes*H0*5*H1
  20713. -->
  20714. Firing elaborate*copy-see-to-output-link
  20715. -->
  20716. (I3 ^see 0 +)
  20717. Firing elaborate*reward*based*on*reward
  20718. -->
  20719. (R1047 ^value 1 +)
  20720. (R1 ^reward R1047 +)
  20721. Firing propose*predict-yes
  20722. -->
  20723. (O2087 ^name predict-yes +)
  20724. (S1 ^operator O2087 +)
  20725. Firing propose*predict-no
  20726. -->
  20727. (O2088 ^name predict-no +)
  20728. (S1 ^operator O2088 +)
  20729. Firing rl*prefer*rvt*predict-no*H0*6
  20730. -->
  20731. (S1 ^operator O2086 = 0.9999934438786788)
  20732. Firing rl*prefer*rvt*predict-yes*H0*5
  20733. -->
  20734. (S1 ^operator O2085 = 0.1215971732320855)
  20735. Firing prefer*rvt*predict-yes*H0
  20736. -->
  20737. Firing prefer*rvt*predict-no*H0
  20738. -->
  20739. Firing elaborate*copy-dir-to-output-link
  20740. -->
  20741. (I3 ^dir R +)
  20742. inner elaboration loop at bottom goal.
  20743. Retracting elaborate*copy-see-to-output-link
  20744. -->
  20745. (I3 ^see 1 +)
  20746. Retracting propose*predict-no
  20747. -->
  20748. (O2086 ^name predict-no +)
  20749. (S1 ^operator O2086 +)
  20750. Retracting propose*predict-yes
  20751. -->
  20752. (O2085 ^name predict-yes +)
  20753. (S1 ^operator O2085 +)
  20754. Retracting elaborate*reward*based*on*reward
  20755. -->
  20756. (R1046 ^value 1 +)
  20757. (R1 ^reward R1046 +)
  20758. Retracting elaborate*copy-dir-to-output-link
  20759. -->
  20760. (I3 ^dir R +)
  20761. Retracting rl*prefer*rvt*predict-no*H0*6
  20762. -->
  20763. (S1 ^operator O2086 = 0.9999934438786788)
  20764. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  20765. -->
  20766. (S1 ^operator O2085 = -0.04253361215288998)
  20767. Retracting rl*prefer*rvt*predict-yes*H0*5
  20768. -->
  20769. (S1 ^operator O2085 = 0.1215971732320855)
  20770. =>WM: (14668: S1 ^operator O2088 +)
  20771. =>WM: (14667: S1 ^operator O2087 +)
  20772. =>WM: (14666: O2088 ^name predict-no)
  20773. =>WM: (14665: O2087 ^name predict-yes)
  20774. =>WM: (14664: R1047 ^value 1)
  20775. =>WM: (14663: R1 ^reward R1047)
  20776. =>WM: (14662: I3 ^see 0)
  20777. <=WM: (14653: S1 ^operator O2085 +)
  20778. <=WM: (14654: S1 ^operator O2086 +)
  20779. <=WM: (14655: S1 ^operator O2086)
  20780. <=WM: (14649: R1 ^reward R1046)
  20781. <=WM: (14648: I3 ^see 1)
  20782. <=WM: (14652: O2086 ^name predict-no)
  20783. <=WM: (14651: O2085 ^name predict-yes)
  20784. <=WM: (14650: R1046 ^value 1)
  20785. --- Inner Elaboration Phase, active level 1 (S1) ---
  20786. Firing prefer*rvt*predict-yes*H0
  20787. -->
  20788. Firing rl*prefer*rvt*predict-yes*H0*5
  20789. -->
  20790. (S1 ^operator O2087 = 0.1215971732320855)
  20791. Firing prefer*rvt*predict-yes*H0*5*H1
  20792. -->
  20793. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  20794. -->
  20795. (S1 ^operator O2087 = -0.1512366769350551)
  20796. Firing prefer*rvt*predict-no*H0
  20797. -->
  20798. Firing rl*prefer*rvt*predict-no*H0*6
  20799. -->
  20800. (S1 ^operator O2088 = 0.9999934438786788)
  20801. inner elaboration loop at bottom goal.
  20802. Retracting rl*prefer*rvt*predict-no*H0*6
  20803. -->
  20804. (S1 ^operator O2086 = 0.9999934438786788)
  20805. Retracting rl*prefer*rvt*predict-yes*H0*5
  20806. -->
  20807. (S1 ^operator O2085 = 0.1215971732320855)
  20808. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  20809. -->
  20810. (S1 ^operator O2085 = -0.1512366769350551)
  20811. --- END Proposal Phase ---
  20812. --- Decision Phase ---
  20813. RL update rl*prefer*rvt*predict-no*H0*6 0.999993 0 0.999993 -> 0.999995 0 0.999995(R,m,v=1,0.939227,0.0573972)
  20814. =>WM: (14669: S1 ^operator O2088)
  20815. 1044: O: O2088 (predict-no)
  20816. --- END Decision Phase ---
  20817. --- Application Phase ---
  20818. --- Firing Productions (PE) For State At Depth 1 ---
  20819. --- Inner Elaboration Phase, active level 1 (S1) ---
  20820. Firing apply*operator
  20821. -->
  20822. (I3 ^predict-no N1044 + :O )
  20823. Firing apply*operator*complete
  20824. -->
  20825. (I3 ^predict-no N1043 - :O )
  20826. inner elaboration loop at bottom goal.
  20827. --- Change Working Memory (PE) ---
  20828. =>WM: (14670: I3 ^predict-no N1044)
  20829. <=WM: (14657: N1043 ^status complete)
  20830. <=WM: (14656: I3 ^predict-no N1043)
  20831. --- Firing Productions (IE) For State At Depth 1 ---
  20832. --- Inner Elaboration Phase, active level 1 (S1) ---
  20833. Firing monitor*world
  20834. -->
  20835. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20836. --- Change Working Memory (IE) ---
  20837. --- END Application Phase ---
  20838. --- Output Phase ---
  20839. ENV: Agent did: predict-no for direction R in state State-B
  20840. In State-B moving R
  20841. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20842. predict error 0
  20843. dir: dir isU
  20844. --- END Output Phase ---
  20845. \-/--- Input Phase ---
  20846. =>WM: (14674: I2 ^dir U)
  20847. =>WM: (14673: I2 ^reward 1)
  20848. =>WM: (14672: I2 ^see 0)
  20849. =>WM: (14671: N1044 ^status complete)
  20850. <=WM: (14660: I2 ^dir R)
  20851. <=WM: (14659: I2 ^reward 1)
  20852. <=WM: (14658: I2 ^see 0)
  20853. =>WM: (14675: I2 ^level-1 R0-root)
  20854. <=WM: (14661: I2 ^level-1 R0-root)
  20855. --- END Input Phase ---
  20856. --- Proposal Phase ---
  20857. --- Inner Elaboration Phase, active level 1 (S1) ---
  20858. Firing elaborate*copy-see-to-output-link
  20859. -->
  20860. (I3 ^see 0 +)
  20861. Firing elaborate*reward*based*on*reward
  20862. -->
  20863. (R1048 ^value 1 +)
  20864. (R1 ^reward R1048 +)
  20865. Firing propose*predict-yes
  20866. -->
  20867. (O2089 ^name predict-yes +)
  20868. (S1 ^operator O2089 +)
  20869. Firing propose*predict-no
  20870. -->
  20871. (O2090 ^name predict-no +)
  20872. (S1 ^operator O2090 +)
  20873. Firing rl*prefer*rvt*predict-no*H0*2
  20874. -->
  20875. (S1 ^operator O2088 = 1.)
  20876. Firing rl*prefer*rvt*predict-yes*H0*1
  20877. -->
  20878. (S1 ^operator O2087 = 0.)
  20879. Firing prefer*rvt*predict-yes*H0
  20880. -->
  20881. Firing prefer*rvt*predict-no*H0
  20882. -->
  20883. Firing elaborate*copy-dir-to-output-link
  20884. -->
  20885. (I3 ^dir U +)
  20886. inner elaboration loop at bottom goal.
  20887. Retracting elaborate*copy-see-to-output-link
  20888. -->
  20889. (I3 ^see 0 +)
  20890. Retracting propose*predict-no
  20891. -->
  20892. (O2088 ^name predict-no +)
  20893. (S1 ^operator O2088 +)
  20894. Retracting propose*predict-yes
  20895. -->
  20896. (O2087 ^name predict-yes +)
  20897. (S1 ^operator O2087 +)
  20898. Retracting elaborate*reward*based*on*reward
  20899. -->
  20900. (R1047 ^value 1 +)
  20901. (R1 ^reward R1047 +)
  20902. Retracting elaborate*copy-dir-to-output-link
  20903. -->
  20904. (I3 ^dir R +)
  20905. Retracting rl*prefer*rvt*predict-no*H0*6
  20906. -->
  20907. (S1 ^operator O2088 = 0.999994501574002)
  20908. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  20909. -->
  20910. (S1 ^operator O2087 = -0.1512366769350551)
  20911. Retracting rl*prefer*rvt*predict-yes*H0*5
  20912. -->
  20913. (S1 ^operator O2087 = 0.1215971732320855)
  20914. =>WM: (14682: S1 ^operator O2090 +)
  20915. =>WM: (14681: S1 ^operator O2089 +)
  20916. =>WM: (14680: I3 ^dir U)
  20917. =>WM: (14679: O2090 ^name predict-no)
  20918. =>WM: (14678: O2089 ^name predict-yes)
  20919. =>WM: (14677: R1048 ^value 1)
  20920. =>WM: (14676: R1 ^reward R1048)
  20921. <=WM: (14667: S1 ^operator O2087 +)
  20922. <=WM: (14668: S1 ^operator O2088 +)
  20923. <=WM: (14669: S1 ^operator O2088)
  20924. <=WM: (14638: I3 ^dir R)
  20925. <=WM: (14663: R1 ^reward R1047)
  20926. <=WM: (14666: O2088 ^name predict-no)
  20927. <=WM: (14665: O2087 ^name predict-yes)
  20928. <=WM: (14664: R1047 ^value 1)
  20929. --- Inner Elaboration Phase, active level 1 (S1) ---
  20930. Firing prefer*rvt*predict-yes*H0
  20931. -->
  20932. Firing rl*prefer*rvt*predict-yes*H0*1
  20933. -->
  20934. (S1 ^operator O2089 = 0.)
  20935. Firing prefer*rvt*predict-no*H0
  20936. -->
  20937. Firing rl*prefer*rvt*predict-no*H0*2
  20938. -->
  20939. (S1 ^operator O2090 = 1.)
  20940. inner elaboration loop at bottom goal.
  20941. Retracting rl*prefer*rvt*predict-no*H0*2
  20942. -->
  20943. (S1 ^operator O2088 = 1.)
  20944. Retracting rl*prefer*rvt*predict-yes*H0*1
  20945. -->
  20946. (S1 ^operator O2087 = 0.)
  20947. --- END Proposal Phase ---
  20948. --- Decision Phase ---
  20949. RL update rl*prefer*rvt*predict-no*H0*6 0.999995 0 0.999995 -> 0.999995 0 0.999995(R,m,v=1,0.93956,0.0571004)
  20950. =>WM: (14683: S1 ^operator O2090)
  20951. 1045: O: O2090 (predict-no)
  20952. --- END Decision Phase ---
  20953. --- Application Phase ---
  20954. --- Firing Productions (PE) For State At Depth 1 ---
  20955. --- Inner Elaboration Phase, active level 1 (S1) ---
  20956. Firing apply*operator
  20957. -->
  20958. (I3 ^predict-no N1045 + :O )
  20959. Firing apply*operator*complete
  20960. -->
  20961. (I3 ^predict-no N1044 - :O )
  20962. inner elaboration loop at bottom goal.
  20963. --- Change Working Memory (PE) ---
  20964. =>WM: (14684: I3 ^predict-no N1045)
  20965. <=WM: (14671: N1044 ^status complete)
  20966. <=WM: (14670: I3 ^predict-no N1044)
  20967. --- Firing Productions (IE) For State At Depth 1 ---
  20968. --- Inner Elaboration Phase, active level 1 (S1) ---
  20969. Firing monitor*world
  20970. -->
  20971. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  20972. --- Change Working Memory (IE) ---
  20973. --- END Application Phase ---
  20974. --- Output Phase ---
  20975. ENV: Agent did: predict-no for direction U in state State-B
  20976. In State-B moving U
  20977. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  20978. predict error 0
  20979. dir: dir isR
  20980. --- END Output Phase ---
  20981. |\---- Input Phase ---
  20982. =>WM: (14688: I2 ^dir R)
  20983. =>WM: (14687: I2 ^reward 1)
  20984. =>WM: (14686: I2 ^see 0)
  20985. =>WM: (14685: N1045 ^status complete)
  20986. <=WM: (14674: I2 ^dir U)
  20987. <=WM: (14673: I2 ^reward 1)
  20988. <=WM: (14672: I2 ^see 0)
  20989. =>WM: (14689: I2 ^level-1 R0-root)
  20990. <=WM: (14675: I2 ^level-1 R0-root)
  20991. --- END Input Phase ---
  20992. --- Proposal Phase ---
  20993. --- Inner Elaboration Phase, active level 1 (S1) ---
  20994. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  20995. -->
  20996. (S1 ^operator O2089 = -0.1512366769350551)
  20997. Firing prefer*rvt*predict-yes*H0*5*H1
  20998. -->
  20999. Firing elaborate*copy-see-to-output-link
  21000. -->
  21001. (I3 ^see 0 +)
  21002. Firing elaborate*reward*based*on*reward
  21003. -->
  21004. (R1049 ^value 1 +)
  21005. (R1 ^reward R1049 +)
  21006. Firing propose*predict-yes
  21007. -->
  21008. (O2091 ^name predict-yes +)
  21009. (S1 ^operator O2091 +)
  21010. Firing propose*predict-no
  21011. -->
  21012. (O2092 ^name predict-no +)
  21013. (S1 ^operator O2092 +)
  21014. Firing rl*prefer*rvt*predict-no*H0*6
  21015. -->
  21016. (S1 ^operator O2090 = 0.9999953878441619)
  21017. Firing rl*prefer*rvt*predict-yes*H0*5
  21018. -->
  21019. (S1 ^operator O2089 = 0.1215971732320855)
  21020. Firing prefer*rvt*predict-yes*H0
  21021. -->
  21022. Firing prefer*rvt*predict-no*H0
  21023. -->
  21024. Firing elaborate*copy-dir-to-output-link
  21025. -->
  21026. (I3 ^dir R +)
  21027. inner elaboration loop at bottom goal.
  21028. Retracting elaborate*copy-see-to-output-link
  21029. -->
  21030. (I3 ^see 0 +)
  21031. Retracting propose*predict-no
  21032. -->
  21033. (O2090 ^name predict-no +)
  21034. (S1 ^operator O2090 +)
  21035. Retracting propose*predict-yes
  21036. -->
  21037. (O2089 ^name predict-yes +)
  21038. (S1 ^operator O2089 +)
  21039. Retracting elaborate*reward*based*on*reward
  21040. -->
  21041. (R1048 ^value 1 +)
  21042. (R1 ^reward R1048 +)
  21043. Retracting elaborate*copy-dir-to-output-link
  21044. -->
  21045. (I3 ^dir U +)
  21046. Retracting rl*prefer*rvt*predict-no*H0*2
  21047. -->
  21048. (S1 ^operator O2090 = 1.)
  21049. Retracting rl*prefer*rvt*predict-yes*H0*1
  21050. -->
  21051. (S1 ^operator O2089 = 0.)
  21052. =>WM: (14696: S1 ^operator O2092 +)
  21053. =>WM: (14695: S1 ^operator O2091 +)
  21054. =>WM: (14694: I3 ^dir R)
  21055. =>WM: (14693: O2092 ^name predict-no)
  21056. =>WM: (14692: O2091 ^name predict-yes)
  21057. =>WM: (14691: R1049 ^value 1)
  21058. =>WM: (14690: R1 ^reward R1049)
  21059. <=WM: (14681: S1 ^operator O2089 +)
  21060. <=WM: (14682: S1 ^operator O2090 +)
  21061. <=WM: (14683: S1 ^operator O2090)
  21062. <=WM: (14680: I3 ^dir U)
  21063. <=WM: (14676: R1 ^reward R1048)
  21064. <=WM: (14679: O2090 ^name predict-no)
  21065. <=WM: (14678: O2089 ^name predict-yes)
  21066. <=WM: (14677: R1048 ^value 1)
  21067. --- Inner Elaboration Phase, active level 1 (S1) ---
  21068. Firing prefer*rvt*predict-yes*H0
  21069. -->
  21070. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  21071. -->
  21072. (S1 ^operator O2091 = -0.1512366769350551)
  21073. Firing rl*prefer*rvt*predict-yes*H0*5
  21074. -->
  21075. (S1 ^operator O2091 = 0.1215971732320855)
  21076. Firing prefer*rvt*predict-yes*H0*5*H1
  21077. -->
  21078. Firing prefer*rvt*predict-no*H0
  21079. -->
  21080. Firing rl*prefer*rvt*predict-no*H0*6
  21081. -->
  21082. (S1 ^operator O2092 = 0.9999953878441619)
  21083. inner elaboration loop at bottom goal.
  21084. Retracting rl*prefer*rvt*predict-no*H0*6
  21085. -->
  21086. (S1 ^operator O2090 = 0.9999953878441619)
  21087. Retracting rl*prefer*rvt*predict-yes*H0*5
  21088. -->
  21089. (S1 ^operator O2089 = 0.1215971732320855)
  21090. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  21091. -->
  21092. (S1 ^operator O2089 = -0.1512366769350551)
  21093. --- END Proposal Phase ---
  21094. --- Decision Phase ---
  21095. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21096. =>WM: (14697: S1 ^operator O2092)
  21097. 1046: O: O2092 (predict-no)
  21098. --- END Decision Phase ---
  21099. --- Application Phase ---
  21100. --- Firing Productions (PE) For State At Depth 1 ---
  21101. --- Inner Elaboration Phase, active level 1 (S1) ---
  21102. Firing apply*operator
  21103. -->
  21104. (I3 ^predict-no N1046 + :O )
  21105. Firing apply*operator*complete
  21106. -->
  21107. (I3 ^predict-no N1045 - :O )
  21108. inner elaboration loop at bottom goal.
  21109. --- Change Working Memory (PE) ---
  21110. =>WM: (14698: I3 ^predict-no N1046)
  21111. <=WM: (14685: N1045 ^status complete)
  21112. <=WM: (14684: I3 ^predict-no N1045)
  21113. --- Firing Productions (IE) For State At Depth 1 ---
  21114. --- Inner Elaboration Phase, active level 1 (S1) ---
  21115. Firing monitor*world
  21116. -->
  21117. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21118. --- Change Working Memory (IE) ---
  21119. --- END Application Phase ---
  21120. --- Output Phase ---
  21121. ENV: Agent did: predict-no for direction R in state State-B
  21122. In State-B moving R
  21123. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21124. predict error 0
  21125. dir: dir isU
  21126. --- END Output Phase ---
  21127. /|\--- Input Phase ---
  21128. =>WM: (14702: I2 ^dir U)
  21129. =>WM: (14701: I2 ^reward 1)
  21130. =>WM: (14700: I2 ^see 0)
  21131. =>WM: (14699: N1046 ^status complete)
  21132. <=WM: (14688: I2 ^dir R)
  21133. <=WM: (14687: I2 ^reward 1)
  21134. <=WM: (14686: I2 ^see 0)
  21135. =>WM: (14703: I2 ^level-1 R0-root)
  21136. <=WM: (14689: I2 ^level-1 R0-root)
  21137. --- END Input Phase ---
  21138. --- Proposal Phase ---
  21139. --- Inner Elaboration Phase, active level 1 (S1) ---
  21140. Firing elaborate*copy-see-to-output-link
  21141. -->
  21142. (I3 ^see 0 +)
  21143. Firing elaborate*reward*based*on*reward
  21144. -->
  21145. (R1050 ^value 1 +)
  21146. (R1 ^reward R1050 +)
  21147. Firing propose*predict-yes
  21148. -->
  21149. (O2093 ^name predict-yes +)
  21150. (S1 ^operator O2093 +)
  21151. Firing propose*predict-no
  21152. -->
  21153. (O2094 ^name predict-no +)
  21154. (S1 ^operator O2094 +)
  21155. Firing rl*prefer*rvt*predict-no*H0*2
  21156. -->
  21157. (S1 ^operator O2092 = 1.)
  21158. Firing rl*prefer*rvt*predict-yes*H0*1
  21159. -->
  21160. (S1 ^operator O2091 = 0.)
  21161. Firing prefer*rvt*predict-yes*H0
  21162. -->
  21163. Firing prefer*rvt*predict-no*H0
  21164. -->
  21165. Firing elaborate*copy-dir-to-output-link
  21166. -->
  21167. (I3 ^dir U +)
  21168. inner elaboration loop at bottom goal.
  21169. Retracting elaborate*copy-see-to-output-link
  21170. -->
  21171. (I3 ^see 0 +)
  21172. Retracting propose*predict-no
  21173. -->
  21174. (O2092 ^name predict-no +)
  21175. (S1 ^operator O2092 +)
  21176. Retracting propose*predict-yes
  21177. -->
  21178. (O2091 ^name predict-yes +)
  21179. (S1 ^operator O2091 +)
  21180. Retracting elaborate*reward*based*on*reward
  21181. -->
  21182. (R1049 ^value 1 +)
  21183. (R1 ^reward R1049 +)
  21184. Retracting elaborate*copy-dir-to-output-link
  21185. -->
  21186. (I3 ^dir R +)
  21187. Retracting rl*prefer*rvt*predict-no*H0*6
  21188. -->
  21189. (S1 ^operator O2092 = 0.9999953878441619)
  21190. Retracting rl*prefer*rvt*predict-yes*H0*5
  21191. -->
  21192. (S1 ^operator O2091 = 0.1215971732320855)
  21193. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  21194. -->
  21195. (S1 ^operator O2091 = -0.1512366769350551)
  21196. =>WM: (14710: S1 ^operator O2094 +)
  21197. =>WM: (14709: S1 ^operator O2093 +)
  21198. =>WM: (14708: I3 ^dir U)
  21199. =>WM: (14707: O2094 ^name predict-no)
  21200. =>WM: (14706: O2093 ^name predict-yes)
  21201. =>WM: (14705: R1050 ^value 1)
  21202. =>WM: (14704: R1 ^reward R1050)
  21203. <=WM: (14695: S1 ^operator O2091 +)
  21204. <=WM: (14696: S1 ^operator O2092 +)
  21205. <=WM: (14697: S1 ^operator O2092)
  21206. <=WM: (14694: I3 ^dir R)
  21207. <=WM: (14690: R1 ^reward R1049)
  21208. <=WM: (14693: O2092 ^name predict-no)
  21209. <=WM: (14692: O2091 ^name predict-yes)
  21210. <=WM: (14691: R1049 ^value 1)
  21211. --- Inner Elaboration Phase, active level 1 (S1) ---
  21212. Firing prefer*rvt*predict-yes*H0
  21213. -->
  21214. Firing rl*prefer*rvt*predict-yes*H0*1
  21215. -->
  21216. (S1 ^operator O2093 = 0.)
  21217. Firing prefer*rvt*predict-no*H0
  21218. -->
  21219. Firing rl*prefer*rvt*predict-no*H0*2
  21220. -->
  21221. (S1 ^operator O2094 = 1.)
  21222. inner elaboration loop at bottom goal.
  21223. Retracting rl*prefer*rvt*predict-no*H0*2
  21224. -->
  21225. (S1 ^operator O2092 = 1.)
  21226. Retracting rl*prefer*rvt*predict-yes*H0*1
  21227. -->
  21228. (S1 ^operator O2091 = 0.)
  21229. --- END Proposal Phase ---
  21230. --- Decision Phase ---
  21231. RL update rl*prefer*rvt*predict-no*H0*6 0.999995 0 0.999995 -> 0.999996 0 0.999996(R,m,v=1,0.939891,0.0568066)
  21232. =>WM: (14711: S1 ^operator O2094)
  21233. 1047: O: O2094 (predict-no)
  21234. --- END Decision Phase ---
  21235. --- Application Phase ---
  21236. --- Firing Productions (PE) For State At Depth 1 ---
  21237. --- Inner Elaboration Phase, active level 1 (S1) ---
  21238. Firing apply*operator
  21239. -->
  21240. (I3 ^predict-no N1047 + :O )
  21241. Firing apply*operator*complete
  21242. -->
  21243. (I3 ^predict-no N1046 - :O )
  21244. inner elaboration loop at bottom goal.
  21245. --- Change Working Memory (PE) ---
  21246. =>WM: (14712: I3 ^predict-no N1047)
  21247. <=WM: (14699: N1046 ^status complete)
  21248. <=WM: (14698: I3 ^predict-no N1046)
  21249. --- Firing Productions (IE) For State At Depth 1 ---
  21250. --- Inner Elaboration Phase, active level 1 (S1) ---
  21251. Firing monitor*world
  21252. -->
  21253. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21254. --- Change Working Memory (IE) ---
  21255. --- END Application Phase ---
  21256. --- Output Phase ---
  21257. ENV: Agent did: predict-no for direction U in state State-B
  21258. In State-B moving U
  21259. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21260. predict error 0
  21261. dir: dir isR
  21262. --- END Output Phase ---
  21263. -/--- Input Phase ---
  21264. =>WM: (14716: I2 ^dir R)
  21265. =>WM: (14715: I2 ^reward 1)
  21266. =>WM: (14714: I2 ^see 0)
  21267. =>WM: (14713: N1047 ^status complete)
  21268. <=WM: (14702: I2 ^dir U)
  21269. <=WM: (14701: I2 ^reward 1)
  21270. <=WM: (14700: I2 ^see 0)
  21271. =>WM: (14717: I2 ^level-1 R0-root)
  21272. <=WM: (14703: I2 ^level-1 R0-root)
  21273. --- END Input Phase ---
  21274. --- Proposal Phase ---
  21275. --- Inner Elaboration Phase, active level 1 (S1) ---
  21276. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  21277. -->
  21278. (S1 ^operator O2093 = -0.1512366769350551)
  21279. Firing prefer*rvt*predict-yes*H0*5*H1
  21280. -->
  21281. Firing elaborate*copy-see-to-output-link
  21282. -->
  21283. (I3 ^see 0 +)
  21284. Firing elaborate*reward*based*on*reward
  21285. -->
  21286. (R1051 ^value 1 +)
  21287. (R1 ^reward R1051 +)
  21288. Firing propose*predict-yes
  21289. -->
  21290. (O2095 ^name predict-yes +)
  21291. (S1 ^operator O2095 +)
  21292. Firing propose*predict-no
  21293. -->
  21294. (O2096 ^name predict-no +)
  21295. (S1 ^operator O2096 +)
  21296. Firing rl*prefer*rvt*predict-no*H0*6
  21297. -->
  21298. (S1 ^operator O2094 = 0.9999961306038242)
  21299. Firing rl*prefer*rvt*predict-yes*H0*5
  21300. -->
  21301. (S1 ^operator O2093 = 0.1215971732320855)
  21302. Firing prefer*rvt*predict-yes*H0
  21303. -->
  21304. Firing prefer*rvt*predict-no*H0
  21305. -->
  21306. Firing elaborate*copy-dir-to-output-link
  21307. -->
  21308. (I3 ^dir R +)
  21309. inner elaboration loop at bottom goal.
  21310. Retracting elaborate*copy-see-to-output-link
  21311. -->
  21312. (I3 ^see 0 +)
  21313. Retracting propose*predict-no
  21314. -->
  21315. (O2094 ^name predict-no +)
  21316. (S1 ^operator O2094 +)
  21317. Retracting propose*predict-yes
  21318. -->
  21319. (O2093 ^name predict-yes +)
  21320. (S1 ^operator O2093 +)
  21321. Retracting elaborate*reward*based*on*reward
  21322. -->
  21323. (R1050 ^value 1 +)
  21324. (R1 ^reward R1050 +)
  21325. Retracting elaborate*copy-dir-to-output-link
  21326. -->
  21327. (I3 ^dir U +)
  21328. Retracting rl*prefer*rvt*predict-no*H0*2
  21329. -->
  21330. (S1 ^operator O2094 = 1.)
  21331. Retracting rl*prefer*rvt*predict-yes*H0*1
  21332. -->
  21333. (S1 ^operator O2093 = 0.)
  21334. =>WM: (14724: S1 ^operator O2096 +)
  21335. =>WM: (14723: S1 ^operator O2095 +)
  21336. =>WM: (14722: I3 ^dir R)
  21337. =>WM: (14721: O2096 ^name predict-no)
  21338. =>WM: (14720: O2095 ^name predict-yes)
  21339. =>WM: (14719: R1051 ^value 1)
  21340. =>WM: (14718: R1 ^reward R1051)
  21341. <=WM: (14709: S1 ^operator O2093 +)
  21342. <=WM: (14710: S1 ^operator O2094 +)
  21343. <=WM: (14711: S1 ^operator O2094)
  21344. <=WM: (14708: I3 ^dir U)
  21345. <=WM: (14704: R1 ^reward R1050)
  21346. <=WM: (14707: O2094 ^name predict-no)
  21347. <=WM: (14706: O2093 ^name predict-yes)
  21348. <=WM: (14705: R1050 ^value 1)
  21349. --- Inner Elaboration Phase, active level 1 (S1) ---
  21350. Firing prefer*rvt*predict-yes*H0
  21351. -->
  21352. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  21353. -->
  21354. (S1 ^operator O2095 = -0.1512366769350551)
  21355. Firing rl*prefer*rvt*predict-yes*H0*5
  21356. -->
  21357. (S1 ^operator O2095 = 0.1215971732320855)
  21358. Firing prefer*rvt*predict-yes*H0*5*H1
  21359. -->
  21360. Firing prefer*rvt*predict-no*H0
  21361. -->
  21362. Firing rl*prefer*rvt*predict-no*H0*6
  21363. -->
  21364. (S1 ^operator O2096 = 0.9999961306038242)
  21365. inner elaboration loop at bottom goal.
  21366. Retracting rl*prefer*rvt*predict-no*H0*6
  21367. -->
  21368. (S1 ^operator O2094 = 0.9999961306038242)
  21369. Retracting rl*prefer*rvt*predict-yes*H0*5
  21370. -->
  21371. (S1 ^operator O2093 = 0.1215971732320855)
  21372. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  21373. -->
  21374. (S1 ^operator O2093 = -0.1512366769350551)
  21375. --- END Proposal Phase ---
  21376. --- Decision Phase ---
  21377. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21378. =>WM: (14725: S1 ^operator O2096)
  21379. 1048: O: O2096 (predict-no)
  21380. --- END Decision Phase ---
  21381. --- Application Phase ---
  21382. --- Firing Productions (PE) For State At Depth 1 ---
  21383. --- Inner Elaboration Phase, active level 1 (S1) ---
  21384. Firing apply*operator
  21385. -->
  21386. (I3 ^predict-no N1048 + :O )
  21387. Firing apply*operator*complete
  21388. -->
  21389. (I3 ^predict-no N1047 - :O )
  21390. inner elaboration loop at bottom goal.
  21391. --- Change Working Memory (PE) ---
  21392. =>WM: (14726: I3 ^predict-no N1048)
  21393. <=WM: (14713: N1047 ^status complete)
  21394. <=WM: (14712: I3 ^predict-no N1047)
  21395. --- Firing Productions (IE) For State At Depth 1 ---
  21396. --- Inner Elaboration Phase, active level 1 (S1) ---
  21397. Firing monitor*world
  21398. -->
  21399. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21400. --- Change Working Memory (IE) ---
  21401. --- END Application Phase ---
  21402. --- Output Phase ---
  21403. ENV: Agent did: predict-no for direction R in state State-B
  21404. In State-B moving R
  21405. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21406. predict error 0
  21407. dir: dir isR
  21408. --- END Output Phase ---
  21409. |\---- Input Phase ---
  21410. =>WM: (14730: I2 ^dir R)
  21411. =>WM: (14729: I2 ^reward 1)
  21412. =>WM: (14728: I2 ^see 0)
  21413. =>WM: (14727: N1048 ^status complete)
  21414. <=WM: (14716: I2 ^dir R)
  21415. <=WM: (14715: I2 ^reward 1)
  21416. <=WM: (14714: I2 ^see 0)
  21417. =>WM: (14731: I2 ^level-1 R0-root)
  21418. <=WM: (14717: I2 ^level-1 R0-root)
  21419. --- END Input Phase ---
  21420. --- Proposal Phase ---
  21421. --- Inner Elaboration Phase, active level 1 (S1) ---
  21422. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  21423. -->
  21424. (S1 ^operator O2095 = -0.1512366769350551)
  21425. Firing prefer*rvt*predict-yes*H0*5*H1
  21426. -->
  21427. Firing elaborate*copy-see-to-output-link
  21428. -->
  21429. (I3 ^see 0 +)
  21430. Firing elaborate*reward*based*on*reward
  21431. -->
  21432. (R1052 ^value 1 +)
  21433. (R1 ^reward R1052 +)
  21434. Firing propose*predict-yes
  21435. -->
  21436. (O2097 ^name predict-yes +)
  21437. (S1 ^operator O2097 +)
  21438. Firing propose*predict-no
  21439. -->
  21440. (O2098 ^name predict-no +)
  21441. (S1 ^operator O2098 +)
  21442. Firing rl*prefer*rvt*predict-no*H0*6
  21443. -->
  21444. (S1 ^operator O2096 = 0.9999961306038242)
  21445. Firing rl*prefer*rvt*predict-yes*H0*5
  21446. -->
  21447. (S1 ^operator O2095 = 0.1215971732320855)
  21448. Firing prefer*rvt*predict-yes*H0
  21449. -->
  21450. Firing prefer*rvt*predict-no*H0
  21451. -->
  21452. Firing elaborate*copy-dir-to-output-link
  21453. -->
  21454. (I3 ^dir R +)
  21455. inner elaboration loop at bottom goal.
  21456. Retracting elaborate*copy-see-to-output-link
  21457. -->
  21458. (I3 ^see 0 +)
  21459. Retracting propose*predict-no
  21460. -->
  21461. (O2096 ^name predict-no +)
  21462. (S1 ^operator O2096 +)
  21463. Retracting propose*predict-yes
  21464. -->
  21465. (O2095 ^name predict-yes +)
  21466. (S1 ^operator O2095 +)
  21467. Retracting elaborate*reward*based*on*reward
  21468. -->
  21469. (R1051 ^value 1 +)
  21470. (R1 ^reward R1051 +)
  21471. Retracting elaborate*copy-dir-to-output-link
  21472. -->
  21473. (I3 ^dir R +)
  21474. Retracting rl*prefer*rvt*predict-no*H0*6
  21475. -->
  21476. (S1 ^operator O2096 = 0.9999961306038242)
  21477. Retracting rl*prefer*rvt*predict-yes*H0*5
  21478. -->
  21479. (S1 ^operator O2095 = 0.1215971732320855)
  21480. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  21481. -->
  21482. (S1 ^operator O2095 = -0.1512366769350551)
  21483. =>WM: (14737: S1 ^operator O2098 +)
  21484. =>WM: (14736: S1 ^operator O2097 +)
  21485. =>WM: (14735: O2098 ^name predict-no)
  21486. =>WM: (14734: O2097 ^name predict-yes)
  21487. =>WM: (14733: R1052 ^value 1)
  21488. =>WM: (14732: R1 ^reward R1052)
  21489. <=WM: (14723: S1 ^operator O2095 +)
  21490. <=WM: (14724: S1 ^operator O2096 +)
  21491. <=WM: (14725: S1 ^operator O2096)
  21492. <=WM: (14718: R1 ^reward R1051)
  21493. <=WM: (14721: O2096 ^name predict-no)
  21494. <=WM: (14720: O2095 ^name predict-yes)
  21495. <=WM: (14719: R1051 ^value 1)
  21496. --- Inner Elaboration Phase, active level 1 (S1) ---
  21497. Firing prefer*rvt*predict-yes*H0
  21498. -->
  21499. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  21500. -->
  21501. (S1 ^operator O2097 = -0.1512366769350551)
  21502. Firing rl*prefer*rvt*predict-yes*H0*5
  21503. -->
  21504. (S1 ^operator O2097 = 0.1215971732320855)
  21505. Firing prefer*rvt*predict-yes*H0*5*H1
  21506. -->
  21507. Firing prefer*rvt*predict-no*H0
  21508. -->
  21509. Firing rl*prefer*rvt*predict-no*H0*6
  21510. -->
  21511. (S1 ^operator O2098 = 0.9999961306038242)
  21512. inner elaboration loop at bottom goal.
  21513. Retracting rl*prefer*rvt*predict-no*H0*6
  21514. -->
  21515. (S1 ^operator O2096 = 0.9999961306038242)
  21516. Retracting rl*prefer*rvt*predict-yes*H0*5
  21517. -->
  21518. (S1 ^operator O2095 = 0.1215971732320855)
  21519. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  21520. -->
  21521. (S1 ^operator O2095 = -0.1512366769350551)
  21522. --- END Proposal Phase ---
  21523. --- Decision Phase ---
  21524. RL update rl*prefer*rvt*predict-no*H0*6 0.999996 0 0.999996 -> 0.999997 0 0.999997(R,m,v=1,0.940217,0.0565158)
  21525. =>WM: (14738: S1 ^operator O2098)
  21526. 1049: O: O2098 (predict-no)
  21527. --- END Decision Phase ---
  21528. --- Application Phase ---
  21529. --- Firing Productions (PE) For State At Depth 1 ---
  21530. --- Inner Elaboration Phase, active level 1 (S1) ---
  21531. Firing apply*operator
  21532. -->
  21533. (I3 ^predict-no N1049 + :O )
  21534. Firing apply*operator*complete
  21535. -->
  21536. (I3 ^predict-no N1048 - :O )
  21537. inner elaboration loop at bottom goal.
  21538. --- Change Working Memory (PE) ---
  21539. =>WM: (14739: I3 ^predict-no N1049)
  21540. <=WM: (14727: N1048 ^status complete)
  21541. <=WM: (14726: I3 ^predict-no N1048)
  21542. --- Firing Productions (IE) For State At Depth 1 ---
  21543. --- Inner Elaboration Phase, active level 1 (S1) ---
  21544. Firing monitor*world
  21545. -->
  21546. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21547. --- Change Working Memory (IE) ---
  21548. --- END Application Phase ---
  21549. --- Output Phase ---
  21550. ENV: Agent did: predict-no for direction R in state State-B
  21551. In State-B moving R
  21552. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  21553. predict error 0
  21554. dir: dir isL
  21555. --- END Output Phase ---
  21556. /|--- Input Phase ---
  21557. =>WM: (14743: I2 ^dir L)
  21558. =>WM: (14742: I2 ^reward 1)
  21559. =>WM: (14741: I2 ^see 0)
  21560. =>WM: (14740: N1049 ^status complete)
  21561. <=WM: (14730: I2 ^dir R)
  21562. <=WM: (14729: I2 ^reward 1)
  21563. <=WM: (14728: I2 ^see 0)
  21564. =>WM: (14744: I2 ^level-1 R0-root)
  21565. <=WM: (14731: I2 ^level-1 R0-root)
  21566. --- END Input Phase ---
  21567. --- Proposal Phase ---
  21568. --- Inner Elaboration Phase, active level 1 (S1) ---
  21569. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  21570. -->
  21571. (S1 ^operator O2098 = -0.1984300550322165)
  21572. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  21573. -->
  21574. (S1 ^operator O2097 = 0.6091357162190356)
  21575. Firing prefer*rvt*predict-no*H0*4*H1
  21576. -->
  21577. Firing prefer*rvt*predict-yes*H0*3*H1
  21578. -->
  21579. Firing elaborate*copy-see-to-output-link
  21580. -->
  21581. (I3 ^see 0 +)
  21582. Firing elaborate*reward*based*on*reward
  21583. -->
  21584. (R1053 ^value 1 +)
  21585. (R1 ^reward R1053 +)
  21586. Firing propose*predict-yes
  21587. -->
  21588. (O2099 ^name predict-yes +)
  21589. (S1 ^operator O2099 +)
  21590. Firing propose*predict-no
  21591. -->
  21592. (O2100 ^name predict-no +)
  21593. (S1 ^operator O2100 +)
  21594. Firing rl*prefer*rvt*predict-no*H0*4
  21595. -->
  21596. (S1 ^operator O2098 = 0.314498303095341)
  21597. Firing rl*prefer*rvt*predict-yes*H0*3
  21598. -->
  21599. (S1 ^operator O2097 = 0.3907618357131554)
  21600. Firing prefer*rvt*predict-yes*H0
  21601. -->
  21602. Firing prefer*rvt*predict-no*H0
  21603. -->
  21604. Firing elaborate*copy-dir-to-output-link
  21605. -->
  21606. (I3 ^dir L +)
  21607. inner elaboration loop at bottom goal.
  21608. Retracting elaborate*copy-see-to-output-link
  21609. -->
  21610. (I3 ^see 0 +)
  21611. Retracting propose*predict-no
  21612. -->
  21613. (O2098 ^name predict-no +)
  21614. (S1 ^operator O2098 +)
  21615. Retracting propose*predict-yes
  21616. -->
  21617. (O2097 ^name predict-yes +)
  21618. (S1 ^operator O2097 +)
  21619. Retracting elaborate*reward*based*on*reward
  21620. -->
  21621. (R1052 ^value 1 +)
  21622. (R1 ^reward R1052 +)
  21623. Retracting elaborate*copy-dir-to-output-link
  21624. -->
  21625. (I3 ^dir R +)
  21626. Retracting rl*prefer*rvt*predict-no*H0*6
  21627. -->
  21628. (S1 ^operator O2098 = 0.9999967532001512)
  21629. Retracting rl*prefer*rvt*predict-yes*H0*5
  21630. -->
  21631. (S1 ^operator O2097 = 0.1215971732320855)
  21632. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  21633. -->
  21634. (S1 ^operator O2097 = -0.1512366769350551)
  21635. =>WM: (14751: S1 ^operator O2100 +)
  21636. =>WM: (14750: S1 ^operator O2099 +)
  21637. =>WM: (14749: I3 ^dir L)
  21638. =>WM: (14748: O2100 ^name predict-no)
  21639. =>WM: (14747: O2099 ^name predict-yes)
  21640. =>WM: (14746: R1053 ^value 1)
  21641. =>WM: (14745: R1 ^reward R1053)
  21642. <=WM: (14736: S1 ^operator O2097 +)
  21643. <=WM: (14737: S1 ^operator O2098 +)
  21644. <=WM: (14738: S1 ^operator O2098)
  21645. <=WM: (14722: I3 ^dir R)
  21646. <=WM: (14732: R1 ^reward R1052)
  21647. <=WM: (14735: O2098 ^name predict-no)
  21648. <=WM: (14734: O2097 ^name predict-yes)
  21649. <=WM: (14733: R1052 ^value 1)
  21650. --- Inner Elaboration Phase, active level 1 (S1) ---
  21651. Firing prefer*rvt*predict-yes*H0
  21652. -->
  21653. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  21654. -->
  21655. (S1 ^operator O2099 = 0.6091357162190356)
  21656. Firing rl*prefer*rvt*predict-yes*H0*3
  21657. -->
  21658. (S1 ^operator O2099 = 0.3907618357131554)
  21659. Firing prefer*rvt*predict-yes*H0*3*H1
  21660. -->
  21661. Firing prefer*rvt*predict-no*H0
  21662. -->
  21663. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  21664. -->
  21665. (S1 ^operator O2100 = -0.1984300550322165)
  21666. Firing rl*prefer*rvt*predict-no*H0*4
  21667. -->
  21668. (S1 ^operator O2100 = 0.314498303095341)
  21669. Firing prefer*rvt*predict-no*H0*4*H1
  21670. -->
  21671. inner elaboration loop at bottom goal.
  21672. Retracting rl*prefer*rvt*predict-no*H0*4
  21673. -->
  21674. (S1 ^operator O2098 = 0.314498303095341)
  21675. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  21676. -->
  21677. (S1 ^operator O2098 = -0.1984300550322165)
  21678. Retracting rl*prefer*rvt*predict-yes*H0*3
  21679. -->
  21680. (S1 ^operator O2097 = 0.3907618357131554)
  21681. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  21682. -->
  21683. (S1 ^operator O2097 = 0.6091357162190356)
  21684. --- END Proposal Phase ---
  21685. --- Decision Phase ---
  21686. RL update rl*prefer*rvt*predict-no*H0*6 0.999997 0 0.999997 -> 0.999997 0 0.999997(R,m,v=1,0.940541,0.056228)
  21687. =>WM: (14752: S1 ^operator O2099)
  21688. 1050: O: O2099 (predict-yes)
  21689. --- END Decision Phase ---
  21690. --- Application Phase ---
  21691. --- Firing Productions (PE) For State At Depth 1 ---
  21692. --- Inner Elaboration Phase, active level 1 (S1) ---
  21693. Firing apply*operator
  21694. -->
  21695. (I3 ^predict-yes N1050 + :O )
  21696. Firing apply*operator*complete
  21697. -->
  21698. (I3 ^predict-no N1049 - :O )
  21699. inner elaboration loop at bottom goal.
  21700. --- Change Working Memory (PE) ---
  21701. =>WM: (14753: I3 ^predict-yes N1050)
  21702. <=WM: (14740: N1049 ^status complete)
  21703. <=WM: (14739: I3 ^predict-no N1049)
  21704. --- Firing Productions (IE) For State At Depth 1 ---
  21705. --- Inner Elaboration Phase, active level 1 (S1) ---
  21706. Firing monitor*world
  21707. -->
  21708. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21709. --- Change Working Memory (IE) ---
  21710. --- END Application Phase ---
  21711. --- Output Phase ---
  21712. ENV: Agent did: predict-yes for direction L in state State-B
  21713. In State-B moving L
  21714. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  21715. predict error 0
  21716. dir: dir isU
  21717. --- END Output Phase ---
  21718. \-/--- Input Phase ---
  21719. =>WM: (14757: I2 ^dir U)
  21720. =>WM: (14756: I2 ^reward 1)
  21721. =>WM: (14755: I2 ^see 1)
  21722. =>WM: (14754: N1050 ^status complete)
  21723. <=WM: (14743: I2 ^dir L)
  21724. <=WM: (14742: I2 ^reward 1)
  21725. <=WM: (14741: I2 ^see 0)
  21726. =>WM: (14758: I2 ^level-1 L1-root)
  21727. <=WM: (14744: I2 ^level-1 R0-root)
  21728. --- END Input Phase ---
  21729. --- Proposal Phase ---
  21730. --- Inner Elaboration Phase, active level 1 (S1) ---
  21731. Firing elaborate*copy-see-to-output-link
  21732. -->
  21733. (I3 ^see 1 +)
  21734. Firing elaborate*reward*based*on*reward
  21735. -->
  21736. (R1054 ^value 1 +)
  21737. (R1 ^reward R1054 +)
  21738. Firing propose*predict-yes
  21739. -->
  21740. (O2101 ^name predict-yes +)
  21741. (S1 ^operator O2101 +)
  21742. Firing propose*predict-no
  21743. -->
  21744. (O2102 ^name predict-no +)
  21745. (S1 ^operator O2102 +)
  21746. Firing rl*prefer*rvt*predict-no*H0*2
  21747. -->
  21748. (S1 ^operator O2100 = 1.)
  21749. Firing rl*prefer*rvt*predict-yes*H0*1
  21750. -->
  21751. (S1 ^operator O2099 = 0.)
  21752. Firing prefer*rvt*predict-yes*H0
  21753. -->
  21754. Firing prefer*rvt*predict-no*H0
  21755. -->
  21756. Firing elaborate*copy-dir-to-output-link
  21757. -->
  21758. (I3 ^dir U +)
  21759. inner elaboration loop at bottom goal.
  21760. Retracting elaborate*copy-see-to-output-link
  21761. -->
  21762. (I3 ^see 0 +)
  21763. Retracting propose*predict-no
  21764. -->
  21765. (O2100 ^name predict-no +)
  21766. (S1 ^operator O2100 +)
  21767. Retracting propose*predict-yes
  21768. -->
  21769. (O2099 ^name predict-yes +)
  21770. (S1 ^operator O2099 +)
  21771. Retracting elaborate*reward*based*on*reward
  21772. -->
  21773. (R1053 ^value 1 +)
  21774. (R1 ^reward R1053 +)
  21775. Retracting elaborate*copy-dir-to-output-link
  21776. -->
  21777. (I3 ^dir L +)
  21778. Retracting rl*prefer*rvt*predict-no*H0*4
  21779. -->
  21780. (S1 ^operator O2100 = 0.314498303095341)
  21781. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  21782. -->
  21783. (S1 ^operator O2100 = -0.1984300550322165)
  21784. Retracting rl*prefer*rvt*predict-yes*H0*3
  21785. -->
  21786. (S1 ^operator O2099 = 0.3907618357131554)
  21787. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  21788. -->
  21789. (S1 ^operator O2099 = 0.6091357162190356)
  21790. =>WM: (14766: S1 ^operator O2102 +)
  21791. =>WM: (14765: S1 ^operator O2101 +)
  21792. =>WM: (14764: I3 ^dir U)
  21793. =>WM: (14763: O2102 ^name predict-no)
  21794. =>WM: (14762: O2101 ^name predict-yes)
  21795. =>WM: (14761: R1054 ^value 1)
  21796. =>WM: (14760: R1 ^reward R1054)
  21797. =>WM: (14759: I3 ^see 1)
  21798. <=WM: (14750: S1 ^operator O2099 +)
  21799. <=WM: (14752: S1 ^operator O2099)
  21800. <=WM: (14751: S1 ^operator O2100 +)
  21801. <=WM: (14749: I3 ^dir L)
  21802. <=WM: (14745: R1 ^reward R1053)
  21803. <=WM: (14662: I3 ^see 0)
  21804. <=WM: (14748: O2100 ^name predict-no)
  21805. <=WM: (14747: O2099 ^name predict-yes)
  21806. <=WM: (14746: R1053 ^value 1)
  21807. --- Inner Elaboration Phase, active level 1 (S1) ---
  21808. Firing prefer*rvt*predict-yes*H0
  21809. -->
  21810. Firing rl*prefer*rvt*predict-yes*H0*1
  21811. -->
  21812. (S1 ^operator O2101 = 0.)
  21813. Firing prefer*rvt*predict-no*H0
  21814. -->
  21815. Firing rl*prefer*rvt*predict-no*H0*2
  21816. -->
  21817. (S1 ^operator O2102 = 1.)
  21818. inner elaboration loop at bottom goal.
  21819. Retracting rl*prefer*rvt*predict-no*H0*2
  21820. -->
  21821. (S1 ^operator O2100 = 1.)
  21822. Retracting rl*prefer*rvt*predict-yes*H0*1
  21823. -->
  21824. (S1 ^operator O2099 = 0.)
  21825. --- END Proposal Phase ---
  21826. --- Decision Phase ---
  21827. RL update rl*prefer*rvt*predict-yes*H0*3 0.47231 -0.0815483 0.390762 -> 0.472317 -0.081547 0.39077(R,m,v=1,0.947059,0.0504351)
  21828. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527603 0.0815331 0.609136 -> 0.527611 0.0815345 0.609145(R,m,v=1,1,0)
  21829. =>WM: (14767: S1 ^operator O2102)
  21830. 1051: O: O2102 (predict-no)
  21831. --- END Decision Phase ---
  21832. --- Application Phase ---
  21833. --- Firing Productions (PE) For State At Depth 1 ---
  21834. --- Inner Elaboration Phase, active level 1 (S1) ---
  21835. Firing apply*operator
  21836. -->
  21837. (I3 ^predict-no N1051 + :O )
  21838. Firing apply*operator*complete
  21839. -->
  21840. (I3 ^predict-yes N1050 - :O )
  21841. inner elaboration loop at bottom goal.
  21842. --- Change Working Memory (PE) ---
  21843. =>WM: (14768: I3 ^predict-no N1051)
  21844. <=WM: (14754: N1050 ^status complete)
  21845. <=WM: (14753: I3 ^predict-yes N1050)
  21846. --- Firing Productions (IE) For State At Depth 1 ---
  21847. --- Inner Elaboration Phase, active level 1 (S1) ---
  21848. Firing monitor*world
  21849. -->
  21850. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  21851. --- Change Working Memory (IE) ---
  21852. --- END Application Phase ---
  21853. --- Output Phase ---
  21854. ENV: Agent did: predict-no for direction U in state State-A
  21855. In State-A moving U
  21856. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  21857. predict error 0
  21858. dir: dir isR
  21859. --- END Output Phase ---
  21860. |--- Input Phase ---
  21861. =>WM: (14772: I2 ^dir R)
  21862. =>WM: (14771: I2 ^reward 1)
  21863. =>WM: (14770: I2 ^see 0)
  21864. =>WM: (14769: N1051 ^status complete)
  21865. <=WM: (14757: I2 ^dir U)
  21866. <=WM: (14756: I2 ^reward 1)
  21867. <=WM: (14755: I2 ^see 1)
  21868. =>WM: (14773: I2 ^level-1 L1-root)
  21869. <=WM: (14758: I2 ^level-1 L1-root)
  21870. --- END Input Phase ---
  21871. --- Proposal Phase ---
  21872. --- Inner Elaboration Phase, active level 1 (S1) ---
  21873. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  21874. -->
  21875. (S1 ^operator O2101 = 0.8784092909945846)
  21876. Firing prefer*rvt*predict-yes*H0*5*H1
  21877. -->
  21878. Firing elaborate*copy-see-to-output-link
  21879. -->
  21880. (I3 ^see 0 +)
  21881. Firing elaborate*reward*based*on*reward
  21882. -->
  21883. (R1055 ^value 1 +)
  21884. (R1 ^reward R1055 +)
  21885. Firing propose*predict-yes
  21886. -->
  21887. (O2103 ^name predict-yes +)
  21888. (S1 ^operator O2103 +)
  21889. Firing propose*predict-no
  21890. -->
  21891. (O2104 ^name predict-no +)
  21892. (S1 ^operator O2104 +)
  21893. Firing rl*prefer*rvt*predict-no*H0*6
  21894. -->
  21895. (S1 ^operator O2102 = 0.9999972751638363)
  21896. Firing rl*prefer*rvt*predict-yes*H0*5
  21897. -->
  21898. (S1 ^operator O2101 = 0.1215971732320855)
  21899. Firing prefer*rvt*predict-yes*H0
  21900. -->
  21901. Firing prefer*rvt*predict-no*H0
  21902. -->
  21903. Firing elaborate*copy-dir-to-output-link
  21904. -->
  21905. (I3 ^dir R +)
  21906. inner elaboration loop at bottom goal.
  21907. Retracting elaborate*copy-see-to-output-link
  21908. -->
  21909. (I3 ^see 1 +)
  21910. Retracting propose*predict-no
  21911. -->
  21912. (O2102 ^name predict-no +)
  21913. (S1 ^operator O2102 +)
  21914. Retracting propose*predict-yes
  21915. -->
  21916. (O2101 ^name predict-yes +)
  21917. (S1 ^operator O2101 +)
  21918. Retracting elaborate*reward*based*on*reward
  21919. -->
  21920. (R1054 ^value 1 +)
  21921. (R1 ^reward R1054 +)
  21922. Retracting elaborate*copy-dir-to-output-link
  21923. -->
  21924. (I3 ^dir U +)
  21925. Retracting rl*prefer*rvt*predict-no*H0*2
  21926. -->
  21927. (S1 ^operator O2102 = 1.)
  21928. Retracting rl*prefer*rvt*predict-yes*H0*1
  21929. -->
  21930. (S1 ^operator O2101 = 0.)
  21931. =>WM: (14781: S1 ^operator O2104 +)
  21932. =>WM: (14780: S1 ^operator O2103 +)
  21933. =>WM: (14779: I3 ^dir R)
  21934. =>WM: (14778: O2104 ^name predict-no)
  21935. =>WM: (14777: O2103 ^name predict-yes)
  21936. =>WM: (14776: R1055 ^value 1)
  21937. =>WM: (14775: R1 ^reward R1055)
  21938. =>WM: (14774: I3 ^see 0)
  21939. <=WM: (14765: S1 ^operator O2101 +)
  21940. <=WM: (14766: S1 ^operator O2102 +)
  21941. <=WM: (14767: S1 ^operator O2102)
  21942. <=WM: (14764: I3 ^dir U)
  21943. <=WM: (14760: R1 ^reward R1054)
  21944. <=WM: (14759: I3 ^see 1)
  21945. <=WM: (14763: O2102 ^name predict-no)
  21946. <=WM: (14762: O2101 ^name predict-yes)
  21947. <=WM: (14761: R1054 ^value 1)
  21948. --- Inner Elaboration Phase, active level 1 (S1) ---
  21949. Firing prefer*rvt*predict-yes*H0
  21950. -->
  21951. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  21952. -->
  21953. (S1 ^operator O2103 = 0.8784092909945846)
  21954. Firing rl*prefer*rvt*predict-yes*H0*5
  21955. -->
  21956. (S1 ^operator O2103 = 0.1215971732320855)
  21957. Firing prefer*rvt*predict-yes*H0*5*H1
  21958. -->
  21959. Firing prefer*rvt*predict-no*H0
  21960. -->
  21961. Firing rl*prefer*rvt*predict-no*H0*6
  21962. -->
  21963. (S1 ^operator O2104 = 0.9999972751638363)
  21964. inner elaboration loop at bottom goal.
  21965. Retracting rl*prefer*rvt*predict-no*H0*6
  21966. -->
  21967. (S1 ^operator O2102 = 0.9999972751638363)
  21968. Retracting rl*prefer*rvt*predict-yes*H0*5
  21969. -->
  21970. (S1 ^operator O2101 = 0.1215971732320855)
  21971. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  21972. -->
  21973. (S1 ^operator O2101 = 0.8784092909945846)
  21974. --- END Proposal Phase ---
  21975. --- Decision Phase ---
  21976. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  21977. =>WM: (14782: S1 ^operator O2103)
  21978. 1052: O: O2103 (predict-yes)
  21979. --- END Decision Phase ---
  21980. --- Application Phase ---
  21981. --- Firing Productions (PE) For State At Depth 1 ---
  21982. --- Inner Elaboration Phase, active level 1 (S1) ---
  21983. Firing apply*operator
  21984. -->
  21985. (I3 ^predict-yes N1052 + :O )
  21986. Firing apply*operator*complete
  21987. -->
  21988. (I3 ^predict-no N1051 - :O )
  21989. inner elaboration loop at bottom goal.
  21990. --- Change Working Memory (PE) ---
  21991. =>WM: (14783: I3 ^predict-yes N1052)
  21992. <=WM: (14769: N1051 ^status complete)
  21993. <=WM: (14768: I3 ^predict-no N1051)
  21994. --- Firing Productions (IE) For State At Depth 1 ---
  21995. --- Inner Elaboration Phase, active level 1 (S1) ---
  21996. Firing monitor*world
  21997. -->
  21998. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  21999. --- Change Working Memory (IE) ---
  22000. --- END Application Phase ---
  22001. --- Output Phase ---
  22002. ENV: Agent did: predict-yes for direction R in state State-A
  22003. In State-A moving R
  22004. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22005. predict error 0
  22006. dir: dir isL
  22007. --- END Output Phase ---
  22008. \-/--- Input Phase ---
  22009. =>WM: (14787: I2 ^dir L)
  22010. =>WM: (14786: I2 ^reward 1)
  22011. =>WM: (14785: I2 ^see 1)
  22012. =>WM: (14784: N1052 ^status complete)
  22013. <=WM: (14772: I2 ^dir R)
  22014. <=WM: (14771: I2 ^reward 1)
  22015. <=WM: (14770: I2 ^see 0)
  22016. =>WM: (14788: I2 ^level-1 R1-root)
  22017. <=WM: (14773: I2 ^level-1 L1-root)
  22018. --- END Input Phase ---
  22019. --- Proposal Phase ---
  22020. --- Inner Elaboration Phase, active level 1 (S1) ---
  22021. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  22022. -->
  22023. (S1 ^operator O2104 = -0.168718511744511)
  22024. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  22025. -->
  22026. (S1 ^operator O2103 = 0.6092694841640142)
  22027. Firing prefer*rvt*predict-no*H0*4*H1
  22028. -->
  22029. Firing prefer*rvt*predict-yes*H0*3*H1
  22030. -->
  22031. Firing elaborate*copy-see-to-output-link
  22032. -->
  22033. (I3 ^see 1 +)
  22034. Firing elaborate*reward*based*on*reward
  22035. -->
  22036. (R1056 ^value 1 +)
  22037. (R1 ^reward R1056 +)
  22038. Firing propose*predict-yes
  22039. -->
  22040. (O2105 ^name predict-yes +)
  22041. (S1 ^operator O2105 +)
  22042. Firing propose*predict-no
  22043. -->
  22044. (O2106 ^name predict-no +)
  22045. (S1 ^operator O2106 +)
  22046. Firing rl*prefer*rvt*predict-no*H0*4
  22047. -->
  22048. (S1 ^operator O2104 = 0.314498303095341)
  22049. Firing rl*prefer*rvt*predict-yes*H0*3
  22050. -->
  22051. (S1 ^operator O2103 = 0.3907701841024368)
  22052. Firing prefer*rvt*predict-yes*H0
  22053. -->
  22054. Firing prefer*rvt*predict-no*H0
  22055. -->
  22056. Firing elaborate*copy-dir-to-output-link
  22057. -->
  22058. (I3 ^dir L +)
  22059. inner elaboration loop at bottom goal.
  22060. Retracting elaborate*copy-see-to-output-link
  22061. -->
  22062. (I3 ^see 0 +)
  22063. Retracting propose*predict-no
  22064. -->
  22065. (O2104 ^name predict-no +)
  22066. (S1 ^operator O2104 +)
  22067. Retracting propose*predict-yes
  22068. -->
  22069. (O2103 ^name predict-yes +)
  22070. (S1 ^operator O2103 +)
  22071. Retracting elaborate*reward*based*on*reward
  22072. -->
  22073. (R1055 ^value 1 +)
  22074. (R1 ^reward R1055 +)
  22075. Retracting elaborate*copy-dir-to-output-link
  22076. -->
  22077. (I3 ^dir R +)
  22078. Retracting rl*prefer*rvt*predict-no*H0*6
  22079. -->
  22080. (S1 ^operator O2104 = 0.9999972751638363)
  22081. Retracting rl*prefer*rvt*predict-yes*H0*5
  22082. -->
  22083. (S1 ^operator O2103 = 0.1215971732320855)
  22084. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  22085. -->
  22086. (S1 ^operator O2103 = 0.8784092909945846)
  22087. =>WM: (14796: S1 ^operator O2106 +)
  22088. =>WM: (14795: S1 ^operator O2105 +)
  22089. =>WM: (14794: I3 ^dir L)
  22090. =>WM: (14793: O2106 ^name predict-no)
  22091. =>WM: (14792: O2105 ^name predict-yes)
  22092. =>WM: (14791: R1056 ^value 1)
  22093. =>WM: (14790: R1 ^reward R1056)
  22094. =>WM: (14789: I3 ^see 1)
  22095. <=WM: (14780: S1 ^operator O2103 +)
  22096. <=WM: (14782: S1 ^operator O2103)
  22097. <=WM: (14781: S1 ^operator O2104 +)
  22098. <=WM: (14779: I3 ^dir R)
  22099. <=WM: (14775: R1 ^reward R1055)
  22100. <=WM: (14774: I3 ^see 0)
  22101. <=WM: (14778: O2104 ^name predict-no)
  22102. <=WM: (14777: O2103 ^name predict-yes)
  22103. <=WM: (14776: R1055 ^value 1)
  22104. --- Inner Elaboration Phase, active level 1 (S1) ---
  22105. Firing prefer*rvt*predict-yes*H0
  22106. -->
  22107. Firing rl*prefer*rvt*predict-yes*H0*3
  22108. -->
  22109. (S1 ^operator O2105 = 0.3907701841024368)
  22110. Firing prefer*rvt*predict-yes*H0*3*H1
  22111. -->
  22112. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  22113. -->
  22114. (S1 ^operator O2105 = 0.6092694841640142)
  22115. Firing prefer*rvt*predict-no*H0
  22116. -->
  22117. Firing rl*prefer*rvt*predict-no*H0*4
  22118. -->
  22119. (S1 ^operator O2106 = 0.314498303095341)
  22120. Firing prefer*rvt*predict-no*H0*4*H1
  22121. -->
  22122. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  22123. -->
  22124. (S1 ^operator O2106 = -0.168718511744511)
  22125. inner elaboration loop at bottom goal.
  22126. Retracting rl*prefer*rvt*predict-no*H0*4
  22127. -->
  22128. (S1 ^operator O2104 = 0.314498303095341)
  22129. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  22130. -->
  22131. (S1 ^operator O2104 = -0.168718511744511)
  22132. Retracting rl*prefer*rvt*predict-yes*H0*3
  22133. -->
  22134. (S1 ^operator O2103 = 0.3907701841024368)
  22135. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  22136. -->
  22137. (S1 ^operator O2103 = 0.6092694841640142)
  22138. --- END Proposal Phase ---
  22139. --- Decision Phase ---
  22140. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.871658,0.112472)
  22141. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465482 0.412927 0.878409 -> 0.465481 0.412927 0.878409(R,m,v=1,1,0)
  22142. =>WM: (14797: S1 ^operator O2105)
  22143. 1053: O: O2105 (predict-yes)
  22144. --- END Decision Phase ---
  22145. --- Application Phase ---
  22146. --- Firing Productions (PE) For State At Depth 1 ---
  22147. --- Inner Elaboration Phase, active level 1 (S1) ---
  22148. Firing apply*operator
  22149. -->
  22150. (I3 ^predict-yes N1053 + :O )
  22151. Firing apply*operator*complete
  22152. -->
  22153. (I3 ^predict-yes N1052 - :O )
  22154. inner elaboration loop at bottom goal.
  22155. --- Change Working Memory (PE) ---
  22156. =>WM: (14798: I3 ^predict-yes N1053)
  22157. <=WM: (14784: N1052 ^status complete)
  22158. <=WM: (14783: I3 ^predict-yes N1052)
  22159. --- Firing Productions (IE) For State At Depth 1 ---
  22160. --- Inner Elaboration Phase, active level 1 (S1) ---
  22161. Firing monitor*world
  22162. -->
  22163. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22164. --- Change Working Memory (IE) ---
  22165. --- END Application Phase ---
  22166. --- Output Phase ---
  22167. ENV: Agent did: predict-yes for direction L in state State-B
  22168. In State-B moving L
  22169. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  22170. predict error 0
  22171. dir: dir isU
  22172. --- END Output Phase ---
  22173. |\---- Input Phase ---
  22174. =>WM: (14802: I2 ^dir U)
  22175. =>WM: (14801: I2 ^reward 1)
  22176. =>WM: (14800: I2 ^see 1)
  22177. =>WM: (14799: N1053 ^status complete)
  22178. <=WM: (14787: I2 ^dir L)
  22179. <=WM: (14786: I2 ^reward 1)
  22180. <=WM: (14785: I2 ^see 1)
  22181. =>WM: (14803: I2 ^level-1 L1-root)
  22182. <=WM: (14788: I2 ^level-1 R1-root)
  22183. --- END Input Phase ---
  22184. --- Proposal Phase ---
  22185. --- Inner Elaboration Phase, active level 1 (S1) ---
  22186. Firing elaborate*copy-see-to-output-link
  22187. -->
  22188. (I3 ^see 1 +)
  22189. Firing elaborate*reward*based*on*reward
  22190. -->
  22191. (R1057 ^value 1 +)
  22192. (R1 ^reward R1057 +)
  22193. Firing propose*predict-yes
  22194. -->
  22195. (O2107 ^name predict-yes +)
  22196. (S1 ^operator O2107 +)
  22197. Firing propose*predict-no
  22198. -->
  22199. (O2108 ^name predict-no +)
  22200. (S1 ^operator O2108 +)
  22201. Firing rl*prefer*rvt*predict-no*H0*2
  22202. -->
  22203. (S1 ^operator O2106 = 1.)
  22204. Firing rl*prefer*rvt*predict-yes*H0*1
  22205. -->
  22206. (S1 ^operator O2105 = 0.)
  22207. Firing prefer*rvt*predict-yes*H0
  22208. -->
  22209. Firing prefer*rvt*predict-no*H0
  22210. -->
  22211. Firing elaborate*copy-dir-to-output-link
  22212. -->
  22213. (I3 ^dir U +)
  22214. inner elaboration loop at bottom goal.
  22215. Retracting elaborate*copy-see-to-output-link
  22216. -->
  22217. (I3 ^see 1 +)
  22218. Retracting propose*predict-no
  22219. -->
  22220. (O2106 ^name predict-no +)
  22221. (S1 ^operator O2106 +)
  22222. Retracting propose*predict-yes
  22223. -->
  22224. (O2105 ^name predict-yes +)
  22225. (S1 ^operator O2105 +)
  22226. Retracting elaborate*reward*based*on*reward
  22227. -->
  22228. (R1056 ^value 1 +)
  22229. (R1 ^reward R1056 +)
  22230. Retracting elaborate*copy-dir-to-output-link
  22231. -->
  22232. (I3 ^dir L +)
  22233. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  22234. -->
  22235. (S1 ^operator O2106 = -0.168718511744511)
  22236. Retracting rl*prefer*rvt*predict-no*H0*4
  22237. -->
  22238. (S1 ^operator O2106 = 0.314498303095341)
  22239. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  22240. -->
  22241. (S1 ^operator O2105 = 0.6092694841640142)
  22242. Retracting rl*prefer*rvt*predict-yes*H0*3
  22243. -->
  22244. (S1 ^operator O2105 = 0.3907701841024368)
  22245. =>WM: (14810: S1 ^operator O2108 +)
  22246. =>WM: (14809: S1 ^operator O2107 +)
  22247. =>WM: (14808: I3 ^dir U)
  22248. =>WM: (14807: O2108 ^name predict-no)
  22249. =>WM: (14806: O2107 ^name predict-yes)
  22250. =>WM: (14805: R1057 ^value 1)
  22251. =>WM: (14804: R1 ^reward R1057)
  22252. <=WM: (14795: S1 ^operator O2105 +)
  22253. <=WM: (14797: S1 ^operator O2105)
  22254. <=WM: (14796: S1 ^operator O2106 +)
  22255. <=WM: (14794: I3 ^dir L)
  22256. <=WM: (14790: R1 ^reward R1056)
  22257. <=WM: (14793: O2106 ^name predict-no)
  22258. <=WM: (14792: O2105 ^name predict-yes)
  22259. <=WM: (14791: R1056 ^value 1)
  22260. --- Inner Elaboration Phase, active level 1 (S1) ---
  22261. Firing prefer*rvt*predict-yes*H0
  22262. -->
  22263. Firing rl*prefer*rvt*predict-yes*H0*1
  22264. -->
  22265. (S1 ^operator O2107 = 0.)
  22266. Firing prefer*rvt*predict-no*H0
  22267. -->
  22268. Firing rl*prefer*rvt*predict-no*H0*2
  22269. -->
  22270. (S1 ^operator O2108 = 1.)
  22271. inner elaboration loop at bottom goal.
  22272. Retracting rl*prefer*rvt*predict-no*H0*2
  22273. -->
  22274. (S1 ^operator O2106 = 1.)
  22275. Retracting rl*prefer*rvt*predict-yes*H0*1
  22276. -->
  22277. (S1 ^operator O2105 = 0.)
  22278. --- END Proposal Phase ---
  22279. --- Decision Phase ---
  22280. RL update rl*prefer*rvt*predict-yes*H0*3 0.472317 -0.081547 0.39077 -> 0.472314 -0.0815475 0.390767(R,m,v=1,0.947368,0.0501548)
  22281. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527717 0.0815529 0.609269 -> 0.527713 0.0815524 0.609266(R,m,v=1,1,0)
  22282. =>WM: (14811: S1 ^operator O2108)
  22283. 1054: O: O2108 (predict-no)
  22284. --- END Decision Phase ---
  22285. --- Application Phase ---
  22286. --- Firing Productions (PE) For State At Depth 1 ---
  22287. --- Inner Elaboration Phase, active level 1 (S1) ---
  22288. Firing apply*operator
  22289. -->
  22290. (I3 ^predict-no N1054 + :O )
  22291. Firing apply*operator*complete
  22292. -->
  22293. (I3 ^predict-yes N1053 - :O )
  22294. inner elaboration loop at bottom goal.
  22295. --- Change Working Memory (PE) ---
  22296. =>WM: (14812: I3 ^predict-no N1054)
  22297. <=WM: (14799: N1053 ^status complete)
  22298. <=WM: (14798: I3 ^predict-yes N1053)
  22299. --- Firing Productions (IE) For State At Depth 1 ---
  22300. --- Inner Elaboration Phase, active level 1 (S1) ---
  22301. Firing monitor*world
  22302. -->
  22303. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22304. --- Change Working Memory (IE) ---
  22305. --- END Application Phase ---
  22306. --- Output Phase ---
  22307. ENV: Agent did: predict-no for direction U in state State-A
  22308. In State-A moving U
  22309. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  22310. predict error 0
  22311. dir: dir isR
  22312. --- END Output Phase ---
  22313. /|\--- Input Phase ---
  22314. =>WM: (14816: I2 ^dir R)
  22315. =>WM: (14815: I2 ^reward 1)
  22316. =>WM: (14814: I2 ^see 0)
  22317. =>WM: (14813: N1054 ^status complete)
  22318. <=WM: (14802: I2 ^dir U)
  22319. <=WM: (14801: I2 ^reward 1)
  22320. <=WM: (14800: I2 ^see 1)
  22321. =>WM: (14817: I2 ^level-1 L1-root)
  22322. <=WM: (14803: I2 ^level-1 L1-root)
  22323. --- END Input Phase ---
  22324. --- Proposal Phase ---
  22325. --- Inner Elaboration Phase, active level 1 (S1) ---
  22326. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  22327. -->
  22328. (S1 ^operator O2107 = 0.8784086918391858)
  22329. Firing prefer*rvt*predict-yes*H0*5*H1
  22330. -->
  22331. Firing elaborate*copy-see-to-output-link
  22332. -->
  22333. (I3 ^see 0 +)
  22334. Firing elaborate*reward*based*on*reward
  22335. -->
  22336. (R1058 ^value 1 +)
  22337. (R1 ^reward R1058 +)
  22338. Firing propose*predict-yes
  22339. -->
  22340. (O2109 ^name predict-yes +)
  22341. (S1 ^operator O2109 +)
  22342. Firing propose*predict-no
  22343. -->
  22344. (O2110 ^name predict-no +)
  22345. (S1 ^operator O2110 +)
  22346. Firing rl*prefer*rvt*predict-no*H0*6
  22347. -->
  22348. (S1 ^operator O2108 = 0.9999972751638363)
  22349. Firing rl*prefer*rvt*predict-yes*H0*5
  22350. -->
  22351. (S1 ^operator O2107 = 0.1215966545261001)
  22352. Firing prefer*rvt*predict-yes*H0
  22353. -->
  22354. Firing prefer*rvt*predict-no*H0
  22355. -->
  22356. Firing elaborate*copy-dir-to-output-link
  22357. -->
  22358. (I3 ^dir R +)
  22359. inner elaboration loop at bottom goal.
  22360. Retracting elaborate*copy-see-to-output-link
  22361. -->
  22362. (I3 ^see 1 +)
  22363. Retracting propose*predict-no
  22364. -->
  22365. (O2108 ^name predict-no +)
  22366. (S1 ^operator O2108 +)
  22367. Retracting propose*predict-yes
  22368. -->
  22369. (O2107 ^name predict-yes +)
  22370. (S1 ^operator O2107 +)
  22371. Retracting elaborate*reward*based*on*reward
  22372. -->
  22373. (R1057 ^value 1 +)
  22374. (R1 ^reward R1057 +)
  22375. Retracting elaborate*copy-dir-to-output-link
  22376. -->
  22377. (I3 ^dir U +)
  22378. Retracting rl*prefer*rvt*predict-no*H0*2
  22379. -->
  22380. (S1 ^operator O2108 = 1.)
  22381. Retracting rl*prefer*rvt*predict-yes*H0*1
  22382. -->
  22383. (S1 ^operator O2107 = 0.)
  22384. =>WM: (14825: S1 ^operator O2110 +)
  22385. =>WM: (14824: S1 ^operator O2109 +)
  22386. =>WM: (14823: I3 ^dir R)
  22387. =>WM: (14822: O2110 ^name predict-no)
  22388. =>WM: (14821: O2109 ^name predict-yes)
  22389. =>WM: (14820: R1058 ^value 1)
  22390. =>WM: (14819: R1 ^reward R1058)
  22391. =>WM: (14818: I3 ^see 0)
  22392. <=WM: (14809: S1 ^operator O2107 +)
  22393. <=WM: (14810: S1 ^operator O2108 +)
  22394. <=WM: (14811: S1 ^operator O2108)
  22395. <=WM: (14808: I3 ^dir U)
  22396. <=WM: (14804: R1 ^reward R1057)
  22397. <=WM: (14789: I3 ^see 1)
  22398. <=WM: (14807: O2108 ^name predict-no)
  22399. <=WM: (14806: O2107 ^name predict-yes)
  22400. <=WM: (14805: R1057 ^value 1)
  22401. --- Inner Elaboration Phase, active level 1 (S1) ---
  22402. Firing prefer*rvt*predict-yes*H0
  22403. -->
  22404. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  22405. -->
  22406. (S1 ^operator O2109 = 0.8784086918391858)
  22407. Firing rl*prefer*rvt*predict-yes*H0*5
  22408. -->
  22409. (S1 ^operator O2109 = 0.1215966545261001)
  22410. Firing prefer*rvt*predict-yes*H0*5*H1
  22411. -->
  22412. Firing prefer*rvt*predict-no*H0
  22413. -->
  22414. Firing rl*prefer*rvt*predict-no*H0*6
  22415. -->
  22416. (S1 ^operator O2110 = 0.9999972751638363)
  22417. inner elaboration loop at bottom goal.
  22418. Retracting rl*prefer*rvt*predict-no*H0*6
  22419. -->
  22420. (S1 ^operator O2108 = 0.9999972751638363)
  22421. Retracting rl*prefer*rvt*predict-yes*H0*5
  22422. -->
  22423. (S1 ^operator O2107 = 0.1215966545261001)
  22424. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  22425. -->
  22426. (S1 ^operator O2107 = 0.8784086918391858)
  22427. --- END Proposal Phase ---
  22428. --- Decision Phase ---
  22429. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22430. =>WM: (14826: S1 ^operator O2109)
  22431. 1055: O: O2109 (predict-yes)
  22432. --- END Decision Phase ---
  22433. --- Application Phase ---
  22434. --- Firing Productions (PE) For State At Depth 1 ---
  22435. --- Inner Elaboration Phase, active level 1 (S1) ---
  22436. Firing apply*operator
  22437. -->
  22438. (I3 ^predict-yes N1055 + :O )
  22439. Firing apply*operator*complete
  22440. -->
  22441. (I3 ^predict-no N1054 - :O )
  22442. inner elaboration loop at bottom goal.
  22443. --- Change Working Memory (PE) ---
  22444. =>WM: (14827: I3 ^predict-yes N1055)
  22445. <=WM: (14813: N1054 ^status complete)
  22446. <=WM: (14812: I3 ^predict-no N1054)
  22447. --- Firing Productions (IE) For State At Depth 1 ---
  22448. --- Inner Elaboration Phase, active level 1 (S1) ---
  22449. Firing monitor*world
  22450. -->
  22451. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22452. --- Change Working Memory (IE) ---
  22453. --- END Application Phase ---
  22454. --- Output Phase ---
  22455. ENV: Agent did: predict-yes for direction R in state State-A
  22456. In State-A moving R
  22457. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  22458. predict error 0
  22459. dir: dir isR
  22460. --- END Output Phase ---
  22461. -/--- Input Phase ---
  22462. =>WM: (14831: I2 ^dir R)
  22463. =>WM: (14830: I2 ^reward 1)
  22464. =>WM: (14829: I2 ^see 1)
  22465. =>WM: (14828: N1055 ^status complete)
  22466. <=WM: (14816: I2 ^dir R)
  22467. <=WM: (14815: I2 ^reward 1)
  22468. <=WM: (14814: I2 ^see 0)
  22469. =>WM: (14832: I2 ^level-1 R1-root)
  22470. <=WM: (14817: I2 ^level-1 L1-root)
  22471. --- END Input Phase ---
  22472. --- Proposal Phase ---
  22473. --- Inner Elaboration Phase, active level 1 (S1) ---
  22474. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  22475. -->
  22476. (S1 ^operator O2109 = -0.04253361215288998)
  22477. Firing prefer*rvt*predict-yes*H0*5*H1
  22478. -->
  22479. Firing elaborate*copy-see-to-output-link
  22480. -->
  22481. (I3 ^see 1 +)
  22482. Firing elaborate*reward*based*on*reward
  22483. -->
  22484. (R1059 ^value 1 +)
  22485. (R1 ^reward R1059 +)
  22486. Firing propose*predict-yes
  22487. -->
  22488. (O2111 ^name predict-yes +)
  22489. (S1 ^operator O2111 +)
  22490. Firing propose*predict-no
  22491. -->
  22492. (O2112 ^name predict-no +)
  22493. (S1 ^operator O2112 +)
  22494. Firing rl*prefer*rvt*predict-no*H0*6
  22495. -->
  22496. (S1 ^operator O2110 = 0.9999972751638363)
  22497. Firing rl*prefer*rvt*predict-yes*H0*5
  22498. -->
  22499. (S1 ^operator O2109 = 0.1215966545261001)
  22500. Firing prefer*rvt*predict-yes*H0
  22501. -->
  22502. Firing prefer*rvt*predict-no*H0
  22503. -->
  22504. Firing elaborate*copy-dir-to-output-link
  22505. -->
  22506. (I3 ^dir R +)
  22507. inner elaboration loop at bottom goal.
  22508. Retracting elaborate*copy-see-to-output-link
  22509. -->
  22510. (I3 ^see 0 +)
  22511. Retracting propose*predict-no
  22512. -->
  22513. (O2110 ^name predict-no +)
  22514. (S1 ^operator O2110 +)
  22515. Retracting propose*predict-yes
  22516. -->
  22517. (O2109 ^name predict-yes +)
  22518. (S1 ^operator O2109 +)
  22519. Retracting elaborate*reward*based*on*reward
  22520. -->
  22521. (R1058 ^value 1 +)
  22522. (R1 ^reward R1058 +)
  22523. Retracting elaborate*copy-dir-to-output-link
  22524. -->
  22525. (I3 ^dir R +)
  22526. Retracting rl*prefer*rvt*predict-no*H0*6
  22527. -->
  22528. (S1 ^operator O2110 = 0.9999972751638363)
  22529. Retracting rl*prefer*rvt*predict-yes*H0*5
  22530. -->
  22531. (S1 ^operator O2109 = 0.1215966545261001)
  22532. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  22533. -->
  22534. (S1 ^operator O2109 = 0.8784086918391858)
  22535. =>WM: (14839: S1 ^operator O2112 +)
  22536. =>WM: (14838: S1 ^operator O2111 +)
  22537. =>WM: (14837: O2112 ^name predict-no)
  22538. =>WM: (14836: O2111 ^name predict-yes)
  22539. =>WM: (14835: R1059 ^value 1)
  22540. =>WM: (14834: R1 ^reward R1059)
  22541. =>WM: (14833: I3 ^see 1)
  22542. <=WM: (14824: S1 ^operator O2109 +)
  22543. <=WM: (14826: S1 ^operator O2109)
  22544. <=WM: (14825: S1 ^operator O2110 +)
  22545. <=WM: (14819: R1 ^reward R1058)
  22546. <=WM: (14818: I3 ^see 0)
  22547. <=WM: (14822: O2110 ^name predict-no)
  22548. <=WM: (14821: O2109 ^name predict-yes)
  22549. <=WM: (14820: R1058 ^value 1)
  22550. --- Inner Elaboration Phase, active level 1 (S1) ---
  22551. Firing prefer*rvt*predict-yes*H0
  22552. -->
  22553. Firing rl*prefer*rvt*predict-yes*H0*5
  22554. -->
  22555. (S1 ^operator O2111 = 0.1215966545261001)
  22556. Firing prefer*rvt*predict-yes*H0*5*H1
  22557. -->
  22558. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  22559. -->
  22560. (S1 ^operator O2111 = -0.04253361215288998)
  22561. Firing prefer*rvt*predict-no*H0
  22562. -->
  22563. Firing rl*prefer*rvt*predict-no*H0*6
  22564. -->
  22565. (S1 ^operator O2112 = 0.9999972751638363)
  22566. inner elaboration loop at bottom goal.
  22567. Retracting rl*prefer*rvt*predict-no*H0*6
  22568. -->
  22569. (S1 ^operator O2110 = 0.9999972751638363)
  22570. Retracting rl*prefer*rvt*predict-yes*H0*5
  22571. -->
  22572. (S1 ^operator O2109 = 0.1215966545261001)
  22573. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  22574. -->
  22575. (S1 ^operator O2109 = -0.04253361215288998)
  22576. --- END Proposal Phase ---
  22577. --- Decision Phase ---
  22578. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.87234,0.111958)
  22579. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465481 0.412927 0.878409 -> 0.465481 0.412927 0.878408(R,m,v=1,1,0)
  22580. =>WM: (14840: S1 ^operator O2112)
  22581. 1056: O: O2112 (predict-no)
  22582. --- END Decision Phase ---
  22583. --- Application Phase ---
  22584. --- Firing Productions (PE) For State At Depth 1 ---
  22585. --- Inner Elaboration Phase, active level 1 (S1) ---
  22586. Firing apply*operator
  22587. -->
  22588. (I3 ^predict-no N1056 + :O )
  22589. Firing apply*operator*complete
  22590. -->
  22591. (I3 ^predict-yes N1055 - :O )
  22592. inner elaboration loop at bottom goal.
  22593. --- Change Working Memory (PE) ---
  22594. =>WM: (14841: I3 ^predict-no N1056)
  22595. <=WM: (14828: N1055 ^status complete)
  22596. <=WM: (14827: I3 ^predict-yes N1055)
  22597. --- Firing Productions (IE) For State At Depth 1 ---
  22598. --- Inner Elaboration Phase, active level 1 (S1) ---
  22599. Firing monitor*world
  22600. -->
  22601. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22602. --- Change Working Memory (IE) ---
  22603. --- END Application Phase ---
  22604. --- Output Phase ---
  22605. ENV: Agent did: predict-no for direction R in state State-B
  22606. In State-B moving R
  22607. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22608. predict error 0
  22609. dir: dir isU
  22610. --- END Output Phase ---
  22611. |\---- Input Phase ---
  22612. =>WM: (14845: I2 ^dir U)
  22613. =>WM: (14844: I2 ^reward 1)
  22614. =>WM: (14843: I2 ^see 0)
  22615. =>WM: (14842: N1056 ^status complete)
  22616. <=WM: (14831: I2 ^dir R)
  22617. <=WM: (14830: I2 ^reward 1)
  22618. <=WM: (14829: I2 ^see 1)
  22619. =>WM: (14846: I2 ^level-1 R0-root)
  22620. <=WM: (14832: I2 ^level-1 R1-root)
  22621. --- END Input Phase ---
  22622. --- Proposal Phase ---
  22623. --- Inner Elaboration Phase, active level 1 (S1) ---
  22624. Firing elaborate*copy-see-to-output-link
  22625. -->
  22626. (I3 ^see 0 +)
  22627. Firing elaborate*reward*based*on*reward
  22628. -->
  22629. (R1060 ^value 1 +)
  22630. (R1 ^reward R1060 +)
  22631. Firing propose*predict-yes
  22632. -->
  22633. (O2113 ^name predict-yes +)
  22634. (S1 ^operator O2113 +)
  22635. Firing propose*predict-no
  22636. -->
  22637. (O2114 ^name predict-no +)
  22638. (S1 ^operator O2114 +)
  22639. Firing rl*prefer*rvt*predict-no*H0*2
  22640. -->
  22641. (S1 ^operator O2112 = 1.)
  22642. Firing rl*prefer*rvt*predict-yes*H0*1
  22643. -->
  22644. (S1 ^operator O2111 = 0.)
  22645. Firing prefer*rvt*predict-yes*H0
  22646. -->
  22647. Firing prefer*rvt*predict-no*H0
  22648. -->
  22649. Firing elaborate*copy-dir-to-output-link
  22650. -->
  22651. (I3 ^dir U +)
  22652. inner elaboration loop at bottom goal.
  22653. Retracting elaborate*copy-see-to-output-link
  22654. -->
  22655. (I3 ^see 1 +)
  22656. Retracting propose*predict-no
  22657. -->
  22658. (O2112 ^name predict-no +)
  22659. (S1 ^operator O2112 +)
  22660. Retracting propose*predict-yes
  22661. -->
  22662. (O2111 ^name predict-yes +)
  22663. (S1 ^operator O2111 +)
  22664. Retracting elaborate*reward*based*on*reward
  22665. -->
  22666. (R1059 ^value 1 +)
  22667. (R1 ^reward R1059 +)
  22668. Retracting elaborate*copy-dir-to-output-link
  22669. -->
  22670. (I3 ^dir R +)
  22671. Retracting rl*prefer*rvt*predict-no*H0*6
  22672. -->
  22673. (S1 ^operator O2112 = 0.9999972751638363)
  22674. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  22675. -->
  22676. (S1 ^operator O2111 = -0.04253361215288998)
  22677. Retracting rl*prefer*rvt*predict-yes*H0*5
  22678. -->
  22679. (S1 ^operator O2111 = 0.1215962258870366)
  22680. =>WM: (14854: S1 ^operator O2114 +)
  22681. =>WM: (14853: S1 ^operator O2113 +)
  22682. =>WM: (14852: I3 ^dir U)
  22683. =>WM: (14851: O2114 ^name predict-no)
  22684. =>WM: (14850: O2113 ^name predict-yes)
  22685. =>WM: (14849: R1060 ^value 1)
  22686. =>WM: (14848: R1 ^reward R1060)
  22687. =>WM: (14847: I3 ^see 0)
  22688. <=WM: (14838: S1 ^operator O2111 +)
  22689. <=WM: (14839: S1 ^operator O2112 +)
  22690. <=WM: (14840: S1 ^operator O2112)
  22691. <=WM: (14823: I3 ^dir R)
  22692. <=WM: (14834: R1 ^reward R1059)
  22693. <=WM: (14833: I3 ^see 1)
  22694. <=WM: (14837: O2112 ^name predict-no)
  22695. <=WM: (14836: O2111 ^name predict-yes)
  22696. <=WM: (14835: R1059 ^value 1)
  22697. --- Inner Elaboration Phase, active level 1 (S1) ---
  22698. Firing prefer*rvt*predict-yes*H0
  22699. -->
  22700. Firing rl*prefer*rvt*predict-yes*H0*1
  22701. -->
  22702. (S1 ^operator O2113 = 0.)
  22703. Firing prefer*rvt*predict-no*H0
  22704. -->
  22705. Firing rl*prefer*rvt*predict-no*H0*2
  22706. -->
  22707. (S1 ^operator O2114 = 1.)
  22708. inner elaboration loop at bottom goal.
  22709. Retracting rl*prefer*rvt*predict-no*H0*2
  22710. -->
  22711. (S1 ^operator O2112 = 1.)
  22712. Retracting rl*prefer*rvt*predict-yes*H0*1
  22713. -->
  22714. (S1 ^operator O2111 = 0.)
  22715. --- END Proposal Phase ---
  22716. --- Decision Phase ---
  22717. RL update rl*prefer*rvt*predict-no*H0*6 0.999997 0 0.999997 -> 0.999998 0 0.999998(R,m,v=1,0.94086,0.055943)
  22718. =>WM: (14855: S1 ^operator O2114)
  22719. 1057: O: O2114 (predict-no)
  22720. --- END Decision Phase ---
  22721. --- Application Phase ---
  22722. --- Firing Productions (PE) For State At Depth 1 ---
  22723. --- Inner Elaboration Phase, active level 1 (S1) ---
  22724. Firing apply*operator
  22725. -->
  22726. (I3 ^predict-no N1057 + :O )
  22727. Firing apply*operator*complete
  22728. -->
  22729. (I3 ^predict-no N1056 - :O )
  22730. inner elaboration loop at bottom goal.
  22731. --- Change Working Memory (PE) ---
  22732. =>WM: (14856: I3 ^predict-no N1057)
  22733. <=WM: (14842: N1056 ^status complete)
  22734. <=WM: (14841: I3 ^predict-no N1056)
  22735. --- Firing Productions (IE) For State At Depth 1 ---
  22736. --- Inner Elaboration Phase, active level 1 (S1) ---
  22737. Firing monitor*world
  22738. -->
  22739. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  22740. --- Change Working Memory (IE) ---
  22741. --- END Application Phase ---
  22742. --- Output Phase ---
  22743. ENV: Agent did: predict-no for direction U in state State-B
  22744. In State-B moving U
  22745. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  22746. predict error 0
  22747. dir: dir isL
  22748. --- END Output Phase ---
  22749. /--- Input Phase ---
  22750. =>WM: (14860: I2 ^dir L)
  22751. =>WM: (14859: I2 ^reward 1)
  22752. =>WM: (14858: I2 ^see 0)
  22753. =>WM: (14857: N1057 ^status complete)
  22754. <=WM: (14845: I2 ^dir U)
  22755. <=WM: (14844: I2 ^reward 1)
  22756. <=WM: (14843: I2 ^see 0)
  22757. =>WM: (14861: I2 ^level-1 R0-root)
  22758. <=WM: (14846: I2 ^level-1 R0-root)
  22759. --- END Input Phase ---
  22760. --- Proposal Phase ---
  22761. --- Inner Elaboration Phase, active level 1 (S1) ---
  22762. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  22763. -->
  22764. (S1 ^operator O2114 = -0.1984300550322165)
  22765. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  22766. -->
  22767. (S1 ^operator O2113 = 0.6091452119121891)
  22768. Firing prefer*rvt*predict-no*H0*4*H1
  22769. -->
  22770. Firing prefer*rvt*predict-yes*H0*3*H1
  22771. -->
  22772. Firing elaborate*copy-see-to-output-link
  22773. -->
  22774. (I3 ^see 0 +)
  22775. Firing elaborate*reward*based*on*reward
  22776. -->
  22777. (R1061 ^value 1 +)
  22778. (R1 ^reward R1061 +)
  22779. Firing propose*predict-yes
  22780. -->
  22781. (O2115 ^name predict-yes +)
  22782. (S1 ^operator O2115 +)
  22783. Firing propose*predict-no
  22784. -->
  22785. (O2116 ^name predict-no +)
  22786. (S1 ^operator O2116 +)
  22787. Firing rl*prefer*rvt*predict-no*H0*4
  22788. -->
  22789. (S1 ^operator O2114 = 0.314498303095341)
  22790. Firing rl*prefer*rvt*predict-yes*H0*3
  22791. -->
  22792. (S1 ^operator O2113 = 0.3907669546625557)
  22793. Firing prefer*rvt*predict-yes*H0
  22794. -->
  22795. Firing prefer*rvt*predict-no*H0
  22796. -->
  22797. Firing elaborate*copy-dir-to-output-link
  22798. -->
  22799. (I3 ^dir L +)
  22800. inner elaboration loop at bottom goal.
  22801. Retracting elaborate*copy-see-to-output-link
  22802. -->
  22803. (I3 ^see 0 +)
  22804. Retracting propose*predict-no
  22805. -->
  22806. (O2114 ^name predict-no +)
  22807. (S1 ^operator O2114 +)
  22808. Retracting propose*predict-yes
  22809. -->
  22810. (O2113 ^name predict-yes +)
  22811. (S1 ^operator O2113 +)
  22812. Retracting elaborate*reward*based*on*reward
  22813. -->
  22814. (R1060 ^value 1 +)
  22815. (R1 ^reward R1060 +)
  22816. Retracting elaborate*copy-dir-to-output-link
  22817. -->
  22818. (I3 ^dir U +)
  22819. Retracting rl*prefer*rvt*predict-no*H0*2
  22820. -->
  22821. (S1 ^operator O2114 = 1.)
  22822. Retracting rl*prefer*rvt*predict-yes*H0*1
  22823. -->
  22824. (S1 ^operator O2113 = 0.)
  22825. =>WM: (14868: S1 ^operator O2116 +)
  22826. =>WM: (14867: S1 ^operator O2115 +)
  22827. =>WM: (14866: I3 ^dir L)
  22828. =>WM: (14865: O2116 ^name predict-no)
  22829. =>WM: (14864: O2115 ^name predict-yes)
  22830. =>WM: (14863: R1061 ^value 1)
  22831. =>WM: (14862: R1 ^reward R1061)
  22832. <=WM: (14853: S1 ^operator O2113 +)
  22833. <=WM: (14854: S1 ^operator O2114 +)
  22834. <=WM: (14855: S1 ^operator O2114)
  22835. <=WM: (14852: I3 ^dir U)
  22836. <=WM: (14848: R1 ^reward R1060)
  22837. <=WM: (14851: O2114 ^name predict-no)
  22838. <=WM: (14850: O2113 ^name predict-yes)
  22839. <=WM: (14849: R1060 ^value 1)
  22840. --- Inner Elaboration Phase, active level 1 (S1) ---
  22841. Firing prefer*rvt*predict-yes*H0
  22842. -->
  22843. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  22844. -->
  22845. (S1 ^operator O2115 = 0.6091452119121891)
  22846. Firing rl*prefer*rvt*predict-yes*H0*3
  22847. -->
  22848. (S1 ^operator O2115 = 0.3907669546625557)
  22849. Firing prefer*rvt*predict-yes*H0*3*H1
  22850. -->
  22851. Firing prefer*rvt*predict-no*H0
  22852. -->
  22853. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  22854. -->
  22855. (S1 ^operator O2116 = -0.1984300550322165)
  22856. Firing rl*prefer*rvt*predict-no*H0*4
  22857. -->
  22858. (S1 ^operator O2116 = 0.314498303095341)
  22859. Firing prefer*rvt*predict-no*H0*4*H1
  22860. -->
  22861. inner elaboration loop at bottom goal.
  22862. Retracting rl*prefer*rvt*predict-no*H0*4
  22863. -->
  22864. (S1 ^operator O2114 = 0.314498303095341)
  22865. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  22866. -->
  22867. (S1 ^operator O2114 = -0.1984300550322165)
  22868. Retracting rl*prefer*rvt*predict-yes*H0*3
  22869. -->
  22870. (S1 ^operator O2113 = 0.3907669546625557)
  22871. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  22872. -->
  22873. (S1 ^operator O2113 = 0.6091452119121891)
  22874. --- END Proposal Phase ---
  22875. --- Decision Phase ---
  22876. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  22877. =>WM: (14869: S1 ^operator O2115)
  22878. 1058: O: O2115 (predict-yes)
  22879. --- END Decision Phase ---
  22880. --- Application Phase ---
  22881. --- Firing Productions (PE) For State At Depth 1 ---
  22882. --- Inner Elaboration Phase, active level 1 (S1) ---
  22883. Firing apply*operator
  22884. -->
  22885. (I3 ^predict-yes N1058 + :O )
  22886. Firing apply*operator*complete
  22887. -->
  22888. (I3 ^predict-no N1057 - :O )
  22889. inner elaboration loop at bottom goal.
  22890. --- Change Working Memory (PE) ---
  22891. =>WM: (14870: I3 ^predict-yes N1058)
  22892. <=WM: (14857: N1057 ^status complete)
  22893. <=WM: (14856: I3 ^predict-no N1057)
  22894. --- Firing Productions (IE) For State At Depth 1 ---
  22895. --- Inner Elaboration Phase, active level 1 (S1) ---
  22896. Firing monitor*world
  22897. -->
  22898. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  22899. --- Change Working Memory (IE) ---
  22900. --- END Application Phase ---
  22901. --- Output Phase ---
  22902. ENV: Agent did: predict-yes for direction L in state State-B
  22903. In State-B moving L
  22904. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  22905. predict error 0
  22906. dir: dir isL
  22907. --- END Output Phase ---
  22908. |\-/--- Input Phase ---
  22909. =>WM: (14874: I2 ^dir L)
  22910. =>WM: (14873: I2 ^reward 1)
  22911. =>WM: (14872: I2 ^see 1)
  22912. =>WM: (14871: N1058 ^status complete)
  22913. <=WM: (14860: I2 ^dir L)
  22914. <=WM: (14859: I2 ^reward 1)
  22915. <=WM: (14858: I2 ^see 0)
  22916. =>WM: (14875: I2 ^level-1 L1-root)
  22917. <=WM: (14861: I2 ^level-1 R0-root)
  22918. --- END Input Phase ---
  22919. --- Proposal Phase ---
  22920. --- Inner Elaboration Phase, active level 1 (S1) ---
  22921. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  22922. -->
  22923. (S1 ^operator O2115 = -0.2062723012911647)
  22924. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  22925. -->
  22926. (S1 ^operator O2116 = 0.6855193314559108)
  22927. Firing prefer*rvt*predict-no*H0*4*H1
  22928. -->
  22929. Firing prefer*rvt*predict-yes*H0*3*H1
  22930. -->
  22931. Firing elaborate*copy-see-to-output-link
  22932. -->
  22933. (I3 ^see 1 +)
  22934. Firing elaborate*reward*based*on*reward
  22935. -->
  22936. (R1062 ^value 1 +)
  22937. (R1 ^reward R1062 +)
  22938. Firing propose*predict-yes
  22939. -->
  22940. (O2117 ^name predict-yes +)
  22941. (S1 ^operator O2117 +)
  22942. Firing propose*predict-no
  22943. -->
  22944. (O2118 ^name predict-no +)
  22945. (S1 ^operator O2118 +)
  22946. Firing rl*prefer*rvt*predict-no*H0*4
  22947. -->
  22948. (S1 ^operator O2116 = 0.314498303095341)
  22949. Firing rl*prefer*rvt*predict-yes*H0*3
  22950. -->
  22951. (S1 ^operator O2115 = 0.3907669546625557)
  22952. Firing prefer*rvt*predict-yes*H0
  22953. -->
  22954. Firing prefer*rvt*predict-no*H0
  22955. -->
  22956. Firing elaborate*copy-dir-to-output-link
  22957. -->
  22958. (I3 ^dir L +)
  22959. inner elaboration loop at bottom goal.
  22960. Retracting elaborate*copy-see-to-output-link
  22961. -->
  22962. (I3 ^see 0 +)
  22963. Retracting propose*predict-no
  22964. -->
  22965. (O2116 ^name predict-no +)
  22966. (S1 ^operator O2116 +)
  22967. Retracting propose*predict-yes
  22968. -->
  22969. (O2115 ^name predict-yes +)
  22970. (S1 ^operator O2115 +)
  22971. Retracting elaborate*reward*based*on*reward
  22972. -->
  22973. (R1061 ^value 1 +)
  22974. (R1 ^reward R1061 +)
  22975. Retracting elaborate*copy-dir-to-output-link
  22976. -->
  22977. (I3 ^dir L +)
  22978. Retracting rl*prefer*rvt*predict-no*H0*4
  22979. -->
  22980. (S1 ^operator O2116 = 0.314498303095341)
  22981. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  22982. -->
  22983. (S1 ^operator O2116 = -0.1984300550322165)
  22984. Retracting rl*prefer*rvt*predict-yes*H0*3
  22985. -->
  22986. (S1 ^operator O2115 = 0.3907669546625557)
  22987. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  22988. -->
  22989. (S1 ^operator O2115 = 0.6091452119121891)
  22990. =>WM: (14882: S1 ^operator O2118 +)
  22991. =>WM: (14881: S1 ^operator O2117 +)
  22992. =>WM: (14880: O2118 ^name predict-no)
  22993. =>WM: (14879: O2117 ^name predict-yes)
  22994. =>WM: (14878: R1062 ^value 1)
  22995. =>WM: (14877: R1 ^reward R1062)
  22996. =>WM: (14876: I3 ^see 1)
  22997. <=WM: (14867: S1 ^operator O2115 +)
  22998. <=WM: (14869: S1 ^operator O2115)
  22999. <=WM: (14868: S1 ^operator O2116 +)
  23000. <=WM: (14862: R1 ^reward R1061)
  23001. <=WM: (14847: I3 ^see 0)
  23002. <=WM: (14865: O2116 ^name predict-no)
  23003. <=WM: (14864: O2115 ^name predict-yes)
  23004. <=WM: (14863: R1061 ^value 1)
  23005. --- Inner Elaboration Phase, active level 1 (S1) ---
  23006. Firing prefer*rvt*predict-yes*H0
  23007. -->
  23008. Firing rl*prefer*rvt*predict-yes*H0*3
  23009. -->
  23010. (S1 ^operator O2117 = 0.3907669546625557)
  23011. Firing prefer*rvt*predict-yes*H0*3*H1
  23012. -->
  23013. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  23014. -->
  23015. (S1 ^operator O2117 = -0.2062723012911647)
  23016. Firing prefer*rvt*predict-no*H0
  23017. -->
  23018. Firing rl*prefer*rvt*predict-no*H0*4
  23019. -->
  23020. (S1 ^operator O2118 = 0.314498303095341)
  23021. Firing prefer*rvt*predict-no*H0*4*H1
  23022. -->
  23023. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  23024. -->
  23025. (S1 ^operator O2118 = 0.6855193314559108)
  23026. inner elaboration loop at bottom goal.
  23027. Retracting rl*prefer*rvt*predict-no*H0*4
  23028. -->
  23029. (S1 ^operator O2116 = 0.314498303095341)
  23030. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  23031. -->
  23032. (S1 ^operator O2116 = 0.6855193314559108)
  23033. Retracting rl*prefer*rvt*predict-yes*H0*3
  23034. -->
  23035. (S1 ^operator O2115 = 0.3907669546625557)
  23036. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  23037. -->
  23038. (S1 ^operator O2115 = -0.2062723012911647)
  23039. --- END Proposal Phase ---
  23040. --- Decision Phase ---
  23041. RL update rl*prefer*rvt*predict-yes*H0*3 0.472314 -0.0815475 0.390767 -> 0.472321 -0.0815465 0.390774(R,m,v=1,0.947674,0.0498776)
  23042. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527611 0.0815345 0.609145 -> 0.527618 0.0815357 0.609153(R,m,v=1,1,0)
  23043. =>WM: (14883: S1 ^operator O2118)
  23044. 1059: O: O2118 (predict-no)
  23045. --- END Decision Phase ---
  23046. --- Application Phase ---
  23047. --- Firing Productions (PE) For State At Depth 1 ---
  23048. --- Inner Elaboration Phase, active level 1 (S1) ---
  23049. Firing apply*operator
  23050. -->
  23051. (I3 ^predict-no N1059 + :O )
  23052. Firing apply*operator*complete
  23053. -->
  23054. (I3 ^predict-yes N1058 - :O )
  23055. inner elaboration loop at bottom goal.
  23056. --- Change Working Memory (PE) ---
  23057. =>WM: (14884: I3 ^predict-no N1059)
  23058. <=WM: (14871: N1058 ^status complete)
  23059. <=WM: (14870: I3 ^predict-yes N1058)
  23060. --- Firing Productions (IE) For State At Depth 1 ---
  23061. --- Inner Elaboration Phase, active level 1 (S1) ---
  23062. Firing monitor*world
  23063. -->
  23064. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23065. --- Change Working Memory (IE) ---
  23066. --- END Application Phase ---
  23067. --- Output Phase ---
  23068. ENV: Agent did: predict-no for direction L in state State-A
  23069. In State-A moving L
  23070. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23071. predict error 0
  23072. dir: dir isR
  23073. --- END Output Phase ---
  23074. |\---- Input Phase ---
  23075. =>WM: (14888: I2 ^dir R)
  23076. =>WM: (14887: I2 ^reward 1)
  23077. =>WM: (14886: I2 ^see 0)
  23078. =>WM: (14885: N1059 ^status complete)
  23079. <=WM: (14874: I2 ^dir L)
  23080. <=WM: (14873: I2 ^reward 1)
  23081. <=WM: (14872: I2 ^see 1)
  23082. =>WM: (14889: I2 ^level-1 L0-root)
  23083. <=WM: (14875: I2 ^level-1 L1-root)
  23084. --- END Input Phase ---
  23085. --- Proposal Phase ---
  23086. --- Inner Elaboration Phase, active level 1 (S1) ---
  23087. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  23088. -->
  23089. (S1 ^operator O2117 = 0.8783979318684918)
  23090. Firing prefer*rvt*predict-yes*H0*5*H1
  23091. -->
  23092. Firing elaborate*copy-see-to-output-link
  23093. -->
  23094. (I3 ^see 0 +)
  23095. Firing elaborate*reward*based*on*reward
  23096. -->
  23097. (R1063 ^value 1 +)
  23098. (R1 ^reward R1063 +)
  23099. Firing propose*predict-yes
  23100. -->
  23101. (O2119 ^name predict-yes +)
  23102. (S1 ^operator O2119 +)
  23103. Firing propose*predict-no
  23104. -->
  23105. (O2120 ^name predict-no +)
  23106. (S1 ^operator O2120 +)
  23107. Firing rl*prefer*rvt*predict-no*H0*6
  23108. -->
  23109. (S1 ^operator O2118 = 0.9999977128360235)
  23110. Firing rl*prefer*rvt*predict-yes*H0*5
  23111. -->
  23112. (S1 ^operator O2117 = 0.1215962258870366)
  23113. Firing prefer*rvt*predict-yes*H0
  23114. -->
  23115. Firing prefer*rvt*predict-no*H0
  23116. -->
  23117. Firing elaborate*copy-dir-to-output-link
  23118. -->
  23119. (I3 ^dir R +)
  23120. inner elaboration loop at bottom goal.
  23121. Retracting elaborate*copy-see-to-output-link
  23122. -->
  23123. (I3 ^see 1 +)
  23124. Retracting propose*predict-no
  23125. -->
  23126. (O2118 ^name predict-no +)
  23127. (S1 ^operator O2118 +)
  23128. Retracting propose*predict-yes
  23129. -->
  23130. (O2117 ^name predict-yes +)
  23131. (S1 ^operator O2117 +)
  23132. Retracting elaborate*reward*based*on*reward
  23133. -->
  23134. (R1062 ^value 1 +)
  23135. (R1 ^reward R1062 +)
  23136. Retracting elaborate*copy-dir-to-output-link
  23137. -->
  23138. (I3 ^dir L +)
  23139. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  23140. -->
  23141. (S1 ^operator O2118 = 0.6855193314559108)
  23142. Retracting rl*prefer*rvt*predict-no*H0*4
  23143. -->
  23144. (S1 ^operator O2118 = 0.314498303095341)
  23145. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  23146. -->
  23147. (S1 ^operator O2117 = -0.2062723012911647)
  23148. Retracting rl*prefer*rvt*predict-yes*H0*3
  23149. -->
  23150. (S1 ^operator O2117 = 0.3907740985018537)
  23151. =>WM: (14897: S1 ^operator O2120 +)
  23152. =>WM: (14896: S1 ^operator O2119 +)
  23153. =>WM: (14895: I3 ^dir R)
  23154. =>WM: (14894: O2120 ^name predict-no)
  23155. =>WM: (14893: O2119 ^name predict-yes)
  23156. =>WM: (14892: R1063 ^value 1)
  23157. =>WM: (14891: R1 ^reward R1063)
  23158. =>WM: (14890: I3 ^see 0)
  23159. <=WM: (14881: S1 ^operator O2117 +)
  23160. <=WM: (14882: S1 ^operator O2118 +)
  23161. <=WM: (14883: S1 ^operator O2118)
  23162. <=WM: (14866: I3 ^dir L)
  23163. <=WM: (14877: R1 ^reward R1062)
  23164. <=WM: (14876: I3 ^see 1)
  23165. <=WM: (14880: O2118 ^name predict-no)
  23166. <=WM: (14879: O2117 ^name predict-yes)
  23167. <=WM: (14878: R1062 ^value 1)
  23168. --- Inner Elaboration Phase, active level 1 (S1) ---
  23169. Firing prefer*rvt*predict-yes*H0
  23170. -->
  23171. Firing rl*prefer*rvt*predict-yes*H0*5
  23172. -->
  23173. (S1 ^operator O2119 = 0.1215962258870366)
  23174. Firing prefer*rvt*predict-yes*H0*5*H1
  23175. -->
  23176. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  23177. -->
  23178. (S1 ^operator O2119 = 0.8783979318684918)
  23179. Firing prefer*rvt*predict-no*H0
  23180. -->
  23181. Firing rl*prefer*rvt*predict-no*H0*6
  23182. -->
  23183. (S1 ^operator O2120 = 0.9999977128360235)
  23184. inner elaboration loop at bottom goal.
  23185. Retracting rl*prefer*rvt*predict-no*H0*6
  23186. -->
  23187. (S1 ^operator O2118 = 0.9999977128360235)
  23188. Retracting rl*prefer*rvt*predict-yes*H0*5
  23189. -->
  23190. (S1 ^operator O2117 = 0.1215962258870366)
  23191. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  23192. -->
  23193. (S1 ^operator O2117 = 0.8783979318684918)
  23194. --- END Proposal Phase ---
  23195. --- Decision Phase ---
  23196. RL update rl*prefer*rvt*predict-no*H0*4 0.478547 -0.164049 0.314498 -> 0.478546 -0.164049 0.314497(R,m,v=1,0.925926,0.0690131)
  23197. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521469 0.16405 0.685519 -> 0.521467 0.16405 0.685518(R,m,v=1,1,0)
  23198. =>WM: (14898: S1 ^operator O2120)
  23199. 1060: O: O2120 (predict-no)
  23200. --- END Decision Phase ---
  23201. --- Application Phase ---
  23202. --- Firing Productions (PE) For State At Depth 1 ---
  23203. --- Inner Elaboration Phase, active level 1 (S1) ---
  23204. Firing apply*operator
  23205. -->
  23206. (I3 ^predict-no N1060 + :O )
  23207. Firing apply*operator*complete
  23208. -->
  23209. (I3 ^predict-no N1059 - :O )
  23210. inner elaboration loop at bottom goal.
  23211. --- Change Working Memory (PE) ---
  23212. =>WM: (14899: I3 ^predict-no N1060)
  23213. <=WM: (14885: N1059 ^status complete)
  23214. <=WM: (14884: I3 ^predict-no N1059)
  23215. --- Firing Productions (IE) For State At Depth 1 ---
  23216. --- Inner Elaboration Phase, active level 1 (S1) ---
  23217. Firing monitor*world
  23218. -->
  23219. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23220. --- Change Working Memory (IE) ---
  23221. --- END Application Phase ---
  23222. --- Output Phase ---
  23223. ENV: Agent did: predict-no for direction R in state State-A
  23224. In State-A moving R
  23225. ENV: (next state, see, prediction correct?) = (State-B, 1, False)
  23226. predict error 1
  23227. dir: dir isU
  23228. --- END Output Phase ---
  23229. /|\--- Input Phase ---
  23230. =>WM: (14903: I2 ^dir U)
  23231. =>WM: (14902: I2 ^reward 0)
  23232. =>WM: (14901: I2 ^see 1)
  23233. =>WM: (14900: N1060 ^status complete)
  23234. <=WM: (14888: I2 ^dir R)
  23235. <=WM: (14887: I2 ^reward 1)
  23236. <=WM: (14886: I2 ^see 0)
  23237. =>WM: (14904: I2 ^level-1 R1-root)
  23238. <=WM: (14889: I2 ^level-1 L0-root)
  23239. --- END Input Phase ---
  23240. --- Proposal Phase ---
  23241. --- Inner Elaboration Phase, active level 1 (S1) ---
  23242. Firing elaborate*copy-see-to-output-link
  23243. -->
  23244. (I3 ^see 1 +)
  23245. Firing elaborate*reward*based*on*reward
  23246. -->
  23247. (R1064 ^value 0 +)
  23248. (R1 ^reward R1064 +)
  23249. Firing propose*predict-yes
  23250. -->
  23251. (O2121 ^name predict-yes +)
  23252. (S1 ^operator O2121 +)
  23253. Firing propose*predict-no
  23254. -->
  23255. (O2122 ^name predict-no +)
  23256. (S1 ^operator O2122 +)
  23257. Firing rl*prefer*rvt*predict-no*H0*2
  23258. -->
  23259. (S1 ^operator O2120 = 1.)
  23260. Firing rl*prefer*rvt*predict-yes*H0*1
  23261. -->
  23262. (S1 ^operator O2119 = 0.)
  23263. Firing prefer*rvt*predict-yes*H0
  23264. -->
  23265. Firing prefer*rvt*predict-no*H0
  23266. -->
  23267. Firing elaborate*copy-dir-to-output-link
  23268. -->
  23269. (I3 ^dir U +)
  23270. inner elaboration loop at bottom goal.
  23271. Retracting elaborate*copy-see-to-output-link
  23272. -->
  23273. (I3 ^see 0 +)
  23274. Retracting propose*predict-no
  23275. -->
  23276. (O2120 ^name predict-no +)
  23277. (S1 ^operator O2120 +)
  23278. Retracting propose*predict-yes
  23279. -->
  23280. (O2119 ^name predict-yes +)
  23281. (S1 ^operator O2119 +)
  23282. Retracting elaborate*reward*based*on*reward
  23283. -->
  23284. (R1063 ^value 1 +)
  23285. (R1 ^reward R1063 +)
  23286. Retracting elaborate*copy-dir-to-output-link
  23287. -->
  23288. (I3 ^dir R +)
  23289. Retracting rl*prefer*rvt*predict-no*H0*6
  23290. -->
  23291. (S1 ^operator O2120 = 0.9999977128360235)
  23292. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  23293. -->
  23294. (S1 ^operator O2119 = 0.8783979318684918)
  23295. Retracting rl*prefer*rvt*predict-yes*H0*5
  23296. -->
  23297. (S1 ^operator O2119 = 0.1215962258870366)
  23298. =>WM: (14912: S1 ^operator O2122 +)
  23299. =>WM: (14911: S1 ^operator O2121 +)
  23300. =>WM: (14910: I3 ^dir U)
  23301. =>WM: (14909: O2122 ^name predict-no)
  23302. =>WM: (14908: O2121 ^name predict-yes)
  23303. =>WM: (14907: R1064 ^value 0)
  23304. =>WM: (14906: R1 ^reward R1064)
  23305. =>WM: (14905: I3 ^see 1)
  23306. <=WM: (14896: S1 ^operator O2119 +)
  23307. <=WM: (14897: S1 ^operator O2120 +)
  23308. <=WM: (14898: S1 ^operator O2120)
  23309. <=WM: (14895: I3 ^dir R)
  23310. <=WM: (14891: R1 ^reward R1063)
  23311. <=WM: (14890: I3 ^see 0)
  23312. <=WM: (14894: O2120 ^name predict-no)
  23313. <=WM: (14893: O2119 ^name predict-yes)
  23314. <=WM: (14892: R1063 ^value 1)
  23315. --- Inner Elaboration Phase, active level 1 (S1) ---
  23316. Firing prefer*rvt*predict-yes*H0
  23317. -->
  23318. Firing rl*prefer*rvt*predict-yes*H0*1
  23319. -->
  23320. (S1 ^operator O2121 = 0.)
  23321. Firing prefer*rvt*predict-no*H0
  23322. -->
  23323. Firing rl*prefer*rvt*predict-no*H0*2
  23324. -->
  23325. (S1 ^operator O2122 = 1.)
  23326. inner elaboration loop at bottom goal.
  23327. Retracting rl*prefer*rvt*predict-no*H0*2
  23328. -->
  23329. (S1 ^operator O2120 = 1.)
  23330. Retracting rl*prefer*rvt*predict-yes*H0*1
  23331. -->
  23332. (S1 ^operator O2119 = 0.)
  23333. --- END Proposal Phase ---
  23334. --- Decision Phase ---
  23335. RL update rl*prefer*rvt*predict-no*H0*6 0.999998 0 0.999998 -> 0.839513 0 0.839513(R,m,v=0,0.935829,0.0603761)
  23336. =>WM: (14913: S1 ^operator O2122)
  23337. 1061: O: O2122 (predict-no)
  23338. --- END Decision Phase ---
  23339. --- Application Phase ---
  23340. --- Firing Productions (PE) For State At Depth 1 ---
  23341. --- Inner Elaboration Phase, active level 1 (S1) ---
  23342. Firing apply*operator
  23343. -->
  23344. (I3 ^predict-no N1061 + :O )
  23345. Firing apply*operator*complete
  23346. -->
  23347. (I3 ^predict-no N1060 - :O )
  23348. inner elaboration loop at bottom goal.
  23349. --- Change Working Memory (PE) ---
  23350. =>WM: (14914: I3 ^predict-no N1061)
  23351. <=WM: (14900: N1060 ^status complete)
  23352. <=WM: (14899: I3 ^predict-no N1060)
  23353. --- Firing Productions (IE) For State At Depth 1 ---
  23354. --- Inner Elaboration Phase, active level 1 (S1) ---
  23355. Firing monitor*world
  23356. -->
  23357. I see 0 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23358. --- Change Working Memory (IE) ---
  23359. --- END Application Phase ---
  23360. --- Output Phase ---
  23361. ENV: Agent did: predict-no for direction U in state State-B
  23362. In State-B moving U
  23363. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  23364. predict error 0
  23365. dir: dir isL
  23366. --- END Output Phase ---
  23367. ---- Input Phase ---
  23368. =>WM: (14918: I2 ^dir L)
  23369. =>WM: (14917: I2 ^reward 1)
  23370. =>WM: (14916: I2 ^see 0)
  23371. =>WM: (14915: N1061 ^status complete)
  23372. <=WM: (14903: I2 ^dir U)
  23373. <=WM: (14902: I2 ^reward 0)
  23374. <=WM: (14901: I2 ^see 1)
  23375. =>WM: (14919: I2 ^level-1 R1-root)
  23376. <=WM: (14904: I2 ^level-1 R1-root)
  23377. --- END Input Phase ---
  23378. --- Proposal Phase ---
  23379. --- Inner Elaboration Phase, active level 1 (S1) ---
  23380. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  23381. -->
  23382. (S1 ^operator O2122 = -0.168718511744511)
  23383. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  23384. -->
  23385. (S1 ^operator O2121 = 0.609265798910378)
  23386. Firing prefer*rvt*predict-no*H0*4*H1
  23387. -->
  23388. Firing prefer*rvt*predict-yes*H0*3*H1
  23389. -->
  23390. Firing elaborate*copy-see-to-output-link
  23391. -->
  23392. (I3 ^see 0 +)
  23393. Firing elaborate*reward*based*on*reward
  23394. -->
  23395. (R1065 ^value 1 +)
  23396. (R1 ^reward R1065 +)
  23397. Firing propose*predict-yes
  23398. -->
  23399. (O2123 ^name predict-yes +)
  23400. (S1 ^operator O2123 +)
  23401. Firing propose*predict-no
  23402. -->
  23403. (O2124 ^name predict-no +)
  23404. (S1 ^operator O2124 +)
  23405. Firing rl*prefer*rvt*predict-no*H0*4
  23406. -->
  23407. (S1 ^operator O2122 = 0.3144968546951614)
  23408. Firing rl*prefer*rvt*predict-yes*H0*3
  23409. -->
  23410. (S1 ^operator O2121 = 0.3907740985018537)
  23411. Firing prefer*rvt*predict-yes*H0
  23412. -->
  23413. Firing prefer*rvt*predict-no*H0
  23414. -->
  23415. Firing elaborate*copy-dir-to-output-link
  23416. -->
  23417. (I3 ^dir L +)
  23418. inner elaboration loop at bottom goal.
  23419. Retracting elaborate*copy-see-to-output-link
  23420. -->
  23421. (I3 ^see 1 +)
  23422. Retracting propose*predict-no
  23423. -->
  23424. (O2122 ^name predict-no +)
  23425. (S1 ^operator O2122 +)
  23426. Retracting propose*predict-yes
  23427. -->
  23428. (O2121 ^name predict-yes +)
  23429. (S1 ^operator O2121 +)
  23430. Retracting elaborate*reward*based*on*reward
  23431. -->
  23432. (R1064 ^value 0 +)
  23433. (R1 ^reward R1064 +)
  23434. Retracting elaborate*copy-dir-to-output-link
  23435. -->
  23436. (I3 ^dir U +)
  23437. Retracting rl*prefer*rvt*predict-no*H0*2
  23438. -->
  23439. (S1 ^operator O2122 = 1.)
  23440. Retracting rl*prefer*rvt*predict-yes*H0*1
  23441. -->
  23442. (S1 ^operator O2121 = 0.)
  23443. =>WM: (14927: S1 ^operator O2124 +)
  23444. =>WM: (14926: S1 ^operator O2123 +)
  23445. =>WM: (14925: I3 ^dir L)
  23446. =>WM: (14924: O2124 ^name predict-no)
  23447. =>WM: (14923: O2123 ^name predict-yes)
  23448. =>WM: (14922: R1065 ^value 1)
  23449. =>WM: (14921: R1 ^reward R1065)
  23450. =>WM: (14920: I3 ^see 0)
  23451. <=WM: (14911: S1 ^operator O2121 +)
  23452. <=WM: (14912: S1 ^operator O2122 +)
  23453. <=WM: (14913: S1 ^operator O2122)
  23454. <=WM: (14910: I3 ^dir U)
  23455. <=WM: (14906: R1 ^reward R1064)
  23456. <=WM: (14905: I3 ^see 1)
  23457. <=WM: (14909: O2122 ^name predict-no)
  23458. <=WM: (14908: O2121 ^name predict-yes)
  23459. <=WM: (14907: R1064 ^value 0)
  23460. --- Inner Elaboration Phase, active level 1 (S1) ---
  23461. Firing prefer*rvt*predict-yes*H0
  23462. -->
  23463. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  23464. -->
  23465. (S1 ^operator O2123 = 0.609265798910378)
  23466. Firing rl*prefer*rvt*predict-yes*H0*3
  23467. -->
  23468. (S1 ^operator O2123 = 0.3907740985018537)
  23469. Firing prefer*rvt*predict-yes*H0*3*H1
  23470. -->
  23471. Firing prefer*rvt*predict-no*H0
  23472. -->
  23473. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  23474. -->
  23475. (S1 ^operator O2124 = -0.168718511744511)
  23476. Firing rl*prefer*rvt*predict-no*H0*4
  23477. -->
  23478. (S1 ^operator O2124 = 0.3144968546951614)
  23479. Firing prefer*rvt*predict-no*H0*4*H1
  23480. -->
  23481. inner elaboration loop at bottom goal.
  23482. Retracting rl*prefer*rvt*predict-no*H0*4
  23483. -->
  23484. (S1 ^operator O2122 = 0.3144968546951614)
  23485. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  23486. -->
  23487. (S1 ^operator O2122 = -0.168718511744511)
  23488. Retracting rl*prefer*rvt*predict-yes*H0*3
  23489. -->
  23490. (S1 ^operator O2121 = 0.3907740985018537)
  23491. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  23492. -->
  23493. (S1 ^operator O2121 = 0.609265798910378)
  23494. --- END Proposal Phase ---
  23495. --- Decision Phase ---
  23496. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23497. =>WM: (14928: S1 ^operator O2123)
  23498. 1062: O: O2123 (predict-yes)
  23499. --- END Decision Phase ---
  23500. --- Application Phase ---
  23501. --- Firing Productions (PE) For State At Depth 1 ---
  23502. --- Inner Elaboration Phase, active level 1 (S1) ---
  23503. Firing apply*operator
  23504. -->
  23505. (I3 ^predict-yes N1062 + :O )
  23506. Firing apply*operator*complete
  23507. -->
  23508. (I3 ^predict-no N1061 - :O )
  23509. inner elaboration loop at bottom goal.
  23510. --- Change Working Memory (PE) ---
  23511. =>WM: (14929: I3 ^predict-yes N1062)
  23512. <=WM: (14915: N1061 ^status complete)
  23513. <=WM: (14914: I3 ^predict-no N1061)
  23514. --- Firing Productions (IE) For State At Depth 1 ---
  23515. --- Inner Elaboration Phase, active level 1 (S1) ---
  23516. Firing monitor*world
  23517. -->
  23518. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  23519. --- Change Working Memory (IE) ---
  23520. --- END Application Phase ---
  23521. --- Output Phase ---
  23522. ENV: Agent did: predict-yes for direction L in state State-B
  23523. In State-B moving L
  23524. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  23525. predict error 0
  23526. dir: dir isU
  23527. --- END Output Phase ---
  23528. /|\--- Input Phase ---
  23529. =>WM: (14933: I2 ^dir U)
  23530. =>WM: (14932: I2 ^reward 1)
  23531. =>WM: (14931: I2 ^see 1)
  23532. =>WM: (14930: N1062 ^status complete)
  23533. <=WM: (14918: I2 ^dir L)
  23534. <=WM: (14917: I2 ^reward 1)
  23535. <=WM: (14916: I2 ^see 0)
  23536. =>WM: (14934: I2 ^level-1 L1-root)
  23537. <=WM: (14919: I2 ^level-1 R1-root)
  23538. --- END Input Phase ---
  23539. --- Proposal Phase ---
  23540. --- Inner Elaboration Phase, active level 1 (S1) ---
  23541. Firing elaborate*copy-see-to-output-link
  23542. -->
  23543. (I3 ^see 1 +)
  23544. Firing elaborate*reward*based*on*reward
  23545. -->
  23546. (R1066 ^value 1 +)
  23547. (R1 ^reward R1066 +)
  23548. Firing propose*predict-yes
  23549. -->
  23550. (O2125 ^name predict-yes +)
  23551. (S1 ^operator O2125 +)
  23552. Firing propose*predict-no
  23553. -->
  23554. (O2126 ^name predict-no +)
  23555. (S1 ^operator O2126 +)
  23556. Firing rl*prefer*rvt*predict-no*H0*2
  23557. -->
  23558. (S1 ^operator O2124 = 1.)
  23559. Firing rl*prefer*rvt*predict-yes*H0*1
  23560. -->
  23561. (S1 ^operator O2123 = 0.)
  23562. Firing prefer*rvt*predict-yes*H0
  23563. -->
  23564. Firing prefer*rvt*predict-no*H0
  23565. -->
  23566. Firing elaborate*copy-dir-to-output-link
  23567. -->
  23568. (I3 ^dir U +)
  23569. inner elaboration loop at bottom goal.
  23570. Retracting elaborate*copy-see-to-output-link
  23571. -->
  23572. (I3 ^see 0 +)
  23573. Retracting propose*predict-no
  23574. -->
  23575. (O2124 ^name predict-no +)
  23576. (S1 ^operator O2124 +)
  23577. Retracting propose*predict-yes
  23578. -->
  23579. (O2123 ^name predict-yes +)
  23580. (S1 ^operator O2123 +)
  23581. Retracting elaborate*reward*based*on*reward
  23582. -->
  23583. (R1065 ^value 1 +)
  23584. (R1 ^reward R1065 +)
  23585. Retracting elaborate*copy-dir-to-output-link
  23586. -->
  23587. (I3 ^dir L +)
  23588. Retracting rl*prefer*rvt*predict-no*H0*4
  23589. -->
  23590. (S1 ^operator O2124 = 0.3144968546951614)
  23591. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  23592. -->
  23593. (S1 ^operator O2124 = -0.168718511744511)
  23594. Retracting rl*prefer*rvt*predict-yes*H0*3
  23595. -->
  23596. (S1 ^operator O2123 = 0.3907740985018537)
  23597. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  23598. -->
  23599. (S1 ^operator O2123 = 0.609265798910378)
  23600. =>WM: (14942: S1 ^operator O2126 +)
  23601. =>WM: (14941: S1 ^operator O2125 +)
  23602. =>WM: (14940: I3 ^dir U)
  23603. =>WM: (14939: O2126 ^name predict-no)
  23604. =>WM: (14938: O2125 ^name predict-yes)
  23605. =>WM: (14937: R1066 ^value 1)
  23606. =>WM: (14936: R1 ^reward R1066)
  23607. =>WM: (14935: I3 ^see 1)
  23608. <=WM: (14926: S1 ^operator O2123 +)
  23609. <=WM: (14928: S1 ^operator O2123)
  23610. <=WM: (14927: S1 ^operator O2124 +)
  23611. <=WM: (14925: I3 ^dir L)
  23612. <=WM: (14921: R1 ^reward R1065)
  23613. <=WM: (14920: I3 ^see 0)
  23614. <=WM: (14924: O2124 ^name predict-no)
  23615. <=WM: (14923: O2123 ^name predict-yes)
  23616. <=WM: (14922: R1065 ^value 1)
  23617. --- Inner Elaboration Phase, active level 1 (S1) ---
  23618. Firing prefer*rvt*predict-yes*H0
  23619. -->
  23620. Firing rl*prefer*rvt*predict-yes*H0*1
  23621. -->
  23622. (S1 ^operator O2125 = 0.)
  23623. Firing prefer*rvt*predict-no*H0
  23624. -->
  23625. Firing rl*prefer*rvt*predict-no*H0*2
  23626. -->
  23627. (S1 ^operator O2126 = 1.)
  23628. inner elaboration loop at bottom goal.
  23629. Retracting rl*prefer*rvt*predict-no*H0*2
  23630. -->
  23631. (S1 ^operator O2124 = 1.)
  23632. Retracting rl*prefer*rvt*predict-yes*H0*1
  23633. -->
  23634. (S1 ^operator O2123 = 0.)
  23635. --- END Proposal Phase ---
  23636. --- Decision Phase ---
  23637. RL update rl*prefer*rvt*predict-yes*H0*3 0.472321 -0.0815465 0.390774 -> 0.472318 -0.0815469 0.390771(R,m,v=1,0.947977,0.0496034)
  23638. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527713 0.0815524 0.609266 -> 0.52771 0.0815518 0.609262(R,m,v=1,1,0)
  23639. =>WM: (14943: S1 ^operator O2126)
  23640. 1063: O: O2126 (predict-no)
  23641. --- END Decision Phase ---
  23642. --- Application Phase ---
  23643. --- Firing Productions (PE) For State At Depth 1 ---
  23644. --- Inner Elaboration Phase, active level 1 (S1) ---
  23645. Firing apply*operator
  23646. -->
  23647. (I3 ^predict-no N1063 + :O )
  23648. Firing apply*operator*complete
  23649. -->
  23650. (I3 ^predict-yes N1062 - :O )
  23651. inner elaboration loop at bottom goal.
  23652. --- Change Working Memory (PE) ---
  23653. =>WM: (14944: I3 ^predict-no N1063)
  23654. <=WM: (14930: N1062 ^status complete)
  23655. <=WM: (14929: I3 ^predict-yes N1062)
  23656. --- Firing Productions (IE) For State At Depth 1 ---
  23657. --- Inner Elaboration Phase, active level 1 (S1) ---
  23658. Firing monitor*world
  23659. -->
  23660. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23661. --- Change Working Memory (IE) ---
  23662. --- END Application Phase ---
  23663. --- Output Phase ---
  23664. ENV: Agent did: predict-no for direction U in state State-A
  23665. In State-A moving U
  23666. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23667. predict error 0
  23668. dir: dir isU
  23669. --- END Output Phase ---
  23670. -/|--- Input Phase ---
  23671. =>WM: (14948: I2 ^dir U)
  23672. =>WM: (14947: I2 ^reward 1)
  23673. =>WM: (14946: I2 ^see 0)
  23674. =>WM: (14945: N1063 ^status complete)
  23675. <=WM: (14933: I2 ^dir U)
  23676. <=WM: (14932: I2 ^reward 1)
  23677. <=WM: (14931: I2 ^see 1)
  23678. =>WM: (14949: I2 ^level-1 L1-root)
  23679. <=WM: (14934: I2 ^level-1 L1-root)
  23680. --- END Input Phase ---
  23681. --- Proposal Phase ---
  23682. --- Inner Elaboration Phase, active level 1 (S1) ---
  23683. Firing elaborate*copy-see-to-output-link
  23684. -->
  23685. (I3 ^see 0 +)
  23686. Firing elaborate*reward*based*on*reward
  23687. -->
  23688. (R1067 ^value 1 +)
  23689. (R1 ^reward R1067 +)
  23690. Firing propose*predict-yes
  23691. -->
  23692. (O2127 ^name predict-yes +)
  23693. (S1 ^operator O2127 +)
  23694. Firing propose*predict-no
  23695. -->
  23696. (O2128 ^name predict-no +)
  23697. (S1 ^operator O2128 +)
  23698. Firing rl*prefer*rvt*predict-no*H0*2
  23699. -->
  23700. (S1 ^operator O2126 = 1.)
  23701. Firing rl*prefer*rvt*predict-yes*H0*1
  23702. -->
  23703. (S1 ^operator O2125 = 0.)
  23704. Firing prefer*rvt*predict-yes*H0
  23705. -->
  23706. Firing prefer*rvt*predict-no*H0
  23707. -->
  23708. Firing elaborate*copy-dir-to-output-link
  23709. -->
  23710. (I3 ^dir U +)
  23711. inner elaboration loop at bottom goal.
  23712. Retracting elaborate*copy-see-to-output-link
  23713. -->
  23714. (I3 ^see 1 +)
  23715. Retracting propose*predict-no
  23716. -->
  23717. (O2126 ^name predict-no +)
  23718. (S1 ^operator O2126 +)
  23719. Retracting propose*predict-yes
  23720. -->
  23721. (O2125 ^name predict-yes +)
  23722. (S1 ^operator O2125 +)
  23723. Retracting elaborate*reward*based*on*reward
  23724. -->
  23725. (R1066 ^value 1 +)
  23726. (R1 ^reward R1066 +)
  23727. Retracting elaborate*copy-dir-to-output-link
  23728. -->
  23729. (I3 ^dir U +)
  23730. Retracting rl*prefer*rvt*predict-no*H0*2
  23731. -->
  23732. (S1 ^operator O2126 = 1.)
  23733. Retracting rl*prefer*rvt*predict-yes*H0*1
  23734. -->
  23735. (S1 ^operator O2125 = 0.)
  23736. =>WM: (14956: S1 ^operator O2128 +)
  23737. =>WM: (14955: S1 ^operator O2127 +)
  23738. =>WM: (14954: O2128 ^name predict-no)
  23739. =>WM: (14953: O2127 ^name predict-yes)
  23740. =>WM: (14952: R1067 ^value 1)
  23741. =>WM: (14951: R1 ^reward R1067)
  23742. =>WM: (14950: I3 ^see 0)
  23743. <=WM: (14941: S1 ^operator O2125 +)
  23744. <=WM: (14942: S1 ^operator O2126 +)
  23745. <=WM: (14943: S1 ^operator O2126)
  23746. <=WM: (14936: R1 ^reward R1066)
  23747. <=WM: (14935: I3 ^see 1)
  23748. <=WM: (14939: O2126 ^name predict-no)
  23749. <=WM: (14938: O2125 ^name predict-yes)
  23750. <=WM: (14937: R1066 ^value 1)
  23751. --- Inner Elaboration Phase, active level 1 (S1) ---
  23752. Firing prefer*rvt*predict-yes*H0
  23753. -->
  23754. Firing rl*prefer*rvt*predict-yes*H0*1
  23755. -->
  23756. (S1 ^operator O2127 = 0.)
  23757. Firing prefer*rvt*predict-no*H0
  23758. -->
  23759. Firing rl*prefer*rvt*predict-no*H0*2
  23760. -->
  23761. (S1 ^operator O2128 = 1.)
  23762. inner elaboration loop at bottom goal.
  23763. Retracting rl*prefer*rvt*predict-no*H0*2
  23764. -->
  23765. (S1 ^operator O2126 = 1.)
  23766. Retracting rl*prefer*rvt*predict-yes*H0*1
  23767. -->
  23768. (S1 ^operator O2125 = 0.)
  23769. --- END Proposal Phase ---
  23770. --- Decision Phase ---
  23771. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23772. =>WM: (14957: S1 ^operator O2128)
  23773. 1064: O: O2128 (predict-no)
  23774. --- END Decision Phase ---
  23775. --- Application Phase ---
  23776. --- Firing Productions (PE) For State At Depth 1 ---
  23777. --- Inner Elaboration Phase, active level 1 (S1) ---
  23778. Firing apply*operator
  23779. -->
  23780. (I3 ^predict-no N1064 + :O )
  23781. Firing apply*operator*complete
  23782. -->
  23783. (I3 ^predict-no N1063 - :O )
  23784. inner elaboration loop at bottom goal.
  23785. --- Change Working Memory (PE) ---
  23786. =>WM: (14958: I3 ^predict-no N1064)
  23787. <=WM: (14945: N1063 ^status complete)
  23788. <=WM: (14944: I3 ^predict-no N1063)
  23789. --- Firing Productions (IE) For State At Depth 1 ---
  23790. --- Inner Elaboration Phase, active level 1 (S1) ---
  23791. Firing monitor*world
  23792. -->
  23793. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23794. --- Change Working Memory (IE) ---
  23795. --- END Application Phase ---
  23796. --- Output Phase ---
  23797. ENV: Agent did: predict-no for direction U in state State-A
  23798. In State-A moving U
  23799. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23800. predict error 0
  23801. dir: dir isL
  23802. --- END Output Phase ---
  23803. \-/--- Input Phase ---
  23804. =>WM: (14962: I2 ^dir L)
  23805. =>WM: (14961: I2 ^reward 1)
  23806. =>WM: (14960: I2 ^see 0)
  23807. =>WM: (14959: N1064 ^status complete)
  23808. <=WM: (14948: I2 ^dir U)
  23809. <=WM: (14947: I2 ^reward 1)
  23810. <=WM: (14946: I2 ^see 0)
  23811. =>WM: (14963: I2 ^level-1 L1-root)
  23812. <=WM: (14949: I2 ^level-1 L1-root)
  23813. --- END Input Phase ---
  23814. --- Proposal Phase ---
  23815. --- Inner Elaboration Phase, active level 1 (S1) ---
  23816. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  23817. -->
  23818. (S1 ^operator O2127 = -0.2062723012911647)
  23819. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  23820. -->
  23821. (S1 ^operator O2128 = 0.6855176931742328)
  23822. Firing prefer*rvt*predict-no*H0*4*H1
  23823. -->
  23824. Firing prefer*rvt*predict-yes*H0*3*H1
  23825. -->
  23826. Firing elaborate*copy-see-to-output-link
  23827. -->
  23828. (I3 ^see 0 +)
  23829. Firing elaborate*reward*based*on*reward
  23830. -->
  23831. (R1068 ^value 1 +)
  23832. (R1 ^reward R1068 +)
  23833. Firing propose*predict-yes
  23834. -->
  23835. (O2129 ^name predict-yes +)
  23836. (S1 ^operator O2129 +)
  23837. Firing propose*predict-no
  23838. -->
  23839. (O2130 ^name predict-no +)
  23840. (S1 ^operator O2130 +)
  23841. Firing rl*prefer*rvt*predict-no*H0*4
  23842. -->
  23843. (S1 ^operator O2128 = 0.3144968546951614)
  23844. Firing rl*prefer*rvt*predict-yes*H0*3
  23845. -->
  23846. (S1 ^operator O2127 = 0.390770856544958)
  23847. Firing prefer*rvt*predict-yes*H0
  23848. -->
  23849. Firing prefer*rvt*predict-no*H0
  23850. -->
  23851. Firing elaborate*copy-dir-to-output-link
  23852. -->
  23853. (I3 ^dir L +)
  23854. inner elaboration loop at bottom goal.
  23855. Retracting elaborate*copy-see-to-output-link
  23856. -->
  23857. (I3 ^see 0 +)
  23858. Retracting propose*predict-no
  23859. -->
  23860. (O2128 ^name predict-no +)
  23861. (S1 ^operator O2128 +)
  23862. Retracting propose*predict-yes
  23863. -->
  23864. (O2127 ^name predict-yes +)
  23865. (S1 ^operator O2127 +)
  23866. Retracting elaborate*reward*based*on*reward
  23867. -->
  23868. (R1067 ^value 1 +)
  23869. (R1 ^reward R1067 +)
  23870. Retracting elaborate*copy-dir-to-output-link
  23871. -->
  23872. (I3 ^dir U +)
  23873. Retracting rl*prefer*rvt*predict-no*H0*2
  23874. -->
  23875. (S1 ^operator O2128 = 1.)
  23876. Retracting rl*prefer*rvt*predict-yes*H0*1
  23877. -->
  23878. (S1 ^operator O2127 = 0.)
  23879. =>WM: (14970: S1 ^operator O2130 +)
  23880. =>WM: (14969: S1 ^operator O2129 +)
  23881. =>WM: (14968: I3 ^dir L)
  23882. =>WM: (14967: O2130 ^name predict-no)
  23883. =>WM: (14966: O2129 ^name predict-yes)
  23884. =>WM: (14965: R1068 ^value 1)
  23885. =>WM: (14964: R1 ^reward R1068)
  23886. <=WM: (14955: S1 ^operator O2127 +)
  23887. <=WM: (14956: S1 ^operator O2128 +)
  23888. <=WM: (14957: S1 ^operator O2128)
  23889. <=WM: (14940: I3 ^dir U)
  23890. <=WM: (14951: R1 ^reward R1067)
  23891. <=WM: (14954: O2128 ^name predict-no)
  23892. <=WM: (14953: O2127 ^name predict-yes)
  23893. <=WM: (14952: R1067 ^value 1)
  23894. --- Inner Elaboration Phase, active level 1 (S1) ---
  23895. Firing prefer*rvt*predict-yes*H0
  23896. -->
  23897. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  23898. -->
  23899. (S1 ^operator O2129 = -0.2062723012911647)
  23900. Firing rl*prefer*rvt*predict-yes*H0*3
  23901. -->
  23902. (S1 ^operator O2129 = 0.390770856544958)
  23903. Firing prefer*rvt*predict-yes*H0*3*H1
  23904. -->
  23905. Firing prefer*rvt*predict-no*H0
  23906. -->
  23907. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  23908. -->
  23909. (S1 ^operator O2130 = 0.6855176931742328)
  23910. Firing rl*prefer*rvt*predict-no*H0*4
  23911. -->
  23912. (S1 ^operator O2130 = 0.3144968546951614)
  23913. Firing prefer*rvt*predict-no*H0*4*H1
  23914. -->
  23915. inner elaboration loop at bottom goal.
  23916. Retracting rl*prefer*rvt*predict-no*H0*4
  23917. -->
  23918. (S1 ^operator O2128 = 0.3144968546951614)
  23919. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  23920. -->
  23921. (S1 ^operator O2128 = 0.6855176931742328)
  23922. Retracting rl*prefer*rvt*predict-yes*H0*3
  23923. -->
  23924. (S1 ^operator O2127 = 0.390770856544958)
  23925. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  23926. -->
  23927. (S1 ^operator O2127 = -0.2062723012911647)
  23928. --- END Proposal Phase ---
  23929. --- Decision Phase ---
  23930. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  23931. =>WM: (14971: S1 ^operator O2130)
  23932. 1065: O: O2130 (predict-no)
  23933. --- END Decision Phase ---
  23934. --- Application Phase ---
  23935. --- Firing Productions (PE) For State At Depth 1 ---
  23936. --- Inner Elaboration Phase, active level 1 (S1) ---
  23937. Firing apply*operator
  23938. -->
  23939. (I3 ^predict-no N1065 + :O )
  23940. Firing apply*operator*complete
  23941. -->
  23942. (I3 ^predict-no N1064 - :O )
  23943. inner elaboration loop at bottom goal.
  23944. --- Change Working Memory (PE) ---
  23945. =>WM: (14972: I3 ^predict-no N1065)
  23946. <=WM: (14959: N1064 ^status complete)
  23947. <=WM: (14958: I3 ^predict-no N1064)
  23948. --- Firing Productions (IE) For State At Depth 1 ---
  23949. --- Inner Elaboration Phase, active level 1 (S1) ---
  23950. Firing monitor*world
  23951. -->
  23952. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  23953. --- Change Working Memory (IE) ---
  23954. --- END Application Phase ---
  23955. --- Output Phase ---
  23956. ENV: Agent did: predict-no for direction L in state State-A
  23957. In State-A moving L
  23958. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  23959. predict error 0
  23960. dir: dir isR
  23961. --- END Output Phase ---
  23962. |\---- Input Phase ---
  23963. =>WM: (14976: I2 ^dir R)
  23964. =>WM: (14975: I2 ^reward 1)
  23965. =>WM: (14974: I2 ^see 0)
  23966. =>WM: (14973: N1065 ^status complete)
  23967. <=WM: (14962: I2 ^dir L)
  23968. <=WM: (14961: I2 ^reward 1)
  23969. <=WM: (14960: I2 ^see 0)
  23970. =>WM: (14977: I2 ^level-1 L0-root)
  23971. <=WM: (14963: I2 ^level-1 L1-root)
  23972. --- END Input Phase ---
  23973. --- Proposal Phase ---
  23974. --- Inner Elaboration Phase, active level 1 (S1) ---
  23975. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  23976. -->
  23977. (S1 ^operator O2129 = 0.8783979318684918)
  23978. Firing prefer*rvt*predict-yes*H0*5*H1
  23979. -->
  23980. Firing elaborate*copy-see-to-output-link
  23981. -->
  23982. (I3 ^see 0 +)
  23983. Firing elaborate*reward*based*on*reward
  23984. -->
  23985. (R1069 ^value 1 +)
  23986. (R1 ^reward R1069 +)
  23987. Firing propose*predict-yes
  23988. -->
  23989. (O2131 ^name predict-yes +)
  23990. (S1 ^operator O2131 +)
  23991. Firing propose*predict-no
  23992. -->
  23993. (O2132 ^name predict-no +)
  23994. (S1 ^operator O2132 +)
  23995. Firing rl*prefer*rvt*predict-no*H0*6
  23996. -->
  23997. (S1 ^operator O2130 = 0.8395129942530221)
  23998. Firing rl*prefer*rvt*predict-yes*H0*5
  23999. -->
  24000. (S1 ^operator O2129 = 0.1215962258870366)
  24001. Firing prefer*rvt*predict-yes*H0
  24002. -->
  24003. Firing prefer*rvt*predict-no*H0
  24004. -->
  24005. Firing elaborate*copy-dir-to-output-link
  24006. -->
  24007. (I3 ^dir R +)
  24008. inner elaboration loop at bottom goal.
  24009. Retracting elaborate*copy-see-to-output-link
  24010. -->
  24011. (I3 ^see 0 +)
  24012. Retracting propose*predict-no
  24013. -->
  24014. (O2130 ^name predict-no +)
  24015. (S1 ^operator O2130 +)
  24016. Retracting propose*predict-yes
  24017. -->
  24018. (O2129 ^name predict-yes +)
  24019. (S1 ^operator O2129 +)
  24020. Retracting elaborate*reward*based*on*reward
  24021. -->
  24022. (R1068 ^value 1 +)
  24023. (R1 ^reward R1068 +)
  24024. Retracting elaborate*copy-dir-to-output-link
  24025. -->
  24026. (I3 ^dir L +)
  24027. Retracting rl*prefer*rvt*predict-no*H0*4
  24028. -->
  24029. (S1 ^operator O2130 = 0.3144968546951614)
  24030. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  24031. -->
  24032. (S1 ^operator O2130 = 0.6855176931742328)
  24033. Retracting rl*prefer*rvt*predict-yes*H0*3
  24034. -->
  24035. (S1 ^operator O2129 = 0.390770856544958)
  24036. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  24037. -->
  24038. (S1 ^operator O2129 = -0.2062723012911647)
  24039. =>WM: (14984: S1 ^operator O2132 +)
  24040. =>WM: (14983: S1 ^operator O2131 +)
  24041. =>WM: (14982: I3 ^dir R)
  24042. =>WM: (14981: O2132 ^name predict-no)
  24043. =>WM: (14980: O2131 ^name predict-yes)
  24044. =>WM: (14979: R1069 ^value 1)
  24045. =>WM: (14978: R1 ^reward R1069)
  24046. <=WM: (14969: S1 ^operator O2129 +)
  24047. <=WM: (14970: S1 ^operator O2130 +)
  24048. <=WM: (14971: S1 ^operator O2130)
  24049. <=WM: (14968: I3 ^dir L)
  24050. <=WM: (14964: R1 ^reward R1068)
  24051. <=WM: (14967: O2130 ^name predict-no)
  24052. <=WM: (14966: O2129 ^name predict-yes)
  24053. <=WM: (14965: R1068 ^value 1)
  24054. --- Inner Elaboration Phase, active level 1 (S1) ---
  24055. Firing prefer*rvt*predict-yes*H0
  24056. -->
  24057. Firing rl*prefer*rvt*predict-yes*H0*5
  24058. -->
  24059. (S1 ^operator O2131 = 0.1215962258870366)
  24060. Firing prefer*rvt*predict-yes*H0*5*H1
  24061. -->
  24062. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  24063. -->
  24064. (S1 ^operator O2131 = 0.8783979318684918)
  24065. Firing prefer*rvt*predict-no*H0
  24066. -->
  24067. Firing rl*prefer*rvt*predict-no*H0*6
  24068. -->
  24069. (S1 ^operator O2132 = 0.8395129942530221)
  24070. inner elaboration loop at bottom goal.
  24071. Retracting rl*prefer*rvt*predict-no*H0*6
  24072. -->
  24073. (S1 ^operator O2130 = 0.8395129942530221)
  24074. Retracting rl*prefer*rvt*predict-yes*H0*5
  24075. -->
  24076. (S1 ^operator O2129 = 0.1215962258870366)
  24077. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  24078. -->
  24079. (S1 ^operator O2129 = 0.8783979318684918)
  24080. --- END Proposal Phase ---
  24081. --- Decision Phase ---
  24082. RL update rl*prefer*rvt*predict-no*H0*4 0.478546 -0.164049 0.314497 -> 0.478545 -0.164049 0.314496(R,m,v=1,0.92638,0.0686208)
  24083. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521467 0.16405 0.685518 -> 0.521466 0.16405 0.685516(R,m,v=1,1,0)
  24084. =>WM: (14985: S1 ^operator O2131)
  24085. 1066: O: O2131 (predict-yes)
  24086. --- END Decision Phase ---
  24087. --- Application Phase ---
  24088. --- Firing Productions (PE) For State At Depth 1 ---
  24089. --- Inner Elaboration Phase, active level 1 (S1) ---
  24090. Firing apply*operator
  24091. -->
  24092. (I3 ^predict-yes N1066 + :O )
  24093. Firing apply*operator*complete
  24094. -->
  24095. (I3 ^predict-no N1065 - :O )
  24096. inner elaboration loop at bottom goal.
  24097. --- Change Working Memory (PE) ---
  24098. =>WM: (14986: I3 ^predict-yes N1066)
  24099. <=WM: (14973: N1065 ^status complete)
  24100. <=WM: (14972: I3 ^predict-no N1065)
  24101. --- Firing Productions (IE) For State At Depth 1 ---
  24102. --- Inner Elaboration Phase, active level 1 (S1) ---
  24103. Firing monitor*world
  24104. -->
  24105. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24106. --- Change Working Memory (IE) ---
  24107. --- END Application Phase ---
  24108. --- Output Phase ---
  24109. ENV: Agent did: predict-yes for direction R in state State-A
  24110. In State-A moving R
  24111. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  24112. predict error 0
  24113. dir: dir isR
  24114. --- END Output Phase ---
  24115. /|--- Input Phase ---
  24116. =>WM: (14990: I2 ^dir R)
  24117. =>WM: (14989: I2 ^reward 1)
  24118. =>WM: (14988: I2 ^see 1)
  24119. =>WM: (14987: N1066 ^status complete)
  24120. <=WM: (14976: I2 ^dir R)
  24121. <=WM: (14975: I2 ^reward 1)
  24122. <=WM: (14974: I2 ^see 0)
  24123. =>WM: (14991: I2 ^level-1 R1-root)
  24124. <=WM: (14977: I2 ^level-1 L0-root)
  24125. --- END Input Phase ---
  24126. --- Proposal Phase ---
  24127. --- Inner Elaboration Phase, active level 1 (S1) ---
  24128. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  24129. -->
  24130. (S1 ^operator O2131 = -0.04253361215288998)
  24131. Firing prefer*rvt*predict-yes*H0*5*H1
  24132. -->
  24133. Firing elaborate*copy-see-to-output-link
  24134. -->
  24135. (I3 ^see 1 +)
  24136. Firing elaborate*reward*based*on*reward
  24137. -->
  24138. (R1070 ^value 1 +)
  24139. (R1 ^reward R1070 +)
  24140. Firing propose*predict-yes
  24141. -->
  24142. (O2133 ^name predict-yes +)
  24143. (S1 ^operator O2133 +)
  24144. Firing propose*predict-no
  24145. -->
  24146. (O2134 ^name predict-no +)
  24147. (S1 ^operator O2134 +)
  24148. Firing rl*prefer*rvt*predict-no*H0*6
  24149. -->
  24150. (S1 ^operator O2132 = 0.8395129942530221)
  24151. Firing rl*prefer*rvt*predict-yes*H0*5
  24152. -->
  24153. (S1 ^operator O2131 = 0.1215962258870366)
  24154. Firing prefer*rvt*predict-yes*H0
  24155. -->
  24156. Firing prefer*rvt*predict-no*H0
  24157. -->
  24158. Firing elaborate*copy-dir-to-output-link
  24159. -->
  24160. (I3 ^dir R +)
  24161. inner elaboration loop at bottom goal.
  24162. Retracting elaborate*copy-see-to-output-link
  24163. -->
  24164. (I3 ^see 0 +)
  24165. Retracting propose*predict-no
  24166. -->
  24167. (O2132 ^name predict-no +)
  24168. (S1 ^operator O2132 +)
  24169. Retracting propose*predict-yes
  24170. -->
  24171. (O2131 ^name predict-yes +)
  24172. (S1 ^operator O2131 +)
  24173. Retracting elaborate*reward*based*on*reward
  24174. -->
  24175. (R1069 ^value 1 +)
  24176. (R1 ^reward R1069 +)
  24177. Retracting elaborate*copy-dir-to-output-link
  24178. -->
  24179. (I3 ^dir R +)
  24180. Retracting rl*prefer*rvt*predict-no*H0*6
  24181. -->
  24182. (S1 ^operator O2132 = 0.8395129942530221)
  24183. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  24184. -->
  24185. (S1 ^operator O2131 = 0.8783979318684918)
  24186. Retracting rl*prefer*rvt*predict-yes*H0*5
  24187. -->
  24188. (S1 ^operator O2131 = 0.1215962258870366)
  24189. =>WM: (14998: S1 ^operator O2134 +)
  24190. =>WM: (14997: S1 ^operator O2133 +)
  24191. =>WM: (14996: O2134 ^name predict-no)
  24192. =>WM: (14995: O2133 ^name predict-yes)
  24193. =>WM: (14994: R1070 ^value 1)
  24194. =>WM: (14993: R1 ^reward R1070)
  24195. =>WM: (14992: I3 ^see 1)
  24196. <=WM: (14983: S1 ^operator O2131 +)
  24197. <=WM: (14985: S1 ^operator O2131)
  24198. <=WM: (14984: S1 ^operator O2132 +)
  24199. <=WM: (14978: R1 ^reward R1069)
  24200. <=WM: (14950: I3 ^see 0)
  24201. <=WM: (14981: O2132 ^name predict-no)
  24202. <=WM: (14980: O2131 ^name predict-yes)
  24203. <=WM: (14979: R1069 ^value 1)
  24204. --- Inner Elaboration Phase, active level 1 (S1) ---
  24205. Firing prefer*rvt*predict-yes*H0
  24206. -->
  24207. Firing rl*prefer*rvt*predict-yes*H0*5
  24208. -->
  24209. (S1 ^operator O2133 = 0.1215962258870366)
  24210. Firing prefer*rvt*predict-yes*H0*5*H1
  24211. -->
  24212. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  24213. -->
  24214. (S1 ^operator O2133 = -0.04253361215288998)
  24215. Firing prefer*rvt*predict-no*H0
  24216. -->
  24217. Firing rl*prefer*rvt*predict-no*H0*6
  24218. -->
  24219. (S1 ^operator O2134 = 0.8395129942530221)
  24220. inner elaboration loop at bottom goal.
  24221. Retracting rl*prefer*rvt*predict-no*H0*6
  24222. -->
  24223. (S1 ^operator O2132 = 0.8395129942530221)
  24224. Retracting rl*prefer*rvt*predict-yes*H0*5
  24225. -->
  24226. (S1 ^operator O2131 = 0.1215962258870366)
  24227. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  24228. -->
  24229. (S1 ^operator O2131 = -0.04253361215288998)
  24230. --- END Proposal Phase ---
  24231. --- Decision Phase ---
  24232. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.873016,0.111449)
  24233. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465473 0.412925 0.878398 -> 0.465473 0.412926 0.878398(R,m,v=1,1,0)
  24234. =>WM: (14999: S1 ^operator O2134)
  24235. 1067: O: O2134 (predict-no)
  24236. --- END Decision Phase ---
  24237. --- Application Phase ---
  24238. --- Firing Productions (PE) For State At Depth 1 ---
  24239. --- Inner Elaboration Phase, active level 1 (S1) ---
  24240. Firing apply*operator
  24241. -->
  24242. (I3 ^predict-no N1067 + :O )
  24243. Firing apply*operator*complete
  24244. -->
  24245. (I3 ^predict-yes N1066 - :O )
  24246. inner elaboration loop at bottom goal.
  24247. --- Change Working Memory (PE) ---
  24248. =>WM: (15000: I3 ^predict-no N1067)
  24249. <=WM: (14987: N1066 ^status complete)
  24250. <=WM: (14986: I3 ^predict-yes N1066)
  24251. --- Firing Productions (IE) For State At Depth 1 ---
  24252. --- Inner Elaboration Phase, active level 1 (S1) ---
  24253. Firing monitor*world
  24254. -->
  24255. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24256. --- Change Working Memory (IE) ---
  24257. --- END Application Phase ---
  24258. --- Output Phase ---
  24259. ENV: Agent did: predict-no for direction R in state State-B
  24260. In State-B moving R
  24261. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24262. predict error 0
  24263. dir: dir isU
  24264. --- END Output Phase ---
  24265. \-/--- Input Phase ---
  24266. =>WM: (15004: I2 ^dir U)
  24267. =>WM: (15003: I2 ^reward 1)
  24268. =>WM: (15002: I2 ^see 0)
  24269. =>WM: (15001: N1067 ^status complete)
  24270. <=WM: (14990: I2 ^dir R)
  24271. <=WM: (14989: I2 ^reward 1)
  24272. <=WM: (14988: I2 ^see 1)
  24273. =>WM: (15005: I2 ^level-1 R0-root)
  24274. <=WM: (14991: I2 ^level-1 R1-root)
  24275. --- END Input Phase ---
  24276. --- Proposal Phase ---
  24277. --- Inner Elaboration Phase, active level 1 (S1) ---
  24278. Firing elaborate*copy-see-to-output-link
  24279. -->
  24280. (I3 ^see 0 +)
  24281. Firing elaborate*reward*based*on*reward
  24282. -->
  24283. (R1071 ^value 1 +)
  24284. (R1 ^reward R1071 +)
  24285. Firing propose*predict-yes
  24286. -->
  24287. (O2135 ^name predict-yes +)
  24288. (S1 ^operator O2135 +)
  24289. Firing propose*predict-no
  24290. -->
  24291. (O2136 ^name predict-no +)
  24292. (S1 ^operator O2136 +)
  24293. Firing rl*prefer*rvt*predict-no*H0*2
  24294. -->
  24295. (S1 ^operator O2134 = 1.)
  24296. Firing rl*prefer*rvt*predict-yes*H0*1
  24297. -->
  24298. (S1 ^operator O2133 = 0.)
  24299. Firing prefer*rvt*predict-yes*H0
  24300. -->
  24301. Firing prefer*rvt*predict-no*H0
  24302. -->
  24303. Firing elaborate*copy-dir-to-output-link
  24304. -->
  24305. (I3 ^dir U +)
  24306. inner elaboration loop at bottom goal.
  24307. Retracting elaborate*copy-see-to-output-link
  24308. -->
  24309. (I3 ^see 1 +)
  24310. Retracting propose*predict-no
  24311. -->
  24312. (O2134 ^name predict-no +)
  24313. (S1 ^operator O2134 +)
  24314. Retracting propose*predict-yes
  24315. -->
  24316. (O2133 ^name predict-yes +)
  24317. (S1 ^operator O2133 +)
  24318. Retracting elaborate*reward*based*on*reward
  24319. -->
  24320. (R1070 ^value 1 +)
  24321. (R1 ^reward R1070 +)
  24322. Retracting elaborate*copy-dir-to-output-link
  24323. -->
  24324. (I3 ^dir R +)
  24325. Retracting rl*prefer*rvt*predict-no*H0*6
  24326. -->
  24327. (S1 ^operator O2134 = 0.8395129942530221)
  24328. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  24329. -->
  24330. (S1 ^operator O2133 = -0.04253361215288998)
  24331. Retracting rl*prefer*rvt*predict-yes*H0*5
  24332. -->
  24333. (S1 ^operator O2133 = 0.1215966938845745)
  24334. =>WM: (15013: S1 ^operator O2136 +)
  24335. =>WM: (15012: S1 ^operator O2135 +)
  24336. =>WM: (15011: I3 ^dir U)
  24337. =>WM: (15010: O2136 ^name predict-no)
  24338. =>WM: (15009: O2135 ^name predict-yes)
  24339. =>WM: (15008: R1071 ^value 1)
  24340. =>WM: (15007: R1 ^reward R1071)
  24341. =>WM: (15006: I3 ^see 0)
  24342. <=WM: (14997: S1 ^operator O2133 +)
  24343. <=WM: (14998: S1 ^operator O2134 +)
  24344. <=WM: (14999: S1 ^operator O2134)
  24345. <=WM: (14982: I3 ^dir R)
  24346. <=WM: (14993: R1 ^reward R1070)
  24347. <=WM: (14992: I3 ^see 1)
  24348. <=WM: (14996: O2134 ^name predict-no)
  24349. <=WM: (14995: O2133 ^name predict-yes)
  24350. <=WM: (14994: R1070 ^value 1)
  24351. --- Inner Elaboration Phase, active level 1 (S1) ---
  24352. Firing prefer*rvt*predict-yes*H0
  24353. -->
  24354. Firing rl*prefer*rvt*predict-yes*H0*1
  24355. -->
  24356. (S1 ^operator O2135 = 0.)
  24357. Firing prefer*rvt*predict-no*H0
  24358. -->
  24359. Firing rl*prefer*rvt*predict-no*H0*2
  24360. -->
  24361. (S1 ^operator O2136 = 1.)
  24362. inner elaboration loop at bottom goal.
  24363. Retracting rl*prefer*rvt*predict-no*H0*2
  24364. -->
  24365. (S1 ^operator O2134 = 1.)
  24366. Retracting rl*prefer*rvt*predict-yes*H0*1
  24367. -->
  24368. (S1 ^operator O2133 = 0.)
  24369. --- END Proposal Phase ---
  24370. --- Decision Phase ---
  24371. RL update rl*prefer*rvt*predict-no*H0*6 0.839513 0 0.839513 -> 0.865247 0 0.865247(R,m,v=1,0.93617,0.0600751)
  24372. =>WM: (15014: S1 ^operator O2136)
  24373. 1068: O: O2136 (predict-no)
  24374. --- END Decision Phase ---
  24375. --- Application Phase ---
  24376. --- Firing Productions (PE) For State At Depth 1 ---
  24377. --- Inner Elaboration Phase, active level 1 (S1) ---
  24378. Firing apply*operator
  24379. -->
  24380. (I3 ^predict-no N1068 + :O )
  24381. Firing apply*operator*complete
  24382. -->
  24383. (I3 ^predict-no N1067 - :O )
  24384. inner elaboration loop at bottom goal.
  24385. --- Change Working Memory (PE) ---
  24386. =>WM: (15015: I3 ^predict-no N1068)
  24387. <=WM: (15001: N1067 ^status complete)
  24388. <=WM: (15000: I3 ^predict-no N1067)
  24389. --- Firing Productions (IE) For State At Depth 1 ---
  24390. --- Inner Elaboration Phase, active level 1 (S1) ---
  24391. Firing monitor*world
  24392. -->
  24393. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24394. --- Change Working Memory (IE) ---
  24395. --- END Application Phase ---
  24396. --- Output Phase ---
  24397. ENV: Agent did: predict-no for direction U in state State-B
  24398. In State-B moving U
  24399. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24400. predict error 0
  24401. dir: dir isR
  24402. --- END Output Phase ---
  24403. |\--- Input Phase ---
  24404. =>WM: (15019: I2 ^dir R)
  24405. =>WM: (15018: I2 ^reward 1)
  24406. =>WM: (15017: I2 ^see 0)
  24407. =>WM: (15016: N1068 ^status complete)
  24408. <=WM: (15004: I2 ^dir U)
  24409. <=WM: (15003: I2 ^reward 1)
  24410. <=WM: (15002: I2 ^see 0)
  24411. =>WM: (15020: I2 ^level-1 R0-root)
  24412. <=WM: (15005: I2 ^level-1 R0-root)
  24413. --- END Input Phase ---
  24414. --- Proposal Phase ---
  24415. --- Inner Elaboration Phase, active level 1 (S1) ---
  24416. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  24417. -->
  24418. (S1 ^operator O2135 = -0.1512366769350551)
  24419. Firing prefer*rvt*predict-yes*H0*5*H1
  24420. -->
  24421. Firing elaborate*copy-see-to-output-link
  24422. -->
  24423. (I3 ^see 0 +)
  24424. Firing elaborate*reward*based*on*reward
  24425. -->
  24426. (R1072 ^value 1 +)
  24427. (R1 ^reward R1072 +)
  24428. Firing propose*predict-yes
  24429. -->
  24430. (O2137 ^name predict-yes +)
  24431. (S1 ^operator O2137 +)
  24432. Firing propose*predict-no
  24433. -->
  24434. (O2138 ^name predict-no +)
  24435. (S1 ^operator O2138 +)
  24436. Firing rl*prefer*rvt*predict-no*H0*6
  24437. -->
  24438. (S1 ^operator O2136 = 0.8652467390234381)
  24439. Firing rl*prefer*rvt*predict-yes*H0*5
  24440. -->
  24441. (S1 ^operator O2135 = 0.1215966938845745)
  24442. Firing prefer*rvt*predict-yes*H0
  24443. -->
  24444. Firing prefer*rvt*predict-no*H0
  24445. -->
  24446. Firing elaborate*copy-dir-to-output-link
  24447. -->
  24448. (I3 ^dir R +)
  24449. inner elaboration loop at bottom goal.
  24450. Retracting elaborate*copy-see-to-output-link
  24451. -->
  24452. (I3 ^see 0 +)
  24453. Retracting propose*predict-no
  24454. -->
  24455. (O2136 ^name predict-no +)
  24456. (S1 ^operator O2136 +)
  24457. Retracting propose*predict-yes
  24458. -->
  24459. (O2135 ^name predict-yes +)
  24460. (S1 ^operator O2135 +)
  24461. Retracting elaborate*reward*based*on*reward
  24462. -->
  24463. (R1071 ^value 1 +)
  24464. (R1 ^reward R1071 +)
  24465. Retracting elaborate*copy-dir-to-output-link
  24466. -->
  24467. (I3 ^dir U +)
  24468. Retracting rl*prefer*rvt*predict-no*H0*2
  24469. -->
  24470. (S1 ^operator O2136 = 1.)
  24471. Retracting rl*prefer*rvt*predict-yes*H0*1
  24472. -->
  24473. (S1 ^operator O2135 = 0.)
  24474. =>WM: (15027: S1 ^operator O2138 +)
  24475. =>WM: (15026: S1 ^operator O2137 +)
  24476. =>WM: (15025: I3 ^dir R)
  24477. =>WM: (15024: O2138 ^name predict-no)
  24478. =>WM: (15023: O2137 ^name predict-yes)
  24479. =>WM: (15022: R1072 ^value 1)
  24480. =>WM: (15021: R1 ^reward R1072)
  24481. <=WM: (15012: S1 ^operator O2135 +)
  24482. <=WM: (15013: S1 ^operator O2136 +)
  24483. <=WM: (15014: S1 ^operator O2136)
  24484. <=WM: (15011: I3 ^dir U)
  24485. <=WM: (15007: R1 ^reward R1071)
  24486. <=WM: (15010: O2136 ^name predict-no)
  24487. <=WM: (15009: O2135 ^name predict-yes)
  24488. <=WM: (15008: R1071 ^value 1)
  24489. --- Inner Elaboration Phase, active level 1 (S1) ---
  24490. Firing prefer*rvt*predict-yes*H0
  24491. -->
  24492. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  24493. -->
  24494. (S1 ^operator O2137 = -0.1512366769350551)
  24495. Firing rl*prefer*rvt*predict-yes*H0*5
  24496. -->
  24497. (S1 ^operator O2137 = 0.1215966938845745)
  24498. Firing prefer*rvt*predict-yes*H0*5*H1
  24499. -->
  24500. Firing prefer*rvt*predict-no*H0
  24501. -->
  24502. Firing rl*prefer*rvt*predict-no*H0*6
  24503. -->
  24504. (S1 ^operator O2138 = 0.8652467390234381)
  24505. inner elaboration loop at bottom goal.
  24506. Retracting rl*prefer*rvt*predict-no*H0*6
  24507. -->
  24508. (S1 ^operator O2136 = 0.8652467390234381)
  24509. Retracting rl*prefer*rvt*predict-yes*H0*5
  24510. -->
  24511. (S1 ^operator O2135 = 0.1215966938845745)
  24512. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  24513. -->
  24514. (S1 ^operator O2135 = -0.1512366769350551)
  24515. --- END Proposal Phase ---
  24516. --- Decision Phase ---
  24517. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  24518. =>WM: (15028: S1 ^operator O2138)
  24519. 1069: O: O2138 (predict-no)
  24520. --- END Decision Phase ---
  24521. --- Application Phase ---
  24522. --- Firing Productions (PE) For State At Depth 1 ---
  24523. --- Inner Elaboration Phase, active level 1 (S1) ---
  24524. Firing apply*operator
  24525. -->
  24526. (I3 ^predict-no N1069 + :O )
  24527. Firing apply*operator*complete
  24528. -->
  24529. (I3 ^predict-no N1068 - :O )
  24530. inner elaboration loop at bottom goal.
  24531. --- Change Working Memory (PE) ---
  24532. =>WM: (15029: I3 ^predict-no N1069)
  24533. <=WM: (15016: N1068 ^status complete)
  24534. <=WM: (15015: I3 ^predict-no N1068)
  24535. --- Firing Productions (IE) For State At Depth 1 ---
  24536. --- Inner Elaboration Phase, active level 1 (S1) ---
  24537. Firing monitor*world
  24538. -->
  24539. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  24540. --- Change Working Memory (IE) ---
  24541. --- END Application Phase ---
  24542. --- Output Phase ---
  24543. ENV: Agent did: predict-no for direction R in state State-B
  24544. In State-B moving R
  24545. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  24546. predict error 0
  24547. dir: dir isL
  24548. --- END Output Phase ---
  24549. -/|\--- Input Phase ---
  24550. =>WM: (15033: I2 ^dir L)
  24551. =>WM: (15032: I2 ^reward 1)
  24552. =>WM: (15031: I2 ^see 0)
  24553. =>WM: (15030: N1069 ^status complete)
  24554. <=WM: (15019: I2 ^dir R)
  24555. <=WM: (15018: I2 ^reward 1)
  24556. <=WM: (15017: I2 ^see 0)
  24557. =>WM: (15034: I2 ^level-1 R0-root)
  24558. <=WM: (15020: I2 ^level-1 R0-root)
  24559. --- END Input Phase ---
  24560. --- Proposal Phase ---
  24561. --- Inner Elaboration Phase, active level 1 (S1) ---
  24562. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  24563. -->
  24564. (S1 ^operator O2138 = -0.1984300550322165)
  24565. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  24566. -->
  24567. (S1 ^operator O2137 = 0.6091533345297356)
  24568. Firing prefer*rvt*predict-no*H0*4*H1
  24569. -->
  24570. Firing prefer*rvt*predict-yes*H0*3*H1
  24571. -->
  24572. Firing elaborate*copy-see-to-output-link
  24573. -->
  24574. (I3 ^see 0 +)
  24575. Firing elaborate*reward*based*on*reward
  24576. -->
  24577. (R1073 ^value 1 +)
  24578. (R1 ^reward R1073 +)
  24579. Firing propose*predict-yes
  24580. -->
  24581. (O2139 ^name predict-yes +)
  24582. (S1 ^operator O2139 +)
  24583. Firing propose*predict-no
  24584. -->
  24585. (O2140 ^name predict-no +)
  24586. (S1 ^operator O2140 +)
  24587. Firing rl*prefer*rvt*predict-no*H0*4
  24588. -->
  24589. (S1 ^operator O2138 = 0.3144956610238658)
  24590. Firing rl*prefer*rvt*predict-yes*H0*3
  24591. -->
  24592. (S1 ^operator O2137 = 0.390770856544958)
  24593. Firing prefer*rvt*predict-yes*H0
  24594. -->
  24595. Firing prefer*rvt*predict-no*H0
  24596. -->
  24597. Firing elaborate*copy-dir-to-output-link
  24598. -->
  24599. (I3 ^dir L +)
  24600. inner elaboration loop at bottom goal.
  24601. Retracting elaborate*copy-see-to-output-link
  24602. -->
  24603. (I3 ^see 0 +)
  24604. Retracting propose*predict-no
  24605. -->
  24606. (O2138 ^name predict-no +)
  24607. (S1 ^operator O2138 +)
  24608. Retracting propose*predict-yes
  24609. -->
  24610. (O2137 ^name predict-yes +)
  24611. (S1 ^operator O2137 +)
  24612. Retracting elaborate*reward*based*on*reward
  24613. -->
  24614. (R1072 ^value 1 +)
  24615. (R1 ^reward R1072 +)
  24616. Retracting elaborate*copy-dir-to-output-link
  24617. -->
  24618. (I3 ^dir R +)
  24619. Retracting rl*prefer*rvt*predict-no*H0*6
  24620. -->
  24621. (S1 ^operator O2138 = 0.8652467390234381)
  24622. Retracting rl*prefer*rvt*predict-yes*H0*5
  24623. -->
  24624. (S1 ^operator O2137 = 0.1215966938845745)
  24625. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  24626. -->
  24627. (S1 ^operator O2137 = -0.1512366769350551)
  24628. =>WM: (15041: S1 ^operator O2140 +)
  24629. =>WM: (15040: S1 ^operator O2139 +)
  24630. =>WM: (15039: I3 ^dir L)
  24631. =>WM: (15038: O2140 ^name predict-no)
  24632. =>WM: (15037: O2139 ^name predict-yes)
  24633. =>WM: (15036: R1073 ^value 1)
  24634. =>WM: (15035: R1 ^reward R1073)
  24635. <=WM: (15026: S1 ^operator O2137 +)
  24636. <=WM: (15027: S1 ^operator O2138 +)
  24637. <=WM: (15028: S1 ^operator O2138)
  24638. <=WM: (15025: I3 ^dir R)
  24639. <=WM: (15021: R1 ^reward R1072)
  24640. <=WM: (15024: O2138 ^name predict-no)
  24641. <=WM: (15023: O2137 ^name predict-yes)
  24642. <=WM: (15022: R1072 ^value 1)
  24643. --- Inner Elaboration Phase, active level 1 (S1) ---
  24644. Firing prefer*rvt*predict-yes*H0
  24645. -->
  24646. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  24647. -->
  24648. (S1 ^operator O2139 = 0.6091533345297356)
  24649. Firing rl*prefer*rvt*predict-yes*H0*3
  24650. -->
  24651. (S1 ^operator O2139 = 0.390770856544958)
  24652. Firing prefer*rvt*predict-yes*H0*3*H1
  24653. -->
  24654. Firing prefer*rvt*predict-no*H0
  24655. -->
  24656. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  24657. -->
  24658. (S1 ^operator O2140 = -0.1984300550322165)
  24659. Firing rl*prefer*rvt*predict-no*H0*4
  24660. -->
  24661. (S1 ^operator O2140 = 0.3144956610238658)
  24662. Firing prefer*rvt*predict-no*H0*4*H1
  24663. -->
  24664. inner elaboration loop at bottom goal.
  24665. Retracting rl*prefer*rvt*predict-no*H0*4
  24666. -->
  24667. (S1 ^operator O2138 = 0.3144956610238658)
  24668. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  24669. -->
  24670. (S1 ^operator O2138 = -0.1984300550322165)
  24671. Retracting rl*prefer*rvt*predict-yes*H0*3
  24672. -->
  24673. (S1 ^operator O2137 = 0.390770856544958)
  24674. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  24675. -->
  24676. (S1 ^operator O2137 = 0.6091533345297356)
  24677. --- END Proposal Phase ---
  24678. --- Decision Phase ---
  24679. RL update rl*prefer*rvt*predict-no*H0*6 0.865247 0 0.865247 -> 0.886836 0 0.886836(R,m,v=1,0.936508,0.0597771)
  24680. =>WM: (15042: S1 ^operator O2139)
  24681. 1070: O: O2139 (predict-yes)
  24682. --- END Decision Phase ---
  24683. --- Application Phase ---
  24684. --- Firing Productions (PE) For State At Depth 1 ---
  24685. --- Inner Elaboration Phase, active level 1 (S1) ---
  24686. Firing apply*operator
  24687. -->
  24688. (I3 ^predict-yes N1070 + :O )
  24689. Firing apply*operator*complete
  24690. -->
  24691. (I3 ^predict-no N1069 - :O )
  24692. inner elaboration loop at bottom goal.
  24693. --- Change Working Memory (PE) ---
  24694. =>WM: (15043: I3 ^predict-yes N1070)
  24695. <=WM: (15030: N1069 ^status complete)
  24696. <=WM: (15029: I3 ^predict-no N1069)
  24697. --- Firing Productions (IE) For State At Depth 1 ---
  24698. --- Inner Elaboration Phase, active level 1 (S1) ---
  24699. Firing monitor*world
  24700. -->
  24701. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24702. --- Change Working Memory (IE) ---
  24703. --- END Application Phase ---
  24704. --- Output Phase ---
  24705. ENV: Agent did: predict-yes for direction L in state State-B
  24706. In State-B moving L
  24707. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  24708. predict error 0
  24709. dir: dir isR
  24710. --- END Output Phase ---
  24711. -/--- Input Phase ---
  24712. =>WM: (15047: I2 ^dir R)
  24713. =>WM: (15046: I2 ^reward 1)
  24714. =>WM: (15045: I2 ^see 1)
  24715. =>WM: (15044: N1070 ^status complete)
  24716. <=WM: (15033: I2 ^dir L)
  24717. <=WM: (15032: I2 ^reward 1)
  24718. <=WM: (15031: I2 ^see 0)
  24719. =>WM: (15048: I2 ^level-1 L1-root)
  24720. <=WM: (15034: I2 ^level-1 R0-root)
  24721. --- END Input Phase ---
  24722. --- Proposal Phase ---
  24723. --- Inner Elaboration Phase, active level 1 (S1) ---
  24724. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  24725. -->
  24726. (S1 ^operator O2139 = 0.8784081974205705)
  24727. Firing prefer*rvt*predict-yes*H0*5*H1
  24728. -->
  24729. Firing elaborate*copy-see-to-output-link
  24730. -->
  24731. (I3 ^see 1 +)
  24732. Firing elaborate*reward*based*on*reward
  24733. -->
  24734. (R1074 ^value 1 +)
  24735. (R1 ^reward R1074 +)
  24736. Firing propose*predict-yes
  24737. -->
  24738. (O2141 ^name predict-yes +)
  24739. (S1 ^operator O2141 +)
  24740. Firing propose*predict-no
  24741. -->
  24742. (O2142 ^name predict-no +)
  24743. (S1 ^operator O2142 +)
  24744. Firing rl*prefer*rvt*predict-no*H0*6
  24745. -->
  24746. (S1 ^operator O2140 = 0.886835768609456)
  24747. Firing rl*prefer*rvt*predict-yes*H0*5
  24748. -->
  24749. (S1 ^operator O2139 = 0.1215966938845745)
  24750. Firing prefer*rvt*predict-yes*H0
  24751. -->
  24752. Firing prefer*rvt*predict-no*H0
  24753. -->
  24754. Firing elaborate*copy-dir-to-output-link
  24755. -->
  24756. (I3 ^dir R +)
  24757. inner elaboration loop at bottom goal.
  24758. Retracting elaborate*copy-see-to-output-link
  24759. -->
  24760. (I3 ^see 0 +)
  24761. Retracting propose*predict-no
  24762. -->
  24763. (O2140 ^name predict-no +)
  24764. (S1 ^operator O2140 +)
  24765. Retracting propose*predict-yes
  24766. -->
  24767. (O2139 ^name predict-yes +)
  24768. (S1 ^operator O2139 +)
  24769. Retracting elaborate*reward*based*on*reward
  24770. -->
  24771. (R1073 ^value 1 +)
  24772. (R1 ^reward R1073 +)
  24773. Retracting elaborate*copy-dir-to-output-link
  24774. -->
  24775. (I3 ^dir L +)
  24776. Retracting rl*prefer*rvt*predict-no*H0*4
  24777. -->
  24778. (S1 ^operator O2140 = 0.3144956610238658)
  24779. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  24780. -->
  24781. (S1 ^operator O2140 = -0.1984300550322165)
  24782. Retracting rl*prefer*rvt*predict-yes*H0*3
  24783. -->
  24784. (S1 ^operator O2139 = 0.390770856544958)
  24785. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  24786. -->
  24787. (S1 ^operator O2139 = 0.6091533345297356)
  24788. =>WM: (15056: S1 ^operator O2142 +)
  24789. =>WM: (15055: S1 ^operator O2141 +)
  24790. =>WM: (15054: I3 ^dir R)
  24791. =>WM: (15053: O2142 ^name predict-no)
  24792. =>WM: (15052: O2141 ^name predict-yes)
  24793. =>WM: (15051: R1074 ^value 1)
  24794. =>WM: (15050: R1 ^reward R1074)
  24795. =>WM: (15049: I3 ^see 1)
  24796. <=WM: (15040: S1 ^operator O2139 +)
  24797. <=WM: (15042: S1 ^operator O2139)
  24798. <=WM: (15041: S1 ^operator O2140 +)
  24799. <=WM: (15039: I3 ^dir L)
  24800. <=WM: (15035: R1 ^reward R1073)
  24801. <=WM: (15006: I3 ^see 0)
  24802. <=WM: (15038: O2140 ^name predict-no)
  24803. <=WM: (15037: O2139 ^name predict-yes)
  24804. <=WM: (15036: R1073 ^value 1)
  24805. --- Inner Elaboration Phase, active level 1 (S1) ---
  24806. Firing prefer*rvt*predict-yes*H0
  24807. -->
  24808. Firing rl*prefer*rvt*predict-yes*H0*5
  24809. -->
  24810. (S1 ^operator O2141 = 0.1215966938845745)
  24811. Firing prefer*rvt*predict-yes*H0*5*H1
  24812. -->
  24813. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  24814. -->
  24815. (S1 ^operator O2141 = 0.8784081974205705)
  24816. Firing prefer*rvt*predict-no*H0
  24817. -->
  24818. Firing rl*prefer*rvt*predict-no*H0*6
  24819. -->
  24820. (S1 ^operator O2142 = 0.886835768609456)
  24821. inner elaboration loop at bottom goal.
  24822. Retracting rl*prefer*rvt*predict-no*H0*6
  24823. -->
  24824. (S1 ^operator O2140 = 0.886835768609456)
  24825. Retracting rl*prefer*rvt*predict-yes*H0*5
  24826. -->
  24827. (S1 ^operator O2139 = 0.1215966938845745)
  24828. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  24829. -->
  24830. (S1 ^operator O2139 = 0.8784081974205705)
  24831. --- END Proposal Phase ---
  24832. --- Decision Phase ---
  24833. RL update rl*prefer*rvt*predict-yes*H0*3 0.472318 -0.0815469 0.390771 -> 0.472323 -0.081546 0.390777(R,m,v=1,0.948276,0.0493323)
  24834. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527618 0.0815357 0.609153 -> 0.527624 0.0815368 0.60916(R,m,v=1,1,0)
  24835. =>WM: (15057: S1 ^operator O2141)
  24836. 1071: O: O2141 (predict-yes)
  24837. --- END Decision Phase ---
  24838. --- Application Phase ---
  24839. --- Firing Productions (PE) For State At Depth 1 ---
  24840. --- Inner Elaboration Phase, active level 1 (S1) ---
  24841. Firing apply*operator
  24842. -->
  24843. (I3 ^predict-yes N1071 + :O )
  24844. Firing apply*operator*complete
  24845. -->
  24846. (I3 ^predict-yes N1070 - :O )
  24847. inner elaboration loop at bottom goal.
  24848. --- Change Working Memory (PE) ---
  24849. =>WM: (15058: I3 ^predict-yes N1071)
  24850. <=WM: (15044: N1070 ^status complete)
  24851. <=WM: (15043: I3 ^predict-yes N1070)
  24852. --- Firing Productions (IE) For State At Depth 1 ---
  24853. --- Inner Elaboration Phase, active level 1 (S1) ---
  24854. Firing monitor*world
  24855. -->
  24856. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  24857. --- Change Working Memory (IE) ---
  24858. --- END Application Phase ---
  24859. --- Output Phase ---
  24860. ENV: Agent did: predict-yes for direction R in state State-A
  24861. In State-A moving R
  24862. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  24863. predict error 0
  24864. dir: dir isL
  24865. --- END Output Phase ---
  24866. |--- Input Phase ---
  24867. =>WM: (15062: I2 ^dir L)
  24868. =>WM: (15061: I2 ^reward 1)
  24869. =>WM: (15060: I2 ^see 1)
  24870. =>WM: (15059: N1071 ^status complete)
  24871. <=WM: (15047: I2 ^dir R)
  24872. <=WM: (15046: I2 ^reward 1)
  24873. <=WM: (15045: I2 ^see 1)
  24874. =>WM: (15063: I2 ^level-1 R1-root)
  24875. <=WM: (15048: I2 ^level-1 L1-root)
  24876. --- END Input Phase ---
  24877. --- Proposal Phase ---
  24878. --- Inner Elaboration Phase, active level 1 (S1) ---
  24879. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  24880. -->
  24881. (S1 ^operator O2142 = -0.168718511744511)
  24882. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  24883. -->
  24884. (S1 ^operator O2141 = 0.6092621009042343)
  24885. Firing prefer*rvt*predict-no*H0*4*H1
  24886. -->
  24887. Firing prefer*rvt*predict-yes*H0*3*H1
  24888. -->
  24889. Firing elaborate*copy-see-to-output-link
  24890. -->
  24891. (I3 ^see 1 +)
  24892. Firing elaborate*reward*based*on*reward
  24893. -->
  24894. (R1075 ^value 1 +)
  24895. (R1 ^reward R1075 +)
  24896. Firing propose*predict-yes
  24897. -->
  24898. (O2143 ^name predict-yes +)
  24899. (S1 ^operator O2143 +)
  24900. Firing propose*predict-no
  24901. -->
  24902. (O2144 ^name predict-no +)
  24903. (S1 ^operator O2144 +)
  24904. Firing rl*prefer*rvt*predict-no*H0*4
  24905. -->
  24906. (S1 ^operator O2142 = 0.3144956610238658)
  24907. Firing rl*prefer*rvt*predict-yes*H0*3
  24908. -->
  24909. (S1 ^operator O2141 = 0.3907770108106386)
  24910. Firing prefer*rvt*predict-yes*H0
  24911. -->
  24912. Firing prefer*rvt*predict-no*H0
  24913. -->
  24914. Firing elaborate*copy-dir-to-output-link
  24915. -->
  24916. (I3 ^dir L +)
  24917. inner elaboration loop at bottom goal.
  24918. Retracting elaborate*copy-see-to-output-link
  24919. -->
  24920. (I3 ^see 1 +)
  24921. Retracting propose*predict-no
  24922. -->
  24923. (O2142 ^name predict-no +)
  24924. (S1 ^operator O2142 +)
  24925. Retracting propose*predict-yes
  24926. -->
  24927. (O2141 ^name predict-yes +)
  24928. (S1 ^operator O2141 +)
  24929. Retracting elaborate*reward*based*on*reward
  24930. -->
  24931. (R1074 ^value 1 +)
  24932. (R1 ^reward R1074 +)
  24933. Retracting elaborate*copy-dir-to-output-link
  24934. -->
  24935. (I3 ^dir R +)
  24936. Retracting rl*prefer*rvt*predict-no*H0*6
  24937. -->
  24938. (S1 ^operator O2142 = 0.886835768609456)
  24939. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  24940. -->
  24941. (S1 ^operator O2141 = 0.8784081974205705)
  24942. Retracting rl*prefer*rvt*predict-yes*H0*5
  24943. -->
  24944. (S1 ^operator O2141 = 0.1215966938845745)
  24945. =>WM: (15070: S1 ^operator O2144 +)
  24946. =>WM: (15069: S1 ^operator O2143 +)
  24947. =>WM: (15068: I3 ^dir L)
  24948. =>WM: (15067: O2144 ^name predict-no)
  24949. =>WM: (15066: O2143 ^name predict-yes)
  24950. =>WM: (15065: R1075 ^value 1)
  24951. =>WM: (15064: R1 ^reward R1075)
  24952. <=WM: (15055: S1 ^operator O2141 +)
  24953. <=WM: (15057: S1 ^operator O2141)
  24954. <=WM: (15056: S1 ^operator O2142 +)
  24955. <=WM: (15054: I3 ^dir R)
  24956. <=WM: (15050: R1 ^reward R1074)
  24957. <=WM: (15053: O2142 ^name predict-no)
  24958. <=WM: (15052: O2141 ^name predict-yes)
  24959. <=WM: (15051: R1074 ^value 1)
  24960. --- Inner Elaboration Phase, active level 1 (S1) ---
  24961. Firing prefer*rvt*predict-yes*H0
  24962. -->
  24963. Firing rl*prefer*rvt*predict-yes*H0*3
  24964. -->
  24965. (S1 ^operator O2143 = 0.3907770108106386)
  24966. Firing prefer*rvt*predict-yes*H0*3*H1
  24967. -->
  24968. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  24969. -->
  24970. (S1 ^operator O2143 = 0.6092621009042343)
  24971. Firing prefer*rvt*predict-no*H0
  24972. -->
  24973. Firing rl*prefer*rvt*predict-no*H0*4
  24974. -->
  24975. (S1 ^operator O2144 = 0.3144956610238658)
  24976. Firing prefer*rvt*predict-no*H0*4*H1
  24977. -->
  24978. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  24979. -->
  24980. (S1 ^operator O2144 = -0.168718511744511)
  24981. inner elaboration loop at bottom goal.
  24982. Retracting rl*prefer*rvt*predict-no*H0*4
  24983. -->
  24984. (S1 ^operator O2142 = 0.3144956610238658)
  24985. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  24986. -->
  24987. (S1 ^operator O2142 = -0.168718511744511)
  24988. Retracting rl*prefer*rvt*predict-yes*H0*3
  24989. -->
  24990. (S1 ^operator O2141 = 0.3907770108106386)
  24991. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  24992. -->
  24993. (S1 ^operator O2141 = 0.6092621009042343)
  24994. --- END Proposal Phase ---
  24995. --- Decision Phase ---
  24996. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.873684,0.110944)
  24997. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465481 0.412927 0.878408 -> 0.465481 0.412927 0.878408(R,m,v=1,1,0)
  24998. =>WM: (15071: S1 ^operator O2143)
  24999. 1072: O: O2143 (predict-yes)
  25000. --- END Decision Phase ---
  25001. --- Application Phase ---
  25002. --- Firing Productions (PE) For State At Depth 1 ---
  25003. --- Inner Elaboration Phase, active level 1 (S1) ---
  25004. Firing apply*operator
  25005. -->
  25006. (I3 ^predict-yes N1072 + :O )
  25007. Firing apply*operator*complete
  25008. -->
  25009. (I3 ^predict-yes N1071 - :O )
  25010. inner elaboration loop at bottom goal.
  25011. --- Change Working Memory (PE) ---
  25012. =>WM: (15072: I3 ^predict-yes N1072)
  25013. <=WM: (15059: N1071 ^status complete)
  25014. <=WM: (15058: I3 ^predict-yes N1071)
  25015. --- Firing Productions (IE) For State At Depth 1 ---
  25016. --- Inner Elaboration Phase, active level 1 (S1) ---
  25017. Firing monitor*world
  25018. -->
  25019. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25020. --- Change Working Memory (IE) ---
  25021. --- END Application Phase ---
  25022. --- Output Phase ---
  25023. ENV: Agent did: predict-yes for direction L in state State-B
  25024. In State-B moving L
  25025. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  25026. predict error 0
  25027. dir: dir isR
  25028. --- END Output Phase ---
  25029. \---- Input Phase ---
  25030. =>WM: (15076: I2 ^dir R)
  25031. =>WM: (15075: I2 ^reward 1)
  25032. =>WM: (15074: I2 ^see 1)
  25033. =>WM: (15073: N1072 ^status complete)
  25034. <=WM: (15062: I2 ^dir L)
  25035. <=WM: (15061: I2 ^reward 1)
  25036. <=WM: (15060: I2 ^see 1)
  25037. =>WM: (15077: I2 ^level-1 L1-root)
  25038. <=WM: (15063: I2 ^level-1 R1-root)
  25039. --- END Input Phase ---
  25040. --- Proposal Phase ---
  25041. --- Inner Elaboration Phase, active level 1 (S1) ---
  25042. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  25043. -->
  25044. (S1 ^operator O2143 = 0.878407746096616)
  25045. Firing prefer*rvt*predict-yes*H0*5*H1
  25046. -->
  25047. Firing elaborate*copy-see-to-output-link
  25048. -->
  25049. (I3 ^see 1 +)
  25050. Firing elaborate*reward*based*on*reward
  25051. -->
  25052. (R1076 ^value 1 +)
  25053. (R1 ^reward R1076 +)
  25054. Firing propose*predict-yes
  25055. -->
  25056. (O2145 ^name predict-yes +)
  25057. (S1 ^operator O2145 +)
  25058. Firing propose*predict-no
  25059. -->
  25060. (O2146 ^name predict-no +)
  25061. (S1 ^operator O2146 +)
  25062. Firing rl*prefer*rvt*predict-no*H0*6
  25063. -->
  25064. (S1 ^operator O2144 = 0.886835768609456)
  25065. Firing rl*prefer*rvt*predict-yes*H0*5
  25066. -->
  25067. (S1 ^operator O2143 = 0.1215963023937551)
  25068. Firing prefer*rvt*predict-yes*H0
  25069. -->
  25070. Firing prefer*rvt*predict-no*H0
  25071. -->
  25072. Firing elaborate*copy-dir-to-output-link
  25073. -->
  25074. (I3 ^dir R +)
  25075. inner elaboration loop at bottom goal.
  25076. Retracting elaborate*copy-see-to-output-link
  25077. -->
  25078. (I3 ^see 1 +)
  25079. Retracting propose*predict-no
  25080. -->
  25081. (O2144 ^name predict-no +)
  25082. (S1 ^operator O2144 +)
  25083. Retracting propose*predict-yes
  25084. -->
  25085. (O2143 ^name predict-yes +)
  25086. (S1 ^operator O2143 +)
  25087. Retracting elaborate*reward*based*on*reward
  25088. -->
  25089. (R1075 ^value 1 +)
  25090. (R1 ^reward R1075 +)
  25091. Retracting elaborate*copy-dir-to-output-link
  25092. -->
  25093. (I3 ^dir L +)
  25094. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  25095. -->
  25096. (S1 ^operator O2144 = -0.168718511744511)
  25097. Retracting rl*prefer*rvt*predict-no*H0*4
  25098. -->
  25099. (S1 ^operator O2144 = 0.3144956610238658)
  25100. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  25101. -->
  25102. (S1 ^operator O2143 = 0.6092621009042343)
  25103. Retracting rl*prefer*rvt*predict-yes*H0*3
  25104. -->
  25105. (S1 ^operator O2143 = 0.3907770108106386)
  25106. =>WM: (15084: S1 ^operator O2146 +)
  25107. =>WM: (15083: S1 ^operator O2145 +)
  25108. =>WM: (15082: I3 ^dir R)
  25109. =>WM: (15081: O2146 ^name predict-no)
  25110. =>WM: (15080: O2145 ^name predict-yes)
  25111. =>WM: (15079: R1076 ^value 1)
  25112. =>WM: (15078: R1 ^reward R1076)
  25113. <=WM: (15069: S1 ^operator O2143 +)
  25114. <=WM: (15071: S1 ^operator O2143)
  25115. <=WM: (15070: S1 ^operator O2144 +)
  25116. <=WM: (15068: I3 ^dir L)
  25117. <=WM: (15064: R1 ^reward R1075)
  25118. <=WM: (15067: O2144 ^name predict-no)
  25119. <=WM: (15066: O2143 ^name predict-yes)
  25120. <=WM: (15065: R1075 ^value 1)
  25121. --- Inner Elaboration Phase, active level 1 (S1) ---
  25122. Firing prefer*rvt*predict-yes*H0
  25123. -->
  25124. Firing rl*prefer*rvt*predict-yes*H0*5
  25125. -->
  25126. (S1 ^operator O2145 = 0.1215963023937551)
  25127. Firing prefer*rvt*predict-yes*H0*5*H1
  25128. -->
  25129. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  25130. -->
  25131. (S1 ^operator O2145 = 0.878407746096616)
  25132. Firing prefer*rvt*predict-no*H0
  25133. -->
  25134. Firing rl*prefer*rvt*predict-no*H0*6
  25135. -->
  25136. (S1 ^operator O2146 = 0.886835768609456)
  25137. inner elaboration loop at bottom goal.
  25138. Retracting rl*prefer*rvt*predict-no*H0*6
  25139. -->
  25140. (S1 ^operator O2144 = 0.886835768609456)
  25141. Retracting rl*prefer*rvt*predict-yes*H0*5
  25142. -->
  25143. (S1 ^operator O2143 = 0.1215963023937551)
  25144. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  25145. -->
  25146. (S1 ^operator O2143 = 0.878407746096616)
  25147. --- END Proposal Phase ---
  25148. --- Decision Phase ---
  25149. RL update rl*prefer*rvt*predict-yes*H0*3 0.472323 -0.081546 0.390777 -> 0.47232 -0.0815465 0.390774(R,m,v=1,0.948571,0.049064)
  25150. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.52771 0.0815518 0.609262 -> 0.527707 0.0815513 0.609258(R,m,v=1,1,0)
  25151. =>WM: (15085: S1 ^operator O2145)
  25152. 1073: O: O2145 (predict-yes)
  25153. --- END Decision Phase ---
  25154. --- Application Phase ---
  25155. --- Firing Productions (PE) For State At Depth 1 ---
  25156. --- Inner Elaboration Phase, active level 1 (S1) ---
  25157. Firing apply*operator
  25158. -->
  25159. (I3 ^predict-yes N1073 + :O )
  25160. Firing apply*operator*complete
  25161. -->
  25162. (I3 ^predict-yes N1072 - :O )
  25163. inner elaboration loop at bottom goal.
  25164. --- Change Working Memory (PE) ---
  25165. =>WM: (15086: I3 ^predict-yes N1073)
  25166. <=WM: (15073: N1072 ^status complete)
  25167. <=WM: (15072: I3 ^predict-yes N1072)
  25168. --- Firing Productions (IE) For State At Depth 1 ---
  25169. --- Inner Elaboration Phase, active level 1 (S1) ---
  25170. Firing monitor*world
  25171. -->
  25172. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25173. --- Change Working Memory (IE) ---
  25174. --- END Application Phase ---
  25175. --- Output Phase ---
  25176. ENV: Agent did: predict-yes for direction R in state State-A
  25177. In State-A moving R
  25178. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25179. predict error 0
  25180. dir: dir isR
  25181. --- END Output Phase ---
  25182. /|\--- Input Phase ---
  25183. =>WM: (15090: I2 ^dir R)
  25184. =>WM: (15089: I2 ^reward 1)
  25185. =>WM: (15088: I2 ^see 1)
  25186. =>WM: (15087: N1073 ^status complete)
  25187. <=WM: (15076: I2 ^dir R)
  25188. <=WM: (15075: I2 ^reward 1)
  25189. <=WM: (15074: I2 ^see 1)
  25190. =>WM: (15091: I2 ^level-1 R1-root)
  25191. <=WM: (15077: I2 ^level-1 L1-root)
  25192. --- END Input Phase ---
  25193. --- Proposal Phase ---
  25194. --- Inner Elaboration Phase, active level 1 (S1) ---
  25195. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  25196. -->
  25197. (S1 ^operator O2145 = -0.04253361215288998)
  25198. Firing prefer*rvt*predict-yes*H0*5*H1
  25199. -->
  25200. Firing elaborate*copy-see-to-output-link
  25201. -->
  25202. (I3 ^see 1 +)
  25203. Firing elaborate*reward*based*on*reward
  25204. -->
  25205. (R1077 ^value 1 +)
  25206. (R1 ^reward R1077 +)
  25207. Firing propose*predict-yes
  25208. -->
  25209. (O2147 ^name predict-yes +)
  25210. (S1 ^operator O2147 +)
  25211. Firing propose*predict-no
  25212. -->
  25213. (O2148 ^name predict-no +)
  25214. (S1 ^operator O2148 +)
  25215. Firing rl*prefer*rvt*predict-no*H0*6
  25216. -->
  25217. (S1 ^operator O2146 = 0.886835768609456)
  25218. Firing rl*prefer*rvt*predict-yes*H0*5
  25219. -->
  25220. (S1 ^operator O2145 = 0.1215963023937551)
  25221. Firing prefer*rvt*predict-yes*H0
  25222. -->
  25223. Firing prefer*rvt*predict-no*H0
  25224. -->
  25225. Firing elaborate*copy-dir-to-output-link
  25226. -->
  25227. (I3 ^dir R +)
  25228. inner elaboration loop at bottom goal.
  25229. Retracting elaborate*copy-see-to-output-link
  25230. -->
  25231. (I3 ^see 1 +)
  25232. Retracting propose*predict-no
  25233. -->
  25234. (O2146 ^name predict-no +)
  25235. (S1 ^operator O2146 +)
  25236. Retracting propose*predict-yes
  25237. -->
  25238. (O2145 ^name predict-yes +)
  25239. (S1 ^operator O2145 +)
  25240. Retracting elaborate*reward*based*on*reward
  25241. -->
  25242. (R1076 ^value 1 +)
  25243. (R1 ^reward R1076 +)
  25244. Retracting elaborate*copy-dir-to-output-link
  25245. -->
  25246. (I3 ^dir R +)
  25247. Retracting rl*prefer*rvt*predict-no*H0*6
  25248. -->
  25249. (S1 ^operator O2146 = 0.886835768609456)
  25250. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  25251. -->
  25252. (S1 ^operator O2145 = 0.878407746096616)
  25253. Retracting rl*prefer*rvt*predict-yes*H0*5
  25254. -->
  25255. (S1 ^operator O2145 = 0.1215963023937551)
  25256. =>WM: (15097: S1 ^operator O2148 +)
  25257. =>WM: (15096: S1 ^operator O2147 +)
  25258. =>WM: (15095: O2148 ^name predict-no)
  25259. =>WM: (15094: O2147 ^name predict-yes)
  25260. =>WM: (15093: R1077 ^value 1)
  25261. =>WM: (15092: R1 ^reward R1077)
  25262. <=WM: (15083: S1 ^operator O2145 +)
  25263. <=WM: (15085: S1 ^operator O2145)
  25264. <=WM: (15084: S1 ^operator O2146 +)
  25265. <=WM: (15078: R1 ^reward R1076)
  25266. <=WM: (15081: O2146 ^name predict-no)
  25267. <=WM: (15080: O2145 ^name predict-yes)
  25268. <=WM: (15079: R1076 ^value 1)
  25269. --- Inner Elaboration Phase, active level 1 (S1) ---
  25270. Firing prefer*rvt*predict-yes*H0
  25271. -->
  25272. Firing rl*prefer*rvt*predict-yes*H0*5
  25273. -->
  25274. (S1 ^operator O2147 = 0.1215963023937551)
  25275. Firing prefer*rvt*predict-yes*H0*5*H1
  25276. -->
  25277. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  25278. -->
  25279. (S1 ^operator O2147 = -0.04253361215288998)
  25280. Firing prefer*rvt*predict-no*H0
  25281. -->
  25282. Firing rl*prefer*rvt*predict-no*H0*6
  25283. -->
  25284. (S1 ^operator O2148 = 0.886835768609456)
  25285. inner elaboration loop at bottom goal.
  25286. Retracting rl*prefer*rvt*predict-no*H0*6
  25287. -->
  25288. (S1 ^operator O2146 = 0.886835768609456)
  25289. Retracting rl*prefer*rvt*predict-yes*H0*5
  25290. -->
  25291. (S1 ^operator O2145 = 0.1215963023937551)
  25292. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  25293. -->
  25294. (S1 ^operator O2145 = -0.04253361215288998)
  25295. --- END Proposal Phase ---
  25296. --- Decision Phase ---
  25297. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534522 -0.412926 0.121596(R,m,v=1,0.874346,0.110444)
  25298. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.465481 0.412927 0.878408 -> 0.46548 0.412927 0.878407(R,m,v=1,1,0)
  25299. =>WM: (15098: S1 ^operator O2148)
  25300. 1074: O: O2148 (predict-no)
  25301. --- END Decision Phase ---
  25302. --- Application Phase ---
  25303. --- Firing Productions (PE) For State At Depth 1 ---
  25304. --- Inner Elaboration Phase, active level 1 (S1) ---
  25305. Firing apply*operator
  25306. -->
  25307. (I3 ^predict-no N1074 + :O )
  25308. Firing apply*operator*complete
  25309. -->
  25310. (I3 ^predict-yes N1073 - :O )
  25311. inner elaboration loop at bottom goal.
  25312. --- Change Working Memory (PE) ---
  25313. =>WM: (15099: I3 ^predict-no N1074)
  25314. <=WM: (15087: N1073 ^status complete)
  25315. <=WM: (15086: I3 ^predict-yes N1073)
  25316. --- Firing Productions (IE) For State At Depth 1 ---
  25317. --- Inner Elaboration Phase, active level 1 (S1) ---
  25318. Firing monitor*world
  25319. -->
  25320. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25321. --- Change Working Memory (IE) ---
  25322. --- END Application Phase ---
  25323. --- Output Phase ---
  25324. ENV: Agent did: predict-no for direction R in state State-B
  25325. In State-B moving R
  25326. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25327. predict error 0
  25328. dir: dir isL
  25329. --- END Output Phase ---
  25330. -/|\--- Input Phase ---
  25331. =>WM: (15103: I2 ^dir L)
  25332. =>WM: (15102: I2 ^reward 1)
  25333. =>WM: (15101: I2 ^see 0)
  25334. =>WM: (15100: N1074 ^status complete)
  25335. <=WM: (15090: I2 ^dir R)
  25336. <=WM: (15089: I2 ^reward 1)
  25337. <=WM: (15088: I2 ^see 1)
  25338. =>WM: (15104: I2 ^level-1 R0-root)
  25339. <=WM: (15091: I2 ^level-1 R1-root)
  25340. --- END Input Phase ---
  25341. --- Proposal Phase ---
  25342. --- Inner Elaboration Phase, active level 1 (S1) ---
  25343. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  25344. -->
  25345. (S1 ^operator O2148 = -0.1984300550322165)
  25346. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  25347. -->
  25348. (S1 ^operator O2147 = 0.6091603294693171)
  25349. Firing prefer*rvt*predict-no*H0*4*H1
  25350. -->
  25351. Firing prefer*rvt*predict-yes*H0*3*H1
  25352. -->
  25353. Firing elaborate*copy-see-to-output-link
  25354. -->
  25355. (I3 ^see 0 +)
  25356. Firing elaborate*reward*based*on*reward
  25357. -->
  25358. (R1078 ^value 1 +)
  25359. (R1 ^reward R1078 +)
  25360. Firing propose*predict-yes
  25361. -->
  25362. (O2149 ^name predict-yes +)
  25363. (S1 ^operator O2149 +)
  25364. Firing propose*predict-no
  25365. -->
  25366. (O2150 ^name predict-no +)
  25367. (S1 ^operator O2150 +)
  25368. Firing rl*prefer*rvt*predict-no*H0*4
  25369. -->
  25370. (S1 ^operator O2148 = 0.3144956610238658)
  25371. Firing rl*prefer*rvt*predict-yes*H0*3
  25372. -->
  25373. (S1 ^operator O2147 = 0.3907738386230689)
  25374. Firing prefer*rvt*predict-yes*H0
  25375. -->
  25376. Firing prefer*rvt*predict-no*H0
  25377. -->
  25378. Firing elaborate*copy-dir-to-output-link
  25379. -->
  25380. (I3 ^dir L +)
  25381. inner elaboration loop at bottom goal.
  25382. Retracting elaborate*copy-see-to-output-link
  25383. -->
  25384. (I3 ^see 1 +)
  25385. Retracting propose*predict-no
  25386. -->
  25387. (O2148 ^name predict-no +)
  25388. (S1 ^operator O2148 +)
  25389. Retracting propose*predict-yes
  25390. -->
  25391. (O2147 ^name predict-yes +)
  25392. (S1 ^operator O2147 +)
  25393. Retracting elaborate*reward*based*on*reward
  25394. -->
  25395. (R1077 ^value 1 +)
  25396. (R1 ^reward R1077 +)
  25397. Retracting elaborate*copy-dir-to-output-link
  25398. -->
  25399. (I3 ^dir R +)
  25400. Retracting rl*prefer*rvt*predict-no*H0*6
  25401. -->
  25402. (S1 ^operator O2148 = 0.886835768609456)
  25403. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  25404. -->
  25405. (S1 ^operator O2147 = -0.04253361215288998)
  25406. Retracting rl*prefer*rvt*predict-yes*H0*5
  25407. -->
  25408. (S1 ^operator O2147 = 0.1215959786322932)
  25409. =>WM: (15112: S1 ^operator O2150 +)
  25410. =>WM: (15111: S1 ^operator O2149 +)
  25411. =>WM: (15110: I3 ^dir L)
  25412. =>WM: (15109: O2150 ^name predict-no)
  25413. =>WM: (15108: O2149 ^name predict-yes)
  25414. =>WM: (15107: R1078 ^value 1)
  25415. =>WM: (15106: R1 ^reward R1078)
  25416. =>WM: (15105: I3 ^see 0)
  25417. <=WM: (15096: S1 ^operator O2147 +)
  25418. <=WM: (15097: S1 ^operator O2148 +)
  25419. <=WM: (15098: S1 ^operator O2148)
  25420. <=WM: (15082: I3 ^dir R)
  25421. <=WM: (15092: R1 ^reward R1077)
  25422. <=WM: (15049: I3 ^see 1)
  25423. <=WM: (15095: O2148 ^name predict-no)
  25424. <=WM: (15094: O2147 ^name predict-yes)
  25425. <=WM: (15093: R1077 ^value 1)
  25426. --- Inner Elaboration Phase, active level 1 (S1) ---
  25427. Firing prefer*rvt*predict-yes*H0
  25428. -->
  25429. Firing rl*prefer*rvt*predict-yes*H0*3
  25430. -->
  25431. (S1 ^operator O2149 = 0.3907738386230689)
  25432. Firing prefer*rvt*predict-yes*H0*3*H1
  25433. -->
  25434. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  25435. -->
  25436. (S1 ^operator O2149 = 0.6091603294693171)
  25437. Firing prefer*rvt*predict-no*H0
  25438. -->
  25439. Firing rl*prefer*rvt*predict-no*H0*4
  25440. -->
  25441. (S1 ^operator O2150 = 0.3144956610238658)
  25442. Firing prefer*rvt*predict-no*H0*4*H1
  25443. -->
  25444. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  25445. -->
  25446. (S1 ^operator O2150 = -0.1984300550322165)
  25447. inner elaboration loop at bottom goal.
  25448. Retracting rl*prefer*rvt*predict-no*H0*4
  25449. -->
  25450. (S1 ^operator O2148 = 0.3144956610238658)
  25451. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  25452. -->
  25453. (S1 ^operator O2148 = -0.1984300550322165)
  25454. Retracting rl*prefer*rvt*predict-yes*H0*3
  25455. -->
  25456. (S1 ^operator O2147 = 0.3907738386230689)
  25457. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  25458. -->
  25459. (S1 ^operator O2147 = 0.6091603294693171)
  25460. --- END Proposal Phase ---
  25461. --- Decision Phase ---
  25462. RL update rl*prefer*rvt*predict-no*H0*6 0.886836 0 0.886836 -> 0.904951 0 0.904951(R,m,v=1,0.936842,0.059482)
  25463. =>WM: (15113: S1 ^operator O2149)
  25464. 1075: O: O2149 (predict-yes)
  25465. --- END Decision Phase ---
  25466. --- Application Phase ---
  25467. --- Firing Productions (PE) For State At Depth 1 ---
  25468. --- Inner Elaboration Phase, active level 1 (S1) ---
  25469. Firing apply*operator
  25470. -->
  25471. (I3 ^predict-yes N1075 + :O )
  25472. Firing apply*operator*complete
  25473. -->
  25474. (I3 ^predict-no N1074 - :O )
  25475. inner elaboration loop at bottom goal.
  25476. --- Change Working Memory (PE) ---
  25477. =>WM: (15114: I3 ^predict-yes N1075)
  25478. <=WM: (15100: N1074 ^status complete)
  25479. <=WM: (15099: I3 ^predict-no N1074)
  25480. --- Firing Productions (IE) For State At Depth 1 ---
  25481. --- Inner Elaboration Phase, active level 1 (S1) ---
  25482. Firing monitor*world
  25483. -->
  25484. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25485. --- Change Working Memory (IE) ---
  25486. --- END Application Phase ---
  25487. --- Output Phase ---
  25488. ENV: Agent did: predict-yes for direction L in state State-B
  25489. In State-B moving L
  25490. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  25491. predict error 0
  25492. dir: dir isL
  25493. --- END Output Phase ---
  25494. -/|--- Input Phase ---
  25495. =>WM: (15118: I2 ^dir L)
  25496. =>WM: (15117: I2 ^reward 1)
  25497. =>WM: (15116: I2 ^see 1)
  25498. =>WM: (15115: N1075 ^status complete)
  25499. <=WM: (15103: I2 ^dir L)
  25500. <=WM: (15102: I2 ^reward 1)
  25501. <=WM: (15101: I2 ^see 0)
  25502. =>WM: (15119: I2 ^level-1 L1-root)
  25503. <=WM: (15104: I2 ^level-1 R0-root)
  25504. --- END Input Phase ---
  25505. --- Proposal Phase ---
  25506. --- Inner Elaboration Phase, active level 1 (S1) ---
  25507. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  25508. -->
  25509. (S1 ^operator O2149 = -0.2062723012911647)
  25510. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  25511. -->
  25512. (S1 ^operator O2150 = 0.6855163447632109)
  25513. Firing prefer*rvt*predict-no*H0*4*H1
  25514. -->
  25515. Firing prefer*rvt*predict-yes*H0*3*H1
  25516. -->
  25517. Firing elaborate*copy-see-to-output-link
  25518. -->
  25519. (I3 ^see 1 +)
  25520. Firing elaborate*reward*based*on*reward
  25521. -->
  25522. (R1079 ^value 1 +)
  25523. (R1 ^reward R1079 +)
  25524. Firing propose*predict-yes
  25525. -->
  25526. (O2151 ^name predict-yes +)
  25527. (S1 ^operator O2151 +)
  25528. Firing propose*predict-no
  25529. -->
  25530. (O2152 ^name predict-no +)
  25531. (S1 ^operator O2152 +)
  25532. Firing rl*prefer*rvt*predict-no*H0*4
  25533. -->
  25534. (S1 ^operator O2150 = 0.3144956610238658)
  25535. Firing rl*prefer*rvt*predict-yes*H0*3
  25536. -->
  25537. (S1 ^operator O2149 = 0.3907738386230689)
  25538. Firing prefer*rvt*predict-yes*H0
  25539. -->
  25540. Firing prefer*rvt*predict-no*H0
  25541. -->
  25542. Firing elaborate*copy-dir-to-output-link
  25543. -->
  25544. (I3 ^dir L +)
  25545. inner elaboration loop at bottom goal.
  25546. Retracting elaborate*copy-see-to-output-link
  25547. -->
  25548. (I3 ^see 0 +)
  25549. Retracting propose*predict-no
  25550. -->
  25551. (O2150 ^name predict-no +)
  25552. (S1 ^operator O2150 +)
  25553. Retracting propose*predict-yes
  25554. -->
  25555. (O2149 ^name predict-yes +)
  25556. (S1 ^operator O2149 +)
  25557. Retracting elaborate*reward*based*on*reward
  25558. -->
  25559. (R1078 ^value 1 +)
  25560. (R1 ^reward R1078 +)
  25561. Retracting elaborate*copy-dir-to-output-link
  25562. -->
  25563. (I3 ^dir L +)
  25564. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  25565. -->
  25566. (S1 ^operator O2150 = -0.1984300550322165)
  25567. Retracting rl*prefer*rvt*predict-no*H0*4
  25568. -->
  25569. (S1 ^operator O2150 = 0.3144956610238658)
  25570. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  25571. -->
  25572. (S1 ^operator O2149 = 0.6091603294693171)
  25573. Retracting rl*prefer*rvt*predict-yes*H0*3
  25574. -->
  25575. (S1 ^operator O2149 = 0.3907738386230689)
  25576. =>WM: (15126: S1 ^operator O2152 +)
  25577. =>WM: (15125: S1 ^operator O2151 +)
  25578. =>WM: (15124: O2152 ^name predict-no)
  25579. =>WM: (15123: O2151 ^name predict-yes)
  25580. =>WM: (15122: R1079 ^value 1)
  25581. =>WM: (15121: R1 ^reward R1079)
  25582. =>WM: (15120: I3 ^see 1)
  25583. <=WM: (15111: S1 ^operator O2149 +)
  25584. <=WM: (15113: S1 ^operator O2149)
  25585. <=WM: (15112: S1 ^operator O2150 +)
  25586. <=WM: (15106: R1 ^reward R1078)
  25587. <=WM: (15105: I3 ^see 0)
  25588. <=WM: (15109: O2150 ^name predict-no)
  25589. <=WM: (15108: O2149 ^name predict-yes)
  25590. <=WM: (15107: R1078 ^value 1)
  25591. --- Inner Elaboration Phase, active level 1 (S1) ---
  25592. Firing prefer*rvt*predict-yes*H0
  25593. -->
  25594. Firing rl*prefer*rvt*predict-yes*H0*3
  25595. -->
  25596. (S1 ^operator O2151 = 0.3907738386230689)
  25597. Firing prefer*rvt*predict-yes*H0*3*H1
  25598. -->
  25599. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  25600. -->
  25601. (S1 ^operator O2151 = -0.2062723012911647)
  25602. Firing prefer*rvt*predict-no*H0
  25603. -->
  25604. Firing rl*prefer*rvt*predict-no*H0*4
  25605. -->
  25606. (S1 ^operator O2152 = 0.3144956610238658)
  25607. Firing prefer*rvt*predict-no*H0*4*H1
  25608. -->
  25609. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  25610. -->
  25611. (S1 ^operator O2152 = 0.6855163447632109)
  25612. inner elaboration loop at bottom goal.
  25613. Retracting rl*prefer*rvt*predict-no*H0*4
  25614. -->
  25615. (S1 ^operator O2150 = 0.3144956610238658)
  25616. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  25617. -->
  25618. (S1 ^operator O2150 = 0.6855163447632109)
  25619. Retracting rl*prefer*rvt*predict-yes*H0*3
  25620. -->
  25621. (S1 ^operator O2149 = 0.3907738386230689)
  25622. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  25623. -->
  25624. (S1 ^operator O2149 = -0.2062723012911647)
  25625. --- END Proposal Phase ---
  25626. --- Decision Phase ---
  25627. RL update rl*prefer*rvt*predict-yes*H0*3 0.47232 -0.0815465 0.390774 -> 0.472325 -0.0815457 0.390779(R,m,v=1,0.948864,0.0487987)
  25628. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527624 0.0815368 0.60916 -> 0.527629 0.0815377 0.609166(R,m,v=1,1,0)
  25629. =>WM: (15127: S1 ^operator O2152)
  25630. 1076: O: O2152 (predict-no)
  25631. --- END Decision Phase ---
  25632. --- Application Phase ---
  25633. --- Firing Productions (PE) For State At Depth 1 ---
  25634. --- Inner Elaboration Phase, active level 1 (S1) ---
  25635. Firing apply*operator
  25636. -->
  25637. (I3 ^predict-no N1076 + :O )
  25638. Firing apply*operator*complete
  25639. -->
  25640. (I3 ^predict-yes N1075 - :O )
  25641. inner elaboration loop at bottom goal.
  25642. --- Change Working Memory (PE) ---
  25643. =>WM: (15128: I3 ^predict-no N1076)
  25644. <=WM: (15115: N1075 ^status complete)
  25645. <=WM: (15114: I3 ^predict-yes N1075)
  25646. --- Firing Productions (IE) For State At Depth 1 ---
  25647. --- Inner Elaboration Phase, active level 1 (S1) ---
  25648. Firing monitor*world
  25649. -->
  25650. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25651. --- Change Working Memory (IE) ---
  25652. --- END Application Phase ---
  25653. --- Output Phase ---
  25654. ENV: Agent did: predict-no for direction L in state State-A
  25655. In State-A moving L
  25656. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  25657. predict error 0
  25658. dir: dir isR
  25659. --- END Output Phase ---
  25660. \-/--- Input Phase ---
  25661. =>WM: (15132: I2 ^dir R)
  25662. =>WM: (15131: I2 ^reward 1)
  25663. =>WM: (15130: I2 ^see 0)
  25664. =>WM: (15129: N1076 ^status complete)
  25665. <=WM: (15118: I2 ^dir L)
  25666. <=WM: (15117: I2 ^reward 1)
  25667. <=WM: (15116: I2 ^see 1)
  25668. =>WM: (15133: I2 ^level-1 L0-root)
  25669. <=WM: (15119: I2 ^level-1 L1-root)
  25670. --- END Input Phase ---
  25671. --- Proposal Phase ---
  25672. --- Inner Elaboration Phase, active level 1 (S1) ---
  25673. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  25674. -->
  25675. (S1 ^operator O2151 = 0.8783984798460494)
  25676. Firing prefer*rvt*predict-yes*H0*5*H1
  25677. -->
  25678. Firing elaborate*copy-see-to-output-link
  25679. -->
  25680. (I3 ^see 0 +)
  25681. Firing elaborate*reward*based*on*reward
  25682. -->
  25683. (R1080 ^value 1 +)
  25684. (R1 ^reward R1080 +)
  25685. Firing propose*predict-yes
  25686. -->
  25687. (O2153 ^name predict-yes +)
  25688. (S1 ^operator O2153 +)
  25689. Firing propose*predict-no
  25690. -->
  25691. (O2154 ^name predict-no +)
  25692. (S1 ^operator O2154 +)
  25693. Firing rl*prefer*rvt*predict-no*H0*6
  25694. -->
  25695. (S1 ^operator O2152 = 0.9049506710147235)
  25696. Firing rl*prefer*rvt*predict-yes*H0*5
  25697. -->
  25698. (S1 ^operator O2151 = 0.1215959786322932)
  25699. Firing prefer*rvt*predict-yes*H0
  25700. -->
  25701. Firing prefer*rvt*predict-no*H0
  25702. -->
  25703. Firing elaborate*copy-dir-to-output-link
  25704. -->
  25705. (I3 ^dir R +)
  25706. inner elaboration loop at bottom goal.
  25707. Retracting elaborate*copy-see-to-output-link
  25708. -->
  25709. (I3 ^see 1 +)
  25710. Retracting propose*predict-no
  25711. -->
  25712. (O2152 ^name predict-no +)
  25713. (S1 ^operator O2152 +)
  25714. Retracting propose*predict-yes
  25715. -->
  25716. (O2151 ^name predict-yes +)
  25717. (S1 ^operator O2151 +)
  25718. Retracting elaborate*reward*based*on*reward
  25719. -->
  25720. (R1079 ^value 1 +)
  25721. (R1 ^reward R1079 +)
  25722. Retracting elaborate*copy-dir-to-output-link
  25723. -->
  25724. (I3 ^dir L +)
  25725. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  25726. -->
  25727. (S1 ^operator O2152 = 0.6855163447632109)
  25728. Retracting rl*prefer*rvt*predict-no*H0*4
  25729. -->
  25730. (S1 ^operator O2152 = 0.3144956610238658)
  25731. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  25732. -->
  25733. (S1 ^operator O2151 = -0.2062723012911647)
  25734. Retracting rl*prefer*rvt*predict-yes*H0*3
  25735. -->
  25736. (S1 ^operator O2151 = 0.390779173043162)
  25737. =>WM: (15141: S1 ^operator O2154 +)
  25738. =>WM: (15140: S1 ^operator O2153 +)
  25739. =>WM: (15139: I3 ^dir R)
  25740. =>WM: (15138: O2154 ^name predict-no)
  25741. =>WM: (15137: O2153 ^name predict-yes)
  25742. =>WM: (15136: R1080 ^value 1)
  25743. =>WM: (15135: R1 ^reward R1080)
  25744. =>WM: (15134: I3 ^see 0)
  25745. <=WM: (15125: S1 ^operator O2151 +)
  25746. <=WM: (15126: S1 ^operator O2152 +)
  25747. <=WM: (15127: S1 ^operator O2152)
  25748. <=WM: (15110: I3 ^dir L)
  25749. <=WM: (15121: R1 ^reward R1079)
  25750. <=WM: (15120: I3 ^see 1)
  25751. <=WM: (15124: O2152 ^name predict-no)
  25752. <=WM: (15123: O2151 ^name predict-yes)
  25753. <=WM: (15122: R1079 ^value 1)
  25754. --- Inner Elaboration Phase, active level 1 (S1) ---
  25755. Firing prefer*rvt*predict-yes*H0
  25756. -->
  25757. Firing rl*prefer*rvt*predict-yes*H0*5
  25758. -->
  25759. (S1 ^operator O2153 = 0.1215959786322932)
  25760. Firing prefer*rvt*predict-yes*H0*5*H1
  25761. -->
  25762. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  25763. -->
  25764. (S1 ^operator O2153 = 0.8783984798460494)
  25765. Firing prefer*rvt*predict-no*H0
  25766. -->
  25767. Firing rl*prefer*rvt*predict-no*H0*6
  25768. -->
  25769. (S1 ^operator O2154 = 0.9049506710147235)
  25770. inner elaboration loop at bottom goal.
  25771. Retracting rl*prefer*rvt*predict-no*H0*6
  25772. -->
  25773. (S1 ^operator O2152 = 0.9049506710147235)
  25774. Retracting rl*prefer*rvt*predict-yes*H0*5
  25775. -->
  25776. (S1 ^operator O2151 = 0.1215959786322932)
  25777. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  25778. -->
  25779. (S1 ^operator O2151 = 0.8783984798460494)
  25780. --- END Proposal Phase ---
  25781. --- Decision Phase ---
  25782. RL update rl*prefer*rvt*predict-no*H0*4 0.478545 -0.164049 0.314496 -> 0.478544 -0.164049 0.314495(R,m,v=1,0.926829,0.0682328)
  25783. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521466 0.16405 0.685516 -> 0.521465 0.16405 0.685515(R,m,v=1,1,0)
  25784. =>WM: (15142: S1 ^operator O2153)
  25785. 1077: O: O2153 (predict-yes)
  25786. --- END Decision Phase ---
  25787. --- Application Phase ---
  25788. --- Firing Productions (PE) For State At Depth 1 ---
  25789. --- Inner Elaboration Phase, active level 1 (S1) ---
  25790. Firing apply*operator
  25791. -->
  25792. (I3 ^predict-yes N1077 + :O )
  25793. Firing apply*operator*complete
  25794. -->
  25795. (I3 ^predict-no N1076 - :O )
  25796. inner elaboration loop at bottom goal.
  25797. --- Change Working Memory (PE) ---
  25798. =>WM: (15143: I3 ^predict-yes N1077)
  25799. <=WM: (15129: N1076 ^status complete)
  25800. <=WM: (15128: I3 ^predict-no N1076)
  25801. --- Firing Productions (IE) For State At Depth 1 ---
  25802. --- Inner Elaboration Phase, active level 1 (S1) ---
  25803. Firing monitor*world
  25804. -->
  25805. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  25806. --- Change Working Memory (IE) ---
  25807. --- END Application Phase ---
  25808. --- Output Phase ---
  25809. ENV: Agent did: predict-yes for direction R in state State-A
  25810. In State-A moving R
  25811. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  25812. predict error 0
  25813. dir: dir isR
  25814. --- END Output Phase ---
  25815. |\---- Input Phase ---
  25816. =>WM: (15147: I2 ^dir R)
  25817. =>WM: (15146: I2 ^reward 1)
  25818. =>WM: (15145: I2 ^see 1)
  25819. =>WM: (15144: N1077 ^status complete)
  25820. <=WM: (15132: I2 ^dir R)
  25821. <=WM: (15131: I2 ^reward 1)
  25822. <=WM: (15130: I2 ^see 0)
  25823. =>WM: (15148: I2 ^level-1 R1-root)
  25824. <=WM: (15133: I2 ^level-1 L0-root)
  25825. --- END Input Phase ---
  25826. --- Proposal Phase ---
  25827. --- Inner Elaboration Phase, active level 1 (S1) ---
  25828. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  25829. -->
  25830. (S1 ^operator O2153 = -0.04253361215288998)
  25831. Firing prefer*rvt*predict-yes*H0*5*H1
  25832. -->
  25833. Firing elaborate*copy-see-to-output-link
  25834. -->
  25835. (I3 ^see 1 +)
  25836. Firing elaborate*reward*based*on*reward
  25837. -->
  25838. (R1081 ^value 1 +)
  25839. (R1 ^reward R1081 +)
  25840. Firing propose*predict-yes
  25841. -->
  25842. (O2155 ^name predict-yes +)
  25843. (S1 ^operator O2155 +)
  25844. Firing propose*predict-no
  25845. -->
  25846. (O2156 ^name predict-no +)
  25847. (S1 ^operator O2156 +)
  25848. Firing rl*prefer*rvt*predict-no*H0*6
  25849. -->
  25850. (S1 ^operator O2154 = 0.9049506710147235)
  25851. Firing rl*prefer*rvt*predict-yes*H0*5
  25852. -->
  25853. (S1 ^operator O2153 = 0.1215959786322932)
  25854. Firing prefer*rvt*predict-yes*H0
  25855. -->
  25856. Firing prefer*rvt*predict-no*H0
  25857. -->
  25858. Firing elaborate*copy-dir-to-output-link
  25859. -->
  25860. (I3 ^dir R +)
  25861. inner elaboration loop at bottom goal.
  25862. Retracting elaborate*copy-see-to-output-link
  25863. -->
  25864. (I3 ^see 0 +)
  25865. Retracting propose*predict-no
  25866. -->
  25867. (O2154 ^name predict-no +)
  25868. (S1 ^operator O2154 +)
  25869. Retracting propose*predict-yes
  25870. -->
  25871. (O2153 ^name predict-yes +)
  25872. (S1 ^operator O2153 +)
  25873. Retracting elaborate*reward*based*on*reward
  25874. -->
  25875. (R1080 ^value 1 +)
  25876. (R1 ^reward R1080 +)
  25877. Retracting elaborate*copy-dir-to-output-link
  25878. -->
  25879. (I3 ^dir R +)
  25880. Retracting rl*prefer*rvt*predict-no*H0*6
  25881. -->
  25882. (S1 ^operator O2154 = 0.9049506710147235)
  25883. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  25884. -->
  25885. (S1 ^operator O2153 = 0.8783984798460494)
  25886. Retracting rl*prefer*rvt*predict-yes*H0*5
  25887. -->
  25888. (S1 ^operator O2153 = 0.1215959786322932)
  25889. =>WM: (15155: S1 ^operator O2156 +)
  25890. =>WM: (15154: S1 ^operator O2155 +)
  25891. =>WM: (15153: O2156 ^name predict-no)
  25892. =>WM: (15152: O2155 ^name predict-yes)
  25893. =>WM: (15151: R1081 ^value 1)
  25894. =>WM: (15150: R1 ^reward R1081)
  25895. =>WM: (15149: I3 ^see 1)
  25896. <=WM: (15140: S1 ^operator O2153 +)
  25897. <=WM: (15142: S1 ^operator O2153)
  25898. <=WM: (15141: S1 ^operator O2154 +)
  25899. <=WM: (15135: R1 ^reward R1080)
  25900. <=WM: (15134: I3 ^see 0)
  25901. <=WM: (15138: O2154 ^name predict-no)
  25902. <=WM: (15137: O2153 ^name predict-yes)
  25903. <=WM: (15136: R1080 ^value 1)
  25904. --- Inner Elaboration Phase, active level 1 (S1) ---
  25905. Firing prefer*rvt*predict-yes*H0
  25906. -->
  25907. Firing rl*prefer*rvt*predict-yes*H0*5
  25908. -->
  25909. (S1 ^operator O2155 = 0.1215959786322932)
  25910. Firing prefer*rvt*predict-yes*H0*5*H1
  25911. -->
  25912. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  25913. -->
  25914. (S1 ^operator O2155 = -0.04253361215288998)
  25915. Firing prefer*rvt*predict-no*H0
  25916. -->
  25917. Firing rl*prefer*rvt*predict-no*H0*6
  25918. -->
  25919. (S1 ^operator O2156 = 0.9049506710147235)
  25920. inner elaboration loop at bottom goal.
  25921. Retracting rl*prefer*rvt*predict-no*H0*6
  25922. -->
  25923. (S1 ^operator O2154 = 0.9049506710147235)
  25924. Retracting rl*prefer*rvt*predict-yes*H0*5
  25925. -->
  25926. (S1 ^operator O2153 = 0.1215959786322932)
  25927. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  25928. -->
  25929. (S1 ^operator O2153 = -0.04253361215288998)
  25930. --- END Proposal Phase ---
  25931. --- Decision Phase ---
  25932. RL update rl*prefer*rvt*predict-yes*H0*5 0.534522 -0.412926 0.121596 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.875,0.109948)
  25933. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465473 0.412926 0.878398 -> 0.465473 0.412926 0.878399(R,m,v=1,1,0)
  25934. =>WM: (15156: S1 ^operator O2156)
  25935. 1078: O: O2156 (predict-no)
  25936. --- END Decision Phase ---
  25937. --- Application Phase ---
  25938. --- Firing Productions (PE) For State At Depth 1 ---
  25939. --- Inner Elaboration Phase, active level 1 (S1) ---
  25940. Firing apply*operator
  25941. -->
  25942. (I3 ^predict-no N1078 + :O )
  25943. Firing apply*operator*complete
  25944. -->
  25945. (I3 ^predict-yes N1077 - :O )
  25946. inner elaboration loop at bottom goal.
  25947. --- Change Working Memory (PE) ---
  25948. =>WM: (15157: I3 ^predict-no N1078)
  25949. <=WM: (15144: N1077 ^status complete)
  25950. <=WM: (15143: I3 ^predict-yes N1077)
  25951. --- Firing Productions (IE) For State At Depth 1 ---
  25952. --- Inner Elaboration Phase, active level 1 (S1) ---
  25953. Firing monitor*world
  25954. -->
  25955. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  25956. --- Change Working Memory (IE) ---
  25957. --- END Application Phase ---
  25958. --- Output Phase ---
  25959. ENV: Agent did: predict-no for direction R in state State-B
  25960. In State-B moving R
  25961. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  25962. predict error 0
  25963. dir: dir isR
  25964. --- END Output Phase ---
  25965. /|\--- Input Phase ---
  25966. =>WM: (15161: I2 ^dir R)
  25967. =>WM: (15160: I2 ^reward 1)
  25968. =>WM: (15159: I2 ^see 0)
  25969. =>WM: (15158: N1078 ^status complete)
  25970. <=WM: (15147: I2 ^dir R)
  25971. <=WM: (15146: I2 ^reward 1)
  25972. <=WM: (15145: I2 ^see 1)
  25973. =>WM: (15162: I2 ^level-1 R0-root)
  25974. <=WM: (15148: I2 ^level-1 R1-root)
  25975. --- END Input Phase ---
  25976. --- Proposal Phase ---
  25977. --- Inner Elaboration Phase, active level 1 (S1) ---
  25978. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  25979. -->
  25980. (S1 ^operator O2155 = -0.1512366769350551)
  25981. Firing prefer*rvt*predict-yes*H0*5*H1
  25982. -->
  25983. Firing elaborate*copy-see-to-output-link
  25984. -->
  25985. (I3 ^see 0 +)
  25986. Firing elaborate*reward*based*on*reward
  25987. -->
  25988. (R1082 ^value 1 +)
  25989. (R1 ^reward R1082 +)
  25990. Firing propose*predict-yes
  25991. -->
  25992. (O2157 ^name predict-yes +)
  25993. (S1 ^operator O2157 +)
  25994. Firing propose*predict-no
  25995. -->
  25996. (O2158 ^name predict-no +)
  25997. (S1 ^operator O2158 +)
  25998. Firing rl*prefer*rvt*predict-no*H0*6
  25999. -->
  26000. (S1 ^operator O2156 = 0.9049506710147235)
  26001. Firing rl*prefer*rvt*predict-yes*H0*5
  26002. -->
  26003. (S1 ^operator O2155 = 0.1215964214230049)
  26004. Firing prefer*rvt*predict-yes*H0
  26005. -->
  26006. Firing prefer*rvt*predict-no*H0
  26007. -->
  26008. Firing elaborate*copy-dir-to-output-link
  26009. -->
  26010. (I3 ^dir R +)
  26011. inner elaboration loop at bottom goal.
  26012. Retracting elaborate*copy-see-to-output-link
  26013. -->
  26014. (I3 ^see 1 +)
  26015. Retracting propose*predict-no
  26016. -->
  26017. (O2156 ^name predict-no +)
  26018. (S1 ^operator O2156 +)
  26019. Retracting propose*predict-yes
  26020. -->
  26021. (O2155 ^name predict-yes +)
  26022. (S1 ^operator O2155 +)
  26023. Retracting elaborate*reward*based*on*reward
  26024. -->
  26025. (R1081 ^value 1 +)
  26026. (R1 ^reward R1081 +)
  26027. Retracting elaborate*copy-dir-to-output-link
  26028. -->
  26029. (I3 ^dir R +)
  26030. Retracting rl*prefer*rvt*predict-no*H0*6
  26031. -->
  26032. (S1 ^operator O2156 = 0.9049506710147235)
  26033. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  26034. -->
  26035. (S1 ^operator O2155 = -0.04253361215288998)
  26036. Retracting rl*prefer*rvt*predict-yes*H0*5
  26037. -->
  26038. (S1 ^operator O2155 = 0.1215964214230049)
  26039. =>WM: (15169: S1 ^operator O2158 +)
  26040. =>WM: (15168: S1 ^operator O2157 +)
  26041. =>WM: (15167: O2158 ^name predict-no)
  26042. =>WM: (15166: O2157 ^name predict-yes)
  26043. =>WM: (15165: R1082 ^value 1)
  26044. =>WM: (15164: R1 ^reward R1082)
  26045. =>WM: (15163: I3 ^see 0)
  26046. <=WM: (15154: S1 ^operator O2155 +)
  26047. <=WM: (15155: S1 ^operator O2156 +)
  26048. <=WM: (15156: S1 ^operator O2156)
  26049. <=WM: (15150: R1 ^reward R1081)
  26050. <=WM: (15149: I3 ^see 1)
  26051. <=WM: (15153: O2156 ^name predict-no)
  26052. <=WM: (15152: O2155 ^name predict-yes)
  26053. <=WM: (15151: R1081 ^value 1)
  26054. --- Inner Elaboration Phase, active level 1 (S1) ---
  26055. Firing prefer*rvt*predict-yes*H0
  26056. -->
  26057. Firing rl*prefer*rvt*predict-yes*H0*5
  26058. -->
  26059. (S1 ^operator O2157 = 0.1215964214230049)
  26060. Firing prefer*rvt*predict-yes*H0*5*H1
  26061. -->
  26062. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  26063. -->
  26064. (S1 ^operator O2157 = -0.1512366769350551)
  26065. Firing prefer*rvt*predict-no*H0
  26066. -->
  26067. Firing rl*prefer*rvt*predict-no*H0*6
  26068. -->
  26069. (S1 ^operator O2158 = 0.9049506710147235)
  26070. inner elaboration loop at bottom goal.
  26071. Retracting rl*prefer*rvt*predict-no*H0*6
  26072. -->
  26073. (S1 ^operator O2156 = 0.9049506710147235)
  26074. Retracting rl*prefer*rvt*predict-yes*H0*5
  26075. -->
  26076. (S1 ^operator O2155 = 0.1215964214230049)
  26077. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  26078. -->
  26079. (S1 ^operator O2155 = -0.1512366769350551)
  26080. --- END Proposal Phase ---
  26081. --- Decision Phase ---
  26082. RL update rl*prefer*rvt*predict-no*H0*6 0.904951 0 0.904951 -> 0.920153 0 0.920153(R,m,v=1,0.937173,0.0591899)
  26083. =>WM: (15170: S1 ^operator O2158)
  26084. 1079: O: O2158 (predict-no)
  26085. --- END Decision Phase ---
  26086. --- Application Phase ---
  26087. --- Firing Productions (PE) For State At Depth 1 ---
  26088. --- Inner Elaboration Phase, active level 1 (S1) ---
  26089. Firing apply*operator
  26090. -->
  26091. (I3 ^predict-no N1079 + :O )
  26092. Firing apply*operator*complete
  26093. -->
  26094. (I3 ^predict-no N1078 - :O )
  26095. inner elaboration loop at bottom goal.
  26096. --- Change Working Memory (PE) ---
  26097. =>WM: (15171: I3 ^predict-no N1079)
  26098. <=WM: (15158: N1078 ^status complete)
  26099. <=WM: (15157: I3 ^predict-no N1078)
  26100. --- Firing Productions (IE) For State At Depth 1 ---
  26101. --- Inner Elaboration Phase, active level 1 (S1) ---
  26102. Firing monitor*world
  26103. -->
  26104. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26105. --- Change Working Memory (IE) ---
  26106. --- END Application Phase ---
  26107. --- Output Phase ---
  26108. ENV: Agent did: predict-no for direction R in state State-B
  26109. In State-B moving R
  26110. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26111. predict error 0
  26112. dir: dir isU
  26113. --- END Output Phase ---
  26114. -/|--- Input Phase ---
  26115. =>WM: (15175: I2 ^dir U)
  26116. =>WM: (15174: I2 ^reward 1)
  26117. =>WM: (15173: I2 ^see 0)
  26118. =>WM: (15172: N1079 ^status complete)
  26119. <=WM: (15161: I2 ^dir R)
  26120. <=WM: (15160: I2 ^reward 1)
  26121. <=WM: (15159: I2 ^see 0)
  26122. =>WM: (15176: I2 ^level-1 R0-root)
  26123. <=WM: (15162: I2 ^level-1 R0-root)
  26124. --- END Input Phase ---
  26125. --- Proposal Phase ---
  26126. --- Inner Elaboration Phase, active level 1 (S1) ---
  26127. Firing elaborate*copy-see-to-output-link
  26128. -->
  26129. (I3 ^see 0 +)
  26130. Firing elaborate*reward*based*on*reward
  26131. -->
  26132. (R1083 ^value 1 +)
  26133. (R1 ^reward R1083 +)
  26134. Firing propose*predict-yes
  26135. -->
  26136. (O2159 ^name predict-yes +)
  26137. (S1 ^operator O2159 +)
  26138. Firing propose*predict-no
  26139. -->
  26140. (O2160 ^name predict-no +)
  26141. (S1 ^operator O2160 +)
  26142. Firing rl*prefer*rvt*predict-no*H0*2
  26143. -->
  26144. (S1 ^operator O2158 = 1.)
  26145. Firing rl*prefer*rvt*predict-yes*H0*1
  26146. -->
  26147. (S1 ^operator O2157 = 0.)
  26148. Firing prefer*rvt*predict-yes*H0
  26149. -->
  26150. Firing prefer*rvt*predict-no*H0
  26151. -->
  26152. Firing elaborate*copy-dir-to-output-link
  26153. -->
  26154. (I3 ^dir U +)
  26155. inner elaboration loop at bottom goal.
  26156. Retracting elaborate*copy-see-to-output-link
  26157. -->
  26158. (I3 ^see 0 +)
  26159. Retracting propose*predict-no
  26160. -->
  26161. (O2158 ^name predict-no +)
  26162. (S1 ^operator O2158 +)
  26163. Retracting propose*predict-yes
  26164. -->
  26165. (O2157 ^name predict-yes +)
  26166. (S1 ^operator O2157 +)
  26167. Retracting elaborate*reward*based*on*reward
  26168. -->
  26169. (R1082 ^value 1 +)
  26170. (R1 ^reward R1082 +)
  26171. Retracting elaborate*copy-dir-to-output-link
  26172. -->
  26173. (I3 ^dir R +)
  26174. Retracting rl*prefer*rvt*predict-no*H0*6
  26175. -->
  26176. (S1 ^operator O2158 = 0.920153033815893)
  26177. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  26178. -->
  26179. (S1 ^operator O2157 = -0.1512366769350551)
  26180. Retracting rl*prefer*rvt*predict-yes*H0*5
  26181. -->
  26182. (S1 ^operator O2157 = 0.1215964214230049)
  26183. =>WM: (15183: S1 ^operator O2160 +)
  26184. =>WM: (15182: S1 ^operator O2159 +)
  26185. =>WM: (15181: I3 ^dir U)
  26186. =>WM: (15180: O2160 ^name predict-no)
  26187. =>WM: (15179: O2159 ^name predict-yes)
  26188. =>WM: (15178: R1083 ^value 1)
  26189. =>WM: (15177: R1 ^reward R1083)
  26190. <=WM: (15168: S1 ^operator O2157 +)
  26191. <=WM: (15169: S1 ^operator O2158 +)
  26192. <=WM: (15170: S1 ^operator O2158)
  26193. <=WM: (15139: I3 ^dir R)
  26194. <=WM: (15164: R1 ^reward R1082)
  26195. <=WM: (15167: O2158 ^name predict-no)
  26196. <=WM: (15166: O2157 ^name predict-yes)
  26197. <=WM: (15165: R1082 ^value 1)
  26198. --- Inner Elaboration Phase, active level 1 (S1) ---
  26199. Firing prefer*rvt*predict-yes*H0
  26200. -->
  26201. Firing rl*prefer*rvt*predict-yes*H0*1
  26202. -->
  26203. (S1 ^operator O2159 = 0.)
  26204. Firing prefer*rvt*predict-no*H0
  26205. -->
  26206. Firing rl*prefer*rvt*predict-no*H0*2
  26207. -->
  26208. (S1 ^operator O2160 = 1.)
  26209. inner elaboration loop at bottom goal.
  26210. Retracting rl*prefer*rvt*predict-no*H0*2
  26211. -->
  26212. (S1 ^operator O2158 = 1.)
  26213. Retracting rl*prefer*rvt*predict-yes*H0*1
  26214. -->
  26215. (S1 ^operator O2157 = 0.)
  26216. --- END Proposal Phase ---
  26217. --- Decision Phase ---
  26218. RL update rl*prefer*rvt*predict-no*H0*6 0.920153 0 0.920153 -> 0.932913 0 0.932913(R,m,v=1,0.9375,0.0589005)
  26219. =>WM: (15184: S1 ^operator O2160)
  26220. 1080: O: O2160 (predict-no)
  26221. --- END Decision Phase ---
  26222. --- Application Phase ---
  26223. --- Firing Productions (PE) For State At Depth 1 ---
  26224. --- Inner Elaboration Phase, active level 1 (S1) ---
  26225. Firing apply*operator
  26226. -->
  26227. (I3 ^predict-no N1080 + :O )
  26228. Firing apply*operator*complete
  26229. -->
  26230. (I3 ^predict-no N1079 - :O )
  26231. inner elaboration loop at bottom goal.
  26232. --- Change Working Memory (PE) ---
  26233. =>WM: (15185: I3 ^predict-no N1080)
  26234. <=WM: (15172: N1079 ^status complete)
  26235. <=WM: (15171: I3 ^predict-no N1079)
  26236. --- Firing Productions (IE) For State At Depth 1 ---
  26237. --- Inner Elaboration Phase, active level 1 (S1) ---
  26238. Firing monitor*world
  26239. -->
  26240. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26241. --- Change Working Memory (IE) ---
  26242. --- END Application Phase ---
  26243. --- Output Phase ---
  26244. ENV: Agent did: predict-no for direction U in state State-B
  26245. In State-B moving U
  26246. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26247. predict error 0
  26248. dir: dir isL
  26249. --- END Output Phase ---
  26250. \---- Input Phase ---
  26251. =>WM: (15189: I2 ^dir L)
  26252. =>WM: (15188: I2 ^reward 1)
  26253. =>WM: (15187: I2 ^see 0)
  26254. =>WM: (15186: N1080 ^status complete)
  26255. <=WM: (15175: I2 ^dir U)
  26256. <=WM: (15174: I2 ^reward 1)
  26257. <=WM: (15173: I2 ^see 0)
  26258. =>WM: (15190: I2 ^level-1 R0-root)
  26259. <=WM: (15176: I2 ^level-1 R0-root)
  26260. --- END Input Phase ---
  26261. --- Proposal Phase ---
  26262. --- Inner Elaboration Phase, active level 1 (S1) ---
  26263. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  26264. -->
  26265. (S1 ^operator O2160 = -0.1984300550322165)
  26266. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  26267. -->
  26268. (S1 ^operator O2159 = 0.6091663904275534)
  26269. Firing prefer*rvt*predict-no*H0*4*H1
  26270. -->
  26271. Firing prefer*rvt*predict-yes*H0*3*H1
  26272. -->
  26273. Firing elaborate*copy-see-to-output-link
  26274. -->
  26275. (I3 ^see 0 +)
  26276. Firing elaborate*reward*based*on*reward
  26277. -->
  26278. (R1084 ^value 1 +)
  26279. (R1 ^reward R1084 +)
  26280. Firing propose*predict-yes
  26281. -->
  26282. (O2161 ^name predict-yes +)
  26283. (S1 ^operator O2161 +)
  26284. Firing propose*predict-no
  26285. -->
  26286. (O2162 ^name predict-no +)
  26287. (S1 ^operator O2162 +)
  26288. Firing rl*prefer*rvt*predict-no*H0*4
  26289. -->
  26290. (S1 ^operator O2160 = 0.3144946769214089)
  26291. Firing rl*prefer*rvt*predict-yes*H0*3
  26292. -->
  26293. (S1 ^operator O2159 = 0.390779173043162)
  26294. Firing prefer*rvt*predict-yes*H0
  26295. -->
  26296. Firing prefer*rvt*predict-no*H0
  26297. -->
  26298. Firing elaborate*copy-dir-to-output-link
  26299. -->
  26300. (I3 ^dir L +)
  26301. inner elaboration loop at bottom goal.
  26302. Retracting elaborate*copy-see-to-output-link
  26303. -->
  26304. (I3 ^see 0 +)
  26305. Retracting propose*predict-no
  26306. -->
  26307. (O2160 ^name predict-no +)
  26308. (S1 ^operator O2160 +)
  26309. Retracting propose*predict-yes
  26310. -->
  26311. (O2159 ^name predict-yes +)
  26312. (S1 ^operator O2159 +)
  26313. Retracting elaborate*reward*based*on*reward
  26314. -->
  26315. (R1083 ^value 1 +)
  26316. (R1 ^reward R1083 +)
  26317. Retracting elaborate*copy-dir-to-output-link
  26318. -->
  26319. (I3 ^dir U +)
  26320. Retracting rl*prefer*rvt*predict-no*H0*2
  26321. -->
  26322. (S1 ^operator O2160 = 1.)
  26323. Retracting rl*prefer*rvt*predict-yes*H0*1
  26324. -->
  26325. (S1 ^operator O2159 = 0.)
  26326. =>WM: (15197: S1 ^operator O2162 +)
  26327. =>WM: (15196: S1 ^operator O2161 +)
  26328. =>WM: (15195: I3 ^dir L)
  26329. =>WM: (15194: O2162 ^name predict-no)
  26330. =>WM: (15193: O2161 ^name predict-yes)
  26331. =>WM: (15192: R1084 ^value 1)
  26332. =>WM: (15191: R1 ^reward R1084)
  26333. <=WM: (15182: S1 ^operator O2159 +)
  26334. <=WM: (15183: S1 ^operator O2160 +)
  26335. <=WM: (15184: S1 ^operator O2160)
  26336. <=WM: (15181: I3 ^dir U)
  26337. <=WM: (15177: R1 ^reward R1083)
  26338. <=WM: (15180: O2160 ^name predict-no)
  26339. <=WM: (15179: O2159 ^name predict-yes)
  26340. <=WM: (15178: R1083 ^value 1)
  26341. --- Inner Elaboration Phase, active level 1 (S1) ---
  26342. Firing prefer*rvt*predict-yes*H0
  26343. -->
  26344. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  26345. -->
  26346. (S1 ^operator O2161 = 0.6091663904275534)
  26347. Firing rl*prefer*rvt*predict-yes*H0*3
  26348. -->
  26349. (S1 ^operator O2161 = 0.390779173043162)
  26350. Firing prefer*rvt*predict-yes*H0*3*H1
  26351. -->
  26352. Firing prefer*rvt*predict-no*H0
  26353. -->
  26354. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  26355. -->
  26356. (S1 ^operator O2162 = -0.1984300550322165)
  26357. Firing rl*prefer*rvt*predict-no*H0*4
  26358. -->
  26359. (S1 ^operator O2162 = 0.3144946769214089)
  26360. Firing prefer*rvt*predict-no*H0*4*H1
  26361. -->
  26362. inner elaboration loop at bottom goal.
  26363. Retracting rl*prefer*rvt*predict-no*H0*4
  26364. -->
  26365. (S1 ^operator O2160 = 0.3144946769214089)
  26366. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  26367. -->
  26368. (S1 ^operator O2160 = -0.1984300550322165)
  26369. Retracting rl*prefer*rvt*predict-yes*H0*3
  26370. -->
  26371. (S1 ^operator O2159 = 0.390779173043162)
  26372. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  26373. -->
  26374. (S1 ^operator O2159 = 0.6091663904275534)
  26375. --- END Proposal Phase ---
  26376. --- Decision Phase ---
  26377. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  26378. =>WM: (15198: S1 ^operator O2161)
  26379. 1081: O: O2161 (predict-yes)
  26380. --- END Decision Phase ---
  26381. --- Application Phase ---
  26382. --- Firing Productions (PE) For State At Depth 1 ---
  26383. --- Inner Elaboration Phase, active level 1 (S1) ---
  26384. Firing apply*operator
  26385. -->
  26386. (I3 ^predict-yes N1081 + :O )
  26387. Firing apply*operator*complete
  26388. -->
  26389. (I3 ^predict-no N1080 - :O )
  26390. inner elaboration loop at bottom goal.
  26391. --- Change Working Memory (PE) ---
  26392. =>WM: (15199: I3 ^predict-yes N1081)
  26393. <=WM: (15186: N1080 ^status complete)
  26394. <=WM: (15185: I3 ^predict-no N1080)
  26395. --- Firing Productions (IE) For State At Depth 1 ---
  26396. --- Inner Elaboration Phase, active level 1 (S1) ---
  26397. Firing monitor*world
  26398. -->
  26399. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26400. --- Change Working Memory (IE) ---
  26401. --- END Application Phase ---
  26402. --- Output Phase ---
  26403. ENV: Agent did: predict-yes for direction L in state State-B
  26404. In State-B moving L
  26405. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26406. predict error 0
  26407. dir: dir isR
  26408. --- END Output Phase ---
  26409. /--- Input Phase ---
  26410. =>WM: (15203: I2 ^dir R)
  26411. =>WM: (15202: I2 ^reward 1)
  26412. =>WM: (15201: I2 ^see 1)
  26413. =>WM: (15200: N1081 ^status complete)
  26414. <=WM: (15189: I2 ^dir L)
  26415. <=WM: (15188: I2 ^reward 1)
  26416. <=WM: (15187: I2 ^see 0)
  26417. =>WM: (15204: I2 ^level-1 L1-root)
  26418. <=WM: (15190: I2 ^level-1 R0-root)
  26419. --- END Input Phase ---
  26420. --- Proposal Phase ---
  26421. --- Inner Elaboration Phase, active level 1 (S1) ---
  26422. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  26423. -->
  26424. (S1 ^operator O2161 = 0.8784073733635152)
  26425. Firing prefer*rvt*predict-yes*H0*5*H1
  26426. -->
  26427. Firing elaborate*copy-see-to-output-link
  26428. -->
  26429. (I3 ^see 1 +)
  26430. Firing elaborate*reward*based*on*reward
  26431. -->
  26432. (R1085 ^value 1 +)
  26433. (R1 ^reward R1085 +)
  26434. Firing propose*predict-yes
  26435. -->
  26436. (O2163 ^name predict-yes +)
  26437. (S1 ^operator O2163 +)
  26438. Firing propose*predict-no
  26439. -->
  26440. (O2164 ^name predict-no +)
  26441. (S1 ^operator O2164 +)
  26442. Firing rl*prefer*rvt*predict-no*H0*6
  26443. -->
  26444. (S1 ^operator O2162 = 0.9329132455998342)
  26445. Firing rl*prefer*rvt*predict-yes*H0*5
  26446. -->
  26447. (S1 ^operator O2161 = 0.1215964214230049)
  26448. Firing prefer*rvt*predict-yes*H0
  26449. -->
  26450. Firing prefer*rvt*predict-no*H0
  26451. -->
  26452. Firing elaborate*copy-dir-to-output-link
  26453. -->
  26454. (I3 ^dir R +)
  26455. inner elaboration loop at bottom goal.
  26456. Retracting elaborate*copy-see-to-output-link
  26457. -->
  26458. (I3 ^see 0 +)
  26459. Retracting propose*predict-no
  26460. -->
  26461. (O2162 ^name predict-no +)
  26462. (S1 ^operator O2162 +)
  26463. Retracting propose*predict-yes
  26464. -->
  26465. (O2161 ^name predict-yes +)
  26466. (S1 ^operator O2161 +)
  26467. Retracting elaborate*reward*based*on*reward
  26468. -->
  26469. (R1084 ^value 1 +)
  26470. (R1 ^reward R1084 +)
  26471. Retracting elaborate*copy-dir-to-output-link
  26472. -->
  26473. (I3 ^dir L +)
  26474. Retracting rl*prefer*rvt*predict-no*H0*4
  26475. -->
  26476. (S1 ^operator O2162 = 0.3144946769214089)
  26477. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  26478. -->
  26479. (S1 ^operator O2162 = -0.1984300550322165)
  26480. Retracting rl*prefer*rvt*predict-yes*H0*3
  26481. -->
  26482. (S1 ^operator O2161 = 0.390779173043162)
  26483. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  26484. -->
  26485. (S1 ^operator O2161 = 0.6091663904275534)
  26486. =>WM: (15212: S1 ^operator O2164 +)
  26487. =>WM: (15211: S1 ^operator O2163 +)
  26488. =>WM: (15210: I3 ^dir R)
  26489. =>WM: (15209: O2164 ^name predict-no)
  26490. =>WM: (15208: O2163 ^name predict-yes)
  26491. =>WM: (15207: R1085 ^value 1)
  26492. =>WM: (15206: R1 ^reward R1085)
  26493. =>WM: (15205: I3 ^see 1)
  26494. <=WM: (15196: S1 ^operator O2161 +)
  26495. <=WM: (15198: S1 ^operator O2161)
  26496. <=WM: (15197: S1 ^operator O2162 +)
  26497. <=WM: (15195: I3 ^dir L)
  26498. <=WM: (15191: R1 ^reward R1084)
  26499. <=WM: (15163: I3 ^see 0)
  26500. <=WM: (15194: O2162 ^name predict-no)
  26501. <=WM: (15193: O2161 ^name predict-yes)
  26502. <=WM: (15192: R1084 ^value 1)
  26503. --- Inner Elaboration Phase, active level 1 (S1) ---
  26504. Firing prefer*rvt*predict-yes*H0
  26505. -->
  26506. Firing rl*prefer*rvt*predict-yes*H0*5
  26507. -->
  26508. (S1 ^operator O2163 = 0.1215964214230049)
  26509. Firing prefer*rvt*predict-yes*H0*5*H1
  26510. -->
  26511. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  26512. -->
  26513. (S1 ^operator O2163 = 0.8784073733635152)
  26514. Firing prefer*rvt*predict-no*H0
  26515. -->
  26516. Firing rl*prefer*rvt*predict-no*H0*6
  26517. -->
  26518. (S1 ^operator O2164 = 0.9329132455998342)
  26519. inner elaboration loop at bottom goal.
  26520. Retracting rl*prefer*rvt*predict-no*H0*6
  26521. -->
  26522. (S1 ^operator O2162 = 0.9329132455998342)
  26523. Retracting rl*prefer*rvt*predict-yes*H0*5
  26524. -->
  26525. (S1 ^operator O2161 = 0.1215964214230049)
  26526. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  26527. -->
  26528. (S1 ^operator O2161 = 0.8784073733635152)
  26529. --- END Proposal Phase ---
  26530. --- Decision Phase ---
  26531. RL update rl*prefer*rvt*predict-yes*H0*3 0.472325 -0.0815457 0.390779 -> 0.472329 -0.0815451 0.390784(R,m,v=1,0.949153,0.0485362)
  26532. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527629 0.0815377 0.609166 -> 0.527633 0.0815384 0.609171(R,m,v=1,1,0)
  26533. =>WM: (15213: S1 ^operator O2163)
  26534. 1082: O: O2163 (predict-yes)
  26535. --- END Decision Phase ---
  26536. --- Application Phase ---
  26537. --- Firing Productions (PE) For State At Depth 1 ---
  26538. --- Inner Elaboration Phase, active level 1 (S1) ---
  26539. Firing apply*operator
  26540. -->
  26541. (I3 ^predict-yes N1082 + :O )
  26542. Firing apply*operator*complete
  26543. -->
  26544. (I3 ^predict-yes N1081 - :O )
  26545. inner elaboration loop at bottom goal.
  26546. --- Change Working Memory (PE) ---
  26547. =>WM: (15214: I3 ^predict-yes N1082)
  26548. <=WM: (15200: N1081 ^status complete)
  26549. <=WM: (15199: I3 ^predict-yes N1081)
  26550. --- Firing Productions (IE) For State At Depth 1 ---
  26551. --- Inner Elaboration Phase, active level 1 (S1) ---
  26552. Firing monitor*world
  26553. -->
  26554. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26555. --- Change Working Memory (IE) ---
  26556. --- END Application Phase ---
  26557. --- Output Phase ---
  26558. ENV: Agent did: predict-yes for direction R in state State-A
  26559. In State-A moving R
  26560. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  26561. predict error 0
  26562. dir: dir isR
  26563. --- END Output Phase ---
  26564. |\--- Input Phase ---
  26565. =>WM: (15218: I2 ^dir R)
  26566. =>WM: (15217: I2 ^reward 1)
  26567. =>WM: (15216: I2 ^see 1)
  26568. =>WM: (15215: N1082 ^status complete)
  26569. <=WM: (15203: I2 ^dir R)
  26570. <=WM: (15202: I2 ^reward 1)
  26571. <=WM: (15201: I2 ^see 1)
  26572. =>WM: (15219: I2 ^level-1 R1-root)
  26573. <=WM: (15204: I2 ^level-1 L1-root)
  26574. --- END Input Phase ---
  26575. --- Proposal Phase ---
  26576. --- Inner Elaboration Phase, active level 1 (S1) ---
  26577. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  26578. -->
  26579. (S1 ^operator O2163 = -0.04253361215288998)
  26580. Firing prefer*rvt*predict-yes*H0*5*H1
  26581. -->
  26582. Firing elaborate*copy-see-to-output-link
  26583. -->
  26584. (I3 ^see 1 +)
  26585. Firing elaborate*reward*based*on*reward
  26586. -->
  26587. (R1086 ^value 1 +)
  26588. (R1 ^reward R1086 +)
  26589. Firing propose*predict-yes
  26590. -->
  26591. (O2165 ^name predict-yes +)
  26592. (S1 ^operator O2165 +)
  26593. Firing propose*predict-no
  26594. -->
  26595. (O2166 ^name predict-no +)
  26596. (S1 ^operator O2166 +)
  26597. Firing rl*prefer*rvt*predict-no*H0*6
  26598. -->
  26599. (S1 ^operator O2164 = 0.9329132455998342)
  26600. Firing rl*prefer*rvt*predict-yes*H0*5
  26601. -->
  26602. (S1 ^operator O2163 = 0.1215964214230049)
  26603. Firing prefer*rvt*predict-yes*H0
  26604. -->
  26605. Firing prefer*rvt*predict-no*H0
  26606. -->
  26607. Firing elaborate*copy-dir-to-output-link
  26608. -->
  26609. (I3 ^dir R +)
  26610. inner elaboration loop at bottom goal.
  26611. Retracting elaborate*copy-see-to-output-link
  26612. -->
  26613. (I3 ^see 1 +)
  26614. Retracting propose*predict-no
  26615. -->
  26616. (O2164 ^name predict-no +)
  26617. (S1 ^operator O2164 +)
  26618. Retracting propose*predict-yes
  26619. -->
  26620. (O2163 ^name predict-yes +)
  26621. (S1 ^operator O2163 +)
  26622. Retracting elaborate*reward*based*on*reward
  26623. -->
  26624. (R1085 ^value 1 +)
  26625. (R1 ^reward R1085 +)
  26626. Retracting elaborate*copy-dir-to-output-link
  26627. -->
  26628. (I3 ^dir R +)
  26629. Retracting rl*prefer*rvt*predict-no*H0*6
  26630. -->
  26631. (S1 ^operator O2164 = 0.9329132455998342)
  26632. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  26633. -->
  26634. (S1 ^operator O2163 = 0.8784073733635152)
  26635. Retracting rl*prefer*rvt*predict-yes*H0*5
  26636. -->
  26637. (S1 ^operator O2163 = 0.1215964214230049)
  26638. =>WM: (15225: S1 ^operator O2166 +)
  26639. =>WM: (15224: S1 ^operator O2165 +)
  26640. =>WM: (15223: O2166 ^name predict-no)
  26641. =>WM: (15222: O2165 ^name predict-yes)
  26642. =>WM: (15221: R1086 ^value 1)
  26643. =>WM: (15220: R1 ^reward R1086)
  26644. <=WM: (15211: S1 ^operator O2163 +)
  26645. <=WM: (15213: S1 ^operator O2163)
  26646. <=WM: (15212: S1 ^operator O2164 +)
  26647. <=WM: (15206: R1 ^reward R1085)
  26648. <=WM: (15209: O2164 ^name predict-no)
  26649. <=WM: (15208: O2163 ^name predict-yes)
  26650. <=WM: (15207: R1085 ^value 1)
  26651. --- Inner Elaboration Phase, active level 1 (S1) ---
  26652. Firing prefer*rvt*predict-yes*H0
  26653. -->
  26654. Firing rl*prefer*rvt*predict-yes*H0*5
  26655. -->
  26656. (S1 ^operator O2165 = 0.1215964214230049)
  26657. Firing prefer*rvt*predict-yes*H0*5*H1
  26658. -->
  26659. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  26660. -->
  26661. (S1 ^operator O2165 = -0.04253361215288998)
  26662. Firing prefer*rvt*predict-no*H0
  26663. -->
  26664. Firing rl*prefer*rvt*predict-no*H0*6
  26665. -->
  26666. (S1 ^operator O2166 = 0.9329132455998342)
  26667. inner elaboration loop at bottom goal.
  26668. Retracting rl*prefer*rvt*predict-no*H0*6
  26669. -->
  26670. (S1 ^operator O2164 = 0.9329132455998342)
  26671. Retracting rl*prefer*rvt*predict-yes*H0*5
  26672. -->
  26673. (S1 ^operator O2163 = 0.1215964214230049)
  26674. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  26675. -->
  26676. (S1 ^operator O2163 = -0.04253361215288998)
  26677. --- END Proposal Phase ---
  26678. --- Decision Phase ---
  26679. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.875648,0.109456)
  26680. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.46548 0.412927 0.878407 -> 0.46548 0.412927 0.878407(R,m,v=1,1,0)
  26681. =>WM: (15226: S1 ^operator O2166)
  26682. 1083: O: O2166 (predict-no)
  26683. --- END Decision Phase ---
  26684. --- Application Phase ---
  26685. --- Firing Productions (PE) For State At Depth 1 ---
  26686. --- Inner Elaboration Phase, active level 1 (S1) ---
  26687. Firing apply*operator
  26688. -->
  26689. (I3 ^predict-no N1083 + :O )
  26690. Firing apply*operator*complete
  26691. -->
  26692. (I3 ^predict-yes N1082 - :O )
  26693. inner elaboration loop at bottom goal.
  26694. --- Change Working Memory (PE) ---
  26695. =>WM: (15227: I3 ^predict-no N1083)
  26696. <=WM: (15215: N1082 ^status complete)
  26697. <=WM: (15214: I3 ^predict-yes N1082)
  26698. --- Firing Productions (IE) For State At Depth 1 ---
  26699. --- Inner Elaboration Phase, active level 1 (S1) ---
  26700. Firing monitor*world
  26701. -->
  26702. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  26703. --- Change Working Memory (IE) ---
  26704. --- END Application Phase ---
  26705. --- Output Phase ---
  26706. ENV: Agent did: predict-no for direction R in state State-B
  26707. In State-B moving R
  26708. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  26709. predict error 0
  26710. dir: dir isL
  26711. --- END Output Phase ---
  26712. -/--- Input Phase ---
  26713. =>WM: (15231: I2 ^dir L)
  26714. =>WM: (15230: I2 ^reward 1)
  26715. =>WM: (15229: I2 ^see 0)
  26716. =>WM: (15228: N1083 ^status complete)
  26717. <=WM: (15218: I2 ^dir R)
  26718. <=WM: (15217: I2 ^reward 1)
  26719. <=WM: (15216: I2 ^see 1)
  26720. =>WM: (15232: I2 ^level-1 R0-root)
  26721. <=WM: (15219: I2 ^level-1 R1-root)
  26722. --- END Input Phase ---
  26723. --- Proposal Phase ---
  26724. --- Inner Elaboration Phase, active level 1 (S1) ---
  26725. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  26726. -->
  26727. (S1 ^operator O2166 = -0.1984300550322165)
  26728. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  26729. -->
  26730. (S1 ^operator O2165 = 0.6091713913477592)
  26731. Firing prefer*rvt*predict-no*H0*4*H1
  26732. -->
  26733. Firing prefer*rvt*predict-yes*H0*3*H1
  26734. -->
  26735. Firing elaborate*copy-see-to-output-link
  26736. -->
  26737. (I3 ^see 0 +)
  26738. Firing elaborate*reward*based*on*reward
  26739. -->
  26740. (R1087 ^value 1 +)
  26741. (R1 ^reward R1087 +)
  26742. Firing propose*predict-yes
  26743. -->
  26744. (O2167 ^name predict-yes +)
  26745. (S1 ^operator O2167 +)
  26746. Firing propose*predict-no
  26747. -->
  26748. (O2168 ^name predict-no +)
  26749. (S1 ^operator O2168 +)
  26750. Firing rl*prefer*rvt*predict-no*H0*4
  26751. -->
  26752. (S1 ^operator O2166 = 0.3144946769214089)
  26753. Firing rl*prefer*rvt*predict-yes*H0*3
  26754. -->
  26755. (S1 ^operator O2165 = 0.3907835800387532)
  26756. Firing prefer*rvt*predict-yes*H0
  26757. -->
  26758. Firing prefer*rvt*predict-no*H0
  26759. -->
  26760. Firing elaborate*copy-dir-to-output-link
  26761. -->
  26762. (I3 ^dir L +)
  26763. inner elaboration loop at bottom goal.
  26764. Retracting elaborate*copy-see-to-output-link
  26765. -->
  26766. (I3 ^see 1 +)
  26767. Retracting propose*predict-no
  26768. -->
  26769. (O2166 ^name predict-no +)
  26770. (S1 ^operator O2166 +)
  26771. Retracting propose*predict-yes
  26772. -->
  26773. (O2165 ^name predict-yes +)
  26774. (S1 ^operator O2165 +)
  26775. Retracting elaborate*reward*based*on*reward
  26776. -->
  26777. (R1086 ^value 1 +)
  26778. (R1 ^reward R1086 +)
  26779. Retracting elaborate*copy-dir-to-output-link
  26780. -->
  26781. (I3 ^dir R +)
  26782. Retracting rl*prefer*rvt*predict-no*H0*6
  26783. -->
  26784. (S1 ^operator O2166 = 0.9329132455998342)
  26785. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  26786. -->
  26787. (S1 ^operator O2165 = -0.04253361215288998)
  26788. Retracting rl*prefer*rvt*predict-yes*H0*5
  26789. -->
  26790. (S1 ^operator O2165 = 0.1215961184552382)
  26791. =>WM: (15240: S1 ^operator O2168 +)
  26792. =>WM: (15239: S1 ^operator O2167 +)
  26793. =>WM: (15238: I3 ^dir L)
  26794. =>WM: (15237: O2168 ^name predict-no)
  26795. =>WM: (15236: O2167 ^name predict-yes)
  26796. =>WM: (15235: R1087 ^value 1)
  26797. =>WM: (15234: R1 ^reward R1087)
  26798. =>WM: (15233: I3 ^see 0)
  26799. <=WM: (15224: S1 ^operator O2165 +)
  26800. <=WM: (15225: S1 ^operator O2166 +)
  26801. <=WM: (15226: S1 ^operator O2166)
  26802. <=WM: (15210: I3 ^dir R)
  26803. <=WM: (15220: R1 ^reward R1086)
  26804. <=WM: (15205: I3 ^see 1)
  26805. <=WM: (15223: O2166 ^name predict-no)
  26806. <=WM: (15222: O2165 ^name predict-yes)
  26807. <=WM: (15221: R1086 ^value 1)
  26808. --- Inner Elaboration Phase, active level 1 (S1) ---
  26809. Firing prefer*rvt*predict-yes*H0
  26810. -->
  26811. Firing rl*prefer*rvt*predict-yes*H0*3
  26812. -->
  26813. (S1 ^operator O2167 = 0.3907835800387532)
  26814. Firing prefer*rvt*predict-yes*H0*3*H1
  26815. -->
  26816. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  26817. -->
  26818. (S1 ^operator O2167 = 0.6091713913477592)
  26819. Firing prefer*rvt*predict-no*H0
  26820. -->
  26821. Firing rl*prefer*rvt*predict-no*H0*4
  26822. -->
  26823. (S1 ^operator O2168 = 0.3144946769214089)
  26824. Firing prefer*rvt*predict-no*H0*4*H1
  26825. -->
  26826. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  26827. -->
  26828. (S1 ^operator O2168 = -0.1984300550322165)
  26829. inner elaboration loop at bottom goal.
  26830. Retracting rl*prefer*rvt*predict-no*H0*4
  26831. -->
  26832. (S1 ^operator O2166 = 0.3144946769214089)
  26833. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  26834. -->
  26835. (S1 ^operator O2166 = -0.1984300550322165)
  26836. Retracting rl*prefer*rvt*predict-yes*H0*3
  26837. -->
  26838. (S1 ^operator O2165 = 0.3907835800387532)
  26839. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  26840. -->
  26841. (S1 ^operator O2165 = 0.6091713913477592)
  26842. --- END Proposal Phase ---
  26843. --- Decision Phase ---
  26844. RL update rl*prefer*rvt*predict-no*H0*6 0.932913 0 0.932913 -> 0.943625 0 0.943625(R,m,v=1,0.937824,0.058614)
  26845. =>WM: (15241: S1 ^operator O2167)
  26846. 1084: O: O2167 (predict-yes)
  26847. --- END Decision Phase ---
  26848. --- Application Phase ---
  26849. --- Firing Productions (PE) For State At Depth 1 ---
  26850. --- Inner Elaboration Phase, active level 1 (S1) ---
  26851. Firing apply*operator
  26852. -->
  26853. (I3 ^predict-yes N1084 + :O )
  26854. Firing apply*operator*complete
  26855. -->
  26856. (I3 ^predict-no N1083 - :O )
  26857. inner elaboration loop at bottom goal.
  26858. --- Change Working Memory (PE) ---
  26859. =>WM: (15242: I3 ^predict-yes N1084)
  26860. <=WM: (15228: N1083 ^status complete)
  26861. <=WM: (15227: I3 ^predict-no N1083)
  26862. --- Firing Productions (IE) For State At Depth 1 ---
  26863. --- Inner Elaboration Phase, active level 1 (S1) ---
  26864. Firing monitor*world
  26865. -->
  26866. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  26867. --- Change Working Memory (IE) ---
  26868. --- END Application Phase ---
  26869. --- Output Phase ---
  26870. ENV: Agent did: predict-yes for direction L in state State-B
  26871. In State-B moving L
  26872. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  26873. predict error 0
  26874. dir: dir isL
  26875. --- END Output Phase ---
  26876. |\--- Input Phase ---
  26877. =>WM: (15246: I2 ^dir L)
  26878. =>WM: (15245: I2 ^reward 1)
  26879. =>WM: (15244: I2 ^see 1)
  26880. =>WM: (15243: N1084 ^status complete)
  26881. <=WM: (15231: I2 ^dir L)
  26882. <=WM: (15230: I2 ^reward 1)
  26883. <=WM: (15229: I2 ^see 0)
  26884. =>WM: (15247: I2 ^level-1 L1-root)
  26885. <=WM: (15232: I2 ^level-1 R0-root)
  26886. --- END Input Phase ---
  26887. --- Proposal Phase ---
  26888. --- Inner Elaboration Phase, active level 1 (S1) ---
  26889. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  26890. -->
  26891. (S1 ^operator O2167 = -0.2062723012911647)
  26892. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  26893. -->
  26894. (S1 ^operator O2168 = 0.6855152344977683)
  26895. Firing prefer*rvt*predict-no*H0*4*H1
  26896. -->
  26897. Firing prefer*rvt*predict-yes*H0*3*H1
  26898. -->
  26899. Firing elaborate*copy-see-to-output-link
  26900. -->
  26901. (I3 ^see 1 +)
  26902. Firing elaborate*reward*based*on*reward
  26903. -->
  26904. (R1088 ^value 1 +)
  26905. (R1 ^reward R1088 +)
  26906. Firing propose*predict-yes
  26907. -->
  26908. (O2169 ^name predict-yes +)
  26909. (S1 ^operator O2169 +)
  26910. Firing propose*predict-no
  26911. -->
  26912. (O2170 ^name predict-no +)
  26913. (S1 ^operator O2170 +)
  26914. Firing rl*prefer*rvt*predict-no*H0*4
  26915. -->
  26916. (S1 ^operator O2168 = 0.3144946769214089)
  26917. Firing rl*prefer*rvt*predict-yes*H0*3
  26918. -->
  26919. (S1 ^operator O2167 = 0.3907835800387532)
  26920. Firing prefer*rvt*predict-yes*H0
  26921. -->
  26922. Firing prefer*rvt*predict-no*H0
  26923. -->
  26924. Firing elaborate*copy-dir-to-output-link
  26925. -->
  26926. (I3 ^dir L +)
  26927. inner elaboration loop at bottom goal.
  26928. Retracting elaborate*copy-see-to-output-link
  26929. -->
  26930. (I3 ^see 0 +)
  26931. Retracting propose*predict-no
  26932. -->
  26933. (O2168 ^name predict-no +)
  26934. (S1 ^operator O2168 +)
  26935. Retracting propose*predict-yes
  26936. -->
  26937. (O2167 ^name predict-yes +)
  26938. (S1 ^operator O2167 +)
  26939. Retracting elaborate*reward*based*on*reward
  26940. -->
  26941. (R1087 ^value 1 +)
  26942. (R1 ^reward R1087 +)
  26943. Retracting elaborate*copy-dir-to-output-link
  26944. -->
  26945. (I3 ^dir L +)
  26946. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  26947. -->
  26948. (S1 ^operator O2168 = -0.1984300550322165)
  26949. Retracting rl*prefer*rvt*predict-no*H0*4
  26950. -->
  26951. (S1 ^operator O2168 = 0.3144946769214089)
  26952. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  26953. -->
  26954. (S1 ^operator O2167 = 0.6091713913477592)
  26955. Retracting rl*prefer*rvt*predict-yes*H0*3
  26956. -->
  26957. (S1 ^operator O2167 = 0.3907835800387532)
  26958. =>WM: (15254: S1 ^operator O2170 +)
  26959. =>WM: (15253: S1 ^operator O2169 +)
  26960. =>WM: (15252: O2170 ^name predict-no)
  26961. =>WM: (15251: O2169 ^name predict-yes)
  26962. =>WM: (15250: R1088 ^value 1)
  26963. =>WM: (15249: R1 ^reward R1088)
  26964. =>WM: (15248: I3 ^see 1)
  26965. <=WM: (15239: S1 ^operator O2167 +)
  26966. <=WM: (15241: S1 ^operator O2167)
  26967. <=WM: (15240: S1 ^operator O2168 +)
  26968. <=WM: (15234: R1 ^reward R1087)
  26969. <=WM: (15233: I3 ^see 0)
  26970. <=WM: (15237: O2168 ^name predict-no)
  26971. <=WM: (15236: O2167 ^name predict-yes)
  26972. <=WM: (15235: R1087 ^value 1)
  26973. --- Inner Elaboration Phase, active level 1 (S1) ---
  26974. Firing prefer*rvt*predict-yes*H0
  26975. -->
  26976. Firing rl*prefer*rvt*predict-yes*H0*3
  26977. -->
  26978. (S1 ^operator O2169 = 0.3907835800387532)
  26979. Firing prefer*rvt*predict-yes*H0*3*H1
  26980. -->
  26981. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  26982. -->
  26983. (S1 ^operator O2169 = -0.2062723012911647)
  26984. Firing prefer*rvt*predict-no*H0
  26985. -->
  26986. Firing rl*prefer*rvt*predict-no*H0*4
  26987. -->
  26988. (S1 ^operator O2170 = 0.3144946769214089)
  26989. Firing prefer*rvt*predict-no*H0*4*H1
  26990. -->
  26991. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  26992. -->
  26993. (S1 ^operator O2170 = 0.6855152344977683)
  26994. inner elaboration loop at bottom goal.
  26995. Retracting rl*prefer*rvt*predict-no*H0*4
  26996. -->
  26997. (S1 ^operator O2168 = 0.3144946769214089)
  26998. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  26999. -->
  27000. (S1 ^operator O2168 = 0.6855152344977683)
  27001. Retracting rl*prefer*rvt*predict-yes*H0*3
  27002. -->
  27003. (S1 ^operator O2167 = 0.3907835800387532)
  27004. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  27005. -->
  27006. (S1 ^operator O2167 = -0.2062723012911647)
  27007. --- END Proposal Phase ---
  27008. --- Decision Phase ---
  27009. RL update rl*prefer*rvt*predict-yes*H0*3 0.472329 -0.0815451 0.390784 -> 0.472332 -0.0815445 0.390787(R,m,v=1,0.949438,0.0482765)
  27010. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527633 0.0815384 0.609171 -> 0.527637 0.081539 0.609176(R,m,v=1,1,0)
  27011. =>WM: (15255: S1 ^operator O2170)
  27012. 1085: O: O2170 (predict-no)
  27013. --- END Decision Phase ---
  27014. --- Application Phase ---
  27015. --- Firing Productions (PE) For State At Depth 1 ---
  27016. --- Inner Elaboration Phase, active level 1 (S1) ---
  27017. Firing apply*operator
  27018. -->
  27019. (I3 ^predict-no N1085 + :O )
  27020. Firing apply*operator*complete
  27021. -->
  27022. (I3 ^predict-yes N1084 - :O )
  27023. inner elaboration loop at bottom goal.
  27024. --- Change Working Memory (PE) ---
  27025. =>WM: (15256: I3 ^predict-no N1085)
  27026. <=WM: (15243: N1084 ^status complete)
  27027. <=WM: (15242: I3 ^predict-yes N1084)
  27028. --- Firing Productions (IE) For State At Depth 1 ---
  27029. --- Inner Elaboration Phase, active level 1 (S1) ---
  27030. Firing monitor*world
  27031. -->
  27032. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27033. --- Change Working Memory (IE) ---
  27034. --- END Application Phase ---
  27035. --- Output Phase ---
  27036. ENV: Agent did: predict-no for direction L in state State-A
  27037. In State-A moving L
  27038. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27039. predict error 0
  27040. dir: dir isU
  27041. --- END Output Phase ---
  27042. -/|--- Input Phase ---
  27043. =>WM: (15260: I2 ^dir U)
  27044. =>WM: (15259: I2 ^reward 1)
  27045. =>WM: (15258: I2 ^see 0)
  27046. =>WM: (15257: N1085 ^status complete)
  27047. <=WM: (15246: I2 ^dir L)
  27048. <=WM: (15245: I2 ^reward 1)
  27049. <=WM: (15244: I2 ^see 1)
  27050. =>WM: (15261: I2 ^level-1 L0-root)
  27051. <=WM: (15247: I2 ^level-1 L1-root)
  27052. --- END Input Phase ---
  27053. --- Proposal Phase ---
  27054. --- Inner Elaboration Phase, active level 1 (S1) ---
  27055. Firing elaborate*copy-see-to-output-link
  27056. -->
  27057. (I3 ^see 0 +)
  27058. Firing elaborate*reward*based*on*reward
  27059. -->
  27060. (R1089 ^value 1 +)
  27061. (R1 ^reward R1089 +)
  27062. Firing propose*predict-yes
  27063. -->
  27064. (O2171 ^name predict-yes +)
  27065. (S1 ^operator O2171 +)
  27066. Firing propose*predict-no
  27067. -->
  27068. (O2172 ^name predict-no +)
  27069. (S1 ^operator O2172 +)
  27070. Firing rl*prefer*rvt*predict-no*H0*2
  27071. -->
  27072. (S1 ^operator O2170 = 1.)
  27073. Firing rl*prefer*rvt*predict-yes*H0*1
  27074. -->
  27075. (S1 ^operator O2169 = 0.)
  27076. Firing prefer*rvt*predict-yes*H0
  27077. -->
  27078. Firing prefer*rvt*predict-no*H0
  27079. -->
  27080. Firing elaborate*copy-dir-to-output-link
  27081. -->
  27082. (I3 ^dir U +)
  27083. inner elaboration loop at bottom goal.
  27084. Retracting elaborate*copy-see-to-output-link
  27085. -->
  27086. (I3 ^see 1 +)
  27087. Retracting propose*predict-no
  27088. -->
  27089. (O2170 ^name predict-no +)
  27090. (S1 ^operator O2170 +)
  27091. Retracting propose*predict-yes
  27092. -->
  27093. (O2169 ^name predict-yes +)
  27094. (S1 ^operator O2169 +)
  27095. Retracting elaborate*reward*based*on*reward
  27096. -->
  27097. (R1088 ^value 1 +)
  27098. (R1 ^reward R1088 +)
  27099. Retracting elaborate*copy-dir-to-output-link
  27100. -->
  27101. (I3 ^dir L +)
  27102. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  27103. -->
  27104. (S1 ^operator O2170 = 0.6855152344977683)
  27105. Retracting rl*prefer*rvt*predict-no*H0*4
  27106. -->
  27107. (S1 ^operator O2170 = 0.3144946769214089)
  27108. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  27109. -->
  27110. (S1 ^operator O2169 = -0.2062723012911647)
  27111. Retracting rl*prefer*rvt*predict-yes*H0*3
  27112. -->
  27113. (S1 ^operator O2169 = 0.3907872220793651)
  27114. =>WM: (15269: S1 ^operator O2172 +)
  27115. =>WM: (15268: S1 ^operator O2171 +)
  27116. =>WM: (15267: I3 ^dir U)
  27117. =>WM: (15266: O2172 ^name predict-no)
  27118. =>WM: (15265: O2171 ^name predict-yes)
  27119. =>WM: (15264: R1089 ^value 1)
  27120. =>WM: (15263: R1 ^reward R1089)
  27121. =>WM: (15262: I3 ^see 0)
  27122. <=WM: (15253: S1 ^operator O2169 +)
  27123. <=WM: (15254: S1 ^operator O2170 +)
  27124. <=WM: (15255: S1 ^operator O2170)
  27125. <=WM: (15238: I3 ^dir L)
  27126. <=WM: (15249: R1 ^reward R1088)
  27127. <=WM: (15248: I3 ^see 1)
  27128. <=WM: (15252: O2170 ^name predict-no)
  27129. <=WM: (15251: O2169 ^name predict-yes)
  27130. <=WM: (15250: R1088 ^value 1)
  27131. --- Inner Elaboration Phase, active level 1 (S1) ---
  27132. Firing prefer*rvt*predict-yes*H0
  27133. -->
  27134. Firing rl*prefer*rvt*predict-yes*H0*1
  27135. -->
  27136. (S1 ^operator O2171 = 0.)
  27137. Firing prefer*rvt*predict-no*H0
  27138. -->
  27139. Firing rl*prefer*rvt*predict-no*H0*2
  27140. -->
  27141. (S1 ^operator O2172 = 1.)
  27142. inner elaboration loop at bottom goal.
  27143. Retracting rl*prefer*rvt*predict-no*H0*2
  27144. -->
  27145. (S1 ^operator O2170 = 1.)
  27146. Retracting rl*prefer*rvt*predict-yes*H0*1
  27147. -->
  27148. (S1 ^operator O2169 = 0.)
  27149. --- END Proposal Phase ---
  27150. --- Decision Phase ---
  27151. RL update rl*prefer*rvt*predict-no*H0*4 0.478544 -0.164049 0.314495 -> 0.478543 -0.164049 0.314494(R,m,v=1,0.927273,0.0678492)
  27152. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521465 0.16405 0.685515 -> 0.521464 0.16405 0.685514(R,m,v=1,1,0)
  27153. =>WM: (15270: S1 ^operator O2172)
  27154. 1086: O: O2172 (predict-no)
  27155. --- END Decision Phase ---
  27156. --- Application Phase ---
  27157. --- Firing Productions (PE) For State At Depth 1 ---
  27158. --- Inner Elaboration Phase, active level 1 (S1) ---
  27159. Firing apply*operator
  27160. -->
  27161. (I3 ^predict-no N1086 + :O )
  27162. Firing apply*operator*complete
  27163. -->
  27164. (I3 ^predict-no N1085 - :O )
  27165. inner elaboration loop at bottom goal.
  27166. --- Change Working Memory (PE) ---
  27167. =>WM: (15271: I3 ^predict-no N1086)
  27168. <=WM: (15257: N1085 ^status complete)
  27169. <=WM: (15256: I3 ^predict-no N1085)
  27170. --- Firing Productions (IE) For State At Depth 1 ---
  27171. --- Inner Elaboration Phase, active level 1 (S1) ---
  27172. Firing monitor*world
  27173. -->
  27174. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27175. --- Change Working Memory (IE) ---
  27176. --- END Application Phase ---
  27177. --- Output Phase ---
  27178. ENV: Agent did: predict-no for direction U in state State-A
  27179. In State-A moving U
  27180. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27181. predict error 0
  27182. dir: dir isU
  27183. --- END Output Phase ---
  27184. \-/--- Input Phase ---
  27185. =>WM: (15275: I2 ^dir U)
  27186. =>WM: (15274: I2 ^reward 1)
  27187. =>WM: (15273: I2 ^see 0)
  27188. =>WM: (15272: N1086 ^status complete)
  27189. <=WM: (15260: I2 ^dir U)
  27190. <=WM: (15259: I2 ^reward 1)
  27191. <=WM: (15258: I2 ^see 0)
  27192. =>WM: (15276: I2 ^level-1 L0-root)
  27193. <=WM: (15261: I2 ^level-1 L0-root)
  27194. --- END Input Phase ---
  27195. --- Proposal Phase ---
  27196. --- Inner Elaboration Phase, active level 1 (S1) ---
  27197. Firing elaborate*copy-see-to-output-link
  27198. -->
  27199. (I3 ^see 0 +)
  27200. Firing elaborate*reward*based*on*reward
  27201. -->
  27202. (R1090 ^value 1 +)
  27203. (R1 ^reward R1090 +)
  27204. Firing propose*predict-yes
  27205. -->
  27206. (O2173 ^name predict-yes +)
  27207. (S1 ^operator O2173 +)
  27208. Firing propose*predict-no
  27209. -->
  27210. (O2174 ^name predict-no +)
  27211. (S1 ^operator O2174 +)
  27212. Firing rl*prefer*rvt*predict-no*H0*2
  27213. -->
  27214. (S1 ^operator O2172 = 1.)
  27215. Firing rl*prefer*rvt*predict-yes*H0*1
  27216. -->
  27217. (S1 ^operator O2171 = 0.)
  27218. Firing prefer*rvt*predict-yes*H0
  27219. -->
  27220. Firing prefer*rvt*predict-no*H0
  27221. -->
  27222. Firing elaborate*copy-dir-to-output-link
  27223. -->
  27224. (I3 ^dir U +)
  27225. inner elaboration loop at bottom goal.
  27226. Retracting elaborate*copy-see-to-output-link
  27227. -->
  27228. (I3 ^see 0 +)
  27229. Retracting propose*predict-no
  27230. -->
  27231. (O2172 ^name predict-no +)
  27232. (S1 ^operator O2172 +)
  27233. Retracting propose*predict-yes
  27234. -->
  27235. (O2171 ^name predict-yes +)
  27236. (S1 ^operator O2171 +)
  27237. Retracting elaborate*reward*based*on*reward
  27238. -->
  27239. (R1089 ^value 1 +)
  27240. (R1 ^reward R1089 +)
  27241. Retracting elaborate*copy-dir-to-output-link
  27242. -->
  27243. (I3 ^dir U +)
  27244. Retracting rl*prefer*rvt*predict-no*H0*2
  27245. -->
  27246. (S1 ^operator O2172 = 1.)
  27247. Retracting rl*prefer*rvt*predict-yes*H0*1
  27248. -->
  27249. (S1 ^operator O2171 = 0.)
  27250. =>WM: (15282: S1 ^operator O2174 +)
  27251. =>WM: (15281: S1 ^operator O2173 +)
  27252. =>WM: (15280: O2174 ^name predict-no)
  27253. =>WM: (15279: O2173 ^name predict-yes)
  27254. =>WM: (15278: R1090 ^value 1)
  27255. =>WM: (15277: R1 ^reward R1090)
  27256. <=WM: (15268: S1 ^operator O2171 +)
  27257. <=WM: (15269: S1 ^operator O2172 +)
  27258. <=WM: (15270: S1 ^operator O2172)
  27259. <=WM: (15263: R1 ^reward R1089)
  27260. <=WM: (15266: O2172 ^name predict-no)
  27261. <=WM: (15265: O2171 ^name predict-yes)
  27262. <=WM: (15264: R1089 ^value 1)
  27263. --- Inner Elaboration Phase, active level 1 (S1) ---
  27264. Firing prefer*rvt*predict-yes*H0
  27265. -->
  27266. Firing rl*prefer*rvt*predict-yes*H0*1
  27267. -->
  27268. (S1 ^operator O2173 = 0.)
  27269. Firing prefer*rvt*predict-no*H0
  27270. -->
  27271. Firing rl*prefer*rvt*predict-no*H0*2
  27272. -->
  27273. (S1 ^operator O2174 = 1.)
  27274. inner elaboration loop at bottom goal.
  27275. Retracting rl*prefer*rvt*predict-no*H0*2
  27276. -->
  27277. (S1 ^operator O2172 = 1.)
  27278. Retracting rl*prefer*rvt*predict-yes*H0*1
  27279. -->
  27280. (S1 ^operator O2171 = 0.)
  27281. --- END Proposal Phase ---
  27282. --- Decision Phase ---
  27283. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27284. =>WM: (15283: S1 ^operator O2174)
  27285. 1087: O: O2174 (predict-no)
  27286. --- END Decision Phase ---
  27287. --- Application Phase ---
  27288. --- Firing Productions (PE) For State At Depth 1 ---
  27289. --- Inner Elaboration Phase, active level 1 (S1) ---
  27290. Firing apply*operator
  27291. -->
  27292. (I3 ^predict-no N1087 + :O )
  27293. Firing apply*operator*complete
  27294. -->
  27295. (I3 ^predict-no N1086 - :O )
  27296. inner elaboration loop at bottom goal.
  27297. --- Change Working Memory (PE) ---
  27298. =>WM: (15284: I3 ^predict-no N1087)
  27299. <=WM: (15272: N1086 ^status complete)
  27300. <=WM: (15271: I3 ^predict-no N1086)
  27301. --- Firing Productions (IE) For State At Depth 1 ---
  27302. --- Inner Elaboration Phase, active level 1 (S1) ---
  27303. Firing monitor*world
  27304. -->
  27305. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27306. --- Change Working Memory (IE) ---
  27307. --- END Application Phase ---
  27308. --- Output Phase ---
  27309. ENV: Agent did: predict-no for direction U in state State-A
  27310. In State-A moving U
  27311. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27312. predict error 0
  27313. dir: dir isU
  27314. --- END Output Phase ---
  27315. |--- Input Phase ---
  27316. =>WM: (15288: I2 ^dir U)
  27317. =>WM: (15287: I2 ^reward 1)
  27318. =>WM: (15286: I2 ^see 0)
  27319. =>WM: (15285: N1087 ^status complete)
  27320. <=WM: (15275: I2 ^dir U)
  27321. <=WM: (15274: I2 ^reward 1)
  27322. <=WM: (15273: I2 ^see 0)
  27323. =>WM: (15289: I2 ^level-1 L0-root)
  27324. <=WM: (15276: I2 ^level-1 L0-root)
  27325. --- END Input Phase ---
  27326. --- Proposal Phase ---
  27327. --- Inner Elaboration Phase, active level 1 (S1) ---
  27328. Firing elaborate*copy-see-to-output-link
  27329. -->
  27330. (I3 ^see 0 +)
  27331. Firing elaborate*reward*based*on*reward
  27332. -->
  27333. (R1091 ^value 1 +)
  27334. (R1 ^reward R1091 +)
  27335. Firing propose*predict-yes
  27336. -->
  27337. (O2175 ^name predict-yes +)
  27338. (S1 ^operator O2175 +)
  27339. Firing propose*predict-no
  27340. -->
  27341. (O2176 ^name predict-no +)
  27342. (S1 ^operator O2176 +)
  27343. Firing rl*prefer*rvt*predict-no*H0*2
  27344. -->
  27345. (S1 ^operator O2174 = 1.)
  27346. Firing rl*prefer*rvt*predict-yes*H0*1
  27347. -->
  27348. (S1 ^operator O2173 = 0.)
  27349. Firing prefer*rvt*predict-yes*H0
  27350. -->
  27351. Firing prefer*rvt*predict-no*H0
  27352. -->
  27353. Firing elaborate*copy-dir-to-output-link
  27354. -->
  27355. (I3 ^dir U +)
  27356. inner elaboration loop at bottom goal.
  27357. Retracting elaborate*copy-see-to-output-link
  27358. -->
  27359. (I3 ^see 0 +)
  27360. Retracting propose*predict-no
  27361. -->
  27362. (O2174 ^name predict-no +)
  27363. (S1 ^operator O2174 +)
  27364. Retracting propose*predict-yes
  27365. -->
  27366. (O2173 ^name predict-yes +)
  27367. (S1 ^operator O2173 +)
  27368. Retracting elaborate*reward*based*on*reward
  27369. -->
  27370. (R1090 ^value 1 +)
  27371. (R1 ^reward R1090 +)
  27372. Retracting elaborate*copy-dir-to-output-link
  27373. -->
  27374. (I3 ^dir U +)
  27375. Retracting rl*prefer*rvt*predict-no*H0*2
  27376. -->
  27377. (S1 ^operator O2174 = 1.)
  27378. Retracting rl*prefer*rvt*predict-yes*H0*1
  27379. -->
  27380. (S1 ^operator O2173 = 0.)
  27381. =>WM: (15295: S1 ^operator O2176 +)
  27382. =>WM: (15294: S1 ^operator O2175 +)
  27383. =>WM: (15293: O2176 ^name predict-no)
  27384. =>WM: (15292: O2175 ^name predict-yes)
  27385. =>WM: (15291: R1091 ^value 1)
  27386. =>WM: (15290: R1 ^reward R1091)
  27387. <=WM: (15281: S1 ^operator O2173 +)
  27388. <=WM: (15282: S1 ^operator O2174 +)
  27389. <=WM: (15283: S1 ^operator O2174)
  27390. <=WM: (15277: R1 ^reward R1090)
  27391. <=WM: (15280: O2174 ^name predict-no)
  27392. <=WM: (15279: O2173 ^name predict-yes)
  27393. <=WM: (15278: R1090 ^value 1)
  27394. --- Inner Elaboration Phase, active level 1 (S1) ---
  27395. Firing prefer*rvt*predict-yes*H0
  27396. -->
  27397. Firing rl*prefer*rvt*predict-yes*H0*1
  27398. -->
  27399. (S1 ^operator O2175 = 0.)
  27400. Firing prefer*rvt*predict-no*H0
  27401. -->
  27402. Firing rl*prefer*rvt*predict-no*H0*2
  27403. -->
  27404. (S1 ^operator O2176 = 1.)
  27405. inner elaboration loop at bottom goal.
  27406. Retracting rl*prefer*rvt*predict-no*H0*2
  27407. -->
  27408. (S1 ^operator O2174 = 1.)
  27409. Retracting rl*prefer*rvt*predict-yes*H0*1
  27410. -->
  27411. (S1 ^operator O2173 = 0.)
  27412. --- END Proposal Phase ---
  27413. --- Decision Phase ---
  27414. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27415. =>WM: (15296: S1 ^operator O2176)
  27416. 1088: O: O2176 (predict-no)
  27417. --- END Decision Phase ---
  27418. --- Application Phase ---
  27419. --- Firing Productions (PE) For State At Depth 1 ---
  27420. --- Inner Elaboration Phase, active level 1 (S1) ---
  27421. Firing apply*operator
  27422. -->
  27423. (I3 ^predict-no N1088 + :O )
  27424. Firing apply*operator*complete
  27425. -->
  27426. (I3 ^predict-no N1087 - :O )
  27427. inner elaboration loop at bottom goal.
  27428. --- Change Working Memory (PE) ---
  27429. =>WM: (15297: I3 ^predict-no N1088)
  27430. <=WM: (15285: N1087 ^status complete)
  27431. <=WM: (15284: I3 ^predict-no N1087)
  27432. --- Firing Productions (IE) For State At Depth 1 ---
  27433. --- Inner Elaboration Phase, active level 1 (S1) ---
  27434. Firing monitor*world
  27435. -->
  27436. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27437. --- Change Working Memory (IE) ---
  27438. --- END Application Phase ---
  27439. --- Output Phase ---
  27440. ENV: Agent did: predict-no for direction U in state State-A
  27441. In State-A moving U
  27442. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27443. predict error 0
  27444. dir: dir isL
  27445. --- END Output Phase ---
  27446. \---- Input Phase ---
  27447. =>WM: (15301: I2 ^dir L)
  27448. =>WM: (15300: I2 ^reward 1)
  27449. =>WM: (15299: I2 ^see 0)
  27450. =>WM: (15298: N1088 ^status complete)
  27451. <=WM: (15288: I2 ^dir U)
  27452. <=WM: (15287: I2 ^reward 1)
  27453. <=WM: (15286: I2 ^see 0)
  27454. =>WM: (15302: I2 ^level-1 L0-root)
  27455. <=WM: (15289: I2 ^level-1 L0-root)
  27456. --- END Input Phase ---
  27457. --- Proposal Phase ---
  27458. --- Inner Elaboration Phase, active level 1 (S1) ---
  27459. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27460. -->
  27461. (S1 ^operator O2175 = -0.208713043145708)
  27462. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27463. -->
  27464. (S1 ^operator O2176 = 0.6854394259185996)
  27465. Firing prefer*rvt*predict-no*H0*4*H1
  27466. -->
  27467. Firing prefer*rvt*predict-yes*H0*3*H1
  27468. -->
  27469. Firing elaborate*copy-see-to-output-link
  27470. -->
  27471. (I3 ^see 0 +)
  27472. Firing elaborate*reward*based*on*reward
  27473. -->
  27474. (R1092 ^value 1 +)
  27475. (R1 ^reward R1092 +)
  27476. Firing propose*predict-yes
  27477. -->
  27478. (O2177 ^name predict-yes +)
  27479. (S1 ^operator O2177 +)
  27480. Firing propose*predict-no
  27481. -->
  27482. (O2178 ^name predict-no +)
  27483. (S1 ^operator O2178 +)
  27484. Firing rl*prefer*rvt*predict-no*H0*4
  27485. -->
  27486. (S1 ^operator O2176 = 0.3144938653010612)
  27487. Firing rl*prefer*rvt*predict-yes*H0*3
  27488. -->
  27489. (S1 ^operator O2175 = 0.3907872220793651)
  27490. Firing prefer*rvt*predict-yes*H0
  27491. -->
  27492. Firing prefer*rvt*predict-no*H0
  27493. -->
  27494. Firing elaborate*copy-dir-to-output-link
  27495. -->
  27496. (I3 ^dir L +)
  27497. inner elaboration loop at bottom goal.
  27498. Retracting elaborate*copy-see-to-output-link
  27499. -->
  27500. (I3 ^see 0 +)
  27501. Retracting propose*predict-no
  27502. -->
  27503. (O2176 ^name predict-no +)
  27504. (S1 ^operator O2176 +)
  27505. Retracting propose*predict-yes
  27506. -->
  27507. (O2175 ^name predict-yes +)
  27508. (S1 ^operator O2175 +)
  27509. Retracting elaborate*reward*based*on*reward
  27510. -->
  27511. (R1091 ^value 1 +)
  27512. (R1 ^reward R1091 +)
  27513. Retracting elaborate*copy-dir-to-output-link
  27514. -->
  27515. (I3 ^dir U +)
  27516. Retracting rl*prefer*rvt*predict-no*H0*2
  27517. -->
  27518. (S1 ^operator O2176 = 1.)
  27519. Retracting rl*prefer*rvt*predict-yes*H0*1
  27520. -->
  27521. (S1 ^operator O2175 = 0.)
  27522. =>WM: (15309: S1 ^operator O2178 +)
  27523. =>WM: (15308: S1 ^operator O2177 +)
  27524. =>WM: (15307: I3 ^dir L)
  27525. =>WM: (15306: O2178 ^name predict-no)
  27526. =>WM: (15305: O2177 ^name predict-yes)
  27527. =>WM: (15304: R1092 ^value 1)
  27528. =>WM: (15303: R1 ^reward R1092)
  27529. <=WM: (15294: S1 ^operator O2175 +)
  27530. <=WM: (15295: S1 ^operator O2176 +)
  27531. <=WM: (15296: S1 ^operator O2176)
  27532. <=WM: (15267: I3 ^dir U)
  27533. <=WM: (15290: R1 ^reward R1091)
  27534. <=WM: (15293: O2176 ^name predict-no)
  27535. <=WM: (15292: O2175 ^name predict-yes)
  27536. <=WM: (15291: R1091 ^value 1)
  27537. --- Inner Elaboration Phase, active level 1 (S1) ---
  27538. Firing prefer*rvt*predict-yes*H0
  27539. -->
  27540. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27541. -->
  27542. (S1 ^operator O2177 = -0.208713043145708)
  27543. Firing rl*prefer*rvt*predict-yes*H0*3
  27544. -->
  27545. (S1 ^operator O2177 = 0.3907872220793651)
  27546. Firing prefer*rvt*predict-yes*H0*3*H1
  27547. -->
  27548. Firing prefer*rvt*predict-no*H0
  27549. -->
  27550. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27551. -->
  27552. (S1 ^operator O2178 = 0.6854394259185996)
  27553. Firing rl*prefer*rvt*predict-no*H0*4
  27554. -->
  27555. (S1 ^operator O2178 = 0.3144938653010612)
  27556. Firing prefer*rvt*predict-no*H0*4*H1
  27557. -->
  27558. inner elaboration loop at bottom goal.
  27559. Retracting rl*prefer*rvt*predict-no*H0*4
  27560. -->
  27561. (S1 ^operator O2176 = 0.3144938653010612)
  27562. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  27563. -->
  27564. (S1 ^operator O2176 = 0.6854394259185996)
  27565. Retracting rl*prefer*rvt*predict-yes*H0*3
  27566. -->
  27567. (S1 ^operator O2175 = 0.3907872220793651)
  27568. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  27569. -->
  27570. (S1 ^operator O2175 = -0.208713043145708)
  27571. --- END Proposal Phase ---
  27572. --- Decision Phase ---
  27573. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  27574. =>WM: (15310: S1 ^operator O2178)
  27575. 1089: O: O2178 (predict-no)
  27576. --- END Decision Phase ---
  27577. --- Application Phase ---
  27578. --- Firing Productions (PE) For State At Depth 1 ---
  27579. --- Inner Elaboration Phase, active level 1 (S1) ---
  27580. Firing apply*operator
  27581. -->
  27582. (I3 ^predict-no N1089 + :O )
  27583. Firing apply*operator*complete
  27584. -->
  27585. (I3 ^predict-no N1088 - :O )
  27586. inner elaboration loop at bottom goal.
  27587. --- Change Working Memory (PE) ---
  27588. =>WM: (15311: I3 ^predict-no N1089)
  27589. <=WM: (15298: N1088 ^status complete)
  27590. <=WM: (15297: I3 ^predict-no N1088)
  27591. --- Firing Productions (IE) For State At Depth 1 ---
  27592. --- Inner Elaboration Phase, active level 1 (S1) ---
  27593. Firing monitor*world
  27594. -->
  27595. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27596. --- Change Working Memory (IE) ---
  27597. --- END Application Phase ---
  27598. --- Output Phase ---
  27599. ENV: Agent did: predict-no for direction L in state State-A
  27600. In State-A moving L
  27601. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27602. predict error 0
  27603. dir: dir isL
  27604. --- END Output Phase ---
  27605. /|\--- Input Phase ---
  27606. =>WM: (15315: I2 ^dir L)
  27607. =>WM: (15314: I2 ^reward 1)
  27608. =>WM: (15313: I2 ^see 0)
  27609. =>WM: (15312: N1089 ^status complete)
  27610. <=WM: (15301: I2 ^dir L)
  27611. <=WM: (15300: I2 ^reward 1)
  27612. <=WM: (15299: I2 ^see 0)
  27613. =>WM: (15316: I2 ^level-1 L0-root)
  27614. <=WM: (15302: I2 ^level-1 L0-root)
  27615. --- END Input Phase ---
  27616. --- Proposal Phase ---
  27617. --- Inner Elaboration Phase, active level 1 (S1) ---
  27618. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27619. -->
  27620. (S1 ^operator O2177 = -0.208713043145708)
  27621. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27622. -->
  27623. (S1 ^operator O2178 = 0.6854394259185996)
  27624. Firing prefer*rvt*predict-no*H0*4*H1
  27625. -->
  27626. Firing prefer*rvt*predict-yes*H0*3*H1
  27627. -->
  27628. Firing elaborate*copy-see-to-output-link
  27629. -->
  27630. (I3 ^see 0 +)
  27631. Firing elaborate*reward*based*on*reward
  27632. -->
  27633. (R1093 ^value 1 +)
  27634. (R1 ^reward R1093 +)
  27635. Firing propose*predict-yes
  27636. -->
  27637. (O2179 ^name predict-yes +)
  27638. (S1 ^operator O2179 +)
  27639. Firing propose*predict-no
  27640. -->
  27641. (O2180 ^name predict-no +)
  27642. (S1 ^operator O2180 +)
  27643. Firing rl*prefer*rvt*predict-no*H0*4
  27644. -->
  27645. (S1 ^operator O2178 = 0.3144938653010612)
  27646. Firing rl*prefer*rvt*predict-yes*H0*3
  27647. -->
  27648. (S1 ^operator O2177 = 0.3907872220793651)
  27649. Firing prefer*rvt*predict-yes*H0
  27650. -->
  27651. Firing prefer*rvt*predict-no*H0
  27652. -->
  27653. Firing elaborate*copy-dir-to-output-link
  27654. -->
  27655. (I3 ^dir L +)
  27656. inner elaboration loop at bottom goal.
  27657. Retracting elaborate*copy-see-to-output-link
  27658. -->
  27659. (I3 ^see 0 +)
  27660. Retracting propose*predict-no
  27661. -->
  27662. (O2178 ^name predict-no +)
  27663. (S1 ^operator O2178 +)
  27664. Retracting propose*predict-yes
  27665. -->
  27666. (O2177 ^name predict-yes +)
  27667. (S1 ^operator O2177 +)
  27668. Retracting elaborate*reward*based*on*reward
  27669. -->
  27670. (R1092 ^value 1 +)
  27671. (R1 ^reward R1092 +)
  27672. Retracting elaborate*copy-dir-to-output-link
  27673. -->
  27674. (I3 ^dir L +)
  27675. Retracting rl*prefer*rvt*predict-no*H0*4
  27676. -->
  27677. (S1 ^operator O2178 = 0.3144938653010612)
  27678. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  27679. -->
  27680. (S1 ^operator O2178 = 0.6854394259185996)
  27681. Retracting rl*prefer*rvt*predict-yes*H0*3
  27682. -->
  27683. (S1 ^operator O2177 = 0.3907872220793651)
  27684. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  27685. -->
  27686. (S1 ^operator O2177 = -0.208713043145708)
  27687. =>WM: (15322: S1 ^operator O2180 +)
  27688. =>WM: (15321: S1 ^operator O2179 +)
  27689. =>WM: (15320: O2180 ^name predict-no)
  27690. =>WM: (15319: O2179 ^name predict-yes)
  27691. =>WM: (15318: R1093 ^value 1)
  27692. =>WM: (15317: R1 ^reward R1093)
  27693. <=WM: (15308: S1 ^operator O2177 +)
  27694. <=WM: (15309: S1 ^operator O2178 +)
  27695. <=WM: (15310: S1 ^operator O2178)
  27696. <=WM: (15303: R1 ^reward R1092)
  27697. <=WM: (15306: O2178 ^name predict-no)
  27698. <=WM: (15305: O2177 ^name predict-yes)
  27699. <=WM: (15304: R1092 ^value 1)
  27700. --- Inner Elaboration Phase, active level 1 (S1) ---
  27701. Firing prefer*rvt*predict-yes*H0
  27702. -->
  27703. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27704. -->
  27705. (S1 ^operator O2179 = -0.208713043145708)
  27706. Firing rl*prefer*rvt*predict-yes*H0*3
  27707. -->
  27708. (S1 ^operator O2179 = 0.3907872220793651)
  27709. Firing prefer*rvt*predict-yes*H0*3*H1
  27710. -->
  27711. Firing prefer*rvt*predict-no*H0
  27712. -->
  27713. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27714. -->
  27715. (S1 ^operator O2180 = 0.6854394259185996)
  27716. Firing rl*prefer*rvt*predict-no*H0*4
  27717. -->
  27718. (S1 ^operator O2180 = 0.3144938653010612)
  27719. Firing prefer*rvt*predict-no*H0*4*H1
  27720. -->
  27721. inner elaboration loop at bottom goal.
  27722. Retracting rl*prefer*rvt*predict-no*H0*4
  27723. -->
  27724. (S1 ^operator O2178 = 0.3144938653010612)
  27725. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  27726. -->
  27727. (S1 ^operator O2178 = 0.6854394259185996)
  27728. Retracting rl*prefer*rvt*predict-yes*H0*3
  27729. -->
  27730. (S1 ^operator O2177 = 0.3907872220793651)
  27731. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  27732. -->
  27733. (S1 ^operator O2177 = -0.208713043145708)
  27734. --- END Proposal Phase ---
  27735. --- Decision Phase ---
  27736. RL update rl*prefer*rvt*predict-no*H0*4 0.478543 -0.164049 0.314494 -> 0.478548 -0.164049 0.314499(R,m,v=1,0.927711,0.0674699)
  27737. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521396 0.164043 0.685439 -> 0.521402 0.164044 0.685446(R,m,v=1,1,0)
  27738. =>WM: (15323: S1 ^operator O2180)
  27739. 1090: O: O2180 (predict-no)
  27740. --- END Decision Phase ---
  27741. --- Application Phase ---
  27742. --- Firing Productions (PE) For State At Depth 1 ---
  27743. --- Inner Elaboration Phase, active level 1 (S1) ---
  27744. Firing apply*operator
  27745. -->
  27746. (I3 ^predict-no N1090 + :O )
  27747. Firing apply*operator*complete
  27748. -->
  27749. (I3 ^predict-no N1089 - :O )
  27750. inner elaboration loop at bottom goal.
  27751. --- Change Working Memory (PE) ---
  27752. =>WM: (15324: I3 ^predict-no N1090)
  27753. <=WM: (15312: N1089 ^status complete)
  27754. <=WM: (15311: I3 ^predict-no N1089)
  27755. --- Firing Productions (IE) For State At Depth 1 ---
  27756. --- Inner Elaboration Phase, active level 1 (S1) ---
  27757. Firing monitor*world
  27758. -->
  27759. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27760. --- Change Working Memory (IE) ---
  27761. --- END Application Phase ---
  27762. --- Output Phase ---
  27763. ENV: Agent did: predict-no for direction L in state State-A
  27764. In State-A moving L
  27765. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27766. predict error 0
  27767. dir: dir isL
  27768. --- END Output Phase ---
  27769. -/|--- Input Phase ---
  27770. =>WM: (15328: I2 ^dir L)
  27771. =>WM: (15327: I2 ^reward 1)
  27772. =>WM: (15326: I2 ^see 0)
  27773. =>WM: (15325: N1090 ^status complete)
  27774. <=WM: (15315: I2 ^dir L)
  27775. <=WM: (15314: I2 ^reward 1)
  27776. <=WM: (15313: I2 ^see 0)
  27777. =>WM: (15329: I2 ^level-1 L0-root)
  27778. <=WM: (15316: I2 ^level-1 L0-root)
  27779. --- END Input Phase ---
  27780. --- Proposal Phase ---
  27781. --- Inner Elaboration Phase, active level 1 (S1) ---
  27782. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27783. -->
  27784. (S1 ^operator O2179 = -0.208713043145708)
  27785. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27786. -->
  27787. (S1 ^operator O2180 = 0.6854458162511854)
  27788. Firing prefer*rvt*predict-no*H0*4*H1
  27789. -->
  27790. Firing prefer*rvt*predict-yes*H0*3*H1
  27791. -->
  27792. Firing elaborate*copy-see-to-output-link
  27793. -->
  27794. (I3 ^see 0 +)
  27795. Firing elaborate*reward*based*on*reward
  27796. -->
  27797. (R1094 ^value 1 +)
  27798. (R1 ^reward R1094 +)
  27799. Firing propose*predict-yes
  27800. -->
  27801. (O2181 ^name predict-yes +)
  27802. (S1 ^operator O2181 +)
  27803. Firing propose*predict-no
  27804. -->
  27805. (O2182 ^name predict-no +)
  27806. (S1 ^operator O2182 +)
  27807. Firing rl*prefer*rvt*predict-no*H0*4
  27808. -->
  27809. (S1 ^operator O2180 = 0.3144993225093091)
  27810. Firing rl*prefer*rvt*predict-yes*H0*3
  27811. -->
  27812. (S1 ^operator O2179 = 0.3907872220793651)
  27813. Firing prefer*rvt*predict-yes*H0
  27814. -->
  27815. Firing prefer*rvt*predict-no*H0
  27816. -->
  27817. Firing elaborate*copy-dir-to-output-link
  27818. -->
  27819. (I3 ^dir L +)
  27820. inner elaboration loop at bottom goal.
  27821. Retracting elaborate*copy-see-to-output-link
  27822. -->
  27823. (I3 ^see 0 +)
  27824. Retracting propose*predict-no
  27825. -->
  27826. (O2180 ^name predict-no +)
  27827. (S1 ^operator O2180 +)
  27828. Retracting propose*predict-yes
  27829. -->
  27830. (O2179 ^name predict-yes +)
  27831. (S1 ^operator O2179 +)
  27832. Retracting elaborate*reward*based*on*reward
  27833. -->
  27834. (R1093 ^value 1 +)
  27835. (R1 ^reward R1093 +)
  27836. Retracting elaborate*copy-dir-to-output-link
  27837. -->
  27838. (I3 ^dir L +)
  27839. Retracting rl*prefer*rvt*predict-no*H0*4
  27840. -->
  27841. (S1 ^operator O2180 = 0.3144993225093091)
  27842. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  27843. -->
  27844. (S1 ^operator O2180 = 0.6854458162511854)
  27845. Retracting rl*prefer*rvt*predict-yes*H0*3
  27846. -->
  27847. (S1 ^operator O2179 = 0.3907872220793651)
  27848. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  27849. -->
  27850. (S1 ^operator O2179 = -0.208713043145708)
  27851. =>WM: (15335: S1 ^operator O2182 +)
  27852. =>WM: (15334: S1 ^operator O2181 +)
  27853. =>WM: (15333: O2182 ^name predict-no)
  27854. =>WM: (15332: O2181 ^name predict-yes)
  27855. =>WM: (15331: R1094 ^value 1)
  27856. =>WM: (15330: R1 ^reward R1094)
  27857. <=WM: (15321: S1 ^operator O2179 +)
  27858. <=WM: (15322: S1 ^operator O2180 +)
  27859. <=WM: (15323: S1 ^operator O2180)
  27860. <=WM: (15317: R1 ^reward R1093)
  27861. <=WM: (15320: O2180 ^name predict-no)
  27862. <=WM: (15319: O2179 ^name predict-yes)
  27863. <=WM: (15318: R1093 ^value 1)
  27864. --- Inner Elaboration Phase, active level 1 (S1) ---
  27865. Firing prefer*rvt*predict-yes*H0
  27866. -->
  27867. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27868. -->
  27869. (S1 ^operator O2181 = -0.208713043145708)
  27870. Firing rl*prefer*rvt*predict-yes*H0*3
  27871. -->
  27872. (S1 ^operator O2181 = 0.3907872220793651)
  27873. Firing prefer*rvt*predict-yes*H0*3*H1
  27874. -->
  27875. Firing prefer*rvt*predict-no*H0
  27876. -->
  27877. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27878. -->
  27879. (S1 ^operator O2182 = 0.6854458162511854)
  27880. Firing rl*prefer*rvt*predict-no*H0*4
  27881. -->
  27882. (S1 ^operator O2182 = 0.3144993225093091)
  27883. Firing prefer*rvt*predict-no*H0*4*H1
  27884. -->
  27885. inner elaboration loop at bottom goal.
  27886. Retracting rl*prefer*rvt*predict-no*H0*4
  27887. -->
  27888. (S1 ^operator O2180 = 0.3144993225093091)
  27889. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  27890. -->
  27891. (S1 ^operator O2180 = 0.6854458162511854)
  27892. Retracting rl*prefer*rvt*predict-yes*H0*3
  27893. -->
  27894. (S1 ^operator O2179 = 0.3907872220793651)
  27895. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  27896. -->
  27897. (S1 ^operator O2179 = -0.208713043145708)
  27898. --- END Proposal Phase ---
  27899. --- Decision Phase ---
  27900. RL update rl*prefer*rvt*predict-no*H0*4 0.478548 -0.164049 0.314499 -> 0.478552 -0.164048 0.314504(R,m,v=1,0.928144,0.0670947)
  27901. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521402 0.164044 0.685446 -> 0.521407 0.164044 0.685451(R,m,v=1,1,0)
  27902. =>WM: (15336: S1 ^operator O2182)
  27903. 1091: O: O2182 (predict-no)
  27904. --- END Decision Phase ---
  27905. --- Application Phase ---
  27906. --- Firing Productions (PE) For State At Depth 1 ---
  27907. --- Inner Elaboration Phase, active level 1 (S1) ---
  27908. Firing apply*operator
  27909. -->
  27910. (I3 ^predict-no N1091 + :O )
  27911. Firing apply*operator*complete
  27912. -->
  27913. (I3 ^predict-no N1090 - :O )
  27914. inner elaboration loop at bottom goal.
  27915. --- Change Working Memory (PE) ---
  27916. =>WM: (15337: I3 ^predict-no N1091)
  27917. <=WM: (15325: N1090 ^status complete)
  27918. <=WM: (15324: I3 ^predict-no N1090)
  27919. --- Firing Productions (IE) For State At Depth 1 ---
  27920. --- Inner Elaboration Phase, active level 1 (S1) ---
  27921. Firing monitor*world
  27922. -->
  27923. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  27924. --- Change Working Memory (IE) ---
  27925. --- END Application Phase ---
  27926. --- Output Phase ---
  27927. ENV: Agent did: predict-no for direction L in state State-A
  27928. In State-A moving L
  27929. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  27930. predict error 0
  27931. dir: dir isL
  27932. --- END Output Phase ---
  27933. \--- Input Phase ---
  27934. =>WM: (15341: I2 ^dir L)
  27935. =>WM: (15340: I2 ^reward 1)
  27936. =>WM: (15339: I2 ^see 0)
  27937. =>WM: (15338: N1091 ^status complete)
  27938. <=WM: (15328: I2 ^dir L)
  27939. <=WM: (15327: I2 ^reward 1)
  27940. <=WM: (15326: I2 ^see 0)
  27941. =>WM: (15342: I2 ^level-1 L0-root)
  27942. <=WM: (15329: I2 ^level-1 L0-root)
  27943. --- END Input Phase ---
  27944. --- Proposal Phase ---
  27945. --- Inner Elaboration Phase, active level 1 (S1) ---
  27946. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  27947. -->
  27948. (S1 ^operator O2181 = -0.208713043145708)
  27949. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  27950. -->
  27951. (S1 ^operator O2182 = 0.685451056996617)
  27952. Firing prefer*rvt*predict-no*H0*4*H1
  27953. -->
  27954. Firing prefer*rvt*predict-yes*H0*3*H1
  27955. -->
  27956. Firing elaborate*copy-see-to-output-link
  27957. -->
  27958. (I3 ^see 0 +)
  27959. Firing elaborate*reward*based*on*reward
  27960. -->
  27961. (R1095 ^value 1 +)
  27962. (R1 ^reward R1095 +)
  27963. Firing propose*predict-yes
  27964. -->
  27965. (O2183 ^name predict-yes +)
  27966. (S1 ^operator O2183 +)
  27967. Firing propose*predict-no
  27968. -->
  27969. (O2184 ^name predict-no +)
  27970. (S1 ^operator O2184 +)
  27971. Firing rl*prefer*rvt*predict-no*H0*4
  27972. -->
  27973. (S1 ^operator O2182 = 0.3145038061064807)
  27974. Firing rl*prefer*rvt*predict-yes*H0*3
  27975. -->
  27976. (S1 ^operator O2181 = 0.3907872220793651)
  27977. Firing prefer*rvt*predict-yes*H0
  27978. -->
  27979. Firing prefer*rvt*predict-no*H0
  27980. -->
  27981. Firing elaborate*copy-dir-to-output-link
  27982. -->
  27983. (I3 ^dir L +)
  27984. inner elaboration loop at bottom goal.
  27985. Retracting elaborate*copy-see-to-output-link
  27986. -->
  27987. (I3 ^see 0 +)
  27988. Retracting propose*predict-no
  27989. -->
  27990. (O2182 ^name predict-no +)
  27991. (S1 ^operator O2182 +)
  27992. Retracting propose*predict-yes
  27993. -->
  27994. (O2181 ^name predict-yes +)
  27995. (S1 ^operator O2181 +)
  27996. Retracting elaborate*reward*based*on*reward
  27997. -->
  27998. (R1094 ^value 1 +)
  27999. (R1 ^reward R1094 +)
  28000. Retracting elaborate*copy-dir-to-output-link
  28001. -->
  28002. (I3 ^dir L +)
  28003. Retracting rl*prefer*rvt*predict-no*H0*4
  28004. -->
  28005. (S1 ^operator O2182 = 0.3145038061064807)
  28006. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  28007. -->
  28008. (S1 ^operator O2182 = 0.685451056996617)
  28009. Retracting rl*prefer*rvt*predict-yes*H0*3
  28010. -->
  28011. (S1 ^operator O2181 = 0.3907872220793651)
  28012. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  28013. -->
  28014. (S1 ^operator O2181 = -0.208713043145708)
  28015. =>WM: (15348: S1 ^operator O2184 +)
  28016. =>WM: (15347: S1 ^operator O2183 +)
  28017. =>WM: (15346: O2184 ^name predict-no)
  28018. =>WM: (15345: O2183 ^name predict-yes)
  28019. =>WM: (15344: R1095 ^value 1)
  28020. =>WM: (15343: R1 ^reward R1095)
  28021. <=WM: (15334: S1 ^operator O2181 +)
  28022. <=WM: (15335: S1 ^operator O2182 +)
  28023. <=WM: (15336: S1 ^operator O2182)
  28024. <=WM: (15330: R1 ^reward R1094)
  28025. <=WM: (15333: O2182 ^name predict-no)
  28026. <=WM: (15332: O2181 ^name predict-yes)
  28027. <=WM: (15331: R1094 ^value 1)
  28028. --- Inner Elaboration Phase, active level 1 (S1) ---
  28029. Firing prefer*rvt*predict-yes*H0
  28030. -->
  28031. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  28032. -->
  28033. (S1 ^operator O2183 = -0.208713043145708)
  28034. Firing rl*prefer*rvt*predict-yes*H0*3
  28035. -->
  28036. (S1 ^operator O2183 = 0.3907872220793651)
  28037. Firing prefer*rvt*predict-yes*H0*3*H1
  28038. -->
  28039. Firing prefer*rvt*predict-no*H0
  28040. -->
  28041. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  28042. -->
  28043. (S1 ^operator O2184 = 0.685451056996617)
  28044. Firing rl*prefer*rvt*predict-no*H0*4
  28045. -->
  28046. (S1 ^operator O2184 = 0.3145038061064807)
  28047. Firing prefer*rvt*predict-no*H0*4*H1
  28048. -->
  28049. inner elaboration loop at bottom goal.
  28050. Retracting rl*prefer*rvt*predict-no*H0*4
  28051. -->
  28052. (S1 ^operator O2182 = 0.3145038061064807)
  28053. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  28054. -->
  28055. (S1 ^operator O2182 = 0.685451056996617)
  28056. Retracting rl*prefer*rvt*predict-yes*H0*3
  28057. -->
  28058. (S1 ^operator O2181 = 0.3907872220793651)
  28059. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  28060. -->
  28061. (S1 ^operator O2181 = -0.208713043145708)
  28062. --- END Proposal Phase ---
  28063. --- Decision Phase ---
  28064. RL update rl*prefer*rvt*predict-no*H0*4 0.478552 -0.164048 0.314504 -> 0.478555 -0.164048 0.314507(R,m,v=1,0.928571,0.0667237)
  28065. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521407 0.164044 0.685451 -> 0.521411 0.164044 0.685455(R,m,v=1,1,0)
  28066. =>WM: (15349: S1 ^operator O2184)
  28067. 1092: O: O2184 (predict-no)
  28068. --- END Decision Phase ---
  28069. --- Application Phase ---
  28070. --- Firing Productions (PE) For State At Depth 1 ---
  28071. --- Inner Elaboration Phase, active level 1 (S1) ---
  28072. Firing apply*operator
  28073. -->
  28074. (I3 ^predict-no N1092 + :O )
  28075. Firing apply*operator*complete
  28076. -->
  28077. (I3 ^predict-no N1091 - :O )
  28078. inner elaboration loop at bottom goal.
  28079. --- Change Working Memory (PE) ---
  28080. =>WM: (15350: I3 ^predict-no N1092)
  28081. <=WM: (15338: N1091 ^status complete)
  28082. <=WM: (15337: I3 ^predict-no N1091)
  28083. --- Firing Productions (IE) For State At Depth 1 ---
  28084. --- Inner Elaboration Phase, active level 1 (S1) ---
  28085. Firing monitor*world
  28086. -->
  28087. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28088. --- Change Working Memory (IE) ---
  28089. --- END Application Phase ---
  28090. --- Output Phase ---
  28091. ENV: Agent did: predict-no for direction L in state State-A
  28092. In State-A moving L
  28093. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  28094. predict error 0
  28095. dir: dir isR
  28096. --- END Output Phase ---
  28097. -/|--- Input Phase ---
  28098. =>WM: (15354: I2 ^dir R)
  28099. =>WM: (15353: I2 ^reward 1)
  28100. =>WM: (15352: I2 ^see 0)
  28101. =>WM: (15351: N1092 ^status complete)
  28102. <=WM: (15341: I2 ^dir L)
  28103. <=WM: (15340: I2 ^reward 1)
  28104. <=WM: (15339: I2 ^see 0)
  28105. =>WM: (15355: I2 ^level-1 L0-root)
  28106. <=WM: (15342: I2 ^level-1 L0-root)
  28107. --- END Input Phase ---
  28108. --- Proposal Phase ---
  28109. --- Inner Elaboration Phase, active level 1 (S1) ---
  28110. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  28111. -->
  28112. (S1 ^operator O2183 = 0.8783989983456222)
  28113. Firing prefer*rvt*predict-yes*H0*5*H1
  28114. -->
  28115. Firing elaborate*copy-see-to-output-link
  28116. -->
  28117. (I3 ^see 0 +)
  28118. Firing elaborate*reward*based*on*reward
  28119. -->
  28120. (R1096 ^value 1 +)
  28121. (R1 ^reward R1096 +)
  28122. Firing propose*predict-yes
  28123. -->
  28124. (O2185 ^name predict-yes +)
  28125. (S1 ^operator O2185 +)
  28126. Firing propose*predict-no
  28127. -->
  28128. (O2186 ^name predict-no +)
  28129. (S1 ^operator O2186 +)
  28130. Firing rl*prefer*rvt*predict-no*H0*6
  28131. -->
  28132. (S1 ^operator O2184 = 0.9436253760703815)
  28133. Firing rl*prefer*rvt*predict-yes*H0*5
  28134. -->
  28135. (S1 ^operator O2183 = 0.1215961184552382)
  28136. Firing prefer*rvt*predict-yes*H0
  28137. -->
  28138. Firing prefer*rvt*predict-no*H0
  28139. -->
  28140. Firing elaborate*copy-dir-to-output-link
  28141. -->
  28142. (I3 ^dir R +)
  28143. inner elaboration loop at bottom goal.
  28144. Retracting elaborate*copy-see-to-output-link
  28145. -->
  28146. (I3 ^see 0 +)
  28147. Retracting propose*predict-no
  28148. -->
  28149. (O2184 ^name predict-no +)
  28150. (S1 ^operator O2184 +)
  28151. Retracting propose*predict-yes
  28152. -->
  28153. (O2183 ^name predict-yes +)
  28154. (S1 ^operator O2183 +)
  28155. Retracting elaborate*reward*based*on*reward
  28156. -->
  28157. (R1095 ^value 1 +)
  28158. (R1 ^reward R1095 +)
  28159. Retracting elaborate*copy-dir-to-output-link
  28160. -->
  28161. (I3 ^dir L +)
  28162. Retracting rl*prefer*rvt*predict-no*H0*4
  28163. -->
  28164. (S1 ^operator O2184 = 0.3145074913744749)
  28165. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  28166. -->
  28167. (S1 ^operator O2184 = 0.685455356981167)
  28168. Retracting rl*prefer*rvt*predict-yes*H0*3
  28169. -->
  28170. (S1 ^operator O2183 = 0.3907872220793651)
  28171. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  28172. -->
  28173. (S1 ^operator O2183 = -0.208713043145708)
  28174. =>WM: (15362: S1 ^operator O2186 +)
  28175. =>WM: (15361: S1 ^operator O2185 +)
  28176. =>WM: (15360: I3 ^dir R)
  28177. =>WM: (15359: O2186 ^name predict-no)
  28178. =>WM: (15358: O2185 ^name predict-yes)
  28179. =>WM: (15357: R1096 ^value 1)
  28180. =>WM: (15356: R1 ^reward R1096)
  28181. <=WM: (15347: S1 ^operator O2183 +)
  28182. <=WM: (15348: S1 ^operator O2184 +)
  28183. <=WM: (15349: S1 ^operator O2184)
  28184. <=WM: (15307: I3 ^dir L)
  28185. <=WM: (15343: R1 ^reward R1095)
  28186. <=WM: (15346: O2184 ^name predict-no)
  28187. <=WM: (15345: O2183 ^name predict-yes)
  28188. <=WM: (15344: R1095 ^value 1)
  28189. --- Inner Elaboration Phase, active level 1 (S1) ---
  28190. Firing prefer*rvt*predict-yes*H0
  28191. -->
  28192. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  28193. -->
  28194. (S1 ^operator O2185 = 0.8783989983456222)
  28195. Firing rl*prefer*rvt*predict-yes*H0*5
  28196. -->
  28197. (S1 ^operator O2185 = 0.1215961184552382)
  28198. Firing prefer*rvt*predict-yes*H0*5*H1
  28199. -->
  28200. Firing prefer*rvt*predict-no*H0
  28201. -->
  28202. Firing rl*prefer*rvt*predict-no*H0*6
  28203. -->
  28204. (S1 ^operator O2186 = 0.9436253760703815)
  28205. inner elaboration loop at bottom goal.
  28206. Retracting rl*prefer*rvt*predict-no*H0*6
  28207. -->
  28208. (S1 ^operator O2184 = 0.9436253760703815)
  28209. Retracting rl*prefer*rvt*predict-yes*H0*5
  28210. -->
  28211. (S1 ^operator O2183 = 0.1215961184552382)
  28212. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  28213. -->
  28214. (S1 ^operator O2183 = 0.8783989983456222)
  28215. --- END Proposal Phase ---
  28216. --- Decision Phase ---
  28217. RL update rl*prefer*rvt*predict-no*H0*4 0.478555 -0.164048 0.314507 -> 0.478558 -0.164048 0.314511(R,m,v=1,0.928994,0.0663567)
  28218. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521411 0.164044 0.685455 -> 0.521414 0.164045 0.685459(R,m,v=1,1,0)
  28219. =>WM: (15363: S1 ^operator O2185)
  28220. 1093: O: O2185 (predict-yes)
  28221. --- END Decision Phase ---
  28222. --- Application Phase ---
  28223. --- Firing Productions (PE) For State At Depth 1 ---
  28224. --- Inner Elaboration Phase, active level 1 (S1) ---
  28225. Firing apply*operator
  28226. -->
  28227. (I3 ^predict-yes N1093 + :O )
  28228. Firing apply*operator*complete
  28229. -->
  28230. (I3 ^predict-no N1092 - :O )
  28231. inner elaboration loop at bottom goal.
  28232. --- Change Working Memory (PE) ---
  28233. =>WM: (15364: I3 ^predict-yes N1093)
  28234. <=WM: (15351: N1092 ^status complete)
  28235. <=WM: (15350: I3 ^predict-no N1092)
  28236. --- Firing Productions (IE) For State At Depth 1 ---
  28237. --- Inner Elaboration Phase, active level 1 (S1) ---
  28238. Firing monitor*world
  28239. -->
  28240. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28241. --- Change Working Memory (IE) ---
  28242. --- END Application Phase ---
  28243. --- Output Phase ---
  28244. ENV: Agent did: predict-yes for direction R in state State-A
  28245. In State-A moving R
  28246. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  28247. predict error 0
  28248. dir: dir isL
  28249. --- END Output Phase ---
  28250. \-/--- Input Phase ---
  28251. =>WM: (15368: I2 ^dir L)
  28252. =>WM: (15367: I2 ^reward 1)
  28253. =>WM: (15366: I2 ^see 1)
  28254. =>WM: (15365: N1093 ^status complete)
  28255. <=WM: (15354: I2 ^dir R)
  28256. <=WM: (15353: I2 ^reward 1)
  28257. <=WM: (15352: I2 ^see 0)
  28258. =>WM: (15369: I2 ^level-1 R1-root)
  28259. <=WM: (15355: I2 ^level-1 L0-root)
  28260. --- END Input Phase ---
  28261. --- Proposal Phase ---
  28262. --- Inner Elaboration Phase, active level 1 (S1) ---
  28263. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  28264. -->
  28265. (S1 ^operator O2186 = -0.168718511744511)
  28266. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  28267. -->
  28268. (S1 ^operator O2185 = 0.6092584839497481)
  28269. Firing prefer*rvt*predict-no*H0*4*H1
  28270. -->
  28271. Firing prefer*rvt*predict-yes*H0*3*H1
  28272. -->
  28273. Firing elaborate*copy-see-to-output-link
  28274. -->
  28275. (I3 ^see 1 +)
  28276. Firing elaborate*reward*based*on*reward
  28277. -->
  28278. (R1097 ^value 1 +)
  28279. (R1 ^reward R1097 +)
  28280. Firing propose*predict-yes
  28281. -->
  28282. (O2187 ^name predict-yes +)
  28283. (S1 ^operator O2187 +)
  28284. Firing propose*predict-no
  28285. -->
  28286. (O2188 ^name predict-no +)
  28287. (S1 ^operator O2188 +)
  28288. Firing rl*prefer*rvt*predict-no*H0*4
  28289. -->
  28290. (S1 ^operator O2186 = 0.3145105217381143)
  28291. Firing rl*prefer*rvt*predict-yes*H0*3
  28292. -->
  28293. (S1 ^operator O2185 = 0.3907872220793651)
  28294. Firing prefer*rvt*predict-yes*H0
  28295. -->
  28296. Firing prefer*rvt*predict-no*H0
  28297. -->
  28298. Firing elaborate*copy-dir-to-output-link
  28299. -->
  28300. (I3 ^dir L +)
  28301. inner elaboration loop at bottom goal.
  28302. Retracting elaborate*copy-see-to-output-link
  28303. -->
  28304. (I3 ^see 0 +)
  28305. Retracting propose*predict-no
  28306. -->
  28307. (O2186 ^name predict-no +)
  28308. (S1 ^operator O2186 +)
  28309. Retracting propose*predict-yes
  28310. -->
  28311. (O2185 ^name predict-yes +)
  28312. (S1 ^operator O2185 +)
  28313. Retracting elaborate*reward*based*on*reward
  28314. -->
  28315. (R1096 ^value 1 +)
  28316. (R1 ^reward R1096 +)
  28317. Retracting elaborate*copy-dir-to-output-link
  28318. -->
  28319. (I3 ^dir R +)
  28320. Retracting rl*prefer*rvt*predict-no*H0*6
  28321. -->
  28322. (S1 ^operator O2186 = 0.9436253760703815)
  28323. Retracting rl*prefer*rvt*predict-yes*H0*5
  28324. -->
  28325. (S1 ^operator O2185 = 0.1215961184552382)
  28326. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  28327. -->
  28328. (S1 ^operator O2185 = 0.8783989983456222)
  28329. =>WM: (15377: S1 ^operator O2188 +)
  28330. =>WM: (15376: S1 ^operator O2187 +)
  28331. =>WM: (15375: I3 ^dir L)
  28332. =>WM: (15374: O2188 ^name predict-no)
  28333. =>WM: (15373: O2187 ^name predict-yes)
  28334. =>WM: (15372: R1097 ^value 1)
  28335. =>WM: (15371: R1 ^reward R1097)
  28336. =>WM: (15370: I3 ^see 1)
  28337. <=WM: (15361: S1 ^operator O2185 +)
  28338. <=WM: (15363: S1 ^operator O2185)
  28339. <=WM: (15362: S1 ^operator O2186 +)
  28340. <=WM: (15360: I3 ^dir R)
  28341. <=WM: (15356: R1 ^reward R1096)
  28342. <=WM: (15262: I3 ^see 0)
  28343. <=WM: (15359: O2186 ^name predict-no)
  28344. <=WM: (15358: O2185 ^name predict-yes)
  28345. <=WM: (15357: R1096 ^value 1)
  28346. --- Inner Elaboration Phase, active level 1 (S1) ---
  28347. Firing prefer*rvt*predict-yes*H0
  28348. -->
  28349. Firing rl*prefer*rvt*predict-yes*H0*3
  28350. -->
  28351. (S1 ^operator O2187 = 0.3907872220793651)
  28352. Firing prefer*rvt*predict-yes*H0*3*H1
  28353. -->
  28354. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  28355. -->
  28356. (S1 ^operator O2187 = 0.6092584839497481)
  28357. Firing prefer*rvt*predict-no*H0
  28358. -->
  28359. Firing rl*prefer*rvt*predict-no*H0*4
  28360. -->
  28361. (S1 ^operator O2188 = 0.3145105217381143)
  28362. Firing prefer*rvt*predict-no*H0*4*H1
  28363. -->
  28364. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  28365. -->
  28366. (S1 ^operator O2188 = -0.168718511744511)
  28367. inner elaboration loop at bottom goal.
  28368. Retracting rl*prefer*rvt*predict-no*H0*4
  28369. -->
  28370. (S1 ^operator O2186 = 0.3145105217381143)
  28371. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  28372. -->
  28373. (S1 ^operator O2186 = -0.168718511744511)
  28374. Retracting rl*prefer*rvt*predict-yes*H0*3
  28375. -->
  28376. (S1 ^operator O2185 = 0.3907872220793651)
  28377. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  28378. -->
  28379. (S1 ^operator O2185 = 0.6092584839497481)
  28380. --- END Proposal Phase ---
  28381. --- Decision Phase ---
  28382. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.876289,0.108969)
  28383. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465473 0.412926 0.878399 -> 0.465474 0.412926 0.878399(R,m,v=1,1,0)
  28384. =>WM: (15378: S1 ^operator O2187)
  28385. 1094: O: O2187 (predict-yes)
  28386. --- END Decision Phase ---
  28387. --- Application Phase ---
  28388. --- Firing Productions (PE) For State At Depth 1 ---
  28389. --- Inner Elaboration Phase, active level 1 (S1) ---
  28390. Firing apply*operator
  28391. -->
  28392. (I3 ^predict-yes N1094 + :O )
  28393. Firing apply*operator*complete
  28394. -->
  28395. (I3 ^predict-yes N1093 - :O )
  28396. inner elaboration loop at bottom goal.
  28397. --- Change Working Memory (PE) ---
  28398. =>WM: (15379: I3 ^predict-yes N1094)
  28399. <=WM: (15365: N1093 ^status complete)
  28400. <=WM: (15364: I3 ^predict-yes N1093)
  28401. --- Firing Productions (IE) For State At Depth 1 ---
  28402. --- Inner Elaboration Phase, active level 1 (S1) ---
  28403. Firing monitor*world
  28404. -->
  28405. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28406. --- Change Working Memory (IE) ---
  28407. --- END Application Phase ---
  28408. --- Output Phase ---
  28409. ENV: Agent did: predict-yes for direction L in state State-B
  28410. In State-B moving L
  28411. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28412. predict error 0
  28413. dir: dir isR
  28414. --- END Output Phase ---
  28415. |\--- Input Phase ---
  28416. =>WM: (15383: I2 ^dir R)
  28417. =>WM: (15382: I2 ^reward 1)
  28418. =>WM: (15381: I2 ^see 1)
  28419. =>WM: (15380: N1094 ^status complete)
  28420. <=WM: (15368: I2 ^dir L)
  28421. <=WM: (15367: I2 ^reward 1)
  28422. <=WM: (15366: I2 ^see 1)
  28423. =>WM: (15384: I2 ^level-1 L1-root)
  28424. <=WM: (15369: I2 ^level-1 R1-root)
  28425. --- END Input Phase ---
  28426. --- Proposal Phase ---
  28427. --- Inner Elaboration Phase, active level 1 (S1) ---
  28428. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  28429. -->
  28430. (S1 ^operator O2187 = 0.8784070247478919)
  28431. Firing prefer*rvt*predict-yes*H0*5*H1
  28432. -->
  28433. Firing elaborate*copy-see-to-output-link
  28434. -->
  28435. (I3 ^see 1 +)
  28436. Firing elaborate*reward*based*on*reward
  28437. -->
  28438. (R1098 ^value 1 +)
  28439. (R1 ^reward R1098 +)
  28440. Firing propose*predict-yes
  28441. -->
  28442. (O2189 ^name predict-yes +)
  28443. (S1 ^operator O2189 +)
  28444. Firing propose*predict-no
  28445. -->
  28446. (O2190 ^name predict-no +)
  28447. (S1 ^operator O2190 +)
  28448. Firing rl*prefer*rvt*predict-no*H0*6
  28449. -->
  28450. (S1 ^operator O2188 = 0.9436253760703815)
  28451. Firing rl*prefer*rvt*predict-yes*H0*5
  28452. -->
  28453. (S1 ^operator O2187 = 0.1215965079981263)
  28454. Firing prefer*rvt*predict-yes*H0
  28455. -->
  28456. Firing prefer*rvt*predict-no*H0
  28457. -->
  28458. Firing elaborate*copy-dir-to-output-link
  28459. -->
  28460. (I3 ^dir R +)
  28461. inner elaboration loop at bottom goal.
  28462. Retracting elaborate*copy-see-to-output-link
  28463. -->
  28464. (I3 ^see 1 +)
  28465. Retracting propose*predict-no
  28466. -->
  28467. (O2188 ^name predict-no +)
  28468. (S1 ^operator O2188 +)
  28469. Retracting propose*predict-yes
  28470. -->
  28471. (O2187 ^name predict-yes +)
  28472. (S1 ^operator O2187 +)
  28473. Retracting elaborate*reward*based*on*reward
  28474. -->
  28475. (R1097 ^value 1 +)
  28476. (R1 ^reward R1097 +)
  28477. Retracting elaborate*copy-dir-to-output-link
  28478. -->
  28479. (I3 ^dir L +)
  28480. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  28481. -->
  28482. (S1 ^operator O2188 = -0.168718511744511)
  28483. Retracting rl*prefer*rvt*predict-no*H0*4
  28484. -->
  28485. (S1 ^operator O2188 = 0.3145105217381143)
  28486. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  28487. -->
  28488. (S1 ^operator O2187 = 0.6092584839497481)
  28489. Retracting rl*prefer*rvt*predict-yes*H0*3
  28490. -->
  28491. (S1 ^operator O2187 = 0.3907872220793651)
  28492. =>WM: (15391: S1 ^operator O2190 +)
  28493. =>WM: (15390: S1 ^operator O2189 +)
  28494. =>WM: (15389: I3 ^dir R)
  28495. =>WM: (15388: O2190 ^name predict-no)
  28496. =>WM: (15387: O2189 ^name predict-yes)
  28497. =>WM: (15386: R1098 ^value 1)
  28498. =>WM: (15385: R1 ^reward R1098)
  28499. <=WM: (15376: S1 ^operator O2187 +)
  28500. <=WM: (15378: S1 ^operator O2187)
  28501. <=WM: (15377: S1 ^operator O2188 +)
  28502. <=WM: (15375: I3 ^dir L)
  28503. <=WM: (15371: R1 ^reward R1097)
  28504. <=WM: (15374: O2188 ^name predict-no)
  28505. <=WM: (15373: O2187 ^name predict-yes)
  28506. <=WM: (15372: R1097 ^value 1)
  28507. --- Inner Elaboration Phase, active level 1 (S1) ---
  28508. Firing prefer*rvt*predict-yes*H0
  28509. -->
  28510. Firing rl*prefer*rvt*predict-yes*H0*5
  28511. -->
  28512. (S1 ^operator O2189 = 0.1215965079981263)
  28513. Firing prefer*rvt*predict-yes*H0*5*H1
  28514. -->
  28515. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  28516. -->
  28517. (S1 ^operator O2189 = 0.8784070247478919)
  28518. Firing prefer*rvt*predict-no*H0
  28519. -->
  28520. Firing rl*prefer*rvt*predict-no*H0*6
  28521. -->
  28522. (S1 ^operator O2190 = 0.9436253760703815)
  28523. inner elaboration loop at bottom goal.
  28524. Retracting rl*prefer*rvt*predict-no*H0*6
  28525. -->
  28526. (S1 ^operator O2188 = 0.9436253760703815)
  28527. Retracting rl*prefer*rvt*predict-yes*H0*5
  28528. -->
  28529. (S1 ^operator O2187 = 0.1215965079981263)
  28530. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  28531. -->
  28532. (S1 ^operator O2187 = 0.8784070247478919)
  28533. --- END Proposal Phase ---
  28534. --- Decision Phase ---
  28535. RL update rl*prefer*rvt*predict-yes*H0*3 0.472332 -0.0815445 0.390787 -> 0.472329 -0.0815451 0.390784(R,m,v=1,0.949721,0.0480196)
  28536. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527707 0.0815513 0.609258 -> 0.527704 0.0815506 0.609254(R,m,v=1,1,0)
  28537. =>WM: (15392: S1 ^operator O2189)
  28538. 1095: O: O2189 (predict-yes)
  28539. --- END Decision Phase ---
  28540. --- Application Phase ---
  28541. --- Firing Productions (PE) For State At Depth 1 ---
  28542. --- Inner Elaboration Phase, active level 1 (S1) ---
  28543. Firing apply*operator
  28544. -->
  28545. (I3 ^predict-yes N1095 + :O )
  28546. Firing apply*operator*complete
  28547. -->
  28548. (I3 ^predict-yes N1094 - :O )
  28549. inner elaboration loop at bottom goal.
  28550. --- Change Working Memory (PE) ---
  28551. =>WM: (15393: I3 ^predict-yes N1095)
  28552. <=WM: (15380: N1094 ^status complete)
  28553. <=WM: (15379: I3 ^predict-yes N1094)
  28554. --- Firing Productions (IE) For State At Depth 1 ---
  28555. --- Inner Elaboration Phase, active level 1 (S1) ---
  28556. Firing monitor*world
  28557. -->
  28558. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28559. --- Change Working Memory (IE) ---
  28560. --- END Application Phase ---
  28561. --- Output Phase ---
  28562. ENV: Agent did: predict-yes for direction R in state State-A
  28563. In State-A moving R
  28564. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  28565. predict error 0
  28566. dir: dir isU
  28567. --- END Output Phase ---
  28568. -/|--- Input Phase ---
  28569. =>WM: (15397: I2 ^dir U)
  28570. =>WM: (15396: I2 ^reward 1)
  28571. =>WM: (15395: I2 ^see 1)
  28572. =>WM: (15394: N1095 ^status complete)
  28573. <=WM: (15383: I2 ^dir R)
  28574. <=WM: (15382: I2 ^reward 1)
  28575. <=WM: (15381: I2 ^see 1)
  28576. =>WM: (15398: I2 ^level-1 R1-root)
  28577. <=WM: (15384: I2 ^level-1 L1-root)
  28578. --- END Input Phase ---
  28579. --- Proposal Phase ---
  28580. --- Inner Elaboration Phase, active level 1 (S1) ---
  28581. Firing elaborate*copy-see-to-output-link
  28582. -->
  28583. (I3 ^see 1 +)
  28584. Firing elaborate*reward*based*on*reward
  28585. -->
  28586. (R1099 ^value 1 +)
  28587. (R1 ^reward R1099 +)
  28588. Firing propose*predict-yes
  28589. -->
  28590. (O2191 ^name predict-yes +)
  28591. (S1 ^operator O2191 +)
  28592. Firing propose*predict-no
  28593. -->
  28594. (O2192 ^name predict-no +)
  28595. (S1 ^operator O2192 +)
  28596. Firing rl*prefer*rvt*predict-no*H0*2
  28597. -->
  28598. (S1 ^operator O2190 = 1.)
  28599. Firing rl*prefer*rvt*predict-yes*H0*1
  28600. -->
  28601. (S1 ^operator O2189 = 0.)
  28602. Firing prefer*rvt*predict-yes*H0
  28603. -->
  28604. Firing prefer*rvt*predict-no*H0
  28605. -->
  28606. Firing elaborate*copy-dir-to-output-link
  28607. -->
  28608. (I3 ^dir U +)
  28609. inner elaboration loop at bottom goal.
  28610. Retracting elaborate*copy-see-to-output-link
  28611. -->
  28612. (I3 ^see 1 +)
  28613. Retracting propose*predict-no
  28614. -->
  28615. (O2190 ^name predict-no +)
  28616. (S1 ^operator O2190 +)
  28617. Retracting propose*predict-yes
  28618. -->
  28619. (O2189 ^name predict-yes +)
  28620. (S1 ^operator O2189 +)
  28621. Retracting elaborate*reward*based*on*reward
  28622. -->
  28623. (R1098 ^value 1 +)
  28624. (R1 ^reward R1098 +)
  28625. Retracting elaborate*copy-dir-to-output-link
  28626. -->
  28627. (I3 ^dir R +)
  28628. Retracting rl*prefer*rvt*predict-no*H0*6
  28629. -->
  28630. (S1 ^operator O2190 = 0.9436253760703815)
  28631. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  28632. -->
  28633. (S1 ^operator O2189 = 0.8784070247478919)
  28634. Retracting rl*prefer*rvt*predict-yes*H0*5
  28635. -->
  28636. (S1 ^operator O2189 = 0.1215965079981263)
  28637. =>WM: (15405: S1 ^operator O2192 +)
  28638. =>WM: (15404: S1 ^operator O2191 +)
  28639. =>WM: (15403: I3 ^dir U)
  28640. =>WM: (15402: O2192 ^name predict-no)
  28641. =>WM: (15401: O2191 ^name predict-yes)
  28642. =>WM: (15400: R1099 ^value 1)
  28643. =>WM: (15399: R1 ^reward R1099)
  28644. <=WM: (15390: S1 ^operator O2189 +)
  28645. <=WM: (15392: S1 ^operator O2189)
  28646. <=WM: (15391: S1 ^operator O2190 +)
  28647. <=WM: (15389: I3 ^dir R)
  28648. <=WM: (15385: R1 ^reward R1098)
  28649. <=WM: (15388: O2190 ^name predict-no)
  28650. <=WM: (15387: O2189 ^name predict-yes)
  28651. <=WM: (15386: R1098 ^value 1)
  28652. --- Inner Elaboration Phase, active level 1 (S1) ---
  28653. Firing prefer*rvt*predict-yes*H0
  28654. -->
  28655. Firing rl*prefer*rvt*predict-yes*H0*1
  28656. -->
  28657. (S1 ^operator O2191 = 0.)
  28658. Firing prefer*rvt*predict-no*H0
  28659. -->
  28660. Firing rl*prefer*rvt*predict-no*H0*2
  28661. -->
  28662. (S1 ^operator O2192 = 1.)
  28663. inner elaboration loop at bottom goal.
  28664. Retracting rl*prefer*rvt*predict-no*H0*2
  28665. -->
  28666. (S1 ^operator O2190 = 1.)
  28667. Retracting rl*prefer*rvt*predict-yes*H0*1
  28668. -->
  28669. (S1 ^operator O2189 = 0.)
  28670. --- END Proposal Phase ---
  28671. --- Decision Phase ---
  28672. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121596(R,m,v=1,0.876923,0.108485)
  28673. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.46548 0.412927 0.878407 -> 0.46548 0.412927 0.878407(R,m,v=1,1,0)
  28674. =>WM: (15406: S1 ^operator O2192)
  28675. 1096: O: O2192 (predict-no)
  28676. --- END Decision Phase ---
  28677. --- Application Phase ---
  28678. --- Firing Productions (PE) For State At Depth 1 ---
  28679. --- Inner Elaboration Phase, active level 1 (S1) ---
  28680. Firing apply*operator
  28681. -->
  28682. (I3 ^predict-no N1096 + :O )
  28683. Firing apply*operator*complete
  28684. -->
  28685. (I3 ^predict-yes N1095 - :O )
  28686. inner elaboration loop at bottom goal.
  28687. --- Change Working Memory (PE) ---
  28688. =>WM: (15407: I3 ^predict-no N1096)
  28689. <=WM: (15394: N1095 ^status complete)
  28690. <=WM: (15393: I3 ^predict-yes N1095)
  28691. --- Firing Productions (IE) For State At Depth 1 ---
  28692. --- Inner Elaboration Phase, active level 1 (S1) ---
  28693. Firing monitor*world
  28694. -->
  28695. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28696. --- Change Working Memory (IE) ---
  28697. --- END Application Phase ---
  28698. --- Output Phase ---
  28699. ENV: Agent did: predict-no for direction U in state State-B
  28700. In State-B moving U
  28701. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28702. predict error 0
  28703. dir: dir isU
  28704. --- END Output Phase ---
  28705. \---- Input Phase ---
  28706. =>WM: (15411: I2 ^dir U)
  28707. =>WM: (15410: I2 ^reward 1)
  28708. =>WM: (15409: I2 ^see 0)
  28709. =>WM: (15408: N1096 ^status complete)
  28710. <=WM: (15397: I2 ^dir U)
  28711. <=WM: (15396: I2 ^reward 1)
  28712. <=WM: (15395: I2 ^see 1)
  28713. =>WM: (15412: I2 ^level-1 R1-root)
  28714. <=WM: (15398: I2 ^level-1 R1-root)
  28715. --- END Input Phase ---
  28716. --- Proposal Phase ---
  28717. --- Inner Elaboration Phase, active level 1 (S1) ---
  28718. Firing elaborate*copy-see-to-output-link
  28719. -->
  28720. (I3 ^see 0 +)
  28721. Firing elaborate*reward*based*on*reward
  28722. -->
  28723. (R1100 ^value 1 +)
  28724. (R1 ^reward R1100 +)
  28725. Firing propose*predict-yes
  28726. -->
  28727. (O2193 ^name predict-yes +)
  28728. (S1 ^operator O2193 +)
  28729. Firing propose*predict-no
  28730. -->
  28731. (O2194 ^name predict-no +)
  28732. (S1 ^operator O2194 +)
  28733. Firing rl*prefer*rvt*predict-no*H0*2
  28734. -->
  28735. (S1 ^operator O2192 = 1.)
  28736. Firing rl*prefer*rvt*predict-yes*H0*1
  28737. -->
  28738. (S1 ^operator O2191 = 0.)
  28739. Firing prefer*rvt*predict-yes*H0
  28740. -->
  28741. Firing prefer*rvt*predict-no*H0
  28742. -->
  28743. Firing elaborate*copy-dir-to-output-link
  28744. -->
  28745. (I3 ^dir U +)
  28746. inner elaboration loop at bottom goal.
  28747. Retracting elaborate*copy-see-to-output-link
  28748. -->
  28749. (I3 ^see 1 +)
  28750. Retracting propose*predict-no
  28751. -->
  28752. (O2192 ^name predict-no +)
  28753. (S1 ^operator O2192 +)
  28754. Retracting propose*predict-yes
  28755. -->
  28756. (O2191 ^name predict-yes +)
  28757. (S1 ^operator O2191 +)
  28758. Retracting elaborate*reward*based*on*reward
  28759. -->
  28760. (R1099 ^value 1 +)
  28761. (R1 ^reward R1099 +)
  28762. Retracting elaborate*copy-dir-to-output-link
  28763. -->
  28764. (I3 ^dir U +)
  28765. Retracting rl*prefer*rvt*predict-no*H0*2
  28766. -->
  28767. (S1 ^operator O2192 = 1.)
  28768. Retracting rl*prefer*rvt*predict-yes*H0*1
  28769. -->
  28770. (S1 ^operator O2191 = 0.)
  28771. =>WM: (15419: S1 ^operator O2194 +)
  28772. =>WM: (15418: S1 ^operator O2193 +)
  28773. =>WM: (15417: O2194 ^name predict-no)
  28774. =>WM: (15416: O2193 ^name predict-yes)
  28775. =>WM: (15415: R1100 ^value 1)
  28776. =>WM: (15414: R1 ^reward R1100)
  28777. =>WM: (15413: I3 ^see 0)
  28778. <=WM: (15404: S1 ^operator O2191 +)
  28779. <=WM: (15405: S1 ^operator O2192 +)
  28780. <=WM: (15406: S1 ^operator O2192)
  28781. <=WM: (15399: R1 ^reward R1099)
  28782. <=WM: (15370: I3 ^see 1)
  28783. <=WM: (15402: O2192 ^name predict-no)
  28784. <=WM: (15401: O2191 ^name predict-yes)
  28785. <=WM: (15400: R1099 ^value 1)
  28786. --- Inner Elaboration Phase, active level 1 (S1) ---
  28787. Firing prefer*rvt*predict-yes*H0
  28788. -->
  28789. Firing rl*prefer*rvt*predict-yes*H0*1
  28790. -->
  28791. (S1 ^operator O2193 = 0.)
  28792. Firing prefer*rvt*predict-no*H0
  28793. -->
  28794. Firing rl*prefer*rvt*predict-no*H0*2
  28795. -->
  28796. (S1 ^operator O2194 = 1.)
  28797. inner elaboration loop at bottom goal.
  28798. Retracting rl*prefer*rvt*predict-no*H0*2
  28799. -->
  28800. (S1 ^operator O2192 = 1.)
  28801. Retracting rl*prefer*rvt*predict-yes*H0*1
  28802. -->
  28803. (S1 ^operator O2191 = 0.)
  28804. --- END Proposal Phase ---
  28805. --- Decision Phase ---
  28806. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28807. =>WM: (15420: S1 ^operator O2194)
  28808. 1097: O: O2194 (predict-no)
  28809. --- END Decision Phase ---
  28810. --- Application Phase ---
  28811. --- Firing Productions (PE) For State At Depth 1 ---
  28812. --- Inner Elaboration Phase, active level 1 (S1) ---
  28813. Firing apply*operator
  28814. -->
  28815. (I3 ^predict-no N1097 + :O )
  28816. Firing apply*operator*complete
  28817. -->
  28818. (I3 ^predict-no N1096 - :O )
  28819. inner elaboration loop at bottom goal.
  28820. --- Change Working Memory (PE) ---
  28821. =>WM: (15421: I3 ^predict-no N1097)
  28822. <=WM: (15408: N1096 ^status complete)
  28823. <=WM: (15407: I3 ^predict-no N1096)
  28824. --- Firing Productions (IE) For State At Depth 1 ---
  28825. --- Inner Elaboration Phase, active level 1 (S1) ---
  28826. Firing monitor*world
  28827. -->
  28828. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  28829. --- Change Working Memory (IE) ---
  28830. --- END Application Phase ---
  28831. --- Output Phase ---
  28832. ENV: Agent did: predict-no for direction U in state State-B
  28833. In State-B moving U
  28834. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  28835. predict error 0
  28836. dir: dir isL
  28837. --- END Output Phase ---
  28838. /|--- Input Phase ---
  28839. =>WM: (15425: I2 ^dir L)
  28840. =>WM: (15424: I2 ^reward 1)
  28841. =>WM: (15423: I2 ^see 0)
  28842. =>WM: (15422: N1097 ^status complete)
  28843. <=WM: (15411: I2 ^dir U)
  28844. <=WM: (15410: I2 ^reward 1)
  28845. <=WM: (15409: I2 ^see 0)
  28846. =>WM: (15426: I2 ^level-1 R1-root)
  28847. <=WM: (15412: I2 ^level-1 R1-root)
  28848. --- END Input Phase ---
  28849. --- Proposal Phase ---
  28850. --- Inner Elaboration Phase, active level 1 (S1) ---
  28851. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  28852. -->
  28853. (S1 ^operator O2194 = -0.168718511744511)
  28854. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  28855. -->
  28856. (S1 ^operator O2193 = 0.6092542666242702)
  28857. Firing prefer*rvt*predict-no*H0*4*H1
  28858. -->
  28859. Firing prefer*rvt*predict-yes*H0*3*H1
  28860. -->
  28861. Firing elaborate*copy-see-to-output-link
  28862. -->
  28863. (I3 ^see 0 +)
  28864. Firing elaborate*reward*based*on*reward
  28865. -->
  28866. (R1101 ^value 1 +)
  28867. (R1 ^reward R1101 +)
  28868. Firing propose*predict-yes
  28869. -->
  28870. (O2195 ^name predict-yes +)
  28871. (S1 ^operator O2195 +)
  28872. Firing propose*predict-no
  28873. -->
  28874. (O2196 ^name predict-no +)
  28875. (S1 ^operator O2196 +)
  28876. Firing rl*prefer*rvt*predict-no*H0*4
  28877. -->
  28878. (S1 ^operator O2194 = 0.3145105217381143)
  28879. Firing rl*prefer*rvt*predict-yes*H0*3
  28880. -->
  28881. (S1 ^operator O2193 = 0.3907835285947055)
  28882. Firing prefer*rvt*predict-yes*H0
  28883. -->
  28884. Firing prefer*rvt*predict-no*H0
  28885. -->
  28886. Firing elaborate*copy-dir-to-output-link
  28887. -->
  28888. (I3 ^dir L +)
  28889. inner elaboration loop at bottom goal.
  28890. Retracting elaborate*copy-see-to-output-link
  28891. -->
  28892. (I3 ^see 0 +)
  28893. Retracting propose*predict-no
  28894. -->
  28895. (O2194 ^name predict-no +)
  28896. (S1 ^operator O2194 +)
  28897. Retracting propose*predict-yes
  28898. -->
  28899. (O2193 ^name predict-yes +)
  28900. (S1 ^operator O2193 +)
  28901. Retracting elaborate*reward*based*on*reward
  28902. -->
  28903. (R1100 ^value 1 +)
  28904. (R1 ^reward R1100 +)
  28905. Retracting elaborate*copy-dir-to-output-link
  28906. -->
  28907. (I3 ^dir U +)
  28908. Retracting rl*prefer*rvt*predict-no*H0*2
  28909. -->
  28910. (S1 ^operator O2194 = 1.)
  28911. Retracting rl*prefer*rvt*predict-yes*H0*1
  28912. -->
  28913. (S1 ^operator O2193 = 0.)
  28914. =>WM: (15433: S1 ^operator O2196 +)
  28915. =>WM: (15432: S1 ^operator O2195 +)
  28916. =>WM: (15431: I3 ^dir L)
  28917. =>WM: (15430: O2196 ^name predict-no)
  28918. =>WM: (15429: O2195 ^name predict-yes)
  28919. =>WM: (15428: R1101 ^value 1)
  28920. =>WM: (15427: R1 ^reward R1101)
  28921. <=WM: (15418: S1 ^operator O2193 +)
  28922. <=WM: (15419: S1 ^operator O2194 +)
  28923. <=WM: (15420: S1 ^operator O2194)
  28924. <=WM: (15403: I3 ^dir U)
  28925. <=WM: (15414: R1 ^reward R1100)
  28926. <=WM: (15417: O2194 ^name predict-no)
  28927. <=WM: (15416: O2193 ^name predict-yes)
  28928. <=WM: (15415: R1100 ^value 1)
  28929. --- Inner Elaboration Phase, active level 1 (S1) ---
  28930. Firing prefer*rvt*predict-yes*H0
  28931. -->
  28932. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  28933. -->
  28934. (S1 ^operator O2195 = 0.6092542666242702)
  28935. Firing rl*prefer*rvt*predict-yes*H0*3
  28936. -->
  28937. (S1 ^operator O2195 = 0.3907835285947055)
  28938. Firing prefer*rvt*predict-yes*H0*3*H1
  28939. -->
  28940. Firing prefer*rvt*predict-no*H0
  28941. -->
  28942. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  28943. -->
  28944. (S1 ^operator O2196 = -0.168718511744511)
  28945. Firing rl*prefer*rvt*predict-no*H0*4
  28946. -->
  28947. (S1 ^operator O2196 = 0.3145105217381143)
  28948. Firing prefer*rvt*predict-no*H0*4*H1
  28949. -->
  28950. inner elaboration loop at bottom goal.
  28951. Retracting rl*prefer*rvt*predict-no*H0*4
  28952. -->
  28953. (S1 ^operator O2194 = 0.3145105217381143)
  28954. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  28955. -->
  28956. (S1 ^operator O2194 = -0.168718511744511)
  28957. Retracting rl*prefer*rvt*predict-yes*H0*3
  28958. -->
  28959. (S1 ^operator O2193 = 0.3907835285947055)
  28960. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  28961. -->
  28962. (S1 ^operator O2193 = 0.6092542666242702)
  28963. --- END Proposal Phase ---
  28964. --- Decision Phase ---
  28965. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  28966. =>WM: (15434: S1 ^operator O2195)
  28967. 1098: O: O2195 (predict-yes)
  28968. --- END Decision Phase ---
  28969. --- Application Phase ---
  28970. --- Firing Productions (PE) For State At Depth 1 ---
  28971. --- Inner Elaboration Phase, active level 1 (S1) ---
  28972. Firing apply*operator
  28973. -->
  28974. (I3 ^predict-yes N1098 + :O )
  28975. Firing apply*operator*complete
  28976. -->
  28977. (I3 ^predict-no N1097 - :O )
  28978. inner elaboration loop at bottom goal.
  28979. --- Change Working Memory (PE) ---
  28980. =>WM: (15435: I3 ^predict-yes N1098)
  28981. <=WM: (15422: N1097 ^status complete)
  28982. <=WM: (15421: I3 ^predict-no N1097)
  28983. --- Firing Productions (IE) For State At Depth 1 ---
  28984. --- Inner Elaboration Phase, active level 1 (S1) ---
  28985. Firing monitor*world
  28986. -->
  28987. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  28988. --- Change Working Memory (IE) ---
  28989. --- END Application Phase ---
  28990. --- Output Phase ---
  28991. ENV: Agent did: predict-yes for direction L in state State-B
  28992. In State-B moving L
  28993. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  28994. predict error 0
  28995. dir: dir isL
  28996. --- END Output Phase ---
  28997. \-/--- Input Phase ---
  28998. =>WM: (15439: I2 ^dir L)
  28999. =>WM: (15438: I2 ^reward 1)
  29000. =>WM: (15437: I2 ^see 1)
  29001. =>WM: (15436: N1098 ^status complete)
  29002. <=WM: (15425: I2 ^dir L)
  29003. <=WM: (15424: I2 ^reward 1)
  29004. <=WM: (15423: I2 ^see 0)
  29005. =>WM: (15440: I2 ^level-1 L1-root)
  29006. <=WM: (15426: I2 ^level-1 R1-root)
  29007. --- END Input Phase ---
  29008. --- Proposal Phase ---
  29009. --- Inner Elaboration Phase, active level 1 (S1) ---
  29010. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  29011. -->
  29012. (S1 ^operator O2195 = -0.2062723012911647)
  29013. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  29014. -->
  29015. (S1 ^operator O2196 = 0.685514319964578)
  29016. Firing prefer*rvt*predict-no*H0*4*H1
  29017. -->
  29018. Firing prefer*rvt*predict-yes*H0*3*H1
  29019. -->
  29020. Firing elaborate*copy-see-to-output-link
  29021. -->
  29022. (I3 ^see 1 +)
  29023. Firing elaborate*reward*based*on*reward
  29024. -->
  29025. (R1102 ^value 1 +)
  29026. (R1 ^reward R1102 +)
  29027. Firing propose*predict-yes
  29028. -->
  29029. (O2197 ^name predict-yes +)
  29030. (S1 ^operator O2197 +)
  29031. Firing propose*predict-no
  29032. -->
  29033. (O2198 ^name predict-no +)
  29034. (S1 ^operator O2198 +)
  29035. Firing rl*prefer*rvt*predict-no*H0*4
  29036. -->
  29037. (S1 ^operator O2196 = 0.3145105217381143)
  29038. Firing rl*prefer*rvt*predict-yes*H0*3
  29039. -->
  29040. (S1 ^operator O2195 = 0.3907835285947055)
  29041. Firing prefer*rvt*predict-yes*H0
  29042. -->
  29043. Firing prefer*rvt*predict-no*H0
  29044. -->
  29045. Firing elaborate*copy-dir-to-output-link
  29046. -->
  29047. (I3 ^dir L +)
  29048. inner elaboration loop at bottom goal.
  29049. Retracting elaborate*copy-see-to-output-link
  29050. -->
  29051. (I3 ^see 0 +)
  29052. Retracting propose*predict-no
  29053. -->
  29054. (O2196 ^name predict-no +)
  29055. (S1 ^operator O2196 +)
  29056. Retracting propose*predict-yes
  29057. -->
  29058. (O2195 ^name predict-yes +)
  29059. (S1 ^operator O2195 +)
  29060. Retracting elaborate*reward*based*on*reward
  29061. -->
  29062. (R1101 ^value 1 +)
  29063. (R1 ^reward R1101 +)
  29064. Retracting elaborate*copy-dir-to-output-link
  29065. -->
  29066. (I3 ^dir L +)
  29067. Retracting rl*prefer*rvt*predict-no*H0*4
  29068. -->
  29069. (S1 ^operator O2196 = 0.3145105217381143)
  29070. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  29071. -->
  29072. (S1 ^operator O2196 = -0.168718511744511)
  29073. Retracting rl*prefer*rvt*predict-yes*H0*3
  29074. -->
  29075. (S1 ^operator O2195 = 0.3907835285947055)
  29076. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  29077. -->
  29078. (S1 ^operator O2195 = 0.6092542666242702)
  29079. =>WM: (15447: S1 ^operator O2198 +)
  29080. =>WM: (15446: S1 ^operator O2197 +)
  29081. =>WM: (15445: O2198 ^name predict-no)
  29082. =>WM: (15444: O2197 ^name predict-yes)
  29083. =>WM: (15443: R1102 ^value 1)
  29084. =>WM: (15442: R1 ^reward R1102)
  29085. =>WM: (15441: I3 ^see 1)
  29086. <=WM: (15432: S1 ^operator O2195 +)
  29087. <=WM: (15434: S1 ^operator O2195)
  29088. <=WM: (15433: S1 ^operator O2196 +)
  29089. <=WM: (15427: R1 ^reward R1101)
  29090. <=WM: (15413: I3 ^see 0)
  29091. <=WM: (15430: O2196 ^name predict-no)
  29092. <=WM: (15429: O2195 ^name predict-yes)
  29093. <=WM: (15428: R1101 ^value 1)
  29094. --- Inner Elaboration Phase, active level 1 (S1) ---
  29095. Firing prefer*rvt*predict-yes*H0
  29096. -->
  29097. Firing rl*prefer*rvt*predict-yes*H0*3
  29098. -->
  29099. (S1 ^operator O2197 = 0.3907835285947055)
  29100. Firing prefer*rvt*predict-yes*H0*3*H1
  29101. -->
  29102. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  29103. -->
  29104. (S1 ^operator O2197 = -0.2062723012911647)
  29105. Firing prefer*rvt*predict-no*H0
  29106. -->
  29107. Firing rl*prefer*rvt*predict-no*H0*4
  29108. -->
  29109. (S1 ^operator O2198 = 0.3145105217381143)
  29110. Firing prefer*rvt*predict-no*H0*4*H1
  29111. -->
  29112. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  29113. -->
  29114. (S1 ^operator O2198 = 0.685514319964578)
  29115. inner elaboration loop at bottom goal.
  29116. Retracting rl*prefer*rvt*predict-no*H0*4
  29117. -->
  29118. (S1 ^operator O2196 = 0.3145105217381143)
  29119. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  29120. -->
  29121. (S1 ^operator O2196 = 0.685514319964578)
  29122. Retracting rl*prefer*rvt*predict-yes*H0*3
  29123. -->
  29124. (S1 ^operator O2195 = 0.3907835285947055)
  29125. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  29126. -->
  29127. (S1 ^operator O2195 = -0.2062723012911647)
  29128. --- END Proposal Phase ---
  29129. --- Decision Phase ---
  29130. RL update rl*prefer*rvt*predict-yes*H0*3 0.472329 -0.0815451 0.390784 -> 0.472326 -0.0815455 0.39078(R,m,v=1,0.95,0.0477654)
  29131. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527704 0.0815506 0.609254 -> 0.527701 0.0815501 0.609251(R,m,v=1,1,0)
  29132. =>WM: (15448: S1 ^operator O2198)
  29133. 1099: O: O2198 (predict-no)
  29134. --- END Decision Phase ---
  29135. --- Application Phase ---
  29136. --- Firing Productions (PE) For State At Depth 1 ---
  29137. --- Inner Elaboration Phase, active level 1 (S1) ---
  29138. Firing apply*operator
  29139. -->
  29140. (I3 ^predict-no N1099 + :O )
  29141. Firing apply*operator*complete
  29142. -->
  29143. (I3 ^predict-yes N1098 - :O )
  29144. inner elaboration loop at bottom goal.
  29145. --- Change Working Memory (PE) ---
  29146. =>WM: (15449: I3 ^predict-no N1099)
  29147. <=WM: (15436: N1098 ^status complete)
  29148. <=WM: (15435: I3 ^predict-yes N1098)
  29149. --- Firing Productions (IE) For State At Depth 1 ---
  29150. --- Inner Elaboration Phase, active level 1 (S1) ---
  29151. Firing monitor*world
  29152. -->
  29153. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29154. --- Change Working Memory (IE) ---
  29155. --- END Application Phase ---
  29156. --- Output Phase ---
  29157. ENV: Agent did: predict-no for direction L in state State-A
  29158. In State-A moving L
  29159. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29160. predict error 0
  29161. dir: dir isU
  29162. --- END Output Phase ---
  29163. |\---- Input Phase ---
  29164. =>WM: (15453: I2 ^dir U)
  29165. =>WM: (15452: I2 ^reward 1)
  29166. =>WM: (15451: I2 ^see 0)
  29167. =>WM: (15450: N1099 ^status complete)
  29168. <=WM: (15439: I2 ^dir L)
  29169. <=WM: (15438: I2 ^reward 1)
  29170. <=WM: (15437: I2 ^see 1)
  29171. =>WM: (15454: I2 ^level-1 L0-root)
  29172. <=WM: (15440: I2 ^level-1 L1-root)
  29173. --- END Input Phase ---
  29174. --- Proposal Phase ---
  29175. --- Inner Elaboration Phase, active level 1 (S1) ---
  29176. Firing elaborate*copy-see-to-output-link
  29177. -->
  29178. (I3 ^see 0 +)
  29179. Firing elaborate*reward*based*on*reward
  29180. -->
  29181. (R1103 ^value 1 +)
  29182. (R1 ^reward R1103 +)
  29183. Firing propose*predict-yes
  29184. -->
  29185. (O2199 ^name predict-yes +)
  29186. (S1 ^operator O2199 +)
  29187. Firing propose*predict-no
  29188. -->
  29189. (O2200 ^name predict-no +)
  29190. (S1 ^operator O2200 +)
  29191. Firing rl*prefer*rvt*predict-no*H0*2
  29192. -->
  29193. (S1 ^operator O2198 = 1.)
  29194. Firing rl*prefer*rvt*predict-yes*H0*1
  29195. -->
  29196. (S1 ^operator O2197 = 0.)
  29197. Firing prefer*rvt*predict-yes*H0
  29198. -->
  29199. Firing prefer*rvt*predict-no*H0
  29200. -->
  29201. Firing elaborate*copy-dir-to-output-link
  29202. -->
  29203. (I3 ^dir U +)
  29204. inner elaboration loop at bottom goal.
  29205. Retracting elaborate*copy-see-to-output-link
  29206. -->
  29207. (I3 ^see 1 +)
  29208. Retracting propose*predict-no
  29209. -->
  29210. (O2198 ^name predict-no +)
  29211. (S1 ^operator O2198 +)
  29212. Retracting propose*predict-yes
  29213. -->
  29214. (O2197 ^name predict-yes +)
  29215. (S1 ^operator O2197 +)
  29216. Retracting elaborate*reward*based*on*reward
  29217. -->
  29218. (R1102 ^value 1 +)
  29219. (R1 ^reward R1102 +)
  29220. Retracting elaborate*copy-dir-to-output-link
  29221. -->
  29222. (I3 ^dir L +)
  29223. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  29224. -->
  29225. (S1 ^operator O2198 = 0.685514319964578)
  29226. Retracting rl*prefer*rvt*predict-no*H0*4
  29227. -->
  29228. (S1 ^operator O2198 = 0.3145105217381143)
  29229. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  29230. -->
  29231. (S1 ^operator O2197 = -0.2062723012911647)
  29232. Retracting rl*prefer*rvt*predict-yes*H0*3
  29233. -->
  29234. (S1 ^operator O2197 = 0.3907804771267326)
  29235. =>WM: (15462: S1 ^operator O2200 +)
  29236. =>WM: (15461: S1 ^operator O2199 +)
  29237. =>WM: (15460: I3 ^dir U)
  29238. =>WM: (15459: O2200 ^name predict-no)
  29239. =>WM: (15458: O2199 ^name predict-yes)
  29240. =>WM: (15457: R1103 ^value 1)
  29241. =>WM: (15456: R1 ^reward R1103)
  29242. =>WM: (15455: I3 ^see 0)
  29243. <=WM: (15446: S1 ^operator O2197 +)
  29244. <=WM: (15447: S1 ^operator O2198 +)
  29245. <=WM: (15448: S1 ^operator O2198)
  29246. <=WM: (15431: I3 ^dir L)
  29247. <=WM: (15442: R1 ^reward R1102)
  29248. <=WM: (15441: I3 ^see 1)
  29249. <=WM: (15445: O2198 ^name predict-no)
  29250. <=WM: (15444: O2197 ^name predict-yes)
  29251. <=WM: (15443: R1102 ^value 1)
  29252. --- Inner Elaboration Phase, active level 1 (S1) ---
  29253. Firing prefer*rvt*predict-yes*H0
  29254. -->
  29255. Firing rl*prefer*rvt*predict-yes*H0*1
  29256. -->
  29257. (S1 ^operator O2199 = 0.)
  29258. Firing prefer*rvt*predict-no*H0
  29259. -->
  29260. Firing rl*prefer*rvt*predict-no*H0*2
  29261. -->
  29262. (S1 ^operator O2200 = 1.)
  29263. inner elaboration loop at bottom goal.
  29264. Retracting rl*prefer*rvt*predict-no*H0*2
  29265. -->
  29266. (S1 ^operator O2198 = 1.)
  29267. Retracting rl*prefer*rvt*predict-yes*H0*1
  29268. -->
  29269. (S1 ^operator O2197 = 0.)
  29270. --- END Proposal Phase ---
  29271. --- Decision Phase ---
  29272. RL update rl*prefer*rvt*predict-no*H0*4 0.478558 -0.164048 0.314511 -> 0.478556 -0.164048 0.314508(R,m,v=1,0.929412,0.0659937)
  29273. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521464 0.16405 0.685514 -> 0.521462 0.16405 0.685512(R,m,v=1,1,0)
  29274. =>WM: (15463: S1 ^operator O2200)
  29275. 1100: O: O2200 (predict-no)
  29276. --- END Decision Phase ---
  29277. --- Application Phase ---
  29278. --- Firing Productions (PE) For State At Depth 1 ---
  29279. --- Inner Elaboration Phase, active level 1 (S1) ---
  29280. Firing apply*operator
  29281. -->
  29282. (I3 ^predict-no N1100 + :O )
  29283. Firing apply*operator*complete
  29284. -->
  29285. (I3 ^predict-no N1099 - :O )
  29286. inner elaboration loop at bottom goal.
  29287. --- Change Working Memory (PE) ---
  29288. =>WM: (15464: I3 ^predict-no N1100)
  29289. <=WM: (15450: N1099 ^status complete)
  29290. <=WM: (15449: I3 ^predict-no N1099)
  29291. --- Firing Productions (IE) For State At Depth 1 ---
  29292. --- Inner Elaboration Phase, active level 1 (S1) ---
  29293. Firing monitor*world
  29294. -->
  29295. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29296. --- Change Working Memory (IE) ---
  29297. --- END Application Phase ---
  29298. --- Output Phase ---
  29299. ENV: Agent did: predict-no for direction U in state State-A
  29300. In State-A moving U
  29301. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29302. predict error 0
  29303. dir: dir isR
  29304. --- END Output Phase ---
  29305. /|\--- Input Phase ---
  29306. =>WM: (15468: I2 ^dir R)
  29307. =>WM: (15467: I2 ^reward 1)
  29308. =>WM: (15466: I2 ^see 0)
  29309. =>WM: (15465: N1100 ^status complete)
  29310. <=WM: (15453: I2 ^dir U)
  29311. <=WM: (15452: I2 ^reward 1)
  29312. <=WM: (15451: I2 ^see 0)
  29313. =>WM: (15469: I2 ^level-1 L0-root)
  29314. <=WM: (15454: I2 ^level-1 L0-root)
  29315. --- END Input Phase ---
  29316. --- Proposal Phase ---
  29317. --- Inner Elaboration Phase, active level 1 (S1) ---
  29318. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  29319. -->
  29320. (S1 ^operator O2199 = 0.878399454147804)
  29321. Firing prefer*rvt*predict-yes*H0*5*H1
  29322. -->
  29323. Firing elaborate*copy-see-to-output-link
  29324. -->
  29325. (I3 ^see 0 +)
  29326. Firing elaborate*reward*based*on*reward
  29327. -->
  29328. (R1104 ^value 1 +)
  29329. (R1 ^reward R1104 +)
  29330. Firing propose*predict-yes
  29331. -->
  29332. (O2201 ^name predict-yes +)
  29333. (S1 ^operator O2201 +)
  29334. Firing propose*predict-no
  29335. -->
  29336. (O2202 ^name predict-no +)
  29337. (S1 ^operator O2202 +)
  29338. Firing rl*prefer*rvt*predict-no*H0*6
  29339. -->
  29340. (S1 ^operator O2200 = 0.9436253760703815)
  29341. Firing rl*prefer*rvt*predict-yes*H0*5
  29342. -->
  29343. (S1 ^operator O2199 = 0.1215962264146522)
  29344. Firing prefer*rvt*predict-yes*H0
  29345. -->
  29346. Firing prefer*rvt*predict-no*H0
  29347. -->
  29348. Firing elaborate*copy-dir-to-output-link
  29349. -->
  29350. (I3 ^dir R +)
  29351. inner elaboration loop at bottom goal.
  29352. Retracting elaborate*copy-see-to-output-link
  29353. -->
  29354. (I3 ^see 0 +)
  29355. Retracting propose*predict-no
  29356. -->
  29357. (O2200 ^name predict-no +)
  29358. (S1 ^operator O2200 +)
  29359. Retracting propose*predict-yes
  29360. -->
  29361. (O2199 ^name predict-yes +)
  29362. (S1 ^operator O2199 +)
  29363. Retracting elaborate*reward*based*on*reward
  29364. -->
  29365. (R1103 ^value 1 +)
  29366. (R1 ^reward R1103 +)
  29367. Retracting elaborate*copy-dir-to-output-link
  29368. -->
  29369. (I3 ^dir U +)
  29370. Retracting rl*prefer*rvt*predict-no*H0*2
  29371. -->
  29372. (S1 ^operator O2200 = 1.)
  29373. Retracting rl*prefer*rvt*predict-yes*H0*1
  29374. -->
  29375. (S1 ^operator O2199 = 0.)
  29376. =>WM: (15476: S1 ^operator O2202 +)
  29377. =>WM: (15475: S1 ^operator O2201 +)
  29378. =>WM: (15474: I3 ^dir R)
  29379. =>WM: (15473: O2202 ^name predict-no)
  29380. =>WM: (15472: O2201 ^name predict-yes)
  29381. =>WM: (15471: R1104 ^value 1)
  29382. =>WM: (15470: R1 ^reward R1104)
  29383. <=WM: (15461: S1 ^operator O2199 +)
  29384. <=WM: (15462: S1 ^operator O2200 +)
  29385. <=WM: (15463: S1 ^operator O2200)
  29386. <=WM: (15460: I3 ^dir U)
  29387. <=WM: (15456: R1 ^reward R1103)
  29388. <=WM: (15459: O2200 ^name predict-no)
  29389. <=WM: (15458: O2199 ^name predict-yes)
  29390. <=WM: (15457: R1103 ^value 1)
  29391. --- Inner Elaboration Phase, active level 1 (S1) ---
  29392. Firing prefer*rvt*predict-yes*H0
  29393. -->
  29394. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  29395. -->
  29396. (S1 ^operator O2201 = 0.878399454147804)
  29397. Firing rl*prefer*rvt*predict-yes*H0*5
  29398. -->
  29399. (S1 ^operator O2201 = 0.1215962264146522)
  29400. Firing prefer*rvt*predict-yes*H0*5*H1
  29401. -->
  29402. Firing prefer*rvt*predict-no*H0
  29403. -->
  29404. Firing rl*prefer*rvt*predict-no*H0*6
  29405. -->
  29406. (S1 ^operator O2202 = 0.9436253760703815)
  29407. inner elaboration loop at bottom goal.
  29408. Retracting rl*prefer*rvt*predict-no*H0*6
  29409. -->
  29410. (S1 ^operator O2200 = 0.9436253760703815)
  29411. Retracting rl*prefer*rvt*predict-yes*H0*5
  29412. -->
  29413. (S1 ^operator O2199 = 0.1215962264146522)
  29414. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  29415. -->
  29416. (S1 ^operator O2199 = 0.878399454147804)
  29417. --- END Proposal Phase ---
  29418. --- Decision Phase ---
  29419. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  29420. =>WM: (15477: S1 ^operator O2201)
  29421. 1101: O: O2201 (predict-yes)
  29422. --- END Decision Phase ---
  29423. --- Application Phase ---
  29424. --- Firing Productions (PE) For State At Depth 1 ---
  29425. --- Inner Elaboration Phase, active level 1 (S1) ---
  29426. Firing apply*operator
  29427. -->
  29428. (I3 ^predict-yes N1101 + :O )
  29429. Firing apply*operator*complete
  29430. -->
  29431. (I3 ^predict-no N1100 - :O )
  29432. inner elaboration loop at bottom goal.
  29433. --- Change Working Memory (PE) ---
  29434. =>WM: (15478: I3 ^predict-yes N1101)
  29435. <=WM: (15465: N1100 ^status complete)
  29436. <=WM: (15464: I3 ^predict-no N1100)
  29437. --- Firing Productions (IE) For State At Depth 1 ---
  29438. --- Inner Elaboration Phase, active level 1 (S1) ---
  29439. Firing monitor*world
  29440. -->
  29441. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29442. --- Change Working Memory (IE) ---
  29443. --- END Application Phase ---
  29444. --- Output Phase ---
  29445. ENV: Agent did: predict-yes for direction R in state State-A
  29446. In State-A moving R
  29447. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  29448. predict error 0
  29449. dir: dir isL
  29450. --- END Output Phase ---
  29451. ---- Input Phase ---
  29452. =>WM: (15482: I2 ^dir L)
  29453. =>WM: (15481: I2 ^reward 1)
  29454. =>WM: (15480: I2 ^see 1)
  29455. =>WM: (15479: N1101 ^status complete)
  29456. <=WM: (15468: I2 ^dir R)
  29457. <=WM: (15467: I2 ^reward 1)
  29458. <=WM: (15466: I2 ^see 0)
  29459. =>WM: (15483: I2 ^level-1 R1-root)
  29460. <=WM: (15469: I2 ^level-1 L0-root)
  29461. --- END Input Phase ---
  29462. --- Proposal Phase ---
  29463. --- Inner Elaboration Phase, active level 1 (S1) ---
  29464. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  29465. -->
  29466. (S1 ^operator O2202 = -0.168718511744511)
  29467. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  29468. -->
  29469. (S1 ^operator O2201 = 0.6092507869249565)
  29470. Firing prefer*rvt*predict-no*H0*4*H1
  29471. -->
  29472. Firing prefer*rvt*predict-yes*H0*3*H1
  29473. -->
  29474. Firing elaborate*copy-see-to-output-link
  29475. -->
  29476. (I3 ^see 1 +)
  29477. Firing elaborate*reward*based*on*reward
  29478. -->
  29479. (R1105 ^value 1 +)
  29480. (R1 ^reward R1105 +)
  29481. Firing propose*predict-yes
  29482. -->
  29483. (O2203 ^name predict-yes +)
  29484. (S1 ^operator O2203 +)
  29485. Firing propose*predict-no
  29486. -->
  29487. (O2204 ^name predict-no +)
  29488. (S1 ^operator O2204 +)
  29489. Firing rl*prefer*rvt*predict-no*H0*4
  29490. -->
  29491. (S1 ^operator O2202 = 0.3145084974129228)
  29492. Firing rl*prefer*rvt*predict-yes*H0*3
  29493. -->
  29494. (S1 ^operator O2201 = 0.3907804771267326)
  29495. Firing prefer*rvt*predict-yes*H0
  29496. -->
  29497. Firing prefer*rvt*predict-no*H0
  29498. -->
  29499. Firing elaborate*copy-dir-to-output-link
  29500. -->
  29501. (I3 ^dir L +)
  29502. inner elaboration loop at bottom goal.
  29503. Retracting elaborate*copy-see-to-output-link
  29504. -->
  29505. (I3 ^see 0 +)
  29506. Retracting propose*predict-no
  29507. -->
  29508. (O2202 ^name predict-no +)
  29509. (S1 ^operator O2202 +)
  29510. Retracting propose*predict-yes
  29511. -->
  29512. (O2201 ^name predict-yes +)
  29513. (S1 ^operator O2201 +)
  29514. Retracting elaborate*reward*based*on*reward
  29515. -->
  29516. (R1104 ^value 1 +)
  29517. (R1 ^reward R1104 +)
  29518. Retracting elaborate*copy-dir-to-output-link
  29519. -->
  29520. (I3 ^dir R +)
  29521. Retracting rl*prefer*rvt*predict-no*H0*6
  29522. -->
  29523. (S1 ^operator O2202 = 0.9436253760703815)
  29524. Retracting rl*prefer*rvt*predict-yes*H0*5
  29525. -->
  29526. (S1 ^operator O2201 = 0.1215962264146522)
  29527. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  29528. -->
  29529. (S1 ^operator O2201 = 0.878399454147804)
  29530. =>WM: (15491: S1 ^operator O2204 +)
  29531. =>WM: (15490: S1 ^operator O2203 +)
  29532. =>WM: (15489: I3 ^dir L)
  29533. =>WM: (15488: O2204 ^name predict-no)
  29534. =>WM: (15487: O2203 ^name predict-yes)
  29535. =>WM: (15486: R1105 ^value 1)
  29536. =>WM: (15485: R1 ^reward R1105)
  29537. =>WM: (15484: I3 ^see 1)
  29538. <=WM: (15475: S1 ^operator O2201 +)
  29539. <=WM: (15477: S1 ^operator O2201)
  29540. <=WM: (15476: S1 ^operator O2202 +)
  29541. <=WM: (15474: I3 ^dir R)
  29542. <=WM: (15470: R1 ^reward R1104)
  29543. <=WM: (15455: I3 ^see 0)
  29544. <=WM: (15473: O2202 ^name predict-no)
  29545. <=WM: (15472: O2201 ^name predict-yes)
  29546. <=WM: (15471: R1104 ^value 1)
  29547. --- Inner Elaboration Phase, active level 1 (S1) ---
  29548. Firing prefer*rvt*predict-yes*H0
  29549. -->
  29550. Firing rl*prefer*rvt*predict-yes*H0*3
  29551. -->
  29552. (S1 ^operator O2203 = 0.3907804771267326)
  29553. Firing prefer*rvt*predict-yes*H0*3*H1
  29554. -->
  29555. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  29556. -->
  29557. (S1 ^operator O2203 = 0.6092507869249565)
  29558. Firing prefer*rvt*predict-no*H0
  29559. -->
  29560. Firing rl*prefer*rvt*predict-no*H0*4
  29561. -->
  29562. (S1 ^operator O2204 = 0.3145084974129228)
  29563. Firing prefer*rvt*predict-no*H0*4*H1
  29564. -->
  29565. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  29566. -->
  29567. (S1 ^operator O2204 = -0.168718511744511)
  29568. inner elaboration loop at bottom goal.
  29569. Retracting rl*prefer*rvt*predict-no*H0*4
  29570. -->
  29571. (S1 ^operator O2202 = 0.3145084974129228)
  29572. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  29573. -->
  29574. (S1 ^operator O2202 = -0.168718511744511)
  29575. Retracting rl*prefer*rvt*predict-yes*H0*3
  29576. -->
  29577. (S1 ^operator O2201 = 0.3907804771267326)
  29578. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  29579. -->
  29580. (S1 ^operator O2201 = 0.6092507869249565)
  29581. --- END Proposal Phase ---
  29582. --- Decision Phase ---
  29583. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121596 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.877551,0.108006)
  29584. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465474 0.412926 0.878399 -> 0.465474 0.412926 0.8784(R,m,v=1,1,0)
  29585. =>WM: (15492: S1 ^operator O2203)
  29586. 1102: O: O2203 (predict-yes)
  29587. --- END Decision Phase ---
  29588. --- Application Phase ---
  29589. --- Firing Productions (PE) For State At Depth 1 ---
  29590. --- Inner Elaboration Phase, active level 1 (S1) ---
  29591. Firing apply*operator
  29592. -->
  29593. (I3 ^predict-yes N1102 + :O )
  29594. Firing apply*operator*complete
  29595. -->
  29596. (I3 ^predict-yes N1101 - :O )
  29597. inner elaboration loop at bottom goal.
  29598. --- Change Working Memory (PE) ---
  29599. =>WM: (15493: I3 ^predict-yes N1102)
  29600. <=WM: (15479: N1101 ^status complete)
  29601. <=WM: (15478: I3 ^predict-yes N1101)
  29602. --- Firing Productions (IE) For State At Depth 1 ---
  29603. --- Inner Elaboration Phase, active level 1 (S1) ---
  29604. Firing monitor*world
  29605. -->
  29606. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  29607. --- Change Working Memory (IE) ---
  29608. --- END Application Phase ---
  29609. --- Output Phase ---
  29610. ENV: Agent did: predict-yes for direction L in state State-B
  29611. In State-B moving L
  29612. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  29613. predict error 0
  29614. dir: dir isL
  29615. --- END Output Phase ---
  29616. /|--- Input Phase ---
  29617. =>WM: (15497: I2 ^dir L)
  29618. =>WM: (15496: I2 ^reward 1)
  29619. =>WM: (15495: I2 ^see 1)
  29620. =>WM: (15494: N1102 ^status complete)
  29621. <=WM: (15482: I2 ^dir L)
  29622. <=WM: (15481: I2 ^reward 1)
  29623. <=WM: (15480: I2 ^see 1)
  29624. =>WM: (15498: I2 ^level-1 L1-root)
  29625. <=WM: (15483: I2 ^level-1 R1-root)
  29626. --- END Input Phase ---
  29627. --- Proposal Phase ---
  29628. --- Inner Elaboration Phase, active level 1 (S1) ---
  29629. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  29630. -->
  29631. (S1 ^operator O2203 = -0.2062723012911647)
  29632. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  29633. -->
  29634. (S1 ^operator O2204 = 0.6855120328590087)
  29635. Firing prefer*rvt*predict-no*H0*4*H1
  29636. -->
  29637. Firing prefer*rvt*predict-yes*H0*3*H1
  29638. -->
  29639. Firing elaborate*copy-see-to-output-link
  29640. -->
  29641. (I3 ^see 1 +)
  29642. Firing elaborate*reward*based*on*reward
  29643. -->
  29644. (R1106 ^value 1 +)
  29645. (R1 ^reward R1106 +)
  29646. Firing propose*predict-yes
  29647. -->
  29648. (O2205 ^name predict-yes +)
  29649. (S1 ^operator O2205 +)
  29650. Firing propose*predict-no
  29651. -->
  29652. (O2206 ^name predict-no +)
  29653. (S1 ^operator O2206 +)
  29654. Firing rl*prefer*rvt*predict-no*H0*4
  29655. -->
  29656. (S1 ^operator O2204 = 0.3145084974129228)
  29657. Firing rl*prefer*rvt*predict-yes*H0*3
  29658. -->
  29659. (S1 ^operator O2203 = 0.3907804771267326)
  29660. Firing prefer*rvt*predict-yes*H0
  29661. -->
  29662. Firing prefer*rvt*predict-no*H0
  29663. -->
  29664. Firing elaborate*copy-dir-to-output-link
  29665. -->
  29666. (I3 ^dir L +)
  29667. inner elaboration loop at bottom goal.
  29668. Retracting elaborate*copy-see-to-output-link
  29669. -->
  29670. (I3 ^see 1 +)
  29671. Retracting propose*predict-no
  29672. -->
  29673. (O2204 ^name predict-no +)
  29674. (S1 ^operator O2204 +)
  29675. Retracting propose*predict-yes
  29676. -->
  29677. (O2203 ^name predict-yes +)
  29678. (S1 ^operator O2203 +)
  29679. Retracting elaborate*reward*based*on*reward
  29680. -->
  29681. (R1105 ^value 1 +)
  29682. (R1 ^reward R1105 +)
  29683. Retracting elaborate*copy-dir-to-output-link
  29684. -->
  29685. (I3 ^dir L +)
  29686. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  29687. -->
  29688. (S1 ^operator O2204 = -0.168718511744511)
  29689. Retracting rl*prefer*rvt*predict-no*H0*4
  29690. -->
  29691. (S1 ^operator O2204 = 0.3145084974129228)
  29692. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  29693. -->
  29694. (S1 ^operator O2203 = 0.6092507869249565)
  29695. Retracting rl*prefer*rvt*predict-yes*H0*3
  29696. -->
  29697. (S1 ^operator O2203 = 0.3907804771267326)
  29698. =>WM: (15504: S1 ^operator O2206 +)
  29699. =>WM: (15503: S1 ^operator O2205 +)
  29700. =>WM: (15502: O2206 ^name predict-no)
  29701. =>WM: (15501: O2205 ^name predict-yes)
  29702. =>WM: (15500: R1106 ^value 1)
  29703. =>WM: (15499: R1 ^reward R1106)
  29704. <=WM: (15490: S1 ^operator O2203 +)
  29705. <=WM: (15492: S1 ^operator O2203)
  29706. <=WM: (15491: S1 ^operator O2204 +)
  29707. <=WM: (15485: R1 ^reward R1105)
  29708. <=WM: (15488: O2204 ^name predict-no)
  29709. <=WM: (15487: O2203 ^name predict-yes)
  29710. <=WM: (15486: R1105 ^value 1)
  29711. --- Inner Elaboration Phase, active level 1 (S1) ---
  29712. Firing prefer*rvt*predict-yes*H0
  29713. -->
  29714. Firing rl*prefer*rvt*predict-yes*H0*3
  29715. -->
  29716. (S1 ^operator O2205 = 0.3907804771267326)
  29717. Firing prefer*rvt*predict-yes*H0*3*H1
  29718. -->
  29719. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  29720. -->
  29721. (S1 ^operator O2205 = -0.2062723012911647)
  29722. Firing prefer*rvt*predict-no*H0
  29723. -->
  29724. Firing rl*prefer*rvt*predict-no*H0*4
  29725. -->
  29726. (S1 ^operator O2206 = 0.3145084974129228)
  29727. Firing prefer*rvt*predict-no*H0*4*H1
  29728. -->
  29729. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  29730. -->
  29731. (S1 ^operator O2206 = 0.6855120328590087)
  29732. inner elaboration loop at bottom goal.
  29733. Retracting rl*prefer*rvt*predict-no*H0*4
  29734. -->
  29735. (S1 ^operator O2204 = 0.3145084974129228)
  29736. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  29737. -->
  29738. (S1 ^operator O2204 = 0.6855120328590087)
  29739. Retracting rl*prefer*rvt*predict-yes*H0*3
  29740. -->
  29741. (S1 ^operator O2203 = 0.3907804771267326)
  29742. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  29743. -->
  29744. (S1 ^operator O2203 = -0.2062723012911647)
  29745. --- END Proposal Phase ---
  29746. --- Decision Phase ---
  29747. RL update rl*prefer*rvt*predict-yes*H0*3 0.472326 -0.0815455 0.39078 -> 0.472324 -0.0815459 0.390778(R,m,v=1,0.950276,0.0475138)
  29748. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527701 0.0815501 0.609251 -> 0.527698 0.0815497 0.609248(R,m,v=1,1,0)
  29749. =>WM: (15505: S1 ^operator O2206)
  29750. 1103: O: O2206 (predict-no)
  29751. --- END Decision Phase ---
  29752. --- Application Phase ---
  29753. --- Firing Productions (PE) For State At Depth 1 ---
  29754. --- Inner Elaboration Phase, active level 1 (S1) ---
  29755. Firing apply*operator
  29756. -->
  29757. (I3 ^predict-no N1103 + :O )
  29758. Firing apply*operator*complete
  29759. -->
  29760. (I3 ^predict-yes N1102 - :O )
  29761. inner elaboration loop at bottom goal.
  29762. --- Change Working Memory (PE) ---
  29763. =>WM: (15506: I3 ^predict-no N1103)
  29764. <=WM: (15494: N1102 ^status complete)
  29765. <=WM: (15493: I3 ^predict-yes N1102)
  29766. --- Firing Productions (IE) For State At Depth 1 ---
  29767. --- Inner Elaboration Phase, active level 1 (S1) ---
  29768. Firing monitor*world
  29769. -->
  29770. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29771. --- Change Working Memory (IE) ---
  29772. --- END Application Phase ---
  29773. --- Output Phase ---
  29774. ENV: Agent did: predict-no for direction L in state State-A
  29775. In State-A moving L
  29776. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29777. predict error 0
  29778. dir: dir isL
  29779. --- END Output Phase ---
  29780. \-/--- Input Phase ---
  29781. =>WM: (15510: I2 ^dir L)
  29782. =>WM: (15509: I2 ^reward 1)
  29783. =>WM: (15508: I2 ^see 0)
  29784. =>WM: (15507: N1103 ^status complete)
  29785. <=WM: (15497: I2 ^dir L)
  29786. <=WM: (15496: I2 ^reward 1)
  29787. <=WM: (15495: I2 ^see 1)
  29788. =>WM: (15511: I2 ^level-1 L0-root)
  29789. <=WM: (15498: I2 ^level-1 L1-root)
  29790. --- END Input Phase ---
  29791. --- Proposal Phase ---
  29792. --- Inner Elaboration Phase, active level 1 (S1) ---
  29793. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  29794. -->
  29795. (S1 ^operator O2205 = -0.208713043145708)
  29796. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  29797. -->
  29798. (S1 ^operator O2206 = 0.6854588867079627)
  29799. Firing prefer*rvt*predict-no*H0*4*H1
  29800. -->
  29801. Firing prefer*rvt*predict-yes*H0*3*H1
  29802. -->
  29803. Firing elaborate*copy-see-to-output-link
  29804. -->
  29805. (I3 ^see 0 +)
  29806. Firing elaborate*reward*based*on*reward
  29807. -->
  29808. (R1107 ^value 1 +)
  29809. (R1 ^reward R1107 +)
  29810. Firing propose*predict-yes
  29811. -->
  29812. (O2207 ^name predict-yes +)
  29813. (S1 ^operator O2207 +)
  29814. Firing propose*predict-no
  29815. -->
  29816. (O2208 ^name predict-no +)
  29817. (S1 ^operator O2208 +)
  29818. Firing rl*prefer*rvt*predict-no*H0*4
  29819. -->
  29820. (S1 ^operator O2206 = 0.3145084974129228)
  29821. Firing rl*prefer*rvt*predict-yes*H0*3
  29822. -->
  29823. (S1 ^operator O2205 = 0.3907779552208955)
  29824. Firing prefer*rvt*predict-yes*H0
  29825. -->
  29826. Firing prefer*rvt*predict-no*H0
  29827. -->
  29828. Firing elaborate*copy-dir-to-output-link
  29829. -->
  29830. (I3 ^dir L +)
  29831. inner elaboration loop at bottom goal.
  29832. Retracting elaborate*copy-see-to-output-link
  29833. -->
  29834. (I3 ^see 1 +)
  29835. Retracting propose*predict-no
  29836. -->
  29837. (O2206 ^name predict-no +)
  29838. (S1 ^operator O2206 +)
  29839. Retracting propose*predict-yes
  29840. -->
  29841. (O2205 ^name predict-yes +)
  29842. (S1 ^operator O2205 +)
  29843. Retracting elaborate*reward*based*on*reward
  29844. -->
  29845. (R1106 ^value 1 +)
  29846. (R1 ^reward R1106 +)
  29847. Retracting elaborate*copy-dir-to-output-link
  29848. -->
  29849. (I3 ^dir L +)
  29850. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  29851. -->
  29852. (S1 ^operator O2206 = 0.6855120328590087)
  29853. Retracting rl*prefer*rvt*predict-no*H0*4
  29854. -->
  29855. (S1 ^operator O2206 = 0.3145084974129228)
  29856. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  29857. -->
  29858. (S1 ^operator O2205 = -0.2062723012911647)
  29859. Retracting rl*prefer*rvt*predict-yes*H0*3
  29860. -->
  29861. (S1 ^operator O2205 = 0.3907779552208955)
  29862. =>WM: (15518: S1 ^operator O2208 +)
  29863. =>WM: (15517: S1 ^operator O2207 +)
  29864. =>WM: (15516: O2208 ^name predict-no)
  29865. =>WM: (15515: O2207 ^name predict-yes)
  29866. =>WM: (15514: R1107 ^value 1)
  29867. =>WM: (15513: R1 ^reward R1107)
  29868. =>WM: (15512: I3 ^see 0)
  29869. <=WM: (15503: S1 ^operator O2205 +)
  29870. <=WM: (15504: S1 ^operator O2206 +)
  29871. <=WM: (15505: S1 ^operator O2206)
  29872. <=WM: (15499: R1 ^reward R1106)
  29873. <=WM: (15484: I3 ^see 1)
  29874. <=WM: (15502: O2206 ^name predict-no)
  29875. <=WM: (15501: O2205 ^name predict-yes)
  29876. <=WM: (15500: R1106 ^value 1)
  29877. --- Inner Elaboration Phase, active level 1 (S1) ---
  29878. Firing prefer*rvt*predict-yes*H0
  29879. -->
  29880. Firing rl*prefer*rvt*predict-yes*H0*3
  29881. -->
  29882. (S1 ^operator O2207 = 0.3907779552208955)
  29883. Firing prefer*rvt*predict-yes*H0*3*H1
  29884. -->
  29885. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  29886. -->
  29887. (S1 ^operator O2207 = -0.208713043145708)
  29888. Firing prefer*rvt*predict-no*H0
  29889. -->
  29890. Firing rl*prefer*rvt*predict-no*H0*4
  29891. -->
  29892. (S1 ^operator O2208 = 0.3145084974129228)
  29893. Firing prefer*rvt*predict-no*H0*4*H1
  29894. -->
  29895. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  29896. -->
  29897. (S1 ^operator O2208 = 0.6854588867079627)
  29898. inner elaboration loop at bottom goal.
  29899. Retracting rl*prefer*rvt*predict-no*H0*4
  29900. -->
  29901. (S1 ^operator O2206 = 0.3145084974129228)
  29902. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  29903. -->
  29904. (S1 ^operator O2206 = 0.6854588867079627)
  29905. Retracting rl*prefer*rvt*predict-yes*H0*3
  29906. -->
  29907. (S1 ^operator O2205 = 0.3907779552208955)
  29908. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  29909. -->
  29910. (S1 ^operator O2205 = -0.208713043145708)
  29911. --- END Proposal Phase ---
  29912. --- Decision Phase ---
  29913. RL update rl*prefer*rvt*predict-no*H0*4 0.478556 -0.164048 0.314508 -> 0.478555 -0.164048 0.314507(R,m,v=1,0.929825,0.0656347)
  29914. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521462 0.16405 0.685512 -> 0.521461 0.16405 0.68551(R,m,v=1,1,0)
  29915. =>WM: (15519: S1 ^operator O2208)
  29916. 1104: O: O2208 (predict-no)
  29917. --- END Decision Phase ---
  29918. --- Application Phase ---
  29919. --- Firing Productions (PE) For State At Depth 1 ---
  29920. --- Inner Elaboration Phase, active level 1 (S1) ---
  29921. Firing apply*operator
  29922. -->
  29923. (I3 ^predict-no N1104 + :O )
  29924. Firing apply*operator*complete
  29925. -->
  29926. (I3 ^predict-no N1103 - :O )
  29927. inner elaboration loop at bottom goal.
  29928. --- Change Working Memory (PE) ---
  29929. =>WM: (15520: I3 ^predict-no N1104)
  29930. <=WM: (15507: N1103 ^status complete)
  29931. <=WM: (15506: I3 ^predict-no N1103)
  29932. --- Firing Productions (IE) For State At Depth 1 ---
  29933. --- Inner Elaboration Phase, active level 1 (S1) ---
  29934. Firing monitor*world
  29935. -->
  29936. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  29937. --- Change Working Memory (IE) ---
  29938. --- END Application Phase ---
  29939. --- Output Phase ---
  29940. ENV: Agent did: predict-no for direction L in state State-A
  29941. In State-A moving L
  29942. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  29943. predict error 0
  29944. dir: dir isL
  29945. --- END Output Phase ---
  29946. |\---- Input Phase ---
  29947. =>WM: (15524: I2 ^dir L)
  29948. =>WM: (15523: I2 ^reward 1)
  29949. =>WM: (15522: I2 ^see 0)
  29950. =>WM: (15521: N1104 ^status complete)
  29951. <=WM: (15510: I2 ^dir L)
  29952. <=WM: (15509: I2 ^reward 1)
  29953. <=WM: (15508: I2 ^see 0)
  29954. =>WM: (15525: I2 ^level-1 L0-root)
  29955. <=WM: (15511: I2 ^level-1 L0-root)
  29956. --- END Input Phase ---
  29957. --- Proposal Phase ---
  29958. --- Inner Elaboration Phase, active level 1 (S1) ---
  29959. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  29960. -->
  29961. (S1 ^operator O2207 = -0.208713043145708)
  29962. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  29963. -->
  29964. (S1 ^operator O2208 = 0.6854588867079627)
  29965. Firing prefer*rvt*predict-no*H0*4*H1
  29966. -->
  29967. Firing prefer*rvt*predict-yes*H0*3*H1
  29968. -->
  29969. Firing elaborate*copy-see-to-output-link
  29970. -->
  29971. (I3 ^see 0 +)
  29972. Firing elaborate*reward*based*on*reward
  29973. -->
  29974. (R1108 ^value 1 +)
  29975. (R1 ^reward R1108 +)
  29976. Firing propose*predict-yes
  29977. -->
  29978. (O2209 ^name predict-yes +)
  29979. (S1 ^operator O2209 +)
  29980. Firing propose*predict-no
  29981. -->
  29982. (O2210 ^name predict-no +)
  29983. (S1 ^operator O2210 +)
  29984. Firing rl*prefer*rvt*predict-no*H0*4
  29985. -->
  29986. (S1 ^operator O2208 = 0.3145068260195175)
  29987. Firing rl*prefer*rvt*predict-yes*H0*3
  29988. -->
  29989. (S1 ^operator O2207 = 0.3907779552208955)
  29990. Firing prefer*rvt*predict-yes*H0
  29991. -->
  29992. Firing prefer*rvt*predict-no*H0
  29993. -->
  29994. Firing elaborate*copy-dir-to-output-link
  29995. -->
  29996. (I3 ^dir L +)
  29997. inner elaboration loop at bottom goal.
  29998. Retracting elaborate*copy-see-to-output-link
  29999. -->
  30000. (I3 ^see 0 +)
  30001. Retracting propose*predict-no
  30002. -->
  30003. (O2208 ^name predict-no +)
  30004. (S1 ^operator O2208 +)
  30005. Retracting propose*predict-yes
  30006. -->
  30007. (O2207 ^name predict-yes +)
  30008. (S1 ^operator O2207 +)
  30009. Retracting elaborate*reward*based*on*reward
  30010. -->
  30011. (R1107 ^value 1 +)
  30012. (R1 ^reward R1107 +)
  30013. Retracting elaborate*copy-dir-to-output-link
  30014. -->
  30015. (I3 ^dir L +)
  30016. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30017. -->
  30018. (S1 ^operator O2208 = 0.6854588867079627)
  30019. Retracting rl*prefer*rvt*predict-no*H0*4
  30020. -->
  30021. (S1 ^operator O2208 = 0.3145068260195175)
  30022. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30023. -->
  30024. (S1 ^operator O2207 = -0.208713043145708)
  30025. Retracting rl*prefer*rvt*predict-yes*H0*3
  30026. -->
  30027. (S1 ^operator O2207 = 0.3907779552208955)
  30028. =>WM: (15531: S1 ^operator O2210 +)
  30029. =>WM: (15530: S1 ^operator O2209 +)
  30030. =>WM: (15529: O2210 ^name predict-no)
  30031. =>WM: (15528: O2209 ^name predict-yes)
  30032. =>WM: (15527: R1108 ^value 1)
  30033. =>WM: (15526: R1 ^reward R1108)
  30034. <=WM: (15517: S1 ^operator O2207 +)
  30035. <=WM: (15518: S1 ^operator O2208 +)
  30036. <=WM: (15519: S1 ^operator O2208)
  30037. <=WM: (15513: R1 ^reward R1107)
  30038. <=WM: (15516: O2208 ^name predict-no)
  30039. <=WM: (15515: O2207 ^name predict-yes)
  30040. <=WM: (15514: R1107 ^value 1)
  30041. --- Inner Elaboration Phase, active level 1 (S1) ---
  30042. Firing prefer*rvt*predict-yes*H0
  30043. -->
  30044. Firing rl*prefer*rvt*predict-yes*H0*3
  30045. -->
  30046. (S1 ^operator O2209 = 0.3907779552208955)
  30047. Firing prefer*rvt*predict-yes*H0*3*H1
  30048. -->
  30049. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  30050. -->
  30051. (S1 ^operator O2209 = -0.208713043145708)
  30052. Firing prefer*rvt*predict-no*H0
  30053. -->
  30054. Firing rl*prefer*rvt*predict-no*H0*4
  30055. -->
  30056. (S1 ^operator O2210 = 0.3145068260195175)
  30057. Firing prefer*rvt*predict-no*H0*4*H1
  30058. -->
  30059. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  30060. -->
  30061. (S1 ^operator O2210 = 0.6854588867079627)
  30062. inner elaboration loop at bottom goal.
  30063. Retracting rl*prefer*rvt*predict-no*H0*4
  30064. -->
  30065. (S1 ^operator O2208 = 0.3145068260195175)
  30066. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30067. -->
  30068. (S1 ^operator O2208 = 0.6854588867079627)
  30069. Retracting rl*prefer*rvt*predict-yes*H0*3
  30070. -->
  30071. (S1 ^operator O2207 = 0.3907779552208955)
  30072. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30073. -->
  30074. (S1 ^operator O2207 = -0.208713043145708)
  30075. --- END Proposal Phase ---
  30076. --- Decision Phase ---
  30077. RL update rl*prefer*rvt*predict-no*H0*4 0.478555 -0.164048 0.314507 -> 0.478557 -0.164048 0.31451(R,m,v=1,0.930233,0.0652795)
  30078. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521414 0.164045 0.685459 -> 0.521417 0.164045 0.685462(R,m,v=1,1,0)
  30079. =>WM: (15532: S1 ^operator O2210)
  30080. 1105: O: O2210 (predict-no)
  30081. --- END Decision Phase ---
  30082. --- Application Phase ---
  30083. --- Firing Productions (PE) For State At Depth 1 ---
  30084. --- Inner Elaboration Phase, active level 1 (S1) ---
  30085. Firing apply*operator
  30086. -->
  30087. (I3 ^predict-no N1105 + :O )
  30088. Firing apply*operator*complete
  30089. -->
  30090. (I3 ^predict-no N1104 - :O )
  30091. inner elaboration loop at bottom goal.
  30092. --- Change Working Memory (PE) ---
  30093. =>WM: (15533: I3 ^predict-no N1105)
  30094. <=WM: (15521: N1104 ^status complete)
  30095. <=WM: (15520: I3 ^predict-no N1104)
  30096. --- Firing Productions (IE) For State At Depth 1 ---
  30097. --- Inner Elaboration Phase, active level 1 (S1) ---
  30098. Firing monitor*world
  30099. -->
  30100. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30101. --- Change Working Memory (IE) ---
  30102. --- END Application Phase ---
  30103. --- Output Phase ---
  30104. ENV: Agent did: predict-no for direction L in state State-A
  30105. In State-A moving L
  30106. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30107. predict error 0
  30108. dir: dir isL
  30109. --- END Output Phase ---
  30110. /|--- Input Phase ---
  30111. =>WM: (15537: I2 ^dir L)
  30112. =>WM: (15536: I2 ^reward 1)
  30113. =>WM: (15535: I2 ^see 0)
  30114. =>WM: (15534: N1105 ^status complete)
  30115. <=WM: (15524: I2 ^dir L)
  30116. <=WM: (15523: I2 ^reward 1)
  30117. <=WM: (15522: I2 ^see 0)
  30118. =>WM: (15538: I2 ^level-1 L0-root)
  30119. <=WM: (15525: I2 ^level-1 L0-root)
  30120. --- END Input Phase ---
  30121. --- Proposal Phase ---
  30122. --- Inner Elaboration Phase, active level 1 (S1) ---
  30123. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  30124. -->
  30125. (S1 ^operator O2209 = -0.208713043145708)
  30126. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  30127. -->
  30128. (S1 ^operator O2210 = 0.6854621356602126)
  30129. Firing prefer*rvt*predict-no*H0*4*H1
  30130. -->
  30131. Firing prefer*rvt*predict-yes*H0*3*H1
  30132. -->
  30133. Firing elaborate*copy-see-to-output-link
  30134. -->
  30135. (I3 ^see 0 +)
  30136. Firing elaborate*reward*based*on*reward
  30137. -->
  30138. (R1109 ^value 1 +)
  30139. (R1 ^reward R1109 +)
  30140. Firing propose*predict-yes
  30141. -->
  30142. (O2211 ^name predict-yes +)
  30143. (S1 ^operator O2211 +)
  30144. Firing propose*predict-no
  30145. -->
  30146. (O2212 ^name predict-no +)
  30147. (S1 ^operator O2212 +)
  30148. Firing rl*prefer*rvt*predict-no*H0*4
  30149. -->
  30150. (S1 ^operator O2210 = 0.3145096147387795)
  30151. Firing rl*prefer*rvt*predict-yes*H0*3
  30152. -->
  30153. (S1 ^operator O2209 = 0.3907779552208955)
  30154. Firing prefer*rvt*predict-yes*H0
  30155. -->
  30156. Firing prefer*rvt*predict-no*H0
  30157. -->
  30158. Firing elaborate*copy-dir-to-output-link
  30159. -->
  30160. (I3 ^dir L +)
  30161. inner elaboration loop at bottom goal.
  30162. Retracting elaborate*copy-see-to-output-link
  30163. -->
  30164. (I3 ^see 0 +)
  30165. Retracting propose*predict-no
  30166. -->
  30167. (O2210 ^name predict-no +)
  30168. (S1 ^operator O2210 +)
  30169. Retracting propose*predict-yes
  30170. -->
  30171. (O2209 ^name predict-yes +)
  30172. (S1 ^operator O2209 +)
  30173. Retracting elaborate*reward*based*on*reward
  30174. -->
  30175. (R1108 ^value 1 +)
  30176. (R1 ^reward R1108 +)
  30177. Retracting elaborate*copy-dir-to-output-link
  30178. -->
  30179. (I3 ^dir L +)
  30180. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30181. -->
  30182. (S1 ^operator O2210 = 0.6854621356602126)
  30183. Retracting rl*prefer*rvt*predict-no*H0*4
  30184. -->
  30185. (S1 ^operator O2210 = 0.3145096147387795)
  30186. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30187. -->
  30188. (S1 ^operator O2209 = -0.208713043145708)
  30189. Retracting rl*prefer*rvt*predict-yes*H0*3
  30190. -->
  30191. (S1 ^operator O2209 = 0.3907779552208955)
  30192. =>WM: (15544: S1 ^operator O2212 +)
  30193. =>WM: (15543: S1 ^operator O2211 +)
  30194. =>WM: (15542: O2212 ^name predict-no)
  30195. =>WM: (15541: O2211 ^name predict-yes)
  30196. =>WM: (15540: R1109 ^value 1)
  30197. =>WM: (15539: R1 ^reward R1109)
  30198. <=WM: (15530: S1 ^operator O2209 +)
  30199. <=WM: (15531: S1 ^operator O2210 +)
  30200. <=WM: (15532: S1 ^operator O2210)
  30201. <=WM: (15526: R1 ^reward R1108)
  30202. <=WM: (15529: O2210 ^name predict-no)
  30203. <=WM: (15528: O2209 ^name predict-yes)
  30204. <=WM: (15527: R1108 ^value 1)
  30205. --- Inner Elaboration Phase, active level 1 (S1) ---
  30206. Firing prefer*rvt*predict-yes*H0
  30207. -->
  30208. Firing rl*prefer*rvt*predict-yes*H0*3
  30209. -->
  30210. (S1 ^operator O2211 = 0.3907779552208955)
  30211. Firing prefer*rvt*predict-yes*H0*3*H1
  30212. -->
  30213. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  30214. -->
  30215. (S1 ^operator O2211 = -0.208713043145708)
  30216. Firing prefer*rvt*predict-no*H0
  30217. -->
  30218. Firing rl*prefer*rvt*predict-no*H0*4
  30219. -->
  30220. (S1 ^operator O2212 = 0.3145096147387795)
  30221. Firing prefer*rvt*predict-no*H0*4*H1
  30222. -->
  30223. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  30224. -->
  30225. (S1 ^operator O2212 = 0.6854621356602126)
  30226. inner elaboration loop at bottom goal.
  30227. Retracting rl*prefer*rvt*predict-no*H0*4
  30228. -->
  30229. (S1 ^operator O2210 = 0.3145096147387795)
  30230. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30231. -->
  30232. (S1 ^operator O2210 = 0.6854621356602126)
  30233. Retracting rl*prefer*rvt*predict-yes*H0*3
  30234. -->
  30235. (S1 ^operator O2209 = 0.3907779552208955)
  30236. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30237. -->
  30238. (S1 ^operator O2209 = -0.208713043145708)
  30239. --- END Proposal Phase ---
  30240. --- Decision Phase ---
  30241. RL update rl*prefer*rvt*predict-no*H0*4 0.478557 -0.164048 0.31451 -> 0.478559 -0.164048 0.314512(R,m,v=1,0.930636,0.0649281)
  30242. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521417 0.164045 0.685462 -> 0.521419 0.164045 0.685465(R,m,v=1,1,0)
  30243. =>WM: (15545: S1 ^operator O2212)
  30244. 1106: O: O2212 (predict-no)
  30245. --- END Decision Phase ---
  30246. --- Application Phase ---
  30247. --- Firing Productions (PE) For State At Depth 1 ---
  30248. --- Inner Elaboration Phase, active level 1 (S1) ---
  30249. Firing apply*operator
  30250. -->
  30251. (I3 ^predict-no N1106 + :O )
  30252. Firing apply*operator*complete
  30253. -->
  30254. (I3 ^predict-no N1105 - :O )
  30255. inner elaboration loop at bottom goal.
  30256. --- Change Working Memory (PE) ---
  30257. =>WM: (15546: I3 ^predict-no N1106)
  30258. <=WM: (15534: N1105 ^status complete)
  30259. <=WM: (15533: I3 ^predict-no N1105)
  30260. --- Firing Productions (IE) For State At Depth 1 ---
  30261. --- Inner Elaboration Phase, active level 1 (S1) ---
  30262. Firing monitor*world
  30263. -->
  30264. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30265. --- Change Working Memory (IE) ---
  30266. --- END Application Phase ---
  30267. --- Output Phase ---
  30268. ENV: Agent did: predict-no for direction L in state State-A
  30269. In State-A moving L
  30270. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30271. predict error 0
  30272. dir: dir isL
  30273. --- END Output Phase ---
  30274. \-/--- Input Phase ---
  30275. =>WM: (15550: I2 ^dir L)
  30276. =>WM: (15549: I2 ^reward 1)
  30277. =>WM: (15548: I2 ^see 0)
  30278. =>WM: (15547: N1106 ^status complete)
  30279. <=WM: (15537: I2 ^dir L)
  30280. <=WM: (15536: I2 ^reward 1)
  30281. <=WM: (15535: I2 ^see 0)
  30282. =>WM: (15551: I2 ^level-1 L0-root)
  30283. <=WM: (15538: I2 ^level-1 L0-root)
  30284. --- END Input Phase ---
  30285. --- Proposal Phase ---
  30286. --- Inner Elaboration Phase, active level 1 (S1) ---
  30287. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  30288. -->
  30289. (S1 ^operator O2211 = -0.208713043145708)
  30290. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  30291. -->
  30292. (S1 ^operator O2212 = 0.685464805522946)
  30293. Firing prefer*rvt*predict-no*H0*4*H1
  30294. -->
  30295. Firing prefer*rvt*predict-yes*H0*3*H1
  30296. -->
  30297. Firing elaborate*copy-see-to-output-link
  30298. -->
  30299. (I3 ^see 0 +)
  30300. Firing elaborate*reward*based*on*reward
  30301. -->
  30302. (R1110 ^value 1 +)
  30303. (R1 ^reward R1110 +)
  30304. Firing propose*predict-yes
  30305. -->
  30306. (O2213 ^name predict-yes +)
  30307. (S1 ^operator O2213 +)
  30308. Firing propose*predict-no
  30309. -->
  30310. (O2214 ^name predict-no +)
  30311. (S1 ^operator O2214 +)
  30312. Firing rl*prefer*rvt*predict-no*H0*4
  30313. -->
  30314. (S1 ^operator O2212 = 0.3145119102257212)
  30315. Firing rl*prefer*rvt*predict-yes*H0*3
  30316. -->
  30317. (S1 ^operator O2211 = 0.3907779552208955)
  30318. Firing prefer*rvt*predict-yes*H0
  30319. -->
  30320. Firing prefer*rvt*predict-no*H0
  30321. -->
  30322. Firing elaborate*copy-dir-to-output-link
  30323. -->
  30324. (I3 ^dir L +)
  30325. inner elaboration loop at bottom goal.
  30326. Retracting elaborate*copy-see-to-output-link
  30327. -->
  30328. (I3 ^see 0 +)
  30329. Retracting propose*predict-no
  30330. -->
  30331. (O2212 ^name predict-no +)
  30332. (S1 ^operator O2212 +)
  30333. Retracting propose*predict-yes
  30334. -->
  30335. (O2211 ^name predict-yes +)
  30336. (S1 ^operator O2211 +)
  30337. Retracting elaborate*reward*based*on*reward
  30338. -->
  30339. (R1109 ^value 1 +)
  30340. (R1 ^reward R1109 +)
  30341. Retracting elaborate*copy-dir-to-output-link
  30342. -->
  30343. (I3 ^dir L +)
  30344. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30345. -->
  30346. (S1 ^operator O2212 = 0.685464805522946)
  30347. Retracting rl*prefer*rvt*predict-no*H0*4
  30348. -->
  30349. (S1 ^operator O2212 = 0.3145119102257212)
  30350. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30351. -->
  30352. (S1 ^operator O2211 = -0.208713043145708)
  30353. Retracting rl*prefer*rvt*predict-yes*H0*3
  30354. -->
  30355. (S1 ^operator O2211 = 0.3907779552208955)
  30356. =>WM: (15557: S1 ^operator O2214 +)
  30357. =>WM: (15556: S1 ^operator O2213 +)
  30358. =>WM: (15555: O2214 ^name predict-no)
  30359. =>WM: (15554: O2213 ^name predict-yes)
  30360. =>WM: (15553: R1110 ^value 1)
  30361. =>WM: (15552: R1 ^reward R1110)
  30362. <=WM: (15543: S1 ^operator O2211 +)
  30363. <=WM: (15544: S1 ^operator O2212 +)
  30364. <=WM: (15545: S1 ^operator O2212)
  30365. <=WM: (15539: R1 ^reward R1109)
  30366. <=WM: (15542: O2212 ^name predict-no)
  30367. <=WM: (15541: O2211 ^name predict-yes)
  30368. <=WM: (15540: R1109 ^value 1)
  30369. --- Inner Elaboration Phase, active level 1 (S1) ---
  30370. Firing prefer*rvt*predict-yes*H0
  30371. -->
  30372. Firing rl*prefer*rvt*predict-yes*H0*3
  30373. -->
  30374. (S1 ^operator O2213 = 0.3907779552208955)
  30375. Firing prefer*rvt*predict-yes*H0*3*H1
  30376. -->
  30377. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  30378. -->
  30379. (S1 ^operator O2213 = -0.208713043145708)
  30380. Firing prefer*rvt*predict-no*H0
  30381. -->
  30382. Firing rl*prefer*rvt*predict-no*H0*4
  30383. -->
  30384. (S1 ^operator O2214 = 0.3145119102257212)
  30385. Firing prefer*rvt*predict-no*H0*4*H1
  30386. -->
  30387. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  30388. -->
  30389. (S1 ^operator O2214 = 0.685464805522946)
  30390. inner elaboration loop at bottom goal.
  30391. Retracting rl*prefer*rvt*predict-no*H0*4
  30392. -->
  30393. (S1 ^operator O2212 = 0.3145119102257212)
  30394. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30395. -->
  30396. (S1 ^operator O2212 = 0.685464805522946)
  30397. Retracting rl*prefer*rvt*predict-yes*H0*3
  30398. -->
  30399. (S1 ^operator O2211 = 0.3907779552208955)
  30400. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30401. -->
  30402. (S1 ^operator O2211 = -0.208713043145708)
  30403. --- END Proposal Phase ---
  30404. --- Decision Phase ---
  30405. RL update rl*prefer*rvt*predict-no*H0*4 0.478559 -0.164048 0.314512 -> 0.478561 -0.164047 0.314514(R,m,v=1,0.931034,0.0645804)
  30406. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521419 0.164045 0.685465 -> 0.521421 0.164046 0.685467(R,m,v=1,1,0)
  30407. =>WM: (15558: S1 ^operator O2214)
  30408. 1107: O: O2214 (predict-no)
  30409. --- END Decision Phase ---
  30410. --- Application Phase ---
  30411. --- Firing Productions (PE) For State At Depth 1 ---
  30412. --- Inner Elaboration Phase, active level 1 (S1) ---
  30413. Firing apply*operator
  30414. -->
  30415. (I3 ^predict-no N1107 + :O )
  30416. Firing apply*operator*complete
  30417. -->
  30418. (I3 ^predict-no N1106 - :O )
  30419. inner elaboration loop at bottom goal.
  30420. --- Change Working Memory (PE) ---
  30421. =>WM: (15559: I3 ^predict-no N1107)
  30422. <=WM: (15547: N1106 ^status complete)
  30423. <=WM: (15546: I3 ^predict-no N1106)
  30424. --- Firing Productions (IE) For State At Depth 1 ---
  30425. --- Inner Elaboration Phase, active level 1 (S1) ---
  30426. Firing monitor*world
  30427. -->
  30428. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30429. --- Change Working Memory (IE) ---
  30430. --- END Application Phase ---
  30431. --- Output Phase ---
  30432. ENV: Agent did: predict-no for direction L in state State-A
  30433. In State-A moving L
  30434. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  30435. predict error 0
  30436. dir: dir isR
  30437. --- END Output Phase ---
  30438. |\---- Input Phase ---
  30439. =>WM: (15563: I2 ^dir R)
  30440. =>WM: (15562: I2 ^reward 1)
  30441. =>WM: (15561: I2 ^see 0)
  30442. =>WM: (15560: N1107 ^status complete)
  30443. <=WM: (15550: I2 ^dir L)
  30444. <=WM: (15549: I2 ^reward 1)
  30445. <=WM: (15548: I2 ^see 0)
  30446. =>WM: (15564: I2 ^level-1 L0-root)
  30447. <=WM: (15551: I2 ^level-1 L0-root)
  30448. --- END Input Phase ---
  30449. --- Proposal Phase ---
  30450. --- Inner Elaboration Phase, active level 1 (S1) ---
  30451. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  30452. -->
  30453. (S1 ^operator O2213 = 0.8783998563714275)
  30454. Firing prefer*rvt*predict-yes*H0*5*H1
  30455. -->
  30456. Firing elaborate*copy-see-to-output-link
  30457. -->
  30458. (I3 ^see 0 +)
  30459. Firing elaborate*reward*based*on*reward
  30460. -->
  30461. (R1111 ^value 1 +)
  30462. (R1 ^reward R1111 +)
  30463. Firing propose*predict-yes
  30464. -->
  30465. (O2215 ^name predict-yes +)
  30466. (S1 ^operator O2215 +)
  30467. Firing propose*predict-no
  30468. -->
  30469. (O2216 ^name predict-no +)
  30470. (S1 ^operator O2216 +)
  30471. Firing rl*prefer*rvt*predict-no*H0*6
  30472. -->
  30473. (S1 ^operator O2214 = 0.9436253760703815)
  30474. Firing rl*prefer*rvt*predict-yes*H0*5
  30475. -->
  30476. (S1 ^operator O2213 = 0.1215965704221909)
  30477. Firing prefer*rvt*predict-yes*H0
  30478. -->
  30479. Firing prefer*rvt*predict-no*H0
  30480. -->
  30481. Firing elaborate*copy-dir-to-output-link
  30482. -->
  30483. (I3 ^dir R +)
  30484. inner elaboration loop at bottom goal.
  30485. Retracting elaborate*copy-see-to-output-link
  30486. -->
  30487. (I3 ^see 0 +)
  30488. Retracting propose*predict-no
  30489. -->
  30490. (O2214 ^name predict-no +)
  30491. (S1 ^operator O2214 +)
  30492. Retracting propose*predict-yes
  30493. -->
  30494. (O2213 ^name predict-yes +)
  30495. (S1 ^operator O2213 +)
  30496. Retracting elaborate*reward*based*on*reward
  30497. -->
  30498. (R1110 ^value 1 +)
  30499. (R1 ^reward R1110 +)
  30500. Retracting elaborate*copy-dir-to-output-link
  30501. -->
  30502. (I3 ^dir L +)
  30503. Retracting rl*prefer*rvt*predict-no*H0*4*H1*12
  30504. -->
  30505. (S1 ^operator O2214 = 0.685467000466911)
  30506. Retracting rl*prefer*rvt*predict-no*H0*4
  30507. -->
  30508. (S1 ^operator O2214 = 0.3145138004710756)
  30509. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*13
  30510. -->
  30511. (S1 ^operator O2213 = -0.208713043145708)
  30512. Retracting rl*prefer*rvt*predict-yes*H0*3
  30513. -->
  30514. (S1 ^operator O2213 = 0.3907779552208955)
  30515. =>WM: (15571: S1 ^operator O2216 +)
  30516. =>WM: (15570: S1 ^operator O2215 +)
  30517. =>WM: (15569: I3 ^dir R)
  30518. =>WM: (15568: O2216 ^name predict-no)
  30519. =>WM: (15567: O2215 ^name predict-yes)
  30520. =>WM: (15566: R1111 ^value 1)
  30521. =>WM: (15565: R1 ^reward R1111)
  30522. <=WM: (15556: S1 ^operator O2213 +)
  30523. <=WM: (15557: S1 ^operator O2214 +)
  30524. <=WM: (15558: S1 ^operator O2214)
  30525. <=WM: (15489: I3 ^dir L)
  30526. <=WM: (15552: R1 ^reward R1110)
  30527. <=WM: (15555: O2214 ^name predict-no)
  30528. <=WM: (15554: O2213 ^name predict-yes)
  30529. <=WM: (15553: R1110 ^value 1)
  30530. --- Inner Elaboration Phase, active level 1 (S1) ---
  30531. Firing prefer*rvt*predict-yes*H0
  30532. -->
  30533. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  30534. -->
  30535. (S1 ^operator O2215 = 0.8783998563714275)
  30536. Firing rl*prefer*rvt*predict-yes*H0*5
  30537. -->
  30538. (S1 ^operator O2215 = 0.1215965704221909)
  30539. Firing prefer*rvt*predict-yes*H0*5*H1
  30540. -->
  30541. Firing prefer*rvt*predict-no*H0
  30542. -->
  30543. Firing rl*prefer*rvt*predict-no*H0*6
  30544. -->
  30545. (S1 ^operator O2216 = 0.9436253760703815)
  30546. inner elaboration loop at bottom goal.
  30547. Retracting rl*prefer*rvt*predict-no*H0*6
  30548. -->
  30549. (S1 ^operator O2214 = 0.9436253760703815)
  30550. Retracting rl*prefer*rvt*predict-yes*H0*5
  30551. -->
  30552. (S1 ^operator O2213 = 0.1215965704221909)
  30553. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  30554. -->
  30555. (S1 ^operator O2213 = 0.8783998563714275)
  30556. --- END Proposal Phase ---
  30557. --- Decision Phase ---
  30558. RL update rl*prefer*rvt*predict-no*H0*4 0.478561 -0.164047 0.314514 -> 0.478563 -0.164047 0.314515(R,m,v=1,0.931429,0.0642365)
  30559. RL update rl*prefer*rvt*predict-no*H0*4*H1*12 0.521421 0.164046 0.685467 -> 0.521423 0.164046 0.685469(R,m,v=1,1,0)
  30560. =>WM: (15572: S1 ^operator O2215)
  30561. 1108: O: O2215 (predict-yes)
  30562. --- END Decision Phase ---
  30563. --- Application Phase ---
  30564. --- Firing Productions (PE) For State At Depth 1 ---
  30565. --- Inner Elaboration Phase, active level 1 (S1) ---
  30566. Firing apply*operator
  30567. -->
  30568. (I3 ^predict-yes N1108 + :O )
  30569. Firing apply*operator*complete
  30570. -->
  30571. (I3 ^predict-no N1107 - :O )
  30572. inner elaboration loop at bottom goal.
  30573. --- Change Working Memory (PE) ---
  30574. =>WM: (15573: I3 ^predict-yes N1108)
  30575. <=WM: (15560: N1107 ^status complete)
  30576. <=WM: (15559: I3 ^predict-no N1107)
  30577. --- Firing Productions (IE) For State At Depth 1 ---
  30578. --- Inner Elaboration Phase, active level 1 (S1) ---
  30579. Firing monitor*world
  30580. -->
  30581. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30582. --- Change Working Memory (IE) ---
  30583. --- END Application Phase ---
  30584. --- Output Phase ---
  30585. ENV: Agent did: predict-yes for direction R in state State-A
  30586. In State-A moving R
  30587. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  30588. predict error 0
  30589. dir: dir isU
  30590. --- END Output Phase ---
  30591. /|\--- Input Phase ---
  30592. =>WM: (15577: I2 ^dir U)
  30593. =>WM: (15576: I2 ^reward 1)
  30594. =>WM: (15575: I2 ^see 1)
  30595. =>WM: (15574: N1108 ^status complete)
  30596. <=WM: (15563: I2 ^dir R)
  30597. <=WM: (15562: I2 ^reward 1)
  30598. <=WM: (15561: I2 ^see 0)
  30599. =>WM: (15578: I2 ^level-1 R1-root)
  30600. <=WM: (15564: I2 ^level-1 L0-root)
  30601. --- END Input Phase ---
  30602. --- Proposal Phase ---
  30603. --- Inner Elaboration Phase, active level 1 (S1) ---
  30604. Firing elaborate*copy-see-to-output-link
  30605. -->
  30606. (I3 ^see 1 +)
  30607. Firing elaborate*reward*based*on*reward
  30608. -->
  30609. (R1112 ^value 1 +)
  30610. (R1 ^reward R1112 +)
  30611. Firing propose*predict-yes
  30612. -->
  30613. (O2217 ^name predict-yes +)
  30614. (S1 ^operator O2217 +)
  30615. Firing propose*predict-no
  30616. -->
  30617. (O2218 ^name predict-no +)
  30618. (S1 ^operator O2218 +)
  30619. Firing rl*prefer*rvt*predict-no*H0*2
  30620. -->
  30621. (S1 ^operator O2216 = 1.)
  30622. Firing rl*prefer*rvt*predict-yes*H0*1
  30623. -->
  30624. (S1 ^operator O2215 = 0.)
  30625. Firing prefer*rvt*predict-yes*H0
  30626. -->
  30627. Firing prefer*rvt*predict-no*H0
  30628. -->
  30629. Firing elaborate*copy-dir-to-output-link
  30630. -->
  30631. (I3 ^dir U +)
  30632. inner elaboration loop at bottom goal.
  30633. Retracting elaborate*copy-see-to-output-link
  30634. -->
  30635. (I3 ^see 0 +)
  30636. Retracting propose*predict-no
  30637. -->
  30638. (O2216 ^name predict-no +)
  30639. (S1 ^operator O2216 +)
  30640. Retracting propose*predict-yes
  30641. -->
  30642. (O2215 ^name predict-yes +)
  30643. (S1 ^operator O2215 +)
  30644. Retracting elaborate*reward*based*on*reward
  30645. -->
  30646. (R1111 ^value 1 +)
  30647. (R1 ^reward R1111 +)
  30648. Retracting elaborate*copy-dir-to-output-link
  30649. -->
  30650. (I3 ^dir R +)
  30651. Retracting rl*prefer*rvt*predict-no*H0*6
  30652. -->
  30653. (S1 ^operator O2216 = 0.9436253760703815)
  30654. Retracting rl*prefer*rvt*predict-yes*H0*5
  30655. -->
  30656. (S1 ^operator O2215 = 0.1215965704221909)
  30657. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  30658. -->
  30659. (S1 ^operator O2215 = 0.8783998563714275)
  30660. =>WM: (15586: S1 ^operator O2218 +)
  30661. =>WM: (15585: S1 ^operator O2217 +)
  30662. =>WM: (15584: I3 ^dir U)
  30663. =>WM: (15583: O2218 ^name predict-no)
  30664. =>WM: (15582: O2217 ^name predict-yes)
  30665. =>WM: (15581: R1112 ^value 1)
  30666. =>WM: (15580: R1 ^reward R1112)
  30667. =>WM: (15579: I3 ^see 1)
  30668. <=WM: (15570: S1 ^operator O2215 +)
  30669. <=WM: (15572: S1 ^operator O2215)
  30670. <=WM: (15571: S1 ^operator O2216 +)
  30671. <=WM: (15569: I3 ^dir R)
  30672. <=WM: (15565: R1 ^reward R1111)
  30673. <=WM: (15512: I3 ^see 0)
  30674. <=WM: (15568: O2216 ^name predict-no)
  30675. <=WM: (15567: O2215 ^name predict-yes)
  30676. <=WM: (15566: R1111 ^value 1)
  30677. --- Inner Elaboration Phase, active level 1 (S1) ---
  30678. Firing prefer*rvt*predict-yes*H0
  30679. -->
  30680. Firing rl*prefer*rvt*predict-yes*H0*1
  30681. -->
  30682. (S1 ^operator O2217 = 0.)
  30683. Firing prefer*rvt*predict-no*H0
  30684. -->
  30685. Firing rl*prefer*rvt*predict-no*H0*2
  30686. -->
  30687. (S1 ^operator O2218 = 1.)
  30688. inner elaboration loop at bottom goal.
  30689. Retracting rl*prefer*rvt*predict-no*H0*2
  30690. -->
  30691. (S1 ^operator O2216 = 1.)
  30692. Retracting rl*prefer*rvt*predict-yes*H0*1
  30693. -->
  30694. (S1 ^operator O2215 = 0.)
  30695. --- END Proposal Phase ---
  30696. --- Decision Phase ---
  30697. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.878173,0.107531)
  30698. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465474 0.412926 0.8784 -> 0.465474 0.412926 0.8784(R,m,v=1,1,0)
  30699. =>WM: (15587: S1 ^operator O2218)
  30700. 1109: O: O2218 (predict-no)
  30701. --- END Decision Phase ---
  30702. --- Application Phase ---
  30703. --- Firing Productions (PE) For State At Depth 1 ---
  30704. --- Inner Elaboration Phase, active level 1 (S1) ---
  30705. Firing apply*operator
  30706. -->
  30707. (I3 ^predict-no N1109 + :O )
  30708. Firing apply*operator*complete
  30709. -->
  30710. (I3 ^predict-yes N1108 - :O )
  30711. inner elaboration loop at bottom goal.
  30712. --- Change Working Memory (PE) ---
  30713. =>WM: (15588: I3 ^predict-no N1109)
  30714. <=WM: (15574: N1108 ^status complete)
  30715. <=WM: (15573: I3 ^predict-yes N1108)
  30716. --- Firing Productions (IE) For State At Depth 1 ---
  30717. --- Inner Elaboration Phase, active level 1 (S1) ---
  30718. Firing monitor*world
  30719. -->
  30720. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  30721. --- Change Working Memory (IE) ---
  30722. --- END Application Phase ---
  30723. --- Output Phase ---
  30724. ENV: Agent did: predict-no for direction U in state State-B
  30725. In State-B moving U
  30726. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  30727. predict error 0
  30728. dir: dir isL
  30729. --- END Output Phase ---
  30730. -/--- Input Phase ---
  30731. =>WM: (15592: I2 ^dir L)
  30732. =>WM: (15591: I2 ^reward 1)
  30733. =>WM: (15590: I2 ^see 0)
  30734. =>WM: (15589: N1109 ^status complete)
  30735. <=WM: (15577: I2 ^dir U)
  30736. <=WM: (15576: I2 ^reward 1)
  30737. <=WM: (15575: I2 ^see 1)
  30738. =>WM: (15593: I2 ^level-1 R1-root)
  30739. <=WM: (15578: I2 ^level-1 R1-root)
  30740. --- END Input Phase ---
  30741. --- Proposal Phase ---
  30742. --- Inner Elaboration Phase, active level 1 (S1) ---
  30743. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  30744. -->
  30745. (S1 ^operator O2218 = -0.168718511744511)
  30746. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  30747. -->
  30748. (S1 ^operator O2217 = 0.6092479147905668)
  30749. Firing prefer*rvt*predict-no*H0*4*H1
  30750. -->
  30751. Firing prefer*rvt*predict-yes*H0*3*H1
  30752. -->
  30753. Firing elaborate*copy-see-to-output-link
  30754. -->
  30755. (I3 ^see 0 +)
  30756. Firing elaborate*reward*based*on*reward
  30757. -->
  30758. (R1113 ^value 1 +)
  30759. (R1 ^reward R1113 +)
  30760. Firing propose*predict-yes
  30761. -->
  30762. (O2219 ^name predict-yes +)
  30763. (S1 ^operator O2219 +)
  30764. Firing propose*predict-no
  30765. -->
  30766. (O2220 ^name predict-no +)
  30767. (S1 ^operator O2220 +)
  30768. Firing rl*prefer*rvt*predict-no*H0*4
  30769. -->
  30770. (S1 ^operator O2218 = 0.3145153576266763)
  30771. Firing rl*prefer*rvt*predict-yes*H0*3
  30772. -->
  30773. (S1 ^operator O2217 = 0.3907779552208955)
  30774. Firing prefer*rvt*predict-yes*H0
  30775. -->
  30776. Firing prefer*rvt*predict-no*H0
  30777. -->
  30778. Firing elaborate*copy-dir-to-output-link
  30779. -->
  30780. (I3 ^dir L +)
  30781. inner elaboration loop at bottom goal.
  30782. Retracting elaborate*copy-see-to-output-link
  30783. -->
  30784. (I3 ^see 1 +)
  30785. Retracting propose*predict-no
  30786. -->
  30787. (O2218 ^name predict-no +)
  30788. (S1 ^operator O2218 +)
  30789. Retracting propose*predict-yes
  30790. -->
  30791. (O2217 ^name predict-yes +)
  30792. (S1 ^operator O2217 +)
  30793. Retracting elaborate*reward*based*on*reward
  30794. -->
  30795. (R1112 ^value 1 +)
  30796. (R1 ^reward R1112 +)
  30797. Retracting elaborate*copy-dir-to-output-link
  30798. -->
  30799. (I3 ^dir U +)
  30800. Retracting rl*prefer*rvt*predict-no*H0*2
  30801. -->
  30802. (S1 ^operator O2218 = 1.)
  30803. Retracting rl*prefer*rvt*predict-yes*H0*1
  30804. -->
  30805. (S1 ^operator O2217 = 0.)
  30806. =>WM: (15601: S1 ^operator O2220 +)
  30807. =>WM: (15600: S1 ^operator O2219 +)
  30808. =>WM: (15599: I3 ^dir L)
  30809. =>WM: (15598: O2220 ^name predict-no)
  30810. =>WM: (15597: O2219 ^name predict-yes)
  30811. =>WM: (15596: R1113 ^value 1)
  30812. =>WM: (15595: R1 ^reward R1113)
  30813. =>WM: (15594: I3 ^see 0)
  30814. <=WM: (15585: S1 ^operator O2217 +)
  30815. <=WM: (15586: S1 ^operator O2218 +)
  30816. <=WM: (15587: S1 ^operator O2218)
  30817. <=WM: (15584: I3 ^dir U)
  30818. <=WM: (15580: R1 ^reward R1112)
  30819. <=WM: (15579: I3 ^see 1)
  30820. <=WM: (15583: O2218 ^name predict-no)
  30821. <=WM: (15582: O2217 ^name predict-yes)
  30822. <=WM: (15581: R1112 ^value 1)
  30823. --- Inner Elaboration Phase, active level 1 (S1) ---
  30824. Firing prefer*rvt*predict-yes*H0
  30825. -->
  30826. Firing rl*prefer*rvt*predict-yes*H0*3*H1*17
  30827. -->
  30828. (S1 ^operator O2219 = 0.6092479147905668)
  30829. Firing rl*prefer*rvt*predict-yes*H0*3
  30830. -->
  30831. (S1 ^operator O2219 = 0.3907779552208955)
  30832. Firing prefer*rvt*predict-yes*H0*3*H1
  30833. -->
  30834. Firing prefer*rvt*predict-no*H0
  30835. -->
  30836. Firing rl*prefer*rvt*predict-no*H0*4*H1*16
  30837. -->
  30838. (S1 ^operator O2220 = -0.168718511744511)
  30839. Firing rl*prefer*rvt*predict-no*H0*4
  30840. -->
  30841. (S1 ^operator O2220 = 0.3145153576266763)
  30842. Firing prefer*rvt*predict-no*H0*4*H1
  30843. -->
  30844. inner elaboration loop at bottom goal.
  30845. Retracting rl*prefer*rvt*predict-no*H0*4
  30846. -->
  30847. (S1 ^operator O2218 = 0.3145153576266763)
  30848. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  30849. -->
  30850. (S1 ^operator O2218 = -0.168718511744511)
  30851. Retracting rl*prefer*rvt*predict-yes*H0*3
  30852. -->
  30853. (S1 ^operator O2217 = 0.3907779552208955)
  30854. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  30855. -->
  30856. (S1 ^operator O2217 = 0.6092479147905668)
  30857. --- END Proposal Phase ---
  30858. --- Decision Phase ---
  30859. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  30860. =>WM: (15602: S1 ^operator O2219)
  30861. 1110: O: O2219 (predict-yes)
  30862. --- END Decision Phase ---
  30863. --- Application Phase ---
  30864. --- Firing Productions (PE) For State At Depth 1 ---
  30865. --- Inner Elaboration Phase, active level 1 (S1) ---
  30866. Firing apply*operator
  30867. -->
  30868. (I3 ^predict-yes N1110 + :O )
  30869. Firing apply*operator*complete
  30870. -->
  30871. (I3 ^predict-no N1109 - :O )
  30872. inner elaboration loop at bottom goal.
  30873. --- Change Working Memory (PE) ---
  30874. =>WM: (15603: I3 ^predict-yes N1110)
  30875. <=WM: (15589: N1109 ^status complete)
  30876. <=WM: (15588: I3 ^predict-no N1109)
  30877. --- Firing Productions (IE) For State At Depth 1 ---
  30878. --- Inner Elaboration Phase, active level 1 (S1) ---
  30879. Firing monitor*world
  30880. -->
  30881. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  30882. --- Change Working Memory (IE) ---
  30883. --- END Application Phase ---
  30884. --- Output Phase ---
  30885. ENV: Agent did: predict-yes for direction L in state State-B
  30886. In State-B moving L
  30887. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  30888. predict error 0
  30889. dir: dir isR
  30890. --- END Output Phase ---
  30891. |\--- Input Phase ---
  30892. =>WM: (15607: I2 ^dir R)
  30893. =>WM: (15606: I2 ^reward 1)
  30894. =>WM: (15605: I2 ^see 1)
  30895. =>WM: (15604: N1110 ^status complete)
  30896. <=WM: (15592: I2 ^dir L)
  30897. <=WM: (15591: I2 ^reward 1)
  30898. <=WM: (15590: I2 ^see 0)
  30899. =>WM: (15608: I2 ^level-1 L1-root)
  30900. <=WM: (15593: I2 ^level-1 R1-root)
  30901. --- END Input Phase ---
  30902. --- Proposal Phase ---
  30903. --- Inner Elaboration Phase, active level 1 (S1) ---
  30904. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  30905. -->
  30906. (S1 ^operator O2219 = 0.8784067009010752)
  30907. Firing prefer*rvt*predict-yes*H0*5*H1
  30908. -->
  30909. Firing elaborate*copy-see-to-output-link
  30910. -->
  30911. (I3 ^see 1 +)
  30912. Firing elaborate*reward*based*on*reward
  30913. -->
  30914. (R1114 ^value 1 +)
  30915. (R1 ^reward R1114 +)
  30916. Firing propose*predict-yes
  30917. -->
  30918. (O2221 ^name predict-yes +)
  30919. (S1 ^operator O2221 +)
  30920. Firing propose*predict-no
  30921. -->
  30922. (O2222 ^name predict-no +)
  30923. (S1 ^operator O2222 +)
  30924. Firing rl*prefer*rvt*predict-no*H0*6
  30925. -->
  30926. (S1 ^operator O2220 = 0.9436253760703815)
  30927. Firing rl*prefer*rvt*predict-yes*H0*5
  30928. -->
  30929. (S1 ^operator O2219 = 0.1215968547680865)
  30930. Firing prefer*rvt*predict-yes*H0
  30931. -->
  30932. Firing prefer*rvt*predict-no*H0
  30933. -->
  30934. Firing elaborate*copy-dir-to-output-link
  30935. -->
  30936. (I3 ^dir R +)
  30937. inner elaboration loop at bottom goal.
  30938. Retracting elaborate*copy-see-to-output-link
  30939. -->
  30940. (I3 ^see 0 +)
  30941. Retracting propose*predict-no
  30942. -->
  30943. (O2220 ^name predict-no +)
  30944. (S1 ^operator O2220 +)
  30945. Retracting propose*predict-yes
  30946. -->
  30947. (O2219 ^name predict-yes +)
  30948. (S1 ^operator O2219 +)
  30949. Retracting elaborate*reward*based*on*reward
  30950. -->
  30951. (R1113 ^value 1 +)
  30952. (R1 ^reward R1113 +)
  30953. Retracting elaborate*copy-dir-to-output-link
  30954. -->
  30955. (I3 ^dir L +)
  30956. Retracting rl*prefer*rvt*predict-no*H0*4
  30957. -->
  30958. (S1 ^operator O2220 = 0.3145153576266763)
  30959. Retracting rl*prefer*rvt*predict-no*H0*4*H1*16
  30960. -->
  30961. (S1 ^operator O2220 = -0.168718511744511)
  30962. Retracting rl*prefer*rvt*predict-yes*H0*3
  30963. -->
  30964. (S1 ^operator O2219 = 0.3907779552208955)
  30965. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*17
  30966. -->
  30967. (S1 ^operator O2219 = 0.6092479147905668)
  30968. =>WM: (15616: S1 ^operator O2222 +)
  30969. =>WM: (15615: S1 ^operator O2221 +)
  30970. =>WM: (15614: I3 ^dir R)
  30971. =>WM: (15613: O2222 ^name predict-no)
  30972. =>WM: (15612: O2221 ^name predict-yes)
  30973. =>WM: (15611: R1114 ^value 1)
  30974. =>WM: (15610: R1 ^reward R1114)
  30975. =>WM: (15609: I3 ^see 1)
  30976. <=WM: (15600: S1 ^operator O2219 +)
  30977. <=WM: (15602: S1 ^operator O2219)
  30978. <=WM: (15601: S1 ^operator O2220 +)
  30979. <=WM: (15599: I3 ^dir L)
  30980. <=WM: (15595: R1 ^reward R1113)
  30981. <=WM: (15594: I3 ^see 0)
  30982. <=WM: (15598: O2220 ^name predict-no)
  30983. <=WM: (15597: O2219 ^name predict-yes)
  30984. <=WM: (15596: R1113 ^value 1)
  30985. --- Inner Elaboration Phase, active level 1 (S1) ---
  30986. Firing prefer*rvt*predict-yes*H0
  30987. -->
  30988. Firing rl*prefer*rvt*predict-yes*H0*5
  30989. -->
  30990. (S1 ^operator O2221 = 0.1215968547680865)
  30991. Firing prefer*rvt*predict-yes*H0*5*H1
  30992. -->
  30993. Firing rl*prefer*rvt*predict-yes*H0*5*H1*18
  30994. -->
  30995. (S1 ^operator O2221 = 0.8784067009010752)
  30996. Firing prefer*rvt*predict-no*H0
  30997. -->
  30998. Firing rl*prefer*rvt*predict-no*H0*6
  30999. -->
  31000. (S1 ^operator O2222 = 0.9436253760703815)
  31001. inner elaboration loop at bottom goal.
  31002. Retracting rl*prefer*rvt*predict-no*H0*6
  31003. -->
  31004. (S1 ^operator O2220 = 0.9436253760703815)
  31005. Retracting rl*prefer*rvt*predict-yes*H0*5
  31006. -->
  31007. (S1 ^operator O2219 = 0.1215968547680865)
  31008. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  31009. -->
  31010. (S1 ^operator O2219 = 0.8784067009010752)
  31011. --- END Proposal Phase ---
  31012. --- Decision Phase ---
  31013. RL update rl*prefer*rvt*predict-yes*H0*3 0.472324 -0.0815459 0.390778 -> 0.472322 -0.0815462 0.390776(R,m,v=1,0.950549,0.0472649)
  31014. RL update rl*prefer*rvt*predict-yes*H0*3*H1*17 0.527698 0.0815497 0.609248 -> 0.527696 0.0815494 0.609246(R,m,v=1,1,0)
  31015. =>WM: (15617: S1 ^operator O2221)
  31016. 1111: O: O2221 (predict-yes)
  31017. --- END Decision Phase ---
  31018. --- Application Phase ---
  31019. --- Firing Productions (PE) For State At Depth 1 ---
  31020. --- Inner Elaboration Phase, active level 1 (S1) ---
  31021. Firing apply*operator
  31022. -->
  31023. (I3 ^predict-yes N1111 + :O )
  31024. Firing apply*operator*complete
  31025. -->
  31026. (I3 ^predict-yes N1110 - :O )
  31027. inner elaboration loop at bottom goal.
  31028. --- Change Working Memory (PE) ---
  31029. =>WM: (15618: I3 ^predict-yes N1111)
  31030. <=WM: (15604: N1110 ^status complete)
  31031. <=WM: (15603: I3 ^predict-yes N1110)
  31032. --- Firing Productions (IE) For State At Depth 1 ---
  31033. --- Inner Elaboration Phase, active level 1 (S1) ---
  31034. Firing monitor*world
  31035. -->
  31036. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31037. --- Change Working Memory (IE) ---
  31038. --- END Application Phase ---
  31039. --- Output Phase ---
  31040. ENV: Agent did: predict-yes for direction R in state State-A
  31041. In State-A moving R
  31042. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  31043. predict error 0
  31044. dir: dir isR
  31045. --- END Output Phase ---
  31046. ---- Input Phase ---
  31047. =>WM: (15622: I2 ^dir R)
  31048. =>WM: (15621: I2 ^reward 1)
  31049. =>WM: (15620: I2 ^see 1)
  31050. =>WM: (15619: N1111 ^status complete)
  31051. <=WM: (15607: I2 ^dir R)
  31052. <=WM: (15606: I2 ^reward 1)
  31053. <=WM: (15605: I2 ^see 1)
  31054. =>WM: (15623: I2 ^level-1 R1-root)
  31055. <=WM: (15608: I2 ^level-1 L1-root)
  31056. --- END Input Phase ---
  31057. --- Proposal Phase ---
  31058. --- Inner Elaboration Phase, active level 1 (S1) ---
  31059. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  31060. -->
  31061. (S1 ^operator O2221 = -0.04253361215288998)
  31062. Firing prefer*rvt*predict-yes*H0*5*H1
  31063. -->
  31064. Firing elaborate*copy-see-to-output-link
  31065. -->
  31066. (I3 ^see 1 +)
  31067. Firing elaborate*reward*based*on*reward
  31068. -->
  31069. (R1115 ^value 1 +)
  31070. (R1 ^reward R1115 +)
  31071. Firing propose*predict-yes
  31072. -->
  31073. (O2223 ^name predict-yes +)
  31074. (S1 ^operator O2223 +)
  31075. Firing propose*predict-no
  31076. -->
  31077. (O2224 ^name predict-no +)
  31078. (S1 ^operator O2224 +)
  31079. Firing rl*prefer*rvt*predict-no*H0*6
  31080. -->
  31081. (S1 ^operator O2222 = 0.9436253760703815)
  31082. Firing rl*prefer*rvt*predict-yes*H0*5
  31083. -->
  31084. (S1 ^operator O2221 = 0.1215968547680865)
  31085. Firing prefer*rvt*predict-yes*H0
  31086. -->
  31087. Firing prefer*rvt*predict-no*H0
  31088. -->
  31089. Firing elaborate*copy-dir-to-output-link
  31090. -->
  31091. (I3 ^dir R +)
  31092. inner elaboration loop at bottom goal.
  31093. Retracting elaborate*copy-see-to-output-link
  31094. -->
  31095. (I3 ^see 1 +)
  31096. Retracting propose*predict-no
  31097. -->
  31098. (O2222 ^name predict-no +)
  31099. (S1 ^operator O2222 +)
  31100. Retracting propose*predict-yes
  31101. -->
  31102. (O2221 ^name predict-yes +)
  31103. (S1 ^operator O2221 +)
  31104. Retracting elaborate*reward*based*on*reward
  31105. -->
  31106. (R1114 ^value 1 +)
  31107. (R1 ^reward R1114 +)
  31108. Retracting elaborate*copy-dir-to-output-link
  31109. -->
  31110. (I3 ^dir R +)
  31111. Retracting rl*prefer*rvt*predict-no*H0*6
  31112. -->
  31113. (S1 ^operator O2222 = 0.9436253760703815)
  31114. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*18
  31115. -->
  31116. (S1 ^operator O2221 = 0.8784067009010752)
  31117. Retracting rl*prefer*rvt*predict-yes*H0*5
  31118. -->
  31119. (S1 ^operator O2221 = 0.1215968547680865)
  31120. =>WM: (15629: S1 ^operator O2224 +)
  31121. =>WM: (15628: S1 ^operator O2223 +)
  31122. =>WM: (15627: O2224 ^name predict-no)
  31123. =>WM: (15626: O2223 ^name predict-yes)
  31124. =>WM: (15625: R1115 ^value 1)
  31125. =>WM: (15624: R1 ^reward R1115)
  31126. <=WM: (15615: S1 ^operator O2221 +)
  31127. <=WM: (15617: S1 ^operator O2221)
  31128. <=WM: (15616: S1 ^operator O2222 +)
  31129. <=WM: (15610: R1 ^reward R1114)
  31130. <=WM: (15613: O2222 ^name predict-no)
  31131. <=WM: (15612: O2221 ^name predict-yes)
  31132. <=WM: (15611: R1114 ^value 1)
  31133. --- Inner Elaboration Phase, active level 1 (S1) ---
  31134. Firing prefer*rvt*predict-yes*H0
  31135. -->
  31136. Firing rl*prefer*rvt*predict-yes*H0*5
  31137. -->
  31138. (S1 ^operator O2223 = 0.1215968547680865)
  31139. Firing prefer*rvt*predict-yes*H0*5*H1
  31140. -->
  31141. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  31142. -->
  31143. (S1 ^operator O2223 = -0.04253361215288998)
  31144. Firing prefer*rvt*predict-no*H0
  31145. -->
  31146. Firing rl*prefer*rvt*predict-no*H0*6
  31147. -->
  31148. (S1 ^operator O2224 = 0.9436253760703815)
  31149. inner elaboration loop at bottom goal.
  31150. Retracting rl*prefer*rvt*predict-no*H0*6
  31151. -->
  31152. (S1 ^operator O2222 = 0.9436253760703815)
  31153. Retracting rl*prefer*rvt*predict-yes*H0*5
  31154. -->
  31155. (S1 ^operator O2221 = 0.1215968547680865)
  31156. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  31157. -->
  31158. (S1 ^operator O2221 = -0.04253361215288998)
  31159. --- END Proposal Phase ---
  31160. --- Decision Phase ---
  31161. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.878788,0.10706)
  31162. RL update rl*prefer*rvt*predict-yes*H0*5*H1*18 0.46548 0.412927 0.878407 -> 0.46548 0.412927 0.878406(R,m,v=1,1,0)
  31163. =>WM: (15630: S1 ^operator O2224)
  31164. 1112: O: O2224 (predict-no)
  31165. --- END Decision Phase ---
  31166. --- Application Phase ---
  31167. --- Firing Productions (PE) For State At Depth 1 ---
  31168. --- Inner Elaboration Phase, active level 1 (S1) ---
  31169. Firing apply*operator
  31170. -->
  31171. (I3 ^predict-no N1112 + :O )
  31172. Firing apply*operator*complete
  31173. -->
  31174. (I3 ^predict-yes N1111 - :O )
  31175. inner elaboration loop at bottom goal.
  31176. --- Change Working Memory (PE) ---
  31177. =>WM: (15631: I3 ^predict-no N1112)
  31178. <=WM: (15619: N1111 ^status complete)
  31179. <=WM: (15618: I3 ^predict-yes N1111)
  31180. --- Firing Productions (IE) For State At Depth 1 ---
  31181. --- Inner Elaboration Phase, active level 1 (S1) ---
  31182. Firing monitor*world
  31183. -->
  31184. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31185. --- Change Working Memory (IE) ---
  31186. --- END Application Phase ---
  31187. --- Output Phase ---
  31188. ENV: Agent did: predict-no for direction R in state State-B
  31189. In State-B moving R
  31190. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31191. predict error 0
  31192. dir: dir isU
  31193. --- END Output Phase ---
  31194. /|\--- Input Phase ---
  31195. =>WM: (15635: I2 ^dir U)
  31196. =>WM: (15634: I2 ^reward 1)
  31197. =>WM: (15633: I2 ^see 0)
  31198. =>WM: (15632: N1112 ^status complete)
  31199. <=WM: (15622: I2 ^dir R)
  31200. <=WM: (15621: I2 ^reward 1)
  31201. <=WM: (15620: I2 ^see 1)
  31202. =>WM: (15636: I2 ^level-1 R0-root)
  31203. <=WM: (15623: I2 ^level-1 R1-root)
  31204. --- END Input Phase ---
  31205. --- Proposal Phase ---
  31206. --- Inner Elaboration Phase, active level 1 (S1) ---
  31207. Firing elaborate*copy-see-to-output-link
  31208. -->
  31209. (I3 ^see 0 +)
  31210. Firing elaborate*reward*based*on*reward
  31211. -->
  31212. (R1116 ^value 1 +)
  31213. (R1 ^reward R1116 +)
  31214. Firing propose*predict-yes
  31215. -->
  31216. (O2225 ^name predict-yes +)
  31217. (S1 ^operator O2225 +)
  31218. Firing propose*predict-no
  31219. -->
  31220. (O2226 ^name predict-no +)
  31221. (S1 ^operator O2226 +)
  31222. Firing rl*prefer*rvt*predict-no*H0*2
  31223. -->
  31224. (S1 ^operator O2224 = 1.)
  31225. Firing rl*prefer*rvt*predict-yes*H0*1
  31226. -->
  31227. (S1 ^operator O2223 = 0.)
  31228. Firing prefer*rvt*predict-yes*H0
  31229. -->
  31230. Firing prefer*rvt*predict-no*H0
  31231. -->
  31232. Firing elaborate*copy-dir-to-output-link
  31233. -->
  31234. (I3 ^dir U +)
  31235. inner elaboration loop at bottom goal.
  31236. Retracting elaborate*copy-see-to-output-link
  31237. -->
  31238. (I3 ^see 1 +)
  31239. Retracting propose*predict-no
  31240. -->
  31241. (O2224 ^name predict-no +)
  31242. (S1 ^operator O2224 +)
  31243. Retracting propose*predict-yes
  31244. -->
  31245. (O2223 ^name predict-yes +)
  31246. (S1 ^operator O2223 +)
  31247. Retracting elaborate*reward*based*on*reward
  31248. -->
  31249. (R1115 ^value 1 +)
  31250. (R1 ^reward R1115 +)
  31251. Retracting elaborate*copy-dir-to-output-link
  31252. -->
  31253. (I3 ^dir R +)
  31254. Retracting rl*prefer*rvt*predict-no*H0*6
  31255. -->
  31256. (S1 ^operator O2224 = 0.9436253760703815)
  31257. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  31258. -->
  31259. (S1 ^operator O2223 = -0.04253361215288998)
  31260. Retracting rl*prefer*rvt*predict-yes*H0*5
  31261. -->
  31262. (S1 ^operator O2223 = 0.1215965720455857)
  31263. =>WM: (15644: S1 ^operator O2226 +)
  31264. =>WM: (15643: S1 ^operator O2225 +)
  31265. =>WM: (15642: I3 ^dir U)
  31266. =>WM: (15641: O2226 ^name predict-no)
  31267. =>WM: (15640: O2225 ^name predict-yes)
  31268. =>WM: (15639: R1116 ^value 1)
  31269. =>WM: (15638: R1 ^reward R1116)
  31270. =>WM: (15637: I3 ^see 0)
  31271. <=WM: (15628: S1 ^operator O2223 +)
  31272. <=WM: (15629: S1 ^operator O2224 +)
  31273. <=WM: (15630: S1 ^operator O2224)
  31274. <=WM: (15614: I3 ^dir R)
  31275. <=WM: (15624: R1 ^reward R1115)
  31276. <=WM: (15609: I3 ^see 1)
  31277. <=WM: (15627: O2224 ^name predict-no)
  31278. <=WM: (15626: O2223 ^name predict-yes)
  31279. <=WM: (15625: R1115 ^value 1)
  31280. --- Inner Elaboration Phase, active level 1 (S1) ---
  31281. Firing prefer*rvt*predict-yes*H0
  31282. -->
  31283. Firing rl*prefer*rvt*predict-yes*H0*1
  31284. -->
  31285. (S1 ^operator O2225 = 0.)
  31286. Firing prefer*rvt*predict-no*H0
  31287. -->
  31288. Firing rl*prefer*rvt*predict-no*H0*2
  31289. -->
  31290. (S1 ^operator O2226 = 1.)
  31291. inner elaboration loop at bottom goal.
  31292. Retracting rl*prefer*rvt*predict-no*H0*2
  31293. -->
  31294. (S1 ^operator O2224 = 1.)
  31295. Retracting rl*prefer*rvt*predict-yes*H0*1
  31296. -->
  31297. (S1 ^operator O2223 = 0.)
  31298. --- END Proposal Phase ---
  31299. --- Decision Phase ---
  31300. RL update rl*prefer*rvt*predict-no*H0*6 0.943625 0 0.943625 -> 0.95262 0 0.95262(R,m,v=1,0.938144,0.0583302)
  31301. =>WM: (15645: S1 ^operator O2226)
  31302. 1113: O: O2226 (predict-no)
  31303. --- END Decision Phase ---
  31304. --- Application Phase ---
  31305. --- Firing Productions (PE) For State At Depth 1 ---
  31306. --- Inner Elaboration Phase, active level 1 (S1) ---
  31307. Firing apply*operator
  31308. -->
  31309. (I3 ^predict-no N1113 + :O )
  31310. Firing apply*operator*complete
  31311. -->
  31312. (I3 ^predict-no N1112 - :O )
  31313. inner elaboration loop at bottom goal.
  31314. --- Change Working Memory (PE) ---
  31315. =>WM: (15646: I3 ^predict-no N1113)
  31316. <=WM: (15632: N1112 ^status complete)
  31317. <=WM: (15631: I3 ^predict-no N1112)
  31318. --- Firing Productions (IE) For State At Depth 1 ---
  31319. --- Inner Elaboration Phase, active level 1 (S1) ---
  31320. Firing monitor*world
  31321. -->
  31322. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31323. --- Change Working Memory (IE) ---
  31324. --- END Application Phase ---
  31325. --- Output Phase ---
  31326. ENV: Agent did: predict-no for direction U in state State-B
  31327. In State-B moving U
  31328. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31329. predict error 0
  31330. dir: dir isU
  31331. --- END Output Phase ---
  31332. -/--- Input Phase ---
  31333. =>WM: (15650: I2 ^dir U)
  31334. =>WM: (15649: I2 ^reward 1)
  31335. =>WM: (15648: I2 ^see 0)
  31336. =>WM: (15647: N1113 ^status complete)
  31337. <=WM: (15635: I2 ^dir U)
  31338. <=WM: (15634: I2 ^reward 1)
  31339. <=WM: (15633: I2 ^see 0)
  31340. =>WM: (15651: I2 ^level-1 R0-root)
  31341. <=WM: (15636: I2 ^level-1 R0-root)
  31342. --- END Input Phase ---
  31343. --- Proposal Phase ---
  31344. --- Inner Elaboration Phase, active level 1 (S1) ---
  31345. Firing elaborate*copy-see-to-output-link
  31346. -->
  31347. (I3 ^see 0 +)
  31348. Firing elaborate*reward*based*on*reward
  31349. -->
  31350. (R1117 ^value 1 +)
  31351. (R1 ^reward R1117 +)
  31352. Firing propose*predict-yes
  31353. -->
  31354. (O2227 ^name predict-yes +)
  31355. (S1 ^operator O2227 +)
  31356. Firing propose*predict-no
  31357. -->
  31358. (O2228 ^name predict-no +)
  31359. (S1 ^operator O2228 +)
  31360. Firing rl*prefer*rvt*predict-no*H0*2
  31361. -->
  31362. (S1 ^operator O2226 = 1.)
  31363. Firing rl*prefer*rvt*predict-yes*H0*1
  31364. -->
  31365. (S1 ^operator O2225 = 0.)
  31366. Firing prefer*rvt*predict-yes*H0
  31367. -->
  31368. Firing prefer*rvt*predict-no*H0
  31369. -->
  31370. Firing elaborate*copy-dir-to-output-link
  31371. -->
  31372. (I3 ^dir U +)
  31373. inner elaboration loop at bottom goal.
  31374. Retracting elaborate*copy-see-to-output-link
  31375. -->
  31376. (I3 ^see 0 +)
  31377. Retracting propose*predict-no
  31378. -->
  31379. (O2226 ^name predict-no +)
  31380. (S1 ^operator O2226 +)
  31381. Retracting propose*predict-yes
  31382. -->
  31383. (O2225 ^name predict-yes +)
  31384. (S1 ^operator O2225 +)
  31385. Retracting elaborate*reward*based*on*reward
  31386. -->
  31387. (R1116 ^value 1 +)
  31388. (R1 ^reward R1116 +)
  31389. Retracting elaborate*copy-dir-to-output-link
  31390. -->
  31391. (I3 ^dir U +)
  31392. Retracting rl*prefer*rvt*predict-no*H0*2
  31393. -->
  31394. (S1 ^operator O2226 = 1.)
  31395. Retracting rl*prefer*rvt*predict-yes*H0*1
  31396. -->
  31397. (S1 ^operator O2225 = 0.)
  31398. =>WM: (15657: S1 ^operator O2228 +)
  31399. =>WM: (15656: S1 ^operator O2227 +)
  31400. =>WM: (15655: O2228 ^name predict-no)
  31401. =>WM: (15654: O2227 ^name predict-yes)
  31402. =>WM: (15653: R1117 ^value 1)
  31403. =>WM: (15652: R1 ^reward R1117)
  31404. <=WM: (15643: S1 ^operator O2225 +)
  31405. <=WM: (15644: S1 ^operator O2226 +)
  31406. <=WM: (15645: S1 ^operator O2226)
  31407. <=WM: (15638: R1 ^reward R1116)
  31408. <=WM: (15641: O2226 ^name predict-no)
  31409. <=WM: (15640: O2225 ^name predict-yes)
  31410. <=WM: (15639: R1116 ^value 1)
  31411. --- Inner Elaboration Phase, active level 1 (S1) ---
  31412. Firing prefer*rvt*predict-yes*H0
  31413. -->
  31414. Firing rl*prefer*rvt*predict-yes*H0*1
  31415. -->
  31416. (S1 ^operator O2227 = 0.)
  31417. Firing prefer*rvt*predict-no*H0
  31418. -->
  31419. Firing rl*prefer*rvt*predict-no*H0*2
  31420. -->
  31421. (S1 ^operator O2228 = 1.)
  31422. inner elaboration loop at bottom goal.
  31423. Retracting rl*prefer*rvt*predict-no*H0*2
  31424. -->
  31425. (S1 ^operator O2226 = 1.)
  31426. Retracting rl*prefer*rvt*predict-yes*H0*1
  31427. -->
  31428. (S1 ^operator O2225 = 0.)
  31429. --- END Proposal Phase ---
  31430. --- Decision Phase ---
  31431. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31432. =>WM: (15658: S1 ^operator O2228)
  31433. 1114: O: O2228 (predict-no)
  31434. --- END Decision Phase ---
  31435. --- Application Phase ---
  31436. --- Firing Productions (PE) For State At Depth 1 ---
  31437. --- Inner Elaboration Phase, active level 1 (S1) ---
  31438. Firing apply*operator
  31439. -->
  31440. (I3 ^predict-no N1114 + :O )
  31441. Firing apply*operator*complete
  31442. -->
  31443. (I3 ^predict-no N1113 - :O )
  31444. inner elaboration loop at bottom goal.
  31445. --- Change Working Memory (PE) ---
  31446. =>WM: (15659: I3 ^predict-no N1114)
  31447. <=WM: (15647: N1113 ^status complete)
  31448. <=WM: (15646: I3 ^predict-no N1113)
  31449. --- Firing Productions (IE) For State At Depth 1 ---
  31450. --- Inner Elaboration Phase, active level 1 (S1) ---
  31451. Firing monitor*world
  31452. -->
  31453. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31454. --- Change Working Memory (IE) ---
  31455. --- END Application Phase ---
  31456. --- Output Phase ---
  31457. ENV: Agent did: predict-no for direction U in state State-B
  31458. In State-B moving U
  31459. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31460. predict error 0
  31461. dir: dir isU
  31462. --- END Output Phase ---
  31463. |\--- Input Phase ---
  31464. =>WM: (15663: I2 ^dir U)
  31465. =>WM: (15662: I2 ^reward 1)
  31466. =>WM: (15661: I2 ^see 0)
  31467. =>WM: (15660: N1114 ^status complete)
  31468. <=WM: (15650: I2 ^dir U)
  31469. <=WM: (15649: I2 ^reward 1)
  31470. <=WM: (15648: I2 ^see 0)
  31471. =>WM: (15664: I2 ^level-1 R0-root)
  31472. <=WM: (15651: I2 ^level-1 R0-root)
  31473. --- END Input Phase ---
  31474. --- Proposal Phase ---
  31475. --- Inner Elaboration Phase, active level 1 (S1) ---
  31476. Firing elaborate*copy-see-to-output-link
  31477. -->
  31478. (I3 ^see 0 +)
  31479. Firing elaborate*reward*based*on*reward
  31480. -->
  31481. (R1118 ^value 1 +)
  31482. (R1 ^reward R1118 +)
  31483. Firing propose*predict-yes
  31484. -->
  31485. (O2229 ^name predict-yes +)
  31486. (S1 ^operator O2229 +)
  31487. Firing propose*predict-no
  31488. -->
  31489. (O2230 ^name predict-no +)
  31490. (S1 ^operator O2230 +)
  31491. Firing rl*prefer*rvt*predict-no*H0*2
  31492. -->
  31493. (S1 ^operator O2228 = 1.)
  31494. Firing rl*prefer*rvt*predict-yes*H0*1
  31495. -->
  31496. (S1 ^operator O2227 = 0.)
  31497. Firing prefer*rvt*predict-yes*H0
  31498. -->
  31499. Firing prefer*rvt*predict-no*H0
  31500. -->
  31501. Firing elaborate*copy-dir-to-output-link
  31502. -->
  31503. (I3 ^dir U +)
  31504. inner elaboration loop at bottom goal.
  31505. Retracting elaborate*copy-see-to-output-link
  31506. -->
  31507. (I3 ^see 0 +)
  31508. Retracting propose*predict-no
  31509. -->
  31510. (O2228 ^name predict-no +)
  31511. (S1 ^operator O2228 +)
  31512. Retracting propose*predict-yes
  31513. -->
  31514. (O2227 ^name predict-yes +)
  31515. (S1 ^operator O2227 +)
  31516. Retracting elaborate*reward*based*on*reward
  31517. -->
  31518. (R1117 ^value 1 +)
  31519. (R1 ^reward R1117 +)
  31520. Retracting elaborate*copy-dir-to-output-link
  31521. -->
  31522. (I3 ^dir U +)
  31523. Retracting rl*prefer*rvt*predict-no*H0*2
  31524. -->
  31525. (S1 ^operator O2228 = 1.)
  31526. Retracting rl*prefer*rvt*predict-yes*H0*1
  31527. -->
  31528. (S1 ^operator O2227 = 0.)
  31529. =>WM: (15670: S1 ^operator O2230 +)
  31530. =>WM: (15669: S1 ^operator O2229 +)
  31531. =>WM: (15668: O2230 ^name predict-no)
  31532. =>WM: (15667: O2229 ^name predict-yes)
  31533. =>WM: (15666: R1118 ^value 1)
  31534. =>WM: (15665: R1 ^reward R1118)
  31535. <=WM: (15656: S1 ^operator O2227 +)
  31536. <=WM: (15657: S1 ^operator O2228 +)
  31537. <=WM: (15658: S1 ^operator O2228)
  31538. <=WM: (15652: R1 ^reward R1117)
  31539. <=WM: (15655: O2228 ^name predict-no)
  31540. <=WM: (15654: O2227 ^name predict-yes)
  31541. <=WM: (15653: R1117 ^value 1)
  31542. --- Inner Elaboration Phase, active level 1 (S1) ---
  31543. Firing prefer*rvt*predict-yes*H0
  31544. -->
  31545. Firing rl*prefer*rvt*predict-yes*H0*1
  31546. -->
  31547. (S1 ^operator O2229 = 0.)
  31548. Firing prefer*rvt*predict-no*H0
  31549. -->
  31550. Firing rl*prefer*rvt*predict-no*H0*2
  31551. -->
  31552. (S1 ^operator O2230 = 1.)
  31553. inner elaboration loop at bottom goal.
  31554. Retracting rl*prefer*rvt*predict-no*H0*2
  31555. -->
  31556. (S1 ^operator O2228 = 1.)
  31557. Retracting rl*prefer*rvt*predict-yes*H0*1
  31558. -->
  31559. (S1 ^operator O2227 = 0.)
  31560. --- END Proposal Phase ---
  31561. --- Decision Phase ---
  31562. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31563. =>WM: (15671: S1 ^operator O2230)
  31564. 1115: O: O2230 (predict-no)
  31565. --- END Decision Phase ---
  31566. --- Application Phase ---
  31567. --- Firing Productions (PE) For State At Depth 1 ---
  31568. --- Inner Elaboration Phase, active level 1 (S1) ---
  31569. Firing apply*operator
  31570. -->
  31571. (I3 ^predict-no N1115 + :O )
  31572. Firing apply*operator*complete
  31573. -->
  31574. (I3 ^predict-no N1114 - :O )
  31575. inner elaboration loop at bottom goal.
  31576. --- Change Working Memory (PE) ---
  31577. =>WM: (15672: I3 ^predict-no N1115)
  31578. <=WM: (15660: N1114 ^status complete)
  31579. <=WM: (15659: I3 ^predict-no N1114)
  31580. --- Firing Productions (IE) For State At Depth 1 ---
  31581. --- Inner Elaboration Phase, active level 1 (S1) ---
  31582. Firing monitor*world
  31583. -->
  31584. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31585. --- Change Working Memory (IE) ---
  31586. --- END Application Phase ---
  31587. --- Output Phase ---
  31588. ENV: Agent did: predict-no for direction U in state State-B
  31589. In State-B moving U
  31590. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  31591. predict error 0
  31592. dir: dir isL
  31593. --- END Output Phase ---
  31594. -/|--- Input Phase ---
  31595. =>WM: (15676: I2 ^dir L)
  31596. =>WM: (15675: I2 ^reward 1)
  31597. =>WM: (15674: I2 ^see 0)
  31598. =>WM: (15673: N1115 ^status complete)
  31599. <=WM: (15663: I2 ^dir U)
  31600. <=WM: (15662: I2 ^reward 1)
  31601. <=WM: (15661: I2 ^see 0)
  31602. =>WM: (15677: I2 ^level-1 R0-root)
  31603. <=WM: (15664: I2 ^level-1 R0-root)
  31604. --- END Input Phase ---
  31605. --- Proposal Phase ---
  31606. --- Inner Elaboration Phase, active level 1 (S1) ---
  31607. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  31608. -->
  31609. (S1 ^operator O2230 = -0.1984300550322165)
  31610. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  31611. -->
  31612. (S1 ^operator O2229 = 0.6091755191206203)
  31613. Firing prefer*rvt*predict-no*H0*4*H1
  31614. -->
  31615. Firing prefer*rvt*predict-yes*H0*3*H1
  31616. -->
  31617. Firing elaborate*copy-see-to-output-link
  31618. -->
  31619. (I3 ^see 0 +)
  31620. Firing elaborate*reward*based*on*reward
  31621. -->
  31622. (R1119 ^value 1 +)
  31623. (R1 ^reward R1119 +)
  31624. Firing propose*predict-yes
  31625. -->
  31626. (O2231 ^name predict-yes +)
  31627. (S1 ^operator O2231 +)
  31628. Firing propose*predict-no
  31629. -->
  31630. (O2232 ^name predict-no +)
  31631. (S1 ^operator O2232 +)
  31632. Firing rl*prefer*rvt*predict-no*H0*4
  31633. -->
  31634. (S1 ^operator O2230 = 0.3145153576266763)
  31635. Firing rl*prefer*rvt*predict-yes*H0*3
  31636. -->
  31637. (S1 ^operator O2229 = 0.3907758702770224)
  31638. Firing prefer*rvt*predict-yes*H0
  31639. -->
  31640. Firing prefer*rvt*predict-no*H0
  31641. -->
  31642. Firing elaborate*copy-dir-to-output-link
  31643. -->
  31644. (I3 ^dir L +)
  31645. inner elaboration loop at bottom goal.
  31646. Retracting elaborate*copy-see-to-output-link
  31647. -->
  31648. (I3 ^see 0 +)
  31649. Retracting propose*predict-no
  31650. -->
  31651. (O2230 ^name predict-no +)
  31652. (S1 ^operator O2230 +)
  31653. Retracting propose*predict-yes
  31654. -->
  31655. (O2229 ^name predict-yes +)
  31656. (S1 ^operator O2229 +)
  31657. Retracting elaborate*reward*based*on*reward
  31658. -->
  31659. (R1118 ^value 1 +)
  31660. (R1 ^reward R1118 +)
  31661. Retracting elaborate*copy-dir-to-output-link
  31662. -->
  31663. (I3 ^dir U +)
  31664. Retracting rl*prefer*rvt*predict-no*H0*2
  31665. -->
  31666. (S1 ^operator O2230 = 1.)
  31667. Retracting rl*prefer*rvt*predict-yes*H0*1
  31668. -->
  31669. (S1 ^operator O2229 = 0.)
  31670. =>WM: (15684: S1 ^operator O2232 +)
  31671. =>WM: (15683: S1 ^operator O2231 +)
  31672. =>WM: (15682: I3 ^dir L)
  31673. =>WM: (15681: O2232 ^name predict-no)
  31674. =>WM: (15680: O2231 ^name predict-yes)
  31675. =>WM: (15679: R1119 ^value 1)
  31676. =>WM: (15678: R1 ^reward R1119)
  31677. <=WM: (15669: S1 ^operator O2229 +)
  31678. <=WM: (15670: S1 ^operator O2230 +)
  31679. <=WM: (15671: S1 ^operator O2230)
  31680. <=WM: (15642: I3 ^dir U)
  31681. <=WM: (15665: R1 ^reward R1118)
  31682. <=WM: (15668: O2230 ^name predict-no)
  31683. <=WM: (15667: O2229 ^name predict-yes)
  31684. <=WM: (15666: R1118 ^value 1)
  31685. --- Inner Elaboration Phase, active level 1 (S1) ---
  31686. Firing prefer*rvt*predict-yes*H0
  31687. -->
  31688. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  31689. -->
  31690. (S1 ^operator O2231 = 0.6091755191206203)
  31691. Firing rl*prefer*rvt*predict-yes*H0*3
  31692. -->
  31693. (S1 ^operator O2231 = 0.3907758702770224)
  31694. Firing prefer*rvt*predict-yes*H0*3*H1
  31695. -->
  31696. Firing prefer*rvt*predict-no*H0
  31697. -->
  31698. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  31699. -->
  31700. (S1 ^operator O2232 = -0.1984300550322165)
  31701. Firing rl*prefer*rvt*predict-no*H0*4
  31702. -->
  31703. (S1 ^operator O2232 = 0.3145153576266763)
  31704. Firing prefer*rvt*predict-no*H0*4*H1
  31705. -->
  31706. inner elaboration loop at bottom goal.
  31707. Retracting rl*prefer*rvt*predict-no*H0*4
  31708. -->
  31709. (S1 ^operator O2230 = 0.3145153576266763)
  31710. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  31711. -->
  31712. (S1 ^operator O2230 = -0.1984300550322165)
  31713. Retracting rl*prefer*rvt*predict-yes*H0*3
  31714. -->
  31715. (S1 ^operator O2229 = 0.3907758702770224)
  31716. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  31717. -->
  31718. (S1 ^operator O2229 = 0.6091755191206203)
  31719. --- END Proposal Phase ---
  31720. --- Decision Phase ---
  31721. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  31722. =>WM: (15685: S1 ^operator O2231)
  31723. 1116: O: O2231 (predict-yes)
  31724. --- END Decision Phase ---
  31725. --- Application Phase ---
  31726. --- Firing Productions (PE) For State At Depth 1 ---
  31727. --- Inner Elaboration Phase, active level 1 (S1) ---
  31728. Firing apply*operator
  31729. -->
  31730. (I3 ^predict-yes N1116 + :O )
  31731. Firing apply*operator*complete
  31732. -->
  31733. (I3 ^predict-no N1115 - :O )
  31734. inner elaboration loop at bottom goal.
  31735. --- Change Working Memory (PE) ---
  31736. =>WM: (15686: I3 ^predict-yes N1116)
  31737. <=WM: (15673: N1115 ^status complete)
  31738. <=WM: (15672: I3 ^predict-no N1115)
  31739. --- Firing Productions (IE) For State At Depth 1 ---
  31740. --- Inner Elaboration Phase, active level 1 (S1) ---
  31741. Firing monitor*world
  31742. -->
  31743. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  31744. --- Change Working Memory (IE) ---
  31745. --- END Application Phase ---
  31746. --- Output Phase ---
  31747. ENV: Agent did: predict-yes for direction L in state State-B
  31748. In State-B moving L
  31749. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  31750. predict error 0
  31751. dir: dir isU
  31752. --- END Output Phase ---
  31753. \-/--- Input Phase ---
  31754. =>WM: (15690: I2 ^dir U)
  31755. =>WM: (15689: I2 ^reward 1)
  31756. =>WM: (15688: I2 ^see 1)
  31757. =>WM: (15687: N1116 ^status complete)
  31758. <=WM: (15676: I2 ^dir L)
  31759. <=WM: (15675: I2 ^reward 1)
  31760. <=WM: (15674: I2 ^see 0)
  31761. =>WM: (15691: I2 ^level-1 L1-root)
  31762. <=WM: (15677: I2 ^level-1 R0-root)
  31763. --- END Input Phase ---
  31764. --- Proposal Phase ---
  31765. --- Inner Elaboration Phase, active level 1 (S1) ---
  31766. Firing elaborate*copy-see-to-output-link
  31767. -->
  31768. (I3 ^see 1 +)
  31769. Firing elaborate*reward*based*on*reward
  31770. -->
  31771. (R1120 ^value 1 +)
  31772. (R1 ^reward R1120 +)
  31773. Firing propose*predict-yes
  31774. -->
  31775. (O2233 ^name predict-yes +)
  31776. (S1 ^operator O2233 +)
  31777. Firing propose*predict-no
  31778. -->
  31779. (O2234 ^name predict-no +)
  31780. (S1 ^operator O2234 +)
  31781. Firing rl*prefer*rvt*predict-no*H0*2
  31782. -->
  31783. (S1 ^operator O2232 = 1.)
  31784. Firing rl*prefer*rvt*predict-yes*H0*1
  31785. -->
  31786. (S1 ^operator O2231 = 0.)
  31787. Firing prefer*rvt*predict-yes*H0
  31788. -->
  31789. Firing prefer*rvt*predict-no*H0
  31790. -->
  31791. Firing elaborate*copy-dir-to-output-link
  31792. -->
  31793. (I3 ^dir U +)
  31794. inner elaboration loop at bottom goal.
  31795. Retracting elaborate*copy-see-to-output-link
  31796. -->
  31797. (I3 ^see 0 +)
  31798. Retracting propose*predict-no
  31799. -->
  31800. (O2232 ^name predict-no +)
  31801. (S1 ^operator O2232 +)
  31802. Retracting propose*predict-yes
  31803. -->
  31804. (O2231 ^name predict-yes +)
  31805. (S1 ^operator O2231 +)
  31806. Retracting elaborate*reward*based*on*reward
  31807. -->
  31808. (R1119 ^value 1 +)
  31809. (R1 ^reward R1119 +)
  31810. Retracting elaborate*copy-dir-to-output-link
  31811. -->
  31812. (I3 ^dir L +)
  31813. Retracting rl*prefer*rvt*predict-no*H0*4
  31814. -->
  31815. (S1 ^operator O2232 = 0.3145153576266763)
  31816. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  31817. -->
  31818. (S1 ^operator O2232 = -0.1984300550322165)
  31819. Retracting rl*prefer*rvt*predict-yes*H0*3
  31820. -->
  31821. (S1 ^operator O2231 = 0.3907758702770224)
  31822. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  31823. -->
  31824. (S1 ^operator O2231 = 0.6091755191206203)
  31825. =>WM: (15699: S1 ^operator O2234 +)
  31826. =>WM: (15698: S1 ^operator O2233 +)
  31827. =>WM: (15697: I3 ^dir U)
  31828. =>WM: (15696: O2234 ^name predict-no)
  31829. =>WM: (15695: O2233 ^name predict-yes)
  31830. =>WM: (15694: R1120 ^value 1)
  31831. =>WM: (15693: R1 ^reward R1120)
  31832. =>WM: (15692: I3 ^see 1)
  31833. <=WM: (15683: S1 ^operator O2231 +)
  31834. <=WM: (15685: S1 ^operator O2231)
  31835. <=WM: (15684: S1 ^operator O2232 +)
  31836. <=WM: (15682: I3 ^dir L)
  31837. <=WM: (15678: R1 ^reward R1119)
  31838. <=WM: (15637: I3 ^see 0)
  31839. <=WM: (15681: O2232 ^name predict-no)
  31840. <=WM: (15680: O2231 ^name predict-yes)
  31841. <=WM: (15679: R1119 ^value 1)
  31842. --- Inner Elaboration Phase, active level 1 (S1) ---
  31843. Firing prefer*rvt*predict-yes*H0
  31844. -->
  31845. Firing rl*prefer*rvt*predict-yes*H0*1
  31846. -->
  31847. (S1 ^operator O2233 = 0.)
  31848. Firing prefer*rvt*predict-no*H0
  31849. -->
  31850. Firing rl*prefer*rvt*predict-no*H0*2
  31851. -->
  31852. (S1 ^operator O2234 = 1.)
  31853. inner elaboration loop at bottom goal.
  31854. Retracting rl*prefer*rvt*predict-no*H0*2
  31855. -->
  31856. (S1 ^operator O2232 = 1.)
  31857. Retracting rl*prefer*rvt*predict-yes*H0*1
  31858. -->
  31859. (S1 ^operator O2231 = 0.)
  31860. --- END Proposal Phase ---
  31861. --- Decision Phase ---
  31862. RL update rl*prefer*rvt*predict-yes*H0*3 0.472322 -0.0815462 0.390776 -> 0.472325 -0.0815456 0.39078(R,m,v=1,0.95082,0.0470186)
  31863. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.527637 0.081539 0.609176 -> 0.52764 0.0815397 0.60918(R,m,v=1,1,0)
  31864. =>WM: (15700: S1 ^operator O2234)
  31865. 1117: O: O2234 (predict-no)
  31866. --- END Decision Phase ---
  31867. --- Application Phase ---
  31868. --- Firing Productions (PE) For State At Depth 1 ---
  31869. --- Inner Elaboration Phase, active level 1 (S1) ---
  31870. Firing apply*operator
  31871. -->
  31872. (I3 ^predict-no N1117 + :O )
  31873. Firing apply*operator*complete
  31874. -->
  31875. (I3 ^predict-yes N1116 - :O )
  31876. inner elaboration loop at bottom goal.
  31877. --- Change Working Memory (PE) ---
  31878. =>WM: (15701: I3 ^predict-no N1117)
  31879. <=WM: (15687: N1116 ^status complete)
  31880. <=WM: (15686: I3 ^predict-yes N1116)
  31881. --- Firing Productions (IE) For State At Depth 1 ---
  31882. --- Inner Elaboration Phase, active level 1 (S1) ---
  31883. Firing monitor*world
  31884. -->
  31885. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  31886. --- Change Working Memory (IE) ---
  31887. --- END Application Phase ---
  31888. --- Output Phase ---
  31889. ENV: Agent did: predict-no for direction U in state State-A
  31890. In State-A moving U
  31891. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  31892. predict error 0
  31893. dir: dir isL
  31894. --- END Output Phase ---
  31895. |\--- Input Phase ---
  31896. =>WM: (15705: I2 ^dir L)
  31897. =>WM: (15704: I2 ^reward 1)
  31898. =>WM: (15703: I2 ^see 0)
  31899. =>WM: (15702: N1117 ^status complete)
  31900. <=WM: (15690: I2 ^dir U)
  31901. <=WM: (15689: I2 ^reward 1)
  31902. <=WM: (15688: I2 ^see 1)
  31903. =>WM: (15706: I2 ^level-1 L1-root)
  31904. <=WM: (15691: I2 ^level-1 L1-root)
  31905. --- END Input Phase ---
  31906. --- Proposal Phase ---
  31907. --- Inner Elaboration Phase, active level 1 (S1) ---
  31908. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  31909. -->
  31910. (S1 ^operator O2233 = -0.2062723012911647)
  31911. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  31912. -->
  31913. (S1 ^operator O2234 = 0.6855101468046794)
  31914. Firing prefer*rvt*predict-no*H0*4*H1
  31915. -->
  31916. Firing prefer*rvt*predict-yes*H0*3*H1
  31917. -->
  31918. Firing elaborate*copy-see-to-output-link
  31919. -->
  31920. (I3 ^see 0 +)
  31921. Firing elaborate*reward*based*on*reward
  31922. -->
  31923. (R1121 ^value 1 +)
  31924. (R1 ^reward R1121 +)
  31925. Firing propose*predict-yes
  31926. -->
  31927. (O2235 ^name predict-yes +)
  31928. (S1 ^operator O2235 +)
  31929. Firing propose*predict-no
  31930. -->
  31931. (O2236 ^name predict-no +)
  31932. (S1 ^operator O2236 +)
  31933. Firing rl*prefer*rvt*predict-no*H0*4
  31934. -->
  31935. (S1 ^operator O2234 = 0.3145153576266763)
  31936. Firing rl*prefer*rvt*predict-yes*H0*3
  31937. -->
  31938. (S1 ^operator O2233 = 0.3907797844980353)
  31939. Firing prefer*rvt*predict-yes*H0
  31940. -->
  31941. Firing prefer*rvt*predict-no*H0
  31942. -->
  31943. Firing elaborate*copy-dir-to-output-link
  31944. -->
  31945. (I3 ^dir L +)
  31946. inner elaboration loop at bottom goal.
  31947. Retracting elaborate*copy-see-to-output-link
  31948. -->
  31949. (I3 ^see 1 +)
  31950. Retracting propose*predict-no
  31951. -->
  31952. (O2234 ^name predict-no +)
  31953. (S1 ^operator O2234 +)
  31954. Retracting propose*predict-yes
  31955. -->
  31956. (O2233 ^name predict-yes +)
  31957. (S1 ^operator O2233 +)
  31958. Retracting elaborate*reward*based*on*reward
  31959. -->
  31960. (R1120 ^value 1 +)
  31961. (R1 ^reward R1120 +)
  31962. Retracting elaborate*copy-dir-to-output-link
  31963. -->
  31964. (I3 ^dir U +)
  31965. Retracting rl*prefer*rvt*predict-no*H0*2
  31966. -->
  31967. (S1 ^operator O2234 = 1.)
  31968. Retracting rl*prefer*rvt*predict-yes*H0*1
  31969. -->
  31970. (S1 ^operator O2233 = 0.)
  31971. =>WM: (15714: S1 ^operator O2236 +)
  31972. =>WM: (15713: S1 ^operator O2235 +)
  31973. =>WM: (15712: I3 ^dir L)
  31974. =>WM: (15711: O2236 ^name predict-no)
  31975. =>WM: (15710: O2235 ^name predict-yes)
  31976. =>WM: (15709: R1121 ^value 1)
  31977. =>WM: (15708: R1 ^reward R1121)
  31978. =>WM: (15707: I3 ^see 0)
  31979. <=WM: (15698: S1 ^operator O2233 +)
  31980. <=WM: (15699: S1 ^operator O2234 +)
  31981. <=WM: (15700: S1 ^operator O2234)
  31982. <=WM: (15697: I3 ^dir U)
  31983. <=WM: (15693: R1 ^reward R1120)
  31984. <=WM: (15692: I3 ^see 1)
  31985. <=WM: (15696: O2234 ^name predict-no)
  31986. <=WM: (15695: O2233 ^name predict-yes)
  31987. <=WM: (15694: R1120 ^value 1)
  31988. --- Inner Elaboration Phase, active level 1 (S1) ---
  31989. Firing prefer*rvt*predict-yes*H0
  31990. -->
  31991. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  31992. -->
  31993. (S1 ^operator O2235 = -0.2062723012911647)
  31994. Firing rl*prefer*rvt*predict-yes*H0*3
  31995. -->
  31996. (S1 ^operator O2235 = 0.3907797844980353)
  31997. Firing prefer*rvt*predict-yes*H0*3*H1
  31998. -->
  31999. Firing prefer*rvt*predict-no*H0
  32000. -->
  32001. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  32002. -->
  32003. (S1 ^operator O2236 = 0.6855101468046794)
  32004. Firing rl*prefer*rvt*predict-no*H0*4
  32005. -->
  32006. (S1 ^operator O2236 = 0.3145153576266763)
  32007. Firing prefer*rvt*predict-no*H0*4*H1
  32008. -->
  32009. inner elaboration loop at bottom goal.
  32010. Retracting rl*prefer*rvt*predict-no*H0*4
  32011. -->
  32012. (S1 ^operator O2234 = 0.3145153576266763)
  32013. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  32014. -->
  32015. (S1 ^operator O2234 = 0.6855101468046794)
  32016. Retracting rl*prefer*rvt*predict-yes*H0*3
  32017. -->
  32018. (S1 ^operator O2233 = 0.3907797844980353)
  32019. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  32020. -->
  32021. (S1 ^operator O2233 = -0.2062723012911647)
  32022. --- END Proposal Phase ---
  32023. --- Decision Phase ---
  32024. RL update rl*prefer*rvt*predict-no*H0*2 1 0 1 -> 1 0 1(R,m,v=1,1,0)
  32025. =>WM: (15715: S1 ^operator O2236)
  32026. 1118: O: O2236 (predict-no)
  32027. --- END Decision Phase ---
  32028. --- Application Phase ---
  32029. --- Firing Productions (PE) For State At Depth 1 ---
  32030. --- Inner Elaboration Phase, active level 1 (S1) ---
  32031. Firing apply*operator
  32032. -->
  32033. (I3 ^predict-no N1118 + :O )
  32034. Firing apply*operator*complete
  32035. -->
  32036. (I3 ^predict-no N1117 - :O )
  32037. inner elaboration loop at bottom goal.
  32038. --- Change Working Memory (PE) ---
  32039. =>WM: (15716: I3 ^predict-no N1118)
  32040. <=WM: (15702: N1117 ^status complete)
  32041. <=WM: (15701: I3 ^predict-no N1117)
  32042. --- Firing Productions (IE) For State At Depth 1 ---
  32043. --- Inner Elaboration Phase, active level 1 (S1) ---
  32044. Firing monitor*world
  32045. -->
  32046. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32047. --- Change Working Memory (IE) ---
  32048. --- END Application Phase ---
  32049. --- Output Phase ---
  32050. ENV: Agent did: predict-no for direction L in state State-A
  32051. In State-A moving L
  32052. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  32053. predict error 0
  32054. dir: dir isR
  32055. --- END Output Phase ---
  32056. -/|--- Input Phase ---
  32057. =>WM: (15720: I2 ^dir R)
  32058. =>WM: (15719: I2 ^reward 1)
  32059. =>WM: (15718: I2 ^see 0)
  32060. =>WM: (15717: N1118 ^status complete)
  32061. <=WM: (15705: I2 ^dir L)
  32062. <=WM: (15704: I2 ^reward 1)
  32063. <=WM: (15703: I2 ^see 0)
  32064. =>WM: (15721: I2 ^level-1 L0-root)
  32065. <=WM: (15706: I2 ^level-1 L1-root)
  32066. --- END Input Phase ---
  32067. --- Proposal Phase ---
  32068. --- Inner Elaboration Phase, active level 1 (S1) ---
  32069. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  32070. -->
  32071. (S1 ^operator O2235 = 0.8784001883287573)
  32072. Firing prefer*rvt*predict-yes*H0*5*H1
  32073. -->
  32074. Firing elaborate*copy-see-to-output-link
  32075. -->
  32076. (I3 ^see 0 +)
  32077. Firing elaborate*reward*based*on*reward
  32078. -->
  32079. (R1122 ^value 1 +)
  32080. (R1 ^reward R1122 +)
  32081. Firing propose*predict-yes
  32082. -->
  32083. (O2237 ^name predict-yes +)
  32084. (S1 ^operator O2237 +)
  32085. Firing propose*predict-no
  32086. -->
  32087. (O2238 ^name predict-no +)
  32088. (S1 ^operator O2238 +)
  32089. Firing rl*prefer*rvt*predict-no*H0*6
  32090. -->
  32091. (S1 ^operator O2236 = 0.9526196166066165)
  32092. Firing rl*prefer*rvt*predict-yes*H0*5
  32093. -->
  32094. (S1 ^operator O2235 = 0.1215965720455857)
  32095. Firing prefer*rvt*predict-yes*H0
  32096. -->
  32097. Firing prefer*rvt*predict-no*H0
  32098. -->
  32099. Firing elaborate*copy-dir-to-output-link
  32100. -->
  32101. (I3 ^dir R +)
  32102. inner elaboration loop at bottom goal.
  32103. Retracting elaborate*copy-see-to-output-link
  32104. -->
  32105. (I3 ^see 0 +)
  32106. Retracting propose*predict-no
  32107. -->
  32108. (O2236 ^name predict-no +)
  32109. (S1 ^operator O2236 +)
  32110. Retracting propose*predict-yes
  32111. -->
  32112. (O2235 ^name predict-yes +)
  32113. (S1 ^operator O2235 +)
  32114. Retracting elaborate*reward*based*on*reward
  32115. -->
  32116. (R1121 ^value 1 +)
  32117. (R1 ^reward R1121 +)
  32118. Retracting elaborate*copy-dir-to-output-link
  32119. -->
  32120. (I3 ^dir L +)
  32121. Retracting rl*prefer*rvt*predict-no*H0*4
  32122. -->
  32123. (S1 ^operator O2236 = 0.3145153576266763)
  32124. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  32125. -->
  32126. (S1 ^operator O2236 = 0.6855101468046794)
  32127. Retracting rl*prefer*rvt*predict-yes*H0*3
  32128. -->
  32129. (S1 ^operator O2235 = 0.3907797844980353)
  32130. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  32131. -->
  32132. (S1 ^operator O2235 = -0.2062723012911647)
  32133. =>WM: (15728: S1 ^operator O2238 +)
  32134. =>WM: (15727: S1 ^operator O2237 +)
  32135. =>WM: (15726: I3 ^dir R)
  32136. =>WM: (15725: O2238 ^name predict-no)
  32137. =>WM: (15724: O2237 ^name predict-yes)
  32138. =>WM: (15723: R1122 ^value 1)
  32139. =>WM: (15722: R1 ^reward R1122)
  32140. <=WM: (15713: S1 ^operator O2235 +)
  32141. <=WM: (15714: S1 ^operator O2236 +)
  32142. <=WM: (15715: S1 ^operator O2236)
  32143. <=WM: (15712: I3 ^dir L)
  32144. <=WM: (15708: R1 ^reward R1121)
  32145. <=WM: (15711: O2236 ^name predict-no)
  32146. <=WM: (15710: O2235 ^name predict-yes)
  32147. <=WM: (15709: R1121 ^value 1)
  32148. --- Inner Elaboration Phase, active level 1 (S1) ---
  32149. Firing prefer*rvt*predict-yes*H0
  32150. -->
  32151. Firing rl*prefer*rvt*predict-yes*H0*5
  32152. -->
  32153. (S1 ^operator O2237 = 0.1215965720455857)
  32154. Firing prefer*rvt*predict-yes*H0*5*H1
  32155. -->
  32156. Firing rl*prefer*rvt*predict-yes*H0*5*H1*14
  32157. -->
  32158. (S1 ^operator O2237 = 0.8784001883287573)
  32159. Firing prefer*rvt*predict-no*H0
  32160. -->
  32161. Firing rl*prefer*rvt*predict-no*H0*6
  32162. -->
  32163. (S1 ^operator O2238 = 0.9526196166066165)
  32164. inner elaboration loop at bottom goal.
  32165. Retracting rl*prefer*rvt*predict-no*H0*6
  32166. -->
  32167. (S1 ^operator O2236 = 0.9526196166066165)
  32168. Retracting rl*prefer*rvt*predict-yes*H0*5
  32169. -->
  32170. (S1 ^operator O2235 = 0.1215965720455857)
  32171. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  32172. -->
  32173. (S1 ^operator O2235 = 0.8784001883287573)
  32174. --- END Proposal Phase ---
  32175. --- Decision Phase ---
  32176. RL update rl*prefer*rvt*predict-no*H0*4 0.478563 -0.164047 0.314515 -> 0.478561 -0.164047 0.314513(R,m,v=1,0.931818,0.0638961)
  32177. RL update rl*prefer*rvt*predict-no*H0*4*H1*10 0.521461 0.16405 0.68551 -> 0.521458 0.164049 0.685508(R,m,v=1,1,0)
  32178. =>WM: (15729: S1 ^operator O2237)
  32179. 1119: O: O2237 (predict-yes)
  32180. --- END Decision Phase ---
  32181. --- Application Phase ---
  32182. --- Firing Productions (PE) For State At Depth 1 ---
  32183. --- Inner Elaboration Phase, active level 1 (S1) ---
  32184. Firing apply*operator
  32185. -->
  32186. (I3 ^predict-yes N1119 + :O )
  32187. Firing apply*operator*complete
  32188. -->
  32189. (I3 ^predict-no N1118 - :O )
  32190. inner elaboration loop at bottom goal.
  32191. --- Change Working Memory (PE) ---
  32192. =>WM: (15730: I3 ^predict-yes N1119)
  32193. <=WM: (15717: N1118 ^status complete)
  32194. <=WM: (15716: I3 ^predict-no N1118)
  32195. --- Firing Productions (IE) For State At Depth 1 ---
  32196. --- Inner Elaboration Phase, active level 1 (S1) ---
  32197. Firing monitor*world
  32198. -->
  32199. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32200. --- Change Working Memory (IE) ---
  32201. --- END Application Phase ---
  32202. --- Output Phase ---
  32203. ENV: Agent did: predict-yes for direction R in state State-A
  32204. In State-A moving R
  32205. ENV: (next state, see, prediction correct?) = (State-B, 1, True)
  32206. predict error 0
  32207. dir: dir isR
  32208. --- END Output Phase ---
  32209. \---- Input Phase ---
  32210. =>WM: (15734: I2 ^dir R)
  32211. =>WM: (15733: I2 ^reward 1)
  32212. =>WM: (15732: I2 ^see 1)
  32213. =>WM: (15731: N1119 ^status complete)
  32214. <=WM: (15720: I2 ^dir R)
  32215. <=WM: (15719: I2 ^reward 1)
  32216. <=WM: (15718: I2 ^see 0)
  32217. =>WM: (15735: I2 ^level-1 R1-root)
  32218. <=WM: (15721: I2 ^level-1 L0-root)
  32219. --- END Input Phase ---
  32220. --- Proposal Phase ---
  32221. --- Inner Elaboration Phase, active level 1 (S1) ---
  32222. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  32223. -->
  32224. (S1 ^operator O2237 = -0.04253361215288998)
  32225. Firing prefer*rvt*predict-yes*H0*5*H1
  32226. -->
  32227. Firing elaborate*copy-see-to-output-link
  32228. -->
  32229. (I3 ^see 1 +)
  32230. Firing elaborate*reward*based*on*reward
  32231. -->
  32232. (R1123 ^value 1 +)
  32233. (R1 ^reward R1123 +)
  32234. Firing propose*predict-yes
  32235. -->
  32236. (O2239 ^name predict-yes +)
  32237. (S1 ^operator O2239 +)
  32238. Firing propose*predict-no
  32239. -->
  32240. (O2240 ^name predict-no +)
  32241. (S1 ^operator O2240 +)
  32242. Firing rl*prefer*rvt*predict-no*H0*6
  32243. -->
  32244. (S1 ^operator O2238 = 0.9526196166066165)
  32245. Firing rl*prefer*rvt*predict-yes*H0*5
  32246. -->
  32247. (S1 ^operator O2237 = 0.1215965720455857)
  32248. Firing prefer*rvt*predict-yes*H0
  32249. -->
  32250. Firing prefer*rvt*predict-no*H0
  32251. -->
  32252. Firing elaborate*copy-dir-to-output-link
  32253. -->
  32254. (I3 ^dir R +)
  32255. inner elaboration loop at bottom goal.
  32256. Retracting elaborate*copy-see-to-output-link
  32257. -->
  32258. (I3 ^see 0 +)
  32259. Retracting propose*predict-no
  32260. -->
  32261. (O2238 ^name predict-no +)
  32262. (S1 ^operator O2238 +)
  32263. Retracting propose*predict-yes
  32264. -->
  32265. (O2237 ^name predict-yes +)
  32266. (S1 ^operator O2237 +)
  32267. Retracting elaborate*reward*based*on*reward
  32268. -->
  32269. (R1122 ^value 1 +)
  32270. (R1 ^reward R1122 +)
  32271. Retracting elaborate*copy-dir-to-output-link
  32272. -->
  32273. (I3 ^dir R +)
  32274. Retracting rl*prefer*rvt*predict-no*H0*6
  32275. -->
  32276. (S1 ^operator O2238 = 0.9526196166066165)
  32277. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*14
  32278. -->
  32279. (S1 ^operator O2237 = 0.8784001883287573)
  32280. Retracting rl*prefer*rvt*predict-yes*H0*5
  32281. -->
  32282. (S1 ^operator O2237 = 0.1215965720455857)
  32283. =>WM: (15742: S1 ^operator O2240 +)
  32284. =>WM: (15741: S1 ^operator O2239 +)
  32285. =>WM: (15740: O2240 ^name predict-no)
  32286. =>WM: (15739: O2239 ^name predict-yes)
  32287. =>WM: (15738: R1123 ^value 1)
  32288. =>WM: (15737: R1 ^reward R1123)
  32289. =>WM: (15736: I3 ^see 1)
  32290. <=WM: (15727: S1 ^operator O2237 +)
  32291. <=WM: (15729: S1 ^operator O2237)
  32292. <=WM: (15728: S1 ^operator O2238 +)
  32293. <=WM: (15722: R1 ^reward R1122)
  32294. <=WM: (15707: I3 ^see 0)
  32295. <=WM: (15725: O2238 ^name predict-no)
  32296. <=WM: (15724: O2237 ^name predict-yes)
  32297. <=WM: (15723: R1122 ^value 1)
  32298. --- Inner Elaboration Phase, active level 1 (S1) ---
  32299. Firing prefer*rvt*predict-yes*H0
  32300. -->
  32301. Firing rl*prefer*rvt*predict-yes*H0*5
  32302. -->
  32303. (S1 ^operator O2239 = 0.1215965720455857)
  32304. Firing prefer*rvt*predict-yes*H0*5*H1
  32305. -->
  32306. Firing rl*prefer*rvt*predict-yes*H0*5*H1*15
  32307. -->
  32308. (S1 ^operator O2239 = -0.04253361215288998)
  32309. Firing prefer*rvt*predict-no*H0
  32310. -->
  32311. Firing rl*prefer*rvt*predict-no*H0*6
  32312. -->
  32313. (S1 ^operator O2240 = 0.9526196166066165)
  32314. inner elaboration loop at bottom goal.
  32315. Retracting rl*prefer*rvt*predict-no*H0*6
  32316. -->
  32317. (S1 ^operator O2238 = 0.9526196166066165)
  32318. Retracting rl*prefer*rvt*predict-yes*H0*5
  32319. -->
  32320. (S1 ^operator O2237 = 0.1215965720455857)
  32321. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  32322. -->
  32323. (S1 ^operator O2237 = -0.04253361215288998)
  32324. --- END Proposal Phase ---
  32325. --- Decision Phase ---
  32326. RL update rl*prefer*rvt*predict-yes*H0*5 0.534523 -0.412926 0.121597 -> 0.534523 -0.412926 0.121597(R,m,v=1,0.879397,0.106594)
  32327. RL update rl*prefer*rvt*predict-yes*H0*5*H1*14 0.465474 0.412926 0.8784 -> 0.465475 0.412926 0.8784(R,m,v=1,1,0)
  32328. =>WM: (15743: S1 ^operator O2240)
  32329. 1120: O: O2240 (predict-no)
  32330. --- END Decision Phase ---
  32331. --- Application Phase ---
  32332. --- Firing Productions (PE) For State At Depth 1 ---
  32333. --- Inner Elaboration Phase, active level 1 (S1) ---
  32334. Firing apply*operator
  32335. -->
  32336. (I3 ^predict-no N1120 + :O )
  32337. Firing apply*operator*complete
  32338. -->
  32339. (I3 ^predict-yes N1119 - :O )
  32340. inner elaboration loop at bottom goal.
  32341. --- Change Working Memory (PE) ---
  32342. =>WM: (15744: I3 ^predict-no N1120)
  32343. <=WM: (15731: N1119 ^status complete)
  32344. <=WM: (15730: I3 ^predict-yes N1119)
  32345. --- Firing Productions (IE) For State At Depth 1 ---
  32346. --- Inner Elaboration Phase, active level 1 (S1) ---
  32347. Firing monitor*world
  32348. -->
  32349. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32350. --- Change Working Memory (IE) ---
  32351. --- END Application Phase ---
  32352. --- Output Phase ---
  32353. ENV: Agent did: predict-no for direction R in state State-B
  32354. In State-B moving R
  32355. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32356. predict error 0
  32357. dir: dir isR
  32358. --- END Output Phase ---
  32359. /|\--- Input Phase ---
  32360. =>WM: (15748: I2 ^dir R)
  32361. =>WM: (15747: I2 ^reward 1)
  32362. =>WM: (15746: I2 ^see 0)
  32363. =>WM: (15745: N1120 ^status complete)
  32364. <=WM: (15734: I2 ^dir R)
  32365. <=WM: (15733: I2 ^reward 1)
  32366. <=WM: (15732: I2 ^see 1)
  32367. =>WM: (15749: I2 ^level-1 R0-root)
  32368. <=WM: (15735: I2 ^level-1 R1-root)
  32369. --- END Input Phase ---
  32370. --- Proposal Phase ---
  32371. --- Inner Elaboration Phase, active level 1 (S1) ---
  32372. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  32373. -->
  32374. (S1 ^operator O2239 = -0.1512366769350551)
  32375. Firing prefer*rvt*predict-yes*H0*5*H1
  32376. -->
  32377. Firing elaborate*copy-see-to-output-link
  32378. -->
  32379. (I3 ^see 0 +)
  32380. Firing elaborate*reward*based*on*reward
  32381. -->
  32382. (R1124 ^value 1 +)
  32383. (R1 ^reward R1124 +)
  32384. Firing propose*predict-yes
  32385. -->
  32386. (O2241 ^name predict-yes +)
  32387. (S1 ^operator O2241 +)
  32388. Firing propose*predict-no
  32389. -->
  32390. (O2242 ^name predict-no +)
  32391. (S1 ^operator O2242 +)
  32392. Firing rl*prefer*rvt*predict-no*H0*6
  32393. -->
  32394. (S1 ^operator O2240 = 0.9526196166066165)
  32395. Firing rl*prefer*rvt*predict-yes*H0*5
  32396. -->
  32397. (S1 ^operator O2239 = 0.1215968294322646)
  32398. Firing prefer*rvt*predict-yes*H0
  32399. -->
  32400. Firing prefer*rvt*predict-no*H0
  32401. -->
  32402. Firing elaborate*copy-dir-to-output-link
  32403. -->
  32404. (I3 ^dir R +)
  32405. inner elaboration loop at bottom goal.
  32406. Retracting elaborate*copy-see-to-output-link
  32407. -->
  32408. (I3 ^see 1 +)
  32409. Retracting propose*predict-no
  32410. -->
  32411. (O2240 ^name predict-no +)
  32412. (S1 ^operator O2240 +)
  32413. Retracting propose*predict-yes
  32414. -->
  32415. (O2239 ^name predict-yes +)
  32416. (S1 ^operator O2239 +)
  32417. Retracting elaborate*reward*based*on*reward
  32418. -->
  32419. (R1123 ^value 1 +)
  32420. (R1 ^reward R1123 +)
  32421. Retracting elaborate*copy-dir-to-output-link
  32422. -->
  32423. (I3 ^dir R +)
  32424. Retracting rl*prefer*rvt*predict-no*H0*6
  32425. -->
  32426. (S1 ^operator O2240 = 0.9526196166066165)
  32427. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*15
  32428. -->
  32429. (S1 ^operator O2239 = -0.04253361215288998)
  32430. Retracting rl*prefer*rvt*predict-yes*H0*5
  32431. -->
  32432. (S1 ^operator O2239 = 0.1215968294322646)
  32433. =>WM: (15756: S1 ^operator O2242 +)
  32434. =>WM: (15755: S1 ^operator O2241 +)
  32435. =>WM: (15754: O2242 ^name predict-no)
  32436. =>WM: (15753: O2241 ^name predict-yes)
  32437. =>WM: (15752: R1124 ^value 1)
  32438. =>WM: (15751: R1 ^reward R1124)
  32439. =>WM: (15750: I3 ^see 0)
  32440. <=WM: (15741: S1 ^operator O2239 +)
  32441. <=WM: (15742: S1 ^operator O2240 +)
  32442. <=WM: (15743: S1 ^operator O2240)
  32443. <=WM: (15737: R1 ^reward R1123)
  32444. <=WM: (15736: I3 ^see 1)
  32445. <=WM: (15740: O2240 ^name predict-no)
  32446. <=WM: (15739: O2239 ^name predict-yes)
  32447. <=WM: (15738: R1123 ^value 1)
  32448. --- Inner Elaboration Phase, active level 1 (S1) ---
  32449. Firing prefer*rvt*predict-yes*H0
  32450. -->
  32451. Firing rl*prefer*rvt*predict-yes*H0*5
  32452. -->
  32453. (S1 ^operator O2241 = 0.1215968294322646)
  32454. Firing prefer*rvt*predict-yes*H0*5*H1
  32455. -->
  32456. Firing rl*prefer*rvt*predict-yes*H0*5*H1*7
  32457. -->
  32458. (S1 ^operator O2241 = -0.1512366769350551)
  32459. Firing prefer*rvt*predict-no*H0
  32460. -->
  32461. Firing rl*prefer*rvt*predict-no*H0*6
  32462. -->
  32463. (S1 ^operator O2242 = 0.9526196166066165)
  32464. inner elaboration loop at bottom goal.
  32465. Retracting rl*prefer*rvt*predict-no*H0*6
  32466. -->
  32467. (S1 ^operator O2240 = 0.9526196166066165)
  32468. Retracting rl*prefer*rvt*predict-yes*H0*5
  32469. -->
  32470. (S1 ^operator O2239 = 0.1215968294322646)
  32471. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  32472. -->
  32473. (S1 ^operator O2239 = -0.1512366769350551)
  32474. --- END Proposal Phase ---
  32475. --- Decision Phase ---
  32476. RL update rl*prefer*rvt*predict-no*H0*6 0.95262 0 0.95262 -> 0.960173 0 0.960173(R,m,v=1,0.938462,0.0580492)
  32477. =>WM: (15757: S1 ^operator O2242)
  32478. 1121: O: O2242 (predict-no)
  32479. --- END Decision Phase ---
  32480. --- Application Phase ---
  32481. --- Firing Productions (PE) For State At Depth 1 ---
  32482. --- Inner Elaboration Phase, active level 1 (S1) ---
  32483. Firing apply*operator
  32484. -->
  32485. (I3 ^predict-no N1121 + :O )
  32486. Firing apply*operator*complete
  32487. -->
  32488. (I3 ^predict-no N1120 - :O )
  32489. inner elaboration loop at bottom goal.
  32490. --- Change Working Memory (PE) ---
  32491. =>WM: (15758: I3 ^predict-no N1121)
  32492. <=WM: (15745: N1120 ^status complete)
  32493. <=WM: (15744: I3 ^predict-no N1120)
  32494. --- Firing Productions (IE) For State At Depth 1 ---
  32495. --- Inner Elaboration Phase, active level 1 (S1) ---
  32496. Firing monitor*world
  32497. -->
  32498. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32499. --- Change Working Memory (IE) ---
  32500. --- END Application Phase ---
  32501. --- Output Phase ---
  32502. ENV: Agent did: predict-no for direction R in state State-B
  32503. In State-B moving R
  32504. ENV: (next state, see, prediction correct?) = (State-B, 0, True)
  32505. predict error 0
  32506. dir: dir isL
  32507. --- END Output Phase ---
  32508. ---- Input Phase ---
  32509. =>WM: (15762: I2 ^dir L)
  32510. =>WM: (15761: I2 ^reward 1)
  32511. =>WM: (15760: I2 ^see 0)
  32512. =>WM: (15759: N1121 ^status complete)
  32513. <=WM: (15748: I2 ^dir R)
  32514. <=WM: (15747: I2 ^reward 1)
  32515. <=WM: (15746: I2 ^see 0)
  32516. =>WM: (15763: I2 ^level-1 R0-root)
  32517. <=WM: (15749: I2 ^level-1 R0-root)
  32518. --- END Input Phase ---
  32519. --- Proposal Phase ---
  32520. --- Inner Elaboration Phase, active level 1 (S1) ---
  32521. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  32522. -->
  32523. (S1 ^operator O2242 = -0.1984300550322165)
  32524. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  32525. -->
  32526. (S1 ^operator O2241 = 0.6091799658293192)
  32527. Firing prefer*rvt*predict-no*H0*4*H1
  32528. -->
  32529. Firing prefer*rvt*predict-yes*H0*3*H1
  32530. -->
  32531. Firing elaborate*copy-see-to-output-link
  32532. -->
  32533. (I3 ^see 0 +)
  32534. Firing elaborate*reward*based*on*reward
  32535. -->
  32536. (R1125 ^value 1 +)
  32537. (R1 ^reward R1125 +)
  32538. Firing propose*predict-yes
  32539. -->
  32540. (O2243 ^name predict-yes +)
  32541. (S1 ^operator O2243 +)
  32542. Firing propose*predict-no
  32543. -->
  32544. (O2244 ^name predict-no +)
  32545. (S1 ^operator O2244 +)
  32546. Firing rl*prefer*rvt*predict-no*H0*4
  32547. -->
  32548. (S1 ^operator O2242 = 0.3145132909791186)
  32549. Firing rl*prefer*rvt*predict-yes*H0*3
  32550. -->
  32551. (S1 ^operator O2241 = 0.3907797844980353)
  32552. Firing prefer*rvt*predict-yes*H0
  32553. -->
  32554. Firing prefer*rvt*predict-no*H0
  32555. -->
  32556. Firing elaborate*copy-dir-to-output-link
  32557. -->
  32558. (I3 ^dir L +)
  32559. inner elaboration loop at bottom goal.
  32560. Retracting elaborate*copy-see-to-output-link
  32561. -->
  32562. (I3 ^see 0 +)
  32563. Retracting propose*predict-no
  32564. -->
  32565. (O2242 ^name predict-no +)
  32566. (S1 ^operator O2242 +)
  32567. Retracting propose*predict-yes
  32568. -->
  32569. (O2241 ^name predict-yes +)
  32570. (S1 ^operator O2241 +)
  32571. Retracting elaborate*reward*based*on*reward
  32572. -->
  32573. (R1124 ^value 1 +)
  32574. (R1 ^reward R1124 +)
  32575. Retracting elaborate*copy-dir-to-output-link
  32576. -->
  32577. (I3 ^dir R +)
  32578. Retracting rl*prefer*rvt*predict-no*H0*6
  32579. -->
  32580. (S1 ^operator O2242 = 0.9601726831979733)
  32581. Retracting rl*prefer*rvt*predict-yes*H0*5*H1*7
  32582. -->
  32583. (S1 ^operator O2241 = -0.1512366769350551)
  32584. Retracting rl*prefer*rvt*predict-yes*H0*5
  32585. -->
  32586. (S1 ^operator O2241 = 0.1215968294322646)
  32587. =>WM: (15770: S1 ^operator O2244 +)
  32588. =>WM: (15769: S1 ^operator O2243 +)
  32589. =>WM: (15768: I3 ^dir L)
  32590. =>WM: (15767: O2244 ^name predict-no)
  32591. =>WM: (15766: O2243 ^name predict-yes)
  32592. =>WM: (15765: R1125 ^value 1)
  32593. =>WM: (15764: R1 ^reward R1125)
  32594. <=WM: (15755: S1 ^operator O2241 +)
  32595. <=WM: (15756: S1 ^operator O2242 +)
  32596. <=WM: (15757: S1 ^operator O2242)
  32597. <=WM: (15726: I3 ^dir R)
  32598. <=WM: (15751: R1 ^reward R1124)
  32599. <=WM: (15754: O2242 ^name predict-no)
  32600. <=WM: (15753: O2241 ^name predict-yes)
  32601. <=WM: (15752: R1124 ^value 1)
  32602. --- Inner Elaboration Phase, active level 1 (S1) ---
  32603. Firing prefer*rvt*predict-yes*H0
  32604. -->
  32605. Firing rl*prefer*rvt*predict-yes*H0*3*H1*9
  32606. -->
  32607. (S1 ^operator O2243 = 0.6091799658293192)
  32608. Firing rl*prefer*rvt*predict-yes*H0*3
  32609. -->
  32610. (S1 ^operator O2243 = 0.3907797844980353)
  32611. Firing prefer*rvt*predict-yes*H0*3*H1
  32612. -->
  32613. Firing prefer*rvt*predict-no*H0
  32614. -->
  32615. Firing rl*prefer*rvt*predict-no*H0*4*H1*8
  32616. -->
  32617. (S1 ^operator O2244 = -0.1984300550322165)
  32618. Firing rl*prefer*rvt*predict-no*H0*4
  32619. -->
  32620. (S1 ^operator O2244 = 0.3145132909791186)
  32621. Firing prefer*rvt*predict-no*H0*4*H1
  32622. -->
  32623. inner elaboration loop at bottom goal.
  32624. Retracting rl*prefer*rvt*predict-no*H0*4
  32625. -->
  32626. (S1 ^operator O2242 = 0.3145132909791186)
  32627. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  32628. -->
  32629. (S1 ^operator O2242 = -0.1984300550322165)
  32630. Retracting rl*prefer*rvt*predict-yes*H0*3
  32631. -->
  32632. (S1 ^operator O2241 = 0.3907797844980353)
  32633. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  32634. -->
  32635. (S1 ^operator O2241 = 0.6091799658293192)
  32636. --- END Proposal Phase ---
  32637. --- Decision Phase ---
  32638. RL update rl*prefer*rvt*predict-no*H0*6 0.960173 0 0.960173 -> 0.966517 0 0.966517(R,m,v=1,0.938776,0.0577708)
  32639. =>WM: (15771: S1 ^operator O2243)
  32640. 1122: O: O2243 (predict-yes)
  32641. --- END Decision Phase ---
  32642. --- Application Phase ---
  32643. --- Firing Productions (PE) For State At Depth 1 ---
  32644. --- Inner Elaboration Phase, active level 1 (S1) ---
  32645. Firing apply*operator
  32646. -->
  32647. (I3 ^predict-yes N1122 + :O )
  32648. Firing apply*operator*complete
  32649. -->
  32650. (I3 ^predict-no N1121 - :O )
  32651. inner elaboration loop at bottom goal.
  32652. --- Change Working Memory (PE) ---
  32653. =>WM: (15772: I3 ^predict-yes N1122)
  32654. <=WM: (15759: N1121 ^status complete)
  32655. <=WM: (15758: I3 ^predict-no N1121)
  32656. --- Firing Productions (IE) For State At Depth 1 ---
  32657. --- Inner Elaboration Phase, active level 1 (S1) ---
  32658. Firing monitor*world
  32659. -->
  32660. I see 1 and I'm going to do: predict-yes inner elaboration loop at bottom goal.
  32661. --- Change Working Memory (IE) ---
  32662. --- END Application Phase ---
  32663. --- Output Phase ---
  32664. ENV: Agent did: predict-yes for direction L in state State-B
  32665. In State-B moving L
  32666. ENV: (next state, see, prediction correct?) = (State-A, 1, True)
  32667. predict error 0
  32668. dir: dir isL
  32669. --- END Output Phase ---
  32670. /|\--- Input Phase ---
  32671. =>WM: (15776: I2 ^dir L)
  32672. =>WM: (15775: I2 ^reward 1)
  32673. =>WM: (15774: I2 ^see 1)
  32674. =>WM: (15773: N1122 ^status complete)
  32675. <=WM: (15762: I2 ^dir L)
  32676. <=WM: (15761: I2 ^reward 1)
  32677. <=WM: (15760: I2 ^see 0)
  32678. =>WM: (15777: I2 ^level-1 L1-root)
  32679. <=WM: (15763: I2 ^level-1 R0-root)
  32680. --- END Input Phase ---
  32681. --- Proposal Phase ---
  32682. --- Inner Elaboration Phase, active level 1 (S1) ---
  32683. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  32684. -->
  32685. (S1 ^operator O2243 = -0.2062723012911647)
  32686. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  32687. -->
  32688. (S1 ^operator O2244 = 0.6855078088135349)
  32689. Firing prefer*rvt*predict-no*H0*4*H1
  32690. -->
  32691. Firing prefer*rvt*predict-yes*H0*3*H1
  32692. -->
  32693. Firing elaborate*copy-see-to-output-link
  32694. -->
  32695. (I3 ^see 1 +)
  32696. Firing elaborate*reward*based*on*reward
  32697. -->
  32698. (R1126 ^value 1 +)
  32699. (R1 ^reward R1126 +)
  32700. Firing propose*predict-yes
  32701. -->
  32702. (O2245 ^name predict-yes +)
  32703. (S1 ^operator O2245 +)
  32704. Firing propose*predict-no
  32705. -->
  32706. (O2246 ^name predict-no +)
  32707. (S1 ^operator O2246 +)
  32708. Firing rl*prefer*rvt*predict-no*H0*4
  32709. -->
  32710. (S1 ^operator O2244 = 0.3145132909791186)
  32711. Firing rl*prefer*rvt*predict-yes*H0*3
  32712. -->
  32713. (S1 ^operator O2243 = 0.3907797844980353)
  32714. Firing prefer*rvt*predict-yes*H0
  32715. -->
  32716. Firing prefer*rvt*predict-no*H0
  32717. -->
  32718. Firing elaborate*copy-dir-to-output-link
  32719. -->
  32720. (I3 ^dir L +)
  32721. inner elaboration loop at bottom goal.
  32722. Retracting elaborate*copy-see-to-output-link
  32723. -->
  32724. (I3 ^see 0 +)
  32725. Retracting propose*predict-no
  32726. -->
  32727. (O2244 ^name predict-no +)
  32728. (S1 ^operator O2244 +)
  32729. Retracting propose*predict-yes
  32730. -->
  32731. (O2243 ^name predict-yes +)
  32732. (S1 ^operator O2243 +)
  32733. Retracting elaborate*reward*based*on*reward
  32734. -->
  32735. (R1125 ^value 1 +)
  32736. (R1 ^reward R1125 +)
  32737. Retracting elaborate*copy-dir-to-output-link
  32738. -->
  32739. (I3 ^dir L +)
  32740. Retracting rl*prefer*rvt*predict-no*H0*4
  32741. -->
  32742. (S1 ^operator O2244 = 0.3145132909791186)
  32743. Retracting rl*prefer*rvt*predict-no*H0*4*H1*8
  32744. -->
  32745. (S1 ^operator O2244 = -0.1984300550322165)
  32746. Retracting rl*prefer*rvt*predict-yes*H0*3
  32747. -->
  32748. (S1 ^operator O2243 = 0.3907797844980353)
  32749. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*9
  32750. -->
  32751. (S1 ^operator O2243 = 0.6091799658293192)
  32752. =>WM: (15784: S1 ^operator O2246 +)
  32753. =>WM: (15783: S1 ^operator O2245 +)
  32754. =>WM: (15782: O2246 ^name predict-no)
  32755. =>WM: (15781: O2245 ^name predict-yes)
  32756. =>WM: (15780: R1126 ^value 1)
  32757. =>WM: (15779: R1 ^reward R1126)
  32758. =>WM: (15778: I3 ^see 1)
  32759. <=WM: (15769: S1 ^operator O2243 +)
  32760. <=WM: (15771: S1 ^operator O2243)
  32761. <=WM: (15770: S1 ^operator O2244 +)
  32762. <=WM: (15764: R1 ^reward R1125)
  32763. <=WM: (15750: I3 ^see 0)
  32764. <=WM: (15767: O2244 ^name predict-no)
  32765. <=WM: (15766: O2243 ^name predict-yes)
  32766. <=WM: (15765: R1125 ^value 1)
  32767. --- Inner Elaboration Phase, active level 1 (S1) ---
  32768. Firing prefer*rvt*predict-yes*H0
  32769. -->
  32770. Firing rl*prefer*rvt*predict-yes*H0*3
  32771. -->
  32772. (S1 ^operator O2245 = 0.3907797844980353)
  32773. Firing prefer*rvt*predict-yes*H0*3*H1
  32774. -->
  32775. Firing rl*prefer*rvt*predict-yes*H0*3*H1*11
  32776. -->
  32777. (S1 ^operator O2245 = -0.2062723012911647)
  32778. Firing prefer*rvt*predict-no*H0
  32779. -->
  32780. Firing rl*prefer*rvt*predict-no*H0*4
  32781. -->
  32782. (S1 ^operator O2246 = 0.3145132909791186)
  32783. Firing prefer*rvt*predict-no*H0*4*H1
  32784. -->
  32785. Firing rl*prefer*rvt*predict-no*H0*4*H1*10
  32786. -->
  32787. (S1 ^operator O2246 = 0.6855078088135349)
  32788. inner elaboration loop at bottom goal.
  32789. Retracting rl*prefer*rvt*predict-no*H0*4
  32790. -->
  32791. (S1 ^operator O2244 = 0.3145132909791186)
  32792. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  32793. -->
  32794. (S1 ^operator O2244 = 0.6855078088135349)
  32795. Retracting rl*prefer*rvt*predict-yes*H0*3
  32796. -->
  32797. (S1 ^operator O2243 = 0.3907797844980353)
  32798. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  32799. -->
  32800. (S1 ^operator O2243 = -0.2062723012911647)
  32801. --- END Proposal Phase ---
  32802. --- Decision Phase ---
  32803. RL update rl*prefer*rvt*predict-yes*H0*3 0.472325 -0.0815456 0.39078 -> 0.472328 -0.0815451 0.390783(R,m,v=1,0.951087,0.0467748)
  32804. RL update rl*prefer*rvt*predict-yes*H0*3*H1*9 0.52764 0.0815397 0.60918 -> 0.527643 0.0815402 0.609184(R,m,v=1,1,0)
  32805. =>WM: (15785: S1 ^operator O2246)
  32806. 1123: O: O2246 (predict-no)
  32807. --- END Decision Phase ---
  32808. --- Application Phase ---
  32809. --- Firing Productions (PE) For State At Depth 1 ---
  32810. --- Inner Elaboration Phase, active level 1 (S1) ---
  32811. Firing apply*operator
  32812. -->
  32813. (I3 ^predict-no N1123 + :O )
  32814. Firing apply*operator*complete
  32815. -->
  32816. (I3 ^predict-yes N1122 - :O )
  32817. inner elaboration loop at bottom goal.
  32818. --- Change Working Memory (PE) ---
  32819. =>WM: (15786: I3 ^predict-no N1123)
  32820. <=WM: (15773: N1122 ^status complete)
  32821. <=WM: (15772: I3 ^predict-yes N1122)
  32822. --- Firing Productions (IE) For State At Depth 1 ---
  32823. --- Inner Elaboration Phase, active level 1 (S1) ---
  32824. Firing monitor*world
  32825. -->
  32826. I see 1 and I'm going to do: predict-no inner elaboration loop at bottom goal.
  32827. --- Change Working Memory (IE) ---
  32828. --- END Application Phase ---
  32829. --- Output Phase ---
  32830. ENV: Agent did: predict-no for direction L in state State-A
  32831. In State-A moving L
  32832. ENV: (next state, see, prediction correct?) = (State-A, 0, True)
  32833. predict error 0
  32834. dir: dir isL
  32835. --- END Output Phase ---
  32836. -/|--- Input Phase ---
  32837. =>WM: (15790: I2 ^dir L)
  32838. =>WM: (15789: I2 ^reward 1)
  32839. =>WM: (15788: I2 ^see 0)
  32840. =>WM: (15787: N1123 ^status complete)
  32841. <=WM: (15776: I2 ^dir L)
  32842. <=WM: (15775: I2 ^reward 1)
  32843. <=WM: (15774: I2 ^see 1)
  32844. =>WM: (15791: I2 ^level-1 L0-root)
  32845. <=WM: (15777: I2 ^level-1 L1-root)
  32846. --- END Input Phase ---
  32847. --- Proposal Phase ---
  32848. --- Inner Elaboration Phase, active level 1 (S1) ---
  32849. Firing rl*prefer*rvt*predict-yes*H0*3*H1*13
  32850. -->
  32851. (S1 ^operator O2245 = -0.208713043145708)
  32852. Firing rl*prefer*rvt*predict-no*H0*4*H1*12
  32853. -->
  32854. (S1 ^operator O2246 = 0.6854688057424099)
  32855. Firing prefer*rvt*predict-no*H0*4*H1
  32856. -->
  32857. Firing prefer*rvt*predict-yes*H0*3*H1
  32858. -->
  32859. Firing elaborate*copy-see-to-output-link
  32860. -->
  32861. (I3 ^see 0 +)
  32862. Firing elaborate*reward*based*on*reward
  32863. -->
  32864. (R1127 ^value 1 +)
  32865. (R1 ^reward R1127 +)
  32866. Firing propose*predict-yes
  32867. -->
  32868. (O2247 ^name predict-yes +)
  32869. (S1 ^operator O2247 +)
  32870. Firing propose*predict-no
  32871. -->
  32872. (O2248 ^name predict-no +)
  32873. (S1 ^operator O2248 +)
  32874. Firing rl*prefer*rvt*predict-no*H0*4
  32875. -->
  32876. (S1 ^operator O2246 = 0.3145132909791186)
  32877. Firing rl*prefer*rvt*predict-yes*H0*3
  32878. -->
  32879. (S1 ^operator O2245 = 0.3907830226387189)
  32880. Firing prefer*rvt*predict-yes*H0
  32881. -->
  32882. Firing prefer*rvt*predict-no*H0
  32883. -->
  32884. Firing elaborate*copy-dir-to-output-link
  32885. -->
  32886. (I3 ^dir L +)
  32887. inner elaboration loop at bottom goal.
  32888. Retracting elaborate*copy-see-to-output-link
  32889. -->
  32890. (I3 ^see 1 +)
  32891. Retracting propose*predict-no
  32892. -->
  32893. (O2246 ^name predict-no +)
  32894. (S1 ^operator O2246 +)
  32895. Retracting propose*predict-yes
  32896. -->
  32897. (O2245 ^name predict-yes +)
  32898. (S1 ^operator O2245 +)
  32899. Retracting elaborate*reward*based*on*reward
  32900. -->
  32901. (R1126 ^value 1 +)
  32902. (R1 ^reward R1126 +)
  32903. Retracting elaborate*copy-dir-to-output-link
  32904. -->
  32905. (I3 ^dir L +)
  32906. Retracting rl*prefer*rvt*predict-no*H0*4*H1*10
  32907. -->
  32908. (S1 ^operator O2246 = 0.6855078088135349)
  32909. Retracting rl*prefer*rvt*predict-no*H0*4
  32910. -->
  32911. (S1 ^operator O2246 = 0.3145132909791186)
  32912. Retracting rl*prefer*rvt*predict-yes*H0*3*H1*11
  32913. -->
  32914. (S1 ^operator O2245 = -0.2062723012911647)
  32915. Retracting rl*prefer*rvt*predict-yes*H0*3
  32916. -->
  32917. (S1 ^operator O2245 = 0.3907830226387189)
  32918. =>WM: (15798: S1 ^operator O2248 +)
  32919. =>WM: (15797: S1 ^operator O2247 +)
  32920. =>WM: (15796: O2248 ^name predict-no)
  32921. =>WM: (15795: O2247 ^name predict-yes)
  32922. =>WM: (15794: R1127 ^value 1)
  32923. =>WM: (15793: R1 ^reward R1127)
  32924. =>WM: (15792: I3 ^see 0)
  32925. <=WM: (15783: S1 ^operator O2245 +)
  32926. <=WM: (15784: S1 ^operator O2246 +)
  32927. <=WM: (15785: S1 ^operator O2246)
  32928. <=WM: (15779: R1 ^reward R1126)
  32929. <=WM: (15778: I3 ^see 1)
  32930. <=WM: (15782: O2246 ^name predict-no)
  32931. <=WM: (15781: O2245 ^name predict-yes)
  32932. <=WM: (15780: R1126 ^value 1)
  32933. --- Inner Elaborati