Search
Search Funnelback University
- Refined by:
- Date: 2018
Did you mean apc53 |u:mi.eng.cam.ac.uk?
1 -
10 of
61
search results for KA :PC53 |u:mi.eng.cam.ac.uk
where 0
match all words and 61
match some words.
Results that match 1 of 2 words
-
Reward Estimation for Dialogue Policy Optimisation Pei-Hao Su, Milica …
mi.eng.cam.ac.uk/~sjy/papers/sugy18.pdf20 Feb 2018: kA(a,a′). The. policy is optimised using an algorithm called GP-SARSA [7, 45] in which theQ-function is updated by calculating the posterior given the collected belief-action pairs ... The summary action kernel is defined as:. kA(a,a′) = δa(a. ′) -
On-line Active Reward Learning for Policy Optimisationin Spoken…
mi.eng.cam.ac.uk/~sjy/papers/sgmb16.pdf20 Feb 2018: Zhang and Chaudhuri2015] Chicheng Zhang and Ka-malika Chaudhuri. 2015. Active learning fromweak and strong labelers. -
is-05-hvs6_final
mi.eng.cam.ac.uk/~sjy/papers/seyo05.pdf20 Feb 2018: 211. 2121. 1. aannna. annnnn. ka. rd (13). Here nr specifies the number of events that occurred r times and a fixed discounting factor was used if they are zero. -
hierParsing.dvi
mi.eng.cam.ac.uk/~sjy/papers/heyo03a.pdf20 Feb 2018: bbj¢&}M"@"jj}j[&}(¢}M|ªjb}a{(|&}(}M¢¤&Ka"}Xl|&|s¡"&}¡}bj¤&|@b&¤&@}"|&}b|&bj5¢@¤[@¤&}"¤|&| b9}&}M¤@j@b -
POLICY COMMITTEE FOR ADAPTATION IN MULTI-DOMAIN SPOKEN…
mi.eng.cam.ac.uk/~sjy/papers/gmsv15.pdf20 Feb 2018: Q(b,a) GP (0,k((b,a), (b,a))) (2). where the kernel k(, ) is factored into separate kernels overbelief and action spaces kB(b,b′)kA(a,a′). ... kA(a,a′) = δa(a. ′) (7). where δa(a′) = 1 iff a = a′, 0 otherwise. -
Dialogue manager domain adaptation using Gaussian process…
mi.eng.cam.ac.uk/~sjy/papers/gmrs17.pdf20 Feb 2018: kA(a,a′). For a training sequence of belief state-action pairs B = [(b0,a0),. , ... kA(a,a′) = δa(a. ′) (6). where δa(a′) = 1 iff a = a′, 0 otherwise. -
DISTRIBUTED DIALOGUE POLICIES FOR MULTI-DOMAIN STATISTICAL…
mi.eng.cam.ac.uk/~sjy/papers/gkty15a.pdf20 Feb 2018: and action spaces kB(b,b′)kA(a,a′). ... kA(a,a′) = δa(a. ′) (5). where δa(a′) = 1 iff a = a′, 0 otherwise. -
DISTRIBUTED DIALOGUE POLICIES FOR MULTI-DOMAIN STATISTICAL…
mi.eng.cam.ac.uk/~sjy/papers/gkty15.pdf20 Feb 2018: and action spaces kB(b,b′)kA(a,a′). ... kA(a,a′) = δa(a. ′) (5). where δa(a′) = 1 iff a = a′, 0 otherwise. -
Incremental on-line adaptation of POMDP-based dialogue managers…
mi.eng.cam.ac.uk/~sjy/papers/gktb14.pdf20 Feb 2018: kA(a,a′) = δa(a. ′), (7). where δa(a′) = 1 iff a = a′, 0 otherwise.4.2. ... The kernelfunction between two sets of actions is. kA(aB,aE) = δaB(a. -
Optimisation for POMDP-based Spoken Dialogue Systems M. Gašić, F.…
mi.eng.cam.ac.uk/~sjy/papers/gjty12.pdf20 Feb 2018: Q(b,a) GP (0,k((b,a), (b,a))). (41). The kernel k(, ) is often factored into separate kernels over the belief state and actionspaces kB(b,
Search history
Recently clicked results
Recently clicked results
Your click history is empty.
Recent searches
Recent searches
Your search history is empty.