From: Further advances on Bayesian Ying-Yang harmony learning
Year | Outcomes |
---|---|
1998 | The following convex combination with 0≤η≤1 is heuristically proposed (1−η)K L(p(Y|X)p(X)∥q(Y|R)q(Y))−η H(θ),(A) as a criterion for model selection, e.g. see Eq. (49) in Xu (1998a) and Eq. (22) in Xu (1998b). The above equation (A) can be rewritten into a format that is exactly equivalent to H _{ L }(θ)=(1+η)H(θ)+η E _{ Y|X } in Equation 17. |
2000 | It is further proposed to make maxθ H _{ L }(θ) with η>0 monotonically decreased from a big value (i.e. remove the constraint η≤1), see Eq. (23) in Xu (2000a), which is further addressed for learning Gaussian mixture in Xu (2001a), e.g. see paragraphs around its Eq. (42) and Eq. (43). |
2003 | The above equation (A) has been also reexamined from a perspective of the KL- η-HL spectrum, with details referred to Eqs. (62-64) in (Xu 2003a). |