• 基于高斯混合模型的分布拟合与参数估计:高维复数域的详细推导


    基于高斯混合模型的分布拟合

    实际中,我们不知道一些样本对应的真实分布。比如我们手上有 N p N_p Np个样本 { h n } n = 1 N p \{ \boldsymbol h_n \}_{n=1}^{N_p} {hn}n=1Np,这些样本都对应到某个特定的label,比如特定的数字或者类。令真实的分布为 f ( h ) f(\boldsymbol h) f(h),我们可以通过经验产生大量样本来近似该分布。定义 f f f的近似为 p ( h ) p(\boldsymbol h) p(h),并且限制分布 p p p为Gaussian Mixture, i.e.
    p ( h ) = ∑ k = 1 K α k C N ( h ; μ k , C k ) (20) p \left (\boldsymbol h \right) = \sum_{k=1}^K \alpha_k \mathcal{CN}(\boldsymbol h; \boldsymbol \mu_k, \boldsymbol{C}_k) \tag{20} p(h)=k=1KαkCN(h;μk,Ck)(20)

    其中 h ∈ C L × 1 \boldsymbol h \in \mathbb{C}^{L \times 1} hCL×1 μ k ∈ C L × 1 \boldsymbol{\mu_k} \in \mathbb{C}^{L \times 1} μkCL×1 C k ∈ C L × L \boldsymbol{C}_k \in \mathbb{C}^{L \times L} CkCL×L。假设我们固定第 p p p个标识label,然后经验生成 N p N_p Np个样本,记为 { h n } n = 1 N p \{ \boldsymbol h_n \}_{n=1}^{N_p} {hn}n=1Np,我们要最大化似然函数:
    max ⁡ { α k , μ k , C k } ln ⁡ ( ∏ n = 1 N p ∑ k = 1 K α k C N ( h n ; μ k , C k ) ) = ∑ n = 1 N p ln ⁡ ( ∑ k = 1 K α k C N ( h n ; μ k , C k ) ) (21)

    max{αk,μk,Ck}ln(n=1Npk=1KαkCN(hn;μk,Ck))=n=1Npln(k=1KαkCN(hn;μk,Ck))" role="presentation">max{αk,μk,Ck}ln(n=1Npk=1KαkCN(hn;μk,Ck))=n=1Npln(k=1KαkCN(hn;μk,Ck))
    \tag{21} {αk,μk,Ck}max=lnn=1Npk=1KαkCN(hn;μk,Ck)n=1Npln(k=1KαkCN(hn;μk,Ck))(21)

    我们采用EM算法进行相关参数的估计。正式计算之前,先引入一个隐变量 z ∈ { 1 , ⋯   , K } z \in \{1,\cdots,K\} z{1,,K}表征GMM中的第几个模(mode)。当 K → ∞ K \rightarrow \infty K时,我们认为GMM能够很好地逼近原始分布。那么反过来,等效地来看,引入的隐变量 z z z可以解释为:先根据隐变量 z z z的先验 p ( z = k ) p(z=k) p(z=k)确定第 k k k个mode,再基于似然 p ( h n ∣ z n = k ) p(\boldsymbol h_n|z^n=k) p(hnzn=k),由第 k k k个mode来产生相应的样本生成(sample)。根据相关概率公式,我们进一步描述其概率意义:
    k-th mode:  p ( h n , z n = k ) = p ( z n = k ) p ( h n ∣ z n = k ) all modes:  p ( h n ) = ∑ z p ( h n , z n = k ) = ∑ z p ( z n = k ) p ( h n ∣ z n = k ) posterior of k-th mode:  p ( z n = k ∣ h n ) = p ( z n = k ) p ( h n ∣ z n = k ) p ( h n ) = p ( z n = k ) p ( h n ∣ z n = k ) ∑ z p ( z n = k ) p ( h n ∣ z n = k )

    k-th mode: p(hn,zn=k)=p(zn=k)p(hn|zn=k)all modes: p(hn)=zp(hn,zn=k)=zp(zn=k)p(hn|zn=k)posterior of k-th mode: p(zn=k|hn)=p(zn=k)p(hn|zn=k)p(hn)=p(zn=k)p(hn|zn=k)zp(zn=k)p(hn|zn=k)" role="presentation">k-th mode: p(hn,zn=k)=p(zn=k)p(hn|zn=k)all modes: p(hn)=zp(hn,zn=k)=zp(zn=k)p(hn|zn=k)posterior of k-th mode: p(zn=k|hn)=p(zn=k)p(hn|zn=k)p(hn)=p(zn=k)p(hn|zn=k)zp(zn=k)p(hn|zn=k)
    k-th mode: all modes: posterior of k-th mode: p(hn,zn=k)=p(zn=k)p(hnzn=k)p(hn)=zp(hn,zn=k)=zp(zn=k)p(hnzn=k)p(zn=khn)=p(hn)p(zn=k)p(hnzn=k)=zp(zn=k)p(hnzn=k)p(zn=k)p(hnzn=k)

    我们将EM算法的步骤描述为:对于第 t + 1 t+1 t+1次迭代
    (1) E步:固定第 t t t次迭代的参数 { α k t , μ k t , C k t } k = 1 K \{\alpha^t_k, \boldsymbol \mu^t_k, \boldsymbol{C}^t_k \}_{k=1}^K {αkt,μkt,Ckt}k=1K,对 ∀ n , k \forall n,k n,k,即第 n n n个sample,第 k k k个mode,计算后验后验分布,定义为
    γ n k t + 1 ≐ p ( z n = k ∣ h n ) = p ( z n = k ) p ( h n ∣ z n = k ) p ( h n ) = α k t C N ( h n ; μ k t , C k t ) ∑ k = 1 K α k t C N ( h n ; μ k t , C k t ) ,    ∀ n , k (22)

    γnkt+1p(zn=k|hn)=p(zn=k)p(hn|zn=k)p(hn)=αktCN(hn;μkt,Ckt)k=1KαktCN(hn;μkt,Ckt),  n,k" role="presentation">γnkt+1p(zn=k|hn)=p(zn=k)p(hn|zn=k)p(hn)=αktCN(hn;μkt,Ckt)k=1KαktCN(hn;μkt,Ckt),  n,k
    \tag{22} γnkt+1p(zn=khn)=p(hn)p(zn=k)p(hnzn=k)=k=1KαktCN(hn;μkt,Ckt)αktCN(hn;μkt,Ckt),  n,k(22)

    经过E步,我们便得到了一组后验分布 { γ n k t + 1 } n , k \{\gamma^{t+1}_{nk}\}_{n,k} {γnkt+1}n,k

    (2) M步:固定后验分布 { γ n k t + 1 } n , k \{\gamma^{t+1}_{nk}\}_{n,k} {γnkt+1}n,k,通过最大化证据下界(Evidence Lower BOund)(即对数边际似然 log ⁡ p ( h ; α , μ , C ) \log p(\boldsymbol h;\alpha, \boldsymbol \mu, \boldsymbol{C}) logp(h;α,μ,C)的下界),来估计第 t + 1 t+1 t+1次迭代的参数 { α k t + 1 , μ k t + 1 , C k t + 1 } k = 1 K \{ \alpha^{t+1}_k, \boldsymbol \mu^{t+1}_k, \boldsymbol{C}^{t+1}_k \}_{k=1}^K {αkt+1,μkt+1,Ckt+1}k=1K

         E L B O ( { γ n k t + 1 } n , k , { h n } n ; { α k t + 1 , μ k t + 1 , C k t + 1 } k = 1 K ) = ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 log ⁡ p ( h n , z n = k ) γ n k t + 1 = ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 log ⁡ p ( z n = k ) p ( h n ∣ z n = k ) γ n k t + 1 = ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 log ⁡ α k t + 1 ⋅ C N ( h n ; μ k t + 1 , C k t + 1 ) γ n k t + 1 = ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 ( − ∥ ( C k t + 1 ) − 1 2 ( h n − μ k t + 1 ) ∥ 2 2 + log ⁡ α k t + 1 − log ⁡ ∣ C k t + 1 ∣ ) + C (23)

        ELBO({γnkt+1}n,k,{hn}n;{αkt+1,μkt+1,Ckt+1}k=1K)=n=1Npk=1Kγnkt+1logp(hn,zn=k)γnkt+1=n=1Npk=1Kγnkt+1logp(zn=k)p(hn|zn=k)γnkt+1=n=1Npk=1Kγnkt+1logαkt+1CN(hn;μkt+1,Ckt+1)γnkt+1=n=1Npk=1Kγnkt+1((Ckt+1)12(hnμkt+1)22+logαkt+1log|Ckt+1|)+C" role="presentation">    ELBO({γnkt+1}n,k,{hn}n;{αkt+1,μkt+1,Ckt+1}k=1K)=n=1Npk=1Kγnkt+1logp(hn,zn=k)γnkt+1=n=1Npk=1Kγnkt+1logp(zn=k)p(hn|zn=k)γnkt+1=n=1Npk=1Kγnkt+1logαkt+1CN(hn;μkt+1,Ckt+1)γnkt+1=n=1Npk=1Kγnkt+1((Ckt+1)12(hnμkt+1)22+logαkt+1log|Ckt+1|)+C
    \tag{23}     ELBO({γnkt+1}n,k,{hn}n;{αkt+1,μkt+1,Ckt+1}k=1K)=n=1Npk=1Kγnkt+1logγnkt+1p(hn,zn=k)=n=1Npk=1Kγnkt+1logγnkt+1p(zn=k)p(hnzn=k)=n=1Npk=1Kγnkt+1logγnkt+1αkt+1CN(hn;μkt+1,Ckt+1)=n=1Npk=1Kγnkt+1((Ckt+1)21(hnμkt+1)22+logαkt+1logCkt+1)+C(23)

    其中 C C C是与 { α k t + 1 , μ k t + 1 , C k t + 1 } k = 1 K \{ \alpha^{t+1}_k, \boldsymbol \mu^{t+1}_k, \boldsymbol{C}^{t+1}_k \}_{k=1}^K {αkt+1,μkt+1,Ckt+1}k=1K无关的常数。上述最大化ELBO转化为优化问题:
    arg max ⁡ { α k t + 1 , μ k t + 1 , C k t + 1 } ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 ( − ∥ ( C k t + 1 ) − 1 2 ( h n − μ k t + 1 ) ∥ 2 2 + log ⁡ α k t + 1 − log ⁡ ∣ C k t + 1 ∣ ) s.t.  ∑ k = 1 K α k = 1 (24)

    \argmax{αkt+1,μkt+1,Ckt+1}n=1Npk=1Kγnkt+1((Ckt+1)12(hnμkt+1)22+logαkt+1log|Ckt+1|)s.t. k=1Kαk=1" role="presentation">\argmax{αkt+1,μkt+1,Ckt+1}n=1Npk=1Kγnkt+1((Ckt+1)12(hnμkt+1)22+logαkt+1log|Ckt+1|)s.t. k=1Kαk=1
    \tag{24} {αkt+1,μkt+1,Ckt+1}argmaxn=1Npk=1Kγnkt+1((Ckt+1)21(hnμkt+1)22+logαkt+1logCkt+1)s.t. k=1Kαk=1(24)

    利用拉格朗日函数:
    g : ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 ( − ∥ ( C k t + 1 ) − 1 2 ( h n − μ k t + 1 ) ∥ 2 2 + log ⁡ α k t + 1 − log ⁡ ∣ C k t + 1 ∣ ) − λ ( ∑ k = 1 K α k − 1 ) (25) g: \sum_{n=1}^{N_p} \sum_{k=1}^K \gamma^{t+1}_{nk} \left ( - \left \Vert (\boldsymbol{C}^{t+1}_k)^{-\frac{1}{2}} \left ( \boldsymbol h_n - \boldsymbol{\mu}^{t+1}_k \right) \right \Vert^2_2 + \log \alpha^{t+1}_k - \log \left \vert \boldsymbol{{C}}^{t+1}_k \right \vert \right) - \lambda (\sum_{k=1}^K \alpha_k - 1) \tag{25} g:n=1Npk=1Kγnkt+1((Ckt+1)21(hnμkt+1)22+logαkt+1logCkt+1)λ(k=1Kαk1)(25)

    求关于 { α k t + 1 , μ k t + 1 , C k t + 1 } \{ \alpha^{t+1}_k, \boldsymbol \mu^{t+1}_k, \boldsymbol{C}^{t+1}_k \} {αkt+1,μkt+1,Ckt+1}的偏导数:
    ∂ g ∂ α k t + 1 = 0 ∂ g ∂ μ k t + 1 = 0 ∂ g ∂ C k t + 1 = 0 (26)

    gαkt+1=0gμkt+1=0gCkt+1=0" role="presentation">gαkt+1=0gμkt+1=0gCkt+1=0
    \tag{26} αkt+1gμkt+1gCkt+1g=0=0=0(26)

    考虑 C k = σ k 2 I \boldsymbol{C}_k=\sigma^2_k \boldsymbol{I} Ck=σk2I

    在实际计算时,为了简化,可以取 C k = σ k 2 I \boldsymbol{C}_k=\sigma^2_k \boldsymbol{I} Ck=σk2I,即等高线为圆的高斯,这时式(25)可以化简为
    g : ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 ( − 1 σ k 2 ∥ ( h n − μ k t + 1 ) ∥ 2 2 + log ⁡ α k t + 1 − 2 N U E N B S log ⁡ σ k ) − λ ( ∑ k = 1 K α k − 1 ) g: \sum_{n=1}^{N_p} \sum_{k=1}^K \gamma^{t+1}_{nk} \left ( - \frac{1}{\sigma^2_k} \left \Vert \left (\boldsymbol h_n- \boldsymbol{\mu}^{t+1}_k \right) \right \Vert^2_2 + \log \alpha^{t+1}_k - 2 N_{UE} N_{BS} \log \sigma_k \right) - \lambda (\sum_{k=1}^K \alpha_k - 1) g:n=1Npk=1Kγnkt+1(σk21(hnμkt+1)22+logαkt+12NUENBSlogσk)λ(k=1Kαk1)

    将式(26)代入到上式
    首先对 α k \alpha_k αk求导
    ∂ g ∂ α k = ∑ n = 1 N p γ n k t + 1 ⋅ 1 α k − λ = 0 ⇒ α k = ∑ n = 1 N p γ n k t + 1 λ

    gαk=n=1Npγnkt+11αkλ=0αk=n=1Npγnkt+1λ" role="presentation">gαk=n=1Npγnkt+11αkλ=0αk=n=1Npγnkt+1λ
    αkg=n=1Npγnkt+1αk1λ=0αk=λn=1Npγnkt+1

    又因为 ∑ k = 1 K α k = 1 \sum_{k=1}^K \alpha_k = 1 k=1Kαk=1,因此 λ = N p \lambda=N_p λ=Np,故
    α k = ∑ n = 1 N p γ n k t + 1 N p \alpha_k = \frac{\sum_{n=1}^{N_p} \gamma^{t+1}_{nk}}{ N_p } αk=Npn=1Npγnkt+1

    然后对 μ k ∈ C L × 1 \boldsymbol{\mu}_k \in \mathbb{C}^{L \times 1} μkCL×1求导
    ∂ g ∂ μ k = ∑ n = 1 N p γ n k t + 1 ⋅ ( − 1 σ k 2 ) 2 ( μ k − h n ) = 0 ⇒ ∑ n = 1 N p γ n k t + 1 ⋅ ( μ k − h n ) = 0 ⇒ μ k ⋅ ∑ n = 1 N p γ n k t + 1 = ∑ n = 1 N p γ n k t + 1 h n ⇒ μ k = ∑ n = 1 N p γ n k t + 1 h n ∑ n = 1 N p γ n k t + 1

    gμk=n=1Npγnkt+1(1σk2)2(μkhn)=0n=1Npγnkt+1(μkhn)=0μkn=1Npγnkt+1=n=1Npγnkt+1hnμk=n=1Npγnkt+1hnn=1Npγnkt+1" role="presentation">gμk=n=1Npγnkt+1(1σk2)2(μkhn)=0n=1Npγnkt+1(μkhn)=0μkn=1Npγnkt+1=n=1Npγnkt+1hnμk=n=1Npγnkt+1hnn=1Npγnkt+1
    μkg=n=1Npγnkt+1(σk21)2(μkhn)=0n=1Npγnkt+1(μkhn)=0μkn=1Npγnkt+1=n=1Npγnkt+1hnμk=n=1Npγnkt+1n=1Npγnkt+1hn

    最后对 σ k \sigma_k σk求导
    ∂ g ∂ σ k = ∑ n = 1 N p γ n k t + 1 ( 1 σ k 3 ∥ ( h n − μ k t + 1 ) ∥ 2 2 − 2 N U E N B S 1 σ k ) = 0 ⇒ σ k 2 = 1 2 N U E N B S ⋅ ∑ n = 1 N p γ n k t + 1 ∥ ( h n − μ k t + 1 ) ∥ 2 2 ∑ n = 1 N p γ n k t + 1

    gσk=n=1Npγnkt+1(1σk3(hnμkt+1)222NUENBS1σk)=0σk2=12NUENBSn=1Npγnkt+1(hnμkt+1)22n=1Npγnkt+1" role="presentation">gσk=n=1Npγnkt+1(1σk3(hnμkt+1)222NUENBS1σk)=0σk2=12NUENBSn=1Npγnkt+1(hnμkt+1)22n=1Npγnkt+1
    σkg=n=1Npγnkt+1(σk31(hnμkt+1)222NUENBSσk1)=0σk2=2NUENBS1n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)22

    考虑一般的 C k \boldsymbol{C}_k Ck

    首先对 α k \alpha_k αk求导
    ∂ g ∂ α k = ∑ n = 1 N p γ n k t + 1 ⋅ 1 α k − λ = 0 ⇒ α k = ∑ n = 1 N p γ n k t + 1 λ

    gαk=n=1Npγnkt+11αkλ=0αk=n=1Npγnkt+1λ" role="presentation">gαk=n=1Npγnkt+11αkλ=0αk=n=1Npγnkt+1λ
    αkg=n=1Npγnkt+1αk1λ=0αk=λn=1Npγnkt+1

    又因为 ∑ k = 1 K α k = 1 \sum_{k=1}^K \alpha_k = 1 k=1Kαk=1,因此 λ = N p \lambda=N_p λ=Np

    然后对 μ k ∗ ∈ C L × 1 \boldsymbol{\mu}^{*}_k \in \mathbb{C}^{L \times 1} μkCL×1求导,根据公式(复矩阵求导):
    ∂ ( b H x ) ∂ x ∗ = 0 ,    ∂ ( x H b ) ∂ x ∗ = b ∂ ( x H A x ) ∂ x ∗ = A x

    (bHx)x=0,  (xHb)x=b(xHAx)x=Ax" role="presentation">(bHx)x=0,  (xHb)x=b(xHAx)x=Ax
    x(bHx)=0,  x(xHb)=bx(xHAx)=Ax

    进一步,
    ∂ g ∂ μ k ∗ = ∂ ∂ μ k ∗ ∑ n = 1 N p − γ n k t + 1 ∥ ( C k t + 1 ) − 1 2 ( h n − μ k t + 1 ) ∥ 2 2 = ∑ n = 1 N p − γ n k t + 1 ∂ ∂ μ k ∗ ( h n − μ k t + 1 ) H ( C k t + 1 ) − 1 ( h n − μ k t + 1 ) = ∑ n = 1 N p γ n k t + 1 ( C k t + 1 ) − 1 ( μ k t + 1 − h n ) = ( C k t + 1 ) − 1 ∑ n = 1 N p γ n k t + 1 ( μ k t + 1 − h n ) = 0 ⇒ μ k = ∑ n = 1 N p γ n k t + 1 h n ∑ n = 1 N p γ n k t + 1

    gμk=μkn=1Npγnkt+1(Ckt+1)12(hnμkt+1)22=n=1Npγnkt+1μk(hnμkt+1)H(Ckt+1)1(hnμkt+1)=n=1Npγnkt+1(Ckt+1)1(μkt+1hn)=(Ckt+1)1n=1Npγnkt+1(μkt+1hn)=0μk=n=1Npγnkt+1hnn=1Npγnkt+1" role="presentation">gμk=μkn=1Npγnkt+1(Ckt+1)12(hnμkt+1)22=n=1Npγnkt+1μk(hnμkt+1)H(Ckt+1)1(hnμkt+1)=n=1Npγnkt+1(Ckt+1)1(μkt+1hn)=(Ckt+1)1n=1Npγnkt+1(μkt+1hn)=0μk=n=1Npγnkt+1hnn=1Npγnkt+1
    μkgμk=μkn=1Npγnkt+1(Ckt+1)21(hnμkt+1)22=n=1Npγnkt+1μk(hnμkt+1)H(Ckt+1)1(hnμkt+1)=n=1Npγnkt+1(Ckt+1)1(μkt+1hn)=(Ckt+1)1n=1Npγnkt+1(μkt+1hn)=0=n=1Npγnkt+1n=1Npγnkt+1hn

    最后对 C k \boldsymbol C_k Ck求导,但是实际中,为了方便处理,我们是对 ( C k − 1 ) ∗ (\boldsymbol C^{-1}_k)^* (Ck1)进行求导,在求导之前,我们回顾一些基本概念。

    回顾:如果 A ∈ C N × N \boldsymbol{A} \in \mathbb{C}^{N \times N} ACN×N是正定矩阵,那么根据 det ( A ) = ∏ n λ n \text{det}(\boldsymbol{A})=\prod_{n} \lambda_n det(A)=nλn,我们可以得到 det ( A ) = det ( A ∗ ) = det ( A T ) = det ( A H ) \text{det}(\boldsymbol{A})=\text{det}(\boldsymbol{A}^*)=\text{det}(\boldsymbol{A}^T)=\text{det}(\boldsymbol{A}^H) det(A)=det(A)=det(AT)=det(AH)。另外, det ( A ) = 1 det ( A − 1 ) \text{det}(\boldsymbol{A})=\frac{1}{\text{det}(\boldsymbol{A}^{-1})} det(A)=det(A1)1。因此我们可以把拉格朗日函数 g g g重写为

    g :      ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 ( − ∥ ( C k t + 1 ) − 1 2 ( h n − μ k t + 1 ) ∥ 2 2 + log ⁡ α k t + 1 − log ⁡ ∣ C k t + 1 ∣ ) − λ ( ∑ k = 1 K α k − 1 ) = ∑ n = 1 N p ∑ k = 1 K γ n k t + 1 ( − ( h n − μ k t + 1 ) H ( C k t + 1 ) − 1 ( h n − μ k t + 1 ) + log ⁡ α k t + 1 + log ⁡ ∣ ( ( C k t + 1 ) − 1 ) ∗ ∣ ) − λ ( ∑ k = 1 K α k − 1 )

    g:    n=1Npk=1Kγnkt+1((Ckt+1)12(hnμkt+1)22+logαkt+1log|Ckt+1|)λ(k=1Kαk1)=n=1Npk=1Kγnkt+1((hnμkt+1)H(Ckt+1)1(hnμkt+1)+logαkt+1+log|((Ckt+1)1)|)λ(k=1Kαk1)" role="presentation">g:    n=1Npk=1Kγnkt+1((Ckt+1)12(hnμkt+1)22+logαkt+1log|Ckt+1|)λ(k=1Kαk1)=n=1Npk=1Kγnkt+1((hnμkt+1)H(Ckt+1)1(hnμkt+1)+logαkt+1+log|((Ckt+1)1)|)λ(k=1Kαk1)
    g:    n=1Npk=1Kγnkt+1((Ckt+1)21(hnμkt+1)22+logαkt+1logCkt+1)λ(k=1Kαk1)=n=1Npk=1Kγnkt+1((hnμkt+1)H(Ckt+1)1(hnμkt+1)+logαkt+1+log((Ckt+1)1))λ(k=1Kαk1)

    根据公式,若 A ∈ C N × N \boldsymbol{A} \in \mathbb{C}^{N \times N} ACN×N
    ∂ y H A H x ∂ A ∗ = x y H ∂ det ( A ) ∂ A = det ( A ) ⋅ ( A T ) − 1

    yHAHxA=xyHdet(A)A=det(A)(AT)1" role="presentation">yHAHxA=xyHdet(A)A=det(A)(AT)1
    AyHAHxAdet(A)=xyH=det(A)(AT)1

    ( C k − 1 ) ∗ (\boldsymbol C^{-1}_k)^* (Ck1)进行求导:
    ∂ g ∂ ( C k − 1 ) ∗ = ∑ n = 1 N p γ n k t + 1 ( − ( h n − μ k t + 1 ) ( h n − μ k t + 1 ) H + C k t + 1 ) = C k t + 1 ∑ n = 1 N p γ n k t + 1 − ∑ n = 1 N p γ n k t + 1 ( h n − μ k t + 1 ) ( h n − μ k t + 1 ) H = 0 ⇒ C k t + 1 = ∑ n = 1 N p γ n k t + 1 ( h n − μ k t + 1 ) ( h n − μ k t + 1 ) H ∑ n = 1 N p γ n k t + 1

    g(Ck1)=n=1Npγnkt+1((hnμkt+1)(hnμkt+1)H+Ckt+1)=Ckt+1n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)(hnμkt+1)H=0Ckt+1=n=1Npγnkt+1(hnμkt+1)(hnμkt+1)Hn=1Npγnkt+1" role="presentation">g(Ck1)=n=1Npγnkt+1((hnμkt+1)(hnμkt+1)H+Ckt+1)=Ckt+1n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)(hnμkt+1)H=0Ckt+1=n=1Npγnkt+1(hnμkt+1)(hnμkt+1)Hn=1Npγnkt+1
    (Ck1)gCkt+1=n=1Npγnkt+1((hnμkt+1)(hnμkt+1)H+Ckt+1)=Ckt+1n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)(hnμkt+1)H=0=n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)(hnμkt+1)H

    这与我们的直觉是一致的。

    小结:转化为参数估计问题

    如果考虑 C k = σ k 2 I \boldsymbol{C}_k=\sigma^2_k \boldsymbol{I} Ck=σk2I

    α k = ∑ n = 1 N p γ n k t + 1 N p μ k = ∑ n = 1 N p γ n k t + 1 h n ∑ n = 1 N p γ n k t + 1 σ k 2 = 1 2 N U E N B S ⋅ ∑ n = 1 N p γ n k t + 1 ∥ ( h n − μ k t + 1 ) ∥ 2 2 ∑ n = 1 N p γ n k t + 1

    αk=n=1Npγnkt+1Npμk=n=1Npγnkt+1hnn=1Npγnkt+1σk2=12NUENBSn=1Npγnkt+1(hnμkt+1)22n=1Npγnkt+1" role="presentation">αk=n=1Npγnkt+1Npμk=n=1Npγnkt+1hnn=1Npγnkt+1σk2=12NUENBSn=1Npγnkt+1(hnμkt+1)22n=1Npγnkt+1
    αkμkσk2=Npn=1Npγnkt+1=n=1Npγnkt+1n=1Npγnkt+1hn=2NUENBS1n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)22

    如果考虑一般的 C k \boldsymbol{C}_k Ck

    α k = ∑ n = 1 N p γ n k t + 1 N p μ k = ∑ n = 1 N p γ n k t + 1 h n ∑ n = 1 N p γ n k t + 1 C k = ∑ n = 1 N p γ n k t + 1 ( h n − μ k t + 1 ) ( h n − μ k t + 1 ) H ∑ n = 1 N p γ n k t + 1

    αk=n=1Npγnkt+1Npμk=n=1Npγnkt+1hnn=1Npγnkt+1Ck=n=1Npγnkt+1(hnμkt+1)(hnμkt+1)Hn=1Npγnkt+1" role="presentation">αk=n=1Npγnkt+1Npμk=n=1Npγnkt+1hnn=1Npγnkt+1Ck=n=1Npγnkt+1(hnμkt+1)(hnμkt+1)Hn=1Npγnkt+1
    αkμkCk=Npn=1Npγnkt+1=n=1Npγnkt+1n=1Npγnkt+1hn=n=1Npγnkt+1n=1Npγnkt+1(hnμkt+1)(hnμkt+1)H

  • 相关阅读:
    Java 版 spring cloud 工程系统管理 +二次开发 工程项目管理系统源码
    【云原生之k8s】Kubeadm搭建K8S
    MQTT,如何在SpringBoot中使用MQTT实现消息的订阅和发布
    基于springboot企业客户信息反馈平台设计与实现的源码+文档
    Bean的生命周期
    ESP 特权隔离机制介绍
    MVC模式认识
    WPF XAML中使用依赖属性
    猿创征文|枚举类增强型for循环遍历
    k8s--基础--25.4--Helm--部署
  • 原文地址:https://blog.csdn.net/weixin_43413559/article/details/128093833