クエリの関連語の照合をシソーラスを用いて行うことで検索効果を向上させる手法
tropical fish
の拡張に有効でない
tropical fish
との関連語をダイス係数で推定
tropical fish
->[stores, pictures, live, sale, types, clipart, blue, freshwater, aquarium, supplies]
筑波大
->つくば市, 茨城, 関東, 日本などreal
->real estate
real
->real madrid
2・n(a ∧ b)/(n(a)|n(b))
, n(x)
は単語x
を含む窓(または文書)の数(defun ld(a b)
(cond
((= (length a) 0)(length a))
((= (length b) 0)(length b))
((char= (char a 0)(char b 0))
(ld (subseq a 1)(subseq b 1)))
(t (1+ (min
(ld a (subseq b 1))
(ld (subseq a 1) b)
(ld (subseq a 1) (subseq b 1)))))))
(ld "heool wrodl" "hello world") ;=> 5
#!/usr/bin/sed -rf
s/^./\U\0/
s/[aeiouyhw]/-/g
s/[aeiouyhw]/-/g;s/[bfpv]/1/g;s/[cgjkqsxz]/2/g;y/dt/33/;y/l/4/;y/mn/55/;y/r/6/
s/([1-6])\1+/\1/g
s/-//g
s/^[A-Z][1-6]$/&00/;s/^[A-Z][1-6]{2}$/&0/;s/^([A-Z][1-6]{3})[1-6]+$/\1/
lawers
->[lowers, lawyers, layers, lasers, lagers]
trial lawers
->trial lawyers
mainscourcebank
P(w)
に基づく」という言語モデルP(e|w)
に基づく」という誤りモデルP(w|e)
を求め、高いwを修正候補としたいP(e|w)
は何かしらで求められるものとするP(w|e) = P(e|w)P(w)
P(w)
の推定は:
λP(w) + (1-λ)P(w|直前の単語w(p))
P(e|w)
は:
筑波大
->筑波大学の沿革?アクセス方法?meta name="description"
のcontent
属性#:~:text=<text>