機(jī)器學(xué)習(xí)入門-關(guān)聯(lián)分析與頻繁樣式勘探

關(guān)聯(lián)分析

import pandas
movie = pandas.read_csv('Data/movies.csv')
movie_dic = {}
for i in movie.iterrows():
    movie_dic[i[1].movieId] =i[1].title
import datetime 
df = pandas.read_csv('Data/ratings.csv')
from apyori import apriori
transactions = [ele for ele in df.groupby('userId')['movieId'].apply(list)]
rules = apriori(transactions,min_support = 0.2,min_confidence = 0.5,min_lift = 3,min_length=2)
results = list(rules)
for rec in results:
    print(rec)
    print(';'.join([movie_dic[item] for item in rec.items]))

頻繁樣式勘探

from pymining import itemmining
fp_input = itemmining.get_fptree(transactions)
report=itemmining.fpgrowth(fp_input,min_support=30,pruning=True)
for ele in report:
    if len(ele)>=6:
        print(';'.join([movie_dic[item] for item in ele]))
最后編輯于
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
平臺(tái)聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點(diǎn),簡(jiǎn)書系信息發(fā)布平臺(tái),僅提供信息存儲(chǔ)服務(wù)。

推薦閱讀更多精彩內(nèi)容