如何解决Python文本处理：attributeError：＆＃x27; list＆＃x27;对象没有属性。

发布于 2025-02-01 19:00:56 字数 2882 浏览 4 评论 0原文

我是Python的新手，可以stackoverflow和Gui Python（请保持温柔），并试图批准如何进行KNN分析。我正在使用我自己构建的代码组合的组合：Python -AttributeError：

doc = doc.lower()
AttributeError: 'list' object has no attribute 'lower'

这是我的代码：

selct = StringVar()
categorychoosen = ttk.Combobox(top, width = 27, textvariable = selct)
categorychoosen['values'] = (' Computer Science', 
                          ' computer engineering',
                          ' Information Technology',
                          ' artificial intelligence',
                          ' cyber security',
                          ' computer networks',
                          ' Information Security',
                          ' Management Information Systems',
                          ' Software engineering',
                          ' data analysis',
                          ' Data Science')
  
categorychoosen.grid(row=1, column=2)
categorychoosen.current()

s = StringVar()
choosen = ttk.Combobox(top, width = 27, textvariable = s)
choosen['values'] = (' Computer Science', 
                          ' computer engineering',
                          ' Information Technology',
                          ' artificial intelligence',
                          ' cyber security',
                          ' computer networks',
                          ' Information Security',
                          ' Management Information Systems',
                          ' Software engineering',
                          ' data analysis',
                          ' Data Science')
  
choosen.grid(row=1, column=3)
choosen.current()

def model():
    
    from sklearn.model_selection import train_test_split
    from sklearn.feature_extraction.text import TfidfVectorizer
    from scipy.sparse import hstack
    from sklearn.multiclass import OneVsRestClassifier
    from sklearn.neighbors import KNeighborsClassifier

    resume = pd.read_csv(r'/Users/asma/Desktop/UpdatedResumeDataSet.csv')

    #DATA
    x = resume['Resume'].values
    y = resume['Category'].values
    v = [[selct.get(),s.get()]]

    #transform
    word = TfidfVectorizer(sublinear_tf=True, stop_words='english')
    word.fit(x)
    wordFeatures = word.transform(x)
    
    w = TfidfVectorizer(sublinear_tf=True, stop_words='english')
    w.fit(v)
    wx = word.transform(v)

    # to 2D Array
    wx.reshape(-1, 1)
    wordFeatures.reshape(-1, 1)
    x.reshape(-1, 1)

    #KNN 
    model = KNeighborsClassifier(n_neighbors=5, metric= 'euclidean')
    model.fit(wordFeatures,y)
    x_test = wx
    y_pred = model.predict([x_test])
    jobR = Label(top,text=str([y_pred]) ,bg='light gray').grid(row=4,column=2)

but= Button(top,text="Start",bg='gray', command=model).grid(row=3,column=0)

在“转换过程”之前或之后，我可以在哪里添加“较低”，我将使用什么数据？ 简历['简历']。值或[[SELCT.GEG（），s.get（）]]。

任何帮助将不胜感激。

原文

I am new to Python to Stackoverflow and GUI Python (please be gentle) and am trying to aplaing how to do a KNN analysis. I am using a combination of codes I built it myself: Python - AttributeError:

doc = doc.lower()
AttributeError: 'list' object has no attribute 'lower'

This is my code:

selct = StringVar()
categorychoosen = ttk.Combobox(top, width = 27, textvariable = selct)
categorychoosen['values'] = (' Computer Science', 
                          ' computer engineering',
                          ' Information Technology',
                          ' artificial intelligence',
                          ' cyber security',
                          ' computer networks',
                          ' Information Security',
                          ' Management Information Systems',
                          ' Software engineering',
                          ' data analysis',
                          ' Data Science')
  
categorychoosen.grid(row=1, column=2)
categorychoosen.current()

s = StringVar()
choosen = ttk.Combobox(top, width = 27, textvariable = s)
choosen['values'] = (' Computer Science', 
                          ' computer engineering',
                          ' Information Technology',
                          ' artificial intelligence',
                          ' cyber security',
                          ' computer networks',
                          ' Information Security',
                          ' Management Information Systems',
                          ' Software engineering',
                          ' data analysis',
                          ' Data Science')
  
choosen.grid(row=1, column=3)
choosen.current()

def model():
    
    from sklearn.model_selection import train_test_split
    from sklearn.feature_extraction.text import TfidfVectorizer
    from scipy.sparse import hstack
    from sklearn.multiclass import OneVsRestClassifier
    from sklearn.neighbors import KNeighborsClassifier

    resume = pd.read_csv(r'/Users/asma/Desktop/UpdatedResumeDataSet.csv')

    #DATA
    x = resume['Resume'].values
    y = resume['Category'].values
    v = [[selct.get(),s.get()]]

    #transform
    word = TfidfVectorizer(sublinear_tf=True, stop_words='english')
    word.fit(x)
    wordFeatures = word.transform(x)
    
    w = TfidfVectorizer(sublinear_tf=True, stop_words='english')
    w.fit(v)
    wx = word.transform(v)

    # to 2D Array
    wx.reshape(-1, 1)
    wordFeatures.reshape(-1, 1)
    x.reshape(-1, 1)

    #KNN 
    model = KNeighborsClassifier(n_neighbors=5, metric= 'euclidean')
    model.fit(wordFeatures,y)
    x_test = wx
    y_pred = model.predict([x_test])
    jobR = Label(top,text=str([y_pred]) ,bg='light gray').grid(row=4,column=2)

but= Button(top,text="Start",bg='gray', command=model).grid(row=3,column=0)

Where can I add the 'lower' before or after 'the transform process', and what data will I use for it? resume['Resume'].values or [[selct.get(),s.get()]].

Any help would be massively appreciated.

分享到QQ

分享到微博