如何通过使用循环和 if 语句在字符串变量中搜索字符（子字符串）来创建变量

发布于 2025-01-13 19:34:20 字数 1071 浏览 1 评论 0原文

'大家好。该变量具有字母和字母数字字符。如果它有“m”字符，则为“百万”，如果它有“Th”。是千。

df['mkt_value']

0       €15.00m
1        €1.00m
2       €100Th.
3        €3.00m
4        €900Th.
5        Free

'我假装 i) 通过创建虚拟变量来识别字符串变量是数百万 (m) 还是数千 (Th.)。然后 ii) 使用这个虚拟变量来获取一个新的整数变量，其中百万是数千'

#Desire output
df['mi']

0       15000
1        1000
2         100
3        3000
4         900
5         nan

'所以，我首先进行设置，然后创建一个带有循环的虚拟变量，最后为数千创建一个整数：'

m = 'm'
th = 'Th'
dtype = {"money": "category"}
l_MKV = df['mkt_value'].tolist()
df['mi'] = df['mkt_value'].str.strip('mTh.€')

#loop for var dummy
for x in l_MKV:
    if m in x:
        df["money"]= 1
else:
       df["money"]= 0

# var integer for thousands: 1 million , 0 thousand
if df["money"] == 1:
        df["miles"] = int(df['mi']) * 100
        else:
                ALL['mi']

'循环（对于var dummy) 不起作用。我得到：'

df["money"]

0       0
1       0
2       0
3       0
4       0

'并且我得到 var 整数的语法错误，没有更多规范

我错过了什么？

感谢您的帮助'。

原文

'Hi, everyone. This variable has alphabetic and alphanumeric characters. If it has 'm' character is million and if it has 'Th.' is thousand.

df['mkt_value']

0       €15.00m
1        €1.00m
2       €100Th.
3        €3.00m
4        €900Th.
5        Free

'I pretend to i) identify if string variable is millions (m) or thousands (Th.) by creating a dummy variable. And then ii) use this dummy to get a new integer variable which millions be thousands'

#Desire output
df['mi']

0       15000
1        1000
2         100
3        3000
4         900
5         nan

'So, I first do a set up, then create a dummy with a loop and finally create a integer for the thousands:'

m = 'm'
th = 'Th'
dtype = {"money": "category"}
l_MKV = df['mkt_value'].tolist()
df['mi'] = df['mkt_value'].str.strip('mTh.€')

#loop for var dummy
for x in l_MKV:
    if m in x:
        df["money"]= 1
else:
       df["money"]= 0

# var integer for thousands: 1 million , 0 thousand
if df["money"] == 1:
        df["miles"] = int(df['mi']) * 100
        else:
                ALL['mi']

'The loop (for var dummy) is not working. I get:'

df["money"]

0       0
1       0
2       0
3       0
4       0

'And I get a syntax error for var integer without more specification

What I have missed?

Thanks for any help'.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

还给你自由 2025-01-20 19:34:20

您的代码的问题在于您尝试修改系列中的单行的方式。例如，df["money"] = 0 实际上会将所有行设置为零。

我不会乱搞虚拟列，而是创建一个单独的函数来解析值并使用 DataFrame.apply()：

def parse_amount(x):
    print(x[0])
    factor = {
        'm': 1000000,
        'Th': 1000
        }
    s = x['mkt_value'].strip('mTh.€')
    try:
        number = float(s)
        for f in factor.keys():
            if f in x['mkt_value']:
                number *= factor[f]
        return number           
    except:
        return 0

# import pandas as pd
# df = pd.DataFrame([{0:'€15.00m'},{0:'€1.00m'},{0:'€100Th.'},{0:'€3.00m'},{0:'€900Th'},{0:'Free'}])

df['money'] = df.apply(lambda x: parse_amount(x), axis = 1)
print(df)

The issue with your code is the way you try to modified a single row in a series. For example, df["money"] = 0 will actually set all rows to zero.

Rather than messing around with dummy columns, I would create a separate function for parsing the values and use DataFrame.apply():

def parse_amount(x):
    print(x[0])
    factor = {
        'm': 1000000,
        'Th': 1000
        }
    s = x['mkt_value'].strip('mTh.€')
    try:
        number = float(s)
        for f in factor.keys():
            if f in x['mkt_value']:
                number *= factor[f]
        return number           
    except:
        return 0

# import pandas as pd
# df = pd.DataFrame([{0:'€15.00m'},{0:'€1.00m'},{0:'€100Th.'},{0:'€3.00m'},{0:'€900Th'},{0:'Free'}])

df['money'] = df.apply(lambda x: parse_amount(x), axis = 1)
print(df)

回复收藏 0 原文

~没有更多了~