pandas で条件をもとに列追加したい

Question

##やりたいこと
既に作成しているDataFrameに対して、新しく追加する列に、特定の条件のもと適した値を入れたいと思っています。
今回はゲームの大会のデータを持ってきているのですが、そのゲームの対戦方式として、
**1セット目：2人vs2人のバトル（2本先取で勝ち）
2セット目：1人vs1人のバトル（2本先取で勝ち）
3セット目：1人vs1人のバトル（3名の勝ち残りで相手を全滅させたら勝利）**
となっており、**2セット先取で試合に勝利**となっています。

既にはdfというデータフレームの中に列が［match（通算で何試合目か）,set（何セット目か）,game（セット内で何試合目か）,gamewinner(その１試合での勝者),team1(チーム１の名前),team2（チーム2の名前）］という風に入っています。


**↓現在のデータフレーム**
```ここに言語を入力

print(df) 

　　#match  #set  #game  #gamewinner  #team1  #team2
#1    1      1     1          1         A       B
#2    1      1     2          1         A       B    
#3    1      2     1          2         A       B    
#4    1      2     2          1         A       B
#5    1      2     3          2         A       B
#6    1      3     1          1         A       B
#7    1      3     2          2         A       B
#8    1      3     3          1         A       B
#9    1      3     4          1         A       B
#10   2      1     1          2         C       D

  ...続く 
```
このようになっているデータフレームにsetwinner,matchwinnerという新たな列を追加し、そのセット全体/試合全体でどちらが勝利したのかを1つの行から分かるようにしたいと考えています。

**↓やりたいイメージ**
```
print(df) 

　　#match  #set  #game  #gamewinner  #team1  #team2  #setwinner  #matchwinner
#1    1      1     1          1         A       B         1            1
#2    1      1     2          1         A       B         1            1   
#3    1      2     1          2         A       B         2            1    
#4    1      2     2          1         A       B         2            1
#5    1      2     3          2         A       B         2            1
#6    1      3     1          1         A       B         1            1
#7    1      3     2          2         A       B         1            1
#8    1      3     3          1         A       B         1            1
#9    1      3     4          1         A       B         1            1
#10   2      1     1          2         C       D         2            1
```

##わからないこと
前後の行の情報を用いないとできないため、どのようにしたらやりたいことが実装できるのかわからず、困っています。

##試してみたこと
他行の情報を見なくてもだできる点に関してはこれで実装できたのですが、そのほかの場合が分かりませんでした。
```ここに言語を入力
♯1,2セット目において3ゲームまでもつれ込んだ場合、3ゲーム目の勝者がそのセットの勝者
df.loc[(df["game"]==3)&(df["set"]!=3),"setwinner"] = df["gamewinner"]

```

## 追記
magichanさんに教えていただいた方法を試したところ、以下のようになってしまいました。
![イメージ説明](0235e1a535fe8e5e222ce514c52d3696.png)
![イメージ説明](f2e64e2a733625f8708ca43c2bc6fd0d.png)
ご回答よろしくお願いします。

Accepted Answer

基本的な考え方としては、以下のように groupby() にて match毎にグループ化し、
さらにその中でset毎にグループ化して２重ループを構成し、各set毎に 'gamewinner'
の最頻値(mode)をもとめることで各setの勝者を、さらにはsetの勝者が多い方を match
の勝者として算出するとよいだけです。

```Python
# match毎にループ
for match, match_df in df.groupby('match'):
    # set毎にループ
    winners = []
    for set, set_df in match_df.groupby('set'):
        # setの勝者（同点は無いものとして算出してます）
        set_winner = set_df['gamewinner'].mode()[0]
        print("MATCH: {}, SET: {}, WINNER: {}".format(match, set, set_winner))
        winners.append(set_winner)
    # matchの勝者（同点は無いものとして算出してます）
    match_winner = pd.Series(set_winner).mode()[0]
    print("MATCH: {}, WINNER: {}".format(match, match_winner))
```

あとは、これをgroupby.transform() や groupby.apply() でまとめるとスッキリと書けます。

```Python
df['setwinner'] = df.groupby(['match','set'])['gamewinner'].transform(lambda d:d.mode()[0])
df['matchwinner'] = df.groupby(['match'])['setwinner'].transform(lambda d:d.mode()[0])
```

以下は動作サンプルです。

```Python
import pandas as pd
import io

csv = """
match,set,game,gamewinner,team1,team2
1,1,1,1,A,B
1,1,2,1,A,B
1,2,1,2,A,B
1,2,2,1,A,B
1,2,3,2,A,B
1,3,1,1,A,B
1,3,2,2,A,B
1,3,3,1,A,B
1,3,4,1,A,B
2,1,1,2,B,C
2,1,2,2,B,C
2,2,1,1,B,C
2,2,2,2,B,C
2,2,3,2,B,C
2,3,1,1,B,C
2,3,2,2,B,C
2,3,3,1,B,C
2,3,4,2,B,C
2,3,5,2,B,C
"""

df = pd.read_csv(io.StringIO(csv))

df['setwinner'] = df.groupby(['match','set'])['gamewinner'].transform(lambda d:d.mode()[0])
df['matchwinner'] = df.groupby(['match'])['setwinner'].transform(lambda d:d.mode()[0])
print(df)
#    match  set  game  gamewinner team1 team2  setwinner  matchwinner
#0       1    1     1           1     A     B          1            1
#1       1    1     2           1     A     B          1            1
#2       1    2     1           2     A     B          2            1
#3       1    2     2           1     A     B          2            1
#4       1    2     3           2     A     B          2            1
#5       1    3     1           1     A     B          1            1
#6       1    3     2           2     A     B          1            1
#7       1    3     3           1     A     B          1            1
#8       1    3     4           1     A     B          1            1
#9       2    1     1           2     B     C          2            2
#10      2    1     2           2     B     C          2            2
#11      2    2     1           1     B     C          2            2
#12      2    2     2           2     B     C          2            2
#13      2    2     3           2     B     C          2            2
#14      2    3     1           1     B     C          2            2
#15      2    3     2           2     B     C          2            2
#16      2    3     3           1     B     C          2            2
#17      2    3     4           2     B     C          2            2
#18      2    3     5           2     B     C          2            2
```

---
###【追記】
動作確認サンプル その２

```Python
import pandas as pd
import io

csv = """
,week,match,set,game,team1,team2,game winner
0,1,1,1,1,gamewith,detonation,1
1,1,1,1,2,gamewith,detonation,1
2,1,1,2,1,gamewith,detonation,2
3,1,1,2,2,gamewith,detonation,1
4,1,1,2,3,gamewith,detonation,2
5,1,1,3,1,gamewith,detonation,2
6,1,1,3,2,gamewith,detonation,1
7,1,1,3,3,gamewith,detonation,1
8,1,1,3,4,gamewith,detonation,1
9,1,2,1,1,C,talon-espo,2
10,1,2,1,2,C,talon-espo,1
11,1,2,1,3,C,talon-espo,2
12,1,2,2,1,C,talon-espo,2
13,1,2,2,2,C,talon-espo,1
14,1,2,2,3,C,talon-espo,1
15,1,2,3,1,C,talon-espo,2
16,1,2,3,2,C,talon-espo,1
17,1,2,3,3,C,talon-espo,1
18,1,2,3,4,C,talon-espo,2
19,1,2,3,5,C,talon-espo,1
20,1,3,1,1,bren-espo,sandbox,2
21,1,3,1,2,bren-espo,sandbox,2
22,1,3,2,1,bren-espo,sandbox,2
23,1,3,2,2,bren-espo,sandbox,1
24,1,3,3,1,bren-espo,sandbox,2
25,1,3,3,2,bren-espo,sandbox,1
"""

df = pd.read_csv(io.StringIO(csv), index_col=0)
print(df)

df['set winner'] = df.groupby(['match','set'])['game winner'].transform(lambda d:d.mode()[0])
df['match winner'] = df.groupby(['match'])['set winner'].transform(lambda d:d.mode()[0])
print(df)
#    week  match  set  game      team1       team2  game winner  setwinner  matchwinner
#0      1      1    1     1   gamewith  detonation            1          1            1
#1      1      1    1     2   gamewith  detonation            1          1            1
#2      1      1    2     1   gamewith  detonation            2          2            1
#3      1      1    2     2   gamewith  detonation            1          2            1
#4      1      1    2     3   gamewith  detonation            2          2            1
#5      1      1    3     1   gamewith  detonation            2          1            1
#6      1      1    3     2   gamewith  detonation            1          1            1
#7      1      1    3     3   gamewith  detonation            1          1            1
#8      1      1    3     4   gamewith  detonation            1          1            1
#9      1      2    1     1          C  talon-espo            2          2            1
#10     1      2    1     2          C  talon-espo            1          2            1
#11     1      2    1     3          C  talon-espo            2          2            1
#12     1      2    2     1          C  talon-espo            2          1            1
#13     1      2    2     2          C  talon-espo            1          1            1
#14     1      2    2     3          C  talon-espo            1          1            1
#15     1      2    3     1          C  talon-espo            2          1            1
#16     1      2    3     2          C  talon-espo            1          1            1
#17     1      2    3     3          C  talon-espo            1          1            1
#18     1      2    3     4          C  talon-espo            2          1            1
#19     1      2    3     5          C  talon-espo            1          1            1
#20     1      3    1     1  bren-espo     sandbox            2          2            1
#21     1      3    1     2  bren-espo     sandbox            2          2            1
#22     1      3    2     1  bren-espo     sandbox            2          1            1
#23     1      3    2     2  bren-espo     sandbox            1          1            1
#24     1      3    3     1  bren-espo     sandbox            2          1            1
#25     1      3    3     2  bren-espo     sandbox            1          1            1
```

Answer

大会の性質上、セット内の最終ゲーム勝利者がセット勝利者、マッチ内の最終セット勝利者がマッチ勝利者になります。

すなわち、ゲーム勝利者を同マッチ・同セットのセット勝利者に反映、セット勝利者を同マッチのマッチ勝利者に反映するのを、各行毎に更新していけばいいのではないでしょうか。

コード例は以下になります。ただ、最終行の数値が必要になるので、magichanさんのコードのほうが便利かと思います。

```ここに言語を入力
print(df)

for i in range(1,11):
    
    #すでに終了したマッチの分しか正確に記入できないので注意
    #セット勝利の記入
    df.loc[(df["#match"]==df.at['#'+str(i),'#match'])&(df["#set"]==df.at['#'+str(i),'#set']),"#setwinner"] = df.at['#'+str(i),'#gamewinner']
        
    #マッチ勝利の記入
    df.loc[(df["#match"]==df.at['#'+str(i),'#match']),"#matchwinner"] = df.at['#'+str(i),'#setwinner']

print(df)
```


---

あと、完全なる蛇足ですが、最初の実装方針は以下のように「ゲーム勝利数をカウントし、条件に応じてセット勝利・マッチ勝利をカウント・記入する」という感じでした。


```ここに言語を入力
import pandas as pd
df=pd.read_excel('excel1.xlsx')
print(df)

for i in range(1,11):

    #セット勝利数リセット
    if df.at['#'+str(i),'#set']==1:
        print('match start')
        times_won_set_team1=0
        times_won_set_team2=0
    
    #ゲーム勝利数リセット
    if df.at['#'+str(i),'#game']==1:
        print('set start')
        times_won_game_team1=0
        times_won_game_team2=0

    #ゲーム勝利数カウント
    if df.at['#'+str(i),'#gamewinner']==1:
        print('team1 won')
        times_won_game_team1+=1
        print('team1:'+str(times_won_game_team1))
    else:
        print('team2 won')
        times_won_game_team2+=1
        print('team2:'+str(times_won_game_team2))

    #セット勝利数カウント、セット勝利の記入
    #第1、第2セットの場合
    if times_won_game_team1==2 and df.at['#'+str(i),'#set']<=2:
        print('team1 set won')
        times_won_set_team1+=1
        df.at['#'+str(i),'#setwinner']=1
        
    elif times_won_game_team2==2 and df.at['#'+str(i),'#set']<=2:
        print('team2 set won')
        times_won_set_team2+=1
        df.at['#'+str(i),'#setwinner']=2
    
    #第3セットの場合
    elif times_won_game_team1==3 and df.at['#'+str(i),'#set']==3:
        print('team1 set won')
        times_won_set_team1+=1
        df.at['#'+str(i),'#setwinner']=1

    elif times_won_game_team2==3 and df.at['#'+str(i),'#set']==3:
        print('team2 set won')
        times_won_set_team2+=1
        df.at['#'+str(i-1),'#setwinner']=2
        
    #マッチ勝利の記入
    if times_won_set_team1==2:
        print('team1 match won')
        df.at['#'+str(i),'#matchwinner']=1

    elif times_won_set_team2==2:
        print('team2 match won')
        df.at['#'+str(i),'#matchwinner']=2

print(df)

```


このコードをたたき台にしていろいろ修正をかけた結果、冒頭のコードになりました。

プログラミングの勉強が目的でしたら、まずは愚直に冗長なクソコードを書いてみて、徐々に洗練させていく方針でいくと、上達も早いんじゃないかなと思います。

追記

関連した質問