回答編集履歴

2019/03/04 05:02

投稿

スコア21956

test CHANGED Viewed

@@ -59,3 +59,77 @@
 Name: nDetections, dtype: int64
 ```
+## 追記
+> 該当する行を丸々抽出したいという場合はどうしたら良いでしょうか。
+1. 列 nDetections が大きい順にならべる。
+`df.sort_values(['nDetections'], ascending=False)`
+2. 列 ra, dec の値が同じものは削除する。
+`drop_duplicates(['ra', 'dec'])`
+drop_duplicates() のデフォルト引数が keep='first' なので、列 ra, dec の値が同じ行で列 nDetections の値が一番大きい行が残り、ほかは削除される。
+```python
+from io import StringIO
+import pandas as pd
+text = StringIO('''ra      dec      nDetections   test
+123.456      5.555      3  a
+123.456      5.555      6  b
+123.456    5.555    9  c
+456.789    6.666    10  d
+456.789    6.666    7  e
+456.789    6.666    11  f''')
+df = pd.read_csv(text, delim_whitespace=True)
+extract = df.sort_values(['nDetections'], ascending=False).drop_duplicates(['ra', 'dec'])
+extract
+```
+```
+ra	dec	nDetections	test
+5	456.789	6.666	11	f
+2	123.456	5.555	9	c
+```

2019/03/04 05:01

投稿

スコア21956

test CHANGED Viewed

@@ -42,6 +42,20 @@
-group['nDetections'].max()
+print(group['nDetections'].max())
 ```
+```txt
+ra       dec
+123.456  5.555     9
+456.789  6.666    11
+Name: nDetections, dtype: int64
+```