回答編集履歴

追記

2019/04/24 04:32

投稿

スコア11299

test CHANGED Viewed

@@ -107,3 +107,47 @@
 b'\ufeff\u3075'
 ```
+----
+KSwordOfHaste さんの回答の続きのような形になりますが、
+```plain
+% echo -n 'あ' | LANG=C python -c 'import sys; print(sys.stdin.read().encode("unicode_escape"))'
+b'\udce3\udc81\udc82'
+% LANG=C python -c 'import sys; print(sys.stdin.encoding, sys.stdout.encoding)'
+US-ASCII US-ASCII
+```
+`LANG=C` の設定下だと`sys.stdin`の方も US-ASCII encoding になる影響を受けるので、
+```python
+sys.stdin = io.TextIOWrapper(sys.stdin.buffer, encoding="utf-8")
+sys.stdout = io.TextIOWrapper(sys.stdout.buffer, encoding="utf-8")
+```
+とするのがいいようです。
+```plain
+% echo -n 'あ' | LANG=C python -c 'import io, sys; sys.stdin = io.TextIOWrapper(sys.stdin.buffer, encoding="utf-8"); print(sys.stdin.read().encode("unicode_escape"))'
+b'\u3042'
+```