To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 娃?????午ヨ?節??搖??嚴 10001000101000010011111100111111001111110011111100111111100011001101111110000011100010000011111110010000110111110011111100111111100111011000101000111111001111111001101010001110 88a13f3f3f3f3f8cdf83883f90df3f3f9d8a3f3f9a8e
EUC-JP 娃?????午ヨ?節??搖??嚴 10110000101000110011111100111111001111110011111100111111101110001110000110100101111010000011111111000000111000010011111100111111110110011110101000111111001111111101001111101110 b0a33f3f3f3f3fb8e1a5e83fc0e13f3fd9ea3f3fd3ee
UTF-8 娃띰쉠樂쒙슁午ヨ땽節곤슘搖얏뜿嚴 111001011010100010000011111010111001110110110000111011001000100110100000111011111010011010111111111011001001001010011001111011001000101010000001111001011000110110001000111000111000001110101000111010111001010110111101111001111010111110000000111010101011001110100100111011001000101010011000111001101001000010010110111011001001011010001111111010111001110010111111111001011001101010110100 e5a883eb9db0ec89a0efa6bfec9299ec8a81e58d88e383a8eb95bde7af80eab3a4ec8a98e69096ec968feb9cbfe59ab4
UHC 娃띰쉠樂쒙슁午ヨ땽節곤슘搖얏뜿嚴 1110100011011111101101101110111110111101101010101110100011111001100111001110111110111101101100111110011111101101101010111110100010001011100100111110111110111101101100001110111110111101101101111110100011110100101111101110011010001101101110101110010111110001 e8dfb6efbdaae8f99cefbdb3e7edabe88b93efbdb0efbdb7e8f4bee68dbae5f1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)