To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癌??獄????ジ毅 1000101011100000001111110011111110001101100101100011111100111111001111110011111110000011010101111000101101000010 8ae03f3f8d963f3f3f3f83578b42
EUC-JP 癌??獄??嫄?ジ毅 10110100111000100011111100111111101110011111011000111111001111111000111110111010101000010011111110100101101110001011010110100011 b4e23f3fb9f63f3f8fbaa13fa5b8b5a3
UTF-8 癌뺣졁獄몄옺嫄띹ジ毅 111001111001100110001100111010111011101010100011111011001010000110000001111001111000110110000100111010111010101010000100111011001001100010111010111001011010101110000100111010111001110110111001111000111000001010111000111001101010111110000101 e7998cebbaa3eca181e78d84ebaa84ec98bae5ab84eb9db9e382b8e6af85
UHC 癌뺣졁獄몄옺嫄띹ジ毅 1110010011011111100101011110101110100000101100101110100010101011101110001110110010011110101100001110101010110001100011011110100010101011101110001110101111110110 e4df95eba0b2e8abb8ec9eb0eab18de8abb8ebf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)