To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??殉?5鼇γ? 1001011101010001001111110011111110001111011111010011111110000010010101001110101010000111100000111100000100111111 97513f3f8f7d3f8254ea8783c13f
EUC-JP 猷??殉?5鼇γ? 1100110110110010001111110011111110111101110111100011111110100011101101011111001111100111101001101100001100111111 cdb23f3fbdde3fa3b5f3e7a6c33f
UTF-8 猷띰쩃殉믩5鼇γ궕 1110011110001100101101111110101110011101101100001110110010101001100000111110011010101110100010011110101110101111101010011110111110111100100101011110100110111100100001111100111010110011111010101011011010010101 e78cb7eb9db0eca983e6ae89ebafa9efbc95e9bc87ceb3eab695
UHC 猷띰쩃殉믩5鼇γ궕 111010111010001110110110111011111010010010011101111000101110011010010010111010111010001110110101111010001010100010100101111000111000001010101010 eba3b6efa49de2e692eba3b5e8a8a5e382aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)