To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 臟棒???臟棒? 111001000110011010010110010111110011111100111111001111111110010001100110100101100101111100111111 e466965f3f3f3fe466965f3f
EUC-JP 臟棒?玎?臟棒? 1110011111000111110010111100000000111111100011111100101111010010001111111110011111000111110010111100000000111111 e7c7cbc03f8fcbd23fe7c7cbc03f
UTF-8 臟棒틴玎렕臟棒텼 111010001000011110011111111001101010001110010010111011011000101110110100111001111000111010001110111010111010000010010101111010001000011110011111111001101010001110010010111011011000010110111100 e8879fe6a392ed8bb4e78e8eeba095e8879fe6a392ed85bc
UHC 臟棒틴玎렕臟棒텼 11101101111101001101110011101010110001101011111011101111111010011000111010101010111011011111010011011100111010101100010111100001 edf4dceac6beefe98eaaedf4dceac5e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)