To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 帳??辱??謠??厓 1001001010100000001111110011111110010000010010100011111100111111111001101000111100111111001111111111101010001101 92a03f3f904a3f3fe68f3f3ffa8d
EUC-JP 帳??辱??謠??厓 110001001010001000111111001111111011111110101011001111110011111111101011111011110011111100111111100011111011010011000111 c4a23f3fbfab3f3febef3f3f8fb4c7
UTF-8 帳들쎁辱덌숱謠쇽쉘厓 111001011011100010110011111010111001001110100100111011001000111010000001111010001011111010110001111010111000110110001100111011001000100010110001111010001010110010100000111011001000011110111101111011001000100110011000111001011000111010010011 e5b8b3eb93a4ec8e81e8beb1eb8d8cec88b1e8aca0ec87bdec8998e58e93
UHC 帳들쎁辱덌숱謠쇽쉘厓 1110110111100011101101011110100110011011101010111110100110110100100010001110111110111101101000101110100110101010101111001110111110111101101010011110010011101101 ede3b5e99babe9b488efbda2e9aabcefbda9e4ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)