To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥〓?誼????6嚥〓∥誼? 10011010100010111000000110101100001111111000101101100010001111110011111100111111001111111000001001010101100110101000101110000001101011001000000101100001100010110110001000111111 9a8b81ac3f8b623f3f3f3f82559a8b81ac81618b623f
EUC-JP 嚥〓?誼????6嚥〓‖誼? 11010011111010111010001010101110001111111011010111000011001111110011111100111111001111111010001110110110110100111110101110100010101011101010000111000010101101011100001100111111 d3eba2ae3fb5c33f3f3f3fa3b6d3eba2aea1c2b5c33f
UTF-8 嚥〓뜄誼숁에琉우6嚥〓∥誼킾 111001011001101010100101111000111000000010010011111010111001110010000100111010001010101010111100111011001000100010000001111011001001011110010000111011111010011110001100111011001001101010110000111011111011110010010110111001011001101010100101111000111000000010010011111000101000100010100101111010001010101010111100111011011000001010111110 e59aa5e38093eb9c84e8aabcec8881ec9790efa78cec9ab0efbc96e59aa5e38093e288a5e8aabced82be
UHC 嚥〓뜄誼숁에琉우6嚥〓∥誼킾 11100110101111111010000111101011100011011000100011101011111111101001100111100110101111111010000111101011101001001011111111101100101000111011011011100110101111111010000111101011101000011010101111101011111111101011010101101000 e6bfa1eb8d88ebfe99e6bfa1eba4bfeca3b6e6bfa1eba1abebfeb568

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)