To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 嚥〓?誼?┸飮??嚥〓?誼?┸飮??B 1001101010001011100000011010110000111111100010110110001000111111100001001011110110011111010110100011111100111111100110101000101110000001101011000011111110001011011000100011111110000100101111011001111101011010001111110011111101000010 9a8b81ac3f8b623f84bd9f5a3f3f9a8b81ac3f8b623f84bd9f5a3f3f42
EUC-JP 嚥〓?誼?┸飮??嚥〓?誼?┸飮??B 1101001111101011101000101010111000111111101101011100001100111111101010001011111111011101101110110011111100111111110100111110101110100010101011100011111110110101110000110011111110101000101111111101110110111011001111110011111101000010 d3eba2ae3fb5c33fa8bfddbb3f3fd3eba2ae3fb5c33fa8bfddbb3f3f42
UTF-8 嚥〓돃誼쏉┸飮귣림嚥〓돃誼쏉┸飮귣림B 11100101100110101010010111100011100000001001001111101011100011111000001111101000101010101011110011101100100011111000100111100010100101001011100011101001101000111010111011101010101101111010001111101011101001101011110011100101100110101010010111100011100000001001001111101011100011111000001111101000101010101011110011101100100011111000100111100010100101001011100011101001101000111010111011101010101101111010001111101011101001101011110001000010 e59aa5e38093eb8f83e8aabcec8f89e294b8e9a3aeeab7a3eba6bce59aa5e38093eb8f83e8aabcec8f89e294b8e9a3aeeab7a3eba6bc42
UHC 嚥〓돃誼쏉┸飮귣림嚥〓돃誼쏉┸飮귣림B 11100110101111111010000111101011100010011001011011101011111111101001101111101111101001101011111111101011111001101000001011101011101110001011001011100110101111111010000111101011100010011001011011101011111111101001101111101111101001101011111111101011111001101000001011101011101110001011001001000010 e6bfa1eb8996ebfe9befa6bfebe682ebb8b2e6bfa1eb8996ebfe9befa6bfebe682ebb8b242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)