To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥〓∥誼??鷹?0嚥〓∥誼??鷹?0^ 10011010100010111000000110101100100000010110000110001011011000100011111100111111100100011110100100111111100000100100111110011010100010111000000110101100100000010110000110001011011000100011111100111111100100011110100100111111100000100100111101011110 9a8b81ac81618b623f3f91e93f824f9a8b81ac81618b623f3f91e93f824f5e
EUC-JP 嚥〓‖誼??鷹?0嚥〓‖誼??鷹?0^ 11010011111010111010001010101110101000011100001010110101110000110011111100111111110000101110101100111111101000111011000011010011111010111010001010101110101000011100001010110101110000110011111100111111110000101110101100111111101000111011000001011110 d3eba2aea1c2b5c33f3fc2eb3fa3b0d3eba2aea1c2b5c33f3fc2eb3fa3b05e
UTF-8 嚥〓∥誼썸에鷹낆0嚥〓∥誼썸에鷹낆0^ 11100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101100100011011011100011101100100101111001000011101001101101111011100111101011100000101000011011101111101111001001000011100101100110101010010111100011100000001001001111100010100010001010010111101000101010101011110011101100100011011011100011101100100101111001000011101001101101111011100111101011100000101000011011101111101111001001000001011110 e59aa5e38093e288a5e8aabcec8db8ec9790e9b7b9eb8286efbc90e59aa5e38093e288a5e8aabcec8db8ec9790e9b7b9eb8286efbc905e
UHC 嚥〓∥誼썸에鷹낆0嚥〓∥誼썸에鷹낆0^ 11100110101111111010000111101011101000011010101111101011111111101011110111100110101111111010000111101011111011011000010111101100101000111011000011100110101111111010000111101011101000011010101111101011111111101011110111100110101111111010000111101011111011011000010111101100101000111011000001011110 e6bfa1eba1abebfebde6bfa1ebed85eca3b0e6bfa1eba1abebfebde6bfa1ebed85eca3b05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)