To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥???沃??????ν?????↑ぜ 100110101000101100111111001111110011111110010111100000000011111100111111001111110011111100111111001111111000001111001011001111110011111100111111001111110011111110000001101010101000001010111010 9a8b3f3f3f97803f3f3f3f3f3f83cb3f3f3f3f3f81aa82ba
EUC-JP 嚥???沃??絪??洹ν?????↑ぜ 11010011111010110011111100111111001111111100110111100000001111110011111110001111110100111110110000111111001111111000111111000111101110101010011011001101001111110011111100111111001111110011111110100010101011001010010010111100 d3eb3f3f3fcde03f3f8fd3ec3f3f8fc7baa6cd3f3f3f3f3fa2aca4bc
UTF-8 嚥싲갭큔沃쇱뼏絪뷸뵺洹ν맇嶺뚮뿪璘↑ぜ 1110010110011010101001011110110010001011101100101110101010110000101011011110110110000001100101001110011010110010100000111110110010000111101100011110101110111100100011111110011110110101101010101110101110110111101110001110101110110101101110101110011010110100101110011100111010111101111010111010011110000111111011111010011010101011111010111001101010101110111010111011111110101010111011111010011110101111111000101000011010010001111000111000000110011100 e59aa5ec8bb2eab0aded8194e6b283ec87b1ebbc8fe7b5aaebb7b8ebb5bae6b4b9cebdeba787efa6abeb9aaeebbfaaefa7afe28691e3819c
UHC 嚥싲갭큔沃쇱뼏絪뷸뵺洹ν맇嶺뚮뿪璘↑ぜ 1110011010111111100110101110101110110000101110001100010110100110111010001010101010111100111011001001011010010111111011001101111110111010111001101001010010111000111010101011011110100101111011011001000010100001111001111010110110001100111010111001011110101010111011001101111010100001111010001010101010111100 e6bf9aebb0b8c5a6e8aabcec9697ecdfbae694b8eab7a5ed90a1e7ad8ceb97aaecdea1e8aabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)