To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??????濡??[??????濡??[^ 0011111100111111001111110011111100111111001111111001010001000111001111110011111101011011001111110011111100111111001111110011111100111111100101000100011100111111001111110101101101011110 3f3f3f3f3f3f94473f3f5b3f3f3f3f3f3f94473f3f5b5e
EUC-JP 縯?????濡??[縯?????濡??[^ 100011111101010011001011001111110011111100111111001111110011111111000111101010000011111100111111010110111000111111010100110010110011111100111111001111110011111100111111110001111010100000111111001111110101101101011110 8fd4cb3f3f3f3f3fc7a83f3f5b8fd4cb3f3f3f3f3fc7a83f3f5b5e
UTF-8 縯ㅻ죰醴노죻濡뗨넶[縯ㅻ죰醴노죻濡뗨넶[^ 111001111011100010101111111000111000010110111011111011001010001110110000111011111010011010110111111010111000010110111000111011001010001110111011111001101011111110100001111010111001011110101000111010111000010010110110010110111110011110111000101011111110001110000101101110111110110010100011101100001110111110100110101101111110101110000101101110001110110010100011101110111110011010111111101000011110101110010111101010001110101110000100101101100101101101011110 e7b8afe385bbeca3b0efa6b7eb85b8eca3bbe6bfa1eb97a8eb84b65be7b8afe385bbeca3b0efa6b7eb85b8eca3bbe6bfa1eb97a8eb84b65b5e
UHC 縯ㅻ죰醴노죻濡뗨넶[縯ㅻ죰醴노죻濡뗨넶[^ 111001101110000010100100111010111010000110001011111001111110010010110011111010111010000110010101111010111010000110001011111010001000011010110011010110111110011011100000101001001110101110100001100010111110011111100100101100111110101110100001100101011110101110100001100010111110100010000110101100110101101101011110 e6e0a4eba18be7e4b3eba195eba18be886b35be6e0a4eba18be7e4b3eba195eba18be886b35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)