To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 壅??曜?????n}壅??曜?????n{^ 100110101101011100111111001111111001011101101010001111110011111100111111001111110011111101101110011111011001101011010111001111110011111110010111011010100011111100111111001111110011111100111111011011100111101101011110 9ad73f3f976a3f3f3f3f3f6e7d9ad73f3f976a3f3f3f3f3f6e7b5e
EUC-JP 壅??曜?????n}壅??曜?????n{^ 110101001101100100111111001111111100110111001011001111110011111100111111001111110011111101101110011111011101010011011001001111110011111111001101110010110011111100111111001111110011111100111111011011100111101101011110 d4d93f3fcdcb3f3f3f3f3f6e7dd4d93f3fcdcb3f3f3f3f3f6e7b5e
UTF-8 壅ㅻ젫曜뺣젺閱곕젙n}壅ㅻ젫曜뺣젺閱곕젙n{^ 1110010110100011100001011110001110000101101110111110110010100000101010111110011010011011100111001110101110111010101000111110110010100000101110101110100110010110101100011110101010110011100101011110110010100000100110010110111001111101111001011010001110000101111000111000010110111011111011001010000010101011111001101001101110011100111010111011101010100011111011001010000010111010111010011001011010110001111010101011001110010101111011001010000010011001011011100111101101011110 e5a385e385bbeca0abe69b9cebbaa3eca0bae996b1eab395eca0996e7de5a385e385bbeca0abe69b9cebbaa3eca0bae996b1eab395eca0996e7b5e
UHC 壅ㅻ젫曜뺣젺閱곕젙n}壅ㅻ젫曜뺣젺閱곕젙n{^ 1110100010110101101001001110101110100000101000111110100011111000100101011110101110100000101011011110011011110011101100001110101110100000100101010110111001111101111010001011010110100100111010111010000010100011111010001111100010010101111010111010000010101101111001101111001110110000111010111010000010010101011011100111101101011110 e8b5a4eba0a3e8f895eba0ade6f3b0eba0956e7de8b5a4eba0a3e8f895eba0ade6f3b0eba0956e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)