To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 堰????????n}堰????????n{^ 10001001100000010011111100111111001111110011111100111111001111110011111100111111011011100111110110001001100000010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 89813f3f3f3f3f3f3f3f6e7d89813f3f3f3f3f3f3f3f6e7b5e
EUC-JP 堰?????洧??n}堰?????洧??n{^ 1011000111100001001111110011111100111111001111110011111110001111110001111011010000111111001111110110111001111101101100011110000100111111001111110011111100111111001111111000111111000111101101000011111100111111011011100111101101011110 b1e13f3f3f3f3f8fc7b43f3f6e7db1e13f3f3f3f3f8fc7b43f3f6e7b5e
UTF-8 堰묐쓹紐룡뒔洧뺣첓n}堰묐쓹紐룡뒔洧뺣첓n{^ 1110010110100000101100001110101110101100100100001110110010010011101110011110111110100111100011111110101110100011101000011110101110010010100101001110011010110100101001111110101110111010101000111110110010110010100100110110111001111101111001011010000010110000111010111010110010010000111011001001001110111001111011111010011110001111111010111010001110100001111010111001001010010100111001101011010010100111111010111011101010100011111011001011001010010011011011100111101101011110 e5a0b0ebac90ec93b9efa78feba3a1eb9294e6b4a7ebbaa3ecb2936e7de5a0b0ebac90ec93b9efa78feba3a1eb9294e6b4a7ebbaa3ecb2936e7b5e
UHC 堰묐쓹紐룡뒔洧뺣첓n}堰묐쓹紐룡뒔洧뺣첓n{^ 1110010111101000100100011110101110011101100101011110101110101010101101111110011010001010100100011110101011111011100101011110101110101010101000000110111001111101111001011110100010010001111010111001110110010101111010111010101010110111111001101000101010010001111010101111101110010101111010111010101010100000011011100111101101011110 e5e891eb9d95ebaab7e68a91eafb95ebaaa06e7de5e891eb9d95ebaab7e68a91eafb95ebaaa06e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)