To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 夭c???????n}夭c???????n{^ 100110101110111010000010100000110011111100111111001111110011111100111111001111110011111101101110011111011001101011101110100000101000001100111111001111110011111100111111001111110011111100111111011011100111101101011110 9aee82833f3f3f3f3f3f3f6e7d9aee82833f3f3f3f3f3f3f6e7b5e
EUC-JP 夭c?嫄?????n}夭c?嫄?????n{^ 11010100111100001010001111100011001111111000111110111010101000010011111100111111001111110011111100111111011011100111110111010100111100001010001111100011001111111000111110111010101000010011111100111111001111110011111100111111011011100111101101011110 d4f0a3e33f8fbaa13f3f3f3f3f6e7dd4f0a3e33f8fbaa13f3f3f3f3f6e7b5e
UTF-8 夭c끂嫄쇘쳽栒멸굴n}夭c끂嫄쇘쳽栒멸굴n{^ 1110010110100100101011011110111110111101100000111110101110000001100000101110010110101011100001001110110010000111100110001110110010110011101111011110011010100000100100101110101110101001101110001110101010110101101101000110111001111101111001011010010010101101111011111011110110000011111010111000000110000010111001011010101110000100111011001000011110011000111011001011001110111101111001101010000010010010111010111010100110111000111010101011010110110100011011100111101101011110 e5a4adefbd83eb8182e5ab84ec8798ecb3bde6a092eba9b8eab5b46e7de5a4adefbd83eb8182e5ab84ec8798ecb3bde6a092eba9b8eab5b46e7b5e
UHC 夭c끂嫄쇘쳽栒멸굴n}夭c끂嫄쇘쳽栒멸굴n{^ 1110100011101100101000111110001110000101101110001110101010110001101111001110011110101011101000001110001011100011101110001110101010110001101111000110111001111101111010001110110010100011111000111000010110111000111010101011000110111100111001111010101110100000111000101110001110111000111010101011000110111100011011100111101101011110 e8eca3e385b8eab1bce7aba0e2e3b8eab1bc6e7de8eca3e385b8eab1bce7aba0e2e3b8eab1bc6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)