To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 茹????????異?????奧??怏??^ 1110010010100101001111110011111100111111001111110011111100111111001111110011111110001000110110010011111100111111001111110011111100111111100110101111101000111111001111111001110010001001001111110011111101011110 e4a53f3f3f3f3f3f3f3f88d93f3f3f3f3f9afa3f3f9c893f3f5e
EUC-JP 茹????????異??洧??奧??怏??^ 11101000101001110011111100111111001111110011111100111111001111110011111100111111101100001101101100111111001111111000111111000111101101000011111100111111110101001111110000111111001111111101011111101001001111110011111101011110 e8a73f3f3f3f3f3f3f3fb0db3f3f8fc7b43f3fd4fc3f3fd7e93f3f5e
UTF-8 茹됰씟溜김븥栒뚦뒰異덃챼洧노젿奧롢뀒怏⑹뎠^ 11101000100011001011100111101011100100001011000011101100100101001001111111101111101001111000101111101010101110011000000011101011101110001010010111100110101000001001001011101011100110101010011011101011100100101011000011100111100101011011000011101011100011011000001111101100101100011011110011100110101101001010011111101011100001011011100011101100101000001011111111100101101001011010011111101011101000011010001011101011100000001001001011100110100000001000111111100010100100011011100111101011100011101010000001011110 e88cb9eb90b0ec949fefa78beab980ebb8a5e6a092eb9aa6eb92b0e795b0eb8d83ecb1bce6b4a7eb85b8eca0bfe5a5a7eba1a2eb8092e6808fe291b9eb8ea05e
UHC 茹됰씟溜김븥栒뚦뒰異덃챼洧노젿奧롢뀒怏⑹뎠^ 11100110101010101000100111101011100111011011001111101010111111101011000111101000100101011000111011100010111000111000110011100101100010101010100111101100101101101000100011100110101010101000100111101010111110111011001111101011101000001011000111100111111100111000111011100011100001011000110011100100111010001010100111101100101101011011000101011110 e6aa89eb9db3eafeb1e8958ee2e38ce58aa9ecb688e6aa89eafbb3eba0b1e7f38ee3858ce4e8a9ecb5b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)