To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厭?????垂?????厭?????垂?????^ 1000100101111101001111110011111100111111001111110011111110010000100000100011111100111111001111110011111100111111100010010111110100111111001111110011111100111111001111111001000010000010001111110011111100111111001111110011111101011110 897d3f3f3f3f3f90823f3f3f3f3f897d3f3f3f3f3f90823f3f3f3f3f5e
EUC-JP 厭?????垂?????厭?????垂?????^ 1011000111011110001111110011111100111111001111110011111110111111111000100011111100111111001111110011111100111111101100011101111000111111001111110011111100111111001111111011111111100010001111110011111100111111001111110011111101011110 b1de3f3f3f3f3fbfe23f3f3f3f3fb1de3f3f3f3f3fbfe23f3f3f3f3f5e
UTF-8 厭얜줃琉볧꺇垂좄럷麗볟츕厭얜줃琉볧꺇垂좄럷麗볠쭚^ 11100101100011101010110111101100100101101001110011101100101001001000001111101111101001111000110011101011101100111010011111101010101110101000011111100101100111101000001011101100101000101000010011101011100111111011011111101111101001101000100011101011101100111001111111101100101110001001010111100101100011101010110111101100100101101001110011101100101001001000001111101111101001111000110011101011101100111010011111101010101110101000011111100101100111101000001011101100101000101000010011101011100111111011011111101111101001101000100011101011101100111010000011101100101011011001101001011110 e58eadec969ceca483efa78cebb3a7eaba87e59e82eca284eb9fb7efa688ebb39fecb895e58eadec969ceca483efa78cebb3a7eaba87e59e82eca284eb9fb7efa688ebb3a0ecad9a5e
UHC 厭얜줃琉볧꺇垂좄럷麗볟츕厭얜줃琉볧꺇垂좄럷麗볠쭚^ 11100110111101001011111011101011101000011001101011101011101001001001001111101101100000111010111011100001111101111010000011101000100011101001011011100110101100001001001111100101101011101000111111100110111101001011111011101011101000011001101011101011101001001001001111101101100000111010111011100001111101111010000011101000100011101001011011100110101100001001001111100110101001111001000001011110 e6f4beeba19aeba493ed83aee1f7a0e88e96e6b093e5ae8fe6f4beeba19aeba493ed83aee1f7a0e88e96e6b093e6a7905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)