To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厭?????????????????壹??^ 100010010111110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001101011100011001111110011111101011110 897d3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f9ae33f3f5e
EUC-JP 厭?????繇???????????壹??^ 1011000111011110001111110011111100111111001111110011111110001111110101001101000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101010011100101001111110011111101011110 b1de3f3f3f3f3f8fd4d13f3f3f3f3f3f3f3f3f3f3fd4e53f3f5e
UTF-8 厭뗭컮溜곕젧繇꾪렆閱뉒뼀李붾젾溜잓즲壹좄룎^ 11100101100011101010110111101011100101111010110111101100101110111010111011101111101001111000101111101010101100111001010111101100101000001010011111100111101110011000011111101010101111101010101011101011101000001000011011101001100101101011000111101011100010011001001011101011101111001000000011101111101001111010000111101011101101101011111011101100101000001011111011101111101001111000101111101100100111101001001111101100101001101011001011100101101000111011100111101100101000101000010011101011101000111000111001011110 e58eadeb97adecbbaeefa78beab395eca0a7e7b987eabeaaeba086e996b1eb8992ebbc80efa7a1ebb6beeca0beefa78bec9e93eca6b2e5a3b9eca284eba38e5e
UHC 厭뗭컮溜곕젧繇꾪렆閱뉒뼀李붾젾溜잓즲壹좄룎^ 11100110111101001000101111101100101100001001010011101010111111101011000011101011101000001001111111101001101000111000010011101101100011101010000011100110111100111000011111100111100101101000101111101100101100001001010011101011101000001011000011101010111111101001111111101001101000111000010011101100111011001010000011101000100011111000110001011110 e6f48becb094eafeb0eba09fe9a384ed8ea0e6f387e7968becb094eba0b0eafe9fe9a384ececa0e88f8c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)