To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲l?竊??循?????韋??猿??椰 1110000110011111100000101000110000111111111000101000011000111111001111111000111101111010001111110011111100111111001111110011111111101000111010000011111100111111100010011000111000111111001111111001111010111101 e19f828c3fe2863f3f8f7a3f3f3f3f3fe8e83f3f898e3f3f9ebd
EUC-JP 癲l?竊??循??孼??韋??猿??椰 11100010101000011010001111101100001111111110001111100110001111110011111110111101110110110011111100111111100011111011101011000011001111110011111111110000111010100011111100111111101100011110111000111111001111111101110010111111 e2a1a3ec3fe3e63f3fbddb3f3f8fbac33f3ff0ea3f3fb1ee3f3fdcbf
UTF-8 癲l옓竊덂쑵循낆꽑孼꾩쉵韋껃쩂猿뗫츆椰 111001111001100110110010111011111011110110001100111011001001100010010011111001111010101110001010111010111000110110000010111011001001000110110101111001011011111010101010111010111000001010000110111010101011110110010001111001011010110110111100111010101011111010101001111011001000100110110101111010011001111110001011111010101011101110000011111011001010100110000010111001111000110010111111111010111001011110101011111011001011100010000110111001101010010010110000 e799b2efbd8cec9893e7ab8aeb8d82ec91b5e5beaaeb8286eabd91e5adbceabea9ec89b5e99f8beabb83eca982e78cbfeb97abecb886e6a4b0
UHC 癲l옓竊덂쑵循낆꽑孼꾩쉵韋껃쩂猿뗫츆椰 1110111110100110101000111110110010011110100110011110111110111100100010001110010110111110101010101110001011100000100001011110110010000100101000001110010111101101100001001110110010011010100010111110101011011111100000111110010110100100100111001110101010111011100010111110101110101110100000111110010110101011 efa6a3ec9e99efbc88e5beaae2e085ec84a0e5ed84ec9a8beadf83e5a49ceabb8bebae83e5ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)