To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲l?竊??淫?????悠??淫??沃 1110000110011111100000101000110000111111111000101000011000111111001111111000100011111010001111110011111100111111001111110011111110010111010010010011111100111111100010001111101000111111001111111001011110000000 e19f828c3fe2863f3f88fa3f3f3f3f3f97493f3f88fa3f3f9780
EUC-JP 癲l?竊??淫??孼??悠??淫??沃 11100010101000011010001111101100001111111110001111100110001111110011111110110000111111000011111100111111100011111011101011000011001111110011111111001101101010100011111100111111101100001111110000111111001111111100110111100000 e2a1a3ec3fe3e63f3fb0fc3f3f8fbac33f3fcdaa3f3fb0fc3f3fcde0
UTF-8 癲l옓竊덃쾮淫뉍뼳孼꾬퐢悠덃썫淫롫걝沃 111001111001100110110010111011111011110110001100111011001001100010010011111001111010101110001010111010111000110110000011111011001011111010101110111001101011011110101011111010111000100110001101111010111011110010110011111001011010110110111100111010101011111010101100111011011001000010100010111001101000001010100000111010111000110110000011111011001000110110101011111001101011011110101011111010111010000110101011111010101011000110011101111001101011001010000011 e799b2efbd8cec9893e7ab8aeb8d83ecbeaee6b7abeb898debbcb3e5adbceabeaced90a2e682a0eb8d83ec8dabe6b7abeba1abeab19de6b283
UHC 癲l옓竊덃쾮淫뉍뼳孼꾬퐢悠덃썫淫롫걝沃 1110111110100110101000111110110010011110100110011110111110111100100010001110011010110010100001011110101111100010100001111110001010010110101101101110010111101101100001001110111110111101100010111110101011101101100010001110011010011011100111001110101111100010100011101110101110000001100001101110100010101010 efa6a3ec9e99efbc88e6b285ebe287e296b6e5ed84efbd8beaed88e69b9cebe28eeb8186e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)