To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 逆?????淫 100010110111010000111111001111110011111100111111001111111000100011111010 8b743f3f3f3f3f88fa
EUC-JP 逆??洧??淫 1011010111010101001111110011111110001111110001111011010000111111001111111011000011111100 b5d53f3f8fc7b43f3fb0fc
UTF-8 逆곷벡洧귝꼮淫 111010011000000010000110111010101011001110110111111010111011001010100001111001101011010010100111111010101011011110011101111010101011110010101110111001101011011110101011 e98086eab3b7ebb2a1e6b4a7eab79deabcaee6b7ab
UHC 逆곷벡洧귝꼮淫 1110011010111101100000011110101110111010101001001110101011111011100000101110011010000100100010011110101111100010 e6bd81ebbaa4eafb82e68489ebe2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)