To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壤ル?熬??濡??佯??妖??轟侑?? 100110101101111110000011100010110011111111100000100100100011111100111111100101000100011100111111001111111001100011010001001111110011111110010111011001000011111100111111100011011000110010011000110100000011111100111111 9adf838b3fe0923f3f94473f3f98d13f3f97643f3f8d8c98d03f3f
EUC-JP 壤ル?熬??濡??佯??妖??轟侑?? 110101001110000110100101111010110011111111011111111100100011111100111111110001111010100000111111001111111101000011010011001111110011111111001101110001010011111100111111101110011110110011010000110100100011111100111111 d4e1a5eb3fdff23f3fc7a83f3fd0d33f3fcdc53f3fb9ecd0d23f3f
UTF-8 壤ル젶熬곷젨濡뉖ㅇ佯얇궬妖껎맓轟侑덆룂 111001011010001110100100111000111000001110101011111011001010000010110110111001111000011010101100111010101011001110110111111011001010000010101000111001101011111110100001111010111000100110010110111000111000010110000111111001001011110110101111111011001001011010000111111010101011011010101100111001011010011010010110111010101011101110001110111010111010011110010011111010001011110110011111111001001011111010010001111010111000110110000110111010111010001110000010 e5a3a4e383abeca0b6e786aceab3b7eca0a8e6bfa1eb8996e38587e4bdafec9687eab6ace5a696eabb8eeba793e8bd9fe4be91eb8d86eba382
UHC 壤ル젶熬곷젨濡뉖ㅇ佯얇궬妖껎맓轟侑덆룂 1110010110111101101010111110101110100000101010101110100010100010100000011110101110100000101000001110101110100001100001111110101110100100101101111110010110111010101111101110001110000010101111101110100011101101100000111110110110010000101001011100111011011110111010101110001010001000111010011000111110000011 e5bdabeba0aae8a281eba0a0eba187eba4b7e5babee382bee8ed83ed90a5cedeeae288e98f83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)