To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???A}v???A}vB 00111111001111110011111101000001011111010111011000111111001111110011111101000001011111010111011001000010 3f3f3f417d763f3f3f417d7642
SJIS-WIN 魯扉?A}v魯扉?A}vB 1001100001000100100101001110000000111111010000010111110101110110100110000100010010010100111000000011111101000001011111010111011001000010 984494e03f417d76984494e03f417d7642
EUC-JP 魯扉?A}v魯扉?A}vB 1100111110100101110010001110001000111111010000010111110101110110110011111010010111001000111000100011111101000001011111010111011001000010 cfa5c8e23f417d76cfa5c8e23f417d7642
UTF-8 魯扉깻A}v魯扉깻A}vB 11101001101011011010111111100110100010011000100111101010101110011011101101000001011111010111011011101001101011011010111111100110100010011000100111101010101110011011101101000001011111010111011001000010 e9adafe68989eab9bb417d76e9adafe68989eab9bb417d7642
UHC 魯扉깻A}v魯扉깻A}vB 11010110110110111101110111101010101100101010001001000001011111010111011011010110110110111101110111101010101100101010001001000001011111010111011001000010 d6dbddeab2a2417d76d6dbddeab2a2417d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)