To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?澣荊???澣荊??^ 001111111110000001010100100011000111010000111111001111110011111111100000010101001000110001110100001111110011111101011110 3fe0548c743f3f3fe0548c743f3f5e
EUC-JP ?澣荊???澣荊??^ 001111111101111110110101101101111101010100111111001111110011111111011111101101011011011111010101001111110011111101011110 3fdfb5b7d53f3f3fdfb5b7d53f3f5e
UTF-8 뤋澣荊쮱쯈뤋澣荊쮱쯈^ 11101011101001001000101111100110101111101010001111101000100011011000101011101100101011101011000111101100101011111000100011101011101001001000101111100110101111101010001111101000100011011000101011101100101011101011000111101100101011111000100001011110 eba48be6bea3e88d8aecaeb1ecaf88eba48be6bea3e88d8aecaeb1ecaf885e
UHC 뤋澣荊쮱쯈뤋澣荊쮱쯈^ 100011111011101111111001110101001111101110101010101010001000111010101001010001001000111110111011111110011101010011111011101010101010100010001110101010010100010001011110 8fbbf9d4fbaaa88ea9448fbbf9d4fbaaa88ea9445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)