To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 塢??搖??張ц?訝??節??堯??午ヤ?B 10011010110001110011111100111111100111011000101000111111001111111001001010100011100001001000100000111111111001100110001000111111001111111001000011011111001111110011111111101010100111110011111100111111100011001101111110000011100001000011111101000010 9ac73f3f9d8a3f3f92a384883fe6623f3f90df3f3fea9f3f3f8cdf83843f42
EUC-JP 塢??搖??張ц?訝??節??堯??午ヤ?B 11010100110010010011111100111111110110011110101000111111001111111100010010100101101001111110100000111111111010111100001100111111001111111100000011100001001111110011111111110100101000010011111100111111101110001110000110100101111001000011111101000010 d4c93f3fd9ea3f3fc4a5a7e83febc33f3fc0e13f3ff4a13f3fb8e1a5e43f42
UTF-8 塢곻슁搖얗뜈張ц룶訝롨뱚節녜뼻堯뗯눐午ヤ툗B 111001011010000110100010111010101011001110111011111011001000101010000001111001101001000010010110111011001001011010010111111010111001110010001000111001011011110010110101110100011000011011101011101000111011011011101000101010001001110111101011101000011010100011101011101100011001101011100111101011111000000011101011100001011001110011101011101111001011101111100101101000001010111111101011100101111010111111101011100010001001000011100101100011011000100011100011100000111010010011101101100010001001011101000010 e5a1a2eab3bbec8a81e69096ec9697eb9c88e5bcb5d186eba3b6e8a89deba1a8ebb19ae7af80eb859cebbcbbe5a0afeb97afeb8890e58d88e383a4ed889742
UHC 塢곻슁搖얗뜈張ц룶訝롨뱚節녜뼻堯뗯눐午ヤ툗B 11100111111100011000000111101111101111011011001111101000111101001011111011101001100011011000101111101101111001011010110011101000100011111010101111100100101110001000111011101000100100111000000111101111101111011011001111101001100101101011111011101000111010111000101111101110100001111010110011100111111011011010101111100100101110001000111001000010 e7f181efbdb3e8f4bee98d8bede5ace88fabe4b88ee89381efbdb3e996bee8eb8bee87ace7edabe4b88e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)