To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ???韋??幽??域??茵??碎??壤??h 00111111001111110011111111101000111010000011111100111111100101110100100000111111001111111000100011100110001111110011111111100100100111110011111100111111111000011110101000111111001111111001101011011111001111110011111101101000 3f3f3fe8e83f3f97483f3f88e63f3fe49f3f3fe1ea3f3f9adf3f3f68
EUC-JP ???韋??幽??域??茵??碎??壤??h 00111111001111110011111111110000111010100011111100111111110011011010100100111111001111111011000011101000001111110011111111101000101000010011111100111111111000101110110000111111001111111101010011100001001111110011111101101000 3f3f3ff0ea3f3fcda93f3fb0e83f3fe8a13f3fe2ec3f3fd4e13f3f68
UTF-8 嶺뚣뢿韋됵㎕幽귙렎域㏃슙茵먨넼碎ㅼ뵰壤쎻뀧h 11101111101001101010101111101011100110101010001111101011101000101011111111101001100111111000101111101011100100001011010111100011100011101001010111100101101110011011110111101010101101111001100111101011101000001000111011100101100111111001111111100011100011111000001111101100100010101001100111101000100011001011010111101011101010001010100011101011100001001011110011100111101000101000111011100011100001011011110011101011101101011011000011100101101000111010010011101100100011101011101111101011100000001010011101101000 efa6abeb9aa3eba2bfe99f8beb90b5e38e95e5b9bdeab799eba08ee59f9fe38f83ec8a99e88cb5eba8a8eb84bce7a28ee385bcebb5b0e5a3a4ec8ebbeb80a768
UHC 嶺뚣뢿韋됵㎕幽귙렎域㏃슙茵먨넼碎ㅼ뵰壤쎻뀧h 11100111101011011000110011100011100011111000001011101010110111111000100111101111101001111010000111101010111010111000001011100011100011101010010011100110101101001010011111101100100110101010011111101100111000001001000011100101100001101011011011100001111011111010010011101100100101001010111011100101101111011001101111100010100001011001111001101000 e7ad8ce38f82eadf89efa7a1eaeb82e38ea4e6b4a7ec9aa7ece090e586b6e1efa4ec94aee5bd9be2859e68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)