To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????佚??????????B 00111111001111110011111100111111001111110011111100111111001111110011111110011000110000110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f98c33f3f3f3f3f3f3f3f3f3f42
EUC-JP ?????????佚??????????B 00111111001111110011111100111111001111110011111100111111001111110011111111010000110001010011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3fd0c53f3f3f3f3f3f3f3f3f3f42
UTF-8 溜븍젿溜븍젛溜븐뀛佚뚮젿溜븍젡溜븐뀓溜봗B 11101111101001111000101111101011101110001000110111101100101000001011111111101111101001111000101111101011101110001000110111101100101000001001101111101111101001111000101111101011101110001001000011101011100000001001101111100100101111011001101011101011100110101010111011101100101000001011111111101111101001111000101111101011101110001000110111101100101000001010000111101111101001111000101111101011101110001001000011101011100000001001001111101111101001111000101111101011101101001001011101000010 efa78bebb88deca0bfefa78bebb88deca09befa78bebb890eb809be4bd9aeb9aaeeca0bfefa78bebb88deca0a1efa78bebb890eb8093efa78bebb49742
UHC 溜븍젿溜븍젛溜븐뀛佚뚮젿溜븍젡溜븐뀓溜봗B 1110101011111110101110101110101110100000101100011110101011111110101110101110101110100000100101111110101011111110101110101110110010000101100101001110110011101010100011001110101110100000101100011110101011111110101110101110101110100000100110101110101011111110101110101110110010000101100011011110101011111110100101000101010001000010 eafebaeba0b1eafebaeba097eafebaec8594ecea8ceba0b1eafebaeba09aeafebaec858deafe945442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)