To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????L????~ 00111111001111110011111100111111010011000011111100111111001111110011111101111110 3f3f3f3f4c3f3f3f3f7e
SJIS-WIN 澱???L綜???~ 100100110110001000111111001111110011111101001100100100011000111000111111001111110011111101111110 93623f3f3f4c918e3f3f3f7e
EUC-JP 澱???L綜???~ 110001011100001100111111001111110011111101001100110000011110111000111111001111110011111101111110 c5c33f3f3f4cc1ee3f3f3f7e
UTF-8 澱ㆁ렰렕L綜숄렰렲~ 1110011010111110101100011110001110000110100000011110101110100000101100001110101110100000100101010100110011100111101101101001110011101100100010001000010011101011101000001011000011101011101000001011001001111110 e6beb1e38681eba0b0eba0954ce7b69cec8884eba0b0eba0b27e
UHC 澱ㆁ렰렕L綜숄렰렲~ 111011101111111010100100111100011000111010111101100011101010101001001100111100001111110010111100111100011000111010111101100011101011111101111110 eefea4f18ebd8eaa4cf0fcbcf18ebd8ebf7e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)