To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 衍??萸у?猿??嗚??衍??萸у?猿??嗚??B 1001111110100101001111110011111111100100110011101000010010000101001111111000100110001110001111110011111110011010011010100011111100111111100111111010010100111111001111111110010011001110100001001000010100111111100010011000111000111111001111111001101001101010001111110011111101000010 9fa53f3fe4ce84853f898e3f3f9a6a3f3f9fa53f3fe4ce84853f898e3f3f9a6a3f3f42
EUC-JP 衍??萸у?猿??嗚??衍??萸у?猿??嗚??B 1101111010100111001111110011111111101000110100001010011111100101001111111011000111101110001111110011111111010011110010110011111100111111110111101010011100111111001111111110100011010000101001111110010100111111101100011110111000111111001111111101001111001011001111110011111101000010 dea73f3fe8d0a7e53fb1ee3f3fd3cb3f3fdea73f3fe8d0a7e53fb1ee3f3fd3cb3f3f42
UTF-8 衍됰뎽萸у푻猿뗫떊嗚삳뵝衍됰뎽萸у푻猿뗫떊嗚삳뵝B 1110100010100001100011011110101110010000101100001110101110001110101111011110100010010000101110001101000110000011111011011001000110111011111001111000110010111111111010111001011110101011111010111001011010001010111001011001011110011010111011001000001010110011111010111011010110011101111010001010000110001101111010111001000010110000111010111000111010111101111010001001000010111000110100011000001111101101100100011011101111100111100011001011111111101011100101111010101111101011100101101000101011100101100101111001101011101100100000101011001111101011101101011001110101000010 e8a18deb90b0eb8ebde890b8d183ed91bbe78cbfeb97abeb968ae5979aec82b3ebb59de8a18deb90b0eb8ebde890b8d183ed91bbe78cbfeb97abeb968ae5979aec82b3ebb59d42
UHC 衍됰뎽萸у푻猿뗫떊嗚삳뵝衍됰뎽萸у푻猿뗫떊嗚삳뵝B 11100110111000101000100111101011100010011001000011101011101011011010110011100101101111101000011111101010101110111000101111101011100010111010000011100111111100001011101111101011100101001001110111100110111000101000100111101011100010011001000011101011101011011010110011100101101111101000011111101010101110111000101111101011100010111010000011100111111100001011101111101011100101001001110101000010 e6e289eb8990ebadace5be87eabb8beb8ba0e7f0bbeb949de6e289eb8990ebadace5be87eabb8beb8ba0e7f0bbeb949d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)