To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 俉??節??節b?蘖??也??繹??松??B 111110100110000100111111001111111001000011011111001111110011111110010000110111111000001010000010001111111001111101010000001111110011111110010110111001110011111100111111111000111000100000111111001111111000111110111100001111110011111101000010 fa613f3f90df3f3f90df82823f9f503f3f96e73f3fe3883f3f8fbc3f3f42
EUC-JP 俉??節??節b?蘖??也??繹??松??B 10001111101100011011101100111111001111111100000011100001001111110011111111000000111000011010001111100010001111111101110110110001001111110011111111001100111010010011111100111111111001011110100000111111001111111011111010111110001111110011111101000010 8fb1bb3f3fc0e13f3fc0e1a3e23fddb13f3fcce93f3fe5e83f3fbebe3f3f42
UTF-8 俉녑쪍節얏봄節b댌蘖뤺뜵也뉛슭繹쏙숲松득릯B 11100100101111111000100111101011100001011001000111101100101010101000110111100111101011111000000011101100100101101000111111101011101101001000010011100111101011111000000011101111101111011000001011101011100011001000110011101000100110001001011011101011101001001011101011101011100111001011010111100100101110011001111111101011100010011001101111101100100010101010110111100111101110011011100111101100100011111001100111101100100010001011001011100110100111011011111011101011100100111001110111101011101001101010111101000010 e4bf89eb8591ecaa8de7af80ec968febb484e7af80efbd82eb8c8ce89896eba4baeb9cb5e4b99feb899bec8aade7b9b9ec8f99ec88b2e69dbeeb939deba6af42
UHC 俉녑쪍節얏봄節b댌蘖뤺뜵也뉛슭繹쏙숲松득릯B 11100111111010111011001111100101101001011000011111101111101111011011111011100110101110101011110111101111101111011010001111100010100010001011010111100101111011101000111111101000100011011011001111100101101001011000011111101111101111011011111011100110101110101011110111101111101111011010001111100001111001101011010111100110100100001000111101000010 e7ebb3e5a587efbdbee6babdefbda3e288b5e5ee8fe88db3e5a587efbdbee6babdefbda3e1e6b5e6908f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)