To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 擾∽?午ヨ?褥??輿??節??五??蟯??^ 10001111111011111000000111100100001111111000110011011111100000111000100000111111111001011111000100111111001111111001011101100000001111110011111110010000110111110011111100111111100011001101110000111111001111111110010110110010001111110011111101011110 8fef81e43f8cdf83883fe5f13f3f97603f3f90df3f3f8cdc3f3fe5b23f3f5e
EUC-JP 擾∽?午ヨ?褥??輿??節??五??蟯??^ 10111110111100011010001011100110001111111011100011100001101001011110100000111111111010101111001100111111001111111100110111000001001111110011111111000000111000010011111100111111101110001101111000111111001111111110101010110100001111110011111101011110 bef1a2e63fb8e1a5e83feaf33f3fcdc13f3fc0e13f3fb8de3f3feab43f3f5e
UTF-8 擾∽슉午ヨ땽褥⑶㉭輿귡삞節듣룶五볣낑蟯얏쳜^ 11100110100100111011111011100010100010001011110111101100100010101000100111100101100011011000100011100011100000111010100011101011100101011011110111101000101001001010010111100010100100011011011011100011100010011010110111101000101111001011111111101010101101111010000111101100100000101001111011100111101011111000000011101011100100111010001111101011101000111011011011100100101110101001010011101011101100111010001111101011100000101001000111101000100111111010111111101100100101101000111111101100101100111001110001011110 e693bee288bdec8a89e58d88e383a8eb95bde8a4a5e291b6e389ade8bcbfeab7a1ec829ee7af80eb93a3eba3b6e4ba94ebb3a3eb8291e89fafec968fecb39c5e
UHC 擾∽슉午ヨ땽褥⑶㉭輿귡삞節듣룶五볣낑蟯얏쳜^ 11101000111101101010000111101111101111011011010111100111111011011010101111101000100010111001001111101001101100111010100111101001101010001011111011100110101010111000001011101001100110001010000111101111101111011011010111101000100011111010101111100111111010011001001111101001101100111010100111101001101010001011111011100110101010111000001001011110 e8f6a1efbdb5e7edabe88b93e9b3a9e9a8bee6ab82e998a1efbdb5e88fabe7e993e9b3a9e9a8bee6ab825e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)