To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???znf???zn^}Y???znf???zn^}bE 0011111100111111001111110111101001101110011001100011111100111111001111110111101001101110010111100111110101011001001111110011111100111111011110100110111001100110001111110011111100111111011110100110111001011110011111010110001001000101 3f3f3f7a6e663f3f3f7a6e5e7d593f3f3f7a6e663f3f3f7a6e5e7d6245
SJIS-WIN 勺紗猥znf勺紗猥zn^}Y勺紗猥znf勺紗猥zn^}bE 1000111011011001100011101101000111100000110011100111101001101110011001101000111011011001100011101101000111100000110011100111101001101110010111100111110101011001100011101101100110001110110100011110000011001110011110100110111001100110100011101101100110001110110100011110000011001110011110100110111001011110011111010110001001000101 8ed98ed1e0ce7a6e668ed98ed1e0ce7a6e5e7d598ed98ed1e0ce7a6e668ed98ed1e0ce7a6e5e7d6245
EUC-JP 勺紗猥znf勺紗猥zn^}Y勺紗猥znf勺紗猥zn^}bE 1011110011011011101111001101001111100000110100000111101001101110011001101011110011011011101111001101001111100000110100000111101001101110010111100111110101011001101111001101101110111100110100111110000011010000011110100110111001100110101111001101101110111100110100111110000011010000011110100110111001011110011111010110001001000101 bcdbbcd3e0d07a6e66bcdbbcd3e0d07a6e5e7d59bcdbbcd3e0d07a6e66bcdbbcd3e0d07a6e5e7d6245
UTF-8 勺紗猥znf勺紗猥zn^}Y勺紗猥znf勺紗猥zn^}bE 1110010110001011101110101110011110110100100101111110011110001100101001010111101001101110011001101110010110001011101110101110011110110100100101111110011110001100101001010111101001101110010111100111110101011001111001011000101110111010111001111011010010010111111001111000110010100101011110100110111001100110111001011000101110111010111001111011010010010111111001111000110010100101011110100110111001011110011111010110001001000101 e58bbae7b497e78ca57a6e66e58bbae7b497e78ca57a6e5e7d59e58bbae7b497e78ca57a6e66e58bbae7b497e78ca57a6e5e7d6245
UHC 勺紗猥znf勺紗猥zn^}Y勺紗猥znf勺紗猥zn^}bE 1110110111000011110111101110100111101000111001010111101001101110011001101110110111000011110111101110100111101000111001010111101001101110010111100111110101011001111011011100001111011110111010011110100011100101011110100110111001100110111011011100001111011110111010011110100011100101011110100110111001011110011111010110001001000101 edc3dee9e8e57a6e66edc3dee9e8e57a6e5e7d59edc3dee9e8e57a6e66edc3dee9e8e57a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)