To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鞜イ隱、蕘丞「イ襞掀鞜イ隱、蕘丞「イ襞掀B 111010001101111110110010111010001010101010100100111001001111101110001111111001011010001010110010111001011111110010011101011101101110100011011111101100101110100010101010101001001110010011111011100011111110010110100010101100101110010111111100100111010111011001000010 e8dfb2e8aaa4e4fb8fe5a2b2e5fc9d76e8dfb2e8aaa4e4fb8fe5a2b2e5fc9d7642
EUC-JP 鞜イ隱、蕘丞「イ襞掀鞜イ隱、蕘丞「イ襞掀B 1111000011100001100011101011001011110000101011001000111010100100111010001111110110111110111001111000111010100010100011101011001011101010111111101101100111010111111100001110000110001110101100101111000010101100100011101010010011101000111111011011111011100111100011101010001010001110101100101110101011111110110110011101011101000010 f0e18eb2f0ac8ea4e8fdbee78ea28eb2eafed9d7f0e18eb2f0ac8ea4e8fdbee78ea28eb2eafed9d742
UTF-8 鞜イ隱、蕘丞「イ襞掀鞜イ隱、蕘丞「イ襞掀B 11101001100111101001110011101111101111011011001011101001100110101011000111101111101111011010010011101000100101011001100011100100101110001001111011101111101111011010001011101111101111011011001011101000101001011001111011100110100011101000000011101001100111101001110011101111101111011011001011101001100110101011000111101111101111011010010011101000100101011001100011100100101110001001111011101111101111011010001011101111101111011011001011101000101001011001111011100110100011101000000001000010 e99e9cefbdb2e99ab1efbda4e89598e4b89eefbda2efbdb2e8a59ee68e80e99e9cefbdb2e99ab1efbda4e89598e4b89eefbda2efbdb2e8a59ee68e8042
UHC ??隱??丞??????隱??丞????B 00111111001111111110101111011111001111110011111111100011101010100011111100111111001111110011111100111111001111111110101111011111001111110011111111100011101010100011111100111111001111110011111101000010 3f3febdf3f3fe3aa3f3f3f3f3f3febdf3f3fe3aa3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)