To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 藥??熬?ぐ娃??藥??熬?ぐ娃??^ 111001010101101000111111001111111110000010010010001111111000001010101110100010001010000100111111001111111110010101011010001111110011111111100000100100100011111110000010101011101000100010100001001111110011111101011110 e55a3f3fe0923f82ae88a13f3fe55a3f3fe0923f82ae88a13f3f5e
EUC-JP 藥??熬?ぐ娃??藥??熬?ぐ娃??^ 111010011011101100111111001111111101111111110010001111111010010010110000101100001010001100111111001111111110100110111011001111110011111111011111111100100011111110100100101100001011000010100011001111110011111101011110 e9bb3f3fdff23fa4b0b0a33f3fe9bb3f3fdff23fa4b0b0a33f3f5e
UTF-8 藥썸㉬熬뽬ぐ娃쒏뜆藥썸㉬熬뽬ぐ娃쒏뜆^ 11101000100101111010010111101100100011011011100011100011100010011010110011100111100001101010110011101011101111011010110011100011100000011001000011100101101010001000001111101100100100101000111111101011100111001000011011101000100101111010010111101100100011011011100011100011100010011010110011100111100001101010110011101011101111011010110011100011100000011001000011100101101010001000001111101100100100101000111111101011100111001000011001011110 e897a5ec8db8e389ace786acebbdace38190e5a883ec928feb9c86e897a5ec8db8e389ace786acebbdace38190e5a883ec928feb9c865e
UHC 藥썸㉬熬뽬ぐ娃쒏뜆藥썸㉬熬뽬ぐ娃쒏뜆^ 11100101101101111011110111100110101010001011110111101000101000101001011011101000101010101011000011101000110111111001110011100110100011011000100111100101101101111011110111100110101010001011110111101000101000101001011011101000101010101011000011101000110111111001110011100110100011011000100101011110 e5b7bde6a8bde8a296e8aab0e8df9ce68d89e5b7bde6a8bde8a296e8aab0e8df9ce68d895e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)