To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 晶ァ晶諶ァ而晶諶アハ諶アメ諶ィ自 10001111101110111111010010001110101001111000111110111011111110011100010011111011101010101010011110001110101001111000111110111011111110011011000011111011101010101011000111001010111110111010101010110001110100101111101110101010101010001000111010101001 8fbbf48ea78fbbf9c4fbaaa78ea78fbbf9b0fbaab1cafbaab1d2fbaaa88ea9
EUC-JP 晶?ァ晶?諶ァ而晶?諶アハ諶アメ諶ィ自 101111101011110100111111100011101010011110111110101111010011111110001111110111101011010110001110101001111011110010101001101111101011110100111111100011111101111010110101100011101011000110001110110010101000111111011110101101011000111010110001100011101101001010001111110111101011010110001110101010001011110010101011 bebd3f8ea7bebd3f8fdeb58ea7bca9bebd3f8fdeb58eb18eca8fdeb58eb18ed28fdeb58ea8bcab
UTF-8 晶ァ晶諶ァ而晶諶アハ諶アメ諶ィ自 111001101001100110110110111011101000110010111101111011111011110110100111111001101001100110110110111011101001110010011111111010001010101110110110111011111011110110100111111010001000000010001100111001101001100110110110111011101001110010001011111010001010101110110110111011111011110110110001111011111011111010001010111010001010101110110110111011111011110110110001111011111011111010010010111010001010101110110110111011111011110110101000111010001000011110101010 e699b6ee8cbdefbda7e699b6ee9c9fe8abb6efbda7e8808ce699b6ee9c8be8abb6efbdb1efbe8ae8abb6efbdb1efbe92e8abb6efbda8e887aa
UHC 晶??晶?諶?而晶?諶??諶??諶?自 11101111110111000011111100111111111011111101110000111111111001001010011000111111111011001011101111101111110111000011111111100100101001100011111100111111111001001010011000111111001111111110010010100110001111111110110110111011 efdc3f3fefdc3fe4a63fecbbefdc3fe4a63f3fe4a63f3fe4a63fedbb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)