To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 載??障?齎?裨????制ь?n?祐?喪?B 10001101110110100011111100111111100011111110000100111111111001101101100000111111111001011110100100111111001111110011111100111111100100001010011110000100100011100011111110000010100011100011111110010111010100110011111110010001011100100011111101000010 8dda3f3f8fe13fe6d83fe5e93f3f3f3f90a7848e3f828e3f97533f91723f42
EUC-JP 載??障?齎?裨????制ь?n?祐?喪?B 10111010110111000011111100111111101111101110001100111111111011001101101000111111111010101110101100111111001111110011111100111111110000001010100110100111111011100011111110100011111011100011111111001101101101000011111111000001110100110011111101000010 badc3f3fbee33fecda3feaeb3f3f3f3fc0a9a7ee3fa3ee3fcdb43fc1d33f42
UTF-8 載언ㄼ障렍齎흗裨뤱횓등갼制ь죳n꽂祐쪕喪렦B 111010001011110010001001111011001001011010111000111000111000010010111100111010011001101010011100111010111010000010001101111010011011110110001110111011011001110110010111111010001010001110101000111010111010010010110001111011011001101010010011111010111001001110110001111010101011000010111100111001011000100010110110110100011000110011101100101000111011001111101111101111011000111011101010101111011000001011100111101001011001000011101100101010101001010111100101100101101010101011101011101000001010011001000010 e8bc89ec96b8e384bce99a9ceba08de9bd8eed9d97e8a3a8eba4b1ed9a93eb93b1eab0bce588b6d18ceca3b3efbd8eeabd82e7a590ecaa95e596aaeba0a642
UHC 載언ㄼ障렍齎흗裨뤱횓등갼制ь죳n꽂祐쪕喪렦B 11101110101100001011111011110000101001001010110011101110101000011000111010100011111011101011001011001000111010011101111010100101100011111101111111000011100011101011010111101110101100001011111011110000101001001010110011101110101000011000111010100011111011101011001011001000111010011101111010100101100011111101111111000011100011101011010101000010 eeb0bef0a4aceea18ea3eeb2c8e9dea58fdfc38eb5eeb0bef0a4aceea18ea3eeb2c8e9dea58fdfc38eb542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)