To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 汚??節??節????汚??節??節????B 1000100110011000001111110011111110010000110111110011111100111111100100001101111100111111001111110011111100111111100010011001100000111111001111111001000011011111001111110011111110010000110111110011111100111111001111110011111101000010 89983f3f90df3f3f90df3f3f3f3f89983f3f90df3f3f90df3f3f3f3f42
EUC-JP 汚??節?ł節????汚??節?ł節????B 101100011111100000111111001111111100000011100001001111111000111110101001110010001100000011100001001111110011111100111111001111111011000111111000001111110011111111000000111000010011111110001111101010011100100011000000111000010011111100111111001111110011111101000010 b1f83f3fc0e13f8fa9c8c0e13f3f3f3fb1f83f3fc0e13f8fa9c8c0e13f3f3f3f42
UTF-8 汚녷쮭節됮ł節면쐩廬쾖汚녷쮭節됮ł節면쐩廬쾖B 1110011010110001100110101110101110000101101101111110110010101110101011011110011110101111100000001110101110010000101011101100010110000010111001111010111110000000111010111010100110110100111011001001000010101001111011111010011010000010111011001011111010010110111001101011000110011010111010111000010110110111111011001010111010101101111001111010111110000000111010111001000010101110110001011000001011100111101011111000000011101011101010011011010011101100100100001010100111101111101001101000001011101100101111101001011001000010 e6b19aeb85b7ecaeade7af80eb90aec582e7af80eba9b4ec90a9efa682ecbe96e6b19aeb85b7ecaeade7af80eb90aec582e7af80eba9b4ec90a9efa682ecbe9642
UHC 汚녷쮭節됮ł節면쐩廬쾖汚녷쮭節됮ł節면쐩廬쾖B 111001111111110110000110111001101010100010001010111011111011110110001001111010011010100110101001111011111011110110111000111010011001110010001110111001011111111010110010011010011110011111111101100001101110011010101000100010101110111110111101100010011110100110101001101010011110111110111101101110001110100110011100100011101110010111111110101100100110100101000010 e7fd86e6a88aefbd89e9a9a9efbdb8e99c8ee5feb269e7fd86e6a88aefbd89e9a9a9efbdb8e99c8ee5feb26942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)