To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 趙?義???韻???畯?楢???韻???B 111001101110001000111111100010110110000000111111001111110011111110001001010000110011111100111111001111111111101101101111001111111001001111101000001111110011111100111111100010010100001100111111001111110011111101000010 e6e23f8b603f3f3f89433f3f3ffb6f3f93e83f3f3f89433f3f3f42
EUC-JP 趙?義???韻???畯?楢???韻???B 11101100111001000011111110110101110000010011111100111111001111111011000110100100001111110011111100111111100011111100110110111011001111111100011011101010001111110011111100111111101100011010010000111111001111110011111101000010 ece43fb5c13f3f3fb1a43f3f3f8fcdbb3fc6ea3f3f3fb1a43f3f3f42
UTF-8 趙렡義얹렰렍韻펨렊렧畯렡楢얹렰렍韻펨렊렧B 11101000101101101001100111101011101000001010000111100111101111101010100111101100100101101011100111101011101000001011000011101011101000001000110111101001100111111011101111101101100011101010100011101011101000001000101011101011101000001010011111100111100101011010111111101011101000001010000111100110101001011010001011101100100101101011100111101011101000001011000011101011101000001000110111101001100111111011101111101101100011101010100011101011101000001000101011101011101000001010011101000010 e8b699eba0a1e7bea9ec96b9eba0b0eba08de99fbbed8ea8eba08aeba0a7e795afeba0a1e6a5a2ec96b9eba0b0eba08de99fbbed8ea8eba08aeba0a742
UHC 趙렡義얹렰렍韻펨렊렧畯렡楢얹렰렍韻펨렊렧B 1111000011100001100011101011001011101011111110011011111011110001100011101011110110001110101000111110101010100100110001101110100010001110101000011000111010110110111100011110000110001110101100101110101011111001101111101111000110001110101111011000111010100011111010101010010011000110111010001000111010100001100011101011011001000010 f0e18eb2ebf9bef18ebd8ea3eaa4c6e88ea18eb6f1e18eb2eaf9bef18ebd8ea3eaa4c6e88ea18eb642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)