To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鴦???額??倚??腰??踰??袁??B 1110100111110001001111110011111100111111100010100111101000111111001111111001100011011111001111110011111110001101100110000011111100111111111001101111101000111111001111111110010111001101001111110011111101000010 e9f13f3f3f8a7a3f3f98df3f3f8d983f3fe6fa3f3fe5cd3f3f42
EUC-JP 鴦???額??倚??腰??踰??袁??B 1111001011110011001111110011111100111111101100111101101100111111001111111101000011100001001111110011111110111001111110000011111100111111111011001111110000111111001111111110101011001111001111110011111101000010 f2f33f3f3fb3db3f3fd0e13f3fb9f83f3fecfc3f3feacf3f3f42
UTF-8 鴦꾆쇱쪠額됰뱺倚닸돮腰뱀뇳踰됵쭗袁⑸윜B 11101001101101001010011011101010101111101000011011101100100001111011000111101100101010101010000011101001101000011000110111101011100100001011000011101011101100011011101011100101100000001001101011101011100010111011100011101011100011111010111011101000100001011011000011101011101100011000000011101011100001111011001111101000101110001011000011101011100100001011010111101100101011011001011111101000101000101000000111100010100100011011100011101100100111001001110001000010 e9b4a6eabe86ec87b1ecaaa0e9a18deb90b0ebb1bae5809aeb8bb8eb8faee885b0ebb180eb87b3e8b8b0eb90b5ecad97e8a281e291b8ec9c9c42
UHC 鴦꾆쇱쪠額됰뱺倚닸돮腰뱀뇳踰됵쭗袁⑸윜B 111001001110110010000100110011101011110011101100101001011001100111100100111111101000100111101011100100111010000011101011111011111011010011100110100010011011000111101001101001101011100111101100100001111001011111101011101100101000100111101111101001111000111111101010101111101010100111101011100111111001111101000010 e4ec84cebceca599e4fe89eb93a0ebefb4e689b1e9a6b9ec8797ebb289efa78feabea9eb9f9f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)