To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????殃▽??????????殃▽?B 0011111100111111001111110011111100111111001111110011111100111111001111111001111101101001100000011010010000111111001111110011111100111111001111110011111100111111001111110011111100111111100111110110100110000001101001000011111101000010 3f3f3f3f3f3f3f3f3f9f6981a43f3f3f3f3f3f3f3f3f3f9f6981a43f42
EUC-JP ?????????殃▽??????????殃▽?B 0011111100111111001111110011111100111111001111110011111100111111001111111101110111001010101000101010011000111111001111110011111100111111001111110011111100111111001111110011111100111111110111011100101010100010101001100011111101000010 3f3f3f3f3f3f3f3f3fddcaa2a63f3f3f3f3f3f3f3f3f3fddcaa2a63f42
UTF-8 溜뷸븽溜뽯졋咽뽯졋殃▽섞溜뷸븽溜뽯졋咽뽯졋殃▽섞B 11101111101001111000101111101011101101111011100011101011101110001011110111101111101001111000101111101011101111011010111111101100101000011000101111101111101001101001111011101011101111011010111111101100101000011000101111100110101011101000001111100010100101101011110111101100100001001001111011101111101001111000101111101011101101111011100011101011101110001011110111101111101001111000101111101011101111011010111111101100101000011000101111101111101001101001111011101011101111011010111111101100101000011000101111100110101011101000001111100010100101101011110111101100100001001001111001000010 efa78bebb7b8ebb8bdefa78bebbdafeca18befa69eebbdafeca18be6ae83e296bdec849eefa78bebb7b8ebb8bdefa78bebbdafeca18befa69eebbdafeca18be6ae83e296bdec849e42
UHC 溜뷸븽溜뽯졋咽뽯졋殃▽섞溜뷸븽溜뽯졋咽뽯졋殃▽섞B 11101010111111101011101011100110100101011010011011101010111111101001011011101011101000001011101011100110111011001001011011101011101000001011101011100100111010101010000111100100101111001010111111101010111111101011101011100110100101011010011011101010111111101001011011101011101000001011101011100110111011001001011011101011101000001011101011100100111010101010000111100100101111001010111101000010 eafebae695a6eafe96eba0bae6ec96eba0bae4eaa1e4bcafeafebae695a6eafe96eba0bae6ec96eba0bae4eaa1e4bcaf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)