To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????~???????????? 001111110011111100111111001111110011111100111111001111110011111101111110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f7e3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?賂??諸???~?賂??諸???第詐?? 001111111001100001000111001111110011111110001111100101000011111100111111001111110111111000111111100110000100011100111111001111111000111110010100001111110011111100111111100100011110011010001101101111000011111100111111 3f98473f3f8f943f3f3f7e3f98473f3f8f943f3f3f91e68dbc3f3f
EUC-JP 鋌賂??諸???~鋌賂??諸???第詐?? 10001111111001001011101111001111101010000011111100111111101111011111010000111111001111110011111101111110100011111110010010111011110011111010100000111111001111111011110111110100001111110011111100111111110000101110100010111010101111100011111100111111 8fe4bbcfa83f3fbdf43f3f3f7e8fe4bbcfa83f3fbdf43f3f3fc2e8babe3f3f
UTF-8 鋌賂렰렡諸쟉렰렮~鋌賂렰렡諸쟉렰렮第詐렰렣 11101001100010111000110011101000101100111000001011101011101000001011000011101011101000001010000111101000101010111011100011101100100111111000100111101011101000001011000011101011101000001010111001111110111010011000101110001100111010001011001110000010111010111010000010110000111010111010000010100001111010001010101110111000111011001001111110001001111010111010000010110000111010111010000010101110111001111010110010101100111010001010100110010000111010111010000010110000111010111010000010100011 e98b8ce8b382eba0b0eba0a1e8abb8ec9f89eba0b0eba0ae7ee98b8ce8b382eba0b0eba0a1e8abb8ec9f89eba0b0eba0aee7acace8a990eba0b0eba0a3
UHC 鋌賂렰렡諸쟉렰렮~鋌賂렰렡諸쟉렰렮第詐렰렣 1110111111111011110101101111000110001110101111011000111010110010111100001011001111000000111100011000111010111101100011101011101101111110111011111111101111010110111100011000111010111101100011101011001011110000101100111100000011110001100011101011110110001110101110111111000010101111110111101111000110001110101111011000111010110100 effbd6f18ebd8eb2f0b3c0f18ebd8ebb7eeffbd6f18ebd8eb2f0b3c0f18ebd8ebbf0afdef18ebd8eb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)