To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????T?????????ZgB 00111111001111110011111100111111001111110011111100111111001111110011111101010100001111110011111100111111001111110011111100111111001111110011111100111111010110100110011101000010 3f3f3f3f3f3f3f3f3f543f3f3f3f3f3f3f3f3f5a6742
SJIS-WIN 陲ア莉呎牛魄難スコT陲ア莉呎牛魄難スコZgB 11101000101000101011000111100100101110111001100111100110100010111000110111101001101011101001001111101111101111011011101001010100111010001010001010110001111001001011101110011001111001101000101110001101111010011010111010010011111011111011110110111010010110100110011101000010 e8a2b1e4bb99e68b8de9ae93efbdba54e8a2b1e4bb99e68b8de9ae93efbdba5a6742
EUC-JP 陲ア莉呎牛魄難スコT陲ア莉呎牛魄難スコZgB 11110000101001001000111010110001111010001011110111010010111010001011010111101101111100101011000011000110111100011000111010111101100011101011101001010100111100001010010010001110101100011110100010111101110100101110100010110101111011011111001010110000110001101111000110001110101111011000111010111010010110100110011101000010 f0a48eb1e8bdd2e8b5edf2b0c6f18ebd8eba54f0a48eb1e8bdd2e8b5edf2b0c6f18ebd8eba5a6742
UTF-8 陲ア莉呎牛魄難スコT陲ア莉呎牛魄難スコZgB 11101001100110011011001011101111101111011011000111101000100011101000100111100101100100011000111011100111100010011001101111101001101011011000010011101001100110111010001111101111101111011011110111101111101111011011101001010100111010011001100110110010111011111011110110110001111010001000111010001001111001011001000110001110111001111000100110011011111010011010110110000100111010011001101110100011111011111011110110111101111011111011110110111010010110100110011101000010 e999b2efbdb1e88e89e5918ee7899be9ad84e99ba3efbdbdefbdba54e999b2efbdb1e88e89e5918ee7899be9ad84e99ba3efbdbdefbdba5a6742
UHC ??莉?牛魄難??T??莉?牛魄難??ZgB 001111110011111111010111111010010011111111101001110110101101101111011110110100011111000100111111001111110101010000111111001111111101011111101001001111111110100111011010110110111101111011010001111100010011111100111111010110100110011101000010 3f3fd7e93fe9dadbded1f13f3f543f3fd7e93fe9dadbded1f13f3f5a6742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)