To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 繒?止?窪?乙?變?‥繒?止?窪?乙?變?‥^ 1111101110001111001111111000111001111110001111111000110001000101001111111000100110110011001111111001110111001100001111111000000101100100111110111000111100111111100011100111111000111111100011000100010100111111100010011011001100111111100111011100110000111111100000010110010001011110 fb8f3f8e7e3f8c453f89b33f9dcc3f8164fb8f3f8e7e3f8c453f89b33f9dcc3f81645e
EUC-JP 繒?止?窪?乙?變蔣‥繒?止?窪?乙?變蔣‥^ 1000111111010100110101000011111110111011110111110011111110110111101001100011111110110010101101010011111111011010110011101000111111011001101101101010000111000101100011111101010011010100001111111011101111011111001111111011011110100110001111111011001010110101001111111101101011001110100011111101100110110110101000011100010101011110 8fd4d43fbbdf3fb7a63fb2b53fdace8fd9b6a1c58fd4d43fbbdf3fb7a63fb2b53fdace8fd9b6a1c55e
UTF-8 繒렰止렣窪렱乙잼變蔣‥繒렰止렣窪렱乙잼變蔣‥^ 11100111101110011001001011101011101000001011000011100110101011011010001011101011101000001010001111100111101010101010101011101011101000001011000111100100101110011001100111101100100111101011110011101000101011101000101011101000100101001010001111100010100000001010010111100111101110011001001011101011101000001011000011100110101011011010001011101011101000001010001111100111101010101010101011101011101000001011000111100100101110011001100111101100100111101011110011101000101011101000101011101000100101001010001111100010100000001010010101011110 e7b992eba0b0e6ada2eba0a3e7aaaaeba0b1e4b999ec9ebce8ae8ae894a3e280a5e7b992eba0b0e6ada2eba0a3e7aaaaeba0b1e4b999ec9ebce8ae8ae894a3e280a55e
UHC 繒렰止렣窪렱乙잼變蔣‥繒렰止렣窪렱乙잼變蔣‥^ 111100011111100110001110101111011111001010101101100011101011010011101000110000011000111010111110111010111110000011000000111010111101110010101000111011011111100010100001101001011111000111111001100011101011110111110010101011011000111010110100111010001100000110001110101111101110101111100000110000001110101111011100101010001110110111111000101000011010010101011110 f1f98ebdf2ad8eb4e8c18ebeebe0c0ebdca8edf8a1a5f1f98ebdf2ad8eb4e8c18ebeebe0c0ebdca8edf8a1a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)