To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮??遺?????竊??恂ル?沃??裕 0011111100111111001111111110100001001010001111110011111110001000111000100011111100111111001111110011111100111111111000101000011000111111001111111001110010010110100000111000101100111111100101111000000000111111001111111001011101010100 3f3f3fe84a3f3f88e23f3f3f3f3fe2863f3f9c96838b3f97803f3f9754
EUC-JP ???鍮??遺??孼??竊??恂ル?沃??裕 00111111001111110011111111101111101010110011111100111111101100001110010000111111001111111000111110111010110000110011111100111111111000111110011000111111001111111101011111110110101001011110101100111111110011011110000000111111001111111100110110110101 3f3f3fefab3f3fb0e43f3f8fbac33f3fe3e63f3fd7f6a5eb3fcde03f3fcdb5
UTF-8 略노쵐鍮섇젆遺삠걶孼꾩뮇竊먨톹恂ル늉沃쇨엽裕 111011111010010110110110111010111000010110111000111011001011010110010000111010011000110110101110111011001000010010000111111011001010000010000110111010011000000110111010111011001000001010100000111010101011000110110110111001011010110110111100111010101011111010101001111010111010111010000111111001111010101110001010111010111010100010101000111011011000011010111001111001101000000110000010111000111000001110101011111010111000101010001001111001101011001010000011111011001000011110101000111011001001011110111101111010001010001110010101 efa5b6eb85b8ecb590e98daeec8487eca086e981baec82a0eab1b6e5adbceabea9ebae87e7ab8aeba8a8ed86b9e68182e383abeb8a89e6b283ec87a8ec97bde8a395
UHC 略노쵐鍮섇젆遺삠걶孼꾩뮇竊먨톹恂ル늉沃쇨엽裕 1110010110110010101100111110101110101100100100101110101110111001100110001110010110100000100010011110101110110110101110111110001110000001100111001110010111101101100001001110110010010010100101101110111110111100100100001110010110110111100011011110001011100001101010111110101110110100101111111110100010101010101111001110101010111111101100011110101110101110 e5b2b3ebac92ebb998e5a089ebb6bbe3819ce5ed84ec9296efbc90e5b78de2e1abebb4bfe8aabceabfb1ebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)