To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??a^h??a^fN}??a^h??a^fN{^ 00111111001111110110000101011110011010000011111100111111011000010101111001100110010011100111110100111111001111110110000101011110011010000011111100111111011000010101111001100110010011100111101101011110 3f3f615e683f3f615e664e7d3f3f615e683f3f615e664e7b5e
SJIS-WIN 灼失a^h灼失a^fN}灼失a^h灼失a^fN{^ 100011101101110010001110101110000110000101011110011010001000111011011100100011101011100001100001010111100110011001001110011111011000111011011100100011101011100001100001010111100110100010001110110111001000111010111000011000010101111001100110010011100111101101011110 8edc8eb8615e688edc8eb8615e664e7d8edc8eb8615e688edc8eb8615e664e7b5e
EUC-JP 灼失a^h灼失a^fN}灼失a^h灼失a^fN{^ 101111001101111010111100101110100110000101011110011010001011110011011110101111001011101001100001010111100110011001001110011111011011110011011110101111001011101001100001010111100110100010111100110111101011110010111010011000010101111001100110010011100111101101011110 bcdebcba615e68bcdebcba615e664e7dbcdebcba615e68bcdebcba615e664e7b5e
UTF-8 灼失a^h灼失a^fN}灼失a^h灼失a^fN{^ 1110011110000001101111001110010110100100101100010110000101011110011010001110011110000001101111001110010110100100101100010110000101011110011001100100111001111101111001111000000110111100111001011010010010110001011000010101111001101000111001111000000110111100111001011010010010110001011000010101111001100110010011100111101101011110 e781bce5a4b1615e68e781bce5a4b1615e664e7de781bce5a4b1615e68e781bce5a4b1615e664e7b5e
UHC 灼失a^h灼失a^fN}灼失a^h灼失a^fN{^ 111011011100011111100011111101110110000101011110011010001110110111000111111000111111011101100001010111100110011001001110011111011110110111000111111000111111011101100001010111100110100011101101110001111110001111110111011000010101111001100110010011100111101101011110 edc7e3f7615e68edc7e3f7615e664e7dedc7e3f7615e68edc7e3f7615e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)