To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 瘟??岳???у?n}瘟??岳???у?n{^ 1110000110001001001111110011111110001010011110000011111100111111001111111000010010000101001111110110111001111101111000011000100100111111001111111000101001111000001111110011111100111111100001001000010100111111011011100111101101011110 e1893f3f8a783f3f3f84853f6e7de1893f3f8a783f3f3f84853f6e7b5e
EUC-JP 瘟??岳???у?n}瘟??岳???у?n{^ 1110000111101001001111110011111110110011110110010011111100111111001111111010011111100101001111110110111001111101111000011110100100111111001111111011001111011001001111110011111100111111101001111110010100111111011011100111101101011110 e1e93f3fb3d93f3f3fa7e53f6e7de1e93f3fb3d93f3f3fa7e53f6e7b5e
UTF-8 瘟룡릍岳쀨갬歷у콪n}瘟룡릍岳쀨갬歷у콪n{^ 111001111001100010011111111010111010001110100001111010111010011010001101111001011011001010110011111011001000000010101000111010101011000010101100111011111010011010001100110100011000001111101100101111011010101001101110011111011110011110011000100111111110101110100011101000011110101110100110100011011110010110110010101100111110110010000000101010001110101010110000101011001110111110100110100011001101000110000011111011001011110110101010011011100111101101011110 e7989feba3a1eba68de5b2b3ec80a8eab0acefa68cd183ecbdaa6e7de7989feba3a1eba68de5b2b3ec80a8eab0acefa68cd183ecbdaa6e7b5e
UHC 瘟룡릍岳쀨갬歷у콪n}瘟룡릍岳쀨갬歷у콪n{^ 1110100010110000101101111110011010111000101011001110010010111111100101111110100010110000101101111110011010111000101011001110010110110001100111100110111001111101111010001011000010110111111001101011100010101100111001001011111110010111111010001011000010110111111001101011100010101100111001011011000110011110011011100111101101011110 e8b0b7e6b8ace4bf97e8b0b7e6b8ace5b19e6e7de8b0b7e6b8ace4bf97e8b0b7e6b8ace5b19e6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)