To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????}??????????{^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101111101001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 潮扇???眺????}潮扇???眺????{^ 1001001010101010100100001110111000111111001111110011111110010010101011010011111100111111001111110011111101111101100100101010101010010000111011100011111100111111001111111001001010101101001111110011111100111111001111110111101101011110 92aa90ee3f3f3f92ad3f3f3f3f7d92aa90ee3f3f3f92ad3f3f3f3f7b5e
EUC-JP 潮扇???眺????}潮扇???眺????{^ 1100010010101100110000001111000000111111001111110011111111000100101011110011111100111111001111110011111101111101110001001010110011000000111100000011111100111111001111111100010010101111001111110011111100111111001111110111101101011110 c4acc0f03f3f3fc4af3f3f3f3f7dc4acc0f03f3f3fc4af3f3f3f3f7b5e
UTF-8 潮扇렱렾셰眺렯롉렯렔}潮扇렱렾셰眺렯롉렯렔{^ 111001101011110110101110111001101000100110000111111010111010000010110001111010111010000010111110111011001000010110110000111001111001110010111010111010111010000010101111111010111010000110001001111010111010000010101111111010111010000010010100011111011110011010111101101011101110011010001001100001111110101110100000101100011110101110100000101111101110110010000101101100001110011110011100101110101110101110100000101011111110101110100001100010011110101110100000101011111110101110100000100101000111101101011110 e6bdaee68987eba0b1eba0beec85b0e79cbaeba0afeba189eba0afeba0947de6bdaee68987eba0b1eba0beec85b0e79cbaeba0afeba189eba0afeba0947b5e
UHC 潮扇렱렾셰眺렯롉렯렔}潮扇렱렾셰眺렯롉렯렔{^ 11110000110011011110000010111111100011101011111010001110110001101011110011001110111100001101001010001110101111001000111011001111100011101011110010001110101010010111110111110000110011011110000010111111100011101011111010001110110001101011110011001110111100001101001010001110101111001000111011001111100011101011110010001110101010010111101101011110 f0cde0bf8ebe8ec6bccef0d28ebc8ecf8ebc8ea97df0cde0bf8ebe8ec6bccef0d28ebc8ecf8ebc8ea97b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)