To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歪??雍???←??????????烏?? 10011000011000110011111100111111111010001011010000111111001111110011111110000001101010010011111100111111001111110011111100111111001111110011111100111111001111110011111110001001010001110011111100111111 98633f3fe8b43f3f3f81a93f3f3f3f3f3f3f3f3f3f89473f3f
EUC-JP 歪??雍???←??????????烏?? 11001111110001000011111100111111111100001011011000111111001111110011111110100010101010110011111100111111001111110011111100111111001111110011111100111111001111110011111110110001101010000011111100111111 cfc43f3ff0b63f3f3fa2ab3f3f3f3f3f3f3f3f3f3fb1a83f3f
UTF-8 歪뺞씧雍㎪략溜←뀥溜뽬뵣溜롩굲呂삯콍烏녿젶 111001101010110110101010111010111011101010011110111011001001010010100111111010011001101110001101111000111000111010101010111010111001111010110101111011111010011110001011111000101000011010010000111010111000000010100101111011111010011110001011111010111011110110101100111010111011010110100011111011111010011110001011111010111010000110101001111010101011010110110010111011111010011010000000111011001000001010101111111011001011110110001101111001111000001110001111111010111000010110111111111011001010000010110110 e6adaaebba9eec94a7e99b8de38eaaeb9eb5efa78be28690eb80a5efa78bebbdacebb5a3efa78beba1a9eab5b2efa680ec82afecbd8de7838feb85bfeca0b6
UHC 歪뺞씧雍㎪략溜←뀥溜뽬뵣溜롩굲呂삯콍烏녿젶 111010001110000010010101111001101001110110111011111010001011110010100111111001101011011110101011111010101111111010100001111001111000010110011100111010101111111010010110111010001001010010100011111010101111111010001110111010011000001010010101111001011111101110111011111010011011000110001001111010001010000110000110111010111010000010101010 e8e095e69dbbe8bca7e6b7abeafea1e7859ceafe96e894a3eafe8ee98295e5fbbbe9b189e8a186eba0aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)