To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????oBF 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6f4246
SJIS-WIN 畏??揖??怨??畏??揖??醫??oBF 100010001101100000111111001111111001011101001011001111110011111110001001100001010011111100111111100010001101100000111111001111111001011101001011001111110011111111100111110011100011111100111111011011110100001001000110 88d83f3f974b3f3f89853f3f88d83f3f974b3f3fe7ce3f3f6f4246
EUC-JP 畏??揖??怨??畏??揖??醫??oBF 101100001101101000111111001111111100110110101100001111110011111110110001111001010011111100111111101100001101101000111111001111111100110110101100001111110011111111101110110100000011111100111111011011110100001001000110 b0da3f3fcdac3f3fb1e53f3fb0da3f3fcdac3f3feed03f3f6f4246
UTF-8 畏븐슱揖붷쮿怨ㅼ돭畏븐슱揖붹릸醫묒돁oBF 111001111001010110001111111010111011100010010000111011001000101010110001111001101000111110010110111010111011011010110111111011001010111010111111111001101000000010101000111000111000010110111100111010111000111110101101111001111001010110001111111010111011100010010000111011001000101010110001111001101000111110010110111010111011011010111001111010111010011010111000111010011000011010101011111010111010110010010010111010111000111110000001011011110100001001000110 e7958febb890ec8ab1e68f96ebb6b7ecaebfe680a8e385bceb8fade7958febb890ec8ab1e68f96ebb6b9eba6b8e986abebac92eb8f816f4246
UHC 畏븐슱揖붷쮿怨ㅼ돭畏븐슱揖붹릸醫묒돁oBF 111010001110011010111010111011001001101010111000111010111110011110010100111001011010100010011011111010101011001110100100111011001000100110110000111010001110011010111010111011001001101010111000111010111110011110010100111001101001000010010110111011001010001010010001111011001000100110010100011011110100001001000110 e8e6baec9ab8ebe794e5a89beab3a4ec89b0e8e6baec9ab8ebe794e69096eca291ec89946f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)