To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 沃??項??純? 1001011110000000001111110011111110001101100000000011111100111111100011111000001100111111 97803f3f8d803f3f8f833f
EUC-JP 沃??項??純? 1100110111100000001111110011111110111001111000000011111100111111101111011110001100111111 cde03f3fb9e03f3fbde33f
UTF-8 沃욌ㅎ項뚧궇純앹 111001101011001010000011111011001001101010001100111000111000010110001110111010011010000010000101111010111001101010100111111010101011011010000111111001111011010010010100111011001001010110111001 e6b283ec9a8ce3858ee9a085eb9aa7eab687e7b494ec95b9
UHC 沃욌ㅎ項뚧궇純앹 11101000101010101001111011101011101001001011111011111010101000111000110011100110100000101010000011100010111011011001110111101100 e8aa9eeba4befaa38ce682a0e2ed9dec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)