To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 茹??泣??源??壓??臾?? 1110010010100101001111110011111110001011100000110011111100111111100011001011100100111111001111111001101011011000001111110011111111100100011010110011111100111111 e4a53f3f8b833f3f8cb93f3f9ad83f3fe46b3f3f
EUC-JP 茹??泣??源??壓??臾?? 1110100010100111001111110011111110110101111000110011111100111111101110001011101100111111001111111101010011011010001111110011111111100111110011000011111100111111 e8a73f3fb5e33f3fb8bb3f3fd4da3f3fe7cc3f3f
UTF-8 茹띿슜泣㏘틠源띿춷壓믩떽臾썹춯 111010001000110010111001111010111001110110111111111011001000101010011100111001101011001110100011111000111000111110011000111011011000101110100000111001101011101010010000111010111001110110111111111011001011011010110111111001011010001110010011111010111010111110101001111010111001011010111101111010001000011110111110111011001000110110111001111011001011011010101111 e88cb9eb9dbfec8a9ce6b3a3e38f98ed8ba0e6ba90eb9dbfecb6b7e5a393ebafa9eb96bde887beec8db9ecb6af
UHC 茹띿슜泣㏘틠源띿춷壓믩떽臾썹춯 111001101010101010001101111011001001101010101001111010111110100010100010111001001011101010001100111010101011100110001101111011001010110110010011111001001110001010010010111010111011011010111101111010111010110010111101111001111010110110001100 e6aa8dec9aa9ebe8a2e4ba8ceab98decad93e4e292ebb6bdebacbde7ad8c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)