To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 沃??筍?─矜誼 10010111100000000011111100111111111000101010000100111111100001001001111111100001111000001000101101100010 97803f3fe2a13f849fe1e08b62
EUC-JP 沃??筍?─矜誼 11001101111000000011111100111111111001001010001100111111101010001010000111100010111000101011010111000011 cde03f3fe4a33fa8a1e2e2b5c3
UTF-8 沃쇰벩筍됵─矜誼 111001101011001010000011111011001000011110110000111010111011001010101001111001111010110110001101111010111001000010110101111000101001010010000000111001111001111110011100111010001010101010111100 e6b283ec87b0ebb2a9e7ad8deb90b5e29480e79f9ce8aabc
UHC 沃쇰벩筍됵─矜誼 11101000101010101011110011101011100100111011111111100010111011001000100111101111101001101010000111010000111010001110101111111110 e8aabceb93bfe2ec89efa6a1d0e8ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)