To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇?????裔??竊 11101010100001110011111100111111001111110011111100111111111001011110000100111111001111111110001010000110 ea873f3f3f3f3fe5e13f3fe286
EUC-JP 鼇?????裔??竊 11110011111001110011111100111111001111110011111100111111111010101110001100111111001111111110001111100110 f3e73f3f3f3f3feae33f3fe3e6
UTF-8 鼇믢폒兩좄쾾裔뀐숲竊 111010011011110010000111111010111010111110100010111011011000111110010010111011111010010110111000111011001010001010000100111011001011111010111110111010001010001110010100111010111000000010010000111011001000100010110010111001111010101110001010 e9bc87ebafa2ed8f92efa5b8eca284ecbebee8a394eb8090ec88b2e7ab8a
UHC 鼇믢폒兩좄쾾裔뀐숲竊 1110100010101000100100101110010010111100100111001110010110111011101000001110100010110010100101001110011111100000101100101110111110111101101000111110111110111100 e8a892e4bc9ce5bba0e8b294e7e0b2efbda3efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)