To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇?????野??與 11101010100001110011111100111111001111110011111100111111100101101110110000111111001111111110010001101111 ea873f3f3f3f3f96ec3f3fe46f
EUC-JP 鼇?????野??與 11110011111001110011111100111111001111110011111100111111110011001110111000111111001111111110011111010000 f3e73f3f3f3f3fccee3f3fe7d0
UTF-8 鼇귛읃溜뀀젎野껊젨與 111010011011110010000111111010101011011110011011111011001001110110000011111011111010011110001011111010111000000010000000111011001010000010001110111010011000011110001110111010101011101110001010111011001010000010101000111010001000100010000111 e9bc87eab79bec9d83efa78beb8080eca08ee9878eeabb8aeca0a8e88887
UHC 鼇귛읃溜뀀젎野껊젨與 1110100010101000100000101110010110011111101110101110101011111110101100101110101110100000100011111110010110101111100000111110101110100000101000001110011010101000 e8a882e59fbaeafeb2eba08fe5af83eba0a0e6a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)