To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇??蜈??預э? 11101010100001110011111100111111111001011000010100111111001111111001011101100001100001001000111100111111 ea873f3fe5853f3f9761848f3f
EUC-JP 鼇??蜈??預э? 11110011111001110011111100111111111010011110010100111111001111111100110111000010101001111110111100111111 f3e73f3fe9e53f3fcdc2a7ef3f
UTF-8 鼇루뼻蜈껓푺預э풜 1110100110111100100001111110101110100011101010001110101110111100101110111110100010011100100010001110101010111011100100111110110110010001101110101110100110100000100100001101000110001101111011011001001010011100 e9bc87eba3a8ebbcbbe89c88eabb93ed91bae9a090d18ded929c
UHC 鼇루뼻蜈껓푺預э풜 111010001010100010110111111001111001011010111110111010001010010110000011111011111011111010000110111001111110100010101100111011111011111010011111 e8a8b7e796bee8a583efbe86e7e8acefbe9f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)