To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 與????オ餓 11100100011011110011111100111111001111110011111110000011010010011000100111101100 e46f3f3f3f3f834989ec
EUC-JP 與????オ餓 11100111110100000011111100111111001111110011111110100101101010101011001011101110 e7d03f3f3f3fa5aab2ee
UTF-8 與썸㉬呂묊オ餓 111010001000100010000111111011001000110110111000111000111000100110101100111011111010011010000000111010111010110010001010111000111000001010101010111010011010010010010011 e88887ec8db8e389acefa680ebac8ae382aae9a493
UHC 與썸㉬呂묊オ餓 1110011010101000101111011110011010101000101111011110010111111011100100011110011110101011101010101110010010111011 e6a8bde6a8bde5fb91e7abaae4bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)