To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 娃?い吾??與 1000100010100001001111111000001010100010100011001110000100111111001111111110010001101111 88a13f82a28ce13f3fe46f
EUC-JP 娃?い吾??與 1011000010100011001111111010010010100100101110001110001100111111001111111110011111010000 b0a33fa4a4b8e33f3fe7d0
UTF-8 娃꿱い吾꾢꺅與 111001011010100010000011111010101011111110110001111000111000000110000100111001011001000010111110111010101011111010100010111010101011101010000101111010001000100010000111 e5a883eabfb1e38184e590beeabea2eaba85e88887
UHC 娃꿱い吾꾢꺅與 1110100011011111101100101110100010101010101001001110011111101110100001001110010110110010101001101110011010101000 e8dfb2e8aaa4e7ee84e5b2a6e6a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)