To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ????ゆ??? 001111110011111100111111001111111000001011100100001111110011111100111111 3f3f3f3f82e43f3f3f
EUC-JP ????ゆ??? 001111110011111100111111001111111010010011100110001111110011111100111111 3f3f3f3fa4e63f3f3f
UTF-8 料곌끝杻ゆ뒔溜켃 111011111010011010111110111010101011001110001100111010111000000110011101111011111010011110001000111000111000001010000110111010111001001010010100111011111010011110001011111011001011110010000011 efa6beeab38ceb819defa788e38286eb9294efa78becbc83
UHC 料곌끝杻ゆ뒔溜켃 11101000111101111011000011101010101100111010000111101010111101001010101011100110100010101001000111101010111111101011000101000010 e8f7b0eab3a1eaf4aae68a91eafeb142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)