To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 要??魚よぇ節 100101110111011000111111001111111000101110011011100000101110011010000010101001011001000011011111 97763f3f8b9b82e682a590df
EUC-JP 要??魚よぇ節 110011011101011100111111001111111011010111111011101001001110100010100100101001111100000011100001 cdd73f3fb5fba4e8a4a7c0e1
UTF-8 要잋튊魚よぇ節 111010001010011010000001111011001001111010001011111011011000101010001010111010011010110110011010111000111000001010001000111000111000000110000111111001111010111110000000 e8a681ec9e8bed8a8ae9ad9ae38288e38187e7af80
UHC 要잋튊魚よぇ節 1110100110101001100111111110010010111001100111101110010111100000101010101110100010101010101001111110111110111101 e9a99fe4b99ee5e0aae8aaa7efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)