To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 煜辞裙自昇鴈 1111101101010101100011101010101111100101111000111000111010101001100011111011100011110111101111001110100111101100 fb558eabe5e38ea98fb8f7bce9ec
EUC-JP 煜辞裙自昇?鴈 1000111111001001111111001011110010101101111010101110010110111100101010111011111010111010001111111111001011101110 8fc9fcbcadeae5bcabbeba3ff2ee
UTF-8 煜辞裙自昇鴈 111001111000010110011100111010001011111010011110111010001010001110011001111010001000011110101010111001101001100010000111111011101001011010011111111010011011010010001000 e7859ce8be9ee8a399e887aae69887ee969fe9b488
UHC 煜?裙自昇?? 1110100111110010001111111100111111011001111011011011101111100011101100000011111100111111 e9f23fcfd9edbbe3b03f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)