To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 曜??節??節??晤 1001011101101010001111110011111110010000110111110011111100111111100100001101111100111111001111111001110111101011 976a3f3f90df3f3f90df3f3f9deb
EUC-JP 曜??節??節??晤 1100110111001011001111110011111111000000111000010011111100111111110000001110000100111111001111111101101011101101 cdcb3f3fc0e13f3fc0e13f3fdaed
UTF-8 曜닸봄節룟끁節쏙슥晤 111001101001101110011100111010111000101110111000111010111011010010000100111001111010111110000000111010111010001110011111111010111000000110000001111001111010111110000000111011001000111110011001111011001000101010100101111001101001100110100100 e69b9ceb8bb8ebb484e7af80eba39feb8181e7af80ec8f99ec8aa5e699a4
UHC 曜닸봄節룟끁節쏙슥晤 1110100011111000101101001110011010111010101111011110111110111101101101111110010110000101101101111110111110111101101111011110111110111101101110111110011111111011 e8f8b4e6babdefbdb7e585b7efbdbdefbdbbe7fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)