To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????L?? 001111110011111100111111001111110011111100111111010011000011111100111111 3f3f3f3f3f3f4c3f3f
SJIS-WIN ?脹???竅L?脹 001111111001001010101111001111110011111100111111111000101000000101001100001111111001001010101111 3f92af3f3f3fe2814c3f92af
EUC-JP ?脹???竅L?脹 001111111100010010110001001111110011111100111111111000111110000101001100001111111100010010110001 3fc4b13f3f3fe3e14c3fc4b1
UTF-8 뤋脹쮲샘폄竅L뤋脹 11101011101001001000101111101000100001001011100111101100101011101011001011101100100000111001100011101101100011111000010011100111101010111000010101001100111010111010010010001011111010001000010010111001 eba48be884b9ecaeb2ec8398ed8f84e7ab854ceba48be884b9
UHC 뤋脹쮲샘폄竅L뤋脹 1000111110111011111100111110110010101000100011111011101111111001110001101110111111010000101010110100110010001111101110111111001111101100 8fbbf3eca88fbbf9c6efd0ab4c8fbbf3ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)