To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?諷漿??????B 001111111110011010000101100111111111011100111111001111110011111100111111001111110011111101000010 3fe6859ff73f3f3f3f3f3f42
EUC-JP ?諷漿??????B 001111111110101111100101110111101111100100111111001111110011111100111111001111110011111101000010 3febe5def93f3f3f3f3f3f42
UTF-8 뤋諷漿쮱찋샘ㅾ렓렔B 11101011101001001000101111101000101010111011011111100110101111001011111111101100101011101011000111101100101100001000101111101100100000111001100011100011100001011011111011101011101000001001001111101011101000001001010001000010 eba48be8abb7e6bcbfecaeb1ecb08bec8398e385beeba093eba09442
UHC 뤋諷漿쮱찋샘ㅾ렓렔B 10001111101110111111100110100100111011011110110010101000100011101010100110001111101110111111100110100100111011101000111010101000100011101010100101000010 8fbbf9a4edeca88ea98fbbf9a4ee8ea88ea942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)