To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????h??? 001111110011111100111111001111110011111101101000001111110011111100111111 3f3f3f3f3f683f3f3f
SJIS-WIN 險貞雀豼、h險貞雀 11101000101010001001001011100101100100001001110111100110101111111010010001101000111010001010100010010010111001011001000010011101 e8a892e5909de6bfa468e8a892e5909d
EUC-JP 險貞雀豼、h險貞雀 1111000010101010110001001110011110111111111111011110110011000001100011101010010001101000111100001010101011000100111001111011111111111101 f0aac4e7bffdecc18ea468f0aac4e7bffd
UTF-8 險貞雀豼、h險貞雀 11101001100110101010101011101000101100101001111011101001100110111000000011101000101100011011110011101111101111011010010001101000111010011001101010101010111010001011001010011110111010011001101110000000 e99aaae8b29ee99b80e8b1bcefbda468e99aaae8b29ee99b80
UHC 險貞雀??h險貞雀 111110101100111111101111111101101110110111001101001111110011111101101000111110101100111111101111111101101110110111001101 facfeff6edcd3f3f68facfeff6edcd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)