To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????h???? 00111111001111110011111100111111001111110110100000111111001111110011111100111111 3f3f3f3f3f683f3f3f3f
SJIS-WIN 險貞雀豼、h險貞雀豼 111010001010100010010010111001011001000010011101111001101011111110100100011010001110100010101000100100101110010110010000100111011110011010111111 e8a892e5909de6bfa468e8a892e5909de6bf
EUC-JP 險貞雀豼、h險貞雀豼 11110000101010101100010011100111101111111111110111101100110000011000111010100100011010001111000010101010110001001110011110111111111111011110110011000001 f0aac4e7bffdecc18ea468f0aac4e7bffdecc1
UTF-8 險貞雀豼、h險貞雀豼 11101001100110101010101011101000101100101001111011101001100110111000000011101000101100011011110011101111101111011010010001101000111010011001101010101010111010001011001010011110111010011001101110000000111010001011000110111100 e99aaae8b29ee99b80e8b1bcefbda468e99aaae8b29ee99b80e8b1bc
UHC 險貞雀??h險貞雀? 11111010110011111110111111110110111011011100110100111111001111110110100011111010110011111110111111110110111011011100110100111111 facfeff6edcd3f3f68facfeff6edcd3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)