To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 熱?????宥??[熱?????宥??[^ 10010100010011010011111100111111001111110011111100111111100101110100011100111111001111110101101110010100010011010011111100111111001111110011111100111111100101110100011100111111001111110101101101011110 944d3f3f3f3f3f97473f3f5b944d3f3f3f3f3f97473f3f5b5e
EUC-JP 熱?????宥??[熱?????宥??[^ 11000111101011100011111100111111001111110011111100111111110011011010100000111111001111110101101111000111101011100011111100111111001111110011111100111111110011011010100000111111001111110101101101011110 c7ae3f3f3f3f3fcda83f3f5bc7ae3f3f3f3f3fcda83f3f5b5e
UTF-8 熱듬떧紐좂뮫宥븍첓[熱듬떧紐좂뮫宥븍첓[^ 111001111000011010110001111010111001001110101100111010111001011010100111111011111010011110001111111011001010001010000010111010111010111010101011111001011010111010100101111010111011100010001101111011001011001010010011010110111110011110000110101100011110101110010011101011001110101110010110101001111110111110100111100011111110110010100010100000101110101110101110101010111110010110101110101001011110101110111000100011011110110010110010100100110101101101011110 e786b1eb93aceb96a7efa78feca282ebaeabe5aea5ebb88decb2935be786b1eb93aceb96a7efa78feca282ebaeabe5aea5ebb88decb2935b5e
UHC 熱듬떧紐좂뮫宥븍첓[熱듬떧紐좂뮫宥븍첓[^ 111001101111000010110101111010111000101110111010111010111010101010100000111001111001001010110101111010101110100110111010111010111010101010100000010110111110011011110000101101011110101110001011101110101110101110101010101000001110011110010010101101011110101011101001101110101110101110101010101000000101101101011110 e6f0b5eb8bbaebaaa0e792b5eae9baebaaa05be6f0b5eb8bbaebaaa0e792b5eae9baebaaa05b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)