To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????管悠??碎λ????靭??? 00111111001111110011111100111111001111110011111110001010110001111001011101001001001111110011111111100001111010101000001111001001001111110011111100111111001111111001000001111000001111110011111100111111 3f3f3f3f3f3f8ac797493f3fe1ea83c93f3f3f3f90783f3f3f
EUC-JP ???佾??管悠??碎λ????靭??絪 0011111100111111001111111000111110110000111110110011111100111111101101001100100111001101101010100011111100111111111000101110110010100110110010110011111100111111001111110011111110111111110110010011111100111111100011111101001111101100 3f3f3f8fb0fb3f3fb4c9cdaa3f3fe2eca6cb3f3f3f3fbfd93f3f8fd3ec
UTF-8 麗몃쓷佾쒏룚管悠끾뉩碎λ룵麗몃쓷靭딁뙴絪 1110111110100110100010001110101110101010100000111110110010010011101101111110010010111101101111101110110010010010100011111110101110100011100110101110011110101110101000011110011010000010101000001110101110000001101111101110101110001001101010011110011110100010100011101100111010111011111010111010001110110101111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001010010000001111010111001100110110100111001111011010110101010 efa688ebaa83ec93b7e4bdbeec928feba39ae7aea1e682a0eb81beeb89a9e7a28ecebbeba3b5efa688ebaa83ec93b7e99dadeb9481eb99b4e7b5aa
UHC 麗몃쓷佾쒏룚管悠끾뉩碎λ룵麗몃쓷靭딁뙴絪 11100110101100001011100011101011100111011001010011101100111010111001110011100110100011111001011011001110101101111110101011101101100001011110011010110100101110011110000111101111101001011110101110001111101010101110011010110000101110001110101110011101100101001110110011100101100010101110011110001100101101111110110011011111 e6b0b8eb9d94eceb9ce68f96ceb7eaed85e6b4b9e1efa5eb8faae6b0b8eb9d94ece58ae78cb7ecdf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)