To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????汲悠??碎λ?????ч.?? 001111110011111100111111001111110011111100111111100010111000001010010111010010010011111100111111111000011110101010000011110010010011111100111111001111110011111100111111100001001000100110000001010001000011111100111111 3f3f3f3f3f3f8b8297493f3fe1ea83c93f3f3f3f3f848981443f3f
EUC-JP ???佾??汲悠??碎λ?????ч.?? 0011111100111111001111111000111110110000111110110011111100111111101101011110001011001101101010100011111100111111111000101110110010100110110010110011111100111111001111110011111100111111101001111110100110100001101001010011111100111111 3f3f3f8fb0fb3f3fb5e2cdaa3f3fe2eca6cb3f3f3f3f3fa7e9a1a53f3f
UTF-8 麗몃쓷佾쒏룚汲悠끾뉩碎λ룵麗몃쓷流ч.劉쓍 11101111101001101000100011101011101010101000001111101100100100111011011111100100101111011011111011101100100100101000111111101011101000111001101011100110101100011011001011100110100000101010000011101011100000011011111011101011100010011010100111100111101000101000111011001110101110111110101110100011101101011110111110100110100010001110101110101010100000111110110010010011101101111110111110100111100010101101000110000111111011111011110010001110111011111010011110000111111011001001001110001101 efa688ebaa83ec93b7e4bdbeec928feba39ae6b1b2e682a0eb81beeb89a9e7a28ecebbeba3b5efa688ebaa83ec93b7efa78ad187efbc8eefa787ec938d
UHC 麗몃쓷佾쒏룚汲悠끾뉩碎λ룵麗몃쓷流ч.劉쓍 111001101011000010111000111010111001110110010100111011001110101110011100111001101000111110010110110100001110001111101010111011011000010111100110101101001011100111100001111011111010010111101011100011111010101011100110101100001011100011101011100111011001010011101010111111001010110011101001101000111010111011101010111001011001110101101000 e6b0b8eb9d94eceb9ce68f96d0e3eaed85e6b4b9e1efa5eb8faae6b0b8eb9d94eafcace9a3aeeae59d68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)