To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????M?????M???? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110100111111001111110011111100111111001111110100110100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f4d3f3f3f3f
SJIS-WIN ????????????M?????M???? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110100111111001111110011111100111111001111110100110100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f4d3f3f3f3f
EUC-JP ????????????M?????M???? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110100111111001111110011111100111111001111110100110100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f4d3f3f3f3f
UTF-8 쩐횋혦횉쩌쨋혧횋척혦횊청M혦짜혦횣청M혦쩌혦쩐 1110110010101001100100001110110110011010100010111110110110011000101001101110110110011010100010011110110010101001100011001110110010101000100010111110110110011000101001111110110110011010100010111110110010110010100110011110110110011000101001101110110110011010100010101110110010110010101011010100110111101101100110001010011011101100101001111001110011101101100110001010011011101101100110101010001111101100101100101010110101001101111011011001100010100110111011001010100110001100111011011001100010100110111011001010100110010000 eca990ed9a8bed98a6ed9a89eca98ceca88bed98a7ed9a8becb299ed98a6ed9a8aecb2ad4ded98a6eca79ced98a6ed9aa3ecb2ad4ded98a6eca98ced98a6eca990
UHC 쩐횋혦횉쩌쨋혧횋척혦횊청M혦짜혦횣청M혦쩌혦쩐 1100001010111110110000111000100111000010100011101100001110000111110000101011110011000010101101101100001010001111110000111000100111000011101101001100001010001110110000111000100011000011101110110100110111000010100011101100001010100101110000101000111011000011100110101100001110111011010011011100001010001110110000101011110011000010100011101100001010111110 c2bec389c28ec387c2bcc2b6c28fc389c3b4c28ec388c3bb4dc28ec2a5c28ec39ac3bb4dc28ec2bcc28ec2be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)