To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蹄???棕?藏?????鏃???鏃??舵μ 10010010111110110011111100111111001111111001111010100001001111111110010101010101001111110011111100111111001111110011111111101000010101100011111100111111001111111110100001010110001111110011111110010001110001111000001111001010 92fb3f3f3f9ea13fe5553f3f3f3f3fe8563f3f3fe8563f3f91c783ca
EUC-JP 蹄???棕?藏?????鏃???鏃??舵μ 11000100111111010011111100111111001111111101110010100011001111111110100110110110001111110011111100111111001111110011111111101111101101110011111100111111001111111110111110110111001111110011111111000010110010011010011011001100 c4fd3f3f3fdca33fe9b63f3f3f3f3fefb73f3f3fefb73f3fc2c9a6cc
UTF-8 蹄뀜렰렲棕렏藏렜뤉칿쏠겁鏃퐥욹눠鏃퐥욺舵μ 1110100010111001100001001110101110000000100111001110101110100000101100001110101110100000101100101110011010100011100101011110101110100000100011111110100010010111100011111110101110100000100111001110101110100100100010011110110010111001101111111110110010001111101000001110101010110010100000011110100110001111100000111110110110010000101001011110110010011010101110011110101110001000101000001110100110001111100000111110110110010000101001011110110010011010101110101110100010001000101101011100111010111100 e8b984eb809ceba0b0eba0b2e6a395eba08fe8978feba09ceba489ecb9bfec8fa0eab281e98f83ed90a5ec9ab9eb88a0e98f83ed90a5ec9abae888b5cebc
UHC 蹄뀜렰렲棕렏藏렜뤉칿쏠겁鏃퐥욹눠鏃퐥욺舵μ 111100001011010010110010111100011000111010111101100011101011111111110000111101111000111010100101111011011111101010001110101011101000111110111001101011111000111010111101111100101011000011001100111100001110110010111101100011101011111111110000101101001011001011110000111011001011110110001110101111111111000111110110111011001010010111101100 f0b4b2f18ebd8ebff0f78ea5edfa8eae8fb9af8ebdf2b0ccf0ecbd8ebff0b4b2f0ecbd8ebff1f6eca5ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)