To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 娃??揖??兪??嶸??泣??蟻??櫻??揖 100010001010000100111111001111111001011101001011001111110011111110011001011000000011111100111111111110101011010000111111001111111000101110000011001111110011111110001011011000010011111100111111100111110100111000111111001111111001011101001011 88a13f3f974b3f3f99603f3ffab43f3f8b833f3f8b613f3f9f4e3f3f974b
EUC-JP 娃??揖??兪??嶸??泣?ˇ蟻??櫻??揖 101100001010001100111111001111111100110110101100001111110011111111010001110000010011111100111111100011111011101111110100001111110011111110110101111000110011111110001111101000101011000010110101110000100011111100111111110111011010111100111111001111111100110110101100 b0a33f3fcdac3f3fd1c13f3f8fbbf43f3fb5e33f8fa2b0b5c23f3fddaf3f3fcdac
UTF-8 娃숇쨪揖먩찄兪낆뒟嶸뗭옚泣ⓩˇ蟻욎돺櫻뗫돍揖 1110010110101000100000111110110010001000100001111110110010101000101010101110011010001111100101101110101110101000101010011110110010110000100001001110010110000101101010101110101110000010100001101110101110010010100111111110010110110110101110001110101110010111101011011110110010011000100110101110011010110011101000111110001010010011101010011100101110000111111010001001111110111011111011001001101010001110111010111000111110111010111001101010101110111011111010111001011110101011111010111000111110001101111001101000111110010110 e5a883ec8887eca8aae68f96eba8a9ecb084e585aaeb8286eb929fe5b6b8eb97adec989ae6b3a3e293a9cb87e89fbbec9a8eeb8fbae6abbbeb97abeb8f8de68f96
UHC 娃숇쨪揖먩찄兪낆뒟嶸뗭옚泣ⓩˇ蟻욎돺櫻뗫돍揖 1110100011011111100110011110101110100100100001001110101111100111100100001110011010101001100010001110101011100100100001011110110010001010100110111110011110101110100010111110110010011110100111101110101111101000101010001110011010100010101001111110101111111100100111101110110010001001101111011110010110100001100010111110101110001001100110111110101111100111 e8df99eba484ebe790e6a988eae485ec8a9be7ae8bec9e9eebe8a8e6a2a7ebfc9eec89bde5a18beb899bebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)