To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??援δ?????ル、違??飮??佯 111001010100000100111111001111111000100110000111100000111100001000111111001111110011111100111111001111111000001110001011100000010100000110001000111000010011111100111111100111110101101000111111001111111001100011010001 e5413f3f898783c23f3f3f3f3f838b814188e13f3f9f5a3f3f98d1
EUC-JP 蘂??援δ?洹???ル、違??飮??佯 1110100110100010001111110011111110110001111001111010011011000100001111111000111111000111101110100011111100111111001111111010010111101011101000011010001010110000111000110011111100111111110111011011101100111111001111111101000011010011 e9a23f3fb1e7a6c43f8fc7ba3f3f3fa5eba1a2b0e33f3fddbb3f3fd0d3
UTF-8 蘂띔퍊援δ빳洹앹뿉曆ル、違뺟솾飮뉖뼮佯 1110100010011000100000101110101110011101100101001110110110001101100010101110011010001111101101001100111010110100111010111011100110110011111001101011010010111001111011001001010110111001111010111011111110001001111011111010011010001011111000111000001110101011111000111000000010000001111010011000000110010101111010111011101010011111111011001000011010111110111010011010001110101110111010111000100110010110111010111011110010101110111001001011110110101111 e89882eb9d94ed8d8ae68fb4ceb4ebb9b3e6b4b9ec95b9ebbf89efa68be383abe38081e98195ebba9fec86bee9a3aeeb8996ebbcaee4bdaf
UHC 蘂띔퍊援δ빳洹앹뿉曆ル、違뺟솾飮뉖뼮佯 1110011111011110101101101110101010111011100000011110101010110101101001011110010010111011101001011110101010110111100111011110110010010111100100001110011010110111101010111110101110100001101000101110101011011110100101011110011110011001101100101110101111100110100001111110101110010110101100011110010110111010 e7deb6eabb81eab5a5e4bba5eab79dec9790e6b7abeba1a2eade95e799b2ebe687eb96b1e5ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)