To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厓??愉?????孃り?竊?Ⅷ擬?????? 1111101010001101001111110011111110010110111110010011111100111111001111110011111100111111100110110110111110000010111010000011111111100010100001100011111110000111010110111000101101011011001111110011111100111111001111110011111100111111 fa8d3f3f96f93f3f3f3f3f9b6f82e83fe2863f875b8b5b3f3f3f3f3f3f
EUC-JP 厓??愉??洧??孃り?竊??擬?????? 10001111101101001100011100111111001111111100110011111011001111110011111110001111110001111011010000111111001111111101010111010000101001001110101000111111111000111110011000111111001111111011010110111100001111110011111100111111001111110011111100111111 8fb4c73f3fccfb3f3f8fc7b43f3fd5d0a4ea3fe3e63f3fb5bc3f3f3f3f3f3f
UTF-8 厓쀢뼰愉꾤뙴洧좑폇孃り쑬竊뺧Ⅷ擬좊돎列룸쓬履 111001011000111010010011111011001000000010100010111010111011110010110000111001101000010010001001111010101011111010100100111010111001100110110100111001101011010010100111111011001010001010010001111011011000111110000111111001011010110110000011111000111000001010001010111011001001000110101100111001111010101110001010111010111011101010100111111000101000010110100111111001101001001110101100111011001010001010001010111010111000111110001110111011111010011010011100111010111010001110111000111011001001001110101100111011111010011110011111 e58e93ec80a2ebbcb0e68489eabea4eb99b4e6b4a7eca291ed8f87e5ad83e3828aec91ace7ab8aebbaa7e285a7e693aceca28aeb8f8eefa69ceba3b8ec93acefa79f
UHC 厓쀢뼰愉꾤뙴洧좑폇孃り쑬竊뺧Ⅷ擬좊돎列룸쓬履 1110010011101101100101111110001010010110101100111110101011110000100001001110011110001100101101111110101011111011101000001110111110111100100101001110010110111110101010101110101010111110101010001110111110111100100101011110111110100101101101111110101111110100101000001110101110110101101110101110011011101010101101111110101110011101100011001110110010101010 e4ed97e296b3eaf084e78cb7eafba0efbc94e5beaaeabea8efbc95efa5b7ebf4a0ebb5bae6eab7eb9d8cecaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)