To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??裕??袁?き筌??游??衰λ?佯 11111010110100000011111100111111100101110101010000111111001111111110010111001101001111111000001010101011111000101010001100111111001111111001111111100000001111110011111110010000100010101000001111001001001111111001100011010001 fad03f3f97543f3fe5cd3f82abe2a33f3f9fe03f3f908a83c93f98d1
EUC-JP ???裕??袁?き筌??游??衰λ?佯 001111110011111100111111110011011011010100111111001111111110101011001111001111111010010010101101111001001010010100111111001111111101111011100010001111110011111110111111111010101010011011001011001111111101000011010011 3f3f3fcdb53f3feacf3fa4ade4a53f3fdee23f3fbfeaa6cb3fd0d3
UTF-8 昻뉗떝裕뉒뙴袁ㅻき筌딄퍊游멱첑衰λ쳴佯 1110011010011000101110111110101110001001100101111110101110010110100111011110100010100011100101011110101110001001100100101110101110011001101101001110100010100010100000011110001110000101101110111110001110000001100011011110011110101101100011001110101110010100100001001110110110001101100010101110011010111000101110001110101110101001101100011110110010110010100100011110100010100001101100001100111010111011111011001011001110110100111001001011110110101111 e698bbeb8997eb969de8a395eb8992eb99b4e8a281e385bbe3818de7ad8ceb9484ed8d8ae6b8b8eba9b1ecb291e8a1b0cebbecb3b4e4bdaf
UHC 昻뉗떝裕뉒뙴袁ㅻき筌딄퍊游멱첑衰λ쳴佯 1110010011101001100001111110110010001011101100111110101110101110100001111110011110001100101101111110101010111110101001001110101110101010101011011110111110100111100010101110101010111011100000011110101011111101101110001110100010101010100111101110000111110001101001011110101110101011100101111110010110111010 e4e987ec8bb3ebae87e78cb7eabea4ebaaadefa78aeabb81eafdb8e8aa9ee1f1a5ebab97e5ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)