To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????i??????????iB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101101001001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN 淨?伊豆?源?低逗工i淨?伊豆?源?低逗工iB 10011111110001000011111110001000110010011001001110100100001111111000110010111001001111111001001011100001100100001000000010001101010010000110100110011111110001000011111110001000110010011001001110100100001111111000110010111001001111111001001011100001100100001000000010001101010010000110100101000010 9fc43f88c993a43f8cb93f92e190808d48699fc43f88c993a43f8cb93f92e190808d486942
EUC-JP 淨?伊豆?源?低逗工i淨?伊豆?源?低逗工iB 11011110110001100011111110110000110010111100011010100110001111111011100010111011001111111100010011100011101111111110000010111001101010010110100111011110110001100011111110110000110010111100011010100110001111111011100010111011001111111100010011100011101111111110000010111001101010010110100101000010 dec63fb0cbc6a63fb8bb3fc4e3bfe0b9a969dec63fb0cbc6a63fb8bb3fc4e3bfe0b9a96942
UTF-8 淨렠伊豆뱌源렰低逗工i淨렠伊豆뱌源렰低逗工iB 111001101011011110101000111010111010000010100000111001001011110010001010111010001011000110000110111010111011000110001100111001101011101010010000111010111010000010110000111001001011110110001110111010011000000010010111111001011011011110100101011010011110011010110111101010001110101110100000101000001110010010111100100010101110100010110001100001101110101110110001100011001110011010111010100100001110101110100000101100001110010010111101100011101110100110000000100101111110010110110111101001010110100101000010 e6b7a8eba0a0e4bc8ae8b186ebb18ce6ba90eba0b0e4bd8ee98097e5b7a569e6b7a8eba0a0e4bc8ae8b186ebb18ce6ba90eba0b0e4bd8ee98097e5b7a56942
UHC 淨렠伊豆뱌源렰低逗工i淨렠伊豆뱌源렰低逗工iB 11101111111001001000111010110001111011001010010111010100111001111011100111110010111010101011100110001110101111011110111010111000110101001110100011001101111011110110100111101111111001001000111010110001111011001010010111010100111001111011100111110010111010101011100110001110101111011110111010111000110101001110100011001101111011110110100101000010 efe48eb1eca5d4e7b9f2eab98ebdeeb8d4e8cdef69efe48eb1eca5d4e7b9f2eab98ebdeeb8d4e8cdef6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)