To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 藥??岳??熬??塋よ?娃??耶??瘟 111001010101101000111111001111111000101001111000001111110011111111100000100100100011111100111111100110101100100010000010111001100011111110001000101000010011111100111111100101101110101100111111001111111110000110001001 e55a3f3f8a783f3fe0923f3f9ac882e63f88a13f3f96eb3f3fe189
EUC-JP 藥??岳??熬??塋よ?娃??耶??瘟 111010011011101100111111001111111011001111011001001111110011111111011111111100100011111100111111110101001100101010100100111010000011111110110000101000110011111100111111110011001110110100111111001111111110000111101001 e9bb3f3fb3d93f3fdff23f3fd4caa4e83fb0a33f3fcced3f3fe1e9
UTF-8 藥썲춼岳쀧떥熬뽪뜆塋よ떨娃쒏짎耶섉룂瘟 111010001001011110100101111011001000110110110010111011001011011010111100111001011011001010110011111011001000000010100111111010111001011010100101111001111000011010101100111010111011110110101010111010111001110010000110111001011010000110001011111000111000001010001000111010111001011010101000111001011010100010000011111011001001001010001111111011001010011110001110111010001000000010110110111011001000010010001001111010111010001110000010111001111001100010011111 e897a5ec8db2ecb6bce5b2b3ec80a7eb96a5e786acebbdaaeb9c86e5a18be38288eb96a8e5a883ec928feca78ee880b6ec8489eba382e7989f
UHC 藥썲춼岳쀧떥熬뽪뜆塋よ떨娃쒏짎耶섉룂瘟 1110010110110111101111011110010110101101100110001110010010111111100101111110011110001011101110001110100010100010100101101110011010001101100010011110011110101011101010101110100010110110101100111110100011011111100111001110011010100011100110101110010110101101100110001110011010001111100000111110100010110000 e5b7bde5ad98e4bf97e78bb8e8a296e68d89e7abaae8b6b3e8df9ce6a39ae5ad98e68f83e8b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)