To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 業??宥?????瑤??愉??????μ?弛 10001011110001100011111100111111100101110100011100111111001111110011111100111111001111111110101010100010001111110011111110010110111110010011111100111111001111110011111100111111001111111000001111001010001111111001001001101111 8bc63f3f97473f3f3f3f3feaa23f3f96f93f3f3f3f3f3f83ca3f926f
EUC-JP 業??宥?????瑤??愉??洧???μ?弛 101101101100100000111111001111111100110110101000001111110011111100111111001111110011111111110100101001000011111100111111110011001111101100111111001111111000111111000111101101000011111100111111001111111010011011001100001111111100001111010000 b6c83f3fcda83f3f3f3f3ff4a43f3fccfb3f3f8fc7b43f3f3fa6cc3fc3d0
UTF-8 業삳돆宥귡렟類좎턃瑤녹럩愉쎿갭洧붿뫁若μ뼲弛 1110011010100101101011011110110010000010101100111110101110001111100001101110010110101110101001011110101010110111101000011110101110100000100111111110111110100111100100001110110010100010100011101110110110000100100000111110011110010001101001001110101110000101101110011110101110011111101010011110011010000100100010011110110010001110101111111110101010110000101011011110011010110100101001111110101110110110101111111110101110101011100000011110111110100101101101001100111010111100111010111011110010110010111001011011110010011011 e6a5adec82b3eb8f86e5aea5eab7a1eba09fefa790eca28eed8483e791a4eb85b9eb9fa9e68489ec8ebfeab0ade6b4a7ebb6bfebab81efa5b4cebcebbcb2e5bc9b
UHC 業삳돆宥귡렟類좎턃瑤녹럩愉쎿갭洧붿뫁若μ뼲弛 1110010111110110101110111110101110001001100101111110101011101001100000101110100110001110101100001110101110111010101000001110110010110101100111111110100011111101101100111110110010001110100011001110101011110000100110111110011010110000101110001110101011111011100101001110110010010001101001011110010110101110101001011110110010010110101101011110110010101100 e5f6bbeb8997eae982e98eb0ebbaa0ecb59fe8fdb3ec8e8ceaf09be6b0b8eafb94ec91a5e5aea5ec96b5ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)