To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??猷??音??孃る?議??濡ろ?沃??壹 1000100101000111001111110011111110010111010100010011111100111111100010011011100100111111001111111001101101101111100000101110100100111111100010110110001100111111001111111001010001000111100000101110101100111111100101111000000000111111001111111001101011100011 89473f3f97513f3f89b93f3f9b6f82e93f8b633f3f944782eb3f97803f3f9ae3
EUC-JP 烏??猷??音??孃る?議??濡ろ?沃??壹 1011000110101000001111110011111111001101101100100011111100111111101100101011101100111111001111111101010111010000101001001110101100111111101101011100010000111111001111111100011110101000101001001110110100111111110011011110000000111111001111111101010011100101 b1a83f3fcdb23f3fb2bb3f3fd5d0a4eb3fb5c43f3fc7a8a4ed3fcde03f3fd4e5
UTF-8 烏띻퀣猷꾣쨫音쎌춷孃る뿭議끿뙠濡ろ뜐沃쇄몾壹 111001111000001110001111111010111001110110111011111011011000000010100011111001111000110010110111111010101011111010100011111011001010100010101011111010011001111110110011111011001000111010001100111011001011011010110111111001011010110110000011111000111000001010001011111010111011111110101101111010001010110110110000111010111000000110111111111010111001100110100000111001101011111110100001111000111000001010001101111010111001110010010000111001101011001010000011111011001000011110000100111010111010101010111110111001011010001110111001 e7838feb9dbbed80a3e78cb7eabea3eca8abe99fb3ec8e8cecb6b7e5ad83e3828bebbfade8adb0eb81bfeb99a0e6bfa1e3828deb9c90e6b283ec8784ebaabee5a3b9
UHC 烏띻퀣猷꾣쨫音쎌춷孃る뿭議끿뙠濡ろ뜐沃쇄몾壹 1110100010100001100011011110101010110011100101111110101110100011100001001110011010100100100001011110101111100101101111011110110010101101100100111110010110111110101010101110101110010111101011011110110010100001100001011110011110001100101001011110101110100001101010101110110110001101100100111110100010101010101111001110001010010001101000101110110011101100 e8a18deab397eba384e6a485ebe5bdecad93e5beaaeb97adeca185e78ca5eba1aaed8d93e8aabce291a2ecec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)