To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 魚??遊?ぜ碎??孃る?誼??柔j?沃??B 1000101110011011001111110011111110010111010101100011111110000010101110101110000111101010001111110011111110011011011011111000001011101001001111111000101101100010001111110011111110001111010111111000001010001010001111111001011110000000001111110011111101000010 8b9b3f3f97563f82bae1ea3f3f9b6f82e93f8b623f3f8f5f828a3f97803f3f42
EUC-JP 魚??遊?ぜ碎??孃る?誼??柔j?沃??B 1011010111111011001111110011111111001101101101110011111110100100101111001110001011101100001111110011111111010101110100001010010011101011001111111011010111000011001111110011111110111101110000001010001111101010001111111100110111100000001111110011111101000010 b5fb3f3fcdb73fa4bce2ec3f3fd5d0a4eb3fb5c33f3fbdc0a3ea3fcde03f3f42
UTF-8 魚잙봾遊얕ぜ碎ⓥ뵥孃る냱誼든춯柔j데沃쇱뙲B 11101001101011011001101011101100100111101001100111101011101101001011111011101001100000011000101011101100100101101001010111100011100000011001110011100111101000101000111011100010100100111010010111101011101101011010010111100101101011011000001111100011100000101000101111101011100000111011000111101000101010101011110011101011100100111010000011101100101101101010111111100110100111111001010011101111101111011000101011101011100011011011000011100110101100101000001111101100100001111011000111101011100110011011001001000010 e9ad9aec9e99ebb4bee9818aec9695e3819ce7a28ee293a5ebb5a5e5ad83e3828beb83b1e8aabceb93a0ecb6afe69f94efbd8aeb8db0e6b283ec87b1eb99b242
UHC 魚잙봾遊얕ぜ碎ⓥ뵥孃る냱誼든춯柔j데沃쇱뙲B 11100101111000001001111111101011100101001000010111101011101101001011111011101000101010101011110011100001111011111010100011100010100101001010010011100101101111101010101011101011100001101000000111101011111111101011010111100111101011011000110011101010111101011010001111101010101101011010010111101000101010101011110011101100100011001011010101000010 e5e09feb9485ebb4bee8aabce1efa8e294a4e5beaaeb8681ebfeb5e7ad8ceaf5a3eab5a5e8aabcec8cb542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)