To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 畑???ヨ?攸??畑???ヨ?攸??B 10010100101010000011111100111111001111111000001110001000001111111001110110111111001111110011111110010100101010000011111100111111001111111000001110001000001111111001110110111111001111110011111101000010 94a83f3f3f83883f9dbf3f3f94a83f3f3f83883f9dbf3f3f42
EUC-JP 畑??嫄ヨ?攸??畑??嫄ヨ?攸??B 1100100010101010001111110011111110001111101110101010000110100101111010000011111111011010110000010011111100111111110010001010101000111111001111111000111110111010101000011010010111101000001111111101101011000001001111110011111101000010 c8aa3f3f8fbaa1a5e83fdac13f3fc8aa3f3f8fbaa1a5e83fdac13f3f42
UTF-8 畑듭쉯嫄ヨ삃攸귣눒畑듭쉯嫄ヨ삃攸귣눒B 11100111100101011001000111101011100100111010110111101100100010011010111111100101101010111000010011100011100000111010100011101100100000101000001111100110100101001011100011101010101101111010001111101011100010001001001011100111100101011001000111101011100100111010110111101100100010011010111111100101101010111000010011100011100000111010100011101100100000101000001111100110100101001011100011101010101101111010001111101011100010001001001001000010 e79591eb93adec89afe5ab84e383a8ec8283e694b8eab7a3eb8892e79591eb93adec89afe5ab84e383a8ec8283e694b8eab7a3eb889242
UHC 畑듭쉯嫄ヨ삃攸귣눒畑듭쉯嫄ヨ삃攸귣눒B 11101111101001011011010111101100100110101000011111101010101100011010101111101000100110001000101011101010111100101000001011101011100001111010111011101111101001011011010111101100100110101000011111101010101100011010101111101000100110001000101011101010111100101000001011101011100001111010111001000010 efa5b5ec9a87eab1abe8988aeaf282eb87aeefa5b5ec9a87eab1abe8988aeaf282eb87ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)