To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 劑槃???精?醫?劑槃???精?醫?B 100110011001110110011110110011110011111100111111001111111001000010111000001111111110011111001110001111111001100110011101100111101100111100111111001111110011111110010000101110000011111111100111110011100011111101000010 999d9ecf3f3f3f90b83fe7ce3f999d9ecf3f3f3f90b83fe7ce3f42
EUC-JP 劑槃??輧精?醫?劑槃??輧精?醫?B 11010001111111011101110011010001001111110011111110001111111000001111001111000000101110100011111111101110110100000011111111010001111111011101110011010001001111110011111110001111111000001111001111000000101110100011111111101110110100000011111101000010 d1fddcd13f3f8fe0f3c0ba3feed03fd1fddcd13f3f8fe0f3c0ba3feed03f42
UTF-8 劑槃렗띌輧精렗醫렋劑槃렗띌輧精렗醫렋B 11100101100010101001000111100110101001111000001111101011101000001001011111101011100111011000110011101000101111001010011111100111101100101011111011101011101000001001011111101001100001101010101111101011101000001000101111100101100010101001000111100110101001111000001111101011101000001001011111101011100111011000110011101000101111001010011111100111101100101011111011101011101000001001011111101001100001101010101111101011101000001000101101000010 e58a91e6a783eba097eb9d8ce8bca7e7b2beeba097e986abeba08be58a91e6a783eba097eb9d8ce8bca7e7b2beeba097e986abeba08b42
UHC 劑槃렗띌輧精렗醫렋劑槃렗띌輧精렗醫렋B 11110000101001011101101011101001100011101010110010110110111010011101110010111110111011111111000110001110101011001110110010100010100011101010001011110000101001011101101011101001100011101010110010110110111010011101110010111110111011111111000110001110101011001110110010100010100011101010001001000010 f0a5dae98eacb6e9dcbeeff18eaceca28ea2f0a5dae98eacb6e9dcbeeff18eaceca28ea242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)