To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嶸ヲ巐倣丿巐啄丿巐卓ア麗囓嶸ヲ巐啄丿 111110101011010010100110111110101011011010010101111011011001100010100110111110101011011010010001111011011001100010100110111110101011011010010001111011001011000110010111111011011001101010010111111110101011010010100110111110101011011010010001111011011001100010100110 fab4a6fab695ed98a6fab691ed98a6fab691ecb197ed9a97fab4a6fab691ed98a6
EUC-JP 嶸ヲ巐倣丿巐啄丿巐卓ア麗囓嶸ヲ巐啄丿 100011111011101111110100100011101010011010001111101110111111100111001010111011111101000010101000100011111011101111111001110000101110111111010000101010001000111110111011111110011100001011101110100011101011000111001110111011111101001111110111100011111011101111110100100011101010011010001111101110111111100111000010111011111101000010101000 8fbbf48ea68fbbf9caefd0a88fbbf9c2efd0a88fbbf9c2ee8eb1ceefd3f78fbbf48ea68fbbf9c2efd0a8
UTF-8 嶸ヲ巐倣丿巐啄丿巐卓ア麗囓嶸ヲ巐啄丿 111001011011011010111000111011111011110110100110111001011011011110010000111001011000000010100011111001001011100010111111111001011011011110010000111001011001010110000100111001001011100010111111111001011011011110010000111001011000110110010011111011111011110110110001111010011011101010010111111001011001101110010011111001011011011010111000111011111011110110100110111001011011011110010000111001011001010110000100111001001011100010111111 e5b6b8efbda6e5b790e580a3e4b8bfe5b790e59584e4b8bfe5b790e58d93efbdb1e9ba97e59b93e5b6b8efbda6e5b790e59584e4b8bf
UHC 嶸??倣??啄??卓?麗?嶸??啄? 11100111101011100011111100111111110110111010011100111111001111111111011011110010001111110011111111110110111100010011111111010101111100100011111111100111101011100011111100111111111101101111001000111111 e7ae3f3fdba73f3ff6f23f3ff6f13fd5f23fe7ae3f3ff6f23f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)