To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譽??癌?????鳶?????央??瘟??倭 11100110101000110011111100111111100010101110000000111111001111110011111100111111001111111001001111001110001111110011111100111111001111110011111110001001100110110011111100111111111000011000100100111111001111111001100001100000 e6a33f3f8ae03f3f3f3f3f93ce3f3f3f3f3f899b3f3fe1893f3f9860
EUC-JP 譽??癌?????鳶?????央??瘟??倭 11101100101001010011111100111111101101001110001000111111001111110011111100111111001111111100011011010000001111110011111100111111001111110011111110110001111110110011111100111111111000011110100100111111001111111100111111000001 eca53f3fb4e23f3f3f3f3fc6d03f3f3f3f3fb1fb3f3fe1e93f3fcfc1
UTF-8 譽긷춼癌닸짎嶪뤻걶鳶멨끀嶪뤷렘央뉐뎴瘟룡릍倭 111010001010110110111101111010101011100010110111111011001011011010111100111001111001100110001100111010111000101110111000111011001010011110001110111001011011011010101010111010111010010010111011111010101011000110110110111010011011001110110110111010111010100110101000111010111000000110000000111001011011011010101010111010111010010010110111111010111010000010011000111001011010010010101110111010111000100110010000111010111000111010110100111001111001100010011111111010111010001110100001111010111010011010001101111001011000000010101101 e8adbdeab8b7ecb6bce7998ceb8bb8eca78ee5b6aaeba4bbeab1b6e9b3b6eba9a8eb8180e5b6aaeba4b7eba098e5a4aeeb8990eb8eb4e7989feba3a1eba68de580ad
UHC 譽긷춼癌닸짎嶪뤻걶鳶멨끀嶪뤷렘央뉐뎴瘟룡릍倭 1110011111100010101100011110010110101101100110001110010011011111101101001110011010100011100110101110010111110101100011111110100110000001100111001110011011101001101110001110010110000101101101101110010111110101100011111110010110110111101111011110010011100111100001111110010110001001100001111110100010110000101101111110011010111000101011001110100011011110 e7e2b1e5ad98e4dfb4e6a39ae5f58fe9819ce6e9b8e585b6e5f58fe5b7bde4e787e58987e8b0b7e6b8ace8de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)