To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????@R?????????DQB 0011111100111111001111110011111100111111001111110011111100111111001111110100000001010010001111110011111100111111001111110011111100111111001111110011111100111111010001000101000101000010 3f3f3f3f3f3f3f3f3f40523f3f3f3f3f3f3f3f3f445142
SJIS-WIN ?????????@R?????????DQB 0011111100111111001111110011111100111111001111110011111100111111001111110100000001010010001111110011111100111111001111110011111100111111001111110011111100111111010001000101000101000010 3f3f3f3f3f3f3f3f3f40523f3f3f3f3f3f3f3f3f445142
EUC-JP ?????????@R?????????DQB 0011111100111111001111110011111100111111001111110011111100111111001111110100000001010010001111110011111100111111001111110011111100111111001111110011111100111111010001000101000101000010 3f3f3f3f3f3f3f3f3f40523f3f3f3f3f3f3f3f3f445142
UTF-8 챔혪쨩챘짧혣챙혟혻@R챔혪쨩챘짧혣챙혟혻DQB 1110110010110001100101001110110110011000101010101110110010101000101010011110110010110001100110001110110010100111101001111110110110011000101000111110110010110001100110011110110110011000100111111110110110011000101110110100000001010010111011001011000110010100111011011001100010101010111011001010100010101001111011001011000110011000111011001010011110100111111011011001100010100011111011001011000110011001111011011001100010011111111011011001100010111011010001000101000101000010 ecb194ed98aaeca8a9ecb198eca7a7ed98a3ecb199ed989fed98bb4052ecb194ed98aaeca8a9ecb198eca7a7ed98a3ecb199ed989fed98bb445142
UHC 챔혪쨩챘짧혣챙혟혻@R챔혪쨩챘짧혣챙혟혻DQB 1100001110101000110000101001001011000010101110111100001110101011110000101010101011000010100011001100001110101100110000101000100111000010101000000100000001010010110000111010100011000010100100101100001010111011110000111010101111000010101010101100001010001100110000111010110011000010100010011100001010100000010001000101000101000010 c3a8c292c2bbc3abc2aac28cc3acc289c2a04052c3a8c292c2bbc3abc2aac28cc3acc289c2a0445142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)