To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???維??幽??巍ル???┬碎????? 001111110011111100111111100010001101101100111111001111111001011101001000001111110011111110011011110110011000001110001011001111110011111100111111100001001010011011100001111010100011111100111111001111110011111100111111 3f3f3f88db3f3f97483f3f9bd9838b3f3f3f84a6e1ea3f3f3f3f3f
EUC-JP ???維??幽??巍ル?庾?┬碎??孼?? 00111111001111110011111110110000110111010011111100111111110011011010100100111111001111111101011011011011101001011110101100111111100011111011110011001110001111111010100010101000111000101110110000111111001111111000111110111010110000110011111100111111 3f3f3fb0dd3f3fcda93f3fd6dba5eb3f8fbcce3fa8a8e2ec3f3f8fbac33f3f
UTF-8 嶺뚭봇維쒏룚幽귢텫巍ル쵐庾듸┬碎멥돘孼뽰벧 111011111010011010101011111010111001101010101101111010111011010010000111111001111011011010101101111011001001001010001111111010111010001110011010111001011011100110111101111010101011011110100010111011011000010110101011111001011011011110001101111000111000001110101011111011001011010110010000111001011011101010111110111010111001001110111000111000101001010010101100111001111010001010001110111010111010100110100101111010111000111110011000111001011010110110111100111010111011110110110000111010111011001010100111 efa6abeb9aadebb487e7b6adec928feba39ae5b9bdeab7a2ed85abe5b78de383abecb590e5babeeb93b8e294ace7a28eeba9a5eb8f98e5adbcebbdb0ebb2a7
UHC 嶺뚭봇維쒏룚幽귢텫巍ル쵐庾듸┬碎멥돘孼뽰벧 111001111010110110001100111010101011101010111111111010111010101110011100111001101000111110010110111010101110101110000010111010101011011010011111111010001110010010101011111010111010110010010010111010101110110010110101111011111010011010101000111000011110111110111000111000111000100110100001111001011110110110010110111011001011101010100110 e7ad8ceababfebab9ce68f96eaeb82eab69fe8e4abebac92eaecb5efa6a8e1efb8e389a1e5ed96ecbaa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)