To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???毅??儒??[???毅??儒??[^ 00111111001111110011111110001011010000100011111100111111100011101111001000111111001111110101101100111111001111110011111110001011010000100011111100111111100011101111001000111111001111110101101101011110 3f3f3f8b423f3f8ef23f3f5b3f3f3f8b423f3f8ef23f3f5b5e
EUC-JP ???毅??儒??[???毅??儒??[^ 00111111001111110011111110110101101000110011111100111111101111001111010000111111001111110101101100111111001111110011111110110101101000110011111100111111101111001111010000111111001111110101101101011110 3f3f3fb5a33f3fbcf43f3f5b3f3f3fb5a33f3fbcf43f3f5b5e
UTF-8 銳잙끆毅싪뇾儒븐삖[銳잙끆毅싪뇾儒븐삖[^ 111010011000101010110011111011001001111010011001111010111000000110000110111001101010111110000101111011001000101110101010111010111000011110111110111001011000010010010010111010111011100010010000111011001000001010010110010110111110100110001010101100111110110010011110100110011110101110000001100001101110011010101111100001011110110010001011101010101110101110000111101111101110010110000100100100101110101110111000100100001110110010000010100101100101101101011110 e98ab3ec9e99eb8186e6af85ec8baaeb87bee58492ebb890ec82965be98ab3ec9e99eb8186e6af85ec8baaeb87bee58492ebb890ec82965b5e
UHC 銳잙끆毅싪뇾儒븐삖[銳잙끆毅싪뇾儒븐삖[^ 111001111110010110011111111010111000010110111010111010111111011010011010111010001000011110011111111010101110001110111010111011001001100010011010010110111110011111100101100111111110101110000101101110101110101111110110100110101110100010000111100111111110101011100011101110101110110010011000100110100101101101011110 e7e59feb85baebf69ae8879feae3baec989a5be7e59feb85baebf69ae8879feae3baec989a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)