To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????G??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100011100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f473f3f3f3f3f3f3f
SJIS-WIN 猷??循?5???猷???G猷??循?5誼 1001011101010001001111110011111110001111011110100011111110000010010101000011111100111111001111111001011101010001001111110011111100111111010001111001011101010001001111110011111110001111011110100011111110000010010101001000101101100010 97513f3f8f7a3f82543f3f3f97513f3f3f4797513f3f8f7a3f82548b62
EUC-JP 猷??循?5???猷???G猷??循?5誼 1100110110110010001111110011111110111101110110110011111110100011101101010011111100111111001111111100110110110010001111110011111100111111010001111100110110110010001111110011111110111101110110110011111110100011101101011011010111000011 cdb23f3fbddb3fa3b53f3f3fcdb23f3f3f47cdb23f3fbddb3fa3b5b5c3
UTF-8 猷띠퐪循용5利썬궞猷댁빴짚G猷띠퐪循용5誼 11100111100011001011011111101011100111011010000011101101100100001010101011100101101111101010101011101100100110101010100111101111101111001001010111101111101001111001110111101100100011011010110011101010101101101001111011100111100011001011011111101011100011001000000111101011101110011011010011101100101001111001101001000111111001111000110010110111111010111001110110100000111011011001000010101010111001011011111010101010111011001001101010101001111011111011110010010101111010001010101010111100 e78cb7eb9da0ed90aae5beaaec9aa9efbc95efa79dec8daceab69ee78cb7eb8c81ebb9b4eca79a47e78cb7eb9da0ed90aae5beaaec9aa9efbc95e8aabc
UHC 猷띠퐪循용5利썬궞猷댁빴짚G猷띠퐪循용5誼 1110101110100011101101101110110010111101100100111110001011100000101111111110101110100011101101011110110010100110101111011110001110000010101100011110101110100011101101001110110010111011101001101100001010100100010001111110101110100011101101101110110010111101100100111110001011100000101111111110101110100011101101011110101111111110 eba3b6ecbd93e2e0bfeba3b5eca6bde382b1eba3b4ecbba6c2a447eba3b6ecbd93e2e0bfeba3b5ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)