To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??意??猷??壓????┐誘??倭 1000100101010001001111110011111110001000110100110011111100111111100101110101000100111111001111111001101011011000001111110011111100111111001111111000010010100010100101110101010100111111001111111001100001100000 89513f3f88d33f3f97513f3f9ad83f3f3f3f84a297553f3f9860
EUC-JP 渦??意??猷??壓??馹?┐誘??倭 10110001101100100011111100111111101100001101010100111111001111111100110110110010001111110011111111010100110110100011111100111111100011111110100110100001001111111010100010100100110011011011011000111111001111111100111111000001 b1b23f3fb0d53f3fcdb23f3fd4da3f3f8fe9a13fa8a4cdb63f3fcfc1
UTF-8 渦깅맧意욄틦猷⑸펳壓믩쓹馹깍┐誘↔뭅倭 111001101011100010100110111010101011100110000101111010111010011110100111111001101000010010001111111011001001101010000100111011011000101110100110111001111000110010110111111000101001000110111000111011011000111010110011111001011010001110010011111010111010111110101001111011001001001110111001111010011010011010111001111010101011100110001101111000101001010010010000111010001010101010011000111000101000011010010100111010111010110110000101111001011000000010101101 e6b8a6eab985eba7a7e6848fec9a84ed8ba6e78cb7e291b8ed8eb3e5a393ebafa9ec93b9e9a6b9eab98de29490e8aa98e28694ebad85e580ad
UHC 渦깅맧意욄틦猷⑸펳壓믩쓹馹깍┐誘↔뭅倭 1110100010111110101100011110101110010000101100001110101111110010100111101110011010111010100100001110101110100011101010011110101110111100100001011110010011100010100100101110101110011101100101011110110011110001101100011110111110100110101001001110101110101111101000011110101010111001101101001110100011011110 e8beb1eb90b0ebf29ee6ba90eba3a9ebbc85e4e292eb9d95ecf1b1efa6a4ebafa1eab9b4e8de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)