To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 勇??猷??瑜??亦??誼??誘る?塋 100101110100010100111111001111111001011101010001001111110011111111100000111011110011111100111111100101101001001000111111001111111000101101100010001111110011111110010111010101011000001011101001001111111001101011001000 97453f3f97513f3fe0ef3f3f96923f3f8b623f3f975582e93f9ac8
EUC-JP 勇??猷??瑜??亦??誼??誘る?塋 110011011010011000111111001111111100110110110010001111110011111111100000111100010011111100111111110010111111001000111111001111111011010111000011001111110011111111001101101101101010010011101011001111111101010011001010 cda63f3fcdb23f3fe0f13f3fcbf23f3fb5c33f3fcdb6a4eb3fd4ca
UTF-8 勇싳뮇猷녷젔瑜끹럷亦뱀궡誼섇짃誘る윪塋 111001011000101110000111111011001000101110110011111010111010111010000111111001111000110010110111111010111000010110110111111011001010000010010100111001111001000110011100111010111000000110111001111010111001111110110111111001001011101010100110111010111011000110000000111010101011011010100001111010001010101010111100111011001000010010000111111011001010011110000011111010001010101010011000111000111000001010001011111011001001110010101010111001011010000110001011 e58b87ec8bb3ebae87e78cb7eb85b7eca094e7919ceb81b9eb9fb7e4baa6ebb180eab6a1e8aabcec8487eca783e8aa98e3828bec9caae5a18b
UHC 勇싳뮇猷녷젔瑜끹럷亦뱀궡誼섇짃誘る윪塋 1110100110111000100110101110110010010010100101101110101110100011100001101110011010100000100100101110101110100101100001011110001110001110100101101110011010110010101110011110110010000010101101001110101111111110100110001110010110100011100100111110101110101111101010101110101110011111101010011110011110101011 e9b89aec9296eba386e6a092eba585e38e96e6b2b9ec82b4ebfe98e5a393ebafaaeb9fa9e7ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)