To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??肄??誘ゃ?h筌??肄??誘ゃ? 111000101010001100111111001111111110001111100101001111110011111110010111010101011000001011100001001111110110100011100010101000110011111100111111111000111110010100111111001111111001011101010101100000101110000100111111 e2a33f3fe3e53f3f975582e13f68e2a33f3fe3e53f3f975582e13f
EUC-JP 筌™?肄??誘ゃ?h筌™?肄??誘ゃ? 11100100101001011000111110100010111011110011111111100110111001110011111100111111110011011011011010100100111000110011111101101000111001001010010110001111101000101110111100111111111001101110011100111111001111111100110110110110101001001110001100111111 e4a58fa2ef3fe6e73f3fcdb6a4e33f68e4a58fa2ef3fe6e73f3fcdb6a4e33f
UTF-8 筌™뫁肄좑㏊誘ゃ렃h筌™뫁肄좑㏊誘ゃ렃 11100111101011011000110011100010100001001010001011101011101010111000000111101000100000101000010011101100101000101001000111100011100011111000101011101000101010101001100011100011100000101000001111101011101000001000001101101000111001111010110110001100111000101000010010100010111010111010101110000001111010001000001010000100111011001010001010010001111000111000111110001010111010001010101010011000111000111000001010000011111010111010000010000011 e7ad8ce284a2ebab81e88284eca291e38f8ae8aa98e38283eba08368e7ad8ce284a2ebab81e88284eca291e38f8ae8aa98e38283eba083
UHC 筌™뫁肄좑㏊誘ゃ렃h筌™뫁肄좑㏊誘ゃ렃 11101111101001111010001011100010100100011010010111101100101111011010000011101111101001111011010111101011101011111010101011100011100011101001110101101000111011111010011110100010111000101001000110100101111011001011110110100000111011111010011110110101111010111010111110101010111000111000111010011101 efa7a2e291a5ecbda0efa7b5ebafaae38e9d68efa7a2e291a5ecbda0efa7b5ebafaae38e9d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)