To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 也よ?押?????瘟????オ言??汝?ク 1001011011100111100000101110011000111111100010011001111100111111001111110011111100111111001111111110000110001001001111110011111100111111001111111000001101001001100011001011111000111111001111111001001111110000001111111000001101001110 96e782e63f899f3f3f3f3f3fe1893f3f3f3f83498cbe3f3f93f03f834e
EUC-JP 也よ?押?????瘟????オ言??汝?ク 1100110011101001101001001110100000111111101100101010000100111111001111110011111100111111001111111110000111101001001111110011111100111111001111111010010110101010101110001100000000111111001111111100011011110010001111111010010110101111 cce9a4e83fb2a13f3f3f3f3fe1e93f3f3f3fa5aab8c03f3fc6f23fa5af
UTF-8 也よ쒼押뜻짎勵잒큹瘟룟쩀呂잏オ言됪쓳汝싩ク 111001001011100110011111111000111000001010001000111011001001001010111100111001101000101010111100111010111001110010111011111011001010011110001110111011111010010110111111111011001001111010010010111011011000000110111001111001111001100010011111111010111010001110011111111011001010100110000000111011111010011010000000111011001001111010001111111000111000001010101010111010001010100010000000111010111001000010101010111011001001001110110011111001101011000110011101111011001000101110101001111000111000001010101111 e4b99fe38288ec92bce68abceb9cbbeca78eefa5bfec9e92ed81b9e7989feba39feca980efa680ec9e8fe382aae8a880eb90aaec93b3e6b19dec8ba9e382af
UHC 也よ쒼押뜻짎勵잒큹瘟룟쩀呂잏オ言됪쓳汝싩ク 111001011010010110101010111010001011111010110000111001001110001110110110111001101010001110011010111001011111101010011111111010001011010010001000111010001011000010110111111001011010010010011010111001011111101110011111111001111010101110101010111001011110101110001001111001101001110110010001111001101010001110011010111001111010101110101111 e5a5aae8beb0e4e3b6e6a39ae5fa9fe8b488e8b0b7e5a49ae5fb9fe7abaae5eb89e69d91e6a39ae7abaf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)