To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??汚??維??喩??瑤?? 0011111100111111001111110011111100111111001111111001101001100111001111110011111110001001100110000011111100111111100010001101101100111111001111111001101001100111001111110011111111101010101000100011111100111111 3f3f3f3f3f3f9a673f3f89983f3f88db3f3f9a673f3feaa23f3f
EUC-JP ???佾??喩??汚??維??喩??瑤?? 00111111001111110011111110001111101100001111101100111111001111111101001111001000001111110011111110110001111110000011111100111111101100001101110100111111001111111101001111001000001111110011111111110100101001000011111100111111 3f3f3f8fb0fb3f3fd3c83f3fb1f83f3fb0dd3f3fd3c83f3ff4a43f3f
UTF-8 麗몃쓷佾듿칰喩믪쪞汚살쉹維볟춢喩롫걙瑤노뵭 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111010111001001110111111111011001011100110110000111001011001011010101001111010111010111110101010111011001010101010011110111001101011000110011010111011001000001010110100111011001000100110111001111001111011011010101101111010111011001110011111111011001011011010100010111001011001011010101001111010111010000110101011111010101011000110011001111001111001000110100100111010111000010110111000111010111011010110101101 efa688ebaa83ec93b7e4bdbeeb93bfecb9b0e596a9ebafaaecaa9ee6b19aec82b4ec89b9e7b6adebb39fecb6a2e596a9eba1abeab199e791a4eb85b8ebb5ad
UHC 麗몃쓷佾듿칰喩믪쪞汚살쉹維볟춢喩롫걙瑤노뵭 111001101011000010111000111010111001110110010100111011001110101110001010111001011010111110000011111010101110011110010010111011001010010110010111111001111111110110111011111011001001101010001111111010111010101110010011111001011010110110000011111010101110011110001110111010111000000110000011111010001111110110110011111010111001010010101011 e6b0b8eb9d94eceb8ae5af83eae792eca597e7fdbbec9a8febab93e5ad83eae78eeb8183e8fdb3eb94ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)