To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??歪?;維??喩??筌?? 001111110011111100111111001111110011111100111111100110100110011100111111001111111001100001100011001111111000000101000111100010001101101100111111001111111001101001100111001111110011111111100010101000110011111100111111 3f3f3f3f3f3f9a673f3f98633f814788db3f3f9a673f3fe2a33f3f
EUC-JP ???佾??喩??歪?;維??喩??筌?? 0011111100111111001111111000111110110000111110110011111100111111110100111100100000111111001111111100111111000100001111111010000110101000101100001101110100111111001111111101001111001000001111110011111111100100101001010011111100111111 3f3f3f8fb0fb3f3fd3c83f3fcfc43fa1a8b0dd3f3fd3c83f3fe4a53f3f
UTF-8 麗몃쓷佾듿칰喩묒쉔歪묐;維볟춢喩롫걖筌덉찊 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111010111001001110111111111011001011100110110000111001011001011010101001111010111010110010010010111011001000100110010100111001101010110110101010111010111010110010010000111011111011110010011011111001111011011010101101111010111011001110011111111011001011011010100010111001011001011010101001111010111010000110101011111010101011000110010110111001111010110110001100111010111000110110001001111011001011000010001010 efa688ebaa83ec93b7e4bdbeeb93bfecb9b0e596a9ebac92ec8994e6adaaebac90efbc9be7b6adebb39fecb6a2e596a9eba1abeab196e7ad8ceb8d89ecb08a
UHC 麗몃쓷佾듿칰喩묒쉔歪묐;維볟춢喩롫걖筌덉찊 111001101011000010111000111010111001110110010100111011001110101110001010111001011010111110000011111010101110011110010001111011001011110110101000111010001110000010010001111010111010001110111011111010111010101110010011111001011010110110000011111010101110011110001110111010111000000110000001111011111010011110001000111011001010100110001110 e6b0b8eb9d94eceb8ae5af83eae791ecbda8e8e091eba3bbebab93e5ad83eae78eeb8181efa788eca98e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)