To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲〓?宥∽?怨??癲?∥誼??音????? 111000011001111110000001101011000011111110010111010001111000000111100100001111111000100110000101001111110011111111100001100111110011111110000001011000011000101101100010001111110011111110001001101110010011111100111111001111110011111100111111 e19f81ac3f974781e43f89853f3fe19f3f81618b623f3f89b93f3f3f3f3f
EUC-JP 癲〓?宥∽?怨??癲?‖誼??音????? 111000101010000110100010101011100011111111001101101010001010001011100110001111111011000111100101001111110011111111100010101000010011111110100001110000101011010111000011001111110011111110110010101110110011111100111111001111110011111100111111 e2a1a2ae3fcda8a2e63fb1e53f3fe2a13fa1c2b5c33f3fb2bb3f3f3f3f3f
UTF-8 癲〓쵇宥∽쭓怨뺤젶癲㏓∥誼쀯쬂音곗젡閭잙껌 111001111001100110110010111000111000000010010011111011001011010110000111111001011010111010100101111000101000100010111101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010110110111001111001100110110010111000111000111110010011111000101000100010100101111010001010101010111100111011001000000010101111111011001010110010000010111010011001111110110011111010101011001110010111111011001010000010100001111011111010011010000110111011001001111010011001111010101011101110001100 e799b2e38093ecb587e5aea5e288bdecad93e680a8ebbaa4eca0b6e799b2e38f93e288a5e8aabcec80afecac82e99fb3eab397eca0a1efa686ec9e99eabb8c
UHC 癲〓쵇宥∽쭓怨뺤젶癲㏓∥誼쀯쬂音곗젡閭잙껌 111011111010011010100001111010111010110010001001111010101110100110100001111011111010011110001011111010101011001110010101111011001010000010101010111011111010011010100111111010111010000110101011111010111111111010010111111011111010011010011001111010111110010110110000111011001010000010011010111001101010110110011111111010111011001010101101 efa6a1ebac89eae9a1efa78beab395eca0aaefa6a7eba1abebfe97efa699ebe5b0eca09ae6ad9febb2ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)