To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN ???誼??碎??}v???誼??碎??}vB 001111110011111100111111100010110110001000111111001111111110000111101010001111110011111101111101011101100011111100111111001111111000101101100010001111110011111111100001111010100011111100111111011111010111011001000010 3f3f3f8b623f3fe1ea3f3f7d763f3f3f8b623f3fe1ea3f3f7d7642
EUC-JP ???誼??碎??}v???誼??碎??}vB 001111110011111100111111101101011100001100111111001111111110001011101100001111110011111101111101011101100011111100111111001111111011010111000011001111110011111111100010111011000011111100111111011111010111011001000010 3f3f3fb5c33f3fe2ec3f3f7d763f3f3fb5c33f3fe2ec3f3f7d7642
UTF-8 劣꾨챶誼됧슖碎몄젶}v劣꾨챶誼됧슖碎몄젶}vB 1110111110100110100111011110101010111110101010001110110010110001101101101110100010101010101111001110101110010000101001111110110010001010100101101110011110100010100011101110101110101010100001001110110010100000101101100111110101110110111011111010011010011101111010101011111010101000111011001011000110110110111010001010101010111100111010111001000010100111111011001000101010010110111001111010001010001110111010111010101010000100111011001010000010110110011111010111011001000010 efa69deabea8ecb1b6e8aabceb90a7ec8a96e7a28eebaa84eca0b67d76efa69deabea8ecb1b6e8aabceb90a7ec8a96e7a28eebaa84eca0b67d7642
UHC 劣꾨챶誼됧슖碎몄젶}v劣꾨챶誼됧슖碎몄젶}vB 1110011011101011100001001110101110101010100000111110101111111110100010011110010110011010101001011110000111101111101110001110110010100000101010100111110101110110111001101110101110000100111010111010101010000011111010111111111010001001111001011001101010100101111000011110111110111000111011001010000010101010011111010111011001000010 e6eb84ebaa83ebfe89e59aa5e1efb8eca0aa7d76e6eb84ebaa83ebfe89e59aa5e1efb8eca0aa7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)