To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 猷??徇?5茵??}v猷??徇?5茵??}vB 10010111010100010011111100111111100111000110110100111111100000100101010011100100100111110011111100111111011111010111011010010111010100010011111100111111100111000110110100111111100000100101010011100100100111110011111100111111011111010111011001000010 97513f3f9c6d3f8254e49f3f3f7d7697513f3f9c6d3f8254e49f3f3f7d7642
EUC-JP 猷??徇?5茵??}v猷??徇?5茵??}vB 11001101101100100011111100111111110101111100111000111111101000111011010111101000101000010011111100111111011111010111011011001101101100100011111100111111110101111100111000111111101000111011010111101000101000010011111100111111011111010111011001000010 cdb23f3fd7ce3fa3b5e8a13f3f7d76cdb23f3fd7ce3fa3b5e8a13f3f7d7642
UTF-8 猷띠툞徇롫5茵껁긽}v猷띠툞徇롫5茵껁긽}vB 1110011110001100101101111110101110011101101000001110110110001000100111101110010110111110100001111110101110100001101010111110111110111100100101011110100010001100101101011110101010111011100000011110101010111000101111010111110101110110111001111000110010110111111010111001110110100000111011011000100010011110111001011011111010000111111010111010000110101011111011111011110010010101111010001000110010110101111010101011101110000001111010101011100010111101011111010111011001000010 e78cb7eb9da0ed889ee5be87eba1abefbc95e88cb5eabb81eab8bd7d76e78cb7eb9da0ed889ee5be87eba1abefbc95e88cb5eabb81eab8bd7d7642
UHC 猷띠툞徇롫5茵껁긽}v猷띠툞徇롫5茵껁긽}vB 1110101110100011101101101110110010111000100101011110001011011111100011101110101110100011101101011110110011100000100000111110001110000011100000010111110101110110111010111010001110110110111011001011100010010101111000101101111110001110111010111010001110110101111011001110000010000011111000111000001110000001011111010111011001000010 eba3b6ecb895e2df8eeba3b5ece083e383817d76eba3b6ecb895e2df8eeba3b5ece083e383817d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)