To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????v????????vB 00111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f7642
SJIS-WIN 巐呱讌簿谿シ巐怡v巐呱讌簿谿シ巐怡vB 111110101011011010011001111010111110011010100101100101011110101111100110101011101011110011111010101101101001110001111101011101101111101010110110100110011110101111100110101001011001010111101011111001101010111010111100111110101011011010011100011111010111011001000010 fab699ebe6a595ebe6aebcfab69c7d76fab699ebe6a595ebe6aebcfab69c7d7642
EUC-JP 巐呱讌簿谿シ巐怡v巐呱讌簿谿シ巐怡vB 100011111011101111111001110100101110110111101100101001111100101011101101111011001011000010001110101111001000111110111011111110011101011111011110011101101000111110111011111110011101001011101101111011001010011111001010111011011110110010110000100011101011110010001111101110111111100111010111110111100111011001000010 8fbbf9d2edeca7caedecb08ebc8fbbf9d7de768fbbf9d2edeca7caedecb08ebc8fbbf9d7de7642
UTF-8 巐呱讌簿谿シ巐怡v巐呱讌簿谿シ巐怡vB 111001011011011110010000111001011001000110110001111010001010111010001100111001111011000010111111111010001011000010111111111011111011110110111100111001011011011110010000111001101000000010100001011101101110010110110111100100001110010110010001101100011110100010101110100011001110011110110000101111111110100010110000101111111110111110111101101111001110010110110111100100001110011010000000101000010111011001000010 e5b790e591b1e8ae8ce7b0bfe8b0bfefbdbce5b790e680a176e5b790e591b1e8ae8ce7b0bfe8b0bfefbdbce5b790e680a17642
UHC ?呱?簿谿??怡v?呱?簿谿??怡vB 001111111100110110110010001111111101110110101101110011011010110000111111001111111110110010101110011101100011111111001101101100100011111111011101101011011100110110101100001111110011111111101100101011100111011001000010 3fcdb23fddadcdac3f3fecae763fcdb23fddadcdac3f3fecae7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)