To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷?┰循?+?γ?猷ユ?猷?┰循?+仰 10010111010100010011111110000100101110111000111101111010001111111000000101111011001111111000001111000001001111111001011101010001100000111000011000111111100101110101000100111111100001001011101110001111011110100011111110000001011110111000101111000010 97513f84bb8f7a3f817b3f83c13f975183863f97513f84bb8f7a3f817b8bc2
EUC-JP 猷?┰循?+?γ?猷ユ?猷?┰循?+仰 11001101101100100011111110101000101111011011110111011011001111111010000111011100001111111010011011000011001111111100110110110010101001011110011000111111110011011011001000111111101010001011110110111101110110110011111110100001110111001011011011000100 cdb23fa8bdbddb3fa1dc3fa6c33fcdb2a5e63fcdb23fa8bdbddb3fa1dcb6c4
UTF-8 猷띠┰循용+嶪γ궔猷ユ젗猷띠┰循용+仰 1110011110001100101101111110101110011101101000001110001010010100101100001110010110111110101010101110110010011010101010011110111110111100100010111110010110110110101010101100111010110011111010101011011010010100111001111000110010110111111000111000001110100110111011001010000010010111111001111000110010110111111010111001110110100000111000101001010010110000111001011011111010101010111011001001101010101001111011111011110010001011111001001011101110110000 e78cb7eb9da0e294b0e5beaaec9aa9efbc8be5b6aaceb3eab694e78cb7e383a6eca097e78cb7eb9da0e294b0e5beaaec9aa9efbc8be4bbb0
UHC 猷띠┰循용+嶪γ궔猷ユ젗猷띠┰循용+仰 1110101110100011101101101110110010100110101111011110001011100000101111111110101110100011101010111110010111110101101001011110001110000010101010011110101110100011101010111110011010100000100100111110101110100011101101101110110010100110101111011110001011100000101111111110101110100011101010111110010011100110 eba3b6eca6bde2e0bfeba3abe5f5a5e382a9eba3abe6a093eba3b6eca6bde2e0bfeba3abe4e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)