To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲??猷??轅???λ?毅??攸??億??B 1110000110011111001111110011111110010111010100010011111100111111111001110111011000111111001111110011111110000011110010010011111110001011010000100011111100111111100111011011111100111111001111111000100110101101001111110011111101000010 e19f3f3f97513f3fe7763f3f3f83c93f8b423f3f9dbf3f3f89ad3f3f42
EUC-JP 癲??猷??轅???λ?毅??攸??億??B 1110001010100001001111110011111111001101101100100011111100111111111011011101011100111111001111110011111110100110110010110011111110110101101000110011111100111111110110101100000100111111001111111011001010101111001111110011111101000010 e2a13f3fcdb23f3fedd73f3f3fa6cb3fb5a33f3fdac13f3fb2af3f3f42
UTF-8 癲놁엱猷쀤뭄轅댐펿若λ챶毅쒐춯攸꾪뜑億됱갚B 111001111001100110110010111010111000011010000001111011001001011110110001111001111000110010110111111011001000000010100100111010111010110110000100111010001011110110000101111010111000110010010000111011011000111010111111111011111010010110110100110011101011101111101100101100011011011011100110101011111000010111101100100100101001000011101100101101101010111111100110100101001011100011101010101111101010101011101011100111001001000111100101100001001000010011101011100100001011000111101010101100001001101001000010 e799b2eb8681ec97b1e78cb7ec80a4ebad84e8bd85eb8c90ed8ebfefa5b4cebbecb1b6e6af85ec9290ecb6afe694b8eabeaaeb9c91e58484eb90b1eab09a42
UHC 癲놁엱猷쀤뭄轅댐펿若λ챶毅쒐춯攸꾪뜑億됱갚B 11101111101001101000011011101100100111101000011011101011101000111001011111100100101110011011001111101010101111111011010011101111101111001000111011100101101011101010010111101011101010101000001111101011111101101001110011100111101011011000110011101010111100101000010011101101100011011001010011100101111000101000100111101100101100001011000101000010 efa686ec9e86eba397e4b9b3eabfb4efbc8ee5aea5ebaa83ebf69ce7ad8ceaf284ed8d94e5e289ecb0b142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)