To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鵝??茹l???ぃ異???ユぃ苡??茹l?^ 1110101001000000001111110011111111100100101001011000001010001100001111110011111100111111100000101010000110001000110110010011111100111111001111111000001110000110100000101010000111100100100011110011111100111111111001001010010110000010100011000011111101011110 ea403f3fe4a5828c3f3f3f82a188d93f3f3f838682a1e48f3f3fe4a5828c3f5e
EUC-JP 鵝??茹l?旿?ぃ異???ユぃ苡??茹l?^ 11110011101000010011111100111111111010001010011110100011111011000011111110001111110000011111010000111111101001001010001110110000110110110011111100111111001111111010010111100110101001001010001111100111111011110011111100111111111010001010011110100011111011000011111101011110 f3a13f3fe8a7a3ec3f8fc1f43fa4a3b0db3f3f3fa5e6a4a3e7ef3f3fe8a7a3ec3f5e
UTF-8 鵝얜젷茹l풄旿섉ぃ異덁슭溜ユぃ苡뚩쐶茹l뎠^ 11101001101101011001110111101100100101101001110011101100101000001011011111101000100011001011100111101111101111011000110011101101100100101000010011100110100101111011111111101100100001001000100111100011100000011000001111100111100101011011000011101011100011011000000111101100100010101010110111101111101001111000101111100011100000111010011011100011100000011000001111101000100010111010000111101011100110101010100111101100100100001011011011101000100011001011100111101111101111011000110011101011100011101010000001011110 e9b59dec969ceca0b7e88cb9efbd8ced9284e697bfec8489e38183e795b0eb8d81ec8aadefa78be383a6e38183e88ba1eb9aa9ec90b6e88cb9efbd8ceb8ea05e
UHC 鵝얜젷茹l풄旿섉ぃ異덁슭溜ユぃ苡뚩쐶茹l뎠^ 11100100101111011011111011101011101000001010101111100110101010101010001111101100101111101000110011100111111110101001100011100110101010101010001111101100101101101000100011100100101111011011111011101010111111101010101111100110101010101010001111101100101111101000110011101000100111001001100011100110101010101010001111101100101101011011000101011110 e4bdbeeba0abe6aaa3ecbe8ce7fa98e6aaa3ecb688e4bdbeeafeabe6aaa3ecbe8ce89c98e6aaa3ecb5b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)