To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ????轅????????儒??億?? 1110010011101000100000101110101000111111001111110011111100111111111001110111011000111111001111110011111100111111001111110011111100111111001111111000111011110010001111110011111110001001101011010011111100111111 e4e882ea3f3f3f3fe7763f3f3f3f3f3f3f3f8ef23f3f89ad3f3f
EUC-JP 蒻れ?佾??轅????????儒??億?? 11101000111010101010010011101100001111111000111110110000111110110011111100111111111011011101011100111111001111110011111100111111001111110011111100111111001111111011110011110100001111110011111110110010101011110011111100111111 e8eaa4ec3f8fb0fb3f3fedd73f3f3f3f3f3f3f3fbcf43f3fb2af3f3f
UTF-8 蒻れ슙佾롩솻轅곌숲若뗫쵐留껆땸儒대젚億뤿뜥 111010001001001010111011111000111000001010001100111011001000101010011001111001001011110110111110111010111010000110101001111011001000011010111011111010001011110110000101111010101011001110001100111011001000100010110010111011111010010110110100111010111001011110101011111011001011010110010000111011111010011110001101111010101011101110000110111010111001010110111000111001011000010010010010111010111000110010000000111011001010000010011010111001011000010010000100111010111010010010111111111010111001110010100101 e892bbe3828cec8a99e4bdbeeba1a9ec86bbe8bd85eab38cec88b2efa5b4eb97abecb590efa78deabb86eb95b8e58492eb8c80eca09ae58484eba4bfeb9ca5
UHC 蒻れ슙佾롩솻轅곌숲若뗫쵐留껆땸儒대젚億뤿뜥 111001011011011010101010111011001001101010100111111011001110101110001110111010011001100110110000111010101011111110110000111010101011110110100011111001011010111010001011111010111010110010010010111010111010011110000011111001111000101110001110111010101110001110110100111010111010000010010110111001011110001010001111111010111000110110101000 e5b6aaec9aa7eceb8ee999b0eabfb0eabda3e5ae8bebac92eba783e78b8eeae3b4eba096e5e28feb8da8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)