To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??巽θ?惟??疫??爰??揄ш?吾?? 111000011001111100111111001111111001001001000110100000111100011000111111100010001101001000111111001111111000100101110101001111110011111111100000101001110011111100111111100111011000100110000100100010100011111110001100111000010011111100111111 e19f3f3f924683c63f88d23f3f89753f3fe0a73f3f9d89848a3f8ce13f3f
EUC-JP 癲??巽θ?惟??疫??爰??揄ш?吾?? 111000101010000100111111001111111100001110100111101001101100100000111111101100001101010000111111001111111011000111010110001111110011111111100000101010010011111100111111110110011110100110100111111010100011111110111000111000110011111100111111 e2a13f3fc3a7a6c83fb0d43f3fb1d63f3fe0a93f3fd9e9a7ea3fb8e33f3f
UTF-8 癲덈챶巽θ굜惟㏃꽑疫뀀툝爰뉒븨揄ш탽吾멸퓚 11100111100110011011001011101011100011011000100011101100101100011011011011100101101101111011110111001110101110001110101010110101100111001110011010000011100111111110001110001111100000111110101010111101100100011110011110010110101010111110101110000000100000001110110110001000100111011110011110001000101100001110101110001001100100101110101110111000101010001110011010001111100001001101000110001000111011011000001110111101111001011001000010111110111010111010100110111000111011011001001110011010 e799b2eb8d88ecb1b6e5b7bdceb8eab59ce6839fe38f83eabd91e796abeb8080ed889de788b0eb8992ebb8a8e68f84d188ed83bde590beeba9b8ed939a
UHC 癲덈챶巽θ굜惟㏃꽑疫뀀툝爰뉒븨揄ш탽吾멸퓚 111011111010011010001000111010111010101010000011111000011101111010100101111010001000001010000100111010101110111010100111111011001000010010100000111001101011100110110010111010111011100010010100111010101011101010000111111001111001010110010001111010101111000110101100111010101011010110011001111001111110111010111000111010101011111110000101 efa688ebaa83e1dea5e88284eaeea7ec84a0e6b9b2ebb894eaba87e79591eaf1aceab599e7eeb8eabf85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)