To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 偲磁偲ハナ叱偲璽偲ニト執偲ニナ磁偲磁偲 1000111011000011100011101010010110001110110000111100101011000101100011101011011010001110110000111000111010100011100011101100001111000110110001001000111010110111100011101100001111000110110001011000111010100101100011101100001110001110101001011000111011000011 8ec38ea58ec3cac58eb68ec38ea38ec3c6c48eb78ec3c6c58ea58ec38ea58ec3
EUC-JP 偲磁偲ハナ叱偲璽偲ニト執偲ニナ磁偲磁偲 1011110011000101101111001010011110111100110001011000111011001010100011101100010110111100101110001011110011000101101111001010010110111100110001011000111011000110100011101100010010111100101110011011110011000101100011101100011010001110110001011011110010100111101111001100010110111100101001111011110011000101 bcc5bca7bcc58eca8ec5bcb8bcc5bca5bcc58ec68ec4bcb9bcc58ec68ec5bca7bcc5bca7bcc5
UTF-8 偲磁偲ハナ叱偲璽偲ニト執偲ニナ磁偲磁偲 111001011000000110110010111001111010001110000001111001011000000110110010111011111011111010001010111011111011111010000101111001011000111110110001111001011000000110110010111001111001001010111101111001011000000110110010111011111011111010000110111011111011111010000100111001011001111110110111111001011000000110110010111011111011111010000110111011111011111010000101111001111010001110000001111001011000000110110010111001111010001110000001111001011000000110110010 e581b2e7a381e581b2efbe8aefbe85e58fb1e581b2e792bde581b2efbe86efbe84e59fb7e581b2efbe86efbe85e7a381e581b2e7a381e581b2
UHC ?磁???叱?璽???執???磁?磁? 00111111111011011011100000111111001111110011111111110010111010100011111111011111110111100011111100111111001111111111001011111011001111110011111100111111111011011011100000111111111011011011100000111111 3fedb83f3f3ff2ea3fdfde3f3f3ff2fb3f3f3fedb83fedb83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)