To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?ъ?姨?????夷????????姨?? 00111111100001001000110000111111100110110100100000111111001111110011111100111111001111111000100011001110001111110011111100111111001111110011111100111111001111110011111110011011010010000011111100111111 3f848c3f9b483f3f3f3f3f88ce3f3f3f3f3f3f3f3f9b483f3f
EUC-JP ?ъ?姨?????夷????????姨?? 00111111101001111110110000111111110101011010100100111111001111110011111100111111001111111011000011010000001111110011111100111111001111110011111100111111001111110011111111010101101010010011111100111111 3fa7ec3fd5a93f3f3f3f3fb0d03f3f3f3f3f3f3f3fd5a93f3f
UTF-8 泥ъ㏈姨먯껄吏㏃쮰夷섏껄吏㏃쭬淋덉콛姨먯쮮 1110111110100111101000111101000110001010111000111000111110001000111001011010011110101000111010111010100010101111111010101011101110000100111011111010011110011110111000111000111110000011111011001010111010110000111001011010010010110111111011001000010010001111111010101011101110000100111011111010011110011110111000111000111110000011111011001010110110101100111011111010011110110101111010111000110110001001111011001011110110011011111001011010011110101000111010111010100010101111111011001010111010101110 efa7a3d18ae38f88e5a7a8eba8afeabb84efa79ee38f83ecaeb0e5a4b7ec848feabb84efa79ee38f83ecadacefa7b5eb8d89ecbd9be5a7a8eba8afecaeae
UHC 泥ъ㏈姨먯껄吏㏃쮰夷섏껄吏㏃쭬淋덉콛姨먯쮮 111011001011001010101100111011001010011110111100111011001010100110010000111011001011001010101100111011001010011110100111111011001010100010001101111011001010100010011000111011001011001010101100111011001010011110100111111011001010011110100000111011001111100010001000111011001011000110010100111011001010100110010000111011001010100010001011 ecb2aceca7bceca990ecb2aceca7a7eca88deca898ecb2aceca7a7eca7a0ecf888ecb194eca990eca88b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)