To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 姨??姨?????Lh姨??姨?????L 10011011010010000011111100111111100110110100100000111111001111110011111100111111001111110100110001101000100110110100100000111111001111111001101101001000001111110011111100111111001111110011111101001100 9b483f3f9b483f3f3f3f3f4c689b483f3f9b483f3f3f3f3f4c
EUC-JP 姨??姨?????Lh姨??姨?????L 11010101101010010011111100111111110101011010100100111111001111110011111100111111001111110100110001101000110101011010100100111111001111111101010110101001001111110011111100111111001111110011111101001100 d5a93f3fd5a93f3f3f3f3f4c68d5a93f3fd5a93f3f3f3f3f4c
UTF-8 姨뚰슋姨뚯쭨梨뷀쉮Lh姨뚰슋姨뚯쭨梨뷀쉮L 111001011010011110101000111010111001101010110000111011001000101010001011111001011010011110101000111010111001101010101111111011001010110110101000111011111010011110100010111010111011011110000000111011001000100110101110010011000110100011100101101001111010100011101011100110101011000011101100100010101000101111100101101001111010100011101011100110101010111111101100101011011010100011101111101001111010001011101011101101111000000011101100100010011010111001001100 e5a7a8eb9ab0ec8a8be5a7a8eb9aafecada8efa7a2ebb780ec89ae4c68e5a7a8eb9ab0ec8a8be5a7a8eb9aafecada8efa7a2ebb780ec89ae4c
UHC 姨뚰슋姨뚯쭨梨뷀쉮Lh姨뚰슋姨뚯쭨梨뷀쉮L 111011001010100110001100111011011001101010011011111011001010100110001100111011001010011110011100111011001011000110010100111011011001101010000110010011000110100011101100101010011000110011101101100110101001101111101100101010011000110011101100101001111001110011101100101100011001010011101101100110101000011001001100 eca98ced9a9beca98ceca79cecb194ed9a864c68eca98ced9a9beca98ceca79cecb194ed9a864c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)