To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 陰瀨??音??謬n}陰瀨??音??謬n{^ 1000100101000001111110110101000000111111001111111000100110111001001111110011111110010101010101000110111001111101100010010100000111111011010100000011111100111111100010011011100100111111001111111001010101010100011011100111101101011110 8941fb503f3f89b93f3f95546e7d8941fb503f3f89b93f3f95546e7b5e
EUC-JP 陰???音??謬n}陰???音??謬n{^ 101100011010001000111111001111110011111110110010101110110011111100111111110010011011010101101110011111011011000110100010001111110011111100111111101100101011101100111111001111111100100110110101011011100111101101011110 b1a23f3f3fb2bb3f3fc9b56e7db1a23f3f3fb2bb3f3fc9b56e7b5e
UTF-8 陰瀨렦렏音ㅽ렋謬n}陰瀨렦렏音ㅽ렋謬n{^ 1110100110011001101100001110011110000000101010001110101110100000101001101110101110100000100011111110100110011111101100111110001110000101101111011110101110100000100010111110100010101100101011000110111001111101111010011001100110110000111001111000000010101000111010111010000010100110111010111010000010001111111010011001111110110011111000111000010110111101111010111010000010001011111010001010110010101100011011100111101101011110 e999b0e780a8eba0a6eba08fe99fb3e385bdeba08be8acac6e7de999b0e780a8eba0a6eba08fe99fb3e385bdeba08be8acac6e7b5e
UHC 陰瀨렦렏音ㅽ렋謬n}陰瀨렦렏音ㅽ렋謬n{^ 11101011111001001101011011101110100011101011010110001110101001011110101111100101101001001110110110001110101000101101011110111101011011100111110111101011111001001101011011101110100011101011010110001110101001011110101111100101101001001110110110001110101000101101011110111101011011100111101101011110 ebe4d6ee8eb58ea5ebe5a4ed8ea2d7bd6e7debe4d6ee8eb58ea5ebe5a4ed8ea2d7bd6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)